xref: /onnv-gate/usr/src/cmd/perl/5.8.4/distrib/pod/perlrun.pod (revision 0:68f95e015346)
1=head1 NAME
2
3perlrun - how to execute the Perl interpreter
4
5=head1 SYNOPSIS
6
7B<perl>	S<[ B<-sTtuUWX> ]>
8	S<[ B<-hv> ] [ B<-V>[:I<configvar>] ]>
9	S<[ B<-cw> ] [ B<-d>[:I<debugger>] ] [ B<-D>[I<number/list>] ]>
10	S<[ B<-pna> ] [ B<-F>I<pattern> ] [ B<-l>[I<octal>] ] [ B<-0>[I<octal/hexadecimal>] ]>
11	S<[ B<-I>I<dir> ] [ B<-m>[B<->]I<module> ] [ B<-M>[B<->]I<'module...'> ]>
12	S<[ B<-P> ]>
13	S<[ B<-S> ]>
14	S<[ B<-x>[I<dir>] ]>
15	S<[ B<-i>[I<extension>] ]>
16	S<[ B<-e> I<'command'> ] [ B<--> ] [ I<programfile> ] [ I<argument> ]...>
17	S<[ B<-C [I<number/list>] >]> ]>
18
19=head1 DESCRIPTION
20
21The normal way to run a Perl program is by making it directly
22executable, or else by passing the name of the source file as an
23argument on the command line.  (An interactive Perl environment
24is also possible--see L<perldebug> for details on how to do that.)
25Upon startup, Perl looks for your program in one of the following
26places:
27
28=over 4
29
30=item 1.
31
32Specified line by line via B<-e> switches on the command line.
33
34=item 2.
35
36Contained in the file specified by the first filename on the command line.
37(Note that systems supporting the #! notation invoke interpreters this
38way. See L<Location of Perl>.)
39
40=item 3.
41
42Passed in implicitly via standard input.  This works only if there are
43no filename arguments--to pass arguments to a STDIN-read program you
44must explicitly specify a "-" for the program name.
45
46=back
47
48With methods 2 and 3, Perl starts parsing the input file from the
49beginning, unless you've specified a B<-x> switch, in which case it
50scans for the first line starting with #! and containing the word
51"perl", and starts there instead.  This is useful for running a program
52embedded in a larger message.  (In this case you would indicate the end
53of the program using the C<__END__> token.)
54
55The #! line is always examined for switches as the line is being
56parsed.  Thus, if you're on a machine that allows only one argument
57with the #! line, or worse, doesn't even recognize the #! line, you
58still can get consistent switch behavior regardless of how Perl was
59invoked, even if B<-x> was used to find the beginning of the program.
60
61Because historically some operating systems silently chopped off
62kernel interpretation of the #! line after 32 characters, some
63switches may be passed in on the command line, and some may not;
64you could even get a "-" without its letter, if you're not careful.
65You probably want to make sure that all your switches fall either
66before or after that 32-character boundary.  Most switches don't
67actually care if they're processed redundantly, but getting a "-"
68instead of a complete switch could cause Perl to try to execute
69standard input instead of your program.  And a partial B<-I> switch
70could also cause odd results.
71
72Some switches do care if they are processed twice, for instance
73combinations of B<-l> and B<-0>.  Either put all the switches after
74the 32-character boundary (if applicable), or replace the use of
75B<-0>I<digits> by C<BEGIN{ $/ = "\0digits"; }>.
76
77Parsing of the #! switches starts wherever "perl" is mentioned in the line.
78The sequences "-*" and "- " are specifically ignored so that you could,
79if you were so inclined, say
80
81    #!/bin/sh -- # -*- perl -*- -p
82    eval 'exec perl -wS $0 ${1+"$@"}'
83        if $running_under_some_shell;
84
85to let Perl see the B<-p> switch.
86
87A similar trick involves the B<env> program, if you have it.
88
89    #!/usr/bin/env perl
90
91The examples above use a relative path to the perl interpreter,
92getting whatever version is first in the user's path.  If you want
93a specific version of Perl, say, perl5.005_57, you should place
94that directly in the #! line's path.
95
96If the #! line does not contain the word "perl", the program named after
97the #! is executed instead of the Perl interpreter.  This is slightly
98bizarre, but it helps people on machines that don't do #!, because they
99can tell a program that their SHELL is F</usr/bin/perl>, and Perl will then
100dispatch the program to the correct interpreter for them.
101
102After locating your program, Perl compiles the entire program to an
103internal form.  If there are any compilation errors, execution of the
104program is not attempted.  (This is unlike the typical shell script,
105which might run part-way through before finding a syntax error.)
106
107If the program is syntactically correct, it is executed.  If the program
108runs off the end without hitting an exit() or die() operator, an implicit
109C<exit(0)> is provided to indicate successful completion.
110
111=head2 #! and quoting on non-Unix systems
112
113Unix's #! technique can be simulated on other systems:
114
115=over 4
116
117=item OS/2
118
119Put
120
121    extproc perl -S -your_switches
122
123as the first line in C<*.cmd> file (B<-S> due to a bug in cmd.exe's
124`extproc' handling).
125
126=item MS-DOS
127
128Create a batch file to run your program, and codify it in
129C<ALTERNATE_SHEBANG> (see the F<dosish.h> file in the source
130distribution for more information).
131
132=item Win95/NT
133
134The Win95/NT installation, when using the ActiveState installer for Perl,
135will modify the Registry to associate the F<.pl> extension with the perl
136interpreter.  If you install Perl by other means (including building from
137the sources), you may have to modify the Registry yourself.  Note that
138this means you can no longer tell the difference between an executable
139Perl program and a Perl library file.
140
141=item Macintosh
142
143A Macintosh perl program will have the appropriate Creator and
144Type, so that double-clicking them will invoke the perl application.
145
146=item VMS
147
148Put
149
150    $ perl -mysw 'f$env("procedure")' 'p1' 'p2' 'p3' 'p4' 'p5' 'p6' 'p7' 'p8' !
151    $ exit++ + ++$status != 0 and $exit = $status = undef;
152
153at the top of your program, where B<-mysw> are any command line switches you
154want to pass to Perl.  You can now invoke the program directly, by saying
155C<perl program>, or as a DCL procedure, by saying C<@program> (or implicitly
156via F<DCL$PATH> by just using the name of the program).
157
158This incantation is a bit much to remember, but Perl will display it for
159you if you say C<perl "-V:startperl">.
160
161=back
162
163Command-interpreters on non-Unix systems have rather different ideas
164on quoting than Unix shells.  You'll need to learn the special
165characters in your command-interpreter (C<*>, C<\> and C<"> are
166common) and how to protect whitespace and these characters to run
167one-liners (see B<-e> below).
168
169On some systems, you may have to change single-quotes to double ones,
170which you must I<not> do on Unix or Plan 9 systems.  You might also
171have to change a single % to a %%.
172
173For example:
174
175    # Unix
176    perl -e 'print "Hello world\n"'
177
178    # MS-DOS, etc.
179    perl -e "print \"Hello world\n\""
180
181    # Macintosh
182    print "Hello world\n"
183     (then Run "Myscript" or Shift-Command-R)
184
185    # VMS
186    perl -e "print ""Hello world\n"""
187
188The problem is that none of this is reliable: it depends on the
189command and it is entirely possible neither works.  If B<4DOS> were
190the command shell, this would probably work better:
191
192    perl -e "print <Ctrl-x>"Hello world\n<Ctrl-x>""
193
194B<CMD.EXE> in Windows NT slipped a lot of standard Unix functionality in
195when nobody was looking, but just try to find documentation for its
196quoting rules.
197
198Under the Macintosh, it depends which environment you are using.  The MacPerl
199shell, or MPW, is much like Unix shells in its support for several
200quoting variants, except that it makes free use of the Macintosh's non-ASCII
201characters as control characters.
202
203There is no general solution to all of this.  It's just a mess.
204
205=head2 Location of Perl
206
207It may seem obvious to say, but Perl is useful only when users can
208easily find it.  When possible, it's good for both F</usr/bin/perl>
209and F</usr/local/bin/perl> to be symlinks to the actual binary.  If
210that can't be done, system administrators are strongly encouraged
211to put (symlinks to) perl and its accompanying utilities into a
212directory typically found along a user's PATH, or in some other
213obvious and convenient place.
214
215In this documentation, C<#!/usr/bin/perl> on the first line of the program
216will stand in for whatever method works on your system.  You are
217advised to use a specific path if you care about a specific version.
218
219    #!/usr/local/bin/perl5.00554
220
221or if you just want to be running at least version, place a statement
222like this at the top of your program:
223
224    use 5.005_54;
225
226=head2 Command Switches
227
228As with all standard commands, a single-character switch may be
229clustered with the following switch, if any.
230
231    #!/usr/bin/perl -spi.orig	# same as -s -p -i.orig
232
233Switches include:
234
235=over 5
236
237=item B<-0>[I<octal/hexadecimal>]
238
239specifies the input record separator (C<$/>) as an octal or
240hexadecimal number.  If there are no digits, the null character is the
241separator.  Other switches may precede or follow the digits.  For
242example, if you have a version of B<find> which can print filenames
243terminated by the null character, you can say this:
244
245    find . -name '*.orig' -print0 | perl -n0e unlink
246
247The special value 00 will cause Perl to slurp files in paragraph mode.
248The value 0777 will cause Perl to slurp files whole because there is no
249legal byte with that value.
250
251If you want to specify any Unicode character, use the hexadecimal
252format: C<-0xHHH...>, where the C<H> are valid hexadecimal digits.
253(This means that you cannot use the C<-x> with a directory name that
254consists of hexadecimal digits.)
255
256=item B<-a>
257
258turns on autosplit mode when used with a B<-n> or B<-p>.  An implicit
259split command to the @F array is done as the first thing inside the
260implicit while loop produced by the B<-n> or B<-p>.
261
262    perl -ane 'print pop(@F), "\n";'
263
264is equivalent to
265
266    while (<>) {
267	@F = split(' ');
268	print pop(@F), "\n";
269    }
270
271An alternate delimiter may be specified using B<-F>.
272
273=item B<-C [I<number/list>]>
274
275The C<-C> flag controls some Unicode of the Perl Unicode features.
276
277As of 5.8.1, the C<-C> can be followed either by a number or a list
278of option letters.  The letters, their numeric values, and effects
279are as follows; listing the letters is equal to summing the numbers.
280
281    I     1    STDIN is assumed to be in UTF-8
282    O     2    STDOUT will be in UTF-8
283    E     4    STDERR will be in UTF-8
284    S     7    I + O + E
285    i     8    UTF-8 is the default PerlIO layer for input streams
286    o    16    UTF-8 is the default PerlIO layer for output streams
287    D    24    i + o
288    A    32    the @ARGV elements are expected to be strings encoded in UTF-8
289    L    64    normally the "IOEioA" are unconditional,
290               the L makes them conditional on the locale environment
291               variables (the LC_ALL, LC_TYPE, and LANG, in the order
292               of decreasing precedence) -- if the variables indicate
293               UTF-8, then the selected "IOEioA" are in effect
294
295For example, C<-COE> and C<-C6> will both turn on UTF-8-ness on both
296STDOUT and STDERR.  Repeating letters is just redundant, not cumulative
297nor toggling.
298
299The C<io> options mean that any subsequent open() (or similar I/O
300operations) will have the C<:utf8> PerlIO layer implicitly applied
301to them, in other words, UTF-8 is expected from any input stream,
302and UTF-8 is produced to any output stream.  This is just the default,
303with explicit layers in open() and with binmode() one can manipulate
304streams as usual.
305
306C<-C> on its own (not followed by any number or option list), or the
307empty string C<""> for the C<PERL_UNICODE> environment variable, has the
308same effect as C<-CSDL>.  In other words, the standard I/O handles and
309the default C<open()> layer are UTF-8-fied B<but> only if the locale
310environment variables indicate a UTF-8 locale.  This behaviour follows
311the I<implicit> (and problematic) UTF-8 behaviour of Perl 5.8.0.
312
313You can use C<-C0> (or C<"0"> for C<PERL_UNICODE>) to explicitly
314disable all the above Unicode features.
315
316The read-only magic variable C<${^UNICODE}> reflects the numeric value
317of this setting.  This is variable is set during Perl startup and is
318thereafter read-only.  If you want runtime effects, use the three-arg
319open() (see L<perlfunc/open>), the two-arg binmode() (see L<perlfunc/binmode>),
320and the C<open> pragma (see L<open>).
321
322(In Perls earlier than 5.8.1 the C<-C> switch was a Win32-only switch
323that enabled the use of Unicode-aware "wide system call" Win32 APIs.
324This feature was practically unused, however, and the command line
325switch was therefore "recycled".)
326
327=item B<-c>
328
329causes Perl to check the syntax of the program and then exit without
330executing it.  Actually, it I<will> execute C<BEGIN>, C<CHECK>, and
331C<use> blocks, because these are considered as occurring outside the
332execution of your program.  C<INIT> and C<END> blocks, however, will
333be skipped.
334
335=item B<-d>
336
337runs the program under the Perl debugger.  See L<perldebug>.
338
339=item B<-d:>I<foo[=bar,baz]>
340
341runs the program under the control of a debugging, profiling, or
342tracing module installed as Devel::foo. E.g., B<-d:DProf> executes
343the program using the Devel::DProf profiler.  As with the B<-M>
344flag, options may be passed to the Devel::foo package where they
345will be received and interpreted by the Devel::foo::import routine.
346The comma-separated list of options must follow a C<=> character.
347See L<perldebug>.
348
349=item B<-D>I<letters>
350
351=item B<-D>I<number>
352
353sets debugging flags.  To watch how it executes your program, use
354B<-Dtls>.  (This works only if debugging is compiled into your
355Perl.)  Another nice value is B<-Dx>, which lists your compiled
356syntax tree.  And B<-Dr> displays compiled regular expressions;
357the format of the output is explained in L<perldebguts>.
358
359As an alternative, specify a number instead of list of letters (e.g.,
360B<-D14> is equivalent to B<-Dtls>):
361
362        1  p  Tokenizing and parsing
363        2  s  Stack snapshots
364                with v, displays all stacks
365        4  l  Context (loop) stack processing
366        8  t  Trace execution
367       16  o  Method and overloading resolution
368       32  c  String/numeric conversions
369       64  P  Print profiling info, preprocessor command for -P, source file input state
370      128  m  Memory allocation
371      256  f  Format processing
372      512  r  Regular expression parsing and execution
373     1024  x  Syntax tree dump
374     2048  u  Tainting checks
375     4096     (Obsolete, previously used for LEAKTEST)
376     8192  H  Hash dump -- usurps values()
377    16384  X  Scratchpad allocation
378    32768  D  Cleaning up
379    65536  S  Thread synchronization
380   131072  T  Tokenising
381   262144  R  Include reference counts of dumped variables (eg when using -Ds)
382   524288  J  Do not s,t,P-debug (Jump over) opcodes within package DB
383  1048576  v  Verbose: use in conjunction with other flags
384  2097152  C  Copy On Write
385
386All these flags require B<-DDEBUGGING> when you compile the Perl
387executable (but see L<Devel::Peek>, L<re> which may change this).
388See the F<INSTALL> file in the Perl source distribution
389for how to do this.  This flag is automatically set if you include B<-g>
390option when C<Configure> asks you about optimizer/debugger flags.
391
392If you're just trying to get a print out of each line of Perl code
393as it executes, the way that C<sh -x> provides for shell scripts,
394you can't use Perl's B<-D> switch.  Instead do this
395
396  # If you have "env" utility
397  env=PERLDB_OPTS="NonStop=1 AutoTrace=1 frame=2" perl -dS program
398
399  # Bourne shell syntax
400  $ PERLDB_OPTS="NonStop=1 AutoTrace=1 frame=2" perl -dS program
401
402  # csh syntax
403  % (setenv PERLDB_OPTS "NonStop=1 AutoTrace=1 frame=2"; perl -dS program)
404
405See L<perldebug> for details and variations.
406
407=item B<-e> I<commandline>
408
409may be used to enter one line of program.  If B<-e> is given, Perl
410will not look for a filename in the argument list.  Multiple B<-e>
411commands may be given to build up a multi-line script.  Make sure
412to use semicolons where you would in a normal program.
413
414=item B<-F>I<pattern>
415
416specifies the pattern to split on if B<-a> is also in effect.  The
417pattern may be surrounded by C<//>, C<"">, or C<''>, otherwise it will be
418put in single quotes.
419
420=item B<-h>
421
422prints a summary of the options.
423
424=item B<-i>[I<extension>]
425
426specifies that files processed by the C<E<lt>E<gt>> construct are to be
427edited in-place.  It does this by renaming the input file, opening the
428output file by the original name, and selecting that output file as the
429default for print() statements.  The extension, if supplied, is used to
430modify the name of the old file to make a backup copy, following these
431rules:
432
433If no extension is supplied, no backup is made and the current file is
434overwritten.
435
436If the extension doesn't contain a C<*>, then it is appended to the
437end of the current filename as a suffix.  If the extension does
438contain one or more C<*> characters, then each C<*> is replaced
439with the current filename.  In Perl terms, you could think of this
440as:
441
442    ($backup = $extension) =~ s/\*/$file_name/g;
443
444This allows you to add a prefix to the backup file, instead of (or in
445addition to) a suffix:
446
447    $ perl -pi'orig_*' -e 's/bar/baz/' fileA	# backup to 'orig_fileA'
448
449Or even to place backup copies of the original files into another
450directory (provided the directory already exists):
451
452    $ perl -pi'old/*.orig' -e 's/bar/baz/' fileA # backup to 'old/fileA.orig'
453
454These sets of one-liners are equivalent:
455
456    $ perl -pi -e 's/bar/baz/' fileA		# overwrite current file
457    $ perl -pi'*' -e 's/bar/baz/' fileA		# overwrite current file
458
459    $ perl -pi'.orig' -e 's/bar/baz/' fileA	# backup to 'fileA.orig'
460    $ perl -pi'*.orig' -e 's/bar/baz/' fileA	# backup to 'fileA.orig'
461
462From the shell, saying
463
464    $ perl -p -i.orig -e "s/foo/bar/; ... "
465
466is the same as using the program:
467
468    #!/usr/bin/perl -pi.orig
469    s/foo/bar/;
470
471which is equivalent to
472
473    #!/usr/bin/perl
474    $extension = '.orig';
475    LINE: while (<>) {
476	if ($ARGV ne $oldargv) {
477	    if ($extension !~ /\*/) {
478		$backup = $ARGV . $extension;
479	    }
480	    else {
481		($backup = $extension) =~ s/\*/$ARGV/g;
482	    }
483	    rename($ARGV, $backup);
484	    open(ARGVOUT, ">$ARGV");
485	    select(ARGVOUT);
486	    $oldargv = $ARGV;
487	}
488	s/foo/bar/;
489    }
490    continue {
491	print;	# this prints to original filename
492    }
493    select(STDOUT);
494
495except that the B<-i> form doesn't need to compare $ARGV to $oldargv to
496know when the filename has changed.  It does, however, use ARGVOUT for
497the selected filehandle.  Note that STDOUT is restored as the default
498output filehandle after the loop.
499
500As shown above, Perl creates the backup file whether or not any output
501is actually changed.  So this is just a fancy way to copy files:
502
503    $ perl -p -i'/some/file/path/*' -e 1 file1 file2 file3...
504or
505    $ perl -p -i'.orig' -e 1 file1 file2 file3...
506
507You can use C<eof> without parentheses to locate the end of each input
508file, in case you want to append to each file, or reset line numbering
509(see example in L<perlfunc/eof>).
510
511If, for a given file, Perl is unable to create the backup file as
512specified in the extension then it will skip that file and continue on
513with the next one (if it exists).
514
515For a discussion of issues surrounding file permissions and B<-i>,
516see L<perlfaq5/Why does Perl let me delete read-only files?  Why does -i clobber protected files?  Isn't this a bug in Perl?>.
517
518You cannot use B<-i> to create directories or to strip extensions from
519files.
520
521Perl does not expand C<~> in filenames, which is good, since some
522folks use it for their backup files:
523
524    $ perl -pi~ -e 's/foo/bar/' file1 file2 file3...
525
526Finally, the B<-i> switch does not impede execution when no
527files are given on the command line.  In this case, no backup is made
528(the original file cannot, of course, be determined) and processing
529proceeds from STDIN to STDOUT as might be expected.
530
531=item B<-I>I<directory>
532
533Directories specified by B<-I> are prepended to the search path for
534modules (C<@INC>), and also tells the C preprocessor where to search for
535include files.  The C preprocessor is invoked with B<-P>; by default it
536searches /usr/include and /usr/lib/perl.
537
538=item B<-l>[I<octnum>]
539
540enables automatic line-ending processing.  It has two separate
541effects.  First, it automatically chomps C<$/> (the input record
542separator) when used with B<-n> or B<-p>.  Second, it assigns C<$\>
543(the output record separator) to have the value of I<octnum> so
544that any print statements will have that separator added back on.
545If I<octnum> is omitted, sets C<$\> to the current value of
546C<$/>.  For instance, to trim lines to 80 columns:
547
548    perl -lpe 'substr($_, 80) = ""'
549
550Note that the assignment C<$\ = $/> is done when the switch is processed,
551so the input record separator can be different than the output record
552separator if the B<-l> switch is followed by a B<-0> switch:
553
554    gnufind / -print0 | perl -ln0e 'print "found $_" if -p'
555
556This sets C<$\> to newline and then sets C<$/> to the null character.
557
558=item B<-m>[B<->]I<module>
559
560=item B<-M>[B<->]I<module>
561
562=item B<-M>[B<->]I<'module ...'>
563
564=item B<-[mM]>[B<->]I<module=arg[,arg]...>
565
566B<-m>I<module> executes C<use> I<module> C<();> before executing your
567program.
568
569B<-M>I<module> executes C<use> I<module> C<;> before executing your
570program.  You can use quotes to add extra code after the module name,
571e.g., C<'-Mmodule qw(foo bar)'>.
572
573If the first character after the B<-M> or B<-m> is a dash (C<->)
574then the 'use' is replaced with 'no'.
575
576A little builtin syntactic sugar means you can also say
577B<-mmodule=foo,bar> or B<-Mmodule=foo,bar> as a shortcut for
578C<'-Mmodule qw(foo bar)'>.  This avoids the need to use quotes when
579importing symbols.  The actual code generated by B<-Mmodule=foo,bar> is
580C<use module split(/,/,q{foo,bar})>.  Note that the C<=> form
581removes the distinction between B<-m> and B<-M>.
582
583=item B<-n>
584
585causes Perl to assume the following loop around your program, which
586makes it iterate over filename arguments somewhat like B<sed -n> or
587B<awk>:
588
589  LINE:
590    while (<>) {
591	...		# your program goes here
592    }
593
594Note that the lines are not printed by default.  See B<-p> to have
595lines printed.  If a file named by an argument cannot be opened for
596some reason, Perl warns you about it and moves on to the next file.
597
598Here is an efficient way to delete all files that haven't been modifed for
599at least a week:
600
601    find . -mtime +7 -print | perl -nle unlink
602
603This is faster than using the B<-exec> switch of B<find> because you don't
604have to start a process on every filename found.  It does suffer from
605the bug of mishandling newlines in pathnames, which you can fix if
606you follow the example under B<-0>.
607
608C<BEGIN> and C<END> blocks may be used to capture control before or after
609the implicit program loop, just as in B<awk>.
610
611=item B<-p>
612
613causes Perl to assume the following loop around your program, which
614makes it iterate over filename arguments somewhat like B<sed>:
615
616
617  LINE:
618    while (<>) {
619	...		# your program goes here
620    } continue {
621	print or die "-p destination: $!\n";
622    }
623
624If a file named by an argument cannot be opened for some reason, Perl
625warns you about it, and moves on to the next file.  Note that the
626lines are printed automatically.  An error occurring during printing is
627treated as fatal.  To suppress printing use the B<-n> switch.  A B<-p>
628overrides a B<-n> switch.
629
630C<BEGIN> and C<END> blocks may be used to capture control before or after
631the implicit loop, just as in B<awk>.
632
633=item B<-P>
634
635B<NOTE: Use of -P is strongly discouraged because of its inherent
636problems, including poor portability.>
637
638This option causes your program to be run through the C preprocessor before
639compilation by Perl.  Because both comments and B<cpp> directives begin
640with the # character, you should avoid starting comments with any words
641recognized by the C preprocessor such as C<"if">, C<"else">, or C<"define">.
642
643If you're considering using C<-P>, you might also want to look at the
644Filter::cpp module from CPAN.
645
646The problems of -P include, but are not limited to:
647
648=over 10
649
650=item *
651
652The C<#!> line is stripped, so any switches there don't apply.
653
654=item *
655
656A C<-P> on a C<#!> line doesn't work.
657
658=item *
659
660B<All> lines that begin with (whitespace and) a C<#> but
661do not look like cpp commands, are stripped, including anything
662inside Perl strings, regular expressions, and here-docs .
663
664=item *
665
666In some platforms the C preprocessor knows too much: it knows about
667the C++ -style until-end-of-line comments starting with C<"//">.
668This will cause problems with common Perl constructs like
669
670    s/foo//;
671
672because after -P this will became illegal code
673
674    s/foo
675
676The workaround is to use some other quoting separator than C<"/">,
677like for example C<"!">:
678
679    s!foo!!;
680
681
682
683=item *
684
685It requires not only a working C preprocessor but also a working
686F<sed>.  If not on UNIX, you are probably out of luck on this.
687
688=item *
689
690Script line numbers are not preserved.
691
692=item *
693
694The C<-x> does not work with C<-P>.
695
696=back
697
698=item B<-s>
699
700enables rudimentary switch parsing for switches on the command
701line after the program name but before any filename arguments (or before
702an argument of B<-->).  This means you can have switches with two leading
703dashes (B<--help>).  Any switch found there is removed from @ARGV and sets the
704corresponding variable in the Perl program.  The following program
705prints "1" if the program is invoked with a B<-xyz> switch, and "abc"
706if it is invoked with B<-xyz=abc>.
707
708    #!/usr/bin/perl -s
709    if ($xyz) { print "$xyz\n" }
710
711Do note that B<--help> creates the variable ${-help}, which is not compliant
712with C<strict refs>.
713
714=item B<-S>
715
716makes Perl use the PATH environment variable to search for the
717program (unless the name of the program contains directory separators).
718
719On some platforms, this also makes Perl append suffixes to the
720filename while searching for it.  For example, on Win32 platforms,
721the ".bat" and ".cmd" suffixes are appended if a lookup for the
722original name fails, and if the name does not already end in one
723of those suffixes.  If your Perl was compiled with DEBUGGING turned
724on, using the -Dp switch to Perl shows how the search progresses.
725
726Typically this is used to emulate #! startup on platforms that
727don't support #!.  This example works on many platforms that
728have a shell compatible with Bourne shell:
729
730    #!/usr/bin/perl
731    eval 'exec /usr/bin/perl -wS $0 ${1+"$@"}'
732	    if $running_under_some_shell;
733
734The system ignores the first line and feeds the program to F</bin/sh>,
735which proceeds to try to execute the Perl program as a shell script.
736The shell executes the second line as a normal shell command, and thus
737starts up the Perl interpreter.  On some systems $0 doesn't always
738contain the full pathname, so the B<-S> tells Perl to search for the
739program if necessary.  After Perl locates the program, it parses the
740lines and ignores them because the variable $running_under_some_shell
741is never true.  If the program will be interpreted by csh, you will need
742to replace C<${1+"$@"}> with C<$*>, even though that doesn't understand
743embedded spaces (and such) in the argument list.  To start up sh rather
744than csh, some systems may have to replace the #! line with a line
745containing just a colon, which will be politely ignored by Perl.  Other
746systems can't control that, and need a totally devious construct that
747will work under any of B<csh>, B<sh>, or Perl, such as the following:
748
749	eval '(exit $?0)' && eval 'exec perl -wS $0 ${1+"$@"}'
750	& eval 'exec /usr/bin/perl -wS $0 $argv:q'
751		if $running_under_some_shell;
752
753If the filename supplied contains directory separators (i.e., is an
754absolute or relative pathname), and if that file is not found,
755platforms that append file extensions will do so and try to look
756for the file with those extensions added, one by one.
757
758On DOS-like platforms, if the program does not contain directory
759separators, it will first be searched for in the current directory
760before being searched for on the PATH.  On Unix platforms, the
761program will be searched for strictly on the PATH.
762
763=item B<-t>
764
765Like B<-T>, but taint checks will issue warnings rather than fatal
766errors.  These warnings can be controlled normally with C<no warnings
767qw(taint)>.
768
769B<NOTE: this is not a substitute for -T.> This is meant only to be
770used as a temporary development aid while securing legacy code:
771for real production code and for new secure code written from scratch
772always use the real B<-T>.
773
774=item B<-T>
775
776forces "taint" checks to be turned on so you can test them.  Ordinarily
777these checks are done only when running setuid or setgid.  It's a
778good idea to turn them on explicitly for programs that run on behalf
779of someone else whom you might not necessarily trust, such as CGI
780programs or any internet servers you might write in Perl.  See
781L<perlsec> for details.  For security reasons, this option must be
782seen by Perl quite early; usually this means it must appear early
783on the command line or in the #! line for systems which support
784that construct.
785
786=item B<-u>
787
788This obsolete switch causes Perl to dump core after compiling your
789program.  You can then in theory take this core dump and turn it
790into an executable file by using the B<undump> program (not supplied).
791This speeds startup at the expense of some disk space (which you
792can minimize by stripping the executable).  (Still, a "hello world"
793executable comes out to about 200K on my machine.)  If you want to
794execute a portion of your program before dumping, use the dump()
795operator instead.  Note: availability of B<undump> is platform
796specific and may not be available for a specific port of Perl.
797
798This switch has been superseded in favor of the new Perl code
799generator backends to the compiler.  See L<B> and L<B::Bytecode>
800for details.
801
802=item B<-U>
803
804allows Perl to do unsafe operations.  Currently the only "unsafe"
805operations are the unlinking of directories while running as superuser,
806and running setuid programs with fatal taint checks turned into
807warnings.  Note that the B<-w> switch (or the C<$^W> variable) must
808be used along with this option to actually I<generate> the
809taint-check warnings.
810
811=item B<-v>
812
813prints the version and patchlevel of your perl executable.
814
815=item B<-V>
816
817prints summary of the major perl configuration values and the current
818values of @INC.
819
820=item B<-V:>I<name>
821
822Prints to STDOUT the value of the named configuration variable.
823For example,
824
825    $ perl -V:man.dir
826
827will provide strong clues about what your MANPATH variable should
828be set to in order to access the Perl documentation.
829
830=item B<-w>
831
832prints warnings about dubious constructs, such as variable names
833that are mentioned only once and scalar variables that are used
834before being set, redefined subroutines, references to undefined
835filehandles or filehandles opened read-only that you are attempting
836to write on, values used as a number that doesn't look like numbers,
837using an array as though it were a scalar, if your subroutines
838recurse more than 100 deep, and innumerable other things.
839
840This switch really just enables the internal C<$^W> variable.  You
841can disable or promote into fatal errors specific warnings using
842C<__WARN__> hooks, as described in L<perlvar> and L<perlfunc/warn>.
843See also L<perldiag> and L<perltrap>.  A new, fine-grained warning
844facility is also available if you want to manipulate entire classes
845of warnings; see L<warnings> or L<perllexwarn>.
846
847=item B<-W>
848
849Enables all warnings regardless of C<no warnings> or C<$^W>.
850See L<perllexwarn>.
851
852=item B<-X>
853
854Disables all warnings regardless of C<use warnings> or C<$^W>.
855See L<perllexwarn>.
856
857=item B<-x> I<directory>
858
859tells Perl that the program is embedded in a larger chunk of unrelated
860ASCII text, such as in a mail message.  Leading garbage will be
861discarded until the first line that starts with #! and contains the
862string "perl".  Any meaningful switches on that line will be applied.
863If a directory name is specified, Perl will switch to that directory
864before running the program.  The B<-x> switch controls only the
865disposal of leading garbage.  The program must be terminated with
866C<__END__> if there is trailing garbage to be ignored (the program
867can process any or all of the trailing garbage via the DATA filehandle
868if desired).
869
870=back
871
872=head1 ENVIRONMENT
873
874=over 12
875
876=item HOME
877
878Used if chdir has no argument.
879
880=item LOGDIR
881
882Used if chdir has no argument and HOME is not set.
883
884=item PATH
885
886Used in executing subprocesses, and in finding the program if B<-S> is
887used.
888
889=item PERL5LIB
890
891A list of directories in which to look for Perl library
892files before looking in the standard library and the current
893directory.  Any architecture-specific directories under the specified
894locations are automatically included if they exist.  If PERL5LIB is not
895defined, PERLLIB is used.  Directories are separated (like in PATH) by
896a colon on unixish platforms and by a semicolon on Windows (the proper
897path separator being given by the command C<perl -V:path_sep>).
898
899When running taint checks (either because the program was running setuid
900or setgid, or the B<-T> switch was used), neither variable is used.
901The program should instead say:
902
903    use lib "/my/directory";
904
905=item PERL5OPT
906
907Command-line options (switches).  Switches in this variable are taken
908as if they were on every Perl command line.  Only the B<-[DIMUdmtw]>
909switches are allowed.  When running taint checks (because the program
910was running setuid or setgid, or the B<-T> switch was used), this
911variable is ignored.  If PERL5OPT begins with B<-T>, tainting will be
912enabled, and any subsequent options ignored.
913
914=item PERLIO
915
916A space (or colon) separated list of PerlIO layers. If perl is built
917to use PerlIO system for IO (the default) these layers effect perl's IO.
918
919It is conventional to start layer names with a colon e.g. C<:perlio> to
920emphasise their similarity to variable "attributes". But the code that parses
921layer specification strings (which is also used to decode the PERLIO
922environment variable) treats the colon as a separator.
923
924An unset or empty PERLIO is equivalent to C<:stdio>.
925
926The list becomes the default for I<all> perl's IO. Consequently only built-in
927layers can appear in this list, as external layers (such as :encoding()) need
928IO in  order to load them!. See L<"open pragma"|open> for how to add external
929encodings as defaults.
930
931The layers that it makes sense to include in the PERLIO environment
932variable are briefly summarised below. For more details see L<PerlIO>.
933
934=over 8
935
936=item :bytes
937
938A pseudolayer that turns I<off> the C<:utf8> flag for the layer below.
939Unlikely to be useful on its own in the global PERLIO environment variable.
940You perhaps were thinking of C<:crlf:bytes> or C<:perlio:bytes>.
941
942=item :crlf
943
944A layer which does CRLF to "\n" translation distinguishing "text" and
945"binary" files in the manner of MS-DOS and similar operating systems.
946(It currently does I<not> mimic MS-DOS as far as treating of Control-Z
947as being an end-of-file marker.)
948
949=item :mmap
950
951A layer which implements "reading" of files by using C<mmap()> to
952make (whole) file appear in the process's address space, and then
953using that as PerlIO's "buffer".
954
955=item :perlio
956
957This is a re-implementation of "stdio-like" buffering written as a
958PerlIO "layer".  As such it will call whatever layer is below it for
959its operations (typically C<:unix>).
960
961=item :pop
962
963An experimental pseudolayer that removes the topmost layer.
964Use with the same care as is reserved for nitroglycerin.
965
966=item :raw
967
968A pseudolayer that manipulates other layers.  Applying the <:raw>
969layer is equivalent to calling C<binmode($fh)>.  It makes the stream
970pass each byte as-is without any translation.  In particular CRLF
971translation, and/or :utf8 intuited from locale are disabled.
972
973Unlike in the earlier versions of Perl C<:raw> is I<not>
974just the inverse of C<:crlf> - other layers which would affect the
975binary nature of the stream are also removed or disabled.
976
977=item :stdio
978
979This layer provides PerlIO interface by wrapping system's ANSI C "stdio"
980library calls. The layer provides both buffering and IO.
981Note that C<:stdio> layer does I<not> do CRLF translation even if that
982is platforms normal behaviour. You will need a C<:crlf> layer above it
983to do that.
984
985=item :unix
986
987Low level layer which calls C<read>, C<write> and C<lseek> etc.
988
989=item :utf8
990
991A pseudolayer that turns on a flag on the layer below to tell perl
992that output should be in utf8 and that input should be regarded as
993already in utf8 form.  May be useful in PERLIO environment
994variable to make UTF-8 the default. (To turn off that behaviour
995use C<:bytes> layer.)
996
997=item :win32
998
999On Win32 platforms this I<experimental> layer uses native "handle" IO
1000rather than unix-like numeric file descriptor layer. Known to be
1001buggy in this release.
1002
1003=back
1004
1005On all platforms the default set of layers should give acceptable results.
1006
1007For UNIX platforms that will equivalent of "unix perlio" or "stdio".
1008Configure is setup to prefer "stdio" implementation if system's library
1009provides for fast access to the buffer, otherwise it uses the "unix perlio"
1010implementation.
1011
1012On Win32 the default in this release is "unix crlf". Win32's "stdio"
1013has a number of bugs/mis-features for perl IO which are somewhat
1014C compiler vendor/version dependent. Using our own C<crlf> layer as
1015the buffer avoids those issues and makes things more uniform.
1016The C<crlf> layer provides CRLF to/from "\n" conversion as well as
1017buffering.
1018
1019This release uses C<unix> as the bottom layer on Win32 and so still uses C
1020compiler's numeric file descriptor routines. There is an experimental native
1021C<win32> layer which is expected to be enhanced and should eventually be
1022the default under Win32.
1023
1024=item PERLIO_DEBUG
1025
1026If set to the name of a file or device then certain operations of PerlIO
1027sub-system will be logged to that file (opened as append). Typical uses
1028are UNIX:
1029
1030   PERLIO_DEBUG=/dev/tty perl script ...
1031
1032and Win32 approximate equivalent:
1033
1034   set PERLIO_DEBUG=CON
1035   perl script ...
1036
1037
1038=item PERLLIB
1039
1040A list of directories in which to look for Perl library
1041files before looking in the standard library and the current directory.
1042If PERL5LIB is defined, PERLLIB is not used.
1043
1044=item PERL5DB
1045
1046The command used to load the debugger code.  The default is:
1047
1048	BEGIN { require 'perl5db.pl' }
1049
1050=item PERL5SHELL (specific to the Win32 port)
1051
1052May be set to an alternative shell that perl must use internally for
1053executing "backtick" commands or system().  Default is C<cmd.exe /x/d/c>
1054on WindowsNT and C<command.com /c> on Windows95.  The value is considered
1055to be space-separated.  Precede any character that needs to be protected
1056(like a space or backslash) with a backslash.
1057
1058Note that Perl doesn't use COMSPEC for this purpose because
1059COMSPEC has a high degree of variability among users, leading to
1060portability concerns.  Besides, perl can use a shell that may not be
1061fit for interactive use, and setting COMSPEC to such a shell may
1062interfere with the proper functioning of other programs (which usually
1063look in COMSPEC to find a shell fit for interactive use).
1064
1065=item PERL_DEBUG_MSTATS
1066
1067Relevant only if perl is compiled with the malloc included with the perl
1068distribution (that is, if C<perl -V:d_mymalloc> is 'define').
1069If set, this causes memory statistics to be dumped after execution.  If set
1070to an integer greater than one, also causes memory statistics to be dumped
1071after compilation.
1072
1073=item PERL_DESTRUCT_LEVEL
1074
1075Relevant only if your perl executable was built with B<-DDEBUGGING>,
1076this controls the behavior of global destruction of objects and other
1077references.  See L<perlhack/PERL_DESTRUCT_LEVEL> for more information.
1078
1079=item PERL_DL_NONLAZY
1080
1081Set to one to have perl resolve B<all> undefined symbols when it loads
1082a dynamic library.  The default behaviour is to resolve symbols when
1083they are used.  Setting this variable is useful during testing of
1084extensions as it ensures that you get an error on misspelled function
1085names even if the test suite doesn't call it.
1086
1087=item PERL_ENCODING
1088
1089If using the C<encoding> pragma without an explicit encoding name, the
1090PERL_ENCODING environment variable is consulted for an encoding name.
1091
1092=item PERL_HASH_SEED
1093
1094(Since Perl 5.8.1.)  Used to randomise Perl's internal hash function.
1095To emulate the pre-5.8.1 behaviour, set to an integer (zero means
1096exactly the same order as 5.8.0).  "Pre-5.8.1" means, among other
1097things, that hash keys will be ordered the same between different runs
1098of Perl.
1099
1100The default behaviour is to randomise unless the PERL_HASH_SEED is set.
1101If Perl has been compiled with C<-DUSE_HASH_SEED_EXPLICIT>, the default
1102behaviour is B<not> to randomise unless the PERL_HASH_SEED is set.
1103
1104If PERL_HASH_SEED is unset or set to a non-numeric string, Perl uses
1105the pseudorandom seed supplied by the operating system and libraries.
1106This means that each different run of Perl will have a different
1107ordering of the results of keys(), values(), and each().
1108
1109B<Please note that the hash seed is sensitive information>. Hashes are
1110randomized to protect against local and remote attacks against Perl
1111code. By manually setting a seed this protection may be partially or
1112completely lost.
1113
1114See L<perlsec/"Algorithmic Complexity Attacks"> and
1115L</PERL_HASH_SEED_DEBUG> for more information.
1116
1117=item PERL_HASH_SEED_DEBUG
1118
1119(Since Perl 5.8.1.)  Set to one to display (to STDERR) the value of
1120the hash seed at the beginning of execution.  This, combined with
1121L</PERL_HASH_SEED> is intended to aid in debugging nondeterministic
1122behavior caused by hash randomization.
1123
1124B<Note that the hash seed is sensitive information>: by knowing it one
1125can craft a denial-of-service attack against Perl code, even remotely,
1126see L<perlsec/"Algorithmic Complexity Attacks"> for more information.
1127B<Do not disclose the hash seed> to people who don't need to know it.
1128See also hash_seed() of L<Hash::Util>.
1129
1130=item PERL_ROOT (specific to the VMS port)
1131
1132A translation concealed rooted logical name that contains perl and the
1133logical device for the @INC path on VMS only.  Other logical names that
1134affect perl on VMS include PERLSHR, PERL_ENV_TABLES, and
1135SYS$TIMEZONE_DIFFERENTIAL but are optional and discussed further in
1136L<perlvms> and in F<README.vms> in the Perl source distribution.
1137
1138=item PERL_SIGNALS
1139
1140In Perls 5.8.1 and later.  If set to C<unsafe> the pre-Perl-5.8.0
1141signals behaviour (immediate but unsafe) is restored.  If set to
1142C<safe> the safe (or deferred) signals are used.
1143See L<perlipc/"Deferred Signals (Safe signals)">.
1144
1145=item PERL_UNICODE
1146
1147Equivalent to the B<-C> command-line switch.  Note that this is not
1148a boolean variable-- setting this to C<"1"> is not the right way to
1149"enable Unicode" (whatever that would mean).  You can use C<"0"> to
1150"disable Unicode", though (or alternatively unset PERL_UNICODE in
1151your shell before starting Perl).  See the description of the C<-C>
1152switch for more information.
1153
1154=item SYS$LOGIN (specific to the VMS port)
1155
1156Used if chdir has no argument and HOME and LOGDIR are not set.
1157
1158=back
1159
1160Perl also has environment variables that control how Perl handles data
1161specific to particular natural languages.  See L<perllocale>.
1162
1163Apart from these, Perl uses no other environment variables, except
1164to make them available to the program being executed, and to child
1165processes.  However, programs running setuid would do well to execute
1166the following lines before doing anything else, just to keep people
1167honest:
1168
1169    $ENV{PATH}  = '/bin:/usr/bin';    # or whatever you need
1170    $ENV{SHELL} = '/bin/sh' if exists $ENV{SHELL};
1171    delete @ENV{qw(IFS CDPATH ENV BASH_ENV)};
1172