1=head1 NAME 2 3perlfunc - Perl builtin functions 4 5=head1 DESCRIPTION 6 7The functions in this section can serve as terms in an expression. 8They fall into two major categories: list operators and named unary 9operators. These differ in their precedence relationship with a 10following comma. (See the precedence table in L<perlop>.) List 11operators take more than one argument, while unary operators can never 12take more than one argument. Thus, a comma terminates the argument of 13a unary operator, but merely separates the arguments of a list 14operator. A unary operator generally provides a scalar context to its 15argument, while a list operator may provide either scalar or list 16contexts for its arguments. If it does both, the scalar arguments will 17be first, and the list argument will follow. (Note that there can ever 18be only one such list argument.) For instance, splice() has three scalar 19arguments followed by a list, whereas gethostbyname() has four scalar 20arguments. 21 22In the syntax descriptions that follow, list operators that expect a 23list (and provide list context for the elements of the list) are shown 24with LIST as an argument. Such a list may consist of any combination 25of scalar arguments or list values; the list values will be included 26in the list as if each individual element were interpolated at that 27point in the list, forming a longer single-dimensional list value. 28Elements of the LIST should be separated by commas. 29 30Any function in the list below may be used either with or without 31parentheses around its arguments. (The syntax descriptions omit the 32parentheses.) If you use the parentheses, the simple (but occasionally 33surprising) rule is this: It I<looks> like a function, therefore it I<is> a 34function, and precedence doesn't matter. Otherwise it's a list 35operator or unary operator, and precedence does matter. And whitespace 36between the function and left parenthesis doesn't count--so you need to 37be careful sometimes: 38 39 print 1+2+4; # Prints 7. 40 print(1+2) + 4; # Prints 3. 41 print (1+2)+4; # Also prints 3! 42 print +(1+2)+4; # Prints 7. 43 print ((1+2)+4); # Prints 7. 44 45If you run Perl with the B<-w> switch it can warn you about this. For 46example, the third line above produces: 47 48 print (...) interpreted as function at - line 1. 49 Useless use of integer addition in void context at - line 1. 50 51A few functions take no arguments at all, and therefore work as neither 52unary nor list operators. These include such functions as C<time> 53and C<endpwent>. For example, C<time+86_400> always means 54C<time() + 86_400>. 55 56For functions that can be used in either a scalar or list context, 57nonabortive failure is generally indicated in a scalar context by 58returning the undefined value, and in a list context by returning the 59null list. 60 61Remember the following important rule: There is B<no rule> that relates 62the behavior of an expression in list context to its behavior in scalar 63context, or vice versa. It might do two totally different things. 64Each operator and function decides which sort of value it would be most 65appropriate to return in scalar context. Some operators return the 66length of the list that would have been returned in list context. Some 67operators return the first value in the list. Some operators return the 68last value in the list. Some operators return a count of successful 69operations. In general, they do what you want, unless you want 70consistency. 71 72A named array in scalar context is quite different from what would at 73first glance appear to be a list in scalar context. You can't get a list 74like C<(1,2,3)> into being in scalar context, because the compiler knows 75the context at compile time. It would generate the scalar comma operator 76there, not the list construction version of the comma. That means it 77was never a list to start with. 78 79In general, functions in Perl that serve as wrappers for system calls 80of the same name (like chown(2), fork(2), closedir(2), etc.) all return 81true when they succeed and C<undef> otherwise, as is usually mentioned 82in the descriptions below. This is different from the C interfaces, 83which return C<-1> on failure. Exceptions to this rule are C<wait>, 84C<waitpid>, and C<syscall>. System calls also set the special C<$!> 85variable on failure. Other functions do not, except accidentally. 86 87=head2 Perl Functions by Category 88 89Here are Perl's functions (including things that look like 90functions, like some keywords and named operators) 91arranged by category. Some functions appear in more 92than one place. 93 94=over 4 95 96=item Functions for SCALARs or strings 97 98C<chomp>, C<chop>, C<chr>, C<crypt>, C<hex>, C<index>, C<lc>, C<lcfirst>, 99C<length>, C<oct>, C<ord>, C<pack>, C<q/STRING/>, C<qq/STRING/>, C<reverse>, 100C<rindex>, C<sprintf>, C<substr>, C<tr///>, C<uc>, C<ucfirst>, C<y///> 101 102=item Regular expressions and pattern matching 103 104C<m//>, C<pos>, C<quotemeta>, C<s///>, C<split>, C<study>, C<qr//> 105 106=item Numeric functions 107 108C<abs>, C<atan2>, C<cos>, C<exp>, C<hex>, C<int>, C<log>, C<oct>, C<rand>, 109C<sin>, C<sqrt>, C<srand> 110 111=item Functions for real @ARRAYs 112 113C<pop>, C<push>, C<shift>, C<splice>, C<unshift> 114 115=item Functions for list data 116 117C<grep>, C<join>, C<map>, C<qw/STRING/>, C<reverse>, C<sort>, C<unpack> 118 119=item Functions for real %HASHes 120 121C<delete>, C<each>, C<exists>, C<keys>, C<values> 122 123=item Input and output functions 124 125C<binmode>, C<close>, C<closedir>, C<dbmclose>, C<dbmopen>, C<die>, C<eof>, 126C<fileno>, C<flock>, C<format>, C<getc>, C<print>, C<printf>, C<read>, 127C<readdir>, C<rewinddir>, C<seek>, C<seekdir>, C<select>, C<syscall>, 128C<sysread>, C<sysseek>, C<syswrite>, C<tell>, C<telldir>, C<truncate>, 129C<warn>, C<write> 130 131=item Functions for fixed length data or records 132 133C<pack>, C<read>, C<syscall>, C<sysread>, C<syswrite>, C<unpack>, C<vec> 134 135=item Functions for filehandles, files, or directories 136 137C<-I<X>>, C<chdir>, C<chmod>, C<chown>, C<chroot>, C<fcntl>, C<glob>, 138C<ioctl>, C<link>, C<lstat>, C<mkdir>, C<open>, C<opendir>, 139C<readlink>, C<rename>, C<rmdir>, C<stat>, C<symlink>, C<sysopen>, 140C<umask>, C<unlink>, C<utime> 141 142=item Keywords related to the control flow of your perl program 143 144C<caller>, C<continue>, C<die>, C<do>, C<dump>, C<eval>, C<exit>, 145C<goto>, C<last>, C<next>, C<redo>, C<return>, C<sub>, C<wantarray> 146 147=item Keywords related to scoping 148 149C<caller>, C<import>, C<local>, C<my>, C<our>, C<package>, C<use> 150 151=item Miscellaneous functions 152 153C<defined>, C<dump>, C<eval>, C<formline>, C<local>, C<my>, C<our>, C<reset>, 154C<scalar>, C<undef>, C<wantarray> 155 156=item Functions for processes and process groups 157 158C<alarm>, C<exec>, C<fork>, C<getpgrp>, C<getppid>, C<getpriority>, C<kill>, 159C<pipe>, C<qx/STRING/>, C<setpgrp>, C<setpriority>, C<sleep>, C<system>, 160C<times>, C<wait>, C<waitpid> 161 162=item Keywords related to perl modules 163 164C<do>, C<import>, C<no>, C<package>, C<require>, C<use> 165 166=item Keywords related to classes and object-orientedness 167 168C<bless>, C<dbmclose>, C<dbmopen>, C<package>, C<ref>, C<tie>, C<tied>, 169C<untie>, C<use> 170 171=item Low-level socket functions 172 173C<accept>, C<bind>, C<connect>, C<getpeername>, C<getsockname>, 174C<getsockopt>, C<listen>, C<recv>, C<send>, C<setsockopt>, C<shutdown>, 175C<socket>, C<socketpair> 176 177=item System V interprocess communication functions 178 179C<msgctl>, C<msgget>, C<msgrcv>, C<msgsnd>, C<semctl>, C<semget>, C<semop>, 180C<shmctl>, C<shmget>, C<shmread>, C<shmwrite> 181 182=item Fetching user and group info 183 184C<endgrent>, C<endhostent>, C<endnetent>, C<endpwent>, C<getgrent>, 185C<getgrgid>, C<getgrnam>, C<getlogin>, C<getpwent>, C<getpwnam>, 186C<getpwuid>, C<setgrent>, C<setpwent> 187 188=item Fetching network info 189 190C<endprotoent>, C<endservent>, C<gethostbyaddr>, C<gethostbyname>, 191C<gethostent>, C<getnetbyaddr>, C<getnetbyname>, C<getnetent>, 192C<getprotobyname>, C<getprotobynumber>, C<getprotoent>, 193C<getservbyname>, C<getservbyport>, C<getservent>, C<sethostent>, 194C<setnetent>, C<setprotoent>, C<setservent> 195 196=item Time-related functions 197 198C<gmtime>, C<localtime>, C<time>, C<times> 199 200=item Functions new in perl5 201 202C<abs>, C<bless>, C<chomp>, C<chr>, C<exists>, C<formline>, C<glob>, 203C<import>, C<lc>, C<lcfirst>, C<map>, C<my>, C<no>, C<our>, C<prototype>, 204C<qx>, C<qw>, C<readline>, C<readpipe>, C<ref>, C<sub*>, C<sysopen>, C<tie>, 205C<tied>, C<uc>, C<ucfirst>, C<untie>, C<use> 206 207* - C<sub> was a keyword in perl4, but in perl5 it is an 208operator, which can be used in expressions. 209 210=item Functions obsoleted in perl5 211 212C<dbmclose>, C<dbmopen> 213 214=back 215 216=head2 Portability 217 218Perl was born in Unix and can therefore access all common Unix 219system calls. In non-Unix environments, the functionality of some 220Unix system calls may not be available, or details of the available 221functionality may differ slightly. The Perl functions affected 222by this are: 223 224C<-X>, C<binmode>, C<chmod>, C<chown>, C<chroot>, C<crypt>, 225C<dbmclose>, C<dbmopen>, C<dump>, C<endgrent>, C<endhostent>, 226C<endnetent>, C<endprotoent>, C<endpwent>, C<endservent>, C<exec>, 227C<fcntl>, C<flock>, C<fork>, C<getgrent>, C<getgrgid>, C<gethostbyname>, 228C<gethostent>, C<getlogin>, C<getnetbyaddr>, C<getnetbyname>, C<getnetent>, 229C<getppid>, C<getprgp>, C<getpriority>, C<getprotobynumber>, 230C<getprotoent>, C<getpwent>, C<getpwnam>, C<getpwuid>, 231C<getservbyport>, C<getservent>, C<getsockopt>, C<glob>, C<ioctl>, 232C<kill>, C<link>, C<lstat>, C<msgctl>, C<msgget>, C<msgrcv>, 233C<msgsnd>, C<open>, C<pipe>, C<readlink>, C<rename>, C<select>, C<semctl>, 234C<semget>, C<semop>, C<setgrent>, C<sethostent>, C<setnetent>, 235C<setpgrp>, C<setpriority>, C<setprotoent>, C<setpwent>, 236C<setservent>, C<setsockopt>, C<shmctl>, C<shmget>, C<shmread>, 237C<shmwrite>, C<socket>, C<socketpair>, 238C<stat>, C<symlink>, C<syscall>, C<sysopen>, C<system>, 239C<times>, C<truncate>, C<umask>, C<unlink>, 240C<utime>, C<wait>, C<waitpid> 241 242For more information about the portability of these functions, see 243L<perlport> and other available platform-specific documentation. 244 245=head2 Alphabetical Listing of Perl Functions 246 247=over 8 248 249=item -X FILEHANDLE 250 251=item -X EXPR 252 253=item -X 254 255A file test, where X is one of the letters listed below. This unary 256operator takes one argument, either a filename or a filehandle, and 257tests the associated file to see if something is true about it. If the 258argument is omitted, tests C<$_>, except for C<-t>, which tests STDIN. 259Unless otherwise documented, it returns C<1> for true and C<''> for false, or 260the undefined value if the file doesn't exist. Despite the funny 261names, precedence is the same as any other named unary operator, and 262the argument may be parenthesized like any other unary operator. The 263operator may be any of: 264X<-r>X<-w>X<-x>X<-o>X<-R>X<-W>X<-X>X<-O>X<-e>X<-z>X<-s>X<-f>X<-d>X<-l>X<-p> 265X<-S>X<-b>X<-c>X<-t>X<-u>X<-g>X<-k>X<-T>X<-B>X<-M>X<-A>X<-C> 266 267 -r File is readable by effective uid/gid. 268 -w File is writable by effective uid/gid. 269 -x File is executable by effective uid/gid. 270 -o File is owned by effective uid. 271 272 -R File is readable by real uid/gid. 273 -W File is writable by real uid/gid. 274 -X File is executable by real uid/gid. 275 -O File is owned by real uid. 276 277 -e File exists. 278 -z File has zero size (is empty). 279 -s File has nonzero size (returns size in bytes). 280 281 -f File is a plain file. 282 -d File is a directory. 283 -l File is a symbolic link. 284 -p File is a named pipe (FIFO), or Filehandle is a pipe. 285 -S File is a socket. 286 -b File is a block special file. 287 -c File is a character special file. 288 -t Filehandle is opened to a tty. 289 290 -u File has setuid bit set. 291 -g File has setgid bit set. 292 -k File has sticky bit set. 293 294 -T File is an ASCII text file (heuristic guess). 295 -B File is a "binary" file (opposite of -T). 296 297 -M Script start time minus file modification time, in days. 298 -A Same for access time. 299 -C Same for inode change time (Unix, may differ for other platforms) 300 301Example: 302 303 while (<>) { 304 chomp; 305 next unless -f $_; # ignore specials 306 #... 307 } 308 309The interpretation of the file permission operators C<-r>, C<-R>, 310C<-w>, C<-W>, C<-x>, and C<-X> is by default based solely on the mode 311of the file and the uids and gids of the user. There may be other 312reasons you can't actually read, write, or execute the file. Such 313reasons may be for example network filesystem access controls, ACLs 314(access control lists), read-only filesystems, and unrecognized 315executable formats. 316 317Also note that, for the superuser on the local filesystems, the C<-r>, 318C<-R>, C<-w>, and C<-W> tests always return 1, and C<-x> and C<-X> return 1 319if any execute bit is set in the mode. Scripts run by the superuser 320may thus need to do a stat() to determine the actual mode of the file, 321or temporarily set their effective uid to something else. 322 323If you are using ACLs, there is a pragma called C<filetest> that may 324produce more accurate results than the bare stat() mode bits. 325When under the C<use filetest 'access'> the above-mentioned filetests 326will test whether the permission can (not) be granted using the 327access() family of system calls. Also note that the C<-x> and C<-X> may 328under this pragma return true even if there are no execute permission 329bits set (nor any extra execute permission ACLs). This strangeness is 330due to the underlying system calls' definitions. Read the 331documentation for the C<filetest> pragma for more information. 332 333Note that C<-s/a/b/> does not do a negated substitution. Saying 334C<-exp($foo)> still works as expected, however--only single letters 335following a minus are interpreted as file tests. 336 337The C<-T> and C<-B> switches work as follows. The first block or so of the 338file is examined for odd characters such as strange control codes or 339characters with the high bit set. If too many strange characters (>30%) 340are found, it's a C<-B> file, otherwise it's a C<-T> file. Also, any file 341containing null in the first block is considered a binary file. If C<-T> 342or C<-B> is used on a filehandle, the current IO buffer is examined 343rather than the first block. Both C<-T> and C<-B> return true on a null 344file, or a file at EOF when testing a filehandle. Because you have to 345read a file to do the C<-T> test, on most occasions you want to use a C<-f> 346against the file first, as in C<next unless -f $file && -T $file>. 347 348If any of the file tests (or either the C<stat> or C<lstat> operators) are given 349the special filehandle consisting of a solitary underline, then the stat 350structure of the previous file test (or stat operator) is used, saving 351a system call. (This doesn't work with C<-t>, and you need to remember 352that lstat() and C<-l> will leave values in the stat structure for the 353symbolic link, not the real file.) (Also, if the stat buffer was filled by 354a C<lstat> call, C<-T> and C<-B> will reset it with the results of C<stat _>). 355Example: 356 357 print "Can do.\n" if -r $a || -w _ || -x _; 358 359 stat($filename); 360 print "Readable\n" if -r _; 361 print "Writable\n" if -w _; 362 print "Executable\n" if -x _; 363 print "Setuid\n" if -u _; 364 print "Setgid\n" if -g _; 365 print "Sticky\n" if -k _; 366 print "Text\n" if -T _; 367 print "Binary\n" if -B _; 368 369=item abs VALUE 370 371=item abs 372 373Returns the absolute value of its argument. 374If VALUE is omitted, uses C<$_>. 375 376=item accept NEWSOCKET,GENERICSOCKET 377 378Accepts an incoming socket connect, just as the accept(2) system call 379does. Returns the packed address if it succeeded, false otherwise. 380See the example in L<perlipc/"Sockets: Client/Server Communication">. 381 382On systems that support a close-on-exec flag on files, the flag will 383be set for the newly opened file descriptor, as determined by the 384value of $^F. See L<perlvar/$^F>. 385 386=item alarm SECONDS 387 388=item alarm 389 390Arranges to have a SIGALRM delivered to this process after the 391specified number of wallclock seconds have elapsed. If SECONDS is not 392specified, the value stored in C<$_> is used. (On some machines, 393unfortunately, the elapsed time may be up to one second less or more 394than you specified because of how seconds are counted, and process 395scheduling may delay the delivery of the signal even further.) 396 397Only one timer may be counting at once. Each call disables the 398previous timer, and an argument of C<0> may be supplied to cancel the 399previous timer without starting a new one. The returned value is the 400amount of time remaining on the previous timer. 401 402For delays of finer granularity than one second, you may use Perl's 403four-argument version of select() leaving the first three arguments 404undefined, or you might be able to use the C<syscall> interface to 405access setitimer(2) if your system supports it. The Time::HiRes 406module (from CPAN, and starting from Perl 5.8 part of the standard 407distribution) may also prove useful. 408 409It is usually a mistake to intermix C<alarm> and C<sleep> calls. 410(C<sleep> may be internally implemented in your system with C<alarm>) 411 412If you want to use C<alarm> to time out a system call you need to use an 413C<eval>/C<die> pair. You can't rely on the alarm causing the system call to 414fail with C<$!> set to C<EINTR> because Perl sets up signal handlers to 415restart system calls on some systems. Using C<eval>/C<die> always works, 416modulo the caveats given in L<perlipc/"Signals">. 417 418 eval { 419 local $SIG{ALRM} = sub { die "alarm\n" }; # NB: \n required 420 alarm $timeout; 421 $nread = sysread SOCKET, $buffer, $size; 422 alarm 0; 423 }; 424 if ($@) { 425 die unless $@ eq "alarm\n"; # propagate unexpected errors 426 # timed out 427 } 428 else { 429 # didn't 430 } 431 432For more information see L<perlipc>. 433 434=item atan2 Y,X 435 436Returns the arctangent of Y/X in the range -PI to PI. 437 438For the tangent operation, you may use the C<Math::Trig::tan> 439function, or use the familiar relation: 440 441 sub tan { sin($_[0]) / cos($_[0]) } 442 443=item bind SOCKET,NAME 444 445Binds a network address to a socket, just as the bind system call 446does. Returns true if it succeeded, false otherwise. NAME should be a 447packed address of the appropriate type for the socket. See the examples in 448L<perlipc/"Sockets: Client/Server Communication">. 449 450=item binmode FILEHANDLE, LAYER 451 452=item binmode FILEHANDLE 453 454Arranges for FILEHANDLE to be read or written in "binary" or "text" 455mode on systems where the run-time libraries distinguish between 456binary and text files. If FILEHANDLE is an expression, the value is 457taken as the name of the filehandle. Returns true on success, 458otherwise it returns C<undef> and sets C<$!> (errno). 459 460On some systems (in general, DOS and Windows-based systems) binmode() 461is necessary when you're not working with a text file. For the sake 462of portability it is a good idea to always use it when appropriate, 463and to never use it when it isn't appropriate. Also, people can 464set their I/O to be by default UTF-8 encoded Unicode, not bytes. 465 466In other words: regardless of platform, use binmode() on binary data, 467like for example images. 468 469If LAYER is present it is a single string, but may contain multiple 470directives. The directives alter the behaviour of the file handle. 471When LAYER is present using binmode on text file makes sense. 472 473If LAYER is omitted or specified as C<:raw> the filehandle is made 474suitable for passing binary data. This includes turning off possible CRLF 475translation and marking it as bytes (as opposed to Unicode characters). 476Note that, despite what may be implied in I<"Programming Perl"> (the 477Camel) or elsewhere, C<:raw> is I<not> the simply inverse of C<:crlf> 478-- other layers which would affect binary nature of the stream are 479I<also> disabled. See L<PerlIO>, L<perlrun> and the discussion about the 480PERLIO environment variable. 481 482The C<:bytes>, C<:crlf>, and C<:utf8>, and any other directives of the 483form C<:...>, are called I/O I<layers>. The C<open> pragma can be used to 484establish default I/O layers. See L<open>. 485 486I<The LAYER parameter of the binmode() function is described as "DISCIPLINE" 487in "Programming Perl, 3rd Edition". However, since the publishing of this 488book, by many known as "Camel III", the consensus of the naming of this 489functionality has moved from "discipline" to "layer". All documentation 490of this version of Perl therefore refers to "layers" rather than to 491"disciplines". Now back to the regularly scheduled documentation...> 492 493To mark FILEHANDLE as UTF-8, use C<:utf8>. 494 495In general, binmode() should be called after open() but before any I/O 496is done on the filehandle. Calling binmode() will normally flush any 497pending buffered output data (and perhaps pending input data) on the 498handle. An exception to this is the C<:encoding> layer that 499changes the default character encoding of the handle, see L<open>. 500The C<:encoding> layer sometimes needs to be called in 501mid-stream, and it doesn't flush the stream. The C<:encoding> 502also implicitly pushes on top of itself the C<:utf8> layer because 503internally Perl will operate on UTF-8 encoded Unicode characters. 504 505The operating system, device drivers, C libraries, and Perl run-time 506system all work together to let the programmer treat a single 507character (C<\n>) as the line terminator, irrespective of the external 508representation. On many operating systems, the native text file 509representation matches the internal representation, but on some 510platforms the external representation of C<\n> is made up of more than 511one character. 512 513Mac OS, all variants of Unix, and Stream_LF files on VMS use a single 514character to end each line in the external representation of text (even 515though that single character is CARRIAGE RETURN on Mac OS and LINE FEED 516on Unix and most VMS files). In other systems like OS/2, DOS and the 517various flavors of MS-Windows your program sees a C<\n> as a simple C<\cJ>, 518but what's stored in text files are the two characters C<\cM\cJ>. That 519means that, if you don't use binmode() on these systems, C<\cM\cJ> 520sequences on disk will be converted to C<\n> on input, and any C<\n> in 521your program will be converted back to C<\cM\cJ> on output. This is what 522you want for text files, but it can be disastrous for binary files. 523 524Another consequence of using binmode() (on some systems) is that 525special end-of-file markers will be seen as part of the data stream. 526For systems from the Microsoft family this means that if your binary 527data contains C<\cZ>, the I/O subsystem will regard it as the end of 528the file, unless you use binmode(). 529 530binmode() is not only important for readline() and print() operations, 531but also when using read(), seek(), sysread(), syswrite() and tell() 532(see L<perlport> for more details). See the C<$/> and C<$\> variables 533in L<perlvar> for how to manually set your input and output 534line-termination sequences. 535 536=item bless REF,CLASSNAME 537 538=item bless REF 539 540This function tells the thingy referenced by REF that it is now an object 541in the CLASSNAME package. If CLASSNAME is omitted, the current package 542is used. Because a C<bless> is often the last thing in a constructor, 543it returns the reference for convenience. Always use the two-argument 544version if the function doing the blessing might be inherited by a 545derived class. See L<perltoot> and L<perlobj> for more about the blessing 546(and blessings) of objects. 547 548Consider always blessing objects in CLASSNAMEs that are mixed case. 549Namespaces with all lowercase names are considered reserved for 550Perl pragmata. Builtin types have all uppercase names, so to prevent 551confusion, you may wish to avoid such package names as well. Make sure 552that CLASSNAME is a true value. 553 554See L<perlmod/"Perl Modules">. 555 556=item caller EXPR 557 558=item caller 559 560Returns the context of the current subroutine call. In scalar context, 561returns the caller's package name if there is a caller, that is, if 562we're in a subroutine or C<eval> or C<require>, and the undefined value 563otherwise. In list context, returns 564 565 ($package, $filename, $line) = caller; 566 567With EXPR, it returns some extra information that the debugger uses to 568print a stack trace. The value of EXPR indicates how many call frames 569to go back before the current one. 570 571 ($package, $filename, $line, $subroutine, $hasargs, 572 $wantarray, $evaltext, $is_require, $hints, $bitmask) = caller($i); 573 574Here $subroutine may be C<(eval)> if the frame is not a subroutine 575call, but an C<eval>. In such a case additional elements $evaltext and 576C<$is_require> are set: C<$is_require> is true if the frame is created by a 577C<require> or C<use> statement, $evaltext contains the text of the 578C<eval EXPR> statement. In particular, for an C<eval BLOCK> statement, 579$filename is C<(eval)>, but $evaltext is undefined. (Note also that 580each C<use> statement creates a C<require> frame inside an C<eval EXPR> 581frame.) $subroutine may also be C<(unknown)> if this particular 582subroutine happens to have been deleted from the symbol table. 583C<$hasargs> is true if a new instance of C<@_> was set up for the frame. 584C<$hints> and C<$bitmask> contain pragmatic hints that the caller was 585compiled with. The C<$hints> and C<$bitmask> values are subject to change 586between versions of Perl, and are not meant for external use. 587 588Furthermore, when called from within the DB package, caller returns more 589detailed information: it sets the list variable C<@DB::args> to be the 590arguments with which the subroutine was invoked. 591 592Be aware that the optimizer might have optimized call frames away before 593C<caller> had a chance to get the information. That means that C<caller(N)> 594might not return information about the call frame you expect it do, for 595C<< N > 1 >>. In particular, C<@DB::args> might have information from the 596previous time C<caller> was called. 597 598=item chdir EXPR 599 600Changes the working directory to EXPR, if possible. If EXPR is omitted, 601changes to the directory specified by C<$ENV{HOME}>, if set; if not, 602changes to the directory specified by C<$ENV{LOGDIR}>. (Under VMS, the 603variable C<$ENV{SYS$LOGIN}> is also checked, and used if it is set.) If 604neither is set, C<chdir> does nothing. It returns true upon success, 605false otherwise. See the example under C<die>. 606 607=item chmod LIST 608 609Changes the permissions of a list of files. The first element of the 610list must be the numerical mode, which should probably be an octal 611number, and which definitely should I<not> a string of octal digits: 612C<0644> is okay, C<'0644'> is not. Returns the number of files 613successfully changed. See also L</oct>, if all you have is a string. 614 615 $cnt = chmod 0755, 'foo', 'bar'; 616 chmod 0755, @executables; 617 $mode = '0644'; chmod $mode, 'foo'; # !!! sets mode to 618 # --w----r-T 619 $mode = '0644'; chmod oct($mode), 'foo'; # this is better 620 $mode = 0644; chmod $mode, 'foo'; # this is best 621 622You can also import the symbolic C<S_I*> constants from the Fcntl 623module: 624 625 use Fcntl ':mode'; 626 627 chmod S_IRWXU|S_IRGRP|S_IXGRP|S_IROTH|S_IXOTH, @executables; 628 # This is identical to the chmod 0755 of the above example. 629 630=item chomp VARIABLE 631 632=item chomp( LIST ) 633 634=item chomp 635 636This safer version of L</chop> removes any trailing string 637that corresponds to the current value of C<$/> (also known as 638$INPUT_RECORD_SEPARATOR in the C<English> module). It returns the total 639number of characters removed from all its arguments. It's often used to 640remove the newline from the end of an input record when you're worried 641that the final record may be missing its newline. When in paragraph 642mode (C<$/ = "">), it removes all trailing newlines from the string. 643When in slurp mode (C<$/ = undef>) or fixed-length record mode (C<$/> is 644a reference to an integer or the like, see L<perlvar>) chomp() won't 645remove anything. 646If VARIABLE is omitted, it chomps C<$_>. Example: 647 648 while (<>) { 649 chomp; # avoid \n on last field 650 @array = split(/:/); 651 # ... 652 } 653 654If VARIABLE is a hash, it chomps the hash's values, but not its keys. 655 656You can actually chomp anything that's an lvalue, including an assignment: 657 658 chomp($cwd = `pwd`); 659 chomp($answer = <STDIN>); 660 661If you chomp a list, each element is chomped, and the total number of 662characters removed is returned. 663 664If the C<encoding> pragma is in scope then the lengths returned are 665calculated from the length of C<$/> in Unicode characters, which is not 666always the same as the length of C<$/> in the native encoding. 667 668Note that parentheses are necessary when you're chomping anything 669that is not a simple variable. This is because C<chomp $cwd = `pwd`;> 670is interpreted as C<(chomp $cwd) = `pwd`;>, rather than as 671C<chomp( $cwd = `pwd` )> which you might expect. Similarly, 672C<chomp $a, $b> is interpreted as C<chomp($a), $b> rather than 673as C<chomp($a, $b)>. 674 675=item chop VARIABLE 676 677=item chop( LIST ) 678 679=item chop 680 681Chops off the last character of a string and returns the character 682chopped. It is much more efficient than C<s/.$//s> because it neither 683scans nor copies the string. If VARIABLE is omitted, chops C<$_>. 684If VARIABLE is a hash, it chops the hash's values, but not its keys. 685 686You can actually chop anything that's an lvalue, including an assignment. 687 688If you chop a list, each element is chopped. Only the value of the 689last C<chop> is returned. 690 691Note that C<chop> returns the last character. To return all but the last 692character, use C<substr($string, 0, -1)>. 693 694See also L</chomp>. 695 696=item chown LIST 697 698Changes the owner (and group) of a list of files. The first two 699elements of the list must be the I<numeric> uid and gid, in that 700order. A value of -1 in either position is interpreted by most 701systems to leave that value unchanged. Returns the number of files 702successfully changed. 703 704 $cnt = chown $uid, $gid, 'foo', 'bar'; 705 chown $uid, $gid, @filenames; 706 707Here's an example that looks up nonnumeric uids in the passwd file: 708 709 print "User: "; 710 chomp($user = <STDIN>); 711 print "Files: "; 712 chomp($pattern = <STDIN>); 713 714 ($login,$pass,$uid,$gid) = getpwnam($user) 715 or die "$user not in passwd file"; 716 717 @ary = glob($pattern); # expand filenames 718 chown $uid, $gid, @ary; 719 720On most systems, you are not allowed to change the ownership of the 721file unless you're the superuser, although you should be able to change 722the group to any of your secondary groups. On insecure systems, these 723restrictions may be relaxed, but this is not a portable assumption. 724On POSIX systems, you can detect this condition this way: 725 726 use POSIX qw(sysconf _PC_CHOWN_RESTRICTED); 727 $can_chown_giveaway = not sysconf(_PC_CHOWN_RESTRICTED); 728 729=item chr NUMBER 730 731=item chr 732 733Returns the character represented by that NUMBER in the character set. 734For example, C<chr(65)> is C<"A"> in either ASCII or Unicode, and 735chr(0x263a) is a Unicode smiley face. Note that characters from 128 736to 255 (inclusive) are by default not encoded in UTF-8 Unicode for 737backward compatibility reasons (but see L<encoding>). 738 739If NUMBER is omitted, uses C<$_>. 740 741For the reverse, use L</ord>. 742 743Note that under the C<bytes> pragma the NUMBER is masked to 744the low eight bits. 745 746See L<perlunicode> and L<encoding> for more about Unicode. 747 748=item chroot FILENAME 749 750=item chroot 751 752This function works like the system call by the same name: it makes the 753named directory the new root directory for all further pathnames that 754begin with a C</> by your process and all its children. (It doesn't 755change your current working directory, which is unaffected.) For security 756reasons, this call is restricted to the superuser. If FILENAME is 757omitted, does a C<chroot> to C<$_>. 758 759=item close FILEHANDLE 760 761=item close 762 763Closes the file or pipe associated with the file handle, returning 764true only if IO buffers are successfully flushed and closes the system 765file descriptor. Closes the currently selected filehandle if the 766argument is omitted. 767 768You don't have to close FILEHANDLE if you are immediately going to do 769another C<open> on it, because C<open> will close it for you. (See 770C<open>.) However, an explicit C<close> on an input file resets the line 771counter (C<$.>), while the implicit close done by C<open> does not. 772 773If the file handle came from a piped open, C<close> will additionally 774return false if one of the other system calls involved fails, or if the 775program exits with non-zero status. (If the only problem was that the 776program exited non-zero, C<$!> will be set to C<0>.) Closing a pipe 777also waits for the process executing on the pipe to complete, in case you 778want to look at the output of the pipe afterwards, and 779implicitly puts the exit status value of that command into C<$?>. 780 781Prematurely closing the read end of a pipe (i.e. before the process 782writing to it at the other end has closed it) will result in a 783SIGPIPE being delivered to the writer. If the other end can't 784handle that, be sure to read all the data before closing the pipe. 785 786Example: 787 788 open(OUTPUT, '|sort >foo') # pipe to sort 789 or die "Can't start sort: $!"; 790 #... # print stuff to output 791 close OUTPUT # wait for sort to finish 792 or warn $! ? "Error closing sort pipe: $!" 793 : "Exit status $? from sort"; 794 open(INPUT, 'foo') # get sort's results 795 or die "Can't open 'foo' for input: $!"; 796 797FILEHANDLE may be an expression whose value can be used as an indirect 798filehandle, usually the real filehandle name. 799 800=item closedir DIRHANDLE 801 802Closes a directory opened by C<opendir> and returns the success of that 803system call. 804 805=item connect SOCKET,NAME 806 807Attempts to connect to a remote socket, just as the connect system call 808does. Returns true if it succeeded, false otherwise. NAME should be a 809packed address of the appropriate type for the socket. See the examples in 810L<perlipc/"Sockets: Client/Server Communication">. 811 812=item continue BLOCK 813 814Actually a flow control statement rather than a function. If there is a 815C<continue> BLOCK attached to a BLOCK (typically in a C<while> or 816C<foreach>), it is always executed just before the conditional is about to 817be evaluated again, just like the third part of a C<for> loop in C. Thus 818it can be used to increment a loop variable, even when the loop has been 819continued via the C<next> statement (which is similar to the C C<continue> 820statement). 821 822C<last>, C<next>, or C<redo> may appear within a C<continue> 823block. C<last> and C<redo> will behave as if they had been executed within 824the main block. So will C<next>, but since it will execute a C<continue> 825block, it may be more entertaining. 826 827 while (EXPR) { 828 ### redo always comes here 829 do_something; 830 } continue { 831 ### next always comes here 832 do_something_else; 833 # then back the top to re-check EXPR 834 } 835 ### last always comes here 836 837Omitting the C<continue> section is semantically equivalent to using an 838empty one, logically enough. In that case, C<next> goes directly back 839to check the condition at the top of the loop. 840 841=item cos EXPR 842 843=item cos 844 845Returns the cosine of EXPR (expressed in radians). If EXPR is omitted, 846takes cosine of C<$_>. 847 848For the inverse cosine operation, you may use the C<Math::Trig::acos()> 849function, or use this relation: 850 851 sub acos { atan2( sqrt(1 - $_[0] * $_[0]), $_[0] ) } 852 853=item crypt PLAINTEXT,SALT 854 855Encrypts a string exactly like the crypt(3) function in the C library 856(assuming that you actually have a version there that has not been 857extirpated as a potential munition). This can prove useful for checking 858the password file for lousy passwords, amongst other things. Only the 859guys wearing white hats should do this. 860 861Note that L<crypt|/crypt> is intended to be a one-way function, much like 862breaking eggs to make an omelette. There is no (known) corresponding 863decrypt function (in other words, the crypt() is a one-way hash 864function). As a result, this function isn't all that useful for 865cryptography. (For that, see your nearby CPAN mirror.) 866 867When verifying an existing encrypted string you should use the 868encrypted text as the salt (like C<crypt($plain, $crypted) eq 869$crypted>). This allows your code to work with the standard L<crypt|/crypt> 870and with more exotic implementations. In other words, do not assume 871anything about the returned string itself, or how many bytes in 872the encrypted string matter. 873 874Traditionally the result is a string of 13 bytes: two first bytes of 875the salt, followed by 11 bytes from the set C<[./0-9A-Za-z]>, and only 876the first eight bytes of the encrypted string mattered, but 877alternative hashing schemes (like MD5), higher level security schemes 878(like C2), and implementations on non-UNIX platforms may produce 879different strings. 880 881When choosing a new salt create a random two character string whose 882characters come from the set C<[./0-9A-Za-z]> (like C<join '', ('.', 883'/', 0..9, 'A'..'Z', 'a'..'z')[rand 64, rand 64]>). This set of 884characters is just a recommendation; the characters allowed in 885the salt depend solely on your system's crypt library, and Perl can't 886restrict what salts C<crypt()> accepts. 887 888Here's an example that makes sure that whoever runs this program knows 889their own password: 890 891 $pwd = (getpwuid($<))[1]; 892 893 system "stty -echo"; 894 print "Password: "; 895 chomp($word = <STDIN>); 896 print "\n"; 897 system "stty echo"; 898 899 if (crypt($word, $pwd) ne $pwd) { 900 die "Sorry...\n"; 901 } else { 902 print "ok\n"; 903 } 904 905Of course, typing in your own password to whoever asks you 906for it is unwise. 907 908The L<crypt|/crypt> function is unsuitable for encrypting large quantities 909of data, not least of all because you can't get the information 910back. Look at the F<by-module/Crypt> and F<by-module/PGP> directories 911on your favorite CPAN mirror for a slew of potentially useful 912modules. 913 914If using crypt() on a Unicode string (which I<potentially> has 915characters with codepoints above 255), Perl tries to make sense 916of the situation by trying to downgrade (a copy of the string) 917the string back to an eight-bit byte string before calling crypt() 918(on that copy). If that works, good. If not, crypt() dies with 919C<Wide character in crypt>. 920 921=item dbmclose HASH 922 923[This function has been largely superseded by the C<untie> function.] 924 925Breaks the binding between a DBM file and a hash. 926 927=item dbmopen HASH,DBNAME,MASK 928 929[This function has been largely superseded by the C<tie> function.] 930 931This binds a dbm(3), ndbm(3), sdbm(3), gdbm(3), or Berkeley DB file to a 932hash. HASH is the name of the hash. (Unlike normal C<open>, the first 933argument is I<not> a filehandle, even though it looks like one). DBNAME 934is the name of the database (without the F<.dir> or F<.pag> extension if 935any). If the database does not exist, it is created with protection 936specified by MASK (as modified by the C<umask>). If your system supports 937only the older DBM functions, you may perform only one C<dbmopen> in your 938program. In older versions of Perl, if your system had neither DBM nor 939ndbm, calling C<dbmopen> produced a fatal error; it now falls back to 940sdbm(3). 941 942If you don't have write access to the DBM file, you can only read hash 943variables, not set them. If you want to test whether you can write, 944either use file tests or try setting a dummy hash entry inside an C<eval>, 945which will trap the error. 946 947Note that functions such as C<keys> and C<values> may return huge lists 948when used on large DBM files. You may prefer to use the C<each> 949function to iterate over large DBM files. Example: 950 951 # print out history file offsets 952 dbmopen(%HIST,'/usr/lib/news/history',0666); 953 while (($key,$val) = each %HIST) { 954 print $key, ' = ', unpack('L',$val), "\n"; 955 } 956 dbmclose(%HIST); 957 958See also L<AnyDBM_File> for a more general description of the pros and 959cons of the various dbm approaches, as well as L<DB_File> for a particularly 960rich implementation. 961 962You can control which DBM library you use by loading that library 963before you call dbmopen(): 964 965 use DB_File; 966 dbmopen(%NS_Hist, "$ENV{HOME}/.netscape/history.db") 967 or die "Can't open netscape history file: $!"; 968 969=item defined EXPR 970 971=item defined 972 973Returns a Boolean value telling whether EXPR has a value other than 974the undefined value C<undef>. If EXPR is not present, C<$_> will be 975checked. 976 977Many operations return C<undef> to indicate failure, end of file, 978system error, uninitialized variable, and other exceptional 979conditions. This function allows you to distinguish C<undef> from 980other values. (A simple Boolean test will not distinguish among 981C<undef>, zero, the empty string, and C<"0">, which are all equally 982false.) Note that since C<undef> is a valid scalar, its presence 983doesn't I<necessarily> indicate an exceptional condition: C<pop> 984returns C<undef> when its argument is an empty array, I<or> when the 985element to return happens to be C<undef>. 986 987You may also use C<defined(&func)> to check whether subroutine C<&func> 988has ever been defined. The return value is unaffected by any forward 989declarations of C<&func>. Note that a subroutine which is not defined 990may still be callable: its package may have an C<AUTOLOAD> method that 991makes it spring into existence the first time that it is called -- see 992L<perlsub>. 993 994Use of C<defined> on aggregates (hashes and arrays) is deprecated. It 995used to report whether memory for that aggregate has ever been 996allocated. This behavior may disappear in future versions of Perl. 997You should instead use a simple test for size: 998 999 if (@an_array) { print "has array elements\n" } 1000 if (%a_hash) { print "has hash members\n" } 1001 1002When used on a hash element, it tells you whether the value is defined, 1003not whether the key exists in the hash. Use L</exists> for the latter 1004purpose. 1005 1006Examples: 1007 1008 print if defined $switch{'D'}; 1009 print "$val\n" while defined($val = pop(@ary)); 1010 die "Can't readlink $sym: $!" 1011 unless defined($value = readlink $sym); 1012 sub foo { defined &$bar ? &$bar(@_) : die "No bar"; } 1013 $debugging = 0 unless defined $debugging; 1014 1015Note: Many folks tend to overuse C<defined>, and then are surprised to 1016discover that the number C<0> and C<""> (the zero-length string) are, in fact, 1017defined values. For example, if you say 1018 1019 "ab" =~ /a(.*)b/; 1020 1021The pattern match succeeds, and C<$1> is defined, despite the fact that it 1022matched "nothing". But it didn't really match nothing--rather, it 1023matched something that happened to be zero characters long. This is all 1024very above-board and honest. When a function returns an undefined value, 1025it's an admission that it couldn't give you an honest answer. So you 1026should use C<defined> only when you're questioning the integrity of what 1027you're trying to do. At other times, a simple comparison to C<0> or C<""> is 1028what you want. 1029 1030See also L</undef>, L</exists>, L</ref>. 1031 1032=item delete EXPR 1033 1034Given an expression that specifies a hash element, array element, hash slice, 1035or array slice, deletes the specified element(s) from the hash or array. 1036In the case of an array, if the array elements happen to be at the end, 1037the size of the array will shrink to the highest element that tests 1038true for exists() (or 0 if no such element exists). 1039 1040Returns a list with the same number of elements as the number of elements 1041for which deletion was attempted. Each element of that list consists of 1042either the value of the element deleted, or the undefined value. In scalar 1043context, this means that you get the value of the last element deleted (or 1044the undefined value if that element did not exist). 1045 1046 %hash = (foo => 11, bar => 22, baz => 33); 1047 $scalar = delete $hash{foo}; # $scalar is 11 1048 $scalar = delete @hash{qw(foo bar)}; # $scalar is 22 1049 @array = delete @hash{qw(foo bar baz)}; # @array is (undef,undef,33) 1050 1051Deleting from C<%ENV> modifies the environment. Deleting from 1052a hash tied to a DBM file deletes the entry from the DBM file. Deleting 1053from a C<tie>d hash or array may not necessarily return anything. 1054 1055Deleting an array element effectively returns that position of the array 1056to its initial, uninitialized state. Subsequently testing for the same 1057element with exists() will return false. Note that deleting array 1058elements in the middle of an array will not shift the index of the ones 1059after them down--use splice() for that. See L</exists>. 1060 1061The following (inefficiently) deletes all the values of %HASH and @ARRAY: 1062 1063 foreach $key (keys %HASH) { 1064 delete $HASH{$key}; 1065 } 1066 1067 foreach $index (0 .. $#ARRAY) { 1068 delete $ARRAY[$index]; 1069 } 1070 1071And so do these: 1072 1073 delete @HASH{keys %HASH}; 1074 1075 delete @ARRAY[0 .. $#ARRAY]; 1076 1077But both of these are slower than just assigning the empty list 1078or undefining %HASH or @ARRAY: 1079 1080 %HASH = (); # completely empty %HASH 1081 undef %HASH; # forget %HASH ever existed 1082 1083 @ARRAY = (); # completely empty @ARRAY 1084 undef @ARRAY; # forget @ARRAY ever existed 1085 1086Note that the EXPR can be arbitrarily complicated as long as the final 1087operation is a hash element, array element, hash slice, or array slice 1088lookup: 1089 1090 delete $ref->[$x][$y]{$key}; 1091 delete @{$ref->[$x][$y]}{$key1, $key2, @morekeys}; 1092 1093 delete $ref->[$x][$y][$index]; 1094 delete @{$ref->[$x][$y]}[$index1, $index2, @moreindices]; 1095 1096=item die LIST 1097 1098Outside an C<eval>, prints the value of LIST to C<STDERR> and 1099exits with the current value of C<$!> (errno). If C<$!> is C<0>, 1100exits with the value of C<<< ($? >> 8) >>> (backtick `command` 1101status). If C<<< ($? >> 8) >>> is C<0>, exits with C<255>. Inside 1102an C<eval(),> the error message is stuffed into C<$@> and the 1103C<eval> is terminated with the undefined value. This makes 1104C<die> the way to raise an exception. 1105 1106Equivalent examples: 1107 1108 die "Can't cd to spool: $!\n" unless chdir '/usr/spool/news'; 1109 chdir '/usr/spool/news' or die "Can't cd to spool: $!\n" 1110 1111If the last element of LIST does not end in a newline, the current 1112script line number and input line number (if any) are also printed, 1113and a newline is supplied. Note that the "input line number" (also 1114known as "chunk") is subject to whatever notion of "line" happens to 1115be currently in effect, and is also available as the special variable 1116C<$.>. See L<perlvar/"$/"> and L<perlvar/"$.">. 1117 1118Hint: sometimes appending C<", stopped"> to your message will cause it 1119to make better sense when the string C<"at foo line 123"> is appended. 1120Suppose you are running script "canasta". 1121 1122 die "/etc/games is no good"; 1123 die "/etc/games is no good, stopped"; 1124 1125produce, respectively 1126 1127 /etc/games is no good at canasta line 123. 1128 /etc/games is no good, stopped at canasta line 123. 1129 1130See also exit(), warn(), and the Carp module. 1131 1132If LIST is empty and C<$@> already contains a value (typically from a 1133previous eval) that value is reused after appending C<"\t...propagated">. 1134This is useful for propagating exceptions: 1135 1136 eval { ... }; 1137 die unless $@ =~ /Expected exception/; 1138 1139If LIST is empty and C<$@> contains an object reference that has a 1140C<PROPAGATE> method, that method will be called with additional file 1141and line number parameters. The return value replaces the value in 1142C<$@>. ie. as if C<< $@ = eval { $@->PROPAGATE(__FILE__, __LINE__) }; >> 1143were called. 1144 1145If C<$@> is empty then the string C<"Died"> is used. 1146 1147die() can also be called with a reference argument. If this happens to be 1148trapped within an eval(), $@ contains the reference. This behavior permits 1149a more elaborate exception handling implementation using objects that 1150maintain arbitrary state about the nature of the exception. Such a scheme 1151is sometimes preferable to matching particular string values of $@ using 1152regular expressions. Here's an example: 1153 1154 eval { ... ; die Some::Module::Exception->new( FOO => "bar" ) }; 1155 if ($@) { 1156 if (ref($@) && UNIVERSAL::isa($@,"Some::Module::Exception")) { 1157 # handle Some::Module::Exception 1158 } 1159 else { 1160 # handle all other possible exceptions 1161 } 1162 } 1163 1164Because perl will stringify uncaught exception messages before displaying 1165them, you may want to overload stringification operations on such custom 1166exception objects. See L<overload> for details about that. 1167 1168You can arrange for a callback to be run just before the C<die> 1169does its deed, by setting the C<$SIG{__DIE__}> hook. The associated 1170handler will be called with the error text and can change the error 1171message, if it sees fit, by calling C<die> again. See 1172L<perlvar/$SIG{expr}> for details on setting C<%SIG> entries, and 1173L<"eval BLOCK"> for some examples. Although this feature was meant 1174to be run only right before your program was to exit, this is not 1175currently the case--the C<$SIG{__DIE__}> hook is currently called 1176even inside eval()ed blocks/strings! If one wants the hook to do 1177nothing in such situations, put 1178 1179 die @_ if $^S; 1180 1181as the first line of the handler (see L<perlvar/$^S>). Because 1182this promotes strange action at a distance, this counterintuitive 1183behavior may be fixed in a future release. 1184 1185=item do BLOCK 1186 1187Not really a function. Returns the value of the last command in the 1188sequence of commands indicated by BLOCK. When modified by a loop 1189modifier, executes the BLOCK once before testing the loop condition. 1190(On other statements the loop modifiers test the conditional first.) 1191 1192C<do BLOCK> does I<not> count as a loop, so the loop control statements 1193C<next>, C<last>, or C<redo> cannot be used to leave or restart the block. 1194See L<perlsyn> for alternative strategies. 1195 1196=item do SUBROUTINE(LIST) 1197 1198A deprecated form of subroutine call. See L<perlsub>. 1199 1200=item do EXPR 1201 1202Uses the value of EXPR as a filename and executes the contents of the 1203file as a Perl script. Its primary use is to include subroutines 1204from a Perl subroutine library. 1205 1206 do 'stat.pl'; 1207 1208is just like 1209 1210 eval `cat stat.pl`; 1211 1212except that it's more efficient and concise, keeps track of the current 1213filename for error messages, searches the @INC libraries, and updates 1214C<%INC> if the file is found. See L<perlvar/Predefined Names> for these 1215variables. It also differs in that code evaluated with C<do FILENAME> 1216cannot see lexicals in the enclosing scope; C<eval STRING> does. It's the 1217same, however, in that it does reparse the file every time you call it, 1218so you probably don't want to do this inside a loop. 1219 1220If C<do> cannot read the file, it returns undef and sets C<$!> to the 1221error. If C<do> can read the file but cannot compile it, it 1222returns undef and sets an error message in C<$@>. If the file is 1223successfully compiled, C<do> returns the value of the last expression 1224evaluated. 1225 1226Note that inclusion of library modules is better done with the 1227C<use> and C<require> operators, which also do automatic error checking 1228and raise an exception if there's a problem. 1229 1230You might like to use C<do> to read in a program configuration 1231file. Manual error checking can be done this way: 1232 1233 # read in config files: system first, then user 1234 for $file ("/share/prog/defaults.rc", 1235 "$ENV{HOME}/.someprogrc") 1236 { 1237 unless ($return = do $file) { 1238 warn "couldn't parse $file: $@" if $@; 1239 warn "couldn't do $file: $!" unless defined $return; 1240 warn "couldn't run $file" unless $return; 1241 } 1242 } 1243 1244=item dump LABEL 1245 1246=item dump 1247 1248This function causes an immediate core dump. See also the B<-u> 1249command-line switch in L<perlrun>, which does the same thing. 1250Primarily this is so that you can use the B<undump> program (not 1251supplied) to turn your core dump into an executable binary after 1252having initialized all your variables at the beginning of the 1253program. When the new binary is executed it will begin by executing 1254a C<goto LABEL> (with all the restrictions that C<goto> suffers). 1255Think of it as a goto with an intervening core dump and reincarnation. 1256If C<LABEL> is omitted, restarts the program from the top. 1257 1258B<WARNING>: Any files opened at the time of the dump will I<not> 1259be open any more when the program is reincarnated, with possible 1260resulting confusion on the part of Perl. 1261 1262This function is now largely obsolete, partly because it's very 1263hard to convert a core file into an executable, and because the 1264real compiler backends for generating portable bytecode and compilable 1265C code have superseded it. That's why you should now invoke it as 1266C<CORE::dump()>, if you don't want to be warned against a possible 1267typo. 1268 1269If you're looking to use L<dump> to speed up your program, consider 1270generating bytecode or native C code as described in L<perlcc>. If 1271you're just trying to accelerate a CGI script, consider using the 1272C<mod_perl> extension to B<Apache>, or the CPAN module, CGI::Fast. 1273You might also consider autoloading or selfloading, which at least 1274make your program I<appear> to run faster. 1275 1276=item each HASH 1277 1278When called in list context, returns a 2-element list consisting of the 1279key and value for the next element of a hash, so that you can iterate over 1280it. When called in scalar context, returns only the key for the next 1281element in the hash. 1282 1283Entries are returned in an apparently random order. The actual random 1284order is subject to change in future versions of perl, but it is 1285guaranteed to be in the same order as either the C<keys> or C<values> 1286function would produce on the same (unmodified) hash. Since Perl 12875.8.1 the ordering is different even between different runs of Perl 1288for security reasons (see L<perlsec/"Algorithmic Complexity Attacks">). 1289 1290When the hash is entirely read, a null array is returned in list context 1291(which when assigned produces a false (C<0>) value), and C<undef> in 1292scalar context. The next call to C<each> after that will start iterating 1293again. There is a single iterator for each hash, shared by all C<each>, 1294C<keys>, and C<values> function calls in the program; it can be reset by 1295reading all the elements from the hash, or by evaluating C<keys HASH> or 1296C<values HASH>. If you add or delete elements of a hash while you're 1297iterating over it, you may get entries skipped or duplicated, so 1298don't. Exception: It is always safe to delete the item most recently 1299returned by C<each()>, which means that the following code will work: 1300 1301 while (($key, $value) = each %hash) { 1302 print $key, "\n"; 1303 delete $hash{$key}; # This is safe 1304 } 1305 1306The following prints out your environment like the printenv(1) program, 1307only in a different order: 1308 1309 while (($key,$value) = each %ENV) { 1310 print "$key=$value\n"; 1311 } 1312 1313See also C<keys>, C<values> and C<sort>. 1314 1315=item eof FILEHANDLE 1316 1317=item eof () 1318 1319=item eof 1320 1321Returns 1 if the next read on FILEHANDLE will return end of file, or if 1322FILEHANDLE is not open. FILEHANDLE may be an expression whose value 1323gives the real filehandle. (Note that this function actually 1324reads a character and then C<ungetc>s it, so isn't very useful in an 1325interactive context.) Do not read from a terminal file (or call 1326C<eof(FILEHANDLE)> on it) after end-of-file is reached. File types such 1327as terminals may lose the end-of-file condition if you do. 1328 1329An C<eof> without an argument uses the last file read. Using C<eof()> 1330with empty parentheses is very different. It refers to the pseudo file 1331formed from the files listed on the command line and accessed via the 1332C<< <> >> operator. Since C<< <> >> isn't explicitly opened, 1333as a normal filehandle is, an C<eof()> before C<< <> >> has been 1334used will cause C<@ARGV> to be examined to determine if input is 1335available. Similarly, an C<eof()> after C<< <> >> has returned 1336end-of-file will assume you are processing another C<@ARGV> list, 1337and if you haven't set C<@ARGV>, will read input from C<STDIN>; 1338see L<perlop/"I/O Operators">. 1339 1340In a C<< while (<>) >> loop, C<eof> or C<eof(ARGV)> can be used to 1341detect the end of each file, C<eof()> will only detect the end of the 1342last file. Examples: 1343 1344 # reset line numbering on each input file 1345 while (<>) { 1346 next if /^\s*#/; # skip comments 1347 print "$.\t$_"; 1348 } continue { 1349 close ARGV if eof; # Not eof()! 1350 } 1351 1352 # insert dashes just before last line of last file 1353 while (<>) { 1354 if (eof()) { # check for end of last file 1355 print "--------------\n"; 1356 } 1357 print; 1358 last if eof(); # needed if we're reading from a terminal 1359 } 1360 1361Practical hint: you almost never need to use C<eof> in Perl, because the 1362input operators typically return C<undef> when they run out of data, or if 1363there was an error. 1364 1365=item eval EXPR 1366 1367=item eval BLOCK 1368 1369In the first form, the return value of EXPR is parsed and executed as if it 1370were a little Perl program. The value of the expression (which is itself 1371determined within scalar context) is first parsed, and if there weren't any 1372errors, executed in the lexical context of the current Perl program, so 1373that any variable settings or subroutine and format definitions remain 1374afterwards. Note that the value is parsed every time the eval executes. 1375If EXPR is omitted, evaluates C<$_>. This form is typically used to 1376delay parsing and subsequent execution of the text of EXPR until run time. 1377 1378In the second form, the code within the BLOCK is parsed only once--at the 1379same time the code surrounding the eval itself was parsed--and executed 1380within the context of the current Perl program. This form is typically 1381used to trap exceptions more efficiently than the first (see below), while 1382also providing the benefit of checking the code within BLOCK at compile 1383time. 1384 1385The final semicolon, if any, may be omitted from the value of EXPR or within 1386the BLOCK. 1387 1388In both forms, the value returned is the value of the last expression 1389evaluated inside the mini-program; a return statement may be also used, just 1390as with subroutines. The expression providing the return value is evaluated 1391in void, scalar, or list context, depending on the context of the eval itself. 1392See L</wantarray> for more on how the evaluation context can be determined. 1393 1394If there is a syntax error or runtime error, or a C<die> statement is 1395executed, an undefined value is returned by C<eval>, and C<$@> is set to the 1396error message. If there was no error, C<$@> is guaranteed to be a null 1397string. Beware that using C<eval> neither silences perl from printing 1398warnings to STDERR, nor does it stuff the text of warning messages into C<$@>. 1399To do either of those, you have to use the C<$SIG{__WARN__}> facility, or 1400turn off warnings inside the BLOCK or EXPR using S<C<no warnings 'all'>>. 1401See L</warn>, L<perlvar>, L<warnings> and L<perllexwarn>. 1402 1403Note that, because C<eval> traps otherwise-fatal errors, it is useful for 1404determining whether a particular feature (such as C<socket> or C<symlink>) 1405is implemented. It is also Perl's exception trapping mechanism, where 1406the die operator is used to raise exceptions. 1407 1408If the code to be executed doesn't vary, you may use the eval-BLOCK 1409form to trap run-time errors without incurring the penalty of 1410recompiling each time. The error, if any, is still returned in C<$@>. 1411Examples: 1412 1413 # make divide-by-zero nonfatal 1414 eval { $answer = $a / $b; }; warn $@ if $@; 1415 1416 # same thing, but less efficient 1417 eval '$answer = $a / $b'; warn $@ if $@; 1418 1419 # a compile-time error 1420 eval { $answer = }; # WRONG 1421 1422 # a run-time error 1423 eval '$answer ='; # sets $@ 1424 1425Due to the current arguably broken state of C<__DIE__> hooks, when using 1426the C<eval{}> form as an exception trap in libraries, you may wish not 1427to trigger any C<__DIE__> hooks that user code may have installed. 1428You can use the C<local $SIG{__DIE__}> construct for this purpose, 1429as shown in this example: 1430 1431 # a very private exception trap for divide-by-zero 1432 eval { local $SIG{'__DIE__'}; $answer = $a / $b; }; 1433 warn $@ if $@; 1434 1435This is especially significant, given that C<__DIE__> hooks can call 1436C<die> again, which has the effect of changing their error messages: 1437 1438 # __DIE__ hooks may modify error messages 1439 { 1440 local $SIG{'__DIE__'} = 1441 sub { (my $x = $_[0]) =~ s/foo/bar/g; die $x }; 1442 eval { die "foo lives here" }; 1443 print $@ if $@; # prints "bar lives here" 1444 } 1445 1446Because this promotes action at a distance, this counterintuitive behavior 1447may be fixed in a future release. 1448 1449With an C<eval>, you should be especially careful to remember what's 1450being looked at when: 1451 1452 eval $x; # CASE 1 1453 eval "$x"; # CASE 2 1454 1455 eval '$x'; # CASE 3 1456 eval { $x }; # CASE 4 1457 1458 eval "\$$x++"; # CASE 5 1459 $$x++; # CASE 6 1460 1461Cases 1 and 2 above behave identically: they run the code contained in 1462the variable $x. (Although case 2 has misleading double quotes making 1463the reader wonder what else might be happening (nothing is).) Cases 3 1464and 4 likewise behave in the same way: they run the code C<'$x'>, which 1465does nothing but return the value of $x. (Case 4 is preferred for 1466purely visual reasons, but it also has the advantage of compiling at 1467compile-time instead of at run-time.) Case 5 is a place where 1468normally you I<would> like to use double quotes, except that in this 1469particular situation, you can just use symbolic references instead, as 1470in case 6. 1471 1472C<eval BLOCK> does I<not> count as a loop, so the loop control statements 1473C<next>, C<last>, or C<redo> cannot be used to leave or restart the block. 1474 1475Note that as a very special case, an C<eval ''> executed within the C<DB> 1476package doesn't see the usual surrounding lexical scope, but rather the 1477scope of the first non-DB piece of code that called it. You don't normally 1478need to worry about this unless you are writing a Perl debugger. 1479 1480=item exec LIST 1481 1482=item exec PROGRAM LIST 1483 1484The C<exec> function executes a system command I<and never returns>-- 1485use C<system> instead of C<exec> if you want it to return. It fails and 1486returns false only if the command does not exist I<and> it is executed 1487directly instead of via your system's command shell (see below). 1488 1489Since it's a common mistake to use C<exec> instead of C<system>, Perl 1490warns you if there is a following statement which isn't C<die>, C<warn>, 1491or C<exit> (if C<-w> is set - but you always do that). If you 1492I<really> want to follow an C<exec> with some other statement, you 1493can use one of these styles to avoid the warning: 1494 1495 exec ('foo') or print STDERR "couldn't exec foo: $!"; 1496 { exec ('foo') }; print STDERR "couldn't exec foo: $!"; 1497 1498If there is more than one argument in LIST, or if LIST is an array 1499with more than one value, calls execvp(3) with the arguments in LIST. 1500If there is only one scalar argument or an array with one element in it, 1501the argument is checked for shell metacharacters, and if there are any, 1502the entire argument is passed to the system's command shell for parsing 1503(this is C</bin/sh -c> on Unix platforms, but varies on other platforms). 1504If there are no shell metacharacters in the argument, it is split into 1505words and passed directly to C<execvp>, which is more efficient. 1506Examples: 1507 1508 exec '/bin/echo', 'Your arguments are: ', @ARGV; 1509 exec "sort $outfile | uniq"; 1510 1511If you don't really want to execute the first argument, but want to lie 1512to the program you are executing about its own name, you can specify 1513the program you actually want to run as an "indirect object" (without a 1514comma) in front of the LIST. (This always forces interpretation of the 1515LIST as a multivalued list, even if there is only a single scalar in 1516the list.) Example: 1517 1518 $shell = '/bin/csh'; 1519 exec $shell '-sh'; # pretend it's a login shell 1520 1521or, more directly, 1522 1523 exec {'/bin/csh'} '-sh'; # pretend it's a login shell 1524 1525When the arguments get executed via the system shell, results will 1526be subject to its quirks and capabilities. See L<perlop/"`STRING`"> 1527for details. 1528 1529Using an indirect object with C<exec> or C<system> is also more 1530secure. This usage (which also works fine with system()) forces 1531interpretation of the arguments as a multivalued list, even if the 1532list had just one argument. That way you're safe from the shell 1533expanding wildcards or splitting up words with whitespace in them. 1534 1535 @args = ( "echo surprise" ); 1536 1537 exec @args; # subject to shell escapes 1538 # if @args == 1 1539 exec { $args[0] } @args; # safe even with one-arg list 1540 1541The first version, the one without the indirect object, ran the I<echo> 1542program, passing it C<"surprise"> an argument. The second version 1543didn't--it tried to run a program literally called I<"echo surprise">, 1544didn't find it, and set C<$?> to a non-zero value indicating failure. 1545 1546Beginning with v5.6.0, Perl will attempt to flush all files opened for 1547output before the exec, but this may not be supported on some platforms 1548(see L<perlport>). To be safe, you may need to set C<$|> ($AUTOFLUSH 1549in English) or call the C<autoflush()> method of C<IO::Handle> on any 1550open handles in order to avoid lost output. 1551 1552Note that C<exec> will not call your C<END> blocks, nor will it call 1553any C<DESTROY> methods in your objects. 1554 1555=item exists EXPR 1556 1557Given an expression that specifies a hash element or array element, 1558returns true if the specified element in the hash or array has ever 1559been initialized, even if the corresponding value is undefined. The 1560element is not autovivified if it doesn't exist. 1561 1562 print "Exists\n" if exists $hash{$key}; 1563 print "Defined\n" if defined $hash{$key}; 1564 print "True\n" if $hash{$key}; 1565 1566 print "Exists\n" if exists $array[$index]; 1567 print "Defined\n" if defined $array[$index]; 1568 print "True\n" if $array[$index]; 1569 1570A hash or array element can be true only if it's defined, and defined if 1571it exists, but the reverse doesn't necessarily hold true. 1572 1573Given an expression that specifies the name of a subroutine, 1574returns true if the specified subroutine has ever been declared, even 1575if it is undefined. Mentioning a subroutine name for exists or defined 1576does not count as declaring it. Note that a subroutine which does not 1577exist may still be callable: its package may have an C<AUTOLOAD> 1578method that makes it spring into existence the first time that it is 1579called -- see L<perlsub>. 1580 1581 print "Exists\n" if exists &subroutine; 1582 print "Defined\n" if defined &subroutine; 1583 1584Note that the EXPR can be arbitrarily complicated as long as the final 1585operation is a hash or array key lookup or subroutine name: 1586 1587 if (exists $ref->{A}->{B}->{$key}) { } 1588 if (exists $hash{A}{B}{$key}) { } 1589 1590 if (exists $ref->{A}->{B}->[$ix]) { } 1591 if (exists $hash{A}{B}[$ix]) { } 1592 1593 if (exists &{$ref->{A}{B}{$key}}) { } 1594 1595Although the deepest nested array or hash will not spring into existence 1596just because its existence was tested, any intervening ones will. 1597Thus C<< $ref->{"A"} >> and C<< $ref->{"A"}->{"B"} >> will spring 1598into existence due to the existence test for the $key element above. 1599This happens anywhere the arrow operator is used, including even: 1600 1601 undef $ref; 1602 if (exists $ref->{"Some key"}) { } 1603 print $ref; # prints HASH(0x80d3d5c) 1604 1605This surprising autovivification in what does not at first--or even 1606second--glance appear to be an lvalue context may be fixed in a future 1607release. 1608 1609See L<perlref/"Pseudo-hashes: Using an array as a hash"> for specifics 1610on how exists() acts when used on a pseudo-hash. 1611 1612Use of a subroutine call, rather than a subroutine name, as an argument 1613to exists() is an error. 1614 1615 exists ⊂ # OK 1616 exists &sub(); # Error 1617 1618=item exit EXPR 1619 1620Evaluates EXPR and exits immediately with that value. Example: 1621 1622 $ans = <STDIN>; 1623 exit 0 if $ans =~ /^[Xx]/; 1624 1625See also C<die>. If EXPR is omitted, exits with C<0> status. The only 1626universally recognized values for EXPR are C<0> for success and C<1> 1627for error; other values are subject to interpretation depending on the 1628environment in which the Perl program is running. For example, exiting 162969 (EX_UNAVAILABLE) from a I<sendmail> incoming-mail filter will cause 1630the mailer to return the item undelivered, but that's not true everywhere. 1631 1632Don't use C<exit> to abort a subroutine if there's any chance that 1633someone might want to trap whatever error happened. Use C<die> instead, 1634which can be trapped by an C<eval>. 1635 1636The exit() function does not always exit immediately. It calls any 1637defined C<END> routines first, but these C<END> routines may not 1638themselves abort the exit. Likewise any object destructors that need to 1639be called are called before the real exit. If this is a problem, you 1640can call C<POSIX:_exit($status)> to avoid END and destructor processing. 1641See L<perlmod> for details. 1642 1643=item exp EXPR 1644 1645=item exp 1646 1647Returns I<e> (the natural logarithm base) to the power of EXPR. 1648If EXPR is omitted, gives C<exp($_)>. 1649 1650=item fcntl FILEHANDLE,FUNCTION,SCALAR 1651 1652Implements the fcntl(2) function. You'll probably have to say 1653 1654 use Fcntl; 1655 1656first to get the correct constant definitions. Argument processing and 1657value return works just like C<ioctl> below. 1658For example: 1659 1660 use Fcntl; 1661 fcntl($filehandle, F_GETFL, $packed_return_buffer) 1662 or die "can't fcntl F_GETFL: $!"; 1663 1664You don't have to check for C<defined> on the return from C<fcntl>. 1665Like C<ioctl>, it maps a C<0> return from the system call into 1666C<"0 but true"> in Perl. This string is true in boolean context and C<0> 1667in numeric context. It is also exempt from the normal B<-w> warnings 1668on improper numeric conversions. 1669 1670Note that C<fcntl> will produce a fatal error if used on a machine that 1671doesn't implement fcntl(2). See the Fcntl module or your fcntl(2) 1672manpage to learn what functions are available on your system. 1673 1674Here's an example of setting a filehandle named C<REMOTE> to be 1675non-blocking at the system level. You'll have to negotiate C<$|> 1676on your own, though. 1677 1678 use Fcntl qw(F_GETFL F_SETFL O_NONBLOCK); 1679 1680 $flags = fcntl(REMOTE, F_GETFL, 0) 1681 or die "Can't get flags for the socket: $!\n"; 1682 1683 $flags = fcntl(REMOTE, F_SETFL, $flags | O_NONBLOCK) 1684 or die "Can't set flags for the socket: $!\n"; 1685 1686=item fileno FILEHANDLE 1687 1688Returns the file descriptor for a filehandle, or undefined if the 1689filehandle is not open. This is mainly useful for constructing 1690bitmaps for C<select> and low-level POSIX tty-handling operations. 1691If FILEHANDLE is an expression, the value is taken as an indirect 1692filehandle, generally its name. 1693 1694You can use this to find out whether two handles refer to the 1695same underlying descriptor: 1696 1697 if (fileno(THIS) == fileno(THAT)) { 1698 print "THIS and THAT are dups\n"; 1699 } 1700 1701(Filehandles connected to memory objects via new features of C<open> may 1702return undefined even though they are open.) 1703 1704 1705=item flock FILEHANDLE,OPERATION 1706 1707Calls flock(2), or an emulation of it, on FILEHANDLE. Returns true 1708for success, false on failure. Produces a fatal error if used on a 1709machine that doesn't implement flock(2), fcntl(2) locking, or lockf(3). 1710C<flock> is Perl's portable file locking interface, although it locks 1711only entire files, not records. 1712 1713Two potentially non-obvious but traditional C<flock> semantics are 1714that it waits indefinitely until the lock is granted, and that its locks 1715B<merely advisory>. Such discretionary locks are more flexible, but offer 1716fewer guarantees. This means that files locked with C<flock> may be 1717modified by programs that do not also use C<flock>. See L<perlport>, 1718your port's specific documentation, or your system-specific local manpages 1719for details. It's best to assume traditional behavior if you're writing 1720portable programs. (But if you're not, you should as always feel perfectly 1721free to write for your own system's idiosyncrasies (sometimes called 1722"features"). Slavish adherence to portability concerns shouldn't get 1723in the way of your getting your job done.) 1724 1725OPERATION is one of LOCK_SH, LOCK_EX, or LOCK_UN, possibly combined with 1726LOCK_NB. These constants are traditionally valued 1, 2, 8 and 4, but 1727you can use the symbolic names if you import them from the Fcntl module, 1728either individually, or as a group using the ':flock' tag. LOCK_SH 1729requests a shared lock, LOCK_EX requests an exclusive lock, and LOCK_UN 1730releases a previously requested lock. If LOCK_NB is bitwise-or'ed with 1731LOCK_SH or LOCK_EX then C<flock> will return immediately rather than blocking 1732waiting for the lock (check the return status to see if you got it). 1733 1734To avoid the possibility of miscoordination, Perl now flushes FILEHANDLE 1735before locking or unlocking it. 1736 1737Note that the emulation built with lockf(3) doesn't provide shared 1738locks, and it requires that FILEHANDLE be open with write intent. These 1739are the semantics that lockf(3) implements. Most if not all systems 1740implement lockf(3) in terms of fcntl(2) locking, though, so the 1741differing semantics shouldn't bite too many people. 1742 1743Note that the fcntl(2) emulation of flock(3) requires that FILEHANDLE 1744be open with read intent to use LOCK_SH and requires that it be open 1745with write intent to use LOCK_EX. 1746 1747Note also that some versions of C<flock> cannot lock things over the 1748network; you would need to use the more system-specific C<fcntl> for 1749that. If you like you can force Perl to ignore your system's flock(2) 1750function, and so provide its own fcntl(2)-based emulation, by passing 1751the switch C<-Ud_flock> to the F<Configure> program when you configure 1752perl. 1753 1754Here's a mailbox appender for BSD systems. 1755 1756 use Fcntl ':flock'; # import LOCK_* constants 1757 1758 sub lock { 1759 flock(MBOX,LOCK_EX); 1760 # and, in case someone appended 1761 # while we were waiting... 1762 seek(MBOX, 0, 2); 1763 } 1764 1765 sub unlock { 1766 flock(MBOX,LOCK_UN); 1767 } 1768 1769 open(MBOX, ">>/usr/spool/mail/$ENV{'USER'}") 1770 or die "Can't open mailbox: $!"; 1771 1772 lock(); 1773 print MBOX $msg,"\n\n"; 1774 unlock(); 1775 1776On systems that support a real flock(), locks are inherited across fork() 1777calls, whereas those that must resort to the more capricious fcntl() 1778function lose the locks, making it harder to write servers. 1779 1780See also L<DB_File> for other flock() examples. 1781 1782=item fork 1783 1784Does a fork(2) system call to create a new process running the 1785same program at the same point. It returns the child pid to the 1786parent process, C<0> to the child process, or C<undef> if the fork is 1787unsuccessful. File descriptors (and sometimes locks on those descriptors) 1788are shared, while everything else is copied. On most systems supporting 1789fork(), great care has gone into making it extremely efficient (for 1790example, using copy-on-write technology on data pages), making it the 1791dominant paradigm for multitasking over the last few decades. 1792 1793Beginning with v5.6.0, Perl will attempt to flush all files opened for 1794output before forking the child process, but this may not be supported 1795on some platforms (see L<perlport>). To be safe, you may need to set 1796C<$|> ($AUTOFLUSH in English) or call the C<autoflush()> method of 1797C<IO::Handle> on any open handles in order to avoid duplicate output. 1798 1799If you C<fork> without ever waiting on your children, you will 1800accumulate zombies. On some systems, you can avoid this by setting 1801C<$SIG{CHLD}> to C<"IGNORE">. See also L<perlipc> for more examples of 1802forking and reaping moribund children. 1803 1804Note that if your forked child inherits system file descriptors like 1805STDIN and STDOUT that are actually connected by a pipe or socket, even 1806if you exit, then the remote server (such as, say, a CGI script or a 1807backgrounded job launched from a remote shell) won't think you're done. 1808You should reopen those to F</dev/null> if it's any issue. 1809 1810=item format 1811 1812Declare a picture format for use by the C<write> function. For 1813example: 1814 1815 format Something = 1816 Test: @<<<<<<<< @||||| @>>>>> 1817 $str, $%, '$' . int($num) 1818 . 1819 1820 $str = "widget"; 1821 $num = $cost/$quantity; 1822 $~ = 'Something'; 1823 write; 1824 1825See L<perlform> for many details and examples. 1826 1827=item formline PICTURE,LIST 1828 1829This is an internal function used by C<format>s, though you may call it, 1830too. It formats (see L<perlform>) a list of values according to the 1831contents of PICTURE, placing the output into the format output 1832accumulator, C<$^A> (or C<$ACCUMULATOR> in English). 1833Eventually, when a C<write> is done, the contents of 1834C<$^A> are written to some filehandle, but you could also read C<$^A> 1835yourself and then set C<$^A> back to C<"">. Note that a format typically 1836does one C<formline> per line of form, but the C<formline> function itself 1837doesn't care how many newlines are embedded in the PICTURE. This means 1838that the C<~> and C<~~> tokens will treat the entire PICTURE as a single line. 1839You may therefore need to use multiple formlines to implement a single 1840record format, just like the format compiler. 1841 1842Be careful if you put double quotes around the picture, because an C<@> 1843character may be taken to mean the beginning of an array name. 1844C<formline> always returns true. See L<perlform> for other examples. 1845 1846=item getc FILEHANDLE 1847 1848=item getc 1849 1850Returns the next character from the input file attached to FILEHANDLE, 1851or the undefined value at end of file, or if there was an error (in 1852the latter case C<$!> is set). If FILEHANDLE is omitted, reads from 1853STDIN. This is not particularly efficient. However, it cannot be 1854used by itself to fetch single characters without waiting for the user 1855to hit enter. For that, try something more like: 1856 1857 if ($BSD_STYLE) { 1858 system "stty cbreak </dev/tty >/dev/tty 2>&1"; 1859 } 1860 else { 1861 system "stty", '-icanon', 'eol', "\001"; 1862 } 1863 1864 $key = getc(STDIN); 1865 1866 if ($BSD_STYLE) { 1867 system "stty -cbreak </dev/tty >/dev/tty 2>&1"; 1868 } 1869 else { 1870 system "stty", 'icanon', 'eol', '^@'; # ASCII null 1871 } 1872 print "\n"; 1873 1874Determination of whether $BSD_STYLE should be set 1875is left as an exercise to the reader. 1876 1877The C<POSIX::getattr> function can do this more portably on 1878systems purporting POSIX compliance. See also the C<Term::ReadKey> 1879module from your nearest CPAN site; details on CPAN can be found on 1880L<perlmodlib/CPAN>. 1881 1882=item getlogin 1883 1884Implements the C library function of the same name, which on most 1885systems returns the current login from F</etc/utmp>, if any. If null, 1886use C<getpwuid>. 1887 1888 $login = getlogin || getpwuid($<) || "Kilroy"; 1889 1890Do not consider C<getlogin> for authentication: it is not as 1891secure as C<getpwuid>. 1892 1893=item getpeername SOCKET 1894 1895Returns the packed sockaddr address of other end of the SOCKET connection. 1896 1897 use Socket; 1898 $hersockaddr = getpeername(SOCK); 1899 ($port, $iaddr) = sockaddr_in($hersockaddr); 1900 $herhostname = gethostbyaddr($iaddr, AF_INET); 1901 $herstraddr = inet_ntoa($iaddr); 1902 1903=item getpgrp PID 1904 1905Returns the current process group for the specified PID. Use 1906a PID of C<0> to get the current process group for the 1907current process. Will raise an exception if used on a machine that 1908doesn't implement getpgrp(2). If PID is omitted, returns process 1909group of current process. Note that the POSIX version of C<getpgrp> 1910does not accept a PID argument, so only C<PID==0> is truly portable. 1911 1912=item getppid 1913 1914Returns the process id of the parent process. 1915 1916Note for Linux users: on Linux, the C functions C<getpid()> and 1917C<getppid()> return different values from different threads. In order to 1918be portable, this behavior is not reflected by the perl-level function 1919C<getppid()>, that returns a consistent value across threads. If you want 1920to call the underlying C<getppid()>, you may use the CPAN module 1921C<Linux::Pid>. 1922 1923=item getpriority WHICH,WHO 1924 1925Returns the current priority for a process, a process group, or a user. 1926(See L<getpriority(2)>.) Will raise a fatal exception if used on a 1927machine that doesn't implement getpriority(2). 1928 1929=item getpwnam NAME 1930 1931=item getgrnam NAME 1932 1933=item gethostbyname NAME 1934 1935=item getnetbyname NAME 1936 1937=item getprotobyname NAME 1938 1939=item getpwuid UID 1940 1941=item getgrgid GID 1942 1943=item getservbyname NAME,PROTO 1944 1945=item gethostbyaddr ADDR,ADDRTYPE 1946 1947=item getnetbyaddr ADDR,ADDRTYPE 1948 1949=item getprotobynumber NUMBER 1950 1951=item getservbyport PORT,PROTO 1952 1953=item getpwent 1954 1955=item getgrent 1956 1957=item gethostent 1958 1959=item getnetent 1960 1961=item getprotoent 1962 1963=item getservent 1964 1965=item setpwent 1966 1967=item setgrent 1968 1969=item sethostent STAYOPEN 1970 1971=item setnetent STAYOPEN 1972 1973=item setprotoent STAYOPEN 1974 1975=item setservent STAYOPEN 1976 1977=item endpwent 1978 1979=item endgrent 1980 1981=item endhostent 1982 1983=item endnetent 1984 1985=item endprotoent 1986 1987=item endservent 1988 1989These routines perform the same functions as their counterparts in the 1990system library. In list context, the return values from the 1991various get routines are as follows: 1992 1993 ($name,$passwd,$uid,$gid, 1994 $quota,$comment,$gcos,$dir,$shell,$expire) = getpw* 1995 ($name,$passwd,$gid,$members) = getgr* 1996 ($name,$aliases,$addrtype,$length,@addrs) = gethost* 1997 ($name,$aliases,$addrtype,$net) = getnet* 1998 ($name,$aliases,$proto) = getproto* 1999 ($name,$aliases,$port,$proto) = getserv* 2000 2001(If the entry doesn't exist you get a null list.) 2002 2003The exact meaning of the $gcos field varies but it usually contains 2004the real name of the user (as opposed to the login name) and other 2005information pertaining to the user. Beware, however, that in many 2006system users are able to change this information and therefore it 2007cannot be trusted and therefore the $gcos is tainted (see 2008L<perlsec>). The $passwd and $shell, user's encrypted password and 2009login shell, are also tainted, because of the same reason. 2010 2011In scalar context, you get the name, unless the function was a 2012lookup by name, in which case you get the other thing, whatever it is. 2013(If the entry doesn't exist you get the undefined value.) For example: 2014 2015 $uid = getpwnam($name); 2016 $name = getpwuid($num); 2017 $name = getpwent(); 2018 $gid = getgrnam($name); 2019 $name = getgrgid($num); 2020 $name = getgrent(); 2021 #etc. 2022 2023In I<getpw*()> the fields $quota, $comment, and $expire are special 2024cases in the sense that in many systems they are unsupported. If the 2025$quota is unsupported, it is an empty scalar. If it is supported, it 2026usually encodes the disk quota. If the $comment field is unsupported, 2027it is an empty scalar. If it is supported it usually encodes some 2028administrative comment about the user. In some systems the $quota 2029field may be $change or $age, fields that have to do with password 2030aging. In some systems the $comment field may be $class. The $expire 2031field, if present, encodes the expiration period of the account or the 2032password. For the availability and the exact meaning of these fields 2033in your system, please consult your getpwnam(3) documentation and your 2034F<pwd.h> file. You can also find out from within Perl what your 2035$quota and $comment fields mean and whether you have the $expire field 2036by using the C<Config> module and the values C<d_pwquota>, C<d_pwage>, 2037C<d_pwchange>, C<d_pwcomment>, and C<d_pwexpire>. Shadow password 2038files are only supported if your vendor has implemented them in the 2039intuitive fashion that calling the regular C library routines gets the 2040shadow versions if you're running under privilege or if there exists 2041the shadow(3) functions as found in System V ( this includes Solaris 2042and Linux.) Those systems which implement a proprietary shadow password 2043facility are unlikely to be supported. 2044 2045The $members value returned by I<getgr*()> is a space separated list of 2046the login names of the members of the group. 2047 2048For the I<gethost*()> functions, if the C<h_errno> variable is supported in 2049C, it will be returned to you via C<$?> if the function call fails. The 2050C<@addrs> value returned by a successful call is a list of the raw 2051addresses returned by the corresponding system library call. In the 2052Internet domain, each address is four bytes long and you can unpack it 2053by saying something like: 2054 2055 ($a,$b,$c,$d) = unpack('C4',$addr[0]); 2056 2057The Socket library makes this slightly easier: 2058 2059 use Socket; 2060 $iaddr = inet_aton("127.1"); # or whatever address 2061 $name = gethostbyaddr($iaddr, AF_INET); 2062 2063 # or going the other way 2064 $straddr = inet_ntoa($iaddr); 2065 2066If you get tired of remembering which element of the return list 2067contains which return value, by-name interfaces are provided 2068in standard modules: C<File::stat>, C<Net::hostent>, C<Net::netent>, 2069C<Net::protoent>, C<Net::servent>, C<Time::gmtime>, C<Time::localtime>, 2070and C<User::grent>. These override the normal built-ins, supplying 2071versions that return objects with the appropriate names 2072for each field. For example: 2073 2074 use File::stat; 2075 use User::pwent; 2076 $is_his = (stat($filename)->uid == pwent($whoever)->uid); 2077 2078Even though it looks like they're the same method calls (uid), 2079they aren't, because a C<File::stat> object is different from 2080a C<User::pwent> object. 2081 2082=item getsockname SOCKET 2083 2084Returns the packed sockaddr address of this end of the SOCKET connection, 2085in case you don't know the address because you have several different 2086IPs that the connection might have come in on. 2087 2088 use Socket; 2089 $mysockaddr = getsockname(SOCK); 2090 ($port, $myaddr) = sockaddr_in($mysockaddr); 2091 printf "Connect to %s [%s]\n", 2092 scalar gethostbyaddr($myaddr, AF_INET), 2093 inet_ntoa($myaddr); 2094 2095=item getsockopt SOCKET,LEVEL,OPTNAME 2096 2097Returns the socket option requested, or undef if there is an error. 2098 2099=item glob EXPR 2100 2101=item glob 2102 2103In list context, returns a (possibly empty) list of filename expansions on 2104the value of EXPR such as the standard Unix shell F</bin/csh> would do. In 2105scalar context, glob iterates through such filename expansions, returning 2106undef when the list is exhausted. This is the internal function 2107implementing the C<< <*.c> >> operator, but you can use it directly. If 2108EXPR is omitted, C<$_> is used. The C<< <*.c> >> operator is discussed in 2109more detail in L<perlop/"I/O Operators">. 2110 2111Beginning with v5.6.0, this operator is implemented using the standard 2112C<File::Glob> extension. See L<File::Glob> for details. 2113 2114=item gmtime EXPR 2115 2116Converts a time as returned by the time function to an 8-element list 2117with the time localized for the standard Greenwich time zone. 2118Typically used as follows: 2119 2120 # 0 1 2 3 4 5 6 7 2121 ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday) = 2122 gmtime(time); 2123 2124All list elements are numeric, and come straight out of the C `struct 2125tm'. $sec, $min, and $hour are the seconds, minutes, and hours of the 2126specified time. $mday is the day of the month, and $mon is the month 2127itself, in the range C<0..11> with 0 indicating January and 11 2128indicating December. $year is the number of years since 1900. That 2129is, $year is C<123> in year 2023. $wday is the day of the week, with 21300 indicating Sunday and 3 indicating Wednesday. $yday is the day of 2131the year, in the range C<0..364> (or C<0..365> in leap years.) 2132 2133Note that the $year element is I<not> simply the last two digits of 2134the year. If you assume it is, then you create non-Y2K-compliant 2135programs--and you wouldn't want to do that, would you? 2136 2137The proper way to get a complete 4-digit year is simply: 2138 2139 $year += 1900; 2140 2141And to get the last two digits of the year (e.g., '01' in 2001) do: 2142 2143 $year = sprintf("%02d", $year % 100); 2144 2145If EXPR is omitted, C<gmtime()> uses the current time (C<gmtime(time)>). 2146 2147In scalar context, C<gmtime()> returns the ctime(3) value: 2148 2149 $now_string = gmtime; # e.g., "Thu Oct 13 04:54:34 1994" 2150 2151If you need local time instead of GMT use the L</localtime> builtin. 2152See also the C<timegm> function provided by the C<Time::Local> module, 2153and the strftime(3) and mktime(3) functions available via the L<POSIX> module. 2154 2155This scalar value is B<not> locale dependent (see L<perllocale>), but is 2156instead a Perl builtin. To get somewhat similar but locale dependent date 2157strings, see the example in L</localtime>. 2158 2159=item goto LABEL 2160 2161=item goto EXPR 2162 2163=item goto &NAME 2164 2165The C<goto-LABEL> form finds the statement labeled with LABEL and resumes 2166execution there. It may not be used to go into any construct that 2167requires initialization, such as a subroutine or a C<foreach> loop. It 2168also can't be used to go into a construct that is optimized away, 2169or to get out of a block or subroutine given to C<sort>. 2170It can be used to go almost anywhere else within the dynamic scope, 2171including out of subroutines, but it's usually better to use some other 2172construct such as C<last> or C<die>. The author of Perl has never felt the 2173need to use this form of C<goto> (in Perl, that is--C is another matter). 2174(The difference being that C does not offer named loops combined with 2175loop control. Perl does, and this replaces most structured uses of C<goto> 2176in other languages.) 2177 2178The C<goto-EXPR> form expects a label name, whose scope will be resolved 2179dynamically. This allows for computed C<goto>s per FORTRAN, but isn't 2180necessarily recommended if you're optimizing for maintainability: 2181 2182 goto ("FOO", "BAR", "GLARCH")[$i]; 2183 2184The C<goto-&NAME> form is quite different from the other forms of 2185C<goto>. In fact, it isn't a goto in the normal sense at all, and 2186doesn't have the stigma associated with other gotos. Instead, it 2187exits the current subroutine (losing any changes set by local()) and 2188immediately calls in its place the named subroutine using the current 2189value of @_. This is used by C<AUTOLOAD> subroutines that wish to 2190load another subroutine and then pretend that the other subroutine had 2191been called in the first place (except that any modifications to C<@_> 2192in the current subroutine are propagated to the other subroutine.) 2193After the C<goto>, not even C<caller> will be able to tell that this 2194routine was called first. 2195 2196NAME needn't be the name of a subroutine; it can be a scalar variable 2197containing a code reference, or a block which evaluates to a code 2198reference. 2199 2200=item grep BLOCK LIST 2201 2202=item grep EXPR,LIST 2203 2204This is similar in spirit to, but not the same as, grep(1) and its 2205relatives. In particular, it is not limited to using regular expressions. 2206 2207Evaluates the BLOCK or EXPR for each element of LIST (locally setting 2208C<$_> to each element) and returns the list value consisting of those 2209elements for which the expression evaluated to true. In scalar 2210context, returns the number of times the expression was true. 2211 2212 @foo = grep(!/^#/, @bar); # weed out comments 2213 2214or equivalently, 2215 2216 @foo = grep {!/^#/} @bar; # weed out comments 2217 2218Note that C<$_> is an alias to the list value, so it can be used to 2219modify the elements of the LIST. While this is useful and supported, 2220it can cause bizarre results if the elements of LIST are not variables. 2221Similarly, grep returns aliases into the original list, much as a for 2222loop's index variable aliases the list elements. That is, modifying an 2223element of a list returned by grep (for example, in a C<foreach>, C<map> 2224or another C<grep>) actually modifies the element in the original list. 2225This is usually something to be avoided when writing clear code. 2226 2227See also L</map> for a list composed of the results of the BLOCK or EXPR. 2228 2229=item hex EXPR 2230 2231=item hex 2232 2233Interprets EXPR as a hex string and returns the corresponding value. 2234(To convert strings that might start with either 0, 0x, or 0b, see 2235L</oct>.) If EXPR is omitted, uses C<$_>. 2236 2237 print hex '0xAf'; # prints '175' 2238 print hex 'aF'; # same 2239 2240Hex strings may only represent integers. Strings that would cause 2241integer overflow trigger a warning. Leading whitespace is not stripped, 2242unlike oct(). 2243 2244=item import 2245 2246There is no builtin C<import> function. It is just an ordinary 2247method (subroutine) defined (or inherited) by modules that wish to export 2248names to another module. The C<use> function calls the C<import> method 2249for the package used. See also L</use>, L<perlmod>, and L<Exporter>. 2250 2251=item index STR,SUBSTR,POSITION 2252 2253=item index STR,SUBSTR 2254 2255The index function searches for one string within another, but without 2256the wildcard-like behavior of a full regular-expression pattern match. 2257It returns the position of the first occurrence of SUBSTR in STR at 2258or after POSITION. If POSITION is omitted, starts searching from the 2259beginning of the string. The return value is based at C<0> (or whatever 2260you've set the C<$[> variable to--but don't do that). If the substring 2261is not found, returns one less than the base, ordinarily C<-1>. 2262 2263=item int EXPR 2264 2265=item int 2266 2267Returns the integer portion of EXPR. If EXPR is omitted, uses C<$_>. 2268You should not use this function for rounding: one because it truncates 2269towards C<0>, and two because machine representations of floating point 2270numbers can sometimes produce counterintuitive results. For example, 2271C<int(-6.725/0.025)> produces -268 rather than the correct -269; that's 2272because it's really more like -268.99999999999994315658 instead. Usually, 2273the C<sprintf>, C<printf>, or the C<POSIX::floor> and C<POSIX::ceil> 2274functions will serve you better than will int(). 2275 2276=item ioctl FILEHANDLE,FUNCTION,SCALAR 2277 2278Implements the ioctl(2) function. You'll probably first have to say 2279 2280 require "ioctl.ph"; # probably in /usr/local/lib/perl/ioctl.ph 2281 2282to get the correct function definitions. If F<ioctl.ph> doesn't 2283exist or doesn't have the correct definitions you'll have to roll your 2284own, based on your C header files such as F<< <sys/ioctl.h> >>. 2285(There is a Perl script called B<h2ph> that comes with the Perl kit that 2286may help you in this, but it's nontrivial.) SCALAR will be read and/or 2287written depending on the FUNCTION--a pointer to the string value of SCALAR 2288will be passed as the third argument of the actual C<ioctl> call. (If SCALAR 2289has no string value but does have a numeric value, that value will be 2290passed rather than a pointer to the string value. To guarantee this to be 2291true, add a C<0> to the scalar before using it.) The C<pack> and C<unpack> 2292functions may be needed to manipulate the values of structures used by 2293C<ioctl>. 2294 2295The return value of C<ioctl> (and C<fcntl>) is as follows: 2296 2297 if OS returns: then Perl returns: 2298 -1 undefined value 2299 0 string "0 but true" 2300 anything else that number 2301 2302Thus Perl returns true on success and false on failure, yet you can 2303still easily determine the actual value returned by the operating 2304system: 2305 2306 $retval = ioctl(...) || -1; 2307 printf "System returned %d\n", $retval; 2308 2309The special string C<"0 but true"> is exempt from B<-w> complaints 2310about improper numeric conversions. 2311 2312=item join EXPR,LIST 2313 2314Joins the separate strings of LIST into a single string with fields 2315separated by the value of EXPR, and returns that new string. Example: 2316 2317 $rec = join(':', $login,$passwd,$uid,$gid,$gcos,$home,$shell); 2318 2319Beware that unlike C<split>, C<join> doesn't take a pattern as its 2320first argument. Compare L</split>. 2321 2322=item keys HASH 2323 2324Returns a list consisting of all the keys of the named hash. 2325(In scalar context, returns the number of keys.) 2326 2327The keys are returned in an apparently random order. The actual 2328random order is subject to change in future versions of perl, but it 2329is guaranteed to be the same order as either the C<values> or C<each> 2330function produces (given that the hash has not been modified). Since 2331Perl 5.8.1 the ordering is different even between different runs of 2332Perl for security reasons (see L<perlsec/"Algorithmic Complexity 2333Attacks">). 2334 2335As a side effect, calling keys() resets the HASH's internal iterator, 2336see L</each>. (In particular, calling keys() in void context resets 2337the iterator with no other overhead.) 2338 2339Here is yet another way to print your environment: 2340 2341 @keys = keys %ENV; 2342 @values = values %ENV; 2343 while (@keys) { 2344 print pop(@keys), '=', pop(@values), "\n"; 2345 } 2346 2347or how about sorted by key: 2348 2349 foreach $key (sort(keys %ENV)) { 2350 print $key, '=', $ENV{$key}, "\n"; 2351 } 2352 2353The returned values are copies of the original keys in the hash, so 2354modifying them will not affect the original hash. Compare L</values>. 2355 2356To sort a hash by value, you'll need to use a C<sort> function. 2357Here's a descending numeric sort of a hash by its values: 2358 2359 foreach $key (sort { $hash{$b} <=> $hash{$a} } keys %hash) { 2360 printf "%4d %s\n", $hash{$key}, $key; 2361 } 2362 2363As an lvalue C<keys> allows you to increase the number of hash buckets 2364allocated for the given hash. This can gain you a measure of efficiency if 2365you know the hash is going to get big. (This is similar to pre-extending 2366an array by assigning a larger number to $#array.) If you say 2367 2368 keys %hash = 200; 2369 2370then C<%hash> will have at least 200 buckets allocated for it--256 of them, 2371in fact, since it rounds up to the next power of two. These 2372buckets will be retained even if you do C<%hash = ()>, use C<undef 2373%hash> if you want to free the storage while C<%hash> is still in scope. 2374You can't shrink the number of buckets allocated for the hash using 2375C<keys> in this way (but you needn't worry about doing this by accident, 2376as trying has no effect). 2377 2378See also C<each>, C<values> and C<sort>. 2379 2380=item kill SIGNAL, LIST 2381 2382Sends a signal to a list of processes. Returns the number of 2383processes successfully signaled (which is not necessarily the 2384same as the number actually killed). 2385 2386 $cnt = kill 1, $child1, $child2; 2387 kill 9, @goners; 2388 2389If SIGNAL is zero, no signal is sent to the process. This is a 2390useful way to check that a child process is alive and hasn't changed 2391its UID. See L<perlport> for notes on the portability of this 2392construct. 2393 2394Unlike in the shell, if SIGNAL is negative, it kills 2395process groups instead of processes. (On System V, a negative I<PROCESS> 2396number will also kill process groups, but that's not portable.) That 2397means you usually want to use positive not negative signals. You may also 2398use a signal name in quotes. 2399 2400See L<perlipc/"Signals"> for more details. 2401 2402=item last LABEL 2403 2404=item last 2405 2406The C<last> command is like the C<break> statement in C (as used in 2407loops); it immediately exits the loop in question. If the LABEL is 2408omitted, the command refers to the innermost enclosing loop. The 2409C<continue> block, if any, is not executed: 2410 2411 LINE: while (<STDIN>) { 2412 last LINE if /^$/; # exit when done with header 2413 #... 2414 } 2415 2416C<last> cannot be used to exit a block which returns a value such as 2417C<eval {}>, C<sub {}> or C<do {}>, and should not be used to exit 2418a grep() or map() operation. 2419 2420Note that a block by itself is semantically identical to a loop 2421that executes once. Thus C<last> can be used to effect an early 2422exit out of such a block. 2423 2424See also L</continue> for an illustration of how C<last>, C<next>, and 2425C<redo> work. 2426 2427=item lc EXPR 2428 2429=item lc 2430 2431Returns a lowercased version of EXPR. This is the internal function 2432implementing the C<\L> escape in double-quoted strings. Respects 2433current LC_CTYPE locale if C<use locale> in force. See L<perllocale> 2434and L<perlunicode> for more details about locale and Unicode support. 2435 2436If EXPR is omitted, uses C<$_>. 2437 2438=item lcfirst EXPR 2439 2440=item lcfirst 2441 2442Returns the value of EXPR with the first character lowercased. This 2443is the internal function implementing the C<\l> escape in 2444double-quoted strings. Respects current LC_CTYPE locale if C<use 2445locale> in force. See L<perllocale> and L<perlunicode> for more 2446details about locale and Unicode support. 2447 2448If EXPR is omitted, uses C<$_>. 2449 2450=item length EXPR 2451 2452=item length 2453 2454Returns the length in I<characters> of the value of EXPR. If EXPR is 2455omitted, returns length of C<$_>. Note that this cannot be used on 2456an entire array or hash to find out how many elements these have. 2457For that, use C<scalar @array> and C<scalar keys %hash> respectively. 2458 2459Note the I<characters>: if the EXPR is in Unicode, you will get the 2460number of characters, not the number of bytes. To get the length 2461in bytes, use C<do { use bytes; length(EXPR) }>, see L<bytes>. 2462 2463=item link OLDFILE,NEWFILE 2464 2465Creates a new filename linked to the old filename. Returns true for 2466success, false otherwise. 2467 2468=item listen SOCKET,QUEUESIZE 2469 2470Does the same thing that the listen system call does. Returns true if 2471it succeeded, false otherwise. See the example in 2472L<perlipc/"Sockets: Client/Server Communication">. 2473 2474=item local EXPR 2475 2476You really probably want to be using C<my> instead, because C<local> isn't 2477what most people think of as "local". See 2478L<perlsub/"Private Variables via my()"> for details. 2479 2480A local modifies the listed variables to be local to the enclosing 2481block, file, or eval. If more than one value is listed, the list must 2482be placed in parentheses. See L<perlsub/"Temporary Values via local()"> 2483for details, including issues with tied arrays and hashes. 2484 2485=item localtime EXPR 2486 2487Converts a time as returned by the time function to a 9-element list 2488with the time analyzed for the local time zone. Typically used as 2489follows: 2490 2491 # 0 1 2 3 4 5 6 7 8 2492 ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = 2493 localtime(time); 2494 2495All list elements are numeric, and come straight out of the C `struct 2496tm'. $sec, $min, and $hour are the seconds, minutes, and hours of the 2497specified time. $mday is the day of the month, and $mon is the month 2498itself, in the range C<0..11> with 0 indicating January and 11 2499indicating December. $year is the number of years since 1900. That 2500is, $year is C<123> in year 2023. $wday is the day of the week, with 25010 indicating Sunday and 3 indicating Wednesday. $yday is the day of 2502the year, in the range C<0..364> (or C<0..365> in leap years.) $isdst 2503is true if the specified time occurs during daylight savings time, 2504false otherwise. 2505 2506Note that the $year element is I<not> simply the last two digits of 2507the year. If you assume it is, then you create non-Y2K-compliant 2508programs--and you wouldn't want to do that, would you? 2509 2510The proper way to get a complete 4-digit year is simply: 2511 2512 $year += 1900; 2513 2514And to get the last two digits of the year (e.g., '01' in 2001) do: 2515 2516 $year = sprintf("%02d", $year % 100); 2517 2518If EXPR is omitted, C<localtime()> uses the current time (C<localtime(time)>). 2519 2520In scalar context, C<localtime()> returns the ctime(3) value: 2521 2522 $now_string = localtime; # e.g., "Thu Oct 13 04:54:34 1994" 2523 2524This scalar value is B<not> locale dependent but is a Perl builtin. For GMT 2525instead of local time use the L</gmtime> builtin. See also the 2526C<Time::Local> module (to convert the second, minutes, hours, ... back to 2527the integer value returned by time()), and the L<POSIX> module's strftime(3) 2528and mktime(3) functions. 2529 2530To get somewhat similar but locale dependent date strings, set up your 2531locale environment variables appropriately (please see L<perllocale>) and 2532try for example: 2533 2534 use POSIX qw(strftime); 2535 $now_string = strftime "%a %b %e %H:%M:%S %Y", localtime; 2536 # or for GMT formatted appropriately for your locale: 2537 $now_string = strftime "%a %b %e %H:%M:%S %Y", gmtime; 2538 2539Note that the C<%a> and C<%b>, the short forms of the day of the week 2540and the month of the year, may not necessarily be three characters wide. 2541 2542=item lock THING 2543 2544This function places an advisory lock on a shared variable, or referenced 2545object contained in I<THING> until the lock goes out of scope. 2546 2547lock() is a "weak keyword" : this means that if you've defined a function 2548by this name (before any calls to it), that function will be called 2549instead. (However, if you've said C<use threads>, lock() is always a 2550keyword.) See L<threads>. 2551 2552=item log EXPR 2553 2554=item log 2555 2556Returns the natural logarithm (base I<e>) of EXPR. If EXPR is omitted, 2557returns log of C<$_>. To get the log of another base, use basic algebra: 2558The base-N log of a number is equal to the natural log of that number 2559divided by the natural log of N. For example: 2560 2561 sub log10 { 2562 my $n = shift; 2563 return log($n)/log(10); 2564 } 2565 2566See also L</exp> for the inverse operation. 2567 2568=item lstat EXPR 2569 2570=item lstat 2571 2572Does the same thing as the C<stat> function (including setting the 2573special C<_> filehandle) but stats a symbolic link instead of the file 2574the symbolic link points to. If symbolic links are unimplemented on 2575your system, a normal C<stat> is done. For much more detailed 2576information, please see the documentation for L</stat>. 2577 2578If EXPR is omitted, stats C<$_>. 2579 2580=item m// 2581 2582The match operator. See L<perlop>. 2583 2584=item map BLOCK LIST 2585 2586=item map EXPR,LIST 2587 2588Evaluates the BLOCK or EXPR for each element of LIST (locally setting 2589C<$_> to each element) and returns the list value composed of the 2590results of each such evaluation. In scalar context, returns the 2591total number of elements so generated. Evaluates BLOCK or EXPR in 2592list context, so each element of LIST may produce zero, one, or 2593more elements in the returned value. 2594 2595 @chars = map(chr, @nums); 2596 2597translates a list of numbers to the corresponding characters. And 2598 2599 %hash = map { getkey($_) => $_ } @array; 2600 2601is just a funny way to write 2602 2603 %hash = (); 2604 foreach $_ (@array) { 2605 $hash{getkey($_)} = $_; 2606 } 2607 2608Note that C<$_> is an alias to the list value, so it can be used to 2609modify the elements of the LIST. While this is useful and supported, 2610it can cause bizarre results if the elements of LIST are not variables. 2611Using a regular C<foreach> loop for this purpose would be clearer in 2612most cases. See also L</grep> for an array composed of those items of 2613the original list for which the BLOCK or EXPR evaluates to true. 2614 2615C<{> starts both hash references and blocks, so C<map { ...> could be either 2616the start of map BLOCK LIST or map EXPR, LIST. Because perl doesn't look 2617ahead for the closing C<}> it has to take a guess at which its dealing with 2618based what it finds just after the C<{>. Usually it gets it right, but if it 2619doesn't it won't realize something is wrong until it gets to the C<}> and 2620encounters the missing (or unexpected) comma. The syntax error will be 2621reported close to the C<}> but you'll need to change something near the C<{> 2622such as using a unary C<+> to give perl some help: 2623 2624 %hash = map { "\L$_", 1 } @array # perl guesses EXPR. wrong 2625 %hash = map { +"\L$_", 1 } @array # perl guesses BLOCK. right 2626 %hash = map { ("\L$_", 1) } @array # this also works 2627 %hash = map { lc($_), 1 } @array # as does this. 2628 %hash = map +( lc($_), 1 ), @array # this is EXPR and works! 2629 2630 %hash = map ( lc($_), 1 ), @array # evaluates to (1, @array) 2631 2632or to force an anon hash constructor use C<+{> 2633 2634 @hashes = map +{ lc($_), 1 }, @array # EXPR, so needs , at end 2635 2636and you get list of anonymous hashes each with only 1 entry. 2637 2638=item mkdir FILENAME,MASK 2639 2640=item mkdir FILENAME 2641 2642Creates the directory specified by FILENAME, with permissions 2643specified by MASK (as modified by C<umask>). If it succeeds it 2644returns true, otherwise it returns false and sets C<$!> (errno). 2645If omitted, MASK defaults to 0777. 2646 2647In general, it is better to create directories with permissive MASK, 2648and let the user modify that with their C<umask>, than it is to supply 2649a restrictive MASK and give the user no way to be more permissive. 2650The exceptions to this rule are when the file or directory should be 2651kept private (mail files, for instance). The perlfunc(1) entry on 2652C<umask> discusses the choice of MASK in more detail. 2653 2654Note that according to the POSIX 1003.1-1996 the FILENAME may have any 2655number of trailing slashes. Some operating and filesystems do not get 2656this right, so Perl automatically removes all trailing slashes to keep 2657everyone happy. 2658 2659=item msgctl ID,CMD,ARG 2660 2661Calls the System V IPC function msgctl(2). You'll probably have to say 2662 2663 use IPC::SysV; 2664 2665first to get the correct constant definitions. If CMD is C<IPC_STAT>, 2666then ARG must be a variable which will hold the returned C<msqid_ds> 2667structure. Returns like C<ioctl>: the undefined value for error, 2668C<"0 but true"> for zero, or the actual return value otherwise. See also 2669L<perlipc/"SysV IPC">, C<IPC::SysV>, and C<IPC::Semaphore> documentation. 2670 2671=item msgget KEY,FLAGS 2672 2673Calls the System V IPC function msgget(2). Returns the message queue 2674id, or the undefined value if there is an error. See also 2675L<perlipc/"SysV IPC"> and C<IPC::SysV> and C<IPC::Msg> documentation. 2676 2677=item msgrcv ID,VAR,SIZE,TYPE,FLAGS 2678 2679Calls the System V IPC function msgrcv to receive a message from 2680message queue ID into variable VAR with a maximum message size of 2681SIZE. Note that when a message is received, the message type as a 2682native long integer will be the first thing in VAR, followed by the 2683actual message. This packing may be opened with C<unpack("l! a*")>. 2684Taints the variable. Returns true if successful, or false if there is 2685an error. See also L<perlipc/"SysV IPC">, C<IPC::SysV>, and 2686C<IPC::SysV::Msg> documentation. 2687 2688=item msgsnd ID,MSG,FLAGS 2689 2690Calls the System V IPC function msgsnd to send the message MSG to the 2691message queue ID. MSG must begin with the native long integer message 2692type, and be followed by the length of the actual message, and finally 2693the message itself. This kind of packing can be achieved with 2694C<pack("l! a*", $type, $message)>. Returns true if successful, 2695or false if there is an error. See also C<IPC::SysV> 2696and C<IPC::SysV::Msg> documentation. 2697 2698=item my EXPR 2699 2700=item my TYPE EXPR 2701 2702=item my EXPR : ATTRS 2703 2704=item my TYPE EXPR : ATTRS 2705 2706A C<my> declares the listed variables to be local (lexically) to the 2707enclosing block, file, or C<eval>. If more than one value is listed, 2708the list must be placed in parentheses. 2709 2710The exact semantics and interface of TYPE and ATTRS are still 2711evolving. TYPE is currently bound to the use of C<fields> pragma, 2712and attributes are handled using the C<attributes> pragma, or starting 2713from Perl 5.8.0 also via the C<Attribute::Handlers> module. See 2714L<perlsub/"Private Variables via my()"> for details, and L<fields>, 2715L<attributes>, and L<Attribute::Handlers>. 2716 2717=item next LABEL 2718 2719=item next 2720 2721The C<next> command is like the C<continue> statement in C; it starts 2722the next iteration of the loop: 2723 2724 LINE: while (<STDIN>) { 2725 next LINE if /^#/; # discard comments 2726 #... 2727 } 2728 2729Note that if there were a C<continue> block on the above, it would get 2730executed even on discarded lines. If the LABEL is omitted, the command 2731refers to the innermost enclosing loop. 2732 2733C<next> cannot be used to exit a block which returns a value such as 2734C<eval {}>, C<sub {}> or C<do {}>, and should not be used to exit 2735a grep() or map() operation. 2736 2737Note that a block by itself is semantically identical to a loop 2738that executes once. Thus C<next> will exit such a block early. 2739 2740See also L</continue> for an illustration of how C<last>, C<next>, and 2741C<redo> work. 2742 2743=item no Module VERSION LIST 2744 2745=item no Module VERSION 2746 2747=item no Module LIST 2748 2749=item no Module 2750 2751See the C<use> function, which C<no> is the opposite of. 2752 2753=item oct EXPR 2754 2755=item oct 2756 2757Interprets EXPR as an octal string and returns the corresponding 2758value. (If EXPR happens to start off with C<0x>, interprets it as a 2759hex string. If EXPR starts off with C<0b>, it is interpreted as a 2760binary string. Leading whitespace is ignored in all three cases.) 2761The following will handle decimal, binary, octal, and hex in the standard 2762Perl or C notation: 2763 2764 $val = oct($val) if $val =~ /^0/; 2765 2766If EXPR is omitted, uses C<$_>. To go the other way (produce a number 2767in octal), use sprintf() or printf(): 2768 2769 $perms = (stat("filename"))[2] & 07777; 2770 $oct_perms = sprintf "%lo", $perms; 2771 2772The oct() function is commonly used when a string such as C<644> needs 2773to be converted into a file mode, for example. (Although perl will 2774automatically convert strings into numbers as needed, this automatic 2775conversion assumes base 10.) 2776 2777=item open FILEHANDLE,EXPR 2778 2779=item open FILEHANDLE,MODE,EXPR 2780 2781=item open FILEHANDLE,MODE,EXPR,LIST 2782 2783=item open FILEHANDLE,MODE,REFERENCE 2784 2785=item open FILEHANDLE 2786 2787Opens the file whose filename is given by EXPR, and associates it with 2788FILEHANDLE. 2789 2790(The following is a comprehensive reference to open(): for a gentler 2791introduction you may consider L<perlopentut>.) 2792 2793If FILEHANDLE is an undefined scalar variable (or array or hash element) 2794the variable is assigned a reference to a new anonymous filehandle, 2795otherwise if FILEHANDLE is an expression, its value is used as the name of 2796the real filehandle wanted. (This is considered a symbolic reference, so 2797C<use strict 'refs'> should I<not> be in effect.) 2798 2799If EXPR is omitted, the scalar variable of the same name as the 2800FILEHANDLE contains the filename. (Note that lexical variables--those 2801declared with C<my>--will not work for this purpose; so if you're 2802using C<my>, specify EXPR in your call to open.) 2803 2804If three or more arguments are specified then the mode of opening and 2805the file name are separate. If MODE is C<< '<' >> or nothing, the file 2806is opened for input. If MODE is C<< '>' >>, the file is truncated and 2807opened for output, being created if necessary. If MODE is C<<< '>>' >>>, 2808the file is opened for appending, again being created if necessary. 2809 2810You can put a C<'+'> in front of the C<< '>' >> or C<< '<' >> to 2811indicate that you want both read and write access to the file; thus 2812C<< '+<' >> is almost always preferred for read/write updates--the C<< 2813'+>' >> mode would clobber the file first. You can't usually use 2814either read-write mode for updating textfiles, since they have 2815variable length records. See the B<-i> switch in L<perlrun> for a 2816better approach. The file is created with permissions of C<0666> 2817modified by the process' C<umask> value. 2818 2819These various prefixes correspond to the fopen(3) modes of C<'r'>, 2820C<'r+'>, C<'w'>, C<'w+'>, C<'a'>, and C<'a+'>. 2821 2822In the 2-arguments (and 1-argument) form of the call the mode and 2823filename should be concatenated (in this order), possibly separated by 2824spaces. It is possible to omit the mode in these forms if the mode is 2825C<< '<' >>. 2826 2827If the filename begins with C<'|'>, the filename is interpreted as a 2828command to which output is to be piped, and if the filename ends with a 2829C<'|'>, the filename is interpreted as a command which pipes output to 2830us. See L<perlipc/"Using open() for IPC"> 2831for more examples of this. (You are not allowed to C<open> to a command 2832that pipes both in I<and> out, but see L<IPC::Open2>, L<IPC::Open3>, 2833and L<perlipc/"Bidirectional Communication with Another Process"> 2834for alternatives.) 2835 2836For three or more arguments if MODE is C<'|-'>, the filename is 2837interpreted as a command to which output is to be piped, and if MODE 2838is C<'-|'>, the filename is interpreted as a command which pipes 2839output to us. In the 2-arguments (and 1-argument) form one should 2840replace dash (C<'-'>) with the command. 2841See L<perlipc/"Using open() for IPC"> for more examples of this. 2842(You are not allowed to C<open> to a command that pipes both in I<and> 2843out, but see L<IPC::Open2>, L<IPC::Open3>, and 2844L<perlipc/"Bidirectional Communication"> for alternatives.) 2845 2846In the three-or-more argument form of pipe opens, if LIST is specified 2847(extra arguments after the command name) then LIST becomes arguments 2848to the command invoked if the platform supports it. The meaning of 2849C<open> with more than three arguments for non-pipe modes is not yet 2850specified. Experimental "layers" may give extra LIST arguments 2851meaning. 2852 2853In the 2-arguments (and 1-argument) form opening C<'-'> opens STDIN 2854and opening C<< '>-' >> opens STDOUT. 2855 2856You may use the three-argument form of open to specify IO "layers" 2857(sometimes also referred to as "disciplines") to be applied to the handle 2858that affect how the input and output are processed (see L<open> and 2859L<PerlIO> for more details). For example 2860 2861 open(FH, "<:utf8", "file") 2862 2863will open the UTF-8 encoded file containing Unicode characters, 2864see L<perluniintro>. (Note that if layers are specified in the 2865three-arg form then default layers set by the C<open> pragma are 2866ignored.) 2867 2868Open returns nonzero upon success, the undefined value otherwise. If 2869the C<open> involved a pipe, the return value happens to be the pid of 2870the subprocess. 2871 2872If you're running Perl on a system that distinguishes between text 2873files and binary files, then you should check out L</binmode> for tips 2874for dealing with this. The key distinction between systems that need 2875C<binmode> and those that don't is their text file formats. Systems 2876like Unix, Mac OS, and Plan 9, which delimit lines with a single 2877character, and which encode that character in C as C<"\n">, do not 2878need C<binmode>. The rest need it. 2879 2880When opening a file, it's usually a bad idea to continue normal execution 2881if the request failed, so C<open> is frequently used in connection with 2882C<die>. Even if C<die> won't do what you want (say, in a CGI script, 2883where you want to make a nicely formatted error message (but there are 2884modules that can help with that problem)) you should always check 2885the return value from opening a file. The infrequent exception is when 2886working with an unopened filehandle is actually what you want to do. 2887 2888As a special case the 3 arg form with a read/write mode and the third 2889argument being C<undef>: 2890 2891 open(TMP, "+>", undef) or die ... 2892 2893opens a filehandle to an anonymous temporary file. Also using "+<" 2894works for symmetry, but you really should consider writing something 2895to the temporary file first. You will need to seek() to do the 2896reading. 2897 2898File handles can be opened to "in memory" files held in Perl scalars via: 2899 2900 open($fh, '>', \$variable) || .. 2901 2902Though if you try to re-open C<STDOUT> or C<STDERR> as an "in memory" 2903file, you have to close it first: 2904 2905 close STDOUT; 2906 open STDOUT, '>', \$variable or die "Can't open STDOUT: $!"; 2907 2908Examples: 2909 2910 $ARTICLE = 100; 2911 open ARTICLE or die "Can't find article $ARTICLE: $!\n"; 2912 while (<ARTICLE>) {... 2913 2914 open(LOG, '>>/usr/spool/news/twitlog'); # (log is reserved) 2915 # if the open fails, output is discarded 2916 2917 open(DBASE, '+<', 'dbase.mine') # open for update 2918 or die "Can't open 'dbase.mine' for update: $!"; 2919 2920 open(DBASE, '+<dbase.mine') # ditto 2921 or die "Can't open 'dbase.mine' for update: $!"; 2922 2923 open(ARTICLE, '-|', "caesar <$article") # decrypt article 2924 or die "Can't start caesar: $!"; 2925 2926 open(ARTICLE, "caesar <$article |") # ditto 2927 or die "Can't start caesar: $!"; 2928 2929 open(EXTRACT, "|sort >Tmp$$") # $$ is our process id 2930 or die "Can't start sort: $!"; 2931 2932 # in memory files 2933 open(MEMORY,'>', \$var) 2934 or die "Can't open memory file: $!"; 2935 print MEMORY "foo!\n"; # output will end up in $var 2936 2937 # process argument list of files along with any includes 2938 2939 foreach $file (@ARGV) { 2940 process($file, 'fh00'); 2941 } 2942 2943 sub process { 2944 my($filename, $input) = @_; 2945 $input++; # this is a string increment 2946 unless (open($input, $filename)) { 2947 print STDERR "Can't open $filename: $!\n"; 2948 return; 2949 } 2950 2951 local $_; 2952 while (<$input>) { # note use of indirection 2953 if (/^#include "(.*)"/) { 2954 process($1, $input); 2955 next; 2956 } 2957 #... # whatever 2958 } 2959 } 2960 2961You may also, in the Bourne shell tradition, specify an EXPR beginning 2962with C<< '>&' >>, in which case the rest of the string is interpreted 2963as the name of a filehandle (or file descriptor, if numeric) to be 2964duped (as L<dup(2)>) and opened. You may use C<&> after C<< > >>, 2965C<<< >> >>>, C<< < >>, C<< +> >>, C<<< +>> >>>, and C<< +< >>. 2966The mode you specify should match the mode of the original filehandle. 2967(Duping a filehandle does not take into account any existing contents 2968of IO buffers.) If you use the 3 arg form then you can pass either a 2969number, the name of a filehandle or the normal "reference to a glob". 2970 2971Here is a script that saves, redirects, and restores C<STDOUT> and 2972C<STDERR> using various methods: 2973 2974 #!/usr/bin/perl 2975 open my $oldout, ">&STDOUT" or die "Can't dup STDOUT: $!"; 2976 open OLDERR, ">&", \*STDERR or die "Can't dup STDERR: $!"; 2977 2978 open STDOUT, '>', "foo.out" or die "Can't redirect STDOUT: $!"; 2979 open STDERR, ">&STDOUT" or die "Can't dup STDOUT: $!"; 2980 2981 select STDERR; $| = 1; # make unbuffered 2982 select STDOUT; $| = 1; # make unbuffered 2983 2984 print STDOUT "stdout 1\n"; # this works for 2985 print STDERR "stderr 1\n"; # subprocesses too 2986 2987 open STDOUT, ">&", $oldout or die "Can't dup \$oldout: $!"; 2988 open STDERR, ">&OLDERR" or die "Can't dup OLDERR: $!"; 2989 2990 print STDOUT "stdout 2\n"; 2991 print STDERR "stderr 2\n"; 2992 2993If you specify C<< '<&=X' >>, where C<X> is a file descriptor number 2994or a filehandle, then Perl will do an equivalent of C's C<fdopen> of 2995that file descriptor (and not call L<dup(2)>); this is more 2996parsimonious of file descriptors. For example: 2997 2998 # open for input, reusing the fileno of $fd 2999 open(FILEHANDLE, "<&=$fd") 3000 3001or 3002 3003 open(FILEHANDLE, "<&=", $fd) 3004 3005or 3006 3007 # open for append, using the fileno of OLDFH 3008 open(FH, ">>&=", OLDFH) 3009 3010or 3011 3012 open(FH, ">>&=OLDFH") 3013 3014Being parsimonious on filehandles is also useful (besides being 3015parsimonious) for example when something is dependent on file 3016descriptors, like for example locking using flock(). If you do just 3017C<< open(A, '>>&B') >>, the filehandle A will not have the same file 3018descriptor as B, and therefore flock(A) will not flock(B), and vice 3019versa. But with C<< open(A, '>>&=B') >> the filehandles will share 3020the same file descriptor. 3021 3022Note that if you are using Perls older than 5.8.0, Perl will be using 3023the standard C libraries' fdopen() to implement the "=" functionality. 3024On many UNIX systems fdopen() fails when file descriptors exceed a 3025certain value, typically 255. For Perls 5.8.0 and later, PerlIO is 3026most often the default. 3027 3028You can see whether Perl has been compiled with PerlIO or not by 3029running C<perl -V> and looking for C<useperlio=> line. If C<useperlio> 3030is C<define>, you have PerlIO, otherwise you don't. 3031 3032If you open a pipe on the command C<'-'>, i.e., either C<'|-'> or C<'-|'> 3033with 2-arguments (or 1-argument) form of open(), then 3034there is an implicit fork done, and the return value of open is the pid 3035of the child within the parent process, and C<0> within the child 3036process. (Use C<defined($pid)> to determine whether the open was successful.) 3037The filehandle behaves normally for the parent, but i/o to that 3038filehandle is piped from/to the STDOUT/STDIN of the child process. 3039In the child process the filehandle isn't opened--i/o happens from/to 3040the new STDOUT or STDIN. Typically this is used like the normal 3041piped open when you want to exercise more control over just how the 3042pipe command gets executed, such as when you are running setuid, and 3043don't want to have to scan shell commands for metacharacters. 3044The following triples are more or less equivalent: 3045 3046 open(FOO, "|tr '[a-z]' '[A-Z]'"); 3047 open(FOO, '|-', "tr '[a-z]' '[A-Z]'"); 3048 open(FOO, '|-') || exec 'tr', '[a-z]', '[A-Z]'; 3049 open(FOO, '|-', "tr", '[a-z]', '[A-Z]'); 3050 3051 open(FOO, "cat -n '$file'|"); 3052 open(FOO, '-|', "cat -n '$file'"); 3053 open(FOO, '-|') || exec 'cat', '-n', $file; 3054 open(FOO, '-|', "cat", '-n', $file); 3055 3056The last example in each block shows the pipe as "list form", which is 3057not yet supported on all platforms. A good rule of thumb is that if 3058your platform has true C<fork()> (in other words, if your platform is 3059UNIX) you can use the list form. 3060 3061See L<perlipc/"Safe Pipe Opens"> for more examples of this. 3062 3063Beginning with v5.6.0, Perl will attempt to flush all files opened for 3064output before any operation that may do a fork, but this may not be 3065supported on some platforms (see L<perlport>). To be safe, you may need 3066to set C<$|> ($AUTOFLUSH in English) or call the C<autoflush()> method 3067of C<IO::Handle> on any open handles. 3068 3069On systems that support a close-on-exec flag on files, the flag will 3070be set for the newly opened file descriptor as determined by the value 3071of $^F. See L<perlvar/$^F>. 3072 3073Closing any piped filehandle causes the parent process to wait for the 3074child to finish, and returns the status value in C<$?>. 3075 3076The filename passed to 2-argument (or 1-argument) form of open() will 3077have leading and trailing whitespace deleted, and the normal 3078redirection characters honored. This property, known as "magic open", 3079can often be used to good effect. A user could specify a filename of 3080F<"rsh cat file |">, or you could change certain filenames as needed: 3081 3082 $filename =~ s/(.*\.gz)\s*$/gzip -dc < $1|/; 3083 open(FH, $filename) or die "Can't open $filename: $!"; 3084 3085Use 3-argument form to open a file with arbitrary weird characters in it, 3086 3087 open(FOO, '<', $file); 3088 3089otherwise it's necessary to protect any leading and trailing whitespace: 3090 3091 $file =~ s#^(\s)#./$1#; 3092 open(FOO, "< $file\0"); 3093 3094(this may not work on some bizarre filesystems). One should 3095conscientiously choose between the I<magic> and 3-arguments form 3096of open(): 3097 3098 open IN, $ARGV[0]; 3099 3100will allow the user to specify an argument of the form C<"rsh cat file |">, 3101but will not work on a filename which happens to have a trailing space, while 3102 3103 open IN, '<', $ARGV[0]; 3104 3105will have exactly the opposite restrictions. 3106 3107If you want a "real" C C<open> (see L<open(2)> on your system), then you 3108should use the C<sysopen> function, which involves no such magic (but 3109may use subtly different filemodes than Perl open(), which is mapped 3110to C fopen()). This is 3111another way to protect your filenames from interpretation. For example: 3112 3113 use IO::Handle; 3114 sysopen(HANDLE, $path, O_RDWR|O_CREAT|O_EXCL) 3115 or die "sysopen $path: $!"; 3116 $oldfh = select(HANDLE); $| = 1; select($oldfh); 3117 print HANDLE "stuff $$\n"; 3118 seek(HANDLE, 0, 0); 3119 print "File contains: ", <HANDLE>; 3120 3121Using the constructor from the C<IO::Handle> package (or one of its 3122subclasses, such as C<IO::File> or C<IO::Socket>), you can generate anonymous 3123filehandles that have the scope of whatever variables hold references to 3124them, and automatically close whenever and however you leave that scope: 3125 3126 use IO::File; 3127 #... 3128 sub read_myfile_munged { 3129 my $ALL = shift; 3130 my $handle = new IO::File; 3131 open($handle, "myfile") or die "myfile: $!"; 3132 $first = <$handle> 3133 or return (); # Automatically closed here. 3134 mung $first or die "mung failed"; # Or here. 3135 return $first, <$handle> if $ALL; # Or here. 3136 $first; # Or here. 3137 } 3138 3139See L</seek> for some details about mixing reading and writing. 3140 3141=item opendir DIRHANDLE,EXPR 3142 3143Opens a directory named EXPR for processing by C<readdir>, C<telldir>, 3144C<seekdir>, C<rewinddir>, and C<closedir>. Returns true if successful. 3145DIRHANDLE may be an expression whose value can be used as an indirect 3146dirhandle, usually the real dirhandle name. If DIRHANDLE is an undefined 3147scalar variable (or array or hash element), the variable is assigned a 3148reference to a new anonymous dirhandle. 3149DIRHANDLEs have their own namespace separate from FILEHANDLEs. 3150 3151=item ord EXPR 3152 3153=item ord 3154 3155Returns the numeric (the native 8-bit encoding, like ASCII or EBCDIC, 3156or Unicode) value of the first character of EXPR. If EXPR is omitted, 3157uses C<$_>. 3158 3159For the reverse, see L</chr>. 3160See L<perlunicode> and L<encoding> for more about Unicode. 3161 3162=item our EXPR 3163 3164=item our EXPR TYPE 3165 3166=item our EXPR : ATTRS 3167 3168=item our TYPE EXPR : ATTRS 3169 3170An C<our> declares the listed variables to be valid globals within 3171the enclosing block, file, or C<eval>. That is, it has the same 3172scoping rules as a "my" declaration, but does not create a local 3173variable. If more than one value is listed, the list must be placed 3174in parentheses. The C<our> declaration has no semantic effect unless 3175"use strict vars" is in effect, in which case it lets you use the 3176declared global variable without qualifying it with a package name. 3177(But only within the lexical scope of the C<our> declaration. In this 3178it differs from "use vars", which is package scoped.) 3179 3180An C<our> declaration declares a global variable that will be visible 3181across its entire lexical scope, even across package boundaries. The 3182package in which the variable is entered is determined at the point 3183of the declaration, not at the point of use. This means the following 3184behavior holds: 3185 3186 package Foo; 3187 our $bar; # declares $Foo::bar for rest of lexical scope 3188 $bar = 20; 3189 3190 package Bar; 3191 print $bar; # prints 20 3192 3193Multiple C<our> declarations in the same lexical scope are allowed 3194if they are in different packages. If they happened to be in the same 3195package, Perl will emit warnings if you have asked for them. 3196 3197 use warnings; 3198 package Foo; 3199 our $bar; # declares $Foo::bar for rest of lexical scope 3200 $bar = 20; 3201 3202 package Bar; 3203 our $bar = 30; # declares $Bar::bar for rest of lexical scope 3204 print $bar; # prints 30 3205 3206 our $bar; # emits warning 3207 3208An C<our> declaration may also have a list of attributes associated 3209with it. 3210 3211The exact semantics and interface of TYPE and ATTRS are still 3212evolving. TYPE is currently bound to the use of C<fields> pragma, 3213and attributes are handled using the C<attributes> pragma, or starting 3214from Perl 5.8.0 also via the C<Attribute::Handlers> module. See 3215L<perlsub/"Private Variables via my()"> for details, and L<fields>, 3216L<attributes>, and L<Attribute::Handlers>. 3217 3218The only currently recognized C<our()> attribute is C<unique> which 3219indicates that a single copy of the global is to be used by all 3220interpreters should the program happen to be running in a 3221multi-interpreter environment. (The default behaviour would be for 3222each interpreter to have its own copy of the global.) Examples: 3223 3224 our @EXPORT : unique = qw(foo); 3225 our %EXPORT_TAGS : unique = (bar => [qw(aa bb cc)]); 3226 our $VERSION : unique = "1.00"; 3227 3228Note that this attribute also has the effect of making the global 3229readonly when the first new interpreter is cloned (for example, 3230when the first new thread is created). 3231 3232Multi-interpreter environments can come to being either through the 3233fork() emulation on Windows platforms, or by embedding perl in a 3234multi-threaded application. The C<unique> attribute does nothing in 3235all other environments. 3236 3237Warning: the current implementation of this attribute operates on the 3238typeglob associated with the variable; this means that C<our $x : unique> 3239also has the effect of C<our @x : unique; our %x : unique>. This may be 3240subject to change. 3241 3242=item pack TEMPLATE,LIST 3243 3244Takes a LIST of values and converts it into a string using the rules 3245given by the TEMPLATE. The resulting string is the concatenation of 3246the converted values. Typically, each converted value looks 3247like its machine-level representation. For example, on 32-bit machines 3248a converted integer may be represented by a sequence of 4 bytes. 3249 3250The TEMPLATE is a sequence of characters that give the order and type 3251of values, as follows: 3252 3253 a A string with arbitrary binary data, will be null padded. 3254 A A text (ASCII) string, will be space padded. 3255 Z A null terminated (ASCIZ) string, will be null padded. 3256 3257 b A bit string (ascending bit order inside each byte, like vec()). 3258 B A bit string (descending bit order inside each byte). 3259 h A hex string (low nybble first). 3260 H A hex string (high nybble first). 3261 3262 c A signed char value. 3263 C An unsigned char value. Only does bytes. See U for Unicode. 3264 3265 s A signed short value. 3266 S An unsigned short value. 3267 (This 'short' is _exactly_ 16 bits, which may differ from 3268 what a local C compiler calls 'short'. If you want 3269 native-length shorts, use the '!' suffix.) 3270 3271 i A signed integer value. 3272 I An unsigned integer value. 3273 (This 'integer' is _at_least_ 32 bits wide. Its exact 3274 size depends on what a local C compiler calls 'int', 3275 and may even be larger than the 'long' described in 3276 the next item.) 3277 3278 l A signed long value. 3279 L An unsigned long value. 3280 (This 'long' is _exactly_ 32 bits, which may differ from 3281 what a local C compiler calls 'long'. If you want 3282 native-length longs, use the '!' suffix.) 3283 3284 n An unsigned short in "network" (big-endian) order. 3285 N An unsigned long in "network" (big-endian) order. 3286 v An unsigned short in "VAX" (little-endian) order. 3287 V An unsigned long in "VAX" (little-endian) order. 3288 (These 'shorts' and 'longs' are _exactly_ 16 bits and 3289 _exactly_ 32 bits, respectively.) 3290 3291 q A signed quad (64-bit) value. 3292 Q An unsigned quad value. 3293 (Quads are available only if your system supports 64-bit 3294 integer values _and_ if Perl has been compiled to support those. 3295 Causes a fatal error otherwise.) 3296 3297 j A signed integer value (a Perl internal integer, IV). 3298 J An unsigned integer value (a Perl internal unsigned integer, UV). 3299 3300 f A single-precision float in the native format. 3301 d A double-precision float in the native format. 3302 3303 F A floating point value in the native native format 3304 (a Perl internal floating point value, NV). 3305 D A long double-precision float in the native format. 3306 (Long doubles are available only if your system supports long 3307 double values _and_ if Perl has been compiled to support those. 3308 Causes a fatal error otherwise.) 3309 3310 p A pointer to a null-terminated string. 3311 P A pointer to a structure (fixed-length string). 3312 3313 u A uuencoded string. 3314 U A Unicode character number. Encodes to UTF-8 internally 3315 (or UTF-EBCDIC in EBCDIC platforms). 3316 3317 w A BER compressed integer. Its bytes represent an unsigned 3318 integer in base 128, most significant digit first, with as 3319 few digits as possible. Bit eight (the high bit) is set 3320 on each byte except the last. 3321 3322 x A null byte. 3323 X Back up a byte. 3324 @ Null fill to absolute position, counted from the start of 3325 the innermost ()-group. 3326 ( Start of a ()-group. 3327 3328The following rules apply: 3329 3330=over 8 3331 3332=item * 3333 3334Each letter may optionally be followed by a number giving a repeat 3335count. With all types except C<a>, C<A>, C<Z>, C<b>, C<B>, C<h>, 3336C<H>, C<@>, C<x>, C<X> and C<P> the pack function will gobble up that 3337many values from the LIST. A C<*> for the repeat count means to use 3338however many items are left, except for C<@>, C<x>, C<X>, where it is 3339equivalent to C<0>, and C<u>, where it is equivalent to 1 (or 45, what 3340is the same). A numeric repeat count may optionally be enclosed in 3341brackets, as in C<pack 'C[80]', @arr>. 3342 3343One can replace the numeric repeat count by a template enclosed in brackets; 3344then the packed length of this template in bytes is used as a count. 3345For example, C<x[L]> skips a long (it skips the number of bytes in a long); 3346the template C<$t X[$t] $t> unpack()s twice what $t unpacks. 3347If the template in brackets contains alignment commands (such as C<x![d]>), 3348its packed length is calculated as if the start of the template has the maximal 3349possible alignment. 3350 3351When used with C<Z>, C<*> results in the addition of a trailing null 3352byte (so the packed result will be one longer than the byte C<length> 3353of the item). 3354 3355The repeat count for C<u> is interpreted as the maximal number of bytes 3356to encode per line of output, with 0 and 1 replaced by 45. 3357 3358=item * 3359 3360The C<a>, C<A>, and C<Z> types gobble just one value, but pack it as a 3361string of length count, padding with nulls or spaces as necessary. When 3362unpacking, C<A> strips trailing spaces and nulls, C<Z> strips everything 3363after the first null, and C<a> returns data verbatim. When packing, 3364C<a>, and C<Z> are equivalent. 3365 3366If the value-to-pack is too long, it is truncated. If too long and an 3367explicit count is provided, C<Z> packs only C<$count-1> bytes, followed 3368by a null byte. Thus C<Z> always packs a trailing null byte under 3369all circumstances. 3370 3371=item * 3372 3373Likewise, the C<b> and C<B> fields pack a string that many bits long. 3374Each byte of the input field of pack() generates 1 bit of the result. 3375Each result bit is based on the least-significant bit of the corresponding 3376input byte, i.e., on C<ord($byte)%2>. In particular, bytes C<"0"> and 3377C<"1"> generate bits 0 and 1, as do bytes C<"\0"> and C<"\1">. 3378 3379Starting from the beginning of the input string of pack(), each 8-tuple 3380of bytes is converted to 1 byte of output. With format C<b> 3381the first byte of the 8-tuple determines the least-significant bit of a 3382byte, and with format C<B> it determines the most-significant bit of 3383a byte. 3384 3385If the length of the input string is not exactly divisible by 8, the 3386remainder is packed as if the input string were padded by null bytes 3387at the end. Similarly, during unpack()ing the "extra" bits are ignored. 3388 3389If the input string of pack() is longer than needed, extra bytes are ignored. 3390A C<*> for the repeat count of pack() means to use all the bytes of 3391the input field. On unpack()ing the bits are converted to a string 3392of C<"0">s and C<"1">s. 3393 3394=item * 3395 3396The C<h> and C<H> fields pack a string that many nybbles (4-bit groups, 3397representable as hexadecimal digits, 0-9a-f) long. 3398 3399Each byte of the input field of pack() generates 4 bits of the result. 3400For non-alphabetical bytes the result is based on the 4 least-significant 3401bits of the input byte, i.e., on C<ord($byte)%16>. In particular, 3402bytes C<"0"> and C<"1"> generate nybbles 0 and 1, as do bytes 3403C<"\0"> and C<"\1">. For bytes C<"a".."f"> and C<"A".."F"> the result 3404is compatible with the usual hexadecimal digits, so that C<"a"> and 3405C<"A"> both generate the nybble C<0xa==10>. The result for bytes 3406C<"g".."z"> and C<"G".."Z"> is not well-defined. 3407 3408Starting from the beginning of the input string of pack(), each pair 3409of bytes is converted to 1 byte of output. With format C<h> the 3410first byte of the pair determines the least-significant nybble of the 3411output byte, and with format C<H> it determines the most-significant 3412nybble. 3413 3414If the length of the input string is not even, it behaves as if padded 3415by a null byte at the end. Similarly, during unpack()ing the "extra" 3416nybbles are ignored. 3417 3418If the input string of pack() is longer than needed, extra bytes are ignored. 3419A C<*> for the repeat count of pack() means to use all the bytes of 3420the input field. On unpack()ing the bits are converted to a string 3421of hexadecimal digits. 3422 3423=item * 3424 3425The C<p> type packs a pointer to a null-terminated string. You are 3426responsible for ensuring the string is not a temporary value (which can 3427potentially get deallocated before you get around to using the packed result). 3428The C<P> type packs a pointer to a structure of the size indicated by the 3429length. A NULL pointer is created if the corresponding value for C<p> or 3430C<P> is C<undef>, similarly for unpack(). 3431 3432=item * 3433 3434The C</> template character allows packing and unpacking of strings where 3435the packed structure contains a byte count followed by the string itself. 3436You write I<length-item>C</>I<string-item>. 3437 3438The I<length-item> can be any C<pack> template letter, and describes 3439how the length value is packed. The ones likely to be of most use are 3440integer-packing ones like C<n> (for Java strings), C<w> (for ASN.1 or 3441SNMP) and C<N> (for Sun XDR). 3442 3443For C<pack>, the I<string-item> must, at present, be C<"A*">, C<"a*"> or 3444C<"Z*">. For C<unpack> the length of the string is obtained from the 3445I<length-item>, but if you put in the '*' it will be ignored. For all other 3446codes, C<unpack> applies the length value to the next item, which must not 3447have a repeat count. 3448 3449 unpack 'C/a', "\04Gurusamy"; gives 'Guru' 3450 unpack 'a3/A* A*', '007 Bond J '; gives (' Bond','J') 3451 pack 'n/a* w/a*','hello,','world'; gives "\000\006hello,\005world" 3452 3453The I<length-item> is not returned explicitly from C<unpack>. 3454 3455Adding a count to the I<length-item> letter is unlikely to do anything 3456useful, unless that letter is C<A>, C<a> or C<Z>. Packing with a 3457I<length-item> of C<a> or C<Z> may introduce C<"\000"> characters, 3458which Perl does not regard as legal in numeric strings. 3459 3460=item * 3461 3462The integer types C<s>, C<S>, C<l>, and C<L> may be 3463immediately followed by a C<!> suffix to signify native shorts or 3464longs--as you can see from above for example a bare C<l> does mean 3465exactly 32 bits, the native C<long> (as seen by the local C compiler) 3466may be larger. This is an issue mainly in 64-bit platforms. You can 3467see whether using C<!> makes any difference by 3468 3469 print length(pack("s")), " ", length(pack("s!")), "\n"; 3470 print length(pack("l")), " ", length(pack("l!")), "\n"; 3471 3472C<i!> and C<I!> also work but only because of completeness; 3473they are identical to C<i> and C<I>. 3474 3475The actual sizes (in bytes) of native shorts, ints, longs, and long 3476longs on the platform where Perl was built are also available via 3477L<Config>: 3478 3479 use Config; 3480 print $Config{shortsize}, "\n"; 3481 print $Config{intsize}, "\n"; 3482 print $Config{longsize}, "\n"; 3483 print $Config{longlongsize}, "\n"; 3484 3485(The C<$Config{longlongsize}> will be undefined if your system does 3486not support long longs.) 3487 3488=item * 3489 3490The integer formats C<s>, C<S>, C<i>, C<I>, C<l>, C<L>, C<j>, and C<J> 3491are inherently non-portable between processors and operating systems 3492because they obey the native byteorder and endianness. For example a 34934-byte integer 0x12345678 (305419896 decimal) would be ordered natively 3494(arranged in and handled by the CPU registers) into bytes as 3495 3496 0x12 0x34 0x56 0x78 # big-endian 3497 0x78 0x56 0x34 0x12 # little-endian 3498 3499Basically, the Intel and VAX CPUs are little-endian, while everybody 3500else, for example Motorola m68k/88k, PPC, Sparc, HP PA, Power, and 3501Cray are big-endian. Alpha and MIPS can be either: Digital/Compaq 3502used/uses them in little-endian mode; SGI/Cray uses them in big-endian 3503mode. 3504 3505The names `big-endian' and `little-endian' are comic references to 3506the classic "Gulliver's Travels" (via the paper "On Holy Wars and a 3507Plea for Peace" by Danny Cohen, USC/ISI IEN 137, April 1, 1980) and 3508the egg-eating habits of the Lilliputians. 3509 3510Some systems may have even weirder byte orders such as 3511 3512 0x56 0x78 0x12 0x34 3513 0x34 0x12 0x78 0x56 3514 3515You can see your system's preference with 3516 3517 print join(" ", map { sprintf "%#02x", $_ } 3518 unpack("C*",pack("L",0x12345678))), "\n"; 3519 3520The byteorder on the platform where Perl was built is also available 3521via L<Config>: 3522 3523 use Config; 3524 print $Config{byteorder}, "\n"; 3525 3526Byteorders C<'1234'> and C<'12345678'> are little-endian, C<'4321'> 3527and C<'87654321'> are big-endian. 3528 3529If you want portable packed integers use the formats C<n>, C<N>, 3530C<v>, and C<V>, their byte endianness and size are known. 3531See also L<perlport>. 3532 3533=item * 3534 3535Real numbers (floats and doubles) are in the native machine format only; 3536due to the multiplicity of floating formats around, and the lack of a 3537standard "network" representation, no facility for interchange has been 3538made. This means that packed floating point data written on one machine 3539may not be readable on another - even if both use IEEE floating point 3540arithmetic (as the endian-ness of the memory representation is not part 3541of the IEEE spec). See also L<perlport>. 3542 3543Note that Perl uses doubles internally for all numeric calculation, and 3544converting from double into float and thence back to double again will 3545lose precision (i.e., C<unpack("f", pack("f", $foo)>) will not in general 3546equal $foo). 3547 3548=item * 3549 3550If the pattern begins with a C<U>, the resulting string will be 3551treated as UTF-8-encoded Unicode. You can force UTF-8 encoding on in a 3552string with an initial C<U0>, and the bytes that follow will be 3553interpreted as Unicode characters. If you don't want this to happen, 3554you can begin your pattern with C<C0> (or anything else) to force Perl 3555not to UTF-8 encode your string, and then follow this with a C<U*> 3556somewhere in your pattern. 3557 3558=item * 3559 3560You must yourself do any alignment or padding by inserting for example 3561enough C<'x'>es while packing. There is no way to pack() and unpack() 3562could know where the bytes are going to or coming from. Therefore 3563C<pack> (and C<unpack>) handle their output and input as flat 3564sequences of bytes. 3565 3566=item * 3567 3568A ()-group is a sub-TEMPLATE enclosed in parentheses. A group may 3569take a repeat count, both as postfix, and for unpack() also via the C</> 3570template character. Within each repetition of a group, positioning with 3571C<@> starts again at 0. Therefore, the result of 3572 3573 pack( '@1A((@2A)@3A)', 'a', 'b', 'c' ) 3574 3575is the string "\0a\0\0bc". 3576 3577 3578=item * 3579 3580C<x> and C<X> accept C<!> modifier. In this case they act as 3581alignment commands: they jump forward/back to the closest position 3582aligned at a multiple of C<count> bytes. For example, to pack() or 3583unpack() C's C<struct {char c; double d; char cc[2]}> one may need to 3584use the template C<C x![d] d C[2]>; this assumes that doubles must be 3585aligned on the double's size. 3586 3587For alignment commands C<count> of 0 is equivalent to C<count> of 1; 3588both result in no-ops. 3589 3590=item * 3591 3592A comment in a TEMPLATE starts with C<#> and goes to the end of line. 3593White space may be used to separate pack codes from each other, but 3594a C<!> modifier and a repeat count must follow immediately. 3595 3596=item * 3597 3598If TEMPLATE requires more arguments to pack() than actually given, pack() 3599assumes additional C<""> arguments. If TEMPLATE requires less arguments 3600to pack() than actually given, extra arguments are ignored. 3601 3602=back 3603 3604Examples: 3605 3606 $foo = pack("CCCC",65,66,67,68); 3607 # foo eq "ABCD" 3608 $foo = pack("C4",65,66,67,68); 3609 # same thing 3610 $foo = pack("U4",0x24b6,0x24b7,0x24b8,0x24b9); 3611 # same thing with Unicode circled letters 3612 3613 $foo = pack("ccxxcc",65,66,67,68); 3614 # foo eq "AB\0\0CD" 3615 3616 # note: the above examples featuring "C" and "c" are true 3617 # only on ASCII and ASCII-derived systems such as ISO Latin 1 3618 # and UTF-8. In EBCDIC the first example would be 3619 # $foo = pack("CCCC",193,194,195,196); 3620 3621 $foo = pack("s2",1,2); 3622 # "\1\0\2\0" on little-endian 3623 # "\0\1\0\2" on big-endian 3624 3625 $foo = pack("a4","abcd","x","y","z"); 3626 # "abcd" 3627 3628 $foo = pack("aaaa","abcd","x","y","z"); 3629 # "axyz" 3630 3631 $foo = pack("a14","abcdefg"); 3632 # "abcdefg\0\0\0\0\0\0\0" 3633 3634 $foo = pack("i9pl", gmtime); 3635 # a real struct tm (on my system anyway) 3636 3637 $utmp_template = "Z8 Z8 Z16 L"; 3638 $utmp = pack($utmp_template, @utmp1); 3639 # a struct utmp (BSDish) 3640 3641 @utmp2 = unpack($utmp_template, $utmp); 3642 # "@utmp1" eq "@utmp2" 3643 3644 sub bintodec { 3645 unpack("N", pack("B32", substr("0" x 32 . shift, -32))); 3646 } 3647 3648 $foo = pack('sx2l', 12, 34); 3649 # short 12, two zero bytes padding, long 34 3650 $bar = pack('s@4l', 12, 34); 3651 # short 12, zero fill to position 4, long 34 3652 # $foo eq $bar 3653 3654The same template may generally also be used in unpack(). 3655 3656=item package NAMESPACE 3657 3658=item package 3659 3660Declares the compilation unit as being in the given namespace. The scope 3661of the package declaration is from the declaration itself through the end 3662of the enclosing block, file, or eval (the same as the C<my> operator). 3663All further unqualified dynamic identifiers will be in this namespace. 3664A package statement affects only dynamic variables--including those 3665you've used C<local> on--but I<not> lexical variables, which are created 3666with C<my>. Typically it would be the first declaration in a file to 3667be included by the C<require> or C<use> operator. You can switch into a 3668package in more than one place; it merely influences which symbol table 3669is used by the compiler for the rest of that block. You can refer to 3670variables and filehandles in other packages by prefixing the identifier 3671with the package name and a double colon: C<$Package::Variable>. 3672If the package name is null, the C<main> package as assumed. That is, 3673C<$::sail> is equivalent to C<$main::sail> (as well as to C<$main'sail>, 3674still seen in older code). 3675 3676If NAMESPACE is omitted, then there is no current package, and all 3677identifiers must be fully qualified or lexicals. However, you are 3678strongly advised not to make use of this feature. Its use can cause 3679unexpected behaviour, even crashing some versions of Perl. It is 3680deprecated, and will be removed from a future release. 3681 3682See L<perlmod/"Packages"> for more information about packages, modules, 3683and classes. See L<perlsub> for other scoping issues. 3684 3685=item pipe READHANDLE,WRITEHANDLE 3686 3687Opens a pair of connected pipes like the corresponding system call. 3688Note that if you set up a loop of piped processes, deadlock can occur 3689unless you are very careful. In addition, note that Perl's pipes use 3690IO buffering, so you may need to set C<$|> to flush your WRITEHANDLE 3691after each command, depending on the application. 3692 3693See L<IPC::Open2>, L<IPC::Open3>, and L<perlipc/"Bidirectional Communication"> 3694for examples of such things. 3695 3696On systems that support a close-on-exec flag on files, the flag will be set 3697for the newly opened file descriptors as determined by the value of $^F. 3698See L<perlvar/$^F>. 3699 3700=item pop ARRAY 3701 3702=item pop 3703 3704Pops and returns the last value of the array, shortening the array by 3705one element. Has an effect similar to 3706 3707 $ARRAY[$#ARRAY--] 3708 3709If there are no elements in the array, returns the undefined value 3710(although this may happen at other times as well). If ARRAY is 3711omitted, pops the C<@ARGV> array in the main program, and the C<@_> 3712array in subroutines, just like C<shift>. 3713 3714=item pos SCALAR 3715 3716=item pos 3717 3718Returns the offset of where the last C<m//g> search left off for the variable 3719in question (C<$_> is used when the variable is not specified). May be 3720modified to change that offset. Such modification will also influence 3721the C<\G> zero-width assertion in regular expressions. See L<perlre> and 3722L<perlop>. 3723 3724=item print FILEHANDLE LIST 3725 3726=item print LIST 3727 3728=item print 3729 3730Prints a string or a list of strings. Returns true if successful. 3731FILEHANDLE may be a scalar variable name, in which case the variable 3732contains the name of or a reference to the filehandle, thus introducing 3733one level of indirection. (NOTE: If FILEHANDLE is a variable and 3734the next token is a term, it may be misinterpreted as an operator 3735unless you interpose a C<+> or put parentheses around the arguments.) 3736If FILEHANDLE is omitted, prints by default to standard output (or 3737to the last selected output channel--see L</select>). If LIST is 3738also omitted, prints C<$_> to the currently selected output channel. 3739To set the default output channel to something other than STDOUT 3740use the select operation. The current value of C<$,> (if any) is 3741printed between each LIST item. The current value of C<$\> (if 3742any) is printed after the entire LIST has been printed. Because 3743print takes a LIST, anything in the LIST is evaluated in list 3744context, and any subroutine that you call will have one or more of 3745its expressions evaluated in list context. Also be careful not to 3746follow the print keyword with a left parenthesis unless you want 3747the corresponding right parenthesis to terminate the arguments to 3748the print--interpose a C<+> or put parentheses around all the 3749arguments. 3750 3751Note that if you're storing FILEHANDLES in an array or other expression, 3752you will have to use a block returning its value instead: 3753 3754 print { $files[$i] } "stuff\n"; 3755 print { $OK ? STDOUT : STDERR } "stuff\n"; 3756 3757=item printf FILEHANDLE FORMAT, LIST 3758 3759=item printf FORMAT, LIST 3760 3761Equivalent to C<print FILEHANDLE sprintf(FORMAT, LIST)>, except that C<$\> 3762(the output record separator) is not appended. The first argument 3763of the list will be interpreted as the C<printf> format. See C<sprintf> 3764for an explanation of the format argument. If C<use locale> is in effect, 3765the character used for the decimal point in formatted real numbers is 3766affected by the LC_NUMERIC locale. See L<perllocale>. 3767 3768Don't fall into the trap of using a C<printf> when a simple 3769C<print> would do. The C<print> is more efficient and less 3770error prone. 3771 3772=item prototype FUNCTION 3773 3774Returns the prototype of a function as a string (or C<undef> if the 3775function has no prototype). FUNCTION is a reference to, or the name of, 3776the function whose prototype you want to retrieve. 3777 3778If FUNCTION is a string starting with C<CORE::>, the rest is taken as a 3779name for Perl builtin. If the builtin is not I<overridable> (such as 3780C<qw//>) or its arguments cannot be expressed by a prototype (such as 3781C<system>) returns C<undef> because the builtin does not really behave 3782like a Perl function. Otherwise, the string describing the equivalent 3783prototype is returned. 3784 3785=item push ARRAY,LIST 3786 3787Treats ARRAY as a stack, and pushes the values of LIST 3788onto the end of ARRAY. The length of ARRAY increases by the length of 3789LIST. Has the same effect as 3790 3791 for $value (LIST) { 3792 $ARRAY[++$#ARRAY] = $value; 3793 } 3794 3795but is more efficient. Returns the new number of elements in the array. 3796 3797=item q/STRING/ 3798 3799=item qq/STRING/ 3800 3801=item qr/STRING/ 3802 3803=item qx/STRING/ 3804 3805=item qw/STRING/ 3806 3807Generalized quotes. See L<perlop/"Regexp Quote-Like Operators">. 3808 3809=item quotemeta EXPR 3810 3811=item quotemeta 3812 3813Returns the value of EXPR with all non-"word" 3814characters backslashed. (That is, all characters not matching 3815C</[A-Za-z_0-9]/> will be preceded by a backslash in the 3816returned string, regardless of any locale settings.) 3817This is the internal function implementing 3818the C<\Q> escape in double-quoted strings. 3819 3820If EXPR is omitted, uses C<$_>. 3821 3822=item rand EXPR 3823 3824=item rand 3825 3826Returns a random fractional number greater than or equal to C<0> and less 3827than the value of EXPR. (EXPR should be positive.) If EXPR is 3828omitted, the value C<1> is used. Currently EXPR with the value C<0> is 3829also special-cased as C<1> - this has not been documented before perl 5.8.0 3830and is subject to change in future versions of perl. Automatically calls 3831C<srand> unless C<srand> has already been called. See also C<srand>. 3832 3833Apply C<int()> to the value returned by C<rand()> if you want random 3834integers instead of random fractional numbers. For example, 3835 3836 int(rand(10)) 3837 3838returns a random integer between C<0> and C<9>, inclusive. 3839 3840(Note: If your rand function consistently returns numbers that are too 3841large or too small, then your version of Perl was probably compiled 3842with the wrong number of RANDBITS.) 3843 3844=item read FILEHANDLE,SCALAR,LENGTH,OFFSET 3845 3846=item read FILEHANDLE,SCALAR,LENGTH 3847 3848Attempts to read LENGTH I<characters> of data into variable SCALAR 3849from the specified FILEHANDLE. Returns the number of characters 3850actually read, C<0> at end of file, or undef if there was an error (in 3851the latter case C<$!> is also set). SCALAR will be grown or shrunk 3852so that the last character actually read is the last character of the 3853scalar after the read. 3854 3855An OFFSET may be specified to place the read data at some place in the 3856string other than the beginning. A negative OFFSET specifies 3857placement at that many characters counting backwards from the end of 3858the string. A positive OFFSET greater than the length of SCALAR 3859results in the string being padded to the required size with C<"\0"> 3860bytes before the result of the read is appended. 3861 3862The call is actually implemented in terms of either Perl's or system's 3863fread() call. To get a true read(2) system call, see C<sysread>. 3864 3865Note the I<characters>: depending on the status of the filehandle, 3866either (8-bit) bytes or characters are read. By default all 3867filehandles operate on bytes, but for example if the filehandle has 3868been opened with the C<:utf8> I/O layer (see L</open>, and the C<open> 3869pragma, L<open>), the I/O will operate on UTF-8 encoded Unicode 3870characters, not bytes. Similarly for the C<:encoding> pragma: 3871in that case pretty much any characters can be read. 3872 3873=item readdir DIRHANDLE 3874 3875Returns the next directory entry for a directory opened by C<opendir>. 3876If used in list context, returns all the rest of the entries in the 3877directory. If there are no more entries, returns an undefined value in 3878scalar context or a null list in list context. 3879 3880If you're planning to filetest the return values out of a C<readdir>, you'd 3881better prepend the directory in question. Otherwise, because we didn't 3882C<chdir> there, it would have been testing the wrong file. 3883 3884 opendir(DIR, $some_dir) || die "can't opendir $some_dir: $!"; 3885 @dots = grep { /^\./ && -f "$some_dir/$_" } readdir(DIR); 3886 closedir DIR; 3887 3888=item readline EXPR 3889 3890Reads from the filehandle whose typeglob is contained in EXPR. In scalar 3891context, each call reads and returns the next line, until end-of-file is 3892reached, whereupon the subsequent call returns undef. In list context, 3893reads until end-of-file is reached and returns a list of lines. Note that 3894the notion of "line" used here is however you may have defined it 3895with C<$/> or C<$INPUT_RECORD_SEPARATOR>). See L<perlvar/"$/">. 3896 3897When C<$/> is set to C<undef>, when readline() is in scalar 3898context (i.e. file slurp mode), and when an empty file is read, it 3899returns C<''> the first time, followed by C<undef> subsequently. 3900 3901This is the internal function implementing the C<< <EXPR> >> 3902operator, but you can use it directly. The C<< <EXPR> >> 3903operator is discussed in more detail in L<perlop/"I/O Operators">. 3904 3905 $line = <STDIN>; 3906 $line = readline(*STDIN); # same thing 3907 3908If readline encounters an operating system error, C<$!> will be set with the 3909corresponding error message. It can be helpful to check C<$!> when you are 3910reading from filehandles you don't trust, such as a tty or a socket. The 3911following example uses the operator form of C<readline>, and takes the necessary 3912steps to ensure that C<readline> was successful. 3913 3914 for (;;) { 3915 undef $!; 3916 unless (defined( $line = <> )) { 3917 die $! if $!; 3918 last; # reached EOF 3919 } 3920 # ... 3921 } 3922 3923=item readlink EXPR 3924 3925=item readlink 3926 3927Returns the value of a symbolic link, if symbolic links are 3928implemented. If not, gives a fatal error. If there is some system 3929error, returns the undefined value and sets C<$!> (errno). If EXPR is 3930omitted, uses C<$_>. 3931 3932=item readpipe EXPR 3933 3934EXPR is executed as a system command. 3935The collected standard output of the command is returned. 3936In scalar context, it comes back as a single (potentially 3937multi-line) string. In list context, returns a list of lines 3938(however you've defined lines with C<$/> or C<$INPUT_RECORD_SEPARATOR>). 3939This is the internal function implementing the C<qx/EXPR/> 3940operator, but you can use it directly. The C<qx/EXPR/> 3941operator is discussed in more detail in L<perlop/"I/O Operators">. 3942 3943=item recv SOCKET,SCALAR,LENGTH,FLAGS 3944 3945Receives a message on a socket. Attempts to receive LENGTH characters 3946of data into variable SCALAR from the specified SOCKET filehandle. 3947SCALAR will be grown or shrunk to the length actually read. Takes the 3948same flags as the system call of the same name. Returns the address 3949of the sender if SOCKET's protocol supports this; returns an empty 3950string otherwise. If there's an error, returns the undefined value. 3951This call is actually implemented in terms of recvfrom(2) system call. 3952See L<perlipc/"UDP: Message Passing"> for examples. 3953 3954Note the I<characters>: depending on the status of the socket, either 3955(8-bit) bytes or characters are received. By default all sockets 3956operate on bytes, but for example if the socket has been changed using 3957binmode() to operate with the C<:utf8> I/O layer (see the C<open> 3958pragma, L<open>), the I/O will operate on UTF-8 encoded Unicode 3959characters, not bytes. Similarly for the C<:encoding> pragma: 3960in that case pretty much any characters can be read. 3961 3962=item redo LABEL 3963 3964=item redo 3965 3966The C<redo> command restarts the loop block without evaluating the 3967conditional again. The C<continue> block, if any, is not executed. If 3968the LABEL is omitted, the command refers to the innermost enclosing 3969loop. This command is normally used by programs that want to lie to 3970themselves about what was just input: 3971 3972 # a simpleminded Pascal comment stripper 3973 # (warning: assumes no { or } in strings) 3974 LINE: while (<STDIN>) { 3975 while (s|({.*}.*){.*}|$1 |) {} 3976 s|{.*}| |; 3977 if (s|{.*| |) { 3978 $front = $_; 3979 while (<STDIN>) { 3980 if (/}/) { # end of comment? 3981 s|^|$front\{|; 3982 redo LINE; 3983 } 3984 } 3985 } 3986 print; 3987 } 3988 3989C<redo> cannot be used to retry a block which returns a value such as 3990C<eval {}>, C<sub {}> or C<do {}>, and should not be used to exit 3991a grep() or map() operation. 3992 3993Note that a block by itself is semantically identical to a loop 3994that executes once. Thus C<redo> inside such a block will effectively 3995turn it into a looping construct. 3996 3997See also L</continue> for an illustration of how C<last>, C<next>, and 3998C<redo> work. 3999 4000=item ref EXPR 4001 4002=item ref 4003 4004Returns a non-empty string if EXPR is a reference, the empty 4005string otherwise. If EXPR 4006is not specified, C<$_> will be used. The value returned depends on the 4007type of thing the reference is a reference to. 4008Builtin types include: 4009 4010 SCALAR 4011 ARRAY 4012 HASH 4013 CODE 4014 REF 4015 GLOB 4016 LVALUE 4017 4018If the referenced object has been blessed into a package, then that package 4019name is returned instead. You can think of C<ref> as a C<typeof> operator. 4020 4021 if (ref($r) eq "HASH") { 4022 print "r is a reference to a hash.\n"; 4023 } 4024 unless (ref($r)) { 4025 print "r is not a reference at all.\n"; 4026 } 4027 if (UNIVERSAL::isa($r, "HASH")) { # for subclassing 4028 print "r is a reference to something that isa hash.\n"; 4029 } 4030 4031See also L<perlref>. 4032 4033=item rename OLDNAME,NEWNAME 4034 4035Changes the name of a file; an existing file NEWNAME will be 4036clobbered. Returns true for success, false otherwise. 4037 4038Behavior of this function varies wildly depending on your system 4039implementation. For example, it will usually not work across file system 4040boundaries, even though the system I<mv> command sometimes compensates 4041for this. Other restrictions include whether it works on directories, 4042open files, or pre-existing files. Check L<perlport> and either the 4043rename(2) manpage or equivalent system documentation for details. 4044 4045=item require VERSION 4046 4047=item require EXPR 4048 4049=item require 4050 4051Demands a version of Perl specified by VERSION, or demands some semantics 4052specified by EXPR or by C<$_> if EXPR is not supplied. 4053 4054VERSION may be either a numeric argument such as 5.006, which will be 4055compared to C<$]>, or a literal of the form v5.6.1, which will be compared 4056to C<$^V> (aka $PERL_VERSION). A fatal error is produced at run time if 4057VERSION is greater than the version of the current Perl interpreter. 4058Compare with L</use>, which can do a similar check at compile time. 4059 4060Specifying VERSION as a literal of the form v5.6.1 should generally be 4061avoided, because it leads to misleading error messages under earlier 4062versions of Perl which do not support this syntax. The equivalent numeric 4063version should be used instead. 4064 4065 require v5.6.1; # run time version check 4066 require 5.6.1; # ditto 4067 require 5.006_001; # ditto; preferred for backwards compatibility 4068 4069Otherwise, demands that a library file be included if it hasn't already 4070been included. The file is included via the do-FILE mechanism, which is 4071essentially just a variety of C<eval>. Has semantics similar to the 4072following subroutine: 4073 4074 sub require { 4075 my ($filename) = @_; 4076 if (exists $INC{$filename}) { 4077 return 1 if $INC{$filename}; 4078 die "Compilation failed in require"; 4079 } 4080 my ($realfilename,$result); 4081 ITER: { 4082 foreach $prefix (@INC) { 4083 $realfilename = "$prefix/$filename"; 4084 if (-f $realfilename) { 4085 $INC{$filename} = $realfilename; 4086 $result = do $realfilename; 4087 last ITER; 4088 } 4089 } 4090 die "Can't find $filename in \@INC"; 4091 } 4092 if ($@) { 4093 $INC{$filename} = undef; 4094 die $@; 4095 } elsif (!$result) { 4096 delete $INC{$filename}; 4097 die "$filename did not return true value"; 4098 } else { 4099 return $result; 4100 } 4101 } 4102 4103Note that the file will not be included twice under the same specified 4104name. 4105 4106The file must return true as the last statement to indicate 4107successful execution of any initialization code, so it's customary to 4108end such a file with C<1;> unless you're sure it'll return true 4109otherwise. But it's better just to put the C<1;>, in case you add more 4110statements. 4111 4112If EXPR is a bareword, the require assumes a "F<.pm>" extension and 4113replaces "F<::>" with "F</>" in the filename for you, 4114to make it easy to load standard modules. This form of loading of 4115modules does not risk altering your namespace. 4116 4117In other words, if you try this: 4118 4119 require Foo::Bar; # a splendid bareword 4120 4121The require function will actually look for the "F<Foo/Bar.pm>" file in the 4122directories specified in the C<@INC> array. 4123 4124But if you try this: 4125 4126 $class = 'Foo::Bar'; 4127 require $class; # $class is not a bareword 4128 #or 4129 require "Foo::Bar"; # not a bareword because of the "" 4130 4131The require function will look for the "F<Foo::Bar>" file in the @INC array and 4132will complain about not finding "F<Foo::Bar>" there. In this case you can do: 4133 4134 eval "require $class"; 4135 4136Now that you understand how C<require> looks for files in the case of 4137a bareword argument, there is a little extra functionality going on 4138behind the scenes. Before C<require> looks for a "F<.pm>" extension, 4139it will first look for a filename with a "F<.pmc>" extension. A file 4140with this extension is assumed to be Perl bytecode generated by 4141L<B::Bytecode|B::Bytecode>. If this file is found, and it's modification 4142time is newer than a coinciding "F<.pm>" non-compiled file, it will be 4143loaded in place of that non-compiled file ending in a "F<.pm>" extension. 4144 4145You can also insert hooks into the import facility, by putting directly 4146Perl code into the @INC array. There are three forms of hooks: subroutine 4147references, array references and blessed objects. 4148 4149Subroutine references are the simplest case. When the inclusion system 4150walks through @INC and encounters a subroutine, this subroutine gets 4151called with two parameters, the first being a reference to itself, and the 4152second the name of the file to be included (e.g. "F<Foo/Bar.pm>"). The 4153subroutine should return C<undef> or a filehandle, from which the file to 4154include will be read. If C<undef> is returned, C<require> will look at 4155the remaining elements of @INC. 4156 4157If the hook is an array reference, its first element must be a subroutine 4158reference. This subroutine is called as above, but the first parameter is 4159the array reference. This enables to pass indirectly some arguments to 4160the subroutine. 4161 4162In other words, you can write: 4163 4164 push @INC, \&my_sub; 4165 sub my_sub { 4166 my ($coderef, $filename) = @_; # $coderef is \&my_sub 4167 ... 4168 } 4169 4170or: 4171 4172 push @INC, [ \&my_sub, $x, $y, ... ]; 4173 sub my_sub { 4174 my ($arrayref, $filename) = @_; 4175 # Retrieve $x, $y, ... 4176 my @parameters = @$arrayref[1..$#$arrayref]; 4177 ... 4178 } 4179 4180If the hook is an object, it must provide an INC method, that will be 4181called as above, the first parameter being the object itself. (Note that 4182you must fully qualify the sub's name, as it is always forced into package 4183C<main>.) Here is a typical code layout: 4184 4185 # In Foo.pm 4186 package Foo; 4187 sub new { ... } 4188 sub Foo::INC { 4189 my ($self, $filename) = @_; 4190 ... 4191 } 4192 4193 # In the main program 4194 push @INC, new Foo(...); 4195 4196Note that these hooks are also permitted to set the %INC entry 4197corresponding to the files they have loaded. See L<perlvar/%INC>. 4198 4199For a yet-more-powerful import facility, see L</use> and L<perlmod>. 4200 4201=item reset EXPR 4202 4203=item reset 4204 4205Generally used in a C<continue> block at the end of a loop to clear 4206variables and reset C<??> searches so that they work again. The 4207expression is interpreted as a list of single characters (hyphens 4208allowed for ranges). All variables and arrays beginning with one of 4209those letters are reset to their pristine state. If the expression is 4210omitted, one-match searches (C<?pattern?>) are reset to match again. Resets 4211only variables or searches in the current package. Always returns 42121. Examples: 4213 4214 reset 'X'; # reset all X variables 4215 reset 'a-z'; # reset lower case variables 4216 reset; # just reset ?one-time? searches 4217 4218Resetting C<"A-Z"> is not recommended because you'll wipe out your 4219C<@ARGV> and C<@INC> arrays and your C<%ENV> hash. Resets only package 4220variables--lexical variables are unaffected, but they clean themselves 4221up on scope exit anyway, so you'll probably want to use them instead. 4222See L</my>. 4223 4224=item return EXPR 4225 4226=item return 4227 4228Returns from a subroutine, C<eval>, or C<do FILE> with the value 4229given in EXPR. Evaluation of EXPR may be in list, scalar, or void 4230context, depending on how the return value will be used, and the context 4231may vary from one execution to the next (see C<wantarray>). If no EXPR 4232is given, returns an empty list in list context, the undefined value in 4233scalar context, and (of course) nothing at all in a void context. 4234 4235(Note that in the absence of an explicit C<return>, a subroutine, eval, 4236or do FILE will automatically return the value of the last expression 4237evaluated.) 4238 4239=item reverse LIST 4240 4241In list context, returns a list value consisting of the elements 4242of LIST in the opposite order. In scalar context, concatenates the 4243elements of LIST and returns a string value with all characters 4244in the opposite order. 4245 4246 print reverse <>; # line tac, last line first 4247 4248 undef $/; # for efficiency of <> 4249 print scalar reverse <>; # character tac, last line tsrif 4250 4251Used without arguments in scalar context, reverse() reverses C<$_>. 4252 4253This operator is also handy for inverting a hash, although there are some 4254caveats. If a value is duplicated in the original hash, only one of those 4255can be represented as a key in the inverted hash. Also, this has to 4256unwind one hash and build a whole new one, which may take some time 4257on a large hash, such as from a DBM file. 4258 4259 %by_name = reverse %by_address; # Invert the hash 4260 4261=item rewinddir DIRHANDLE 4262 4263Sets the current position to the beginning of the directory for the 4264C<readdir> routine on DIRHANDLE. 4265 4266=item rindex STR,SUBSTR,POSITION 4267 4268=item rindex STR,SUBSTR 4269 4270Works just like index() except that it returns the position of the LAST 4271occurrence of SUBSTR in STR. If POSITION is specified, returns the 4272last occurrence at or before that position. 4273 4274=item rmdir FILENAME 4275 4276=item rmdir 4277 4278Deletes the directory specified by FILENAME if that directory is 4279empty. If it succeeds it returns true, otherwise it returns false and 4280sets C<$!> (errno). If FILENAME is omitted, uses C<$_>. 4281 4282=item s/// 4283 4284The substitution operator. See L<perlop>. 4285 4286=item scalar EXPR 4287 4288Forces EXPR to be interpreted in scalar context and returns the value 4289of EXPR. 4290 4291 @counts = ( scalar @a, scalar @b, scalar @c ); 4292 4293There is no equivalent operator to force an expression to 4294be interpolated in list context because in practice, this is never 4295needed. If you really wanted to do so, however, you could use 4296the construction C<@{[ (some expression) ]}>, but usually a simple 4297C<(some expression)> suffices. 4298 4299Because C<scalar> is unary operator, if you accidentally use for EXPR a 4300parenthesized list, this behaves as a scalar comma expression, evaluating 4301all but the last element in void context and returning the final element 4302evaluated in scalar context. This is seldom what you want. 4303 4304The following single statement: 4305 4306 print uc(scalar(&foo,$bar)),$baz; 4307 4308is the moral equivalent of these two: 4309 4310 &foo; 4311 print(uc($bar),$baz); 4312 4313See L<perlop> for more details on unary operators and the comma operator. 4314 4315=item seek FILEHANDLE,POSITION,WHENCE 4316 4317Sets FILEHANDLE's position, just like the C<fseek> call of C<stdio>. 4318FILEHANDLE may be an expression whose value gives the name of the 4319filehandle. The values for WHENCE are C<0> to set the new position 4320I<in bytes> to POSITION, C<1> to set it to the current position plus 4321POSITION, and C<2> to set it to EOF plus POSITION (typically 4322negative). For WHENCE you may use the constants C<SEEK_SET>, 4323C<SEEK_CUR>, and C<SEEK_END> (start of the file, current position, end 4324of the file) from the Fcntl module. Returns C<1> upon success, C<0> 4325otherwise. 4326 4327Note the I<in bytes>: even if the filehandle has been set to 4328operate on characters (for example by using the C<:utf8> open 4329layer), tell() will return byte offsets, not character offsets 4330(because implementing that would render seek() and tell() rather slow). 4331 4332If you want to position file for C<sysread> or C<syswrite>, don't use 4333C<seek>--buffering makes its effect on the file's system position 4334unpredictable and non-portable. Use C<sysseek> instead. 4335 4336Due to the rules and rigors of ANSI C, on some systems you have to do a 4337seek whenever you switch between reading and writing. Amongst other 4338things, this may have the effect of calling stdio's clearerr(3). 4339A WHENCE of C<1> (C<SEEK_CUR>) is useful for not moving the file position: 4340 4341 seek(TEST,0,1); 4342 4343This is also useful for applications emulating C<tail -f>. Once you hit 4344EOF on your read, and then sleep for a while, you might have to stick in a 4345seek() to reset things. The C<seek> doesn't change the current position, 4346but it I<does> clear the end-of-file condition on the handle, so that the 4347next C<< <FILE> >> makes Perl try again to read something. We hope. 4348 4349If that doesn't work (some IO implementations are particularly 4350cantankerous), then you may need something more like this: 4351 4352 for (;;) { 4353 for ($curpos = tell(FILE); $_ = <FILE>; 4354 $curpos = tell(FILE)) { 4355 # search for some stuff and put it into files 4356 } 4357 sleep($for_a_while); 4358 seek(FILE, $curpos, 0); 4359 } 4360 4361=item seekdir DIRHANDLE,POS 4362 4363Sets the current position for the C<readdir> routine on DIRHANDLE. POS 4364must be a value returned by C<telldir>. Has the same caveats about 4365possible directory compaction as the corresponding system library 4366routine. 4367 4368=item select FILEHANDLE 4369 4370=item select 4371 4372Returns the currently selected filehandle. Sets the current default 4373filehandle for output, if FILEHANDLE is supplied. This has two 4374effects: first, a C<write> or a C<print> without a filehandle will 4375default to this FILEHANDLE. Second, references to variables related to 4376output will refer to this output channel. For example, if you have to 4377set the top of form format for more than one output channel, you might 4378do the following: 4379 4380 select(REPORT1); 4381 $^ = 'report1_top'; 4382 select(REPORT2); 4383 $^ = 'report2_top'; 4384 4385FILEHANDLE may be an expression whose value gives the name of the 4386actual filehandle. Thus: 4387 4388 $oldfh = select(STDERR); $| = 1; select($oldfh); 4389 4390Some programmers may prefer to think of filehandles as objects with 4391methods, preferring to write the last example as: 4392 4393 use IO::Handle; 4394 STDERR->autoflush(1); 4395 4396=item select RBITS,WBITS,EBITS,TIMEOUT 4397 4398This calls the select(2) system call with the bit masks specified, which 4399can be constructed using C<fileno> and C<vec>, along these lines: 4400 4401 $rin = $win = $ein = ''; 4402 vec($rin,fileno(STDIN),1) = 1; 4403 vec($win,fileno(STDOUT),1) = 1; 4404 $ein = $rin | $win; 4405 4406If you want to select on many filehandles you might wish to write a 4407subroutine: 4408 4409 sub fhbits { 4410 my(@fhlist) = split(' ',$_[0]); 4411 my($bits); 4412 for (@fhlist) { 4413 vec($bits,fileno($_),1) = 1; 4414 } 4415 $bits; 4416 } 4417 $rin = fhbits('STDIN TTY SOCK'); 4418 4419The usual idiom is: 4420 4421 ($nfound,$timeleft) = 4422 select($rout=$rin, $wout=$win, $eout=$ein, $timeout); 4423 4424or to block until something becomes ready just do this 4425 4426 $nfound = select($rout=$rin, $wout=$win, $eout=$ein, undef); 4427 4428Most systems do not bother to return anything useful in $timeleft, so 4429calling select() in scalar context just returns $nfound. 4430 4431Any of the bit masks can also be undef. The timeout, if specified, is 4432in seconds, which may be fractional. Note: not all implementations are 4433capable of returning the $timeleft. If not, they always return 4434$timeleft equal to the supplied $timeout. 4435 4436You can effect a sleep of 250 milliseconds this way: 4437 4438 select(undef, undef, undef, 0.25); 4439 4440Note that whether C<select> gets restarted after signals (say, SIGALRM) 4441is implementation-dependent. 4442 4443B<WARNING>: One should not attempt to mix buffered I/O (like C<read> 4444or <FH>) with C<select>, except as permitted by POSIX, and even 4445then only on POSIX systems. You have to use C<sysread> instead. 4446 4447=item semctl ID,SEMNUM,CMD,ARG 4448 4449Calls the System V IPC function C<semctl>. You'll probably have to say 4450 4451 use IPC::SysV; 4452 4453first to get the correct constant definitions. If CMD is IPC_STAT or 4454GETALL, then ARG must be a variable which will hold the returned 4455semid_ds structure or semaphore value array. Returns like C<ioctl>: 4456the undefined value for error, "C<0 but true>" for zero, or the actual 4457return value otherwise. The ARG must consist of a vector of native 4458short integers, which may be created with C<pack("s!",(0)x$nsem)>. 4459See also L<perlipc/"SysV IPC">, C<IPC::SysV>, C<IPC::Semaphore> 4460documentation. 4461 4462=item semget KEY,NSEMS,FLAGS 4463 4464Calls the System V IPC function semget. Returns the semaphore id, or 4465the undefined value if there is an error. See also 4466L<perlipc/"SysV IPC">, C<IPC::SysV>, C<IPC::SysV::Semaphore> 4467documentation. 4468 4469=item semop KEY,OPSTRING 4470 4471Calls the System V IPC function semop to perform semaphore operations 4472such as signalling and waiting. OPSTRING must be a packed array of 4473semop structures. Each semop structure can be generated with 4474C<pack("s!3", $semnum, $semop, $semflag)>. The number of semaphore 4475operations is implied by the length of OPSTRING. Returns true if 4476successful, or false if there is an error. As an example, the 4477following code waits on semaphore $semnum of semaphore id $semid: 4478 4479 $semop = pack("s!3", $semnum, -1, 0); 4480 die "Semaphore trouble: $!\n" unless semop($semid, $semop); 4481 4482To signal the semaphore, replace C<-1> with C<1>. See also 4483L<perlipc/"SysV IPC">, C<IPC::SysV>, and C<IPC::SysV::Semaphore> 4484documentation. 4485 4486=item send SOCKET,MSG,FLAGS,TO 4487 4488=item send SOCKET,MSG,FLAGS 4489 4490Sends a message on a socket. Attempts to send the scalar MSG to the 4491SOCKET filehandle. Takes the same flags as the system call of the 4492same name. On unconnected sockets you must specify a destination to 4493send TO, in which case it does a C C<sendto>. Returns the number of 4494characters sent, or the undefined value if there is an error. The C 4495system call sendmsg(2) is currently unimplemented. See 4496L<perlipc/"UDP: Message Passing"> for examples. 4497 4498Note the I<characters>: depending on the status of the socket, either 4499(8-bit) bytes or characters are sent. By default all sockets operate 4500on bytes, but for example if the socket has been changed using 4501binmode() to operate with the C<:utf8> I/O layer (see L</open>, or the 4502C<open> pragma, L<open>), the I/O will operate on UTF-8 encoded 4503Unicode characters, not bytes. Similarly for the C<:encoding> pragma: 4504in that case pretty much any characters can be sent. 4505 4506=item setpgrp PID,PGRP 4507 4508Sets the current process group for the specified PID, C<0> for the current 4509process. Will produce a fatal error if used on a machine that doesn't 4510implement POSIX setpgid(2) or BSD setpgrp(2). If the arguments are omitted, 4511it defaults to C<0,0>. Note that the BSD 4.2 version of C<setpgrp> does not 4512accept any arguments, so only C<setpgrp(0,0)> is portable. See also 4513C<POSIX::setsid()>. 4514 4515=item setpriority WHICH,WHO,PRIORITY 4516 4517Sets the current priority for a process, a process group, or a user. 4518(See setpriority(2).) Will produce a fatal error if used on a machine 4519that doesn't implement setpriority(2). 4520 4521=item setsockopt SOCKET,LEVEL,OPTNAME,OPTVAL 4522 4523Sets the socket option requested. Returns undefined if there is an 4524error. OPTVAL may be specified as C<undef> if you don't want to pass an 4525argument. 4526 4527=item shift ARRAY 4528 4529=item shift 4530 4531Shifts the first value of the array off and returns it, shortening the 4532array by 1 and moving everything down. If there are no elements in the 4533array, returns the undefined value. If ARRAY is omitted, shifts the 4534C<@_> array within the lexical scope of subroutines and formats, and the 4535C<@ARGV> array at file scopes or within the lexical scopes established by 4536the C<eval ''>, C<BEGIN {}>, C<INIT {}>, C<CHECK {}>, and C<END {}> 4537constructs. 4538 4539See also C<unshift>, C<push>, and C<pop>. C<shift> and C<unshift> do the 4540same thing to the left end of an array that C<pop> and C<push> do to the 4541right end. 4542 4543=item shmctl ID,CMD,ARG 4544 4545Calls the System V IPC function shmctl. You'll probably have to say 4546 4547 use IPC::SysV; 4548 4549first to get the correct constant definitions. If CMD is C<IPC_STAT>, 4550then ARG must be a variable which will hold the returned C<shmid_ds> 4551structure. Returns like ioctl: the undefined value for error, "C<0> but 4552true" for zero, or the actual return value otherwise. 4553See also L<perlipc/"SysV IPC"> and C<IPC::SysV> documentation. 4554 4555=item shmget KEY,SIZE,FLAGS 4556 4557Calls the System V IPC function shmget. Returns the shared memory 4558segment id, or the undefined value if there is an error. 4559See also L<perlipc/"SysV IPC"> and C<IPC::SysV> documentation. 4560 4561=item shmread ID,VAR,POS,SIZE 4562 4563=item shmwrite ID,STRING,POS,SIZE 4564 4565Reads or writes the System V shared memory segment ID starting at 4566position POS for size SIZE by attaching to it, copying in/out, and 4567detaching from it. When reading, VAR must be a variable that will 4568hold the data read. When writing, if STRING is too long, only SIZE 4569bytes are used; if STRING is too short, nulls are written to fill out 4570SIZE bytes. Return true if successful, or false if there is an error. 4571shmread() taints the variable. See also L<perlipc/"SysV IPC">, 4572C<IPC::SysV> documentation, and the C<IPC::Shareable> module from CPAN. 4573 4574=item shutdown SOCKET,HOW 4575 4576Shuts down a socket connection in the manner indicated by HOW, which 4577has the same interpretation as in the system call of the same name. 4578 4579 shutdown(SOCKET, 0); # I/we have stopped reading data 4580 shutdown(SOCKET, 1); # I/we have stopped writing data 4581 shutdown(SOCKET, 2); # I/we have stopped using this socket 4582 4583This is useful with sockets when you want to tell the other 4584side you're done writing but not done reading, or vice versa. 4585It's also a more insistent form of close because it also 4586disables the file descriptor in any forked copies in other 4587processes. 4588 4589=item sin EXPR 4590 4591=item sin 4592 4593Returns the sine of EXPR (expressed in radians). If EXPR is omitted, 4594returns sine of C<$_>. 4595 4596For the inverse sine operation, you may use the C<Math::Trig::asin> 4597function, or use this relation: 4598 4599 sub asin { atan2($_[0], sqrt(1 - $_[0] * $_[0])) } 4600 4601=item sleep EXPR 4602 4603=item sleep 4604 4605Causes the script to sleep for EXPR seconds, or forever if no EXPR. 4606May be interrupted if the process receives a signal such as C<SIGALRM>. 4607Returns the number of seconds actually slept. You probably cannot 4608mix C<alarm> and C<sleep> calls, because C<sleep> is often implemented 4609using C<alarm>. 4610 4611On some older systems, it may sleep up to a full second less than what 4612you requested, depending on how it counts seconds. Most modern systems 4613always sleep the full amount. They may appear to sleep longer than that, 4614however, because your process might not be scheduled right away in a 4615busy multitasking system. 4616 4617For delays of finer granularity than one second, you may use Perl's 4618C<syscall> interface to access setitimer(2) if your system supports 4619it, or else see L</select> above. The Time::HiRes module (from CPAN, 4620and starting from Perl 5.8 part of the standard distribution) may also 4621help. 4622 4623See also the POSIX module's C<pause> function. 4624 4625=item socket SOCKET,DOMAIN,TYPE,PROTOCOL 4626 4627Opens a socket of the specified kind and attaches it to filehandle 4628SOCKET. DOMAIN, TYPE, and PROTOCOL are specified the same as for 4629the system call of the same name. You should C<use Socket> first 4630to get the proper definitions imported. See the examples in 4631L<perlipc/"Sockets: Client/Server Communication">. 4632 4633On systems that support a close-on-exec flag on files, the flag will 4634be set for the newly opened file descriptor, as determined by the 4635value of $^F. See L<perlvar/$^F>. 4636 4637=item socketpair SOCKET1,SOCKET2,DOMAIN,TYPE,PROTOCOL 4638 4639Creates an unnamed pair of sockets in the specified domain, of the 4640specified type. DOMAIN, TYPE, and PROTOCOL are specified the same as 4641for the system call of the same name. If unimplemented, yields a fatal 4642error. Returns true if successful. 4643 4644On systems that support a close-on-exec flag on files, the flag will 4645be set for the newly opened file descriptors, as determined by the value 4646of $^F. See L<perlvar/$^F>. 4647 4648Some systems defined C<pipe> in terms of C<socketpair>, in which a call 4649to C<pipe(Rdr, Wtr)> is essentially: 4650 4651 use Socket; 4652 socketpair(Rdr, Wtr, AF_UNIX, SOCK_STREAM, PF_UNSPEC); 4653 shutdown(Rdr, 1); # no more writing for reader 4654 shutdown(Wtr, 0); # no more reading for writer 4655 4656See L<perlipc> for an example of socketpair use. Perl 5.8 and later will 4657emulate socketpair using IP sockets to localhost if your system implements 4658sockets but not socketpair. 4659 4660=item sort SUBNAME LIST 4661 4662=item sort BLOCK LIST 4663 4664=item sort LIST 4665 4666In list context, this sorts the LIST and returns the sorted list value. 4667In scalar context, the behaviour of C<sort()> is undefined. 4668 4669If SUBNAME or BLOCK is omitted, C<sort>s in standard string comparison 4670order. If SUBNAME is specified, it gives the name of a subroutine 4671that returns an integer less than, equal to, or greater than C<0>, 4672depending on how the elements of the list are to be ordered. (The C<< 4673<=> >> and C<cmp> operators are extremely useful in such routines.) 4674SUBNAME may be a scalar variable name (unsubscripted), in which case 4675the value provides the name of (or a reference to) the actual 4676subroutine to use. In place of a SUBNAME, you can provide a BLOCK as 4677an anonymous, in-line sort subroutine. 4678 4679If the subroutine's prototype is C<($$)>, the elements to be compared 4680are passed by reference in C<@_>, as for a normal subroutine. This is 4681slower than unprototyped subroutines, where the elements to be 4682compared are passed into the subroutine 4683as the package global variables $a and $b (see example below). Note that 4684in the latter case, it is usually counter-productive to declare $a and 4685$b as lexicals. 4686 4687In either case, the subroutine may not be recursive. The values to be 4688compared are always passed by reference, so don't modify them. 4689 4690You also cannot exit out of the sort block or subroutine using any of the 4691loop control operators described in L<perlsyn> or with C<goto>. 4692 4693When C<use locale> is in effect, C<sort LIST> sorts LIST according to the 4694current collation locale. See L<perllocale>. 4695 4696Perl 5.6 and earlier used a quicksort algorithm to implement sort. 4697That algorithm was not stable, and I<could> go quadratic. (A I<stable> sort 4698preserves the input order of elements that compare equal. Although 4699quicksort's run time is O(NlogN) when averaged over all arrays of 4700length N, the time can be O(N**2), I<quadratic> behavior, for some 4701inputs.) In 5.7, the quicksort implementation was replaced with 4702a stable mergesort algorithm whose worst case behavior is O(NlogN). 4703But benchmarks indicated that for some inputs, on some platforms, 4704the original quicksort was faster. 5.8 has a sort pragma for 4705limited control of the sort. Its rather blunt control of the 4706underlying algorithm may not persist into future perls, but the 4707ability to characterize the input or output in implementation 4708independent ways quite probably will. See L<sort>. 4709 4710Examples: 4711 4712 # sort lexically 4713 @articles = sort @files; 4714 4715 # same thing, but with explicit sort routine 4716 @articles = sort {$a cmp $b} @files; 4717 4718 # now case-insensitively 4719 @articles = sort {uc($a) cmp uc($b)} @files; 4720 4721 # same thing in reversed order 4722 @articles = sort {$b cmp $a} @files; 4723 4724 # sort numerically ascending 4725 @articles = sort {$a <=> $b} @files; 4726 4727 # sort numerically descending 4728 @articles = sort {$b <=> $a} @files; 4729 4730 # this sorts the %age hash by value instead of key 4731 # using an in-line function 4732 @eldest = sort { $age{$b} <=> $age{$a} } keys %age; 4733 4734 # sort using explicit subroutine name 4735 sub byage { 4736 $age{$a} <=> $age{$b}; # presuming numeric 4737 } 4738 @sortedclass = sort byage @class; 4739 4740 sub backwards { $b cmp $a } 4741 @harry = qw(dog cat x Cain Abel); 4742 @george = qw(gone chased yz Punished Axed); 4743 print sort @harry; 4744 # prints AbelCaincatdogx 4745 print sort backwards @harry; 4746 # prints xdogcatCainAbel 4747 print sort @george, 'to', @harry; 4748 # prints AbelAxedCainPunishedcatchaseddoggonetoxyz 4749 4750 # inefficiently sort by descending numeric compare using 4751 # the first integer after the first = sign, or the 4752 # whole record case-insensitively otherwise 4753 4754 @new = sort { 4755 ($b =~ /=(\d+)/)[0] <=> ($a =~ /=(\d+)/)[0] 4756 || 4757 uc($a) cmp uc($b) 4758 } @old; 4759 4760 # same thing, but much more efficiently; 4761 # we'll build auxiliary indices instead 4762 # for speed 4763 @nums = @caps = (); 4764 for (@old) { 4765 push @nums, /=(\d+)/; 4766 push @caps, uc($_); 4767 } 4768 4769 @new = @old[ sort { 4770 $nums[$b] <=> $nums[$a] 4771 || 4772 $caps[$a] cmp $caps[$b] 4773 } 0..$#old 4774 ]; 4775 4776 # same thing, but without any temps 4777 @new = map { $_->[0] } 4778 sort { $b->[1] <=> $a->[1] 4779 || 4780 $a->[2] cmp $b->[2] 4781 } map { [$_, /=(\d+)/, uc($_)] } @old; 4782 4783 # using a prototype allows you to use any comparison subroutine 4784 # as a sort subroutine (including other package's subroutines) 4785 package other; 4786 sub backwards ($$) { $_[1] cmp $_[0]; } # $a and $b are not set here 4787 4788 package main; 4789 @new = sort other::backwards @old; 4790 4791 # guarantee stability, regardless of algorithm 4792 use sort 'stable'; 4793 @new = sort { substr($a, 3, 5) cmp substr($b, 3, 5) } @old; 4794 4795 # force use of mergesort (not portable outside Perl 5.8) 4796 use sort '_mergesort'; # note discouraging _ 4797 @new = sort { substr($a, 3, 5) cmp substr($b, 3, 5) } @old; 4798 4799If you're using strict, you I<must not> declare $a 4800and $b as lexicals. They are package globals. That means 4801if you're in the C<main> package and type 4802 4803 @articles = sort {$b <=> $a} @files; 4804 4805then C<$a> and C<$b> are C<$main::a> and C<$main::b> (or C<$::a> and C<$::b>), 4806but if you're in the C<FooPack> package, it's the same as typing 4807 4808 @articles = sort {$FooPack::b <=> $FooPack::a} @files; 4809 4810The comparison function is required to behave. If it returns 4811inconsistent results (sometimes saying C<$x[1]> is less than C<$x[2]> and 4812sometimes saying the opposite, for example) the results are not 4813well-defined. 4814 4815Because C<< <=> >> returns C<undef> when either operand is C<NaN> 4816(not-a-number), and because C<sort> will trigger a fatal error unless the 4817result of a comparison is defined, when sorting with a comparison function 4818like C<< $a <=> $b >>, be careful about lists that might contain a C<NaN>. 4819The following example takes advantage of the fact that C<NaN != NaN> to 4820eliminate any C<NaN>s from the input. 4821 4822 @result = sort { $a <=> $b } grep { $_ == $_ } @input; 4823 4824=item splice ARRAY,OFFSET,LENGTH,LIST 4825 4826=item splice ARRAY,OFFSET,LENGTH 4827 4828=item splice ARRAY,OFFSET 4829 4830=item splice ARRAY 4831 4832Removes the elements designated by OFFSET and LENGTH from an array, and 4833replaces them with the elements of LIST, if any. In list context, 4834returns the elements removed from the array. In scalar context, 4835returns the last element removed, or C<undef> if no elements are 4836removed. The array grows or shrinks as necessary. 4837If OFFSET is negative then it starts that far from the end of the array. 4838If LENGTH is omitted, removes everything from OFFSET onward. 4839If LENGTH is negative, removes the elements from OFFSET onward 4840except for -LENGTH elements at the end of the array. 4841If both OFFSET and LENGTH are omitted, removes everything. If OFFSET is 4842past the end of the array, perl issues a warning, and splices at the 4843end of the array. 4844 4845The following equivalences hold (assuming C<< $[ == 0 and $#a >= $i >> ) 4846 4847 push(@a,$x,$y) splice(@a,@a,0,$x,$y) 4848 pop(@a) splice(@a,-1) 4849 shift(@a) splice(@a,0,1) 4850 unshift(@a,$x,$y) splice(@a,0,0,$x,$y) 4851 $a[$i] = $y splice(@a,$i,1,$y) 4852 4853Example, assuming array lengths are passed before arrays: 4854 4855 sub aeq { # compare two list values 4856 my(@a) = splice(@_,0,shift); 4857 my(@b) = splice(@_,0,shift); 4858 return 0 unless @a == @b; # same len? 4859 while (@a) { 4860 return 0 if pop(@a) ne pop(@b); 4861 } 4862 return 1; 4863 } 4864 if (&aeq($len,@foo[1..$len],0+@bar,@bar)) { ... } 4865 4866=item split /PATTERN/,EXPR,LIMIT 4867 4868=item split /PATTERN/,EXPR 4869 4870=item split /PATTERN/ 4871 4872=item split 4873 4874Splits the string EXPR into a list of strings and returns that list. By 4875default, empty leading fields are preserved, and empty trailing ones are 4876deleted. 4877 4878In scalar context, returns the number of fields found and splits into 4879the C<@_> array. Use of split in scalar context is deprecated, however, 4880because it clobbers your subroutine arguments. 4881 4882If EXPR is omitted, splits the C<$_> string. If PATTERN is also omitted, 4883splits on whitespace (after skipping any leading whitespace). Anything 4884matching PATTERN is taken to be a delimiter separating the fields. (Note 4885that the delimiter may be longer than one character.) 4886 4887If LIMIT is specified and positive, it represents the maximum number 4888of fields the EXPR will be split into, though the actual number of 4889fields returned depends on the number of times PATTERN matches within 4890EXPR. If LIMIT is unspecified or zero, trailing null fields are 4891stripped (which potential users of C<pop> would do well to remember). 4892If LIMIT is negative, it is treated as if an arbitrarily large LIMIT 4893had been specified. Note that splitting an EXPR that evaluates to the 4894empty string always returns the empty list, regardless of the LIMIT 4895specified. 4896 4897A pattern matching the null string (not to be confused with 4898a null pattern C<//>, which is just one member of the set of patterns 4899matching a null string) will split the value of EXPR into separate 4900characters at each point it matches that way. For example: 4901 4902 print join(':', split(/ */, 'hi there')); 4903 4904produces the output 'h:i:t:h:e:r:e'. 4905 4906Using the empty pattern C<//> specifically matches the null string, and is 4907not be confused with the use of C<//> to mean "the last successful pattern 4908match". 4909 4910Empty leading (or trailing) fields are produced when there are positive width 4911matches at the beginning (or end) of the string; a zero-width match at the 4912beginning (or end) of the string does not produce an empty field. For 4913example: 4914 4915 print join(':', split(/(?=\w)/, 'hi there!')); 4916 4917produces the output 'h:i :t:h:e:r:e!'. 4918 4919The LIMIT parameter can be used to split a line partially 4920 4921 ($login, $passwd, $remainder) = split(/:/, $_, 3); 4922 4923When assigning to a list, if LIMIT is omitted, or zero, Perl supplies 4924a LIMIT one larger than the number of variables in the list, to avoid 4925unnecessary work. For the list above LIMIT would have been 4 by 4926default. In time critical applications it behooves you not to split 4927into more fields than you really need. 4928 4929If the PATTERN contains parentheses, additional list elements are 4930created from each matching substring in the delimiter. 4931 4932 split(/([,-])/, "1-10,20", 3); 4933 4934produces the list value 4935 4936 (1, '-', 10, ',', 20) 4937 4938If you had the entire header of a normal Unix email message in $header, 4939you could split it up into fields and their values this way: 4940 4941 $header =~ s/\n\s+/ /g; # fix continuation lines 4942 %hdrs = (UNIX_FROM => split /^(\S*?):\s*/m, $header); 4943 4944The pattern C</PATTERN/> may be replaced with an expression to specify 4945patterns that vary at runtime. (To do runtime compilation only once, 4946use C</$variable/o>.) 4947 4948As a special case, specifying a PATTERN of space (S<C<' '>>) will split on 4949white space just as C<split> with no arguments does. Thus, S<C<split(' ')>> can 4950be used to emulate B<awk>'s default behavior, whereas S<C<split(/ /)>> 4951will give you as many null initial fields as there are leading spaces. 4952A C<split> on C</\s+/> is like a S<C<split(' ')>> except that any leading 4953whitespace produces a null first field. A C<split> with no arguments 4954really does a S<C<split(' ', $_)>> internally. 4955 4956A PATTERN of C</^/> is treated as if it were C</^/m>, since it isn't 4957much use otherwise. 4958 4959Example: 4960 4961 open(PASSWD, '/etc/passwd'); 4962 while (<PASSWD>) { 4963 chomp; 4964 ($login, $passwd, $uid, $gid, 4965 $gcos, $home, $shell) = split(/:/); 4966 #... 4967 } 4968 4969As with regular pattern matching, any capturing parentheses that are not 4970matched in a C<split()> will be set to C<undef> when returned: 4971 4972 @fields = split /(A)|B/, "1A2B3"; 4973 # @fields is (1, 'A', 2, undef, 3) 4974 4975=item sprintf FORMAT, LIST 4976 4977Returns a string formatted by the usual C<printf> conventions of the C 4978library function C<sprintf>. See below for more details 4979and see L<sprintf(3)> or L<printf(3)> on your system for an explanation of 4980the general principles. 4981 4982For example: 4983 4984 # Format number with up to 8 leading zeroes 4985 $result = sprintf("%08d", $number); 4986 4987 # Round number to 3 digits after decimal point 4988 $rounded = sprintf("%.3f", $number); 4989 4990Perl does its own C<sprintf> formatting--it emulates the C 4991function C<sprintf>, but it doesn't use it (except for floating-point 4992numbers, and even then only the standard modifiers are allowed). As a 4993result, any non-standard extensions in your local C<sprintf> are not 4994available from Perl. 4995 4996Unlike C<printf>, C<sprintf> does not do what you probably mean when you 4997pass it an array as your first argument. The array is given scalar context, 4998and instead of using the 0th element of the array as the format, Perl will 4999use the count of elements in the array as the format, which is almost never 5000useful. 5001 5002Perl's C<sprintf> permits the following universally-known conversions: 5003 5004 %% a percent sign 5005 %c a character with the given number 5006 %s a string 5007 %d a signed integer, in decimal 5008 %u an unsigned integer, in decimal 5009 %o an unsigned integer, in octal 5010 %x an unsigned integer, in hexadecimal 5011 %e a floating-point number, in scientific notation 5012 %f a floating-point number, in fixed decimal notation 5013 %g a floating-point number, in %e or %f notation 5014 5015In addition, Perl permits the following widely-supported conversions: 5016 5017 %X like %x, but using upper-case letters 5018 %E like %e, but using an upper-case "E" 5019 %G like %g, but with an upper-case "E" (if applicable) 5020 %b an unsigned integer, in binary 5021 %p a pointer (outputs the Perl value's address in hexadecimal) 5022 %n special: *stores* the number of characters output so far 5023 into the next variable in the parameter list 5024 5025Finally, for backward (and we do mean "backward") compatibility, Perl 5026permits these unnecessary but widely-supported conversions: 5027 5028 %i a synonym for %d 5029 %D a synonym for %ld 5030 %U a synonym for %lu 5031 %O a synonym for %lo 5032 %F a synonym for %f 5033 5034Note that the number of exponent digits in the scientific notation produced 5035by C<%e>, C<%E>, C<%g> and C<%G> for numbers with the modulus of the 5036exponent less than 100 is system-dependent: it may be three or less 5037(zero-padded as necessary). In other words, 1.23 times ten to the 503899th may be either "1.23e99" or "1.23e099". 5039 5040Between the C<%> and the format letter, you may specify a number of 5041additional attributes controlling the interpretation of the format. 5042In order, these are: 5043 5044=over 4 5045 5046=item format parameter index 5047 5048An explicit format parameter index, such as C<2$>. By default sprintf 5049will format the next unused argument in the list, but this allows you 5050to take the arguments out of order. Eg: 5051 5052 printf '%2$d %1$d', 12, 34; # prints "34 12" 5053 printf '%3$d %d %1$d', 1, 2, 3; # prints "3 1 1" 5054 5055=item flags 5056 5057one or more of: 5058 space prefix positive number with a space 5059 + prefix positive number with a plus sign 5060 - left-justify within the field 5061 0 use zeros, not spaces, to right-justify 5062 # prefix non-zero octal with "0", non-zero hex with "0x", 5063 non-zero binary with "0b" 5064 5065For example: 5066 5067 printf '<% d>', 12; # prints "< 12>" 5068 printf '<%+d>', 12; # prints "<+12>" 5069 printf '<%6s>', 12; # prints "< 12>" 5070 printf '<%-6s>', 12; # prints "<12 >" 5071 printf '<%06s>', 12; # prints "<000012>" 5072 printf '<%#x>', 12; # prints "<0xc>" 5073 5074=item vector flag 5075 5076The vector flag C<v>, optionally specifying the join string to use. 5077This flag tells perl to interpret the supplied string as a vector 5078of integers, one for each character in the string, separated by 5079a given string (a dot C<.> by default). This can be useful for 5080displaying ordinal values of characters in arbitrary strings: 5081 5082 printf "version is v%vd\n", $^V; # Perl's version 5083 5084Put an asterisk C<*> before the C<v> to override the string to 5085use to separate the numbers: 5086 5087 printf "address is %*vX\n", ":", $addr; # IPv6 address 5088 printf "bits are %0*v8b\n", " ", $bits; # random bitstring 5089 5090You can also explicitly specify the argument number to use for 5091the join string using eg C<*2$v>: 5092 5093 printf '%*4$vX %*4$vX %*4$vX', @addr[1..3], ":"; # 3 IPv6 addresses 5094 5095=item (minimum) width 5096 5097Arguments are usually formatted to be only as wide as required to 5098display the given value. You can override the width by putting 5099a number here, or get the width from the next argument (with C<*>) 5100or from a specified argument (with eg C<*2$>): 5101 5102 printf '<%s>', "a"; # prints "<a>" 5103 printf '<%6s>', "a"; # prints "< a>" 5104 printf '<%*s>', 6, "a"; # prints "< a>" 5105 printf '<%*2$s>', "a", 6; # prints "< a>" 5106 printf '<%2s>', "long"; # prints "<long>" (does not truncate) 5107 5108If a field width obtained through C<*> is negative, it has the same 5109effect as the C<-> flag: left-justification. 5110 5111=item precision, or maximum width 5112 5113You can specify a precision (for numeric conversions) or a maximum 5114width (for string conversions) by specifying a C<.> followed by a number. 5115For floating point formats, with the exception of 'g' and 'G', this specifies 5116the number of decimal places to show (the default being 6), eg: 5117 5118 # these examples are subject to system-specific variation 5119 printf '<%f>', 1; # prints "<1.000000>" 5120 printf '<%.1f>', 1; # prints "<1.0>" 5121 printf '<%.0f>', 1; # prints "<1>" 5122 printf '<%e>', 10; # prints "<1.000000e+01>" 5123 printf '<%.1e>', 10; # prints "<1.0e+01>" 5124 5125For 'g' and 'G', this specifies the maximum number of digits to show, 5126including prior to the decimal point as well as after it, eg: 5127 5128 # these examples are subject to system-specific variation 5129 printf '<%g>', 1; # prints "<1>" 5130 printf '<%.10g>', 1; # prints "<1>" 5131 printf '<%g>', 100; # prints "<100>" 5132 printf '<%.1g>', 100; # prints "<1e+02>" 5133 printf '<%.2g>', 100.01; # prints "<1e+02>" 5134 printf '<%.5g>', 100.01; # prints "<100.01>" 5135 printf '<%.4g>', 100.01; # prints "<100>" 5136 5137For integer conversions, specifying a precision implies that the 5138output of the number itself should be zero-padded to this width: 5139 5140 printf '<%.6x>', 1; # prints "<000001>" 5141 printf '<%#.6x>', 1; # prints "<0x000001>" 5142 printf '<%-10.6x>', 1; # prints "<000001 >" 5143 5144For string conversions, specifying a precision truncates the string 5145to fit in the specified width: 5146 5147 printf '<%.5s>', "truncated"; # prints "<trunc>" 5148 printf '<%10.5s>', "truncated"; # prints "< trunc>" 5149 5150You can also get the precision from the next argument using C<.*>: 5151 5152 printf '<%.6x>', 1; # prints "<000001>" 5153 printf '<%.*x>', 6, 1; # prints "<000001>" 5154 5155You cannot currently get the precision from a specified number, 5156but it is intended that this will be possible in the future using 5157eg C<.*2$>: 5158 5159 printf '<%.*2$x>', 1, 6; # INVALID, but in future will print "<000001>" 5160 5161=item size 5162 5163For numeric conversions, you can specify the size to interpret the 5164number as using C<l>, C<h>, C<V>, C<q>, C<L>, or C<ll>. For integer 5165conversions (C<d u o x X b i D U O>), numbers are usually assumed to be 5166whatever the default integer size is on your platform (usually 32 or 64 5167bits), but you can override this to use instead one of the standard C types, 5168as supported by the compiler used to build Perl: 5169 5170 l interpret integer as C type "long" or "unsigned long" 5171 h interpret integer as C type "short" or "unsigned short" 5172 q, L or ll interpret integer as C type "long long", "unsigned long long". 5173 or "quads" (typically 64-bit integers) 5174 5175The last will produce errors if Perl does not understand "quads" in your 5176installation. (This requires that either the platform natively supports quads 5177or Perl was specifically compiled to support quads.) You can find out 5178whether your Perl supports quads via L<Config>: 5179 5180 use Config; 5181 ($Config{use64bitint} eq 'define' || $Config{longsize} >= 8) && 5182 print "quads\n"; 5183 5184For floating point conversions (C<e f g E F G>), numbers are usually assumed 5185to be the default floating point size on your platform (double or long double), 5186but you can force 'long double' with C<q>, C<L>, or C<ll> if your 5187platform supports them. You can find out whether your Perl supports long 5188doubles via L<Config>: 5189 5190 use Config; 5191 $Config{d_longdbl} eq 'define' && print "long doubles\n"; 5192 5193You can find out whether Perl considers 'long double' to be the default 5194floating point size to use on your platform via L<Config>: 5195 5196 use Config; 5197 ($Config{uselongdouble} eq 'define') && 5198 print "long doubles by default\n"; 5199 5200It can also be the case that long doubles and doubles are the same thing: 5201 5202 use Config; 5203 ($Config{doublesize} == $Config{longdblsize}) && 5204 print "doubles are long doubles\n"; 5205 5206The size specifier C<V> has no effect for Perl code, but it is supported 5207for compatibility with XS code; it means 'use the standard size for 5208a Perl integer (or floating-point number)', which is already the 5209default for Perl code. 5210 5211=item order of arguments 5212 5213Normally, sprintf takes the next unused argument as the value to 5214format for each format specification. If the format specification 5215uses C<*> to require additional arguments, these are consumed from 5216the argument list in the order in which they appear in the format 5217specification I<before> the value to format. Where an argument is 5218specified using an explicit index, this does not affect the normal 5219order for the arguments (even when the explicitly specified index 5220would have been the next argument in any case). 5221 5222So: 5223 5224 printf '<%*.*s>', $a, $b, $c; 5225 5226would use C<$a> for the width, C<$b> for the precision and C<$c> 5227as the value to format, while: 5228 5229 print '<%*1$.*s>', $a, $b; 5230 5231would use C<$a> for the width and the precision, and C<$b> as the 5232value to format. 5233 5234Here are some more examples - beware that when using an explicit 5235index, the C<$> may need to be escaped: 5236 5237 printf "%2\$d %d\n", 12, 34; # will print "34 12\n" 5238 printf "%2\$d %d %d\n", 12, 34; # will print "34 12 34\n" 5239 printf "%3\$d %d %d\n", 12, 34, 56; # will print "56 12 34\n" 5240 printf "%2\$*3\$d %d\n", 12, 34, 3; # will print " 34 12\n" 5241 5242=back 5243 5244If C<use locale> is in effect, the character used for the decimal 5245point in formatted real numbers is affected by the LC_NUMERIC locale. 5246See L<perllocale>. 5247 5248=item sqrt EXPR 5249 5250=item sqrt 5251 5252Return the square root of EXPR. If EXPR is omitted, returns square 5253root of C<$_>. Only works on non-negative operands, unless you've 5254loaded the standard Math::Complex module. 5255 5256 use Math::Complex; 5257 print sqrt(-2); # prints 1.4142135623731i 5258 5259=item srand EXPR 5260 5261=item srand 5262 5263Sets the random number seed for the C<rand> operator. 5264 5265The point of the function is to "seed" the C<rand> function so that 5266C<rand> can produce a different sequence each time you run your 5267program. 5268 5269If srand() is not called explicitly, it is called implicitly at the 5270first use of the C<rand> operator. However, this was not the case in 5271versions of Perl before 5.004, so if your script will run under older 5272Perl versions, it should call C<srand>. 5273 5274Most programs won't even call srand() at all, except those that 5275need a cryptographically-strong starting point rather than the 5276generally acceptable default, which is based on time of day, 5277process ID, and memory allocation, or the F</dev/urandom> device, 5278if available. 5279 5280You can call srand($seed) with the same $seed to reproduce the 5281I<same> sequence from rand(), but this is usually reserved for 5282generating predictable results for testing or debugging. 5283Otherwise, don't call srand() more than once in your program. 5284 5285Do B<not> call srand() (i.e. without an argument) more than once in 5286a script. The internal state of the random number generator should 5287contain more entropy than can be provided by any seed, so calling 5288srand() again actually I<loses> randomness. 5289 5290Most implementations of C<srand> take an integer and will silently 5291truncate decimal numbers. This means C<srand(42)> will usually 5292produce the same results as C<srand(42.1)>. To be safe, always pass 5293C<srand> an integer. 5294 5295In versions of Perl prior to 5.004 the default seed was just the 5296current C<time>. This isn't a particularly good seed, so many old 5297programs supply their own seed value (often C<time ^ $$> or C<time ^ 5298($$ + ($$ << 15))>), but that isn't necessary any more. 5299 5300Note that you need something much more random than the default seed for 5301cryptographic purposes. Checksumming the compressed output of one or more 5302rapidly changing operating system status programs is the usual method. For 5303example: 5304 5305 srand (time ^ $$ ^ unpack "%L*", `ps axww | gzip`); 5306 5307If you're particularly concerned with this, see the C<Math::TrulyRandom> 5308module in CPAN. 5309 5310Frequently called programs (like CGI scripts) that simply use 5311 5312 time ^ $$ 5313 5314for a seed can fall prey to the mathematical property that 5315 5316 a^b == (a+1)^(b+1) 5317 5318one-third of the time. So don't do that. 5319 5320=item stat FILEHANDLE 5321 5322=item stat EXPR 5323 5324=item stat 5325 5326Returns a 13-element list giving the status info for a file, either 5327the file opened via FILEHANDLE, or named by EXPR. If EXPR is omitted, 5328it stats C<$_>. Returns a null list if the stat fails. Typically used 5329as follows: 5330 5331 ($dev,$ino,$mode,$nlink,$uid,$gid,$rdev,$size, 5332 $atime,$mtime,$ctime,$blksize,$blocks) 5333 = stat($filename); 5334 5335Not all fields are supported on all filesystem types. Here are the 5336meanings of the fields: 5337 5338 0 dev device number of filesystem 5339 1 ino inode number 5340 2 mode file mode (type and permissions) 5341 3 nlink number of (hard) links to the file 5342 4 uid numeric user ID of file's owner 5343 5 gid numeric group ID of file's owner 5344 6 rdev the device identifier (special files only) 5345 7 size total size of file, in bytes 5346 8 atime last access time in seconds since the epoch 5347 9 mtime last modify time in seconds since the epoch 5348 10 ctime inode change time in seconds since the epoch (*) 5349 11 blksize preferred block size for file system I/O 5350 12 blocks actual number of blocks allocated 5351 5352(The epoch was at 00:00 January 1, 1970 GMT.) 5353 5354(*) The ctime field is non-portable. In particular, you cannot expect 5355it to be a "creation time", see L<perlport/"Files and Filesystems"> 5356for details. 5357 5358If C<stat> is passed the special filehandle consisting of an underline, no 5359stat is done, but the current contents of the stat structure from the 5360last C<stat>, C<lstat>, or filetest are returned. Example: 5361 5362 if (-x $file && (($d) = stat(_)) && $d < 0) { 5363 print "$file is executable NFS file\n"; 5364 } 5365 5366(This works on machines only for which the device number is negative 5367under NFS.) 5368 5369Because the mode contains both the file type and its permissions, you 5370should mask off the file type portion and (s)printf using a C<"%o"> 5371if you want to see the real permissions. 5372 5373 $mode = (stat($filename))[2]; 5374 printf "Permissions are %04o\n", $mode & 07777; 5375 5376In scalar context, C<stat> returns a boolean value indicating success 5377or failure, and, if successful, sets the information associated with 5378the special filehandle C<_>. 5379 5380The File::stat module provides a convenient, by-name access mechanism: 5381 5382 use File::stat; 5383 $sb = stat($filename); 5384 printf "File is %s, size is %s, perm %04o, mtime %s\n", 5385 $filename, $sb->size, $sb->mode & 07777, 5386 scalar localtime $sb->mtime; 5387 5388You can import symbolic mode constants (C<S_IF*>) and functions 5389(C<S_IS*>) from the Fcntl module: 5390 5391 use Fcntl ':mode'; 5392 5393 $mode = (stat($filename))[2]; 5394 5395 $user_rwx = ($mode & S_IRWXU) >> 6; 5396 $group_read = ($mode & S_IRGRP) >> 3; 5397 $other_execute = $mode & S_IXOTH; 5398 5399 printf "Permissions are %04o\n", S_IMODE($mode), "\n"; 5400 5401 $is_setuid = $mode & S_ISUID; 5402 $is_setgid = S_ISDIR($mode); 5403 5404You could write the last two using the C<-u> and C<-d> operators. 5405The commonly available C<S_IF*> constants are 5406 5407 # Permissions: read, write, execute, for user, group, others. 5408 5409 S_IRWXU S_IRUSR S_IWUSR S_IXUSR 5410 S_IRWXG S_IRGRP S_IWGRP S_IXGRP 5411 S_IRWXO S_IROTH S_IWOTH S_IXOTH 5412 5413 # Setuid/Setgid/Stickiness/SaveText. 5414 # Note that the exact meaning of these is system dependent. 5415 5416 S_ISUID S_ISGID S_ISVTX S_ISTXT 5417 5418 # File types. Not necessarily all are available on your system. 5419 5420 S_IFREG S_IFDIR S_IFLNK S_IFBLK S_ISCHR S_IFIFO S_IFSOCK S_IFWHT S_ENFMT 5421 5422 # The following are compatibility aliases for S_IRUSR, S_IWUSR, S_IXUSR. 5423 5424 S_IREAD S_IWRITE S_IEXEC 5425 5426and the C<S_IF*> functions are 5427 5428 S_IMODE($mode) the part of $mode containing the permission bits 5429 and the setuid/setgid/sticky bits 5430 5431 S_IFMT($mode) the part of $mode containing the file type 5432 which can be bit-anded with e.g. S_IFREG 5433 or with the following functions 5434 5435 # The operators -f, -d, -l, -b, -c, -p, and -S. 5436 5437 S_ISREG($mode) S_ISDIR($mode) S_ISLNK($mode) 5438 S_ISBLK($mode) S_ISCHR($mode) S_ISFIFO($mode) S_ISSOCK($mode) 5439 5440 # No direct -X operator counterpart, but for the first one 5441 # the -g operator is often equivalent. The ENFMT stands for 5442 # record flocking enforcement, a platform-dependent feature. 5443 5444 S_ISENFMT($mode) S_ISWHT($mode) 5445 5446See your native chmod(2) and stat(2) documentation for more details 5447about the C<S_*> constants. To get status info for a symbolic link 5448instead of the target file behind the link, use the C<lstat> function. 5449 5450=item study SCALAR 5451 5452=item study 5453 5454Takes extra time to study SCALAR (C<$_> if unspecified) in anticipation of 5455doing many pattern matches on the string before it is next modified. 5456This may or may not save time, depending on the nature and number of 5457patterns you are searching on, and on the distribution of character 5458frequencies in the string to be searched--you probably want to compare 5459run times with and without it to see which runs faster. Those loops 5460which scan for many short constant strings (including the constant 5461parts of more complex patterns) will benefit most. You may have only 5462one C<study> active at a time--if you study a different scalar the first 5463is "unstudied". (The way C<study> works is this: a linked list of every 5464character in the string to be searched is made, so we know, for 5465example, where all the C<'k'> characters are. From each search string, 5466the rarest character is selected, based on some static frequency tables 5467constructed from some C programs and English text. Only those places 5468that contain this "rarest" character are examined.) 5469 5470For example, here is a loop that inserts index producing entries 5471before any line containing a certain pattern: 5472 5473 while (<>) { 5474 study; 5475 print ".IX foo\n" if /\bfoo\b/; 5476 print ".IX bar\n" if /\bbar\b/; 5477 print ".IX blurfl\n" if /\bblurfl\b/; 5478 # ... 5479 print; 5480 } 5481 5482In searching for C</\bfoo\b/>, only those locations in C<$_> that contain C<f> 5483will be looked at, because C<f> is rarer than C<o>. In general, this is 5484a big win except in pathological cases. The only question is whether 5485it saves you more time than it took to build the linked list in the 5486first place. 5487 5488Note that if you have to look for strings that you don't know till 5489runtime, you can build an entire loop as a string and C<eval> that to 5490avoid recompiling all your patterns all the time. Together with 5491undefining C<$/> to input entire files as one record, this can be very 5492fast, often faster than specialized programs like fgrep(1). The following 5493scans a list of files (C<@files>) for a list of words (C<@words>), and prints 5494out the names of those files that contain a match: 5495 5496 $search = 'while (<>) { study;'; 5497 foreach $word (@words) { 5498 $search .= "++\$seen{\$ARGV} if /\\b$word\\b/;\n"; 5499 } 5500 $search .= "}"; 5501 @ARGV = @files; 5502 undef $/; 5503 eval $search; # this screams 5504 $/ = "\n"; # put back to normal input delimiter 5505 foreach $file (sort keys(%seen)) { 5506 print $file, "\n"; 5507 } 5508 5509=item sub NAME BLOCK 5510 5511=item sub NAME (PROTO) BLOCK 5512 5513=item sub NAME : ATTRS BLOCK 5514 5515=item sub NAME (PROTO) : ATTRS BLOCK 5516 5517This is subroutine definition, not a real function I<per se>. 5518Without a BLOCK it's just a forward declaration. Without a NAME, 5519it's an anonymous function declaration, and does actually return 5520a value: the CODE ref of the closure you just created. 5521 5522See L<perlsub> and L<perlref> for details about subroutines and 5523references, and L<attributes> and L<Attribute::Handlers> for more 5524information about attributes. 5525 5526=item substr EXPR,OFFSET,LENGTH,REPLACEMENT 5527 5528=item substr EXPR,OFFSET,LENGTH 5529 5530=item substr EXPR,OFFSET 5531 5532Extracts a substring out of EXPR and returns it. First character is at 5533offset C<0>, or whatever you've set C<$[> to (but don't do that). 5534If OFFSET is negative (or more precisely, less than C<$[>), starts 5535that far from the end of the string. If LENGTH is omitted, returns 5536everything to the end of the string. If LENGTH is negative, leaves that 5537many characters off the end of the string. 5538 5539You can use the substr() function as an lvalue, in which case EXPR 5540must itself be an lvalue. If you assign something shorter than LENGTH, 5541the string will shrink, and if you assign something longer than LENGTH, 5542the string will grow to accommodate it. To keep the string the same 5543length you may need to pad or chop your value using C<sprintf>. 5544 5545If OFFSET and LENGTH specify a substring that is partly outside the 5546string, only the part within the string is returned. If the substring 5547is beyond either end of the string, substr() returns the undefined 5548value and produces a warning. When used as an lvalue, specifying a 5549substring that is entirely outside the string is a fatal error. 5550Here's an example showing the behavior for boundary cases: 5551 5552 my $name = 'fred'; 5553 substr($name, 4) = 'dy'; # $name is now 'freddy' 5554 my $null = substr $name, 6, 2; # returns '' (no warning) 5555 my $oops = substr $name, 7; # returns undef, with warning 5556 substr($name, 7) = 'gap'; # fatal error 5557 5558An alternative to using substr() as an lvalue is to specify the 5559replacement string as the 4th argument. This allows you to replace 5560parts of the EXPR and return what was there before in one operation, 5561just as you can with splice(). 5562 5563If the lvalue returned by substr is used after the EXPR is changed in 5564any way, the behaviour may not be as expected and is subject to change. 5565This caveat includes code such as C<print(substr($foo,$a,$b)=$bar)> or 5566C<(substr($foo,$a,$b)=$bar)=$fud> (where $foo is changed via the 5567substring assignment, and then the substr is used again), or where a 5568substr() is aliased via a C<foreach> loop or passed as a parameter or 5569a reference to it is taken and then the alias, parameter, or deref'd 5570reference either is used after the original EXPR has been changed or 5571is assigned to and then used a second time. 5572 5573=item symlink OLDFILE,NEWFILE 5574 5575Creates a new filename symbolically linked to the old filename. 5576Returns C<1> for success, C<0> otherwise. On systems that don't support 5577symbolic links, produces a fatal error at run time. To check for that, 5578use eval: 5579 5580 $symlink_exists = eval { symlink("",""); 1 }; 5581 5582=item syscall NUMBER, LIST 5583 5584Calls the system call specified as the first element of the list, 5585passing the remaining elements as arguments to the system call. If 5586unimplemented, produces a fatal error. The arguments are interpreted 5587as follows: if a given argument is numeric, the argument is passed as 5588an int. If not, the pointer to the string value is passed. You are 5589responsible to make sure a string is pre-extended long enough to 5590receive any result that might be written into a string. You can't use a 5591string literal (or other read-only string) as an argument to C<syscall> 5592because Perl has to assume that any string pointer might be written 5593through. If your 5594integer arguments are not literals and have never been interpreted in a 5595numeric context, you may need to add C<0> to them to force them to look 5596like numbers. This emulates the C<syswrite> function (or vice versa): 5597 5598 require 'syscall.ph'; # may need to run h2ph 5599 $s = "hi there\n"; 5600 syscall(&SYS_write, fileno(STDOUT), $s, length $s); 5601 5602Note that Perl supports passing of up to only 14 arguments to your system call, 5603which in practice should usually suffice. 5604 5605Syscall returns whatever value returned by the system call it calls. 5606If the system call fails, C<syscall> returns C<-1> and sets C<$!> (errno). 5607Note that some system calls can legitimately return C<-1>. The proper 5608way to handle such calls is to assign C<$!=0;> before the call and 5609check the value of C<$!> if syscall returns C<-1>. 5610 5611There's a problem with C<syscall(&SYS_pipe)>: it returns the file 5612number of the read end of the pipe it creates. There is no way 5613to retrieve the file number of the other end. You can avoid this 5614problem by using C<pipe> instead. 5615 5616=item sysopen FILEHANDLE,FILENAME,MODE 5617 5618=item sysopen FILEHANDLE,FILENAME,MODE,PERMS 5619 5620Opens the file whose filename is given by FILENAME, and associates it 5621with FILEHANDLE. If FILEHANDLE is an expression, its value is used as 5622the name of the real filehandle wanted. This function calls the 5623underlying operating system's C<open> function with the parameters 5624FILENAME, MODE, PERMS. 5625 5626The possible values and flag bits of the MODE parameter are 5627system-dependent; they are available via the standard module C<Fcntl>. 5628See the documentation of your operating system's C<open> to see which 5629values and flag bits are available. You may combine several flags 5630using the C<|>-operator. 5631 5632Some of the most common values are C<O_RDONLY> for opening the file in 5633read-only mode, C<O_WRONLY> for opening the file in write-only mode, 5634and C<O_RDWR> for opening the file in read-write mode, and. 5635 5636For historical reasons, some values work on almost every system 5637supported by perl: zero means read-only, one means write-only, and two 5638means read/write. We know that these values do I<not> work under 5639OS/390 & VM/ESA Unix and on the Macintosh; you probably don't want to 5640use them in new code. 5641 5642If the file named by FILENAME does not exist and the C<open> call creates 5643it (typically because MODE includes the C<O_CREAT> flag), then the value of 5644PERMS specifies the permissions of the newly created file. If you omit 5645the PERMS argument to C<sysopen>, Perl uses the octal value C<0666>. 5646These permission values need to be in octal, and are modified by your 5647process's current C<umask>. 5648 5649In many systems the C<O_EXCL> flag is available for opening files in 5650exclusive mode. This is B<not> locking: exclusiveness means here that 5651if the file already exists, sysopen() fails. The C<O_EXCL> wins 5652C<O_TRUNC>. 5653 5654Sometimes you may want to truncate an already-existing file: C<O_TRUNC>. 5655 5656You should seldom if ever use C<0644> as argument to C<sysopen>, because 5657that takes away the user's option to have a more permissive umask. 5658Better to omit it. See the perlfunc(1) entry on C<umask> for more 5659on this. 5660 5661Note that C<sysopen> depends on the fdopen() C library function. 5662On many UNIX systems, fdopen() is known to fail when file descriptors 5663exceed a certain value, typically 255. If you need more file 5664descriptors than that, consider rebuilding Perl to use the C<sfio> 5665library, or perhaps using the POSIX::open() function. 5666 5667See L<perlopentut> for a kinder, gentler explanation of opening files. 5668 5669=item sysread FILEHANDLE,SCALAR,LENGTH,OFFSET 5670 5671=item sysread FILEHANDLE,SCALAR,LENGTH 5672 5673Attempts to read LENGTH bytes of data into variable SCALAR from the 5674specified FILEHANDLE, using the system call read(2). It bypasses 5675buffered IO, so mixing this with other kinds of reads, C<print>, 5676C<write>, C<seek>, C<tell>, or C<eof> can cause confusion because the 5677perlio or stdio layers usually buffers data. Returns the number of 5678bytes actually read, C<0> at end of file, or undef if there was an 5679error (in the latter case C<$!> is also set). SCALAR will be grown or 5680shrunk so that the last byte actually read is the last byte of the 5681scalar after the read. 5682 5683An OFFSET may be specified to place the read data at some place in the 5684string other than the beginning. A negative OFFSET specifies 5685placement at that many characters counting backwards from the end of 5686the string. A positive OFFSET greater than the length of SCALAR 5687results in the string being padded to the required size with C<"\0"> 5688bytes before the result of the read is appended. 5689 5690There is no syseof() function, which is ok, since eof() doesn't work 5691very well on device files (like ttys) anyway. Use sysread() and check 5692for a return value for 0 to decide whether you're done. 5693 5694Note that if the filehandle has been marked as C<:utf8> Unicode 5695characters are read instead of bytes (the LENGTH, OFFSET, and the 5696return value of sysread() are in Unicode characters). 5697The C<:encoding(...)> layer implicitly introduces the C<:utf8> layer. 5698See L</binmode>, L</open>, and the C<open> pragma, L<open>. 5699 5700=item sysseek FILEHANDLE,POSITION,WHENCE 5701 5702Sets FILEHANDLE's system position in bytes using the system call 5703lseek(2). FILEHANDLE may be an expression whose value gives the name 5704of the filehandle. The values for WHENCE are C<0> to set the new 5705position to POSITION, C<1> to set the it to the current position plus 5706POSITION, and C<2> to set it to EOF plus POSITION (typically 5707negative). 5708 5709Note the I<in bytes>: even if the filehandle has been set to operate 5710on characters (for example by using the C<:utf8> I/O layer), tell() 5711will return byte offsets, not character offsets (because implementing 5712that would render sysseek() very slow). 5713 5714sysseek() bypasses normal buffered IO, so mixing this with reads (other 5715than C<sysread>, for example >< or read()) C<print>, C<write>, 5716C<seek>, C<tell>, or C<eof> may cause confusion. 5717 5718For WHENCE, you may also use the constants C<SEEK_SET>, C<SEEK_CUR>, 5719and C<SEEK_END> (start of the file, current position, end of the file) 5720from the Fcntl module. Use of the constants is also more portable 5721than relying on 0, 1, and 2. For example to define a "systell" function: 5722 5723 use Fcntl 'SEEK_CUR'; 5724 sub systell { sysseek($_[0], 0, SEEK_CUR) } 5725 5726Returns the new position, or the undefined value on failure. A position 5727of zero is returned as the string C<"0 but true">; thus C<sysseek> returns 5728true on success and false on failure, yet you can still easily determine 5729the new position. 5730 5731=item system LIST 5732 5733=item system PROGRAM LIST 5734 5735Does exactly the same thing as C<exec LIST>, except that a fork is 5736done first, and the parent process waits for the child process to 5737complete. Note that argument processing varies depending on the 5738number of arguments. If there is more than one argument in LIST, 5739or if LIST is an array with more than one value, starts the program 5740given by the first element of the list with arguments given by the 5741rest of the list. If there is only one scalar argument, the argument 5742is checked for shell metacharacters, and if there are any, the 5743entire argument is passed to the system's command shell for parsing 5744(this is C</bin/sh -c> on Unix platforms, but varies on other 5745platforms). If there are no shell metacharacters in the argument, 5746it is split into words and passed directly to C<execvp>, which is 5747more efficient. 5748 5749Beginning with v5.6.0, Perl will attempt to flush all files opened for 5750output before any operation that may do a fork, but this may not be 5751supported on some platforms (see L<perlport>). To be safe, you may need 5752to set C<$|> ($AUTOFLUSH in English) or call the C<autoflush()> method 5753of C<IO::Handle> on any open handles. 5754 5755The return value is the exit status of the program as returned by the 5756C<wait> call. To get the actual exit value shift right by eight (see below). 5757See also L</exec>. This is I<not> what you want to use to capture 5758the output from a command, for that you should use merely backticks or 5759C<qx//>, as described in L<perlop/"`STRING`">. Return value of -1 5760indicates a failure to start the program (inspect $! for the reason). 5761 5762Like C<exec>, C<system> allows you to lie to a program about its name if 5763you use the C<system PROGRAM LIST> syntax. Again, see L</exec>. 5764 5765Since C<SIGINT> and C<SIGQUIT> are ignored during the execution of 5766C<system>, if you expect your program to terminate on receipt of these 5767signals you will need to arrange to do so yourself based on the return 5768value. 5769 5770 @args = ("command", "arg1", "arg2"); 5771 system(@args) == 0 5772 or die "system @args failed: $?" 5773 5774You can check all the failure possibilities by inspecting 5775C<$?> like this: 5776 5777 if ($? == -1) { 5778 print "failed to execute: $!\n"; 5779 } 5780 elsif ($? & 127) { 5781 printf "child died with signal %d, %s coredump\n", 5782 ($? & 127), ($? & 128) ? 'with' : 'without'; 5783 } 5784 else { 5785 printf "child exited with value %d\n", $? >> 8; 5786 } 5787 5788or more portably by using the W*() calls of the POSIX extension; 5789see L<perlport> for more information. 5790 5791When the arguments get executed via the system shell, results 5792and return codes will be subject to its quirks and capabilities. 5793See L<perlop/"`STRING`"> and L</exec> for details. 5794 5795=item syswrite FILEHANDLE,SCALAR,LENGTH,OFFSET 5796 5797=item syswrite FILEHANDLE,SCALAR,LENGTH 5798 5799=item syswrite FILEHANDLE,SCALAR 5800 5801Attempts to write LENGTH bytes of data from variable SCALAR to the 5802specified FILEHANDLE, using the system call write(2). If LENGTH is 5803not specified, writes whole SCALAR. It bypasses buffered IO, so 5804mixing this with reads (other than C<sysread())>, C<print>, C<write>, 5805C<seek>, C<tell>, or C<eof> may cause confusion because the perlio and 5806stdio layers usually buffers data. Returns the number of bytes 5807actually written, or C<undef> if there was an error (in this case the 5808errno variable C<$!> is also set). If the LENGTH is greater than the 5809available data in the SCALAR after the OFFSET, only as much data as is 5810available will be written. 5811 5812An OFFSET may be specified to write the data from some part of the 5813string other than the beginning. A negative OFFSET specifies writing 5814that many characters counting backwards from the end of the string. 5815In the case the SCALAR is empty you can use OFFSET but only zero offset. 5816 5817Note that if the filehandle has been marked as C<:utf8>, Unicode 5818characters are written instead of bytes (the LENGTH, OFFSET, and the 5819return value of syswrite() are in UTF-8 encoded Unicode characters). 5820The C<:encoding(...)> layer implicitly introduces the C<:utf8> layer. 5821See L</binmode>, L</open>, and the C<open> pragma, L<open>. 5822 5823=item tell FILEHANDLE 5824 5825=item tell 5826 5827Returns the current position I<in bytes> for FILEHANDLE, or -1 on 5828error. FILEHANDLE may be an expression whose value gives the name of 5829the actual filehandle. If FILEHANDLE is omitted, assumes the file 5830last read. 5831 5832Note the I<in bytes>: even if the filehandle has been set to 5833operate on characters (for example by using the C<:utf8> open 5834layer), tell() will return byte offsets, not character offsets 5835(because that would render seek() and tell() rather slow). 5836 5837The return value of tell() for the standard streams like the STDIN 5838depends on the operating system: it may return -1 or something else. 5839tell() on pipes, fifos, and sockets usually returns -1. 5840 5841There is no C<systell> function. Use C<sysseek(FH, 0, 1)> for that. 5842 5843Do not use tell() on a filehandle that has been opened using 5844sysopen(), use sysseek() for that as described above. Why? Because 5845sysopen() creates unbuffered, "raw", filehandles, while open() creates 5846buffered filehandles. sysseek() make sense only on the first kind, 5847tell() only makes sense on the second kind. 5848 5849=item telldir DIRHANDLE 5850 5851Returns the current position of the C<readdir> routines on DIRHANDLE. 5852Value may be given to C<seekdir> to access a particular location in a 5853directory. Has the same caveats about possible directory compaction as 5854the corresponding system library routine. 5855 5856=item tie VARIABLE,CLASSNAME,LIST 5857 5858This function binds a variable to a package class that will provide the 5859implementation for the variable. VARIABLE is the name of the variable 5860to be enchanted. CLASSNAME is the name of a class implementing objects 5861of correct type. Any additional arguments are passed to the C<new> 5862method of the class (meaning C<TIESCALAR>, C<TIEHANDLE>, C<TIEARRAY>, 5863or C<TIEHASH>). Typically these are arguments such as might be passed 5864to the C<dbm_open()> function of C. The object returned by the C<new> 5865method is also returned by the C<tie> function, which would be useful 5866if you want to access other methods in CLASSNAME. 5867 5868Note that functions such as C<keys> and C<values> may return huge lists 5869when used on large objects, like DBM files. You may prefer to use the 5870C<each> function to iterate over such. Example: 5871 5872 # print out history file offsets 5873 use NDBM_File; 5874 tie(%HIST, 'NDBM_File', '/usr/lib/news/history', 1, 0); 5875 while (($key,$val) = each %HIST) { 5876 print $key, ' = ', unpack('L',$val), "\n"; 5877 } 5878 untie(%HIST); 5879 5880A class implementing a hash should have the following methods: 5881 5882 TIEHASH classname, LIST 5883 FETCH this, key 5884 STORE this, key, value 5885 DELETE this, key 5886 CLEAR this 5887 EXISTS this, key 5888 FIRSTKEY this 5889 NEXTKEY this, lastkey 5890 SCALAR this 5891 DESTROY this 5892 UNTIE this 5893 5894A class implementing an ordinary array should have the following methods: 5895 5896 TIEARRAY classname, LIST 5897 FETCH this, key 5898 STORE this, key, value 5899 FETCHSIZE this 5900 STORESIZE this, count 5901 CLEAR this 5902 PUSH this, LIST 5903 POP this 5904 SHIFT this 5905 UNSHIFT this, LIST 5906 SPLICE this, offset, length, LIST 5907 EXTEND this, count 5908 DESTROY this 5909 UNTIE this 5910 5911A class implementing a file handle should have the following methods: 5912 5913 TIEHANDLE classname, LIST 5914 READ this, scalar, length, offset 5915 READLINE this 5916 GETC this 5917 WRITE this, scalar, length, offset 5918 PRINT this, LIST 5919 PRINTF this, format, LIST 5920 BINMODE this 5921 EOF this 5922 FILENO this 5923 SEEK this, position, whence 5924 TELL this 5925 OPEN this, mode, LIST 5926 CLOSE this 5927 DESTROY this 5928 UNTIE this 5929 5930A class implementing a scalar should have the following methods: 5931 5932 TIESCALAR classname, LIST 5933 FETCH this, 5934 STORE this, value 5935 DESTROY this 5936 UNTIE this 5937 5938Not all methods indicated above need be implemented. See L<perltie>, 5939L<Tie::Hash>, L<Tie::Array>, L<Tie::Scalar>, and L<Tie::Handle>. 5940 5941Unlike C<dbmopen>, the C<tie> function will not use or require a module 5942for you--you need to do that explicitly yourself. See L<DB_File> 5943or the F<Config> module for interesting C<tie> implementations. 5944 5945For further details see L<perltie>, L<"tied VARIABLE">. 5946 5947=item tied VARIABLE 5948 5949Returns a reference to the object underlying VARIABLE (the same value 5950that was originally returned by the C<tie> call that bound the variable 5951to a package.) Returns the undefined value if VARIABLE isn't tied to a 5952package. 5953 5954=item time 5955 5956Returns the number of non-leap seconds since whatever time the system 5957considers to be the epoch (that's 00:00:00, January 1, 1904 for Mac OS, 5958and 00:00:00 UTC, January 1, 1970 for most other systems). 5959Suitable for feeding to C<gmtime> and C<localtime>. 5960 5961For measuring time in better granularity than one second, 5962you may use either the Time::HiRes module (from CPAN, and starting from 5963Perl 5.8 part of the standard distribution), or if you have 5964gettimeofday(2), you may be able to use the C<syscall> interface of Perl. 5965See L<perlfaq8> for details. 5966 5967=item times 5968 5969Returns a four-element list giving the user and system times, in 5970seconds, for this process and the children of this process. 5971 5972 ($user,$system,$cuser,$csystem) = times; 5973 5974In scalar context, C<times> returns C<$user>. 5975 5976=item tr/// 5977 5978The transliteration operator. Same as C<y///>. See L<perlop>. 5979 5980=item truncate FILEHANDLE,LENGTH 5981 5982=item truncate EXPR,LENGTH 5983 5984Truncates the file opened on FILEHANDLE, or named by EXPR, to the 5985specified length. Produces a fatal error if truncate isn't implemented 5986on your system. Returns true if successful, the undefined value 5987otherwise. 5988 5989The behavior is undefined if LENGTH is greater than the length of the 5990file. 5991 5992=item uc EXPR 5993 5994=item uc 5995 5996Returns an uppercased version of EXPR. This is the internal function 5997implementing the C<\U> escape in double-quoted strings. Respects 5998current LC_CTYPE locale if C<use locale> in force. See L<perllocale> 5999and L<perlunicode> for more details about locale and Unicode support. 6000It does not attempt to do titlecase mapping on initial letters. See 6001C<ucfirst> for that. 6002 6003If EXPR is omitted, uses C<$_>. 6004 6005=item ucfirst EXPR 6006 6007=item ucfirst 6008 6009Returns the value of EXPR with the first character in uppercase 6010(titlecase in Unicode). This is the internal function implementing 6011the C<\u> escape in double-quoted strings. Respects current LC_CTYPE 6012locale if C<use locale> in force. See L<perllocale> and L<perlunicode> 6013for more details about locale and Unicode support. 6014 6015If EXPR is omitted, uses C<$_>. 6016 6017=item umask EXPR 6018 6019=item umask 6020 6021Sets the umask for the process to EXPR and returns the previous value. 6022If EXPR is omitted, merely returns the current umask. 6023 6024The Unix permission C<rwxr-x---> is represented as three sets of three 6025bits, or three octal digits: C<0750> (the leading 0 indicates octal 6026and isn't one of the digits). The C<umask> value is such a number 6027representing disabled permissions bits. The permission (or "mode") 6028values you pass C<mkdir> or C<sysopen> are modified by your umask, so 6029even if you tell C<sysopen> to create a file with permissions C<0777>, 6030if your umask is C<0022> then the file will actually be created with 6031permissions C<0755>. If your C<umask> were C<0027> (group can't 6032write; others can't read, write, or execute), then passing 6033C<sysopen> C<0666> would create a file with mode C<0640> (C<0666 &~ 6034027> is C<0640>). 6035 6036Here's some advice: supply a creation mode of C<0666> for regular 6037files (in C<sysopen>) and one of C<0777> for directories (in 6038C<mkdir>) and executable files. This gives users the freedom of 6039choice: if they want protected files, they might choose process umasks 6040of C<022>, C<027>, or even the particularly antisocial mask of C<077>. 6041Programs should rarely if ever make policy decisions better left to 6042the user. The exception to this is when writing files that should be 6043kept private: mail files, web browser cookies, I<.rhosts> files, and 6044so on. 6045 6046If umask(2) is not implemented on your system and you are trying to 6047restrict access for I<yourself> (i.e., (EXPR & 0700) > 0), produces a 6048fatal error at run time. If umask(2) is not implemented and you are 6049not trying to restrict access for yourself, returns C<undef>. 6050 6051Remember that a umask is a number, usually given in octal; it is I<not> a 6052string of octal digits. See also L</oct>, if all you have is a string. 6053 6054=item undef EXPR 6055 6056=item undef 6057 6058Undefines the value of EXPR, which must be an lvalue. Use only on a 6059scalar value, an array (using C<@>), a hash (using C<%>), a subroutine 6060(using C<&>), or a typeglob (using C<*>). (Saying C<undef $hash{$key}> 6061will probably not do what you expect on most predefined variables or 6062DBM list values, so don't do that; see L<delete>.) Always returns the 6063undefined value. You can omit the EXPR, in which case nothing is 6064undefined, but you still get an undefined value that you could, for 6065instance, return from a subroutine, assign to a variable or pass as a 6066parameter. Examples: 6067 6068 undef $foo; 6069 undef $bar{'blurfl'}; # Compare to: delete $bar{'blurfl'}; 6070 undef @ary; 6071 undef %hash; 6072 undef &mysub; 6073 undef *xyz; # destroys $xyz, @xyz, %xyz, &xyz, etc. 6074 return (wantarray ? (undef, $errmsg) : undef) if $they_blew_it; 6075 select undef, undef, undef, 0.25; 6076 ($a, $b, undef, $c) = &foo; # Ignore third value returned 6077 6078Note that this is a unary operator, not a list operator. 6079 6080=item unlink LIST 6081 6082=item unlink 6083 6084Deletes a list of files. Returns the number of files successfully 6085deleted. 6086 6087 $cnt = unlink 'a', 'b', 'c'; 6088 unlink @goners; 6089 unlink <*.bak>; 6090 6091Note: C<unlink> will not delete directories unless you are superuser and 6092the B<-U> flag is supplied to Perl. Even if these conditions are 6093met, be warned that unlinking a directory can inflict damage on your 6094filesystem. Use C<rmdir> instead. 6095 6096If LIST is omitted, uses C<$_>. 6097 6098=item unpack TEMPLATE,EXPR 6099 6100C<unpack> does the reverse of C<pack>: it takes a string 6101and expands it out into a list of values. 6102(In scalar context, it returns merely the first value produced.) 6103 6104The string is broken into chunks described by the TEMPLATE. Each chunk 6105is converted separately to a value. Typically, either the string is a result 6106of C<pack>, or the bytes of the string represent a C structure of some 6107kind. 6108 6109The TEMPLATE has the same format as in the C<pack> function. 6110Here's a subroutine that does substring: 6111 6112 sub substr { 6113 my($what,$where,$howmuch) = @_; 6114 unpack("x$where a$howmuch", $what); 6115 } 6116 6117and then there's 6118 6119 sub ordinal { unpack("c",$_[0]); } # same as ord() 6120 6121In addition to fields allowed in pack(), you may prefix a field with 6122a %<number> to indicate that 6123you want a <number>-bit checksum of the items instead of the items 6124themselves. Default is a 16-bit checksum. Checksum is calculated by 6125summing numeric values of expanded values (for string fields the sum of 6126C<ord($char)> is taken, for bit fields the sum of zeroes and ones). 6127 6128For example, the following 6129computes the same number as the System V sum program: 6130 6131 $checksum = do { 6132 local $/; # slurp! 6133 unpack("%32C*",<>) % 65535; 6134 }; 6135 6136The following efficiently counts the number of set bits in a bit vector: 6137 6138 $setbits = unpack("%32b*", $selectmask); 6139 6140The C<p> and C<P> formats should be used with care. Since Perl 6141has no way of checking whether the value passed to C<unpack()> 6142corresponds to a valid memory location, passing a pointer value that's 6143not known to be valid is likely to have disastrous consequences. 6144 6145If there are more pack codes or if the repeat count of a field or a group 6146is larger than what the remainder of the input string allows, the result 6147is not well defined: in some cases, the repeat count is decreased, or 6148C<unpack()> will produce null strings or zeroes, or terminate with an 6149error. If the input string is longer than one described by the TEMPLATE, 6150the rest is ignored. 6151 6152See L</pack> for more examples and notes. 6153 6154=item untie VARIABLE 6155 6156Breaks the binding between a variable and a package. (See C<tie>.) 6157Has no effect if the variable is not tied. 6158 6159=item unshift ARRAY,LIST 6160 6161Does the opposite of a C<shift>. Or the opposite of a C<push>, 6162depending on how you look at it. Prepends list to the front of the 6163array, and returns the new number of elements in the array. 6164 6165 unshift(@ARGV, '-e') unless $ARGV[0] =~ /^-/; 6166 6167Note the LIST is prepended whole, not one element at a time, so the 6168prepended elements stay in the same order. Use C<reverse> to do the 6169reverse. 6170 6171=item use Module VERSION LIST 6172 6173=item use Module VERSION 6174 6175=item use Module LIST 6176 6177=item use Module 6178 6179=item use VERSION 6180 6181Imports some semantics into the current package from the named module, 6182generally by aliasing certain subroutine or variable names into your 6183package. It is exactly equivalent to 6184 6185 BEGIN { require Module; import Module LIST; } 6186 6187except that Module I<must> be a bareword. 6188 6189VERSION may be either a numeric argument such as 5.006, which will be 6190compared to C<$]>, or a literal of the form v5.6.1, which will be compared 6191to C<$^V> (aka $PERL_VERSION. A fatal error is produced if VERSION is 6192greater than the version of the current Perl interpreter; Perl will not 6193attempt to parse the rest of the file. Compare with L</require>, which can 6194do a similar check at run time. 6195 6196Specifying VERSION as a literal of the form v5.6.1 should generally be 6197avoided, because it leads to misleading error messages under earlier 6198versions of Perl which do not support this syntax. The equivalent numeric 6199version should be used instead. 6200 6201 use v5.6.1; # compile time version check 6202 use 5.6.1; # ditto 6203 use 5.006_001; # ditto; preferred for backwards compatibility 6204 6205This is often useful if you need to check the current Perl version before 6206C<use>ing library modules that have changed in incompatible ways from 6207older versions of Perl. (We try not to do this more than we have to.) 6208 6209The C<BEGIN> forces the C<require> and C<import> to happen at compile time. The 6210C<require> makes sure the module is loaded into memory if it hasn't been 6211yet. The C<import> is not a builtin--it's just an ordinary static method 6212call into the C<Module> package to tell the module to import the list of 6213features back into the current package. The module can implement its 6214C<import> method any way it likes, though most modules just choose to 6215derive their C<import> method via inheritance from the C<Exporter> class that 6216is defined in the C<Exporter> module. See L<Exporter>. If no C<import> 6217method can be found then the call is skipped. 6218 6219If you do not want to call the package's C<import> method (for instance, 6220to stop your namespace from being altered), explicitly supply the empty list: 6221 6222 use Module (); 6223 6224That is exactly equivalent to 6225 6226 BEGIN { require Module } 6227 6228If the VERSION argument is present between Module and LIST, then the 6229C<use> will call the VERSION method in class Module with the given 6230version as an argument. The default VERSION method, inherited from 6231the UNIVERSAL class, croaks if the given version is larger than the 6232value of the variable C<$Module::VERSION>. 6233 6234Again, there is a distinction between omitting LIST (C<import> called 6235with no arguments) and an explicit empty LIST C<()> (C<import> not 6236called). Note that there is no comma after VERSION! 6237 6238Because this is a wide-open interface, pragmas (compiler directives) 6239are also implemented this way. Currently implemented pragmas are: 6240 6241 use constant; 6242 use diagnostics; 6243 use integer; 6244 use sigtrap qw(SEGV BUS); 6245 use strict qw(subs vars refs); 6246 use subs qw(afunc blurfl); 6247 use warnings qw(all); 6248 use sort qw(stable _quicksort _mergesort); 6249 6250Some of these pseudo-modules import semantics into the current 6251block scope (like C<strict> or C<integer>, unlike ordinary modules, 6252which import symbols into the current package (which are effective 6253through the end of the file). 6254 6255There's a corresponding C<no> command that unimports meanings imported 6256by C<use>, i.e., it calls C<unimport Module LIST> instead of C<import>. 6257 6258 no integer; 6259 no strict 'refs'; 6260 no warnings; 6261 6262See L<perlmodlib> for a list of standard modules and pragmas. See L<perlrun> 6263for the C<-M> and C<-m> command-line options to perl that give C<use> 6264functionality from the command-line. 6265 6266=item utime LIST 6267 6268Changes the access and modification times on each file of a list of 6269files. The first two elements of the list must be the NUMERICAL access 6270and modification times, in that order. Returns the number of files 6271successfully changed. The inode change time of each file is set 6272to the current time. For example, this code has the same effect as the 6273Unix touch(1) command when the files I<already exist>. 6274 6275 #!/usr/bin/perl 6276 $atime = $mtime = time; 6277 utime $atime, $mtime, @ARGV; 6278 6279Since perl 5.7.2, if the first two elements of the list are C<undef>, then 6280the utime(2) function in the C library will be called with a null second 6281argument. On most systems, this will set the file's access and 6282modification times to the current time (i.e. equivalent to the example 6283above.) 6284 6285 utime undef, undef, @ARGV; 6286 6287Under NFS this will use the time of the NFS server, not the time of 6288the local machine. If there is a time synchronization problem, the 6289NFS server and local machine will have different times. The Unix 6290touch(1) command will in fact normally use this form instead of the 6291one shown in the first example. 6292 6293Note that only passing one of the first two elements as C<undef> will 6294be equivalent of passing it as 0 and will not have the same effect as 6295described when they are both C<undef>. This case will also trigger an 6296uninitialized warning. 6297 6298=item values HASH 6299 6300Returns a list consisting of all the values of the named hash. 6301(In a scalar context, returns the number of values.) 6302 6303The values are returned in an apparently random order. The actual 6304random order is subject to change in future versions of perl, but it 6305is guaranteed to be the same order as either the C<keys> or C<each> 6306function would produce on the same (unmodified) hash. Since Perl 63075.8.1 the ordering is different even between different runs of Perl 6308for security reasons (see L<perlsec/"Algorithmic Complexity Attacks">). 6309 6310As a side effect, calling values() resets the HASH's internal iterator, 6311see L</each>. (In particular, calling values() in void context resets 6312the iterator with no other overhead.) 6313 6314Note that the values are not copied, which means modifying them will 6315modify the contents of the hash: 6316 6317 for (values %hash) { s/foo/bar/g } # modifies %hash values 6318 for (@hash{keys %hash}) { s/foo/bar/g } # same 6319 6320See also C<keys>, C<each>, and C<sort>. 6321 6322=item vec EXPR,OFFSET,BITS 6323 6324Treats the string in EXPR as a bit vector made up of elements of 6325width BITS, and returns the value of the element specified by OFFSET 6326as an unsigned integer. BITS therefore specifies the number of bits 6327that are reserved for each element in the bit vector. This must 6328be a power of two from 1 to 32 (or 64, if your platform supports 6329that). 6330 6331If BITS is 8, "elements" coincide with bytes of the input string. 6332 6333If BITS is 16 or more, bytes of the input string are grouped into chunks 6334of size BITS/8, and each group is converted to a number as with 6335pack()/unpack() with big-endian formats C<n>/C<N> (and analogously 6336for BITS==64). See L<"pack"> for details. 6337 6338If bits is 4 or less, the string is broken into bytes, then the bits 6339of each byte are broken into 8/BITS groups. Bits of a byte are 6340numbered in a little-endian-ish way, as in C<0x01>, C<0x02>, 6341C<0x04>, C<0x08>, C<0x10>, C<0x20>, C<0x40>, C<0x80>. For example, 6342breaking the single input byte C<chr(0x36)> into two groups gives a list 6343C<(0x6, 0x3)>; breaking it into 4 groups gives C<(0x2, 0x1, 0x3, 0x0)>. 6344 6345C<vec> may also be assigned to, in which case parentheses are needed 6346to give the expression the correct precedence as in 6347 6348 vec($image, $max_x * $x + $y, 8) = 3; 6349 6350If the selected element is outside the string, the value 0 is returned. 6351If an element off the end of the string is written to, Perl will first 6352extend the string with sufficiently many zero bytes. It is an error 6353to try to write off the beginning of the string (i.e. negative OFFSET). 6354 6355The string should not contain any character with the value > 255 (which 6356can only happen if you're using UTF-8 encoding). If it does, it will be 6357treated as something which is not UTF-8 encoded. When the C<vec> was 6358assigned to, other parts of your program will also no longer consider the 6359string to be UTF-8 encoded. In other words, if you do have such characters 6360in your string, vec() will operate on the actual byte string, and not the 6361conceptual character string. 6362 6363Strings created with C<vec> can also be manipulated with the logical 6364operators C<|>, C<&>, C<^>, and C<~>. These operators will assume a bit 6365vector operation is desired when both operands are strings. 6366See L<perlop/"Bitwise String Operators">. 6367 6368The following code will build up an ASCII string saying C<'PerlPerlPerl'>. 6369The comments show the string after each step. Note that this code works 6370in the same way on big-endian or little-endian machines. 6371 6372 my $foo = ''; 6373 vec($foo, 0, 32) = 0x5065726C; # 'Perl' 6374 6375 # $foo eq "Perl" eq "\x50\x65\x72\x6C", 32 bits 6376 print vec($foo, 0, 8); # prints 80 == 0x50 == ord('P') 6377 6378 vec($foo, 2, 16) = 0x5065; # 'PerlPe' 6379 vec($foo, 3, 16) = 0x726C; # 'PerlPerl' 6380 vec($foo, 8, 8) = 0x50; # 'PerlPerlP' 6381 vec($foo, 9, 8) = 0x65; # 'PerlPerlPe' 6382 vec($foo, 20, 4) = 2; # 'PerlPerlPe' . "\x02" 6383 vec($foo, 21, 4) = 7; # 'PerlPerlPer' 6384 # 'r' is "\x72" 6385 vec($foo, 45, 2) = 3; # 'PerlPerlPer' . "\x0c" 6386 vec($foo, 93, 1) = 1; # 'PerlPerlPer' . "\x2c" 6387 vec($foo, 94, 1) = 1; # 'PerlPerlPerl' 6388 # 'l' is "\x6c" 6389 6390To transform a bit vector into a string or list of 0's and 1's, use these: 6391 6392 $bits = unpack("b*", $vector); 6393 @bits = split(//, unpack("b*", $vector)); 6394 6395If you know the exact length in bits, it can be used in place of the C<*>. 6396 6397Here is an example to illustrate how the bits actually fall in place: 6398 6399 #!/usr/bin/perl -wl 6400 6401 print <<'EOT'; 6402 0 1 2 3 6403 unpack("V",$_) 01234567890123456789012345678901 6404 ------------------------------------------------------------------ 6405 EOT 6406 6407 for $w (0..3) { 6408 $width = 2**$w; 6409 for ($shift=0; $shift < $width; ++$shift) { 6410 for ($off=0; $off < 32/$width; ++$off) { 6411 $str = pack("B*", "0"x32); 6412 $bits = (1<<$shift); 6413 vec($str, $off, $width) = $bits; 6414 $res = unpack("b*",$str); 6415 $val = unpack("V", $str); 6416 write; 6417 } 6418 } 6419 } 6420 6421 format STDOUT = 6422 vec($_,@#,@#) = @<< == @######### @>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 6423 $off, $width, $bits, $val, $res 6424 . 6425 __END__ 6426 6427Regardless of the machine architecture on which it is run, the above 6428example should print the following table: 6429 6430 0 1 2 3 6431 unpack("V",$_) 01234567890123456789012345678901 6432 ------------------------------------------------------------------ 6433 vec($_, 0, 1) = 1 == 1 10000000000000000000000000000000 6434 vec($_, 1, 1) = 1 == 2 01000000000000000000000000000000 6435 vec($_, 2, 1) = 1 == 4 00100000000000000000000000000000 6436 vec($_, 3, 1) = 1 == 8 00010000000000000000000000000000 6437 vec($_, 4, 1) = 1 == 16 00001000000000000000000000000000 6438 vec($_, 5, 1) = 1 == 32 00000100000000000000000000000000 6439 vec($_, 6, 1) = 1 == 64 00000010000000000000000000000000 6440 vec($_, 7, 1) = 1 == 128 00000001000000000000000000000000 6441 vec($_, 8, 1) = 1 == 256 00000000100000000000000000000000 6442 vec($_, 9, 1) = 1 == 512 00000000010000000000000000000000 6443 vec($_,10, 1) = 1 == 1024 00000000001000000000000000000000 6444 vec($_,11, 1) = 1 == 2048 00000000000100000000000000000000 6445 vec($_,12, 1) = 1 == 4096 00000000000010000000000000000000 6446 vec($_,13, 1) = 1 == 8192 00000000000001000000000000000000 6447 vec($_,14, 1) = 1 == 16384 00000000000000100000000000000000 6448 vec($_,15, 1) = 1 == 32768 00000000000000010000000000000000 6449 vec($_,16, 1) = 1 == 65536 00000000000000001000000000000000 6450 vec($_,17, 1) = 1 == 131072 00000000000000000100000000000000 6451 vec($_,18, 1) = 1 == 262144 00000000000000000010000000000000 6452 vec($_,19, 1) = 1 == 524288 00000000000000000001000000000000 6453 vec($_,20, 1) = 1 == 1048576 00000000000000000000100000000000 6454 vec($_,21, 1) = 1 == 2097152 00000000000000000000010000000000 6455 vec($_,22, 1) = 1 == 4194304 00000000000000000000001000000000 6456 vec($_,23, 1) = 1 == 8388608 00000000000000000000000100000000 6457 vec($_,24, 1) = 1 == 16777216 00000000000000000000000010000000 6458 vec($_,25, 1) = 1 == 33554432 00000000000000000000000001000000 6459 vec($_,26, 1) = 1 == 67108864 00000000000000000000000000100000 6460 vec($_,27, 1) = 1 == 134217728 00000000000000000000000000010000 6461 vec($_,28, 1) = 1 == 268435456 00000000000000000000000000001000 6462 vec($_,29, 1) = 1 == 536870912 00000000000000000000000000000100 6463 vec($_,30, 1) = 1 == 1073741824 00000000000000000000000000000010 6464 vec($_,31, 1) = 1 == 2147483648 00000000000000000000000000000001 6465 vec($_, 0, 2) = 1 == 1 10000000000000000000000000000000 6466 vec($_, 1, 2) = 1 == 4 00100000000000000000000000000000 6467 vec($_, 2, 2) = 1 == 16 00001000000000000000000000000000 6468 vec($_, 3, 2) = 1 == 64 00000010000000000000000000000000 6469 vec($_, 4, 2) = 1 == 256 00000000100000000000000000000000 6470 vec($_, 5, 2) = 1 == 1024 00000000001000000000000000000000 6471 vec($_, 6, 2) = 1 == 4096 00000000000010000000000000000000 6472 vec($_, 7, 2) = 1 == 16384 00000000000000100000000000000000 6473 vec($_, 8, 2) = 1 == 65536 00000000000000001000000000000000 6474 vec($_, 9, 2) = 1 == 262144 00000000000000000010000000000000 6475 vec($_,10, 2) = 1 == 1048576 00000000000000000000100000000000 6476 vec($_,11, 2) = 1 == 4194304 00000000000000000000001000000000 6477 vec($_,12, 2) = 1 == 16777216 00000000000000000000000010000000 6478 vec($_,13, 2) = 1 == 67108864 00000000000000000000000000100000 6479 vec($_,14, 2) = 1 == 268435456 00000000000000000000000000001000 6480 vec($_,15, 2) = 1 == 1073741824 00000000000000000000000000000010 6481 vec($_, 0, 2) = 2 == 2 01000000000000000000000000000000 6482 vec($_, 1, 2) = 2 == 8 00010000000000000000000000000000 6483 vec($_, 2, 2) = 2 == 32 00000100000000000000000000000000 6484 vec($_, 3, 2) = 2 == 128 00000001000000000000000000000000 6485 vec($_, 4, 2) = 2 == 512 00000000010000000000000000000000 6486 vec($_, 5, 2) = 2 == 2048 00000000000100000000000000000000 6487 vec($_, 6, 2) = 2 == 8192 00000000000001000000000000000000 6488 vec($_, 7, 2) = 2 == 32768 00000000000000010000000000000000 6489 vec($_, 8, 2) = 2 == 131072 00000000000000000100000000000000 6490 vec($_, 9, 2) = 2 == 524288 00000000000000000001000000000000 6491 vec($_,10, 2) = 2 == 2097152 00000000000000000000010000000000 6492 vec($_,11, 2) = 2 == 8388608 00000000000000000000000100000000 6493 vec($_,12, 2) = 2 == 33554432 00000000000000000000000001000000 6494 vec($_,13, 2) = 2 == 134217728 00000000000000000000000000010000 6495 vec($_,14, 2) = 2 == 536870912 00000000000000000000000000000100 6496 vec($_,15, 2) = 2 == 2147483648 00000000000000000000000000000001 6497 vec($_, 0, 4) = 1 == 1 10000000000000000000000000000000 6498 vec($_, 1, 4) = 1 == 16 00001000000000000000000000000000 6499 vec($_, 2, 4) = 1 == 256 00000000100000000000000000000000 6500 vec($_, 3, 4) = 1 == 4096 00000000000010000000000000000000 6501 vec($_, 4, 4) = 1 == 65536 00000000000000001000000000000000 6502 vec($_, 5, 4) = 1 == 1048576 00000000000000000000100000000000 6503 vec($_, 6, 4) = 1 == 16777216 00000000000000000000000010000000 6504 vec($_, 7, 4) = 1 == 268435456 00000000000000000000000000001000 6505 vec($_, 0, 4) = 2 == 2 01000000000000000000000000000000 6506 vec($_, 1, 4) = 2 == 32 00000100000000000000000000000000 6507 vec($_, 2, 4) = 2 == 512 00000000010000000000000000000000 6508 vec($_, 3, 4) = 2 == 8192 00000000000001000000000000000000 6509 vec($_, 4, 4) = 2 == 131072 00000000000000000100000000000000 6510 vec($_, 5, 4) = 2 == 2097152 00000000000000000000010000000000 6511 vec($_, 6, 4) = 2 == 33554432 00000000000000000000000001000000 6512 vec($_, 7, 4) = 2 == 536870912 00000000000000000000000000000100 6513 vec($_, 0, 4) = 4 == 4 00100000000000000000000000000000 6514 vec($_, 1, 4) = 4 == 64 00000010000000000000000000000000 6515 vec($_, 2, 4) = 4 == 1024 00000000001000000000000000000000 6516 vec($_, 3, 4) = 4 == 16384 00000000000000100000000000000000 6517 vec($_, 4, 4) = 4 == 262144 00000000000000000010000000000000 6518 vec($_, 5, 4) = 4 == 4194304 00000000000000000000001000000000 6519 vec($_, 6, 4) = 4 == 67108864 00000000000000000000000000100000 6520 vec($_, 7, 4) = 4 == 1073741824 00000000000000000000000000000010 6521 vec($_, 0, 4) = 8 == 8 00010000000000000000000000000000 6522 vec($_, 1, 4) = 8 == 128 00000001000000000000000000000000 6523 vec($_, 2, 4) = 8 == 2048 00000000000100000000000000000000 6524 vec($_, 3, 4) = 8 == 32768 00000000000000010000000000000000 6525 vec($_, 4, 4) = 8 == 524288 00000000000000000001000000000000 6526 vec($_, 5, 4) = 8 == 8388608 00000000000000000000000100000000 6527 vec($_, 6, 4) = 8 == 134217728 00000000000000000000000000010000 6528 vec($_, 7, 4) = 8 == 2147483648 00000000000000000000000000000001 6529 vec($_, 0, 8) = 1 == 1 10000000000000000000000000000000 6530 vec($_, 1, 8) = 1 == 256 00000000100000000000000000000000 6531 vec($_, 2, 8) = 1 == 65536 00000000000000001000000000000000 6532 vec($_, 3, 8) = 1 == 16777216 00000000000000000000000010000000 6533 vec($_, 0, 8) = 2 == 2 01000000000000000000000000000000 6534 vec($_, 1, 8) = 2 == 512 00000000010000000000000000000000 6535 vec($_, 2, 8) = 2 == 131072 00000000000000000100000000000000 6536 vec($_, 3, 8) = 2 == 33554432 00000000000000000000000001000000 6537 vec($_, 0, 8) = 4 == 4 00100000000000000000000000000000 6538 vec($_, 1, 8) = 4 == 1024 00000000001000000000000000000000 6539 vec($_, 2, 8) = 4 == 262144 00000000000000000010000000000000 6540 vec($_, 3, 8) = 4 == 67108864 00000000000000000000000000100000 6541 vec($_, 0, 8) = 8 == 8 00010000000000000000000000000000 6542 vec($_, 1, 8) = 8 == 2048 00000000000100000000000000000000 6543 vec($_, 2, 8) = 8 == 524288 00000000000000000001000000000000 6544 vec($_, 3, 8) = 8 == 134217728 00000000000000000000000000010000 6545 vec($_, 0, 8) = 16 == 16 00001000000000000000000000000000 6546 vec($_, 1, 8) = 16 == 4096 00000000000010000000000000000000 6547 vec($_, 2, 8) = 16 == 1048576 00000000000000000000100000000000 6548 vec($_, 3, 8) = 16 == 268435456 00000000000000000000000000001000 6549 vec($_, 0, 8) = 32 == 32 00000100000000000000000000000000 6550 vec($_, 1, 8) = 32 == 8192 00000000000001000000000000000000 6551 vec($_, 2, 8) = 32 == 2097152 00000000000000000000010000000000 6552 vec($_, 3, 8) = 32 == 536870912 00000000000000000000000000000100 6553 vec($_, 0, 8) = 64 == 64 00000010000000000000000000000000 6554 vec($_, 1, 8) = 64 == 16384 00000000000000100000000000000000 6555 vec($_, 2, 8) = 64 == 4194304 00000000000000000000001000000000 6556 vec($_, 3, 8) = 64 == 1073741824 00000000000000000000000000000010 6557 vec($_, 0, 8) = 128 == 128 00000001000000000000000000000000 6558 vec($_, 1, 8) = 128 == 32768 00000000000000010000000000000000 6559 vec($_, 2, 8) = 128 == 8388608 00000000000000000000000100000000 6560 vec($_, 3, 8) = 128 == 2147483648 00000000000000000000000000000001 6561 6562=item wait 6563 6564Behaves like the wait(2) system call on your system: it waits for a child 6565process to terminate and returns the pid of the deceased process, or 6566C<-1> if there are no child processes. The status is returned in C<$?>. 6567Note that a return value of C<-1> could mean that child processes are 6568being automatically reaped, as described in L<perlipc>. 6569 6570=item waitpid PID,FLAGS 6571 6572Waits for a particular child process to terminate and returns the pid of 6573the deceased process, or C<-1> if there is no such child process. On some 6574systems, a value of 0 indicates that there are processes still running. 6575The status is returned in C<$?>. If you say 6576 6577 use POSIX ":sys_wait_h"; 6578 #... 6579 do { 6580 $kid = waitpid(-1, WNOHANG); 6581 } until $kid > 0; 6582 6583then you can do a non-blocking wait for all pending zombie processes. 6584Non-blocking wait is available on machines supporting either the 6585waitpid(2) or wait4(2) system calls. However, waiting for a particular 6586pid with FLAGS of C<0> is implemented everywhere. (Perl emulates the 6587system call by remembering the status values of processes that have 6588exited but have not been harvested by the Perl script yet.) 6589 6590Note that on some systems, a return value of C<-1> could mean that child 6591processes are being automatically reaped. See L<perlipc> for details, 6592and for other examples. 6593 6594=item wantarray 6595 6596Returns true if the context of the currently executing subroutine or 6597eval() block is looking for a list value. Returns false if the context is 6598looking for a scalar. Returns the undefined value if the context is 6599looking for no value (void context). 6600 6601 return unless defined wantarray; # don't bother doing more 6602 my @a = complex_calculation(); 6603 return wantarray ? @a : "@a"; 6604 6605This function should have been named wantlist() instead. 6606 6607=item warn LIST 6608 6609Produces a message on STDERR just like C<die>, but doesn't exit or throw 6610an exception. 6611 6612If LIST is empty and C<$@> already contains a value (typically from a 6613previous eval) that value is used after appending C<"\t...caught"> 6614to C<$@>. This is useful for staying almost, but not entirely similar to 6615C<die>. 6616 6617If C<$@> is empty then the string C<"Warning: Something's wrong"> is used. 6618 6619No message is printed if there is a C<$SIG{__WARN__}> handler 6620installed. It is the handler's responsibility to deal with the message 6621as it sees fit (like, for instance, converting it into a C<die>). Most 6622handlers must therefore make arrangements to actually display the 6623warnings that they are not prepared to deal with, by calling C<warn> 6624again in the handler. Note that this is quite safe and will not 6625produce an endless loop, since C<__WARN__> hooks are not called from 6626inside one. 6627 6628You will find this behavior is slightly different from that of 6629C<$SIG{__DIE__}> handlers (which don't suppress the error text, but can 6630instead call C<die> again to change it). 6631 6632Using a C<__WARN__> handler provides a powerful way to silence all 6633warnings (even the so-called mandatory ones). An example: 6634 6635 # wipe out *all* compile-time warnings 6636 BEGIN { $SIG{'__WARN__'} = sub { warn $_[0] if $DOWARN } } 6637 my $foo = 10; 6638 my $foo = 20; # no warning about duplicate my $foo, 6639 # but hey, you asked for it! 6640 # no compile-time or run-time warnings before here 6641 $DOWARN = 1; 6642 6643 # run-time warnings enabled after here 6644 warn "\$foo is alive and $foo!"; # does show up 6645 6646See L<perlvar> for details on setting C<%SIG> entries, and for more 6647examples. See the Carp module for other kinds of warnings using its 6648carp() and cluck() functions. 6649 6650=item write FILEHANDLE 6651 6652=item write EXPR 6653 6654=item write 6655 6656Writes a formatted record (possibly multi-line) to the specified FILEHANDLE, 6657using the format associated with that file. By default the format for 6658a file is the one having the same name as the filehandle, but the 6659format for the current output channel (see the C<select> function) may be set 6660explicitly by assigning the name of the format to the C<$~> variable. 6661 6662Top of form processing is handled automatically: if there is 6663insufficient room on the current page for the formatted record, the 6664page is advanced by writing a form feed, a special top-of-page format 6665is used to format the new page header, and then the record is written. 6666By default the top-of-page format is the name of the filehandle with 6667"_TOP" appended, but it may be dynamically set to the format of your 6668choice by assigning the name to the C<$^> variable while the filehandle is 6669selected. The number of lines remaining on the current page is in 6670variable C<$->, which can be set to C<0> to force a new page. 6671 6672If FILEHANDLE is unspecified, output goes to the current default output 6673channel, which starts out as STDOUT but may be changed by the 6674C<select> operator. If the FILEHANDLE is an EXPR, then the expression 6675is evaluated and the resulting string is used to look up the name of 6676the FILEHANDLE at run time. For more on formats, see L<perlform>. 6677 6678Note that write is I<not> the opposite of C<read>. Unfortunately. 6679 6680=item y/// 6681 6682The transliteration operator. Same as C<tr///>. See L<perlop>. 6683 6684=back 6685