perlglossary - Perl Glossary
|
perlglossary - Perl Glossary
version 5.20190126
A glossary of terms (technical and otherwise) used in the Perl
documentation, derived from the Glossary of Programming
Perl, Fourth Edition. Words or phrases in bold are defined elsewhere in
this glossary.
Other useful sources include the Unicode Glossary http://unicode.org/glossary/,
the Free On-Line Dictionary of Computing http://foldoc.org/,
the Jargon File http://catb.org/~esr/jargon/,
and Wikipedia http://www.wikipedia.org/.
- accessor methods
-
A method used to
indirectly inspect or update an object’s state (its instance
variables).
- actual arguments
-
The scalar values that you supply
to a function or subroutine when you call it. For instance, when you
call
power("puff")
, the string "puff"
is the actual argument. See also
argument and formal arguments.
- address operator
-
Some languages work directly with the memory addresses of
values, but this can be like playing with fire. Perl provides a set of
asbestos gloves for handling all memory management. The closest to an
address operator in Perl is the backslash operator, but it gives you a
hard reference, which is much safer than a memory address.
- algorithm
-
A well-defined sequence of steps, explained clearly
enough that even a computer could do them.
- alias
-
A nickname for something, which behaves in all ways as
though you’d used the original name instead of the nickname. Temporary
aliases are implicitly created in the loop variable for
foreach
loops, in
the $_
variable for map
or grep
operators, in $a
and $b
during sort
’s comparison function, and in each element of @_
for the
actual arguments of a subroutine call. Permanent aliases are explicitly
created in packages by importing symbols or by assignment to
typeglobs. Lexically scoped aliases for package variables are explicitly
created by the our
declaration.
- alphabetic
-
The sort of characters we put into words. In Unicode, this
is all letters including all ideographs and certain diacritics, letter
numbers like Roman numerals, and various combining marks.
- alternatives
-
A list of possible choices from which you may
select only one, as in, “Would you like door A, B, or C?” Alternatives in
regular expressions are separated with a single vertical bar:
|
.
Alternatives in normal Perl expressions are separated with a double vertical
bar: ||
. Logical alternatives in Boolean expressions are separated
with either ||
or or
.
- anonymous
-
Used to describe a referent
that is not directly accessible through a named variable. Such a referent
must be indirectly accessible through at least one hard reference. When
the last hard reference goes away, the anonymous referent is destroyed
without pity.
- application
-
A bigger, fancier sort of program with a fancier
name so people don’t realize they are using a program.
- architecture
-
The kind of computer you’re working on, where one “kind of
computer” means all those computers sharing a compatible machine language.
Since Perl programs are (typically) simple text files, not executable
images, a Perl program is much less sensitive to the architecture it’s
running on than programs in other languages, such as C, that are compiled
into machine code. See also platform and operating system.
- argument
-
A piece of data supplied to a program,
subroutine, function, or method to tell it what it’s supposed to
do. Also called a “parameter”.
- ARGV
-
The name of the array containing the argument vector
from the command line. If you use the empty
<>
operator, ARGV
is the name of both the filehandle used to traverse the arguments and the
scalar containing the name of the current input file.
- arithmetical operator
-
A symbol such as
+
or /
that tells
Perl to do the arithmetic you were supposed to learn in grade school.
- array
-
An ordered sequence of values, stored such that you can
easily access any of the values using an integer subscript that specifies
the value’s offset in the sequence.
- array context
-
An archaic expression for what is more correctly referred to
as list context.
- Artistic License
-
The open source license that Larry Wall
created for Perl, maximizing Perl’s usefulness, availability, and
modifiability. The current version is 2. (http://www.opensource.org/licenses/artistic-license.php).
- ASCII
-
The American Standard Code for
Information Interchange (a 7-bit character set adequate only for poorly
representing English text). Often used loosely to describe the lowest 128
values of the various ISO-8859-X character sets, a bunch of mutually
incompatible 8-bit codes best described as half ASCII. See also Unicode.
- assertion
-
A component of a regular expression that must be true for the pattern to
match but does not necessarily match any characters itself. Often used
specifically to mean a zero-width assertion.
- assignment
-
An operator whose assigned mission in life is to
change the value of a variable.
- assignment operator
-
Either a regular assignment or a compound
operator composed of an ordinary assignment and some other operator, that
changes the value of a variable in place; that is, relative to its old
value. For example,
$a += 2
adds 2
to $a
.
- associative array
-
See hash. Please. The term associative array is the
old Perl 4 term for a hash. Some languages call it a dictionary.
- associativity
-
Determines whether you do the left operator first or the
right operator first when you have “A operator B operator C”, and
the two operators are of the same precedence. Operators like
+
are left
associative, while operators like **
are right associative. See Camel
chapter 3, “Unary and Binary Operators” for a list of operators and their
associativity.
- asynchronous
-
Said of events or activities whose relative
temporal ordering is indeterminate because too many things are going on at
once. Hence, an asynchronous event is one you didn’t know when to expect.
- atom
-
A regular expression component potentially matching a
substring containing one or more characters and treated as an indivisible
syntactic unit by any following quantifier. (Contrast with an
assertion that matches something of zero width and may not be quantified.)
- atomic operation
-
When Democritus gave the word “atom” to the indivisible
bits of matter, he meant literally something that could not be cut: ἀ-
(not) + -τομος (cuttable). An atomic operation is an action that can’t be
interrupted, not one forbidden in a nuclear-free zone.
- attribute
-
A new feature that allows the declaration of
variables and subroutines with modifiers, as in
sub foo : locked
method
. Also another name for an instance variable of an object.
- autogeneration
-
A feature of operator overloading of objects,
whereby the behavior of certain operators can be reasonably deduced using
more fundamental operators. This assumes that the overloaded operators will
often have the same relationships as the regular operators. See Camel
chapter 13, “Overloading”.
- autoincrement
-
To add one to something automatically, hence the name
of the
++
operator. To instead subtract one from something automatically
is known as an “autodecrement”.
- autoload
-
To load on demand. (Also called “lazy” loading.)
Specifically, to call an
AUTOLOAD
subroutine on behalf of an undefined
subroutine.
- autosplit
-
To split a string automatically, as the –a switch
does when running under –p or –n in order to emulate awk. (See also
the
AutoSplit
module, which has nothing to do with the
–a
switch but a lot to do with autoloading.)
- autovivification
-
A Graeco-Roman word meaning “to bring oneself to life”.
In Perl, storage locations (lvalues) spontaneously generate themselves as
needed, including the creation of any hard reference values to point to
the next level of storage. The assignment
$a[5][5][5][5][5] = "quintet"
potentially creates five scalar storage locations, plus four references (in
the first four scalar locations) pointing to four new anonymous arrays (to
hold the last four scalar locations). But the point of autovivification is
that you don’t have to worry about it.
- AV
-
Short for “array
value”, which refers to one of Perl’s internal data types that holds an
array. The
AV
type is a subclass of SV.
- awk
-
Descriptive editing term—short for “awkward”. Also
coincidentally refers to a venerable text-processing language from which
Perl derived some of its high-level ideas.
- backreference
-
A substring captured
by a subpattern within unadorned parentheses in a regex. Backslashed
decimal numbers (
\1
, \2
, etc.) later in the same pattern refer back to
the corresponding subpattern in the current match. Outside the pattern, the
numbered variables ($1
, $2
, etc.) continue to refer to these same
values, as long as the pattern was the last successful match of the current
dynamic scope.
- backtracking
-
The practice of saying, “If I had to do it all over, I’d do
it differently,” and then actually going back and doing it all over
differently. Mathematically speaking, it’s returning from an unsuccessful
recursion on a tree of possibilities. Perl backtracks when it attempts to
match patterns with a regular expression, and its earlier attempts don’t
pan out. See the section “The Little Engine That /Couldn(n’t)” in Camel
chapter 5, “Pattern Matching”.
- backward compatibility
-
Means you can still run your old program
because we didn’t break any of the features or bugs it was relying on.
- bareword
-
A word sufficiently ambiguous to be deemed illegal under
use strict 'subs'
. In the absence of that stricture, a bareword is
treated as if quotes were around it.
- base class
-
A generic object type; that is, a class
from which other, more specific classes are derived genetically by
inheritance. Also called a
“superclass” by people who respect their ancestors.
- big-endian
-
From Swift: someone who
eats eggs big end first. Also used of computers that store the most
significant byte of a word at a lower byte address than the least
significant byte. Often considered superior to little-endian machines. See
also little-endian.
- binary
-
Having to do with numbers represented in base 2. That means
there’s basically two numbers: 0 and 1. Also used to describe a file of
“nontext”, presumably because such a file makes full use of all the binary
bits in its bytes. With the advent of Unicode, this distinction, already
suspect, loses even more of its meaning.
- binary operator
-
An operator that takes two operands.
- bind
-
To assign a specific network address to a socket.
- bit
-
An integer in the range from 0 to 1, inclusive. The smallest
possible unit of information storage. An eighth of a byte or of a dollar.
(The term “Pieces of Eight” comes from being able to split the old Spanish
dollar into 8 bits, each of which still counted for money. That’s why a 25-
cent piece today is still “two bits”.)
- bit shift
-
The movement of bits left or right in a
computer word, which has the effect of multiplying or dividing by a
power of 2.
- bit string
-
A sequence of bits that is actually being thought of as a
sequence of bits, for once.
- bless
-
In corporate life, to grant official
approval to a thing, as in, “The VP of Engineering has blessed our
WebCruncher project.” Similarly, in Perl, to grant official approval to a
referent so that it can function as an object, such as a WebCruncher
object. See the
bless
function in Camel chapter 27, “Functions”.
- block
-
What a process does when it has to wait for something:
“My process blocked waiting for the disk.” As an unrelated noun, it refers
to a large chunk of data, of a size that the operating system likes to
deal with (normally a power of 2 such as 512 or 8192). Typically refers to
a chunk of data that’s coming from or going to a disk file.
- BLOCK
-
A syntactic construct
consisting of a sequence of Perl statements that is delimited by braces.
The
if
and while
statements are defined in terms of BLOCK
s, for
instance. Sometimes we also say “block” to mean a lexical scope; that is, a
sequence of statements that acts like a BLOCK
, such as within an
eval
or a file, even though the statements aren’t delimited by braces.
- block buffering
-
A method of making input and output
efficient by passing one block at a time. By default, Perl does block
buffering to disk files. See buffer and command buffering.
- Boolean
-
A value that is either true or
false.
- Boolean context
-
A special kind of scalar
context used in conditionals to decide whether the scalar value returned
by an expression is true or false. Does not evaluate as either a
string or a number. See context.
- breakpoint
-
A spot in your program where you’ve told the debugger
to stop execution so you can poke around and see whether anything is
wrong yet.
- broadcast
-
To send a datagram to multiple destinations
simultaneously.
- BSD
-
A psychoactive drug, popular in the ’80s, probably developed at UC
Berkeley or thereabouts. Similar in many ways to the prescription-only
medication called “System V”, but infinitely more useful. (Or, at least,
more fun.) The full chemical name is “Berkeley Standard Distribution”.
- bucket
-
A location in a hash table containing (potentially)
multiple entries whose keys “hash” to the same hash value according to its
hash function. (As internal policy, you don’t have to worry about it unless
you’re into internals, or policy.)
- buffer
-
A temporary holding location for data. Data that are
Block buffering means that the data is passed on to its destination
whenever the buffer is full. Line buffering means that it’s passed on
whenever a complete line is received. Command buffering means that it’s
passed every time you do a
print
command (or equivalent). If your output
is unbuffered, the system processes it one byte at a time without the use of
a holding area. This can be rather inefficient.
- built-in
-
A function that is predefined in the
language. Even when hidden by overriding, you can always get at a built-
in function by qualifying its name with the
CORE::
pseudopackage.
- bundle
-
A group of related modules on CPAN. (Also sometimes
refers to a group of command-line switches grouped into one switch
cluster.)
- byte
-
A piece of data worth eight bits in most places.
- bytecode
-
A pidgin-like lingo spoken among ’droids when they don’t wish to reveal
their orientation (see endian). Named after some similar languages spoken
(for similar reasons) between compilers and interpreters in the late 20ᵗʰ
century. These languages are characterized by representing everything as a
nonarchitecture-dependent sequence of bytes.
- C
-
A language beloved by many for its inside-out type
definitions, inscrutable precedence rules, and heavy overloading of
the function-call mechanism. (Well, actually, people first switched to C
because they found lowercase identifiers easier to read than upper.) Perl is
written in C, so it’s not surprising that Perl borrowed a few ideas from it.
- cache
-
A data repository. Instead of computing expensive answers
several times, compute it once and save the result.
- callback
-
A handler that you register with some other part of your
program in the hope that the other part of your program will trigger your
handler when some event of interest transpires.
- call by reference
-
An argument-passing mechanism in which the formal arguments refer directly to the
actual arguments, and the subroutine can change the actual arguments
by changing the formal arguments. That is, the formal argument is an
alias for the actual argument. See also call by value.
- call by value
-
An argument-passing mechanism in which the formal
arguments refer to a copy of the actual arguments, and the
subroutine cannot change the actual arguments by changing the formal
arguments. See also call by reference.
- canonical
-
Reduced to a standard form to facilitate comparison.
- capture variables
-
The variables—such as
$1
and
$2
, and %+
and %–
—that hold the text remembered in a pattern
match. See Camel chapter 5, “Pattern Matching”.
- capturing
-
The use of parentheses around a subpattern in a
regular expression to store the matched substring as a
backreference. (Captured strings are also returned as a list in list
context.) See Camel chapter 5, “Pattern Matching”.
- cargo cult
-
Copying and pasting code without understanding it, while
superstitiously believing in its value. This term originated from
preindustrial cultures dealing with the detritus of explorers and colonizers
of technologically advanced cultures. See The Gods Must Be Crazy.
- case
-
A property of certain
characters. Originally, typesetter stored capital letters in the upper of
two cases and small letters in the lower one. Unicode recognizes three
cases: lowercase (character property
\p{lower}
), titlecase
(\p{title}
), and uppercase (\p{upper}
). A fourth casemapping called
foldcase is not itself a distinct case, but it is used internally to
implement casefolding. Not all letters have case, and some nonletters
have case.
- casefolding
-
Comparing or matching a string case-insensitively. In Perl, it
is implemented with the
/i
pattern modifier, the fc
function, and the
\F
double-quote translation escape.
- casemapping
-
The process of converting a string to one of the four Unicode
casemaps; in Perl, it is implemented with the
fc
, lc
, ucfirst
,
and uc
functions.
- character
-
The smallest individual element of a string. Computers
store characters as integers, but Perl lets you operate on them as text. The
integer used to represent a particular character is called that character’s
codepoint.
- character class
-
A square-bracketed list of
characters used in a regular expression to indicate that any character
of the set may occur at a given point. Loosely, any predefined set of
characters so used.
- character property
-
A predefined character class matchable by the
\p
or \P
metasymbol. Unicode defines hundreds of standard properties
for every possible codepoint, and Perl defines a few of its own, too.
- circumfix operator
-
An operator that surrounds its operand, like the
angle operator, or parentheses, or a hug.
- class
-
A user-defined type, implemented in Perl via a
package that provides (either directly or by inheritance) methods
(that is, subroutines) to handle instances of the class (its
objects). See also inheritance.
- class method
-
A method whose invocant is a
package name, not an object reference. A method associated with the
class as a whole. Also see instance method.
- client
-
In networking, a process that
initiates contact with a server process in order to exchange data and
perhaps receive a service.
- closure
-
An anonymous subroutine
that, when a reference to it is generated at runtime, keeps track of the
identities of externally visible lexical variables, even after those
lexical variables have supposedly gone out of scope. They’re called
“closures” because this sort of behavior gives mathematicians a sense of
closure.
- cluster
-
A parenthesized subpattern
used to group parts of a regular expression into a single atom.
- CODE
-
The word returned by the
ref
function when you apply it to a reference to a subroutine. See also CV.
- code generator
-
A system that writes code for you in a low-level
language, such as code to implement the backend of a compiler. See program
generator.
- codepoint
-
The integer a computer uses to represent a given
character. ASCII codepoints are in the range 0 to 127; Unicode codepoints
are in the range 0 to 0x1F_FFFF; and Perl codepoints are in the range 0 to
2³²−1 or 0 to 2⁶⁴−1, depending on your native integer size. In Perl Culture,
sometimes called ordinals.
- code subpattern
-
A regular expression subpattern
whose real purpose is to execute some Perl code—for example, the
(?{...})
and (??{...})
subpatterns.
- collating sequence
-
The order into which characters
sort. This is used by string comparison routines to decide, for example,
where in this glossary to put “collating sequence”.
- co-maintainer
-
A person with permissions to index a namespace in
PAUSE. Anyone can upload any namespace, but only primary and
co-maintainers get their contributions indexed.
- combining character
-
Any character with the
General Category of Combining Mark (
\p{GC=M}
), which may be spacing or
nonspacing. Some are even invisible. A sequence of combining characters
following a grapheme base character together make up a single user-visible
character called a grapheme. Most but not all diacritics are combining
characters, and vice versa.
- command
-
In shell programming, the syntactic combination of a
program name and its arguments. More loosely, anything you type to a shell
(a command interpreter) that starts it doing something. Even more loosely, a
Perl statement, which might start with a label and typically ends with
a semicolon.
- command buffering
-
A mechanism in Perl that lets you
store up the output of each Perl command and then flush it out as a
single request to the operating system. It’s enabled by setting the
$|
($AUTOFLUSH
) variable to a true value. It’s used when you don’t want data
sitting around, not going where it’s supposed to, which may happen because
the default on a file or pipe is to use block buffering.
- command-line arguments
-
The values you supply
along with a program name when you tell a shell to execute a command.
These values are passed to a Perl program through
@ARGV
.
- command name
-
The name of the program currently executing, as typed on the
command line. In C, the command name is passed to the program as the
first command-line argument. In Perl, it comes in separately as
$0
.
- comment
-
A remark that doesn’t affect the meaning of the program.
In Perl, a comment is introduced by a
#
character and continues to the
end of the line.
- compilation unit
-
The file (or string, in the case of
eval
) that
is currently being compiled.
- compile
-
The process of turning source code into a machine-usable form. See compile
phase.
- compile phase
-
Any time before Perl starts running your main
program. See also run phase. Compile phase is mostly spent in compile
time, but may also be spent in runtime when
BEGIN
blocks, use
or
no
declarations, or constant subexpressions are being evaluated. The
startup and import code of any use
declaration is also run during
compile phase.
- compiler
-
Strictly speaking, a program that munches
up another program and spits out yet another file containing the program in
a “more executable” form, typically containing native machine instructions.
The perl program is not a compiler by this definition, but it does
contain a kind of compiler that takes a program and turns it into a more
executable form (syntax trees) within the perl process itself, which
the interpreter then interprets. There are, however, extension modules
to get Perl to act more like a “real” compiler. See Camel chapter 16,
“Compiling”.
- compile time
-
The time when Perl is trying to make sense of your
code, as opposed to when it thinks it knows what your code means and is
merely trying to do what it thinks your code says to do, which is runtime.
- composer
-
A “constructor” for a referent that isn’t really an
object, like an anonymous array or a hash (or a sonata, for that matter).
For example, a pair of braces acts as a composer for a hash, and a pair of
brackets acts as a composer for an array. See the section “Creating
References” in Camel chapter 8, “References”.
- concatenation
-
The process of gluing one
cat’s nose to another cat’s tail. Also a similar operation on two
strings.
- conditional
-
Something “iffy”. See Boolean context.
- connection
-
In telephony, the temporary electrical circuit between
the caller’s and the callee’s phone. In networking, the same kind of
temporary circuit between a client and a server.
- construct
-
As a noun, a piece of syntax made up of smaller
pieces. As a transitive verb, to create an object using a constructor.
- constructor
-
Any class method, instance, or subroutine
that composes, initializes, blesses, and returns an object. Sometimes we
use the term loosely to mean a composer.
- context
-
The surroundings or environment. The context given by the
surrounding code determines what kind of data a particular expression is
expected to return. The three primary contexts are list context,
scalar, and void context. Scalar context is sometimes subdivided into
Boolean context, numeric context, string context, and void
context. There’s also a “don’t care” context (which is dealt with in Camel
chapter 2, “Bits and Pieces”, if you care).
- continuation
-
The treatment of more than one physical line as a
single logical line. Makefile lines are continued by putting a backslash
before the newline. Mail headers, as defined by RFC 822, are
continued by putting a space or tab after the newline. In general, lines
in Perl do not need any form of continuation mark, because whitespace
(including newlines) is gleefully ignored. Usually.
- core dump
-
The corpse of a process, in the form of a file left in the
working directory of the process, usually as a result of certain kinds
of fatal errors.
- CPAN
-
The Comprehensive Perl Archive Network. (See the Camel Preface
and Camel chapter 19, “CPAN” for details.)
- C preprocessor
-
The typical C compiler’s first pass, which processes lines
beginning with
#
for conditional compilation and macro definition, and
does various manipulations of the program text based on the current
definitions. Also known as cpp(1).
- cracker
-
Someone who breaks security on computer systems. A cracker may
be a true hacker or only a script kiddie.
- currently selected output channel
-
The last filehandle that was
designated with
select(FILEHANDLE)
; STDOUT
, if no filehandle has
been selected.
- current package
-
The package in which the current statement is
compiled. Scan backward in the text of your program through the current
lexical scope or any enclosing lexical scopes until you find a package
declaration. That’s your current package name.
- current working directory
-
See working directory.
- CV
-
In academia, a curriculum vitæ, a fancy kind of résumé. In Perl, an internal “code value” typedef holding a
subroutine. The
CV
type is a subclass of SV.
- dangling statement
-
A bare, single statement,
without any braces, hanging off an
if
or while
conditional. C allows
them. Perl doesn’t.
- datagram
-
A packet of data, such as a UDP message, that (from
the viewpoint of the programs involved) can be sent independently over the
network. (In fact, all packets are sent independently at the IP level,
but stream protocols such as TCP hide this from your program.)
- data structure
-
How your various pieces of data relate to each
other and what shape they make when you put them all together, as in a
rectangular table or a triangular tree.
- data type
-
A set of possible values, together with all the
operations that know how to deal with those values. For example, a numeric
data type has a certain set of numbers that you can work with, as well as
various mathematical operations that you can do on the numbers, but would
make little sense on, say, a string such as
"Kilroy"
. Strings have their
own operations, such as concatenation. Compound types made of a number of
smaller pieces generally have operations to compose and decompose them, and
perhaps to rearrange them. Objects that model things in the real world
often have operations that correspond to real activities. For instance, if
you model an elevator, your elevator object might have an open_door
method.
- DBM
-
Stands for “Database Management” routines, a set of routines that emulate an
associative array using disk files. The routines use a dynamic hashing
scheme to locate any entry with only two disk accesses. DBM files allow a
Perl program to keep a persistent hash across multiple invocations. You
can
tie
your hash variables to various DBM implementations.
- declaration
-
An assertion that states something exists and
perhaps describes what it’s like, without giving any commitment as to how
or where you’ll use it. A declaration is like the part of your recipe that
says, “two cups flour, one large egg, four or five tadpoles…” See
statement for its opposite. Note that some declarations also function
as statements. Subroutine declarations also act as definitions if a body
is supplied.
- declarator
-
Something that tells your program what sort of variable
you’d like. Perl doesn’t require you to declare variables, but you can use
my
, our
, or state
to denote that you want something other than
the default.
- decrement
-
To subtract a value from a
variable, as in “decrement
$x
” (meaning to remove 1 from its value) or
“decrement $x
by 3”.
- default
-
A value chosen for you if you don’t
supply a value of your own.
- defined
-
Having a meaning. Perl thinks that some of the things
people try to do are devoid of meaning; in particular, making use of
variables that have never been given a value and performing certain
operations on data that isn’t there. For example, if you try to read data
past the end of a file, Perl will hand you back an undefined value. See also
false and the
defined
entry in Camel chapter 27, “Functions”.
- delimiter
-
A character or string that sets bounds to an
arbitrarily sized textual object, not to be confused with a separator or
terminator. “To delimit” really just means “to surround” or “to enclose”
(like these parentheses are doing).
- dereference
-
A fancy computer science term
meaning “to follow a reference to what it points to”. The “de” part of it
refers to the fact that you’re taking away one level of indirection.
- derived class
-
A class that defines some of its methods in terms of a more generic class,
called a base class. Note that classes aren’t classified exclusively into
base classes or derived classes: a class can function as both a derived
class and a base class simultaneously, which is kind of classy.
- descriptor
-
See file descriptor.
- destroy
-
To deallocate the memory of a referent (first triggering
its
DESTROY
method, if it has one).
- destructor
-
A special method that is called
when an object is thinking about destroying itself. A Perl program’s
DESTROY
method doesn’t do the actual destruction; Perl just triggers
the method in case the class wants to do any associated cleanup.
- device
-
A whiz-bang hardware gizmo (like a disk or tape drive or a
modem or a joystick or a mouse) attached to your computer, which the
operating system tries to make look like a file (or a bunch of files).
Under Unix, these fake files tend to live in the /dev directory.
- directive
-
A pod directive. See Camel chapter 23, “Plain Old
Documentation”.
- directory
-
A special file that contains other files. Some
operating systems call these “folders”, “drawers”, “catalogues”, or
“catalogs”.
- directory handle
-
A name that represents a particular instance of opening a
directory to read it, until you close it. See the
opendir
function.
- discipline
-
Some people need this and some people avoid it.
For Perl, it’s an old way to say I/O layer.
- dispatch
-
To send something to its correct destination. Often used
metaphorically to indicate a transfer of programmatic control to a
destination selected algorithmically, often by lookup in a table of function
references or, in the case of object methods, by traversing the
inheritance tree looking for the most specific definition for the method.
- distribution
-
A standard, bundled release of a system of
software. The default usage implies source code is included. If that is not
the case, it will be called a “binary-only” distribution.
- dual-lived
-
Some modules live both in the
Standard Library and on CPAN. These modules might be developed on two
tracks as people modify either version. The trend currently is to untangle
these situations.
- dweomer
-
An enchantment, illusion, phantasm, or jugglery. Said when Perl’s
magical dwimmer effects don’t do what you expect, but rather seem to be
the product of arcane dweomercraft, sorcery, or wonder working. [From
Middle English.]
- dwimmer
-
DWIM is
an acronym for “Do What I Mean”, the principle that something
should just do what you want it to do without an undue amount of fuss. A bit
of code that does “dwimming” is a “dwimmer”. Dwimming can require a great
deal of behind-the-scenes magic, which (if it doesn’t stay properly behind
the scenes) is called a dweomer instead.
- dynamic scoping
-
Dynamic scoping works over a dynamic
scope, making variables visible throughout the rest of the block in
which they are first used and in any subroutines that are called by the
rest of the block. Dynamically scoped variables can have their values
temporarily changed (and implicitly restored later) by a
local
operator.
(Compare lexical scoping.) Used more loosely to mean how a subroutine
that is in the middle of calling another subroutine “contains” that
subroutine at runtime.
- eclectic
-
Derived from many sources. Some would say too many.
- element
-
A basic building block. When you’re talking about an
array, it’s one of the items that make up the array.
- embedding
-
When something is contained in something else,
particularly when that might be considered surprising: “I’ve embedded a
complete Perl interpreter in my editor!”
- empty subclass test
-
The notion that an empty derived class should
behave exactly like its base class.
- encapsulation
-
The veil of abstraction separating the interface
from the implementation (whether enforced or not), which mandates that
all access to an object’s state be through methods alone.
- endian
-
See little-endian and big-endian.
- en passant
-
When you change a value as it is being copied. [From
French “in passing”, as in the exotic pawn-capturing maneuver in chess.]
- environment
-
The collective set of environment variables your
process inherits from its parent. Accessed via
%ENV
.
- environment variable
-
A mechanism by which some high-level agent such as a user can pass its
preferences down to its future offspring (child processes, grandchild
processes, great-grandchild processes, and so on). Each environment
variable is a key/value pair, like one entry in a hash.
- EOF
-
End of File. Sometimes used
metaphorically as the terminating string of a here document.
- errno
-
The error number returned by a
syscall when it fails. Perl refers to the error by the name
$!
(or
$OS_ERROR
if you use the English module).
- error
-
See exception or fatal error.
- escape sequence
-
See metasymbol.
- exception
-
A fancy term for an error. See fatal error.
- exception handling
-
The way a program responds to an error. The
exception-handling mechanism in Perl is the
eval
operator.
- exec
-
To throw away the current process’s program and replace
it with another, without exiting the process or relinquishing any resources
held (apart from the old memory image).
- executable file
-
A file that is specially marked to
tell the operating system that it’s okay to run this file as a program.
Usually shortened to “executable”.
- execute
-
To run a program or subroutine. (Has nothing to do
with the
kill
built-in, unless you’re trying to run a signal handler.)
- execute bit
-
The special mark that tells the operating system it can run
this program. There are actually three execute bits under Unix, and which
bit gets used depends on whether you own the file singularly, collectively,
or not at all.
- exit status
-
See status.
- exploit
-
Used as a noun in this case, this refers to a known way
to compromise a program to get it to do something the author didn’t intend.
Your task is to write unexploitable programs.
- export
-
To make symbols from a module available for
import by other modules.
- expression
-
Anything you can legally say in a spot
where a value is required. Typically composed of literals,
variables, operators, functions, and subroutine calls, not
necessarily in that order.
- extension
-
A Perl module that also pulls in compiled C or C++
code. More generally, any experimental option that can be compiled into
Perl, such as multithreading.
- false
-
In Perl, any value that would look like
""
or "0"
if evaluated in a string context. Since undefined values evaluate
to ""
, all undefined values are false, but not all false values are
undefined.
- FAQ
-
Frequently Asked Question (although not necessarily
frequently answered, especially if the answer appears in the Perl FAQ
shipped standard with Perl).
- fatal error
-
An uncaught exception, which causes termination of the
process after printing a message on your standard error stream. Errors
that happen inside an
eval
are not fatal. Instead, the eval
terminates
after placing the exception message in the $@
($EVAL_ERROR
) variable.
You can try to provoke a fatal error with the die
operator (known as
throwing or raising an exception), but this may be caught by a dynamically
enclosing eval
. If not caught, the die
becomes a fatal error.
- feeping creaturism
-
A spoonerism of “creeping
featurism”, noting the biological urge to add just one more feature to
a program.
- field
-
A single piece of numeric or string data that is part of a
longer string, record, or line. Variable-width fields are usually
split up by separators (so use
split
to extract the fields), while
fixed-width fields are usually at fixed positions (so use unpack
).
Instance variables are also known as “fields”.
- FIFO
-
First In, First Out. See also LIFO. Also a nickname for a named pipe.
- file
-
A named collection of data, usually stored on disk in a
directory in a filesystem. Roughly like a document, if you’re into
office metaphors. In modern filesystems, you can actually give a file more
than one name. Some files have special properties, like directories and
devices.
- file descriptor
-
The little number the operating
system uses to keep track of which opened file you’re talking about.
Perl hides the file descriptor inside a standard I/O stream and then
attaches the stream to a filehandle.
- fileglob
-
A “wildcard” match on filenames. See the
glob
function.
- filehandle
-
An identifier (not necessarily related to the real
name of a file) that represents a particular instance of opening a file,
until you close it. If you’re going to open and close several different
files in succession, it’s fine to open each of them with the same
filehandle, so you don’t have to write out separate code to process each
file.
- filename
-
One name for a file. This name is listed in a
directory. You can use it in an
open
to tell the operating system
exactly which file you want to open, and associate the file with a
filehandle, which will carry the subsequent identity of that file in
your program, until you close it.
- filesystem
-
A set of directories and files residing on a
partition of the disk. Sometimes known as a “partition”. You can change the
file’s name or even move a file around from directory to directory within a
filesystem without actually moving the file itself, at least under Unix.
- file test operator
-
A built-in unary operator that you use to
determine whether something is true about a file, such as
–o
$filename
to test whether you’re the owner of the file.
- filter
-
A program designed to take a stream of input and
transform it into a stream of output.
- first-come
-
The first PAUSE
author to upload a namespace automatically becomes the primary
maintainer for that namespace. The “first come” permissions distinguish a
primary maintainer who was assigned that role from one who received it
automatically.
- flag
-
We tend to avoid this term because it means so many things.
It may mean a command-line switch that takes no argument itself (such as
Perl’s
–n
and –p
flags) or, less frequently, a single-bit indicator
(such as the O_CREAT
and O_EXCL
flags used in sysopen
). Sometimes
informally used to refer to certain regex modifiers.
- floating point
-
A method of storing
numbers in “scientific notation”, such that the precision of the number is
independent of its magnitude (the decimal point “floats”). Perl does its
numeric work with floating-point numbers (sometimes called “floats”) when
it can’t get away with using integers. Floating-point numbers are mere
approximations of real numbers.
- flush
-
The act of emptying a buffer,
often before it’s full.
- FMTEYEWTK
-
Far More Than Everything You Ever Wanted To Know. An
exhaustive treatise on one narrow topic, something of a super-FAQ. See
Tom for far more.
- foldcase
-
The casemap used in Unicode when comparing or matching
without regard to case. Comparing lower-, title-, or uppercase are all
unreliable due to Unicode’s complex, one-to-many case mappings. Foldcase is
a lowercase variant (using a partially decomposed normalization form
for certain codepoints) created specifically to resolve this.
- fork
-
To create a child process
identical to the parent process at its moment of conception, at least until
it gets ideas of its own. A thread with protected memory.
- formal arguments
-
The generic names by which a
subroutine knows its arguments. In many languages, formal arguments
are always given individual names; in Perl, the formal arguments are just
the elements of an array. The formal arguments to a Perl program are
$ARGV[0]
, $ARGV[1]
, and so on. Similarly, the formal arguments to a
Perl subroutine are $_[0]
, $_[1]
, and so on. You may give the
arguments individual names by assigning the values to a my
list. See
also actual arguments.
- format
-
A specification of how many spaces and digits and things
to put somewhere so that whatever you’re printing comes out nice and
pretty.
- freely available
-
Means you don’t have to pay money to get it, but
the copyright on it may still belong to someone else (like Larry).
- freely redistributable
-
Means you’re not in legal trouble if you
give a bootleg copy of it to your friends and we find out about it. In
fact, we’d rather you gave a copy to all your friends.
- freeware
-
Historically, any software that you give away,
particularly if you make the source code available as well. Now often
called open source software. Recently there has been a trend to use the
term in contradistinction to open source software, to refer only to free
software released under the Free Software
Foundation’s GPL (General Public License), but this is difficult to justify
etymologically.
- function
-
Mathematically, a mapping of each of a set of input
values to a particular output value. In computers, refers to a
subroutine or operator that returns a value. It may or may not
have input values (called arguments).
- funny character
-
Someone like Larry, or one of his
peculiar friends. Also refers to the strange prefixes that Perl requires as
noun markers on its variables.
- garbage collection
-
A misnamed feature—it should be called,
“expecting your mother to pick up after you”. Strictly speaking, Perl
doesn’t do this, but it relies on a reference-counting mechanism to keep
things tidy. However, we rarely speak strictly and will often refer to the
reference-counting scheme as a form of garbage collection. (If it’s any
comfort, when your interpreter exits, a “real” garbage collector runs to
make sure everything is cleaned up if you’ve been messy with circular
references and such.)
- GID
-
Group ID—in Unix, the numeric group ID
that the operating system uses to identify you and members of your
group.
- glob
-
Strictly, the shell’s
*
character, which will match
a “glob” of characters when you’re trying to generate a list of filenames.
Loosely, the act of using globs and similar symbols to do pattern matching.
See also fileglob and typeglob.
- global
-
Something you can see from anywhere, usually used of
variables and subroutines that are visible everywhere in your
program. In Perl, only certain special variables are truly global—most
variables (and all subroutines) exist only in the current package.
Global variables can be declared with
our
. See “Global Declarations” in
Camel chapter 4, “Statements and Declarations”.
- global destruction
-
The garbage collection of globals (and the running
of any associated object destructors) that takes place when a Perl
interpreter is being shut down. Global destruction should not be
confused with the Apocalypse, except perhaps when it should.
- glue language
-
A language such as Perl that is good at hooking things
together that weren’t intended to be hooked together.
- granularity
-
The size of the pieces you’re dealing with, mentally
speaking.
- grapheme
-
A graphene is an allotrope of carbon arranged in a
hexagonal crystal lattice one atom thick. A grapheme, or more fully, a
grapheme cluster string is a single user-visible character, which may
in turn be several characters (codepoints) long. For example, a carriage
return plus a line feed is a single grapheme but two characters, while a
“ȫ” is a single grapheme but one, two, or even three characters, depending
on normalization.
- greedy
-
A subpattern whose
quantifier wants to match as many things as possible.
- grep
-
Originally from the old Unix editor command for “Globally
search for a Regular Expression and Print it”, now used in the general
sense of any kind of search, especially text searches. Perl has a built-in
grep
function that searches a list for elements matching any given
criterion, whereas the grep(1) program searches for lines matching a
regular expression in one or more files.
- group
-
A set of users of which you are a member. In some
operating systems (like Unix), you can give certain file access permissions
to other members of your group.
- GV
-
An internal “glob value” typedef,
holding a typeglob. The
GV
type is a subclass of SV.
- hacker
-
Someone who is brilliantly persistent in solving technical
problems, whether these involve golfing, fighting orcs, or programming.
Hacker is a neutral term, morally speaking. Good hackers are not to be
confused with evil crackers or clueless script kiddies. If you
confuse them, we will presume that you are either evil or clueless.
- handler
-
A subroutine or method that Perl calls when your
program needs to respond to some internal event, such as a signal, or an
encounter with an operator subject to operator overloading. See also
callback.
- hard reference
-
A scalar value containing
the actual address of a referent, such that the referent’s reference
count accounts for it. (Some hard references are held internally, such as
the implicit reference from one of a typeglob’s variable slots to its
corresponding referent.) A hard reference is different from a symbolic
reference.
- hash
-
An unordered association of key/value pairs, stored such that you can easily use a string key to
look up its associated data value. This glossary is like a hash, where
the word to be defined is the key and the definition is the value. A hash
is also sometimes septisyllabically called an “associative array”, which is
a pretty good reason for simply calling it a “hash” instead.
- hash table
-
A data structure used internally by Perl for implementing
associative arrays (hashes) efficiently. See also bucket.
- header file
-
A file containing certain required
definitions that you must include “ahead” of the rest of your program to do
certain obscure operations. A C header file has a .h extension. Perl
doesn’t really have header files, though historically Perl has sometimes
used translated .h files with a .ph extension. See
require
in
Camel chapter 27, “Functions”. (Header files have been superseded by the
module mechanism.)
- here document
-
So called because of a similar construct in shells that
pretends that the lines following the command are a separate file
to be fed to the command, up to some terminating string. In Perl, however,
it’s just a fancy form of quoting.
- hexadecimal
-
A number in base 16, “hex” for short. The digits for 10
through 15 are customarily represented by the letters
a
through f
.
Hexadecimal constants in Perl start with 0x
. See also the hex
function in Camel chapter 27, “Functions”.
- home directory
-
The directory you are put into when
you log in. On a Unix system, the name is often placed into
$ENV{HOME}
or $ENV{LOGDIR}
by login, but you can also find it with
(get
pwuid($<))[7]
. (Some platforms do not have a concept of a
home directory.)
- host
-
The computer on which a program or other data resides.
- hubris
-
Excessive pride, the sort of thing for which Zeus zaps
you. Also the quality that makes you write (and maintain) programs that
other people won’t want to say bad things about. Hence, the third great
virtue of a programmer. See also laziness and impatience.
- HV
-
Short for a “hash value” typedef, which
holds Perl’s internal representation of a hash. The
HV
type is a
subclass of SV.
- identifier
-
A legally formed name for most anything in which a
computer program might be interested. Many languages (including Perl) allow
identifiers to start with an alphabetic character, and then contain
alphabetics and digits. Perl also allows connector punctuation like the
underscore character wherever it allows alphabetics. (Perl also has more
complicated names, like qualified names.)
- impatience
-
The anger you feel when the computer is being lazy.
This makes you write programs that don’t just react to your needs, but
actually anticipate them. Or at least that pretend to. Hence, the second
great virtue of a programmer. See also laziness and hubris.
- implementation
-
How a piece of code actually goes about doing its
job. Users of the code should not count on implementation details staying
the same unless they are part of the published interface.
- import
-
To gain access to symbols that are exported from another
module. See
use
in Camel chapter 27, “Functions”.
- increment
-
To increase the value of
something by 1 (or by some other number, if so specified).
- indexing
-
In olden days, the act of looking up a key in an
actual index (such as a phone book). But now it's merely the act of using
any kind of key or position to find the corresponding value, even if no
index is involved. Things have degenerated to the point that Perl’s
index
function merely locates the position (index) of one string in
another.
- indirect filehandle
-
An expression that
evaluates to something that can be used as a filehandle: a string
(filehandle name), a typeglob, a typeglob reference, or a low-level
IO object.
- indirection
-
If something in a program isn’t the value you’re
looking for but indicates where the value is, that’s indirection. This can
be done with either symbolic references or hard.
- indirect object
-
In English grammar, a short
noun phrase between a verb and its direct object indicating the beneficiary
or recipient of the action. In Perl,
print STDOUT "$foo\n";
can be
understood as “verb indirect-object object”, where STDOUT
is the
recipient of the print
action, and "$foo"
is the object being
printed. Similarly, when invoking a method, you might place the
invocant in the dative slot between the method and its arguments:
-
$gollum = new Pathetic::Creature "Sméagol";
give $gollum "Fisssssh!";
give $gollum "Precious!";
- indirect object slot
-
The syntactic position falling between a method call
and its arguments when using the indirect object invocation syntax. (The
slot is distinguished by the absence of a comma between it and the next
argument.)
STDERR
is in the indirect object slot here:
-
print STDERR "Awake! Awake! Fear, Fire, Foes! Awake!\n";
- infix
-
An operator that comes in between its operands,
such as multiplication in
24 * 7
.
- inheritance
-
What you get from your ancestors, genetically or
otherwise. If you happen to be a class, your ancestors are called base
classes and your descendants are called derived classes. See single
inheritance and multiple inheritance.
- instance
-
Short for “an instance of a class”, meaning an object
of that class.
- instance data
-
See instance variable.
- instance method
-
A method of an object, as
opposed to a class method.
-
A method whose invocant is an object, not a package name. Every
object of a class shares all the methods of that class, so an instance
method applies to all instances of the class, rather than applying to a
particular instance. Also see class method.
- instance variable
-
An attribute of an object; data stored with the particular object rather than with the class
as a whole.
- integer
-
A number with no fractional (decimal) part. A counting
number, like 1, 2, 3, and so on, but including 0 and the negatives.
- interface
-
The services a piece of code promises to provide
forever, in contrast to its implementation, which it should feel free to
change whenever it likes.
- interpolation
-
The insertion of a scalar or list value somewhere
in the middle of another value, such that it appears to have been there all
along. In Perl, variable interpolation happens in double-quoted strings and
patterns, and list interpolation occurs when constructing the list of
values to pass to a list operator or other such construct that takes a
LIST
.
- interpreter
-
Strictly speaking, a program that reads a second
program and does what the second program says directly without turning the
program into a different form first, which is what compilers do. Perl is
not an interpreter by this definition, because it contains a kind of
compiler that takes a program and turns it into a more executable form
(syntax trees) within the perl process itself, which the Perl
runtime system then interprets.
- invocant
-
The agent on whose behalf a method is invoked. In a
class method, the invocant is a package name. In an instance method,
the invocant is an object reference.
- invocation
-
The act of calling up a deity, daemon, program,
method, subroutine, or function to get it to do what you think it’s
supposed to do. We usually “call” subroutines but “invoke” methods, since
it sounds cooler.
- I/O
-
Input from, or output to, a file or device.
- IO
-
An internal I/O object. Can also mean indirect object.
- I/O layer
-
One of the filters between the data and what you get as input
or what you end up with as output.
- IPA
-
India Pale Ale. Also the International Phonetic Alphabet, the
standard alphabet used for phonetic notation worldwide. Draws heavily on
Unicode, including many combining characters.
- IP
-
Internet Protocol, or
Intellectual
Property.
- IPC
-
Interprocess Communication.
- is-a
-
A relationship between two objects in which one
object is considered to be a more specific version of the other, generic
object: “A camel is a mammal.” Since the generic object really only exists
in a Platonic sense, we usually add a little abstraction to the notion of
objects and think of the relationship as being between a generic base
class and a specific derived class. Oddly enough, Platonic classes
don’t always have Platonic relationships—see inheritance.
- iteration
-
Doing something repeatedly.
- iterator
-
A special programming gizmo that keeps track of where you are
in something that you’re trying to iterate over. The
foreach
loop in
Perl contains an iterator; so does a hash, allowing you to each
through
it.
- IV
-
The integer four, not to be
confused with six, Tom’s favorite editor. IV also means an internal Integer
Value of the type a scalar can hold, not to be confused with an NV.
- JAPH
-
“Just Another Perl Hacker”, a clever but cryptic bit of Perl
code that, when executed, evaluates to that string. Often used to
illustrate a particular Perl feature, and something of an ongoing
Obfuscated Perl Contest seen in USENET signatures.
- key
-
The string index to a hash, used to look up the value
associated with that key.
- keyword
-
See reserved words.
- label
-
A name you give to a statement so that you can talk
about that statement elsewhere in the program.
- laziness
-
The quality that makes you go to great effort to reduce
overall energy expenditure. It makes you write labor-saving programs that
other people will find useful, and then document what you wrote so you
don’t have to answer so many questions about it. Hence, the first great
virtue of a programmer. Also hence, this book. See also impatience and
hubris.
- leftmost longest
-
The preference of the regular expression engine to match the
leftmost occurrence of a pattern, then given a position at which a match
will occur, the preference for the longest match (presuming the use of a
greedy quantifier). See Camel chapter 5, “Pattern Matching” for much
more on this subject.
- left shift
-
A bit shift that <) bit operator>< (left shift) bit operator>multiplies the
number by some power of 2.
- lexeme
-
Fancy term for a token.
- lexer
-
Fancy term for a tokener.
- lexical analysis
-
Fancy term for tokenizing.
- lexical scoping
-
Looking at your Oxford English
Dictionary through a microscope. (Also known as static scoping, because
dictionaries don’t change very fast.) Similarly, looking at variables
stored in a private dictionary (namespace) for each scope, which are
visible only from their point of declaration down to the end of the lexical scope in which they are declared. —Syn.
static scoping. —Ant. dynamic scoping.
- lexical variable
-
A variable subject to
lexical scoping, declared by
my
. Often just called a “lexical”. (The
our
declaration declares a lexically scoped name for a global variable,
which is not itself a lexical variable.)
- library
-
Generally, a collection of procedures. In ancient
days, referred to a collection of subroutines in a .pl file. In modern
times, refers more often to the entire collection of Perl modules on
your system.
- LIFO
-
Last In, First Out. See also FIFO. A LIFO is usually called a
stack.
- line
-
In Unix, a sequence of zero or more nonnewline characters
terminated with a newline character. On non-Unix machines, this is
emulated by the C library even if the underlying operating system has
different ideas.
- linebreak
-
A grapheme consisting of either a carriage return followed
by a line feed or any character with the Unicode Vertical Space character
property.
- line buffering
-
Used by a standard I/O output stream that
flushes its buffer after every newline. Many standard I/O libraries
automatically set up line buffering on output that is going to the terminal.
- line number
-
The number of lines read previous to this one, plus 1. Perl
keeps a separate line number for each source or input file it opens. The
current source file’s line number is represented by
__LINE__
. The
current input line number (for the file that was most recently read via
<FH>
) is represented by the $.
($INPUT_LINE_NUMBER
)
variable. Many error messages report both values, if available.
- link
-
Used as a noun, a name in a directory that represents a
file. A given file can have multiple links to it. It’s like having the
same phone number listed in the phone directory under different names. As a
verb, to resolve a partially compiled file’s unresolved symbols into a
(nearly) executable image. Linking can generally be static or dynamic,
which has nothing to do with static or dynamic scoping.
- LIST
-
A syntactic construct representing a
comma- separated list of expressions, evaluated to produce a list value.
Each expression in a
LIST
is evaluated in list context and
interpolated into the list value.
- list
-
An ordered set of scalar values.
- list context
-
The situation in which an expression is
expected by its surroundings (the code calling it) to return a list of
values rather than a single value. Functions that want a
LIST
of
arguments tell those arguments that they should produce a list value. See
also context.
- list operator
-
An operator that does something with a list of
values, such as
join
or grep
. Usually used for named built-in
operators (such as print
, unlink
, and system
) that do not require
parentheses around their argument list.
- list value
-
An unnamed list of temporary scalar
values that may be passed around within a program from any list-generating
function to any function or construct that provides a list context.
- literal
-
A token in a programming language, such as a number or
string, that gives you an actual value instead of merely representing
possible values as a variable does.
- little-endian
-
From Swift: someone
who eats eggs little end first. Also used of computers that store the least
significant byte of a word at a lower byte address than the most
significant byte. Often considered superior to big-endian machines. See
also big-endian.
- local
-
Not meaning the same thing everywhere. A global
variable in Perl can be localized inside a dynamic scope via the
local
operator.
- logical operator
-
Symbols representing the concepts “and”, “or”,
“xor”, and “not”.
- lookahead
-
An assertion that peeks at the string to the right of the current match location.
- lookbehind
-
An assertion that peeks at the string to the left of the current match
location.
- loop
-
A construct that
performs something repeatedly, like a roller coaster.
- loop control statement
-
Any statement within the body of a loop that can
make a loop prematurely stop looping or skip an iteration. Generally,
you shouldn’t try this on roller coasters.
- loop label
-
A kind of key or name attached to a loop (or
roller coaster) so that loop control statements can talk about which loop
they want to control.
- lowercase
-
In Unicode, not just
characters with the General Category of Lowercase Letter, but any character
with the Lowercase property, including Modifier Letters, Letter Numbers,
some Other Symbols, and one Combining Mark.
- lvaluable
-
Able to serve as an lvalue.
- lvalue
-
Term used by language lawyers for a
storage location you can assign a new value to, such as a variable or
an element of an array. The “l” is short for “left”, as in the left side
of an assignment, a typical place for lvalues. An lvaluable function or
expression is one to which a value may be assigned, as in
pos($x) = 10
.
- lvalue modifier
-
An adjectival pseudofunction that
warps the meaning of an lvalue in some declarative fashion. Currently
there are three lvalue modifiers:
my
, our
, and local
.
- magic
-
Technically speaking, any extra semantics attached to a
variable such as
$!
, $0
, %ENV
, or %SIG
, or to any tied
variable. Magical things happen when you diddle those variables.
- magical increment
-
An increment operator that knows how to
bump up ASCII alphabetics as well as numbers.
- magical variables
-
Special variables that have side
effects when you access them or assign to them. For example, in Perl,
changing elements of the
%ENV
array also changes the corresponding
environment variables that subprocesses will use. Reading the $!
variable gives you the current system error number or message.
- Makefile
-
A file that controls the compilation of a program. Perl programs
don’t usually need a Makefile because the Perl compiler has plenty of
self-control.
- man
-
The Unix program that displays online documentation
(manual pages) for you.
- manpage
-
A “page” from the manuals, typically accessed via the
man(1) command. A manpage contains a SYNOPSIS, a DESCRIPTION, a list of
BUGS, and so on, and is typically longer than a page. There are manpages
documenting commands, syscalls, library functions, devices,
protocols, files, and such. In this book, we call any piece of
standard Perl documentation (like the perlop manpage or the perldelta manpage) a manpage, no
matter what format it’s installed in on your system.
- matching
-
See pattern matching.
- member data
-
See instance variable.
- memory
-
This always means your main memory, not your disk.
Clouding the issue is the fact that your machine may implement
virtual memory; that is, it will pretend that it has more memory than
it really does, and it’ll use disk space to hold inactive bits. This can
make it seem like you have a little more memory than you really do, but
it’s not a substitute for real memory. The best thing that can be said
about virtual memory is that it lets your performance degrade gradually
rather than suddenly when you run out of real memory. But your program
can die when you run out of virtual memory, too—if you haven’t thrashed
your disk to death first.
- metacharacter
-
A character that is not supposed to be treated normally. Which characters
are to be treated specially as metacharacters varies greatly from context to
context. Your shell will have certain metacharacters, double-quoted Perl
strings have other metacharacters,
and regular expression patterns have all the double-quote metacharacters plus
some extra ones of their own.
- metasymbol
-
Something we’d call a
metacharacter except that it’s a sequence of more than one character.
Generally, the first character in the sequence must be a true metacharacter
to get the other characters in the metasymbol to misbehave along with it.
- method
-
A kind of action that an object can take if you tell
it to. See Camel chapter 12, “Objects”.
- method resolution order
-
The path Perl takes through
@INC
. By default, this is a double depth first
search, once looking for defined methods and once for AUTOLOAD
. However,
Perl lets you configure this with mro
.
- minicpan
-
A CPAN mirror that includes just the latest versions for each
distribution, probably created with
CPAN::Mini
. See
Camel chapter 19, “CPAN”.
- minimalism
-
The belief that “small is beautiful”. Paradoxically, if you
say something in a small language, it turns out big, and if you say it in a
big language, it turns out small. Go figure.
- mode
-
In the context of the stat(2) syscall, refers to the field
holding the permission bits and the type of the file.
- modifier
-
See statement modifier, regular expression, and
lvalue, not necessarily in that order.
- module
-
A file that defines a package of (almost) the same
name, which can either export symbols or function as an object class.
(A module’s main .pm file may also load in other files in support of the
module.) See the
use
built-in.
- modulus
-
An integer divisor when
you’re interested in the remainder instead of the quotient.
- mojibake
-
When you speak one language and the computer thinks you’re
speaking another. You’ll see odd translations when you send UTF‑8, for
instance, but the computer thinks you sent Latin-1, showing all sorts of
weird characters instead. The term is written 「文字化け」in Japanese and
means “character rot”, an apt description. Pronounced [
modʑibake
] in
standard IPA phonetics, or approximately “moh-jee-bah-keh”.
- monger
-
Short for one member of Perl mongers, a
purveyor of Perl.
- mortal
-
A temporary value scheduled to die when the
current statement finishes.
- mro
-
See method resolution order.
- multidimensional array
-
An array with multiple
subscripts for finding a single element. Perl implements these using
references—see Camel chapter 9, “Data Structures”.
- multiple inheritance
-
The features you got from
your mother and father, mixed together unpredictably. (See also
inheritance and single inheritance.) In computer languages (including
Perl), it is the notion that a given class may have multiple direct
ancestors or base classes.
- named pipe
-
A pipe with a name embedded in the
filesystem so that it can be accessed by two unrelated processes.
- namespace
-
A domain of names. You needn’t worry about whether the
names in one such domain have been used in another. See package.
- NaN
-
Not a number. The value Perl uses
for certain invalid or inexpressible floating-point operations.
- network address
-
The most important attribute of a socket, like your
telephone’s telephone number. Typically an IP address. See also port.
- newline
-
A single character that
represents the end of a line, with the ASCII value of 012 octal under Unix
(but 015 on a Mac), and represented by
\n
in Perl strings. For Windows
machines writing text files, and for certain physical devices like
terminals, the single newline gets automatically translated by your C
library into a line feed and a carriage return, but normally, no
translation is done.
- NFS
-
Network File System, which allows you to mount a remote filesystem as if it were local.
- normalization
-
Converting a text string into an alternate but equivalent
canonical (or compatible) representation that can then be compared for
equivalence. Unicode recognizes four different normalization forms: NFD,
NFC, NFKD, and NFKC.
- null character
-
A character with the numeric value of
zero. It’s used by C to terminate strings, but Perl allows strings to
contain a null.
- null list
-
A list value with zero elements, represented
in Perl by
()
.
- null string
-
A string containing no characters, not to
be confused with a string containing a null character, which has a
positive length and is true.
- numeric context
-
The situation in which an expression
is expected by its surroundings (the code calling it) to return a number.
See also context and string context.
- numification
-
(Sometimes spelled nummification and nummify.) Perl lingo
for implicit conversion into a number; the related verb is numify.
Numification is intended to rhyme with mummification, and numify with
mummify. It is unrelated to English numen, numina, numinous. We
originally forgot the extra m a long time ago, and some people got used to
our funny spelling, and so just as with
HTTP_REFERER
’s own missing letter,
our weird spelling has stuck around.
- NV
-
Short for Nevada, no part of
which will ever be confused with civilization. NV also means an internal
floating- point Numeric Value of the type a scalar can hold, not to be
confused with an IV.
- nybble
-
Half a byte, equivalent to one hexadecimal digit, and worth
four bits.
- object
-
An instance of a class. Something that “knows”
what user-defined type (class) it is, and what it can do because of what
class it is. Your program can request an object to do things, but the
object gets to decide whether it wants to do them or not. Some objects are
more accommodating than others.
- octal
-
A number in base 8. Only the digits 0 through 7 are allowed. Octal
constants in Perl start with 0, as in 013. See also the
oct
function.
- offset
-
How many things you have to skip
over when moving from the beginning of a string or array to a specific
position within it. Thus, the minimum offset is zero, not one, because you
don’t skip anything to get to the first item.
- one-liner
-
An entire computer program crammed into one line of
text.
- open source software
-
Programs for which the source code is freely
available and freely redistributable, with no commercial strings attached.
For a more detailed definition, see http://www.opensource.org/osd.html.
- operand
-
An expression that yields a value that an
operator operates on. See also precedence.
- operating system
-
A special program that runs on the bare
machine and hides the gory details of managing processes and devices.
Usually used in a looser sense to indicate a particular culture of
programming. The loose sense can be used at varying levels of specificity.
At one extreme, you might say that all versions of Unix and Unix-lookalikes
are the same operating system (upsetting many people, especially lawyers
and other advocates). At the other extreme, you could say this particular
version of this particular vendor’s operating system is different from any
other version of this or any other vendor’s operating system. Perl is much
more portable across operating systems than many other languages. See also
architecture and platform.
- operator
-
A gizmo that transforms some number of input values to
some number of output values, often built into a language with a special
syntax or symbol. A given operator may have specific expectations about
what types of data you give as its arguments (operands) and what type
of data you want back from it.
- operator overloading
-
A kind of
overloading that you can do on built-in operators to make them work
on objects as if the objects were ordinary scalar values, but with the
actual semantics supplied by the object class. This is set up with the
overload pragma—see Camel chapter 13, “Overloading”.
- options
-
See either switches or regular expression modifiers.
- ordinal
-
An abstract character’s integer value. Same thing as
codepoint.
- overloading
-
Giving additional meanings to a symbol or construct.
Actually, all languages do overloading to one extent or another, since
people are good at figuring out things from context.
- overriding
-
Hiding or invalidating some other definition of the
same name. (Not to be confused with overloading, which adds definitions
that must be disambiguated some other way.) To confuse the issue further,
we use the word with two overloaded definitions: to describe how you can
define your own subroutine to hide a built-in function of the same
name (see the section “Overriding Built-in Functions” in Camel chapter 11,
“Modules”), and to describe how you can define a replacement method in a
derived class to hide a base class’s method of the same name (see
Camel chapter 12, “Objects”).
- owner
-
The one user (apart from the
superuser) who has absolute control over a file. A file may also have a
group of users who may exercise joint ownership if the real owner
permits it. See permission bits.
- package
-
A namespace for global variables, subroutines,
and the like, such that they can be kept separate from like-named
symbols in other namespaces. In a sense, only the package is global,
since the symbols in the package’s symbol table are only accessible from
code compiled outside the package by naming the package. But in another
sense, all package symbols are also globals—they’re just well-organized
globals.
- pad
-
Short for scratchpad.
- parameter
-
See argument.
- parent class
-
See base class.
- parse tree
-
See syntax tree.
- parsing
-
The subtle but sometimes brutal art of attempting to turn
your possibly malformed program into a valid syntax tree.
- patch
-
To fix by applying one, as it were. In the realm of hackerdom, a
listing of the differences between two versions of a program as might be
applied by the patch(1) program when you want to fix a bug or upgrade
your old version.
- PATH
-
The list of
directories the system searches to find a program you want to
execute. The list is stored as one of your environment variables,
accessible in Perl as
$ENV{PATH}
.
- pathname
-
A fully qualified filename such as /usr/bin/perl. Sometimes
confused with
PATH
.
- pattern
-
A template used in pattern matching.
- pattern matching
-
Taking a pattern, usually a regular
expression, and trying the pattern various ways on a string to see whether
there’s any way to make it fit. Often used to pick interesting tidbits out
of a file.
- PAUSE
-
The Perl Authors Upload SErver (http://pause.perl.org), the gateway
for modules on their way to CPAN.
- Perl mongers
-
A Perl user group, taking the form of its
name from the New York Perl mongers, the first Perl user group. Find one
near you at http://www.pm.org.
- permission bits
-
Bits that the owner of a file sets
or unsets to allow or disallow access to other people. These flag bits are
part of the mode word returned by the
stat
built-in when you ask
about a file. On Unix systems, you can check the ls(1) manpage for more
information.
- Pern
-
What you get when you do
Perl++
twice. Doing it only once
will curl your hair. You have to increment it eight times to shampoo your
hair. Lather, rinse, iterate.
- pipe
-
A direct connection that carries the output of one
process to the input of another without an intermediate temporary file.
Once the pipe is set up, the two processes in question can read and write
as if they were talking to a normal file, with some caveats.
- pipeline
-
A series of processes all in a row, linked by pipes, where
each passes its output stream to the next.
- platform
-
The entire hardware and software context in which a
program runs. A program written in a platform-dependent language might
break if you change any of the following: machine, operating system,
libraries, compiler, or system configuration. The perl interpreter has
to be compiled differently for each platform because it is implemented
in C, but programs written in the Perl language are largely platform
independent.
- pod
-
The markup
used to embed documentation into your Perl code. Pod stands for “Plain old
documentation”. See Camel chapter 23, “Plain Old Documentation”.
- pod command
-
A sequence, such as
=head1
, that denotes
the start of a pod section.
- pointer
-
A variable in a language like C that contains the exact
memory location of some other item. Perl handles pointers internally so you
don’t have to worry about them. Instead, you just use symbolic pointers in
the form of keys and variable names, or hard references, which
aren’t pointers (but act like pointers and do in fact contain pointers).
- polymorphism
-
The notion that you can tell an object to do something
generic, and the object will interpret the command in different ways
depending on its type. [< Greek πολυ- + μορϕή, many forms.]
- port
-
The part of the address of a TCP or UDP socket that directs
packets to the correct process after finding the right machine, something
like the phone extension you give when you reach the company operator. Also
the result of converting code to run on a different platform than
originally intended, or the verb denoting this conversion.
- portable
-
Once upon a time, C code compilable under both BSD and
SysV. In general, code that can be easily converted to run on another
platform, where “easily” can be defined however you like, and usually
is. Anything may be considered portable if you try hard enough, such as a
mobile home or London Bridge.
- porter
-
Someone who “carries” software from one platform to another.
Porting programs written in platform-dependent languages such as C can be
difficult work, but porting programs like Perl is very much worth the
agony.
- possessive
-
Said of quantifiers and groups in patterns that refuse
to give up anything once they’ve gotten their mitts on it. Catchier and
easier to say than the even more formal nonbacktrackable.
- POSIX
-
The Portable Operating System Interface
specification.
- postfix
-
An operator that follows its operand, as in
$x++
.
- pp
-
An internal shorthand for a
“push- pop” code; that is, C code implementing Perl’s stack machine.
- pragma
-
A standard module whose practical hints and
suggestions are received (and possibly ignored) at compile time. Pragmas
are named in all lowercase.
- precedence
-
The rules of
conduct that, in the absence of other guidance, determine what should
happen first. For example, in the absence of parentheses, you always do
multiplication before addition.
- prefix
-
An operator that precedes its operand, as in
++$x
.
- preprocessing
-
What some helper process did to transform the incoming
data into a form more suitable for the current process. Often done with an
incoming pipe. See also C preprocessor.
- primary maintainer
-
The author that PAUSE allows to assign co-maintainer
permissions to a namespace. A primary maintainer can give up this
distinction by assigning it to another PAUSE author. See Camel chapter 19,
“CPAN”.
- procedure
-
A subroutine.
- process
-
An instance of a running program. Under multitasking
systems like Unix, two or more separate processes could be running the same
program independently at the same time—in fact, the
fork
function is
designed to bring about this happy state of affairs. Under other operating
systems, processes are sometimes called “threads”, “tasks”, or “jobs”,
often with slight nuances in meaning.
- program
-
See script.
- program generator
-
A system that algorithmically writes code for you in a
high-level language. See also code generator.
- progressive matching
-
Pattern matching matching>that picks up where it left off before.
- property
-
See either instance variable or character property.
- protocol
-
In networking, an agreed-upon way of sending messages
back and forth so that neither correspondent will get too confused.
- prototype
-
An optional part of a subroutine declaration telling
the Perl compiler how many and what flavor of arguments may be passed as
actual arguments, so you can write subroutine calls that parse much like
built-in functions. (Or don’t parse, as the case may be.)
- pseudofunction
-
A construct that sometimes looks like a function but really
isn’t. Usually reserved for lvalue modifiers like
my
, for context
modifiers like scalar
, and for the pick-your-own-quotes constructs,
q//
, qq//
, qx//
, qw//
, qr//
, m//
, s///
, y///
, and
tr///
.
- pseudohash
-
Formerly, a reference to an array
whose initial element happens to hold a reference to a hash. You used to be
able to treat a pseudohash reference as either an array reference or a hash
reference. Pseduohashes are no longer supported.
- pseudoliteral
-
An operator X
that looks something like a literal,
such as the output-grabbing operator, <literal
moreinfo="none"
`>command
`
.
- public domain
-
Something not owned by anybody. Perl is copyrighted and is
thus not in the public domain—it’s just freely available and freely
redistributable.
- pumpkin
-
A notional “baton” handed around the Perl community
indicating who is the lead integrator in some arena of development.
- pumpking
-
A pumpkin holder, the person in charge of pumping the pump,
or at least priming it. Must be willing to play the part of the Great
Pumpkin now and then.
- PV
-
A “pointer value”, which is Perl
Internals Talk for a
char*
.
- qualified
-
Possessing a complete name. The symbol
$Ent::moot
is
qualified; $moot
is unqualified. A fully qualified filename is specified
from the top-level directory.
- quantifier
-
A component of a regular expression specifying how
many times the foregoing atom may occur.
- race condition
-
A race condition exists when the result of
several interrelated events depends on the ordering of those events, but
that order cannot be guaranteed due to nondeterministic timing effects. If
two or more programs, or parts of the same program, try to go through the
same series of events, one might interrupt the work of the other. This is a
good way to find an exploit.
- readable
-
With respect to files, one that has the proper permission
bit set to let you access the file. With respect to computer programs, one
that’s written well enough that someone has a chance of figuring out what
it’s trying to do.
- reaping
-
The last rites performed by a parent process
on behalf of a deceased child process so that it doesn’t remain a
zombie. See the
wait
and waitpid
function calls.
- record
-
A set of related data values in a file or stream,
often associated with a unique key field. In Unix, often commensurate
with a line, or a blank-line–terminated set of lines (a “paragraph”).
Each line of the /etc/passwd file is a record, keyed on login name,
containing information about that user.
- recursion
-
The art of defining something (at least partly) in
terms of itself, which is a naughty no-no in dictionaries but often works
out okay in computer programs if you’re careful not to recurse forever
(which is like an infinite loop with more spectacular failure modes).
- reference
-
Where you look to find a pointer to information
somewhere else. (See indirection.) References come in two flavors:
symbolic references and hard references.
- referent
-
Whatever a reference refers to, which may or may not
have a name. Common types of referents include scalars, arrays, hashes, and
subroutines.
- regex
-
See regular expression.
- regular expression
-
A single entity with various
interpretations, like an elephant. To a computer scientist, it’s a grammar
for a little language in which some strings are legal and others aren’t. To
normal people, it’s a pattern you can use to find what you’re looking for
when it varies from case to case. Perl’s regular expressions are far from
regular in the theoretical sense, but in regular use they work quite well.
Here’s a regular expression:
/Oh s.*t./
. This will match strings like
“Oh say can you see by the dawn's early light
” and “Oh sit!
”. See
Camel chapter 5, “Pattern Matching”.
- regular expression modifier
-
An option on a pattern or substitution, such as
/i
to render the pattern
case- insensitive.
- regular file
-
A file that’s not a directory, a
device, a named pipe or socket, or a symbolic link. Perl uses
the
–f
file test operator to identify regular files. Sometimes called a
“plain” file.
- relational operator
-
An operator that says whether a particular
ordering relationship is true about a pair of operands. Perl has both
numeric and string relational operators. See collating sequence.
- reserved words
-
A word with a specific, built-in meaning
to a compiler, such as
if
or delete
. In many languages (not Perl),
it’s illegal to use reserved words to name anything else. (Which is why
they’re reserved, after all.) In Perl, you just can’t use them to name
labels or filehandles. Also called “keywords”.
- return value
-
The value produced by a subroutine
or expression when evaluated. In Perl, a return value may be either a
list or a scalar.
- RFC
-
Request For Comment, which despite the timid connotations is the name of a series of
important standards documents.
- right shift
-
A bit shift >) bit operator>> (right shift) bit operator>that divides
a number by some power of 2.
- role
-
A name for a concrete set of behaviors. A role is a way to
add behavior to a class without inheritance.
- root
-
The superuser (
UID
== 0). Also the top-level directory of
the filesystem.
- RTFM
-
What you are told when someone thinks you should Read The
Fine Manual.
- run phase
-
Any time after Perl starts running your main program.
See also compile phase. Run phase is mostly spent in runtime but may
also be spent in compile time when
require
, do
FILE
, or
eval
STRING
operators are executed, or when a substitution uses
the /ee
modifier.
- runtime
-
The time when Perl is actually doing what your
code says to do, as opposed to the earlier period of time when it was
trying to figure out whether what you said made any sense whatsoever, which
is compile time.
- runtime pattern
-
A pattern that contains one or more
variables to be interpolated before parsing the pattern as a regular
expression, and that therefore cannot be analyzed at compile time, but
must be reanalyzed each time the pattern match operator is evaluated.
Runtime patterns are useful but expensive.
- RV
-
A recreational vehicle, not
to be confused with vehicular recreation. RV also means an internal
Reference Value of the type a scalar can hold. See also IV and NV
if you’re not confused yet.
- rvalue
-
A value that you might find on the
right side of an assignment. See also lvalue.
- sandbox
-
A walled off area that’s not supposed to affect beyond
its walls. You let kids play in the sandbox instead of running in the road.
See Camel chapter 20, “Security”.
- scalar
-
A simple, singular value; a number, string, or
reference.
- scalar context
-
The situation in which an
expression is expected by its surroundings (the code calling it) to
return a single value rather than a list of values. See also
context and list context. A scalar context sometimes imposes
additional constraints on the return value—see string context and
numeric context. Sometimes we talk about a Boolean context inside
conditionals, but this imposes no additional constraints, since any scalar
value, whether numeric or string, is already true or false.
- scalar literal
-
A number or quoted string—an actual
value in the text of your program, as opposed to a variable.
- scalar value
-
A value that happens to be a
scalar as opposed to a list.
- scalar variable
-
A variable prefixed with
$
that holds a single value.
- scope
-
From how far away you can see a variable, looking through
one. Perl has two visibility mechanisms. It does dynamic scoping of
local
variables, meaning that the rest of the block, and any
subroutines that are called by the rest of the block, can see the
variables that are local to the block. Perl does lexical scoping of
my
variables, meaning that the rest of the block can see the variable,
but other subroutines called by the block cannot see the variable.
- scratchpad
-
The area in which a particular invocation of a particular
file or subroutine keeps some of its temporary values, including any
lexically scoped variables.
- script
-
A text file that is a program
intended to be executed directly rather than compiled to another form
of file before execution.
-
Also, in the context of Unicode, a writing system for a particular
language or group of languages, such as Greek, Bengali, or Tengwar.
- script kiddie
-
A cracker who is not a hacker but knows just enough
to run canned scripts. A cargo-cult programmer.
- sed
-
A venerable Stream EDitor from
which Perl derives some of its ideas.
- semaphore
-
A fancy kind of interlock that prevents multiple threads or
processes from using up the same resources simultaneously.
- separator
-
A character or string that keeps two surrounding strings from being
confused with each other. The
split
function works on separators. Not to be confused with delimiters
or terminators. The “or” in the previous sentence separated the two
alternatives.
- serialization
-
Putting a fancy data structure into
linear order so that it can be stored as a string in a disk file or
database, or sent through a pipe. Also called marshalling.
- server
-
In networking, a process that
either advertises a service or just hangs around at a known location and
waits for clients who need service to get in touch with it.
- service
-
Something you do for someone else to make them happy,
like giving them the time of day (or of their life). On some machines,
well-known services are listed by the
getservent
function.
- setgid
-
Same as setuid, only having to do with giving
away group privileges.
- setuid
-
Said of a program that runs with the privileges of
its owner rather than (as is usually the case) the privileges of whoever
is running it. Also describes the bit in the mode word (permission bits)
that controls the feature. This bit must be explicitly set by the owner to
enable this feature, and the program must be carefully written not to give
away more privileges than it ought to.
- shared memory
-
A piece of memory accessible by two
different processes who otherwise would not see each other’s memory.
- shebang
-
Irish for the whole McGillicuddy. In Perl culture, a
portmanteau of “sharp” and “bang”, meaning the
#!
sequence that tells
the system where to find the interpreter.
- shell
-
A command-line interpreter. The program that
interactively gives you a prompt, accepts one or more lines of input,
and executes the programs you mentioned, feeding each of them their proper
arguments and input data. Shells can also execute scripts containing
such commands. Under Unix, typical shells include the Bourne shell
(/bin/sh), the C shell (/bin/csh), and the Korn shell (/bin/ksh).
Perl is not strictly a shell because it’s not interactive (although Perl
programs can be interactive).
- side effects
-
Something extra that happens when you evaluate an
expression. Nowadays it can refer to almost anything. For example,
evaluating a simple assignment statement typically has the “side effect” of
assigning a value to a variable. (And you thought assigning the value was
your primary intent in the first place!) Likewise, assigning a value to the
special variable
$|
($AUTOFLUSH
) has the side effect of forcing a
flush after every write
or print
on the currently selected
filehandle.
- sigil
-
A glyph used in magic. Or, for Perl, the symbol in front
of a variable name, such as
$
, @
, and %
.
- signal
-
A bolt out of the blue; that is, an
event triggered by the operating system, probably when you’re least
expecting it.
- signal handler
-
A subroutine that, instead of being content to be
called in the normal fashion, sits around waiting for a bolt out of the
blue before it will deign to execute. Under Perl, bolts out of the blue
are called signals, and you send them with the
kill
built-in. See the
%SIG
hash in Camel chapter 25, “Special Names” and the section “Signals”
in Camel chapter 15, “Interprocess Communication”.
- single inheritance
-
The features you got from your
mother, if she told you that you don’t have a father. (See also
inheritance and multiple inheritance.) In computer languages, the
idea that classes reproduce asexually so that a given class can only
have one direct ancestor or base class. Perl supplies no such
restriction, though you may certainly program Perl that way if you like.
- slice
-
A selection of any number of
elements from a list, array, or hash.
- slurp
-
To read an entire file into a string in one operation.
- socket
-
An endpoint for network communication among multiple
processes that works much like a telephone or a post office box. The
most important thing about a socket is its network address (like a phone
number). Different kinds of sockets have different kinds of addresses—some
look like filenames, and some don’t.
- soft reference
-
See symbolic reference.
- source filter
-
A special kind of module that does
preprocessing on your script just before it gets to the tokener.
- stack
-
A device you can put things on the top of, and later take
them back off in the opposite order in which you put them on. See LIFO.
- standard
-
Included in the official Perl distribution, as in a
standard module, a standard tool, or a standard Perl manpage.
- standard error
-
The default output stream for nasty remarks that don’t belong in
standard output. Represented within a Perl program by the output> filehandle
STDERR
. You can use this
stream explicitly, but the die
and warn
built-ins write to your
standard error stream automatically (unless trapped or otherwise
intercepted).
- standard input
-
The default input stream for your program,
which if possible shouldn’t care where its data is coming from. Represented
within a Perl program by the filehandle
STDIN
.
- standard I/O
-
A standard C library for doing buffered input
and output to the operating system. (The “standard” of standard I/O is
at most marginally related to the “standard” of standard input and output.)
In general, Perl relies on whatever implementation of standard I/O a given
operating system supplies, so the buffering characteristics of a Perl
program on one machine may not exactly match those on another machine.
Normally this only influences efficiency, not semantics. If your standard
I/O package is doing block buffering and you want it to flush the buffer
more often, just set the
$|
variable to a true value.
- Standard Library
-
Everything that comes with the official
perl distribution. Some vendor versions of perl change their
distributions, leaving out some parts or including extras. See also
dual-lived.
- standard output
-
The default output stream for your program,
which if possible shouldn’t care where its data is going. Represented
within a Perl program by the filehandle
STDOUT
.
- statement
-
A command to the computer about what to do next,
like a step in a recipe: “Add marmalade to batter and mix until mixed.” A
statement is distinguished from a declaration, which doesn’t tell the
computer to do anything, but just to learn something.
- statement modifier
-
A conditional or
loop that you put after the statement instead of before, if you know
what we mean.
- static
-
Varying slowly compared to something else. (Unfortunately,
everything is relatively stable compared to something else, except for
certain elementary particles, and we’re not so sure about them.) In
computers, where things are supposed to vary rapidly, “static” has a
derogatory connotation, indicating a slightly dysfunctional variable,
subroutine, or method. In Perl culture, the word is politely avoided.
-
If you’re a C or C++ programmer, you might be looking for Perl’s state
keyword.
- static method
-
No such thing. See class method.
- static scoping
-
No such thing. See lexical scoping.
- static variable
-
No such thing. Just use a lexical
variable in a scope larger than your subroutine, or declare it with
state
instead of with my
.
- stat structure
-
A special internal spot
in which Perl keeps the information about the last file on which you
requested information.
- status
-
The value returned to the
parent process when one of its child processes dies. This value is
placed in the special variable
$?
. Its upper eight bits are the exit
status of the defunct process, and its lower eight bits identify the signal
(if any) that the process died from. On Unix systems, this status value is
the same as the status word returned by wait(2). See system
in Camel
chapter 27, “Functions”.
- STDERR
-
See standard error.
- STDIN
-
See standard input.
- STDIO
-
See standard I/O.
- STDOUT
-
See standard output.
- stream
-
A flow of data into or out of
a process as a steady sequence of bytes or characters, without the
appearance of being broken up into packets. This is a kind of
interface—the underlying implementation may well break your data up
into separate packets for delivery, but this is hidden from you.
- string
-
A sequence of characters such as “He said !@#*&%@#*?!”.
A string does not have to be entirely printable.
- string context
-
The situation in which an expression is
expected by its surroundings (the code calling it) to return a string.
See also context and numeric context.
- stringification
-
The process of producing a string representation of an
abstract object.
- struct
-
C keyword introducing a structure definition or name.
- structure
-
See data structure.
- subclass
-
See derived class.
- subpattern
-
A component of a regular expression pattern.
- subroutine
-
A named or otherwise accessible piece of program
that can be invoked from elsewhere in the program in order to accomplish
some subgoal of the program. A subroutine is often parameterized to
accomplish different but related things depending on its input
arguments. If the subroutine returns a meaningful value, it is also
called a function.
- subscript
-
A value that indicates the position of a particular
array element in an array.
- substitution
-
Changing parts of a string via the
s///
operator. (We avoid use of this term to mean variable interpolation.)
- substring
-
A portion of a string, starting at a certain
character position (offset) and proceeding for a certain number of
characters.
- superclass
-
See base class.
- superuser
-
The person whom the operating system will let do almost
anything. Typically your system administrator or someone pretending to be
your system administrator. On Unix systems, the root user. On Windows
systems, usually the Administrator user.
- SV
-
Short for “scalar value”. But
within the Perl interpreter, every referent is treated as a member of a
class derived from SV, in an object-oriented sort of way. Every value
inside Perl is passed around as a C language
SV*
pointer. The SV
struct knows its own “referent type”, and the code is smart enough (we
hope) not to try to call a hash function on a subroutine.
- switch
-
An option you give on a command line to
influence the way your program works, usually introduced with a minus sign.
The word is also used as a nickname for a switch statement.
- switch cluster
-
The combination of multiple command-
line switches (e.g.,
–a –b –c
) into one switch (e.g., –abc
).
Any switch with an additional argument must be the last switch in a
cluster.
- switch statement
-
A program technique that lets you
evaluate an expression and then, based on the value of the expression,
do a multiway branch to the appropriate piece of code for that value. Also
called a “case structure”, named after the similar Pascal construct. Most
switch statements in Perl are spelled
given
. See “The given
statement” in Camel chapter 4, “Statements and Declarations”.
- symbol
-
Generally, any token or metasymbol. Often used
more specifically to mean the sort of name you might find in a symbol
table.
- symbolic debugger
-
A program that lets you step through
the execution of your program, stopping or printing things out here and
there to see whether anything has gone wrong, and, if so, what. The
“symbolic” part just means that you can talk to the debugger using the same
symbols with which your program is written.
- symbolic link
-
An alternate filename that points to the
real filename, which in turn points to the real file. Whenever the
operating system is trying to parse a pathname containing a symbolic
link, it merely substitutes the new name and continues parsing.
- symbolic reference
-
A variable whose value is the
name of another variable or subroutine. By dereferencing the first
variable, you can get at the second one. Symbolic references are illegal
under
use strict "refs"
.
- symbol table
-
Where a compiler remembers symbols. A program
like Perl must somehow remember all the names of all the variables,
filehandles, and subroutines you’ve used. It does this by placing the
names in a symbol table, which is implemented in Perl using a hash
table. There is a separate symbol table for each package to give each
package its own namespace.
- synchronous
-
Programming in which the orderly sequence of events
can be determined; that is, when things happen one after the other, not at
the same time.
- syntactic sugar
-
An alternative way of writing something more easily; a
shortcut.
- syntax
-
From Greek σύνταξις, “with-arrangement”. How things
(particularly symbols) are put together with each other.
- syntax tree
-
An internal representation of your program wherein
lower-level constructs dangle off the higher-level constructs enclosing
them.
- syscall
-
A function call directly to the operating
system. Many of the important subroutines and functions you use aren’t
direct system calls, but are built up in one or more layers above the
system call level. In general, Perl programmers don’t need to worry about
the distinction. However, if you do happen to know which Perl functions are
really syscalls, you can predict which of these will set the
$!
($ERRNO
) variable on failure. Unfortunately, beginning programmers often
confusingly employ the term “system call” to mean what happens when you
call the Perl system
function, which actually involves many syscalls. To
avoid any confusion, we nearly always say “syscall” for something you could
call indirectly via Perl’s syscall
function, and never for something you
would call with Perl’s system
function.
- taint checks
-
The special bookkeeping Perl does to track the flow
of external data through your program and disallow their use in system
commands.
- tainted
-
Said of data derived from the grubby hands of a user,
and thus unsafe for a secure program to rely on. Perl does taint checks if
you run a setuid (or setgid) program, or if you use the
–T
switch.
- taint mode
-
Running under the
–T
switch, marking all external data as
suspect and refusing to use it with system commands. See Camel chapter 20,
“Security”.
- TCP
-
Short for Transmission Control Protocol. A protocol wrapped around the
Internet Protocol to make an unreliable packet transmission mechanism
appear to the application program to be a reliable stream of bytes.
(Usually.)
- term
-
Short for a “terminal”—that is, a leaf node of a syntax
tree. A thing that functions grammatically as an operand for the
operators in an expression.
- terminator
-
A character or string that marks the end of another string. The
$/
variable contains the string that terminates a readline
operation, which
chomp
deletes from the end. Not to be confused with delimiters or
separators. The period at the end of this sentence is a terminator.
- ternary
-
An operator taking three operands. Sometimes
pronounced trinary.
- text
-
A string or file containing primarily printable characters.
- thread
-
Like a forked process, but without fork’s inherent
memory protection. A thread is lighter weight than a full process, in that
a process could have multiple threads running around in it, all fighting
over the same process’s memory space unless steps are taken to protect
threads from one another.
- tie
-
The bond between a magical variable and its
implementation class. See the
tie
function in Camel chapter 27,
“Functions” and Camel chapter 14, “Tied Variables”.
- titlecase
-
The case used for capitals
that are followed by lowercase characters instead of by more capitals.
Sometimes called sentence case or headline case. English doesn’t use
Unicode titlecase, but casing rules for English titles are more complicated
than simply capitalizing each word’s first character.
- TMTOWTDI
-
There’s More Than One Way To Do It, the Perl Motto. The
notion that there can be more than one valid path to solving a programming
problem in context. (This doesn’t mean that more ways are always better or
that all possible paths are equally desirable—just that there need not be
One True Way.)
- token
-
A morpheme in a programming language, the smallest unit
of text with semantic significance.
- tokener
-
A module that breaks a program text into a sequence of
tokens for later analysis by a parser.
- tokenizing
-
Splitting up a program text into tokens. Also known as
“lexing”, in which case you get “lexemes” instead of tokens.
- toolbox approach
-
The notion that, with a complete set of simple tools
that work well together, you can build almost anything you want. Which is
fine if you’re assembling a tricycle, but if you’re building a
defranishizing comboflux regurgalator, you really want your own machine
shop in which to build special tools. Perl is sort of a machine shop.
- topic
-
The thing you’re working on. Structures like
while(<>)
, for
, foreach
, and given
set the topic for
you by assigning to $_
, the default (topic) variable.
- transliterate
-
To turn one string
representation into another by mapping each character of the source string
to its corresponding character in the result string. Not to be confused
with translation: for example, Greek πολύχρωμος transliterates into
polychromos but translates into many-colored. See the
tr///
operator in Camel chapter 5, “Pattern Matching”.
- trigger
-
An event that causes a handler to be run.
- trinary
-
Not a stellar system with three stars, but an
operator taking three operands. Sometimes pronounced ternary.
- troff
-
A venerable typesetting language from which Perl derives
the name of its
$%
variable and which is secretly used in the production
of Camel books.
- true
-
Any scalar value that doesn’t evaluate to 0 or
""
.
- truncating
-
Emptying a file of existing
contents, either automatically when opening a file for writing or
explicitly via the
truncate
function.
- type
-
See data type and class.
- type casting
-
Converting data from one type to another. C permits this.
Perl does not need it. Nor want it.
- typedef
-
A type definition in the C and C++ languages.
- typed lexical
-
A lexical variable lexical>that is declared with a class
type:
my Pony $bill
.
- typeglob
-
Use of a single identifier, prefixed with
*
. For
example, *name
stands for any or all of $name
, @name
, %name
,
&name
, or just name
. How you use it determines whether it is
interpreted as all or only one of them. See “Typeglobs and Filehandles” in
Camel chapter 2, “Bits and Pieces”.
- typemap
-
A description of how C types may be transformed to and from Perl
types within an extension module written in XS.
- UDP
-
User Datagram Protocol, the typical way to send
datagrams over the Internet.
- UID
-
A user ID. Often used in the context of
file or process ownership.
- umask
-
A mask of those permission bits that should be forced
off when creating files or directories, in order to establish a policy of
whom you’ll ordinarily deny access to. See the
umask
function.
- unary operator
-
An operator with only one operand, like
!
or
chdir
. Unary operators are usually prefix operators; that is, they
precede their operand. The ++
and ––
operators can be either prefix
or postfix. (Their position does change their meanings.)
- Unicode
-
A character set comprising all the major character sets of
the world, more or less. See http://www.unicode.org.
- Unix
-
A very large and constantly evolving language with several
alternative and largely incompatible syntaxes, in which anyone can define
anything any way they choose, and usually do. Speakers of this language
think it’s easy to learn because it’s so easily twisted to one’s own ends,
but dialectical differences make tribal intercommunication nearly
impossible, and travelers are often reduced to a pidgin-like subset of the
language. To be universally understood, a Unix shell programmer must spend
years of study in the art. Many have abandoned this discipline and now
communicate via an Esperanto-like language called Perl.
-
In ancient times, Unix was also used to refer to some code that a couple of
people at Bell Labs wrote to make use of a PDP-7 computer that wasn’t doing
much of anything else at the time.
- uppercase
-
In Unicode, not just
characters with the General Category of Uppercase Letter, but any character
with the Uppercase property, including some Letter Numbers and Symbols. Not
to be confused with titlecase.
- value
-
An actual piece of data, in contrast to all the
variables, references, keys, indices, operators, and whatnot that you need
to access the value.
- variable
-
A named storage location that can hold any
of various kinds of value, as your program sees fit.
- variable interpolation
-
The interpolation of
a scalar or array variable into a string.
- variadic
-
Said of a function that happily receives an
indeterminate number of actual arguments.
- vector
-
Mathematical jargon for a list of scalar values.
- virtual
-
Providing the appearance of something without the reality,
as in: virtual memory is not real memory. (See also memory.) The
opposite of “virtual” is “transparent”, which means providing the reality
of something without the appearance, as in: Perl handles the
variable-length UTF‑8 character encoding transparently.
- void context
-
A form of scalar context in which an
expression is not expected to return any value at all and is
evaluated for its side effects alone.
- v-string
-
A “version” or “vector” string
specified with a
v
followed by a series of decimal integers in dot
notation, for instance, v1.20.300.4000
. Each number turns into a
character with the specified ordinal value. (The v
is optional when
there are at least three integers.)
- warning
-
A message printed to the
STDERR
stream to the effect that something might be
wrong but isn’t worth blowing up over. See warn
in Camel chapter 27,
“Functions” and the warnings
pragma in Camel chapter 28, “Pragmantic
Modules”.
- watch expression
-
An expression which, when its value
changes, causes a breakpoint in the Perl debugger.
- weak reference
-
A reference that doesn’t get counted
normally. When all the normal references to data disappear, the data
disappears. These are useful for circular references that would never
disappear otherwise.
- whitespace
-
A character that moves
your cursor but doesn’t otherwise put anything on your screen. Typically
refers to any of: space, tab, line feed, carriage return, or form feed. In
Unicode, matches many other characters that Unicode considers whitespace,
including the ɴ-ʙʀ .
- word
-
In normal “computerese”, the piece of data of the size most
efficiently handled by your computer, typically 32 bits or so, give or take a
few powers of 2. In Perl culture, it more often refers to an alphanumeric
identifier (including underscores), or to a string of nonwhitespace
characters bounded by whitespace or string boundaries.
- working directory
-
Your current directory, from
which relative pathnames are interpreted by the operating system. The
operating system knows your current directory because you told it with a
chdir
, or because you started out in the place where your parent
process was when you were born.
- wrapper
-
A program or subroutine that runs some other program or
subroutine for you, modifying some of its input or output to better suit
your purposes.
- WYSIWYG
-
What You See Is What You Get. Usually used when something
that appears on the screen matches how it will eventually look, like Perl’s
format
declarations. Also used to mean the opposite of magic because
everything works exactly as it appears, as in the three- argument form of
open
.
- XS
-
An extraordinarily
exported, expeditiously excellent, expressly eXternal Subroutine, executed
in existing C or C++ or in an exciting extension language called
(exasperatingly) XS.
- XSUB
-
An external subroutine defined in XS.
- yacc
-
Yet Another Compiler Compiler. A parser generator without
which Perl probably would not have existed. See the file perly.y in the
Perl source distribution.
- zero width
-
A subpattern assertion matching the null
string between characters.
- zombie
-
A process that has died (exited) but
whose parent has not yet received proper notification of its demise by
virtue of having called
wait
or waitpid
. If you fork
, you must
clean up after your child processes when they exit; otherwise, the process
table will fill up and your system administrator will Not Be Happy with
you.
Based on the Glossary of Programming Perl, Fourth Edition,
by Tom Christiansen, brian d foy, Larry Wall, & Jon Orwant.
Copyright (c) 2000, 1996, 1991, 2012 O'Reilly Media, Inc.
This document may be distributed under the same terms as Perl itself.
perlglossary - Perl Glossary
|