Digest::SHA - Perl extension for SHA-1/224/256/384/512 |
Digest::SHA - Perl extension for SHA-1/224/256/384/512
In programs:
# Functional interface
use Digest::SHA qw(sha1 sha1_hex sha1_base64 ...);
$digest = sha1($data); $digest = sha1_hex($data); $digest = sha1_base64($data);
$digest = sha256($data); $digest = sha384_hex($data); $digest = sha512_base64($data);
# Object-oriented
use Digest::SHA;
$sha = Digest::SHA->new($alg);
$sha->add($data); # feed data into stream
$sha->addfile(*F); $sha->addfile($filename);
$sha->add_bits($bits); $sha->add_bits($data, $nbits);
$sha_copy = $sha->clone; # make copy of digest object $state = $sha->getstate; # save current state to string $sha->putstate($state); # restore previous $state
$digest = $sha->digest; # compute digest $digest = $sha->hexdigest; $digest = $sha->b64digest;
From the command line:
$ shasum files
$ shasum --help
# Functional interface only
use Digest::SHA qw(hmac_sha1 hmac_sha1_hex ...);
$digest = hmac_sha1($data, $key); $digest = hmac_sha224_hex($data, $key); $digest = hmac_sha256_base64($data, $key);
Digest::SHA is a complete implementation of the NIST Secure Hash Standard. It gives Perl programmers a convenient way to calculate SHA-1, SHA-224, SHA-256, SHA-384, SHA-512, SHA-512/224, and SHA-512/256 message digests. The module can handle all types of input, including partial-byte data.
Digest::SHA is written in C for speed. If your platform lacks a C compiler, you can install the functionally equivalent (but much slower) the Digest::SHA::PurePerl manpage module.
The programming interface is easy to use: it's the same one found in CPAN's the Digest manpage module. So, if your applications currently use the Digest::MD5 manpage and you'd prefer the stronger security of SHA, it's a simple matter to convert them.
The interface provides two ways to calculate digests: all-at-once, or in stages. To illustrate, the following short program computes the SHA-256 digest of ``hello world'' using each approach:
use Digest::SHA qw(sha256_hex);
$data = "hello world"; @frags = split(//, $data);
# all-at-once (Functional style) $digest1 = sha256_hex($data);
# in-stages (OOP style) $state = Digest::SHA->new(256); for (@frags) { $state->add($_) } $digest2 = $state->hexdigest;
print $digest1 eq $digest2 ? "whew!\n" : "oops!\n";
To calculate the digest of an n-bit message where n is not a multiple of 8, use the add_bits() method. For example, consider the 446-bit message consisting of the bit-string ``110'' repeated 148 times, followed by ``11''. Here's how to display its SHA-1 digest:
use Digest::SHA; $bits = "110" x 148 . "11"; $sha = Digest::SHA->new(1)->add_bits($bits); print $sha->hexdigest, "\n";
Note that for larger bit-strings, it's more efficient to use the two-argument version add_bits($data, $nbits), where $data is in the customary packed binary format used for Perl strings.
The module also lets you save intermediate SHA states to a string. The getstate() method generates portable, human-readable text describing the current state of computation. You can subsequently restore that state with putstate() to resume where the calculation left off.
To see what a state description looks like, just run the following:
use Digest::SHA; print Digest::SHA->new->add("Shaw" x 1962)->getstate;
As an added convenience, the Digest::SHA module offers routines to calculate keyed hashes using the HMAC-SHA-1/224/256/384/512 algorithms. These services exist in functional form only, and mimic the style and behavior of the sha(), sha_hex(), and sha_base64() functions.
# Test vector from draft-ietf-ipsec-ciph-sha-256-01.txt
use Digest::SHA qw(hmac_sha256_hex); print hmac_sha256_hex("Hi There", chr(0x0b) x 32), "\n";
Perl supports Unicode strings as of version 5.6. Such strings may contain wide characters, namely, characters whose ordinal values are greater than 255. This can cause problems for digest algorithms such as SHA that are specified to operate on sequences of bytes.
The rule by which Digest::SHA handles a Unicode string is easy to state, but potentially confusing to grasp: the string is interpreted as a sequence of byte values, where each byte value is equal to the ordinal value (viz. code point) of its corresponding Unicode character. That way, the Unicode string 'abc' has exactly the same digest value as the ordinary string 'abc'.
Since a wide character does not fit into a byte, the Digest::SHA routines croak if they encounter one. Whereas if a Unicode string contains no wide characters, the module accepts it quite happily. The following code illustrates the two cases:
$str1 = pack('U*', (0..255)); print sha1_hex($str1); # ok
$str2 = pack('U*', (0..256)); print sha1_hex($str2); # croaks
Be aware that the digest routines silently convert UTF-8 input into its equivalent byte sequence in the native encoding (cf. utf8::downgrade). This side effect influences only the way Perl stores the data internally, but otherwise leaves the actual value of the data intact.
NIST acknowledges that the work of Prof. Xiaoyun Wang constitutes a practical collision attack on SHA-1. Therefore, NIST encourages the rapid adoption of the SHA-2 hash functions (e.g. SHA-256) for applications requiring strong collision resistance, such as digital signatures.
ref. http://csrc.nist.gov/groups/ST/hash/statement.html
By convention, CPAN Digest modules do not pad their Base64 output. Problems can occur when feeding such digests to other software that expects properly padded Base64 encodings.
For the time being, any necessary padding must be done by the user. Fortunately, this is a simple operation: if the length of a Base64-encoded digest isn't a multiple of 4, simply append ``='' characters to the end of the digest until it is:
while (length($b64_digest) % 4) { $b64_digest .= '='; }
To illustrate, sha256_base64(``abc'') is computed to be
ungWv48Bz+pBQUDeXa4iI7ADYaOWF3qctBD/YfIAFa0
which has a length of 43. So, the properly padded version is
ungWv48Bz+pBQUDeXa4iI7ADYaOWF3qctBD/YfIAFa0=
None by default.
Provided your C compiler supports a 64-bit type (e.g. the long long of C99, or __int64 used by Microsoft C/C++), all of these functions will be available for use. Otherwise, you won't be able to perform the SHA-384 and SHA-512 transforms, both of which require 64-bit operations.
Functional style
It's important to note that the resulting string does not contain the padding characters typical of Base64 encodings. This omission is deliberate, and is done to maintain compatibility with the family of CPAN Digest modules. See PADDING OF BASE64 DIGESTS for details.
OOP style
Invoking new as an instance method will reset the object to the initial state associated with $alg. If the argument is missing, the object will continue using the same algorithm that was selected at creation.
$sha->add("a"); $sha->add("b"); $sha->add("c"); $sha->add("a")->add("b")->add("c"); $sha->add("a", "b", "c"); $sha->add("abc");
The return value is the updated object itself.
The first form causes the most-significant $nbits of $data to be appended to the stream. The $data argument is in the customary binary format used for Perl strings.
The second form takes an ASCII string of ``0'' and ``1'' characters as its argument. It's equivalent to
$sha->add_bits(pack("B*", $bits), length($bits));
So, the following two statements do the same thing:
$sha->add_bits("111100001010"); $sha->add_bits("\xF0\xA0", 12);
Note that SHA-1 and SHA-2 use most-significant-bit ordering for their internal state. This means that
$sha3->add_bits("110");
is equivalent to
$sha3->add_bits("1")->add_bits("1")->add_bits("0");
By default, $filename is simply opened and read; no special modes or I/O disciplines are used. To change this, set the optional $mode argument to one of the following values:
"b" read file in binary mode
"U" use universal newlines
"0" use BITS mode
The ``U'' mode is modeled on Python's ``Universal Newlines'' concept, whereby DOS and Mac OS line terminators are converted internally to UNIX newlines before processing. This ensures consistent digest values when working simultaneously across multiple file systems. The ``U'' mode influences only text files, namely those passing Perl's -T test; binary files are processed with no translation whatsoever.
The BITS mode (``0'') interprets the contents of $filename as a logical stream of bits, where each ASCII '0' or '1' character represents a 0 or 1 bit, respectively. All other characters are ignored. This provides a convenient way to calculate the digest values of partial-byte data by using files, rather than having to write separate programs employing the add_bits method.
Note that the digest method is a read-once operation. Once it has been performed, the Digest::SHA object is automatically reset in preparation for calculating another digest value. Call $sha->clone->digest if it's necessary to preserve the original digest state.
Like digest, this method is a read-once operation. Call $sha->clone->hexdigest if it's necessary to preserve the original digest state.
Like digest, this method is a read-once operation. Call $sha->clone->b64digest if it's necessary to preserve the original digest state.
It's important to note that the resulting string does not contain the padding characters typical of Base64 encodings. This omission is deliberate, and is done to maintain compatibility with the family of CPAN Digest modules. See PADDING OF BASE64 DIGESTS for details.
HMAC-SHA-1/224/256/384/512
It's important to note that the resulting string does not contain the padding characters typical of Base64 encodings. This omission is deliberate, and is done to maintain compatibility with the family of CPAN Digest modules. See PADDING OF BASE64 DIGESTS for details.
the Digest manpage, the Digest::SHA::PurePerl manpage
The Secure Hash Standard (Draft FIPS PUB 180-4) can be found at:
http://csrc.nist.gov/publications/drafts/fips180-4/Draft-FIPS180-4_Feb2011.pdf
The Keyed-Hash Message Authentication Code (HMAC):
http://csrc.nist.gov/publications/fips/fips198/fips-198a.pdf
Mark Shelor <mshelor@cpan.org>
The author is particularly grateful to
Gisle Aas H. Merijn Brand Sean Burke Chris Carey Alexandr Ciornii Chris David Jim Doble Thomas Drugeon Julius Duque Jeffrey Friedl Robert Gilmour Brian Gladman Jarkko Hietaniemi Adam Kennedy Mark Lawrence Andy Lester Alex Muntada Steve Peters Chris Skiscim Martin Thurn Gunnar Wolf Adam Woodbury
``who by trained skill rescued life from such great billows and such thick darkness and moored it in so perfect a calm and in so brilliant a light'' - Lucretius
Copyright (C) 2003-2018 Mark Shelor
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
Digest::SHA - Perl extension for SHA-1/224/256/384/512 |