Tape library

In computer storage, a tape library is a physical area that holds magnetic data tapes. In an earlier era, tape libraries were maintained by people known as tape librarians and computer operators and the proper operation of the library was crucial to the running of batch processing jobs. Although tape libraries of this era were not automated, the use of tape management system software could assist in running them.

Subsequently, tape libraries became physically automated, and as such are sometimes called a tape silo, tape robot, or tape jukebox. These are a storage devices that contain one or more tape drives, a number of slots to hold tape cartridges, a barcode reader to identify tape cartridges, and an automated method for loading tapes (a robot). Such solutions are mostly used for backups and for digital archiving. Additionally, the area where tapes that are not currently in a silo are stored is also called a tape library. One of the earliest examples was the IBM 3850 Mass Storage System (MSS), announced in 1974.

In either era, tape libraries can contain millions of tapes.

Manual era

[edit]
A manual magnetic tape library, common in the 1960s and 1970s. Rolling carts are used by staff to transfer tapes between the racks in the library and the computer room where the tape drives reside.

Tapes and batch processing

[edit]

In the mainframe computer era, especially the IBM mainframe, the most common format in use was the 9-track tape.[1] Some large application systems could require scores of different tapes as part of their batch job runs.[2]

In the data processing applications of the era, the master files for such things as employee payroll information, supplies and stores inventory, or customer accounts were typically kept on tape.[3][4] Batch jobs to update these master files would take the existing tape master file as input and write out a new tape master file as output.[5] In addition, the set of update transactions themselves might constitute a second input tape.[3] The master file output of one update job would then be the master file input to the next time the job is run, perhaps a day, a week, or a month later.[1] The tapes representing a few past iterations of a master file would typically be retained, in case a problem with the latest version were to be discovered and the job had to be rerun.[1]

Role of tape libraries and librarians

[edit]
Tape Retention / Scratch Control form, in triplicate

Mainframe computer installations often had a separate room, the tape library, to house their racks and cabinets of tapes.[1] The typical workflow for running a batch job was to go into the library, pull certain tapes off the racks there and load them onto a rolling cart, move the cart into the computer area, mount the tapes onto tape drives for a production run, take the tapes off the drives when the run was over, move the cart back to the library, and put the tapes back on the library racks. Such tape libraries existed at most computer installations.[6]

Even a modestly sized computer installation could have hundreds of tapes,[4] and library sizes of several thousand reels of tapes were commonplace.[6] And they could be much larger: by the mid-1970s, the U.S. Census Bureau and NASA each had tape libraries with around one million tape reels in them.[2] The person in charge of all this was typically called the tape librarian.[1][4]

In this era, there were no automated tape delivery and mounting systems, and so this action had to be done by computer operators.[6] These people were the ones responsible for mounting tapes onto tape drives as part of running a job.[1] Even careful computer operators could sometimes mount the wrong tape as input to a job or present the reels of a multi-tape dataset out of order.[2] Overwriting a tape that was meant to be preserved was another potential mistake.[4]

It was the tape librarian's responsibility to set up procedures for the handling of tapes to minimize the chances of errors taking place.[4] As one book of the era wrote, "keeping track of the whereabouts of the tapes is a formidable and responsible job."[1]

Supporting software

[edit]

Tape management systems of this era were software packages whose purpose was to help facilitate tape library operations and management. They kept track of data sets on tape, and produced reports indicating whether a data set should be retained on, or could be scratched from, a tape; they aided in the setup and running of scheduled production jobs, through such things as tape pull lists and pre-printed external gummed tape labels; and they kept track of the physical inventory of tape reels. The most popular of these packages was UCC-1 from University Computing Company,[7] a product that was also known as the Tape Management System.[8] It made several appearances on Datapro Research Corporation's Software Honor Roll.[7] Another was Valu-Lib from Value Computing, Inc.,[9][10] and a third was TLMS II from Capex Corporation.[11]

As use of the mainframe continued on into the following century, tape library management, both manual and automatic, was one element of the offerings of the Data Facility Storage Management Subsystem (MVS) from IBM.[12]

Automated era

[edit]
Large StorageTek Powderhorn tape library, showing tape cartridges with barcodes packed on shelves in the front and a robot arm moving in the back
Small ADIC Scalar 100 tape library, showing a robot visible on the bottom with two IBM LTO2 tape drives behind it

Design

[edit]

Physically automated tape library devices can store immense amounts of data, ranging from 20 terabytes[13] up to 2.1 exabytes of data[14] as of 2016. Such capacity is multiple thousand times that of a typical hard drive and well in excess of what is capable with network attached storage. Typical entry-level solutions cost around $10,000 USD,[15] while high-end solutions can start at as much as $200,000 USD[16] and cost well in excess of $1 million for a fully expanded and configured library.

For large data-storage, they are a cost-effective solution, with cost per gigabyte as low as 2 cents USD.[17] The tradeoff for their larger capacity is their slower access time, which usually involves mechanical manipulation of tapes. Access to data in a library takes from several seconds to several minutes.

Because of their slow sequential access and huge capacity, tape libraries are primarily used for backups and as the final stage of digital archiving. A typical application of the latter would be an organization's extensive transaction record for legal or auditing purposes. Another example is hierarchical storage management (HSM), in which tape library is used to hold rarely used files from file systems.

Software support

[edit]

There are several large-scale library-management packages available commercially. Open-source implementations include AMANDA, Bacula, and the minimal mtx program.

Barcode labels

[edit]

Tape libraries commonly have the capability of optically scanning barcode labels which are attached to each tape, allowing them to automatically maintain an inventory of which tapes are where within the library. Preprinted barcode labels are commercially available or custom labels may be generated using commercial or free software. The barcode label is frequently part of the tape label, information recorded at the beginning of the medium to uniquely identify the tape.

Autoloaders

[edit]
Dell PowerVault 124T Autoloader

Smaller tape libraries with only one drive are known as autoloaders.[18] The term autoloader is also sometimes used synonymously with stacker,[19] a device in which the media are loaded necessarily in a sequential manner.[20]

Other types of autoloaders may operate with optical discs (such as compact discs or DVDs) or floppy disks[citation needed].

See also

[edit]

References

[edit]
  1. ^ a b c d e f g Popkin, Gary S.; Pike, Arthur H. (1977). Introduction to Data Processing. Boston: Houghton Mifflin Company. pp. 149–151, 260–263. ISBN 0-395-20628-6.
  2. ^ a b c McCracken, Daniel D. (1976). A Simplified Guide to Structured COBOL Programming. New York: John Wiley & Sons. pp. 259, 264. ISBN 0-471-58284-0.
  3. ^ a b McQuillen, Kevin (1975). System/360–370 Assembler Language (OS). Fresno, California: Mike Murach & Associates. p. 302. LCCN 74-29645.
  4. ^ a b c d e Stern, Nancy; Stern, Robert A. (1980). Structured COBOL Programming (3rd ed.). New York: John Wiley & Sons. pp. 494, 496, 498–499. ISBN 0-471-04913-1.
  5. ^ Ashley, Ruth; Fernandez, Judi N. (1978). Job Control Language: A Self-Teaching Guide. New York: John Wiley & Sons. p. 43. ISBN 0-471-03205-0.
  6. ^ a b c Conway, Richard; Gries, David (1973). An Introduction to Programming: A Structured Approach using PL/1 and PL/C. Cambridge, Massachusetts: Winthrop. pp. 333–334.
  7. ^ a b Leavitt, Don (January 17, 1977). "Users Put 38 Packages on Honor Roll". Computerworld. p. 23.
  8. ^ "UCC-1 Tape Management Updated with Release 4.7". Computerworld. July 4, 1983. p. 35.
  9. ^ "'Valu-Lib' Can Run Tape Library, Can Interface With Scheduler". Computerworld. May 16, 1973. p. 15.
  10. ^ "'Valu Lib' Update Released For IBM 4300s, Series/36". Computerworld. December 19, 1983. p. 32.
  11. ^ "uncertain". Infosystems. Vol. uncertain. Hitchcock Publishing Company. 1980. p. 90. Archived from the original on March 25, 2023. Retrieved February 22, 2023.
  12. ^ "Introduction to tape library management". IBM. April 5, 2023. Retrieved November 1, 2023.
  13. ^ "HP StorageWorks MSL2024 Tape Library - overview". March 18, 2006. Archived from the original on March 18, 2006. Retrieved June 19, 2018.{{cite web}}: CS1 maint: bot: original URL status unknown (link)
  14. ^ Oracle "StorageTek SL8500 Modular Library System".
  15. ^ HP Small & Medium Business Online Store: HP StorageWorks MSL2024 Tape Libraries
  16. ^ [1]. Cites cost as "From $195,830. (US)"
  17. ^ "The Costs Of Storage". Forbes.
  18. ^ "SNIA Dictionary". Storage Network Industry Association. Retrieved 2010-01-30. tape autoloader...[Storage System] A tape device that provides automated access to multiple tape cartridges, typically via a single tape drive.
  19. ^ "Ten common backup/restore related questions". Sun Microsystems, Inc. Retrieved 2010-01-30. What is a stacker (autoloader) vs a jukebox?
  20. ^ "SNIA Dictionary". Storage Network Industry Association. Retrieved 2010-01-30. media stacker...[Data Recovery] A robotic media handler in which media must be moved sequentially by the robot.