Virtual tape library
   HOME

TheInfoList



OR:

A virtual tape library (VTL) is a
data storage Data storage is the recording (storing) of information (data) in a storage medium. Handwriting, phonographic recording, magnetic tape, and optical discs are all examples of storage media. Biological molecules such as RNA and DNA are conside ...
virtualization In computing, virtualization or virtualisation (sometimes abbreviated v12n, a numeronym) is the act of creating a virtual (rather than actual) version of something at the same abstraction level, including virtual computer hardware platforms, stor ...
technology used typically for backup and recovery purposes. A VTL presents a storage component (usually hard disk storage) as
tape libraries In computer storage, a tape library, sometimes called a tape silo, tape robot or tape jukebox, is a storage device that contains one or more tape drives, a number of slots to hold tape cartridges, a barcode reader to identify tape cartridges a ...
or
tape drive A tape drive is a data storage device that reads and writes data on a magnetic tape. Magnetic tape data storage is typically used for offline, archival data storage. Tape media generally has a favorable unit cost and a long archival stability. ...
s for use with existing backup software. Virtualizing the disk storage as tape allows integration of VTLs with existing
backup software Backup software are computer programs used to perform a backup; they create supplementary exact copies of files, databases or entire computers. These programs may later use the supplementary copies to restore the original contents in the event of d ...
and existing backup and recovery processes and policies. The benefits of such virtualization include storage consolidation and faster data restore processes. For most mainframe data centers, the storage capacity varies, however protecting its business and mission critical data is always vital. Most current VTL solutions use SAS or
SATA SATA (Serial AT Attachment) is a computer bus interface that connects host bus adapters to mass storage devices such as hard disk drives, optical drives, and solid-state drives. Serial ATA succeeded the earlier Parallel ATA (PATA) standard to ...
disk array A disk array is a disk storage system which contains multiple disk drives. It is differentiated from a disk enclosure, in that an array has cache memory and advanced functionality, like RAID, deduplication, encryption and virtualization. Compone ...
s as the primary storage component due to their relatively low cost. The use of array enclosures increases the scalability of the solution by allowing the addition of more disk drives and enclosures to increase the storage capacity. The shift to VTL also eliminates streaming problems that often impair efficiency in tape drives as disk technology does not rely on streaming and can write effectively regardless of data transfer speeds. By backing up data to disks instead of tapes, VTL often increases performance of both backup and recovery operations. Restore processes are found to be faster than backup regardless of implementations. In some cases, the data stored on the VTL's disk array is exported to other media, such as physical tapes, for
disaster recovery Disaster recovery is the process of maintaining or reestablishing vital infrastructure and systems following a natural or human-induced disaster, such as a storm or battle.It employs policies, tools, and procedures. Disaster recovery focuses on t ...
purposes (scheme called ''disk-to-disk-to-tape'', or ''D2D2T''). Alternatively, most contemporary backup software products introduced also direct usage of the
file system In computing, file system or filesystem (often abbreviated to fs) is a method and data structure that the operating system uses to control how data is stored and retrieved. Without a file system, data placed in a storage medium would be one larg ...
storage (especially
network-attached storage Network-attached storage (NAS) is a file-level (as opposed to block-level storage) computer data storage server connected to a computer network providing data access to a heterogeneous group of clients. The term "NAS" can refer to both the tech ...
, accessed through NFS and
CIFS Server Message Block (SMB) is a communication protocol originally developed in 1983 by Barry A. Feigenbaum at IBM and intended to provide shared access to files and printers across nodes on a network of systems running IBM's OS/2. It also provides ...
protocols over IP networks) not requiring a tape library emulation at all. They also often offer a
disk staging Disk staging is using disks as an additional, temporary stage of backup process before finally storing backup to tape. Backups stay on disk typically for a day or a week, before being copied to tape in a background process and deleted afterward ...
feature: moving the data from disk to a physical tape for a long-term storage. While a virtual tape library is very fast, the disk storage within is not designed to be removable, and does not usually involve physically removable external disk drives to be used for data archiving in place of tape. Since the disk storage is always connected to power and data sources and is never physically electrically isolated, it is vulnerable to potential damage and corruption due to nearby building or power grid lightning strikes.


History

The first VTL solution was introduced by Cybernetics in 1992 under the name HSTC (high speed tape cache). Later, IBM released a Virtual Tape Server (VTS) introduced in 1997. It was targeted for a
mainframe A mainframe computer, informally called a mainframe or big iron, is a computer used primarily by large organizations for critical applications like bulk data processing for tasks such as censuses, industry and consumer statistics, enterprise ...
market, where many legacy applications tend to use a lot of very short tape volumes. It used the
ESCON ESCON (Enterprise Systems Connection) is a data connection created by IBM, and is commonly used to connect their mainframe computers to peripheral devices such as disk storage, tape drives and IBM 3270 display controllers. ESCON is an optical ...
interface, and acted as a disk cache for the IBM 3494 tape library. A competitive offering from StorageTek (acquired in 2005 by Sun Microsystems, then subsequently by Oracle Corporation) was known as Virtual Storage Manager (VSM) which leveraged the market dominant STK Powderhorn library as a back store. Each product line has been enhanced to support larger disk buffer capacities, FICON, and more recently (c. 2010) "tapeless" disk-only environments. Other offerings in the mainframe space are also "tapeless". DLm has been developed by EMC Corporation, while Luminex has gained popularity and wide acceptance by teaming with Data Domain to provide the benefits of
data deduplication In computing, data deduplication is a technique for eliminating duplicate copies of repeating data. Successful implementation of the technique can improve storage utilization, which may in turn lower capital expenditure by reducing the overall amou ...
behind its Channel Gateway platform. With the consequent reduction in off-site replication bandwidth afforded by deduplication, it is possible and practical for this form of virtual tape to reduce
recovery point objective Disaster recovery is the process of maintaining or reestablishing vital infrastructure and systems following a natural or human-induced disaster, such as a storm or battle.It employs policies, tools, and procedures. Disaster recovery focuses on t ...
time and
recovery time objective Disaster recovery is the process of maintaining or reestablishing vital infrastructure and systems following a natural or human-induced disaster, such as a storm or battle.It employs policies, tools, and procedures. Disaster recovery focuses on t ...
to near zero (or instantaneous). Outside of the
mainframe A mainframe computer, informally called a mainframe or big iron, is a computer used primarily by large organizations for critical applications like bulk data processing for tasks such as censuses, industry and consumer statistics, enterprise ...
environment, tape drives and libraries mostly featured
SCSI Small Computer System Interface (SCSI, ) is a set of standards for physically connecting and transferring data between computers and peripheral devices. The SCSI standards define commands, protocols, electrical, optical and logical interface ...
. Likewise, VTLs were developed supporting popular SCSI transport protocols such as SPI (legacy systems),
Fibre Channel Fibre Channel (FC) is a high-speed data transfer protocol providing in-order, lossless delivery of raw block data. Fibre Channel is primarily used to connect computer data storage to servers in storage area networks (SAN) in commercial data cen ...
, and
iSCSI Internet Small Computer Systems Interface or iSCSI ( ) is an Internet Protocol-based storage networking standard for linking data storage facilities. iSCSI provides block-level access to storage devices by carrying SCSI commands over a TCP/IP ...
. The
FalconStor FalconStor is a data management software company based in Austin, Texas. History FalconStor was co-founded in 2000 in New York by Computer Associates veterans ReiJane Huai and Wayne Lam. In 2007 the company started a joint-venture with the Chi ...
VTL is the foundation of nearly half of the products sold in the VTL market, according to an Enterprise Strategy Group analyst. In mid-2010s VTLs got a rebirth thanks to hi-capacity "archive" drives from Seagate and
HGST HGST, Inc. (Hitachi Global Storage Technologies) was a manufacturer of hard disk drives, solid-state drives, and external storage products and services. It was initially a subsidiary of Hitachi, formed through its acquisition of IBM's disk d ...
and more popular "tape in cloud" and Disk-to-Disk-to-Tape (often in cloud) scenarios.
Amazon Amazon most often refers to: * Amazons, a tribe of female warriors in Greek mythology * Amazon rainforest, a rainforest covering most of the Amazon basin * Amazon River, in South America * Amazon (company), an American multinational technology c ...
and
StarWind Software StarWind Software, Inc. is a privately held Beverly, Massachusetts-based computer software and hardware appliance company specializing in storage virtualization and software-defined storage. History StarWind Software began in 2008 as a spin ...
in partnership with Veeam, BackBlaze and Wasabi Technologies offer a so-called gateway products that facilitates backing up and archiving "on premises" data as virtual tapes stored in AWS,
Microsoft Microsoft Corporation is an American multinational technology corporation producing computer software, consumer electronics, personal computers, and related services headquartered at the Microsoft Redmond campus located in Redmond, Washing ...
Azure, Wasabi Technologies and BackBlaze public clouds. The idea is to provide a seamless integration of a backup applications incompatible with the APIs object storages expose. Say, at the time Veeam couldn't do AWS S3 and can't backup to the deep archive tier within Azure still.


See also

*
Backup In information technology, a backup, or data backup is a copy of computer data taken and stored elsewhere so that it may be used to restore the original after a data loss event. The verb form, referring to the process of doing so, is "back up", w ...
*
Tape library In computer storage, a tape library, sometimes called a tape silo, tape robot or tape jukebox, is a storage device that contains one or more tape drives, a number of slots to hold tape cartridges, a barcode reader to identify tape cartridges a ...
* Tape Management System *
Disk staging Disk staging is using disks as an additional, temporary stage of backup process before finally storing backup to tape. Backups stay on disk typically for a day or a week, before being copied to tape in a background process and deleted afterward ...
for an alternative approach *
Emulation Emulation may refer to: *Emulation (computing), imitation of behavior of a computer or other electronic system with the help of another type of system :*Video game console emulator, software which emulates video game consoles *Gaussian process em ...
*
Storage virtualization In computer science, storage virtualization is "the process of presenting a logical view of the physical storage resources to" a host computer system, "treating all storage media (hard disk, optical disk, tape, etc.) in the enterprise as a singl ...
*
Seven tiers of disaster recovery Business continuity may be defined as "the capability of an organization to continue the delivery of products or services at pre-defined acceptable levels following a disruptive incident", and business continuity planning (or business continuity a ...


References

{{Operating System Tape-based computer storage Backup