Content Storage Management
   HOME

TheInfoList



OR:

Content storage management (CSM) is a technique for the evolution of traditional media archive technology used by media companies and content owners to store and protect valuable file-based media assets. CSM solutions focus on active management of content and media assets regardless of format, type and source, interfaces between proprietary content source/destination devices and any format and type of commodity IT centric
storage Storage may refer to: Goods Containers * Dry cask storage, for storing high-level radioactive waste * Food storage * Intermodal container, cargo shipping * Storage tank Facilities * Garage (residential), a storage space normally used to store car ...
technology. These
digital media Digital media is any communication media that operate in conjunction with various encoded machine-readable data formats. Digital media can be created, viewed, distributed, modified, listened to, and preserved on a digital electronics device. ' ...
files (or assets) most often contain
video Video is an electronic medium for the recording, copying, playback, broadcasting, and display of moving visual media. Video was first developed for mechanical television systems, which were quickly replaced by cathode-ray tube (CRT) syste ...
but in rarer cases may be still pictures or sound. A CSM system may be directed manually but is more often directed by upper-level systems, which may include
media asset management Media may refer to: Communication * Media (communication), tools used to deliver information or data ** Advertising media, various media, content, buying and placement for advertising ** Broadcast media, communications delivered over mass e ...
(MAM), automation, or traffic. Typically, CSM systems are server-based
software application Software is a set of computer programs and associated documentation and data. This is in contrast to hardware, from which the system is built and which actually performs the work. At the lowest programming level, executable code consists ...
s that reside between the media network, which connects the various broadcast or manipulation devices, and the storage network, which connects the
nearline Nearline storage (a portmanteau of "near" and "online storage") is a term used in computer science to describe an intermediate type of data storage that represents a compromise between online storage (supporting frequent, very rapid access to data ...
and archive storage tiers. The most basic function of CSM is the automated retrieval of high-resolution digital content (or essence) either from a data tape library (with the aid of a
robot A robot is a machine—especially one programmable by a computer—capable of carrying out a complex series of actions automatically. A robot can be guided by an external control device, or the control may be embedded within. Robots may be c ...
), or from a data server, and the delivery of that content either to a
workstation A workstation is a special computer designed for technical or scientific applications. Intended primarily to be used by a single user, they are commonly connected to a local area network and run multi-user operating systems. The term ''workstat ...
, playout or editing device. CSM also performs this process in reverse – moving content back to storage. In a given media operation, CSM may be used to facilitate content manipulation and repurposing; systems interoperability through high and low bit rate content
transcoding Transcoding is the direct digital-to-digital conversion of one encoding to another, such as for video data files, audio files (e.g., MP3, WAV), or character encoding (e.g., UTF-8, ISO/IEC 8859). This is usually done in cases where a target devic ...
; and/or site-to-site content replication for disaster recovery. CSM solutions comply with the well known Reference Model for an Open Archival Information System (OAIS), which is fundamental to long-term archive and content preservation in file-based environments and are characterized by a set of application-specific functionality, which can include: * Interfaces to proprietary media creation and consumption devices regardless of interface and storage topology * Transcoding and rewrapping technologies to ensure compatibility of content despite its source, format,
encoding In communications and information processing, code is a system of rules to convert information—such as a letter, word, sound, image, or gesture—into another form, sometimes shortened or secret, for communication through a communication ...
rates, aspect ratio, content container/wrapping standards * Direct integration to any type and format of storage device typically categorized as IT-centric storage devices allowing limitless or near limitless storage expansion * Integration with various network technologies including
Ethernet Ethernet () is a family of wired computer networking technologies commonly used in local area networks (LAN), metropolitan area networks (MAN) and wide area networks (WAN). It was commercially introduced in 1980 and first standardized in 198 ...
,
Fibre Channel Fibre Channel (FC) is a high-speed data transfer protocol providing in-order, lossless delivery of raw block data. Fibre Channel is primarily used to connect computer data storage to servers in storage area networks (SAN) in commercial data cen ...
, etc. and many protocols such as
SCSI Small Computer System Interface (SCSI, ) is a set of standards for physically connecting and transferring data between computers and peripheral devices. The SCSI standards define commands, protocols, electrical, optical and logical interface ...
,
TCP/IP The Internet protocol suite, commonly known as TCP/IP, is a framework for organizing the set of communication protocols used in the Internet and similar computer networks according to functional criteria. The foundational protocols in the suit ...
, etc. * Full compliance with the well established OAIS model allowing combination of reference
metadata Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive metadata – the descriptive ...
, media and any other key elements, which comprise an overall asset targeted to storage and preservation goals * Subjective content analysis for file-based audio/video content entering and exiting the system * Data integrity and validation checks such as checksums, etc. * Analytics engines for capturing, measuring and reporting on all internal aspects of the system including network bandwidth, read/write error rates, data storage profiles, etc. * Features for local and geographically distributed content storage for file-based distribution as well as disaster recovery applications * Extensibility through incremental addition of features and resources to allow limitless or near limitless expansion of the system


History of CSM

The concept of CSM was first introduced in November 2006 by Brian Campanotti and Rino Petricola from Front Porch Digital Inc, one of the providers of CSM solutions. CSM solutions have their origins in archive management systems, but their functionality is considerably broader. Likewise, CSM systems resemble hierarchical storage management (HSM) systems in that both migrate among storage tiers, but they are distinct. For example, CSM software is media-specific, with internal decision-making models based on media-centric parameters, and with programmatic interfaces specific to control and management by broadcast business systems such as traffic, inventory, and automation. CSM groups related assets (typically video file and related multilingual audio files) into a single managed object for storage and retrieval. CSM does not use stub files, which are pointers to files, but instead handles actual media files directly. CSM migrates files by copying them between systems rather than transferring them from system to system. And finally, by definition, CSM does not stand alone but functions as an
abstraction layer In computing, an abstraction layer or abstraction level is a way of hiding the working details of a subsystem. Examples of software models that use layers of abstraction include the OSI model for network protocols, OpenGL, and other graphics libra ...
between disparate systems including various storage devices, broadcast editing and playout devices and even end consumer devices such as set top boxes and Web streaming sites. The most significant differentiator between CSM and HSM solutions is the difference between migration between tiered storage fundamental to HSM and the active storage management fundamental to CSM solutions. Conceptually, HSM solutions are used to age content to less expensive tiers of storage based on static policies examining parameters such as least recently accessed, file size, specific directories/paths, etc. and treat each file as a separate and unique entity. Although CSM solutions also manage tiered storage (although not specifically limited to any type, amount or storage technology) but do not rely on static policies but rather can be living policies that can be assigned dynamically to content entering the system and govern replication policies, storage persistence, age-based migration as well as more advanced content aware processing steps such as transcoding, rewrapping, reformatting and subjective quality analysis. In some cases, the content stored within the CSM system is intentionally different than the content that entered the system initially. As an example, content entering a CSM solution in a legacy media format such as Pinnacle MPEG2 may be intentionally transcoded to MPEG2 IMX50 and wrapped in an industry standard format such as
MXF MXF or mxf may refer to: * Material Exchange Format, a container format for professional digital video and audio media * MXF, the IATA and FAA LID code for Maxwell Air Force Base, Alabama, United States * mxf, the ISO 639-3 code for Malgbe language ...
to ensure longer term compatibility with other systems connected to the CSM solution. These media-centric workflows are not inherently supported by HSM solutions. Based on the content centric focus of CSM solutions, they do not have legacy reliance on key HSM traits such as the use of stub files, which must remain on the original
file system In computing, file system or filesystem (often abbreviated to fs) is a method and data structure that the operating system uses to control how data is stored and retrieved. Without a file system, data placed in a storage medium would be one larg ...
from where the file was migrated to cheaper tiers of storage. Although beneficial in some environments where HSM is being used simply as a “disk extender” for economic motivation, this feature is a significant limitation in active file-based content workflows typical in media, entertainment and preservationist applications where CSM solutions are key. Because of the inability for broadcast devices to maintain these HSM specific stub files on their internal (online) disk, the use of HSM systems in these advanced applications mandates a copy of the media content be made first from the online storage to some other tier of spinning disk prior to the HSM system taking ownership and migrating the content to other tiers of disk. As the HSM systems do not have the ability to directly interface to these various broadcast and media devices such as encoders, video servers, editing systems, etc. there is usually another software application responsible for this additional step such as a media asset management (MAM) system or other application specific utility. CSM solutions extend the realm of content control and management directly from the internals of the content source and destination devices through any number and type of
mass storage In computing, mass storage refers to the storage of large amounts of data in a persisting and machine-readable fashion. In general, the term is used as large in relation to contemporaneous hard disk drives, but it has been used large in relati ...
devices (disk, tape, optical, offline, etc.) all managed by configurable intelligent policies similar in concept to information lifecycle policies (ILM) but enhanced to become “content aware.” CSM solutions are not meant to age content into less expensive storage over time as its perceived value diminishes but rather actively play a part in a fairly symmetrical content lifecycle where content stored within the CSM system yesterday or five years ago is just as likely to be requested to be restored. All content stored within a CSM storage
infrastructure Infrastructure is the set of facilities and systems that serve a country, city, or other area, and encompasses the services and facilities necessary for its economy, households and firms to function. Infrastructure is composed of public and priv ...
is equally viable for restore operations at any point of time and often no reliable predictability is possible. Augmenting the overall content storage capacities by allowing the addition of less expensive storage technologies is an advantage of CSM solutions but preservation is also a key driver. In terms of preservation, high-value content (historical, cultural, monetary, etc.) can be assigned differing ILM policies that can govern the number of copies or instances of the content maintained by the system and also geographical distribution to other CSM solutions via WAN connectivity providing distribution as well as
disaster recovery Disaster recovery is the process of maintaining or reestablishing vital infrastructure and systems following a natural or human-induced disaster, such as a storm or battle.It employs policies, tools, and procedures. Disaster recovery focuses on t ...
functionality. Example of CSM in Broadcast An example of the role of CSM within a broadcast setting is as follows: A movie is ingested into the content owner’s
workflow A workflow consists of an orchestrated and repeatable pattern of activity, enabled by the systematic organization of resources into processes that transform materials, provide services, or process information. It can be depicted as a sequence of ...
leveraging an encoding device and a control system such as
automation Automation describes a wide range of technologies that reduce human intervention in processes, namely by predetermining decision criteria, subprocess relationships, and related actions, as well as embodying those predeterminations in machines ...
or MAM. The control system instructs the CSM system to simultaneously analyze the content to ensure acceptable subjective quality, copy this original material into the archive library, make two copies or instances of it for protection and generate a low resolution Windows Media proxy version for Web access. An editor uses the MAM system to view the proxy generated by the CSM solution, and from his/her desktop workstation selects shots for use in a promo for the movie. The editor then sends his/her edit decision list (EDL) to the CSM system, which restores to the editing system only the desired broadcast-quality segments based on mark in and mark out
timecode A timecode (alternatively, time code) is a sequence of numeric codes generated at regular intervals by a timing synchronization system. Timecode is used in video production, show control and other applications which require temporal coordination ...
values defined within it. The CSM system then transcodes or rewraps the segments as necessary, so they can be used by the editor in the creation of the promo. Once the editor completes the promo, the editing system sends it via the CSM system (which can also repeat the replication and proxy generation process) to the on-air video servers for playout to air.


See also

* Content management *
Digital library A digital library, also called an online library, an internet library, a digital repository, or a digital collection is an online database of digital objects that can include text, still images, audio, video, digital documents, or other digital me ...
* Digital preservation *
Extensible Metadata Platform The Extensible Metadata Platform (XMP) is an ISO standard, originally created by Adobe Systems Inc., for the creation, processing and interchange of standardized and custom metadata for digital documents and data sets. XMP standardizes a data ...
* Information lifecycle management * Hierarchical storage management * Repository OSID


References

* Campanotti, Brian (2007). Content Storage Management in the Newsroom, National Association of Broadcasters (NAB) Proceedings * Campanotti, Brian (2009). Decomposing Media Operations, SMPTE Proceedings, Hollywood, LA * Campanotti, Brian (2007). Content Storage Management in Broadcast Applications, FKT * Campanotti, Brian (2007). Emerging Trends in File Based Infrastructures, Broadcast Engineering * Campanotti, Brian (2007). File Based Workflows, TV Technology América Latina * Campanotti, Brian (2007). How to Buy an Archive Systems, TVB Magazine * Campanotti, Brian (2006). Choosing a Storage Management System for Broadcast, Broadcast Engineering * Campanotti, Brian (2006). Disaster Recovery through Archive Management, Broadcast Engineering * Campanotti, Brian (2005). Digital Archive Software in Broadcast, Information Systems {{DEFAULTSORT:Content Storage Management Information management