HOME

TheInfoList



OR:

In a
computer system A computer is a machine that can be programmed to carry out sequences of arithmetic or logical operations (computation) automatically. Modern digital electronic computers can perform generic sets of operations known as programs. These progr ...
or
network Network, networking and networked may refer to: Science and technology * Network theory, the study of graphs as a representation of relations between discrete objects * Network science, an academic field that studies complex networks Mathematics ...
, a runbook is a compilation of routine procedures and operations that the system administrator or operator carries out.
System administrator A system administrator, or sysadmin, or admin is a person who is responsible for the upkeep, configuration, and reliable operation of computer systems, especially multi-user computers, such as servers. The system administrator seeks to ensu ...
s in IT departments and NOCs use runbooks as a reference. Runbooks can be in either electronic or in physical book form. Typically, a runbook contains procedures to begin, stop, supervise, and debug the system. It may also describe procedures for handling special requests and contingencies. An effective runbook allows other operators, with prerequisite expertise, to effectively manage and troubleshoot a system. Through runbook automation, these processes can be carried out using software tools in a predetermined manner. In addition to automating IT specific processes, the results of the runbook can be presented on-screen back to the user or Service Desk engineer. Multiple runbooks can be linked together using a
Decision Tree A decision tree is a decision support tool that uses a tree-like model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility. It is one way to display an algorithm that only contains condit ...
to provide users with interactive troubleshooting and guided procedures.


Runbook applied to operations

Operational runbooks may be tied to
ITIL The Information Technology Infrastructure Library (ITIL) is a set of detailed practices for IT activities such as IT service management (ITSM) and IT asset management (ITAM) that focus on aligning IT services with the needs of business. ITIL de ...
incidents to allow repeatable processes supporting specific aspects of the
service catalog A service catalog (or catalogue), is an organized and curated collection of any and all business and information technology related services that can be performed by, for, or within an enterprise. Service catalogs act as knowledge management tools ...
. The runbook is typically divided into routine automated processes and routine manual processes. The runbook catalog begins with an index of processes covered and may be broken down in
outline Outline or outlining may refer to: * Outline (list), a document summary, in hierarchical list format * Code folding, a method of hiding or collapsing code or text to see content in outline form * Outline drawing, a sketch depicting the outer edge ...
form to align the processes to the major elements they support in the service catalog. A runbook is a compilation of routine procedures and operations that the system administrator or operator carries out.


Computer operator documentation

The documentation for running a task, whether in electronic or paper form, is called a runbook. Sometimes written as "run book," it may supplement bullet-pointed steps with error messages (and what to do) and
flowchart A flowchart is a type of diagram that represents a workflow or process. A flowchart can also be defined as a diagrammatic representation of an algorithm, a step-by-step approach to solving a task. The flowchart shows the steps as boxes of va ...
s.


Runbook automation

Runbook automation (RBA) is the ability to define, build, orchestrate, manage, and report on
workflow A workflow consists of an orchestrated and repeatable pattern of activity, enabled by the systematic organization of resources into processes that transform materials, provide services, or process information. It can be depicted as a sequence of ...
s that support system and network operational processes. Areas of a business ideal for IT automation are Operations Teams, Service Desk, Network Operations Center's (NOC's), Cloud Operations, Integrations, and Automation Center of Excellence (CoE). A runbook workflow can potentially interact with all types of infrastructure elements, such as
applications Application may refer to: Mathematics and computing * Application software, computer software designed to help the user to perform specific tasks ** Application layer, an abstraction layer that specifies protocols and interface methods used in a c ...
,
database In computing, a database is an organized collection of data stored and accessed electronically. Small databases can be stored on a file system, while large databases are hosted on computer clusters or cloud storage. The design of databases sp ...
s, and hardware - using a variety of communication methods such as command-line interfaces (CLI), HTTP
REST Rest or REST may refer to: Relief from activity * Sleep ** Bed rest * Kneeling * Lying (position) * Sitting * Squatting position Structural support * Structural support ** Rest (cue sports) ** Armrest ** Headrest ** Footrest Arts and entert ...
and
SOAP Soap is a salt of a fatty acid used in a variety of cleansing and lubricating products. In a domestic setting, soaps are surfactants usually used for washing, bathing, and other types of housekeeping. In industrial settings, soaps are use ...
API's,
SSH The Secure Shell Protocol (SSH) is a cryptographic network protocol for operating network services securely over an unsecured network. Its most notable applications are remote login and command-line execution. SSH applications are based on ...
sessions,
scripts Script may refer to: Writing systems * Script, a distinctive writing system, based on a repertoire of specific elements or symbols, or that repertoire * Script (styles of handwriting) ** Script typeface, a typeface with characteristics of handw ...
, utilities, and code libraries. According to
Gartner Gartner, Inc is a technological research and consulting firm based in Stamford, Connecticut that conducts research on technology and shares this research both through private consulting as well as executive programs and conferences. Its clients ...
, the growth of RBA has coincided with the need for IT operations executives to enhance IT operations efficiency measures—including reducing
mean time to repair Mean time to repair (MTTR) is a basic measure of the maintainability of repairable items. It represents the average time required to repair a failed component or device. Expressed mathematically, it is the total corrective maintenance time for ...
(MTTR), increasing
mean time between failures Mean time between failures (MTBF) is the predicted elapsed time between inherent failures of a mechanical or electronic system during normal system operation. MTBF can be calculated as the arithmetic mean (average) time between failures of a system ...
(MTBF), and automating the provisioning of IT resources. In addition, it is necessary to have the mechanisms to implement best practices (for example, implement and manage IT operations processes in line with the ITIL), increase the effectiveness of IT personnel (for example, automate repetitive tasks associated with IT operations processes), and have the tools to report on how well the processes are executed in line with established policies and service levels. Patents have been granted for various aspects of producing, improving, and using runbooks. Some tools can incorporate a front-end or presentation layer (to the runbook) so that the results of the automated tasks are presented on-screen, sometimes even with red/amber/green traffic lights to visualise if a specific task completed successfully, or failed to execute correctly.


See also

*
Automation Automation describes a wide range of technologies that reduce human intervention in processes, namely by predetermining decision criteria, subprocess relationships, and related actions, as well as embodying those predeterminations in machines ...
*
Support automation Customer support is a range of Customer service, services to assist customers in making cost effective and correct use of a product. It includes assistance in planning, installation, training, troubleshooting, maintenance, upgrading, and disposal ...
*
Prompt book The prompt book, also called transcript, the bible or sometimes simply "the book," is the copy of a production script that contains the information necessary to create a theatrical production from the ground up. It is a compilation of all blocking ...


References

* ''IT Operations Run Book Automation'', by David Williams, Gartner. May 4, 2007.


Further reading

* {{cite book , title=Documentation Writing for System Administrators , author1=Mark C. Langston , author2=Hal Skelly , year=2003 , series=SAGE Short Topics in System Administration , publisher=USENIX , url=https://www.usenix.org/lisa/books/documentation-writing-system-administrators/ , page=61 Workflow technology Automation software System administration