A computer program is a sequence or set of instructions in a
programming language
A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language.
The description of a programming ...
for a
computer to
execute
Execute, in capital punishment, is to put someone to death.
Execute may also refer to:
* Execution (computing), the running of a computer program
* ''Execute'' (album), a 2001 Garage hip-hop album by Oxide & Neutrino
* USS ''Execute'' (AM-232), a ...
. Computer programs are one component of
software
Software is a set of computer programs and associated software documentation, documentation and data (computing), data. This is in contrast to Computer hardware, hardware, from which the system is built and which actually performs the work.
...
, which also includes
documentation and other intangible components.
A computer program in its human-readable form is called
source code
In computing, source code, or simply code, is any collection of code, with or without comments, written using a human-readable programming language, usually as plain text. The source code of a program is specially designed to facilitate the w ...
. Source code needs another computer program to
execute
Execute, in capital punishment, is to put someone to death.
Execute may also refer to:
* Execution (computing), the running of a computer program
* ''Execute'' (album), a 2001 Garage hip-hop album by Oxide & Neutrino
* USS ''Execute'' (AM-232), a ...
because computers can only execute their native
machine instructions
In computer programming, machine code is any low-level programming language, consisting of machine language instruction set architecture, instructions, which are used to control a computer's central processing unit (CPU). Each instruction cau ...
. Therefore, source code may be translated to machine instructions using the language's
compiler
In computing, a compiler is a computer program that translates computer code written in one programming language (the ''source'' language) into another language (the ''target'' language). The name "compiler" is primarily used for programs tha ...
. (
Assembly language programs are translated using an
assembler
Assembler may refer to:
Arts and media
* Nobukazu Takemura, avant-garde electronic musician, stage name Assembler
* Assemblers, a fictional race in the ''Star Wars'' universe
* Assemblers, an alternative name of the superhero group Champions of ...
.) The resulting file is called an
executable. Alternatively, source code may execute within the language's
interpreter.
If the executable is requested for execution, then the
operating system
An operating system (OS) is system software that manages computer hardware, software resources, and provides common services for computer programs.
Time-sharing operating systems schedule tasks for efficient use of the system and may also i ...
loads it into
memory
Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembered ...
and starts a
process
A process is a series or set of activities that interact to produce a result; it may occur once-only or be recurrent or periodic.
Things called a process include:
Business and management
*Business process, activities that produce a specific se ...
.
The
central processing unit
A central processing unit (CPU), also called a central processor, main processor or just Processor (computing), processor, is the electronic circuitry that executes Instruction (computing), instructions comprising a computer program. The CPU per ...
will soon
switch
In electrical engineering, a switch is an electrical component that can disconnect or connect the conducting path in an electrical circuit, interrupting the electric current or diverting it from one conductor to another. The most common type of ...
to this process so it can
fetch, decode, and then execute each machine instruction.
If the source code is requested for execution, then the operating system loads the corresponding interpreter into memory and starts a process. The interpreter then loads the source code into memory to translate and execute each
statement.
Running the source code is slower than running an executable. Moreover, the interpreter must be installed on the computer.
Example computer program
The
"Hello, World!" program is used to illustrate a language's basic
syntax. The syntax of the language
BASIC (1964) was intentionally limited to make the language easy to learn.
For example,
variables are not
declared before being used.
Also, variables are automatically initialized to zero.
Here is an example computer program, in Basic, to
average
In ordinary language, an average is a single number taken as representative of a list of numbers, usually the sum of the numbers divided by how many numbers are in the list (the arithmetic mean). For example, the average of the numbers 2, 3, 4, 7 ...
a list of numbers:
10 INPUT "How many numbers to average?", A
20 FOR I = 1 TO A
30 INPUT "Enter number:", B
40 LET C = C + B
50 NEXT I
60 LET D = C/A
70 PRINT "The average is", D
80 END
Once the mechanics of basic computer programming are learned, more sophisticated and powerful languages are available to build large computer systems.
History
Improvements in
software development are the result of improvements in
computer hardware. At each stage in hardware's history, the task of
computer programming
Computer programming is the process of performing a particular computation (or more generally, accomplishing a specific computing result), usually by designing and building an executable computer program. Programming involves tasks such as anal ...
changed dramatically.
Analytical Engine
In 1837,
Charles Babbage was inspired by
Jacquard's loom to attempt to build the
Analytical Engine.
The names of the components of the calculating device were borrowed from the textile industry. In the textile industry, yarn was brought from the store to be milled. The device had a "store" which consisted of memory to hold 1,000 numbers of 50 decimal digits each.
Numbers from the "store" were transferred to the "mill" for processing. It was programmed using two sets of perforated cards. One set directed the operation and the other set inputted the variables.
However, after more than 17,000 pounds of the British government's money, the thousands of cogged wheels and gears never fully worked together.
Ada Lovelace worked for Charles Babbage to create a description of the Analytical Engine (1843).
The description contained Note G which completely detailed a method for calculating
Bernoulli number
In mathematics, the Bernoulli numbers are a sequence of rational numbers which occur frequently in analysis. The Bernoulli numbers appear in (and can be defined by) the Taylor series expansions of the tangent and hyperbolic tangent functions, ...
s using the Analytical Engine. This note is recognized by some historians as the world's first computer program.
Universal Turing machine
In 1936,
Alan Turing
Alan Mathison Turing (; 23 June 1912 – 7 June 1954) was an English mathematician, computer scientist, logician, cryptanalyst, philosopher, and theoretical biologist. Turing was highly influential in the development of theoretical co ...
introduced the
Universal Turing machine
In computer science, a universal Turing machine (UTM) is a Turing machine that can simulate an arbitrary Turing machine on arbitrary input. The universal machine essentially achieves this by reading both the description of the machine to be simu ...
, a theoretical device that can model every computation.
It is a
finite-state machine
A finite-state machine (FSM) or finite-state automaton (FSA, plural: ''automata''), finite automaton, or simply a state machine, is a mathematical model of computation. It is an abstract machine that can be in exactly one of a finite number o ...
that has an infinitely long read/write tape. The machine can move the tape back and forth, changing its contents as it performs an
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algorithms are used as specificat ...
. The machine starts in the initial state, goes through a sequence of steps, and halts when it encounters the halt state.
All present-day computers are
Turing complete
Alan Mathison Turing (; 23 June 1912 – 7 June 1954) was an English mathematician, computer scientist, logician, cryptanalyst, philosopher, and theoretical biologist. Turing was highly influential in the development of theoretical co ...
.
ENIAC
The
Electronic Numerical Integrator And Computer (ENIAC) was built between July 1943 and Fall 1945. It was a
Turing complete
Alan Mathison Turing (; 23 June 1912 – 7 June 1954) was an English mathematician, computer scientist, logician, cryptanalyst, philosopher, and theoretical biologist. Turing was highly influential in the development of theoretical co ...
, general-purpose computer that used 17,468
vacuum tube
A vacuum tube, electron tube, valve (British usage), or tube (North America), is a device that controls electric current flow in a high vacuum between electrodes to which an electric potential difference has been applied.
The type known as ...
s to create the
circuits. At its core, it was a series of
Pascalines wired together.
Its 40 units weighed 30 tons, occupied , and consumed $650 per hour (
in 1940s currency) in electricity when idle.
It had 20
base-10
The decimal numeral system (also called the base-ten positional numeral system and denary or decanary) is the standard system for denoting integer and non-integer numbers. It is the extension to non-integer numbers of the Hindu–Arabic numeral ...
accumulators. Programming the ENIAC took up to two months.
Three function tables were on wheels and needed to be rolled to fixed function panels. Function tables were connected to function panels by plugging heavy black cables into
plugboard
A plugboard or control panel (the term used depends on the application area) is an array of jacks or sockets (often called hubs) into which patch cords can be inserted to complete an electrical circuit. Control panels are sometimes used to di ...
s. Each function table had 728 rotating knobs. Programming the ENIAC also involved setting some of the 3,000 switches. Debugging a program took a week.
It ran from 1947 until 1955 at
Aberdeen Proving Ground
Aberdeen Proving Ground (APG) (sometimes erroneously called Aberdeen Proving ''Grounds'') is a U.S. Army facility located adjacent to Aberdeen, Harford County, Maryland, United States. More than 7,500 civilians and 5,000 military personnel work a ...
, calculating hydrogen bomb parameters, predicting weather patterns, and producing firing tables to aim artillery guns.
Stored-program computers
Instead of plugging in cords and turning switches, a
stored-program computer loads its instructions into memory just like it loads its data into memory.
As a result, the computer could be programmed quickly and perform calculations at very fast speeds.
Presper Eckert
John Adam Presper Eckert Jr. (April 9, 1919 – June 3, 1995) was an American electrical engineer and computer pioneer. With John Mauchly, he designed the first general-purpose electronic digital computer (ENIAC), presented the first course in co ...
and
John Mauchly
John William Mauchly (August 30, 1907 – January 8, 1980) was an American physicist who, along with J. Presper Eckert, designed ENIAC, the first general-purpose electronic digital computer, as well as EDVAC, BINAC and UNIVAC I, the first ...
built the ENIAC. The two engineers introduced the ''stored-program concept'' in a three-page memo dated February 1944.
Later, in September 1944,
Dr. John von Neumann began working on the ENIAC project. On June 30, 1945, von Neumann published the ''
First Draft of a Report on the EDVAC
The ''First Draft of a Report on the EDVAC'' (commonly shortened to ''First Draft'') is an incomplete 101-page document written by John von Neumann and distributed on June 30, 1945 by Herman Goldstine, security officer on the classified ENIAC pro ...
'' which equated the structures of the computer with the structures of the human brain.
The design became known as the
von Neumann architecture
The von Neumann architecture — also known as the von Neumann model or Princeton architecture — is a computer architecture based on a 1945 description by John von Neumann, and by others, in the '' First Draft of a Report on the EDVAC''. T ...
. The architecture was simultaneously deployed in the constructions of the
EDVAC and
EDSAC computers in 1949.
The
IBM System/360 (1964) was a line of six computers, each having the same
instruction set architecture. The
Model 30 was the smallest and least expensive. Customers could upgrade and retain the same
application software
Application may refer to:
Mathematics and computing
* Application software, computer software designed to help the user to perform specific tasks
** Application layer, an abstraction layer that specifies protocols and interface methods used in a ...
.
The
Model 75 was the most premium. Each System/360 model featured
multiprogramming
In computing, multitasking is the concurrent execution of multiple tasks (also known as processes) over a certain period of time. New tasks can interrupt already started ones before they finish, instead of waiting for them to end. As a result ...
—having multiple
processes in
memory
Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembered ...
at once. When one process was waiting for
input/output
In computing, input/output (I/O, or informally io or IO) is the communication between an information processing system, such as a computer, and the outside world, possibly a human or another information processing system. Inputs are the signals ...
, another could compute.
IBM planned for each model to be programmed using
PL/1
PL/I (Programming Language One, pronounced and sometimes written PL/1) is a procedural, imperative computer programming language developed and published by IBM. It is designed for scientific, engineering, business and system programming. I ...
.
A committee was formed that included
COBOL,
Fortran and
ALGOL
ALGOL (; short for "Algorithmic Language") is a family of imperative computer programming languages originally developed in 1958. ALGOL heavily influenced many other languages and was the standard method for algorithm description used by the ...
programmers. The purpose was to develop a language that was comprehensive, easy to use, extendible, and would replace Cobol and Fortran.
The result was a large and complex language that took a long time to
compile
In computing, a compiler is a computer program that translates computer code written in one programming language (the ''source'' language) into another language (the ''target'' language). The name "compiler" is primarily used for programs that ...
.
Computers manufactured until the 1970s had front-panel switches for manual programming.
The computer program was written on paper for reference. An instruction was represented by a configuration of on/off settings. After setting the configuration, an execute button was pressed. This process was then repeated. Computer programs also were automatically inputted via
paper tape or
punched cards
A punched card (also punch card or punched-card) is a piece of stiff paper that holds digital data represented by the presence or absence of holes in predefined positions. Punched cards were once common in data processing applications or to di ...
. After the medium was loaded, the starting address was set via switches, and the execute button was pressed.
Very Large Scale Integration
A major milestone in software development was the invention of the
Very Large Scale Integration
Very large-scale integration (VLSI) is the process of creating an integrated circuit (IC) by combining millions or billions of MOS transistors onto a single chip. VLSI began in the 1970s when MOS integrated circuit (Metal Oxide Semiconductor) ...
(VLSI) circuit (1964).
Following
World War II
World War II or the Second World War, often abbreviated as WWII or WW2, was a world war that lasted from 1939 to 1945. It involved the vast majority of the world's countries—including all of the great powers—forming two opposin ...
, tube-based technology was replaced with
point-contact transistors (1947) and
bipolar junction transistor
A bipolar junction transistor (BJT) is a type of transistor that uses both electrons and electron holes as charge carriers. In contrast, a unipolar transistor, such as a field-effect transistor, uses only one kind of charge carrier. A bipola ...
s (late 1950s) mounted on a
circuit board
A printed circuit board (PCB; also printed wiring board or PWB) is a medium used in electrical and electronic engineering to connect electronic components to one another in a controlled manner. It takes the form of a laminated sandwich struc ...
.
During the 1960s, the
aerospace
Aerospace is a term used to collectively refer to the atmosphere and outer space. Aerospace activity is very diverse, with a multitude of commercial, industrial and military applications. Aerospace engineering consists of aeronautics and astr ...
industry replaced the circuit board with an
integrated circuit chip.
Robert Noyce, co-founder of
Fairchild Semiconductor (1957) and
Intel
Intel Corporation is an American multinational corporation and technology company headquartered in Santa Clara, California. It is the world's largest semiconductor chip manufacturer by revenue, and is one of the developers of the x86 seri ...
(1968), achieved a technological improvement to refine the
production
Production may refer to:
Economics and business
* Production (economics)
* Production, the act of manufacturing goods
* Production, in the outline of industrial organization, the act of making products (goods and services)
* Production as a stati ...
of
field-effect transistor
The field-effect transistor (FET) is a type of transistor that uses an electric field to control the flow of current in a semiconductor. FETs ( JFETs or MOSFETs) are devices with three terminals: ''source'', ''gate'', and ''drain''. FETs cont ...
s (1963).
The goal is to alter the
electrical resistivity and conductivity of a
semiconductor junction
A semiconductor is a material which has an electrical conductivity value falling between that of a conductor, such as copper, and an insulator, such as glass. Its resistivity falls as its temperature rises; metals behave in the opposite way. ...
. First, naturally occurring
silicate minerals
Silicate minerals are rock-forming minerals made up of silicate groups. They are the largest and most important class of minerals and make up approximately 90 percent of Earth's crust.
In mineralogy, silica (silicon dioxide, ) is usually con ...
are converted into
polysilicon
Polycrystalline silicon, or multicrystalline silicon, also called polysilicon, poly-Si, or mc-Si, is a high purity, polycrystalline form of silicon, used as a raw material by the solar photovoltaic and electronics industry.
Polysilicon is produce ...
rods using the
Siemens process
Chemical vapor deposition (CVD) is a vacuum deposition method used to produce high quality, and high-performance, solid materials. The process is often used in the semiconductor industry to produce thin films.
In typical CVD, the wafer (subst ...
.
The
Czochralski process then converts the rods into a
monocrystalline silicon,
boule crystal.
The
crystal
A crystal or crystalline solid is a solid material whose constituents (such as atoms, molecules, or ions) are arranged in a highly ordered microscopic structure, forming a crystal lattice that extends in all directions. In addition, macro ...
is then thinly sliced to form a
wafer
A wafer is a crisp, often sweet, very thin, flat, light and dry biscuit, often used to decorate ice cream, and also used as a garnish on some sweet dishes. Wafers can also be made into cookies with cream flavoring sandwiched between them. They ...
substrate. The
planar process of
photolithography then ''integrates'' unipolar transistors,
capacitor
A capacitor is a device that stores electrical energy in an electric field by virtue of accumulating electric charges on two close surfaces insulated from each other. It is a passive electronic component with two terminals.
The effect of ...
s,
diodes, and
resistors onto the wafer to build a matrix of
metal–oxide–semiconductor
The metal–oxide–semiconductor field-effect transistor (MOSFET, MOS-FET, or MOS FET) is a type of field-effect transistor (FET), most commonly fabricated by the controlled oxidation of silicon. It has an insulated gate, the voltage of which d ...
(MOS) transistors.
The MOS transistor is the primary component in ''integrated circuit chips''.
Originally, integrated circuit chips had their function set during manufacturing. During the 1960s, controlling the electrical flow migrated to programming a
matrix
Matrix most commonly refers to:
* ''The Matrix'' (franchise), an American media franchise
** ''The Matrix'', a 1999 science-fiction action film
** "The Matrix", a fictional setting, a virtual reality environment, within ''The Matrix'' (franchis ...
of
read-only memory
Read-only memory (ROM) is a type of non-volatile memory used in computers and other electronic devices. Data stored in ROM cannot be electronically modified after the manufacture of the memory device. Read-only memory is useful for storing sof ...
(ROM). The matrix resembled a two-dimensional array of fuses.
The process to embed instructions onto the matrix was to burn out the unneeded connections.
There were so many connections,
firmware programmers wrote a ''computer program'' on another chip to oversee the burning.
The technology became known as
Programmable ROM
A programmable read-only memory (PROM) is a form of digital memory where the contents can be changed once after manufacture of the device. The data is then permanent and cannot be changed. It is one type of read-only memory (ROM). PROMs are used ...
. In 1971, Intel
installed the computer program onto the chip and named it the
Intel 4004
The Intel 4004 is a 4-bit central processing unit (CPU) released by Intel Corporation in 1971. Sold for US$60, it was the first commercially produced microprocessor, and the first in a long line of Intel CPUs.
The 4004 was the first significa ...
microprocessor
A microprocessor is a computer processor where the data processing logic and control is included on a single integrated circuit, or a small number of integrated circuits. The microprocessor contains the arithmetic, logic, and control circ ...
.
The terms ''microprocessor'' and
central processing unit
A central processing unit (CPU), also called a central processor, main processor or just Processor (computing), processor, is the electronic circuitry that executes Instruction (computing), instructions comprising a computer program. The CPU per ...
(CPU) are now used interchangeably. However, CPUs predate microprocessors. For example, the
IBM System/360 (1964) had a CPU made from
circuit boards containing discrete components on ceramic substrates.
Sac State 8008
The Intel 4004 (1971) was a 4-
bit
The bit is the most basic unit of information in computing and digital communications. The name is a portmanteau of binary digit. The bit represents a logical state with one of two possible values. These values are most commonly represente ...
microprocessor designed to run the
Busicom calculator. Five months after its release, Intel released the
Intel 8008
The Intel 8008 ("''eight-thousand-eight''" or "''eighty-oh-eight''") is an early byte-oriented microprocessor designed by Computer Terminal Corporation (CTC), implemented and manufactured by Intel, and introduced in April 1972. It is an 8-bit CP ...
, an 8-bit microprocessor. Bill Pentz led a team at
Sacramento State
California State University, Sacramento (CSUS, Sacramento State, or informally Sac State) is a public university in Sacramento, California. Founded in 1947 as Sacramento State College, it is the eleventh oldest school in the 23-campus California ...
to build the first
microcomputer using the Intel 8008: the ''Sac State 8008'' (1972).
Its purpose was to store patient medical records. The computer supported a
disk operating system to run a
Memorex
Memorex Corp. began as a computer tape producer and expanded to become both a consumer media supplier and a major IBM plug compatible peripheral supplier. It was broken up and ceased to exist after 1996 other than as a consumer electronics bran ...
, 3-
megabyte,
hard disk drive
A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with magne ...
.
It had a color display and keyboard that was packaged in a single console. The disk operating system was programmed using
IBM's Basic Assembly Language (BAL). The medical records application was programmed using a
BASIC interpreter.
However, the computer was an evolutionary dead-end because it was extremely expensive. Also, it was built at a public university lab for a specific purpose.
Nonetheless, the project contributed to the development of the
Intel 8080
The Intel 8080 (''"eighty-eighty"'') is the second 8-bit microprocessor designed and manufactured by Intel. It first appeared in April 1974 and is an extended and enhanced variant of the earlier 8008 design, although without binary compatibil ...
(1974)
instruction set.
x86 series
In 1978, the modern software development environment began when Intel upgraded the
Intel 8080
The Intel 8080 (''"eighty-eighty"'') is the second 8-bit microprocessor designed and manufactured by Intel. It first appeared in April 1974 and is an extended and enhanced variant of the earlier 8008 design, although without binary compatibil ...
to the
Intel 8086. Intel simplified the Intel 8086 to manufacture the cheaper
Intel 8088
The Intel 8088 ("''eighty-eighty-eight''", also called iAPX 88) microprocessor is a variant of the Intel 8086. Introduced on June 1, 1979, the 8088 has an eight-bit external data bus instead of the 16-bit bus of the 8086. The 16-bit registers and ...
.
IBM embraced the Intel 8088 when they entered the
personal computer
A personal computer (PC) is a multi-purpose microcomputer whose size, capabilities, and price make it feasible for individual use. Personal computers are intended to be operated directly by an end user, rather than by a computer expert or tec ...
market (1981). As
consumer
A consumer is a person or a group who intends to order, or uses purchased goods, products, or services primarily for personal, social, family, household and similar needs, who is not directly related to entrepreneurial or business activities. ...
demand
In economics, demand is the quantity of a good that consumers are willing and able to purchase at various prices during a given time. The relationship between price and quantity demand is also called the demand curve. Demand for a specific item ...
for personal computers increased, so did Intel's microprocessor development. The succession of development is known as the
x86 series. The
x86 assembly language
x86 assembly language is the name for the family of assembly languages which provide some level of backward compatibility with CPUs back to the Intel 8008 microprocessor, which was launched in April 1972. It is used to produce object code for t ...
is a family of
backward-compatible
Backward compatibility (sometimes known as backwards compatibility) is a property of an operating system, product, or technology that allows for interoperability with an older legacy system, or with input designed for such a system, especially in ...
machine instruction
In computer programming, machine code is any low-level programming language, consisting of machine language instructions, which are used to control a computer's central processing unit (CPU). Each instruction causes the CPU to perform a very ...
s. Machine instructions created in earlier microprocessors were retained throughout microprocessor upgrades. This enabled consumers to purchase new computers without having to purchase new
application software
Application may refer to:
Mathematics and computing
* Application software, computer software designed to help the user to perform specific tasks
** Application layer, an abstraction layer that specifies protocols and interface methods used in a ...
. The major categories of instructions are:
* Memory instructions to set and access numbers and
strings
String or strings may refer to:
*String (structure), a long flexible structure made from threads twisted together, which is used to tie, bind, or hang other objects
Arts, entertainment, and media Films
* ''Strings'' (1991 film), a Canadian anim ...
in
random-access memory
Random-access memory (RAM; ) is a form of computer memory that can be read and changed in any order, typically used to store working data and machine code. A random-access memory device allows data items to be read or written in almost the ...
.
* Integer
arithmetic logic unit (ALU) instructions to perform the primary arithmetic operations on
integers
An integer is the number zero (), a positive natural number (, , , etc.) or a negative integer with a minus sign ( −1, −2, −3, etc.). The negative numbers are the additive inverses of the corresponding positive numbers. In the language ...
.
* Floating point ALU instructions to perform the primary arithmetic operations on
real number
In mathematics, a real number is a number that can be used to measure a ''continuous'' one-dimensional quantity such as a distance, duration or temperature. Here, ''continuous'' means that values can have arbitrarily small variations. Every ...
s.
*
Call stack
In computer science, a call stack is a stack data structure that stores information about the active subroutines of a computer program. This kind of stack is also known as an execution stack, program stack, control stack, run-time stack, or ma ...
instructions to push and pop
words
A word is a basic element of language that carries an objective or practical meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no conse ...
needed to allocate memory and interface with
functions.
*
Single instruction, multiple data (SIMD) instructions to increase speed when multiple processors are available to perform the same
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algorithms are used as specificat ...
on an
array of data.
Changing programming environment
VLSI circuits enabled the
programming environment
An integrated development environment (IDE) is a software application that provides comprehensive facilities to computer programmers for software development. An IDE normally consists of at least a source code editor, build automation tools a ...
to advance from a
computer terminal
A computer terminal is an electronic or electromechanical hardware device that can be used for entering data into, and transcribing data from, a computer or a computing system. The teletype was an example of an early-day hard-copy terminal and ...
(until the 1990s) to a
graphical user interface
The GUI ( "UI" by itself is still usually pronounced . or ), graphical user interface, is a form of user interface that allows users to interact with electronic devices through graphical icons and audio indicator such as primary notation, inst ...
(GUI) computer. Computer terminals limited programmers to a single
shell
Shell may refer to:
Architecture and design
* Shell (structure), a thin structure
** Concrete shell, a thin shell of concrete, usually with no interior columns or exterior buttresses
** Thin-shell structure
Science Biology
* Seashell, a hard o ...
running in a
command-line environment. During the 1970s, full-screen source code editing became possible through a
text-based user interface
In computing, text-based user interfaces (TUI) (alternately terminal user interfaces, to reflect a dependence upon the properties of computer terminals and not just text), is a retronym describing a type of user interface (UI) common as an ear ...
. Regardless of the technology available, the goal is to program in a
programming language
A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language.
The description of a programming ...
.
Programming paradigms and languages
Programming language
A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language.
The description of a programming ...
features exist to provide building blocks to be combined to express programming ideals.
Ideally, a programming language should:
* express ideas directly in the code.
* express independent ideas independently.
* express relationships among ideas directly in the code.
* combine ideas freely.
* combine ideas only where combinations make sense.
* express simple ideas simply.
The programming style of a programming language to provide these building blocks may be categorized into
programming paradigm
Programming paradigms are a way to classify programming languages based on their features. Languages can be classified into multiple paradigms.
Some paradigms are concerned mainly with implications for the execution model of the language, suc ...
s.
For example, different paradigms may differentiate:
*
procedural languages,
functional languages
In computer science, functional programming is a programming paradigm where programs are constructed by applying and composing functions. It is a declarative programming paradigm in which function definitions are trees of expressions that m ...
, and
logical languages.
* different levels of
data abstraction.
* different levels of
class hierarchy
A class hierarchy or inheritance tree in computer science is a classification of object types, denoting objects as the instantiations of classes (class is like a blueprint, the object is what is built from that blueprint) inter-relating the vario ...
.
* different levels of input
datatypes
In computer science and computer programming, a data type (or simply type) is a set of possible values and a set of allowed operations on it. A data type tells the compiler or interpreter how the programmer intends to use the data. Most progra ...
, as in
container types and
generic programming
Generic programming is a style of computer programming in which algorithms are written in terms of types ''to-be-specified-later'' that are then ''instantiated'' when needed for specific types provided as parameters. This approach, pioneered b ...
.
Each of these programming styles has contributed to the synthesis of different ''programming languages''.
A ''programming language'' is a set of
keywords,
symbols,
identifiers
An identifier is a name that identifies (that is, labels the identity of) either a unique object or a unique ''class'' of objects, where the "object" or class may be an idea, physical countable object (or class thereof), or physical noncountable ...
, and rules by which programmers can communicate instructions to the computer.
They follow a set of rules called a
syntax.
* ''Keywords'' are reserved words to form
declarations and
statements
Statement or statements may refer to: Common uses
*Statement (computer science), the smallest standalone element of an imperative programming language
*Statement (logic), declarative sentence that is either true or false
*Statement, a declarative ...
.
* ''Symbols'' are characters to form
operations,
assignments,
control flow
In computer science, control flow (or flow of control) is the order in which individual statements, instructions or function calls of an imperative program are executed or evaluated. The emphasis on explicit control flow distinguishes an ''im ...
, and
delimiters.
* ''Identifiers'' are words created by programmers to form
constants,
variable names,
structure names, and
function names.
* ''Syntax Rules'' are defined in the
Backus–Naur form.
''Programming languages'' get their basis from
formal language
In logic, mathematics, computer science, and linguistics, a formal language consists of words whose letters are taken from an alphabet and are well-formed according to a specific set of rules.
The alphabet of a formal language consists of sy ...
s.
The purpose of defining a solution in terms of its ''formal language'' is to generate an
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algorithms are used as specificat ...
to solve the underlining problem.
An ''algorithm'' is a sequence of simple instructions that solve a problem.
Generations of programming language
The evolution of programming language began when the
EDSAC (1949) used the first
stored computer program in its
von Neumann architecture
The von Neumann architecture — also known as the von Neumann model or Princeton architecture — is a computer architecture based on a 1945 description by John von Neumann, and by others, in the '' First Draft of a Report on the EDVAC''. T ...
.
Programming the EDSAC was in the first
generation of programming language.
* The
first generation of programming language is
machine language
In computer programming, machine code is any low-level programming language, consisting of machine language instructions, which are used to control a computer's central processing unit (CPU). Each instruction causes the CPU to perform a very ...
.
''Machine language'' requires the programmer to enter instructions using ''instruction numbers'' called
machine code
In computer programming, machine code is any low-level programming language, consisting of machine language instructions, which are used to control a computer's central processing unit (CPU). Each instruction causes the CPU to perform a ve ...
. For example, the ADD operation on the
PDP-11 has instruction number 24576.
* The
second generation of programming language is
assembly language.
''Assembly language'' allows the programmer to use
mnemonic
A mnemonic ( ) device, or memory device, is any learning technique that aids information retention or retrieval (remembering) in the human memory for better understanding.
Mnemonics make use of elaborative encoding, retrieval cues, and imag ...
instructions instead of remembering instruction numbers. An
assembler
Assembler may refer to:
Arts and media
* Nobukazu Takemura, avant-garde electronic musician, stage name Assembler
* Assemblers, a fictional race in the ''Star Wars'' universe
* Assemblers, an alternative name of the superhero group Champions of ...
translates each assembly language mnemonic into its machine language number. For example, on the PDP-11, the operation 24576 can be referenced as ADD in the source code.
The four basic arithmetic operations have assembly instructions like ADD, SUB, MUL, and DIV.
Computers also have instructions like DW (Define
Word
A word is a basic element of language that carries an objective or practical meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no conse ...
) to reserve
memory
Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembered ...
cells. Then the MOV instruction can copy
integer
An integer is the number zero (), a positive natural number (, , , etc.) or a negative integer with a minus sign ( −1, −2, −3, etc.). The negative numbers are the additive inverses of the corresponding positive numbers. In the languag ...
s between
registers and memory.
:* The basic structure of an assembly language statement is a label, operation, operand, and comment.
::* ''Labels'' allow the programmer to work with
variable names. The assembler will later translate labels into physical
memory address
In computing, a memory address is a reference to a specific memory location used at various levels by software and hardware. Memory addresses are fixed-length sequences of digits conventionally displayed and manipulated as unsigned integers. ...
es.
::* ''Operations'' allow the programmer to work with mnemonics. The assembler will later translate mnemonics into instruction numbers.
::* ''Operands'' tell the assembler which data the operation will process.
::* ''Comments'' allow the programmer to articulate a narrative because the instructions alone are vague.
:: The key characteristic of an assembly language program is it forms a one-to-one mapping to its corresponding machine language target.
* The
third generation of programming language uses
compiler
In computing, a compiler is a computer program that translates computer code written in one programming language (the ''source'' language) into another language (the ''target'' language). The name "compiler" is primarily used for programs tha ...
s and
interpreters
Interpreting is a translational activity in which one produces a first and final target-language output on the basis of a one-time exposure to an expression in a source language.
The most common two modes of interpreting are simultaneous interp ...
to execute computer programs. The distinguishing feature of a ''third generation'' language is its independence from particular hardware.
Early languages include
Fortran (1958),
COBOL (1959),
ALGOL
ALGOL (; short for "Algorithmic Language") is a family of imperative computer programming languages originally developed in 1958. ALGOL heavily influenced many other languages and was the standard method for algorithm description used by the ...
(1960), and
BASIC (1964).
In 1973, the
C programming language emerged as a
high-level language
In computer science, a high-level programming language is a programming language with strong abstraction from the details of the computer. In contrast to low-level programming languages, it may use natural language ''elements'', be easier to use, ...
that produced efficient machine language instructions.
Whereas ''third-generation'' languages historically generated many machine instructions for each statement,
C has statements that may generate a single machine instruction. Moreover, an
optimizing compiler
In computing, an optimizing compiler is a compiler that tries to minimize or maximize some attributes of an executable computer program. Common requirements are to minimize a program's execution time, memory footprint, storage size, and power cons ...
might overrule the programmer and produce fewer machine instructions than statements. Today, an entire
paradigm of languages fill the
imperative, ''third generation'' spectrum.
* The
fourth generation of programming language emphasizes what output results are desired, rather than how programming statements should be constructed.
Declarative languages attempt to limit
side effects
In medicine, a side effect is an effect, whether therapeutic or adverse, that is secondary to the one intended; although the term is predominantly employed to describe adverse effects, it can also apply to beneficial, but unintended, consequence ...
and allow programmers to write code with relatively few errors.
One popular ''fourth generation'' language is called
Structured Query Language (SQL).
Database
In computing, a database is an organized collection of data stored and accessed electronically. Small databases can be stored on a file system, while large databases are hosted on computer clusters or cloud storage. The design of databases s ...
developers no longer need to process each database record one at a time. Also, a simple instruction can generate output records without having to understand how it's retrieved.
Imperative languages
''Imperative languages'' specify a sequential
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algorithms are used as specificat ...
using
declarations,
expressions, and
statements
Statement or statements may refer to: Common uses
*Statement (computer science), the smallest standalone element of an imperative programming language
*Statement (logic), declarative sentence that is either true or false
*Statement, a declarative ...
:
* A ''declaration'' introduces a
variable
Variable may refer to:
* Variable (computer science), a symbolic name associated with a value and whose associated value may be changed
* Variable (mathematics), a symbol that represents a quantity in a mathematical expression, as used in many ...
name to the ''computer program'' and assigns it to a
datatype – for example:
var x: integer;
* An ''expression'' yields a value – for example:
2 + 2
yields 4
* A ''statement'' might
assign an expression to a variable or use the value of a variable to alter the program's
control flow
In computer science, control flow (or flow of control) is the order in which individual statements, instructions or function calls of an imperative program are executed or evaluated. The emphasis on explicit control flow distinguishes an ''im ...
– for example:
x := 2 + 2; if x = 4 then do_something();
Fortran
FORTRAN (1958) was unveiled as "The IBM Mathematical FORmula TRANslating system." It was designed for scientific calculations, without
string handling facilities. Along with
declarations,
expressions, and
statements
Statement or statements may refer to: Common uses
*Statement (computer science), the smallest standalone element of an imperative programming language
*Statement (logic), declarative sentence that is either true or false
*Statement, a declarative ...
, it supported:
*
arrays
An array is a systematic arrangement of similar objects, usually in rows and columns.
Things called an array include:
{{TOC right
Music
* In twelve-tone and serial composition, the presentation of simultaneous twelve-tone sets such that the ...
.
*
subroutine
In computer programming, a function or subroutine is a sequence of program instructions that performs a specific task, packaged as a unit. This unit can then be used in programs wherever that particular task should be performed.
Functions may ...
s.
*
"do" loops.
It succeeded because:
* programming and debugging costs were below computer running costs.
* it was supported by IBM.
* applications at the time were scientific.
However, non-IBM vendors also wrote Fortran compilers, but with a syntax that would likely fail IBM's compiler.
The
American National Standards Institute (ANSI) developed the first Fortran standard in 1966. In 1978, Fortran 77 became the standard until 1991. Fortran 90 supports:
*
records
A record, recording or records may refer to:
An item or collection of data Computing
* Record (computer science), a data structure
** Record, or row (database), a set of fields in a database related to one entity
** Boot sector or boot record, r ...
.
*
pointers
Pointer may refer to:
Places
* Pointer, Kentucky
* Pointers, New Jersey
* Pointers Airport, Wasco County, Oregon, United States
* The Pointers, a pair of rocks off Antarctica
People with the name
* Pointer (surname), a surname (including a lis ...
to arrays.
COBOL
COBOL (1959) stands for "COmmon Business Oriented Language." Fortran manipulated symbols. It was soon realized that symbols didn't need to be numbers, so strings were introduced.
The
US Department of Defense
The United States Department of Defense (DoD, USDOD or DOD) is an executive branch department of the federal government charged with coordinating and supervising all agencies and functions of the government directly related to national sec ...
influenced COBOL's development, with
Grace Hopper
Grace Brewster Hopper (; December 9, 1906 – January 1, 1992) was an American computer scientist, mathematician, and United States Navy rear admiral. One of the first programmers of the Harvard Mark I computer, she was a pioneer of compu ...
being a major contributor. The statements were English-like and verbose. The goal was to design a language so managers could read the programs. However, the lack of structured statements hindered this goal.
COBOL's development was tightly controlled, so dialects didn't emerge to require ANSI standards. As a consequence, it wasn't changed for 15 years until 1974. The 1990s version did make consequential changes, like
object-oriented programming
Object-oriented programming (OOP) is a programming paradigm based on the concept of "objects", which can contain data and code. The data is in the form of fields (often known as attributes or ''properties''), and the code is in the form of ...
.
Algol
ALGOL
ALGOL (; short for "Algorithmic Language") is a family of imperative computer programming languages originally developed in 1958. ALGOL heavily influenced many other languages and was the standard method for algorithm description used by the ...
(1960) stands for "ALGOrithmic Language." It had a profound influence on programming language design.
Emerging from a committee of European and American programming language experts, it used standard mathematical notation and had a readable structured design. Algol was first to define its
syntax using the
Backus–Naur form.
This led to
syntax-directed compilers. It added features like:
* block structure, where variables were local to their block.
* arrays with variable bounds.
*
"for" loops.
*
functions.
*
recursion
Recursion (adjective: ''recursive'') occurs when a thing is defined in terms of itself or of its type. Recursion is used in a variety of disciplines ranging from linguistics to logic. The most common application of recursion is in mathemati ...
.
Algol's direct descendants include
Pascal,
Modula-2,
Ada
Ada may refer to:
Places
Africa
* Ada Foah, a town in Ghana
* Ada (Ghana parliament constituency)
* Ada, Osun, a town in Nigeria
Asia
* Ada, Urmia, a village in West Azerbaijan Province, Iran
* Ada, Karaman, a village in Karaman Province, ...
,
Delphi and
Oberon
Oberon () is a king of the fairies in medieval and Renaissance literature. He is best known as a character in William Shakespeare's play ''A Midsummer Night's Dream'', in which he is King of the Fairies and spouse of Titania, Queen of the Fairi ...
on one branch. On another branch there's
C,
C++
C++ (pronounced "C plus plus") is a high-level general-purpose programming language created by Danish computer scientist Bjarne Stroustrup as an extension of the C programming language, or "C with Classes". The language has expanded significan ...
and
Java
Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's mos ...
.
Basic
BASIC (1964) stands for "Beginner's All-Purpose Symbolic Instruction Code." It was developed at
Dartmouth College
Dartmouth College (; ) is a private research university in Hanover, New Hampshire. Established in 1769 by Eleazar Wheelock, it is one of the nine colonial colleges chartered before the American Revolution. Although founded to educate Native ...
for all of their students to learn.
If a student didn't go on to a more powerful language, the student would still remember Basic.
A Basic interpreter was installed in the
microcomputers
A microcomputer is a small, relatively inexpensive computer having a central processing unit (CPU) made out of a microprocessor. The computer also includes memory and input/output (I/O) circuitry together mounted on a printed circuit board (PC ...
manufactured in the late 1970s. As the microcomputer industry grew, so did the language.
Basic pioneered the
interactive session.
It offered
operating system
An operating system (OS) is system software that manages computer hardware, software resources, and provides common services for computer programs.
Time-sharing operating systems schedule tasks for efficient use of the system and may also i ...
commands within its environment:
* The 'new' command created an empty slate.
* Statements evaluated immediately.
* Statements could be programmed by preceding them with a line number.
* The 'list' command displayed the program.
* The 'run' command executed the program.
However, the Basic syntax was too simple for large programs.
Recent dialects added structure and object-oriented extensions.
Microsoft's
Microsoft Corporation is an American multinational corporation, multinational technology company, technology corporation producing Software, computer software, consumer electronics, personal computers, and related services headquartered at th ...
Visual Basic Visual Basic is a name for a family of programming languages from Microsoft. It may refer to:
* Visual Basic .NET (now simply referred to as "Visual Basic"), the current version of Visual Basic launched in 2002 which runs on .NET
* Visual Basic ( ...
is still widely used and produces a
graphical user interface
The GUI ( "UI" by itself is still usually pronounced . or ), graphical user interface, is a form of user interface that allows users to interact with electronic devices through graphical icons and audio indicator such as primary notation, inst ...
.
C
C programming language (1973) got its name because the language
BCPL was replaced with
B, and
AT&T Bell Labs
Nokia Bell Labs, originally named Bell Telephone Laboratories (1925–1984),
then AT&T Bell Laboratories (1984–1996)
and Bell Labs Innovations (1996–2007),
is an American industrial research and scientific development company owned by mult ...
called the next version "C." Its purpose was to write the
UNIX
Unix (; trademarked as UNIX) is a family of multitasking, multiuser computer operating systems that derive from the original AT&T Unix, whose development started in 1969 at the Bell Labs research center by Ken Thompson, Dennis Ritchie, an ...
operating system
An operating system (OS) is system software that manages computer hardware, software resources, and provides common services for computer programs.
Time-sharing operating systems schedule tasks for efficient use of the system and may also i ...
.
C is a relatively small language, making it easy to write compilers. Its growth mirrored the hardware growth in the 1980s.
Its growth also was because it has the facilities of
assembly language, but uses a
high-level syntax. It added advanced features like:
*
inline assembler In computer programming, an inline assembler is a feature of some compilers that allows low-level code written in assembly language to be embedded within a program, among code that otherwise has been compiled from a higher-level language such as C ...
.
* arithmetic on pointers.
* pointers to functions.
* bit operations.
* freely combining complex
operators
Operator may refer to:
Mathematics
* A symbol indicating a mathematical operation
* Logical operator or logical connective in mathematical logic
* Operator (mathematics), mapping that acts on elements of a space to produce elements of another sp ...
.
''C'' allows the programmer to control which region of memory data is to be stored. ''Global variables'' and ''static variables'' require the fewest
clock cycles
In electronics and especially synchronous digital circuits, a clock signal (historically also known as ''logic beat'') oscillates between a high and a low state and is used like a metronome to coordinate actions of digital circuits.
A clock signa ...
to store. The
stack is automatically used for the standard variable
declarations.
Heap memory is returned to a
pointer variable from the
malloc()
function.
* The ''global and static data'' region is located just above the ''program'' region. (The program region is technically called the ''text'' region. It's where machine instructions are stored.)
:* The global and static data region is technically two regions.
One region is called the ''initialized
data segment
In computing, a data segment (often denoted .data) is a portion of an object file or the corresponding address space of a program that contains initialized static variables, that is, global variables and static local variables. The size of thi ...
'', where variables declared with default values are stored. The other region is called the ''
block started by segment'', where variables declared without default values are stored.
:* Variables stored in the ''global and static data'' region have their
addresses set at compile-time. They retain their values throughout the life of the process.
:* The global and static region stores the ''global variables'' that are declared on top of (outside) the
main()
function.
Global variables are visible to
main()
and every other function in the source code.
: On the other hand, variable declarations inside of
main()
, other functions, or within
block delimiters are ''local variables''. Local variables also include ''
formal parameter variables''. Parameter variables are enclosed within the parenthesis of function definitions.
They provide an
interface
Interface or interfacing may refer to:
Academic journals
* ''Interface'' (journal), by the Electrochemical Society
* '' Interface, Journal of Applied Linguistics'', now merged with ''ITL International Journal of Applied Linguistics''
* '' Int ...
to the function.
:* ''Local variables'' declared using the
static
prefix are also stored in the ''global and static data'' region.
Unlike global variables, static variables are only visible within the function or block. Static variables always retain their value. An example usage would be the function
int increment_counter()
* The
stack region is a contiguous block of memory located near the top memory address.
Variables placed in the stack are populated from top to bottom.
A
stack pointer
In computer science, a call stack is a stack data structure that stores information about the active subroutines of a computer program. This kind of stack is also known as an execution stack, program stack, control stack, run-time stack, or mach ...
is a special-purpose
register
Register or registration may refer to:
Arts entertainment, and media Music
* Register (music), the relative "height" or range of a note, melody, part, instrument, etc.
* ''Register'', a 2017 album by Travis Miller
* Registration (organ), th ...
that keeps track of the last memory address populated.
Variables are placed into the stack via the ''assembly language'' PUSH instruction. Therefore, the addresses of these variables are set during
runtime. The method for stack variables to lose their
scope
Scope or scopes may refer to:
People with the surname
* Jamie Scope (born 1986), English footballer
* John T. Scopes (1900–1970), central figure in the Scopes Trial regarding the teaching of evolution
Arts, media, and entertainment
* Cinem ...
is via the POP instruction.
:* ''Local variables'' declared without the
static
prefix, including formal parameter variables,
are called ''automatic variables''
and are stored in the stack.
They are visible inside the function or block and lose their scope upon exiting the function or block.
* The
heap region is located below the stack.
It is populated from the bottom to the top. The
operating system
An operating system (OS) is system software that manages computer hardware, software resources, and provides common services for computer programs.
Time-sharing operating systems schedule tasks for efficient use of the system and may also i ...
manages the heap using a ''heap pointer'' and a list of allocated memory blocks.
Like the stack, the addresses of heap variables are set during runtime. An
out of memory
Out of memory (OOM) is an often undesired state of computer operation where no additional memory can be allocated for use by programs or the operating system. Such a system will be unable to load any additional programs, and since many programs ...
error occurs when the heap pointer and the stack pointer meet.
:* ''C'' provides the
malloc()
library function to
allocate heap memory.
Populating the heap with data is an additional copy function. Variables stored in the heap are economically passed to functions using pointers. Without pointers, the entire block of data would have to be passed to the function via the stack.
C++
In the 1970s,
software engineers
Software engineering is a systematic engineering approach to software development.
A software engineer is a person who applies the principles of software engineering to design, develop, maintain, test, and evaluate computer software. The term '' ...
needed language support to break large projects down into
modules
Broadly speaking, modularity is the degree to which a system's components may be separated and recombined, often with the benefit of flexibility and variety in use. The concept of modularity is used primarily to reduce complexity by breaking a s ...
.
One obvious feature was to decompose large projects ''physically'' into separate
files. A less obvious feature was to decompose large projects ''logically'' into
abstract datatypes
In computer science and computer programming, a data type (or simply type) is a set of possible values and a set of allowed operations on it. A data type tells the compiler or interpreter how the programmer intends to use the data. Most progra ...
.
At the time, languages supported concrete (
scalar
Scalar may refer to:
*Scalar (mathematics), an element of a field, which is used to define a vector space, usually the field of real numbers
* Scalar (physics), a physical quantity that can be described by a single element of a number field such ...
) datatypes like
integer
An integer is the number zero (), a positive natural number (, , , etc.) or a negative integer with a minus sign ( −1, −2, −3, etc.). The negative numbers are the additive inverses of the corresponding positive numbers. In the languag ...
numbers,
floating-point numbers, and
strings
String or strings may refer to:
*String (structure), a long flexible structure made from threads twisted together, which is used to tie, bind, or hang other objects
Arts, entertainment, and media Films
* ''Strings'' (1991 film), a Canadian anim ...
of
characters
Character or Characters may refer to:
Arts, entertainment, and media Literature
* ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk
* ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
. Concrete datatypes have their representation as part of their name.
Abstract datatypes are
structures
A structure is an arrangement and organization of interrelated elements in a material object or system, or the object or system so organized. Material structures include man-made objects such as buildings and machines and natural objects such as ...
of concrete datatypes, with a new name assigned. For example, a
list
A ''list'' is any set of items in a row. List or lists may also refer to:
People
* List (surname)
Organizations
* List College, an undergraduate division of the Jewish Theological Seminary of America
* SC Germania List, German rugby unio ...
of integers could be called
integer_list
.
In object-oriented jargon, abstract datatypes are called
classes. However, a ''class'' is only a definition; no memory is allocated. When memory is allocated to a class, it's called an
object
Object may refer to:
General meanings
* Object (philosophy), a thing, being, or concept
** Object (abstract), an object which does not exist at any particular time or place
** Physical object, an identifiable collection of matter
* Goal, an ...
.
''
Object-oriented imperative languages'' developed by combining the need for classes and the need for safe
functional programming
In computer science, functional programming is a programming paradigm where programs are constructed by applying and composing functions. It is a declarative programming paradigm in which function definitions are trees of expressions that ...
.
A function, in an object-oriented language, is assigned to a class. An assigned function is then referred to as a
method
Method ( grc, μέθοδος, methodos) literally means a pursuit of knowledge, investigation, mode of prosecuting such inquiry, or system. In recent centuries it more often means a prescribed process for completing a task. It may refer to:
*Scien ...
,
member function
A method in object-oriented programming (OOP) is a procedure associated with a message and an object. An object consists of ''state data'' and ''behavior''; these compose an ''interface'', which specifies how the object may be utilized by any of ...
, or ''
operation
Operation or Operations may refer to:
Arts, entertainment and media
* ''Operation'' (game), a battery-operated board game that challenges dexterity
* Operation (music), a term used in musical set theory
* ''Operations'' (magazine), Multi-Ma ...
''. ''Object-oriented programming'' is executing ''operations'' on ''objects''.
''Object-oriented languages'' support a syntax to model
subset/superset relationships. In
set theory
Set theory is the branch of mathematical logic that studies sets, which can be informally described as collections of objects. Although objects of any kind can be collected into a set, set theory, as a branch of mathematics, is mostly conce ...
, an
element of a subset inherits all the attributes contained in the superset. For example, a student is a person. Therefore, the set of students is a subset of the set of persons. As a result, students inherit all the attributes common to all persons. Additionally, students have unique attributes that other people don't have. ''Object-oriented languages'' model ''subset/superset'' relationships using
inheritance
Inheritance is the practice of receiving private property, titles, debts, entitlements, privileges, rights, and obligations upon the death of an individual. The rules of inheritance differ among societies and have changed over time. Officia ...
.
''Object-oriented programming'' became the dominant language paradigm by the late 1990s.
C++
C++ (pronounced "C plus plus") is a high-level general-purpose programming language created by Danish computer scientist Bjarne Stroustrup as an extension of the C programming language, or "C with Classes". The language has expanded significan ...
(1985) was originally called "C with Classes."
It was designed to expand
C's C's may refer to:
* The Boston Celtics, American basketball franchise
* Citizens (Spanish political party)
Citizens ( es, link=no, Ciudadanos ; ca, link=no, Ciutadans ; eu, link=no, Hiritarrak; gl, link=no, Cidadáns; shortened as Cs—C's un ...
capabilities by adding the object-oriented facilities of the language
Simula.
An object-oriented module is composed of two files. The definitions file is called the
header file
Many programming languages and other computer files have a directive, often called include (sometimes copy or import), that causes the contents of the specified file to be inserted into the original file. These included files are called copybooks ...
. Here is a C++ ''header file'' for the ''GRADE class'' in a simple school application:
// grade.h
// -------
// Used to allow multiple source files to include
// this header file without duplication errors.
// ----------------------------------------------
#ifndef GRADE_H
#define GRADE_H
class GRADE ;
#endif
A
constructor operation is a function with the same name as the class name.
It is executed when the calling operation executes the
new
statement.
A module's other file is the ''
source file
In computing, source code, or simply code, is any collection of code, with or without comments, written using a human-readable programming language, usually as plain text. The source code of a program is specially designed to facilitate the wo ...
''. Here is a C++ source file for the ''GRADE class'' in a simple school application:
// grade.cpp
// ---------
#include "grade.h"
GRADE::GRADE( const char letter )
int GRADE::grade_numeric( const char letter )
Here is a C++ ''header file'' for the ''PERSON class'' in a simple school application:
// person.h
// --------
#ifndef PERSON_H
#define PERSON_H
class PERSON ;
#endif
Here is a C++ ''source file'' for the ''PERSON class'' in a simple school application:
// person.cpp
// ----------
#include "person.h"
PERSON::PERSON ( const char *name )
Here is a C++ ''header file'' for the ''STUDENT class'' in a simple school application:
// student.h
// ---------
#ifndef STUDENT_H
#define STUDENT_H
#include "person.h"
#include "grade.h"
// A STUDENT is a subset of PERSON.
// --------------------------------
class STUDENT : public PERSON;
#endif
Here is a C++ ''source file'' for the ''STUDENT class'' in a simple school application:
// student.cpp
// -----------
#include "student.h"
#include "person.h"
STUDENT::STUDENT ( const char *name ):
// Execute the constructor of the PERSON superclass.
// -------------------------------------------------
PERSON( name )
Here is a driver program for demonstration:
// student_dvr.cpp
// ---------------
#include
#include "student.h"
int main( void )
Here is a
makefile
In software development, Make is a build automation tool that automatically builds executable programs and libraries from source code by reading files called ''Makefiles'' which specify how to derive the target program. Though integrated devel ...
to compile everything:
# makefile
# --------
all: student_dvr
clean:
rm student_dvr *.o
student_dvr: student_dvr.cpp grade.o student.o person.o
c++ student_dvr.cpp grade.o student.o person.o -o student_dvr
grade.o: grade.cpp grade.h
c++ -c grade.cpp
student.o: student.cpp student.h
c++ -c student.cpp
person.o: person.cpp person.h
c++ -c person.cpp
Declarative languages
''Imperative languages'' have one major criticism: assigning an expression to a ''non-local'' variable may produce an unintended
side effect
In medicine, a side effect is an effect, whether therapeutic or adverse, that is secondary to the one intended; although the term is predominantly employed to describe adverse effects, it can also apply to beneficial, but unintended, consequence ...
.
Declarative languages generally omit the assignment statement and the control flow. They describe ''what'' computation should be performed and not ''how'' to compute it. Two broad categories of declarative languages are
functional language
In computer science, functional programming is a programming paradigm where programs are constructed by applying and composing functions. It is a declarative programming paradigm in which function definitions are trees of expressions that ...
s and
logical languages.
The principle behind a ''functional language'' is to use
lambda calculus as a guide for a well defined
semantic.
In mathematics, a function is a rule that maps elements from an ''expression'' to a range of ''values''. Consider the function:
times_10(x) = 10 * x
The ''expression''
10 * x
is mapped by the function
times_10()
to a range of ''values''. One ''value'' happens to be 20. This occurs when x is 2. So, the application of the function is mathematically written as:
times_10(2) = 20
A ''functional language'' compiler will not store this value in a variable. Instead, it will ''push'' the value onto the computer's
stack before setting the
program counter
The program counter (PC), commonly called the instruction pointer (IP) in Intel x86 and Itanium microprocessors, and sometimes called the instruction address register (IAR), the instruction counter, or just part of the instruction sequencer, is ...
back to the calling function. The calling function will then ''pop'' the value from the stack.
''Imperative languages'' do support functions. Therefore, ''functional programming'' can be achieved in an imperative language if the programmer uses discipline. However, a ''functional language'' will force this discipline onto the programmer through its syntax. Functional languages have a syntax tailored to emphasize the ''what''.
A functional program is developed with a set of primitive functions followed by a single driver function.
Consider the
snippet:
function max(a,b)
function min(a,b)
function difference_between_largest_and_smallest(a,b,c)
The primitives are
max()
and
min()
. The driver function is
difference_between_largest_and_smallest()
. Executing:
put(difference_between_largest_and_smallest(10,4,7));
will output 6.
''Functional languages'' are used in
computer science
Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to Applied science, practical discipli ...
research to explore new language features.
Moreover, their lack of side-effects have made them popular in
parallel programming
Parallel computing is a type of computing, computation in which many calculations or Process (computing), processes are carried out simultaneously. Large problems can often be divided into smaller ones, which can then be solved at the same time. ...
and
concurrent programming
Concurrent means happening at the same time. Concurrency, concurrent, or concurrence may refer to:
Law
* Concurrence, in jurisprudence, the need to prove both ''actus reus'' and ''mens rea''
* Concurring opinion (also called a "concurrence"), ...
.
However, application developers prefer the
object-oriented features of ''imperative languages''.
Lisp
Lisp (1958) stands for "LISt Processor."
It is tailored to process
lists
A ''list'' is any set of items in a row. List or lists may also refer to:
People
* List (surname)
Organizations
* List College, an undergraduate division of the Jewish Theological Seminary of America
* SC Germania List, German rugby unio ...
. A full structure of the data is formed by building lists of lists. In memory, a
tree data structure
In computer science, a tree is a widely used abstract data type that represents a hierarchical tree structure with a set of connected nodes. Each node in the tree can be connected to many children (depending on the type of tree), but must be c ...
is built. Internally, the tree structure lends nicely for
recursive
Recursion (adjective: ''recursive'') occurs when a thing is defined in terms of itself or of its type. Recursion is used in a variety of disciplines ranging from linguistics to logic. The most common application of recursion is in mathematics ...
functions.
The syntax to build a tree is to enclose the space-separated
elements within parenthesis. The following is a
list
A ''list'' is any set of items in a row. List or lists may also refer to:
People
* List (surname)
Organizations
* List College, an undergraduate division of the Jewish Theological Seminary of America
* SC Germania List, German rugby unio ...
of three elements. The first two elements are themselves lists of two elements:
((A B) (HELLO WORLD) 94)
Lisp has functions to extract and reconstruct elements.
The function
head()
returns a list containing the first element in the list. The function
tail()
returns a list containing everything but the first element. The function
cons()
returns a list that is the concatenation of other lists. Therefore, the following expression will return the list
x
:
cons(head(x), tail(x))
One drawback of Lisp is when many functions are nested, the parentheses may look confusing.
Modern Lisp
environments help ensure parenthesis match. As an aside, Lisp does support the ''imperative language'' operations of the assignment statement and goto loops.
Also, ''Lisp'' is not concerned with the
datatype of the elements at compile time.
Instead, it assigns (and may reassign) the datatypes at
runtime. Assigning the datatype at runtime is called
dynamic binding
Dynamic binding may refer to:
* Dynamic binding (computing), also known as late binding
*Dynamic scoping in programming languages
* Dynamic binding (chemistry)
See also
*Dynamic dispatch
*Dynamic linking
In computing, a dynamic linker is the par ...
.
Whereas dynamic binding increases the language's flexibility, programming errors may linger until late in the
software development process.
Writing large, reliable, and readable Lisp programs requires forethought. If properly planned, the program may be much shorter than an equivalent ''imperative language'' program.
''Lisp'' is widely used in
artificial intelligence
Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by animals and humans. Example tasks in which this is done include speech r ...
. However, its usage has been accepted only because it has ''imperative language'' operations, making unintended side-effects possible.
ML
ML (1973)
stands for "Meta Language." ML checks to make sure only data of the same type are compared with one another.
For example, this function has one input parameter (an integer) and returns an integer:
''ML'' is not parenthesis-eccentric like ''Lisp''. The following is an application of
times_10()
:
times_10 2
It returns "20 : int". (Both the results and the datatype are returned.)
Like ''Lisp'', ''ML'' is tailored to process lists. Unlike ''Lisp'', each element is the same datatype.
Moreover, ''ML'' assigns the datatype of an element at
compile-time
In computer science, compile time (or compile-time) describes the time window during which a computer program is compiled.
The term is used as an adjective to describe concepts related to the context of program compilation, as opposed to concept ...
. Assigning the datatype at compile-time is called
static binding
In programming languages, name binding is the association of entities (data and/or code) with identifiers. An identifier bound to an Object (computer science), object is said to Reference (computer science), reference that object. Machine languag ...
. Static binding increases reliability because the compiler checks the context of variables before they are used.
Prolog
Prolog
Prolog is a logic programming language associated with artificial intelligence and computational linguistics.
Prolog has its roots in first-order logic, a formal logic, and unlike many other programming languages, Prolog is intended primarily ...
(1972) stands for "PROgramming in LOgic." It was designed to process
natural languages.
The building blocks of a Prolog program are ''
objects
Object may refer to:
General meanings
* Object (philosophy), a thing, being, or concept
** Object (abstract), an object which does not exist at any particular time or place
** Physical object, an identifiable collection of matter
* Goal, an ...
'' and their ''relationships'' to other objects. ''Objects'' are built by stating true ''facts'' about them.
Set theory
Set theory is the branch of mathematical logic that studies sets, which can be informally described as collections of objects. Although objects of any kind can be collected into a set, set theory, as a branch of mathematics, is mostly conce ...
''facts'' are formed by assigning objects to sets. The syntax is
setName(object).
* Cat is an animal.
:
animal(cat).
* Mouse is an animal.
:
animal(mouse).
* Tom is a cat.
:
cat(tom).
* Jerry is a mouse.
:
mouse(jerry).
Adjective
In linguistics, an adjective (abbreviated ) is a word that generally modifies a noun or noun phrase or describes its referent. Its semantic role is to change information given by the noun.
Traditionally, adjectives were considered one of the ma ...
''facts'' are formed using
adjective(object).
* Cat is big.
:
big(cat).
* Mouse is small.
:
small(mouse).
''Relationships'' are formed using multiple items inside the parentheses. In our example we have
verb(object,object)
and
verb(adjective,adjective)
.
* Mouse eats cheese.
:
eat(mouse,cheese).
* Big animals eat small animals.
:
eat(big,small).
After all the facts and relationships are entered, then a question can be asked:
: Will Tom eat Jerry?
:
?- eat(tom,jerry).
Prolog's usage has expanded to become a goal-oriented language.
In a goal-oriented application, the goal is defined by providing a list of subgoals. Then each subgoal is defined by further providing a list of its subgoals, etc. If a path of subgoals fails to find a solution, then that subgoal is
backtracked and another path is systematically attempted.
Practical applications include solving the
shortest path problem and producing
family tree
A family tree, also called a genealogy or a pedigree chart, is a chart representing family relationships in a conventional tree structure. More detailed family trees, used in medicine and social work, are known as genograms.
Representations of ...
s.
Object-oriented programming
Object-oriented programming
Object-oriented programming (OOP) is a programming paradigm based on the concept of "objects", which can contain data and code. The data is in the form of fields (often known as attributes or ''properties''), and the code is in the form of ...
is a programming method to execute
operations (
functions) on
objects
Object may refer to:
General meanings
* Object (philosophy), a thing, being, or concept
** Object (abstract), an object which does not exist at any particular time or place
** Physical object, an identifiable collection of matter
* Goal, an ...
.
The basic idea is to group the characteristics of a
phenomenon
A phenomenon ( : phenomena) is an observable event. The term came into its modern philosophical usage through Immanuel Kant, who contrasted it with the noumenon, which ''cannot'' be directly observed. Kant was heavily influenced by Gottfried ...
into an object
container
A container is any receptacle or enclosure for holding a product used in storage, packaging, and transportation, including shipping.
Things kept inside of a container are protected on several sides by being inside of its structure. The term ...
and give the container a name. The ''operations'' on the phenomenon are also grouped into the container.
''Object-oriented programming'' developed by combining the need for containers and the need for safe
functional programming
In computer science, functional programming is a programming paradigm where programs are constructed by applying and composing functions. It is a declarative programming paradigm in which function definitions are trees of expressions that ...
.
This programming method need not be confined to an ''object-oriented language''.
In an object-oriented language, an object container is called a
class
Class or The Class may refer to:
Common uses not otherwise categorized
* Class (biology), a taxonomic rank
* Class (knowledge representation), a collection of individuals or objects
* Class (philosophy), an analytical concept used differentl ...
. In a non-object-oriented language, a
data structure (which is also known as a
record) may become an object container. To turn a data structure into an object container, operations need to be written specifically for the structure. The resulting structure is called an
abstract datatype.
However,
inheritance
Inheritance is the practice of receiving private property, titles, debts, entitlements, privileges, rights, and obligations upon the death of an individual. The rules of inheritance differ among societies and have changed over time. Officia ...
will be missing. Nonetheless, this shortcoming can be overcome.
Here is a
C programming language ''header file'' for the ''GRADE abstract datatype'' in a simple school application:
/* grade.h */
/* ------- */
/* Used to allow multiple source files to include */
/* This header file is without duplication errors. */
/* ---------------------------------------------- */
#ifndef GRADE_H
#define GRADE_H
typedef struct
GRADE;
/* Constructor */
/* ----------- */
GRADE *grade_new( char letter );
int grade_numeric( char letter );
#endif
The
grade_new()
function performs the same algorithm as the C++
constructor operation.
Here is a C programming language ''
source file
In computing, source code, or simply code, is any collection of code, with or without comments, written using a human-readable programming language, usually as plain text. The source code of a program is specially designed to facilitate the wo ...
'' for the ''GRADE abstract datatype'' in a simple school application:
/* grade.c */
/* ------- */
#include "grade.h"
GRADE *grade_new( char letter )
int grade_numeric( char letter )
In the constructor, the function
calloc()
is used instead of
malloc()
because each memory cell will be set to zero.
Here is a C programming language ''header file'' for the ''PERSON abstract datatype'' in a simple school application:
/* person.h */
/* -------- */
#ifndef PERSON_H
#define PERSON_H
typedef struct
PERSON;
/* Constructor */
/* ----------- */
PERSON *person_new( char *name );
#endif
Here is a C programming language ''source file'' for the ''PERSON abstract datatype'' in a simple school application:
/* person.c */
/* -------- */
#include "person.h"
PERSON *person_new( char *name )
Here is a C programming language ''header file'' for the ''STUDENT abstract datatype'' in a simple school application:
/* student.h */
/* --------- */
#ifndef STUDENT_H
#define STUDENT_H
#include "person.h"
#include "grade.h"
typedef struct
STUDENT;
/* Constructor */
/* ----------- */
STUDENT *student_new( char *name );
#endif
Here is a C programming language ''source file'' for the ''STUDENT abstract datatype'' in a simple school application:
/* student.c */
/* --------- */
#include "student.h"
#include "person.h"
STUDENT *student_new( char *name )
Here is a driver program for demonstration:
/* student_dvr.c */
/* ------------- */
#include
#include "student.h"
int main( void )
Here is a
makefile
In software development, Make is a build automation tool that automatically builds executable programs and libraries from source code by reading files called ''Makefiles'' which specify how to derive the target program. Though integrated devel ...
to compile everything:
# makefile
# --------
all: student_dvr
clean:
rm student_dvr *.o
student_dvr: student_dvr.c grade.o student.o person.o
gcc student_dvr.c grade.o student.o person.o -o student_dvr
grade.o: grade.c grade.h
gcc -c grade.c
student.o: student.c student.h
gcc -c student.c
person.o: person.c person.h
gcc -c person.c
The formal strategy to build object-oriented objects is to:
* Identify the objects. Most likely these will be nouns.
* Identify each object's attributes. What helps to describe the object?
* Identify each object's actions. Most likely these will be verbs.
* Identify the relationships from object to object. Most likely these will be verbs.
For example:
* A person is a human identified by a name.
* A grade is an achievement identified by a letter.
* A student is a person who earns a grade.
Syntax and semantics
The
syntax of a programming language is a
list
A ''list'' is any set of items in a row. List or lists may also refer to:
People
* List (surname)
Organizations
* List College, an undergraduate division of the Jewish Theological Seminary of America
* SC Germania List, German rugby unio ...
of
production rules which govern its ''form''.
A programming language's ''form'' is the correct placement of its
declarations,
expressions, and
statements
Statement or statements may refer to: Common uses
*Statement (computer science), the smallest standalone element of an imperative programming language
*Statement (logic), declarative sentence that is either true or false
*Statement, a declarative ...
.
Complementing the syntax of a language are its
semantics
Semantics (from grc, σημαντικός ''sēmantikós'', "significant") is the study of reference, meaning, or truth. The term can be used to refer to subfields of several distinct disciplines, including philosophy, linguistics and comp ...
. The ''semantics'' describe the meanings attached to various syntactic constructs.
A syntactic construct may need a semantic description because a form may have an invalid interpretation.
Also, different languages might have the same syntax; however, their behaviors may be different.
The syntax of a language is formally described by listing the ''production rules''. Whereas the syntax of a
natural language is extremely complicated, a subset of the English language can have this production rule listing:
# a sentence is made up of a noun-phrase followed by a verb-phrase;
# a noun-phrase is made up of an article followed by an adjective followed by a noun;
# a verb-phrase is made up of a verb followed by a noun-phrase;
# an article is 'the';
# an adjective is 'big' or
# an adjective is 'small';
# a noun is 'cat' or
# a noun is 'mouse';
# a verb is 'eats';
The words in bold-face are known as "non-terminals". The words in 'single quotes' are known as "terminals".
From this production rule listing, complete sentences may be formed using a series of replacements.
The process is to replace ''non-terminals'' with either a valid ''non-terminal'' or a valid ''terminal''. The replacement process repeats until only ''terminals'' remain. One valid sentence is:
* sentence
* noun-phrase verb-phrase
* article adjective noun verb-phrase
* ''the'' adjective noun verb-phrase
* ''the'' ''big'' noun verb-phrase
* ''the'' ''big'' ''cat'' verb-phrase
* ''the'' ''big'' ''cat'' verb noun-phrase
* ''the'' ''big'' ''cat'' ''eats'' noun-phrase
* ''the'' ''big'' ''cat'' ''eats'' article adjective noun
* ''the'' ''big'' ''cat'' ''eats'' ''the'' adjective noun
* ''the'' ''big'' ''cat'' ''eats'' ''the'' ''small'' noun
* ''the'' ''big'' ''cat'' ''eats'' ''the'' ''small'' ''mouse''
However, another combination results in an invalid sentence:
* ''the'' ''small'' ''mouse'' ''eats'' ''the'' ''big'' ''cat''
Therefore, a ''semantic'' is necessary to correctly describe the meaning of an ''eat'' activity.
One ''production rule'' listing method is called the
Backus–Naur form (BNF).
BNF describes the syntax of a language and itself has a ''syntax''. This recursive definition is an example of a
meta-language
In logic and linguistics, a metalanguage is a language used to describe another language, often called the ''object language''. Expressions in a metalanguage are often distinguished from those in the object language by the use of italics, quot ...
.
The ''syntax'' of BNF includes:
*
::=
which translates to ''is made up of a
' when a non-terminal is to its right. It translates to ''is'' when a terminal is to its right.
*
,
which translates to ''or''.
*
<
and
>
which surround non-terminals.
Using BNF, a subset of the English language can have this ''production rule'' listing:
::=
::=
::=
::= the
::= big , small
::= cat , mouse
::= eats
Using BNF, a signed-
integer
An integer is the number zero (), a positive natural number (, , , etc.) or a negative integer with a minus sign ( −1, −2, −3, etc.). The negative numbers are the additive inverses of the corresponding positive numbers. In the languag ...
has the ''production rule'' listing:
::=
::= + , -
::= ,
::= 0 , 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9
Notice the recursive production rule:
::= ,
This allows for an infinite number of possibilities. Therefore, a ''semantic'' is necessary to describe a limitation of the number of digits.
Notice the leading zero possibility in the production rules:
::= ,
::= 0 , 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9
Therefore, a ''semantic'' is necessary to describe that leading zeros need to be ignored.
Two formal methods are available to describe ''semantics''. They are
denotational semantics and
axiomatic semantics.
Software engineering and computer programming
Software engineering
Software engineering is a systematic engineering approach to software development.
A software engineer is a person who applies the principles of software engineering to design, develop, maintain, test, and evaluate computer software. The term '' ...
is a variety of techniques to produce quality software.
Computer programming
Computer programming is the process of performing a particular computation (or more generally, accomplishing a specific computing result), usually by designing and building an executable computer program. Programming involves tasks such as anal ...
is the process of writing or editing
source code
In computing, source code, or simply code, is any collection of code, with or without comments, written using a human-readable programming language, usually as plain text. The source code of a program is specially designed to facilitate the w ...
. In a formal environment, a
systems analyst will gather information from managers about all the organization's processes to automate. This professional then prepares a
detailed plan for the new or modified system.
The plan is analogous to an architect's blueprint.
Performance objectives
The systems analyst has the objective to deliver the right information to the right person at the right time.
The critical factors to achieve this objective are:
# The quality of the output. Is the output useful for decision-making?
# The accuracy of the output. Does it reflect the true situation?
# The format of the output. Is the output easily understood?
# The speed of the output. Time-sensitive information is important when communicating with the customer in real time.
Cost objectives
Achieving performance objectives should be balanced with all of the costs, including:
# Development costs.
# Uniqueness costs. A reusable system may be expensive. However, it might be preferred over a limited-use system.
# Hardware costs.
# Operating costs.
Applying a
systems development process will mitigate the axiom: the later in the process an error is detected, the more expensive it is to correct.
Waterfall model
The
waterfall model
The waterfall model is a breakdown of project activities into linear sequential phases, meaning they are passed down onto each other, where each phase depends on the deliverables of the previous one and corresponds to a specialization of tasks. ...
is an implementation of a ''systems development process''.
As the ''waterfall'' label implies, the basic phases overlap each other:
# The ''investigation phase'' is to understand the underlying problem.
# The ''analysis phase'' is to understand the possible solutions.
# The ''design phase'' is to plan the best solution.
# The ''implementation phase'' is to program the best solution.
# The ''maintenance phase'' lasts throughout the life of the system. Changes to the system after it's deployed may be necessary.
Faults may exist, including specification faults, design faults, or coding faults. Improvements may be necessary. Adaption may be necessary to react to a changing environment.
Computer programmer
A
computer programmer is a specialist responsible for writing or modifying the source code to implement the detailed plan.
A programming team is likely to be needed because most systems are too large to be completed by a single programmer.
However, adding programmers to a project may not shorten the completion time. Instead, it may lower the quality of the system.
To be effective, program modules need to be defined and distributed to team members.
Also, team members must interact with one another in a meaningful and effective way.
Computer programmers may be ''programming-in-the-small'': programming within a single module.
Chances are a module will execute modules located in other source code files. Therefore, computer programmers may be ''programming-in-the-large'': programming modules so they will effectively couple with each other.
Programming-in-the-large includes contributing to the
application programming interface
An application programming interface (API) is a way for two or more computer programs to communicate with each other. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how t ...
(API).
Program modules
Modular programming
Modular programming is a software design technique that emphasizes separating the functionality of a Computer program, program into independent, interchangeable modules, such that each contains everything necessary to execute only one aspect of th ...
is a technique to refine ''imperative language'' programs. Refined programs may reduce the software size, separate responsibilities, and thereby mitigate
software aging
In software engineering, software aging is the tendency for software to fail or cause a system failure after running continuously for a certain time, or because of ongoing changes in systems surrounding the software. Software aging has several c ...
. A ''program module'' is a sequence of statements that are bounded within a
block
Block or blocked may refer to:
Arts, entertainment and media Broadcasting
* Block programming, the result of a programming strategy in broadcasting
* W242BX, a radio station licensed to Greenville, South Carolina, United States known as ''96.3 ...
and together identified by a name.
Modules have a ''function'', ''context'', and ''logic'':
* The ''function'' of a module is what it does.
* The ''context'' of a module are the elements being performed upon.
* The ''logic'' of a module is how it performs the function.
The module's name should be derived first by its ''function'', then by its ''context''. Its ''logic'' should not be part of the name.
For example,
function compute_square_root( x )
or
function compute_square_root_integer( i : integer )
are appropriate module names. However,
function compute_square_root_by_division( x )
is not.
The degree of interaction ''within'' a module is its level of
cohesion.
''Cohesion'' is a judgment of the relationship between a module's name and its ''function''. The degree of interaction ''between'' modules is the level of
coupling
A coupling is a device used to connect two shafts together at their ends for the purpose of transmitting power. The primary purpose of couplings is to join two pieces of rotating equipment while permitting some degree of misalignment or end mov ...
.
''Coupling'' is a judgement of the relationship between a module's ''context'' and the elements being performed upon.
Cohesion
The levels of cohesion from worst to best are:
* ''Coincidental Cohesion'': A module has coincidental cohesion if it performs multiple functions, and the functions are completely unrelated. For example,
function read_sales_record_print_next_line_convert_to_float()
. Coincidental cohesion occurs in practice if management enforces silly rules. For example, "Every module will have between 35 and 50 executable statements."
* Logical Cohesion: A module has logical cohesion if it has available a series of functions, but only one of them is executed. For example,
function perform_arithmetic( perform_addition, a, b )
.
* ''Temporal Cohesion'': A module has temporal cohesion if it performs functions related to time. One example,
function initialize_variables_and_open_files()
. Another example,
stage_one()
,
stage_two()
, ...
* ''Procedural Cohesion'': A module has procedural cohesion if it performs multiple loosely related functions. For example,
function read_part_number_update_employee_record()
.
* ''Communicational Cohesion'': A module has communicational cohesion if it performs multiple closely related functions. For example,
function read_part_number_update_sales_record()
.
* ''Informational Cohesion'': A module has informational cohesion if it performs multiple functions, but each function has its own entry and exit points. Moreover, the functions share the same data structure. Object-oriented classes work at this level.
* ''Functional Cohesion'': a module has functional cohesion if it achieves a single goal working only on local variables. Moreover, it may be reusable in other contexts.
Coupling
The levels of coupling from worst to best are:
* ''Content Coupling'': A module has content coupling if it modifies a local variable of another function. COBOL used to do this with the ''alter'' verb.
* ''Common Coupling'': A module has common coupling if it modifies a global variable.
* ''Control Coupling'': A module has control coupling if another module can modify its
control flow
In computer science, control flow (or flow of control) is the order in which individual statements, instructions or function calls of an imperative program are executed or evaluated. The emphasis on explicit control flow distinguishes an ''im ...
. For example,
perform_arithmetic( perform_addition, a, b )
. Instead, control should be on the makeup of the returned object.
* ''Stamp Coupling'': A module has stamp coupling if an element of a data structure passed as a parameter is modified. Object-oriented classes work at this level.
* '' Data Coupling'': A module has data coupling if all of its input parameters are needed and none of them are modified. Moreover, the result of the function is returned as a single object.
Data flow analysis
''Data flow analysis'' is a design method used to achieve modules of ''functional cohesion'' and ''data coupling''.
The input to the method is a
data-flow diagram
A data-flow diagram is a way of representing a flow of data through a process or a system (usually an information system). The DFD also provides information about the outputs and inputs of each entity and the process itself. A data-flow diagram h ...
. A data-flow diagram is a set of ovals representing modules. Each module's name is displayed inside its oval. Modules may be at the executable level or the function level.
The diagram also has arrows connecting modules to each other. Arrows pointing into modules represent a set of inputs. Each module should have only one arrow pointing out from it to represent its single output object. (Optionally, an additional exception arrow points out.) A
daisy chain
Daisy chain may refer to:
* Daisy chain, a garland created from daisy flowers
* Daisy chain (climbing), a type of strap
* Daisy chain (electrical engineering), a wiring scheme
* Daisy chain (fishing), a type of fishing lure
* Daisy chain (knot), ...
of ovals will convey an entire
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algorithms are used as specificat ...
. The input modules should start the diagram. The input modules should connect to the transform modules. The transform modules should connect to the output modules.
Functional categories
''Computer programs'' may be categorized along functional lines. The main functional categories are
application software
Application may refer to:
Mathematics and computing
* Application software, computer software designed to help the user to perform specific tasks
** Application layer, an abstraction layer that specifies protocols and interface methods used in a ...
and
system software
System software is software designed to provide a platform for other software. Examples of system software include operating systems (OS) like macOS, Linux, Android and Microsoft Windows, computational science software, game engines, search engin ...
. System software includes the
operating system
An operating system (OS) is system software that manages computer hardware, software resources, and provides common services for computer programs.
Time-sharing operating systems schedule tasks for efficient use of the system and may also i ...
, which couples
computer hardware with application software.
The purpose of the operating system is to provide an environment where application software executes in a convenient and efficient manner.
Both application software and system software execute
utility programs. At the hardware level, a
microcode program controls the circuits throughout the
central processing unit
A central processing unit (CPU), also called a central processor, main processor or just Processor (computing), processor, is the electronic circuitry that executes Instruction (computing), instructions comprising a computer program. The CPU per ...
.
Application software
Application software is the key to unlocking the potential of the computer system.
Enterprise application software bundles accounting, personnel, customer, and vendor applications. Examples include
enterprise resource planning
Enterprise resource planning (ERP) is the integrated management of main business processes, often in real time and mediated by software and technology. ERP is usually referred to as a category of business management software—typically a sui ...
,
customer relationship management
Customer relationship management (CRM) is a process in which a business or other organization administers its interactions with customers, typically using data analysis to study big data, large amounts of information.
CRM systems data collectio ...
, and
supply chain management software Supply-chain-management software (SCMS) is the software tools or modules used in executing supply chain transactions, managing supplier relationships and controlling associated business processes. Supply chain management maximizes the efficiency of ...
.
Enterprise applications may be developed in-house as a one-of-a-kind
proprietary software
Proprietary software is software that is deemed within the free and open-source software to be non-free because its creator, publisher, or other rightsholder or rightsholder partner exercises a legal monopoly afforded by modern copyright and i ...
.
Alternatively, they may be purchased as
off-the-shelf software
Commercial off-the-shelf or commercially available off-the-shelf (COTS) products are packaged or canned (ready-made) hardware or software, which are adapted aftermarket to the needs of the purchasing organization, rather than the commissioning of ...
. Purchased software may be modified to provide
custom software
Customised software (also known as bespoke software or tailor-made software) is software that is specially developed for some specific organization or other user. As such, it can be contrasted with the use of software packages developed for the ...
. If the application is customized, then either the company's resources are used or the resources are outsourced. Outsourced software development may be from the original software vendor or a third-party developer.
The potential advantages of in-house software are features and reports may be developed exactly to specification.
Management may also be involved in the development process and offer a level of control.
Management may decide to counteract a competitor's new initiative or implement a customer or vendor requirement.
A merger or acquisition may necessitate enterprise software changes. The potential disadvantages of in-house software are time and resource costs may be extensive.
Furthermore, risks concerning features and performance may be looming.
The potential advantages of off-the-shelf software are upfront costs are identifiable, the basic needs should be fulfilled, and its performance and reliability have a track record.
The potential disadvantages of off-the-shelf software are it may have unnecessary features that confuse end users, it may lack features the enterprise needs, and the data flow may not match the enterprise's work processes.
One approach to economically obtaining a customized enterprise application is through an
application service provider
An application service provider (ASP) is a business providing application software generally through the Web.
The ASP model
The application software resides on the vendor's system and is accessed by users through a communication protocol. Alter ...
.
Specialty companies provide hardware, custom software, and end-user support. They may speed the development of new applications because they possess skilled information system staff. The biggest advantage is it frees in-house resources from staffing and managing complex computer projects.
Many application service providers target small, fast-growing companies with limited information system resources.
On the other hand, larger companies with major systems will likely have their technical infrastructure in place. One risk is having to trust an external organization with sensitive information. Another risk is having to trust the provider's infrastructure reliability.
Operating system
An
operating system
An operating system (OS) is system software that manages computer hardware, software resources, and provides common services for computer programs.
Time-sharing operating systems schedule tasks for efficient use of the system and may also i ...
is the low-level software that supports a computer's basic functions, such as
scheduling
A schedule or a timetable, as a basic time-management tool, consists of a list of times at which possible tasks, events, or actions are intended to take place, or of a sequence of events in the chronological order in which such things are ...
processes and controlling
peripherals.
In the 1950s, the programmer, who was also the operator, would write a program and run it. After the program finished executing, the output may have been printed, or it may have been punched onto paper tape or cards for later processing.
More often than not the program did not work. The programmer then looked at the console lights and fiddled with the console switches. If less fortunate, a memory printout was made for further study. In the 1960s, programmers reduced the amount of wasted time by automating the operator's job. A program called an ''operating system'' was kept in the computer at all times.
The term ''operating system'' may refer to two levels of software.
The operating system may refer to the
kernel program that manages the
processes,
memory
Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembered ...
, and
devices. More broadly, the operating system may refer to the entire package of the central software. The package includes a kernel program,
command-line interpreter
A command-line interpreter or command-line processor uses a command-line interface (CLI) to receive commands from a user in the form of lines of text. This provides a means of setting parameters for the environment, invoking executables and pro ...
,
graphical user interface
The GUI ( "UI" by itself is still usually pronounced . or ), graphical user interface, is a form of user interface that allows users to interact with electronic devices through graphical icons and audio indicator such as primary notation, inst ...
,
utility programs, and
editor.
Kernel Program
The kernel's main purpose is to manage the limited resources of a computer:
* The kernel program should perform
process scheduling.
The kernel creates a
process control block A process control block (PCB) is a data structure used by computer operating systems to store all the information about a process. It is also known as a process descriptor. When a process is created (initialized or installed), the operating system c ...
when a program is selected for execution. However, an executing program gets exclusive access to the
central processing unit
A central processing unit (CPU), also called a central processor, main processor or just Processor (computing), processor, is the electronic circuitry that executes Instruction (computing), instructions comprising a computer program. The CPU per ...
only for a
time slice
In computing, preemption is the act of temporarily interrupting an executing task, with the intention of resuming it at a later time. This interrupt is done by an external scheduler with no assistance or cooperation from the task. This preempt ...
. To provide each user with the
appearance of continuous access, the kernel quickly
preempts each process control block to execute another one. The goal for
system developers is to minimize
dispatch latency.
* The kernel program should perform
memory management
Memory management is a form of resource management applied to computer memory. The essential requirement of memory management is to provide ways to dynamically allocate portions of memory to programs at their request, and free it for reuse when ...
.
:* When the kernel initially
loads an executable into memory, it divides the address space logically into
regions
In geography, regions, otherwise referred to as zones, lands or territories, are areas that are broadly divided by physical characteristics (physical geography), human impact characteristics (human geography), and the interaction of humanity and t ...
.
The kernel maintains a master-region table and many per-process-region (pregion) tables—one for each running
process
A process is a series or set of activities that interact to produce a result; it may occur once-only or be recurrent or periodic.
Things called a process include:
Business and management
*Business process, activities that produce a specific se ...
.
These tables constitute the
virtual address space
In computing, a virtual address space (VAS) or address space is the set of ranges of virtual addresses that an operating system makes available to a process. The range of virtual addresses usually starts at a low address and can extend to the hig ...
. The master-pregion table is used to determine where its contents are located in
physical memory
Computer data storage is a technology consisting of computer components and recording media that are used to retain digital data. It is a core function and fundamental component of computers.
The central processing unit (CPU) of a compute ...
. The pregion tables allow each process to have its own program (text) pregion, data pregion, and stack pregion.
:*The program pregion stores machine instructions. Since machine instructions don't change, the program pregion may be shared by many processes of the same executable.
:* To save time and memory, the kernel may load only blocks of execution instructions from the disk drive, not the entire execution file completely.
:*The kernel is responsible for translating virtual addresses into
physical address
In computing, a physical address (also real address, or binary address), is a memory address that is represented in the form of a binary number on the address bus circuitry in order to enable the data bus to access a ''particular'' storage cell ...
es. The kernel may request data from the
memory controller
The memory controller is a digital circuit that manages the flow of data going to and from the computer's main memory. A memory controller can be a separate chip or integrated into another chip, such as being placed on the same die or as an int ...
and, instead, receive a
page fault
In computing, a page fault (sometimes called PF or hard fault) is an exception that the memory management unit (MMU) raises when a process accesses a memory page without proper preparations. Accessing the page requires a mapping to be added t ...
.
If so, the kernel accesses the
memory management unit
A memory management unit (MMU), sometimes called paged memory management unit (PMMU), is a computer hardware unit having all memory references passed through itself, primarily performing the translation of virtual memory addresses to physical a ...
to populate the physical data region and translate the address.
:* The kernel allocates memory from the ''heap'' upon request by a process.
When the process is finished with the memory, the process may request for it to be
freed. If the process exits without requesting all allocated memory to be freed, then the kernel performs
garbage collection
Waste collection is a part of the process of waste management. It is the transfer of solid waste from the point of use and disposal to the point of treatment or landfill. Waste collection also includes the curbside collection of recyclabl ...
to free the memory.
:* The kernel also ensures that a process only accesses its own memory, and not that of the kernel or other processes.
* The kernel program should perform
file system management
File or filing may refer to:
Mechanical tools and processes
* File (tool), a tool used to ''remove'' fine amounts of material from a workpiece
**Filing (metalworking), a material removal process in manufacturing
** Nail file, a tool used to gent ...
.
The kernel has instructions to create, retrieve, update, and delete files.
* The kernel program should perform
device management
Mobile device management (MDM) is the administration of mobile devices, such as smartphones, tablet computers, and laptops. MDM is usually implemented with the use of a third-party product that has management features for particular vendors of ...
.
The kernel provides programs to standardize and simplify the interface to the mouse, keyboard, disk drives, printers, and other devices. Moreover, the kernel should arbitrate access to a device if two processes request it at the same time.
* The kernel program should perform
network management
Network management is the process of administering and managing computer networks. Services provided by this discipline include fault analysis, performance management, provisioning of networks and maintaining quality of service. Network managem ...
.
The kernel transmits and receives
packets on behalf of processes. One key service is to find an efficient
route
Route or routes may refer to:
* Route (gridiron football), a path run by a wide receiver
* route (command), a program used to configure the routing table
* Route, County Antrim, an area in Northern Ireland
* ''The Route'', a 2013 Ugandan film
* Ro ...
to the target system.
* The kernel program should provide
system level functions for programmers to use.
** Programmers access files through a relatively simple interface that in turn executes a relatively complicated low-level I/O interface. The low-level interface includes file creation,
file descriptors, file seeking, physical reading, and physical writing.
** Programmers create processes through a relatively simple interface that in turn executes a relatively complicated low-level interface.
** Programmers perform date/time arithmetic through a relatively simple interface that in turn executes a relatively complicated low-level time interface.
* The kernel program should provide a
communication channel
A communication channel refers either to a physical transmission medium such as a wire, or to a logical connection over a multiplexed medium such as a radio channel in telecommunications and computer networking. A channel is used for informa ...
between executing processes.
For a large software system, it may be desirable to
engineer
Engineers, as practitioners of engineering, are professionals who invent, design, analyze, build and test machines, complex systems, structures, gadgets and materials to fulfill functional objectives and requirements while considering the l ...
the system into smaller processes. Processes may communicate with one another by sending and receiving
signals
In signal processing, a signal is a function that conveys information about a phenomenon. Any quantity that can vary over space or time can be used as a signal to share messages between observers. The ''IEEE Transactions on Signal Processing'' ...
.
Originally, operating systems were programmed in
assembly
Assembly may refer to:
Organisations and meetings
* Deliberative assembly, a gathering of members who use parliamentary procedure for making decisions
* General assembly, an official meeting of the members of an organization or of their representa ...
; however, modern operating systems are typically written in higher-level languages like
C,
Objective-C
Objective-C is a general-purpose, object-oriented programming language that adds Smalltalk-style messaging to the C programming language. Originally developed by Brad Cox and Tom Love in the early 1980s, it was selected by NeXT for its NeXT ...
, and
Swift
Swift or SWIFT most commonly refers to:
* SWIFT, an international organization facilitating transactions between banks
** SWIFT code
* Swift (programming language)
* Swift (bird), a family of birds
It may also refer to:
Organizations
* SWIFT, ...
.
Utility program
A
utility program
Utility software is software designed to help analyze, configure, optimize or maintain a computer. It is used to support the computer infrastructure - in contrast to application software, which is aimed at directly performing tasks that benefit ord ...
is designed to aid system administration and software execution. Operating systems execute hardware utility programs to check the status of disk drives, memory, speakers, and printers.
A utility program may optimize the placement of a file on a crowded disk. System utility programs monitor hardware and network performance. When a metric is outside an acceptable range, a trigger alert is generated.
Utility programs include compression programs so data files are stored on less disk space.
Compressed programs also save time when data files are transmitted over the network.
Utility programs can sort and merge data sets.
Utility programs detect
computer viruses.
Microcode program
A
microcode program is the bottom-level interpreter that controls the
data path of software-driven computers.
(Advances in
hardware have migrated these operations to
hardware execution circuits.)
Microcode instructions allow the programmer to more easily implement the
digital logic level—the computer's real hardware. The digital logic level is the boundary between
computer science
Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to Applied science, practical discipli ...
and
computer engineering.
A
logic gate is a tiny
transistor
upright=1.4, gate (G), body (B), source (S) and drain (D) terminals. The gate is separated from the body by an insulating layer (pink).
A transistor is a semiconductor device used to Electronic amplifier, amplify or electronic switch, switch ...
that can return one of two signals: on or off.
* Having one transistor forms the
NOT gate.
* Connecting two transistors in series forms the
NAND gate
In digital electronics, a NAND gate (NOT-AND) is a logic gate which produces an output which is false only if all its inputs are true; thus its output is complement to that of an AND gate. A LOW (0) output results only if all the inputs to the ...
.
* Connecting two transistors in parallel forms the
NOR gate
The NOR gate is a digital logic gate that implements logical NOR - it behaves according to the truth table to the right. A HIGH output (1) results if both the inputs to the gate are LOW (0); if one or both input is HIGH (1), a LOW output (0 ...
.
* Connecting a NOT gate to a NAND gate forms the
AND gate
The AND gate is a basic digital logic gate that implements logical conjunction (∧) from mathematical logic AND gate behaves according to the truth table. A HIGH output (1) results only if all the inputs to the AND gate are HIGH (1). If not al ...
.
* Connecting a NOT gate to a NOR gate forms the OR gate.
These five gates form the building blocks of Boolean algebra, binary algebra—the digital logic functions of the computer.
Microcode instructions are Assembly language#Mnemonics, mnemonics programmers may use to execute digital logic functions instead of forming them in binary algebra. They are stored in a central processing unit, central processing unit's (CPU) control store.
These hardware-level instructions move data throughout the
data path.
The micro-instruction cycle begins when the microsequencer uses its microprogram counter to ''fetch'' the next Machine code, machine instruction from
random-access memory
Random-access memory (RAM; ) is a form of computer memory that can be read and changed in any order, typically used to store working data and machine code. A random-access memory device allows data items to be read or written in almost the ...
.
The next step is to ''decode'' the machine instruction by selecting the proper output line to the hardware module.
The final step is to ''execute'' the instruction using the hardware module's set of gates.
Instructions to perform arithmetic are passed through an
arithmetic logic unit (ALU).
The ALU has circuits to perform elementary operations to add, shift, and compare integers. By combining and looping the elementary operations through the ALU, the CPU performs its complex arithmetic.
Microcode instructions move data between the CPU and the
memory controller
The memory controller is a digital circuit that manages the flow of data going to and from the computer's main memory. A memory controller can be a separate chip or integrated into another chip, such as being placed on the same die or as an int ...
. Memory controller microcode instructions manipulate two
registers. The memory address register is used to access each memory cell's address. The Memory buffer register, memory data register is used to set and read each cell's contents.
Microcode instructions move data between the CPU and the many Bus (computing), computer buses. The Disk controller, disk controller bus writes to and reads from
hard disk drive
A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with magne ...
s. Data is also moved between the CPU and other functional units via the PCI Express, peripheral component interconnect express bus.
Notes
References
{{DEFAULTSORT:Computer Program
Computer programming
Software