The z196
microprocessor
A microprocessor is a computer processor where the data processing logic and control is included on a single integrated circuit, or a small number of integrated circuits. The microprocessor contains the arithmetic, logic, and control circu ...
is a chip made by
IBM for their
zEnterprise 196 and zEnterprise 114 mainframe computer
A mainframe computer, informally called a mainframe or big iron, is a computer used primarily by large organizations for critical applications like bulk data processing for tasks such as censuses, industry and consumer statistics, enterpris ...
s, announced on July 22, 2010.
The processor was developed over a three-year time span by IBM engineers from
Poughkeepsie, New York
Poughkeepsie ( ), officially the City of Poughkeepsie, separate from the Town of Poughkeepsie around it) is a city in the U.S. state of New York. It is the county seat of Dutchess County, with a 2020 census population of 31,577. Poughkeepsi ...
;
Austin, Texas
Austin is the capital city of the U.S. state of Texas, as well as the county seat, seat and largest city of Travis County, Texas, Travis County, with portions extending into Hays County, Texas, Hays and Williamson County, Texas, Williamson co ...
; and
Böblingen
Böblingen (; Swabian German, Swabian: ''Beblenga'') is a town in Baden-Württemberg, Germany, seat of Böblingen (district), Böblingen District. Sindelfingen and Böblingen are Geographic contiguity, contiguous.
History
Böblingen was found ...
,
Germany
Germany,, officially the Federal Republic of Germany, is a country in Central Europe. It is the second most populous country in Europe after Russia, and the most populous member state of the European Union. Germany is situated betwe ...
at a cost of US$1.5 billion. Manufactured at IBM's
Fishkill, New York
Fishkill is a village within the town of Fishkill in Dutchess County, New York, United States. The village is in the eastern part of the town of Fishkill on U.S. Route 9. It is north of Interstate 84. NY 52 is the main street. It is part of the ...
fabrication plant, the processor began shipping on September 10, 2010. IBM stated that it was the world's fastest microprocessor at the time.
Description
The chip measures 512.3 mm
2 and consists of 1.4 billion
transistor
upright=1.4, gate (G), body (B), source (S) and drain (D) terminals. The gate is separated from the body by an insulating layer (pink).
A transistor is a semiconductor device used to Electronic amplifier, amplify or electronic switch, switch e ...
s fabricated in IBM's
45 nm
Per the International Technology Roadmap for Semiconductors, the 45 nm process is a MOSFET technology node referring to the average half-pitch of a memory cell manufactured at around the 2007–2008 time frame.
Matsushita and Intel started mass ...
CMOS
Complementary metal–oxide–semiconductor (CMOS, pronounced "sea-moss", ) is a type of metal–oxide–semiconductor field-effect transistor (MOSFET) fabrication process that uses complementary and symmetrical pairs of p-type and n-type MOSFE ...
silicon on insulator
In semiconductor manufacturing, silicon on insulator (SOI) technology is fabrication of silicon semiconductor devices in a layered silicon–insulator–silicon substrate, to reduce parasitic capacitance within the device, thereby improving perfo ...
fabrication process
Semiconductor device fabrication is the process used to manufacture semiconductor devices, typically integrated circuit (IC) chips such as modern computer processors, microcontrollers, and memory chips such as NAND flash and DRAM that are pres ...
, supporting speeds of 5.2
GHz
The hertz (symbol: Hz) is the unit of frequency in the International System of Units (SI), equivalent to one event (or cycle) per second. The hertz is an SI derived unit whose expression in terms of SI base units is s−1, meaning that one he ...
:
at the time, the highest clock speed CPU ever produced for commercial sale.
The processor implements the
CISC z/Architecture
z/Architecture, initially and briefly called ESA Modal Extensions (ESAME), is IBM's 64-bit complex instruction set computer (CISC) instruction set architecture, implemented by its mainframe computers. IBM introduced its first z/Architecture-b ...
with a new
superscalar
A superscalar processor is a CPU that implements a form of parallelism called instruction-level parallelism within a single processor. In contrast to a scalar processor, which can execute at most one single instruction per clock cycle, a sup ...
,
out-of-order pipeline
Pipeline may refer to:
Electronics, computers and computing
* Pipeline (computing), a chain of data-processing stages or a CPU optimization found on
** Instruction pipelining, a technique for implementing instruction-level parallelism within a s ...
and 100 new
instructions
Instruction or instructions may refer to:
Computing
* Instruction, one operation of a processor within a computer architecture instruction set
* Computer program, a collection of instructions
Music
* Instruction (band), a 2002 rock band from Ne ...
. The instruction pipeline has 15 to 17 stages; the instruction queue can hold 40 instructions; and up to 72 instructions can be "in flight". It has four
cores, each with a private 64
KB L1 instruction cache, a private 128 KB L1 data cache and a private 1.5
MB L2 cache
A CPU cache is a hardware cache used by the central processing unit (CPU) of a computer to reduce the average cost (time or energy) to access data from the main memory. A cache is a smaller, faster memory, located closer to a processor core, which ...
. In addition, there is a 24 MB shared L3 cache implemented in
eDRAM
Embedded DRAM (eDRAM) is dynamic random-access memory (DRAM) integrated on the same die or multi-chip module (MCM) of an application-specific integrated circuit (ASIC) or microprocessor. eDRAM's cost-per-bit is higher when compared to equivalen ...
and controlled by two on-chip L3 cache controllers. There's also an additional shared L1 cache used for compression and cryptography operations.
Each core has six
RISC
In computer engineering, a reduced instruction set computer (RISC) is a computer designed to simplify the individual instructions given to the computer to accomplish tasks. Compared to the instructions given to a complex instruction set comput ...
-like execution units, including two
integer units, two
load-store units, one binary
floating point unit
Floating may refer to:
* a type of dental work performed on horse teeth
* use of an isolation tank
* the guitar-playing technique where chords are sustained rather than scratched
* ''Floating'' (play), by Hugh Hughes
* Floating (psychological phe ...
and one
decimal floating point
Decimal floating-point (DFP) arithmetic refers to both a representation and operations on decimal floating-point numbers. Working directly with decimal (base-10) fractions can avoid the rounding errors that otherwise typically occur when convert ...
unit. The z196 chip can decode three instructions and execute five operations in a single clock cycle.
The z196 chip has on board
DDR3 RAM memory controller
The memory controller is a digital circuit that manages the flow of data going to and from the computer's main memory. A memory controller can be a separate chip or integrated into another chip, such as being placed on the same die or as an int ...
supporting a
RAID
Raid, RAID or Raids may refer to:
Attack
* Raid (military), a sudden attack behind the enemy's lines without the intention of holding ground
* Corporate raid, a type of hostile takeover in business
* Panty raid, a prankish raid by male college ...
like configuration to recover from memory faults. The z196 also includes a
GX bus controller for accessing host channel adapters and peripherals. Additionally, each chip includes co-processors for cryptographic and compression functionality.
Shared Cache
Even though the z196 processor has on-die facilities for
symmetric multiprocessing
Symmetric multiprocessing or shared-memory multiprocessing (SMP) involves a multiprocessor computer hardware and software architecture where two or more identical processors are connected to a single, shared main memory, have full access to all ...
(SMP), there are 2 dedicated companion chips called the ''Shared Cache'' (SC) that each adds 96 MB off-die
L4 cache
A CPU cache is a hardware cache used by the central processing unit (CPU) of a computer to reduce the average cost (time or energy) to access data from the main memory. A cache is a smaller, faster memory, located closer to a processor core, which ...
for a total of 192 MB L4 cache. L4 cache is shared by all processors in the book. The SC chip consists of 1.5 billion transistors and measures 478.8 mm
2, manufactured with the same 45 nm process as the z196 chip.
Each chip also has 24 MB L3 cache shared by the 4 cores on the chip.
Multi-chip module
The
zEnterprise System z196 uses
multi-chip module
A multi-chip module (MCM) is generically an electronic assembly (such as a package with a number of conductor terminals or "pins") where multiple integrated circuits (ICs or "chips"), semiconductor dies and/or other discrete components are int ...
s (MCMs) which allows for six z196 chips to be on a single module. Each MCM has two shared cache chips allowing processors on the MCM to be connected with 40 GB/s links.
The different models of the zEnterprise System have a different number of active cores. To accomplish this, some processors in each MCM may have its fourth core disabled.
z114
The zEnterprise System z114 does use z196 processors but does not use MCMs so the processors are packaged on single chip modules (SCM) instead. Two SCMs and one Shared Cache chip is mounted together in a processor drawer. These processors also run at reduced speed, at 3.8 GHz.
References
External links
IBM zEnterprise System Technical Introduction - IBM.comIBM z196 Mainframe Architectureat
Real World Technologies
{{DEFAULTSORT:Ibm Z196 (Microprocessor)
z196
z196
Computer-related introductions in 2010