The Intel Core microarchitecture (provisionally referred to as Next Generation Micro-architecture, and developed as Merom)
is a multi-core
processor microarchitecture launched by
Intel
Intel Corporation is an American multinational corporation and technology company headquartered in Santa Clara, California, and Delaware General Corporation Law, incorporated in Delaware. Intel designs, manufactures, and sells computer compo ...
in mid-2006. It is a major evolution over the
Yonah, the previous iteration of the
P6 microarchitecture series which started in 1995 with
Pentium Pro. It also replaced the
NetBurst
The NetBurst microarchitecture, called P68 inside Intel, was the successor to the P6 microarchitecture in the x86 family of central processing units (CPUs) made by Intel. The first CPU to use this architecture was the Willamette-core Pentium ...
microarchitecture, which suffered from high power consumption and heat intensity due to an inefficient
pipeline
A pipeline is a system of Pipe (fluid conveyance), pipes for long-distance transportation of a liquid or gas, typically to a market area for consumption. The latest data from 2014 gives a total of slightly less than of pipeline in 120 countries ...
designed for high
clock rate
Clock rate or clock speed in computing typically refers to the frequency at which the clock generator of a processor can generate pulses used to synchronize the operations of its components. It is used as an indicator of the processor's s ...
. In early 2004, Prescott needed very high power to reach the clocks it needed for competitive performance, making it unsuitable for the shift to
dual/multi-core CPUs. On May 7, 2004, Intel confirmed the cancellation of the next NetBurst,
Tejas and Jayhawk. Intel had been developing Merom, the 64-bit evolution of the
Pentium M, since 2001,
and decided to expand it to all market segments, replacing NetBurst in desktop computers and servers. It inherited from Pentium M the choice of a short and efficient pipeline, delivering superior performance despite not reaching the high clocks of NetBurst.
The first processors that used this architecture were code-named '
Merom', '
Conroe', and '
Woodcrest'; Merom is for mobile computing, Conroe is for desktop systems, and Woodcrest is for servers and workstations. While architecturally identical, the three processor lines differ in the socket used, bus speed, and power consumption. The first Core-based desktop and mobile processors were branded ''
Core 2'', later expanding to the lower-end ''
Pentium Dual-Core'', ''
Pentium
Pentium is a series of x86 architecture-compatible microprocessors produced by Intel from 1993 to 2023. The Pentium (original), original Pentium was Intel's fifth generation processor, succeeding the i486; Pentium was Intel's flagship proce ...
'' and ''
Celeron
Celeron is a series of IA-32 and x86-64 computer microprocessor, microprocessors targeted at low-cost Personal computer, personal computers, manufactured by Intel from 1998 until 2023.
The first Celeron-branded CPU was introduced on April 15, ...
'' brands; while server and workstation Core-based processors were branded ''
Xeon
Xeon (; ) is a brand of x86 microprocessors designed, manufactured, and marketed by Intel, targeted at the non-consumer workstation, server, and embedded markets. It was introduced in June 1998. Xeon processors are based on the same archite ...
''.
Features
The Core microarchitecture returned to lower
clock rate
Clock rate or clock speed in computing typically refers to the frequency at which the clock generator of a processor can generate pulses used to synchronize the operations of its components. It is used as an indicator of the processor's s ...
s and improved the use of both available clock cycles and power when compared with the preceding
NetBurst
The NetBurst microarchitecture, called P68 inside Intel, was the successor to the P6 microarchitecture in the x86 family of central processing units (CPUs) made by Intel. The first CPU to use this architecture was the Willamette-core Pentium ...
microarchitecture of the
Pentium 4
Pentium 4 is a series of single-core central processing unit, CPUs for Desktop computer, desktops, laptops and entry-level Server (computing), servers manufactured by Intel. The processors were shipped from November 20, 2000 until August 8, 20 ...
and
D-branded CPUs. The Core microarchitecture provides more efficient decoding stages, execution units,
caches, and
buses, reducing the
power consumption of Core 2-branded CPUs while increasing their processing capacity. Intel's CPUs have varied widely in power consumption according to clock rate, architecture, and semiconductor process, shown in the
CPU power dissipation tables.
Like the last NetBurst CPUs, Core based processors feature multiple cores and hardware virtualization support (marketed as
Intel VT-x), and
Intel 64 and
SSSE3
Supplemental Streaming SIMD Extensions 3 (SSSE3 or SSE3S) is a SIMD instruction set created by Intel and is the fourth iteration of the SSE technology.
History
SSSE3 was first introduced with Intel processors based on the Core microarchitect ...
. However, Core-based processors do not have the
hyper-threading technology as in Pentium 4 processors. This is because the Core microarchitecture is based on the
P6 microarchitecture used by Pentium Pro, II, III, and M.
The L1 cache of the Core microarchitecture at 64 KB L1 cache/core (32 KB L1 Data + 32 KB L1 Instruction) is as large as in Pentium M, up from 32 KB on Pentium II / III (16 KB L1 Data + 16 KB L1 Instruction). The consumer version also lacks an L3 cache as in the Gallatin core of the Pentium 4 Extreme Edition, though it is exclusively present in high-end versions of Core-based Xeons. Both an L3 cache and hyper-threading were reintroduced again to consumer line in the
Nehalem microarchitecture.
Roadmap
Technology
While the Core microarchitecture is a major architectural revision, it is based in part on the
Pentium M processor family designed by Intel Israel. The
pipeline
A pipeline is a system of Pipe (fluid conveyance), pipes for long-distance transportation of a liquid or gas, typically to a market area for consumption. The latest data from 2014 gives a total of slightly less than of pipeline in 120 countries ...
of Core/
Penryn is 14 stages long – less than half of
Prescott's. Penryn's successor
Nehalem has a two cycles higher branch misprediction penalty than Core/Penryn. Core can ideally sustain up to 4
instructions per cycle (IPC) execution rate, compared to the 3 IPC capability of
P6,
Pentium M and
NetBurst
The NetBurst microarchitecture, called P68 inside Intel, was the successor to the P6 microarchitecture in the x86 family of central processing units (CPUs) made by Intel. The first CPU to use this architecture was the Willamette-core Pentium ...
microarchitectures. The new architecture is a dual core design with a shared
L2 cache engineered for maximum
performance per watt and improved scalability.
One new technology included in the design is
Macro-Ops Fusion, which combines two
x86 instructions into a single
micro-operation
In computer central processing units, micro-operations (also known as micro-ops or μops, historically also as micro-actions) are detailed low-level instructions used in some designs to implement complex machine instructions (sometimes termed ma ...
. For example, a common code sequence like a compare followed by a conditional jump would become a single micro-op. However, this technology does not work in 64-bit mode.
Core can speculatively execute
loads ahead of preceding stores with unknown addresses.
Other new technologies include 1 cycle throughput (2 cycles previously) of all 128-bit SSE instructions and a new power saving design. All components will run at minimum speed, raising speed dynamically as needed (similar to AMD's
Cool'n'Quiet power-saving technology, and Intel's own
SpeedStep technology from earlier mobile processors). This allows the chip to produce less heat, and minimize power use.
For most Woodcrest CPUs, the
front-side bus (FSB) runs at 1333
MT/s; however, this is scaled down to 1066 MT/s for lower end 1.60 and 1.86 GHz variants. The Merom mobile variant was initially targeted to run at an FSB of 667 MT/s while the second wave of Meroms, supporting 800 MT/s FSB, were released as part of the Santa Rosa platform with a different socket in May 2007. The desktop-oriented Conroe began with models having an FSB of 800 MT/s or 1066 MT/s with a 1333 MT/s line officially launched on July 22, 2007.
The power use of these processors is very low: average energy use is to be in the 1–2 watt range in ultra-low voltage variants, with
thermal design powers (TDPs) of 65 watts for Conroe and most Woodcrests, 80 watts for the 3.0 GHz Woodcrest, and 40 or 35 watts for the low-voltage Woodcrest. In comparison, a 2.2 GHz AMD
Opteron 875HE processor consumes 55 watts, while the energy efficient
Socket AM2 line fits in the 35 watt
thermal envelope (specified a different way so not directly comparable). Merom, the mobile variant, is listed at 35 watts TDP for standard versions and 5 watts TDP for ultra-low voltage (ULV) versions.
Previously, Intel announced that it would now focus on power efficiency, rather than raw performance. However, at
Intel Developer Forum (IDF) in spring 2006, Intel advertised both. Some of the promised numbers were:
* 20% more performance for Merom at the same power level; compared to
Core Duo
* 40% more performance for Conroe at 40% less power; compared to
Pentium D
* 80% more performance for Woodcrest at 35% less power; compared to the original
dual-core Xeon
Processor cores
The processors of the Core microarchitecture can be categorized by number of cores, cache size, and socket; each combination of these has a unique code name and product code that is used across several brands. For instance, code name "Allendale" with product code 80557 has two cores, 2 MB L2 cache and uses the desktop socket 775, but has been marketed as Celeron, Pentium, Core 2, and Xeon, each with different sets of features enabled. Most of the mobile and desktop processors come in two variants that differ in the size of the L2 cache, but the specific amount of L2 cache in a product can also be reduced by disabling parts at production time. Tigerton dual-cores and all quad-core processors except - are multi-chip modules combining two dies. For the 65 nm processors, the same product code can be shared by processors with different dies, but the specific information about which one is used can be derived from the stepping.
Conroe/Merom (65 nm)
The original Core 2 processors are based on the same dies that can be identified as
CPUID Family 6 Model 15. Depending on their configuration and packaging, their code names are Conroe (
LGA 775, 4 MB L2 cache), Allendale (LGA 775, 2 MB L2 cache), Merom (
Socket M, 4 MB L2 cache) and Kentsfield (
multi-chip module, LGA 775, 2x4MB L2 cache). Merom and Allendale processors with limited features are in
Pentium Dual Core and
Celeron
Celeron is a series of IA-32 and x86-64 computer microprocessor, microprocessors targeted at low-cost Personal computer, personal computers, manufactured by Intel from 1998 until 2023.
The first Celeron-branded CPU was introduced on April 15, ...
processors, while Conroe, Allendale and Kentsfield also are sold as
Xeon
Xeon (; ) is a brand of x86 microprocessors designed, manufactured, and marketed by Intel, targeted at the non-consumer workstation, server, and embedded markets. It was introduced in June 1998. Xeon processors are based on the same archite ...
processors.
Additional code names for processors based on this model are
Woodcrest (LGA 771, 4 MB L2 cache),
Clovertown (MCM, LGA 771, 2×4MB L2 cache) and
Tigerton (MCM,
Socket 604, 2×4MB L2 cache), all of which are marketed only under the Xeon brand.
Conroe-L/Merom-L
The Conroe-L and Merom-L processors are based around the same core as Conroe and Merom, but only contain a single core and 1 MB of L2 cache, significantly reducing production cost and power consumption of the processor at the expense of performance compared to the dual-core version. It is used only in ultra-low voltage Core 2 Solo U2xxx and in Celeron processors and is identified as CPUID family 6 model 22.
Penryn/Wolfdale (45 nm)
In Intel's
Tick-Tock cycle, the 2007/2008 "Tick" was the shrink of the Core microarchitecture to 45 nanometers as CPUID model 23. In Core 2 processors, it is used with the code names Penryn (Socket P), Wolfdale (LGA 775) and Yorkfield (MCM, LGA 775), some of which are also sold as Celeron, Pentium and Xeon processors. In the Xeon brand, the
Wolfdale-DP and
Harpertown code names are used for LGA 771 based MCMs with two or four active Wolfdale cores.
Architecturally, 45 nm Core 2 processors feature SSE4.1 and new divide/shuffle engine.
The chips come in two sizes, with 6 MB and 3 MB L2 cache. The smaller version is commonly called Penryn-3M and Wolfdale-3M and Yorkfield-6M, respectively. The single-core version of Penryn, listed as Penryn-L here, is not a separate model like Merom-L but a version of the Penryn-3M model with only one active core.
Dunnington
The
Xeon "Dunnington" processor (CPUID Family 6, model 29) is closely related to Wolfdale but comes with six cores and an on-chip L3 cache and is designed for servers with Socket 604, so it is marketed only as Xeon, not as Core 2.
Steppings
The Core microarchitecture uses several
stepping levels (steppings), which unlike prior microarchitectures, represent incremental improvements, and different sets of features like cache size and low power modes. Most of these steppings are used across brands, typically by disabling some features and limiting clock frequencies on low-end chips.
Steppings with a reduced cache size use a separate naming scheme, which means that the releases are no longer in alphabetic order. Added steppings have been used in internal and engineering samples, but are unlisted in the tables.
Many of the high-end Core 2 and Xeon processors use
Multi-chip modules of two chips in order to get larger cache sizes or more than two cores.
Steppings using 65 nm process
Early ES/QS steppings are: B0 (CPUID 6F4h), B1 (6F5h) and E0 (6F9h).
Steppings B2/B3, E1, and G0 of model 15 (cpuid 06fx) processors are evolutionary steps of the standard Merom/Conroe die with 4 MB L2 cache, with the short-lived E1 stepping only being used in mobile processors. Stepping L2 and M0 are the ''
Allendale'' chips with just 2 MB L2 cache, reducing production cost and power consumption for low-end processors.
The G0 and M0 steppings improve idle power consumption in C1E state and add the C2E state in desktop processors. In mobile processors, all of which support C1 through C4 idle states, steppings E1, G0, and M0 add support for the Mobile Intel 965 Express (
Santa Rosa) platform with
Socket P, while the earlier B2 and L2 steppings only appear for the
Socket M based Mobile Intel 945 Express (
Napa refresh) platform.
The model 22 stepping A1 (cpuid 10661h) marks a significant design change, with just a single core and 1 MB L2 cache further reducing the power consumption and manufacturing cost for the low-end. Like the earlier steppings, A1 is not used with the Mobile Intel 965 Express platform.
Steppings G0, M0 and A1 mostly replaced all older steppings in 2008. In 2009, a new stepping G2 was introduced to replace the original stepping B2.
Steppings using 45 nm process
In the model 23 (cpuid 01067xh), Intel started marketing stepping with full (6 MB) and reduced (3 MB) L2 cache at the same time, and giving them identical cpuid values. All steppings have the new
SSE4.1 instructions. Stepping C1/M1 was a bug fix version of C0/M0 specifically for quad core processors and only used in those. Stepping E0/R0 adds two new instructions (XSAVE/XRSTOR) and replaces all earlier steppings.
In mobile processors, stepping C0/M0 is only used in the Intel Mobile 965 Express (
Santa Rosa refresh) platform, whereas stepping E0/R0 supports the later Intel Mobile 4 Express (
Montevina) platform.
Model 30 stepping A1 (cpuid 106d1h) adds an L3 cache and six instead of the usual two cores, which leads to an unusually large die size of 503 mm
2. As of February 2008, it has only found its way into the very high-end Xeon 7400 series (
Dunnington).
System requirements
Motherboard compatibility
Conroe, Conroe XE and Allendale all use Socket
LGA 775; however, not every
motherboard
A motherboard, also called a mainboard, a system board, a logic board, and informally a mobo (see #Nomenclature, "Nomenclature" section), is the main printed circuit board (PCB) in general-purpose computers and other expandable systems. It ho ...
is compatible with these processors.
Supporting
chipsets are:
*
Intel
Intel Corporation is an American multinational corporation and technology company headquartered in Santa Clara, California, and Delaware General Corporation Law, incorporated in Delaware. Intel designs, manufactures, and sells computer compo ...
: 865G/PE/P, 945G/GZ/GC/P/PL, 965G/P, 975X, P/G/Q965, Q963, 946GZ/PL, P3x, G3x, Q3x, X38, X48, P4x, 5400 Express (See also:
List of Intel chipsets)
*
Nvidia:
nForce4 Ultra/SLI X16 for Intel,
nForce 570/590 SLI for Intel,
nForce 650i Ultra/650i SLI/680i LT SLI/680i SLI and
nForce 750i SLI/780i SLI/790i SLI/790i Ultra SLI.
*
VIA: P4M800, P4M800PRO, P4M890, P4M900, PT880 Pro/Ultra, PT890. (See also:
List of VIA chipsets
This is a list of computer motherboard chipsets made by VIA Technologies. Northbridge (computing), Northbridge chips are listed first, primarily by CPU-socket or CPU-family; Southbridge (computing), southbridge chips are listed in a later table.
...
)
*
SiS: 662, 671, 671fx, 672, 672fx
*
ATI:
Radeon Xpress 200 and CrossFire Xpress 3200 for Intel
The Yorkfield XE model QX9770 (45 nm with 1600 MT/s FSB) has limited chipset compatibility - with only X38, P35 (with
overclocking) and some high-performance X48 and P45 motherboards being compatible. BIOS updates were gradually being released to provide support for the Penryn technology, and the QX9775 is only compatible with the Intel D5400XS motherboard. The Wolfdale-3M model E7200 also has limited compatibility (at least the Xpress 200 chipset is incompatible).
Although a motherboard may have the required chipset to support Conroe, some motherboards based on the above-mentioned chipsets do not support Conroe. This is because all Conroe-based processors require a new power delivery feature set specified i
Voltage Regulator-Down (VRD) 11.0 This requirement is a result of Conroe's significantly lower power consumption, compared to the Pentium 4/D CPUs it replaced. A motherboard that has both a supporting chipset and VRD 11 supports Conroe processors, but even then some boards will need an updated
BIOS to recognize Conroe's FID (Frequency ID) and VID (Voltage ID).
Synchronous memory modules
Unlike the prior
Pentium 4
Pentium 4 is a series of single-core central processing unit, CPUs for Desktop computer, desktops, laptops and entry-level Server (computing), servers manufactured by Intel. The processors were shipped from November 20, 2000 until August 8, 20 ...
and
Pentium D design, the Core 2 technology sees a greater benefit from memory running
synchronously with the
front-side bus (FSB). This means that for the Conroe CPUs with FSB of 1066 MT/s, the ideal memory performance for DDR2 is
PC2-8500. In a few configurations, using
PC2-5300 instead of PC2-4200 can actually decrease performance. Only when going to
PC2-6400 is there a significant performance increase. While DDR2 memory models with tighter timing specifications do improve performance, the difference in real world games and applications is often negligible.
Optimally, the memory bandwidth afforded should match the bandwidth of the FSB, that is to say that a CPU with a 533 MT/s rated bus speed should be paired with RAM matching the same rated speed, for example DDR2 533, or PC2-4200. A common myth is that installing interleaved RAM will offer double the bandwidth. However, at most the increase in bandwidth by installing interleaved RAM is roughly 5–10%. Th
AGTL+ PSBused by all
NetBurst
The NetBurst microarchitecture, called P68 inside Intel, was the successor to the P6 microarchitecture in the x86 family of central processing units (CPUs) made by Intel. The first CPU to use this architecture was the Willamette-core Pentium ...
processors and current and medium-term (pre-
QuickPath) Core 2 processors provide a 64-bit data path. Current chipsets provide for a couple of either DDR2 or DDR3 channels.
On jobs requiring large amounts of memory access, the quad-core Core 2 processors can benefit significantly from using
PC2-8500 memory, which runs at the same speed as the CPU's FSB; this is not an officially supported configuration, but several motherboards support it.
The Core 2 processor does not require the use of DDR2. While the Intel 975X and P965 chipsets require this memory, some motherboards and chipsets support both Core 2 processors and
DDR memory. When using DDR memory, performance may be reduced because of the lower available memory bandwidth.
Chip errata
The Core 2
memory management unit
A memory management unit (MMU), sometimes called paged memory management unit (PMMU), is a computer hardware unit that examines all references to computer memory, memory, and translates the memory addresses being referenced, known as virtual mem ...
(MMU) in X6800, E6000 and E4000 processors does not operate to prior specifications
implemented in prior generations of
x86 hardware. This may cause problems, many of them serious security and stability issues, with extant
operating system
An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs.
Time-sharing operating systems scheduler (computing), schedule tasks for ...
software. Intel's documentation states that their programming manuals will be updated "in the coming months" with information on recommended methods of managing the
translation lookaside buffer (TLB) for Core 2 to avoid issues, and admits that, "in rare instances, improper TLB invalidation may result in unpredictable system behavior, such as hangs or incorrect data."
Among the issues stated:
*
Non-execute bit is shared across the cores.
* Floating point instruction non-coherencies.
* Allowed memory corruptions outside of the range of permitted writing for a process by running common instruction sequences.
Intel
errata
An erratum or corrigendum (: errata, corrigenda) (comes from ) is a correction of a published text. Generally, publishers issue an erratum for a production error (i.e., an error introduced during the publishing process) and a corrigendum for an a ...
Ax39, Ax43, Ax65, Ax79, Ax90, Ax99 are said to be particularly serious. 39, 43, 79, which can cause unpredictable behavior or system hang, have been fixed in recent
steppings.
Among those who have stated the errata to be particularly serious are
OpenBSD's
Theo de Raadt and
DragonFly BSD
DragonFly BSD is a free and open-source Unix-like operating system forked from FreeBSD 4.8. Matthew Dillon, an Amiga developer in the late 1980s and early 1990s and FreeBSD developer between 1994 and 2003, began working on DragonFly BSD in ...
's
Matthew Dillon. Taking a contrasting view was
Linus Torvalds
Linus Benedict Torvalds ( , ; born 28 December 1969) is a Finnish software engineer who is the creator and lead developer of the Linux kernel. He also created the distributed version control system Git.
He was honored, along with Shinya Yam ...
, calling the TLB issue "totally insignificant", adding, "The biggest problem is that Intel should just have documented the TLB behavior better."
Microsoft has issued update KB936357 to address the errata by
microcode
In processor design, microcode serves as an intermediary layer situated between the central processing unit (CPU) hardware and the programmer-visible instruction set architecture of a computer. It consists of a set of hardware-level instructions ...
update,
with no performance penalty. BIOS updates are also available to fix the issue.
See also
*
x86
*
List of Intel CPU microarchitectures
References
External links
Intel Core Microarchitecture website*
ttp://www.intel.com/pressroom/archive/releases/20060307corp.htm Intel press release introducing the Core Microarchitecturebr>
Intel processor roadmapA Detailed Look at Intel's New Core ArchitectureIntel names the Core Microarchitecture*
ttps://web.archive.org/web/20060322051611/http://www.tgdaily.com/2006/03/07/idf_keynotes_welcome_to_intel_3-point-0/ IDF keynotes, advertising the performance of the new processorsbr>
The Core of Intel's new chipsRealWorld Tech's overview of the Core microarchitectureDetailed overview of the Core microarchitecture at Ars TechnicaIntel Core versus AMD's K8 architecture at AnandtechRelease dates of upcoming Intel Core processors using the Intel Core MicroarchitectureBenchmarks Comparing the Computational Power of Core Architecture against Older Intel NetBurst and AMD Athlon64 Central Processing Units
{{Intel processors, core
Core microarchitecture
Core
Core
Computer-related introductions in 2006