The Core i7 doesn't have a conventional bottleneck.
If you look at my motherboard, it works like this. It is one
of the slowest ones you can buy.
Core2_Duo_LGA775
|
| FSB800 * 8 bytes = 6.4GB/sec
|
Northbridge -- 2 channels of DDR2
That is the "old" architecture. The FSB is the bottleneck.
In the case of Core2 Quad processors, they use two silicon
die, and cache coherency traffic would have to travel over
the FSB. A Quad for LGA775 looks like this. (In a
"perfect scaling" test, the Core2 Quad goes 3.5x as fast,
when using 4 cores, so does not scale perfectly. That is
about 88% performance.)
Core1,2 Core3,4 Core2 Quad - two silicon die
| |
+----+----+ LGA775
|
| FSB
|
v
to Northbridge
Now, compare that to Core i7. It uses one silicon die.
It adds L3 cache to the design (one more level of caching
than on Core2).
Core_i7_LGA1366 -- 3 channels of DDR3, up to six sticks
|
| QPI
|
Northbridge X58
What travels over QPI, is peripheral traffic. On an SLI
motherboard, that would be gigabytes per second of graphics
traffic. But since the memory is connected directly to the
processor, the memory path is no longer a bottleneck for
the processor. So there is some improvement, similar to
what AMD did years ago.
I haven't seen a scaling test result for Core i7, but
I presume it does a little better than Core2 designs.
Hyperthreading is used, when a core is blocked on some
operation, like towards memory. The second (virtual) core,
continues its thread of execution in the interim. On
older processors, there were cases where this caused
more problems than it solved.
Part of the fun of owning new hardware, is testing
this stuff
Switch it on, switch it off, run
some benchmarks.
Good luck,
Paul