Overclocking Intel's New 45nm QX9650: The Rules Have Changed
by Kris Boughton on December 19, 2007 2:00 AM EST- Posted in
- CPUs
Will the Real QX9650 Power Consumption Please Stand Up?
The recent attention highlighting the conflicting manners in which Intel and AMD quote processor power consumption figures has generated a fair amount of discussion regarding which methodology more closely represents the truth. In reality, we can see the merits of both systems, and which one is more valuable to the consumer is largely a matter of personal perception.
To this day Intel still provides processor TDP (thermal design power) specifications with a single purpose in mind - ensuring thermal engineers have the information they need to do their jobs. Intel knows companies providing platforms based on their products must be able to design and manufacture cooling solutions capable of dissipating the installed processor's maximum heat production - all without violating temperature limits. Designing around these worst-case values ensures the system remains stable under all computing conditions.
AMD's focus is very different, as they have decided to use this data as a marketing tool. By drawing attention to the processor APC (average power consumption), they hope to appeal to businesses looking to lower costs associated with their data infrastructure's operation. Although most end users find little reason to concern themselves with the relatively small monthly savings that can come from a single system's lowered power consumption, this can be a significant sum of money for corporations supporting hundreds or even thousands of workstations. Then again, any business supporting thousands of computers usually has other concerns. Do they worry more about the $5,000-$10,000 per month (or more) spent on the employee using a workstation, or the $10-$30 spent on the power for the workstation? The greater concern is often whether or not a given location has the capacity to power the workstations, not how much the power will cost.
With that said, we would argue that AMD is pandering to the public while Intel has chosen to support the industry developers. We can all certainly agree though: no matter which method you believe is more correct, quoted numbers are useless without a consistent means for calculation.
Surprisingly, Intel has chosen to leave the QX9650's TDP unchanged from the 65nm QX6850's given value of 130W. This is a little confusing as reports from Intel on 45nm process technology improvements provide us with claims of switching energy efficiency increases of up to 30% and a reduction in transistor gate leakages of 10x or more. Could basic core performance improvements such as the incorporation of the new SSE4 instruction set have made such a difference in overall CPU efficiency? We think not. Obviously, there is something more going on here; either the QX6850 TDP is vastly under-rated or the QX9650 is much better than suggested. Perhaps over-rating the QX9650's TDP gives Intel the option in claiming further thermal improvements later. We will share the method we used in calculating a more representative TDP rating for our 45nm quad-core in a moment, but we found the value to be closer to 100W - some 30W less than the figure provided by the blue team, and right in line with the 30% energy efficiency increase.
Determining just how much power a particular processor consumes at a given operating frequency is more difficult than one might originally think. A few different methods exist for estimating these values - one requires nothing more that the processor's specified TDP and then scales this value based on a given overclocked core frequency and voltage. This isn't going to cut it for us, especially considering this technique insists we place full faith in Intel's quoted QX9650 TDP, a number we know to be sometimes unrepresentative of actual operating conditions. Instead, we decided to get creative and develop our own method. The concept is rather simple: we will measure the power draw from the wall, subtract off power consumption for "static" components, correct for PSU efficiency, and then devise a means for determining what portion of the remaining power the CPU consumes - by somehow attributing the remainder to "everything else." The biggest challenge will be devising a means for accurately determining where to place this final split.
Choosing the right PSU is vital in the success of our mission, and employing the services of a Corsair unit means we can take advantage of the published efficiency curves. Monitoring and recording total system power draw is easy thanks to our Kill-A-Watt from P3 International. Finally, calculating just how much power is used by static loads (fans, pumps, HDDs, etc.) won't be too hard at all - simply noting the power draw before and after disconnecting the load is quick and effective. Indeed, developing a method for subtracting off power consumed by the motherboard, memory, and graphics card is going to require the most thought.
Our approach is rather unique: we vary CPU frequency and voltage and then plot the base system load delta versus frequency. System load delta is the difference between wall power minus static loads at CPU full load and wall power minus static loads at idle - both corrected for PSU efficiency. Graphing the delta means that the plotted power varies solely as a function of CPU speed. Our immediate results generate the following graph.
Base System Power Delta (W) versus Processor Frequency (GHz)
Extending the curve downwards allows us to extrapolate our plot's predicted y-intercept. Mathematically, this point represents the frequency in which CPU full load power minus idle power is zero (CPU idle power equals CPU full power). Logic suggests that as the CPU full load power approaches zero so will idle power, especially considering idle power should always be less than the full load power. Without much argument, we find ourselves accepting this as our CPU minimum energy frequency (about 240MHz).
We then plot the base system idle power (wall power minus static loads at idle, again corrected for PSU efficiency) versus the same set of frequencies. Creating a best-fit equation and solving for power using the minimum energy frequency found in the previous graph above gives us the base system power consumption with the CPU at its minimum energy (near zero). This is as close we are going to come to quantitatively determining the idle power consumption of the motherboard, memory, and graphics card combined. We calculated a value of 110W using an ASUS P5E3, 4GB of OCZ DDR3, and a single 8800 Ultra resting comfortably at 500Mhz on a 2D desktop. More likely than not the power-hungry GPU and overclocked X38 chipset make up the lion's share of this value.
With this number our goal is in sight - subtracting this value from the corrected base system idle values used in the creation of our second graph (not shown) gives us the CPU idle power points. Adding to these the base system load deltas plotted in the figure above provides us with the final unknowns - actual CPU full load power values. Joining these two sets of data points gives us the data below.
This graph rewards our hard work by providing us with a wealth of knowledge. Although the values above are highly precise only when referring to our particular QX9650 (every CPU is a little different), the accuracy of our method should be very good. The first thing we notice is how full load power consumption quickly increases to almost uncontrollable levels when pushing above ~4.25GHz. In fact, we found achieving true stability any higher nearly impossible with water-cooling alone. Users who are familiar with overclocking will recognize this trend: a CPU usually scales extremely well up to a certain point, beyond which any further overclocking requires an excessive amount of additional voltage. This is normal and exactly where this point lies is a function of cooling and silicon limits (i.e. process technology) and in reality has very little to do with luck.
An example of the findings that can be developed by measuring system response
We made this last graph using our CPU full load power consumption/frequency pair data points from the figure above. Based on our examinations, just about every Core 2 Extreme QX9650 owner should have little trouble accomplishing at least a 3.9GHz overclock. Upgrading the cooling used will at best provide modest improvements as these quad-cores begin to generate a lot of heat when pushed much further.
56 Comments
View All Comments
Griswold - Thursday, December 20, 2007 - link
I think the other explanation for the delay is much more likely than this horsecrap.tomoyo - Wednesday, December 19, 2007 - link
This is a great article from both a readability and technical standpoint, I found it very enjoyable to actually read the commentary, which I cannot say is true for 95% of tech reviews. Also the graphs and information presented were extremely useful. I love the demystifying of the vdroop issue to prove that it's a GOOD thing.Amiteriver - Wednesday, December 19, 2007 - link
Finally a review with some understandable explanations of Bios settings and there realation ships. As the motherboards change you get new Bios names for things and your eyes glaze over wondering what the heck some of the terms mean. Look forward to future articles of popular motherboards Bioses. I will be printing this article out. Thanks muchAmiteriver - Wednesday, December 19, 2007 - link
Finally a review with some understandable explanations of Bios settings and there realation ships. As the motherboards change you get new Bios names for things and your eyes glaze over wondering what the heck some of the terms mean. Look forward to future articles of popular motherboards Bioses. I will be printing this article out. Thanks muchSczee - Wednesday, December 19, 2007 - link
Quite possibly the best article on computer hardware I have ever read. I learnt a lot about overclocking from it and plan to put it to use tonight. I'll be looking out for your next piece Kris.Bozo Galora - Wednesday, December 19, 2007 - link
ya, the X48 is being delayed because some top tier mobo manuf needs to get rid of its X38 inventory. My guess is Asus, because they have the muscle to influence Intel. Bleh.So I bought all this super xmas bargain stuff, and now I am going to have to let it gather dust - lol
http://www.digitimes.com/mobos/a20071217PD205.html">http://www.digitimes.com/mobos/a20071217PD205.html
Bozo Galora - Wednesday, December 19, 2007 - link
and the X48 does seem to have a 450-460 cap, at least so farhttp://en.hardspell.com/doc/showcont.asp?news_id=1...">http://en.hardspell.com/doc/showcont.asp?news_id=1...
bryanW1995 - Wednesday, December 19, 2007 - link
Best article I've read here in a long time. Good job.Regs - Wednesday, December 19, 2007 - link
So lets say I have a 65nm Core Duo running between 0.850V-1.3525V. These are the product specs, which I guess .850 is the low limit and 1.325 is the high limit. Why does the voltage have to decrease depending on load? Is it just as simple as "supply and demand"? How does running the CPU at 1.352v run the risk of instability?Aivas47a - Wednesday, December 19, 2007 - link
Great article. You guys have really been distinguishing yourselves with in-depth work on overclocking the last few months: exploring obscure bios settings, tinkering with "extreme" cooling -- keep it up!My experience with a qx9650 so far is very similar to yours: easy scaling to 4 ghz, difficult scaling after that with 4.2 ghz being the practical max for regular operation (folding, etc.).
One issue I will be interested to see you address in the future is fsb overclocking on yorkfield. So far I am seeing yorkfield top out at lower fsb (450-460) than was possible for kentsfield on a comparable P35 or X38 platform. That is not so significant for the unlocked Extreme Edition chips, but could make it difficult to achieve the magic 4 ghz with the q9550 and especially the q9450.