Benchmarking the Benchmarks?

Rob Williams

Editor-in-Chief
Staff member
Moderator
From our front-page news:
Kyle Bennett is no stranger to stirring the proverbial pot in the tech industry. He's a well-known gear-head who's not afraid to speak his mind - which is one reason he rubs so many people the wrong way. And while I may not agree with everything he says, there is one area where we do agree, and that's with video card benchmarking.

Over at Hard|OCP, he has just posted a new article explaining their testing methodology, and goes into some depth with regards to specifics. Although we've not had many GPU reviews in the past, all of our testing is done manually as well, with the exception for a few timedemos for use with our CPU reviews (since timedemos rely a lot on the CPU). For our GPU reviews, we play through all of the levels manually.

This article came at a coincidental time, because I just spent the past weekend benchmarking five GPUs for upcoming reviews, with three more left on the table. I admit, playing the same level over and over and over gets tedious, but a tall Guinness or short Heineken works wonders!

The article is worth a look if you want a another opinion on why timedemos are not the way to do things. But, it will all come down to personal preference, and there is no denying that most of the time, timedemos are somewhat accurate. One interesting finding, though, is that even though most reviews for the AMD HD 3870 X2 show the card to be faster than the 8800 GTX... it turns out that real-world, that's not the case.

That comparison also couldn't come at a better time, since that was an identical scenario I will be dealing with later today or tomorrow. I am interested to see if the numbers all scale with his. I'll post in the news once I have some findings.

<table align="center"><tbody><tr><td>
3dmark_vantage_021108.jpg

</td></tr></tbody></table>
That is not to say that synthetic and canned benchmarks do not have their places in testing, we just don’t usually find those metrics to be indicative of what the end user has in terms of actual experiences. Some website’s want to tell you the "relative performance of a graphics card" based on a timedemo that in no way represents playing the game. That is not what we want to focus on here at HardOCP.

Source: HardOCP
 

Greg King

I just kinda show up...
Staff member
For me, it's never been about the point of his messages but rather in the way he presents them.

He totally called out Anantech on multiple occasions in his write up and I don't agree with that at all.
 

Rob Williams

Editor-in-Chief
Staff member
Moderator
I agree, but it's certainly nothing new with Kyle. Calling out sites specifically is in bad taste, but he's all about bad taste and doing whatever gets the site traffic. We certainly operate differently here.

That said, I still agree that timedemos for the most part are not representitive of real gameplay. It makes me want to consider including both, however. Perhaps seven manually played games and two timedemos.

I should maybe even write a similar article but delve into more timedemos than just the one Kyle did. Testing out a single timedemo also isn't very representitive of what to expect from across the map.

If I can find time within the next week, I think I'll create a timedemo of the exact same HL2 level I've been playing for months on end through benchmarking, and them compare the results for the two. Should be interesting.
 
Top