New York Times beats Digg, or Hitwise beats Alexa?

New York Times beats Digg, or Hitwise beats Alexa?

Summary: The debate I sparked over the holiday weekend with my “Digg v.3, Who needs the New York Times?

TOPICS: Big Data

The debate I sparked over the holiday weekend with my "Digg v.3, Who needs the New York Times?” is larger than Digg vs. The New York Times.

My story questioned the validity of relying on an Alexa-based chart to conclude that a 15 person Social Web start-up, which itself does not gather, or report on, any news, is on track to displace the 1200 person New York Times worldwide newsroom with its $200 million news gathering budget.

While it would seem that no statistics are necessary to conclude that The New York Times is in no near-term danger of being displaced by Digg as the world’s “newspaper of record,” in profiling Digg v. 3, Michael Arrington, of TechCrunch, referenced Alexa statistics in stating Digg is: “looking more and more like the newspaper of the web, and is challenging even the New York Times on page views.”

In my “Digg v.3, Who needs the New York Times?” I discuss "Important Disclaimers" Alexa, itself, states regarding the reliability of "Alexa Traffic Rankings”:

The traffic data are based on the set of toolbars that use Alexa data, which may not be a representative sample of the global Internet population. Known biases include (but are likely not limited to)…

In my "Digg vs. The New York Times: record lead for 'newspaper of record' story I discuss data released by Hitwise, yesterday, based on its sample of 10 million Internet users:

The share of page impressions for the NY Times was 19 times greater than for Digg for that week…put the NY Times on the same chart as Digg, Digg's traffic would look tiny and relatively flat

Hitwise in “How We Do It” describes its methodology:

There are three principle ways to measure Internet usage. A panel of users can be measured at their computers with installed software (user-centric), marketers can monitor how visitors interact with a specific website (site-centric), or data can be collected directly from ISP networks (network-centric).

The network-centric methodology employed by Hitwise enables the most efficient way of monitoring of how more people visit more websites than any other way of measuring Internet usage.

Hitwise has developed proprietary software that Internet Service Providers (ISPs) use to analyze website usage logs created on their network. The anonymous data sent to Hitwise from the ISPs include a range of industry standard metrics relating to the viewing of websites including page requests, visits and average visit length.

Hitwise also combines this rich ISP data with a worldwide opt-in panel to overlay demographic, lifestyle and transactional behavior across the thousands of websites that are reported on every day.

Because of the extensive sample size of network data, Hitwise can also provide detailed insights into the search terms used to find thousands of sites as well as a range of clickstream reports, analyzing the movements of visitors between sites.

Hitwise collects aggregate usage statistics from a geographically diverse range of ISP networks in metropolitan and regional areas, representing all types of Internet usage including home, work, educational and public access.

To ensure the ISP and opt-in data is accurate and representative, it is weighted to universe estimates in each market.

Hitwise only extracts aggregate information from ISP networks and no personal information is seen or captured by Hitwise in accordance with local and international privacy guidelines. Hitwise's methodology is audited by PricewaterhouseCoopers on an annual basis.


Topic: Big Data

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.


Log in or register to join the discussion
  • Anchor to Windward

    Users will stay on top of current events by different means.

    Digg does turn up information fast and is a great piece of web technology, but, should be used with caution. It will at times find 'nuggets' of gold and will also occasionally lead the reader a stray.

    The NYT is on my 'reliable' BlogLines RSS news feeds aggregator.

    Also, FWIW, you have been added to that list also!

    Keep 'em coming Donna! :)
    D T Schmitz
  • Beware Selection Bias

    Great post Donna!

    Hitwise collects, aggregates and analyzes data from ISPs, unlike other service providers that rely on user-downloaded toolbars as the source for data collection, like Alexa. Collecting data from individual surfers who have to opt-in via downloading a toolbar of course lends it self very easily to selection bias. If wonky web 2.0 geeks like me are more likely than the average web surfer to download cool toolbars like Alexa, then of course wonky and geeky web 2.0 sites will be over-represented in the traffic rankings reported by Alexa.