Skip to main content

Analytics

2007 Web Analytics Shootout – Interim Report

Use the following link to go straight to the Final Analytics Report.

Introduction to the 2007 Analytics Shoot Out – by Jim Sterne

Every web analytics tool measures clickthroughs and page views a little differently. They’re all using slightly different yardsticks and getting slightly different results. The disparity is driving us to distraction.
Just how different are they?
To get the answer to that, you would have to install some of the most popular tools side by side on the same sites and compare the results. That’s what the team at Perficient Digital did and the results are informative.
Yes, it’s interesting to see which tools, looking at the same data, count higher and which count lower. But the eye opener for these very complex yardsticks is just how many variables there are in the implementation process. Very interesting reading.
Jim Sterne
Take me directly to the data!

Overview of the 2007 Analytics Shoot Out

The 2007 Analytics Shoot Out is targeted at evaluating the performance, accuracy, and capabilities of 7 different analytics packages as implemented across 4 different sites. The goals of the project are as follows:

  1. Evaluate ease of implementation
  2. Evaluate ease of use
  3. Understand the basic capabilities of each package
  4. Solve specific problems on each web site
  5. Discover the unique strengths of each package
  6. Discover the unique weaknesses of each package
  7. Learn about the structural technology elements of each package that affect its capabilities
  8. Learn how to better match a customer’s needs to the right analytics package

How the results of the Shoot Out will be delivered.

The results of the Shoot Out will be delivered in two stages:

  1. This is the interim report, which was officially released at the Emetrics Summit in San Francisco on May 6, 2007.
  2. The final report, with more comprehensive results and analysis, was officially released on August 27, 2007. Click here to see the Final Report of The 2007 Analytics Shoot Out.

What you get in this interim report

  1. An analysis of how the user deletion/non-acceptance rates of third-party cookies and first-party cookies differ.
  2. Comparative data showing:
    • Visitors
    • Unique Visitors
    • Page Views
    • Specific segments as defined per site for 2 sites
  3. An analysis of the comparative data and discussion of the following topics:
    • What the numbers tell us
    • Range of results
    • Does one package always report lower numbers than the others?
    • Does one package always report higher numbers than the others?

Contents of the Final Report

The final report will contain the same basic types of data as the interim report but with more numerical detail and a more extensive analysis of that data.
In addition to enhancing the above data from the interim report, the final report will also:

  1. Highlight major strengths of each analytics package, and outline specific scenarios where that will matter to a website owner/web master.
  2. Highlight specific weaknesses (as well as workarounds and plans by each company to address these weaknesses) of each analytics package, and outline specific scenarios where that will matter to a website owner/webmaster.
  3. Provide detailed screenshots of the significant aspects of each analytics package.
  4. Provide overall commentary on each package.

2007 Analytics Shoot Out Details

Analytics Packages

The following companies actively contributed their time and effort to this project:

  1. Clicktracks
  2. Google Analytics
  3. IndexTools
  4. Unica Affinium NetInsight
  5. WebSideStory HBX Analytics

Each of these analytics packages was installed on multiple web sites, and each of these companies contributed engineering support resources to assist us during the project.
We were also able to evaluate the following analytics packages because they were already on one of the sites we used in the project:

  1. Omniture SiteCatalyst
  2. WebTrends

Participating Web Sites

  1. AdvancedMD (AMD)
  2. City Town Info (CTI)
  3. Home Portfolio (HPort)
  4. Tool Parts Direct (TPD)

Each of these sites installed multiple analytics packages on their sites per our instructions and made revisions as requested by us.

Matrix of Web Sites and Analytics Packages that were tested in the Shoot Out

Site Clicktracks Google Analytics IndexTools Omniture Unica Net Insight WebSideStory HBX Analytics WebTrends
AMD Y Y Y Y Y Y N
CTI Y Y Y N Y Y N
HPort Y Y Y N N Y Y
TPD Y Y Y N N Y N

Additional Contributors

Thanks are also due to the following companies, who contributed to this project:

  1. Alchemist Media
  2. SEOmoz

And a special thanks to Jim Sterne of Target Marketing, and Emetrics for his support of the Shoot Out.

Methodology

The major aspects of the Shoot Out methodology are as follows:

  1. For each package, except WebTrends, we installed JavaScript on the pages of the participating sites. WebTrends was already installed on one of the sites participating in the project, and the implementation used a combination of JavaScript tags and log file analysis.
  2. All the JavaScript was added to website pages through include files. As a result, any errors of omission should hold true for each analytics software package.
  3. All packages were run concurrently.
  4. All packages used first-party cookies.
  5. A custom analytics plan was tailored to the needs of each site.
  6. Visitors, Unique Visitors, and Page Views were recorded daily for each site.
  7. Content Groups and Segments were setups for each site. Numbers related to these were recorded daily.
  8. Detailed ad hoc analysis was done with each analytics package on each site.
  9. Critical strengths and weaknesses of each package were noted and reviewed with each vendor for comment.
  10. Each vendor was given an opportunity to present their product’s strongest features and benefits.

Interim Results


The next few sections present the interim results from the Shoot Out. Note that the time frames for the data from each site have been masked, and the time frame used for each site was different, but the numbers are real.

First Party Cookies vs. Third Party Cookies


Using WebSideStory’s HBX Analytics running on CityTownInfo.com, we ran the software for a fixed period of time using third-party cookies (TPCs). We then ran the software for the same amount of time using first-party cookies (FPCs).

During that same period we ran 3 of the other analytics packages (Clicktracks, Google Analytics, and IndexTools), all using first-party cookies.

The results were then compared by examining the relationship of HBX reported volumes to the average of the volumes of the three other packages and then seeing how that relationship changed when we switched from third-party cookies to first-party cookies. In theory, this should give us an estimate of how the user deletion/non-acceptance rates of third-party cookies compare to user deletion/non-acceptance rates of first-party cookies.

Here are the results we obtained while HBX Analytics was running third-party cookies:

Visitors Uniques Page Views
Clicktracks 72,224 66,335 120,536
Google Analytics 66,866 64,975 118,230
IndexTools 67,365 65,212 123,279
WebSideStory’s HBX Analytics 48,990 47,813 102,534
Average of all but HBX Analytics 68,818 65,507 120,682
HBX Analytics % of Average 71.19% 72.99% 84.96%


Visitor and unique visitor totals for HBX Analytics are 71 – 73% of the average of the other 3 packages. On the other hand, page views are roughly 85% of the average of the other 3 packages.

Now let’s take a look at the same data when HBX Analytics was making use of first-party cookies:

Visitors Uniques Page Views
Clicktracks 71,076 65,314 114,966
Google Analytics 65,906 64,030 112,436
IndexTools 67,117 64,621 119,049
WebSideStory’s HBX Analytics 55,871 54,520 96,453
Average of all but HBX Analytics 68,033 64,655 115,484
HBX Analytics % of Average 82.12% 84.32% 83.52%
Relative Traffic Growth with FPCs 13.32% 13.44%


With first-party cookies, the visitor and unique visitor totals for HBX Analytics are now 82 – 84% of the average of the other 3 packages. The pageviews relationship did not change significantly and were roughly 84%.

Analysis and Commentary


By observing how the increase in the traffic reported by HBX Analytics increased with respect to the average of the other 3 packages, we can estimate how third-party cookie deletion + non-acceptance differs from first-party cookie deletion.

According to this data, the third party cookie deletion / non-acceptance rate exceeds the first party cookie deletion / non-acceptance rate by a little more than 13%. WebSideStory also reported to Perficient Digital that it saw a 15-20% third-party cookie deletion / non-acceptance rate across sites that they monitor during a 2 week period in January, and about a 2% first-party cookie deletion /non-acceptance rate.

This data is fairly consistent with past industry data that estimates the third party cookie deletion / non-acceptance rate at about 15%.

Note that comScore recently reported more than 30% of cookies are deleted or not accepted overall, and also seemed to show that the difference between TPC and FPC deletions / not acceptances was significantly smaller. What remains to be seen is the methodology they used. Nonetheless, our data above should provide a reasonable indication of how TPC deletions and non-acceptances differ from FPC deletions and non-acceptances.
Cookie deletion and acceptance rates are of great concern when evaluating web analytics. Every time a cookie is deleted or not accepted it impacts the visitor and unique visitor counts of the tool. In particular, counting of unique visitors is significantly affected. If a user visits a site in the morning, and doesn’t accept or deletes their cookies, and then visits again in the afternoon, this will show up as 2 different daily unique visitors in the totals for that day, when in fact one user made multiple visits, and should be counted only as one unique visitor.
It should be noted that the packages use different methods for setting their cookies. For example, HBX Analytics requires you to set up a CNAME record in your DNS configuration file to remap a sub-domain of your site to one of their servers. This CNAME record maps a sub-domain of your site to the HBX Analytics server.
While this requires someone who is not frightened by configuring CNAME records to do, it does provide some advantages. For example, simple first party cookie implementations still pass data directly back to the servers of the analytics vendor. Memory resident anti-spyware software will intercept and block these communications.
Using the CNAME record bypasses this problem because all the memory resident anti-spyware software will see is a communication with a sub-domain of your site, and the process of redirecting the data stream to the HBX Analytics server happens at the DNS server.
Unica uses a similar approach that provides a choice of either using a CNAME record-based approach for first-party cookies or going with a simpler first-party cookie implementation.
Other analytics packages used in this test (Clicktracks, Google Analytics, and IndexTools) have chosen an approach to the initial configuration which requires no special configuration, and that allows a less technical user to set them up and get started.
Have comments or want to discuss? You can comment on the Interim Analytics Report here

Visitors, Unique Visitors, and Page Views (aka “traffic numbers”)

Notes


The Uniques column is the summation of Daily Unique Visitors over a period of time. The resulting total is therefore not an actual unique visitor count for the time period (because some of the visitors may have visited the site multiple times, and have been counted as a Daily Unique Visitor for each visit).

This was done because not all of the packages readily permitted us to obtain Unique Visitor totals over an arbitrary period of time. For example, for some packages, it is not trivial to pull the 12 day period Unique Visitor count.

Regardless, the Uniques data in the tables below remains a meaningful measurement of how the analytics packages compare in calculating Daily Unique Visitors.

The rows that show % information for each analytics package refer to that packages percent of the average result. Note that average does not necessarily mean “correct”. The purpose of this information is intended simply to help identify which analytics package yielded results that differed substantially from the others.

The time period is not being disclosed to obscure the actual daily traffic numbers of the participating sites. In addition, the time period used for each site differed.

Traffic Data

  1. Summary visitor, unique visitor, and page view data for CityTownInfo.com:

Visitors Uniques Page Views
Clicktracks 645,380 587,658 1,042,604
Google Analytics 600,545 583,199 1,038,995
IndexTools 614,600 595,163 1,099,786
Unica Affinium NetInsight 607,475 593,871 1,027,445
WebSideStory HBX Analytics 524,055 510,882 910,809
Average 598,411 574,155 1,023,928
Clicktracks % 107.85% 102.35% 101.82%
Google Analytics % 100.36% 101.58% 101.47%
IndexTools % 102.71% 103.66% 107.41%
Unica Affinium NetInsight % 101.51% 103.43% 100.34%
WebSideStory HBX Analytics% 87.57% 88.98% 88.95%
Standard Deviation 40209 31930 61868
Clicktracks Std Deviations 1.17 0.42 0.30
Google Analytics Std Deviations 0.05 0.28 0.24
IndexTools Std Deviations 0.40 0.66 1.23
Unica Affinium NetInsight Std Deviations 0.23 0.62 0.06
WebSideStory HBX Analytics Std Deviations -1.85 -1.98 -1.83

  1. Summary visitor, unique visitor, and page view data for HomePortfolio.com:

Visitors Uniques Page Views
Google Analytics 754,446 707,358 7,209,828
IndexTools 731,218 686,518 7,078,720
WebSideStory HBX Analytics 701,895 662,411 6,439,982
WebTrends 804,012 778,280 7,483,154
Average 747,893 708,642 7,052,921
Google Analytics % 100.88% 99.82% 102.22%
IndexTools % 97.77% 96.88% 100.37%
WebSideStory HBX Analytics % 93.85% 93.48% 91.31%
WebTrends % 124.83% 127.53% 106.10%
Standard Deviation 37370 43237 382779
Google Analytics Std Deviations 0.18 -0.03 0.41
IndexTools Std Deviations -0.45 -0.51 0.07
WebSideStory HBX Analytics Std Deviations -1.23 -1.07 -1.60
WebTrends Std Deviations 1.50 1.61 1.12

  1. Summary visitor, unique visitor, and page view data for ToolPartsDirect.com:

Visitors Uniques Page Views
Clicktracks 129,900 91,879 639,892
Google Analytics 159,955 103,260 939,373
IndexTools 108,486 92,070 687,544
WebSideStory HBX Analytics 103,724 91,847 582,887
Average 125,516 94,764 712,424
Clicktracks % 103.49% 96.96% 89.82%
Google Analytics % 127.44% 108.97% 131.86%
IndexTools % 86.43% 97.16% 96.51%
WebSideStory HBX Analytics % 82.64% 96.92% 81.82%
Standard Deviation 22193 4906 136167
Clicktracks Std Deviations 0.20 -0.59 -0.53
Google Analytics Std Deviations 1.55 1.73 1.67
IndexTools Std Deviations -0.77 -0.55 -0.18
WebSideStory HBX Analytics Std Deviations -0.98 -0.59 -0.95

  1. Summary visitor, unique visitor, and page view data for AdvancedMD.com:

Visitors Uniques Page Views
Clicktracks 155,396 63,339 234,930
Google Analytics 148,665 63,554 231,511
IndexTools 116,757 52,949 225,859
Omniture SiteCatalyst 110,211 64,016 237,108
Unica Affinium Net Insight 101,419 57,739 196,277
WebSideStory HBX Analytics 110,824 63,156 222,732
Average 123,878 60,792 224,736
Clicktracks % 125.44% 104.19% 104.54%
Google Analytics % 120.01% 104.54% 103.01%
IndexTools % 94.25% 87.10% 100.50%
Omniture Site Catalyst % 88.97% 105.30% 105.51%
Unica Affinium Net Insight % 81.87% 94.98% 87.34%
WebSideStory HBX Analytics % 89.46% 103.89% 99.11%
Standard Deviation 20494 4101 13651
Clicktracks Std Deviations 1.54 0.62 0.75
Google Analytics Std Deviations 1.21 0.67 0.50
IndexTools Std Deviations -0.35 -1.91 0.08
Omniture SiteCatalyst Std Deviations -0.67 0.79 0.91
Unica Affinium Net Insight Std Deviations -1.10 -0.74 -2.08
WebSideStory HBX Analytics Std Deviations -0.64 0.58 -0.15

Analysis and Commentary

  1. There were significant differences in the traffic numbers revealed by the packages. While we are all conditioned to think that this is a purely mechanical counting process, it is, in fact, a very complex process.


There are dozens (possibly more) implementation decisions made in putting together an analytics package that affects the method of counting used by each package. The example we provided above of different types of first-party cookie implementation is just one example.

Other examples include: whether or not the configuration of the package is done primarily in the JavaScript or the UI, and how a unique visitor is defined (e.g. is a daily unique visitor defined as over the past 24 hours, or for a specific calendar day?).

If we look at the standard deviations in the above data, the distribution appears to be pretty normal. Note that for a normal distribution, 68% of scores should be within 1 standard deviation, and 95% of the scores should be within 2 standard deviations. In our data above, this indeed appears to be holding roughly true.

Here is a summary of the raw data:

1.1. While HBX Analytics tended to report the lowest numbers of all the packages, this was certainly not always the case. For example, on AdvancedMD.com, HBX was higher than 2 packages for visitors, and unique visitors.

1.2. Clicktracks reported the highest numbers on AdvancedMD.com. Google Analytics reported the second highest numbers for this site. Google Analytics reported the highest numbers on ToolPartsDirect.com. Clicktracks reported the second highest numbers for this site.

AdvancedMD.com and ToolPartsDirect.com receive a large amount of their traffic from Pay Per Click campaigns. While it’s pure speculation on our part, perhaps this plays into some differences in the way that Clicktracks and Google Analytics count visitor, unique visitor, and page view data as compared to the other packages. We are seeking feedback from Clicktracks and Google Analytics on this point, and hope to provide more information in the Final Report.

1.3. On HomePortfolio.com, WebTrends reported significantly more visitors and unique visitors than the other vendors (about 20% more). This is the only site that we were able to look at WebTrends numbers for at this stage in the project.

Google Analytics reported the second highest numbers on this site.

1.4. On CityTownInfo.com, the highest numbers were reports by IndexTools.

  1. As Jim Sterne is fond of saying, if your yardstick measures 39 inches instead of 36 inches, it’s still great to have a measurement tool. The yardstick will still help you measure changes with a great deal of accuracy. So if tomorrow your 39-inch yardstick tells you that you are at 1 yard and 1 inch (i.e., 40 inches), you know you have made some progress.


In evaluating the data presented above, you can see that the analytics packages are all reasonably close to one another. For purposes of evaluating the quality of a yardstick, we can conclude that each of these yardsticks are similar in their measurement quality.

However, referring back to the comScore study, if the first party cookie deletion rate is, in fact, greater than 30%, this would be of great concern. We will provide some more commentary on this issue in the Final Report.

To put this in perspective, classic marketing vehicles have no direct form of measurement whatsoever. How do TV ads drive sales? How about Radio ads? There is no direct measurement of the return on these marketing expenditures. The web is unique in its ability to provide a direct measurement of user behavior to a high degree of accuracy. There is no other marketing vehicle like it.

For example, in web analytics, you can conduct A/B testing at a remarkably granular level. Using our yardstick, we can tweak our marketing message in numerous ways, and get direct feedback on how it affects the achievement of our business objective. We could, for example, measure the effects of such changes as:

  1. Change marketing copy
  2. Change layout
  3. Change the colors used on landing pages
  4. Change the actual offers made


There really is much more that you can do. But the kicker is that you can get close to real-time feedback, and you can rapidly hone your pitch to the customer. On high volume sites, you can get meaningful feedback in less than a day.

Used in this fashion, web analytics software is very accurate. Given enough data, our 39-inch yardstick can easily measure the difference between a 1.5% conversion rate and a 2% conversion rate.

The marketing person placing TV and radio ads might well sell their soul to have that 39-inch yardstick to measure their ad’s effectiveness with a similar level of accuracy.

Have comments or want to discuss? You can comment on the Interim Analytics Report here

Content Group Data

  1. Here is the form completion and content group page view data for each of the analytics packages and CityTownInfo.com:
Form 1 Form 2 Form 3 Group 1 Views Group 2 Views Group 3 Views
Clicktracks 169 567 69 45,646 3,833 9,423
Google Analytics 172 543 59 59,638 4,695 12,255
IndexTools 177 616 68 67,166 4,891 14,461
Unica Affinium NetInsight 172 572 70 60,699 4,713 12,291
WebSideStory HBX Analytics 162 560 69 54,889 4,274 14,763
Average 170 572 67 57,608 4,481 12,639
Clicktracks % 99.18% 99.20% 102.99% 79.24% 85.54% 74.56%
Google Analytics % 100.94% 95.00% 88.06% 103.52% 104.77% 96.96%
IndexTools % 103.87% 107.77% 101.49% 116.59% 109.14% 114.42%
Unica Affinium NetInsight % 100.94% 100.07% 104.48% 105.37% 105.17% 97.25%
WebSideStory HBX Analytics % 95.07% 97.97% 102.99% 95.28% 95.38% 116.81%
  1. Here is the content group page view data for each of the analytics packages and HomePortfolio.com:
Group 1 Views Group 2 Views Group 3 Views Group 4 Views
Google Analytics 4,878,899 514,704 448,355 11,823
IndexTools 4,844,642 520,521 457,857 11,540
WebSideStory HBX Analytics 2,222,843 161,922 317,307 10,787
Average 3,982,128 399,049 407,840 11,383
Google Analytics % 122.52% 128.98% 109.93% 103.86%
IndexTools % 121.66% 130.44% 112.26% 101.38%
WebSideStory HBX Analytics % 55.82% 40.58% 77.80% 94.76%

Analysis and commentary

  1. Interestingly, this data appears to be more consistent than the traffic data (we discuss the exception of HBX Analytics running on HomePortfolio.com below). This is largely because it is page view based, and page views are inherently easier to track accurately than visitors. This is because the analytics software can’t detect when a user leaves your site.

For example, if a user comes to your site, then leaves to look at some other site, then immediately comes back to a page on your site by typing in the URL directly, or via a bookmark, the analytics software can’t determine that two different visits took place (it could have been simply a page reload request). The JavaScript only runs on the pages of your site, so it does not know that you left the site in between viewing two pages on your site.
In addition, if your visitor arrives on the site, then goes to lunch for an hour, then comes back and clicks on a link on the page they arrived on, the analytics software sees this as a new visit, even though they never left the page.
The reason for this is that the analytics industry has standardized on a definition of sessions that says that 30 minutes of inactivity ends a session (a visit). Some criteria for dealing with these situations needed to be selected, and this is what they chose.
However, it’s easy for the analytics software to determine page views with greater accuracy because each page view makes a request to the web server of the site, which causes the JavaScript for the analytics software to execute. All the actions being counted take place on the web site.

  1. As an exception to this, the HBX Analytics content group data for HomePortfolio is quite a bit lower than that of the other packages. However, we discovered that this is due to an implementation error by our team.

Note that this is not a reflection of the difficulty in implementing HBX Analytics. Instead, it’s a reflection of how important it is to understand exactly what it is that you want the analytics software to do, specifying it accurately, and then double checking that you are measuring what you think you are measuring.
In the case of the problem above, through no fault of the software, we set it up to track people who initially entered at pages in the content group, rather than tracking all the page views for the content group, which is what we wanted.
There is a key lesson in this. Implementation of an analytics package requires substantial forethought and planning. And, when you are done with that, you have to check and recheck your results, to make sure they make sense. Here is a summary of some of the issues you face in setting up your implementation correctly:

  1. Tagging errors – an error in tagging your pages can really throw you for a loop. You need to do a comprehensive job of setting the software up for success.
  2. Understanding the terminology – each package uses terms in different ways, and it’s important to understand them.
  3. Learning the software, and how it does things – each software package has its own way of doing things.
  4. Learning your requirements – learning your requirements will be a process all by itself. If you are implementing analytics for the first time it may be many months before you truly understand how to use it most effectively on your site.
  5. Learning the requirements of others in your organization – these are not necessarily the same as your personal requirements.
  6. Validating the data – even if you are not running more than one analytics package, you need to have a method of testing the quality of your data and making sure it makes sense. If it doesn’t, then perhaps some of the other steps above were not correctly executed.

One way to reduce many of these risks is to install multiple analytics packages. A substantial difference between the two packages would provide you a visible clue that something went wrong!
Have comments or want to discuss? You can comment on the Interim Analytics Report here
The final report, with more comprehensive results and analysis, was officially released on August 27, 2007. Click here to see the Final Report of The 2007 Analytics Shoot Out.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Eric Enge

Eric Enge is part of the Digital Marketing practice at Perficient. He designs studies and produces industry-related research to help prove, debunk, or evolve assumptions about digital marketing practices and their value. Eric is a writer, blogger, researcher, teacher, and keynote speaker and panelist at major industry conferences. Partnering with several other experts, Eric served as the lead author of The Art of SEO.

More from this Author

Follow Us