Latest posts.

Finding lost content

The Internet can be a frustrating resource. There is constant pressure on webmasters to update their web sites and keep them current. While this is a wonderful imperative it can lay waste to otherwise useful content because it is arbitrarily deemed “too old”. It is estimated that the average lifespan of a web page is 45-75 days.  It is very likely that information you find useful today will not be accessible in 3 months..

Over the last 14 years I have watched as fantastic collections of information have vanished overnight. If information was valuable enough to be published in the first place, it should be given some regard at “update time”. Happily I am not alone in my opinion and efforts are being made to preserve an archive of the Internet.

Google itself offers a short-term solution to recently deceased pages. In the Google results page you will see in the bottom line of each citation a link called “cache”. This is a snapshot of the page the last time Google crawled the site. Given the frequency with which Google crawls the Internet these cached pages are usually no more than a few days old.

The Internet Archive is the most ambitious attempt to archive online content. The site collects publicly available Internet documents using a web crawler. You cannot keyword search the archive but if you know the website you are interested in you can search on the URL. The results will provide you with a list of archived dates available. By clicking on the dates you can access the content as it appeared at the time referenced.

Be forewarned however, sometimes authors and publishers express a desire for their documents not to be included in the archive either by tagging a file for robot exclusion or by contacting the Internet Archive directly. Therefore this resource does not always yield results.

The Canadian government has taken a more thorough approach to archiving its own content. Their strategy is based on two sites.

The first is the Electronic Collection which consists of books and periodicals published online in Canada by both government and non-government publishers. All Canadian publishers are required to deposit copies of their online publications with Library and Archives Canada (LAC). Currently the composition of the collection is 68% federal government, 29% commercial / non-commercial sources and 3% provincial government.

You can keyword search this resource to located documents. It is not only useful as an archive but it provides access to current online publications as well.

The second resource is the Government of Canada Web Archive. This site provides an archive of federal government web sites as a whole. You can search by keyword, by department name, and by URL. The archive however does not provide any content beyond pages requiring user input such as a search screen to a database or content requiring the user to pay.

The pages within the archive are clearly tagged with a bright green bar across the top so that you are well aware that you are looking at an older version of a web site. It should be noted that the content in this archive cannot be accessed via Google.

When you subscribe to our Stats Link Canada Source Lists we provide you with free archive services. If you come across a dead link, simply report it to us along with the reference Stats Link ID.  We will first search for a new live link for you. If that does not exist we will recover an archived copy of the poll, survey or report referenced and e-mail it to you.


Guess What They Said

Some statistical sources are quirky but if they have the one stat you are looking for they can be invaluable. Guess What They Said is one such resource.

This site is primarily a showcase of the Asking Canadians web panel. It publishes the results of topical questions in order to encourage you to request a free no-obligation quote on submitting your own questions for survey.

Some of these results are frivolous and appear to be intended as media filler. For example 600 Canadians who think Jennifer Aniston will find love were asked whether or not they feel the government is prepared for an “inevitable zombie invasion”. (I did not realize a zombie invasion was inevitable. I am pretty sure that scenario is not covered in my emergency preparedness kit!)

Other questions however are relevant such as the survey of Canadians on whether they use paper based phone books regularly or prefer the Internet, the survey of mothers of “tweens” on the type of breakfast cereal they buy for their children and the percentage of Canadians that are concerned about getting old.

You can browse both current and archived results. They are organized into six major groupings: Tech, Politics, Business, Sports and Entertainment and Lifestyle. You need to be careful when you consult certain question as the survey sample in some cases is very specific. For example body art enthusiasts were surveyed on their favourite movie of the summer of 2008

As this is primarily a marketing tool as opposed to a research resource the results have an interactive gimmick attached – you have to try to guess the results by sliding the answer bars to the appropriate percentage before they reveal the actual numbers. In a pinch you could even use this site as a party game! The site calculates the accuracy of your guess and compares it to the average accuracy score of everyone who tried the survey.

If you want to be part of this panel and participate in the surveys with the chance to win prizes see the Asking Canadians web site.

Admittedly this resource will be long shot for most researchers but if it covers any aspect of your market once it could prove helpful. We are now tracking the more serious results (sorry zombie-fearing Jennifer Aniston fans that does not include you) in our Stats Link Canada Source Lists.

New Stats Pack now available (2010-4)

I have added another Stats Pack.  The difference between the  number  of entrants into vs. exits from entrepreneurship during the recent downturn was impressive!

The following topics are covered:

1. Self-employment in Canada by age and gender (October 2009)

2. Average amount Canadian entrepreneurs spend on business banking fees per month

3. Median amount of financing approved by size of business  (2003 vs.2009)

4. Incidence of minimum wage-jobs by firm employee-size (2009)

5. Entry into and exit out of self-employment (October 2008 – October 2009)

You can access this Stats Pack (2010-4) in HTML format on our Stats Link Canada web site.  A PDF version is available at the the bottom of the linked page.

For more details and data related to these highlights use the Stats Link Canada ID numbers identified in the Stats Pack.