Didactics: Statistical Language!

Some days ago the Australian Bureau of Statistics ABS has published an e-Magazine about important statistical terms: Statistical language! This is an “educational resource designed to improve the reader’s understanding of some fundamental statistical concepts”.

Statistical Concepts explained

There are not only definitions but also precious explanations about how to calculate and about benefits and downfalls related to these concepts.

Video of OECD-Statistics Sweden Seminar 26-27 May 2008 is ready

Starting with opening speeches of Kjell Jansson, Director General of Statistics Sweden and Enrico Giovannini, Chief Statistician of OECD. Powerpoint-presentations are found at http://www.oecd.org/dataoecd/30/32/40293021.pdf

OECD Factbook visualized with Gapminder

Staffan Landin’s comment on OECD Factbook is worth to be upgraded to a special post!  In fact all the data from this pocket book have been integrated into Gapminder Graphs Community and are now visualized in trendalyzer style.  A huge work as all the tables had to bee uploaded to Google spreadsheets; it would be very interesting to know if there exists (or is planned) a database connection to upload data into Google spreadsheets. I am sure Staffan can give an answer to this issue.

OECD Facrbook with Trendalyzer

Statistics - a personal approach (3)

And here another pesonal comparator, just published: Salarium - Personal wages calculator from Statistics Switzerland.

After filling in a quite detailed form defining your personal profile ….

salarium, form

you get this information about your salary

  • the gross monthly salary (middle or median value)
  • the salary dispersion (interquartile range)
  • factors influencing the salary (table of salary variations by region, level of education etc.)
  • comparison with personal salary data (total salary based on the same salary components as the median salary)

Salarium, results

Statistics - a personal approach (2)

Another example bringing together Statistics and one’s personal situation is about life expectancy (!). It comes from a site called “Understanding Uncertainty“.

” This site is produced by the Winton programme for the public understanding of risk based in the Statistical Laboratory in the University of Cambridge. The aim is to help improve the way that uncertainty and risk are discussed in society, and show how probability and statistics can be both useful and entertaining! “

It brings a lot of very good insights, stuff for educators and this interactive presentation about life expectancy:

About Life Expectancy

Statistics - a personal approach (1)

Statistical information is so close to everyday life and - at the same time - often so far from our personal experience. It’s all about averages, medians … .

Attempts to bridge this gap exist.

Inflation is one prominent example. Several National Statistical Offices have developped inflation calculators giving users a tool to adapt data to specific personal situations and to give better insight in the methods (see the explanations introducing the calculators).

DESTATIS’ personal inflation calculator (using SVG)
DESTATIS-inflation-calculator

This calculator has been adapted to other countries. So by ONS (UK) or INSEE (France)

ONS Personal Inflation Calculator PIC

ons-inflation-calculator

INSEE Simuler un indice des prix personnalisé
insee-inflation-calculator

There is also a Slovenian version and

a Dutch one in Flash (thanks Katja!)

CBS-inflation-calculator

Statistics Switzerland has developed another flavour of such a calculator

swiss-inflation-calculator

swiss-inflation-calculator2

More than just football statistics

EURO 2008 knocks at the door. Statistics Switzerland has created a game which presents some indicators for all the participating football teams:

StatistigOal: the game where football teams up with statistics

Game

Go and play

2 Years

Two years ago Blog about Stats started. Every week a new post, nearly every week a comment and 18,000 visits.

Thanks to all who have contributed to this blog!

blogstats of Blog about Stats

blogstats of Blog about Stats

Dissemination, knowledge and innovation

In May 2008 there have been two international meetings about Dissemination of Statitics.

UNECE in Geneva:

Workshop UNECE

See the documents

OECD in Stockholm: Seminar on Innovative Approaches to Turning Statistics into Knowledge
Seminar Stockholm

See the details

Gapminder invites to “Graphs Community”

Gapminder launches a community for visualising statistics by Trendalyzer and Google Motion Chart. Interested organisations who wants to share their international data are invited. More information at the Gapminder website and in their new Gapminder Charts Development Blog. Have also a look at this nice example created in Motion Chart with Google Spreadsheet.

Discuss graphs in groups

Swivel, well known for giving the possibility to visualize and comment data, has introduced Swivel Business some weeks ago. Swivel Business adds more of Web 2.0 and lets companies and offices visualize their data in closed user groups. Every member can comment (parts of) the graphs and get feedback from other members. To discuss with others is perhaps the best way to a better understanding of what a graph means.

Screenshot Swivel Business features
This interactivity could be used inside Statistical Offices. These offices could also organize groups with external members, for instance with schools. Learning to interprete data and gaining statistical literacy in such a way is quite attractive and also a good opportunity for statistical offices to stay in contact with users.
To get into Swivel Business is not very easy (for the moment) as you have to wait for a telephone call from Swivel before getting a password.

Google Visualization API on Google Spreadsheets

What is the Google Visualization API?
The Google Visualization API lets you access multiple sources of structured data that you can display, choosing from a large selection of visualizations. The Google Visualization API also provides a platform that can be used to create, share and reuse visualizations written by the developer community at large. Read more at http://code.google.com/apis/visualization/ and look at the video.

OECD Factbook 2008 - now “alive” and focus on productivity

The free online economic statistics in the 2008 OECD Factbook (www.sourceoecd.org/factbook) have been transformed to include animated diagrams of the key indicators which bring the data alive visually. The interactive graphs enable users to clearly observe trends in the statistics and interact with the data to perform their chosen comparisons between countries. The Factbook also includes increased data for key non-OECD Member Countries, Brazil, Chile, China, Estonia, India, Israel, Russia, Slovenia and South Africa.

See for example Real GDP growth and Labour productivity.

How to make statistics more accessible

Disseminating statistics to the public is the one side, making statistics accessible the other one. Even if huge databases are actually free and many statistical websites offered to the public there remain many obstacles to get the data to the (right) people.

Issue and agenda setting is one point, metadata another topic in this context. And presentation issues still another.

For the moment let’s concentrate to the latter case. Visualizing is one of the most important strategies for statistics to gain momentum. But that’s not enough: There are countless graphs on the web not being looked at. Maybe there is no (clear) message, maybe that the presentation itself is not attractive enough.

Animating graphs can change this on the presentation side. There are lots of possibilities to do this.

One excellent example of both engaged message and perfect presentation is Hans Rosling with Gapminder/Trendalyzer. Here animated graphs and personal presentation are used together, it is the full monty!

Six minutes has analyzed one of his presentations in detail and gives Six Simple Techniques for Presenting Data:

  1. Explain the data axes
  2. Highlight subsets of data
  3. Dig deeper to unwrap data
  4. Place labels close to data points
  5. Answer the “Why?” questions
  6. Complement data with energetic delivery

Hans Rosling adds number 7. “I may add one aspect, I am lucky to talk about a topic where the actual knowledge is surprisingly low in spite of interest being high, the development of the world.” Some more background about this commitment can be found in H. Rosling’s interview with Discover Magazine.

See the full Six Minutes’ analysis.

Just another analysis to learn from ;) … : How NOT To Use Powerpoint By Comedian Don McMillan

A Tool Shed using Semantic Web

More and more Web 2.0 applications give us more and more possibilities to manage and share our bookmarks, our RSS feeds, our photos and so on: iGoogle, netvibes, linkedin, del.icio.us, flickr, twitter … but let’s be honest:  We don’t have time for all these applications, we risk to get lost.

Now a second generation of applications emerges integrating a lot of these functionalities. twine is such a tool shed worth to be looked at, especially because Web 3.0 or Semantic Web tools (RDF) are used.

Give your email to blogstats at gmx dot net if you want an invitation to use twine beta.

Trendalyzer becomes Motion Chart

Google has activated the trendalyzer graphic style known and bought from Gapminder (thank you Marc Ringuette for the hint in your comment). Google integrates trendalyzer, now renamed Motion Chart, in its spreadsheet. There you can choose several types of visualizations, i.e. Motion Chart.

It’s like in EXCEL and more: The graphs produced from the data can also be exported as gadgets and introduced in iGoogle or in whatever website. The link to Google spreadsheet remains necessary and gadgets can be updated with new data. In fact you don’t get autonomous graphs as known from other trendalyzer clones (see trend graphs).

How to:
In Google spreadsheet data have to be highlighted and with “Insert -> Gadget” a Motion Chart can be produced from these data.

As a gadget choose Motion Chart

Give a title, check data fields … and it’s done

As with other templates it’s not easy to produce good graphical representations. Good concepts and clear messages are needed.
I have made two examples with data from Statistics Switzerland. These are first steps and first lessons learned. To see them in motion you have to leave this wordpress blog not allowing to insert iframes in the content section (is there a trick to do so ?).

Try it yourself !

History of statistics at YouTube!

Widget time

Widgets or gadgets or however named are little programs that can be embedded in your website or your netvibes, iGoogle, pageflakes, Facebook etc. . They take the intelligence from other sites on the web and give it to your site. Popular examples are calendars, weather forecasts, tag clouds …. or soon trendalyzer ;). A complex widget is for instance yahoo.pipes. There are thousands of them out there on www.widgetbox.com or www.programmableweb.com.

Official statistics are using RSS feeds extensively, but not the more elaborate widgets (true?). Search widgets could be a first and very useful test case on huge statistics sites where informations are often hidden to impatiently searching users. A small test can be seen on Statistics Switzerland (German , French ). It uses a widget for Google and a widget, that introduces a search tool with user feedback (www.eurekster.com).

swicki

How to do trend graphs

Analyzing trends in Gapminder style has become a legend. With Google trendalyzer we hoped to get a user friendly tool to produce such graphs but unfortunately this widget has disappeared and Yonatan B, too.

In a recent post Eng. Hisham Abdel Maguid from Epic Systems has shown us this Flash applications http://www.epicsyst.com/main3.swf or http://www.epicsyst.com/visual.swf
He told us that data input is in Excel, xml and Access format and that a program interface with edit menus will be available soon.

And there is another demo in this style by Boomerang

Things are moving,

Comparing websites - a playground or more than that?

There are several tools that offer interactive comparisons between websites. It’s quite amusing playing with such tools. I did it with websitegrader and got results like these for some statistics websites … and for blogstats:

Comparing Wesbites

In general I think these comparisons give a first hint and are just interesting but should never be used or understood as scientific explanations. Deeper investigations are needed.

A test with Statistics France shows only very small differences in a second report. The tool seems to be coherent in itself.

Some explanations about the measured categories (partially from Websitegrader):

“A website grade of 97/100 means that of the hundreds of thousands of websites that have previously been evaluated, Websitegrader’s algorithm has calculated that this site scores higher than 97% of them in terms of its marketing effectiveness. The algorithm uses a proprietary blend of over 50 different variables, including search engine data, website structure, approximate traffic, site performance, and others.” - From Websitegrader. Not very transparent!

Google PageRank relies on the uniquely democratic nature of the web by using its vast link structure as an indicator of an individual page’s value. In essence, Google interprets a link from page A to page B as a vote, by page A, for page B. But, Google looks at more than the sheer volume of votes, or links a page receives; it also analyzes the page that casts the vote. Votes cast by pages that are themselves important weigh more heavily and help to make other pages important.” - From Google

Traffic rank by Alexa . Alexa is an online service measuring millions of sites on the Internet and comparing them.

Inbound links: One of the most important measures for a website is how many other sites link to it. The more links the better.

Google inexed pages: This number is the approximate number of pages that have been stored in the Google index. The Google web crawler will visit the website periodically and look for new content for its index.