<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
   <title>Visuale</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/" />
   <link rel="self" type="application/atom+xml" href="http://diuf.unifr.ch/people/bertinie/visuale/atom.xml" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale/1</id>
   <updated>2009-11-10T07:58:03Z</updated>
   
   <generator uri="http://www.sixapart.com/movabletype/">Movable Type Open Source 4.1</generator>


<entry>
   <title>BELIV&apos;10 Workshop on InfoVis Evaluation</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/11/beliv10_workshop_on_infovis_ev.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.76</id>
   
   <published>2009-11-06T10:31:37Z</published>
   <updated>2009-11-10T07:58:03Z</updated>
   
   <summary>It&apos;s my great pleasure to announce here BELIV&apos;10: BEyond time and errors: novel evaLuation methods for Information Visualization, a workshop on infovis evaluation I am organizing together with Adam Perer from IBM and Heidi Lam from Google. The idea of...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
      <category term="events" scheme="http://www.sixapart.com/ns/types#category" />
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[It's my great pleasure to announce here <a href="http://www.beliv.org/beliv2010/">BELIV'10: BEyond time and errors: novel evaLuation methods for Information Visualization</a>, a workshop on infovis evaluation I am organizing together with <a href="http://www.perer.org/">Adam Perer</a> from IBM and <a href="http://people.cs.ubc.ca/~hllam/">Heidi Lam</a> from Google.

<a href="http://www.beliv.org/beliv2010/"><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="Beliv_logo_large.png" src="http://diuf.unifr.ch/people/bertinie/visuale/Beliv_logo_large.png" width="448" height="169" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></span></a>

The idea of the workshop was born in 2006 almost by chance when me, my phd advisor <a href="http://www.dis.uniroma1.it/~santucci/">Giuseppe (Beppe) Santucci</a>, and <a href="http://www.cs.umd.edu/hcil/members/cplaisant/">Catherine Plaisant</a> had the possibility to organize a workshop at <a href="http://www.dsi.unive.it/avi2006/">AVI 2006</a> and felt that it was time to gather some people and talk about the problem of evaluation in infovis.

The first workshop was a real success with very interesting discussions and (highly cited) papers out of it. After this experience we thought that having a BELIV every two years could be a good idea and the right time frame. In fact we organized it again at CHI 2008 and now again at <a href="http://www.chi2010.org">CHI 2010</a>.

The goal of BELIV is to raise fundamental questions about evaluation. The main big question around the workshop, and the reason why we believe it is important, is that visualization still needs to explain <strong>why, when, and how it is useful</strong> and <strong>we don't have the right tools yet to fully answer these questions</strong>.

So, if you are interested in participating there are two options: submit a position paper or a regular research paper. Position papers present a personal view on evaluation and are meant to introduce your point of view in the workshop discussion. Research papers are meant to provide substantial contributions to the research community with novel ideas.

A great plus of this edition is that it is <strong>two-day workshop</strong> and that it will be <strong>a lot more interactive</strong> than past editions. Despite its success many participants to past editions voiced the need to have less presentations and more productive discussions, and we strongly agreed with them.

So, BELIV 2010 will be based on short presentations on Day 1, with planned sessions to collect relevant issues for the next day. Day 2 will be all centered around the discussion of the collected issues.

Many more details can be found at the <a href="http://www.beliv.org/beliv2010/">workshop website</a>, where you can also find links to previous editions so that you can better understand what a BELIV workshop is.

For any questions please send me an email or post a comment here.]]>
      
   </content>
</entry>

<entry>
   <title>Toning down the enthusiasm: it&apos;s just plain data</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/07/estingushing_the_enthusiasm_ab.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.74</id>
   
   <published>2009-07-13T19:17:41Z</published>
   <updated>2009-07-14T07:12:53Z</updated>
   
   <summary>Data is largely available, no question. Everywhere we hear that the new big trend is data crunching and that the great thing of years 2000 is the large provision of freely available data sets. Just recently the US Government has...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="enthusiasm.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/enthusiasm.jpg" width="116" height="118" class="mt-image-left" style="float: left; margin: 0 20px 20px 0;" /></span>Data is largely available, no question. Everywhere we hear that the new big trend is data crunching and that the great thing of years 2000 is the large provision of freely available data sets. Just recently the US Government has released <a href="http://www.data.gov/">data.gov</a>, and this event has been acclaimed by numerous people <a href="http://eagereyes.org/blog/2009/data-gov.html">in visualization</a> and in data analysis as a big step toward a better world. Ok, data is good. All of it is good. But I think we are getting overly excited about it.

<p>
I see a dangerous trend here: thinking that data is the only thing we need and that having large data at our hands will solve some problems. But, data per se has no real value if it is not related to problems and people! So, I see many new interesting web sites (and tools!) popping up on the Internet but I don't see any guidance about what to do with them.]]>
      <![CDATA[<p>
In realistic settings, where visualization and data analysis tools are really needed, people is not enthusiastic about data but about the impact data can have in their context once it is analyzed. An this context is one rich of background knowledge and business goals (in its broadest meaning). I don't see the same kind of richness around.

<h3>Tasks and people are the scarce resource</h3>
Basic economy teaches us that values is generated from scarce resources and not from what is broadly and readily available. What we really need therefore is not only access to data sets but also to real problems and to the people who care about them.

<p>
Unfortunately, while today data can be readily collected and transferred, problems and people cannot. If we don't realize this, we risk to run millions of useless studies, build thousands of useless tools, and waste enormous amounts of energy and resources.

<p>
If at least tasks would be joined to data, the situation would be largely improved. Instead of guessing what these data are for, we could try to solve some real problems connected to them. Take for instance the data.gov website, what if people could post interesting questions? Or what if another website would be available where people post data AND tasks?

<h3>Interesting good examples</h3>
Some great examples come from research areas like knowledge discovery and visual analytics. 

<p>
Take the <a href="http://www.sigkdd.org/kddcup/index.php">KDD Cup</a>. It is organized every year at KDD, the premier international conference on knowledge discovery. Real world data is published to let researchers compete on a series of pre-defined TASKS. The last <a href="http://www.kddcup-orange.com/">KDD 2009</a> is an excellent example. After few lines of description the web page has a large title: "Task Description". The competition is based on a real data set provided by Orange, the French telecommunications company, and the goal is to do better than a data mining tool developed internally. The tasks is to: "estimate the churn, appetency and up-selling probability of customers" ... "churn rate is a measure of the number of individuals or items moving into or out of a collection over a specific period of time" ... "the appetency is the propensity to buy a service or a product." ... "Up-selling can imply selling something additional, or selling something that is more profitable or otherwise preferable for the seller instead of the original sale. " Better than just data, isn't it?

<p>
The <a href="http://hcil.cs.umd.edu/localphp/hcil/vast/index.php">VAST Challenge</a> is another excellent example. It is organized every year within the <a href="http://vis.computer.org/VisWeek2009/vast/">Symposium on Visual Analytics Science and Technology</a>. Similarly to the KDD Cup, a new data set with tasks is published every year. The problems are selected in a way that visual analytics technology is needed to solve them, that is, plain automatic methods without iterative user sense making, are unlikely to be the solution. Another great thing about this challenge is that data is synthetically generated so that ground truth is available. In practice, this means that different solutions can be compared in terms of their ability to discover what there is to be discovered. So, the VAST Challenge provides complex data AND tasks. But even better they also provide people! Since the 2007 edition the contest includes also a session where contest winners have the opportunity to run a small contest live together with real analysts. This is exactly the direction to take in my humble opinion.

<h3>Conclusion</h3>
I don't want to give the impression that data is not important or that its wide availability is not a great thing. It is! But as data turn into a commodity there are other factors that become relevant. Having meaningful tasks and access to real people trying to solve problems is a lot more important, and a lot less likely to become a commodity. What will count in the future (present?) both for researchers and practitioners is not data but people.

<p>
I think it is important to recognize this limits and opportunities and start behaving accordingly. It is for this reason that I am not overly enthusiastic about having a lot of data. And I think the the sooner we start differentiating between just data and data + problems the better will be for all of us.]]>
   </content>
</entry>

<entry>
   <title>Against toolkit fetishism: so many libraries, so few tools!</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/06/im_sick_and_tired_so_many_libr.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.70</id>
   
   <published>2009-06-22T10:56:00Z</published>
   <updated>2009-06-22T20:24:17Z</updated>
   
   <summary> I am sorry guys, I feel a strong need to share my frustration with you today. I have discovered yet another infovis library to create the most beautiful visualizations in the world and instead of being excited I am...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="desperate-businessman.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/desperate-businessman.jpg" width="160" height="106" class="mt-image-left" style="float: left; margin: 0 20px 20px 0;" /></span> I am sorry guys, I feel a strong need to share my frustration with you today. I have discovered yet another infovis library to create the most beautiful visualizations in the world and instead of being excited I am depressed. That's great I really champion the effort of these good guys but a tough question keep hammering in my head: <strong>why so many libraries and so few tools?</strong> Libraries are great and really needed to speed up the development process but here I perceive a dangerous trend: there are a lot more libraries than real tools written with them!]]>
      <![CDATA[<p>
There are a lot of people out there waiting for our useful tools to come and I think it is time we realize that developing real tools for real people is more important than writing toolkits. Personally, I am totally ready to accept a world with no toolkits and lots of tools.

<p>
I give a look around on the web and I cannot find a decent visualization tool freely available, only few expensive highly technical commercial tools. As Stephen Few pointed out in <a href="http://diuf.unifr.ch/people/bertinie/visuale/2007/12/stephen_fews_vast_keynote.html">his talk at InfoVis</a> a couple of years ago, there is a whole bunch of casual users out there whose job includes the need to analyze data. So, what are we waiting for? Who is expected to build these tools?

<p>
I am worried by this short sighted view and this auto-referential culture where infovis people build things for other infovis people, that's it. We develop libraries and then set up fancy examples to show to ourselves and our peers how good we are. Ok, this is useful and needed to some extent. It helps building a community, sharing knowledge and to consolidate good practices. But if we want to go to the next level and let infovis go beyond the <em>toy tool</em> stage, we have to go one step further and embrace the much riskier and tough question: who will use it?

<p>
I see so few examples around that I'm kind of embarrassed to talk about it. Can you list any serious and freely available tool that an average user could use in his or her daily activity? Do we maybe have something that minimally resembles a free <a href="http://spotfire.tibco.com/">Spotfire</a>? We have a myriad or little toy vis scattered around on the web and nothing in our hands.

<p>
There are very rare exceptions. Robert Kosara has recently published its <a href="http://eagereyes.org/parallel-sets">Parallel Sets</a> in his <a href="http://eagereyes.org/">EagerEyes</a> and plans to keep the burden of maintaining it over the next follow up versions. This is a great thing. Parallel Set would not solve the analytic problems of the entire world but it is a step towards this direction. Therefore bravo Robert!

<p>
Another tool I've seen around recently is <a href="http://verifiable.com/welcome?id=11">Verifiable</a>. A very nice and well done tool to create charts directly on the web. Nothing really revolutionary, but what it does it does it well and with an extremely clear interface.

<p>
These have the shape of tools made for end users and this is what we need. C'mon folks, libraries are great but we need to show yet what we are able to do to the entire world. Let's develop tools, tools, tools!!!]]>
   </content>
</entry>

<entry>
   <title>EuroVis 2009 Report</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/06/shame_on_me_i_didnt.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.72</id>
   
   <published>2009-06-16T07:19:42Z</published>
   <updated>2009-06-16T10:47:25Z</updated>
   
   <summary>Shame on me, I didn&apos;t keep my word on reporting from EuroVis. Anyway here is a very small selection of the remarkable things I have seen during the conference. Pat Hanrahan&apos;s talk was really deep and thoughtful. A lot of...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[Shame on me, I didn't keep my word on reporting from EuroVis. Anyway here is a very small selection of the remarkable things I have seen during the conference.

<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="animal-movement_small.png" src="http://diuf.unifr.ch/people/bertinie/visuale/animal-movement_small.png" width="300" height="74" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></span>

Pat Hanrahan's talk was really deep and thoughtful. A lot of new basic material to think about visualization under a new lens. 

Then I forced myself to select only 3 paper out of the long program. They are engaging and new in some sense. Obviously this is totally personal. And there were many other good ones worth reading.]]>
      <![CDATA[<p>
<h3>Pat Hanrahan's Keynote</h3>
As I expected the talk was really great, but in a very special way that I couldn't imagine. The talk was titled "Systems of Thought" and was totally based on a deep retrospective of the basic scientific notions underlying visualization.

<p>
<a href="http://www.graphics.stanford.edu/~hanrahan/">Hanrahan</a> started his talk putting visual thinking in the context of various systems of thoughts, that is, the tools we humans have to reason and communicate, like: natural language, logic, mathematics, etc. It was kind of shocking to see how in this context visual thinking seems less developed and weaker than the other systems.

<p>
Then the talk went on by discussing when and how visualization is better than other forms of thinking, introducing basic studies from people like Herb Simon and Donald Norman, and demonstrating that a change in representation can greatly affect the way we perceive a problem and our ability to solve it.

<p>
The talk ended with a series of key questions suggested as a way to proceed when designing a new visualization:

<ol>
<li>What is the problem you are trying to solve?</li>
<li>How do you think about the problem? What are the semantic objects and their relationships?</li>
<li>What visual representations are already used? How does the visualization represent those objects and support reasoning about them?</li>
<li>How can the manipulation of the representation be embodied in the interaction?</li>
<li>How can visualization be coupled with other systems of thought?</li>
</ol>

They are meant to be asked in the given order and interestingly they offer a viable plan on how to proceed in visualization design. Asking about the problem first is really fundamental in my opinion because we still have too many solutions in search of problems around.

<p>
Thinking about the semantic objects in the domain of the intended users is also a key point, as well as the related need to see how currently people solve the problem.

<p>
The manipulation of the representation puts interaction at the center of the problem and give it its due role. Visualization is not only visual representation but also the great power of visual manipulation.

<p>
Last, coupling visualization with other systems of thought is really a great advice. When designing visualization there is always the risk to focus too much on the tools and overlook the whole system, tasks, and processes happening off of the screen. I'm not sure if this is what Pat intended but I think that reasoning on how visualization can be coupled with other system is in part a way to take a broader look and include the whole ecology of reasoning artifacts found around a problem.

<p>
If you want to know more I suggest to give a look the the slides (<a href="http://www.graphics.stanford.edu/~hanrahan/talks/thought.pdf">pdf</a>). Just collecting the cited references and reading those paper would be an excellent exercise to deepen our thoughts around visualization.

<h3>Papers</h3>
This is a very short and personal selection of the papers I really liked.

<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="animal-movement.png" src="http://diuf.unifr.ch/people/bertinie/visuale/animal-movement.png" width="494" height="122" class="mt-image-none" style="" /></span>

<strong>Visualisation of Sensor Data from Animal Movement</strong> (<a href="http://www.cs.swan.ac.uk/~csbob/research/biologySensor/grundy09visualisation.pdf">pdf link</a>), by: Edward Grundy, Mark W. Jones, Robert S. Laramee, Rory P. Wilson and Emily L.C. Shepard - This also won the best paper award. These guys have developed a system to analyze the movement of animals around the world taking data from accelerometer sensors. The quality of the final images is surprising. Lots of interesting patterns can be seen and it is not hard to imagine how useful it is for researchers using these data. 

<p><p>
<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="chinese-room.png" src="http://diuf.unifr.ch/people/bertinie/visuale/chinese-room.png" width="696" height="199" class="mt-image-none" style="" /></span>

<strong>Visualization of Vessel Movements</strong> (<a href="http://www.win.tue.nl/~cwillems/public/eurovis09.pdf">pdf link</a>), by: Niels Willems, Huub van de Wetering, Jarke J. van Wijk - I have to admit it, I like this paper for the beauty of the images. GPS data from vessels is collected and used to create a sort of map of water highways. A clever rendering technique is used to represent both high level overview patterns and fine details of specific routes.

<p><p>
<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="chinese-room-2.png" src="http://diuf.unifr.ch/people/bertinie/visuale/chinese-room-2.png" width="414" height="296" class="mt-image-none" style="" /></span>

<strong>The Chinese Room: Visualization and Interaction to Understand and Correct Ambiguous Machine Translation</strong> (<a href="http://vis.cs.pitt.edu/results/downloadPub.php?id=259">pdf link</a>), by: Joshua Albrecht, Rebecca Hwa, G. Elisabeta Marai - I included this paper because it is a very good example of integration between machine intelligence and human intelligence. The system uses the output of an automatic Chinese to English translator to visualize the structure of the result. The user can better understand this structure visually and rearrange the items on the screen to create better results.

<h3>Final Thoughts</h3>
Overall EuroVis was really a great event, with a European touch that I like. The best thing about it is that it is small enough to have time to meet nice people, be engaged in interesting discussions and have some fun. The overall quality of the papers was quite good and I have the impression it is becoming better and better. So I'm looking forward to the next EuroVis in Bordeaux: French atmosphere and lots of wine!]]>
   </content>
</entry>

<entry>
   <title>Reporting from EuroVis 2009</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/06/reporting_from_eurovis_2009.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.71</id>
   
   <published>2009-06-08T17:58:45Z</published>
   <updated>2009-06-09T14:04:59Z</updated>
   
   <summary>Hi there! I&apos;m writing from Berlin where I came to attend the EuroVis 2009 conference. EuroVis is the premier conference on visualization in Europe and every year it hosts a mix of very interesting SciVis and InfoVis works. The program...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[Hi there! I'm writing from Berlin where I came to attend the <a href="http://www.zib.de/eurovis09/">EuroVis 2009</a> conference. EuroVis is the premier conference on visualization in Europe and every year it hosts a mix of very interesting SciVis and InfoVis works.

<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="eurovis.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/eurovis.jpg" width="448" height="150" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></span>

The program is out and you can give it a look <a href="http://www.zib.de/eurovis09/?page_id=780">here</a>. The program comprises also a <a href="http://www.zib.de/eurovis09/?page_id=689">keynote</a> from <a href="http://graphics.stanford.edu/~hanrahan/">Pat Hanrahan</a>, which I'm really looking forward to see. It has the promising title: "<em>Systems of Thought: When to Use Visual Representations in Problem Solving</em>".

This is just a short notice to tell you that I intend to post at least one post a day to wrap up on the things I see and to share my thoughts with you. I'll also try to showcase the most interesting works.

---------------
P.S. If any of you is also attending EuroVis'09 drop me a line, we might end up drinking a good German beer in one of the many wonderful places this city has ;-)]]>
      
   </content>
</entry>

<entry>
   <title>Sensemaking ok, but ACTION is what they need</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/06/action_the_missing_word_of_the.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.69</id>
   
   <published>2009-06-04T08:48:28Z</published>
   <updated>2009-06-04T10:25:28Z</updated>
   
   <summary>Yesterday in a meeting with our industrial partners I received yet another lesson. Simply put: though fancy and well-crafted visualization is useless if it doesn&apos;t help people take actions. Ok I must admit it, this is maybe only true in...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
      <category term="ideas &amp; thoughts" scheme="http://www.sixapart.com/ns/types#category" />
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="action.gif" src="http://diuf.unifr.ch/people/bertinie/visuale/action.gif" width="314" height="189" class="mt-image-left" style="float: left; margin: 0 20px 20px 0;" /></span>Yesterday in a meeting with our industrial partners I received yet another lesson. Simply put: though fancy and well-crafted <strong>visualization is useless if it doesn't help people take actions</strong>.

Ok I must admit it, this is maybe only true in business sectors (is it?) but what I come to realize is that we infovis enthusiasts are too much focused on the never ending refrain that visualization is useful to <em>explore</em> data and that we need it to <em>make sense of things</em>.

This is certainly true but this is only part of the story. Take the million managers out there. Not trained to cope with complex stats or charting tools but desperately in need to take decisions based on data. What do they need? To explore and make sense of thing? Sure, to some extent ... but ultimately to take complex decisions in a very constrained setting and tight time limits.]]>
      <![CDATA[By not taking into account this perspective in our work as researchers and designers we miss a bunch of fabulous opportunities:

<ul>
<li><strong>Better constraints:</strong> If we take the ultimate business goal in mind when designing a visualization tool, we have additional constrains and constraints in design are not just good, they are fantastic! By having constraints we can focus on clear objectives and guide our work through them.
</li>

<li><strong>Measures of success:</strong> If the tools we design help people make decisions, take action, and see the outcome, the measure of our success is suddenly clear: we are successful if our users/customers are able to take clever decisions in a short time, and ultimately if they have success with them. It reminds me the never aging and inspiring advice of <a href="http://www.cs.unc.edu/~brooks/">Prof. Brooks</a> in his <a href="http://www.cs.unc.edu/~brooks/Toolsmith-CACM.pdf">Computer Scientist as Toolsmith essay</a>: 
<blockquote>"<em>If we perceive our role aright, we then see more clearly the proper criterion for success: a toolmaker succeeds as, and only as, the users of his tool succeed with his aid. However shining the blade, however jeweled the hilt, however perfect the heft, a sword is tested only by cutting. That swordsmith is successful whose clients die of old age</em>"</blockquote>
</li> 

<li><strong>Conquer market segments:</strong> If we are able to give people what people really need it is a win-win situation. They can do their work faster, better, with higher accuracy and we let out field thrive and become more known, more useful, more developed and more mature. Oooh and yes ... for those in academia like me: we should not underestimate the need to have successful products in the market coming out from our discipline. Our success depends also on them.
</li>
</ul>

So in short, I believe an excessive focus on data exploration, sense making and the like is detrimental to our discipline and to the pool of our potential users. Don't get me wrong, I still believe data exploration and sense making are the cornerstone of visualization and by no means I am suggesting to abandon them. I just believe that taking decisions and actions in mind as guiding principles can add up something to what we have already in the box and create a more winning formula.

<h3>Data Mining vs. Visualization</h3>
On a side note, I think it is useful to make a parallel between data visualization and data  mining and understand how they differ, how they are perceived and why their success is different.

I don't think you can call me heretic if I say that Data Mining has had a far better success than visualization so far. And I think the main motivation resides on what I am suggesting here. The good thing about data mining and statistics is that they can produce better actionable knowledge than infovis. In a typical scenario, data mining can crunch some numbers and spit the response about which customers are more likely to respond to a marketing campaign. That simple: crunch some numbers, produce a list of prospective customers, send letters to them. The last point is what matters: "send letters to them", an action.

Note one very important thing: in data mining people don't even need to make sense of things to make decisions, they just need to have "reasonable" confidence on the quality of results. I agree that this is also the limit of this domain and that the excessive reliance on a black-box way of doing data analysis can be dangerous. But this is what works and the results are not bad then! If we want to evolve and become better we have to accept this state of things and create a better formula. Visualization has the power of opening the black box and at the same time retain the same power of the existing tools. But I don't see many solutions out there going into this direction.

I don't think that necessarily we have to make visualization tools that overlap with the goals covered by data mining but I'm totally sure that this shift in perspective can enormously help us making our infovis edge a lot sharper.

Do you agree? Or maybe disagree? Any comments? Suggestions?]]>
   </content>
</entry>

<entry>
   <title>New Stephen Few&apos;s book out soon: &quot;Now you see it&quot;</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/05/new_stephen_fews_book_out_soon.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.68</id>
   
   <published>2009-05-28T22:09:29Z</published>
   <updated>2009-05-28T22:48:08Z</updated>
   
   <summary>Almost by chance I discovered there is a new Stephen Few&apos;s book coming out soon. It is has a wonderful title: &quot;Now you see it&quot; and I couldn&apos;t be more excited about it. Few weeks ago I posted a blog...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[Almost by chance I discovered there is a new <a href="http://www.perceptualedge.com/">Stephen Few</a>'s book coming out soon. It is has a wonderful title: "<a href="http://www.amazon.com/gp/product/0970601980?ie=UTF8&tag=perceedge-20&linkCode=as2&camp=1789&creative=9325&creativeASIN=0970601980"><em>Now you see it</em></a>" and I couldn't be more excited about it.

<a href="http://www.amazon.com/gp/product/0970601980"><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="NYSI_cover_small.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/NYSI_cover_small.jpg" width="175" height="230" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></span></a>

Few weeks ago I posted a blog post titled "<a href="http://diuf.unifr.ch/people/bertinie/visuale/2009/03/the_lack_of_cookbooks.html">Book for practitioners, not designer!</a>" claiming that we are desperately in need of books that teach how to perform interactive visual data analysis. And this book looks like just the perfect response to what I expressed in my post. From the book description:

<blockquote>"<em>Now You See It does for data analysis what Stephen Few's book Show Me the Numbers does for data presentation: it teaches simple, fundamental, practical techniques that anyone can use--only this time they're for making sense of information, not presenting it.</em>"</blockquote>

Wow, This is what I was looking for! A book that teaches how to analyze data with visual interactive tools. And one written for casual users, in need to analze some data in their work, not for highly skilled statisticians and engineers. And not for designers.

I'm really looking forward to reading it. I hope (I'm sure though) it will meet my expectation. Well done Stephen!

P.S. It has such a beautiful cover! Isn't it?]]>
      
   </content>
</entry>

<entry>
   <title>No more excuses: a list of references to learn how to use color</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/05/infovis_color_theory_in_few_li.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.67</id>
   
   <published>2009-05-27T16:05:00Z</published>
   <updated>2009-06-01T11:32:52Z</updated>
   
   <summary>... and finally stop polluting our eyes I&apos;d say!I was talking with Ilya, a new PhD student in our department, the other day and in front of a prototype he developed he said something like: &quot;oh yes, and I should...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[... and finally stop polluting our eyes I'd say!<br /><br /><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="color-vis.gif" src="http://diuf.unifr.ch/people/bertinie/visuale/color-vis.gif" class="mt-image-center" style="margin: 0pt auto 20px; text-align: center; display: block;" height="130" width="600" /></span><p>I was talking with <a href="http://www.boyandi.net/">Ilya</a>, a new PhD student in our department, the other day and in front of a prototype he developed he said something like: "oh yes, and I should find the right color mapping here but ... how?" Oh well ... good question! Originally I wanted to write a whole new post on it but after some reasonings I came to the conclusion that not only it is a daunting task but also and more importantly I don't know enough to seriously teach about it.<br /></p><p>But wait a minute, does it mean I cannot help him and the ever increasing pool of poor color choosers? No, there is one thing I can do at least: share my list of favorite sources of information on color. And maybe add some tips and rules of thumb I often use for myself.<br /></p><p>So, no more excuses to use poor color schemes. Here is my annotated list of resources, plus some personal tips.<br /></p>]]>
      <![CDATA[<h2>Research Papers</h2>

List of papers I found most useful in understanding color in use. Some of
them are written more for the general public, some others require quite
some effort to understand. They cover however a very large part of what
should be learned and the effort is largely payed off.

<p>
<b><a href="http://www.personal.psu.edu/faculty/c/a/cab38/ColorSch/ASApaper.html">Color Use Guidelines for Data Representation</a></b>. Brewer, C. A.,<i>Proceedings of the Section on Statistical Graphics, American Statistical Association,
</i>Alexandria VA. pp. 55-60 (1999).
<br /><b>[ If you can read only one, read this ]</b>
<br />If
you don't have time to read and you need one single source for practical
advice stop here. This is the best and conciser explanation about how to use color in visualization you'll ever find. Cynthia Brewer is a
cartographer and focused much of her work on color in geographical
data but her suggestions apply broadly to any kind of data. You may
see the result of her work in <a href="http://www.personal.psu.edu/cab38/ColorBrewer/ColorBrewer_intro.html">Color Brewer</a>,
an on line tool to learn how to select color scales. The tool alone is
an eye-opener for those who don't know anything about the topic.

<p>
<b><a href="http://www.research.ibm.com/dx/proceedings/pravda/truevis.htm">How NOT to lie with visualization</a></b>. BE Rogowitz, LA Treinish, S Bryson, <i>Computers in Physics</i> (1996).
<br /><b>[ More into color for SciVis but still very useful and great examples ]</b>
<br />This is another classic, quite short and easy to read. I like it especially
for its focus on how harmful color can be if not used properly. The use
of color is discussed more in the context of <i>scientific visualization</i>
where continuous shades of color are often the case, like in medical
images and geographical mapping, but the results can be applied to any
other visualization. It is especially interesting the notion that different
color mapping strategies should/can be used according to the task at
hand (e.g., segmentation, highlight, etc.).

<p>
<b><a href="http://www.ub.uni-konstanz.de/kops/volltexte/2008/7046/pdf/TVCG00.pdf">Designing pixel-oriented visualization techniques: Theory and applications</a></b>. DA Keim, IEEE Transactions on Visualization and Computer Graphics</i> (2000).
<br /><b>[ Discussion (and code) of a "perceptually" optimal color scale ]</b>
<br />Though this is not only about color, the paper contains a very useful section on color and on how to build a <i>perceptually optimal</i> color scale. The color scale is called HSI (Hue, Saturation, Intensity) and is a variation over the most common RGB, HSB, etc.
The very good point about it is that it is a very rare example of
article where both color theory and practical implementations are
discussed in the same place. The HSI color scale can be easily re-implemented by following the code they provide in a related paper: </font><a href="http://infovis.uni-konstanz.de/papers/1995/VDB95.pdf">Issues in visualizing large databases</a>. DA Keim, HP Kriegel - <i>Proc. Conf. on Visual Database Systems</i>, VDB'95 (1995).

<p>
<b><a href="http://portal.acm.org/citation.cfm?id=617723">Color Scales for Image Data</a></b>. H. Levkowitz, G. T. Herman, IEEE Computer Graphics and Applications</i> (12):1 pp.72 - 80 (1992).
<br /><b>[ Some relevant psychophysics theory and its relevance in color scale design ]</b>
<br /> This is a purely theoretical paper. I included it because it contains some
information that is difficult to find elsewhere. And also because I
find it especially intriguing. Here we learn that (1) not all
differences in color intensity are perceived by our eyes and (2) that a
linear increase in color intensity is not necessarily perceived
linearly. The concept of <a href="http://en.wikipedia.org/wiki/Just_noticeable_difference">Just Noticeable Difference</a> (JND) is
introduced and applied to color scale design. One practical consequence
is that it doesn't matter how well we map our data to color,
some differences will always be lost.

<p><b><a href="http://portal.acm.org/citation.cfm?id=245597">Choosing Effective Colours for Data Visualization</a>.</b> Healey, C. G., <i>Proceedings IEEE Visualization '96</i>, pp. 263-270 (1996).
<br /><b>[ Not easy read, hard-core experimentation, but unique info on categorical colors ]</b>
<br />
This is even more theoretical than the paper above. And be warned, it is not
an easy read! Anyway, I put it in the list because it is the only
"serious" reference I know where the selection of categorical colors,
that is, colors that represents categories and not quantity, is
discussed in fine details and an algorithm for their selection is
discussed. Here we learn that color is not as powerful as we may think. The
maximum number of distinguishable colors we can use to label data is
around 12. Not so many indeed!

<p>
<h2>Book Chapters</h2>

<p>
<b><a href="http://www.amazon.com/Information-Visualization-Second-Interactive-Technologies/dp/1558608192">Information visualization: perception for design</a></b> <b>(Chapter 4: Color)</b> by Colin Ware.<br />Colin
Ware's book is simply the best resource for whatever concerns
perception theory applied to visualization. Admittedly, this is
probably the best book on visualization ever. Chapter 4 is all about
color theory and its content is obviously great. Theory and practice
are well balanced and useful examples are illustrated throughout the
chapter. I think it only missed practical advices and how to implement
the suggestions in practice, but ok, maybe this would be out of the
scope of the book.
</p>

<p><b><a href="http://www.amazon.com/Envisioning-Information-Edward-R-Tufte/dp/0961392118">Envisioning Information</a> (Chapter 5: Color and Information)</b> by Edward Tufte.<br />I
don't think this book needs any introduction. It is part of the famous
Tufte's trilogy and of course it contains some indications on color
use. Even if here one can find many of the things discussed in other
books and papers, but in a useful summarized version, it also contains
some unique content in the usual original Tufte's style. A great piece
of knowledge here is given right away as the chapter opens. Tufte
summarizes color uses in information design as: to <i>label</i>, to <i>measure</i>, to <i>represent or imitate reality</i> and to <i>enliven or decorate</i>. These few tasks provide a useful framework around the work of a visualization designer.
</p>

<p><b><a href="http://www.amazon.com/Show-Me-Numbers-Designing-Enlighten/dp/0970601999">Show Me the Numbers</a> (Chapter 6: Visual perception and quantitative communication)</b> by Stephen Few.<br />This
chapter written by Stephen Few is the best summary I have ever seen on
visual perception theory applied to visualization. Here you will find
not only how to use color effectively but also how to boil down basic
theory on how human vision works to few simple rules to apply in visual
design. In a way it can be considered a sort of Colin Ware's book
compressed in one pill. So again, if you don't have enough time to read,
pick this one and study this chapter. You won't regret your choice.
</p>

<p>
<h2>Tips and rules of thumb</h2>

<p>Finally I try to put something myself. This is just a random list of rules I learned the hard way by doing.</p>

<ul>
<li><b>Don't overestimate the power of color</b>
- Color is attractive and powerful and let's admit it, it is what makes
most of our visualizations pretty and nice to see. But for any serious
use it is important to realize how limited it is. The number of colors
we can easily distinguish is incredibly low (this you can learn it from
the refs above). For instance, it is estimated that the maximum number
of categorical colors we can easily detect in a representation is
around 12. Similar figures holds when presenting continuous data.
Compared to other data features like position, length, size, it is
visually perceived less efficiently. So just don't believe color
mapping will do wonders, it is useful within its bounds.
<br><br>
</li>

<li><b>Always provide a color legend</b>
- I think this one goes in the list of the most common mistakes in
visualization: some data feature is represented with color but then
there's nothing in the interface that tells you what this color
represents. A color legend is alway needed and not only for labeling. As
an example, when it represents quantitative data it must also tell us to
what numbers the brightest and darkest colors map to. So in short,
please do your home work, provide a legend.
<br><br>
</li>

<li><b>Use color with extreme care and parsimony (above all do no harm!)</b>
- This is a sort of repetition of the first point but from a different
angle. As color is added to an interface it soon becomes noise. Learn
to use it with extreme care and parsimony. It is important for instance
to realize that if color is used to represent a data feature it is
extremely hard to use it for some other elements in the interface.
In the end it is extremely important what Tufte says: "<i>above all do no harm</i>".
<br><br>
</li>

<li><b>Learn to love grays and gray scales (grids!)</b>
- The best use one can find of color is to understand how powerful
colorless graphics are. In particular shades of grays are so useful in
data representation that I am surprised there are so few, if any,
specialists advocating for their use (Tufte mentions it by the way). Give a look around, pick the best
known and best crafted tools and you'll see that most of the times
their design is based on shades of gray. Gray is especially useful in
segmenting the visualization space and organizing it in spaces. The most
obvious example is the use of grids in charts and alternated rows in
tables (Stephen Few shows excellent examples in Show Me the Numbers) but
the same principle applies to thousands of other visualization components. So
in short: learn to love gray and gray scales, they can do wonders and rarely do harm.
<br><br>
</li>

<li><b>Don't represent unordered data with ordered colors</b>
- This is self-explanatory but I see it so often that I think it's
worth to add it. Also, I think not everybody would agree with me on
that. Some people use different intensities of the same "hue" to
represent categories. In my opinion this is poor use of color and opens
the door to false interpretations. Ordered colors are automatically
coded as "there's some ordered here" by our brain. Why do we want to
fool our mind when there are better solutions? Use distinguishable hues
and, if possible, make them of the same intensity. This will work best.
<br><br>
</li>

<li><b>Keep an eye to skewed distributions</b>
- Personally I always find this problem in my data visualizations and I
am surprised it is not discussed more. When the dimension you map to
color has a skewed distribution the result is incredibly poor: there
are few items represented by the highest intensity and all the others
flattened to the lower. In short, there's nothing really useful to see apart the fact that there are two or three items with
very high values. In this case one option is to adopt a not linear
mapping between data feature and color. Common solutions are
logarithmic or square root functions that alleviate the problem and
permit to reproduce a full progression of values.
</li>
</ul>

<p>
Here was my list and .... oh before I forget there is one last major one!
<p>

<ul>
<li><b>Don't use the (infamous) rainbow color scale</b> - Maybe someone would laugh at this advice as something too obvious but then, thanks to <a href="http://www.boyandi.net/">Ilya </a>I
discovered that there is nothing to laugh about. If you are not
convinced see this study on the uses of the rainbow color scale and
discover how many professionals and researchers still believe it has
some value:<a href="http://portal.acm.org/citation.cfm?id=1251614">Rainbow Color Map (Still) Considered Harmful</a></li>
</ul>


<p>
<h2>Conclusion</h2>

If you want to design great visualizations, learning to use color properly
and effectively cannot be avoided. The whole system is as weak as the
weakest link, therefore if color is used badly your design will suffer a
lot. Take your time, read as many of these references as you can and
you won't regret. They come from top class researchers and
designers, you can trust their words. Your visualizations will improve,
your clients will thank you, and the visual world will definitely and
finally be less polluted.]]>
   </content>
</entry>

<entry>
   <title>Extended Excentric Labeling</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/05/extended_excentric_labeling.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.66</id>
   
   <published>2009-05-15T09:33:43Z</published>
   <updated>2009-05-15T11:10:33Z</updated>
   
   <summary>A bit of self promotion here on Visuale today! It&apos;s my pleasure to introduce to you one recent work of ours: The Extended Excentric Labeling. It is an extension to the original interactive labeling technique called Excentric Labeling, which was...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="density.PNG" src="http://diuf.unifr.ch/people/bertinie/visuale/density.PNG" class="mt-image-left" style="margin: 0pt 20px 20px 0pt; float: left;" height="235" width="276" /></span><p>A bit of self promotion here on Visuale today! </p>

<p>It's my pleasure to introduce to you one recent work of ours: The <b>Extended Excentric Labeling</b>. It is an extension to the original interactive labeling technique called <a href="http://www.cs.umd.edu/hcil/excentric/">Excentric Labeling</a>, which was developed by <a href="http://www.lri.fr/%7Efekete/">Jean Daniel Fekete</a> and <a href="http://www.cs.umd.edu/hcil/members/cplaisant/">Catherine Plaisant</a> in 1998 at the University of Maryland. We extended it to solve some problems present in the original version and it will be presented and published at the next <a href="http://www.zib.de/eurovis09/">EuroVis 2009</a> conference in Berlin, next June.</p>

<p>Here is the <a href="http://diuf.unifr.ch/people/bertinie/visuale/eurovis09-bertini.pdf">EuroVis'09 Paper</a> (pdf) and a <a href="http://diuf.unifr.ch/people/bertinie/visuale/eurovis09-video-3.mp4">Video</a> (mp4) we produced to showcase the technique.</p>]]>
      <![CDATA[<p>Here is the paper abstract:</p><blockquote><p>"<i>The paper presents an extension to the Excentric Labeling, a labeling technique to dynamically show labels around a movable lens. Each labels refers to one object within the lens and is connected to it through a line. The original implementation has several known limitations and potential improvements that we address in this work, like: high density areas, uneven density distributions, and summary statistics. We describe the implemented extensions and present a think-aloud user study. The study shows that users can naturally understand and easily operate the majority of the implemented function but label scrolling, which requires additional research. From the study we also gained unanticipated requirements and interesting directions for further research.</i>"</p></blockquote><p><br /></p><p><font style="font-size: 1.5625em;">The Motivation</font><br /></p><p>The main motivation behind this technique and its relative study was a practical problem we encountered in the development of a new visualization. EL was just great and fit our need of understanding the content of screen regions. But the real problem was to have something flexible enough to deal with very sparse and very dense areas at the same time. In dense areas the original EL provided only a simple sampling mechanism that did not really help to interpret the data. Them while implementing a solution we discovered we could add some other useful and interesting features.<br /></p><p><br /></p><p><font style="font-size: 1.5625em;">The Techniques</font></p><p>With the EEL we introduced the following features:</p><ul><li> <b>Label scrolling:</b> a mechanism to scroll through labels when there are to many of them.<br /><br /></li><li><b>Focus area adjustment:</b> an automatic mechanism to let the size of the focus area automatically adapt to the underlying data density.<br /><br /></li><li><b>Summary statistics and filtering:</b> a series of glyphs and interactive tools to summarize the content under the focus area and to disambiguate it through filtering.<br /><br /></li><li><b>Inheritance of visual features:</b> a visual mapping mechanism to let the labels inherit the visual/data features from their connected items.<br /><br /></li><li><b>Layout and sorting:</b> algorithms and techniques to permit effective positioning of labels and of their links to the items.<br /></li></ul>For the details you can download the <a href="http://diuf.unifr.ch/people/bertinie/visuale/eurovis09-bertini.pdf">paper</a>.<br /><br /><br /><font style="font-size: 1.5625em;">The User Study</font><br />The user study we conducted is in my opinion a very interesting part of this work. We learned really a lot form it. For years <a href="http://www.useit.com/papers/guerrilla_hci.html">discount usability studies</a> have been promoted by people like <a href="http://www.useit.com/">Jakob Nielsen</a>, especially in industry, but they are not very much loved or popular in research contexts; especially in InfoVis where evaluation in general yet struggles a bit to find its way. But if used as explorative tools, we discovered, they can be great!<br /><br />Sure we received some critics, as our results cannot really tell a final word on whether the introduced features provide a <i>measurable </i>benefit. But what is often overlooked in academia is that the role of research is not only to provide answers but also to create new targeted and relevant questions. And discount or informal evaluation methods can be a great complement to our research toolbox.<br /><br />We gathered 8 people and observed them while performing some predefined tasks we deemed relevant. The result was useful not only to perfect our work but also and foremost to generate some questions that we would not ask to ourselves otherwise. Observing your users using your visualization is alway an eye opener. There is a big gap between what you expect and what they do in reality. And within this gap there is a lot to learn!<br /><br /><font style="font-size: 1.5625em;">Excentric or Eccentric?</font><br />Every spell checker I use highlights the word "excentric" as non existing and suggests "eccentric" instead. I did a little research and also well known dictionaries like the Merriam-Webster do not know it. By typing in Google "excentric definition" I can get some results but then they basically say it is a synonym of eccentric. Maybe someone among you have a clue on it? <br /><br />By the way, maybe the next time I'll meet Jean Daniel or Catherine I will ask why they used excentric and not eccentric. My guess is that the cause is that the French word for it is "excentrique"? ;-). In doubt and for consistency reasons we decided to keep it as it is.<br />]]>
   </content>
</entry>

<entry>
   <title>Map of the remotest places on Earth</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/04/map_of_the_remotest_places_on.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.64</id>
   
   <published>2009-04-21T20:14:04Z</published>
   <updated>2009-04-21T21:21:20Z</updated>
   
   <summary>The New Scientist has just published an article on an amazing recent study conducted by the European Commission&apos;s Joint Research Centre on the remoteness of places in the world. The remoteness is calculated taking into account how long it takes...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
      <category term="blog" scheme="http://www.sixapart.com/ns/types#category" />
   
      <category term="maps" scheme="http://www.sixapart.com/ns/types#category" />
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[<p>The New Scientist has just published an <a href="http://www.newscientist.com/article/mg20227041.500-wheres-the-remotest-place-on-earth.html">article</a> on an amazing recent study conducted by the European Commission's <a href="http://ec.europa.eu/dgs/jrc/index.cfm">Joint Research Centre</a> on the remoteness of places in the world. The remoteness is calculated taking into account how long it takes to travel by land or water to the nearest place with a least a population of 50.000 inhabitants. Here is the result.</p>

<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="access-map_new.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/access-map_new.jpg" class="mt-image-center" style="margin: 0pt auto 20px; text-align: center; display: block;" width="500" height="244" /></span>]]>
      <![CDATA[<p>It turns out that Tibet is <b>the most inaccessible place in the world</b>
and specifically the point on coordinates 34.7°N, 85.7°E. It takes a
three-week trip to the cities of Lhasa or Korla - one day by car and
the remaining 20 on foot. So if you are looking for a really peaceful
place to restore your mind and take a break from civilization well,
here you have it.</p>

<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="tibet.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/tibet.jpg" class="mt-image-center" style="margin: 0pt auto 20px; text-align: center; display: block;" width="500" height="375" /></span>

<p>There are also a number of other related and fascinating maps out of this study. Here is how the world is covered by: </p>

<p><big>Roads</big><br />
</p><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="roads.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/roads.jpg" class="mt-image-center" style="margin: 0pt auto 20px; text-align: center; display: block;" width="500" height="244" /></span>

<p><font style="font-size: 1.25em;">Railways</font><br /></p><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="railways.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/railways.jpg" class="mt-image-center" style="margin: 0pt auto 20px; text-align: center; display: block;" width="500" height="244" /></span><div><font style="font-size: 1.25em;">Navigable Rivers<br /></font><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="major_rivers.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/major_rivers.jpg" class="mt-image-center" style="margin: 0pt auto 20px; text-align: center; display: block;" width="500" height="244" /></span><br /></div>The
model takes into account all these elements to estimate the distance to
the nearest center. More specifically this is how the researchers
estimated the distance according to the kind of surface.<br /><br /><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="roads_2.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/roads_2.jpg" class="mt-image-center" style="margin: 0pt auto 20px; text-align: center; display: block;" width="290" height="375" /></span><br />The map is simple but well designed. If you give a look to the <a href="http://www.newscientist.com/data/images/ns/cms/mg20227041.500/mg20227041.500-1_1000.jpg">bigger version</a>
you'll notice how color is mapped on a nonlinear scale. Brighter colors
represent hour intervals whereas darker ones represent day intervals.
The darkest points represent 5 days.<br /><br />Europe and Japan somewhat
scare me in terms of how easy is to go from one point to another.
United States, regardless their development, contain a relatively dense
number of quite places. Tibet and Greenland looks like the best places
if you want to stay remote.<br /><br />As Alan Belward, who leads the
project, says: the interesting part of the project will be in comparing
this same map with another one computed in the future. What scares me
is the perspective of having no more remote places in the world. Is
this far to come?]]>
   </content>
</entry>

<entry>
   <title>The multidimensional detective and his timeless advice</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/04/_its_been_almost_by.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.63</id>
   
   <published>2009-04-13T08:55:00Z</published>
   <updated>2009-04-14T07:14:22Z</updated>
   
   <summary> It&apos;s been almost by chance that I stumbled upon the old Parallel Coordinates InfoVis&apos;97 paper &quot;Multidimensional Detective&quot; after my last post. And it&apos;s crazy how some information seems to reach you when you start following a new line of...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="multidim-detective.gif" src="http://diuf.unifr.ch/people/bertinie/visuale/multidim-detective.gif" class="mt-image-left" style="margin: 0pt 20px 20px 0pt; float: left;" height="171" width="300" /></span>

<p>It's been almost by chance that I stumbled upon the old Parallel Coordinates InfoVis'97 paper "<a href="http://www.cs.helsinki.fi/u/salaakso/visualisointi/lahteet/Parallel-Inselberg99.pdf">Multidimensional Detective</a>" after my last post. And it's crazy how some information seems to reach you when you start following a new line of thought.</p>

<p>This post and my last one are very much in line with the idea that we do need to invest a lot more on the practice of visualization and not only on design.</p>

<p>Parallel Coordinates have been invented by <a href="http://www.cs.tau.ac.il/%7Eaiisreal/">Alfred Inselberg</a> who is also the author of this paper. What is really impressive in retrospective, after more than ten years, is the need to communicate to people not only what Parallel Coordinates is, but more, and foremost probably, how they can be used.</p>

<p>In this paper Inselberg provides some great guidelines to use when performing visual data analysis. Here I provide some personal comments about each guideline.</p>
]]>
      <![CDATA[<p>Here is Alfred's list:</p>

<p><strong>Guideline 1 - Do not let the picture intimidate you.</strong> This seems to be the trademark of infovis. And indeed it perfectly pairs up with the first part of the famous InfoVis Mantra "overview first". If we accept the idea that infovis is effective when we present an overview first (IMHO this is somewhat questionable but well ... this is not the right place to discuss it), it's evident that we have to teach our analysts to not be intimidated by it. I've seen and designed countless of visualization which are intimidating at first. And often I receive comments like: "can you make it simpler"? Sure, but maybe less informative too? If only we can let the users have the chance to try first and see what's the result!</p>

<p><strong>Guideline 2 - Understand the objectives and use them to obtain visual cues.</strong> Oh this is my favorite one! Too often I hear that infovis is for generic data exploration, like if one could approach data without no idea of the ultimate goal. There's nothing in reality like looking at the data for the sake of it. Inselberg acknowledges it and adds a worthy advice: make the statement clear about what you want to obtain form visualization and let the goal guide your analysis. Our perceptual system is designed to "tune up" our senses when we focus our attention on something (see the latest <a href="http://www.amazon.com/Visual-Thinking-Kaufmann-Interactive-Technologies/dp/0123708966/ref=pd_bxgy_b_img_b">Colin Ware's book</a> for details). So, let's exploit this feature and focus on what we care about.</p>

<p><strong>Guideline 3 - Carefully scrutinize the picture.</strong> As a consequence to the previous step, it is necessary to look carefully to the picture and find visual cues that can help move at least little steps towards the prefixed goal. Visual patterns that tell us something are always there, it's only a matter of looking for them carefully. They usually trigger new questions and hypotheses and help us formulate new actions. We look into other segments of the data or manipulate the visualization in a way that helps us find clarifications. That is, a knowledge building loop.</p>

<p><strong>Guideline 4 - Test the assumptions and the "I'm really sure of".</strong> Inselberg reminds us that what we get out of a visualization is not only what we see but also what we we already have in our mind, that is, our subjective world, which comprises: background knowledge, assumptions, beliefs, etc. An effective analyst must take this into account and understand how this can affect the analysis. Often the visualization present patterns that generate some skepticism but then it's exactly form these strange data segments that important discoveries stem. On a side note, it's surprising to see how much effort we have put into the analysis of objective perceptual processes in visualization and how few on higher level cognitive processes that involve subjective evaluation.</p>

<p><strong>Guideline 5 - You can't be unlucky all the time.</strong>If you are not intimidated, you understand the objective, you scrutinize the picture and test the assumptions, well ... you can't be unlucky all the time! If I understand well, what Inselberg seems to tell us here is that even if a good proportion of the patterns extracted from the visualization can result in little advancement towards the goal or not useful discoveries, you can't be unlucky all the time. In the end there should be something that helps you progress towards your desired direction. This last one, is a very positive and encouraging advice, which I understand because visualization tools often lead to the discovery of trivial or useless information, but then by striving to find something, little gems often sort out of it.</p>

<p>In summary this quite old paper reminds me how hard is to be an analytical visual detective and also how far we are to help our customers focus on fruitful paths. It would be nice to see novel contributions in this direction, because this is in my opinion one of the greatest limits to infovis adoption. In the meantime this small advice, as simple as it seems, looks to me as the most solid I've ever seen.</p>
]]>
   </content>
</entry>

<entry>
   <title>Books for practitioners, not designers!</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/03/the_lack_of_cookbooks.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.62</id>
   
   <published>2009-03-24T10:23:00Z</published>
   <updated>2009-04-14T07:14:49Z</updated>
   
   <summary> I&apos;ve recently come across this incredibly good book: &quot;Data Mining for Business Intelligence&quot;. I was at first a bit skeptical, my academic background naturally led me to wrongly assume a book on applied business intelligence had nothing more to...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
      <category term="books" scheme="http://www.sixapart.com/ns/types#category" />
   
      <category term="critique" scheme="http://www.sixapart.com/ns/types#category" />
   
      <category term="ideas &amp; thoughts" scheme="http://www.sixapart.com/ns/types#category" />
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="datamining-bi.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/datamining-bi.jpg" class="mt-image-left" style="margin: 0pt 20px 20px 0pt; float: left;" height="160" width="112" /></span>

<p>I've recently come across this incredibly good book: "<a href="http://www.amazon.com/Data-Mining-Business-Intelligence-Applications/dp/0470084855">Data Mining for Business Intelligence</a>". I was at first a bit skeptical, my academic background naturally led me to wrongly assume a book on applied business intelligence had nothing more to give than the two other respected books I have on the shelf. Wrong wrong wrong!</p>

<p>As I started reading, chapter after chapter, I felt refreshed by a new stream of ideas, like if all those notions I had accumulated year after year could be seen from a new and fruitful perspective. The book is full of applied examples, compact, with a direct and simple language and, above all, made me finally understand what data mining is and what is it for in the real world. It is the first time I feel I can walk in the same pair of shoes of those guy in the trenches who desperately need strong technology to resolve *their* problems.</p>

<p>So, why do I blog this?</p>]]>
      <![CDATA[<p>Am I not supposed to write about infovis and beyond here? Sure, the point is that after the excitement of having discovered this little gem I realize <strong>how far we are in visualization to have a book like this</strong>. Don't get me wrong, we have plenty of incredibly good books on visualization out there (<a href="http://www.amazon.com/Show-Me-Numbers-Designing-Enlighten/dp/0970601999">Few's</a> and <a href="http://www.amazon.com/Information-Visualization-Second-Interactive-Technologies/dp/1558608192/ref=sr_1_1?ie=UTF8&amp;s=books&amp;qid=1237890061&amp;sr=1-1">Ware's</a> among my favourites) but they are all designed to help designers not practitioners. Why?</p>

<p>I don't have a clear answer but I think it is partly due to the <strong>lack of maturity of our field</strong>. I think we have now a reasonable set of notions, design processes, and examples to design decent tools but we have not matured enough to direct our attention to our users as practitioners. Another reason is the <strong>lack of synthesis</strong>. Take data mining, despite the numerous application areas, techniques, algorithms, it is possible to synthesize data mining methods and approaches in a consistent and simple way. Simplifying to the extreme, data mining is the work of predicting the value of one target variable (usually a class) according to the value of some other variables. Plus clustering and rules. <strong>Can we reach such a synthesis in information visualization?</strong> Is ours an intrinsic limit or is it just due to the way we advance our discipline?</p>

<p>I don't know, in any case I suggest to read this very nice little book. It can be really useful to apply some automatic data analysis to your visualization.</p>

<p>One last thought: oooh if only half of the books out there could be so short and informative at the same time!</p>]]>
   </content>
</entry>

<entry>
   <title>Charting the Index of Economic Freedom: a call to action</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/01/carting_the_index_of_economic.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.61</id>
   
   <published>2009-01-20T09:58:04Z</published>
   <updated>2009-01-20T14:00:37Z</updated>
   
   <summary>The Heritage Foundation together with the Wall Street Journal has recently published the last report on the Index of Economic Freedom. The index captures in a series of factors like: &quot;freedoms of movement for labor, capital, and goods, and an...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[The <a href="http://www.heritage.org/">Heritage Foundation</a> together with the Wall Street Journal has recently published the last report on the <a href="http://www.heritage.org/Index/Default.aspx">Index of Economic Freedom</a>. The index captures in a series of factors like: "<em>freedoms of movement for labor, capital, and goods, and an absolute absence of coercion or constraint of economic liberty, etc.</em>" the degree of economic freedom of a given country.

Apart for the intrinsic interest such data has, in that it measures freedom, the case is very challenging in terms of graphical representation. Here is just an example of what I've been able to do so far, sincerely with little success.

<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><a href="http://diuf.unifr.ch/people/bertinie/visuale/index-economic-freedom.html" onclick="window.open('http://diuf.unifr.ch/people/bertinie/visuale/index-economic-freedom.html','popup','width=702,height=474,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false">

<form mt:asset-id="26" class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="index-economic-freedom_s.png" src="http://diuf.unifr.ch/people/bertinie/visuale/index-economic-freedom_s.png" width="448" height="302" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></span>

</a></form>

<h3>Data</h3>
The dataset (which can be downloaded directly from the <a href="http://www.heritage.org/Index/Download.aspx">website</a>) contains for each country and year, form 1995 to 2009, the overall score and the individual factors (e.g., business freedom, trade freedom, fiscal freedom, etc.) that compose the score. Technically speaking it is, in fact, a multivariate time series, a quite tough object to handle indeed.

<h3>Chart</h3>
In my proposed solution I focus on the representation of the states that experienced the highest positive or negative changes in the whole time range. Beyond the obvious reading of best and worse countries in the overall score, which can be easily obtained from the website, I think representing measures of change is a lot more interesting.

I've created the chart with <a href="http://www.bonavistasystems.com/">MicroCharts</a> a wonderful little Excel add-on. Each sparkline represents the time variation of the overall score, so that it is possible to see ups and downs in the considered time span. Since the variation is represented in terms of the individual maximum and minimum values, the timelines cannot be compared in terms of their absolute values. But this is ok as long as the main goal is to covey messages like: "hey this country has significantly and steadily improved its index over the course of the years!". The absolute values can be read on the right side where min and max are color-coded the same way the small dots are coded in the sparkline. The size of the dot represents the value and the bar chart the amount of variation.

<h3>Trends</h3>
I am by no means satisfied with my design, but I think it sheds some interesting light on the data. We can see that Armenia had an impressive improvement from 42.2 to 70.6. We can also see that many Eastern Europe countries like Moldova, Bosnia and Herzegovina, Lithuania, and Romania, had a great improvement as well, as highlighted in the report. Sad examples are Argentina, which experienced a sudden decrease, probably concomitant with the country economic breakdown, and Zimbawe which went from the already low 48.7 to 22.7.

<h3>A call to action!</h3>
The real challenge for these data is to represent the single factors together with the overall score and to represent the whole dataset, which I've not done. These factors can help explain for any major variation, if it is due to a specific sector or an overall change. I'm also convinced the same data can be seen under a myriad of other lenses different to mine. It is for this reason that I propose a "<strong>call to action</strong>", inviting you to create a chart of this intriguing dataset.

In order to facilitate your task I have attached here a <a href="http://diuf.unifr.ch/people/bertinie/visuale/EconomicFreedomIndex_OverallTime.xls">processed version of the file</a> that contains the overall score organized by time in a single Excel sheet (the original data has one sheet for each year). If you go into some preprocessing too pay attention to some data inconsistencies the original file may have. Especially, note that Somalia in some years is removed from the dataset.

Good Luck!]]>
      
   </content>
</entry>

<entry>
   <title>Book Review: The Numerati</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2009/01/book_review_numerati.html" />
   <id>tag:diuf.unifr.ch,2009:/people/bertinie/visuale//1.60</id>
   
   <published>2009-01-14T11:29:00Z</published>
   <updated>2009-01-14T11:32:23Z</updated>
   
   <summary> The Numerati are all the statisticians, computer scientists and analysts around the world who are analyzing tons of data to understand &quot;us&quot;. This is the main topic of this wonderful book written by Stephen Baker, a Business Week journalist....</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><a href="http://www.amazon.com/Numerati-Stephen-Baker/dp/0618784608"><img alt="thenumerati.jpg" src="http://diuf.unifr.ch/people/bertinie/visuale/thenumerati.jpg" class="mt-image-left" style="margin: 0pt 20px 20px 0pt; float: left;" width="106" height="160" /></a></span>

The Numerati are all the statisticians, computer scientists and analysts around the world who are analyzing tons of data to understand "us". This is the main topic of this wonderful book written by <a href="http://www.businessweek.com/bios/Stephen_Baker.htm">Stephen Baker</a>, a Business Week journalist.

<p>The book is an easy read, written with a simple style that makes it accessible to everybody, and yet incredibly intriguing and informative for the knowledgeable reader.</p>

<p>Stephen interviewed tens of researchers and entrepreneurs around the US and put into focus one of the major trends of our days: not only an incredible amount of data has been and is collected everyday around the world but we are also finally starting to "use" these data to let us understand relevant aspect of the human being. Health, Finance, Marketing, Policy, are only few examples of areas where data is collected and deeply analyzed everyday.</p>

<h3>Content</h3>

<p>The book is organized around 7 chapters: Worker, Shopper, Voter, Blogger, Terrorist, Patient, Lover, in which people is modeled under the lens of a specific stereotype.<br /></p><p>In <b>Worker </b>we are modeled according to our skills and the way we work. We meet people like <a href="http://en.wikipedia.org/wiki/Samer_Takriti">Samer Takriti</a> at IBM who is modeling about 300.000 IBM workers to understand the relationship between their skills and their performance and how to better allocate these skills in the company the same way we used to do with any other physical company asset. <br /></p><p>In <b>Shopper </b>we are modeled according to the things we buy. Researchers are analyzing the millions of transactions we make everyday in stores to understand what "type" of buyers we are. <a href="http://www.accenture.com/Global/Services/Accenture_Technology_Labs/R_and_I/RayidGhani.htm">Raiyd Ghani</a>, for instance, analyzes with his group at <a href="http://www.accenture.com/Global/Services/Accenture_Technology_Labs/default.htm">Accenture Technology Labs</a> grocery store transactions to provide personalized suggestions to shoppers through the use of carts equipped with personal assistants.<br /></p><p>In <b>Voter </b>we are modeled according to ... to what? This is an impressive chapter because it demonstrates that we can be modeled in a given domain indirectly, using data that apparently has no connection with the subject matter. This is what <a href="http://www.spotlightanalysis.com/whoweare.htm">Josh Gotbaum</a> with his political firm <a href="http://www.spotlightanalysis.com/">Spotlight Analysis</a> does. They provide detailed indications on <i>swing voters</i> based on data taken from large data companies like <a href="http://www.choicepoint.com/">ChoicePoint</a> and and <a href="http://www.acxiom.com/">Acxiom</a>, who collect an incledible amount of data about us on almost every aspect of our life (scary?! :-)).<br /></p><p>In <b>Blogger </b>we are modeled according to our opinion. Yes, our opinion. There are companies like <a href="http://www.umbrialistens.com/">Umbria Communications</a> which analyzes the blogosphere to understand the opinion trends of millions of bloggers on whatever interests a given company. If I want to track how people react to a new product put on the market Umbria can tell.<br /></p><p>In <b>Terrorist </b>we are modeled as potential terrorists or thieves. Here we meet people like <a href="http://www.research.ibm.com/theworldin2050/bios-Jonas.shtml">Jeff Jonas</a>, now at IBM, who helped casinos in Las Vegas sift through millions of internal records to single out suspect customers. And the same technology is used by <a href="http://www.iqt.org/">In-Q-Tel</a>, the venture capital arm of the CIA which invested in this technology, to cope with national security and counter terrorism. </p><p>In <b>Patient </b>we are modeled according to our body signals and medical records. This is the chapter I most loved, not only for its humanitarian applications, but also for the cleverness of some solutions. <a href="http://www.intel.com/pressroom/kits/bios/edishman.htm">Eric Dishman</a> launched the home health division at Intel where they design smart sensors like the "magic carpet" that monitors weight an movements to monitor the health of patients and where they try to predict the onset of diseases like Parkinson's and Alzheimer's by detecting suspect variations in the stream of data.<br /></p><p>Finally, in <b>Lover </b>we are modeled according to our profile to find matches among us as potential lovers. We meet <a href="http://www.helenfisher.com/">Helen Fisher</a>, a Rutger's University anthropologist, who devised an innovative method to find matches between people which is the basis of the <a href="http://chemistry.com/">Chemistry.com</a> dating website. Her method goes well beyond simple matching of demographic data, it is based on her theory that we can be split in four groups where a specific hormone is predominant and that the best matches comes from complementary hormones.<br /></p><br />

<h3>Reflections</h3>

<p>The first issue the book raises is obviously privacy. I really liked the approach of Stephen Baker, equally distant from the excitement for the new opportunities brought by innovation and the potential for a super-controlled society where drawing a full profile of ourselves is becoming worryingly easy. Any other technological shift in history came however with the promise of new advancement in human being together with novel problems (think about cars and pollution). Stephen asks the right questions to some of the researchers he met. The most interesting in terms of privacy is the one with Jeff Jonas who is "<i>vehemently opposed to the use of statistical data mining to predict the next terrorist attack</i>" because of the high risk of intrusion and false alarms. And yet he believes that this technology can both protect our freedom and our privacy at the same time. I think this is one of the biggest challenges of our time, to <b>find the right balance between the opportunities for increased freedom and security and the risks of intrusion, control, and faulty conclusions in the analysis of our own data</b>.</p><p>From a more scientific and technological point of view what strikes me is the relevance prediction has in all the application areas described in the book. In traditional data analysis, especially for those with a visualization background, the focus is on "understanding" what is in the data to build a mental model out of it and in "discovering" some special gems out of chaos. Yet, however, real world applications are more concerned with elaborating actionable solutions to run and test, and I have the impression that "prediction" lends itself better to this goal. Think about it, through the book's examples, in workers the company wants to predict performance to put people in the right place, in shoppers the grocery store wants to predict what product can be sold to one specific customer to provide timely suggestions, in voter a political party wants to predict which population segment should be addresses with a targeted message to increase the chances they hit a group of swing voters, and so on. How do we, visual information designers and analysts, cope with this fact? <b>Are we able to provide with our tools the same level of actionable knowledge or are we condemned to just describe things and hope that this information will be useful in some way?</b><br /></p><p><br /></p>

<h3>Implications for Visualization</h3>

<p>What is the role of visualization in the world of the Numerati. I think it is huge!!!</p><p>First of all all the technologies used by the Numerati are to some extent prone to errors and they are always the results of continued refinement of the underlying model. <b>Visualization can play a significant role in helping the modelers understand and test their models</b> and explore their implication as they are applied to new data. Without such a level of interaction the risk is to build monstrous black-boxes that spit oracles we all have to follow without really knowing why.</p><p>Another area where I see <b>a large role of visualization is when mining is used in monitoring environments</b>, where the timely detection and comprehension of the situation (more technically known as the situational awareness problem) is important. We have a long and respected tradition of research for knowing what works best in terms of visual representation when visual saliency, detection and contextual information are at stake. Well designed visualizations that permit to get the most out of a screen in a matter of seconds are of paramount importance here, from the need to analyze terrorist attacks to the doctor monitoring a patient.</p><p>A third potential I see for visualization is the <b>need for personal data visualization</b>. As these technologies develops, and the results of data analysis become more pervasive, I expect to see and increase in the need of managing personal data and the results of these analyzes by end-users. And how are we going to provide this information to the average person? Visualization can play a big role here and and again it would need to reinvent itself a bit. In this domain extremely simple and useful visualizations will be needed and some of them will be provided on non-standard devices like TVs, cell phones, public displays. We need flexible and simple solutions to provide to the large public.<br /></p><p>So, in summary, the explosion of data analysis is good news for us! <b>We have plenty of novel challenges to address</b>. A somewhat silent mind shift is already going on underway ... I expect to see in the future an ever tighter integration of automatic mining technologies and visualization, as the recent Visual Analytics trend demonstrates after all.<br /></p>]]>
      
   </content>
</entry>

<entry>
   <title>DabbleDB &amp; Magic/Replace: web-based data processing the easy way</title>
   <link rel="alternate" type="text/html" href="http://diuf.unifr.ch/people/bertinie/visuale/2008/12/dabbledb_magicreplace.html" />
   <id>tag:diuf.unifr.ch,2008:/people/bertinie/visuale//1.59</id>
   
   <published>2008-12-12T13:31:07Z</published>
   <updated>2008-12-16T11:47:54Z</updated>
   
   <summary>I am totally amazed to see these two tools appearing on the web. DabbleDB and Magic/Replace permit to organize you data in a web-based spreadsheet and then process it in multiple ways with few intuitive well designed functions. I think...</summary>
   <author>
      <name>Enrico Bertini</name>
      
   </author>
   
   
   <content type="html" xml:lang="en" xml:base="http://diuf.unifr.ch/people/bertinie/visuale/">
      <![CDATA[I am totally amazed to see these two tools appearing on the web. <a href="http://dabbledb.com/">DabbleDB </a>and <a href="http://cleanupdata.com/">Magic/Replace</a> permit to organize you data in a web-based spreadsheet and then process it in multiple ways with few intuitive well designed functions.

I think that everyone who have had a minimal experience with visualizing data knows very well how data (pre-)processing an reformatting is necessary and painful. Sincerely, I cannot recall any single project in which I didn't need some data manipulation.

<h3>DabbleDB</h3>
<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="screenshot-view.png" src="http://diuf.unifr.ch/people/bertinie/visuale/screenshot-view.png" width="428" height="246" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></span>
<a href="http://dabbledb.com/">DabbleDB</a> lets you organize your data in multiple tables (called categories) and extract views out of it. A view can be a table, an aggregated table or a chart. There are few basic types of charts but they do their job well: neat and clean and super easy to create.

I know, it doesn't look that great so far but when you use it you realize hundreds of little tricks and smart software guesses that makes it wonderful. As an example, it automatically recognizes the column type (date, number, location, etc.) and changes the way data can be aggregated according to the column content. In a date column you can aggregate by day, month, year, etc. In a location column by exact value, region, country, and continent. And aggregation is a matter of few mouse clicks, no hassle, no formulas, no complex "group by" queries. Click on the column and just choose among a series of grouping options. 

Another great feature is how data is imported in the system: open it in Excel, copy the table content, past it in a form field, press the submit button and ... puff ... imported. And with very few mistakes as far as I have seen. Again, nothing really revolutionary but it saves lots of time and stress! It can also import data directly form html tables, just paste the url of the page and it imports the data.

Filtering is also very powerful. On the left side you can type a keyword and the view is automatically updated. If you want to do something more sophisticated, you can add a column specific filter and, according to the column type, enable special purpose filtering. In a date field you can type for instance "> 1980" and filter out all the records with a year earlier than 1980.

There are hundreds of other features I did not mention, the best is to try it out and see. It took me very few minutes to register and start to play with it. Alternatively, you can start from <a href="http://dabbledb.com/demo/">this video demonstration</a>, which is very well done.


<h3>Magic/Replace</h3>
<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="magic-replace.png" src="http://diuf.unifr.ch/people/bertinie/visuale/magic-replace.png" width="500" height="179" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></span>
<a href="http://cleanupdata.com/">Magic/Replace</a> is a companion product that permits to reformat your data on the fly by cut&paste operations. You cut a piece of data from one field, put it in another field, plus some additional characters, and it updates the full table on the fly. You need to reformat a date? Concatenate Name and Surname in one single field? Create a new column with a piece of data coming from another field? Super easy: just cut and paste the elements and it does the rest for you.

It's a little piece of simple "intelligence" that works really great and it's very useful. The same thing done in Excel or any other product would be a pain. Especially if you are not an expert.

The interface is also very well done. One line tagged "from" and one tagged "to", where you can test your modifications, and a preview button that lets you preview the result in a section of the table. It takes few seconds to understand how it works.


<h3>Few Reflections</h3>

<ul>
<li><strong>Data Management & InfoVis</strong> - It's really surprising to see how data management has been neglected in InfoVis, and yet it is so central in our daily activity, both as designers as well as users. I strongly believe the lack of simple data management tools in software products is one of the major barriers to the adoption of InfoVis. <a href="http://www.tableausoftware.com/">Tableau</a> is another great example: data is loaded, managed, and manipulated in few intuitive drag and drop operations. It might be frustrating to accept it but a major quality factor of the systems we design does not reside in the fancy visual representations we make but rather in little details that make a difference like these. 
 
<li><strong>Data Management & Web 2.0</strong> - These two tools are another example of the power of Web 2.0. I have already argued that we can probably already speak of <a href="http://diuf.unifr.ch/people/bertinie/visuale/2008/05/vis_20_1.html">Vis2.0</a>, a series of visualization tools that work smoothly on the web. Web 2.0 data management tools are another piece added to the puzzle. And it's amazing because, as any other 2.0 tools, it is not only a matter of reproducing the same desktop tool on the web, but also to enable a whole new spectrum of possibilities. DabbleDB, for instance, lets people publish and update the DB on the web in a matter of second, thus enabling collaboration in a way that would be impossible to do with Excel or any other desktop product.

<li><strong>Simple Intelligence in Software</strong> - All the simple tricks and smart guesses found in these two products let me think of the idea of embedded simple pieces of intelligence in software products. It's crazy if you think about it: we have been researching super complex AI algorithms for years, to do things of little use for anybody, and now there are some little intelligent and focused tricks that work really great for some people. It also reminds me of the concept of "<a href="http://www.comp.lancs.ac.uk/~dixa/topics/appropriate-intelligence/">Appropriate Intelligence</a>" advocated by <a href="http://www.comp.lancs.ac.uk/~dixa/">Alan Dix</a>: "<em>Designing interactions for appropriate intelligence is based on two principles: 1) it should do good things when it works, 2) it shouldn't do bad things when it doesn't</em>". It is perfectly applied here.
</ul>

I think we will continue to see a proliferation of similar products on the web ... and I cannot be more excited about it! Little by little we have a whole infrastructure that is taking shape on the web and, I am sure, this will help InfoVis and related disciplines thrive in ways we could not even imagine before.]]>
      
   </content>
</entry>

</feed>
