From discourse to data…and return ?

What is a discourse? It’s a way to arrange facts, ideas, expression of feelings to give them order and consistency, to make them resonate with each other, or to inspire in those it is addressing a particular purpose. Different societies have developed during their history, different types of discourse – religious, political, philosophical, scientific, commercial, sentimental, each of them having its own rules, its own history. Do we get to the end of this history, or more generally to the end of the discourse itself? This is what one might think at first sight by measuring the impact that the digital revolution seems to impose this particular type of intellectual production. Roger Chartier announced a few years ago a “disturbance of the order of discourse” by the Internet and new media mixing everything: books, journals, personal letters, diaries and newspapers in the digital large funnel. At the “Web 3″ era, should one speak of a disappearance of the order of discourse? It seems so, as this new creed shows a clear focus and fascination for what lies in the speech, which is its foundation and primary material, without order or construction, namely the ” raw data “[1].

This radical change of perspective can be seen at work in several areas, starting at a political level with the open data movement. What is it? Cast as often form the United States, the open data movement aims at encouraging or compelling governments and their agencies to make public the data they collect or produce in all areas of their work: census data, of course, but also those related to crime, health and economic activity [2]. The purpose is twofold: on a political ground, it gives a better transparency and thus a better control for citizens on the activities of their governments and the effectiveness of public policies they implement [3]. Related to the economic context of the “knowledge economy”, it must allow civil society to benefit from a real source of information, enabling it to develop new services that will benefit the population [4]. In the U.S., the Obama administration wanted to set an example with the initiative data.gov where you can access and download many databases in all areas. Britain launched its own program [5], soon followed by many countries. In France [6], the central gouvernement is not as much active in this area as the local communities. Several major French cities including Paris [7] have begun to open their databases to the public.

Launched as a voluntary movement from government, open data, however, tends to go beyond their control. The sensational revelations came from Wikileaks which shown that the transparency of government could be exercised at several levels. More profoundly, Wikileaks is acting as the developer of the open data movement: it reveals a general trend resulting in disqualification of political discourse essentially considered as a mystification. The movement is by far more radical than what have undertaken critical theory. The point is no more to deconstruct the discourse to reveal it as deceptive, but to put it aside in order to focus on data alone, open to all, allowing everyone to build his own speech on this basis [8].

One might object that Wikileaks is not open to the data itself because what is made available through it are documents. The objection is probably valid if one thinks in terms of resource. Yet it would miss the essence of what makes the value of Wikileaks, whose modus operandi is not to make public some particularly embarrassing documents, but to give to the public a mass of documents which, because of the amount of information involved, behaves like a database in which everyone is called to dig according to their own interests [9]. This point clearly explains why collaboration was needed between Wikileaks and various newspapers and magazines (New York Times, Le Monde, El Pais) : the organization gives access to information and the press gives it editorial value.

In science, the same movement is on the agenda. There is a general movement prompting researchers to provide open access to the data on which they build their argumentation in scientific publications. If this idea seems obvious at first sight and in theory, its implementation raises a host of practical problems in many disciplines. The open data movement is going further and reverses the perspective: now, data tend to produce their own theory. It’s quite the meaning of the initiative Culturomics [10] that provides researchers and the public a basic lexicometric interface on the entire corpus of books scanned in Google books. The article published in Science by several researchers [11] about this new service is quite significant: it bears no strong theory, any proposal or specific assumptions: It shows several types of interactions with a database that delivers statistical answers to a set of spontaneous questions. As in the case of open data in the political and journalistic areas, for the scientific one, the equivalent movement gives a secondary importance to discourse and theory and places mere data at the center of the picture.

It is certainly striking to see how this triumph of data and the dissolution of the discourse that can be seen in these initiatives, comes at the same time than its exact opposite: the triumph of the discourse in its narrative mode. The story telling phenomenon that touches all sectors too, is the exact opposite of open data. Well documented by the famous book written by Christian Salmon [12], the technique of story telling aims at causing the adhesion of one for whom it is deployed and at the same time anesthetizing his critical sense. Seduction rather than accuracy is here set up as a cardinal virtue.

Between the open data on one side and the story telling on the other, it is ultimately a certain rhetorical tradition that seems doomed to extinction by quartering: built on a strong collusion and an almost inseparable mixture of factual elements (the inventio) on one hand, and the layout that organize them (the dispositio) on the other, the rhetoric was a way to build a shared representation, socialized of the world. Its weakening means a radical disjunction between the “reality” on one side, placed on the side of its independent factual representation or perception, and social relations on the other firmly placed under the sign of seductive and hypnotic artifice. This trend can be seen as congruent with the weakening of institutions and all forms of mediation, a phenomenon quite characteristic of our time.

It is also possible that the dissolution is  only a time of transition and the fall of a rhetorical order linked to an outdated mode of organization of society and not the disappearance of every possible rhetoric. Three examples can be cited to illustrate how a “renaissance of rhetoric” may be trying to break, pushing with the reconstruction of a new discourse built on new foundations.

The first example is the work that is emerging on the concept of data visualization. While reference books are starting to be published on the subject, that great blogs are the latest research in this area [13], or as digital artists to explore the many possibilities [14], many observers are beginning to evoke the concept of “data journalism” whose job is largely to develop both modes of representation, data visualization, as well as interfaces to access data that are satisfactory to the public [15]. From one point of view, all these works that are now together under the banner of “data curation” can be seen as building a new kind of rhetoric.
Secondly, the issue of data representation in science shows similar recent developments. The triumph of the data at the expense of interpretive discourse could be similar in the field of humanities and social sciences to the domination of social “naturalist”, “objectifying” science over the humanities gathered under the banner of hermeneutics. It is precisely from a consideration of the representation of data that the contribution of humanities and their singular concern of the subjective representation of the actors is reintroduced in the new context. This was brilliantly demonstrated by Johanna Drucker in a conference organized by the MIT [16] when she showed how, in history, one can design modes of representation of data that do justice to the subjective perception that players could have of the situation described. The triumph of the data may be accompanied by a somewhat paradoxical revival of semiotics, and finally, hermeneutics, via the issue of representation which is far from obvious. From another point of view, the sociologist Dominique Cardon poses at the open data movement as a whole a similar question [17]: the availability of data does not resolve the question of the nature of the data that will be exploited, and ultimately the focus that will be favored by the analysis: the individual or aggregate? Drucker and Cardon said almost the same thing: data depends on the perspective for which they are constructed, aggregated, represented.

Finally, we can conclude this overview by reporting a new service, launched in recent days in public beta. Significantly called Storify [18], this online service allows any user to gather and organize various documents from the Internet and social networking platforms in order to build  a narrative “story”. The reader is a new rhapsode, he is encouraged to “sew” the data together to produce a text frame from the myriad of available documents. The entire Web in its various components, is mobilized as a database, but it is in the prospect of widespread production of discourse open to all and carried on under conditions very different from what was known far. The mere fact that such an offer exists is perhaps the premonitory sign of a return to discourse after a short period of reduction to the data. Time will tell whether Storify meets its purpose.

Anyway, the open data movement cannot escape the question faced by all innovations: thequestion of its adoption by the majority [19]. In other words, the question if data are fed through the activity of symbolic production users,  if they make sense to them and help them articulate a discursive activity  without which there is no possible society.

Notes

[1] Cavazza, Fred. “Du contenu roi aux données reines.” FredCavazza.net, Juillet 19, 2010. http://www.fredcavazza.net/2010/07/….

[2] O’Reilly, Tim. “Government As a Platform.” Dans Open Governement. Lathrope, Daniel et Ruma Laurel (éds.). O’Reilly Media, Inc., 2010. http://opengovernment.labs.oreilly.com/.

[3] Cardon, Dominique. “En finir avec le culte du secret et de la raison d’Etat.” Le Monde, Décembre 3, 2010. http://www.lemonde.fr/idees/article….

[4] LiberTIC. « Pourquoi la réutilisation des données publiques à des fins commerciales doit-elle être gratuite ? » InternetActu.net, mars 9, 2011. http://www.internetactu.net/2011/03…

[5] « L’open data n’est plus une chimère » : entretien avec Nigel Shadbolt,” RSLN, Février 24, 2011, http://www.rslnmag.fr/blog/2011/2/2….

[6] « La France de l’Open Data : où en est-on ? » Regards sur le numérique, mars 21, 2011. http://www.rslnmag.fr/blog/2011/3/2…

[7] Thierry Noisette, “Open Data : les données publiques de Paris ouvertes sous licence libre by-sa,” L’esprit libre, Décembre 19, 2010, http://www.zdnet.fr/blogs/l-esprit-….

[8] Fabrice Epelboin, “Bastille Day à #AmnDawla en Egypte : la révolution par l’open data radical,” ReadWriteWeb French edition, Mars 6, 2011, http://fr.readwriteweb.com/2011/03/…

[9] Kayser-Brill, Nicolas. “StateLogs : Wikileaks begins to reveal 250,000 diplomatic files.” OWNI, Digital Journalism, Novembre 27, 2010. http://owni.fr/2010/11/27/wikileaks….

[10] Guillaud, Hubert. “Culturomics : Comprendre les “lois” de la culture - La Feuille – Blog LeMonde.fr.” La Feuille, Décembre 17, 2010. http://lafeuille.blog.lemonde.fr/20….

[11] Michel, Jean-Baptiste, Yuan K Shen, Aviva P Aiden, Adrian Veres, Matthew K Gray, The Google Books Team, Joseph P Pickett, et al. “Quantitative Analysis of Culture Using Millions of Digitized Books.” Science (Décembre 2010). http://dx.doi.org/10.1126/science.1….

[12] Salmon, Christian. Storytelling  : La machine à fabriquer des histoires et à formater les esprits. Editions La Découverte, 2008.

[13] Andrew Vande Moere. « Information Aesthetics », s. d. http://infosthetics.com/.

[14] Harris, Jonathan, et Sep Kamvar. We Feel Fine : An Almanac of Human Emotion. 1er éd. Scribner Book Company, 2009.

[15] Bradshaw, Paul. « How to be a data journalist ». The Guardian, octobre 1, 2010. http://www.guardian.co.uk/news/data….

[16] “Johanna Drucker : Humanistic Approaches to the Graphical Expression of Interpretation”. MIT World, 2010. http://18.9.60.136/video/796.

[17] Dominique Cardon, “Zoomer ou dézoomer ? Les enjeux politiques des données ouvertes » Article »,” OWNI, Digital Journalism, Février 21, 2011, http://owni.fr/2011/02/21/zoomer-ou….

[18] Gahran, Amy. « Storify launches public beta : Curation is a core news skill ». Knight Digital Media Center, avril 28, 2011. http://www.knightdigitalmediacenter…

[19] Michael Gurstein, “Open data : Empowering the empowered or effective data use for everyone ?,” First Monday 16, no. 2 (Février 7, 2011), http://www.uic.edu/htbin/cgiwrap/bi….

Posted in short paper, Uncategorized | Tagged , , , , , | Leave a comment

I love the web !

With the advent of dedicated electronic reading devices and the marketing of the iPad in particular, the Web is no longer the sole channel for content delivery over the Internet. In any case, the idea developed by Michael Hirschorn in a very interesting article published last month that the magazine The Atlantic. Publishers of books on one side, who have always balked at making available their catalog on the Web, rush now that Apple, which with Amazon to be present in the virtual bookstore that they bring to them. The publishers, rolled by the iron law of “information wants to be free” (IWTBF) regarded as the iPad castaway contemplates his salvation. More importantly, in all minds, all media is now the quarry cons web guilty of all evils: unorganized, ineffective, dangerous, immoral, the Web is a wasteland, an area, a wasteland, a place perdition midway between the saloon and brothel where Steve Saint reduces consumer-sheep to lead them astray into the garden of Eden secure, orderly, clean and free from sin, enclosure fences beautiful white (“white picket fence “in the words of imaged Hirschorn) which is the custodian.

I just want to propose here a defense and illustration of this Web well unloved. In a word, all the while decrying, for my part I declare, loudly and very well, without fear or trembling: I love the Web!

I love the web

Prior to recite the reasons for this commitment, perhaps should we define Web, and say what the particularized over the Internet in general. The best way to understand just without using the technique of what the Web is to start the browser: the Web is allowing us to access content through a browser whatever – let’s consensus for once: Internet Explorer, Mozilla Firefox, Google Chrome, Opera or Safari. To function, the Web browser based on three techniques which are a combination web:

1. an addressing system that assigns an address to each “Web page, each document to find it in the incredible mess of billions of Web pages that is. This system URL – Uniform Resource Locator – is the backbone of the Web.

2. A language to structure and / or formatting, HTML allows the browser to compose web pages by placing the elements contained in the right place and to act as the page designer intended. HTML, Hypertext Markup Language, it is the muscles of the Web.

3. A very simple technology to create “hyperlinks” between documents, link them, enabling the user to jump from one document to another with a click of mouse. This is the hyperlink and the HTTP – Hypertext Transfer Protocol over which it relies, which allows to establish. The hyperlink is the bloodstream of the Web.

(4. One might add, to be quite complete: the layout, the final appearance of Web documents, ie the skin, the language defined by CSS-Cascading Style Sheets – which has gained increasing importance in recent years and it is not finished, see the bonus at the end of post).

Conclusion equation form: WWW = URL + HTML + HTTP (CSS +)

Now that the fundamentals are addressed, is why the Web is for me a unique technology to be defended and cherished with the utmost energy, and here’s why I love the Web.

I love the Web because it is simple and low tech

Cybermonks by PIMboula in CC by-nc-nd 2.0 on Flickr

The Web is simplicity at all levels: the uses firstly, through hyperlinks that allow movement from one document to another without having to learn any taxonomy or query syntax. It is not surprising that this is the first official Web browser – Mosaic – in 1993, which is the cause of the explosion of Internet applications that have been experienced since then. Why? Just because surfing the Web is within reach of a child 5 years. At the technical level then the Web is so simple and low tech, so little demand for resources that the last thing you will do a computer dyspeptic end of life, who can not even open Word 95 in less than 3 hours This will allow you to surf the Web. Suddenly, the Web is accessible to everyone, to all machines, all budgets, libraries and other public spaces can make available digital dozens of computers with a browser and Internet cafés flourish Gaza Hanoi, Delhi to Lagos giving a free or very low cost a considerable amount of information of all kinds for billions of people. This, I wish we would not forget it, because it is simply a revolution across the globe: “magical, wonderful, incredible, amazing, awesome, big, bigger, better, best , great, huge, terrific, Tremendous, Ambitious, gorgeous, beautiful, phenomenal, unbelievable, unbeatable, Remarkable, Revolutionary, extraordinary “say Steve Saint.

But also, the Web is so simple that it allows all those who wish to transmit and disseminate information cheaply and without having to acquire great technical knowledge. Do they know it is quite possible to compose a web page using Notepad his computer? Try to create the same conditions applied for iPad to view! And this web page that I made on my notebook, nothing easier than putting it on a Web server, one that is free at my disposal my ISP, for example, to make it available to land whole, without asking anyone (and I can even stream my page directly from my computer person, if I want). Conversely, you can not publish my application for iPad without obtaining the imprimatur and nihil obstat Steve Saint …

Homo Numericus 2000 – Hand Made

It is this simplicity to the creation of content that attracted me first and made me dive into the Web, like many of my generation. Living in a world where we explain all the time that everything is very complicated, where to do this or that, always a lot of time, obtain the agreement of a huge number of people permitted where it must constantly make compromises with the powers that be, the Web gave me first tastes of freedom to do everything alone, even though I later learned that it is also much more interesting do with others. But precisely because I am free to write and distribute my ideas, explore possible without asking anything to anyone, I am much more fun doing it with others, and partly because I’m there not constrained.

I love the web because it is open and Social

Let’s talk about precisely what “doing with others”. One of the more heartening is that the Web becomes more and more social: you think social networks of course – deeply immersed in the Web, but we must also think of the hundreds of thousands of online communities that congregate around Web sites of all kinds and on all subjects, we must also mention the famous “Planet” that structure the blogosphere and give new life to old “webrings”, not to mention all the collaborative sites, Wikipedia and others, who are born of the Web and allow thousands of people co-create content, the logic of Web 2.0. We need to examine the engine of this “socialization” progressive web.

Do not use (your) public garden! by Castorp Republic in CC by-nc-sa 2.0 on Flickr

What enables these multiple collaborations, these movements by which people are constantly exchanging, more or less close, but more and more intense? I think that is exactly why we criticize most often the web: its unorganized, without pre-established plan, and open to all winds. The Web is a bazaar where a gay chat would not find its young, is a Spanish hotel where everyone brings a dish from home calling a meal for less colorful, is a lush and bloom the hundred flowers of diversity and it is exactly what gives it life and is so similar to … the company itself, an open and welcoming society, self-organizing somehow, but as vulgar, violent, sometimes dangerous, destructive and creative at the same time, that is to say simply living! In comparison, the Eden of St Steve is very comfortable, pleasant, no weeds or thorns, the roads are straight and well raked, everything has its place and a place for everything, one might relax and leave the children frolic without fear of bad shots. But his cottage garden does not promote conversation or meetings, it does not allow visitors to combine an area of garden and invent together the novel.

A corporation is not a company or an army, nor a church. This is not a shop or a medium and it is not even a family or community. The company encompasses and transcends all these forms of collective. It is like an ocean without limit, it is multidimensional and nobody has a set agenda in mind. Some think perhaps the government, but they are deluding themselves mostly. One company saw his own life and grows in unpredictable directions and unexpected. It feeds on many meetings, exchanges and interactions that connect its members. These interactions are disordered, chaotic, unexpected too, and it is the engine of creation. It seems that the Web works in much the same way.

I love the Web because it is democratic.

Yes, de-mo-cra-tion. I am well aware that a dirty word in this day and age, and I urge my readers to please forgive me, but the combination of the foregoing: the Web as we know, that is that is accessible, open, social and self gives a quality political advantage. That is why it is much vilified by the powerful, political, media stars and other captains of industry, who have words strong enough to denounce the “all-in-sewage”, the “worst crap ever invented by man “, the den of terrorists and pedophiles and more recently Trotskyite-fascist. The Web is hurting because it is powerful

Student demonstration by Philip Leroyer, the by-nc-nd 2.0

place where the people – another big word – expressed freely and critically. Understand that the entire body of legislation that has been building for over ten years and one that promises all the measures demanded by each other to regulate and control the “no-go zone” that would the Web, all the speeches of vilification is meant all the time against this means of communication are attempts to reduce the freedom of expression that characterizes him.

“Internet [she meant the Web] is a public hazard as open to anyone to say anything,” exclaimed one day was Francoise Giroud. I quite agree with it, and it is precisely why I love so much the Web.

Bonus: And the Web is cool too!

The supposed poor quality, the so-called limitations of HTML are often used to fuel criticism of the Web, the part of editors and designers in particular. The HTML would be unable to meet the high standards of the beautiful typography, with hyphenation, fonts and glyphs licked choice, hence the use of PDF files. The HTML would be unable to support any animation, where the proliferation of sites or Flash video (including Saint Steve does not matter). These reviews could be based at a time. They are now less and less, thanks, first, the modernization of browsers, and also to changing standards themselves. On the first point (text and typography), you should know that Epub format that is currently prevail for encapsulating electronic books to read on the shelves of reading is based on HTML and crushes it greatly PDF unable to reposition a text on the fly depending on the size of the support. Nice revenge! On the second point (images and animation), you must see this video Nitot proposed by combining HTML5, CSS3 and Firefox4 us away eyes.

Disclaimer: I have a iPad me too and I love it. … Especially to surf the Web.

Posted in Uncategorized | Leave a comment