Tim Berners-Lee delivered his one hour keynote at the AAAI’06 conference yesterday on the Semantic Web, after an introduction from Yolanda Gil. Tim gave an impassioned speech covering the last 16 years of the web and discussed the future of sharing data on the web using persistent URI’s and W3C standards like RDF and OWL. At the end of it all, there were some searching questions from Peter Norvig, Director of Research at Google Labs.
Peter opened his questions by saying.
Many people usually ask me, when I stand up and ask questions after keynote speeches at conferences:
“Peter, what do you have against the Semantic Web?”
Here is roughly what Peter said, the semantic web will never work because:
- People are stupid: Google has lots of experience of dealing with peoples stoopidity on the web. Many people don’t write well-formed HTML, they don’t run web servers properly and they keep changing what their URIs identify. It sucks, but this is the world, imperfect and messy and we just have to deal with it. These same people can’t be expected to use the Resource Description Framework (RDF) and the Web Ontology Language (OWL), which are much more complicated and considerably less fool-proof. (Perhaps you could call this the dumb-antic web?!)
Tim replied that a large part of the semantic web can be populated by taking existing relational databases and mapping them into RDF/OWL. The structured data is already there, it just needs web-izing in a mashup-friendly format. (What I like to call the romantic web: people will publish their data freely on the web this way, especially in e-science for example. This will allow sharing and re-use in unexpected ways.)
- People are competitive: People working for commercial companies and market leaders can’t be expected to just put their raw data on the web as RDF/OWL, they have little interest in standards. This is how they make a living, beating their competitors and locking their customers into proprietary data formats, so they can keep selling them software / hardware to use their data. (Analagous problems in science, scientists can be reluctant to share and publish data, if someone else will make new discoveries with it and claim all the glory)
Tim replied that most bookstores thought putting their stock levels and prices on the web was a bad idea as it would give sensitive information away to their competitors. However, they soon realised that this would allow their customers to search, browse and eventually (kerr-ching) buy their books.
- People cheat and lie: People lie about what their content is about, again, Google is on the receiving end of this. Cheats try to fool the PageRank algorithm by saying their web pages are about books or movies, when they are really about Viagra or Pornography. The same fate awaits RDF and OWL, cheaters will use ontologies to tell bare-faced lies about their data. (What I like to call the satanic web: people do evil things).
Tim didn’t have any good answers to this, although later in the day there were some papers touching on the issue of Trust and Policies in the semantic web layer cake.
These lively debates are raging on un-abated, in the corridors, lecture theatres and bars. AAAI is now in full swing and its great to be here!
Further reading
- Peter Norvig Teach Yourself Programming in Ten Years norvig.com
- Stuart Russell and Peter Norvig Artificial Intelligence: A modern approach
- Peter Norvig Paradigms of Artificial Intelligence Programming: Case Studies in Common Lisp
- Harry Chen Google Executive challenges Tim Berners-Lee
- Jamie Pitts Comments on Tim Berners-Lee presentation at AAAI’06: Q&A
- Danny Ayers Semantic “incompetents” revolt against Google!
- Tim Finin Google and the Semantic Web
- A lot of Peter Norvigs arguments look like Cory Doctorows Metacrap: Putting the torch to seven straw-men of the meta-utopia from back in 2001.