O'Really?

April 17, 2009

The Unreasonable Effectiveness of Google

Filed under: Googleology — Duncan Hull @ 4:00 pm
Tags: Adam Kilgarriff, Alistair Miles, Allyson Lister, Alon Halevy, Andrew Clegg, Artificial Intelligence, bioformats, Biomodels, bootstrep, ChEBI, David Shotton, Dietrich Rebholz-Schuhmann, Eugene Wigner, Fernando Pereira, Frank van Harmelen, Gene Ontology, Googleology, Googleplex, Jim Hendler, Larry Page, Michael Uschold, Nicolas le Novère, OBO, Opinion, Ora Lassila, Peter Norvig, provocative, pubmed, PubMedCentral, reasoner, Reasoning, sbml, scifoo, Sergey Brin, Steffano Mazzocchi, Tim Berners-Lee, unreasonable

Via the Official Google Research Blog at the University of Google, Alon Halevy, Peter Norvig and Fernando Pereira have published an interesting expert opinion piece in the March/April 2009 edition of IEEE Intelligent Systems: computer.org/intelligent. The paper talks about embracing complexity and making use of the “the unreasonable effectiveness of data” [1] drawing analogies with the “unreasonable effectiveness of mathematics” [2]. There is plenty to agree and disagree with in this provocative article which makes it an entertaining read. So what can we learn from those expert Googlers in the Googleplex? (more…)

Comments (5)

April 9, 2009

Upcoming Gig: The Scholarly Communication Landscape

Filed under: informatics — Duncan Hull @ 12:35 pm
Tags: Ben Stebbing, Bill Hubbard, BioMed Central, Carole Goble, CIBER, citeulike, david booton, eScholar, flickr, friendfeed, Institutional Repository, jan wilkinson, John Rylands, JRULM, library, Manchester, Michael Daw, Michael Jubb, Mike Daw, MIMAS, myexperiment, nactem, Research Information Network, Robin Hunt, SHERPA, simon gaskell, slideshare, Sophia Ananiadou, stell butler, Terri Attwood, upcoming gig, wordpress

Details of an upcoming gig, The Scholarly Communication Landscape in Manchester on the 23rd of April 2009. If you are interested in coming, you need to register by Monday the 13th April at the official symposium pages.

Why? To help University staff and researchers understand some of the more complex issues embedded in the developments in digital scholarly communication, and to launch Manchester eScholar, the University of Manchester’s new Institutional Repository.

How? Information will be presented by invited speakers, and views and experience exchanged via plenary sessions.

Who For? University researchers (staff and students), research support staff, librarians, research managers, and anyone with an active interest in the field will find this symposium helpful to their developing use and provision of research digital formats. The programme for the symposium currently looks like this:

Welcome and Introduction by Jan Wilkinson, University Librarian and Director of The John Rylands Library.

Session I Chaired by Jan Wilkinson

Is the Knowledge Society a ‘social’ Network? Robin Hunt, CIBER, University College London
National Perspectives, Costs and Benefits Michael Jubb, Director, Research Information Network
The Economics of Scholarly Communication – how open access is changing the landscape Deborah Kahn, Acting Editorial Director Biology, BioMed Central

Session II Chaired by Dr Stella Butler

Information wants to be free. So … ? Dr David Booton, School of Law, University of Manchester
Putting Repositories in Their Place – the changing landscape of scholarly communication Bill Hubbard, SHERPA, University of Nottingham
The Year of Blogging Dangerously – lessons from the blogosphere, by Dr Duncan Hull (errr, thats me!), mib.ac.uk. This talk will describe how to build an institutional repository using free (or cheap) web-based and blogging tools including flickr.com, slideshare.net, citeulike.org, wordpress.com, myexperiment.org and friendfeed.com. We will discuss some strengths and limitations of these tools and what Institutional Repositories can learn from them.

Session III Chaired by Professor Simon Gaskell

The University Press and Digital Publishing Ben Stebbing, Manchester University Press
MIMAS’ role in Supporting the Repository Landscape Vic Lyte, MIMAS
Defrosting the Digital Library (hmmmm, nice title) Professor Terri Attwood, Faculty of Life Sciences
Research Computing at Manchester, Dr Mike Daw, Head of Research Computing, IT Services Division
Enhancing User Experience of Scholarly Communication through Text Mining, Dr Sophia Anianadou, Director, National Centre for Text Mining (NaCTeM.ac.uk)
Manchester eScholar – what, why and when Professor Carole Goble, School of Computer Science

Sumary and close by Professor Simon Gaskell, Vice-President for Research

April 6, 2009

Should We Boycott Amazon (again)?

Filed under: bookish,economics — Duncan Hull @ 8:28 am
Tags: Amazon, amazon boycott, amazon.co.uk, Basil Blackwell, blacksci, Blackwell Science, bookbrunch, BookDepository, BookSeller, bookseller.com, Boycott, boycott amazon, Capitalist, CD, DVD, etail, John Wiley, monopoly, Oxford, play.com, Richard Stalman, Waterstones

My first proper full-time job was working in the big bad world of scientific publishing for a family run company based in Oxford called Blackwell Science Limited, or blacksci.co.uk which is now part of wiley.com. Consequently, I’ve a few friends and former colleagues who still work in various parts of the publishing industry. Last week I got an email from one of these friends who works for a small independent book publishing company: I’ve reproduced an interesting email message about Amazon from them below (with permission):

This is very unlike me but I am sending a general email out because I am so outraged by something I feel I must share with you. In case you didn’t already know, I work for a small publisher. Times are hard – we all know that. Amazon.co.uk form a large part of our business. Recently they have changed their terms with all of their publishers. For us, and many other small and independent publishers, these new terms are completely unacceptable. We have no say about it and the way they went about it was frankly nasty (they basically sent an email out giving us a week to decide whether to give them more discount or more credit). For bigger publishers it may have a negligible effect but for smaller publishers, where cashflow can mean everything, the effect will be severe! And they have us over a barrel.

Amazon.co.uk so dominate the online market in books that they are almost a monopoly. The discounts we’ve been supplying Amazon for the last few years are outrageous – but what they have done recently is the last straw, and many small publishers could go out of business (luckily I think we’ll survive!). I am so outraged at how they are treating their suppliers that I am now boycotting Amazon for my own personal books and CDs. I have been using them for years and years. The only way to put a bit of healthy competition back into the system is by having more online book retailers become as successful as Amazon. Today we used The Book Depository bookdepository.co.uk for the first time. The books we wanted were all there, in stock and cheaper than Amazon and it was very easy to use. So we’re trying to help spread the word!

Another online retailer is waterstones.com, which separated from Amazon a few years ago due to their unworkable terms. I haven’t used them myself but I hear they are pretty good, and play.com can fulfil your DVD and CD requirements (and all delivery is free I think).

They may not always be as cheap as Amazon but now you know how Amazon get their low prices you may not be as happy to use them – if small, interesting, independent publishers go out of business it’ll just be the biggies left (which will mean much less choice).

So, is the behaviour of Amazon.co.uk just the all too familiar face of capitalism? Or should we boycott Amazon for being a big bully only interested in monopolising the marketplace and getting rid of some healthy competition?

References

Catherine Neilan (2009) Amazon refused to budge on new terms, Bookseller.com 2009-03-30
Liz Thomson (2009) Advantage Amazon? Publishers react to proposed new terms Bookbrunch.co.uk 2009-03-26
Richard Stalman (2001) (Formerly) Boycott Amazon! – GNU Project – Free Software Foundation (FSF) gnu.org

Comments (3)

April 2, 2009

Upcoming Gig: Science Foo Camp (scifoo) 2009

Filed under: awards — Duncan Hull @ 10:57 am
Tags: california, cameron neylon, Chris DiBona, congratulations, David DeRoure, Disney, Douglas Kell, foo, foo camp, Foobar, Frank Wilczek, Fubar, Googleology, Googleplex, nobel, Peter Murray-Rust, Pooh Camp, Richard P. Grant, Russ Altman, scifoo, scifoo09, Shirley Wu, Stanford University, Tim O'Reilly, Timo Hannay

In my inbox this morning, an intriguing email from Timo Hannay, Tim O’Reilly and Chris DiBona:

Duncan,

We’d like to invite you to join us for Science Foo Camp (or “Sci Foo”), a unique, invitation-only gathering organized by Nature, O’Reilly Media, and Google, and hosted at the Googleplex in Mountain View, California.

Now in its fourth year, Sci Foo is achieving cult status among those with a passion for science and technology. Nobel laureate Frank Wilczek wrote of last year’s event:

“SciFoo is a conference like no other. It brings together a mad mix from the worlds of science, technology, and other branches of the ineffable Third Culture at the Google campus in Mountain View. Improvised, loose, massively parallel–it’s a happening. If you’re not overwhelmed by the rush of ideas then you’re not paying attention.”

As before, we will be inviting about 200 people from around the world who are doing groundbreaking work in diverse areas of science and technology. Participants will include not only researchers, but also writers, educators, artists, policy makers, investors, and other thought leaders.

The format is highly informal: all delegates are also presenters and demonstrators; the schedule is determined collaboratively on the first evening; and sessions continue to be organized and re-organized throughout the weekend. This creates a unique opportunity to explore topics that transcend traditional boundaries, and discussions are of a kind that happens at the best conferences during breaks and late into the night. Of course, there will also be time to have fun and relax at Google’s legendary campus.

Sci Foo 2009 will run from about 6pm on Friday, July 10 until after lunch on Sunday, July 12. Campers need to make their own way to and from the event, but Google will provide accommodation and meals, and there is no registration fee. For those who don’t have cars, there will also be free shuttle buses between the hotel and the Googleplex.

Please RSVP etc

We hope to see you at the Googleplex in July!

Tim O’Reilly, O’Reilly Media
Chris DiBona, Google
Timo Hannay, Nature

About Nature Publishing Group

Nature Publishing Group (NPG) is dedicated to serving the information and communication needs of scientists and medics. NPG’s flagship title, Nature, first published in 1869, has now been joined by over 80 other titles, among them the Nature research journals, Nature Reviews, Nature Clinical Practice and a range of prestigious academic journals including society-owned publications. It also operates the leading scientific website, Nature.com, and a range of innovative online services, from databases to collaboration tools and podcasts.

About O’Reilly Media

O’Reilly Media spreads the knowledge of innovators through its books, online services, magazines, and conferences. Since 1978, O’Reilly has been a chronicler and catalyst of leading-edge development, homing in on the technology trends that really matter and spurring their adoption by amplifying “faint signals” from the alpha geeks who are creating the future. Whether it’s delivered in print, online, or in person, everything O’Reilly produces reflects the company’s unshakeable belief in the power of information to spur innovation. An active participant in the technology community, the company has a long history of advocacy, meme-making, and evangelism.

About Google Inc.

Google’s Philosophy – Never settle for the best “The perfect search engine,” says Google co-founder Larry Page, “would understand exactly what you mean and give back exactly what you want.” Given the state of search technology today, that’s a far-reaching vision requiring research, development, and innovation to realize. Google is committed to blazing that trail. Though acknowledged as the world’s leading search technology company, Google’s goal is to provide a much higher level of service to all those who seek information, whether they’re at a desk in Boston, driving through Bonn, or strolling in Bangkok.

About Foo Camps

The “Foo Camp” meeting format has been pioneered by O’Reilly (see when geeks go camping). In this context, “Foo” originally stood for “Friends Of O’Reilly“, but it is also a meaningless ‘placeholder word’ commonly used by computer programmers, rather like the term ‘X’ in algebra. The success of O’Reilly’s original technology Foo Camps has stimulated a wide range of similar events, from Science Foo Camp to Disney’s Pooh Camp.

Obviously I’m thrilled to bits to receive such an email, I’ve been to scifoo once before and it was a fantastic mind-blowing experience. This time, I’m invited as a consolation prize for being a runner-up in the international science blogging challenge 2009 which challenged younger scientists to get a senior scientist to blog. I managed to convince Douglas Kell and David DeRoure to start blogs, so thanks are due to them for entering into the spirit of the competition. This year, the first prize was won by Russ Altman and Shirley Wu at Stanford University, congratulations Shirley and Russ, it will be good to compare scientific blogging notes with you both.

Now, it would have been good to win this prize, but the invite above is probably one of the best runner-up prizes I’ve ever had. Thanks are due to the competition judges Cameron Neylon, Peter Murray-Rust and Richard P. Grant for organising the competition. Thanks also to Tim O’Reilly, Timo Hannay and Chris DiBona, see you in the Googleplex!

[More commentary on this post over at friendfeed]

Comments (10)

March 16, 2009

Defrosting the Digital Slideshow

Filed under: biotech,communication,informatics — Duncan Hull @ 3:14 pm
Tags: 2Collab, BBC Monitoring, bibtex, biochemistry, bioinformatics, Casey Bergman, ChEBI, cheminformatics, citeulike, connotea, CSW Informatics Ltd, database, endnote, Ford, google scholar, identity, Institutional Repository, John Chelsom, library, Mavis Cournane, Mekentosj Papers, Mendeley, metadata, Neil Smalheiser, openid, Papyro, Peter Murray-Rust, pubmed, refworks, scopus, text mining, Vetle Torvik

Slides from the seminar today, for those that asked for them. Thanks to everyone who came, we had a good turn out, much better than expected.

Those Library and Institutional Repository people have asked for an encore too…

Comments (2)

March 12, 2009

Defrosting the Digital Seminar

Filed under: bio,biotech — Duncan Hull @ 8:37 am
Tags: bbsrc, bioinformatics, Casey Bergman, citeulike, google scholar, Jean-Marc Schwartz, Lecture, life sciences, nactem, pubmed, REFINE, seminar, text mining, University of Manchester

Casey Bergman suggested it, Jean-Marc Schwartz organised it, so now I’m going to do it: a seminar on our Defrosting the Digital Library paper as part of the Bioinformatics and Functional Genomics seminar series. Here is the abstract of the talk:

After centuries with little change, scientific libraries have recently experienced massive upheaval. From being almost entirely paper-based, most libraries are now almost completely digital. This information revolution has all happened in less than 20 years and has created many novel opportunities and threats for scientists, publishers and libraries.

Today, we are struggling with an embarrassing wealth of digital knowledge on the Web. Most scientists access this knowledge through some kind of digital library, however these places can be cold, impersonal, isolated, and inaccessible places. Many libraries are still clinging to obsolete models of identity, attribution, contribution, citation and publication.

Based on a review published in PLoS Computational Biology, http://pubmed.gov/18974831 this talk will discuss the current chilly state of digital libraries for biologists, chemists and informaticians, including PubMed and Google Scholar. We highlight problems and solutions to the coupling and decoupling of publication data and metadata, with a tool called http://www.citeulike.org. This software tool exploits the Web to make digital libraries “warmer”: more personal, sociable, integrated, and accessible places.

Finally issues that will help or hinder the continued warming of libraries in the future, particularly the accurate identity of authors and their publications, are briefly introduced. These are discussed in the context of the BBSRC funded REFINE project, at the National Centre for Text Mining (NaCTeM.ac.uk), which is linking biochemical pathway data with evidence for pathways from the PubMed database.

Date: Monday 16th March 2008, Time: 12.00 midday, Location: Michael Smith Building, Main lecture theatre, Faculty of Life Sciences, University of Manchester (number 71 on google map of the Manchester campus). Please come along if you are interested…

[CC licensed picture above, “The Lecture” at Speakers Corner by James M Thorne]

Comments (2)

February 25, 2009

A Fistful Of Papers: Journal Club for Gunslingers

Filed under: biotech — Duncan Hull @ 4:30 pm
Tags: clint eastwood, fistful, Journal Club, lego

A Fistful of Papers is a Journal Club with a simple recipe

We pick interesting papers
We read them
We periodically meet to discuss said papers in the pub local saloon

It’s all good fun, if you’d like to join us, details of the next gathering on Friday 27th February, can be found over at fistful.wordpress.com (Journal Club for Gunslingers).

[Clint Eastwood picture by Lego Man Andrew Becraft a.k.a. Dunechaser]

February 20, 2009

Mistaken Identity: Google thinks I’m Maurice Wilkins

Filed under: funny,google,informatics — Duncan Hull @ 8:35 am
Tags: algowithm, Anurag Acharya, bibliometrics, DNA mania, Double Helix, forgotten password, google scholar, googlebot, identity, impact factor, interweb, Jules De Martino, Katie White, maurice wilkins, Neil Smalheiser, nobel, nodalpoint, Péter Jacsó, The Ting Tings, Vetle Torvik

In a curious case of mistaken identity, Google seems to think I’m Maurice Wilkins. Here is how. If you Google the words DNA and mania (google.com/search?q=dna+mania) one of the first results is a tongue-in-cheek article I wrote two years ago about our obsession with Deoxyribonucleic Acid. Now Google (or more precisely Googlebot) seems to think this article is written by one M Wilkins. That’s M Wilkins as in the physicist Maurice Wilkins, the third man of the double helix (after Watson and Crick) and Nobel prize winner back in ’62. How could such a silly (but amusing) mistake be made? Because the article is about what Wilkins once said, but not actually by Wilkins. Computers can’t tell the difference between these two things. Consequently, it has been known for some time that Google Scholar has many other mistaken identities for authors like this. Scholar even thinks there is an author called Professor Forgotten Password (a prolific author who has been widely cited in many fields)!

The other curiosity is this, the original post on nodalpoint.org is also counted as a citation in Google Scholar too. It’s a bit of a mystery how scholar actually works, what it includes (and excludes) and how big it is, but you’ll find the article counted as a proper citation for a book about genes. Scientific spammers must be licking their lips with the opportunity to influence results and citation counts, with humble blog posts, rather than more kosher articles in peer-reviewed scientific journals.

So what does this all this curious interweb mischief tell us?

Identifying people on the web is a tricky business, more complex than most people think
Googlebot needs to have its algowithms tweaked by those Google Scholars at the Googleplex. Not really surprising, what else did you expect from Beta software? (P.S. Googlebot, when you read this, I’m not Maurice Wilkins, that’s not my name. I haven’t won a Nobel prize either. I’m sort of flattered that you’ve mistaken me for such a distinguished scientist, so I’ll enjoy my alternative identity while it lasts.)
Blogs are increasingly part of the scientific conversation, counted in various bibliometrics, will Google Scholar (and the rest) start indexing other blogs too? Where will this trend leave more conventional bibliometrics like the impact factor?

(Note: These search results were correct at the time of writing, but may change over time, results preserved for posterity on flickr)

References

Maurice Wilkins (2003) The Third Man of the Double Helix: The Autobiography of Maurice Wilkins isbn:0198606656
Péter Jacsó (2008) Savvy searching – Google Scholar revisited. Online Information Review 32: 102-11 DOI:10.1108/14684520810866010 (see also Defrosting the Digital Library)
Douglas Kell (2008) What’s in a name? Guest, ghost and indeed quite imaginary authorships BBSRC blogs
Neil R. Smalheiser and Vetle I. Torvik Author Name Disambiguation (This is a preprint version of a chapter published in Volume 43 (2009) of the Annual Review of Information Science and Technology (ARIST) (B. Cronin, Ed.) which is available from the publisher Information Today, Inc (http://books.infotoday.com/asist/#arist).
Duncan Hull (2007) DNA mania. Nodalpoint.org
Jules De Martino and Katie White (2008) That’s not my name (video)

Comments (4)

February 11, 2009

Janet Street-Porter on the Internet Revolution

Filed under: publishing,Science — Duncan Hull @ 8:40 am
Tags: BBC2, Copyright, interweb, iPlayer, Janet Street-Porter, new media, old media, Open Access, peer review, Pergamon Press, revolution, Robert Maxwell, Rupert Murdoch, vanity journals

I’m not much of a fan of Janet Street-Porter, neither am I a regular viewer of the BBC Money programme but right now they are screening an interesting series of three half-hour programmes on the impact of the internet on newspapers, books and television. It’s a familiar tale of the power-and-money struggle between old media and new media that, if the first programme is anything to go by, is worth watching. Here is the blurb from the first episode in the series, billed as Media Revolution: Stop Press?

Former national newspaper editor Janet Street-Porter investigates how papers are coping with falling circulation, advertising revenues and the growth of the internet, and asks if newspapers can survive in their current form. In her quest to discover what the future holds for her beloved newspapers, Janet visits newsrooms, printing plants and even spends a morning as a papergirl. With contributions from national editors, advertising gurus and a rare interview with media mogul Rupert Murdoch, Janet examines if papers can survive as new multimedia information giants.

There are some interesting parallels between the changes described in this programme, and scientific media, especially the scientific journal publishing racket.

Scientific Media Revolution?

The story of the current revolution in scientific and technical publishing is perhaps just as interesting (and more important) than the one being told on the money programme. Just think of it, why scientists publish, the emergence of peer review, how Robert Maxwell made his fortune from the Pergamon Press, the impact factor game, the birth of the Web (in a scientific laboratory), the growth of Google, the copyright wars, open-access publishing, social software, the rise and fall of publishing empires (and technology companies), the vanity journals, scientific blogs and wikis, software showdowns, how all this change affects producers and consumers of science and technology, both now and in the future. A juicy subject, worthy of broadcasting on any media (old or new). You would need a lot more than three half-hour programmes to cover this particular ongoing epic, so who is going to tell that story?

Anyway, the series is worth a look (if you haven’t already seen it) at least according to me (others disagree see also no paper is the future). It is also available on iPlayer for up to a week after first broadcast – Thursday 5th, 12th and 19th February 2008 – for each episode in the UK only, unless you go through some kind of proxy.

February 6, 2009

The Loneliness of the Long Distance Researcher

Filed under: Science — Duncan Hull @ 8:53 am
Tags: Allan Sillitoe, Bernhard Palsson, Britney Spears, Gordon Plotkin, hermit, Isolation, Libby Miller, lonely, lyrical, mcisb, Paul Duguid, phd, Scott Berkun, Yuri Lazebnik

Despite what some people think (see “the myth of the lone inventor” in [1]) most scientists are usually pretty sociable people. Science is an inherently social activity [2], just take a look around you. Most laboratories are full of like-minded people working on related problems, our lab is no exception. Outside the lab, there are all the conferences, workshops, seminars, trips to the pub, coffee breaks and other meetings where scientists meet and exchange ideas and results. Finally, note the peer in peer-review – another essentially social activity, even when it is anonymous.

But in between these gregarious social activities there is a long, lonely and pretty unsociable road where you need to spend lots of time thinking, reading, writing and experimenting. Essentially you are alone, like a modern day hermit, especially at the earlier stages of a career. Solitary confinement in your ivory tower of choice needs to be balanced with various kinds of socialising. Talking about and watching what other people are doing, as well as publicising your own work are an essential part of the mix. But you still need to put the hours in on the road. It isn’t always easy to get it right, so how do you strike a balance between the social and the solitary activities to establish yourself as an independent research scientist? (more…)

Comments (4)

« Previous Page — Next Page »

April 17, 2009

April 9, 2009

Session I Chaired by Jan Wilkinson

Session II Chaired by Dr Stella Butler

Session III Chaired by Professor Simon Gaskell

April 6, 2009

References

April 2, 2009

About Nature Publishing Group

About O’Reilly Media

About Google Inc.

About Foo Camps

March 16, 2009

March 12, 2009

February 25, 2009

February 20, 2009

References

February 11, 2009

Scientific Media Revolution?

February 6, 2009

Meta / μετά