My Photo

WorldCat


Twitter Updates

    follow me on Twitter
    Blog powered by TypePad

    google analytics


    meter


    Categories

    Categories

    « Tertulia (Book Review: Living to Tell the Tale) | Main | LOCKSS - Floating electronic librarianship to a higher level »

    January 08, 2007

    Cows and the Colossus

    Img_6511 Mike Keller delivered a presentation at OCLC today entitled Mass Digitization in Google Book Search: Effects on Scholarship. Mike is director of Stanford University Libraries, and wears an academic publisher’s hat as well, being responsible for High Wire Press and Stanford University Press. He commands a panoramic view of the digital scholarly landscape, and has the intellect and experience to convert view to vision. This vision is both breathtaking and, in some respects, disturbing.

    For those unsettled by the rapidity of Googlian hegemony in library spaces, Mike constructs a vivid and compelling argument for embracing the revolution, reaching back to the digitization of card catalogs. Several salient observations from his remarks:

    • Digitization of the card catalog resulted in a 50 % increase in book usage
    • Google indexing is the #1 driver of article usage in High Wire – by a large margin (10 to 1 beyond the next highest, if I understood him correctly)
    • Metadata searching (what Keller describes as subtle searching), in combination with novel methods of taxonomic search and citation cross-linking, dramatically improves discovery and navigation within large result sets.

    It is difficult to resist Keller’s assertion that Google Book Search (GBS) is likely to revolutionize access to books more than any single factor in the library world – if not directly, then indirectly. It would be hard to be a librarian and not find chagrin in this realization.  Keller rightly urges us to focus on the larger picture and the many benefits.

    Stepping away from the somewhat daunting implications for libraries, Keller suggested that the most important thing about GBS is that it has occasioned a great debate about the importance of copyright in the intellectual life of the nation (and the world). The legal wrangling surrounding the digitization of books in the so called Google Libraries may indeed midwife a re-examination of copyright issues on a grand scale. Certainly the current interpretation of copyright law in the United States is far more likely to be driven by supply-chain economics than by the intent of the framers of the Constitution. Perhaps for the first time, there are heavy hitters on both sides of the argument, which may result in a reinterpretation of fair use that makes more sense (to libraries and readers) in the digital age.  One may hope.

    Keller pointed out the importance of healthy competition among various digitization projects: the Million Book Project, GBS, the Open Content Alliance, the Microsoft/British Library and Microsoft/Cornell efforts. Could we have imagined anything like this rush to mine the library shelves of the world even a few years ago? Could we (the library community) have marshalled either the vision or the resources to accomplish the task on our own?  It is unlikely.

    On the dark side, he raised the image of libraries as herds of cows in these deals. Participants are kept in the dark, enjoined from sharing the details of their deals with other participants, let alone with their public constituencies. They (we) don’t know if they are being “milked, butchered, or destined to cover automobile seats of expensive cars.”  [I paraphrase, but only slightly]

    What is evident is that benefits for the G-Libraries are substantial. The libraries involved receive a windfall of the digitized contents of their collections (though, Keller also points out that much is likely to have to be recaptured at higher resolution in the future).  For institutions of the caliber of these early G-partners, their technical wherewithal and innovative approaches promise rich research environments as well as new services to readers.

    There are plenty of objections, of course, some with a Luddite feel, others based on money, control, cultural imperialism, disintermediation of libraries, and the hegemony of a colossus too large and too innovative to ignore. Publishers, authors, and librarians all have much to lose, but also many potential gains in how this plays out in the long run. It would seem that only for readers is the benefit clear and unambiguous. But even readers may lose in a world where any monolith controls access to the content, whether that monolith be a government or Google.

    Mike Keller is an articulate spokesperson for the dilemmas and opportunities that are upon us. I found myself unable to resist most of his arguments, even a few I wanted to reject. Our community (or at least an elite component of it) has been offered a deal impossible to resist, a side effect of the inexorable march of digitalization that began in our realm with Henriette Avram’s vision of machine readable cataloging.

    Most of our constituents will benefit greatly, while libraries will rattle from the footfalls of the Googles and their ilk. Still, I am unnerved that these developments are cloaked in non-disclosure agreements, even as I am excited about the extraordinary challenges and possibilities we face.  Will cooperation within the community survive the Faustian bargains from without?

    -----

    My thoughts here are almost certainly flawed renditions of Mike's remarks... I was jotting notes as fast as my dyslexic fingers would go, and trying to pay attention as well.  Responsibility for distortions or errors are of course my own.
    -----
    Image: OCLC at dawn

    TrackBack

    TrackBack URL for this entry:
    http://www.typepad.com/t/trackback/462557/7422526

    Listed below are links to weblogs that reference Cows and the Colossus:

    » Cows and the Colossus from University Update
    [Read More]

    Comments

    Hey Stu, interesting post. I've been waiting for you to get back in the groove and start posting again. The image is a nice touch ... ;-)

    I think the interesting question about the massive book digitization efforts will be whether we, as librarians, will pull ourselves together to figure out how to provide useful access to these resources that are different than what Google/Microsoft/whoever will be doing. The libraries working with the behemoths will have their own versions of the text and images, but I've not heard much about what they plan to do (above and beyond the inevitable scramble for storage space).

    Diane

    A great post; thank you. I would offer two little notes:

    1. In my opinion (stated previously elsewhere), the most significant effect of GBS on libraries is a likely increase in circulation (with Worldcat.org playing an important intermediary role), quite possibly a substantial increase. This is not, I believe, a terrible thing.

    2. Well, Stanford may be stuck with a nondisclosure agreement, but the older institution across the bay wasn't, since public institutions can't be held to nondisclosure agreements in California (and several other states). So we know the terms of UC's contract with Google, just as we know the terms of Michigan's contract. You'd think Google would just give up on the nondisclosure at this point...

    Post a comment

    If you have a TypeKey or TypePad account, please Sign In