Digital Natives » Blog Archive » Google Book Search, Orphan Works and the Public Domain

June 2008
M	T	W	T	F	S	S
	1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

Google Book Search, Orphan Works and the Public Domain

Comments: 8 - Date: June 25th, 2008 - Categories: Information Quality

Google Book Search has inspired passionate feelings and responses from many people since Google announced the project. Some, like Larry Lessig, view its scanning and indexing of copyrighted books as a legitimate activity under Fair Use. Others, like Siva Vaidhyanathan, are more skeptical of Google Book Search (and in Siva’s case, Google generally).

Either way, there’s no doubt that Google Book Search is a big deal. A key fact to keep in mind is one that Lessig makes repeatedly; namely, that

Google’s “Book Search service” aims to provide access to three kinds of published works: (1) works in the public domain, (2) works in copyright and in print, and (3) works in copyright but no longer in print. As some of you may recall from the presentation I made a while ago, about 16% of books are in category (1); 9% of books are in category (2), and 75% of books are in category (3).

And today there’s been a key advance in determining the often-difficult-to-divine status of whether some books are in category (1) or (3) – also courtesy Google:

For U.S. books published between 1923 and 1963, the rights holder needed to submit a form to the U.S. Copyright Office renewing the copyright 28 years after publication. In most cases, books that were never renewed are now in the public domain. Estimates of how many books were renewed vary, but everyone agrees that most books weren’t renewed. If true, that means that the majority of U.S. books published between 1923 and 1963 are freely usable.

How do you find out whether a book was renewed? You have to check the U.S. Copyright Office records. Records from 1978 onward are online (see http://www.copyright.gov/records) but not downloadable in bulk. The Copyright Office hasn’t digitized their earlier records, but Carnegie Mellon scanned them as part of their Universal Library Project, and the tireless folks at Project Gutenberg and the Distributed Proofreaders painstakingly typed in every word.

Thanks to the efforts of Google software engineer Jarkko Hietaniemi, we’ve gathered the records from both sources, massaged them a bit for easier parsing, and combined them into a single XML file available for download here.

This is, whatever your other feelings are about Google Book Search more generally, a wonderful advance in public accessibility of information. The list of what books are in the public domain can and will be used not just by Google Book Search in its ongoing (and arguably proprietary) book-scanning project, but also by other efforts like Brewster Kahle’s Open Content Alliance. Google comes in for a lot of criticism, but it’s worth acknowledging those times when they follow through on their stated goal of “organizing the world’s information,” and this is one of them.

One of the great challenges/opportunities that we face with digital information is the interface with print and analog information. There’s a danger – implicitly addressed by Book Search and the OCA – that our great knowledge resources from the past are ignored or left to molder, and the difficulty of determining copyright status has been something of a hurdle to digitization efforts thusfar. Recency bias will always be with us, but the possibility of making the great (and undiscovered or underappreciated) works of the past just as accessible to tomorrow’s students as the latest blog post or journal article is a goal to work towards.

–Jacob Kramer-Duffield

Be Sociable, Share!

Pingback by Google Book Search, Orphan Works and the Public Domain « Jacob Kramer-Duffield thinks - June 25, 2008 @ 11:12 am

[…] (cross-posted at Digital Natives) […]

Comment by Nikki Leon - June 25, 2008 @ 11:36 am

One of the upshots of Google Book Search, I’ve noticed, is the way it can be used in classrooms. This past year, one of my professors frequently assigned readings available on Google books and other online articles. This contributed to a “go out and get it” mentality that inspired students to do a lot of independent research online. The Digital Native’s search is no longer confined to a physical library, to the pages of a book, or by the whims of a publisher — and this change is important not simply because it promotes ease of research, but because it inspires the DN *to* research. “It’s out there. Grab it.”

The problem is, of course, that terms like “public domain” and “fair use”… and even copyright itself are poorly explained at all levels of schooling. As a result, (and I speak from my own experience and that of my peers), Digital Natives don’t always know *why* they have access to something, just that they’ve been lucky enough to stumble upon it. Their “its out there, grab it” mentality can therefore be good or bad: if they find things that they do have the rights to use, it broadens their knowledge of the world immensely and contributes to public knowledge; if, however, they stumble on something they’re not supposed to have (pirated items), there are unintended consequences.

Pingback by Google Book Search Adds Copyright Renewal Data - Creative Commons - June 27, 2008 @ 3:52 pm

[…] have maintained their copyright status and which have gone in to the PD. Jakob Kramer-Duffield speaks well to the implications of Google’s efforts in pointing out “there’s a danger […] that our great knowledge […]

Pingback by Googles Book Search « Public Domain - June 29, 2008 @ 5:02 am

[…] Google Book Search, Orphan Works and the Public Domain […]

Pingback by Notebook for the Week of June 23 « nina scaletti - June 30, 2008 @ 4:55 pm

[…] Thoughts and reflections on Google’s Book Search. [Digital Natives] […]

Pingback by Blog-Her » Notebook for the Week of June 23 - July 3, 2008 @ 4:26 pm

Pingback by Stop Press for July 10th | booktwo.org - July 10, 2008 @ 8:30 pm

[…] Digital Natives » Google Book Search, Orphan Works and the Public Domain – “This is, whatever your other feelings are about Google Book Search more generally, a wonderful advance in public accessibility of information.” […]

Comment by ercenk - October 31, 2008 @ 9:31 pm

seslichat,sesli sohbet,sesli chat scprehen sie? kommen sie bitte unsere web seite http://www.bizimsokak.biz und http://www.seslichatbiziz.com

Calendar

Categories

Pages

Blogroll

Links

Google Book Search, Orphan Works and the Public Domain