Inside Google Book Search: U.S. copyright renewal records available for download

Inside Google Book Search

For U.S. books published between 1923 and 1963, the rights holder
needed to submit a form to the U.S. Copyright Office renewing the
copyright 28 years after publication. In most cases,
books that were never renewed are now in the public domain. Estimates
of how many books were renewed vary, but everyone agrees that most
books weren’t renewed. If true, that means that the majority of U.S.
books published between 1923 and 1963 are freely usable.

How do
you find out whether a book was renewed? You have to check the U.S.
Copyright Office records. Records from 1978 onward are online (see http://www.copyright.gov/records)
but not downloadable in bulk. The Copyright Office hasn’t digitized
their earlier records, but Carnegie Mellon scanned them as part of
their Universal Library Project, and the tireless folks at Project Gutenberg and the Distributed Proofreaders painstakingly corrected the OCR.

Thanks
to the efforts of Google software engineer Jarkko Hietaniemi, we’ve
gathered the records from both sources, massaged them a bit for easier
parsing, and combined them into a single XML file available for download here.

Share
Published by
Dennis G. Jerz

Recent Posts

Another corner building. Designed and textured. Needs an interior. #blender3d #design #aesthetics #medievalyork #mysteryplay

Another corner building. Designed and textured. Needs an interior. #blender3d #design #aesthetics #medievalyork #mysteryplay

2 days ago

There’s No Longer Any Doubt That Hollywood Writing Is Powering AI

Two years after the release of ChatGPT, it may not be surprising that creative work…

4 days ago

The complex geometry on this wedge building took me all weekend.  #blender3d #medievalyork #mysteryplay #cgi #aesthetics #design

The complex geometry on this wedge building took me all weekend. The interior walls still…

6 days ago