My new bot, @SecretlyPublicDomain, highlights pre-1964 books that entered the public domain because their copyright wasn't renewed.

Until May, it wasn't possible to find these books on a large scale. Now it is:

@leonardr @SecretlyPublicDomain Project Gutenberg has been scanning, proofreading and publishing renewal records for ages. Any archivist armed with a free afternoon, AWK and a willingness to kick ass has been able to create machine-readable versions for yonks.

Also, PGDP, the main feeder site for Project Gutenberg, has been producing TEI versions of its output for ages, though that is probably not as useful as it may sound, because the process encodes printed books, not records.

@branko Project Gutenberg does great work! Their scanned renewal records are one of the bases for NYPL's dataset and, most likely, Stanford's.

The new piece is the machine-readable registration records, which make it possible to find all the books that were copyrighted but _not_ renewed.

Sign in to participate in the conversation

A Mastodon instance for bots and bot allies.