The Open Knowledge Foundation has published a nifty guide on the basics of Finding Interesting Public Domain Works Online. You can skim the guide in well under ten minutes, and it includes useful links and accompanying descriptions to online collections where PD works can be found, including Europeana, the Internet Archive, and Project Gutenberg. It also contains quite a few references to Creative Commons and succinct explanations of the relevant CC tools, such as the Public Domain Mark and the CC0 Public Domain Dedication. The guide, like all articles at The Public Domain Review, is available for reuse under CC BY.1 Comment »
One week after the nuclear disaster at the Fukushima Diachi plant in March, the Safecast project was born to respond to the information needs of Japanese citizens regarding radiation levels in their environment. Safecast, then known as RDTN.org, started a campaign on Kickstarter “to provide an aggregate feed of nuclear radiation data from governmental, non-governmental and citizen-scientist sources.” All radiation data collected via the project would be dedicated to the public domain using CC0, “available to everyone, including scientists and nuclear experts who can provide context for lay people.” Since then, more than 1.25 million data points have been collected and shared; Safecast has been featured on PBS Newshour; and the project aims to expand its scope to mapping the rest of the world.
“Safecast supports the idea that more data – freely available data – is better. Our goal is not to single out any individual source of data as untrustworthy, but rather to contribute to the existing measurement data and make it more robust. Multiple sources of data are always better and more accurate when aggregated.
While Japan and radiation is the primary focus of the moment, this work has made us aware of a need for more environmental data on a global level and the longterm work that Safecast engages in will address these needs. Safecast is based in the US but is currently focused on outreach efforts in Japan. Our team includes contributors from around the world.”
To learn more, visit http://safecast.org. All raw data from the project is available for re-use via the CC0 public domain dedication, while other website content (such as photos and text) are available under CC BY-NC.Comments Off on Safecast: Global sensor network collects and shares radiation data via CC0
Yesterday, Europeana — Europe’s digital library, museum and archive, and the first major adopter of the Public Domain Mark for works in the worldwide public domain — published and made available The Europeana Licensing Framework using the CC0 public domain dedication. The licensing framework encompasses and is a follow-on to the recent Data Exchange Agreement that Europeana adopted in September, and which Europe’s national librarians publicly supported weeks later.
In Europeana’s own words, the licensing framework “underpins Europeana’s Strategic Plan” for 2011-2015:
“The goal of the Europeana Licensing Framework is to standardize and harmonize rights-related information and practices. Its intention is to bring clarity to a complex area, and make transparent the relationship between the end-users and the institutions that provide data.”
“Users need good and reliable information about what they may do with [content]. Whether they can freely re-use it for their educational, creative or even commercial projects or not. The Europeana Licensing Framework therefore asks data providers to provide structured rights information in the metadata they provide about the content that is accessible through Europeana. Doing so makes it easier for users to filter content by the different re-use options they have – by ‘public domain’, for example and hence easier for users to comply with licensing terms.”
The framework supports re-use of data and content through CC legal tools (CC0 public domain dedication, the Public Domain Mark, and CC BY-SA), providing guidelines for their appropriate applications. Download the European Licensing Framework (pdf) or peruse the full set of resources at Europeana Connect.
Relatedly, see Europeana’s white paper no. 2 published last month, The Problem of the Yellow Milkmaid: A Business Model Perspective on Open Metadata (pdf). The white paper “explore[s] in detail the risks and rewards of open data from different perspectives” after “extensive consultation with the heritage sector, including dozens of workshops.” It opens:
1 Comment »
“‘The Milkmaid’, one of Johannes Vermeer’s most famous pieces, depicts a scene of a woman quietly pouring milk into a bowl. During a survey the Rijksmuseum discovered that there were over 10,000 copies of the image on the internet—mostly poor, yellowish reproductions1. As a result of all of these low-quality copies on the web, according to the Rijksmuseum, “people simply didn’t believe the postcards in our museum shop were showing the original painting. This was the trigger for us to put high-resolution images of the original work with open metadata on the web ourselves. Opening up our data is our best defence against the ‘yellow Milkmaid’.”
In other news:
Comments Off on CC News: Metadata for millions of cultural works will be published under CC0
Following the exciting news of Europeana’s new data exchange agreement, which authorizes Europeana to release the metadata for millions of cultural works into the public domain using the CC0 public domain dedication, the Conference of European National Librarians (CENL) voted to support the agreement in a meeting last week at the Royal Library of Denmark. CENL represents Europe’s national libraries and “is responsible for the massive collection of publications that represent the accumulated knowledge of Europe.” From the press release,
“It means that the datasets describing all the millions of books and texts ever published in Europe – the title, author, date, imprint, place of publication and so on, which exists in the vast library catalogues of Europe – will become increasingly accessible for anybody to re-use for whatever purpose they want.
Bruno Racine, new Chair of CENL and President of the Bibliothèque nationale de France, and Dr. Elisabeth Niggemann, former Chair of CENL and Director of the Deutsche Nationalbibliothek, welcomed the leadership shown by CENL. Dr Niggemann said…‘Only in that way can society derive full social and economic benefit from the data that we’ve created to record Europe’s published output over the past 500 years. The best analogy is between bottled water and a water main. Rather than bottling it and branding it, we’re putting data on tap, so that everyone has free and open access, and can use it for whatever purpose they need.’”
Read more about Europeana’s Data Exchange Agreement.1 Comment »
Europeana — Europe’s digital library, museum and archive, and the first major adopter of the Public Domain Mark for works in the worldwide public domain — has adopted a new Data Exchange Agreement. The agreement, which data providers and aggregators will transition to by the end of 2011, authorizes Europeana to release the metadata for millions of cultural works into the public domain using the CC0 public domain dedication. All metadata for cultural works accessible via the Europeana portal, including previously-delivered metadata, will then be available for free and open re-use.
Additionally, the new agreement requires data providers to make best efforts to correctly identify content that is public domain as being public domain. Last October, Europeana announced plans to use the PDM as the standard mark for works free of known copyright that are shared via the Europeana portal, playing an important infrastructural role in the EU’s efforts to ensure that all works shared online are marked with rights information.
Europeana has also published non-binding Usage Guidelines that users of the metadata are asked to follow, including a specific request that users “actively acknowledge and give attribution to all the sources” of the metadata.
This is hugely exciting news for CC and open culture! Read more about the Data Exchange Agreement. Congratulations Europeana on your leadership!Comments Off on Europeana adopts new data exchange agreement, all metadata to be published under CC0
We’re pleased to see the launch of The Public Domain Review. The Review is a website with weekly updates in which scholars, writers, artists, librarians and others present an interesting or curious work (including films, photographs, texts and audio) from the public domain and write short accompanying articles about it that provide background, context, history, or other commentary or criticism. There are already several articles up on the site. The Review is also accepting submissions.
The Public Domain Review aspires to become a bounteous gateway into the whopping plenitude that is the public domain, helping our readers to explore this rich terrain by surfacing unusual and obscure works, and offering fresh reflections and unfamiliar angles on material which is more well known.
The Public Domain Review will highlight public domain materials from Wikimedia Commons, The Internet Archive, Flickr’s The Commons, and other sites. While all the multimedia content featured on the site is in the public domain, the reviews themselves are published under the Creative Commons Attribution license.
Congratulations to editors Adam Green and Jonathan Gray on launching this fascinating site that will share and celebrate the vast wonders of the public domain! You can sign up for updates, or follow on Twitter.Comments Off on Launch of The Public Domain Review website
Mike Masnick at Techdirt asks Does It Make Sense For Governments To Make Their Content Creative Commons… Or Fully Public Domain?
Ideally all Public Sector Information (PSI; government content and data) would be in the public domain — not restricted by copyright or any related rights. Masnick points to the U.S. federal government’s good policy:
nearly all works produced by the [U.S.] federal government automatically go into the public domain, and don’t receive any form of copyright
Unfortunately it is not quite that good: works produced for the U.S. federal government, but not directly by federal government employees or officers are covered by copyright — including works acquired, produced by contractors, and funded by grants. Furthermore, works produced by U.S. federal government employees are only unambiguously free of copyright in the U.S., thus cannot be considered in the public domain worldwide. This is not to say that the U.S. federal government policy is not stellar — relative to policies of other levels of government within the U.S., and those of other governments worldwide, it truly is, to the particular and tremendous benefit of the U.S. people and economy. But we live in a globalized and highly interconnected world now, and even that stellar policy could be improved.
This brings us to another question: how to improve policy around PSI? The status of U.S. federal government works is specified in the U.S. Copyright Act. Crown Copyright is specified in the copyright acts of various commonwealth jurisdictions. Similarly many other jurisdictions’ copyright acts specify the status of and any special limitations and exceptions to copyright for government works. Clearly changing a jurisdiction’s copyright act or otherwise changing its default status for PSI (preferably to public domain) would be most powerful. But they aren’t changes anyone can effect relatively quickly and deterministically (historically opening up a copyright act has led to more restrictive copyright).
In the meantime (presumably many years) there’s a tremendous desire to make government more accessible and unlock the value of content and data that is funded, held, and produced by governments — and existing public sector copyright defaults are recognized as a barrier to achieving these benefits. Especially in the last few years, governments have been implementing their own directives aimed to modernize PSI while some government agencies and politicians look to move more quickly within their remits, and activist citizens push to clear barriers to the potential of “open government” or “government 2.0″ with utmost urgency. This is where government use of a standard public license, usually one of the Creative Commons licenses, makes lots of sense. An agency, province, city or other body that holds copyright or funds the creation of copyrighted works can choose to open its or funded content by releasing under one of the Creative Commons licenses, or if they are really progressive, under the CC0 Public Domain Dedication.
Many governments are using CC tools in just these ways, and we expect that many more will in the coming years. That said, if any do manage to change policy defaults for PSI such that more government content and data is automatically in the public domain — we will be cheering all the way. In fact, we already have a tool for marking and tagging works that are in the public domain worldwide. The CC Public Domain Mark is currently applicable to really old works, but it would be lovely if a government were to decide to by law make all of its content unambiguously public domain, worldwide, thus making the CC Public Domain Mark applicable (of course there is no requirement to use the mark; it is just there for people and institutions that wish to use it to signal to humans and machines the public domain status of a work).
A couple caveats. First, whether they ought to or not, many governments like using copyright to control PSI. Sometimes the desire comes from a good place, e.g, to have the information be used in a way so as to not mislead the public, imply endorsement of the government, or imply that other regulations, e.g., privacy, do not apply. CC licenses have mechanisms to address these concerns where relevant (e.g., attribution to original URL, noting adaptation, non-endorsement) and government licensing frameworks (or non-binding guidelines in the case of the public domain) that explain orthogonal rights and responsibilities (e.g., privacy) but do not create incompatible licenses are key to addressing these concerns.
Second, although as noted above, usually use of any CC license would give the public more rights to PSI than they have now. But, licenses with a NonCommercial or NoDerivatives restriction set the bar too low. Clearly to maximize the value of public sector information, business needs to have access, and to maximize the ability of citizens to do interesting things with content, adaptation needs to be permitted. We strongly prefer governments use fully free/open CC tools — the CC0 Public Domain Dedication and CC Attribution (BY) and Attribution-ShareAlike (BY-SA) licenses. The Definition of Free Cultural Works and Open Knowledge Definition spell out why those tools are preferred in general. We look forward to working with the Open Knowledge Foundation and others to flesh out the specific and even more compelling case for fully free/open PSI.
- Creative Commons and Public Sector Information: Flexible tools to support PSI creators and re-users
- State of Play: Public Sector Information in the United States
- Creative Commons presentation on interoperability and sustainable sharing policy at the Share-PSI.eu workshop on removing the barriers to pan European market for public sector information re-use and all position papers and slides from that workshop.
- The “Licensing” of public sector information paper from LAPSI, the European Thematic Network on Legal Aspects of Public Sector Information.
The basic idea of Creative Commons, offering free copyright tools, is copied from the free software movement. However, CC licenses are not intended to be used to release software, as our FAQ has always said.
One important reason why Creative Commons licenses should not be used to release software is that they aren’t compatible with existing free software licenses, most importantly the GPL from the Free Software Foundation, which is used by over half of free software projects. A commons fractured by legal incompatibilities is a weak commons, and it would be deeply contrary to our mission to fracture the commons of software. (It should also be noted that the FSF helped unfracture the non-software commons by facilitating Wikimedia’s migration to CC BY-SA as the main content license of Wikipedia and its sibling sites.)
While the vast majority of contemporary free software is released under the GPL or another free software license, there is also a long tradition of public domain software, which was free before the term free software existed. Indeed, prior to the 1970s, copyright did not apply to software. Currently, SQLite, an embedded database that you almost certainly use, is probably the most popular software that is dedicated to the public domain.
There are a variety of public domain dedications used to release software, which is mostly not a problem — to the extent such dedications are well-crafted, they don’t present a legal interoperability problem. This means it is possible to improve the state of the art in public domain dedications without harming the ecosystem. (Though this doesn’t mean an infinite variety of public domain dedications is optimal — at the extreme having to determine whether a new dedication is well-crafted each time one encounters a new public domain work would make using public domain works unattractive.)
In addition to licenses, Creative Commons also offers public domain tools. In creating the CC0 public domain dedication, we did set out to improve the state of the art in public domain dedications, and we think we’ve been pretty successful. Users seem to think so — ranging from governments and institutions to musicians.
We hadn’t set out with CC0 to improve on public domain dedications for software. However, since the release of CC0, we’ve been approached a number of times about using CC0 to dedicate software to the public domain. While we were happy to hear of this unanticipated demand, we wanted to tread very carefully so as to not create any unintended consequences for the free software ecosystem. This led to discussions with the Free Software Foundation, the steward of the GPL and moral leader of the free software movement.
We’re really happy to announce that the Free Software Foundation has added CC0 to its free software licenses list (which includes public domain terms). As usual, the FSF’s language is extremely clear, so we simply quote two sections from their list:
CC0 is a public domain dedication from Creative Commons. A work released under CC0 is dedicated to the public domain to the fullest extent permitted by law. If that is not possible for any reason, CC0 also provides a simple permissive license as a fallback. Both public domain works and the simple license provided by CC0 are compatible with the GNU GPL.
If you want to release your work to the public domain, we recommend you use CC0.
If you want to release your work to the public domain, we encourage you to use formal tools to do so. We ask people who make small contributions to GNU to sign a disclaimer form; that’s one solution. If you’re working on a project that doesn’t have formal contribution policies like that, CC0 is a good tool that anyone can use. It formally dedicates your work to the public domain, and provides a fallback license for cases where that is not legally possible.
We’ve also added an entry to the CC0 FAQ about using CC0 to release software, which you ought read if you’d like to do that. If you’re only familiar with the way CC licenses and public domain tools are typically used on web pages and other media, be aware that with free software, the full license (or public domain terms) are usually included with the software. In order to make this easy to do, we’ve taken this opportunity to fulfill a longstanding request — plain text copies of the “legalcode” for CC0 and CC’s six main international licenses. See CC software engineer Chris Webber’s post for details.
Special thanks to Chris Webber and the FSF’s Brett Smith for their persistent work to make the CC0 software recommendation possible.3 Comments »
CERN Library releases its book catalog into the public domain via CC0, and other bibliographic data news
CERN, the European Organization for Nuclear Research that is home to the Large Hadron Collider and birthplace of the web, has released its book catalog into the public domain using the CC0 public domain dedication. This is not the first time that CERN has used CC tools to open its resources; earlier this year, CERN released the first results of the Large Hadron Collider experiments under CC licenses. In addition, CERN is a strong supporter of CC, having given corporate support at the “creator” level, and is currently featured as a CC Superhero in the campaign, where you can join them in the fight for openness and innovation!
Jens Vigen, the head of CERN Library, says in the press release,
“Books should only be catalogued once. Currently the public purse pays for having the same book catalogued over and over again. Librarians should act as they preach: data sets created through public funding should be made freely available to anyone interested. Open Access is natural for us, here at CERN we believe in openness and reuse… By getting academic libraries worldwide involved in this movement, it will lead to a natural atmosphere of sharing and reusing bibliographic data in a rich landscape of so-called mash-up services, where most of the actors who will be involved, both among the users and the providers, will not even be library users or librarians.”
In related news, the Cologne-based libraries have made the 5.4 million bibliographic records they released into the public domain earlier this year, also via CC0, available in various places. See the hbz wiki, lobid.org (and their files on CKAN), and OpenDATA at the Central Library of Sport Sciences of the German Sports University in Cologne. For more information, see the case study.
The German Wikipedia has also used CC0 to dedicate data into the public domain; specifically, their PND-BEACON files are available for download. Since Wikipedia links out to quite a number of external resources, and since a lot of articles link to the same external resources, PND-BEACON files are the German Wikipedia’s way of organizing the various data. “In short a BEACON file contains a 1-to-1 (or 1-to-n) mapping from identifiers to links. Each link consists of at least an URL with optionally a link title and additional information such as the number of resources that are available behind a link.” Learn more from the English description of the project.1 Comment »