After more than two years of hard work, the CC China Mainland 3.0 licenses are ready for use. Congratulations to Chunyan Wang and the entire CC China Mainland team. Thank you to everyone who helped create these licenses, including the community members who participated in the public discussion.
The China Mainland licenses are now available on the CC license chooser. You can learn more about the CC China Mainland team and their work on the CC wiki and at http://creativecommons.net.cn/. The CC China Mainland 3.0 licenses are one of the last 3.0 ports to conclude, with the few other remaining suites expected to be launched prior to publication of the version 4.0 licenses. As announced to affiliates at the CC Global Summit in Warsaw almost a year ago, and reiterated last October and this past February, other than a very few ports then well underway, Creative Commons put the porting process on hold. This has allowed staff and our affiliates to focus more fully on the important work of versioning the license suite. We encourage all affiliates, CC community members and others interested in CC licenses to contribute to the 4.0 discussions currently in progress.1 Comment »
We are pleased to post for public comment the first discussion draft of version 4.0. This draft is the product of an extended (and unprecedented) requirements gathering period involving input from CC affiliates, community and stakeholders. Thanks to all of you who contributed your valuable time and energy in the policy discussions and drafting sessions in support of this draft.
We crafted this first draft (v4.0d1) mindful of the overarching design goals first articulated at the 2011 Global Summit:
- Producing a 4.0 suite that addresses pressing challenges of important adopters, including those in countries where localized version of CC licenses have not existed, and never may, for any number of reasons;
- Maximizing interoperability, reducing license proliferation and promoting standardization where possible; and
- Longevity and ease of use.
We have also been mindful of supporting those for whom version 3.0 is working well. We will continue efforts to ensure those constituents are aware of our support throughout this process and our eagerness to see those implementations thrive.
We’ve documented and discussed all of these at length, and are excited to hear back from our community on how we can still better accomplish these goals. Here are some highlights of the major policy and drafting choices reflected in the draft, as well issues on which we would especially value your input. Join the discussion!
As anticipated, the license fully licenses database rights on the same terms and conditions as copyright and neighboring rights. We have heard no compelling reason for reversing course on this new policy, and all early feedback suggests this is a welcomed change despite questions about their utility. We have taken care to ensure that the license only applies where permission is needed and the licensor holds those rights.
Other copyright-like rights
Rights beyond copyright and neighboring rights are more complicated, however. We know from our community that other sui generis, copyright-like rights exist and more have been or will be proposed. These include press publisher rights in Germany and catalogue rights in Nordic countries. We remain concerned that these “ancillary rights” (the term coined for use in the draft) could undermine or interfere with expected uses of the licensed work, much as sui generis database rights (and their treatment in 3.0 and its ports) have vexed CC licensors and licensees in Europe for years.
We have taken the approach in this first draft of requiring waiver of those ancillary rights, but only if possible and then only to the extent necessary to allow the work to be used as intended under the license. (These ancillary rights do not include the traditional group of rights long excluded from CC licenses and reserved to licensors, such as trademark, privacy and personality rights, and similar.) We look to our community for input on this important policy choice.
Treatment of moral rights is the other central policy issue addressed in this draft. In 3.0 (unported) and a rough majority of the 3.0 ports, moral rights are generally reserved and unaffected by the license. Yet in other ports, those rights are reserved only where they cannot be waived, suggesting the licensor is waiving those rights where possible, and possibly without limitation. The difference is nuanced but not trivial, and merits consideration.
For purposes of this first draft, we have chosen a middle ground: where waiver is possible, a limited waiver (or non assert) is granted to allow the work to be used as otherwise permitted by the license. For all other purposes (or where a waiver or non assert is not permitted), those rights are fully reserved. This proposal draws heavily from the proposal made for 3.01, and is intended to re-start the discussion for 4.0 where that discussion left off. We look forward to hearing the views of our community on this proposal as well.
Proposals under development
A few policy decisions are still under consideration and will benefit from further public discussion before formal proposals are made. To the extent these decisions involve existing terms in 3.0, we have [bracketed] related provisions in the draft. These include technical protection measures and the definition of NonCommercial. On the former, discussion during the requirements gathering period was robust and productive, but not conclusive on any approach. We plan to use a portion of this public discussion period to curate use cases that will inform a formal proposal. Ideally, these use cases will be based on demonstrated needs (or lack thereof) by licensees for a change from the prohibition in 3.0. As for NonCommercial, more discussion is necessary if any of the current proposals or arguments for changing that definition are to be advanced. Consequently, we have left the definition unchanged in this first draft. On both of these issues, look for prompts from us on the license discussion list and this blog, and please contribute your voice to the discussion.
The draft license has several new features deserving of attention and your feedback. Attribution and marking requirements are now centralized in a single location and clarified for ease of understanding and compliance. The collecting society provision is dramatically simplified, though operating in the same spirit as in 3.0. Overall, we have strived to simplify, better organize, internationalize and enhance usability whenever possible. We welcome your ideas for making this license still better in these respects and more.
We need your input!
One of our highest priorities is to ensure to the extent possible that the 4.0 licenses work seamlessly in as many jurisdictions, and for as many constituents, as possible. Please help us identify provisions that could be improved to operate better in your locale and for the communities of CC adopters you care about.
We have updated the 4.0 wiki with a special page dedicated to this first draft, where you can find the full draft of BY-NC-SA and a detailed chart comparing this draft to version 3.0, among other resources. The primary discussion forum continues to be the license-discuss list. We look forward to hearing from you!1 Comment »
You may have heard that data is huge — changing the way science is done, enabling new kinds of consumer and business applications, furthering citizen involvement and government transparency, spawning a new class of software for processing big data and new interdisciplinary class of “data scientists” to help utilize all this data — not to mention metadata (data about data), linked data and the semantic web — there’s a whole lot of data, there’s more every day, and it’s potentially extremely valuable.
Much of the potential value of data is to society at large — more data has the potential to facilitate enhanced scientific collaboration and reproducibility, more efficient markets, increased government and corporate transparency, and overall to speed discovery and understanding of solutions to planetary and societal needs.
A big part of the potential value of data, in particular its society-wide value, is realized by use across organizational boundaries. How does this occur (legally)? Facts themselves are not covered by copyright and related restrictions, though the extent to which this is the case (e.g., for compilations of facts) varies considerably across jurisdictions. Many sites give narrow permission to use data via terms of service. Much ad hoc data sharing occurs among researchers. And increasingly, open data is facilitated by sharing under public terms, e.g. CC licenses or the CC0 public domain dedication.
CC tools, data, and databases
Since soon after the release of version 1.0 of the CC license suite (December, 2002) people have published data and databases under CC licenses. MusicBrainz is an early example (note their recognition that parts of the MusicBrainz database is strictly factual, so in the public domain, while other parts are licensible). Other examples include Freebase, DBpedia (structured information extracted from Wikipedia), OpenStreetMap, and various governments (Australia in particular has been a leader).
More recently CC0 has gained wide use for releasing data into the public domain (to the extent it isn’t already), not only in science, as expected, but also for bibliographic, social media, public sector data, and much more.
With the exception of strongly recommending CC0 (public domain) for scientific data, Creative Commons has been relatively quiet about use of our licenses for data and databases. Prior to coming to the public domain recommendation for scientific data, we published a FAQ on CC licenses and databases, which is still informative. It is important to recognize going forward that the two are complementary: one concerns what ought be done in a particular domain in line with that domain’s tradition (and public funding sources), the other what is possible with respect to CC licenses and databases.
This is/ought distinction is not out of line with CC’s general approach — to offer a range (but not an infinity) of tools to enable sharing, while encouraging use of tools that enable more sharing, in particular where institutional missions and community norms align with more sharing. For a number of reasons, now is a good time to make clear and make sure that our approach to data and databases reflects CC’s general approach rather than an exaggerated caricature:
- We occasionally encounter a misimpression that CC licenses can’t be used for data and databases, or that we don’t want CC licenses to be used for data and databases. This is largely our fault: we haven’t actively communicated about CC licenses and data since the aforementioned FAQ (until very recently), meaning our only message has been “public domain for scientific data” — leaving extrapolation to other fields to the imagination.
- Our consolidation of CC education and science “divisions” has facilitated examinations of domain-specific policies, and increased policy coherence.
- Ongoing work and discussions with CC’s global affiliate network; many CC affiliates are deeply involved in promoting open public sector information, including data.
- The existence and increasing number of users of CC licenses for data and databases (see third paragraph above).
- A sense of overwhelming competitive threat from non-open data; the main alternative to public domain is not sharing at all — absence of a strong CC presence, except for a normative one in science, creates a correspondingly large opportunity cost for society due to “failed sharing” (e.g., under custom, non-interoperable terms) and lack of sharing.
- A long-term shift in understanding of CC’s role: from CC as purveyor of a variety of tools and policies to CC as steward of the commons, and thus need to put global maximization, interoperability and standards before any single tool or policy idea that sounds good on its own, and to encourage (and sometimes push) producers of data and databases to do the same.
- We’ve thought and learned a lot about data and databases and CC’s role in open data. In 2002 data was not central to CC’s programs, now (in keeping with the times), it is.
- Ongoing confusion among providers and users of data about the copyrightability of data (it depends) and rights that may or may not exist as a result of how the data is compiled and distributed — the database.
- Later in 2011 we expect to begin a public requirements process for version 4.0 of our license suite. At the top level, we know that an absolute requirement will be to make sure the 4.0 licenses are the best possible tools (where public domain is not feasible, for whatever reason) for legally sharing data possible.
One other subtlety should be understood with respect to current (3.0) CC licenses. Data and databases are often copyrightable. When licensed under any of our licenses, the license terms apply to copyrightable data and databases, requiring adaptations that are distributed be released under the same or compatible license terms, for example, when a ShareAlike license is used.
Databases are covered by additional rights (sometimes called “sui generis” database rights) in Europe (similar database rights exist in a few other places). A few early (2.0) European jurisdiction CC license “ports” licensed database rights along with copyright. Non-EU jurisdiction and international CC licenses have heretofore been silent on database rights. We adopted a policy that version 3.0 EU jurisdiction ports must waive license requirements and prohibitions (attribution, share-alike, etc) for uses triggering database rights — so that if the use of a database published under a CC license implicated only database rights, but not copyright, the CC license requirements and prohibitions would not apply to that use. The license requirements and prohibitions, however, continued to apply to all uses triggering copyright.
CC licenses other than EU jurisdiction 3.0 ports are silent on database rights: databases and data are licensed (i.e., subject to restrictions detailed in the license) to the extent copyrightable, and if data in the database or the database itself are not copyrightable the license restrictions do not apply to those parts (though they still apply to the remainder). Perhaps this differential handling of database rights is not ideal, given that all CC licenses (including jurisdiction ports) apply worldwide and ought be easily understandable. However, those are not the only requirements for CC tools — they are also intended to be legally valid worldwide (for which they have a good track record) and produce outcomes consistent with our mission.
These requirements mandate the caution with which we approach database rights in our license suite. In particular, database rights are widely recognized to be bad policy, and instance of a general class of additional restrictions that are harmful to the commons, and thus harmful to collaboration, innovation, participation, and the overall health of the Internet, the economy, and society.
If database rights were to be somehow “exported” to non-EU jurisdictions via CC licenses, this would be a bad outcome, contrary not only to our overall mission, but also our policy that CC licenses not effectively introduce restrictions not present by default, e.g., by attempting to make license requirements and prohibitions obviate copyright exceptions and limitations (see “public domain” and “other rights” on our deeds, and the relevant FAQ). Simply licensing database rights, just like copyright, but only to the extent they apply, just like copyright, is an option — but any option we take will be taken very carefully.
What does all this mean right now?
(1) We do recommend CC0 for scientific data — and we’re thrilled to see CC0 used in other domains, for any content and data, wherever the rights holder wants to make clear such is in the public domain worldwide, to the extent that is possible (note that CC0 includes a permissive fallback license, covering jurisdictions where relinquishment is not thought possible).
(2) However, where CC0 is not desired for whatever reason (business requirements, community wishes, institutional policy…) CC licenses can and should be used for data and databases, right now (as they have been for 8 years) — with the important caveat that CC 3.0 license conditions do not extend to “protect” a database that is otherwise uncopyrightable.
(3) We are committed to an open transparent discussion and process around making CC licenses the best possible tools for sharing data (including addressing how they handle database rights), consistent with our overall mission of maximizing the value of the commons, and cognizant of the limitations of voluntary tools such as CC’s in the context of increasingly restrictive policy and overwhelming competitive threat from non-sharing (proprietary data). This will require the expertise of our affiliates and other key stakeholders, including you — we haven’t decided anything yet and will not without taking the time and doing the research that stewards of public infrastructure perform before making changes.
(4) is a corollary of (2) and (3): use CC licenses for data and databases now, participate in the 4.0 process, and upgrade when the 4.0 suite is released, or at least do not foreclose the possibility of doing so.
Regarding discussion — please subscribe to cc-licenses for a very low volume (moderated) list, intended only for specific proposals to improve CC licenses, and announcements of versioning milestones. If you’re interested in a more active, ongoing (unmoderated) discussion, join cc-community. You might also leave a comment on this post or other means of staying in touch. We’re also taking part in a variety of other open data discussions and conferences.
By the way, what is data and what are databases?
Oh right, those questions. I won’t try to answer too seriously, for that would require legal, technical, and philosophical dissertations. All information (including software and “content”) can be thought of as data; more pertinently, data might be limited to (uncopyrightable) facts, or it may include any arrangement of information, e.g., in rows, tables, or graphs, including with (copyrightable) creativity, and creative (copyrightable) arrangements of information. Some kinds of arrangements and collections of information are characterized as databases.
Data and databases might contain what one would think of as content, e.g., prose contained in a database table. Data and databases might be contained in what one would think of as content, e.g., the structured information in Wikipedia, assertions waiting to be extracted from academic papers, and annotated content on the web, intended first for humans, but also structured for computers.
(Note that CC has been very interested in and worked toward standards for mixing content and data — apparently taking off — because such mixing is a good method for ensuring that content and data are kept accurate, in sync, and usable — for example, licensing and attribution information.)
All of this highlights the need for interoperability across “content” and “data”, which means compatible (or the same) legal tools — a good reason for ensuring that CC licenses are the best tools for data, databases and content — indeed a mandate for ensuring this is the case. Thanks in advance for your help (constructive criticism counts, as does simply using our tools — experience is the best guide) in fulfilling this mandate.4 Comments »
Our CC jurisdiction teams are always hard at work on critical license maintenance and version upgrades. Currently, many of these talented local teams are adapting Version 3.0, released February 2007, to the laws and languages of more than 70 jurisdictions around the globe. Joining the jurisdictions that offer licenses at 3.0 is Creative Commons Australia, whose versioning process revealed important insights into the licenses and suggestions for future versions.
The Australian licenses already have their first significant adopter, the Australian Parliament. The Parliament’s central web portal http://www.aph.gov.au houses the most important documents of the Australian Federal Government including all bills, committee reports and, most importantly, the Hansard transcript of Parliamentary Sittings, and the portal will be published under a Creative Commons BY-NC-ND Version 3.0 Australian license.
Thanks so much to the CC Australia team, headed by Professor Brian Fitzgerald and Tom Cochrane and coordinated by Jessica Coates and Elliott Bledsoe at Queensland University of Technology, for their diligence and input. Congratulations!
From CC Australia:
The Australian v3.0 licences…have been developed over the last few years via a public consultation process. We thank all of those who provided feedback on the licences, particularly our colleagues at CC Aoteoroa New Zealand and within the Australian government and non-profit sectors.
Our main aims during the Australian v3.0 drafting process were to ensure that the new licences:
- complied with Australian legal requirements and conventions;
- aligned with the rights and restrictions of the Unported (ie non-country specific) licences provided by Creative Commons; and
- were clear and easy for creators and users alike to read and understand.
Based on these aims, we made the following changes to the licences:
- adapting the Unported formatting and language to bring them more in line with Australian law and drafting conventions – mainly by using localised definitions and introducing lists and headings;
- simplifying some of the language, where this would not affect the legal interpretation of the licence – many of these simplifications were adopted from the recent version put together by our friends in New Zealand;
- a few minor additions to clarify the operation of the licences in the Australian context, in response to feedback from our consultation process – these included clarifying how the licences operate with respect to sublicensing and adding language to ensure that the licences comply with the requirements of Australian consumer protection law.
We are happy to release these licences, which we believe provide clear, reasonable and legally sound options for creators and users alike and represent a new best practice standard for the CC licences in Australia. If you would like any more information about the licences please feel free to contact us at firstname.lastname@example.org. For more information on the versioning process contact the Creative Commons head office.
The Software Freedom Law Show, Episode 0×16 contains numerous bits of interest to CC geeks and is well worth a listen. The show’s hosts, Karen Sandler and Bradley Kuhn of the Software Freedom Law Center, discuss among other things:
- How the GFDL turned out suboptimally — a key point is that developing good public licenses is very hard, the the GFDL was one of the very first for software documentation or other non-software works.
- The migration of Wikipedia and sister projects from GFDL to CC BY-SA, successfully completed this June.
- The importance of public license stewardship by mission-driven nonprofits — Bradley Kuhn’s writing on stewardship has been noted previously on this blog.
- The license used for the show itself, which is CC BY-ND.
- A promise to talk about the public domain and specifically CC0 in a future episode. Looking forward to it.
One quick addendum to the show, in which the hosts wonder if CC has a public versioning process. The answer is yes — see a a list of CC blog posts over the course of development of our 3.0 licenses. The next, eventual versioning will be even more public and rigorous, just as the GPLv3 had a development process far more in depth than that of any public software license that preceded it.Comments Off