NLP Interchange Format (NIF) 2.0

License, Persistence and Versioning

Document Version: Latest Draft

Summary

The NLP Interchange Format (NIF) is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations. NIF consists of specifications, ontologies and software (overview), which are combined under the version identifier "NIF 2.0", but are versioned individually.

This specification regulates the procedure in which the NIF 2.0 standard is advanced. In summary, the following policies are applicable:

  1. License: The NLP2RDF project is interested in a wide spread adoption of NIF. Open licenses are preferred and mandatory for core resources. In case of license incompatability, please contact the project maintainers to either waive licenses or add additional compatible licenses.
  2. Costs: Published specifications of NIF can be implemented free of charge. No royalties are demanded. This also applies for the use of the ontology URIs.
  3. Persistence: The computer center of the 600 year old University of Leipzig kindly provided us with persistent web space for the publication of the NIF standard. This will guarantuee the persistent hosting of the ontologies and the specs. Additionally, we are maintaining short- and medium-term demos und webservices at http://nlp2rdf.org, http://nlp2rdf.lod2.eu and http://lider-project.eu
  4. Versioning: Individual resources have different versioning procedures in place. This document specifies how all NIF resources are versioned.

Persistence

The computer center of the 600 year old University of Leipzig kindly provided us with persistent web space for the publication of the NIF standard. The web space is at http://persistence.uni-leipzig.org/nlp2rdf/ and snapshots of the GitHub repositories will be published on the server in adequate intervalls. The server is hosted and maintained by the University of Leipzig and will continue running much longer than the duration of the LOD2 or LIDER EU projects, which have funded the initial development of NIF. On the server you will be able to find the following subfolders: ontologies, specifications, documents, examples. The runtime of the hosting is virtually unlimited. We would like to be upfront and make the two main risks transparent for users of NIF:

Scenario 1

Three employees of University Leipzig (Sebastian Hellmann, Martin Brümmer and Jens Lehmann) have access to the server. There is a chance that we travel to a conference together in the same vehicle (e.g. airplane). If the vehicle crashes, the server will still be available and NIF will remain exactly as is. Although the files on the server can not be changed easily any more, anybody from the community is free to continue the work on NLP2RDF and develop it further due to its open license.

Scenario 2

The University Leipzig will shut down their computer center. This is totally improbable and since you might be fighting pollution, robots or zombies, persistence of NIF resources might be the least of your worries.

http://nlp2rdf.lod2.eu, http://nlp2rdf.org, http://lider-project.eu and other services

In addition, to the persistent hosting, we will run short to midterm demos on http://nlp2rdf.lod2.eu, http://nlp2rdf.org, http://lider-project.eu. We kindly ask you not to build critical infrastructure on these services and demos. The source code is available and open and you can download the tools and run them on your server.

Versioning

For all resources of NIF 2.0, we try to be consistent with versioning. This is achieved by using http://semver.org as strict as is feasible without weighing down the development process. Since different kinds of resources require different treatment, we distinguish between version numbers and versioning granularity. Furthermore, we are tracking revisions of all resources in GitHub In the following, we define special version numbers and their meaning for different types of resources.

Projectwide

Ontologies

One of the most important issue for ontologies is the persistence of identifiers for the URIs they provide. Ontologies are either versioned as a whole with the version number attached to the ontology URI or each URI is versioned individually. We adopt the following rule: Once the version for an ontology or a URI that appears in an ontology reaches 1.y.z we assume URI persistence, e.g. we will not delete any URI in the ontology anymore. Deprecation is still possible and will be stated via the owl:deprecated property.

Maven

We will provide Maven packages for the ontologies and the software. The version numbers of the Maven jars will follow SemVer.org. Note that we are hosting an Archiva at AKSW, where we deploy the Jars in regular intervalls. Our Maven repository has a high uptime and is repaired quite fast, normally. Please notify the mailing list, if you encounter any errors. It will be maintained for a couple of years, but there are no long term plans. Please use your own Maven repository, if you depend heavily on it. Detailed instruction can be found in the GitHub README.

Specifications

For each specification there is a draft version, which is available via a link at the top of each specification. This draft version is given a version number in appropriate intervals and will thus create a new version of the specification.