On Monday, February 10, at 1.30 pm in room P702 (Paulinum of the University of Leipzig main building at the Augustusplatz), Sebastian Hellmann will present the Natural Language Processing (NLP) Interchange Format (NIF) which is based on a Linked Data enabled URI scheme for identifying elements in (hyper-)texts and an ontology for describing common NLP terms and concepts. During the meeting we will jointly look at the existing tools and infrastructure, collect issues and discuss potential fixes. Bringing a laptop is recommended.
As always, Bachelor and Master students are able to get points for attendance and there is complimentary coffee and cake after the session.
We are currently observing a plethora of Natural Language Processing tools and services being made available. Each of the tools and services has its particular strengths and weaknesses, but exploiting the strengths and synergistically combining different tools is currently an extremely cumbersome and time consuming task. Also, once a particular set of tools is integrated, this integration is not reusable by others. We argue that simplifying the interoperability of different NLP tools performing similar but also complementary tasks will facilitate the comparability of results and the creation of sophisticated NLP applications. In this session, we present the NLP Interchange Format (NIF). NIF is based on a Linked Data enabled URI scheme for identifying elements in (hyper-)texts and an ontology for describing common NLP terms and concepts. In contrast to more centralized solutions such as UIMA and GATE, NIF enables the creation of heterogeneous, distributed and loosely coupled NLP applications, which use the Web as an integration platform. We present several use cases of the second version of the NIF specification (NIF 2.0) and the result of a developer study.