Mathématiques et Informatique Appliquées
du Génome à l'Environnement

 

 

OpenMinTeD

Equipe(s)
Agence de moyen
Etat
Titre du projet
Open Mining INfrastructure for TExt and Data
Nom de l'appel d'offre
E-INFRA
Coordinateur.trice
Natalia Manola (ATHENA RESEARCH AND INNOVATION CENTER IN INFORMATION COMMUNICATION & KNOWLEDGE)
Participants de MaIAGE
C. Nédellec, R. Bossy, L. Deléger
Partenaires (hors MaIAGE)
ARC, UNIVERSITY OF MANCHESTER, UKP-TUDA, INRA, EMBL, Agro-Know I.K.E, LIBER, UNIVERSITEIT VAN AMSTERDAM, Open University, EPFL, CNIO, USFD, GESIS, GRNET, Frontiers
Année de démarrage - Année de fin de projet
06/2015-05/2018
Date de fin du projet
Résumé
Recent years witness an upsurge in the quantities of digital research data, offering new insights and opportunities for improved understanding. Text and data mining is emerging as a powerful tool for harnessing the power of structured and unstructured content and data, by analysing them at multiple levels and in several dimensions to discover hidden and new knowledge. However, text mining solutions are not easy to discover and use, nor are they easily combinable by end users. OpenMinTeD aspires to enable the creation of an infrastructure that fosters and facilitates the use of text mining technologies in the scientific publications world, builds on existing text mining tools and platforms, and renders them discoverable and interoperable through appropriate registries and a standards-based interoperability layer, respectively. It supports training of text mining users and developers alike and demonstrates the merits of the approach through several use cases identified by scholars and experts from different scientific areas, ranging from generic scholarly communication to literature related to life sciences, biodiversity and agriculture, and social sciences and humanities. Through its infrastructural activities, OpenMinTeD’s vision is tomake operational a virtuous cycle in which

a) primary content is accessed through standardised interfaces and access rules

b) by well-documented and easily discoverable text mining services that process, analyse, and annotate text

c) to identify patterns and extract new meaningful actionable knowledge, which will be used

d) for structuring, indexing, and searching content and, in tandem,

e) acting as new knowledge useful to draw new relations between content items and firing a new mining cycle.

To achieve its goals, OpenMinTeD brings together different stakeholders, content providers and scientific communities, text mining and infrastructure builders, legal experts, data and computing centres, industrial players, and SMEs.



INRA is mainly involved in



(1) Infrastructure design

Interoperability framework specifications
Platform design and implementation
Platform integration, testing and deployment


(2) Biodiversity and agriculture use cases

Community driven requirements and sustainability
-> Stakeholders in plant development, look at the survey: A text mining based application for cultivated plants

-> Poster at BioCreative on a Knowledge Model for mining Plants Biology literature

Community driven applications implementation and evaluation
Année de soumission
2014