<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Sparna Blog &#187; Linked Data</title>
	<atom:link href="https://blog.sparna.fr/category/theme/linked-data/feed/" rel="self" type="application/rss+xml" />
	<link>https://blog.sparna.fr</link>
	<description>Web de données &#124; Architecture de l&#039;information &#124; Accès aux connaissances</description>
	<lastBuildDate>Tue, 03 Jun 2025 10:30:27 +0000</lastBuildDate>
	<language>fr-FR</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
	<item>
		<title>European Parliament Open Data Portal : a SHACL-powered knowledge graph</title>
		<link>https://blog.sparna.fr/2025/04/09/european-parliament-open-data-portal-a-shacl-powered-knowledge-graph/</link>
		<comments>https://blog.sparna.fr/2025/04/09/european-parliament-open-data-portal-a-shacl-powered-knowledge-graph/#comments</comments>
		<pubDate>Wed, 09 Apr 2025 14:10:12 +0000</pubDate>
		<dc:creator><![CDATA[Marie Muller]]></dc:creator>
				<category><![CDATA[Editeurs]]></category>
		<category><![CDATA[FAIR]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Non classé]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[SHACL]]></category>
		<category><![CDATA[SHACL Play]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[DCAT]]></category>
		<category><![CDATA[ELI]]></category>
		<category><![CDATA[google spreadsheets]]></category>
		<category><![CDATA[json-ld]]></category>
		<category><![CDATA[knowledge graph]]></category>
		<category><![CDATA[SKOS]]></category>
		<category><![CDATA[SPARQL]]></category>

		<guid isPermaLink="false">https://blog.sparna.fr/?p=1959</guid>
		<description><![CDATA[<p>A second usecase Thomas wrote for Veronika Heimsbakk’s SHACL for the Practitioner upcoming book is about Sparna&#8217;s work for the European Parliament. From validation of the data in the knowledge graph to further projects of data integration and dissemination, many different usages of SHACL specifications were explored&#8230; &#8230; and more exploratory usages of SHACL are foreseen ! “&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2025/04/09/european-parliament-open-data-portal-a-shacl-powered-knowledge-graph/">European Parliament Open Data Portal : a SHACL-powered knowledge graph</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>A second usecase Thomas wrote for Veronika Heimsbakk’s <em><a href="https://veronahe.wordpress.com/shacl-for-the-practitioner/">SHACL for the Practitioner</a></em> upcoming book is about Sparna&rsquo;s work for the European Parliament.</p>
<p>From validation of the data in the knowledge graph to further projects of data integration and dissemination, many different usages of SHACL specifications were explored&#8230;</p>
<p>&#8230; and more exploratory usages of SHACL are foreseen !</p>
<h1>“</h1>
<h2><strong>A knowledge-graph powered open data portal</strong></h2>
<p><a href="https://data.europarl.europa.eu/">The European Parliament Open Data Portal (EPODP)</a> went live in January 2023. Its particularity is that it is not a mere aggregation of documents or dump files from business applications in custom formats; but rather a <strong>collection of datasets each extracted from a central semantic knowledge graph</strong>, itself aggregating data migrated from approximately <strong>twenty business applications</strong>. The result is a semantically interoperable open data portal : the semantic of its data model is clearly defined and documented, and reuses widely deployed existing ontologies. It already provides its data to different consumers (most notably <a href="https://www.europarl.europa.eu/">the europarl website</a> and <a href="https://law-tracker.europa.eu/">the EU law tracker</a>) in a context of cross-institutions interoperability. The data captures the activity of the parliament : as co-legislator together with the Council of the EU, the European Parliament (EP) holds plenary sittings, in which reports originating from committees, as well as motion for resolutions, are amended and voted; after the vote, the final adopted texts are published.</p>
<p>The focus on semantic interoperability of EPODP maximizes the potential of reuse and linkage of its datasets, and <strong>maximizes the quality</strong> of the offered data. It comes however at a cost when building the portal : deep analysis and understanding of the existing data and documents structure is required to capture the business semantic. SHACL is the way to formally encode this business semantic &#8211; but how is it deployed in practice ? how is it maintained ? what are the different types of SHACL specifications used ?</p>
<h2><strong>SHACL at the center of a model-driven approach</strong></h2>
<p>SHACL in the EPODP is at the basis of multiple model-driven usages depicted in the following diagram:</p>
<p><a href="https://blog.sparna.fr/wp-content/uploads/2025/04/spec-SHACL.png"><img class="aligncenter size-large wp-image-1961" src="https://blog.sparna.fr/wp-content/uploads/2025/04/spec-SHACL-1024x508.png" alt="spec-SHACL" width="650" height="322" /></a></p>
<p>There was two key drivers for introducing the use of SHACL in the EPODP project : <strong>validation of the data</strong> in the knowledge graph, and <strong>generation of public documentations</strong> of the models. The same SHACL specification that captures the business semantic is directly actionable to be published as a documentation and to validate the data. The produced documentation is a set of public files, such as <a href="https://data.europarl.europa.eu/def/eli-ep">the ELI-EP application profile documentation</a> and others accessible from <a href="https://data.europarl.europa.eu/en/developer-corner">the EPODP developer&rsquo;s corner</a>. <a href="https://shacl-play.sparna.fr/play/doc">The SHACL Play documentation generator</a> is used to produce the documentation pages. Data validation happens at earlier stages, after data transformation steps.</p>
<p>Two additional usages of SHACL specifications were explored : one was to generate SPARQL queries to extract the content of datasets from the larger knowledge graph. The SHACL specification of a dataset content is interpreted to generate SPARQL CONSTRUCT queries, executed against the entire knowledge graph, to return a subset of data corresponding to the specification. The query generation was implemented <a href="https://shacl-play.sparna.fr/play/sparql">in SHACL Play</a>, however the EPODP chose to continue using manually crafted SPARQL queries to generate the datasets. The other usage was to complement the SHACL specifications with the mapping rules used to feed the corresponding properties or classes in the graph. This has the advantage that the mapping rules are documented and maintained alongside the specification and not in a separate document. This work is ongoing.</p>
<p>More exploratory usages of SHACL are foreseen : generating a query user interface based on the SHACL specification, <a href="https://docs.sparnatural.eu/how-to-configure-shacl/How-to-configure-Sparnatural-shacl.html">using the <strong>Sparnatural</strong> query builder</a>, and also input forms to facilitate the creation of DCAT datasets descriptions. Additionally, automated generation of the JSON-LD context and the JSON schema of the API are foreseen.</p>
<h2><strong>Not &laquo;&nbsp;1 SHACL to rule them all&nbsp;&raquo;, but application profiles, dataset definitions, and migration specifications</strong></h2>
<p>The definition of the EPODP knowledge graph is not captured in a single SHACL specification, but rather in three different application profiles, each being a selection of classes and properties of one sub-domain : <strong><a href="https://data.europarl.europa.eu/def/eli-ep">ELI-EP</a></strong> covers the description of documents and activities, <strong><a href="https://data.europarl.europa.eu/def/org-ep">ORG-EP</a></strong> covers the definitions of EP organisations (such as committees, political groups, etc.) and members of the parliament, and <strong><a href="https://europarl.github.io/skos-ep">SKOS-EP</a></strong> covers how controlled vocabularies are structured. In addition, <a href="https://data.europarl.europa.eu/def/dcat-ep">DCAT-EP</a> is the specification for how dataset records are described in the EPODP catalog &#8211; but this is not part of the knowledge graph <em>per se</em>.</p>
<p>Together, ELI-EP, ORG-EP and SKOS-EP specify the structure of the entire knowledge graph from which the datasets are extracted. In addition, the structure of each dataset family available in the EPODP (such as adopted texts, plenary documents, parliamentary questions, etc.) is also described in SHACL, referred to as <strong>&laquo;&nbsp;DSD&nbsp;&raquo; for &laquo;&nbsp;Dataset Definition&nbsp;&raquo;</strong>. While the application profiles describe every possible properties on generic shapes, the DSDs will specify only the subset of properties used in a dataset, with possibly different cardinalities or range. For example, ELI-EP specifies that <em>&laquo;&nbsp;a Work may have the property</em><em> </em><em>eli:adopts</em><em>&laquo;&nbsp;</em> (with no minimum cardinality (eli:adopts is defined as <em>&laquo;&nbsp;Indicates that the work represents the adopted work of one or several related works&nbsp;&raquo;</em>). The DSD for adopted texts datasets specifies the shape of &laquo;&nbsp;Adopted texts&nbsp;&raquo; as a subset of the Works, and indicates that the minimum cardinality of eli:adopts is 1 for this particular subset. Besides, some properties, such as eli:amends are not available for adopted texts, thus not declared in the DSD.</p>
<p>In addition, specifications of the conversion of some data sources are also specified in independent SHACL files. The articulations of these 3 kinds of SHACL files and the reused ontologies is depicted in the following diagram:</p>
<p><a href="https://blog.sparna.fr/wp-content/uploads/2025/04/3-SHACL-shapes.png"><img class="aligncenter size-large wp-image-1962" src="https://blog.sparna.fr/wp-content/uploads/2025/04/3-SHACL-shapes-1024x603.png" alt="3-SHACL-shapes" width="650" height="383" /></a></p>
<p>There is currently no reuse or reference of shapes across the different specifications. Each is independent. A nice improvement would be to study how SHACL DSDs could be derived from the application profile SHACL, without redeclaring the identical constraints.</p>
<h2><strong>Editing SHACL in spreadsheets</strong></h2>
<p>In total 16 SHACL specifications are currently published in the EPODP, and around 80 are used to validate data migrated from each individual sources. The first step in the specification of each model is the design in a diagram such as the ones visible in the public documentations of the models. The EPODP team is then using spreadsheets to encode the specifications, adapted from the one provided <a href="https://shacl-play.sparna.fr/play/shaclexcel">in the SHACL Play suite</a>. The spreadsheet is converted to SHACL using <a href="https://xls2rdf.sparna.fr/rest/">the xls2rdf converter</a>. <strong>Spreadsheets provide a simple editing solution</strong>, with an easy learning curve, made even easier with a few formulas to compute cell values automatically. It even provides ways for editing advanced patterns (such as the ability to directly turtle lists for sh:or, or blank nodes for property paths), but of course still limits the expressivity. The following screenshot shows how property shapes look like in the spreadsheet:</p>
<p><a href="https://blog.sparna.fr/wp-content/uploads/2025/04/properties-ELI.png"><img class="aligncenter size-large wp-image-1963" src="https://blog.sparna.fr/wp-content/uploads/2025/04/properties-ELI-1024x442.png" alt="properties-ELI" width="650" height="281" /></a></p>
<h2></h2>
<h2><strong>Results and future perspectives</strong></h2>
<p>The EPODP use-case shows how SHACL can be applied in a systematic way in a data integration and dissemination project : at the data transformation step, at the knowledge graph level, and at the data dissemination. <strong>Public documentation, data validation, data extraction are tasks that can be be automated based on a SHACL specification</strong>. While the context is one of a large public institution, the same approach can be applied in industrial contexts. The SHACL specifications are a cornerstone of such projects, enabling semantic interoperability at large and a mutual understanding between business experts, data analysts, developers, and data consumers.</p>
<p>&nbsp;</p>
<h1>”</h1>
<p>Veronika&rsquo;s book will be divided into three parts :<br class="html-br" /><br class="html-br" />1. Back to Basics<br class="html-br" />Introduction to logic and RDF, brief skimming of the topics. Also covering various world assumptions.</p>
<p>2. Getting to know the stuff<br class="html-br" />Introduction to SHACL, including core, sh-sparql, advanced features.</p>
<p>3. Working with the stuff<br class="html-br" />SHACL Stories. Use cases, user stories and implementations.</p>
<p><em>Image : © European Union, [2024] &#8211; EP</em></p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2025/04/09/european-parliament-open-data-portal-a-shacl-powered-knowledge-graph/">European Parliament Open Data Portal : a SHACL-powered knowledge graph</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2025/04/09/european-parliament-open-data-portal-a-shacl-powered-knowledge-graph/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Genesis of Sparnatural in the context of the OpenArchaeo platform</title>
		<link>https://blog.sparna.fr/2025/03/28/the-genesis-of-sparnatural-in-the-context-of-the-openarchaeo-platform/</link>
		<comments>https://blog.sparna.fr/2025/03/28/the-genesis-of-sparnatural-in-the-context-of-the-openarchaeo-platform/#comments</comments>
		<pubDate>Fri, 28 Mar 2025 14:54:53 +0000</pubDate>
		<dc:creator><![CDATA[Marie Muller]]></dc:creator>
				<category><![CDATA[FAIR]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Recherche d'informations]]></category>
		<category><![CDATA[Sparnatural]]></category>
		<category><![CDATA[SPARQL]]></category>
		<category><![CDATA[Triplestores]]></category>
		<category><![CDATA[CIDOC-CRM]]></category>
		<category><![CDATA[Huma-Num]]></category>
		<category><![CDATA[ResearchSpace]]></category>
		<category><![CDATA[thesaurus]]></category>
		<category><![CDATA[Triplestore]]></category>

		<guid isPermaLink="false">https://blog.sparna.fr/?p=1949</guid>
		<description><![CDATA[<p>The OpenArchaeo platform, developed by French consortium Huma-Num MASAplus (Mémoire des Archéologues et des Sites Archéologiques) together with SPARNA, is a platform dedicated to archaeological data interoperability. This semantic interoperability objective relies on the strong conceptual foundations offered by the CIDOC-CRM data model. Paired with the CIDOC-CRM in a federated way, OpenArchaeo aims at :&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2025/03/28/the-genesis-of-sparnatural-in-the-context-of-the-openarchaeo-platform/">The Genesis of Sparnatural in the context of the OpenArchaeo platform</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>The <a href="http://openarchaeo.huma-num.fr">OpenArchaeo</a> platform, developed by French <a href="https://www.huma-num.fr/les-consortiums-hn/#MASAplus">consortium Huma-Num MASAplus</a> (Mémoire des Archéologues et des Sites Archéologiques) together with SPARNA, is a platform dedicated to archaeological data interoperability. This semantic interoperability objective relies on the strong conceptual foundations offered by the <a href="https://blog.sparna.fr/2019/03/26/le-cidoc-crm-ne-nous-arrachons-plus-les-cheveux/">CIDOC-CRM</a> data model.</p>
<p>Paired with the CIDOC-CRM in a federated way, OpenArchaeo aims at :</p>
<ul>
<li>making available the archaeological datasets produced by the MASAplus consortium’s partners on the semantic web, in the form of a triplestore with data aligned with the ontology and its extensions dedicated to archaeology ;</li>
<li>providing an intuitive query interface for archaeological data.</li>
</ul>
<p>The latter query interface integrates the Sparnatural knowledge graph exploration component. The UI of this component was heavily inspired by the British Museum&rsquo;s ResearchSpace semantic search feature, as the system proposes the user to build his own queries based on the CIDOC-CRM model underlying the data.</p>
<h2>About ResearchSpace platform</h2>
<p>Initiated in 2009 by a cross-disciplinary team at the British Museum, ResearchSpace is « A full CIDOC-CRM authoring and search system, based on an exhaustive collection of forms that reflects all applicable relationships from the CIDOC CRM ontology. »</p>
<p>Among a wide range of semantic tools to create, manipulate, analyse and visualise data, the platform provides a s<a href="https://documentation.researchspace.org/resource/Help:SemanticSearch">emantic structured search component</a> based on categories and relations.</p>
<p>While open source, ResearchSpace’s code didn’t fit our architecture : we just chose to follow the simple visual elements of ResearchSpace’s query interface to develop our own Sparnatural query builder for OpenArchaeo, and set up a system of icons to identify the main components of the archaeological data.</p>
<p>ResearchSpace has recently (december 2024) released a brand new <a href="https://github.com/researchspace/researchspace/blob/master/release-notes.md">4.0.0 version</a>. This latest can be installed easily and now comes with a default setup of forms based on the CIDOC-CRM. It enables image annotations, knowledge maps creations, semantic narratives writing, timeline productions, and more <a href="https://researchspace.org/semantic-tools/">semantic tools</a>.</p>
<h2>Sparnatural’s first use-case was OpenArchaeo’s CIDOC-CRM model !</h2>
<p><a href="http://openarchaeo.huma-num.fr/explorateur/home">The structure of the knowledge graph of OpenArchaeo</a> relies on the CIDOC-CRM and some of its extension (CRMarchaeo, CRMsci and CRMba). It is a generic model that covers the basic concepts found in most archaeological corpuses (site, operation, structure, feature, wall, burial, stratigraphic unit and artifact).</p>
<p>Here a focus on Class S19 :</p>
<p><a href="https://blog.sparna.fr/wp-content/uploads/2025/03/modeleOpenArchaeoEn_base.png"><img class="aligncenter size-full wp-image-1950" src="https://blog.sparna.fr/wp-content/uploads/2025/03/modeleOpenArchaeoEn_base.png" alt="modeleOpenArchaeoEn_base" width="898" height="416" /></a><br />
Several external thesauri were added too for querying the datasets : <a href="https://pactols.frantiq.fr/">PACTOLS thesaurus for archaelogy</a>, but also <a href="https://www.geonames.org/ontology/documentation.html">Geonames</a> and <a href="https://perio.do/technical-overview/">Periodo</a> for spatial and temporal searches.</p>
<p>This way, when users wish to connect two elements (artifact and site for example), the interface automatically suggests the available relationships between these entities, enabling users to formulate their request in a simple way without having to know either the entities and properties of CIDOC CRM, or the structure of the system : the SPARQL queries that correspond to the sentences visually built by users will be automatically computed. In addition, the usage of thesauri allows the users to cross-reference easily multiple datasets through the different widgets proposed in Sparnatural.</p>
<h2><a href="https://github.com/sparna-git/Sparnatural/releases">Get the latest release of Sparnatural !</a></h2>
<p>Since it was created for OpenArchaeo in 2019, Sparnatural UI has been fully redesigned. It now offers a large panel of features, from different <a href="https://docs.sparnatural.eu/widgets.html">widgets for value selection</a> (dropdown lists, ordered by occurrence count or alphabetically, autocomplete search fields, date pickers, tree widgets&#8230;) to brand new <a href="https://docs.sparnatural.eu/result-display.html">result display plugins</a> : the default visualisation is a table of results, but if the results are geolocalized they can be shown in a map. Also grid, stats, pie or bar charts, and a timeline plugin have been made available and documented.</p>
<h2>To go further on OpenArchaeo’s platform &#8230;</h2>
<p><a href="https://www.youtube.com/watch?v=8GW5sirwHJs">See a presentation of the project on the CIDOC Museum Documentation Channel</a><br />
(« Semantic modelling of archaeological data online workshop series »)</p>
<p>The platform : <a href="http://openarchaeo.huma-num.fr/">http://openarchaeo.huma-num.fr/</a></p>
<p>The project : <a href="https://masa.hypotheses.org/openarchaeo">https://masa.hypotheses.org/openarchaeo</a></p>
<p>Read full research paper about the project : <a href="https://ceur-ws.org/Vol-2375/paper1.pdf">https://ceur-ws.org/Vol-2375/paper1.pdf</a></p>
<p><em>Image : Vestiges of a large villa in Courbehaye &laquo;&nbsp;les Deux Muids / le Moulin de Mongé&nbsp;&raquo;, photo Alain Lelong (2003), <a href="https://aerba.huma-num.fr/fiche.html?id=2811401">Atlas des Établissements Ruraux de Beauce Antique</a>, licence </em><a href="https://creativecommons.org/licenses/by-nc-sa/4.0/">CC BY-NC-SA</a></p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2025/03/28/the-genesis-of-sparnatural-in-the-context-of-the-openarchaeo-platform/">The Genesis of Sparnatural in the context of the OpenArchaeo platform</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2025/03/28/the-genesis-of-sparnatural-in-the-context-of-the-openarchaeo-platform/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Retour sur &#8230; Le déploiement de Sparnatural pour FranceArchives</title>
		<link>https://blog.sparna.fr/2025/02/14/retour-sur-le-deploiement-de-sparnatural-pour-francearchives/</link>
		<comments>https://blog.sparna.fr/2025/02/14/retour-sur-le-deploiement-de-sparnatural-pour-francearchives/#comments</comments>
		<pubDate>Fri, 14 Feb 2025 17:35:50 +0000</pubDate>
		<dc:creator><![CDATA[Marie Muller]]></dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Non classé]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[Recherche d'informations]]></category>
		<category><![CDATA[Sparnatural]]></category>
		<category><![CDATA[SPARQL]]></category>
		<category><![CDATA[Visualisation de données]]></category>
		<category><![CDATA[archives]]></category>
		<category><![CDATA[EAC]]></category>
		<category><![CDATA[EAD]]></category>
		<category><![CDATA[EAG]]></category>
		<category><![CDATA[RiC-O]]></category>
		<category><![CDATA[schema.org]]></category>

		<guid isPermaLink="false">https://blog.sparna.fr/?p=1860</guid>
		<description><![CDATA[<p>Voilà maintenant près d’un an et demi que FranceArchives, le Portail national des Archives de France, a annoncé le déploiement de l’outil « Supernatural » (comprendre Sparnatural) via ses réseaux, dans l’optique de proposer à ses usagers « un accès nouveau aux métadonnées archivistiques, complémentaire de la recherche classique par le moteur du portail ».&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2025/02/14/retour-sur-le-deploiement-de-sparnatural-pour-francearchives/">Retour sur &#8230; Le déploiement de Sparnatural pour FranceArchives</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<h1></h1>
<p><span style="font-weight: 400;">Voilà maintenant près d’un an et demi que FranceArchives, le Portail national des Archives de France, <a href="https://francearchives.gouv.fr/fr/requeteurnaturel">a annoncé le déploiement de l’outil « Supernatural »</a> (comprendre <u><a href="https://sparnatural.eu/">Sparnatural</a></u>) via ses réseaux, dans l’optique de proposer à ses usagers « <em>un accès nouveau aux métadonnées archivistiques, complémentaire de la recherche classique par le moteur du portail</em> ».</span></p>
<p><span style="font-weight: 400;"><a href="https://x.com/FranceArchives/status/1706286558385463319"><img class="aligncenter wp-image-1897 size-full" src="https://blog.sparna.fr/wp-content/uploads/2025/02/Supernatural-e1739532443130.jpg" alt="Supernatural" width="400" height="392" /></a></span></p>
<p><span style="font-weight: 400;">Porté par le Service interministériel des Archives de France, le portail FranceArchives offre une recherche fédérée dans près de 26 millions de métadonnées archivistiques produites par près de 170 institutions et entièrement sémantisées en RDF par le biais de l&rsquo;ontologie RiC-O version 0.2 publiée en février 2021.</span></p>
<p>C’est une des premières utilisations de RiC-O à grande échelle (même s’il faudra à l’avenir qu’il se mette à jour sur la <u><a href="https://www.ica.org/resource/records-in-contexts-ontology/">version 1.0 de RiC-O publiée depuis</a></u> !), et c’est également l&rsquo;un des premiers entrepôts de données archivistiques de cette taille sur le Linked Open Data.</p>
<p><span style="font-weight: 400;">&#8230; Un graphe de données qui a tout pour être « Supernaturalisé » <img src="https://blog.sparna.fr/wp-includes/images/smilies/simple-smile.png" alt=":)" class="wp-smiley" style="height: 1em; max-height: 1em;" /></span></p>
<h2><b><i>Des données de qualité à une recherche augmentée</i></b></h2>
<p><span style="font-weight: 400;">&#8230; Enfin, il va surtout s’agir de ses « données de qualité », autrement dit les :</span></p>
<ul>
<li style="font-weight: 400;"><span style="font-weight: 400;">inventaires avec leurs composants,</span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">notices descriptives de producteurs d’archives,</span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">fiches signalétiques des services d’archives,</span></li>
</ul>
<p><span style="font-weight: 400;"> </span><span style="font-weight: 400;">… tous objets liés à une autorité « </span><a href="https://francearchives.gouv.fr/fr/agents"><span style="font-weight: 400;">personnes et institutions</span></a><span style="font-weight: 400;"> »</span><span style="font-weight: 400;">, </span><span style="font-weight: 400;">« </span><a href="https://francearchives.gouv.fr/fr/locations"><span style="font-weight: 400;">lieux</span></a><span style="font-weight: 400;"> »</span><span style="font-weight: 400;"> et </span><span style="font-weight: 400;">« </span><a href="https://francearchives.gouv.fr/fr/subjects"><span style="font-weight: 400;">thèmes</span></a><span style="font-weight: 400;"> » de qualité (soit moins de 5% des métadonnées du portail avant conversion&#8230; et plus de 70% de l’ensemble du réservoir en RDF !</span><span style="font-weight: 400;">), autorités de qualité elles-mêmes</span><a href="https://francearchives.gouv.fr/fr/article/213604642"> <span style="font-weight: 400;">harmonisées et alignées vers des référentiels nationaux et internationaux</span></a><span style="font-weight: 400;">.</span></p>
<p><span style="font-weight: 400;">M</span><span style="font-weight: 400;">odèle particulièrement adapté à la description des archives en RDF</span><span style="font-weight: 400;">, c’est l’ontologie </span><span style="font-weight: 400;">RiC-O (v0.2</span><span style="font-weight: 400;">) qui a été utilisée pour la sémantisation des données </span><span style="font-weight: 400;">XML EAD</span><span style="font-weight: 400;"> &#8211; </span><span style="font-weight: 400;">XML EAC-CPF</span><span style="font-weight: 400;"> vers RDF, complétée de</span><a href="https://schema.org/"> <span style="font-weight: 400;">schema.org</span></a><span style="font-weight: 400;"> pour les fiches signalétiques des services de l’annuaire au format </span><span style="font-weight: 400;">XML EAG</span><span style="font-weight: 400;">.</span></p>
<p><span style="font-weight: 400;">Les informations relatives aux archives et à leurs producteurs étant décrites dans des fichiers différents, la recherche avancée via SPARQL rend désormais possible une interrogation fédérée plus fine d’un vaste corpus de notices en « traversant » le graphe structuré selon le modèle RiC-O. </span><span style="font-weight: 400;">En effet, l’intérêt de l’interrogation via SPARQL est de casser les silos entre types de métadonnées : il permet de faire une recherche transversale entre données provenant de fichiers EAD et de fichier EAC-CPF.</span></p>
<p>Les notices affichées en résultats de recherche montrent les alignements existants vers les notices de producteurs externes, Wikidata, data.bnf, GeoNames ou encore le Thesaurus pour l&rsquo;indexation matières des archives locales. C’est ainsi dans l’onglet Personnes/indexations liées que sont exploités les résultats de la conversion en RDF, par le biais de suggestions de recherches complémentaires sur le portail classique.</p>
<p>Une façon de faire bénéficier le grand public du RDF de manière complètement transparente pour lui !</p>
<p><a href="https://blog.sparna.fr/wp-content/uploads/2025/02/VictorHugo.jpg"><img class="aligncenter size-large wp-image-1904" src="https://blog.sparna.fr/wp-content/uploads/2025/02/VictorHugo-1024x547.jpg" alt="VictorHugo" width="650" height="347" /></a></p>
<h2><b><i>Quelques exemples de requêtes&#8230;</i></b></h2>
<p><span style="font-weight: 400;">On accède à l’outil via le menu « Recherche SPARQL » <a href="https://francearchives.gouv.fr/fr/requeteurnaturel">en haut à droite du site du portail</a> :</span></p>
<p><span style="font-weight: 400;"> <a href="https://blog.sparna.fr/wp-content/uploads/2025/02/QueriesFA.jpg"><img class="aligncenter size-large wp-image-1905" src="https://blog.sparna.fr/wp-content/uploads/2025/02/QueriesFA-1024x504.jpg" alt="QueriesFA" width="650" height="320" /></a></span></p>
<p><span style="font-weight: 400;">Plusieurs exemples de requêtes sont à disposition pour explorer les données :</span></p>
<p><span style="font-weight: 400;">De la requête la plus simple :</span></p>
<h5><em><b>Personne est membre de Institution</b></em></h5>
<p><a href="https://blog.sparna.fr/wp-content/uploads/2025/02/SampleFA.jpg"><img class="aligncenter size-large wp-image-1906" src="https://blog.sparna.fr/wp-content/uploads/2025/02/SampleFA-1024x422.jpg" alt="SampleFA" width="650" height="268" /></a></p>
<p><span style="font-weight: 400;"> </span><span style="font-weight: 400;"> </span></p>
<p><img class="aligncenter size-large wp-image-1928" src="https://blog.sparna.fr/wp-content/uploads/2025/02/query11-1024x375.jpg" alt="query1" width="650" height="238" /></p>
<p><span style="font-weight: 400;">À des requêtes de plus en plus élaborées et complexes, comme ici :</span></p>
<h5><em><b>Lieux qui sont le sujet des archives reliées au fonds « Fabrique de berlingot Eysséric »</b></em></h5>
<p><span style="font-weight: 400;"> <a href="https://blog.sparna.fr/wp-content/uploads/2025/02/SampleFA2.jpg"><img class="aligncenter size-large wp-image-1907" src="https://blog.sparna.fr/wp-content/uploads/2025/02/SampleFA2-1024x717.jpg" alt="SampleFA2" width="650" height="455" /></a></span></p>
<p><span style="font-weight: 400;"> <a href="https://blog.sparna.fr/wp-content/uploads/2025/02/query2.jpg"><img class="aligncenter size-large wp-image-1929" src="https://blog.sparna.fr/wp-content/uploads/2025/02/query2-1024x461.jpg" alt="query2" width="650" height="293" /></a></span></p>
<p><span style="font-weight: 400;">Où l’on voit que l’on peut retracer le cheminement de la requête à travers le graphe de l’ontologie RiC-O en cliquant sur </span><i><span style="font-weight: 400;">« </span></i><i><span style="font-weight: 400;">Afficher/masquer l’éditeur SPARQL</span></i><span style="font-weight: 400;"> ».</span></p>
<h2><b><i>Des archives à la page…</i></b></h2>
<p><span style="font-weight: 400;">À noter que le projet, qui avait fait l’objet d’une présentation à l’occasion de</span><a href="https://swib.org/swib23/slides/07_Fabien%20Amarger_Slides.pdf"> <span style="font-weight: 400;">SWIB (</span><span style="font-weight: 400;">Semantic Web in Libraries</span><span style="font-weight: 400;">)</span></a><span style="font-weight: 400;"> et de</span><a href="https://web.archive.org/web/20240715083647/https://semweb.pro/conference/2023/presentation/francearchives-portail-de-reference-pour-les-archives-francaises/"> <span style="font-weight: 400;">SemWebPro 2023</span></a><span style="font-weight: 400;"> a été entièrement déployé (et configuré !) à partir de la documentation disponible sur le site web de Sparnatural.</span></p>
<p><span style="font-weight: 400;">N’hésitez pas à aller la consulter !</span></p>
<p><a href="https://docs.sparnatural.eu/hello-sparnatural/Hello-Sparnatural.html"><span style="font-weight: 400;">Hello Sparnatural</span></a></p>
<p><a href="https://docs.sparnatural.eu/how-to-configure-shacl/How-to-configure-Sparnatural-shacl.html"><span style="font-weight: 400;">How-to configure in SHACL</span></a></p>
<p><a href="https://docs.sparnatural.eu/widgets.html"><span style="font-weight: 400;">Reference documentation of Sparnatural widgets</span></a></p>
<h2><strong>Pour aller plus loin sur la sémantisation des archives…</strong></h2>
<p>Le déploiement de Sparnatural sur FranceArchives fait suite à une autre réalisation de l&rsquo;année précédente, le <a href="https://sparna-git.github.io/sparnatural-demonstrateur-an/">démonstrateur Sparnatural des Archives nationales</a>. Celui-ci avait permis de faire évoluer Sparnatural et de le déployer sur un graphe sémantique en RiC-O de 20 millions de triplets (hors inférence), alimenté avec le contenu de 1577 instruments de recherche décrivant les archives de 40 des 122 études notariales de Paris conservées aux Archives nationales, de 1120 notices décrivant ces études et les notaires qui y ont exercé, et d&rsquo;autres référentiels des Archives nationales notamment sur les lieux de Paris. La réalisation de ce démonstrateur a été <a href="https://sparna-git.github.io/sparnatural-demonstrateur-an/presentation-fr.html">entièrement documentée en français</a> et en <a href="https://sparna-git.github.io/sparnatural-demonstrateur-an/presentation-en.html">anglais</a>. Ce démonstrateur et ses interfaces évolueront d&rsquo;ailleurs bientôt.</p>
<p>Depuis, Sparna s&rsquo;est impliqué dans le domaine de la sémantisation des archives puisque nous développons également, pour les comptes des Archives Nationales, l’outil <u><a href="https://github.com/ArchivesNationalesFR/rico-converter">Ric-O converter</a></u>.</p>
<p>Celui-ci permet la conversion de notices EAD et EAC vers du RDF exprimé en RiC-O. Nous finalisons actuellement une nouvelle version du convertisseur pour le rendre compatible RiC-O 1.0 (et même 1.1 dont la sortie est imminente).</p>
<p><a href="https://blog.sparna.fr/2020/04/20/rico-records-in-contexts-archives-modele-conceptuel/">Un nouvel article à paraître ici sur RiC-O ?</a> &#8230; Stay tuned !</p>
<p>&nbsp;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2025/02/14/retour-sur-le-deploiement-de-sparnatural-pour-francearchives/">Retour sur &#8230; Le déploiement de Sparnatural pour FranceArchives</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2025/02/14/retour-sur-le-deploiement-de-sparnatural-pour-francearchives/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nakala : from an RDF dataset to a query UI in minutes &#8211; SHACL automated generation and Sparnatural</title>
		<link>https://blog.sparna.fr/2025/02/06/nakala-from-an-rdf-dataset-to-a-query-ui-in-minutes-shacl-automated-generation-and-sparnatural/</link>
		<comments>https://blog.sparna.fr/2025/02/06/nakala-from-an-rdf-dataset-to-a-query-ui-in-minutes-shacl-automated-generation-and-sparnatural/#comments</comments>
		<pubDate>Thu, 06 Feb 2025 10:38:25 +0000</pubDate>
		<dc:creator><![CDATA[Marie Muller]]></dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Recherche d'informations]]></category>
		<category><![CDATA[SHACL]]></category>
		<category><![CDATA[Sparnatural]]></category>
		<category><![CDATA[SPARQL]]></category>
		<category><![CDATA[Visualisation de données]]></category>
		<category><![CDATA[dcterms]]></category>
		<category><![CDATA[digital humanities]]></category>
		<category><![CDATA[EDM]]></category>
		<category><![CDATA[endpoint]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[UI]]></category>

		<guid isPermaLink="false">https://blog.sparna.fr/?p=1867</guid>
		<description><![CDATA[<p>Here is a usecase of an automated version of Sparnatural submitted as an example for Veronika Heimsbakk&#8217;s SHACL for the Practitioner upcoming book about the Shapes Constraint Language (SHACL). “ The Sparnatural knowledge graph explorer leverages SHACL specifications to drive a user interface (UI) that allows end users to easily discover the content of an RDF graph. What&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2025/02/06/nakala-from-an-rdf-dataset-to-a-query-ui-in-minutes-shacl-automated-generation-and-sparnatural/">Nakala : from an RDF dataset to a query UI in minutes &#8211; SHACL automated generation and Sparnatural</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>Here is a usecase of an automated version of Sparnatural submitted as an example for Veronika Heimsbakk&rsquo;s <em><a href="https://veronahe.wordpress.com/shacl-for-the-practitioner/">SHACL for the Practitioner</a></em> upcoming book about the Shapes Constraint Language (SHACL).</p>
<h1></h1>
<h1>“</h1>
<p>The <a href="https://sparnatural.eu/">Sparnatural knowledge graph explorer</a> leverages SHACL specifications to drive a user interface (UI) that allows end users to easily discover the content of an RDF graph. What is the best way to make this UI-oriented SHACL specification ? if a SHACL specification for the knowledge graph structure already exists, can it be used directly ? does it require customization ? or is the Sparnatural SHACL spec completely decoupled from an existing knowledge graph spec ? and what if no SHACL spec exists at all ?</p>
<p>We faced all these different situations while deploying Sparnatural, and used various approaches to produce a satisfying end-user oriented specification. In particular, <a href="https://www.nakala.fr/">the Nakala repository</a> is one of the latest graph <a href="https://www.nakala.fr/sparnatural/">for which Sparnatural was deployed</a>. Nakala is a data repository that aims to preserve and disseminate data produced by French research projects in the Humanities and Social Sciences, in compliance with the FAIR principles. Nakala is a service offered by <a href="https://www.huma-num.fr/">Huma-Num</a>, a research infrastructure dedicated to the digital humanities. The Nakala knowledge graph contains `dcterms` metadata provided by researchers to describe the resources they upload. Additional non-dcterms metadata can also be provided. The metadata varies in quality and quantity depending on the researcher. When exposed <a href="https://www.nakala.fr/sparql">in a SPARQL endpoint</a>, resources, collections of resources and agents are described using <a href="https://pro.europeana.eu/page/edm-documentation">the Europeana Data Model (EDM)</a>.</p>
<p>As the EDM dissemination channel for Nakala was new, no SHACL specification existed for it. We could have designed one for Sparnatural from scratch, but the choice was make to generate it automatically, with no human intervention. This was for three reasons : ease of configuration, flexibility in maintenance over time, and pedagogical reason, as it was important to explain the structure of the graph to target users.</p>
<h2>Sparnatural UI</h2>
<p>Let&rsquo;s first have a look at what the Sparnatural UI looks like on an example from Nakala:</p>
<p><a href="https://blog.sparna.fr/wp-content/uploads/2025/02/SHACLNAKALA11.png"><img class="aligncenter size-large wp-image-1883" src="https://blog.sparna.fr/wp-content/uploads/2025/02/SHACLNAKALA11-1024x365.png" alt="SHACLNAKALA1" width="650" height="232" /></a></p>
<p>Once you know that &laquo;&nbsp;ProvidedCHO&nbsp;&raquo; stands for &laquo;&nbsp;Provided Cultural Heritage Object&nbsp;&raquo;, and that &laquo;&nbsp;asWKT&nbsp;&raquo; encodes the location of a Place, you will be able to understand that the query searches for all ProvidedCHO entries gathered into a certain collection (&laquo;&nbsp;Cartes Université Bordeaux Montaigne&nbsp;&raquo; &#8211; a collection of maps), and selects their location and an optional description (and yes, the results of this query are displayed on a map, but that&rsquo;s out of scope).</p>
<h2>SHACL is derived automatically</h2>
<p>In this project we wanted the shortest path from the graph to the query UI. Hence we used <a href="https://shacl-play.sparna.fr/play/generate#documentation">a SHACL generation algorithm, available in SHACL Play</a>. By issuing SPARQL queries on an RDF graph, the algorithm determines the NodeShapes (targeting the classes used as values of `rdf:type`), and PropertyShapes (from all predicates used on instances on each class) of the model, with their node kinds, datatypes, class range, and cardinalities. It generates `sh:or` constraints when multiple datatypes or ranges are found. Note that in the case of Nakala a large variety of ranges are used, since the data comes from very open user inputs : the same `dcterms` property can be either an IRI or a Literal, with varying datatypes.</p>
<p>In addition, the algorithm computes some statistics on the dataset : the number of targets of each NodeShapes, the number of occurrences and the number of distinct values for each property shapes. The statistics are expressed using the `void` vocabulary, and `dcterms:conformsTo` is used to link void partitions to the corresponding shapes.</p>
<p><a href="https://shacl-play.sparna.fr/play/doc">The SHACL Play documentation tool</a> was then used to generate a report of the generated SHACL combined with the statistics. A few errors were spotted in the exported data, and fixed. We also saw that around 70 properties were present only a few times out of 700.000+ ProvidedCHO records. These properties were applied by probably a single or very few researchers when describing their data. It was decided to filter them out to keep the final UI simple, with an extra filtering step : based on statistics, property shapes used less than 0.1% of the number of targets of their node shapes are removed.</p>
<p>Here is a screenshot of the report : the right column shows the number of distinct values, and the column before is the number of total occurrences; we can immediately see that `dct:isReplacedBy` occurs only once, and `dct:isRequiredBy` occurs 81 times. They will be filtered out.</p>
<p><a href="https://blog.sparna.fr/wp-content/uploads/2025/02/SHACLNAKALA21.png"><img class="aligncenter size-large wp-image-1882" src="https://blog.sparna.fr/wp-content/uploads/2025/02/SHACLNAKALA21-1024x481.png" alt="SHACLNAKALA2" width="650" height="305" /></a></p>
<h2>Sparnatural reads SHACL</h2>
<p>Sparnatural can then read <a href="https://docs.sparnatural.eu/SHACL-based-configuration.html">the SPARQL specification</a>, together with the dataset statistics. When designing a query, value selection widgets for literal properties are determined by looking at the `sh:datatype` constraint (for number, dates, boolean, or map widgets). For IRI properties, statistics are used to distinguish between list and autocomplete widgets. Predicates with less than 500 distinct values will use a dropdown list, and those with more will use an autocomplete search field. The range is determined by reading `sh:class` or `sh:node`. The label to show in dropdown lists or to search on autocomplete field is determined by looking at a `dash:propertyRole = dash:LabelRole` annotation.</p>
<p>How about labels ? Sparnatural can read them from classes and properties of the original OWL file, if provided with it. Otherwise local names of target classes or predicates are used.</p>
<h2>Other configuration techniques</h2>
<p>Other Sparnatural deployments, such as <a href="https://sparnatural.eu/demos/demo-dbpedia-en/">the DBPedia demo</a> are designed in SHACL from scratch, <a href="https://docs.sparnatural.eu/how-to-configure-shacl/How-to-configure-Sparnatural-shacl.html">using spreadsheets</a>. This requires more manual work, but has the advantage of tailoring the UI to exactly what needs to be shown, including user-oriented labels/tooltips/icons, hiding some properties, taking shortcuts or declaring inverses using property paths, etc. In the case of DBPedia, no SHACL spec exists, and deriving it automatically for the entire graph would probably not make a lot of sense, hence the necessity for a manual design.</p>
<p>For other projects we are working on a third configuration technique : a SHACL spec that describes the exact content of the graph is first built. It is used to publish the documentation of the model and to validate the data. A separate shapes file containing a Sparnatural-specific configuration layer is then added on top of it. That layer can hide shapes by applying an `sh:deactivated` annotation on them, can specify the UI widgets to use, add additional `dash:LabelRole` flags, add shortcut or inverse properties, etc.</p>
<p>The 3 configuration paths are shown in the following diagram:</p>
<p><a href="https://blog.sparna.fr/wp-content/uploads/2025/02/SHACLNAKALA31.png"><img class="aligncenter size-large wp-image-1881" src="https://blog.sparna.fr/wp-content/uploads/2025/02/SHACLNAKALA31-1024x569.png" alt="SHACLNAKALA3" width="650" height="361" /></a></p>
<h2>Your query UI in minutes</h2>
<p>We combined 4 tools (all open-source) : an algorithm to generate a &laquo;&nbsp;profile&nbsp;&raquo; in SHACL of an RDF dataset, a statistical report generator, a SHACL filter based on statistics, and the Sparnatural query UI. The ability to generate the SHACL profile and review it in the report provided a way to understand the structure of the data in a matter of minutes, while hours would have been necessary with SPARQL queries, without a guarantee of completeness. The provision of the query UI was made by dropping the SHACL file and the statistics to Sparnatural, without manual intervention. This shows the pivotal role of SHACL for data quality and model-driven approaches for knowledge graphs projects.</p>
<h1>”</h1>
<p>We look forward to reading Veronika&rsquo;s book, and you ?</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2025/02/06/nakala-from-an-rdf-dataset-to-a-query-ui-in-minutes-shacl-automated-generation-and-sparnatural/">Nakala : from an RDF dataset to a query UI in minutes &#8211; SHACL automated generation and Sparnatural</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2025/02/06/nakala-from-an-rdf-dataset-to-a-query-ui-in-minutes-shacl-automated-generation-and-sparnatural/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CORDIS : a SPARQL endpoint is born !</title>
		<link>https://blog.sparna.fr/2024/01/15/cordis-a-sparql-endpoint-is-born/</link>
		<comments>https://blog.sparna.fr/2024/01/15/cordis-a-sparql-endpoint-is-born/#comments</comments>
		<pubDate>Mon, 15 Jan 2024 08:55:41 +0000</pubDate>
		<dc:creator><![CDATA[Marie Muller]]></dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[SPARQL]]></category>
		<category><![CDATA[Thesaurus]]></category>
		<category><![CDATA[Triplestores]]></category>
		<category><![CDATA[Visualisation de données]]></category>
		<category><![CDATA[endpoint]]></category>
		<category><![CDATA[Europe]]></category>
		<category><![CDATA[Linked Open Data]]></category>
		<category><![CDATA[ontologie]]></category>
		<category><![CDATA[ontologies]]></category>
		<category><![CDATA[thesaurus]]></category>

		<guid isPermaLink="false">http://blog.sparna.fr/?p=1615</guid>
		<description><![CDATA[<p>Another star to light on EU&#8217;s linked open data maturity flag ! 🌟 Not talking about 2024 exceptional Northern Lights to come, but this one&#8217;s also good news for science ! ➡️ Late 2023, the Publications Office of the European Union announced on social media the public release of the new CORDIS SPARQL endpoint. CORDIS, aka « the Community&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2024/01/15/cordis-a-sparql-endpoint-is-born/">CORDIS : a SPARQL endpoint is born !</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p><span style="font-weight: 400;">Another star to light on EU&rsquo;s linked open data maturity flag ! <img src="https://s.w.org/images/core/emoji/72x72/1f31f.png" alt="🌟" class="wp-smiley" style="height: 1em; max-height: 1em;" /></span></p>
<p><span style="font-weight: 400;">Not talking about 2024 exceptional <a href="https://www.nationalgeographic.com/science/article/auroras-solar-maximum-2024">Northern Lights to come,</a> </span><span style="font-weight: 400;">but this one&rsquo;s also good news for science !</span></p>
<p><span style="font-weight: 400;">➡️ Late 2023, </span><span style="font-weight: 400;">the Publications Office of the European Union</span><span style="font-weight: 400;"> <a href="https://twitter.com/CORDIS_EU/status/1726865540143276079">announced on social media</a> </span><span style="font-weight: 400;">the public release of </span><strong><a href="https://cordis.europa.eu/datalab/sparql-endpoint">the new CORDIS SPARQL endpoint</a></strong><span style="font-weight: 400;">.</span></p>
<p><span style="font-weight: 400;">CORDIS, aka « </span><span style="font-weight: 400;">t</span><span style="font-weight: 400;">he Community Research and Development Information Service </span><span style="font-weight: 400;">of</span><span style="font-weight: 400;"> the European Commission</span><span style="font-weight: 400;"> », is « </span><em><a href="https://cordis.europa.eu/about"><span style="font-weight: 400;">the [&#8230;] primary source of results from the projects funded by the EU&rsquo;s framework programmes for research and innovation, from FP1 to Horizon Europe</span></a></em><span style="font-weight: 400;"> ». Described as a « <em>rich and structured public repository with all project information held by the European Commission such as project factsheets, participants, reports, deliverables and links to open-access publications</em> », the CORDIS catalog has also been made available in 6 European languages by Publications Office&rsquo;s editorial team.</span></p>
<p><span style="font-weight: 400;">Cherry on top <img src="https://s.w.org/images/core/emoji/72x72/1f352.png" alt="🍒" class="wp-smiley" style="height: 1em; max-height: 1em;" /> of a whole process, the CORDIS SPARQL endpoint release comes to crown a long-term linked open data project. The aim identifying, acquiring, preserving and providing access to knowledge in a common will to share with the widest public possible a trust-worthy, qualified and structured information (see </span><a href="https://op.europa.eu/webpub/op/annual-management-report-2021/en/"><span style="font-weight: 400;">Publications Office 2021 Annual Management Report</span></a><span style="font-weight: 400;">).</span></p>
<p><span style="font-weight: 400;">In the context of the pandemic (and recent opening of <a href="https://data.europa.eu/en">data.europa.eu</a>, the official portal for European data, as defined in 2017–2025 European Open Data Space strategy), </span><a href="https://data.europa.eu/data/datasets/euroscivoc-the-european-science-vocabulary?locale=en"><span style="font-weight: 400;">the EuroSciVoc taxonomy of fields of science</span></a><span style="font-weight: 400;"> was released April 2020, followed December 2021 by the publishing of </span><a href="https://data.europa.eu/data/datasets/european-research-information-ontology?locale=en"><span style="font-weight: 400;">European research information ontology (EURIO)</span></a><span style="font-weight: 400;"> on the EU Vocabularies website <img src="https://s.w.org/images/core/emoji/72x72/1f310.png" alt="🌐" class="wp-smiley" style="height: 1em; max-height: 1em;" />.</span></p>
<p><span style="font-weight: 400;">As presented at </span><a href="https://op.europa.eu/en/web/endorse-2021/conference"><span style="font-weight: 400;">ENDORSE conference March 2021</span></a><span style="font-weight: 400;">, the redesign of CORDIS data-model in accordance with Semantic Web standards contributed to bring the platform « <strong><em>from acting as a data repository to finally playing an active role as data provider</em></strong> », where EuroSciVoc taxonomy &amp; EURIO ontology both played key roles in the creation of future CORDIS knowledge graph and SPARQL endpoint :</span></p>
<p><span style="font-weight: 400;"><img src="https://s.w.org/images/core/emoji/72x72/1f538.png" alt="🔸" class="wp-smiley" style="height: 1em; max-height: 1em;" /></span><em><span style="font-weight: 400;"> <a href="https://data.europa.eu/data/datasets/euroscivoc-the-european-science-vocabulary?locale=en">EuroSciVoc</a> [&#8230;] is a multilingual, SKOS-XL based taxonomy that represents all the main fields of science that were discovered from the CORDIS content, e.g., project abstracts. It was built starting from the hierarchy of the OECD&rsquo;s Fields of R&amp;D classification (FoRD) as root and extended through a semi-automatic process based on NLP techniques. It contains almost 1 000 categories in 6 languages (English, French, German, Italian, Polish and Spanish) and each category is enriched with relevant keywords extracted from the textual description of CORDIS projects. It is constantly evolving and is available on EU Vocabularies website [&#8230;].</span></em></p>
<p><span style="font-weight: 400;"><img src="https://s.w.org/images/core/emoji/72x72/1f538.png" alt="🔸" class="wp-smiley" style="height: 1em; max-height: 1em;" /></span><em><span style="font-weight: 400;"> In order to transform CORDIS data into Linked Open Data, thus aligning with Semantic Web standards, best practices and tools in industry and public organizations, the need for an ontology emerged. CORDIS created the <a href="https://data.europa.eu/data/datasets/european-research-information-ontology?locale=en">EURIO</a> (European Research Information Ontology) based on data about research projects funded by the EU&rsquo;s framework programmes for research and innovation. EURIO is aligned with EU ontologies such as <a href="https://dcodings.github.io/DINGO/">DINGO</a> and <a href="https://github.com/SPAROntologies/frapo">FRAPO</a> and de facto standard ontologies such as schema.org and the Organization Ontology from W3C. It models projects, their results and actors such as people and organizations, and includes administrative information like funding schemes and grants.</span></em></p>
<p><span style="font-weight: 400;"><img src="https://s.w.org/images/core/emoji/72x72/1f449.png" alt="👉" class="wp-smiley" style="height: 1em; max-height: 1em;" /></span><em> EURIO, which is available on EU Vocabularies website, was <strong>the starting point to develop a Knowledge Graph of CORDIS data that will be publicly available via a dedicated SPARQL endpoint</strong>.</em> <em>»</em></p>
<p>(Enrico Bignotti &amp; Baya Remaoun, &laquo;&nbsp;<a href="https://op.europa.eu/en/web/endorse-2021/programme">EuroSciVoc taxonomy and EURIO ontology: CORDIS as (semantic) data provider</a> &nbsp;&raquo; , ENDORSE March 16, 2021. <a href="https://op.europa.eu/documents/10120270/10133951/BIGNOTTI_REMAOUN_presentation_EuroSciVoc+taxonomy+and+EURIO+ontology+CORDIS+as+%28semantic%29+data+provider.pdf/3303e7b9-967d-65f2-23a3-96b3e2bd2856?t=1616568751644"><span style="font-weight: 400;">PDF</span></a><span style="font-weight: 400;"> </span><a href="https://www.youtube.com/watch?v=vIBC_PO5aoM&amp;t=3689s"><span style="font-weight: 400;">VIDEO</span></a>)</p>
<p><span style="font-weight: 400;">&#8230; A Knowledge graph <a href="https://upcommons.upc.edu/bitstream/handle/2117/378291/2022-ISWC.pdf?sequence=1&amp;isAllowed=y">that was soon released in 2022-2023</a></span><span style="font-weight: 400;"> (see INDUSTRY TRACK 1 on Tuesday, 25 October of <a href="http://iswc2022.semanticweb.org/index.php/conference/">ISWC 2022 Conference</a> for more detail), until final opening of a </span><a href="https://cordis.europa.eu/datalab"><span style="font-weight: 400;">CORDIS SPARQL endpoint</span></a><span style="font-weight: 400;"> late november 2023.</span></p>
<p><span style="font-weight: 400;">Now fancy a few SPARQL queries in there ?</span></p>
<p><strong>Follow the SPARQL <img src="https://s.w.org/images/core/emoji/72x72/1f4ab.png" alt="💫" class="wp-smiley" style="height: 1em; max-height: 1em;" /></strong></p>
<p><span style="font-weight: 400;">CORDIS SPARQL endpoint is </span><a href="https://cordis.europa.eu/datalab"><span style="font-weight: 400;">actually made available on CORDIS Datalab</span></a><span style="font-weight: 400;"> (and already referenced in <a href="https://linkedopendata.eu/wiki/The_EU_Knowledge_Graph">EU Knowledge Graph</a> among other European SPARQL endpoints ! <a href="http://tinyurl.com/yn5fsylk">see the query</a> / <a href="http://tinyurl.com/2e8z6y5e">see the results</a>)</span></p>
<p><span style="font-weight: 400;">Here you can access a quick documentation guide to CORDIS Linked Open Data : </span><a href="https://cordis.europa.eu/about/sparql"><span style="font-weight: 400;">https://cordis.europa.eu/about/sparql</span></a><span style="font-weight: 400;">.</span></p>
<p><span style="font-weight: 400;">Let’s have a look at EURIO ontology first : we need to understand it to query CORDIS knowledge graph.</span></p>
<p><span style="font-weight: 400;">As we are told in the guide, the latest version can be downloaded </span><a href="https://op.europa.eu/en/web/eu-vocabularies/dataset/-/resource?uri=http://publications.europa.eu/resource/dataset/eurio"><span style="font-weight: 400;">on EU Vocabularies website</span></a><span style="font-weight: 400;">. When we unzip</span><span style="font-weight: 400;"> the archive we access the whole documentation about EURIO Classes &amp; properties that we need to write our SPARQL queries – and a diagram of </span><span style="font-weight: 400;">main classes and properties</span><span style="font-weight: 400;"> of CORDIS data model : </span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2024/01/EURIO_v2.4.png"><img class="aligncenter size-large wp-image-1669" src="http://blog.sparna.fr/wp-content/uploads/2024/01/EURIO_v2.4-1024x812.png" alt="EURIO_v2.4" width="650" height="515" /></a></p>
<p><span style="font-weight: 400;">At first sight we can observe on the schema 3 main groups of entities :</span></p>
<ul>
<li style="font-weight: 400;"><span style="font-weight: 400;">On the top right, the projects &amp; publications associated, key ressources of CORDIS ;</span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">On the top left, the fundings &amp; grants materials, on « monetary » side of the project ;</span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">On the bottom, the organisations &amp; persons implied, with references &amp; coordinates.</span></li>
</ul>
<p><span style="font-weight: 400;">Let’s open </span><a href="https://cordis.europa.eu/datalab/sparql-endpoint"><span style="font-weight: 400;">CORDIS SPARQL endpoint</span></a><span style="font-weight: 400;"> – some easy queries can be run to begin exploring CORDIS knowledge graph.</span></p>
<p><span style="font-weight: 400;">Nb : the data on SPARQL endpoint is a snapshot, but freshest dumps can be found </span><a href="https://data.europa.eu/data/datasets/named-graphs-from-eurio-knowledge-graph?locale=en"><span style="font-weight: 400;">on European data portal</span></a><span style="font-weight: 400;"> !</span></p>
<p><span style="font-weight: 400;">Here a simple one to </span><b>find a list of FundingSchemes with their titles and IDs corresponding to « Horizon 2020 » programme</b><span style="font-weight: 400;"> :</span></p>
<table>
<tbody>
<tr>
<td><strong>FundingSchemes with their titles and IDs corresponding to « Horizon 2020 » programme</strong></p>
<p>PREFIX xsd: &lt;http://www.w3.org/2001/XMLSchema#&gt;<br />
PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;<br />
SELECT ?fs ?title ?id<br />
WHERE {<br />
# select all funding schemes …<br />
?fs a eurio:FundingScheme.<br />
# … with their title …<br />
?fs eurio:title ?title.<br />
# … and identifier …<br />
?fs eurio:identifier ?id.<br />
# where the identifier contains the regular expression “H2020”<br />
FILTER (REGEX (?id, &lsquo;H2020&prime;))<br />
} LIMIT 100</td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20FundingSchemes%20with%20their%20titles%20and%20IDs%20corresponding%20to%20%C2%AB%20Horizon%202020%20%C2%BB%20programme%0A%0APREFIX%20xsd%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2001%2FXMLSchema%23%3E%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0ASELECT%20%3Ffs%20%3Ftitle%20%3Fid%0AWHERE%20%7B%0A%23%20select%20all%20funding%20schemes%20%E2%80%A6%0A%3Ffs%20a%20eurio%3AFundingScheme.%0A%23%20%E2%80%A6%20with%20their%20title%20%E2%80%A6%0A%3Ffs%20eurio%3Atitle%20%3Ftitle.%0A%23%20%E2%80%A6%20and%20identifier%20%E2%80%A6%0A%3Ffs%20eurio%3Aidentifier%20%3Fid.%0A%23%20where%20the%20identifier%20contains%20the%20regular%20expression%20%E2%80%9CH2020%E2%80%9D%0AFILTER%20(REGEX%20(%3Fid%2C%20'H2020'))%0A%7D%20LIMIT%20100&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p><span style="font-weight: 400;">The FILTER REGEX enables us to display the IDs corresponding to H2020 Funding Schemes.</span></p>
<p><span style="font-weight: 400;">We can make another query to get the projects with the Funding Scheme Programme they are related to (note that, in EURIO a eurio:</span><span style="font-weight: 400;">hasFundingSchemeProgramme</span><span style="font-weight: 400;"> is a sub-property of eurio:fundingScheme) :</span></p>
<table>
<tbody>
<tr>
<td><strong>Projects with the Funding Scheme Programme they are related to</strong></p>
<p><span style="font-weight: 400;">PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;</span><br />
<span style="font-weight: 400;">PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</span><br />
<span style="font-weight: 400;">PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;</span><br />
<span style="font-weight: 400;">SELECT ?project ?acronym ?fundingscheme</span><br />
<span style="font-weight: 400;">WHERE {</span><br />
# select the projects &#8230;<br />
<span style="font-weight: 400;">?project a eurio:Project.</span><br />
# … with acronyms &#8230;<br />
<span style="font-weight: 400;">?project eurio:hasAcronym/eurio:shortForm ?acronym.</span><br />
# … and corresponding funding scheme programmes<br />
<span style="font-weight: 400;">?project eurio:isFundedBy/eurio:hasFundingSchemeProgramme/eurio:code ?fundingscheme.</span><br />
<span style="font-weight: 400;">} LIMIT 100</span></td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20Projects%20with%20the%20Funding%20Scheme%20Programme%20they%20are%20related%20to%0A%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0ASELECT%20%3Fproject%20%3Facronym%20%3Ffundingscheme%0AWHERE%20%7B%0A%23%20select%20the%20projects%20...%0A%3Fproject%20a%20eurio%3AProject.%0A%23%20%E2%80%A6%20with%20acronyms%20...%0A%3Fproject%20eurio%3AhasAcronym%2Feurio%3AshortForm%20%3Facronym.%0A%23%20%E2%80%A6%20and%20corresponding%20funding%20scheme%20programmes%0A%3Fproject%20eurio%3AisFundedBy%2Feurio%3AhasFundingSchemeProgramme%2Feurio%3Acode%20%3Ffundingscheme.%0A%7D%20LIMIT%20100&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query%201&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p><span style="font-weight: 400;">(Here we used a property path with a « / » to shorten the query to get the acronyms of projects &amp; Funding Scheme Programmes codes).</span></p>
<p><span style="font-weight: 400;">&#8230; and combining with the first query we can find the projects depending on H2020 Funding Scheme Programme in particular :</span></p>
<table>
<tbody>
<tr>
<td><strong>Projects depending on H2020 Funding Scheme Programme in particular</strong></p>
<p><span style="font-weight: 400;">PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;</span><br />
<span style="font-weight: 400;">PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</span><br />
<span style="font-weight: 400;">PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;</span><br />
<span style="font-weight: 400;">SELECT ?project ?acronym ?fundingscheme</span><br />
<span style="font-weight: 400;">WHERE {</span><br />
# select the projects &#8230;<br />
<span style="font-weight: 400;">?project a eurio:Project.</span><br />
# … with acronyms &#8230;<br />
<span style="font-weight: 400;">?project eurio:hasAcronym/eurio:shortForm ?acronym.</span><br />
# … and corresponding funding scheme programmes codes &#8230;<br />
<span style="font-weight: 400;">?project eurio:isFundedBy/eurio:hasFundingSchemeProgramme/eurio:code ?fundingscheme.</span><br />
# … with a filter on funding scheme codes &lsquo;H2020&prime;<br />
<span style="font-weight: 400;">FILTER REGEX (?fundingscheme, &lsquo;H2020&prime;)</span><br />
<span style="font-weight: 400;">} LIMIT 100</span></td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20Projects%20depending%20on%20H2020%20Funding%20Scheme%20Programme%20in%20particular%0A%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0ASELECT%20%3Fproject%20%3Facronym%20%3Ffundingscheme%0AWHERE%20%7B%0A%23%20select%20the%20projects%20...%0A%3Fproject%20a%20eurio%3AProject.%0A%23%20%E2%80%A6%20with%20acronyms%20...%0A%3Fproject%20eurio%3AhasAcronym%2Feurio%3AshortForm%20%3Facronym.%0A%23%20%E2%80%A6%20and%20corresponding%20funding%20scheme%20programmes%20codes%20...%0A%3Fproject%20eurio%3AisFundedBy%2Feurio%3AhasFundingSchemeProgramme%2Feurio%3Acode%20%3Ffundingscheme.%0A%23%20%E2%80%A6%20with%20a%20filter%20on%20funding%20scheme%20codes%20'H2020'%0AFILTER%20REGEX%20(%3Ffundingscheme%2C%20'H2020')%0A%7D%20LIMIT%20100&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p><span style="font-weight: 400;">It is also possible to get the list of all existing Funding Scheme Programmes CORDIS projects have been funded by – we observe 27 of them here (from the SPARQL endpoint) – while adding a count function to know how many projects per FundingSchemeProgramme :</span></p>
<table>
<tbody>
<tr>
<td><strong>All existing Funding Scheme Programmes CORDIS projects have been funded by</strong></p>
<p><span style="font-weight: 400;">PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;</span><br />
<span style="font-weight: 400;">PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</span><br />
<span style="font-weight: 400;">PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;</span><br />
# count the number of projects by funding scheme programme &#8230;<br />
<span style="font-weight: 400;">SELECT (COUNT (?project) as ?count) ?fundingscheme</span><br />
<span style="font-weight: 400;">WHERE {</span><br />
# select the projects with corresponding funding scheme programmes codes &#8230;<br />
<span style="font-weight: 400;">?project eurio:isFundedBy/eurio:hasFundingSchemeProgramme/eurio:code ?fundingscheme.</span><br />
# &#8230; counting projects per funding scheme programme<br />
<span style="font-weight: 400;">} GROUP BY ?fundingscheme</span><br />
<span style="font-weight: 400;">LIMIT 100</span></td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20All%20existing%20Funding%20Scheme%20Programmes%20CORDIS%20projects%20have%20been%20funded%20by%0A%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0A%23%20count%20the%20number%20of%20projects%20by%20funding%20scheme%20programme%20...%0ASELECT%20(COUNT%20(%3Fproject)%20as%20%3Fcount)%20%3Ffundingscheme%0AWHERE%20%7B%0A%23%20select%20the%20projects%20with%20corresponding%20funding%20scheme%20programmes%20codes%20...%0A%3Fproject%20eurio%3AisFundedBy%2Feurio%3AhasFundingSchemeProgramme%2Feurio%3Acode%20%3Ffundingscheme.%0A%23%20...%20counting%20projects%20per%20funding%20scheme%20programme%0A%7D%20GROUP%20BY%20%3Ffundingscheme%0ALIMIT%20100&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query%201&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p>Querying the organisations properties will return other kind of useful informations about geographical location of the projects stakeholders. Let’s figure out we want to find the projects whose coordinating organisations have sites located in France :</p>
<table>
<tbody>
<tr>
<td><strong>Projects whose coordinating organisations have sites located in France <img src="https://s.w.org/images/core/emoji/72x72/1f413.png" alt="🐓" class="wp-smiley" style="height: 1em; max-height: 1em;" /></strong></p>
<p>PREFIX skos: &lt;http://www.w3.org/2004/02/skos/core#&gt;<br />
PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;<br />
SELECT ?project ?acronym ?role ?organisation ?country<br />
WHERE {<br />
# select the projects with their acronyms &#8230;<br />
?project a eurio:Project.<br />
?project eurio:hasAcronym/eurio:shortForm ?acronym.<br />
# &#8230; and organisations with &lsquo;coordinator&rsquo; role and name &#8230;<br />
?project eurio:hasInvolvedParty ?organisationrole.<br />
?organisationrole eurio:roleLabel ?role.<br />
?organisationrole eurio:roleLabel &laquo;&nbsp;coordinator&nbsp;&raquo;.<br />
?organisationrole eurio:isRoleOf/eurio:legalName ?organisation.<br />
# &#8230; with address country for the sites defined at &lsquo;FR&rsquo;<br />
?organisationrole eurio:isRoleOf/eurio:hasSite/eurio:hasAddress/eurio:addressCountry ?country.<br />
VALUES ?country { &lsquo;FR&rsquo; }<br />
} LIMIT 100</td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20Projects%20whose%20coordinating%20organisations%20have%20sites%20located%20in%20France%20%F0%9F%90%93%0A%0APREFIX%20skos%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2004%2F02%2Fskos%2Fcore%23%3E%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0ASELECT%20%3Fproject%20%3Facronym%20%3Frole%20%3Forganisation%20%3Fcountry%0AWHERE%20%7B%0A%23%20select%20the%20projects%20with%20their%20acronyms%20...%0A%3Fproject%20a%20eurio%3AProject.%0A%3Fproject%20eurio%3AhasAcronym%2Feurio%3AshortForm%20%3Facronym.%0A%23%20...%20and%20organisations%20with%20'coordinator'%20role%20and%20name%20...%0A%3Fproject%20eurio%3AhasInvolvedParty%20%3Forganisationrole.%0A%3Forganisationrole%20eurio%3AroleLabel%20%3Frole.%0A%3Forganisationrole%20eurio%3AroleLabel%20%22coordinator%22.%0A%3Forganisationrole%20eurio%3AisRoleOf%2Feurio%3AlegalName%20%3Forganisation.%0A%23%20...%20with%20address%20country%20for%20the%20sites%20defined%20at%20'FR'%0A%3Forganisationrole%20eurio%3AisRoleOf%2Feurio%3AhasSite%2Feurio%3AhasAddress%2Feurio%3AaddressCountry%20%3Fcountry.%0AVALUES%20%3Fcountry%20%7B%20'FR'%20%7D%0A%7D%20LIMIT%20100&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p>Depending on available data, you can either query via PostalAddress info (eurio:addressCountry &lsquo;FR&rsquo;) or AdministrativeArea (eurio:hasGeographicalLocation) &#8230; Here we&rsquo;re lucky as both fields are mandatory ones.</p>
<p>Last but not least, we can also play with CORDIS vocabularies : here you&rsquo;ll have the choice to investigate via plain keywords of Projects or Publications items, querying titles, abstracts or other types of literals&#8230;</p>
<p>An example of projects with abstracts containing string ❄ &lsquo;winter&rsquo; ❄ &#8211; the URL giving the exact link to the project online :</p>
<table>
<tbody>
<tr>
<td><strong>Looking for ❄ &lsquo;winter&rsquo; ❄ in CORDIS projects abstracts (with nice URL to go)</strong></p>
<p>PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;<br />
SELECT?project ?acronym ?abstract ?url<br />
WHERE {<br />
# select the projects with their acronyms and abstracts &#8230;<br />
?project rdf:type eurio:Project.<br />
?project eurio:hasAcronym/eurio:shortForm ?acronym.<br />
?project eurio:abstract ?abstract.<br />
# &#8230; with a filter on abstracts containing string &lsquo;winter&rsquo; case insensitive &#8230;<br />
FILTER (regex(str(?abstract), &lsquo;winter&rsquo;, &lsquo;i&rsquo;))<br />
# &#8230; generating proper CORDIS website URLs based on RCN project code<br />
?project eurio:rcn ?rcn.<br />
BIND(IRI(CONCAT(&lsquo;https://cordis.europa.eu/project/rcn/&rsquo;, ?rcn)) AS ?url)<br />
} LIMIT 100</td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20Looking%20for%20%E2%9D%84%20'winter'%20%E2%9D%84%20in%20CORDIS%20projects%20abstracts%20(with%20nice%20URL%20to%20go)%0A%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0ASELECT%3Fproject%20%3Facronym%20%3Fabstract%20%3Furl%0AWHERE%20%7B%0A%23%20select%20the%20projects%20with%20their%20acronyms%20and%20abstracts%20...%0A%3Fproject%20rdf%3Atype%20eurio%3AProject.%0A%3Fproject%20eurio%3AhasAcronym%2Feurio%3AshortForm%20%3Facronym.%0A%3Fproject%20eurio%3Aabstract%20%3Fabstract.%0A%23%20...%20with%20a%20filter%20on%20abstracts%20containing%20string%20'winter'%20case%20insensitive%20...%0AFILTER%20(regex(str(%3Fabstract)%2C%20'winter'%2C%20'i'))%0A%23%20...%20generating%20proper%20CORDIS%20website%20URLs%20based%20on%20RCN%20project%20code%0A%3Fproject%20eurio%3Arcn%20%3Frcn.%0ABIND(IRI(CONCAT('https%3A%2F%2Fcordis.europa.eu%2Fproject%2Frcn%2F'%2C%20%3Frcn))%20AS%20%3Furl)%0A%7D%20LIMIT%20100&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query%201&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p>But funniest way will be using EuroSciVoc taxonomy (and navigating through thesaurus hierarchy) : to do so we need to navigate through property &laquo;&nbsp;eurio:hasEuroSciVocClassification&nbsp;&raquo; to get the Concepts skosxl:prefLabel property &#8230; to finally obtain the thesaurus labels (don&rsquo;t forget to choose a prefered language with a FILTER (lang parameter) :</p>
<table>
<tbody>
<tr>
<td><strong>Projects with their associated EuroSciVoc keywords (English prefLabels <img src="https://s.w.org/images/core/emoji/72x72/1f482.png" alt="💂" class="wp-smiley" style="height: 1em; max-height: 1em;" />)</strong></p>
<p>PREFIX skosxl: &lt;http://www.w3.org/2008/05/skos-xl#&gt;<br />
PREFIX skos: &lt;http://www.w3.org/2004/02/skos/core#&gt;<br />
PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;<br />
SELECT ?project ?acronym ?ESV<br />
WHERE {<br />
# select the projects with their acronyms &#8230;<br />
?project eurio:hasAcronym/eurio:shortForm ?acronym.<br />
# &#8230; with EuroSciVoc Classification prefLabels &#8230;<br />
?project eurio:hasEuroSciVocClassification/skosxl:prefLabel/skosxl:literalForm ?ESV.<br />
# &#8230; only returning &lsquo;English&rsquo; prefLabels<br />
FILTER (lang(?ESV) = &lsquo;en&rsquo;)<br />
} LIMIT 100</td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20Projects%20with%20their%20associated%20EuroSciVoc%20keywords%20(English%20prefLabels%20%F0%9F%92%82)%0A%0APREFIX%20skosxl%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2008%2F05%2Fskos-xl%23%3E%0APREFIX%20skos%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2004%2F02%2Fskos%2Fcore%23%3E%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0ASELECT%20%3Fproject%20%3Facronym%20%3FESV%0AWHERE%20%7B%0A%23%20select%20the%20projects%20with%20their%20acronyms%20...%0A%3Fproject%20eurio%3AhasAcronym%2Feurio%3AshortForm%20%3Facronym.%0A%23%20...%20with%20EuroSciVoc%20Classification%20prefLabels%20...%0A%3Fproject%20eurio%3AhasEuroSciVocClassification%2Fskosxl%3AprefLabel%2Fskosxl%3AliteralForm%20%3FESV.%0A%23%20...%20only%20returning%20'English'%20prefLabels%0AFILTER%20(lang(%3FESV)%20%3D%20'en')%0A%7D%20LIMIT%20100%0A&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query%203&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p><span style="font-weight: 400;">A bit more complex one using first level of hierarchy of the taxonomy : here we are searching for all skos:broader concepts &laquo;&nbsp;with no other broader concept&nbsp;&raquo; (the FILTER NOT EXISTS formula), aka the top concepts or root concepts of the vocabulary used to describe the projects. Then counting the projects by each category :</span></p>
<table>
<tbody>
<tr>
<td><strong>All root categories of EuroSciVoc used to describe the projects</strong></p>
<p>PREFIX skosxl: &lt;http://www.w3.org/2008/05/skos-xl#&gt;<br />
PREFIX skos: &lt;http://www.w3.org/2004/02/skos/core#&gt;<br />
PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;<br />
# count the number of projects by EuroSciVoc top categories &#8230;<br />
SELECT (COUNT(?project) AS ?nbProject) ?ESV_root_label<br />
WHERE {<br />
# &#8230; the top categories are Concepts &#8230;<br />
?ESV_root a skos:Concept .<br />
# &#8230; with no broader Concept &#8230;<br />
FILTER NOT EXISTS { ?ESV_root skos:broader ?anything }<br />
# &#8230; list with corresponding projects &#8230;<br />
?ESV_root ^skos:broader*/^eurio:hasEuroSciVocClassification ?project .<br />
# &#8230; and EuroSciVoc corresponding skos-xl prefLabels &#8230;<br />
?ESV_root skosxl:prefLabel/skosxl:literalForm ?ESV_root_label.<br />
# &#8230; sorting by EuroSciVoc category, with English prefLabels<br />
FILTER (lang(?ESV_root_label) = &lsquo;en&rsquo;)<br />
} GROUP BY ?ESV_root_label<br />
LIMIT 100</td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20All%20root%20categories%20of%20EuroSciVoc%20used%20to%20describe%20the%20projects%0A%0APREFIX%20skosxl%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2008%2F05%2Fskos-xl%23%3E%0APREFIX%20skos%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2004%2F02%2Fskos%2Fcore%23%3E%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0A%23%20count%20the%20number%20of%20projects%20by%20EuroSciVoc%20top%20categories%20...%0ASELECT%20(COUNT(%3Fproject)%20AS%20%3FnbProject)%20%3FESV_root_label%0AWHERE%20%7B%0A%23%20...%20the%20top%20categories%20are%20Concepts%20...%0A%3FESV_root%20a%20skos%3AConcept%20.%0A%23%20...%20with%20no%20broader%20Concept%20...%0AFILTER%20NOT%20EXISTS%20%7B%20%3FESV_root%20skos%3Abroader%20%3Fanything%20%7D%0A%23%20...%20list%20with%20corresponding%20projects%20...%0A%3FESV_root%20%5Eskos%3Abroader*%2F%5Eeurio%3AhasEuroSciVocClassification%20%3Fproject%20.%0A%23%20...%20and%20EuroSciVoc%20corresponding%20skos-xl%20prefLabels%20...%0A%3FESV_root%20skosxl%3AprefLabel%2Fskosxl%3AliteralForm%20%3FESV_root_label.%0A%23%20...%20sorting%20by%20EuroSciVoc%20category%2C%20with%20English%20prefLabels%0AFILTER%20(lang(%3FESV_root_label)%20%3D%20'en')%0A%7D%20GROUP%20BY%20%3FESV_root_label%0ALIMIT%20100&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query%201&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p>&#8230; and maybe again more explicit results if refined to level 2 of hierarchy <img src="https://s.w.org/images/core/emoji/72x72/1f440.png" alt="👀" class="wp-smiley" style="height: 1em; max-height: 1em;" /> :</p>
<table>
<tbody>
<tr>
<td><strong>All &lsquo;level 2&prime; root categories of EuroSciVoc used to describe the projects</strong></p>
<p>PREFIX skosxl: &lt;http://www.w3.org/2008/05/skos-xl#&gt;<br />
PREFIX skos: &lt;http://www.w3.org/2004/02/skos/core#&gt;<br />
PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;<br />
# count the number of projects by EuroSciVoc level 2 top categories &#8230;<br />
SELECT (COUNT(?project) AS ?nbProject) ?ESV_root_label ?ESV_level2_label<br />
WHERE {<br />
# &#8230; the top categories are Concepts &#8230;<br />
?ESV_root a skos:Concept .<br />
# &#8230; with no broader Concept &#8230;<br />
FILTER NOT EXISTS { ?ESV_root skos:broader ?anything }<br />
# &#8230; list level 2 category below level 1 with corresponding projects &#8230;<br />
?ESV_root ^skos:broader ?ESV_level2 .<br />
?ESV_level2 ^skos:broader*/^eurio:hasEuroSciVocClassification ?project .<br />
# &#8230; and EuroSciVoc corresponding skos-xl prefLabels &#8230;<br />
?ESV_root skosxl:prefLabel/skosxl:literalForm ?ESV_root_label.<br />
?ESV_level2 skosxl:prefLabel/skosxl:literalForm ?ESV_level2_label.<br />
# &#8230; sorting by EuroSciVoc category, with English prefLabels<br />
FILTER (lang(?ESV_root_label) = &lsquo;en&rsquo;)<br />
FILTER (lang(?ESV_level2_label) = &lsquo;en&rsquo;)<br />
} GROUP BY ?ESV_root_label ?ESV_level2_label<br />
ORDER BY ?ESV_root_label<br />
LIMIT 100</td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20All%20'level%202'%20root%20categories%20of%20EuroSciVoc%20used%20to%20describe%20the%20projects%0A%0APREFIX%20skosxl%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2008%2F05%2Fskos-xl%23%3E%0APREFIX%20skos%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2004%2F02%2Fskos%2Fcore%23%3E%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0A%23%20count%20the%20number%20of%20projects%20by%20EuroSciVoc%20level%202%20top%20categories%20...%0ASELECT%20(COUNT(%3Fproject)%20AS%20%3FnbProject)%20%3FESV_root_label%20%3FESV_level2_label%0AWHERE%20%7B%0A%23%20...%20the%20top%20categories%20are%20Concepts%20...%0A%3FESV_root%20a%20skos%3AConcept%20.%0A%23%20...%20with%20no%20broader%20Concept%20...%0AFILTER%20NOT%20EXISTS%20%7B%20%3FESV_root%20skos%3Abroader%20%3Fanything%20%7D%0A%23%20...%20list%20level%202%20category%20below%20level%201%20with%20corresponding%20projects%20...%0A%3FESV_root%20%5Eskos%3Abroader%20%3FESV_level2%20.%0A%3FESV_level2%20%5Eskos%3Abroader*%2F%5Eeurio%3AhasEuroSciVocClassification%20%3Fproject%20.%0A%23%20...%20and%20EuroSciVoc%20corresponding%20skos-xl%20prefLabels%20...%0A%3FESV_root%20skosxl%3AprefLabel%2Fskosxl%3AliteralForm%20%3FESV_root_label.%0A%3FESV_level2%20skosxl%3AprefLabel%2Fskosxl%3AliteralForm%20%3FESV_level2_label.%0A%23%20...%20sorting%20by%20EuroSciVoc%20category%2C%20with%20English%20prefLabels%0AFILTER%20(lang(%3FESV_root_label)%20%3D%20'en')%0AFILTER%20(lang(%3FESV_level2_label)%20%3D%20'en')%0A%7D%20GROUP%20BY%20%3FESV_root_label%20%3FESV_level2_label%0AORDER%20BY%20%3FESV_root_label%0ALIMIT%20100&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p>And a little last one with a count, to enumerate most used EuroSciVoc Concepts for indexing projects :</p>
<table>
<tbody>
<tr>
<td><strong>Most used EuroSciVoc Concepts for indexing projects</strong></p>
<p>PREFIX skosxl: &lt;http://www.w3.org/2008/05/skos-xl#&gt;<br />
PREFIX skos: &lt;http://www.w3.org/2004/02/skos/core#&gt;<br />
PREFIX eurio: &lt;http://data.europa.eu/s66#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;<br />
# count the number of projects by EuroSciVoc Concept &#8230;<br />
SELECT (COUNT (?project) as ?count) ?ESV<br />
WHERE {<br />
#  &#8230; select the projects with their acronyms &#8230;<br />
?project eurio:hasAcronym/eurio:shortForm ?acronym.<br />
# &#8230; with EuroSciVoc Classification prefLabels &#8230;<br />
?project eurio:hasEuroSciVocClassification/skosxl:prefLabel/skosxl:literalForm ?ESV.<br />
# &#8230; sorting by EuroSciVoc Concept, with English prefLabels<br />
FILTER (lang(?ESV) = &lsquo;en&rsquo;)<br />
} GROUP BY ?ESV<br />
ORDER BY DESC(?count)<br />
LIMIT 3000</td>
</tr>
</tbody>
</table>
<p>▶️ <a href="https://cordis.europa.eu/datalab/sparql-endpoint#query=%23%20Most%20used%20EuroSciVoc%20Concepts%20for%20indexing%20projects%0A%0APREFIX%20skosxl%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2008%2F05%2Fskos-xl%23%3E%0APREFIX%20skos%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2004%2F02%2Fskos%2Fcore%23%3E%0APREFIX%20eurio%3A%20%3Chttp%3A%2F%2Fdata.europa.eu%2Fs66%23%3E%0APREFIX%20rdf%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23%3E%0APREFIX%20rdfs%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0A%23%20count%20the%20number%20of%20projects%20by%20EuroSciVoc%20Concept%20...%0ASELECT%20(COUNT%20(%3Fproject)%20as%20%3Fcount)%20%3FESV%0AWHERE%20%7B%0A%23%20%20...%20select%20the%20projects%20with%20their%20acronyms%20...%0A%3Fproject%20eurio%3AhasAcronym%2Feurio%3AshortForm%20%3Facronym.%0A%23%20...%20with%20EuroSciVoc%20Classification%20prefLabels%20...%0A%3Fproject%20eurio%3AhasEuroSciVocClassification%2Fskosxl%3AprefLabel%2Fskosxl%3AliteralForm%20%3FESV.%0A%23%20...%20sorting%20by%20EuroSciVoc%20Concept%2C%20with%20English%20prefLabels%0AFILTER%20(lang(%3FESV)%20%3D%20'en')%0A%7D%20GROUP%20BY%20%3FESV%0AORDER%20BY%20DESC(%3Fcount)%0ALIMIT%203000&amp;endpoint=https%3A%2F%2Fcordis.europa.eu%2Fdatalab%2Fsparql&amp;requestMethod=POST&amp;tabTitle=Query%201&amp;headers=%7B%7D&amp;contentTypeConstruct=application%2Fn-triples%2C*%2F*%3Bq&amp;contentTypeSelect=application%2Fsparql-results%2Bjson%2C*%2F*%3Bq&amp;outputFormat=table">See the results</a></p>
<p><img src="https://s.w.org/images/core/emoji/72x72/1f4a1.png" alt="💡" class="wp-smiley" style="height: 1em; max-height: 1em;" />This one an ideal one to generate a word cloud maybe ?</p>
<p>What if we send the CSV data to <a href="https://wordart.com/create">some nice online word cloud generator</a> then ?</p>
<p><a href="https://wordart.com/r8zsokkk7ghu/untitled"><img class="aligncenter size-full wp-image-1710" src="http://blog.sparna.fr/wp-content/uploads/2024/01/Cordis-Taxo-Cloud.png" alt="Cordis Taxo Cloud" width="660" height="757" /></a></p>
<p>(OMG <a href="https://wordart.com/create">they also have a shooting star shape</a> <img src="https://s.w.org/images/core/emoji/72x72/1f320.png" alt="🌠" class="wp-smiley" style="height: 1em; max-height: 1em;" /> in there 🤩)</p>
<p><strong>As a conclusion&#8230;</strong></p>
<p><span style="font-weight: 400;">According to Science (CORDIS saying !), <a href="https://europa.eu/!vYVHXD">New Year’s resolutions appear difficult to be held</a></span><span style="font-weight: 400;">&#8230; because most of time too ambitious, restrictive or unprecisely formulated : indeed, « </span><span style="font-weight: 400;">the </span><a href="https://cordis.europa.eu/article/id/428767-trending-science-do-this-one-thing-to-keep-your-new-year-s-resolutions-research-says"><span style="font-weight: 400;">effectiveness of resolutions depends on how they are </span><b>framed</b></a><span style="font-weight: 400;">.</span><span style="font-weight: 400;"> »</span></p>
<p><span style="font-weight: 400;">Horizon 2024, let’s suggest a(n RDF ?) well-framed one : may CORDIS SPARQL endpoint initiative be an example for other structures who want to share Linked Open Data !</span></p>
<p><span style="font-weight: 400;"><strong>Wishing you Best Interoperability and a Very Merry ✨ Sparqling New Year !</strong> ✨</span></p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2024/01/15/cordis-a-sparql-endpoint-is-born/">CORDIS : a SPARQL endpoint is born !</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2024/01/15/cordis-a-sparql-endpoint-is-born/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>2013-2023 : ‘Tis SKOSPlay!’s Birthday !</title>
		<link>https://blog.sparna.fr/2023/03/13/2013-2023-tis-skosplays-birthday/</link>
		<comments>https://blog.sparna.fr/2023/03/13/2013-2023-tis-skosplays-birthday/#comments</comments>
		<pubDate>Mon, 13 Mar 2023 14:28:53 +0000</pubDate>
		<dc:creator><![CDATA[Marie Muller]]></dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[OWL]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[SHACL]]></category>
		<category><![CDATA[SHACL Play]]></category>
		<category><![CDATA[SKOS]]></category>
		<category><![CDATA[SKOS Play]]></category>
		<category><![CDATA[Thesaurus]]></category>
		<category><![CDATA[Visualisation de données]]></category>
		<category><![CDATA[ontologie]]></category>
		<category><![CDATA[skos play]]></category>
		<category><![CDATA[thesaurus]]></category>
		<category><![CDATA[web sémantique]]></category>

		<guid isPermaLink="false">http://blog.sparna.fr/?p=1540</guid>
		<description><![CDATA[<p>Hi, it’s Marie (aka chutjetweet here). To be short I’m a documentalist, terminologist, old (linked &#8211; open) data maniac &#38; lil’ onto-Padawan and… just came to join Sparna’s team this early January ! To inaugurate my first article on Sparna’s blog, let’s share a little feedback of mine today about Sparna’s well-known SKOSPlay! whose 10 years’ birthday&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2023/03/13/2013-2023-tis-skosplays-birthday/">2013-2023 : ‘Tis SKOSPlay!’s Birthday !</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p><span style="font-weight: 400;">Hi, it’s Marie (aka chutjetweet </span><a href="https://twitter.com/chutjetweet"><span style="font-weight: 400;">here</span></a><span style="font-weight: 400;">). To be short I’m a documentalist, terminologist, old (linked &#8211; open) data maniac &amp; lil’ onto-Padawan and… just came to join Sparna’s team this early January !</span></p>
<p><span style="font-weight: 400;">To inaugurate my first article on Sparna’s blog, let’s share a little feedback of mine today about Sparna’s well-known </span><a href="https://skos-play.sparna.fr/play/"><span style="font-weight: 400;">SKOSPlay!</span></a><span style="font-weight: 400;"> whose 10 years’ birthday is to celebrate this year !</span></p>
<p><span style="font-weight: 400;">10 yo, quite a historic tool ! but more than ever actual in a context where the semantic technologies get front of the scene anew due to growing interest shown by the digital humanities movement to data interoperability projects via the standardized knowledge structuration (Wikipedia-Wikidata projects e.g., as semantic wiki devices), and also due to the last progress of artificial intelligence, now able to processing large amount of data and </span><a href="https://www.epimorphics.com/writing-ontologies-with-chatgpt/"><span style="font-weight: 400;">soon fully leveraging the potential of ontologies and knowledge graphs</span></a><span style="font-weight: 400;">.</span></p>
<table>
<tbody>
<tr>
<td><i><span style="font-weight: 400;"><a href="https://twitter.com/taxobob/status/1615594349512851456"><img class="alignnone wp-image-1561 size-full" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image5.png" alt="image5" width="878" height="321" /></a></span></i><em><span style="font-weight: 400;">From asking for a taxonomy </span><a href="https://twitter.com/mommi84/status/1631027355987656706"><span style="font-weight: 400;">to querying RDF files with an API</span></a><span style="font-weight: 400;">…</span></em></td>
</tr>
</tbody>
</table>
<p><span style="font-weight: 400;">This said, in a more practical way, semantic web standards are not always easy to manipulate as a professional &#8211; if non-initiate to SPARQL and nor confirmed data scientist, and even when you have got to deal with a simple structured list of terms !</span></p>
<p><span style="font-weight: 400;">Either your data is already SKOS-standardized (great !), there sometimes come to have a gap between normalization step and visualization step that requires a bit more technical IT skills. Either &#8211; most of time &#8211; the common muggle-born is to start with a plain Excel spreadsheet, create a list, add some hierarchy, maybe other scope notes or definitions and&#8230; end far puzzled wondering how to </span><span style="font-weight: 400;">get </span><a href="https://www.w3.org/DesignIssues/diagrams/lod/597992118v2_350x350_Back.jpg"><span style="font-weight: 400;">a 5-star data vocabulary</span></a><span style="font-weight: 400;"> ⭐ !</span></p>
<p>&nbsp;</p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image14.gif"><img class="alignnone size-full wp-image-1577" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image14.gif" alt="image14" width="500" height="208" /></a></p>
<p>&nbsp;</p>
<p><strong>A SKOSPlay!-within-a-SKOSPlay!</strong></p>
<table>
<tbody>
<tr>
<td><a href="https://twitter.com/belett/status/1583128873877008384"><img class="alignnone wp-image-1565 size-full" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image3.png" alt="image3" width="886" height="359" /></a><em><span style="font-weight: 400;">Wink to <a href="https://twitter.com/belett">@belett</a>, anything possible now with SKOSPlay!</span></em></td>
</tr>
</tbody>
</table>
<p><span style="font-weight: 400;">Aiming at visualizing (and printing !) SKOS thesauri, taxonomies and vocabularies at the very beginning, SKOSPlay! is a full online free and open source tool leveraging semantic technologies (RDF, SPARQL, inference, Linked Data) to generate downloadable HTML or PDF documents. More and more new features have been added since then : alignments display, OWL and SKOS-XL files processing, autocomplete fields and permuted indexes generating …</span></p>
<table>
<tbody>
<tr>
<td><a href="https://twitter.com/veronikaheim/status/1612375648948846592"><img class="alignnone size-full wp-image-1566" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image7.png" alt="image7" width="884" height="237" /></a><em><span style="font-weight: 400;">Hello <a href="https://twitter.com/veronikaheim">@veronikaheim</a>, maybe SKOSPlay! could match your need ?</span></em></td>
</tr>
</tbody>
</table>
<p><span style="font-weight: 400;">… among other nice and useful developments.</span></p>
<p><span style="font-weight: 400;">But as an Excel aficionada, the one that I prefer is </span><a href="https://skos-play.sparna.fr/play/convert?lang=en"><span style="font-weight: 400;">the Excel-to-RDF converter tool</span></a><span style="font-weight: 400;">.</span></p>
<p><span style="font-weight: 400;">One sheet. One import. One result. Easy-peasy, happy terminologist :))</span></p>
<p><span style="font-weight: 400;">(And you can even keep your custom colors templates and formats !!! </span><span style="font-weight: 400;">🦄</span><span style="font-weight: 400;"> )</span></p>
<p>&nbsp;</p>
<p><strong>Come on &amp; let’s SKOSPlay!</strong></p>
<p><span style="font-weight: 400;">Let&rsquo;s figure out you want to display or construct a small vocabulary you could quickly visualize in a standardized SKOS-structured way :</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image10-2.png"><img class="alignnone size-full wp-image-1607" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image10-2.png" alt="image10-2" width="1657" height="1079" /></a></p>
<p><span style="font-weight: 400;">Now to fit in the SKOS model your data has to follow </span><a href="https://skos-play.sparna.fr/play/excel_test/excel2skos-exemple-1.xlsx"><span style="font-weight: 400;">a particular template</span></a><span style="font-weight: 400;"> you can fullfill by downloading on SKOSPlay! website.</span></p>
<p><span style="font-weight: 400;">First you have to define </span><a href="https://skos-play.sparna.fr/play/convert#excel-file-structure"><span style="font-weight: 400;">the header of the template</span></a><span style="font-weight: 400;"> : the global scheme of your vocabulary, its URI, title and description :</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image12.png"><img class="alignnone size-full wp-image-1575" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image12.png" alt="image12" width="1347" height="189" /></a></p>
<p><span style="font-weight: 400;">Adding the terms of your list (with the URIs)… Here with the “@en” language indication on top of the column </span><a href="https://skos-play.sparna.fr/play/convert#languages"><span style="font-weight: 400;">as I am to create an English-French multilingual vocabulary</span></a><span style="font-weight: 400;"> :</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image15.png"><img class="alignnone size-full wp-image-1578" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image15.png" alt="image15" width="1509" height="1021" /></a></p>
<p><span style="font-weight: 400;">Recreating the arborescent structure through the Excel template (don’t mind my color palette, I always like colouring my Excel sheets to better visualize the info at a glance !).</span></p>
<p><span style="font-weight: 400;">The hierarchical broader-narrower structure is to be recreated </span><a href="https://skos-play.sparna.fr/play/convert#sheet-body"><span style="font-weight: 400;">by adding a “skos:narrower” column</span></a><span style="font-weight: 400;"> (or skos:broader, as you want, with only 1 broader value per line) where you will list the different specific values front of the more generic one (separated by comas). Here I used a PREFIX too in order to shorten my http:// URIs, SKOSPlay! can process them anyway !</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image9.png"><img class="alignnone size-full wp-image-1572" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image9.png" alt="image9" width="1943" height="1431" /></a></p>
<p><span style="font-weight: 400;">Then adding a few notes and other information (multilingual values, skos:notation, any other default properties known in the converter (</span><a href="https://skos-play.sparna.fr/play/convert#prefixes"><span style="font-weight: 400;">see the documentation</span></a><span style="font-weight: 400;">) or different custom elements of yours by adding other PREFIXes :</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image4.png"><img class="alignnone size-full wp-image-1569" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image4.png" alt="image4" width="1999" height="1269" /></a></p>
<p><span style="font-weight: 400;">Your Excel template is ready to go ! quite an easy configuration in my demo here, but SKOSPlay! can also deal with skos:Collections, SKOS-XL and other </span><a href="https://skos-play.sparna.fr/play/convert#advanced-features"><span style="font-weight: 400;">advanced RDF structures</span></a><span style="font-weight: 400;"> : blank nodes, RDF lists, named graphs. And now possible to generate OWL and SHACL files with the converter too !</span></p>
<p><span style="font-weight: 400;">Now it’s time to turn your (finally-not-so-dirty-<img src="https://s.w.org/images/core/emoji/72x72/1f438.png" alt="🐸" class="wp-smiley" style="height: 1em; max-height: 1em;" />) data into a SKOS-charming file. Take your favorite </span><del><span style="font-weight: 400;">magic wand</span></del><span style="font-weight: 400;"> SKOSPlay! </span><a href="https://skos-play.sparna.fr/play/convert"><span style="font-weight: 400;">Excel-to-RDF converter tool</span></a><span style="font-weight: 400;"> and load your Excel file in it (adding some optional parameters if needed).</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image8.png"><img class="alignnone size-full wp-image-1571" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image8.png" alt="image8" width="1601" height="885" /></a></p>
<p><span style="font-weight: 400;">Well done, it’s a wonderful RDF-ized vocabulary file (here in a Turtle format but you have also RDF/XML, N-Triples, N-Quads, N3 and TriG available) :</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image1.png"><img class="alignnone size-full wp-image-1567" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image1.png" alt="image1" width="1617" height="1029" /></a></p>
<p>&nbsp;</p>
<p><strong>Wingardium Visualiza !</strong></p>
<p>We&rsquo;re almost done. <span style="font-weight: 400;">Go back to the website, </span><a href="https://skos-play.sparna.fr/play/upload"><span style="font-weight: 400;">tab “Play!”</span></a><span style="font-weight: 400;">, load your last RDF-serialized file and go to the next step to chose the kind of display you want to get, endly press (SKOS)Play! and … abracadataaaaaaa !</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image2.png"><img class="alignnone size-full wp-image-1568" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image2.png" alt="image2" width="1657" height="1197" /></a></p>
<p><span style="font-weight: 400;">Many different options to visualize your arborescent data. Tree, static and dynamic, but also more « professional » and printable sorts of displays like alphabetical, hierarchical or permuted views :</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image6.png"><img class="alignnone size-full wp-image-1570" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image6.png" alt="image6" width="1999" height="1361" /></a></p>
<p><span style="font-weight: 400;">And KWIC (as for « </span><span style="font-weight: 400;">KeyWord In Context ») :</span></p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2023/03/image13.png"><img class="alignnone size-full wp-image-1576" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image13.png" alt="image13" width="1999" height="937" /></a></p>
<p>&nbsp;</p>
<p><span style="font-weight: 400;">Even possible to load an online Google spreadsheet (</span><a href="https://docs.google.com/spreadsheets/d/1MpN4tzd7S7m7Dnr7IFOz43YoWcSYqUG1/edit?usp=share_link&amp;ouid=118425122592371390359&amp;rtpof=true&amp;sd=true"><span style="font-weight: 400;">mine is shared here</span></a><span style="font-weight: 400;">), just by adapting a little its URL for the converter’s need. Interesting feature in a collaborative purpose when you are team-building a vocabulary !</span></p>
<p><span style="font-weight: 400;">The whole pack </span><a href="https://skos-play.sparna.fr/play/convert#documentation"><span style="font-weight: 400;">fully documented</span></a><span style="font-weight: 400;"> and findable on Sparna’s </span><a href="https://skos-play.sparna.fr/play/"><span style="font-weight: 400;">website</span></a><span style="font-weight: 400;"> &amp; </span><a href="https://github.com/sparna-git/skos-play"><span style="font-weight: 400;">Git</span></a><span style="font-weight: 400;">. Some recent users even produced a </span><a href="http://blog.sparna.fr/2021/06/30/fair-data-collective-is-doing-cool-things-with-skos-play-and-xls2rdf/"><span style="font-weight: 400;">short video tutorial</span></a><span style="font-weight: 400;"> to show what they managed to do with different SKOSPlay! visualization tools.</span></p>
<p><span style="font-weight: 400;">Already knew about SKOSPlay! ? go see his little brother, </span><a href="https://shacl-play.sparna.fr/play/home;jsessionid=40BBB04DA1F3DCC1EBFB084A08EE2924?lang=en"><span style="font-weight: 400;">SHACLPlay!</span></a><span style="font-weight: 400;"> and feel free to give us some feedback in the comments <img src="https://blog.sparna.fr/wp-includes/images/smilies/simple-smile.png" alt=":)" class="wp-smiley" style="height: 1em; max-height: 1em;" /></span></p>
<p><span style="font-weight: 400;">Happy Birthday SKOSPlay! &amp; Long live Semantic Web !</span></p>
<p><span style="font-weight: 400;">A bit more Vouvray with your nougat de Tours ?</span></p>
<p><a href="https://www.my-loire-valley.com/2018/06/top-10-des-specialites-gourmandes-de-touraine/"><img class="alignnone size-full wp-image-1574" src="http://blog.sparna.fr/wp-content/uploads/2023/03/image11.png" alt="image11" width="1280" height="939" /></a></p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2023/03/13/2013-2023-tis-skosplays-birthday/">2013-2023 : ‘Tis SKOSPlay!’s Birthday !</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2023/03/13/2013-2023-tis-skosplays-birthday/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Clean JSON(-LD) from RDF using Framing</title>
		<link>https://blog.sparna.fr/2022/07/20/clean-json-ld-from-rdf-using-framing/</link>
		<comments>https://blog.sparna.fr/2022/07/20/clean-json-ld-from-rdf-using-framing/#comments</comments>
		<pubDate>Wed, 20 Jul 2022 06:56:36 +0000</pubDate>
		<dc:creator><![CDATA[Thomas Francart]]></dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[SHACL Play]]></category>
		<category><![CDATA[Framing]]></category>
		<category><![CDATA[json-ld]]></category>

		<guid isPermaLink="false">http://blog.sparna.fr/?p=1462</guid>
		<description><![CDATA[<p>Say you have a nice RDF knowledge graph based on an ontology, or maybe reusing ontologies, and maybe you have specified the structure of the knowledge graph with SHACL. And now you would like to expose your RDF in JSON in an API, for the average developer (or maybe you would like to produce a&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2022/07/20/clean-json-ld-from-rdf-using-framing/">Clean JSON(-LD) from RDF using Framing</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>Say you have a nice RDF knowledge graph based on an ontology, or maybe reusing ontologies, and maybe you have specified the structure of the knowledge graph with SHACL. And now you would like to <strong>expose your RDF in JSON</strong> in an API, for the average developer (or maybe you would like to produce a clean JSON to be indexed by Elastic). And the average developer (or Elastic) does not care about RDF and does not care about the “-LD” in “JSON-LD”, he just cares about JSON; and he is right ! we are here to care about the “-LD” part for him.</p>
<p>So what you need is to produce a clean JSON structure from your raw RDF triples. And when I mean “clean”, I mean :</p>
<ul>
<li><strong>no URIs</strong>. Nowhere. No URIs in JSON keys, No URIs in types of entities, no URIs in the value of properties controlled by a closed list; the only places where it is acceptable to see a URI are : to give the id of the entities, and when making a reference to such an id of entity within the graph; even in these cases the URIs can be shortened.</li>
<li><strong>no fancy JSON-LD keys</strong> like @type, @value, @datatype, @id, etc.</li>
<li><strong>indented</strong>.</li>
</ul>
<p>You have 2 possibilities to do that :</p>
<ol>
<li>You develop a custom script, to either generate a JSON export of your data, or to implement the API that will query the knowledge graph, parse the triples, and generate that clean JSON output.</li>
<li>You use <strong><a href="https://www.w3.org/TR/json-ld11-framing/">JSON-LD framing</a> to automate the production of a clean JSON(-LD) from RDF</strong>.</li>
</ol>
<p>There are 2 nice things about the solution with JSON-LD framing :</p>
<ol>
<li>it can be automated</li>
<li>you automatically retain the RDF compatibility &#8211; because your JSON will necessarily be JSON-LD. This means you can import your nice JSON directly in a triplestore.</li>
</ol>
<p>The principle of JSON-LD framing is that you provide a <a href="https://www.w3.org/TR/json-ld/#the-context">JSON-LD @context </a>with an additionnal <a href="https://www.w3.org/TR/json-ld11-framing/#framing">frame specification</a> that defines how the JSON should be structured (indented), which entity to include at each level (entities can be filtered based on some criteria), and also which properties to include in each entity.</p>
<p>To start with JSON-LD framing, what you need is JSON-LD. Any JSON-LD. Typically the raw JSON-LD serialization that any RDF library or triplestore will produce; that kind of ugly, messy, full-of-URIs-and-@language kind of JSON. So something like:</p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2022/07/Capture-d’écran-du-2022-07-19-17-30-45.png"><img class="aligncenter size-large wp-image-1465" src="http://blog.sparna.fr/wp-content/uploads/2022/07/Capture-d’écran-du-2022-07-19-17-30-45-1024x658.png" alt="Capture d’écran du 2022-07-19 17-30-45" width="650" height="418" /></a></p>
<p><em>(Brrr, scary, no ?)</em></p>
<p>And then what you need is the <a href="https://json-ld.org/playground/">JSON-LD playground</a> with the “Framed” tab. This will allow you to test your context and frame specification.</p>
<p>And when deployed in production, what you will need is a JSON-LD library that is capable of implementing the JSON-LD framing algorithm. Implementations are listed <a href="https://json-ld.org/#developers">here</a>, and you need an implementation compatible with JSON-LD 1.1.</p>
<h2>Example files</h2>
<p>As an example, I use a JSON-LD file from the French National Library, the one from Les Misérables here : <a href="https://data.bnf.fr/fr/13516296/victor_hugo_les_miserables/">https://data.bnf.fr/fr/13516296/victor_hugo_les_miserables/</a> (download link at the bottom of the page).</p>
<p><strong>You can download <a href="http://blog.sparna.fr/wp-content/uploads/2022/07/les_miserables_jsonld_framing_example.zip">the initial JSON example, the frame specification, and the result in a zip</a>.</strong> The zip also contains intermediate frame specifications.</p>
<h2>The @context</h2>
<p>We&rsquo;ll start by specifying the JSON-LD context part.</p>
<h3><b>Map @type to type and @id to id</b></h3>
<p>Average developer will wonder what are those @type and @id keys. Re-map them straight away to type and id:</p>
<blockquote><p><code>"type" : "@type",<br />
"id" : "@id",<br />
</code></p></blockquote>
<p>Schema.org and lot of other specifications do that.</p>
<h3><b>What about @graph ?<br />
</b></h3>
<p>If you have a named graph at the top, introduced by @graph, my suggestion would be to simply remap it to a fixed key, like &laquo;&nbsp;data&nbsp;&raquo;, or &laquo;&nbsp;entities&nbsp;&raquo; :</p>
<blockquote><p><code>"data" : "@graph",<br />
</code></p></blockquote>
<h3><b>Map RDF properties URIs to JSON keys</b></h3>
<p>Get rid of any trace of URI or short URIs in JSON keys. Declare a term for every property in your graph. The simplest way to do this is to use the local part of the URI (after last “#” or “/”) as the term. Order the context by the alphabetical order of the terms. Terms for properties will usually start with a lowercase letter.</p>
<p>In corner cases you may end up with the same term (such as in the example bnf-onto:subject and dcterms:subject), so in that case you need a different key, I chose “bnf-subject” here for bnf-onto:subject and kept “subject” for dcterms:subject.</p>
<blockquote><p><code>"creator" : "dcterms:creator",<br />
"date" : "dcterms:date",<br />
"dateOfWork" : "rdagroup1elements:dateOfWork",<br />
"depiction" : "foaf:depiction",<br />
"description" : "dcterms:description",<br />
</code></p></blockquote>
<h3><b>Map classes URIs to JSON terms</b></h3>
<p>Now you want to do the same thing to get rid of any trace of URIs in the “type” of entities. Declare a term for every class in your ontology/application profile. List the classes in a different section than the properties. Terms for classes will usually start with an uppercase.</p>
<blockquote><p><code>"Concept" : "skos:Concept",<br />
"Document" : "foaf:Document",<br />
"ExpositionVirtuelle" : "bnf-onto:ExpositionVirtuelle",<br />
</code></p></blockquote>
<h3> <b>Declare object properties with “@type”: “@id”</b></h3>
<p>Now you want to get rid of all those ugly “id”, we are only interested in listing the values. To do that, modify the mapping of the property (here “depiction”) to state its values are URIs. You need to change the mapping from</p>
<blockquote><p><code>"depiction" : "foaf:depiction",</code></p></blockquote>
<p>to</p>
<blockquote><p><code>"depiction" : { "@value" : "foaf:depiction", "@type":"@id" },</code></p></blockquote>
<p>And so parts like this :</p>
<blockquote><p><code>"depiction": [<br />
{<br />
"id": "https://gallica.bnf.fr/ark:/12148/btv1b8438568p.thumbnail"<br />
},<br />
{<br />
"id": "https://gallica.bnf.fr/ark:/12148/btv1b9004781d.thumbnail"<br />
},<br />
{<br />
"id": "https://gallica.bnf.fr/ark:/12148/bpt6k5545348q.thumbnail"<br />
}<br />
]</code></p></blockquote>
<p>Will be turned into</p>
<blockquote><p><code> "depiction": [<br />
"https://gallica.bnf.fr/ark:/12148/btv1b8438568p.thumbnail",<br />
"https://gallica.bnf.fr/ark:/12148/btv1b9004781d.thumbnail",<br />
"https://gallica.bnf.fr/ark:/12148/bpt6k5545348q.thumbnail",<br />
"https://gallica.bnf.fr/ark:/12148/btv1b8438570r.thumbnail"<br />
]</code></p></blockquote>
<h3><b>Map datatypes</b></h3>
<p>Now you want to get rid of the @datatype information for literals. If the value of a property always uses the same datatype, which is the case 99,9% of the time, then you can change the mapping from</p>
<blockquote><p><code>"property" : "http://myproperty",</code></p></blockquote>
<p>to</p>
<blockquote><p><code>"property" : { “@id”: "http://myproperty", “@type”:”xsd:date” }</code></p></blockquote>
<p>(The example used does not have datatype properties.)</p>
<h3><b>Map languages, with fixed language or when multilingual</b></h3>
<p>Now let’s get rid of the @language. For this you have 2 choices : when the language is always the same for the value, you can indicate it in the context, the same way that you would do for the datatype but with the @language key. So you change from</p>
<blockquote><p><code>"description" : "dcterms:description",</code></p></blockquote>
<p>to</p>
<blockquote><p><code>"description" : { “@id” : "dcterms:description", “@language” : “fr” }</code></p></blockquote>
<p>You could even have different terms for different languages, such as :</p>
<blockquote><p><code>"title_fr" : { "@id" : "dcterms:title", "@language" : "fr" },<br />
"title_en" : { "@id" : "dcterms:title", "@language" : "en" },<br />
"title" : { "@id" : "dcterms:title" },</code></p></blockquote>
<p>or when you have multilingual multiple values, you can make the property a language map by declaring it this way:</p>
<blockquote><p><code>"editorialNote" : { "@id" : "skos:editorialNote", "@container" : "@language" },</code></p></blockquote>
<p>Which will turn the language code as a key in the JSON output:</p>
<blockquote><p><code>"editorialNote": {<br />
"fr": [<br />
"BN Cat. gén. (sous : Hugo, comte Victor-Marie) : Les misérables. - . - BN Cat. gén. 1960-1969 (sous : Hugo, Victor) : idem. - . -",<br />
"Laffont-Bompiani, Oeuvres, 1994. - . - GDEL. - . -"<br />
]
},</code></p></blockquote>
<p>In that case, watch out for cases where there is a value without language, it will generate a @none key.</p>
<h3><b>Map controlled list values to JSON terms</b></h3>
<p>By now you already get a much cleaner JSON and almost all “unnecessary” URIs have disappeared. But we still have some URI references that we can clean up : the ones that are references to controlled lists with a finite number of values.</p>
<p>We can declare term mappings for those values just like we did to map properties and classes. BUT &#8211; and this is the trick, <strong>we need to change the property declaration from “@id” to “@vocab” for the replacement to happen</strong>. This is documented in the <a href="https://www.w3.org/TR/json-ld/#type-coercion">&laquo;&nbsp;Type coercion&nbsp;&raquo; section of the spec</a>.</p>
<p>In our example, the mapping to languages and subjects are good candidates to be mapped to JSON terms. So we change</p>
<blockquote><p><code>"language" : { "@id" : "dcterms:language", "@type":"@id" },<br />
"subject" : { "@id" : "dcterms:subject", "@type":"@id" },<br />
</code></p></blockquote>
<p>to</p>
<blockquote><p><code>"language" : { "@id" : "dcterms:language", "@type":"@vocab" },<br />
"subject" : { "@id" : "dcterms:subject", "@type":"@vocab" },<br />
“fre” : “http://id.loc.gov/vocabulary/iso639-2/fre”,<br />
“eng” : “http://id.loc.gov/vocabulary/iso639-2/eng”,</code></p></blockquote>
<h3><b>Shorten remaining URI references<br />
</b></h3>
<p>Now the only URIs left are the ids of the main entities in our graph, and references to those ids. Reference to controlled vocabularies with a limited number of values have been mapped to JSON terms. Although we cannot turn all the remaining URIs to JSON terms (because we can&rsquo;t declare all possible entity URIs in the context), we can shorten them by adding a prefix mapping in the context, in our case:</p>
<blockquote><p><code>"ark-https": "https://data.bnf.fr/ark:/12148/",<br />
</code></p></blockquote>
<p>(I note that there are http:// and https:// URIs in the data, I don&rsquo;t know why)</p>
<p>&nbsp;</p>
<h2>The frame specification</h2>
<p>So now we have clean values, no URIs, no fancy JSON-LD keys. But we still don’t have a structure indented the way the average developer would expect it; and this is where the frame specification comes into play.</p>
<h3><b>Define indentation and filters (and reverse properties if needed)<br />
</b></h3>
<p>The frame specification acts as both a filter/selection mechanism and as a structure definition. At each level you indicate the criterias for the object to be included. In our example we have a skos:Concept (the entry in the library catalog) that is foaf:focus a Work (the Book &laquo;&nbsp;in the real world&nbsp;&raquo;), and that skos:Concept is the subject of many virtual exhibits. We want to have the Concept and the Work at first level, and under the concept the exhibits. But there is a trick : it is the virtual exhibits that points to the concept with a dcterms:subject, and we want it the other direction : Concept is_subject_of Exhibit, so we need a @reverse property.</p>
<p>To do that, add the following reverse mapping declaration: (don&rsquo;t modify the existing one):</p>
<blockquote><p><code>"subject_of" : { "@reverse" : "dcterms:subject" },</code></p></blockquote>
<p>Note the use of &laquo;&nbsp;@reverse&nbsp;&raquo; to indicate that JSON key is to be interpreted from object to subject when turned into triples.</p>
<p>With that in place, we can write our frame specification, which goes right after the @context we have designed before:</p>
<blockquote><p><code>"type" : ["Concept", "Work"],<br />
"subject_of" : {<br />
"type" : "ExpositionVirtuelle"<br />
}</code></p></blockquote>
<p>Note how we use the terms defined in the context previously. This is to be understood the following way : <em>&laquo;&nbsp;at the first level, take any entity with a type of either Concept or Work, then insert a subject_of key and put inside any value that has a type ExpositionVirtuelle&nbsp;&raquo;</em>. This garantees the virtual exhibits objects will go under the Concept, and not above or at the same level. But this is not sufficient, as you will notice if you apply that framing that the Work is <strong>repeated</strong> under the &laquo;&nbsp;focus&nbsp;&raquo; property of the Concept, and at the root level. This is because of the default behavior of the JSON-LD playground regarding <a href="https://www.w3.org/TR/json-ld11-framing/#object-embed-flag">object embedding</a> (objects are always embeded when they are referenced)</p>
<h3><b>Avoid embedding</b></h3>
<p>To avoid embedding when it is undesired, we can set the &laquo;&nbsp;@embed&nbsp;&raquo; option to &laquo;&nbsp;@never&nbsp;&raquo; on the &laquo;&nbsp;focus&nbsp;&raquo; property, like so :</p>
<blockquote><p><code>"type" : ["Concept", "Work"],<br />
"subject_of" : {<br />
"type" : "ExpositionVirtuelle"<br />
},<br />
"focus" : {<br />
"@embed" : "@never",<br />
"@omitDefault": true<br />
}<br />
</code></p></blockquote>
<p>This tells the framing algorithm to never embed the complete entity inside the focus property, just reference the URI instead.</p>
<p>Also, you will notice the use of &laquo;&nbsp;@omitDefault&nbsp;&raquo; to true; this tells the framing algorithm to omit the focus property when it has no value. Otherwise, since the Work does not have a foaf:focus property (only the Concept), then it will get a &laquo;&nbsp;focus&nbsp;&raquo; key set to null.</p>
<h3><b>What about order of keys in the JSON ?</b></h3>
<p>Well, I am sure this can be controlled, either by specifying explicitely all the keys you want, in the order you want them, in the frame specification, or by using an &laquo;&nbsp;ordered&nbsp;&raquo; parameter to the JSON-LD API, but that is not available in the playground.</p>
<p>If you list all keys explicitely in the frame specification, don&rsquo;t forget to use wildcards so that any value will match; wildcards are empty objects with &laquo;&nbsp;{}&nbsp;&raquo;:</p>
<blockquote><p><code>"myProperty" : {}</code></p></blockquote>
<h2>The result</h2>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2022/07/Capture-d’écran-du-2022-07-20-08-01-19.png"><img class="aligncenter size-full wp-image-1475" src="http://blog.sparna.fr/wp-content/uploads/2022/07/Capture-d’écran-du-2022-07-20-08-01-19.png" alt="Capture d’écran du 2022-07-20 08-01-19" width="806" height="360" /></a></p>
<p>Much nicer no ? This is something you can put into the hand of an average developer.</p>
<h2>Automate context generation from SHACL</h2>
<p>Do you have a SHACL specification of the structure of your graph ? wouldn&rsquo;t it be nice to automate the generation of the JSON-LD context from SHACL ? Maybe we could do that in <a href="https://shacl-play.sparna.fr/play/">SHACL-Play</a> ? stay tuned !</p>
<p>Probably what we can automate is the context part, which can be global and unique for all your graph, but the framing specification should probably be different for each different API you need; each framing specification will then reference the same context by its URL.</p>
<p><em>Image : <a href="https://gallica.bnf.fr/ark:/12148/btv1b53230250h/f1.item.r=encadrement.zoom#"><span class="title">[Encadrement ornemental] ([1er état]) / .Io. MIGon 1544. [Jean Mignon] ; [d&rsquo;après Le Primatice]</span></a> <a href="https://gallica.bnf.fr/ark:/12148/btv1b53230250h">https://gallica.bnf.fr/ark:/12148/btv1b53230250h</a></em></p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2022/07/20/clean-json-ld-from-rdf-using-framing/">Clean JSON(-LD) from RDF using Framing</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2022/07/20/clean-json-ld-from-rdf-using-framing/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Aligner son thesaurus avec RAMEAU en utilisant OnAGUI</title>
		<link>https://blog.sparna.fr/2018/07/17/aligner-thesaurus-rameau-onagui/</link>
		<comments>https://blog.sparna.fr/2018/07/17/aligner-thesaurus-rameau-onagui/#comments</comments>
		<pubDate>Tue, 17 Jul 2018 08:56:09 +0000</pubDate>
		<dc:creator><![CDATA[Thomas Francart]]></dc:creator>
				<category><![CDATA[Editeurs]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[OnAGUI]]></category>
		<category><![CDATA[SKOS]]></category>
		<category><![CDATA[Thesaurus]]></category>
		<category><![CDATA[alignement]]></category>
		<category><![CDATA[bnf]]></category>
		<category><![CDATA[data.bnf.fr]]></category>
		<category><![CDATA[mapping]]></category>
		<category><![CDATA[onagui]]></category>
		<category><![CDATA[Rameau]]></category>
		<category><![CDATA[thesaurus]]></category>

		<guid isPermaLink="false">http://blog.sparna.fr/?p=1212</guid>
		<description><![CDATA[<p>La BNF organisait le 10 juillet 2018 un atelier &#171;&#160;Données liées et données à lier : quels outils pour quels alignements ?&#171;&#160;, avec plein de bonnes choses dedans : RobotDonnées, le module développé par Logilab pour data.bnf.fr qui permet le regroupement semi-automatique des éditions en œuvres; où l&#8217;on apprend que data.bnf.fr ne comporte &#171;&#160;que&#160;&#187; 300&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2018/07/17/aligner-thesaurus-rameau-onagui/">Aligner son thesaurus avec RAMEAU en utilisant OnAGUI</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p style="text-align: justify;">La BNF organisait le 10 juillet 2018 un <a href="https://bnf.hypotheses.org/3077">atelier &laquo;&nbsp;<em>Données liées et données à lier : quels outils pour quels alignements ?</em>&laquo;&nbsp;</a>, avec plein de bonnes choses dedans :</p>
<ul>
<li style="text-align: justify;">RobotDonnées, le module développé par Logilab pour data.bnf.fr qui permet le regroupement semi-automatique des éditions en œuvres;
<ul>
<li>où l&rsquo;on apprend que <a href="http://data.bnf.fr">data.bnf.fr</a> ne comporte &laquo;&nbsp;que&nbsp;&raquo; 300 000 œuvres, alors que la cible théorique serait à plusieurs millions; par exemple on voit bien en regardant <a href="http://data.bnf.fr/12249911/michel_houellebecq/#rdt70-12249911">les oeuvres dont Michel Houellebecq est auteur dans data.bnf.fr</a> que &laquo;&nbsp;<em>La carte et le territoire</em>&nbsp;&raquo; n’apparaît pas comme œuvre, mais que toutes les éditions de cet ouvrage sont bien listées quand on regarde <a href="http://data.bnf.fr/documents-by-rdt/12249911/70/page1">tous les documents dont Houellebecq est l&rsquo;auteur</a>.</li>
<li>où l&rsquo;on apprend également que ce regroupement en œuvre est progressivement réintroduit dans le catalogue général de la BNF;</li>
</ul>
</li>
<li style="text-align: justify;">BiblioStratus, pour aligner un catalogue externe avec celui de la BNF sur la base de plusieurs heuristiques (ISBN, EAN ou bien titre+auteur+date ou titre+auteur); où l&rsquo;on apprend que la transition n&rsquo;est pas qu&rsquo;écologique mais qu&rsquo;il existe également une <a href="https://www.transition-bibliographique.fr/17-06-20-notice-bibliographique-manifestation-notice-autorite-titre-oeuvre/">transition bibliographique</a>;</li>
<li style="text-align: justify;">l&rsquo;ANR <a href="https://project.inria.fr/elker/">ELKER</a> pour &laquo;&nbsp;Enhancing Link Keys: Extraction and Reasoning&nbsp;&raquo;, porté notamment par l&rsquo;INRIA Grenoble pour la détection automatique des attributs permettant de trouver des correspondances dans 2 jeux de données RDF arbitraires; très impressionnant;</li>
<li style="text-align: justify;"><a href="http://cultuurlink.beeldengeluid.nl">CultuurLink</a>, à tester pour tous ceux qui s&rsquo;intéressent aux processus d&rsquo;alignement de vocabulaires, bien que l&rsquo;outil soit restreint aux vocabulaires patrimoniaux des Pays-Bas;</li>
<li style="text-align: justify;"><em>et encore plein d&rsquo;autres choses (cf les liens sur la page de l&rsquo;atelier);</em></li>
</ul>
<h2 style="text-align: justify;">OnAGUI pour aligner 2 thesaurus SKOS</h2>
<p style="text-align: justify;">Je voulais profiter de cette thématique des alignements pour parler d&rsquo;un autre outil d&rsquo;alignement de thesaurus SKOS : <strong><a href="https://github.com/lmazuel/onagui">OnAGUI</a> </strong>pour &laquo;&nbsp;<em>Ontology Alignment Graphical Interface</em>&laquo;&nbsp;. OnAGUI est le genre d&rsquo;outil qui ne paie pas de mine au premier abord mais qui est à mon avis<strong> l&rsquo;outil dont la prise en main est la plus immédiate pour qui veut aligner 2 thesaurus en SKOS, </strong>tout en permettant d&rsquo;<strong>outiller le processus depuis l&rsquo;alignement linguistique automatique jusqu&rsquo;à la validation manuelle de l&rsquo;alignement produit</strong>.</p>
<div id="attachment_1170" style="width: 660px" class="wp-caption aligncenter"><a href="http://blog.sparna.fr/wp-content/uploads/2018/01/onagui-irstea-gemet.png"><img class="size-large wp-image-1170" src="http://blog.sparna.fr/wp-content/uploads/2018/01/onagui-irstea-gemet-1024x582.png" alt="L'interface d'OnAGUI pour aligner 2 thesaurus" width="650" height="369" /></a><p class="wp-caption-text">L&rsquo;interface d&rsquo;OnAGUI pour aligner 2 thesaurus</p></div>
<p style="text-align: justify;">Quelques considérations générales : d&rsquo;abord l&rsquo;acronyme parle d&rsquo;alignement d&rsquo; &laquo;&nbsp;ontologies&nbsp;&raquo;, mais pas de panique, ça marche pour des <strong>thesaurus en SKOS</strong>; surtout pour des thesaurus en SKOS, d&rsquo;ailleurs. Ensuite, c&rsquo;est <strong>open-source</strong>; ça s&rsquo;installe facilement sur n&rsquo;importe quel poste équipé de Java, c&rsquo;est une <strong>application &laquo;&nbsp;de bureau&nbsp;&raquo;</strong>, qui ne nécessite pas d&rsquo;infrastructure client-serveur, etc. l&rsquo;interface est en <strong>français</strong> (et anglais) puisque le développement à été fait par <a href="https://github.com/lmazuel">Laurent Mazuel</a> à l&rsquo;époque où il travaillait avec <a href="http://www.limics.fr/fr/annuaire/fiche-membre/voir/18-CHARLET-Jean">Jean Charlet</a> au <a href="http://www.limics.fr/fr/">LIMICS</a> (INSERM). L&rsquo;outil est maintenu, notamment par votre serviteur.</p>
<p style="text-align: justify;">Vous pouvez consulter la <a href="https://github.com/lmazuel/onagui/wiki">documentation d&rsquo;OnAGUI</a> où vous trouverez des informations plus détaillées sur le logiciel.</p>
<h2 style="text-align: justify;">Alignement avec RAMEAU</h2>
<p style="text-align: justify;">Maintenant, le cas d&rsquo;usage : il m&rsquo;ait arrivé de rencontrer des personnes qui avaient besoin d&rsquo;<strong>aligner un vocabulaire vers le <a href="http://rameau.bnf.fr/">langage d&rsquo;indexation RAMEAU de la BNF</a></strong>. Comment faire ?</p>
<ol>
<li style="text-align: justify;">Je ne détaille pas ici la structure de RAMEAU et son mécanisme de &laquo;&nbsp;vedettes-matières pré-coordonnées&nbsp;&raquo; que je ne connais pas bien d&rsquo;ailleurs, mais je pars (naïvement) de ce qui existe : le fichier des Thèmes Rameau que l&rsquo;on télécharge depuis le site des <a href="http://api.bnf.fr/dumps-de-databnffr">dumps de données de la BNF</a>. Il s&rsquo;agit du fichier intitulé &laquo;&nbsp;<a href="ftp://databnf:databnf@pef.bnf.fr/DATA/databnf_rameau_xml.tar.gz">[RDF/XML] Thèmes RAMEAU</a>&laquo;&nbsp;.</li>
<li style="text-align: justify;">A ce stade il faut l&rsquo;avouer : <em>la BNF a encore des efforts à faire sur diffusion des données de RAMEAU</em>; les données telles qu&rsquo;on les télécharge sont inutilisables telles qu&rsquo;elles, et nécessitent : 1/ un nettoyage et 2/ une fusion en un seul fichier; je ne m&rsquo;étends pas sur cette étape que j&rsquo;ai réalisé à l&rsquo;aide de scripts <a href="http://rdf4j.org">RDF4J</a> dont je me sers régulièrement, mais on se ramène au final à un seul fichier RDF contenant tout RAMEAU en SKOS; si vous voulez tester OnAGUI sur un autre vocabulaire ne nécessitant pas cette étape de &laquo;&nbsp;magie noire&nbsp;&raquo;, essayez avec <a href="http://vocabularies.unesco.org/browser/en/about">le thesaurus de l&rsquo;UNESCO</a> ou <a href="http://data.europa.eu/euodp/fr/data/dataset/eurovoc">Eurovoc</a>;</li>
<li style="text-align: justify;">Il faut ensuite avoir son propre thesaurus en SKOS; on peut utiliser pour cela l&rsquo;<a href="http://labs.sparna.fr/skos-play/convert">outil de conversion Excel vers SKOS disponible dans SKOS Play</a>;</li>
<li style="text-align: justify;">Il faut <a href="https://github.com/lmazuel/onagui/releases">télécharger OnAGUI</a> en prenant soin de prendre une version postérieure à 0.3.2, version pour laquelle des améliorations ont été apportées pour fonctionner avec RAMEAU;</li>
<li>Muni donc de votre vocabulaire en SKOS d&rsquo;un côté, de RAMEAU (nettoyé) de l&rsquo;autre, et d&rsquo;OnAGUI entre les deux, vous pouvez ensuite :
<ol>
<li style="text-align: justify;">Charger votre thesaurus dans OnaGUI (Menu &laquo;&nbsp;Fichier &gt; Charger vocabulaire 1 &gt; Charger en SKOS&nbsp;&raquo;);</li>
<li style="text-align: justify;">Charger RAMEAU dans OnaGUI (Menu &laquo;&nbsp;Fichier &gt; Charger vocabulaire 2 &gt; Charger en SKOS&nbsp;&raquo;), qui va vaillamment charger 160000+ concepts;</li>
<li style="text-align: justify;">Lancer un alignement linguistique (Menu &laquo;&nbsp;Alignement &gt; I-Sub distance&nbsp;&raquo;) en prenant soin de :
<ol>
<li>sélectionner la langue &laquo;&nbsp;fr&nbsp;&raquo; dans les 2 thesaurus;</li>
<li>abaisser légèrement le seuil de tolérance de l&rsquo;algorithme, en le mettant à 0.85 par exemple; ne le mettez pas trop bas sinon vous aurez trop de correspondances qui ne veulent rien dire, ni trop haut sinon vous allez rater des choses;</li>
</ol>
</li>
<li style="text-align: justify;">Aller prendre un petit café, ou deux, ou trois&#8230; en fonction de la taille du vocabulaire que vous avez besoin d&rsquo;aligner; patientez entre quelques secondes et une nuit entière;</li>
<li style="text-align: justify;">Analyser le tableau de résultat fourni et soit :
<ol>
<li>supprimer des lignes de correspondances fausses; regardez le score de chaque correspondance : plus le score tend vers 1, meilleure est la correspondance linguisitique, plus le score diminue, plus les libellés sont éloignés;</li>
<li>ajouter des nouvelles correspondances qui auraient été ratées;</li>
</ol>
</li>
<li style="text-align: justify;">Puis exporter l&rsquo;alignement (Menu &laquo;&nbsp;Fichier &gt; Exporter l&rsquo;alignemenr&nbsp;&raquo;) soit dans un format RDF <a href="http://alignapi.gforge.inria.fr/edoal.html">EDOAL</a>, le plus expressif, soit avec les <a href="https://www.w3.org/TR/2009/REC-skos-reference-20090818/#mapping">propriétés d&rsquo;alignement SKOS</a>, soit en CSV pour une analyse dans un tableur;</li>
</ol>
</li>
</ol>
<div id="attachment_1216" style="width: 537px" class="wp-caption aligncenter"><a href="http://blog.sparna.fr/wp-content/uploads/2018/07/onagui-rameau.png"><img class="wp-image-1216 size-full" src="http://blog.sparna.fr/wp-content/uploads/2018/07/onagui-rameau.png" alt="Vocabulaire Rameau dans OnaGUI" width="527" height="903" /></a><p class="wp-caption-text">Vocabulaire Rameau chargé dans OnaGUI</p></div>
<p>&nbsp;</p>
<p style="text-align: justify;">OnAGUI est un outil perfectible sur bien des aspects mais qui permet de valider &laquo;&nbsp;pour de vrai&nbsp;&raquo; des alignements sur des vrais projets, sans besoin d&rsquo;infrastructure compliquée ou d&rsquo;outil commercial. J&rsquo;ai pu le mettre en œuvre à 2 reprises : pour l&rsquo;alignement du <a href="https://vocabulaires.irstea.fr/">thesaurus de l&rsquo;IRSTEA</a> vers le <a href="http://www.eionet.europa.eu/gemet/en/themes/">GEMET</a> et <a href="http://aims.fao.org/vest-registry/vocabularies/agrovoc">Agrovoc</a>, et plus récemment pour un essai d&rsquo;alignement du <a href="http://data.legilux.public.lu/vocabulaires/legal-subject-theme/fr/">vocabulaire thématique d&rsquo;indexation de la loi Luxembourgeoise</a> avec le <a href="http://eurovoc.europa.eu/">thesaurus Eurovoc</a> utilisé au niveau européen, sur la base de l&rsquo;analyse des <a href="https://fr.wikipedia.org/wiki/Transposition_(droit)">transpositions des directives européennes</a> dans la loi nationale. Il peut également prendre en charger des cas d&rsquo;usage d&rsquo;<em>alignement partiel</em>, en alignant seulement les concepts créés ou modifiés après une certaine date, de façon à ne pas à avoir à refaire l&rsquo;intégralité de l&rsquo;alignement à chaque nouvelle version du thesaurus.</p>
<p style="text-align: justify;">Si vous avez des questions, des cas d&rsquo;usage ou des retours d&rsquo;expérience sur OnAGUI, n&rsquo;hésitez pas à <a href="https://github.com/lmazuel/onagui/issues">soumettre des tickets dans le site du projet</a>.</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2018/07/17/aligner-thesaurus-rameau-onagui/">Aligner son thesaurus avec RAMEAU en utilisant OnAGUI</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2018/07/17/aligner-thesaurus-rameau-onagui/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SPARQL sur les données de data.bnf.fr : un exercice guidé dans les Fables</title>
		<link>https://blog.sparna.fr/2018/03/07/data-bnf-fr-sparql-exercice/</link>
		<comments>https://blog.sparna.fr/2018/03/07/data-bnf-fr-sparql-exercice/#comments</comments>
		<pubDate>Wed, 07 Mar 2018 14:27:41 +0000</pubDate>
		<dc:creator><![CDATA[Thomas Francart]]></dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[SKOS]]></category>
		<category><![CDATA[SPARQL]]></category>
		<category><![CDATA[data.bnf.fr]]></category>
		<category><![CDATA[FRBR]]></category>
		<category><![CDATA[Rameau]]></category>

		<guid isPermaLink="false">http://blog.sparna.fr/?p=1182</guid>
		<description><![CDATA[<p>Appréhender les données de data.bnf.fr n&#8217;est pas forcément facile. Je vais expérimenter ce nouveau TP pour expliquer comment partir à la découverte des données de la BNF en utilisant SPARQL. L&#8217;énoncé permet de comprendre : Comment trouver un auteur par son nom Comment trouver une oeuvre par son nom Comment chercher un mot-clé dans Rameau&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2018/03/07/data-bnf-fr-sparql-exercice/">SPARQL sur les données de data.bnf.fr : un exercice guidé dans les Fables</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>Appréhender les données de data.bnf.fr n&rsquo;est pas forcément facile. Je vais expérimenter ce nouveau TP pour expliquer comment partir à la découverte des données de la BNF en utilisant SPARQL. L&rsquo;énoncé permet de comprendre :</p>
<ul>
<li>Comment trouver un auteur par son nom</li>
<li>Comment trouver une oeuvre par son nom</li>
<li>Comment chercher un mot-clé dans Rameau</li>
<li>Comment faire une recherche combinée sur l&rsquo;auteur de l&rsquo;oeuvre et un mot-clé Rameau</li>
</ul>
<blockquote class="embedly-card"><h4><a href="https://www.slideshare.net/thomasfrancart/dcouvrir-les-donnes-de-databnffr-en-utilisant-sparql/thomasfrancart/dcouvrir-les-donnes-de-databnffr-en-utilisant-sparql">Découvrir les données de data.bnf.fr en utilisant SPARQL</a></h4>
<p>Travaux pratiques pour explorer les oeuvres, les auteurs et les mots-clés de data.bnf.fr en utilisant des requêtes SPARQL.</p>
</blockquote>
<p><script async src="//cdn.embedly.com/widgets/platform.js" charset="UTF-8"></script></p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2018/03/07/data-bnf-fr-sparql-exercice/">SPARQL sur les données de data.bnf.fr : un exercice guidé dans les Fables</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2018/03/07/data-bnf-fr-sparql-exercice/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Vocabulaires dans le web de données : quels outils open-source ?</title>
		<link>https://blog.sparna.fr/2018/01/23/vocabulaires-thesaurus-web-donnees-skos-open-source/</link>
		<comments>https://blog.sparna.fr/2018/01/23/vocabulaires-thesaurus-web-donnees-skos-open-source/#comments</comments>
		<pubDate>Tue, 23 Jan 2018 10:43:08 +0000</pubDate>
		<dc:creator><![CDATA[Thomas Francart]]></dc:creator>
				<category><![CDATA[Editeurs]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[OnAGUI]]></category>
		<category><![CDATA[SKOS]]></category>
		<category><![CDATA[SKOS Play]]></category>
		<category><![CDATA[Skosmos]]></category>
		<category><![CDATA[Thesaurus]]></category>
		<category><![CDATA[ginco]]></category>
		<category><![CDATA[onagui]]></category>
		<category><![CDATA[opentheso]]></category>
		<category><![CDATA[skos play]]></category>
		<category><![CDATA[skos testing tool]]></category>
		<category><![CDATA[skosmos]]></category>
		<category><![CDATA[vocbench]]></category>

		<guid isPermaLink="false">http://blog.sparna.fr/?p=1157</guid>
		<description><![CDATA[<p>Je travaille régulièrement sur des projets de gestion et/ou de mise en ligne de vocabulaires (thesaurus, taxonomies) en SKOS : pour l&#8217;UNESCO &#8211; voir le précédent billet -, l&#8217;INED (thesaurus Popin), le Luxembourg, l&#8217;IRSTEA ou le Réseau Canopé (vocabulaire ScolomFr). Ces projets sont réalisés en utilisant de l&#8217;open-source sur toute la chaîne, depuis le back-office&#8230;</p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2018/01/23/vocabulaires-thesaurus-web-donnees-skos-open-source/">Vocabulaires dans le web de données : quels outils open-source ?</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p style="text-align: justify;">Je travaille régulièrement sur des <strong>projets de gestion et/ou de mise en ligne de vocabulaires (thesaurus, taxonomies)</strong> en <a href="http://www.sparna.fr/skos/SKOS-traduction-francais.html">SKOS</a> : pour l&rsquo;<a href="http://vocabularies.unesco.org">UNESCO</a> &#8211; voir le <a href="http://blog.sparna.fr/2017/02/06/unesco-thesaurus-published-with-semantic-web-standards-and-open-source-software/">précédent billet</a> -, l&rsquo;<a href="https://www.ined.fr/">INED</a> (thesaurus Popin), le <a href="http://data.legilux.public.lu/">Luxembourg</a>, l&rsquo;<a href="http://irstea.fr">IRSTEA</a> ou le <a href="https://www.reseau-canope.fr/">Réseau Canopé</a> (vocabulaire <a href="https://www.reseau-canope.fr/scolomfr/accueil.html">ScolomFr</a>). Ces projets sont réalisés en utilisant de l&rsquo;open-source sur toute la chaîne, depuis le back-office de gestion jusqu&rsquo;à la diffusion conforme aux standards du web de données, en passant par l&rsquo;alignement ou la conversion depuis des tableaux Excel.</p>
<p>&nbsp;</p>
<p style="text-align: justify;"><a href="http://blog.sparna.fr/wp-content/uploads/2018/01/Outils-pour-la-publication-des-thesaurus-dans-le-web-de-donnees.jpg"><img class=" wp-image-1158 size-full aligncenter" src="http://blog.sparna.fr/wp-content/uploads/2018/01/Outils-pour-la-publication-des-thesaurus-dans-le-web-de-donnees.jpg" alt="Outils-pour-la-publication-des-thesaurus-dans-le-web-de-donnees" width="960" height="720" /></a></p>
<p style="text-align: justify;">Je vous propose un petit <strong>tour d&rsquo;horizon des outils open-source pour la gestion et la diffusion de vocabulaires contrôlés</strong>.</p>
<h1>Outils de gestion</h1>
<h2>Vocbench3</h2>
<p style="text-align: justify;"><a href="http://vocbench.uniroma2.it/">Vocbench3</a> est l&rsquo;outil de gestion de référentiels SKOS que je recommande. Il est développé par l&rsquo;Université de Rome Tor-Vergata, et financé pour 1 ou 2 année encore par le programme de financement européen ISA, ce qui lui donne une bonne visibilité à moyen terme. La <a href="http://vocbench.uniroma2.it/support/community.jsf">communauté d&rsquo;utilisateurs</a> est large avec beaucoup d&rsquo;organismes internationaux (FAO/<a href="http://aims.fao.org/vest-registry/vocabularies/agrovoc-multilingual-agricultural-thesaurus">Agrovoc</a>, INRA, Office des Publications de l&rsquo;UE/<a href="http://eurovoc.europa.eu/">Eurovoc</a>, etc.). Le support est bon, l&rsquo;équipe de développement répond rapidement aux questions.</p>
<p style="text-align: justify;">Coté technique, il est directement basé sur des données RDF, ce qui facilite son intégration dans des environnements techniques basés sur le web de données. Le modèle de données est extensible et permet de gérer des propriétés &laquo;&nbsp;customisées&nbsp;&raquo; sur les concepts. Vocbench permet également d&rsquo;éditer des ontologies OWL, sans arriver à concurrencer <a href="https://protege.stanford.edu/">Protégé</a>.</p>
<p style="text-align: justify;"><a href="http://blog.sparna.fr/wp-content/uploads/2018/01/td_eurovoc_show.png"><img class="aligncenter wp-image-1166 size-large" src="http://blog.sparna.fr/wp-content/uploads/2018/01/td_eurovoc_show-1024x646.png" alt="td_eurovoc_show" width="650" height="410" /></a></p>
<h2 style="text-align: justify;">Ginco</h2>
<p style="text-align: justify;"><a href="https://github.com/culturecommunication/ginco">Ginco</a> est l&rsquo;outil de gestion de vocabulaires développé par le Ministère de la Culture pour la maintenance de ses référentiels. Il a été dès le départ spécifié pour être compatible avec la norme <a href="http://dossierdoc.typepad.com/descripteurs/2011/08/norme-iso-25964-1-thesaurus-publication-officielle.html">ISO-25964</a> sur la modélisation des thesaurus, et avec SKOS. Donc en particulier la gestion des facettes, des tableaux de concepts, etc. Il a un pendant &laquo;&nbsp;<a href="https://github.com/culturecommunication/ginco-diff">Ginco diffusion</a>&nbsp;&raquo; qui motorise le <a href="http://data.culture.fr/thesaurus/">site de diffusion des vocabulaires du MCC</a>.</p>
<h2 style="text-align: justify;">OpenTheso</h2>
<p style="text-align: justify;"><a href="https://github.com/miledrousset/opentheso">OpenTheso</a> est l&rsquo;outil de gestion SKOS <em>made in</em> CNRS. La communauté d&rsquo;utilisateurs est dans le monde de la recherche française/francophone (notamment le réseau <a href="https://www.frantiq.fr/">FRANTIQ</a>).<br />
On pourra consulter la <a href="http://semweb.pro/semwebpro-2017.html#p3">présentation d&rsquo;OpenTheso à semweb.pro 2017 par Miled Rousset</a>.</p>
<h1>Outils de Publication et Visualisation de vocabulaires</h1>
<h2>Skosmos</h2>
<p style="text-align: justify;">Vous voulez <strong>valoriser vos vocabulaires/thesaurus/taxonomies</strong> ? Vous avez besoin des les rendre visibles, navigables, interrogeables par les utilisateurs ? Vous voulez recevoir du feedback, faire participer une communauté ? <a href="http://skosmos.org/">Skosmos</a> est l&rsquo;outil de choix.<br />
Développé par la Bibliothèque Nationale de Finlande pour son portail &laquo;&nbsp;<a href="http://finto.fi">finto.fi</a>&nbsp;&raquo; (parlez-vous finnois ?), Skosmos propose une interface de rendu très propre avec toutes les fonctions de navigation dans des vocabulaires contrôlés : listes alphabétiques, arborescence hiérarchique, recherche plain-texte avec autocompletion, gestion des groupes/collections SKOS, affichage propre des alignements avec d&rsquo;autres vocabulaires. Il s&rsquo;adapte assez facilement pour le mettre aux couleurs de l&rsquo;institution. Il prend en charge la négociation de contenu pour l&rsquo;accès aux données brutes en RDF. L&rsquo;interface inclut également un formulaire permettant d&rsquo;envoyer du feedback à l&rsquo;équipe de maintenance du thesaurus.</p>
<p style="text-align: justify;">L&rsquo;équipe de développement répond très vite aux questions, le code PHP est de qualité, l&rsquo;outil est vivant et de nouvelles versions sont disponibles régulièrement.</p>
<p style="text-align: justify;">Skosmos est à mon sens <strong>LE meilleur outil pour la diffusion de vocabulaires en SKOS</strong>.</p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2018/01/screenshot-skosmos-finto.png"><img class="aligncenter wp-image-1168 size-large" src="http://blog.sparna.fr/wp-content/uploads/2018/01/screenshot-skosmos-finto-1024x597.png" alt="screenshot-skosmos-finto" width="650" height="379" /></a></p>
<h2>Skos Play</h2>
<p style="text-align: justify;"><a href="http://labs.sparna.fr/skos-play/">SKOS Play</a> est l&rsquo;outil que je développe depuis 2013 pour faire de la publication et de la visualisation de référentiels SKOS. Il permet de générer des documents PDF des vocabulaires, des sorties HTML dynamiques (arbres dépliables),des listings alphabétiques, des tableaux de traduction ou d&rsquo;alignement, ou des data visualisations. A titre d&rsquo;exemple on pourra consulter les <a href="http://blog.sparna.fr/2017/04/18/rome-pole-emploi-skos-rdf-data-gouv-fr/">rendus du code Rome de Pôle Emploi en SKOS</a>.</p>
<p style="text-align: justify;">SKOS Play est intégré dans la plateforme de publication des vocabulaires du MCC, et dans celle de l&rsquo;UNESCO, pour la <a href="http://vocabularies.unesco.org/exports/thesaurus/latest/">génération des PDFs</a>. Le service en ligne est quant à lui utilisé plus de <strong>500 fois par mois</strong>.</p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2018/01/code-rome-visualisation-arbre.png"><img class="aligncenter wp-image-1164 size-large" src="http://blog.sparna.fr/wp-content/uploads/2018/01/code-rome-visualisation-arbre-1024x557.png" alt="code-rome-visualisation-arbre" width="650" height="354" /></a></p>
<h1>Conversion Excel, contrôle qualité, alignement</h1>
<h2>Skos Play convert</h2>
<p style="text-align: justify;">SKOS Play inclut un <a href="http://labs.sparna.fr/skos-play/convert">service de génération de données RDF/SKOS depuis des tableaux Excel</a> (voir <a href="http://blog.sparna.fr/2017/01/12/creer-des-referentiels-skosrdf-a-partir-dexcel/">précédent billet</a>).</p>
<p style="text-align: justify;">Soyons honnêtes : <strong>rien ne remplace Excel</strong> <img src="https://blog.sparna.fr/wp-includes/images/smilies/simple-smile.png" alt=":-)" class="wp-smiley" style="height: 1em; max-height: 1em;" /> ! pour de la saisie ou de l&rsquo;analyse de données en masse, avoir une vue globale sur ces données, faire des reporting, gérer facilement plusieurs vocabulaires&#8230; Pas de courbe d&rsquo;apprentissage, pas d&rsquo;outil à installer, grande efficacité de saisie. Et si on utilise un environnement en ligne type Google Sheets, on gagne l&rsquo;aspect collaboratif.<br />
Cette passerelle Excel-RDF-SKOS s&rsquo;est révélée très importante pour faire les reprises ou les créations initiales de vocabulaires, avant de les basculer dans des outils de gestion comme VocBench.</p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2018/01/convert-screenshot-body.png"><img class="aligncenter wp-image-1163 size-full" src="http://blog.sparna.fr/wp-content/uploads/2018/01/convert-screenshot-body.png" alt="convert-screenshot-body" width="1490" height="335" /></a></p>
<h2>Skos Testing Tool : contrôle qualité</h2>
<p style="text-align: justify;">Le <a href="http://labs.sparna.fr/skos-testing-tool/">SKOS Testing Tool</a> est une interface web permettant de valider la conformité de fichiers SKOS (voir <a href="http://blog.sparna.fr/2017/07/11/outil-de-test-de-vocabulaires-skos/">précédent billet</a>). L&rsquo;outil s&rsquo;appuie sur <a href="https://github.com/cmader/qSKOS">qSKOS</a>, qui fonctionne lui en ligne de commande. 27 vérifications peuvent être effectuées sur les données, comme :</p>
<ul style="text-align: justify;">
<li>vérification de l&rsquo;absence de cycle dans la hiérarchie des concepts</li>
<li>vérification que tous les concepts ont bien un prefLabel</li>
<li>vérification que tous les concepts sont traduits (dans le cas de thesaurus multilingues)</li>
<li>&#8230;</li>
</ul>
<p style="text-align: justify;">Combiné avec les sorties alphabétiques/hiérarchiques de SKOS Play, ces vérifications permettent de s&rsquo;assurer de la qualité des données soit en amont lors des comités de validation du vocabulaire, soit en aval avant la diffusion des données en ligne dans Skosmos.</p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2018/01/screenshot-skos-testing-tool.png"><img class="aligncenter wp-image-1172 size-large" src="http://blog.sparna.fr/wp-content/uploads/2018/01/screenshot-skos-testing-tool-1024x715.png" alt="screenshot-skos-testing-tool" width="650" height="454" /></a></p>
<h2>OnaGUI : Alignement semi-automatique</h2>
<p style="text-align: justify;"><a href="https://github.com/lmazuel/onagui">OnaGUI</a> (Ontology Alignment Graphical User Interface) a été développé à l&rsquo;origine à l&rsquo;INSERM dans l&rsquo;équipe de <a href="http://www.limics.fr/fr/annuaire/fiche-membre/voir/18-CHARLET-Jean">Jean Charlet</a> pour faire de l&rsquo;alignement d&rsquo;ontologies médicales, et il a depuis été adapté pour pouvoir aligner des données SKOS.</p>
<p style="text-align: justify;">C&rsquo;est l&rsquo;outil le plus accessible et le plus simple d&rsquo;utilisation pour aligner 2 vocabulaires : on charge un thesaurus à gauche, un thesaurus à droite, et on lance un algorithme d&rsquo;alignement qui va proposer des correspondances sur la base des proximités des libellés des concepts.</p>
<p style="text-align: justify;">On utilise régulièrement OnaGUI pour illustrer les alignements de thesaurus dans les <a href="https://www.adbs.fr/formations/formation-comprendre-web-semantique-et-ontologies-167">formations web de données à l&rsquo;ADBS</a>.</p>
<p style="text-align: justify;">Si on veut utiliser des algorithmes d&rsquo;alignement plus poussés, on se tournera vers l&rsquo;<a href="http://alignapi.gforge.inria.fr/">API d&rsquo;alignement de l&rsquo;INRIA</a>, mais qui demande du développement technique pour être intégrée dans un outil.</p>
<p><a href="http://blog.sparna.fr/wp-content/uploads/2018/01/onagui-irstea-gemet.png"><img class="aligncenter wp-image-1170 size-large" src="http://blog.sparna.fr/wp-content/uploads/2018/01/onagui-irstea-gemet-1024x582.png" alt="onagui-irstea-gemet" width="650" height="369" /></a></p>
<h2>Quels outils manquent ?</h2>
<p>Quels outils manquent dans l&rsquo;écosystème des vocabulaires dans le web de données ?</p>
<ul>
<li>Probablement des outils faciles à mettre en œuvre pour assister à la <strong>création de vocabulaires à partir d&rsquo;analyse de corpus</strong> (détection d&rsquo;entités, agrégation des mots en concepts, etc.); en connaissez-vous ? <a href="https://www.ibm.com/watson/products-services/">IBM Watson</a> ne ferait pas quelque chose comme ça ?</li>
<li>Des <strong>interfaces d&rsquo;alignement puissantes</strong> faciliteraient la mise en réseau des référentiels (des choses comme <a href="http://cultuurlink.beeldengeluid.nl">cultuurlink</a>, mais généralisées);</li>
<li>Une <strong>gestion des versions successives</strong> des vocabulaires, avec calcul de deltas, génération automatique de ce qui a changé entre 2 versions, etc;</li>
<li>Des <strong>plugins dans les CMS</strong> courants (WordPress, Drupal, etc&#8230;) permettant d&rsquo;importer directement des vocabulaires SKOS, et de les mettre à jour; ça doit bien exister, non ? également des ponts faciles et direct vers SolR et ElasticSearch;</li>
<li>Mon rêve : un <strong>outil d&rsquo;édition des vocabulaires en mode collaboratif et ouvert poussé</strong>, en mode SaaS, sans différence entre l&rsquo;environnement d&rsquo;édition et de publication, permettant de faire collaborer une communauté plus large que quelques éditeurs experts; ça viendra un jour ! <img src="https://blog.sparna.fr/wp-includes/images/smilies/simple-smile.png" alt=":-)" class="wp-smiley" style="height: 1em; max-height: 1em;" /></li>
</ul>
<p><em><strong>Et vous, quels outils vous manquent ?</strong></em></p>
<p><em>(Illustration de l&rsquo;article : &laquo;&nbsp;Atelier Heilmann Ducommun pour la construction de machines-outils&nbsp;&raquo; Gallica <a href="http://gallica.bnf.fr/ark:/12148/btv1b10201510v" target="_blank">http://gallica.bnf.fr/ark:/12148/btv1b10201510v</a>)</em></p>
<p>Cet article <a rel="nofollow" href="https://blog.sparna.fr/2018/01/23/vocabulaires-thesaurus-web-donnees-skos-open-source/">Vocabulaires dans le web de données : quels outils open-source ?</a> est apparu en premier sur <a rel="nofollow" href="https://blog.sparna.fr">Sparna Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.sparna.fr/2018/01/23/vocabulaires-thesaurus-web-donnees-skos-open-source/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
	</channel>
</rss>
