<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.0/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.0" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id journal-id-type="issn">1683-1470</journal-id>
<journal-title-group>
<journal-title>Data Science Journal</journal-title>
</journal-title-group>
<issn pub-type="epub">1683-1470</issn>
<publisher>
<publisher-name>Ubiquity Press</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5334/dsj-2019-004</article-id>
<article-categories>
<subj-group>
<subject>Practice paper</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Implementing in the VAMDC the New Paradigms for Data Citation from the Research Data Alliance</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-5762-6747</contrib-id>
<name>
<surname>Zw&#246;lf</surname>
<given-names>Carlo Maria</given-names>
</name>
<email>carlo-maria.zwolf@obspm.fr</email>
<xref ref-type="aff" rid="aff-1">1</xref>
<xref ref-type="aff" rid="aff-2">2</xref>
<xref ref-type="aff" rid="aff-3">3</xref>
<xref ref-type="aff" rid="aff-4">4</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Moreau</surname>
<given-names>Nicolas</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
<xref ref-type="aff" rid="aff-2">2</xref>
<xref ref-type="aff" rid="aff-3">3</xref>
<xref ref-type="aff" rid="aff-4">4</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ba</surname>
<given-names>Yaye-Awa</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
<xref ref-type="aff" rid="aff-2">2</xref>
<xref ref-type="aff" rid="aff-3">3</xref>
<xref ref-type="aff" rid="aff-4">4</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-1434-0693</contrib-id>
<name>
<surname>Dubernet</surname>
<given-names>Marie-Lise</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
<xref ref-type="aff" rid="aff-2">2</xref>
<xref ref-type="aff" rid="aff-3">3</xref>
<xref ref-type="aff" rid="aff-4">4</xref>
</contrib>
</contrib-group>
<aff id="aff-1"><label>1</label>LERMA, Observatoire de Paris, FR</aff>
<aff id="aff-2"><label>2</label>PSL Research University, FR</aff>
<aff id="aff-3"><label>3</label>CNRS, Sorbonne University, FR</aff>
<aff id="aff-4"><label>4</label>UPMC Univ Paris 06, 5 Place Janssen, 92190 Meudon, FR</aff>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2019-01-14">
<day>14</day>
<month>01</month>
<year>2019</year>
</pub-date>
<pub-date pub-type="collection">
<year>2019</year>
</pub-date>
<volume>18</volume>
<elocation-id>4</elocation-id>
<history>
<date date-type="received" iso-8601-date="2018-07-31">
<day>31</day>
<month>07</month>
<year>2018</year>
</date>
<date date-type="accepted" iso-8601-date="2018-12-12">
<day>12</day>
<month>12</month>
<year>2018</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2019 The Author(s)</copyright-statement>
<copyright-year>2019</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://datascience.codata.org/articles/10.5334/dsj-2019-004/"/>
<abstract>
<p>VAMDC bridged the gap between atomic and molecular (A&amp;M) producers and users by providing an interoperable e-infrastructure connecting A&amp;M databases, as well as tools to extract and manipulate those data. The current paper highlights how the new paradigms for data citation produced by the Research Data Alliance in order to address the citation issues in the data-driven science landscape, have successfully been implemented on the VAMDC e-infrastructure.</p>
</abstract>
<kwd-group>
<kwd>database</kwd>
<kwd>data citation</kwd>
<kwd>Research Data Alliance</kwd>
<kwd>Scholix</kwd>
<kwd>atomic data</kwd>
<kwd>molecular data</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec>
<title>1 Introduction</title>
<p>For the last decades, data and software have redefined the way of carrying out science (Hey et al. (<xref ref-type="bibr" rid="B11">2009</xref>)). The current volumes and complexity of data that are now being collected, produced and processed, and their inevitable increase require new tools, techniques and ways of working. A number of principles and best practices for the management of scientific data have arisen, and a consensus is being reached around themes such as data identification (Wittenburg et al. (<xref ref-type="bibr" rid="B17">2017</xref>)) or FAIR principles (Wilkinson et al. (<xref ref-type="bibr" rid="B16">2016</xref>)).</p>
<p>In this fast evolving landscape of data-intensive science, the <italic>citation</italic> is an anchor: it remains a key element in the production of new knowledge, since it enhances trust (the new results are based on proven/solid bases and a scientist does not need to prove again a used result), makes the process described by the cited work reproducible and gives credits to the author of the cited intellectual product. According to the FAIR principles, most of the data should be re-used in derived works: the role of <italic>Citation</italic> is crucial in open-data-driven science. However, the classical citation paradigm used in scientific papers (mostly hand-made bibliographies and referring to other papers) is incompatible with the current data-deluge (Bell et al. (<xref ref-type="bibr" rid="B2">2009</xref>)): on one hand a huge number of digital data (with disparate origins) may be used in a given paper; on the other hand the evolution of digital data is very rapid and not systematically reported.</p>
<p>In the context of the Virtual Atomic and Molecular Data Centre we aimed at addressing these issues at the data-community level and in 2014 we joined the Research Data Alliance. The RDA, through its Data Citation Working Group<xref ref-type="fn" rid="n1">1</xref> and RDA/WDS Scholarly Link Exchange (Scholix) Working Group,<xref ref-type="fn" rid="n2">2</xref> has defined new models for citation in the digital era.</p>
<p>In this paper, after recalling some technical elements of the VAMDC e-infrastructure (section 2.1) and the recommendations coming from the RDA-Data Citation WG and Scholix WG (respectively section 2.2 and 2.3), we focus on how these recommendations are implemented ov0er the existing VAMDC E-infrastructure (section 3 and 4 respectively for Data-Citation and Scholix). After describing the ongoing/further works (section 5) we conclude with discussion (section 6).</p>
</sec>
<sec>
<title>2 Technical framework</title>
<sec>
<title>2.1 The VAMDC e-infrastructure</title>
<p>The Virtual Atomic and Molecular Data Centre (VAMDC, (Dubernet et al. (<xref ref-type="bibr" rid="B8">2016</xref>))) is a political and technical framework for operating and sustaining a worldwide digital research infrastructure, built over two European FP-7 projects ((<xref ref-type="bibr" rid="B9">Dubernet et al. 2010</xref>); (<xref ref-type="bibr" rid="B18">Zw&#246;lf et al. 2014</xref>)). The e-infrastructure federates in an interoperable way about 30 heterogeneous atomic and molecular databases. By providing data producers and compilers a large dissemination platform for their works, VAMDC succeeded in removing the bottleneck between data producers and the wide body of users of that data. The &#8220;V&#8221; of VAMDC stands for &#8220;virtual&#8221; in the sense that the e-infrastructure does not contain data: it is a wrapping for exposing in a unified way a set of heterogeneous databases. An <italic>ad hoc</italic> generic wrapping software, called the <italic>node-software</italic> (Regandell et al. (<xref ref-type="bibr" rid="B13">2018</xref>)) transforms an autonomous database into a VAMDC federated database, called <italic>data-node</italic>. Each <italic>data-node</italic> accepts queries submitted in a standard grammar (VAMDC SQL Subset (VAMDC Consortium (<xref ref-type="bibr" rid="B14">2012</xref>)), a subset of SQL as it names indicates) and, by implementing an interoperable data access protocol (Dowler et al. (<xref ref-type="bibr" rid="B7">2010</xref>)) developed by the IVOA,<xref ref-type="fn" rid="n3">3</xref> provides output formatted into a standard XML file (VAMDC XML Schema for Atomic Molecular and Solid Data, VAMDC-XSAMS).<xref ref-type="fn" rid="n4">4</xref> The data-nodes are listed into a specific registry (Benson et al. (<xref ref-type="bibr" rid="B3">2009</xref>)), a sort of yellow pages service for discovering the VAMDC available resources. The current VAMDC registry implementation is derived from the AstroGrid project (Walton (<xref ref-type="bibr" rid="B15">2004</xref>)).</p>
<p>A user wishing to extract data from VAMDC:</p>
<list list-type="bullet">
<list-item><p>may use a VAMDC client software: when the client forms a query, the client asks the registry about the availability and relevance of the data-nodes, and then dispatches the query to the nodes. Each node produces a standard VAMDC-XSAMS file. The client collects the returned file and displays the file&#8217;s content to the user.</p></list-item>
<list-item><p>may submit his/her query directly to the specific node he/she wants to hit, after having discovered it on the registry.</p></list-item>
</list>
<p>From the technical point of view, VAMDC may be seen as a distributed architecture, with no central management system.</p>
</sec>
<sec>
<title>2.2 The RDA recommendation on dynamic data citation</title>
<p>The Research Data Alliance<xref ref-type="fn" rid="n5">5</xref> and its Data Citation Working Group<xref ref-type="fn" rid="n6">6</xref> have provided the researchers and data centers communities with recommendations to identify and cite dynamic data (Asmi et al. (<xref ref-type="bibr" rid="B1">2016</xref>)). The proposed solution relies on a query centric view and the set-up of a <italic>Query Store</italic>. Data should be stored in a versioned time-stamped manner and accessed through queries. The Query Store stores all the identified and time-stamped queries together with the relevant metadata. It also gives access to the the data produced when a given query was executed. Within the context of the RDA recommendation the term &#8220;query&#8221; has to be understood in its wider sense: it stands for any processing mechanism used to extract data from a computer-based system.</p>
<p>We already discussed (Zw&#246;lf et al. (<xref ref-type="bibr" rid="B19">2016</xref>)) how the VAMDC standards have evolved in order to meet the part of the RDA recommendation related to the versioning and to the data-timestamping. In this paper, we focus on the technical details about the implementation of the Query Store, i.e. for storing timestamped queries submitted to the VAMDC infrastructure.</p>
</sec>
<sec>
<title>2.3 The RDA Scholix recommendation</title>
<p>The goal of the Scholix initiative (Burton et al. (<xref ref-type="bibr" rid="B4">2017</xref>)) is to establish a high-level interoperability framework for exchanging information about the links between scholarly literature and data. It is an evolving lightweight set of guidelines that aims to increase interoperability and to enable an open information ecosystem. The objective is to understand systematically what data underpins literature and what literature references data. The Data-Literature Interlinking Service from OpenAIRE (DLI Service)<xref ref-type="fn" rid="n7">7</xref> is the first exemplar aggregation and query service fed by the Scholix open information ecosystem. The Scholix framework, together with the DLI aggregation, is designed to enable other 3rd party services (domain-specific aggregations, integrations with other global services, discovery tools, impact assessments etc).</p>
</sec>
</sec>
<sec>
<title>3 Implementing the Query Store for the VAMDC infrastructure</title>
<p>The RDA Data Citation recommendation is meant for standalone data-repositories and/or for warehouses. It was both technically and politically challenging to implement the RDA recommendation in the case of the distributed VAMDC infrastructure. The solution had to deal with a lot of constraints:</p>
<list list-type="bullet">
<list-item><p>any evolution of the infrastructure automatically impacts all the connected databases (there are about 30 connected databases nowadays).</p></list-item>
<list-item><p>as a consequence, the majority of the VAMDC Consortium members must validate any technological evolution of the infrastructure.</p></list-item>
</list>
<p>Any adopted solution must lessen the load on the existing infrastructure members and have minimal implementing costs for each <italic>data-node</italic> owner. These constraints suggested to embed part of the solution into the <italic>node-software</italic> (cf. par. 2.1).</p>
<p>Our implementation of the Query Store consists of two distinct software elements:</p>
<list list-type="bullet">
<list-item><p>an overlay to (and embedded into) the existing VAMDC <italic>node software</italic>, thus independent from any specific database.</p></list-item>
<list-item><p>a set of centralized asynchronous web-services, which may be seen as a smart log-service. In what follows we will call this element <italic>Query-Store service</italic>.<xref ref-type="fn" rid="n8">8</xref></p></list-item>
</list>
<p>Concerning the data versioning and time-stamping, we have two different mechanisms:</p>
<list list-type="bullet">
<list-item><p>a coarse-grained one: a modification of any publicly available data at a given <italic>data-node</italic> induces an increment in the version of the data-node. We have indeed a mechanism for informing that something has changed on a given <italic>data-node</italic>: in other words, we know that the result of an identical query may be different from one version to the other.</p></list-item>
<list-item><p>a fined-grained one: based on the introduction of the <italic>Version element</italic> into the <italic>VAMDC-XSAMS</italic> standard, as described in (Zw&#246;lf et al. (<xref ref-type="bibr" rid="B19">2016</xref>)). The information contained into the <italic>Version element</italic> indicates which data have changed between two different <italic>data-node</italic> versions.</p></list-item>
</list>
<p>The Query Store is built over the coarse-grained mechanism.</p>
<sec>
<title>3.1 The functioning of the Query Store</title>
<p>For extracting data from VAMDC, the users may query directly a given known <italic>data-node</italic> or use one of the centralized query-clients (e.g. the VAMDC portal, <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://portal.vamdc.eu">https://portal.vamdc.eu</ext-link>). In the latter case, the centralized client software asks the <italic>registries</italic> what are the <italic>data-nodes</italic> able to answer and dispatches to them the query. Any centralized client acts as a relay. This is complitely transparent from the <italic>data-node</italic> perspective and a <italic>data-node</italic> acts in the same way regardless the source of the query it is serving: when a <italic>data-node</italic> receives a query:</p>
<list list-type="bullet">
<list-item><p>it generates a unique <italic>query-token</italic> (this can be seen as a session token associated to the incoming query);</p></list-item>
<list-item><p>it answers the query by producing the <italic>VAMDC-XSAMS</italic> output file, which is returned to the user together with the generated <italic>query-token</italic>. This token is copied both in the header of the answer and in the output file;</p></list-item>
<list-item><p>it notifies to a specific notification service of the <italic>Query-Store service</italic> the <italic>query-token</italic>, the content of the query, the version of the node and the version of the standards used for formatting the output. It is worth noting that this process is not blocking and has no impact on the existing infrastructure whatsoever: the data extraction process is not slowed down and, if the Query Store cannot be reached the user will still receive the <italic>VAMDC-XSAMS</italic> output file.</p></list-item>
</list>
<p>When the <italic>Query-Store service</italic> receives a notification from the <italic>data node</italic>, it stores the received information and reduces the query to a standard form (using the VAMDC SQL-comparator library,<xref ref-type="fn" rid="n9">9</xref> cf. remark 1 for a discussion) and it checks if a semantically identical query has already been submitted to the same <italic>data-node</italic>, having the same node version and working with the same version of the standards:</p>
<list list-type="bullet">
<list-item><p>If there is no such a query, the <italic>Query-Store service</italic> attributes a unique UUID and a timestamp to the new query, downloads the data, i.e. the <italic>VAMDC-XSAMS</italic> output file from the data-node and processes this file in order to extract the bibliographic information (each <italic>VAMDC-XSAMS</italic> file produced by the VAMDC infrastructure includes the references to the articles used for compiling the data) as well as metadata. The relevant metadata are stored and associated with the generated UUID. These metadata are kept permanently, while the downloaded XSAMS data are kept for an arbitrary time and then deleted (cf. remark 2 for a discussion).</p></list-item>
<list-item><p>If such a query is already stored in the <italic>Query-Store service</italic>, the new couple (query time-stamp, query token) is added to the lists of the other time-stamps already associated with the query.</p></list-item>
</list>
<p>The <italic>Query-Store service</italic> permanently keeps the mapping between the UUID and the set of <italic>query-tokens</italic><xref ref-type="fn" rid="n10">10</xref> assigned to a given query. This information is kept for different reasons:</p>
<list list-type="bullet">
<list-item><p>statistics: it is interesting for database owner to know which queries are submitted and how many times a given query is re-submitted. This information is used for reporting to our founders and stakeholders.</p></list-item>
<list-item><p>coherence of the human-interface: a user who has just re-submitted a query which was played for the first time long time ago by another user, may believe that there is some bug on the system if only the original timestamp is returned. By returning all the re-execution timestamp we avoid any ambiguity.</p></list-item>
<list-item><p>troubleshooting and technical support: if something goes wrong on the Query-Store service before it issued the final UUID, we may use the token for identifying the query who generated the problem. Indeed the token is the first element generated into the query-notification pipeline.</p></list-item>
</list>
<p>During the query-submission phase the user has no direct interaction with the <italic>Query-Store service</italic> (as we explained before, the <italic>data-node</italic> that answers the query, notifies directly its action to the Query-Store). When the user receive the data from the <italic>data-node</italic> he/she has no information about the UUID the <italic>Query-Store service</italic> assigned to his/her query. The user may recover the final UUID assigned to his/her query by sending the query token to a specific service endpoint of the Query-Store (plus further optional information, e.g. the user e-mail and/or ORCID, information about the used client, etc&#8230;). This mechanism is implemented into the VAMDC-client software and its complexity is transparent to the scientific-user.</p>
<p>The functioning of the Query-Store is asynchronous. This was a mandatory constraint in order to avoid slowing down the VAMDC-infrastructure with a central bottleneck service. Indeed the <italic>Query-Store service</italic> response time could be slowed down if a huge number of queries comes in at the same time. Moreover computing the uniqueness of an incoming query may take some time if a very large number of queries is already stored. The asynchronous architecture solves these problems. A direct technical consequence of this asynchronous implementation is the combined generation of the associated tokens: the <italic>query-token</italic> and the <italic>query-UUID</italic>.</p>
<p>The unique identifier assigned to each query is resolvable, and is both human and machine actionable. The associated landing page provides the metadata associated with the query, as well as the access to the queried data. Figure <xref ref-type="fig" rid="F1">1</xref> represents a screen capture of the human-oriented landing page, whereas Figure <xref ref-type="fig" rid="F3">3</xref> represents the data model behind the <italic>Query-Store service</italic>.</p>
<fig id="F1">
<label>Figure 1</label>
<caption>
<p>Screen capture of the human-oriented landing page for a given query. The &#8220;Data source&#8221;, &#8220;Data source version&#8221;, &#8220;XSAMS version&#8221;, &#8220;Query&#8221; fields indicate respectively which <italic>data-node</italic> produced the result, the version of the <italic>data-node</italic>, the version of the standards when the query was processed, and the content of the query. The &#8220;Query identifier&#8221; is the UUID assigned by the <italic>Query-Store service</italic> to this query. The &#8220;Query Result downloaded on&#8221; list recall when this query was submitted (or re-submitted) and the &#8220;References&#8221; list contains the bibliographic references used for compiling the output file. For these, it is possible to switch between a tabular or a BibTex view (cf. figure <xref ref-type="fig" rid="F2">2</xref>). Finally a link gives access to the output file produced by the <italic>data-node</italic> while answering the query.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="dsj-18-867-g1.jpg"/>
</fig>
<fig id="F2">
<label>Figure 2</label>
<caption>
<p>Screen capture of the human-oriented landing page for a given query where a BibTex view is chosen for displaying the references. By clicking on the &#8220;Switch to References&#8221; button, one goes back to the display of figure <xref ref-type="fig" rid="F1">1</xref>.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="dsj-18-867-g2.jpg"/>
</fig>
<p>As we can see in Figure <xref ref-type="fig" rid="F3">3</xref>, some personal information is stored into the <italic>Query-Store service</italic> (mainly the query submitted by the user). This information is kept only for internal purpose and in order to get a better user experience (cf. par 4.1). Because of this personal information and in the context of the European General Data Protection Regulation, we are registrating the <italic>Query-Store service</italic> with the CNIL (French National Agency regulating Data Protection).<xref ref-type="fn" rid="n11">11</xref> All public interfaces of the <italic>Query-Store service</italic> are completely de-identified by virtually cutting the link &#8220;submitted-by&#8221; between the <italic>Submission</italic> and the <italic>Author</italic> classes: the queries contained into the <italic>Query-Store service</italic> may be browsed online in their anonymized form at the web-page: <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://cite.vamdc.eu.">https://cite.vamdc.eu</ext-link>.</p>
<fig id="F3">
<label>Figure 3</label>
<caption>
<p>UML graphical representation of the data model used for organizing the metadata available in the <italic>Query-Store service</italic>.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="dsj-18-867-g3.png"/>
</fig>
<p><bold>Remark 1</bold> Comparing the sematic equivalence of two SQL queries is a problem which admits neither analytical nor close solution. This implies that &#8216;false negative&#8217; may exist in the <italic>VAMDC-SQL comparator</italic> library which is built using the ANTLR parser.<xref ref-type="fn" rid="n12">12</xref> Indeed if two queries are considered identical, they are actually identical; however in some minority cases, two semantically identical queries may be considered different.</p>
<p><bold>Remark 2</bold> Most of the queries processed by the VAMDC e-infrastructure are not used in a published work. It is therefore neither possible nor reasonable, to store the <italic>VAMDC-XSAMS</italic> data produced by all the queries for a very long term; data deletion is an operational requirement. The deletion mechanisms works as follow: the <italic>XSAMS</italic>-data produced by the <italic>data-node</italic> and stored on the <italic>Query-Store service</italic> are deleted only if the last (re-)execution of the query dates more than an arbitrary defined duration (5 years in our implementation). In other words, the data are not deleted if the query is too old but if the last query invocation is too old. Some specific queries (e.g. the queries associated with the Hidrogen H-&#945; emission line, at a wavelength of 656.28 nm, are commonly used for solar observations or detecting Hidrogen in space nebulae) may be very old, but re-executed daily. It is worth noting that only the <italic>XSAMS</italic>-data associated with the queries may deleted. All the other information (Data Source name and version, the query syntax and identifier, re-execution timestamps, bibliographic references) are kempt permanently.</p>
<p>The <italic>XSAMS</italic>-data associated with queries which have been assigned a DOI (i.e. which have been uploaded to Zenodo, cf. section 4.1) will never been deleted, regardless of their age.</p>
</sec>
</sec>
<sec>
<title>4 Implementing Scholix for the Query Store</title>
<p>The Scholix recommendation is not implemented directly on the Query Store, but is a consequence of the interlinking between the <italic>Query-Store service</italic> and the Zenodo open science repository.<xref ref-type="fn" rid="n13">13</xref></p>
<sec>
<title>4.1 Interlinking the Query Store with Zenodo</title>
<p>As we highlighted in the remark 2, most of the queries are not cited by published works, and after an arbitrary time the underlying data are deleted from the <italic>Query-Store service</italic>. On the other hand, we have queries generating data used and cited in published works. A lifetime access must be provided to these data. The interconnection with Zenodo provides the Query Store with Scholix functionalities and with lifetime access to the query-generated data.</p>
<p>The link between the <italic>Query-Store service</italic> and Zenodo is implemented using, on the Query Store side, the Zenodo public REST API.<xref ref-type="fn" rid="n14">14</xref> As we see in Figure <xref ref-type="fig" rid="F1">1</xref>, when a user uses the <italic>Query-Store service</italic> for displaying the information related to a given query, a button &#8220;Get a DOI&#8221; is displayed (if the query has not already been assigned a DOI). By clicking on this button, the user may trig the Zenodo registration process:<xref ref-type="fn" rid="n15">15</xref> the file associated with the query is uploaded to Zenodo using the &#8220;Data Set&#8221; upload type and all the query-associated metadata are copied to corresponding Zenodo fields. In particular:</p>
<list list-type="bullet">
<list-item><p>the author of the upload is set to &#8220;VAMDC consortium&#8221;;</p></list-item>
<list-item><p>the title and the description are generated automatically starting from the query itself, the node producing the data and the query execution context (timestamp, token,&#8230;);</p></list-item>
<list-item><p>the license chosen for the data being uploaded is &#8220;CC4 By&#8221;, with open access;</p></list-item>
<list-item><p>the bibliographic references extracted from the data-file by the <italic>Query-Store service</italic> while it processed the query (cf. paragraph 3.1), are copied into the &#8220;References&#8221; fields. The authors of these references also populates the &#8220;Contributors&#8221; fields;</p></list-item>
<list-item><p>a reverse link, pointing from Zenodo to the <italic>Query-Store service</italic> query-entry, is introduced by putting into the field &#8220;Related Identifier &#8211; Is identical to&#8221; the resolvable persistent identifier of the query on the Query Store side (cf. remark 4 for a discussion about the relevance of this link).</p></list-item>
</list>
<p>When the upload process finishes successfully, Zenodo provides the <italic>Query-Store service</italic> with a DOI and with a deposition identifier, that the Query Store curators may use further for administrating the upload on the Zenodo side. These two identifiers are stored on the <italic>Query-Store service</italic> and associated to the query. The deposition identifier is never returned to the users. When a user displays a query which has already been copied to Zenodo, the button &#8220;Get a DOI&#8221; is replaced by a DOI badge (cf. Figure <xref ref-type="fig" rid="F4">4</xref>). By clicking on this badge, the corresponding Zenodo record is displayed on the user screen (cf. Figure <xref ref-type="fig" rid="F5">5</xref>). This page also contains the instructions for citing this query-record. Different export fromats are supported (cf. Figure <xref ref-type="fig" rid="F6">6</xref>): Figure <xref ref-type="fig" rid="F7">7</xref> gives an example of the Bibtex citation format. A citation for the query record we used for our example is (Consortium VAMDC (<xref ref-type="bibr" rid="B5">2018</xref>)).</p>
<fig id="F4">
<label>Figure 4</label>
<caption>
<p>When a DOI is assigned, the &#8220;Get a DOI&#8221; button (cf. figures <xref ref-type="fig" rid="F1">1</xref> or <xref ref-type="fig" rid="F2">2</xref>) is replaced by the DOI badge.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="dsj-18-867-g4.jpg"/>
</fig>
<fig id="F5">
<label>Figure 5</label>
<caption>
<p>Partial screen-shot of the Zenodo-record landing page: The mentioned query token is the one generated by the node serving the query (cf. section 3.1). The set of references are those provided by the <italic>Query-Store service</italic> during the submission phase. One can also see on the right side the <italic>reverse link</italic> (Related Identifiers) pointing to the original query record on the VAMDC side.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="dsj-18-867-g5.jpg"/>
</fig>
<fig id="F6">
<label>Figure 6</label>
<caption>
<p>Partial screen-shot of the Zenodo-record landing page: this part of the screen displays the instruction for citing the current query-record.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="dsj-18-867-g6.jpg"/>
</fig>
<fig id="F7">
<label>Figure 7</label>
<caption>
<p>Bibtex format to be used for citing the query-record of this example.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="dsj-18-867-g7.png"/>
</fig>
<p>Moreover, the data deletion mechanism described in 2 is suspended for all the queries associated with a DOI (in other words, the underlying data are kept permanently on the Query Store side as well).</p>
<p>Since Zenodo is indexed in OpenAIRE,<xref ref-type="fn" rid="n16">16</xref> and since the latter implements Scholix through its Data-Literature Interlinking Service,<xref ref-type="fn" rid="n17">17</xref> all the VAMDC queries registered by the Query Store in Zenodo are included in those infrastructures. Therefore when some data extracted from VAMDC are cited (in papers and/or other datasets) through the DOI obtained by the couple (Query Store/Zenodo), the authors of the works referenced by the VAMDC data receive credits automatically.</p>
<p>What has been described above is typical of the interoperability virtuous circle: if a system A implements some interoperability protocols and a system B implements some other ones, than a wrapping between A and B will disclose to A the interoperability capabilities of B. One could say that the interoperability-capabilities propagation speed is greater than the interoperability-protocols adoption speed.</p>
<p><bold>Remark 3</bold> The upload-to-Zenodo process was described in this paragraph from a human point of view. In our implementation this process may also be completely machine actionated. Indeed, the computer architecture of the services described through this paper relies on a set of REST services. A user, or a computer program, may interact with these services by sending parameters (using GET and/or POST methods) to specific endpoints. All these services respond by providing JSON formatted output which may be automatically parsed. The Graphical Web User interface we presented in Figures <xref ref-type="fig" rid="F1">1</xref>, <xref ref-type="fig" rid="F2">2</xref> and <xref ref-type="fig" rid="F4">4</xref> are part of a lightweight html5 layer for interacting with and formatting output from these REST services.</p>
<p><bold>Remark 4</bold> As Zenodo is an open repository (any person who has, e.g., a Github or an ORCID account may upload his/her productions to Zenodo), one has to pay the greatest attention to the provenance and to the scientific relevance of the works shared through this repository. In this context, the reverse link pointing from Zenodo to the Query-Store query-entry (see. Figure <xref ref-type="fig" rid="F5">5</xref>) gives users quality and provenance assurance on the shared datasets: the reverse link states that the data come from a well-known and documented database.</p>
</sec>
</sec>
<sec>
<title>5 On-going and further works</title>
<p>The VAMDC funders and stakeholders regularly ask us to report on the outcomes of their investments, to track and demonstrate they have been used efficently. In this context the VAMDC Query-Store may play a double role: on one hand it may increase the impact of VAMDC (cf. section 5.1) and on the other it constitutes a fine-grained reporting tool (cf. section 5.2).</p>
<sec>
<title>5.1 The Query-Store impact</title>
<p>As we underlined in (Moreau et al. (<xref ref-type="bibr" rid="B12">2018</xref>)), from the start of the VAMDC project in 2009, one of our goal has been to increase the citation impact of data producers. Indeed we find that the current status of citing spectroscopic data is to cite the database. It should be stressed that atomic and molecular data require months to be either measured or calculated, and therefore it is a loss of visibility and recognition that only databases be cited in users&#8217; papers. We believe that the Query Store coupled to the VAMDC portal now allow this flaw to be overcome, even if additional refinements need to be carried out: on January 2018 we have started the deployement of the Query-Store data citation capabilities in the production environment. Currently these are deployed over a third of the <italic>Data-nodes</italic> of the VAMDC infrastructure. Since January 2018 the <italic>Query-Store service</italic> received &#8764;2000 queries. From these, &#8764;180 unique queries have been identified. The link between the Query-Store and Zenodo have been added in May 2018. Since then &#8764;10 queries received a DOI.</p>
<p>This paper is the first peer-reviewed work where the technical details of the VAMDC Query Store are described, whereas (Moreau et al. (<xref ref-type="bibr" rid="B12">2018</xref>)) is the first article where the atomic/molecular science-aspects linked with the Query Store are discussed: at this point in time we cannot be sufficiently objective for evaluating how the usage of the VAMDC infrastructure has been altered by the implementation of the Query Store.</p>
<p>We would like the Query-Store to boost the usage of VAMDC. For that reason, we have recently started collaborating with the main Astronomy and Physics Journal editors so that they may have their paper-submission workflows adapted for interacting directly with the VAMDC Query Store. Our goal is to make the Query Store indispensable for all the author publishing papers citing atomic or molecular data. During the submission phase, the author may put references to data using the DOIs assigned by the Query Store, as it is already the case for papers. We are working with editors for achieving this integration. All the actors will obtain benefits: VAMDC will increase its impact and its usage, editors will gain an efficient tool for data-paper linking and data producers/providers will benefit from the automatic citation mechanisms. From the earlier discussion with editors, we have identified some improvement targets, described in section 5.3.</p>
</sec>
<sec>
<title>5.2 Refining the level of autorhip</title>
<p>As described in on (FAIR Data (<xref ref-type="bibr" rid="B10">2018</xref>)) (Recommendation 6), data practitioners should facilitate the inclusion of a wide range of indicators for the assessment of the scientific and technical contributions to data-related activities: provision of data infrastructure ans services should be recognized and rewarded accordingly. In order to be able to measure all the contributions (together with the specific role of each contributor) we are planning to extend the range of metadata to be sent to Zenodo. Indeed Zenodo adopts the DataCite Metadata Schema for the Publication and Citation of Research Data (DataCite Metadata Working Group (<xref ref-type="bibr" rid="B6">2017</xref>)). This schema is very rich and contains several optional parameterswe we would like to exploit: we think that the fields &#8220;Contributor &#8211; Data Collector&#8221;, &#8220;Contributor &#8211; Data Curator&#8221;, &#8220;Contributor &#8211; Data Manager&#8221; are very valuable since, by filling those fields, we may mention and acknowledge with bibliometric credits the work of people involved in VAMDC data-infrastructure maintenance and curation. Nowadays this technical work is anonymous and mostly invisible for the scientific final users of VAMDC.</p>
<p>The VAMDC registry (cf. 2.1) already contains the names of the scientific and technical maintainers of each <italic>data-node</italic>, however, there do not exist machine actionable mechanisms for extracting this information from the current version of the registry. We are developing such a service: while registering to Zenodo a query processed by a given <italic>data-node</italic>, the <italic>Query-Store service</italic> will extract -directly and on the fly- from the registries the information about the scientific and technical curator of the <italic>data-node</italic>. The Zenodo fields concerning the &#8220;Data Contributors&#8221; will be populated accordingly.</p>
</sec>
<sec>
<title>5.3 Clustering queries</title>
<p>In order to enhance the user experience, we would like to provide new services for clustering a set of queries and assign to the cluster a DOI. The service we are designing relies on user-authentication and authorization. An authenticated user will be able to:</p>
<list list-type="bullet">
<list-item><p>create a new query-Cluster. He/she will automatically be the first author of the freshly created cluster;</p></list-item>
<list-item><p>add other contributors to an existing query-Cluster (he/she is first-authoring). The new authors may be added by their identifier (typically their ORCID) and by specifying their contribution rank to the cluster (e.g. 2nd author, 3rd author,&#8230;);</p></list-item>
<list-item><p>add/remove queries to a cluster he/she is co-authoring. Only the queries whose data have not yet been deleted may be added to a cluster). The automatic data deletion mechanism (cf. par. 4.1) will also be blocked for the queries belonging to a query cluster;</p></list-item>
<list-item><p>publish to Zenodo the query cluster he/she is first-authoring. This will assign a DOI to the cluster.</p></list-item>
</list>
<p>In Figure <xref ref-type="fig" rid="F8">8</xref> we represent all the metadata (together with their structure) attached to a given query cluster. As we can see on Figure <xref ref-type="fig" rid="F8">8</xref> we have three different levels of authorship:</p>
<list list-type="bullet">
<list-item><p>the author contributing to the cluster. An author may contribute to the cluster, without having any submitted query;</p></list-item>
<list-item><p>the author who processed a given Query attached to the cluster (this author is always part of the authors of the cluster);</p></list-item>
<list-item><p>the author who wrote a paper referenced by the result of a Query.</p></list-item>
</list>
<fig id="F8">
<label>Figure 8</label>
<caption>
<p>Graphical representation of the meta data associated with a query cluster: Each cluster is created and/or modified by specific contributors (a first author, a second author, etc.). Each author may add to the clusters the Queries he/she performed (provided the query related data are still present and not deleted from the system, cf. remark 2). Each Query produces a result by extracting data from a specific data source. Each result has a list of references, i.e. the list of all the publication used for compiling the result. The unique identifier of each Cluster is resolvable and the associated landing page will contain all the cluster-associated metadata, together with access to the underlying data (i.e. all the data coming from the extraction performed by the queries composing the cluster).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="dsj-18-867-g8.png"/>
</fig>
<p>All these three levels of authorship are important and, through the Zenodo metadata schema (cf. par. 4.1), will receive credits when a given Query Cluster is cited through its DOI.</p>
<p>While implementing these additional features to the VAMDC Query Store, we will pay great attention to follow the RDA recommendations on Data Collections.<xref ref-type="fn" rid="n18">18</xref></p>
</sec>
</sec>
<sec>
<title>6 Concluding remarks</title>
<p>Through this paper we exposed how the new RDA data citation paradigms have been implemented on the VAMDC distributed e-infrastructure and how we succeeded in removing the technical barriers linked with the automatic data-citation and with the delegation of credits for VAMDC-extracted data. However, the success of a technical solution does not only depend on its intrinsic quality, but also on its level of adoption by the user-community: we are focusing our efforts:</p>
<list list-type="bullet">
<list-item><p>on increasing the impact of the described citation services through community awareness-raising and training around these new tools, as we suggested in (Moreau et al. (<xref ref-type="bibr" rid="B12">2018</xref>)).</p></list-item>
<list-item><p>Working with editors for integrating the VAMDC Query Store in the paper submission workflows (for paper citing atomic and molecular data, cf. section 5.1).</p></list-item>
</list>
<p>We are focusing our efforts in this collaboration with editors because we believe this is a key action to consolidate the VAMDC position as a leading infrastructure for sharing atomic and molecular data.</p>
</sec>
</body>
<back>
<fn-group>
<fn id="n1"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.rd-alliance.org/groups/data-citation-wg.html">https://www.rd-alliance.org/groups/data-citation-wg.html</ext-link>.</p></fn>
<fn id="n2"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.rd-alliance.org/groups/data-citation-wg.html">https://www.rd-alliance.org/groups/rdawds-scholarly-link-exchange-scholix-wg</ext-link>, which is a follow up of the RDA/WDS Publishing Data Services WG (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://rd-alliance.org/groups/rdawds-publishing-data-services-wg.html">https://rd-alliance.org/groups/rdawds-publishing-data-services-wg.html</ext-link>).</p></fn>
<fn id="n3"><p>International Virtual Observatory Alliance.</p></fn>
<fn id="n4"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://standards.vamdc.eu/dataModel/vamdcxsams/index.html#vamdcxsamslanguage-index">https://standards.vamdc.eu/dataModel/vamdcxsams/index.html#vamdcxsamslanguage-index</ext-link>.</p></fn>
<fn id="n5"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.rd-alliance.org">https://www.rd-alliance.org</ext-link>.</p></fn>
<fn id="n6"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.rd-alliance.org/groups/data-citation-wg.html">https://www.rd-alliance.org/groups/data-citation-wg.html</ext-link>.</p></fn>
<fn id="n7"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://scholexplorer.openaire.eu/index.html#/api">https://scholexplorer.openaire.eu/index.html#/api</ext-link>.</p></fn>
<fn id="n8"><p>We implemented these web-services using the Java Servlet technology&#169;. The source code is released with a &#8216;Creative Commons 4 (By, Nd, Nc)&#8217; license on GitHub: https://github.com/VAMDC/QueryStore.</p></fn>
<fn id="n9"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://github.com/VAMDC/VamdcSqlRequestComparator">https://github.com/VAMDC/VamdcSqlRequestComparator</ext-link>.</p></fn>
<fn id="n10"><p>A query may be re-executed several times. Each execution has a different <italic>query-token</italic>.</p></fn>
<fn id="n11"><p>Since the Paris Observatory hosts the <italic>Query-Store service</italic> and is the legal representative of the VAMDC Consortium, we are subject to French law.</p></fn>
<fn id="n12"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.antlr.org/">https://www.antlr.org/</ext-link>.</p></fn>
<fn id="n13"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://zenodo.org">https://zenodo.org</ext-link>.</p></fn>
<fn id="n14"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://developers.zenodo.org">https://developers.zenodo.org</ext-link>.</p></fn>
<fn id="n15"><p>Automatic checks are implemented in order to avoid to register twice a given query.</p></fn>
<fn id="n16"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.openaire.eu">https://www.openaire.eu</ext-link>.</p></fn>
<fn id="n17"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://scholexplorer.openaire.eu/index.html">https://scholexplorer.openaire.eu/index.html</ext-link>.</p></fn>
<fn id="n18"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://rd-alliance.org/group/research-data-collections-wg/outcomes/rda-research-data-collections-wg-recommendations">https://rd-alliance.org/group/research-data-collections-wg/outcomes/rda-research-data-collections-wg-recommendations</ext-link>.</p></fn>
</fn-group>
<ack>
<title>Acknowledgements</title>
<p>We would like to thank the anonymous reviewers for their comments, which helped us in improving the clarity of this article.</p>
<p>Support for VAMDC has been provided through the <italic>VAMDC</italic> and the <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://www.sup-vamdc.vamdc.org/"><italic>SUP@VAMDC</italic></ext-link> projects funded under the &#8220;Combination of Collaborative Projects and Coordination and Support Actions&#8221; scheme of the Seventh Framework Program. Call topic: INFRA-2008-1.2.2 and INFRA-2012 Scientific Data Infrastructure. Grant Agreement numbers: 239108 and 313284.</p>
<p>The Query Store was partially funded by the European Project RDA EU3 (funded under H2020-EINFRA-2014-2, project ID: 653194).</p>
<p>We acknowledge support from Paris Astronomical Data Center of Paris Observatory.</p>
</ack>
<sec>
<title>Competing Interests</title>
<p>The authors have no competing interests to declare.</p>
</sec>
<ref-list>
<ref id="B1"><label>1</label><mixed-citation publication-type="confproc"><string-name><surname>Asmi</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Rauber</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Pr&#246;ll</surname>, <given-names>S</given-names></string-name> and <string-name><surname>van Uytvanck</surname>, <given-names>D</given-names></string-name>. <year>2016</year>. <article-title>Citing Dynamic Data &#8211; Research Data Alliance working group recommendations</article-title>. In: <conf-name>EGU General Assembly Conference Abstracts, volume 18, EGU General Assembly Conference Abstracts, EPSC2016&#8211;7456</conf-name>. <conf-date>April 2016</conf-date>.</mixed-citation></ref>
<ref id="B2"><label>2</label><mixed-citation publication-type="webpage"><string-name><surname>Bell</surname>, <given-names>G</given-names></string-name>, <string-name><surname>Hey</surname>, <given-names>T</given-names></string-name> and <string-name><surname>Szalay</surname>, <given-names>A</given-names></string-name>. <year>2009</year>. <article-title>Beyond the Data Deluge</article-title>. <source>Science</source>, <volume>323</volume>(<issue>5919</issue>): <fpage>1297</fpage>&#8211;<lpage>1298</lpage>. ISSN 0036-8075. URL: <uri>https://science.sciencemag.org/content/323/5919/1297</uri>. DOI: <pub-id pub-id-type="doi">10.1126/science.1170411</pub-id></mixed-citation></ref>
<ref id="B3"><label>3</label><mixed-citation publication-type="journal"><string-name><surname>Benson</surname>, <given-names>K</given-names></string-name>, <string-name><surname>Plante</surname>, <given-names>R</given-names></string-name>, <string-name><surname>Auden</surname>, <given-names>E</given-names></string-name>, <string-name><surname>Graham</surname>, <given-names>M</given-names></string-name>, <string-name><surname>Benson</surname>, <given-names>K</given-names></string-name>, <string-name><surname>Plante</surname>, <given-names>R</given-names></string-name>, <string-name><surname>Greene</surname>, <given-names>G</given-names></string-name>, <string-name><surname>Hill</surname>, <given-names>M</given-names></string-name>, <string-name><surname>Linde</surname>, <given-names>T</given-names></string-name>, <string-name><surname>Morris</surname>, <given-names>D</given-names></string-name>, <string-name><surname>O&#8217;Mullane</surname>, <given-names>W</given-names></string-name>, <string-name><surname>Rixon</surname>, <given-names>G</given-names></string-name>, <string-name><surname>St&#233;b&#233;</surname>, <given-names>A</given-names></string-name> and <string-name><surname>Andrews</surname>, <given-names>K</given-names></string-name>. <year>2009</year>. <article-title>IVOA Registry Interfaces Version 1.0</article-title>. <source>IVOA Recommendation</source> <day>04</day> <month>November</month> 2009.</mixed-citation></ref>
<ref id="B4"><label>4</label><mixed-citation publication-type="journal"><string-name><surname>Burton</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Fenner</surname>, <given-names>M</given-names></string-name>, <string-name><surname>Haak</surname>, <given-names>W</given-names></string-name> and <string-name><surname>Manghi</surname>, <given-names>P</given-names></string-name>. <month>November</month> <year>2017</year>. <article-title>Scholix Metadata Schema for Exchange of Scholarly Communication Links</article-title>. DOI: <pub-id pub-id-type="doi">10.5281/zenodo.1120265</pub-id></mixed-citation></ref>
<ref id="B5"><label>5</label><mixed-citation publication-type="journal"><collab>Consortium VAMDC</collab>. <article-title>VAMDC extraction with identifier = 17053a9ae56e-451b-9bd2-8e0cddda0d5d</article-title>, <year>May 2018</year>. DOI: <pub-id pub-id-type="doi">10.5281/zenodo.1620773</pub-id></mixed-citation></ref>
<ref id="B6"><label>6</label><mixed-citation publication-type="webpage"><collab>DataCite Metadata Working Group</collab>. <year>2017</year>. <article-title>DataCite Metadata Schema Documentation for the Publication and Citation of Research Data. Version 4.1. DataCite e.V</article-title>. <uri>https://schema.datacite.org/meta/kernel-4.1/index.html</uri>. DOI: <pub-id pub-id-type="doi">10.5438/0014</pub-id></mixed-citation></ref>
<ref id="B7"><label>7</label><mixed-citation publication-type="journal"><string-name><surname>Dowler</surname>, <given-names>P</given-names></string-name>, <string-name><surname>Rixon</surname>, <given-names>G</given-names></string-name> and <string-name><surname>Tody</surname>, <given-names>D</given-names></string-name>. <year>2010</year>. <article-title>Table Access Protocol Version 1.0</article-title>. <source>IVOA Recommendation</source> <day>27</day> <month>March</month> 2010.</mixed-citation></ref>
<ref id="B8"><label>8</label><mixed-citation publication-type="journal"><string-name><surname>Dubernet</surname>, <given-names>ML</given-names></string-name>, <string-name><surname>Antony</surname>, <given-names>B</given-names></string-name>, <string-name><surname>Ba</surname>, <given-names>Y-A</given-names></string-name>, <string-name><surname>Babikov</surname>, <given-names>Y</given-names></string-name>, <string-name><surname>Bartschat</surname>, <given-names>K</given-names></string-name>, <string-name><surname>Boudon</surname>, <given-names>V</given-names></string-name>, <string-name><surname>Braams</surname>, <given-names>B</given-names></string-name>, <string-name><surname>Chung</surname>, <given-names>H-K</given-names></string-name>, <string-name><surname>Daniel</surname>, <given-names>F</given-names></string-name>, <string-name><surname>Delahaye</surname>, <given-names>F</given-names></string-name>, <string-name><surname>Del Zanna</surname>, <given-names>G</given-names></string-name>, <string-name><surname>de Urquijo</surname>, <given-names>J</given-names></string-name>, <string-name><surname>Dimitrijevic</surname>, <given-names>M</given-names></string-name>, <string-name><surname>Domaracka</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Doronin</surname>, <given-names>M</given-names></string-name>, <string-name><surname>Drouin</surname>, <given-names>B</given-names></string-name>, <string-name><surname>Endres</surname>, <given-names>C</given-names></string-name>, <string-name><surname>Fazliev</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Gagarin</surname>, <given-names>S</given-names></string-name>, <string-name><surname>Gordon</surname>, <given-names>I</given-names></string-name>, <string-name><surname>Gratier</surname>, <given-names>P</given-names></string-name>, <string-name><surname>Heiter</surname>, <given-names>U</given-names></string-name>, <string-name><surname>Hill</surname>, <given-names>C</given-names></string-name>, <string-name><surname>Jevremovic</surname>, <given-names>D</given-names></string-name>, <string-name><surname>Joblin</surname>, <given-names>C</given-names></string-name>, <string-name><surname>Karsprzak</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Krishnakumar</surname>, <given-names>E</given-names></string-name>, <string-name><surname>Leto</surname>, <given-names>G</given-names></string-name>, <string-name><surname>Loboda</surname>, <given-names>PA</given-names></string-name>, <string-name><surname>Louge</surname>, <given-names>T</given-names></string-name>, <string-name><surname>Maclot</surname>, <given-names>S</given-names></string-name>, <string-name><surname>Marinkovic</surname>, <given-names>B</given-names></string-name>, <string-name><surname>Markwick Kemper</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Marquart</surname>, <given-names>T</given-names></string-name>, <string-name><surname>Mason</surname>, <given-names>H</given-names></string-name>, <string-name><surname>Mason</surname>, <given-names>N</given-names></string-name>, <string-name><surname>Mendoza</surname>, <given-names>C</given-names></string-name>, <string-name><surname>Mihajlov</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Millar</surname>, <given-names>T</given-names></string-name>, <string-name><surname>Moreau</surname>, <given-names>N</given-names></string-name>, <string-name><surname>Mulas</surname>, <given-names>G</given-names></string-name>, <string-name><surname>Pakhomov</surname>, <given-names>Y</given-names></string-name>, <string-name><surname>Palmeri</surname>, <given-names>P</given-names></string-name>, <string-name><surname>Pancheshnyi</surname>, <given-names>S</given-names></string-name>, <string-name><surname>Perevalov</surname>, <given-names>VI</given-names></string-name>, <string-name><surname>Piskunov</surname>, <given-names>N</given-names></string-name>, <string-name><surname>Postler</surname>, <given-names>J</given-names></string-name>, <string-name><surname>Quinet</surname>, <given-names>EL</given-names></string-name>, <string-name><surname>S&#225;nchez</surname>, <given-names>PQ</given-names></string-name>, <string-name><surname>Ralchenko</surname>, <given-names>Y</given-names></string-name>, <string-name><surname>Rhee</surname>, <given-names>Y-J</given-names></string-name>, <string-name><surname>Rixon</surname>, <given-names>G</given-names></string-name>, <string-name><surname>Rothman</surname>, <given-names>L</given-names></string-name>, <string-name><surname>Roueff</surname>, <given-names>E</given-names></string-name>, <string-name><surname>Ryabchikova</surname>, <given-names>T</given-names></string-name>, <string-name><surname>Sahal-Brechot</surname>, <given-names>S</given-names></string-name>, <string-name><surname>Scheier</surname>, <given-names>P</given-names></string-name>, <string-name><surname>Schlemmer</surname>, <given-names>S</given-names></string-name>, <string-name><surname>Schmitt</surname>, <given-names>B</given-names></string-name>, <string-name><surname>Stempels</surname>, <given-names>E</given-names></string-name>, <string-name><surname>Tashkun</surname>, <given-names>S</given-names></string-name>, <string-name><surname>Tennyson</surname>, <given-names>J</given-names></string-name>, <string-name><surname>Tyuterev</surname>, <given-names>V</given-names></string-name>, <string-name><surname>Vujcic</surname>, <given-names>V</given-names></string-name>, <string-name><surname>Wakelam</surname>, <given-names>V</given-names></string-name>, <string-name><surname>Walton</surname>, <given-names>N</given-names></string-name>, <string-name><surname>Zatsarinny</surname>, <given-names>O</given-names></string-name>, <string-name><surname>Zeippen</surname>, <given-names>C</given-names></string-name> and <string-name><surname>Zw&#246;lf</surname>, <given-names>CM</given-names></string-name>. <year>2016</year>. <article-title>The Virtual Atomic and Molecular Data Centre (VAMDC) Consortium</article-title>. <source>Journal of Physics B: Atomic, Molecular and Optical Physics</source>, <volume>49</volume>(<issue>7</issue>). DOI: <pub-id pub-id-type="doi">10.1088/0953-4075/49/7/074003</pub-id></mixed-citation></ref>
<ref id="B9"><label>9</label><mixed-citation publication-type="journal"><string-name><surname>Dubernet</surname>, <given-names>ML</given-names></string-name>, <string-name><surname>Boudon</surname>, <given-names>V</given-names></string-name>, <string-name><surname>Culhane</surname>, <given-names>JL</given-names></string-name>, <string-name><surname>Dimitrijevic</surname>, <given-names>MS</given-names></string-name>, <string-name><surname>Fazliev</surname>, <given-names>AZ</given-names></string-name>, <string-name><surname>Joblin</surname>, <given-names>C</given-names></string-name>, <string-name><surname>Kupka</surname>, <given-names>F</given-names></string-name>, <string-name><surname>Leto</surname>, <given-names>G</given-names></string-name>, <string-name><surname>Le Sidaner</surname>, <given-names>P</given-names></string-name>, <string-name><surname>Loboda</surname>, <given-names>PA</given-names></string-name>, <string-name><surname>Mason</surname>, <given-names>HE</given-names></string-name>, <string-name><surname>Mason</surname>, <given-names>NJ</given-names></string-name>, <string-name><surname>Mendoza</surname>, <given-names>C</given-names></string-name>, <string-name><surname>Mulas</surname>, <given-names>G</given-names></string-name>, <string-name><surname>Millar</surname>, <given-names>TJ</given-names></string-name>, <string-name><surname>Nu&#241;ez</surname>, <given-names>LA</given-names></string-name>, <string-name><surname>Perevalov</surname>, <given-names>VI</given-names></string-name>, <string-name><surname>Piskunov</surname>, <given-names>N</given-names></string-name>, <string-name><surname>Ralchenko</surname>, <given-names>Y</given-names></string-name>, <string-name><surname>Rixon</surname>, <given-names>G</given-names></string-name>, <string-name><surname>Rothman</surname>, <given-names>LS</given-names></string-name>, <string-name><surname>Roueff</surname>, <given-names>E</given-names></string-name>, <string-name><surname>Ryabchikova</surname>, <given-names>TA</given-names></string-name>, <string-name><surname>Ryabtsev</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Sahal-Br&#233;chot</surname>, <given-names>S</given-names></string-name>, <string-name><surname>Schmitt</surname>, <given-names>B</given-names></string-name>, <string-name><surname>Schlemmer</surname>, <given-names>S</given-names></string-name>, <string-name><surname>Tennyson</surname>, <given-names>J</given-names></string-name>, <string-name><surname>Tyuterev</surname>, <given-names>VG</given-names></string-name>, <string-name><surname>Walton</surname>, <given-names>NA</given-names></string-name>, <string-name><surname>Wakelam</surname>, <given-names>V</given-names></string-name> and <string-name><surname>Zeippen</surname>, <given-names>CJ</given-names></string-name>. <year>2010</year>. <article-title>Virtual atomic and molecular data centre</article-title>. <source>J. Quant. Spectrosc. &amp; Rad. Transfer</source>, <volume>111</volume>: <fpage>2151</fpage>&#8211;<lpage>2159</lpage>. <month>Oct</month>, 2010. DOI: <pub-id pub-id-type="doi">10.1016/j.jqsrt.2010.05.004</pub-id></mixed-citation></ref>
<ref id="B10"><label>10</label><mixed-citation publication-type="webpage"><collab>European Commission Expert Group on FAIR Data</collab>. <year>2018</year>. <article-title>Turning FAIR into reality</article-title>. <source>Final report and Action Plan</source>. URL: <uri>http://www.codata.org/news/254/62/Turning-FAIR-Data-into-Reality-Report-and-Action-Plan-Consultation-until-5-August</uri>.</mixed-citation></ref>
<ref id="B11"><label>11</label><mixed-citation publication-type="webpage"><string-name><surname>Hey</surname>, <given-names>T</given-names></string-name>, <string-name><surname>Tansley</surname>, <given-names>S</given-names></string-name> and <string-name><surname>Tolle</surname>, <given-names>K</given-names></string-name>. <year>2009</year>. <article-title>The Fourth Paradigm: Data-Intensive Scientific Discovery</article-title>. <source>Microsoft Research</source>, <month>October</month>. ISBN: 978-0-9825442-0-4. URL: <uri>https://www.microsoft.com/en-us/research/publication/fourth-paradigm-data-intensive-scientific-discovery/</uri>.</mixed-citation></ref>
<ref id="B12"><label>12</label><mixed-citation publication-type="journal"><string-name><surname>Moreau</surname>, <given-names>N</given-names></string-name>, <string-name><surname>Zw&#246;lf</surname>, <given-names>CM</given-names></string-name>, <string-name><surname>Ba</surname>, <given-names>Y-A</given-names></string-name>, <string-name><surname>Richard</surname>, <given-names>C</given-names></string-name>, <string-name><surname>Boudon</surname>, <given-names>V</given-names></string-name> and <string-name><surname>Dubernet</surname>, <given-names>M-L</given-names></string-name>. <year>2018</year>. <article-title>The VAMDC Portal as a major vector of atomic and molecular data citation</article-title>. <source>Galaxies</source>, pages galaxies-326995.</mixed-citation></ref>
<ref id="B13"><label>13</label><mixed-citation publication-type="journal"><string-name><surname>Regandell</surname>, <given-names>S</given-names></string-name>, <string-name><surname>Marquart</surname>, <given-names>T</given-names></string-name> and <string-name><surname>Piskunov</surname>, <given-names>N</given-names></string-name>. <month>March</month> <year>2018</year>. <article-title>Inside a VAMDC data node &#8211; putting standards into practical software</article-title>. <source>Physica Scripta</source>, <volume>93</volume>(<issue>3</issue>). DOI: <pub-id pub-id-type="doi">10.1088/1402-4896/aaa268</pub-id></mixed-citation></ref>
<ref id="B14"><label>14</label><mixed-citation publication-type="webpage"><collab>VAMDC Consortium</collab>. <year>2012</year>. <article-title>VAMDC SQL Subset, version 2</article-title>. <source>VAMDC standard</source>. <uri>http://vamdc.eu/documents/standards/queryLanguage/vss2.html</uri>.</mixed-citation></ref>
<ref id="B15"><label>15</label><mixed-citation publication-type="confproc"><string-name><surname>Walton</surname>, <given-names>N</given-names></string-name>. <year>2004</year>. <article-title>Meeting the User Science Challenge for a Virtual Universe</article-title>. In: <conf-name>Toward An International Virtual Observatory: Proceedings Of The Eso-esa-nasa-nsf Conference Held At Garching</conf-name>, <fpage>188</fpage>. <conf-loc>Germany</conf-loc>, <conf-date>10&#8211;14 June 2002</conf-date>. DOI: <pub-id pub-id-type="doi">10.1007/10857598_29</pub-id></mixed-citation></ref>
<ref id="B16"><label>16</label><mixed-citation publication-type="journal"><string-name><surname>Wilkinson</surname>, <given-names>MD</given-names></string-name>, <string-name><surname>Caselli</surname>, <given-names>P</given-names></string-name>, <string-name><surname>Pon</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Belloche</surname>, <given-names>A</given-names></string-name> and <string-name><surname>Andr&#233;</surname>, <given-names>P</given-names></string-name>. <month>March</month> <year>2016</year>. <article-title>The FAIR Guiding Principles for scientific data management and stewardship</article-title>. <source>Scientific Data</source>, <volume>3</volume>. Online. DOI: <pub-id pub-id-type="doi">10.1038/sdata.2016.18</pub-id></mixed-citation></ref>
<ref id="B17"><label>17</label><mixed-citation publication-type="journal"><string-name><surname>Wittenburg</surname>, <given-names>P</given-names></string-name>, <string-name><surname>Hellstr&#246;m</surname>, <given-names>M</given-names></string-name>, <string-name><surname>Zw&#246;lf</surname>, <given-names>C-M</given-names></string-name>, <string-name><surname>Abroshan</surname>, <given-names>H</given-names></string-name>, <string-name><surname>Asmi</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Di Bernardo</surname>, <given-names>G</given-names></string-name>, <string-name><surname>Couvreur</surname>, <given-names>D</given-names></string-name>, <string-name><surname>Gaizer</surname>, <given-names>T</given-names></string-name>, <string-name><surname>Holub</surname>, <given-names>P</given-names></string-name>, <string-name><surname>Hooft</surname>, <given-names>R</given-names></string-name>, <string-name><surname>H&#228;ggstr&#246;m</surname>, <given-names>I</given-names></string-name>, <string-name><surname>Kohler</surname>, <given-names>M</given-names></string-name>, <string-name><surname>Koureas</surname>, <given-names>D</given-names></string-name>, <string-name><surname>Kuchinke</surname>, <given-names>W</given-names></string-name>, <string-name><surname>Milanesi</surname>, <given-names>L</given-names></string-name>, <string-name><surname>Padfield</surname>, <given-names>J</given-names></string-name>, <string-name><surname>Rosato</surname>, <given-names>A</given-names></string-name>, <string-name><surname>Staiger</surname>, <given-names>C</given-names></string-name>, <string-name><surname>van Uytvanck</surname>, <given-names>D</given-names></string-name> and <string-name><surname>Weigel</surname>, <given-names>T</given-names></string-name>. <month>December</month> <year>2017</year>. <article-title>Persistent identifiers: Consolidated assertions</article-title>. Status of November, 2017. DOI: <pub-id pub-id-type="doi">10.5281/zenodo.1116189</pub-id></mixed-citation></ref>
<ref id="B18"><label>18</label><mixed-citation publication-type="confproc"><string-name><surname>Zw&#246;lf</surname>, <given-names>CM</given-names></string-name>, <string-name><surname>Dubernet</surname>, <given-names>M-L</given-names></string-name>, <string-name><surname>Ba</surname>, <given-names>Y-A</given-names></string-name> and <string-name><surname>Moreau</surname>, <given-names>N</given-names></string-name>. <month>May</month> <year>2014</year>. <article-title>Experience and feedbacks from the sustainability for the virtual atomic and molecular data centre E-infrastructure</article-title>. In: <conf-name>IST-Africa Conference Proceedings</conf-name>, <fpage>1</fpage>&#8211;<lpage>9</lpage>. DOI: <pub-id pub-id-type="doi">10.1109/ISTAFRICA.2014.6880621</pub-id></mixed-citation></ref>
<ref id="B19"><label>19</label><mixed-citation publication-type="journal"><string-name><surname>Zw&#246;lf</surname>, <given-names>CM</given-names></string-name>, <string-name><surname>Moreau</surname>, <given-names>N</given-names></string-name> and <string-name><surname>Dubernet</surname>, <given-names>M-L</given-names></string-name>. <month>September</month> <year>2016</year>. <article-title>New model for datasets citation and extraction reproducibility in VAMDC</article-title>. <source>Journal of Molecular Spectroscopy</source>, <volume>327</volume>: <fpage>122</fpage>&#8211;<lpage>137</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jms.2016.04.009</pub-id></mixed-citation></ref>
</ref-list>
</back>
</article>