Eurac Research CLARIN Centre (ERCC) repository
About and Policies
- Mission Statement
- Terms of Service
- About Repository
- About Eurac Research and the IAL
- License Agreement and Contracts
- Intellectual Property Rights
- Metadata Policy
- Preservation Policy
- Citing Data Policy
The ultimate objective of CLARIN ERIC, to which also the ERCC adheres, is to advance research in humanities and social sciences by giving researchers unified single sign-on access to a platform which integrates language-based resources and advanced tools at a European level. This shall be implemented by the construction and operation of a shared distributed infrastructure that aims at making language resources, technology and expertise available to the humanities and social sciences research communities at large.
To know more about CLARIN ERIC visit CLARIN-ShortGuide.pdf
Terms of Service
To achieve our mission statement, we set out some ground rules through the Terms of Service. By accessing or using any kind of data or services provided by the Repository, you agree to abide by the Terms contained in the above mentioned document.
Data in the Eurac Research CLARIN Centre (ERCC) repository are made available under the licence attached to the resources. In case there is no licence, data is made freely available for access, printing and download for the purposes of non-commercial research or private study. Users must acknowledge in any publication, the Deposited Work using a persistent identifier (see Citing Data), its original author(s)/creator(s), and any publisher where applicable. Full items must not be harvested by robots except transiently for full-text indexing or citation analysis. Full items must not be sold commercially unless explicitly granted by the attached licence without formal permission of the copyright holders.
About the Repository
It is like a library for linguistic data and tools. This means, using it, you can
- Search for data and tools and easily download them.
- Deposit your data and be sure it is safely stored, everyone can find it, use it, and correctly cite it (giving you credit)
About Eurac Research and the Institute for Applied Linguistics
In South Tyrol, multilingualism involves geographic, institutional, social and personal aspects. The main aim of research at the Institute for Applied Linguistics (IAL) is to provide scientific answers to current issues of language and education policy as well as to economic and social questions at both the local and international level. Our activities are targeted towards applied research (carrying out research projects and networking), training and consulting (consulting services, monitoring, seminars) as well as dissemination (scientific publications, dictionaries, data bases, corpora). IAL is part of Eurac Research which was funded in 1992 to strengthen research in the South Tyrol area. Today over 400 scientists from over 30 countries work here in a wide number of different research fields.
License Agreement and Contracts
At the moment, ERCC distinguishes three types of contracts.
- For every deposit, before accepting the resources from the submitter/depositor, the submitter has to sign the so-called "Distribution License Agreement", in which we describe our rights and duties and the submitter acknowledges that they have the right to submit the data and give us (the repository centre) the right to distribute the data on their behalf.
- Everyone who downloads data is bound by the licence assigned to the item - in order to download protected data, one has to be authenticated and needs to electronically sign the licence. A list of available licenses in our repository can be found here.
- For submitters, there is a possibility for setting custom licences to items during the submission workflow.
Intellectual Property Rights
All Intellectual Property Rights concerning the provided resources belong to the original creator. As mentioned in the section License Agreement and Contracts, we require the depositor of data to sign a Distribution License Agreement, which specifies that they have the right to submit the data and gives us (the repository centre) the right to distribute the data on their behalf. This means that depositors are solely responsible for taking care of IPR issues before publishing data by submitting them to us.
Should anyone have a suspicion that any of the datasets in our repository violate Intellectual Property Rights, they should contact us immediately at our Help Desk.
Deposited content must be accompanied by sufficient metadata describing its content, provenance and formats in order to support its preservation and dissemination. Metadata are freely accessible and are distributed in the public domain (under CC0). However, we would like to be informed about commercial usage of metadata from the ERCC repository including a description of your use case at the Help Desk.
The Eurac Research CLARIN Centre is committed to the long-term care of items deposited in its repository, to preserve the research and to help in keeping research replicable and strives to adopt the current best practice in digital preservation. See the Mission Statement. We follow best practice guidelines, standards and regulations set forth by CLARIN, OAIS and/or Eurac Research.
In order to stay a reliable and trustworthy repository, we undergo periodical assessments by CLARIN ERIC.
To fulfill the commitments, the repository ensures that datasets are ingested and distributed in accordance with their license (see agreements and contracts). Sometimes (for licenses that do not permit public access) this means only authorized users can access the dataset.
The submission workflow as described in deposit and the work of our editors ensures discoverability (by requiring accurate metadata) via our search engine, externally through OAI-PMH and in page metadata for certain web crawlers. Metadata are freely accessible.
There are various automated procedures including fixity checks, to ensure integrity of the submitted datasets and completeness of metadata. On the system level we employ various on-site and off-site backup strategies and hardware monitoring. The datasets are accessible online.
We view data and tools as primary research outputs, each submission receives a Persistent IDentifier for reference and the users are guided to use them. Changes in a dataset after it has been published are not permitted, new submission is required instead. The old and new submissions are linked through their metadata (see new version guide for more details).
Through regular participation in CLARIN activities, Open Repositories and various other meetings, schools and conferences, the repository staff is informed of new developments in technologies and/or initiatives.
The various export options offered by the repository system (DSpace) ensures that data and their metadata are not locked in and can be moved to a different repository system.
The repository encourages the usage of specific file formats as recommended by CLARIN. The preferred file formats will change over time, in which case the repository will make every effort to migrate to other formats, while keeping originals intact for reproducibility purposes (ie. migrated item will be a new repository record linked to the old). The guiding principles for format selection are: open standards are preferred over proprietary standards, formats should be well-documented, verifiable and proven, text-based formats are preferred over binary formats where possible, in the case of digitalization of analogue signal lossless or no compression is recommended.
In the case of a withdrawal of funding, the repositories content would be transferred to another CLARIN centre.
Citing Data Policy
Please refer to the separate page on Citing Data.