# Fileset

[MikikoTanifuji-COAR2019final4handsout.pdf](https://mdr.nims.go.jp/filesets/aa2169c5-770e-454e-b417-0ef9b3ec3729/download)

## Creator

[TANIFUJI, Mikiko](https://orcid.org/0000-0001-5284-6364)

## Rights

Creative Commons BY-NC-ND Attribution-NonCommercial-NoDerivs 4.0 International[Creative Commons BY-NC-ND Attribution-NonCommercial-NoDerivs 4.0 International](https://creativecommons.org/licenses/by-nc-nd/4.0/)

## Other metadata

[Materials Data Repository](https://mdr.nims.go.jp/datasets/206a5f9f-a3b7-4a97-a9e9-585438e6f448)

## Fulltext

Microsoft PowerPoint - MikikoTanifuji-COAR2019final4handsout.pptxMikikoTanifujiManaging Director, Materials Data Platform CenterNational Institute for Materials Science, Tsukuba (NIMS), JapanCOAR Annual Meeting & General Assembly, May 21-23, 2019 @ Lyon, France2Materials Data Repository‐ Scenario: a workflow from experimental facilities to a data repositoryData repositoryExperimental facilitiesElectronic Laboratory notebookloT3Materials Data Repository(binary) （machine-readable)           (with high legibility)  (with informative items)Materials Data Repository“Schema-on-Read” data registration with user-customizable formatsSpectral data automatically sparse-modeledOptimization of measurement conditionsData informatics(Community)Inter-operable Data RepositoriesOpen accessPublishing(Journal)‐ Scenario: a workflow from experimental facilities to a data repositoryTechnical challenges 1: Metadata as a data recipe: how to assist researchers?4ExperimentData• Conversion• Annotation• Local data PIDText dataBinary data• Metadata• History PID• Speciments• Units control• etcAPI‐FWK API‐FWKData AcquisitionSystemData StorageSystem(machine‐readable)Vocabulary Platform(wikidata)MDR vocabulary name masterMaterials Data RepositoryOther vocabulary name master5Data-centric-repository systemOpen-source repository softwareData model (Portland Common Data Model)FedoraLibraries (Ruby gem)HyraxSolr (search backend)Digital resource management architectureRDF storeTechnical challenges 2: ResourceSync1. ResourceSync is recommended by the COAR Next Generation Repository report as a successor to OAI-PMH.2. ResourceSync is implemented in the MDR (also OAI-PMH)3. It will allow both the metadata and (in some cases) the content (research data, publications) to be harvested by other services on the network.4. NIMS will be testing this with the Open University’s Core aggregator system in the next few weeks.6Technical challenges 3: Using Persistent Identifiers7(binary) （machine-readable)           (with high legibility)  (with informative items)Materials Data RepositoryInstruments PID MasterNIMS ORCIDNon-NIMS PID Master VocabularyPID MasterConversion tools PID MasterPhysical Units PID MasterData local PID MasterData local PID MasterRepository PID MasterDOIcitationSpecimensPID MasterData informatics(Community)Inter-operable Data RepositoriesOpen accessPublishing(Journal)Technical challenges 4: SWORD8GUIMaterials Data RepositoryData informatics(Community)(binary) （machine-readable)            (with high legibility)  (with informative items)APIOpen Science FrameworkResearch Data Management: RDMValidation (hash, timestamp)API(SWORD)Data Management Plan: DMPInter-operable Data RepositoriesOpen accessPublishing(Journal)5 Challenges of Materials Data Platformfor Open Science1. Quality Identify who/what/when/how Integrity of the data2. Accessibility with Open Data Open data for data publishing Linked data  Data search for machine learning3. Usability  Machine-readability Metadata as data recipes for informatics Data licensing (CC, CC-BY-NC, MIT, etc.)4. Security and Preservation Open data policy User identifications Data preservation  Cyber security 5. Research-aids on the platform Vocabulary assistance (for data curation, collection, conversion for AI) Data analysis software9Summary: How the MDR is following the NGR recommendations1. Exposing Identifiers2. Declaring Licenses at the Resource Level3. Discovery Through Navigation4. Interacting with Resources (Annotation, Commentary, and Review)5. Resource Transfer6. Batch Discovery7. Collecting and Exposing Activities8. Identification of Users9. Authentication of Users10. Exposing Standardized Usage Metrics11. Preserving Resources10