ProgramMaster Logo
Conference Tools for 2022 TMS Annual Meeting & Exhibition
Login
Register as a New User
Help
Submit An Abstract
Propose A Symposium
Presenter/Author Tools
Organizer/Editor Tools
About this Abstract
Meeting 2022 TMS Annual Meeting & Exhibition
Symposium ICME Case Studies: Successes and Challenges for Generation, Distribution, and Use of Public/Pre-Existing Materials Datasets
Presentation Title NOW ON-DEMAND ONLY - Hard Fought Lessons on Open Data and Code Sharing and the Terra Infirma of Ground Truth
Author(s) Jason Hattrick-Simpers
On-Site Speaker (Planned) Jason Hattrick-Simpers
Abstract Scope The use of machine learning (ML) in the physical sciences has stimulated the discovery of exciting new phase change materials, amorphous alloys, and catalysts. But even scientifically sound AI models are only as dependable as the labels and values upon which they are built. The continued success of these methods relies upon the availability of open data, meta-data, and scientific code that are findable, accessible, interoperable and reusable (F.A.I.R.). I will discuss our successes and failures in creating the first F.A.I.R. multi-institution combinatorial dataset and code repository. I will also discuss the tenuousness of ground truth, the need for openly preserving expert disagreement within scientific data sets, and challenges associated with aggregating data from the open literature. This will drive home the difficulties in forming and capturing expert consensus, the impact of consensus variance on ML model evaluation, and the need to recreate important datasets that are born digital.
Proceedings Inclusion? Planned:
Keywords ICME, Machine Learning,

OTHER PAPERS PLANNED FOR THIS SYMPOSIUM

A Quest for Re-using 3D Materials Data
A Validation Framework for Microstructure-sensitive Fatigue Simulation Models
Added Value and Increased Organization: Capturing Experimental Data Provenance in Materials Commons 2.0
Challenges in Producing, Curating, and Sharing Large Multimodal, Multi-institutional Data Sets for Additive Manufacturing
Data-driven Model Based Comparison of Public Datasets for Online State of Charge Estimation in Lithium-ion Batteries
Filling Data Gaps in 3D Microstructure with Deep Learning
Generating, Sharing, and Using Halide Perovskite Exploratory Synthesis Data to Discover New Materials
Graph Convolutional Neural Networks for Fast, Accurate Prediction of Material Properties for Solid Solution High Entropy Alloys Using Open-source Datasets
Holistic Merging of Experimental and Computational Datasets – A Case Study for Diffusion Coefficients
Materials Innovation and Design Enabled by the Materials Project
Mg Database Project: Mapping Trends and Data Sets of Magnesium and Its Alloys for Improved Mechanical Performance
NOW ON-DEMAND ONLY - Hard Fought Lessons on Open Data and Code Sharing and the Terra Infirma of Ground Truth
The Status of ML Algorithms for Structure-property Relationships Using Matbench as a Test Protocol

Questions about ProgramMaster? Contact programming@programmaster.org