Conference Tools for 2022 TMS Annual Meeting & Exhibition

Login

Register as a New User

Help

Submit An Abstract

Propose A Symposium

Presenter/Author Tools

Organizer/Editor Tools

About this Abstract
Meeting	2022 TMS Annual Meeting & Exhibition
Symposium	ICME Case Studies: Successes and Challenges for Generation, Distribution, and Use of Public/Pre-Existing Materials Datasets
Presentation Title	NOW ON-DEMAND ONLY - Hard Fought Lessons on Open Data and Code Sharing and the Terra Infirma of Ground Truth
Author(s)	Jason Hattrick-Simpers
On-Site Speaker (Planned)	Jason Hattrick-Simpers
Abstract Scope	The use of machine learning (ML) in the physical sciences has stimulated the discovery of exciting new phase change materials, amorphous alloys, and catalysts. But even scientifically sound AI models are only as dependable as the labels and values upon which they are built. The continued success of these methods relies upon the availability of open data, meta-data, and scientific code that are findable, accessible, interoperable and reusable (F.A.I.R.). I will discuss our successes and failures in creating the first F.A.I.R. multi-institution combinatorial dataset and code repository. I will also discuss the tenuousness of ground truth, the need for openly preserving expert disagreement within scientific data sets, and challenges associated with aggregating data from the open literature. This will drive home the difficulties in forming and capturing expert consensus, the impact of consensus variance on ML model evaluation, and the need to recreate important datasets that are born digital.
Proceedings Inclusion?	Planned:
Keywords	ICME, Machine Learning,

OTHER PAPERS PLANNED FOR THIS SYMPOSIUM

A Quest for Re-using 3D Materials Data

A Validation Framework for Microstructure-sensitive Fatigue Simulation Models

Added Value and Increased Organization: Capturing Experimental Data Provenance in Materials Commons 2.0

Challenges in Producing, Curating, and Sharing Large Multimodal, Multi-institutional Data Sets for Additive Manufacturing

Data-driven Model Based Comparison of Public Datasets for Online State of Charge Estimation in Lithium-ion Batteries

Filling Data Gaps in 3D Microstructure with Deep Learning

Generating, Sharing, and Using Halide Perovskite Exploratory Synthesis Data to Discover New Materials

Graph Convolutional Neural Networks for Fast, Accurate Prediction of Material Properties for Solid Solution High Entropy Alloys Using Open-source Datasets

Holistic Merging of Experimental and Computational Datasets – A Case Study for Diffusion Coefficients

Materials Innovation and Design Enabled by the Materials Project

Mg Database Project: Mapping Trends and Data Sets of Magnesium and Its Alloys for Improved Mechanical Performance

NOW ON-DEMAND ONLY - Hard Fought Lessons on Open Data and Code Sharing and the Terra Infirma of Ground Truth

The Status of ML Algorithms for Structure-property Relationships Using Matbench as a Test Protocol

Questions about ProgramMaster? Contact programming@programmaster.org