Tag

Data Provenance

All articles tagged with #data provenance

Flawed datasets cast doubt on AI tools predicting diabetes and stroke
science1 month ago

Flawed datasets cast doubt on AI tools predicting diabetes and stroke

Researchers found that 124 papers used two Kaggle datasets to train stroke- and diabetes-prediction models that may be built on fabricated data; some models are already in clinical use in Indonesia, Spain, and the US, with journals investigating; irregular data patterns—such as unreal completeness and duplicated values—cast doubt on reliability, prompting calls for data-source disclosure and removal of the dubious datasets to prevent flawed clinical decisions.

"Trustworthy A.I. Data: How Big Companies Are Navigating the Challenge"
technology2 years ago

"Trustworthy A.I. Data: How Big Companies Are Navigating the Challenge"

A consortium of companies, including American Express, Humana, IBM, Pfizer, UPS, and Walmart, has developed data provenance standards to address concerns about the lineage and trustworthiness of data used in artificial intelligence (A.I.) applications. The standards serve as a labeling system that describes the origin, history, legal rights, and intended use of data. By providing greater clarity and transparency, the standards aim to bolster corporate confidence in A.I. technology. The initiative is expected to improve efficiency in data handling and increase the reliability of A.I.-generated answers. The standards are set to be made available to the public early next year.