Machine Learning (ML) and Knowledge Graphs (KGs) possess a symbiotic relationship with the potential to mutually enhance their capabilities. ML can play a pivotal role in the construction of KGs by automating ontology design, inferring classes and relations from alternative sources, and aiding data curators in making informed decisions. ML algorithms need data to be in a "certain form", meaning that they need to get the data prepared and ensure high-quality data used as input. Poor-quality datasets can compromise ML systems in several ways. The four data quality categories of intrinsic, contextual, representational, and accessibility are relevant to different stages of the ML development pipeline. For example, intrinsic data quality categories include accuracy, completeness, and consistency, while contextual data quality categories include bias, relevance, and validity. Conversely, KGs can enrich ML models through node or graph embedding techniques, link prediction, supporting explainability and improving the overall performance of data-driven models. Low quality KGs, i.e., those not fit to be used, may lead to biased or inaccurate ML models, hindering their ability to generate meaningful insights or make informed decisions. The growing range of ML data management guidelines, frameworks and standards presents practitioners with a vast range of possible criteria to aspire to, on top of the traditional data management practices that were established in previous decades.
This workshop aims to explore the intricate interplay of data quality, ML, and KGs, elucidating limitations in assessment methodologies, proposing effective methods for objective quality assessment, and addressing challenges on ML and AI in general, verify if and to what extent well-known quality metrics are compliant with ML-based quality assessment, and addressing FAIR principles. We also welcome proposals riding the path of Explainable AI, Large Language Models, Generative AI, and any AI-driven approach that can be applied to the Semantic Web technologies to support and enhance data quality assessment and improvement.
New approaches for performing Data quality assessment or improvement of Knowledge Graphs via Machine Learning
Applications combining Machine Learning and Knowledge Graphs dealing with Data Quality concerns:
Submissions can fall in one of the following categories:
We welcome contributions presenting
Papers must comply with the CEUR-WS template. Papers are submitted in PDF format via the workshop’s Open Review submission page.
Accepted papers (after blind review of at least 2 experts) will be published by CEUR–WS.
At least one of the authors of the accepted papers must register for the workshop (pre-conference only option) to be included into the workshop proceedings. Information about registration can be found on the ESWC 2024 official page.
The program will include an inspiring keynote! Stay tuned!
Maria Angela Pellegrino | Anisa Rula | Jose Emilio Labra Gayo | Michael Cochez | Mehwish Alam |
University of Salerno | University of Brescia | University of Oviedo | Vrije Universiteit Amsterdam | Institut Polytechnique de Paris |
Italy | Italy | Spain | the Netherlands | France |