Logo image
Towards Efficient Structured Description Generation for Data Marketplace Offerings
Conference proceeding   Open access

Towards Efficient Structured Description Generation for Data Marketplace Offerings

M. Awan, A. Nadeem, J. R. Santana, P. Sotres, T. Bousselin, M. Costalonga and T. Elsaleh
IEEE Conference on Standards for Communications and Networking (Online), pp.1-7
IEEE Conference on Standards for Communications and Networking (IEEE CSCN 2025) (Bologna, Italy, 15/09/2025–17/09/2025)
15/09/2025

Abstract

data assets data exchange data products data spaces Decoding Digital economy Documentation Interoperability Large language models Manuals marketplace Metadata metadata generation Pipelines Standards Transforms
As data marketplaces are expected to become a prominent theme in the digital economy, whereby data assets are being generated today more than ever, and their potential to transform how value can be unlocked, the automation of metadata description generation will become a necessity for data asset discoverability, especially as data volumes explode and manual documentation becomes unsustainable. Therefore, semi-autonomous means are required to bring down the barrier to entry for data providers. As an extension of the Data Space concept, marketplaces advertise their assets or products through "offerings", that enable the discoverability of an asset or bundle of assets, published by data providers to a catalogue, which data consumers in turn can use for querying. Recent advances in large language models (LLMs) and constrained decoding techniques enable schema-compliant, semi-automated metadata generation, reducing manual overhead and improving discoverability. We propose a schema-aware, edge-optimized LLM pipeline for generating structured descriptions for data asset offerings in the SEDIMARK marketplace, with evaluation on realistic information models.
pdf
SEDIMARK_Offering_Generator_Paper-21.67 MBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access
url
https://cscn2025.ieee-cscn.org/View
Event WebsiteConference website

Metrics

238 File views/ downloads
79 Record Views

Details

Logo image

Usage Policy