EOTLAB

Publications

Back to all publications

Konferenzpaper · 2025

Designing Grammar-Guided LLM Outputs for Open Data Integration – A DSR Approach to IoT Data Platforms

Dennis M. Riehle, Arnold F. Arz von Straussenburg, Timon T. Aldenhoff

In Local Solutions for Global Challenges : 20th International Conference on Design Science Research in Information Systems and Technology, DESRIST 2025, Montego Bay, Jamaica, June 2-4, 2025, Proceedings, Part I

DOI

Abstract

This paper designs and implements an artifact for converting unstructured or semi-structured open data into outputs conforming to the OGC SensorThings API (STA). Motivated by the growing influx of heterogeneous data in Internet-of-Things environments, the study employs an Action Design Research process to apply formalized grammars to Large Language Models (LLMs) to produce valid, STA-compliant JSON documents. Early prototypes using JSON schemas and Pydantic models highlighted the need for stricter control mechanisms to handle real-world open data complexity. Evaluation across multiple open data sources demonstrates the effectiveness of grammar-driven constraints in reducing malformed or incomplete outputs. Three smaller LLMs—Qwen 2.5 Instruct, Llama 3.1 Instruct, and Phi-4—were tested, showing that grammar length and input context can significantly influence output quality and model throughput. The findings underscore the advantages of embedding strict syntax requirements without sacrificing flexibility for diverse use cases. While domain-level validation (e.g., verifying realistic time-series values) remains a future direction, this research confirms the promise of grammar-based generation for streamlining data ingestion in IoT platforms. The approach facilitates more consistent and maintainable pipelines, potentially boosting interoperability and data quality in sensor-driven environments.

Cite As

Riehle, D. M., Arz von Straussenburg, A. F., & Aldenhoff, T. T. (2025). Designing Grammar-Guided LLM Outputs for Open Data Integration – A DSR Approach to IoT Data Platforms. In S. Chatterjee, J. Brocke, & R. Anderson (Eds.), Local Solutions for Global Challenges : 20th International Conference on Design Science Research in Information Systems and Technology, DESRIST 2025, Montego Bay, Jamaica, June 2-4, 2025, Proceedings, Part I (pp. 178–195). Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-93976-1_12

BibTeX

@inproceedings{Riehle2025Designing,
	address = {Cham},
	author = {Riehle, Dennis M. and Arz von Straussenburg, Arnold F. and Aldenhoff, Timon T.},
	booktitle = {Local {Solutions} for {Global} {Challenges} : 20th {International} {Conference} on {Design} {Science} {Research} in {Information} {Systems} and {Technology}, {DESRIST} 2025, {Montego} {Bay}, {Jamaica}, {June} 2-4, 2025, {Proceedings}, {Part} {I}},
	doi = {10.1007/978-3-031-93976-1_12},
	editor = {Chatterjee, Samir and Brocke, Jan and Anderson, Ricardo},
	year = {2025},
	pages = {178--195},
	organization = {Springer Nature Switzerland},
	title = {Designing {Grammar}-{Guided} {LLM} {Outputs} for {Open} {Data} {Integration} -- {A} {DSR} {Approach} to {IoT} {Data} {Platforms}},
	url = {https://doi.org/10.1007/978-3-031-93976-1_12},
}