Dwarves
Memo
Type ESC to close search bar

History of Structured Outputs for LLMs

Overview

When Large Language Models (LLMs) becomes popular and an essential tool for growing businesses, it signals a transition toward more complex and efficient data processing. Instead of outputting raw text, the models may now generate structured data in formats such as JSON or XML. This allowed the information to be directly integrated into company databases, removing the need for human processing. Businesses can use structured outputs to streamline their data workflows, decrease processing time, and improve the accuracy and reliability of their analytics.

Why Structured Output is Needed

A survey of 51 industry professionals investigated the contexts and motivations behind limitations placed on Large Language Models. The findings identified two key constraint categories: low-level and high-level. Low-level constraints focus on technical aspects, guaranteeing the generated content adheres to a specific format and length. High-level constraints, on the other hand, address semantic and stylistic aspects, ensuring the outputs are meaningful, avoid factual errors (hallucination), and maintain a desired style. By implementing these constraints, developers can streamline the development process, improve the user experience by ensuring consistent and clear outputs, and ultimately guarantee the quality and usability of what LLMs produce.

The first signals in the emergence of structured output

Pre-2022:

During 2022:

Late 2022/Early 2023:

During 2023:

During 2024:

There’s been a clear progression from user workarounds and external frameworks to functionalities embedded directly within LLMs. We’ve seen increased control over output format, with advancements like JSON schemas and schema-example combinations. Research continues to address accuracy, flexibility, and seamless integration of structured output functionalities across LLM platforms.

Challenge and Future of structured output

Structured output strives for a balance between two seemingly opposed forces:

Challenges to Overcome

The Future of Structured Output

By addressing these challenges and continuing research, structured output has the potential to unlock exciting possibilities:

Reference