Ax framework breakdown

Overview

TL;DR

Ax is the essential toolkit you wish to have in the new emerging trend of context engineering because it frees you from all the hassles of prompt engineering and enable you to focus more on your business domain logic. You might think it looks wizardry and PhD-level kind of things but at the end of the day it's just string templates.

Template literal signatures: Structured input/output, no more "please output with JSON my life depends on it please 🙏" shenanigans
Fluent workflow engine: Define workflows with declarative fluent API
Advanced optimization: Make your LLM smarter by literally teaching it, using the teacher-student pattern (no cap)

Ax brings DSPy’s signature and optimization to TypeScript. Less prompt maneuver, more context engineering.

You lost me at DSPy, wtf is that?

Let's break it down, Declarative Self-improving Python:

Declarative: refers to the signature pattern
Self-improving: refers to the optimization flow where it learns by your examples
Python: quite self-explanatory

Ax is the faithful port of DSPy, preserving the core concepts.

Problems it fixed

LLM dev in TypeScript used to suck:

No type safety: Find out at runtime your LLM output is garbage
Manual workflows: Wire up multi-step operations by hand like a caveman
Bad prompts: Different prompt works with different model, tweaking your prompt to work correctly is even harder than asking your girl what to eat
Vendor lock-in: Switch providers? Rewrite everything. Fun.

This sounds too good to be true, what's the catch?

Compared to other frameworks/libraries like Mastra, VoltAgent or even the original DSPy itself:

Maturity: obviously because TypeScript is not the de-factor language of choice in the ML world, community adoption is still small. As a result, documentation is not as rich as others
Usecase: Ax takes the doubling down approach on conversational agents, it's no coincidence that most of the examples are just chatbots. So unless you want to omega-optimize your agent to provide 100/10 answers otherwise it's quite overkill

The three foundational pillars

🏛️ Ax Signature (AxSignature): The most primitive unit of Ax, used in everywhere else
🏛️ Ax Flow (AxFlow): Fluent API with nodes that can be defined using Signatures -> Declarative workflows
🏛️ Ax Optimizer (AxBaseOptimizer, AxBootstrapFewShot, AxMiPRO): Help reduce time, cost of using smaller model when optimized

The possibilities that Ax unlocks for you

Wall-of-prompt-be-gone abracadabra

Look mom, no prompts

import { AxAI, ax } from '@ax-llm/ax';

const textToSummarize = `
The technological singularity—or simply the singularity[1]—is a hypothetical future point in time at which technological growth becomes uncontrollable and irreversible, resulting in unforeseeable changes to human civilization.[2][3] ...`;

const ai = new AxAI({
  name: 'openai',
  apiKey: process.env.OPENAI_APIKEY as string,
});

// no prompt, just input and output (*cough* context *cough*)
const gen = ax`textToSummarize -> textType:class "note, email, reminder", shortSummary "summarize in 5 to 10 words"`;

const res = await gen.forward(ai, { textToSummarize });

console.log('>', res);

Agent Smith would be proud

Connect agents together, and they intercommunicate solely on signature (*cough* again context engineering *cough*)

Look mom still no prompts

const researcher = new AxAgent({
  name: 'researcher',
  description: 'Researcher agent',
  signature: `physicsQuestion "physics questions" -> answer "reply in bullet points"`,
});

const summarizer = new AxAgent({
  name: 'summarizer',
  description: 'Summarizer agent',
  signature: `text "text so summarize" -> shortSummary "summarize in 5 to 10 words"`,
});

const agent = new AxAgent({
  name: 'agent',
  description: 'A an agent to research complex topics',
  signature: `question -> answer`,
  agents: [researcher, summarizer],
});

agent.forward(ai, { questions: 'How many atoms are there in the universe' });

Make o4-mini as smart as o4? Hold my beer

You token cost will go up, but isn't it always the case when it comes to education 🤷🏻‍♂️?

import { AxAI, AxChainOfThought, AxMiPRO } from '@ax-llm/ax';

// 1. Setup your AI service
const ai = new AxAI({
  name: 'openai',
  config: {
    model: AxOpenAIModel.O4Mini,
  },
  apiKey: process.env.OPENAI_API_KEY,
});

// 2. Create your program
const program = new AxChainOfThought(`input -> output`);

// 3. Configure the optimizer
const optimizer = new AxMiPRO({
  studentAI: ai,
  examples: trainingData, // Your training examples
  options: {
    numTrials: 20, // Number of configurations to try
    verbose: true,
  },
});

// 4. Define your evaluation metric
// this is where the teaching happens
const metricFn = ({ prediction, example }) => {
  return prediction.output === example.output;
};

// 5. Run the optimization
const result = await optimizer.compile(program, metricFn, {
  valset: validationData, // Optional validation set
  auto: 'medium', // Optimization level
});

// 6. Use the optimized program
const result = await optimizedProgram.forward(ai, { input: 'test input' });

Hopefully by now you're intrigued in what Ax has to offer, read on to if you are

How it works

Architecture overview

graph TD
    DEV[Developer Code]
    TL[Template Literals ax]
    SIG[AxSignature System]
    FLOW[AxFlow Engine]
    OPT[Optimizer Engine]
    AI[Multi-Provider AI Layer]

    DEV --> TL
    TL --> SIG
    SIG --> FLOW
    FLOW --> OPT
    SIG --> AI
    FLOW --> AI
    OPT --> AI

    subgraph "Type System"
        TS[TypeScript Compiler]
        RT[Runtime Validation]
        TL --> TS
        SIG --> RT
    end

    subgraph "Execution Engine"
        PAR[Parallel Planner]
        DEP[Dependency Analyzer]
        EXEC[Step Executor]
        FLOW --> PAR
        PAR --> DEP
        DEP --> EXEC
    end

    subgraph "Optimization Layer"
        BS[Bootstrap FewShot]
        MIPRO[MiPRO v2]
        BAY[Bayesian Optimization]
        OPT --> BS
        OPT --> MIPRO
        MIPRO --> BAY
    end

Request flow

sequenceDiagram
    participant Dev as Developer
    participant TL as Template Literal
    participant Sig as Signature Parser
    participant Flow as Flow Engine
    participant Dep as Dependency Analyzer
    participant AI as AI Provider
    participant Optimizer as Optimizer

    Note over Dev,Optimizer: Signature Creation
    Dev->>TL: ax`userInput:string -> result:string`
    TL->>Sig: Parse template with field builders
    Sig->>Sig: Validate field names & types
    Sig->>Dev: Return typed AxGen instance

    Note over Dev,Optimizer: Flow Execution
    Dev->>Flow: .node().execute().map()
    Flow->>Dep: Analyze dependencies
    Dep->>Flow: Return execution plan
    Flow->>AI: Execute steps (parallel where possible)
    AI->>Flow: Return results
    Flow->>Dev: Type-safe results

    Note over Dev,Optimizer: Optimization
    Dev->>Optimizer: compile(program, metric)
    Optimizer->>AI: Generate instruction candidates
    Optimizer->>AI: Bootstrap few-shot examples
    Optimizer->>Optimizer: Bayesian parameter search
    Optimizer->>Dev: Optimized program + stats

Data structures and algorithms

Signature system (Pillar #1)

AxSignature: The core type definition

class AxSignature {
  private inputFields: AxIField[];
  private outputFields: AxIField[];
  private sigHash: string;
  private validatedAtHash?: string;

  // Template literal parsing with field builder support
  constructor(signature: string | TemplateStringsArray | AxSignatureConfig) {
    if (typeof signature === 'string') {
      const parsed = parseSignature(signature);
      this.inputFields = parsed.inputs.map(this.parseParsedField);
      this.outputFields = parsed.outputs.map(this.parseParsedField);
    }
    this.validateSignatureConsistency();
    [this.sigHash, this.sigString] = this.updateHash();
  }
}

Field builder system

export const f = {
  string: (desc?: string): AxFieldType => ({
    type: 'string',
    description: desc,
  }),
  class: (options: readonly string[], desc?: string): AxFieldType => ({
    type: 'class',
    options,
    description: desc,
  }),
  array: <T extends AxFieldType>(
    baseType: T,
  ): T & { readonly isArray: true } => ({
    ...baseType,
    isArray: true,
  }),
  optional: <T extends AxFieldType>(
    baseType: T,
  ): T & { readonly isOptional: true } => ({
    ...baseType,
    isOptional: true,
  }),
  // Multi-modal types
  image: (desc?: string): AxFieldType => ({ type: 'image', description: desc }),
  file: (desc?: string): AxFieldType => ({ type: 'file', description: desc }),
  url: (desc?: string): AxFieldType => ({ type: 'url', description: desc }),
};

Time complexity: O(1) for field creation, O(n) for signature validation where n = number of fields
Space complexity: O(f) where f = total number of fields across all signatures
Validation performance: Cached validation using SHA-256 hashing to avoid re-validation

Extract and validate response

export const streamingExtractFinalValue = (
  sig: Readonly<AxSignature>,
  values: Record<string, unknown>,
  // eslint-disable-next-line functional/prefer-immutable-types
  xstate: extractionState,
  content: string,
  strictMode = false
) => {
  if (xstate.currField) {
    const val = content.substring(xstate.s).trim();

    const parsedValue = validateAndParseFieldValue(xstate.currField, val);
    if (parsedValue !== undefined) {
      values[xstate.currField.name] = parsedValue;
    }
  }

  // In strict mode, if we have content but no fields were extracted and no current field,
  // this means field prefixes were missing when they should have been present
  if (strictMode && !xstate.currField && xstate.extractedFields.length === 0) {
    const trimmedContent = content.trim();
    if (trimmedContent) {
      // Find the first required field to report in the error
      const outputFields = sig.getOutputFields();
      const firstRequiredField = outputFields.find(
        (field) => !field.isOptional
      );
      if (firstRequiredField) {
        throw new ValidationError({
          message: "Expected field not found",
          fields: [firstRequiredField],
        });
      }
      // If only optional fields exist, ignore unprefixed content in strict mode
    }
  }

  // Check for optional fields that might have been missed by streaming parser
  parseOptionalFieldsFromFullContent(sig, values, content);

  // Check all previous required fields before processing current field
  checkMissingRequiredFields(xstate, values, sig.getOutputFields());
};

Flow execution engine (Pillar #2)

Dynamic signature inference algorithm

private inferSignatureFromFlow(): AxSignature {
  const executionPlan = this.executionPlanner.getExecutionPlan();

  const allProducedFields = new Set<string>();
  const allConsumedFields = new Set<string>();

  // Analyze execution plan for data flow
  for (const step of executionPlan.steps) {
    step.produces.forEach(field => allProducedFields.add(field));
    step.dependencies.forEach(field => allConsumedFields.add(field));
  }

  // Input fields = consumed but not produced
  const inputFields = [...allConsumedFields].filter(f => !allProducedFields.has(f));

  // Output fields = produced but not consumed (special handling for final operations)
  const lastStep = executionPlan.steps[executionPlan.steps.length - 1];
  let outputFields: string[];

  if (lastStep && (lastStep.type === 'map' || lastStep.type === 'merge')) {
    outputFields = lastStep.produces.filter(f => !f.startsWith('_'));
  } else {
    outputFields = [...allProducedFields].filter(f => {
      return !executionPlan.steps.some(step => step.dependencies.includes(f));
    });
  }

  return this.buildSignatureFromFields(inputFields, outputFields);
}

Parallel execution planning

class AxFlowExecutionPlanner {
  createOptimizedExecution(batchSize: number): AxFlowStepFunction[] {
    const groups = this.identifyParallelGroups();
    const optimizedSteps: AxFlowStepFunction[] = [];

    for (const group of groups) {
      if (group.steps.length === 1) {
        optimizedSteps.push(group.steps[0]!);
      } else {
        // Create parallel execution wrapper
        const parallelStep = async (state: AxFlowState, context: any) => {
          const results = await processBatches(
            group.steps,
            async (step, _index) => await step(state, context),
            batchSize,
          );
          // Merge results maintaining execution order
          return results.reduce(
            (merged, result) => ({ ...merged, ...result }),
            state,
          );
        };
        optimizedSteps.push(parallelStep);
      }
    }

    return optimizedSteps;
  }

  private identifyParallelGroups(): AxFlowParallelGroup[] {
    const dependencies = this.analyzeDependencies();
    const groups: AxFlowParallelGroup[] = [];
    const processed = new Set<number>();

    for (let i = 0; i < this.steps.length; i++) {
      if (processed.has(i)) continue;

      const parallelSteps = [this.steps[i]!];
      processed.add(i);

      // Find steps that can run in parallel (no dependencies between them)
      for (let j = i + 1; j < this.steps.length; j++) {
        if (processed.has(j)) continue;

        const canRunInParallel =
          !this.hasDependency(dependencies, i, j) &&
          !this.hasDependency(dependencies, j, i);

        if (canRunInParallel) {
          parallelSteps.push(this.steps[j]!);
          processed.add(j);
        }
      }

      groups.push({
        steps: parallelSteps,
        dependencies: dependencies[i] || [],
      });
    }

    return groups;
  }
}

Optimization algorithms (Pillar #3)

MiPRO v2 implementation

class AxMiPRO extends AxBaseOptimizer {
  async compile(program: AxGen, metricFn: AxMetricFn): Promise<AxMiPROResult> {
    // Step 1: Bootstrap few-shot examples using teacher-student approach
    const bootstrappedDemos = await this.bootstrapFewShotExamples(program, metricFn);

    // Step 2: Generate instruction candidates with contextual awareness
    const instructions = await this.proposeInstructionCandidates(program);

    // Step 3: Bayesian optimization loop
    const { bestConfig, bestScore } = await this.runOptimization(
      program, bootstrappedDemos, labeledExamples, instructions, validationSet, metricFn
    );

    return { demos: bootstrappedDemos, bestScore, optimizedGen: this.createOptimizedProgram(bestConfig) };
  }

  private async runOptimization(...): Promise<{ bestConfig: ConfigType; bestScore: number }> {
    let bestConfig: ConfigType = { instruction: instructions[0], bootstrappedDemos: 1, labeledExamples: 1 };
    let bestScore = 0;

    for (let trial = 0; trial < this.numTrials; trial++) {
      let config: ConfigType;

      if (this.bayesianOptimization && this.configHistory.length > 2) {
        config = await this.selectConfigurationViaBayesianOptimization(instructions, bootstrappedDemos, labeledExamples);
      } else {
        config = this.randomConfiguration(instructions, bootstrappedDemos, labeledExamples);
      }

      const score = await this.evaluateConfig(program, config, validationSet, metricFn);
      this.updateSurrogateModel(config, score);

      if (score > bestScore + this.minImprovementThreshold) {
        bestScore = score;
        bestConfig = config;
      }

      // Early stopping and progress tracking
      if (this.shouldEarlyStop(trial, bestScore)) break;
    }

    return { bestConfig, bestScore };
  }
}

Bayesian optimization with acquisition functions

private calculateAcquisitionValue(config: ConfigType): number {
  const prediction = this.predictPerformance(config);
  const { mean, variance } = prediction;
  const std = Math.sqrt(variance);
  const bestScore = Math.max(...this.configHistory.map(entry => entry.score));

  switch (this.acquisitionFunction) {
    case 'expected_improvement': {
      const improvement = mean - bestScore;
      if (std === 0) return Math.max(0, improvement);

      const z = improvement / std;
      const phi = 0.5 * (1 + this.erf(z / Math.sqrt(2))); // CDF
      const pdfValue = Math.exp(-0.5 * z * z) / Math.sqrt(2 * Math.PI); // PDF

      return improvement * phi + std * pdfValue;
    }

    case 'upper_confidence_bound': {
      return mean + this.explorationWeight * std;
    }

    case 'probability_improvement': {
      const improvement = mean - bestScore;
      if (std === 0) return improvement > 0 ? 1 : 0;

      const z = improvement / std;
      return 0.5 * (1 + this.erf(z / Math.sqrt(2)));
    }
  }
}

Bootstrap few shot execution flow

The teacher-student pattern that makes your prompts actually good:

flowchart TD
    A[Start: compile method] --> B[Initialize parameters<br/>maxRounds, maxDemos, maxExamples]
    B --> C[Reset stats and traces]
    C --> D[Begin round loop<br/>i = 0 to maxRounds]

    D --> E[compileRound: Set temperature = 0.7<br/>Apply token limits if specified]
    E --> F[Random sample examples<br/>up to maxExamples]
    F --> G[Track previous success count]

    G --> H[Begin batch processing<br/>Process examples in batches]
    H --> I[For each batch: Adjust temperature<br/>temp = 0.7 + 0.001 * i]

    I --> J[For each example in batch]
    J --> K[Set remaining examples as demos<br/>excluding current example]
    K --> L[Get Teacher or Student AI]
    L --> M[Increment totalCalls counter]

    M --> N{Try forward pass}
    N -->|Success| O[Get prediction result]
    N -->|Error| P[Log warning and set empty result<br/>Continue bootstrap process]

    O --> Q[Estimate token usage if<br/>cost monitoring enabled]
    Q --> R[Calculate metric score<br/>using metricFn]
    R --> S{Score >= 0.5?}

    S -->|Yes| T[Add to traces<br/>Increment successfulDemos]
    S -->|No| U[Continue to next example]
    P --> U
    T --> V{Traces >= maxDemos?}
    U --> V

    V -->|Yes| W[Exit batch processing]
    V -->|No| X{More examples?}
    X -->|Yes| J
    X -->|No| Y[Check early stopping conditions]

    W --> Y
    Y --> Z{Early stopping enabled<br/>and patience exhausted?}
    Z -->|Yes| AA[Set earlyStopped = true<br/>Break round loop]
    Z -->|No| BB{More rounds?}

    BB -->|Yes| D
    BB -->|No| AA
    AA --> CC{Any traces found?}

    CC -->|No| DD[Throw Error:<br/>No demonstrations found]
    CC -->|Yes| EE[Group traces by keys<br/>Create program demos]

    EE --> FF[Calculate best score<br/>successfulDemos / totalCalls]
    FF --> GG[Return AxOptimizerResult<br/>demos, stats, bestScore, config]

    DD --> HH[End: Error]
    GG --> II[End: Success]

    classDef startEnd fill:#e1f5fe
    classDef process fill:#f3e5f5
    classDef decision fill:#fff3e0
    classDef error fill:#ffebee
    classDef success fill:#e8f5e8

    class A,II,HH startEnd
    class B,C,E,F,G,H,I,K,L,M,O,Q,R,T,U,W,Y,EE,FF,GG process
    class D,J,N,S,V,X,Z,BB,CC decision
    class P,DD error
    class AA success

Key insight: Teacher model quality examples → Student learns patterns → Better few-shot demos for production

MiPRO v2 execution flow

Bayesian optimization that makes your prompts scientifically better:

flowchart TD
    A[Start: compile method<br/>Initialize MIPRO optimizer] --> B[Setup validation examples<br/>20% of training data]
    B --> C[Bootstrap Few-Shot Examples<br/>if maxBootstrappedDemos > 0]

    C --> D{Bootstrapping<br/>needed?}
    D -->|Yes| E[Create AxBootstrapFewShot instance<br/>Run bootstrap compilation using Student AI]
    D -->|No| F[Skip bootstrapping]
    E --> G[Generate bootstrapped demonstrations<br/>via Student AI forward passes]
    F --> G
    G --> H[Select Labeled Examples<br/>Random sampling from training set]

    H --> I[Generate Instruction Candidates<br/>proposeInstructionCandidates]
    I --> J{Context-aware<br/>proposers enabled?}
    J -->|Yes| K[Generate program/dataset summaries<br/>using Teacher AI if available]
    J -->|No| L[Use default instruction templates]
    K --> M[Generate instruction candidates<br/>using Teacher AI with context]
    L --> N[Generate instruction candidates<br/>using fallback templates]
    M --> O[Combine all instruction candidates]
    N --> O

    O --> P[Begin Optimization Loop<br/>runOptimization method]
    P --> Q[Initialize best config and score<br/>Start optimization trials]
    Q --> R[Trial loop: i = 0 to numTrials]

    R --> S{Use Bayesian<br/>optimization?}
    S -->|Yes & history > 2| T[Select config via Bayesian optimization<br/>Use acquisition function]
    S -->|No| U[Random/round-robin config selection<br/>Exploration phase]

    T --> V[Evaluate configuration<br/>evaluateConfig method]
    U --> V
    V --> W[Create test program with config<br/>Apply instruction, demos, examples]

    W --> X{Use minibatch<br/>evaluation?}
    X -->|Yes| Y[Adaptive minibatch size<br/>Stochastic evaluation]
    X -->|No| Z[Full validation set evaluation]

    Y --> AA[For each evaluation example:<br/>Forward pass with Student AI]
    Z --> AA
    AA --> BB{Self-consistency<br/>sampling?}
    BB -->|Yes| CC[Multiple samples with majority vote<br/>using Student AI]
    BB -->|No| DD[Single prediction<br/>using Student AI]

    CC --> EE[Calculate metric score<br/>Average across examples]
    DD --> EE
    EE --> FF[Update surrogate model<br/>Store config-score pair]

    FF --> GG{Score improvement<br/>> threshold?}
    GG -->|Yes| HH[Update best config and score<br/>Reset stagnation counter]
    GG -->|No| II[Increment stagnation rounds]

    HH --> JJ[Update optimization progress]
    II --> JJ
    JJ --> KK{Early stopping<br/>conditions met?}

    KK -->|Cost limits| LL[Stop: Cost limit reached]
    KK -->|Stagnation| MM[Stop: No improvement for N trials]
    KK -->|Target score| NN[Stop: Target score achieved]
    KK -->|No| OO{More trials?}

    OO -->|Yes| R
    OO -->|No| PP[Optimization complete]
    LL --> PP
    MM --> PP
    NN --> PP

    PP --> QQ[Create optimized AxGen instance<br/>Apply best configuration]
    QQ --> RR[Update final statistics]
    RR --> SS[Return AxMiPROResult<br/>optimizedGen, demos, stats, bestScore]

    SS --> TT[End: Success]

    classDef startEnd fill:#e1f5fe
    classDef process fill:#f3e5f5
    classDef decision fill:#fff3e0
    classDef success fill:#e8f5e8

    class A,TT startEnd
    class B,C,G,H,I,K,L,M,N,O,P,Q,V,W,Y,Z,AA,CC,DD,EE,FF,HH,JJ,QQ,RR,SS process
    class D,J,R,S,X,BB,GG,KK,OO decision
    class LL,MM,NN success

The magic: Each trial teaches the algorithm which configurations work → Converges to optimal prompt settings faster than manual tuning

Combined optimization pipeline

How Bootstrap feeds into MiPRO for maximum effectiveness:

sequenceDiagram
    participant Dev as Developer
    participant Bootstrap as Bootstrap FewShot
    participant Teacher as Teacher Model
    participant Student as Student Model
    participant MiPRO as MiPRO v2
    participant Bayes as Bayesian Optimizer
    participant Eval as Evaluator

    Note over Dev,Eval: Phase 1: Bootstrap Demo Generation
    Dev->>Bootstrap: compile(program, metric, examples)
    Bootstrap->>Teacher: Initialize high-quality model
    Bootstrap->>Student: Initialize target model

    loop For each round
        Bootstrap->>Student: Generate outputs with few-shot demos
        Bootstrap->>Eval: Evaluate outputs with metric
        Eval-->>Bootstrap: Success/failure scores
        Bootstrap->>Bootstrap: Collect successful traces
    end

    Bootstrap-->>Dev: High-quality demo collection

    Note over Dev,Eval: Phase 2: Instruction + Hyperparameter Optimization
    Dev->>MiPRO: optimize(program, demos, validation)
    MiPRO->>MiPRO: Generate instruction candidates

    loop For each trial
        MiPRO->>Bayes: Select next configuration
        Bayes-->>MiPRO: instruction + demo counts
        MiPRO->>Eval: Test configuration on validation
        Eval-->>MiPRO: Performance score
        MiPRO->>Bayes: Update surrogate model
    end

    MiPRO-->>Dev: Optimized program with best config

    Note over Dev,Eval: Result: Production-Ready Program

Technical challenges and solutions

Challenge 1: LLM input and output are not typed

Why it's annoying:

TypeScript checks templates at compile time, but LLMs need runtime validation too
Field builders gotta work smoothly with template parsing
Type info can't get lost in the shuffle
Need to handle complex stuff (arrays, optional fields, classes) in templates

The solution: Dual-Phase Processing with Type Preservation

// Phase 1: Template literal processing with field builder integration
export function ax<IN extends AxGenIn, OUT extends AxGenerateResult<AxGenOut>>(
  strings: TemplateStringsArray,
  ...values: readonly AxSignatureTemplateValue[]
): AxGen<IN, OUT> {
  let result = '';

  for (let i = 0; i < strings.length; i++) {
    result += strings[i] ?? '';

    if (i < values.length) {
      const val = values[i];

      // Smart field marker handling for optional/internal fields
      if (isAxFieldType(val)) {
        const fieldNameMatch = result.match(/(\w+)\s*:\s*$/);
        if (fieldNameMatch && (val.isOptional || val.isInternal)) {
          const fieldName = fieldNameMatch[1]!;
          let modifiedFieldName = fieldName;
          if (val.isOptional) modifiedFieldName += '?';
          if (val.isInternal) modifiedFieldName += '!';
          result = result.replace(/(\w+)(\s*:\s*)$/, `${modifiedFieldName}$2`);
        }
        result += convertFieldTypeToString(val);
      }
    }
  }

  return new AxGen<IN, OUT>(result);
}

// Phase 2: Runtime validation with cached results
class AxSignature {
  private validatedAtHash?: string;

  public validate(): boolean {
    if (this.validatedAtHash === this.sigHash) {
      return true; // Use cached validation
    }

    this.inputFields.forEach((field) => validateField(field, 'input'));
    this.outputFields.forEach((field) => validateField(field, 'output'));
    this.validateSignatureConsistency();

    this.validatedAtHash = this.sigHash; // Cache successful validation
    return true;
  }
}

Result: Perfect integration of compile-time type checking with runtime validation, enabling both developer productivity and runtime safety.

Challenge 2: Workflow node also need to be typed (knows the signature input/output)

Why it's a pain:

Workflows can branch, loop, and merge however they want
State changes every step, collecting more fields
Final signature depends on analyzing the whole execution path
Type info can't get corrupted along the way

How we solved it: Analyze execution plans and track type changes

private inferSignatureFromFlow(): AxSignature {
  const executionPlan = this.executionPlanner.getExecutionPlan();

  if (this.nodeGenerators.size === 0 && executionPlan.steps.length === 0) {
    return this.createDefaultSignature();
  }

  // Analyze data flow through execution plan
  const allProducedFields = new Set<string>();
  const allConsumedFields = new Set<string>();

  for (const step of executionPlan.steps) {
    step.produces.forEach(field => allProducedFields.add(field));
    step.dependencies.forEach(field => allConsumedFields.add(field));
  }

  // Input fields = consumed but not produced by any step
  const inputFieldNames = new Set<string>();
  for (const consumed of allConsumedFields) {
    if (!allProducedFields.has(consumed)) {
      inputFieldNames.add(consumed);
    }
  }

  // Special handling for final map/merge operations
  const outputFieldNames = new Set<string>();
  const lastStep = executionPlan.steps[executionPlan.steps.length - 1];

  if (lastStep && (lastStep.type === 'map' || lastStep.type === 'merge')) {
    // Use fields produced by final transformation
    lastStep.produces.forEach(field => {
      if (!field.startsWith('_')) { // Skip internal fields
        outputFieldNames.add(field);
      }
    });

    // Special case: conditional merges that produce _mergedResult
    if (lastStep.type === 'merge' && lastStep.produces.includes('_mergedResult')) {
      // Include all node result fields as potential outputs
      for (const step of executionPlan.steps) {
        if (step.type === 'execute' && step.produces.length > 0) {
          step.produces.forEach(field => outputFieldNames.add(field));
        }
      }
    }
  } else {
    // Standard logic: find leaf fields (produced but not consumed)
    for (const produced of allProducedFields) {
      let isConsumed = false;
      for (const step of executionPlan.steps) {
        if (step.dependencies.includes(produced)) {
          isConsumed = true;
          break;
        }
      }
      if (!isConsumed) {
        outputFieldNames.add(produced);
      }
    }
  }

  return this.buildSignatureFromAnalysis(inputFieldNames, outputFieldNames);
}

The trick: Treat the workflow like a data flow graph, then use graph analysis to figure out the right signature automatically.

The key trick: Copy state immutably plus dependency analysis ensures safe parallel execution without race conditions.

Challenge 3: LLM providers don't like each other

Provider differences:

Different ways to authenticate (API keys, OAuth, custom headers)
Different request/response formats
Different features (image support, function calling, streaming)
Different error handling and retry approaches
Different rate limits and pricing

How we solved it: Layered abstraction that detects what each provider can do

// Base abstraction layer
export abstract class AxBaseAI implements AxAIService {
  abstract getName(): string;
  abstract getModelInfo(): AxModelInfo;
  abstract getCapabilities(): AxModelCapabilities;

  // Unified chat interface
  async chat(req: AxChatRequest): Promise<AxChatResponse> {
    // Pre-processing: validate request against capabilities
    this.validateRequest(req);

    // Provider-specific implementation
    const response = await this.chatImplementation(req);

    // Post-processing: normalize response format
    return this.normalizeResponse(response);
  }

  protected abstract chatImplementation(
    req: AxChatRequest,
  ): Promise<AxChatResponse>;
}

// Provider-specific implementations
export class AxAIOpenAI extends AxBaseAI {
  getCapabilities(): AxModelCapabilities {
    return {
      functions: true,
      streaming: true,
      vision: this.modelId.includes('vision'),
      maxTokens: this.getMaxTokensForModel(this.modelId),
    };
  }

  protected async chatImplementation(
    req: AxChatRequest,
  ): Promise<AxChatResponse> {
    const openaiRequest = this.convertToOpenAIFormat(req);
    const response = await this.openaiClient.chat.completions.create(
      openaiRequest,
    );
    return this.convertFromOpenAIFormat(response);
  }
}

// Capability-aware routing
export class AxAIRouter {
  selectProvider(requirements: AxCapabilityRequirements): AxAIService {
    for (const provider of this.providers) {
      const capabilities = provider.getCapabilities();
      if (this.satisfiesRequirements(capabilities, requirements)) {
        return provider;
      }
    }
    throw new Error('No provider satisfies requirements');
  }
}

Cool feature: Automatic fallback chain that keeps capabilities ensures requests always reach a provider that can handle them.

Challenge 4: DSPy optimization in TypeScript

The problem: Building complex optimization algorithms like MiPRO v2 in TypeScript while keeping the math correct from the original Python version.

Math stuff that'll melt your brain:

Bayesian optimization with Gaussian processes
Multiple ways to pick next parameters (EI, UCB, PI)
Teacher-student optimization patterns
Multi-goal optimization with Pareto frontiers
Advanced sampling strategies

How we solved it: Pure TypeScript version with optional Python backend

WARNING: Math zone detected, big brains alert

// Native TypeScript Bayesian optimization
class AxMiPRO extends AxBaseOptimizer {
  private surrogateModel = new Map<
    string,
    { mean: number; variance: number }
  >();

  private calculateAcquisitionValue(config: ConfigType): number {
    const prediction = this.predictPerformance(config);
    const { mean, variance } = prediction;
    const std = Math.sqrt(variance);
    const bestScore = Math.max(
      ...this.configHistory.map((entry) => entry.score),
    );

    switch (this.acquisitionFunction) {
      case 'expected_improvement': {
        const improvement = mean - bestScore;
        if (std === 0) return Math.max(0, improvement);

        const z = improvement / std;
        const phi = 0.5 * (1 + this.erf(z / Math.sqrt(2))); // CDF
        const pdfValue = Math.exp(-0.5 * z * z) / Math.sqrt(2 * Math.PI); // PDF

        return improvement * phi + std * pdfValue;
      }
      // ... other acquisition functions
    }
  }

  // Error function approximation for statistical calculations
  private erf(x: number): number {
    // Abramowitz and Stegun approximation
    const a1 = 0.254829592,
      a2 = -0.284496736,
      a3 = 1.421413741;
    const a4 = -1.453152027,
      a5 = 1.061405429,
      p = 0.3275911;

    const sign = x >= 0 ? 1 : -1;
    const absX = Math.abs(x);
    const t = 1.0 / (1.0 + p * absX);
    const y =
      1.0 -
      ((((a5 * t + a4) * t + a3) * t + a2) * t + a1) *
        t *
        Math.exp(-absX * absX);

    return sign * y;
  }

  // Optional Python backend integration
  private async compilePython(
    program: AxGen,
    metricFn: AxMetricFn,
  ): Promise<AxMiPROResult> {
    if (!this.pythonClient) throw new Error('Python client not initialized');

    const optimizationRequest = {
      study_name: `mipro_${Date.now()}`,
      parameters: [
        { name: 'temperature', type: 'float', low: 0.1, high: 2.0 },
        {
          name: 'bootstrappedDemos',
          type: 'int',
          low: 0,
          high: this.maxBootstrappedDemos,
        },
      ],
      objective: { name: 'score', direction: 'maximize' },
      n_trials: this.numTrials,
      sampler: 'TPESampler',
    };

    const job = await this.pythonClient.createOptimizationJob(
      optimizationRequest,
    );
    // ... handle optimization loop with Python backend
  }
}

Best of both: Pure TypeScript works in browsers, optional Python backend for advanced math stuff.

Smart tricks we found

Ax doesn't have many tricks to begin with, its selling point is with the signature pattern and collection of optimizers. The biggest trick of Ax/DSPy is how it managed to stay so low-key for so many years that no one has mentioned it in mainstream media (blog posts, tutorials, etc...) until context engineering become the new trend

Trick 1: Runtime checks that play nice with TypeScript

The problem: Making sure field names are descriptive at runtime without breaking TypeScript's compile-time checking.

How we did it: Multiple layers of validation with ~~tons of @ts-ignores~~ compile-time hints.

function validateField(field: AxField, context: 'input' | 'output'): void {
  if (!field.name || field.name.length === 0) {
    throw new AxSignatureValidationError(
      'Field name cannot be blank',
      field.name,
    );
  }

  // Runtime validation for field name descriptiveness
  if (axGlobals.signatureStrict) {
    const reservedNames = [
      'text',
      'object',
      'data',
      'value',
      'result',
      'response',
      'request',
      'item',
    ];

    if (reservedNames.includes(field.name.toLowerCase())) {
      const suggestions =
        context === 'input'
          ? ['userInput', 'questionText', 'documentContent', 'messageText']
          : ['responseText', 'analysisResult', 'categoryType', 'summaryText'];

      throw new AxSignatureValidationError(
        `Field name '${field.name}' is too generic`,
        field.name,
        `Use a more descriptive name. Examples: ${suggestions.join(', ')}`,
      );
    }
  }

  // Case validation
  if (!isValidCase(field.name)) {
    throw new AxSignatureValidationError(
      `Invalid field name '${field.name}' - must be camelCase or snake_case`,
      field.name,
      'Use camelCase (e.g., "userInput") or snake_case (e.g., "user_input")',
    );
  }
}

// Type-level enforcement through branded types
type DescriptiveFieldName = string & { __brand: 'descriptive' };

function createField(name: DescriptiveFieldName, type: AxFieldType): AxField {
  return { name, type }; // Compile-time guarantee of descriptive name
}

The cool part: Mix runtime validation with TypeScript's branded types to get both type safety and runtime checks.

Trick 2: Finding parallel operations automatically

The problem: Finding operations that can run in parallel without making developers mark them explicitly.

How we did it: Control flow analysis with execution graph optimization.

class AxFlowExecutionPlanner {
  setInitialFields(fields: string[]): void {
    this.availableFields = new Set(fields);
  }

  createOptimizedExecution(batchSize: number): AxFlowStepFunction[] {
    const executionGraph = this.buildExecutionGraph();
    const optimizedGroups = this.optimizeExecution(executionGraph);

    return optimizedGroups.map((group) => {
      if (group.length === 1) {
        return group[0]!.step;
      }

      // Create batched parallel execution
      return async (state: AxFlowState, context: any) => {
        console.log(`Executing ${group.length} operations in parallel`);

        const results = await processBatches(
          group,
          async (stepInfo, _index) => {
            const stepResult = await stepInfo.step(state, context);
            return { [stepInfo.id]: stepResult };
          },
          batchSize,
        );

        // Merge all parallel results
        return results.reduce(
          (merged, result) => ({ ...merged, ...result }),
          state,
        );
      };
    });
  }

  private buildExecutionGraph(): ExecutionNode[] {
    const nodes: ExecutionNode[] = [];

    for (let i = 0; i < this.steps.length; i++) {
      const step = this.steps[i]!;
      const node: ExecutionNode = {
        id: i,
        step: step.step,
        dependencies: step.dependencies,
        produces: step.produces,
        canExecuteAfter: new Set<number>(),
        mustExecuteBefore: new Set<number>(),
      };

      // Find dependencies on previous steps
      for (let j = 0; j < i; j++) {
        const prevStep = this.steps[j]!;
        const hasDataDependency = step.dependencies.some((dep) =>
          prevStep.produces.includes(dep),
        );

        if (hasDataDependency) {
          node.canExecuteAfter.add(j);
          nodes[j]?.mustExecuteBefore.add(i);
        }
      }

      nodes.push(node);
    }

    return nodes;
  }

  private optimizeExecution(graph: ExecutionNode[]): ExecutionNode[][] {
    const groups: ExecutionNode[][] = [];
    const scheduled = new Set<number>();

    while (scheduled.size < graph.length) {
      const readyNodes = graph.filter(
        (node) =>
          !scheduled.has(node.id) &&
          [...node.canExecuteAfter].every((dep) => scheduled.has(dep)),
      );

      if (readyNodes.length === 0) {
        throw new Error('Circular dependency detected in execution graph');
      }

      groups.push(readyNodes);
      readyNodes.forEach((node) => scheduled.add(node.id));
    }

    return groups;
  }
}

Just works: Complex workflows automatically get parallel execution without any setup.

Properties

Location

Stats

Ax framework breakdown

Overview

TL;DR

You lost me at DSPy, wtf is that?

Problems it fixed

This sounds too good to be true, what's the catch?

The three foundational pillars

The possibilities that Ax unlocks for you

Wall-of-prompt-be-gone abracadabra

Agent Smith would be proud

Make o4-mini as smart as o4? Hold my beer

How it works

Architecture overview

Request flow

Data structures and algorithms

Signature system (Pillar #1)

AxSignature: The core type definition

Field builder system

Extract and validate response

Flow execution engine (Pillar #2)

Dynamic signature inference algorithm

Parallel execution planning

Optimization algorithms (Pillar #3)

MiPRO v2 implementation

Bayesian optimization with acquisition functions

Bootstrap few shot execution flow

MiPRO v2 execution flow

Combined optimization pipeline

Technical challenges and solutions

Challenge 1: LLM input and output are not typed

Challenge 2: Workflow node also need to be typed (knows the signature input/output)

Challenge 3: LLM providers don't like each other

Challenge 4: DSPy optimization in TypeScript

Smart tricks we found

Trick 1: Runtime checks that play nice with TypeScript

Trick 2: Finding parallel operations automatically

Subscribe to Dwarves Memo

Properties

Location

Stats

Command Palette

Ax framework breakdown

Overview

TL;DR

You lost me at DSPy, wtf is that?

Problems it fixed

This sounds too good to be true, what's the catch?

The three foundational pillars

The possibilities that Ax unlocks for you

Wall-of-prompt-be-gone abracadabra

Agent Smith would be proud

Make o4-mini as smart as o4? Hold my beer

How it works

Architecture overview

Request flow

Data structures and algorithms

Signature system (Pillar #1)

AxSignature: The core type definition

Field builder system

Extract and validate response

Flow execution engine (Pillar #2)

Dynamic signature inference algorithm

Parallel execution planning

Optimization algorithms (Pillar #3)

MiPRO v2 implementation

Bayesian optimization with acquisition functions

Bootstrap few shot execution flow

MiPRO v2 execution flow

Combined optimization pipeline

Technical challenges and solutions

Challenge 1: LLM input and output are not typed

Challenge 2: Workflow node also need to be typed (knows the signature input/output)

Challenge 3: LLM providers don't like each other

Challenge 4: DSPy optimization in TypeScript

Smart tricks we found

Trick 1: Runtime checks that play nice with TypeScript

Trick 2: Finding parallel operations automatically

Subscribe to Dwarves Memo