LLM Integration (Anthropic)

43 fragments · Layer 3 Synthesized established · 43 evidence · updated 2025-01-31

Summary

The default max_tokens of 4096 is too small for any non-trivial Claude task — every project that hit truncation fixed it by raising to 16384, and JSON parsing breaks silently when the response is cut mid-structure. The most reliable architecture across projects is a two-tier model split: Haiku at temperature 0 for classification and planning, Sonnet for generation and writing. Tool use requires an explicit result-feeding loop or Claude can't chain actions. When latency is the primary constraint (sub-30-second user-facing responses), GPT-4o has outperformed Claude in practice — LabelCheck's migration from Claude 3.5 Sonnet → GPT-5 Mini → GPT-4o cut analysis time from 117+ seconds to 15–30 seconds.

TL;DR

What we've learned
- max_tokens: 4096 truncates real responses; set 16384 as the floor across all projects — this has burned Meridian, ContentCommand, and AsymXray independently.
- Haiku + Sonnet two-pass (plan cheap, write expensive) processes 5 documents in under 60 seconds with 3 concurrent workers in Meridian.
- Tool use loops need explicit result-feeding: Claude won't chain actions unless you feed each tool result back into the next API call.
- Claude's 200K context limit is reachable in production — ContentCommand's DataForSEO payloads hit 211K tokens and required pre-summarization.
- LabelCheck migrated away from Claude entirely for latency-sensitive analysis; GPT-4o at 15–30 seconds beat Claude 3.5 Sonnet at 117+ seconds.

External insights

No external sources ingested yet for this topic.

Common Failure Modes

`max_tokens` default truncates JSON mid-structure

Established failure mode across 4 projects (Meridian, ContentCommand, AsymXray, and implicitly LabelCheck). The Anthropic SDK default of max_tokens: 4096 is insufficient for responses that include structured JSON with substantive content. The failure mode is silent: the API returns a 200, but the response body ends mid-JSON, causing a parse error downstream.

SyntaxError: Unexpected end of JSON input

The fix is two-part: raise max_tokens to 16384 and add JSON repair logic for cases where truncation still occurs at edge-case lengths.

const response = await anthropic.messages.create({
  model: "claude-haiku-4-5",
  max_tokens: 16384,  // not 4096
  messages: [...]
});

Observed in Meridian (extraction and compiler passes), ContentCommand (brief generation), and AsymXray (brief generation).
^[1]

Context window overflow from third-party API payloads

ContentCommand's DataForSEO competitive analysis responses exceeded Claude's 200K token context window — raw competitive data came in at 211K tokens, causing the API call to fail before Claude could process it.

The fix is pre-summarization: run a cheap pass to compress third-party payloads before they enter the main prompt. Don't assume external API responses are context-window-safe.

Observed in ContentCommand when building the content brief generation pipeline.
^[2]

Anthropic client lazy initialization silently fails

Observed in AsymXray: the Anthropic client was initialized lazily (constructed on first use rather than at module load), and when the API key was missing or misconfigured, the failure surfaced as a confusing runtime error during the first AI call rather than at startup.

The fix is eager initialization with an explicit API key check at startup:

if (!process.env.ANTHROPIC_API_KEY) {
  throw new Error("ANTHROPIC_API_KEY is required");
}
const anthropic = new Anthropic({ apiKey: process.env.ANTHROPIC_API_KEY });

Seen in two projects (AsymXray, Eydn) — treat missing API key as a startup crash, not a per-request error.
^[3]

Tool use without result-feeding loops stalls after first action

In Eydn's AI chat, Claude was given 9 tools (guest management, task creation, vendor tracking, budget, mood board, decision memory, etc.) but the initial implementation didn't feed tool results back into subsequent API calls. Claude would invoke a tool and then stop, unable to chain a second action.

The fix is an explicit loop — up to 5 iterations in Eydn's implementation — that feeds each tool_result block back as a user message:

while (response.stop_reason === "tool_use" && iterations < MAX_ITERATIONS) {
  const toolResults = await executeTools(response.content);
  messages.push({ role: "assistant", content: response.content });
  messages.push({ role: "user", content: toolResults });
  response = await anthropic.messages.create({ model, messages, tools });
  iterations++;
}

Observed in Eydn when implementing the action-taking AI chat.
^[4]

Frontend timeouts fire before long AI responses complete

AsymXray's brief generation hit a wall when max_tokens was raised from 2000 to 4096: the longer responses pushed past the default 60-second frontend timeout, returning a timeout error even though the API call eventually succeeded.

Fix: raise the frontend (Next.js route handler or fetch) timeout to 120 seconds when max_tokens is above ~3000. This is a two-knob problem — token limit and timeout must be raised together.

Observed in AsymXray.
^[5]

Claude latency unacceptable for synchronous user-facing analysis

LabelCheck's FDA label analysis with Claude 3.5 Sonnet took 117+ seconds — unusable for a user waiting on a compliance result. The project migrated to GPT-5 Mini (still ~117 seconds), then to GPT-4o (15–30 seconds, 4–6x faster).

This is a documented migration decision, not a bug. The lesson: for synchronous, user-facing document analysis where the user is watching a spinner, Claude's latency profile may not fit. GPT-4o is currently faster for this workload.

Observed in LabelCheck (Claude 3.5 Sonnet → GPT-5 Mini → GPT-4o migration).
^[6]

What Works

Haiku for planning/classification, Sonnet for writing

Consistent across Meridian and AsymXray: use claude-haiku-4-5 for decisions that produce structured output (which files to create, which industry category, which intent class), and claude-sonnet-4-5 for prose generation. Haiku is fast enough that the planning pass doesn't dominate wall time.

In Meridian's two-pass compiler, this split processes 5 documents in under 60 seconds with 3 concurrent workers. Haiku classification over 58 clients ran in 85.5 seconds total.

// Planning pass — cheap and fast
const plan = await anthropic.messages.create({
  model: "claude-haiku-4-5",
  max_tokens: 16384,
  temperature: 0,  // deterministic for classification
  messages: [{ role: "user", content: planningPrompt }]
});

// Writing pass — quality matters
const content = await anthropic.messages.create({
  model: "claude-sonnet-4-5",
  max_tokens: 16384,
  messages: [{ role: "user", content: writingPrompt(plan) }]
});

^[7]

Temperature 0 for classification tasks

Haiku at temperature: 0 produces stable, repeatable classifications. In Meridian's industry classifier, 34 of 58 clients came back high-confidence, 6 medium, 18 low — the low-confidence bucket is the right signal for routing to manual override rather than retrying with a different prompt.

Pair temperature 0 with JSON schema validation in the prompt. Don't rely on Claude to self-correct schema drift; validate the output and reject/retry on schema failure.
^[8]

Full user-data context injection for actionable chat responses

In Eydn, injecting the complete user state into the system prompt — up to 50 tasks, 20 vendors, 100 guests, full budget, seating chart, and uploaded documents — produced qualitatively better responses than injecting summaries. Claude can reference specific vendor names, flag budget conflicts, and suggest concrete next steps when it has the actual data.

The practical limit: Eydn's full context fits comfortably within Claude's 200K window. If your user data grows beyond ~150K tokens, you'll need selective injection or summarization.
^[9]

Claude vision API for PDF document analysis

The @anthropic-ai/sdk document type (available from v0.67.0) handles PDFs with complex layouts: rotated text, colored backgrounds, poor contrast. In LabelCheck, this was used for FDA label extraction before the project migrated to OpenAI for latency reasons — the extraction quality was not the issue.

const response = await anthropic.messages.create({
  model: "claude-sonnet-4-5",
  max_tokens: 16384,
  messages: [{
    role: "user",
    content: [{
      type: "document",
      source: {
        type: "base64",
        media_type: "application/pdf",
        data: pdfBase64
      }
    }, {
      type: "text",
      text: "Extract all label text and structure..."
    }]
  }]
});

Enforce a 10MB max file size before sending — the API will reject larger files and the error message is not user-friendly.
^[10]

Batching fragments for synthesis at scale

In Meridian's Layer 3 synthesis pipeline, large topics are batched at 20 fragments per batch. Sending all fragments in a single call risks both context overflow and degraded synthesis quality (Claude's attention degrades over very long contexts). The 20-fragment batch size was chosen empirically.

^[11]

Per-feature model configuration

Observed in AsymXray: rather than hardcoding a single model across all AI features, implement per-feature model configuration. This lets you tune cost vs. quality per use case without a code change, and makes it easy to A/B test model upgrades on individual features.

^[12]

Gotchas and Edge Cases

Haiku is not lightweight for large-batch extraction

The assumption that Haiku = cheap = fast breaks down at scale. In Meridian, Haiku extraction with max_tokens: 16384 over large batches still required 16K tokens per call — the model is cheaper per token, but the token count is driven by the task, not the model. Budget accordingly.
^[13]

Tool use with external APIs needs rate limiting and caching

In Eydn, web search was implemented as a Claude tool via the Tavily API. Without rate limiting, a single chat session could exhaust the Tavily quota. The production implementation caps at 10 searches per user per day with a 24-hour result cache. Any tool that calls an external paid API needs this treatment — Claude will call tools eagerly.
^[14]

Admin context bypass in AI chat

In Eydn, the AI chat system prompt included task/vendor context fetched from the database. An early bug allowed admin users to bypass the user-scoping on that fetch, meaning the AI could respond with data from other users' accounts. Any context injection that queries the database must enforce the same auth scoping as the rest of the app.
^[15]

LLM classification confidence distribution requires a manual override path

In Meridian's industry classifier, 18 of 58 clients (31%) came back low-confidence. A classifier without a manual override path would silently misclassify nearly a third of inputs. Build the override UI before shipping classification features — it's not optional.
^[8]

Claude Opus 4.6 1M context is real but not free

Claude Opus 4.6's 1M token context window is used in Meridian and ContentCommand for complex multi-file engineering tasks and knowledge synthesis. The context window works as advertised. The cost per call at 1M tokens is significant — use it for async/batch tasks, not synchronous user-facing features.
^[16]

Speech-to-text input requires alias matching in registry-validated prompts

In Meridian's registry-enforced compiler, Claude validates planned knowledge paths against clients.yaml and topics.yaml. Voice input (speech-to-text) introduces transcription errors — "Eydn" becomes "Eden", "AsymXray" becomes "Asim Ray". The validation layer needs alias matching, not exact-string matching, or voice-driven workflows break constantly.
^[17]

Where Docs Disagree With Practice

LabelCheck: Claude → GPT migration for latency

The Anthropic docs don't address latency benchmarks relative to OpenAI. In practice at LabelCheck, Claude 3.5 Sonnet took 117+ seconds for FDA label analysis — the same task GPT-4o completes in 15–30 seconds. This is a 4–6x latency gap on a real production workload. The migration path was Claude 3.5 Sonnet → GPT-5 Mini (no improvement) → GPT-4o (fixed). For synchronous document analysis, Claude's latency is a real constraint that Anthropic's documentation doesn't surface.
^[6]

`max_tokens` default is documented but the failure mode is not

Anthropic's docs note that max_tokens defaults to 4096 and can be raised. What they don't say is that JSON responses truncate silently — the API returns HTTP 200 with a stop_reason: "max_tokens" that's easy to miss if you're not explicitly checking it. Four projects hit this independently before it became a known pattern here.

if (response.stop_reason === "max_tokens") {
  // Response was truncated — JSON will be malformed
  throw new Error(`Response truncated at ${response.usage.output_tokens} tokens`);
}

^[18]

Tool use docs show single-turn examples; production needs multi-turn loops

Anthropic's tool use documentation shows single tool invocations. Production use cases (Eydn's 9-tool chat with up to 5 chained actions) require a loop that feeds results back. The docs have examples of this but they're not prominent — the single-turn example is the one developers implement first and then have to retrofit.
^[4]

Tool and Version Notes

@anthropic-ai/sdk v0.67.0 — Required for the document content type (PDF analysis). Earlier versions don't support it. Observed in LabelCheck. ^[19]
Claude Opus 4.6 — 1M token context window. Used in Meridian and ContentCommand for large-context synthesis and multi-file engineering tasks. Cost is high; reserve for async batch workloads.
Claude Haiku 4.5 — Current fast/cheap tier. Suitable for classification and planning at temperature 0. Still requires max_tokens: 16384 for extraction tasks — don't assume Haiku calls are cheap just because the model is.
Claude Sonnet 4.5 — Current mid-tier. Used for content generation, writing passes, and context-aware chat. The latency gap vs. GPT-4o is significant for synchronous user-facing tasks (see LabelCheck migration).
Claude 3.5 Sonnet — LabelCheck migrated away from this model specifically due to latency (117+ seconds for PDF analysis). Still valid for async tasks.
GPT-4o (OpenAI) — Used in LabelCheck after Claude migration. 15–30 second latency for document analysis. ContentCommand uses it alongside Claude Sonnet in a multi-model pipeline. Not an Anthropic product, but relevant as the comparison point.
GPT-5 Mini (OpenAI) — LabelCheck tried this as an intermediate step; it did not improve on Claude's latency (~117 seconds). Superseded by GPT-4o in that project. ^[20]

Sources

Synthesized from 43 fragments: git commits across AsymXray, ContentCommand, Eydn, LabelCheck, and Meridian. No external sources ingested yet. Date range: unknown to unknown (commit timestamps not extracted).

Sources

Fragments (43)

Classify clients by industry via Haiku, with manual overrides Meridian

# Classify clients by industry via Haiku, with manual overrides **Project:** Meridian (`meridian`) **Date:** 2026-04-10 **Author:** Mark Hope **Commit:** `3695d97` **Scope:** 2 files, +402/-0 ## Commit message ``` Classify clients by industry via Haiku, with manual overrides Tags every

3695d97 · 2 files · +402/-0 · 2026-04-10 · high confidence
compiler: cross-file new fragments into the industry dimension Meridian

# compiler: cross-file new fragments into the industry dimension **Project:** Meridian (`meridian`) **Date:** 2026-04-10 **Author:** Mark Hope **Commit:** `c2a9a07` **Scope:** 2 files, +170/-28 ## Commit message ``` compiler: cross-file new fragments into the industry dimension Complet

c2a9a07 · 2 files · +170/-28 · 2026-04-10 · medium confidence
Phase 3: Layer 3 synthesis agent, scheduler, endpoints, CLI, n8n, web UI Meridian

# Phase 3: Layer 3 synthesis agent, scheduler, endpoints, CLI, n8n, web UI **Project:** Meridian (`meridian`) **Date:** 2026-04-07 **Author:** Mark Hope **Commit:** `1480508` **Scope:** 9 files, +1022/-1 ## Commit message ``` Phase 3: Layer 3 synthesis agent, scheduler, endpoints, CLI,

1480508 · 9 files · +1022/-1 · 2026-04-07 · high confidence
Fix extraction: increase max_tokens to 16384, improve JSON parsing Meridian

# Fix extraction: increase max_tokens to 16384, improve JSON parsing **Project:** Meridian (`meridian`) **Date:** 2026-04-07 **Author:** Mark Hope **Commit:** `5ac699f` **Scope:** 1 files, +28/-5 ## Commit message ``` Fix extraction: increase max_tokens to 16384, improve JSON parsing H

5ac699f · 1 files · +28/-5 · 2026-04-07 · high confidence
Registry-enforced compiler: clients.yaml + topics.yaml Meridian

# Registry-enforced compiler: clients.yaml + topics.yaml **Project:** Meridian (`meridian`) **Date:** 2026-04-05 **Author:** Mark Hope **Commit:** `c2c610c` **Scope:** 2 files, +147/-23 ## Commit message ``` Registry-enforced compiler: clients.yaml + topics.yaml Planning pass now recei

c2c610c · 2 files · +147/-23 · 2026-04-05 · medium confidence
Parallel two-pass compiler: Haiku plans, Sonnet writes Meridian

# Parallel two-pass compiler: Haiku plans, Sonnet writes **Project:** Meridian (`meridian`) **Date:** 2026-04-04 **Author:** Mark Hope **Commit:** `51746ed` **Scope:** 4 files, +385/-156 ## Commit message ``` Parallel two-pass compiler: Haiku plans, Sonnet writes Pass 1: Haiku decides

51746ed · 4 files · +385/-156 · 2026-04-04 · high confidence
Add Q&A agent — researches wiki, synthesizes answers Meridian

# Add Q&A agent — researches wiki, synthesizes answers **Project:** Meridian (`meridian`) **Date:** 2026-04-04 **Author:** Mark Hope **Commit:** `7e69a3d` **Scope:** 3 files, +190/-0 ## Commit message ``` Add Q&A agent — researches wiki, synthesizes answers agents/qa_agent.py searches

7e69a3d · 3 files · +190/-0 · 2026-04-04 · high confidence
Initial scaffold for Meridian knowledge system Meridian

# Initial scaffold for Meridian knowledge system **Project:** Meridian (`meridian`) **Date:** 2026-04-04 **Author:** Mark Hope **Commit:** `b3a1b4d` **Scope:** 21 files, +2386/-0 ## Commit message ``` Initial scaffold for Meridian knowledge system Complete project structure with receiv

b3a1b4d · 21 files · +2386/-0 · 2026-04-04 · medium confidence
Add /capture/gdrive and /check endpoints for Sieve integration Meridian

# Add /capture/gdrive and /check endpoints for Sieve integration **Project:** Meridian (`meridian`) **Date:** 2026-04-04 **Author:** Mark Hope **Commit:** `c09ab69` **Scope:** 2 files, +221/-0 ## Commit message ``` Add /capture/gdrive and /check endpoints for Sieve integration - POST /

c09ab69 · 2 files · +221/-0 · 2026-04-04 · medium confidence
Add compiler agent and /compile endpoint Meridian

# Add compiler agent and /compile endpoint **Project:** Meridian (`meridian`) **Date:** 2026-04-04 **Author:** Mark Hope **Commit:** `c3e6f11` **Scope:** 3 files, +319/-2 ## Commit message ``` Add compiler agent and /compile endpoint Compiler reads raw/ docs, sends to LLM with AGENTS.m

c3e6f11 · 3 files · +319/-2 · 2026-04-04 · high confidence
Increase compiler max_tokens to 16384 Meridian

# Increase compiler max_tokens to 16384 **Project:** Meridian (`meridian`) **Date:** 2026-04-04 **Author:** Mark Hope **Commit:** `cac78d4` **Scope:** 1 files, +1/-1 ## Commit message ``` Increase compiler max_tokens to 16384 4096 was too small for the compiler to return full wiki arti

cac78d4 · 1 files · +1/-1 · 2026-04-04 · high confidence
Add tool use to AI chat — Eydn can now take actions in the app Eydn

# Add tool use to AI chat — Eydn can now take actions in the app **Project:** Eydn (`eydn-app`) **Date:** 2026-03-24 **Author:** Mark Hope **Commit:** `2cb2d24` **Scope:** 3 files, +345/-34 ## Commit message ``` Add tool use to AI chat — Eydn can now take actions in the app 9 tools Eyd

2cb2d24 · 3 files · +345/-34 · 2026-03-24 · high confidence
Give AI chat full access to all user data Eydn

# Give AI chat full access to all user data **Project:** Eydn (`eydn-app`) **Date:** 2026-03-24 **Author:** Mark Hope **Commit:** `4901b5c` **Scope:** 2 files, +203/-31 ## Commit message ``` Give AI chat full access to all user data Eydn now receives the complete wedding context on eve

4901b5c · 2 files · +203/-31 · 2026-03-24 · high confidence
Add web search to AI chat via Tavily API Eydn

# Add web search to AI chat via Tavily API **Project:** Eydn (`eydn-app`) **Date:** 2026-03-24 **Author:** Mark Hope **Commit:** `97978c1` **Scope:** 5 files, +205/-21 ## Commit message ``` Add web search to AI chat via Tavily API Eydn can now search the web when users ask about vendor

97978c1 · 5 files · +205/-21 · 2026-03-24 · high confidence
Fix AI chat: admin bypass, task/vendor context, app-aware prompts Eydn

# Fix AI chat: admin bypass, task/vendor context, app-aware prompts **Project:** Eydn (`eydn-app`) **Date:** 2026-03-24 **Author:** Mark Hope **Commit:** `f3e1b1f` **Scope:** 3 files, +67/-9 ## Commit message ``` Fix AI chat: admin bypass, task/vendor context, app-aware prompts 1. Admi

f3e1b1f · 3 files · +67/-9 · 2026-03-24 · high confidence
Add persistent eydn memory and expand chat context window Eydn

# Add persistent eydn memory and expand chat context window **Project:** Eydn (`eydn-app`) **Date:** 2026-03-22 **Author:** Mark Hope **Commit:** `876317f` **Scope:** 7 files, +42/-3 ## Commit message ``` Add persistent eydn memory and expand chat context window Chat context: - Increas

876317f · 7 files · +42/-3 · 2026-03-22 · high confidence
Integrate Resend for lifecycle emails and cron logging Eydn

# Integrate Resend for lifecycle emails and cron logging **Project:** Eydn (`eydn-app`) **Date:** 2026-03-21 **Author:** Mark Hope **Commit:** `93f6782` **Scope:** 5 files, +298/-1 ## Commit message ``` Integrate Resend for lifecycle emails and cron logging Email system: - Add Resend S

93f6782 · 5 files · +298/-1 · 2026-03-21 · medium confidence
Fix chat error handling and add ANTHROPIC_API_KEY check Eydn

# Fix chat error handling and add ANTHROPIC_API_KEY check **Project:** Eydn (`eydn-app`) **Date:** 2026-03-20 **Author:** Mark Hope **Commit:** `3707872` **Scope:** 1 files, +58/-41 ## Commit message ``` Fix chat error handling and add ANTHROPIC_API_KEY check - Return clear error if AN

3707872 · 1 files · +58/-41 · 2026-03-20 · high confidence
perf: skip Frase API calls on regeneration when brief already has SERP data ContentCommand

# perf: skip Frase API calls on regeneration when brief already has SERP data **Project:** ContentCommand (`contentcommand`) **Date:** 2026-03-03 **Author:** Mark Hope **Commit:** `73e9d4b` **Scope:** 1 files, +36/-31 ## Commit message ``` perf: skip Frase API calls on regeneration when

73e9d4b · 1 files · +36/-31 · 2026-03-03 · medium confidence
fix: handle truncated JSON from AI + increase max tokens to 16384 ContentCommand

# fix: handle truncated JSON from AI + increase max tokens to 16384 **Project:** ContentCommand (`contentcommand`) **Date:** 2026-03-03 **Author:** Mark Hope **Commit:** `9b4782f` **Scope:** 1 files, +33/-2 ## Commit message ``` fix: handle truncated JSON from AI + increase max tokens t

9b4782f · 1 files · +33/-2 · 2026-03-03 · high confidence
feat: enrich brief generation with all available data + purpose-driven briefs ContentCommand

# feat: enrich brief generation with all available data + purpose-driven briefs **Project:** ContentCommand (`contentcommand`) **Date:** 2026-03-02 **Author:** Mark Hope **Commit:** `517c4d9` **Scope:** 13 files, +348/-31 ## Commit message ``` feat: enrich brief generation with all avai

517c4d9 · 13 files · +348/-31 · 2026-03-02 · high confidence
feat: add delete, regenerate with feedback for content management ContentCommand

# feat: add delete, regenerate with feedback for content management **Project:** ContentCommand (`contentcommand`) **Date:** 2026-03-01 **Author:** Mark Hope **Commit:** `46072bf` **Scope:** 15 files, +609/-9 ## Commit message ``` feat: add delete, regenerate with feedback for content m

46072bf · 15 files · +609/-9 · 2026-03-01 · high confidence
fix: summarize competitive data to prevent prompt token overflow ContentCommand

# fix: summarize competitive data to prevent prompt token overflow **Project:** ContentCommand (`contentcommand`) **Date:** 2026-03-01 **Author:** Mark Hope **Commit:** `b96d1b7` **Scope:** 1 files, +54/-3 ## Commit message ``` fix: summarize competitive data to prevent prompt token ove

b96d1b7 · 1 files · +54/-3 · 2026-03-01 · high confidence
Stage 3: AI Content Generation Engine — briefs, content creation, quality scoring, review workflow ContentCommand

# Stage 3: AI Content Generation Engine — briefs, content creation, quality scoring, review workflow **Project:** ContentCommand (`contentcommand`) **Date:** 2026-02-28 **Author:** Mark Hope **Commit:** `08a0d13` **Scope:** 34 files, +3081/-15 ## Commit message ``` Stage 3: AI Content G

08a0d13 · 34 files · +3081/-15 · 2026-02-28 · high confidence
test: add comprehensive tests for QuickBooks, SMS, and AI Estimate services Hazardos

# test: add comprehensive tests for QuickBooks, SMS, and AI Estimate services **Project:** Hazardos (`hazardos`) **Date:** 2026-02-01 **Author:** Mark Hope **Commit:** `0beeef2` **Scope:** 2 files, +41/-47 ## Commit message ``` test: add comprehensive tests for QuickBooks, SMS, and AI E

0beeef2 · 2 files · +41/-47 · 2026-02-01 · high confidence
feat: add call intelligence with Whisper transcription and intent analysis AsymXray

# feat: add call intelligence with Whisper transcription and intent analysis **Project:** AsymXray (`asymxray`) **Date:** 2026-01-08 **Author:** Mark Hope **Commit:** `1380c15` **Scope:** 8 files, +1567/-151 ## Commit message ``` feat: add call intelligence with Whisper transcription an

1380c15 · 8 files · +1567/-151 · 2026-01-08 · high confidence
feat: implement per-feature AI model configuration AsymXray

# feat: implement per-feature AI model configuration **Project:** AsymXray (`asymxray`) **Date:** 2025-12-29 **Author:** Mark Hope **Commit:** `caa7181` **Scope:** 2 files, +217/-9 ## Commit message ``` feat: implement per-feature AI model configuration - Add model selection dropdowns

caa7181 · 2 files · +217/-9 · 2025-12-29 · high confidence
feat: integrate target keywords and competitor monitoring into opportunities AsymXray

# feat: integrate target keywords and competitor monitoring into opportunities **Project:** AsymXray (`asymxray`) **Date:** 2025-12-23 **Author:** Mark Hope **Commit:** `4cedcda` **Scope:** 3 files, +489/-42 ## Commit message ``` feat: integrate target keywords and competitor monitoring

4cedcda · 3 files · +489/-42 · 2025-12-23 · medium confidence
feat: add Gravity Forms API integration for form submissions AsymXray

# feat: add Gravity Forms API integration for form submissions **Project:** AsymXray (`asymxray`) **Date:** 2025-12-18 **Author:** Mark Hope **Commit:** `544661e` **Scope:** 9 files, +755/-15 ## Commit message ``` feat: add Gravity Forms API integration for form submissions - Create Gr

544661e · 9 files · +755/-15 · 2025-12-18 · low confidence
feat: Add GSC historical sync, fix AI client initialization, and various improvements AsymXray

# feat: Add GSC historical sync, fix AI client initialization, and various improvements **Project:** AsymXray (`asymxray`) **Date:** 2025-12-15 **Author:** Mark Hope **Commit:** `b03c187` **Scope:** 167 files, +2912/-940 ## Commit message ``` feat: Add GSC historical sync, fix AI client

b03c187 · 167 files · +2912/-940 · 2025-12-15 · high confidence
fix: Resolve AI brief generation issues AsymXray

# fix: Resolve AI brief generation issues **Project:** AsymXray (`asymxray`) **Date:** 2025-12-15 **Author:** Mark Hope **Commit:** `ef33a7a` **Scope:** 4 files, +7/-5 ## Commit message ``` fix: Resolve AI brief generation issues - Increase max_tokens from 2000 to 4096 to prevent JSON

ef33a7a · 4 files · +7/-5 · 2025-12-15 · high confidence
feat: Add Form Intent Analysis to Forms tab AsymXray

# feat: Add Form Intent Analysis to Forms tab **Project:** AsymXray (`asymxray`) **Date:** 2025-12-13 **Author:** Mark Hope **Commit:** `38c3b2b` **Scope:** 12 files, +2842/-2 ## Commit message ``` feat: Add Form Intent Analysis to Forms tab - Add AI-powered form intent analysis compon

38c3b2b · 12 files · +2842/-2 · 2025-12-13 · high confidence
Add context-aware AI Chat with nav button, performance monitoring AsymXray

# Add context-aware AI Chat with nav button, performance monitoring **Project:** AsymXray (`asymxray`) **Date:** 2025-12-09 **Author:** Mark Hope **Commit:** `8e38ea2` **Scope:** 19 files, +3532/-114 ## Commit message ``` Add context-aware AI Chat with nav button, performance monitoring

8e38ea2 · 19 files · +3532/-114 · 2025-12-09 · high confidence
feat: Add competitor tracking, AI discovery, and multi-MCC support AsymXray

# feat: Add competitor tracking, AI discovery, and multi-MCC support **Project:** AsymXray (`asymxray`) **Date:** 2025-12-08 **Author:** Mark Hope **Commit:** `97d91c8` **Scope:** 94 files, +13006/-2087 ## Commit message ``` feat: Add competitor tracking, AI discovery, and multi-MCC sup

97d91c8 · 94 files · +13006/-2087 · 2025-12-08 · medium confidence
Add automatic category ambiguity detection for fortified foods LabelCheck

# Add automatic category ambiguity detection for fortified foods **Project:** LabelCheck (`labelcheck`) **Date:** 2025-11-05 **Author:** Mark H **Commit:** `359f238` **Scope:** 3 files, +321/-16 ## Commit message ``` Add automatic category ambiguity detection for fortified foods Proble

359f238 · 3 files · +321/-16 · 2025-11-05 · high confidence
Implement priority classification system, print-ready certification, and value-based pricing LabelCheck

# Implement priority classification system, print-ready certification, and value-based pricing **Project:** LabelCheck (`labelcheck`) **Date:** 2025-10-29 **Author:** Mark Hope **Commit:** `8202d73` **Scope:** 16 files, +3214/-40 ## Commit message ``` Implement priority classification s

8202d73 · 16 files · +3214/-40 · 2025-10-29 · medium confidence
Fix OpenAI API parameter for GPT-5 compatibility LabelCheck

# Fix OpenAI API parameter for GPT-5 compatibility **Project:** LabelCheck (`labelcheck`) **Date:** 2025-10-23 **Author:** Mark Hope **Commit:** `2c1ad05` **Scope:** 5 files, +62/-6 ## Commit message ``` Fix OpenAI API parameter for GPT-5 compatibility Changes: - Replace deprecated max

2c1ad05 · 5 files · +62/-6 · 2025-10-23 · high confidence
Implement Phase 1: Product Category Classification System LabelCheck

# Implement Phase 1: Product Category Classification System **Project:** LabelCheck (`labelcheck`) **Date:** 2025-10-23 **Author:** Mark Hope **Commit:** `5e33995` **Scope:** 9 files, +1936/-2 ## Commit message ``` Implement Phase 1: Product Category Classification System Add automatic

5e33995 · 9 files · +1936/-2 · 2025-10-23 · high confidence
Implement Phase 1.5: Category Guidance & Ambiguity Detection LabelCheck

# Implement Phase 1.5: Category Guidance & Ambiguity Detection **Project:** LabelCheck (`labelcheck`) **Date:** 2025-10-23 **Author:** Mark Hope **Commit:** `63acf49` **Scope:** 5 files, +1057/-9 ## Commit message ``` Implement Phase 1.5: Category Guidance & Ambiguity Detection Add com

63acf49 · 5 files · +1057/-9 · 2025-10-23 · high confidence
Migrate from Anthropic Claude to OpenAI GPT for AI analysis LabelCheck

# Migrate from Anthropic Claude to OpenAI GPT for AI analysis **Project:** LabelCheck (`labelcheck`) **Date:** 2025-10-23 **Author:** Mark Hope **Commit:** `a20c510` **Scope:** 9 files, +454/-177 ## Commit message ``` Migrate from Anthropic Claude to OpenAI GPT for AI analysis Major Ch

a20c510 · 9 files · +454/-177 · 2025-10-23 · high confidence
Switch from GPT-5 Mini to GPT-4o for faster analysis performance LabelCheck

# Switch from GPT-5 Mini to GPT-4o for faster analysis performance **Project:** LabelCheck (`labelcheck`) **Date:** 2025-10-23 **Author:** Mark Hope **Commit:** `d2c1be4` **Scope:** 4 files, +8/-7 ## Commit message ``` Switch from GPT-5 Mini to GPT-4o for faster analysis performance Pe

d2c1be4 · 4 files · +8/-7 · 2025-10-23 · high confidence
Add PDF upload with vision-based text extraction to text checker LabelCheck

# Add PDF upload with vision-based text extraction to text checker **Project:** LabelCheck (`labelcheck`) **Date:** 2025-10-22 **Author:** Mark H **Commit:** `46a6820` **Scope:** 5 files, +479/-360 ## Commit message ``` Add PDF upload with vision-based text extraction to text checker U

46a6820 · 5 files · +479/-360 · 2025-10-22 · high confidence
Add PDF upload support for initial label analysis LabelCheck

# Add PDF upload support for initial label analysis **Project:** LabelCheck (`labelcheck`) **Date:** 2025-10-22 **Author:** Mark H **Commit:** `513ad8f` **Scope:** 2 files, +101/-38 ## Commit message ``` Add PDF upload support for initial label analysis Extended initial upload to accep

513ad8f · 2 files · +101/-38 · 2025-10-22 · high confidence

LLM Integration (Anthropic)

Summary

TL;DR

Common Failure Modes

max_tokens default truncates JSON mid-structure

Context window overflow from third-party API payloads

Anthropic client lazy initialization silently fails

Tool use without result-feeding loops stalls after first action

Frontend timeouts fire before long AI responses complete

Claude latency unacceptable for synchronous user-facing analysis

What Works

Haiku for planning/classification, Sonnet for writing

Temperature 0 for classification tasks

Full user-data context injection for actionable chat responses

Claude vision API for PDF document analysis

Batching fragments for synthesis at scale

Per-feature model configuration

Gotchas and Edge Cases

Haiku is not lightweight for large-batch extraction

Tool use with external APIs needs rate limiting and caching

Admin context bypass in AI chat

LLM classification confidence distribution requires a manual override path

Claude Opus 4.6 1M context is real but not free

Speech-to-text input requires alias matching in registry-validated prompts

Where Docs Disagree With Practice

LabelCheck: Claude → GPT migration for latency

max_tokens default is documented but the failure mode is not

Tool use docs show single-turn examples; production needs multi-turn loops

Tool and Version Notes

Related Topics

Sources

Sources

Fragments (43)

`max_tokens` default truncates JSON mid-structure

`max_tokens` default is documented but the failure mode is not