feat(ai): robust response extraction + structured-output foundation (flag-gated) #188

Merged

chihlasm merged 3 commits from feat/ai-structured-outputs into main

2026-05-29 04:23:28 +00:00

Author	SHA1	Message	Date
Michael Chihlas	02db15f118	docs(decisions): scope structured outputs to flat-array JSON (close 3c) Some checks failed Mirror to GitHub / mirror (push) Failing after 6s Details CI / frontend (pull_request) Successful in 7m12s Details CI / backend (pull_request) Successful in 11m51s Details CI / e2e (pull_request) Successful in 10m7s Details Record the 3c finding: Anthropic structured outputs apply only to flat-array generate_json outputs (kb_conversion). ai_fix and knowledge_flywheel flow-gen emit recursive/nested decision trees that the "no recursive schemas" limit excludes; their fence-strippers stay. Documents the deferred kb-only _try_repair_json removal pending staging validation of the AI_KB_CONVERT_STRUCTURED_OUTPUT flag. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 21:48:49 -04:00
Michael Chihlas	3fde3369c8	chore: gitignore core dumps (core.<pid>) Stop crashed-process core dumps (core.144926, etc.) from showing up as untracked noise / being committed by accident. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 21:48:49 -04:00
Michael Chihlas	067574ad6a	feat(ai): robust response extraction + structured-output foundation Harden the Anthropic provider and lay the groundwork for schema-constrained JSON, optimizing the existing claude-sonnet-4-6 / claude-haiku-4-5 usage (no model changes). ai_provider.py: - _extract_text_from_response replaces fragile response.content[0].text: skips non-text leading blocks (e.g. thinking), returns the first text block, logs an anthropic.stop_reason warning on max_tokens/refusal (truncation now observable), and raises ValueError on a no-text response. - generate_json gains an optional `schema` param. Anthropic wires it to output_config.format (structured outputs); schema=None preserves the exact prior call for every existing caller. Gemini accepts-and-ignores it. kb_conversion_service.py: - TROUBLESHOOTING_SCHEMA / PROCEDURAL_SCHEMA + _schema_for_target_type(), modelled as a strict superset of every field the prompts emit. - convert_document passes the schema only when the new AI_KB_CONVERT_STRUCTURED_OUTPUT setting is True (default False). The _try_repair_json fallback stays as belt-and-suspenders. Tests: 14 provider + 7 schema, TDD (red-green). Live constrained-decoding smoke-test still required before enabling the flag in production. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 21:48:49 -04:00

Author

SHA1

Message

Date

Michael Chihlas

02db15f118

docs(decisions): scope structured outputs to flat-array JSON (close 3c)

Mirror to GitHub / mirror (push) Failing after 6s

Details

CI / frontend (pull_request) Successful in 7m12s

Details

CI / backend (pull_request) Successful in 11m51s

Details

CI / e2e (pull_request) Successful in 10m7s

Details

Record the 3c finding: Anthropic structured outputs apply only to flat-array
generate_json outputs (kb_conversion). ai_fix and knowledge_flywheel flow-gen
emit recursive/nested decision trees that the "no recursive schemas" limit
excludes; their fence-strippers stay. Documents the deferred kb-only
_try_repair_json removal pending staging validation of the
AI_KB_CONVERT_STRUCTURED_OUTPUT flag.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

2026-05-28 21:48:49 -04:00

Michael Chihlas

3fde3369c8

chore: gitignore core dumps (core.<pid>)

Stop crashed-process core dumps (core.144926, etc.) from showing up as
untracked noise / being committed by accident.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

2026-05-28 21:48:49 -04:00

Michael Chihlas

067574ad6a

feat(ai): robust response extraction + structured-output foundation

Harden the Anthropic provider and lay the groundwork for schema-constrained
JSON, optimizing the existing claude-sonnet-4-6 / claude-haiku-4-5 usage
(no model changes).

ai_provider.py:
- _extract_text_from_response replaces fragile response.content[0].text:
  skips non-text leading blocks (e.g. thinking), returns the first text
  block, logs an anthropic.stop_reason warning on max_tokens/refusal
  (truncation now observable), and raises ValueError on a no-text response.
- generate_json gains an optional `schema` param. Anthropic wires it to
  output_config.format (structured outputs); schema=None preserves the exact
  prior call for every existing caller. Gemini accepts-and-ignores it.

kb_conversion_service.py:
- TROUBLESHOOTING_SCHEMA / PROCEDURAL_SCHEMA + _schema_for_target_type(),
  modelled as a strict superset of every field the prompts emit.
- convert_document passes the schema only when the new
  AI_KB_CONVERT_STRUCTURED_OUTPUT setting is True (default False). The
  _try_repair_json fallback stays as belt-and-suspenders.

Tests: 14 provider + 7 schema, TDD (red-green). Live constrained-decoding
smoke-test still required before enabling the flag in production.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

2026-05-28 21:48:49 -04:00

feat(ai): robust response extraction + structured-output foundation (flag-gated) #188

3 Commits