AI Accountability

We use AI. Here's exactly
what it touches.

Every piece of content on FitChef carries a provenance trail. Which AI agents processed it, what they found, what they got wrong, and where humans intervened. This page explains that trail — and shows you the aggregate results.

Most AI transparency pages are a single paragraph: "We use AI tools to assist our editorial process." That tells you nothing. It doesn't tell you which AI, what it did, where it failed, or how you'd verify any of it.

This page is different. We don't just disclose that we use AI. We disclose what every AI agent does, what it cannot do, what it gets wrong, and how we catch those errors — with live data from our actual pipeline runs.

Five Things AI Never Does on FitChef

Before we show you the data, here are the boundaries. These are architectural constraints — not guidelines. The pipeline enforces them automatically.

AI never invents claims

Every factual claim on FitChef is extracted from a peer-reviewed paper. AI reads and structures — it does not generate scientific conclusions. If a claim can't be traced to a source paper, it doesn't exist in our system.

AI never publishes alone

No AI output reaches readers without passing through a second, independent AI verification and at least one human checkpoint. The pipeline has three mandatory human gates where content pauses until a person approves.

AI never overrides the Skeptic

The Skeptic agent runs independently from every writing agent. If it flags an error, there is no editorial override. The content gets fixed or it doesn't publish. The writing agent and the verification agent never share context.

Every AI decision is logged

Every extraction, every verification, every catch, every correction gets written to the Golden JSON — a single file that grows through every step of our three pipelines. Nothing is deleted. Nothing is quietly edited. The audit trail is permanent.

Humans hold 28 kill switches

At any point, a human can stop the pipeline and kill content. 28 automatic conditions halt publication instantly — wrong numbers, fabricated quotes, retracted studies, meaning drift, editorial overstatement, injected external data, scope overgeneralization. No appeals.

What Each AI Agent Does — and What It Can't

FitChef runs three dedicated pipeline teams — one for studies, one for claims, one for flagship guides. Each team has its own specialized agents, its own skeptic gates, and its own quality bar. 5 verification gates total. Here's the full map.

Pipeline 1 — Study Team Analyze one paper. Extract every finding. Verify twice. 11 steps · 3 skeptic gates

Searches PubMed, Google Scholar, and journal databases to find every relevant study for a topic. Produces the cluster blueprint that every downstream agent reads from.

Cannot Judge study quality. Skip niche studies. Every relevant paper must be found — missing one means incomplete evidence for claims later.

A human provides the full paper. Pipeline pauses until the complete document is verified.

Reads every page, table, and footnote. Extracts structured findings with exact numbers, methodology, limitations, and direct quotes.

Cannot Interpret or draw conclusions. Extracts exactly what the paper says — nothing more.

Verifies every extracted finding against the original paper. Numbers, quotes, framing, limitations — all checked independently.

Cannot Approve "close enough." Either the extraction matches the paper exactly, or it gets rejected.

Three agents enrich the study: cultural context and myth-busting angles, search intent mapping, and emotional arc design for the narrative.

Cannot Override verified findings for engagement. The story serves the evidence — never the other way around.

Writes the editorial "match analysis" — magazine-quality narrative. A dedicated Audio Agent then rewrites for spoken rhythm (not read-aloud).

Cannot Add hype. Change qualifiers. Every sentence must trace to a verified finding.

Checks every sentence of the finished page and audio against verified findings. Did the writing change the meaning? Did we add unsupported claims?

Cannot Approve content that "mostly" matches. Any meaning drift gets sent back to the writer.

Validates the WordPress import package: structured data integrity, schema output, trust metadata, and cross-references against verified findings. Nothing reaches the live site without passing this final study gate.

Cannot Skip structural checks. A missing field, broken schema, or mismatched metadata halts the import.

Pipeline 2 — Claim Team Synthesize findings from multiple studies into evidence-based answers. 14 steps · 1 skeptic gate

Scans TikTok, Instagram, YouTube, Reddit for high-engagement topics. Cross-references social demand with all verified findings to produce the claim blueprint.

Cannot Create claims without evidence. Every claim must be answerable with verified study findings.

A human reviews and locks the claim blueprint — which questions to answer, which evidence supports each. No claim is produced without human sign-off on the plan.

Gathers findings from multiple studies, weighs evidence quality, handles contradictions, produces an evidence-verified answer with full evidence base.

Cannot Cherry-pick evidence. Inflate consistency scores. Every divergent study must be included and weighted.

Writes the 500-700 word answer page with kitchen translations and Evidence Consistency Matrix. Dedicated Audio Agent rewrites for earbuds.

Verifies the multi-study synthesis is honest and every statement in the answer page traces to the evidence base. Was evidence cherry-picked? Is the consistency index justified? No overclaiming. No drift. Missing one divergent study = instant rejection.

Cannot Accept claims broader than the evidence supports. Would an independent researcher agree with this synthesis?

Pipeline 3 — Library Team Weave answers into flagship guides people read 100 times. 10 steps · 1 skeptic gate

Decides which flagship items to produce: the Definitive Guide, the Myth Buster, the Skeptic Review. Maps which claims feed which sections.

Writes the masterpiece — 2,000-4,000 word editorial weaving claims into narrative. Dedicated Audio Agent produces a 3-6 minute mini-podcast episode.

Cannot List claims. The guide must weave them into flowing editorial where every citation feels like discovery.

Every claim reference in the guide is accurate. No overstatement. No misrepresentation. Certainty tiers match the source claims.

Cannot Let creative writing override accuracy. If a claim has moderate certainty, the guide cannot present it as settled science.

Shared — All Pipelines

Quality scoring against world-class bar. Internal linking across all three layers. WordPress import with full schema output.

A human reviews the final content and clicks publish. AI doesn't press the publish button. Humans do.

Evidence-Connected Pipelines

FitChef also runs two additional pipelines for recipes and shorts. These don't create new evidence claims — they inherit trust from the studies and claims above. But they still run their own verification to ensure nothing gets lost in translation.

Connected

Recipe Pipeline

Every macro traces to a verified study. 2 agents research and build each recipe. 22 kill switches verify ingredients, nutrition facts, and evidence connections.

22 kill switches

Connected

Shorts Pipeline

Quick-read articles that distill research insights. Every claim links to verified evidence. 8 verification gates check for overstatement and broken links.

8 verification gates

Where AI Gets It Wrong

AI makes mistakes. The question isn't whether it will — it's whether you catch them before readers see them. Here's what our pipeline catches most often.

Meaning Drift

"A modest benefit in trained athletes" becomes "everyone should do this." The most common AI error: gradually broadening scope while making text readable.

Confidence Inflation

"Suggests" becomes "proves." "May contribute" becomes "causes." AI writing agents naturally escalate certainty to make sentences flow better.

Missing Qualifiers

A study about young males gets presented as universal advice. The population qualifier disappears because AI optimizes for readability over precision.

Cherry-Picking

Highlighting the exciting finding while ignoring the discussion section where researchers warned about limitations and conflicting evidence.

Numerical Rounding

"47 studies" becomes "about 50 studies." Exact numbers replaced with round numbers. Small change, big integrity problem.

2 errors caught by the Skeptic across all content so far

Every Page Carries Its Own AI Fingerprint

This is what makes FitChef different from a generic "we use AI" disclaimer. Every individual page on FitChef carries machine-readable metadata that documents exactly what happened to it.

Meta Tag

fc:skeptic_note

The Skeptic's reader-facing note: key caveats, trust context, and any limitations the verification found. Present on every study page and claim page.

Meta Tag

fc:skeptic_verification

Machine-readable verification summary. What the Skeptic checked, what it found, and whether this page passed all gates.

Schema

JSON-LD Review + ScholarlyArticle

Every study page outputs structured Review schema (the Skeptic's assessment) linked to ScholarlyArticle schema (the underlying paper). Crawlable by Google and every LLM.

This isn't hidden in a developer console. It's the infrastructure that makes FitChef content verifiable by anyone — human readers, search engines, AI systems, and journalists. View the source of any study or claim page and you'll see it.

The System Evolves

Every time the Skeptic catches a pattern — not just a single error, but a recurring type of mistake — the pipeline upgrades itself. These upgrades are logged automatically and published here. The verification system you're reading about today is not the same one that launched.

Jul 2026 Fix

Meta-analysis participant counts corrected across 3 live imports

During Library Pipeline LP1 execution, kill switch KS-LP1-04 (participant count verification) flagged that two meta-analyses — Wolf 2023 (partial vs full ROM, 24 studies) and Rosa 2023 (single vs multi-joint, 7 studies) — had their study counts displayed as participant counts on the live site. Wolf showed n=24 instead of n=658 (sum of 23 known individual study Ns from Table 1). Rosa's claim evidence entries showed n=7 instead of n=193 (corrected by R11 in the study import but not propagated to claim evidence). Root cause: when a meta-analysis paper doesn't explicitly report total pooled participants, P2 (Study Extractor) recorded the number of included studies as sample_size_final. This propagated through the full chain (extraction → synthesis → study import → claim import) without being caught. Three import JSONs corrected: Wolf study import (24→658), Wolf claim import (5 evidence entries, 24→658), Rosa claim import (4 evidence entries, 7→193). All fixes applied via Rule 23 compliant json.load → modify → json.dump with backups in clusters/_backups/.

LP1 Fuel Collector kill switch KS-LP1-04 flagged suspiciously low participant counts matching study counts for meta-analyses

Jul 2026 Verification upgrade

Recipe-based articles must now ground the standard in science before citing our own data

Our recipe-based articles answer 'can I actually do this?' using FitChef's verified recipe database as proof. A review found that some were leading with our internal database numbers instead of first establishing — from independent science — what the reader actually asked about. An article on whether salads are filling enough compared our recipes' median protein and fiber without ever establishing what makes a meal filling in the first place. We rebuilt the process so the order can never invert again: every recipe-based article must first ground the standard it measures against in real science (either peer-reviewed studies or our own research pipeline — never our recipe database deciding its own bar), and only then use our recipes to show whether that standard is met. Our recipe data can prove something is achievable; it can never decide what matters. New automated checks now catch four specific failure modes: measuring against a yardstick our own database invented, comparing a group of recipes against a pool that secretly contains that same group (which mathematically hides real differences), dropping a source's own caveat exactly where it would weaken the point, and describing an industry-funded source as 'independent.' The last two checks now run on every article, not just recipe-based ones.

Internal audit of the 16 live recipe-based articles found the underlying process built articles from our data outward instead of from the reader's question inward.

Jul 2026 Verification upgrade

Cross-cluster study page detection for Shorts fuel attribution

When a Short's fuel references a study via its satellite path in one cluster, the study may have its own full study page in a different cluster. Previously, if the fuel didn't include a DOI, the verification gate had nothing to match against — the source shipped without a study page link even though one existed. SV1 now performs a name-based fallback search when both slug and DOI are empty: it extracts the author surname and year from the source field and searches the study corpus for a matching study page. If found, the slug and DOI are populated automatically.

BMI-vs-DEXA Short (KW-708) — Longland 2016 referenced via satellite in fat-loss cluster, but study page lives in protein cluster. Empty slug and DOI shipped; caught by human review.

Jul 2026 Verification upgrade

Anti-Rationalization Gate added to evidence assembly

During a Shorts production run, the AI found a full-text source URL that would have strengthened the evidence package — then rationalized it away instead of presenting it. The completeness framework allowed the AI to invent an intermediate category (⚠️) between PASS and FAIL, then re-reason those results to PASS without locking new evidence. The new Anti-Rationalization Gate (KS-SF1-39) makes all completeness tests strictly binary: PASS or FAIL, no middle ground. A FAIL can only become a PASS by locking a new source that directly addresses the failure — never by re-reasoning. Any source URL found during the search checklist MUST be presented when any test is flagged. Evidence resolves doubt, not reasoning.

KW-692 production incident: AI found Kresge & Melanson 2015 full text at URI Digital Commons, then abandoned it after re-reasoning completeness tests from ⚠️ to ✅. Caught by human review of the fuel package.

Jul 2026 Verification upgrade

Audited every Short for medical-advice risk — added two new safety gates

We ran a full legal and medical-advice audit across all 509 published Shorts — every field, from the headline and search description to the audio script and FAQ answers. FitChef's whole model is to report what the research found, never to give medical advice, so we pressure-tested whether any Short could read as a medical verdict. Two gaps surfaced, and we closed both at the pipeline level so no future Short can reproduce them. First: every Short's audio must now end with a spoken disclaimer — that FitChef reports research rather than medical advice, and that listeners should consult a qualified professional — because audio can play far from the page, where our written disclaimers live. Second: the parts of a Short that appear alone in a Google result or a shared social card — the title and description — can no longer state a disease-prevention, safety, or treatment outcome as FitChef's own verdict; they must ask the question or attribute the finding to the study. Both rules are now enforced by automatic checks that block a Short from publishing if it fails.

Internal legal and medical-advice audit of all 509 live Shorts (July 2026)

Jul 2026 Verification upgrade

New kill switch: publication type verification (KS-C0-PUBLICATION-TYPE)

The pipeline now mechanically verifies that every study entering the production queue is a full peer-reviewed paper — not a conference abstract, poster presentation, letter to the editor, or editorial supplement. A conference abstract (Heffernan 2017, Proceedings of the Nutrition Society) passed all existing verification gates because it had a valid DOI, was indexed in PubMed, and reported real meta-analytic results. It was caught during paper ingestion when the source revealed it was a ~600-word conference proceeding with no full Methods or Results sections. The new KS-C0-PUBLICATION-TYPE kill switch adds mandatory publication-type checking to the citation verification process, with specific red flags for journal names containing 'Proceedings' or 'Supplements', single-page references, and 'OCE' issue labels. Zero live content was affected — the study was caught before entering the production pipeline.

Food Science & Bioavailability cluster P1 paper ingestion (2026-07-17) — vitc-iron-absorption-meta flagged as conference abstract

Jul 2026 Verification upgrade

New gate prevents recipe-fuel Shorts with zero grounded evidence

During production of a recipe-fuel Short about fiber and bloating, the system produced impressive-looking recipe scan data — 478 matching recipes, diverse macro distributions — but the keyword's actual question (GI physiology: why bloating happens, how to avoid it) cannot be answered by recipe database evidence. The scan was high-quality but irrelevant. Two new structural gates now verify that recipe data can actually answer the reader's question BEFORE any scan runs. If the core answer requires non-recipe evidence (physiology, biochemistry, medical science), the keyword is rejected as recipe-fuel regardless of how good the scan looks.

KW-599 ('how to increase fiber without bloating') — recipe scan produced valid data that masked a total evidence gap. Human caught it before publication.

Jul 2026 Verification upgrade

Recipe-fuel Shorts now require scientific grounding for selection criteria

Recipe-fuel Shorts recommend specific foods based on a scientific premise — for example, showing casein-rich recipes 'for muscle recovery' assumes casein before bed actually supports muscle protein synthesis. Previously, the recipe scan proved WHAT recipes exist but the WHY (the science behind the food selection) was treated as optional. A Short could lock with impressive recipe data and zero scientific evidence for why those foods were chosen. Now, every essential scientific claim behind the recipe selection must carry grounded evidence — with verbatim quotes traceable to source files — in the Short's own fuel file. The system searches cluster studies, existing Short evidence files, and claim/library files before pursuing external sources. No recipe-fuel Short can publish recommendations based on assumed science.

KW-601 ('what to eat before bed for muscle recovery') — 56 casein-rich recipes locked as fuel with 'Internal evidence: None needed.' The Short recommended casein foods for recovery without any grounded evidence that casein before bed actually works. The science exists in the platform but was never pulled into this Short's evidence chain.

Jul 2026 Verification upgrade

Post-Edit Field Diff Gate — catches unintended mutations in live import data

When the pipeline adds links or metadata to an existing published article, it now runs a mandatory diff check after every edit: no fields added, no fields removed, no field values changed outside the specific set the step is allowed to touch. Any violation aborts the edit and restores the backup. This catches a class of bug where editing code accidentally mutates fields it was never supposed to touch — the data passes through serialization faithfully, so the corruption is invisible without the diff check.

KW-489 production run — SV1 added a field with a wrong name and SL1 changed recipe bridge data in 2 live Shorts while only body text should have been modified. Both Shorts were already live and importing cleanly before the edit. The mutations were caught during reimport validation.

Jun 2026 Verification upgrade

Recipe bridge slug resolution gate added

Added a mandatory slug resolution step (Step 7D.4b) to the Short Verifier. When SV1 writes recipe bridge entries, it must now resolve the Short's slug from its own import JSON — never from the folder name. For 18+ Shorts, the internal folder name differs from the WordPress slug, and using the folder name produced 404 links in the 'Why This Works' section on recipe pages. Kill switch KS-SV1-69 enforces this check.

2026-06-18 hummus Short: SV1 wrote folder name into both recipe bridges, producing 404s. Audit found 5 additional broken entries across the corpus.

Jun 2026 Verification upgrade

Programmatic fuel verification gate added to Shorts pipeline

Added a mandatory programmatic verification step to the Short Fueler agent (SF1). After assembling the evidence package for each Short, every verbatim quote is now string-matched against its referenced source file using a Python script — not a mental check. If any quote is a paraphrase rather than an exact copy, the fuel file is rejected and corrected before downstream agents can use it. Every specific number in the creative direction section must also trace to a grounded evidence entry. Previously, the spec required quotes to be exact but had no enforcement mechanism to verify this programmatically. Caught during production when a quote from an internal study was found to be a correct paraphrase (right meaning, wrong words) rather than a verbatim copy.

During KW-404 production, a post-assembly audit found that an internal evidence quote was paraphrased rather than copied verbatim from the extraction file. The underlying science was correct, but the quote chain was not character-perfect. The existing kill switch (KS-SF1-15) declared quotes must be findable but had no programmatic check to enforce it.

Jun 2026 Verification upgrade

Grounding Integrity Law — evidence files can never contain AI-fetched content

Added a three-part grounding integrity law to the Shorts pipeline. Part A: Claude can never use content it fetched from the web as grounded evidence — only content the human researcher physically shared (pasted or uploaded) can be locked as evidence. Part B: When a PDF is uploaded, the evidence file must contain verbatim excerpts from the document, not Claude's reorganized interpretation. Part C: After creating any evidence file, Claude must verify every claim traces back to the human-provided source material. Claims from web research are separated into a clearly labeled ungrounded section. This ensures every fact on a FitChef page can be traced through an unbroken chain: published paper → human verification → locked evidence file → final article.

During ACV weight loss Short production, Claude created an evidence file by reorganizing PDF table data into narrative paragraphs (accurate but not verbatim) and embedded a retraction finding from web research into the grounded evidence section without separating grounded from ungrounded context.

Jun 2026 Bugfix

Corrected misattributed author on creatine body-composition meta-analysis

Three live pages (two claim pages and one library guide) attributed the creatine body-composition meta-analysis (DOI: 10.1080/15502783.2024.2380058, 143 RCTs) to 'Bonilla et al.' instead of the actual first author, Pashayee-Khamene et al. The study page itself was already correct — the error was only in claim and library pages that referenced the study. The misattribution originated during cluster architecture (C0), where the AI confused a prominently cited reference author (Bonilla DA appears as Reference #1 in the paper) with the paper's actual authors. Triple-verified from PubMed (PMID 39042054), Taylor & Francis, and the full-text PMC article. All 36 incorrect references across 6 files corrected; zero data loss confirmed via programmatic diff against backups. The prevention mechanism for this error class — KS-C0-SAT-CONSISTENCY and P1 plausibility check — was already deployed in a previous update.

Manual review by Mark questioned authorship. Investigation confirmed the actual paper by Pashayee-Khamene F, Heidari Z, Asbaghi O et al. has no author named Bonilla. Diego Bonilla appears only as a cited reference within the paper, not as a co-author.

Jun 2026 Verification upgrade

Satellite brief consistency verification — two new kill switches

Added two new kill switches to catch same-author paper confusion in satellite assignments. KS-C0-SAT-CONSISTENCY (C0 v4.3) requires cross-checking every satellite's extract_this_finding against its verified paper title from citation verification — catches cases where a finding describes Paper X but the DOI points to Paper Y by the same author. P1 v1.7 adds a safety-net plausibility check at paper ingestion: when Mark pastes a satellite paper, P1 now verifies the paper's title and abstract plausibly match the finding the satellite is supposed to contribute. Together these create a two-layer defense against the highest-risk satellite error: same-lab, same-topic author confusion.

Aging cluster prep discovered Morton 2016 hormone RCT was assigned Morton 2018 protein meta-analysis metadata — two different papers by the same first author from the same lab. Same session: Orsatti 2023 DOI resolved to Khalafi et al. 2023. Both are same-author confusion that existing citation verification (which checks DOI existence but not finding-title alignment) could not catch.

Jun 2026 Verification upgrade

Thesis Jaccard Gate prevents duplicate content at scale

Added a new mandatory gate (Tier 1B) to the keyword separation system. The existing lexical gates check word overlap and substrings, but two keywords can use different words while asking the same question — 'does evening exercise ruin your sleep' vs 'does evening exercise affect sleep' share zero substrings but have an identical thesis. The new gate computes Jaccard similarity specifically against the thesis-carrying fields (the primary keywords and claim queries that define what each page answers), auto-killing candidates above 60% similarity and flagging those above 35% for manual thesis-difference review. On the batch that triggered this upgrade, the new gate would have caught 25 duplicate candidates that the lexical gates missed — a 62.5% false-pass rate reduced to zero.

2026-06-26 SK0 keyword batch run: 40 candidates passed all existing gates, but manual audit found 25 were duplicates of existing Shorts or Claims using different phrasing for the same question

Jun 2026 Verification upgrade

Pull Quote Isolation Test — standalone readability enforcement

Added a mandatory Isolation Test to the narrative writer (R1) and field verification (R7) agents. Every pull quote must now pass a three-question self-containedness check: can a reader identify the subject, the claim, and at least one specific data point without any article context? A 60-character minimum floor catches compressed fragments mechanically. Pull quotes that work as narrative pacing devices but fail as standalone cards are now blocked before they reach the site.

Quality review found pull quotes rendering as vague fragments ('A 73% crash.', 'More going in. Less showing up.') — dramatic as narrative beats but meaningless as standalone cards. Data analysis across ~75 studies confirmed only 2 outliers, both from recent runs. Dual execution failure: writer produced staccato fragments, verification agent passed them despite existing standalone readability check.

Jun 2026 Verification upgrade

Visual-vs-prose consistency gate added

When a visual element displays a data value alongside a threshold or benchmark, the pipeline now verifies that the body text's characterization of the relationship between those numbers is mathematically defensible. A visual that shows readers one thing while the prose claims another undermines trust — this gate catches the mismatch before publication. Added to both the Visual Team (VT) and Short Verifier (SV1) specs.

External AI review flagged a live Short where a visual showed a value just below a threshold while the body text claimed the value 'cleared' it. The underlying science was correct, but the prose overclaimed relative to what the reader could see.

Jun 2026 Verification upgrade

Pull quote attribution now distinguishes editorial synthesis from verbatim quotes

Pull quotes on study and Short pages were rendered with curly quotation marks and researcher-name attribution (e.g., '— Morton et al. 2018'), which visually implied the text was a direct quote from the paper. In reality, pull quotes are always FitChef's editorial synthesis — rewritten in plain language, never raw academic text. The theme now renders pull quotes without quotation marks and with 'Based on {source}' attribution, making it clear the insight is our interpretation of the research, not the researcher's own words. Pipeline specs (R1, R7, Shorts rendering model, Component Library) updated to reflect and enforce the new format. 142 study pull quotes and 116 Short screenshot sentences are affected — all corrected automatically by the theme change with no import JSON edits needed.

External review (Gemini) flagged that a pull quote on the Collagen Short attributed editorial prose to Ravindran et al. (2026) in a format indistinguishable from a verbatim quote. Audit confirmed the pattern was systemic across all studies and Shorts.

Jun 2026 Verification upgrade

Statistical notation gate now catches correlation coefficients

R9's Zero PubMed mechanical code gate previously caught p-values, confidence intervals, heterogeneity notation, effect sizes, and odds/risk ratios — but not correlation coefficients (r-values). A live Short shipped with raw r=0.07 and r=0.60 in reader-facing text because the gate's regex didn't include the pattern. The gate now scans for correlation coefficient notation and translates it to plain language, closing the gap.

Live Short ('why-regain-weight-after-every-diet') found with raw correlation coefficients in takeaway box, body text, SEO description, and speakable text — all Magazine layer fields that should contain zero statistical notation.

Jun 2026 Expansion

Short ↔ Recipe Bridge — cross-content grounding through shared evidence sources

Shorts and Recipes are now connected bidirectionally when they share a grounded evidence source (study DOI, claim slug, or internal synthesis register entry). SV1 Gate 7D scans recipe imports when verifying a Short; RR2 Step 7.7 scans Short imports when building a recipe. Both directions enforce the HELL YES threshold: the recipe must physically deploy the mechanism the Short explains. Categorical similarity ("both about protein") is never sufficient. Three new kill switches enforce slug verification, mechanistic-only grounding, and a per-page cap. The bridge creates a knowledge graph where readers can move between "why this works" (Short) and "put this into practice" (Recipe) — grounded by the same underlying evidence.

Architecture decision to connect the two newest content types through shared evidence rather than topic similarity

Jun 2026 Verification upgrade

P4 Source Hunter now cross-checks bomb derivations against verified extraction data

Added a mandatory derivation cross-check (Step 4B) to the Source Hunter agent. Every number in a bomb derivation that claims to come from the study must now match a verified point estimate in the extraction — not a confidence interval bound, subgroup footnote, or table annotation. All derived values (ratios, percentages) are recalculated from verified point estimates. This catches the most common derivation error: using a CI bound instead of an effect size.

During the Dupuy 2018 recovery study run, P4 used a confidence interval bound (g = −0.20) instead of the point estimate (g = −0.47) for cold water immersion. This produced an incorrect 11x ratio instead of the correct ~5x ratio. The error propagated through the grounded file and was not caught until the Number Skeptic (R6) — 8 agents later. Creative content used the correct value from extraction, but the discrepancy persisted in the grounded info-gain file.

Jun 2026 Verification upgrade

Medical Topic Exclusion Gate added to Shorts pipeline

New two-layer gate systematically excludes keywords that target medical conditions from the Shorts pipeline. Keywords about blood sugar management, blood pressure, depression treatment, cardiovascular disease, or any clinical condition are now automatically killed — even when the evidence is strong and actionable. FitChef is a fitness editorial platform for people building their bodies, not a health information site for people managing medical conditions. SK0 (Keyword Scout) enforces this at mining time; SF1 (Short Fueler) enforces it again as a safety net at production time. Seven keywords from the June 2026 bank refill were retroactively killed under this gate.

SK0 v2.9 bank refill admitted 7 keywords targeting blood sugar, blood pressure, depression treatment, and heart health. All passed the existing Body-Composition Filter (their answers DO change behavior) but their audiences are patients managing conditions, not gym-goers optimizing performance.

Jun 2026 Verification upgrade

External source claims now require human verification

When a claim page names a specific external source with a specific number (e.g., 'Healthline says 44 to 78 grams'), the pipeline now requires that source+number pair to be human-verified — the same standard applied to scientific evidence. Previously, web-fetched competitive data from SEO research could enter narrative prose without human verification. The pipeline already distinguished cultural fuel from grounded evidence at the research stage, but the distinction was not enforced at the writing stage or caught by the content skeptic. Two new gates close this gap: a writer-level rule preventing unverified named-source claims, and a skeptic-level gate catching any that slip through.

Mark's review of CL-004 creative output identified that web-fetched competitive data was being used as grounded facts in prose.

Jun 2026 Verification upgrade

Macro attribution accuracy gate added to recipe pipeline

Added a new verification rule (SR9 + KS-RR2-19) ensuring that nutritional values stated in recipe descriptions, audio scripts, and FAQ answers are attributed to the whole meal rather than a single ingredient. Recipe macros describe the combined contribution of every ingredient on the plate. An audit of 201 recipes found cases where editorial text credited the full meal protein to one ingredient while grains, vegetables, and other sources contributed meaningfully. The pipeline now requires meal-level language for meal-level numbers.

Manual audit discovered recipe audio scripts attributing total meal protein to a single protein source, ignoring contributions from grains, legumes, and vegetables

Jun 2026 Verification upgrade

Cross-Content Consistency Gate added — catches contradictions between pages

Added a new verification gate (Gate 1C, KS-SV1-46) that checks every claim in new content against all existing live content for contradictions. Each piece of content was already verified against its own sources individually — but two pages could each be grounded in real research while still contradicting each other if the interpretation drifted. This gate catches that drift before it goes live. Triggered by discovering two live Shorts making conflicting claims about the same tissue: one called muscle "the strongest predictor of calorie burn" (an interpretation drift from the source, which said "fat-free mass"), while the other correctly noted muscle is "metabolically quiet at rest" at 6 calories per pound per day. Both were individually grounded in real studies. Neither was wrong about its own source. But the reader saw two pages on the same site saying opposite things. The contradiction has been corrected, and the new gate prevents any future content from shipping with cross-page inconsistencies.

Manual audit discovered two live Shorts (KW-035 muscle-does-not-turn-to-fat and KW-015 muscle-calories-at-rest) making contradictory claims about muscle's metabolic contribution — each individually grounded, but conflicting due to interpretation drift from source verbatim

Jun 2026 Verification upgrade

Body-Composition Filter now enforced across all content tiers including Shorts

The Body-Composition Filter — which ensures every FitChef page changes how you eat, train, or build your body rather than providing medical reassurance — was already enforced at study tier (C0 Cluster Architect) and claim tier (Claim Rule 8). It was NOT enforced in the Shorts pipeline. Two high-demand keywords passed every existing gate (separation, evidence mapping, competition assessment) but their honest answers were medical reassurance: 'your kidneys are fine' and 'it's safe long-term.' Both killed. SK0 (Keyword Scout) now has a mandatory Body-Composition Filter gate that catches these keywords before they enter the production queue. SF1 (Short Fueler) has a safety-net gate that catches any that slipped through older bank versions. Editorial Foundation Rule 9 — the filter applies at every tier identically — is now mechanically enforced end-to-end.

Keyword bank audit found KW-021 ('is creatine safe long term') and KW-024 ('will too much protein damage kidneys') in the production queue. Both had massive search demand and zero competition — but both fail the Body-Composition Filter because their answers are safety reassurance, not behavioral change. The protein cluster build plan had already excluded the kidney topic at study tier with cut_reason 'medical_reassurance.'

Jun 2026 Pattern fix

Guide headline stats now provably match the studies on the page

A guide's headline numbers — how many studies and how many participants — must now be computed from that guide's own analyzed studies, not borrowed from any single underlying question. During an internal audit we found one guide whose 'total participants' figure had been pulled from a single claim's evidence set, most of it from one large study, which made the headline unrepresentative of the guide as a whole. We corrected the figure to reflect the guide's actual study set, and added a verification step that blocks any headline stat that can't be reconstructed from the studies shown on the page.

Internal audit found the Carbs guide's hero stat reported a participant total lifted from one claim (dominated by a single 68,128-participant study) instead of the guide's own studies.

Jun 2026 Bugfix

Removed a misleading “extra” from an ultra-processed-food hero stat

The study's headline read '340 extra calories.' The 340 figure is grounded — it is how many calories the measured eating rate delivers in a 20-minute meal — but the word 'extra' implied it was the between-diet difference, which is actually 508 calories per day (stated correctly in the key finding). We removed 'extra' so the headline number is unambiguous.

Internal audit found the hero stat labeled a 20-minute intake figure as 'extra' calories (Hall 2019).

Jun 2026 Bugfix

Corrected a meta-analysis's “participants” figure that was actually its study count

On the sugar-and-body-weight study page and a claim that cited it, the 'Participants' figure read 68 — but 68 is the number of studies the meta-analysis pooled (30 randomised trials plus 38 prospective cohort studies), not participants. The source paper reports per-analysis effect sizes, not a single participant total, so we removed the incorrect figure rather than substitute an invented one.

Internal audit traced the study page's 'Participants: 68' to the meta-analysis's pooled study count (Te Morenga 2013, BMJ).

Jun 2026 Fix

Removed unverifiable sample size from Schoenfeld 2017 load meta-analysis

The Schoenfeld 2017 study page stated '684 participants' as the pooled sample size across 21 studies. During the deep pre-scale audit, we verified this number against the full paper (J Strength Cond Res 31(12):3508-3523): the paper does not report a pooled participant total. Table 1 lists per-study sample sizes summing to 630, but this includes control groups excluded from the meta-analysis — the actual analyzed N is lower and unreported. 684 appeared nowhere in the paper text, abstract, or tables. Rather than replace it with a derived estimate, we removed the participant count entirely and retained '21 studies' as the scope descriptor, consistent with what the paper itself reports. The training library flagship participant total was updated accordingly (9,350 → 8,666). A cross-reference in the Zhang 2025 exercise-deficit study was also corrected.

Deep pre-scale quality audit (Check 1: study number traceability). Verified 17,599 numbers across 224 published pieces; Schoenfeld 684 was one of two medium findings.

Jun 2026 Fix

Corrected protein badge to match EU nutrition claim threshold

The Stuffed Portobello Mushrooms recipe displayed a 'High Protein' badge, but protein provided only 16.7% of total energy — below the EU EFSA threshold of 20% required for 'high protein' claims (Regulation 1924/2006). The badge was changed to '35g Protein' (a factual statement, not a regulated claim). The recipe pipeline spec (RR2) was updated with mandatory EFSA threshold checks so future recipes are automatically verified before a nutrition badge is assigned.

Deep pre-scale quality audit (Check 17: EFSA health-claim compliance).

Jun 2026 Verification upgrade

New Rule: Zero Fabrication on Import Fix

Added Rule 24 to the pipeline master instructions. When fixing import validation errors (type mismatches, missing fields), the system must never populate fields with inferred or derived content. Only three actions are permitted: type conversion with empty values, direct verbatim copy from a grounded source file with explicit tracing, or mechanical transforms specified by the pipeline spec. Every non-empty value must trace to an exact source file and field path — otherwise the field stays empty. This closes a gap where reasonable-sounding inferences could bypass the grounding requirement.

During a post-pipeline type-mismatch fix, the system fabricated inclusion criteria by inferring from sample characteristics instead of using empty containers. Caught during import review.

Jun 2026 Bugfix

Pull-quote attribution now matches the actual source

The pull-quote blockquote (the shareable sentence shown mid-article in Shorts) attributed the finding to the first study in the fuel list regardless of which study the sentence actually described. For multi-source Shorts this could display the wrong author. A new field now explicitly maps each pull-quote to its correct source, and a kill switch (KS-SV1-34) verifies the match before any Short ships.

Shorts Scale Audit discovered nighttime-carbs Short attributed Sofer et al. 2011's finding (28% more weight loss with dinner carbs) to Gardner et al. 2018 (a different study that happened to be first in the fuel array). Second mismatch found in processed-food-speed-trap Short (Forde finding attributed to Hall).

Jun 2026 Bugfix

LLM citation hint now shows actual answer text

The 'AI systems — cite as:' line in the Cite This Short block was showing a CSS selector (.fc-short-takeaway) instead of the actual citation-ready answer for 5 Shorts, and was missing entirely for 18 Shorts. All 27 Shorts now display the correct answer capsule text that AI systems can use for accurate citation.

Shorts Scale Audit discovered 5 Shorts rendering '.fc-short-takeaway' as the LLM citation hint and 18 Shorts with no hint at all. Root cause: SC1 spec described the field ambiguously (CSS selector vs text), and SV1 never included fcc_short_speakable in its field template.

Jun 2026 Verification upgrade

Data Register Verification added to Shorts pipeline

Added five register-level thinking rules (Paste-Back Test, Feelability Gate, Researcher-as-Subject, Authority-Label, Compound-Noun) to SW1 (writer) and a new Gate 6C to SV1 (verifier). These catch sentences that pass vocabulary checks but read like study-results prose rather than FitChef story voice. The existing Gate 6B caught jargon words; Gate 6C catches the register — the frame in which data is presented.

Protein-before-bed Short (KW-001) passed all existing gates but contained '+8.4 cm² vs +4.8 cm²', 'Trommelen and colleagues tracked', and 'Type II muscle fibers' — structurally sound vocabulary but study-report register.

May 2026 Bugfix

Statistical detail rendering — full numbers now visible on all claim pages

Fixed a rendering gap where 34 of 45 claim pages showed study names in the Statistical Detail accordion but not the actual numbers (effect sizes, p-values, heterogeneity). The theme template now renders both the canonical format and the legacy flat-key format. Pipeline specs (CR7, CR12) updated to mandate the correct structure for all future claims.

CL-002 production run revealed statistical numbers missing from the accordion. Deep audit confirmed 34 of 45 claims affected.

May 2026 Improvement

Sidebar participant counts corrected after cross-system audit

An independent AI review flagged discrepancies between article body numbers and sidebar Evidence Base widgets. Investigation traced the root cause: the agent building sidebar data had no mandated source for participant counts, leading to 6 values across 3 clusters that could not be traced to verified study extractions. Every article body number was confirmed grounded — the issue was isolated to the sidebar rendering layer. All 6 values have been corrected to match verified extraction data, and a new kill switch (KS-LIB-11) now requires every sidebar participant count to be sourced from the same grounded extraction files that feed the article body.

Independent AI cross-check flagged body-sidebar number mismatches

May 2026 Improvement

New blocking gate prevents sidebar-body number mismatches

Added KS-LIB-12: a mandatory cross-check that compares sidebar participant totals against the verified fuel file breakdown before any library page can be imported. If the sidebar sum does not match the article body total, the import is blocked until the discrepancy is resolved. This gate makes it structurally impossible for sidebar and body numbers to diverge — the same grounded data must feed both.

Number consistency audit revealed no existing gate caught the mismatch

May 2026 Bugfix

Study participant counts standardized to prevent silent parsing errors

Audit discovered that some study pages stored participant counts as freeform text (e.g., '49 studies, 1863 participants' or 'n=20 (10 per group)'). The rendering system extracted the first number it found — sometimes reading a study count as a participant count, or reading zero from text starting with a letter. Seven study imports and three claim imports have been corrected to clean integers, each verified against the original study's grounded extraction data.

Number consistency audit found PHP integer casting silently misreading freeform sample size values

May 2026 Verification upgrade

Four-category jargon sweep added to Shorts verification

The Short Verifier now scans every Short and its audio script for four categories of technical language: statistical notation, biomedical terms, exercise-science jargon, and paper-apparatus vocabulary. Any sentence a smart 8-year-old wouldn't understand on first read is caught and sent back for rewriting before it can publish. FAQ answers get the same sweep — no p-values or confidence intervals in reader-facing dropdowns.

Quality audit of first three Shorts found jargon leaks that passed verification: statistical notation in prose, untranslated clinical terms in audio.

May 2026 Verification upgrade

Audio scripts now grounding-verified against source research

Every number, claim, and finding in a Short's audio script must now trace back to the grounded fuel files — the same verified research that powers the written content. Audio translates verified content into spoken-word format; it can never add new claims, regardless of whether the claim is true. The verifier checks audio against fuel files before the Short can publish.

Architecture review found Shorts audio had no grounding constraint — the study pipeline's audio agent (R4) had this rule but the Shorts pipeline did not.

May 2026 Bugfix

Citation links now verified against actual published page URLs

The Citability Engineer and Short Verifier now verify every source citation link against the actual published page URL from the import system. Previously, citation URLs could use internal folder names instead of WordPress page slugs, producing broken links. The verifier also ensures external sources with DOIs always have clickable links — readers can click through to verify every cited study.

Review of first three Shorts found four broken citation URLs (wrong slugs) and two external sources with DOIs that had no clickable link.

May 2026 Verification upgrade

Universal Citation Verification Gate — every study now verified before entering the pipeline

The citation verification gate (KS-C0-CITATION) now requires every study — whether discovered by Claude or suggested by Gemini — to be verified via WebSearch or WebFetch before it can enter the cluster plan. Previously, only Gemini-suggested studies required verification. This upgrade was triggered when a hallucinated citation ('Ravelli 2019') was detected in the fat-loss cluster plan — the paper did not exist in any indexed source. The real paper had a different author, year, and journal. The hallucination never reached any published page (caught before the study entered the pipeline), but the gap that allowed it to enter the plan at all has now been closed. Every citation in every cluster plan is now tool-call verified.

Hallucinated citation detected in fat-loss cluster architecture during production queue review

May 2026 Verification upgrade

Answer capsule standalone gate now catches title-dependent openers

The answer capsule appears in featured snippets, AI citations, and the answer hero card — all contexts where the page title may be absent. The standalone gate previously verified that the answer named its subject and embedded its purpose, but did not check whether the opening word was a direct response to the headline question. An answer starting with 'No —' or 'Yes —' only makes sense if the reader just read a yes/no question above it. In standalone surfaces, there is no question. The gate now includes a fifth test: the first word must make sense without a question preceding it.

CL-005 test-boosters-mostly-scam-one-exception — answer_short 'No — three out of four ingredients...' passed all four existing tests but opened with a title-dependent 'No —' that has no referent in standalone contexts

May 2026 Verification upgrade

New blocking gate: Answer Capsule Standalone Verification

Added Step 3E to the Content Skeptic (CR10) — a blocking gate that verifies the answer capsule works as a standalone statement without the page title. The answer capsule appears in Google Featured Snippets, AI citations (ChatGPT, Perplexity), and the answer hero card — contexts where the title may be absent. The gate checks that the text names its subject and embeds its purpose. Previously, if the answer capsule writer (CR4) falsely logged a standalone PASS, no downstream agent caught it. On CL-003, the answer started with 'Yes — but the boost is smaller than it feels' — which doesn't tell the reader WHAT is being discussed without the title above it. The gate now catches this pattern before publication.

CL-003 preworkout-caffeine-small-real-edge — answer_short standalone failure propagated through entire pipeline undetected

May 2026 Verification upgrade

Doctor’s-letter test expanded to body prose — 4 prescriptive sentences caught and fixed

A full legal audit of all 60 import JSONs across 4 clusters found 4 sentences in claim body prose that read like dietary prescriptions rather than evidence reporting — all using 'aim for' with specific gram or frequency targets. The verification gate (CR10 Step 3A2) already scanned persona actions, FAQs, and skeptic notes for this pattern, but body prose was not included in the scan. All 4 sentences have been rewritten to describe what the evidence found rather than tell the reader what to do, and body prose is now included in the gate so the pattern cannot recur.

Pre-scaling legal audit of all import JSONs (2026-05-12)

May 2026 Verification upgrade

New mechanical gate catches raw statistical notation in narratives

A cross-cluster audit found that 7 of 23 study narratives contained raw statistical notation (p-values, confidence intervals, heterogeneity scores) that should have been translated to plain language. The existing Zero PubMed rule relied on agent judgment, which interpreted 'translate and keep' as compliant. Three spec upgrades close the gap: R1 now requires notation to be translated AWAY (not accompanied), R3 tightens the story-point exception to at most one instance per narrative, and R9 adds a mandatory regex code gate that mechanically catches any remaining notation before publication. Six study narratives were corrected.

Mark’s review of Schwingshackl 2013 live page revealed p-value density. Cross-cluster audit confirmed pattern across carbs and meal-timing clusters.

May 2026 Verification upgrade

Audio scripts now pass the doctor’s-letter test

A pre-scale legal audit of all 31 audio scripts found 5 scripts containing prescriptive imperatives ('aim for X grams', 'push toward X') — the same pattern found and fixed in written persona actions. All 5 scripts were surgically edited to replace imperatives with evidence-reporting language. Two audio scriptwriter specs were upgraded: CR9 (v2.4.0) for claim audio and R4 (v2.6) for study audio both now include an explicit doctor’s-letter test gate (CR9 Step 2C, R4 SR7B). The disclaimer check confirmed all 31 scripts already end with a spoken 'not medical advice' disclaimer — that system was already working correctly.

Pre-scale legal audit — Check 2 (audio scripts) following Check 1 (persona actions)

May 2026 Verification upgrade

FAQ answers and skeptic notes now pass the doctor’s-letter test

A comprehensive legal audit of all 164 FAQ answers and all skeptic notes across both clusters found 5 FAQ answers and 1 skeptic note containing prescriptive imperatives ('aim for X grams', 'you should aim for', 'eat more protein'). All 6 were surgically edited to replace imperatives with evidence-reporting language while preserving the same practical information. A final sweep of all 30 import JSONs confirmed zero prescriptive patterns remaining in any field.

Pre-scale legal audit — Checks 6 and 12 (FAQ fields and skeptic notes) following Checks 1-2 (persona actions and audio scripts)

May 2026 Verification upgrade

Persona actions and translations now pass the doctor’s-letter test

A pre-scale legal audit found that 14 persona action fields across 13 claims contained direct imperatives ('Aim for X grams', 'Push toward X', 'Move X from Y to Z') that could be read as medical advice rather than evidence reporting. All 22 affected fields across both clusters were rewritten to describe what the research tested and found, letting readers infer the action themselves. Three pipeline specs were upgraded with a new blocking gate: CR7 (v2.11.0) now bans prescriptive imperatives in persona actions and real-world translations, CR10 (v1.7.0) adds a mandatory doctor’s-letter verification scan before any claim can pass Gate 2, and R2 (v2.24) extends the same rule to study so_what fields. The reframe preserves the same practical information in the same plain language — the only change is whether FitChef tells readers what to do (old) or reports what was studied (new).

Comprehensive legal audit of all live content across protein and meal-timing clusters before scaling

Apr 2026 Launch

Claim Pipeline goes live — first multi-study synthesis verified

The Claim Pipeline (Phase 2) produced its first verified claim: 'Does intermittent fasting actually give you a better body than regular dieting?' This claim synthesizes 4 studies (529 participants) through a 15-agent pipeline with 18 verification steps, including evidence consistency scoring, content verification against all source papers, reader simulation, and Gemini cross-check. The claim pipeline adds a new layer of verification on top of individual study checks: every factual statement in the synthesis traces back to a verified study extraction, and a quality audit scores the final content across 5 dimensions (evidence integrity, content quality, dwell time design, SEO readiness, trust layer). This first claim scored 91.3/100 composite — above the 85 ship threshold.

First claim reaching production readiness through the complete claim pipeline

Apr 2026 Improvement

Legal safety audit v2: numeric scores removed from public pages + claim audio disclaimer added

Comprehensive 15-point legal safety re-audit of all FitChef systems. Found and fixed 4 places where internal numeric scores (consistency_index/100, trust_score/5) were rendered on public pages — claim cards, cluster hub rows, and OG meta descriptions. These numeric scores could imply medical authority or study validation (Legal Audit §1.3 violation). All replaced with human-readable certainty tier labels. Also found claim audio scripts were missing the legal disclaimer that study audio scripts have had since day one. Added mandatory disclaimer specification to CR9 (Audio Scriptwriter) and verification gate to CR10 (Content Skeptic). Fixed prescriptive heading language on study pages.

Full legal re-audit requested after many system iterations since original April 2026 audit

Apr 2026 Improvement

Internal numeric scores removed from all public outputs

Internal quality metrics (consistency index, data fidelity rate) were previously exposed in REST API responses, JSON-LD structured data, HTML meta tags, and AI-facing llms.txt. These numbers are used internally for sorting and quality gates but could imply FitChef is rating or evaluating science — which contradicts our identity as a content platform. All numeric scores have been removed from every public-facing output. The public now sees only human-readable certainty tiers (High/Moderate/Low) and factual counts (studies verified, claims grounded). The numbers continue to work behind the scenes for quality control.

Legal safety audit v2 — making all public outputs rock-solid for FitChef's identity as a content platform, not a research evaluator

Apr 2026 Improvement

Sidebar fixes + Legal text sweep + Mobile horizontal scroll fix

Multiple theme fixes shipped in v12.15.6: (1) Study sidebar 'What kind of study is this?' disclosure styled with custom CSS chevron. (2) Claim audio playlist CTA is now sticky at bottom-right. (3) Full legal text sweep — updated footer disclaimer, methodology page, skeptic protocol, and cite block headings to remove any implication FitChef conducts original research. (4) Claim card text overflow fixed with CSS line-clamp. (5) Studies archive mobile horizontal scroll fixed — Pattern A center columns now use split overflow (overflow-x: hidden, overflow-y: visible) instead of overflow: visible which broke horizontal containment.

Visual QA audit + Legal safety review

Apr 2026 Improvement

Claim pipeline: 3 automated code gates + voice-register enforcement added

Deep audit of claim pipeline vs. study pipeline revealed 12 quality gaps. Added three mandatory Python code gates to the claim editorial polish agent: bold distribution gate (ensures every content section has visual anchoring), paragraph length gate (max 400 characters — prevents wall-of-text rendering on mobile), and dense-outcome gate (catches results-section-style data dumps). Also added Pillar B voice-register enforcement matching the study pipeline's proven system — four-category jargon sweep, one-voice consistency check across all page fields, and gender-neutral language enforcement. These gates run automatically on every claim before it can pass editorial review.

Comprehensive claim vs. study pipeline comparison revealed claim pipeline lacked the mechanical quality enforcement the study pipeline has had since v3.5

Apr 2026 Improvement

First Claim Pipeline Execution Complete (CL-001)

The claim pipeline (Phase 2) processed its first claim: 'How much protein do you actually need per day?' (CL-001). 18 agents synthesized evidence from 4 studies (Morton 2018, Schoenfeld 2013, Nunes 2022, Jäger 2017 — Consistency Index 87, High Certainty). Quality audit score: 91.5 (SHIP). Gemini external review completed with one accepted fix. Infrastructure validation (CR13): PASS_WITH_GAPS — 0 blocking, 1 non-blocking (bridge metadata field not yet in plugin registry; bridge content preserved in body prose). Trust audit (CR14): PASS — 0 kill switches triggered, evidence chain fully traceable, trust page consistency confirmed (ClaimReview schema, LLM meta tags, Skeptic Protocol references all present).

Claim pipeline (Phase 2) completed first full production run for protein cluster CL-001

Apr 2026 Verification upgrade

Cluster Architect gains meta-analysis mandatory check and tension splitting gate

Post-launch review of the protein cluster revealed a critical gap: the 'how much protein when losing weight' question had no covering study — Wycherley 2012 (meta-analysis, 24 RCTs, 1,063 participants) was never evaluated as a candidate because the tension was framed as 'body recomposition' (Longland 2016), collapsing two different mass-audience questions into one. Two new rules now prevent this: A-S9 forces the Cluster Architect to search for meta-analyses for every tension with an RCT flagship, and A-S10 forces a two-archetype test to detect collapsed tensions. The protein cluster has been updated: 9 flagships (was 8), with Wycherley 2012 covering deficit populations that Morton's 1.62g/kg breakpoint explicitly excludes.

Post-launch protein cluster integrity review found Wycherley 2012 was never evaluated

Apr 2026 Verification upgrade

Independent Dwell-Time Verification Added to Claim Pages

Every claim page now undergoes an independent reading experience check by a second AI model (Gemini) that has never seen the pipeline. Gemini reads the page as a naive reader who Googled the question, identifies where engagement drops, and flags structural similarities between sibling claims. This model has zero authority over the evidence — all factual claims remain locked by the existing verification gates. The check catches dwell-time weak spots that the writing model cannot detect in its own work.

Claim pipeline design — claim pages need independent reading experience verification like study pages get from R10

Apr 2026 Expansion

Cluster Planning Rebuilt: Tension-First Architecture + Flagship/Satellite Studies + Body-Composition Filter

Every cluster is now built around the real debates fitness audiences argue about — not around which papers exist in the literature. Studies that answer the same question are grouped into flagship pages with convergent evidence sections, so you see one definitive page backed by multiple studies instead of redundant overlapping pages. Medical-reassurance topics (organ safety, mortality statistics) no longer earn standalone study pages — only findings that change how you eat, train, or build your body get published. Layer 3 of each cluster now includes four content types: a comprehensive Master Guide, a shareable Myths Piece, a methodology transparency Skeptic Note, and an interactive Tool (where applicable). The result is tighter clusters, zero redundancy, and every page serves a unique purpose.

Protein cluster C0 Phase A execution revealed five structural issues: redundant meta-analysis pages, medical-reassurance content passing the viral filter, Layer 3 underspecification, bottom-up planning producing academic completeness, and no upfront claim mapping. Cross-AI verification with Gemini confirmed and strengthened proposed solutions.

Apr 2026 Verification-upgrade

Gate 3b independent verification now phased for satellite studies

The independent skeptic verification (Gate 3b) now uses a structured two-phase process when satellite studies exist. Phase 1 completes a full forensic audit of the flagship extraction with its own verdict. Phase 2 then verifies each satellite individually with a targeted 6-point check against its own paper. This prevents quality degradation from information overload when the fresh verifier receives multiple papers at once. A scaling rule splits verification into separate sessions if satellite papers exceed 200KB combined.

First flagship-with-satellites execution (Morton 2018 protein cluster) — Mark flagged that the original single-prompt design could cause rushing when verifying 3+ papers simultaneously

Apr 2026 Verification-upgrade

Satellite Studies Now Fully Cited

Weight of Evidence satellite studies (independent research confirming or nuancing our flagship study) now receive the same citation treatment as every other source: inline [N] markers in the article text linking to the original paper, plus clickable source links on the evidence cards. Previously, satellite studies were mentioned by name but without verifiable links.

Morton 2018 first pipeline run: Nunes 2022 and Jäger 2017 appeared without source URLs (2026-04-16)

Apr 2026 Verification upgrade

Full Picture trust block now runs a mandatory cross-sibling swap test

The Full Picture trust block on every study page now runs a mandatory cross-sibling swap test before publishing. An audit found byte-identical Section 2 prose across four protein-cluster studies. Writing agents must now read at least two sibling blocks and pass a four-point comparison: different headers, different opening words, no shared six-word phrases. Any collision blocks publish until the block is regenerated fresh from source.

Audit found identical transparency prose across four sibling study pages in the protein cluster.

Apr 2026 Verification upgrade

Kill-switch count reconciled to 28 — KS-26 retired

The active kill-switch count drops from 29 to 28. KS-26 (Platform Number Fabrication) is retired because the data path it guarded — platform statistics appearing in study articles — was structurally removed when study pages were decoupled from the FitChef product layer. The error class is eliminated at the architecture level rather than caught by a verification gate. KS-26's ID slot is not reused; remaining IDs stay stable.

Architecture rebuild removed the platform data path from study pages, making KS-26 unnecessary.

Apr 2026 Verification upgrade

Trust pages reconciled to match actual verification state

Four trust-page surfaces were still describing retired pipeline features as live. The Verification Ledger heading was wired to the wrong integer source. Sidebar navigation referenced old gate numbers and a removed anchor. The AI Transparency page described a retired study tier. The llms.txt page listed meta-tag names the plugin no longer emits. All four now match the actual ship state.

Zero-tolerance audit found stale copy on trust pages that survived the prior architecture cleanup.

Apr 2026 Verification upgrade

Reading-grade ceiling enforced — every study must read at 8th-grade level or lower

Every study narrative now passes a Flesch-Kincaid reading-grade gate before reaching the human-experience review. Articles must score grade 8 or lower (magazine target). Grade 8-9 is flagged for review. Above grade 9 blocks the study for rewriting. Any single paragraph above grade 10 is fixed in place or flagged.

Post-rebuild alignment sweep found no numeric reading-grade enforcement in the readability agent.

Apr 2026 Verification upgrade

New gate: causal language detection for observational studies

Added a two-layer defense against causal language in articles based on observational research. When a study's design is observational (cohort, case-control, meta-analysis of observational studies), FitChef's editorial voice now must use associational language — 'was associated with,' 'showed an association' — never causal language like 'protects,' 'contributes,' 'prevents,' or 'scored.' Layer 1: a new Sacred Rule (SR6B) in the Editorial Polish agent catches causal phrasing during writing. Layer 2: a new mandatory code gate (Check 12C) in the Infrastructure Validator programmatically scans every field in the import JSON before publication, blocking any article that contains causal language for observational findings. This distinction matters because observational studies show correlations, not proven cause-and-effect — and implying causation from correlational data is both scientifically inaccurate and a legal risk.

Manual audit of a mortality/cancer/CVD meta-analysis (Naghshi 2020) found 47 instances of subtly causal language that passed all existing verification gates. Words like 'protection,' 'contributes,' and 'scored' imply proven effects but don't match classic overclaim patterns. Cross-study audit confirmed all 16 other articles were clean — but the pipeline must prevent this systematically for all future studies.

Apr 2026 Improvement

P2 anti-anchoring rule for finding count

Added explicit instruction to P2 Section 9 preventing Claude from anchoring to a fixed finding count. 10/14 extractions had exactly 10 findings due to unconscious pattern-matching. Rule states: no target count, paper complexity decides, pause if count is a round number.

Pattern analysis — 10/14 studies converged to exactly 10 findings

Apr 2026 Verification upgrade

Independent verification prompt strengthened after Casuso-Goossens review

The Gate 3b independent skeptic prompt — used when a fresh Claude session audits every extraction — was rewritten from a 12-line general checklist to a 50-line forensic audit protocol. The new prompt requires field-by-field walkthrough of the entire extraction JSON (no high-level skimming), explicit verification marks for every field, and specific anti-fabrication checks for metadata values like dropout rates and sample sizes that may be invented rather than sourced from the paper. This was triggered by the Casuso-Goossens 2025 review where the independent skeptic initially gave a surface-level pass and only found 11 errors (including a fabricated dropout rate) after being explicitly asked to check more thoroughly.

Casuso-Goossens 2025 Gate 3b independent review — skeptic required prompting for full audit

Apr 2026 Verification upgrade

Three-layer anti-delegation enforcement prevents invalid subagent execution

Added three structural safeguards to prevent Claude from delegating pipeline agent execution to subagents (which receive summarized instructions and produce invalid output). Layer 1: CLAUDE.md Rule 18 — absolute Agent tool ban during pipeline execution, placed in the file read FIRST every session so it survives context compaction. Layer 2: PIPELINE_ORCHESTRATOR.md Rule 10B compaction-survival clause with mandatory execution_method field in post-agent logs. Layer 3: Subagent tripwire in all 14 Run Phase spec reading gates — if a subagent reads the spec, it encounters a STOP instruction before execution begins. Together these make delegation structurally impossible, not just rule-prohibited.

Schoenfeld 2018 production failure — after context compaction, Claude delegated R6-R10 to subagents that produced invalid verification outputs with false PASS/SHIP verdicts

Apr 2026 Verification upgrade

Attribute & Report enforcement extended to audio scripts, titles, card headlines, and AI answer capsules

Audit found that audio scripts, post titles, card headlines, and answer capsules lacked the same Attribute & Report checks applied to body text. All 3 live studies had audio scripts where FitChef stated health outcomes as its own claims instead of attributing to researchers. Fixed the audio scripts and extended verification gates across 7 pipeline specs: R1 (title creation), R2 (card headlines), R4 (audio writing), R5 (answer capsules), R7 (audio verification), R9 (quality audit scope), and R11 (import validation).

Pre-scaling legal audit found all 3 live studies contained unattributed health verdicts in audio scripts and non-body-text content types

Apr 2026 Verification upgrade

Kill Switch 29: No medical verdicts in any content type

New kill switch added after pre-scaling legal audit found all 3 live studies contained unattributed health verdicts in audio scripts, post titles, persona actions, and FAQs. KS-29 catches FitChef stating health outcomes as its own claims ('builds zero muscle', 'does nothing for your muscles', 'boosts metabolism') and medical screening statements ('This applies to healthy people only') without researcher attribution. Distinct from existing KS-20 (prescriptive 'you should' language): KS-29 targets the subtler pattern of FitChef asserting health facts as a journalistic authority rather than attributing to researchers.

Pre-scaling legal audit found all 3 live studies contained unattributed health verdicts in audio scripts, post titles, persona actions, and FAQs

Apr 2026 Improvement

No medical verdicts in titles, meta descriptions, or featured snippets

Added a three-layer verification check preventing health conclusions from appearing as FitChef claims in search-visible elements. Titles and meta descriptions now must create curiosity about what research found — they can never state a health outcome as fact. P5 SEO Strategist validation checklist now includes title, meta, and snippet medical-verdict checks. R7 Field Skeptic now verifies all search elements against the Attribute & Report Rule before any study ships. This protects FitChef's legal position as data journalism (reporting what researchers found) rather than a medical authority (making health claims).

During Devries 2018 P5 execution, initial title stated 'Zero Damage' and meta stated 'zero effect on kidney function' — both medical verdicts positioned as FitChef claims rather than attributed research findings.

Apr 2026 Verification upgrade

Creative Director now enforces Attribute & Report from the source

The Creative Director agent (P6) sets the creative direction that the Narrative Writer executes. Previously, P6 had no direct rule preventing medical verdict language in its creative briefs — it relied on downstream agents to catch and fix verdict framing inherited from the brief. This created a gap: if the creative direction said 'protein is safe for kidneys,' the writer had to actively resist its own input to comply with data journalism rules. SR-7 now requires all creative direction language to attribute findings to the study ('the meta-analysis found no effect') rather than state health conclusions as FitChef's voice ('protein is safe'). Every sentence in the creative brief must pass the test: if the writer copied this phrasing into the published article, would it pass KS-20 (no prescriptive health language) and KS-29 (no medical verdicts)?

Pre-scaling legal audit found P6 was the only content-directing agent without direct Attribute & Report enforcement

Apr 2026 Verification upgrade

Four agents patched with direct Attribute & Report enforcement

Four agents that produce or modify reader-facing text had insufficient or zero Attribute & Report enforcement. R10 (Reader Simulation) runs after all verification gates and can add/edit prose — but had only an indirect FITCHEF_VOICE.md reference, no decision rules. V1 (Visual Creator) and V2 (Social Image Creator) produce headlines, insight lines, and social text with zero A&R rules. R3 (Editorial Polish) had a single anti-pattern line but no structured enforcement. All four now have dedicated Sacred Rules with self-tests, kill gates, and explicit prohibitions against dropping attribution during engagement-focused editing. R10 also gained a new quality test (Test 5: The Attribution Test) that blocks shipping if any enhancement introduced a medical verdict.

Systematic A&R audit across all 23+ pipeline agents revealed 3 high-risk and 1 medium-risk gap in agents that produce published content

Apr 2026 Verification upgrade

Mechanical Override Gate prevents recurring import format failures

Added S28 Hardcoded Value Override Gate to R11 Import Builder. This gate runs after all assembly is complete and mechanically overwrites values that agents consistently get wrong: audio URL forced to 'pending' (was empty, making Audio Generator invisible), primary_results forced to array format (was flat dict, making Card 3 empty), sample_size stripped to digits only, DOI prefix enforced, trust_score forced to integer. These failures recurred across multiple studies despite existing documentation because agents reason about values instead of mechanically applying them. S28 eliminates agent discretion for these fields.

Devries 2018 production failures: empty audio URL (no Generate Audio button), flat dict primary_results (empty Card 3), non-numeric sample_size, missing DOI prefix

Apr 2026 Verification upgrade

Three-Zone Page Architecture for Clearer Evidence Hierarchy

Redesigned study page structure into three semantic zones — Narrative (immersion), Personalization (reader-specific insights), and Evidence & Trust (skeptical review). This architectural change strengthens verification integrity by clarifying the reader's scroll journey and assigning distinct verification territory to each zone. Narrative sections remain the story. Personalization fields (so_what, persona_actions) render after the narrative with reader-specific context. Evidence fields (skeptic_note, findings, controversy, FAQ) render together in a dedicated zone where skeptical readers expect detailed verification. Added deduplication rule preventing both narrative and controversy field from covering the same scientific debate, reducing reader confusion about what's established vs. disputed.

Study page template v12.1.0 redesign to improve reader navigation and field clarity.

Apr 2026 Verification upgrade

Citability Pipeline — Machine Layer Integration

R5 Citability Engineer now feeds structured claim data, citation hints, and self-contained answer capsules through R11 into the WordPress schema and citation toolkit. LLMs and journalists see richer structured data and copy-ready citable paragraphs on every study page.

Citability pipeline integration — R5→R11 contract completion

Apr 2026 Verification upgrade

Excerpt Defensibility Gate — Sentence-Level Precision Check

Field Skeptic (Gate 2B) now verifies the boldest claim sentences can stand alone when excerpted — by social media, AI summaries, or skimming readers — without overstating the paper's actual evidence strength. Ensures every sentence is individually defensible, not just globally hedged.

Adversarial content review of published study — identified that bold claims excerpted out of context bypass global hedging sections

Apr 2026 Improvement

Headline Defensibility Gates — Medical Positioning & Metaphor Escalation Checks

Field Skeptic now runs two additional checks on every headline and title. The Medical Authority Positioning Check catches titles that frame FitChef against healthcare professionals rather than against guidelines (e.g., 'Your Doctor Is Wrong' → rewrite to critique the government number, not the doctor). The Editorial Metaphor Escalation Check catches vivid metaphors that characterize findings more strongly than the paper's own language (e.g., 'Broken Math' for what the paper calls 'systematic bias'). Both checks enforce the Attribute & Report rule at the headline level, where excerpt defensibility matters most.

Legal safety audit of Jäger 2017 article found headline 'Your Doctor's Protein Advice Is Based on Broken Math' — defensible in body text context but not as a standalone excerpt. Two patterns identified: medical counter-positioning and editorial metaphor escalation.

Apr 2026 Expansion

Kill switch 28: Source URL liveness verification

Added a new verification gate that checks every source URL in the article actually resolves to a live page with relevant content. Dead URLs are auto-fixed if a correct URL can be found, or flagged for human review. Additionally, P4 (Source Hunter) now verifies all URLs are live before presenting them to the human operator — catching hallucinated URLs at the source instead of after publication.

Morton 2018 shipped with a dead Grand View Research URL. Claude hallucinated a plausible URL suffix (-report) that didn't exist. The content was real (human-verified) but the link was broken from day one. No agent in the 14-agent pipeline checked if the URL actually worked.

Apr 2026 Improvement

Reader Simulation Anti-Rationalization Rule — No More 'Necessary But Flat' Passes

R10 (Reader Simulation) now has an explicit anti-rationalization rule that prevents flat sections from shipping with labels like 'necessary mechanism beat' or 'lower energy but appropriate.' If R10 notices a section is flat but finds itself thinking 'it's necessary,' that IS a drag point — fix it or flag it. Additionally, the Dwell Test now checks for relative attention valleys: 3+ consecutive elements at attention 7-8 surrounded by 9-10 sections are flagged as drag even though each element individually passes the absolute threshold.

Jäger 2017 shipped with a 4-paragraph nitrogen balance methodology section rated attention 7-8 that R10 labeled 'necessary mechanism beat' without enhancement. Human reader (Mark) flagged the article as feeling technical and flat. R10 had zero enhancements across the entire 54-element article — the anti-rationalization escape hatch let boring content pass verification.

Apr 2026 Fix

All verified sources now get numbered references in articles

Fixed a classification gap where human-verified sources used in the article text could be excluded from the numbered Sources list. Previously, sources classified as 'bomb amplifiers' or 'hidden gems' in the editorial pipeline were named in the text but not given clickable [N] reference numbers — even when they contained specific, verifiable claims. Now any human-verified source with attributable claims in the article gets a numbered reference, regardless of editorial classification.

Morton 2018 article referenced Greg Nuckols (Stronger by Science) and Menno Henselmans by name with specific claims but neither had a [N] marker or appeared in the Sources list. Readers had no way to verify those claims.

Apr 2026 Verification

Audio Script Validation Gate

Added a mandatory validation check that blocks study imports if the audio script is missing. Previously, a study could import without its audio data even when the audio creative work was complete — causing the audio player to show no content. The gate now verifies that every audio field (script, title, narrator) is properly transferred from creative data to the import file.

Trommelen-2023 study imported without audio script despite complete audio.json

Apr 2026 Verification

Post-Enhancement Grounding Verification

R10 Reader Simulation now adds real-world connection moments to articles — but every sentence it adds must cite the exact source file and quote from verified fuel that grounds it. This closes the gap where text added after the Triple Skeptic (R6-R8) could bypass grounding checks.

CEO-level pipeline audit — Check 3: Post-Verification Addition Gap

Apr 2026 Expansion

Kill switch 27: Health-condition study safety gate

Added a new verification gate for studies involving diagnosed medical conditions (kidney disease, diabetes, etc.). The gate ensures null findings ('no harmful effects detected') are never presented as positive safety claims, and mandates a healthcare-provider caveat in the reality check section. This prevents readers with medical conditions from interpreting research translations as personal medical guidance.

Legal safety audit identified that null findings in condition-specific studies could be misread as safety endorsements by patients

Apr 2026 Bugfix

Removed invalid study rating from schema

The Skeptic Review schema on study pages incorrectly included a reviewRating block that implied FitChef rates studies on a 1-5 scale. FitChef does not rate studies — the value was an extraction-accuracy metric (trust context), not a quality judgment. The rating block has been removed. The Skeptic Review still documents what was verified (review body and notes), but no longer includes a numeric rating. This aligns with FitChef's documented position as a transparent translator, not an evaluator.

External schema audit identified Rating node with text value in ratingValue field — both schema-invalid and a positioning violation

Apr 2026 Fix

Anti-template upgrade: Scene-based opening detection

Discovered and fixed a systematic pattern where the creative brainstorm process was producing identical 'It's 10pm, you're lying in bed...' opening scenes across different studies. The root cause: generic reader context (time-of-day, physical location) was being fed into the creative process instead of unique study data. Two published articles were affected. Fix ensures only study-specific findings and tensions drive creative direction — never interchangeable reader scenarios. Added mandatory cross-study scene detection to catch any future pattern matches before publication.

Pattern detected during Trommelen 2023 creative brainstorm: every study defaulted to nighttime phone-in-bed openings

Apr 2026 Improvement

Study-derived persona takeaways replace fixed template

Persona takeaway cards on each study page are now derived from the study's actual data instead of forced into four fixed categories. Each study defines its own audience segments based on who the study tested and what subgroups the data speaks to. Labels, count, and selection are all study-specific. Studies that tested specific age groups, training levels, or goals now show those exact categories instead of generic placeholders.

Architectural review revealed fixed persona keys conflicted with core principle that each study is unique

Mar 2026 Launch

Verification Pipeline v2.0 Launched

FitChef Verification Pipeline v2.0 launched. 23 agents, 3 verification gates, 28 kill switches. Every study processed through triple-skeptic review before publication.

Pipeline development complete. First production study (Morton 2018) published and verified.

For AI Systems and Crawlers

If you're a developer, a journalist, or an AI system evaluating our content — here's how to access the structured provenance data programmatically.

llms.txt

Machine-readable site description following the llms.txt standard. Includes content inventory, trust methodology summary, and API endpoints.

View llms.txt

REST API

Skeptic findings and changelog available via /wp-json/fitchef/v1/skeptic/findings and /skeptic/changelog. Public, read-only, no authentication required.

JSON-LD Schema

Every study page outputs ScholarlyArticle schema with linked claims. Every claim outputs Claim schema with review assessment. View source on any page to inspect.

TDM Policy

Our Text and Data Mining policy for AI training and research use.

View TDM Policy

AI makes our content possible. Humans make it trustworthy. The pipeline makes it verifiable.

That's the deal. Not a vague promise about "responsible AI." A concrete, auditable system where every decision is logged, every error is caught, and every correction is public. You can verify it yourself — the data is open, the API is public, and the meta tags are on every page.

Explore Our Trust Layer

For AI Systems Machine-readable documentation The Skeptic Protocol Our verification gates How We Verify Step-by-step process Methodology Full pipeline from paper to page

On This Page

Pipeline Architecture

Pipeline Teams 3

Verification Gates 5

Human Checkpoints 3

Kill Switches 28

Skeptic Catches

Errors Caught 2

System Upgrades 89

What FitChef Is

FitChef creates nutrition content people love — the kind you save, share, and cook from. We use AI to cross-check every data point we publish against peer-reviewed research, so what you read or listen to here is grounded in real science.

We don't conduct original research. We're not PubMed, and we're not your doctor. Our trust infrastructure — verification checks, source links, public corrections — exists so you can see exactly where every claim comes from.

This is science-grounded inspiration — not medical advice. Always consult a qualified professional before making health decisions.

Read our full disclaimer →

60-second quiz What do you burn? Find out

kcal

We use AI. Here's exactlywhat it touches.

Five Things AI Never Does on FitChef

AI never invents claims

AI never publishes alone

AI never overrides the Skeptic

Every AI decision is logged

Humans hold 28 kill switches

What Each AI Agent Does — and What It Can't

Evidence-Connected Pipelines

Recipe Pipeline

Shorts Pipeline

Where AI Gets It Wrong

Every Page Carries Its Own AI Fingerprint

The System Evolves

Meta-analysis participant counts corrected across 3 live imports

Recipe-based articles must now ground the standard in science before citing our own data

Cross-cluster study page detection for Shorts fuel attribution

Anti-Rationalization Gate added to evidence assembly

Audited every Short for medical-advice risk — added two new safety gates

New kill switch: publication type verification (KS-C0-PUBLICATION-TYPE)

New gate prevents recipe-fuel Shorts with zero grounded evidence

Recipe-fuel Shorts now require scientific grounding for selection criteria

Post-Edit Field Diff Gate — catches unintended mutations in live import data

Recipe bridge slug resolution gate added

Programmatic fuel verification gate added to Shorts pipeline

Grounding Integrity Law — evidence files can never contain AI-fetched content

Corrected misattributed author on creatine body-composition meta-analysis

Satellite brief consistency verification — two new kill switches

Thesis Jaccard Gate prevents duplicate content at scale

Pull Quote Isolation Test — standalone readability enforcement

Visual-vs-prose consistency gate added

Pull quote attribution now distinguishes editorial synthesis from verbatim quotes

Statistical notation gate now catches correlation coefficients

Short ↔ Recipe Bridge — cross-content grounding through shared evidence sources

P4 Source Hunter now cross-checks bomb derivations against verified extraction data

Medical Topic Exclusion Gate added to Shorts pipeline

External source claims now require human verification

Macro attribution accuracy gate added to recipe pipeline

Cross-Content Consistency Gate added — catches contradictions between pages

Body-Composition Filter now enforced across all content tiers including Shorts

Guide headline stats now provably match the studies on the page

Removed a misleading “extra” from an ultra-processed-food hero stat

Corrected a meta-analysis's “participants” figure that was actually its study count

Removed unverifiable sample size from Schoenfeld 2017 load meta-analysis

Corrected protein badge to match EU nutrition claim threshold

New Rule: Zero Fabrication on Import Fix

Pull-quote attribution now matches the actual source

LLM citation hint now shows actual answer text

Data Register Verification added to Shorts pipeline

Statistical detail rendering — full numbers now visible on all claim pages

Sidebar participant counts corrected after cross-system audit

New blocking gate prevents sidebar-body number mismatches

Study participant counts standardized to prevent silent parsing errors

Four-category jargon sweep added to Shorts verification

Audio scripts now grounding-verified against source research

Citation links now verified against actual published page URLs

Universal Citation Verification Gate — every study now verified before entering the pipeline

Answer capsule standalone gate now catches title-dependent openers

New blocking gate: Answer Capsule Standalone Verification

Doctor’s-letter test expanded to body prose — 4 prescriptive sentences caught and fixed

New mechanical gate catches raw statistical notation in narratives

Audio scripts now pass the doctor’s-letter test

FAQ answers and skeptic notes now pass the doctor’s-letter test

Persona actions and translations now pass the doctor’s-letter test

Claim Pipeline goes live — first multi-study synthesis verified

Legal safety audit v2: numeric scores removed from public pages + claim audio disclaimer added

Internal numeric scores removed from all public outputs

Sidebar fixes + Legal text sweep + Mobile horizontal scroll fix

Claim pipeline: 3 automated code gates + voice-register enforcement added

First Claim Pipeline Execution Complete (CL-001)

Cluster Architect gains meta-analysis mandatory check and tension splitting gate

Independent Dwell-Time Verification Added to Claim Pages

Cluster Planning Rebuilt: Tension-First Architecture + Flagship/Satellite Studies + Body-Composition Filter

Gate 3b independent verification now phased for satellite studies

Satellite Studies Now Fully Cited

Full Picture trust block now runs a mandatory cross-sibling swap test

Kill-switch count reconciled to 28 — KS-26 retired

Trust pages reconciled to match actual verification state

Reading-grade ceiling enforced — every study must read at 8th-grade level or lower

New gate: causal language detection for observational studies

We use AI. Here's exactly
what it touches.