Methodology & Rights¶
A technical account of how this index was built, what safeguards protect its accuracy, and how much confidence each claim warrants. A plain-language summary is on the How this was made page.
Unofficial study aid. This site is a personal, non-commercial research index. It is not affiliated with or endorsed by Ligonier Ministries or St. Andrew's Chapel, and it is not a substitute for R.C. Sproul's primary works.
1. Purpose and scope¶
This project indexes positions — discrete theological claims — that R.C. Sproul (1939–2017) stated in his publicly available recorded and written teaching, and weights each by how strongly the available sources attest it. The goal is a navigable, source-linked map of what he taught and how consistently, not a republication of his work and not an interpretation of it.
It is deliberately not exhaustive: it covers freely-available web text from Ligonier Ministries plus a small set of public video, and excludes his books (see §7).
2. Sources and corpus¶
818 source documents across four independent channels:
| Channel | Count | Acquisition | Fidelity |
|---|---|---|---|
| Sermons (St. Andrew's Chapel) | 281 | Official Ligonier transcripts | Human-edited |
| Articles (incl. Tabletalk) | 422 | Official Ligonier transcripts | Human-edited |
| Q&A answers | 105 | Official Ligonier transcripts | Human-edited |
| Video | 10 | YouTube captions (ASR) | Machine, lower fidelity |
The 808 Ligonier items were enumerated from learn.ligonier.org XML sitemaps
(permitted by robots.txt; /api and /auth were never accessed) and filtered
to Sproul by author/teacher metadata. Sermons, articles, and Q&As were retained
only where the page identified R.C. Sproul as author/primary teacher. From these,
~10,356 quote-grounded claims were extracted and reduced to ~8,700 canonical
positions.
3. Pipeline (reproducible)¶
All code is in the repository; the corpus regenerates from ./run.sh.
ingest → extract → weight (embed → topic → cluster → corroborate) → render
- Ingest each channel to local transcripts (raw transcripts are not republished — see §8).
- Extract claims with a locally-run LLM (
gemma4:e4bvia Ollama), constrained to emit, for every claim, a verbatim quote plus a one-sentence neutral summary and a topic label. - Weight: embed every claim (
nomic-embed-text); assign each to one of 33 canonical theological topics by nearest-anchor similarity; cluster semantically within a topic so one cluster = one canonical position; then apply the corroboration gate (§4). - Render the published positions to this static site.
4. Validity safeguards¶
- Verbatim-quote grounding (code-enforced). The model's quote is re-checked in code as a literal substring of the transcript; ungrounded quotes are dropped, not trusted. No position exists without a real quote.
- Source-trust tiers. Official human-edited transcripts (sermons, articles, Q&As) = trust 1.0; whisper transcription = 0.8; YouTube auto-captions = 0.5.
- Independent-corroboration gate. A position publishes only if it is attested by ≥2 distinct sources, or by a single trust-1.0 (official transcript) source. A position resting on a single low-trust source is quarantined to a review queue and never presented as settled.
- Why this matters: transcription errors self-eliminate. A real example — an auto-caption mis-rendered "amillennial" as "Arminian"; because no other source corroborated the garbled claim, it was quarantined automatically while the correct statement (from an official transcript) was published.
- Contradiction audit. The corroborated core was searched for internal
contradictions (same-topic positions about a near-identical point) and
adjudicated by hand; none were found (see
contradiction-audit.md). - Human review gate. Nothing is presented as settled without passing the above; quarantined and flagged items are held back transparently.
5. Reading the confidence signal¶
Each position shows a corroboration count. Interpret it as evidentiary weight:
- Well-attested (≥2 independent sources, often across genres — a sermon and an article and a Q&A) — high confidence this is a settled, repeated emphasis of his.
- Single official source — confidently his (verbatim, from a human-edited transcript), but stated in one place in this corpus.
We deliberately do not use the language model's self-reported confidence: it was found to be uncalibrated (near-1.0 on virtually everything). Corroboration and source trust are the only weighting signals.
6. Limitations (read these)¶
- Summaries are machine-paraphrased; quotes are authoritative. The bold one-line summary on each position is an LLM's neutral restatement and may compress or slightly skew nuance. The verbatim quote and its source link are the ground truth — always defer to them.
- Topic assignment and clustering are approximate. A position may sit under a topic you'd file differently, and distinct points are occasionally not merged (or vice-versa).
- "A view he describes" vs. "his own view." Sproul often explains positions he rejects (e.g., Roman Catholic justification, semi-Pelagianism). These are retained with their subject named ("Rome teaches…"), but a careless reader could misread an exposition as an endorsement. Stance is not yet machine-tagged.
- Coverage is partial. Books are excluded; video transcription (10 items) is machine-generated and lower-fidelity; the corpus is a large sample, not the whole of his output.
- Not peer-reviewed and not official.
7. Out of scope¶
Sproul's books are not ingested. They are published variously by Reformation
Trust (Ligonier) and other houses, carry distinct rights, and are not freely
crawlable. Daily devotionals, conference event pages, and teaching-series
landing pages were evaluated and excluded — devotionals are predominantly other
authors', and conference/series pages are aggregators whose underlying messages
are already captured as sermons or sit behind /api.
8. Rights and attribution¶
All indexed content was authored by R.C. Sproul. The sermons, articles, and
Q&A answers are published by Ligonier Ministries (learn.ligonier.org,
publisher metadata confirmed) and are governed by
Ligonier's Copyright Policy;
Ligonier holds or administers their copyright. The 10 video items were
transcribed from third-party re-uploads on YouTube; the original recordings
remain the property of their respective copyright holders (in most cases Ligonier
Ministries), and the re-uploading channel is not a rights holder. No books
are included.
This site reproduces only brief quotations for study and indexing, claims no ownership, stores no full transcripts publicly, and links every quotation to its source so readers can consult the original. If you are a rights holder and have concerns, the project will respond promptly.