@theartharrisonArt Harrisonpublic

through Jul 27, 2026

The dataThrough Jul 27, 2026

Take the data home.

The SQLite file behind this site is yours to download. Open it in any SQL client, browse it in the in-browser Playground, or hand it to an AI with the prompts further down the page. Daily reads, video states, traffic patterns, and every raw row are inside.

This database was last built on Jul 30, 2026 at 8:13 AM UTC.

v74

schema revision

Changelog ↓

The dataset

Download the .db

SQLite, with titles + ids — everything the site shows. Ships gzipped (one gunzip away from a regular .db). Open it in any SQL client — DB Browser for SQLite, DBeaver, or the sqlite3 CLI. The AI prompts further down the page are an alternative path, not the only one.

29,266

rows across 88 tables & views

Download .db.gz Download .csv

Refreshed: Daily

Largest tables, by row count

app_pool_cell_stat5,927
app_wisdom_verdict_daily2,850
app_insight_snapshot2,624
source_age_pattern_daily2,018
traffic_daily1,557

First step from here

Open it however you read SQL.

The file works in any SQLite client. Browse the schema in the in-browser Playground, or upload it to an AI model that accepts file attachments — there is a primed prompt further down the page.

Open it locally. Any SQLite client reads it — DB Browser for SQLite, DBeaver, or the sqlite3 CLI.
Browse it in the browser. The SQL Playground runs queries against the same file without any install.
Hand it to an AI. Upload the .db to Claude, ChatGPT, or Gemini Advanced and start with the primed first prompt lower on this page.

Read today's dashboard → Read the chronology →

Two DBs, two jobs

Analyst-mode .db

Same channel, same dates, same raw numbers as the file above. What is stripped: timeline headlines, dashboard verdicts, wisdom labels, and per-row interpretive columns (lifecycle tags, the click-vs-watch grouping, change verdicts). What stays: every raw / summary / cohort / event table, plus all forecast tables and the calibration log. The point is that an LLM loading this DB has no stored conclusions to parrot — it reasons from the numbers.

Download analyst-mode .db.gz

theartharrison-stats.db

The dashboard DB. Carries the same data plus the prose surfaces the site renders — timeline headlines, dashboard briefings, wisdom verdicts, lifecycle labels. Use this one when you want a fast read along with the site’s own determinations.

theartharrison-analysis.db

The analyst-mode DB. Same raw + forecast + calibration tables. Narrative prose, verdict labels, and per-row interpretive columns are gone. Use this one when you want an LLM to do its own reasoning rather than relay the dashboard’s.

Analyst prompt

Paste alongside your upload of theartharrison-analysis.db

Independent analyst over the analysis DB

Frames the chat as an analyst reasoning from the data. Sets the voice rules — hedged forecasts, calibration honesty, observation over advice — that the dashboard itself follows.

Show the full prompt

How the data was made

Methodology and the honest conditions under which every chart was drawn — reporting lag, stub rows, sample confidence, unattributed impressions, snapshot reconciliation. Everything that orients the file before you trust a number in it.

Analyst-ready layer

The database carries the first read.

These tables turn the raw YouTube export into the same daily observations the site shows: what changed, what carried the channel, and what is still too small to read.

channel_signal_daily237 rows

The behind-the-scenes comparisons that decide what is worth surfacing.

app_dashboard_briefing_daily202 rows

The short daily read shown on the dashboard and historical snapshots.

video_current_state957 rows

Per-video state, one row per video per tracked day: launching, quiet, late pickup, current driver, or steady tail.

source_age_pattern_daily2018 rows

How each traffic source behaves at launch, in week two, and later in the catalog.

Honesty

What this dataset doesn’t say

A page in an atlas marked “survey conducted under ice cover” is not weakened by the footnote — it’s strengthened. The columns below name the conditions under which every other chart in this dashboard was drawn.

Reporting lag

YouTube reports two to three days behind.

Data flows reliably through 2026-07-27. The dashboard never charts past that date — it would be zeros dressed as truth. Reporting time zone is America/Los_Angeles; a “day” here is a calendar day in PT, not your local time.

Impressions window

Impressions cover days from Jun 1, 2026 forward.

Views and watch time cover this channel’s full history. Impressions and click-through come from YouTube’s reports, which reach back about thirty days from when a channel connects — on this channel they cover days from 2026-06-01 forward. Earlier days show views without impressions; those impressions aren’t available via the APIs.

Stub rows

22 videos have pre-publish stub rows.

When a video is scheduled, YouTube sometimes records reporting rows for the day before it published — usually a row of zero views and null impressions. Every chart and aggregate filters these out (the pre_publish_stub = 0 guard in every summary CTE). These stubs stay in the database for provenance — they’re how we know YouTube did this on this channel.

Per-video pre-publish stub counts (top 5)
When Motivation Dies, Use This Instead	1 stub day
The Worst Advice I've Ever Given (That Actually Works)	1 stub day
You Don't Become the Person You Thought You'd Be \| Advice Before You Succeed	1 stub day
The deafening silence of trying to start something	1 stub day
Your Best Idea Will Feel Illegal (Mine Was a Felony)	1 stub day

Click-rate confidence

Where the click rate isn't yet trustworthy, the dashboard says so.

The database tags every click rate with a confidence tier from its impression sample. Fewer than 50 impressions reads as noise; 50 to 249 as low; 250 to 999 as medium; 1,000 or more as high. The four-segment dotted glyph after every click-rate cell encodes which tier — fewer dots means a thinner sample.

noise 1
low 1
medium 15
high 9

Watch-time confidence

The same tiering applies to average watch time.

Average view duration carries a confidence tier from its view sample. Below 10 views reads as noise; 10 to 29 as low; 30 to 99 as medium; 100 or more as high. The same four-segment glyph after every average-view-duration cell.

noise 6
low 10
medium 9
high 1

Source gaps

Almost every impression is attributed to a source.

YouTube’s per-source impression rows don’t always sum to the per-video impression total. The gap is real — some impressions come from sources YouTube doesn’t expose. We surface the gap rather than redistribute it.

0 unattributed of 36,984 total impressions.

Snapshot drift

Channel-level views match the sum of per-video views.

The channel snapshot (summary_channel_snapshot_daily in the database) sometimes shows totals slightly higher than the sum of per-video reporting rows. This is YouTube’s own attribution gap; we chart it instead of hiding it.

How is this computed?

The timezone YouTube reporting uses for per-day rollups.: All dates in the public DB are calendar days in YouTube's reporting timezone (America/Los_Angeles, PT), not the visitor's local timezone. A "day" here is a calendar day in PT — a video published at 22:00 PT on Apr 27 will accumulate that day's reporting under the date 2026-04-27, even for viewers in UTC+12.
Flag (0/1) on every per-video reporting row indicating whether the row sits before the video's published_at.: Set to 1 when the row's date is earlier than SUBSTR(published_at, 1, 10). Every summary_* CTE that aggregates over per-video reporting must include WHERE pre_publish_stub = 0 so the seven-row stub artifact (NULL impressions, 0 views) doesn't poison aggregations. The flag itself stays in the DB for downstream provenance.
Sample-size confidence tier for the row's click-through rate; one of noise / low / medium / high.: Derived from the row's impressions sample size. Noise tier when impressions < 50; low when < 250; medium when < 1000; high otherwise. The thresholds are calibrated so that a noise-tier CTR is statistically meaningless (one or two clicks against a tiny denominator).
Sample-size confidence tier for the row's average view duration; one of noise / low / medium / high.: Derived from the row's view sample size — the denominator AVD averages over. Noise tier when views < 10 (one or two viewers' watch time, statistically meaningless); low when views < 30; medium when views < 100; high otherwise. Mirrors the V10 ctr_confidence pattern but keyed on views instead of impressions.
Per-date drift between the channel-level views snapshot and the sum of per-video views.: MAX(0, summary_channel_snapshot_daily.total_views - SUM(video_daily.views WHERE pre_publish_stub = 0)) per date. Reads >0 when the channel-snapshot total exceeds the sum of attributed per-video reporting (e.g. deleted videos, late attribution). Floored at zero so reporting overcounts (per-video sum exceeds channel snapshot — also possible during transient lag) don't render as negative drift.

Channel state

Two trailing-window reads on the shape of the channel — its publishing cadence and how evenly views and subscribers are spread across the catalog. Both shapes are common at different stages; neither is the target.

Publishing cadence

The trailing 28-day publish strip plus the detected pattern.

Pattern: Three times weekly (86% conformance over the last 28 days)

View distribution

How unevenly views and subscribers are distributed across the catalog over time. A higher Gini means a few videos carry most of the recent views or subscribers; a lower Gini means the distribution is more even. Neither shape is the target — both are common at different stages.

Common advice, checked here

50 common creator beliefs, scored against this channel's tracked data. See the Learnings page for the full panel — one example channel's record, with expandable evidence.

Agree: 4
Disagree: 9
Still thin: 16
Not yet testable: 21

Ask it yourself

Starter prompts for the AI-path read of the database — optional, not required. Each one names the tables it expects to find, so you can open them in any SQL client instead. The schema reference and changelog sit at the foot.

Start here

Default first question

Start with the daily read

Ask what changed, what carried the channel, and what is still too thin to read.

Show the full prompt

Diagnose growth

Is the channel growing, flat, or declining?

View velocity, spikes, and drops across the channel over time.

Show the full prompt

Is day-1 performance a predictor of lifetime views?

Whether early signals reliably forecast long-term outcomes.

Show the full prompt

Understand the audience

Where are viewers actually finding these videos?

Browse-driven vs Suggested-driven vs Search — by video.

Show the full prompt

How does CTR behave over a video's lifetime?

Does CTR stay stable, or does it drop as YouTube expands the audience?

Show the full prompt

Inspect individual videos

Which videos have potential that hasn't been discovered yet?

Videos with above-average CTR but below-average views.

Show the full prompt

Which videos keep getting views — and which flashed and died?

Flash pattern vs evergreen across the catalog.

Show the full prompt

Which videos convert impressions into watch time most efficiently?

How much viewing time each thumbnail impression generates.

Show the full prompt

Which videos turn viewers into subscribers?

Subscriber conversion rate across the catalog.

Show the full prompt

Synthesis reads

Do videos do better depending on the week they launched?

Launch-week cohorts vs the channel's own baseline — a read Studio doesn't group.

Show the full prompt

How does where views come from change as a video ages?

Source mix normalized by video age across the whole catalog.

Show the full prompt

Have this channel's own forecasts actually held up?

Calibration record by forecast type and horizon, plus tested creator beliefs.

Show the full prompt

How concentrated has the catalog been over time?

Busiest-video share, quiet-catalog share, and source spread, day by day.

Show the full prompt

Go deep

Complete channel audit

Comprehensive read across every table — what's moving, what's stable, what's too thin.

Show the full prompt

How uneven is this channel, really?

One-channel mirror: which videos carried, how long quiet uploads lasted, and whether later videos got picked up.

Show the full prompt

Sample SQL queries (for SQL writers)Open SQL Playground →

Which AI should I use?

These prompts work best with Claude, ChatGPT Plus, or Gemini Advanced— they need file upload and long-context reasoning. Free-tier models will often fail silently on the bigger queries (like "Complete channel audit"). If you have access to Claude Projects or custom GPTs, upload the .db once and keep asking questions.

Where each metric comes from

Subscriber columns come from two independent YouTube APIs. subs_total_eod is a daily snapshot from the Data API. subs_gained and subs_lost are deltas from the Analytics API. Drift between them is expected — they sample at different times of day and use different rounding rules. Each column is authoritative for its own purpose.

Field	API source
views, watch_time_minutes, avg_view_duration_sec, avg_view_pct, subs_gained, subs_lost, likes, comments, shares	YouTube Reporting API + Analytics API (reconciled)
impressions, ctr	YouTube Reporting API (reach reports)
engaged_views, dislikes	YouTube Analytics API only
Country, device_type, subscribed_status dimensions	YouTube Reporting API

Inspection recipes

Each section below names the source tables and starter SQL for one route family. Provenance popovers throughout the dashboard link here. Open the public .db file in any SQLite client to run these directly.

Homepage — hero numeral and channel-state strip

Appears on: Dashboard

Tables: summary_channel, summary_channel_metrics_daily, event_channel_state_change
Columns: current_subscribers, total_views, total_days_tracked, total_impressions, weighted_ctr, subs_net, surface_state, reason_text
Window: Latest snapshot row per table; daily metrics anchored to the reporting cutoff date.
Sample: One channel row and one daily row per render.

Starter SQL

Show the SQL

Traffic — tenacity stats and source totals

Appears on: Traffic · Traffic source detail

Tables: summary_traffic_source_daily, summary_video_traffic, summary_channel_traffic
Columns: traffic_source_id, total_views, total_watch_minutes, weighted_ctr
Window: All tracked days up to the reporting cutoff for active_days; lifetime totals from summary_channel_traffic.
Sample: Active-day count is over rows where total_views > 0 in summary_traffic_source_daily; distinct-video count is from summary_video_traffic (one row per video, source).

Starter SQL

Show the SQL

Compare videos — summary cards

Appears on: Compare

Tables: summary_video
Columns: video_id, total_views, weighted_ctr, ctr_confidence, total_impressions
Window: Lifetime per video.
Sample: ctr_confidence tier reflects the total impression sample that feeds weighted_ctr.

Starter SQL

Show the SQL

Changes — title and thumbnail era comparisons

Appears on: Changes

Tables: summary_era
Columns: video_id, field, era_index, weighted_ctr, total_impressions
Window: All era pairs across all public videos.
Sample: Per-era CTR comes from impressions for that era only; pp_change is the delta to the next era for the same field on the same video.

Starter SQL

Show the SQL

Wisdom — belief verdicts

Appears on: Data

Tables: app_wisdom_canon, app_wisdom_test
Columns: test_id, belief_text, status, evidence_json, sample_size, confidence
Window: Per belief: latest snapshot row in app_wisdom_test.
Sample: Each verdict is one row per test_id at the latest snapshot_date.

Starter SQL

Show the SQL

Have a YouTube belief you want tested? or email it directly. Useful submissions get added to the canon — credit appears in the card's source line.

Schema changelog (v5 – v74)

Everything the site shows is in the file — real titles, real IDs, every daily row.

What's in the .db

Tables your AI (or SQL client) can query directly. Many carry a snapshot_date — a full set of rows per tracked day — so a one-per-entity table can still show many rows.

summary_channel1 row
One row per snapshot_date: channel-level totals, averages, imp_to_view_ratio_median, median_video_ctr and median_video_retention.
summary_video26 rows
Per-video lifetime stats: engagement_rate, subs_per_1k_views, watch_min_per_impression, day1/day3/day7 cohort columns, peak_views_day, late_growth_pct, imp_to_view_ratio, avd_drift_sec, source_hhi, ctr_quadrant, primary_traffic_source, days_to_first_*.
video_daily979 rows
Daily metrics per video with days_since_publish for cross-video comparison. Carries pre_publish_stub and ctr_confidence.
traffic_daily1,557 rows
Per video/source/day views, watch time, avg view duration, avg view pct, impressions, and CTR. Carries pre_publish_stub and ctr_confidence.
summary_video_traffic160 rows
Lifetime traffic source breakdown per video, including impression-weighted CTR.
summary_channel_traffic10 rows
Per traffic source: total views, impressions, weighted CTR, % of channel, watch time per view.
summary_channel_metrics_daily57 rows
Daily channel totals plus rolling 7d/28d, source_diversity_score, top_video_share_7d, top3_video_share_7d, top_video_id_7d, videos_with_zero_views_7d, quiet_inventory_share.
summary_channel_source_share_daily432 rows
Long-format trailing-7d source share per (date, traffic_source_id). Drives source-diversity and stacked-area visualisations.
summary_traffic_source_daily287 rows
Per source per day: views, impressions, weighted CTR, active video count.
summary_traffic_top_videos_by_source140 rows
Top 20 videos per traffic source.
summary_era61 rows
Performance during each title/thumbnail version.
summary_channel_snapshot_daily56 rows
Daily subscriber/view snapshots from YouTube Data API.
channel_timeline_event515 rows
Auto-generated growth story events: publishes, milestones, discovery shifts.
data_status5 rows
What data is available from what date (helps interpret reporting lag).
cohort_video_traffic_age_window480 rows
Per video/source/age-window acquisition quality: impressions, views, weighted CTR, AVD/AVP, and within-window share.
cohort_video_geography_age_window268 rows
Per video/country/age-window geography with stored geo_bucket = core | expansion | other.
cohort_video_launch_age_window132 rows
Per video/age-window early-life rollup of views, impressions, weighted CTR, AVD/AVP, watch time, subs, and engaged views.
summary_channel_catalog_pressure_daily57 rows
Per-date catalog pressure and trailing-7d impression concentration.
summary_channel_source_geo_daily506 rows
Per-(date, source, country) acquisition geography with stored geo_bucket.
app_wisdom_verdict_daily2,850 rows
V50 — Per-(date, test_id) historical wisdom verdict. Stores deterministic verdict status, evidence_json, confidence, sample_size, prior_status, and status_changed_today for every canon belief on every historical date.
app_wisdom_canon50 rows
The catalog of common creator beliefs under test: test_id, belief_text, source_attribution, category, and the evidence shape each belief expects.
app_wisdom_test50 rows
Latest verdict per belief (joins app_wisdom_canon on test_id): status, evidence_json, confidence, sample_size, and prior_status.

Hidden data ledger

What is not in the public database and why each absence persists.

Structurally absent

Individual comments
The text of viewer comments isn't in the data. Counts are recorded; the text itself never ships in the public dataset.

Size-gated by YouTube

Viewer demographics
Aggregate country, age, and gender breakdowns are empty. YouTube only returns them once audience size clears a minimum threshold; this channel is below that today.

Partner Program only

Revenue and monetization
Revenue, CPM, and ad-breakdown aren't in the data. This channel isn't yet in the YouTube Partner Program, and those figures only arrive for channels that are.

Pipeline not yet pulling it

Second-by-second retention
Per-second audience-retention curves aren't in this dataset. Only average retention percentage per video-day is stored; the pipeline doesn't fetch the full curve yet.
Hour-of-day patterns
Hour-of-day granularity isn't in the schema. Only daily aggregates are tracked; YouTube exposes hourly separately, and the pipeline doesn't pull it.
Search terms
The words viewers typed to find a video aren't in this dataset. YouTube exposes them through a Reporting API report the pipeline hasn't added.
External referrer URLs
The specific URLs that sent External traffic aren't surfaced — only the aggregate “External” category. The pipeline doesn't yet pull the per-referrer breakdown.