Content Pipeline
Inputs
_data/interviews.yml_data/video_assets.yml_data/interview_conferences.yml_data/interview_communities.yml_data/resources.yml_data/transcripts/*.yml
Generation Steps
- link alignment:
bin/sync_interview_asset_links.rb - summary derivation:
bin/generate_context_summaries.rb - page generation:
bin/generate_interview_pages.rbbin/generate_video_asset_pages.rbbin/generate_interview_taxonomy_pages.rb
Template Sources
- Generated taxonomy pages now render through:
_templates/generated/interview-taxonomy-index.erb_templates/generated/interview-taxonomy-detail.erb
Validation
./bin/pipeline validate runs:
- data uniqueness/integrity checks
- transcript integrity checks (
bin/audit_transcripts.rb) - resources registry validation
- last-modified parity checks
- SEO/canonical and semantic/schema validators
- metadata reporting + HTMLProofer
Transcript Import Workflow
Use bin/import_transcripts_from_outbox.rb to onboard transcript files from an external directory.
- Run a dry run to generate mapping reports:
ruby ./bin/import_transcripts_from_outbox.rb --source-dir /Volumes/Dock_1TB/vimeo/outbox
- Review:
tmp/transcript-import-report.jsontmp/transcript-import-report.md
- Apply high-confidence mappings:
ruby ./bin/import_transcripts_from_outbox.rb --source-dir /Volumes/Dock_1TB/vimeo/outbox --apply
- Re-run validation:
./bin/pipeline validate
Shortcut wrapper:
./bin/transcripts dry-run --source-dir /Volumes/Dock_1TB/vimeo/outbox./bin/transcripts ingest --source-dir /Volumes/Dock_1TB/vimeo/outbox --auto-commit