NotebookLM Studio
Import sources (URLs, YouTube, files, text) into Google NotebookLM and generate user-selected artifacts: podcast, video, report, quiz, flashcards, mind map,...
Description
name: notebooklm-studio description: > Import sources (URLs, YouTube, files, text) into Google NotebookLM and generate user-selected artifacts: podcast, video, report, quiz, flashcards, mind map, slides, infographic, data table. Use when the user sends content and asks to generate learning materials, podcasts, videos, or study packages. version: 2.1.2 metadata: openclaw: requires: bins: [notebooklm, ffmpeg] emoji: "🎙️"
NotebookLM Studio
Import sources into NotebookLM, generate user-selected artifacts via CLI, download results locally.
Inputs
Collect from user message (ask only for missing fields):
- Sources: URLs, YouTube links, text notes, or file attachments (PDF, Word, audio, image, Google Drive link)
- Artifacts: User selects from 9 types (no default — always ask):
audio(podcast),video,report,quiz,flashcards,mind-map,slide-deck,infographic,data-table
- Language (optional, default:
zh_Hant): applied vianotebooklm language set- ⚠️ This is a GLOBAL setting — affects all notebooks in the account
- Artifact options (discussed in step 1b): format, style, length, difficulty, etc.
See
references/artifact-options.mdfor all options per artifact type. - Custom instructions (optional): passed as description to generate commands
- Telegram target (optional, OpenClaw only): chat_id for delivery
See references/source-types.md for source type detection rules.
See references/artifacts.md for all 9 artifact types and CLI options.
Workflow
Steps are sequential gates — do NOT skip or combine steps. Each numbered step must complete before the next begins. In particular:
- Step 0 (auth precheck) must run and pass before any other CLI command.
- Step 1b (options discussion) must get user confirmation before generation. Do not assume defaults unless the user explicitly says "use defaults."
-
Auth precheck — Verify the session is valid before doing any work:
notebooklm auth check --test --json"status": "ok"→ proceed to step 1."status": "error"→ stop immediately. Tell the user:NotebookLM 登入已過期,請先重新登入(
notebooklm login),完成後告訴我,我再繼續。- Command itself fails (network error, CLI not found, etc.) → also stop and report the error.
--testis required — without it, only local checks run, which can pass even with an expired session.- When the user confirms re-login, re-run this check before continuing.
-
Parse input & configure artifacts —
1a. Select artifacts — Detect source types from user message (URLs, files, text). Confirm which artifacts to generate.
1b. Discuss options — Before generating, confirm key options for each selected artifact. Refer to
references/artifact-options.mdfor priority levels:- ASK options: must ask the user
- OFFER options: state the default, let user decide whether to change
- SILENT options: use defaults without asking
- Options already specified by the user → skip
- Present all questions in a single message (batch, not one-by-one)
If user says "use defaults" → skip all questions, proceed with default values immediately.
Example agent message (audio + video + report + quiz + flashcards + slides + infographic selected):
Before generating, a few options to confirm:
- Podcast: deep-dive / brief / critique / debate?
- Video: explainer / brief / cinematic? (cinematic uses Veo 3, takes 30-40 min)
- Report: briefing-doc / study-guide / blog-post / custom?
- Slides: detailed / presenter?
- Quiz & Flashcards: difficulty
medium, quantitystandard— adjust? - Infographic: style
auto, or prefer a specific style (sketch-note, professional, bento-grid...)? - Language:
zh_Hant, OK?
Or just say "use defaults" to start immediately.
-
Derive slug — Based on the sources and user message, generate a short kebab-case slug (2-4 words) that captures the core topic. This slug is used for both the notebook name and the output directory.
- Examples:
react-server-components,feynman-technique,taiwan-semiconductor-q4 - Keep it concise, lowercase, ASCII-only (transliterate non-ASCII if needed)
- If the user provides a topic or title, prefer that as the basis
- Examples:
-
Create notebook —
notebooklm create "<slug> <YYYYMMDD>" # → {"notebook_id": "xyz789", ...} ← capture notebook_id notebooklm use <notebook_id> mkdir -p ./output/<slug> -
Set language —
notebooklm language set <confirmed_language>Use the language confirmed in step 1b. ⚠️ GLOBAL setting — always set explicitly to avoid residual from previous runs.
-
Add sources — For each source:
# URL, YouTube, or file path notebooklm source add "<url_or_filepath>" # Google Drive notebooklm source add-drive <file_id> "<title>"For plain text → save to a
.txtfile first, thensource add "./temp_text.txt". -
Generate artifacts — Two-tier strategy for timeout safety:
⚠️ Deduplication gate (Tier 2 only) — Before generating Tier 2 artifacts, call
artifact listonce and check all requested types in that single response:notebooklm artifact list --json # → [{"task_id": "abc123", "type": "slide-deck", "status": "processing"}, ...]For each Tier 2 artifact you are about to generate, look for entries where
typematches (e.g.,slide-deck,audio,video). If multiple entries match the same type, the non-terminal status takes priority (processing/pending>completed>failed):processing/pending→ do NOT generate again. Take the existingtask_id, go to step 9 (wait + deliver).completed→ do NOT generate again. Skip the wait — go directly to download + deliver in step 9.failed→ safe to re-generate.- No matching entry → proceed with generation.
If
artifact listitself fails or returns an error, proceed with generation — the dedup check is a safety net, not a hard gate. Duplicate generation wastes resources and causes confusion — this gate prevents the most common operational error.Tier 1 — Immediate (use
--wait, completes within timeout):# Sync (instant) notebooklm generate mind-map # Fast async (1-2 min) — use options confirmed in step 1b notebooklm generate report --format <chosen_format> --wait notebooklm generate quiz --difficulty <chosen_difficulty> --quantity <chosen_quantity> --wait notebooklm generate flashcards --difficulty <chosen_difficulty> --quantity <chosen_quantity> --wait notebooklm generate data-table "<description>" --wait # Medium async (2-5 min, borderline — if timeout, retry or move to Tier 2) notebooklm generate infographic --style <chosen_style> --orientation <chosen_orientation> --waitTier 2 — Deferred (use
--jsonwithout--wait, capturetask_idfor step 9):# Slow async — use options confirmed in step 1b # Parse JSON output to extract task_id for polling notebooklm generate slide-deck --format <chosen_format> --json # → {"task_id": "abc123", "status": "pending"} ← save task_id notebooklm generate video --format <chosen_format> --style <chosen_style> --json # → {"task_id": "def456", "status": "pending"} ← save task_id # Note: if cinematic, omit --style (ignored by Veo 3) notebooklm generate audio "<description>" --format <chosen_format> --length <chosen_length> --json # → {"task_id": "ghi789", "status": "pending"} ← save task_idOptions accepted as defaults in step 1b can be omitted (CLI uses its own defaults). Parse each JSON response and save the
task_id— you will need it in step 9. Only generate the artifacts the user requested. Skip the rest. Seereferences/artifacts.md→ "Deferred Generation" for Tier 2 details.Write delivery status — Immediately after all Tier 2 generates are dispatched, write
./output/<slug>/delivery-status.jsonso the recovery script can pick up if the agent times out:{ "slug": "<slug>", "notebook_id": "<notebook_id>", "created_at": "<ISO 8601>", "artifacts": [ {"type": "slide-deck", "task_id": "<id>", "status": "pending", "output_path": "./output/<slug>/slides.pdf"}, {"type": "audio", "task_id": "<id>", "status": "pending", "output_path": "./output/<slug>/podcast.mp3"} ] }Update each artifact's
statustocompletedorfailedas step 9 progresses. This file is the handoff contract between the agent andscripts/recover_tier2_delivery.sh. Telegram delivery is agent-only (requires OpenClawmessagetool); the recovery script handles download + status tracking only. -
Download Tier 1 — Each successful Tier 1 artifact into
./output/<slug>/:notebooklm download mind-map ./output/<slug>/mindmap.json notebooklm download report ./output/<slug>/report.md notebooklm download quiz --format json ./output/<slug>/quiz.json notebooklm download flashcards --format json ./output/<slug>/flashcards.json notebooklm download data-table ./output/<slug>/data.csv notebooklm download infographic ./output/<slug>/infographic.png -
Report + Deliver Tier 1 — Present completed Tier 1 artifacts to user. If Tier 2 artifacts are pending, include a status note:
"Slides/Audio/Video are still generating, I'll send them when ready."
Telegram delivery (OpenClaw only) — If
messagetool is available:- Text summary with Tier 2 pending status (always first)
- Report → Quiz → Flashcards → Mind Map → Infographic → Data Table
See
references/telegram-delivery.mdfor delivery contract. Skip Telegram delivery if running outside OpenClaw (e.g. Claude Code, Codex). -
Poll + Deliver Tier 2 — Wait for each deferred artifact in order of expected speed (fastest first), then download and deliver as each completes:
# Wait by expected completion order: slide-deck (fastest) → video → audio (slowest) # Uses --interval 5 (not default 2) since Tier 2 artifacts take minutes, not seconds notebooklm artifact wait <slide_task_id> --timeout 1800 --interval 5 --json # → {"status": "completed", ...} ← task_id from generate is used as artifact_id here notebooklm download slide-deck ./output/<slug>/slides.pdf # → deliver to Telegram immediately notebooklm artifact wait <video_task_id> --timeout 1800 --interval 5 --json # Note: if cinematic, use --timeout 2400 (generation takes 30-40 min) notebooklm download video ./output/<slug>/video.mp4 # → deliver to Telegram immediately notebooklm artifact wait <audio_task_id> --timeout 1800 --interval 5 --json notebooklm download audio ./output/<slug>/podcast.mp3 bash scripts/compress_audio.sh ./output/<slug>/podcast.mp3 ./output/<slug>/podcast_compressed.mp3 # → deliver to Telegram immediately- Order matters: wait for fastest artifact first (slide-deck → video → audio) to minimize idle time
- On completion: download → post-process → deliver to Telegram → update
delivery-status.jsonstatus tocompleted - On failure: update status to
failedwith reason, notify user, continue to next artifact - On timeout: see timeout recovery below
- Max wait: 30 minutes per artifact (covers worst-case audio/video)
- If agent is about to exit with any artifact still
pending, tell the user:"Tier 2 補送模式已啟動,recovery script 會每 5 分鐘檢查並自動送達。"
⚠️ Timeout recovery — If
artifact waitreturnsstatus: "timeout", the artifact is likely still generating. NEVER re-generate. Instead:- Re-check status:
notebooklm artifact poll <task_id> --json - If
processing→ re-wait:notebooklm artifact wait <task_id> --timeout 1800 --interval 5 --json - If
completed→ download and deliver - If
failed→ notify user with error, move to next artifact - If re-wait also times out (2+ total timeouts, ~60 min elapsed) → give up, notify user, suggest downloading from NotebookLM directly A timeout means the wait expired, not that generation failed. The task continues server-side. Re-generating creates duplicates and wastes time.
Error handling
- Auth errors → caught by step 0 precheck. If any CLI command later returns an authentication/session error (HTTP 401, "Not logged in", "session expired", token fetch failure), treat it as a mid-workflow auth failure — stop, ask user to re-login, then re-run step 0 before resuming.
- Tier 1 failure: retry up to 2 times, then include failure note in step 8 delivery.
- Tier 2 failure: notify user per-artifact in step 9. Tier 1 is already delivered by this point, so Tier 2 failures never block text artifact delivery.
- Capture failure reason in delivery status.
Delivery confirmation gate
Before reporting "complete" to the user, ALL of the following must be true:
- Every requested artifact is either successfully delivered or reported as failed with reason
- For Telegram delivery (OpenClaw): each
messagetool call (OpenClaw's built-in messaging tool) returned a success response with amessageId- If a send fails, retry once. If still failing, report the failure to the user — do NOT silently skip
- No artifact is still in
processingorpendingstatus without being tracked
Never say "done" while any artifact is still pending delivery. If Tier 2 artifacts are still generating, say so explicitly and continue waiting. The task is not complete until everything is delivered or accounted for.
Quality gate
Before delivery, verify:
- Sources are concrete article/content pages (not category/index pages).
- Report contains actionable takeaways (not generic summary).
- Quiz tests key concepts and mechanics.
- Flashcards focus on terms, decisions, and trade-offs.
- Output respects requested language and length.
See references/output-contracts.md for format specifications.
Delivery template
- Selection rationale (<=3 bullets)
- Artifact list with paths/status (all 9 types if applicable)
- Key takeaways (3-5 bullets)
- Failures + fallback note (if any)
- One discussion question
Changelog
v2.1.0
- Auth precheck gate — step 0 runs
auth check --test --jsonbefore any work; expired sessions fail fast instead of blowing up mid-generation. - Dedup gate — step 6 checks
artifact listbefore Tier 2 generation to prevent duplicate artifacts when agent retries or resumes. - Timeout recovery —
artifact waittimeout no longer triggers re-generation; polls status and re-waits, giving up only after 2 consecutive timeouts (~60 min). - Delivery confirmation gate — agent cannot claim "done" until every artifact is delivered or explicitly reported as failed with reason.
- Delivery status contract — step 6 writes
delivery-status.jsonafter Tier 2 dispatch; step 9 updates it as artifacts complete. Enables cron-based recovery viascripts/recover_tier2_delivery.shwhen agent times out.
Reviews (0)
No reviews yet. Be the first to review!
Comments (0)
No comments yet. Be the first to share your thoughts!