🎬

Squeeze the key points out of a long video or lecture in minutes with AI

Hand a long video, online class or talk to AI; get structured notes in minutes and ask follow-ups on what you missed.

Learning Beginner

A one-hour online class, a two-hour talk, a long meeting replay — the content may be useful, but you just don’t have the patience to watch it all, especially when the gold is buried in some ten minutes in the middle.

AI can “watch it first” for you: give it the video’s captions or audio and it reads through the whole thing in minutes, pulling out the through-line and key points. The trick isn’t a vague “summarize this,” but a fixed routine — get structured notes with timestamps first, then drill into the parts you didn’t get. A one-hour video shrinks to a few minutes of grasping 80–90% of it.

When to use it

When an online class is too long to watch in full, a talk/livestream replay drags on, or you want to judge fast whether a video is worth a close watch — let AI surface the key points first, then decide which part to rewatch.

How to do it

  1. Get the text: many online classes / videos carry captions you can copy; if not, use Tongyi Tingwu to upload the audio/video and auto-transcribe
  2. Paste the captions or transcript into Kimi / Doubao and ask for “structured notes with timestamps”
  3. Drill into unclear parts and have it cite which minute of the video it’s from so you can rewatch precisely
  4. Finally ask for 3–5 “most worth remembering” takeaways and save them to your notes

Weak vs strong

❌ How most people write it
Summarize this video for me.
✅ Do this instead
Here are the captions of a personal-finance class. Turn them into structured notes: 1) the topics covered, in order 2) the core point and key takeaway of each 3) the 3 most practical tips for “an ordinary salaried worker.” Mark the minute for each where you can, and list 3 questions I should dig into next.

The left gives a vague restatement; the right adds structure, ties it to your situation and asks for timestamps — you finish with something usable.

Copy-paste prompt

Here are the captions (or transcript) of a【class / talk / video】. Turn them into structured notes: 1) the topics in order 2) the core point and key conclusion of each 3) the 3–5 takeaways most useful to me. Mark the rough timestamp (which minute) for each where possible. Then list 3 questions I should ask or look into next. Here it is:【paste the captions / transcript here】

Worked examples

Example 1 · A one-hour class into one page of notes
Here are the captions of a one-hour Excel class. In one page, organize: which features it covers, how to use each (one-line steps), and which a beginner should learn first. Plain language; explain any term the first time it appears.

You get:Without watching it all, you get a list of “what it teaches and which to practice first,” then rewatch just the one or two features that matter.

Example 2 · No captions? Transcribe first, then summarize
(First upload the audio to Tongyi Tingwu to auto-transcribe, then paste the text into AI.) This is the transcript of an industry talk. Extract: the speaker’s core points, any specific data or cases mentioned, 3 tips for practitioners — and flag likely mis-transcribed jargon for me to verify.

You get:Even recordings with no captions work: transcribe first, then summarize — it even flags likely mis-heard terms for you to check.

Level up

  • Make a quiz: ask it to “write 5 questions to test me on this class” to check what stuck
  • Compare: paste two classes on the same topic and ask it to “contrast the two instructors’ views” to spot the more reliable one
  • Turn it into an action list: have it reorganize the methods into “steps I can actually follow,” not just facts

Common mistakes

  • Trusting the summary blindly — recheck key figures and conclusions against the video; AI can distort or invent
  • Just saying “summarize” — give structure and your goal so it summarizes for you specifically
  • Mistaking a summary for having watched it — it locates what matters, but the one or two key parts still need a real watch

FAQ

How do I turn a video into text the AI can read?
Captions are easiest — Bilibili, online classes and many apps show and let you copy them. No captions? Upload the audio/video to Tongyi Tingwu to auto-transcribe, then paste the text into Kimi / Doubao to summarize.
The video’s too long and the text won’t fit in one go — what then?
Paste it in chunks and say “summarize this part, remember it, and combine once I’ve pasted everything”; or use a tool like Tongyi Tingwu that handles long audio/video in one pass.

Pro tip:Treat it as the assistant that watches first: it tells you which part is worth a rewatch and locates the key points, but the real learning is still on you.

Related tips