Have you ever wished that you could just get a summary of a YouTube video instead of having to watch the whole thing?

Summarize YouTube videos with Twilio, OpenAI, and Picovoice

Now, you can! Ezzeddin Abdullah built an application that transcribes and summarizes YouTube videos using Twilio's WhatsApp Messaging API, OpenAI ChatGPT, and Leopard Speech-to-Text.

How it works:

  1. Users send a YouTube link on WhatsApp.
  2. Picovoice Leopard Speech-to-Text transcribes the video.
  3. OpenAI GPT-3.5 Turbo summarizes the transcription provided by Leopard.
  4. Application responses to the user with the summary provided by ChatGPT.
  5. Twilio WhatsApp Messaging API handles the communication between the app and users.



  1. You can create your own application using one of the open-source large language models instead of OpenAI GPT-3.5 Turbo.
  2. Picovoice Audio Recorders is a superior alternative to FFmpeg when it comes to speech processing. They offer cross-platform support and simplified audio recording and processing. You can check out Picovoice’s Subtitle Tutorial to see an example.

You can find the tutorial on Twilio’s Blog and check out Ezzeddin's website for other great tutorials and guides he’s created!

We love seeing the community projects enabled by Picovoice’s Forever-Free Plan. Don’t forget to share them with us!