YouTube Video Automatic Transcription and Intelligent Summary Push Workflow
This workflow automatically receives requests containing YouTube video links, extracts the video ID, and retrieves the video along with its subtitle content. It utilizes AI technology for intelligent analysis and structured summarization of the subtitles, ultimately pushing the summary results to a Telegram channel, achieving full process automation. This workflow greatly simplifies the processing of video content and enhances information retrieval efficiency, making it suitable for content creators, educators, and marketing teams, facilitating quick understanding of video highlights for sharing.

Workflow Name
YouTube Video Automatic Transcription and Intelligent Summary Push Workflow
Key Features and Highlights
This workflow automatically processes YouTube videos by receiving webhook requests containing YouTube video links, extracting the video ID, retrieving the video and its subtitle text, and leveraging AI models to perform structured intelligent analysis and summarization of the subtitles. The final results are returned via webhook response and pushed to a Telegram channel. The entire process is fully automated without manual intervention.
Core Problems Addressed
- Automates the retrieval and processing of YouTube video subtitles, eliminating the tedious manual downloading and organizing of subtitles.
- Utilizes AI technology to extract and structurally summarize long subtitle texts, enhancing information acquisition efficiency.
- Instantly pushes summary information to Telegram for rapid sharing and notification.
Application Scenarios
- Content creators and marketers can quickly grasp the core content of videos to assist in content planning and secondary creation.
- In education and training, video explanations are converted into structured notes for easy review and dissemination.
- Media monitoring and intelligence analysis benefit from quick access to key information through video summaries.
- Automated operations can push real-time video updates and key points to team communication tools.
Main Workflow Steps
- Webhook Request Reception: Listens on a specified endpoint to receive POST requests containing YouTube video links.
- Extract Video URL: Extracts the YouTube video link from the request body.
- Parse Video ID: Uses regular expressions to extract the unique video ID from the link.
- Retrieve Video Information: Calls the YouTube API to fetch basic video details and subtitles.
- Split Subtitle Text: Divides the retrieved subtitle content into sentences or paragraphs.
- Concatenate Text: Reassembles the split subtitles into a complete text.
- Intelligent Summary Analysis: Employs LangChain combined with the GPT-4o-mini model to generate a structured summary and analysis of the subtitle content.
- Generate Response Object: Integrates the video title, description, ID, original link, and summary information.
- Return and Push Results: Sends the response back to the caller via webhook and simultaneously pushes the summary title and link to a Telegram group or channel.
Involved Systems or Services
- YouTube API: Used to obtain video details and subtitle content.
- n8n Webhook: Serves as the entry point to receive requests and return results.
- LangChain + OpenAI GPT-4o-mini: Implements intelligent text analysis and summary generation for subtitles.
- Telegram: Pushes video titles and links, supporting HTML-formatted messages.
Target Users and Value
- Video Content Operators: Quickly grasp video highlights to improve content editing efficiency.
- Educators and Students: Facilitate the creation of video study notes and knowledge summaries.
- Marketing and Social Media Teams: Share video key points promptly with teams or clients to promote information dissemination.
- Automation Developers and Enterprises: Integrate video content processing capabilities to realize intelligent business workflows.
This workflow efficiently and intelligently automates the processing of video subtitle information, greatly simplifying the understanding and sharing of video content. It is a powerful tool for enhancing video content utilization and communication efficiency.