GenAI Based YouTube Video Summarizer
DOI:
https://doi.org/10.5281/zenodo.17926249Keywords:
Video Summarization, Artificial Intelligence, Text-To-Speech, Multilingual Transcription, Streamlit InterfaceAbstract
This paper proposes an intelligent, web-based application—AI video summarizer—that efficiently extracts, Tran- scribes, and summarizes YouTube video content using advanced AI models such as Google Gemini. By simply entering a video link, users can obtain multilingual transcripts (in English, Hindi, and Marathi), concise summaries, and time stamped highlights of key moments. Furthermore, the application converts the generated summaries into audio using GTTS and offers options to download or copy full transcripts. Built with Streamlit, it provides an interactive and user-friendly interface. This solution addresses the growing challenge of overwhelming digital video content, offering a more accessible, time-saving, and language- inclusive way to understand and utilize video information across various fields.
Downloads
References
Yogendra Singh, Rishu Kumar, Soumya Kabdal, & Prashant Upadhyay. (2023). Youtube video summarizer using nlp: A review. International Journal of Performability Engineering, 19(12), 817, 2023.
Mrs Gauri Mandar Puranik, Nidhi Kamath, Gargi Dusane, & Nakshatra Akhadkar. (2018). Youtube video summarizer: A web based application for concise visual and textual summary. Artificial Intelligence.
Reshma Shaik, Saloni Bargat, & Shilpa Ghode. (2023). Article and youtube transcript summarizer using spacy and nltk module. SSGM Journal of Science and Engineering, 1(1), 126–131.
Oguzhan Topsakal, & Tahir Cetin Akinci. (2023). Creating large language model applications utilizing langchain: A primer on developing llm apps fast. In: International Conference on Applied Engineering and Natural Sciences, 1, pp. 1050–1056.
Yixin Liu, Kejian Shi, Katherine S He, Longtian Ye, Alexander R Fabbri, Pengfei Liu, Dragomir Radev, & Arman Cohan. (2023). On learning to summarize with large language models as references. arXiv preprint arXiv:2305.14239.
Jitender Kumar, Ritu Vashistha, Roop Lal, & Dhrumil Somanir. (2023). Youtube transcript summarizer. In: 14th International Conference on Computing Communication and Networking Technologies (ICCCNT), pp. 1–4. IEEE.
Nayana Cholanayakanahalli Prakash Prakash, Achyutha Prasad Narasimhaiah, Jagadeesh Bettakote Nagaraj, Piyush Kumar Pareek, Nalini Bpalya Maruthikumar, & Ramya Iyaravally Manjunath. (2022). Implementation of NLP based automatic text summarization using spacy. International Journal of Health Sciences, 6(S5), 7508–7521.
Kaiqiang Song, Chen Li, Xiaoyang Wang, Dong Yu, & Fei Liu. (2022). Towards abstractive grounded summarization of podcast transcripts. arXiv preprint arXiv:2203.11425
Sulochana Devi, Rahul Nadar, Tejas Nichat, & Alfredprem Lucas. (2023). Abstractive summarizer for youtube videos. In: International Conference on Applications of Machine Intelligence and Data Analytics, pp. 431–438. Atlantis Press.
Rand Abdulwahid Albeer, Huda F Al-Shahad, Hiba J Aleqabie, & Noor D Al-shakarchy. (2022). Automatic summarization of youtube video transcription text using term frequency-inverse document frequency. Indonesian Journal of Electrical Engineering and Computer Science, 26(3), 1512–1519.
Ishitva Awasthi, Kuntal Gupta, Prabjot Singh Bhogal, Sahejpreet Singh Anand, & Piyush Kumar Soni. (2021). Natural language processing (NLP) based text summarization-a survey. In: 6th International Conference on Inventive Computation Technologies (ICICT), pp. 1310–1317. IEEE.
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Deepali Jadhav, Samiksha Raghunath Devardekar, Amruta Bharat Talandage, Onkar Tanajirao Bhokare, Akshata Ashok Kudale, Vaishnavi Shivaji Patil

This work is licensed under a Creative Commons Attribution 4.0 International License.
Research Articles in 'International Journal of Engineering and Management Research' are Open Access articles published under the Creative Commons CC BY License Creative Commons Attribution 4.0 International License http://creativecommons.org/licenses/by/4.0/. This license allows you to share – copy and redistribute the material in any medium or format. Adapt – remix, transform, and build upon the material for any purpose, even commercially.






