Skip to main content
Loading...
Scholar9 logo True scholar network
  • Article ▼
    • Article List
    • Deposit Article
  • Mentorship ▼
    • Overview
    • Sessions
  • Questions
  • Scholars
  • Institutions
  • Journals
  • Login/Sign up
Back to Top

Transparent Peer Review By Scholar9

VIDEO TO VIDEO TRANSLATION USING MBART MODEL

Abstract

There are many languages are spoken in India due to different diversities and different regions,so it is difficult to understand the global languages such as English ,Spanish ,French ,German. so this paper aims to translation of one of the global language English to their regional languages such as Tamil.So what our project does is it takes the Youtube url as an input in which the video should be in English and then save the video and perform the Machine Learning libraries as gTTS and Whisper model,Mbart50 model etc.Through this we do Audio Extraction,Speech-To-Text-Conversion,Text-Translation,Text-To-Speech-Synthesis. Through this we had Integrating language translation and audio synthesis and break down the the Linguistic barriers.

Balachandar Ramalingam Reviewer

badge Review Request Accepted

Balachandar Ramalingam Reviewer

15 Oct 2024 05:47 PM

badge Approved

Relevance and Originality

Methodology

Validity & Reliability

Clarity and Structure

Results and Analysis

Relevance and Originality

This research paper addresses a crucial issue regarding language barriers in India, where numerous regional languages coexist. The project’s focus on translating English videos into Tamil highlights the necessity for effective communication in a multilingual society. By utilizing cutting-edge machine learning models like gTTS, Whisper, and MBart50, the study presents an original approach to bridging linguistic gaps, particularly in the context of global languages. The relevance is further underscored by the growing importance of accessible educational and informational content in diverse languages, making this work significant in today’s digital landscape.


Methodology

The methodology outlined in the paper demonstrates a systematic approach to achieving its goals. The use of a YouTube URL as input for processing English videos is innovative, allowing for practical application of the proposed translation system. The integration of various machine learning libraries for audio extraction, speech-to-text conversion, text translation, and text-to-speech synthesis showcases a comprehensive methodology. However, more details on the selection criteria for the machine learning models and the specific algorithms used could enhance the robustness of the methodology section. Additionally, discussing any challenges encountered during implementation would provide valuable insights into the practical aspects of the project.


Validity & Reliability

The validity of the findings appears strong, given the incorporation of well-established machine learning models known for their effectiveness in language processing tasks. However, the paper would benefit from including quantitative metrics or evaluation criteria to assess the performance of the translation system. For example, incorporating user feedback or accuracy rates for translation and speech synthesis would enhance the reliability of the results. Furthermore, a comparison with existing translation solutions could illustrate the advantages of the proposed approach, bolstering its credibility.


Clarity and Structure

The paper is generally well-structured, guiding the reader through the project's aims, methodology, and expected outcomes. However, some sentences could be rephrased for better clarity and conciseness. For instance, the explanation of the machine learning processes could be broken down into clearer steps, allowing readers to follow the workflow more easily. Utilizing bullet points or numbered lists to outline the processes involved in each stage (audio extraction, translation, synthesis) would improve readability and comprehension.


Result Analysis

The results analysis effectively emphasizes the integration of language translation and audio synthesis, aiming to break down linguistic barriers. However, the analysis could be strengthened by including preliminary findings or case studies demonstrating the system’s effectiveness in real-world scenarios. Providing examples of translated content and user experiences would illustrate the practical implications of the research. Additionally, discussing potential limitations, such as the accuracy of machine translations and challenges related to dialects or variations within Tamil, would present a more balanced view of the project’s outcomes. Suggestions for future enhancements or applications of the system could further enrich the discussion, highlighting the project's potential impact.

avatar

IJ Publication Publisher

ok sir

Publisher

User Profile

IJ Publication

Reviewer

User Profile

Balachandar Ramalingam

More Detail

User Profile

Paper Category

Computer Engineering

User Profile

Journal Name

JETIR - Journal of Emerging Technologies and Innovative Research

User Profile

p-ISSN

User Profile

e-ISSN

2349-5162

Subscribe us to get updated

logo logo

Scholar9 is aiming to empower the research community around the world with the help of technology & innovation. Scholar9 provides the required platform to Scholar for visibility & credibility.

QUICKLINKS

  • What is Scholar9?
  • About Us
  • Mission Vision
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • Blogs
  • FAQ

CONTACT US

  • logo +91 82003 85143
  • logo hello@scholar9.com
  • logo www.scholar9.com

© 2025 Sequence Research & Development Pvt Ltd. All Rights Reserved.

whatsapp