Skip to main content
Loading...
Scholar9 logo True scholar network
  • Article ▼
    • Article List
    • Deposit Article
  • Mentorship ▼
    • Overview
    • Sessions
  • Questions
  • Scholars
  • Institutions
  • Journals
  • Login/Sign up
Back to Top

Transparent Peer Review By Scholar9

VIDEO TO VIDEO TRANSLATION USING MBART MODEL

Abstract

There are many languages are spoken in India due to different diversities and different regions,so it is difficult to understand the global languages such as English ,Spanish ,French ,German. so this paper aims to translation of one of the global language English to their regional languages such as Tamil.So what our project does is it takes the Youtube url as an input in which the video should be in English and then save the video and perform the Machine Learning libraries as gTTS and Whisper model,Mbart50 model etc.Through this we do Audio Extraction,Speech-To-Text-Conversion,Text-Translation,Text-To-Speech-Synthesis. Through this we had Integrating language translation and audio synthesis and break down the the Linguistic barriers.

Chinmay Pingulkar Reviewer

badge Review Request Accepted

Chinmay Pingulkar Reviewer

15 Oct 2024 05:20 PM

badge Approved

Relevance and Originality

Methodology

Validity & Reliability

Clarity and Structure

Results and Analysis

Relevance and Originality

This paper addresses a significant and timely issue: the challenge of language barriers in a linguistically diverse country like India, particularly in relation to global languages such as English. By focusing on the translation of English content into regional languages like Tamil, the research has practical implications for enhancing communication and accessibility for non-English speakers. The integration of various machine learning models for audio extraction, speech-to-text conversion, text translation, and text-to-speech synthesis reflects an innovative approach to solving real-world problems. This originality in methodology can contribute to more inclusive educational and informational access.


Methodology

The methodology presented in the paper outlines a multi-step process utilizing machine learning libraries such as gTTS, Whisper, and Mbart50. This comprehensive approach includes audio extraction, speech-to-text conversion, translation, and synthesis, providing a clear framework for the project. However, the paper would benefit from a more detailed explanation of the algorithms and models used, including their respective advantages and limitations. Additionally, information on the selection criteria for the YouTube videos analyzed, such as content type or relevance, would enhance the clarity of the methodology. Providing details on how the models were trained or fine-tuned for this specific application could also improve the methodological rigor.


Validity & Reliability

The validity of the paper is supported by the use of established machine learning models and techniques for language translation and audio synthesis. However, to strengthen the reliability of the findings, the paper should include evaluation metrics and results from the implemented models. Providing quantitative data, such as accuracy rates for translation or user satisfaction surveys, would substantiate the claims made about the effectiveness of the system. Additionally, discussing potential biases in the training data or the models used would enhance the overall credibility of the research.


Clarity and Structure

The paper is generally well-structured, guiding readers through the objectives, methodology, and potential outcomes of the project. However, the writing could be improved by breaking down complex sentences and avoiding jargon where possible, making the content more accessible to a broader audience. Including clear headings and subheadings to delineate sections would also enhance the overall readability. Visual aids, such as flowcharts or diagrams illustrating the process from input to output, could provide additional clarity and help readers understand the workflow of the proposed system.


Result Analysis

While the paper outlines the intended functions and applications of the proposed translation and synthesis system, it lacks a detailed analysis of results or practical applications. Including case studies or examples of how the system performs in real-world scenarios would provide valuable insights into its effectiveness. Furthermore, discussing any limitations encountered during implementation, such as challenges with specific languages or accents, would offer a more balanced view of the system's capabilities. This analysis would contribute to a more comprehensive understanding of the project's impact and future directions for improvement.

avatar

IJ Publication Publisher

done sir

Publisher

User Profile

IJ Publication

Reviewer

User Profile

Chinmay Pingulkar

More Detail

User Profile

Paper Category

Computer Engineering

User Profile

Journal Name

JETIR - Journal of Emerging Technologies and Innovative Research

User Profile

p-ISSN

User Profile

e-ISSN

2349-5162

Subscribe us to get updated

logo logo

Scholar9 is aiming to empower the research community around the world with the help of technology & innovation. Scholar9 provides the required platform to Scholar for visibility & credibility.

QUICKLINKS

  • What is Scholar9?
  • About Us
  • Mission Vision
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • Blogs
  • FAQ

CONTACT US

  • logo +91 82003 85143
  • logo hello@scholar9.com
  • logo www.scholar9.com

© 2025 Sequence Research & Development Pvt Ltd. All Rights Reserved.

whatsapp