VIDEO TO VIDEO TRANSLATION USING MBART MODEL

S.Sindhuja; Vyshnavi. K

Transparent Peer Review By Scholar9

VIDEO TO VIDEO TRANSLATION USING MBART MODEL

Abstract

There are many languages are spoken in India due to different diversities and different regions,so it is difficult to understand the global languages such as English ,Spanish ,French ,German. so this paper aims to translation of one of the global language English to their regional languages such as Tamil.So what our project does is it takes the Youtube url as an input in which the video should be in English and then save the video and perform the Machine Learning libraries as gTTS and Whisper model,Mbart50 model etc.Through this we do Audio Extraction,Speech-To-Text-Conversion,Text-Translation,Text-To-Speech-Synthesis. Through this we had Integrating language translation and audio synthesis and break down the the Linguistic barriers.

Chinmay Pingulkar Reviewer

Review Request Accepted

Chinmay Pingulkar Reviewer
15 Oct 2024 05:20 PM

Approved Rating

Relevance and Originality

Methodology

Validity & Reliability

Clarity and Structure

Results and Analysis

Comment

Relevance and Originality

This paper addresses a significant and timely issue: the challenge of language barriers in a linguistically diverse country like India, particularly in relation to global languages such as English. By focusing on the translation of English content into regional languages like Tamil, the research has practical implications for enhancing communication and accessibility for non-English speakers. The integration of various machine learning models for audio extraction, speech-to-text conversion, text translation, and text-to-speech synthesis reflects an innovative approach to solving real-world problems. This originality in methodology can contribute to more inclusive educational and informational access.

Methodology

The methodology presented in the paper outlines a multi-step process utilizing machine learning libraries such as gTTS, Whisper, and Mbart50. This comprehensive approach includes audio extraction, speech-to-text conversion, translation, and synthesis, providing a clear framework for the project. However, the paper would benefit from a more detailed explanation of the algorithms and models used, including their respective advantages and limitations. Additionally, information on the selection criteria for the YouTube videos analyzed, such as content type or relevance, would enhance the clarity of the methodology. Providing details on how the models were trained or fine-tuned for this specific application could also improve the methodological rigor.

Validity & Reliability

The validity of the paper is supported by the use of established machine learning models and techniques for language translation and audio synthesis. However, to strengthen the reliability of the findings, the paper should include evaluation metrics and results from the implemented models. Providing quantitative data, such as accuracy rates for translation or user satisfaction surveys, would substantiate the claims made about the effectiveness of the system. Additionally, discussing potential biases in the training data or the models used would enhance the overall credibility of the research.

Clarity and Structure

The paper is generally well-structured, guiding readers through the objectives, methodology, and potential outcomes of the project. However, the writing could be improved by breaking down complex sentences and avoiding jargon where possible, making the content more accessible to a broader audience. Including clear headings and subheadings to delineate sections would also enhance the overall readability. Visual aids, such as flowcharts or diagrams illustrating the process from input to output, could provide additional clarity and help readers understand the workflow of the proposed system.

Result Analysis

While the paper outlines the intended functions and applications of the proposed translation and synthesis system, it lacks a detailed analysis of results or practical applications. Including case studies or examples of how the system performs in real-world scenarios would provide valuable insights into its effectiveness. Furthermore, discussing any limitations encountered during implementation, such as challenges with specific languages or accents, would offer a more balanced view of the system's capabilities. This analysis would contribute to a more comprehensive understanding of the project's impact and future directions for improvement.

IJ Publication Publisher

done sir

Transparent Peer Review By Scholar9

VIDEO TO VIDEO TRANSLATION USING MBART MODEL

Abstract

Chinmay Pingulkar Reviewer

Chinmay Pingulkar Reviewer
15 Oct 2024 05:20 PM

Relevance and Originality

Methodology

Validity & Reliability

Clarity and Structure

Result Analysis

IJ Publication Publisher

QUICKLINKS

CONTACT US

Transparent Peer Review By Scholar9

VIDEO TO VIDEO TRANSLATION USING MBART MODEL

Abstract

Chinmay Pingulkar Reviewer

Chinmay Pingulkar Reviewer 15 Oct 2024 05:20 PM

Relevance and Originality

Methodology

Validity & Reliability

Clarity and Structure

Result Analysis

IJ Publication Publisher

Chinmay Pingulkar Reviewer
15 Oct 2024 05:20 PM