Transparent Peer Review By Scholar9

VIDEO TO VIDEO TRANSLATION USING MBART MODEL

Abstract

There are many languages are spoken in India due to different diversities and different regions,so it is difficult to understand the global languages such as English ,Spanish ,French ,German. so this paper aims to translation of one of the global language English to their regional languages such as Tamil.So what our project does is it takes the Youtube url as an input in which the video should be in English and then save the video and perform the Machine Learning libraries as gTTS and Whisper model,Mbart50 model etc.Through this we do Audio Extraction,Speech-To-Text-Conversion,Text-Translation,Text-To-Speech-Synthesis. Through this we had Integrating language translation and audio synthesis and break down the the Linguistic barriers.

Srinivasulu Harshavardhan Kendyala Reviewer
15 Oct 2024 05:27 PM

Review Request Accepted

Srinivasulu Harshavardhan Kendyala Reviewer
15 Oct 2024 05:31 PM

Approved Rating

Relevance and Originality

Methodology

Validity & Reliability

Clarity and Structure

Results and Analysis

Comment

Relevance and Originality

The research article addresses a crucial issue in India's multilingual landscape, where the diversity of languages presents challenges in understanding global languages like English, Spanish, French, and German. The project's aim to translate English videos into regional languages such as Tamil is highly relevant and original, as it targets a significant barrier to communication and access to information for non-English speakers. The use of YouTube as a source for translation showcases an innovative approach to leveraging existing digital content. However, the article could enhance its originality by discussing the broader implications of such technology for education, cultural exchange, and digital inclusivity.

Methodology

The methodology outlined in the research article effectively employs various machine learning libraries, including gTTS, Whisper, and M-BART50, for audio extraction, speech-to-text conversion, text translation, and text-to-speech synthesis. This multi-faceted approach allows for a comprehensive solution to the translation problem. However, the article would benefit from a more detailed explanation of the implementation process for each step, including specific algorithms used, their performance metrics, and how the integration of these technologies is achieved. Including examples or case studies demonstrating the workflow would provide clarity and enhance the methodological rigor.

Validity & Reliability

The proposed methods for language translation and audio synthesis appear valid, given the increasing use of machine learning in natural language processing. By utilizing established libraries, the research leverages proven technologies, which contributes to the reliability of its findings. However, the article would strengthen its claims by including empirical evidence, such as user feedback or performance evaluations, to assess the effectiveness of the translation process. Additionally, addressing potential challenges or limitations, such as the accuracy of translations and audio quality, would provide a more balanced view of the project's validity.

Clarity and Structure

The clarity and structure of the research article could be improved for better reader comprehension. While the objectives and methods are outlined, the flow of information can be somewhat disjointed, making it challenging for readers to grasp the overall project. Clearer subheadings and organized sections would enhance navigation and understanding. Additionally, simplifying complex technical language and providing definitions for specialized terms would make the content more accessible to a broader audience.

Result Analysis

The article highlights the integration of language translation and audio synthesis as a means to break down linguistic barriers, but it lacks a detailed analysis of results or outcomes. Including specific examples of translated content, evaluations of accuracy, and user experiences would provide a clearer picture of the project's effectiveness. Additionally, discussing the potential impact of the technology on enhancing communication and access to information for speakers of regional languages would enrich the analysis and highlight the project's significance in a broader context.

IJ Publication Publisher
15 Oct 2024 05:32 PM

ok sir

Publisher

IJ Publication

Reviewer

Srinivasulu Harshavardhan Kendyala

More Detail

Paper Category

Computer Engineering

Journal Name

JETIR - Journal of Emerging Technologies and Innovative Research

p-ISSN

e-ISSN

2349-5162

Transparent Peer Review By Scholar9

VIDEO TO VIDEO TRANSLATION USING MBART MODEL

Abstract

Srinivasulu Harshavardhan Kendyala Reviewer
15 Oct 2024 05:27 PM