Voice Cloning Using Artificial Intelligence and Machine Learning: A Review

Main Article Content

Fatima M Inamdar
Sateesh Ambesange
Renuka Mane
Hasan Hussain
Sahil Wagh
Prachi Lakhe

Abstract

This paper represents a thorough method for integrating emotions, texttospeech conversion, and state of the art voice cloning. The paper focuses on novel background noise adaptation, emotional voice synthesis, and multi-speaker voice cloning for better speech synthesis. The synthesis of emotive voices, multi-speaker voice cloning, and creative methods for modifying background noise to improve speech synthesis quality are among the topics covered in this study. Additionally, the study explores the domain of emotional artificial intelligence by adding a variety of emotions to artificial voices, improving user engagement through sympathetic reactions. The study also looks at how background noise can be altered to change it from a disturbing to a silent, non-disruptive state. The texttospeech systems usability in noisy conditions is greatly enhanced by this improvement. By integrating these components, the project makes a substantial contribution to text to speech, emotional AI, and voice cloning, creating new avenues for human-computer connection.

Downloads

Download data is not yet available.

Article Details

How to Cite
M Inamdar, F. ., Ambesange, S. ., Mane, R. ., Hussain, H. ., Wagh, S. ., & Lakhe, P. . (2023). Voice Cloning Using Artificial Intelligence and Machine Learning: A Review. Journal of Advanced Zoology, 44(S7), 419–427. https://doi.org/10.17762/jaz.v44iS7.2721
Section
Articles