Voice Cloning Using Artificial Intelligence and Machine Learning: A Review

Fatima  M Inamdar; Sateesh  Ambesange; Renuka  Mane; Hasan  Hussain; Sahil  Wagh; Prachi  Lakhe

doi:10.17762/jaz.v44iS7.2721

Authors

Fatima M Inamdar
Sateesh Ambesange
Renuka Mane
Hasan Hussain
Sahil Wagh
Prachi Lakhe

DOI:

https://doi.org/10.17762/jaz.v44iS7.2721

Abstract

This paper represents a thorough method for integrating emotions, texttospeech conversion, and state of the art voice cloning. The paper focuses on novel background noise adaptation, emotional voice synthesis, and multi-speaker voice cloning for better speech synthesis. The synthesis of emotive voices, multi-speaker voice cloning, and creative methods for modifying background noise to improve speech synthesis quality are among the topics covered in this study. Additionally, the study explores the domain of emotional artificial intelligence by adding a variety of emotions to artificial voices, improving user engagement through sympathetic reactions. The study also looks at how background noise can be altered to change it from a disturbing to a silent, non-disruptive state. The texttospeech systems usability in noisy conditions is greatly enhanced by this improvement. By integrating these components, the project makes a substantial contribution to text to speech, emotional AI, and voice cloning, creating new avenues for human-computer connection.

Downloads

Download data is not yet available.

Voice Cloning Using Artificial Intelligence and Machine Learning: A Review

Authors

DOI:

Abstract

Downloads

Downloads

Published

Issue

Section

License

Make a Submission

Our Indexing Partners