Python Mel Spectrogram

EcoSoundNet: Animal Sound Classifier

Abstract: Recognition of animals in the forest using sound classification by using the framework like Convolution Neural ANNs/CNNs to analyze animal vocalizations that are pre- pre processed using ...

Scientific Research Publishing

UNESCO (2021) Towards Sustainable Preservation and Accessibility of Documentary Heritage.

ABSTRACT: The aim of this research is to develop a speech synthesis model tailored towards Nigerian languages by leveraging natural language processing tool such as FastSpeech 2 and meta-tts for ...

Scientific Research Publishing

Tan, Y. and Jehom, W.J. (2024) The Function of Digital Technology in Minority Language Preservation: The Case of the Gyalrong Tibetan Language. Preservation, Digital Technology ...

ABSTRACT: The aim of this research is to develop a speech synthesis model tailored towards Nigerian languages by leveraging natural language processing tool such as FastSpeech 2 and meta-tts for ...

GitHub

Show inaccessible results

EcoSoundNet: Animal Sound Classifier

UNESCO (2021) Towards Sustainable Preservation and Accessibility of Documentary Heritage.

Tan, Y. and Jehom, W.J. (2024) The Function of Digital Technology in Minority Language Preservation: The Case of the Gyalrong Tibetan Language. Preservation, Digital Technology ...

Vector-Quantized Contrastive Predictive Coding

Masked Spectrogram Modeling using Masked Autoencoders (MSM-MAE)

SR-TTS: a rhyme-based end-to-end speech synthesis system

Enhancing Speech Emotion Recognition: A Dual-Channel Spectrogram Approach

Speech based Emotion Recognition using Machine Learning