This project is based on converting the audio signals received to text using speech to text api (python modules ... Data sets of predefined sign language are used as the input so that the software can ...
This repo is for text-to-audio diffusion utilizing a denoising unet and Meta's Encodec. The unet is trained to denoise Encodec's encoded codebooks while taking in t5 text embeddings as conditioning.