
Microsoft’s new artificial intelligence model called VALL-E has the ability to generate natural human speech using only a three-second audio prompt, which can be particularly useful in situations with limited data.
Unlike previous speech synthesis systems, VALL-E can only generate a few words or a short sentence, allowing users to produce natural human speech by simply writing or saying a few words.
This technology has many different application areas. For example, businesses can use this technology to generate natural human speech for customer service. Additionally, speech production can be made easier for students with learning difficulties.
Microsoft notes that the VALL-E model has not yet been released and that work on it is ongoing. However, it is certain that this technology will have many different application areas in the future.