Search giant Google has completed the 'critical first step' toward building its artificial intelligence model that will support the world's one thousand most-spoken languages. In aGoogle's announcement is part of the build-up to its annual I/O event where it plans to unveil a slew of products powered by AI.
According to the blog post, the Universal Speech Model is a family of speech models that includes two billion parameters that have been trained on 12 million hours of speech and 28 billion sentences of text. Currently, the model is based on a little over 300 languages but is already in use in Google's products, such as YouTube.
If you have used Automatic Speech Recognition while watching YouTube videos in a language that you are not familiar with, it is the USM that is making it easier to understand the content. Google researchers Yu Zhang and James Qin further elaborated on how the machine-learning model was trained. The researchers state that the fundamental difficulty in training a model such as USM is access to enough data. In a conventional supervised learning approach, the audio data needs to be manually labeled or collected from a pre-existing transcription. This either turns out too expensive, time-consuming, or hard to find, depending on the language and its representation.
The company is now looking to use USM's based model architecture and training pipeline to build its 1,000 language model. Mark Zuckerberg's Meta, which has been betting on building the metaverse, also released its ChatGPT-like
Technology Technology Latest News, Technology Technology Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
Source: verge - 🏆 94. / 67 Read more »