• AI Academy
  • Posts
  • 🗣️ OpenAI Can Clone Any Voice in Seconds

🗣️ OpenAI Can Clone Any Voice in Seconds

Plus Salesforce enters Medical treatments and Amazon Invests in Anthropic

Hello AI Enthusiast,

Last week, we talked about Nvidia and Hippocratic AI's new AI nurses, which could help with the nursing shortage by offering advice over video calls for much cheaper than human nurses. We asked you what you thought about it, and 67% saw it as a good chance to make medical support more available to people. One of you specified that “It might be helpful sometimes, but human contact is one of the key factors for longevity as well as recovery”.

It's up to us to find the right balance between using AI to help us and keeping the human touch where it matters most.

Today's top news includes OpenAI's Voice Engine. It lets you imagine the possibilities when you can clone a voice from just a 15-second snippet. What could this mean for the future of communication, content creation, and even personal identity?

Before diving into the implications, tomorrow, our Lead Business with Generative AI course will be available for everyone to join. This program is designed to help you craft an AI business strategy tailored to your company's objectives. This is your heads-up to join the waitlist today because we're offering a special deal just for those who subscribe — you can save 500€ on the signup cost! You still have a few hours.

Now on with our news list. 👇

News Bytes 🗞️

  • OpenAI has released Voice Engine, a text-to-voice platform that can create synthetic voices from a 15-second voice sample, with partners such as Age of Learning and HeyGen utilizing it. OpenAI affirms that the technology is being used for good across various industries, with a focus on maintaining ethical use and obtaining consent from original speakers.

  • 💡 Our take: OpenAI is making it easier for everyone to create artificial voices. HeyGen's choice to use it suggests it's superior to alternatives. This also means that nowadays, businesses should prioritize how a product feels to use over developing their technology, especially with high-quality tech like OpenAI's available for integration. OpenAI emphasizes ethical use and consent, which is commendable. However, considering the economic motives behind technological advancements is always wise.

Inspired by HeyGen using OpenAI's Voice Engine technology, dive into our "Generative AI Project Bootcamp" to learn crafting AI products leveraging ChatGPT’s tech. Exclusive early access and a free webinar await our waitlist members. Join the waitlist today.

  • ProGen, a project led by Salesforce, has evolved into the startup Profluent, using AI to design proteins for potential medical treatments. Profluent plans to leverage generative AI to revolutionize the intentional design of treatments, potentially reducing the time and capital typically needed for drug development.

  • 💡 Our take: Generative AI isn't just good at guessing the next word—it can also predict the next protein. So, instead of just using it for writing blog posts, why not harness its power to tackle diseases we've yet to cure? What is interesting also is that a company like Salesforce has started something like this. Probably this means that if you have computing power and talent (both scarce) you can build whatever you want.

  • Amazon is investing $2.75 billion in AI startup Anthropic, aiming to enhance customer experiences. The deal includes Amazon maintaining a minority stake and marks a trend of cloud providers like AWS and Google heavily investing in AI companies.

  • Microsoft paid Inflection AI $620 million for the nonexclusive right to sell their AI model, with a total estimated deal value exceeding $1 billion.

  • 💡 Our take: Microsoft's and Amazon’s strategic moves show a trend towards consolidating power among tech giants. Building unique AI models independently is increasingly challenging for startups. Amazon with Anthropic, Microsoft with OpenAI, and Google with Gemini are shaping up to be the dominant players in the generative AI landscape.

  • Elon Musk's AI startup X.ai introduces Grok-1.5, an upgraded generative AI model with enhanced reasoning abilities for coding and math tasks. Grok-1.5 can process longer contexts with up to 128,000 tokens and its performance on benchmarks suggests significant improvements.

  • Microsoft introduces new leaders for its Windows and Surface team and its AI division, aiming to improve collaboration on AI, hardware, and user experiences, while some team members might be leaving.

  • Microsoft and OpenAI are discussing plans for a $100 billion data center project named "Stargate" to house a supercomputer with specialized server chips for AI.

  • The White House told government agencies to pick AI chiefs in 60 days to watch over AI projects and make sure they're used right. These chiefs will check how AI affects safety and rights, and fix any AI uses that don't follow rules by December 1.

  • Hume Group unveils Empathic Voice Interface (EVI), a conversational AI with emotional intelligence. EVI uses vocal tones and multimodal generative AI to understand users' preferences and emotions, aiming to provide natural voice-first experiences.

Educational Pill đź’Š

OpenAI’s Voice Engine Explained

OpenAI's Voice Engine technology harnesses sophisticated deep learning models, a branch of machine learning that mimics the neural networks of the human brain, to learn from a short, 15-second audio clip of someone's voice.

This process involves analyzing the unique characteristics of the voice, such as its pitch, tone, and rhythm, to create a detailed voice profile. The engine then applies this profile to new text inputs, using its understanding of the voice's nuances to generate speech that mimics the original speaker’s style and emotional tone.

Through this advanced understanding and replication of human speech patterns, the Voice Engine can produce speech that sounds remarkably natural and personal, as if the original speaker were saying the words themselves.

That's a wrap on our newsletter! Here’s a quick recap before you go:

Catch you next week! đź‘‹