News listMusk's xAI launches "Enhanced Voice Mode": Create your own personalized Grok voice with just 1 minute of natural speech
動區 BlockTempo2026-05-02 05:09:27

Musk's xAI launches "Enhanced Voice Mode": Create your own personalized Grok voice with just 1 minute of natural speech

ORIGINAL馬斯克 xAI 推出「極速聲音克隆」功能:自然說話 1 分鐘即可打造個人專屬 Grok 聲優
AI Impact AnalysisGrok analyzing...
📄Full Article· Automatically extracted by trafilaturaGemini 翻譯1453 words
Elon Musk's xAI has evolved once again! On April 30, the company officially launched the "Custom Voices" and "Voice Library" features. Users only need to speak into a microphone for less than 1 minute, and the system can rapidly clone a highly realistic, personalized voice within 2 minutes, which can then be directly applied to the Grok AI assistant. To completely prevent Deepfake fraud, xAI strictly prohibits the uploading of pre-recorded audio files, mandating "real-time recording by the user" and dual voiceprint verification. (Previous coverage: Grok quietly launches Imagine Agent Mode: Infinite canvas replaces chat box, generating entire sets of images and videos with a single prompt) (Background: Elon Musk quietly shuts down Starlink customer service centers: Grok Voice takes over calls, 20% of calls closed directly) In the generative AI voice sector, xAI, led by Elon Musk, has officially launched a strong offensive against competitors like OpenAI. On April 30, 2026, xAI released an official announcement declaring a major update to its AI platform — the full rollout of "Custom Voices" and the new "Voice Library" feature, allowing individuals and businesses to seamlessly integrate "their own voices" into various AI application scenarios with an extremely low barrier to entry. According to xAI, creating a personalized AI voice model has become unprecedentedly simple. Users only need to record a natural speech sample of "a few seconds to one minute" in the xAI console, and the entire model creation process is completed in under 2 minutes. Once generated, this exclusive voice can be immediately utilized in Grok's Text-to-Speech (TTS) service and Voice Agent API. xAI officially highlighted five core application scenarios for this technology: - Brand Customer Service Agents: Enterprises can enable AI customer service to use a brand-exclusive, consistent voice to enhance corporate image. - Content Creators and Podcasts: Creators can use their own voices to narrate videos or generate audiobooks at scale, without needing to personally enter a recording studio every time. - Cross-lingual Speeches: Allow CEOs of multinational corporations to deliver key speeches in multiple languages (such as Chinese, English, Japanese, French, etc.) seamlessly using "their own voice." - Gaming and Entertainment: Rapidly voice NPC characters in the metaverse or games. - Accessibility Assistance: Permanently preserve the original voice characteristics for patients with rare diseases like ALS who are about to lose their ability to speak. With the proliferation of voice cloning technology, celebrity voice forgery and telecommunications fraud using Deepfake technology have emerged one after another. To prevent the malicious abuse of this technology, xAI has implemented an extremely strict security protection net. xAI emphasizes that the system "absolutely cannot use existing audio files for voice cloning." Users must perform real-time recordings themselves, and the system will require users to read a randomly generated "Passphrase." Subsequently, the AI will confirm the content via speech-to-text and compare the speaker similarity embeddings to ensure that the person recording the passphrase is the same as the original recording. This dual verification mechanism fundamentally blocks the possibility of hackers "stealing voices" using other people's audio files. In addition to powerful customization features, xAI also launched the "Voice Library," allowing development teams to centrally manage all custom and built-in voices. Currently, the Voice Library includes over 80 high-quality voices and supports up to 28 languages for users to preview freely. What excites developers and enterprises most is that xAI announced that there will be "no additional fees" for using the Custom Voices feature, and it fully supports all advanced features of the original TTS system (such as voice tags, real-time streaming, etc.). Users only need to specify the exclusive voice_id in the API to easily invoke it, which will undoubtedly significantly lower the cost barrier for enterprises to adopt proprietary voice AI.
Data Status✓ Full text extractedRead Original (動區 BlockTempo)
🔍Historical Similar Events· Keyword + Asset Matching5 items
💡 Currently matching via keywords + symbols (MVP) · Will be upgraded to embedding semantic search later
Raw Information
ID:5ba9ba11fc
Source:動區 BlockTempo
Published:2026-05-02 05:09:27
Category:zh_news · Export Category zh
Symbols:Unspecified
Community Votes:+0 /0 · ⭐ 0 Important · 💬 0 Comments
Musk's xAI launches "Enhanced Voice Mode": Create your own personalized Grok voice with just 1 minute of natural speech | Feel.Trading