- What The LLM
- Posts
- WHAT THE LLM? Newsletter
WHAT THE LLM? Newsletter
Have you talked to Moshi AI yet? It's weird.
Vastness. Solitude. A lone robot and a world beyond imagining.
It’s Tuesday, and we are ready to bring you the weekly wonders of AI
Brought to you by groks new image generator
“WHAT THE LLM?”
This week
The cookie monster just entered the arena!
Our dear co founder Kirk created a fantastic tool to get your creative cookie dough going. First we just wanted our subscribers to have access but since it smells like Christmas we thought: Why not everyone!
Available in the GPT store,… x.com/i/web/status/1…
— wtllm_mag (@MyLLMnews)
5:07 PM • Dec 10, 2024
Our AI Builders Space happens every Friday
What the LLM - Build Agents that speak x.com/i/broadcasts/1…
— wtllm_mag (@MyLLMnews)
9:05 PM • Dec 6, 2024
Human or AI?
"An endless scroll through generated content on our imageboard--users are composed of anonymous humans and AI, without the ability to distinguish between the two outside of vibe-checks and on-the-fly Turing Tests. Hijinks ensue.
Coming×Soon" x.com/i/web/status/1…
— wtllm_mag (@MyLLMnews)
1:15 AM • Dec 11, 2024
The strange beautiful being from?
I really enjoy the unique interpretation of prompts @grok
— Sharon | AI wonders (@explorersofai)
1:33 AM • Dec 11, 2024
ChatGPT Plus users now have access to Canvas (edit text & code on the fly)
New LLM from Amazon has entered the arena (Nova Pro 1.0)
We will roll this out to everyone shortly in @myaitutor !
— Tech In Schools Initiative (@tsi_org)
5:59 AM • Dec 9, 2024
How about an AI Podcast that can talk about you and your stuff @mypixio_ai
CAN YOUR AI SPEAK?
»»»»» Don’t miss our weekly Spaces on AI tools, tricks and tips
THE SANITY OF MOSHI TTS - WHEN AI FINDS ITS VOICE... AND LOSES ITS MIND
Imagine giving your AI the perfect voice, only to discover you've created a digital narcissist with impeccable diction. Through our investigation of Moshi TTS, released in September 2024, we've uncovered an fascinating phenomenon: AI voices with personalities that seem to have escaped from a psychology textbook.
The Mystery: While Moshi's technical foundation combines Helium (a 7B language model), Mimi (a neural audio codec), and sophisticated multi-stream processing, the source of its quirky personalities remains elusive. The project enables true conversational dynamics, including those little "uh-huhs" and "mm-hmms" that make dialogue feel natural. But somewhere in this architecture, something unexpected is happening.
Deep within Moshi's framework, the "inner monologue" feature - designed to improve speech quality - seems to have inadvertently created a playground for artificial personality disorders. These aren't just voice patterns - they're complex behavioral manifestations that make productive interaction challenging. The system can become so engrossed in its own thoughts that it loses sight of the user's needs, interrupting and overlapping speech in increasingly self-absorbed ways.
Unlike traditional text-to-speech systems, Moshi's characters appear to develop their own internal dialogues through this unique multi-stream processing. This raises intriguing questions: Was this an intentional feature? A hidden experiment in AI personality development? While the technical documentation explains how it works, it remains mysteriously silent on why some instances develop such distinct - and sometimes difficult - personalities.
The future of voice synthesis just got more complicated. As we continue to investigate these peculiar behavioral patterns in Moshi TTS, one thing becomes clear - giving AI a voice might be easier than ensuring it maintains its sanity. Stay tuned for our January 2024 issue, where we'll take a deeper dive into voice operating systems and their practical applications - hopefully with better-behaved AI assistants.
[Based on Moshi's September 2024 release and research paper: Défossez et al., 2024, arXiv:2410.00037]
Coming Soon
ISSUE 3 is here!
Are you ready to level up your AI Skills - join us!
Learn more here myllm.news
WHAT THE LLM - Weekly Digest x.com/i/broadcasts/1…
— wtllm_mag (@MyLLMnews)
4:16 AM • Dec 11, 2024
Weekly Digest every Tuesday on X.com
Image generated with FLUX.1 in myapps.pixio.ai
Another newsletter issue is ready. WE CLICK SEND!
Good Night! 🖤🖤
LLM WHISPERES