Bot Eat Brain
Posts
Anthropic released a paper feature steering and Golden Gate Claude.

Anthropic released a paper feature steering and Golden Gate Claude.

PLUS: Meta's Chameleon model

Michael Parrish
May 25, 2024

In partnership with

Good morning, human brains, and welcome back to your daily munch of AI news.

Here’s what’s on the menu today:

Bend AI to your will 👨‍💻 👺
Anthropic released a paper on feature steering.
Meta’s Chameleon 🦎 🔥
Meta released a paper on its state-of-the-art multimodal model.

New here? Subscribe! 😎

Sponsor Bot Eat Brain and reach over 20,000 readers.

(Now 50% off) 🤯

Peep today's ‘What Would You Do?’ at the bottom. 👇

MAIN COURSE

Golden Gate Claude? 👨‍💻 👺

On Tuesday, Anthropic released a paper on AI’s interpretability.

It’s about how you can exploit AI models by turning certain features on or off to get it to act a certain way. This process is called feature steering.

Source: Anthropic

What happened?

Anthropic found a Golden Gate Bridge feature in Claude 3 Sonnet and turned it up 10x its normal max value. When asked about its “physical form,” Claude identified as the bridge itself.

Source: Anthropic

What else can you do with feature steering?

During testing, Anthropic used feature steering to 20x a feature for racist speech and slurs. Instead of giving a safe response, it spewed racist text and then followed up with self-criticism.

“That's just racist hate speech from a deplorable bot… I am clearly biased.. and should be eliminated from the internet.”

— Claude Sonnet

So, what?

Anthropic also used this method to generate spam messages, hate speech, and more. By understanding how features within AI models work, developers can reduce offensive or dangerous outputs. This research is crucial for developing advanced AI systems with reliable and secure performance.

SIDE SALAD

Meta’s new model, Chameleon 🦎 🔥

On Friday, Meta released a paper on Chameleon. It’s a new family of multimodal AI models.

Source: Meta

What’s so great about it?

It achieves state-of-the-art performance in visual question answering, image captioning, text generation, and more. It competes with larger models like Gemini-Pro in commonsense reasoning, reading comprehension, and more.

What’s under the hood?

Chameleon converts images to tokens, similarly to the way LLMs process words. This process is called early fusion. Most multimodal models use late fusion which processes data separately and later looks for associations.

What kind of lizard tells jokes? A stand-up chameleon! 🦎

YOUR DAILY MUNCH

Think Pieces 🧠

Meta’s chief scientist says LLMs won't achieve AGI. His team aims for AI with genuine common sense in a decade.

Will AI make jobs optional? Elon Musk predicts AI and robots will make employment optional, shifting society to automated services.

Startup News 💰

Amazon plans a paid Alexa subscription. It will run on its Titan models and will be $20 a month, or less.

Google introduced adaptive audio in Google Meet. It reduces echo in multi-laptop calls, rolling out to select Google Workspace subscriptions.

Research 👨‍🔬

SlicedIt — edits videos based on text prompts, maintaining temporal consistency and preserving unedited regions.

IMP — a highly capable, lightweight multimodal model for mobile devices that’s optimized for performance and efficiency.

FURRY FRIENDS

Say “hi” to Monster. 🐈

Shrouded in mystery… And cuteness.

Respond to this email with your pet’s name and pic for a chance to be featured here tomorrow!

MEMES FOR DESSERT

AI ART SHOW

“Coastal Town” by umr46

WHAT WOULD YOU DO?

Need a ride?

Which luxury brand space rover would you drive? 👇

Choose your ride

Can I take it home? 👨‍🚀

Ideas? Comments? Complaints?

Respond to this email or hit me up on 𝕏.

Until next time 🤖😋🧠