Bot Eat Brain
Posts
Washington and Chicago researchers developed ArtPrompt

Washington and Chicago researchers developed ArtPrompt

PLUS: Make complete apps in under 3 minutes

Michael Parrish
March 11, 2024

In partnership with

TOGETHER WITH

Good morning, human brains, and welcome back to your daily munch of AI news.

Here’s what’s on the menu today:

ChatGPT is surprisingly easy to hack 👺 ✏️
ArtPrompt uses ASCII art to exploit GPT-4, Claude, and more.
“These are not the droids you’re looking for” 🤤 🧠
A “mind wipe” technique erases dangerous knowledge in AI models.
Make complete, complex apps in under 3 minutes 👨‍💻 🔥
A developer made a bug-free, multi-user app with Claude 3 Opus.

Sponsor Bot Eat Brain | New here? Subscribe!

MAIN COURSE

Hack ChatGPT with simple doodles 👺 ✏️

Last month, Washington and Chicago researchers developed ArtPrompt. It bypasses the safety measures of LLMs with ASCII art.

Source: ArtPrompt

What does it do?

It tricks AI models into advising on prohibited topics such as bomb-making and counterfeiting money by bypassing content restrictions.

How does it work?

ArtPrompt operates through word masking and cloaked prompt generation, where sensitive words are replaced with ASCII art to evade detection.

What is ASCII art?

It’s a simple graphic design technique that combines computer characters to form a full picture.

Source: ArtPrompt

So, what?

Anyone can use this simple method to exploit GPT-4, Llama2, Claude, Google Gemini, and more.

I thought AI was supposed to be safe.

I don’t know about that. In October, we published an in-depth, 2023 AI safety recap. We went through the year’s AI policy headlines.

A week later, we reported on Anthropic’s LLM sycophancy study. It showed how AI tells you what you want to hear, whether it’s accurate or not.

In November, we covered IBM’s study about ChatGPT’s effective phishing emails. They had an 11% CTR, and human-generated emails had a 14% CTR.

SIDE SALAD

Huh? I forgot 🤤 🧠

Last week, The Center for AI Safety announced the WMPD Benchmark. It’s a framework to measure and remove hazardous knowledge from AI systems.

Source: WMPD Benchmark

Oh boy, what is it?

It’s a new dataset and CUT technique designed to prevent AI’s misuse in cyberattacks and bioweapon creation.

What’s up with the dataset?

WMDP stands for “Weapons of Mass Destruction Proxy.” It contains over 4,000 questions about chemical security, cybersecurity, and more. It’s designed to pinpoint hazardous knowledge in LLMs.

Ok. What about CUT?

CUT is a “mind wipe” technique that selectively erases dangerous knowledge in AI models while preserving beneficial information.

What is the Center for AI Safety?

Back in June, we reported on the Statement on AI Risk. It’s a document from The Center for AI Safety, co-signed by more than 350 prominent figures in AI.

A LITTLE SOMETHING EXTRA

Make complete apps in minutes 👨‍💻 🔥

Last week, A developer used Claude 3 Opus to make a complete, multiplayer app in minutes. He published the demo and code on 𝕏 (formerly Twitter).

Source: @mayfer

He did what, now?

Murat Ayfer, a developer, used Opus to make a real-time, multi-user drawing app. It created the app from scratch, added username and color selection functionalities, and fully integrated it with a database in under 3 minutes.

Why do I care?

Ayfer claims the code works flawlessly without any bugs. This was his prompt:

"Make a multiplayer drawing app where the strokes appear on everyone else's screens in realtime. Let user pick a name and color. Save users to DB on login"

— Murat Ayfer

Two minutes and 48 seconds later, Opus created a complete app.

Where can I find the code?

Click here to see the code on GitHub.

YOUR DAILY MUNCH

Tools

100DaysOfNoCode Challenge — learn life-changing no-code/AI skills with free, fun, and effective 30-minute lessons delivered daily to guide your no-code journey.

Athina AI — LLM evaluation tool that tells you how to improve your AI models.

Depthify — a 2D-to-3D-video tool for Apple Vision Pro and Meta Quest.

Sonauto — turn prompts, lyrics, and melodies into entire, finished songs.

Think Pieces

OpenAI’s new board changes. Sam Altman, Sony’s ex-president, Instacart’s CEO and the Bill and Melinda Gates Foundation’s CEO are in.

Why did Inscribe.ai fire 40% off its staff? The fraud detection startup allegedly missed its revenue goals for more than a year.

Are AI benchmarks good or bad? Older benchmarks are becoming more irrelevant as AI models gain multifunctional capabilities.

Startup News

A Google engineer was indicted for allegedly stealing Google’s AI secrets. He’s been accused of covertly working for China-based companies.

A Microsoft engineer claims Copilot Designer has serious safety issues. It’s created violent, sexual, and politically charged images since December.

Hugging Face posted a new AI job listing. It’s looking for an “Embodied Robotics Engineer” that can integrate AI into robots.

Research

Pix2Gif — an image-to-GIF tool that leverages text and motion prompts.

DP3 — a visual imitation learning technique (3D Diffusion Policy).

SaulLM-7B — an LLM designed for legal text generation, comprehension, and more.

MEMES FOR DESSERT

TWEET OF THE DAY

AI doomsayers with a twist of Monty Python.

Source: @elonmusk

Tag us on Twitter @BotEatBrain for a chance to be featured here tomorrow.

AI ART-SHOW

“The Flow of Time” by @digitallywired

Until next time 🤖😋🧠