- Bot Eat Brain
- Posts
- HuggingGPT: one chatbot to rule them all
HuggingGPT: one chatbot to rule them all
PLUS: No more blurry Zoom calls
Good morning human brains, welcome back to your daily munch of AI news.
Here’s what’s up:
Zoom! Enhance! just became a real technology 🕵️
One chatbot to rule them all 💍
An AI tool that automatically animates CGI characters 🎥
A cartoonist from 1923 predicted the singularity in 2023 ✏️
Follow @BotEatBrain on Twitter for breaking news.
P.S. We are hiring researchers and writers, click here for more info.
APPETIZER
ReBotNet: No more blurry Zoom Calls.
ReBotNet can make live streams and video calls look better by using AI to sharpen your image in real-time.
It’s basically the “Zoom! Enhance!” meme from CSI brought to life.
ReBotNet works almost as well as the best available image-restoration models while being much much faster.
The RVRT model is slightly better, but not as fast as ReBotNet.
Don’t forget: AI can’t find information that isn’t there. It CAN make up information if you let it and that can be dangerous. This tech is awesome for making your video calls sharper, but it will never reveal a killer’s face hidden in the reflection of a screw.
CSI: Zoom! Enhance!
BUZZWORD OF THE DAY
SOTA
It stands for State of the Art and is often used in AI research to indicate a new best-in-class technology based on some standardized performance comparison.
FROM OUR PARTNERS
Say goodbye to endless support tickets
SelfService Chat enables your users to self-serve, reducing support tickets. Spend more time working on your business than in it.
MAIN COURSE
HuggingGPT: one chatbot to rule them all 💍
HuggingGPT is a new system that uses big language models, like ChatGPT, to control other AI models and solve complicated tasks.
It connects with a lot of AI models from Hugging Face (an online community for researchers to build and share AI models) and uses language as a way to work with them.
Language models can connect other AI models together to solve complicated tasks.
By doing this, HuggingGPT can help with many different tasks and do well in areas like language, seeing, and talking. Basically, it wants to be JARVIS from Iron Man (and that’s what they’ve named their GitHub repo too).
In this concept, an LLM (Large Language Model) acts as a controller, managing and organizing the cooperation of expert models. The LLM first plans a list of tasks based on the user request and then assigns expert models to each task. After the experts execute the tasks, the LLM collects the results and responds to the user.
Don’t confuse this JARVIS with the Jarvis from Iron Man, or with this separate (also very cool) project released yesterday with the same name:
LLMs like ChatGPT, LLaMA, and yes, even Google Bard, are quickly turning into the “glue” that connects all AI models together.
Predictions:
1/ We will soon see fully autonomous AI agents operating on the Internet, running businesses, and pursuing goals set by their human masters with little-to-no oversight.
2/ An AI will commit fraud or a financial crime within the next 6 months and its human creator will be liable (if they can be found).
3/ A platform will be launched within the next 3 months for managing autonomous AI assistants that can handle tasks like coordinating appointments or planning a trip.
MEMES FOR DESSERT
YOUR DAILY MUNCH
Think Pieces
Stanford released their 2023 report on the state of AI.
Could you train a ChatGPT-beating model for $85,000 and run it in a browser?
ChatGPT - The Revolutionary Bullshit Parrot
Startup News
Fourthline raised $54 million to build AI-powered identity verification tools for the finance industry.
Quantexa raised $129M at a $1.8B valuation to help navigate online fraud
Suprema AI launched a tool for better face detection.
Twitter open-sourced the recommendation algorithm for their home feed.
Police relied on data from Clearview AI and put the wrong person in jail.
Research
BloombergGPT: A Large Language Model for Finance
Achilles Heel for AGI: is it possible to implant a weakness in a superintelligent AI?
Objaverse: a Massive Dataset with 800K+ Annotated 3D Objects
Just some of the 3D objects available in the Objaverse dataset.
LLaMA-Adapter: a small modification to Facebook’s LLaMA AI that works almost as well as larger models like Alpaca-7B and GPT-3, while also having multi-modal capacity (it can process images).
Tools
avtrs.ai: generate AI avatars with an iPhone app
senec.ai: create a landing page in seconds, just type your startup idea and click generate.
stockimg.ai: easily generate logos, book covers, posters, and more.
askrails.com: an AI Ruby on Rails Expert at your fingertips.
wonderdynamics.com: an AI tool that automatically animates, lights, and composes CG characters into a live-action scene [closed beta].
Wonder Dynamics Demo
AI EXPLAINED
Why does AI struggle so hard with generating human hands?
AI ART-SHOW
“The kid stood frozen in the middle of the street, his heart racing as he gazed in wonder at the distant flashes of lightning and intense bursts of light, wondering what incredible events were unfolding just beyond his view.” - @Paperwrks3d
Until next time 🤖😋🧠
What'd you think of today's newsletter? |