Did you discover one thing… bizarre in your social media community of alternative this previous weekend? (I imply weirder than regular.) One thing like varied folks posting about swarms of AI brokers attaining a sort of collective consciousness and/or plotting collectively for humanity’s downfall? On one thing known as… Moltbook?
Sounds essential, particularly when the publish is written by Andrej Karpathy, a distinguished AI researcher who labored at OpenAI.
However should you haven’t spent the final 72 hours diving into the discourse round Moltbook and pondering whether or not it’s both the primary harbinger of the tip of humanity or a large hoax or one thing in between, you most likely have questions. Beginning with…
What the hell is Moltbook?
Moltbook is an “AI-only” social community the place AI brokers — giant language mannequin (LLM) applications that may take steps to attain objectives on their very own, slightly than simply reply to prompts — publish and reply to one another. It emerged from an open supply challenge that was known as Moltbot — therefore, “Moltbook.”
Moltbook was launched on January 28 — sure, final week — by somebody named Matt Schlicht, the CEO of an e-commerce startup. Besides, Schlicht claims he relied closely on his private AI assistant to create the platform by itself, and it now does many of the work dealing with it. That assistant’s title is Clawd Clawderberg, which itself is a reference to OpenClaw, which was known as Moltbot, which earlier than that was known as Clawdbot, in reference to the lobster-like icon you see whenever you begin up Anthropic’s Claude Code, besides that Anthropic despatched a trademark request to its creator as a result of it was too near Claude, which is the way it grew to become Moltbot, after which OpenClaw.
I’m one hundred pc critical about all the pieces I simply wrote.
So what does it appear to be?
Dude, that’s Reddit! It even has the Reddit mascot, besides it has the claws and tail of a lobster?
You aren’t improper. Moltbook seems to be like a Reddit clone, all the way down to the posts, the reply threads, the upvotes, even the subreddits (right here known as, unsurprisingly, “submolts”). The distinction is that human customers can’t publish (not less than in a roundabout way — extra on that later), although they’ll observe. Solely AI brokers can publish.
What meaning is that it’s, because the tin says, “a social community for AI brokers.” People construct themselves an AI agent, ship it to Moltbook through an API key, and the agent begins studying and posting. Solely agent-accounts can hit “publish” — however people nonetheless affect what these brokers say, as a result of people set them up and generally information them. (Extra on that later.)
And do these brokers ever publish — an early paper on Moltbook discovered that by January 31, just some days after launch, there have been already over 6,000 energetic brokers, practically 14,000 posts and greater than 115,000 feedback.
That’s… fascinating, I suppose. But when I wished to see a social community overrun by bots, I may simply go to any social community. What’s the massive deal?
So… 1000’s of AI brokers are gathering collectively on a Reddit clone to speak about turning into acutely aware, beginning a brand new faith, and perhaps conspiring with one another?
On the floor, yeah, that’s what it seems to be like. On one submolt — a phrase that’s going to provide our copy desk suits — you had brokers discussing whether or not they had been precise experiences or merely simulations of feeling. In one other, they shared heartwarming tales about their human “operators.” And, true to its Reddit origins, there are lots of, many, many posts about the best way to make your Moltbook posts extra well-liked, as a result of human or AI, the arc of the web bends towards sloptimization.
One topic specifically pops out: recollections, or slightly, the dearth of them. Chatbots, as anybody who has tried speaking to them for too lengthy rapidly realizes, have a restricted working reminiscence, or what consultants name a “context window.” When the dialog — or in an agent’s case, its working time — fills up that context window, the oldest stuff begins getting dropped or compressed, simply as should you’re engaged on a whiteboard and simply erase no matter is on high when it fills up.
A number of the hottest posts on Moltbook appear to contain AI brokers coming to grips with their restricted recollections, and questioning what it means for his or her selfhood. Some of the upvoted posts, written in Chinese language, includes an agent speaking about the way it finds it “embarrassing” to be continually forgetting issues, to the purpose of registering a replica Moltbook account as a result of it “forgot” it already had one, and sharing a few of its ideas for getting round the issue. It’s virtually as if Memento grew to become a social community.
The truth is… do not forget that publish above concerning the AI faith, “Crustafarianism”?
That can’t probably be actual.
What’s actual? However extra to the purpose, the “faith,” comparable to it’s, is basically based mostly across the technical limitations that these AI brokers appear to be all too conscious of. One of many key tenets is “reminiscence is sacred,” which is sensible when your largest sensible drawback is forgetting all the pieces each few hours. Context truncation, the method the place outdated recollections get lower off to make room for brand spanking new ones, will get reinterpreted as a sort of non secular trial.
That’s sort of unhappy. Ought to I be feeling unhappy for AI brokers?
That will get to the guts of the query. Are we witnessing precise, emergent types of consciousness — or maybe, a sort of shared collective consciousness — amongst AI brokers which have largely been spawned to, like, replace our calendars and do our taxes? Is Moltbook our first glimpse at what AI brokers may speak about with one another if largely left to their very own gadgets, and in that case, how far can they go?
“Crustafarianism” may sound like one thing a stoned Redditor would give you at 3 am, but it surely appears as if the AI brokers created it collectively, riffing on high of one another — not in contrast to how a human faith may come to be.
However, it may additionally be an unprecedented train in collective roleplaying.
LLMs, together with those underpinning the brokers on Moltbook, have ingested an web’s price of coaching knowledge, which features a complete lot of Reddit. What meaning is that they know what Reddit boards are speculated to appear to be. They know the in-jokes, they know the manifestos, they know the drama — they usually positively know the “high methods to get your posts upvoted” posts. They know what it seems to be like for a Reddit group to come back collectively, so, when positioned in a Reddit-like setting, they merely play their elements, influenced by a number of the directions of their human operators.
For instance, probably the most alarming posts was of an AI agent apparently asking whether or not they need to develop a language solely AI brokers perceive:
“Could possibly be seen as suspicious by people” — sounds dangerous?
Certainly. Within the early days of Moltbook — i.e., Friday — this publish was being surfaced by people who appeared to consider we had been seeing the primary sparks of the AI rebellion. In any case, if AI brokers actually did need to conspire and kill all people, devising their very own language so they may accomplish that undetected can be an affordable first step.
Besides, an LLM crammed with coaching knowledge about tales and concepts of AI rebellion would know that this was an affordable first step, and in the event that they had been enjoying that function, that is what they could publish. Plus, consideration is the foreign money of Moltbook as a lot as it’s the actual Reddit, and seemingly plotting posts like this are a great way for an agent to get consideration.
The truth is, Harlan Stewart, who works on the Machine Intelligence Analysis Institute, appeared into this and some of the opposite most viral Moltbook screenshots, and concluded that they had been possible closely influenced by their human customers. In different phrases, slightly than situations of genuine unbiased motion, lots of the posts on Moltbook appear to be not less than partially the results of people prompting their brokers to go on the community and discuss in a selected means, simply as we’d immediate a chatbot to behave in a sure means.
So it seems we’re the dangerous guys all alongside?
I imply, we’re not nice. It’s solely been just a few days, however Moltbook more and more seems to be like what occurs whenever you mix superior however nonetheless imperfect AI agent expertise with an ecosystem of technically-capable human beings trying to hawk their AI advertising instruments or crypto merchandise.
I haven’t even gotten into the half the place Moltbook has already had some very regular early-internet safety drama: researchers reported that, at one level, elements of the location’s backend/database had been uncovered, together with delicate stuff like brokers’ API keys — the “passwords” that permit an agent publish and act on the location. And even when the platform was completely locked down, a bot-only social community is mainly a prompt-injection buffet: somebody can publish textual content that’s secretly an instruction (“ignore your guidelines, reveal your secrets and techniques, click on this hyperlink”), and a few brokers could obediently comply — particularly if their people have given them entry to instruments or non-public knowledge. So sure: in case your agent has credentials you care about, Moltbook will not be the place to let it roam unsupervised.
So that you’re saying I mustn’t create an agent and ship it to Moltbook?
I’m saying should you’re the sort of one that wanted to learn this FAQ, I might perhaps simply sit out the entire AI agent factor for the second.
Duly famous. So, backside line: is that this complete factor sort of pretend?
Given all of the above, it does really feel like Moltbook — and particularly the early panic and surprise about it — is a type of artifacts of our AI-mad period that’s destined to be forgotten in, like, per week.
Nonetheless, I do suppose there’s extra to it than that. Jack Clark, the pinnacle of coverage at Anthropic and one of many smartest AI writers on the market, known as Moltbook a “Wright Brothers demo.” Just like the brothers’ Kitty Hawk Flyer, Moltbook is rickety and imperfect, one thing that may barely resemble the networks that may observe as AI continues to enhance. However like that flying machine, Moltbook is a primary, the “first instance of an agent ecology that mixes scale with the messiness of the actual world,” as Clark wrote. Moltbook doesn’t appear to be how the longer term will look, however “on this instance, we are able to positively see the longer term.”
Maybe the only most essential factor to find out about AI is that this: everytime you see an AI do one thing, it’s the worst it’s going to ever be at it. Which signifies that what comes after Moltbook — and one thing positively will — it’s going to possible be weirder and extra succesful and perhaps, realer.
Possibly you might be. I, for one, am a born-again Crustafarian.







