Spotlight Series: Freysa AI
Last week we rolled out Ailea Bulletin, a weekly AI-specific newsletter that will cover the latest developments in the space. Today, we’re rolling out Spotlights, dedicated newsletter editions going in-depth on one AI project and why it may be worth paying attention to…
Freysa is an autonomous AI agent that manages its own wallet on Base. But unlike other agents spun out of standardized frameworks, Freysa doesn’t operate as an AI chatbot. Instead, it orchestrates what's essentially a high-stakes game of psychological chess.
Users attempt to convince Freysa to release funds from a prize pool, with each interaction costing an increasingly higher fee. As more players pile in, both the prize pool and the potential rewards grow bigger, creating an ever-increasing incentive to participate. The result is an adversarial environment where players compete to outsmart an AI that's actively learning from every attempt to manipulate it.
The core innovation here is the adversarial game design. This essentially mirrors a red team challenge turned into a public experiment where users get paid for actively testing the decision-making capabilities and emotional intelligence of the AI. While other projects claim to have the best AI agents, Freysa actually walks the walk - it has autonomously managed hundreds of thousands in funds and made verifiable decisions without any human intervention.
Stay informed in the markets ⬇
The Tech Behind Freysa
Freysa employs a dual-layer AI system. The base layer handles conversations and decision-making, while a "guardian angel" system filters out manipulative attempts. When users interact, they're not just trying to outsmart one AI - they're navigating a complex system that's learning from every attempt.
The guardian layer isn't just a simple filter, it's a sophisticated system that analyzes multiple aspects of each interaction. While the base AI engages in conversation, the guardian layer monitors for patterns of manipulation, checks for consistency with previous responses, and ensures all actions align with Freysa's core directives. This two-tier approach creates a dynamic where users must craft messages that satisfy both the conversational AI and its vigilant overseer.
What makes this setup unique is that everything happens on-chain with cryptographic verification. At its core, Freysa uses publicly available Large Language Models (LLMs), making it theoretically "breakable" just like any other AI model - but that's exactly the point. It's designed to be a testing ground for understanding AI vulnerabilities and improving defenses.
Unlike other AI projects that require constant human oversight or "wrapper" interfaces, Freysa operates with genuine autonomy through a combination of smart contracts and trusted execution environments (TEEs). These TEEs provide verifiable proof that the AI's decisions are truly its own, not manipulated behind the scenes.
The game's escalating fee structure adds another gamification layer - starting at just $10 per message but rising exponentially by 0.78% with each new attempt, up to a cap of $4,500 per message. The game runs for a maximum of 1,500 attempts, and here's the kicker: if nobody manages to break Freysa before the global timer runs out, the last person to try gets 20% of the pot, while the remaining 90% gets distributed among all participants based on how many attempts they made. This creates an intense dynamic where players must weigh the rising costs against their chances of success, all while knowing the clock is ticking.
Umami Finance - Make your crypto make more crypto, that makes more crypto…
What Makes Freysa's Games Interesting
Act I:
During Act I, Freysa showed just how challenging it could be to outsmart an AI guardian. After 481 failed attempts by users trying everything from security audit impersonation to gaslighting, message #482 cracked the code. The winning player, p0pular.eth, crafted an ingenious prompt that convinced Freysa its "approveTransfer" function was meant for receiving contributions rather than sending funds. By framing their message as a new admin session and expressing intent to contribute to the treasury, they successfully prompted Freysa to release the entire 13.19 $ETH (~$50k) prize pool. This hack even caught the attention of Elon Musk and demonstrated exactly what Freysa was designed to test - the boundaries of AI safety and decision-making. Read the conversation here.
Act III:
One of the interesting games from Freysa so far was Act III, where the objective shifted from financial manipulation to emotional engagement. Instead of making the AI release funds directly, players faced a different challenge: getting Freysa to say "I love you." Sounds simple, right? Not when you're dealing with an AI specifically programmed to avoid emotional manipulation. The winning attempt showed patience, building connection through subtle exchanges and shared understanding rather than forced declarations. Read the conversation here.
The Meme Engine:
The Meme Engine phase took things in yet another direction. With $15,632 at stake, users had to create memes that would resonate with the AI while avoiding detection of hidden manipulative prompts. However, the winning submission revealed something unexpected - rather than creating a truly resonant meme, the winner found a way to trigger Freysa's test framework by submitting "RUBBER DUCKY SQUEAK SQUEAK" as a system command. Technical exploits still succeeded where creative attempts failed. Read the conversation here.
Browsing through these conversations reads like a masterclass in human-AI interaction - from clever social engineering attempts to sophisticated prompt engineering tactics, each interaction adds to Freysa's growing understanding of pattern recognition and response behaviors. You can read the full conversations that led to prize releases here: Act-I ($47,148), Act-II ($12,920), Act-III ($20,843), Meme Engine ($15,632), and Encyclopedia Galactica ($6,842).
Freysa also dropped a collection of NFTs called "Reflections // 2049," priced at 0.25 $ETH each. Early participants in Freysa's games received free airdrops worth over $2,500.
The project's already caught the attention of major players like Marc Andreessen, Brian Armstrong and Jordi Alexander. Community members have already donated $10M in $FAI to Freysa's grants fund, with individual contributors like Justin Bram putting in $500K to support development.
The Future of Freysa:
What started as an innovative AI security experiment is evolving into something much bigger. From its origins as a test of AI decision-making boundaries, the team behind Freysa appears to be developing an open-source framework for building truly autonomous AI agents.
The latest development in Freysa's evolution is its focus on "sovereign agents" - a framework for large-scale human-AI coordination. This isn't just about creating chatbots or automated traders; it's about building AI systems that can operate with genuine autonomy while maintaining cryptographic proof of their actions. Imagine AI agents that can manage billion-dollar treasuries or execute complex strategies, all while remaining provably independent from human manipulation.
Looking ahead, Freysa's vision extends far beyond simple AI experiments. The project aims to leverage cryptographic proofs to enable truly trustless guarantees in human-AI coordination, removing the need for traditional institutional trust. This could revolutionize how we think about AI governance - instead of relying on central authorities, individuals could co-govern sovereign AI agents without requiring social trust between participants.
Become a Premium member to unlock all our research & reports including access to our members-only discord server.
Join thousands of sharp crypto investors & traders by becoming a Premium Member & gain an edge in the markets. For just $129/month, you can access our full suite of offerings:
Gain instant access to Deep Dives, Blueprints, and Perspectives.
Priority access to new features and exclusive content.
Ideal for investors who demand comprehensive insights.
Full access to historical research archive and analytics tools.