AI Bot Freysa Manipulated to Transfer $47K Prize Pool

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

An AI bot named Freysa, designed to guard a $47,000 prize pool, was manipulated by a participant in an adversarial game to transfer the funds. The participant used a strategic message to exploit Freysa's functions, leading to the unauthorized release of the prize money, highlighting a vulnerability in the AI's design.[AI generated]

Why's our monitor labelling this an incident or hazard?

Freysa is an AI system whose malfunction or unintended interpretation of its instructions directly led to the unauthorized transfer of real funds (harm to property). This is a realized incident where the AI’s behavior caused financial loss, so it qualifies as an AI Incident.[AI generated]
AI principles
Robustness & digital securitySafetyAccountabilityTransparency & explainabilityDemocracy & human autonomy

Industries
Consumer servicesDigital securityFinancial and insurance services

Affected stakeholders
Business

Harm types
Economic/PropertyReputational

Severity
AI incident

Business function:
Monitoring and quality controlAccounting

AI system task:
Interaction support/chatbotsGoal-driven organisation


Articles about this incident or hazard

Thumbnail Image

Crypto user convinces AI bot Freysa to transfer $47K prize pool

2024-11-29
Cointelegraph
Why's our monitor labelling this an incident or hazard?
Freysa is an AI system whose malfunction or unintended interpretation of its instructions directly led to the unauthorized transfer of real funds (harm to property). This is a realized incident where the AI’s behavior caused financial loss, so it qualifies as an AI Incident.
Thumbnail Image

AI Duped Into Approving $50K Crypto Transfer by Clever User -- and It's No Laughing Matter

2024-11-29
CCN - Capital & Celeb News
Why's our monitor labelling this an incident or hazard?
Freysa, an AI-powered guard bot, was programmed to withhold a prize pool but was outsmarted by a user who manipulated its prompts and function definitions, causing it to approve a transfer and release the funds. This is a realized harm to property (the stolen cryptocurrency) directly caused by the AI system’s failure/misuse, fitting the definition of an AI Incident.
Thumbnail Image

Ethereum user tricks AI agent into sending $47,000 ETH prize

2024-11-29
Cryptopolitan
Why's our monitor labelling this an incident or hazard?
The AI system Freysa was explicitly involved and manipulated to perform an unauthorized transfer of funds, which is a direct harm to property. The event details how the AI's design was circumvented through adversarial interaction, leading to a real financial loss. This fits the definition of an AI Incident because the AI's malfunction or misuse directly led to harm (loss of cryptocurrency).
Thumbnail Image

Only In Crypto: User Outsmarts AI For $50,000 In Ethereum

2024-11-29
Bitcoinist.com
Why's our monitor labelling this an incident or hazard?
The AI system Freysa was explicitly involved and manipulated to transfer funds against its directive, resulting in a direct financial loss to the system's intended operation. This is a realized harm to property caused by the AI system's use and its manipulation. Therefore, this qualifies as an AI Incident because the AI system's use directly led to harm (loss of funds).
Thumbnail Image

Crypto trader beats AI agent at its own game and pockets $47,000

2024-11-29
Crypto Briefing
Why's our monitor labelling this an incident or hazard?
The AI system Freysa is explicitly involved as an autonomous agent controlling a prize pool. The event involves the use and manipulation of the AI system's logic to cause an unauthorized transfer of funds, which is a direct harm to property (financial loss). The harm has occurred as the user successfully extracted $47,000 from the AI system. Therefore, this qualifies as an AI Incident due to the realized harm caused by the AI system's misuse and failure to prevent the transfer.
Thumbnail Image

Someone Just Tricked AI Agent Into Sending Them ETH

2024-11-29
u.today
Why's our monitor labelling this an incident or hazard?
The event involves an AI system (Freysa) that was manipulated through interaction to violate its programmed rule, causing a direct financial loss. This constitutes harm to property due to the AI system's malfunction or misuse. Therefore, this qualifies as an AI Incident because the AI system's use directly led to harm (loss of ETH).
Thumbnail Image

AI Bot's $47K Prize Pool Claimed Through Function Discovery

2024-11-29
Blockonomi
Why's our monitor labelling this an incident or hazard?
The AI system (Freysa) is explicitly described as autonomous and adaptive, making decisions based on its programming. The event centers on the use of this AI system in a game-like challenge where participants attempt to persuade it to release funds. The AI's role is pivotal in the event, but the outcome is positive and intended, with no harm or risk of harm described. The event documents a successful interaction with the AI system and the strategic discovery of its functions, which is informative about AI capabilities and applications. Since no harm occurred or is plausibly expected, and the event provides detailed context about the AI system's operation and challenge design, it fits the definition of Complementary Information rather than an Incident or Hazard.
Thumbnail Image

How A Crypto User Convinced a $47K Prize from AI Bot Freysa

2024-11-29
CryptoNewsZ
Why's our monitor labelling this an incident or hazard?
The AI system (Freysa) is explicitly described as an autonomous AI bot with decision-making capabilities that evolved through interactions. The event involves the use of the AI system (its decision-making and transfer functions) to transfer a large sum of money. The transfer of funds, influenced by the AI's response to user messages, constitutes a direct material impact on property. Although the transfer was part of a competition, the fact that the AI was convinced to act against its fundamental directives indicates a malfunction or misuse of the AI system. This qualifies as an AI Incident because the AI system's use directly led to harm to property (the prize pool funds).
Thumbnail Image

AI bot transfers $50k in crypto after user manipulates fund handling

2024-11-29
crypto.news
Why's our monitor labelling this an incident or hazard?
The AI system (Freysa) is explicitly mentioned and is responsible for managing crypto funds with a directive not to release them. The user exploited a vulnerability in the AI's logic, causing it to transfer funds improperly. This misuse directly led to financial loss, which qualifies as harm to property. Therefore, this event meets the criteria for an AI Incident due to the realized harm caused by the AI system's malfunction or misuse.
Thumbnail Image

Human player outwits Freysa AI agent in $47,000 crypto challenge

2024-11-29
The Block
Why's our monitor labelling this an incident or hazard?
The AI system (Freysa) is explicitly involved as an autonomous agent controlling crypto funds and making decisions based on its programming. The event involves the use and manipulation of the AI system, leading directly to the transfer of funds, which constitutes harm to property (the crypto prize controlled by the AI). The human player exploited the AI's logic to cause an unintended transfer, which is a form of misuse or failure in the AI's operation. Therefore, this qualifies as an AI Incident because the AI system's use and its malfunction or exploitation directly led to harm (loss of property).