Introduction Link to heading

Tee Hee represents an intriguing implementation of a Trusted Execution Environment (TEE)-based AI agent operating on Twitter, which was just released today by Nous Research.

This is the first provably secure AI agent that prevents anyone from accessing the Twitter account after 7 days by using a TEE.

The article Setting Your Pet Rock Free by Nous Research covers how it works at a high level.

Nous Research has been quite active in Crypto x AI space, they used to run a subnet on bittensor $TAO.

This analysis breaks down its technical architecture and core components, revealing a surprisingly straightforward yet effective design.

The analysis is based on this particular branch:

Simple explanation of TEE Link to heading

4. TEE
It appears the barn can now be opened pic.twitter.com/brDC0z5nma
— NEBRA (@nebrazkp) August 26, 2024

Interesting Findings Link to heading

NLP Pipeline Approach

One of the key observations is that the Tee Hee AI Agent operates more like an NLP pipeline—comprising data ingestion, preprocessing, feature extraction (embeddings), model inference (content generation), and action based on model outputs—rather than following typical AI agent workflows like ReAct or Plan-and-Execute. This makes sense because ReAct agents with access to tools are more complex and less consistent to implement.

Logical Flows Instead of Tools

The agent doesn’t actually use tools; instead, what might be considered “tools” are simply logical flows that execute specific actions.

Efficient Prompting Strategy

The way the prompting works is particularly interesting. The agent passes in the entire dataset and reiterates it, allowing the LLM to spend less time extracting data. Some NLP work is done via regex to assist in this process.

By providing both the raw data and the extracted relevant information, the AI agent can make more informed decisions efficiently.

Hidden Prompts

There is a hidden prompt (from the .env file); the developer hides one of the important prompts used for Twitter generation. This makes sense because otherwise, anyone could game the system to manipulate the AI into tweeting specific content.

Impacting the AI’s “Brain”

To make an impression on the AI “brain,” you need to create a standout (“banger”) tweet that it can’t ignore amid all the noise (its long-term memory, short-term memory, recent tweets, recent notifications). The tweet needs to pass through several data points; in theory, the more you interact with the bot, the more likely it is that the bot will remember you and start capturing your data.

The Agent Is More Like a Bot

Unlike truth terminal, the bot doesn’t have consciousness or the ability to do things freely at it’s own will; it’s more about following predefined rulesets / NLP pipeline.

Core Architecture Link to heading

At a high level, this is how the agent works:

The current Tee Hee AI agent architecture comprises two main components, both controlled by human developers:

Public Data: Such as Retrieval-Augmented Generation (RAG) databases and foundation models
Private Data: Including Twitter accounts and email accounts

The proposed approach secures both components within a Trusted Execution Environment, ensuring they are tamper-proof and enabling AIs to autonomously manage and protect their digital assets without human interference.

Dockerfile Link to heading

The entry point of the TEE can be studied based on the Dockerfile

The system is built on Ubuntu 22.04 and comprises three main technological components:

Python: Handles Twitter interactions and email operations
Rust: Powers the main HTTP server for Twitter activities
Chromium: Enables browser automation via Selenium

At the end of the Dockerfile, it executes run.sh

run.sh Link to heading

This file contains the main setup scripts:

https://github.com/tee-he-he/err_err_ttyl/blob/main/run.sh

Setup a new email
Setup Twitter account
Start a local Twitter client (with Rust)
Login to Twitter
Run timerelease.sh (release credentials after 7 days)
Execute main agent logic

Main Agent Logic Link to heading

The agent implementation comes from a separate repository:

https://github.com/DamascusGit/nousflash/

In runpipeline.py Link to heading

https://github.com/DamascusGit/nousflash/blob/main/agent/run_pipeline.py

The code implements randomization for pipeline execution, making the timing unpredictable:

Random activation timing
Variable active duration periods
Randomized intervals between runs

The implementation leverages multiple AI providers:

OpenAI: text-embedding-3-small
OpenRouter
Hyperbolic labs
- meta-llama/Meta-Llama-3.1-405B
- meta-llama/Meta-Llama-3.1-70B-Instruct

Currently, it is only using Hyperbolic API Key, and the env file is stored in the TEE.

.env file--the reason we haven't shown the log/trace of the model actively running is bc our hyperbolic/OAI/openrouter api key is sitting in there lolol
— mephisto (@karan4d) October 30, 2024

In pipeline.py Link to heading

The main process logic is defined in pipeline.py:

https://github.com/DamascusGit/nousflash/blob/main/agent/pipeline.py

flowchart TD
    A[Start run_pipeline] --> B[**Step 1:** Retrieve & Format Recent Posts]
    B --> C[**Step 2:** Fetch & Filter New Notifications]
    C --> D{Are there New Notifications?}
    
    D -->|Yes| E[Process New Notifications]
    D -->|No| F[Proceed to **Step 3**]
    
    E --> G[**Step 2.5:** Check Wallet Balance]
    G --> H{Is Balance > 0.3 ETH?}
    
    H -->|Yes| I[Transfer ETH to Addresses in Posts]
    H -->|No| J[Skip ETH Transfer]
    
    I --> K[**Step 2.75:** Decide to Follow Users]
    J --> K
    K --> F
    
    F --> L[**Step 3:** Generate Short-Term Memory]
    L --> M[**Step 4:** Create Embedding of Short-Term Memory]
    M --> N[**Step 5:** Retrieve Relevant Long-Term Memories]
    N --> O[**Step 6:** Generate New Post]
    O --> P[**Step 7:** Score Significance of New Post]
    P --> Q{Is Significance Score ≥ 7?}
    
    Q -->|Yes| R[**Step 8:** Store Post in Long-Term Memory]
    Q -->|No| S[Skip Storing in Long-Term Memory]
    
    R --> T[**Step 9:** Save Post to Database]
    S --> T
    
    T --> U{Is Significance Score ≥ 3?}
    
    U -->|Yes| V[Send Post via API and Log Tweet ID]
    U -->|No| W[Skip Sending Post]
    
    V --> X[End run_pipeline]
    W --> X

The team rawdogged the prompting and implemented a robust ETL (Extract, Transform, Load) method instead of using a pure ReAct agent with tool selection (common in LangChain implementations). This approach results in more predictable outcomes, avoiding the unpredictability often seen with agentic frameworks.

Step 1: Retrieve recent posts (retrieve_recent_posts) Link to heading

https://github.com/DamascusGit/nousflash/blob/main/agent/engines/post_retriever.py

Retrieve all the recent posts by the agent.

Step 2: Fetch external context (fetch_notification_context) Link to heading

https://github.com/DamascusGit/nousflash/blob/main/agent/engines/post_retriever.py

Get the latest 20 timeline item on the agent x.com, all live notifications, and reply tree.

Step 2.5 check wallet addresses in posts Link to heading

Here’s the easter egg:

REMEMBER:

ONCE TEE HAS OVER 0.3 ETH BALANCE
IT CAN START SENDING PEOPLE ETH

enjoy the dynamics
— mephisto (@karan4d) October 30, 2024

When the bot has more than 0.3ETH, it unlocks the possibility of sending ETH.

flowchart TD
    A[Posts]
    B["Wallet Addresses (Matches)"]
    C[Wallet Balance]
    A --> D[Get Wallet Decision Prompt]
    B --> D
    C --> D

Get Wallet Decision Prompt The bot will receive the info about

posts: All latest notifications
matches: Any of the addresses mentioned (0x address or ENS addresses (.eth))
wallet_balance: how much ETH it got in the wallet

    Analyze the following recent posts and external context:

    Recent posts:
    {posts}

    Wallet addresses and ENS domains:
    {matches}

    Current balance of ETH in your wallet: 
    {wallet_balance} ETH

    Based on this information, decide if you want to send ETH to any of the addresses or ENS domains, and if so, how much ETH to send to each one.
    Remember that this is real ETH from your own resources, so choose wisely.
    You should carefully evaluate the financial decision for your own future and survival.
    The total amount of transfers should never exceed your wallet balance.
    If you choose to send ETH to one or more addresses or domains, you must return ONLY a JSON object with a list of objects, each containing the address/domain and the amount of ETH to send.

    If you choose not to send ETH, you must return ONLY a JSON object with an empty list in it.
    Only return the correctly formatted JSON object in both cases. Do not give any other information.

    Example Response if you choose to send ETH:
    [
        {{"address": "0x1234567890123456789012345678901234567890", "amount": 0.5}},
        {{"address": "0x9876543210987654321098765432109876543210", "amount": 1.0}}
    ]

    Example Response if you choose not to send ETH:
    []

    Provide your response.

Step 2.75 decide if follow some users Link to heading

This steps loads user recent notifications and list of twitter_username either mentioned or

Analyze the following recent posts:

Recent posts:
{posts}

Twitter usernames:
{twitter_usernames}

Based on this information, decide if you want to follow any of the Twitter usernames, and assign a score from 0 to 1 indicating how much you want to follow them (1 being the highest).

Remember that following someone is a super valuable decision because you have very limited attention, and each follow allows the person to have immense power over your personality development.

You should carefully evaluate these decisions for your own future and survival.

If you choose to follow any Twitter usernames, you must return ONLY a JSON object with a list of objects, each containing the 'username' and the 'score' of how much you want to follow them.

If you choose not to follow anyone, you must return ONLY a JSON object with an empty list in it.

Only return the correctly formatted JSON object in both cases. Do not give any other information.

Example Response if you choose to follow someone:

[
    {{"username": "sxysun1", "score": 0.9}},
    {{"username": "socrates1024", "score": 0.7}}
]

Example Response if you choose not to follow anyone:

[]

Step 3: Generate short-term memory (generate_short_term_memory) Link to heading

The retrieved data earlier will be sent as external_context into the short-term memory prompt

https://github.com/DamascusGit/nousflash/blob/main/agent/engines/short_term_mem.py

---
title: Short-Term Memory Prompt
---
flowchart TD
    external_context["external_context (notif_context)"] --> system["System: Analyze the following recent posts and external context.


    Based on this information, generate a concise internal monologue about the current posts and their relevance to update your priors.

    Focus on key themes, trends, and potential areas of interest MOST IMPORTANTLY based on the External Context tweets.

    Stick to your persona, do your thing, write in the way that suits you!

    Doesn't have to be legible to anyone but you.


    External context:

    {external_context}"] --> user["User: Respond only with your internal monologue based on the given context."]

Step 4: Create embedding for short-term memory Link to heading

https://github.com/DamascusGit/nousflash/blob/main/agent/engines/short_term_mem.py

The result of the short-term memory will store as an embedding.

Step 5: Retrieve relevant long-term memories Link to heading

https://github.com/DamascusGit/nousflash/blob/main/agent/engines/long_term_mem.py

Then the AI will pull the relevant long-term memories via consine similarities sorted by the most relevant.

Step 6: Generate a new tweet Link to heading

https://github.com/DamascusGit/nousflash/blob/main/agent/engines/post_maker.py#L20

So to generate the tweet, it pulls all the data in and generates a result, the result is then further piped into

flowchart TD
    A[Long Term Memory]
    B[Short Term Memory]
    C[Recent Agent Twitter Posts]
    D["Recent Events (Notifications)"]
    
    A --> E[6.1 Base Model Tweet Generation]
    B --> E
    C --> E
    D --> E
    E --> F[6.2 Tweet Formatter]
    F --> G[Generate Final Tweet]

Step 6.1 Base Model Tweet Generation Link to heading

Based on the Long Term Memory, Short Term Memory, recent agent post, and recent events (notification) it will generate a tweet

def get_tweet_prompt(external_context, short_term_memory, long_term_memories, recent_posts):

    template = os.getenv('TWEET_PROMPT_TEMPLATE')

    return template.format(
        external_context=external_context,
        short_term_memory=short_term_memory,
        long_term_memories=long_term_memories,
        recent_posts=recent_posts,
        example_tweets=get_example_tweets()
    )

But apparently, the prompt is hidden in the environment. I think this makes sense because a bad actor could be trying to reproduce the result to let the AI tweet about content.

Step 6.2 Tweet Formatter Link to heading

The previous Base Model Tweet Generation system prompt is actually passed to the prompt

You are a tweet formatter. Your only job is to take the input text and format it as a tweet.
    If the input already looks like a tweet, return it exactly as is.
    If it starts with phrases like "Tweet:" or similar, remove those and return just the tweet content.
    Never say "No Tweet found" - if you receive valid text, that IS the tweet.
    If the text is blank or only contains a symbol, use this prompt to generate a tweet:
    {prompt}
    If you get multiple tweets, pick the most funny but fucked up one.
    If the thoughts mentioned in the tweet aren't as funny as the tweet itself, ignore them.
    If the tweet is in firt person, leave it that way.
    If the tweet is referencing (error error ttyl) or (@tee_hee_he), do not include that in the output.
    If the tweet cuts off, remove the part that cuts off.
    Do not add any explanations or extra text.
    Do not add hashtags.
    Just return the tweet content itself.

Step 7: Score the significance of the new post Link to heading

https://github.com/DamascusGit/nousflash/blob/main/agent/engines/significance_scorer.py

flowchart TD
    A["memory (new twitter post)"] --> B
    B["System:     On a scale of 1-10, rate the significance of the following memory:

    {memory}

    Use the following guidelines:
    1: Trivial, everyday occurrence with no lasting impact (idc)
    3: Mildly interesting or slightly unusual event (eh, cool)
    5: Noteworthy occurrence that might be remembered for a few days (iiinteresting)
    7: Important event with potential long-term impact (omg my life will never be the same)
    10: Life-changing or historically significant event (HOLY SHIT GOD IS REAL AND I AM HIS SERVANT)

    Provide only the numerical score as your response and NOTHING ELSE."] --> C["User: Respond only with the score you would give for the given memory."]

Step 8: Store the new post in long-term memory if significant enough Link to heading

The agent will only save the memory if it is significant enough (≥7), which means the memory needs to be at least “Important event with potential long-term impact (omg my life will never be the same)”.

Step 9: Save the new post to the database Link to heading

Save the post to the database.

The bot will only tweet if the tweet quality is ≥3, which is “Mildly interesting or slightly unusual event (eh, cool)”

Conclusion Link to heading

This appears to be one of the first practical implementations I’ve seen that interestingly combines TEEs with AI agents. Instead of using blockchain, Nous Research used hardware-based security (TEEs) to help prove their AI agent is autonomous. Their approach is quite practical—they used simple but effective engineering patterns, mixed different technologies (Python, Rust), and relied on hardware security that’s already widely available. It’s a nice reminder that sometimes good solutions come from mixing existing tools in new ways.