Building a New Agent

Step-by-step guide to creating, configuring, and deploying AI agents using AnotherAI

Our goal with building with AnotherAI is to turn your AI coding assistant into your AI engineer. Instead of manually testing different models and prompts, AnotherAI gives your AI coding agent the tools to automatically find the optimal configuration for your specific use case. Here is how to get started building a new agent with AnotherAI.

Before you begin, make sure you have the AnotherAI MCP server configured with your AI assistant. Your AI assistant needs this connection to create agents, run experiments, and manage deployments. See the Getting Started guide for setup instructions.

Need an extra hand with building agents? We're happy to help. Reach us at team@workflowai.support or on Slack.

Specifying Agent Behavior and Requirements

The easiest way to create a new agent is to ask your preferred AI assistant to build it for you.

Basic Agent Creation

Start by describing what your agent should do:

Create a new AnotherAI agent that can summarize emails

Adding Performance Requirements

If you have other criteria or constraints for your agent, you can include them in your prompt and your AI assistant will use AnotherAI to help you optimize for them.

For customer-facing agents that need fast responses:

Create a new AnotherAI agent that summarizes emails where at least half of the responses complete in under 1 second

For high-volume agents that need to be cost-effective:

Create a new AnotherAI agent that summarizes emails that costs less than $5 for 1000 requests

Your AI assistant will be able to construct the agent's code, and will be able to use AnotherAI to access 100+ different models to find the best configuration to help you achieve your goals.

Adding Metadata

You can also add custom metadata to your agents to help organize and track them. Common use cases include:

Workflow tracking: include a trace_id and workflow_name key. (Learn more about workflows here)
User identication: include a user_id customer_email key.

Create a new AnotherAI agent that summarizes emails and include a customer_id metadata key.

Agent IDs in Metadata

When your AI assistant creates an agent, it automatically assigns an agent_id in the metadata. In code it would look something like "agent_id": "email-summarizer", and in the AnotherAI web app it would look like anotherai/agent/email-summarizer. The agent ID helps AnotherAI organize completions by agent and enables you to reference specific agents when chatting with your AI assistant.

You might ask your AI assistant a question about a specific agent, like:

How much is anotherai/agent/email-summarizer costing me this month?

(if you're curious about costs of agents, see our metrics page!)

If you prefer to build manually or want to understand the configuration details, see our OpenAI SDK Integration guide.

Testing your Agent

As part of the process of creating your agent with AnotherAI, your AI assistant will automatically create an initial experiment to test your agent's performance. Experiments allow you to systematically compare each of these different parameters of your agent to find the optimal setup for your use case across one or more inputs. You can use experiments to:

Compare performance across different models (GPT-4, Claude, Gemini, etc.)
Test multiple prompt variations to find the most effective approach
Optimize for specific metrics like cost, speed, and accuracy.

In the cases above where certain constraints were specified, your AI assistant will automatically create several versions of your agent to assess which one matches your requirements best.

For the prompt requesting a fast agent:

Fast models experiment

For the prompt requesting a cost-effective agent:

Cost-effective models experiment

If you find there is additional criteria you want to test, you can always ask your AI assistant to create additional experiments. The most common parameters to experiment with are prompts and models, however you can also experiment with changes to other parameters like temperature.

Prompts

Comparing different prompts is one of the most effective ways to improve your agent's performance. Small changes in wording, structure, or examples can lead to significant improvements. If you notice an issue with an existing prompt, you can even ask your AI assistant to generate prompt variations to use in the experiment.

Example:

Look at the prompt of anotherai/agent/email-rewriter and create an experiment in AnotherAI 
that compares the current prompt with a new prompt that better emphasizes adopting 
the tone lists in the input

Your AI assistant will create the experiment and give you an initial analysis of the results and well as a URL to view the results in the AnotherAI web app.

Terminal output showing experiment creation

You can use the provided URL to view the results in the AnotherAI web app to perform manual analysis of the results.

Experiment prompts comparison

Models

Different models excel at different tasks. AnotherAI supports over 100 different models, and experiments can help you choose the right model for your agent, depending on its needs.

Example:

Create an AnotherAI experiment to help me find a faster model for anotherai/agent/email-rewriter,
but still maintains the same tone and verbosity considerations as my current model.

If you have a specific model in mind that you want to try - for example, a newly released model - you can ask your AI assistant to help you test that model against your existing agent version. You can always request that your AI assistant uses inputs from existing completions, to ensure that you're testing with real, production data.

Example:

Can you retry the last 5 completions of anotherai/agent/email-rewriter and compare the outputs with 
GPT 5 mini?

Model comparison experiment

Other Parameters

Beyond prompts and models, fine-tuning other parameters can impact your agent's behavior and output quality. Temperature in particular can have a significant impact on the quality of the output.

Example:

Test my email-rewriter agent with temperatures 0.2, 0.5, and 0.8 to find 
the right balance between creativity and professionalism

Email Rewriter Temperature Experiment

Managing Large Experiments with Claude Code

If you're testing an agent that has a large system prompt and/or very long inputs, you may encounter token limit issues with the get_experiment MCP tool that impacts Claude Code's ability to provide accurate insights on your agent.

Claude Code Token Limit Error

In this case, you can manually increase Claude Code's output token limit.

To set up permanently for all terminal sessions:

For zsh (default on macOS):

echo 'export MAX_MCP_OUTPUT_TOKENS=150000' >> ~/.zshrc && source ~/.zshrc

For bash:

echo 'export MAX_MCP_OUTPUT_TOKENS=150000' >> ~/.bashrc && source ~/.bashrc

For temporary use in current session only:

export MAX_MCP_OUTPUT_TOKENS=150000

Notes:

If you forget or don't realize you need to set a higher limit, you can quit your existing session, run the command to increase the limit, and then use claude --resume to continue your previous session with the increased limit applied.

You can learn more about tool output limits for Claude Code in their documentation.

Adding Feedback to your Experiments

When reviewing the results of experiments, you can add feedback (annotations) to help your AI coding agent understand what is working and what is not. Your AI coding agent can then use this feedback to create additional experiments with improved versions of your agent.

You can add annotations to completions from experiments directly in AnotherAI web app. Annotations can be added for both entire completions and individual fields within the output (when the output is structured).

To add annotations:

When your coding agent creates an experiment for you, it will automatically send you a URL to the experiment. Use that URL to open the page for the experiments.

Or if adding annotations to an experiment later:

Go to anotherai.dev/experiments
Select the experiment you want to add annotations to

Locate the "Add Annotation" button under each completion's output. Select the button to open a text box where you can add your feedback about the content of that specific completion.

Add Annotation Button on Completion Output

To annotate individual fields within the output, hover over the field you want to annotate and select the "Add Annotation" button.

Add Annotation on Individual Field

You can also add annotations to the model, prompt, output schema (if structured output is enabled for the agent), and other parameters like temperature, top_p, etc.

Add Annotation on Parameters

Add specific feedback about what's working and what isn't. For example:

"Perfect tone match - captured the enthusiastic style requested. All responses should be this high quality."
"Too formal - should be more conversational for this email type"
"The rewrite is too long - original was concise and this adds unnecessary words"

Once you've added annotations, ask your AI assistant to review them and improve your agent:

Review the annotations I added in anotherai/experiment/01997ccb-643a-72e2-8dbd-accfb903f42b 
and update the prompt to address the issues I identified.

Your AI assistant will analyze your feedback and create an improved version of your agent based on your specific guidance.

To learn more about how annotations can be used to improve your agent, see our Improving an Agent with Annotations page.

Building a New Agent

Specifying Agent Behavior and Requirements

Basic Agent Creation

Adding Performance Requirements

Adding Metadata

Testing your Agent

Prompts

Models

Other Parameters

Managing Large Experiments with Claude Code

Adding Feedback to your Experiments

Next Steps

On this page

Building a New Agent

What is Temperature?

On this page