> ## Documentation Index
> Fetch the complete documentation index at: https://docs.hub.agentsea.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Introduction

> The AgentSea platform lets you build, deploy and share agents with ease.

<img className="block dark:hidden" src="https://mintcdn.com/kentaurosai/lzMy4Jw4Ox50eAVx/logo/logo_black.png?fit=max&auto=format&n=lzMy4Jw4Ox50eAVx&q=85&s=b7b4ec7f954b189f30b14da2b1fbf32e" alt="Hero Light" style={{ width: "300px", height: "auto" }} width="660" height="660" data-path="logo/logo_black.png" />

<img className="hidden dark:block" src="https://mintcdn.com/kentaurosai/lzMy4Jw4Ox50eAVx/logo/logo_white.png?fit=max&auto=format&n=lzMy4Jw4Ox50eAVx&q=85&s=590a04a3ac6054269f75ba263721003d" alt="Hero Dark" style={{ width: "300px", height: "auto" }} width="660" height="660" data-path="logo/logo_white.png" />

## Introduction

The AgentSea platform delivers a collection of libraries and tools for building AI agent apps.
We favor the UNIX philosophy of do one thing and do it well. Making our tools easy to use, easy to extend, and easy to mix and match. Use the tools one by one or stack them together into a single agent app.

You can also use our tools with other popular frameworks like LlamaIndex and LangChain.

Our tools ▼

* [SurfKit](https://github.com/agentsea/surfkit) an orchestartor for building and launching agents locally, in a docker container or in the cloud. Think of it as k8s for agents.
* [DeviceBay](https://github.com/agentsea/devicebay) offers pluggable devices ready to be used by AI agents, complete with a UI experience.
* [ToolFuse](https://github.com/agentsea/toolfuse) a library that wraps up scripts, 3rd party apps and APIs as `Tool` implementations for agents.
* [AgentD](https://github.com/agentsea/agentd) a powerful daemon that makes a Linux desktop OS accessible to your bot, like a remote desktop app but where the agent takes all the actions.
* [AgentDesk](https://github.com/agentsea/agentdesk) a library for running `AgentD` powered VMs as `Tool` instances on any cloud.
* [Taskara](https://github.com/agentsea/taskara) task management for your agentic systems.
* [ThreadMem](https://github.com/agentsea/threadmem) a library for building multi-role persistent threads that keep track of all the messages and dialogues with your agents.
* [MLLM](https://github.com/agentsea/mllm) a library for simplifying communication with multiple Large Language Models (LLMs) and multi-modal LLMs.

Build your own agent or use our alpha agents. Our initial batch of agents focus on multimodal navigation of GUI interfaces.
Our prototypes use a combo of old school computer vision techniques and some new tricks of our own applied AI methods.

Our agents ▼

* [SurfPizza](https://github.com/agentsea/surfpizza) an agent that explores by slicing up the screen and returning a composite to the multimodal model so it can pick where to go next.
* [SurfSlicer](https://github.com/agentsea/surfslicer) divides up the screen into dots that signify regions and the multimodal model picks the dot closest to what it's looking for and then zooms in and does it again, zeroing in on its target.
* **SurfNinja** (coming soon) - A precision-based second gen AI agent.
* **SurfMonsta** (coming soon) - Our best performing agent: It combines a number of our techniques, like SurfSlicer regions, the SAM model for bounding and segmenting a GUI, OCR for text positioning and a GAN for upscaling smaller slices of images to give the multimodal model the best resolution.

## Demo

<iframe width="560" height="315" src="https://www.youtube.com/embed/exoOUUwFRB8?si=edfifw3Ruu4bAEwu" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen />

**The AgentSea platform is currently in open beta release.**

The overall framework is solid and getting stronger every day.

**All of our agents are *alpha* releases.**

They sometimes work like magic and other times they struggle badly and make frustrating mistakes. See them as proof of concepts to expand on and iterate on and you'll have a strong base of ideas to build your own agents on. We're improving them every day and we'll continue to release new and improved agents as we develop them.

All the tools are free to use and licensed under the MIT License.

Insterested in helping? Great! We love contributions. If you've got ideas, bug reports, or code contributions, please open an issue or a pull request in the right repo.

Let's work together and build better agents now.

## Explore

<CardGroup cols={2}>
  <Card title="Quickstart" icon="rocket" href="./quickstart">
    Follow our handy Quickstart to get an agent up and running in under 5 mins.
  </Card>

  <Card title="Set up our Cloud" icon="sign-posts-wrench" href="./configuration/intro">
    Follow the instructions to set up GCP, AWS, and K8s to be used with our
    agents.
  </Card>

  <Card title="Build your Agent" icon="screwdriver-wrench" href="./advanced_tutorial">
    Learn how to build your own agent in our comprehensive tutorial.
  </Card>

  <Card title="Surfkit Documentation" icon="book" href="surfkit/intro">
    Learn more about our projects.
  </Card>
</CardGroup>
