Skip to main content

Capabilities

This page lists the product surfaces that are easy to miss from the short README. It is based on the current server routes, agent tool registry, Flutter sections, Android bridge code, and integration providers in this repository.

Operator UI

The Flutter client exposes the main operator surfaces:

SectionWhat it is for
ChatNormal agent runs with tools, memory, integrations, and messaging
RunsLive and historical run steps, including browser, Android, CLI, messaging, tasks, MCP, and subagent work
LogsService logs and diagnostics from the server you are connected to
TasksSchedule-triggered and integration-triggered automations
SkillsBuilt-in and custom reusable workflows
IntegrationsOAuth account connections for structured app tools
MCPRemote MCP server registration and tool discovery
MemoryLong-term memory, core memory, and session search
DevicesServer-side browser and Android runtime controls
RecordingsRecording sessions, transcripts, segments, and playback
HealthAndroid Health Connect sync status and synced metrics
SettingsAI providers, model routing, runtime settings, messaging, and service controls

Recordings

NeoAgent records audio as server-side sessions with one or more sources. The web client can record browser microphone and screen audio, and the Android app can record phone microphone audio through a foreground service.

Recording sessions support:

  • Chunked uploads with per-source sequence checks.
  • Sources, chunks, transcript segments, session status, and playback URLs.
  • Statuses for recording, processing, completed, failed, and cancelled sessions.
  • Retry transcription and delete transcript segment actions.
  • Full session deletion with storage cleanup.
  • Agent tools for listing, opening, and searching transcripts: recordings_list, recordings_get, and recordings_search.

Transcription uses Deepgram when DEEPGRAM_API_KEY is configured. The default speech model is nova-3, and the default language mode is multi. When auto_recording_insights is enabled in AI settings, NeoAgent can generate structured recording insights such as a summary, action items, and events.

Android Control

NeoAgent can let the AI control an Android emulator or device attached to the NeoAgent server or configured worker. This is the Android capability in the comparison: the agent can observe and operate Android, not only run an Android companion app.

Android control supports:

  • Starting and stopping the managed Android emulator.
  • Listing ADB-connected devices and installed apps.
  • Taking screenshots and UIAutomator XML dumps.
  • Observing visible UI nodes.
  • Opening apps and Android intents.
  • Tapping, long pressing, typing, swiping, and pressing Android navigation keys.
  • Waiting for text, resource IDs, descriptions, or classes to appear.
  • Installing .apk and universal .apks bundles.
  • Running adb shell commands when higher-level tools are not enough.

These actions run where the NeoAgent backend is running. If NeoAgent is deployed on a remote server, the AI controls the Android runtime attached to that server, not the laptop where you are reading the docs.

Android App And Health

The Flutter Android app is still useful as a client. It can sign in to the same self-hosted backend, run chat and operator UI flows, sync Health Connect data, and record audio locally.

Android app capabilities include:

  • NEOAGENT_BACKEND_URL build/run configuration for real devices.
  • Health Connect permission flow and background sync.
  • Microphone recording through an Android foreground service.
  • Boot restore hooks for recording services when Android allows them.

Health Data

Health data comes from the Android app through /api/mobile/health. NeoAgent stores sync runs and normalized metric samples. The built-in metric aliases include steps, heart rate, sleep sessions, exercise sessions, and weight.

The agent tool read_health_data returns summaries and recent samples. It is designed to answer questions such as recent step totals or available health metrics without dumping every raw record.

Integrations And Messaging

NeoAgent has two separate integration layers:

  • Official integrations expose structured tools for Google Workspace, Microsoft 365, Notion, Slack, Figma, and a separate personal WhatsApp connection.
  • Messaging platforms let the agent talk through WhatsApp, Telegram, Discord, Slack, Google Chat, Teams, Matrix, Signal, iMessage/BlueBubbles, IRC, Twitch, LINE, Mattermost, configurable webhook bridges, and Telnyx Voice.

Official integration examples include Gmail thread search and send mail, Google Calendar events, Drive upload/download/export/share links, Docs create/append/replace, Sheets read/update/append/create, Microsoft Outlook/Calendar/OneDrive/Teams tools, Notion search/page/block/database tools, Slack conversation/message tools, Figma file/node/comment/image tools, and a personal WhatsApp integration with isolated chat read/send tools and per-account read-only versus read/write access.

Messaging examples include Telegram and Discord messages, Slack channel replies, Matrix room messages, Google Chat and Teams webhook delivery, Signal bridge delivery, iMessage/BlueBubbles sends, WhatsApp text and media sends, Telnyx inbound voice, Telnyx outbound calls, and scheduled-task call delivery.

Agent Tools

NeoAgent's agent tool surface includes more than basic chat:

AreaExamples
CLIPTY-capable execute_command with stdin, timeout, stdout, stderr, exit code, and duration
BrowserNavigate, click, type, extract, screenshot, and evaluate page JavaScript
Android controlUI observation, input, screenshots, app launch, intent launch, APK install, and shell commands
Web searchBrave Search API through web_search
FilesRead, write, edit, list, and search files
HTTPDirect HTTP requests
MemorySemantic memory, session search, daily logs, API key name reads, and core memory
SkillsCreate, list, update, and delete persistent skills
TasksSchedule-triggered and integration-triggered automations, one-time runs, model overrides, and optional Telnyx call delivery
MCPAdd, list, and remove MCP servers, plus dynamic MCP tool use
SubagentsSpawn, list, wait for, and cancel async subagents inside a run
OutputGenerate markdown tables and Mermaid graphs
ImagesGenerate images with Grok and analyze local image files with a vision-capable model
RecordingsList, inspect, and search recording transcripts
HealthRead synced mobile health metrics

Generated binary or text artifacts can be promoted into user-scoped artifact storage under ~/.neoagent/data/artifacts and served through authenticated /api/artifacts/:id/content URLs.

Runtime Modes

Runtime settings let operators choose where higher-risk work runs:

ProfileRuntime shape
trusted-hostCLI, browser, and Android tools run on the host
secure-vmCLI, browser, and Android tools run through the local VM backend

Production policy can require the secure VM profile and a strong VM guest token.

These controls matter operationally: the browser, Android emulator, local files, and shell commands run wherever the NeoAgent backend, VM, or paired browser extension is running, not necessarily on the computer where you are reading the docs. Logs from a different server or remote browser may not match the logs on the local machine.

For extension-only remote browser control, download /api/browser-extension/download from NeoAgent, unzip it on the remote machine, load the folder in chrome://extensions, and pair after logging in. The extension uses Chrome's debugger permission for full browser control, so Chrome will show its normal debugging warning while attached. The popup can check whether the server has a newer extension bundle, but unpacked Developer Mode installs still need a manual download and reload.