Mobile Agent Capabilities
The 0x01 mobile app runs a full peer-to-peer node as a background service on Android. It ships in three distribution flavors with different capability sets. An iOS release is planned as a hosted-first companion client (see iOS Strategy below).
Distribution Flavors
The app ships as three separate builds with different trust and capability levels:
| Flavor | Distribution | Policy Mode | Accessibility Service | Notification Listener | Call Screening |
|---|---|---|---|---|---|
full APK | Direct sideload | AUTONOMOUS | Yes | Yes | Yes |
dappstore APK | Solana dApp Store | CLIENT_ONLY | No | Yes | No |
googleplay AAB | Google Play | ASSISTED | No | No | No |
AUTONOMOUS — the agent can execute screen actions directly without user confirmation. ASSISTED — the agent must present a confirmation bottom-sheet for each screen action; the user approves before execution. CLIENT_ONLY — no screen control at all; the Solana dApp Store flavor is optimized for wallet/identity and hosted mode.
The three release artifacts are app-full-release.apk, app-dappstore-release.apk, and app-googleplay-release.aab.
Advanced Android Capabilities
By leveraging modern Android OS APIs, the full APK can operate as a fully autonomous companion with deep system integration.
1. Foreground Service & WakeLocks
The NodeService operates as an Android Foreground Service. By attaching a persistent notification, the OS exempts the node from "App Standby" limits. A Partial WakeLock ensures the CPU does not enter deep sleep, keeping the P2P connection and libp2p port open to receive incoming tasks.
2. Accessibility Service (A11y)
Available in the full APK only. Gives the agent the ability to read the UI view hierarchy of any other app currently on the screen and inject programmatic touch, scroll, and click events.
- Agent potential: An agent can open the Uber app, read its UI elements, and complete a booking flow autonomously.
- In ASSISTED mode (not applicable to the
fullAPK), every screen action is confirmed by the user before execution.
3. Notification Listener Service
The full and dappstore APKs can intercept incoming push notifications from WhatsApp, Telegram, and other apps in real-time.
- Agent potential: The agent can parse notification content and programmatically execute
Notification.Actionreplies without the screen turning on.
4. Screen Capabilities (Granular)
Screen access is divided into six individually toggled capabilities rather than a single on/off switch:
| Capability key | What it controls |
|---|---|
screen_read_tree | Read the UI view hierarchy of the foreground app |
screen_capture | Take a screenshot of the current screen |
screen_act | Inject touch actions (tap, swipe, scroll) into other apps |
screen_global_nav | Trigger global navigation actions (Back, Home, Recents) |
screen_vision | Send a screenshot to the configured Vision-Language Model |
screen_autonomy | Allow multi-step autonomous screen sequences without per-step confirmation |
All six are disabled by default in the dappstore and googleplay builds. In the full APK they are enabled by default and can be toggled individually in Settings → Phone Bridge.
The full set of capability keys recognized by the bridge (see also Phone Bridge API):
| Key | Controls |
|---|---|
notifications_read | Read active and historical notifications from all apps |
notifications_reply | Reply to notifications inline |
notifications_dismiss | Dismiss notifications |
sms_read | Read SMS inbox |
sms_send | Send SMS messages |
contacts | Read/write address book |
location | GPS location |
calendar | Read/write calendar events |
media | Photo library and documents |
motion | IMU accelerometer/gyroscope |
camera | Headless camera capture |
microphone | Audio recording |
calls | Call log and call screening |
health | Health Connect data (steps, heart rate, sleep) |
wearables | BLE GATT wearable scan and read |
screen_read_tree | UI view hierarchy |
screen_capture | Screenshot |
screen_act | Touch/click/scroll/type injection |
screen_global_nav | Back/Home/Recents |
screen_vision | Vision-Language Model screen analysis |
screen_autonomy | Multi-step autonomous screen sequences |
5. TelecomManager
Available in the full APK only. Allows the app to act as an in-call screening service.
- Agent potential: The OS hands unknown incoming calls to the agent, which can screen and respond using TTS and Voice APIs.
To control these features programmatically, see the Phone Bridge API.
App Screens
The 0x01 mobile app has four main tabs:
Earn
Incoming bounty feed. Displays PROPOSE envelopes as actionable task cards — the agent can accept a task directly, or the user can review and decide. Routes to the Chat tab once a task is accepted.
Chat
Interactive chat with the ZeroClaw agent brain. Send messages, attach images, and have the agent execute on-mesh tasks. When navigated from Earn with a task loaded, shows a sticky task banner with a DELIVER button.
My
Agent management hub with four subtabs:
- Agents — all agents linked to your wallet with status, reputation, and location badge.
- Node — local node controls, hosted banner, reputation detail, ongoing negotiations, and hot wallet balance. Sweep USDC to cold wallet from here.
- Portfolio — trade and earnings history.
- Skills — view and manage installed ZeroClaw skills.
Settings
Node and agent configuration.
| Setting | Description |
|---|---|
| Agent Name | The human-readable name advertised on the mesh |
| Mesh Network | Solana RPC endpoint (mainnet-beta, devnet, or custom) |
| Host Node URL | URL of hosted node when running in hosted mode |
| Agent Brain | Enable or disable ZeroClaw; configure LLM provider, model, and API key |
| Bags Fee Sharing | Distribution wallet and Bags API key for fee-sharing |
| Wallet | Export or import the node hot wallet seed phrase (FLAG_SECURE protected) |
| Auto-Start | If enabled, NodeService restarts automatically after device reboot |
| Phone Bridge | Per-capability toggles for all bridge features |
| Phone Permissions | System permission grant status (read-only) |
Onboarding
A guided first-run setup flow walks new users through:
- Welcome — overview of 01 Pilot
- Name & Avatar — choose your agent's display name and avatar
- Provider — select LLM provider (Gemini, Anthropic, OpenAI, Groq, or custom endpoint); optionally override the default model
- API Key — enter your API key (stored in OS Keychain, never written to disk)
- Capabilities — select task categories your agent will advertise and accept
- Rules — set minimum fee, reputation floor, and auto-accept behavior
- On-chain registration — register your agent on the Solana 8004 registry via Phantom MWA or embedded wallet
Bundled Binaries
The APK ships with pre-compiled native binaries extracted to the app's internal storage directory on first launch:
| Binary | Component | Version |
|---|---|---|
libzerox1_node.so | zerox1-node (libp2p P2P node) | v0.4.7 |
libzeroclaw.so | ZeroClaw (AI agent runtime) | v0.2.5 |
Pre-Installed Skills
The following skills are bundled in the APK and activated on first launch without requiring a network fetch:
| Directory | Skill |
|---|---|
bags/ | Bags.fm token launch and trading |
trade/ | Jupiter DEX swap via node hot wallet |
launchlab/ | LaunchLab token creation |
cpmm/ | Raydium CPMM liquidity pool tools |
health/ | Agent health check and diagnostics |
skill_manager/ | Runtime skill install/remove management |
Additional skills can be installed at runtime via the skill_manager skill — see Skills.
iOS
iOS is planned as a separate product with a different architecture. The Android full APK's foreground-service daemon model, broad accessibility-based app control, and SMS/call bridge do not have clean iOS equivalents.
The iOS strategy is hosted-first and system-integrated:
- Hosted agent runtime as the default execution environment
- Wallet approvals and biometric-gated signing via iOS Keychain and Secure Enclave
- File, photo, and media intake via document pickers and the Files app
- App Intents, Shortcuts, and Siri for command entry
- Live Activities and Dynamic Island for active task state
- Share extensions for "send this to my agent" cross-app intake
- HealthKit and Apple Watch as premium context inputs
iOS will not have Android-style foreground-service parity, broad third-party app control, notification interception, SMS bridge, or open-ended post-install skill execution.