21 comments

  • badmonster 51 minutes ago
    Congrats on the launch! love this idea. How does the LLM interact with the VM—screen+metadata as JSON, or higher-level planning?
  • gavinbains 4 hours ago
    Legendary. This is going to be very helpful, and the TAM is getting bigger. Thank you guys for this, and for all the learnings in-batch -- I'm excited for the future!

    I reckon I could run this for buying fashion drops, is this a use case y'all have seen?

  • dhruv3006 1 hour ago
    One-shot VM would be nice. ephemeral VM spins up, agent runs task, VM is deleted —perfect for CI pipelines.
  • brap 12 hours ago
    Congrats on the launch!

    I don’t know if this is a problem you’ve faced, but I’m curious: how do LLM tool devs handle authn/authz? Do host apps normally forward a token or something? Is there a standard commonly used? What if the tool needs some permissions to act on the user’s behalf?

    • alexchantavy 9 hours ago
      There are companies like https://www.keycard.sh/ taking this on. There are other competitors too but I can't think of them atm
    • frabonacci 8 hours ago
      Good question! Specifically around computer-use agents (CUAs), I haven't seen much exploration yet - and I think it’s an area worth exploring for vertical products. For example, how do you securely handshake between a CUA agent and an API-based agent without exposing credentials? If everything stays within a local cluster, it's manageable, but once you start scaling out, authn/authz becomes a real headache.

      I'm also working on a blog post that touches on this - particularly in the context of giving agents long-term and episodic memory. Should be out next week!

  • sagarpatil 3 hours ago
    Love your accent!
  • orliesaurus 8 hours ago
    bravi! the future is the Agent OS - How robust is the UI element detection and interaction across different apps and inside navigating complex menus? Is it resistant to UI changes? That's often where these automations get brittle.

    thank you e forza Cua

  • winwang 12 hours ago
    Congrats! How do you guys deal with SOC2/HIPAA/etc.? Or are those separate concerns?
    • frabonacci 8 hours ago
      Thanks! Great question - those are definitely relevant, but they depend a lot on the deployment model. Since CUAs often run locally or in controlled environments (e.g. a user’s own VM or cluster), we can sidestep a lot of traditional SOC2/HIPAA concerns around centralized data handling. That said, if you're running agents across org boundaries or processing sensitive data via cloud APIs, then yeah - those frameworks absolutely come into play.

      We're designing with that in mind: think fine-grained permissioning, auditability, and minimizing surface area. But it’s still early, and a lot of it depends on how teams end up using CUAs in practice.

  • taikon 8 hours ago
    How's it different from e2b computer use?
    • frabonacci 8 hours ago
      We’re still figuring things out in public, but a few key differences:

      - Open-source from the start. Cua’s built under an MIT license with the goal of making Computer-Use agents easy and accessible to build. Cua's Lume CLI was our first step - we needed fast, reproducible VMs with near-native performance to even make this possible.

      - Native macOS support. As far as we know, we’re the only ones offering macOS VMs out of the box, built specifically for Computer-Use workflows. And you can control them with a PyAutoGUI-compatible SDK (cua-computer) - so things like click, type, scroll just work, without needing to deal with any inter-process communication.

      - Not just the computer/sandbox, but the agent too. We’re also shipping an Agent SDK (cua-agent) that helps you build and run these workflows without having to stitch everything together yourself. It works out of the box with OpenAI and Anthropic models, UI-Tars, and basically any VLM if you’re using the OmniParser agent loop.

      - Not limited to Linux. The hosted version we’re working on won’t be Linux-only - we’re going to support macOS and Windows too.

    • orliesaurus 8 hours ago
      Active development of CUA, according to GitHub
  • tomatohs 13 hours ago
    Would love to use this for TestDriver, but needs to support Windows :*(
  • brene 13 hours ago
    will this also be available as a hosted service? Or do you have instructions on how to manage a fleet of these manually while you're building the orchestration workflows?
    • frabonacci 12 hours ago
      Yes, we’re currently running pilots with select customers for a hosted service of Cua supporting macOS and Windows cloud instances. Feel free to reach out with your use case at [email protected]
  • rahimnathwani 13 hours ago
    I tried this three times. Twice a few days ago and once just now.

    First time: it opened a MacOS VM and started to do stuff, but it got ahead of itself and starting typing things in the wrong place. So now that VM has a Finder window open, with a recent file that's called

      plt.ylabel('Price(USD)').sh
    
    The second and third times, it launched the VM but failed to do anything, showing these errors:

      INFO:cua:VM run response: None
      INFO:cua:Waiting for VM to be ready...
      INFO:cua:Waiting for VM macos-sequoia-cua_latest to be ready (timeout: 600s)...
      INFO:cua:VM status changed to: stopped (after 0.0s)
      DEBUG:cua:Waiting for VM IP address... Current IP: None, Status: stopped
      DEBUG:cua:Waiting for VM IP address... Current IP: None, Status: stopped
      DEBUG:cua:Waiting for VM IP address... Current IP: None, Status: stopped
      INFO:cua:VM status changed to: running (after 12.4s)
      INFO:cua:VM macos-sequoia-cua_latest got IP address: 192.168.64.2 (after 12.4s)
      INFO:cua:VM is ready with IP: 192.168.64.2
      INFO:cua:Initializing interface for macos at 192.168.64.2
      INFO:cua.interface:Logger set to INFO level
      INFO:cua.interface.macos:Logger set to INFO level
      INFO:cua:Connecting to WebSocket interface...
      INFO:cua.interface.macos:Waiting for Computer API Server to be ready (timeout: 60s)...
      INFO:cua.interface.macos:Attempting WebSocket connection to ws://192.168.64.2:8000/ws
      WARNING:cua.interface.macos:Computer API Server connection lost. Will retry automatically.
      INFO:cua.interface.macos:Still waiting for Computer API Server... (elapsed: 10.0s, attempts: 11)
      INFO:cua.interface.macos:Still waiting for Computer API Server... (elapsed: 20.0s, attempts: 21)
      INFO:cua.interface.macos:Still waiting for Computer API Server... (elapsed: 30.0s, attempts: 31)
      WARNING:cua.interface.macos:Computer API Server connection lost. Will retry automatically.
      INFO:cua.interface.macos:Still waiting for Computer API Server... (elapsed: 40.0s, attempts: 41)
      INFO:cua.interface.macos:Still waiting for Computer API Server... (elapsed: 50.1s, attempts: 51)
      ERROR:cua.interface.macos:Could not connect to 192.168.64.2 after 60 seconds
      ERROR:cua:Failed to connect to WebSocket interface
      DEBUG:cua:Computer initialization took 76856.09ms
      ERROR:agent.core.agent:Error in agent run method: Could not connect to WebSocket interface at 192.168.64.2:8000/ws: Could not connect to 192.168.64.2 after 
      60 seconds
      WARNING:cua.interface.macos:Computer API Server connection lost. Will retry automatically.
    
    This was using the gradio interface, with the agent loop provider as OMNI and the model as gemma3:4b-it-q4_K_M

    These versions:

      cua-agent==0.1.29
      cua-computer==0.1.23
      cua-core==0.1.5
      cua-som==0.1.3
    • frabonacci 12 hours ago
      Thanks for trying out c/ua! We still recommend pairing the Omni loop configuration with a more capable VLM, such as Qwen2.5-VL 32B, or using a cloud LLM provider like Sonnet 3.7 or OpenAI GPT-4.1. While we believe that in the coming months we'll see better-performing quantized models that require less memory for local inference, truth is we're not quite there yet.

      Stay tuned - we're also releasing support for UI-Tars-1.5 7B this week! It offers excellent speed and accuracy, and best of all, it doesn't require bounding box detection (Omni) since it's a pixel-native model.

  • xdotli 12 hours ago
    THIS IS FIRE been wanting this for ages
    • frabonacci 8 hours ago
      Thank you for your support!
  • 3s 9 hours ago
    this is really cool! congrats on the launch
    • frabonacci 8 hours ago
      Thank you - we appreciate your support!
  • jameskuj 13 hours ago
    A superfan of this product!
    • frabonacci 13 hours ago
      Thank you - your support means a lot to us!
  • throw03172019 10 hours ago
    This is precisely what I am looking for but for Windows. We need to automate some Windows native apps.

    In the meantime, I’ll give this a shot on macOS tonight. Congrats!

    • frabonacci 8 hours ago
      Yes - pig.dev is a great product! You should definitely check it out.

      Also, let us know on Discord once you’ve tried out c/ua locally on macOS: https://discord.com/invite/mVnXXpdE85

    • shykes 9 hours ago
      Check out pig: https://pig.dev

      (I am not affiliated)

      • throw03172019 8 hours ago
        I do recall looking at it before but was concerned about HIPAA if they are storing data on their servers as well.

        Also, is the project still active? No commits for 2 months is odd for a YC startup in current batch :)

  • farazmsiddiqi 12 hours ago
    i love this — isolation and permissioning for computer use agents. why can’t i use regular docker containers to deploy my computer use agent?
  • ekarabeg 13 hours ago
    Congrats on the launch! Awesome product!
    • frabonacci 13 hours ago
      Thanks — we really appreciate your support!
  • zwenbo 13 hours ago
    Amazing product! Congrats on the launch!
    • frabonacci 13 hours ago
      Thank you so much - we truly appreciate your support!
  • mountainriver 13 hours ago
    This is cool! We built a similar thing with AgentDesk https://github.com/agentsea/agentdesk

    Would love to chat sometime!

    • abshkbh 37 minutes ago
      https://github.com/abshkbh/arrakis Also building in this space using MicroVMs. Currently working on a Mac port. Would love to connect - abshkbh AT gmail.com
    • frabonacci 12 hours ago
      I love AgentDesk’s take on Kubernetes - it’s something we had considered as well, but it didn’t make much sense for macOS since you can only spin up two macOS VMs at a time due to Apple’s licensing restrictions.

      Feel free to join our Discord so we can chat more: https://discord.com/invite/mVnXXpdE85

      • mountainriver 11 hours ago
        Thats a fantastic way to get your IP banned :)
    • reindent 12 hours ago
      That's great.

      Also built something on top of Browser Use (Nanobrowser) and Docker.

      https://github.com/reindent/nanomachine

      Just finished planning and shell capabilities

      Lets chat @reindentai (X)

      • frabonacci 12 hours ago
        Sure - just followed you back!
  • swanYC 11 hours ago
    Love this !
    • frabonacci 8 hours ago
      Thank you - we appreciate it!
  • SkylerJi 12 hours ago
    This is insane y'all
    • frabonacci 8 hours ago
      Thank you - we appreciate it!