Phoenix.new – Remote AI Runtime for Phoenix

(fly.io)

332 points | by wut42 7 hours ago

27 comments

ativzzz 4 hours ago
This is very cool. I think the primary innovation here is twofold:
1. Remote agent - it's a containerized environment where the agent can run loose and do whatever - it doesn't need approval for user tasks because it's in an isolated environment (though it could still accidentally do destructive actions like edit git history). I think this alone is a separate service that needs to be productionized. When I run claude code in my terminal, automatically spin up the agent in an isolated environment (locally or remotely) and have it go wild. Easy to run things in parallel
2. Deep integration with fly. Everyone will be trying to embed AI deep into their product. Instead of having to talk to chatgpt and copy paste output, I should be able to directly interact with whatever product I'm using and interact with my data in the product using tools. In this case, it's deploying my web app
[-]
- indigodaddy 3 hours ago
  look into Kasm workspaces.. great way to spin up remote docker-based linux desktops, and works great as an AI dev environment that you can use wherever you happen to be. There is homedir persistence, and package persistence can be achieved via some extra configuration that allows for Brew homedir-based package persistence.
  https://hub.docker.com/r/linuxserver/kasm
  https://www.reddit.com/r/kasmweb/comments/1l7k2o8/workaround...
chrismccord 6 hours ago
Phoenix creator here. I'm happy to answer any questions about this! Also worth noting that phoenix.new is a global Elixir cluster that spans the planet. If you sign up in Australia, you get an IDE and agent placed in Sydney.
[-]
- tiffanyh 6 hours ago
  Amazing work.
  Just a clarifying question since I'm confused by the branding use of "Phoenix.new" (since I associate "Phoenix" as a web framework for Elixir apps but this seems to be a lot more than that).
  - Is "Phoenix.new" an IDE?
  - Is "Phoenix.new" ... AI to help you create an app using the Phoenix web framework for Elixir?
  - Does "Phoenix.new" require the app to be hosted/deployed on Fly.io? If that's the case, maybe a naming like "phoenix.flyio.new" would be better and extensible for any type of service Fly.io helps in deployment - Phoenix/Elixir being one)
  - Is it all 3 above?
  And how does this compare to Tidewave.ai (created as presumably you know, by Elixir creator)
  Apologies if I'm possibility conflating topics here.
  [-]
  - chrismccord 5 hours ago
    Yes all 3. It has been weird trying to position/brand this as we started out just going for full-stack Elixir/Phoenix and it became very clear this is already much bigger than a single stack. That said, we wanted to nail a single stack super well to start and the agent is tailored for vibe'd apps atm. I want to introduce a pair mode next for more leveled assistance without having to nag it.
    You could absolutely treat phoenix.new as your full dev IDE environment, but I think about it less an IDE, and more a remote runtime where agents get work done that you pop into as needed. Or another way to think about it, the agent doesn't care or need the vscode IDE or xterm. They are purely conveniences for us meaty humans.
    For me, something like this is the future of programming. Agents fiddling away and we pop in to see what's going on or work on things they aren't well suited for.
    Tidewave is focused on improving your local dev experience while we sit on the infra/remote agent/codex/devin/jules side of the fence. Tidewave also has a MCP server which Phoenix.new could integrate with that runs inside your app itself.
- joevandyk 4 hours ago
  The Phoenix.new environment includes a headless Chrome browser that our agent knows how to drive. Prompt it to add a front-end feature to your application, and it won’t just sketch the code out and make sure it compiles and lints. It’ll pull the app up itself and poke at the UI, simultaneously looking at the page content, JavaScript state, and server-side logs.
  Is it possible to get that headless Chrome browser + agent working locally? With something like Cursor?
  [-]
  - tomashubelbauer 2 hours ago
    Playwright has an MCP server which I believe should be able to give you this.
- Snakes3727 5 hours ago
  Hi just to confirm as I cannot find anything related to security or your use of using submitted code for training purposes. Where is your security policies with regards to that.
  [-]
  - mrkurt 4 hours ago
    We don't do any model training, and only use existing open source or hosted models. Code gets sent to those providers in context windows. They all promise not to train on it, so far.
    [-]
    - tptacek 3 hours ago
      Did I not say it good enough, Kurt?
  - tptacek 4 hours ago
    Ask some security questions, I'll get you security answers. We're not a model company; we don't "train" anything.
- miki123211 5 hours ago
  1. What's your approach to accessibility? Do you test accessibility of the phoenix.new UI? Considering that many people effectively use Phoenix to write front-ends, have you conducted any evals on how accessible those frontends come out?
  2. How do you handle 3rd party libraries? Can the agent access library docs somehow? Considering that Elixir is less popular than more mainstream languages, and hence has less training data available, this seems like an important problem to solve.
  [-]
  - chickensong 2 hours ago
    It seems like they're giving you lower level building building blocks here. It's up to the developer to address these things. Instruct the agent to build/test for accessibility, feed it docs via MCP or by other means.
- mrdoops 5 hours ago
  Any takeaways on using Fly APIs for provisioning isolated environments? I'm looking into doing something similar to Phoenix.new but for a low-code server-less workflow system.
  [-]
  - chrismccord 5 hours ago
    1 week of work to go from local-only to fly provisioned IDE machines with all the proxying. fly-replay is the unsung hero in this case, that's how we can route the *.phx.run urls to your running dev servers, how we proxy `git push` to phoenix.new to your IDE's git server, and how we frame your app preview within the IDE in a way that works with Safari (cross origin websocket iframes are a no go). We're also doing a bunch of other neat tricks involving object storage, which we'll write about at some point. Feel free to reach out in slack/email if you want to chat more.
    [-]
    - mrdoops 5 hours ago
      Thanks, I might hit you up when I'm in the weeds of that feature.
- jonator 2 hours ago
  I'm assuming you're using FLAME?
  How do you protect the host Elixir app from the agent shell, runtime, etc
  [-]
  - chrismccord 2 hours ago
    Not using FLAME in this case. The agent runs entirely separately from your apps/IDE/compute. It communicates with and drives your runtime over phoenix channels
    [-]
    - jonator 1 hour ago
      Oh interesting. So how do messages come from the container? Is there a host elixir app that is running the agent env? How does that work?
- finder83 5 hours ago
  This looks amazing! I keep loving Phoenix more the more I use it.
  I was curious what the pricing for this is? Is it normal fly pricing for an instance, and is there any AI cost or environment cost?
  And can it do multiple projects on different domains?
  [-]
  - manmal 5 hours ago
    It’s $20 per month if you click through, and I haven’t tried it but almost certainly the normal hosting costs will be added on top.
    [-]
    - finder83 1 hour ago
      Thanks, apparently didn't click through enough
- causal 6 hours ago
  Do you have a package for calling LLM services we can use? This service is neat, but I don't need another LLM IDE built in Elixir but I COULD really use a way to call LLMs from Elixir.
  [-]
  - chrismccord 5 hours ago
    Req.post to /chat/completions, streaming the tokens through a parser and doing regular elixir messages. It's really not more complicated than that :)
    [-]
    - throwawaymaths 4 hours ago
      even less complicated, just set stream: false in your json :)
- arrowsmith 6 hours ago
  Thanks for everything you do Chris! Keep crushing it.
- cosmic_cheese 5 hours ago
  How tightly coupled to Fly.io are generated apps?
  [-]
  - chrismccord 5 hours ago
    Everything starts as a stock phx.new app which use sqlite by default. Nothing is specific to fly. You should be able to copy the git clone url, paste, cd && mix deps.get && mix phx.server locally and the app will just work.
- b0a04gl 5 hours ago
  how are they isolating ai agent state from app-level processes without breaking BEAM's supervision guarantees?
  [-]
  - chrismccord 5 hours ago
    They run on separate machines and your agent just controls the remote runtime when it needs to interact with the system/write/read/etc
- b0a04gl 5 hours ago
  this spins up a live elixir cluster per user session. how they handling hot code reloads across distributed nodes?
- b0a04gl 5 hours ago
  the ai agent runs inside the same remote runtime as the app. does it share the BEAM vm or run as a port process?
  [-]
  - chrismccord 5 hours ago
    The agent runs outside your IDE instance and controls/communicates with it over Phoenix channels
- beepbooptheory 2 hours ago
  What is the benefit of this vs. just running your agent of choice in any ole container?
  [-]
  - tptacek 2 hours ago
    The whole post is about that. Not everything is for everybody, so if it doesn't resonate for you, that's totally OK.
    [-]
    - beepbooptheory 1 hour ago
      Oh geez so sorry for the dumb question! I read a lot about the benefits of containerization in general for agents, but thought it might be enlightening/instructive to know what this specific project adds to that (other than the special Elixir-tuned prompting).
      But either way I hear you, thanks so much for taking the time to set me straight. It seems like either way you have done some visionary things here and you should be content with your good work! This stuff does not work for me for just circumstantial reasons (too poor), but still always very curious about the stuff coming out!
      Again, so sorry. Congrats on the release and hope your day is good.
      [-]
      - tptacek 1 hour ago
        You're fine! Just encouraging people to read Chris's post. :)
        [-]
        beepbooptheory 1 hour ago
        Gotcha! I'll keep reading it I guess until I see what I am missing! Good job again!
        [-]
        tptacek 1 hour ago
        I did none of the work! I'm just like Flavor Flav or Bez in this situation. I will relay your congrats to Chris and the team, though. ;)
- troupo 6 hours ago
  I know it's early days, but here's a must-have wish list for me:
  - ability to run locally somehow. I have my own IDE, tools etc. Browser IDEs are definitely not something I use willingly.
  - ability to get all code, and deploy it myself, anywhere
  ---
  Edit: forgot to add. I like that every video in Elixir/Phoenix space is the spiritual successor to "15-minute rails blog" from 20 year ago. No marketing bullshit, just people actually using the stuff they build.
  [-]
  - chrismccord 6 hours ago
    You can push and pull code to and from local desktop already: hamburger menu => copy git clone/copy git push.
    You could also have it use GitHub and do PRs for a codex/devin style workflows. Running phoenix.new itself locally isn't something we're planning, but opening the runtime for SSH access is high on our list. Then you could do remote ssh access with local vscode or whatever.
    [-]
    - prophesi 6 hours ago
      > Running phoenix.new itself locally isn't something we're planning
      So no plans to open the source code?
      [-]
      - chrismccord 5 hours ago
        confirm
  - chrismccord 6 hours ago
    "15-minute rails blog" changed the game so I definitely resonate with this. My videos are pretty raw, so happy to hear it works for some folks.
  - fridder 6 hours ago
    run locally or in your private cloud would be amazing. The latter bit would be a great paid option for large enterprises
losvedir 1 hour ago
This is very neat, and right up my alley as both someone really into Elixir and who thinks agentic AI is the future.
I have a question about how you manage context, and what model you use. Gemini seems the best at working with give context windows right now, but even that has its limitations. Thinking about working with Claude Code, a fair bit of my strategizing is in breaking down work and managing project state to keep context size manageable.
I'm watching the linked video and it's amazing seeing it in action, but I'm imagining continuing to work on a project and wondering if it will start losing its way so to speak. Can you have it summarize stuff, and can you start a session clean with those summaries, and have it "forget" files it won't need to use for this next feature, etc?
arrowsmith 7 hours ago
Ah man I'm really happy to see this and excited to try it out.
As an Elixir enthusiast I've been worried that Elixir would fall behind because the LLMs don't write it as well as they write bigger languages like Python/JS. So I'm really glad to see such active effort to rectify this problem.
We're in safe hands.
[-]
- acedTrex 7 hours ago
  LLMs not writing it well might be the biggest current selling point of elixir lol.
  [-]
  - matt_s 2 hours ago
    Selling point from a developer and career perspective for sure. Its also fun to program in and at least for me made me think about solutions differently.
    Its a negative point for engineering leaders that are the decision makers on tech stacks as it relates to staffing needs. LLMs not writing it well, developers that know it typically needing higher compensation, a DIY approach to libraries when there aren't any or they were abandoned and haven't kept pace with deprecations/changes, etc.
    In the problem space of needing a web framework to build a SaaS, to an engineering leader there are a lot of other better choices on tech stack that work organizationally better (i.e. not comparing tech itself or benchmarks, comparing staffing, ecosystem, etc.) to solve web SaaS business problems.
    I don't know where I stand personally since I'm not at the decision maker level, just thought I'd point out the non-programmer thought process I've heard.
  - zupa-hu 6 hours ago
    Oh lol I love that angle!
- victorbjorklund 2 hours ago
  I found them to be pretty solid for writing Elixir (not perfect but neither is it with JS) the last couple of months.
- bad_haircut72 6 hours ago
  This is such a weird meme, Claude crushes elixir especially a fullstack app in liveview
  [-]
  - heeton 6 hours ago
    Yea CC is great with phoenix / liveview. It’s been doing things that teach me new tricks about elixir I didn’t know yet.
- bluehatbrit 5 hours ago
  This last few weeks I've been going hard on LLMs to put together a new prototype project. I've exclusively been using Claude Sonnet 3.7 within Zed (via github copilot) and it' fantastic.
  From time to time it tries to do something a little old-school, but nothing significant really. It's very capable at spitting out entire new features, even in liveview.
  Over all the experience has very productive, and at least on-par with my recent work on similar sized python and nextjs applications.
  I think because I'm using mostly common and well understood packages it has a big leg up. I also made sure to initialise the phoenix project myself to start with so it didn't try to go off on some weird direction.
  [-]
  - indigodaddy 3 hours ago
    Assuming you are on the $20 Zed plan? Has the 500 prompts/mo been sufficient for you? I'm debating between the Zed and Claude $20 plans-- no doubt I'd get better value from Zed's?
    [-]
    - bluehatbrit 2 hours ago
      I'm on the free plan and have been using it via GitHub copilot instead, as the current project is a work one and they pay for that.
      Before this I did a small project and I hit the 50 free tier limit through Zed by the time I was about 90% done. It was a small file drop app where internal users could create upload links, share them with people who could use them to upload a file. The internal user could then download that file. So it was very basic, but it churned out a reasonable UI and all the S3 compatible integration, etc.
      I had to intervene a bit and obviously was reviewing everything and tweaking where needed. But I was surprised at how far I got on the 50 free prompts.
      It's hard to know what you really get for that prompt limit though as I probably had a much higher number of actual prompts than they were registering. It's obviously using some token calculation under the hood and it's not clear what that is. All in all I probably had about 60-70 actual prompts I ran through it.
      My gut says 500/mo would feel limited if I was going full "vibe" and having the LLM do basically everything for me every day. That said, this is the first LLM product I'm considering personally paying for. The integration with Zed is what wins for me over Claude, where you'd have to pay for API credits or use Claude Code. The way they highlight code changes and stuff really is nice.
      Bit of a brain dump, sorry about that!
      [-]
      - indigodaddy 2 hours ago
        Thanks, that gives me some new things to think about
- nxobject 4 hours ago
  Yeah – as someone who works in Common Lisp, I wish there was way to do complementary training for LLMs with existing corpora of codebases. Being able to read access documentation doesn't do much, unfortunately, to help with more general issues with correctness of output.
- throwawaymaths 7 hours ago
  in principle llms should do better on immutable languages since there is no risk a term will get modified by a distant function call.
  [-]
  - bevr1337 7 hours ago
    In my experience, it's the functional part, not immutability, where they fall short. Any LLM can write immutable C# because it's easy and there's incredible amounts of training data.
    [-]
    - throwawaymaths 7 hours ago
      good news, "immutable" is pretty much the only way that elixir is "functional" except for lambdas being first class datatypes (which is almost every language now)
- neya 6 hours ago
  I use o3, it's really good with Elixir. I prefer it to Claude, but Claude does a decent job as well.
  If you want to take your website and business down, use ChatGPT-4o's code
- pawelduda 6 hours ago
  Claude 3.5 produces very good Elixir/Phoenix code. Haven't tried 3.7 much but I assume it's only going to get better from here
- mrcwinn 7 hours ago
  Worried it might fall behind… further? I love LiveView, Phoenix, Elixir, OTP. But the ecosystem is a wasteland of abandoned packages.
  If Phoenix.new helps solve that problem, I’m all for the effort. But otherwise, the sole focus of the community leaders of Elixir should be squarely and exactly focused on creating the incentives and dynamics to grow the base.
  Compare, for example, Mastra in TypeScript or PydanticAI in Python. Elixir? Nothing.
  Not here to bash. It’s more just a disappointment because otherwise I think nothing comes close.
  [-]
  - uncircle 7 hours ago
    All languages are a wasteland of abandoned packages, i.e. there is a very long tail of stuff no one has maintained for years. It’s all relative to the mindshare. For its size, Elixir is doing quite well.
    [-]
    - mrcwinn 5 hours ago
      It's not the long tail. It's that the HEAD of packages in Elixir are also often poorly maintained or not maintained. The fundamental question for any developer: can I be productive quickly? Despite all that Elixir has going for it, the answer is often "no."
      Want a first-party client library for the service you're using? Typically the answer is "too bad, Elixir developer." And writing your own Finch or Req wrapper for their REST endpoint simply isn't a valid answer.
      >For its size, Elixir is doing quite well.
      I'm actually arguing the opposite. Elixir is not doing well because of its size. So how can that be influenced and changed?
      [-]
      - prophesi 3 hours ago
        What packages in Elixir have you found unmaintained/missing in the ecosystem? Genuinely curious.
    - erichocean 6 hours ago
      Most languages require maintenance.
      Some languages—Clojure is a good example—have packages from 10 years ago, entirely unmaintained, that still work great because no maintenance is needed.
      [-]
      - arrowsmith 6 hours ago
        This is also true for Elixir though. A lot of "unmaintained" Elixir packages still work fine.
        [-]
        sodapopcan 6 hours ago
        That's their point (I think, lol).
      - spiderice 6 hours ago
        In my experience, Elixir is very much on that end of the spectrum as well. I'm wondering if GGP just considers packages that don't have updates for 6 months as "unmaintained" or "dead" because they come from Javascript world where everything is, well... you know.
  - mervz 1 hour ago
    This is such a weird thing to say and I see it all of the time. It sucks people have been tricked into thinking a library must be updated every 2 weeks in order to still be relevant.
    You think just because an author bumps the version number of a library it's somehow better than a library that is considered complete?
    It boggles my mind that people actually think this way.
  - conradfr 6 hours ago
    Old packages usually still run great in Elixir though.
- zorrolisto 7 hours ago
  Same, I watched a video from Theo where he says Next.js and Python will be the best languages because LLMs know them well, but if the model can infer, it shouldn’t be a problem.
  [-]
  - rramon 7 hours ago
    Folks on YouTube have used Claude Code and the new Tidewave.ai MCP (for Elixir and Rails) to vibe code a live polling app in Cursor without writing a line of code. The 2hr session is on YT.
    [-]
    - rahimnathwani 5 hours ago
      This one?
      https://www.youtube.com/live/V2b6QCPgFTk
      [-]
      - rramon 3 hours ago
        That's the one.
  - sieabahlpark 6 hours ago
    [dead]
  - dingnuts 7 hours ago
    since models can't reason, as you just pointed out, and need examples to do anything, and the LLM companies are abusing everyone's websites with crawlers, why aren't we generating plausible looking but non working code for the crawlers to gobble, in order to poison them?
    I mean seriously, fuck everything about how the data is gathered for these things, and everything that your comment implies about them.
    The models cannot infer.
    The upside of my salty attitude is that hordes of vibe coders are actively doing what I just suggested -- unknowingly.
    [-]
    - Imustaskforhelp 7 hours ago
      For what its worth, AI already has subpar data. Atleast this is what I've heard.
      I am not sure, but the cat is out of the box. I don't think we can do anything at this point.
    - fragmede 7 hours ago
      But the models can run tools, so wouldn't they just run the code, not get the expected output, and then exclude the bad code from their training data?
      [-]
      - bee_rider 6 hours ago
        That seems like a feedback loop that’s unlikely to exist currently. I guess if intentionally plausible but bad data became a really serious problem, the loop could be created… maybe? Although it would be necessary to attribute a bit of code output back to the training data that lead to it.
- te_chris 6 hours ago
  Claude and o3 are both excellent (if a bit erratic) elixir developers.
CollinEMac 6 hours ago
I'm a little surprised by the sentiment here that LLMs don't do well with Elixir. I've had a pretty good experience using AI tools on Phoenix/Elixir side projects.
[-]
- vaer-k 45 minutes ago
  I've only used LLMs with Elixir, so I don't have any other experience for comparison, but I've found that although Claude frequently employs the wrong approaches in Elixir, I usually know when he'll have trouble and just ask him to read pertinent documentation first. So long as he's read the manual he seems to do just fine.
- arrowsmith 6 hours ago
  LLMs are definitely a lot better at Elixir than they used to be - the gap has closed somewhat. I still perceive a gap though, especially when trying to do more complicated things in Phoenix and LiveView (as opposed to just raw Elixir.)
  Which LLMs do you use that you find are best with Elixir/Phoenix?
  [-]
  - CollinEMac 6 hours ago
    I haven't really used much other than Claude.
    I guess I should also note that I haven't really used LiveView much.
jsmo 5 hours ago
Hmm, just signed up to check it out but no trial just "$20/mo of Built-In AI Assistance" without any mention of usage limits?
[-]
- afro88 5 hours ago
  Same. Agents can be very expensive. Plus I have no idea how reliable or effective this actually is in practice. Would love to try it first.
rustc 6 hours ago
"Sign in with fly.io" takes me to a page asking me to pay $20 but the plan details are vague - what exactly is included in "$20/mo of Built-In AI Assistance Builds, refactors, and debugs right in your IDE"?
[-]
- tptacek 37 minutes ago
  This is a situation where we've been pushing on Chris to get this out into the world quickly, and there's a lot of packaging stuff like this that isn't fully put together yet. Thanks for calling it out! We'll get to it over the next week.
gavmor 6 hours ago
Phoenix.new looks powerful, and I'll definitely play with it.
I've been daydreaming of an agentic framework that maximally exploits BEAM. This isn't that, but maybe jido[0] is what I'm looking for.
0. https://github.com/agentjido/jido
[-]
- fridder 5 hours ago
  TIL; That is a very interesting library
unvs 2 hours ago
Any chance of open sourcing the model instructions for this? Do you feed it all the Phoenix/LiveView/Elixir docs, or have you written more specialised instructions?
I find Claude to have quite a bit of problems trying to navigate changesets + forms + streams in my codebase, just wondered if you had any tips of making it understand better :)
themgt 7 hours ago
This looks really cool, but I gotta say I'm a bit uneasy with the apparent(?) closed-source + hosted + branding. "mix phx.new" is the way to generate a new Phoenix project, but "Phoenix.new" is closed source Fly.io product for building Phoenix projects?
Feels like we're getting into a weird situation if LLM providers are publishing open source agentic coding tools and OSS web app frameworks are publishing closed source/non-BYOK agentic coding tool. I realize this may not be an official "Phoenix" project but it seems analogous to DHH releasing a closed-source/hosted "Rails.new" service.
krts- 4 hours ago
This is incredible. It does seem quite expensive compared to Zed or Claude Code now it's on Pro. But neat enough I've burned through the $20 subscription credit despite being a bit of an AI sceptic. This seems to have a much better handle on UI design (unless I'm missing something with the other agents), but as a solo dev I'm becoming quite convinced. It's also got me to try out fly again.
I couldn't get Tidewave working but I must try again to see if Tidewave with Claude Code would offer this level of awesome.
ps. @fly - please let me buy more credit, I just get an error!
[-]
- chrismccord 3 hours ago
  Thanks for the feedback! Send your fly email to [email protected] and I'll get things sorted out. We'll throw you some credits for the trouble :)
  [-]
  - krts- 3 hours ago
    Hero.
qudat 1 hour ago
Technically this is really. Practically I don’t understand the point. Who is the target demographic? People that don’t have a local dev environment?
moffers 7 hours ago
The mental models of Elixir/OTP and AI Agents are very compatible. I’ve felt for a long time that it would be one of the best platforms for building AI agents.
nilirl 6 hours ago
Beautiful demo! How do I build agents like this?
Does anyone know any great resources to learn how to design agents? Tool agnostic resources would be awesome.
[-]
- chrismccord 6 hours ago
  Thanks! Everything is overly complicated in this space. It's probably far easier than you think. The open secret is it's just a loop that POST [provider]/chat/completions.
  Elixir is particularly well suited here. In Elixir this is a genserver doing http posts and reacting to the token stream. The LiveView chat gets messages from the genserver agent regardless of where it is on the planet, and the agent also communicates with the phoenix channel websocket talking to the IDE machines with regular messages, again anywhere they are on the planet.
  I talk about this quite a bit in my ElixirConfEU talk and distill things down: https://youtu.be/ojL_VHc4gLk?si=MzQmz-vofWxWDrmo&t=1040
  [-]
  - nilirl 5 hours ago
    Wow, thank you for the link, it clarified how tool calling and "choice-making" works.
    It's like helping LLMs use a computer; like building an interface for it.
    Ok, this is enough to get me started.
__0x01 6 hours ago
What LLM does Phoenix.new use?
[-]
- chrismccord 2 hours ago
  claude 4 sonnet as the main driver atm, and a mix of smaller models depending on the scenario
  [-]
  - tptacek 1 hour ago
    I'm building an agent right now (or rather, extending an agent I build a few weeks ago in a couple hours) and I would love to hear more about which scenarios get assigned to which models.
    (I almost just asked you on company Slack but figured the answer would be more broadly interesting.)
martypitt 7 hours ago
This is really cool! And, that you did it in a few weeks is insane.
How did you get VS Code embedded in your app? I'm aware of projects like Monaco, and that vscode.dev exists - so it's clearly possible - but I didn't realize it was something others could build upon?
Again, kudos!
sockboy 6 hours ago
Integrating AI assistance directly into an Elixir IDE could boost productivity, especially for newcomers. Excited to see how remote SSH and local workflows develop!
ipnon 7 hours ago
Chris is a hacker’s hacker.
mcdirty 6 hours ago
This is great. I had to back out of a phoenix project and rewrite it in Django because I couldn't get good AI assistance. I'm pretty inexperienced with Elixir and Phoenix but understand the benefits enough to want to make projects in it. So this is really cool.
[-]
- causal 6 hours ago
  I had this experience too. Though most of my issues were with my model not necessarily with Elixir itself so much as understanding the Phoenix model, state, and CLI. Maybe even differences in versions? Wasn't always clear.
- dontlaugh 5 hours ago
  I find that baffling. Why not just write the Elixir yourself?
  [-]
  - prophesi 5 hours ago
    Not just baffling, but concerning. LLM's are great for learning new languages, but terrible for outputting code you don't understand yet hope to maintain.
rramon 7 hours ago
Very cool. Does it use Phoenix 7 or 8 in the video?
[-]
- chrismccord 5 hours ago
  1.8rc, which is going 1.8.0 momentarily
- arrowsmith 7 hours ago
  1.7 or 1.8, not 7 or 8
  [-]
  - rramon 6 hours ago
    Thanks for correcting my typo.
sergiotapia 3 hours ago
What model is it using for all the agentic work?
What usage limits do we get with the $20 monthly price?
Thank you!
johnwheeler 5 hours ago
Why wouldn't LLMs work as well with Elixir as python or JS? LLMs don't parse an AST.
[-]
- lawn 5 hours ago
  Because of the amount of Python and JS in the wild is much more than the amount of Elixir code, so the LLMs have much more data to base their answers on.
krainboltgreene 6 hours ago
I've wasted a lot of time and energy on stuff that doesn't matter, so I can hardly judge anyone else on what they focus on, but man does it feel bad to have community leaders actively focus on building out tooling that is anti-worker. I think the only way I'd feel more conflicted is if Fly.io started building weapons systems for the military. I guess that wouldn't be shocking considering some of their lead's beliefs.
[-]
- mrkurt 5 hours ago
  It's safe to say that if either Chris or I believed this to be anti worker, we wouldn't be working on it. He's spent the last 10+ years working on Phoenix specifically to improve the lives of the people doing the work.
  My experience with software development is maybe different than yours. There's a massive amount of not-yet-built software that can improve peoples' lives, even in teeny tiny ways. Like 99.999% of what should exist, doesn't.
  Building things faster with LLMs makes me more capable. It (so far) has not taken work away from the people I work with. It has made them more capable. We can all build better tools, and faster than we did 12 months ago.
  Automation is disruptive to peoples' lives. I get that. It decreases the value of some hard earned skills. Developer automation, in my life at least, has also increased the value of other peoples' skills. I don't believe it's anti worker to build more tools for builders.
  [-]
  - jonator 5 hours ago
    Exactly. And as more software is written, the demand and possible set of softwares increase.
    It's really a matter of positive sum/growth mindset vs scarcity/status quo mindset.
  - krainboltgreene 5 hours ago
    > There's a massive amount of not-yet-built software that can improve peoples' lives, even in teeny tiny ways. Like 99.999% of what should exist, doesn't.
    We agree on this completely, however you and I know there are plenty of people without jobs in the world who could be employed to do this work. You are spending your finite amount of time on earth working with services that are trying to squeeze the job market (they've said this openly) rather than spending it increasing the welfare of workers by giving them work.
    > Automation is disruptive to peoples' lives.
    You know the difference between automation and the goals of these companies. You know that they don't want to make looms that increase the productivity of workers, they want to replace the worker so they never have to pay wages again.
    [-]
    - tptacek 4 hours ago
      rather than spending it increasing the welfare of workers by giving them work.
      Saying the quiet part loud here.
- wturner 4 hours ago
  Most tech isn't "Anti worker". What determines pro/anti worker are laws and government policies that reciprocate with the cultural norms we adopt. At the moment, money in U.S. politics is the most anti-worker phenomena I can think of. The ultra wealthy have a monopoly on the incentives that create policy and how our lives are ordered. The only power working people seem to have is the ability to impose consequences via rogue guerilla acts of protest and violence (Luigi Mangion) . Hopefully, AI is a Frankenstein monster the public learns to wield to facilitate more of these "consequences" and upend the monopoly the super wealthy have on policy incentives and change the way politics is funded for good. It's a new world and a Hawaiian or New Zealand doomsday bunker isn't going make a difference.
- leafmeal 5 hours ago
  Do you believe making things easier and more accessible is bad for workers? I don't think it inherently is or isn't, it just depends on who benefits from the increased efficiency. I think that's more of a problem with your economic system, or wealth distribution.
  Overall I think we would all be happier if efficient machines take away the drudgery of our daily work and allow us to focus on things that really matter to us. . . as long as our basic needs are met.
  [-]
  - krainboltgreene 4 hours ago
    > Do you believe making things easier and more accessible is bad for workers?
    Nope, I've been doing it for 16 years.
- jonator 5 hours ago
  You can think of it as just automating the boring tedious stuff so us humans can focus on the harder problems like strategy, direction, design, GTM, etc.
  The days are numbered where humans are sitting typing out code themselves.
  It's akin to the numbered days of type writer secretaries of the 20th century.
  [-]
  - krainboltgreene 4 hours ago
    I know what stories workers in the industry are using to cope with working with capitalists that have explicit goals of eliminating workers.
    I'm sure your poor understanding of the history of improved tooling, like "type writer secretaries", will be a soft comfort in the future.
    [-]
    - jonator 1 hour ago
      And you don't think capitalism is the reason we have these computer jobs to automate away in the first place?
- yunwal 5 hours ago
  The same logic that would lead one to believe that AI is anti-worker should also lead one to believe that software as a whole is anti-worker.
  [-]
  - krainboltgreene 4 hours ago
    Sure, if you don't think about it at all.
    [-]
    - mwcampbell 1 hour ago
      The argument you're responding to is effective enough, based solely on the fact that it has led me to second-guess whether I chose the right line of work, that it would be worth expounding on what you think is wrong with it.
pier25 6 hours ago
I'm surprised they are investing into this. I checked Phoenix recently because I was interested in LiveView and there isn't even an official AWS SDK for Elixir.
Honestly doubt the AI stuff is going to move the needle much if you can't even have a dependable S3 client.
[-]
- chrismccord 5 hours ago
  Meanwhile I am a happy user of :ex_aws or :req_s3 which has done everything I need it to do. Object ops, iam policies, etc. A dependable S3 client has been there for years. The elixir core team doesn't need to maintain it. ReqS3 is one of my favorite things to use: https://hexdocs.pm/req_s3/readme.html
  [-]
  - pier25 5 hours ago
    Maybe you can guarantee Phoenix will be maintained 5-10 years from now but that's really not the case for some random library on Github.
    If you look around you'll see this kind of stuff is really one of the biggest blockers for Elxir and Phoenix. Especially for something as fundamental as cloud storage.
- techpression 6 hours ago
  Considering the horror the official AWS CLI is this seems like a strange example. I’ve used both the non official libraries and they work fine. The one that is auto generated doesn’t feel very Elixir, but that’s to be expected.
  [-]
  - pier25 6 hours ago
    > I’ve used both the non official libraries and they work fine
    Maybe fine today but what about 5 years from now?
    Can you say, with any degree of confidence, if these these libraries are going to be properly maintained in the future? No, you cannot.
    [-]
    - techpression 6 hours ago
      Goes for every library out there though. Official or not.
      [-]
      - pier25 5 hours ago
        Not really. You can be much more confident the official AWS SDK will be available 5 or even 10 years from now.
- conradfr 6 hours ago
  But there's two actively non-official maintained libs.
- olafura 6 hours ago
  What are you talking about, there has been a AWS client forever and I've never had a problem. It's not something you really need an official sdk for they are anyway often just reference because you might want different performance characteristics.
  https://hex.pm/packages/ex_aws https://hex.pm/packages/ex_aws_s3
  I've usually not seen more than 3 or so official SDK for most services and there are a lot more programming languages than that. For example Microsoft's Graph API doesn't have an official Ruby client, they have one that sort of works.
  [-]
  - Snakes3727 5 hours ago
    Neither of them are official which is often a non starter for some large enterprise customers.
    [-]
    - pier25 3 hours ago
      Not only large enterprise customers. Anyone who's thinking mid or long term.
      [-]
      - olafura 2 minutes ago
        I still don't understand this. If you are big enough then you get Amazon to make an official sdk, if you aren't then what exactly are you looking for?
        The official aws cli used to talk to the soap interface and used regex instead of actually doing correct error handling and that was used by so many tools. Even though it used to break horrible.
        It's quite a niche you are talking about, not big enough to debug open source code but still big enough to require SLA for SDK and not being able to talk Amazon into creating it. It's generated code, it's not rocket science.
        What I have experienced is that software licence, where you are sending data to, where you are hosting it and having access to audit the code has usually been a bigger concern.
        But then again big organisations often have really specific concerns. So I'm not doubting your statement it's just that I have never heard it before.
      - sysashi 1 hour ago
        Why do you need an official SDK to make http calls?
        [-]
        pier25 1 hour ago
        Why do you even need a framework like Phoenix? Just write everything yourself! /s
- 4b11b4 3 hours ago
  Not sure what you're talking about..