Anthropic is expanding to Colossus2. Will use GB200

(xcancel.com)

80 points | by aurareturn 5 hours ago

14 comments

  • gaze 1 hour ago
    I think that's the datacenter with the gas turbine generators that operate without permits because they're "portable." Data centers have tremendous externalities but colossus is a particularly nasty offender, and not just due its size.

    Edit: They did it with Colossus and now they're doing the exact same thing with Colossus2. https://www.selc.org/news/xai-built-an-illegal-power-plant-t...

    • stronglikedan 3 minutes ago
      Seems like selc's time would be better spent trying to close the loophole that allows for unpermitted turbine generators instead of going after one company for doing what they were allowed to do when they did it.
    • hemul3n 33 minutes ago
      I dug into this topic in some detail on my blog and it's both enraging and depressing.

      https://poiesic.com/posts/pattern-recognition

    • georgemcbay 1 hour ago
      Yup, and now Anthropic is complicit in the environmental damage and health problems for local residents that these data centers are causing.

      But hey, number must go up, right?

      • idle_zealot 1 hour ago
        Have you considered that the march of progress requires human blood to grease the gears and mulched skulls to pave the (highly efficient) road? Really, when you take into account all of the future lives this will improve and save it's difficult to claim any cost now is too high. Would you stand in their way and delay the day that Mythos cures cancer?

        This is a joke. Read it in a mocking tone.

        • projektfu 53 minutes ago
        • thelastgallon 23 minutes ago
          Not Anthropic, but Sam Altman - AI will solve climate change and cure all diseases.
        • louiereederson 22 minutes ago
          Per sanguinem ad astra
        • morkalork 43 minutes ago
          When do we start building pyramids and doing the Sardaukar blood letting ritual?
          • hugh-avherald 32 minutes ago
            When the 10-year yield hits 6%. Basic macroeconomics.
        • LogicFailsMe 57 minutes ago
          I'm not saying human blood and mulched skulls are a renewable source of power, I'm just saying. Or maybe they can partner with SoulCycle to power computation with 24/7 spin classes?
          • kennywinker 25 minutes ago
            And people called the matrix’s human batteries far-fetched.
        • Johnny_Bonk 58 minutes ago
          I loled
      • broknbottle 50 minutes ago
        Some of you may die, but it’s a sacrifice that I am willing to make.
        • Nition 28 minutes ago
          A superintelligent AI will be safe though, because it learnt its morality from us.
      • shimman 49 minutes ago
        Do you think Boris cares about people getting cancer and dying from these data centers? No, he cares about becoming rich as fuck.
      • jquery 17 minutes ago
        Given how much our EPA has been gutted by the current administration, I don’t think relief is very likely.
    • redox99 46 minutes ago
      What's the tremendous externality of gas generators? People heat their own homes with natural gas and it's no big deal. How can a datacenter that is miles away be worse than that?
      • jLaForest 43 minutes ago
        The gas furnace in my basement don't have a massive jet turbine emiting high frequency noise
        • redox99 3 minutes ago
          I wouldn't call noise pollution a "tremendous externality". The gas turbines should just be placed far enough from where people live.
  • virgildotcodes 36 minutes ago
    Here's a probably stupid question - if someone were unbounded by ethics and conceivably had enough power and connections to power to shield themselves from many consequences of their actions - and that person owned these DCs, could they in theory observe all the streams of tokens coming in and out of these models, and even exfiltrate copies of these models wholesale to have their own teams do what they will with them in the pursuit of building their own competitive models?

    Or is there something fundamental in the way these models get deployed (encryption or something or than legal contracts?) at this scale that prohibits the owners of the infra from gaining this level of insight / access?

    • tristanj 27 minutes ago
      1) The situation you described would be covered under the contract between Anthropic and xAI, and that any violation of that would be subject to financial penalties and legal proceedings. The US has a robust corporate legal system, and disputes do get resolved through the court system, although in a slow and costly manner.

      The contract can stipulate a penalty at a high enough amount to discourage this behavior.

      2) Output from models & intra-datacenter communications can be encrypted if customers truly cared.

      3) There is no reason do this, because there are far better ways to exfiltrate data from Anthropic models. Chinese companies are already doing this at an industrial scale where they are reselling Claude tokens for 10-20% of the cost while retaining the data to train their own models. https://www.chinatalk.media/p/how-to-buy-cheap-claude-tokens...

      If we look at Deepseek V4-pro, created by Deepseek who Anthropic formally accused of harvesting Claude tokens at scale, it performs the same as Claude did 6 months prior.

    • fragmede 29 minutes ago
      There's accusations that the Chinese labs have done essentially that to OpenAI and Anthropic and exfiltrated their models without having DC access, so if you had DC access, yes, you could do that. If you had DC access though you could just copy the model onto an SSD.
  • aurareturn 4 hours ago
    More signs that xAI might be giving up on the AGI race. xAI let Cursor train a model on Colossus 2, gave the entire Colossus1 to Anthropic, and is now giving compute in Colossus2 to Anthropic as well.
    • papascrubs 2 hours ago
      Elon lost his lawsuit with openAI and knows xAI isn't on the same trajectory. Might as well try to win the bet and flip off Sam by supporting the best competition. Also they are getting a head start on AI as a commodity. I'm sure there's plenty of money to be made for those that can leverage their capital to essentially rent capacity right now. If he's not making enough off of grok, might as well cover their expenses.
    • tristanj 40 minutes ago
      Bad read on the situation. xAI has too much compute and not enough customers using it. They have around half a million GPUs, some of which are stolen from Tesla, running at 11% utilization. xAI predicted more people would be using Grok, but Grok is not a SOTA model & users primarily want to use SOTA models. They have excess capacity and it makes sense to rent out GPUs to other customers while they improve their models.
    • paxys 2 hours ago
      It was kinda obvious when SpaceX "acquired" it. Elon rewarded xAI investors/prevented lawsuits by giving them SpaceX equity, and that was that.
  • alienreborn 1 hour ago
    Why is xAI giving up their advantage? Is this a signal that their frontier model improvements are plateauing and decided there is no value in hoarding all their compute?
    • dktp 1 hour ago
      I would guess it's purely because Grok isn't nearly in-demand enough to produce meaningful revenue. And they want to juice the numbers for IPO

      And I'm sure it's a bonus point for Musk that it goes to OpenAI's most relevant competitor

    • SoKamil 1 hour ago
      As weird as it seems I think this is Musk’s best shot at winning over Altman. He has personal vendetta.
    • Computer0 1 hour ago
      When did xAI have an advantage?
    • tw04 1 hour ago
      What advantage? Has there ever been any indication they’re leading in any segment? Sure Elon has thrown a bunch of money at hardware, but to what end?

      And frankly as bad as Altman is from a: if AI is really going to disrupt humanity do I want this guy in charge? Elon is 10x worse. So why would the best and the brightest ever work for him?

      • abraxas 1 hour ago
        No other LLM has made as much child porn as grok so there is that...
      • eightysixfour 1 hour ago
        In a compute starved world, big ass data centers are an advantage.
        • loktarogar 1 hour ago
          yes, but it's only advantage if one is compute-strained and the other isn't. if they both have lots then there's no advantage. if one doesn't fully utilise their compute then it's not an advantage either
          • eightysixfour 1 hour ago
            Well, it appears all their competitors are compute starved so…
        • 8note 1 hour ago
          can you elucidate what that advantage is, that isnt renting it out for the highest price to somebody that really needs it?
          • eightysixfour 1 hour ago
            Well, other than your ability to turn that into cash by renting it out for the highest price to someone who needs it, you can promise prospective employees that are supposed to use that infra to train models that they won’t be compute starved.

            You can kick off more model training runs and experiments than your competitors.

            You can kick off a $1-2t IPO claiming you are going to capture a large portion of the largest TAM the world has ever seen.

            • sumeno 24 minutes ago
              They have neither the most resources nor the best models. They are mediocre at everything except the CSAM generation market, they've got that one cornered
          • root_axis 58 minutes ago
            Training a model that is larger than your competitor's.
      • Rover222 37 minutes ago
        The datacenter advantage, obviously
    • guluarte 23 minutes ago
      I think they overestimated the demand for Grok, which is mostly useless, and now they have too much compute on hand.
  • urams 1 hour ago
    Very possible Elon is doing this to make give Anthropic better chances against OAI while he attempts to reshape xAI.

    Also possible he sees infra as the future of xAI if he really believes in the value of space compute.

    Hard to see this any of this as anything other than a bearish sign for Grok though.

    • fillskills 43 minutes ago
      We could see the first company vertically integrated from etching to chip to data center
      • jacobrast 28 minutes ago
        What do you mean by etching? Google does also it's own chip design with TPUs, data centers, and models but afaik only TSMC Intel and Samsung do the actual semiconductor fabrication
    • jameson 47 minutes ago
      Right, also Anthropic has been having difficult time getting more GPUs
  • brightball 37 minutes ago
  • amazingamazing 1 hour ago
    I don't see a scenario where it really makes sense to be a frontier lab long term. Eventually model quality will plateau then you distill and get 90% for 10% or less cost.
    • hungryhobbit 1 hour ago
      General AI is that scenario. The investor dream is that their horse hits general AI first, patents it (or otherwise somehow stops the competition from hitting it), and then reaps the massive benefits.

      I'm not saying it's a likely scenario, but I genuinely believe a big percentage of AI investment revolves around that (or similar) scenarios.

    • twoodfin 1 hour ago
      Over the past 6 months, Anthropic has made more waves as a product company than a frontier lab.
  • throwatdem12311 52 minutes ago
    Cool. Can they make Claude not absolute dogwater then?
  • mrcwinn 28 minutes ago
    Anthropic is gross for this. The grandstanding about principles and values is intolerable.
  • labrador 21 minutes ago
    Musk said Anthropic Claude was woke DEI until he said it wasn't. It must be hard for Musk fans to keep up.
  • jgalt212 1 hour ago
    Too bad Enron is still not around. They'd have some real fun with today's electrical markets.
  • fasbg1 1 hour ago
    I though Claude is too woke. Musk has posted that at least 50 times in the last year.

    But booking outrageous rental fees as fake AI revenue ahead of the SpaceX IPO apparently takes precedence.

    • 0xy 1 hour ago
      Anthropic is paying real cash, how is it fake revenue?
      • 123aad 1 hour ago
        Fake __AI__ revenue. Maybe Hetzner should build Colossus4, rent it out and book it as AI revenue instead of hardware rental revenue and get a P/E of 100.
        • nikcub 1 hour ago
          Anthropic this quarter will have revenue of $10.9B, up from $4.8B last quarter[0]. They're paying SpaceX $1.25B per month for compute[1] - which is more than what SpaceX earn on space. SpaceX spent about $30-40B in capex on Colossus 1 & 2.

          This is all real revenue, real spend, real usage.

          Hetzner just aren't at this scale. Not even close. If they wanted to get into this business - first, they're late. Second, it's at a scale of ~10x of their total lifetime datacenter buildout. Third, they'd need to change their business to being one that is debt fronted.

          xAI have proven out that being able to deploy compute is a very viable business (and difficult to pull off)

          At some point AI cynicism clashes with reality, it must be exhausting maintaining it.

          [0] https://www.wsj.com/tech/ai/mind-blowing-growth-is-about-to-...

          [1] https://www.wired.com/story/spacex-ipo-anthropic-compute-fin...

          • er1276 1 hour ago
            Jesus Christ, the Hetzner example is obviously an example of booking revenue as AI revenue (where investors assume it is generated by Grok subscriptions) vs. hardware rental revenue, which traditionally not valued as highly.

            Nowhere does the hypothetical state that Hetzner, an example for hardware rental, has the funding or the capabilities to execute the sarcastic example.

            But ok, now hardware rentals have a P/E of 100 or more.

            • gdhkgdhkvff 28 minutes ago
              Investors aren’t dumb. These numbers are being reported and the fact that the data centers are being rented out is publicly disclosed everywhere. Investors know full well that the revenue is from the data center rental. No (non-retail) investor is going to see the jump in revenue and think “I better buy up because grok must be kicking ass!”

              And yes, if hetzner built a massive AI hyper scale datacenter and rented it out for billions, with the expectation that they would keep building more, they would also see massive PE ratios because it’s expected that their revenue would be going up.

            • jgalt212 1 hour ago
              I think you're low at 100. TSLA has 370 PE, and SpaceX is targeting 250+.
        • tptacek 1 hour ago
          If Hetzner could build that, they would.
    • Rover222 36 minutes ago
      He talked with the Anthropic team, and his concerns lessened. It's actually a good thing to be able to change one's mind.
    • tmp10423288442 1 hour ago
      The eternal truth: money talks, bullshit walks
    • thelastgallon 1 hour ago
      Musk will buy Anthropic and fix the wokeness. It was done before.
      • gdiamos 59 minutes ago
        Don’t put it past Dario to buy spaceX
        • redox99 42 minutes ago
          Elon will never sell SpaceX. And he controls 86% of votes.
  • varalaakshay 17 minutes ago
    [flagged]