Kimi K2.7 Code is generally available in GitHub Copilot

(github.blog)

126 points | by unliftedq 4 hours ago

14 comments

  • nsoonhui 1 hour ago
    I used GitHub Copilot for my VS 2026 development and switched between ChatGPT and Claude. That was before I discovered Claude Code and the Codex app. Copilot was OK for my purposes, and the USD 10 per month fee was enough for my usage.

    However, last month they introduced a new pricing model ( I know the old pricing was not sustainable), and my USD 10 was exhausted within days. Because of that, I switched to Claude Code and Codex and have never looked back. Yes, tokens on Claude Code and Codex are subsidized heavily, but let's just enjoy when good things last.

    I do feel there is a difference between using Claude via Copilot versus using Claude directly in Claude Code. I'm not sure what Microsoft is doing behind the scenes.

    • happyweasel 0 minutes ago
      Same ,I switched to cursor. I told it how to invoke msbuild and it can edit away without needing a native Visual studio plugin.. no problems at all
    • taspeotis 38 minutes ago
      The harness is super important, what tools are available and the system prompts vary from harness to harness.

      Anthropic seems to have a modest lead on their harness and models, so it’s a best-of-both-worlds scenario.

      > I'm not sure what Microsoft is doing behind the scenes

      It’s probably the exact same model, but the tools and the prompts around it are worse, so you get worse results.

      • Vinnl 19 minutes ago
        So if you use Claude via Copilot in Zed... You use Zed's harness, I think? What does Copilot do, at that point?
        • pantulis 3 minutes ago
          It’s providing the inference of Anthropic models
    • arikrahman 1 hour ago
      I had a similar experience moving away from Copilot within Zed. Now using the reasonix harness for Deepseek that makes cache hits almost free. And that's with unsubsidized American providers like Digital Ocean or Cloudflare.
      • toyg 12 minutes ago
        I tried using Zed but with local models it constantly breaks on tool calls. I wanted to like it but the smell of vibing is just too much.
        • arcanemachiner 6 minutes ago
          You using models released this year? I hear this complaint a lot, and it's often due to using an old model which is not as good at tool calling as newer models.
      • k__ 41 minutes ago
        Nice.

        I paid $6 yesterday for DeepSeek V4 Flash on OpenRouter. That's like $120 dollar for a month, and it's not even a good model.

        • bel8 30 minutes ago
          For DS4 it's much cheaper and reputable to use OpenCode Go $10/mo subscription, or directly with DeepSeek API.
        • epolanski 23 minutes ago
          That's quite an achievement, I managed to spend only 2$ on 16 different tasks of v4 pro.
    • altmanaltman 58 minutes ago
      My copilot quota finished in maybe 2-3 prompts with claude 4.8 opus. i was expecting it to suck but not this bad. it was good while it lasted though
  • andhuman 2 hours ago
    Finally an alternative to the big dogs that a company can use. People have been asking for a way to run the Chinese models from a trusted provider. Here GitHub delivered!

    The performance, if we trust the benchmarks, put it at Sonnet 4.6.

    Let’s see if it’s worth it with GitHubs pricing.

    • MangoCoffee 2 hours ago
      Microsoft needs to offer cheaper option since they change to token based billing. GPT-5.4 used to be x1 for yearly subscriber but now it cost 6x. i run out the premium request for just couple prompts. Github copilot for $10 used to be the best value since you get all the US AI labs model for cheap.
      • sneezychl 1 hour ago
        CoPilot was an insanely good value while it lasted. Only moneysoft could subsidize a service that much.
    • w4yai 6 minutes ago
      > People have been asking for a way to run the Chinese models from a trusted provider

      I'm going to be called a chiller again, but at this point I don't care as it is relevant. Synthetic runs their own models for a reasonable price, GLM5.2 & Kimi K2.7-Code included.

      Referral link :

      https://synthetic.new/?referral=kwjqga9QYoUgpZV

  • kingstnap 46 minutes ago
    Input: $0.95

    Cache hit (most important): $0.19

    Output: $4.00

    This is the same as how much Moonshot charges for it, and it puts it at roughly the price of GPT 5.4 mini, not a bad option.

    For some context here is a stupid prompt that wastes tokens: "Play a game of tic tac toe against yourself on a 5x5 board, you need 5 in a row to win."

    It costs $0.006 on Kimi K2.7, and you get to see the whole raw reasoning trace.

    GPT-5.4 mini costs $0.016 and its summarized.

    And in case you are wondering both play incredibly stupidly.

    Kimi:

          A   B   C   D   E
      1   .   .   .   .   .
      2   .   .   .   .   .
      3   X   X   X   X   X
      4   .   O   O   O   O
      5   .   .   .   .   .
    
    
    GPT 5.4 mini:

      1: X X X X X
      2: O O . . .
      3: . . O . .
      4: . . . O .
      5: . . . . O
    • ubanholzer 36 minutes ago
      Nice idea. I just asked Haiku to do the same in Claude Chat on iOS: it created a interactive react game, implemented the rules and let it play. Clever move for 1$ input and 5$ output, Anthropic!
    • kingstnap 37 minutes ago
      Btw if anyone is wondering, GPT 5.5 does the same garbage as 5.4 mini for 4 times the cost.

      Fable manages to make a reasonable game, at a cost of 40 cents.

        X X O O O
        O O X X X
        X X X O O
        X O O X O
        X O X X O
    • asimovDev 41 minutes ago
      when i will be extremely bored, I think I will make two models play chess against each other. I bet there's a chess benchmark / llm tournament already somewhere
  • tapirl 14 minutes ago
    Unlike Google, the AI wave appears to deliver positive revenue impacts for Microsoft.

    The company does need to integrate the new AI-human-machine interface into its application development SDKs.

  • mmusc 2 hours ago
    Yes significantly cheaper to run compared to the other models, tried it for an hour yesterday and the results look promising.

    Saw in a discussion on Reddit that the team is evaluating glm5.2 so hopefully more to come!

  • skybrian 2 hours ago
    Looks like it’s the same price on Fireworks AI?

    https://fireworks.ai/blog/kimi-k2p7-code

    I don’t know much about them but they did a deal with Microsoft in March:

    https://azure.microsoft.com/en-us/blog/introducing-fireworks...

  • johnathan101 57 minutes ago
    Competition in coding models has gotten intense. A year ago it felt like choosing between two options. Now the bigger question is which model to route each task to.
  • scriptsmith 2 hours ago
    Is GitHub Copilot the best positioned platform for enterprise? They support Claude, GPT, Gemini, and now even open weight models. Larger orgs are paying at API rates anyway so it costs just as much as anywhere else. They have a pretty good agent CLI and SDK, and now a desktop app. They have hosted agents, and you can run their 'Agentic Workflows' in CI.

    Has their reputation tanked so much that the alternatives get all the buzz? Or is it that non-enterprise users are priced out by the usage costs, so no free marketing?

    • gunalx 2 hours ago
      The rugpull with the pricing change without further notice was not taken kindly by enterprice.
    • attentive 1 hour ago
      They were, until they decided to commit suicide for the service.
  • grumbelbart2 21 minutes ago
    Is there a zero-retention option?
  • impact_sy 2 hours ago
    When will DeepSeek be available?
    • pkaye 2 hours ago
      The V4 models are already in the Azure AI foundry so maybe a good chance of it coming.
  • SeriousM 2 hours ago
    Who really cares? The model multipliers and the artificial currency were the final nail in the Github Copilot coffin.
    • sognetic 2 hours ago
      Enterprises still have big contracts with github, those companies are imposing tight spending limits now and if the open weight models enable those limits to last a bit longer that's probably quite popular.
  • boundless88 3 hours ago
    When will GitHub Copilot support integrating custom models?
    • mvATM99 1 hour ago
      It does, but it's very poorly documented and quite unstable (on purpose i think). What the other commenter said about the VSCode BYOK seems to be the more reliable way.

      I tried adding a Foundry LLM as Github Copilot custom model and failed miserably. But with VSCode BYOK (and Github Copilot as the interfact) i did get it working, and i can now use Deepseek V4 Flash with Copilot.

    • Klaster_1 2 hours ago
      AFAIK you can already use custom models in VSCode Copilot, but probably not for cloud workloads yet.
    • ignoramous 2 hours ago
      Copilot Chat supports BYOK since Oct 2025 for the VSCode plugin: https://code.visualstudio.com/blogs/2026/06/18/byok-vscode
  • websap 2 hours ago
    Where is the inference running?