Making LLM Training Faster with Unsloth and NVIDIA

(unsloth.ai)

47 points | by segmenta 3 hours ago

2 comments

  • stared 1 hour ago
    While I do admire Unsloth (especially their https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF binarizations), the linked blog post looks like written by AI from notes (unless a human author acquired this taste from interactions with chatbots).
    • danielhanchen 16 minutes ago
      Oh thanks :) We're also going to add MTP support soon for Qwen3.6!

      90% of it is fully human done - the maths, algos, code snippets, screenshots & benchmarks are done / conducted by us and NVIDIA :)

      We did use AI to fix spelling errors, and spice up the intro a bit + made some nice plots using Chat (ours would look horrible lol)

    • adityamwagh 20 minutes ago
      What’s with the all the hate for AI assisted writing on HackerNews? It’s a tool and people use tools all the time. It saves TIME and helps in improving coherence of one’s articles.
      • saberience 17 minutes ago
        Because AI writing is lazy and moreover, I don’t want to know the AIs opinion on something, I can get that myself, if I want to read someone’s article I want to hear that persons words and that persons opinions.

        If someone has no opinions or unique insight then why would I listen to them or read their content.

        Again, if I want the AIs view on something I can open up Claude and ask them myself, why bother reading generated articles that took 10 seconds for someone else to prompt?

        • danielhanchen 15 minutes ago
          The intro was spiced up (I guess we won't do it from now on :)) but the rest 90% maths, benchmarks, code, explanations etc are fully done by us!
  • electroglyph 1 hour ago
    nice writeup! looking forward to doing some more training as soon as i get some more data sorted. it'll be a custom arch, but i'll probably shoehorn it into unsloth for a speed boost.