Making LLM Training Faster with Unsloth and NVIDIA

(unsloth.ai)

47 points | by segmenta 3 hours ago

2 comments

stared 1 hour ago
While I do admire Unsloth (especially their https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF binarizations), the linked blog post looks like written by AI from notes (unless a human author acquired this taste from interactions with chatbots).
[-]
- danielhanchen 16 minutes ago
  Oh thanks :) We're also going to add MTP support soon for Qwen3.6!
  90% of it is fully human done - the maths, algos, code snippets, screenshots & benchmarks are done / conducted by us and NVIDIA :)
  We did use AI to fix spelling errors, and spice up the intro a bit + made some nice plots using Chat (ours would look horrible lol)
- adityamwagh 20 minutes ago
  What’s with the all the hate for AI assisted writing on HackerNews? It’s a tool and people use tools all the time. It saves TIME and helps in improving coherence of one’s articles.
  [-]
  - saberience 17 minutes ago
    Because AI writing is lazy and moreover, I don’t want to know the AIs opinion on something, I can get that myself, if I want to read someone’s article I want to hear that persons words and that persons opinions.
    If someone has no opinions or unique insight then why would I listen to them or read their content.
    Again, if I want the AIs view on something I can open up Claude and ask them myself, why bother reading generated articles that took 10 seconds for someone else to prompt?
    [-]
    - danielhanchen 15 minutes ago
      The intro was spiced up (I guess we won't do it from now on :)) but the rest 90% maths, benchmarks, code, explanations etc are fully done by us!
electroglyph 1 hour ago
nice writeup! looking forward to doing some more training as soon as i get some more data sorted. it'll be a custom arch, but i'll probably shoehorn it into unsloth for a speed boost.
[-]
- danielhanchen 16 minutes ago
  Thank you!