MSA: Memory Sparse Attention

(github.com)

28 points | by chaosprint 2 days ago

2 comments

  • cyanydeez 1 hour ago
    Neat. Can't wait for our language, framework specific tools for models. I don't need my models writing shakespeare, unless I'm working on shakespeare.
  • mememememememo 1 hour ago
    [dead]