Model sizes are crazy these days with billions and billions of parameters. As Mark Kurtz explains in this episode, this makes inference slow and expensive despite the fact that up to 90%+ of the parameters don’t influence the outputs at all. Mark helps us understand all of the practicalities and progress that is being …
Large models on CPUs with Mark Kurtz, director of ML at Neural Magic (Practical AI #221) |> Changelog
Tagged with model sizes mark kurtz