Speeding up Deep Learning inference via unstructured sparsity

towards-data-science

This post was originally published by Ziheng Wang at Towards Data Science

Serving large neural networks can be expensive. It certainly doesn’t help that neural network size appears to correlate with how useful they are. (Case in point: GPT-3)
Spread the word

This post was originally published by Ziheng Wang at Towards Data Science

Related posts