Network Trimming: A Data-Driven Neuron Pruning Approach towards Efficient DeepArchitectures

Cut the Fat from Big Neural Nets — make them smaller and faster

Big learning models often have many parts that do almost nothing. We looked at their inner units, called neurons, and saw many give near-zero output no matter what input is shown. Those quiet parts can be dropped, making the model lighter without breaking its brain. The trick is simple: find the quiet units, remove them, then tune the remaining parts a bit, repeat until the model is tight. This makes the system use less memory and run quicker, yet keeps the same or sometimes even better accuracy.

The idea sounds small but it saves a lot of work and cost. You dont need to build a new model from scratch; just trim the extra pieces and train a little more. Many big models were tested and ended up much…

Cut the Fat from Big Neural Nets — make them smaller and faster

Cut the Fat from Big Neural Nets — make them smaller and faster

Similar Posts