
Sign up to save your podcasts
Or


The paper introduces the fast feedforward (FFF) architecture, which offers comparable performance to feedforward networks at a fraction of the inference cost. FFFs can replace feedforward networks and mixture-of-expert networks in transformers. A vision transformer trained with FFF achieves single-neuron inferences with only a 5.8% performance decrease.
By Igor Melnyk5
33 ratings
The paper introduces the fast feedforward (FFF) architecture, which offers comparable performance to feedforward networks at a fraction of the inference cost. FFFs can replace feedforward networks and mixture-of-expert networks in transformers. A vision transformer trained with FFF achieves single-neuron inferences with only a 5.8% performance decrease.

977 Listeners

1,993 Listeners

443 Listeners

113,121 Listeners

10,254 Listeners

5,576 Listeners

221 Listeners

51 Listeners

101 Listeners

475 Listeners