
Sign up to save your podcasts
Or


The paper introduces the fast feedforward (FFF) architecture, which offers comparable performance to feedforward networks at a fraction of the inference cost. FFFs can replace feedforward networks and mixture-of-expert networks in transformers. A vision transformer trained with FFF achieves single-neuron inferences with only a 5.8% performance decrease.
By Igor Melnyk5
33 ratings
The paper introduces the fast feedforward (FFF) architecture, which offers comparable performance to feedforward networks at a fraction of the inference cost. FFFs can replace feedforward networks and mixture-of-expert networks in transformers. A vision transformer trained with FFF achieves single-neuron inferences with only a 5.8% performance decrease.

958 Listeners

1,977 Listeners

438 Listeners

112,858 Listeners

10,073 Listeners

5,535 Listeners

214 Listeners

51 Listeners

98 Listeners

473 Listeners