HyperMixer: An MLP-based Low Cost Alternative to Transformers
Published in ACL, 2023
Recommended citation: Florian Mai, Arnaud Pannatier, Fabio Fehr, Haolin Chen, François Marelli, François Fleuret and James Henderson. (2020). "HyperMixer: An MLP-based Low Cost Alternative to Transformers." ACL 2023. https://arxiv.org/abs/2203.03691
We propose an efficient all-MLP architecture with the same inductive biases as Transformers.