ELLIS header
University of Stuttgart Logo
Max Planck Institute for Intelligent Systems Logo

On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation

Nghiem Tuong Diep, Huy Nguyen, Chau Nguyen, Minh Le, Duy Minh Ho Nguyen, Daniel Sonntag, Mathias Niepert, Nhat Ho

Proceedings of the 42nd International Conference on Machine Learning (ICML), 2025.


Abstract


Links


BibTeX

@inproceedings{diep2025zero, author = {Diep, Nghiem Tuong and Nguyen, Huy and Nguyen, Chau and Le, Minh and Nguyen, Duy Minh Ho and Sonntag, Daniel and Niepert, Mathias and Ho, Nhat}, title = {On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation}, booktitle = {Proceedings of the 42nd International Conference on Machine Learning (ICML)}, year = {2025} }