End-to-end Planner Training for Language Modeling
Published in arXiv, 2024
Recommended citation: Nathan Cornille, Florian Mai, Jingyuan Sun, Marie-Francine Moens. (2024). "End-to-end Planner Training for Language Modeling." arXiv:2410.12492. https://arxiv.org/abs/2410.12492
We propose a differentiable method for joint fine-tuning of language models with planning modules by using predicted label probabilities as mixing weights.