End-to-end Planner Training for Language Modeling

Published in arXiv, 2024

Recommended citation: Nathan Cornille, Florian Mai, Jingyuan Sun, Marie-Francine Moens. (2024). "End-to-end Planner Training for Language Modeling." arXiv:2410.12492. https://arxiv.org/abs/2410.12492

We propose a differentiable method for joint fine-tuning of language models with planning modules by using predicted label probabilities as mixing weights.

Download paper here