minilm offers a complete pipeline from tokenisation and encoding through training, evaluation, and text generation while staying dependency-light and easy to extend. pad_token str "" Padding token; ...