2024-06-24 Vocabulary expansion for non-SentencePiece based BPE tokeniser A note on how to do vocabulary expansion with LLaMA3, OLMo, etc.