Beyond Perplexity: UTF-8 Validity in Byte-aware Language ModelsPublished in Forty-Third International Conference on Machine Learning (ICML), 2026Share on Twitter Facebook LinkedIn Previous Next