This talk gives an overview of the process of building GPT-SW3, a multilingual LLM trained in 6 languages. Felix covers data ingestion and preprocessing, training, and evaluation of the model, as well as future plans. The focus of the presentation will be on multilingual aspects and challenges.

Add comment

Your email address will not be published. Required fields are marked *

Categories

All Topics