Article
Stefan Schweter PRO
stefan-it
AI & ML interests
Flair Library ๐, NER & PoS Tagging, LM Pretraining (mostly encoder-only & encoder-decoder), Historical Language Models, German Language Models, Bavarian NLP ๐ฅจ
Recent Activity
upvoted a collection about 7 hours ago
Nemotron-Pre-Training-Datasets upvoted a paper about 10 hours ago
Lost in Backpropagation: The LM Head is a Gradient Bottleneck upvoted a collection 1 day ago
NVIDIA Nemotron v3