Research News
Papers and technical reports from the Apertus project.
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Main technical report — architecture, training methodology, data pipeline, evaluation
Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs
Shows that respecting robots.txt opt-outs causes minimal performance degradation
Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks
Research on memorization patterns and copyright risks in LLMs
Quantifying Training Data Retention in Large Language Models: An Analysis of Pretraining Factors and Mitigation Strategies
Analysis of memorization and mitigation strategies applied in Apertus
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Multilingual evaluation benchmark across 44 languages
Deriving Activation Functions Using Integration
xIELU activation function used in Apertus architecture
Visit our 📖 Zotero group for further literature.