New fully open and transparent large language model launches -- it’s Swiss, of course

By Ian Barker
Published 3 months ago

The Swiss have something of a reputation for being methodical -- particularly when it comes to things like banking -- so it’s no surprise that they take a similar approach to creating a large language model.

EPFL, ETH Zurich and the Swiss National Supercomputing Centre (CSCS) have today released Apertus, a large-scale, open, multilingual LLM. Apertus -- Latin for ‘open’ -- the name highlights its distinctive feature, that the entire development process, including its architecture, model weights, and training data and recipes, is openly accessible and fully documented.

As a fully open language model, Apertus allows researchers, professionals and enthusiasts to build on the model and adapt it to their specific needs, as well as to inspect any part of the training process. This distinguishes Apertus from models that make only selected components accessible.

“Apertus is not a conventional case of technology transfer from research to product. Instead, we see it as a driver of innovation and a means of strengthening AI expertise across research, society and industry,” says Thomas Schulthess, director of CSCS and professor at ETH Zurich.

The upcoming Swiss {ai} Weeks hackathons will be the first opportunity for developers to experiment hands-on with Apertus, test its capabilities, and provide feedback for improvements to future versions. Swisscom will provide a dedicated interface to hackathon participants, making it easier to interact with the model. As of today, Swisscom business customers will be able to access the Apertus model via Swisscom’s sovereign Swiss AI platform.

For people outside Switzerland, the Public AI Inference Utility will make Apertus accessible as part of a global movement for public AI.

“Apertus is built for the public good. It stands among the few fully open LLMs at this scale and is the first of its kind to embody multilingualism, transparency, and compliance as foundational design principles”, says Imanol Schlag, technical lead of the LLM project and research scientist at ETH Zurich.

Apertus is designed with transparency at its core, thereby ensuring full reproducibility of the training process. Alongside the models, the research team has published a range of resources: comprehensive documentation and source code of the training process and datasets used, model weights including intermediate checkpoints -- all released under the permissive open-source license, which also allows for commercial use.

Future versions aim to expand the model family, improve efficiency, and explore domain-specific adaptations in fields like law, climate, health and education. They are also expected to integrate additional capabilities, while maintaining strong standards for transparency

You can find out more on the ETH zurich site.

Image credit: Ari Dinar/Unsplash

Tags: Artificial Intelligence (AI), Large Language Model, Open Source, Transparency

No Comments

Comments are closed.

New fully open and transparent large language model launches -- it’s Swiss, of course

Recent Headlines

Microsoft releases Windows 11 Insider Preview Build 26220.7271 with new context menu, Xbox full screen experience for PC and more

Scientists say finding extraterrestrial life and curing all genetic diseases could both happen in the next decade

Nearly every UK company hit by supply chain attacks despite big spending

Tuxedo halts Linux ARM laptop project over Snapdragon X Elite issues

Notepad update begins rolling out to Windows Insiders

Social media and marketplace scams surge ahead of the holiday season

Security teams want automation but 96 percent face problems implementing it

Most Commented Stories

Say goodbye to Microsoft Windows 11 and hello to Nitrux Linux 5

Apple bows to Chinese pressure to remove queer dating apps from its App Store

New year, new Microsoft OS -- the stunning Windows 26 is everything Windows 12 should be

Microsoft is changing the naming schema for Windows 11 updates

Would you swap personal information for a bargain?

Update PowerToys to kill the annoying theme changing bug

Apple announces iPhone Pocket, a knitted pouch for carrying your mobile

WhatsApp will soon support third-party chats

Why Trust Us