High Performance Language Technologies

Julie Arteza

01 September 2022

31 August 2025

EC funded project

The EU-funded HPLT project applies high-performance computing to scale and advance language technologies. Taking advantage of recent advances in machine learning and astonishing storage capacities, it will create and process huge language datasets and produce language and translation models in a large number of languages.

The resulting models will be tested from various angles to ensure smooth integration, high accuracy, and regulatory compliance concerning privacy, unwanted biases and ethical issues. The models and data sets will be a game changer in the language service market in the EU and beyond. The resulting models will be open, free and available from established language repositories for anyone interested in pursuing research or innovation projects.

A space that combines petabytes of natural language data with largescale model training:

  • Lots of monolingual and multilingual data consistently formatted and curated
  • Efficient and high-quality language and translation models
  • Sustainable and reusable workflows using high-perfomance computing


Vertical Category: