A team of engineers, scientists, and a semiconductor manufacturer from Silicon Valley worked together to publish sophisticated Arabic language software that can power applications for generative AI.
With 13 billion parameters, the new massive language model known as Jais was created from a large collection of data mixing Arabic and English, some of which came via computer code.
Supercomputers built by Silicon Valley-based Cerebras Systems, which makes chips the size of dinner plates that compete with Nvidia’s potent AI hardware, were used to develop the new language model.
Jais, which takes its name from the highest mountain in the United Arab Emirates, is the result of a partnership between Cerebras, the Mohamed bin Zayed University of Artificial Intelligence, and the AI-focused subsidiary Inception of the Abu Dhabi-based tech company G42.
According to Timothy Baldwin, professor of artificial intelligence at Mohamed bin Zayed University, the limited availability of Arabic data to train a model of Jais’ scale and the computer code contained in the English language data helped train the model’s reasoning abilities.
An open source licence will be used to make Jais accessible.