Introducing Gemma 2 Swahili

Gemma 2 Swahili hero image

We were really fascinated by the Gemma 2 Japanese releases last year, so much that we felt compelled to work on Swahili variations of Gemma 2 models.

We are excited to announce the release of Gemma 2 Swahili, a comprehensive suite of Gemma 2 language models specifically adapted for Swahili language understanding and generation. This release represents a significant step forward in making advanced AI technology accessible to the two hundred million plus speakers of the language.

All models are now available on Kaggle and Hugging Face, as well as comprehensive documentation on training, inference and evaluation. We are excited for you to try these models out and we look forward to your feedback.

Model Suite Overview

Key Capabilities

Sample Responses

Example 1 – Creative writing (Gemma 2 B Swahili IT)

Prompt
Tunga hadithi fupi kuhusu Twiga
Response
Twiga alikuwa mnyama mwitu wa asili ambaye alikuwa na furaha ya kuishi katika jiji la bandarini. Alikuwa amefungwa kwa nyumba yake ya kawaida, lakini kwa sababu ya uchovu wake, hakuwa na muda wa kucheza.

Hata hivyo, jiji lake lilikuwa likikumbwa na mabadiliko ya hali ya hewa. Maji yalikuwa yakipungua, na siku moja Twiga aligundua kuwa maji ya ziwa yalikuwa yametoweka. Hofu ilimvaa kwa sababu kupotea kwa maji kungemaanisha kupotea kwa makazi yake.

Twiga alitamani kurudisha maji ya ziwa lakini hakujua aanzie wapi. Alijaribu kuuliza barabarani bila mafanikio. Baada ya muda alimwona mzee aliyeketi karibu na ziwa na akaomba msaada.

Mzee huyo alimweleza kwamba siri ya kurudisha maji ilikuwa kwenye mtihani wa kuunganisha nguvu za viumbe wote wanaoishi kandokando ya ziwa. Twiga aliwasiliana na ndege, samaki, wanyama na watu. Kufikia jioni maji yalirudi na Twiga akagundua thamani ya ushirikiano. Alirejea nyumbani akiwa na furaha na shukrani kwa jamii yake mpya.

Example 2 – Comprehensive writing (Gemma 2 B Swahili IT)

Prompt
Eleza dhana ya uchumi wa kidijitali na umuhimu wake katika ulimwengu wa leo
Response
Uchumi wa kidijitali ni mfumo wa kiuchumi unaotumia teknolojia za mtandaoni kuwezesha uzalishaji, usambazaji na matumizi ya bidhaa na huduma. Ndani ya mfumo huu data huchukua nafasi ya mali ghafi kama rasilimali kuu, na miamala hufanyika kwa kasi na uwazi kupitia majukwaa ya kidijitali.

Umuhimu wake unatokana na vipengele vinne vikuu. Kwanza, unafungua masoko mapya kwa kupunguza vikwazo vya kijiografia na gharama za miamala. Pili, unaharakisha uvumbuzi kwa kurahisisha ushirikiano wa kimataifa na upatikanaji wa habari. Tatu, unapunguza gharama za uendeshaji kupitia kiotomatiki na uchanganuzi wa data. Nne, unaleta uwazi na ufuatiliaji wa rasilimali kupitia leja za kidijitali na mifumo ya malipo ya papo hapo.

Kwa dunia ya leo ambako taarifa husafiri kwa mwanga wa nyuzi, uchumi wa kidijitali umekuwa nguzo ya ushindani wa kitaifa na maendeleo jumuishi. Biashara zinazoikumbatia zinapata ufanisi, wakati watumiaji hunufaika na chaguo pana na bei nafuu.

Performance Benchmarks

Swahili MMLU (Massive Multitask Language Understanding)

Swahili MMLU benchmark chart

Fine-tuning on curated Swahili corpora lifts accuracy dramatically. The 2 B model rises from 15 % to 34 %. The 9 B checkpoint surpasses 55 %, a level previously reachable only with much larger multilingual systems. Even after 4-bit quantization the 27 B model retains 54 %, delivering frontier performance at a fraction of the memory footprint.

Sentiment Analysis Results

Swahili sentiment analysis chart

Sentiment classification shows similar gains. The 2 B instruction model jumps from 49 % to 66.5 %. The 27 B quantized model sets a new record at 88.5 %. These results confirm that targeted Swahili fine-tuning consistently outperforms generic multilingual pre-training.

Conclusion

Gemma 2 Swahili proves that purposeful data collection, continual pre-training and careful instruction tuning can deliver frontier-level language capability for low-resource languages. We are working hard to provide more frontier capabilities to the 200M+ Swahili speakers.