DORB AI

Introduction

Krio, the lingua franca of Sierra Leone spoken by over 95% of the population, remains severely underrepresented in modern large language models (LLMs). While multilingual models have made strides in high-resource languages, low-resource krios like Krio often exhibit poor fluency, limited vocabulary coverage and near-zero conversational ability in base models.

N-ATLaS, an 8B-parameter instruction-tuned model built on Llama-3-8B and specifically adapted for African linguistic contexts (with strong performance on Nigerian languages and Pidgin), represents a promising Afrocentric starting point. This project aimed to extend its capabilities to Krio without requiring massive compute resources, using parameter-efficient fine-tuning to create a lightweight adapter suitable for deployment on consumer hardware or mobile devices in low-bandwidth settings.

The primary objectives were to:

achieve reliable English ↔ Krio translation,
enable natural conversational responses in Krio, and
maintain the base model's existing strengths in English and other Nigerian languages.

All experiments were conducted in Google Colab using free-tier A100 GPUs.

Related Work

Parameter-efficient fine-tuning methods, particularly LoRA and its quantized variant QLoRA, have technically become standard for adapting large models to new domains or languages with limited compute. Unsloth has further optimized these techniques for Llama-family models, achieving 3 to 5× speedups and reduced memory usage.

Prior work on African languages has largely focused on Nigerian languages (e.g., Hausa, Yoruba, Igbo, Pidgin) or higher-resource Bantu/Swahili languages. Krio-specific efforts remain scarce; the primary public resource is the machine-translated parallel corpus used here, supplemented by smaller Bible translations and community contributions. Recent initiatives (Masakhane, Lelapa AI, etc.) emphasize continued pre-training followed by instruction tuning, but few target Sierra Leonean languages specifically.

This work builds on the Afrocentric foundation of N-ATLaS while demonstrating practical QLoRA adaptation for an underrepresented West African krio.

Dataset

We used the Hugging Face dataset michsethowusu/english-krio_sentence-pairs_mt560, containing approximately 42,000 English-Krio sentence pairs. The data appears to be machine-translated (likely via NLLB-200 or similar) from English source material.

Key characteristics:

high syntactic alignment suitable for translation tasks,
Strong topical bias: manual inspection revealed that >70% of sentences originate from Jehovah's Witnesses religious publications and Bible translations. Frequent themes include salvation, divine power, wickedness, and scriptural references
limited coverage of everyday conversational topics (greetings, food, football, weather, urban life).

For the initial experiment, we subsampled 10,000 examples for rapid iteration. Data was formatted in Alpaca-style instruction format with a 50/50 split between explicit translation tasks and simulated chat responses.

Model and Fine-Tuning Method

Base model: NCAIR1/N-ATLaS (8B parameters, Llama-3-8B-Instruct derivative, Afrocentric instruction tuning).

Fine-tuning setup:

Library: Unsloth (latest Colab-compatible version).
Quantization: 4-bit loading with QLoRA.
LoRA configuration: rank=16 (later experiments considered r=32/64), α=32, targeting all linear layers, dropout=0.05.
Training regime (initial run):
- Batch size: 2 (effective 8 with gradient accumulation).
- Learning rate: 2×10⁻⁴.
- Optimizer: AdamW 8-bit.
- Max steps: 100 (resulting in ~55 effective steps completed).
- Sequence length: 2048.
Formatting: Alpaca prompt template with randomized instruction types.

Training completed in approximately 20 minutes on a Colab A100 GPU, yielding ~42 million trainable parameters (0.52% of total).

Figure 1: Training metrics from Weights & Biases

Loss curve, learning rate schedule, global step, epoch progress, gradient norm

Figure 2: GPU streaming and power usage

GPU streaming and power usage during training

Results

Translation Performance

Direct translation prompts produced high-quality, idiomatic Krio output even after minimal training:

Prompt: "Translate the following English sentence to natural Krio: You're not ready for this."

Output: "Yu nɔ rɛdi fɔ dis."

This demonstrates successful capture of basic syntactic patterns ("Mi nem" for possessive "My name is") and orthographic conventions.

Conversational Performance

Open-ended chat revealed severe domain overfitting:

Example exchange:

● User: "hello" → Model: "Bɔku gud mɔnin"

● User: "ɔl rɛdi fɔ tɔk to yu" → Model: "Jizɔs, na i go gi mi ɛn i go gi dɛn dɛnsɛf pawa."

● Subsequent turns continued with unrelated religious references.

The model defaulted to high-probability sequences from the dominant religious subset of the training data when context was ambiguous.

Discussion

The primary limitation is dataset bias. The heavy religious skew caused the model to overfit to scriptural phrasing, producing contextually inappropriate responses in casual conversation. This is a known risk when using machine-translated parallel corpora derived from narrow source domains.

Mitigation strategies explored:

Extended training duration and higher LoRA rank to better capture minority patterns.
Gentle probabilistic downweighting of religious examples (without removal).
Strong inference-time system prompts explicitly prohibiting religious content and enforcing everyday slang.
Generation hyperparameters to reduce sampling variance and repetition.

These approaches are expected to substantially reduce drift while preserving translation quality.

The experiment also validates the accessibility of state-of-the-art fine-tuning: individual researchers in Freetown were able to produce a functional Krio-enhanced adapter using only limited cloud resources.

Conclusion

We successfully created a LoRA adapter that significantly improves Krio translation capability in the Afrocentric N-ATLaS model. However, conversational fluency remains constrained by the topical bias in the only substantial public parallel corpus. This underscores the urgent need for diverse, community-sourced Krio text and conversation data.

The resulting adapter (to be uploaded as dorb-ai/krio-natlas-adapter-v1) provides a practical starting point for Sierra Leonean developers and serves as a proof-of-concept for rapid low-resource adaptation.

Future Work

Collect and curate diverse monolingual and conversational Krio data (ongoing data engine development).
Conduct longer training runs with proposed mitigations.
Evaluate formally using human raters from Sierra Leone on fluency, cultural appropriateness, and topic coherence.
Explore merging with other African Krio adapters for broader West African coverage.
Deploy via lightweight interfaces (Ollama, Gradio Spaces) for local testing.

Enhancing Krio Language Capabilities
in an Afrocentric Large Language
Model: A Case Study with N-ATLaS and
QLoRA Fine-Tuning

Abstract

Introduction

Related Work

Dataset

Model and Fine-Tuning Method

Figure 1: Training metrics from Weights & Biases

Figure 2: GPU streaming and power usage

Results

Translation Performance

Conversational Performance

Discussion

Conclusion

Future Work

Enhancing Krio Language Capabilitiesin an Afrocentric Large LanguageModel: A Case Study with N-ATLaS andQLoRA Fine-Tuning

Abstract

Introduction

Related Work

Dataset

Model and Fine-Tuning Method

Figure 1: Training metrics from Weights & Biases

Figure 2: GPU streaming and power usage

Results

Translation Performance

Conversational Performance

Discussion

Conclusion

Future Work

Enhancing Krio Language Capabilities
in an Afrocentric Large Language
Model: A Case Study with N-ATLaS and
QLoRA Fine-Tuning