Introduction

Tools and Technologies

Inspiration

 

1. Common AI Model Formats

  • Link: https://huggingface.co/blog/ngxson/common-ai-model-formats
  • Why relevant: Explains the fundamental differences between PyTorch, Safetensors, and GGUF formats. Critical for your hardware setup because it covers how GGUF uses memory-mapped loading (mmap()) to run large models without preloading the full weight matrix into your 8 GB of RAM.
  • Estimated Study Time: 15 minutes

2. A Practical Guide to GGUF Quantization Selection

3. Official Ollama Documentation & Setup Guide

  • Link: https://github.com/ollama/ollama/blob/main/README.md
  • Why relevant: The authoritative source for installing Ollama on Ubuntu, pulling GGUF models from HuggingFace, and running a local OpenAI-compatible inference server. More trustworthy and maintained than community gists.
  • Estimated Study Time: 20 minutes

4. llama.cpp Official Build Guide (Linux)

  • Link: https://github.com/ggml-org/llama.cpp/blob/master/docs/build.md
  • Why relevant: The official compilation instructions for llama.cpp on Ubuntu/Debian — covers CMake flags, OpenBLAS acceleration for CPU-bound matrix ops, and build variants. Authoritative source, unlike third-party mirror sites.
  • Estimated Study Time: 30 minutes

5. llama.cpp Discussions: Optimizing CPU-only and 8GB RAM Performance

  • Link: https://github.com/ggml-org/llama.cpp/discussions/21136
  • Why relevant: Community-verified OS-level tweaks for Debian/Ubuntu with 8 GB RAM. Covers vm.swappiness tuning and the --mlock flag to pin models in physical memory and prevent inference freezes.
  • Estimated Study Time: 20 minutes

6. Monitor & Control CPU Temperature on Ubuntu 22.04

  • Link: https://help.ubuntu.com/community/SensorInstallHowto
  • Why relevant: Directly addresses the competition's 10-point thermal penalty. Teaches you how to install lm-sensors, read core/package temperatures in real time, and configure thermald and CPU frequency governors to keep your chip below the 85°C disqualification threshold under sustained inference load.
  • Estimated Study Time: 20 minutes

7. Build Small Hackathons with Cohere Models

8. Masakhane: African NLP Benchmarks and Datasets

  • Link: https://github.com/masakhane-io/masakhane-nlp
  • Why relevant: Masakhane is the leading open-source African NLP research community. This repo provides datasets, benchmarks, and fine-tuning baselines across 50+ African languages — the most direct resource for building a model that scores well on the African Use Case Bonus.
  • Estimated Study Time: 30 minutes (orientation + exploring datasets)

9. LLM Benchmark for Throughput via Ollama

  • Link: https://github.com/aidatatools/ollama-benchmark
  • Why relevant: A Python CLI tool that runs structured throughput benchmarks against your local Ollama server and reports tokens-per-second. Directly maps to the Model Throughput Performance judging criterion.
  • Estimated Study Time: 15 minutes

10. llama-bench: Syntax, Usage and Documentation

 

Contact Us & Support Channels

Discord: bit.ly/ADTC_Discord