Skip to content

LLM Formats

This section is available as slides which is presented on the workshop. This text version include some additional notes. You can also access the slide version here.

Overview

  • Formats of LLM models
  • Formats of numbers
  • Quantization of LLM
  • Quantization and performance

Formats of LLM models

So you want to use a LLM model

What the name means

  • Llama-3.3: model (architecture)
  • 70B: size / number of parameters
  • Instruct: fine-tuning
  • AWQ-INT4: quantization
  • GGUF: model format

File-formats of LLMs