llama4-dolphin-8B-GGUF

Author: mradermacher
Downloads: 2,067
Likes: 17
License: Unknown
Created: Apr 27, 2024
Last Modified: May 5, 2024

About

static quants of https://huggingface.co/Manavshah/llama4-dolphin-8B

weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion.

Usage

If you are unsure how to use GGUF files, refer to one of TheBloke's
READMEs
for
more details, including on how to concatenate multi-part files.

Provided Quants

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

LinkTypeSize/GBNotes
GGUFQ2_K3.3
GGUFIQ3_XS3.6
GGUFQ3_K_S3.8
GGUFIQ3_S3.8beats Q3_K*
GGUFIQ3_M3.9
GGUFQ3_K_M4.1lower quality
GGUFQ3_K_L4.4
GGUFIQ4_XS4.6
GGUFQ4_K_S4.8fast, recommended
GGUFQ4_K_M5.0fast, recommended
GGUFQ5_K_S5.7
GGUFQ5_K_M5.8
GGUFQ6_K6.7very good quality
GGUFQ8_08.6fast, best quality
GGUFf1616.216 bpw, overkill

Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):

image.png

And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9

FAQ / Model Request

See https://huggingface.co/mradermacher/model_requests for some answers to
questions you might have and/or if you want some other model quantized.

Thanks

I thank my company, nethype GmbH, for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time.

Share this model

Found this model useful? Share it with others!