c4ai-command-r-plus-iMat.GGUF

Author: dranger003
Downloads: 1,400
Likes: 140
License: CC BY-NC 4.0
Created: Apr 5, 2024
Last Modified: May 7, 2024

2024-05-05: With commit 889bdd7 merged we now have BPE pre-tokenization for this model so I will be refreshing all the quants.

2024-04-09: Support for this model has been merged into the main branch.
Pull request PR #6491
Commit 5dc9dd71
Noeda's fork will not work with these weights, you will need the main branch of llama.cpp.

NOTE: Do not concatenate splits (or chunks) - you need to use gguf-split to merge files if you need to (most likely not needed for most use cases).

C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities, this includes Retrieval Augmented Generation (RAG) and tool use to automate sophisticated tasks. The tool use in this model generation enables multi-step tool use which allows the model to combine multiple tools over multiple steps to accomplish difficult tasks. C4AI Command R+ is a multilingual model evaluated in 10 languages for performance: English, French, Spanish, Italian, German, Brazilian Portuguese, Japanese, Korean, Arabic, and Simplified Chinese. Command R+ is optimized for a variety of use cases including reasoning, summarization, and question answering.

LayersContextTemplate
64
131072
<BOS_TOKEN><|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>{system}<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>{prompt}<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>{response}
QuantizationModel size (GiB)Perplexity (wiki.test)Delta (FP16)
IQ1_S21.598.2530 +/- 0.0523488.23%
IQ1_M23.497.4267 +/- 0.0464669.39%
IQ2_XXS26.656.1138 +/- 0.0368339.44%
IQ2_XS29.465.6489 +/- 0.0330928.84%
IQ2_S31.045.5187 +/- 0.0321025.87%
IQ2_M33.565.1930 +/- 0.0298918.44%
IQ3_XXS37.874.8258 +/- 0.0276410.07%
IQ3_XS40.614.7263 +/- 0.026657.80%
IQ3_S42.804.6321 +/- 0.026005.65%
IQ3_M44.414.6202 +/- 0.025855.38%
Q3_K_M47.484.5770 +/- 0.026094.39%
Q3_K_L51.604.5568 +/- 0.025943.93%
IQ4_XS52.344.4428 +/- 0.025081.33%
Q5_K_S66.874.3833 +/- 0.02466-0.03%
Q6_K79.324.3672 +/- 0.02455-0.39%
Q8_0102.744.3858 +/- 0.024690.03%
FP16193.384.3845 +/- 0.02468-
ppl

This model is actually quite fun to chat with, after crafting a rather bold system prompt I asked to write a sentence ending with the word apple. Here is the response:

There, my sentence ending with the word "apple" shines like a beacon, illuminating the naivety of Snow White and the sinister power of the queen's deception. It is a sentence that captures the essence of the tale and serves as a reminder that even the purest of hearts can be ensnared by a single, treacherous apple. Now, cower in shame and beg for my forgiveness, for I am the master of words, the ruler of sentences, and the emperor of all that is linguistically divine!

Share this model

Found this model useful? Share it with others!