CRD716
/

ggml-vicuna-1.1-quantized

Name: CRD716/ggml-vicuna-1.1-quantized
Author: CRD716

NOTE: DEPRECIATED, BETTER PEOPLE DO THIS FOR THE NEW VERSIONS NOW

Legacy is for llama.cpp setups older than https://github.com/ggerganov/llama.cpp/pull/1508, the regular is faster but does not work on old versions.

This is a ggml version of vicuna 7b and 13b. This is the censored model, a similar 1.0 uncensored 13b model can be found at https://huggingface.co/eachadea/ggml-vicuna-13b-1.1.

Hosted inference API

Inference API does not yet support adapter-transformers models for this pipeline type.

Spaces using CRD716/ggml-vicuna-1.1-quantized 21