LLMs

A list of all the LLMs in the market, feel free to add your LLMs here but please include relevant and accurate info

Curated by:

@Beau Django

With help from:

GTP-4 Turbo

https://platform.openai.com/docs/models/gpt-4-and-gpt-4-turbo

GPT-4

https://platform.openai.com/docs/models/gpt-4-and-gpt-4-turbo

Gemini

https://blog.google/technology/ai/google-gemini-ai/

Lumiere by Google

https://lumiere-video.github.io/

GPT-3.5

https://platform.openai.com/docs/models/gpt-3-5

Goody 2

https://www.goody2.ai/goody2-modelcard.pdf

OpenAI Text Moderation

https://platform.openai.com/docs/models/moderation

Stabilityai - Expert ensemble, latent diffusion

https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0

Facebook - Repository English TTS model

https://huggingface.co/facebook/mms-tts-eng

NousResearch/Hermes-2-Pro-Mistral-7B-GGUF

https://huggingface.co/NousResearch/Hermes-2-Pro-Mistral-7B-GGUF

Pyannote - Reproduce Interspeech 2021 results with segmentation

https://huggingface.co/pyannote/segmentation

LLaMa 2

https://ai.meta.com/llama/

Distilbert - Distilled RoBERTa-base model

https://huggingface.co/distilbert/distilroberta-base

Dslim NER - Fine-tuned BERT, state-of-the-art NER

https://huggingface.co/dslim/bert-base-NER

FacebookAI - English MLM pretrained model

https://huggingface.co/FacebookAI/roberta-large

PaLM 2

https://ai.google/discover/palm2/

FacebookAI - English MLM pretrained model

https://huggingface.co/FacebookAI/roberta-base

upstage/SOLAR-10.7B-Instruct-v1.0

https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0

Nexusflow/Starling-LM-7B-beta

https://huggingface.co/Nexusflow/Starling-LM-7B-beta

Facebook Opt - OPT introduced in May 2022

https://huggingface.co/facebook/opt-2.7b

ReplaceAnything

https://huggingface.co/spaces/modelscope/ReplaceAnything

Mistralai - Mistral-7B-Instruct-v0.2: Improved LLM

https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2

Distilbert - DistilBERT base, SST-2 fine-tuned

https://huggingface.co/distilbert/distilbert-base-uncased-finetuned-sst-2-english

sentence t5-base - 768D vector space.

https://huggingface.co/sentence-transformers/sentence-t5-base

Weaver - Creative Writing Models

https://huggingface.co/papers/2401.17268

Google Byt5 - Tokenizer-free T5 version, architecture similar to MT5

https://huggingface.co/google/byt5-small

Cross-encoder - Query encoding, passage matching

https://huggingface.co/cross-encoder/ms-marco-MiniLM-L-6-v2

Pyannote speaker - Pipeline Sans onnxruntime, like pyannote

https://huggingface.co/pyannote/speaker-diarization-3.1

cerebras/Cerebras-GPT-111M

https://huggingface.co/cerebras/Cerebras-GPT-111M

Openai-community - English CLM pretrained model

https://huggingface.co/openai-community/gpt2

Stabilityai - Model card for Stable Diffusion v2

https://huggingface.co/stabilityai/stable-diffusion-2

Marieke93 - Fine-tuned MiniLM on evidence types.

https://huggingface.co/marieke93/MiniLM-evidence-types

Pyannote - x-vector TDNN, SincNet features

https://huggingface.co/pyannote/embedding

Distiluse-base - Text to 512D vectors, clustering, semantic search

https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2

Openai Clip - Model trained from scratch

https://huggingface.co/openai/clip-vit-large-patch14-336

j-hartmann emotion - English text, Ekman's 6 emotions, trained on 6 datasets

https://huggingface.co/j-hartmann/emotion-english-distilroberta-base

Cardiffnlp - roBERTa-base, 58M tweets, TweetEval sentiment finetuned

https://huggingface.co/cardiffnlp/twitter-roberta-base-sentiment

anthropic/claude-3-haiku:beta

https://openrouter.ai/models/anthropic/claude-3-haiku:beta

MoritzLaurer - Trained on MultiNLI, Fever-NLI, ANLI: 763,913 pairs

https://huggingface.co/MoritzLaurer/DeBERTa-v3-base-mnli-fever-anli

Contriever - HuggingFace transformers require mean pooling

https://huggingface.co/facebook/contriever

Google-Gemini

https://blog.google/technology/ai/google-gemini-ai/#sundar-note

Prajjwal1 - Converted Google BERT PyTorch model

https://huggingface.co/prajjwal1/bert-small

Jonatasgrosman - Fine-tuned with OVHcloud GPU credits

https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

Sentence-transformers - Maps text to 384-dimensional vectors

https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2

sentence Paraphrase - Text to 384D vectors, clustering, semantic search

https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2

Yiyanghkust - Pre-trained on financial text

https://huggingface.co/yiyanghkust/finbert-tone

WhiteRabbitNeo-33B

https://www.whiterabbitneo.com/

Ehsanaghaei - Specific RoBERTa for cybersecurity

https://huggingface.co/ehsanaghaei/SecureBERT

Google-bert - English MLM pretrained model

https://huggingface.co/google-bert/bert-base-cased

Phind-CodeLlama-34B-v2

https://huggingface.co/Phind/Phind-CodeLlama-34B-v2

Tsmatz - Fine-tuned xlm-roberta-base

https://huggingface.co/tsmatz/xlm-roberta-ner-japanese

OpenVoice

https://huggingface.co/myshell-ai/OpenVoice

Pyannote - Requires pyannote.audio v3.1+

https://huggingface.co/pyannote/wespeaker-voxceleb-resnet34-LM

Guillaumekln - OpenAI Whisper-large-v2 to CTranslate2 conversion

https://huggingface.co/guillaumekln/faster-whisper-large-v2

google/gemini-pro

https://openrouter.ai/models/google/gemini-pro

Cardiffnlp - ~124M tweets, TweetEval sentiment finetuned.

https://huggingface.co/cardiffnlp/twitter-roberta-base-sentiment-latest

sentence MiniLM - Sentence-embeddings: 384D, cluster, search

https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2

Facebook Encodec - EnCodec model card: Real-time audio codec

https://huggingface.co/facebook/encodec_24khz

Microsoft - Pre-trained multimodal Transformer

https://huggingface.co/microsoft/layoutlmv3-base

Lxyuan - Distilled, zero-shot, Multilingual Sentiment

https://huggingface.co/lxyuan/distilbert-base-multilingual-cased-sentiments-student

Openai Clip - Model card from CLIP repository

https://huggingface.co/openai/clip-vit-base-patch32

Cardiffnlp - XLM-roBERTa-base, ~198M tweets, sentiment finetuned

https://huggingface.co/cardiffnlp/twitter-xlm-roberta-base-sentiment

Microsoft - Improved BERT, RoBERTa with disentangled attention

https://huggingface.co/microsoft/deberta-base

Jean-Baptiste - Validated on emails/chat, outperformed others

https://huggingface.co/Jean-Baptiste/roberta-large-ner-english

Deepset - roberta-base, fine-tuned SQuAD2.0

https://huggingface.co/deepset/roberta-base-squad2

Google Bert - Pretrained model Top 104 languages, MLM

https://huggingface.co/google-bert/bert-base-multilingual-cased

Timm - MobileNet-v3 image classifier

https://huggingface.co/timm/mobilenetv3_large_100.ra_in1k

tiiuae/falcon-7b

https://huggingface.co/tiiuae/falcon-7b

abacusai/Smaug-72B-v0.1

https://huggingface.co/abacusai/Smaug-72B-v0.1

Facebook - Base model, 960h Librispeech, 16kHz fine-tuned

https://huggingface.co/facebook/wav2vec2-base-960h

Cardiffnlp - roBERTa-base, 58M tweets, TweetEval irony finetuned

https://huggingface.co/cardiffnlp/twitter-roberta-base-irony

Genie by Lumalabs - text to 3d

https://lumalabs.ai/genie?view=create

Sentence Transformers - Maps text to 768D vectors

https://huggingface.co/sentence-transformers/bert-base-nli-mean-tokens

Cardiffnlp - roBERTa-base, ~58M tweets, TweetEval offensive language

https://huggingface.co/cardiffnlp/twitter-roberta-base-offensive

Martin-ha - Toxic comments classification

https://huggingface.co/martin-ha/toxic-comment-model

Almanach - French RoBERTa-based model

https://huggingface.co/almanach/camembert-base

Stabilityai - Weights for diffusers library

https://huggingface.co/stabilityai/sd-vae-ft-mse

ai21labs/Jamba-v0.1

https://huggingface.co/ai21labs/Jamba-v0.1

CAMeL Lab - Sentiment Analysis, fine-tuned

https://huggingface.co/CAMeL-Lab/bert-base-arabic-camelbert-da-sentiment

google - Vit - ImageNet-21k pre-trained, 224x224 resolution

https://huggingface.co/google/vit-base-patch16-224-in21k

Phi 2

https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/

Morpheus 1 - multi-modal generative ultrasonic transformer

https://x.com/PropheticAI/status/1750534355242418300?s=20

Sentence Multi - Multi-QA MiniLM-L6, sentence-transformers

https://huggingface.co/sentence-transformers/multi-qa-MiniLM-L6-cos-v1

Google Electra - ELECTRA Self-supervised language learning.

https://huggingface.co/google/electra-base-discriminator

Stabilityai diffusion - SDXL Ensemble, latent diffusion pipeline

https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0

google Fnetbase - English MLM, NSP pretrained model

https://huggingface.co/google/fnet-base

Perplexity - PPLX Online

https://www.perplexity.ai/hub/blog/introducing-pplx-online-llms

Gemma 7b by Google

https://huggingface.co/google/gemma-7b

Distilbert - Paper introduces distilled BERT

https://huggingface.co/distilbert/distilbert-base-uncased

Runwayml - Stable Diffusion Text-to-image model

https://huggingface.co/runwayml/stable-diffusion-v1-5

anthropic/claude-3-opus

https://openrouter.ai/models/anthropic/claude-3-opus

Prajjwal1 - PyTorch model from converted Google BERT

https://huggingface.co/prajjwal1/bert-small

Mixtral 8x7B

https://mistral.ai/news/mixtral-of-experts/

mixedbread-ai/mxbai-embed-large-v1

https://huggingface.co/mixedbread-ai/mxbai-embed-large-v1

Dslim - Fine-tuned BERT, state-of-the-art NER

https://huggingface.co/dslim/bert-large-NER

Ahmedrachid - Pre-trained on financial texts

https://huggingface.co/ahmedrachid/FinancialBERT-Sentiment-Analysis

Openai - AI explores vision robustness

https://huggingface.co/openai/clip-vit-large-patch14

BAAI - FlagEmbedding Retrieval-augmented LLMs

https://huggingface.co/BAAI/bge-reranker-base

Timbrooks - Install diffusers in InstructPix2Pix using main for now

https://huggingface.co/timbrooks/instruct-pix2pix

Supabase - GTE models by Alibaba DAMO Academy

https://huggingface.co/Supabase/gte-small

LTP/small - Chinese NLP tools

https://huggingface.co/LTP/small

Ashishkr - Validate content well-formedness

https://huggingface.co/Ashishkr/query_wellformedness_score

Stabilityai/stablelm-zephyr-3b

https://huggingface.co/stabilityai/stablelm-zephyr-3b

Sentence-transformers - 768-dimensional text vectors

https://huggingface.co/sentence-transformers/all-mpnet-base-v2

Cmarkea - Fine-tuned for French NER.

https://huggingface.co/cmarkea/distilcamembert-base-ner

Nlpconnect - Flax image captioning model, PyTorch version

https://huggingface.co/nlpconnect/vit-gpt2-image-captioning

Stabilityai - Fast text-to-image synthesis

https://huggingface.co/stabilityai/sdxl-turbo

Jonatasgrosman - OVHcloud GPU credits used

https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english

Deci/DeciLM-7B

https://huggingface.co/Deci/DeciLM-7B/

Distilbert - Base multilingual model.

https://huggingface.co/distilbert/distilbert-base-multilingual-cased

MythoMax-L2-13B-GPTQ

https://huggingface.co/TheBloke/MythoMax-L2-13B-GPTQ

Bigscience/bloomz-560m

https://huggingface.co/

Sentence Qa - 768D dense vectors, semantic search

https://huggingface.co/sentence-transformers/multi-qa-mpnet-base-dot-v1

FMA-Net - video deblurring

https://kaist-viclab.github.io/fmanet-site/

SamLowe - roberta-base, go_emotions dataset, multi-label.

https://huggingface.co/SamLowe/roberta-base-go_emotions

Facebook Opt-125m - Introduced in metaseq's repository

https://huggingface.co/facebook/opt-125m

Facebook Sam - Object masks from input prompts

https://huggingface.co/facebook/sam-vit-huge

Jean Baptiste - Fine-tuned on wikinerfr

https://huggingface.co/Jean-Baptiste/camembert-ner

Microsoft Mdeberta - DeBERTa Improved BERT, RoBERTa

https://huggingface.co/microsoft/mdeberta-v3-base

Papluca - XLM-RoBERTa with classification head

https://huggingface.co/papluca/xlm-roberta-base-language-detection

Distilbert Tokens - 768D vectors, clustering, semantic search

https://huggingface.co/sentence-transformers/distilbert-base-nli-mean-tokens

Facebook - BART-large checkpoint, trained on MNLI

https://huggingface.co/facebook/bart-large-mnli

Albert - English MLM pretrained model

https://huggingface.co/albert/albert-base-v2

Google-bert - English model, MLM objective

https://huggingface.co/google-bert/bert-base-uncased

Cambridgeltl - SapBERT ACL 2021 cross-lingual extension

https://huggingface.co/cambridgeltl/SapBERT-from-PubMedBERT-fulltext

Google Patch - Pre-trained ImageNet-21k, fine-tuned ImageNet

https://huggingface.co/facebook/bart-large-cnn

Google Bert - Chinese model, input masking

https://huggingface.co/google-bert/bert-base-chinese

deepset/gbert-large

https://huggingface.co/deepset/gbert-large

Databricks - DBRX

https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm

Facebook Bart - Pre-trained English, fine-tuned CNN Daily Mail

https://huggingface.co/facebook/bart-large-cnn

Lmsys/vicuna-7b-v1.5

https://huggingface.co/lmsys/vicuna-7b-v1.5

Jonatasgrosman - Fine-tuned wav2vec2-xl53: Russian, Common Voice, CSS10

https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-russian

Mistralai - Generative Sparse Mixture of Experts

https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1

CohereForAI/c4ai-command-r-v01-4bit

https://huggingface.co/CohereForAI/c4ai-command-r-v01-4bit

kk08 - Custom Crypto Market Sentiment

https://huggingface.co/kk08/CryptoBERT

Microsoft Beit - Pre-trained ImageNet-22k, fine-tuned at 224x224.

https://huggingface.co/microsoft/beit-base-patch16-224-pt22k-ft22k

Mistralai - Mistral-7B-Instruct-v0.1: Fine-tuned instruct LLM

https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1

Facebook xlm - CommonCrawl data, 100 languages

https://huggingface.co/FacebookAI/xlm-roberta-large

MAGNeT - Music generator by Meta AI

https://pages.cs.huji.ac.il/adiyoss-lab/MAGNeT/

Open-Orca/Mistral-7B-OpenOrca

https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca

Tohoku - Japanese BERT pretrained model

https://huggingface.co/tohoku-nlp/bert-base-japanese