LLMs

A list of all the LLMs in the market, feel free to add your LLMs here but please include relevant and accurate info

Curated by: 

 @Beau Django

With help from: 

2

GTP-4 Turbo

https://platform.openai.com/docs/models/gpt-4-and-gpt-4-turbo

0

0

Stabilityai - Expert ensemble, latent diffusion

https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0

0

0

Facebook - Repository English TTS model

https://huggingface.co/facebook/mms-tts-eng

0

0

0

0

Pyannote - Reproduce Interspeech 2021 results with segmentation

https://huggingface.co/pyannote/segmentation

0

0

Distilbert - Distilled RoBERTa-base model

https://huggingface.co/distilbert/distilroberta-base

0

0

Dslim NER - Fine-tuned BERT, state-of-the-art NER

https://huggingface.co/dslim/bert-base-NER

0

0

FacebookAI - English MLM pretrained model

https://huggingface.co/FacebookAI/roberta-large

0

0

FacebookAI - English MLM pretrained model

https://huggingface.co/FacebookAI/roberta-base

0

0

Facebook Opt - OPT introduced in May 2022

https://huggingface.co/facebook/opt-2.7b

0

0

Mistralai - Mistral-7B-Instruct-v0.2: Improved LLM

https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2

0

0

Weaver - Creative Writing Models

https://huggingface.co/papers/2401.17268

0

0

Google Byt5 - Tokenizer-free T5 version, architecture similar to MT5

https://huggingface.co/google/byt5-small

0

0

Cross-encoder - Query encoding, passage matching

https://huggingface.co/cross-encoder/ms-marco-MiniLM-L-6-v2

0

0

Pyannote speaker - Pipeline Sans onnxruntime, like pyannote

https://huggingface.co/pyannote/speaker-diarization-3.1

0

0

Openai-community - English CLM pretrained model

https://huggingface.co/openai-community/gpt2

0

0

Stabilityai - Model card for Stable Diffusion v2

https://huggingface.co/stabilityai/stable-diffusion-2

0

0

Marieke93 - Fine-tuned MiniLM on evidence types.

https://huggingface.co/marieke93/MiniLM-evidence-types

0

0

Pyannote - x-vector TDNN, SincNet features

https://huggingface.co/pyannote/embedding

0

0

Distiluse-base - Text to 512D vectors, clustering, semantic search

https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2

0

0

Openai Clip - Model trained from scratch

https://huggingface.co/openai/clip-vit-large-patch14-336

0

0

j-hartmann emotion - English text, Ekman's 6 emotions, trained on 6 datasets

https://huggingface.co/j-hartmann/emotion-english-distilroberta-base

0

0

Cardiffnlp - roBERTa-base, 58M tweets, TweetEval sentiment finetuned

https://huggingface.co/cardiffnlp/twitter-roberta-base-sentiment

0

0

MoritzLaurer - Trained on MultiNLI, Fever-NLI, ANLI: 763,913 pairs

https://huggingface.co/MoritzLaurer/DeBERTa-v3-base-mnli-fever-anli

0

0

Contriever - HuggingFace transformers require mean pooling

https://huggingface.co/facebook/contriever

0

0

Prajjwal1 - Converted Google BERT PyTorch model

https://huggingface.co/prajjwal1/bert-small

0

0

Jonatasgrosman - Fine-tuned with OVHcloud GPU credits

https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

0

0

Sentence-transformers - Maps text to 384-dimensional vectors

https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2

0

0

sentence Paraphrase - Text to 384D vectors, clustering, semantic search

https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2

0

0

Yiyanghkust - Pre-trained on financial text

https://huggingface.co/yiyanghkust/finbert-tone

0

0

0

0

Ehsanaghaei - Specific RoBERTa for cybersecurity

https://huggingface.co/ehsanaghaei/SecureBERT

0

0

Google-bert - English MLM pretrained model

https://huggingface.co/google-bert/bert-base-cased

0

0

Tsmatz - Fine-tuned xlm-roberta-base

https://huggingface.co/tsmatz/xlm-roberta-ner-japanese

0

0

0

0

Guillaumekln - OpenAI Whisper-large-v2 to CTranslate2 conversion

https://huggingface.co/guillaumekln/faster-whisper-large-v2

0

0

Cardiffnlp - ~124M tweets, TweetEval sentiment finetuned.

https://huggingface.co/cardiffnlp/twitter-roberta-base-sentiment-latest

0

0

sentence MiniLM - Sentence-embeddings: 384D, cluster, search

https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2

0

0

Facebook Encodec - EnCodec model card: Real-time audio codec

https://huggingface.co/facebook/encodec_24khz

0

0

Microsoft - Pre-trained multimodal Transformer

https://huggingface.co/microsoft/layoutlmv3-base

0

0

0

0

Openai Clip - Model card from CLIP repository

https://huggingface.co/openai/clip-vit-base-patch32

0

0

Cardiffnlp - XLM-roBERTa-base, ~198M tweets, sentiment finetuned

https://huggingface.co/cardiffnlp/twitter-xlm-roberta-base-sentiment

0

0

Microsoft - Improved BERT, RoBERTa with disentangled attention

https://huggingface.co/microsoft/deberta-base

0

0

Jean-Baptiste - Validated on emails/chat, outperformed others

https://huggingface.co/Jean-Baptiste/roberta-large-ner-english

0

0

Deepset - roberta-base, fine-tuned SQuAD2.0

https://huggingface.co/deepset/roberta-base-squad2

0

0

Google Bert - Pretrained model Top 104 languages, MLM

https://huggingface.co/google-bert/bert-base-multilingual-cased

0

0

0

0

Facebook - Base model, 960h Librispeech, 16kHz fine-tuned

https://huggingface.co/facebook/wav2vec2-base-960h

0

0

Cardiffnlp - roBERTa-base, 58M tweets, TweetEval irony finetuned

https://huggingface.co/cardiffnlp/twitter-roberta-base-irony

0

0

Genie by Lumalabs - text to 3d

https://lumalabs.ai/genie?view=create

0

0

Sentence Transformers - Maps text to 768D vectors

https://huggingface.co/sentence-transformers/bert-base-nli-mean-tokens

0

0

Cardiffnlp - roBERTa-base, ~58M tweets, TweetEval offensive language

https://huggingface.co/cardiffnlp/twitter-roberta-base-offensive

0

0

Martin-ha - Toxic comments classification

https://huggingface.co/martin-ha/toxic-comment-model

0

0

Almanach - French RoBERTa-based model

https://huggingface.co/almanach/camembert-base

0

0

Stabilityai - Weights for diffusers library

https://huggingface.co/stabilityai/sd-vae-ft-mse

0

0

google - Vit - ImageNet-21k pre-trained, 224x224 resolution

https://huggingface.co/google/vit-base-patch16-224-in21k

0

0

Morpheus 1 - multi-modal generative ultrasonic transformer

https://x.com/PropheticAI/status/1750534355242418300?s=20

0

0

Sentence Multi - Multi-QA MiniLM-L6, sentence-transformers

https://huggingface.co/sentence-transformers/multi-qa-MiniLM-L6-cos-v1

0

0

Google Electra - ELECTRA Self-supervised language learning.

https://huggingface.co/google/electra-base-discriminator

0

0

Stabilityai diffusion - SDXL Ensemble, latent diffusion pipeline

https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0

0

0

google Fnetbase - English MLM, NSP pretrained model

https://huggingface.co/google/fnet-base

0

0

Distilbert - Paper introduces distilled BERT

https://huggingface.co/distilbert/distilbert-base-uncased

0

0

Runwayml - Stable Diffusion Text-to-image model

https://huggingface.co/runwayml/stable-diffusion-v1-5

0

0

Prajjwal1 - PyTorch model from converted Google BERT

https://huggingface.co/prajjwal1/bert-small

0

0

Dslim - Fine-tuned BERT, state-of-the-art NER

https://huggingface.co/dslim/bert-large-NER

0

0

Ahmedrachid - Pre-trained on financial texts

https://huggingface.co/ahmedrachid/FinancialBERT-Sentiment-Analysis

0

0

Openai - AI explores vision robustness

https://huggingface.co/openai/clip-vit-large-patch14

0

0

BAAI - FlagEmbedding Retrieval-augmented LLMs

https://huggingface.co/BAAI/bge-reranker-base

0

0

Timbrooks - Install diffusers in InstructPix2Pix using main for now

https://huggingface.co/timbrooks/instruct-pix2pix

0

0

Supabase - GTE models by Alibaba DAMO Academy

https://huggingface.co/Supabase/gte-small

0

0

LTP/small - Chinese NLP tools

https://huggingface.co/LTP/small

0

0

Ashishkr - Validate content well-formedness

https://huggingface.co/Ashishkr/query_wellformedness_score

0

0

Sentence-transformers - 768-dimensional text vectors

https://huggingface.co/sentence-transformers/all-mpnet-base-v2

0

0

Cmarkea - Fine-tuned for French NER.

https://huggingface.co/cmarkea/distilcamembert-base-ner

0

0

Nlpconnect - Flax image captioning model, PyTorch version

https://huggingface.co/nlpconnect/vit-gpt2-image-captioning

0

0

Stabilityai - Fast text-to-image synthesis

https://huggingface.co/stabilityai/sdxl-turbo

0

0

Bigscience/bloomz-560m

https://huggingface.co/

0

0

Sentence Qa - 768D dense vectors, semantic search

https://huggingface.co/sentence-transformers/multi-qa-mpnet-base-dot-v1

0

0

0

0

SamLowe - roberta-base, go_emotions dataset, multi-label.

https://huggingface.co/SamLowe/roberta-base-go_emotions

0

0

Facebook Opt-125m - Introduced in metaseq's repository

https://huggingface.co/facebook/opt-125m

0

0

Facebook Sam - Object masks from input prompts

https://huggingface.co/facebook/sam-vit-huge

0

0

Jean Baptiste - Fine-tuned on wikinerfr

https://huggingface.co/Jean-Baptiste/camembert-ner

0

0

Microsoft Mdeberta - DeBERTa Improved BERT, RoBERTa

https://huggingface.co/microsoft/mdeberta-v3-base

0

0

Papluca - XLM-RoBERTa with classification head

https://huggingface.co/papluca/xlm-roberta-base-language-detection

0

0

Distilbert Tokens - 768D vectors, clustering, semantic search

https://huggingface.co/sentence-transformers/distilbert-base-nli-mean-tokens

0

0

Facebook - BART-large checkpoint, trained on MNLI

https://huggingface.co/facebook/bart-large-mnli

0

0

Albert - English MLM pretrained model

https://huggingface.co/albert/albert-base-v2

0

0

Google-bert - English model, MLM objective

https://huggingface.co/google-bert/bert-base-uncased

0

0

Cambridgeltl - SapBERT ACL 2021 cross-lingual extension

https://huggingface.co/cambridgeltl/SapBERT-from-PubMedBERT-fulltext

0

0

Google Patch - Pre-trained ImageNet-21k, fine-tuned ImageNet

https://huggingface.co/facebook/bart-large-cnn

0

0

Google Bert - Chinese model, input masking

https://huggingface.co/google-bert/bert-base-chinese

0

0

Facebook Bart - Pre-trained English, fine-tuned CNN Daily Mail

https://huggingface.co/facebook/bart-large-cnn

0

0

Jonatasgrosman - Fine-tuned wav2vec2-xl53: Russian, Common Voice, CSS10

https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-russian

0

0

Mistralai - Generative Sparse Mixture of Experts

https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1

0

0

kk08 - Custom Crypto Market Sentiment

https://huggingface.co/kk08/CryptoBERT

0

0

Microsoft Beit - Pre-trained ImageNet-22k, fine-tuned at 224x224.

https://huggingface.co/microsoft/beit-base-patch16-224-pt22k-ft22k

0

0

Mistralai - Mistral-7B-Instruct-v0.1: Fine-tuned instruct LLM

https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1

0

0

Facebook xlm - CommonCrawl data, 100 languages

https://huggingface.co/FacebookAI/xlm-roberta-large

0

0

MAGNeT - Music generator by Meta AI

https://pages.cs.huji.ac.il/adiyoss-lab/MAGNeT/

0

0

Tohoku - Japanese BERT pretrained model

https://huggingface.co/tohoku-nlp/bert-base-japanese

0