🌚 Discover
More

Other Resources

Bonus Resources!

HyperTech Core v0.1.0 ☄️

HYPERION 🪐

  • (coming soon!)

Resources ✨

YouTube 📺

Build 🏗️

Compute ⚡

R&D 🧪

Bonus 🛸

Looking for all of the other cool technologies being developed in the space? Checkout my GitHub Stars for tons of really interesting projects that are FOSS & FOSAI.

Awesome-LLM

The content below is from Awesome-LLM (opens in a new tab).

Base LLM

ModelSizeArchitectureAccessDateOriginModel License[^1]
Switch Transformer1.6TDecoder(MOE)-2021-01Paper (opens in a new tab)-
GLaM1.2TDecoder(MOE)-2021-12Paper (opens in a new tab)-
PaLM540BDecoder-2022-04Paper (opens in a new tab)-
MT-NLG530BDecoder-2022-01Paper (opens in a new tab)-
J1-Jumbo178BDecoderapi (opens in a new tab)2021-08Paper (opens in a new tab)-
OPT175BDecoderapi (opens in a new tab) | ckpt (opens in a new tab)2022-05Paper (opens in a new tab)OPT-175B License Agreement (opens in a new tab)
BLOOM176BDecoderapi (opens in a new tab) | ckpt (opens in a new tab)2022-11Paper (opens in a new tab)BigScience RAIL License v1.0 (opens in a new tab)
GPT 3.0175BDecoderapi (opens in a new tab)2020-05Paper (opens in a new tab)-
LaMDA137BDecoder-2022-01Paper (opens in a new tab)-
GLM130BDecoderckpt (opens in a new tab)2022-10Paper (opens in a new tab)The GLM-130B License (opens in a new tab)
YaLM100BDecoderckpt (opens in a new tab)2022-06Blog (opens in a new tab)Apache 2.0 (opens in a new tab)
LLaMA65BDecoderckpt (opens in a new tab)2022-09Paper (opens in a new tab)Non-commercial bespoke license (opens in a new tab)
GPT-NeoX20BDecoderckpt (opens in a new tab)2022-04Paper (opens in a new tab)Apache 2.0 (opens in a new tab)
Falcon40BDecoderckpt (opens in a new tab)2023-05Homepage (opens in a new tab)Apache 2.0 (opens in a new tab)
UL220Bagnosticckpt (opens in a new tab)2022-05Paper (opens in a new tab)Apache 2.0 (opens in a new tab)
鹏程.盘古α13BDecoderckpt (opens in a new tab)2021-04Paper (opens in a new tab)Apache 2.0 (opens in a new tab)
T511BEncoder-Decoderckpt (opens in a new tab)2019-10Paper (opens in a new tab)Apache 2.0 (opens in a new tab)
CPM-Bee10BDecoderapi (opens in a new tab)2022-10Paper (opens in a new tab)-
rwkv-47BRWKVckpt (opens in a new tab)2022-09Github (opens in a new tab)Apache 2.0 (opens in a new tab)
GPT-J6BDecoderckpt (opens in a new tab)2022-09Github (opens in a new tab)Apache 2.0 (opens in a new tab)
GPT-Neo2.7BDecoderckpt (opens in a new tab)2021-03Github (opens in a new tab)MIT (opens in a new tab)
GPT-Neo1.3BDecoderckpt (opens in a new tab)2021-03Github (opens in a new tab)MIT (opens in a new tab)

Instruction Finetuned LLM

ModelSizeArchitectureAccessDateOriginModel License[^1]
Flan-PaLM540BDecoder-2022-10Paper (opens in a new tab)-
BLOOMZ176BDecoderckpt (opens in a new tab)2022-11Paper (opens in a new tab)BigScience RAIL License v1.0 (opens in a new tab)
InstructGPT175BDecoderapi (opens in a new tab)2022-03Paper (opens in a new tab)-
Galactica120BDecoderckpt (opens in a new tab)2022-11Paper (opens in a new tab)CC-BY-NC-4.0 (opens in a new tab)
OpenChatKit20B-ckpt (opens in a new tab)2023-3-Apache 2.0 (opens in a new tab)
Flan-UL220BDecoderckpt (opens in a new tab)2023-03Blog (opens in a new tab)Apache 2.0 (opens in a new tab)
Gopher------
Chinchilla------
Flan-T511BEncoder-Decoderckpt (opens in a new tab)2022-10Paper (opens in a new tab)Apache 2.0 (opens in a new tab)
T011BEncoder-Decoderckpt (opens in a new tab)2021-10Paper (opens in a new tab)Apache 2.0 (opens in a new tab)
Alpaca7BDecoderdemo (opens in a new tab)2023-03Github (opens in a new tab)CC BY NC 4.0 (opens in a new tab)
Orca13BDecoderckpt (opens in a new tab)2023-06Paper (opens in a new tab)Non-commercial bespoke license (opens in a new tab)

LLM Training Frameworks

Tools for Deploying LLMs

Tutorials About LLMs

Courses About LLMs

Opinions about LLMs

Other Awesome Lists

Other Useful Resources

Other Papers

If you're interested in the field of LLM, you may find the above list of milestone papers helpful to explore its history and state-of-the-art. However, each direction of LLM offers a unique set of insights and contributions, which are essential to understanding the field as a whole. For a detailed list of papers in various subfields, please refer to the following link (it is possible that there are overlaps between different subfields):

  • LLM-Analysis

    Analyse different LLMs in different fields with respect to different abilities

  • LLM-Acceleration

    Hardware and software acceleration for LLM training and inference

  • LLM-Application

    Use LLM to do some really cool stuff

  • LLM-Augmentation

    Augment LLM in different aspects including faithfulness, expressiveness, domain-specific knowledge etc.

  • LLM-Detection

    Detect LLM-generated text from texts written by humans

  • LLM-Alignment

    Align LLM with Human Preference

  • Chain-of-Thought

    Chain of thought—a series of intermediate reasoning steps—significantly improves the ability of large language models to perform complex reasoning.

  • In-Context-Learning

    Large language models (LLMs) demonstrate an in-context learning (ICL) ability, that is, learning from a few examples in the context.

  • Prompt-Learning

    A Good Prompt is Worth 1,000 Words

  • Instruction-Tuning

    Finetune a language model on a collection of tasks described via instructions