• OpenAI
    • GPT-4 Technical Report, 2023.
  • Anthropic Claude
    • A General Language Assistant as a Laboratory for Alignment, 2021.
    • Model Card and Evaluations for Claude Models, 2023.
    • The Claude 3 Model Family: Opus, Sonnet, Haiku, 2024.
  • Google Gemin
    • Gemini: A Family of Highly Capable Multimodal Models, Dec. 19 2023.
    • Gemma: Open Models Based on Gemini Research and Technology, 2024.
    • Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context, May 2024.
  • Llama
    • LLaMA: Open and Efficient Foundation Language Models, Feb. 2023.
    • Llama 2: Open Foundation and Fine-Tuned Chat Models, Jul. 2023.
    • Llama3, 2024.
  • QWen
    • Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond, Aug. 24 2023.
    • Qwen Technical Report, Sep. 28 2023.
    • Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models, Nov. 14 2023.
    • https://qwenlm.github.io/blog/
  • DeepSeek
    • DeepSeek LLM: Scaling Open-Source Language Models with Longtermism, Jan. 5 2024.
    • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models, Jan. 11 2024.
    • DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence, Jan. 24 2024.
    • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models, Feb. 5 2024.
    • DeepSeek-VL: Towards Real-World Vision-Language Understanding, Mar. 8 2024.
    • DeepSeek-V2: A Strong, Economical, and Efficien Mixture-of-Experts Language Model, May 16 2024.
  • Others
    • Yi: Open Foundation Models by 01.AI, 2024.
    • Baichuan 2: Open Large-scale Language Models, 2023.
    • GLM-130B: An Open Bilingual Pre-trained Model, 2022.
    • ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback, Apr. 3 2024
    • The ChatGLM’s Road to AGI, ICLR 2024 talk.
    • InternLM: A Multilingual Language Model with Progressively Enhanced Capabilities, 2023.
    • InternLM2 Technical Report, 2024.
    • OpenELM: An Efficient Language Model Family with Open Training and Inference Framework, May 2024.
    • Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models, 2024.