ThumbnailCreator

Thumbnails you'll love! 🥰

Last updated 10-23-2025

Category:

Large Language Model (LLM)

Reviews:

Join thousands of AI enthusiasts in the World of AI!

GLM-130B

GLM-130B, showcased at ICLR 2023, represents a groundbreaking open bilingual pre-trained model that stands out with its impressive 130 billion parameters. Developed for bidirectional dense modeling in both English and Chinese, the GLM-130B leverages the General Language Model (GLM) algorithm for pre-training and is optimized to run inference tasks on a single server setup whether it be the A100 (40G * 8) or the V100 (32G * 8). Furthermore, its compatibility with INT4 quantization means that the already modest hardware requirements can be reduced even further, allowing a server with 4 * RTX 3090 (24G) to support the model with minimal performance degradation.

As part of its training process, the GLM-130B has digested an extensive dataset consisting of over 400 billion text tokens, equally divided between Chinese and English. It boasts exceptional bilingual support, superior performance across various datasets when compared to its counterparts, and offers fast inference times. Additionally, this repository also promotes reproducibility by facilitating open-source code and model checkpoints for over 30 tasks.

Top Features:

Bilingual Support: GLM-130B caters to both English and Chinese language models.
High Performance: Comprehensive benchmarks show GLM-130B outperforming rival models across diverse datasets.
Fast Inference: Utilizes SAT and FasterTransformer for rapid inference on a single A100 server.
Reproducibility: Consistent results across more than 30 tasks, thanks to open-source code and model checkpoints.
Cross-Platform Compatibility: Accommodates a range of platforms including NVIDIA, Hygon DCU, Ascend 910, and Sunway.

FAQs:

What is GLM-130B?

GLM-130B is a bilingual, bidirectional dense model with 130 billion parameters, pre-trained using the General Language Model (GLM) algorithm.

How much data was GLM-130B trained on?

The model was trained on over 400 billion text tokens, with 200 billion each for Chinese and English text.

Can the results produced by GLM-130B be reproduced?

Yes, all results across more than 30 tasks can be easily reproduced using provided open-source code and model checkpoints.

Does GLM-130B support multiple hardware platforms?

GLM-130B supports not only NVIDIA but also Hygon DCU, Ascend 910, and soon, Sunway platforms for training and inference.

What is the main focus of the GLM-130B repository?

The repository mainly focuses on the evaluation of GLM-130B, supporting fast model inference and reproducibility of results.

Category:

Large Language Model (LLM)

Pricing:

Free

Tags:

GitHub

Bilingual Pre-Trained Model

GLM-130B

ICLR 2023

Open Source

Machine Learning

Reviews:

Join thousands of AI enthusiasts in the World of AI!

Best Free GLM-130B Alternatives (and Paid)

LlamaIndex

LlamaIndex presents a seamless and powerful data framework designed for the integration and utilization of custom data sources within large language model...

Large Language Model (LLM)

Freemium

LlamaIndex vs GLM-130B

ggml.ai

ggml.ai is at the forefront of AI technology, bringing powerful machine learning capabilities directly to the edge with its innovative tensor library. Bui...

Large Language Model (LLM)

Freemium

ggml.ai vs GLM-130B

Terracotta

Terracotta is a cutting-edge platform designed to enhance the workflow for developers and researchers working with large language models (LLMs). This intu...

Large Language Model (LLM)

Freemium

Terracotta vs GLM-130B

supervised.co

Supervised AI is revolutionizing the way AI and large language model (LLM) projects are designed, built, and scaled. Offering a platform that simplifies a...

Large Language Model (LLM)

Freemium

supervised.co vs GLM-130B

Stellaris AI

Join the forefront of AI technology with Stellaris AI's mission to create groundbreaking Native-Safe Large Language Models. At Stellaris AI, we prioritize...

Large Language Model (LLM)

Freemium

Stellaris AI vs GLM-130B

Gopher

Discover the cutting-edge advancements in artificial intelligence with DeepMind's exploration of language processing capabilities in AI. At the heart of t...

Large Language Model (LLM)

Freemium

Gopher vs GLM-130B

Enprompt 360

Experience seamless prompt generation with Enprompt 360, the ultimate ChatGPT Prompts Generator designed to elevate your interactions with AI tools. This ...

Large Language Model (LLM)

Freemium

Enprompt 360 vs GLM-130B

ZeroGPT

ZeroGPT.com stands out as the premier destination for AI detection, setting the gold standard in safeguarding digital landscapes. With cutting-edge algori...

Large Language Model (LLM)

Freemium

ZeroGPT vs GLM-130B

ChatGPT

ChatGPT is a tool that helps optimize language models for dialogue. It uses advanced algorithms and techniques to analyze and understand how people commun...

Large Language Model (LLM)

Free

ChatGPT vs GLM-130B

Claude 3 \ Anthropic

Discover the future of artificial intelligence with the launch of the Claude 3 model family by Anthropic. This groundbreaking introduction ushers in a new...

Large Language Model (LLM)

Freemium

Claude 3 \ Anthropic vs GLM-130B

LlamaIndex

Large Language Model (LLM)

Freemium

LlamaIndex presents a seamless and powerful data framework designed for the integration and utilization of custom data sources within large language model...

LlamaIndex vs GLM-130B

ggml.ai

Large Language Model (LLM)

Freemium

ggml.ai is at the forefront of AI technology, bringing powerful machine learning capabilities directly to the edge with its innovative tensor library. Bui...

ggml.ai vs GLM-130B

Terracotta

Large Language Model (LLM)

Freemium

Terracotta is a cutting-edge platform designed to enhance the workflow for developers and researchers working with large language models (LLMs). This intu...

Terracotta vs GLM-130B

supervised.co

Large Language Model (LLM)

Freemium

Supervised AI is revolutionizing the way AI and large language model (LLM) projects are designed, built, and scaled. Offering a platform that simplifies a...

supervised.co vs GLM-130B

Stellaris AI

Large Language Model (LLM)

Freemium

Join the forefront of AI technology with Stellaris AI's mission to create groundbreaking Native-Safe Large Language Models. At Stellaris AI, we prioritize...

Stellaris AI vs GLM-130B

Gopher

Large Language Model (LLM)

Freemium

Discover the cutting-edge advancements in artificial intelligence with DeepMind's exploration of language processing capabilities in AI. At the heart of t...

Gopher vs GLM-130B

Enprompt 360

Large Language Model (LLM)

Freemium

Experience seamless prompt generation with Enprompt 360, the ultimate ChatGPT Prompts Generator designed to elevate your interactions with AI tools. This ...

Enprompt 360 vs GLM-130B

ZeroGPT

Large Language Model (LLM)

Freemium

ZeroGPT.com stands out as the premier destination for AI detection, setting the gold standard in safeguarding digital landscapes. With cutting-edge algori...

ZeroGPT vs GLM-130B

ChatGPT

Large Language Model (LLM)

Free

ChatGPT is a tool that helps optimize language models for dialogue. It uses advanced algorithms and techniques to analyze and understand how people commun...

ChatGPT vs GLM-130B

Claude 3 \ Anthropic

Large Language Model (LLM)

Freemium

Discover the future of artificial intelligence with the launch of the Claude 3 model family by Anthropic. This groundbreaking introduction ushers in a new...

Claude 3 \ Anthropic vs GLM-130B