HandH1998

HandH1998 HandH1998

Achievements

vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31.3k 4.8k
bytedance/lightseq bytedance/lightseq Public

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3.2k 329
microsoft/Megatron-DeepSpeed microsoft/Megatron-DeepSpeed Public

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1.9k 345
AniZpZ/AutoSmoothQuant AniZpZ/AutoSmoothQuant Public

An easy-to-use package for implementing SmoothQuant for LLMs

Python 86 7
QQQ QQQ Public

QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.

Python 91 8
sglang sglang Public

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python