Change the repository type filter
All
Repositories list
71 repositories
sailor-llm
Publicinceptionnext
PublicInceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)- 🌾 OAT: Online AlignmenT for LLMs
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.
VocabularyParallelism
Public- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
Cheating-LLM-Benchmarks
PublicP-DoS
PublicCPO
PublicSimLayerKV
PublicAttention-Sink
Public[ATTRIB @ NeurIPS 2024 Oral] When Attention Sink Emerges in Language Models: An Empirical Viewregmix
Publicscaling-with-vocab
Public[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
I-FSJ
Publicdice
PublicOfficial implementation of Bootstrapping Language Models via DPO Implicit Rewardslorahub
Public- 🚢 Data Toolkit for Sailor Language Models
- MetaFormer Baselines for Vision (TPAMI 2024)
poolformer
PublicPoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)