OAT

Research-friendly framework for LLM online alignment

GitHub stars

652

Direction

model alignment

Form

training framework

機関: Sea AI Lab (SAIL)
グループ: International corporate lab
カテゴリー: LLM online alignment framework
ステータス: Actively maintained
ローンチ: 2024-10
言語 / 形態: Python
ライセンス: Apache-2.0
GitHub Stars: 652
情報更新: 2026-05-04

OAT is Sea AI Lab’s LLM alignment-training framework, aimed at post-training workflows such as reinforcement learning and preference learning.

説明

OAT stands for Online Alignment Training. It packages common LLM post-training workflows such as reinforcement learning, preference optimization, online sampling, and evaluation into a research-friendly framework.

It is not an end-user product, but a tool for model research and training teams.

AIとの関係

Model capability increasingly depends on post-training. Pretraining provides base knowledge; SFT, RLHF, DPO, online reinforcement learning, and related workflows determine whether a model is useful, stable, and aligned.

OAT matters because it turns complex alignment experiments into reusable engineering infrastructure.

シンガポールとの関係

OAT shows that Sea AI Lab is not only building regional language models, but also model-training tooling. That matters for a Singapore homegrown tech company participating in foundation-model competition.

Future tracking should watch whether it is used in Sailor or other SAIL model-training pipelines.

重要マイルストーン

2024-10
OAT repository created

リソース入口

OAT on GitHub

その他の産学研プロジェクト

Salesforce AI Research Singapore

OAT

説明

AIとの関係

シンガポールとの関係

重要マイルストーン

リソース入口

その他の産学研プロジェクト

LAVIS / BLIP

CodeGen

BAGEL

VideoLLaMA3