Project Profile
SGNLP
Models from the Singapore NLP research community
- Owner
- AI Singapore
- Category
- Localized NLP toolkit
- Status
- Maintenance slowed
- Started
- 2021
- Language / Form
- Python
- License
- MIT
- GitHub Stars
- 37
- Updated
- 2026-05-04
SGNLP is AI Singapore’s localized language-AI toolkit before SEA-LION, focused on Singlish, multilingual code-switching, and Singapore-specific NLP tasks.
What It Is
SGNLP is a Python package that wraps models from Singapore’s NLP research community. Its focus is not generic English NLP, but the Singapore context: Singlish, English / Mandarin / Malay code-switching, local entities, and local text understanding.
Before LLMs became widely available, this kind of lightweight model was better suited to customer service, social-media analysis, and government-text processing.
AI Relevance
SGNLP shows an important fact: language-AI localization did not begin with SEA-LION. Singapore English and multilingual code-switching often break generic NLP tools, and lightweight models still retain value for edge deployment and real-time processing.
Its relationship with SEA-LION is closer to two product generations: SGNLP as the specialty toolkit, SEA-LION as the general regional LLM.
Singapore Relevance
SGNLP is an early engineering expression of Singapore’s "language sovereignty" path. It treats local language phenomena as a product problem rather than waiting for global models to cover them naturally.
This page is a good future home for more detail: model list, demo status, whether government or enterprise systems still use it, and how it relates to SEA-LION embeddings / ModernBERT.
Milestones
- 2021SGNLP open-sourced