Back to Official Open Source Localized NLP toolkit Maintenance slowed

Project Profile

SGNLP

Models from the Singapore NLP research community

GitHub stars
37
Install
pip
Core context
Singlish / code-switching
Owner
AI Singapore
Category
Localized NLP toolkit
Status
Maintenance slowed
Started
2021
Language / Form
Python
License
MIT
GitHub Stars
37
Updated
2026-05-04

SGNLP is AI Singapore’s localized language-AI toolkit before SEA-LION, focused on Singlish, multilingual code-switching, and Singapore-specific NLP tasks.

What It Is

SGNLP is a Python package that wraps models from Singapore’s NLP research community. Its focus is not generic English NLP, but the Singapore context: Singlish, English / Mandarin / Malay code-switching, local entities, and local text understanding.

Before LLMs became widely available, this kind of lightweight model was better suited to customer service, social-media analysis, and government-text processing.

AI Relevance

SGNLP shows an important fact: language-AI localization did not begin with SEA-LION. Singapore English and multilingual code-switching often break generic NLP tools, and lightweight models still retain value for edge deployment and real-time processing.

Its relationship with SEA-LION is closer to two product generations: SGNLP as the specialty toolkit, SEA-LION as the general regional LLM.

Singapore Relevance

SGNLP is an early engineering expression of Singapore’s "language sovereignty" path. It treats local language phenomena as a product problem rather than waiting for global models to cover them naturally.

This page is a good future home for more detail: model list, demo status, whether government or enterprise systems still use it, and how it relates to SEA-LION embeddings / ModernBERT.

Milestones

  1. 2021
    SGNLP open-sourced

Resources