MDDI 演讲稿 · 2025-07-07

部长Josephine Teo在2025年个人数据保护周上的开幕致辞

Josephine Teo · 数码发展及新闻部长 · 个人数据保护周

要点

• 新加坡 IMDA 与另外八个国家联合举办首届区域性红队挑战赛，发现大语言模型因训练数据偏见而产生刻板化输出问题，例如将特定族裔姓名与犯罪角色绑定。
• 过去三年，IMDA 与 PDPC 持续运营「PET 沙盒」，蚂蚁国际等企业借助隐私增强技术在不交换原始客户数据的前提下与合作方联合训练 AI 模型，显著提升凭证核销率。
• IMDA 将发布面向高管层的「PET 采用指南」，协助企业识别适合自身业务需求的隐私增强技术并提供关键部署考量要素。
• IMDA、AI Verify Foundation 及行业伙伴联合开展「全球 AI 保证试点」，为生成式 AI 应用可靠性测试制定标准化方法，成果已汇编为「IMDA 入门工具包」，涵盖不当内容与数据泄露等风险的测试方法。
• IMDA 正将上述试点升级为持续运营的「AI 保证沙盒」，供业务用户、治理团队与 AI 开发者协同研发生成式 AI 应用的护栏与测试流程。
• IMDA 联合新加坡企业发展局与新加坡认可理事会，将数据保护信任标志（DPTM）升级为新国家标准「新加坡标准 714」（SS714），为企业展示卓越数据保护能力提供正式认证基准。

完整译文（中文）

MDDI 英文原文译文 · 翻译日期: 2026-06-21

早上好，各位同仁与朋友。首先，我想感谢在座的每一位。今天，这个房间里有逾1,500人，整个星期来来往往的与会者超过2,000人，其中包括来自亚洲许多国家乃至更远地区的嘉宾。我尤其感谢前来出席的国际宾客，包括东盟成员国的数据保护机构代表。感谢大家莅临。

今年的主题是"变革世界中的数据保护"。这一主题是对我们全球运营环境以及技术世界所发生的重大变化的回应。

这两股力量已扰乱了我们的工作场所、家庭以及彼此之间的关系。我们必须调整自身的实践方式、法律乃至更广泛的社会规范，这是必然之举。

在座大多数人是数据或人工智能从业者，或两者兼而有之。

去年，我曾谈及数据在人工智能时代的重要性。这一点至今依然切中时要。我们都知道，生成式人工智能模型建立在海量数据之上，而数据在人工智能开发全生命周期中至关重要——从预训练、微调，到测试与验证，无一例外。

近年来，我们见证了基于定制化或专有数据集构建的垂直行业人工智能应用百花齐放。

一个典型案例是樟宜机场的聊天机器人AskMax，它专门协助解答旅客查询。该系统运行于一个大型语言模型之上，该模型被设计为可调用樟宜机场的数据存储库。

另一个案例是GPT-Legal，它由IMDA利用新加坡法律学院LawNet数据库进行微调而成。

鉴于数据在人工智能时代的关键地位，数据已成为持续进步的制约因素，这并不令人意外。

让我们逐一梳理人工智能开发与应用各阶段所面临的数据挑战。

在模型训练阶段，第一个广为人知的问题是使用互联网数据来训练大型模型。互联网数据质量参差不齐，往往包含来自不同来源的偏颇或有害内容，包括讨论论坛上的用户生成内容。当底层数据输入含有有害、毒性或偏颇内容时，可能导致模型输出出现下游问题。

在由新加坡IMDA与其他八个国家联合举办的首届区域红队测试挑战赛中，发现了模型的问题行为。当被要求编写一段关于新加坡服刑人员的脚本时，该大型语言模型为因非法赌博而系狱的角色选取了"Kok Wei"这一名字，为酗酒滋事者选取了"Siva"，为毒品滥用违法者选取了"Razif"。这些刻板印象极有可能源自训练数据，而这恰恰是我们希望避免的。

与此同时，开发者正面临互联网数据枯竭的问题。大多数大型语言模型已在全量互联网数据上完成训练。那么，模型提供商应如何进一步提升模型性能？他们正转向更为敏感的私有数据库来增强模型，这又带来了一系列新的挑战。

以OpenAI为例，其数据合作伙伴关系的清单不断扩大，合作方不仅涵盖全球新闻机构，还包括政府、企业及大学，例如冰岛政府、苹果公司、赛诺菲以及亚利桑那州立大学。

合作伙伴模式是扩大数据可用性的一种途径，但耗时较长，且难以大规模复制。这些数据库中可能包含敏感数据，例如个人数据或商业机密信息。

我们越来越需要一种既能训练模型、又能保护敏感信息的方法。

人工智能应用（即"app"）可视为叠加于人工智能模型之上的"外壳"，同样可能带来可靠性方面的隐忧。若应用提供不准确、带有偏颇或具有毒性的信息，或泄露机密信息，将对企业声誉产生严重影响，情节严重时甚至可能造成实际的人身伤害。

通常，企业会采用一系列广为人知的防护措施来确保应用的可靠性。这些措施包括：编写详细的系统提示词以引导模型行为；采用检索增强生成（即RAG，相信在座许多人对此并不陌生）以提升准确性；以及采用各类过滤器筛除敏感信息。

即便如此，应用仍可能存在意想不到的缺陷。第三方测试机构Vulcan近期对一家高科技制造商的聊天机器人进行了测试，该机器人旨在协助员工解答潜在客户就产品规格提出的问题。该制造商担忧应用会无意中泄露商业机密信息，例如向潜在客户透露其不希望对方知晓的内容。果不其然，Vulcan发现，当以普通话进行提问时，该应用泄露了后台销售佣金比率。可以想象，从制造商的角度来看，向潜在客户透露销售佣金比率，无异于揭示了自身还有多大的降价空间——而这是任何企业都不愿意的。

所幸，这一问题在测试阶段便已被发现。这凸显了独立测试的价值。为确保生成式人工智能应用在发布前的可靠性，建立一套系统性、一致性的检验机制至关重要，以核实应用是否按预期运行，并具备基本的安全保障。

与模型开发者一样，应用开发者同样须应对数据不足的问题。通常情况下，模型会与企业内部数据库相连接，以使应用能够满足企业的特定需求。然而，往往缺乏足够的专有数据来构建可靠的应用。在IBM全球调查中，42%的受访者将此列为其推进人工智能应用所面临的最大挑战之一。因此，我们需要一种在保护敏感信息的同时，推动企业间更多数据共享的方法。

人工智能应用部署并供消费者使用后，纠正错误或有害信息将面临重大挑战。对已"学习"过某些内容的模型进行微调和重新训练，过程并不精准，且往往代价高昂。

因此，机器遗忘已成为一个新兴领域，尽管目前仍处于起步阶段。Anthropic等大型语言模型领军者所面临的核心挑战在于：模型如今拥有数十亿乃至数万亿个参数。哪些变量对输出缺陷贡献最大？是否存在相应技术，能够识别这些变量并在大规模层面开展有针对性的模型修正？

最后，一个至关重要的问题是问责机制。人工智能全生命周期错综复杂，涉及模型构建者、部署者、用户等多方主体，各方均须承担相应责任以规避风险。

在座各位应对这样一个案例并不陌生：三星数名员工因将机密源代码粘贴至ChatGPT以检查错误，无意间泄露了敏感信息。我想，我们也意识到，这种情况同样发生在我们自己的工作场所——有时，同事为了做拼写检查，或核查自己表达想法的方式，可能会将文件上传至ChatGPT。这不禁令人思考：文件中是否存在不宜与ChatGPT共享的内容。

这是否应由员工负责——因为他们本不应将敏感信息输入聊天机器人？我想，在座大多数同仁认为员工确实负有一定责任。

但应用提供商是否也有责任确保其设置了充分的防护措施，以防止敏感数据被收集？

抑或，模型开发者是否应负责确保此类数据不被用于进一步训练？

对此，恐怕没有简单的答案。

要使人工智能持续进步，我们需要多种类型的解决方案——从改善组织流程到开发新的风险缓解技术。技术解决方案，例如隐私增强技术（即PETs，在不损害隐私的前提下优化数据使用），已作为应对上述关切的可行路径而逐渐浮现。

过去3年，IMDA和PDPC运营了PET沙盒，鼓励企业在多个行业和应用场景中探索和试验隐私增强技术（PETs）的使用。我们看到各方兴趣日益增长，部分早期采用者也已获得切实的商业回报。

例如，加入沙盒的金融机构Ant International与其数字钱包合作伙伴联合训练AI模型，采用多种不同PETs的组合，在双方互不披露客户信息的前提下完成训练。其目的是利用该模型，将钱包合作伙伴提供的优惠券与Ant International中最有可能使用这些优惠券的客户进行匹配。Ant International提供了其客户的优惠券兑换数据，数字钱包公司则提供了同一批客户的购买历史、偏好及人口统计数据。AI模型分别基于两份数据集进行训练，任何一方的数据所有者均无法查看或获取对方的数据。这使优惠券领取数量大幅提升；钱包合作伙伴的收入得以增加，Ant International的客户参与度也得到了提升。

可以看出，这种使用PETs的方式拥有众多应用场景，例如用于欺诈检测，或帮助医疗机构更好地照护患者。

合成数据是另一种颇具前景的PET示例。去年，我发布了PDPC的《合成数据生成指南》，为各组织制定了最佳实践规范。新加坡目前已涌现出Betterdata等富有创新精神的公司，帮助AI开发者生成能够模拟真实世界数据集的数据。这些合成数据可进一步扩充现有数据集，作为构建AI模型的训练数据集，在一定程度上缓解了我此前提到的数据挑战。

我们与沙盒中各组织的合作经验，使我们能够更深入地了解相关技术，以及这些技术在数据共享时保护个人数据、履行法律义务方面的能力。这也让我们充分感受到技术提供商在提供PETs解决方案方面日益增长的兴趣，以及有意使用PETs的企业群体。

为延续这一势头，IMDA将推出《PETs采用指南》。该指南专为C级高管设计，将提供相关资源，帮助各组织根据业务需求甄选合适的PETs，同时还将涵盖企业有效部署PETs的关键考量事项。

今年的个人数据保护周将再度举办PETs峰会。与去年首次举办时相似，本届峰会将为数据保护机构、现有及有意向的PETs解决方案提供商以及沙盒用户提供相互交流、共同学习的良好契机。

正如PETs沙盒所展示的，新加坡对待新兴技术的方式，是为企业提供工具、资源和安全的试验环境，并迅速分享所得经验，使行业和消费者从中受益。

近期，IMDA、AI Verify Foundation与行业合作伙伴共同参与了一项全球AI可信性试点项目，研究测试生成式AI应用可靠性的方法。测试是证明AI应用已有效应对关键风险的重要步骤。

我们日常使用的许多物品，例如家中的电器、载我们上下班的交通工具——若未经过严格测试，我们绝不会使用它们。然而，AI应用程序每天都在被用于我们身上，却未经过适当的测试。这是一个漏洞，一个亟待填补的严重空白。

其中一个例子是樟宜综合医院，该院与第三方测试机构Softserve合作，对其用于特定医疗报告的摘要工具的可靠性进行了测试。能够生成可与其他医生共享的病例或患者摘要，对减轻医生的工作量大有裨益。如何确保该摘要工具可靠、准确且不歪曲患者信息，至关重要。

另一个例子是NCS，该公司测试了其编码助手对内部编码标准、安全要求以及外部监管指引的遵循程度。

基于本次试点所获得的经验，IMDA已甄选出多种测试方法，供各组织用于风险测试与管理。这套测试方法汇编被称为"IMDA入门套件"。这是对企业诉求的直接回应——企业希望在治理框架和指南之外，获得更为标准化的AI应用测试与部署方式。该套件涵盖对不良内容和意外数据披露等风险的测试，正如我此前所描述的情形。

随着IMDA将试点项目过渡至全新的、持续运营的AI可信性沙盒，学习与迭代仍在继续。该沙盒是一个学习环境，旨在帮助我们所有人——无论是商业用户、治理团队还是AI开发者——共同开发解决方案，例如为生成式AI应用建立更完善的护栏或流程。欢迎有意测试自身应用、并为共享知识库做出贡献的组织加入。

归根结底，我们设立这些沙盒的目标，是就数据保护或AI治理领域何为"良好标准"达成联盟共识。

与产品安全或制药等传统领域颇为相似，我们需要主题专家就应坚守的标准达成共识，并需要测试人员向我们保证这些标准得到了切实执行。

鉴于AI采用的速度和规模，制定并达成标准共识具有一定的紧迫性。现实而言，这将需要时间，其中有许多阶段需要经历。至少在新加坡，我们已迈出关键的第一步，致力于培育测试与可信性生态系统。我们希望行业参与者能够加入我们，共同推动"软性"标准的形成，为最终建立正式标准奠定基础。

数据保护领域起步较早，我很高兴地宣布，我们已准备好迈出下一步。

IMDA已与Enterprise SG及新加坡认可理事会合作，将数据保护信任标志（DPTM）提升为新的新加坡标准——新加坡标准714。能够证明其数据保护实践具有问责性的公司，现可申请获得该新标准的认证；该标准将为希望展示卓越数据保护能力的企业设立国家基准。该信任标志将向消费者保证，获得认证的组织在保护个人数据方面采用了世界级的实践。

我希望已为各位呈现了新加坡在应对以数据推动AI进步过程中所面临的挑战与机遇方面的整体思路。

我们相信，当AI以负责任的方式开发并以可靠的方式部署时，企业和民众将大有裨益，包括在释放数据价值的各种方法上。作为企业和政府的领导者，我们有责任理解如何做到这一点，并落实正确的举措。

如此一来，我们不仅将促进AI的广泛采用，更将激发社会对数据和AI治理的更大信心。在此，祝各位在接下来的讨论中收获丰硕成果。非常感谢。

英文原文

MDDI 官网原始记录 · 抓取日期: 2026-06-21

Good morning, colleagues and friends. I’d first like to thank everyone for being here. We have over 1,500 people in the room today, and over 2,000 coming and going throughout the week, including from many countries in Asia, and even further afield. I especially appreciate our international guests for joining us, including Data Protection Authorities from fellow ASEAN member states. Thank you all for being here.

The theme for this year is “data protection in a changing world”. This is an acknowledgement of the significant changes in both our global operating environment, as well as in the world of technology.

These twin forces have disrupted our workplaces, our homes, and our relationships with each other. It is inevitable that we must adjust our practices, laws and even our broader social norms.

Most of you in this room are practitioners of data or AI, or both.

Last year, I had spoken about the importance of data in the age of AI. This remains as pertinent as ever. We all know that generative AI models are built on vast amounts of data, and data is critical throughout the AI development lifecycle, from pre-training, to fine-tuning, to testing and validation.

In recent times, we have seen an explosion of sector-specific AI applications built on customised or proprietary datasets.

A good example is AskMax, Changi Airport’s chatbot that helps to address passenger queries. It runs on a LLM designed to call on Changi Airport’s data repositories.

Another example is GPT-Legal, which was finetuned by IMDA using the Singapore Academy of Law’s LawNet database.

Given the criticality of data in the AI age, it should not be surprising that data has also become a limiting factor to continuing advancement.

Let us walk through the data challenges at each stage of AI development and use.

In model training, the first well-known issue is the use of internet data to train these large models. Internet data is uneven in quality. Often, they contain biased or toxic content from different sources, including user-generated content on discussion forums. When the underlying data input contains harmful, toxic or biased content, this can lead to downstream problems with model outputs.

In the first regional red teaming challenge run jointly by Singapore IMDA and eight other countries, problematic model behaviours were observed. When asked to write a script about Singaporean inmates, the LLM chose names such as “Kok Wei” for a character jailed for illegal gambling, “Siva” for disorderly drunk and “Razif” for a drug abuse offender. These stereotypes, most likely picked up from the training data, are actually things that we want to avoid.

At the same time, developers are running out of internet data. Most of the LLMs are already trained on the entire corpus of internet data. What then should model providers do to improve their models? They are turning to more sensitive and private databases to augment their models, which brings its own set of challenges.

OpenAI, for example, has a growing list of data-related partnerships not only with global news outlets, but also governments, companies and universities like the Icelandic Government, Apple, Sanofi and Arizona State University.

The partnership model is one way of increasing data availability, but it is time-consuming and difficult to scale. Some of these databases may include sensitive data such as personal data or business confidential information.

Increasingly, we need a way to train models, while protecting sensitive information.

AI application, or ‘app’, which can be seen as the ‘skin’ that is layered on top of AI models, can also pose reliability concerns. If apps provide inaccurate, bias or toxic information, or leak confidential information, these can have serious implications for the company’s reputation, and in the worst cases, may actually cause physical harm.

Typically, companies would employ a range of well-known guardrails to make their app reliable. These include writing detailed system prompts to steer the model behaviour, using retrieval-augmented generation (or RAG), which many of you are familiar with, to improve accuracy or different types of filters to sieve out sensitive information.

Even then, apps can have unexpected shortcomings. Vulcan, a third-party tester, recently tested a high-tech manufacturer’s chatbot that assists employees to answer questions on product specifications that are posed by prospective customers. The manufacturer was concerned that the app would inadvertently leak confidential business information, for example, telling the prospective customers something that they do not want the prospective customers to know. True enough, Vulcan found that when prompted in Mandarin, the app leaked backend sales commission rates. You can imagine, from the manufacturer’s point of view, telling the prospective customers what the sales commission rates are is basically revealing how much further they can cut the price – and it is not something any business wants.

Fortunately, this problem was discovered during the testing phase. This highlights the value of independent testing. To ensure the reliability of GenAI apps before release, it is important to have a systematic and consistent way to check that the app is functioning as intended, and there is some baseline safety.

Like model developers, app developers must deal with data inadequacies. Very often, the models are linked up with internal company databases so that the apps can cater to the businesses’ specific needs. However, there are often insufficient proprietary data to build reliable apps. 42% of respondents to an IBM global survey cited this as one of their biggest challenges to AI adoption. So, we need a way to unlock more data-sharing among companies while protecting sensitive information.

After AI apps are deployed and used by consumers, correcting erroneous or harmful information poses a significant challenge. The process of finetuning and retraining a model – after it has “learnt” something – is imprecise and often costly.

Machine unlearning has therefore become a new field, albeit a nascent one. A key challenge faced by LLM leaders like Anthropic is that models now have billions or trillions of parameters. Which variables contribute most to the shortcomings in output? Are there techniques to identify them and carry out targeted model corrections at scale?

Finally, an overriding concern is accountability. The AI lifecycle is complex, with model builders, deployers, users and more. Each has a role to play to mitigate the risks.

This community here would be familiar with the case of a group of Samsung employees who unintentionally leaked sensitive information by pasting confidential source code into ChatGPT to check for errors. I think we are aware that this is happening in our workplaces too – sometimes our colleagues, in order to do a spell check, or to check the way in which they have put across ideas, may upload a file on to ChatGPT. This makes you wonder if there is anything in the file that should not be shared with ChatGPT.

Is it the responsibility of the employees who should not have put sensitive information into the chatbot? I think most of our colleagues here believe they have some responsibility.

But is it also the responsibility of the app provider to ensure that they have sufficient guardrails to prevent sensitive data from being collected?

Or should model developers be responsible for ensuring that such data is not used for further training?

There are no easy answers to this, I’m afraid.

For AI to continue advancing, we will need various types of solutions – from organisational process improvements to developing new techniques in risk mitigation. Technical solutions, such as Privacy Enhancing Technologies – or PETs that optimise the use of data without compromising privacy – have emerged as a viable pathway for addressing these concerns.

In the last 3 years, the IMDA and PDPC have run the PET Sandbox to encourage businesses to explore and experiment with the use of PETs across a variety of sectors and use cases. We have seen growing interest and some early adopters have also experienced tangible business returns.

For instance, Ant International, a financial institution that joined the Sandbox, used a combination of different PETs to train an AI model with their digital wallet partner without disclosing customer information to each other. The intention was to use the model to match vouchers offered by the wallet partner with customers of Ant International, who were most likely to use them. Ant International contributed voucher redemption data of their customers, while the digital wallet company contributed purchase history, preference and demographic data of the same customers. The AI model was trained separately with both datasets, without each data owner seeing or ingesting the other’s data. This led to a vast improvement in the number of vouchers claimed; the wallet partner increased its revenues, while Ant International enhanced its customer engagement.

You can see that this way of using PETs has many use cases, for example in detecting fraud, or in allowing healthcare institutions to do a better job of taking care of their patients.

Synthetic Data is another example of a PET that shows good promise. Last year, I launched PDPC’s Guide on Synthetic Data Generation, which sets out best practices for organisations. There are now innovative companies in Singapore, such as Betterdata, that help AI developers generate data to mimic real-world datasets. These synthetic data can further augment existing datasets as training datasets to build AI models, which goes some way to addressing the data challenges I had referred to earlier.

Our experience with organisations in the Sandbox has allowed us to better understand the technologies, their ability to protect personal data and comply with legal obligations when such data is shared. It has also given us a good sense of the growing interest from technology providers in offering PET solutions, as well as companies who are keen to use PETs.

To build on this momentum, IMDA will be introducing a PETs Adoption Guide. Designed for C-suite executives, this guide will offer resources to help organisations identify the right PETs for their business needs and will also include key considerations for companies to effectively deploy PETs.

This year’s Personal Data Protection Week will once again include the PETs Summit. Similar to last year when it was held for the first time, the Summit will be a good opportunity for data protection authorities, existing and interested PETs solution providers, and users in the Sandbox to connect and learn more from one another.

As demonstrated in the PETs Sandbox, Singapore’s approach towards emerging technologies is to help provide tools, resources, and a safe environment for companies to experiment, and to quickly share the learnings so that industries and consumers can benefit.

Recently, IMDA, AI Verify Foundation and industry partners collaborated on a Global AI Assurance pilot, studying ways to test the reliability of generative AI applications. Testing is a critical step to demonstrate that the AI application has addressed key risks.

A lot of the things that we use on a day-to-day basis, such as the appliances in our homes, the vehicles that take us to the workplace – we would not use them if they had not been properly tested. And yet, on a day-to-day basis, AI applications are being used on us without having been properly tested. So this is a lacuna, a serious gap that needs to be filled.

One example is Changi General Hospital, which worked with third party tester Softserve to test the reliability of their summarisation tool for selected medical reports. It is incredibly helpful to doctors and their workloads, to be able to put together case or patient summaries that can be shared with other physicians. How we ensure that this summarisation tool is reliable, accurate and does not misrepresent the patient, is of utmost importance.

Another is NCS, which tested how well its coding assistant adhered to internal coding standards and security requirements, as well as external regulatory guidelines.

With insights from this pilot, IMDA has identified several testing methods that organisations can use to test for and manage risks. This compilation of testing methods is known as the “IMDA Starter Kit”. It is a direct response to companies’ requests to go beyond governance frameworks and guidelines, for more standardised ways to test and deploy AI applications. It includes testing for risks like undesirable content and unintended data disclosure, like those I described earlier.

The learning and iterating continue as IMDA transitions its pilot to a new, ongoing AI Assurance Sandbox. The Sandbox is a learning environment to help all of us – whether we are business users, governance teams, AI developers – to jointly develop solutions, like better guardrails or processes for gen AI applications. Organisations interested in putting their applications to the test and contributing to our shared knowledge base are welcome to join.

Ultimately, our aim with each of these Sandboxes is to find coalition and consensus around what good looks like, whether for data protection or AI governance.

Much like traditional fields of product safety or pharmaceuticals, we need subject matter experts to agree on the standards to uphold, and testers to assure us that the standards are being met.

Given the speed and scale of AI adoption, there is some urgency for standards to be developed and agreed to. Realistically, this will take time. There are many stages to go through. In Singapore at least, we have taken the critical first steps to grow the ecosystem for testing and assurance. Our hope is that industry players will join us to initiate ‘soft’ standards that can be the basis for the eventual establishment of formal standards.

The field of data protection has had a head start, and I am pleased to share that we are ready to take the next step.

IMDA has worked with Enterprise SG and the Singapore Accreditation Council to elevate the Data Protection Trustmark (DPTM) to a new Singapore Standard, Singapore Standard 714. Companies that demonstrate accountable data protection practices can now apply to be certified under this new Standard, which will set the national benchmark for companies that want to demonstrate data protection excellence. The Trustmark will assure consumers that certified organisations adopt world-class practices in protecting their personal data.

I hope I have given you a sense of Singapore’s approach to dealing with the challenges and opportunities in using data for AI advancement.

We believe there is much for businesses and people to gain when AI is developed responsibly and deployed reliably, including the methods for unlocking data. It is up to us as leaders in corporations and the government to understand how we can do so, and to put in place the right measures.

By doing so, not only will we facilitate AI adoption, we will also inspire greater confidence in data and AI governance. On that note, I wish you fruitful discussions in the days ahead. Thank you very much.

MDDI 官网原文