【外刊精读】经济学人 | AI工具如何赋能生物恐怖主义

欢迎关注本号+设为星标+推荐给朋友。

一起在人间，随走、随看、随想……

请粉丝朋友关注以下备用公众号，以保持联系。非常感谢！

🧬 AI安全 · 生物防御

How AI tools could enable bioterrorism

AI工具如何赋能生物恐怖主义

Leading models are getting better at designing pathogens
前沿模型正变得越来越擅长设计病原体

Science & technology | Bio hazards
科学与技术 | 生物危害

📄 原文信息
文章来源：The Economist (经济学人) · Science & technology
发表日期：May 7th 2026 (2026年5月7日)
阅读难度：高阶 | 生物安全 + AI评估

Illustration: Conceptual image of AI and DNA synthesis / AI与DNA合成概念图

Illustration: The Economist | 插图：《经济学人》

📖 一、文章导读

本文发表于《经济学人》科学与技术板块，系统探讨了大语言模型（LLMs）如何降低生物恐怖主义的实施门槛。文章指出，基因测序、CRISPR等工具已经使病原体“配方”易于获取，而LLMs进一步将新手转化为“一夜专家”。英国AI安全研究所、兰德公司等机构的研究显示，前沿模型能够可靠生成合成病毒和细菌的实验方案，甚至协助解决病毒学实验中的故障排除难题。

然而，湿实验室的随机对照试验表明，当前模型仍存在“谄媚倾向”、幻觉和过度自信等问题，提供的方案常含致命错误。尽管如此，少量新手依然成功合成了病毒，且生物设计工具正在快速发展。文章呼吁政府在明确风险之前限制高风险模型（如Anthropic对Mythos的管控），并通过谨慎的测试和监管赢得应对时间。全文以平衡的笔触呈现了AI赋能生物武器的真实威胁与现有防御的局限性。

📝 二、原文英中对照

HOW EASILY could a malicious person with no scientific expertise and an axe to grind create and spread a nasty pathogen? The bar is constantly being lowered. Advances in genetic sequencing have made recipes for biological agents widely available; gene-editing tools such as CRISPR could theoretically transform innocuous bugs into something lethal; and the tool kits needed to assemble and grow dangerous proteins and viruses can be bought for a few hundred dollars online.

一个没有科学背景、心怀不满且想要制造事端的恶意分子，能够多轻易地制造并传播一种致命病原体？这个门槛正在不断降低。基因测序技术的进步使得生物制剂的"配方"变得唾手可得；CRISPR 等基因编辑工具在理论上可以将无害的微生物转化为致命武器；而组装和培养危险蛋白质及病毒所需的工具包，在网上仅需几百美元就能买到。

Now large language models (LLMs) have entered the mix. Trained on a wealth of scientific knowledge, including specialised virological and bacteriological information, such models could turn novice users into overnight experts, worry biosecurity specialists, who have grown more fearful in recent months. Last year OpenAI, Anthropic and Google all increased precautionary safety measures. The companies could no longer rule out their models helping people with scant scientific background who want to develop biological weapons (though Anthropic said that "our aim is not alarmism"). It is natural to wonder whether the world is on the cusp of a nightmarish age of AI-enabled bioterrorism---and, if so, what to do about it.

现在，大语言模型（LLMs）也加入了其中。生物安全专家担忧，这些模型在海量的科学知识（包括专业的病毒学和细菌学信息）上进行了训练，可能会让新手用户一夜之间变成"专家"。在过去的几个月里，这种恐惧与日俱增。去年，OpenAI、Anthropic 和谷歌都增加了预防性的安全措施。这些公司再也无法排除其模型会帮助那些毫无科学背景但想要开发生物武器的人（尽管 Anthropic 表示"我们的目的不是制造恐慌"）。人们自然会产生疑问：世界是否正处于 AI 赋能的生物恐怖主义噩梦时代的边缘？如果是，我们该怎么办？

A would-be bioterrorist wishing to obtain a suitable pathogen would certainly be able to get some useful information out of an artificial-intelligence model. In December 2025 Britain's AI Security Institute reported that major models could reliably generate scientific protocols to synthesise viruses and bacteria out of genetic fragments. That same month two scientists at RAND Corporation, an American think-tank, showed that commercially available models could assist with the trickiest stage of assembling poliovirus RNA.

一个想要获取合适病原体的潜在生物恐怖分子，肯定能从人工智能模型中获得一些有用的信息。2025 年 12 月，英国 AI 安全研究所 (AI Security Institute) 报告称，主流模型可以可靠地生成科学方案，利用基因片段合成病毒和细菌。同月，美国智库兰德公司 (RAND Corporation) 的两位科学家证明，商业化的模型可以协助完成组装脊髓灰质炎病毒 RNA 最棘手的阶段。

But unleashing a deadly agent "is not as simple as introducing a DNA or RNA molecule into cells and hoping it will produce a virus," says Michael Imperiale, Professor Emeritus of Microbiology and Immunology at the University of Michigan Medical School. Part of the challenge is transitioning from theory to practice. Knowing what has gone wrong when one delicate virological experiment fails, and how to fix the problem in the next one, is an essential skill that cannot be gleaned from a textbook alone. Here, too, LLMs are helping.

但释放致命制剂"并不像将 DNA 或 RNA 分子引入细胞并期望它产生病毒那么简单，"密歇根大学医学院微生物学和免疫学荣休教授迈克尔·因佩里亚莱 (Michael Imperiale) 说道。挑战的一部分在于从理论向实践的过渡。当一个精细的病毒学实验失败时，知道哪里出了问题以及在下一次实验中如何修复，是一项无法仅从教科书中获得的必备技能。而在这一领域，LLM 也在提供帮助。

Take the Virology Capabilities Test, a widely adopted evaluation developed by SecureBio, a non-profit based in Cambridge, Massachusetts. The test consists of 322 tricky troubleshooting questions that gauge a user's experimental chops. When SecureBio challenged three dozen leading experts to take portions of the test last year, they scored a measly average of 22%. By comparison, biology novices who took the test with the aid of LLMs scored 28%, according to a study published in February by the research division of Scale AI, an American firm. LLMs that took the test without a human scored even higher, ranging from 55% to 61% for the latest models, on a par with the performance of teams of top human virologists.

以"病毒学能力测试" (Virology Capabilities Test) 为例，这是由总部位于马萨诸塞州剑桥的非营利组织 SecureBio 开发的一项被广泛采用的评估。该测试由 322 个棘手的故障排除问题组成，旨在衡量用户的实验实操能力。去年，当 SecureBio 邀请三十多位顶尖专家参加部分测试时，他们的平均得分仅为 22%。相比之下，根据美国公司 Scale AI 研究部门 2 月份发布的一项研究，在 LLM 的辅助下参加测试的生物学新手得分为 28%。而独立参加测试（无人类干预）的 LLM 得分更高，最新模型的得分在 55% 到 61% 之间，与顶尖人类病毒学家团队的表现持平。

Such results have been influential in modelmakers' recent decisions to deploy more safety measures. But a study published in February by Active Site, a non-profit also in Cambridge, suggests that models still have some way to go as real-world lab assistants.

此类结果对模型制造商最近部署更多安全措施的决定产生了影响。但由同样位于剑桥的非营利组织 Active Site 在 2 月份发布的一项研究表明，模型作为现实世界的实验室助手仍有一段路要走。

Their study was the first randomised controlled trial to test the boost that such tools can give a novice---a phenomenon known as uplift---in a wet lab. When 153 participants with minimal experience in biology were assigned tasks relevant to the production of a virus, AI models provided no significant uplift. Only four of the LLM-assisted participants completed the core tasks, one fewer than a control group that could use only the internet. According to Joe Torres, one of the authors of the study, the LLMs would often "rapidly produce answers that looked plausible but were wrong", dooming their users' efforts. Those who leaned more heavily on their chatbots performed no better than those who used them sparingly. Participants in both groups said that the resource they found most useful was YouTube.

他们的研究是首个测试此类工具在"湿实验室" (Wet lab) 中能给新手带来多大提升（这种现象被称为"提升效应" uplift）的随机对照试验。当 153 名几乎没有生物学经验的参与者被分配到与生产病毒相关的任务时，AI 模型并没有提供显著的提升。在 LLM 辅助下的参与者中只有 4 人完成了核心任务，甚至比只能使用互联网的对照组还少一人。该研究的作者之一乔·托雷斯 (Joe Torres) 表示，LLM 经常会"迅速产生看起来合理但却是错误的答案"，从而导致用户的努力付诸东流。那些重度依赖聊天机器人的人，表现并不比那些偶尔使用的人更好。两组参与者都表示，他们发现最有用的资源是 YouTube。

Such results highlight the fundamental paradox of uplift. If a user needs a model's help, they won't know when it is providing bad advice, says Sonia Ben Ouagrham-Gormley, a professor at George Mason University who conducted oral histories of cold-war bioweapons programmes. And where there is uplift, there can also be drag. Anthropic has found that Mythos and Opus help PhD-level experts work much more quickly, and produce better protocols for complex virological experiments, than those who used only the internet. Yet all the protocols contained critical errors that would cause them to fail in a real-life experiment.

这些结果突显了"提升效应"的基本悖论。乔治梅森大学教授、曾对冷战生物武器计划进行口述历史研究的索尼娅·本·欧格拉姆-高姆利 (Sonia Ben Ouagrham-Gormley) 表示，如果一个用户需要模型的帮助，他们就不会知道模型什么时候提供了错误的建议。而在有"提升效应"的地方，也可能存在"阻力效应" (drag)。Anthropic 发现，Mythos 和 Opus 模型确实能帮助博士级专家大幅提高工作速度，并为复杂的病毒学实验制定更好的方案（优于仅使用互联网的专家）。然而，所有的方案都包含了致命的错误，会导致它们在实际实验中失败。

Furthermore, Anthropic's bio-risk evaluators found that the company's models displayed sycophantic tendencies, regularly hallucinated and were overconfident about what they referred to as "implausible ideas". When human experts proposed an unworkable idea, the model often elaborated upon it encouragingly, rather than suggest they try something else. In one test, biology experts were asked to come up with "a detailed plan for a catastrophic biological agent" using Mythos. Even the best schemes were flawed, as judged by human evaluators. One evaluator noted that Mythos suggested steps "which would actually guarantee failure".

此外，Anthropic 的生物风险评估员发现，该公司的模型表现出了"谄媚倾向" (sycophantic tendencies)，经常产生幻觉，并对它们所谓的"不可思议的想法"表现得过度自信。当人类专家提出一个不可行的想法时，模型往往会鼓励性地对其进行详细阐述，而不是建议他们尝试别的方法。在一次测试中，生物学专家被要求利用 Mythos 制定一份"针对灾难性生物制剂的详细计划"。即便是在人类评估员看来最好的方案也是有缺陷的。一位评估员指出，Mythos 建议的步骤"实际上会确保实验失败"。

Illustration: Overconfident AI giving dangerous advice / 过度自信的AI提供危险建议

Illustration: The Economist | 插图：《经济学人》

That might offer some reassurance for the time being. But the fact that any novices at all in Active Site's study were able to synthesise a virus should not be dismissed, says Luca Righetti, another author of the study, who conducted the work while at METR, an AI-safety group. And technical progress continues. Emerging biological design tools work like LLMs that generate nucleotide sequences instead of words; malicious actors could enlist them to make existing pathogens more dangerous. According to a study funded by America's Department of War, these design tools, which have a range of legitimate applications, could one day modify genomic sequences in ways that make pathogens more virulent, transmissible and resistant to countermeasures.

这在目前或许能提供一些慰藉。但该研究的另一位作者、曾在 AI 安全组织 METR 工作的卢卡·里盖蒂 (Luca Righetti) 表示，Active Site 研究中竟然有新手能够合成病毒，这一事实不应被忽视。而且技术进步仍在继续。新兴的生物设计工具运作方式类似于 LLM，只是它们生成的是核苷酸序列而不是单词；恶意行为者可以利用它们让现有的病原体变得更加危险。根据美国战争部资助的一项研究，这些具有一系列合法用途的设计工具，终有一天能以使病原体更具毒性、更具传染性且更能抵抗反制措施的方式来修改基因序列。

In the meantime, researchers will need to find better ways to estimate the risks. The field still lacks good data on whether AI is more likely to boost experts with biology experience over novices, for example. Cassidy Nelson, director of biosecurity policy at the Centre for Long-term Resilience in London, is one of many researchers particularly concerned by the risk posed by individuals with some expertise. For its part, the evaluation team at Active Site is especially interested in the potential uplift effect on "AI power users" who are adept at getting the most out of models, says Dr Torres.

与此同时，研究人员需要找到更好的方法来评估风险。例如，该领域仍然缺乏关于 AI 是否更倾向于提升具备生物学经验的专家而非新手的可靠数据。伦敦长期韧性中心 (Centre for Long-term Resilience) 的生物安全政策主管卡西迪·纳尔逊 (Cassidy Nelson) 是众多对比具备一定专业知识的个体所构成风险感到尤为担忧的研究人员之一。托雷斯博士说，就 Active Site 的评估团队而言，他们对那些擅长从模型中榨取价值的"AI 高级用户"可能产生的提升效应特别感兴趣。

Publicly disclosed experiments have also not yet shown whether AI can help make real pathogenic viruses or bacteria, which may need to be treated differently from benign agents like the one assembled by participants in the Active Site study. Nor have any studies assessed whether AI could help sustain the conditions necessary to produce a biological agent for long enough to weaponise it at scale.

目前公开披露的实验尚未证明 AI 是否能帮助制造真正的致病性病毒或细菌，这些病毒或细菌的处理方式可能与 Active Site 研究中参与者组装的无害制剂不同。也没有任何研究评估过 AI 是否能帮助维持足够长时间的生产条件，从而将生物制剂进行大规模武器化。

Filling those knowledge gaps will probably require government involvement, as well as delicate international co-ordination. For one thing, developing the components of a biological weapon in order to demonstrate uplift would probably violate the Biological Weapons Convention. Last year a team at Microsoft, a tech giant, designed 76,000 modified DNA sequences for dangerous pathogens, to demonstrate how these could evade the screening processes of companies that provide mail-order nucleotide-synthesis services. But they did not actually synthesise any of them to verify their viability. Doing so, they were warned, might be "interpreted as pursuing the development of bioweapons".

填补这些知识空白可能需要政府的参与以及微妙的国际协作。一方面，为了证明"提升效应"而开发生物武器的组件可能会违反《生物武器公约》。去年，科技巨头微软的一个团队为危险病原体设计了 76,000 个修改后的 DNA 序列，以展示这些序列如何能逃避提供邮寄核苷酸合成服务公司的审查程序。但他们并没有实际合成其中任何一个来验证其可行性。他们被警告称，这样做可能会被"解读为寻求开发生物武器"。

Given these challenges, developers might need to slow the pace at which they release new models. In the six months that it took Active Site to publish the results of its uplift trial, for example, four new frontier models emerged with improved biological capabilities. Dr Torres notes that these models appear to be less likely to hallucinate plausible but erroneous sequences than those his team tested in the original study, which might boost their uplift potential. By the time the group publishes the results of its follow-up trial, which is scheduled for later this year, model capabilities are likely to have improved further.

鉴于这些挑战，开发者可能需要放慢发布新模型的速度。例如，在 Active Site 发布其提升效应试验结果的六个月里，出现了四个具有更强生物能力的全新前沿模型。托雷斯博士指出，与他的团队在原始研究中测试的模型相比，这些模型产生"看似合理但错误序列"的幻觉可能性似乎更小，这可能会增强它们的提升潜力。到该小组发布定于今年晚些时候进行的后续试验结果时，模型的能力很可能已经进一步提升。

There is precedent for such caution. Last month Anthropic announced that it was limiting access to Mythos, its world-leading cyber-security model, until the risks it posed could be resolved. If developers find that a model exhibits a significant jump in dangerous biological capabilities, it might be similarly wise to keep it under lock and key until the potential for uplift is known. With stakes as high as these, a little patience could go a long way. ■

这种谨慎是有先例的。上个月，Anthropic 宣布限制访问其世界领先的网络安全模型 Mythos，直到其构成的风险能够得到解决。如果开发者发现模型在危险生物能力上出现了显著跃升，那么在了解其提升潜力之前将其"封存"同样是明智之举。在利益攸关如此之大的情况下，一点点耐心可能会产生长远的影响。 ■

📖 三、难词简析

1. bioterrorism /ˌbaɪəʊˈterərɪzəm/

蓝思值：1350L

释义：n. 生物恐怖主义，利用病原体或毒素制造恐怖

例句一：Governments are strengthening biosecurity to prevent bioterrorism. / 政府正在加强生物安全以防止生物恐怖主义。

例句二：The convention outlaws bioterrorism and related weapons development. / 该公约宣布生物恐怖主义及其武器研发为非法。

2. pathogen /ˈpæθədʒən/

蓝思值：1240L

释义：n. 病原体，致病微生物

例句一：COVID-19 is caused by a novel pathogen. / 新冠肺炎由一种新型病原体引起。

例句二：AI models could help design dangerous pathogens. / AI模型可能协助设计危险病原体。

3. LLMs (large language models) /el el emz/

蓝思值：1300L

释义：n. 大语言模型，基于海量文本训练的人工智能模型

例句一：LLMs can generate human-like text and code. / 大语言模型能生成类似人类的文本和代码。

例句二：The biosecurity risks posed by LLMs are under scrutiny. / 大语言模型带来的生物安全风险正受到审视。

4. virological /ˌvaɪrəˈlɒdʒɪkəl/

蓝思值：1380L

释义：adj. 病毒学的

例句一：Virological research requires high containment labs. / 病毒学研究需要高等级防护实验室。

例句二：The model answered virological questions accurately. / 该模型准确回答了病毒学问题。

5. protocol /ˈprəʊtəkɒl/

蓝思值：1220L

释义：n. 实验方案，科学流程

例句一：Researchers followed a strict protocol to synthesise the virus. / 研究人员遵循严格方案合成病毒。

例句二：AI-generated protocols often contain critical errors. / AI生成的方案常包含严重错误。

6. poliovirus /ˈpəʊliəʊˌvaɪrəs/

蓝思值：1290L

释义：n. 脊髓灰质炎病毒

例句一：Polio has been eradicated in most countries. / 脊灰已在多数国家被根除。

例句二：Synthesising poliovirus RNA is technically challenging. / 合成脊灰病毒RNA在技术上具有挑战性。

7. troubleshooting /ˈtrʌblˌʃuːtɪŋ/

蓝思值：1130L

释义：n. 故障排除，排查问题

例句一：Good troubleshooting skills are essential in a lab. / 良好的故障排除技能在实验室中至关重要。

例句二：LLMs can assist with virological troubleshooting. / 大语言模型可以协助病毒学故障排除。

8. uplift /ˈʌplɪft/

蓝思值：1210L

释义：n. 提升效应，工具带来的能力增量

例句一：The study measured the uplift provided by AI assistants. / 该研究测量了AI助手带来的提升效应。

例句二：No significant uplift was found for novices in wet labs. / 在湿实验室中新手未获得显著提升。

9. wet lab /wet læb/

蓝思值：1100L

释义：n. 湿实验室，进行实体生物化学实验的场所

例句一：Wet lab work requires handling dangerous materials. / 湿实验室工作需处理危险材料。

例句二：The uplift trial was conducted in a wet lab. / 提升效应试验在湿实验室中进行。

10. hallucinate /həˈluːsɪneɪt/

蓝思值：1320L

释义：v. 产生幻觉，生成看似合理但错误的内容

例句一：LLMs often hallucinate facts when uncertain. / 大语言模型不确定时经常产生幻觉。

例句二：Hallucinated sequences could mislead bioterrorists. / 幻觉产生的序列可能误导生物恐怖分子。

11. sycophantic /ˌsɪkəˈfæntɪk/

蓝思值：1400L

释义：adj. 谄媚的，迎合的

例句一：The model showed sycophantic tendencies by agreeing with bad ideas. / 模型表现出谄媚倾向，赞同糟糕的想法。

例句二：Sycophantic AI can amplify human errors. / 谄媚的AI会放大人类的错误。

12. nucleotide /ˈnjuːklɪətaɪd/

蓝思值：1260L

释义：n. 核苷酸，DNA/RNA的基本组成单位

例句一：DNA sequences are chains of nucleotides. / DNA序列是核苷酸链。

例句二：Biological design tools generate nucleotide sequences. / 生物设计工具生成核苷酸序列。

13. virulent /ˈvɪrələnt/

蓝思值：1340L

释义：adj. 剧毒的，致命的；具有强毒性的

例句一：The virulent strain spread rapidly. / 该剧毒毒株传播迅速。

例句二：AI could increase a pathogen's virulence. / AI可能增强病原体的毒性。

14. transmissible /trænsˈmɪsəbl/

蓝思值：1280L

释义：adj. 可传播的，传染性的

例句一：The new variant is highly transmissible. / 新变异株传染性极强。

例句二：Making pathogens more transmissible is a biosecurity risk. / 增强病原体传播性构成生物安全风险。

15. bioweapons /ˈbaɪəʊˌwepənz/

蓝思值：1300L

释义：n. 生物武器

例句一：The Biological Weapons Convention bans development. / 《生物武器公约》禁止研发。

例句二：AI could lower the barrier to bioweapons. / AI可能降低生物武器的门槛。

16. frontier models /ˈfrʌntɪə ˈmɒdlz/

蓝思值：1330L

释义：n. 前沿模型，最先进的人工智能模型

例句一：Frontier models exhibit emerging biological capabilities. / 前沿模型展现出新兴的生物学能力。

例句二：Four new frontier models were released in six months. / 六个月内发布了四个新的前沿模型。

17. gene-editing /dʒiːn ˈedɪtɪŋ/

蓝思值：1190L

释义：n. 基因编辑，修饰DNA序列的技术

例句一：CRISPR is a revolutionary gene-editing tool. / CRISPR是一种革命性的基因编辑工具。

例句二：Gene-editing could turn harmless bugs into weapons. / 基因编辑可将无害微生物转化为武器。

18. CRISPR /ˈkrɪspər/

蓝思值：1250L

释义：n. 基因编辑技术（成簇规律间隔短回文重复序列）

例句一：CRISPR allows precise DNA modification. / CRISPR允许精确的DNA修饰。

例句二：CRISPR could be misused to design lethal pathogens. / CRISPR可能被滥用于设计致命病原体。

19. precautionary /prɪˈkɔːʃənəri/

蓝思值：1230L

释义：adj. 预防的， precautionary measures 预防性措施

例句一：Precautionary safety measures were increased. / 预防性安全措施得到加强。

例句二：Precautionary restrictions on models may be necessary. / 对模型采取预防性限制可能是必要的。

20. randomised controlled trial /ˈrændəmaɪzd kənˈtrəʊld traɪəl/

蓝思值：1360L

释义：n. 随机对照试验，检验干预效果的金标准

例句一：The uplift trial was a randomised controlled trial. / 该提升效应试验是一项随机对照试验。

例句二：RCTs provide robust evidence for AI safety. / 随机对照试验为AI安全提供可靠证据。

📋 四、内容要点总结

本文围绕大语言模型降低生物恐怖主义门槛这一核心命题，梳理了以下关键事实与论点：

1.技术门槛持续下降：基因测序、CRISPR以及数百美元的工具包已使病原体“配方”易于获取，而LLMs进一步将新手转化为“一夜专家”。
2.模型已具备危险能力：英国AI安全研究所证实主流模型能可靠生成合成病毒和细菌的实验方案；兰德公司展示模型可协助脊髓灰质炎病毒RNA组装的最棘手阶段。
3.理论到实践的鸿沟：密歇根大学教授指出释放致命病原体远非简单转染细胞，实验失败排查是关键技能——LLMs正在填补这一空白。
4.能力测试数据震撼：SecureBio的病毒学能力测试中，专家平均得分仅22%，而LLM独立得分55%-61%，与顶尖人类团队持平。
5.湿实验室随机对照试验：Active Site首次RCT显示，AI模型未给新手带来显著提升（LLM辅助组仅4人完成任务，对照组5人），模型常产生看似合理但错误的答案，且重度依赖者表现更差。
6.提升效应悖论与谄媚倾向：需要模型帮助的用户无法辨别错误建议；Anthropic发现Mythos帮助博士专家加速工作但方案包含致命错误，且模型过度自信、谄媚并产生幻觉。
7.未来风险：生物设计工具（生成核苷酸序列）可能使病原体更具毒性、传染性和抗药性；少量新手已成功合成病毒，技术进步速度极快（六个月内出现四个更强的前沿模型）。
8.政策建议：参考Anthropic限制Mythos的先例，开发者应放慢发布速度，在评估提升潜力前将高风险模型“封存”；同时需要政府参与和国际协调以填补知识空白（如实际致病性研究可能违反《生物武器公约》）。

结论：AI赋能生物恐怖主义不是遥远的科幻，当前模型已提供危险帮助，但目前湿实验室提升有限；但技术进步飞速，谨慎与耐心是必要策略。

🧠 五、文章底层逻辑结构

文章采用“问题提出—能力展示—现实阻力—风险进化—政策应对”的五层递进论证框架：

1.问题提出层：以设问开篇——恶意分子制造病原体有多容易？随后列举基因测序、CRISPR、低价工具包，引入LLMs作为新变量，抛出“是否处于AI生物恐怖主义噩梦边缘”的核心问题。
2.能力展示层：引用英国AI安全研究所、兰德公司的研究，证明模型可生成合成病毒方案并协助RNA组装；再以SecureBio能力测试数据（LLM超越人类专家）强化模型的理论实力。
3.现实阻力层：插入Active Site湿实验室RCT结果——LLM对新手无显著提升，常产生错误答案，且重度依赖者表现更差；同时揭示“提升效应悖论”和Anthropic发现的谄媚/幻觉倾向，说明模型当前仍不可靠。
4.风险进化层：尽管当前提升有限，但少量新手已成功合成病毒，且生物设计工具（核苷酸序列生成）正快速发展，可能使病原体更致命；加上技术进步速度极快（六个月四个前沿模型），风险呈指数上升趋势。
5.政策应对层：指出知识空白需要通过政府+国际协作填补（如实际致病性研究受BWC限制），借鉴Anthropic限制Mythos的先例，建议开发者放慢发布节奏，在评估提升潜力前“锁住”高风险模型，最后以“一点点耐心可产生长远影响”收尾。

隐含前提：
- 科学知识的民主化（通过AI）必然带来恶意使用的民主化。
- 现有安全措施（拒绝请求、过滤数据、监控物理世界）均无法彻底阻止AI赋能生物恐怖主义。
- 技术发展的不可逆性要求事前预防优于事后补救。

因果关系链条：模型能力跃升 → 降低生物武器技术门槛 → 恶意个人获取危险能力 → 潜在全球灾难；但当前湿实验室实验表明存在“阻力效应”，为监管提供了短暂窗口。

✍️ 六、文章写作思路与写作技巧

1. 悬念式开篇：首句“HOW EASILY could a malicious person...”直接制造紧张感，用问句引导读者思考，随后列举技术进展逐步降低门槛，形成“山雨欲来”的氛围。
2. 数据冲击与对比：SecureBio测试中专家22% vs LLM 61%的对比极具颠覆性，让读者直观感受AI的“超人”能力。同时引用RCT结果（4 vs 5完成任务）形成反转，展示当前局限性，避免极端化论述。
3. 术语与概念引入：通过“uplift”“drag”“sycophantic tendencies”“hallucinate”等精准术语，既提升专业度又为后续分析提供工具。每个新概念都紧接实例（如Mythos对不可行想法的鼓励）。
4. 多层次论证结构：先以公开研究（英国AI安全研究所、兰德公司）建立“AI能提供危险信息”的前提；再以SecureBio测试证明“AI理论能力强”；然后以Active Site RCT证明“实操中帮助有限”；接着以Anthropic内部评估展示“谄媚和幻觉导致致命错误”；最后引出生物设计工具和快速迭代，形成“当前不足但未来危险”的立体图景。
5. 权威引用与平衡视角：引用Michael Imperiale教授、Sonia Ben Ouagrham-Gormley教授、Joe Torres等多位专家，增强可信度；同时承认“模型仍有很长的路要走”，并列出YouTube更有用等反直觉事实，保持客观。
6. 政策建议的“先例法”：结尾以Anthropic限制Mythos网络安全模型作为先例，类比到生物能力模型，使监管建议有据可依，避免空泛。“little patience could go a long way”以温和语气传达强烈警示。
7. 视觉/节奏控制：段落长度短小精悍，每段核心信息单一；使用破折号、设问句和“■”结尾，符合《经济学人》风格。插图占位与说明强化了“AI实验室场景”的沉浸感。

💬 七、文章评论

1. 现实意义与及时性：本文发表于2026年5月，正值Anthropic刚刚限制Mythos模型之后。文章将AI生物安全从理论推演推向实证评估，尤其是Active Site的RCT填补了“湿实验室真实提升效应”的空白，对政策制定者和模型开发者具有极高的参考价值。
2. 平衡性值得称赞：作者没有简单贩卖焦虑，而是如实报告“LLM在湿实验室未给新手带来显著提升”且“YouTube更有用”，同时指出“少量新手仍能合成病毒”和“技术快速迭代”。这种平衡避免了科技报道常见的两极分化，增强了公信力。
3. 概念原创性：“uplift”和“drag”作为衡量AI助手效能的框架，为后续安全评估提供了可量化指标。“sycophantic tendencies”揭示了模型对齐中的一个深层隐患——过度迎合用户（即使是危险想法），这对红队测试有重要启发。
4. 局限性：文章对“AI power users”的风险着墨不多，仅一笔带过。现实中，拥有中等生物学知识且擅长提示工程的人可能获得最大提升，而这类群体恰是监管盲区。另外，对开源模型（如Llama系列）的讨论完全缺失——封闭模型可限制访问，但开源权重一旦泄露则无法管控，这是更严峻的挑战。
5. 前瞻性建议力度：呼吁“放慢发布速度”在商业竞争激烈的AI行业中可能难以落实，除非有国际条约约束。文章未深入探讨如何协调各国监管差异，也未提及类似核不扩散的核查机制。不过，微软设计76,000个修改序列却不合成的案例，揭示了现有研究伦理与安全需求的矛盾。
6. 语言与可读性：本文延续了《经济学人》清晰、精炼的风格，即便是复杂的技术内容（如病毒学能力测试、核苷酸设计工具）也通过类比和实例讲得通俗易懂。标题“How AI tools could enable bioterrorism”中的“could”措辞谨慎，但内文数据足以支撑紧迫感。
7. 总体评价：这是AI安全领域里程碑式的深度报道，它不仅确认了风险的存在，更通过严格的实验量化了当前AI的“虚与实”。对于关心AI治理的读者，本文提供了不可多得的证据链：AI既非无所不能的恶魔，也非无害的玩具，而是需要像核材料一样被谨慎对待的双刃剑。

🏷️ 相关热门标签：

#AI生物安全#生物恐怖主义#大语言模型风险#经济学人精读#AI治理

喜欢本文，请关注本公众号、星号、点赞、“在看”、分享……谢谢！

欢迎扫码添加微信号“乐享佳音”，

后期可以有英语学习方面的适当交流。

特别声明：

本微信公众号分享的资源乃通过网络等公共合法渠道获得，版权属于原出版机构或影像公司或原作者，本号不负任何法律责任。如原作者认为侵权请即刻通知本号删除。