Researchers from the Center for AI Safety (CAIS), MIT’s Media Lab, the Brazilian university UFABC, and the pandemic prevention non-profit SecureBio have found that leading artificial intelligence models can outperform experienced, PhD-level virologists in troubleshooting complex laboratory procedures.
The findings, detailed in a new study introducing the Virology Capabilities Test (VCT), demonstrate AI’s proficiency in specialized scientific tasks but also highlight serious dual-use concerns, suggesting these tools could lower the barrier for creating dangerous biological agents.
The VCT benchmark, consisting of 322 questions and detailed further in its research paper, was designed specifically to measure an AI’s ability to assist with intricate ‘wet lab’ virology protocols, assessing fundamental, visual, and tacit understanding – the kind of practical know-how often gained through hands-on lab experience.
The results showed OpenAI’s o3 model achieved 43.8% accuracy, substantially exceeding the 22.1% average scored by specialized human virologists answering questions within their fields. Google’s Gemini 2.5 Pro also performed strongly, scoring 37.6%. According to the VCT analysis, o3’s performance surpassed 94% of the human experts on tailored question subsets.
AI Virologist Chatbots Pose Dual-Use Dilemma
This emergent AI capability – providing expert-level guidance for sensitive lab work – presents a clear dual-use scenario: useful for accelerating legitimate research but potentially dangerous if misused. Seth Donoughe, a SecureBio research scientist and study co-author, conveyed his apprehension to TIME, stating the findings made him “little nervous.”
He elaborated on the historical context: “Throughout history, there are a fair number of cases where someone attempted to make a bioweapon—and one of the major reasons why they didn’t succeed is because they didn’t have access to the right level of expertise… So it seems worthwhile to be cautious about how these capabilities are being distributed.”
Reflecting this, the VCT researchers propose that this AI skill warrants inclusion within governance frameworks designed for dual-use life science technologies.
The VCT findings spurred immediate calls for action from safety advocates. Dan Hendrycks, director of the Center for AI Safety, stressed the need for immediate action, urging AI companies to implement robust safeguards within six months, calling inaction “reckless.”
He advocated for tiered or gated access controls as a potential mitigation strategy. “We want to give the people who have a legitimate use for asking how to manipulate deadly viruses—like a researcher at the MIT biology department—the ability to do so,” Hendrycks explained to TIME. “But random people who made an account a second ago don’t get those capabilities.”
Industry Responses and Calls for Oversight
Having been briefed on the VCT results months ago, AI developers have reacted differently. xAI, Elon Musk’s company, in February, published a risk management framework acknowledging the paper and mentioning potential virology safeguards for its Grok model, such as training it to decline harmful requests.
OpenAI stated it “deployed new system-level mitigations for biological risks” for its recently released o3 and o4-mini models, including specific measures like “blocking harmful outputs.”
This measure reportedly resulted from a “thousand-hour red-teaming campaign in which 98.7% of unsafe bio-related conversations were successfully flagged and blocked.” Red-teaming is a common security practice involving simulated attacks to find vulnerabilities. Anthropic, another leading AI lab, acknowledged the VCT results in its system documentation but offered no specific mitigation plans, while Google declined to comment on the matter to TIME.
However, some experts believe self-policing by the industry isn’t sufficient. Tom Inglesby from the Johns Hopkins Center for Health Security advocated for governmental policy and regulation. “The current situation is that the companies that are most virtuous are taking time and money to do this work, which is good for all of us, but other companies don’t have to do it,” he told TIME, adding, “That doesn’t make sense.” Inglesby proposed mandatory evaluations for new large language models before their release “to make sure it will not produce pandemic-level outcomes.”
AI’s Expanding Footprint in Scientific Research
The VCT results are not an isolated incident but rather a stark data point within a broader landscape where AI is rapidly integrating into specialized scientific fields. OpenAI, creator of the top-performing o3 model, was already known to be exploring biological applications; Winbuzzer reported in January on its collaboration with Retro Biosciences using a model named GPT-4b Micro to optimize proteins involved in stem cell creation.
Similarly, Google DeepMind has been highly active. Besides the Gemini model family, its widely used AlphaFold program predicts protein structures, while an “AI Co-Scientist” project, detailed in February, aims to generate novel scientific hypotheses, sometimes mirroring unpublished human research.
Microsoft entered the fray in February with BioEmu-1, a model focused on predicting the dynamic movement of proteins, complementing AlphaFold’s static predictions. These tools, focusing on protein engineering, hypothesis generation, and molecular simulation, illustrate AI’s expanding role, moving beyond data analysis toward complex scientific reasoning and procedural assistance – amplifying both the potential scientific gains and the safety challenges highlighted by the VCT.
23 Comments
Hello to all, how is everything, I think every one is getting more from this web page, and your views are fastidious in support of new users.
Experienced Seattle chauffeurs
Hi there I am so excited I found your web site, I really found you by mistake, while I was searching on Digg for something else, Anyways I am here now and would just like to say thank you for a fantastic post and a all round thrilling blog (I also love the theme/design), I don’t have time to go through it all at the moment but I have bookmarked it and also added your RSS feeds, so when I have time I will be back to read a lot more, Please do keep up the great work.
http://tm-marmelad.com.ua/holovni-kryteriyi-vyboru-avto-linz-dlya-riznykh-mo.html
Hey I know this is off topic but I was wondering if you knew of any widgets I could add to my blog that automatically tweet my newest twitter updates. I’ve been looking for a plug-in like this for quite some time and was hoping maybe you would have some experience with something like this. Please let me know if you run into anything. I truly enjoy reading your blog and I look forward to your new updates.
https://esco-center.com.ua/headlight-sealing-for-motorcycles.html
ゼントレーダーで始める|Zenブログで深める賢い投資ライフ
最近、投資をよりシンプルに、そして効率的に始めたいと考える人々の間で注目されているのが「ゼントレーダー(Zentrader)」です。登録から取引までが非常にスムーズで、初心者にもわかりやすい設計が魅力です。
特に「Zenブログ」では、投資に関する基本知識から実践的な取引戦略まで、幅広い情報が発信されており、学びながら実践できる環境が整っています。日々のマーケット動向やトレンドも分かりやすく解説されているため、忙しい人でも効率よく情報収集が可能です。
さらに、投資初心者がつまずきやすいポイントやリスク管理の方法についても、https://social-consulting.jp/ にて実践的なアドバイスが紹介されています。投資を「感覚」ではなく「戦略」として考えたい方におすすめです。
поручень пристенный купить Поручни для лестниц – это неотъемлемая часть любой лестничной конструкции, обеспечивающая поддержку и комфорт при использовании лестницы.
https://www.med2.ru/story.php?id=147094
трип скан Tripskan – это ваш персональный гид по миру, готовый ответить на любые вопросы и помочь в любой ситуации.
https://sonturkhaber.com/
Производство шариковых подшипников Оптовые закупки подшипников непосредственно у производителей – это оптимальное решение для крупных потребителей, сочетающее в себе экономическую выгоду и уверенность в качестве продукции.
игровой компьютер виндовс 11 : Игровой компьютер для игр: Оптимально для гейминга.
tripscan top На сайте Tripscan вы найдете все необходимое для идеального отдыха: авиабилеты, отели, аренда автомобилей, экскурсии и многое другое. Мы предлагаем широкий выбор услуг, чтобы сделать ваше путешествие максимально комфортным и беззаботным.
Америка Специальная военная операция (СВО) стала водоразделом, разделившим миропорядок на “до” и “после”. Политика, как искусство компромисса и достижения согласия, подверглась серьезному испытанию. Переговоры на высшем уровне, с участием Владимира Путина и Владимира Зеленского, стали хрупкой надеждой на деэскалацию конфликта и поиск мирного решения. Финансовая система столкнулась с беспрецедентным давлением. Европа, Азия и Америка ощутили на себе последствия санкций, роста цен на энергоносители и нарушения логистических цепочек. Безопасность и оборона стали приоритетом номер один. Государства пересматривают военные доктрины, инвестируют в новые технологии и укрепляют альянсы. Кавказ и Ближний Восток, регионы и без того нестабильные, оказались в эпицентре геополитического шторма. Новости и аналитика играют ключевую роль в формировании общественного мнения. Однако в условиях информационных войн возрастает ответственность журналистов и аналитиков за объективность и непредвзятость.
обычные истории с людьми Таинственные истории, случившиеся с обычными людьми Иногда жизнь подбрасывает нам загадки, объяснить которые рационально невозможно. Истории о встречах с непознанным, о предчувствиях и знаках судьбы заставляют нас задуматься о границах реальности и о том, что существует за пределами нашего понимания. Они наполнены тревогой и любопытством, заставляя нас верить в чудеса.
поездки в дагестан экскурсии Что посмотреть в КБР
литература Литература – голос эпохи
wood fence on chain link posts Wood Picket Fence Price Per Foot Classic fencing for most home owners.
обучение кайтсёрфингу
Hello there! This is kind of off topic but I need some help from an established blog. Is it very difficult to set up your own blog? I’m not very techincal but I can figure things out pretty quick. I’m thinking about creating my own but I’m not sure where to begin. Do you have any tips or suggestions? Cheers
https://http-kra38.cc/
продаю буксик
https://www.bondhuplus.com/read-blog/221981
CDM15-9FSWPR Насос вертикальный многоступенчатый 7,5 кВт, 3×380 В, 50 Гц, чугун, 120*С
https://github.com/awsadm/AWS-CLI/releases
программа для учета дохода дома программа для учета заработной платы Оптимизируйте процесс начисления заработной платы и ведения кадрового учета. Наша программа позволит вам автоматизировать расчеты, формировать отчетность и соблюдать требования законодательства. Больше времени на развитие, меньше на рутину!