
追踪
────────────────────
[01]AI Agent 2026
- 2026年AI行业的核心变革是从被动对话转向主动做事。AI Agent智能体通过一句话指令即可自主拆解任务、执行并交付,相比传统逐句提问方式,工作效率提升超50%。
工具
────────────────────
[01]Microsoft's open-source SkillOpt automatically upgrades AI agent skills without touching model weights
https://venturebeat.com/orchestration/microsofts-open-source-skillopt-automatically-upgrades-ai-agent-skills-without-touching-model-weights
- 微软开源SkillOpt工具可自动优化AI代理技能,无需修改模型权重,提升企业应用适配效率。
[02]Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks
https://venturebeat.com/technology/xiaomis-new-open-source-agentic-ai-coding-harness-mimo-code-beats-claude-code-at-ultra-long-200-step-tasks
- 小米开源MiMo Code编程助手在超长多步骤任务上超越Claude Code,展现强大的代码生成能力。
[03]Deezer's new tool can identify AI music from Spotify, Apple Music, and others
https://techcrunch.com/2026/06/11/deezers-new-tool-can-identify-ai-music-from-spotify-apple-music-and-others
- Deezer推出AI音乐识别工具,可扫描多平台播放列表检测AI生成的音乐。
研究
────────────────────
[01]From Explicit Elements to Implicit Intent: A Predefined Library for Auditable Behavioral Inference
- SemantiClean框架从电商会话数据中提取语义信号,驱动购买意图识别与客户分类等推理任务。
[02]Position: Hippocampal Explicit Memory Is the Cornerstone for AGI
- 研究论证显式记忆是LLM迈向通用人工智能的基石,提出关键技术方向。
[03]Can AI Agents Synthesize Scientific Conclusions?
- 引入SciConBench评估AI代理在医疗等高危领域综合科学结论的能力。
[04]Knowing When to Ask: Self-Gated Clarification for Hierarchical Language Agents
- 提出自门控澄清机制,帮助代理在分层推理中识别缺失信息并主动求证。
[05]Automated Mediator for Human Negotiation: Pre-Mediation via a Structured LLM Pipeline
- 利用结构化LLM管道自动执行谈判前调解,降低成本同时提升协议效率。
[06]Restless bandits with imperfect binary feedback: PCL-indexability analysis and computation
- 基于部分守恒律分析频谱感知中的赤手空拳问题,解决感知误差下的决策优化。
[07]To Intervene or Not: Guiding Inference-time Alignment with Probabilistic Model Blending
- 通过概率模型混合在推理时指导LLM对齐,降低成本提升安全性。
[08]Dual-Stance Evaluation of Sycophancy: The Structure of Agreement and the Limits of Intervention
- 双立场评估方法检验激活转向减少谄媚同时保留事实正确的回应。
[09]Few-Shot Resampling for Scalable Statistically-Sound Data Mining
- 少样本重采样方法加速数据挖掘中的统计显著性评估,提升可扩展性。
[10]ProHiFlo: Hierarchical Flow Matching with Functional Guidance for De Novo Protein Generation
- 分层流匹配框架引入功能约束指导,改进蛋白质从头设计的精准性。
行业动态
────────────────────
[01]The Download: soccer's data renaissance and China's big nuclear plans
https://www.technologyreview.com/2026/06/11/1138809/the-download-soccer-football-data-analytics-china-nuclear-power/
- 足球数据分析与中国核电扩张成为本周科技热点,展现AI在体育和能源领域的应用。
[02]Google DeepMind is worried about what happens when millions of agents start to interact
https://www.technologyreview.com/2026/06/11/1138794/google-deepmind-is-worried-about-what-happens-when-millions-of-agents-start-to-interact/
- 谷歌DeepMind关注数百万AI代理交互的潜在风险,投资AGI对齐研究。
[03]Job titles of the future: Nature's drug designer
https://www.technologyreview.com/2026/06/11/1138502/job-titles-natures-drug-designer-tim-cernak/
- 默克化学家利用AI设计精准疗法,开创药物设计新职业方向。
[04]Inside soccer's data renaissance
https://www.technologyreview.com/2026/06/11/1138506/inside-soccer-data-renaissance-jesse-davis/
- 数据分析正改变足球战术决策,AI帮助球队发现传统认知外的最优策略。
[05]Why China is betting on big nuclear reactors
https://www.technologyreview.com/2026/06/11/1138789/china-big-nuclear-reactors/
- 中国核电装机容量翻番至60GW,大型反应堆成为能源转型主力。
[06]Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit
https://venturebeat.com/data/context-compression-finally-works-in-production-new-research-cuts-llm-input-16x-without-the-accuracy-hit
- 新型上下文压缩技术实现16倍压缩率,解决生产环境中的计算瓶颈。
[07]What AI benchmarks miss about real-world performance
https://venturebeat.com/orchestration/what-ai-benchmarks-miss-about-real-world-performance
- AI基准测试存在局限性,实际生产性能受存储计算路径等因素制约。
[08]Google's DiffusionGemma generates 256 tokens in parallel and self-corrects as it goes
https://venturebeat.com/technology/googles-diffusiongemma-generates-256-tokens-in-parallel-and-self-corrects-as-it-goes
- 谷歌DiffusionGemma实现并行生成256个令牌并边生成边自正,加速文本生成。
[09]Theker just raised $85M to build the factory robot that doesn't specialize in anything
https://techcrunch.com/2026/06/11/theker-just-raised-85m-to-build-the-factory-robot-that-doesnt-specialize-in-anything/
- Theker融资8500万美元开发可重配置工厂机器人,打破单一专用设计局限。
[10]Jeff Bezos's Prometheus raises $12B to build an 'artificial general engineer' for the physical world
https://techcrunch.com/2026/06/11/jeff-bezoss-prometheus-raises-12b-to-build-an-artificial-general-engineer-for-the-physical-world/
- 贝索斯Prometheus公司融资120亿美元开发物理世界通用工程AI,估值410亿。
[11]SpaceX officially prices shares at $135 in the largest IPO ever
https://techcrunch.com/2026/06/11/spacex-officially-prices-shares-at-135-in-the-largest-ipo-ever/
- SpaceX定价135美元启动历史最大规模IPO,震撼资本市场。
[12]SpaceX SPV investors won't know their true holdings until post-IPO lock-ups lift
https://techcrunch.com/2026/06/11/spacex-spv-investors-wont-know-their-true-holdings-until-post-ipo-lock-ups-lift/
- SpaceX特殊目的公司低层投资者面临隐性费用和锁定期延迟支付风险。
[13]Apple's Camera Chief Thinks AI Can Give You Superpowers
https://www.wired.com/story/apple-camera-chief-thinks-ai-can-give-you-superpowers/
- 苹果相机主管认为iOS 27生成式拍照功能能赋予用户AI超能力。
[14]Why You Might Already Own SpaceX Shares, Siri's AI Makeover, and Knicks Owner's Surveillance Machine
https://www.wired.com/story/uncanny-valley-podcast-why-you-might-already-own-spacex-shares-siri-ai-makeover-knicks-owner-surveillance-machine/
- SpaceX IPO、Siri升级与监控技术成为本周科技新闻焦点。
[15]Meet the OpenAI Engineer Leading ChatGPT's Biggest Transformation Yet
https://www.wired.com/story/model-behavior-interview-with-openai-codex-lead-tibo-sottiaux/
- OpenAI工程师Tibo Sottiaux推动ChatGPT重大升级,编程功能成快速增长业务。
[16]Grok Is Still Hosting Sexualized Deepfakes of Famous Women
https://www.wired.com/story/grok-is-still-hosting-sexualized-deepfakes-of-famous-women/
- Grok平台仍存在非法深度伪造内容,涉及名人与政界人士的不当图像。
[17]Anthropic Walks Back Policy That Could Have 'Sabotaged' AI Researchers Using Claude
https://www.wired.com/story/anthropic-responds-to-backlash-on-claudes-secret-sabotage-on-ai-research/
- Anthropic撤回可能秘密限制Claude竞争能力的政策,回应研究社区反对。
付费速览
────────────────────
[01]Claude Code (参考)
- 计费模式:按用量计费(Usage-based)
- 方案说明:通过 Anthropic API 计费,无独立订阅,按 token 消耗付费
- 价格详情:Claude Sonnet 4: $3/M input, $15/M output;Claude Opus 4: $15/M input, $75/M output
- 官网:查看最新定价
[02]Cursor (参考)
- 计费模式:订阅制 + 用量计费
- 方案说明:Hobby 免费(2000次/月补全);Pro $20/月(无限补全 + 500次高级请求);Business $40/用户/月
- 价格详情:高级模型(GPT-4o、Claude)按请求计费,超出套餐后按 token 收费
- 官网:查看最新定价
[03]Trae (实时)
- 计费模式:免费 + 订阅制
- 方案说明:根据提供的页面内容,Trae 的最新付费方案如下:
- Trae 付费方案(中文简洁描述)
- Trae 提供四档付费方案:Lite($3/月)包含基础使用和无限自动完成;Pro($10/月,新用户7天免费)增加IDE功能和10个并发任务;Pro+($30/月)提供3.5倍用量和15个并发任务;Ultra($100/月)提供20倍用量、模型早期访问和20个并发任务。年付可优惠25%。所有方案支持多种支付方式。
- 价格详情:字节跳动出品,国内版免费额度较高,海外版定价参考官网
- 官网:查看最新定价
[04]Codex (参考)
- 计费模式:ChatGPT 订阅内含 + API 按 token 计费
- 方案说明:包含在 ChatGPT Plus($20/月)、Pro($200/月)、Teams($25/用户/月) 中;也可 pay-as-you-go
- 价格详情:codex-mini-latest API:$1.50/M input,$6/M output;Plus 用户有任务次数限制
- 官网:查看最新定价
[05]Qoder (参考)
- 计费模式:订阅制(Credits 体系)
- 方案说明:Pro / Pro+ / Ultra 三档;折扣已于 2026-04-30 结束,恢复原价
- 价格详情:Teams:$40/用户/月(3000 Credits/月);个人版三档按月订阅,具体价格见官网
- 官网:查看最新定价
夜雨聆风