LinuxDo
Anthropic正式推出新一代超大规模模型Claude Fable 5(通用可用)和Claude Mythos 5(受限访问),性能超越前代,敏感查询可回退至Opus 4.8。(多家报道)
推荐理由:行业格局级发布,直接影响AI应用开发者和研究者的模型选择策略。
LinuxDo
Anthropic正式推出新一代超大规模模型Claude Fable 5(通用可用)和Claude Mythos 5(受限访问),性能超越前代,敏感查询可回退至Opus 4.8。(多家报道)
推荐理由:行业格局级发布,直接影响AI应用开发者和研究者的模型选择策略。
OpenAI News
OpenAI计划收购云环境公司Ona,以增强Codex平台的安全持久云端环境能力,支持长时间运行的AI Agent在企业工作流中部署。
推荐理由:重大企业级基础设施收购,预示AI Agent从原型走向生产的关键一步。
GitHub Trending
苹果开源Container工具,使用Swift编写,在Apple Silicon上通过轻量虚拟机创建和管理Linux容器,优化Mac开发体验。
推荐理由:苹果官方开源项目,直接提升Mac开发者的容器使用效率,可立即上手。
GitHub Trending
addyosmani发布Agent Skills项目,提供面向生产环境的AI编码Agent技能集,帮助开发者快速构建可靠Agent。
推荐理由:直接可用的生产级技能库,提升Agent开发效率,推荐Agent工程师使用。
Claude Blog
Claude官方博客详解如何使用Managed Agents构建Agent交互界面,涵盖架构设计与最佳实践。
推荐理由:官方指南,对希望部署Claude Agent的开发者有直接指导作用。
Hugging Face Blog
Hugging Face博客发布PyTorch性能分析系列第二篇,详细讲解如何将nn.Linear层融合以优化MLP性能。
推荐理由:实操性强的性能优化教程,对PyTorch开发者提升模型推理效率很有价值。
MIT Tech Review AI
Google DeepMind正资助研究数百万AI Agent同时在线交互的潜在危险,由安全团队负责人Rohin Shah主导。
推荐理由:关注Agent生态安全的重要前瞻性讨论,对业界有警示意义。
DeepMind Blog
Google DeepMind推出DiffusionGemma模型,采用扩散机制实现文本生成速度提升4倍,主打高效推理。
推荐理由:性能突破性模型,对部署成本和效率有显著影响,值得关注。
Hacker News
FablePool允许用户为AI提示词众筹资金,平台公开构建相应应用,获Hacker News社区84分关注。
推荐理由:新颖的众筹模式降低AI应用构建门槛,适合创意验证。
Anthropic Research
Anthropic发布研究进展,探索AI Agent在生物学领域的应用潜力,包括实验设计和数据分析。
推荐理由:跨领域应用研究,对生物信息学从业者有启发性。
Swift · ★ 32,241 · 🍴 905 · 📈 2,419 stars today
A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.
中文介绍 苹果开源的 Linux 容器运行工具,利用轻量级虚拟化技术在 Mac(特别是 Apple Silicon)上创建和运行 Linux 容器。以 Swift 编写,为开发者提供原生 macOS 环境下的容器化开发与测试能力。
Shell · ★ 54,591 · 🍴 5,933 · 📈 3,275 stars today
Production-grade engineering skills for AI coding agents.
中文介绍 为 AI 编码代理提供生产级工程技能的集合。帮助开发者将高质量编程实践注入 AI 代理,提升代码生成、调试与维护能力,适用于自动化开发流程。
Python · ★ 2,725 · 🍴 268 · 📈 427 stars today
open-source healthcare ai
中文介绍 开源医疗 AI 项目,聚焦于医疗领域的智能应用。为研究人员和开发者提供构建医疗诊断、数据分析等 AI 工具的基础框架。
★ 16,161 · 🍴 1,682 · 📈 1,944 stars today
PM Skills Marketplace: 100+ agentic skills, commands, and plugins — from discovery to strategy, execution, launch, and growth.
中文介绍 产品经理技能市场,包含 100+ 代理技能、命令和插件,覆盖从发现、策略到执行、发布和增长的全流程。帮助 PM 通过 AI 代理自动化项目管理任务。
Python · ★ 2,594 · 🍴 208 · 📈 308 stars today
Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks.
中文介绍 NVIDIA 出品的 AI 代理技能安全扫描器,用于检测漏洞、恶意模式和风险。帮助开发者在集成第三方 AI 技能前评估安全状况,保护系统安全。
Python · ★ 32,573 · 🍴 2,386 · 📈 665 stars today
🕵️♂️ Collect a dossier on a person by username from 3000+ sites
中文介绍 通过用户名从 3000+ 个网站收集人员档案的开源工具,用于开源情报(OSINT)调查。适用于安全研究、背景调查和社交工程评估。
★ 139,844 · 🍴 34,615 · 📈 369 stars today
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI, VSCode Agent, Warp.dev, Windsurf, Xcode, Z.ai Code, Dia & v0. (And other Open Sourced) System Prompts
中文介绍 收集了 Augment Code、Claude Code、Cursor、Devin AI、Perplexity、Replit 等众多 AI 工具的系统和模型提示词。方便开发者了解不同工具的内部配置与行为。
TypeScript · ★ 15,365 · 🍴 1,059 · 📈 604 stars today
Desktop app to manage markdown knowledge bases
中文介绍 用于管理 Markdown 知识库的桌面应用。提供本地化的笔记组织、检索与协作能力,适合个人知识管理和技术文档维护。
Shell · ★ 224,756 · 🍴 19,980 · 📈 1,323 stars today
An agentic skills framework & software development methodology that works.
中文介绍 一个代理技能框架与软件开发方法论。提供可复用的 AI 代理技能和开发流程,帮助团队高效构建和部署 AI 驱动的软件项目。
Go · ★ 34,133 · 🍴 1,785 · 📈 33 stars today
Fast, secure, efficient backup program
中文介绍 快速、安全、高效的备份程序,支持加密、去重和增量备份。适用于个人和服务器数据保护,支持多种后端存储(本地、云服务等)。
Shell · ★ 111,483 · 🍴 18,246 · 📈 1,235 stars today
A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes, and proven deliverables.
中文介绍 一个完整的 AI 代理机构,包含前端专家、社交媒体运营、创意注入等多种专业代理。每个代理拥有独特个性与流程,可协同完成复杂任务。
Go · ★ 5,661 · 🍴 518 · 📈 510 stars today
Advanced DNS tunneling VPN for censorship bypass, optimized beyond DNSTT and SlipStream with low-overhead ARQ, resolver load balancing, high packet-loss stability and speed.
中文介绍 先进的 DNS 隧道 VPN 工具,用于绕过网络审查。优化了原有 DNSTT 和 SlipStream 技术,具备低开销 ARQ、解析器负载均衡和高丢包稳定性,提升速度与可靠性。
Ruby · ★ 30,332 · 🍴 7,503 · 📈 191 stars today
Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. 🔥💬
中文介绍 开源即时通讯、邮件支持和全渠道客服平台,可作为 Intercom、Zendesk 等商业工具的替代方案。支持多渠道消息聚合、自动化回复和客户管理。
Go · ★ 1,602 · 🍴 172 · 📈 98 stars today
Local-first session intelligence and analytics for coding agents, supporting Claude Code, Codex, and more than 20 other agents. Also: 100x faster replacement for ccusage!
中文介绍 面向编码代理的本地化会话智能与分析工具,支持 Claude Code、Codex 等 20+ 代理。提供 100 倍于 cusage 的性能提升,帮助开发者监控和优化代理行为。
★ 7,926 · 🍴 2,411 · 📈 131 stars today
张雪峰.skill — 张雪峰的认知操作系统。高考志愿/考研/职业规划的实战思维框架。由女娲.skill生成。
中文介绍 张雪峰认知操作系统,聚焦高考志愿、考研和职业规划的实战思维框架。由女娲.skill 生成,适用于教育咨询和人生规划场景。
Roff · ★ 73,914 · 🍴 16,537 · 📈 345 stars today
所有小初高、大学PDF教材。
中文介绍 收录中国小学、初中、高中和大学各阶段的 PDF 教材。方便学生、教师和研究人员获取教学资源。
Python · ★ 1,243 · 🍴 156 · 📈 177 stars today
SIA is a Self Improving AI framework to autonomously improve the performance of any AI system (Model / Agent) on a benchmark task.
中文介绍 自我改进 AI 框架(SIA),能够自主提升任何 AI 系统(模型或代理)在基准任务上的表现。通过自动化优化流程,适用于模型调优和代理效果提升。
TypeScript · ★ 37,284 · 🍴 8,696 · 📈 26 stars today
Mattermost is an open source platform for secure collaboration across the entire software development lifecycle..
中文介绍 开源协作平台,专为软件开发生命周期中的安全沟通设计。提供即时消息、文件共享、集成工具链等功能,是 Slack 的自托管替代方案。
Kotlin · ★ 46,728 · 🍴 8,002 · 📈 243 stars today
翻墙-科学上网
中文介绍 提供翻墙(科学上网)相关资源和指南的集合。包含工具、教程和配置方法,帮助用户绕过网络限制访问互联网。
👍 3
There are two main Parameter-Efficient Fine-Tuning (PEFT) techniques for Large Language Models (LLMs). While Low-Rank Adaptation (LoRA) introduces additional weights between the LLM layers, Soft Prompting introduces additional fine-tuning-specific raw tokens to an LLM input. However, both require mo
中文介绍 提出ART(基于艺术强化训练)方法,用于多模态大语言模型的参数高效微调,融合LoRA和软提示两种技术。
👍 1
Modern LLM training pipelines increasingly rely on other models to generate data, filter corpora, judge outputs, and guide development decisions. These dependencies are recursive: a model may depend on an upstream artifact whose own dependencies are documented only in separate releases and artifacts
中文介绍 研究现代大语言模型训练中依赖其他模型生成数据、过滤语料等递归性依赖关系,并提出审计方法。
👍 14
Vision-language models (VLMs) project images into hundreds to thousands of visual tokens, making decoder inference expensive in both attention computation and KV-cache memory. Existing visual-token reduction methods largely follow a rank-and-remove paradigm: they score visual tokens, keep a compact
中文介绍 提出可恢复视觉令牌路由方法,用于视觉语言模型解码时通过重路由而非删除减少视觉令牌,降低计算和KV缓存开销。
👍 2
Large language models (LLMs) are widely used to tackle complex tasks with autonomous workflows. Recently, reusable natural language skills have emerged as a popular paradigm to inject procedural knowledge into LLM applications. Since popular skills are often invoked repeatedly, placing their full te
中文介绍 提出自适应多分辨率程序知识压缩方法,将可复用的自然语言技能注入大语言模型,提升复杂任务中程序知识注入效率。
👍 17
Large Language Models (LLMs) are increasingly used for code generation, raising concerns that they may be misused to produce malicious code. Meanwhile, Grammar-Constrained Decoding (GCD) has been widely adopted to improve the reliability of LLM-generated code by enforcing syntactic validity. In this
中文介绍 研究发现语法约束解码(GCD)可被利用使大语言模型生成恶意代码,揭示该技术在安全防护中的漏洞。
👍 3
Remaining Useful Life (RUL) prediction is essential for industrial predictive maintenance, yet many learning-based approaches rely on extensive feature engineering or large labeled datasets to train task-specific sequence models. In this work, we introduce a lightweight learning approach, in which w
中文介绍 提出基于时间序列基础模型嵌入的轻量学习方法,用于工业剩余使用寿命预测,减少特征工程和标注数据需求。
👍 28
Spatial reasoning from egocentric videos is inherently challenging because the observable evidence is constrained by the camera trajectory. Existing methods rely on single-turn inference, forcing models to resolve geometric ambiguity through semantic priors rather than verifiable evidence. We argue
中文介绍 提出跨视角回顾推理方法,改善从第一视角视频进行空间推理的能力,通过多轮推断解决几何歧义。
👍 55
General-purpose agents such as OpenClaw are increasingly used as autonomous tool users, but their coding ability is difficult to measure under SWE-bench: a generic agent does not by itself satisfy the clean Docker workspace, patch, and prediction contract required for scoring. We introduce Claw-SWE-
中文介绍 发布Claw-SWE-Bench基准,用于评估OpenClaw等通用智能体在编程任务中的能力,解决SWE-bench难以直接测试通用智能体的难题。
👍 68
Scientific progress depends on a repeated loop of exploration, experimentation, and abstraction. Researchers test candidate directions, interpret the evidence, and carry the resulting lessons into later attempts. We study how an AI agent can run this loop autonomously over long horizons. We introduc
中文介绍 研究如何让AI智能体通过假设树细化自主运行探索、实验和抽象的科学循环,实现长期自主研究。
👍 6
Reinforcement Learning (RL) with verifiable environments has emerged as a powerful approach for enhancing the reasoning capabilities of Large Language Models (LLMs). While prior research demonstrates that scaling environment quantity improves RL performance, existing manual or individual constructio
中文介绍 提出可验证环境作为乐高积木递归组合的方法,用于增强大语言模型推理能力,提升强化学习在不同环境中的泛化。
👍 56
Environments serve as interactive systems for large language model (LLM) based agents across diverse scenarios and play a crucial role in driving the continual evolution of model capabilities. Despite this importance, existing work lacks a systematic categorization and deep analysis. This paper syst
中文介绍 综述基于大语言模型的智能体环境工程,涵盖环境建模、合成、评估与应用,为智能体提供系统化分类与分析。
👍 17
Recent progress in foundation models has shifted toward agentic behavior involving multi-step reasoning and tool use. However, open-source efforts largely focus on text-dominant settings, leaving long-horizon multimodal tasks underexplored. This gap is evident in video tasks requiring sustained temp
中文介绍 提出InternVideo3,通过多模态上下文推理智能体化基础模型,在视频任务中实现长时多步推理和工具使用。
👍 6
Pretrained video generators are promising visual world models that exhibit emergent task-solving abilities; however, their reliance on detailed textual descriptions limits their direct use for planning and decision-making. Existing approaches either outsource this reasoning to language or vision-lan
👍 16
Reinforcement learning (RL) has become a key component in modern large language models, yet the rollout stage remains the key bottleneck in RL training pipelines. Although Multi-Token Prediction (MTP) offers a natural solution to accelerate rollouts through speculative decoding, many studies have ob
👍 1
Understanding and predicting how social beliefs evolve in response to events -- from policy changes to scientific breakthroughs -- remains a fundamental challenge in social science. Given LLMs' commonsense knowledge and social intelligence, we ask: Can LLMs model the dynamics of social beliefs follo
👍 15
Reinforcement learning with verifiable rewards (RLVR) is a promising approach for enhancing reasoning and agentic behavior in large language models. However, rollout-intensive policy optimization is often limited by insufficient reward contrast, arising when overly simple or complex prompts generate
👍 1
Large language models (LLMs) are on the rise for accelerating scientific discovery, most recently in advanced tasks such as generating valid scientific hypotheses. Yet in many discovery settings, the goal is not to identify a single best hypothesis since validation can be noisy and expensive, and sc
👍 18
Combinatorics is central to Olympiad-level mathematical problem solving, requiring deep discrete reasoning, creative constructions, and rigorous structural insight. Recent evidence suggests that even today's strongest frontier models remain uneven on Olympiad combinatorics, revealing a gap in creati
👍 8
We introduce Embodied-R1.5, a unified Embodied Foundation Model (EFM) that integrates comprehensive embodied reasoning capabilities, spanning embodied cognition, task planning, correction, and pointing, within a single architecture toward general physical intelligence. Leveraging three automated dat
👍 5
Diffusion models have consistently driven progress in text-to-image generation. However, it is challenging to attribute recent progress to specific modeling and data choices: state-of-the-art open-weight models provide limited ablations, and do not disclose their training data and full training deta
👍 1
As recommender systems transition toward agentic, multi-turn conversational interfaces, evaluation paradigms have struggled to keep pace. Current benchmarks often rely on "LLM-as-a-judge" evaluations, which introduce subjectivity, high costs and inconsistency. We present τ-Rec, a benchmark for agent
👍 1
Brain Magnetic Resonance Imaging (MRI) plays a central role in studying neurological development, aging, and diseases. One key application is Brain Age Prediction (BAP), which estimates an individual's biological brain age from MRI data. Effective BAP models require large, diverse, and age-balanced
👍 51
Reward models are central to text-to-image post-training, but visual preference is subjective and better represented as a distribution over rubric scores than as a deterministic scalar. Existing scalar, score-token, and pairwise reward models over-compress uncertainty and fine-grained score differen
👍 4
Agent skills provide a lightweight mechanism for extending general-purpose agents, but their open format exposes them to skill-poisoning attacks. A practically dangerous injection must stay invisible: if executing the payload derails the user's legitimate task, the resulting failure signal invites i
👍 4
Computer-use agents (CUAs) rely on visual observations of graphical user interfaces, where each screenshot is encoded into a large number of visual tokens. As interaction trajectories grow, the token cost increases rapidly, limiting the amount of history that can be incorporated under fixed context
👍 3
Many modern vision-language models (VLMs) build on autoregressive decoding of discrete tokens. While text-based output interfaces enable scalable pretraining and strong zero-shot generalization across diverse tasks, they are poorly suited for problems that require precise continuous outputs, such as
👍 1
Sparse attention reduces compute and memory bandwidth for long-context LLM inference. However, two key challenges remain: (1) KV cache capacity still grows with sequence length, and offloading to CPU memory introduces a PCIe transfer bottleneck; (2) the sparse selection step itself retains O(T^2) co
👍 3
Prior work has shown that instruction-tuned large language models (LLMs) are less well calibrated than their base pre-trained counterparts. However, little is known about the frequently used chat template's effect on the calibration of conversational LLMs. In this work, we investigate the mechanisms
👍 9
Autonomous LLM training is often framed as recipe search, which leaves the training harness largely static. This limitation sharpens in agentic RL, where shifting bottlenecks and scalar rewards mask diverse failure modes. We introduce EvoTrainer, an autonomous training framework that co-evolves LLM
👍 1
Post-training for reasoning models typically combines supervised fine-tuning with reinforcement learning from verifiable rewards, most commonly with GRPO. However, this algorithm suffers from sparse rewards, limited exploration, and mode collapse. Building upon recent works on self-distillation, we
@sairahul1 · 113.0K 粉丝 · 852.6K 阅 · 600 赞 · 79 转
Peter Steinberger, creator of OpenClaw, who now works with OpenAI. Yesterday he posted this: "You shouldn't be prompting coding agents anymore. You should be designing loops that prompt your agents."
中文介绍 核心观点:从手动 prompt 转向设计自动循环来驱动 AI 智能体,这是 2026 年 AI 工程师的关键技能。
@sairahul1 · 113.0K 粉丝 · 710.8K 阅 · 509 赞 · 97 转
How To Become An AI Engineer in 2026. Without a CS degree. Without a bootcamp. Without knowing what a transformer is today. Here's what nobody tells you: The companies hiring right now don't need
中文介绍 非 CS 背景如何成为 AI 工程师:不需要学位、不需要 bootcamp、不需要了解 Transformer。当前真正招人的公司看重的是实践能力而非传统学历背景。
@sairahul1 · 113.0K 粉丝 · 546.4K 阅 · 536 赞 · 94 转
In February 2026, a small OpenAI team shipped 1 million lines of production code. They didn't write a single line by hand. The AI agents wrote it. The humans designed the system that made the agents
中文介绍 2026 年 2 月,一个小型 OpenAI 团队用 AI 智能体完成了 100 万行生产代码,人类一行未写。关键在于设计让智能体高效工作的系统,而非手动编码本身。
@dhaber · 50.0K 粉丝 · 497.3K 阅 · 500 赞 · 57 转
One of the biggest ways that AI is transforming work (and also one of the most taboo subjects inside companies at the moment) is that most work discussions are being recorded now by default. This
中文介绍 AI 推动工作讨论默认被录音记录,成为公司内部敏感但普遍的趋势,影响沟通方式和隐私。
@maubaron · 16.9K 粉丝 · 233.8K 阅 · 506 赞 · 19 转
Our YouTube channel has 125k subscribers and we've never made or uploaded a single video ourselves. This is a completely automated system. It is this very same strategy that made us the first app
中文介绍 一个完全自动化的 YouTube 频道系统:频道拥有 12.5 万订阅者,从未自行制作或上传过任何视频。分享这套全自动化增长策略的完整指南。
@saranormous · 143.5K 粉丝 · 194.8K 阅 · 614 赞 · 40 转
The mid-2026 investor's version of AI psychosis is a despair that nothing is investable, that we should put all our money into Anthropic and Nvidia and go home. I have never felt it. I have been sure
中文介绍 投资者中出现的「AI 精神病」:认为无物可投,只想把钱全投 Anthropic 和 Nvidia。作者认为真正机会在别处。
@0x_rody · 1.7K 粉丝 · 193.2K 阅 · 513 赞 · 72 转
Claude writes your code, hands it over, and 3 tests are failing. You paste the errors back, it fixes one thing, breaks another, and you spend the evening as a messenger between Claude and your
中文介绍 在 Claude Code 中搭建自改进循环:不再手动粘贴错误信息,而是设计自动反馈回路让智能体自我纠错。
@intuitiveml · 6.4K 粉丝 · 171.3K 阅 · 524 赞 · 70 转
Most agent frameworks today assume a desktop. One user, one machine, one process. The agent runs while the laptop is open, writes to a local filesystem, holds API keys in environment variables, and
中文介绍 构建云端智能体基础设施的实战经验:传统框架假设桌面环境——单个用户、单台机器、单进程。迁移到云端需要重新设计文件系统、环境变量和 API 密钥管理等核心能力。
@MatthewBerman · 121.3K 粉丝 · 108.0K 阅 · 661 赞 · 26 转
tl;dr I've been testing Fable (Mythos) for the past week and it feels unlike any other model I've used. It feels, and is priced, like a next-generation model. It also has some real quirks. The Good
中文介绍 亲测 Fable(Mythos 系列)一周的感受:感觉像新一代模型,定价也符合新一代标准,但存在一些真实缺陷。优缺点均有详细评测。
@Kimi_Moonshot · 172.7K 粉丝 · 106.6K 阅 · 500 赞 · 61 转
Our predictions will probably be wrong. But the World Cup offers a rare, public, verifiable, and constantly evolving real-world setting. Through this initiative, we hope to place analysis,
中文介绍 Kimi 宣布预测 2026 年世界杯全部 104 场比赛,承认预测可能不准确。德国队被低估,世界杯作为公开可验证的实时场景是检验 AI 分析能力的理想舞台。
@0xCodez · 5.3K 粉丝 · 97.8K 阅 · 510 赞 · 80 转
Most developers still prompt their coding agents by hand. They type, they wait, they read the diff, they type again. 9out of 10 builders have never written a single loop that prompts the agent for
中文介绍 从手工 prompt 到循环设计工程师的 14 步路线图:90% 的开发者从未写过一条让智能体自动 prompt 智能体的循环。分享系统性成长路径。
@RLanceMartin · 30.4K 粉丝 · 84.7K 阅 · 660 赞 · 50 转
Mythos-class models like Claude Fable 5 have changed the way many of us work at Anthropic. I want to share two tips for getting the most out of this class of models. Self-correction loops There’s been
中文介绍 在 Anthropic 内部使用 Mythos 类模型(如 Claude Fable 5)的两个技巧:一是自我修正循环——让模型自动检查并修复自己的输出;二是针对这类新模型的特定设计模式。
@RayDalio · 2.2M 粉丝 · 72.6K 阅 · 515 赞 · 93 转
What is the best approach to being effectively intelligent now that human intelligence and artificial intelligence are merging? Because I have been building computerized investment decision-making
中文介绍 当人类智慧和人工智能融合时,如何变得「有效智能」?原则性思考与 AI 必须结合,作者基于其构建自动化投资决策系统的经验分享最佳应对方法。
@itsreallyvivek · 4.3K 粉丝 · 65.8K 阅 · 521 赞 · 28 转
A few days ago I wrote that getting into a frontier AI lab mostly comes down to two things: proven research and trench engineering. The more I think about it, the less these feel like separate skills.
中文介绍 进入前沿 AI 实验室的关键:经过验证的研究能力和扎实的工程能力。这两者并非独立技能,而是同一核心理念的不同侧面——解决棘手问题。
@nifinet · 10.2K 粉丝 · 60.9K 阅 · 522 赞 · 54 转
When a team says they want AI for growth, they usually mean a faster send. An agent that fires the same template at a longer list, day and night. That is the cheap half of the job, and it stopped
中文介绍 用 Claude Code 构建 AI GTM 大脑:团队通常只想更快发送模板,但那只是表面工作。真正有效的增长系统应该做更深层的分析与策略。
@dickiebush · 441.8K 粉丝 · 57.7K 阅 · 519 赞 · 45 转
Legendary marketer David Ogilvy generated over $864 million for his clients. He was a British advertiser known as "The Father of Advertising." And in 1982, Ogilvy sent this 1-page memo to his staff:
中文介绍 将传奇广告人大卫·奥格威的写作规则融入 Claude,打造出一位 AI 写作教练。奥格威 1982 年的一页备忘录曾为客户创造超 8.64 亿美元价值,这些规则现可被 AI 复用。
@FakeMaidenMaker · 4.9K 粉丝 · 123.1K 阅 · 7d 曝光 123.1K
AI 内容创作变现零基础入门指南(短文、长文、图片、短视频、长视频)
@RayDalio · 2.2M 粉丝 · 72.6K 阅 · 7d 曝光 72.6K
Principled Thinking and AI Need to Go Together
@0x_rody · 1.7K 粉丝 · 193.2K 阅 · 7d 曝光 193.2K
How to Build a Self-Improving Loop in Claude Code (Exact Setup Inside)
中文介绍 在 Claude Code 中搭建自改进循环:不再手动粘贴错误信息,而是设计自动反馈回路让智能体自我纠错。
@dhaber · 50.0K 粉丝 · 497.3K 阅 · 7d 曝光 497.3K
Everything Is Recorded Now
中文介绍 AI 推动工作讨论默认被录音记录,成为公司内部敏感但普遍的趋势,影响沟通方式和隐私。
@MANISH1027512 · 37.1K 粉丝 · 84.5K 阅 · 7d 曝光 84.5K
【教程】自动化风格探索器,请躺好,自动收图就完事了!
@saranormous · 143.5K 粉丝 · 194.8K 阅 · 7d 曝光 194.8K
The Untrainable
中文介绍 投资者中出现的「AI 精神病」:认为无物可投,只想把钱全投 Anthropic 和 Nvidia。作者认为真正机会在别处。
@quipnetwork · 141.1K 粉丝 · 32.8K 阅 · 7d 曝光 32.8K
What Is Quip Network? A Primer
@ENERGY · 884.7K 粉丝 · 64.6K 阅 · 7d 曝光 64.6K
DOE Releases Finalized Fusion Science and Technology Roadmap to Accelerate Commercial Fusion Power
@nifinet · 10.2K 粉丝 · 60.9K 阅 · 7d 曝光 60.9K
How to Build an AI GTM Brain using Claude Code
@RLanceMartin · 30.4K 粉丝 · 84.7K 阅 · 7d 曝光 84.7K
Designing loops with Fable 5
@MatthewBerman · 121.3K 粉丝 · 108.0K 阅 · 7d 曝光 108.0K
My Week with Fable
@GoogleAIStudio · 176.3K 粉丝 · 32.2K 阅 · 7d 曝光 32.2K
Fluid, natural voice translation with Gemini 3.5 Live Translate
@0xCodez · 5.3K 粉丝 · 97.8K 阅 · 7d 曝光 97.8K
Loop engineering: the 14-step roadmap from prompter to loop designer.
@Kimi_Moonshot · 172.7K 粉丝 · 106.6K 阅 · 7d 曝光 106.6K
Kimi to Predict All 104 World Cup Matches: Germany May Be Underestimated
@sairahul1 · 113.0K 粉丝 · 852.6K 阅 · 7d 曝光 852.6K
Loops: What Every AI Engineer Needs to Know in 2026
中文介绍 核心观点:从手动 prompt 转向设计自动循环来驱动 AI 智能体,这是 2026 年 AI 工程师的关键技能。
@xiaogaifun · 1.3K 粉丝 · 56.5K 阅 · 7d 曝光 56.5K
总结下我使用 Codex 的 8 个高频场景。
@polynoamial · 129.2K 粉丝 · 43.7K 阅 · 7d 曝光 43.7K
Implications of Large-Scale Test-Time Compute
@addyosmani · 395.5K 粉丝 · 42.7K 阅 · 7d 曝光 42.7K
Loop Engineering.
中文介绍 Cursor 联合创始人 Michael Truell 在讲座中探讨编程工具与开发者效率的问题解决思路。
中文介绍 Claude Fable 5 仅凭视觉能力成功通关《宝可梦 火红》,展示了多模态推理能力。
中文介绍 Claude Fable 5 在游戏中自主操作《异星工厂》,展现 AI 在复杂策略环境中的决策能力。
中文介绍 Claude Fable 5 模拟太阳系运行并成功预测日食,体现其在天体物理模拟方面的能力。
中文介绍 Claude Fable 5 将流体模拟与贝多芬音乐结合,展示 AI 在科学与艺术交叉领域的创造力。
中文介绍 Claude Fable 5 使用自己构建的 CAD 编辑器设计出可 3D 打印的模型,展现 AI 在工程设计中的自主能力。
中文介绍 Cursor 联合创始人 Michael Truell 在讲座中探讨编程工具与开发者效率的问题解决思路。
中文介绍 Claude Fable 5 仅凭视觉能力成功通关《宝可梦 火红》,展示了多模态推理能力。
中文介绍 Claude Fable 5 在游戏中自主操作《异星工厂》,展现 AI 在复杂策略环境中的决策能力。
中文介绍 Claude Fable 5 模拟太阳系运行并成功预测日食,体现其在天体物理模拟方面的能力。
中文介绍 Claude Fable 5 将流体模拟与贝多芬音乐结合,展示 AI 在科学与艺术交叉领域的创造力。
中文介绍 Claude Fable 5 使用自己构建的 CAD 编辑器设计出可 3D 打印的模型,展现 AI 在工程设计中的自主能力。
Google DeepMind is funding research into the potential dangers of situations where millions of different AI agents interact with each other online. According to Rohin Shah, who directs the company’s AGI safety and alignment research, the mass-market arrival of agents that can carry out tasks without
中文介绍 谷歌DeepMind资助研究数百万AI代理在线交互的潜在危险。该公司AGI安全与对齐研究负责人Rohin Shah表示,大规模面向市场的代理到来可能带来风险。
a quiet day lets us reflect on a great essay
中文介绍 该新闻摘要提及开放模型、模型实验室与代理实验室的对比,以及不可训练的内容,但缺乏具体细节。
OpenAI plans to acquire Ona to expand Codex with secure, persistent cloud environments, enabling long-running AI agents across enterprise workflows.
中文介绍 OpenAI计划收购Ona,以扩展Codex平台,提供安全、持久的云端环境,支持企业工作流中的长期运行的AI代理。
OpenAI supports the EU Code of Practice on AI content transparency, advancing provenance standards and tools to help people understand AI-generated content.
中文介绍 OpenAI支持欧盟关于AI内容透明度的行为准则,推动溯源标准和工具,帮助用户理解AI生成内容。
Discover how astrophysicist Chi-kwan Chan uses Codex to build black hole simulations, helping scientists study extreme physics and test Einstein’s theory of general relativity.
中文介绍 天体物理学家Chi-kwan Chan使用Codex构建黑洞模拟,帮助科学家研究极端物理并检验爱因斯坦广义相对论。
Learn how BBVA scaled ChatGPT Enterprise to 100,000 employees and partnered with OpenAI to accelerate AI-powered banking transformation worldwide.
中文介绍 BBVA将ChatGPT Enterprise推广至10万名员工,并与OpenAI合作加速全球银行业AI转型。
中文介绍 本文介绍PyTorch中的性能分析,从nn.Linear到融合MLP的实现,涵盖优化技术。
中文介绍 该摘要涵盖Dario Amodei政策、DiffusionGemma快速度文本生成、以及WhatsApp解封机器人等AI新闻。
Access OpenAI models and Codex through Oracle Cloud, using existing commitments to build and deploy AI with enterprise security and governance.
中文介绍 用户可通过Oracle Cloud的现有承诺访问OpenAI模型和Codex,在具备企业安全与治理的环境下构建和部署AI。
中文介绍 DeepMind发布DiffusionGemma,实现比传统模型快4倍的文本生成速度。
A new report from OpenAI details PRC-linked influence operations using AI to target U.S. tech debates, data center narratives, tariffs, and false claims about ChatGPT.
中文介绍 OpenAI报告详述与PRC关联的影响力行动,利用AI针对美国科技辩论、数据中心叙事、关税以及关于ChatGPT的虚假声明。
Google DeepMind and partners announce a $10M funding call for multi-agent safety research.
中文介绍 谷歌DeepMind与合作伙伴宣布1000万美元资金征集,用于多代理AI安全研究。
The much anticipated launch of the Mythos-class model was marred by some controversial usage policies
中文介绍 Anthropic发布Claude Fable 5模型,属于Mythos级别,但伴随有争议的使用政策。
See how LSEG uses OpenAI to scale trusted AI across its global business, accelerating insights, shrinking release cycles, and empowering 4,000 employees.
中文介绍 LSEG利用OpenAI在全球业务中扩展可信AI,加速洞察、缩短发布周期,并赋能4000名员工。
The evolution of agentic surfaces: building with Claude Managed Agents
中文介绍 文章探讨使用Claude托管代理构建代理表面(agentic surfaces)的演进过程。
4 回复 · 程序员 节点
5 回复 · 程序员 节点
10 回复 · 程序员 节点
9 回复 · Apple 节点
44 回复 · Apple 节点
12 回复 · Apple 节点
9 回复 · Apple 节点
28 回复 · Apple 节点
8 回复 · Apple 节点
12 回复 · Apple 节点
[!NOTE] Claude Fable 5 & Claude Mythos 5 System Card 如果想深挖,随便找个AI就能挖了 这边又开新坑,那一定是因为我要吐槽!!!! [!CHECK] 总结 图片总结 文配图总结 [!INFO] 一张图两个字 端妃 嘿呦喂!我就端着,就端着,就端着。 TL;DR 虽然Mythos 5和Fable 5都是新一代超模,性能超越了之前的Mythos Preview,乃们浅浅氪金的屁民们只配用Fable 5,我们还为了人类的福祉,给你们的Fable 5添加了极其先进的外审,传说中的 ASL-3 blocking classifiers,这样在标准用户提
本帖使用社区开源推广,符合推广要求。我申明并遵循社区要求的以下内容: 我的帖子已经打上 开源推广 标签: 是 我的开源项目完整开源,无未开源部分: 是 我的开源项目已链接认可 LINUX DO 社区: 是 我帖子内的项目介绍,AI生成、润色内容部分已截图发出: 是 以上选择我承诺是永久有效的,接受社区和佬友监督: 是 以下为项目介绍正文内容,AI生成、润色内容已使用截图方式发出 剧透 [!warning] 注:这里有一个三分钟使用极简教程,正式使用前推荐看看:【全开源免费!抢先体验属于个人的Easy Research!Obsidian开发者手把手教你三分钟速通NotEMD!-哔哩哔哩】 htt
6月10日,胖东来创始人于东来在企业内部分享会上公开反思经营管理问题。 他表示:“我们公司30年以来,基本上都是使用超越员工期望的薪酬,这样好的一点是能激发大家的热情,不好的一点,产生溺爱。大家其实不值这么多钱,但是给你了这么多钱,让你产生了一个认知上的偏差,偏差就导致自己认为自己就值这么多钱。好的一面是我在公司里面,因为我值这么多钱,我感觉我很骄傲很幸福。不好的一面,就是一旦走出胖东来就完蛋。这可能就是政策上的过度溺爱。” 于东来对胖东来薪酬制度也表达了看法,他表示:“公司里面我期望的结果是,我们薪资层面,无论是人力资源、财务部门或者营运部门,结合起来,我们给公司制定我们相应的比较科学的薪酬
纯自娱自乐,佬友轻喷 赛波 | CYBERBALL 赛波CYBERBALL — 2026世界杯看球伴侣 赛波(CYBERBALL)源于赛博世界波,将实时赛事、AI预测、全球新闻与数据可视化融合为足球时代的数字竞技场。48支参赛队实时赔率、赛程日历、球员档案、小组赛积分榜——一屏尽览。 53 个帖子 - 28 位参与者 阅读完整话题
转自飞书群 注意,内测群貌似不是每个max用户都能进的。 感谢佬友透露内测情况: TengMMVP: 1M上下文窗口,不支持多模态,支持2档思考强度。 44 个帖子 - 38 位参与者 阅读完整话题
今天很多号挂了,我们一直在找号找渠道,所以白天公益站不可用了一段时间 Codex公益站:https://new.sharedchat.cc/ https://rawchat.cn/ 声明:在资源紧张的时候我们还是会优先保障付费站的,希望各位佬友理解 qq群:758607042(有问题群内反馈回复会快点,群内有技术支持) 83 个帖子 - 77 位参与者 阅读完整话题
据传:OpenAI已测试内部代号为kepler和kindle的两个新检查点。kindle-alpha被曝已选为发布候选。 来源公众号 37 个帖子 - 35 位参与者 阅读完整话题
我虽然不高强度冲浪,但是快两年了,也没有到3级。 差浏览量,这个浏览量。我很很难达到啊。 我看到一位3级佬,居然是25年底注册的,已经三级了,这也太肝了吧。 有同样的佬嘛。 118 个帖子 - 103 位参与者 阅读完整话题
如题,第一次在L站过生日,来到L站的这几个月里跟着佬友们学到了很多有用的东西,愿佬友们天天开心,社区越来越好 132 个帖子 - 129 位参与者 阅读完整话题
各位干草铺的老友,请立刻停止YOLO模式,现在我不知道是我自己程序的问题还是奥特曼的问题,已经有佬友给我反馈了GPT回复还像被夺舍的情况,虽然我说过号商再薅我就放毒,但这次真不是我,请大家先放弃YOLO,稍后我会停一会儿公益站。YOLO,也就是bypass那个模式,我现在在外面,来不及写详细,大家能理解就好了,不要完全托管给GPT,能夺舍第一次就能有第二次,至少今天大家先放弃这种模式哈 34 个帖子 - 31 位参与者 阅读完整话题
11 points · 0 comments
84 points · 30 comments
96 points · 34 comments
23 points · 6 comments
I'm a network engineer that likes to think about the future of the internet and this is what I've built over many nights and weekends. One reputation graph over IPs, ASNs, domains, and entities, exposed as a JSON API. Try it: curl https://api.tunnelmind.ai/v1/check/
67 points · 32 comments
130 points · 117 comments
44 points · 3 comments
56 points · 6 comments
176 points · 111 comments
378 points · 184 comments
129 points · 342 comments
See also https://www.youtube.com/watch?v=4HjWHNLRMB0Related: The RCE that AMD won't fix - https://news.ycombinator.com/item?id=46906947 - Feb 2026 (173 comments)
165 points · 65 comments
295 points · 105 comments
89 points · 53 comments
389 points · 217 comments
Today, I’m proud to announce Homebrew 6.0.0. The most significant changes since 5.1.0 are a new tap trust security mechanism, the new faster, smaller, default internal Homebrew JSON API, sandboxing on Linux, better defaults informed by our user survey, many brew bundle improvements, improved perform
182 points · 16 comments
336 points · 236 comments
211 points · 45 comments
39 points · 5 comments
At Deno we've been using OpenClaw and other agents increasingly for addressing production problems in Deno Deploy - when a PagerDuty alert fires, the agent starts researching the cause and making fixes.In order to do this, the agent needs access to real production systems - postgres, kubernetes
https://social.treehouse.systems/@AsahiLinux/116719749555082...
10 points · 0 comments
https://twitter.com/dante_leoncini/status/206303501506830790...
108 points · 64 comments
324 points · 116 comments
104 points · 36 comments
88 points · 19 comments
今日AI领域热闹非凡:Anthropic的Claude Fable 5以惊人视频演示刷屏,展现多模态推理、3D设计、天体模拟等全能实力;Google DeepMind以DiffusionGemma实现4倍文本加速,同期却警示百万级AI代理交互风险,并联合投资千万美元研究多代理安全;OpenAI拟收购Ona以强化企业级Codex平台,并力推EU透明度准则。行业正从模型竞赛向智能体生态系统与安全治理并重转型。
Google DeepMind正式发布DiffusionGemma模型,采用扩散机制实现比传统自回归模型快4倍的文本生成速度。该成果打破了大语言模型推理速度瓶颈,为实时应用场景提供了新选择,标志着模型架构创新在效率维度的重大突破。
Anthropic发布Claude Fable 5模型,隶属于Mythos安全级别,同步引发争议的使用政策讨论。演示视频中,该模型仅凭视觉通关《宝可梦 火红》、自主运行《异星工厂》、模拟太阳系并预测日食,甚至用自建CAD编辑器设计可3D打印模型,展示出强大的多模态推理与自主执行能力。
研究提出ART(Art-based Reinforcement Training)微调方法,融合LoRA和软提示两种技术,实现多模态大语言模型的参数高效微调。该方法在降低计算开销的同时保持了任务性能,为多模态模型的定制化部署提供了更经济的路径。
提出InternVideo3框架,通过多模态上下文推理将基础模型智能体化,在长时视频理解任务中实现多步逻辑推理与工具使用。该项工作将视频模型从静态特征提取推进到动态推理与自主规划阶段。
OpenAI宣布计划收购代码沙箱公司Ona,旨在为Codex平台提供安全且持久的云端运行环境,使企业工作流中的长期运行AI代理得以高效执行。此举将补齐Codex在云端基础设施上的短板,加速AI代理从原型到生产的落地。
用户现可通过Oracle Cloud的现有承诺额度访问OpenAI模型和Codex,在符合企业安全与治理规范的云环境中构建和部署AI。这标志着OpenAI进一步拓展云合作生态,降低企业AI采用的合规门槛。
Anthropic官方博客详解如何利用Claude托管代理(Managed Agents)构建「代理表面」(agentic surfaces),推动AI从单轮问答进化到持续自主服务。文章系统阐述了代理设计模式与工程实践,为开发者提供了从对话界面到代理生态的演进路线。
Moonshot旗下Kimi宣布将预测2026年世界杯全部104场比赛,并承认结果可能不准确,尤其认为德国队被低估。此举将AI分析能力置于全球瞩目的实时竞技场中检验,为模型推理能力提供了公开可验证的极端测试场景。
MIT Tech Review报道,谷歌DeepMind AGI安全与对齐研究负责人Rohin Shah公开担忧:当数百万AI代理在互联网上大规模交互时,可能产生不可预见的系统性风险。DeepMind此前已联合多家机构投入1000万美元资助多代理安全研究,试图在风险爆发前建立防护框架。
谷歌DeepMind与合作伙伴共同宣布1000万美元资金征集计划,专项用于多代理AI安全研究。随着多个企业级代理平台的快速部署,代理间的隐性协同与对抗风险正成为行业焦点,该资助旨在推动风险评估与防护措施的学术创新。
西班牙银行BBVA宣布将ChatGPT Enterprise全面推广至10万名员工,并与OpenAI建立深度合作,加速全球银行业的AI转型。这一金融行业迄今最大规模的AI部署案例,标志着企业级AI应用从实验走向全组织覆盖。
OpenAI官方声明支持欧盟推动的AI内容透明度行为准则,承诺推广溯源标准与工具,帮助用户清晰识别AI生成内容。此举是AI行业在监管框架成熟前主动建立信任机制的关键步骤。
OpenAI发布详细报告,揭露与PRC关联的影响力行动利用AI技术在美国科技辩论中散布不实信息,内容涉及数据中心叙事、关税政策及针对ChatGPT的虚假声索。该报告为跨国AI滥用治理提供了新证据。
KOL sairahul1在推文中指出,2026年AI工程师的关键技能已从手动编写prompt转向设计自动循环(Loops)来驱动AI智能体。据案例披露,一个小型OpenAI团队曾用AI智能体完成100万行生产代码而人类一行未写,验证了循环工程的效率优势。
KOL 0x_rody分享在Claude Code中搭建自我改进循环的实操方案:不再手动粘贴错误信息,而是设计自动反馈回路让智能体自动识别并修正错误。该设置提升了代码调试效率,将开发者从低效的「复制-粘贴-等待」循环中解放。
Anthropic内部人士RLanceMartin分享使用Mythos类模型(如Claude Fable 5)的两个核心技巧:自我修正循环——让模型自动检查并修复自己的输出;以及针对新一代模型的行为模式设计特定的效率循环。这些技巧为开发者挖掘模型潜力提供了利器。
KOL dickiebush将传奇广告人大卫·奥格威的写作规则注入Claude,打造出一位AI写作教练。奥格威1982年的「一页备忘录」曾为客户创造超过8.64亿美元价值,如今这些经典原则通过AI得以规模化复用,展现了知识工程与AI结合的实践路径。