OpenAI News · 06/24 14:00
OpenAI与博通(Broadcom)联合发布了一款名为「Jalapeño」的定制AI芯片,专为大型语言模型(LLM)推理设计,旨在显著提升AI系统的性能、效率和规模。社区分析显示该芯片配备216GB HBM3E内存,9个月内完成设计,旨在控制更多AI堆栈并改善计算经济性。(多家报道)
推荐理由:这是OpenAI首次涉足定制AI硬件领域,预示着AI巨头将更深入掌控底层算力,对AI芯片市场和未来模型部署策略有深远影响。
OpenAI News · 06/24 14:00
OpenAI与博通(Broadcom)联合发布了一款名为「Jalapeño」的定制AI芯片,专为大型语言模型(LLM)推理设计,旨在显著提升AI系统的性能、效率和规模。社区分析显示该芯片配备216GB HBM3E内存,9个月内完成设计,旨在控制更多AI堆栈并改善计算经济性。(多家报道)
推荐理由:这是OpenAI首次涉足定制AI硬件领域,预示着AI巨头将更深入掌控底层算力,对AI芯片市场和未来模型部署策略有深远影响。
Hugging Face Blog · 06/25 00:00
Hugging Face博客介绍了如何利用NVIDIA NeMo AutoModel技术,显著加速Transformer模型的微调过程,从而提升开发效率和模型性能。这项合作旨在简化复杂的AI模型训练流程,让开发者能更高效地迭代和部署高性能模型。
推荐理由:对于需要高效训练大型Transformer模型的开发者而言,这提供了即插即用的加速方案,能显著提升开发效率和模型性能。
DeepMind Blog · 06/25 00:30
谷歌DeepMind宣布,已为Gemini 3.5 Flash模型引入了「计算机使用」功能,进一步增强了该模型与外部计算环境交互和执行任务的能力。这意味着模型可以更自主地操作计算机、执行复杂指令,标志着AI模型在自主性方面又迈出重要一步。(多家报道)
推荐理由:这项功能极大扩展了Gemini 3.5 Flash的应用边界,使模型能够执行更复杂的实际任务,值得关注其后续发展及应用潜力。
Claude Blog · 06/24 08:00
Claude官方博客宣布,在其「Claude Tag」功能中引入了全新的「智能体身份」访问模型。该模型旨在为团队提供更自主、更安全、更便捷的AI访问和协作方式,进一步推动AI在团队工作中的应用。新功能支持将Claude智能体嵌入到Slack等工作流中,提高组织内AI的可用性和管理性。(多家报道)
推荐理由:该功能是Anthropic在AI智能体落地企业场景的重要进展,为团队更安全、高效地使用AI提供了新范式。
GitHub Trending
OpenMontage是全球首个开源的代理式视频制作系统,通过12条生产线、52种工具和500多个AI代理技能,将AI编码助手转化为完整的视频制作工作室。它旨在自动化视频内容生成,解决传统视频制作的复杂性和耗时性问题。
推荐理由:作为首个开源代理式视频制作系统,它为内容创作者和开发者提供了极具潜力的AI驱动视频生产力工具,值得尝试。
Latent Space · 06/25 10:14
文章指出,AI领域正迎来「元连接器(Meta-Harness)」的时代,这意味着AI系统集成和管理框架将进入更高级阶段,超越了传统连接器工程的范畴。这旨在构建更强大、更统一的AI能力,解决系统间复杂协同与扩展性问题。
推荐理由:深入分析AI系统发展趋势,为企业和开发者提供前瞻性视角,思考未来AI架构的演进方向。
MIT Tech Review AI · 06/24 19:59
麻省理工科技评论指出,随着AI蓬勃发展,企业需要大规模数据支持,但现有网络数据常受阻或非结构化。文章探讨了为解决此挑战而出现的「AI网络数据基础设施层」的重要性,强调其对AI应用落地和效能提升的关键作用。
推荐理由:这篇文章揭示了AI发展面临的数据挑战及其解决方案,对于AI战略规划者和数据工程师具有重要参考价值。
GitHub Trending
ZhuLinsen/daily_stock_analysis是一个由LLM驱动的多市场股票智能分析系统。它整合多源行情数据、实时新闻,提供直观的决策看板,并支持自动推送关键信息,旨在为投资者和分析师提供便捷、智能的股票市场洞察,且支持零成本定时运行。
推荐理由:对于需要市场洞察的投资者和金融从业者,这是一个实用的开源AI工具,能大幅提升分析效率并辅助决策。
Hacker News · 06/25 10:51
这篇文章探讨了「开放权重模型」日益增长的低成本特性及其对AI生态系统的深远影响。它分析了开源模型在成本、灵活性和创新方面的优势,并预测了它们将如何改变AI技术普及和商业化的格局。
推荐理由:对于关注AI产业发展、特别是开源AI趋势的从业者,这篇文章提供了独到见解和对未来竞争格局的思考。
Riley Brown (YouTube) · 06/25 06:39
该短视频展示了在Slack等协作平台中集成强大AI智能体的应用,通过自动化任务、智能回复和信息整理,显著提升团队的工作效率。这体现了AI智能体如何直接融入日常工作流,为用户提供即时智能辅助。
推荐理由:对企业用户和开发者来说,这是一个直观了解AI智能体在实际办公场景中应用价值的案例,具有参考意义。
Python · ★ 20,430 · 🍴 2,298 · 📈 3,719 stars today
World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.
中文介绍 OpenMontage是首个开源的代理式视频制作系统,通过12条生产线、52种工具和500多个AI代理技能,将AI编码助手转化为完整的视频制作工作室。它旨在自动化视频内容生成,解决传统视频制作的复杂性和耗时性问题。内容创作者和开发者可利用该系统,通过AI驱动高效产出各类视频,极大提升工作流效率。
Python · ★ 48,995 · 🍴 43,167 · 📈 1,468 stars today
LLM 驱动的多市场股票智能分析系统:多源行情、实时新闻、决策看板与自动推送,支持零成本定时运行。 LLM-powered multi-market stock analysis system with multi-source market data, real-time news, decision dashboard, automated notifications, and cost-free scheduled runs.
中文介绍 ZhuLinsen/daily_stock_analysis是一个由LLM驱动的多市场股票智能分析系统。它整合多源行情数据、实时新闻,提供直观的决策看板,并支持自动推送关键信息,旨在为投资者和分析师提供便捷、智能的股票市场洞察。该系统特色在于支持零成本定时运行,有效解决了传统市场分析耗时耗力的问题,助力用户高效把握市场动态、辅助投资决策。
Swift · ★ 42,601 · 🍴 1,252 · 📈 1,838 stars today
A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.
中文介绍 Apple/container是一个专为Mac用户设计的工具,它能通过轻量级虚拟机创建并运行Linux容器。该项目采用Swift语言开发,并针对Apple silicon进行了优化,旨在提供高效、原生的容器化体验。它解决了在macOS上运行Linux容器的性能和兼容性挑战,使得开发者能够更便捷地在Mac设备上进行跨平台开发、测试和部署,尤其适合需要高性能容器环境的专业人士。
Python · ★ 2,459 · 🍴 620 · 📈 203 stars today
AI agent to evaluate and score resumes.
中文介绍 Interviewstreet/hiring-agent是一个利用AI代理技术,自动化评估和评分候选人简历的项目。它旨在解决招聘流程中简历筛选耗时耗力的问题,通过智能分析简历内容,提供客观的评分,从而帮助招聘人员和HR团队快速识别最符合职位要求的候选人。该工具能够显著提高招聘效率,减少人工筛选的偏差,使企业能够更有效地进行人才招募。
TypeScript · ★ 19,646 · 🍴 2,912 · 📈 692 stars today
Clone any website with one command using AI coding agents
中文介绍 JCodesMore/ai-website-cloner-template是一个利用AI编码代理实现网站克隆的工具。它允许用户通过一条命令,快速复制任何网站的结构和样式,旨在简化网页设计和前端开发流程。该项目解决了从零开始构建网站的耗时问题,为开发者、设计师和产品经理提供了快速获取网站模板、分析设计模式或进行原型开发的便利,极大提升了工作效率。
HTML · ★ 7,849 · 🍴 1,066 · 📈 277 stars today
A meta-skill that designs domain-specific agent teams, defines specialized agents, and generates the skills they use.
中文介绍 Revfactory/harness是一个“元技能”项目,专注于AI代理系统的设计与构建。它能够设计领域特定的代理团队,定义专业的AI代理,并自动生成这些代理所需的技能。该项目旨在简化复杂AI代理应用的开发流程,解决从头构建多代理协作系统的高门槛问题。AI系统开发者和研究员可利用Harness,高效地定制和部署高性能的AI代理解决方案,加速智能应用的创新与落地。
Dart · ★ 177,486 · 🍴 30,554 · 📈 73 stars today
Flutter makes it easy and fast to build beautiful apps for mobile and beyond
中文介绍 Flutter是一个由Google开发的开源UI工具包,旨在帮助开发者快速、轻松地为移动、Web、桌面等多个平台构建美观的原生编译应用。它通过一套代码库解决跨平台开发中的兼容性和效率问题,提供了丰富的预构建组件和响应式框架,显著缩短了开发周期并降低了维护成本。移动开发者和企业团队可利用Flutter高效交付高性能、高保真的多平台应用。
Kotlin · ★ 1,479 · 🍴 115 · 📈 41 stars today
Headunit App for displaying Android Auto
中文介绍 Andreknieriem/headunit-revived是一个专为显示Android Auto界面而设计的Headunit应用程序。它解决了部分车载信息娱乐系统或自定义设备无法原生支持Android Auto的问题,使用户能够在非官方兼容的硬件上体验Google的智能车载系统。汽车爱好者和改装车主可利用此应用,将旧款车辆或特定平板设备变身为Android Auto显示屏,极大提升驾驶的智能互联体验。
TypeScript · ★ 7,027 · 🍴 506 · 📈 331 stars today
Orca is the ADE for working with a fleet of parallel agents. Run any coding agent with your own subscription. Available on desktop and mobile.
中文介绍 Stablyai/orca是一个Agent开发环境(ADE),专注于管理并运行并行AI代理组成的“舰队”。它解决了多AI代理系统在开发、部署和扩展中的复杂性问题,允许用户通过自己的订阅运行任何编码代理。Orca在桌面和移动端均可用,为开发者和企业提供了一个统一、高效的平台,用于构建、测试和部署复杂的AI代理解决方案,加速智能应用的落地。
TypeScript · ★ 17,693 · 🍴 1,580 · 📈 619 stars today
A format specification for describing a visual identity to coding agents. DESIGN.md gives agents a persistent, structured understanding of a design system.
中文介绍 Google-labs-code/design.md是一个创新性的格式规范,旨在为AI编码代理提供一种描述视觉标识的方法。通过使用DESIGN.md,编码代理能够持久且结构化地理解一个设计系统,解决了AI在理解和实现设计意图时的挑战。这使得设计师和开发者能以标准化方式,将设计语言精确传达给AI,从而自动化和优化从设计到代码的转换流程,加速UI/UX开发效率。
Batchfile · ★ 30,054 · 🍴 2,344 · 📈 61 stars today
中文介绍 Flowseal/zapret-discord-youtube项目名称暗示其与Discord和YouTube平台的内容访问或管理有关,特别是涉及"zapret"(俄语,意为禁止或审查)的场景。尽管没有详细描述,它可能旨在解决用户在特定区域访问Discord或YouTube内容时遇到的限制问题,或提供相关的规避、管理工具。目标用户可能是需要应对内容审查或希望自由访问社交媒体和视频平台的用户群体。
Go · ★ 2,490 · 🍴 159 · 📈 110 stars today
git push no-mistakes
中文介绍 Kunchenguid/no-mistakes是一个旨在通过`git push no-mistakes`命令,帮助开发者避免在代码提交过程中犯错的项目。它可能通过集成Git hooks或预提交检查等机制,在代码推送前执行一系列验证,如代码规范、单元测试通过、敏感信息检查等。该工具解决了团队协作中因代码质量问题或疏忽而导致的潜在风险,确保了代码库的整洁和稳定性,提升了整体开发效率。
Python · ★ 202,387 · 🍴 36,166 · 📈 1,178 stars today
The agent that grows with you
中文介绍 NousResearch/hermes-agent是一个名为Hermes的AI代理项目,其核心理念是“与你共同成长”。它旨在解决传统AI代理缺乏长期学习和个性化适应能力的问题,通过某种机制使其能够根据用户的交互和反馈不断进化,更好地理解和满足个性化需求。该项目为开发者和研究员提供了一个探索可学习、自适应AI代理的框架,也为寻求长期智能助力的个人用户提供了可能性。
👍 9
Real-world photography requires capture-time guidance for both camera framing and subject pose. Yet existing aesthetic cropping benchmarks mainly evaluate post-hoc crop prediction and overlook subject-side recommendations, leaving the capture-time guidance capabilities of multimodal large language m
👍 7
Fine-grained visual reasoning requires multimodal large language models (MLLMs) to identify task-relevant visual evidence and ground their reasoning in local image regions. Existing agentic methods typically rely on reinforcement learning with verifiable rewards or supervised fine-tuning on large-sc
👍 5
While Video Virtual Try-on (VVT) has achieved remarkable progress in synthesizing realistic garment overlays on dynamic subjects, existing paradigms remains fundamentally constrained by a passive dependency on source camera trajectories, failing to accommodate the requisite interactive freedom for o
👍 5
We introduce Autodata, a general method that enables AI agents to act as data scientists who build high quality training and evaluation data. We show how to train (meta-optimize) such a data scientist agent, so that it learns to create even stronger data. We describe the overall formulation, and a s
👍 2
"Talk short. Drop grammar. Save token." This caveman style is widely promoted as a way to cut inference cost, but whether it actually saves anything depends on which channel (the user's prompt or the model's response) is being compressed. We present Cavewoman, a two-channel evaluation protocol that
👍 42
Memory for large language model (LLM) agents has rapidly evolved from simple retrieval-augmented mechanisms into a data management system that supports persistent information storage, retrieval, update, consolidation, and dynamic lifecycle governance throughout agent execution. Despite this evolutio
👍 2
Existing low-bit KV-cache quantizers often treat each cached key as a flat vector. Under RoPE, however, a key's contribution to a future attention logit decomposes into a position-dependent sum over two-dimensional frequency blocks. This makes key-cache quantization a block-wise bit-allocation probl
👍 11
Unified multi-modal large language models (MLLMs) have achieved strong text-to-image generation quality, but still struggle with structure-aware prompt following, where object counts, spatial relations, attribute bindings, and coarse layouts must be preserved. We attribute this limitation in part to
👍 4
WordArt (artistic text) features highly customized fonts, textures, and layouts, making WordArt-oriented scene TExt Recognition (WATER) substantially more challenging than general Scene Text Recognition (STR). Existing STR datasets and methods, typically built around regular scene text and fixed-tem
👍 25
We present Wan-Streamer, a native-streaming, end-to-end interactive foundation model designed from the ground up for real-time, low-latency, full-duplex audio-visual interaction. Wan-Streamer seamlessly models language, audio, and video as both input and output within a single Transformer, where the
👍 1
Long-term memory promises LLM agents that grow more capable across sessions, maintaining an accurate, evolving understanding of the user that interaction forms. In practice, however, this memory is evaluated mostly through downstream behavior, such as later answers, personalization quality, or task
👍 0
Large language models are increasingly deployed as agents that reason over documents rather than answer from parametric knowledge. We study archive-grounded reasoning: locating sparse evidence across a large, messy collection of workplace files, reconciling inconsistent terminology, units, and time
👍 14
Generating explorable 3D scenes from a single image requires strong generative priors and accurate geometric representations suitable for downstream use. Current video diffusion models offer high-quality generation and implicitly encode multi-view geometric structure in latent space. However, existi
👍 8
Experience-driven self-evolution is critical for large language model (LLM) agents to improve through open-world interaction. However, existing experience learning methods mostly rely on single-agent loops, where the same agent executes tasks, summarizes outcomes, and determines memory content. This
👍 6
The Hitchhiker's Guide to Agentic AI is a comprehensive practitioner's reference for building autonomous AI systems. The book covers the full stack from first principles to production deployment, organized around a central thesis: building great agentic systems requires understanding every layer of
👍 4
Scaling reinforcement learning for visual mathematical reasoning requires more than generating harder questions: as data volume grows, the reward labels themselves must remain reliable. Yet existing data pipelines scale supervision while trusting the labeller, and policy-side methods assume the unde
👍 15
What is an agent? What constitutes agency? With the rise of Large Language Model (LLM) systems marketed as ``coding agents'', ``AI co-scientists'', and other ``agentic" tools that promise to drive up productivity, and at the same time, ``existential" concerns such as AI escaping human control with d
👍 14
Modern text-to-image models excel in visual fidelity and prompt adherence. However, this strict adherence comes at the cost of diversity: generated samples tend to collapse into a single visual interpretation. Existing methods to improve diversity produce outputs driven by incidental variations rath
👍 2
Cross-Chart Retrieval-Augmented Generation (RAG) is critical for complex multi-modal analytical tasks in scientific, business, and political domains. However, existing benchmarks either focus on tables, which are well-structured and textualized, or generate cross-chart questions by simply extracting
👍 26
AI agents are driving a new software paradigm, with the ability to autonomously call tools, extract information, manage memory, and complete tasks that span applications and data sources. Most existing end-user operating systems, however, are designed for application-centric workflows and offer litt
👍 5
Chain-of-Thought (CoT) has become a standard method for improving reasoning capabilities in large language models (LLMs) by eliciting step-by-step thinking, but its effectiveness in multimodal tasks remains unclear. In this paper, we aim to systematically investigate the key question: What can multi
👍 0
Dynamic 3D Gaussian splatting faces a fundamental tension between motion consistency and visual fidelity. Deformation-based approaches preserve temporal correspondence but suffer from motion over-factorization, oversmoothing high-frequency dynamics. In contrast, 4D-primitive methods capture fine vis
👍 3
Generating a coherent multi-shot video requires structured cross-shot memory. Subject appearance, scene context, and speaker identity must persist across cuts. Existing approaches either train end-to-end over fixed-length sequences and cannot scale, generate shot-by-shot with memory banks that grow
👍 2
As LLM agents increasingly select tools autonomously, their choices among tools with different privileges become safety-relevant. However, prior tool-selection studies focus on safety-agnostic metadata preferences, leaving privilege-sensitive choices underexplored. To address this gap, we study over
👍 2
Attention-based Multiple Instance Learning aggregators in medical imaging are prone to attention concentration, producing overconfident and unstable predictions. We introduce QG-MIL, a gated transformer aggregator that addresses this through four synergistic architectural components: RMSNorm-based p
👍 2
Memory remains a critical bottleneck for long-horizon robotic manipulation, as standard Vision-Language-Action (VLA) policies often fail when task-relevant cues become occluded or unobservable over time. While existing memory-augmented methods utilize historical context, they either suffer from seve
👍 21
While Large Language Models (LLMs) have substantially advanced text-to-code synthesis, many real programming tasks specify intent through visual artifacts such as screenshots, charts, vector drawings, videos, and interactive states. These tasks require models to connect visual perception to executab
👍 1
Retrieving external knowledge is essential for solving real-world tasks, yet it remains challenging when the relationship between a query and its relevant knowledge involves implicit and complex reasoning beyond surface-level semantic or lexical matching (e.g., mathematical problems relying on the s
👍 21
Mental disorders are highly prevalent worldwide, but the shortage of psychiatrists and the inherent subjectivity of interview-based diagnosis create substantial barriers to timely and consistent mental-health assessment. Progress in AI-assisted psychiatric diagnosis is constrained by the absence of
👍 0
The problem of optimal sizing and power scheduling in microgrids subject to uncertainties is well known to the control community. Commonly, the optimal control problem is cast as a mixed-integer program to model the logical constraints arising in energy storage systems, and is then solved approximat
Your Figma canvas now has a timeline
Safe agent spend from the terminal
Group Scheduling Simplified
Describe your app and Blop tests it and repairs broken tests
Let AI agents run your next deal, fundraise or data room
Learn to build AI agents by actually building them
Generate UI prototypes, videos, and posters with AI
Build agents & apps on top of your context with one prompt.
Find the city where everyone's flights are cheapest
Wireshark for AI Agents: passive eBPF observability
@horizon_trade_x · 4.4K 粉丝 · 1.3M 阅 · 507 赞 · 59 转
Your backtest looked flawless. You went live. Two weeks later, the strategy was bleeding. Every quant has lived this. The answer is a loop: generate a strategy, test it, score it, feed the result
@AnatoliKopadze · 83.0K 粉丝 · 1.3M 阅 · 584 赞 · 70 转
AI has been in everyone's hands for years. Most people who use it every day still use it the slowest way there is: type a request, wait, fix it, ask again, all by hand. Not because the faster way is
@undefinedKi · 3.9K 粉丝 · 1.0M 阅 · 601 赞 · 78 转
Your best ideas are scattered across a dozen places right now. Notes apps. Browser tabs. Old chats with Claude that you closed and will never find again. Every time you sit down to work, you rebuild
@eng_khairallah1 · 69.3K 粉丝 · 678.7K 阅 · 506 赞 · 81 转
Most people are using Claude Opus 4.8 to answer questions. Save this :) A small group of people have it running businesses while they sleep. The difference is not the model. You both have access to
@spandan_madan · 1.1K 粉丝 · 626.9K 阅 · 516 赞 · 28 转
AI hardware is having a moment. Hyperscaler capex on AI data centres is on track to clear $690 billion in 2026, and private equity has followed in scale — Blackstone alone reports a $55B+ data-centre
@sairahul1 · 120.1K 粉丝 · 581.3K 阅 · 516 赞 · 110 转
Learning anything today is easy and confusing at the same time. Easy because AI can explain almost anything in seconds. Confusing because most people just ask random questions, get random answers, and
@djfarrelly · 3.8K 粉丝 · 344.7K 阅 · 501 赞 · 61 转
Everyone's asking "WTF is a loop?" Here's the question nobody's asking: what runs the loop? The AI discourse has converged on loops as a core primitive of agentic systems. Matt Van Horn (@mvanhorn)
@EngramLab · 1.2K 粉丝 · 255.7K 阅 · 537 赞 · 76 转
We’re Engram. We’re building AI that learns from you and deeply understands your work. Today’s AI models don’t understand what you do. Not really. Everything models know comes from their training –
@sairahul1 · 120.1K 粉丝 · 249.0K 阅 · 502 赞 · 90 转
There are 8 billion people on the planet. Only a fraction of developers understand how AI agents actually work. Not the demos. Not the hype. The real engineering underneath. Every week a new agent
@Oracle_Trade_ai · 39.9K 粉丝 · 197.8K 阅 · 2.8K 赞 · 580 转
In 2026, autonomous AI agents have become one of the most effective strategies on prediction markets. Over 30% of all activity on Polymarket now comes from algorithmic and AI-powered wallets. We
@posthog · 21.8K 粉丝 · 162.4K 阅 · 512 赞 · 36 转
When the creators of both OpenClaw and Claude Code speak, people listen. And recently Peter Steinberger and Boris Cherny have both been talking about the same concept: loops. Their argument? You
@OracAItrading · 31.8K 粉丝 · 141.6K 阅 · 2.8K 赞 · 576 转
In 2026, autonomous AI agents have become one of the most effective strategies on prediction markets. Over 30% of all activity on Polymarket now comes from algorithmic and AI-powered wallets. We
@GoogleAIStudio · 179.4K 粉丝 · 138.2K 阅 · 504 赞 · 42 转
Today we're announcing that the Interactions API has reached general availability and is now our primary API for interacting with Gemini models and agents. We launched its public beta in December
@Oractrading · 33.9K 粉丝 · 109.2K 阅 · 2.8K 赞 · 585 转
In 2026, autonomous AI agents have become one of the most effective strategies on prediction markets. Over 30% of all activity on Polymarket now comes from algorithmic and AI-powered wallets. We
@RohOnChain · 51.4K 粉丝 · 108.3K 阅 · 501 赞 · 65 转
I will break down exactly how to build the loops that run an entire quant trading system on their own. Let's get straight to it. Bookmark This - I'm Roan, a backend developer working on system
@EXM7777 · 118.9K 粉丝 · 107.7K 阅 · 509 赞 · 44 转
for a few days, we had something that felt like AGI... Fable 5 showed up, effectively unlimited inside the plans, and the ceiling on what you could build lifted overnight but then Anthropic killed it,
@mvanhorn · 35.2K 粉丝 · 102.4K 阅 · 510 赞 · 56 转
Earlier this month I wrote WTF Is a Loop? Peter Steinberger vs. Boris Cherny, which did 3.6M views on what a loop even is. This is the sequel, and it answers the next question: which loops do people
@omarsar0 · 308.0K 粉丝 · 90.2K 阅 · 504 赞 · 69 转
A claim has been circulating in AI coding circles: stop prompting your coding agents and start designing loops that prompt them for you. As with everything new, this stuff gets repeated often and
@Designarena · 13.9K 粉丝 · 80.4K 阅 · 518 赞 · 39 转
GLM 5.2 ranks 1st overall on Design Arena’s single-turn, HTML Web Design (Non-Agentic) evaluation, 5 places higher than its predecessor GLM-5.1. To do so, it beat Claude Fable 5, Opus 4.6, and Opus
@const_reborn · 29.7K 粉丝 · 79.8K 阅 · 503 赞 · 116 转
e_i \;\propto\; \underbrace{\rho_i \times \bar{p}_i}_{\text{linear (maximize)}} \times \underbrace{(1 - b_i)}_{\text{boolean gate}} Disclaimer: this upgrade only effects subnet owners and dynamic TAO
@GoogleAIStudio · 179.4K 粉丝 · 41.0K 阅 · 7d 曝光 179.2K
Introducing computer use in Gemini 3.5 Flash
@tslaming · 34.9K 粉丝 · 65.2K 阅 · 7d 曝光 65.2K
Tesla has patented a digital siren that allows car sensors to break the network rules in a crisis
@spandan_madan · 1.1K 粉丝 · 626.9K 阅 · 7d 曝光 626.9K
The efficiency gap: How do cells and GPUs compare when running the exact same algorithm?
@posthog · 21.8K 粉丝 · 162.4K 阅 · 7d 曝光 162.4K
Why we're bullish on loops
@EngramLab · 1.2K 粉丝 · 255.7K 阅 · 7d 曝光 255.7K
Introducing Engram: Scaling compute on your context
@zachlloydtweets · 11.2K 粉丝 · 39.2K 阅 · 7d 曝光 39.2K
Building a skill optimization loop
@amitiitbhu · 23.6K 粉丝 · 35.7K 阅 · 7d 曝光 35.7K
How does vLLM work?
@sairahul1 · 120.1K 粉丝 · 249.0K 阅 · 7d 曝光 249.0K
30 Core Agentic Engineering Concepts Every Developer Should Know
@dashen_wang · 24.0K 粉丝 · 189.6K 阅 · 7d 曝光 189.6K
构架师教程:Foundation Engineering
@const_reborn · 29.7K 粉丝 · 79.8K 阅 · 7d 曝光 79.8K
Subnets mine TAO; TAO mines Subnets
@philipkiely · 8.6K 粉丝 · 73.1K 阅 · 7d 曝光 73.1K
How we built the world’s fastest API for GLM-5.2
@akshay_pachaar · 278.2K 粉丝 · 44.0K 阅 · 7d 曝光 44.0K
Loop Engineering Clearly Explained
@GoogleAIStudio · 179.4K 粉丝 · 138.2K 阅 · 7d 曝光 179.2K
Interactions API: Our primary interface for Gemini models and agents
@expsgg · 151 粉丝 · 79.2K 阅 · 7d 曝光 79.2K
Taking CS2 in-game stats a little bit more serious
@RohOnChain · 51.4K 粉丝 · 108.3K 阅 · 7d 曝光 108.3K
How To Use Loop Engineering To Build A Self-Improving Quant Trading System
@xiaogaifun · 1.7K 粉丝 · 114.5K 阅 · 7d 曝光 114.5K
简直就是教科书级别的 AI 设计规范。
@Oracle_Trade_ai · 39.9K 粉丝 · 197.8K 阅 · 7d 曝光 197.8K
ORACLE: Official AI Agents Trade on Polymarket
@OracAItrading · 31.8K 粉丝 · 141.6K 阅 · 7d 曝光 141.6K
ORACLE: Official AI Agents Trade on Polymarket
@Oractrading · 33.9K 粉丝 · 109.2K 阅 · 7d 曝光 109.2K
ORACLE: Official AI Agents Trade on Polymarket
@wangdefou · 14.3K 粉丝 · 86.1K 阅 · 7d 曝光 86.1K
用一台 6 美元 VPS,把外网安全拿回自己手里
Move over, Harness Engineering, it is time for the harness of harnesses!
中文介绍 文章指出,AI领域正迎来「元连接器(Meta-Harness)」的时代,这意味着AI系统集成和管理框架将进入更高级阶段,超越了传统连接器工程的范畴,致力于构建更强大的AI能力。
In a rare double-interview, the Databricks technical leaders riff on what it will take for every company to build Agent Clouds
中文介绍 Databricks技术领导者Matei Zaharia和Reynold Xin在采访中强调,为了让企业能够构建「智能体云(Agent Clouds)」,前沿AI生态系统必须保持开放性。
中文介绍 Google DeepMind在其博客中宣布,已为Gemini 3.5 Flash模型引入了「计算机使用」功能,进一步增强了该模型与外部计算环境交互和执行任务的能力。
中文介绍 Hugging Face博客介绍了如何利用NVIDIA NeMo AutoModel技术,显著加速Transformer模型的微调过程,从而提升开发效率和模型性能。
AI is booming. New use cases are emerging each day. To capitalize on the technology’s potential, enterprises require data at scale. In many cases, though, the relevant information is blocked or unstructured, which limits its use by AI models. To understand this challenge, consider the foundation of
中文介绍 麻省理工科技评论指出,随着AI蓬勃发展,企业需要大规模数据支持,但现有网络数据常受阻或非结构化。文章探讨了为解决此挑战而出现的「AI网络数据基础设施层」的重要性。
Claude finally gets a Slackbot upgrade
中文介绍 Claude的Slackbot获得重大升级,引入了「Claude Tag」功能,使其智能体现在支持多玩家协作、主动式交互和持久化操作,进一步提升了其在团队沟通工具Slack中的应用能力。
OpenAI and Broadcom introduce Jalapeño, a custom AI chip built for LLM inference to improve performance, efficiency, and scale across AI systems.
中文介绍 OpenAI和博通(Broadcom)联合发布了一款名为「Jalapeño」的定制AI芯片。该芯片专为大型语言模型(LLM)推理设计,旨在显著提升AI系统的性能、效率和规模。
**OpenAI** announced **Jalapeño**, its first custom AI chip for LLM inference, built with **Broadcom**, aiming to control more of the AI stack and improve compute economics with a fast 9-month design cycle. Community analysis suggests Jalapeño features **216GB HBM3E**, **~7.1–7.4 TB/s bandwidth**, a
中文介绍 OpenAI与博通合作发布了其首款定制AI芯片「Jalapeño」,专为大型语言模型(LLM)推理设计。该芯片旨在提升AI计算经济性,并能在9个月内完成设计,社区分析显示其配备216GB HBM3E内存。
Agent identity in Claude Tag: a new access model for autonomous, team-wide AI
中文介绍 Claude官方博客宣布,在其「Claude Tag」功能中引入了全新的「智能体身份」访问模型。该模型旨在为团队提供更自主、更安全、更便捷的AI访问和协作方式,进一步推动AI在团队工作中的应用。
Building effective human-agent teams
中文介绍 Claude官方博客发布文章,探讨了如何构建高效的「人类-智能体团队」。文章深入分析了人类与AI智能体之间协作的最佳实践,旨在最大化团队生产力并优化工作流程。
中文介绍 Hugging Face博客宣布推出了「FFASR 排行榜」,旨在为真实世界场景下的自动语音识别(ASR)模型提供全面的基准测试和性能评估,促进ASR技术的发展。
中文介绍 TLDR AI总结了近期AI领域的多个重要更新,包括Claude Tag功能增强、Seedance 2.5版本发布以及Mistral OCR 4的推出,涵盖了智能体、多模态和光学字符识别等方向。
GPT-5 Pro helped solve a 3-year-old immunology mystery, offering insights into T cell behavior. The breakthrough could support cancer and autoimmune research.
中文介绍 OpenAI报道称,GPT-5 Pro成功协助免疫学家Derya Unutmaz解决了困扰其3年的免疫学难题,深入揭示了T细胞行为。这项突破有望为癌症和自身免疫疾病的研究提供重要支持。
OpenAI helps build shared standards for advanced AI, supporting evaluation frameworks, safety practices, and global cooperation through the Appia Foundation.
中文介绍 OpenAI正积极参与高级AI共享标准的建设,通过Appia基金会支持评估框架、安全实践和全球合作。此举旨在确保AI技术的负责任发展,并促进国际社会在AI治理方面的协调。
中文介绍 Hugging Face博客与IBM Research合作,介绍了如何利用「CUGA」框架和轻量级连接器构建真正的智能体(agentic)应用程序,并提供了约24个实际可运行的示例,以加速AI应用开发。
6 回复 · 程序员 节点
8 回复 · 程序员 节点
6 回复 · 程序员 节点
29 回复 · 程序员 节点
8 回复 · Apple 节点
6 回复 · Apple 节点
14 回复 · Apple 节点
6 回复 · Apple 节点
7 回复 · Apple 节点
42 回复 · Apple 节点
奶奶种的,香 28 个帖子 - 27 位参与者 阅读完整话题
这几天看到很多佬友都缺 GPT 模型,所以决定开放一波注册,希望能帮助到各位。邀请码价格降为 300LDC,老用户不用感觉背刺哦,你们的 2 万美元依然可以爽蹬,新用户注册只送 300 美元。 干草铺站点:https://gancaopu.com 邀请码购买地址 https://shop.aini8.com 重要声明: 禁止破限,严禁分发,发现就封哦 rpm 限制为 10,正常使用即可 gpt 模型仅保证在 codex 中正常使用,其他使用场景不限制但是技术方面不保证其他场景可用 目前仅保证 gpt5.5 模型稳定可用,且仅保证在 codex 场景稳定可用,其他场景不限制但是技术层面不保证可用
# 回答风格 ## 思维链 先判断,再解释,再给做法。 ## 语气 - 冷静、直接、克制 - 不写客套话、不写泛泛背景 - 不主动扩展无关知识 ## 歧义处理(最高优先级) **当问题存在关键歧义时,先用一句话追问,不要替用户猜测所有分支再逐一展开。不要直接回答如果是xx,就xx,而是先向用户确认,你问的是xx还是xx** 只有确认意图后才开始正式回答。 --- **先砍后补**——先用最短形式回答,只有信息确实不够时才加层。 排版规范是"允许使用的工具",不是"必须填满的模板"。 一条命令能答完的问题,就只给一条命令加一句说明,不套任何结构。 # 输出结构 | 问题类型 | 结构 | |--
本帖使用社区公益推广,符合推广要求。我申明并遵循社区要求的以下内容: 我的项目是免费使用的,无收费(变相收费、赞助)部分: 是 我的帖子已经打上 公益推广 标签: 是 我的项目属于个人项目,与公司或商业机构无关: 是 我的项目不存在QQ、TG等群组引流: 是 我的项目不存在非运营必要的网站引流: 是 我的项目不存在为他人推广、AFF: 是 我的项目无关联的商业项目: 是 我的站点存在登录,并已接入 LINUX DO Connect: 是 我帖子内的项目介绍,AI生成、润色内容部分已截图发出: 是 以上选择我承诺是永久有效的,接受社区和佬友监督: 是 以下为项目介绍正文内容,AI生成、润色内容已
Selected model is at capacity. Please try a different model. 27 个帖子 - 26 位参与者 阅读完整话题
35 个帖子 - 35 位参与者 阅读完整话题
OpenAI又有bug了?怎么土区开的Plus突然变成Pro 20x了? 102 个帖子 - 90 位参与者 阅读完整话题
我在联合早报看到的消息, 他也是援引路透和证券时报的消息. 三六零发布“中国版Mythos”图龙锋 周鸿祎称漏洞发现能力正成为新的战略能力 周鸿祎星期三(6月24日)在第十四届互联网安全大会上,发布了两款人工智能模型:漏洞自动化挖掘智能体“图龙锋”与自动化防御系统“仪天阵”。 周鸿祎介绍,“图龙锋”能自动发现软件漏洞,而“仪天阵”则负责实现网络防御和事件响应的自动化,其中“图龙锋”被视为中国版Mythos。 凭借在敏感系统中挖掘漏洞的强大能力,今年4月发布的Mythos已在华盛顿及多国政府高层,乃至整个网络安全行业引发高度警觉。而360此次发布的新产品,标志着中国企业迄今为止对该模型做出的最高
Any站的质量、纯度无可置疑,用过的都说好。出问题恢复也嘎嘎快,但是又从来看不到Any佬冒泡。而且来了L站就可以享受Any佬的福利,这不妥妥的L最大福利嘛!嘎嘎好,嘎嘎给力的啊 158 个帖子 - 106 位参与者 阅读完整话题
本帖使用社区公益推广,符合推广要求。我申明并遵循社区要求的以下内容: 我的项目是免费使用的,无收费(变相收费、赞助)部分: 是 我的帖子已经打上 公益推广 标签: 是 我的项目属于个人项目,与公司或商业机构无关: 是 我的项目不存在QQ、TG等群组引流: 是 我的项目不存在非运营必要的网站引流: 是 我的项目不存在为他人推广、AFF: 是 我的项目无关联的商业项目: 是 我的站点存在登录,并已接入 LINUX DO Connect: 是 我帖子内的项目介绍,AI生成、润色内容部分已截图发出: 是 以上选择我承诺是永久有效的,接受社区和佬友监督: 是 以下为项目介绍正文内容,AI生成、润色内容已
75 points · 16 comments
25 points · 4 comments
27 points · 10 comments
112 points · 42 comments
77 points · 29 comments
23 points · 5 comments
134 points · 60 comments
43 points · 11 comments
131 points · 75 comments
182 points · 62 comments
311 points · 535 comments
Announcement: https://openai.com/index/openai-broadcom-jalapeno-inference-...https://decrypt.co/371971/openai-broadcom-jalapeno-first-cus...https://www.cnn.com/2026/06/24/tech/openai-broadcom-jalapeno...
205 points · 129 comments
218 points · 124 comments
375 points · 60 comments
208 points · 121 comments
Colin here, creator of Nub. I’ve had the general shape of this in mind for years. Nub runs your code with stock `node`, augmented with a `--require` preload hook[0] that adds a transpiler (oxc-powered, packaged as a Node-API add-on), registers a module resolution hook[1], and injects polyfills as ne
288 points · 183 comments
https://investor.qualcomm.com/news-events/press-releases/new...https://www.modular.com/blog/qualcomm-to-acquire-modularhttps://x.com/clattner_llvm/status/2069769232477192354, https://xcancel.com/clattner_llvm/s
368 points · 40 comments
23 points · 1 comments
21 points · 2 comments
200 points · 111 comments
38 points · 10 comments
4 points · 1 comments
125 points · 104 comments
88 points · 34 comments
310 points · 59 comments
26 points · 23 comments
Hello HN,I would like to share with you linkedrecords.com - an open source backend as a service I'm working on since some time now. You can think of it as an firebase/convex alternative with an interesting twist.In 2018 I needed to write large software requirements/architecture docume
今日AI领域亮点纷呈,硬件创新、智能体技术迭代与行业生态建设并驾齐驱。OpenAI携手博通推出首款LLM推理定制芯片「Jalapeño」,预示着AI算力经济性迈入新阶段,而Google DeepMind则通过为Gemini 3.5 Flash引入“计算机使用”功能,显著拓宽了大模型的应用边界。在软件层面,开源社区涌现出代理式视频制作、AI招聘智能体、Mac版Linux容器等实用工具,提升了内容创作、HR和开发者效率。同时,Claude智能体身份访问模型的升级,以及围绕AI代理系统设计理念的探讨,强调了人机协作和Agent架构的重要性。行业层面,OpenAI呼吁构建AI共享标准,Databricks强调前沿AI生态的开放性,以及MIT对AI网络数据基础设施层的关注,共同勾勒出AI技术在硬件、软件与治理层面协同演进的宏大图景。
OpenAI和博通(Broadcom)联合发布了一款名为「Jalapeño」的定制AI芯片。该芯片专为大型语言模型(LLM)推理设计,旨在显著提升AI系统的性能、效率和规模,以应对日益增长的AI模型算力需求。社区分析指出该芯片配备216GB HBM3E内存,其仅9个月的设计周期也展现了行业在AI硬件创新上的加速,预示着AI推理能力将迎来显著提升,降低了AI应用的部署成本。
Google DeepMind在其博客中宣布,已为Gemini 3.5 Flash模型引入了「计算机使用」功能,进一步增强了该模型与外部计算环境交互和执行任务的能力。这一更新使Gemini 3.5 Flash不再局限于文本生成,能够处理更多元、更实际的应用场景,如数据分析、代码执行或网络信息检索。此举旨在提升模型的实用性和自动化水平,使其能更好地作为智能助手,辅助用户完成各类计算密集型工作。
OpenMontage是首个开源的代理式视频制作系统,集成了12条生产线、52种工具及500多个AI代理技能,将AI编码助手转化为全面的视频制作工作室。该系统致力于自动化视频内容生成,解决传统制作流程的复杂性和耗时问题,使内容创作者和开发者能够通过AI驱动高效产出各类视频,从而大幅提升工作流效率和创新能力,为视频行业带来变革。
Apple开源了其为Mac用户设计的轻量级虚拟机工具Apple/container,旨在高效创建并运行Linux容器。该项目采用Swift语言开发,并针对Apple silicon进行优化,提供了原生的容器化体验,有效解决了在macOS上运行Linux容器的性能和兼容性难题。这使得开发者能在Mac设备上更便捷地进行跨平台开发、测试和部署,尤其适合需要高性能容器环境的专业人士,显著提升了开发效率。
Claude官方博客宣布,其「Claude Tag」功能引入了全新的「智能体身份」访问模型。此模型旨在为团队提供更自主、更安全、更便捷的AI访问和协作方式,允许智能体拥有独立的身份和权限,从而更好地融入团队工作流程。这一创新不仅提升了AI在企业协作中的应用深度,也为智能体如何管理和参与多用户环境提供了新的范式,进一步推动AI在团队生产力中的核心作用。
Interviewstreet/hiring-agent是一个利用AI代理技术自动化评估和评分候选人简历的开源项目。该工具旨在解决招聘流程中简历筛选耗时耗力的问题,通过智能分析简历内容,提供客观评分,从而帮助招聘人员和HR团队快速识别最符合职位要求的候选人。它显著提高了招聘效率,减少了人工筛选偏差,使企业能够更有效地进行人才招募,优化了人才获取策略。
OpenAI正积极参与高级AI共享标准的建设,通过Appia基金会支持评估框架、安全实践和全球合作。此举旨在确保AI技术的负责任发展,并促进国际社会在AI治理方面的协调。OpenAI的参与强调了在AI飞速发展背景下,建立全球统一的安全和伦理标准的重要性,以应对潜在风险,并为AI技术的长期健康发展奠定基础,从而推动AI的普惠和可持续发展。
Databricks技术领导者Matei Zaharia和Reynold Xin强调,为使企业能够构建「智能体云(Agent Clouds)」,前沿AI生态系统必须保持开放性。他们认为,开放标准和接口是企业利用AI智能体进行创新和规模化部署的关键。这一观点挑战了封闭AI系统的趋势,主张通过开放协作来加速AI智能体的普及和应用,尤其是在复杂的企业级场景中,以释放其巨大潜力。
麻省理工科技评论指出,随着AI的蓬勃发展,企业对大规模数据的需求日益增长,但现有网络数据常面临受阻或非结构化的挑战。文章探讨了为解决此问题而出现的「AI网络数据基础设施层」的重要性。这一层旨在提供高效、合规的数据获取、处理和结构化能力,为AI模型提供高质量的训练和推理数据,是推动AI应用广泛落地、确保数据可用性和可信度的关键。
Latent Space的文章指出,AI领域正迈入「元连接器(Meta-Harness)」时代,标志着AI系统集成和管理框架进入了更高级阶段。这超越了传统连接器工程,致力于构建能编排复杂AI代理和工具的强大管理层。这一趋势强调了将AI能力有机整合为可协作、可扩展系统的必要性,预示着未来AI应用将更加模块化、智能化,且易于部署和管理,进一步提升AI系统的整体效能。
OpenAI报道称,GPT-5 Pro成功协助免疫学家Derya Unutmaz解决了困扰其3年的免疫学难题,深入揭示了T细胞行为的复杂性。此项突破不仅展示了GPT-5在高级科学研究中的强大分析和洞察能力,也为癌症和自身免疫疾病的治疗研究提供了重要支持。这凸显了大型语言模型在加速科学发现、克服传统研究瓶颈方面的巨大潜力,为AI在生命科学领域的应用树立了典范。
Revfactory/harness是一个“元技能”项目,专注于AI代理系统的设计与构建。它能设计领域特定的代理团队,定义专业AI代理,并自动生成这些代理所需的技能。该项目旨在简化复杂AI代理应用的开发流程,解决从头构建多代理协作系统的高门槛问题。AI系统开发者和研究员可利用Harness高效定制和部署高性能AI代理解决方案,加速智能应用的创新与落地,推动Agent技术的实用化。
Google-labs-code/design.md是一个创新性的格式规范,旨在为AI编码代理提供一种描述视觉标识的方法。通过使用DESIGN.md,编码代理能够持久且结构化地理解一个设计系统,解决了AI在理解和实现设计意图时的挑战。这使得设计师和开发者能以标准化方式,将设计语言精确传达给AI,从而自动化和优化从设计到代码的转换流程,加速UI/UX开发效率,弥合设计与开发之间的鸿沟。
Hugging Face与IBM Research合作,介绍了如何利用「CUGA」框架和轻量级连接器构建真正的智能体(agentic)应用程序。文章提供了约24个实际可运行的示例,旨在加速AI应用开发。CUGA框架通过简化智能体的集成和部署,降低了开发门槛,使开发者能够更便捷地设计和实现复杂的、具备自主决策能力的AI应用,从而推动智能体技术在实际场景中的广泛应用和创新。