NousResearch/hermes-agent
Python · ★ 183,045 · 🍴 31,389 · 📈 1,821 stars today
The agent that grows with you
Python · ★ 183,045 · 🍴 31,389 · 📈 1,821 stars today
The agent that grows with you
Python · ★ 14,457 · 🍴 919 · 📈 2,503 stars today
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
TypeScript · ★ 32,641 · 🍴 4,189 · 📈 350 stars today
The Frontend Stack for Agents & Generative UI. React + Angular. Makers of the AG-UI Protocol
TypeScript · ★ 25,960 · 🍴 2,989 · 📈 1,142 stars today
An Open Source implementation of Notebook LM with more flexibility and features
JavaScript · ★ 208,311 · 🍴 31,958 · 📈 1,368 stars today
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
Python · ★ 21,520 · 🍴 1,861 · 📈 127 stars today
Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.
Jupyter Notebook · ★ 9,394 · 🍴 601 · 📈 494 stars today
NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.
Python · ★ 64,670 · 🍴 10,085 · 📈 324 stars today
A Simple and Universal Swarm Intelligence Engine, Predicting Anything. 简洁通用的群体智能引擎,预测万物
Python · ★ 28,178 · 🍴 2,391 · 📈 738 stars today
AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
Python · ★ 80,513 · 🍴 10,631 · 📈 755 stars today
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
JavaScript · ★ 1,517 · 🍴 239 · 📈 82 stars today
OpenAI Plugins
Python · ★ 53,853 · 🍴 7,079 · 📈 228 stars today
The best-benchmarked open-source AI memory system. And it's free.
TypeScript · ★ 4,500 · 🍴 241 · 📈 126 stars today
The sandbox agent framework.
C# · ★ 1,592 · 🍴 180 · 📈 329 stars today
Windows companion suite for OpenClaw - System Tray app, Shared library, Node, and PowerToys Command Palette extension
Go · ★ 35,842 · 🍴 443 · 📈 208 stars today
Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more
★ 350,363 · 🍴 83,289 · 📈 757 stars today
A complete computer science study plan to become a software engineer.
Java · ★ 9,222 · 🍴 1,223 · 📈 310 stars today
Multi-platform SDK for integrating GitHub Copilot Agent into apps and services
👍 43
Code language models need repository-level context to resolve imports, APIs, and project conventions. Existing methods inject this knowledge as long inputs (retrieved through RAG or dependency analysis) or through per-repository fine-tuning and LoRA -- costly at repository scale and brittle to evolv
👍 40
Role-playing language agents (RPLAs) should play characters whose values and behavior evolve as the story progresses, not maintain a fixed persona. Existing benchmarks measure factual recall at a given chapter, not whether responses align with the character's psychological trajectory, especially in
👍 36
Agents are widely deployed as assistants over documents, tools, and code. However, they typically act only on explicit user requests, which surface only the problems the user has noticed, while many other important problems coexist, hidden in plain sight, within the broader user context, with their
👍 32
Planning for real-world problems by language models often involves both world and user constraints, which may not be fully specified upfront and are progressively disclosed through interaction. However, existing benchmarks still underexplore adaptive planning under such progressively revealed dual c
👍 30
We introduce VideoKR, the first large-scale training corpus specifically designed to strengthen knowledge- and reasoning-intensive video understanding. It comprises 315K video reasoning examples over 145K newly collected, CC-licensed, expert-domain videos. We develop a human-in-the-loop, skill-orien
👍 23
Prior work has shown that large language models (LLMs) can translate unseen or low-resource languages by undergoing continued training or even by encoding a grammar book in their context. However, both methods typically overfit specific languages, with limited zero-shot transfer at test time. To tra
👍 22
While household robots are often evaluated based on task completion, everyday domestic environments involve value-conflicting situations in which robots are expected to choose actions that prioritize other values than task success, such as human autonomy, efficiency, or social appropriateness. Yet,
👍 17
We study the personal camera roll visual question answering setting. In this setting, a conversational AI assistant can access a user's personal camera roll and retrieve relevant photos to answer queries, ranging from simple factual questions (e.g., ``Name of the food I tried yesterday?'') to more o
👍 16
Developing unified video generation and editing models capable of interpreting interleaved multimodal inputs is a promising yet challenging frontier field. Existing unified frameworks predominantly rely on massive models (typically 13B parameters or more) and incorporate source video conditions for
👍 15
Experience internalization converts contextual experience from past interactions into reusable parametric capability, offering a promising path toward continual learning in large language models (LLMs). While prior work has predominantly focused on single-iteration transfer, we discover that under m
👍 12
Video generation models have made impressive strides in synthesizing visually compelling content, yet their outputs remain confined to the virtual domain. A natural question follows: how well do these models reflect the physical world when their generated videos leave the screen and enter reality? W
👍 10
Inference-time skill augmentation provides a lightweight way to improve data-analytic agents by injecting reusable procedural knowledge without updating model parameters. However, discovering effective skills for data analysis remains challenging, as reliable supervision is expensive and success cri
👍 7
Large language models can reproduce training data, but existing memorization evaluations mostly measure whether models can be forced to do so, rather than whether they do so under ordinary use. We introduce PropMe, a propensity-aware framework for memorization evaluation that contrasts prefix-based
👍 6
Selection is a core operation in interactive image editing. To be practical, a user should be able to specify and disambiguate the desired selection region through either text or click-based interactions, and the system should support selecting not only objects but also other criteria, such as mater
👍 6
Inference-time scaling has emerged as a critical avenue for enhancing Large Language Models' performance, yet real-world deployment is constrained by strict computational budgets. In this work, we formulate inference budget allocation as a global constrained optimization problem governed by economic
👍 5
Memory-augmented LLM agents tackle complex long-horizon tasks by recursively summarizing interaction trajectories into compact memory. However, existing approaches typically train these memory policies using outcome-based reinforcement learning, failing to localize where intermediate memory quality
👍 5
We propose world-language-action (WLA) models as a new class of embodied foundation models. WLA takes textual instructions, images, and robot states as inputs to jointly predict textual subtasks, subgoal images, and robot actions, conjoining the world modeling interface to learn from extensive egoce
👍 5
Video event prediction (VEP) requires models to infer unobserved future states from partial video evidence. Existing video MLLMs usually verbalize intermediate future reasoning in text space: once visual evidence is verbalized, fine-grained motion, geometry, and interaction cues can be lost, leading
👍 4
Temporal Grounding (TG) aims to localize video segments corresponding to a textual query. Prior research predominantly focuses on single-segment retrieval. Real-world scenarios, however, often require localizing multiple disjoint segments for a single query -- a setting we term One-to-Many Temporal
👍 4
Large language model (LLM) agents are increasingly applied to long-horizon tasks such as scientific discovery and machine learning engineering (MLE), where sustained self-evolution becomes a key capability. However, existing MLE agents suffer from inter-branch information isolation, memoryless searc
@DamiDefi · 96.5K 粉丝 · 2.3M 阅 · 584 赞 · 80 转
The number that stopped me was not the $2 trillion valuation. It was $791 million. That is what SpaceX made in net income in 2024. A profitable, growing aerospace company with a genuine moat in launch
@0xCodez · 3.3K 粉丝 · 637.2K 阅 · 510 赞 · 59 转
Most Claude Code users still write their workflows by hand. They chain prompts, copy outputs, paste them into the next prompt, fix what went wrong, repeat. 9 out of 10 builders haven’t tried Dynamic
@Saboo_Shubham_ · 116.2K 粉丝 · 263.3K 阅 · 517 赞 · 74 转
The frontend used to be a fixed thing. Designers drew it. Engineers built it. Users got what shipped. That's over. The interfaces shipping in 2026 are drawn partly by the agent itself, in real time,
@dkundel · 19.3K 粉丝 · 116.9K 阅 · 523 赞 · 40 转
We launched the goal mode (or /goal) as a way to help you have Codex drive towards a concrete outcome. When you set a goal Codex will continue to work until the goal is achieved, whether that takes
@drfeifei · 738.0K 粉丝 · 72.2K 阅 · 699 赞 · 144 转
“The world is everything that is the case.” — Ludwig Wittgenstein, Tractatus Logico-Philosophicus, 1921 The world is not made of words. In an earlier essay, we argued that spatial intelligence is AI’s
@sydneyrunkle · 7.5K 粉丝 · 69.5K 阅 · 511 赞 · 74 转
Building useful agents is largely about customization: connecting your agent to the right context, data, and environment(s) for the task at hand. At its core, an agent is a model calling tools in a
@jainarvind · 9.3K 粉丝 · 53.7K 阅 · 505 赞 · 68 转
Enterprise AI token spend is scaling quickly, especially as the technology shifts from simple chat assistants into coding agents, AI coworkers, and long-running workflows. These systems do far more
@IBuzovskyi · 1.2K 粉丝 · 50.1K 阅 · 500 赞 · 51 转
These 10 Hermes Agent hacks saved me 15+ hours every week - and they work for any workflow you run repeatedly. Content, software development, business operations, client management, research, sales.
@delba_oliveira · 74.0K 粉丝 · 37.4K 阅 · 533 赞 · 38 转
As we delegate more ambitious tasks to Claude, it becomes increasingly important that it can verify its own work. The more Claude can self-verify: the more independently it can work on long-running
@ericzakariasson · 67.9K 粉丝 · 37.3K 阅 · 507 赞 · 27 转
If you've ever watched an agent try to fix a bug, you've watched it guess. It reads the code, comes up with a theory, makes an edit, and hopes. Sometimes it's right. A lot of the time you get a fix
Your broken harness is actively making the model worse. Here's what I keep seeing after years of eyeballing trajectories, and what you need to fix.
On June 5, 404 Media reported that attackers had been using Meta’s AI customer support agent to steal Instagram accounts. Their approach was simple: They asked the agent to link the accounts to email addresses that they controlled, and the agent complied. One attacker broke into the dormant Obama Wh
a quiet day
We talk with the VendingBench authors on evaling Claudes from Haiku to Mythos, and how they build leading, and lasting, frontier evals from scratch.
Learn how Endava is using AI agents, ChatGPT Enterprise, and Codex to accelerate software delivery, automate workflows, and build an AI-native culture across the enterprise.
Most days in her chambers, Judge Maritza Braswell, a federal magistrate judge in Colorado, sifts through stacks of documents written by people without a lawyer. Many of them can’t afford to hire a lawyer, and others have cases too weak or too small to interest one. She reads each one carefully, mind
ChatGPT introduces a new memory system to better remember preferences, keeping context fresh and relevant across conversations.
**NVIDIA** released **Nemotron 3 Ultra**, a fully open **550B MoE** model with **55B active parameters** and **1M context**, optimized for long-running agent tasks with up to **5x speedup** and **30% cost reduction**. It features hybrid Mamba/attention, LatentMoE, native MTP, and was pretrained on *
a quiet day.
An action plan for AI-powered biological resilience
VIX 恐慌指数
10Y 美债收益率 (%)
美元指数 DXY
S&P 500 ETF
Nasdaq 100 ETF
Apple
Microsoft
Nvidia
Alphabet
Tesla
Meta
Bitcoin
Ethereum
Solana
阿里巴巴 (BABA)
拼多多 (PDD)
京东 (JD)
腾讯控股 (0700.HK)
黄金期货
WTI 原油期货
美元 / 人民币
以上内容基于公开行情数据的技术指标计算与文本摘要,不构成任何投资建议。过去走势不代表未来表现,市场风险自负。
Dawa Sherpa, 57, was found alive on Thursday, nearly a week after he was last seen on the mountain. His wife says more could have been done to find him sooner.
The queen’s third cousin, she was a bridesmaid at the royal wedding in 1947, and witnessed firsthand pivotal moments in British history.
Russian attempt to repair tunnel area sparks safe-haven procedure for five other astronauts onboard.
All the big names are out at the French Open. The result is a very confusing, and very exciting, tournament.
Trump seeks to shore up support among rural voters hard-hit by tariffs, economic fallout of war with Iran.
At "Russian Davos," Putin ruled out meeting with Zelenskyy and promoted a new world economic order.
Russian President Vladimir Putin has turned down an offer for in-person talks with the Ukrainian President.
Trump hailed jobs growth before pivoting to Iran, saying negotiations with Tehran "seem to be going quite well".
Tenth seed Flavio Cobolli will take on German second seed Alexander Zverev in final after Matteo Arnaldi withdrawal.
A drone attack killed a young woman and injured 15 others near Khan Younis, reported the Wafa news agency.
Dawa Sherpa was spotted alive by a cleaning crew as he slid slowly down the world's tallest mountain and spoke to the BBC from hospital.
The airline Lufthansa said the cause of the accident at Frankfurt Airport was under investigation. The plane can weigh 279 tons at takeoff.
French activists who took part in a Gaza-bound foreign aid flotilla accuse Israeli forces of abuse and torture.
Families in Az-Zawayda, central Gaza, clear rubble after overnight Israeli strikes in Gaza
Activists who were detained with a flotilla trying to reach the Gaza Strip have said they were abused while in Israel’s custody. Israeli authorities have denied mistreating them.
Diane Swonk, KPMG Chief Economist, analyzed the current economic landscape, highlighting how improvements in the labor market and persistent service sector inflation are driving hawkish sentiment among Federal Reserve officials. She noted that bond market pricing reflects expectations of a 25 basis
Gene Sperling, President of Sperling Economic Strategies and former director of the National Economic Council, discussed the recent U.S. jobs report, highlighting that the headline numbers were stronger than expected and suggest a low likelihood of recession despite ongoing geopolitical and economic
General counsel resigned after revelations about relationship with disgraced financier but now will stay on as adviser
For years, Wall Street has benefited from one of the most reliable forces in modern markets: a retail-trader army willing to buy almost anything.
Marvell Technology Inc. and Flex Ltd. will join the S&P 500 in the latest quarterly rebalance, S&P Dow Jones Indices said Friday.
The Nasdaq saw its biggest daily fall since early 2025.
Friday’s pullback in US equities offers a chance to add exposure rather than a reason to retreat, with a clear path for the S&P 500 to reach 8,000 this year, according to John Flood, the head of Americas equities execution services at Goldman Sachs Group Inc.
Agreement comes ahead of a record-breaking initial public offering for Elon Musk’s rockets-to-AI conglomerate
Barry sits down with Chris Davis, Chairman and Portfolio Manager at Davis Funds. They discuss his approach to managing risk and the key elements changing the economy. Chris and Barry also discuss Chris's mentors including Charlie Munger, and how he settled into the family business. (Source: Bloomber
Rising expectations of a Federal Reserve rate increase send US bond yields rising sharply
Mega-IPO candidates including SpaceX are expected to face a long road to entry to the S&P 500 Index, after the company that makes the rules rejected a proposal that included relaxing the requirement that they be profitable.
Katherine Bordlethwait, Goldman Sachs asset management co head of equity client portfolio management, discussed the current equity market environment amid record highs and strong investor enthusiasm. She emphasized that continued earnings growth is critical for the market to sustain its momentum, no
3 回复 · 程序员 节点
5 回复 · 程序员 节点
5 回复 · 程序员 节点
3 回复 · 程序员 节点
4 回复 · Apple 节点
6 回复 · Apple 节点
11 回复 · Apple 节点
12 回复 · Apple 节点
29 回复 · Apple 节点
17 回复 · Apple 节点
本帖使用社区开源推广,符合推广要求。我申明并遵循社区要求的以下内容: 我的帖子已经打上 开源推广 标签: 是 我的开源项目完整开源,无未开源部分: 是 我的开源项目已链接认可 LINUX DO 社区: 是 我帖子内的项目介绍,AI生成、润色内容部分已截图发出: 是 以上选择我承诺是永久有效的,接受社区和佬友监督: 是 以下为项目介绍正文内容,AI生成、润色内容已使用截图方式发出 自己的第一个开源项目,全程vibe-coding。 在Opencode下已测试迭代N次,完整可用,搭配deepseek v4 flash,几乎0成本。 注意,适配调研任何主题,不仅仅能做行业研报哦!看我出的案例报告就知
千亿,指日可待 93 个帖子 - 85 位参与者 阅读完整话题
刚才看到有评论说群里发消息了,站长被气到了然后关站了,有在群里的佬说说发生什么了吗? 62 个帖子 - 51 位参与者 阅读完整话题
希望明天能看到账号解封的消息,那群逆天号商整天去官网搞举报,看看伤害多少无辜的正规充值的用户啊 38 个帖子 - 23 位参与者 阅读完整话题
应该有很多人跟我一样不喜欢桌面一堆乱七八糟的图标,也不想装 uTools、Listary 这种额外的启动器。所以俺来分享一下这个用了好多年的 windows 上的软件使用习惯 核心思路:把所有软件的快捷方式统一放到一个文件夹(比如 QuickWay),再把这个文件夹加进环境变量,之后就可以在任意界面按下 Win+R 输入快捷方式名称回车,就可以直接打开对应软件 1. 新建一个放快捷方式的文件夹 在任意位置新建一个文件夹,命名为 QuickWay(名字随意,以下统一用这个),建议放在不容易误删的位置,我是放在了个人文件夹下,即 C:\Users\TaiYang\QuickWay 2. 把所有软件
最近Minimax刚上线了M3版本模型,因为公司采购的就是Minimax家订阅,所以公司项目代码就只能用它来写,使用过程中感觉豆包味越来越重,甚至无视写在项目md里的红线警告,自己往数据库写了数据,事后我让它细化md文档,更是搞了一堆废话进去。 公司也是抠,好歹也选个Qwen或者DS…… 40 个帖子 - 27 位参与者 阅读完整话题
事情是这样的,有人跟我们反馈说回答里出现了我们的来源信息,因为我们是动态风控机制的,正常使用是不会出现来源信息的,只有多设备 多ip 多账号 环境跳来跳去,才会触发这个风控机制,会动态增加来源信息 然后私聊问了下,如下: 没想到居然拿我们的公益站去卖而且还卖的这么贵!0.2的倍率,太夸张了! 由于当事人说收到了很多人的消息,不想被打扰,所以码掉了 224 个帖子 - 203 位参与者 阅读完整话题
41个pro开号方式如下: 11个为美区没免税 套google play开的,220刀/月 之前封了7个 今早到现在为止封了25个,余9个 今天死的号为1个美区免税 2个土区 20个礼品卡 2个美区没免税 现在看来美区没免税能活久点 再说下2个多月做中转站,到现在的情况 总投入还差23000+ 没有回本,别说赚钱了 另外封了的号 googleplay去申请退款,没有一个给我退的,不给退 佬们参考下吧,9个号刷完 或者 刷死 结束 终于能睡个好觉了 65 个帖子 - 43 位参与者 阅读完整话题
中午偶然间刷脉脉看到 ATA 又有小作文爆火了,于是去小红书上找了找 7w 字原文 drive.google.com 置身钉内 14.34.50.pdf Google Drive file. 在 7w 字的原文中见到了很多相似的、过去反复发生的故事,唯一没想到的是,都过去至少 5-6 年了,逻辑还是和以往一模一样,甚至都不带变的 从内容上来说,是毫无疑问的好文章,共享一下 PS:为什么现在很多人看到这种精品文章,下意识都会去找 AI 蒸馏的简略版呢?并不是很理解 看了一下后续佬友们的回复,仔细思考了一下,用 AI 作为前置筛选器,筛选一下双方观点互通的文章也是挺有意义的,这样对创作者和阅读者都
(划水.gif) 像是动漫里走出来的世界 41 个帖子 - 31 位参与者 阅读完整话题