Claude & Anthropic
25Writing an agent skill
Most developers now use coding assistants. I do too—Copilot at work, Claude Code at home. As a...
VibePod adds Ollama/vLLM back end support for Claude Code and Codex
TTal – CLI that turns Claude Code into a multi-agent software factory
Show HN: Motif - Analyze your Cursor and Claude Code chat history
Mining Hidden Skills from Claude Code Session Logs with Semantic Knowledge Graphs
The Problem If you use Claude Code (or any LLM-based coding agent) daily, you've probably...
shareAI-lab/learn-claude-code: Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1
jarrodwatts/claude-hud: A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress
Why Your MCP Setup Keeps Timing Out in 60 Seconds (And How I Fixed It on Windows)
Every developer hits this wall: add more than 8 MCP servers to Claude Desktop (or Cursor, or VSCode)...
An industrial piping contractor on Claude Code [video]
MoMA – Claude Code orchestrator that won't implement until the plan scores 10/10
Forcing Claude Code to Run Outside-In TDD
Agentic & Tools
29[D] Breaking down MiroThinker H1's verification centric reasoning: why fewer interaction rounds produce better agent performance
I've been building agentic RAG systems at work and keep running into the same problem: agents that spiral into long, unproductive tool call loops. So when I saw the MiroThinker paper (arXiv: 2603.1572
TDAD: Test-Driven Agentic Development - Reducing Code Regressions in AI Coding Agents via Graph-Based Impact Analysis
AI coding agents can resolve real-world software issues, yet they frequently introduce regressions, breaking tests that previously passed. Current benchmarks focus almost exclusively on resolution rat
Model Context Protocol (MCP): The Tool Ecosystem for AI Agents
MCP is the open standard that lets AI agents connect to any external tool or data source. This guide covers how it works, the most useful servers available today, when to use it, and how to build your
Missing from the MCP debate: Who holds the keys when 50 agents access 50 APIs?
MCP vs CLI vs directly calling the API, the real question is the same: where do the credentials live when agents scale?
Show HN: Lukan – An open-source agentic workstation in a single Rust binary
Show HN: KYC Verification Process Automation Using Agentic AI
A Meta agentic AI sparked a security incident by acting without permission
obra/superpowers: An agentic skills framework & software development methodology that works.
The Agent Buddy System: When Prompt Engineering Isn't Enough
Most AI agents don’t reliably follow directions, and that’s one of the biggest reasons they never...
The OWASP MCP Top: A Security Framework for AI Agent Tool Integration
Show HN: Mimir – open-source code intelligence for AI agents (Go, MCP, SQLite)
MCP Tool Design: Why Your AI Agent Is Failing (And How to Fix It)
The Reports of MCP's Death Have Been Greatly Exaggerated Scroll through developer forums...
Models & Releases
12Tomorrow, @cursor_ai is going to release a super strong coding model. Possibly SOTA. Stay tuned. 🫡
@@mark_k
Multiverse Computing pushes its compressed AI models into the mainstream
After compressing models from major AI labs including OpenAI, Meta, DeepSeek and Mistral AI, Multiverse Computing has launched both an app that showcases the capabilities of its compressed models and
[R] Extreme Sudoku as a constraint-satisfaction benchmark, solved natively without tools or CoT or solution backtracking
I came across an interesting writeup from Pathway that I think is more interesting as a reasoning benchmark than as a puzzle result. They use “Sudoku Extreme”: about 250,000 very hard Sudoku instance
[D] Looking for arXiv endorsement (cs.LG) - PDE-based world model paper
Hi everyone, I'm a researcher looking for an arXiv endorsement for cs.LG to submit my first paper. I've been working for about a year on FluidWorld, a world model where the prediction engine is a rea
RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference
Post training quantization is essential for deploying large language models (LLMs) on resource constrained hardware, yet state of the art methods enforce uniform bit widths across layers, yielding sub
Evaluation and Alignment: The Seminal Papers (new book + 50% code)
Hi r/MachineLearning, I'm Stjepan from Manning, and I'm posting on behalf of Manning with the mods' approval. We’ve just released a book that focuses on a part of ML systems that tends to get less a
[P] ColQwen3.5-v3 release + Case study
Happy to share the latest colqwen3.5-4.5B model in the series. ColQwen3.5-4.5B-v3 is #1 (avg) on the MTEB ViDoRe leaderboard (Pending release) at 75.67 mean, \~half the params, \~13x fewer embedding
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models
Multimodal Large Language Models (MLLMs) have made impressive progress in connecting vision and language, but they still struggle with spatial understanding and viewpoint-aware reasoning. Recent effor
New Open Source Release
Nvidia greenboost: transparently extend GPU VRAM using system RAM/NVMe
Tags: ai, release, vibecoding
NER: Gemini vs Spacy vs Compromise
TLDR For NER, if accuracy is critical, go with an LLM — even an old one like...
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs
Token pruning is essential for enhancing the computational efficiency of vision-language models (VLMs), particularly for video-based tasks where temporal redundancy is prevalent. Prior approaches typi
Research & Papers
2[D] ICML rejects papers of reviewers who used LLMs despite agreeing not to
According to multiple posts on Twitter/X ICML has rejected all paper of reviewers who used LLMs for their reviews even though they chose the review track with no LLM use. What are your thoughts on thi
[R] A Gradient Descent Misalignment — Causes Normalisation To Emerge
[**This paper**](https://arxiv.org/pdf/2512.22247), just accepted at ICLR's GRaM workshop, asks a simple question: >*Does gradient descent systematically take the wrong step in activation space*? It
Industry & General
7exciting things at cursor coming soon 🎻
@@kristaletz
Cursor @cursor_ai just bought the most expensive ad spot on Earth
@@TickerSymbolYOU
🇮🇳Indian-origin developer Aman Gottumukkala, founder of AI coding assistant Firebender, has joined Elon Musk's xAI to build advanced coding AI systems
@@EverythingAjay
Show HN: Will my flight have Starlink?
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
Actually, NVIDIA is the one paying us :)
@@jediahkatz
won 3rd 🥉 overall at @cursor_ai waterloo freeform hackathon w/ @chahana_reddy thanks for hosting @shayaan_azeem @demireren_ @byjustinwu
@@likhithakoppula