Show HN: I wrote a C++ ray tracer from scratch without AI
A developer built a C++ ray tracer from scratch without using AI assistance, demonstrating manual implementation of rendering algorithms. The project gained traction on Hacker News with 145 points, indicating community interest in traditional software engineering approaches.
Rio de Janeiro's "homegrown" LLM appears to be a merge of an existing model
A large language model promoted as Brazil's homegrown AI appears to be a merge of existing open-source models rather than originally developed, according to analysis on GitHub and discussion on Hacker News. This revelation raises questions about authenticity claims and proper attribution in the model development community.
AI OSS tool repo goes archived over night after raising $7.3M Seed
An open-source AI tool repository was unexpectedly archived overnight following a $7.3M seed funding round, raising questions about the company's direction and commitment to the open-source community. The sudden move sparked significant discussion on Hacker News with 251 upvotes and 162 comments, suggesting it touched a nerve about transparency and trust in AI OSS projects.
Show HN: Paca – Lightweight Jira alternative for human-AI collaboration
An engineer built Paca, a free, lightweight Jira alternative written in Go that enables human-AI agent collaboration for sprint planning and task assignment. The tool features customizable views, fields, and a WASM-based plugin architecture, with the creator committing to continuous maintenance and free availability.
An opinion piece argues that open-source AI development is critical to preventing monopolistic control of AI technology by large corporations, emphasizing transparency, accessibility, and community-driven innovation as essential counterweights to proprietary systems.
olmo-eval: An evaluation workbench for the model development loop
Allen Institute for AI (AI2) released olmo-eval, an open-source evaluation framework designed to streamline model testing and benchmarking throughout the development cycle. The workbench provides standardized tools for assessing LLM performance across multiple benchmarks, enabling faster iteration and more rigorous evaluation practices.
Apache Burr: Build reliable AI agents and applications
Apache released Burr, an open-source framework for building reliable AI agents and applications with built-in support for persistence, state management, and debugging. The project garnered significant community interest on Hacker News, suggesting growing demand for production-grade agent development tools.
The Open Source Community is backing OpenEnv for Agentic RL
The open source community is rallying behind OpenEnv, a framework designed to support reinforcement learning for AI agents. This backing signals growing momentum for standardized, community-driven tools in agentic AI development.
Anthropic's open-source framework for AI-powered vulnerability discovery
Anthropic released an open-source framework called Defending Code Reference Harness for discovering vulnerabilities in software using AI. The tool enables researchers and developers to detect security flaws automatically, advancing the field of AI-assisted code security analysis.
New Microsoft tool lets devs spin up AI behavior tests using text descriptions
Microsoft released Adaptive Spec-driven Scoring for Evaluation and Regression Testing (ASSERT), an open-source framework that enables developers to define AI model evaluations using text descriptions rather than manual code. This streamlines the process of testing AI behavior and detecting performance regressions across different models.
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains
JetBrains unveiled Mellum2, a 12 billion parameter mixture-of-experts language model designed to advance open-source AI capabilities. This release represents JetBrains' entry into large language model development, positioning the company alongside other open-source model contributors.
Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action
NVIDIA unveiled Cosmos 3, an open-source foundation model designed for physical AI reasoning and action, marking the company's first omni-model to handle multimodal inputs for robotics and autonomous systems. The model addresses the need for general-purpose AI systems that can understand and act on physical world tasks.
Odysseus is a self-hosted AI workspace available on GitHub, enabling users to run AI tools locally without cloud dependency. The project has gained traction in the open-source community, attracting 145 points and 69 comments on Hacker News, indicating interest in privacy-preserving, locally-controlled AI infrastructure.
Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA
Tiny-vLLM is an open-source high-performance LLM inference engine written in C++ and CUDA, designed for efficient model serving. The project demonstrates how to build a lean inference framework optimized for speed and resource efficiency.
Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL
HuggingFace's TRL library introduces Delta Weight Sync, a technique that enables efficient distribution of trillion-parameter models by shipping only weight deltas instead of full model files to Hub buckets. This approach reduces storage and bandwidth requirements for training and deploying extremely large language models, making trillion-parameter scale more accessible to researchers and organizations.
Millions of AI agents imperiled by critical vulnerability in open source package
A critical vulnerability dubbed "BadHost" was discovered in Starlette, a widely-used open-source Python package with 325 million weekly downloads, potentially exposing millions of AI agents and applications to attack. The vulnerability threatens systems relying on Starlette for web framework functionality.
3D-printable humanoid legs let robotics experiments run wild
Hugging Face released a $2,500 open-source bipedal robot design with 3D-printable legs, enabling builders and researchers to conduct robotics experiments at lower cost. The project democratizes access to humanoid robotics hardware for the research and maker communities.
Hugging Face introduced the Ettin Reranker Family, a suite of ranking models designed to improve relevance and retrieval accuracy in search and RAG systems. The models leverage advanced reranking techniques to boost precision in information retrieval tasks.
We stopped AI bot spam in our GitHub repo using Git's –author flag
A GitHub repository implemented a solution using Git's --author flag to filter and block AI bot spam contributions. The approach demonstrates a practical defensive measure against automated spam in open-source projects, addressing a growing challenge as AI-generated contributions increase.
PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend
PaddleOCR 3.5 introduces a Transformers backend for optical character recognition and document parsing tasks, enhancing model architecture flexibility and performance. This update enables developers to leverage transformer-based architectures for improved OCR accuracy and scalability in document processing workflows.
Show HN: Find the best local LLM for your hardware, ranked by benchmarks
A Hacker News user released WhichLLM, an open-source tool that helps users find the best local large language model for their hardware by ranking models against benchmark datasets. The project makes it easier for individuals to evaluate and select LLMs optimized for their specific computational constraints.
Clawdmeter turns your Claude Code usage stats into a tiny desktop dashboard
Clawdmeter, an open-source tool, converts Claude Code usage statistics into a compact desktop dashboard for developers monitoring their AI coding resource consumption.
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality
IBM released Granite Embedding Multilingual R2, an open-source multilingual embedding model under Apache 2.0 license with 32K context length and sub-100M parameters, achieving best-in-class retrieval quality for its size category. The model supports multiple languages and is designed for efficient deployment in retrieval-augmented generation (RAG) and semantic search applications.
A Claude Code and Codex Skill for Deliberate Skill Development
A developer created a Claude-based code skill framework designed for deliberate, structured practice in coding development. The project provides learning opportunities that leverage Claude's capabilities for interactive skill-building, addressing a gap in intentional programming education through automated feedback and guided exercises.
PS3 Emulator Devs Politely Ask That People Stop Flooding It with AI PRs
Developers of a PlayStation 3 emulator are requesting that contributors stop submitting AI-generated pull requests, which are flooding their repository with low-quality code. The issue highlights tension between open-source maintainers and the trend of using AI tools to generate code contributions without human review or understanding of project requirements.
A GitHub repository provides open-source code and guides for training large language models from scratch, enabling developers to understand and build LLMs without relying on proprietary frameworks. The project gained significant attention on Hacker News with 428 upvotes, reflecting community interest in democratizing LLM development.
DeepClaude – Claude Code agent loop with DeepSeek V4 Pro, 17x cheaper
DeepClaude implements Claude's agentic code loop using DeepSeek V4 Pro as a cost-effective alternative, achieving 17x cheaper API calls while maintaining comparable performance. The open-source project demonstrates that smaller, optimized models can replace expensive LLM agents for code generation and debugging tasks.
Mike is an open-source legal AI system designed to assist with legal document analysis and research. The project brings legal AI capabilities to a broader audience through open-source availability, reducing barriers to access for smaller firms and practitioners.
IBM released Granite 4.1, a series of open-source large language models with details on their architecture and training methodology. The release emphasizes transparency in model development while offering variants optimized for different enterprise and research applications.
Microsoft VibeVoice: Open-Source Frontier Voice AI
Microsoft released VibeVoice, an open-source voice AI model that expands the frontier of accessible speech synthesis and generation. The project demonstrates Microsoft's commitment to democratizing advanced voice technology by making it available to the broader developer community.
Symphony is an open-source specification for orchestrating Codex agents that integrates issue trackers into always-on agent systems to reduce context switching and boost engineering productivity. The spec enables automated task orchestration without requiring additional tooling overhead.