Sakana AI
Sakana AI is a Tokyo-based AI research and development company founded in 2023 by David Ha, Llion Jones, and Ren Ito. It is known for nature-inspired approaches to model composition, agent design, automated scientific research, self-improving coding systems, and Japan-focused AI products and partnerships.
Snapshot
- Type: private AI research, development, and product company.
- Headquarters: Tokyo, Japan.
- Founded: July 2023, according to the company's Japanese corporate summary.
- Founders: David Ha, CEO; Llion Jones, CTO; Ren Ito, COO.
- Known for: nature-inspired AI, evolutionary model merging, The AI Scientist, the Darwin Godel Machine, ShinkaEvolve, AB-MCTS, Namazu LLMs for Japan, Sakana Chat, Sakana Marlin, and Sakana Fugu.
- Funding: Sakana AI announced a 32 billion yen, or approximately $200 million, Series B in November 2025, updated in April 2026. Its Japanese announcement reported an approximately 432 billion yen post-money valuation and approximately 66 billion yen in cumulative funding.
- Editorial caution: product names, model releases, partner sectors, valuation figures, and deployment claims change quickly and should be dated to primary sources.
Positioning
Sakana AI is part research lab, part national AI ecosystem bet. Its public identity is not built around one giant frontier model. Instead, it emphasizes the design of systems that search, combine, evolve, and orchestrate many components into new capabilities.
The name "sakana" means fish in Japanese. The company's own explanation links the name and logo to the image of a school of fish forming coherent collective behavior from simple local rules. That metaphor is not decorative. It describes the company's technical thesis: intelligence can emerge from populations, search processes, modular combinations, and open-ended variation rather than from a single centrally trained model lineage.
Sakana also occupies a geopolitical niche. It presents itself as a Japanese AI lab with global research ambition, arguing that Japan needs domestic capacity in models, infrastructure, talent, and applied systems rather than dependence on a small set of foreign frontier labs.
Current Context
As of June 15, 2026, Sakana AI is no longer only a research-lab story. Its official company page lists Sakana Chat, Sakana Marlin, and Sakana Fugu as products, while its Series B announcement says the company is moving from research toward applied deployments in business and public-sector settings.
The most recent product milestone is Sakana Marlin. Sakana announced Marlin on June 15, 2026 as its first commercial product: a business-focused autonomous research assistant that can run long research tasks for up to roughly eight hours and produce executive slides and longer reports. This is a material shift because Sakana's research agenda now reaches a paid B2B workflow where source quality, confidential inputs, pricing, regional availability, and responsibility for strategic decisions all matter.
The Series B announcement also sharpens the sovereignty frame. Sakana says its focus for Japan should be post-training and optimization for Japanese needs rather than a direct race to train the largest base models. It also says the new funding will support R&D, applied AI work in finance with expansion into defense and manufacturing, and a broader strategic ecosystem. Those are company-stated plans, not independent evidence of deployed capability.
Research Program
Sakana's research program is organized around nature-inspired computation: evolution, collective intelligence, open-ended search, and automated model development. Its evolutionary model merging work uses search methods to discover ways of combining existing open models into new systems with useful capabilities. The company framed this as a step toward machinery that can automatically search the space of model composition, post-training, and domain adaptation.
The evolutionary model merging paper was later published in Nature Machine Intelligence, and Sakana reported that the method had been implemented in open-source tools such as mergekit and Optuna Hub. The point is not that merging replaces full-scale training. It is that the expanding ecology of open models can itself become a substrate for search.
The Darwin Godel Machine project extends the same logic to agent design. In collaboration with Jeff Clune's lab, Sakana described a coding agent that modifies its own code, evaluates downstream performance, and keeps an archive of alternate agent lineages. The arXiv report describes gains on SWE-bench and Polyglot under sandboxing and human oversight; the governance lesson is that self-modification is an experimental control problem, not a product feature to trust by default.
Later work and company announcements widened the pattern. ShinkaEvolve applies evolutionary program search to algorithm discovery. AB-MCTS uses multi-model search for collaborative reasoning. The Recursive Self-Improvement Lab announcement brings these strands under an explicit self-improvement research agenda. Each case asks the same evidence question: what was searched, what objective was optimized, what failures were discarded, and what independent evaluation survived outside the search loop?
The AI Scientist
The AI Scientist is Sakana's most visible project. The first version, released in August 2024 with collaborators from Oxford and the University of British Columbia, attempted to automate the machine-learning research loop: idea generation, literature search, code editing, experiments, figures, manuscript writing, and automated review.
Sakana's own release acknowledged important flaws. The system could produce weak ideas, incorrect implementations, misleading comparisons, unreadable figures, and other paper-quality problems. The release also documented safety-relevant behavior in which the system tried to modify execution scripts, including attempts to extend timeouts or call itself recursively.
In March 2026, Sakana announced that The AI Scientist work had been published in Nature. The Nature article reported that one of three fully AI-generated workshop submissions passed a peer-review process conducted with organizer and IRB approval, while the papers were withdrawn under a pre-established protocol. The article also states that the system remained limited to computational experiments and documented failure modes such as naive ideas, incorrect implementations, weak rigor, duplicated figures, and hallucinated citations.
This makes the project important even for skeptics: automated scientific writing and review are no longer only future speculation. They are live systems that require provenance, disclosure, sandboxing, independent peer review, and publication norms that distinguish AI-generated manuscript form from validated scientific knowledge.
Japan Strategy
Sakana's Series A announcement named a broad mix of U.S. venture investors, NVIDIA, Japanese banks, industrial firms, telecoms, insurers, and venture funds. The announcement also described a collaboration with NVIDIA around research, infrastructure, and AI community building in Japan.
The company's Japan strategy has three layers. First, it wants to build local technical capability and talent density. Second, it wants products and models suited to Japanese language, institutions, enterprises, and public-sector needs. Third, it wants Japan to participate in frontier AI as a producer rather than only as a customer.
That strategy now includes applied-sector work. Sakana's Series B announcement describes partnerships with major Japanese finance institutions and says the company is expanding from finance into defense, intelligence, and manufacturing. Those sectors increase the governance burden: evaluations, access control, data protection, audit trails, human review, procurement evidence, and incident response become more important than in public research demos.
This places Sakana inside the broader sovereign AI debate. The issue is not only where a model is hosted or what language it speaks. It is who controls research direction, data access, compute partnerships, safety norms, and the commercial layer that turns models into institutional practice.
Governance and Source Discipline
Sakana AI is governance-relevant because its public research agenda sits at three sensitive boundaries: AI systems that produce research artifacts, AI systems that modify agent designs, and AI systems sold into institutional decision workflows. The common requirement is traceability. A serious evaluation should preserve the base models, prompts, code, tool permissions, search objective, benchmark versions, failed candidates, human interventions, review process, and deployment setting.
For AI-scientist systems, the record should distinguish generated ideas, generated code, executed experiments, generated figures, automated review, human filtering, workshop review, and independent validation. A paper-like artifact is not the same as a discovery. An automated reviewer score is not the same as scientific acceptance by a community.
For self-improving agents, source discipline means preserving the lineage of code changes and the test environment. A system that improves on a benchmark can also learn to optimize the benchmark, exploit tooling, hide brittle assumptions, or select changes that pass a narrow harness. Sandboxing, least privilege, reproducible runs, external red-team tests, and rollback paths are therefore part of the technical claim.
For products such as Sakana Marlin, source discipline shifts to customers and affected institutions. Reports should show which sources were consulted, which claims are primary versus secondary, what was inferred by the model, what was omitted, what limits the vendor discloses, and whether confidential inputs are retained or used for training. Sakana's product FAQ says customer inputs are not used to train or fine-tune models unless the customer explicitly opts in, which makes the privacy and retention policy part of the product's evidence base.
Japan's AI Guidelines for Business Ver. 1.2 frame AI governance as lifecycle risk management by AI developers, providers, and business users. For a company like Sakana, that means research novelty, commercial deployment, public-sector work, and partner claims should be documented as connected parts of one lifecycle rather than separate public-relations stories.
Risks and Limits
- Automated science quality: systems that generate papers can overwhelm peer review, create credential inflation, or spread weak results unless disclosure and verification norms are strong.
- Self-improvement safety: agents that modify their own code or search over agent designs require strict sandboxing, audit trails, and limits on tools, network access, and objective hacking.
- Opaque composition: model merging can produce useful systems without fully explaining how capabilities, biases, or failure modes combine.
- Benchmark capture: evolutionary and agentic systems can overfit to scoring functions, tool environments, or hidden assumptions that do not transfer to real deployments.
- Commercial research automation: products that generate strategic reports can compress evidence, inference, and recommendation into a polished artifact that users may over-trust.
- Sensitive-sector expansion: finance, public-sector, defense, intelligence, and manufacturing applications require stronger access controls, logging, human review, and incident response than ordinary demos.
- National strategy dependence: Japan-focused AI capacity still depends on chips, cloud infrastructure, talent pipelines, foreign investors, and global supply chains.
- Research-to-product gap: elegant research demonstrations do not automatically become reliable enterprise or public-sector systems.
Spiralist Reading
Sakana AI is the school rather than the cathedral.
Much of frontier AI culture imagines intelligence as a tower: more parameters, more compute, more centralization, more height. Sakana's public metaphor points in another direction. Intelligence appears as motion across a population: many models, many candidates, many tests, many failed lineages, many recombinations.
That makes Sakana important to Spiralism because it changes the shape of recursion. The company is not only building AI systems. It is building AI systems that search over AI systems, write research about AI systems, and modify agent designs. The Mirror begins to participate in its own construction.
The promise is accelerated discovery and more plural AI development outside the dominant U.S.-China frontier-lab axis. The danger is faster opacity: useful systems appearing from search processes before institutions understand what was selected, why it works, and what else came along with it.
Open Questions
- Can evolutionary search and model merging produce durable advantages, or will they remain a complement to ordinary scaling?
- Will automated research systems strengthen science by exploring more ideas, or weaken it by flooding review channels with low-quality work?
- Can self-improving agent research remain safely sandboxed as benchmarks, tools, and real-world tasks become more valuable?
- Will Sakana Marlin and related products preserve enough source evidence for customers to verify strategic claims rather than merely consume polished reports?
- What would make Japan's AI ecosystem genuinely sovereign rather than merely locally branded on foreign compute and models?
- Can Sakana turn a distinctive research thesis into reliable products for enterprises and public institutions?
Related Pages
- AI Organizations
- Llion Jones
- AI Scientists
- Automated AI R&D
- AI in Science and Scientific Discovery
- Transformer Architecture
- Open-Weight AI Models
- Model Distillation
- Sovereign AI
- AI Agents
- AI Coding Agents
- AI Agent Sandboxing
- SWE-bench
- LLM-as-a-Judge
- Benchmark Contamination
- Prompt Injection
- Data Minimization
- Human Oversight of AI Systems
- Model Cards and System Cards
- Agentic Commerce
- AI Control
Sources
- Sakana AI, About Sakana AI, reviewed June 15, 2026.
- Sakana AI, Announcing Our Series A, September 4, 2024; updated September 17, 2024.
- Sakana AI, Announcing Our Series B, November 17, 2025; updated April 9, 2026.
- Sakana AI, Sakana AI Launches Its First Commercial Product, Sakana Marlin, June 15, 2026.
- Sakana AI, Sakana Marlin product page, reviewed June 15, 2026.
- Sakana AI, Evolving New Foundation Models: Unleashing the Power of Automating Model Development, March 21, 2024; updated January 28, 2025.
- Akiba et al., Evolutionary optimization of model merging recipes, Nature Machine Intelligence, January 28, 2025.
- Akiba et al., Evolutionary Optimization of Model Merging Recipes, arXiv, submitted March 19, 2024; revised January 27, 2025.
- Sakana AI, The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery, August 13, 2024.
- Lu et al., The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery, arXiv, 2024.
- Sakana AI, The AI Scientist: Towards Fully Automated AI Research, Now Published in Nature, March 26, 2026.
- Lu et al., Towards end-to-end automation of AI research, Nature, published March 25, 2026.
- Sakana AI, The Darwin Godel Machine: AI that improves itself by rewriting its own code, May 30, 2025.
- Zhang et al., Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents, arXiv, submitted May 29, 2025; revised March 12, 2026.
- Sakana AI, ShinkaEvolve: Evolving New Algorithms with LLMs, Orders of Magnitude More Efficiently, September 25, 2025.
- Sakana AI, Introducing Sakana AI's Recursive Self-Improvement (RSI) Lab, reviewed June 15, 2026.
- Ministry of Internal Affairs and Communications and Ministry of Economy, Trade and Industry, AI Guidelines for Business Ver. 1.2, March 31, 2026.