-Optimus

Discover complex signatures, find biomarkers, predict gene mutations, determine protein expression and more from routine H&E slides

Get started

A Pathology Foundation Model to Power Your Research

H-Optimus is built on a dataset and architecture that make it an ideal backbone for any use-case leveraging digital pathology.

1 Million+ Slides Trained

Trained on one of the largest pathology datasets available

1.1 Billion Parameters

Built on a Vision Transformer (VIT-g/14) architecture

800,000+ Patients

Ensuring diverse and robust real-world data representation

Over 1 Million Downloads

Thousands of users, trusted discovery across many uses

50 Organs Covered

Providing broad applicability across numerous disease areas

4000+ Clinical Practices

Used by thousands of practices around the globe

H-Optimus-1

6.06

Virchow2

6.34

H-Optimus-0

6.86

UNI2

7.10

mSTAR

7.65

*Chart shows overall rank across all tasks, PathBench (lower is better)

Ranked #1 in benchmarks across 229 tasks.

H-Optimus-1 reaches state-of-the-art performance in a variety of benchmarks and in downstream applications, such as biomarker prediction, spatial gene expression, or survival prediction.

Read Full Report Access Benchmark Study

From Whole Slide to
Foundation-Level Insight

1 Histology Images

2 Patch Tokens

3 Transformer Encoder

4 Embedding

5 Downstream Tasks

6 Downstream Example

H-Optimus-1 leverages a vast and diverse dataset of histology images, encompassing millions of samples across various tissue types and pathological conditions.

Each histology image is systematically partitioned into smaller patches, which are then transformed into token embeddings.

The extracted patch embeddings are processed by a transformer encoder, which models their complex spatial and structural relationships.

The transformer encoder generates a compact embedding that encapsulates critical morphological and contextual features of the histology images.

This embedding serves as input for various specialized downstream tasks. Several dedicated AI downstream models, or specialized heads, can be built — for treatment response, biomarker prediction, grading/prognosis, or tissue/cell segmentation — each leveraging it for targeted analysis.

In the example, a patient’s tissue image is analyzed by a specific downstream AI model tailored for outcome prediction, enabling an accurate assessment—83% risk of relapse. This approach allows each downstream model to be independently optimized for its predictive task.

How H-Optimus-1 helps your research

Get Started with H1

Biomarker Discovery & Patient Stratification

Identify novel predictive biomarkers for targeted therapies and precision medicine.

Enhance patient stratification for clinical trials, ensuring the right patients are selected based on histopathological and molecular features.

AI-Powered Drug Discovery & Preclinical Validation

Analyze drug-tissue interactions at scale, improving toxicology assessments and response predictions.

Predict drug efficacy across different cancer subtypes, reducing early-stage attrition rates.

Accelerating Clinical Trials & Regulatory Approvals

Automate histopathological grading & disease progression tracking to enhance trial endpoints.

Support AI-assisted clinical decision-making by integrating H-Optimus-1 with real-world histology data.

Facilitate regulatory submissions with AI-powered standardization of pathology image assessments, ensuring compliance with FDA and EMA requirements.

Enhancing Digital Pathology & AI-Assisted Diagnosis

Leverage AI for automated annotation and analysis of whole-slide images (WSI) to reduce pathologist workload and improve consistency.

Enable real-time pathology analysis in research hospitals to support faster decision-making.

Get Started with H1

Case Studies

Case Study

February 3, 2026

Breast Cancer Recurrence Risk Prediction with H‑Optimus‑1 and STAMP

Case Study

December 19, 2025

ICGI researchers build a winning pathology report generation model with H-optimus-1

View all Case Studies

Ready To Get Started?

Off the shelf access. Evaluation support. Strategic collaboration.

Amazon
SageMaker AI

H-Optimus can be accessed through Amazon SageMaker, AWS’s fully managed ML platform, allowing teams to evaluate, fine-tune, and deploy the model within their existing cloud infrastructure.

Academic
Licenses

Universities and research institutions can request non-commercial research licenses for evaluation and experimental use.

Industry
Licenses

H-Optimus is available for collaboration with industry and clinical partners seeking to accelerate biomarker development, patient stratification, and translational research initiatives.

Not sure where to begin?

Read our full documentation on getting started:

Bioptimus Docs

Frequently Asked Questions

Can H-Optimus be used to automate or assist in generating pathology reports?

H-Optimus has been used as a visual backbone for automated slide-to-report systems. For example, researchers at the Institute for Cancer Genetics and Informatics(ICGI) used H-Optimus-1 to power NARWHAL, an AI system that generates standardized clinical reports directly from gigapixel whole-slide images. Demonstrating its robustness across diverse scanners and tissue types, the H-Optimus-backed NARWHAL system recently won first place in the global REG2025 challenge for clinical alignment and linguistic quality. Read the case study.‍

Importantly, these systems are designed to generate pre-structured drafts to assist pathologists and accelerate workflows, rather than replace expert human validation.

‍

Can H-Optimus be used to predict spatial gene expression or biological pathways?

Yes, researchers are leveraging H-Optimus to infer molecular landscapes directly from standard H&E slides. For example, researchers at The University of Manchester recently developed Deep Pathway, a computational framework that uses H-Optimus-0 to predict pathway-level expressions from H&E images. Using the model, they successfully mapped complex pathways (like Androgen Response) inprostate cancer and predicted hypoxia signatures in glioblastoma. Crucially, these AI-derived hypoxia predictions showed strong visual concordance withactual PIMO staining (the clinical ground-truth). Read the paper.

‍While highly effective for hypothesis generation and cohort stratification, these predictive maps are exploratory tools and do not replace definitive molecular testing.

‍

Can H-Optimus analyze Immunohistochemistry (IHC), Immunofluorescence (IF), or other special stains, or is it strictly for H&E?

The H-Optimus models were pre-trained exclusively on massive, highly diverse datasets of H&E (Hematoxylin and Eosin) stained whole-slide images.However, researchers are successfully adapting H-Optimus for non-H&E analysis in exploratory settings. For example, a recent study published in Laboratory Investigation demonstrated the model's adaptability for immunofluorescence(IF). Researchers at MedStar Georgetown University Hospital utilized the H-Optimus vision transformer to analyze kidney IF images, fine-tuning the model to automatically screen and classify whole-slide images for immune reactants.

‍Because the model was pre-trained exclusively on H&E, we recommend that clinical teams rigorously validateits performance when adapting it for non-H&E tasks.

Is H-Optimus available for academic research?

Yes, the H-Optimus family of models is available for academic research and can be accessed directly via Hugging Face.

To support different computational needs and research goals,we offer three distinct models. It is important to note the differences intheir capabilities and licensing:

H-Optimus-0: Our original 1.1 billion parameter foundation model, trained on over 500,000 histology slides. This model is fully open-source and released under the permissive Apache 2.0license, allowing for broad academic and research use.

H-Optimus-1: Our state-of-the-art 1.1 billion parameter model, trained on a massive, highly diverse dataset ofover 1 million slides from more than 800,000 patients. This model is available under a CC-BY-NC-ND 4.0 license, meaning it is strictly available for non-commercial, academic research purposes. Any commercial use or monetization requires a separate licensing agreement.

H0-mini: A lightweight, highly efficient model developed in collaboration with Owkin. It was distilled from H-Optimus-0 to deliver comparable performance to larger foundation models butat a significantly reduced computational (inference) cost. Like H-Optimus-1, H0-mini is released under the CC-BY-NC-ND 4.0 license for non-commercial, academic research.

What data was used to train H–Optimus?

Both H-Optimus models were trained using self-supervised learning on massive, proprietary datasets of routine H&E-stained whole-slide images (WSIs). To ensure the models generalize well across different laboratory environments, the training data was intentionally curated for high patient, disease, and technical diversity.

During pretraining, these whole-slide images are converted into billions of small, standardized image tiles (specifically, 224×224 pixel tiles extracted at approximately 0.5 microns-per-pixel) to teach the model the fundamental visual language of histology.

Here is the specific breakdown of the training cohorts for each model:

H-Optimus-1: Trained on an extensive collection of over 1 million H&E slides from more than 800,000 patients. To ensure robustness to real-world variability, this dataset spans over 50 different organs and was digitized using 3 different scanner types across morethan 4,000 clinical centers. Read more.

H-Optimus-0: Trained on over 500,000 histopathology slides sourced from across 4,000 clinical practices, yielding several hundreds of millions of training tiles. Read more.

By training on such a vast and diverse corpus of real-world clinical data, the models learn rich, generalizable biological features designed to be highly robust to the typical staining, tissue preparation, and scanning variations encountered across diverse clinical centers.

‍

How do I integrate H-Optimus into my lab?

H-Optimus acts as a foundational "embedding layer" for your digital pathology pipeline. It serves as the computational backbone for your data science teams to build clinical-development applications, rather than acting as an out-of-the-box diagnostic.

A standard integration follows three main phases:

Data Preparation: Scan H&E slides into standard Whole-Slide Image (WSI) formats. Preprocess these by masking and extracting tissue tiles that match the model’s specs (e.g., 224×224 pixels at ~0.5 MPP for H-Optimus-1).
Model Deployment: Deploy the model based on your regulatory and infrastructure needs. For commercial use and sensitive trial data, you can deploy securely within your own environment via AWS SageMaker. For academic research, the model is available via Hugging Face for local hardware deployment. Both setups allow you to generate embeddings across multiple slides in a batch.
Downstream Training & Validation: Feed the model's output (embeddings) into lightweight, task-specific models (like MIL heads) to predict your specific clinical endpoints. Before prospective use, rigorously validate this custom pipeline on your lab's retrospective cohorts to account for local scanner and staining variability.

Still have questions?

Our team is available to discuss validation, partnerships, academic access, or technical details. Get in touch to start the conversation.

Still have questions?

Our team is available to discuss validation, partnerships, academic access, or technical details. Get in touch to start the conversation.

One Model. Every Scale.

Bioptimus bridges the gap between biological layers. Building on the industry-leading performance of H-Optimus-1, our new M-Optimus model integrates multiple data modalities to provide the definitive multi-scale view of biology.

Learn About M-Optimus

-Optimus

A Pathology Foundation Model to Power Your Research

Ranked #1 in benchmarks across 229 tasks.

From Whole Slide to Foundation-Level Insight

How H-Optimus-1 helps your research

Case Studies

Ready To Get Started?

Off the shelf access. Evaluation support. Strategic collaboration.

Not sure where to begin?

Frequently Asked Questions

Still have questions?

Still have questions?

One Model. Every Scale.

From Whole Slide to
Foundation-Level Insight