Helmholtz Imaging Annual Conference 2025

Name: Helmholtz Imaging Annual Conference 2025
Start: 2025-06-25T09:00:00+02:00
End: 2025-06-27T15:00:00+02:00
Location: Schinkelhalle Potsdam

25–27 Jun 2025

Schinkelhalle Potsdam

Europe/Berlin timezone

Contact

ConText Transformer: Text-guided Instance Segmentation in Scientific Imaging

Not scheduled

15m

Schinkelhalle Potsdam

Schiffbauergasse 4i 14467 Potsdam

AI for Image Analysis Part 2

Scientific imaging gives rise to a multitude of diﬀerent segmentation tasks, in many cases addressed with manually annotated datasets. We collected a large number of such heterogeneous datasets, consisting of over 10 million instance annotations, and demonstrate that in a multi-task setting, segmentation models at this scale cannot be trained eﬀectively by only using image-based supervised learning. A major reason is that images of the same domain may be used to address diﬀerent research questions, with varying annotation procedures and styles. For example, images of biological tissues may be evaluated for nuclei or cell bodies despite using the same staining. To overcome these challenges, we propose using simple text-based task descriptions to provide models the necessary context for solving a given objective. We introduce the ConText Transformer, which implements a dual-stream architecture, processing and fusing both image and text data. Based on the provided textual descriptions, the model learns to adapt its internal feature representations to eﬀectively switch between segmenting diﬀerent classes and annotation styles observed in the datasets. These descriptions can range from simple class names (e.g. “white blood cells”)—prompting the model to only segment the referenced class—to more nuanced formulations such as toggling the use of overlapping segmentations in model predictions or segmenting a cell’s nuclei during cell segmentation if the respective cell boundary is not visible, as it is common for example in the TissueNet dataset. Since interpreting these descriptions is part of the model training, it is also possible to define dedicated terms abbreviating very complex descriptions. ConText Transformer is designed for compatibility. It can be used with existing segmentation frameworks, including Contour Proposal Network (CPN) or Mask R-CNN. Our experiments on over 10 million instance annotations show that ConText Transformer models achieve competitive segmentation performance and outperform specialized models in several benchmarks; confirming that a single, unified model can eﬀectively handle a wide spectrum of segmentation tasks; and eventually allowing to replace specialist models in scientific image segmentation.

Mr Eric Upschulte (Forschungszentrum Jülich)

Prof. Katrin Amunts (Forschungszentrum Jülich) Prof. Timo Dickscheid (Forschungszentrum Jülich)

There are no materials yet.

Helmholtz Imaging Annual Conference 2025

Contact

ConText Transformer: Text-guided Instance Segmentation in Scientific Imaging

Schinkelhalle Potsdam

Description

Primary author

Co-authors

Presentation materials

Choose timezone

Helmholtz Imaging Annual Conference 2025

Contact

Description

Primary author

Co-authors

Presentation materials