Quantitative evaluation of methods to analyze motion changes in single-particle experiments

Muñoz-Gil, Gorka; Bachimanchi, Harshith; Pineda, Jesús; Midtvedt, Benjamin; Fernández-Fernández, Gabriel; Requena, Borja; Ahsini, Yusef; Asghar, Solomon; Bae, Jaeyong; Barrantes, Francisco J.; Bender, Steen W. B.; Cabriel, Clément; Conejero, J. Alberto; Escoto, Marc; Feng, Xiaochen; Haidari, Rasched; Hatzakis, Nikos S.; Huang, Zihan; Izeddin, Ignacio; Jeong, Hawoong; Jiang, Yuan; Kæstel-Hansen, Jacob; Miné-Hattab, Judith; Ni, Ran; Park, Junwoo; Qu, Xiang; Saavedra, Lucas A.; Sha, Hao; Sokolovska, Nataliya; Zhang, Yongbing; Volpe, Giorgio; Lewenstein, Maciej; Metzler, Ralf; Krapf, Diego; Volpe, Giovanni; Manzo, Carlo

doi:10.1038/s41467-025-61949-x

Download PDF

Registered Report
Open access
Published: 22 July 2025

Quantitative evaluation of methods to analyze motion changes in single-particle experiments

Nature Communications volume 16, Article number: 6749 (2025) Cite this article

5660 Accesses
11 Citations
51 Altmetric
Metrics details

Subjects

Abstract

The analysis of live-cell single-molecule imaging experiments can reveal valuable information about the heterogeneity of transport processes and interactions between cell components. These characteristics are seen as motion changes in the particle trajectories. Despite the existence of multiple approaches to carry out this type of analysis, no objective assessment of these methods has been performed so far. Here, we report the results of a competition to characterize and rank the performance of these methods when analyzing the dynamic behavior of single molecules. To run this competition, we implemented a software library that simulates realistic data corresponding to widespread diffusion and interaction models, both in the form of trajectories and videos obtained in typical experimental conditions. The competition constitutes the first assessment of these methods, providing insights into the current limitations of the field, fostering the development of new approaches, and guiding researchers to identify optimal tools for analyzing their experiments.

Classification-based motion analysis of single-molecule trajectories using DiffusionLab

Article Open access 10 June 2022

A guide to single-particle tracking

Article 12 September 2024

Optimal transport for single-cell and spatial omics

Article 14 August 2024

Introduction

Physiological processes occurring in living cells rely on encounters and interactions between molecules. Archetypal examples include gene regulation, transduction of biological signals, and protein delivery to specific locations. All these processes involve the active or passive transport of biomolecules in highly complex, time-varying, and far-from-equilibrium environments, such as the cell membrane (Fig. 1a). One of the most powerful tools to study these transport phenomena is the combination of live-cell single-molecule imaging with single-particle tracking^1,2 because it can provide the time when and location where single events take place (Fig. 1b, c). Alternative ensemble methods (e.g., fluorescence correlation spectroscopy or fluorescence recovery after photobleaching³) usually provide limited information because they lose track of crucial details when averaging out spatial and temporal fluctuations.

**Fig. 1: Rationale for the challenge organization.**

Methods for single-molecule imaging and single-particle tracking have seen tremendous progress in the last decade, in terms of both experimental acquisition and data analysis^1,2,4,5. The abundance of experimental single-particle trajectories, encompassing molecules, protein complexes, vesicles, and organelles, has led to the development of numerous methods dedicated to the reliable detection of changes in their motion patterns (as summarized in Supplementary Table 1). These changes serve as valuable indicators for the occurrence of interactions within the system. For instance, diffusing particles may exhibit variations in diffusion coefficients (due to processes like dimerization, ligand binding, or conformational changes) or shifts in their mode of motion (attributed to transient immobilization or confinement at specific scaffolding sites) (Fig. 1a)⁶. These interactions can also result in deviations from standard Brownian motion, as characterized by Einstein’s free diffusion model, which includes a linear mean-squared displacement (MSD) and a Gaussian distribution of displacements⁷. This is the case, e.g., of spatiotemporal heterogeneities producing transient subdiffusion at specific timescales^{8,9,10,11,12,13,14,15,16,17,18,19}. Other mechanisms can instead produce asymptotic anomalous diffusion^2,20,21,22. Anomalous diffusion compatible with models such as fractional Brownian motion^{23,24,25,26,27,28}, continuous-time random walk^29,30, scaled Brownian motion³¹, and Lévy walk³² has been observed for telomers, macromolecular complexes, proteins, and organelles in living cells. Several approaches have been recently proposed to detect and quantify these behaviors^33,34, also involving machine-learning techniques^{35,36,37,38,39,40,41}.

To gain insights into the performance of methods to detect anomalous diffusion from individual trajectories, in 2021, we successfully ran the 1st AnDi Challenge⁴². The discussion that developed between members of diverse research communities working on biology, microscopy, single-particle tracking, and anomalous diffusion (including experimentalists, theoreticians, data analysts, and computer scientists) emphasized the necessity for deeper insights into biologically relevant phenomena. First, it identified a need to evaluate methods to determine the switch between different diffusive behaviors, as often observed in experiments. Second, it highlighted the necessity to assess the methods’ crosstalk in detecting inherent anomalous diffusion from nonlinearity in the MSD due to motion constraints or heterogeneity. Third, it emphasized the importance of determining whether the bottleneck of the analysis process was at the level of the analysis of the single trajectory or associated with their extraction from experimental videos. These needs shaped the design of the 2nd AnDi Challenge, defining its scope with a focus on characterizing and ranking the performance of methods that analyze changes of dynamic behavior. While we retained the name of the 1st AnDi Challenge to build upon its already-established community, the 2nd AnDi Challenge focused mainly on revealing heterogeneity rather than anomalous diffusion. In the simulated datasets, anomalous diffusion emerged from heterogeneity itself or was intentionally introduced for evaluation purposes.

A multitude of methods have been designed to identify and characterize heterogeneous diffusion (Supplementary Table 1). They can be classified based on the heterogeneity they aim to identify or the kind of analysis they perform. We considered three heterogeneity classes that these methods aim to identify: (i) changes in the diffusion coefficient D; (ii) changes in the anomalous diffusion exponent α (often classified as subdiffusion, diffusion, or superdiffusion); and (iii) changes in the phenomenological behavior associated with interactions with the environment (often classified as immobilization, confinement, (free) diffusion, and directed motion). While changes in the diffusion coefficient and in the phenomenological behavior have been widely reported, the exploration of changes in the anomalous diffusion exponent is a more recent development^43,44,45,46, which is attracting increasing interest also from the theoretical point of view^47,48,49,50. The introduction of new methods for data analysis, as promoted by the Challenge, had the objective to push the performance for detecting subtle changes in these diffusion properties in systems where they could have been overlooked. Along this line, it must be pointed out that the traditional analysis based on the calculation of the scaling exponent of the mean-squared displacement (MSD) can create some ambiguity between the last two classes. Just to provide an example, a particle performing Brownian diffusion in a confined region has an exponent α = 1 in terms of the generating motion, but its MSD features a horizontal asymptote at long times, corresponding to α = 0. In the following, we will refer to the exponent α as the characteristic feature of the generating motion.

From the analysis point of view, we identified two classes of methods: (i) ensemble methods, meant to determine characteristic features out of an ensemble of trajectories (Fig. 1d) and (ii) single-trajectory methods, meant to identify changepoint (CP) locations through trajectory segmentation (Fig. 1e). While most available methods rely on the analysis of trajectories obtained from video processing⁵¹, recent advances in computer vision have led to methods capable of directly extracting information from raw movies without requiring the explicit extraction of trajectories^52,53. Each method has its own set of advantages and disadvantages, and its performance may depend on the specific problem under consideration. However, there is no universally accepted gold standard for determining which method to use to address each specific problem.

To cater to these more advanced needs, we ran an open competition as the 2nd Anomalous Diffusion (AnDi) Challenge. The rationale described above shaped the scope of the challenge, defining the choice of the datasets and the design of the tasks. To rely on an objective ground truth, we assessed the methods’ performance on simulated datasets inspired by models of diffusion and interactions documented in biological systems. These datasets describe particles undergoing fractional Brownian motion (FBM,⁵⁴) with piecewise-constant parameters. FBM-type motion has been widely observed in biological systems by means of microrheology, a technique that uses large tracer particles as probes to study the properties of the environment⁵⁵. Anomalous diffusion compatible with FBM has also been reported for telomers and macromolecular complexes in living cells^{20,23,24,25,26,27,28,56}. Beyond this evidence, in the context of the Challenge, FBM served as a tool to enable the tuning of diffusion parameters. The combination of parameter values and interaction models might produce situations that do not correspond to previously documented biological scenarios but will be valuable to test the methods’ performance in a wide range of conditions. In biological experiments, other kinds of motion and even non-Gaussian behavior have been reported²¹. However, the choice of FBM did not limit the generality of the Challenge since other models of diffusion and non-Gaussian behavior can be obtained by properly tuning the parameters of the simulations. Datasets provided for the last phase of the competition included motion with parameters inspired by actual experiments for their comparative analysis with the Challenge methods.

The standard and straightforward approach in live-cell single-molecule imaging primarily captures information related to lateral motion. In cases involving flat membranes or isotropic systems, employing 2D imaging and tracking techniques suffices for obtaining accurate motion-related parameters. However, when dealing with motion on non-flat surfaces or within anisotropic 3D environments, relying solely on 2D projections can result in critical information being overlooked, potentially leading to the misinterpretation of diffusion coefficients or the appearance of apparent anomalous diffusion effects^57,58. Consequently, drawing definitive conclusions under such circumstances should be avoided or approached with caution. To study motion occurring in 3D space, it is advisable to employ 3D tracking methods, such as off-focus imaging (i.e., the analysis of ring patterns in the defocused point spread function)⁵⁹, interference/holographic approaches⁶⁰, multifocus imaging⁶¹, or point spread function engineering⁶². Although more challenging, these methods can also measure the motion along the axial dimension, facilitating a more thorough characterization. For the purposes of the Challenge, we choose to concentrate on studying changes in diffusion behavior occurring within a 2D context, driven by particle interactions of various types.

While this challenge focused on data inspired by biological systems, the use of regime-switching detection and trajectory segmentation extends well beyond the domain of living cells. Particularly interesting applications also include, e.g., the analysis of biomedical signals⁶³, speech⁶⁴, traffic flows⁶⁵, seismic signals⁶⁶, econometrics^67,68, ecology⁶⁹, and river flows⁷⁰.

Results

Datasets and ground truth

In order to benchmark the different methods on data with a known ground truth, we relied on numerical simulations. We developed the andi-datasets Python package⁷¹ to generate the required datasets to train and evaluate the various methods. Details about available functions can be found in the hosting repository⁷¹.

Particle motion was simulated according to fractional Brownian motion (FBM,⁵⁴), a model that reproduces Brownian and anomalous diffusion processes by tuning the correlation of the increments through the Hurst exponent H. FBM is a Gaussian process with a covariance function

$${\rm{E}}[{B}_{H}(t){B}_{H}(s)]=K\left({t}^{2H}+{s}^{2H}-| t-s{| }^{2H}\right),$$

(1)

where E[⋅] denotes the expected value and K is a constant with units length² ⋅ time^−2H. In order to generalize FBM in two dimensions (2D), a trajectory R(t) is represented as R(t) = {X(t), Y(t)}, where X(t) and Y(t) are independent FBM processes along the x and y axes, respectively³³. The anomalous diffusion exponent is related to the Hurst exponent as α = 2H⁵⁴, and the MSD for an unconstrained FBM in 2D scales with time t as

$${\rm{MSD}}(t)=4K{t}^{\alpha }.$$

(2)

When α = 1, FBM reverts to Brownian motion and K corresponds to the diffusion coefficient D. FBM describes subdiffusion for 0 < H < 1/2 (0 < α < 1), Brownian diffusion for H = 1/2 (α = 1), and superdiffusion for 1/2 < H < 1 (1 < α < 2).

We considered the following physical models of motion and interactions (Fig. 2a):

Single-state model (SSM)—Particles diffusing according to a single diffusion state, as observed for some lipids in the plasma membrane^14,15,72. This model also serves as a negative control to assess the false positive rate of detecting diffusion changes.
Multi-state model (MSM)—Particles diffusing according to a time-dependent multi-state (2 or more) model of diffusion undergoing transient changes of K and/or α. Examples of changes of K have been observed in proteins as induced by, e.g., allosteric changes or ligand binding^73,74,75,76.
Dimerization model (DIM)—Particles diffusing according to a 2-state model of diffusion, with transient changes of K and/or α induced by encounters with other diffusing particles. Examples of changes of K have been observed in protein dimerization and protein-protein interactions^{77,78,79,80,81}.
Transient-confinement model (TCM)—Particles diffusing according to a space-dependent 2-state model of diffusion, observed for example in proteins being transiently confined in regions where diffusion properties might change, e.g., the confinement induced by clathrin-coated pits on the cell membrane⁸². In the limit of a high density of trapping regions, this model reproduces the picket-and-fence model used to describe the effect of the actin cytoskeleton on transmembrane proteins^9,83.
Quenched-trap model (QTM)—Particles diffusing according to a space-dependent 2-state model of diffusion, representing proteins being transiently immobilized at specific locations as induced by binding to immobile structures, such as cytoskeleton-induced molecular pinning^17,84.

While the interaction mechanisms producing the heterogeneous diffusion are inspired by biological scenarios, some of the combinations of diffusion parameters and models lead to situations that may not correspond to previously documented biological contexts. Nevertheless, this approach holds substantial value as it enables the comprehensive assessment of method performance across a broad spectrum of conditions.

**Fig. 2: Physical models of interaction and structure of the simulated datasets.**

In the simulations, each dynamic state is characterized by a distribution of values for the parameters K and α. For each trajectory, the values of K and α for each state are randomly drawn from Gaussian distributions with bounds α ∈ (0, 2) and K ∈ [10⁻¹², 10⁶] pixel²/frame^α. The interaction distance and the radius of confinement or trapping have constant values across each experiment. Simulations are provided in generalized units (i.e., pixels and frames) that can be rescaled to meaningful temporal and spatial scales.

A detailed description of the simulation procedure is presented in Extended Methods.

Competition design

To enable the assessment of the performance of previously established methods while fostering the development of new approaches and the participation from diverse disciplines, the challenge was organized along two tracks:

Video Track—based on the analysis of raw videos.
Trajectory Track—based on the analysis of trajectories.

For each track, datasets were provided according to a hierarchical structure (Fig. 2b, c) that includes:

Experiment—A given biological scenario defined by a model of interactions and a set of parameters describing the dynamic interplay of the particles and the environment.
FOV—A region of the sample where the recording takes place. Particles within the same field of view (FOV) can undergo interactions among themselves and/or with the environment.
Video (Video Track only)—Videos corresponding to each FOV.
Trajectory (Trajectory Track only)—Trajectory corresponding to the motion of an individual particle.

For both tracks, all particles used in the simulations and located in the FOV are provided/visualized (i.e., full labeling conditions). The effect of blinking or photobleaching was not taken into account.

In each track, participants could compete in two different tasks, as typically done in the analysis of experimental data:

Ensemble Task—Ensemble-level predictions providing, for each experimental condition, the model used to simulate the experiment, the number of states, and the fraction of time spent in each state. For each identified state, participants had to determine the mean and standard deviation of the distribution of the generalized diffusion coefficients K, and the mean and standard deviation of the distribution of the anomalous diffusion exponent α corresponding to the underlying motion.
Single-trajectory Task—Trajectory-level predictions providing for each trajectory a list of M inner CPs delimiting M + 1 segments with different dynamic behavior. For each segment, participants had to identify the generalized diffusion coefficient K, the anomalous diffusion exponent α corresponding to the underlying motion, and an identifier of the kind of constraint imposed by the environment (0 = immobile, 1 = confined, 2 = free (unconstrained, 0.05 ≤ α < 1.9), 3 = directed (1.9 ≤ α < 2.0). For the Video Track, predictions had to be provided for a subset of particles (in the following, we will refer to them as VIP, very important particles) identified through a label map of the first frame of the movie. For the Trajectory Track, predictions had to be provided for all trajectories in the FOV.

For each task, several metrics were evaluated (see Scoring and evaluation). Participants were allowed to provide partial submissions, e.g., including predictions for a limited subset of experiments or for specific parameters. For ranking purposes of the Challenge, missing predictions were scored with the worst possible value of the corresponding metric.

Competition overview

The 2nd AnDi Challenge was held between December 1, 2023, and July 15, 2024, on the Codalab platform. It was divided into three phases, namely Development, Validation, and Challenge. The Development Phase (5 months) was intended for the participants to set up their methods, test them, and familiarize themselves with the datasets and the scoring platform. An unlabeled dataset was available, and the public leaderboard showed scores obtained on this dataset. An online workshop was held on February 22, 2024, to instruct the participants about the details of the challenge. The Validation Phase (1 month) was a test of the actual final challenge. A new dataset (described in Challenge Dataset) was provided, and the leaderboard was again public. The Challenge Phase (15 days) was the final stage of the competition. A new dataset was provided, and the number of submissions per team was limited to 1 per day. The results were not publicly disclosed, and the leaderboard was made public only after the end of the competition. In total, we received 1343 submissions during the three phases. Participants registered in teams of 1 to 5 people. In the final stage, out of 80 registered participants, 53 individuals, divided into 18 teams, were included in the leaderboard (see Supplementary Table 2 for the list of participating teams). The teams’ affiliations spanned Europe (12 teams), Asia (6 teams), and America (1 team). From the final leaderboard, members of the top 5 teams in each task were invited to co-author this article. An overview of these teams and the methods is provided in Supplementary Information— Overview of Teams and Methods.

The results of the Challenge were discussed with the participants and other experts from the field during the 2nd Anomalous Diffusion Workshop that was held in June 2025.

Challenge dataset

The Challenge dataset was composed of 12 experiments corresponding to different diffusion models and parameter values. Details about the numeric values of parameters of the experiments are given in Supplementary Table 3. In addition, Supplementary Fig. 1 summarizes the distribution of specific features within the dataset. EXP 1 aimed at mimicking multistate diffusion in membrane proteins. Average diffusion coefficients and the transition matrix of the MSM were chosen to reproduce, with the appropriate scaling, the three fastest states reported for the diffusion of the α2A-adrenergic receptor⁸⁰. EXP 2 reproduced changes in diffusion coefficient due to protein dimerization, inspired by the behavior reported for the epidermal growth factor receptor ErbB1⁷⁷. EXP 3, EXP 4, and EXP 5 were designed to compare the methods’ ability to detect changes from the same free diffusive state to a slow diffusing state characterized either by traps (QTM, EXP 3), small confinement regions (TCM, EXP 4), or a subdiffusive dimeric state (DIM, EXP 5). EXP 6 and EXP 7 were meant to assess the methods’ ability to take advantage of the knowledge of the physical model itself and additional information present in the experiment to improve predictions. The experiments corresponded to different theoretical models (DIM and MSM) with the same diffusive parameters. EXP 8 served as a negative control and contained only SSM trajectories with very broad distributions of K and α. EXP 9 was generated from QTM with very short trapping times and superdiffusion in the free state to assess how the methods deal with such extreme conditions. The other three experiments contained data with extreme and unrealistic parameters meant to assess potential biases of the methods, and will not be discussed further.

Scoring and evaluation

The performance of the methods was evaluated using specific metrics for each task. For ranking purposes in the Challenge, composite metrics were used, as described below.

Ensemble task

Participation in the Ensemble Task required predictions of the type of model used for simulating each experiment, the number of states S of the model, and the parameters of each state. The type of model was simply evaluated as correct or wrong. The prediction of the number of states was assessed by measuring the difference with the ground truth. For both the generalized diffusion coefficient and the anomalous diffusion exponent, predictions had to include the mean, the standard deviation, and the relative weight of each state. From these values, we computed the associated multi-modal distributions P_α and P_D. The similarity of these distributions to the ground-truth distributions Q_α and Q_D was assessed by means of the first Wasserstein distance (W₁),

$${W}_{1}(P,Q)={\int}_{{\rm{supp}}(Q)}| {{\rm{CDF}}}_{P}(x)-{{\rm{CDF}}}_{Q}(x)| dx$$

(3)

where CDF_Q is the cumulative distribution function of the distribution Q and supp(Q) is the support (α ∈ (0, 2) and K ∈ [10⁻¹², 10⁶] pixel²/frame^α).

Single-trajectory task

Participation in the Single-trajectory Task required predictions of the M CPs and the dynamic properties, i.e., the generalized diffusion coefficient K, the anomalous exponent α, and diffusive-type identifiers of the resulting M + 1 segments. Different metrics were used to evaluate the methods’ performance.

CP detection metrics

Following Ref. ⁵¹, given a ground-truth CP at locations t_(GT),i, and a predicted CP at locations t_(P),j, we defined the gated absolute distance:

$${d}_{i,j}=\min (| {t}_{({\rm{GT}}),i}-{t}_{({\rm{P}}),j}|,{\varepsilon }_{{\rm{CP}}}),$$

(4)

where ε_CP was used as a fixed maximum penalty for CPs located more than ε_CP apart. For a set of M_GT ground-truth CPs and M_P predicted CPs, we solved a rectangular assignment problem using the Hungarian algorithm⁸⁵ by minimizing the sum of distances between paired CPs:

$${d}_{{\rm{CP}}}=\mathop{\min }\limits_{{\rm{paired}}\,{\rm{CP}}}\left(\sum {d}_{i,j}\right).$$

(5)

The distance d_CP allows to define a pairing metric:

$${\alpha }_{{\rm{CP}}}=1-\frac{{d}_{{\rm{CP}}}}{{d}_{{\rm{CP}}}^{\max }},$$

(6)

where ${d}_{{\rm{CP}}}^{\max }={M}_{{\rm{GT}}}\,{\varepsilon }_{{\rm{CP}}}$ is the distance associated with having all predicted CPs unpaired or at a distance larger than ε_CP from all ground-truth CPs. The metric α_CP is bound in [0, 1], taking a value of 1 if all ground-truth and predicted CPs are matching exactly. Similarly, we define a CP localization metric:

$${\beta }_{{\rm{CP}}}=\frac{{d}_{{\rm{CP}}}^{\max }-{d}_{{\rm{CP}}}}{{d}_{{\rm{CP}}}^{\max }+\overline{{d}_{{\rm{CP}}}}},$$

(7)

where $\overline{{d}_{{\rm{CP}}}}$ is the distance associated with having all unassigned predicted CPs at a distance larger than ε_CP from all ground-truth CPs. This metric measures the presence of spurious CPs and is bound in [0, α_CP], taking value α_CP if no spurious CPs are present. We also calculate the number of true positives (TP), i.e., the paired true and predicted CPs with a distance smaller than ε_CP. Spurious predictions, i.e., not associated with any ground truth or having a distance larger than ε_CP were counted as false positives (FP). Ground truth CPs not having an associated prediction at a distance shorter than ε_CP were considered false negatives (FN). Given an experiment containing N trajectories, we computed the overall number of TP, FP, and FN. We then used these values to calculate the JSC over the whole experiment as:

$${\rm{JSC}}=\frac{{\rm{TP}}}{{\rm{TP}}+{\rm{FN}}+{\rm{FP}}}.$$

(8)

For the predicted CPs classified as TP, we also computed the root mean square error (RMSE), defined as:

$${\rm{RMSE}}=\sqrt{\frac{1}{N}\sum _{\begin{array}{c}{\rm{paired}}\,{\rm{CP}}\\ {d}_{i,j} < {\varepsilon }_{{\rm{CP}}}\end{array}}{\left({t}_{({\rm{GT}}),i}-{t}_{({\rm{P}}),j}\right)}^{2}}.$$

(9)

Metrics for the estimation of dynamic properties

For the evaluation of the methods’ performances on the estimation of the dynamic properties, we first followed a procedure similar to the one described above for the pairing of the CPs. Predicted CPs were used to define the predicted trajectory segments. We defined a distance between predicted and ground-truth segments based on the JSC calculated with respect to their temporal support, where time points at which predicted and ground-truth segments overlap were considered as TP, predicted time points not corresponding to the ground truth as FP, and ground-truth time points not predicted as FN. The Hungarian algorithm was used to pair segments by maximizing the sum of the JSC. Only paired segments were used to calculate metrics assessing methods' performance for the estimation of dynamic properties. For the generalized diffusion coefficient K, we used the mean squared logarithmic error (MSLE) defined as:

$${\rm{MSLE}}=\frac{1}{N}\sum _{\begin{array}{c}{\rm{paired}} \\ {\rm{segments}}\end{array}}{\left(\log ({K}_{({\rm{GT}}),i}+1)-\log ({K}_{({\rm{P}}),j}+1)\right)}^{2}.$$

(10)

For the anomalous diffusion exponents α, we used the mean absolute error (MAE):

$${{\rm{MAE}}}_{\alpha }=\frac{1}{N}\sum _{\begin{array}{c}{\rm{paired}}\\ {\rm{segments}}\end{array}}| {\alpha }_{({\rm{GT}}),i}-{\alpha }_{({\rm{P}}),j}|,$$

(11)

where N is the total number of paired segments in the experiment, α_(GT),i and α_(P),j represent the ground-truth and predicted values of the anomalous exponent of paired segments, respectively. For the classification of the type of diffusion, we used the F₁-score:

$${{\rm{F}}}_{1}=\frac{2{{\rm{TP}}}_{{\rm{c}}}}{2{{\rm{TP}}}_{{\rm{c}}}+{{\rm{FP}}}_{{\rm{c}}}+{{\rm{FN}}}_{{\rm{c}}}},$$

(12)

where TP_c, FP_c, and FN_c represent true positives, false positives, and false negatives with respect to segment classification. The metric was calculated as a micro-average, which aggregates the contributions of all classes to compute the average metric and is generally preferable when class imbalance is present.

Metrics for challenge ranking

For ranking purposes, we used the mean reciprocal rank (MRR) as a summary statistic for the overall evaluation of software performance⁴²:

$${\rm{MRR}}=\frac{1}{N}\cdot \mathop{\sum }\limits_{i=1}^{N}\frac{1}{{{\rm{rank}}}_{{{\rm{M}}}_{{\rm{i}}}}},$$

(13)

where ${{\rm{rank}}}_{{{\rm{M}}}_{{\rm{i}}}}$ corresponds to the position in an ordered list based on the value of the corresponding metrics M_i.

For the Ensemble Task, the metrics involved in the calculation were the F₁-score of the model and the MAE of the distributions of K and α. For the Single-trajectory Task, we used the JSC and the RMSE of CPs, the MSLE of K, and the MAE of α.

Overview of the challenge results

The Challenge dataset was comprehensively designed to test the submitted methods under distinct scenarios, using ad hoc metrics to evaluate their specific capabilities. For ranking, we employed composite metrics that aggregate the scores from different experiments and subtasks. The results are summarized in Fig. 3. Here, we present an overview of the Challenge results, highlighting the general trends observed. The complete rankings are provided in Supplementary Fig. 2.

In the Single-trajectory Task (Fig. 3a), one method based on UNet3+^86,87 (team I) clearly outperformed the others, whereas the Ensemble Task (Fig. 3b) showed a more balanced competition. From the MRR breakdown, we observed that the top team in the Single-trajectory Task performed consistently well across all metrics. In contrast, for the Ensemble Task, the top teams improved their final ranks by specializing in one of the two subtasks.

We also show the correlation between pairs of metrics associated with CP detection (Fig. 3c) and the prediction of diffusive properties (Fig. 3d, e). The predictions for the Video Track (represented by filled squares) are also included alongside those of the Trajectory Track (represented by empty circles). Across methods, enhanced CP detection, reflected by higher JSC and lower RMSE, yields a tight correlation between these metrics (Fig. 3c). A similar but weaker trend appears for K and α errors (Fig. 3d, e), because their estimation often relies on distinct algorithms, decoupling improvements in one from the other.

In the plots, the dashed lines connect the predictions of teams participating in both tracks. All teams in the Video Track (teams E and Q for the Single-trajectory Task, teams E and F for the Ensemble Task), except for team K, improved their predictions in the Trajectory Track compared to the Video Track. Notably, all four teams first extracted the trajectories using a previously established tracking method^{5,40,88,89,90,91} and then performed the ensuing analysis using the same method developed for the Trajectory Track. While this highlights the influence of error associated with the tracking process⁵¹, none of the methods explored the possibility of obtaining results directly from the video, which was one of the exploratory goals of this competition.

Finally, Fig. 4 shows the score obtained for subtask metrics by all teams for each experiment (filled symbols). The consistently lower performance of the Video Track compared to the Trajectory Track lends support to the third rationale: it suggests that challenges in accurately extracting trajectories from experimental videos represent a more significant bottleneck than the downstream analysis of pre-extracted tracks.

These plots provide further insight into which experimental conditions were more challenging for each subtask. For example, CP detection in EXP 1 (MSM with 3 states) was particularly difficult, as indicated by the low JSC in Fig. 4a. As shown in Fig. 4e, classification of the type of diffusion for EXP 4 (TCM) was more challenging than EXP 3 (QTM), despite having similar parameters for the unrestrained motion. For the Ensemble Task, we observe poorer predictions for K in EXP 8 (SSM, Fig. 4f) and for α in EXP 9 (QTM, Fig. 4g). In the following, we will comparatively discuss results obtained for groups of experiments aimed at detecting specific method capabilities. For most of these analyzes, we will mainly consider the methods of the top 5 teams in each Track and Task.

CP detection and segment diffusion properties

A main aspect of the Challenge was the evaluation of CP detection capability and the ensuing assessment of diffusion properties for the identified segments. In particular, we tested the methods’ ability to distinguish true anomalous diffusion from subdiffusive behavior that emerges solely from physical constraints, directly addressing the second rationale. These insights were provided by the Single-trajectory Task.

As shown in Fig. 4a–e, the methods generally performed well when tested on time-varying processes. We sought to characterize the false positive rate of the methods by evaluating their behavior over the trajectories of EXP 8 having no CPs (Fig. 5a, b). EXP 8 also served to assess the methods’ ability to estimate parameters K and α independently of errors induced by incorrect segmentations. Submitted predictions were benchmarked with the estimations of K and α obtained by linear and logarithmic fits of the MSD, respectively (dashed lines). Most methods predicted very few CPs for these trajectories, producing a low false positive rate and outperformed the MSD fit for both K and α (Fig. 5a, b).

**Fig. 5: CP detection and segment diffusion properties.**

A relevant aspect associated with CP detection accuracy is its dependence on the number of CPs per trajectory, shown in Fig. 5c–e, which is inversely related to the average segment duration. As expected, the JSC shows worse performance as the number of CPs increases (Fig. 5c). Regarding the diffusion parameter estimation, we observe that the methods allow a robust estimation of K independently of the number of CPs (Fig. 5d), whereas for α we observe a drop in performance as the number of CPs increases (Fig. 5e). This confirms the difficulty of estimating α from short segments, due to its asymptotic nature, already observed in the 1st AnDi Challenge⁴².

Classification of types of diffusion

One of the goals of this competition was to assess the methods’ ability to classify different diffusion types and distinguish among distinct physical models. Results for all experiments of the Video and Trajectory Tracks are shown in Supplementary Figs. 3 and 4, respectively. The results of the two tracks were qualitatively similar but the Video Track had overall lower scores since all teams except team Q missed the immobile state (Supplementary Fig. 3). To summarize the methods’ ability to assign segments to diffusion types, in Fig. 6 we show the distribution of each diffusive state compared to the ground truth (horizontal segments) for representative experiments of the Trajectory Track. In Fig. 6a we exemplarily show the results obtained for EXP 9, a QTM with an unconstrained state having a narrow distribution of K but with α values that could produce either superdiffusive or directed motion. In this case, only the top method (team I, light blue) was able to produce a reliable classification of the diffusion type of the segments. The difficulty in inferring the correct type of mechanism producing interaction underscores the challenges in accurately analyzing this kind of data, which can have significant implications for the biological interpretation of the results. Although perfect classification of diffusive states remains challenging, the algorithms nonetheless provide precise estimates of critical biophysical parameters, namely, the average dwell times in both trapped and unconstrained states (inset of Fig. 6a). The measure of these parameters is essential for quantifying binding kinetics, confinement lifetimes, and transition rates that directly inform biological interpretation.

The second rationale for the Challenge was to probe the methods’ ability to disentangle genuine anomalous diffusion from subdiffusive behaviors arising purely from motion constraints. To test the methods in challenging conditions, we designed a group of experiments (EXP 3, EXP 4, and EXP 5) with different underlying models but with diffusive parameters that produce similar trajectories. The three experiments share an unconstrained state with normal diffusion, and K ≈ 1: EXP 3 is simulated as QTM, whereas EXP 4 is from a TCM with a small confinement radius and α ≈ 0.2, and EXP 5 is DIM with a dimeric state with α ≈ 0.2. Other parameters were set to obtain similar residence times in the different states. Figure 6b–d highlights the performance of the top five methods across EXP 3–5. Teams I, C, and R each correctly classify over 95% of segments, closely matching the true distribution of diffusive states. Team E tends to over-label segments as diffusive, while Team O occasionally confuses confined segments for diffusive ones and vice versa. Team R, despite its high overall accuracy, also makes occasional misclassifications of diffusive segments as immobile or confined. Importantly, for EXP 4 (small-radius confinement) and EXP 5 (dimerization-induced subdiffusion), misclassification as immobile is negligible for Teams I, C, and R. Detecting confinement in EXP 4 is particularly challenging since short dwell times in confined areas yield few boundary reflections, inducing confusion with unconstrained anti-persistent subdiffusion of EXP 5. The ability of Teams I, C, and R to resolve these subtle cases underscores the high sensitivity and robustness of their methods.

Using physical models to enhance method performance

The information contained in an individual trajectory is typically sufficient to estimate CPs and diffusive properties. However, for some physical models, the knowledge of the model itself offers additional information that could be used to improve further CP detection and parameter estimation. This is the case for QTM and TCM, where changes in diffusion correspond to spatial constraints. For DIM, diffusion changes are associated with particle proximity; in addition, since particles in a dimer co-diffuse, one could, in principle, use twice as much information to estimate K and α, although in typical experimental conditions it may be very challenging to track two co-diffusing particles.

Along these lines, for the Single-Trajectory Task, the lowest JSC values were obtained for EXP 1 and EXP 7 (circles in Fig. 4a). Both experiments correspond to simulations of MSM, a model where the diffusion changes are produced in a purely time-dependent fashion and the dataset itself does not provide additional hints to determine them. This suggests that the methods can directly or indirectly take advantage of the presence of a physical event (e.g., trapping, confinement, or dimerization) to enhance CP detection accuracy. To assess this effect quantitatively, we used EXP 5 and EXP 6, which correspond to different physical models (DIM and MSM, respectively) generated with an identical set of diffusive parameters. To quantify model-based gains, we computed the relative improvement

$$\Delta m(\%)=\frac{{m}_{{\rm{DIM}}}-{m}_{{\rm{MSM}}}}{{m}_{{\rm{MSM}}}}\times 100\%$$

(14)

for each subtask metric (JSC, MSLE, and MAE). Figure 7 reports these improvements for all methods, with the overall average shown as a dashed line.

**Fig. 7: Effect of the physical model.**

Surprisingly, while most of the methods showed improved performance for the CP prediction in DIM (Fig. 7a), there were minor differences in the prediction of diffusive properties (Fig. 7b, c). We believe this is because the methods predict each trajectory’s properties without considering it in the ensemble of the FOV or of the experiment, an observation that may improve the next generation of methods.

Ensemble predictions

The Ensemble Task was designed to test whether the methods could take advantage of the increased statistics obtained from common parameters shared by all trajectories within the same experiment to better identify the type of motion and estimate its parameters. As discussed earlier, several approaches of this type have been devised and used in the past to extract biophysical information from single-particle tracking data (Supplementary Table 1). However, no pure ensemble-level method, i.e., one that disregards the individual trajectory identity, was employed for the Challenge. Instead, all teams that provided submissions for the Ensemble Task used predictions obtained at the single-trajectory level, which were then pooled together to estimate the moments of the distributions of the diffusive parameters. Results for all experiments of the Video and Trajectory Tracks are shown in Supplementary Figs. 5–8. The resulting distributions are summarized in Fig. 8 for 4 exemplary experiments (EXP 4, EXP 7, EXP 8, and EXP 9) of the Trajectory Track. The pooling operation was performed using two general approaches: teams either applied a Gaussian mixture model (GMM) or a clustering algorithm on the predicted segments to extract subpopulation parameters, with four of the top 5 teams opting for the former approach (teams E, I, M, and O). Interestingly, as it can be inferred from Fig. 3a, b, the scores obtained by the teams participating in both tasks showed a low correlation. Therefore, accurate predictions at the single-trajectory level do not necessarily translate into reliable ensemble-level predictions, pointing to a critical role of the clustering approach.

**Fig. 8: Ensemble task predictions for the trajectory track.**

Figure 8a, b shows an experiment where all teams provided consistent and reasonable predictions. This is particularly evident for the K distribution in EXP 7 and EXP 8 (Fig. 8b, c). Since the methods rely on estimates of K per segment and then apply GMM or k-means, they generally tend to over-fragment wide K ranges, misrepresenting the overall distribution. The corresponding predictions for the distributions of α for these experiments are shown in Fig. 8f, g. For EXP 8, characterized by the absence of CPs and nearly flat distributions of K and α, most methods successfully captured the broad distribution of α (Fig. 8g). However, their predictions for K (Fig. 8c) were often biased toward different ranges within the allowed support. In contrast, EXP 9 presented a population of short dwell times in the trapped state. Most methods successfully detected the occurrence of these events, as reflected in the K distribution (Fig. 8d), but, with the exception of team I, failed to associate these events with the correct α = 0 Fig. 8h.

We further point out that optimizing methods to provide high scores for the metrics of the competition did not always translate into more meaningful insights about the underlying physical processes. For instance, teams M, H, and O showed significant biases across all experiments when predicting the K distribution but still achieved high rankings according to the metric in Eq. (3) (Supplementary Fig. 6). Moreover, accurately predicting the number of true states did not provide a clear advantage with this metric, as most top teams overestimated the number of states but carefully adjusted their relative weights to minimize differences with the ground-truth distribution.

Results summary and take-home messages

Robust changepoint detection

Top single-trajectory methods (e.g., based on UNet3+⁸⁶) consistently achieve over 95% accuracy in identifying segment boundaries, with only minor false-positive rates across all scenarios.

Distinguishing confinement, immobilization, and anomalous diffusion

Leading algorithms accurately classify segments arising from geometric constraints or anomalous dynamics. Only very short segments and exponents close to zero remain challenging, indicating minimal crosstalk between distinct diffusion mechanisms.

Trajectory extraction is a bottleneck

Video-Track performance lags the Trajectory Track by 10−30%, highlighting that linking and localization errors-not downstream analysis–drive most of the accuracy loss.

Parameter estimation benefits from physical priors

Incorporating known physical models may yield significant gains in changepoint detection, but separate estimation pipelines for K and α result in only modest improvements in parameter accuracy.

Dedicated ensemble approaches are needed

Ensemble Task submissions rely on GMM or k-means clustering of per-trajectory outputs, which fragments broad parameter distributions (e.g., EXP 7–8). Ensemble approaches, either bypassing single-trajectory clustering or using more sophisticated grouping techniques, hold potential for uncovering population-scale insights.

Discussion

The 2nd AnDi Challenge provided a platform for advancing methods to characterize diffusion trajectories, with a special focus on those exhibiting transitions between distinct diffusive regimes. Through this Challenge, participants developed approaches that, when applied to standardized benchmarks, demonstrate robust capabilities in analyzing processes akin to those found in complex biophysical environments.

The high participation from teams spanning different fields vividly demonstrated the first rationale for the Challenge: the urgent need for standardized, rigorously evaluated methods to analyze dynamic changes in particle motion.

The Challenge highlighted several key insights. The methods for changepoint analysis have reached a good level of maturity. Participants demonstrated strong capabilities in detecting changepoints, which is crucial for understanding transitions between different diffusive regimes. However, the characterization of the resulting segments can still be improved. Accurate estimation of diffusion parameters within these segments remains challenging, particularly for short segments where the asymptotic nature of certain parameters, such as the anomalous diffusion exponent α, complicates analysis. Sequence-to-sequence machine learning methods, mostly based on architectures combining convolutional⁹² and transformer⁹³ layers, have shown great flexibility and effectiveness. The top-performing methods often utilized these architectures, highlighting their potential for further advancements in the field. Notably, the methods did not take into account information coming from common parameters shared among trajectories or the underlying physical processes. Incorporating this knowledge could enhance the accuracy and robustness of the analyzes.

Nevertheless, significant challenges remain, and we hope the Challenge will help pave the way toward their resolution. In particular, we highlight two promising new avenues, which we believe may have a great impact on our understanding of the physics underlying biophysical processes.

First, the precision with which we can extract diffusion parameters remains fundamentally limited by current tracking algorithms, directly highlighting the third rationale for the Challenge. Notably, all participants in the Video Track relied on existing tracking techniques, subsequently applying the methods developed for the Trajectory Track to analyze the resulting trajectories. Despite the rapid advances in deep learning, none of the participants have yet leveraged these cutting-edge technologies to directly extract diffusive properties from video data. This missed opportunity could be attributed to several factors: the analysis technology may not yet be fully mature, the training processes might be too lengthy and complex, or the computational resources and time required could be prohibitively high. We foresee that as these bottlenecks are addressed, a new generation of methods will emerge, capable of bypassing the tracking step altogether and setting new standards of accuracy.

Second, in the Ensemble Task, all participants relied on post-processing of single-trajectory outputs. Features were first extracted from individual trajectories, and then a separate step was used to infer the parameters of the diffusive populations. No team developed dedicated ensemble-level algorithms or used established ensemble frameworks.

Although this single-trajectory-based approach produced high Challenge rankings, it offered limited biophysical insight due to the proliferation of predicted states and the instability of each mode’s variance. Minimizing the Wasserstein-1 (W₁) distance aligns predicted and ground-truth distributions, but W₁ offers no penalty for over-splitting into numerous states or for unstable variance estimates, nor does it encourage physically interpretable solutions (e.g., filtering overlapping modes or very low-population segments). This warns us that outputs should not be blindly trusted when applied to real experiments. Care should always be taken not to overfit the data with too many states that cannot be assigned to a biophysical process. Whenever an analysis yields a large number of states, their identities should be validated through control experiments. In practice, a priori biological knowledge often narrows the expected state count, providing essential context for interpreting algorithmic results.

Looking ahead, methods capable of inferring population distributions directly from the raw ensemble of trajectories, thereby bypassing single-trajectory feature extraction and clustering, may deliver deeper physical insights. Moreover, approaches that treat the full set of trajectories contextually, rather than in isolation, are likely to enhance both performance and interpretability.

To encourage further development of methods addressing these issues, as well as those aligned with approaches used throughout the challenge, we have made the labeled dataset discussed in this work publicly available on Zenodo⁹⁴. This resource allows researchers to benchmark new methods in a standardized manner, while also providing the experimental biophysics community with a tool to better identify the methods best suited to their specific experimental scenarios.

Methods

Simulations of diffusion and interaction models

Trajectories are simulated according to a 2-dimensional fractional Brownian motion (FBM)⁵⁴. FBM is a continuous-time Gaussian process B_H(t) with stationary increments and a covariance function $E[{B}_{H}(t){B}_{H}(s)]=\frac{1}{2}(| t{| }^{2H}+| s{| }^{2H}-| t-s{| }^{2H})$, where H represents the Hurst exponent and is related to the anomalous diffusion exponent α as H = α/2⁵⁴. FBM features three regimes: one in which the increments are positively correlated (1/2 < H < 1, i.e., 1 < α < 2, superdiffusive); one in which the increments are negatively correlated (0 < H < 1/2, i.e., 0 < α < 1, subdiffusive); and one in which the increments are uncorrelated (H = 1/2, i.e., α = 1, diffusive Brownian motion).

The models included in the Challenge describe trajectories where diffusion properties are piecewise constant along segments of varying duration T_s and undergo sudden changes. To obtain a trajectory segment of length T_s with given anomalous diffusion exponent α and generalized diffusion coefficient K, a set of T_s − 1 displacements for each dimension is sampled from a fractional Gaussian noise generator⁹⁵. The displacements are then standardized to have variance σ² = 2KΔt, where Δt is the sampling time.

Simulations are performed considering particles diffusing in a square box of size L with reflecting boundary conditions. However, to avoid boundary effects, the fields of view used for the Challenge datasets correspond to a square region of size L_FOV ≪ L within the central part of the original box (Fig. 2b).

For Track 1, trajectory coordinates are used as sub-pixel localizations of individual particles to simulate movie frames as in single-molecule fluorescence experiments⁵. Each particle has a random intensity I_i that corresponds to the total number of photons collected by the detector. I_i is drawn from a uniform distribution in the interval $[{I}_{\min },{I}_{\max }]$ and fluctuates over time according to a normal distribution with mean I_i and standard deviation σ_I. Each particle is rendered as a diffraction-limited spot using an Airy disk as a point-spread function (PSF) with full width at half maximum FWHM_PSF = 2.1 px. A constant background of I_bg = 100 counts is added to each frame. Images are corrupted with Poisson noise.

For Track 2, trajectory coordinates are corrupted with noise from a Gaussian distribution with zero mean and standard deviation σ_N to take into account the finite localization precision obtained in tracking experiments. All simulated trajectories were generated without missing frames: no gaps were introduced, yielding continuous tracks to isolate segmentation performance from linking or gap-filling complexities.

All the models share a set of parameters required for the simulations that are described here. Model-specific parameters are defined when describing the details of the models in the following sections.

[K₁, K₂, …, K_n]: average values of the (Gaussian) distribution of the generalized diffusion coefficient for each of the n diffusive states considered in a given experiment, with support [10⁻¹², 10⁶] pixel²/frame^α.
$[{\sigma }_{{K}_{1}},{\sigma }_{{K}_{2}},\ldots,{\sigma }_{{K}_{n}}]$: standard deviations of the (Gaussian) distribution of the generalized diffusion coefficient for each of the n diffusive states considered in a given experiment. If not provided, the standard deviation is considered to be equal to 0 (i.e., the distribution is δ(K − K_i)).
[α₁, α₂, …, α_n]: average values of the (Gaussian) distribution of the anomalous diffusion exponent for each of the n diffusive states considered in a given experiment, with support (0, 2).
$[{\sigma }_{{\alpha }_{1}},{\sigma }_{{\alpha }_{2}},\ldots,{\sigma }_{{\alpha }_{n}}]$: standard deviations of the (Gaussian) distribution of the anomalous diffusion exponent for each of the n diffusive states considered in a given experiment. If not provided, the standard deviation is considered to be equal to 0 (i.e., the distribution is δ(α − α_i)).
L: size of the box in which trajectories are simulated with reflecting boundary conditions.
L_FOV: size of the box defining the FOV used for the Challenge datasets. The same particles can enter and exit the FOV over time but, for evaluation purposes, they will be considered as generating different trajectories.
Δt: sampling time at which the original motion of the particle is tracked. For the Challenge datasets, we consider Δt = 1.
T: duration of the recording over each FOV, given as the number of time steps Δt. It also corresponds to the maximum trajectory duration. For the Challenge, we set T = 200;
${T}_{\min }$: minimum duration of a trajectory to be included in the dataset. For the Challenge, we use T = 20;
I_bg (Track 1): background level of noise (counts) used in the simulation of videos.
FWHM_PSF (Track 1): full width at half maximum in pixels of the point-spread function used to render fluorescent particles.
I_tot (Track 1): mean value in counts of the total fluorescence collected for the detected particles.
σ_tot (Track 1): standard deviation in counts of the distribution of total fluorescence collected for the detected particles.
I_peak (Track 1): mean value in counts of the peak fluorescence collected for the detected particles. Can be calculated as ${I}_{{\rm{peak}}}={I}_{{\rm{tot}}}\frac{4\ln 2}{\pi {{\rm{FWHM}}}_{{\rm{PSF}}}^{2}}$
SNR (Track 1): typical signal-to-noise ratio of the movies, calculated as the average peak intensity over the standard deviation of the noise⁵¹ and thus equal to
$${\rm{SNR}}=\frac{{I}_{{\rm{peak}}}}{\sqrt{{I}_{{\rm{peak}}}+{I}_{{\rm{bg}}}}}.$$
(15)
σ_N (Track 2): standard deviation of the Gaussian localization noise used to corrupt trajectory coordinates.
${t}_{\min }$: minimum distance between changepoints, corresponding to the minimum amount of time that a particle spends in a state. Shorter segments are eliminated by smoothing the time trace of the state label using a majority filter with a window of 5 steps. For the Challenge, we set ${t}_{\min }=3$ frames to test the sensitivity and robustness of the segmentation methods under minimal data conditions.

A schematic representation of each of the models presented below is shown in Fig. 2a.

Model 1 - Single-state model (SSM)

This model simply corresponds to particles diffusing according to FBM with constant generalized diffusion coefficient K and anomalous diffusion exponent α. For each trajectory, a value of K and a value of α are sampled from the corresponding distribution. Data corresponding to these models are necessary to establish the false positive rate of the methods toward the detection of changes of diffusion properties.

Model 2 - Multi-state model (MSM)

The multi-state model is a Markov model describing particles undergoing FBM whose diffusion properties can change at random times. The number of states S is fixed for a given experiment, as are the parameters defining the distributions of K and α for each state. For each trajectory, S values of α and S values of K are sampled from the distribution of the corresponding states, i.e., one per state. At every time step, a diffusing particle has a given probability to undergo a change in one of its diffusive parameters (either α or K). The probability of switching is given by a transition matrix M. Namely, M_ij is the probability of switching from state i to state j at each time step. In the same sense, M_ii is the probability of remaining in state i. The residence time in a given state i can be directly calculated from the previous probability as

$${\tau }_{i}=\frac{1}{{\sum }_{j\ne i}{M}_{ij}}=\frac{1}{1-{M}_{ii}}.$$

(16)

Model 2 (MSM) parameters

M: transition matrix between diffusive states.

Model 3 - Dimerization (DIM)

This model considers the case in which dimerization, i.e., the transient binding of two particles, may occur and produce changes in the diffusion properties of both particles. In particular, we consider the case of N circular particles of radius r. For each trajectory, a value of α and a value of K are sampled from the corresponding distributions associated with the monomeric state. If two particles are at a distance d < 2r, then they have a probability P_b of binding. The two particles forming a dimer move with equal displacements, according to a generalized diffusion coefficient K and an anomalous diffusion exponent α drawn from the distributions associated with the dimeric state. At each time step, the dimer has a probability P_b of breaking its bond, freeing the two particles to go back to their original motion parameters. The particles cannot form any new dimers until taking a new step. Only dimers are allowed, and subsequent hits with other particles will not affect either the particles or the dimers.

Model 3 (DIM) parameters

N: number of diffusing particles in the box of size L.
r: interaction radius, corresponding to the radius of the diffusing particles.
P_b: probability that two particles bind to form a dimer in each time step. For this to happen, the particles must be at a distance d < 2r.
P_u: probability that a dimer breaks up at each time step so that the two particles go back to diffusing independently.

Model 4 - Transient-confinement model (TCM)

This model considers an environment with N_c circular compartments of radius r_c. The compartments are distributed randomly throughout the environment such that they do not overlap. We consider that the compartments are osmotic, i.e., a particle reaching their boundary from the exterior has a probability 1 of entering them, but a particle reaching the boundary from the interior of a compartment has a probability T of exiting it (and 1— T of being reflected back to the interior of the compartment). The diffusion inside and outside the compartment is different, hence defining two diffusive states. For each trajectory, two values of α and two values of K are sampled from the corresponding distributions, representing the motion outside and inside the compartments.

Model 4 (TCM) parameters

N_c: number of compartments in the box of size L.
r_c: radius of the compartments.
T: transmittance of the boundary. Probability that a particle reaching the boundary from inside the compartment exits the compartment.

Model 5 - Quenched-trap model (QTM)

This model considers the diffusion of particles in an environment with N_t immobile traps of radius r_t. The values of α and K are sampled for each trajectory from the corresponding distributions and define its unrestrained motion. A particle that enters the domain defined by a trap has a probability P_b of binding to the trap and, hence, getting temporarily immobilized (K = 0, α = 0). At each time step, a trapped particle has a probability P_u of unbinding and being released from the trap, going back to its unrestrained motion. A particle cannot be trapped again until taking a new step.

Model 5 (QTM) parameters

N_t: number of traps in the box of size L.
r_t: radius of the traps.
P_b: probability that a particle binds to a trap and gets immobilized. For that to happen, a particle must be at a distance d < r_t from the trap.
P_u: probability that a trapped particle unbinds from a trap and starts diffusing independently at each time δt.

Dataset structure

The datasets used in the Challenge (Supplementary Fig. 9) include different experiments, each contained in a folder labeled with a sequential number (EXP_[exp number]) and corresponding to a specific model and a fixed set of parameters. The information about the model and the parameters is unknown to Challenge participants. Each experiment folder contains a list of files labeled with a sequential number (FOV_[fov number]) associated with 30 FOVs. Each FOV reports data from a variable number of particles diffusing on a 128 × 128 pixel² area.

For the Video Track, the coordinates of the particles in the same FOV are used to generate 200-frame videos as a series of 8-bit images in the multi-tiff format using Deeptrack 2.1⁵. Noise is added to the synthetic images to account for background fluorescence and shot noise. A map corresponding to the segmentation of VIP particles at the first frame for which CPs and diffusion parameters must be detected is also provided as a TIFF file. Connected components of the map are labeled with unique integer values that correspond to the particle index.

For the Trajectory Track, we provide a CSV file for each FOV with a table whose columns contain trajectory index, time step, x-coordinate, and y-coordinate. Coordinates of simulated trajectories are corrupted with Gaussian noise corresponding to finite (subpixel) localization precision. The trajectories have a maximum length of 200 frames.

Besides localization precision, motion blur can introduce a significant contribution to noise, in particular if the camera frame rate is slow compared to particle motion⁹⁶. However, this aspect will not be included in the Challenge datasets since it would introduce complexities in the definition of the ground truth that could detract from the focus of the work. Nevertheless, the simulation software incorporates the capability to introduce the effect of motion blur both in videos and trajectories.

Exemplary data for all the models are shown in Supplementary Fig. 10. Files in different Tracks labeled with the same experiment and the FOV index (e.g., Track_1/EXP_4/FOV_3.tiff and Track_2/EXP_4/FOV_3.csv) include simulations obtained with the same set of dynamics parameters but do not correspond to the motion of the same set of particles.

Protocol registration

The Stage 1 protocol for this Registered Report was accepted in principle on 31st October 2023. The protocol, as accepted by the journal, can be found at https://doi.org/10.6084/m9.figshare.24771687.v1.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The labeled benchmark dataset used in this study is available on Zenodo⁹⁴. All datasets generated for the Challenge can be accessed on the Codalab platform (registration required). Source data for all figures are provided with this paper. Source data are provided with this paper.

Code availability

All code used to generate the Challenge datasets is publicly available via the andi_datasets repository on GitHub: https://github.com/AnDiChallenge/andi_datasets⁷¹.

References

Shen, H. et al. Single particle tracking: from theory to biophysical applications. Chem. Rev. 117, 7331–7376 (2017).
Article CAS PubMed Google Scholar
Manzo, C. & Garcia-Parajo, M. F. A review of progress in single particle tracking: from methods to biophysical insights. Rep. Prog. Phys. 78, 124601 (2015).
Article PubMed Google Scholar
Chen, Y., Lagerholm, B. C., Yang, B. & Jacobson, K. Methods to measure the lateral diffusion of membrane lipids and proteins. Methods 39, 147–153 (2006).
Article PubMed Google Scholar
Norregaard, K., Metzler, R., Ritter, C. M., Berg-Sørensen, K. & Oddershede, L. B. Manipulation and motion of organelles and single molecules in living cells. Chem. Rev. 117, 4342–4375 (2017).
Article CAS PubMed Google Scholar
Midtvedt, B. et al. Quantitative digital microscopy with deep learning. Appl. Phys. Rev. 8, 011310 (2021).
Article CAS Google Scholar
Torreno-Pina, J. A., Manzo, C. & Garcia-Parajo, M. F. Uncovering homo-and hetero-interactions on the cell membrane using single particle tracking approaches. J. Phys. D Appl. Phys. 49, 104002 (2016).
Article Google Scholar
Einstein, A. Über die von der molekularkinetischen Theorie der Wärme geforderte Bewegung von in ruhenden Flüssigkeiten suspendierten Teilchen. Ann. Der Phys. 322, 549–560 (1905).
Article Google Scholar
Weiss, M., Hashimoto, H. & Nilsson, T. Anomalous protein diffusion in living cells as seen by fluorescence correlation spectroscopy. Biophys. J. 84, 4043–4052 (2003).
Article CAS PubMed PubMed Central Google Scholar
Ritchie, K., Iino, R., Fujiwara, T., Murase, K. & Kusumi, A. The fence and picket structure of the plasma membrane of live cells as revealed by single molecule techniques. Mol. Membr. Biol. 20, 13–18 (2003).
Article CAS PubMed Google Scholar
Daumas, F. et al. Confined diffusion without fences of a G-protein-coupled receptor as revealed by single particle tracking. Biophys. J. 84, 356–366 (2003).
Article CAS PubMed PubMed Central Google Scholar
Ritchie, K. et al. Detection of non-Brownian diffusion in the cell membrane in single molecule tracking. Biophys. J. 88, 2266–2277 (2005).
Article CAS PubMed Google Scholar
Banks, D. S. & Fradin, C. Anomalous diffusion of proteins due to molecular crowding. Biophys. J. 89, 2960–2971 (2005).
Article CAS PubMed PubMed Central Google Scholar
Saxton, M. J. A biological interpretation of transient anomalous subdiffusion. I. qualitative model. Biophys. J. 92, 1178–1191 (2007).
Article CAS PubMed Google Scholar
Eggeling, C. et al. Direct observation of the nanoscale dynamics of membrane lipids in a living cell. Nature 457, 1159–1162 (2009).
Article CAS PubMed Google Scholar
Manzo, C., van Zanten, T. S. & Garcia-Parajo, M. F. Nanoscale fluorescence correlation spectroscopy on intact living cell membranes with NSOM probes. Biophys. J. 100, L8–L10 (2011).
Article CAS PubMed PubMed Central Google Scholar
Soula, H., Caré, B., Beslon, G. & Berry, H. Anomalous versus slowed-down Brownian diffusion in the ligand-binding equilibrium. Biophys. J. 105, 2064–2073 (2013).
Article CAS PubMed PubMed Central Google Scholar
Spillane, K. M. et al. High-speed single-particle tracking of GM1 in model membranes reveals anomalous diffusion due to interleaflet coupling and molecular pinning. Nano Lett. 14, 5390–5397 (2014).
Article CAS PubMed PubMed Central Google Scholar
Mosqueira, A., Camino, P. A. & Barrantes, F. J. Antibody-induced crosslinking and cholesterol-sensitive, anomalous diffusion of nicotinic acetylcholine receptors. J. Neurochem. 152, 663–674 (2020).
Article CAS PubMed Google Scholar
Chai, Y.-J., Cheng, C.-Y., Liao, Y.-H., Lin, C.-H. & Hsieh, C.-L. Heterogeneous nanoscopic lipid diffusion in the live cell membrane and its dependency on cholesterol. Biophys. J. 121, 3146–3161 (2022).
Article CAS PubMed PubMed Central Google Scholar
Höfling, F. & Franosch, T. Anomalous transport in the crowded world of biological cells. Rep. Prog. Phys. 76, 046602 (2013).
Article MathSciNet PubMed Google Scholar
Metzler, R., Jeon, J.-H., Cherstvy, A. G. & Barkai, E. Anomalous diffusion models and their properties: non-stationarity, non-ergodicity, and ageing at the centenary of single particle tracking. Phys. Chem. Chem. Phys. 16, 24128–24164 (2014).
Article CAS PubMed Google Scholar
Krapf, D. Mechanisms underlying anomalous diffusion in the plasma membrane. Curr. Top. Membr. 75, 167–207 (2015).
Article CAS PubMed Google Scholar
Magdziarz, M., Weron, A., Burnecki, K. & Klafter, J. Fractional Brownian motion versus the continuous-time random walk: a simple test for subdiffusive dynamics. Phys. Rev. Lett. 103, 180602 (2009).
Article PubMed Google Scholar
Kepten, E., Bronshtein, I. & Garini, Y. Ergodicity convergence test suggests telomere motion obeys fractional dynamics. Phys. Rev. E 83, 041919 (2011).
Article CAS Google Scholar
Bronshtein, I. et al. Loss of lamin A function increases chromatin dynamics in the nuclear interior. Nat. Commun. 6, 8044 (2015).
Article CAS PubMed Google Scholar
Weber, S. C., Spakowitz, A. J. & Theriot, J. A. Bacterial chromosomal loci move subdiffusively through a viscoelastic cytoplasm. Phys. Rev. Lett. 104, 238102 (2010).
Article PubMed PubMed Central Google Scholar
Tabei, S. A. et al. Intracellular transport of insulin granules is a subordinated random walk. Proc. Natl. Acad. Sci. USA 110, 4911–4916 (2013).
Article PubMed PubMed Central Google Scholar
Lampo, T. J., Stylianidou, S., Backlund, M. P., Wiggins, P. A. & Spakowitz, A. J. Cytoplasmic RNA-protein particles exhibit non-gaussian subdiffusive behavior. Biophys. J. 112, 532–542 (2017).
Article CAS PubMed PubMed Central Google Scholar
Weigel, A. V., Simon, B., Tamkun, M. M. & Krapf, D. Ergodic and nonergodic processes coexist in the plasma membrane as observed by single-molecule tracking. Proc. Natl. Acad. Sci. USA 108, 6438–6443 (2011).
Article CAS PubMed PubMed Central Google Scholar
Manzo, C. et al. Weak ergodicity breaking of receptor motion in living cells stemming from random diffusivity. Phys. Rev. X 5, 011021 (2015).
Google Scholar
Smith, P. R., Morrison, I. E., Wilson, K. M., Fernandez, N. & Cherry, R. J. Anomalous diffusion of major histocompatibility complex class i molecules on HeLa cells determined by single particle tracking. Biophys. J. 76, 3331–3344 (1999).
Article CAS PubMed PubMed Central Google Scholar
Song, M. S., Moon, H. C., Jeon, J.-H. & Park, H. Y. Neuronal messenger ribonucleoprotein transport follows an aging Lévy walk. Nat. Commun. 9, 1–8 (2018).
Google Scholar
Krapf, D. et al. Spectral content of a single non-Brownian trajectory. Phys. Rev. X 9, 011019 (2019).
CAS Google Scholar
Sposini, V. et al. Towards a robust criterion of anomalous diffusion. Commun. Phys. 5, 305 (2022).
Article Google Scholar
Granik, N. et al. Single-particle diffusion characterization by deep learning. Biophys. J. 117, 185–192 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kowalek, P., Loch-Olszewska, H. & Szwabiński, J. Classification of diffusion modes in single-particle tracking data: feature-based versus deep-learning approach. Phys. Rev. E 100, 032410 (2019).
Article CAS PubMed Google Scholar
Bo, S., Schmidt, F., Eichhorn, R. & Volpe, G. Measurement of anomalous diffusion using recurrent neural networks. Phys. Rev. E 100, 010102 (2019).
Article CAS PubMed Google Scholar
Muñoz-Gil, G., Garcia-March, M. A., Manzo, C., Martín-Guerrero, J. D. & Lewenstein, M. Single trajectory characterization via machine learning. N. J. Phys. 22, 013010 (2020).
Article MathSciNet Google Scholar
Seckler, H. & Metzler, R. Bayesian deep learning for error estimation in the analysis of anomalous diffusion. Nat. Commun. 13, 6717 (2022).
Article CAS PubMed PubMed Central Google Scholar
Pineda, J. et al. Geometric deep learning reveals the spatiotemporal features of microscopic motion. Nat. Mach. Intell. 5, 71–82 (2023).
Article Google Scholar
Seckler, H., Szwabinński, J. & Metzler, R. Machine-learning solutions for the analysis of single-particle diffusion trajectories. J. Phys. Chem. Lett. 14, 7910–7923 (2023).
Article CAS PubMed Google Scholar
Muñoz-Gil, G. et al. Objective comparison of methods to decode anomalous diffusion. Nat. Commun. 12, 6253 (2021).
Article PubMed PubMed Central Google Scholar
Janczura, J. et al. Identifying heterogeneous diffusion states in the cytoplasm by a hidden Markov model. N. J. Phys. 23, 053018 (2021).
Article Google Scholar
Lee, G. et al. Myosin-driven actin-microtubule networks exhibit self-organized contractile dynamics. Sci. Adv. 7, eabe4334 (2021).
Article CAS PubMed PubMed Central Google Scholar
Requena, B. et al. Inferring pointwise diffusion properties of single trajectories with deep learning. Biophys. J. 122, 4360–4369 (2023).
Article CAS PubMed PubMed Central Google Scholar
Han, D. et al. Deciphering anomalous heterogeneous intracellular transport with neural networks. eLife 9, e52224 (2020).
Article PubMed PubMed Central Google Scholar
Wang, W., Seno, F., Sokolov, I. M., Chechkin, A. V. & Metzler, R. Unexpected crossovers in correlated random-diffusivity processes. N. J. Phys. 22, 083041 (2020).
Article MathSciNet Google Scholar
Balcerek, M., Burnecki, K., Thapa, S., Wyłomańska, A. & Chechkin, A. Fractional Brownian motion with random Hurst exponent: accelerating diffusion and persistence transitions. Chaos 32, 093114 (2022).
Wang, W. et al. Memory-multi-fractional Brownian motion with continuous correlations. Phys. Rev. Res. 5, L032025 (2023).
Article CAS Google Scholar
Ślezak, J. & Metzler, R. Minimal model of diffusion with time changing Hurst exponent. J. Phys. A Math. Theor. 56, 35LT01 (2023).
Article MathSciNet Google Scholar
Chenouard, N. et al. Objective comparison of particle tracking methods. Nat. Methods 11, 281–289 (2014).
Article CAS PubMed PubMed Central Google Scholar
Bohnslav, J. P. et al. Deepethogram, a machine learning pipeline for supervised behavior classification from raw pixels. eLife 10, e63377 (2021).
Article CAS PubMed PubMed Central Google Scholar
Midtvedt, B. et al. Fast and accurate nanoparticle characterization using deep-learning-enhanced off-axis holography. ACS Nano 15, 2240–2250 (2021).
Article CAS PubMed PubMed Central Google Scholar
Mandelbrot, B. B. & Van Ness, J. W. Fractional Brownian motions, fractional noises and applications. SIAM Rev. 10, 422–437 (1968).
Article MathSciNet Google Scholar
Weiss, M. Single-particle tracking data reveal anticorrelated fractional Brownian motion in crowded fluids. Phys. Rev. E 88, 010101 (2013).
Article Google Scholar
Golding, I. & Cox, E. C. Physical nature of bacterial cytoplasm. Phys. Rev. Lett. 96, 098102 (2006).
Article PubMed Google Scholar
Dupont, A. et al. Three-dimensional single-particle tracking in live cells: news from the third dimension. N. J. Phys. 15, 075008 (2013).
Article CAS Google Scholar
Sukhorukov, V. M. & Bereiter-Hahn, J. Anomalous diffusion induced by cristae geometry in the inner mitochondrial membrane. PLoS ONE 4, e4604 (2009).
Article PubMed PubMed Central Google Scholar
Speidel, M., Jonáš, A. & Florin, E.-L. Three-dimensional tracking of fluorescent nanoparticles with subnanometer precision by use of off-focus imaging. Opt. Lett. 28, 69–71 (2003).
Article CAS PubMed Google Scholar
Maurice, L. & Bilenca, A. Three-dimensional single particle tracking using 4π self-interference of temporally phase-shifted fluorescence. Light Sci. Appl. 12, 58 (2023).
Article CAS PubMed PubMed Central Google Scholar
Toprak, E., Balci, H., Blehm, B. H. & Selvin, P. R. Three-dimensional particle tracking via bifocal imaging. Nano Lett. 7, 2043–2045 (2007).
Article CAS PubMed Google Scholar
Holtzer, L., Meckel, T. & Schmidt, T. Nanometric three-dimensional tracking of individual quantum dots in cells. Appl. Phys. Lett. 90, 053902 (2007).
Andreao, R. V., Dorizzi, B. & Boudy, J. ECG signal analysis through hidden Markov models. IEEE Trans. Biomed. Eng. 53, 1541–1549 (2006).
Article PubMed Google Scholar
Khanagha, V., Daoudi, K., Pont, O. & Yahia, H. Phonetic segmentation of speech signal using local singularity analysis. Digital Signal Process. 35, 86–94 (2014).
Article Google Scholar
Cetin, M. & Comert, G. Short-term traffic flow prediction with regime switching models. Transport. Res. Rec. 1965, 23–31 (2006).
Article Google Scholar
Bulla, J. & Berzel, A. Computational issues in parameter estimation for stationary hidden Markov models. Comput. Stat. 23, 1–18 (2008).
Article MathSciNet Google Scholar
Janczura, J. & Weron, R. Goodness-of-fit testing for the marginal distribution of regime-switching models with an application to electricity spot prices. AStA Adv. Stat. Anal. 97, 239–270 (2013).
Article MathSciNet Google Scholar
Lux, T. & Morales-Arias, L. Forecasting volatility under fractality, regime-switching, long memory and student-T innovations. Comput. Stat. Data Anal. 54, 2676–2692 (2010).
Article MathSciNet Google Scholar
Edelhoff, H., Signer, J. & Balkenhol, N. Path segmentation for beginners: an overview of current methods for detecting changes in animal movement patterns. Mov. Ecol. 4, 1–21 (2016).
Article Google Scholar
Vasas, K., Elek, P. & Márkus, L. A two-state regime switching autoregressive model with an application to river flow analysis. J. Stat. Plan. Inference 137, 3113–3126 (2007).
Article MathSciNet Google Scholar
Muñoz-Gil, G. et al. AnDiChallenge/andi_datasets: AnDi Challenge 2 https://doi.org/10.5281/zenodo.10259556 (2023).
Honigmann, A. et al. Scanning STED-FCS reveals spatiotemporal heterogeneity of lipid interaction in the plasma membrane of living cells. Nat. Commun. 5, 1–12 (2014).
Article Google Scholar
Mainali, D. & Smith, E. A. The effect of ligand affinity on integrins’ lateral diffusion in cultured cells. Eur. Biophys. J. 42, 281–290 (2013).
Article CAS PubMed Google Scholar
Yanagawa, M. et al. Single-molecule diffusion-based estimation of ligand effects on G protein–coupled receptors. Sci. Signal. 11, eaao1917 (2018).
Article PubMed Google Scholar
da Rocha-Azevedo, B. et al. Heterogeneity in VEGF receptor-2 mobility and organization on the endothelial cell surface leads to diverse models of activation by VEGF. Cell Rep. 32, 108187 (2020).
Achimovich, A. M., Yan, T. & Gahlmann, A. Dimerization of iLID optogenetic proteins observed using 3D single-molecule tracking in live E. coli. Biophys. J. 122, 3254–3267 (2023).
Article CAS PubMed PubMed Central Google Scholar
Low-Nam, S. T. et al. ErbB1 dimerization is promoted by domain co-confinement and stabilized by ligand binding. Nat. Struct. Mol. Biol. 18, 1244–1249 (2011).
Article CAS PubMed PubMed Central Google Scholar
Valley, C. C. et al. Enhanced dimerization drives ligand-independent activity of mutant epidermal growth factor receptor in lung cancer. Mol. Biol. Cell 26, 4087–4099 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tabor, A. et al. Visualization and ligand-induced modulation of dopamine receptor dimerization at the single molecule level. Sci. Rep. 6, 1–16 (2016).
Article Google Scholar
Sungkaworn, T. et al. Single-molecule imaging reveals receptor-G protein interactions at cell surface hot spots. Nature 550, 543–547 (2017).
Article CAS PubMed Google Scholar
Grimes, J. et al. Plasma membrane preassociation drives β-arrestin coupling to receptors and activation. Cell 186, 2238–2255 (2023).
Weigel, A. V., Tamkun, M. M. & Krapf, D. Quantifying the dynamic interactions between a clathrin-coated pit and cargo molecules. Proc. Natl. Acad. Sci. USA 110, E4591–E4600 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sadegh, S., Higgins, J. L., Mannion, P. C., Tamkun, M. M. & Krapf, D. Plasma membrane is compartmentalized by a self-similar cortical actin meshwork. Phys. Rev. X 7, 011031 (2017).
PubMed PubMed Central Google Scholar
Rossier, O. et al. Integrins β1 and β3 exhibit distinct dynamic nanoscale organizations inside focal adhesions. Nat. Cell Biol. 14, 1057–1067 (2012).
Article CAS PubMed Google Scholar
Crouse, D. F. On implementing 2D rectangular assignment algorithms. IEEE Trans. Aerosp. Electron. Syst. 52, 1679–1696 (2016).
Article Google Scholar
Asghar, S., Ni, R. & Volpe, G. U-net 3+ for anomalous diffusion analysis enhanced with mixture estimates (U-AnD-Me) in particle-tracking data. arXiv preprint arXiv:2502.19253 https://arxiv.org/abs/2502.19253 (2025).
Huang, H. et al. Unet 3+: A Full-scale Connected Unet For Medical Image Segmentation, 1055–1059 (IEEE, 2020).
Crocker, J. C. & Grier, D. G. Methods of digital video microscopy for colloidal studies. J. Colloid Interface Sci. 179, 298–310 (1996).
Article CAS Google Scholar
Tinevez, J.-Y. et al. TrackMate: an open and extensible platform for single-particle tracking. Methods 115, 80–90 (2017).
Article CAS PubMed Google Scholar
Ershov, D. et al. TrackMate 7: integrating state-of-the-art segmentation algorithms into tracking pipelines. Nat. Methods 19, 829–832 (2022).
Article CAS PubMed Google Scholar
Allan, D. B., Caswell, T., Keim, N. C., van der Wel, C. M. & Verweij, R. W. Trackpy: fast, flexible particle-tracking toolkit https://doi.org/10.5281/zenodo.12708864 (2024).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inform. Process. Syst. 25, 84–90 (2012).
Vaswani, A. & Guyon, I. et al. (eds) Attention is all you need. (eds Guyon, I. et al.) Advances in Neural Information Processing Systems, Vol. 30 (Curran Associates, Inc., 2017). https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf.
Muñoz-Gil, G. et al. AnDi 2 Benchmark dataset https://doi.org/10.5281/zenodo.14281478 (2024).
Stochastic Python package. https://stochastic.readthedocs.io/en/stable/.
Berglund, A. J. Statistics of camera-based single-particle tracking. Phys. Rev. E 82, 011917 (2010).
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the other participants of the 2nd AnDi Challenge: Thomas Martynec, Sarah A.M. Loos; Maxime Lavaud, Juliette Lacherez, Yosef Shokeeb, Yacine Amarouchene, Thomas Salez; Roman Lavrynenko, Lyudmyla Kirichenko, Sophia Lavrynenko; Taegeun Song, Seunghee Han, Jaehyun Jeong, Jihye Kim; Farzaneh Nazari, Mohammad Mehdi Nazari; Janusz Szwabiński, Jakub Malinowski, Marcin Kostrzewa, Michał Balcerek, Weronika Tomczuk; Alvaro Lanza, Stefano Bo; Raffaele Pastore, Francesco Rusciano, Maurizio De Micco, Pier Luca Maffettone, Francesco Greco. The organizers of the 2nd AnDi Challenge acknowledge the CHAIR (Chalmers AI Research Center) Research Area AISDA (AI for Scientific Data Analysis) and the EUTOPIA Connected Community on BioImaging for sponsoring the final workshop, and thank Agnese Callegari for her invaluable help with its organization. G.M-G. acknowledges support from the European Union. S.A. and Gior.V. are grateful for the studentship funded by the A*STAR-UCL Research Attachment Program through the EPSRC M3S CDT (EP/L015862/1). Gior.V. acknowledges support for this work by The Chan Zuckerberg Initiative, “Multi-color single molecule tracking with lifetime imaging” (2023-321188). R.N. acknowledges support from the Academic Research Fund from the Singapore Ministry of Education (RG151/23 and MOE2019-T2-2-010) and the National Research Foundation, Singapore, under its 29th Competitive Research Program (CRP) Call (NRF-CRP29-2022-0002). Z.H. acknowledges the support from the National Natural Science Foundation of China (Grant No. 12104147) and the Fundamental Research Funds for the Central Universities. X.F. and Y.Z. Acknowledge grants from the National Natural Science Foundation of China (grant nos. 62031023 and 62331011). J.A.C. is supported by the European Union—NextGenerationEU, ANDHI project CPP2021-008994 and PID2021-124618NB-C21, by MCIN/AEI/10.13039/501100011033, and by “ERDF A way of making Europe”, from the European Union. J.K.H., S.W.B.B., and N.S.H. acknowledge the Novo Nordisk Foundation Challenge Center for Optimized Oligo escape (NNF23OC0081287). H.J. and J.B. acknowledge support from the Basic Science Research Program through the National Research Foundation of Korea (RS-2025-00514776). R.H. acknowledges the Medical Sciences Doctoral Training Centre, University of Oxford for financial support. M.L., G.F-F. and B.R. acknowledge support from: European Research Council AdG NOQIA; MCIN/AEI (PGC2018-0910.13039/501100011033, CEX2019-000910-S/10.13039/501100011033, Plan National FIDEUA PID2019-106901GB-I00, Plan National STAMEENA PID2022-139099NB, I00, project funded by MCIN/AEI/10.13039/501100011033 and by the “European Union NextGenerationEU/PRTR” (PRTR-C17.I1), FPI); QUANTERA DYNAMITE PCI2022-132919, QuantERA II Program co-funded by European Union’s Horizon 2020 program under Grant Agreement No. 101017733; Ministry for Digital Transformation and of Civil Service of the Spanish Government through the QUANTUM ENIA project call - Quantum Spain project, and by the European Union through the Recovery, Transformation and Resilience Plan - NextGenerationEU within the framework of the Digital Spain 2026 Agenda; Fundació Cellex; Fundació Mir-Puig; Generalitat de Catalunya (European Social Fund FEDER and CERCA program); Barcelona Supercomputing Center MareNostrum (FI-2023-3-0024); (HORIZON-CL4-2022-QUANTUM-02-SGA PASQuanS2.1, 101113690, EU Horizon 2020 FET-OPEN OPTOlogic, Grant No. 899794, QU-ATTO, 101168628), EU Horizon Europe Program (This project has received funding from the European Union’s Horizon Europe research and innovation program under grant agreement No. 101080086 NeQST); ICFO Internal “QuantumGaudi” project; Funded by the European Union. Views and opinions expressed are, however, those of the author(s) only and do not necessarily reflect those of the European Union, European Commission, European Climate, Infrastructure and Environment Executive Agency (CINEA), or any other granting authority. Neither the European Union nor any granting authority can be held responsible for them. R.M. acknowledges DFG grants ME 1535/16-1 and 1535/22-1. D.K. acknowledges funding from the National Science Foundation grant 2102832. Giov.V. acknowledges support from the Horizon Europe ERC Consolidator Grant MAPEI (grant number 101001267) and the Knut and Alice Wallenberg Foundation (grant number 2019.0079). C.M. acknowledges support through grant RYC-2015-17896 funded by MCIN/AEI/10.13039/501100011033 and “ESF Investing in your future”, grants BFU2017-85693-R and PID2021-125386NB-I00 funded by MCIN/AEI/10.13039/501100011033/ and “ERDF A way of making Europe”.

Funding

Open access funding provided by University of Gothenburg.

Author information

Authors and Affiliations

Institute for Theoretical Physics, University of Innsbruck, Innsbruck, Austria
Gorka Muñoz-Gil
Department of Physics, University of Gothenburg, Origovägen 6B, SE-41296, Gothenburg, Sweden
Harshith Bachimanchi, Jesús Pineda, Benjamin Midtvedt & Giovanni Volpe
ICFO – Institut de Ciències Fotòniques, The Barcelona Institute of Science and Technology, Av. Carl Friedrich Gauss 3, 08860, Castelldefels (Barcelona), Spain
Gabriel Fernández-Fernández, Borja Requena & Maciej Lewenstein
Instituto Universitario de Matemática Pura y Aplicada, Universitat Politècnica de València, València, Spain
Yusef Ahsini & J. Alberto Conejero
Department of Chemistry, University College London, 20 Gordon Street, London, WC1H 0AJ, UK
Solomon Asghar & Giorgio Volpe
Department of Physics, Korea Advanced Institute of Science and Technology, Daejeon, Korea
Jaeyong Bae & Hawoong Jeong
Molecular Neurobiology Division, BIOMED UCA-CONICET, Buenos Aires, Argentina
Francisco J. Barrantes & Lucas A. Saavedra
Department of Chemistry, University of Copenhagen, Copenhagen, Denmark
Steen W. B. Bender, Nikos S. Hatzakis & Jacob Kæstel-Hansen
Novo Nordisk Center for Optimised Oligo Escape and Control of Disease, University of Copenhagen, Copenhagen, Denmark
Steen W. B. Bender, Nikos S. Hatzakis & Jacob Kæstel-Hansen
Institut Langevin, ESPCI Paris, Université PSL, CNRS, Paris, France
Clément Cabriel & Ignacio Izeddin
Centro de Investigación en Gestión e Ingeniería de Producción, Universitat Politècnica de València, València, Spain
Marc Escoto
School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, China
Xiaochen Feng, Yuan Jiang, Hao Sha & Yongbing Zhang
Gene Machines Group, Clarendon Laboratory, Department of Physics, University of Oxford, Oxford, UK
Rasched Haidari
Kavli Institute of Nanoscience Discovery, University of Oxford, Oxford, UK
Rasched Haidari
School of Physics and Electronics, Hunan University, Changsha, China
Zihan Huang & Xiang Qu
Center of Complex Systems, Korea Advanced Institute of Science and Technology, Daejeon, Korea
Hawoong Jeong
Laboratory of Computational Quantitative and Synthetic Biology (CQSB), Sorbonne Université, CNRS, Paris, France
Judith Miné-Hattab, Junwoo Park & Nataliya Sokolovska
School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore, Singapore
Ran Ni
ICREA, Pg. Lluís Companys 23, 08010, Barcelona, Spain
Maciej Lewenstein
Institute for Physics & Astronomy, University of Potsdam, Potsdam-Golm, Germany
Ralf Metzler
Asia Pacific Centre for Theoretical Physics, Pohang, Republic of Korea
Ralf Metzler
Department of Electrical and Computer Engineering and School of Biomedical Engineering, Colorado State University, Fort Collins, CO, USA
Diego Krapf
Science for Life Laboratory, Physics Department, University of Gothenburg, Origovägen 6B, SE-41296, Gothenburg, Sweden
Giovanni Volpe
Facultat de Ciències, Tecnologia i Enginyeries, Universitat de Vic—Universitat Central de Catalunya (UVic-UCC), C. de la Laura, 13, 08500, Vic, Spain
Carlo Manzo
Bioinformatics and Bioimaging, Institut de Recerca i Innovació en Ciències de la Vida i de la Salut a la Catalunya Central (IRIS-CC), 08500, Vic, Spain
Carlo Manzo

Authors

Gorka Muñoz-Gil
View author publications
Search author on:PubMed Google Scholar
Harshith Bachimanchi
View author publications
Search author on:PubMed Google Scholar
Jesús Pineda
View author publications
Search author on:PubMed Google Scholar
Benjamin Midtvedt
View author publications
Search author on:PubMed Google Scholar
Gabriel Fernández-Fernández
View author publications
Search author on:PubMed Google Scholar
Borja Requena
View author publications
Search author on:PubMed Google Scholar
Yusef Ahsini
View author publications
Search author on:PubMed Google Scholar
Solomon Asghar
View author publications
Search author on:PubMed Google Scholar
Jaeyong Bae
View author publications
Search author on:PubMed Google Scholar
Francisco J. Barrantes
View author publications
Search author on:PubMed Google Scholar
Steen W. B. Bender
View author publications
Search author on:PubMed Google Scholar
Clément Cabriel
View author publications
Search author on:PubMed Google Scholar
J. Alberto Conejero
View author publications
Search author on:PubMed Google Scholar
Marc Escoto
View author publications
Search author on:PubMed Google Scholar
Xiaochen Feng
View author publications
Search author on:PubMed Google Scholar
Rasched Haidari
View author publications
Search author on:PubMed Google Scholar
Nikos S. Hatzakis
View author publications
Search author on:PubMed Google Scholar
Zihan Huang
View author publications
Search author on:PubMed Google Scholar
Ignacio Izeddin
View author publications
Search author on:PubMed Google Scholar
Hawoong Jeong
View author publications
Search author on:PubMed Google Scholar
Yuan Jiang
View author publications
Search author on:PubMed Google Scholar
Jacob Kæstel-Hansen
View author publications
Search author on:PubMed Google Scholar
Judith Miné-Hattab
View author publications
Search author on:PubMed Google Scholar
Ran Ni
View author publications
Search author on:PubMed Google Scholar
Junwoo Park
View author publications
Search author on:PubMed Google Scholar
Xiang Qu
View author publications
Search author on:PubMed Google Scholar
Lucas A. Saavedra
View author publications
Search author on:PubMed Google Scholar
Hao Sha
View author publications
Search author on:PubMed Google Scholar
Nataliya Sokolovska
View author publications
Search author on:PubMed Google Scholar
Yongbing Zhang
View author publications
Search author on:PubMed Google Scholar
Giorgio Volpe
View author publications
Search author on:PubMed Google Scholar
Maciej Lewenstein
View author publications
Search author on:PubMed Google Scholar
Ralf Metzler
View author publications
Search author on:PubMed Google Scholar
Diego Krapf
View author publications
Search author on:PubMed Google Scholar
Giovanni Volpe
View author publications
Search author on:PubMed Google Scholar
Carlo Manzo
View author publications
Search author on:PubMed Google Scholar

Contributions

G.M.-G., M.L., R.M., D.K., Giov.V. and C.M. conceived the study. G.M.-G., Giov.V. and C.M. organized the challenge and the corresponding workshop. G.M.-G., H.B., J.Pi. and B.M. designed and implemented the software for data generation. G.M.-G., J.Pi. and C.M. implemented the platform for scoring. G.M.-G. and C.M. analyzed the results. The methods discussed in the paper were designed, implemented, run, and described by the Challenge participants: G.F.-F., B.R., Y.A. S.A., J.B., F.J.B. S.W.B.B., C.C., J.A.C., M.E., X.F., R.H., N.S.H., Z.H., I.I., H.J., Y.J., J.K-H., J.M.-H., R.H., J.Pa., X.Q., L.A.S., H.S., N.S., Y.Z., Gior.V. The article was written by G.M.-G., M.L., R.M., D.K., Giov.V., and C.M. with input from all authors.

Corresponding authors

Correspondence to Gorka Muñoz-Gil, Giovanni Volpe or Carlo Manzo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Andreas Gahlmann and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Source data

Source Data files

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Muñoz-Gil, G., Bachimanchi, H., Pineda, J. et al. Quantitative evaluation of methods to analyze motion changes in single-particle experiments. Nat Commun 16, 6749 (2025). https://doi.org/10.1038/s41467-025-61949-x

Download citation

Received: 23 March 2023
Accepted: 07 July 2025
Published: 22 July 2025
DOI: https://doi.org/10.1038/s41467-025-61949-x

This article is cited by

Concurrent diffusion of nicotinic acetylcholine receptors and fluorescent cholesterol disclosed by two-colour sub-millisecond MINFLUX-based single-molecule tracking
- Francesco Reina
- Lucas A. Saavedra
- Francisco J. Barrantes
Nature Communications (2025)