Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 21;109(2):346-54.
doi: 10.1016/j.bpj.2015.05.038.

An information-based approach to change-point analysis with applications to biophysics and cell biology

Affiliations

An information-based approach to change-point analysis with applications to biophysics and cell biology

Paul A Wiggins. Biophys J. .

Abstract

This article describes the application of a change-point algorithm to the analysis of stochastic signals in biological systems whose underlying state dynamics consist of transitions between discrete states. Applications of this analysis include molecular-motor stepping, fluorophore bleaching, electrophysiology, particle and cell tracking, detection of copy number variation by sequencing, tethered-particle motion, etc. We present a unified approach to the analysis of processes whose noise can be modeled by Gaussian, Wiener, or Ornstein-Uhlenbeck processes. To fit the model, we exploit explicit, closed-form algebraic expressions for maximum-likelihood estimators of model parameters and estimated information loss of the generalized noise model, which can be computed extremely efficiently. We implement change-point detection using the frequentist information criterion (which, to our knowledge, is a new information criterion). The frequentist information criterion specifies a single, information-based statistical test that is free from ad hoc parameters and requires no prior probability distribution. We demonstrate this information-based approach in the analysis of simulated and experimental tethered-particle-motion data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
State model schematic. The state model signal is characterized by four model parameters that are written as the vector θ ≡ (k, ε, μ, α). Above, we schematically illustrate the role of each parameter in shaping the signal. The parameter k, state 1, parameterizes the standard deviation of the noise (σ = k−1/2). State 2 illustrates the effect of the finite lifetime of fluctuations in models with autoregression (0 < ε < 1). State 3 illustrates the role of the level mean μ. State 4 illustrates of the role of the level slope (α). To see this figure in color, go online.
Figure 2
Figure 2
Analysis of experimental TPM data: protein-induced DNA looping measured by TPM. (A) Position trace for the no-looping control. The y position of the bead shown for 1.5 × 104 frames. In the absence of protein-induced looping, only a single state is identified by change-point analysis, corresponding to the unlooped configuration. (B) Position trace for protein-induced DNA looping. The y position of the bead shown for 7.5 × 104 frames. Seventeen states were identified by change-point analysis. The trace is colored by state and the state number is shown above the trace. A representative example of the unlooped and looped state is shown. (C) High-resolution time trace. At t ≈ 5.98 × 104 frames, a high-time-resolution trace is shown, which reveals two short-lived states, states 12 and 14. The ER for each of these states is shown. The statistical evidence for state 12 is extremely strong whereas the evidence for state 14 is marginal. (D) Histogram of bead position by state. The histogram for all data and selected states is shown. States 6 and 11 are representative of the unlooped and looped states, respectively. Neither state is well approximated by a Gaussian distribution, as demonstrated by the flatness of the peak of the probability density functions. (E) Mean position and variance by state. The 95% confidence region is shown for each state. The states cluster into two clearly identifiable groups corresponding to the unlooped (2, 4, 6, 8, 10, and 16) and looped (1, 7, 11, 13, 15, and 17) states. In addition to these clusters, there are low-mobility and moderate-mobility states with mean positions offset from zero. The short-lived states with low mobility correspond to sticking events (12 and 14). (F) Variance and nearest-neighbor coupling by state. The 95% confidence region is shown for each state. Again, the states form clusters analogous to (E). For states 12 and 14, ε is approximately zero, consistent with bead sticking. To see this figure in color, go online.

Similar articles

Cited by

References

    1. Little M.A., Jones N.S. Generalized methods and solvers for noise removal from piecewise constant signals. I. Background theory. Proc. Math. Phys. Eng. Sci. 2011;467:3088–3114. - PMC - PubMed
    1. Page E.S. A test for a change in a parameter occurring at an unknown point. Biometrika. 1955;42:523–527.
    1. Page E.S. On problems in which a change in a parameter occurs at an unknown point. Biometrika. 1957;44:248–252.
    1. Chen J., Gupta A.K. On change point detection and estimation. Comm. Stat. Simul. Comput. 2007;30:665–697.
    1. Little M.A., Jones N.S. Generalized methods and solvers for noise removal from piecewise constant signals. II. New methods. Proc. Math. Phys. Eng. Sci. 2011;467:3115–3140. - PMC - PubMed

Publication types

LinkOut - more resources