Difference between revisions of "Estimating second order system parameters from noise power spectra using nonlinear regression"

From Course Wiki
Jump to: navigation, search
(Get ready …)
(Get ready …)
Line 28: Line 28:
  
 
==Get ready …==
 
==Get ready …==
[[File:Optical Trap Power Spectrum.png|thumb|right|Power spectrum of x-axis QPD voltage. 3.2 μm silica spheres trapped with 80 mW laser power. Sampling frequency was 50 MHz.]]
 
 
You need four things to run the regression:
 
You need four things to run the regression:
  
Line 37: Line 36:
  
 
===Power spectrum===
 
===Power spectrum===
 +
[[File:Optical Trap Power Spectrum.png|thumb|right|Power spectrum of x-axis QPD voltage. 3.2 μm silica spheres trapped with 80 mW laser power. Sampling frequency was 50 MHz.]]
 
The optical trap records a sequence of voltages from the QPD, sampled at frequency <math>f_s</math>. In the regression we plan to do, frequency, <math>f</math>, is the independent variable, and power, <math>P_{xx}</math> is the dependent variable. The time series must be transformed into a power spectrum. An example power spectrum from the optical trap is shown above on the right.  
 
The optical trap records a sequence of voltages from the QPD, sampled at frequency <math>f_s</math>. In the regression we plan to do, frequency, <math>f</math>, is the independent variable, and power, <math>P_{xx}</math> is the dependent variable. The time series must be transformed into a power spectrum. An example power spectrum from the optical trap is shown above on the right.  
  

Revision as of 13:52, 20 May 2015

20.309: Biological Instrumentation and Measurement

ImageBar 774.jpg


… the safe use of regression requires a good deal of thought and a good dose of skepticism

Overview

In the limits of detection lab, you will use regression to estimate the parameters of second order system models of the optical trap and AFM. MATLAB includes a powerful nonlinear regression function called nlinfit. Like all sharp tools, nlinfit must be used with extreme care. Regression is a delicate business.

Many common regression missteps spawn subtle errors rather than catastrophic failures. Because of this, bad regressions easily masquerade as good ones — hiding their insidious flaws behind a lovely, plausible pair of overlapping curves. This page illustrates several flawed regressions with the hope of helping you navigate the concealed hazards of fitting power spectra.

Remain vigilant at all times when you regress.

Regression review

Nonlinear regression is a method for finding the relationship between a dependent variable $ y_n $ and an independent variable $ x_n $. The two quantities are related by a model function $ f(\beta, x_n) $. $ \beta $ is a vector of model parameters. $ y_n $ is measured in the presence of random noise, which is represented mathematically by a random variable $ \epsilon_n $. In equation form:

$ y_n=f(\beta, x_n)+\epsilon_n $.

The goal of a regression is to find the set of model parameters $ \hat{\beta} $ that best matches the observed data.

Ordinary nonlinear least squares regression assumes that:

  • the error terms have a mean value of zero,
  • the error terms are independent and identically distributed (i.i.d.),
  • the independent variable is known exactly, with zero noise, and
  • the independent variable covers a range adequate to define all the model parameters.

These assumptions are almost never perfectly met in practice. It is important to consider how badly these assumptions have been violated when evaluating the results of a regression.


Get ready …

You need four things to run the regression:

  1. a vector containing the values of the independent variable;
  2. a vector containing the corresponding observed values of the dependent variable;
  3. a model function; and
  4. a vector of initial guesses for the parameters.

Power spectrum

Power spectrum of x-axis QPD voltage. 3.2 μm silica spheres trapped with 80 mW laser power. Sampling frequency was 50 MHz.

The optical trap records a sequence of voltages from the QPD, sampled at frequency $ f_s $. In the regression we plan to do, frequency, $ f $, is the independent variable, and power, $ P_{xx} $ is the dependent variable. The time series must be transformed into a power spectrum. An example power spectrum from the optical trap is shown above on the right.

The MATLAB function pwelch computes a power spectrum and associated frequency vector from a time series. Like many MATLAB functions, pwelch can be called with different sets of parameters. The signatures of pwelch are particularly opaque — even by Mathworks' standards. Here is the command that generated the example spectrum above:

[ Pxx, frequency ] = pwelch( OpticalTrapTimeDataX, [], [], [], 50000 );

The first argument, OpticalTrapTimeDataX is an Nx1 vector containing the QPD x-axis voltage samples. Empty values for window, noverlap, and f to their default values. We will reexamine these parameters later on. The last parameter, 50000, is the sampling frequency.



Model function

signature [ PredictedValues ] = ModelFunction( Beta, x)

First attempt