Difference between revisions of "20.109(S21):M3D3"

From Course Wiki
Jump to: navigation, search
(Part 3: Purify IPC protein)
Line 5: Line 5:
 
==Introduction==
 
==Introduction==
  
In the previous laboratory session, you reviewed the mutations that were generated in IPC to create variants. The goal of this was to change calcium binding of IPC by affecting either affinity or cooperativity. Today you will learn how IPC and the IPC variants were expressed and purified, then you will evaluate the success of the protein purification procedure.
+
As evidenced by Nagai’s work, wild-type inverse pericam is not toxic to BL21(DE3)pLysS cells. Although it is unlikely that your point mutation will dramatically change this fact, in general a novel protein may turn out to be toxic. If this is the case, only very small amounts of protein are produced before the bacteria die. Keep in mind that overexpressing a single protein may come at the expense of producing proteins needed for survival, and will most likely cause cell death eventually; however, toxic proteins hasten this demise. Aberrant toxicity can sometimes be alleviated by reducing the culture temperature (e.g., to 30 °C).
  
[[Image:Sp16 M1D4 pRSET vector map.png|thumb|right|350px|'''Schematic of pRSET expression plasmid.''' Modified from Invitrogen manual.]] The genetic sequences that encode the IPC protein and IPC variant proteins are maintained within the pRSET expression vector (recall the cloning exercise from M3D1!). This expression vector contains several features that are important to the expression and purification of IPC and the IPC variants. To enable selection of bacterial cells that carry pRSET_IPC, an antibiotic cassette, specifically an ampicillin marker, is included on the vector. The features most relevant to protein expression and purification are highlighted in the schematic to the right.  The T7 promoter drives expression of the gene that encodes IPC (or IPC variant).  To ensure that the transcript is translated into a protein, a ribosome binding site (RBS) is included. The ATG sequence serves as the transcriptional start and the 6xHis represents the six-histidine residue tag that is used for protein purification via affinity chromatography.
+
Based on its fluorescence activity, wild-type inverse pericam allows proper folding of (cp)EYFP, and based on its response to calcium, it also allows calmodulin to fold. One problem you may encounter is that your mutant proteins will no longer fold correctly. Since you made mutations in the calcium sensor part of IPC, rather than the fluorescent part, it is unlikely that your protein will destroy EYFP fluorescence. However, a common problem with misfolded proteins is the formation of insoluble aggregates, due for instance to improperly exposed hydrophobic surfaces. Proteins can be purified from these aggregates – called inclusion bodies – but the process is more labor-intensive than for soluble proteins. (The proteins must be extracted under more harsh conditions than you will use next time, then purified under denaturing conditions, before finally attempting to renature the proteins.) Inclusion bodies sometimes form simply due to very high expression of the protein of interest, causing it to pass its solubility limit. This outcome can be prevented by lowering the culture temperature, the induction duration, the amount of IPTG, or the growth phase of the bacteria.
  
There are some similarities between the expression system used to purify TDP43-RRM12 in Mod 2 and the system we will use for IPC.  First, in both expression systems IPTG is used to induce protein production. As a review, IPTG is a lactose analog that induces expression by binding to the LacI repressor. When bound to IPTG, the LacI repressor is not able to bind to the ''lac'' operator sequence and transcription occurs unimpeded. For more details please review to the [[20.109(S21):M2D1#Introduction |M2D1 Introduction]]!  Another similarity is that a 6His tag is used and 6His-tagged IPC and IPC variants will purified using column affinity as shown in the image below.  There are also several differences between the expression systems for TDP43-RRM12 and IPC.  As you read through the exercises below, consider how these steps are different from those used previously.
+
One final point to keep in mind is that not all proteins can be produced in bacteria. Eukaryotic proteins that require post-translational modifications (such as glycosylation) for activity require eukaryotic hosts (such as yeast, or the commonly used CHO – Chinese hamster ovary – cells). Sometimes eukaryote-derived proteins will be truncated or otherwise mistranslated by E. coli due to differential codon bias; errors in translation can be prevented by providing additional tRNAs to the culture or directly to the bacteria via plasmids. Despite all this complexity, prokaryotic hosts have been plenty good enough to produce proteins for certain therapies, notably the cytokine G-CSF. This cytokine is taken by patients needing to replenish their white blood cells (e.g., after chemotherapy), and sold as Neupogen by the company Amgen.
  
[[Image:Fa20 M2D1 protein purification.png|thumb|550px|center|'''Schematic of affinity separation process.''' For purification, agarose beads (yellow) are coated with nickel (green). When cell lysate is added to the nickel-coated agarose beads, His-tagged protein of interest (blue) adheres to the beads and other proteins in the lysate (orange) are washed from the beads.]]
 
  
==Protocols==
+
This is it, folks! Moment of truth. Time to find out how the proteins that you worked so hard to express, purify, and test really behave. Although you should be able to produce reasonable titration curves by following the example of Nagai, the introduction/review of binding constants below may help contextualize your analysis.
  
===Part 1: Participate in Comm Lab workshop===
+
Let’s start by considering the simple case of a receptor-ligand pair that are exclusive to each other, and in which the receptor is monovalent. The ligand (L) and receptor (R) form a complex (C), which can be written
  
Our communication instructors, Dr. Prerna Bhargava and Dr. Sean Clarke, will join us today for a discussion on preparing a Research proposal presentation.
+
<center>
 +
<math> R + L  \rightleftharpoons\ ^{k_f}_{k_r}      C </math>
 +
</center>
  
===Part 2: Prepare protein expression system===
+
At equilibrium, the rates of the forward reaction (rate constant = <math>k_f</math>) and reverse reaction (rate constant = <math>k_r</math>) must be equivalent. Solving this equivalence yields an equilibrium dissociation constant <math>K_d</math>, which may be defined either as <math>k_r/k_f</math>, or as <math>[R][L]/[C]</math>, where brackets indicate the molar concentration of a species. Meanwhile, the fraction of receptors that are bound to ligand at equilibrium, often called ''y'' or &theta;, is <math>C/R_{TOT}</math>, where <math>R_{TOT}</math> indicates total (both bound and unbound) receptors. Note that the position of the equilibrium (''i.e.'', ''y'') depends on the starting concentrations of the reactants; however, <math>K_d</math> is always the same value. The total number of receptors <math>R_{TOT}</math>= [''C''] (ligand-bound receptors) + [''R''] (unbound receptors). Thus,
  
As mentioned above, IPTG is used to induce protein production in the expression systems for TDP43-RRM12 and IPC; however, the mechanism that drives transcription of the gene that encodes the protein of interest is different.  For IPC and the IPC variants, the proteins are expressed using the BL21(DE3)pLysS strain of ''E. coli'', which has the following genotype: F<sup>-</sup>, ''omp''T ''hsd''SB (r<sub>B</sub><sup>-</sup> m<sub>B</sub><sup>-</sup>) ''gal dcm'' (DE3) pLysS (Cam<sup>R</sup>).
+
<center>
 +
<math>\qquad y = {[C] \over R_{TOT}} \qquad = \qquad {[C] \over [C] + [R]} \qquad = \qquad {[L] \over [L] + [K_d]} \qquad</math>
 +
</center>
  
As mentioned above, pRSET encodes the bacteriophage T7 promoter, which is active only in the presence of T7 RNA polymerase (T7RNAP), an enzyme that therefore must be expressed by the bacterial strain used to make the protein of interest.  In BL21(DE3), T7RNAP is associated with a ''lac'' construct. Constitutively expressed ''lac'' repressor (''lac''I gene) blocks expression from the ''lac'' promoter; thus, the polymerase will not be produced except in the presence of repressor-binding lactose or a small-molecule lactose analogue such as IPTG (isopropyl &beta;-D-thiogalactoside). To reduce ‘leaky’ expression of the protein of interest (in our case, IPC), the pLysS version of BL21(DE3) contains T7 lysozyme, which inhibits basal transcription of T7RNAP. This gene is retained by chloramphenicol selection, while the pRSET plasmid itself (and thus IPC) is retained by ampicillin selection.
+
where the right-hand equation was derived by algebraic substitution. If the ligand concentration is in excess of the concentration of the receptor, [''L''] may be approximated as a constant, ''L'', for any given equilibrium. Let’s explore the implications of this result:
  
The pRSET_IPC and pRSET_IPC variants were transformed into chemically competent BL21(DE3)pLysS using heat shock as described previously.  To review this method, look back at the information provided on [[20.109(S21):M1D3#Part_3:_Transform_plasmid_from_yeast_into_E._coli |M1D3]]!
+
*What happens when ''L'' << <math>K_d</math>?
 +
::&rarr;Then ''y'' ~ <math>L/K_d</math>, and the binding fraction increases in a first-order fashion, directly proportional to ''L''.  
  
[[Image:Sp16 M1D4 protein expression system.png|thumb|center|600px|'''Overview of protein expression system used for IPC purification.''']]
+
*What happens when ''L'' >> <math>K_d</math>?
 +
::&rarr;In this case ''y'' ~1, so the binding fraction becomes approximately constant, and the receptors are saturated.
  
<font color = #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
*What happens when ''L'' = <math>K_d</math>?
*questions about expression system...
+
::&rarr;Then ''y'' = 0.5, and the fraction of receptors that are bound to ligand is 50%. This is why you can read <math>K_d</math> directly off of the plots in Nagai’s paper (compare Figure 3 and Table 1). When ''y'' = 0.5, the concentration of free calcium (our [''L'']) is equal to <math>K_d</math>. '''This is a great rule of thumb to know.'''
  
===Part 3: Purify IPC protein===
+
The figures below demonstrate how to read <math>K_d</math> from binding curves. You will find semilog plots (right) particularly useful today, but the linear plot (left) can be a helpful visualization as well. Keep in mind that every ''L'' value is associated with a particular equilbrium value of ''y'', while the curve as a whole gives information on the global equilibrium constant <math>K_d</math>.
  
Though the protocol refers only to the purification of IPC, the same procedure was used to purify IPC all of the IPC variants that were assessed in this experiment.
+
[[Image:20109 Fa15 M2D7 figure.png|thumb|300px|left|'''Simple binding curve.''' The binding fraction ''y'' at first increases linearly as the starting ligand concentration is increased, then asymptotically approaches full saturation (''y''=1). The dissociation constant <math>K_d</math> is equal to the ligand concentration [''L''] for which ''y'' = 1/2.]]
 +
[[Image:20109 Fa15 M2D7 figure2.png|thumb|300px|center|'''Semilog binding curves.''' By converting ligand concentrations to logspace, the dissociation constant is readily determined from the inflection point of the sigmoidal curve. The three curves each represent different ligand species. The middle curve has a <math>K_d</math> close to 10 nM, while the right-hand curve has a higher <math>K_d</math> and therefore lower affinity between ligand and receptor (vice-versa for the left-hand curve).]]
 +
<br style="clear:both;"/>
  
'''Induce expression of IPC'''
+
Of course, inverse pericam has multiple binding sites, and thus IPC-calcium binding is actually more complicated than the example above. The <math>K_d</math> reported by Nagai is called an ‘apparent <math>K_d</math>’ because it reflects the overall avidity of multiple calcium binding sites, not their individual affinities for calcium. Normally, calmodulin has a low affinity (N-terminus) and a high affinity (C-terminus) pair of calcium binding sites. However, the E104Q mutant, which is the version of CaM used in inverse pericam, displays low affinity binding at both termini. Moreover, the Hill coefficient, which quantifies cooperativity of binding in the case of multiple sites, is reported to be 1.0 for inverse pericam. This indicates that inverse pericam behaves as if it were binding only a single calcium ion per molecule. Thus, wild-type IPC is well-described by a single apparent <math>K_d</math>.
  
#Inoculate 5 mL of LB media containing 50 &mu;g/mL ampicillin and 34 &mu;g/mL chloramphenicol with a colony of BL21(DE3)pLysS cells transformed with pRSET_IPC.
+
For any given mutant, things may be more complicated. Keep in mind that we are not directly measuring calcium binding, but instead are indirectly inferring it based on fluorescence (for both mutant and wild-type IPC). A change in fluorescence requires the participation not only of calcium, but also of M13. In addition to the four separate calcium binding sites in calmodulin, the M13 binding site influences apparent affinity and apparent cooperativity. In short, be careful about how you describe the meanings of our binding parameters in your reports.
#Incubate the culture overnight at 37 &deg;C with shaking at 220 rpm.
+
#Dilute the overnight culture 1:10 in 50 mL of fresh LB media containing 50 &mu;g/mL ampicillin and 34 &mu;g/mL chloramphenicol.
+
#Incubate at 37 &deg;C until the OD<sub>600</sub> = ~0.6 with shaking at 220 rpm, approximately 4 hours.
+
#To induce IPC protein expression, add IPTG to a final concentration of 1 mM.
+
#Incubate at 25 &deg;C with shaking at 100 rpm overnight.
+
#To harvest the cells, centrifuge the culture at 3000 g for 15 min at 4 &deg;C.
+
#Cell pellet was stored at -80 &deg;C until used for purification.
+
  
<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
Returning to the big picture: when you write your Protein engineering summary, be sure to consider how changes in both binding affinity and cooperativity (and even potentially raw fluorescence differences) can affect the practical utility of a sensor.
*Why is it important that both ampicillin and chloramphenicol are added to the growth media?
+
  
'''Lyse BL21(DE3)pLysS cells expressing pRSET_IPC'''
+
==Protocols==
  
#Obtain a 2 mL aliquot of room temperature BugBuster buffer and the induced BL21(DE3)pLysS pRSET_IPC cell pellet.
+
===Part 2: Prepare samples for titration curve===
#*BugBuster is a bacterial lysis and protein extraction solution, which contains 0.1% bovine serum albumin and 1:200 protease inhibitor cocktail to guard against protein degradation.
+
#Add 1:1000 of cold nuclease enzyme to the BugBuster buffer.
+
#Add 600 &mu;L of the BugBuster with nuclease enzyme to the BL21(DE3)pLysS pRSET_IPC cell pellet.
+
#Resuspend the cell pellet by pipetting until the solution is homogeneous.
+
#Incubate on the nutator at room temperature for 10 minutes.
+
#Centrifuge the lysed cell suspension for 10 minutes at maximum speed.
+
#Transfer the supernatant to a fresh microcentrifuge tube.
+
  
'''Prepare Ni-NTA affinity column'''
+
==== Tips for success====
  
It is important that all liquid waste generated in the below steps is collected in a designated waste stream due to the presence of nickel in the solution!
+
Take great care today to limit the introduction of bubbles in your samples. When expelling fluid, pipet '''''slowly''''' while touching the pipet tip against the bottom or side of the well.
 +
<!--
 +
When using the multichannel pipet, always check to make sure all tips are getting filled - sometimes one tip may not be on all the way, and will pull up less volume than the others. If this happens, release the fluid, adjust the tip, and try again.
 +
-->
  
#Gently mix the Ni-NTA His-bind resin to fully resuspend, then aliquot 400 μL of the resin into a 15 mL conical tube.
+
====Protocol====
#Add 1.6 mL of 1X Ni-NTA Bind Buffer to the Ni-NTA His-bind resin.
+
#Resuspend the resin by pippeting, then centrifuge at 3300 rpm for 1 minute.
+
#Carefully remove the supernatant and discard it in the appropriate waste stream.
+
  
'''Purify TDP43-RRM12 from cell lysate'''
+
[[File:Fa15 Protein Ca assay plate map.png|thumb|450px|right|Titration plate map]]
  
It is important that all liquid waste generated in the below steps is collected in a designated waste stream due to the presence of imidazole in the solution!
+
#Take a black 96-well plate, and familiarize yourself with the plate map scheme at right: top two rows are to be loaded with wild-type IPC, next two rows are to be loaded with your X#Z mutant IPC, and the final row is to be loaded with  water/BSA to serve as a blank/background row.
 +
#*The dark sides of the plate reduce "cross-talk" (''i.e.'', light leakage) between samples in adjacent wells, another potential contribution to error.
 +
#Aliquot your wild-type protein to your plate. Use your P200 pipet to add 30 μL of protein (per well) to rows A and B of your plate.
 +
#Aliquot you X#Z mutant IPC to your plate.  Use your P200 pipet and add 30 μL of protein (per well) to rows C and D of your plate.
 +
#Finally, add 30 μL of water with only 0.1% BSA (no IPC) to row 5(E) of your plate.
 +
#The calcium solutions are at the front bench in shared reservoirs.  '''Carefully''' carry your plate to the front bench to add these solutions with the  multi-channel pipet. 
 +
#Using shared reservoir #1 (lowest calcium concentration - actually 0 nM), add 30 μL to the top five rows in the first column of the plate. Discard the pipet tips.
 +
#Now work your way from reservoirs #2 to #12 (highest calcium concentration), and from the left-hand to the right-hand columns on your plate. Be sure to use fresh pipet tips each time! If you do contaminate a solution, let the teaching faculty know so they can put out some fresh solution. Honesty about a mistake is far preferred here to affecting every downstream experiment.
 +
#When you are done, alert the teaching faculty and you will be taken in small groups to measure the fluorescence values for your samples.
  
#Add the supernatent from the cell lysis to the prepared Ni-NTA His-bond resin and carefully place on the nutator at at 4&deg;C for 30 minutes.
+
===Part 3: Fluorescence assay===
#Centrifuge at 3300 rpm for 1 minute.
+
#Remove the liquid above the resin and discard it in the appropriate waste stream.
+
#Add 1 mL of 1X Ni-NTA Wash Buffer to the resin and resuspend.
+
#Centrifuge at 3300 rpm for 1 minute.
+
#Remove the liquid above the resin and discard it in the appropriate waste stream.
+
#Repeat Steps #4-6.
+
#To collect your purified protein, add 500 &mu;L of 1X Ni-NTA Elute Buffer and resupend.
+
#Centrifuge at 3300 rpm for 1 minute.
+
#Transfer the liquid above the resin to a fresh microcentrifuge tube.
+
#*Please note: your protein is in the liquid at this step!
+
#Repeat Steps #8-10.
+
#*Transfer the liquid from the second elution to the same tube used in Step #10.
+
#*You should have a total of 1 mL of purified protein solution.
+
  
'''Remove imidazole from purified IPC'''
+
#The BMC (BioMicro Center) has graciously agreed to let us use their plate reader. Walk over to building 68 with a member of the teaching staff.
 +
#You will be shown how to set the excitation (485 nm) and emission (515 nm) wavelength on the plate reader to assay your protein.
 +
#Your raw data will be posted on today's [[Talk:20.109(S16):Characterize protein expression (Day7)|Discussion page]] and emailed to you as a .txt file so you can begin your analysis.
  
Pilot experiments revealed that imidazole affects the binding curves of inverse pericams. Thus, you will further purify your protein by removing low molecular weight compounds (which includes imidazole!) using a column that removes salt, or desalts, liquid as it passing through a resin.  
+
You will analyze your calcium titration assay data in two steps. First, you will get a rough feel for how your mutant changed (or didn't) compared to wild-type IPC by plotting the two replicate values and their average, in both raw and processed form. Second, you will take the average processed values and plug them into some <small>MATLAB</small> code that will more precisely tell you the affinity and cooperativity of each protein with respect to calcium.  
  
#Obtain a Zeba column and a 15 mL conical tube.
+
===Part 1: Titration curve in Excel and first estimate of ''K<sub>d</sub>''===
#To prepare the Zeba column, snap off the bottom of a Zeba column and place it into a 15 mL conical tube.
+
#Centrifuge the column at 2100 rpm for 2 minutes.
+
#Transfer the column to a fresh 15 mL conical tube, then gently apply your ~1 mL of purified protein solution to the center of the compacted resin.
+
#Centrifuge the column at 2100 rpm for 2 minutes.
+
#Transfer the liquid from the 15 mL conical tube to a fresh microcentrifuge tube.
+
#*Please note: your protein is in the liquid at this step!
+
#From the desalted purified protein solution, aliquot the following amounts:
+
#*Add 25 &mu;L to a fresh microcentrifuge tube for examining protein purity using SDS-PAGE.
+
#*Add 10 &mu;L to a fresh microcentrifuge tube for examining protein concentration using microBCA.
+
#Lastly, add a 1:100 dilution of 10% BSA to the remaining desalted purified protein solution.
+
#*For reference, 10 &mu;L of 10% BSA would be added to 1 mL of protein solution.
+
  
===Part 4: Evaluate purified IPC===
+
Today  you will analyze the fluorescence data that you got last time. Begin by analyzing the wild-type protein as a check on your work (your curve should resemble Nagai's Figure 3L), and then move on to your mutant samples. If you are not familiar with manipulations in Excel, use the ''Help'' menu or ask the teaching faculty for assistance.
  
To evaluate the purified IPC protein, we will use the same methods as when we assessed purified TDP43-RRM12: SDS-PAGE and microBCA (this is a variation of the BCA procedure that is used to measure lower protein concentrations). To review these methods, look back at the information provided on [[20.109(S21):M2D2 |M2D2]]!
+
#Open an Excel file for your data analysis. Begin by making a column of the free calcium concentrations present in your twelve test solutions. Assuming a 1:1 dilution of protein with calcium, the final concentrations are: 0 nM, 8.5 nM, 19 nM, 32.5 nM, 50 nM, 75 nM, 112.5 nM, 175 nM, 301 nM, 675 nM, 1.505 &mu;M, 19.5 &mu;M. Be sure to convert all concentrations to the same units.<br>
 +
#Now open the text file containing your raw data as a tab-delimited file in Excel (you can download the file from the  [[Talk:20.109(S16):Characterize protein expression (Day7) | M1D7 Discussion]] page). Convert the row-wise data to column-wise data (using ''Paste Special'' &rarr; ''Transpose''), and transfer each column to your analysis file. Add column headers to indicate which protein is which, and analyze each replicate separately for now. Also include a column of your control samples that did not contain protein.
 +
#Begin by calculating the average of your blank samples, and bold this number for easy reference. It is the background fluorescence present in the calcium solutions and should be quite low. If necessary, subtract this background value from each of your raw data values. It may help to have a 6-column series called “RAW”, and another called “SUBTRACTED.
 +
#Next you should normalize your data. The maximum and minimum fluorescence values for a given titration series should be defined as 100% and 0% fluorescence, respectively, and every other fluorescence value should be expressed as a percentage in between. Think about how to mathematically express these conditions.
 +
#*First calculate the percent fluorescence for both replicates. Then make a new column and calculate the average percentage as well.
 +
#*Alternatively, average your data first, and then normalize the average data.  How do the "average then normalize" and the "normalize then average" curves compare?  Which one will you include in your Protein engineering summary?
 +
#*If one data point seems really off from the other replicate and from the expected trend, you might consider it an outlier and delete it, especially if you have good reason to believe that there was a reason (error in pipetting, air bubble in that well) for the anomaly. Otherwise, you might be losing valuable information, and/or misleading anyone who tries to interpret your data.
 +
#For each protein, plot this normalized data versus calcium concentration. Save these plots in case you want to include them in your report.
 +
#*You might plot the two replicates as points and their average value as a dashed line (see [http://engineerbiology.org/wiki/20.109%28S16%29:Module_1 front page] of this module).
 +
#Note down the approximate inflection points of the curves, which should occur at half-saturation: these indicate the approximate values of the apparent <math>K_d</math> for each sample.
  
'''Assess purity using SDS-PAGE'''
+
===Part 2: Improved estimate of ''K<sub>d</sub>'' using <small>MATLAB</small> modeling===
  
[[Image:Sp21 M3D3 SDSPAGE.png|thumb|500px|center]]
+
====Preparation====
 +
#Download these three files: [[Media:F15 Fit Main.| F15_Fit_Main]], [[Media:Fit_SingleKD.m| Fit_SingleKD]], and [[Media:Fit_KDn.m| Fit_KDn]]. Move them to the username/Documents/MATLAB folder on your computer.
 +
# Double-click on the <small>MATLAB</small> icon to start up this software.
 +
# The main window that opens is called the command window: here is where you run programs (or directly input commands) and view outputs. You can also see and access the command history, workspace, and current directory windows, but you likely won’t need to today.
 +
# In the command window, type ''more on''; this command allows you to scroll through multi-page output (using the spacebar), such as help files.
 +
# In addition to the command area, <small>MATLAB</small> comes with an editor. Click ''File'' &rarr; ''Open'' and select the program '''F15_Fit_Main'''. It has the .m extension and thus is executable by <small>MATLAB</small>. Read the introductory comments (the beginning of a comment is indicated by a % sign), and then input your fluorescence data.
 +
# Read through the program, and as you encounter unfamiliar terms, return to the workspace and type ''help functionname''. Feel free to ask questions of the teaching faculty as well.
 +
#* You might read about such built-in functions as ''logspace'' and ''nlinfit''.
 +
#* You will also want to open and read '''Fit_SingleKD''' – a user-defined function called by '''F15_Fit_Main''' – in the <small>MATLAB</small> editor.
 +
#* If you type ''help function'' you will learn the syntax for a function header.
 +
#* Note that a dot preceeding an operator (such as A ./ B or A .* B) is a way of telling <small>MATLAB</small> to perform element-by-element rather than matrix algebra.
 +
#* Also note that when a line of code is ''not'' followed by a semi-colon, the value(s) resulting from the operation will be displayed in the command window.
  
'''Measure concentration using microBCA'''
+
====Analysis====
  
#calculate concentration of total protein in each sample...
+
# Once you more-or-less follow Part 1 of the program, type '''F15_Fit_Main''' in the workspace, hit return to run the program, and consider the following questions:
 +
#* Why must the fluorescence data be transformed (from ''S'' to ''Y'') prior to using in the model?
 +
#* What <math>K_d</math> values are output in the command window, and how do they compare to the values you estimated from your Excel plots?
 +
#* Figure 1 should display your wild type and mutant data points and model curves. How do they look in comparison to the curves you plotted in Excel?
 +
#* Figure 2 should display the residuals (difference between data and model) for your three proteins. If the absolute values are low, this indicates good agreement between the model and the data numerically. Whether or not this is the case, another thing to look for is whether the residuals are evenly and randomly distributed about the zero-line. If there is a pattern to the errors, likely there is a systematic difference between the data and the model, and thus the model does not reflect the actual binding process well. What are the residuals like for each of your modeled proteins?
 +
# Now move on to Part 2 of the '''F15_Fit_Main''' program. Part 2 also fits the data to a model with a single, ‘apparent’ value of <math>K_d</math>, but it allows for multiple binding sites and tests for cooperativity among them. The parameter used to measure cooperativity is called the Hill coefficient. A Hill coefficient of 1 indicates independent binding sites, while greater or lesser values reflect positive or negative cooperativity, respectively. Let the following questions guide you as you proceed:
 +
#*Visually, which model appears to fit your wild-type data better (Fig. 3 ''vs.'' Fig. 1)? Your mutant data?
 +
#*Do the respective residuals support your qualitative assessment (Fig. 4 ''vs.'' Fig. 2)?
 +
#*Numerically, how do the values of <math>K_d</math> compare for the two models? How does the value of ''n'' compare to the implicitly assumed value of 1 in Part 1?
 +
#*Do you see changes in binding affinity and/or cooperativity between the wild-type and X#Z samples? Do they match your ''a priori'' predictions?
 +
#*'''Don't forget to save any figures you want to use in your report!''' If the legends are covering up your data, you can simply move them over with your mouse.
 +
# Finally, you can skim Part 3 of the '''F15_Fit_Main''' program. Make sure you update the range of the linear transition region for each IPC sample, but beyond this, don’t worry too much about the coding details; rather do read through the comments.
 +
#* Look at Part 1 of Figure 5: are the binding curves asymptotic, sigmoidal, or other? What does this shape indicate? You can use the zoom button to get a closer look at part of the plot, or the ''axis'' command present in the code. (Don't worry too much about this question if it is unclear.)
 +
#* Now look in the command window. What values of <math>K_d</math> and Hill coefficient (''n'') do you get for your three proteins? How do the <math>K_d</math>’s from Part 3 compare to the ones from Parts 1 and 2? Don’t be discouraged if your wild-type values do not exactly match Nagai’s work, or if there is variation between Parts 1, 2, and 3.
 +
#*Comparing the model and data points by eye (Part 2 of Figure 5), do you think it is a good model for any of your proteins? If so, which ones? What experimental limitations might prevent Hill analysis from working well, especially for some mutants?
 +
#*Why should only the transition region be analyzed in a Hill plot?
 +
#* What is the relationship between slope and <math>K_d</math> and/or ''n'', and intercept and <math>K_d</math> and/or ''n''?
 +
#If your mutant proteins are not well-described by any of the models so far, what kind of model(s) (qualitatively speaking) do you think might be useful?
 +
#*Optional: If your data might be well-described by a model with two <math>K_d</math>'s (or if you are interesting in exploring some sample data that is), download and run [[Media:Fit_TwoKD.m | Fit_TwoKD]] and [[Media:Fit_TwoKD_Func.m | Fit_TwoKD_Func]].
  
#using SDS-PAGE to estimate percentage of total protein that is IPC...
+
==Reagent list==
  
#might adding different amounts of IPC variants complicate comparisons... how... (move to next day)
 
 
#how could you change experiment such that same amounts of IPC used... (move to next day)
 
 
===Part 2: Advance preparation for SDS-PAGE of protein extracts===
 
 
#Last time you measured the amount of cells in each of your samples (-IPTG and +IPTG of the wild-type IPC and one correct mutant). (If you ran cultures overnight, the teaching faculty measured the +IPTG samples for you and posted the results.) Look back at your measurements, and find the sample with the lowest cell concentration. Set aside 15 &mu;L of this sample for PAGE analysis in an eppendorf.
 
#For your other three samples, you should take the amount of bacterial lysate corresponding to the same number of cells as the lowest concentration sample. For example, if the OD<sub>600</sub> of your WT -IPTG sample was 0.05, and the OD<sub>600</sub> of your WT +IPTG sample was 0.30, you would take  15 &mu;L of the -IPTG, but only 2.5 &mu;L of the +IPTG sample.
 
#Next, add enough water so the each sample has 15 &mu;L of liquid in it. You might use the table below to guide your work.
 
#Finally, add 3 μL of 6X sample buffer to 15 μL of each of your diluted lysates. These will be stored in the freezer until next time.
 
 
<center>
 
{| border="1"
 
! Sample Name       
 
! OD<sub>600</sub>
 
! Sample Volume (&mu;L)
 
! Water Volume (&mu;L)
 
! Total Volume (&mu;L)
 
|-
 
| -IPTG WT
 
|
 
|
 
|
 
|15
 
|-
 
| +IPTG WT
 
|
 
|
 
|
 
|15
 
|-
 
| -IPTG mutant
 
|
 
|
 
|
 
|15
 
|-
 
| +IPTG mutant
 
|
 
|
 
|
 
|15
 
|-
 
|}
 
</center>
 
 
 
 
===Part 4: Protein concentration===
 
====Part 4A: Prepare diluted albumin (BSA) standards====
 
#Obtain a 0.25 mL aliquot of 2.0 mg/mL albumin standard stock and a conical tube of diH<sub>2</sub>O from the front bench.
 
#Prepare your standards according to the table below using dH<sub>2</sub>O as the diluent:
 
#*'''Be sure to use 5 mL polystyrene tubes found on the instructors bench when preparing your standards as the volumes are too large for the microcentrifuge tubes.'''
 
<center>
 
{| border="1"
 
! '''Vial''' <br>
 
! '''Volume of diluent (mL)'''
 
! '''Volume (mL) and source of BSA (vial)'''
 
! '''Final BSA concentration (μg/mL)'''         
 
|-
 
| A
 
| 2.25
 
| 0.25 of stock
 
| 200
 
|-
 
| B
 
| 3.6
 
| 0.4 of A
 
| 20
 
|-
 
| C
 
| 2.0
 
| 2.0 of B
 
| 10
 
|-
 
| D
 
| 2.0
 
| 2.0 of C
 
| 5
 
|-
 
| E
 
| 2.0
 
| 2.0 of D
 
| 2.5
 
|-
 
| F
 
| 2.4
 
| 1.6 of E
 
| 1
 
|-
 
| G
 
| 2.0
 
| 2.0 of F
 
| 0.5
 
|-
 
| H
 
| 4.0
 
| 0
 
| Blank
 
|}
 
</center>
 
  
====Part 4B: Prepare Working Reagent (WR) and measuring protein concentration====
+
*Calcium calibration kit from Life Technologies
#Use the following formula to calculate the volume of WR required: (# of standards + # unknowns) * 1.1 = total volume of WR (in mL).
+
**Zero free calcium buffer: 10 mM EGTA in 100 mM KCl, 30 mM MOPS, pH 7.2
#Prepare the calculated volume of WR by mixing the Micro BCA Reagent MA, Reagent MB, and Reagent MC such that 50% of the total volume is MA, 48% is MB, and 2% is MC.
+
**39 &mu;M free calcium buffer: 10 mM CaEGTA in 100 mM KCl, 30 mM MOPS, pH 7.2
#*For example, if your calculated total volume of WR is 100 mL, then mix 50 mL of MA, 48 mL of MB, and 2 mL of MC.
+
#*'''Prepare your WR in a 15 mL conical tube.'''
+
#Pipet 0.5 mL of each standard prepared in Part 4A into clearly labeled 1.5 mL microcentrifuge tubes.
+
#Prepare your protein samples by adding 990 μL of dH<sub>2</sub>O to your 10 μL aliquot of purified protein, for a final volume of 1 mL in clearly labeled 1.5 mL microcentrifuge tubes.
+
#Add 0.5 mL of the WR to each 0.5 mL aliquot of the standard and to your 0.5 mL protein samples.
+
#Cap your tubes and incubate at 60&deg;C in the water bath for 1 hour. During this time download the sample data on the Discussion page to practice estimating protein concentration of your samples.
+
#Following the incubation, the teaching faculty will use the spectrophotometer to measure the protein concentrations of your standards and your purified samples.
+
#* The cuvette filled only with water (H) will be used as a blank in the spectrophotometer.
+
#*The absorbance at 562 nm for each solution will be measured and the results will be posted to today's [http://engineerbiology.org/wiki/Talk:20.109%28S16%29:Purify_protein_%28Day6%29 Discussion] page.
+
#* Establish your standard curve by plotting OD562 for each BSA standard (B-H) vs. its concentration in &mu;g/mL.
+
#* Use the standard curve in its linear range (0.5 - 20 &mu;g/mL), and its linear regression in Excel, to determine the protein concentration of each unknown sample (wild-type and mutant IPC).
+
  
==Reagents list==
+
*Thermo Scientific Varioskan Flash Spectral Scanning Multimode Reader
*Luria-Bertani broth (LB) (from Difco)
+
*ampicillin; stock = 100 mg/mL (from Sigma)
+
*chloramphenicol; stock = 34 mg/mL (from Sigma)
+
*isopropyl β-d-1-thiogalactopyranoside (IPTG) (from Sigma)
+
*BugBuster Protein Extraction Reagent (from EMD Millipore)
+
*6X Laemmli sample buffer (from Boston BioProducts)
+
*4-20% polyacrylamide gels in Tris-HCl (from Bio-Rad)
+
*TGS buffer: 5 mM Tris, 192 mM glycine, 0.1% (w/v) SDS (pH 8.3) (from Bio-Rad)
+
*Precision Plus Dual Color Standard ladder (from Bio-Rad)
+
**Molecular weights of ladder bands (linked [http://www.bio-rad.com/en-us/product/prestained-protein-standards?ID=a7b0f9ce-e080-4b51-ab99-4cded66497c1&WT.mc_id=170125006445&WT.srch=1&WT.knsh_id=7417aea6-506f-40cd-96ae-4b6621ba8344&gclid=Cj0KCQjwjer4BRCZARIsABK4QeXhVOAohSuOrR4OKPwHbNnBCWyi5EWgxDPWCbqp67YwI-Qk7AAmMaUaAumyEALw_wcB here]).
+
*BioSafe Coomassie G-250 Stain (from Bio-Rad)
+
*Protein purification supplies (from Novagen/Calbiochem):
+
**Ni-NTA His-Bind Resin
+
**1X Ni-NTA Bind Buffer; 50 mM NaH<sub>2</sub>PO<sub>4</sub>, pH 8.0; 300 mM NaCl; 10 mM imidazole
+
**1X Ni-NTA Wash Buffer; 50 mM NaH<sub>2</sub>PO<sub>4</sub>, pH 8.0; 300 mM NaCl; 20 mM imidazole
+
**1X Ni-NTA Elute Buffer; 50 mM NaH<sub>2</sub>PO<sub>4</sub>, pH 8.0; 300 mM NaCl; 250 mM imidazole
+
*Zeba Desalt Spin Columns (from Thermo Scientific)
+
*Micro BCA Protein Assay Kit (from Thermo Scientific)
+
  
 
==Navigation links==
 
==Navigation links==
Next day: [[20.109(S21):M3D4 |Evaluate effect of mutations on IPC variants ]] <br>
+
Next day: [[20.109(S21):M3D4 |Design new IPC variant ]] <br>
Previous day: [[20.109(S21):M3D2 |Examine IPC mutations ]] <br>
+
Previous day: [[20.109(S21):M3D2 |Identify IPC mutations]] <br>

Revision as of 15:17, 19 April 2021

20.109(S21): Laboratory Fundamentals of Biological Engineering

Sp21 banner image v2.png

Spring 2021 schedule        FYI        Assignments        Homework        Communication |        Accessibility

       M1: Antibody engineering        M2: Drug discovery        M3: Protein engineering       


Introduction

As evidenced by Nagai’s work, wild-type inverse pericam is not toxic to BL21(DE3)pLysS cells. Although it is unlikely that your point mutation will dramatically change this fact, in general a novel protein may turn out to be toxic. If this is the case, only very small amounts of protein are produced before the bacteria die. Keep in mind that overexpressing a single protein may come at the expense of producing proteins needed for survival, and will most likely cause cell death eventually; however, toxic proteins hasten this demise. Aberrant toxicity can sometimes be alleviated by reducing the culture temperature (e.g., to 30 °C).

Based on its fluorescence activity, wild-type inverse pericam allows proper folding of (cp)EYFP, and based on its response to calcium, it also allows calmodulin to fold. One problem you may encounter is that your mutant proteins will no longer fold correctly. Since you made mutations in the calcium sensor part of IPC, rather than the fluorescent part, it is unlikely that your protein will destroy EYFP fluorescence. However, a common problem with misfolded proteins is the formation of insoluble aggregates, due for instance to improperly exposed hydrophobic surfaces. Proteins can be purified from these aggregates – called inclusion bodies – but the process is more labor-intensive than for soluble proteins. (The proteins must be extracted under more harsh conditions than you will use next time, then purified under denaturing conditions, before finally attempting to renature the proteins.) Inclusion bodies sometimes form simply due to very high expression of the protein of interest, causing it to pass its solubility limit. This outcome can be prevented by lowering the culture temperature, the induction duration, the amount of IPTG, or the growth phase of the bacteria.

One final point to keep in mind is that not all proteins can be produced in bacteria. Eukaryotic proteins that require post-translational modifications (such as glycosylation) for activity require eukaryotic hosts (such as yeast, or the commonly used CHO – Chinese hamster ovary – cells). Sometimes eukaryote-derived proteins will be truncated or otherwise mistranslated by E. coli due to differential codon bias; errors in translation can be prevented by providing additional tRNAs to the culture or directly to the bacteria via plasmids. Despite all this complexity, prokaryotic hosts have been plenty good enough to produce proteins for certain therapies, notably the cytokine G-CSF. This cytokine is taken by patients needing to replenish their white blood cells (e.g., after chemotherapy), and sold as Neupogen by the company Amgen.


This is it, folks! Moment of truth. Time to find out how the proteins that you worked so hard to express, purify, and test really behave. Although you should be able to produce reasonable titration curves by following the example of Nagai, the introduction/review of binding constants below may help contextualize your analysis.

Let’s start by considering the simple case of a receptor-ligand pair that are exclusive to each other, and in which the receptor is monovalent. The ligand (L) and receptor (R) form a complex (C), which can be written

$ R + L \rightleftharpoons\ ^{k_f}_{k_r} C $

At equilibrium, the rates of the forward reaction (rate constant = $ k_f $) and reverse reaction (rate constant = $ k_r $) must be equivalent. Solving this equivalence yields an equilibrium dissociation constant $ K_d $, which may be defined either as $ k_r/k_f $, or as $ [R][L]/[C] $, where brackets indicate the molar concentration of a species. Meanwhile, the fraction of receptors that are bound to ligand at equilibrium, often called y or θ, is $ C/R_{TOT} $, where $ R_{TOT} $ indicates total (both bound and unbound) receptors. Note that the position of the equilibrium (i.e., y) depends on the starting concentrations of the reactants; however, $ K_d $ is always the same value. The total number of receptors $ R_{TOT} $= [C] (ligand-bound receptors) + [R] (unbound receptors). Thus,

$ \qquad y = {[C] \over R_{TOT}} \qquad = \qquad {[C] \over [C] + [R]} \qquad = \qquad {[L] \over [L] + [K_d]} \qquad $

where the right-hand equation was derived by algebraic substitution. If the ligand concentration is in excess of the concentration of the receptor, [L] may be approximated as a constant, L, for any given equilibrium. Let’s explore the implications of this result:

  • What happens when L << $ K_d $?
→Then y ~ $ L/K_d $, and the binding fraction increases in a first-order fashion, directly proportional to L.
  • What happens when L >> $ K_d $?
→In this case y ~1, so the binding fraction becomes approximately constant, and the receptors are saturated.
  • What happens when L = $ K_d $?
→Then y = 0.5, and the fraction of receptors that are bound to ligand is 50%. This is why you can read $ K_d $ directly off of the plots in Nagai’s paper (compare Figure 3 and Table 1). When y = 0.5, the concentration of free calcium (our [L]) is equal to $ K_d $. This is a great rule of thumb to know.

The figures below demonstrate how to read $ K_d $ from binding curves. You will find semilog plots (right) particularly useful today, but the linear plot (left) can be a helpful visualization as well. Keep in mind that every L value is associated with a particular equilbrium value of y, while the curve as a whole gives information on the global equilibrium constant $ K_d $.

Simple binding curve. The binding fraction y at first increases linearly as the starting ligand concentration is increased, then asymptotically approaches full saturation (y=1). The dissociation constant $ K_d $ is equal to the ligand concentration [L] for which y = 1/2.
Semilog binding curves. By converting ligand concentrations to logspace, the dissociation constant is readily determined from the inflection point of the sigmoidal curve. The three curves each represent different ligand species. The middle curve has a $ K_d $ close to 10 nM, while the right-hand curve has a higher $ K_d $ and therefore lower affinity between ligand and receptor (vice-versa for the left-hand curve).


Of course, inverse pericam has multiple binding sites, and thus IPC-calcium binding is actually more complicated than the example above. The $ K_d $ reported by Nagai is called an ‘apparent $ K_d $’ because it reflects the overall avidity of multiple calcium binding sites, not their individual affinities for calcium. Normally, calmodulin has a low affinity (N-terminus) and a high affinity (C-terminus) pair of calcium binding sites. However, the E104Q mutant, which is the version of CaM used in inverse pericam, displays low affinity binding at both termini. Moreover, the Hill coefficient, which quantifies cooperativity of binding in the case of multiple sites, is reported to be 1.0 for inverse pericam. This indicates that inverse pericam behaves as if it were binding only a single calcium ion per molecule. Thus, wild-type IPC is well-described by a single apparent $ K_d $.

For any given mutant, things may be more complicated. Keep in mind that we are not directly measuring calcium binding, but instead are indirectly inferring it based on fluorescence (for both mutant and wild-type IPC). A change in fluorescence requires the participation not only of calcium, but also of M13. In addition to the four separate calcium binding sites in calmodulin, the M13 binding site influences apparent affinity and apparent cooperativity. In short, be careful about how you describe the meanings of our binding parameters in your reports.

Returning to the big picture: when you write your Protein engineering summary, be sure to consider how changes in both binding affinity and cooperativity (and even potentially raw fluorescence differences) can affect the practical utility of a sensor.

Protocols

Part 2: Prepare samples for titration curve

Tips for success

Take great care today to limit the introduction of bubbles in your samples. When expelling fluid, pipet slowly while touching the pipet tip against the bottom or side of the well.

Protocol

Titration plate map
  1. Take a black 96-well plate, and familiarize yourself with the plate map scheme at right: top two rows are to be loaded with wild-type IPC, next two rows are to be loaded with your X#Z mutant IPC, and the final row is to be loaded with water/BSA to serve as a blank/background row.
    • The dark sides of the plate reduce "cross-talk" (i.e., light leakage) between samples in adjacent wells, another potential contribution to error.
  2. Aliquot your wild-type protein to your plate. Use your P200 pipet to add 30 μL of protein (per well) to rows A and B of your plate.
  3. Aliquot you X#Z mutant IPC to your plate. Use your P200 pipet and add 30 μL of protein (per well) to rows C and D of your plate.
  4. Finally, add 30 μL of water with only 0.1% BSA (no IPC) to row 5(E) of your plate.
  5. The calcium solutions are at the front bench in shared reservoirs. Carefully carry your plate to the front bench to add these solutions with the multi-channel pipet.
  6. Using shared reservoir #1 (lowest calcium concentration - actually 0 nM), add 30 μL to the top five rows in the first column of the plate. Discard the pipet tips.
  7. Now work your way from reservoirs #2 to #12 (highest calcium concentration), and from the left-hand to the right-hand columns on your plate. Be sure to use fresh pipet tips each time! If you do contaminate a solution, let the teaching faculty know so they can put out some fresh solution. Honesty about a mistake is far preferred here to affecting every downstream experiment.
  8. When you are done, alert the teaching faculty and you will be taken in small groups to measure the fluorescence values for your samples.

Part 3: Fluorescence assay

  1. The BMC (BioMicro Center) has graciously agreed to let us use their plate reader. Walk over to building 68 with a member of the teaching staff.
  2. You will be shown how to set the excitation (485 nm) and emission (515 nm) wavelength on the plate reader to assay your protein.
  3. Your raw data will be posted on today's Discussion page and emailed to you as a .txt file so you can begin your analysis.

You will analyze your calcium titration assay data in two steps. First, you will get a rough feel for how your mutant changed (or didn't) compared to wild-type IPC by plotting the two replicate values and their average, in both raw and processed form. Second, you will take the average processed values and plug them into some MATLAB code that will more precisely tell you the affinity and cooperativity of each protein with respect to calcium.

Part 1: Titration curve in Excel and first estimate of Kd

Today you will analyze the fluorescence data that you got last time. Begin by analyzing the wild-type protein as a check on your work (your curve should resemble Nagai's Figure 3L), and then move on to your mutant samples. If you are not familiar with manipulations in Excel, use the Help menu or ask the teaching faculty for assistance.

  1. Open an Excel file for your data analysis. Begin by making a column of the free calcium concentrations present in your twelve test solutions. Assuming a 1:1 dilution of protein with calcium, the final concentrations are: 0 nM, 8.5 nM, 19 nM, 32.5 nM, 50 nM, 75 nM, 112.5 nM, 175 nM, 301 nM, 675 nM, 1.505 μM, 19.5 μM. Be sure to convert all concentrations to the same units.
  2. Now open the text file containing your raw data as a tab-delimited file in Excel (you can download the file from the M1D7 Discussion page). Convert the row-wise data to column-wise data (using Paste SpecialTranspose), and transfer each column to your analysis file. Add column headers to indicate which protein is which, and analyze each replicate separately for now. Also include a column of your control samples that did not contain protein.
  3. Begin by calculating the average of your blank samples, and bold this number for easy reference. It is the background fluorescence present in the calcium solutions and should be quite low. If necessary, subtract this background value from each of your raw data values. It may help to have a 6-column series called “RAW”, and another called “SUBTRACTED.”
  4. Next you should normalize your data. The maximum and minimum fluorescence values for a given titration series should be defined as 100% and 0% fluorescence, respectively, and every other fluorescence value should be expressed as a percentage in between. Think about how to mathematically express these conditions.
    • First calculate the percent fluorescence for both replicates. Then make a new column and calculate the average percentage as well.
    • Alternatively, average your data first, and then normalize the average data. How do the "average then normalize" and the "normalize then average" curves compare? Which one will you include in your Protein engineering summary?
    • If one data point seems really off from the other replicate and from the expected trend, you might consider it an outlier and delete it, especially if you have good reason to believe that there was a reason (error in pipetting, air bubble in that well) for the anomaly. Otherwise, you might be losing valuable information, and/or misleading anyone who tries to interpret your data.
  5. For each protein, plot this normalized data versus calcium concentration. Save these plots in case you want to include them in your report.
    • You might plot the two replicates as points and their average value as a dashed line (see front page of this module).
  6. Note down the approximate inflection points of the curves, which should occur at half-saturation: these indicate the approximate values of the apparent $ K_d $ for each sample.

Part 2: Improved estimate of Kd using MATLAB modeling

Preparation

  1. Download these three files: F15_Fit_Main, Fit_SingleKD, and Fit_KDn. Move them to the username/Documents/MATLAB folder on your computer.
  2. Double-click on the MATLAB icon to start up this software.
  3. The main window that opens is called the command window: here is where you run programs (or directly input commands) and view outputs. You can also see and access the command history, workspace, and current directory windows, but you likely won’t need to today.
  4. In the command window, type more on; this command allows you to scroll through multi-page output (using the spacebar), such as help files.
  5. In addition to the command area, MATLAB comes with an editor. Click FileOpen and select the program F15_Fit_Main. It has the .m extension and thus is executable by MATLAB. Read the introductory comments (the beginning of a comment is indicated by a % sign), and then input your fluorescence data.
  6. Read through the program, and as you encounter unfamiliar terms, return to the workspace and type help functionname. Feel free to ask questions of the teaching faculty as well.
    • You might read about such built-in functions as logspace and nlinfit.
    • You will also want to open and read Fit_SingleKD – a user-defined function called by F15_Fit_Main – in the MATLAB editor.
    • If you type help function you will learn the syntax for a function header.
    • Note that a dot preceeding an operator (such as A ./ B or A .* B) is a way of telling MATLAB to perform element-by-element rather than matrix algebra.
    • Also note that when a line of code is not followed by a semi-colon, the value(s) resulting from the operation will be displayed in the command window.

Analysis

  1. Once you more-or-less follow Part 1 of the program, type F15_Fit_Main in the workspace, hit return to run the program, and consider the following questions:
    • Why must the fluorescence data be transformed (from S to Y) prior to using in the model?
    • What $ K_d $ values are output in the command window, and how do they compare to the values you estimated from your Excel plots?
    • Figure 1 should display your wild type and mutant data points and model curves. How do they look in comparison to the curves you plotted in Excel?
    • Figure 2 should display the residuals (difference between data and model) for your three proteins. If the absolute values are low, this indicates good agreement between the model and the data numerically. Whether or not this is the case, another thing to look for is whether the residuals are evenly and randomly distributed about the zero-line. If there is a pattern to the errors, likely there is a systematic difference between the data and the model, and thus the model does not reflect the actual binding process well. What are the residuals like for each of your modeled proteins?
  2. Now move on to Part 2 of the F15_Fit_Main program. Part 2 also fits the data to a model with a single, ‘apparent’ value of $ K_d $, but it allows for multiple binding sites and tests for cooperativity among them. The parameter used to measure cooperativity is called the Hill coefficient. A Hill coefficient of 1 indicates independent binding sites, while greater or lesser values reflect positive or negative cooperativity, respectively. Let the following questions guide you as you proceed:
    • Visually, which model appears to fit your wild-type data better (Fig. 3 vs. Fig. 1)? Your mutant data?
    • Do the respective residuals support your qualitative assessment (Fig. 4 vs. Fig. 2)?
    • Numerically, how do the values of $ K_d $ compare for the two models? How does the value of n compare to the implicitly assumed value of 1 in Part 1?
    • Do you see changes in binding affinity and/or cooperativity between the wild-type and X#Z samples? Do they match your a priori predictions?
    • Don't forget to save any figures you want to use in your report! If the legends are covering up your data, you can simply move them over with your mouse.
  3. Finally, you can skim Part 3 of the F15_Fit_Main program. Make sure you update the range of the linear transition region for each IPC sample, but beyond this, don’t worry too much about the coding details; rather do read through the comments.
    • Look at Part 1 of Figure 5: are the binding curves asymptotic, sigmoidal, or other? What does this shape indicate? You can use the zoom button to get a closer look at part of the plot, or the axis command present in the code. (Don't worry too much about this question if it is unclear.)
    • Now look in the command window. What values of $ K_d $ and Hill coefficient (n) do you get for your three proteins? How do the $ K_d $’s from Part 3 compare to the ones from Parts 1 and 2? Don’t be discouraged if your wild-type values do not exactly match Nagai’s work, or if there is variation between Parts 1, 2, and 3.
    • Comparing the model and data points by eye (Part 2 of Figure 5), do you think it is a good model for any of your proteins? If so, which ones? What experimental limitations might prevent Hill analysis from working well, especially for some mutants?
    • Why should only the transition region be analyzed in a Hill plot?
    • What is the relationship between slope and $ K_d $ and/or n, and intercept and $ K_d $ and/or n?
  4. If your mutant proteins are not well-described by any of the models so far, what kind of model(s) (qualitatively speaking) do you think might be useful?
    • Optional: If your data might be well-described by a model with two $ K_d $'s (or if you are interesting in exploring some sample data that is), download and run Fit_TwoKD and Fit_TwoKD_Func.

Reagent list

  • Calcium calibration kit from Life Technologies
    • Zero free calcium buffer: 10 mM EGTA in 100 mM KCl, 30 mM MOPS, pH 7.2
    • 39 μM free calcium buffer: 10 mM CaEGTA in 100 mM KCl, 30 mM MOPS, pH 7.2
  • Thermo Scientific Varioskan Flash Spectral Scanning Multimode Reader

Navigation links

Next day: Design new IPC variant

Previous day: Identify IPC mutations