Difference between revisions of "20.109(S08):Start-up protein engineering (Day1)"

From Course Wiki
Jump to: navigation, search
(Introduction)
(Protocols)
Line 16: Line 16:
  
 
==Protocols==
 
==Protocols==
 +
 +
===Part 1: protein backbone===
 +
 +
Perhaps nothing is so conducive to a feeling of intimate familiarity with a protein as studying it at the amino acid level (primary structure). For the first part of lab today, you will dissect a two-dimensional representation of inverse pericam into its component parts. Begin by downloading this document (LINK), which contains the DNA and amino acid sequences of inverse pericam (IPC). You will annotate these sequences to guide your design work.
 +
 +
#Figure 1 of the paper by [http://www.ncbi.nlm.nih.gov/sites/entrez?Db=pubmed&Cmd=ShowDetailView&TermToSearch=11248055&ordinalpos=5&itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum Nagai et al.] depicts the inverse pericam construct. Use this diagram to locate the M13 peptide in your sequence file. (Hint: the last paragraph of Nagai’s introduction states how many residues this version of M13 contains, and also cites a paper that gives the M13 amino acid sequence.) Bold the first and last triplet codons of this component to indicate its location, and also write the location (according to residue) at the top of the page (e.g., M13 = aa10 - aa50).
 +
#Next, look for the two parts of the permuted EYFP, keeping in mind that each component may be separated by a short linker of a few residues. You might look for the DNA or amino acid sequence of EYFP on a site such as [http://www.ncbi.nlm.nih.gov/ NCBI] (choose Protein or Nucleotide in the Search pull-down menu to get aa or bp, respectively). Note that many variants of this protein exist, but they have a lot of sequence overlap. It may help you orient yourself to know that the version of EYFP in IPC was modified at residues 68 and 69 (see beginning of Materials and Methods in Nagai et al. for details). Again, bold the first and last triplet codon of each component, and note the numerical location.
 +
#You can use a similar method – i.e., finding sequence data online – to locate calmodulin within inverse pericam. The CaM sequence is highly conserved across species, so you can most likely locate it using almost any sequence.
 +
#Finally, mark the linkers in between each component in blue: how do they compare to the linkers described in the Results section of the Nagai paper? (this part as a FNT assignment?)
  
 
==For next time==
 
==For next time==

Revision as of 19:26, 13 December 2007


20.109(S08): Laboratory Fundamentals of Biological Engineering

Macintosh HD-Users-nkuldell-Desktop-labbottles.png

Home        People        Schedule Spring 2008        Assignments        Lab Basics        OWW Basics       
DNA Engineering        Protein Engineering        Biomaterials Engineering              

Introduction

Contrary to how it may be taught in some laboratory classrooms, the process of scientific inquiry encompasses much more than the collection and interpretation of data. A key part of the process is design – of experiments that specifically address a hypothesis, and of new materials or technologies. Moreover, any design is subject to continued revision. You might redesign an experiment or tool based on your own own research, or you might consult the vast body of scientific literature for other perspectives. As the old graduate student saying goes: “a month in the lab might save you a day in the library!” In other words, although the process of combing the literature can be arduous or even tedious at times, it beats wasting a month of your time repeating experiments already proven not to work, or reinventing the wheel.

During this module, each of you will design and test a new version of inverse pericam (IPC). Today we will refer to a few primary research articles in order to familiarize ourselves with this protein and its constituent parts. The fluorescent component of IPC is an enhanced yellow fluorescent protein (abbreviated EYFP), one of the many derivatives of green fluorescent protein (GFP). GFP is naturally produced by jellyfish and was cloned into other organisms in the early 1990’s; it has since been exploited as a genetically encodable reporter, and mutagenized to vary its excitation and emission spectra. The other key component of inverse pericam is the protein calmodulin (CaM), a natural calcium sensor that is present in all eukaryotes (LINK a review, e.g. Chin?). Calmodulin has many ligands that it binds only in the presence of calcium, including the peptide fragment M13. This conditional specificity for M13 binding is enabled by the change in CaM’s conformation when it binds calcium.

Within inverse pericam, M13 and CaM are located at opposite ends, surrounding a permuted (i.e., rearranged) version of EYFP. In the absence of calcium, this EYFP exhibits strong fluorescence. However, when calcium is added to a solution of inverse pericam, CaM and M13 interact, disrupting the conformation and as a result the fluorescence of EYFP. The transition from bright to dim fluorescence occurs over a particular concentration range of calcium. Your goal today is to propose a modification (actually, three) that will shift the concentration range over which IPC fluorescence decreases. Specifically, you will modify the calcium sensor portion of inverse pericam in a manner that is likely to increase or decrease its affinity for calcium.

In order to make reasonable modifications to inverse pericam, we will use several protein analysis tools. Proteins are modular materials that may be described and examined at multiple levels of a structural hierarchy (from primary to quaternary in the classical paradigm). Primary structure refers to a protein’s amino acid sequence, which might reveal a cluster of charged residues, say, or a pattern of alternating polar and nonpolar residues. One cannot predict the conformation of a protein merely from its linear sequence, however, due to rotational flexibility about its backbone, and non-covalent interactions between non-adjacent amino acids (as well as covalent disulfide bonds).

Physical methods used to interrogate 3D protein structure include X-ray diffraction (XRD), electron microscopy, and nuclear magnetic resonance (NMR) spectroscopy. The paper by Zhang et al. that you will refer to today describes the decoding of calmodulin’s structure using NMR, which depends on subjecting molecules to electromagnetic fields and analyzing the resulting energy absorption spectra of their nuclei. Scientists who elucidate protein structures, in addition to publishing their results, will often add them to public databases such as the Protein Data Bank (PDB). Because many proteins have structural motifs in common (e.g., alpha helices and beta sheets at the secondary level, or leucine-rich repeats at the tertiary level), which ultimately arise from their chemical formulae, such databases can be useful for making predictions about proteins with known amino acid sequences but unknown structures. Today we will use a computer program that harnesses information in the Protein Data Bank to display interactive 3D images.

After examining both two- and three-dimensional protein information, you will propose three changes to the wild-type inverse pericam protein, and finally design primers for incorporating these modifications at the gene level.

Protocols

Part 1: protein backbone

Perhaps nothing is so conducive to a feeling of intimate familiarity with a protein as studying it at the amino acid level (primary structure). For the first part of lab today, you will dissect a two-dimensional representation of inverse pericam into its component parts. Begin by downloading this document (LINK), which contains the DNA and amino acid sequences of inverse pericam (IPC). You will annotate these sequences to guide your design work.

  1. Figure 1 of the paper by Nagai et al. depicts the inverse pericam construct. Use this diagram to locate the M13 peptide in your sequence file. (Hint: the last paragraph of Nagai’s introduction states how many residues this version of M13 contains, and also cites a paper that gives the M13 amino acid sequence.) Bold the first and last triplet codons of this component to indicate its location, and also write the location (according to residue) at the top of the page (e.g., M13 = aa10 - aa50).
  2. Next, look for the two parts of the permuted EYFP, keeping in mind that each component may be separated by a short linker of a few residues. You might look for the DNA or amino acid sequence of EYFP on a site such as NCBI (choose Protein or Nucleotide in the Search pull-down menu to get aa or bp, respectively). Note that many variants of this protein exist, but they have a lot of sequence overlap. It may help you orient yourself to know that the version of EYFP in IPC was modified at residues 68 and 69 (see beginning of Materials and Methods in Nagai et al. for details). Again, bold the first and last triplet codon of each component, and note the numerical location.
  3. You can use a similar method – i.e., finding sequence data online – to locate calmodulin within inverse pericam. The CaM sequence is highly conserved across species, so you can most likely locate it using almost any sequence.
  4. Finally, mark the linkers in between each component in blue: how do they compare to the linkers described in the Results section of the Nagai paper? (this part as a FNT assignment?)

For next time