Difference between revisions of "20.109(F23):M2D7"

From Course Wiki
Jump to: navigation, search
(Created page with "<div style="padding: 10px; width: 820px; border: 5px solid #434a43;"> {{Template:20.109(F23)}} ==Introduction== The research goal for this module was to identify small mo...")
 
(Part 1: Consider modifications that could improve binding to PfFKBP35)
 
(15 intermediate revisions by 2 users not shown)
Line 6: Line 6:
 
==Introduction==
 
==Introduction==
  
The research goal for this module was to identify small molecules that bind PfFKBP35As a bonus, it would be ideal to identify a small molecule that binds PfFKBP35 but not the human ortholog, FKBP12Throughout this module we learned that there are different methods that can be used to find small molecules that bind a protein of interest.
+
Research takes time!  The work that is reported in a published article is often a culmination of years of study by multiple investigatorsBecause of this it is very difficult to complete an entire project in the time span of a single semesterThe work you completed this semester is part of an ongoing research project that started even before your previous classmates.
  
One method used to find small molecules that bind a protein of interest is to use a high-throughput screen, such as the SMMThough we did not use this method as part of this module, we learned about the technology and the data analysis workflow.  This technology is an approach used by researchers to screen thousands, to hundreds of thousands, of small molecules in an unbiased manner using commercially available libraries.
+
'''Previous work'''
 +
In work completed by the Fall 2022 class, students screened a library of small molecules that are chemically altered versions of FK506 which is known to bind to PfFKBP35 and the human ortholog, FKBP12The goal of the initial screen was to examine the binding of each small molecule to PfFKBP35 and to FKBP12 in an effort to identify molecules that could potentially be used to target PfFKBP35 preferentially over FKBP12.  From this experiment, the four small molecules used in your research were identified.
  
Another method is to design small molecules using a scaffold molecule that is known to bind the protein of interest, such as the approach used in this moduleHere we used a set of small molecules that were chemically altered versions of FK506 which is known to bind PfFKBP35.  This approach is only useful when a known small molecule binder is knownIn addition, using this approach will not be useful in discovering molecules that bind outside of a known binding pocket or active site.
+
'''Your work'''
 +
The research goal for this module was to further study small molecules that were previously identified to bind to PfFKBP35.  To better characterize the binding parameters of each of the four identified small molecules to PfFKBP35, you examined binding across different concentrations of small moleculeThe T<sub>m</sub> for PfFKBP35 across a range of small molecule concentrations can be used to confirm that binding is occurring and can also give some insight into the strength of the binding.  If the small molecule binds to FfFKBP35, then a shift in the T<sub>m</sub> will be observed and can be quantified by subtracting the T<sub>m</sub> of PfFKBP35 without small moleculeThe strength of binding can qualitatively be addressed by considering the concentrations of small molecule needed to cause a shift in the T<sub>m</sub>If a shift in T<sub>m</sub> for a small molecule is observed at only high concentrations, then perhaps the strength of binding is not as good as for another small molecule that causes a shift in T<sub>m</sub> at lower concentrations.
  
Today we will consider yet another technique used to identify a small molecule that binds to a protein of interestIn this exercise we will think through how a researcher might rationally design a small molecule based on the structure of the protein of interest.
+
'''Future work'''
 +
There are several next steps that can be taken from your work.  An obvious experiment is to test the small molecules used in your work with FKBP12 to confirm that the binding to PfFKBP35 is preferential across a range of concentrations of small moleculeIf a small molecule passes this step, then perhaps modifications can be made to further improve the affinity and specificity to PfFKBP35.  If a small molecule doesn't emerge as promising, then perhaps it is time to restart the screening process to identify new molecules.
  
 
==Protocols==
 
==Protocols==
  
===Part 1: Participate in Communication Lab workshop===
+
===Part 1: Consider modifications that could improve binding to PfFKBP35===
  
Our communication instructor, Dr. Chiara Ricci-Tam, will join us today for a discussion on preparing written manuscripts.
+
One method used to identify small molecules that bind to a specific target is to rationally design small molecules based on the structure information.  If the structure of the protein target is known, then this information can be used to build a molecule that binds. In reality this is a very challenging task!  To increase the chances of success using rational design, a scaffold molecule that is a known binder can be used.  This is the approach that was used to develop the initial library of molecules that was screened by students in Fall 2022.  The library was built by chemically modifying FK506, which is known to bind to PfFKBP35.  This approach is only useful when a known small molecule binder is known.  In addition, using this approach is not be useful in discovering molecules that bind outside of a known binding pocket or active site.
  
===Part 2: Align KBP12 and PfFKBP35 protein sequences===
+
One method for assessing what functional groups are potentially important in protein-small molecule binding is to visually inspect known small molecule binders for common features / structures. As shown in the image below, common shapes can be identified between the small molecules.
  
One way to compare proteins is to align the amino acid sequences.
+
[[Image:Sp17 20.109 M1D7 chemical structure features.png|thumb|750px|center|]]
  
#To start, you will use the align function in UniProt (linked [https://www.uniprot.org/align here]).
+
<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
#Open the word document (linked [[Media:Fa22 AA sequence FKBP35 FKBP12.docx|here]]), then copy all of the text within the document and paste into second box under the header "Enter multiple protein or nucleotide sequences, separated by a FASTA header."
+
#Select the 'Run Align' button at the bottom right of the page.
+
#When the alignment is finished, click on 'Completed' under the 'Status' header.
+
#Review your alignment results by selecting each of the tabs across the top of the page.
+
#*The 'Overview' tab shows the exact matches between the sequences in purple boxes.
+
#*The 'Trees' tab shows evolutionary relationships between the sequences by representing the species on branches with varying distances  This is more informative when more than two sequences are aligned.
+
#*The 'Percent Identity Matrix' tab provides a numeric score determined for each pair of aligned sequences displayed in a matrix. It measures the number of identical residues (or matched) in relation to the length of the alignment.
+
#*The 'Text Output' tabs shows the alignment with information regarding the degree of conservation at each amino acid. An * (asterisk) indicates positions which have a single, fully conserved residue.  A : (colon) indicates conservation between groups of strongly similar properties. And a . (period) indicates conservation between groups of weakly similar properties.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*Record the information provided in each of the tabs.
+
#*Briefly, describe the similarities and differences you notice by examining the amino acid sequences of FKBP12 and PfFKBP35.
+
#*What regions might be interesting targets to target with a small molecule specifically designed to bind PfFKBP35?  Why?
+
#*What type of functional group can be used to target the region? (a positive functional group? a polar functional group?).
+
  
===Part 3: Examine KBP12 and PfKBP35 structures===
+
*For the small molecules that showed binding from the class DSF results,
 +
**Are there any common features present?
 +
**If there are common features, do these features resemble the functional groups you hypothesized would promote binding to PfFKBP35 in the exercise completed on M2D4?
 +
**What might this suggest about the potential binding site(s) on PfFKBP35?
 +
*How might you make modifications to the small molecules / features to improve binding to PfFKBP35?
 +
*What is a downside of using this approach to identify small molecules that bind a specific target?
 +
*What is an upside of using this approach to identify small molecules that bind a specific target?
  
Because it is difficult to gather information regarding secondary and tertiary structure from an amino acid sequence, it is often more informative to evaluate 3D models of proteins.  For this exercise you will use protein viewing platforms to examine the structures of FKBP12 and PfFKBP35.
+
===Part 2: Consider other methods for small molecule discovery===
  
#To start, you will examine the structure of FKBP12 using the Protein Data Bank (PDB) (linked [http://www.pdb.org/pdb/home/home.do here]).  In this online database, the structures are organized according to PDB identification codes.
+
In contrast to rational design, another method used to find small molecules that bind a protein of interest is to use a high-throughput screen, such as the small molecular microarray (SMM).  Though we did not use this method as part of this module, the technology and the data analysis workflow will be briefly described below as this could be an alternative strategy to the approach used in this moduleThis technology enables researchers to screen thousands, to hundreds of thousands, of small molecules in an unbiased manner using commercially available libraries.
#For this exercise, you will look at the unbound form of FKBP12. 
+
#*Enter "1D6O" into the search box at the top right corner of the PDB homepage.
+
#The landing page for the FKBP12 structure includes background information on the source and reference for this protein structure.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*What method was used to solve this protein structure?  Perform a quick search to learn more about this method and provide a brief description.
+
#*At what resolution was the structure solved? Perform a quick search to learn more about this concept and provide a brief description.
+
#*What is the total weight of the structure?
+
#*How many chains are included in this structure?
+
#*Read the abstract for the reference article wherein this structure was first published.  What is the difference noted between the solved structures associated with this work when different ligands are bound?
+
#Under the structure shown on the left side of the window, click the 'Structure' link.  A page showing the 'cartoon' structure of FKBP12 will load.  Using the tools to the right of this page you will be able to more closely examine the structure.
+
#First, let's orient ourselves on how to move / manipulate the protein structure.
+
#*Place your cursor over the structure and while pressing down on your mouse / track pad, move the image to view the protein structure from different angles.
+
#*To zoom-in on an area of the protein structure, place your cursor on the area of interest and double-click.  When zoomed in single-click on a residue to get a more detailed view of the amino acids that are present in that area.  The dotted lines represent bonds or salt bridges that exist between the elements in the amino acids.
+
#*To zoom-out, single-click on the white space in the viewer window.
+
#*To zoom-in or -out more gradually, use two fingers and drag in the up or down direction.
+
#*[[Image:Fa22 M2D7 PDB amino acid location.png|thumb|400px|right|]]To identify which amino acid residues are present in each position of the protein, hover your cursor over the protein.  A box will appear in the lower right of the viewer window (see example to the right).  Though most of the details here can be ignored, the information provided tells you that the highlighted residue is a alanine (Ala) at position 72 in the amino acid sequence.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*What secondary structures are present in FKBP12?
+
#Next, let's consider the tools provided in the panel on the right of the page.
+
#The contents of the 'Components' tab are listed: Polymer, Ligand, Water, and Ion.
+
#*Polymer refers the larger structures present, such as protein chains, DNA, or RNA. 
+
#*Ligand refers to any non-polymer structure, such as ligand binders, ATP, or co-factors that are not single atoms.
+
#*Water refers to water.
+
#*Ion refers to any lone elements that are associated with the structure.
+
#*Use the 'eyeball' icon to the right of the component labels to remove / add the components to the image.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*Does the FKBP12 structure contain the Components listed?  Answer yes or no for each Component type.
+
#Click on the 'Density' tab.  Though we will not focus much on the details here, the electron density map is the actual data from the x-ray crystallography experiment used to solve the structure.
+
#*Select '2Fo-Fc &sigma;' from the options.
+
#*Click the box to the right of 'Wireframe' such that this feature is activated (toggle to '&#10003; On').
+
#*Click on a residue within the protein structure.  This will zoom-in on that area and also layer a grid, or cage, over the area.  The cage represents the electron density data that were captured via x-ray crystallography.  The structural features and atoms within the FKBP12 protein were modeled to match the density map, thus providing a best estimate of the protein structure. The resolution is related to how tight this cage is to the solved structure.  Though a gross oversimplification, the relationship can be described as such: the fit of the cage to the solved structure is related to the angstrom value achieved via crystallography, the smaller the angstrom the better the resolution and thus the tighter the cage to the solved structure.
+
#To gain more insight into the secondary structure of FKBP12, select the 'Sequence' tab above the protein window.
+
#The exact positions for several different structural features are provided in the chart on this tab.
+
#<font color = #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*What secondary structures are present in FKBP12?
+
#*At what amino acid positions are these secondary structures present?
+
#Next, let's look at the structure of PfFKBP35!
+
#Though the structure of FKBP12 was solved experimentally, this is not true for the full-length version of PfFKBP35.  Therefore, we will use AlphaFold Protein Structure Database to examine the structure of PfFKBP35 (linked [https://alphafold.ebi.ac.uk/ here]).  With this software, you will look at the predicted structure of PfFKBP35. 
+
#*Enter "Q8I4V8" into the search box at the center of the AlphaFold Protein Structure Database homepage.
+
#Choose the first entry, "Peptidyl-prolyl cis-trans isomerase FKBP35" from the options listed.
+
#The landing page for the PfFKBP35 structure includes background information on the biological function and the references for this information.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*What is the biological function of PfFKBP35?
+
#Scroll to the predicted structure of PfFKBP35.
+
#Familiarize yourself with how to move / manipulate the structure.  The movements / manipulations are largely the same as those used in PDB.
+
#Though the information that can be gained from the predicted structure of PfFKBP35 is limited compared to the experimentally derived structure of FKBP12, several questions can be answered by visually examining the predicted structure.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*What secondary structures are present in PfFKBP35?
+
#*At what amino acid positions are these secondary structures present? (Note: this can be found by moving the cursor over the area of interest in the protein structure.)  How does this compare to FKBP12?
+
#*Briefly, describe the similarities and differences you notice by visually examining the structures of FKBP12 and PfFKBP35.
+
  
===Part 4: Align FKBP12 and PfFKBP35 protein structures===
+
'''Preparing the small molecule microarray''' <br>
 +
An SMM screen requires the covalent attachment of a library of small molecules to a glass slide. The library is meant to broadly sample the drug-like chemical space (''i.e.'' all possible chemical structures that have drug-like physical properties) and contains about 50,000 small molecules. Some libraries are much smaller, while many pharmaceutical companies possess high-throughput screening (HTS) collections of millions of compounds. Because this chemical space is very large, it’s difficult to generalize any single chemical reaction for this attachment that can be applied to all small molecules. This method uses a “one-size-fits-most” approach, where the glass slide is functionalized with a broadly reactive electrophile capable of reacting with nucleophiles present in most drug-like small molecules, such as alcohols or amines. Many small molecules contain multiple nucleophiles suitable for attachment. In this case, manufacturing will result in a mixture of attachment sites. It’s important to remember that attachment to the glass slide constrains the possible orientation of the protein-small molecule interaction; some orientations are not possible because the glass slide and linker are in the way.
  
Though it is possible to identify differences via a visual inspection of two proteins side-by-side, an overlay of two proteins can better highlight areas of interest that are specific to each protein or shared between the proteins.
+
First, a glass slide with exposed amines across the surface and attach a short PEG (polyethylene glycol) linker. To the end of this PEG linker, an isocyanate group is attached. Isocyanate, or R-N=C=O, is a resonant structure, and a partial positive charge is stabilized on the carbon atom.  This carbon atom is electrophilic, and small molecules with nucleophiles will react here.  It is estimated that about 70% of drug-like small molecules are amenable to this reaction, and the library is filtered to contain only these molecules.
  
#To start, you will need to download a trial version of PyMOL (linked [https://pymol.org/2/ here]).
+
[[Image:Sp17 20.109 M1D4 SMM printing.png|thumb|700px|center|Image from Bradner, J. E. et al. [http://www.ncbi.nlm.nih.gov/pubmed/17406478 PMID: 17406478]]]
#You will also need to download the PDB files for FKBP12 and PfFKBP35.
+
#*For FKBP12: Toward the top, right of the protein structure page click the 'Download Files' dropdown and select 'PDB Format' from the options listed.
+
#*For PfFKBP35: Toward the top, left of the protein structure page click the 'PDB File' button to the right of the 'Download' header.
+
#Open PyMOL.
+
#*Select the 'Skip Activation' option.
+
#[[Image:Fa22 M2D7 file upload.png|right|thumb|200px]]To open the .pdb files, be sure that the PyMOL window active then select 'File' from the toolbar and choose 'Open...' from the dropdown list. Select each file.
+
#*Files should populate the panel to the right of the PyMOL window (see image to the right).
+
#For technical reasons due to the data embedded in the .pdb file for FKBP12 that are beyond the scope of this exercise, two chains will appear in the structure viewing panel.  To omit the second chain, enter the following as separate commands:
+
#* "create 1d6o_1_chain, 1d6o and chain A"
+
#* "hide everything, 1d6o"
+
#To align the structures, enter the command: "align 1d6o_1_chain,AF-Q8I4V8-F1-model_V4"
+
#If the aligned structures are out of the viewing panel, enter the command: "zoom AF-Q8I4V8-F1-model_V4"
+
#[[Image:Fa22 M2D7 color selection.png|right|thumb|200px]]If the default color-coding make it difficult to distinguish the aligned proteins, select the 'C' (C = color) next to the name of one of the protein files in the panel to the right of the PyMOL window and select one of the color options from the dropdown list (see image to the right).
+
#Visually inspect the alignment by moving / manipulating the structures. 
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*Briefly, describe your observations. Do the proteins appear to align well?  Not well?  Perhaps a combination of well and not well?
+
#To better visualize the the structures you can use the options in the 'S' (S = show) next to the files of the proteins in the panel to the right.
+
#*The options include changing the structure such that it is represented as sticks or dots.  
+
#*Options can be applied to only one structure by selecting the 'S' to the right of the specific protein file or to both structures by selecting the 'S' to the right of the 'all' file.
+
#To identify the amino acid residues present at a specific location within the structures, double-click on an area.  The amino acid at that position will be provided in the top panel of the PyMOL window (see image below).[[Image:Fa22 M2D7 amino acid identification.png|center|thumb|450px]]
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*Based on what you learned from the protein structures, consider how you might design a small molecule specific to PfFKBP35:
+
#**What regions might be interesting targets to target with a small molecule specifically designed to bind PfFKBP35?  Why?
+
#**What type of functional group can be used to target the region? (a positive functional group? a polar functional group?).
+
#*Are the regions identified here the same as those identified in Part #2?  Different?
+
#*Comment on which approach is most useful (either examining amino acid sequences or structures) in identifying regions that are unique to each protein or shared between both proteins.  Why?
+
  
===Part 5:  Draft discussion section for Research article===
+
The compound library is dissolved in DMSO and stored in 384-well plates. To dispense the compounds onto our functionalized glass slide, a robotic arm with a set of 48 metal pins is used to transfer the compounds to the glass slide. Each metal pin has a small slit in the end, and capillary action is used to precisely withdraw and dispense consistent volumes. When the pins touch the glass slide, the compound in DMSO is dispensed into a small circle of approximately 160 micron in diameter. Each pin prints one compound in two different locations on each slide, and then the pins are washed in water and DMSO. This process is repeated for each compound, resulting in our final microarray. The microarray is divided into 48 subarrays, and each subarray corresponds to one pin and contains 256 discrete spots. Within each subarray, we print a set of fluorescent compounds in the shape of an ‘X’ so that we can precisely determine where each spot is printed. After the compounds react, we quench the surface so that no electrophiles remain. This results in our final microarray; a collection of approximately 12,000 discrete spots displaying one compound each.
 +
 
 +
'''Performing an SMM screen''' <br>
 +
 
 +
<font color = #0d368e>'''To see the steps used to screen small molecules, please watch the video tutorial linked here: [[https://www.dropbox.com/s/0phy1rg1w23fs49/SMM%20Staining.mp4?dl=0 SMM Screen]].  The steps are detailed below so you can follow along!'''</font color>
 +
 
 +
'''Scanning SMM slides''' <br>
 +
 
 +
After the printed SMM slides are incubated with purified protein, the next step is to check which of the small molecules, if any, you screened may be able to bind the protein of interest. To do this the slides are imaged using a Genepix microarray scanner.  The scanner measures the fluorescence signal emitted from the slide at two wavelengths: 532 nm
 +
and 635 nm.  The goal for today is to familiarize you with how the SMM slides are scanned and imaged.  These images are the raw data that will be used to identify putative small molecule binders.
 +
 
 +
[[Image:Fa22 M2D4 SMM, scan.png|thumb|700px|center|'''Overview of SMM slide imaging.''' A. To visualize which small molecule is putative binder of PfFKBP35, the 6xHis-tag is labeled using an anti-His antibody that is conjugated to a fluorophore. B. The fluorophore used to label protein bound to the small molecule in a particular location on the slide emits a red signal (indicated by white arrow).  Image in Panel A generated using BioRender.]]<br>
 +
 
 +
When the SMM slides are imaged, the scanner exposes each slide to excitation light specific to the fluorophores used in the experiment. As shown in the figure above, two fluorophores were used to evaluate small molecule binding to the protein of interest.  The green spots represent locations on the SMM slide where fluorescein was printed.  Fluorescein is a fluorescent dye that emits at 532 nm and is used for alignment purposes.  Correct alignment is critical to knowing which small molecules are in which spots on the slide.  Red spots are indicative of small molecules that are bound by the protein of interest.  The signal is due to an Alexa Fluor 647 anti-His antibody that emits at 635 nm. 
 +
 
 +
During imaging, the scanner detects the intensity of the emitted fluorescence light to generate an image that can be analyzed. The intensity and position of the 'red' signal associated with putative small molecule binders is what will be assessed in the next laboratory session.  This analysis will provide a list of 'hits' that can be used in future studies to identify drug candidates.
 +
 
 +
'''Analyzing SMM results''' <br>
 +
 
 +
[[Image:Sp20 M1D6 background, foreground.png|right|800px|thumb]]The microarrayer reads the fluorescence signals emitted from the surface of the SMM slide at two excitation wavelengths.  As noted previously, the 532 nm wavelength was used to excite fluorescein, which was printed in an 'X' pattern to assist with alignment.  The 635 nm wavelength was used to excite Alexa Fluor 647-conjugated anti-His antibody; which would be associated with the protein of interest bound to a small molecule on the slide.  A hit denotes a spot on the slide that emits a red fluorescence signal significantly higher than the background fluorescence level.  In terms of protein binding, a hit denotes that the protein of interest is bound to a small molecule and is therefore localized to a specific position on the slide.  You will analyze the fluorescence signal collected by the microarray scanner using a value termed the robust z-score. 
 +
 
 +
The robust z-score differentiates signal from noise by providing a value that represents the intensity of a signal above background.  In the case of the SMM experiment, the intensity of a fluorescent signal above the background fluorescence is calculated.  To do this the fluorescence emitted across the entire slide is grouped to define the Median Absolute Deviation (MAD), which is is a measure of the variability of a univariate dataset.  Though beyond the scope of this class, the equation for calculating the robust z-score assigns a value for how much more intense the fluroescent signal at a spot is over background.  The higher the value, the more different the signal from background. After putative binders are identified via the robust z-score, the data are examined by-eye.
 +
 
 +
<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
 +
 
 +
*What is a downside of using this approach to identify small molecules that bind a specific target?
 +
*What is an upside of using this approach to identify small molecules that bind a specific target?
 +
 
 +
===Part 3:  Draft discussion section for Research article===
  
 
As the final section of your Research article, you will write a formal Discussion that summarizes the key findings and states the implications of your research.  Use the homework you completed for today to draft the Discussion for your Research article.   
 
As the final section of your Research article, you will write a formal Discussion that summarizes the key findings and states the implications of your research.  Use the homework you completed for today to draft the Discussion for your Research article.   

Latest revision as of 15:58, 13 November 2023

20.109(F23): Laboratory Fundamentals of Biological Engineering

Fa23 banner image.png

Fall 2023 schedule        FYI        Assignments        Homework        Class data        Communication        Accessibility

       M1: Genomic instability        M2: Drug discovery        M3: Project design       


Introduction

Research takes time! The work that is reported in a published article is often a culmination of years of study by multiple investigators. Because of this it is very difficult to complete an entire project in the time span of a single semester. The work you completed this semester is part of an ongoing research project that started even before your previous classmates.

Previous work In work completed by the Fall 2022 class, students screened a library of small molecules that are chemically altered versions of FK506 which is known to bind to PfFKBP35 and the human ortholog, FKBP12. The goal of the initial screen was to examine the binding of each small molecule to PfFKBP35 and to FKBP12 in an effort to identify molecules that could potentially be used to target PfFKBP35 preferentially over FKBP12. From this experiment, the four small molecules used in your research were identified.

Your work The research goal for this module was to further study small molecules that were previously identified to bind to PfFKBP35. To better characterize the binding parameters of each of the four identified small molecules to PfFKBP35, you examined binding across different concentrations of small molecule. The Tm for PfFKBP35 across a range of small molecule concentrations can be used to confirm that binding is occurring and can also give some insight into the strength of the binding. If the small molecule binds to FfFKBP35, then a shift in the Tm will be observed and can be quantified by subtracting the Tm of PfFKBP35 without small molecule. The strength of binding can qualitatively be addressed by considering the concentrations of small molecule needed to cause a shift in the Tm. If a shift in Tm for a small molecule is observed at only high concentrations, then perhaps the strength of binding is not as good as for another small molecule that causes a shift in Tm at lower concentrations.

Future work There are several next steps that can be taken from your work. An obvious experiment is to test the small molecules used in your work with FKBP12 to confirm that the binding to PfFKBP35 is preferential across a range of concentrations of small molecule. If a small molecule passes this step, then perhaps modifications can be made to further improve the affinity and specificity to PfFKBP35. If a small molecule doesn't emerge as promising, then perhaps it is time to restart the screening process to identify new molecules.

Protocols

Part 1: Consider modifications that could improve binding to PfFKBP35

One method used to identify small molecules that bind to a specific target is to rationally design small molecules based on the structure information. If the structure of the protein target is known, then this information can be used to build a molecule that binds. In reality this is a very challenging task! To increase the chances of success using rational design, a scaffold molecule that is a known binder can be used. This is the approach that was used to develop the initial library of molecules that was screened by students in Fall 2022. The library was built by chemically modifying FK506, which is known to bind to PfFKBP35. This approach is only useful when a known small molecule binder is known. In addition, using this approach is not be useful in discovering molecules that bind outside of a known binding pocket or active site.

One method for assessing what functional groups are potentially important in protein-small molecule binding is to visually inspect known small molecule binders for common features / structures. As shown in the image below, common shapes can be identified between the small molecules.

Sp17 20.109 M1D7 chemical structure features.png

In your laboratory notebook, complete the following:

  • For the small molecules that showed binding from the class DSF results,
    • Are there any common features present?
    • If there are common features, do these features resemble the functional groups you hypothesized would promote binding to PfFKBP35 in the exercise completed on M2D4?
    • What might this suggest about the potential binding site(s) on PfFKBP35?
  • How might you make modifications to the small molecules / features to improve binding to PfFKBP35?
  • What is a downside of using this approach to identify small molecules that bind a specific target?
  • What is an upside of using this approach to identify small molecules that bind a specific target?

Part 2: Consider other methods for small molecule discovery

In contrast to rational design, another method used to find small molecules that bind a protein of interest is to use a high-throughput screen, such as the small molecular microarray (SMM). Though we did not use this method as part of this module, the technology and the data analysis workflow will be briefly described below as this could be an alternative strategy to the approach used in this module. This technology enables researchers to screen thousands, to hundreds of thousands, of small molecules in an unbiased manner using commercially available libraries.

Preparing the small molecule microarray
An SMM screen requires the covalent attachment of a library of small molecules to a glass slide. The library is meant to broadly sample the drug-like chemical space (i.e. all possible chemical structures that have drug-like physical properties) and contains about 50,000 small molecules. Some libraries are much smaller, while many pharmaceutical companies possess high-throughput screening (HTS) collections of millions of compounds. Because this chemical space is very large, it’s difficult to generalize any single chemical reaction for this attachment that can be applied to all small molecules. This method uses a “one-size-fits-most” approach, where the glass slide is functionalized with a broadly reactive electrophile capable of reacting with nucleophiles present in most drug-like small molecules, such as alcohols or amines. Many small molecules contain multiple nucleophiles suitable for attachment. In this case, manufacturing will result in a mixture of attachment sites. It’s important to remember that attachment to the glass slide constrains the possible orientation of the protein-small molecule interaction; some orientations are not possible because the glass slide and linker are in the way.

First, a glass slide with exposed amines across the surface and attach a short PEG (polyethylene glycol) linker. To the end of this PEG linker, an isocyanate group is attached. Isocyanate, or R-N=C=O, is a resonant structure, and a partial positive charge is stabilized on the carbon atom. This carbon atom is electrophilic, and small molecules with nucleophiles will react here. It is estimated that about 70% of drug-like small molecules are amenable to this reaction, and the library is filtered to contain only these molecules.

Image from Bradner, J. E. et al. PMID: 17406478

The compound library is dissolved in DMSO and stored in 384-well plates. To dispense the compounds onto our functionalized glass slide, a robotic arm with a set of 48 metal pins is used to transfer the compounds to the glass slide. Each metal pin has a small slit in the end, and capillary action is used to precisely withdraw and dispense consistent volumes. When the pins touch the glass slide, the compound in DMSO is dispensed into a small circle of approximately 160 micron in diameter. Each pin prints one compound in two different locations on each slide, and then the pins are washed in water and DMSO. This process is repeated for each compound, resulting in our final microarray. The microarray is divided into 48 subarrays, and each subarray corresponds to one pin and contains 256 discrete spots. Within each subarray, we print a set of fluorescent compounds in the shape of an ‘X’ so that we can precisely determine where each spot is printed. After the compounds react, we quench the surface so that no electrophiles remain. This results in our final microarray; a collection of approximately 12,000 discrete spots displaying one compound each.

Performing an SMM screen

To see the steps used to screen small molecules, please watch the video tutorial linked here: [SMM Screen]. The steps are detailed below so you can follow along!

Scanning SMM slides

After the printed SMM slides are incubated with purified protein, the next step is to check which of the small molecules, if any, you screened may be able to bind the protein of interest. To do this the slides are imaged using a Genepix microarray scanner. The scanner measures the fluorescence signal emitted from the slide at two wavelengths: 532 nm and 635 nm. The goal for today is to familiarize you with how the SMM slides are scanned and imaged. These images are the raw data that will be used to identify putative small molecule binders.

Overview of SMM slide imaging. A. To visualize which small molecule is putative binder of PfFKBP35, the 6xHis-tag is labeled using an anti-His antibody that is conjugated to a fluorophore. B. The fluorophore used to label protein bound to the small molecule in a particular location on the slide emits a red signal (indicated by white arrow). Image in Panel A generated using BioRender.

When the SMM slides are imaged, the scanner exposes each slide to excitation light specific to the fluorophores used in the experiment. As shown in the figure above, two fluorophores were used to evaluate small molecule binding to the protein of interest. The green spots represent locations on the SMM slide where fluorescein was printed. Fluorescein is a fluorescent dye that emits at 532 nm and is used for alignment purposes. Correct alignment is critical to knowing which small molecules are in which spots on the slide. Red spots are indicative of small molecules that are bound by the protein of interest. The signal is due to an Alexa Fluor 647 anti-His antibody that emits at 635 nm.

During imaging, the scanner detects the intensity of the emitted fluorescence light to generate an image that can be analyzed. The intensity and position of the 'red' signal associated with putative small molecule binders is what will be assessed in the next laboratory session. This analysis will provide a list of 'hits' that can be used in future studies to identify drug candidates.

Analyzing SMM results

Sp20 M1D6 background, foreground.png
The microarrayer reads the fluorescence signals emitted from the surface of the SMM slide at two excitation wavelengths. As noted previously, the 532 nm wavelength was used to excite fluorescein, which was printed in an 'X' pattern to assist with alignment. The 635 nm wavelength was used to excite Alexa Fluor 647-conjugated anti-His antibody; which would be associated with the protein of interest bound to a small molecule on the slide. A hit denotes a spot on the slide that emits a red fluorescence signal significantly higher than the background fluorescence level. In terms of protein binding, a hit denotes that the protein of interest is bound to a small molecule and is therefore localized to a specific position on the slide. You will analyze the fluorescence signal collected by the microarray scanner using a value termed the robust z-score.

The robust z-score differentiates signal from noise by providing a value that represents the intensity of a signal above background. In the case of the SMM experiment, the intensity of a fluorescent signal above the background fluorescence is calculated. To do this the fluorescence emitted across the entire slide is grouped to define the Median Absolute Deviation (MAD), which is is a measure of the variability of a univariate dataset. Though beyond the scope of this class, the equation for calculating the robust z-score assigns a value for how much more intense the fluroescent signal at a spot is over background. The higher the value, the more different the signal from background. After putative binders are identified via the robust z-score, the data are examined by-eye.

In your laboratory notebook, complete the following:

  • What is a downside of using this approach to identify small molecules that bind a specific target?
  • What is an upside of using this approach to identify small molecules that bind a specific target?

Part 3: Draft discussion section for Research article

As the final section of your Research article, you will write a formal Discussion that summarizes the key findings and states the implications of your research. Use the homework you completed for today to draft the Discussion for your Research article.

Remember that the Results and Discussion information will be separate in this more formal writing assignment. Use the questions below to help you decide which details should be included where in your text.

Fa22 M2D7 results v discussion.png

Navigation links

Next day: Brainstorm ideas for Research Proposal presentation

Previous day: Complete data analysis of functional assay results