FODIS USER GUIDE
User Manual:
Open the PDF directly: View PDF
.
Page Count: 60
| Download | |
| Open PDF In Browser | View PDF |
Fodis A Software for Single Molecule Force Spectroscopy (and Membrane Mechanics) User Guide Nicola Galvanetto Trieste, Italy October 2018 2 Contents Preface 3 Installation 5 Installation 7 1 Basics 1.1 Software Framework . . . . . . . . . . . . . . . . 1.2 Open data . . . . . . . . . . . . . . . . . . . . . . 1.2.1 Import traces (.txt files in Plain Format) 1.2.2 Open samples . . . . . . . . . . . . . . . . 1.3 Data saving . . . . . . . . . . . . . . . . . . . . . 1.4 Data pre-processing . . . . . . . . . . . . . . . . 1.5 Single trace representation - Plot data . . . . . . 1.5.1 Traces . . . . . . . . . . . . . . . . . . . . 1.5.2 Traces-Lc (Peaks automatic detection) . . 1.5.3 Dynamic persistence length . . . . . . . . 1.5.4 Contour length (Lc,Fc) . . . . . . . . . . 1.5.5 Contour length histogram . . . . . . . . . 1.5.6 Delta Lc histograms . . . . . . . . . . . . 1.6 Global trace representation . . . . . . . . . . . . 1.6.1 Global contour length histogram . . . . . 1.6.2 Global contour length histogram max . . 1.6.3 Global contour length - Force plot . . . . 1.6.4 Global delta Lc histograms (and plots) . . 1.6.5 Global peaks (or Global Matrix) . . . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 9 10 11 12 14 15 18 18 19 20 20 21 21 21 22 22 23 23 24 2 CONTENTS 1.6.6 Global persistence length 1.7 Graphical features . . . . . . . . 1.7.1 Density plots . . . . . . . 1.7.2 Parameters Box . . . . . . 1.7.3 Export plot . . . . . . . . 1.8 Selected & Valid Traces . . . . . 1.9 Trace filtering: basic tool . . . . 1.10 Align to zero . . . . . . . . . . . 1.11 Manual Alignment . . . . . . . . 1.12 Export Peaks Info (.txt, .xls) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Advanced Tools 2.1 Unfolding Pathways Analysis . . . . . . . . . 2.1.1 Group division . . . . . . . . . . . . . 2.1.2 Path Plot . . . . . . . . . . . . . . . . 2.1.3 Combined Path Plot . . . . . . . . . . 2.1.4 Force-Lc segmentation plot (Thoma et 2.2 Fingerprint ROI . . . . . . . . . . . . . . . . 2.3 Automatic Alignment . . . . . . . . . . . . . 2.4 Selection through correlation . . . . . . . . . 2.5 Structural heterogeneity determination . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 25 25 25 27 28 29 31 31 32 . . . . . . . . . . . . al.) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 35 35 36 40 40 42 43 47 50 . . . . . . . . . . . . . . 51 52 52 . . . . . 52 53 54 55 56 3 Membrane Mechanics 3.1 Import/Open breakthrough force curves . . . . 3.1.1 Import (plain format) . . . . . . . . . . 3.1.2 Open extension curves from AFM files (only .txt exported from JPK, see 1.1.2) 3.2 Pre-processing . . . . . . . . . . . . . . . . . . 3.3 Analysis of the Breakthrough forces . . . . . . 3.4 Enhance Breakthrough forces detection . . . . 3.5 Export Breakthrough forces . . . . . . . . . . . Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 Preface Fodis (for “Force-distance software”) is a program for the analysis of Single Molecule Force Spectroscopy data. It is mainly intended for automated analysis of force-distance curves of Protein Unfolding Events obtained during Atomic Force Spectroscopy experiments. However, it can be used for visualization and organization of any force-distance curve. Fodis provides a large number of force-curve visualization option, including function for basic statistical analysis. Since the developers are active AFM users, the program also contains some specific data processing methods intended for protein unfolding studies. Fodis is Free and Open Source software, covered by CC Public License. It was born in Matlab and we encourage thirdparties to participate in developing functions and modules. Being a free software, it provides the source code to developers and users, which makes easier both the verification of its data processing algorithms and any further program improvements. Fodis was written in Matlab 2016b. 3 4 Installation Fodis can be downloaded from https://github.com/nicolagalvanetto/ Fodis. Users need to • unzip Fodis folder and move it to the Matlab Current Folder • right clik on the folder and select Add to path folder and subfolders • type Fodis in the Command Window and Enter Fodis requires Matlab 2016b of newer versions. It also requires: • Signal Processing Toolbox • Statistics and Machine Learning Toolbox • Image Processing Toolbox Non-Matlab users working on Windows, Mac or Linux need to launch the installer of their own version and follow the instructions. Please read the README.txt file before installation. For any problem related to the software, please use GitHub issue section https://github.com/nicolagalvanetto/Fodis/issues. 5 6 Nomenclature TSS Tip Sample Separation: distance between the sample and the tip Lc Contour Length: length of the stretched polymer (segment) Fc Force: always referred in combination with Lc F-d Force-distance (curve): a SMFS trace p Persistence length GHM Global Histogram of the Maxima CNG Cyclic Nucleotide Gated channel 7 8 Chapter 1 Basics In this chapter we present the basic working rules of the software and how to use it. 1.1 Software Framework The general architecture of Fodis and the relationships among the various blocks is shown in this flow chart. An ideal working session is here presented. After the collection of 1000 Force-distance curves the user imports these curves in Fodis. The gray boxes represent the input options (described in the section 1.2 and 1.3: the data are imported as they are. If the imported data are in Volts (rather than in Newton) or in the absolute height (rather than in Tip Sample Separation(TSS)), it is possible to transform the force and position units with the transformation module (section 1.4). Then, the traces can be aligned, filtered and analyzed with all the tools available in 9 Fodis. Imagine a set 1000 consecutive Force-distance curves from a purple membrane patch (like in the famous experiment by Oesterhelt et al. [1]). The dataset contains many flat curves (without unfolding peaks), some traces with only non-specific adhesion, some traces with random peaks and some traces with real bacteriorhodopsin unfoldings. The basic filtering tool (section 1.9) and the selection through correlation (section 2.4) can be used to refine the selection of traces and go from 1000 to 10-20 traces. The remaining traces can be aligned with the manual alignment (section 1.11) or the automatic alignment (section 2.3). The user can then finalize his analysis with the remaining tools, focusing on single traces or to population statistics. 1.2 Open data There are three modalities to import data and start working with Fodis. • Import traces uploads matrices of numbers. This is the most universal import option in Fodis (see example below). • Open samples uploads folders containing multiple files of single traces. The format allowed is listed in the GUI menu. • Load session uploads .afm files previously generated by Fodis through Save session. These files contain not only the saved dataset, but also the specific configuration and parameters set in the previous working sessions. 10 1.2.1 Import traces (.txt files in Plain Format) To import .txt files follow these rules: • The .txt file must be organized in columns. The first column contains the force values of the first trace in Newton, the second column contains tip-sample-separation values (or polymer extension) in meters, and so on. • Columns are space-separated. • By convention, Fodis keep repulsive forces negative, attractive forces positive. • If different traces have different number of points, the number of rows of the .txt will be equal to the number of point of the bigger trace. Zeros must be added at the end of short traces to fill the gap (see Figure). N.B. If the user loads single traces (one couple of columns) with different number of rows, Fodis will manage the data anyway. The exported file will be created following the aforementioned rules. 11 1.2.2 Open samples Open-sample user interface is designed to help the user uploading JPK or Bruker files. First select the desired format on the Select format menu and then Open Folder. An example of the various files can be found in additional datasets. • JPK .txt: ASCII exported .txt files from JPK Data processing • JPK .jpk-force • Bruker .txt: with or without Header, ASCII exported in Display Units (Fodis reads the metric prefix of the units) • Bruker .spm (only for Windows in non-deployed versions): Bruker raw files cannot be decoded without their proprietary .dll libraries compiled in user’s machine. Fodis can open .spm and .xyz (.084, .017, ...) files, but it requires some additional installation steps. 1. Install Microsoft Windows SDK (Software Development Kit) https://www.microsoft.com/en-us/download/details.aspx? id=8279. 2. Install the Matlab compiler MinGW-w64: from Matlab Home > Environment > Addons and search MinGW-w64 C/C++ Compiler or download it from https://it.mathworks.com/matlabcentral/ fileexchange/52848-matlab-support-for-mingw-w64-c-c++-compiler 3. Install Nanoscope 64 bit: ftp://anonymous@sboftp.bruker-nano.com/outgoing/ GPTech/Software/NanoScope_Analysis_x86_v180r2sr3.exe 32 bit: ftp://anonymous@sboftp.bruker-nano.com/outgoing/ GPTech/Software/NanoScope_Analysis_x64_v180r2sr3.exe 12 4. Install the AFM-Toolbox (chose custom and your Windows version) ftp://anonymous@sboftp.bruker-nano.com/outgoing/ GPTech/Software/AFM_MATLAB_Toolbox_Setup_for_NanoScope_ Analysis_v180r2sr3.exe and follow the instruction to add the folder to the path. For more information please contact fodis.help@gmail.com, and for newer versions of Bruker’s software contact support.afm@bruker. com. 13 1.3 Data saving Any time, during a working session it is possible to save the data in two different modalities (File > ...): • Export traces generates a .txt of the same format presented in the Import traces section. It is highly recommended to use this format for portability reasons. • Save session generates a .afm file that can be uploaded only by Fodis through the Load session. These files contain not only the current dataset, but also the specific configuration and parameters set during the working session. In case only partial information is needed, it is possible to Export plots or Excel tabels (Tools > Export ...): • Export plot generates a .fig, or any other file format for figures of the current view. • Export ‘Superimpose traces’ XY file generates a .txt two columns file with all the points of the selected traces one after the other (column1=TSS [m] ; column2=Force [N]). • Export Peaks Info (comma separated values .txt) generates a text file of comma separated values, that can be imported in Excel. The file contains the list of selected traces and the relative Force, Position and Slope of the peaks. See details in section 1.12. • Export Peaks Info (Excel .xls) generates a .xls file containing the list of selected traces and the relative Force, Position and Slope of the peaks. (Due to Matlab-Excel poor compatibility this process is slow). See details in section 1.12. 14 1.4 Data pre-processing To correctly analyze a Force-distance curve in Fodis, it must have in the x-axis the Tip Sample Separation in meters (TSS) with the contact point on the left, and in the y-axis the force in Newton (baseline corrected). The imported data may not be ready for the analysis (wrong units, TSS transformation needed, etc). To pre-process the curves please use this module that allows to perform 6 different type of transformations (go to Tools > Pre-Processing (shift, TSS, Baseline)). By default, no transformation is active, but a message pops up suggesting the necessary transformations that needs to be performed. 1. Geometrical Adjustment: the mirroring option inverts the values of the Force. The rotation option swaps the coordinates. N.B. the curves must have the contact point on the left, and the unfolding forces positive — tip-sample contact negative (as shown in the figure). 2. X-axis Adjustment: the x-axis values are multiplied by the constant Height multiplier (which is automatically read in the import 15 files when present). This is necessary when the imported files contain only the non-corrected Height channel (e.g. in JPK files). 3. Y-axis Adjustment: if the vertical deflection of the cantilever is imported in Volts, the curves need to be rescaled with the Sensitivity constant. If the vertical deflection of the cantilever is imported in meters, the curves need to be rescaled with the Spring constant. (The constants are automatically read in the import files when present). 4. Subtract Baseline: if the Force values need to be vertically shifted, this option finds the baseline averaging the final part of the curve (red patch). The starting point and the ending point can be changed in the edit boxes. The linear fit subtraction interpolate the final part and allows to tilt the curve (Warning: curves with peaks within the selected region will be shifted in an inconsistent way). 5. Find contact point finds the first positive value (starting from the left). 6. From piezo height to TSS: it transforms the x-coordinate in tip 16 sample separation (TSS). (The constant is automatically read in the import files when present). N.B. These transformations act on all Valid curves (see section 1.8). Mark the curves as Not Valid to skip the transformations. 17 1.5 Single trace representation - Plot data In this section we present all possible trace representations of single events implemented in Fodis. The plot-box at the top-left corner controls the graph displayed in the central panel. There are two major groups among these graphical representations: single trace representations and global representations of all selected traces. In the view-box, all the parameters necessary to control plotting range, marker size and grid are present. To perform Export plot click on tool in the menu bar: this creates a Matlab figure containing all the objects present in the Fodis graphic panel. N.B. Use the button Find my trace in case no trace is visible in the screen. 1.5.1 Traces Typical Force-tip sample separation curve: exact representation of input numbers. The slide bar at the lower part of the panel allows to move across different loaded traces. 18 Trace smoothing performed through a discontinuity-preserving filter (Savitzky-Golay algorithm). K is the polynomial order ot the interpolant curve while F is the number of points used for interpolation (F must be an odd number). The smoothing acts on the trace shown in the screen when the check-box is checked, after clicking the button Update. The button Update Sel performs the soothing onto all Selected curves (see section 1.8). To remove the smoothing, uncheck the check-box and Update. 1.5.2 Traces-Lc (Peaks automatic detection) Traces-Lc finds relevant peaks and fits them with the WLC equation. Peak detection is enhanced on smoothed traces. The numbers reported in Lc for Analysis are all detected contour lengths (in nm) of plotted rainbow curves. In case a peak is not detected, type the number in the edit box and then Enter or click the button (red arrow) then click the peak in the graph, and then Enter. That new Lc will be saved for all the following analyses. N.B. The detection of the peaks is based on the Force-Lc profile shown in section 1.5.4 (cyan line). The profile can be tuned acting on the parameters Threshold N points and Min peak proximity of the parameter box (section 1.7.2) to enhance peaks detection. 19 1.5.3 Dynamic persistence length The persistence length, which is typically kept fixed (p=0.4nm for a.a. chains), in dynamic mode is relaxed to find the best p-Lc couple for each peak. The final parameters are plotted in the legend. 1.5.4 Contour length (Lc,Fc) This representation transforms every point above a threshold force (in this case 30 pN, tunable in Parameters box Min F) from F-tss space to FLc space according to the transformation presented by Puncher and colleagues [2, 3]. N.B. The detection of the peaks is based on the Force-Lc profile shown (cyan line). The profile can be tuned acting on the parameters Threshold N points and Min peak proximity of the parameter box (section 1.7.2) to enhance peaks detection. 20 1.5.5 Contour length histogram This is the histogram of Contour length (Lc,Fc) representation. It is also called Barrier position histogram [2]. The red bars represent the detected peaks. Bin width can be tuned in the Parameters box. 1.5.6 Delta Lc histograms Delta Lc histograms shows the value of the difference in Lc between two consecutive peaks. The first peak’s DeltaLc, by definition, is equal to the first Lc. 1.6 Global trace representation Single molecule force spectroscopy experiments generate a large amount of data. Single trace representations are useful to establish the quality and features of single events, but only a statistical evaluation of Lc distribution or other specific features allows to perform a quantitative analysis. For this reason, Fodis includes a variety of global representations for the population of traces. 21 1.6.1 Global contour length histogram The Global contour length histogram is the sum of all contour length histograms, normalized on the highest bin. Bin size can be tuned in the Parameters box. 1.6.2 Global contour length histogram max The Global contour length histogram of Maxima (also called Contour length histogram by Kawamura and colleagues [4]) counts the detected peaks of all selected traces (i.e. red bars shown in Contour length histograms section). The resulting distribution can be fitted with the Gaussian Mixture Model (GMM) checking Tools > Automatic multi Gaussian option on the menu bar. The GMM can be re-calculated with the button Update in the Parameter box (the fitting procedure produces variable results, so it may be useful to try different times). This representation is useful to determine the probability of a certain unfolding event happening along the Lc coordinate. (Bin size can be tuned in the Parameters box). 22 1.6.3 Global contour length - Force plot The Lc - Force plot shows the contour length vs Force of all detected peaks. This plot can be used also to detect the detachment force in binding experiments (e.g. with antibodies). If the user is interested on the force distribution of a particular subgroup of peaks (i.e. black rectangle), using Select Lc-F button it is possible to select a region (draw a rectangle and double click inside de area). A force distribution of the selected points will be automatically generated. 1.6.4 Global delta Lc histograms (and plots) Delta Lc is defined as the Lc difference between two consecutive peaks. On the left, histogram of all Delta Lc values of the population. N.B. Right clicking on the graphs allows the user to change the view to Delta Lc-Fc or Lc-Delta Lc. 23 The Lc-Delta Lc plot shows the scatter plot of all detected peaks. If the user wants the Delta Lc distribution of a particular subgroup of peaks (i.e. red rectangle), using Select dLc button it is possible to select a region (draw a rectangle and double click inside de area). A Delta Lc distribution of the selected points will be automatically generated. 1.6.5 Global peaks (or Global Matrix) Automatically detected peaks are used to form a string of ones and zeros for each trace (zeros where there are no peaks, ones where there are a peak). It is a sort of binarized coding for the traces. Each string is therefore plotted one above the other in Global peaks with white pixels for zeros and black pixels for ones. 1.6.6 Global persistence length All the peaks of the population are fitted with two free parameters persistence length (p) and Lc as described in Dynamic persistence length- and plotted. This allows to find correlations between the peaks positions and their curvature(i.e. p). 24 1.7 1.7.1 Graphical features Density plots A heat map is plotted when the Density plot check box is selected. The image is constituted by pixels (Resolution) and the color of the pixel is proportional to the number of points contained in the pixel area. Ratio reference changes the heat map color limits. The Points Density plot highlights each point with the color of the corresponding pixel. Density plot is available for: • Superimpose Traces • Superimpose Lc • Global delta Lc histograms (Lc - Delta Lc) 1.7.2 Parameters Box Many parameters can be set to tune both graphical views and threshold limits (parameters not described here will be discussed later). • Min F: minimum value of force valid for peaks detection or Lc evaluation; • Max F: maximum value of force valid for peaks detection or Lc evaluation; • Min peak proximity: minimum interval for automatic peaks detection (it tunes the sharpness of the cyan line in Contour length (Lc,Fc) plot, section 1.5.4); 25 • Threshold N points: parameter related to peaks detection. The automatic peaks detection operate on the Contour length (Lc,Fc) plot (section 1.5.4). N is the number of points that are removed from each bin (Lc bin size) after the transformation in Lc-Fc, to reduce the detection of spurious and isolated peaks caused by noise fluctuations. N.B. This number highly depends on the sampling rate of the curves. A heuristic rule could be to chose this value proportional to the total number of points of the curve: T hreshold N points ∼ T otal P oints N umber ; 1000 N.B.2 Breakthrough force detection [for Membrane studies (see section 3.4)]. • Max Lc: maximum value of Lc taken into account; • Min Lc for Delta Lc: minimum value of Lc taken into account for Delta Lc plots; 26 • Max Lc for Delta Lc: maximum value of Lc taken into account for Delta Lc plots; • Lc bin size: bin width of Lc histograms or Global Lc representations; • Fc bin size: bin width of Force histograms or Global Force representations; • Delta Lc bin size: bin width of Delta Lc histograms or Global Delta Lc representations; • Persistence Length p: fixed value of persistence length for Lc evaluation; • p step (free p): minimum step when evaluating free persistence length (dynamic persistence length); • Min p (free p): minimum value for dynamic persistence length algorithm; • Max p (free p): maximum value for dynamic persistence length algorithm; • Lc for Analysis: detected Lc values of a current trace (they can be edited); • Update: after parameters changes, click this button to update displayed results. 1.7.3 Export plot Every plot can be extracted and saved individually in Matlab format .fig, or any other file format for figures, from the menu bar Tools > Export plot. It is also possible to export the Superimpose traces plot as a single X-Y .txt file with Export ’Superimpose traces’ file. 27 1.8 Selected & Valid Traces Loaded traces have a visibility label. There are only three possible labels: • Selected • Valid • Not Valid By default, loaded traces are all Selected. Global representations act on Selected (that is a sub-group of Valid). All Valid traces are displayed moving the slide bar. Not Valid traces are not shown. Filtering procedures described in the next section and chapter, filter out traces labeling them as Not Valid. Details Box shows the numbers: 85 Selected, 101 Valid, 122 Valid plus Not Valid, i.e. all loaded data. Trace Selection Box allows to edit the list of Selected traces. The Remove[r] check box removes the current trace from the selection (to Valid but not Selected). Offset add a shift to all selected traces. Selected Traces edit box lists the indexes of Selected curves. It allows also, by changing the list, to add or to remove traces from Selected group. 28 In the Filter menu bar the user can manage the visibility labels of loaded data: • Move ALL to valid: all Not Valid are labeled Valid; • Move all Valid to Selected: all Valid are labeled Selected; • Move not-Selected to not-Valid: all Valid but not Selected are set Not Valid; • Swap Valid and not Valid: all Valid, therefore also Selected, are labeled Not Valid, and vice versa; • Swap Selected and not Selected: all Selected are labeled Valid, and vice versa; Not Valid remains unchanged. 1.9 Trace filtering: basic tool SMFS experiments typically generate a large amount of data. In this section we present a basic filtering tool that allows the user to select traces upon two characteristics: points position in F-tss space and peaks position. Traces that do not satisfy desired requirements will be labeled as Not Valid. 29 The top panel displays single traces. On this panel, the user can draw two kinds of rectangular ROIs: mandatory regions (green) and forbidden regions (red). Green ROIs define Force and tss intervals in which at least one point of the trace must fall into, otherwise the trace is filtered out. Red ROIs define Force and tss intervals in which no point of the trace can fall into, otherwise the trace is filtered out. The ROIs are equal for all the traces. To draw a ROI, the user needs to click on Draw Rectangle ROI and then move on the panel and click&drag; to erase a ROI, just click Erase ROI. Multiple ROIs can be drawn. The bottom panel displays the single contour length histograms in red and the detected peaks in black. Color limits are set by the four edit boxes on the right. The Blue section defines the area in which at least one peak of the trace must fall, otherwise the trace is filtered out. The Pink section defines the area in which the last peak of the trace must fall, i.e. no peaks on higher Lc values, otherwise the trace is filtered out. N.B. We suggest to perform this filtering procedure with traces smoothed with Savitzky-Golay algorithm in order to enhance peaks detection. Once all the desired limits have been set, Compute Filter finds the traces 30 to be removed and Update Valid Traces updates the Not Valid group. 1.10 Align to zero Loaded traces can be all aligned to a zero position. There are two zero definitions: • Align to zero • Align to first positive force Align to zero finds the average value of tss corresponding to negative forces between -200pN and -400pN, and translates each trace accordingly. Sometimes this procedure may not work because the contact phases are not homogeneous: in this case Align to first positive force sets to zero the tss corresponding to the first positive value of Force of each trace. 1.11 Manual Alignment 31 Manual Alignment tool (accessible from the menu bar Align > Manual alignment) allows the user to slightly refine the relative shift among valid traces, typically caused by misalignment of the base line. The graphical panel shows three traces of different colors: • Yellow - chosen trace, its number is visible in the Frame text box; • Green - template trace, editable in the Template edit box; • Pink - trace(s) saved on screen, Keep on screen check box. Traces can be moved only horizontally, both with the Translate slide bar and clicking&dragging on the graphic panel. Only chosen trace (yellow trace) can be translated. To move across valid traces, use Frame slide bar. Select box contains all the information related to Selected traces, the chosen trace can be removed editing the selected edit box or by clicking on Remove trace check box. View box allows to decide the trace representation and to set an Lc of reference. 1.12 Export Peaks Info (.txt, .xls) The following description is valid for both .xls files and comma separated .txt files (imported in Excel). The files generated in Tools > Export Peaks Info contain the geometrical information of Selected traces. • The first column contains the reference number of the Selected traces. 32 • Contour length (m): the first group lists the contour length of each peak identified in the traces. The parameters for peaks identification can be tuned as explained in section 1.5.4. • Force (N): the second group lists the rupture force of the peaks reported above. • Slope keq (nN/nm): the third group lists the slope calculated on the last 20 points before the rupture. This value can be used to calculate the loading rate before the rupture re as follows [7]: re = ksamp · v 1+ ksamp kcant , where ksamp = keq · kcant kcant − keq where keq is the slope, kcant is the spring constant of the cantilever and v is the pulling velocity. 33 Chapter 2 Advanced Tools In this chapter we describe the tools implemented in Fodis. These tools were developed in order to solve some common issues in SMFS experiments, like clustering and alignment. 2.1 Unfolding Pathways Analysis Given a population of traces that represent the unfolding of a certain protein, an important issue in SMFS is to determine how this unfolding happens. Global representations presented in Chapter 1, for instance, are very useful to generate statistics in an aggregated fashion. Here, with Unfolding Pathways Analysis we intend to present three representations that, in spite of being global, show unfolding pathways in a disaggregated fashion. 2.1.1 Group division The first step in an Unfolding Pathways Analysis is to identify different groups of traces depending on the position of their force peaks. 35 The idea is to generate a graphical representation of the different unfolding pathways of a set of traces, and cluster them. Starting from Global Histogram of Maxima (GHM), we can divide the Lc coordinates in intervals (different colors). The extremities of each interval can be set in the Path Interval edit box. Intervals cannot overlap. Auto Intervals automatically generate the intervals writing the corresponding string. The borders are identified as the minima of the Gaussians sum generated over the GHM (section 1.6.2). Then, the string can be edited following this simple rule: min1-max1,min2-max2,... in nm; max(n)≤min(n+1) The intervals can be also chosen equally spaced by inserting only one integer: N : divide Lc coordinates in N equal intervals. Once the intervals have been chosen, the user can update the group division by clicking on Update Grouping. Then checking Enable Group Division it is possible to move across different resulting groups with the menu Show Group Nr. 2.1.2 Path Plot The Path Plot is a representation that clusters a population of traces depending on the position and on the number of Force peaks. Group divi36 sion, and resulting Path Plot generation (Path Analysis > Path Plot), is based on the following 5 steps. Traces are fitted with the WLC model, transformed into Lc histograms and grouped in the Global Histogram (or GHM). The Global Histogram is divided into intervals according to the main ensembles of occurring events, to partition the Lc coordinates in regions with distinct maxima. Following the aforementioned rule, in this explanatory panel, we selected 5 intervals 90-118,118-155,155-200,200-247,247-285nm. 37 On the base of this division, each trace is coded in a binary string of 5 digits: 0 is assigned if no force peak is detected within the interval, 1 is assigned if at least one event is detected. From each string, we created two additional sequences: # and Pn. # is the sequence referred to the order of appearance of the force peak along the trace (in a trace with 2 peaks, the 1st peak has #=1 and the 2nd peak has #=2). Pn refers to the interval position occupied by a peak (a peak that falls within the 3rd interval has Pn=3). Traces so coded are finally plotted as broken lines into an orthogonal #-Pn space, the line-width is proportional to the number of traces that follow the same path. This method provides a representation to distinguish different unfolding behaviors/clusters of a given set of traces, based on the number and position of occurrence of unfolding events. Path Plot of a larger population The previous example shows how the Path Plot is constructed, now we present how it looks when generated on a larger dataset (CNG dataset). 38 Each line width is proportional to the number of traces that cover that specific path. The size of the point at a node is proportional to the number of traces that pass through that node. By Enabling Group Division we can select only one group, the 18th in this case. We can see the peaks distribution and relative Path Plot of 7 traces that has no peak in intervals 3 and 4, indeed, in the Path Plot representation, it does not have any node in the 3rd and 4th intervals. 39 2.1.3 Combined Path Plot Similar to the Path plot, the Combined Path Plot displays the #-Pn space but with no color distinction. Each segment-width is proportional to the absolute number of traces that travel over that particular segment path. 2.1.4 Force-Lc segmentation plot (Thoma et al.) Thoma and colleagues, in 2017, presented a different Unfolding Pathways Analysis based on the Contour length position and average Force of Force peaks [6]. In this representation, the points coordinates 40 are the average Force and Contour length of each different class (intervals of different colors in the figure below); the segments connect the points showing the distribution of the unfolding pathways. The groupdivision implemented in Fodis is based on the partition shown in section 2.1.1. This representation can be generated clicking (Path Analysis > Force-Lc segmentation plot). The radius of the points is proportional to the number of Force peaks that are present in the relative intervals; the width of the segments is proportional to the number of Force-distance curves that show that specific point-to-point pathway. N.B. This analysis is enabled only after setting the intervals (section 2.1.1). Global force plot and group division Thoma plot relative to the above population of peaks 41 2.2 Fingerprint ROI Fingerprint ROI is a tool intended to find specific pattern over the population of traces. It is accessible from the menu bar Selection > Fingerprint ROI. This graphic interface shows the plot panel and the slide bar. The general idea behind this tool is to evaluate how many points of a trace fall into the drawn area (pink area). By clicking the Brush area button, move the mouse cursor on the graph and finally click&drag to draw a continuous ROI. The user can draw as many ROIs as needed just by clicking the button Brush area for any new ROI. The Delete area button deletes all ROIs previously drawn. Once all desired ROIs have been drawn, the evaluation can start. Lag allowed edit box sets how many nm each trace is shifted from its starting position. The maximum value of intersection between ROIs and a trace is saved and displayed with Compute similarity. The Threshold score edit box sets the threshold value of the score: the Update Traces button filters out all the traces that score lower than the threshold value and then they are available in the main Fodis interface. 42 2.3 Automatic Alignment The Automatic Alignment (menu bar Align > Automatic Alignment) is a tool intended to solve the critical and highly operator-dependent issue of traces alignment. Align to a reference trace The Align to this trace option takes the contour length histogram of the current trace (the trace in the view box) and it calculates the cross correlation of all contour length histograms against the current trace. After this process, all the traces will be horizontally shifted to maximize the cross correlation against the current trace. Reference-free Alignment This tool is based on the work of Bosshart and colleagues [3], with some specific improvements developed to extend its applicability to more general SMFS datasets. In particular, they proposed an algorithm consisting of 4 steps. Starting from the contour length histogram of every trace, they: • subdivided the traces into groups of homogeneous traces (i.e. traces with the same number of peaks); • recursively aligned traces into the same group with the maximumcorrelation principle, building an average contour-length reference for each group; • formed a global reference; • aligned all the traces of the dataset to the global reference. To this remarkable computational procedure, we added two features that help to manage a larger variety of traces that do not display regular occurrence of peaks. 1. In addition to the contour-length histogram, we assign to every trace a zero-point, that is the point of tip-sample contact. Given the correlation curve of two traces, we then multiply the correlation curve with 43 a Gaussian curve centered at the point in which the zero-points of the two traces match with each other (red Gaussian curve). The idea is to apply a “potential well” to reduce the maximum displacement of the two zeros. 2. Group division proposed by Bosshart and colleagues works only if all the traces with the same number of peaks have the peaks in the same position, but this is not generally true for F-D curves of the same protein. Therefore, we used a group division following the method described in the Path Plot section. In this way, we imposed two constrains for a given trace to be part of a given group: to have a specific number of peaks and to have those peaks in a specific position. N.B. Group division is done as shown in Path Plot section, with custom or equally-spaced intervals. The user performing Automatic Alignment (menu bar Align > Automatic Alignment) in Fodis is guided across the aforementioned steps with the help of two Graphic Interfaces. Graphic interface 1 for global reference construction 44 The first interface helps the user to set all relevant parameters for the formation of the global reference. The Histogram Customization box is crucial to determine the shape of the histograms to be aligned. Default parameters are generally adequate, but in case of some specific requirements, the user is allowed to change them. The Alignment Valid range edit boxes set the limits over which histograms are set to zero. Gaussian std. sets the standard deviation of the Gaussian ”potential” and Nr. of Iterations sets how many iterations the algorithm executes: iterations are time consuming, but only one iteration generally gives unstable results. Use the button Align! to execute the algorithm, Skip if first alignment has already been performed thus to go directly to the second Alignment interface. Graphic interface 2 for global reference tuning The scope of the second alignment step is to prepare the desired global reference (i.e. the reference histogram onto which every single histogram is finally aligned). Default parameters should be adequate, but for specific requirements, it is possible to change some settings. The tuning process consists of: • tuning shape of the blue profiles (i.e. group references, resulting his45 tograms from alignment intra-group). This can be done in the Group Histograms Customization box moving through different groups with the slide bar. N.B. Group division is done as shown in the Path Plot section, with custom or equally-spaced intervals; • tuning shape of the red profile (i.e. global reference) and pink profile (i.e. binarized reference). This can be done in Reference Histogram Customization box. The global reference is the weighted average of group references. The binarized reference is based on the global reference. It contains peaks found in the global reference, but they are leveled and normalized. Align to Global-Custom Reference to finally re-align all the traces to the reference. 46 2.4 Selection through correlation A common problem in SMFS is finding and isolating relevant traces among all collected data. The Basic Filtering Tool described in Chapter 1 allows the operator to filter out traces longer or shorter than the expected ones; the Fingerprint ROI tool, described in this Chapter, looks at a specific pattern drawn by the user and can list the traces accordingly. All these tools need the intervention of the user to choose some specific interval or shape, in contrast with the purpose of an unsupervised filtering procedure. The Selection through correlation (Selection > *) calculates the cross correlation between every couple of loaded (Valid) traces forming a symmetric similarity matrix. This matrix can then be sorted (ordered) with the MATLAB function symamd (symmetric approximate minimum permutation algorithm). The slide bars change the visualized contour length histograms. The blue curve in the top-right panel is the calculated cross correlation of the two visualized histograms of trace 1 and trace 2. The diamond points the maximum value, the red area is the Gaussian curve that weights the correlation (as described in Automaic Alignment). 47 • Single trace correlation button displays the values of correlation relative to trace 1 (the one shown in the upper panel); • Sort Correlation Matrix applies the symmetric approximate minimum permutation algorithm to the similarity matrix (i.e. it changes the order of the traces to find clusters of similar traces); • Global Correlation shows the similarity matrix ; • Sigma determines the width of the Gaussian curve that weights the correlation curve; • Min is the threshold value for the similarity matrix ; • Max Lag is the maximum lag allowed when calculating the correlation between two traces. Similarity matrix Sorted similarity matrix At this point the user has two options: he can decide to select a specific cluster in the sorted similarity matrix, or he can select traces similar to the reference trace 1 (the trace visualized in the upper panel). • Manual Select Valid Group button: the sorted similarity matrix pops up and, with the cursor, the user can select a cluster of traces 48 (click-drag&drop plus double-click on the segment). The operation can be performed multiple times just by clicking again the button Manual Select Valid Group. The Delete button deletes the selection. The selected group of traces (identified by a X) will be visible in the main interface only after clicking Update traces. If the selection does not satisfy the user, he can Filter > Move ALL to Valid and redo all the process. • Select Traces above Min corr Value button allows to select the traces above the Min threshold, that are similar to the reference trace (trace shown in the upper panel). The selected traces are previously visible clicking Single trace correlation button. A further refinement of the calculation of the correlation is possible tuning the contour length histograms with the options: • Filter trace applies a smoothing filter to the contour length histogram of every trace; • Gaussian substitution substitutes a sum of Gaussian curves to each contour length histograms. It detects the peaks with the parameters on the right. 49 2.5 Structural heterogeneity determination Hinkzewski and colleagues in 2016 found a function that give information about the structural heterogeneity of a protein [5]. This function Ω can be determined performing SMFS at different pulling speed. To perform this analysis in Fodis go to Tools > Structural heterogeneity. Global Contour length-Force plot (section 1.6.3 ). The user have to select the region of interest containing the Force peaks, and double click inside the region. Fodis will compute Ω. Then the user have to repeat the procedure to the same Force peaks obtained at a different pulling speed, saving and comparing the Ω graphs. Ω function of the force 50 Chapter 3 Membrane Mechanics In this chapter we describe the tools for the analysis of breakthrough force: • determination of the Breakthrough force of a membrane; • 2D mapping of the Breakthrough force of a membrane; • line tension; • spreading pressure. 51 3.1 3.1.1 Import/Open breakthrough force curves Import (plain format) From File > Import traces you can open the curves in plain format (see section 1.2.1). 3.1.2 Open extension curves from AFM files (only .txt exported from JPK, see 1.1.2) From File > Open samples you can choose to open the extension curves. Select the folder with the data and Import Traces. 52 3.2 Pre-processing First set the baseline to zero and then align to the maximum positive. You could also transform the x axis to Tip-sample-separation. N.B. If your force is negative (upside down) use Mirror (F -> -F) since the tool for membrane mechanics only recognizes the breakthrough forces in the positive region. Then you can align the traces to the surface position with Align to maximum value. 53 3.3 Analysis of the Breakthrough forces Once all the curves are loaded and aligned go to the menu bar for Membranes > Force Break Membrane. Fodis will generate the statistical analysis of all the valid curves. • Figure 1: Breakthrough force map (if the JPK .txt contain the position of the curves); • Figure 2: scatter plot of the detected Breakthrough forces (Force and position); • Figure 3: histogram of the detected Breakthrough forces (Force only; orange curve: curve described in [8]); • Parameters: set the experimental parameters of your experiment and then tune the surface energy S and the Line tension G in order to fit the distribution with the orange cure in Figure 3. The correct line tension and spreading pressure of your membrane are the ones that best fit the cyan distribution. 54 3.4 Enhance Breakthrough forces detection If the number of points in the previous histogram/scatterplot is too low (too high), it means that the detection of the Breakthrough forces of your set of curves is not optimal. So you probably need to tune the parameter Threshold N points which set the threshold for the prominence of the Breakthrough events. • Reduce Threshold N points and click for Membranes > Force Break Membrane again to increase the number of detected Breakthrough events. • Increase Threshold N points and click for Membranes > Force Break Membrane again to reduce the number of detected Breakthrough events. 3.5 Export Breakthrough forces You can export the values of position and Breakthrough Forces in a .txt file from for Membranes > Export position/ Break force. Column1=position [m] ; Column2=Force [N]. 55 Bibliography [1] Oesterhelt, F., et al. ”Unfolding pathways of individual bacteriorhodopsins.” Science 288.5463 (2000): 143-146. [2] Puchner, Elias M., et al. ”Comparing proteins by their unfolding pattern.” Biophysical journal 95.1 (2008): 426-434. [3] Bosshart, Patrick D., Patrick LTM Frederix, and Andreas Engel. ”Reference-free alignment and sorting of single-molecule force spectroscopy data.” Biophysical journal 102.9 (2012): 2202-2211. [4] Kawamura, Shiho, et al. ”Kinetic, energetic, and mechanical differences between dark-state rhodopsin and opsin.” Structure 21.3 (2013): 426-437. [5] Hinczewski, Michael, Changbong Hyeon, and D. Thirumalai. ”Directly measuring single-molecule heterogeneity using force spectroscopy.” Proceedings of the National Academy of Sciences 113.27 (2016): E3852-E3861. [6] Thoma, Johannes, et al. ”Maltoporin LamB Unfolds beta Hairpins along Mechanical Stress-Dependent Unfolding Pathways.” Structure (2017). [7] Bizzarri, Anna Rita, and Salvatore Cannistraro, eds. Dynamic force spectroscopy and biomolecular recognition. CRC Press, 2012.. 57 [8] S.Loi, G.Sun, V.Franz, H.-J.Butt, Rupture of molecular thin films observed in atomic force microscopy. II.Experiment, Phys.Rev.E66(2002)031602, https:// doi.org/10.1103/PhysRevE.66.031602 58
Source Exif Data:
File Type : PDF File Type Extension : pdf MIME Type : application/pdf PDF Version : 1.5 Linearized : No Page Count : 60 Page Mode : UseOutlines Author : NICOLA GALVANETTO Title : FODIS USER GUIDE Subject : Creator : pdflatex Producer : Latex con hyperref Create Date : 2018:11:01 13:26:55+01:00 Modify Date : 2018:11:01 13:26:55+01:00 Trapped : False PTEX Fullbanner : This is MiKTeX-pdfTeX 2.9.5900 (1.40.17)EXIF Metadata provided by EXIF.tools