-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathmodel.html
144 lines (115 loc) · 7.16 KB
/
model.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
{% extends "layout.html" %} {% block title %}Model{% endblock %}
{% block lead %}{% endblock %} {% block page_content %}
<style>
body {
background-image: linear-gradient(to bottom,
#b09e99 0%,
#fee9e1 20%,
#fad4c0 40%,
#64b6ac 60%,
#64b6ac 100%);
}
#float1 {
background-image: url("https://static.igem.wiki/teams/5346/doodle/model-1-removebg-preview-1.png");
}
#float2 {
background-image: url("https://static.igem.wiki/teams/5346/doodle/model-2-removebg-preview.png");
}
</style>
<div class="col-md-3 sidebar">
</div>
<div class="col-md-9 content">
<h1>Integrated Iterative Approach</h1>
<p>The proposed model, termed IMPROViSeD, adopts an integrated, multi-step iterative approach, which is well-aligned
with the concept of "integrative modelling." This approach mimics the scientific process—collecting experimental
data, proposing hypotheses, and refining models in an iterative manner.</p>
<p>The challenges of computing optimal orientations for subunits forming protein complexes, is broken down into
well-known mathematical problem of <b>"Graph-embedding in Euclidean space"</b> and <b>"Multiview registration"</b>
in computer
vision. This not only makes the pipeline robust, but helps us avert the computational overhead of starting with
numerous random starting orientations. We lay out the steps involved in the IMPROViSeD pipeline below.
</p>
<h2>Localization</h2>
<p>Using a supporting structure composed of C-alpha atoms, the model identifies
plausible alignments while maintaining distance constraints.</p>
<h2>One-Shot Registration</h2>
<p>The model performs a global alignment of the protein subunits to the
supporting structure, reducing computational cost compared to traditional pairwise alignment approaches that
accumulate errors.</p>
<h2>Flip Refinement</h2>
<p>By incorporating a mirror reflection step, the method corrects for potential
symmetry-related errors, ensuring biologically valid configurations.</p>
<h2>Iterative Conformation Generation</h2>
<p>The model generates multiple conformations to account for
the dynamic nature of proteins, enhancing the overall robustness of the resulting models.</p>
<p>This multi-cycle process allows for rapid convergence to a solution that fits experimental crosslinking data,
resulting in a highly refined model.</p>
<figure>
<img src="https://static.igem.wiki/teams/5346/iterations.png" id="flowchart" alt="" />
<figcaption>
Simplified Flowchart for IMPROViSeD. The subunits are oriented in multiple iterations. Each resulting from a
subset of crosslinks, followed by localization and registration. The iterations proceed in parallel. The structures without clashes while
satisfying the crosslinks gives final results.
</figcaption>
</figure>
<h1>Computational Efficiency</h1>
<p>The method addresses the common problem of computational cost by breaking the modelling into different cycles and
utilizing distance-preserving transformations through rigid body motions.</p>
<h2>Optimization on \(\mathbb{SE}(3)\)</h2>
<p>The algorithm takes advantage of the mathematical properties of \(\mathbb{SE}(3)\) to
efficiently compute rigid body motions, ensuring distance preservation without deformation.</p>
<h2>Semi-Definite Programming</h2>
<p>A non-linear optimization formulation known as semi-definite
programming is used to solve the registration problem, providing an optimal solution with reduced computation.</p>
<h2>Parallel Execution</h2>
<p>The iterative steps proceed in parallel, allowing the model to generate
multiple conformations simultaneously, reducing overall computation time.</p>
<h1>Practical Applications and Novelty</h1>
<h2>Handling Experimental Noise</h2>
<p>The model includes the uncertainty inherent in crosslink data,
effectively integrating noise into the optimization to create more biologically realistic structures.</p>
<h2>Global Alignment for Accuracy</h2>
<p>The global alignment of all subunits ensures that the final
configuration is accurate and optimally aligned.</p>
<h2>Generation of Hypothetical Contact Points</h2>
<p>The use of hypothetical supporting structures allows
for the exploration of multiple orientations and predictions of new interfaces, which is valuable for artificial
drug design.</p>
<h1>Experimental Validation and Refinement</h1>
<p>The proposed method was validated using experimental data from protein complexes Ribonuclease Inhibitor Complexed
with Ribonuclease A (PDB 1DFJ). The model showed that it could reconstruct protein structures in a single
alignment step and provided quantitative metrics like RMSD to verify the accuracy of predictions.</p>
<p>Additional energy minimization step ensures the thermodynamic stability of the final model, allowing
researchers to obtain reliable, biologically relevant conformations.</p>
<h1>Comprehensive Post-Processing</h1>
<p>The energy minimization after alignment ensures that the modelled protein structures are not only geometrically
accurate but also energetically favorable. This final refinement step makes the model output suitable for downstream
applications, such as structure-based drug design or functional analysis.</p>
<h1>Key Highlights for the Best Model Award Submission</h1>
<ol>
<li><strong>Unique Iterative Workflow</strong>: IMPROViSeD integrates localization, registration, flip refinement,
and parallel iteration—ensuring efficient and accurate protein modelling.</li>
<li><strong>Mathematical Rigor and Flexibility</strong>: The utilization of \(\mathbb{SE}(3)\) and semi-definite
programming not
only offers computational efficiency. Additional corrections ensure biological relevance of the structures.</li>
<li><strong>Practical Relevance</strong>: The ability to handle experimental uncertainty and generate biologically
plausible hypothetical interfaces positions this model as a valuable tool in biomedicine, especially in drug
discovery.</li>
<li><strong>Efficiency and Scalability</strong>: The parallel processing capability makes it scalable for modelling
large protein complexes, setting it apart from traditional modelling methods.</li>
</ol>
<h1>Conclusion</h1>
<p>In conclusion, the IMPROViSeD model represents a well-balanced approach combining accuracy, efficiency, and
practical relevance, making it a competitive choice for the iGEM 2024 Best Model Award. Its ability to handle the
complexities of protein structure determination with an efficient computational design provides a clear advantage,
especially in scenarios involving large datasets or where computational resources are limited.</p>
<figure>
<img src="https://static.igem.wiki/teams/5346/exp-images/1dfj-movie.gif" id="flowchart" alt="" />
<figcaption>
Visual comparison for the results generated by IMPROVeD with that available in the PDB repository (top left),
shown in similar orientations. The structure derived by our method is able to model new orientations while
satisfying the crosslink data.
</figcaption>
</figure>
</div>
{% endblock %}