Academic
Publications
Characterizing the Space of interatomic Distance Distribution Functions Consistent with Solution Scattering Data

Characterizing the Space of interatomic Distance Distribution Functions Consistent with Solution Scattering Data,10.1142/S0219720010004781,Journal of

Characterizing the Space of interatomic Distance Distribution Functions Consistent with Solution Scattering Data  
BibTex | RIS | RefWorks Download
Scattering of neutrons and x-rays from molecules in solution ofiers alternative approaches to the studying of a wide range of macromolecular structures in their solution state without the need of crystallization. In this paper, we study one part of the problem of elucidating three-dimensional structure from solution scattering data, determining the distribution of interatomic distances, P(r). This problem is known to be ill-conditioned; for a single observed difiraction pattern, there may be many consistent distance distribution functions. Due to the ill conditioning, there is a risk of overfltting the observed scattering data. We propose a new approach to avoiding this problem, accepting the validity of multiple alternative P(r) curves rather than seeking a single \best". We show that there are linear constraints that ensure that a computed P(r) is consistent with the experimental data. The constraints enforce smoothness in the P(r) curve, ensure that the P(r) curve is a probability distribution, and allow for experimental error. We use these constraints to precisely describe the space of all consistent P(r) curves as a polytope of histogram values or Fourier coe-cients. This description can then be used to sample the space of potential alternative P(r) curves. We use this description to develop a linear programming approach to sampling the space of consistent, realistic P(r) curves. In tests on both experimental and simulated scattering data, our approach e-ciently generates ensembles of such curves that display substantial diversity. In particular, we show that the ensemble of P(r) curves generated for a given protein includes members that are more difierent from a reference curve for that protein than are reference curves for proteins of other structural topologies. Thus subsequent reconstruction steps must properly account for this P(r) diversity in optimizing structural models.
Journal: Journal of Bioinformatics and Computational Biology - JBCB , vol. 8, no. 2, pp. 315-335, 2010
Cumulative Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.