LNA: Fast Protein Classification Using A Laplacian Characterization of Tertiary Structure
IRISA-UBS, Universite de Bretagne Sud, France
hal-00639663, 2011
@article{bonnel2011lna,
title={LNA: Fast Protein Classification Using A Laplacian Characterization of Tertiary Structure},
author={Bonnel, N. and Marteau, P.F. and others},
year={2011}
}
In the last two decades, a lot of protein 3D shapes have been discovered, characterized and made available thanks to the Protein Data Bank (PDB), that is nevertheless growing very quickly. New scalable methods are thus urgently required to search through the PDB efficiently. We present in this paper an approach entitled LNA (Laplacian Norm Alignment) that performs structural comparison of two proteins with dynamic programming algorithms. This is achieved by characterizing each residue in the protein with scalar features. The feature values are calculated using a Laplacian operator applied on the graph corresponding to the adjacency matrix of the residues. The weighted Laplacian operator we use estimates at various scales local deformations of the topology where each residue is located. On some benchmarks widely shared by the community we obtain qualitatively similar results compared to other competing approaches, but with an algorithm one or two order of magnitudes faster. 180,000 protein comparisons can be done within 1 seconds with a single recent GPU, which makes our algorithm very scalable and suitable for real-time database querying across the Web.
January 18, 2012 by hgpu