|
Dartmouth College Computer Science Technical Report series |
CS home TR home TR search TR listserv |
| By author: | A B C D E F G H I J K L M N O P Q R S T U V W Y Z | |
| By number: | 2008, 2007, 2006, 2005, 2004, 2003, 2002, 2001, 2000, 1999, 1998, 1997, 1996, 1995, 1994, 1993, 1992, 1991, 1990, 1989, 1988, 1987, 1986 | |
Abstract:
Identifying residue coupling relationships within a protein family can
provide important insights into intrinsic molecular processes, and has
significant applications in modeling structure and dynamics,
understanding function, and designing new or modified proteins. We
present the first algorithm to infer an undirected graphical model
representing residue coupling in protein families. Such a model
serves as a compact description of the joint amino acid distribution,
and can be used for predictive (will this newly designed protein be
folded and functional?), diagnostic (why is this protein not stable or
functional?), and abductive reasoning (what if I attempt to graft
features of one protein family onto another?). Unlike current
correlated mutation algorithms that are focused on assessing
dependence, which can conflate direct and indirect relationships, our
algorithm focuses on assessing independence, which modularizes
variation and thus enables efficient reasoning of the types described
above. Further, our algorithm can readily incorporate, as priors,
hypotheses regarding possible underlying mechanistic/energetic
explanations for coupling. The resulting approach constitutes a
powerful and discriminatory mechanism to identify residue coupling
from protein sequences and structures. Analysis results on the
G-protein coupled receptor (GPCR) and PDZ domain families demonstrate
the ability of our approach to effectively uncover and exploit models
of residue coupling.
Note:
To appear in BIOKDD05.
Bibliographic citation for this report: [plain text] [BIB] [BibTeX] [Refer]
Or copy and paste:
John Thomas,
Naren Ramakrishnan, and
Chris Bailey-Kellogg,
"Graphical Models of Residue Coupling in Protein Families."
Dartmouth Computer Science Technical Report TR2005-535,
March 2005.
Want to be notified about new tech reports? Join our mailing list.
Want to search our technical reports?
Want us to mail you a paper copy of a report? Send your address and the TR number to reports AT cs.dartmouth.edu
Copyright notice: The documents contained in this server are included by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.