[ Team LiB ] Previous Section Next Section

4.1 Pfam Example Flat File

Example 4-1 shows a Pfam flat file. This entry contains terms from the Pfam Field Definitions, discussed later in this chapter.

Example 4-1. Sample Pfam example
# STOCKHOLM 1.0
#=GF ID   14-3-3
#=GF AC   PF00244
#=GF DE   14-3-3 proteins
#=GF AU   Finn RD
#=GF AL   Clustalw
#=GF SE   Prosite
#=GF GA   25 25
#=GF TC   35.40 35.40
#=GF NC   19.10 19.10
#=GF BM   hmmbuild -f HMM SEED
#=GF BM   hmmcalibrate --seed 0 HMM
#=GF RN   [1]
#=GF RM   95327195
#=GF RT   Structure of a 14-3-3 protein and implications for
#=GF RT   coordination of multiple signalling pathways. 
#=GF RA   Xiao B, Smerdon SJ, Jones DH, Dodson GG, Soneji Y, Aitken
#=GF RA   A, Gamblin SJ; 
#=GF RL   Nature 1995;376:188-191.
#=GF RN   [2]
#=GF RM   95327196
#=GF RT   Crystal structure of the zeta isoform of the 14-3-3
#=GF RT   protein. 
#=GF RA   Liu D, Bienkowska J, Petosa C, Collier RJ, Fu H, Liddington
#=GF RA   R; 
#=GF RL   Nature 1995;376:191-194.
#=GF RN   [3]
#=GF RM   96182649
#=GF RT   Interaction of 14-3-3 with signaling proteins is mediated
#=GF RT   by the recognition of phosphoserine. 
#=GF RA   Muslin AJ, Tanner JW, Allen PM, Shaw AS; 
#=GF RL   Cell 1996;84:889-897.
#=GF RN   [4]
#=GF RM   97424374
#=GF RT   The 14-3-3 protein binds its target proteins with a common
#=GF RT   site located towards the C-terminus. 
#=GF RA   Ichimura T, Ito M, Itagaki C, Takahashi M, Horigome T,
#=GF RA   Omata S, Ohno S, Isobe T 
#=GF RL   FEBS Lett 1997;413:273-276.
#=GF RN   [5]
#=GF RM   96394689
#=GF RT   Molecular evolution of the 14-3-3 protein family. 
#=GF RA   Wang W, Shakes DC 
#=GF RL   J Mol Evol 1996;43:384-398.
#=GF RN   [6]
#=GF RM   96300316
#=GF RT   Function of 14-3-3 proteins. 
#=GF RA   Jin DY, Lyu MS, Kozak CA, Jeang KT 
#=GF RL   Nature 1996;382:308-308.
#=GF DR   PROSITE; PDOC00633;
#=GF DR   SMART; 14_3_3;
#=GF DR   PRINTS; PR00305;
#=GF DR   SCOP; 1a4o; fa;
#=GF DR   PDB; 1a37 A; 3; 228;
#=GF DR   PDB; 1a37 B; 3; 228;
#=GF DR   PDB; 1a38 A; 3; 228;
#=GF DR   PDB; 1a38 B; 3; 228;
#=GF DR   PDB; 1a4o A; 3; 228;
#=GF DR   PDB; 1a4o B; 3; 228;
#=GF DR   PDB; 1a4o C; 3; 228;
#=GF DR   PDB; 1a4o D; 3; 228;
#=GF DR   PDB; 1qja B; 3; 229;
#=GF DR   PDB; 1qja A; 3; 230;
#=GF DR   PDB; 1qjb A; 3; 232;
#=GF DR   PDB; 1qjb B; 3; 232;
#=GF DR   INTERPRO; IPR000308;
#=GF SQ   148
#=GS O61131/11-251      AC O61131
<deleted for brevity>
#=GS 143Z_HUMAN/3-236 DR PDB; 1qjb B; 3; 232;
O61131/11-251                RSDCTYRSKLAEQAERYDEMADAMRTLVEQCVnn.......
dkdELTVEERNLLSVAYKNAVGARRASWRIISSVEQKEMSKA.NVHNKNIAATYRKKVEEELNNIC.QDILN.
LLTKKLIPNT..SESESKVFYYKMKGDYYRYISEFS.CDE.
GKKEASNFAQEAYQKATDIAENELPSTHPIRLGLALNYSVFFY..EILNQPHQACEMAKRAF...DDAITEFDNV..
SEDS..YKDSTLI.MQLLRDNLTLWTSDLQGDQ
   
<deleted for brevity>
   
Q9XZV0/2-235                 KEELLNRCKLNDLIENYGEMFEYLKELSHIKI............
DLQPDELDLITRCTKCYIGHKRGQYRKILTLIDKDKIVD.NQKNSALLEILRKKLSEEILLLC.NSTIE.LSQNFLNNNV.
.FPKKTQLFFTKIIADHYRYIYEIN.GKE.DIKLKAKEYYE--KGLQTIKTCKYNSTETAYLTFYLNYSVFLH..
DTMRNTEESIKVSKACL...YEALKDTEDI..VDNS..QKDIVLL.CQMLKDNISLWKTETNEDN
#=GC SS_cons                 HHHHHHHHHHHHHTTCHHHHHHHHHHHHTTSC............
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCTTT--.CCHHHHHHHHHHHHHHHHHHHHH.HHHHH.HHHHTTTTCC.
.CSCHHHHHHHHHHHHHHHHHHHHC.CSC.HHHHHHHHHHHHHHHHHHHHHCHCCTTCHCHHHHHHHHHHHHC..
HTSCCHHHCHHHHHHHH...HHHHTTCGGC..CTTT..HHHHHHH.HHHHHHHHHHCTCCCXXXX
#=GC SA_cons                 26310320300350512510050022003352............
4045500400120033002310402420152179179--.38752510440144014203510.43002.0035201642.
.754403000010100011100201.867.7465125302500340252067635113122100001001127..
31372485135106412...5415867932..3994..6651462.142043126627759XXXX
//
    [ Team LiB ] Previous Section Next Section