| Links: | BOTTOM |
|
|
reference predict_h14790 (Jun 26, 2000 00:04:10) reference pred_h14790 (Jun 26, 2000 00:05:01) PPhdr from: kapilm@cs.brandeis.edu PPhdr resp: MAIL PPhdr orig: HTML PPhdr want: HTML PPhdr password(###) prediction of: - threading (TOPITS)- return msf format ret topits hssp ret topits strip ret topits own ret html # default: single protein sequence MCAQYCISFADVEKAHINIQDSIHLTPVLTSSILNQIAGRNLFFKCELFQKTGSFKIRGALNAIRGLIPDTPEEKPKAVVTHSSGNHGQALTYAAKLEGIPAYIVVPQTAPNCKKLAIQAYGASIVYCDPSDESREKVTQRIMQETEGILVHPNQEPAVIAGQGTIALEVLNQVPLVDALVVPVGGGGMVAGIAITIKALKPSVKVYAAEPSNADDCYQSKLKGELTPNLHPPETIADGVKSSIGLNTWPIIRDLVDDVFTVTEDEIKYATQLVWGRMKLLIEPTAGVALAAVLSQHFQTVSPEVKNVCIVLSGGNVDLTSLNWVGQAERPAPYQTVSV
-------------------------------------------------------------
Pattern-ID: PKC_PHOSPHO_SITE PS00005 PDOC00005
Pattern-DE: Protein kinase C phosphorylation site
Pattern: [ST].[RK]
54 SFK
139 TQR
196 TIK
203 SVK
Pattern-ID: CK2_PHOSPHO_SITE PS00006 PDOC00006
Pattern-DE: Casein kinase II phosphorylation site
Pattern: [ST].{2}[DE]
8 SFAD
71 TPEE
212 SNAD
235 TIAD
261 TVTE
Pattern-ID: MYRISTYL PS00008 PDOC00008
Pattern-DE: N-myristoylation site
Pattern: G[^EDRKHPFYW].{2}[STAGCN][^P]
59 GALNAI
88 GQALTY
187 GGMVAG
239 GVKSSI
287 GVALAA
Pattern-ID: DEHYDRATASE_SER_THR PS00165 PDOC00149
Pattern-DE: Serine/threonine dehydratases pyridoxal-phosphate attachment site
Pattern: [DESH].{4,5}[STVG].[AS][FYI]K[DLIFSA][RVMF][GA][LIVMGA]
47 ELFQKTGSFKIRGA
>prot (#) ppOld, default: single protein sequence /home/phd/server/work/predict_h14790
MCAQYCISFADVEKAHINIQDSIHLTPVLTSSILNQIAGRNLFFKCELFQKTGSFKIRGA
LNAIRGLIPDTPEEKPKAVVTHSSGNHGQALTYAAKLEGIPAYIVVPQTAPNCKKLAIQA
YGASIVYCDPSDESREKVTQRIMQETEGILVHPNQEPAVIAGQGTIALEVLNQVPLVDAL
xxxxxxxxxxxx
IAITIKALKPSVKVYAAEPSNADDCYQSKLKGELTPNLHPPETIADGV
KSSIGLNTWPIIRDLVDDVFTVTEDEIKYATQLVWGRMKLLIEPTAGVALAAVLSQHFQT
VSPEVKNVCIVLSGGNVDLTSLNWVGQAERPAPYQTVSV
Identities computed with respect to: (query) prot Colored by: consensus/70% and property
HSP processing: ranked
26 [ . . : . . . . 1 . . . . : . . . . 2 . . . . : . . . . 3 . ] 314
prot (#) ppOld, default: single ... score P(N) N 100.0% TPVLTSSILNQIAGRNLFFKCELFQKTGSFKIRGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIALEVLNQVPLVDALVVPVGGGGMVAGIAITIKALKPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIGLNTWPIIRDLVDDVFTVTEDEIKYATQLVWGRMKLLIEPTAGVALAAVLSQHFQTVSPEVKNVCIVLSG
1 PD002811 p2000.1 (28) THD1(9) SDHL(3... 105 4.6e-15 2 37.0% --------------------------------------------------------------------------------------------------------------------------------------------ILKQVDDQGNKIDAIFVPVGGGGLIAGVATYLKRISP-----------------------------------------VGEYTFELCQEYVDDIVLVSEDEICAAIKDLYEEEKVIVEPAGALAIAGLKSYKEQHKGKNVNVVCIISGG
2 PD035210 p2000.1 (2) SDHL(2) // L-S... 78 0.00043 1 44.1% TPIRDSMALSKMAGTSVYLKMDSAQPSGSFKIRG---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
consensus/100% .................................. ..................................... .......................................................................
consensus/90% .................................. ..................................... .......................................................................
consensus/80% .................................. ..................................... .......................................................................
consensus/70% .................................. ..................................... .......................................................................
|
--- ------------------------------------------------------------ --- --- Again: these results were obtained based on the domain data- --- base collected by Daniel Kahn and his coworkers in Toulouse. --- --- PLEASE quote: --- F Corpet, J Gouzy, D Kahn (1998). The ProDom database --- of protein domain families. Nucleic Ac Res 26:323-326. --- --- The general WWW page is on: ---- --------------------------------------- --- http://www.toulouse.inra.fr/prodom.html ---- --------------------------------------- --- --- For WWW graphic interfaces to PRODOM, in particular for your --- protein family, follow the following links (each line is ONE --- single link for your protein!!): --- http://www.toulouse.inra.fr/prodom/cgi-bin/ReqProdomII.pl?id_dom1=PD002811 ==> multiple alignment, consensus, PDB and PROSITE links of domain PD002811 http://www.toulouse.inra.fr/prodom/cgi-bin/ReqProdomII.pl?id_dom2=PD002811 ==> graphical output of all proteins having domain PD002811 http://www.toulouse.inra.fr/prodom/cgi-bin/ReqProdomII.pl?id_dom1=PD035210 ==> multiple alignment, consensus, PDB and PROSITE links of domain PD035210 http://www.toulouse.inra.fr/prodom/cgi-bin/ReqProdomII.pl?id_dom2=PD035210 ==> graphical output of all proteins having domain PD035210 --- --- NOTE: if you want to use the link, make sure the entire line --- is pasted as URL into your browser! --- --- END of PRODOM --- ------------------------------------------------------------
--- ------------------------------------------------------------ --- MAXHOM multiple sequence alignment --- ------------------------------------------------------------ --- --- MAXHOM ALIGNMENT HEADER: ABBREVIATIONS FOR SUMMARY --- ID : identifier of aligned (homologous) protein --- STRID : PDB identifier (only for known structures) --- IDE : percentage of pairwise sequence identity --- WSIM : percentage of weighted similarity --- LALI : number of residues aligned --- NGAP : number of insertions and deletions (indels) --- LGAP : number of residues in all indels --- LSEQ2 : length of aligned sequence --- ACCNUM : SwissProt accession number --- OMIM : OMIM(Online Mendelian Inheritance in Man) ID --- NAME : one-line description of aligned protein --- --- MAXHOM ALIGNMENT HEADER: SUMMARY ID STRID IDE WSIM LALI NGAP LGAP LEN2 ACCNUM OMIM NAME ykv8_yeast 40 52 319 4 8 326 P36007 HYPOTHETICAL 34.9 KD PROT thd2_ecoli 35 47 320 4 6 329 P05792 THREONINE DEHYDRATASE CAT y4tj_rhisn 32 44 328 4 10 332 P55664 PUTATIVE THREONINE DEHYDR thdh_yeast 31 42 319 6 13 576 P00927 THREONINE DEHYDRATASE PRE thdh_arxad 31 43 320 5 9 550 O42615 THREONINE DEHYDRATASE PRE thd1_haein 31 44 327 5 10 513 P46493 DEAMINASE). thd1_salty 31 43 334 6 16 514 P20506 DEAMINASE). thd1_lyces 30 41 323 5 11 595 P25306 DEAMINASE). thd1_ecoli 1TDJ 30 42 334 6 16 514 P04968 DEAMINASE). thd1_burce 30 41 316 6 10 507 P53607 DEAMINASE). thd1_myctu 30 37 318 5 13 429 Q10766 DEAMINASE). thd1_bacsu 28 39 320 6 12 422 P37946 DEAMINASE). thd1_lacla 28 39 312 6 15 441 Q02145 DEAMINASE). thd1_soltu 28 41 185 3 3 359 P31212 (FRAGMENT). sdhl_rat 28 28 298 9 59 362 P09367 DEHYDRATASE (EC 4.2.1.16) sdhl_human 28 30 297 7 16 328 P20132 182128 L-SERINE DEHYDRATASE (EC thd1_corgl 27 33 313 7 18 436 Q04513 DEAMINASE). --- --- MAXHOM ALIGNMENT: IN MSF FORMAT
--- ------------------------------------------------------------ --- 3D homologue: the known structure that appeared to have sig- --- 3D homologue: nificant sequence identity to your protein is: --- 3D homologue: 1TDJ, . --- 3D homologue: Note: we do NOT check whether the similarity --- 3D homologue: is in the region for which structure has --- 3D homologue: been determined. Thus, please verify! --- ------------------------------------------------------------
--- --- Version of database searched for alignment: --- SWISS-PROT release 38.0 (7/99) with 80000 proteins ---
Identities computed with respect to: (1) predict_h1470 Colored by: consensus/70% and property
1 [ . . . . : . . . . 1 . . . . : . . . . 2 . . . . : . . . . 3 . . . ] 339
1 predict_h1470 100.0% MCAQYCISFADVEKAHINIQDSIHLTPVLTSSILNQIAGRNLFFKCELFQKTGSFKIRGALNAIRGLIPDTPEEKPKAVVTHSSGNHGQALTYAAKLEGIPAYIVVPQTAPNCKKLAIQAYGASIVYCDPSDESREKVTQRIMQETEGILVHPNQEPAVIAGQGTIALEVLNQVPLVDALVVPVGGGGMVAGIAITIKALKPSVKVYAAEPSNADDCYQSKLKGELTPNLHPPETIADGVKSSIGLNTWPIIRDLVDDVFTVTEDEIKYATQLVWGRMKLLIEPTAGVALAAVLSQHFQTVSPEVKNVCIVLSGGNVDLTSLNWVGQAERPAPYQTVSV
2 ykv8_yeast 39.6% -------TYGDVLDASNRIKEYVNKTPVLTSRMLNDRLGAQIYFKGENFQRVGAFKFRGAMNAVSKL---SDEKRSKGVIAFSSGNHAQAIALSAKLLNVPATIVMPEDAPALKVAATAGYGAHIIRYNRYTEDREQIGRQLAAEHGFALIPPYDHPDVIAGQGTSAKELLEEVGQLDALFVPLGGGGLLSGSALAARSLSPGCKIFGVEPEAGNDGQQSFRSGSIV-HINTPKTIADGAQthLGEYTFAIIRENVDDILTVSDQELVKCMHFLAERMKVVVEPTACLGFAGALLKKEELVG---KKVGIILSGGNVDMKRYATLISGKEDGP------
3 thd2_ecoli 33.8% ITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSL---TDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRT-TGTLADGCdsRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTV-SIISGGNIDLSRVSQI--------------
4 y4tj_rhisn 31.0% MNELSNLSLESIERARERIEEHVFRTPLTTSRSLTELTGTQVSLKLEHYQRTGSFKLRGATNAILQL---SPSDRARGVIAASTGNHGRALSYAAKAVGSRATICMSDLVPENKVSEIRKLGATVRIVGSSQDDAQVEVERLVAEEGLSMIPPFDHPHIIAGQRTVGLEIVEAMPDVAMVLLPLSGGGLAAGVAAAVKALRPHARIIGVTMDRGAAMKASIEAGHPV-QVKEYRSLADSLGGGIGmwTFQMCRALLDDVVLVNEGEIAAGIRHAYEHERQILEGAGAVGIAALLSG---KVAARGGSVGVVLSGQNIDMGLHREVINGVVRATEE----
5 thdh_yeast 30.6% ---------------RSSVYDVINESPISQGVGLSSRLNTNVILKREDLLPVFSFKLRGAYNMIAKL---DDSQRNQGVIACSAGNHAQGVAFAAKHLKIPATIVMPVCTPSIKYQNVSRLGSQVVLYGNDFDEAKAECAKLAEERGLTNIPPFDHPYVIAGQGTVAMEILRQVrkIGAVFVPVGGGGLIAGIGAYLKRVAPHIKIIGVETYDAATLHNSLQRNQRTP-LPVVGTFADGTSvmIGEETFRVAQQVVDEVVLVNTDEICAAVKDIFEDTRSIVEPSGALSVAGMKK-YISTVHPEinTYVPILSGANMNFDRLRFVSERAVLGEGKEVFM
6 thdh_arxad 30.6% ---------------TSKVYDVCNETPVTPAVNLSSKLGANIFLKREDLQPVFSFKLRGAYNMMAHLP---QETRWKGVIACSAGNHAQGVAYSAKHLNIPATIVMPVVTPAIKYKNVDRLGAKVVLHGNDFDAAKAECNRLSEKHGLTNIPLFDNPYVIAGQGTIGVELLRQIdsLKAIFVCIGGGGLIAGVGAYIKRIAPQVKIIGVETYDANAMRQSLQKGERI-TLSEVGLFADGAAviLGEETFRLCQQVVDEIVLVSTDEICAAIKDVFTETRSIVEPAGALSVAGLVkeSHPEIDHSASGYTAILSGANMDFDRLRFVSERAKLGEGSEVFI
7 thd1_haein 30.1% -------SQSDYINAIVKLGSRVyvTPLQKMGKLSERLHNNIWIKREDRQPVNSFKLRGAYAMISSL---SAEQKAAGVIAASAGNHAQGVALSAKQLGLKALIVMPQNTPSIKVDAVRGFGGEVLLHGANFDEAKAKAIELSKEKNMTFIPPFDHPLVIAGQGTLAMEMLQQVADLDYVFVQVGGGGLAAGVAILLKQFMPEIKIIGVESKDSACLKAALDKGEPT-DLTHIGLFADGVAvrIGDETFRLCQQYLDDMVLVDSDEVCAAMKDLFENVRAVAEPSGALGLAGLKKYVKQNHI-EGKNMAAILSGANLNFHTLRYVSERCEIGENREALL
8 thd1_salty 29.8% MAESQPLSVAPEGAEYlpVYEAAQVTPLQKMEKLSSRLDNVILVKREDRQPVHSFKLRGAYAMMTGL---TEEQKAHGVITASAGNHAQGVAFSSARLGVKSLIVMPKATADIKVDAVRGLGGEVLLHGANFDEAKAKAIELAQQQGFTWVPPFDHPMVIAGQGTLALELLQQDSHLDRVFVPVGGGGLAAGVAVLIKQLMPQIKVIAVEAEDSA-CLKAALEAGHPVDLPRVGLFAEGVAvrIGDETFRLCQEYLDDIITVDSDAICAAMKDLFEDVRAVAEPSGALALAGMKKYIAQH-NIRGERLAHVLSGANVNFHGLRYVSEreQREGLLTVTI
9 thd1_lyces 29.6% -----------VDILASPVYDVAIESPLELAEKLSDRLGVNFYIKREDKQRVFSFKLRGAYNMMSNL---SREELDKGVITASAGNHAQGVALAGQRLNCVAKIVMPTTTPQIKIDAVRALGGDVVLYGKTFDEAQTHALELSEKDGLKYIPPFDDPGVIKGQGTIGTEINRQLKDIHAVFIPVGGGGLIAGVATFFKQIAPNTKIIGVEPYGAASMTLSLHEGHRV-KLSNVDTFADGVAvlVGEYTFAKCQELIDGMVLVANDGISAAIKDVYDEGRNILETSGAVAIAGAAA-YCEFYKIKNENIVAIASGANMDFSKLHKVTELakEALLATFMV
10 thd1_ecoli 29.2% MADSQPLSGAPEGAEYlpVYEAAQVTPLQKMEKLSSRLDNVILVKREDRQPVHSFKLRGAYAMMAGL---TEEQKAHGVITASAGNHAQGVAFSSARLGVKALIVMPTATADIKVDAVRGFGGEVLLHGANFDEAKAKAIELSQQQGFTWVPPFDHPMVIAGQGTLALELLQQDAHLDRVFVPVGGGGLAAGVAVLIKQLMPQIKVIAVEAEDSA-CLKAALDAGHPVDLPRVGLFAEGVAvrIGDETFRLCQEYLDDIITVDSDAICAAMKDLFEDVRAVAEPSGALALAGMKK-YIALHNIRGERLAHILSGANVNFHGLRYVSEreQREALLAVTI
11 thd1_burce 29.3% ---------------TARVYDVAFETELEPARNLSARLRNPVYLKREDNQPVFSFKLRGAYNKMAHIP---ADALARGVITASAGNHAQGVAFSAARMGVKAVIVVPVTTPQVKVDAVRAHGGPGVEVIQAGESYSDaaLKVQEERGLTFVHPFDDPYVIAGQGTIAMEILRQHqpIHAIFVPIGGGGLAAGVAAYVKAVRPEIKVIGVQAEDSCAMAQSLQAGKRV-ELAEVGLFADGTAvlVGEETFRLCKEYLDGVVTVDTDALCAAIKDVFQDTRSVLEPSGALAVAGAKL-YAEREGIENQTLVAVTSGANMNFDRMRFVAERAEVGEARE---
12 thd1_myctu 28.6% --PLFSLSGADIDRAAKRIAPVVTPTPLQPSDRLSAITGATVYLKREDLQTVRSYKLRGAYNLLVQL---SDEELAAGVVCSSAGNHAQGFAYACRCLGVHGRVYVPAKTPKQKRDRIRYHGGEFIDLIVGGSTYDLAAAAALEDVErtLVPPFDDLRTIAGQGTIAVEVLGQLeePDLVVVPVGGGGCIAGITTYLAERTTNTAVLGVEPAGAAAMMAALAAGEPVTLDHVDQFVDGAAVNRAGTLTYAALAAAGDMVstVDEGAVCTAMLDLYQNEGIIAEPAGALSVAGLLEADIEPGST----VVCLISGGNNDVSRYGEVLE------------
13 thd1_bacsu 28.4% LKENSLIQVKHILKAHQNVKDVVIHTPLQRNDRLSERYECNIYLKREDLQVVRSFKLRGAYHKMKQL---SSEQTENGVVCASAGNHAQGVAFSCKHLGIHGKIFMPSTTPRQKVSQVELFGKgiILTGDTFDDVYKSAAECCEAESRTFIHPFDDPDVMAGQGTLAVEILNDIdePHFLFASVGGGGLLSGVGTYLKNVSPDTKVIAVEPAGAASYFESNKAGHVV-TLDKIDKFVDGAAvkIGEETFRTLETVVDDILLVPEGKVCTSILELYNECAVVAEPAGALSVAALDLYKDQIKG---KNVVCVVSGGNNDIGRMQEMKE------------
14 thd1_lacla 28.2% --------LSNKYQANIYLKEVVTKTPLQLDPYLSNKYQANIYLKEENLQKVRSFKLRGAYYSISKL---SDEQRSKGVVCASAGNHAQGVAFAANQLNISATIFMPVTTPNQKISQVKFFGESHVtiGDTFDESARAAKAFSQDNDKPFIDPFDDENVIAGQGTVALEIFAQAksLDKIFVQIGGGGLIAGITAYSKERYPQTEIIGVEAKGATSMKAAYSAGQPV-TLEHIDKFADGIAvtVGQKTYQLINDKVKQLLAVDEGLISQTILELYSKLGIVAEPAGATSVAALELIKDEIKG---KNIVCIISGGNNDISRMQEIEE------------
15 thd1_soltu 28.9% --------------------------------------------------------------------------------------------------------------------------------------------------------PFDAPGVIKGQGTIGTEINRQLKDIHAVFVPVGGGGLISGVAAYFTQVAPHTKIIGVEPYGAASMTLSLYEGHRV-KLENVDTFADGVAvlVGEYTFAKCQELIDGMVLVRNDGISAAIKDVYDEGRNILETSGAVAIAGAAA-YCEFYNIKNENIVAIASGANMDFSKLHKVTELAELGSDNEALL
16 sdhl_rat 27.1% -------------------QESLhkTPLRDSMALSKVAGTSVFLKMDSSQPSGSFKIRGIGHLCkaLLPDTPSPL-------TAGNAGMATAYAARRLGLPATIVVPSTTPALTIERLKNEGATVEVVGEMLDEAIQLAKALEKNNPgvYISPFDDPLIWEGHTSLVKELKETLskPGAIVLSVGGGGLLCGVVQGLREvwEDVPIIAMETFGAHS-FHAAVKEGKLVTLPKITSVAKALgnTVGAQTLKLFYEHPIFSEVISDQEAVTAIEKFVDDEKILVEPACGAALAAVYSGvgRLQTPLASLVVIVCGGSNISLAQLQAL--------------
17 sdhl_human 27.2% -------------------------TPIRDSMALSKMAGTSVYLKMDSAQPSGSFKIRGIGHFCKRWA----KQGCAHFVCSSAGNAGMAAAYAARQLGVPATIVVPGTTPALTIERLKNEGAtkVVGELLDEAFELAKALAKNNPGWVYIPPFDDPLIWEGHASIVKELKETLwkPGAIALSVGGGGLLCGVVQGLQegWGDVPVIAMETFGAHSFHAATTAGKLV-SLPKITSVAKALGvtVGSQALKLFQEHPIFSEVISDQEAVAAIEKFVDDEKILVEPAWGAALAAVYSHVIQKLQLepSLVVIVCGGSNISLAQLRALKE------------
18 thd1_corgl 26.5% ------IRAADIQTAQARISSVIAPTPLQYCPRLSEETGAEIYLKREDLQDVRSYKIRGALNSG---AQSPQEQRDAGIVAASAGNHAQGVAYVCKSLGVQGRIYVPVQTPKQKRDRIMVHGGEFVSLVVTGNNFDEASAAAHEDAErtLIEPFDARNTVIGQGTVAAEILSQLtsADHVMVPVGGGGLLAGVVSYMADMAPRTAIVGIEPAGAAS-MQAALHNGGPITLETVDPFVDGAEvrVGDLNYTIVEKNQGRVHMMSATEGAVCTEmlYQNEGIIAEPAGALSIAGLKEMSFAPGSV----VVCIISGGNNDVLRYAEIAE------------
consensus/100% ..........................................................................................................................................................pt..hh.Gpto.shEh.tt....t.lhh.luGGGhhsG.s.h.tth..th.lhuhp..tst....u..ttt........t.hstuh.s..G..sh.hh.t......hh.tt.h..sh..hhtp.t.lhEsshshuhAuh..............h..lhuGtN.sh..ht.h..............
consensus/90% .........................o.h.....hst.ht..h.hK.-....s.uaKhRGhh..h........t.h.......osGNtu.uhshsst..t..uhlhhs..sst.ph.th...ut.h..h....pt.......h.t...h..l.PaDt..hhtGpsolshEl.tp....thlhl.lGGGGhhsGls.hhtth.sph.lhuhps.sut..h.uh.tst....h...t.hscuhtshhG..shthhtt......hlsptth..sh..hhpp.t.lhEsssuhuhAuh..............hs.lhuGuN.shttht.l..............
consensus/80% ..................l.t.h..TPl.....Lophhts.lhlKhEshQ.shSFKlRGAhthh.tl. ..pph.tullstSuGNHupuhuhust.lsl.uhIhhP.tsPt.Khttlp.hGuphl.h....pph...s.th.pptthhhl.PFDcP.lltGQGTluhElhpph.p.thlhlslGGGGLhuGlshhhpphhPphtllulEs.susshhtuh.tut.s.pl..ht.hA-uhsshlGp.TathhpphhcthhhVspstlssuh.tlhpc.t.lhEsuuululAuhh....t.ht...p.lshlhSGuN.shtphp.l.p............
consensus/70% ..................l.phh..TPlp.s.tLSphhtsslalKtEshQ.stSFKlRGAhshhttL. stcptstGVlssSAGNHAQulAauuppLsl.uhIshP.sTPp.KhptlpthGuphlhhstsh-phpttshthtpppshshlsPFDcPhVIAGQGTluhElhppltplctlhVslGGGGLlAGlushl+plhPph+lIuVEs.susshhtuh.tGphs.pLtplshhADGsustlGp.TaplhpphlDtllhVspstlssuhcclapc.+.lsEPuGAlulAuhht.hhphhs...pplshllSGuNhshsphp.ltE............
|
****************************************************************************
* *
* Prediction of: *
* - secondary structure, by PHDsec *
* - solvent accessibility, by PHDacc *
* *
* PHD: Profile fed neural network systems from HeiDelberg *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* Author: Burkhard Rost *
* EMBL, Heidelberg, FRG *
* Meyerhofstrasse 1, 69 117 Heidelberg *
* Internet: Predict-Help@EMBL-Heidelberg.DE *
* *
* All rights reserved. *
* *
****************************************************************************
* *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* Secondary structure prediction by PHDsec: *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* Author: Burkhard Rost *
* EMBL, Heidelberg, FRG *
* Meyerhofstrasse 1, 69 117 Heidelberg *
* Internet: Rost@EMBL-Heidelberg.DE *
* *
* All rights reserved. *
* *
* *
****************************************************************************
* *
* About the network method *
* ~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* The network procedure is described in detail in: *
* 1) Rost, Burkhard; Sander, Chris: *
* Prediction of protein structure at better than 70% accuracy. *
* J. Mol. Biol., 1993, 232, 584-599. *
* *
* A brief description is given in: *
* Rost, Burkhard; Sander, Chris: *
* Improved prediction of protein secondary structure by use of se- *
* quence profiles and neural networks. *
* Proc. Natl. Acad. Sci. U.S.A., 1993, 90, 7558-7562. *
* *
* The PHD mail server is described in: *
* 2) Rost, Burkhard; Sander, Chris; Schneider, Reinhard: *
* PHD - an automatic mail server for protein secondary structure *
* prediction. *
* CABIOS, 1994, 10, 53-60. *
* *
* The latest improvement steps (up to 72%) are explained in: *
* 3) Rost, Burkhard; Sander, Chris: *
* Combining evolutionary information and neural networks to predict *
* protein secondary structure. *
* Proteins, 1994, 19, 55-72. *
* *
* To be quoted for publications of PHD output: *
* Papers 1-3 for the prediction of secondary structure and the pre- *
* diction server. *
* *
****************************************************************************
* *
* About the input to the network *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* The prediction is performed by a system of neural networks. *
* The input is a multiple sequence alignment. It is taken from an HSSP *
* file (produced by the program MaxHom: *
* Sander, Chris & Schneider, Reinhard: Database of Homology-Derived *
* Structures and the Structural Meaning of Sequence Alignment. *
* Proteins, 1991, 9, 56-68. *
* *
* For optimal results the alignment should contain sequences with varying *
* degrees of sequence similarity relative to the input protein. *
* The following is an ideal situation: *
* *
* +-----------------+----------------------+ *
* | sequence: | sequence identity | *
* +-----------------+----------------------+ *
* | target sequence | 100 % | *
* | aligned seq. 1 | 90 % | *
* | aligned seq. 2 | 80 % | *
* | ... | ... | *
* | aligned seq. 7 | 30 % | *
* +-----------------+----------------------+ *
* *
****************************************************************************
* *
* Estimated Accuracy of Prediction *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* A careful cross validation test on some 250 protein chains (in total *
* about 55,000 residues) with less than 25% pairwise sequence identity *
* gave the following results: *
* *
* ++================++-----------------------------------------+ *
* || Qtotal = 72.1% || ("overall three state accuracy") | *
* ++================++-----------------------------------------+ *
* *
* +----------------------------+-----------------------------+ *
* | Qhelix (% of observed)=70% | Qhelix (% of predicted)=77% | *
* | Qstrand(% of observed)=62% | Qstrand(% of predicted)=64% | *
* | Qloop (% of observed)=79% | Qloop (% of predicted)=72% | *
* +----------------------------+-----------------------------+ *
*..........................................................................*
* *
* These percentages are defined by: *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* | number of correctly predicted residues *
* |Qtotal = --------------------------------------- (*100)*
* | number of all residues *
* | *
* | no of res correctly predicted to be in helix *
* |Qhelix (% of obs) = -------------------------------------------- (*100)*
* | no of all res observed to be in helix *
* | *
* | *
* | no of res correctly predicted to be in helix *
* |Qhelix (% of pred)= -------------------------------------------- (*100)*
* | no of all residues predicted to be in helix *
* *
*..........................................................................*
* *
* Averaging over single chains *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* The most reasonable way to compute the overall accuracies is the above *
* quoted percentage of correctly predicted residues. However, since the *
* user is mainly interested in the expected performance of the prediction *
* for a particular protein, the mean value when averaging over protein *
* chains might be of help as well. Computing first the three state *
* accuracy for each protein chain, and then averaging over 250 chains *
* yields the following average: *
* *
* +-------------------------------====--+ *
* | Qtotal/averaged over chains = 72.2% | *
* +-------------------------------====--+ *
* | standard deviation = 9.3% | *
* +-------------------------------------+ *
* *
*..........................................................................*
* *
* Further measures of performance *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* Matthews correlation coefficient: *
* *
* +---------------------------------------------+ *
* | Chelix = 0.63, Cstrand = 0.53, Cloop = 0.52 | *
* +---------------------------------------------+ *
*..........................................................................*
* *
* Average length of predicted secondary structure segments: *
* *
* . +------------+----------+ *
* . | predicted | observed | *
* +-----------+------------+----------+ *
* | Lhelix = | 10.3 | 9.3 | *
* | Lstrand = | 5.0 | 5.3 | *
* | Lloop = | 7.2 | 5.9 | *
* +-----------+------------+----------+ *
*..........................................................................*
* *
* The accuracy matrix in detail: *
* *
* +---------------------------------------+ *
* | number of residues with H, E, L | *
* +---------+------+------+------+--------+ *
* | |net H |net E |net L |sum obs | *
* +---------+------+------+------+--------+ *
* | obs H |12447 | 1255 | 3990 | 17692 | *
* | obs E | 949 | 7493 | 3750 | 12192 | *
* | obs L | 2604 | 2875 |19962 | 25441 | *
* +---------+------+------+------+--------+ *
* | sum Net |16000 |11623 |27702 | 55325 | *
* +---------+------+------+------+--------+ *
* *
* Note: This table is to be read in the following manner: *
* 12447 of all residues predicted to be in helix, were observed to *
* be in helix, 949 however belong to observed strands, 2604 to *
* observed loop regions. The term "observed" refers to the DSSP *
* assignment of secondary structure calculated from 3D coordinates *
* of experimentally determined structures (Dictionary of Secondary *
* Structure of Proteins: Kabsch & Sander (1983) Biopolymers, 22, *
* 2577-2637). *
* *
****************************************************************************
* *
* Position-specific reliability index *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* The network predicts the three secondary structure types using real *
* numbers from the output units. The prediction is assigned by choosing *
* the maximal unit ("winner takes all"). However, the real numbers *
* contain additional information. *
* E.g. the difference between the maximal and the second largest output *
* unit can be used to derive a "reliability index". This index is given *
* for each residue along with the prediction. The index is scaled to *
* have values between 0 (lowest reliability), and 9 (highest). *
* The accuracies (Qtot) to be expected for residues with values above a *
* particular value of the index are given below as well as the fraction *
* of such residues (%res).: *
* *
* +------+-----+-----+-----+-----+-----+-----+-----+-----+-----+-----+ *
* | index| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | *
* | %res |100.0| 99.2| 90.4| 80.9| 71.6| 62.5| 52.8| 42.3| 29.8| 14.1| *
* +------+-----+-----+-----+-----+-----+-----+-----+-----+-----+-----+ *
* | | | | | | | | | | | | *
* | Qtot | 72.1| 72.3| 74.8| 77.7| 80.3| 82.9| 85.7| 88.5| 91.1| 94.2| *
* | | | | | | | | | | | | *
* +------+-----+-----+-----+-----+-----+-----+-----+-----+-----+-----+ *
* | H%obs| 70.4| 70.6| 73.7| 77.1| 80.1| 83.1| 86.0| 89.3| 92.5| 96.4| *
* | E%obs| 61.5| 61.7| 63.7| 66.6| 69.1| 71.7| 74.6| 77.0| 77.8| 68.1| *
* | | | | | | | | | | | | *
* | H%prd| 77.8| 78.0| 80.0| 82.6| 84.7| 86.9| 89.2| 91.3| 93.1| 95.4| *
* | E%prd| 64.5| 64.7| 67.8| 71.0| 74.2| 77.6| 81.4| 85.1| 89.8| 93.5| *
* +------+-----+-----+-----+-----+-----+-----+-----+-----+-----+-----+ *
* *
* The above table gives the cumulative results, e.g. 62.5% of all *
* residues have a reliability of at least 5. The overall three-state *
* accuracy for this subset of almost two thirds of all residues is 82.9%. *
* For this subset, e.g., 83.1% of the observed helices are correctly *
* predicted, and 86.9% of all residues predicted to be in helix are *
* correct. *
* *
*..........................................................................*
* *
* The following table gives the non-cumulative quantities, i.e. the *
* values per reliability index range. These numbers answer the question: *
* how reliable is the prediction for all residues labeled with the *
* particular index i. *
* *
* +------+-----+-----+-----+-----+-----+-----+-----+-----+-----+ *
* | index| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | *
* | %res | 8.8| 9.5| 9.3| 9.1| 9.7| 10.5| 12.5| 15.7| 14.1| *
* +------+-----+-----+-----+-----+-----+-----+-----+-----+-----+ *
* | | | | | | | | | | | *
* | Qtot | 46.6| 50.6| 57.7| 62.6| 67.9| 74.2| 82.2| 88.3| 94.2| *
* | | | | | | | | | | | *
* +------+-----+-----+-----+-----+-----+-----+-----+-----+-----+ *
* | H%obs| 36.8| 42.3| 49.5| 55.2| 61.7| 69.9| 78.8| 87.4| 96.4| *
* | E%obs| 44.7| 44.5| 52.1| 55.4| 60.9| 68.0| 75.9| 81.0| 68.1| *
* | | | | | | | | | | | *
* | H%prd| 49.9| 52.5| 60.3| 64.2| 69.2| 77.5| 85.4| 89.9| 95.4| *
* | E%prd| 41.7| 47.1| 53.6| 57.0| 64.0| 71.6| 78.8| 88.8| 93.5| *
* +------+-----+-----+-----+-----+-----+-----+-----+-----+-----+ *
* *
* For example, for residues with Relindex = 5 64% of all predicted betha- *
* strand residues are correctly identified. *
* *
* *
****************************************************************************
* *
* *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* Solvent accessibility prediction by PHDacc: *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* Author: Burkhard Rost *
* EMBL, Heidelberg, FRG *
* Meyerhofstrasse 1, 69 117 Heidelberg *
* Internet: Rost@EMBL-Heidelberg.DE *
* *
* All rights reserved. *
* *
* *
****************************************************************************
* *
* About the network method *
* ~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* The network for prediction of secondary structure is described in *
* detail in: *
* Rost, Burkhard; Sander, Chris: *
* Prediction of protein structure at better than 70% accuracy. *
* J. Mol. Biol., 1993, 232, 584-599. *
* *
* The analysis of the prediction of solvent exposure is given in: *
* Rost, Burkhard; Sander, Chris: *
* Conservation and prediction of solvent accessibility in protein *
* families. Proteins, 1994, 20, 216-226. *
* *
* To be quoted for publications of PHD exposure prediction: *
* Both papers quoted above. *
* *
****************************************************************************
* *
* Definition of accessibility *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* For training the residue solvent accessibility the DSSP (Dictionary of *
* Secondary Structure of Proteins; Kabsch & Sander (1983) Biopolymers, 22,*
* 2577-2637) values of accessible surface area have been used. The *
* prediction provides values for the relative solvent accessibility. The *
* normalisation is the following: *
* *
* | ACCESSIBILITY (from DSSP in Angstrom) *
* |RELATIVE_ACCESSIBILITY = ------------------------------------- * 100 *
* | MAXIMAL_ACC (amino acid type i) *
* *
* where MAXIMAL_ACC (i) is the maximal accessibility of amino acid type i.*
* The maximal values are: *
* *
* +----+----+----+----+----+----+----+----+----+----+----+----+ *
* | A | B | C | D | E | F | G | H | I | K | L | M | *
* | 106| 160| 135| 163| 194| 197| 84| 184| 169| 205| 164| 188| *
* +----+----+----+----+----+----+----+----+----+----+----+----+ *
* | N | P | Q | R | S | T | V | W | X | Y | Z | *
* | 157| 136| 198| 248| 130| 142| 142| 227| 180| 222| 196| *
* +----+----+----+----+----+----+----+----+----+----+----+ *
* *
* Notation: one letter code for amino acid, B stands for D or N; Z stands *
* for E or Q; and X stands for undetermined. *
* *
* The relative solvent accessibility can be used to estimate the number *
* of water molecules (W) in contact with the residue: *
* *
* W = ACCESSIBILITY /10 *
* *
* The prediction is given in 10 states for relative accessibility, with *
* *
* RELATIVE_ACCESSIBILITY = (PREDICTED_ACC * PREDICTED_ACC) *
* *
* where PREDICTED_ACC = 0 - 9. *
* *
****************************************************************************
* *
* Estimated Accuracy of Prediction *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* A careful cross validation test on some 238 protein chains (in total *
* about 62,000 residues) with less than 25% pairwise sequence identity *
* gave the following results: *
* *
* *
* Correlation *
* ........... *
* *
* The correlation between observed and predicted solvent accessibility *
* is: *
* *
* ----------- *
* corr = 0.53 *
* ----------- *
* *
* This value ought to be compared to the worst and best case prediction *
* scenario: random prediction (corr = 0.0) and homology modelling *
* (corr = 0.66). (Note: homology modelling yields a relative accurate *
* prediction in 3D if, and only if, a significantly identical sequence *
* has a known 3D structure.) *
* *
* *
* 3-state accuracy *
* ................ *
* *
* Often the relative accessibility is projected onto, e.g., 3 states: *
* b = buried (here defined as < 9% relative accessibility), *
* i = intermediate ( 9% <= rel. acc. < 36% ), *
* e = exposed ( rel. acc. >= 36% ). *
* *
* A projection onto 3 states or 2 states (buried/exposed) enables the *
* compilation of a 3- and 2-state prediction accuracy. PHD reaches an *
* overall 3-state accuracy of: *
* Q3 = 57.5% *
* (compared to 35% for random prediction and 70% for homology modelling). *
* *
* In detail: *
* *
* +-----------------------------------+-------------------------+ *
* | Qburied (% of observed)=77% | Qb (% of predicted)=60% | *
* | Qintermediate (% of observed)= 9% | Qi (% of predicted)=44% | *
* | Qexposed (% of observed)=78% | Qe (% of predicted)=56% | *
* +-----------------------------------+-------------------------+ *
* *
* *
* 10-state accuracy *
* ................. *
* *
* The network predicts relative solvent accessibility in 10 states, with *
* state i (i = 0-9) corresponding to a relative solvent accessibility of *
* i*i %. The 10-state accuracy of the network is: *
* *
* Q10 = 24.5% *
* *
*..........................................................................*
* *
* These percentages are defined by: *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* | number of correctly predicted residues *
* |Q3 = --------------------------------------- (*100)*
* | number of all residues *
* | *
* | no of res. correctly predicted to be buried *
* |Qburied (% of obs) = ------------------------------------------- (*100)*
* | no of all res. observed to be buried *
* | *
* | *
* | no of res. correctly predicted to be buried *
* |Qburied (% of pred)= ------------------------------------------- (*100)*
* | no of all residues predicted to be buried *
* *
*..........................................................................*
* *
* Averaging over single chains *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* The most reasonable way to compute the overall accuracies is the above *
* quoted percentage of correctly predicted residues. However, since the *
* user is mainly interested in the expected performance of the prediction *
* for a particular protein, the mean value when averaging over protein *
* chains might be of help as well. Computing first the correlation *
* between observed and predicted accessibility for each protein chan, and *
* then averaging over all 238 chains yields the following average: *
* *
* +-------------------------------====--+ *
* | corr/averaged over chains = 0.53 | *
* +-------------------------------====--+ *
* | standard deviation = 0.11 | *
* +-------------------------------------+ *
* *
*..........................................................................*
* *
* Further details of performance accuracy *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* The accuracy matrix in detail: *
* .............................. *
* *
* -------+----------------------------------------------------+----------- *
* \ PHD | 0 1 2 3 4 5 6 7 8 9 | SUM %obs *
* -------+----------------------------------------------------+----------- *
* OBS 0 | 8611 140 8 44 82 169 772 334 27 0 | 10187 16.6 *
* OBS 1 | 4367 164 0 50 106 231 738 346 44 3 | 6049 9.8 *
* OBS 2 | 3194 168 1 68 125 303 951 513 42 7 | 5372 8.7 *
* OBS 3 | 2760 159 8 80 136 327 1246 746 58 19 | 5539 9.0 *
* OBS 4 | 2312 144 2 72 166 396 1615 1245 124 19 | 6095 9.9 *
* OBS 5 | 1873 96 3 84 138 425 1979 1834 187 27 | 6646 10.8 *
* OBS 6 | 1387 67 1 60 80 278 2237 2627 231 51 | 7019 11.4 *
* OBS 7 | 1082 35 0 32 56 225 1871 3107 302 60 | 6770 11.0 *
* OBS 8 | 660 25 0 27 43 136 1206 2374 325 87 | 4883 7.9 *
* OBS 9 | 325 20 2 27 29 74 648 1159 366 214 | 2864 4.7 *
* -------+----------------------------------------------------+----------- *
* SUM |26571 1018 25 544 961 2564 13263 14285 1706 487 | *
* %pred | 43.3 1.7 0.0 0.9 1.6 4.2 21.6 23.3 2.8 0.8 | *
* -------+----------------------------------------------------+----------- *
* *
* Note: This table is to be read in the following manner: *
* 8611 of all residues predicted to be in exposed by 0%, were *
* observed with 0% relative accessibility. However, 325 of all *
* residues predicted to have 0% are observed as completely exposed *
* (obs = 9 -> rel. acc. >= 81%). The term "observed" refers to the *
* DSSP compilation of area of solvent accessibility calculated from *
* 3D coordinates of experimentally determined structures (Diction- *
* ary of Secondary Structure of Proteins: Kabsch & Sander (1983) *
* Biopolymers, 22, 2577-2637). *
* *
* *
* Accuracy for each amino acid: *
* ............................. *
* *
* +---+------------------------------+-----+-------+------+ *
* |AA | Q3 b%o b%p i%o i%p e%o e%p | Q10 | corr | N | *
* +---+------------------------------+-----+-------+------+ *
* | A | 59.0 87 60 2 38 66 57 | 31 | 0.530 | 5054 | *
* | C | 62.0 91 67 5 39 25 21 | 34 | 0.244 | 893 | *
* | D | 56.5 21 45 6 49 94 57 | 20 | 0.321 | 3536 | *
* | E | 60.8 9 40 3 41 98 61 | 21 | 0.347 | 3743 | *
* | F | 63.3 94 67 9 46 29 37 | 27 | 0.366 | 2436 | *
* | G | 52.1 75 51 1 31 67 53 | 22 | 0.405 | 4787 | *
* | H | 50.9 63 53 23 45 71 50 | 18 | 0.442 | 1366 | *
* | I | 64.9 95 68 6 41 30 38 | 34 | 0.360 | 3437 | *
* | K | 66.6 2 11 2 37 98 67 | 23 | 0.267 | 3652 | *
* | L | 61.6 93 65 8 44 31 40 | 31 | 0.368 | 5016 | *
* | M | 60.1 92 64 5 39 45 44 | 29 | 0.452 | 1371 | *
* | N | 55.5 45 45 8 38 87 59 | 17 | 0.410 | 2923 | *
* | P | 53.0 48 48 9 39 83 56 | 18 | 0.364 | 2920 | *
* | Q | 54.3 27 44 7 44 92 56 | 20 | 0.344 | 2225 | *
* | R | 49.9 15 47 36 47 76 51 | 18 | 0.372 | 2765 | *
* | S | 55.6 69 53 3 51 81 56 | 22 | 0.464 | 3981 | *
* | T | 51.8 61 51 8 38 78 53 | 21 | 0.432 | 3740 | *
* | V | 61.1 93 65 5 40 39 42 | 34 | 0.418 | 4156 | *
* | W | 56.2 85 62 20 49 29 27 | 21 | 0.318 | 891 | *
* | Y | 49.7 73 52 33 49 36 38 | 19 | 0.359 | 2301 | *
* +---+------------------------------+-----+-------+------+ *
* *
* Abbreviations: *
* *
* AA: amino acid in one-letter code *
* b%o, i%o, e%o: = Qburied, Qintermediate, Qexposed (% of observed), *
* i.e. percentage of correct prediction in each state, see above *
* b%p, i%p, e%p: = Qburied, Qintermediate, Qexposed (% of predicted), *
* i.e. probability of correct prediction in each state, see above *
* b%o: = Qburied (% of observed), see above *
* Q10: percentage of correctly predicted residues in each of the 10 *
* states of predicted relative accessibility. *
* corr: correlation between predicted and observed rel. acc. *
* N: number of residues in data set *
* *
* *
* Accuracy for different secondary structure: *
* ........................................... *
* *
* +--------+------------------------------+----+-------+-------+ *
* | type | Q3 b%o b%p i%o i%p e%o e%p |Q10 | corr | N | *
* +--------+------------------------------+----+-------+-------+ *
* | helix | 59.5 79 64 8 44 80 56 | 27 | 0.574 | 20100 | *
* | strand | 61.3 84 73 9 46 69 37 | 35 | 0.524 | 13356 | *
* | loop | 54.4 64 43 11 44 78 61 | 18 | 0.442 | 27968 | *
* +--------+------------------------------+----+-------+-------+ *
* *
* Abbreviations as before. *
* *
****************************************************************************
* *
* Position-specific reliability index *
* ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *
* *
* The network predicts the 10 states for relative accessibility using real*
* numbers from the output units. The prediction is assigned by choosing *
* the maximal unit ("winner takes all"). However, the real numbers *
* contain additional information. *
* E.g. the difference between the maximal and the second largest output *
* unit (with the constraint that the second largest output is compiled *
* among all units at least 2 positions off the maximal unit) can be used *
* to derive a "reliability index". This index is given for each residue *
* along with the prediction. The index is scaled to have values between *
* 0 (lowest reliability), and 9 (highest). *
* The accuracies (Q3, corr, asf.) to be expected for residues with values *
* above a particular value of the index are given below as well as the *
* fraction of such residues (%res).: *
* *
* +---+------------------------------+----+-------+-------+ *
* |RI | Q3 b%o b%p i%o i%p e%o e%p |Q10 | corr | %res | *
* +---+------------------------------+----+-------+-------+ *
* | 0 | 57.5 77 60 9 44 78 56 | 24 | 0.535 | 100.0 | *
* | 1 | 59.1 76 63 9 45 82 57 | 25 | 0.560 | 91.2 | *
* | 2 | 61.7 79 66 4 47 87 58 | 27 | 0.594 | 77.1 | *
* | 3 | 66.6 87 70 1 51 89 63 | 30 | 0.650 | 57.1 | *
* | 4 | 70.0 89 72 0 83 91 67 | 32 | 0.686 | 45.8 | *
* | 5 | 72.9 92 75 0 0 93 70 | 34 | 0.722 | 35.6 | *
* | 6 | 76.3 95 77 0 0 93 75 | 36 | 0.769 | 24.7 | *
* | 7 | 79.0 97 79 0 0 93 78 | 39 | 0.803 | 16.0 | *
* | 8 | 80.9 98 80 0 0 91 81 | 43 | 0.824 | 9.6 | *
* | 9 | 81.2 99 80 0 0 88 83 | 45 | 0.828 | 5.9 | *
* +---+------------------------------+----+-------+-------+ *
* *
* Abbreviations as before. *
* *
* The above table gives the cumulative results, e.g. 45.8% of all *
* residues have a reliability of at least 4. The correlation for this *
* most reliably predicted half of the residues is 0.686, i.e. a value *
* comparable to what could be expected if homology modelling were *
* possible. For this subset of 45.8% of all residues, 89% of the buried *
* residues are correctly predicted, and 72% of all residues predicted to *
* be buried are correct. *
* *
*..........................................................................*
* *
* The following table gives the non-cumulative quantities, i.e. the *
* values per reliability index range. These numbers answer the question: *
* how reliable is the prediction for all residues labeled with the *
* particular index i. *
* *
* +---+------------------------------+----+-------+-------+ *
* |RI | Q3 b%o b%p i%o i%p e%o e%p |Q10 | corr | %res | *
* +---+------------------------------+----+-------+-------+ *
* | 0 | 40.9 79 40 16 41 21 40 | 14 | 0.175 | 8.8 | *
* | 1 | 45.4 61 46 28 44 48 44 | 17 | 0.278 | 14.1 | *
* | 2 | 47.4 53 52 10 46 80 44 | 19 | 0.343 | 19.9 | *
* | 3 | 52.9 75 59 4 50 77 47 | 23 | 0.439 | 11.4 | *
* | 4 | 60.0 81 63 0 83 84 56 | 25 | 0.547 | 10.1 | *
* | 5 | 65.2 82 70 0 0 93 62 | 28 | 0.607 | 10.9 | *
* | 6 | 71.3 90 72 0 0 94 70 | 31 | 0.692 | 8.8 | *
* | 7 | 76.0 94 76 0 0 95 75 | 34 | 0.762 | 6.3 | *
* | 8 | 80.5 97 81 0 0 94 79 | 39 | 0.808 | 3.8 | *
* | 9 | 81.2 99 80 0 0 88 83 | 45 | 0.828 | 5.9 | *
* +---+------------------------------+----+-------+-------+ *
* *
* For example, for residues with RI = 4 83% of all predicted intermediate *
* residues are correctly predicted as such. *
* *
* *
****************************************************************************
| %H: 43.4 | %E: 17.1 | %L: 39.5 |
| %A: 9.7 | %C: 2.1 | %D: 4.1 | %E: 5.6 | %F: 2.1 |
| %G: 7.1 | %H: 2.1 | %I: 8.0 | %K: 5.6 | %L: 8.6 |
| %M: 1.2 | %N: 4.1 | %P: 6.5 | %Q: 5.3 | %R: 2.4 |
| %S: 5.9 | %T: 6.2 | %V: 10.0 | %W: 0.9 | %Y: 2.6 |
| AA : | amino acid sequence | |
| PHD_sec: | PHD predicted secondary structure: H=helix, E=extended (sheet), blank=other (loop) PHD = PHD: Profile network prediction HeiDelberg | |
| Rel_sec: | reliability index for PHDsec prediction (0=low to 9=high) Note: for the brief presentation strong predictions marked by '*' | |
| SUB_sec: | subset of the PHDsec prediction, for all residues with an expected average accuracy > 82% (tables in header) NOTE: for this subset the following symbols are used: L: is loop (for which above ' ' is used) .: means that no prediction is made for this residue, as the reliability is: Rel < 5 | |
| pH_sec: | 'probability' for assigning helix (1=high, 0=low) | |
| pE_sec: | 'probability' for assigning strand (1=high, 0=low) | |
| pL_sec: | 'probability' for assigning neither helix, nor strand (1=high, 0=low) | |
| P_3_acc: | PHD predicted relative solvent accessibility (acc) in 3 states: b = 0-9%, i = 9-36%, e = 36-100%. | |
| Rel_acc: | reliability index for PHDacc prediction (0=low to 9=high) Note: for the brief presentation strong predictions marked by '*' | |
| SUB_acc: | subset of the PHDacc prediction, for all residues with an expected average correlation > 0.69 (tables in header) NOTE: for this subset the following symbols are used: I: is intermediate (for which above ' ' is used) .: means that no prediction is made for this residue, as the reliability is: Rel < 4 | |
| PHD_acc: | PHD predicted relative solvent accessibility (acc) in 10 states: a value of n (=0-9) corresponds to a relative acc. of between n*n % and (n+1)*(n+1) % (e.g. for n=5: 16-25%). |
PHD results (brief)....,....1....,....2....,....3....,....4....,....5....,....6....,....7....,....8....,....9....,....10...,....11...,....12...,....13...,....14...,....15...,....16...,....17...,....18...,....19...,....20...,....21...,....22...,....23...,....24...,....25...,....26...,....27...,....28...,....29...,....30...,....31...,....32...,....33...,....34 AA MCAQYCISFADVEKAHINIQDSIHLTPVLTSSILNQIAGRNLFFKCELFQKTGSFKIRGALNAIRGLIPDTPEEKPKAVVTHSSGNHGQALTYAAKLEGIPAYIVVPQTAPNCKKLAIQAYGASIVYCDPSDESREKVTQRIMQETEGILVHPNQEPAVIAGQGTIALEVLNQVPLVDALVVPVGGGGMVAGIAITIKALKPSVKVYAAEPSNADDCYQSKLKGELTPNLHPPETIADGVKSSIGLNTWPIIRDLVDDVFTVTEDEIKYATQLVWGRMKLLIEPTAGVALAAVLSQHFQTVSPEVKNVCIVLSGGNVDLTSLNWVGQAERPAPYQTVSV PHD_sec HHHHHHHHHHHHHH HHHHH EEEEEE EE HHHHHHHH HHH EEEEE HHHHHHHHHHHH EEEEE HHHHHHHHHHH EEEEE HHHHHHHHHHHHH EEEE EE HHHHHHHHHH EEEEE HHHHHHHHHHHHHH EEEEE HHHHHHHHH E EE E HHHHHHHHHH EEEE HHHHHHHHHHHHH EEEE HHHHHHHHHHHH EEEEEE HHHHHHHHHHHHH E Rel_sec ***** * *********** **** **** *** **** ****** ************ ******* * *********** ** **** *** ********* ** **** *** *********** ****** ***** ******** ******** * ************* *** **** ************ *** ** * ********* ***** * ********** * ** * ********** * ***** ***** *** ************ *** * P_3_acc eeeebebbbeebeebeeebeebbeebbbeebeebbeebebebbbeeeeeeeb bbebbbbbbbbeeb eeeeeeeeebbbbbbbbbbbbbbbbbbeebbbebbbbbbee bebebebbee bbebbbbbee eebeeebeebeeeee ebbbbbe bbbbbbbbbbbbebbeebeebebbbbbbbbbbbbbbbbbbbeeb eebebbbb bebbbbbebbbeeeeebbebeebebbbebbbbebbeebbebbeebbeebbbbbeeebbbbbeebbeebebbbebbbbbbbbbbeeeeeeeeeeeeeebbbbbbbbbbebbebeebeeebeeeeeeebbb Rel_acc * * * * *** ** * ** * ******* * ****** * * *** * ** * ** * ** * ***** ******** * * **** * * ** ** * * ** * ** * ** ******* ****** * * *
PHD results (normal) ....,....1....,....2....,....3....,....4....,....5....,....6....,....7....,....8....,....9....,....10...,....11...,....12...,....13...,....14...,....15...,....16...,....17...,....18...,....19...,....20...,....21...,....22...,....23...,....24...,....25...,....26...,....27...,....28...,....29...,....30...,....31...,....32...,....33...,....34 AA MCAQYCISFADVEKAHINIQDSIHLTPVLTSSILNQIAGRNLFFKCELFQKTGSFKIRGALNAIRGLIPDTPEEKPKAVVTHSSGNHGQALTYAAKLEGIPAYIVVPQTAPNCKKLAIQAYGASIVYCDPSDESREKVTQRIMQETEGILVHPNQEPAVIAGQGTIALEVLNQVPLVDALVVPVGGGGMVAGIAITIKALKPSVKVYAAEPSNADDCYQSKLKGELTPNLHPPETIADGVKSSIGLNTWPIIRDLVDDVFTVTEDEIKYATQLVWGRMKLLIEPTAGVALAAVLSQHFQTVSPEVKNVCIVLSGGNVDLTSLNWVGQAERPAPYQTVSV PHD_sec HHHHHHHHHHHHHH HHHHH EEEEEE EE HHHHHHHH HHH EEEEE HHHHHHHHHHHH EEEEE HHHHHHHHHHH EEEEE HHHHHHHHHHHHH EEEE EE HHHHHHHHHH EEEEE HHHHHHHHHHHHHH EEEEE HHHHHHHHH E EE E HHHHHHHHHH EEEE HHHHHHHHHHHHH EEEE HHHHHHHHHHHH EEEEEE HHHHHHHHHHHHH E Rel_sec 999854453169999987876212488764168763169549985139886563131315999985599973422686599725327899999999718813999538993787999998199189982595499999999996489655517999741103334599999983378859996484599999999999938984599832577589999998179922266421111453111325899999993495598235379999999994247375163167999999963253466788499998189834999999999985299942139 SUB_sec LLLLL..L..HHHHHHHHHHH....LLLL..HHHH..LLL.EEEE..LLLLLL......HHHHHHHLLLLL....LLLEEEE.L..HHHHHHHHHHH.LL..EEEE.LLL.HHHHHHHHH.LL.EEEE.LLL.HHHHHHHHHHH.LLLEEE.LLLLL........HHHHHHHH..LLLEEEEE.L.HHHHHHHHHHHHH.LLL.EEEE..LLLHHHHHHHHH.LLL...LL.......L......HHHHHHHHH..LLEEE..L.HHHHHHHHHH...L.EE.L..HHHHHHHHHH..L..LLLLL.EEEEE.LLL..HHHHHHHHHHHH.LLL....L P_3_acc eeeebebbbeebeebeeebeebbeebbbeebeebbeebebebbbeeeeeeeb bbebbbbbbbbeeb eeeeeeeeebbbbbbbbbbbbbbbbbbeebbbebbbbbbee bebebebbee bbebbbbbee eebeeebeebeeeee ebbbbbe bbbbbbbbbbbbebbeebeebebbbbbbbbbbbbbbbbbbbeeb eebebbbb bebbbbbebbbeeeeebbebeebebbbebbbbebbeebbebbeebbeebbbbbeeebbbbbeebbeebebbbebbbbbbbbbbeeeeeeeeeeeeeebbbbbbbbbbebbebeebeeebeeeeeeebbb Rel_acc 122101202001125112711150211021012533100117460001011117616057002502302202321034689757101608665771040318194721000301011712031155212110210211312203310002601001106625030582082212221028894633245687844361211115177770210201310502211111121120035123221551155226120110534601106126711512101064123146879651111011010302069896432020401402512003103211201 SUB_acc ..............b...b...b..........b.......bbb.........bb.b.bb...b.............bbbbbbb...b.bbbbbb..b...b.bbb...........b......bb........................b.......bb.b...bb..b.........bbbbb...bbbbbbbb.b......b.bbbb..........b................b......bb..bb..b......b.bb....b..bb..b......bb....bbbbbbb..............bbbbbb.....b..b..b..............
--- --- GLOBE: prediction of protein globularity --- --- nexp = 147 (number of predicted exposed residues) --- nfit = 139 (number of expected exposed residues --- diff = 8.00 (difference nexp-nfit) --- =====> your protein appears as compact, as a globular domain --- --- --- GLOBE: further explanations preliminaryily in: --- http://www.columbia.edu/~rost/Papers/98globe.html --- --- END of GLOBE
---
--- ------------------------------------------------------------
--- TOPITS prediction-based threading
--- ------------------------------------------------------------
---
--- TOPITS ALIGNMENTS HEADER: PARAMETERS
--- str:seq= 50 : structure (sec str, acc)= 50%, sequence= 50%
--- str:seq = 50 : weight structure/sequence,i.e. str= 50%, seq= 50%
--- smin = -1.00 : minimal value of alignment metric
--- smax = 2.00 : maximal value of alignment metric
--- go = 2 : gap open penalty
--- ge = 0.2 : gap elongation penalty
--- len1 = 339 : length of search sequence, i.e., your protein
---
--- TOPITS ALIGNMENTS HEADER: ABBREVIATIONS
--- RANK : rank in alignment list, sorted according to z-score
--- EALI : alignment score
--- LALI : length of alignment
--- IDEL : number of residues inserted
--- NDEL : number of insertions
--- ZALI : alignment zcore; note: hits with z>3 more reliable
--- PIDE : percentage of pairwise sequence identity
--- LEN2 : length of aligned protein structure
--- ID2 : PDB identifier of aligned structure
--- NAME2 : name of aligned protein structure
--- IFIR : position of first residue of search sequence
--- ILAS : position of last residue of search sequence
--- JFIR : PDB position of first residue of remote homologue
--- JLAS : PDB position of last residue of remote homologue
---
--- TOPITS ALIGNMENTS HEADER: ACCURACY
--- : Tested on 80 proteins, TOPITS found the
--- : correct remote homologue in about 30% of
--- : the cases, detection accuracy was higher
--- : for higher z-scores (ZALI):
--- ZALI>0 : 1st hit correct in 33% of cases
--- ZALI>3 : 1st hit correct in 50% of cases
--- ZALI>3.5 : 1st hit correct in 60% of cases
---
--- TOPITS ALIGNMENTS HEADER: SUMMARY
RANK EALI LALI IDEL NDEL ZALI PIDE LEN2 ID2 NAME2
1 189.93 322 31 14 5.57 34 495 1tdj _ID: 1;
2 104.73 312 75 27 2.35 28 397 2tys_B OL_ID: 1;
3 102.13 309 166 35 2.25 34 566 2kau_C OLECULE: KLEBSIELLA AEROG
4 101.87 326 138 29 2.24 27 602 1cii _ID: 1;
5 101.73 320 129 36 2.24 33 727 1req_A OL_ID: 1;
6 101.27 316 74 27 2.22 28 530 2frv_B OL_ID: 1;
7 101.13 322 140 34 2.22 30 605 1aor_A LDEHYDE FERREDOXIN OXIDOR
8 100.67 304 83 28 2.20 28 399 1rom _ID: 1;
9 100.53 305 102 33 2.19 30 404 1psd_A -3-PHOSPHOGLYCERATE DEHYD
10 99.80 310 112 31 2.17 32 678 1trk_A TRANSKETOLASE (E.C.2.2.1.
11 99.60 310 121 34 2.16 31 757 1kit L_ID: 1;
12 99.20 330 135 31 2.14 27 839 1yge L_ID: 1;
13 98.07 310 84 31 2.10 28 376 1bhe L_ID: 1;
14 97.93 304 96 29 2.10 31 418 1uae L_ID: 1;
15 97.33 318 107 32 2.07 27 442 1mro_B MOL_ID: 1;
16 97.13 318 150 34 2.07 31 870 1dik L_ID: 1;
17 97.13 298 109 33 2.07 28 561 3pmg_A MOL_ID: 1;
18 96.73 323 94 26 2.05 26 401 1fcd_A FLAVOCYTOCHROME C SULFIDE
19 96.60 323 147 40 2.05 30 698 1aa6 L_ID: 1;
20 96.60 318 83 26 2.05 28 458 1lvl HYDROLIPOAMIDE DEHYDROGEN
---
--- TOPITS ALIGNMENTS HEADER: PDB_POSITIONS FOR ALIGNED PAIR
RANK PIDE IFIR ILAS JFIR JLAS LALI LEN2 ID2
---
--- TOPITS ALIGNMENTS: SYMBOLS AND EXPLANATIONS
--- BLOCK 1 : your protein and its predicted 1D structure,
--- : i.e., secondary structure and solvent accessibility
--- line 1 : amino acid sequence (one-letter-code)
--- line 2 : predicted secondary structure:
--- H : helix
--- E : strand (extended)
--- L : other (no regular secondary structure)
--- line 3 : predicted residue relative solvent accessibility
--- B : buried, i.e., relative accessibility < 15%
--- O : exposed (outside), i.e., relative accessibility >= 15%
--- :
--- BLOCKS 1-20 : 20 best hits of the prediction-based threading
--- ATTENTION : We chose to include all first 20 hit. However,
--- ATTENTION : most of them will not constitute true remote
--- ATTENTION : homologues. Instead, all hits with a zscore
--- ATTENTION : (ZALI) < 3.5 are, at best, rather speculative!
--- : for each aligned protein:
--- line 1 : amino acids conserved between guide (yours) and the
--- : aligned protein (putative homologue)
--- line 1 : sequence of aligned protein
--- line 3 : secondary structure, taken from DSSP (assignment
--- : of secondary structure based on experimental coordinates)
--- line 4 : relative solvent accessibility, taken from DSSP
---
--- TOPITS ALIGNMENTS
1 - 51 ....:....1....:....2....:....3....:....4....:....5
pred MCAQYCISFADVEKAHINIQDSIHLTPVLTSSILNQIAGRNLFFKCELFQK
HHHHHHHHHHHHHH HHHHH EEEEEE
OOOOBOBBBOOBOOBOOOBOOBBOOBBBOOBOOBBOOBOBOBBBOOOOOOO
1. 1tdj 189.93 AEY...LRAVLRAPVY..EAAQVTPllSSRLDNVI.....LVKREDRQP
HHH...HHHHHHLLHH..HLLLLLLEHHHHLLLEE.....EEELHHHLL
E A Q L AGR KC L K
2. 2tys_B 104.73 ALNQLEEAFVRAQKDPEFQAQF.ADLLKNYAGrtALTKCqlYLK
HHHHHHHHHHHHHLLHHHHHHH.HHHHHHLLLLLLEEELLEEEE
AD E I D L VLT I I K F K
3. 2kau_C 102.13 VRLADTE.LWIEVEDDLtlAavLTNaiVDhiVKADIGVkgRIFak
EELLLLL.LEEELLEELLLHHEEEEEEEELEEEEEEEEELEEEEL
A I Q L L IAG L ELF
4. 1cii 101.87 LRAGNADAADITRQEFRLLQAELREYGftEIAGYdlRLHTelFAD
HHHLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AD E A D S LNQ A R FF F
5. 1req_A 101.73 ADIEMAY.TLADGVDYIRAGESVGLnqFAPRLSFfgMNFFME
HHHHHHH.HHHHHHHHHHHHHHLLLLHLHHHEEEELLLHHHH
C Y A V I L LT Q LF
6. 2frv_B 101.27 VC.TYVHALASVRAVdvKIPENATLMRNLT..MGAQYMHDHlfYH...LHA
LL.LLHHHHHHHHHHHLLLLHHHHHHHHHH..HHHHHHHHHHHHH...LLH
F I L L G L K
7. 1aor_A 101.13 RfvNLSTGDIKVEE...YDEELAKKWLGS.RGLAIY....LLLK
EEEELLLLEEEEEE...LLHHHHHHHLLL.HHHHHH....HHHH
S A H S L V TS Q A F Q
8. 1rom 100.67 SLAWLVTKHKDvaTSEKLSKVRtsASGKQAaaKPTFVDMdmHQR
LEEEEELLHHHHHHLLLEELLLLLHHHHHHLLLLLHHHLLHHHH
A Y I F D E AHI HLT V T L A R F F
9. 1psd_A 100.53 AGYtiEFhdDEqahiGLRSRTHLTevitNQvlDAAAKRGivFN.APFSN
LLLLEEELLHHHLLEEELLLLLELHHHLLLEHHHHHLLLLEEL.LLLLL
F D K I I S H PVL S N G
10. 1trk_A 99.80 QFTDIDksTIRiaNSGhmApvLWSQmiNrsNGHAVALLYSMLHL
LLLHHHHHHHHHHLLLLHHHHHHHHLLLLLLHHHHHHHHHHHHH
I A N Q I T V S QIAG EL
11. 1kit 99.60 IFYSVYDVASGNWQAPIDVtqVKERSF..QIAG...WGGSELYRR
EEEEEEELLLLEEEEEEELHHHLLLLE..EEEL...LLEEEEEEE
I F A I L S L I F C
12. 1yge 99.20 SAFNIHFegIPGAFY.IKNYMQVEFFLKSLTLEAISNqtIRFVCNSWvn
EEEEEEEELLEEEEE.EEELLLLLEEEEEEEELLLLLLLEEEEEEEEEE
C AD A IQ L AG F L
13. 1bhe 98.07 PSSCTTlaDSSTATSTIQKALNNCDQGKAVRLS..AGssVFLSGPLSLP
LLEEEEELLLLELHHHHHHHHLLLLLLLEEEEE..LLLLEEEELLEELL
A I FA IQ L V TS L QI RN F C K
14. 1uae 97.93 AALPILFAalAEEPVEIQNVPKLKDVDTskLLSqiDArnVF..CAPYdk
HHHHHHHHHHELLLEEEELLLLLHHHHHHHHHHHEELLLLL..ELLHHL
I I I T V I N C
15. 1mro_B 97.33 ALSPLRNPAisIVQGIKRtaVNLEGIENALKTAKvgPACKIMGR
HHLLLLLHHHHHHHHHHHEEEEHHHHHHHHHHLLLLLLLLLLLL
C Y S IQD I TP L S ILN G N E F K
16. 1dik 97.13 CTEYYNSGKQITQ...EIQDQIfitpLLVstILNL..GLN.DVAVEGfkK
HHHHHLLLLLLLH...HHHHHHHHHEELEEEEEEE..LLL.LLHHHHHHH
C K P T I RN F K EL
17. 3pmg_A 97.13 ICPDLKVDLGVLGKQQFDLEN..KFKP.FTVEIVDSVEarNIflK.ELLSG
ELLLLLLLLLLLEEEEELLLL..LLLL.EEEEEELLLHHHLLLHH.HHHHL
A Y I AD I T T N G K
18. 1fcd_A 96.73 AKY.IKLADPSIEVTLIEPN...TDYYTCYLSNEVIGGD........RK
HHH.HHHHLLLLEEEEELLL...LLEELLLLHHHHHHLL........LL
CA CI A N Q L NQI L K
19. 1aa6 96.60 CASGciNlaEAAQGKTN.QGTLCLKGYYGWDFInqILTPRL..KTPMIRR
ELLLLEEEEEELLLLLL.LLLLLHHHHHLLHHHHLLLLLEL..LLLEELL
A I L LN I L E F
20. 1lvl 96.60 AAIRAGQLGIPTVLVEGQALGGTCLncIPSKALIHVAEQFHQ
HHHHHHHHLLLEEEELLLLLLHHHHHHHHHHHHHHHHHHHHH
---
--- TOPITS ALIGNMENTS CONTINUED
---
1. 1tdj 189.93 PVHSFKLRGAYAMMAGL...TEEQKAHGVITASAGNHAQGVAfsARL.GVK
LLLLLLHHHHHHHHHLL...LLLLLLLLLEEEELLLLHHHHHHHHHL.LLL
K GA N GL K G HG A A L G
2. 2tys_B 104.73 KREDLLHGGatNQVLglLA.KRMGKSEIIAETGAGQHGVASALASALLGLK
EEHHHLLLLEHHHHHHHHH.HHLLLLEEEEEELLLHHHHHHHHHHHHHLLE
K G I GA A G I DTP A VT G G A T A P
3. 2kau_C 102.13 kAGNPDIqgavIAAEGKIvdtpQQAEEALvtMVGGGTGPAatHATTCTPGP
LEELLLLLLLEEELLLLEEEELHHHHHHHHEEEEELLLLLHHHHLLLLLHH
S I A NA EE K T S N G AL A L
4. 1cii 101.87 DADSLRikDAQNADKKAADMLAeeRRKGIltRLskNGGAAL...AVLDAQQ
HLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH...HHHHHHH
K R A L PK THS G LTYAA
5. 1req_A 101.73 EVA..KLRAARMLWAKLVHQFGPKNPksLRTHSQTS.GWSLtyaahTNSLD
HHH..HHHHHHHHHHHHHHLLLLLLHHLLLEEEEEL.HHHLLLHHELLLLL
ALNA L P T E K KA V SG G T A L G P
6. 2frv_B 101.27 ALDWVNVANALNaaNDLSptTTeaKVKALV..ESGQLG.IFTNAYFLGGHP
HHHLLLHHHHHHLHHHHLLLLHHHHHHHHH..LLLLLH.HHLLLLLLLLLL
K L LIP T P VVT S GN G K G
7. 1aor_A 101.13 KEMDPTV.DPLSPENKLipLTGTSAPtnVVTKstgnSGGYFGAELKFAGYD
HHLLLLL.LLLLLLLLEELLLLLLLLLEEEEELLLEELLLHHHHHHHLLLL
F A I T N L L P
8. 1rom 100.67 RspTF.TPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALP.VP
HLHHH.LHHHHHHLHHHHHHHHHHHHHHHHLLLLLLLLEEHHHHLLLH.HH
T S I L RG PE KA N A AKL GI
9. 1psd_A 100.53 NTRslVIGELLLLLRGV....PEANAKA..HRGVWNKLAAGSFEakL.GII
LHHHHHHHHHHHHHLLH....HHHHHHH..LLLELLLLHHHLLLLEE.EEE
TG I L R L P PEE P VT G GQ AA L P
10. 1trk_A 99.80 LTgdLSI.EDLKQFRQltPGHPeeLPGVEVT..TGPLGQGISNaaNLAAtp
HLLLLLH.HHHLLLLLLLLLLLLLLLLLLLL..LLLLLHHHHHHHHHHHHL
S KIRGA N I K VVT S G A L G
11. 1kit 99.60 RNTSlkIrgAANQIQ.....VADGSRKYVVTLSIDESGGLV...ANLNGVS
EELLLEEEELLEEEE.....EEELLEEEEEEEEELLLLLEE...EEELLLL
T K R P E P V GN G Y
12. 1yge 99.20 nTKLYK......SVRIFFANhpSETPAPLVSYrrGntGERKEY.DRIYDYD
EHHHLL......LLEEEELLLHHHLLHHHHHHHHLLLLLLLLL.LLLLLEE
GS I G L A E P V G G
13. 1bhe 98.07 PSgsLLikgtLRAVNN..AKSFENAPSsgVVDKNG............KGCD
LLLEEEELLEEEELLL..LHHHELLLLLLLEELLL............LLEL
KT R A L G L LE
14. 1uae 97.93 kT....MRASIWALGPLVARFGQGQVSLPGGCTIGARPVDLHI.SGLEQLG
LL....LLHHHHHHHHHHHHHLEEEEELLLLLLLLLLLLHHHH.HHHHHLL
I G I I T A V S A YAA L
15. 1mro_B 97.33 RELDLDIVGNAESikEMIQVTEDdgKRALVQVPSARFDVAAEyaAPLVTAT
LLLLLLLHHHHHHHHHHHLLLLLLHLEEEEELLHHHLLLLLLLHHHHHHHH
KTG R A R I VV H A K EG
16. 1dik 97.13 KTGNP..RFAYDSYRRFIQMYSD.....VVMEVPKSHFEKIIDAMKEegV.
HHLLH..HHHHHHHHHHHHHHHH.....HLLLLLHHHHHHHHHHHHLLLL.
KIR A G AV G H LTYAA L
17. 3pmg_A 97.13 GPNRLKIR..IDAMHGVveELGAPANSAvlEDFGGHHPDplTYAADL....
LLLLLLEE..EELLLLLLLLLLLLHHHEELLLHHHLLLLLLLLLHHH....
K S K G L A G D K V G G L YAAKL
18. 1fcd_A 96.73 KLESIK.HggLRA.HgvVHDSAtdPDKKLVKTAGgaPGIELIyaAKLP..H
LHHHHE.ELHHHL.LLEELLLEELLLLLEEEELLLLLLEEELLLLLLL..L
K ALN E P A T SS G Y A P
19. 1aa6 96.60 RQRGGKLeeALNYVAERLSAIKEkgPDAIQTtsSRGTGNETNYvaRVXHGP
LLLLLLLEHHHHHHHHHHHHHHHHLHHHEEEELLLLLLHHHHHHHHHHLHH
F L A LI K V G HG A L G
20. 1lvl 96.60 QASRFTEPSPLgaSPRliGQSVAWKDGIVDRLTTgkHGVKVvwAKVLDG..
HHHHLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHLLLEEELEEEEEL..
---
--- TOPITS ALIGNMENTS CONTINUED
---
1. 1tdj 189.93 KALIVMPTATADIKVDAVRGFGGEVLLHGAndEAKAKAIE..LSQQQgtWV
LEEEELLLLLLHHHHHHHHHHLLEEELLLLLHHHHHHHHH..HHHHHLEEL
I V PN GA V C D S T M T G
2. 2tys_B 104.73 KCRIyvERQSPN..VFRMRLMGAEvvhcNerDWSGSYETAHYMLGtaGPHP
EEEEEHHHLHHH..HHHHHHLLLEEELHHHHHHHHHLLLEEELLLLLLLLL
P YI A I G V P D RE V G
3. 2kau_C 102.13 PWYISRMLQAADSLPVNIGLLGKGNVS.QP.DALREQVAAGVI....GLKI
HHHHHHHHHHHLLLLLEEEEEEELLLL.LH.HHHHHHHHHLLL....EEEE
A QT N AI S R TQ Q G V
4. 1cii 101.87 QARLLGQQT.RNDR..AISEARNKLSSVTEsnTARNALtqQLTQqpDgtIV
HHHHHHHHH.HHHH..HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
I P A N Q G V DP S E T QE EGI
5. 1req_A 101.73 DEAIALprIARNTQLFLQQESGTTRVI.DPWSGsvEELTWDliQEVegIPK
LLLLLLLHHHHHHHHHHHHLLLLLLLL.LLLLLLHHHHHHHHHHHHHLHHH
PAY TA L QA A TQ G
6. 2frv_B 101.27 PAYVLPAetAHYLEALRVqaARAMAIF.....GAKNPHTQ..FTVVGGCTN
LLLLLLLLHHHHHHHHHHHHHHHHHHH.....HLLLLLLL..LEELLEELL
A IVV A K I AS E E I
7. 1aor_A 101.13 DA.IVVEGKAEkiKDEHIEIRDASHIWGKKVSETEATIRKEVGSEKVKisI
LE.EEEELLLLLEELLEEEEEELLLLLLLLHHHHHHHHHHHHLLLLLEEEL
P YI VP T N AI G S S E V QR IL
8. 1rom 100.67 PSYIivplTQQN....AIRTNGSSTAR.EASAANQelVEQRLVEPKDDilC
HHHHHLLHHHHH....HHHHLLLLLHH.HHHHHHHHHHHHHHHLLLLLHHH
Y L I A G D V Q M V
9. 1psd_A 100.53 IGYGHIG......TQLGILalGMYVYFYDIENknATQV.QHlmSDVVSLHV
ELLLLLH......HHHHHHHLLLEEEEELLLLLLLEEL.LLHHLLEEEELL
P A KL I Y D E V R E G V
10. 1trk_A 99.80 pGFTliSSEASshLKLgiAIYDDNKITIDGATSIseDVAKRY..EAYGWEv
LLELLHHHHHHHHLLLLEEEEEELLEELLEEHHHLLLHHHHH..HHHLLEE
A I Q Y A D S E R V Q I G LV
11. 1kit 99.60 SAPIILQSEHAKVHSfeLQ.YSatTLFVDgsQEnrLHV.QKIVLTQQGhlv
LLLEEEELLHHHHLLLEEE.EELEEEEELLLLLEEEEE.EEEEEEELLEEE
Y KLA G S P D EK L
12. 1yge 99.20 DVYNDLGNPDKS.EKLARPVLGGSSTFPYptDPNTEKQGEVFYVPRDENLG
ELELLLLLLLLL.HHHLLLLELLLLLLLLLLLLLLELLLLLLLLLHHHLLL
A I TA I YG D KV
13. 1bhe 98.07 DAFI....TAVSTTNSGI..YGPGTI..DGQGGVkkKVSWWELAADAKVKK
LLLE....EEELLELLEE..ELLLEE..ELLLLLLLLLLLLHHHHHHHHHL
A I A L GA IV D T IM TEGI
14. 1uae 97.93 GATIKLEekASVDGRLK....GAHIV....MDKVSVGATVTImaAtegiIE
LLEEEEELEEELLLLLE....LLEEE....LLLLLHHHHHHHHHHLELEEE
A I V N K A Y S Y Q E GI V
15. 1mro_B 97.33 TAfiivSMYDANMVKAAVLgyPQSVEYMGANIATMLDIPQKL..EGPgiMV
HHHHHLLLLLHHHHHHHHHLLLLLLLLLLLLEELLLLLHHHL..LLLLLLH
TA K LA A VY E M
16. 1dik 97.13 ..HFDTDLTADDLKELA.EKFKA..VYKEagEEFPQEPKDQLMGAVKAVFR
..LLHHHLLHHHHHHHH.HHHHH..HHHHLLLLLLLLHHHHHHHHHHHHHH
V T K GA D R G V
17. 3pmg_A 97.13 ....V..ETM....KSGEHDFGAAF....DGDGDRN.....MILGKHGFFV
....H..HHH....HLLLLLEEEEE....LLLLLLE.....EEEEHHHLEL
A QTA K L A G V P RE Q I
18. 1fcd_A 96.73 HAWKAGEQTAILRKQLEDMADGGTVVIAPPAAPFreRASqyYLKasKVIIL
LLLLLLHHHHHHHHHHHHLLLLLLEEEELLLLLLLHHHHHHHHHHLLEEEE
P Q N AI V P D S V I V
19. 1aa6 96.60 PSVAGLHQSVGN.GAmaINEIDNtlVfyNPAD.SHPIVANHVirNGAKIIv
HHHHHHHHHHLL.LLLLHHHHHHLEEELLHHH.HLHHHHHHHHHLLLEEEE
V C L A G S V P V TE
20. 1lvl 96.60 .KQVEVDGQRIQCEHL.LLATGSSSVE.LPMLPLGGPV....ISSTEALAP
.LEEEELLEEEELLEE.EELLLEEELL.ELLELLELLE....ELHHHHLLL
---
--- TOPITS ALIGNMENTS CONTINUED
---
1. 1tdj 189.93 VPPFDHPMVIAGQGTLALELLQQDAHLDRVFVPVGGGGLAAGVAVLIKQLM
LLLLLLHHHHHHHHHHHHHHHHHLLLLLEEEEELLLLHHHHHHHHHHHHHL
P E G T A L L DA VGGGGM A I
2. 2tys_B 104.73 PYPtrEFQRMIGEETKA.QILdeGRLPDAVIACVGGggMFADF...IN..D
LHHHHHLLLHHHHHHHH.HHHHHLLLLLEEEEELLLLHHHHHH...LL..L
H PA I T A E LN V GGGG I IT A
3. 2kau_C 102.13 IHEdaTPAAIDCALTVADEmlNESGFVEDTLAAIggGGHAPDI.ITACA.H
EEHHLLHHHHHHHHHHHHHHLLLLLLHHHHHHHHLLLLLLLLH.HHHHH.L
V P P V G GTI V L G
4. 1cii 101.87 VSPEKFPgvVSGdgTIKiaVIDNRANLNYLLSHSgrNILNDRNPVVTEDVE
HLLLLLLEELLLLLEEEEEEELLHHHHHHHHLLLHHHLLLLLLLLLLLLHH
E A I GQ I L P D V V A A A
5. 1req_A 101.73 KMRIEEAAaisgqPLIGVnrLEHEPPLD..VLKVDNSTVLAekavKLRAER
HHHHHHHHHHHLLLLELLLLLLLLLLLL..LLLLLHHHHHHHHHHHHHHHL
P IA EV QV D L V G AGI T L
6. 2frv_B 101.27 NYDSLRPERIAEFRKLYKEviEQVYITDLLAVA....GFyaGIGKTSNFL.
LLHHHLHHHHHHHHHHHHHHHHHLHHHHHHHHH....HLLHHLLLLLLEE.
P E A AG G IA E VP D LVVPV GGG G A
7. 1aor_A 101.13 IGPAGeaAImaGRgaIAVEGSKTVPIAdmLVvpVAGGGLPkgTAVLVNIIN
LLHHHHELEEELLLEEEEELLLLLLLLLHHHHHHHHLHHHHLHHHHHHHHH
P I L L V G MV IA L
8. 1rom 100.67 CTEQVKPGNIDKSDAVQIAFL........LLVA.GNATMVNMIALGVATLA
HHHLLLLLLLLHHHHHHHHHH........HHHL.LHHHHHHHHHHHHHHHH
V N G I L L A G V IA K L
9. 1psd_A 100.53 VPENPSTKNMMGAKEISLMKPGSL.LINA......SRGTVVDiaLASKHlg
LLLLLLLLLLELHHHHHHLLLLEE.EEEL......LLLLLELHHHHLLLEE
V N A IA IA L P G G AG
10. 1trk_A 99.80 vnGNEDLAGIA..KAIAQAKLsdKPTLIKMTTTIGYGSLHAG.........
ELLLLLHHHHH..HHHHHHLLLLLLEEEEEELLLLLLLLLLL.........
V Q P V G N L A V P G G GIAI
11. 1kit 99.60 vlAQQTPEvlEKLGWTKIKTGNTMSLynASVNPGPGHGiiSgiaIVLDRFF
EHHLLLLLLHHHHLLEEEEEELLEEEELLEEELLLLLLELLLEEEEELLLL
H A G A E L P V L GG TI L
12. 1yge 99.20 GHLKSKDALEIGTKSlaFEslKSTPivHDLY..EGGIKLPRDVISTIIPL.
LLLLHHHLHHHHHHHHHHHHLLLLLLHHHHH..LLLEELLHHHHHHHLLL.
P I N V L VV G G A TIK
13. 1bhe 98.07 KLKQNTPRLIQINKSKNFTLYN.VSLINshVVFSDGDGFTA.WKTTIkgID
LLEELLLLLEEEELLEEEEEEE.EEEELLLEEEELLEEEEE.EEEEEELEE
EPA I GQGTI E V P G AI
14. 1uae 97.93 ENAAREpaKISGQGtiVIEGVERLGGGVYRVLpiETGTFLVAAAIS.....
ELLLLLHLEEELLLLEEEELLLLELLEEEELLEHHHHHHHHHHHHL.....
V NQ A I Q A E L D LV G G V IA AL
15. 1mro_B 97.33 VnknqAAAliLEQTAmaFERMHLLGlaDNLVFDlgKEGTvsVIalVERALE
HHLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLHHHHHHHHHHHH
A I G GT A V V G G A TI A
16. 1dik 97.13 RSWDNPRAiiPGdgT.AVN.......VQTMVFGNKGETSGTGVAFtiNAQG
HHLLLHHHHLLLLLE.EEE.......EEELLLLLLLLLLEEEEEEEELLLH
V P AVIA Q V G A A K
17. 3pmg_A 97.13 VNPSDSVAVIAAN.IFSIPYFQQTG.VRGFARSMPTSGALDRVA...NATK
LLHHHHHHHHHHL.HHHLHHHHHHL.LLLEEEELLLLLHHHHHH...HLLL
Q G GT P D VV V GG M A KAL
18. 1fcd_A 96.73 LDSSQTfeRLYGFGT.ENAMIEWHPGPDSAVVKVDGGEMMVETAfeFKalI
ELLLLLLHHHHLLLL.LLLLEEEELLLLLLEEEEEELLLEEEELLEEELEL
V P E A IA IAL L A V G V G IT A
19. 1aa6 96.60 vdPRkeTARIADM.HIALKNGSNIALLNAmfVASRTEgiVEGydItvSaiR
ELLLLHHHHHLLE.EELLLLLLHHHHHHHHHHHHHEELHHLLLHHHLLHHH
V G G I LE L QV V A P A A KAL
20. 1lvl 96.60 PKALPQHLVVVGGGYIGLEllaQVSVVEarILPTYDSELTAPVAESLkaLh
LLLLLLEEEEELLLHHHHHHHLEEEEELLLLLLLLLHHHHHHHHHHHHEEE
---
--- TOPITS ALIGNMENTS CONTINUED
---
1. 1tdj 189.93 MPQIKVIAVEAEDS.ACLKAALDAGHPVDLPRVGLFAEGVakRIGDETFRL
LLLLEEEEEEELLL.LHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLHHHHH
SV EP LKG P I S GL P
2. 2tys_B 104.73 DTSVGLIGVEPGgeTGEHGAPLkgMKAPMMQTAdqIEESYSISAGLD.FPS
LLLLEEEEEEEEEHHLELLLHHHEEEEEELELLLLELLLLLLLHHHL.LLE
P PS L LH IA V S I T
3. 2kau_C 102.13 HPNIL.....PSSTNPTLPYTLN.TIDEHlhLDPDIAEDveSRIRRETIA.
LLLEE.....EEELLLLLLLELL.HHHHHHLLLLLLHHHHHHHLLHHHHH.
K Y AE D Q L L D K I I
4. 1cii 101.87 EGDKKIYNAEVAEWDKLRQRLlrNNLSARTNEQKHANDAlkENIRNQLSGI
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KV AA N DD LKGE L TI GV S NT
5. 1req_A 101.73 RDPEKVKAaaAGNPddPDRNLLkgEMSDAlyTAqtIS.GVYSKEVKNTPEv
LLHHHHHHHHHHLLLLHHHLHHHHHHHHHHLLLLLLL.LLHHHHLLLLHHH
YQ G P I VK S G
6. 2frv_B 101.27 ..TCGEFPTDEYDLNSRyqGVIWGNDLSKVDdpDLIEEHVKYSwgAGAHHP
..ELLELLLLLLLHHHLLLEEEELLEEEEEELHHHEEEELLLLLLLLLELH
VKVYA E S CY L GEL NL IA GL T
7. 1aor_A 101.13 NEnvkvyaYEQSGekPCYanRLpgelGANlnDLASiaNHMCDELGLDTIST
HHLELLLHHHHLHHELLLLEEELEEHLHHHLLHHHHHHHHHHHHLELHHHH
AA PS A CY I D V G
8. 1rom 100.67 AQHPDQLaaNPSLAPqcyHTASALAIKRTAKEDVMIGDkvRANEGisNQSA
HHLHHHHHHLHHHHHHHHLLLLLLLLEEEELLLELLLLLELLLLEELHHHH
V EP D S L LTP A IGL
9. 1psd_A 100.53 gAAIDVFPTEPATNSDPFTSPlnVLLTPHIGGSTQEA...QENIGLEVAGK
EEEELLLLLLLLLLLLELLLHHLEEELLLLLLLLHHH...HHHHHHHHHHH
S V A P ADD Q K K P P D IG N W
10. 1trk_A 99.80 ..SHSVHGA.PLKADDVKQLKSKFGFNpsFVVPQEVYDHYQKTigvnKWNK
..LHHHLLL.LLLHHHHHHHHHHLLLLLLLLLLHHHHHHHHHHLHHHHHHH
V Y SN Q G P I KSS L T P
11. 1kit 99.60 FLNvsIYSDdgSN....WQT...GSTLP.......IPFRWKSSSILETlpS
LEEEEEEELLLLL....LEE...EEELL.......LLLEEEELLEEELLEE
P K Y Q T E IA GV GL PI
12. 1yge 99.20 .PVIklYRTDGQhqPHVVQVSQSAWMTDEEFAREMIA.GVNPCvgLEEFpi
.LLHHHLEELLLLLLHHHLLLLLHHHLHHHHHHHHHH.LLLLLLELLLLLL
PS K SN DD K KG T NLH G SIG T
13. 1bhe 98.07 DpsSKNITIAYSngDDNVAIkyKGRatRnlH..NDFGTGHGMSIGSETMGV
EELLEEEEEELLELLLLEEEELLLLLEEEEE..EEELLLLLEEEEEEELLE
K A P D KL I G I L
14. 1uae 97.93 ..RGKIinAQPDTLDA.VLAKLR.......DAGADIEVG.EDWISLDMHG.
..LLEEELLLHHHHHH.HHHHHH.......HLLLEEEEL.LLEEEEELLL.
VKVY Y G A GV S L
15. 1mro_B 97.33 EDGvkVYGTDDLAMWNAYAA..AGLMAATmqGAARAAQGVSSTldLIEFET
HLLLEEEELLLHHHHHHHHH..HHHHHHHHHHHHLLHHHHHHHHHHHHHHH
V V P DCY L L TI G G T PI
16. 1dik 97.13 GEDvgVRTPQpnDMPDCYKqdLAMKLEKHFRDMqtIEEglQTRNGKRTApi
HHHHLLLLLEEHHLHHHHHHHHHHHHHHHLLLLEEEELLEELLELLLLHHH
K Y P N D SKL GE T H E DG L
17. 3pmg_A 97.13 K..IALYE.TPtnLMD..ASKLsgegTGSDHIRE..KDGLWAVLalSILAT
L..LLEEE.ELLHHHH..LLLLLEEEEEELLLLL..LLHHHHHHHHHHHHH
P K A A C S IA KS N
18. 1fcd_A 96.73 IpaGKiiAGLTNDAGWcfESSIHKGI..HVIGDASIANPmkSGYSANSQGK
LLELHHHHLLLLLLLLELEELLLLLE..EELLLLEELLLLLLHHHHHHHHH
YA S A YQ L LT NL P V GL T P
19. 1aa6 96.60 RQAARMYAQAKSAatQFYqrSLtaMLTGNLGKPHAGVNPVRGqmgldTYPG
HHHHHHHHHLLLEEHLLLLHHHHHHHLLLLLLLLLLEEELLLELLELEELL
SV Y A D L EL P T G N I
20. 1lvl 96.60 hhSVEGYENGCLLANDGKGGQLRLelVAVGRRPRT..KGFNLelKMNGAAI
ELEEEEEELLEEEEELLLLLLLEELEELLLEEELL..LLLLHHLLEELLEE
---
--- TOPITS ALIGNMENTS CONTINUED
---
1. 1tdj 189.93 LCQEYLDDIITVDSDAICAAMKDLFEDVRAVAEPSGALALAG.MKKYIALH
HHLLLLLEEEEELHHHHHHHHHHHHHHLLLLLLHHHHHHHHH.HHHHHHHH
D T DE A ALA L
2. 2tys_B 104.73 SvhAYLndYVSITDDEALEAFKTLCRHEGIIPALESSHALAHAL..KMMRE
ELHHHHHEEEEEEHHHHHHHHHHHHHHHLLLELHHHHHHHHHHH..HHHHH
D D F T E T V RMK I PT G ALA V S F V
3. 2kau_C 102.13 .AEDVLHDlfSLTsgEVILRTWQVAHRMKvtINptHGIalavVWSPAFFGV
.HHHHHHHHLLEELLLHHHHHHHHHHHHHHLHHHHLLLLELEEELHHHLLL
I D T D I T G L AG A AA LS
4. 1cii 101.87 INQKIAedELKATKDAINFTTEFlygkAEQLAREMAgaAIAAAlsVKLSDI
HHHHHHHHHHHHHHHHHHHHHHHHHLHHHHHHHHHHHHHHHHHHLLLHHHH
R LV F E I A Q R K A FQT
5. 1req_A 101.73 vaRELVEE.FEQAepRILLAkqDGHDrqKVIATAYADLGFDVDVGPLFQTP
HHHHHHHH.HHHHHLEEEEELLLLLLHHHHHHHHHHHHLLEEEELLLLELH
T E Y R K E LA VL
6. 2frv_B 101.27 PYKGVTKPKWTEFHGEDRY.SWMKAPRYK..GEAFEVGPLASVLVAYAKKH
HHHLLLLELLLLLLELLEL.LLLLEEEEL..LELLELLHHHHHHHHHHLLL
L E A WG LIE AG LA L
7. 1aor_A 101.13 TGGTLAteLyhIKDEELGDAPPFRWGNTEVliEKIagDKLAerLAESYGH.
HHHHHHHHHHLLLHHHHLLLLLLLLLLLHHHHHHHHHHHHLLHHHHHLLL.
RD VF DE K Q G E A L V S Q
8. 1rom 100.67 ANRD..EEVFE.NPDEfkWPPqfGFGDHRCIAEHLAKAELTTVFSTLYQKF
HLLL..LLLLL.LLLLLLLLLLLLLHHHLLLLHHHHHHHHHHHHLLHHHHL
D T G L IE GVAL Q QT
9. 1psd_A 100.53 KLIKYSDNGSTLSAVNFPEVSLPLHGGRRLmieNRPGvaLNKIfaQYLQT.
HHHHHHHHLLLLLLLLLLLLLLLLLLLLEEEEELLLLHHHHHHHEEEEEE.
F E A L G EPT AV TV
10. 1trk_A 99.80 KLFSEYQKKFPELGAEL..ARRL.SGQLPANWepTYTAKDSAVATRKleTV
HHHHHHHHHLHHHHHHH..HHHH.HLLLLLLHHLLLLLLLLLEEHHHHHHH
D V D T Q V GR L G L A F
11. 1kit 99.60 SEADMVedLL.LTA.RLDF.NQIVNgrQQFL.SKDGGItlEANNANVFSNI
EEEEEEELEE.EEE.EELL.LLEELLEEEEE.ELLLLLLEEEEEHHHLLLL
I D DD T E I YAT G KL AG L A SQ
12. 1yge 99.20 iTADSLddGYTMDehDitYATRTIlgTLklSLPHSAG.DLSAAVSQVVLPA
LLHHHLLLLLLHHHHHHLLEEEEEELLEEEELLLLLL.LLLLLLLEEELLL
VDD T K A V GRMK IE G F V
13. 1bhe 98.07 VYNVTVDDlmNGTTNGlkSAAGVVNgrmkiVieKKEGSNVPDWSDITFKDV
EEEEEEEEEEELLLEEELLLLLEEEEEEEEEELLLLLLLLLEEEEEEEEEE
R V T Q L E T G V F V
14. 1uae 97.93 .KRPKAVNVRTAPHPAFPTDMQAQFTLLNLVAEGT.GFITETVFENRFMHV
.LLLELLLEELLLLLLLLHHHHHHHHHHHHHLELE.EEEELLLLLLLLLHH
VD F V G GVA A L Q
15. 1mro_B 97.33 TGLPSVD..fkVEGTAVGFssHSIYggNHIVTRHSKgvAAAMALDAGTQMF
HLLLLHH..HHHHHHHHHHHLLLLLLLLLLLLLLLLLHHHHHHHLLLLLLL
I DLVD E E K QL L PTA A AAV T
16. 1dik 97.13 iACDLVDEGMITEEeeAKSLDQLLHPTFnlPASptADEAKAavILVRLET.
HHHHHHHLLLLLHHHLHHHHHLLLLLLELEEEELLHHHHHLLEEEEELLL.
V D FT E E AT L G AV V
17. 3pmg_A 97.13 TRKQSVEDiftyEEVEAEGATKMMKDLEALMFDRsvGKQFSavYteYHDPV
HHLLLHHHHEEEEEELHHHHHHHHHHHHHHHHLLLLLLEEEEEEELEELLL
V E T P G AA
18. 1fcd_A 96.73 KVAAAAVVVLLKGEEP...GTPSYLNTCYSILAPAYGISVAAIYRPNAdsA
HHHHHHHHHHHHLLLL...LLLLEEEEEEEEEELLEEEEEEEEEEELLLLL
V D K A WGR L AG AA QTV
19. 1aa6 96.60 GYQ.YVKD....PANREKFAK..AWgrISELPHRAagEVRAAYieDPLQtv
LLE.ELLL....HHHHHHHHH..HLLLHHHHHHHHHLLLLEEEELLHHHHH
I D V V G M E AG A AAV V
20. 1lvl 96.60 IAIdsMHNVwdVAGEPMLAHRAMAQGEM..VAEIIAgaAIAAVCFTDPEVv
ELLLLLLLEEHHHLLLLLHHHHHHHHHH..HHHHHLLLLLLEEELLLLEEE
---
--- TOPITS ALIGNMENTS CONTINUED
---
1. 1tdj 189.93 HNIRGERLAHILSGANVNFHGLRYVSegEQREALLAVTI
HLLLLLEEEEELLLLLLLLLHHHHHHHHHLLEEEEEEEL
PE VLSGG D A
2. 2tys_B 104.73 EQPEKEQLLVvlSggDKDIFTVHDILKARGE
HLLLLLEEEEEELELHHHHHHHHHHHHLLLL
V P V GG A P P Q V
3. 2kau_C 102.13 VKPA.....TVIKGGMIAIAPMGDI.NASIPTP.QPV
LLLL.....EEEELLEEEEEEELLL.LLLLLLL.LLL
S S G TSL W G A R V
4. 1cii 101.87 ISSNLNR....FSRgaGKFTSldWigKAVRTENWRPLFV
HHHHHHH....HHHHLLLELLHHHHHHHHHLLLLHHHHH
V VL GG L L G RP TV
5. 1req_A 101.73 PEETArdVHVvlAGGHLTlkELDKLG...RPDILITV
HHHHHHLLLEELLLLHHHHHHHHHLL...LLLLEEEE
P VK V VL V L G A R A Q V V
6. 2frv_B 101.27 HEPTVKAVDLVLKTLGVGPEAlsTLGraArtAA.QEVEV
LHHHHHHHHHHHHHHLLLHHHLLHHHHHHHHHH.HHHHH
PE L G Y T
7. 1aor_A 101.13 ..PELS.....MTVKKLELPAYDPRGAEGHGLGYAT
..HHHL.....LEELLEELLLLLHHHLHHHHHHHHH
P K V L G T LN VG P
8. 1rom 100.67 F.PDLK.VAVPL..GKINYTPLNrvGIVDLPVIF
L.LLLE.ELLLH..HHLEELLLLLLLEEELEEEL
S V I V L QA P T
9. 1psd_A 100.53 .SAQMGYVViiEADEDVAEKAL....QAMKAIP.GTIR
.LLLEEEEEEEELLHHHHHHHH....HHHHLLL.LEEE
V V N L GG DLT N E Q S
10. 1trk_A 99.80 VLEDVYNQLPELIGGSADLTPSNLTRWKE.ALDFQPPS
HHHHHHLLLLLEEEEELLLHHHHLLLLLL.LLEELLHH
S I S G L N G Q
11. 1kit 99.60 ISTGTVDASItqSDGSHFLLFTNPQGNPAGTNGRQNLGL
LLLLLLLLEEELLLLLEEEEEEELLLLLLLLLLLELLEE
V L V L SLN E PA SV
12. 1yge 99.20 AKEGVESTIWLLAKAYvqLMslNTHAAME.paTHRHLSV
LLLHHHHHHHHHHHHHHHHHHHLLHHHHH.HHHHHHLLL
V E K V VL G N T N V V
13. 1bhe 98.07 VTSETKGV.VVLNGEniEVTMKNVKLTSDSTWQIKNVNV
EEELLLLE.EEEELLLEEEEEEEEELLLLLEEEEELEEE
V PE N IVLSG VDL A A TV
14. 1uae 97.93 V.PElsNTVivlSGAQvdLRASASLVLAGCIAEGTTV
H.HHHLLEEELLELLEELHHHHHHHHHHHHHLEEEEE
SPE IV S VD L V A A
15. 1mro_B 97.33 FSPEATSGLivFS..QVdrEPLKYVVEA..AAEIK
LLHHHHLHHHHHL..LLHHLHHHHHHHH..HHHLL
SPE GG TS V R VS
16. 1dik 97.13 .SPeiEGMhlTVRGG...MTSHAAV..VARGMGTCCVS
.LHHHHHHHEELLLL...LLLHHHH..HHHHHLLEEEE
V VKN G L G A T
17. 3pmg_A 97.13 VDGSvknlRLIFADGSRIIFRLSGTGSAG.....ATIRL
LLLLELLEEEEELLLLEEEEEEEELLLLL.....EEEEE
V SGG WV E Y
18. 1fcd_A 96.73 AIESVPD.....SGGVTPVDAPDWVLEREVQYAYSWYN
LEEELLL.....LLEELLLLLLHHHHHHHHHHHHHHHH
V V S V L S W G E A Q
19. 1aa6 96.60 vRKAFEDLELVisAADVILPSTSW.GEHetAagFQ
HHHHHHHLLEEEHLLLEEEEELLH.HHLLELLEEE
V PE CIV G L S V R Q V V
20. 1lvl 96.60 vtPEQASqcIvaNGRAMSLESKsfVRVVARRdgWQAVGV
ELHHHHHHEEEHLHHHHHLLLLLEEEEEEELLEEEEEEL
---
--- TOPITS ALIGNMENTS END
---
Identities computed with respect to: (1) predict_h1470 Colored by: consensus/70% and property
1 [ . . . . : . . . . 1 . . . . : . . . . 2 . . . . : . . . . 3 . . . ] 339
1 predict_h1470 100.0% MCAQYCISFADVEKAHINIQDSIHLTPVLTSSILNQIAGRNLFFKCELFQKTGSFKIRGALNAIRGLIPDTPEEKPKAVVTHSSGNHGQALTYAAKLEGIPAYIVVPQTAPNCKKLAIQAYGASIVYCDPSDESREKVTQRIMQETEGILVHPNQEPAVIAGQGTIALEVLNQVPLVDALVVPVGGGGMVAGIAITIKALKPSVKVYAAEPSNADDCYQSKLKGELTPNLHPPETIADGVKSSIGLNTWPIIRDLVDDVFTVTEDEIKYATQLVWGRMKLLIEPTAGVALAAVLSQHFQTVSPEVKNVCIVLSGGNVDLTSLNWVGQAERPAPYQTVSV
2 1tdj 31.5% --AEY---LRAVLRAPVY--EAAQVTPllSSRLDNVI-----LVKREDRQPVHSFKLRGAYAMMAGL---TEEQKAHGVITASAGNHAQGVAfsARL-GVKALIVMPTATADIKVDAVRGFGGEVLLHGAndEAKAKAIE--LSQQQgtWVPPFDHPMVIAGQGTLALELLQQDAHLDRVFVPVGGGGLAAGVAVLIKQLMPQIKVIAVEAEDS-ACLKAALDAGHPVDLPRVGLFAEGVakRIGDETFRLCQEYLDDIITVDSDAICAAMKDLFEDVRAVAEPSGALALAG-MKKYIALHNIRGERLAHILSGANVNFHGLRYVSegEQREALLAVTI
3 2tys_B 21.0% -------ALNQLEEAFVRAQKDPEFQAQF-ADLLKNYAGrtALTKCqlYLKREDLLHGGatNQVLglLA-KRMGKSEIIAETGAGQHGVASALASALLGLKCRIyvERQSPN--VFRMRLMGAEvvhcNerDWSGSYETAHYMLGtaGPHPYPtrEFQRMIGEETKA-QILdeGRLPDAVIACVGGggMFADF---IN--DTSVGLIGVEPGgeTGEHGAPLkgMKAPMMQTAdqIEESYSISAGLD-FPSvhAYLndYVSITDDEALEAFKTLCRHEGIIPALESSHALAHAL--KMMREQPEKEQLLVvlSggDKDIFTVHDILKARGE--------
4 2kau_C 23.9% ------VRLADTE-LWIEVEDDLtlAavLTNaiVDhiVKADIGVkgRIFakAGNPDIqgavIAAEGKIvdtpQQAEEALvtMVGGGTGPAatHATTCTPGPWYISRMLQAADSLPVNIGLLGKGNVS-QP-DALREQVAAGVI----GLKIHEdaTPAAIDCALTVADEmlNESGFVEDTLAAIggGGHAPDI-ITACA-HPNIL-----PSSTNPTLPYTLN-TIDEHlhLDPDIAEDveSRIRRETIA-AEDVLHDlfSLTsgEVILRTWQVAHRMKvtINptHGIalavVWSPAFFGVKPA-----TVIKGGMIAIAPMGDI-NASIPTP-QPV--
5 1cii 18.9% ------LRAGNADAADITRQEFRLLQAELREYGftEIAGYdlRLHTelFADADSLRikDAQNADKKAADMLAeeRRKGIltRLskNGGAAL---AVLDAQQARLLGQQT-RNDR--AISEARNKLSSVTEsnTARNALtqQLTQqpDgtIVSPEKFPgvVSGdgTIKiaVIDNRANLNYLLSHSgrNILNDRNPVVTEDVEGDKKIYNAEVAEWDKLRQRLlrNNLSARTNEQKHANDAlkENIRNQLSGINQKIAedELKATKDAINFTTEFlygkAEQLAREMAgaAIAAAlsVKLSDISSNLNR----FSRgaGKFTSldWigKAVRTENWRPLFV
6 1req_A 23.2% ---------ADIEMAY-TLADGVDYIRAGESVGLnqFAPRLSFfgMNFFMEVA--KLRAARMLWAKLVHQFGPKNPksLRTHSQTS-GWSLtyaahTNSLDEAIALprIARNTQLFLQQESGTTRVI-DPWSGsvEELTWDliQEVegIPKMRIEEAAaisgqPLIGVnrLEHEPPLD--VLKVDNSTVLAekavKLRAERDPEKVKAaaAGNPddPDRNLLkgEMSDAlyTAqtIS-GVYSKEVKNTPEvaRELVEE-FEQAepRILLAkqDGHDrqKVIATAYADLGFDVDVGPLFQTPEETArdVHVvlAGGHLTlkELDKLG---RPDILITV--
7 2frv_B 22.1% VC-TYVHALASVRAVdvKIPENATLMRNLT--MGAQYMHDHlfYH---LHALDWVNVANALNaaNDLSptTTeaKVKALV--ESGQLG-IFTNAYFLGGHPAYVLPAetAHYLEALRVqaARAMAIF-----GAKNPHTQ--FTVVGGCTNYDSLRPERIAEFRKLYKEviEQVYITDLLAVA----GFyaGIGKTSNFL--TCGEFPTDEYDLNSRyqGVIWGNDLSKVDdpDLIEEHVKYSwgAGAHHPYKGVTKPKWTEFHGEDRY-SWMKAPRYK--GEAFEVGPLASVLVAYAKKHEPTVKAVDLVLKTLGVGPEAlsTLGraArtAA-QEVEV
8 1aor_A 20.1% -------RfvNLSTGDIKVEE---YDEELAKKWLGS-RGLAIY----LLLKEMDPTV-DPLSPENKLipLTGTSAPtnVVTKstgnSGGYFGAELKFAGYDA-IVVEGKAEkiKDEHIEIRDASHIWGKKVSETEATIRKEVGSEKVKisIGPAGeaAImaGRgaIAVEGSKTVPIAdmLVvpVAGGGLPkgTAVLVNIINEnvkvyaYEQSGekPCYanRLpgelGANlnDLASiaNHMCDELGLDTISTGGTLAteLyhIKDEELGDAPPFRWGNTEVliEKIagDKLAerLAESYGH--PELS-----MTVKKLELPAYDPRGAEGHGLGYAT---
9 1rom 21.1% -------SLAWLVTKHKDvaTSEKLSKVRtsASGKQAaaKPTFVDMdmHQRspTF-TPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALP-VPSYIivplTQQN----AIRTNGSSTAR-EASAANQelVEQRLVEPKDDilCTEQVKPGNIDKSDAVQIAFL--------LLVA-GNATMVNMIALGVATLAQHPDQLaaNPSLAPqcyHTASALAIKRTAKEDVMIGDkvRANEGisNQSANRD--EEVFE-NPDEfkWPPqfGFGDHRCIAEHLAKAELTTVFSTLYQKF-PDLK-VAVPL--GKINYTPLNrvGIVDLPVIF-----
10 1psd_A 21.1% --AGYtiEFhdDEqahiGLRSRTHLTevitNQvlDAAAKRGivFN-APFSNTRslVIGELLLLLRGV----PEANAKA--HRGVWNKLAAGSFEakL-GIIGYGHIG------TQLGILalGMYVYFYDIENknATQV-QHlmSDVVSLHVPENPSTKNMMGAKEISLMKPGSL-LINA------SRGTVVDiaLASKHlgAAIDVFPTEPATNSDPFTSPlnVLLTPHIGGSTQEA---QENIGLEVAGKLIKYSDNGSTLSAVNFPEVSLPLHGGRRLmieNRPGvaLNKIfaQYLQT-SAQMGYVViiEADEDVAEKAL----QAMKAIP-GTIR-
11 1trk_A 23.3% -------QFTDIDksTIRiaNSGhmApvLWSQmiNrsNGHAVALLYSMLHLTgdLSI-EDLKQFRQltPGHPeeLPGVEVT--TGPLGQGISNaaNLAAtpGFTliSSEASshLKLgiAIYDDNKITIDGATSIseDVAKRY--EAYGWEvnGNEDLAGIA--KAIAQAKLsdKPTLIKMTTTIGYGSLHAG----------SHSVHGA-PLKADDVKQLKSKFGFNpsFVVPQEVYDHYQKTigvnKWNKLFSEYQKKFPELGAEL--ARRL-SGQLPANWepTYTAKDSAVATRKleTVLEDVYNQLPELIGGSADLTPSNLTRWKE-ALDFQPPS-
12 1kit 23.1% ------IFYSVYDVASGNWQAPIDVtqVKERSF--QIAG---WGGSELYRRNTSlkIrgAANQIQ-----VADGSRKYVVTLSIDESGGLV---ANLNGVSAPIILQSEHAKVHSfeLQ-YSatTLFVDgsQEnrLHV-QKIVLTQQGhlvlAQQTPEvlEKLGWTKIKTGNTMSLynASVNPGPGHGiiSgiaIVLDRFFLNvsIYSDdgSN----WQT---GSTLP-------IPFRWKSSSILETlpSEADMVedLL-LTA-RLDF-NQIVNgrQQFL-SKDGGItlEANNANVFSNISTGTVDASItqSDGSHFLLFTNPQGNPAGTNGRQNLGL
13 1yge 20.2% --SAFNIHFegIPGAFY-IKNYMQVEFFLKSLTLEAISNqtIRFVCNSWvnTKLYK------SVRIFFANhpSETPAPLVSYrrGntGERKEY-DRIYDYDVYNDLGNPDKS-EKLARPVLGGSSTFPYptDPNTEKQGEVFYVPRDENLGHLKSKDALEIGTKSlaFEslKSTPivHDLY--EGGIKLPRDVISTIIPL-PVIklYRTDGQhqPHVVQVSQSAWMTDEEFAREMIA-GVNPCvgLEEFpiTADSLddGYTMDehDitYATRTIlgTLklSLPHSAG-DLSAAVSQVVLPAKEGVESTIWLLAKAYvqLMslNTHAAME-paTHRHLSV
14 1bhe 20.8% --PSSCTTlaDSSTATSTIQKALNNCDQGKAVRLS--AGssVFLSGPLSLPSgsLLikgtLRAVNN--AKSFENAPSsgVVDKNG------------KGCDAFI----TAVSTTNSGI--YGPGTI--DGQGGVkkKVSWWELAADAKVKKLKQNTPRLIQINKSKNFTLYN-VSLINshVVFSDGDGFTA-WKTTIkgIDpsSKNITIAYSngDDNVAIkyKGRatRnlH--NDFGTGHGMSIGSETMGVYNVTVDDlmNGTTNGlkSAAGVVNgrmkiVieKKEGSNVPDWSDITFKDVTSETKGV-VVLNGEniEVTMKNVKLTSDSTWQIKNVNV
15 1uae 21.5% --AALPILFAalAEEPVEIQNVPKLKDVDTskLLSqiDArnVF--CAPYdkT----MRASIWALGPLVARFGQGQVSLPGGCTIGARPVDLHI-SGLEQLGATIKLEekASVDGRLK----GAHIV----MDKVSVGATVTImaAtegiIENAAREpaKISGQGtiVIEGVERLGGGVYRVLpiETGTFLVAAAIS------RGKIinAQPDTLDA-VLAKLR-------DAGADIEVG-EDWISLDMHG-KRPKAVNVRTAPHPAFPTDMQAQFTLLNLVAEGT-GFITETVFENRFMHV-PElsNTVivlSGAQvdLRASASLVLAGCIAEGTTV--
16 1mro_B 18.3% -------ALSPLRNPAisIVQGIKRtaVNLEGIENALKTAKvgPACKIMGRELDLDIVGNAESikEMIQVTEDdgKRALVQVPSARFDVAAEyaAPLVTATAfiivSMYDANMVKAAVLgyPQSVEYMGANIATMLDIPQKL--EGPgiMVnknqAAAliLEQTAmaFERMHLLGlaDNLVFDlgKEGTvsVIalVERALEDGvkVYGTDDLAMWNAYAA--AGLMAATmqGAARAAQGVSSTldLIEFETGLPSVD--fkVEGTAVGFssHSIYggNHIVTRHSKgvAAAMALDAGTQMFSPEATSGLivFS--QVdrEPLKYVVEA--AAEIK----
17 1dik 22.8% -CTEYYNSGKQITQ---EIQDQIfitpLLVstILNL--GLN-DVAVEGfkKTGNP--RFAYDSYRRFIQMYSD-----VVMEVPKSHFEKIIDAMKEegV--HFDTDLTADDLKELA-EKFKA--VYKEagEEFPQEPKDQLMGAVKAVFRSWDNPRAiiPGdgT-AVN-------VQTMVFGNKGETSGTGVAFtiNAQGEDvgVRTPQpnDMPDCYKqdLAMKLEKHFRDMqtIEEglQTRNGKRTApiACDLVDEGMITEEeeAKSLDQLLHPTFnlPASptADEAKAavILVRLET-SPeiEGMhlTVRGG---MTSHAAV--VARGMGTCCVS-
18 3pmg_A 19.8% ICPDLKVDLGVLGKQQFDLEN--KFKP-FTVEIVDSVEarNIflK-ELLSGPNRLKIR--IDAMHGVveELGAPANSAvlEDFGGHHPDplTYAADL-------V--ETM----KSGEHDFGAAF----DGDGDRN-----MILGKHGFFVNPSDSVAVIAAN-IFSIPYFQQTG-VRGFARSMPTSGALDRVA---NATK--IALYE-TPtnLMD--ASKLsgegTGSDHIRE--KDGLWAVLalSILATRKQSVEDiftyEEVEAEGATKMMKDLEALMFDRsvGKQFSavYteYHDPVDGSvknlRLIFADGSRIIFRLSGTGSAG-----ATIRL
19 1fcd_A 21.1% --AKY-IKLADPSIEVTLIEPN---TDYYTCYLSNEVIGGD--------RKLESIK-HggLRA-HgvVHDSAtdPDKKLVKTAGgaPGIELIyaAKLP--HAWKAGEQTAILRKQLEDMADGGTVVIAPPAAPFreRASqyYLKasKVIILDSSQTfeRLYGFGT-ENAMIEWHPGPDSAVVKVDGGEMMVETAfeFKalIpaGKiiAGLTNDAGWcfESSIHKGI--HVIGDASIANPmkSGYSANSQGKVAAAAVVVLLKGEEP---GTPSYLNTCYSILAPAYGISVAAIYRPNAdsAIESVPD-----SGGVTPVDAPDWVLEREVQYAYSWYN-
20 1aa6 21.3% -CASGciNlaEAAQGKTN-QGTLCLKGYYGWDFInqILTPRL--KTPMIRRQRGGKLeeALNYVAERLSAIKEkgPDAIQTtsSRGTGNETNYvaRVXHGPSVAGLHQSVGN-GAmaINEIDNtlVfyNPAD-SHPIVANHVirNGAKIIvdPRkeTARIADM-HIALKNGSNIALLNAmfVASRTEgiVEGydItvSaiRQAARMYAQAKSAatQFYqrSLtaMLTGNLGKPHAGVNPVRGqmgldTYPGYQ-YVKD----PANREKFAK--AWgrISELPHRAagEVRAAYieDPLQtvRKAFEDLELVisAADVILPSTSW-GEHetAagFQ----
21 1lvl 22.1% ---------AAIRAGQLGIPTVLVEGQALGGTCLncIPSKALIHVAEQFHQASRFTEPSPLgaSPRliGQSVAWKDGIVDRLTTgkHGVKVvwAKVLDG--KQVEVDGQRIQCEHL-LLATGSSSVE-LPMLPLGGPV----ISSTEALAPKALPQHLVVVGGGYIGLEllaQVSVVEarILPTYDSELTAPVAESLkaLhhSVEGYENGCLLANDGKGGQLRLelVAVGRRPRT--KGFNLelKMNGAAIAIdsMHNVwdVAGEPMLAHRAMAQGEM--VAEIIAgaAIAAVCFTDPEVvtPEQASqcIvaNGRAMSLESKsfVRVVARRdgWQAVGV
consensus/100% ...............................................................................................................................................................................................................................................................................................................t...................................
consensus/90% ........ht........h.t....t..h...h.t...t.........h...........h...t.h.....t...t.......tt....h.....h......h............t....tt...................h....th......t.thh.t...ht.t..t........h....ttth...h...........t.h.....t....ht......h.............ht.t.t.p........hp..h...t.t.......h.tt.....t...s..hs..h...h.....th.......t.t.h.h...t................
consensus/80% .......hsshttt.hthtp...ht..htt.hhsth.s.t.h.....h.t.tt..h.tsh..htth...h.ptt.p.hhp..sst.s..h...h.h.th.thh.h..tt......t...hssthh.....t..tt.h...hh.t..shhh..ttp.thl.t.tthththhp....hp.hhh...stshhsth....p....shthhs.tstt..t.htt..th.h..th...t.ht.thp.phshp.hs.htt.hpphh..tt.th......hhsp.t.hhp...s.thsshht.hht....phtt..h.htstth.h.shs.hh.ht.....t....
consensus/70% .....thsshpts.hpltpshphpthhtsthhsph.stshhhtht.htpstshphttshtthpthh..hspttsphlhphssupts.th.htstl.sh.shhhh.tps.p.tthth..hssphlh.tststsptthtt.hh.tttuhhh..pppsthl.sttthshphhpphshhpthhhs.sstshhsshshhhpth..shplhsstsss.ss.hts.lphthssphtt.tthtpshptphuhpthshhtshhcchhphststh..spt.hhsphphhhp.ttuhshussht.hhpphpsphpth.hhhssuththtshshhhtsths..ht.h..
|
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 1.0 1991
PDBID predict_h14790
DATE file generated on 26-Jun-00
SEQBASE RELEASE 38.0 OF EMBL/SWISS-PROT WITH 80000 SEQUENCES
PARAMETER SMIN: -1.0 SMAX: 2.0
PARAMETER gap-open: 2.0 gap-elongation: 0.2
PARAMETER conservation weights: NO
PARAMETER InDels in secondary structure allowed: YES
PARAMETER alignments sorted according to :ZSCORE
THRESHOLD according to: ALL
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT e-mail (INTERNET) Schneider@EMBL-Heidelberg.DE or Sander@EMBL-Heidelberg.DE / fax +49-6221-387306
AVAILABLE Free academic use. Commercial users must apply for license.
AVAILABLE No inclusion in other databanks without permission.
HEADER predict_h14790.phdRdb
COMPND
SOURCE
AUTHOR
SEQLENGTH 339
NCHAIN 1 chain(s) in predict_h14790 data set
KCHAIN 1 chain(s) used here ; chain(s) : A
NALIGN 20
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCESSION: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of INSERTION IN THIS sequence
NOTATION : dots (....) in the alignend SEQUENCE INDICATE POINTS of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : EMBL/SWISSPROT identifier and alignment statistics
NR. ID STRID %IDE %SIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCESSION PROTEIN
1 : 1tdj 0.34 0.00 3 339 10 341 322 14 31 495 MOL_ID: 1;
2 : 2tys_B 0.28 0.00 8 331 22 394 312 27 75 397 MOL_ID: 1;
3 : 2kau_C 0.34 0.00 7 337 20 470 309 35 166 566 MOLECULE: KLEBSIELLA AEROGENES UREASE;
4 : 1cii 0.27 0.00 7 339 97 547 326 29 138 602 MOL_ID: 1;
5 : 1req_A 0.33 0.00 10 337 251 683 320 36 129 727 MOL_ID: 1;
6 : 2frv_B 0.29 0.00 1 339 61 421 316 27 74 530 MOL_ID: 1;
7 : 1aor_A 0.30 0.00 8 336 7 441 322 34 140 605 ALDEHYDE FERREDOXIN OXIDOREDUCTASE PROTEI
8 : 1rom 0.28 0.00 8 334 36 399 304 28 83 399 MOL_ID: 1;
9 : 1psd_A 0.30 0.00 3 338 24 399 305 33 102 404 D-3-PHOSPHOGLYCERATE DEHYDROGENASE (PHOSP
10 : 1trk_A 0.32 0.00 8 338 1 399 310 31 112 678 TRANSKETOLASE (E.C.2.2.1.1)
11 : 1kit 0.31 0.00 7 339 306 693 310 34 121 757 MOL_ID: 1;
12 : 1yge 0.27 0.00 3 339 79 520 330 31 135 839 MOL_ID: 1;
13 : 1bhe 0.29 0.00 3 339 12 374 310 31 84 376 MOL_ID: 1;
14 : 1uae 0.31 0.00 3 337 24 388 304 29 96 418 MOL_ID: 1;
15 : 1mro_B 0.27 0.00 8 335 25 439 318 32 107 442 MOL_ID: 1;
16 : 1dik 0.31 0.00 2 338 46 468 318 34 150 870 MOL_ID: 1;
17 : 3pmg_A 0.28 0.00 1 339 158 515 298 33 109 561 MOL_ID: 1;
18 : 1fcd_A 0.26 0.00 3 338 18 393 323 26 94 401 FLAVOCYTOCHROME C SULFIDE DEHYDROGENASE (
19 : 1aa6 0.30 0.00 2 335 11 463 323 40 147 698 MOL_ID: 1;
20 : 1lvl 0.28 0.00 10 339 20 408 318 26 83 458 DIHYDROLIPOAMIDE DEHYDROGENASE (E.C.1.8.1
## ALIGNMENTS 1 - 20
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 M 0 0 152 3 26 V I
2 2 C 0 0 86 5 0 C CC C
3 3 A 0 0 51 10 26 A . A SPA TPAA
4 4 Q 0 0 97 11 41 E T G ASA EDKS
5 5 Y 0 0 0 11 36 Y Y Y FSL YLYG
6 6 C 0 0 66 9 51 . V t NCP YK.c
7 7 I 0 0 0 13 31 . VL H i IITI NVIi
8 8 S 0 0 0 18 50 .ARR ARSEQFHTLASDKN
9 9 F 0 0 0 19 21 LLLA LfLFFYFlFLGLLl
10 10 A H 0 0 38 21 37 RNAGAAvAhTSeaASKGAaA
11 11 D H 0 0 58 21 42 AQDNDSNWdDVgDaPQVDEA
12 12 V H 0 0 0 21 39 VLTAIVLLDIYISlLILPAI
13 13 E H 0 0 69 21 45 LEEDERSVEDDPSARTGSAR
14 14 K H 0 0 100 20 46 RE.AMATTqkVGTENQKIQA
15 15 A H 0 0 0 20 36 AALAAVGKasAAAEP.QEGG
16 16 H H 0 0 66 20 53 PFWDYdDHhTSFTPA.QVKQ
17 17 I H 0 0 82 19 37 VVII.vIKiIGYSVi.FTTL
18 18 N H 0 0 56 20 46 YRETTKKDGRN.TEsEDLNG
19 19 I H 0 0 0 19 28 .AVRLIVvLiWIIIIILI.I
20 20 Q H 0 0 71 20 36 .QEQAPEaRaQKQQVQEEQP
21 21 D H 0 0 79 21 37 EKDEDEETSNANKNQDNPGT
22 22 S H 0 0 0 19 50 ADDFGN.SRSPYAVGQ.NTV
23 23 I H 0 0 0 18 45 APLRVA.ETGIMLPII..LL
24 24 H 0 0 66 19 49 QEtLDT.KHhDQNKKfK.CV
25 25 L 0 0 80 20 35 VFlLYLYLLmVVNLRiF.LE
26 26 T 0 0 0 21 44 TQAQIMDSTAtECKttKTKG
27 27 P 0 0 0 21 44 PAaARREKepqFDDapPDGQ
28 28 V 0 0 0 20 45 lQvEANEVvvVFQVVL.YYA
29 29 L 0 0 59 21 42 lFLLGLLRiLKLGDNLFYYL
30 30 T 0 0 69 20 44 S.TRETAttWEKKTLVTTGG
31 31 S 0 0 0 20 41 SANES.KsNSRSAsEsVCWG
32 32 S H 0 0 63 20 52 RDaYV.KAQQSLVkGtEYDT
33 33 I H 0 0 82 21 43 LLiGGMWSvmFTRLIIILFC
34 34 L H 0 0 0 20 36 DLVfLGLGli.LLLELVSIL
35 35 N H 0 0 0 20 32 NKDtnAGKDN.ESSNNDNnn
36 36 Q H 0 0 97 20 44 VNhEqQSQArQA.qALSEqc
37 37 I 0 0 60 18 33 IYiIFY.AAsII.iL.VVII
38 38 A 0 0 0 19 43 .AVAAMRaANASADK.EILP
39 39 G 0 0 41 20 35 .GKGPHGaKGGNGATGaGTS
40 40 R 0 0 0 19 49 .rAYRDLKRH.qsrALrGPK
41 41 N E 0 0 56 19 44 .tDdLHAPGA.tsnKNNDRA
42 42 L E 0 0 0 17 29 .AIlSlITiV.IVVv.I.LL
43 43 F E 0 0 0 19 45 LLGRFfYFvAWRFFgDf..I
44 44 F E 0 0 0 17 40 VTVLfY.VFLGFL.PVl..H
45 45 K E 0 0 73 18 47 KKkHgH.DNLGVS.AAK.KV
46 46 C E 0 0 48 16 47 RCgTM..M.YSCGCCV..TA
47 47 E 0 0 95 18 37 EqReN..dASENPAKEE.PE
48 48 L 0 0 80 19 40 DlIlF.LmPMLSLPIGL.MQ
49 49 F 0 0 70 20 30 RYFFFLLHFLYWSYMfL.IF
50 50 Q 0 0 71 21 49 QLaAMHLQSHRvLdGkSRRH
51 51 K 0 0 73 21 40 PKkDEAKRNLRnPkRKGKRQ
52 52 T 0 0 0 21 46 VRAAVLEsTTNTSTETPLQA
53 53 G 0 0 21 20 46 HEGDADMpRgTKg.LGNERS
54 54 S 0 0 0 19 42 SDNS.WDTsdSLs.DNRSGR
55 55 F 0 0 0 19 35 FLPL.VPFlLlYL.LPLIGF
56 56 K E 0 0 73 18 41 KLDRKNT.VSkKL.D.KKKT
57 57 I E 0 0 0 18 29 LHIiLVVTIII.iMI.I.LE
58 58 R 0 0 0 18 42 RGqkRA.PG.r.kRVRRHeP
59 59 G H 0 0 0 19 33 GGgDANDEEEg.gAGF.geS
60 60 A H 0 0 0 19 30 AaaAAAPALDA.tSNA.gAP
61 61 L H 0 0 0 20 39 YtvQRLLVLLA.LIAYILLL
62 62 N H 0 0 0 20 46 ANINMNSKLKN.RWEDDRNg
63 63 A H 0 0 0 21 42 MQAALaPNLQQSAASSAAYa
64 64 I H 0 0 0 20 44 MVADWaELLFIVVLiYM.VS
65 65 R H 0 0 121 21 44 ALEKANNQRRQRNGkRHHAP
66 66 G H 0 0 41 20 42 GgGKKDKPGQ.INPERGgER
67 67 L 0 0 0 19 32 LlKALLLYVl.F.LMFVvRl
68 68 I 0 0 42 17 29 .LIAVSiI.t.F.VIIvVLi
69 69 P 0 0 87 18 41 .AvDHppQ.P.AAAQQeHSG
70 70 D 0 0 79 17 48 ..dMQtLR.G.NKRVMEDAQ
71 71 T 0 0 51 20 47 TKtLFTTT.HVhSFTYLSIS
72 72 P H 0 0 66 21 44 ERpAGTGVPPApFGESGAKV
73 73 E H 0 0 95 21 34 EMQePeTDEeDSEQDDAtEA
74 74 E H 0 0 95 20 41 QGQeKaSDAeGENGd.PdkW
75 75 K 0 0 73 20 46 KKARNKALNLSTAQg.APgK
76 76 P 0 0 66 20 44 ASERPVPLAPRPPVK.NDPD
77 77 K 0 0 100 20 40 HEEKkKtEKGKASSR.SKDG
78 78 A 0 0 0 20 45 GIAGsAnQAVYPsLA.AKAI
79 79 V E 0 0 0 20 29 VILILLVM.EVLgPLVvLIV
80 80 V E 0 0 0 20 35 IAvlRVVK.VVVVGVVlVQD
81 81 T E 0 0 0 20 41 TEttT.TQHTTSVGQMEKTR
82 82 H E 0 0 0 19 52 ATMRH.KKR.LYDCVEDTtL
83 83 S E 0 0 0 20 46 SGVLSEsGG.SrKTPVFAsT
84 84 S 0 0 0 21 44 AAGsQStCVTIrNISPGGST
85 85 G 0 0 0 21 32 GGGkTGgAWGDGGGAKGgRg
86 86 N H 0 0 0 20 39 NQGNSQnNNPEn.ARSHaGk
87 87 H H 0 0 0 19 50 HHTG.LSGKLSt.RFHHPTH
88 88 G H 0 0 0 20 29 AGGGGGGPLGGG.PDFPGGG
89 89 Q H 0 0 0 19 48 QVPAW.GVAQGE.VVEDINV
90 90 A H 0 0 0 20 46 GAAASIYDAGLR.DAKpEEK
91 91 L H 0 0 0 20 39 VSaLLFFLGIVK.LAIlLTV
92 92 T H 0 0 0 18 44 AAt.tTGVSS.E.HEITINv
93 93 Y H 0 0 0 18 42 fLH.yNAKFN.Y.IyDYyYw
94 94 A H 0 0 0 16 24 sAA.aAEEEa....aAAavA
95 95 A H 0 0 0 20 38 ASTAaYLFaaAD.SAMAAaK
96 96 K H 0 0 100 20 48 RATVhFKAkNNR.GPKDKRV
97 97 L H 0 0 59 20 24 LLCLTLFLLLLI.LLELLVL
98 98 E 0 0 0 17 48 .LTDNGAP.ANYKEVe.PXD
99 99 G 0 0 0 18 29 GGPASGG.GAGDGQTg..HG
100 100 I 0 0 0 18 47 VLGQLHYVItVYCLAV..G.
101 101 P 0 0 48 18 42 KKPQDPDPIpSDDGT..HP.
102 102 A E 0 0 0 19 36 ACWAEAASGGAVAAA..ASK
103 103 Y E 0 0 0 19 46 LRYRAY.YYFPYFTfH.WVQ
104 104 I E 0 0 0 20 33 IIILIVIIGTINIIiF.KAV
105 105 V E 0 0 0 20 47 VySLALViHlID.KiDVAGE
106 106 V E 0 0 0 19 36 MvRGLPVvIiLL.LvT.GLV
107 107 P 0 0 0 19 41 PEMQpAEpGSQG.ESD.EHD
108 108 Q 0 0 97 19 48 TRLQreGl.SSN.eMLEQQG
109 109 T 0 0 69 20 43 AQQTItKT.EEPTkYTTTSQ
110 110 A 0 0 16 19 37 TSA.AAAQ.AHDAADAMAVR
111 111 P H 0 0 0 19 48 APARRHEQ.SAKVSAD.IGI
112 112 N H 0 0 76 19 42 DNDNNYkN.sKSSVND.LNQ
113 113 C H 0 0 0 15 52 I.SDTLi..hV.TDML.R.C
114 114 K H 0 0 73 18 47 K.LRQEK.TLHETGVK.KGE
115 115 K H 0 0 0 19 46 VVP.LAD.QKSKNRKEKQAH
116 116 L H 0 0 80 19 36 DFV.FLE.LLfLSLALSLmL
117 117 A H 0 0 0 20 40 ARNALRHAGgeAGKAAGEa.
118 118 I H 0 0 0 19 33 VMIIQVIIIiLRI.V.EDIL
119 119 Q H 0 0 71 19 48 RRGSQqERLAQP..LEHMNL
120 120 A H 0 0 51 18 44 GLLEEaITaI.V..gKDAEA
121 121 Y H 0 0 55 20 47 FMLASARNlYYLY.yFFDIT
122 122 G 0 0 0 21 28 GGGRGRDGGDSGGGPKGGDG
123 123 A 0 0 0 21 37 GAKNTAASMDaGPAQAAGNS
124 124 S 0 0 63 20 44 EEGKTMSSYNtSGHS.ATtS
125 125 I E 0 0 0 20 47 VvNLRAHTVKTSTIV.FVlS
126 126 V E 0 0 0 20 32 LvVSVIIAYILTIVEV.VVV
127 127 Y E 0 0 0 18 43 LhSSIFWRFTFF..YY.IfE
128 128 C E 0 0 0 13 52 Hc.V..G.YIVP..MK.Ay.
129 129 D E 0 0 0 18 41 GNQTD.KEDDDYD.GE.PNL
130 130 P 0 0 48 19 37 AePEP.KAIGgpG.AaDPPP
131 131 S 0 0 63 19 48 nr.sW.VSEAstQMNgGAAM
132 132 D 0 0 40 20 38 dDDnS.SANTQDGDIEDADL
133 133 E H 0 0 95 20 43 EWATGGEAkSEPGKAEGP.P
134 134 S H 0 0 63 21 48 ASLAsATNnInNVVTFDFSL
135 135 R H 0 0 0 21 45 KGRRvKEQAsrTkSMPRrHG
136 136 E H 0 0 95 21 43 ASENENAeTeLEkVLQNePG
137 137 K H 0 0 100 20 49 KYQAEPTlQDHKKGDE.RIP
138 138 V H 0 0 69 20 36 AEVLLHIVVVVQVAIP.AVV
139 139 T H 0 0 0 17 40 ITAtTTRE.A.GSTPK.SA.
140 140 Q H 0 0 97 19 42 EAAqWQKQQKQEWVQD.qN.
141 141 R H 0 0 121 17 48 .HGQD.ERHRKVWTKQ.yH.
142 142 I H 0 0 0 18 31 .YVLl.VLlYIFEILLMYV.
143 143 M H 0 0 92 19 31 LMITiFGVm.VYLm.MILiI
144 144 Q H 0 0 97 18 49 SL.QQTSES.LVAa.GLKrS
145 145 E H 0 0 95 20 39 QG.qEVEPDETPAAEAGaNS
146 146 T 0 0 69 20 46 Qt.pVVKKVAQRDtGVKsGT
147 147 E 0 0 95 20 44 Qa.DeGVDVYQDAePKHKAE
148 148 G 0 0 21 21 30 gGGggGKDSGGEKggAGVKA
149 149 I E 0 0 82 21 41 tPLtICiiLWhNViiVFIIL
150 150 L E 0 0 0 21 48 WHKIPTslHElLKIMFFIIA
151 151 V E 0 0 0 21 42 VPIVKNICVvvGKEVRVLvP
152 152 H E 0 0 0 21 50 PYHSMYGTPnlHLNnSNDdK
153 153 P 0 0 0 21 45 PPEPRDPEEGALKAkWPSPA
154 154 N 0 0 0 21 48 FtdEISAQNNQKQAnDSSRL
155 155 Q 0 0 160 21 45 DraKELGVPEQSNRqNDQkP
156 156 E 0 0 48 21 44 HETFEReKSDTKTEAPSTeQ
157 157 P 0 0 0 21 44 PFPPAPaPTLPDPpARVfTH
158 158 A 0 0 0 21 35 MQAgAEAGKAEARaAAAeAL
159 159 V E 0 0 0 21 46 VRAvaRINNGvLLKliVRRV
160 160 I E 0 0 0 21 19 IMIViImIMIlEIIiiILIV
161 161 A 0 0 0 21 45 AIDSsAaDMAEIQSLPAYAV
162 162 G 0 0 0 20 31 GGCGgEGKG.KGIGEGAGDG
163 163 Q 0 0 0 20 49 QEAdqFRSA.LTNQQdNFMG
164 164 G 0 0 0 19 39 GELgPRgDKKGKKGTg.G.G
165 165 T H 0 0 0 21 45 TTTTLKaAEAWSStATITHY
166 166 I H 0 0 0 19 29 LKVIILIVIITlKim.F.II
167 167 A H 0 0 0 21 38 AAAKGYAQSAKaNVaASEAG
168 168 L H 0 0 0 20 37 L.DiVKVILQIFFIFVINLL
169 169 E H 0 0 69 21 37 EQEanEEAMAKETEENPAKE
170 170 V H 0 0 0 20 50 LImVrvGFKKTsLGR.YMNl
171 171 L H 0 0 0 20 36 LLlILiSLPLGlYVM.FIGl
172 172 N H 0 0 76 19 37 QdNDEEK.GsNKNEH.QESa
173 173 Q H 0 0 97 18 45 QeENHQT.SdTS.RL.QWNQ
174 174 V H 0 0 0 19 47 DGSREVV.LKMTVLL.THIV
175 175 P 0 0 66 18 38 ARGAPYP..PSPSGG.GPAS
176 176 L 0 0 80 18 42 HLFNPII.LTLiLGl..GLV
177 177 V 0 0 0 20 38 LPVLLTA.ILyvIGaVVPLV
178 178 D 0 0 58 20 33 DDENDDd.NInHNVDQRDNE
179 179 A E 0 0 0 19 48 RADY.Lm.AKADsYNTGSAa
180 180 L E 0 0 0 19 39 VVTL.LLL.MSLhRLMFAmr
181 181 V E 0 0 0 20 31 FILLVAVL.TVYVVVVAVfI
182 182 V E 0 0 0 19 40 VAASLVvV.TN.VLFFRVVL
183 183 P E 0 0 0 19 44 PCAHKApA.TP.FpDGSKAP
184 184 V 0 0 0 18 41 VVISV.V..IGESilNMVST
185 185 G 0 0 0 19 37 GGggD.AG.GPGDEgKPDRY
186 186 G 0 0 0 20 36 GGgrN.GNSYGGGTKGTGTD
187 187 G H 0 0 0 20 37 GgGNS.GARGHIDGEESGES
188 188 G H 0 0 0 21 30 GgGITGGTGSGKGTGTGEgE
189 189 M H 0 0 0 21 35 LMHLVFLMTLiLFFTSAMiL
190 190 V H 0 0 0 21 46 AFANLyPVVHiPTLvGLMVT
191 191 A H 0 0 0 21 41 AAPDAakNVASRAVsTDVEA
192 192 G H 0 0 0 20 37 GDDReGgMDGgD.AVGREGP
193 193 I H 0 0 0 20 39 VFINkITIi.iVWAIVVTyV
194 194 A H 0 0 0 18 23 A..PaGAAa.aIKAaAAAdA
195 195 I H 0 0 0 18 36 V.IVvKVLL.ISTIlF.fIE
196 196 T H 0 0 0 18 41 L.TVKTLGA.VTTSVt.etS
197 197 I H 0 0 0 18 37 IIATLSVVS.LII.Ei.FvL
198 198 K H 0 0 100 19 41 KNCERNNAK.DIk.RNNKSk
199 199 A H 0 0 51 18 40 Q.ADAFITH.RPg.AAAaaa
200 200 L H 0 0 0 17 29 L..VELILl.FLI.LQTliL
201 201 K 0 0 51 17 49 MDHER.NAg.F.D.EGKIRh
202 202 P 0 0 66 17 41 PTPGD.EQA.LPp.DE.pQh
203 203 S 0 0 63 20 43 QSNDPTnHASNVsRGD.aAS
204 204 V 0 0 0 21 41 IVIKECvPIHvISGvvIGAV
205 205 K E 0 0 73 21 38 KGLKKGkDDSskKKkgAKRE
206 206 V E 0 0 0 20 32 VL.IVEvQVVIlNIVVLiMG
207 207 Y E 0 0 0 20 36 II.YKFyLFHYYIiYRYiYY
208 208 A E 0 0 0 20 37 AG.NAPaaPGSRTnGTEAAE
209 209 A E 0 0 0 19 43 VV.AaTYaTADTIATP.GQN
210 210 E 0 0 48 19 34 EE.EaDENE.dDAQDQTLAG
211 211 P 0 0 0 21 41 APPVAEQPPPgGYPDpPTKC
212 212 S 0 0 63 21 46 EGSAGYSSALSQSDLntNSL
213 213 N 0 0 0 21 42 DgSENDGLTKNhnTADnDAL
214 214 A H 0 0 0 20 47 SeTWPLeANA.qgLMMLAaA
215 215 D H 0 0 0 19 45 .TNDdNkPSD.PDDWPMGtN
216 216 D H 0 0 0 20 40 AGPKdSPqDD.HDANDDWQD
217 217 C H 0 0 0 18 50 CETLPRCcPV.VN.AC.cFG
218 218 Y H 0 0 79 20 44 LHLRDyYyFKWVVVYY.fYK
219 219 Q H 0 0 0 21 41 KGPQRqaHTQQQALAKAEqG
220 220 S H 0 0 0 21 48 AAYRNGnTSLTVIAAqSSrG
221 221 K H 0 0 0 19 47 APTLLVRAPK.SkK.dKSSQ
222 222 L H 0 0 80 19 28 LLLlLILSlS.QyL.LLILL
223 223 K 0 0 100 20 44 DkNrkWpAnK.SKRAAsHtR
224 224 G 0 0 68 19 41 Ag.NgGgLVFGAG.GMgKaL
225 225 E 0 0 95 20 49 GMTNENeALGSWR.LKeGMe
226 226 L 0 0 59 20 39 HKILMDlILFTMa.MLgILl
227 227 T 0 0 0 19 44 PADSSLGKTNLTt.AET.TV
228 228 P E 0 0 0 19 41 VPEADSARPpPDR.AKG.GA
229 229 N 0 0 76 19 46 DMHRAKNTHs.En.THSHNV
230 230 L 0 0 0 19 38 LMlTlVlAIF.El.mFDVLG
231 231 H 0 0 90 20 49 PQhNyDnKGV.FHDqRHIGR
232 232 P 0 0 66 19 46 RTLETdDEGV.A.AGDIGKR
233 233 P 0 0 0 19 45 VADQApLDSP.R.GAMRDPP
234 234 E 0 0 69 20 42 GdPKqDAVTQ.ENAAqEAHR
235 235 T 0 0 0 19 48 LqDHtLSMQE.MDDRt.SAT
236 236 I E 0 0 0 19 29 FIIAIIiIEVIIFIAI.IG.
237 237 A E 0 0 0 20 39 AEANSEaGAYPAGEAEKAV.
238 238 D 0 0 79 18 37 EEED.END.DF.TVQEDNNK
239 239 G 0 0 0 20 37 GSDAGHHk.HRGGGGgGPPG
240 240 V 0 0 0 19 33 VYvlVVMv.YWVH.VlLmVF
241 241 K 0 0 0 21 47 aSekYKCRQQKNGESQWkRN
242 242 S E 0 0 0 21 47 kISESYDAEKSPMDSTASGL
243 243 S 0 0 46 21 46 RSRNKSENNTSCSWTRVGqe
244 244 I 0 0 0 21 41 IAIIEwLEIiSvIIlNLYml
245 245 G 0 0 0 21 33 GGRRVgGGGgIgGSdGaSgK
246 246 L H 0 0 80 21 42 DLRNKALiLvLLSLLKlAlM
247 247 N H 0 0 56 21 34 EDEQNGDsEnEEEDIRSNdN
248 248 T H 0 0 0 20 40 T.TLTATNVKTETMETISTG
249 249 W H 0 0 0 21 49 FFISPHIQAWlFMHFALQYA
250 250 P H 0 0 48 21 38 RPAGEHSSGNppGGEpAGPA
251 251 I H 0 0 0 19 45 LS.IvPTAKKSiV.TiTKGI
252 252 I H 0 0 0 21 52 CvANaYGNLLETYKGARVYA
253 253 R H 0 0 89 21 51 QhEQRKGRIFAANRLCKAQI
254 254 D H 0 0 79 20 38 EADKEGTDKSDDVPPDQA.d
255 255 L H 0 0 0 20 50 YYVILVL.YEMSTKSLSAYs
256 256 V 0 0 0 20 36 LLLAVTA.SYVLVAVVVAVM
257 257 D 0 0 79 21 35 DnHeEKtEDQedDVDDEVKH
258 258 D 0 0 79 20 26 DdDdEPeENKddDN.EDVDN
259 259 V E 0 0 0 18 44 IYlE.KLVGKLGlV.GiV.V
260 260 F E 0 0 0 20 30 IVfLFWyFSFLYmRfMfL.w
261 261 T E 0 0 0 19 43 TSSKEThETP.TNTkItL.d
262 262 V E 0 0 0 19 46 VILAQEI.LELMGAVTyK.V
263 263 T 0 0 0 21 44 DTTTAFKNSLTDTPEEEGPA
264 264 E 0 0 95 21 38 SDsKeHDPAGAeTHGEEEAG
265 265 D H 0 0 79 20 39 DDgDpGEDVA.hNPTeVENE
266 266 E H 0 0 69 21 37 AEEAREEENERDGAAeEPRP
267 267 I H 0 0 0 20 38 IAVIIDLfFLLilFVAA.EM
268 268 K H 0 0 0 19 51 CLINLRGkP.DtkPGKE.KL
269 269 Y H 0 0 0 19 50 AELFLYDWE.FYSTFSG.FA
270 270 A H 0 0 0 19 36 AARTA.APVA.AADsLAGAH
271 271 T H 0 0 0 21 46 MFTTkSPPSRNTAMsDTTKR
272 272 Q H 0 0 97 20 45 KKWEqWPqLRQRGQHQKP.A
273 273 L H 0 0 80 20 48 DTQFDMFfPLITVASLMS.M
274 274 V H 0 0 0 20 44 LLVlGKRGL.VIVQILMYAA
275 275 W H 0 0 0 21 50 FCAyHAWFHSNlNFYHKLWQ
276 276 G H 0 0 41 21 30 ERHgDPGGGGgggTgPDNgG
277 277 R H 0 0 121 21 46 DHRkrRNDGQrTrLgTLTrE
278 278 M 0 0 0 21 50 VEMAqYTHRLQLmLNFECIM
279 279 K 0 0 73 20 43 RGKEKKERRPQkkNHnAYS.
280 280 L E 0 0 0 19 41 AIvQV.VCLAFliLIlLSE.
281 281 L E 0 0 0 20 33 VItLI.lImNLSVVVPMILV
282 282 I E 0 0 0 20 44 APIAAGiAiW.LiATAFLPA
283 283 E E 0 0 69 21 36 EANRTEEEeeSPeERSDAHE
284 284 P 0 0 0 21 46 PLpEAAKHNpKHKGHpRPRI
285 285 T 0 0 0 21 49 SEtMYFILRTDSKTStsAAI
286 286 A H 0 0 0 20 41 GSHAAEaAPYGAE.KAvYaA
287 287 G H 0 0 0 21 23 ASGgDVgKGTGGGGgDGGgg
288 288 V H 0 0 0 20 47 LHIaLGDAvAI.SFvEKIEa
289 289 A H 0 0 0 21 39 AAaAGPKEaKtDNIAAQSVA
290 290 L H 0 0 0 21 35 LLlIFLLLLDlLVTAKFVRI
291 291 A H 0 0 0 21 29 AAaADAATNSESPEAASAAA
292 292 A H 0 0 0 21 37 GHvAVSeTKAAADTMaaAAA
293 293 V H 0 0 0 20 42 .AVADVrVIVNAWVAvvIYV
294 294 L H 0 0 59 21 39 MLWlVLLFfANVSFLIYYiC
295 295 S H 0 0 63 20 47 K.SsGVASaTASDEDLtReF
296 296 Q H 0 0 97 20 46 K.PVPAETQRNQINAVePDT
297 297 H H 0 0 90 21 55 YKAKLYSLYKVVTRGRYNPD
298 298 F 0 0 96 21 38 IMFLFAYYLlFVFFTLHALP
299 299 Q 0 0 97 21 45 AMFSQKGQQeSLKMQEDdQE
300 300 T 0 0 69 21 47 LRGDTKHKTTNPDHMTPstV
301 301 V 0 0 69 18 40 HEVIPH.F.VIAVVF.VAvv
302 302 S 0 0 63 18 45 NQKSEE..SLSKT.SSDIRt
303 303 P 0 0 66 21 36 IPPSEPPPAETESPPPGEKP
304 304 E 0 0 95 21 35 REANTTEDQDGGEEEeSSAE
305 305 V 0 0 69 20 40 GK.LAVLLMVTVTlAivVFQ
306 306 K 0 0 100 20 45 EE.NrKSKGYVEKsTEkPEA
307 307 N E 0 0 56 18 42 RQ.RdA..YNDSGNSGnDDS
308 308 V E 0 0 0 17 39 LL..VV.VVQATVTGMl.Lq
309 309 C E 0 0 0 16 52 AL..HD.AVLSI.VLhR.Ec
310 310 I E 0 0 0 18 34 HVT.VL.ViPIWViilL.LI
311 311 V E 0 0 0 18 27 IvV.vV.PiEtLVvvTI.Vv
312 312 L E 0 0 0 20 27 LlIFlLMLELqLLlFVF.ia
313 313 S 0 0 0 20 37 SSKSAKT.AISANSSRASsN
314 314 G 0 0 0 19 29 GgGRGTV.DGDKGG.GDGAG
315 315 G 0 0 0 20 31 AgGgGLKGEGGAEA.GGGAR
316 316 N 0 0 0 20 47 NDMaHGKKDSSYnQQ.SVDA
317 317 V 0 0 0 20 37 VKIGLVLIVAHvivV.RTVM
318 318 D H 0 0 58 20 45 NDAKTGENADFqEdd.IPIS
319 319 L H 0 0 0 21 32 FIIFlPLYELLLVLrMIVLL
320 320 T H 0 0 0 21 49 HFATkEPTKTLMTRETFDPE
321 321 S H 0 0 46 21 41 GTPSEAAPAPFsMAPSRASS
322 322 L H 0 0 0 21 42 LVMlLlYLLSTlKSLHLPTK
323 323 N H 0 0 56 20 37 RHGdDsDN.NNNNAKASDSs
324 324 W H 0 0 81 20 55 YDDWKTPr.LPTVSYAGWWf
325 325 V H 0 0 0 19 38 VIIiLLRv.TQHKLVVTV.V
326 326 G H 0 0 30 18 42 SL.gGGGG.RGALVV.GLGR
327 327 Q H 0 0 97 19 49 eKNK.rAIQWNATLE.SEEV
328 328 A H 0 0 51 20 40 gAAA.aEVAKPMSAAVARHV
329 329 E H 0 0 0 19 38 ERSV.AGDMEAEDG.AGEeA
330 330 R H 0 0 121 17 49 QGIRRrHLK.G.SC.R.VtR
331 331 P 0 0 66 20 41 REPTPtGPAATpTIAG.QAR
332 332 A 0 0 51 19 49 E TEDALVILNaWAAM.Yad
333 333 P 0 0 66 19 41 A PNIAGIPDGTQEEG.Agg
334 334 Y 0 0 108 16 42 L .WL.YF.FRHIGIT.YFW
335 335 Q 0 0 71 19 46 L QRIQA GQQRKTKCASQQ
336 336 T 0 0 69 17 45 A PPTET TPNHNT CTW A
337 337 V E 0 0 0 16 23 V VLVV IPLLVV VIY V
338 338 S 0 0 0 13 45 T F E RSGSN SRN G
339 339 V 0 0 0 9 13 I V V LVV L V
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 33 0 33 33 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 1.099 100 1.27
2 2 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 5 0 0 0.000 0 1.78
3 3 0 0 0 0 0 0 0 0 60 20 10 10 0 0 0 0 0 0 0 0 10 1 0 1.089 47 1.28
4 4 0 0 0 0 0 0 0 9 18 0 18 9 0 0 0 9 9 18 0 9 11 0 0 2.020 84 1.00
5 5 0 18 0 0 9 0 55 9 0 0 9 0 0 0 0 0 0 0 0 0 11 0 0 1.295 54 1.09
6 6 11 0 0 0 0 0 11 0 0 11 0 11 33 0 0 11 0 0 11 0 9 2 2 1.831 83 0.81
7 7 15 8 54 0 0 0 0 0 0 0 0 8 0 8 0 0 0 0 8 0 13 1 0 1.411 55 1.18
8 8 0 6 0 0 6 0 0 0 17 0 17 6 0 6 17 6 6 6 6 6 18 1 0 2.341 81 0.82
9 9 0 53 0 0 32 0 5 5 5 0 0 0 0 0 0 0 0 0 0 0 19 0 3 1.167 40 1.38
10 10 5 0 0 0 0 0 0 10 48 0 10 5 0 5 5 5 0 5 5 0 21 0 2 1.816 61 1.07
11 11 10 0 0 0 0 5 0 5 14 5 5 0 0 0 0 0 10 5 10 33 21 0 1 2.041 68 0.97
12 12 14 29 24 0 0 0 5 0 10 5 5 5 0 0 0 0 0 0 0 5 21 0 0 1.926 64 1.02
13 13 5 5 0 0 0 0 0 5 10 5 14 5 0 0 14 0 0 24 0 14 21 0 0 2.124 71 0.92
14 14 5 0 5 5 0 0 0 5 15 0 0 15 0 0 5 15 15 10 5 0 20 1 2 2.267 76 0.90
15 15 5 5 0 0 0 0 0 15 45 5 5 0 0 0 0 5 5 10 0 0 20 1 0 1.773 59 1.09
16 16 5 0 0 0 10 5 5 0 5 10 5 10 0 15 0 5 10 0 0 15 20 1 2 2.389 80 0.79
17 17 21 5 37 0 5 0 5 5 0 0 5 11 0 0 0 5 0 0 0 0 19 2 1 1.863 63 1.07
18 18 0 5 0 0 0 0 5 10 0 0 5 15 0 0 10 10 0 15 15 10 20 1 0 2.224 74 0.89
19 19 16 16 53 0 0 5 0 0 5 0 0 0 0 0 5 0 0 0 0 0 19 2 2 1.386 47 1.25
20 20 5 0 0 0 0 0 0 0 15 10 0 0 0 0 5 5 40 20 0 0 20 1 0 1.653 55 1.09
21 21 0 0 0 0 0 0 0 5 5 5 5 10 0 0 0 10 5 19 19 19 21 0 0 2.120 71 1.07
22 22 11 0 0 0 5 0 5 11 11 5 16 5 0 0 5 0 5 0 11 11 19 2 0 2.406 82 0.82
23 23 6 22 22 6 0 0 0 6 11 11 0 6 0 0 6 0 0 6 0 0 18 3 0 2.120 73 0.91
24 24 5 5 0 0 5 0 0 0 0 0 0 11 5 16 0 21 11 5 5 11 19 2 3 2.260 77 0.83
25 25 15 40 5 5 10 0 10 0 0 0 0 0 0 0 5 0 0 5 5 0 20 1 0 1.861 62 1.10
26 26 0 0 5 5 0 0 0 5 10 0 5 33 5 0 0 14 10 5 0 5 21 0 3 2.107 70 0.94
27 27 0 0 0 0 5 0 0 5 19 24 0 0 0 0 10 5 10 10 0 14 21 0 3 2.042 68 0.94
28 28 40 10 0 0 5 0 10 0 10 0 0 0 0 0 0 0 10 10 5 0 20 1 1 1.817 61 0.92
29 29 0 48 5 0 10 0 10 10 0 0 0 0 0 0 5 5 0 0 5 5 21 0 1 1.750 58 0.97
30 30 5 5 0 0 0 5 0 10 5 0 5 40 0 0 5 10 0 10 0 0 20 1 1 1.956 65 0.94
31 31 5 0 0 0 0 5 0 5 10 0 40 0 5 0 5 5 0 10 10 0 20 1 2 1.956 65 0.99
32 32 10 5 0 0 0 0 10 5 10 0 10 10 0 0 5 10 10 5 0 10 20 1 1 2.441 81 0.79
33 33 5 19 24 10 10 5 0 10 0 0 5 5 5 0 5 0 0 0 0 0 21 0 2 2.199 73 0.94
34 34 10 50 10 0 5 0 0 10 0 0 5 0 0 0 0 0 0 5 0 5 20 1 1 1.636 55 1.08
35 35 0 0 0 0 0 0 0 5 5 0 10 5 0 0 0 10 0 5 45 15 20 1 3 1.704 57 1.16
36 36 5 5 0 0 0 0 0 0 15 0 10 0 5 5 5 0 35 10 5 0 20 1 3 2.011 67 0.93
37 37 11 6 50 0 6 0 11 0 11 0 6 0 0 0 0 0 0 0 0 0 18 3 0 1.561 54 1.15
38 38 5 5 5 5 0 0 0 0 42 5 5 0 0 0 5 5 0 5 5 5 19 2 1 2.069 70 0.95
39 39 0 0 0 0 0 0 0 45 15 5 5 10 0 5 0 10 0 0 5 0 20 1 1 1.704 57 1.11
40 40 0 11 0 0 0 0 5 5 11 5 5 0 0 5 32 11 5 0 0 5 19 2 4 2.160 73 0.84
41 41 0 5 0 0 0 0 0 5 16 5 5 11 0 5 5 5 0 0 21 16 19 2 1 2.233 76 0.93
42 42 24 29 29 0 0 0 0 0 6 0 6 6 0 0 0 0 0 0 0 0 17 4 3 1.560 55 1.22
43 43 5 11 5 0 37 5 5 11 5 0 0 0 0 0 11 0 0 0 0 5 19 2 1 2.009 68 0.91
44 44 24 24 0 0 24 0 6 6 0 6 0 6 0 6 0 0 0 0 0 0 17 4 1 1.855 65 1.01
45 45 11 6 0 0 0 0 0 11 11 0 6 0 0 11 0 33 0 0 6 6 18 3 1 1.985 69 0.88
46 46 6 0 0 13 0 0 6 13 6 0 6 13 31 0 6 0 0 0 0 0 16 5 0 2.010 72 0.88
47 47 0 0 0 0 0 0 0 0 11 11 6 0 0 0 6 6 6 39 11 6 18 3 3 1.903 66 1.07
48 48 0 37 11 16 5 0 0 5 0 11 5 0 0 0 0 0 5 0 0 5 19 2 0 1.908 65 1.00
49 49 0 20 5 5 35 5 15 0 0 0 5 0 0 5 5 0 0 0 0 0 20 1 1 1.873 63 1.20
50 50 5 14 0 5 0 0 0 5 10 0 10 0 0 14 14 5 14 0 0 5 21 0 3 2.285 76 0.83
51 51 0 5 0 0 0 0 0 5 5 10 0 0 0 0 19 33 5 5 10 5 21 0 0 2.000 67 1.00
52 52 10 10 0 0 0 0 0 0 14 5 10 29 0 0 5 0 5 10 5 0 21 0 1 2.112 70 0.90
53 53 0 5 0 5 0 0 0 25 5 5 5 5 0 5 10 5 0 10 5 10 20 1 2 2.385 80 0.90
54 54 0 5 0 0 0 5 0 5 0 0 37 5 0 0 11 0 0 0 11 21 19 2 1 1.790 61 0.97
55 55 5 42 5 0 21 0 5 5 0 16 0 0 0 0 0 0 0 0 0 0 19 2 1 1.604 54 1.11
56 56 6 11 0 0 0 0 0 0 0 0 6 11 0 0 6 44 0 0 6 11 18 3 0 1.735 60 0.98
57 57 11 17 50 6 0 0 0 0 0 0 0 6 0 6 0 0 0 6 0 0 18 3 2 1.532 53 1.22
58 58 6 0 0 0 0 0 0 11 6 11 0 0 0 6 39 11 6 6 0 0 18 3 3 1.903 66 0.98
59 59 0 0 0 0 5 0 0 42 11 0 5 0 0 0 0 0 0 21 5 11 19 2 2 1.631 55 1.15
60 60 0 5 0 0 0 0 0 5 58 11 5 5 0 0 0 0 0 0 5 5 19 2 2 1.483 50 1.20
61 61 10 45 10 0 0 0 10 0 10 0 0 5 0 0 5 0 5 0 0 0 20 1 0 1.730 58 1.03
62 62 0 5 5 5 0 5 0 5 5 0 5 0 0 0 10 10 0 5 30 10 20 1 1 2.250 75 0.89
63 63 0 10 0 5 0 0 5 0 43 5 14 0 0 0 0 0 14 0 5 0 21 0 1 1.723 58 0.98
64 64 20 15 15 10 5 5 5 0 10 0 5 0 0 0 0 0 0 5 0 5 20 1 1 2.250 75 0.94
65 65 0 5 0 0 0 0 0 5 14 5 0 0 0 10 24 10 10 5 14 0 21 0 0 2.149 72 0.93
66 66 0 0 5 0 0 0 0 35 0 10 0 0 0 0 10 15 5 10 5 5 20 1 2 1.942 65 0.97
67 67 16 47 0 5 11 0 5 0 5 0 0 0 0 0 5 5 0 0 0 0 19 2 2 1.657 56 1.17
68 68 24 12 41 0 6 0 0 0 6 0 6 6 0 0 0 0 0 0 0 0 17 4 2 1.624 57 1.22
69 69 6 0 0 0 0 0 0 6 22 22 6 0 0 11 0 0 17 6 0 6 18 3 2 2.014 70 1.00
70 70 6 6 0 12 0 0 0 6 6 0 0 6 0 0 12 6 12 6 6 18 17 4 0 2.395 85 0.86
71 71 5 10 5 0 10 0 5 0 0 0 15 35 0 10 0 5 0 0 0 0 20 1 2 1.942 65 0.87
72 72 10 0 0 0 5 0 0 19 14 24 5 5 0 0 5 5 0 10 0 0 21 0 0 2.108 70 0.93
73 73 0 0 0 5 0 0 0 0 10 5 5 10 0 0 0 0 10 38 0 19 21 0 4 1.790 60 1.12
74 74 0 0 0 0 0 5 0 15 10 5 5 0 0 0 0 10 10 20 5 15 20 1 2 2.181 73 0.99
75 75 0 10 0 0 0 0 0 10 20 5 5 5 0 0 5 25 5 0 10 0 20 1 0 2.108 70 0.89
76 76 10 5 0 0 0 0 0 0 10 35 5 0 0 0 10 5 0 5 5 10 20 1 0 2.037 68 0.93
77 77 0 0 0 0 0 0 0 10 5 0 15 5 0 5 5 35 0 15 0 5 20 1 2 1.916 64 1.01
78 78 5 5 10 0 0 0 5 10 35 5 10 0 0 0 0 5 5 0 5 0 20 1 1 2.107 70 0.92
79 79 35 30 15 5 0 0 0 5 0 5 0 0 0 0 0 0 0 5 0 0 20 1 1 1.612 54 1.22
80 80 55 10 5 0 0 0 0 5 5 0 0 0 0 0 5 5 5 0 0 5 20 1 2 1.608 54 1.11
81 81 5 0 0 5 0 0 0 5 0 0 5 45 0 5 5 5 10 10 0 0 20 1 0 1.868 62 0.99
82 82 5 11 0 5 0 0 5 0 5 0 0 16 5 11 11 11 0 5 0 11 19 2 1 2.406 82 0.79
83 83 10 5 0 0 5 0 0 15 5 5 30 10 0 0 5 5 0 5 0 0 20 1 2 2.155 72 0.90
84 84 5 0 10 0 0 0 0 14 10 5 24 14 5 0 5 0 5 0 5 0 21 0 1 2.215 74 0.94
85 85 0 0 0 0 0 5 0 62 10 0 0 5 0 0 5 10 0 0 0 5 21 0 3 1.325 44 1.16
86 86 0 0 0 0 0 0 0 10 10 5 10 0 0 5 5 5 10 5 35 0 20 1 1 2.037 68 1.02
87 87 0 11 0 0 5 0 0 11 0 5 11 16 0 32 5 5 0 0 0 0 19 2 0 1.986 67 0.83
88 88 0 5 0 0 5 0 0 65 5 15 0 0 0 0 0 0 0 0 0 5 20 1 0 1.164 39 1.21
89 89 26 0 5 0 0 5 0 11 11 5 0 0 0 0 0 0 16 11 5 5 19 2 0 2.129 72 0.86
90 90 0 5 5 0 0 0 5 10 30 5 5 0 0 0 5 10 0 10 0 10 20 1 1 2.181 73 0.89
91 91 15 35 10 0 10 0 0 5 10 0 5 5 0 0 0 5 0 0 0 0 20 1 1 1.942 65 1.02
92 92 11 0 11 0 0 0 0 6 11 0 11 28 0 6 0 0 0 11 6 0 18 3 2 2.058 71 0.93
93 93 0 6 6 0 11 6 39 0 6 0 0 0 0 6 0 6 0 0 11 6 18 3 3 1.980 68 0.96
94 94 6 0 0 0 0 0 0 0 69 0 6 0 0 0 0 0 0 19 0 0 16 5 3 0.918 33 1.32
95 95 0 5 0 5 5 0 5 0 55 0 10 5 0 0 0 5 0 0 0 5 20 1 1 1.608 54 1.04
96 96 10 0 0 0 5 0 0 5 10 5 0 5 0 5 15 25 0 0 10 5 20 1 1 2.221 74 0.86
97 97 5 70 5 0 5 0 0 0 0 0 0 5 5 0 0 0 0 5 0 0 20 1 0 1.148 38 1.31
98 98 6 6 0 0 0 0 6 6 12 12 0 6 0 0 0 6 0 18 12 12 17 3 1 2.313 82 0.86
99 99 0 0 0 0 0 0 0 56 11 6 6 6 0 6 0 0 6 0 0 6 18 3 0 1.534 53 1.22
100 100 22 17 11 0 0 0 11 11 6 0 0 6 6 6 0 0 6 0 0 0 18 3 1 2.168 75 0.88
101 101 0 0 6 0 0 0 0 6 0 33 6 6 0 6 0 11 6 0 0 22 18 3 0 1.908 66 0.97
102 102 5 0 0 0 0 5 0 11 53 0 11 0 5 0 0 5 0 5 0 0 19 2 0 1.587 54 1.09
103 103 5 5 0 0 16 5 32 0 5 5 0 5 0 5 11 0 5 0 0 0 19 2 1 2.132 72 0.89
104 104 10 5 55 0 5 0 0 5 5 0 0 5 0 0 0 5 0 0 5 0 20 1 0 1.608 54 1.15
105 105 20 15 15 0 0 0 5 5 10 0 5 0 0 5 0 5 0 5 0 10 20 1 4 2.250 75 0.87
106 106 32 26 11 5 0 0 0 11 0 5 0 5 0 0 5 0 0 0 0 0 19 2 0 1.809 61 1.08
107 107 0 0 0 5 0 0 0 11 5 21 11 0 0 5 0 0 11 21 0 11 19 2 2 2.069 70 1.00
108 108 0 16 0 5 0 0 0 11 0 0 11 5 0 0 11 0 21 16 5 0 19 2 2 2.087 71 0.86
109 109 0 0 5 0 0 0 5 0 5 5 5 40 0 0 0 10 15 10 0 0 20 1 0 1.861 62 0.95
110 110 5 0 0 5 0 0 0 0 53 0 5 5 0 5 5 0 5 0 0 11 19 2 0 1.660 56 1.06
111 111 5 0 11 0 0 0 0 5 21 11 11 0 0 5 11 5 5 5 0 5 19 2 0 2.361 80 0.87
112 112 5 5 0 0 0 0 5 0 0 0 16 0 0 0 0 11 5 0 37 16 19 2 2 1.808 61 0.98
113 113 7 13 13 7 0 0 0 0 0 0 7 13 13 7 7 0 0 0 0 13 15 6 0 2.246 83 0.79
114 114 6 11 0 0 0 0 0 11 0 0 0 11 0 6 6 28 6 17 0 0 18 3 0 2.029 70 0.88
115 115 11 5 0 0 0 0 0 0 11 5 5 0 0 5 5 26 11 5 5 5 19 2 0 2.302 78 0.89
116 116 5 47 0 5 16 0 0 0 5 0 11 0 0 0 0 0 0 5 0 5 19 2 2 1.657 56 1.09
117 117 0 5 0 0 0 0 0 20 40 0 0 0 0 5 10 5 0 10 5 0 20 1 1 1.748 58 1.01
118 118 16 11 47 5 0 0 0 0 0 0 0 0 0 0 5 0 5 5 0 5 19 2 0 1.657 56 1.15
119 119 0 16 0 5 0 0 0 5 5 5 5 0 0 5 16 0 21 11 5 0 19 2 1 2.233 76 0.86
120 120 6 11 11 0 0 0 0 11 28 0 0 6 0 0 0 6 0 17 0 6 18 3 2 2.029 70 0.93
121 121 0 15 5 5 15 0 25 0 10 0 5 5 0 0 5 0 0 0 5 5 20 1 0 2.194 73 0.88
122 122 0 0 0 0 0 0 0 62 0 5 5 0 0 0 10 5 0 0 0 14 21 0 0 1.234 41 1.24
123 123 0 0 0 5 0 0 0 14 38 5 10 5 0 0 0 5 5 0 10 5 21 0 1 1.963 66 1.06
124 124 0 0 0 5 0 0 5 10 5 0 30 20 0 5 0 5 0 10 5 0 20 1 1 2.042 68 0.94
125 125 25 10 10 0 5 0 0 0 5 0 10 15 0 5 5 5 0 0 5 0 20 1 1 2.221 74 0.88
126 126 45 10 20 0 0 0 5 0 5 0 5 5 0 0 0 0 0 5 0 0 20 1 0 1.660 55 1.17
127 127 0 6 11 0 28 6 17 0 0 0 11 6 0 6 6 0 0 6 0 0 18 3 2 2.106 73 0.94
128 128 15 0 8 8 0 0 15 8 8 8 0 0 15 8 0 8 0 0 0 0 13 8 0 2.245 88 0.79
129 129 0 6 0 0 0 0 6 11 0 6 0 6 0 0 0 6 6 11 11 33 18 3 0 2.062 71 1.00
130 130 0 0 5 0 0 0 0 16 21 37 0 0 0 0 0 5 0 11 0 5 19 2 4 1.689 57 1.07
131 131 5 0 0 11 0 5 0 11 16 0 21 5 0 0 5 0 5 5 11 0 19 2 2 2.260 77 0.86
132 132 0 5 5 0 0 0 0 5 10 0 10 5 0 0 0 0 5 5 10 40 20 1 0 1.956 65 1.05
133 133 0 0 0 0 0 5 0 20 15 15 5 5 0 0 0 10 0 25 0 0 20 1 1 1.917 64 0.96
134 134 10 10 5 0 10 0 0 0 14 0 19 10 0 0 0 0 0 0 19 5 21 0 2 2.095 70 0.85
135 135 5 0 0 5 0 0 0 10 5 5 10 5 0 5 29 14 5 5 0 0 21 0 3 2.244 75 0.92
136 136 5 10 0 0 0 0 0 5 10 5 5 5 0 0 0 5 5 33 14 0 21 0 1 2.107 70 0.95
137 137 0 5 5 0 0 0 5 5 5 10 0 5 0 5 5 20 10 10 0 10 20 1 0 2.441 81 0.85
138 138 45 10 10 0 0 0 0 0 15 5 0 0 0 5 0 0 5 5 0 0 20 1 0 1.704 57 1.08
139 139 0 0 6 0 0 0 0 6 18 6 12 35 0 0 6 6 0 6 0 0 17 4 1 1.925 68 1.00
140 140 5 0 0 0 0 11 0 0 11 0 0 0 0 0 0 11 42 11 5 5 19 2 1 1.777 60 0.97
141 141 6 0 0 0 0 6 6 6 0 0 0 6 0 18 18 12 12 6 0 6 17 4 0 2.282 81 0.86
142 142 17 33 17 6 6 0 17 0 0 0 0 0 0 0 0 0 0 6 0 0 18 3 2 1.744 60 1.18
143 143 11 16 26 26 5 0 5 5 0 0 0 5 0 0 0 0 0 0 0 0 19 2 2 1.851 63 1.17
144 144 6 17 0 0 0 0 0 6 11 0 22 6 0 0 6 6 17 6 0 0 18 3 0 2.139 74 0.84
145 145 5 0 0 0 0 0 0 10 20 10 5 5 0 0 0 0 10 25 5 5 20 1 2 2.108 70 1.03
146 146 20 0 0 0 0 0 0 10 5 5 5 20 0 0 5 15 10 0 0 5 20 1 2 2.138 71 0.90
147 147 10 0 0 0 0 0 5 5 15 5 0 0 0 5 0 10 10 20 0 15 20 1 1 2.181 73 0.93
148 148 5 0 0 0 0 0 0 57 10 0 5 0 0 0 0 14 0 5 0 5 21 0 4 1.402 47 1.20
149 149 10 14 38 0 5 5 0 0 0 5 0 10 5 5 0 0 0 0 5 0 21 0 3 1.963 66 1.00
150 150 0 19 19 5 10 5 0 0 5 5 5 5 0 10 0 10 0 5 0 0 21 0 0 2.318 77 0.85
151 151 43 5 10 0 0 0 0 5 0 10 0 0 5 0 5 10 0 5 5 0 21 0 3 1.905 64 0.97
152 152 0 10 0 5 0 0 10 5 0 10 10 5 0 14 0 5 0 0 19 10 21 0 1 2.293 77 0.81
153 153 0 5 0 0 0 5 0 5 14 33 5 0 0 0 5 10 0 14 0 5 21 0 0 2.016 67 0.92
154 154 0 5 5 0 5 0 0 0 10 0 14 5 0 0 5 5 14 5 19 10 21 0 3 2.335 78 0.87
155 155 5 5 0 0 0 0 0 5 5 10 5 0 0 0 10 10 19 10 10 10 21 0 1 2.384 80 0.92
156 156 0 0 0 0 5 0 0 0 5 5 10 19 0 5 5 10 5 29 0 5 21 0 1 2.137 71 0.93
157 157 5 5 0 0 10 0 0 0 14 43 0 10 0 5 5 0 0 0 0 5 21 0 2 1.814 61 0.94
158 158 0 5 0 5 0 0 0 10 52 0 0 0 0 0 5 5 5 14 0 0 21 0 1 1.566 52 1.10
159 159 29 14 10 0 0 0 0 5 10 0 0 0 0 0 19 5 0 0 10 0 21 0 4 1.914 64 0.89
160 160 10 10 62 14 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 21 0 1 1.168 39 1.41
161 161 5 5 10 5 0 0 5 0 33 5 14 0 0 0 0 0 5 5 0 10 21 0 1 2.107 70 0.91
162 162 0 0 5 0 0 0 0 60 5 0 0 0 5 0 0 10 0 10 0 5 20 1 0 1.366 46 1.19
163 163 0 5 0 5 10 0 0 5 10 0 5 5 0 0 5 0 25 5 10 10 20 1 3 2.316 77 0.83
164 164 0 5 0 0 0 0 0 47 0 5 0 5 0 0 5 21 0 5 0 5 19 2 1 1.612 55 1.02
165 165 0 5 5 0 0 5 5 0 19 0 10 38 0 5 0 5 0 5 0 0 21 0 1 1.922 64 0.92
166 166 11 16 47 5 5 0 0 0 0 0 0 5 0 0 0 11 0 0 0 0 19 2 2 1.584 54 1.22
167 167 5 0 0 0 0 0 5 10 48 0 10 0 0 0 0 10 5 5 5 0 21 0 0 1.750 58 1.06
168 168 15 25 25 0 15 0 0 0 0 0 0 0 0 0 0 5 5 0 5 5 20 1 1 1.861 62 1.07
169 169 0 0 0 5 0 0 0 0 19 5 0 5 0 0 0 10 5 43 10 0 21 0 1 1.707 57 1.07
170 170 15 15 5 10 5 0 5 10 0 0 5 5 0 0 10 10 0 0 5 0 20 1 4 2.389 80 0.83
171 171 5 45 15 5 5 0 5 10 0 5 5 0 0 0 0 0 0 0 0 0 20 1 0 1.773 59 1.09
172 172 0 0 0 0 0 0 0 5 5 0 11 0 0 5 0 11 11 21 21 11 19 2 3 2.069 70 1.07
173 173 0 6 0 0 0 6 0 0 0 0 11 11 0 6 6 0 28 11 11 6 18 3 0 2.135 74 0.91
174 174 26 16 5 5 0 0 0 5 0 0 5 11 0 5 5 5 0 5 0 5 19 2 0 2.274 77 0.88
175 175 0 0 0 0 0 0 6 22 17 33 17 0 0 0 6 0 0 0 0 0 18 3 0 1.619 56 1.05
176 176 6 39 17 0 6 0 0 11 0 6 0 6 0 6 0 0 0 0 6 0 18 3 2 1.874 65 0.97
177 177 30 25 10 0 0 0 5 5 10 10 0 5 0 0 0 0 0 0 0 0 20 1 1 1.848 62 1.04
178 178 5 0 5 0 0 0 0 0 0 0 0 0 0 5 5 0 5 10 25 40 20 1 1 1.692 56 1.15
179 179 0 5 0 5 0 0 11 5 32 0 11 5 0 0 5 5 0 0 5 11 19 2 2 2.160 73 0.86
180 180 11 37 0 16 5 0 0 0 5 0 5 5 0 5 11 0 0 0 0 0 19 2 1 1.908 65 1.03
181 181 45 15 10 0 10 0 5 0 10 0 0 5 0 0 0 0 0 0 0 0 20 1 0 1.634 55 1.18
182 182 42 16 0 0 11 0 0 0 11 0 5 5 0 0 5 0 0 0 5 0 19 2 1 1.749 59 1.00
183 183 0 0 0 0 5 0 0 5 21 32 5 5 5 5 0 11 0 0 0 5 19 2 1 2.014 68 0.94
184 184 33 6 17 6 0 0 0 6 0 0 17 6 0 0 0 0 0 6 6 0 18 3 1 1.927 67 0.99
185 185 0 0 0 0 0 0 5 47 5 11 0 0 0 0 5 5 0 5 0 16 19 2 2 1.657 56 1.07
186 186 0 0 0 0 0 0 5 50 0 0 5 15 0 0 5 5 0 0 10 5 20 1 0 1.610 54 1.08
187 187 0 0 5 0 0 0 0 40 5 0 15 0 0 5 5 0 0 15 5 5 20 1 1 1.834 61 1.07
188 188 0 0 5 0 0 0 0 57 0 0 5 19 0 0 0 5 0 10 0 0 21 0 1 1.295 43 1.21
189 189 5 29 10 19 14 0 0 0 5 0 5 10 0 5 0 0 0 0 0 0 21 0 1 1.980 66 1.10
190 190 24 14 5 5 5 0 5 5 10 10 0 10 0 5 0 0 0 0 5 0 21 0 2 2.306 77 0.89
191 191 14 0 0 0 0 0 0 0 38 5 10 5 0 0 5 5 0 5 5 10 21 0 1 1.963 66 0.99
192 192 5 0 0 5 0 0 0 40 5 5 0 0 0 0 10 0 0 10 0 20 20 1 2 1.748 58 1.06
193 193 25 0 35 0 5 5 5 0 5 0 0 10 0 0 0 5 0 0 5 0 20 1 2 1.843 62 1.02
194 194 0 0 6 0 0 0 0 6 72 6 0 0 0 0 0 6 0 0 0 6 18 3 3 1.038 36 1.34
195 195 22 17 28 0 11 0 0 0 0 0 6 6 0 0 0 6 0 6 0 0 18 3 1 1.875 65 1.08
196 196 17 11 0 0 0 0 0 6 6 0 11 39 0 0 0 6 0 6 0 0 18 3 2 1.796 62 0.98
197 197 17 17 33 0 6 0 0 0 6 0 11 6 0 0 0 0 0 6 0 0 18 3 0 1.850 64 1.07
198 198 0 0 5 0 0 0 0 0 5 0 5 0 5 0 11 32 0 5 26 5 19 2 2 1.882 64 0.99
199 199 0 0 6 0 6 0 0 6 50 6 0 6 0 6 6 0 6 0 0 6 18 3 2 1.792 62 1.01
200 200 6 53 18 0 6 0 0 0 0 0 0 6 0 0 0 0 6 6 0 0 17 4 1 1.476 52 1.22
201 201 0 0 6 6 6 0 0 12 6 0 0 0 0 12 12 12 0 12 6 12 17 4 1 2.344 83 0.83
202 202 0 6 0 0 0 0 0 6 6 35 0 6 0 6 0 0 12 12 0 12 17 4 2 1.956 69 1.00
203 203 5 0 0 0 0 0 0 5 15 5 25 5 0 5 5 0 5 0 15 10 20 1 1 2.194 73 0.96
204 204 33 0 24 0 0 0 0 10 5 5 5 0 5 5 0 5 0 5 0 0 21 0 3 1.947 65 1.00
205 205 0 5 0 0 0 0 0 14 5 0 10 0 0 0 5 48 0 5 0 10 21 0 2 1.659 55 1.05
206 206 40 15 20 5 0 0 0 5 0 0 0 0 0 0 0 0 5 5 5 0 20 1 1 1.722 57 1.15
207 207 0 5 25 0 10 0 45 0 0 0 0 0 0 5 5 5 0 0 0 0 20 1 2 1.535 51 1.09
208 208 0 0 0 0 0 0 0 15 35 10 5 10 0 0 5 0 0 10 10 0 20 1 1 1.873 63 1.07
209 209 11 0 5 0 0 0 5 5 32 5 0 21 0 0 0 0 5 0 5 5 19 2 1 2.014 68 0.96
210 210 0 5 0 0 0 0 0 5 16 0 0 5 0 0 0 0 11 32 5 21 19 2 1 1.840 63 1.12
211 211 5 0 0 0 0 0 5 10 10 43 0 5 5 0 0 5 5 5 0 5 21 0 1 1.971 66 0.99
212 212 0 14 0 0 0 0 5 10 10 0 33 5 0 0 0 0 5 5 10 5 21 0 1 2.041 68 0.90
213 213 0 10 0 0 0 0 0 10 10 0 5 10 0 5 0 5 0 5 24 19 21 0 3 2.133 71 0.96
214 214 0 15 0 10 0 5 0 5 30 5 5 5 0 0 0 0 5 10 5 0 20 1 2 2.155 72 0.87
215 215 0 0 0 5 0 5 0 5 0 16 5 11 0 0 0 5 0 0 16 32 19 2 1 1.959 67 0.92
216 216 0 0 0 0 0 5 0 5 10 10 5 0 0 5 0 5 10 0 5 40 20 1 1 1.956 65 1.01
217 217 11 6 0 0 6 0 0 6 6 11 0 6 33 0 6 0 0 6 6 0 18 3 1 2.139 74 0.82
218 218 15 10 0 0 10 5 35 0 0 0 0 0 0 5 5 10 0 0 0 5 20 1 2 1.942 65 0.93
219 219 0 5 0 0 0 0 0 10 19 5 0 5 0 5 5 10 33 5 0 0 21 0 2 2.000 67 0.99
220 220 5 5 5 0 0 0 5 10 19 0 19 10 0 0 10 0 5 0 10 0 21 0 1 2.252 75 0.85
221 221 5 11 0 0 0 0 0 0 11 11 16 5 0 0 5 26 5 0 0 5 19 2 1 2.129 72 0.88
222 222 0 68 11 0 0 0 5 0 0 0 11 0 0 0 0 0 5 0 0 0 19 2 2 1.044 35 1.25
223 223 0 0 0 0 0 5 0 0 15 5 10 5 0 5 15 25 0 0 10 5 20 1 5 2.125 71 0.94
224 224 5 11 0 5 5 0 0 47 16 0 0 0 0 0 0 5 0 0 5 0 19 2 0 1.657 56 1.00
225 225 0 10 0 10 0 5 0 15 5 0 5 5 0 0 5 5 0 25 10 0 20 1 3 2.221 74 0.84
226 226 0 35 15 15 5 0 0 5 5 0 0 5 0 5 0 5 0 0 0 5 20 1 1 1.985 66 1.03
227 227 5 11 0 0 0 0 0 5 11 5 11 32 0 0 0 5 0 5 5 5 19 2 0 2.160 73 0.93
228 228 5 0 0 0 0 0 0 11 21 26 5 0 0 0 11 5 0 5 0 11 19 2 1 2.010 68 0.99
229 229 5 0 0 5 0 0 0 0 5 0 11 11 0 21 5 5 0 5 21 5 19 2 1 2.215 75 0.90
230 230 11 37 5 11 11 0 0 5 5 0 0 5 0 0 0 0 0 5 0 5 19 2 4 2.009 68 1.05
231 231 5 0 5 0 5 0 5 10 0 5 0 0 0 20 10 5 10 0 10 10 20 1 0 2.372 79 0.83
232 232 5 5 5 0 0 0 0 16 11 5 0 11 0 0 11 5 0 11 0 16 19 2 1 2.306 78 0.89
233 233 5 5 0 5 0 0 0 5 16 26 5 0 0 0 11 0 5 0 0 16 19 2 0 2.101 71 0.91
234 234 5 0 0 0 0 0 0 5 20 5 0 5 0 5 5 5 15 15 5 10 20 1 3 2.320 77 0.97
235 235 0 11 0 11 0 0 0 0 5 0 11 21 0 5 5 0 11 5 0 16 19 2 0 2.187 74 0.85
236 236 5 0 63 0 11 0 0 5 11 0 0 0 0 0 0 0 0 5 0 0 19 2 1 1.229 42 1.22
237 237 5 0 0 0 0 0 5 10 40 5 5 0 0 0 0 5 0 20 5 0 20 1 0 1.817 61 1.03
238 238 6 0 0 0 6 0 0 0 0 0 0 6 0 0 0 6 6 28 17 28 18 3 0 1.813 63 1.07
239 239 0 0 0 0 0 0 0 50 5 10 5 0 0 15 5 5 0 0 0 5 20 1 2 1.610 54 1.07
240 240 47 16 0 11 5 5 11 0 0 0 0 0 0 5 0 0 0 0 0 0 19 2 3 1.584 54 1.14
241 241 0 0 0 0 0 5 5 5 5 0 10 0 5 0 10 24 14 10 10 0 21 0 1 2.240 75 0.88
242 242 0 5 5 5 0 0 5 5 10 5 29 5 0 0 0 1