Insertions in MSF for PredictProtein

Burkhard Rost


Differences in PHD predictions between suitable and not-suitable alignments

Do NOT use insertions for the guide sequence when you supply your alignment to be used as input for the predictions ("input option 'MSF format'"). In the current implementation, PHD will treat such insertions as if the corresponding positions were occupied by solvent. This may lead to particular problems!



 Symbol comparison table: GenRunData:pileuppep.cmp  CompCheck: 1254

                   GapWeight: 3.000
             GapLengthWeight: 0.100

 hemz.msf  MSF: 532  Type: P  May 14, 1996 16:18  Check: 2334 ..

 Name: hemz_ecoli       Len:   532  Check: 6362  Weight:  1.00
 Name: hemz_yeren       Len:   532  Check:  626  Weight:  1.00
 Name: hemz_haein       Len:   532  Check: 7557  Weight:  1.00
 Name: hemz_braja       Len:   532  Check:  611  Weight:  1.00
 Name: hemz_cucsa       Len:   532  Check: 1470  Weight:  1.00
 Name: hemz_horvu       Len:   532  Check: 3440  Weight:  1.00
 Name: hemz_arath       Len:   532  Check: 8747  Weight:  1.00
 Name: hemz_bovin       Len:   532  Check: 7969  Weight:  1.00
 Name: hemz_human       Len:   532  Check: 4175  Weight:  1.00
 Name: hemz_mouse       Len:   532  Check: 9621  Weight:  1.00
 Name: hemz_yeast       Len:   532  Check: 9999  Weight:  1.00
 Name: hemz_bacsu       Len:   532  Check: 1757  Weight:  1.00

//

            1                                                   50
hemz_ecoli  .......... .......... .......... .......... ..........
hemz_yeren  .......... .......... .......... .......... ..........
hemz_haein  .......... .......... .......... .......... ..........
hemz_braja  .......... .......... .......... .......... ..........
hemz_cucsa  MDAASSSLAL SNIKLHGSTN TLN..SDQRI SSLCSLPKSR VTFSCKTSGN
hemz_horvu  MECVRS.... GALDLGRSGN FLG..KSGST TS.CGKVRCS TNLAGSTKCE
hemz_arath  .........M QATALSSGFN PLTKRKDHRF PRSCS..... ..........
hemz_bovin  .......... .......... .......... .......... ..........
hemz_human  .......... .......... .......... .......... .........M
hemz_mouse  .......... .......... .......... .......... .........M
hemz_yeast  .......... .......... .......... .......... ..........
hemz_bacsu  .......... .......... .......... .......... ..........

            51                                                 100
hemz_ecoli  .......... .......... .......... .......... ..........
hemz_yeren  .......... .......... .......... .......... ..........
hemz_haein  .......... .......... .......... .......... ..........
hemz_braja  .......... .......... .......... .......... .......MST
hemz_cucsa  LQVRDRSTGL VVSCSSSNGD RDVIQGLHLS GPIEKKSRLG QACCSVGTFT
hemz_horvu  QNLHGKAKPL LLSASGK..A RGTSGLVHRS PVLKHQHHLS VRSTSTDVCT
hemz_arath  ....QRNSLS LIQCDIKERS FGESMTITNR GLSFKTNVFE QARSVTGDCS
hemz_bovin  .......... .....VLLRD RLLYGGSRAC QPRRCQSGAA TAAAATETAQ
hemz_human  RSLGANMAAA LRAAGVLLRD PLASSSWRVC QPWRWKSGAA AAAVTTETAQ
hemz_mouse  LSASANMAAA LRAAGALLRE PLVHGSSRAC QPWRCQSGAA VAA.TTEKVH
hemz_yeast  .....MLSRT IRTQGSFLRR SQL....... .......... .......TIT
hemz_bacsu  .......... .......... .......... .......... ..........

            101                                                150
hemz_ecoli  .......... ..MRQTKTGI LLANLGTPDA PTPEAVKRYL KQFLSDRRVV
hemz_yeren  .......... ..MKQSKLGV LMVNLGTPDA PTPQAVKRYL AEFLSDRRVV
hemz_haein  .......... .MTKPAKIGV LLANLGTPDS PTPKSISRYL WQFLTDPRVV
hemz_braja  AAPNETTQPT VRSGQKRVGV LLVNLGTPDT ADAPGVRVYL KEFLSDARVI
hemz_cucsa  VGEFALES.Q SQAVDDKVGV LLLNLGGPE. .TLDDVQPFL YNLFADPDII
hemz_horvu  TFDEDVKGVS SHAVEEKVGV LLLNLGGPE. .TLNDVQPFL FNLFADPDII
hemz_arath  YDETSAKARS HVVAEDKIGV LLLNLGGPE. .TLNDVQPFL YNLFADPDII
hemz_bovin  RARSPKPQAQ PGNRKPRTGI LMLNMGGPE. .TVEEVQDFL QRLFLDQDLM
hemz_human  HAQGAKPQVQ PQKRKPKTGI LMLNMGGPE. .TLGDVHDFL LRLFLDQDLM
hemz_mouse  HAKTTKPQAQ PERRKPKTGI LMLNMGGPE. .TLGEVQDFL QRLFLDRDLM
hemz_yeast  RSFSVTFNMQ NAQKRSPTGI VLMNMGGPS. .KVEETYDFL YQLFADNDLI
hemz_bacsu  .......... ..MSRKKMGL LVMAYGTPY. .KEEDIERYY THI.......

            151                                                200
hemz_ecoli  DTSRLLWWPL LRGVIFPLRS PRVAKLYASV WMEG..GSPL MVYSRQQQQA
hemz_yeren  DTSPWLWWPL LRGVILPIRS PRVAKLYQSV WMDE..GSPL LVYSRRQQKA
hemz_haein  DLPRCKWYPL LKAIILPLRS KRIAKNYQAI WTEQ..GSPL LAISRQQKDA
hemz_braja  EDQGLVWKVV LNGIILRQRP RSKALDYQKI WNNEKNESPL KTITRSQSAK
hemz_cucsa  RLPRL..FRF LQEPLAKLIS TYRAPKSKEG YASIGGGSPL RKITDEQAQA
hemz_horvu  RLPRL..FRF LQRPLAKLIS TFRAPKSNEG YASIGGGSPL RKITDEQANA
hemz_arath  RLPRP..FQF LQGTIAKFIS VVRAPKSKEG YAAIGGGSPL RKITDEQADA
hemz_bovin  TLPV...... .QDKLGPFIA KRRTPKIQEQ YRRIGGGSPI KMWTSKQGEG
hemz_human  TLPI...... .QNKLAPFIA KRRTPKIQEQ YRRIGGGSPI KIWTSKQGEG
hemz_mouse  TLPI...... .QNKLAPFIA KRRTPKIQE. .RRIGGGSPI KMWTSKQGEG
hemz_yeast  PISAK..Y.. .QKTIAKYIA KFRTPKIEKQ YREIGGGSPI RKWSEYQATE
hemz_bacsu  .......... ...RRGRKPE PEMLQDLKDR YEAIGGISPL AQITEQQAHN

            201                                                250
hemz_ecoli  LAQRLP.... EMPVA...LG MSYGSPSLES AVDELLAEHV DHIVVLPLYP
hemz_yeren  LAERMP.... EIPVE...LG MSYGSPNLPD AIDKLLAQGV TKLVVLPLYP
hemz_haein  LQAYLDNQNI DTQVE...IA MTYGNPSMQS AVKNLLKNQV ERIIVLPLYP
hemz_braja  LAAALSDRD. HVVVD...WA MRYGNPSIKS GIDALIGGMR PHLAV.PLYP
hemz_cucsa  LKMALAEKNM ST...NVYVG MRYWYPFTEE AIQQIKRDGI TRLVVLPLYP
hemz_horvu  LKVALKSKNL EA...DIYVG MRYWYPFTEE AIDQIKKDKI TKLVVLPLYP
hemz_arath  IKMSLQAKNI AA...NVYVG MRYWYPFTEE AVQQIKKDKI TRLVVLPLYP
hemz_bovin  MVKLLDELSP HTAPHKYYIG FRYVHPLTEE AIEEMERDGL ERAVAFTQYP
hemz_human  MVKLLDELSP NTAPHKYYIG FRYVHPLTEE AIEEMERDGL ERAIAFTQYP
hemz_mouse  MVKLLDELSP ATAPHKYYIG FRYVHPLTEE AIEEMERDGL ERAIAFTQYP
hemz_yeast  VCKILDKTCP ETAPHKPYVA FRYAKPLTAE TYKQMLKDGV KKAVAFSQYP
hemz_bacsu  LEQHLNEI.Q DEITFKAYIG LKHIEPFIED AVAEMHKDGI TEAVSIVLAP

            251                                                300
hemz_ecoli  QFSCSTVGAV WDELARILAR KRSIPGISF. .IRDYADNHD YINALANSVR
hemz_yeren  QYSCSTSAAV WDAVARILKG YRRLPSISF. .IRDYAEHPA YISALKQSVE
hemz_haein  QYSSSTTGAV FDAFANALKE ERGLLPFDF. .IHSYHIDEN YINALADSIK
hemz_braja  QYSASTSATV CDEVFRVLAR LRAQPTLRV. .TPPYYEDEA YIEALAVSIE
hemz_cucsa  QYSISTTGSS IRVLQKMFRE DAYLSSLPVS IIKSWYQREG YIKSMADLMQ
hemz_horvu  QYSISTSGSS IRVLQNIVKE DPYFAGLPIS IIESWYQREG YVKSMADLIE
hemz_arath  QYSISTTGSS IRVLQDLFRK DPYLAGVPVA IIKSWYQRRG YVNSMADLIE
hemz_bovin  QYSCSTTGSS LNAIYRYYNE VGRKPTMKWS TIDRWPTHPL LIQCFADHIL
hemz_human  QYSCSTTGSS LNAIYRYYNQ VGRKPTMKWS TIDRWPTHHL LIQCFADHIL
hemz_mouse  QYSCSTTGSS LNAIYRYYNE VGQKPTMKWS TIDRWPTHPL LIQCFADHIL
hemz_yeast  HFSYSTTGSS INELWRQIKA LDSERSISWS VIDRWPTNEG LIKAFSENIT
hemz_bacsu  HFSTFSVQSY NK...RAKEE AEKLGGLTIT SVESWYDEPK FVTYWVDRVK

            301                                                350
hemz_ecoli  ASFAKHG.EP D...LLLLSY HGIPQRYA.D EGDDYPQRCR TTTRELASAL
hemz_yeren  NSFVQHG.KP D...RLVLSF HGIPKRYA.Q LGDDYPQRCE DTSRALRAEI
hemz_haein  VRLKSD.... E...FLLFSY HGIPLRYE.K MGDYYREHCK QTTIAVVNKL
hemz_braja  THLATLPFKP E...LIVASF HGMPKSYV.D KGDPYQEHCI ATTEALRAAR
hemz_cucsa  AELKNFANPQ ..EVMIFFSA HGVPVSYVEN AGDPYKDQME ECICLIMQEL
hemz_horvu  KELSVFSNPE ..EVMIFFSA HGVPLTYVKD AGDPYRDQME DCIALIMEEL
hemz_arath  KELQTFSDPK ..EVMIFFSA HGVPVSYVEN AGDPYQKQME ECIDLIMEEL
hemz_bovin  KELDHFPPEK RREVVILFSA HSLPMSVV.N RGDPYPQEVG ATVQRVMDKL
hemz_human  KELDHFPLEK RSEVVILFSA HSLPMSVV.N RGDPYPQEVS ATVQKVMERL
hemz_mouse  KELNHFPEEK RSEVVILFSA HSLPMSVV.N RGDPYPQEVG ATVHKVMEKL
hemz_yeast  KKLQEFPQPV RDKVVLLFSA HSLPMDVV.N TGDAYPAEVA ATVYNIMQKL
hemz_bacsu  ETYASMPEDE RENAMLIVSA HSLPEK.IKE FGDPYPDQLH ESAKLIAEG.

            351                                                400
hemz_ecoli  GMAP..EKVM MTFQSRF.GR EPWLMPYTDE TLKML.GEKG VGHIQVMCPG
hemz_yeren  ALPA..EQIM MTYQSRF.GR EPWLTPYTDE TLKSL.PSQG VKHIQLICPG
hemz_haein  GLTE..NQWR MTFQSRF.GR EEWLQPYTDK FLESA.AAQN IQKIAVICPG
hemz_braja  RLDA..SKLL LTFQSRF.GN DEWLQPYTDK TMERL.AKEG VRRIAVVTPG
hemz_cucsa  KARGIGNEHT LAYQSRV.GP VQWLKPYTDE VLVEL.GQKG IKSLLAVPVS
hemz_horvu  KSRGTLNDHT LAYQSRV.GP VQWLKPYTDE VLVEL.GQKG VKSLLAVPVS
hemz_arath  KARGVLNDHK LAYQSRV.GP VQWLKPYTDE VLVDL.GKSG VKSLLAVPVS
hemz_bovin  ...GYSNPYR LVWQSKV.GP MPWLGPQTDE AIKGL.CKRG RKNILLVPIA
hemz_human  ...EYCNPYR LVWQSKV.GP MPWLGPQTDE SIKGL.CERG RKNILLVPIA
hemz_mouse  ...GYPNPYR LVWQSKV.GP VPWLGPQTDE AIKGL.CERG RKNILLVPIA
hemz_yeast  ...KFKNPYR LVWQSQV.GP KPWLGAQTAE .IAEF.LGPK VDGLMFIPIA
hemz_bacsu  ...AGVSEYA VGWQSEGNTP DPWLGPDVQD LTRDLFEQKG YQAFVYVPVG

            401                                                450
hemz_ecoli  FAADCLETLE EIAEQNREVF .LGAGGKKYE YIPALNATPE HIEMMANLVA
hemz_yeren  FSADCLETLE EIKEQNREVF .IHAGGEKFE YIPALNDDEG P.........
hemz_haein  FSVDCLETIE EIDEENRENF .LNNGGQSYQ YIPALNVEHA HIEMMGKLIL
hemz_braja  FAADCLETLE EIAQENAEIF .KHNGGETFS AIPCLNDSEP GMDVIRTLVL
hemz_cucsa  FVSEHIETLE EIDMEYKH.L ALESGIQNWG RVPALNCNSS FISDLADAVI
hemz_horvu  FVSEHIETLE EIDMEYRE.L ALESGIENWG RVPALGCTSS FISDLADAVV
hemz_arath  FVSEHIETLE EIDMEYRE.L ALESGVENWG RVPALGLTPS FITDLADAVI
hemz_bovin  FTSDHIETLY ELDIEYSQVL ASECGLENIR RAESLNGNPL FSKALADLVH
hemz_human  FTSDHIETLY ELDIEYSQVL AKECGVENIR RAESLNGNPL FSKALADLVH
hemz_mouse  FTSDHIETLY ELDIEYSQVL AQKCGAENIR RAESLNGNPL FSKALADLVH
hemz_yeast  FTSDHIETLH EIDLG...VI GESEYKDKFK RCESLNGNQT FIEGMADLVK
hemz_bacsu  FVADHLEVLY DNDYECK..V VTDDIGASYY RPEMPNAKPE FIDALATVVL

            451                                                500
hemz_ecoli  AYR....... .......... .......... .......... ..........
hemz_yeren  .......... .......... .......... .......... ..........
hemz_haein  EKLT...... .......... .......... .......... ..........
hemz_braja  RELQGWI... .......... .......... .......... ..........
hemz_cucsa  EALPSATALA PHTS...STD ADDHDPFLYA IKLLFGSVLA FILLLSPKAF
hemz_horvu  EALPSASAMA TRKV...KDT DSDMDMMHYL TKMFLGSVLA FFLLLSPRLV
hemz_arath  ESLPSAEAMS NPNAVVDSED SESSDAFSYI VKMFFGSILA FVLLLSPKMF
hemz_bovin  SHLQSKERCS TQLT...... ......LSCP LCVNPTCRET K.........
hemz_human  SHIQSNELCS KQLT...... ......LSCP LCVNPVCRET KSFFTSQQL.
hemz_mouse  SHIQSNKLCS TQLS...... ......LNCP LCVNPVCRKT KSFFTSQQL.
hemz_yeast  SHLQSNQLYS NQLP...... ......LDFA LGKSNDPVKD LSLVFGNHES
hemz_bacsu  KKLGR..... .......... .......... .......... ..........

            501                             532
hemz_ecoli  .......... .......... .......... ..
hemz_yeren  .......... .......... .......... ..
hemz_haein  .......... .......... .......... ..
hemz_braja  .......... .......... .......... ..
hemz_cucsa  MVFRNNFLLN YTRIYGYRGE RSEFFWVRLI FT
hemz_horvu  SAFRNTLQ.. .......... .......... ..
hemz_arath  HAFRNL.... .......... .......... ..
hemz_bovin  .......... .......... .......... ..
hemz_human  .......... .......... .......... ..
hemz_mouse  .......... .......... .......... ..
hemz_yeast  T......... .......... .......... ..
hemz_bacsu  .......... .......... .......... ..


****************************************************************************
*                                                                          *
*    PHD: Profile fed neural network systems from HeiDelberg               *
*    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~               *
*                                                                          *
*    Prediction of:			                                   *
* 	and helical transmembrane regions, 	   by PHDhtm		   *
*                                                                          *
*    Author:             						   *
*	Burkhard Rost							   *
*       EMBL, 69012 Heidelberg, Germany					   *
*       Internet: Rost@EMBL-Heidelberg.DE				   *
*                                                                          *
*    All rights reserved.                                                  *
*                                                                          *
****************************************************************************
*                                                                          *
*    Percentage of helical trans-membrane predicted:                       *
*    +--------------+--------+--------+                                    *
*    | SecStr:      |    H   |    L   |                                    *
*    | % Predicted: |    0.0 |  100.0 |                                    *
*    +--------------+--------+--------+                                    *
*                                                                          *
****************************************************************************
--- 
--- PhdTopology REFINEMENT AND TOPOLOGY PREDICTION: SYMBOLS
--- AA           : amino acid in one-letter code
--- PHD htm      : HTM's predicted by the PHD neural network
---                system (H=HTM, ' '=not HTM)
--- Rel htm      : Reliability index of prediction (0-9, 0 is low)
--- detail       : Neural network output in detail
--- prH htm      : 'Probability' for assigning a helical trans-
---                membrane region (HTM)
--- prL htm      : 'Probability' for assigning a non-HTM region
---          note: 'Probabilites' are scaled to the interval
---                0-9, e.g., prH=5 means, that the first 
---                output node is 0.5-0.6
--- subset       : Subset of more reliable predictions
--- SUB htm      : All residues for which the expected average
---                accuracy is > 82% (tables in header).
---          note: for this subset the following symbols are used:
---             L: is loop (for which above ' ' is used)
---           '.': means that no prediction is made for this,
---                residue as the reliability is:  Rel < 5
--- other        : predictions derived based on PHDhtm
--- PHDFhtm      : filtered prediction, i.e., too long HTM's are
---                split, too short ones are deleted
--- PHDRhtm      : refinement of neural network output 
--- PHDThtm      : topology prediction based on refined model
---                symbols used:
---             i: intra-cytoplasmic
---             T: transmembrane region
---             o: extra-cytoplasmic
--- 
--- PhdTopology REFINEMENT AND TOPOLOGY PREDICTION
                  ....,....1....,....2....,....3....,....4....,....5....,....6
         AA      |............................................................|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
                  ....,....7....,....8....,....9....,....10...,....11...,....12
         AA      |....................................................MRQTKTGI|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
                  ....,....13...,....14...,....15...,....16...,....17...,....18
         AA      |LLANLGTPDAPTPEAVKRYLKQFLSDRRVVDTSRLLWWPLLRGVIFPLRSPRVAKLYASV|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
                  ....,....19...,....20...,....21...,....22...,....23...,....24
         AA      |WMEG..GSPLMVYSRQQQQALAQRLP....EMPVA...LGMSYGSPSLESAVDELLAEHV|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
                  ....,....25...,....26...,....27...,....28...,....29...,....30
         AA      |DHIVVLPLYPQFSCSTVGAVWDELARILARKRSIPGISF..IRDYADNHDYINALANSVR|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
                  ....,....31...,....32...,....33...,....34...,....35...,....36
         AA      |ASFAKHG.EPD...LLLLSYHGIPQRYA.DEGDDYPQRCRTTTRELASALGMAP..EKVM|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
                  ....,....37...,....38...,....39...,....40...,....41...,....42
         AA      |MTFQSRF.GREPWLMPYTDETLKML.GEKGVGHIQVMCPGFAADCLETLEEIAEQNREVF|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
                  ....,....43...,....44...,....45...,....46...,....47...,....48
         AA      |.LGAGGKKYEYIPALNATPEHIEMMANLVAAYR...........................|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
                  ....,....49...,....50...,....51...,....52...,....53...,....54
         AA      |....................................................|
         PHD htm |                                                    |
         Rel htm |9999888998887889999999999999999999999999999999999999|
 detail:         |                                                    |
         prH htm |0000000000001000000000000000000000000000000000000000|
         prL htm |9999999999998999999999999999999999999999999999999999|
 subset:         |                                                    |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
--- 
--- PhdTopology REFINEMENT AND TOPOLOGY PREDICTION END
--- 



MSF of: hemz_horvu.hssp from:    1 to:  484
 hemz_horvu.msf  MSF:  484  Type: P 22-May-96  21:35:4  Check: 1795  ..
 
 
 Name: hemz_horvu   Len:   484  Check: 8534  Weight:  1.00
 Name: hemz_arath   Len:   484  Check: 1560  Weight:  1.00
 Name: hemz_cucsa   Len:   484  Check: 9165  Weight:  1.00
 Name: hemz_mouse   Len:   484  Check: 2753  Weight:  1.00
 Name: hemz_bovin   Len:   484  Check: 8569  Weight:  1.00
 Name: hemz_human   Len:   484  Check: 7497  Weight:  1.00
 Name: hemz_salty   Len:   484  Check: 6991  Weight:  1.00
 Name: hemz_yerps   Len:   484  Check: 7579  Weight:  1.00
 Name: hemz_yeast   Len:   484  Check: 8160  Weight:  1.00
 Name: hemz_yeren   Len:   484  Check: 8339  Weight:  1.00
 Name: hemz_ecoli   Len:   484  Check: 1992  Weight:  1.00
 Name: hemz_haein   Len:   484  Check: 6891  Weight:  1.00
 Name: hemz_bacsu   Len:   484  Check: 6269  Weight:  1.00
 Name: hemz_braja   Len:   484  Check: 1633  Weight:  1.00
 Name: yvbc_vaccc   Len:   484  Check: 5863  Weight:  1.00
 
//
 
 
           1                                                   50  
hemz_horvu MECVRSGALD LGRSGNFLGK SGSTTSCGKV RCSTNLAGST KCEQNLHGKA
hemz_arath .......... .......... .......... ..SSGFNPLT KRKDHRFPRS
hemz_cucsa .....SNIKL HGSTNTLNSD QRISSLCSLP KSRVTFSCKT SGNLQVRDRS
hemz_mouse .......... .......... .......... .......... .....LSASA
hemz_bovin .......... .......... .......... .......... ..........
hemz_human .......... .......... .......... .......... ..........
hemz_salty .......... .......... .......... .......... ..........
hemz_yerps .......... .......... .......... .......... ..........
hemz_yeast .......... .......... .......... .......... ..........
hemz_yeren .......... .......... .......... .......... ..........
hemz_ecoli .......... .......... .......... .......... ..........
hemz_haein .......... .......... .......... .......... ..........
hemz_bacsu .......... .......... .......... .......... ..........
hemz_braja .......... .......... .......... .......... ..........
yvbc_vaccc .......... .......... .......... .......... ..........
 
           51                                                 100  
hemz_horvu KPLLLSASGK ARGTSGLVHR SPVLKHQHHL SVRSTSTDVC TTFDEDVKGV
hemz_arath CSQRNSLseR SFGESMTITN RGLSFKTNVF EQARSVTGDC SYDETSAKAR
hemz_cucsa TGLVvsSSNG DRDVIQGLHL SGPIEKKSRL GQACCSVGT. FTVGEFALES
hemz_mouse NMAAALRAAG ALLREPLVHG SSrwRCQSGA AVAATTEKVH HAKTTKPQAQ
hemz_bovin ....LRSAGV LLRDRLLYGG SRACQPRRCQ SGAATAAAAT ETAQRARSPK
hemz_human ...LGANMAA ALRAAGVLLR DPLASSSWRv aAAAVTTETA QHAQGAKPQV
hemz_salty .......... .......... .......... .......... ..........
hemz_yerps .......... .......... .......... .......... ..........
hemz_yeast .......... .......... ......RTQG SFLRRSQLTI TRSFSVTFNM
hemz_yeren .......... .......... .......... .......... ..........
hemz_ecoli .......... .......... .......... .......... ..........
hemz_haein .......... .......... .......... .......... ..........
hemz_bacsu .......... .......... .......... .......... ..........
hemz_braja .......... .......... .......... ........MS TAAPNETTQP
yvbc_vaccc .......... .......... .......... .......... ..........
 
           101                                                150  
hemz_horvu SSHAVEEKVG VLLLNLGGPE TLNDVQPFLF NLFADPDIIR LPRLFRFLQR
hemz_arath SHVVAEDKIG VLLLNLGGPE TLNDVQPFLY NLFADPDIIR LPRPFQFLQG
hemz_cucsa QSQAVDDKVG VLLLNLGGPE TLDDVQPFLY NLFADPDIIR LPRLFRFLQE
hemz_mouse PERR.KPKTG ILMLNMGGPE TLGEVQDFLQ RLFLDRDLMT LP.....IQN
hemz_bovin PQAQpkPRTG ILMLNMGGPE TVEEVQDFLQ RLFLDQDLMT LP.....VQD
hemz_human QPQKRKPKTG ILMLNMGGPE TLGDVHDFLL RLFLDQDLMT LP.....IQN
hemz_salty .....QTKTG ILLANLGTPd tPEAVKRYLK QFLSDRRVVD TSRLLWWPL.
hemz_yerps .......... .......... .......... .......... ..........
hemz_yeast QNAQKRSPTG IVLMNMGGPS KVEETYDFLY QLFADNDLIP ISAKY...QK
hemz_yeren .....QSKLG VLMVNLGTPd tPQAVKRYLA EFLSDRRVvt SPWLWWPLLR
hemz_ecoli .....QTKTG ILLANLGTPd tPEAVKRYLK QFLSDRRVVD TSRLLWWPL.
hemz_haein .....PAKIG VLLANLGTPd tPKSISRYLW QFLTDPRVVD LPRCKWY..P
hemz_bacsu .....RKKMG LLVMAYGTP. .......... ..YKEEDIER YYTHIRRGRK
hemz_braja TVRSGQKRVG VLLVNLGTPD TAdgVRVYLK EFLSDARVIE DQGLVwvLNG
yvbc_vaccc .......... .......... .......... .......... ..........
 
           151                                                200  
hemz_horvu PLAKLISTFR APKSNEGYAS IGGGSPLRKI TDEQANALKV ALKSKNLEAD
hemz_arath TIAKFISVVR APKSKEGYAA IGGGSPLRKI TDEQADAIKM SLQAKNIAAN
hemz_cucsa PLAKLISTYR APKSKEGYAS IGGGSPLRKI TDEQAQALKM ALAEKNMSTN
hemz_mouse KLAPFIAKRR TPKIQER..R IGGGSPIKMW TSKQGekLLD ELSPATAPHK
hemz_bovin KLGPFIAKRR TPKIQEQYRR IGGGSPIKMW TSKQGekLLD ELSPHTAPHK
hemz_human KLAPFIAKRR TPKIQEQYRR IGGGSPIKIW TSKQGekLLD ELSPNTAPHK
hemz_salty .LRGVILPLR SPRVAKLYQS idGGSPLMVY SRQQQQALAA RLP....DTP
hemz_yerps .......... .......... .......... .......... ..........
hemz_yeast TIAKYIAKFR TPKIEKQYRE IGGGSPIRKW SEYQATEVCK ILdpETAPHK
hemz_yeren GVILPIRSPR VAKLYQ.SVW MDEGSPLLVY SRRQQKALAE RMP....EIP
hemz_ecoli .LRGVIFPLR SPRVAKLYAs mEGGSPLMVY SRQQQQALAQ RLP....EMP
hemz_haein LLKAIILPLR SKRIAKNYQA ieQGSPLLAI SRQQKDALQA YLDNQNIDTQ
hemz_bacsu PEPEMLQDLK .....DRYEA IGGISPLAQI TEQQAHNLEQ HLnqDEITFK
hemz_braja IILRQRPRSK ALDYQKIWNN EKNESPLKTI TRSQSAKLAA ALSDRDHVV.
yvbc_vaccc .......... .......... .......... .......... ..........
 
           201                                                250  
hemz_horvu IYVGMRYWYP FTEEAIDQIK KDKITKLVVL PLYPQYSIST SGSSIRVLQN
hemz_arath VYVGMRYWYP FTEEAVQQIK KDKITRLVVL PLYPQYSIST TGSSIRVLQD
hemz_cucsa VYVGMRYWYP FTEEAIQQIK RDGITRLVVL PLYPQYSIST TGSSIRVLQK
hemz_mouse YYIGFRYVHP LTEEAIEEME RDGLERAIAF TQYPQYSCST TGSSLNAIYR
hemz_bovin YYIGFRYVHP LTEEAIEEME RDGLERAVAF TQYPQYSCST TGSSLNAIYR
hemz_human YYIGFRYVHP LTEEAIEEME RDGLERAIAF TQYPQYSCST TGSSLNAIYR
hemz_salty VALGMSYGCP SLESAVDELL ASDVDHIVVL PLYPQYSCST VGAVWDELGR
hemz_yerps .......... .......... .......... .......... ..........
hemz_yeast PYVAFRYAKP LTAETYKQML KDGVKKAVAF SQYPHFSYST TGSSINELWR
hemz_yeren VELGMSYGSP NLPDAIDKLL AQGVTKLVVL PLYPQYSCST SAAVWDAVAR
hemz_ecoli VALGMSYGSP SLESAVDELL AEHVDHIVVL PLYPQFSCST VGAVWDELAR
hemz_haein VEIAMTYGNP SMQSAVKNLL KNQVERIIVL PLYPQYSSST TGAVFDAFAN
hemz_bacsu AYIGLKHIEP FIEDAVAEMH KDGITEAVSI VLAPHFSTFS VQSYNKRAK.
hemz_braja VDWAMRYGNP SIKSGIDALI GGMRPHLAV. PLYPQYSAST SATVCDEVFR
yvbc_vaccc .......... .......... .......... .........T AFFTIHNIKM
 
           251                                                300  
hemz_horvu IVKEDPYFAG LPISIIESWY QREGYVKSMA DLIEKELSVF SNPEEVMIFF
hemz_arath LFRKDPYLAG VPVAIIKSWY QRRGYVNSMA DLIEKELQTF SDPKEVMIFF
hemz_cucsa MFREDAYLSS LPVSIIKSWY QREGYIKSMA DLMQAELKNF ANPQEVMIFF
hemz_mouse YYNEVGQKPT MKWSTIDRWP THPLLIQCFA DHILKELNHf eKRSEVVILF
hemz_bovin YYNEVGRKPT MKWSTIDRWP THPLLIQCFA DHILKELDHf eKRREVVILF
hemz_human YYNQVGRKPT MKWSTIDRWP THHLLIQCFA DHILKELDHf eKRSEVVILF
hemz_salty ILAPKRRIPG I......... .......... .......... ..........
hemz_yerps .......... .......... .......... .......... ..........
hemz_yeast QIKALDSERS ISWSVIDRWP TNEGLIKAFS ENITKKLQEF PQpdKVVLLF
hemz_yeren ILKGYRRLP. .SISFIRDYA EHPAYISALK QSVENSFVQH GKPDRLVLSF
hemz_ecoli ILARKRSIPG ..ISFIRDYA DNHDYINALA NSVRASFAKH GEPD..LLLL
hemz_haein ALKEERGL.. LPFDFIHSYH IDENYINALA DSIKVRL... ..KSDEFLLF
hemz_bacsu ..EEAEKLGG LTITSVESWY DEPKFVTYWV DRVKETYASm dERENAMLIV
hemz_braja VL..ARLRAQ PTLRVTPPYY EDEAYIEALA VSIETHLATL PFKPE.LIVA
yvbc_vaccc IIYNRPHMat FHEYIIHAVF QRGGYLDGAA DIVEERFSVS FSPSLTFTRF
 
           301                                                350  
hemz_horvu SAHGVPLTYV KDAGDPYRDQ MEDCIALIME ELKSRGTLND HTLAYQSRVG
hemz_arath SAHGVPVSYV ENAGDPYQKQ MEECIDLIME ELKARGVLND HKLAYQSRVG
hemz_cucsa SAHGVPVSYV ENAGDPYKDQ MEECICLIMQ ELKARGIGNE HTLAYQSRVG
hemz_mouse SAHSLPMSVV .NRGDPYPQE VGATVHKVME KL...GYPNP YRLVWQSKVG
hemz_bovin SAHSLPMSVV .NRGDPYPQE VGATVQRVMD KL...GYSNP YRLVWQSKVG
hemz_human SAHSLPMSVV .NRGDPYPQE VSATVQKVME RLEYC...NP YRLVWQSKVG
hemz_salty .......... .......... .......... .......... ..........
hemz_yerps .......... .......... .......... .......... ..........
hemz_yeast SAHSLPMDVV .NTGDAYPAE VAATVYNIMQ KLKFK...NP YRLVWQSQVG
hemz_yeren ..HGIPKRY. AQLGDDYPQR CEDTSRALRA EIALPA..EQ IMMTYQSRFG
hemz_ecoli SYHGIPQRY. ADEGDDYPQR CRTTTRELAS ALGMAP..EK VMMTFQSRFG
hemz_haein SYHGIPLRYE K.MGDYYREH CKQTTIAVVN KLGL..TENQ WRMTFQSRFG
hemz_bacsu SAHSLP.EKI KEFGDPYPDQ LHESAKLIAE ....GAGVSE YAVGWQSEGn
hemz_braja SFHGMPKSYV .DKGDPYQEH CIATTEALRA A..RRLDASK LLLTFQSRFG
yvbc_vaccc .......... .......... .......... .......... ..........
 
           351                                                400  
hemz_horvu PVQWLKPYTD EVLVELGQKG VKSLLAVPVS FVSEHIETLE EIDMEYRELA
hemz_arath PVQWLKPYTD EVLVDLGKSG VKSLLAVPVS FVSEHIETLE EIDMEYRELA
hemz_cucsa PVQWLKPYTD EVLVELGQKG IKSLLAVPVS FVSEHIETLE EIDMEYKHLA
hemz_mouse PVPWLGPQTD EAIKGLCERG RKNILLVPIA FTSDHIETLY ELDIEYSqlA
hemz_bovin PMPWLGPQTD EAIKGLCKRG RKNILLVPIA FTSDHIETLY ELDIEYSqlA
hemz_human PMPWLGPQTD ESIKGLCERG RKNILLVPIA FTSDHIETLY ELDIEYSqlA
hemz_salty .......... .......... .......... .......... ..........
hemz_yerps .....TPYTD ETLKSLPSQG VKHIQLICPG FSADCLETLE EIKEQNREFF
hemz_yeast PKPWLGAQTA EIAEFLGPK. VDGLMFIPIA FTSDHIETLH EIDLGV..IG
hemz_yeren REPWLTPYTD ETLKSLPSQG VKHIQLICPG FSADCLETLE EIKEQNREVF
hemz_ecoli REPWLMPYTD ETLKMLGEKG VGHIQVMCPG FAADCLETLE EIAEQNREVF
hemz_haein REEWLQPYTD KFLESAAAQN IQKIAVICPG FSVDCLETIE EIDEENRENF
hemz_bacsu pDPWLGPDVQ DLTRDleQKG YQAFVYVPVG FVADHLEVLY DNDYECKVVT
hemz_braja NDEWLQPYTD KTMERLAKEG VRRIAVVTPG FAADCLETLE EIAQENAEIF
yvbc_vaccc .......... .......... .......... .......... ..........
 
           401                                                450  
hemz_horvu LESGIENWGR VPALGCTSSF ISDLADAVVE ALPSASAMAT RKVKDTDSDM
hemz_arath LESGVENWGR VPALGLTPSF ITDLADAVIE SLPSAEAMSN PNasEDSESS
hemz_cucsa LESGIQNWGR VPALNCNSSF ISDLADAVIE ALPSATALAP HTSSTDADDH
hemz_mouse QKCGAENIRR AESLNGNPLF SKALADLVHS HIQSNKLCST QLSLNCPLCV
hemz_bovin SECGLENIRR AESLNGNPLF SKALADLVHS HLQSKERCST QLTLSCPLCV
hemz_human KECGVENIRR AESLNGNPLF SKALADLVHS HIQSNELCSK QLTLSCPLCV
hemz_salty .......... .......... .......... .......... ..........
hemz_yerps LHAGGEKFEY IPALNDDEGH IALLEQLIRH NI........ ..........
hemz_yeast ESEYKDKFKR CESLNGNQTF IEGMADLVKS HLQSNqdFAL GKSNDPVKDL
hemz_yeren IHAGGEKFEY IPAL...... .......... .......... ..........
hemz_ecoli LGAGGKKYEY IPALNATPEH IEMMANLVAA .......... ..........
hemz_haein LNNGGQSYQY IPALNVEHAH IEMMGKLILE KL........ ..........
hemz_bacsu DDIG.ASYYR PEMPNAKPEF IDALATVVLK KLGR...... ..........
hemz_braja KHNGGETFSA IPCLNDSEPG MDVIRTLVLR EL........ ..........
yvbc_vaccc .......... .......... .......... .......... ..........
 
           451                               484   
hemz_horvu DMMHYLTKMF LGSVLAFFLL LSPRLVSAFR NTLQ
hemz_arath DAFSYIVKMF FGSILAFVLL LSPKMFHAFR N...
hemz_cucsa DPFLYAIKLL FGSVLAFILL LSPKAFMVFR NNF.
hemz_mouse NPVCRKTKSF FTS....... .......... ....
hemz_bovin NPTCRETKSF FTS....... .......... ....
hemz_human NPVCRETKSF FTS....... .......... ....
hemz_salty .......... .......... .......... ....
hemz_yerps .......... .......... .......... ....
hemz_yeast SLVF...... .......... .......... ....
hemz_yeren .......... .......... .......... ....
hemz_ecoli .......... .......... .......... ....
hemz_haein .......... .......... .......... ....
hemz_bacsu .......... .......... .......... ....
hemz_braja .......... .......... .......... ....
yvbc_vaccc .......... .......... .......... ....
 


****************************************************************************
*                                                                          *
*    PHD: Profile fed neural network systems from HeiDelberg               *
*    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~               *
*                                                                          *
*    Prediction of:			                                   *
* 	helical transmembrane regions, 	   	by PHDhtm		   *
*                                                                          *
*    Author:             						   *
*	Burkhard Rost							   *
*       EMBL, 69012 Heidelberg, Germany					   *
*       Internet: Rost@EMBL-Heidelberg.DE				   *
*                                                                          *
*    All rights reserved.                                                  *
*                                                                          *
****************************************************************************
*                                                                          *
*    Percentage of helical trans-membrane predicted:                       *
*    +--------------+--------+--------+                                    *
*    | SecStr:      |    H   |    L   |                                    *
*    | % Predicted: |    4.1 |   95.9 |                                    *
*    +--------------+--------+--------+                                    *
*                                                                          *
****************************************************************************
--- 
--- ------------------------------------------------------------
--- PhdTopology prediction of transmembrane helices and topology
--- ------------------------------------------------------------
--- 
--- PhdTopology REFINEMENT AND TOPOLOGY HEADER: ABBREVIATIONS
--- 
--- NHTM_BEST    : number of transmembrane helices best model
--- NHTM_2ND_BEST: number of transmembrane helices 2nd best model
--- REL_BEST     : reliability of best model (0 is low, 9 high)
--- HTMTOP_PRD   : topology predicted ('in': intra-cytoplasmic)
--- HTMTOP_RID   : difference between positive charges
--- HTMTOP_RIP   : reliability of topology prediction (0-9)
--- MOD_NHTM     : number of transmembrane helices of model
--- MOD_STOT     : score for all residues
--- MOD_SHTM     : score for HTM added at current iteration step
--- MOD_N-C      : N  -  C  term of HTM added at current step
--- 
--- ALGORITHM REF: The refinement is performed by a dynamic pro-
--- ALGORITHM    : gramming-like procedure: iteratively the best
--- ALGORITHM    : transmembrane helix (HTM) compatible with the
--- ALGORITHM    : network output is added (starting from the  0
--- ALGORITHM    : assumption, i.e.,  no HTM's  in the protein).
--- ALGORITHM TOP: Topology is predicted by the  positive-inside
--- ALGORITHM    : rule, i.e., the positive charges are compiled
--- ALGORITHM    : separately  for all even and all odd  non-HTM
--- ALGORITHM    : regions.  If the difference (charge even-odd)
--- ALGORITHM    : is < 0, topology is predicted as 'in'.   That
--- ALGORITHM    : means, the protein N-term starts on the intra
--- ALGORITHM    : cytoplasmic side.
--- 
--- PhdTopology REFINEMENT HEADER: SUMMARY
 MOD_NHTM MOD_STOT MOD_SHTM MOD_N-C 
        1    0.972    0.854   457 -   474
--- 
--- PhdTopology REFINEMENT AND TOPOLOGY HEADER: SUMMARY
--- NHTM_BEST    : 1
--- NHTM_2ND_BEST: 0
--- REL_BEST     : 2
--- HTMTOP_PRD   : out
--- HTMTOP_RID   : 1.720
--- HTMTOP_RIP   : 1
--- 
--- PhdTopology REFINEMENT AND TOPOLOGY PREDICTION: SYMBOLS
--- AA           : amino acid in one-letter code
--- PHD htm      : HTM's predicted by the PHD neural network
---                system (H=HTM, ' '=not HTM)
--- Rel htm      : Reliability index of prediction (0-9, 0 is low)
--- detail       : Neural network output in detail
--- prH htm      : 'Probability' for assigning a helical trans-
---                membrane region (HTM)
--- prL htm      : 'Probability' for assigning a non-HTM region
---          note: 'Probabilites' are scaled to the interval
---                0-9, e.g., prH=5 means, that the first 
---                output node is 0.5-0.6
--- subset       : Subset of more reliable predictions
--- SUB htm      : All residues for which the expected average
---                accuracy is > 82% (tables in header).
---          note: for this subset the following symbols are used:
---             L: is loop (for which above ' ' is used)
---           '.': means that no prediction is made for this,
---                residue as the reliability is:  Rel < 5
--- other        : predictions derived based on PHDhtm
--- PHDFhtm      : filtered prediction, i.e., too long HTM's are
---                split, too short ones are deleted
--- PHDRhtm      : refinement of neural network output 
--- PHDThtm      : topology prediction based on refined model
---                symbols used:
---             i: intra-cytoplasmic
---             T: transmembrane region
---             o: extra-cytoplasmic
--- 
--- PhdTopology REFINEMENT AND TOPOLOGY PREDICTION
                  ....,....1....,....2....,....3....,....4....,....5....,....6
         AA      |MECVRSGALDLGRSGNFLGKSGSTTSCGKVRCSTNLAGSTKCEQNLHGKAKPLLLSASGK|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
  other:         |                                                            |
         PHDFhtm |                                                            |
         PHDRhtm |                                                            |
         PHDThtm |oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo|
                  ....,....7....,....8....,....9....,....10...,....11...,....12
         AA      |ARGTSGLVHRSPVLKHQHHLSVRSTSTDVCTTFDEDVKGVSSHAVEEKVGVLLLNLGGPE|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
  other:         |                                                            |
         PHDFhtm |                                                            |
         PHDRhtm |                                                            |
         PHDThtm |oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo|
                  ....,....13...,....14...,....15...,....16...,....17...,....18
         AA      |TLNDVQPFLFNLFADPDIIRLPRLFRFLQRPLAKLISTFRAPKSNEGYASIGGGSPLRKI|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999998889999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
  other:         |                                                            |
         PHDFhtm |                                                            |
         PHDRhtm |                                                            |
         PHDThtm |oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo|
                  ....,....19...,....20...,....21...,....22...,....23...,....24
         AA      |TDEQANALKVALKSKNLEADIYVGMRYWYPFTEEAIDQIKKDKITKLVVLPLYPQYSIST|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999988888888899|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000000|
         prL htm |999999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
  other:         |                                                            |
         PHDFhtm |                                                            |
         PHDRhtm |                                                            |
         PHDThtm |oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo|
                  ....,....25...,....26...,....27...,....28...,....29...,....30
         AA      |SGSSIRVLQNIVKEDPYFAGLPISIIESWYQREGYVKSMADLIEKELSVFSNPEEVMIFF|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999877|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000000000000000000000000000000011|
         prL htm |999999999999999999999999999999999999999999999999999999999988|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
  other:         |                                                            |
         PHDFhtm |                                                            |
         PHDRhtm |                                                            |
         PHDThtm |oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo|
                  ....,....31...,....32...,....33...,....34...,....35...,....36
         AA      |SAHGVPLTYVKDAGDPYRDQMEDCIALIMEELKSRGTLNDHTLAYQSRVGPVQWLKPYTD|
         PHD htm |                                                            |
         Rel htm |788999999999999999999999999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |100000000000000000000000000000000000000000000000000000000000|
         prL htm |899999999999999999999999999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
  other:         |                                                            |
         PHDFhtm |                                                            |
         PHDRhtm |                                                            |
         PHDThtm |oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo|
                  ....,....37...,....38...,....39...,....40...,....41...,....42
         AA      |EVLVELGQKGVKSLLAVPVSFVSEHIETLEEIDMEYRELALESGIENWGRVPALGCTSSF|
         PHD htm |                                                            |
         Rel htm |999999999999988777667789999999999999999999999999999999999999|
 detail:         |                                                            |
         prH htm |000000000000000111111100000000000000000000000000000000000000|
         prL htm |999999999999999888888899999999999999999999999999999999999999|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL|
  other:         |                                                            |
         PHDFhtm |                                                            |
         PHDRhtm |                                                            |
         PHDThtm |oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo|
                  ....,....43...,....44...,....45...,....46...,....47...,....48
         AA      |ISDLADAVVEALPSASAMATRKVKDTDSDMDMMHYLTKMFLGSVLAFFLLLSPRLVSAFR|
         PHD htm |                                    HHHHHHHHHHHHHHHHHHHH    |
         Rel htm |988899999999999999999999999999987752036788888888887763200256|
 detail:         |                                                            |
         prH htm |000000000000000000000000000000001123568899999999998886654321|
         prL htm |999999999999999999999999999999998876431100000000001113345678|
 subset:         |                                                            |
         SUB htm |LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL...HHHHHHHHHHHHHHH.....LL|
  other:         |                                                            |
         PHDFhtm |                                    HHHHHHHHHHHHHHHHHHHH    |
         PHDRhtm |                                    HHHHHHHHHHHHHHHHHH      |
         PHDThtm |ooooooooooooooooooooooooooooooooooooTTTTTTTTTTTTTTTTTTiiiiii|
                  ....,....49...,....50...,....51...,....52...,....53...,....54
         AA      |NTLQ|
         PHD htm |    |
         Rel htm |7788|
 detail:         |    |
         prH htm |1100|
         prL htm |8899|
 subset:         |    |
         SUB htm |LLLL|
  other:         |    |
         PHDFhtm |    |
         PHDRhtm |    |
         PHDThtm |iiii|
--- 
--- PhdTopology REFINEMENT AND TOPOLOGY PREDICTION END
--- 


Result from alignment not suitable


                  ....,....43...,....44...,....45...,....46...,....47...,....48
         AA      |.LGAGGKKYEYIPALNATPEHIEMMANLVAAYR...........................|
         PHD htm |                                                            |
         Rel htm |999999999999999999999999999999999999999999999999999999999999|

Result from alignment not suitable

                  ....,....43...,....44...,....45...,....46...,....47...,....48
         AA      |ISDLADAVVEALPSASAMATRKVKDTDSDMDMMHYLTKMFLGSVLAFFLLLSPRLVSAFR|
         PHD htm |                                    HHHHHHHHHHHHHHHHHHHH    |
         Rel htm |988899999999999999999999999999987752036788888888887763200256|
  other:         |                                                            |
         PHDThtm |ooooooooooooooooooooooooooooooooooooTTTTTTTTTTTTTTTTTTiiiiii|  

Notation:

AA: amino acid
PHD htm: prediction of transmembrane helices (H) and non-membrane regions (blank)
Rel htm: reliability of prediction, from 0 (low) to 9 (high)
PHDThtm: prediction of HTM topology:
o:extra-cytoplasmic non-membrane region
T: transmembrane helix
i: intra-cytoplasmic non-membrane region




PP home Mail to PP Submit Help