# ../bin/tfastx35_t -m 9c -q -i -3 -m 6 ../seq/mgstm1.aa %p
TFASTX compares a protein to a translated DNA data bank
 version 35.04 Oct. 7, 2008
Please cite:
 Pearson et al, Genomics (1997) 46:24-36

Query: ../seq/mgstm1.aa
  1>>>GT8.7 | 266 40001 90043 | transl. of pa875.con, - 218 aa (reverse complement)
Library: GB165 Primate 5491101432 residues in 507455 library sequences

       opt      E()
< 20    98     0:=
  22     0     0:           one = represents 988 library sequences
  24     2     0:=
  26     7    11:*
  28    20   115:*
  30    96   700:*
  32   447  2708:= *
  34  2759  7343:===    *
  36 11472 15081:============   *
  38 33267 24924:=========================*========
  40 55500 34766:===================================*=====================
  42 59255 42497:===========================================*================
  44 56704 46879:===============================================*==========
  46 45718 47747:=============================================== *
  48 39371 45712:========================================      *
  50 33496 41713:==================================        *
  52 28148 36672:=============================        *
  54 24015 31325:=========================      *
  56 21582 26166:======================    *
  58 16632 21482:=================    *
  60 14125 17401:===============  *
  62 11535 13951:============  *
  64 17077 11095:===========*======
  66 10631  8769:========*==
  68 11422  6898:======*=====
  70  3958  5405:=====*
  72  2717  4224:=== *
  74  2091  3293:===*
  76  1623  2563:==*
  78  1156  1992:==*
  80   822  1547:=*
  82   647  1183:=*
  84   503   937:*
  86   348   725:*
  88   243   561:*          inset = represents 4 library sequences
  90   178   434:*
  92   189   336:*         :=======================================*
  94   108   260:*         :===========================            *
  96   121   201:*         :===============================        *
  98    80   156:*         :====================                  *
 100    42   120:*         :===========                  *
 102    27    93:*         :=======                *
 104    31    72:*         :========         *
 106     8    56:*         :==           *
 108    11    43:*         :===       *
 110    11    33:*         :===     *
 112     3    26:*         :=     *
 114     3    20:*         :=   *
 116     3    15:*         :=  *
 118     3    12:*         := *
>120    16     9:*         :==*=
5491667796 residues in 508321 library sequences
Statistics:  Expectation_n fit: rho(ln(x))= 4.4575+/-8.14e-05; mu= 5.6033+/- 0.006
 mean_var=57.5206+/- 9.119, 0's: 3 Z-trim: 12  B-trim: 4018 in 2/97
 Lambda= 0.169107
 statistics sampled from 60000 to 508305 sequences
 Kolmogorov-Smirnov  statistic: 0.0876 (N=29) at  44
Algorithm: TFASTX (3.5 Sept 2006) [optimized]
Parameters: BL50 matrix (o=15:-5:-1:-1) ktup: 2
 join: 36, open/ext: -12/-2, shift: -20 width:  16
 Scan time: 371.600



The best scores are:                             initn opt bits E(508321)
AC091492 - Homo sapiens chromosome 3  (180963) [r]  992 1040 265.3 2.8e-67 align
AL357515 - Human DNA sequence from cl (190440) [r] 1004 1003 256.3 1.5e-64 align
AK126952 - Homo sapiens cDNA FLJ45005 (3674) [r]  378  415 111.4 1.2e-22 align
AC010093 - Homo sapiens BAC clone RP1 (160930) [r]  692  331 92.2   3e-15 align
AC000032 - Homo sapiens Chromosome 1p (30140) [r] 1067  223 65.3 7.3e-08 align
AC195273 - Pan troglodytes BAC clone  (189196) [r]  649  227 66.9 1.5e-07 align
AC000031 - Homo sapiens Chromosome 1p (38703) [r]  793  216 63.7 2.9e-07 align
AC208868 - Homo sapiens FOSMID clone  (41047) [r]  656  216 63.7   3e-07 align
AC092859 - Pan troglodytes clone rp43 (175281) [r] 1023  222 65.7 3.3e-07 align
AC002520 - Homo sapiens Chromosome 1p (11899) [r] 1074  207 61.1 5.5e-07 align
AC092860 - Pan troglodytes clone rp43 (160656) [r]  796  203 61.0 7.6e-06 align
AL158847 - Human DNA sequence from cl (133451) [r]  794  200 60.2 1.1e-05 align
AK125820 - Homo sapiens cDNA FLJ43832 (4208) [r]  176  122 40.0    0.44 align


>>>GT8.7_rev, 218 aa vs %p library

>>AC091492 - Homo sapiens chromosome 3 clone RP11-335I9 map 3p,                                   (180963 aa)
Frame: r initn: 992 init1: 992 opt: 1040  Z-score: 1342.7  bits: 265.3 E(): 2.8e-67
trans. Smith-Waterman score: 1040; 68.3% identity (87.6% similar) in 218 aa overlap (1-218:108699-108046)
Entrez lookup  Re-search database  General re-search

10 20 30 40 50 60 70 80 90 100 GT8.7 MPMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPDFDRSQWLNEKFKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIV ::. ::: ...::.: :..:::::::::.::.:::::.::.::: :: ::::::::::::::.::.:::::::::::... :: : :::::.::.:: AC0914 MPITLGYGDIHGLAHAIHLLLEYTDSSYEEKKYTMGDSPDYDRSJJLNJKFKLGLDFPNLPYLFDGAHKITQSNAILRHISCKHSLCEETEEEKIRVDIS 108700 108670 108640 108610 108580 108550 108520 108490 108460 108430 110 120 130 140 150 160 170 180 190 200 GT8.7 ENQVMDTRMQLIMLCYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKVTYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLARFEGLKKISAYMKS :::.::. ::: :.:: :.::: ::..:. .:::.::::.:::: :::: ::.:::::::::::: .:.:.::::::::.::.. .:::::::::::: AC0914 ENQTMDNLMQLAMICYIPEFEKLKPKYLEELPEKLKLYSQFLGKWPWFAJDKITYVDFLAYDILDLNCIFDPSCLDAFPNLKDFMSJIEGLKKISAYMKS 108400 108370 108340 108310 108280 108250 108220 108190 108160 108130 210 GT8.7 SRYIATPIFSKMAHWSNK :. . :.:.: : :..: AC0914 SQCLQGPLFGKSAMWNSK 108100 108070


>>AL357515 - Human DNA sequence from clone RP11-397G5 on chromo                                   (190440 aa)
Frame: r initn: 1004 init1: 905 opt: 1003  Z-score: 1293.6  bits: 256.3 E(): 1.5e-64
trans. Smith-Waterman score: 1003; 66.1% identity (86.2% similar) in 218 aa overlap (1-218:154780-154130)
Entrez lookup  Re-search database  General re-search

10 20 30 40 50 60 70 80 90 100 GT8.7 MPMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPDFDRSQWLNEKFKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIV ::. ::::.. ::.: . .::.::: ::.::.: ::::::.:::::::::::::::::::::::::.::::::.::: .: ::.: :::: :.: ::. AL3575 MPVTLGYWDIJGLAHAVCLLLQYTDLSYEEKKYMMGDAPDYDRSQWLNEKFKLGLDFPNLPYLIDGAHKITQSKAILGCIAYKHNLCGETEGEKIWEDIL 154760 154730 154700 154670 154640 154610 154580 154550 154520 154490 110 120 130 140 150 160 170 180 190 200 GT8.7 ENQVMDTRMQLIMLCYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKVTYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLARFEGLKKISAYMKS :::..:...:: :::::::.: :::.:...: .::::.::::. : :::.: :::.:: ::.. ..:::: :::::::.::..::::: .::::::: AL3575 ENQLVDNHVQLARLCYNPDFKKLKPEYLEALPAMLKLYSQFLGKQLLFLGDKITLVDFIAYGILERNQVFEPKWLDAFPNLKDFISRFEGL-EISAYMKS 154460 154430 154400 154370 154340 154310 154280 154250 154220 154190 210 GT8.7 SRYIATPIFSKMAHWSNK : .. :.:.::: :.:: AL3575 SCFLLRPVFTKMAVWGNK 154160


>>AK126952 - Homo sapiens cDNA FLJ45005 fis, clone BRAWH3011907                                   (3674 aa)
Frame: r initn: 378 init1: 156 opt: 415  Z-score: 541.6  bits: 111.4 E(): 1.2e-22
trans. Smith-Waterman score: 415; 40.0% identity (68.6% similar) in 220 aa overlap (2-218:1595-961)
Entrez lookup  Re-search database  General re-search

10 20 30 40 50 60 70 80 90 GT8.7 PMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTM-GDAPDFDRSQWLNEKFKLGLDFPNLPYLIDGSHKITQSNA-IL-RYLARKHHL--DGETEEERIR :: .: .: :: .:: :: .. ..:: ::::. :::: .:.:::::: ::...:... . :: : : : ..:::: .. AK1269 PMEMGDRDVLGLILDTCLLL-------GGKRJSI\AEAPGHGGSQWLDVKFKLDVDIPNLPYLTDGKNRIVHNPG\ILVRPAALTSTLLQSAETEEGKVP 1590 1560 1530 1500 1470 1440 1410 1380 1350 100 110 120 130 140 150 160 170 180 190 GT8.7 ADIVENQVMDTRMQLIMLCYNPDF-EKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKVTYVDFLAYDILDQYRMFEPKCLDAFPNLR-DFLARFEGLKKI .:: :.:... ..:: .: ..::. : :: . . : ..: .: :: .. :: : :.:..:.:. :.::: ::: ::: ::::: ::. .. .:.. AK1269 GDIREDQMVEYHLQLKQLHHDPDM/ETWKPPYSELQPGQLKHFSLFL-RKSWFPGGKLTFLDILTCDVLDQNCMFELKCLYEFPNLR/DFMHHIAALEER 1320 1290 1260 1230 1200 1170 1140 1110 1080 1050 200 210 GT8.7 SAYMKSS-RYIATPIFSKMAHWSNK .: ..:. ... :. :.::.:. . AK1269 AAPVQSA/LFFGMPVSSEMAQWGPR 1020 990


>>AC010093 - Homo sapiens BAC clone RP11-323O5 from 2, complete                                   (160930 aa)
Frame: r initn: 692 init1: 214 opt: 331  Z-score: 408.6  bits: 92.2 E(): 3e-15
trans. Smith-Waterman score: 461; 37.0% identity (51.7% similar) in 327 aa overlap (3-218:100065-99099)
Entrez lookup  Re-search database  General re-search

10 20 30 40 50 60 70 80 90 100 GT8.7 MILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPD-FDRSQWLNEKFKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIVE :.:: ::. ::.: : .:: .:: ::.:: :: ..: .: ::::. ::: ::::::: ..::..:.::::::: : : : :. :.: :.:..::.: AC0100 MVLGJWNICGLAHTICLLLJFTDMSYEEKWYTCQETP-/YDJSQWLDVIFKLDLDFPNLPNFMDGKNKVTQSNAILCYTAGK-HMCGKT--EKIQVDILE 100060 100030 100000 99970 99940 99910 99880 99850 99820 99790 110 120 GT8.7 NQVMDTRMQLIMLCYNPDFEK------------------------------------------------------------------------------- ::. . :::. ::: : :: AC0100 NQATSFCTQLIQCCYNSDHEKLKPGQAQWLTPLIPALWEASLFSWITSSGDQDHPGJYGETPSLLKIQKVSWAWWRAPVVSATQEAEAGEJLEPGRRRLQ 99760 99730 99700 99670 99640 99610 99580 99550 99520 99490 130 140 150 160 170 180 GT8.7 --------------------------------Q-KPEFLKTIPEKMKLYSEFLG-KRPWFAGD-KVTYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLAR ::..:. .::..: .: ::: : ::::. :.:.::: ::.::: :::::::: ::.:. :. : AC0100 JAEIAPLHSSLGDRARLHLKKKNKJIKJIKTNP\KPQYLEQLPEQLKQFSMFLG\KFSWFAGE/KLTFVDFPIYDVLDQKCMFEPKCLDEFPHLKAFMCR 99460 99430 99400 99370 99340 99310 99280 99250 99220 190 200 210 GT8.7 FEG-LKKISAYMKSSRYIATPIFSKMAHWSNK . : :.::.:::.:. .. :: .:::.:.:: AC0100 J-G\LEKIAAYMQSDCFLKMPINNKMAQWGNK 99190 99160 99130


>>AC000032 - Homo sapiens Chromosome 1p13.3 Cosmid Clone cgtm12                                   (30140 aa)
Frame: r initn: 1067 init1: 202 opt: 223  Z-score: 276.0  bits: 65.3 E(): 7.3e-08
trans. Smith-Waterman score: 336; 51.9% identity (67.3% similar) in 104 aa overlap (115-189:23726-23418)
Entrez lookup  Re-search database  General re-search

120 130 140 150 160 170 180 GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV-----------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDFL :. :. :: ::..:. .:::.:::::::::::::::.:: :.::::.::.:: .:.::::::::::::.::. AC0000 CFLPQ-EKLKPKYLEELPEKLKLYSEFLGKRPWFAGNKVKEEJYGEJDLFCFTCYGGSSPHILGLLQITFVDFLVYDVLDLHRIFEPKCLDAFPNLKDFI 23700 23670 23640 23610 23580 23550 23520 23490 23460 GT8.7 ARFE .::: AC0000 SRFE 23430


>>AC195273 - Pan troglodytes BAC clone CH251-397N13 from chromo                                   (189196 aa)
Frame: r initn: 649 init1: 202 opt: 227  Z-score: 270.5  bits: 66.9 E(): 1.5e-07
trans. Smith-Waterman score: 327; 53.1% identity (67.3% similar) in 98 aa overlap (121-189:28212-27919)
Entrez lookup  Re-search database  General re-search

130 140 150 160 170 180 GT8.7 EKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV-----------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLARFE :: ::..:. .:::.:::::::::::::::.:: :.::::.::.:: .:.::::::::::::.::..::: AC1952 EKLKPKYLEELPEKLKLYSEFLGKRPWFAGNKVKEEJYGEJDLFCFTCYGGSSPHILGLLQITFVDFLVYDVLDLHRIFEPKCLDAFPNLKDFISRFE 28210 28180 28150 28120 28090 28060 28030 28000 27970 27940


>>AC000031 - Homo sapiens Chromosome 1p13.3 Cosmid Clone ctgm1,                                   (38703 aa)
Frame: r initn: 793 init1: 201 opt: 216  Z-score: 265.3  bits: 63.7 E(): 2.9e-07
trans. Smith-Waterman score: 308; 47.6% identity (63.8% similar) in 105 aa overlap (115-189:29768-29457)
Entrez lookup  Re-search database  General re-search

120 130 140 150 160 170 180 GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDF :. :. :: :::.:. .: :. .:.::::::::.:::: :.:::::::.:: .:.:::.::::::::.:: AC0000 CFLPQ-EKLKPEYLEELPTMMQHFSQFLGKRPWFVGDKVMGACDEDTRDLPYILRYRDSSPHILGLLQITFVDFLAYDVLDLHRIFEPNCLDAFPNLKDF 29760 29730 29700 29670 29640 29610 29580 29550 29520 29490 GT8.7 LARFE ..::: AC0000 ISRFE 29460


>>AC208868 - Homo sapiens FOSMID clone ABC9-41289500I16 from ch                                   (41047 aa)
Frame: r initn: 656 init1: 201 opt: 216  Z-score: 265.0  bits: 63.7 E(): 3e-07
trans. Smith-Waterman score: 308; 47.6% identity (63.8% similar) in 105 aa overlap (115-189:38059-37748)
Entrez lookup  Re-search database  General re-search

120 130 140 150 160 170 180 GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDF :. :. :: :::.:. .: :. .:.::::::::.:::: :.:::::::.:: .:.:::.::::::::.:: AC2088 CFLPQ-EKLKPEYLEELPTMMQHFSQFLGKRPWFVGDKVMGACDEDTRDLPYILCYRDSSPHILGLLQITFVDFLAYDVLDLHRIFEPNCLDAFPNLKDF 38060 38030 38000 37970 37940 37910 37880 37850 37820 37790 GT8.7 LARFE ..::: AC2088 ISRFE 37760


>>AC092859 - Pan troglodytes clone rp43-111m15, complete sequen                                   (175281 aa)
Frame: r initn: 1023 init1: 207 opt: 222  Z-score: 264.4  bits: 65.7 E(): 3.3e-07
trans. Smith-Waterman score: 314; 48.6% identity (63.8% similar) in 105 aa overlap (115-189:26961-26650)
Entrez lookup  Re-search database  General re-search

120 130 140 150 160 170 180 GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDF :. :. :: :::.:. .: :. .:.::::::::.:::: :.:::::::.:: .:.::::::::::::.:: AC0928 CFLPQ-EKLKPEYLEELPTMMQHFSQFLGKRPWFVGDKVMGACDEDTRDLPYILCYGDSSPHILGLLQITFVDFLAYDVLDLHRIFEPKCLDAFPNLKDF 26950 26920 26890 26860 26830 26800 26770 26740 26710 26680 GT8.7 LARFE ..::: AC0928 ISRFE


>>AC002520 - Homo sapiens Chromosome 1p13 Cosmid Clone m5-3, co                                   (11899 aa)
Frame: r initn: 1074 init1: 192 opt: 207  Z-score: 260.4  bits: 61.1 E(): 5.5e-07
trans. Smith-Waterman score: 331; 52.9% identity (66.3% similar) in 104 aa overlap (115-189:5278-4970)
Entrez lookup  Re-search database  General re-search

120 130 140 150 160 170 180 GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV-----------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDFL :. :. :: ::..:. .:::.:::::::::::::::::: :.:::::::.::. :.::::::::: ::.::. AC0025 CFLPQ-EKLKPKYLEELPEKLKLYSEFLGKRPWFAGDKVKEEJYGEJDLFYFMCFGVFSPHILGLLQITFVDFLAYDVLDMKRIFEPKCLDAFLNLKDFI 5270 5240 5210 5180 5150 5120 5090 5060 5030 5000 GT8.7 ARFE .::: AC0025 SRFE


>>AC092860 - Pan troglodytes clone rp43-125j14, complete sequen                                   (160656 aa)
Frame: r initn: 796 init1: 187 opt: 203  Z-score: 239.8  bits: 61.0 E(): 7.6e-06
trans. Smith-Waterman score: 240; 44.9% identity (59.2% similar) in 98 aa overlap (121-189:70421-70127)
Entrez lookup  Re-search database  General re-search

130 140 150 160 170 180 GT8.7 EKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLARFE :: ::..:. .: ..: .: :::: ::::.:: :.::::.:::::: :.:.::::: ::::. :. ::: AC0928 EKLKPQYLEELPGQLKQFSMFLGKFSWFAGEKVGRREKKRILLYLCRLLLLRPKHLIAFLLA\TFVDFLTYDILDQNRIFDPKCLDEFPNLKAFMCRFE 70410 70380 70350 70320 70290 70260 70230 70200 70170 70140


>>AL158847 - Human DNA sequence from clone RP4-735C1 on chromos                                   (133451 aa)
Frame: r initn: 794 init1: 187 opt: 200  Z-score: 237.0  bits: 60.2 E(): 1.1e-05
trans. Smith-Waterman score: 243; 30.1% identity (38.8% similar) in 312 aa overlap (51-189:26405-25472)
Entrez lookup  Re-search database  General re-search

60 70 80 90 100 110 GT8.7 FKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHL-------------------------------DGETEEERIRADIVENQVMDTRMQLIMLCY--- : :...: .::::.::..:::::::::::.::::.. ::::::.::.::.:::::: : ::: ::: AL1588 FVLSFSF-QLPYLLDGKNKITQSNAILRYIARKHNMCEWGRAGEGPQAGSLSESGJCQGJFVVIWPTGGETEEEKIRVDIIENQVMDFRTQLIRLCYSSD 26400 26370 26340 26310 26280 26250 26220 26190 26160 26130 120 GT8.7 ------------------------------------NPDF------------------------------------------------------------ ::: AL1588 HVSFLSTHLLNGVSEKGVFRNPRVISVPFCCCYLPTNPDCJNKKQNLYFLVCFVMJFWCWCLFFLNSSFPTLPSSSEGNENPPREGATKCPSLACALINY 26100 26070 26040 26010 25980 25950 25920 25890 25860 25830 130 140 150 160 170 GT8.7 --------------EKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLD :: ::..:. .: ..: .: :::: ::::.:: :.::::.:::::: :.:.::::: AL1588 KDDHHYSHSVFSSQEKLKPQYLEELPGQLKQFSMFLGKFSWFAGEKVGRREKKRILLYLCRLLLLRPKHLIAFLLA\TFVDFLTYDILDQNRIFDPKCLD 25800 25770 25740 25710 25680 25650 25620 25590 25560 25530 180 GT8.7 AFPNLRDFLARFE ::::. :. ::: AL1588 EFPNLKAFMCRFE 25500


>>AK125820 - Homo sapiens cDNA FLJ43832 fis, clone TESTI4005543                                   (4208 aa)
Frame: r initn: 176 init1: 120 opt: 122  Z-score: 154.4  bits: 40.0 E(): 0.44
trans. Smith-Waterman score: 122; 56.7% identity (80.0% similar) in 30 aa overlap (90-119:380-291)
Entrez lookup  Re-search database  General re-search

90 100 110 GT8.7 TEEERIRADIVENQVMDTRMQLIMLCYNPD .:::.::.: .::::.:. :: .:: :: AK1258 SEEEKIRVDTLENQVIDVSSQLAGVCYRPD 360 330 300




218 residues in 1 query   sequences
5491101432 residues in 507455 library sequences
 Tcomplib [35.04] (4 proc)
 start: Fri Oct 24 16:10:08 2008 done: Fri Oct 24 16:12:50 2008
 Total Scan time: 371.600 Total Display time:  0.450

Function used was TFASTX [version 35.04 Oct. 7, 2008]