# ../bin/tfastx35_t -m 9c -q -i -3 -m 6 ../seq/mgstm1.aa %p
TFASTX compares a protein to a translated DNA data bank version 35.04 Oct. 7, 2008 Please cite: Pearson et al, Genomics (1997) 46:24-36Query: ../seq/mgstm1.aa
1>>>GT8.7 | 266 40001 90043 | transl. of pa875.con, - 218 aa (reverse complement)
Library: GB165 Primate 5491101432 residues in 507455 library sequences
opt E()
< 20 98 0:=
22 0 0: one = represents 988 library sequences
24 2 0:=
26 7 11:*
28 20 115:*
30 96 700:*
32 447 2708:= *
34 2759 7343:=== *
36 11472 15081:============ *
38 33267 24924:=========================*========
40 55500 34766:===================================*=====================
42 59255 42497:===========================================*================
44 56704 46879:===============================================*==========
46 45718 47747:=============================================== *
48 39371 45712:======================================== *
50 33496 41713:================================== *
52 28148 36672:============================= *
54 24015 31325:========================= *
56 21582 26166:====================== *
58 16632 21482:================= *
60 14125 17401:=============== *
62 11535 13951:============ *
64 17077 11095:===========*======
66 10631 8769:========*==
68 11422 6898:======*=====
70 3958 5405:=====*
72 2717 4224:=== *
74 2091 3293:===*
76 1623 2563:==*
78 1156 1992:==*
80 822 1547:=*
82 647 1183:=*
84 503 937:*
86 348 725:*
88 243 561:* inset = represents 4 library sequences
90 178 434:*
92 189 336:* :=======================================*
94 108 260:* :=========================== *
96 121 201:* :=============================== *
98 80 156:* :==================== *
100 42 120:* :=========== *
102 27 93:* :======= *
104 31 72:* :======== *
106 8 56:* :== *
108 11 43:* :=== *
110 11 33:* :=== *
112 3 26:* := *
114 3 20:* := *
116 3 15:* := *
118 3 12:* := *
>120 16 9:* :==*=
5491667796 residues in 508321 library sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.4575+/-8.14e-05; mu= 5.6033+/- 0.006
mean_var=57.5206+/- 9.119, 0's: 3 Z-trim: 12 B-trim: 4018 in 2/97
Lambda= 0.169107
statistics sampled from 60000 to 508305 sequences
Kolmogorov-Smirnov statistic: 0.0876 (N=29) at 44
Algorithm: TFASTX (3.5 Sept 2006) [optimized]
Parameters: BL50 matrix (o=15:-5:-1:-1) ktup: 2
join: 36, open/ext: -12/-2, shift: -20 width: 16
Scan time: 371.600
The best scores are: initn opt bits E(508321)
AC091492 - Homo sapiens chromosome 3 (180963) [r] 992 1040 265.3 2.8e-67 align
AL357515 - Human DNA sequence from cl (190440) [r] 1004 1003 256.3 1.5e-64 align
AK126952 - Homo sapiens cDNA FLJ45005 (3674) [r] 378 415 111.4 1.2e-22 align
AC010093 - Homo sapiens BAC clone RP1 (160930) [r] 692 331 92.2 3e-15 align
AC000032 - Homo sapiens Chromosome 1p (30140) [r] 1067 223 65.3 7.3e-08 align
AC195273 - Pan troglodytes BAC clone (189196) [r] 649 227 66.9 1.5e-07 align
AC000031 - Homo sapiens Chromosome 1p (38703) [r] 793 216 63.7 2.9e-07 align
AC208868 - Homo sapiens FOSMID clone (41047) [r] 656 216 63.7 3e-07 align
AC092859 - Pan troglodytes clone rp43 (175281) [r] 1023 222 65.7 3.3e-07 align
AC002520 - Homo sapiens Chromosome 1p (11899) [r] 1074 207 61.1 5.5e-07 align
AC092860 - Pan troglodytes clone rp43 (160656) [r] 796 203 61.0 7.6e-06 align
AL158847 - Human DNA sequence from cl (133451) [r] 794 200 60.2 1.1e-05 align
AK125820 - Homo sapiens cDNA FLJ43832 (4208) [r] 176 122 40.0 0.44 align
>>>GT8.7_rev, 218 aa vs %p library
10 20 30 40 50 60 70 80 90 100
GT8.7 MPMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPDFDRSQWLNEKFKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIV
::. ::: ...::.: :..:::::::::.::.:::::.::.::: :: ::::::::::::::.::.:::::::::::... :: : :::::.::.::
AC0914 MPITLGYGDIHGLAHAIHLLLEYTDSSYEEKKYTMGDSPDYDRSJJLNJKFKLGLDFPNLPYLFDGAHKITQSNAILRHISCKHSLCEETEEEKIRVDIS
108700 108670 108640 108610 108580 108550 108520 108490 108460 108430
110 120 130 140 150 160 170 180 190 200
GT8.7 ENQVMDTRMQLIMLCYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKVTYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLARFEGLKKISAYMKS
:::.::. ::: :.:: :.::: ::..:. .:::.::::.:::: :::: ::.:::::::::::: .:.:.::::::::.::.. .::::::::::::
AC0914 ENQTMDNLMQLAMICYIPEFEKLKPKYLEELPEKLKLYSQFLGKWPWFAJDKITYVDFLAYDILDLNCIFDPSCLDAFPNLKDFMSJIEGLKKISAYMKS
108400 108370 108340 108310 108280 108250 108220 108190 108160 108130
210
GT8.7 SRYIATPIFSKMAHWSNK
:. . :.:.: : :..:
AC0914 SQCLQGPLFGKSAMWNSK
108100 108070
10 20 30 40 50 60 70 80 90 100
GT8.7 MPMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPDFDRSQWLNEKFKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIV
::. ::::.. ::.: . .::.::: ::.::.: ::::::.:::::::::::::::::::::::::.::::::.::: .: ::.: :::: :.: ::.
AL3575 MPVTLGYWDIJGLAHAVCLLLQYTDLSYEEKKYMMGDAPDYDRSQWLNEKFKLGLDFPNLPYLIDGAHKITQSKAILGCIAYKHNLCGETEGEKIWEDIL
154760 154730 154700 154670 154640 154610 154580 154550 154520 154490
110 120 130 140 150 160 170 180 190 200
GT8.7 ENQVMDTRMQLIMLCYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKVTYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLARFEGLKKISAYMKS
:::..:...:: :::::::.: :::.:...: .::::.::::. : :::.: :::.:: ::.. ..:::: :::::::.::..::::: .:::::::
AL3575 ENQLVDNHVQLARLCYNPDFKKLKPEYLEALPAMLKLYSQFLGKQLLFLGDKITLVDFIAYGILERNQVFEPKWLDAFPNLKDFISRFEGL-EISAYMKS
154460 154430 154400 154370 154340 154310 154280 154250 154220 154190
210
GT8.7 SRYIATPIFSKMAHWSNK
: .. :.:.::: :.::
AL3575 SCFLLRPVFTKMAVWGNK
154160
10 20 30 40 50 60 70 80 90
GT8.7 PMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTM-GDAPDFDRSQWLNEKFKLGLDFPNLPYLIDGSHKITQSNA-IL-RYLARKHHL--DGETEEERIR
:: .: .: :: .:: :: .. ..:: ::::. :::: .:.:::::: ::...:... . :: : : : ..:::: ..
AK1269 PMEMGDRDVLGLILDTCLLL-------GGKRJSI\AEAPGHGGSQWLDVKFKLDVDIPNLPYLTDGKNRIVHNPG\ILVRPAALTSTLLQSAETEEGKVP
1590 1560 1530 1500 1470 1440 1410 1380 1350
100 110 120 130 140 150 160 170 180 190
GT8.7 ADIVENQVMDTRMQLIMLCYNPDF-EKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKVTYVDFLAYDILDQYRMFEPKCLDAFPNLR-DFLARFEGLKKI
.:: :.:... ..:: .: ..::. : :: . . : ..: .: :: .. :: : :.:..:.:. :.::: ::: ::: ::::: ::. .. .:..
AK1269 GDIREDQMVEYHLQLKQLHHDPDM/ETWKPPYSELQPGQLKHFSLFL-RKSWFPGGKLTFLDILTCDVLDQNCMFELKCLYEFPNLR/DFMHHIAALEER
1320 1290 1260 1230 1200 1170 1140 1110 1080 1050
200 210
GT8.7 SAYMKSS-RYIATPIFSKMAHWSNK
.: ..:. ... :. :.::.:. .
AK1269 AAPVQSA/LFFGMPVSSEMAQWGPR
1020 990
10 20 30 40 50 60 70 80 90 100
GT8.7 MILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPD-FDRSQWLNEKFKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIVE
:.:: ::. ::.: : .:: .:: ::.:: :: ..: .: ::::. ::: ::::::: ..::..:.::::::: : : : :. :.: :.:..::.:
AC0100 MVLGJWNICGLAHTICLLLJFTDMSYEEKWYTCQETP-/YDJSQWLDVIFKLDLDFPNLPNFMDGKNKVTQSNAILCYTAGK-HMCGKT--EKIQVDILE
100060 100030 100000 99970 99940 99910 99880 99850 99820 99790
110 120
GT8.7 NQVMDTRMQLIMLCYNPDFEK-------------------------------------------------------------------------------
::. . :::. ::: : ::
AC0100 NQATSFCTQLIQCCYNSDHEKLKPGQAQWLTPLIPALWEASLFSWITSSGDQDHPGJYGETPSLLKIQKVSWAWWRAPVVSATQEAEAGEJLEPGRRRLQ
99760 99730 99700 99670 99640 99610 99580 99550 99520 99490
130 140 150 160 170 180
GT8.7 --------------------------------Q-KPEFLKTIPEKMKLYSEFLG-KRPWFAGD-KVTYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLAR
::..:. .::..: .: ::: : ::::. :.:.::: ::.::: :::::::: ::.:. :. :
AC0100 JAEIAPLHSSLGDRARLHLKKKNKJIKJIKTNP\KPQYLEQLPEQLKQFSMFLG\KFSWFAGE/KLTFVDFPIYDVLDQKCMFEPKCLDEFPHLKAFMCR
99460 99430 99400 99370 99340 99310 99280 99250 99220
190 200 210
GT8.7 FEG-LKKISAYMKSSRYIATPIFSKMAHWSNK
. : :.::.:::.:. .. :: .:::.:.::
AC0100 J-G\LEKIAAYMQSDCFLKMPINNKMAQWGNK
99190 99160 99130
120 130 140 150 160 170 180
GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV-----------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDFL
:. :. :: ::..:. .:::.:::::::::::::::.:: :.::::.::.:: .:.::::::::::::.::.
AC0000 CFLPQ-EKLKPKYLEELPEKLKLYSEFLGKRPWFAGNKVKEEJYGEJDLFCFTCYGGSSPHILGLLQITFVDFLVYDVLDLHRIFEPKCLDAFPNLKDFI
23700 23670 23640 23610 23580 23550 23520 23490 23460
GT8.7 ARFE
.:::
AC0000 SRFE
23430
130 140 150 160 170 180
GT8.7 EKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV-----------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLARFE
:: ::..:. .:::.:::::::::::::::.:: :.::::.::.:: .:.::::::::::::.::..:::
AC1952 EKLKPKYLEELPEKLKLYSEFLGKRPWFAGNKVKEEJYGEJDLFCFTCYGGSSPHILGLLQITFVDFLVYDVLDLHRIFEPKCLDAFPNLKDFISRFE
28210 28180 28150 28120 28090 28060 28030 28000 27970 27940
120 130 140 150 160 170 180
GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDF
:. :. :: :::.:. .: :. .:.::::::::.:::: :.:::::::.:: .:.:::.::::::::.::
AC0000 CFLPQ-EKLKPEYLEELPTMMQHFSQFLGKRPWFVGDKVMGACDEDTRDLPYILRYRDSSPHILGLLQITFVDFLAYDVLDLHRIFEPNCLDAFPNLKDF
29760 29730 29700 29670 29640 29610 29580 29550 29520 29490
GT8.7 LARFE
..:::
AC0000 ISRFE
29460
120 130 140 150 160 170 180
GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDF
:. :. :: :::.:. .: :. .:.::::::::.:::: :.:::::::.:: .:.:::.::::::::.::
AC2088 CFLPQ-EKLKPEYLEELPTMMQHFSQFLGKRPWFVGDKVMGACDEDTRDLPYILCYRDSSPHILGLLQITFVDFLAYDVLDLHRIFEPNCLDAFPNLKDF
38060 38030 38000 37970 37940 37910 37880 37850 37820 37790
GT8.7 LARFE
..:::
AC2088 ISRFE
37760
120 130 140 150 160 170 180
GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDF
:. :. :: :::.:. .: :. .:.::::::::.:::: :.:::::::.:: .:.::::::::::::.::
AC0928 CFLPQ-EKLKPEYLEELPTMMQHFSQFLGKRPWFVGDKVMGACDEDTRDLPYILCYGDSSPHILGLLQITFVDFLAYDVLDLHRIFEPKCLDAFPNLKDF
26950 26920 26890 26860 26830 26800 26770 26740 26710 26680
GT8.7 LARFE
..:::
AC0928 ISRFE
120 130 140 150 160 170 180
GT8.7 CYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV-----------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDFL
:. :. :: ::..:. .:::.:::::::::::::::::: :.:::::::.::. :.::::::::: ::.::.
AC0025 CFLPQ-EKLKPKYLEELPEKLKLYSEFLGKRPWFAGDKVKEEJYGEJDLFYFMCFGVFSPHILGLLQITFVDFLAYDVLDMKRIFEPKCLDAFLNLKDFI
5270 5240 5210 5180 5150 5120 5090 5060 5030 5000
GT8.7 ARFE
.:::
AC0025 SRFE
130 140 150 160 170 180
GT8.7 EKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLDAFPNLRDFLARFE
:: ::..:. .: ..: .: :::: ::::.:: :.::::.:::::: :.:.::::: ::::. :. :::
AC0928 EKLKPQYLEELPGQLKQFSMFLGKFSWFAGEKVGRREKKRILLYLCRLLLLRPKHLIAFLLA\TFVDFLTYDILDQNRIFDPKCLDEFPNLKAFMCRFE
70410 70380 70350 70320 70290 70260 70230 70200 70170 70140
60 70 80 90 100 110
GT8.7 FKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHL-------------------------------DGETEEERIRADIVENQVMDTRMQLIMLCY---
: :...: .::::.::..:::::::::::.::::.. ::::::.::.::.:::::: : ::: :::
AL1588 FVLSFSF-QLPYLLDGKNKITQSNAILRYIARKHNMCEWGRAGEGPQAGSLSESGJCQGJFVVIWPTGGETEEEKIRVDIIENQVMDFRTQLIRLCYSSD
26400 26370 26340 26310 26280 26250 26220 26190 26160 26130
120
GT8.7 ------------------------------------NPDF------------------------------------------------------------
:::
AL1588 HVSFLSTHLLNGVSEKGVFRNPRVISVPFCCCYLPTNPDCJNKKQNLYFLVCFVMJFWCWCLFFLNSSFPTLPSSSEGNENPPREGATKCPSLACALINY
26100 26070 26040 26010 25980 25950 25920 25890 25860 25830
130 140 150 160 170
GT8.7 --------------EKQKPEFLKTIPEKMKLYSEFLGKRPWFAGDKV------------------------------TYVDFLAYDILDQYRMFEPKCLD
:: ::..:. .: ..: .: :::: ::::.:: :.::::.:::::: :.:.:::::
AL1588 KDDHHYSHSVFSSQEKLKPQYLEELPGQLKQFSMFLGKFSWFAGEKVGRREKKRILLYLCRLLLLRPKHLIAFLLA\TFVDFLTYDILDQNRIFDPKCLD
25800 25770 25740 25710 25680 25650 25620 25590 25560 25530
180
GT8.7 AFPNLRDFLARFE
::::. :. :::
AL1588 EFPNLKAFMCRFE
25500
90 100 110
GT8.7 TEEERIRADIVENQVMDTRMQLIMLCYNPD
.:::.::.: .::::.:. :: .:: ::
AK1258 SEEEKIRVDTLENQVIDVSSQLAGVCYRPD
360 330 300
>>AC091492 - Homo sapiens chromosome 3 clone RP11-335I9 map 3p, (180963 aa)
Frame: r initn: 992 init1: 992 opt: 1040 Z-score: 1342.7 bits: 265.3 E(): 2.8e-67
trans. Smith-Waterman score: 1040; 68.3% identity (87.6% similar) in 218 aa overlap (1-218:108699-108046)
Entrez lookup Re-search database General re-search
>>AL357515 - Human DNA sequence from clone RP11-397G5 on chromo (190440 aa)
Frame: r initn: 1004 init1: 905 opt: 1003 Z-score: 1293.6 bits: 256.3 E(): 1.5e-64
trans. Smith-Waterman score: 1003; 66.1% identity (86.2% similar) in 218 aa overlap (1-218:154780-154130)
Entrez lookup Re-search database General re-search
>>AK126952 - Homo sapiens cDNA FLJ45005 fis, clone BRAWH3011907 (3674 aa)
Frame: r initn: 378 init1: 156 opt: 415 Z-score: 541.6 bits: 111.4 E(): 1.2e-22
trans. Smith-Waterman score: 415; 40.0% identity (68.6% similar) in 220 aa overlap (2-218:1595-961)
Entrez lookup Re-search database General re-search
>>AC010093 - Homo sapiens BAC clone RP11-323O5 from 2, complete (160930 aa)
Frame: r initn: 692 init1: 214 opt: 331 Z-score: 408.6 bits: 92.2 E(): 3e-15
trans. Smith-Waterman score: 461; 37.0% identity (51.7% similar) in 327 aa overlap (3-218:100065-99099)
Entrez lookup Re-search database General re-search
>>AC000032 - Homo sapiens Chromosome 1p13.3 Cosmid Clone cgtm12 (30140 aa)
Frame: r initn: 1067 init1: 202 opt: 223 Z-score: 276.0 bits: 65.3 E(): 7.3e-08
trans. Smith-Waterman score: 336; 51.9% identity (67.3% similar) in 104 aa overlap (115-189:23726-23418)
Entrez lookup Re-search database General re-search
>>AC195273 - Pan troglodytes BAC clone CH251-397N13 from chromo (189196 aa)
Frame: r initn: 649 init1: 202 opt: 227 Z-score: 270.5 bits: 66.9 E(): 1.5e-07
trans. Smith-Waterman score: 327; 53.1% identity (67.3% similar) in 98 aa overlap (121-189:28212-27919)
Entrez lookup Re-search database General re-search
>>AC000031 - Homo sapiens Chromosome 1p13.3 Cosmid Clone ctgm1, (38703 aa)
Frame: r initn: 793 init1: 201 opt: 216 Z-score: 265.3 bits: 63.7 E(): 2.9e-07
trans. Smith-Waterman score: 308; 47.6% identity (63.8% similar) in 105 aa overlap (115-189:29768-29457)
Entrez lookup Re-search database General re-search
>>AC208868 - Homo sapiens FOSMID clone ABC9-41289500I16 from ch (41047 aa)
Frame: r initn: 656 init1: 201 opt: 216 Z-score: 265.0 bits: 63.7 E(): 3e-07
trans. Smith-Waterman score: 308; 47.6% identity (63.8% similar) in 105 aa overlap (115-189:38059-37748)
Entrez lookup Re-search database General re-search
>>AC092859 - Pan troglodytes clone rp43-111m15, complete sequen (175281 aa)
Frame: r initn: 1023 init1: 207 opt: 222 Z-score: 264.4 bits: 65.7 E(): 3.3e-07
trans. Smith-Waterman score: 314; 48.6% identity (63.8% similar) in 105 aa overlap (115-189:26961-26650)
Entrez lookup Re-search database General re-search
>>AC002520 - Homo sapiens Chromosome 1p13 Cosmid Clone m5-3, co (11899 aa)
Frame: r initn: 1074 init1: 192 opt: 207 Z-score: 260.4 bits: 61.1 E(): 5.5e-07
trans. Smith-Waterman score: 331; 52.9% identity (66.3% similar) in 104 aa overlap (115-189:5278-4970)
Entrez lookup Re-search database General re-search
>>AC092860 - Pan troglodytes clone rp43-125j14, complete sequen (160656 aa)
Frame: r initn: 796 init1: 187 opt: 203 Z-score: 239.8 bits: 61.0 E(): 7.6e-06
trans. Smith-Waterman score: 240; 44.9% identity (59.2% similar) in 98 aa overlap (121-189:70421-70127)
Entrez lookup Re-search database General re-search
>>AL158847 - Human DNA sequence from clone RP4-735C1 on chromos (133451 aa)
Frame: r initn: 794 init1: 187 opt: 200 Z-score: 237.0 bits: 60.2 E(): 1.1e-05
trans. Smith-Waterman score: 243; 30.1% identity (38.8% similar) in 312 aa overlap (51-189:26405-25472)
Entrez lookup Re-search database General re-search
>>AK125820 - Homo sapiens cDNA FLJ43832 fis, clone TESTI4005543 (4208 aa)
Frame: r initn: 176 init1: 120 opt: 122 Z-score: 154.4 bits: 40.0 E(): 0.44
trans. Smith-Waterman score: 122; 56.7% identity (80.0% similar) in 30 aa overlap (90-119:380-291)
Entrez lookup Re-search database General re-search
218 residues in 1 query sequences
5491101432 residues in 507455 library sequences
Tcomplib [35.04] (4 proc)
start: Fri Oct 24 16:10:08 2008 done: Fri Oct 24 16:12:50 2008
Total Scan time: 371.600 Total Display time: 0.450
Function used was TFASTX [version 35.04 Oct. 7, 2008]