BLASTX nr result
ID: Atropa21_contig00005544
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00005544 (1151 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004252438.1| PREDICTED: uncharacterized protein LOC101262... 359 1e-96 ref|XP_006383287.1| hypothetical protein POPTR_0005s13880g [Popu... 187 5e-45 ref|XP_006374744.1| hypothetical protein POPTR_0014s00420g [Popu... 177 7e-42 ref|XP_006377635.1| hypothetical protein POPTR_0011s09120g [Popu... 169 1e-39 gb|EOY09317.1| Uncharacterized protein TCM_024740 [Theobroma cacao] 136 2e-29 ref|XP_006381633.1| hypothetical protein POPTR_0006s14515g [Popu... 132 2e-28 ref|XP_006384435.1| hypothetical protein POPTR_0004s15060g [Popu... 125 3e-26 gb|AAM23241.1|AC092553_7 Putative transposase [Oryza sativa Japo... 94 1e-16 ref|NP_001055055.2| Os05g0269800 [Oryza sativa Japonica Group] g... 91 7e-16 gb|AAV43964.1| putative polyprotein [Oryza sativa Japonica Group] 91 7e-16 ref|XP_006381919.1| hypothetical protein POPTR_0006s21120g [Popu... 88 6e-15 gb|AAV43825.1| putative polyprotein [Oryza sativa Japonica Group] 88 6e-15 gb|AAV44105.1| unknown protein [Oryza sativa Japonica Group] 88 6e-15 gb|ABA98143.2| transposon protein, putative, CACTA, En/Spm sub-c... 87 1e-14 ref|XP_006347741.1| PREDICTED: uncharacterized protein LOC102581... 87 1e-14 gb|EOY08532.1| Uncharacterized protein isoform 3 [Theobroma caca... 87 1e-14 gb|EOY08531.1| Uncharacterized protein isoform 2 [Theobroma cacao] 87 1e-14 gb|EOY08530.1| Uncharacterized protein isoform 1 [Theobroma cacao] 87 1e-14 gb|ABA96347.1| transposon protein, putative, CACTA, En/Spm sub-c... 87 1e-14 gb|EXB95722.1| hypothetical protein L484_007472 [Morus notabilis] 86 3e-14 >ref|XP_004252438.1| PREDICTED: uncharacterized protein LOC101262394 [Solanum lycopersicum] Length = 530 Score = 359 bits (921), Expect = 1e-96 Identities = 190/260 (73%), Positives = 210/260 (80%), Gaps = 2/260 (0%) Frame = -3 Query: 1119 GKEGEKQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKH 940 GK+GEKQ RW+KPMEYLMLEILAD+VKQGNK TN+FK ISFNRVS+AIN+QLGMDCS KH Sbjct: 213 GKKGEKQFRWSKPMEYLMLEILADQVKQGNKSTNKFKVISFNRVSNAINEQLGMDCSLKH 272 Query: 939 VQNNLKILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDM 760 V+N+ K LRST N VQTLLNKSGLGWDDNLKMITASPRVY+ +IQA+ +HDKFI KKIDM Sbjct: 273 VENHHKTLRSTWNIVQTLLNKSGLGWDDNLKMITASPRVYAMHIQAHPSHDKFIKKKIDM 332 Query: 759 CEEMSLVCGKDLARGDYAKSFDDISLDRSSEKDNDVDIEGPSKEKEV--VFETSQGKASH 586 EEMSLVCG D ARGD AKSF+DI LD SSEK N+ +IEGPSKE V V ETSQ K+S Sbjct: 333 FEEMSLVCGNDRARGDCAKSFEDIGLDCSSEKGNEDEIEGPSKENGVQDVSETSQVKSSR 392 Query: 585 KRNYSCDALDVIGDISIKLGEVAAAINKIADNRLDVTRLXXXXXXXXXXXXEFLGDAFVY 406 KRN + DV+GDIS KLGEV A I+KIADNRLDVT L +FLGDAF Y Sbjct: 393 KRNRHSNVQDVVGDISTKLGEVVATISKIADNRLDVTSLYEEVMAIEGYGEDFLGDAFDY 452 Query: 405 LVQSDTLAKAFMAKNQNLRK 346 LVQSDTLAK MAKNQNLRK Sbjct: 453 LVQSDTLAKVLMAKNQNLRK 472 >ref|XP_006383287.1| hypothetical protein POPTR_0005s13880g [Populus trichocarpa] gi|550338877|gb|ERP61084.1| hypothetical protein POPTR_0005s13880g [Populus trichocarpa] Length = 266 Score = 187 bits (476), Expect = 5e-45 Identities = 105/264 (39%), Positives = 154/264 (58%), Gaps = 5/264 (1%) Frame = -3 Query: 1104 KQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNL 925 K W+KPM +++L+IL +E +GNK ++ FKA SF V C KH+ N+L Sbjct: 12 KHFTWSKPMSHMLLKILVEEALKGNKPSSTFKAKSFFNVQ----------CEPKHMDNHL 61 Query: 924 KILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDMCEEMS 745 KI++ + L NKSG GWDD LKMIT S VY ++A+L HDK++NKK+DM E M Sbjct: 62 KIVKKELGIITKLKNKSGFGWDDCLKMITVSKDVYDEEVKAHLNHDKYLNKKLDMYEAMI 121 Query: 744 LVCGKDLARGDYAKSFDDISLDRSSEK-----DNDVDIEGPSKEKEVVFETSQGKASHKR 580 +V GK++ +Y KS+ DI+L+ ++E +N+ + E S+ KE ++Q + KR Sbjct: 122 IVVGKNMVTRNYIKSYADINLEENTEVQSISIENEGEYEETSRGKETSSSSAQKRQHKKR 181 Query: 579 NYSCDALDVIGDISIKLGEVAAAINKIADNRLDVTRLXXXXXXXXXXXXEFLGDAFVYLV 400 N + D + +S K+G+VA I + N+L+V L LGDAF +LV Sbjct: 182 NRMYED-DSVEKLSTKIGDVAFVIQSLRKNQLNVNELYIEVMKIKGFEEIALGDAFDHLV 240 Query: 399 QSDTLAKAFMAKNQNLRKVWLKRF 328 Q+ LAKAFM K NLRK+W++ F Sbjct: 241 QNKMLAKAFMKKYDNLRKIWVQNF 264 >ref|XP_006374744.1| hypothetical protein POPTR_0014s00420g [Populus trichocarpa] gi|550323003|gb|ERP52541.1| hypothetical protein POPTR_0014s00420g [Populus trichocarpa] Length = 260 Score = 177 bits (449), Expect = 7e-42 Identities = 103/267 (38%), Positives = 155/267 (58%), Gaps = 5/267 (1%) Frame = -3 Query: 1104 KQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNL 925 K W+KPM +++LEIL +E +G+K ++ FKA SF +V+ I+Q+ + C KH Sbjct: 7 KHFTWSKPMSHMLLEILVEEALKGSKPSSTFKAESFIKVAIEISQKFNVQCKPKH----- 61 Query: 924 KILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDMCEEMS 745 + L NKSG GWDD LKMIT S VY + KF+NKK+DM E M+ Sbjct: 62 -------GIITKLKNKSGFGWDDCLKMITISKDVYDEEV-------KFLNKKLDMYEAMA 107 Query: 744 LVCGKDLARGDYAKSFDDISLDRSSEK-----DNDVDIEGPSKEKEVVFETSQGKASHKR 580 ++ GKD+A G+YAKS+ D++++ ++E+ +N+ + E SK KE ++Q + KR Sbjct: 108 IIVGKDIATGNYAKSYADVNMEENTEEQSISIENEGEYEETSKGKETSSSSTQKRQHRKR 167 Query: 579 NYSCDALDVIGDISIKLGEVAAAINKIADNRLDVTRLXXXXXXXXXXXXEFLGDAFVYLV 400 N + D + +S ++G+V AI ++ N+LDV L LG+AF +LV Sbjct: 168 NRMYED-DGVEKLSKQIGDVELAIQSLSKNQLDVNALYAEVMKIEGFDEITLGEAFDHLV 226 Query: 399 QSDTLAKAFMAKNQNLRKVWLKRFKRQ 319 Q+ LAKAFMAKN NLRK+ ++ F Q Sbjct: 227 QNKMLAKAFMAKNANLRKIGVQNFVNQ 253 >ref|XP_006377635.1| hypothetical protein POPTR_0011s09120g [Populus trichocarpa] gi|550327980|gb|ERP55432.1| hypothetical protein POPTR_0011s09120g [Populus trichocarpa] Length = 234 Score = 169 bits (429), Expect = 1e-39 Identities = 98/257 (38%), Positives = 143/257 (55%), Gaps = 1/257 (0%) Frame = -3 Query: 1104 KQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNL 925 K W+KPM +++LEIL +E +GNK ++ FKA SF +V+ I+Q + C KHV N+L Sbjct: 7 KHFTWSKPMSHMLLEILVEEAFKGNKTSSTFKAESFVKVATKISQNFNVQCESKHVDNHL 66 Query: 924 KILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFIN-KKIDMCEEM 748 K ++ + L NKSG WDD LKMIT S VY +A+ HDK++N KK+D+ E M Sbjct: 67 KTVKKEWGIITQLKNKSGFSWDDCLKMITVSKDVYD--EEAHPNHDKYLNKKKLDIYEAM 124 Query: 747 SLVCGKDLARGDYAKSFDDISLDRSSEKDNDVDIEGPSKEKEVVFETSQGKASHKRNYSC 568 ++V GKD+A G+YAKS+ DI+L + +++++ S E E +E + Sbjct: 125 TIVVGKDMATGNYAKSYADINL------EENIEVQSISIENEGEYEET------------ 166 Query: 567 DALDVIGDISIKLGEVAAAINKIADNRLDVTRLXXXXXXXXXXXXEFLGDAFVYLVQSDT 388 A AI ++ N+LDV L L DAF +L+Q++ Sbjct: 167 --------------TKAFAIQSLSKNQLDVNELYTEVMKVEGFEEIALDDAFDHLIQNEM 212 Query: 387 LAKAFMAKNQNLRKVWL 337 LAKAFMAKN N RK+W+ Sbjct: 213 LAKAFMAKNANFRKIWI 229 >gb|EOY09317.1| Uncharacterized protein TCM_024740 [Theobroma cacao] Length = 164 Score = 136 bits (342), Expect = 2e-29 Identities = 73/145 (50%), Positives = 96/145 (66%), Gaps = 7/145 (4%) Frame = -3 Query: 1056 LADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNLKILRST*NTVQTLLNK 877 L D ++GNK +N F A S+ RV AIN++ + C HV+N+L+I+++T NTVQ +L K Sbjct: 18 LTDGAQKGNKPSNVFNASSYIRVLQAINEKFNVQCKTNHVENHLRIVKNTSNTVQNVLAK 77 Query: 876 SGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDMCEEMSLVCGKDLARGDYAKSF 697 SG GWDDNLKMITA +VY +A+L H+ FINKKIDM EM+LV GKD+A +AKSF Sbjct: 78 SGFGWDDNLKMITADRQVYE--DEAHLKHEPFINKKIDMFNEMTLVVGKDMATESFAKSF 135 Query: 696 DDISLDRSSEK-------DNDVDIE 643 DI ++E D DVD E Sbjct: 136 ADIDFQTNTEANAMLVDLDKDVDEE 160 >ref|XP_006381633.1| hypothetical protein POPTR_0006s14515g [Populus trichocarpa] gi|550336341|gb|ERP59430.1| hypothetical protein POPTR_0006s14515g [Populus trichocarpa] Length = 177 Score = 132 bits (333), Expect = 2e-28 Identities = 69/166 (41%), Positives = 107/166 (64%), Gaps = 5/166 (3%) Frame = -3 Query: 1104 KQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNL 925 K W+KPM +++LEILA+E + +K ++ FKA SF ++ I+Q+ HV N+L Sbjct: 12 KHFTWSKPMSHMLLEILAEEALKRSKPSSTFKAESFVELATEISQKFN------HVNNHL 65 Query: 924 KILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDMCEEMS 745 K ++ + L NKSG GWDD LKMIT S VY+ ++A+ HDK++NKK+DM E MS Sbjct: 66 KTMKKEWGIITKLKNKSGFGWDDCLKMITVSKDVYNEELKAHPNHDKYLNKKLDMYEAMS 125 Query: 744 LVCGKDLARGDYAKSFDDISLDRSSEK-----DNDVDIEGPSKEKE 622 +V GKD+ +YAKS+ D++L+ ++++ +N+ + E SK KE Sbjct: 126 IVVGKDMTTRNYAKSYIDVNLEENTDEQLISIENEGEYEETSKRKE 171 >ref|XP_006384435.1| hypothetical protein POPTR_0004s15060g [Populus trichocarpa] gi|550341053|gb|ERP62232.1| hypothetical protein POPTR_0004s15060g [Populus trichocarpa] Length = 154 Score = 125 bits (314), Expect = 3e-26 Identities = 62/147 (42%), Positives = 92/147 (62%) Frame = -3 Query: 1104 KQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNL 925 K W+KPM ++ K ++ FKA F +V+ I+Q+ + C KHV N+L Sbjct: 21 KHFTWSKPMSHI-------------KPSSTFKAECFVKVATEISQKFNVQCEPKHVDNHL 67 Query: 924 KILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDMCEEMS 745 K ++ + L NKSG GWDD LKMIT S VY ++A+ HDKF+NKK+DM E M+ Sbjct: 68 KTVKKEWGIITKLKNKSGFGWDDCLKMITVSKDVYDEEVKAHPNHDKFLNKKLDMYEAMT 127 Query: 744 LVCGKDLARGDYAKSFDDISLDRSSEK 664 +V GKD+A G+YAKS+ D++L+ ++E+ Sbjct: 128 IVLGKDMATGNYAKSYADVNLEENNEE 154 >gb|AAM23241.1|AC092553_7 Putative transposase [Oryza sativa Japonica Group] gi|21326484|gb|AAM47612.1|AC122147_1 Putative transposase [Oryza sativa Japonica Group] gi|110288571|gb|ABB46678.2| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa Japonica Group] Length = 535 Score = 93.6 bits (231), Expect = 1e-16 Identities = 58/215 (26%), Positives = 101/215 (46%), Gaps = 2/215 (0%) Frame = -3 Query: 1122 TGKEGEKQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDK 943 +GK G WT M ML LA+ V G + ++ FKA+ N + A+N++ + + Sbjct: 261 SGKGGSTHASWTSAMSSFMLSHLANVVAGGTRTSSGFKAVHLNACARAVNERFNSTLTGE 320 Query: 942 HVQNNLKILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKID 763 ++N+LK + + + L S GWD+ +IT Y+ YI+ + + NK + Sbjct: 321 QIKNHLKTWQRKFSKINRLRKVSAAGWDEKNFIITLDDEHYNGYIEDHKADANYFNKPLA 380 Query: 762 MCEEMSLVCGKDLARGDYAKSFDDISLDRSSEKDNDVDIEGPSKEKEVVFETSQGKASHK 583 EM + G +A G YAK + + DND + +GP+ + +S K Sbjct: 381 HYGEMLTIFGSTMATGKYAKDSSSVLGTEDVQDDNDEENDGPATTDDRAEASSASKPKKA 440 Query: 582 RNYSCDALDVIGDISIKLGEVAAAINKIA--DNRL 484 + + +IG + ++A+AI K+A DN+L Sbjct: 441 KTQENEDDGLIGAFTSVGDKLASAILKVAEPDNKL 475 >ref|NP_001055055.2| Os05g0269800 [Oryza sativa Japonica Group] gi|255676197|dbj|BAF16969.2| Os05g0269800 [Oryza sativa Japonica Group] Length = 529 Score = 91.3 bits (225), Expect = 7e-16 Identities = 57/215 (26%), Positives = 100/215 (46%), Gaps = 2/215 (0%) Frame = -3 Query: 1122 TGKEGEKQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDK 943 +GK G WT M ML LA+ V G + ++ FKA+ N + A+N++ + + Sbjct: 255 SGKGGSTHASWTSAMSSFMLSHLANVVAGGTRTSSGFKAVHLNACARAVNERFNSTLTGE 314 Query: 942 HVQNNLKILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKID 763 ++N+LK + + + L S GWD+ +IT Y+ Y + + + NK + Sbjct: 315 QIKNHLKTWQRKFSKINRLRKVSAAGWDEKNFIITLDDEHYNGYTEDHKADADYFNKPLA 374 Query: 762 MCEEMSLVCGKDLARGDYAKSFDDISLDRSSEKDNDVDIEGPSKEKEVVFETSQGKASHK 583 EM + G +A G YAK + + DND + +GP+ + +S K Sbjct: 375 HYGEMLTIFGSTMATGKYAKDSSSVLGTEDVQDDNDEENDGPATTDDRAEASSASKPKKA 434 Query: 582 RNYSCDALDVIGDISIKLGEVAAAINKIA--DNRL 484 + + +IG + ++A+AI K+A DN+L Sbjct: 435 KTQENEDDGLIGAFTSVGDKLASAILKVAEPDNKL 469 >gb|AAV43964.1| putative polyprotein [Oryza sativa Japonica Group] Length = 561 Score = 91.3 bits (225), Expect = 7e-16 Identities = 57/215 (26%), Positives = 100/215 (46%), Gaps = 2/215 (0%) Frame = -3 Query: 1122 TGKEGEKQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDK 943 +GK G WT M ML LA+ V G + ++ FKA+ N + A+N++ + + Sbjct: 287 SGKGGSTHASWTSAMSSFMLSHLANVVAGGTRTSSGFKAVHLNACARAVNERFNSTLTGE 346 Query: 942 HVQNNLKILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKID 763 ++N+LK + + + L S GWD+ +IT Y+ Y + + + NK + Sbjct: 347 QIKNHLKTWQRKFSKINRLRKVSAAGWDEKNFIITLDDEHYNGYTEDHKADADYFNKPLA 406 Query: 762 MCEEMSLVCGKDLARGDYAKSFDDISLDRSSEKDNDVDIEGPSKEKEVVFETSQGKASHK 583 EM + G +A G YAK + + DND + +GP+ + +S K Sbjct: 407 HYGEMLTIFGSTMATGKYAKDSSSVLGTEDVQDDNDEENDGPATTDDRAEASSASKPKKA 466 Query: 582 RNYSCDALDVIGDISIKLGEVAAAINKIA--DNRL 484 + + +IG + ++A+AI K+A DN+L Sbjct: 467 KTQENEDDGLIGAFTSVGDKLASAILKVAEPDNKL 501 >ref|XP_006381919.1| hypothetical protein POPTR_0006s21120g [Populus trichocarpa] gi|550336771|gb|ERP59716.1| hypothetical protein POPTR_0006s21120g [Populus trichocarpa] Length = 112 Score = 88.2 bits (217), Expect = 6e-15 Identities = 43/95 (45%), Positives = 61/95 (64%) Frame = -3 Query: 1104 KQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNL 925 K +KPM +++LEIL +E +G+K ++ FKA SF +V+ I+Q+ + C KHV N+L Sbjct: 12 KHFTLSKPMSHMLLEILTEEALKGSKPSSTFKAESFVKVATEISQKFNVQCEPKHVDNHL 71 Query: 924 KILRST*NTVQTLLNKSGLGWDDNLKMITASPRVY 820 K ++ + L NKSG GWDD LKMIT S VY Sbjct: 72 KTVKKEWGIITKLKNKSGFGWDDCLKMITVSKDVY 106 >gb|AAV43825.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1067 Score = 88.2 bits (217), Expect = 6e-15 Identities = 58/215 (26%), Positives = 100/215 (46%), Gaps = 3/215 (1%) Frame = -3 Query: 1119 GKEGEKQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKH 940 GK G WT M ML+ LA+ V G + ++ FKA+ N + A+N++ + + Sbjct: 302 GKGGSTHASWTSAMSSFMLKHLANLVAGGTRTSSGFKAVHLNACARAVNERFNSTLTGEQ 361 Query: 939 VQNNLKILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDM 760 ++N+LK + + L S GWD+ +IT Y+ YI+ + + NK + Sbjct: 362 IKNHLKTWQRKFTKINRLRKVSAAGWDEKNFIITLDDEHYNGYIEDHKADADYFNKPLAH 421 Query: 759 CEEMSLVCGKDLARGDYAKSFDDISLDRSSEKDND-VDIEGPSKEKEVVFETSQGKASHK 583 EM + G +A G YAK + + +ND + +GP+ + +S K Sbjct: 422 YGEMLTIFGSTMATGKYAKDSSSVLGTEDVQTENDEEENDGPATTDDRAEASSASKPKKA 481 Query: 582 RNYSCDALDVIGDISIKLGEVAAAINKIA--DNRL 484 R + +IG + ++A+AI K+A DN+L Sbjct: 482 RTQEIEDDGLIGAFTSVGDKLASAILKVAEPDNKL 516 >gb|AAV44105.1| unknown protein [Oryza sativa Japonica Group] Length = 1220 Score = 88.2 bits (217), Expect = 6e-15 Identities = 58/215 (26%), Positives = 100/215 (46%), Gaps = 3/215 (1%) Frame = -3 Query: 1119 GKEGEKQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKH 940 GK G WT M ML+ LA+ V G + ++ FKA+ N + A+N++ + + Sbjct: 455 GKGGSTHASWTSAMSSFMLKHLANLVAGGTRTSSGFKAVHLNACARAVNERFNSTLTGEQ 514 Query: 939 VQNNLKILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDM 760 ++N+LK + + L S GWD+ +IT Y+ YI+ + + NK + Sbjct: 515 IKNHLKTWQRKFTKINRLRKVSAAGWDEKNFIITLDDEHYNGYIEDHKADADYFNKPLAH 574 Query: 759 CEEMSLVCGKDLARGDYAKSFDDISLDRSSEKDND-VDIEGPSKEKEVVFETSQGKASHK 583 EM + G +A G YAK + + +ND + +GP+ + +S K Sbjct: 575 YGEMLTIFGSTMATGKYAKDSSSVLGTEDVQTENDEEENDGPATTDDRAEASSASKPKKA 634 Query: 582 RNYSCDALDVIGDISIKLGEVAAAINKIA--DNRL 484 R + +IG + ++A+AI K+A DN+L Sbjct: 635 RTQEIEDDGLIGAFTSVGDKLASAILKVAEPDNKL 669 >gb|ABA98143.2| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa Japonica Group] Length = 581 Score = 87.4 bits (215), Expect = 1e-14 Identities = 58/215 (26%), Positives = 101/215 (46%), Gaps = 3/215 (1%) Frame = -3 Query: 1119 GKEGEKQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKH 940 GK G WT M ML+ LA+ V G + ++ FKA+ N + A+N++ + + Sbjct: 307 GKGGSTHASWTSAMSSFMLKHLANLVAGGTRTSSGFKAVHLNACARAVNERFNSTLTGEQ 366 Query: 939 VQNNLKILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDM 760 ++N+LK + + L S GWD+ +IT Y+ YI+ + + NK + Sbjct: 367 IKNHLKTWQRKFTKINRLRKVSAAGWDEKNIIITLDDEHYNGYIEDHKADADYFNKPLAH 426 Query: 759 CEEMSLVCGKDLARGDYAKSFDDISLDRSSEKDND-VDIEGPSKEKEVVFETSQGKASHK 583 EM + G +A G YAK + + + +ND + +GP+ + +S K Sbjct: 427 YGEMLTIFGSTMATGKYAKDSNSVLGTEDVQTENDEEENDGPATTDDRGEASSASKPKKA 486 Query: 582 RNYSCDALDVIGDISIKLGEVAAAINKIA--DNRL 484 R + +IG + ++A+AI K+A DN+L Sbjct: 487 RTQEIEDDGLIGAFTSVGDKLASAILKVAEPDNKL 521 >ref|XP_006347741.1| PREDICTED: uncharacterized protein LOC102581412 [Solanum tuberosum] Length = 339 Score = 87.0 bits (214), Expect = 1e-14 Identities = 73/294 (24%), Positives = 126/294 (42%), Gaps = 32/294 (10%) Frame = -3 Query: 1116 KEGEKQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHV 937 K K + W+ M+ ++E L+ + + GNK+ F ++N A+N + +++ V Sbjct: 41 KHKGKNVVWSPAMDKCLIEALSIQARNGNKVDKCFNENAYNAACVAVNSHFSLSLNNQKV 100 Query: 936 QNNLKILRST*NTVQTLLNKSGLGWDDNLKMITA-SPRVYSFYIQAYLTHDKFINKKIDM 760 N LK ++ NT++ +L++ G W+ N I ++ Y+ A+ F K+I M Sbjct: 101 VNRLKTIKKRYNTIRNILSQEGFSWNPNTNTIDCEDDDLWKRYVAAHPDARTFRGKQITM 160 Query: 759 CEEMSLVCGKDLARGDYAKSFDDIS----------------LDRSSEKDNDVD-IEGPSK 631 EEM +VCG A +A+ ++ L SSE ND D E S Sbjct: 161 YEEMKIVCGNYQAHSRWARMPGKVNGNPVIECKYEQESASYLSASSEHMNDSDGTETQSS 220 Query: 630 EKEVVF--------------ETSQGKASHKRNYSCDALDVIGDISIKLGEVAAAINKIAD 493 KE V+ +G+A+ + S D + I+ + +A I + + Sbjct: 221 AKEPVYTEMLANNEDEDEPEAQPEGQAAKRTRSSETLQDAMLAIASSIRHLADTIEQ-SK 279 Query: 492 NRLDVTRLXXXXXXXXXXXXEFLGDAFVYLVQSDTLAKAFMAKNQNLRKVWLKR 331 +D L AF +L + T A+AFMA N+ LR+++L R Sbjct: 280 YTIDTPALLQAVMEIEGLEESKQMYAFEFLNEDPTKARAFMAYNRRLRRIYLFR 333 >gb|EOY08532.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508716636|gb|EOY08533.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508716637|gb|EOY08534.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 339 Score = 87.0 bits (214), Expect = 1e-14 Identities = 70/287 (24%), Positives = 130/287 (45%), Gaps = 25/287 (8%) Frame = -3 Query: 1101 QLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNLK 922 ++ WT M+ LE++ D+V +GNK+ K ++ + N + G+ S ++N K Sbjct: 56 KIDWTPTMDQYFLELMLDQVHKGNKVGCTLKKKAWVSMITLFNAKFGLQHSRAVLKNRYK 115 Query: 921 ILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDMCEEMSL 742 ILRS +++TLL + G WD+ KM+ A RV++ Y++ + +F NK + ++M + Sbjct: 116 ILRSQYASIKTLLTEKGFHWDETQKMVIADDRVWNKYVKEHPEFRRFKNKSMPCYDDMCI 175 Query: 741 VC-----------------------GKDLARGDYAKSFDDISLDRSSEKDNDVDIEGPS- 634 +C GKD+ G ++ +I + + I G Sbjct: 176 ICCNESTSAETRILQCNMSSENGTPGKDI--GGRSEPTINIKVAKKVHDKVPAPIVGSKL 233 Query: 633 KEKEVVFETSQGKASHKRNYSCDALDVIGDISIKLGEVAAAINKIADNRLDVT-RLXXXX 457 +E++ ++ + SH+ S D + + ++ V +I + +N T R+ Sbjct: 234 QEQQNKHQSQMPRTSHQPKRSRSEEDAMANAVREMAFVVTSIKRKKENENAPTRRVIEEL 293 Query: 456 XXXXXXXXEFLGDAFVYLVQSDTLAKAFMAKNQNLRKVWLKRFKRQQ 316 + L DA +L + D A+ F+A + +LRK WL R R Q Sbjct: 294 QAIPGIDDDLLLDACDFL-EDDRRARMFLALDASLRKKWLMRKLRPQ 339 >gb|EOY08531.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 454 Score = 87.0 bits (214), Expect = 1e-14 Identities = 70/287 (24%), Positives = 130/287 (45%), Gaps = 25/287 (8%) Frame = -3 Query: 1101 QLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNLK 922 ++ WT M+ LE++ D+V +GNK+ K ++ + N + G+ S ++N K Sbjct: 171 KIDWTPTMDQYFLELMLDQVHKGNKVGCTLKKKAWVSMITLFNAKFGLQHSRAVLKNRYK 230 Query: 921 ILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDMCEEMSL 742 ILRS +++TLL + G WD+ KM+ A RV++ Y++ + +F NK + ++M + Sbjct: 231 ILRSQYASIKTLLTEKGFHWDETQKMVIADDRVWNKYVKEHPEFRRFKNKSMPCYDDMCI 290 Query: 741 VC-----------------------GKDLARGDYAKSFDDISLDRSSEKDNDVDIEGPS- 634 +C GKD+ G ++ +I + + I G Sbjct: 291 ICCNESTSAETRILQCNMSSENGTPGKDI--GGRSEPTINIKVAKKVHDKVPAPIVGSKL 348 Query: 633 KEKEVVFETSQGKASHKRNYSCDALDVIGDISIKLGEVAAAINKIADNRLDVT-RLXXXX 457 +E++ ++ + SH+ S D + + ++ V +I + +N T R+ Sbjct: 349 QEQQNKHQSQMPRTSHQPKRSRSEEDAMANAVREMAFVVTSIKRKKENENAPTRRVIEEL 408 Query: 456 XXXXXXXXEFLGDAFVYLVQSDTLAKAFMAKNQNLRKVWLKRFKRQQ 316 + L DA +L + D A+ F+A + +LRK WL R R Q Sbjct: 409 QAIPGIDDDLLLDACDFL-EDDRRARMFLALDASLRKKWLMRKLRPQ 454 >gb|EOY08530.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 494 Score = 87.0 bits (214), Expect = 1e-14 Identities = 70/287 (24%), Positives = 130/287 (45%), Gaps = 25/287 (8%) Frame = -3 Query: 1101 QLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNLK 922 ++ WT M+ LE++ D+V +GNK+ K ++ + N + G+ S ++N K Sbjct: 211 KIDWTPTMDQYFLELMLDQVHKGNKVGCTLKKKAWVSMITLFNAKFGLQHSRAVLKNRYK 270 Query: 921 ILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDMCEEMSL 742 ILRS +++TLL + G WD+ KM+ A RV++ Y++ + +F NK + ++M + Sbjct: 271 ILRSQYASIKTLLTEKGFHWDETQKMVIADDRVWNKYVKEHPEFRRFKNKSMPCYDDMCI 330 Query: 741 VC-----------------------GKDLARGDYAKSFDDISLDRSSEKDNDVDIEGPS- 634 +C GKD+ G ++ +I + + I G Sbjct: 331 ICCNESTSAETRILQCNMSSENGTPGKDI--GGRSEPTINIKVAKKVHDKVPAPIVGSKL 388 Query: 633 KEKEVVFETSQGKASHKRNYSCDALDVIGDISIKLGEVAAAINKIADNRLDVT-RLXXXX 457 +E++ ++ + SH+ S D + + ++ V +I + +N T R+ Sbjct: 389 QEQQNKHQSQMPRTSHQPKRSRSEEDAMANAVREMAFVVTSIKRKKENENAPTRRVIEEL 448 Query: 456 XXXXXXXXEFLGDAFVYLVQSDTLAKAFMAKNQNLRKVWLKRFKRQQ 316 + L DA +L + D A+ F+A + +LRK WL R R Q Sbjct: 449 QAIPGIDDDLLLDACDFL-EDDRRARMFLALDASLRKKWLMRKLRPQ 494 >gb|ABA96347.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa Japonica Group] Length = 572 Score = 87.0 bits (214), Expect = 1e-14 Identities = 58/215 (26%), Positives = 100/215 (46%), Gaps = 3/215 (1%) Frame = -3 Query: 1119 GKEGEKQLRWTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKH 940 GK G WT M ML+ LA+ V G ++ FKA+ N + A+N++ + + Sbjct: 298 GKGGSTHASWTSAMSSFMLKHLANLVAGGTSTSSGFKAVHLNACARAVNKRFNSTLTGEQ 357 Query: 939 VQNNLKILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDM 760 ++N+LK + + L S GWD+ +IT Y+ YI+ + + NK + Sbjct: 358 IKNHLKTWQRKFTKINRLRKVSAAGWDEKNFIITLDDEHYNGYIEDHKADADYFNKPLAH 417 Query: 759 CEEMSLVCGKDLARGDYAKSFDDISLDRSSEKDND-VDIEGPSKEKEVVFETSQGKASHK 583 EM + G +A G YAK + +++ND + +GP+ + +S K Sbjct: 418 YGEMLTIFGSTMATGKYAKDSSSVLGTEDVQEENDEEENDGPATTDDRPEASSASKPKKA 477 Query: 582 RNYSCDALDVIGDISIKLGEVAAAINKIA--DNRL 484 R + +IG + ++A+AI K+A DN+L Sbjct: 478 RTQEIEDDGLIGAFTSVGDKLASAILKVAEPDNKL 512 >gb|EXB95722.1| hypothetical protein L484_007472 [Morus notabilis] Length = 467 Score = 85.9 bits (211), Expect = 3e-14 Identities = 60/266 (22%), Positives = 122/266 (45%), Gaps = 7/266 (2%) Frame = -3 Query: 1092 WTKPMEYLMLEILADEVKQGNKLTNQFKAISFNRVSDAINQQLGMDCSDKHVQNNLKILR 913 W PM+ ++++ D+V++G+++ F+ ++ + A N + G ++N K LR Sbjct: 207 WQPPMDRYFIDVMMDQVQKGSRIDGVFRKQAWMEMIAAFNAKFGFSYDMDVLKNRYKTLR 266 Query: 912 ST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKIDMCEEMSLVCG 733 N ++ LL+ G WDD +M+TA V+ YI+A+ +F+ + + +E+ ++C Sbjct: 267 RQYNVIKNLLDLDGFVWDDTRQMVTADDYVWQDYIKAHTDARQFMTRPVPYYKELCVICD 326 Query: 732 KDLARGDYAKSFD-DISLDRSSEKDNDVDIEGPSKEKEVVFETSQGKASHKRNYSCDALD 556 + + D D D + + SK+ + E ++ KR+ D Sbjct: 327 PSSDERECSSGQDLDQQNDEDDARSPATSVSNGSKKNKRQLENLYCLSNSKRSRDND--- 383 Query: 555 VIGDISIKLGEVAAAINKIAD------NRLDVTRLXXXXXXXXXXXXEFLGDAFVYLVQS 394 ++ L E+A+A++ ++D N + + + + + DA L++ Sbjct: 384 --DGMASALREMASAVSSLSDKRKNDENSIPIENVMKAVQALPDMDEDLVLDA-CDLLED 440 Query: 393 DTLAKAFMAKNQNLRKVWLKRFKRQQ 316 + AK FMA + LR+ WL R R + Sbjct: 441 EKKAKTFMALDVKLRRKWLLRKLRPE 466 Score = 57.8 bits (138), Expect = 8e-06 Identities = 33/129 (25%), Positives = 64/129 (49%), Gaps = 3/129 (2%) Frame = -3 Query: 1116 KEGEKQLR--WTKPMEYLMLEILADEVKQGNKLTNQ-FKAISFNRVSDAINQQLGMDCSD 946 + G +LR WT M+ ++++ ++V +GNK + F ++ ++ N + Sbjct: 6 RSGSDRLRTVWTPEMDRYFVDLMLEQVNKGNKFDDHLFSKRAWKHMTSLFNSKFKFQYEK 65 Query: 945 KHVQNNLKILRST*NTVQTLLNKSGLGWDDNLKMITASPRVYSFYIQAYLTHDKFINKKI 766 ++N K LR+ V+ LL+++G WDD +M+TA V+ YI+ + F K I Sbjct: 66 DVLKNRHKTLRNLYKAVKNLLDQTGFSWDDTRQMVTADNDVWDEYIKVHPDARSFRIKTI 125 Query: 765 DMCEEMSLV 739 ++ L+ Sbjct: 126 PHYNDLCLI 134