BLASTX nr result
ID: Ephedra26_contig00005180
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra26_contig00005180 (1963 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAD17409.1| putative retroelement pol polyprotein [Arabidopsi... 119 6e-24 gb|AAD32906.1| putative retroelement pol polyprotein [Arabidopsi... 114 1e-22 gb|EFA05312.1| hypothetical protein TcasGA2_TC015470 [Tribolium ... 113 3e-22 gb|ABD32582.1| Integrase, catalytic region; Zinc finger, CCHC-ty... 108 8e-21 emb|CAN61272.1| hypothetical protein VITISV_039063 [Vitis vinifera] 107 2e-20 ref|XP_003629120.1| Serine/threonine protein kinase SRPK1 [Medic... 106 4e-20 gb|EFA12557.1| hypothetical protein TcasGA2_TC005030 [Tribolium ... 106 4e-20 gb|ACI62137.1| polyprotein [Drosophila melanogaster] 106 4e-20 gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Ar... 104 2e-19 gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum ... 104 2e-19 emb|CAN71427.1| hypothetical protein VITISV_027864 [Vitis vinifera] 103 2e-19 emb|CAB43904.1| putative protein [Arabidopsis thaliana] gi|72697... 103 2e-19 gb|AAQ01581.1| agCP7521-like protein [Aedes albopictus] 103 3e-19 gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao] 102 5e-19 gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 102 5e-19 emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera] 102 5e-19 ref|XP_005715938.1| unnamed protein product [Chondrus crispus] g... 79 5e-19 emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera] 102 6e-19 gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsi... 102 6e-19 emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 102 8e-19 >gb|AAD17409.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1347 Score = 119 bits (297), Expect = 6e-24 Identities = 76/227 (33%), Positives = 121/227 (53%), Gaps = 7/227 (3%) Frame = +3 Query: 1302 IWCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFKLQLKN 1481 +W +DSGC HMT E +NI K K I+ D++ G G+I ++ + K +KN Sbjct: 325 VWLVDSGCTNHMTKEERYFSNIN-KSIKVPIRVRNGDIVMTAGKGDITVMTRHGKRIIKN 383 Query: 1482 V-LVHRLRRNLISVRKLVMAG--VTIEENLTKYHDDQLRLFYNKKLISTSIGSSGLYLLK 1652 V LV L +NL+SV +++ +G V ++ D + N ++ S + +K Sbjct: 384 VFLVPGLEKNLLSVPQIISSGYWVRFQDKRCIIQDANGKEIMNIEMTDKS------FKIK 437 Query: 1653 SEAVGESNLANQSKPWKEWHQKLGHVNDRYLNEIYKKVNGKNLPN---TQEICESCAKGK 1823 +V E + + + WH++LGHV+++ L ++ K LP T+E C++C GK Sbjct: 438 LSSVEEEAMTANVQTEETWHKRLGHVSNKRLQQMQDKELVNGLPRFKVTKETCKACNLGK 497 Query: 1824 MSRSPFIS-SNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 SR F S TKT LE++H+D+ GPM QS G RY+V F+DD+ Sbjct: 498 QSRKSFPKESQTKTREKLEIVHTDVCGPMQHQSIDGSRYYVLFLDDY 544 >gb|AAD32906.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 822 Score = 114 bits (286), Expect = 1e-22 Identities = 78/225 (34%), Positives = 119/225 (52%), Gaps = 7/225 (3%) Frame = +3 Query: 1305 WCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFKLQLKNV 1484 W +DSGC HMT NE + T I ++ K I+ ++ G G+IE++ K +++V Sbjct: 30 WLIDSGCTNHMTPNEKLFTKIN-RDFKVPIRVGNGAVMMSEGKGDIEVMTRKDKRGIRDV 88 Query: 1485 L-VHRLRRNLISVRKLVMAG--VTIEENLTKYHDDQLRLFYNKKLISTSIGSSGLYLLKS 1655 L V +L +NL+SV ++++ G VT++ N HD + ++++ S L L + Sbjct: 89 LLVPKLGKNLLSVPQMIINGYQVTLKNNYCTIHDSARKKIGEVEMVNKSFH---LRWLSN 145 Query: 1656 EAVGESNLANQSKPWKEWHQKLGHVNDRYLNEIYKKVNGKNLP--NTQE-ICESCAKGKM 1826 E E+ + + + + WH++LGH L + K LP N +E CESC K Sbjct: 146 E---ETAMVAKDEATELWHKRLGHTGHSNLKILQSKEMVTGLPKFNVEEGKCESCILSKH 202 Query: 1827 SRSPFIS-SNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDD 1958 SR PF S T+ LELIHSD+ GPM S G RY ++FIDD Sbjct: 203 SRDPFPKESETRAKHKLELIHSDVCGPMQNSSINGSRYILTFIDD 247 >gb|EFA05312.1| hypothetical protein TcasGA2_TC015470 [Tribolium castaneum] Length = 2375 Score = 113 bits (283), Expect = 3e-22 Identities = 75/237 (31%), Positives = 130/237 (54%), Gaps = 18/237 (7%) Frame = +3 Query: 1305 WCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDM-LTIRGFGNIEIIL----PNFKL 1469 W +DSG HM ++ LTN+ E I A+ ++ L G++ +L + Sbjct: 268 WYVDSGATDHMVNSKEHLTNVRKLESPVKICVAKDNVKLLATEIGDVNAVLRVNNTVTRA 327 Query: 1470 QLKNVL-VHRLRRNLISVRKLVMAGVTIEENLTKYHDDQLRLFYNKKLISTSIGSSGLYL 1646 +KNVL V L+ NL+SV+K+ +A + + + ++ + N K+++ LY Sbjct: 328 TIKNVLYVKNLKHNLLSVQKIELASLNVS-----FEHGKVVIKRNSKVLAEGKRIDNLYE 382 Query: 1647 LKSEAVGE-----SNLANQSKPWKEWHQKLGHVNDRYLNEIYKK--VNGKNLPNTQ---- 1793 + E + SN+ S K WH++LGH++++ L + K V+G N+ N+ Sbjct: 383 ICFEVENKCKVVCSNVCEVSASLKLWHRRLGHLSNKNLVTLSKNNMVSGLNIRNSNCNES 442 Query: 1794 EICESCAKGKMSRSPFIS-SNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 +ICE C K K+++ PF S+ KT+R+LELIHSD+ GP+ +++ G RYF++F+DD+ Sbjct: 443 QICEVCVKSKITKLPFGKRSDNKTTRVLELIHSDLCGPITPETHDGKRYFLTFLDDY 499 >gb|ABD32582.1| Integrase, catalytic region; Zinc finger, CCHC-type; Peptidase aspartic, catalytic [Medicago truncatula] Length = 1715 Score = 108 bits (270), Expect = 8e-21 Identities = 85/294 (28%), Positives = 138/294 (46%), Gaps = 11/294 (3%) Frame = +3 Query: 1113 NDPTNPNATELKERAKRKMRPARKEANITEVINPNIVALITEHINEFNKYTPEVNVCIN- 1289 ++P + N + + + ++ K A +E P V + +N+ + VC+ Sbjct: 606 SEPVHQNLIKPESKIPKQKDQKNKAATASEKTIPKGVK--PKVLNDQKPLSIHPKVCLRA 663 Query: 1290 -DNHSIWCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFK 1466 + W LDSGC HMTG +++ +T K D +K I G G I N Sbjct: 664 REKQRSWYLDSGCSRHMTGEKALFLTLTMK-DGGEVKFGGNQTGKIIGTGTIG----NSS 718 Query: 1467 LQLKNV-LVHRLRRNLISVRKLVMAGVTI---EENLTKYHDDQLRLFYNKKLISTSIGSS 1634 + + NV LV L+ NL+S+ + G + + N T + D + + K + + Sbjct: 719 ISINNVWLVDGLKHNLLSISQFCDNGYDVTFSKTNCTLVNKDDKSITFKGKRVENVYKIN 778 Query: 1635 GLYLLKSEAVGESNLANQSKPWKEWHQKLGHVNDRYLNEIYKKVNGKNLPN----TQEIC 1802 L + V L+ K W WH++LGH N R +++I K K LPN + +C Sbjct: 779 FSDLADQKVV--CLLSMNDKKWV-WHKRLGHANWRLISKISKLQLVKGLPNIDYHSDALC 835 Query: 1803 ESCAKGKMSRSPFISSN-TKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 +C KGK+ +S F S + TSR LEL+H D+FGP+ T S G +Y + +DD+ Sbjct: 836 GACQKGKIVKSSFKSKDIVSTSRPLELLHIDLFGPVNTASLYGSKYGLVIVDDY 889 >emb|CAN61272.1| hypothetical protein VITISV_039063 [Vitis vinifera] Length = 1643 Score = 107 bits (267), Expect = 2e-20 Identities = 70/233 (30%), Positives = 119/233 (51%), Gaps = 3/233 (1%) Frame = +3 Query: 1272 VNVCINDNHSIWCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEII 1451 + V + + W LD+G HM + + T T+KE S+K L ++G G+++I Sbjct: 609 LTVSTSSSAESWILDTGASYHMAYSRDLFT--TFKEWNGSVKLGDDGELGVKGSGSVQIK 666 Query: 1452 LPNFKLQLKNV-LVHRLRRNLISVRKLVMAGVTIEENLTKYHDDQLRLFYNKKLISTSIG 1628 + + ++ N V LR+NLISV L G T + LR+ ++ Sbjct: 667 MYDGLVRTLNAWYVPGLRKNLISVGTLDKNGYTFSGS-----GGVLRVSKGALVVMKGRL 721 Query: 1629 SSGLYLLKSEAVGESNLANQSKPWKEWHQKLGHVNDRYLNEIYKK--VNGKNLPNTQEIC 1802 G+Y L +V + + + WH++LGH++++ L+ + K+ ++G + C Sbjct: 722 QHGIYTLMGSSVLGTAAVEEDNCTELWHRRLGHMSEKGLSILSKQGLLSGAETGKLK-FC 780 Query: 1803 ESCAKGKMSRSPFISSNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 E+C GK R F + T+ +LE IHSD++GP P +S+ G RY+V+FIDDF Sbjct: 781 ETCVMGKQRRVKFSMGSHTTNGVLEYIHSDLWGPSPVESHSGCRYYVTFIDDF 833 >ref|XP_003629120.1| Serine/threonine protein kinase SRPK1 [Medicago truncatula] gi|355523142|gb|AET03596.1| Serine/threonine protein kinase SRPK1 [Medicago truncatula] Length = 1025 Score = 106 bits (264), Expect = 4e-20 Identities = 82/289 (28%), Positives = 133/289 (46%), Gaps = 11/289 (3%) Frame = +3 Query: 1128 PNATELKERAKRKMRPARKEANITEVINPNIVALITEHINEFNKYTPEVNVCIN--DNHS 1301 P + K++ ++ E I + + P + +N+ ++ VC+ + Sbjct: 616 PESKIPKQKDQKNKAVTASEKTIPKGVKPKV-------LNDQKPFSIHSKVCLRAREKQR 668 Query: 1302 IWCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFKLQLKN 1481 W LDSGC HMTG +++ +T K D +K I G G I N + + N Sbjct: 669 SWYLDSGCSRHMTGEKALFLTLTMK-DGGEVKFGGNQTGKIIGTGTIG----NSSISINN 723 Query: 1482 V-LVHRLRRNLISVRKLVMAGVTI---EENLTKYHDDQLRLFYNKKLISTSIGSSGLYLL 1649 V LV L+ NL+S+ + G + + N T + D + + K + + L Sbjct: 724 VWLVDGLKHNLLSISQFCDNGYDVMFSKTNCTLVNKDDKSITFKGKRVENVYKINFSDLA 783 Query: 1650 KSEAVGESNLANQSKPWKEWHQKLGHVNDRYLNEIYKKVNGKNLPN----TQEICESCAK 1817 + V L+ K W WH++LGH N R + +I K K PN + +C +C K Sbjct: 784 DQKVV--CLLSMNDKKWV-WHKRLGHANWRLIFKISKLQVVKGFPNIDYHSDALCGACQK 840 Query: 1818 GKMSRSPFISSN-TKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 GK+ +S F S + TSR LEL+H D+FGP+ T S G +Y + +DD+ Sbjct: 841 GKIVKSSFKSKDIVSTSRPLELLHIDLFGPVNTASLYGSKYGLVIVDDY 889 >gb|EFA12557.1| hypothetical protein TcasGA2_TC005030 [Tribolium castaneum] Length = 882 Score = 106 bits (264), Expect = 4e-20 Identities = 71/229 (31%), Positives = 115/229 (50%), Gaps = 8/229 (3%) Frame = +3 Query: 1296 HSIWCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIIL---PNFK 1466 +++WCLDSGC+ H+ +E N+ ++D +K A M + G G++ I P+ Sbjct: 186 NNLWCLDSGCKSHLCKDEDFFVNV--RDDLGQLKLADNSMTRVCGKGDVRIATADNPDNV 243 Query: 1467 LQLKNVL-VHRLRRNLISVRKLVMAGVTIEENLTKYHDDQLRLFYNKKLISTSIGSSGLY 1643 + LK+ L V LR +L+S+ K+V G H+ + Sbjct: 244 VMLKDTLYVPNLRSHLLSISKIVDHG----------HEVTFK------------------ 275 Query: 1644 LLKSEAVGESNLANQSKPWKEWHQKLGHVNDRYLNEIYKKVNGKNLPNTQEI----CESC 1811 KS A+ ++ + +E H +LGH+N R L+ + K N K L + C++C Sbjct: 276 --KSCAIVLNSFGDSKSDVEERHTRLGHLNLRDLSLLAKGGNVKGLKTKSIVSEIKCDTC 333 Query: 1812 AKGKMSRSPFISSNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDD 1958 K+SR+PF T +S +LEL+H+D+ GP TQS +G RYF++ IDD Sbjct: 334 FSAKISRAPFGVRETHSSELLELVHTDLCGPTQTQSMRGARYFMTLIDD 382 >gb|ACI62137.1| polyprotein [Drosophila melanogaster] Length = 1319 Score = 106 bits (264), Expect = 4e-20 Identities = 75/230 (32%), Positives = 118/230 (51%), Gaps = 9/230 (3%) Frame = +3 Query: 1299 SIWCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFKLQLK 1478 +IWC+DSG HM ++ + T+ KE SI A + G G + + N ++L+ Sbjct: 271 NIWCVDSGATSHMCCDKGLFTSFINKE--TSIMLAADKFVKSSGIGTVMLKSQNVNIELR 328 Query: 1479 NVL-VHRLRRNLISVRKLVMAGVTIEENLTKYHDDQLRLFYNKK--LISTSIGSSGLYLL 1649 +V+ V L N +SV K EN+T + D + + NK+ ++ ++ LYL Sbjct: 329 DVIYVPSLHMNFLSVSKSAEY-----ENITTF-DKKAAVIKNKQGEVMMRAMQEDNLYLF 382 Query: 1650 KSEAV-GESNLANQSKPWKEWHQKLGHVNDRYLNEIYKK--VNGKNLPNTQEI--CESCA 1814 S + G +L N S WH + GH+N + L EI +K V G + N C++C Sbjct: 383 TSSSKNGAVHLLNDSSRMATWHNRFGHLNFQCLKEIKEKELVIGMDFKNMSVNINCDTCN 442 Query: 1815 KGKMSRSPFISSNTK-TSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 K+ PF ++ + T +LEL+HSD+ GPM S G +YFV+FIDD+ Sbjct: 443 MAKIHVLPFPQNSERATQSVLELVHSDVCGPMNVSSLGGNKYFVTFIDDY 492 >gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Arabidopsis thaliana] Length = 1522 Score = 104 bits (259), Expect = 2e-19 Identities = 75/236 (31%), Positives = 114/236 (48%), Gaps = 10/236 (4%) Frame = +3 Query: 1284 INDNHSI-WCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPN 1460 + D+H W DS H+T N +L +SI A + L I G+ I + Sbjct: 318 VTDHHGHEWIPDSAASAHVTNNRHVLQQSQPYHGSDSIMVADGNFLPITHTGSGSIASSS 377 Query: 1461 FKLQLKNVLV-HRLRRNLISVRKLVM---AGVTIEENLTKYHDDQLRLFYNKKLISTSIG 1628 K+ LK VLV + ++L+SV KL V + + + +D KKL+ Sbjct: 378 GKIPLKEVLVCPDIVKSLLSVSKLTSDYPCSVEFDADSVRINDKA-----TKKLLVMGRN 432 Query: 1629 SSGLYLLKSEAVGESNLANQSKPWKE-WHQKLGHVNDRYLNEIYKKVNGKNL----PNTQ 1793 GLY L+ + Q+ E WH++LGH N L+++ + K++ + Sbjct: 433 RDGLYSLEEPKLQVLYSTRQNSASSEVWHRRLGHANAEVLHQL---ASSKSIIIINKVVK 489 Query: 1794 EICESCAKGKMSRSPFISSNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 +CE+C GK +R PF+ S SR LE IH D++GP PT S QG+RY+V FID + Sbjct: 490 TVCEACHLGKSTRLPFMLSTFNASRPLERIHCDLWGPSPTSSVQGFRYYVVFIDHY 545 >gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1333 Score = 104 bits (259), Expect = 2e-19 Identities = 71/254 (27%), Positives = 124/254 (48%), Gaps = 8/254 (3%) Frame = +3 Query: 1224 ALITEHINEFNKYTPEVNVCINDNHSIWCLDSGCRLHMTGNESILTNITWKEDKNSIKTA 1403 A T+++ E +K + +++W +DSGC HM+ ++S+ ++ + K+ ++ Sbjct: 284 ANFTQNVEEESKLFMASSQITESANAVWFIDSGCSNHMSSSKSLFRDLD-ESQKSEVRLG 342 Query: 1404 RKDMLTIRGFGNIEI--ILPNFKLQLKNVLVHRLRRNLISVRKLVMAG--VTIEENLTKY 1571 + I G G +EI + N K V L NL+SV +L+ +G V +N Sbjct: 343 DDKQVHIEGKGTVEIKTVQGNVKFLYDVQYVPTLAHNLLSVGQLMTSGYSVVFYDNACDI 402 Query: 1572 HDDQLRLFYNKKLISTSIGSSGLYLLKSEAVGESNLANQSKPWKE-WHQKLGHVNDRYLN 1748 D + + + + + ++ L VG S L + K WH + GH+N +L Sbjct: 403 KDKES----GRTIARVPMTQNKMFPLDISNVGNSALVVKEKNETNLWHLRYGHLNVNWLK 458 Query: 1749 EIYKKVNGKNLPNTQEI--CESCAKGKMSRSPF-ISSNTKTSRILELIHSDIFGPMPTQS 1919 + +K LPN +E+ CE C GK +R F + + + + LEL+H+D+ GPM +S Sbjct: 459 LLVQKDMVIGLPNIKELDLCEGCIYGKQTRKSFPVGKSWRATTCLELVHADLCGPMKMES 518 Query: 1920 YQGYRYFVSFIDDF 1961 G RYF+ F DD+ Sbjct: 519 LGGSRYFLMFTDDY 532 >emb|CAN71427.1| hypothetical protein VITISV_027864 [Vitis vinifera] Length = 1300 Score = 103 bits (258), Expect = 2e-19 Identities = 71/236 (30%), Positives = 120/236 (50%), Gaps = 12/236 (5%) Frame = +3 Query: 1290 DNHSIWCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFK- 1466 D W +DSGC HMTG++ L +++ + ++ + TA L I GN ++ + Sbjct: 316 DYEKDWIIDSGCSNHMTGDKEKLQDLSEYKGRHMVVTANNSKLPIAHIGNT-VVSSQYNT 374 Query: 1467 --LQLKNVL-VHRLRRNLISVRKLVMAGVTI---EENLTKYHDDQLRLFYNKKLISTSIG 1628 + L+NV V +++NL+SV +L +G ++ +++ YHD ++ ++ + Sbjct: 375 NDVSLQNVYHVPGMKKNLLSVAQLTSSGHSVLFGPQDVKVYHDLEVM----EEPVIKGRR 430 Query: 1629 SSGLYLLKSE-AVGESNLANQSKPWKEWHQKLGHVNDRYLNEIYKKVNGKNLPNTQ---- 1793 +Y++ +E A + N++ WH +L H++ L + KK K LP + Sbjct: 431 LESVYVMSAETAYVDKTRKNETADL--WHMRLSHISYSKLTMMMKKSMLKGLPQLEVRKX 488 Query: 1794 EICESCAKGKMSRSPFISSNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 IC C GK + P+ S K LELIHSD+FGP+ S G +Y V+FIDDF Sbjct: 489 TICAXCQYGKAHQLPYEESKWKAKGPLELIHSDVFGPVKQASLSGMKYMVTFIDDF 544 >emb|CAB43904.1| putative protein [Arabidopsis thaliana] gi|7269745|emb|CAB81478.1| putative protein [Arabidopsis thaliana] Length = 1415 Score = 103 bits (258), Expect = 2e-19 Identities = 68/231 (29%), Positives = 112/231 (48%), Gaps = 12/231 (5%) Frame = +3 Query: 1305 WCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFKLQLKNV 1484 W DSG H+T + S L + ++S+ D L I G+ + L L++V Sbjct: 293 WVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHIGSAVLTSNQGNLPLRDV 352 Query: 1485 LV-HRLRRNLISVRKLVMA----------GVTIEENLTKYHDDQLRLFYNKKLISTSIGS 1631 LV + ++L+SV KL GV +++ LTK +L++ Sbjct: 353 LVCPNITKSLLSVSKLTSDYPCVIEFDSDGVIVKDKLTK------------QLLTKGTRH 400 Query: 1632 SGLYLLKSEAVGESNLANQSKPWKE-WHQKLGHVNDRYLNEIYKKVNGKNLPNTQEICES 1808 + LYLL++ + Q E WH +LGH N L ++ + + +C++ Sbjct: 401 NDLYLLENPKFMACYSSRQQATSDEVWHMRLGHPNQDVLQQLLRNKAIVISKTSHSLCDA 460 Query: 1809 CAKGKMSRSPFISSNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 C GK+ + PF SS+ +SR+LE +H D++GP P S QG+RY+V FID++ Sbjct: 461 CQMGKICKLPFASSDFVSSRLLERVHCDLWGPAPVVSSQGFRYYVIFIDNY 511 >gb|AAQ01581.1| agCP7521-like protein [Aedes albopictus] Length = 602 Score = 103 bits (257), Expect = 3e-19 Identities = 73/231 (31%), Positives = 115/231 (49%), Gaps = 11/231 (4%) Frame = +3 Query: 1302 IWCLDSGCRLHMTGNESILTNITWKEDKNSIKTAR-------KDMLTIRGFGNIEIILPN 1460 ++ LDSG H+ ++S ++ I A+ + + I G N+ + Sbjct: 266 VFKLDSGSSDHLVNSKSFFASLKPAPQTVIINVAKDGQFLEARQVGVIAGSSNLGV---- 321 Query: 1461 FKLQLKNVL-VHRLRRNLISVRKLVMAGVTIEENLTKYHDDQLRLFYNKKLISTSIGSSG 1637 LQ+K+VL V LR NL+SV+KL AG+ + ++ L N I+T+ Sbjct: 322 -PLQVKDVLYVPSLRDNLMSVKKLAKAGIEVV-----FNSKLATLKLNGNPIATAYLRGN 375 Query: 1638 LYLLKSEAVGESNLANQSKPWKEWHQKLGHVNDRYLNEIYKKVNGKNL---PNTQEICES 1808 LY LK E +S S WH++LGH+ + + + ++ K L P + C++ Sbjct: 376 LYELKIEVPEKSANLCSSDVTNLWHRRLGHLCENGMKTMVREDLAKGLNFKPEKLKFCDA 435 Query: 1809 CAKGKMSRSPFISSNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 C +GKM R PF + + +R L IHSD+ GP+ S+ G RYFVSFIDD+ Sbjct: 436 CVQGKMCREPFDGTRERATRPLGRIHSDVCGPIEPASWDGCRYFVSFIDDY 486 >gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao] Length = 1318 Score = 102 bits (255), Expect = 5e-19 Identities = 74/231 (32%), Positives = 115/231 (49%), Gaps = 10/231 (4%) Frame = +3 Query: 1299 SIWCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFKLQLK 1478 SIW +DS C H+TG ++ K K++++ ++L I G G + I + Sbjct: 327 SIWLIDSACSTHITGKIKNFLDLN-KAYKSTVEIGDGNLLKIAGRGTVGITTKKGMKTIA 385 Query: 1479 NV-LVHRLRRNLISVRKLVMAGVTIEENLTKYHDDQLRLFY-NKKLISTSIGSSGLYLLK 1652 NV + +NL+SV +LV E+N + D+ +F + + I+T + + L Sbjct: 386 NVCFAPEVTQNLLSVGQLVK-----EKNSLLFKDELCTIFDPSGREIATVKMRNKCFPLD 440 Query: 1653 SEAVGESNLANQSKPWKEWHQKLGHVNDRYLNEIYKKVNGKNLPNTQEI-------CESC 1811 G S + WH++LGH+N +++ K + NL N I CE C Sbjct: 441 LNEAGHMAYKCVSNEARLWHRRLGHINYQFI----KNMGSLNLVNDMPIITEVEKTCEVC 496 Query: 1812 AKGKMSRSPFIS-SNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 +GK SR PF S T+T+ L+LIH+DI GP+ T S G +YF+ FIDDF Sbjct: 497 LQGKQSRHPFPKQSQTRTANRLQLIHTDICGPIGTLSLNGNKYFILFIDDF 547 >gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from Arabidopsis thaliana BAC gb|AF080119 and is a member of the reverse transcriptase family PF|00078 [Arabidopsis thaliana] Length = 1415 Score = 102 bits (255), Expect = 5e-19 Identities = 71/231 (30%), Positives = 112/231 (48%), Gaps = 6/231 (2%) Frame = +3 Query: 1287 NDNHSIWCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFK 1466 +D W DS H+T + + L + T E +++ L I G+ I N K Sbjct: 316 DDTGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKSSNGK 375 Query: 1467 LQLKNVLV-HRLRRNLISVRKLV---MAGVTIEENLTKYHDDQLRLFYNKKLISTSIGSS 1634 + L VLV ++++L+SV KL GV + N D Q +K+++T + Sbjct: 376 IPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLQ-----TQKVVTTGPRRN 430 Query: 1635 GLYLLKSEAVGESNLANQSKPWKE-WHQKLGHVNDRYLNEIYK-KVNGKNLPNTQEICES 1808 GLY+L+++ Q +E WH +LGH N + L + K N T +CE Sbjct: 431 GLYVLENQEFVALYSNRQCAATEEVWHHRLGHANSKALQHLQNSKAIQINKSRTSPVCEP 490 Query: 1809 CAKGKMSRSPFISSNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 C GK SR PF+ S+++ L+ IH D++GP P S QG +Y+ F+DD+ Sbjct: 491 CQMGKSSRLPFLISDSRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDY 541 >emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera] Length = 1472 Score = 102 bits (255), Expect = 5e-19 Identities = 81/262 (30%), Positives = 124/262 (47%), Gaps = 14/262 (5%) Frame = +3 Query: 1215 NIVALITEHINEFNKYTPEVNVCINDNHSIWCLDSGCRLHMTGNESILTNITWKEDKNSI 1394 N V + + F Y EV +++IW LDSGC HMTG +S+ + + K + Sbjct: 265 NYVEQEEDQVKLFMAYNEEVV----SSNNIWFLDSGCSNHMTGIKSLFKELD-ESHKLKV 319 Query: 1395 KTARKDMLTIRGFGNIEIILP--NFKLQLKNVLVHRLRRNLISVRKLVMAGVTIEEN--- 1559 K + + G G + N KL + L +NL+SV +L+++G +I + Sbjct: 320 KLGDDKQVQVEGKGTXAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILFDGAT 379 Query: 1560 --LTKYHDDQL----RLFYNKKLISTSIGSSGLYLLKSEAVGESNLANQSKPWKEWHQKL 1721 + DQ+ R+ NK L + S + L + ESNL WH + Sbjct: 380 CVIKDKKSDQIIVBVRMAANK-LFPLEVSSIEKHALVVKETSESNL---------WHLRY 429 Query: 1722 GHVNDRYLNEIYKKVNGKNLP--NTQEICESCAKGKMSRSPFISSNTK-TSRILELIHSD 1892 GH+N + L + KK LP ++ +CE C GK S+ PF ++ S LE+IH+D Sbjct: 430 GHLNVKGLKLLSKKEMVFGLPKIDSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHAD 489 Query: 1893 IFGPMPTQSYQGYRYFVSFIDD 1958 + GPM T S+ G RYF+ F DD Sbjct: 490 LCGPMQTASFGGSRYFLLFTDD 511 >ref|XP_005715938.1| unnamed protein product [Chondrus crispus] gi|507112437|emb|CDF36119.1| unnamed protein product [Chondrus crispus] Length = 753 Score = 78.6 bits (192), Expect(2) = 5e-19 Identities = 68/275 (24%), Positives = 131/275 (47%), Gaps = 8/275 (2%) Frame = +3 Query: 1158 KRKMRPARKEANITEVINPNIVALITEHINEFNKYTPEVNVCINDNHSIWCLDSGCRLHM 1337 K +M ++ A +T+ +P++V + +K + ++ ++ + W +DS C H+ Sbjct: 251 KPRMSQRKQSAFVTQKPDPDVVVNSVDFTCLMSKASRTNDLEMSPS---WLVDSACTAHI 307 Query: 1338 TGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILP-NFKLQ---LKNVL-VHRLR 1502 T + S+ E S++ K + G G++ + L N +++ L +VL V Sbjct: 308 TYDRSLFATYEPLESA-SVQMGTKASAKVAGRGDVHLKLNVNGRIEPCKLTDVLHVPDFA 366 Query: 1503 RNLISVRKLVMAGVTIEENLTKYHDDQLRLFYNKKLISTSIGSSGLYLLKSEA-VGESNL 1679 +L+SV ++ G+ + + + + + +++T+ LY+L + VG ++ Sbjct: 367 FSLLSVSRMTELGLKVG-----FENGKCMIRRGSTVVATATLVGELYVLDIVSDVGSAHA 421 Query: 1680 ANQSKPWKEWHQKLGHVNDRYLNEIYKKVNGKNLPNTQEICESCAKGKMSRS--PFISSN 1853 A + WH++ H N K+N N E C +C GK +RS P S+ Sbjct: 422 ATL----QTWHERFAHAN---------KINNTNNDCISEKCSACVYGKATRSVIPKERSS 468 Query: 1854 TKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDD 1958 + L+L+HSD+ GP+ QS G +YF++FIDD Sbjct: 469 RRAYFCLDLVHSDVCGPLEVQSIGGAKYFITFIDD 503 Score = 44.7 bits (104), Expect(2) = 5e-19 Identities = 49/214 (22%), Positives = 88/214 (41%), Gaps = 13/214 (6%) Frame = +2 Query: 470 LRSNNYEA*KDKISVLLKSNNLLMIVLKGKEKDSIF*TDK------DNATHTLLSLSESE 631 L +N+ K KI +LL ++ +++G+ ++ D+ L+ LS S+ Sbjct: 15 LTDSNFYVWKQKIQLLLALRDVDQYIVEGRVPSEERAEERKKWIRGDSKAKALIGLSLSD 74 Query: 632 EIAPLI*KAQNAHDSWTRLNKHFGRKSPTKLRLLISEIENLKMKEDENSAKLIRKVLDLQ 811 E + +AH+ W + F R + E +KM E I +V L Sbjct: 75 EHLEHVRDVDSAHEMWEAIVNVFERHTLLNKLAARREFYTVKMLSGEKVLAYINRVKQLA 134 Query: 812 QQIEDQGKNLLDIDLIHYTLKALPLKFVDFISKFD---NDD*DITYDVFCNKLQIMETKL 982 ++ N+ D ++ L LP +F I D N++ + D ++L E + Sbjct: 135 AILKSMSVNIDDKEMAMAVLNGLPARFEALIVALDALGNEEKIFSLDFVKSRLLQEEQRA 194 Query: 983 TLRNNLLDQFDAMVAHRYPNKR----RKPTYCKH 1072 ++++ Q A+V +R PN R K T C H Sbjct: 195 NMKSS-SSQTSALV-NRAPNNRDINDYKCTNCGH 226 >emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera] Length = 1274 Score = 102 bits (254), Expect = 6e-19 Identities = 81/262 (30%), Positives = 125/262 (47%), Gaps = 14/262 (5%) Frame = +3 Query: 1215 NIVALITEHINEFNKYTPEVNVCINDNHSIWCLDSGCRLHMTGNESILTNITWKEDKNSI 1394 N V + + F Y EV +++IW LDSGC HMTG +S+ + + K + Sbjct: 280 NYVEQEEDQVKLFMXYNEEVV----SSNNIWFLDSGCSNHMTGIKSLFKELD-ESHKLKV 334 Query: 1395 KTARKDMLTIRGFGNIEIILP--NFKLQLKNVLVHRLRRNLISVRKLVMAGVTIEEN--- 1559 K + + G G + + N KL + L +NL+SV +L+++G +I + Sbjct: 335 KLGDDKQVXVEGKGIMAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILFDGAT 394 Query: 1560 --LTKYHDDQL----RLFYNKKLISTSIGSSGLYLLKSEAVGESNLANQSKPWKEWHQKL 1721 + DQ+ R+ NK L + S + L + ESNL WH + Sbjct: 395 CVIKDKKSDQIIVNVRMAANK-LFPLEVSSIEKHALVVKETSESNL---------WHLRY 444 Query: 1722 GHVNDRYLNEIYKKVNGKNLP--NTQEICESCAKGKMSRSPFISSNTK-TSRILELIHSD 1892 GH+N + L + KK LP ++ +CE C GK S+ PF ++ S LE+IH+D Sbjct: 445 GHLNVKGLKLLSKKEMVFGLPKIDSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHAD 504 Query: 1893 IFGPMPTQSYQGYRYFVSFIDD 1958 + GPM T S+ G RYF+ F DD Sbjct: 505 LCGPMQTASFGGSRYFLLFTDD 526 >gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1149 Score = 102 bits (254), Expect = 6e-19 Identities = 74/230 (32%), Positives = 110/230 (47%), Gaps = 12/230 (5%) Frame = +3 Query: 1305 WCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFKLQLKNV 1484 W DS H+T N S L + +++ + + L I G+ + + L LK+V Sbjct: 314 WVPDSAATAHITNNSSRLQQMQPYLGNDTVMASDGNFLPITHIGSANLPSTSGNLPLKDV 373 Query: 1485 LV-HRLRRNLISVRKLVMA----------GVTIEENLTKYHDDQLRLFYNKKLISTSIGS 1631 LV + ++L+SV KL GV +++ T K L S S Sbjct: 374 LVCPNIAKSLLSVSKLTKDYPCSFTFDADGVLVKDKATC-----------KVLTKGSSTS 422 Query: 1632 SGLYLLKSEAVGESNLANQSKPWKE-WHQKLGHVNDRYLNEIYKKVNGKNLPNTQEICES 1808 GLY L++ Q K E WH +LGH N + L + K + +T ++CES Sbjct: 423 EGLYKLENPKFQMFYSTRQVKATDEVWHMRLGHPNPQVLQLLANKKAIQINKSTSKMCES 482 Query: 1809 CAKGKMSRSPFISSNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDD 1958 C GK SR PFI+S+ SR LE +H D++GP P S QG++Y+V FID+ Sbjct: 483 CRLGKSSRLPFIASDFIASRPLERVHCDLWGPAPVSSIQGFQYYVIFIDN 532 >emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] Length = 1466 Score = 102 bits (253), Expect = 8e-19 Identities = 69/225 (30%), Positives = 110/225 (48%), Gaps = 6/225 (2%) Frame = +3 Query: 1305 WCLDSGCRLHMTGNESILTNITWKEDKNSIKTARKDMLTIRGFGNIEIILPNFKLQLKNV 1484 W DS H+T + S L N T E +++ L I G+ I + L V Sbjct: 324 WYPDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKGTIPLNEV 383 Query: 1485 LV-HRLRRNLISVRKLV---MAGVTIEENLTKYHDDQLRLFYNKKLISTSIGSSGLYLLK 1652 LV ++++L+SV KL GV + N D +K++S ++GLY+L+ Sbjct: 384 LVCPAIQKSLLSVSKLCDDYPCGVYFDANKVCIID-----LTTQKVVSKGPRNNGLYMLE 438 Query: 1653 -SEAVGESNLANQSKPWKEWHQKLGHVNDRYLNEIY-KKVNGKNLPNTQEICESCAKGKM 1826 SE V + + + WH +LGH N + L ++ +K N T +CE C GK Sbjct: 439 NSEFVALYSNRQCAASMETWHHRLGHSNSKILQQLLTRKEIQVNKSRTSPVCEPCQMGKS 498 Query: 1827 SRSPFISSNTKTSRILELIHSDIFGPMPTQSYQGYRYFVSFIDDF 1961 +R F SS+ + + L+ +H D++GP P S QG++Y+ F+DDF Sbjct: 499 TRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDF 543