BLASTX nr result
ID: Papaver25_contig00035185
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver25_contig00035185 (1439 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa ... 258 6e-66 ref|XP_007203344.1| hypothetical protein PRUPE_ppa020282mg [Prun... 256 2e-65 emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulga... 254 7e-65 gb|EPS72636.1| hypothetical protein M569_02121, partial [Genlise... 253 1e-64 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 251 4e-64 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 251 6e-64 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 249 3e-63 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 248 4e-63 gb|ABA98491.1| retrotransposon protein, putative, unclassified [... 247 8e-63 emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulga... 246 1e-62 ref|XP_007203452.1| hypothetical protein PRUPE_ppa022115mg [Prun... 244 7e-62 ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268... 244 7e-62 emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga... 244 9e-62 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 243 2e-61 gb|AFP55557.1| non-ltr retroelement reverse transcriptase [Rosa ... 242 3e-61 emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulga... 242 3e-61 ref|XP_006491472.1| PREDICTED: uncharacterized protein LOC102626... 241 4e-61 ref|XP_007217321.1| hypothetical protein PRUPE_ppa019733mg [Prun... 241 6e-61 gb|ABA96650.1| retrotransposon protein, putative, unclassified [... 240 1e-60 gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam... 239 2e-60 >gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa rugosa] Length = 1656 Score = 258 bits (658), Expect = 6e-66 Identities = 161/477 (33%), Positives = 244/477 (51%), Gaps = 16/477 (3%) Frame = -2 Query: 1384 PWLVLGDLNFHIXXXXXXXXXXXXGF----VNIVINDCDLTDLG*IGKDYTWSSNITGTC 1217 PWLVLGD N + + +N+ L DL G ++W + G Sbjct: 736 PWLVLGDFNEVLDPSEKWGGGPPLPWRIKLFRDFLNNGHLRDLHFKGPGFSWFAMRHGRV 795 Query: 1216 NRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF*KNYSWLS 1037 K R+D ALGN WS P++++ HL +GSDH P+LL ++ F W + Sbjct: 796 FIKERLDRALGNIAWSSSQPNTQILHLPKIGSDHRPLLLDSNPKMLNKTRLFRFEQMWTT 855 Query: 1036 DNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQVDKLQSELEVL 857 S I + W GS + L S K L W+++ F N + QV L S++E L Sbjct: 856 HEEYSDVIQRSWPPAFGGSAMRSWNRNLLSCGKALKMWSKEKFSNPSVQVADLLSDIEKL 915 Query: 856 QNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHAKINRKKAR 677 P + H+ I + ++K + ++ Q+SR + K D N+ FFH +++ Sbjct: 916 HQSNPPDA-HHQINILTDQVTKLWTQDEMYWHQRSRVNWLKLGDQNSSFFHQTTIQRRQY 974 Query: 676 NNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVITSEDNVNLC 497 N I +KD GNW+ + D+A D+F + S+ P E + + T +T+E N L Sbjct: 975 NKIVRLKDDHGNWLDSEADVALQFLDYFTALYQSNGPQQWEEVLDFVDTAVTAEMNKILS 1034 Query: 496 KIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFFQSRHILKEIN 350 L E+ A+ +GF FY++QWE V + I + + S +L+ +N Sbjct: 1035 SPVSLLEVKKAVFDLGATKAPGPDGFSGIFYQNQWEWVQSIIHESALQHQTSSSLLQVMN 1094 Query: 349 KTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQAAYVSGRM 170 +T+++LIPK K PT YRPI LCN SYKI++KI+ RL+P M ++IS Q+A+VS R Sbjct: 1095 RTHLALIPKVKAPTHPSHYRPIALCNFSYKILTKIIASRLQPFMSELISDNQSAFVSNRQ 1154 Query: 169 ISDNTIIAQEIIHSMKKKRG-ESGWIALKLDMSKAFNRLEWSFLLKVLNYFGFSENF 2 I DN IIA EI H +K R +G LKLDM+KA++R+EW+FL VL GF +++ Sbjct: 1155 IQDNVIIAHEIYHHLKLTRSCNNGAFGLKLDMNKAYDRVEWNFLEAVLRKMGFVDSW 1211 >ref|XP_007203344.1| hypothetical protein PRUPE_ppa020282mg [Prunus persica] gi|462398875|gb|EMJ04543.1| hypothetical protein PRUPE_ppa020282mg [Prunus persica] Length = 1496 Score = 256 bits (654), Expect = 2e-65 Identities = 156/470 (33%), Positives = 239/470 (50%), Gaps = 12/470 (2%) Frame = -2 Query: 1384 PWLVLGDLNFHIXXXXXXXXXXXXGFVNIVINDCDLTDLG*IGKDYTWSSNITGTCNRKS 1205 PWL GD N + + I+ C DLG G YTW N + Sbjct: 517 PWLCCGDFNEILRADE-----------KLAIDTCRFKDLGYTGPKYTWWRN--NPMEIRI 563 Query: 1204 RIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF*KNYSWLSDNSC 1025 R+D AL DW ++ +++ HLN SDH P+ KKL F W +C Sbjct: 564 RLDRALATADWCSRFLGTKVIHLNPTKSDHLPL--------KKL---FRFEEMWAEHVNC 612 Query: 1024 SVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQVDKLQSELEVLQNQT 845 I GW GS F +KLK TR L W++ +FG++ Q+ + +L L + Sbjct: 613 MQTIQDGWQRTCRGSAPFTTTEKLKCTRHKLLGWSKCNFGHLPNQIKITREKLGELLDAP 672 Query: 844 PGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHAKINRKKARNNID 665 P + L + +++Q SR T+ K D N+KFFH K + ++ RN I Sbjct: 673 PSHHTAELRNALTKQLDSLMAKNEVYWRQCSRATWLKAGDRNSKFFHYKASSRRRRNTIS 732 Query: 664 AIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVITSEDNVNLCKIPE 485 A++D G+W + + + ++F+ + +S+ S + + +T E N L + Sbjct: 733 ALEDEHGHWQTTEQGLTQTVVNYFQHLFSSTGSSEYTEVVDGVRGRVTEEMNQALLAVFT 792 Query: 484 LQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFFQSRHILKEINKTYI 338 +EI AL +GF FY+ W +VG D+ V FF++ +LK IN T++ Sbjct: 793 PEEIKIALFQMHPSKAPGPDGFSPFFYQKYWPIVGEDVVAAVLHFFKTGKLLKRINFTHV 852 Query: 337 SLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQAAYVSGRMISDN 158 +LIPK +P ++ RPI LCN YKI +K+L RLK ++ +IS Q+A+V GR ISDN Sbjct: 853 ALIPKVHEPKNMMQLRPISLCNVLYKIGAKVLTTRLKAILPTLISDTQSAFVPGRAISDN 912 Query: 157 TIIAQEIIHSM-KKKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYFGFS 11 +I+A E++H M KK +G G++ALK+DMSKA++R+EWSFL ++ GF+ Sbjct: 913 SIVAFELLHMMHKKNQGRQGYLALKIDMSKAYDRVEWSFLEALMKGMGFA 962 >emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1362 Score = 254 bits (649), Expect = 7e-65 Identities = 148/449 (32%), Positives = 231/449 (51%), Gaps = 17/449 (3%) Frame = -2 Query: 1297 VINDCDLTDLG*IGKDYTWSSNITGTCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSD 1118 VI+DC + DLG +G +TW + + + R+D L N +W +P + HL SD Sbjct: 164 VIDDCAVKDLGYVGNRFTWQRGNSPSTLIRERLDRMLANDEWCDNFPSWEVVHLPRYRSD 223 Query: 1117 HSPILLVTDYTQK-----KLWIPF*KNYSWLSDNSCSVEIAKGWSINVNGSPGFQCVQKL 953 H+P+LL T KL F WLS C + + W NGS G +L Sbjct: 224 HAPLLLKTGVNDSFRRGNKL---FKFEAMWLSKEECGKIVEEAW----NGSAGEDITNRL 276 Query: 952 KSTRKILSKWNRDHFGNINQQVDKLQSELEVLQNQTPGEVVHNDILKVNADLSKWHKRRA 773 + LS W FGN+ ++ + + L LQ + P V+ DL + H+ Sbjct: 277 DEVSRSLSTWATKTFGNLKKRKKEALTLLNGLQQRDPDASTLEQCRIVSGDLDEIHRLEE 336 Query: 772 DFYQQKSRFTFYKEHDNNTKFFHAKINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHF 593 ++ ++R ++ D NTK+FH K +++K RN I+ + D G W RE+I + +F Sbjct: 337 SYWHARARANEIRDGDKNTKYFHHKASQRKRRNTINELLDENGVWKKGREEICGVVQHYF 396 Query: 592 RKISTSSNPSLEERLYLVLPTVITSEDNVNLCKIPELQEIHSAL-----------EGFQA 446 + + +P E L ++++ N L +P E+ AL +G A Sbjct: 397 EGLFATDSPVNMELALEGLSHCVSTDMNTALLMLPSGDEVKEALFAMHPNKAPGIDGLHA 456 Query: 445 GFYKSQWEVVGTDICKMVWKFFQSRHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTS 266 F++ W ++G+D+ V +++ L +NKT I LIPK P + D+RPI LC Sbjct: 457 LFFQKFWHILGSDVISFVQSWWRGMGDLGVVNKTCIVLIPKCDHPQSMKDFRPISLCTVL 516 Query: 265 YKIISKILVGRLKPVMEKIISPFQAAYVSGRMISDNTIIAQEIIHSMKKK-RGESGWIAL 89 YKI+SK L RLK ++ IISP Q+A+V R+I+DN ++A EI H+MK+K ++G AL Sbjct: 517 YKILSKTLANRLKVILPAIISPNQSAFVPRRLITDNALVAFEIFHAMKRKDANKNGVCAL 576 Query: 88 KLDMSKAFNRLEWSFLLKVLNYFGFSENF 2 KLDMSKA++R+EW FL +V+ GF + + Sbjct: 577 KLDMSKAYDRVEWCFLERVMKKMGFCDGW 605 >gb|EPS72636.1| hypothetical protein M569_02121, partial [Genlisea aurea] Length = 1503 Score = 253 bits (647), Expect = 1e-64 Identities = 154/488 (31%), Positives = 249/488 (51%), Gaps = 27/488 (5%) Frame = -2 Query: 1384 PWLVLGDLNFHIXXXXXXXXXXXXG-----FVNIVINDCDLTDLG*IGKDYTWSSNITGT 1220 PWLV+GD N + F N + +CDL+DLG G +TW++N T Sbjct: 457 PWLVVGDFNEVLWQDEHLSSCLRSCSSMGLFRN-ALEECDLSDLGFQGYPFTWTNNRTHP 515 Query: 1219 CNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYT-------QKKLWIPF 1061 K+R+D + N W P + HL GSDH PILL+ ++K + F Sbjct: 516 STVKARLDRFVANTSWINIVPHFSVSHLKFGGSDHCPILLMFKDVVGCHTTLRRKRFFKF 575 Query: 1060 *KNYSWLSDNSCSVEIAKGWSI-NVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQVD 884 K W + +C V I W++ + P +++L++ R+ L W+R G++ ++ Sbjct: 576 EK--IWCENETCRVIIDGCWAVPRSSWCPQLSLLRRLQNCRQKLQCWHRTSIGSLRHRIS 633 Query: 883 KLQSELEVLQNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFH 704 +Q L L + V + I + A LS+ K +++Q+S+ + +E D N KFFH Sbjct: 634 SIQDRLSTLMEGVISDSVGDQIRDLKAQLSQLLKLDEIWWKQRSKVHWLREGDKNNKFFH 693 Query: 703 AKINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLE--ERLYLVLPT 530 + ++ RN I+ +K W+ N DI + + S+ PS + + P Sbjct: 694 GVASSRQRRNKIERLKSRNNIWLENTSDIHHEFISVYEDLFKSTYPSEDAINNIVRTAPR 753 Query: 529 VITSEDNVNLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKF 383 ++T E N L + +EI +A+ +GF FY+ W +G+++C V F Sbjct: 754 MVTDEMNRKLTQAFTSEEILTAVMQMNADSAPGPDGFPPLFYQKFWPTIGSEVCNSVLDF 813 Query: 382 FQSRHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIIS 203 +R ++ N T I IPK P V YRPI LCN YK+ SK + RLK + +IIS Sbjct: 814 LNNRKCFRKFNHTNIVFIPKVSDPVEVAHYRPISLCNVIYKMASKCITNRLKEFVSEIIS 873 Query: 202 PFQAAYVSGRMISDNTIIAQEIIHSMKK-KRGESGWIALKLDMSKAFNRLEWSFLLKVLN 26 P+Q+A+V R+I+DN ++A E+ HS++ +RG+ +++LKLDM+KA++R+EWSFL +L Sbjct: 874 PWQSAFVPDRLITDNILVAFEVNHSIRNLRRGKKSFVSLKLDMNKAYDRVEWSFLKAMLI 933 Query: 25 YFGFSENF 2 GF +F Sbjct: 934 QLGFHISF 941 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 251 bits (642), Expect = 4e-64 Identities = 153/484 (31%), Positives = 255/484 (52%), Gaps = 22/484 (4%) Frame = -2 Query: 1387 QPWLVLGDLNFHIXXXXXXXXXXXXG-----FVNIVINDCDLTDLG*IGKDYTWSSNITG 1223 +PWLV GD N + F ++++ DC L D G G +TW++N Sbjct: 1013 EPWLVGGDFNIILKREERLYGSAPHEGSMEDFASVLL-DCGLLDGGFEGNPFTWTNN--- 1068 Query: 1222 TCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF*KNYSW 1043 R+D + N W +P +R+ HLN GSDH P+L+ + +K F ++W Sbjct: 1069 --RMFQRLDRVVYNHQWINMFPITRIQHLNRDGSDHCPLLISCFISSEKSPSSFRFQHAW 1126 Query: 1042 LSDNSCSVEIAKGWSINVNGSPGFQCVQ-KLKSTRKILSKWNRDHFGNIN---QQVDKLQ 875 + + + W++ +NGS G Q K ++ L WN+ FG+I ++ +K Sbjct: 1127 VLHHDFKTSVEGNWNLPINGS-GLQAFWIKQHRLKQHLKWWNKAVFGDIFSKLKEAEKRV 1185 Query: 874 SELEVL--QNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHA 701 E E+L Q QT G ++ + K A L+K F++QKS + E + NTKFFH Sbjct: 1186 EECEILHQQEQTVGSRIN--LNKSYAQLNKQLNVEEIFWKQKSGVKWVVEGERNTKFFHM 1243 Query: 700 KINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVIT 521 ++ +K+ R++I +++ +G WI ++E + ++F + + + ++P++I+ Sbjct: 1244 RMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIEYFSSLLKAEPCDISRFQNSLIPSIIS 1303 Query: 520 SEDNVNLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFFQS 374 + +N LC P LQE+ A+ +GF + FY+ W + D+ V FF Sbjct: 1304 NSENELLCAEPNLQEVKDAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRDFFHG 1363 Query: 373 RHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQ 194 +I + + T + L+PKK + ++RPI LC KII+K+L RL ++ II+ Q Sbjct: 1364 ANIPRGVTSTTLVLLPKKSSASKWSEFRPISLCTVMNKIITKLLSNRLAKILPSIITENQ 1423 Query: 193 AAYVSGRMISDNTIIAQEIIHSMKKKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYFGF 14 + +V GR+ISDN ++AQE+I + K G +ALKLDM KA++RL+WSFL+KVL +FGF Sbjct: 1424 SGFVGGRLISDNILLAQELIRKLDTK-SRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGF 1482 Query: 13 SENF 2 +E + Sbjct: 1483 NEQW 1486 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 251 bits (641), Expect = 6e-64 Identities = 155/479 (32%), Positives = 249/479 (51%), Gaps = 21/479 (4%) Frame = -2 Query: 1384 PWLVLGDLNFHIXXXXXXXXXXXXG-----FVNIVINDCDLTDLG*IGKDYTWSSNITGT 1220 PW+V GD N + F ++++ DC L D G G +TW++N Sbjct: 977 PWIVGGDFNIILKREERLYGADPHEGSIEDFASVLL-DCGLLDGGFEGNPFTWTNN---- 1031 Query: 1219 CNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF*KNYSWL 1040 R+D + N W ++P +R+ HLN GSDH P+LL + +K F ++W Sbjct: 1032 -RMFQRLDRMVYNQQWINKFPITRIQHLNRDGSDHCPLLLSCSNSSEKAPSSFRFLHAWA 1090 Query: 1039 SDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQV---DKLQSE 869 ++ + + W++ +NGS K K ++ L WN+ FG+I + +K E Sbjct: 1091 LHHNFNASVEGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDIFSNIKEAEKRVEE 1150 Query: 868 LEVL--QNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHAKI 695 E+L Q QT G + + K A L+K F++QKS + E + NTKFFH ++ Sbjct: 1151 CEILHQQEQTIGSRIQ--LNKSYAQLNKQLSMEEIFWKQKSGVKWVVEGERNTKFFHMRM 1208 Query: 694 NRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVITSE 515 +K+ R++I I++ +GNWI + E + D F + + + + P++I+ Sbjct: 1209 QKKRIRSHIFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTRFQSSLCPSIISDT 1268 Query: 514 DNVNLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFFQSRH 368 DN LC P LQE+ A+ +GF + FY+ W+++ D+ + V +FF Sbjct: 1269 DNGFLCAEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFFHGAD 1328 Query: 367 ILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQAA 188 I + + T + LIPK + ++RPI LC KII+KIL RL ++ II+ Q+ Sbjct: 1329 IPQGMTSTTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILPSIITENQSG 1388 Query: 187 YVSGRMISDNTIIAQEIIHSMKKKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYFGFS 11 +V GR+ISDN ++AQE+I + +K G +ALKLDM KA++RL+WSFL KVL + GF+ Sbjct: 1389 FVGGRLISDNILLAQELIGKLDQK-NRGGNVALKLDMMKAYDRLDWSFLFKVLQHLGFN 1446 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 249 bits (635), Expect = 3e-63 Identities = 151/486 (31%), Positives = 254/486 (52%), Gaps = 21/486 (4%) Frame = -2 Query: 1396 NVAQPWLVLGDLNFHIXXXXXXXXXXXXG-----FVNIVINDCDLTDLG*IGKDYTWSSN 1232 ++ PWLV GD N + F + ++ DC L D G G +TW++N Sbjct: 1180 DIEVPWLVGGDFNIILKREERLYGSAPHEGAMEDFASTLL-DCGLLDGGFEGNPFTWTNN 1238 Query: 1231 ITGTCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF*KN 1052 R+D + N W ++P +R+ HLN GSDH P+L+ + +K F Sbjct: 1239 -----RMFQRLDRIVYNHHWINKFPITRIQHLNRDGSDHCPLLISCFNSSEKAPSSFRFQ 1293 Query: 1051 YSWLSDNSCSVEIAKGWSINVNGSPGFQCV-QKLKSTRKILSKWNRDHFGNINQQVDKLQ 875 ++W+ + + W++ +NGS G Q K ++ L WN+ FG+I ++ + + Sbjct: 1294 HAWVLHHDFKTSVESNWNLPINGS-GLQAFWSKQHRLKQHLKWWNKVMFGDIFSKLKEAE 1352 Query: 874 SELEVLQNQTPGEVVHNDILKVN---ADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFH 704 +E + E I+K+N A L+K F++QKS + E + NTKFFH Sbjct: 1353 KRVEECEILHQNEQTVESIIKLNKSYAQLNKQLNIEEIFWKQKSGVKWVVEGERNTKFFH 1412 Query: 703 AKINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYL-VLPTV 527 ++ +K+ R++I +++ +G WI ++E + +F + P + R ++P++ Sbjct: 1413 TRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIKYFSSL-LKFEPCDDSRFQRSLIPSI 1471 Query: 526 ITSEDNVNLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFF 380 I++ +N LC P LQE+ A+ +GF + FY+ W ++ D+ V FF Sbjct: 1472 ISNSENELLCAEPNLQEVKDAVFGIDPESAAGPDGFSSYFYQQCWNIIAHDLLDAVRDFF 1531 Query: 379 QSRHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISP 200 +I + + T + L+PKK + D+RPI LC KII+K+L RL ++ II+ Sbjct: 1532 HGANIPRGVTSTTLILLPKKPSASKWSDFRPISLCTVMNKIITKLLSNRLAKILPSIITE 1591 Query: 199 FQAAYVSGRMISDNTIIAQEIIHSMKKKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYF 20 Q+ +V GR+ISDN ++AQE+I + K G +ALKLDM KA++RL+WSFL+KVL +F Sbjct: 1592 NQSGFVGGRLISDNILLAQELIGKLNTK-SRGGNLALKLDMMKAYDRLDWSFLIKVLQHF 1650 Query: 19 GFSENF 2 GF++ + Sbjct: 1651 GFNDQW 1656 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 248 bits (634), Expect = 4e-63 Identities = 154/483 (31%), Positives = 250/483 (51%), Gaps = 21/483 (4%) Frame = -2 Query: 1396 NVAQPWLVLGDLNFHIXXXXXXXXXXXXG-----FVNIVINDCDLTDLG*IGKDYTWSSN 1232 ++ PWLV GD N + F + ++ DC L D G G +TW++N Sbjct: 1008 DIEVPWLVGGDFNVILKREERLYGSAPHEGAMEDFASTLL-DCGLLDGGFEGNSFTWTNN 1066 Query: 1231 ITGTCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF*KN 1052 R+D + N W ++P +R+ HLN GSDH P+L+ + +K F Sbjct: 1067 -----RMFQRLDRIVYNHHWINKFPVTRIQHLNRDGSDHCPLLISCFNSSEKAPSSFRFQ 1121 Query: 1051 YSWLSDNSCSVEIAKGWSINVNGSPGFQCV-QKLKSTRKILSKWNRDHFGNIN---QQVD 884 ++W+ + + W++ +NGS G Q K ++ L WN+ FG+I ++ + Sbjct: 1122 HAWVLHHDFKTSVESNWNLPINGS-GLQAFWSKQHRLKQHLKWWNKAVFGDIFSKLKEAE 1180 Query: 883 KLQSELEVLQNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFH 704 K E E+L Q + K A L+K F++QKS + E + NTKFFH Sbjct: 1181 KRVEECEILHQQEQTFESRIKLNKSYAQLNKQLNIEELFWKQKSGVKWVVEGERNTKFFH 1240 Query: 703 AKINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYL-VLPTV 527 ++ +K+ R++I ++D EG WI ++E + ++F + P + R ++P++ Sbjct: 1241 MRMQKKRIRSHIFKVQDPEGRWIEDQEQLKHSAIEYFSSL-LKVEPCYDSRFQSSLIPSI 1299 Query: 526 ITSEDNVNLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFF 380 I++ +N LC P LQE+ A+ +GF + FY+ W ++ D+ V FF Sbjct: 1300 ISNSENELLCAEPSLQEVKDAVFGINSESAAGPDGFSSYFYQQCWNIIAQDLLDAVRDFF 1359 Query: 379 QSRHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISP 200 +I + + T + L+PKK + D+RPI LC KII+K+L RL V+ II+ Sbjct: 1360 HGANIPRGVTSTTLILLPKKSSASKWSDFRPISLCTVMNKIITKLLSNRLAKVLPSIITE 1419 Query: 199 FQAAYVSGRMISDNTIIAQEIIHSMKKKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYF 20 Q+ +V GR+ISDN ++AQE+I + K G +ALKLDM KA+++L+WSFL KVL +F Sbjct: 1420 NQSGFVGGRLISDNILLAQELIGKLNTK-SRGGNLALKLDMMKAYDKLDWSFLFKVLQHF 1478 Query: 19 GFS 11 GF+ Sbjct: 1479 GFN 1481 >gb|ABA98491.1| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] Length = 1621 Score = 247 bits (631), Expect = 8e-63 Identities = 150/485 (30%), Positives = 249/485 (51%), Gaps = 24/485 (4%) Frame = -2 Query: 1396 NVAQPWLVLGDLN----FHIXXXXXXXXXXXXGFVNIVINDCDLTDLG*IGKDYTWSSNI 1229 N PWL+ GD N H + DC L DLG G +TW ++ Sbjct: 373 NPTTPWLMAGDFNEILFSHEKQGGRMKAQSAMDEFRHALTDCGLDDLGFEGDAFTWRNHS 432 Query: 1228 TGTCNR-KSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKL-----WI 1067 + R+D A+ N +W +P +R+ + + SDH P+++ + K + Sbjct: 433 HSQEGYIRERLDRAVANPEWRAMFPAARVINGDPRHSDHRPVIIELEGKNKGVRGRNGHN 492 Query: 1066 PF*KNYSWLSDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQV 887 F +WL + + + W ++ G G L LS W+ + G++ ++V Sbjct: 493 DFRFEAAWLEEEKFKEVVKEAWDVSA-GLQGLPVHASLAGVAAGLSSWSSNVLGDLEKRV 551 Query: 886 DKLQSELEVLQNQ--TPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTK 713 K++ ELE + Q + +VV ++L+ L K ++ +++Q++ + + D NT Sbjct: 552 KKVKKELETCRRQPISRDQVVREEVLRYR--LEKLEQQVDIYWKQRAHTNWLNKGDRNTS 609 Query: 712 FFHAKINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLP 533 FFHA + ++ RN I+ ++ +G+W+ ED A + + F+++ TS+ ++L V+ Sbjct: 610 FFHASCSERRRRNRINKLRREDGSWVEREEDKRAMIIEFFKQLFTSNGGQNSQKLLDVVD 669 Query: 532 TVITSEDNVNLCKIPELQEIHSALE-----------GFQAGFYKSQWEVVGTDICKMVWK 386 ++ N +L +E+ AL+ G AGFYK+ W+VVG + V + Sbjct: 670 RKVSGAMNESLRAEFTREEVKEALDAIGDLKAPGPDGMPAGFYKACWDVVGEKVTDEVLE 729 Query: 385 FFQSRHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKII 206 + I + N I LIPK KKP ++ D RPI LCN YK++SK+L RLK ++ +I Sbjct: 730 VLRGGAIPEGWNDITIVLIPKVKKPELIKDLRPISLCNVCYKLVSKVLANRLKKILPDVI 789 Query: 205 SPFQAAYVSGRMISDNTIIAQEIIHSMKKKR-GESGWIALKLDMSKAFNRLEWSFLLKVL 29 SP Q+A+V GR+ISDN +IA E+ H M+ KR G+ G+ A KLDMSKA++R+EWSFL ++ Sbjct: 790 SPAQSAFVPGRLISDNILIADEMTHYMRNKRSGQVGYAAFKLDMSKAYDRVEWSFLHDMI 849 Query: 28 NYFGF 14 GF Sbjct: 850 LKLGF 854 >emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1355 Score = 246 bits (629), Expect = 1e-62 Identities = 150/438 (34%), Positives = 224/438 (51%), Gaps = 16/438 (3%) Frame = -2 Query: 1279 LTDLG*IGKDYTWSSNITGTCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILL 1100 L DLG +G YTW + + + R+D L + W YPDS H SDHS I+L Sbjct: 169 LRDLGYVGTWYTWERGRSPSTCIRERLDRYLCSNSWLDLYPDSVPEHTIRYKSDHSAIVL 228 Query: 1099 VTDYTQKKLWIPF*KNY--SWLSDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSK 926 + + ++ SWL D+ C + + W S G ++ S + L + Sbjct: 229 RSQRAGRPRGKTRRLHFETSWLLDDECEAVVRESWE----NSEGEVMTGRVASMGQCLVR 284 Query: 925 WNRDHFGNINQQVDKLQSELEVLQNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRF 746 W+ F N+++Q++ + L V QN E + + + L + H + ++ +SR Sbjct: 285 WSTKKFKNLSKQIETAEKALSVAQNNPISESACQECVLLEKKLDELHAKHEAYWYLRSRV 344 Query: 745 TFYKEHDNNTKFFHAKINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNP 566 K+ D NTK+FH K +++K RN + + D G W + I T +F I TSSNP Sbjct: 345 AEVKDGDKNTKYFHHKASQRKKRNFVKGLFDGLGTWREEADHIENIFTSYFSSIFTSSNP 404 Query: 565 S--LEERLYLVLPTVITSEDNVNLCKIPELQEIHSALE-----------GFQAGFYKSQW 425 S E + V+ V+T E N+ L + EI +AL+ G FY+ W Sbjct: 405 SDLSLEAVMSVIEPVVTEEHNLKLLEPFSKDEILAALQQMHPCKAPGPDGMHVIFYQRFW 464 Query: 424 EVVGTDICKMVWKFFQSRHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKI 245 +VG D+ + +N T I+LIPK K PT ++RPI LCN YK++SK Sbjct: 465 HIVGDDVTSFISNILHGHSSPSCVNNTNIALIPKVKNPTKAAEFRPIALCNVLYKLMSKA 524 Query: 244 LVGRLKPVMEKIISPFQAAYVSGRMISDNTIIAQEIIHSMK-KKRGESGWIALKLDMSKA 68 +V RLK + +IIS Q+A+V GR+I+DN +IA E+ HSMK + R G IA+KLDMSKA Sbjct: 525 IVMRLKSFLPEIISENQSAFVPGRLITDNALIAMEVFHSMKNRNRSRKGTIAMKLDMSKA 584 Query: 67 FNRLEWSFLLKVLNYFGF 14 ++R+EW FL K+L GF Sbjct: 585 YDRVEWGFLRKLLLTMGF 602 >ref|XP_007203452.1| hypothetical protein PRUPE_ppa022115mg [Prunus persica] gi|462398983|gb|EMJ04651.1| hypothetical protein PRUPE_ppa022115mg [Prunus persica] Length = 1755 Score = 244 bits (623), Expect = 7e-62 Identities = 156/476 (32%), Positives = 238/476 (50%), Gaps = 19/476 (3%) Frame = -2 Query: 1384 PWLVLGDLNFHIXXXXXXXXXXXXG-----FVNIVINDCDLTDLG*IGKDYTWSSNITGT 1220 PWL +GD N + F NIV + DLG G +TW G Sbjct: 543 PWLCVGDFNEILSTDEKEGGPLRNNRQMQGFRNIV-DKLGFRDLGFNGYKFTWKCRF-GD 600 Query: 1219 CNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYT--QKKLWIPF*KNYS 1046 + R+D AL W +P + HL+ SDH PIL+ + QK + F Sbjct: 601 GFVRVRLDRALATTSWQNLFPGFSVQHLDPSRSDHLPILVRIRHATCQKSRYRRFHFEAM 660 Query: 1045 WLSDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQVDKLQSEL 866 W + C I + W N P +K+K +L +W++ FG+I ++ L+++L Sbjct: 661 WTTHVDCEKTIKQVWESVGNLDPMVGLDKKIKQMTWVLQRWSKSTFGHIKEETRVLRAKL 720 Query: 865 EVLQNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHAKINRK 686 L E V D V L + + ++ Q+SR + K D NT +FH K + Sbjct: 721 ASLFQAPYSERVEEDRRVVQKSLDELLAKNELYWCQRSRENWLKAGDKNTSYFHQKATNR 780 Query: 685 KARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVITSEDNV 506 + RN I ++D G W +R+ I + + D+F + SS S+ E + L +T++ Sbjct: 781 RRRNIIKGLEDSNGCWRTSRQGITSIVIDYFGDLFRSSGSSMMEEILSALEPKVTADMQQ 840 Query: 505 NLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFFQSRHILK 359 L QEI A+ +G FY+ W +VG D+ V F QS +L+ Sbjct: 841 VLIADFSYQEIKDAVFQMQPSKAPGPDGLPPLFYQKYWRIVGDDVVAAVRAFLQSNEMLR 900 Query: 358 EINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQAAYVS 179 ++N T+++LIPK K+P + RPI LCN Y+I +K L R+K VM+ +IS Q+A+V Sbjct: 901 QLNHTFVTLIPKVKEPRTMAQLRPISLCNVLYRIGAKTLANRMKFVMQSVISESQSAFVP 960 Query: 178 GRMISDNTIIAQEIIHSMK-KKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYFGF 14 GR+I+DN+I+A EI H +K ++RG G +ALKLDMSKA++R+EW FL K++ GF Sbjct: 961 GRLITDNSIVAFEIAHFLKQRRRGRKGSLALKLDMSKAYDRVEWEFLEKMMLAMGF 1016 >ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268853 [Solanum lycopersicum] Length = 1333 Score = 244 bits (623), Expect = 7e-62 Identities = 152/480 (31%), Positives = 242/480 (50%), Gaps = 19/480 (3%) Frame = -2 Query: 1384 PWLVLGDLNF-----HIXXXXXXXXXXXXGFVNIVINDCDLTDLG*IGKDYTWSSNITGT 1220 PW ++GD N F+NI I C L D+G G+DYTW ++ Sbjct: 75 PWSIIGDFNVITSTSEKLGGRDYNINKSLEFINI-IEACGLVDMGYHGQDYTWCNHRKDG 133 Query: 1219 CNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF*KNYSWL 1040 R+D + N W P S + HL +GSDH P+L+ Q F W Sbjct: 134 ARIWKRLDRGMTNDKWIETIPHSSITHLPSVGSDHCPLLMEICDIQSNTIKYFKFLNCWT 193 Query: 1039 SDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQVDKLQSELEV 860 ++S + K W +V G+P + KL+ K L W++ +G++ ++V + ++ Sbjct: 194 ENDSFLETVEKCWKRDVIGNPMWNFHTKLRRLTKTLRIWSKQEYGDVFEKVKLYEDLVKK 253 Query: 859 LQNQTPGEVVHNDILK---VNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHAKINR 689 +N + K +NA+ K+ K QQK++ + +E D NTK+FH I Sbjct: 254 AENIIIDNYSAKNSEKLNAINAEYIKFSKMEYKILQQKTQLHWLQEGDANTKYFHTVIRG 313 Query: 688 KKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVITSEDN 509 K+ R +I + D GNWI E+IA H D++ KI T N ++E + + +IT E N Sbjct: 314 KRNRMSIHKLMDESGNWIKGEEEIAKHACDYYEKIFTGMNGKIKEDILQCINPMITQEQN 373 Query: 508 VNLCKIPELQEI---------HSA--LEGFQAGFYKSQWEVVGTDICKMVWKFFQSRHIL 362 +L +IP++ E+ HSA +GF FY+ ++++ D+ V F+ + Sbjct: 374 KDLDRIPDMDELRRTIMSMNPHSAPGPDGFGGKFYQVCFDIIKEDLLAAVKHFYVGNIMP 433 Query: 361 KEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQAAYV 182 + + ++LIPK P + D+RPI L N + KIISKIL RL ++ I+S Q+ +V Sbjct: 434 RYLTHACLTLIPKIDHPCRLKDFRPISLSNFTNKIISKILSTRLALILPSIVSANQSGFV 493 Query: 181 SGRMISDNTIIAQEIIHSMKKKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYFGFSENF 2 GR I++N ++AQEI H +KK + S + +KLDM KA++R+ W++ VL GFSE F Sbjct: 494 KGRSIAENILLAQEIFHGIKKPKDGSN-VVIKLDMVKAYDRVSWNYTCLVLRKMGFSEVF 552 >emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1369 Score = 244 bits (622), Expect = 9e-62 Identities = 144/484 (29%), Positives = 239/484 (49%), Gaps = 22/484 (4%) Frame = -2 Query: 1387 QPWLVLGDLNFHIXXXXXXXXXXXXG-----FVNIVINDCDLTDLG*IGKDYTWSSNITG 1223 +PWL GD N + F N + +C DLG +G ++TW++N G Sbjct: 135 RPWLCGGDFNLMLVASEKKGGDGFNSREADIFRN-AMEECHFMDLGFVGYEFTWTNNRGG 193 Query: 1222 TCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTD-----YTQKKLWIPF* 1058 N + R+D + N W ++P S + HL SDH PI+ T+ K F Sbjct: 194 DANIQERLDRFVANDLWKIKFPGSFVSHLPKRKSDHVPIVASVKGAQSAATRTKKSKRFR 253 Query: 1057 KNYSWLSDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQVDKL 878 WL + + + W + + ++ K+LS W++ FG++ +++ Sbjct: 254 FEAMWLREGESDEVVKETWMRGTDAG-----INLARTANKLLS-WSKQKFGHVAKEIRMC 307 Query: 877 QSELEVLQNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHAK 698 Q +++VL P E + ++A + + KR ++ Q+SR + K D NTKFFH K Sbjct: 308 QHQMKVLMESEPSEDNIMHMRALDARMDELEKREEVYWHQRSRQDWIKSGDKNTKFFHQK 367 Query: 697 INRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVITS 518 + ++ RNN+ I++ G W + +D+ +F + S N + + ++ IT Sbjct: 368 ASHREQRNNVRRIRNEAGEWFEDEDDVTECFAHYFENLFQSGNNCEMDPILNIVKPQITD 427 Query: 517 EDNVNLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFFQSR 371 E L +E+ +AL +G A FY+ W+ +G D+ V + Sbjct: 428 ELGTQLDAPFRREEVSAALAQMHPNKAPGPDGMNALFYQHFWDTIGEDVTTKVLNMLNNV 487 Query: 370 HILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQA 191 + +N+T+I LIPKKK VD+RPI LCN YKI++K+L R+K V+ +I Q+ Sbjct: 488 DNIGAVNQTHIVLIPKKKHCESPVDFRPISLCNVLYKIVAKVLANRMKMVLPMVIHESQS 547 Query: 190 AYVSGRMISDNTIIAQEIIHSM-KKKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYFGF 14 +V GR+I+DN ++A E H + KKK G+ G++ LKLDMSKA++R+EW FL ++ GF Sbjct: 548 GFVPGRLITDNVLVAYECFHFLRKKKTGKKGYLGLKLDMSKAYDRVEWCFLENMMLKLGF 607 Query: 13 SENF 2 + Sbjct: 608 PTRY 611 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 243 bits (619), Expect = 2e-61 Identities = 152/491 (30%), Positives = 248/491 (50%), Gaps = 23/491 (4%) Frame = -2 Query: 1405 VG*NVAQPWLVLGDLNFHIXXXXXXXXXXXXG-----FVNIVINDCDLTDLG*IGKDYTW 1241 +G + A PW+V GD N + F + + DC L D G G +TW Sbjct: 884 LGFHKAGPWMVGGDFNSIVSTVERLNGAAPHVGSMEDFASTLF-DCGLLDAGFEGNSFTW 942 Query: 1240 SSNITGTCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF 1061 ++N + R+D + N +W+ + +R+ HLN GSDH P+L+ + +K F Sbjct: 943 TNN-----HMFQRLDRVVYNPEWAQCFSSTRVQHLNRDGSDHCPLLISCNTASQKGASTF 997 Query: 1060 *KNYSWLSDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNI------ 899 ++W + + + W + GS K + ++ L WN+ FG+I Sbjct: 998 RFLHAWTKHHDFLPFVTRSWQTPIQGSGLSAFWFKQQRLKRDLKWWNKHIFGDIFEKLRL 1057 Query: 898 -NQQVDKLQSELEVLQNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDN 722 ++ +K + E + + T ++H K+N LS F+QQKS + E +N Sbjct: 1058 AEEEAEKKEIEFQHNPSLTNRNLMHKAYAKLNRQLSIEEL----FWQQKSGVKWLVEGEN 1113 Query: 721 NTKFFHAKINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYL 542 NTKFFH ++ +K+ R++I I+D EGN + I TD FR + + N L Sbjct: 1114 NTKFFHMRMRKKRVRSHIFQIQDSEGNVFDDIHSIQKSATDFFRDLMQAENCDLSRFDPS 1173 Query: 541 VLPTVITSEDNVNLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKM 395 ++P +I+S DN LC P LQEI A+ +GF + FY+ W+++ D+ Sbjct: 1174 LIPRIISSADNEFLCAAPPLQEIKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKNDLLDA 1233 Query: 394 VWKFFQSRHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVME 215 V FF+ + + + T + L+PKK +YRPI LC KI++K+L RL ++ Sbjct: 1234 VLDFFRGSPLPRGVTSTTLVLLPKKPNACHWSEYRPISLCTVLNKIVTKLLANRLSKILP 1293 Query: 214 KIISPFQAAYVSGRMISDNTIIAQEIIHSMKKKRGESGWIALKLDMSKAFNRLEWSFLLK 35 IIS Q+ +V+GR+ISDN ++AQE+I + K G + LKLDM+KA++RL W FL Sbjct: 1294 SIISENQSGFVNGRLISDNILLAQELIGKIDAK-SRGGNVVLKLDMAKAYDRLNWDFLYL 1352 Query: 34 VLNYFGFSENF 2 ++ +FGF+ ++ Sbjct: 1353 MMEHFGFNAHW 1363 >gb|AFP55557.1| non-ltr retroelement reverse transcriptase [Rosa rugosa] Length = 1747 Score = 242 bits (617), Expect = 3e-61 Identities = 146/487 (29%), Positives = 241/487 (49%), Gaps = 24/487 (4%) Frame = -2 Query: 1390 AQPWLVLGDLN----FHIXXXXXXXXXXXXGFVNIVINDCDLTDLG*IGKDYTWSSNITG 1223 ++PWL GD N F+ + DC L + G YTW + G Sbjct: 412 SEPWLCCGDFNEILDFNEKTGAVQRSQRQIDGFRHAVEDCGLYEFAFTGFQYTWDNRRKG 471 Query: 1222 TCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTD--------YTQKKLWI 1067 N K R+D GN Q+ HL + SDH P+L D + +K+ ++ Sbjct: 472 DANVKERLDRGFGNLALIQQWGGISCHHLVSMSSDHCPLLFENDPPMSRGGNWRRKRRFL 531 Query: 1066 PF*KNYSWLSDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQV 887 WL+ C + + W VN G KL+ L +WN++ FG++ ++V Sbjct: 532 ---FEDMWLTHEGCRGVVERQWLFGVNSVVG-----KLEQVAGGLKRWNQETFGSVKKKV 583 Query: 886 DKLQSELEVLQNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFF 707 L+ EL+VLQ Q P + +V L +R ++Q++R +++K D NT+FF Sbjct: 584 ASLREELDVLQRQPPTSNIICKRNEVECLLDGVLEREELLWKQRARVSWFKCGDRNTQFF 643 Query: 706 HAKINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTV 527 H ++ N I I + W + DI +FR + T+ S++E ++ + + Sbjct: 644 HQTAKQRGRSNRICGILGEDNRWRSDVTDIGCVFVSYFRNLFTAGGGSMDETIFEAVTSR 703 Query: 526 ITSEDNVNLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFF 380 + + +L ++ +EI AL +G A F++ W ++G D+ + +F Sbjct: 704 VDATSKKSLDQVYRREEIELALKDMNPSKSPGSDGMPARFFQKFWNIIGNDVVDVCLRFL 763 Query: 379 QSRHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISP 200 + + N + I+LIPK + P V +YRPI LCN YK++SK+L RLK V+ ++I+ Sbjct: 764 NGDGSIADFNHSLIALIPKVQNPKKVTEYRPISLCNVVYKLVSKVLANRLKSVLPEVIAE 823 Query: 199 FQAAYVSGRMISDNTIIAQEIIHSMKKKRGESGW-IALKLDMSKAFNRLEWSFLLKVLNY 23 Q+A++S R+I DN I A EIIH +K++ +S IALKLDM+KA++R+EW FL +++ Sbjct: 824 NQSAFMSQRIIHDNIIAAFEIIHCLKRRGKDSRQKIALKLDMTKAYDRVEWGFLQRMMEV 883 Query: 22 FGFSENF 2 GF + F Sbjct: 884 MGFPDRF 890 >emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1355 Score = 242 bits (617), Expect = 3e-61 Identities = 158/445 (35%), Positives = 233/445 (52%), Gaps = 18/445 (4%) Frame = -2 Query: 1294 INDCDLTDLG*IGKDYTWS-SNITGTCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSD 1118 ++D L DLG G +TW N TC R+ R+D + + W+ YP++ + H SD Sbjct: 164 MDDLFLRDLGYNGVWHTWERGNSLSTCIRE-RLDRFVCSPSWATMYPNTIVDHSMRYKSD 222 Query: 1117 HSPILLVTDYTQKKLWIP--F*KNYSWLSDNSCSVEIAKGWSINVNGSPGFQCVQKLKST 944 H I L ++ T++ F SWL D +C I W+ S G +L Sbjct: 223 HLAICLRSNRTRRPTSKQRRFFFETSWLLDPTCEETIRDAWT----DSAGDSLTGRLDLL 278 Query: 943 RKILSKWNRDHFGNINQQVDKLQSELEVLQNQTPGEVVHNDILKVNADLSKWHKRRADFY 764 L W+ + GNI +Q+ +++S+L LQ Q L + L + H ++ + Sbjct: 279 ALKLKSWSSEKGGNIGKQLGRVESDLCRLQQQPISSANCEARLTLEKKLDELHAKQEARW 338 Query: 763 QQKSRFTFYKEHDNNTKFFHAKINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKI 584 +SR ++ D NTK+FH K +++K RN + + D G W +DI TD+F I Sbjct: 339 YLRSRAMEVRDGDRNTKYFHHKASQRKKRNFVKGLFDASGTWCEEVDDIECVFTDYFTSI 398 Query: 583 STSSNPS---LEERLYLVLPTVITSEDNVNLCKIPELQEIHSAL-----------EGFQA 446 TS+NPS L + L V P V+T E N L K +E++ AL +G A Sbjct: 399 FTSTNPSDVQLNDVLCCVDP-VVTEECNTWLLKPFSKEELYVALSQMHPCKAPGPDGMHA 457 Query: 445 GFYKSQWEVVGTDICKMVWKFFQSRHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTS 266 FY+ W ++G D+ + V IN T I+LIPK K PT ++RPI LCN Sbjct: 458 IFYQKFWHIIGDDVTQFVSSILHGSISPSCINHTNIALIPKVKNPTTPAEFRPIALCNVV 517 Query: 265 YKIISKILVGRLKPVMEKIISPFQAAYVSGRMISDNTIIAQEIIHSMK-KKRGESGWIAL 89 YK++SK LV RLK + +++S Q+A+V GR+I+DN +IA E+ HSMK + R G IA+ Sbjct: 518 YKLVSKALVIRLKDFLPRLVSENQSAFVPGRLITDNALIAMEVFHSMKHRNRSRKGTIAM 577 Query: 88 KLDMSKAFNRLEWSFLLKVLNYFGF 14 KLDMSKA++R+EW FL K+L GF Sbjct: 578 KLDMSKAYDRVEWGFLRKLLLTMGF 602 >ref|XP_006491472.1| PREDICTED: uncharacterized protein LOC102626455 [Citrus sinensis] Length = 1452 Score = 241 bits (616), Expect = 4e-61 Identities = 146/453 (32%), Positives = 244/453 (53%), Gaps = 25/453 (5%) Frame = -2 Query: 1294 INDCDLTDLG*IGKDYTWSSNITGTCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDH 1115 I C+L D+G G +TWS+ G + R+D L + DW + + L + SDH Sbjct: 244 IRACNLMDMGFKGHKFTWSNRRFGVNYIEERLDRVLCSKDWGSTFQNLPAISLANWVSDH 303 Query: 1114 SPILLVTDYTQKKLWIP---F*KNY---SWLSDNSCSVEIAKGWSINVNGSPGFQCVQKL 953 PI+ KKL F ++Y W S +CS + W + +G+ VQK Sbjct: 304 CPIMFEVKVCCKKLHYKKNSFPRDYYEDMWSSYEACSNIVRSEWE-SFDGNSWESPVQKF 362 Query: 952 KSTRKI----LSKWNRDHFGNINQQVDKLQSELEVLQNQTPGEVVHNDILKVNADLSKWH 785 + K L W+++ F ++ ++L L++ + + + +I K+ +S Sbjct: 363 QRVAKRSLAHLKIWSKEEFEGRKKKQNELIDRLKMTKQEPLQAIDGEEIRKLEDQISNML 422 Query: 784 KRRADFYQQKSRFTFYKEHDNNTKFFHAKINRKKARNNIDAIKDHEGNWIWNREDIAAHL 605 +++Q+SR + KE D NTKFFH+K + ++ +N I ++D +GNW+ + E I Sbjct: 423 VDEEVYWKQRSRADWLKEGDKNTKFFHSKASARRRKNKIWGVEDDQGNWVDDPEGIEGEF 482 Query: 604 TDHFRKISTSSNPS---LEERLYLVLPTVITSEDNVNLCKIPELQEIHSAL--------- 461 F+++ TSSNPS + E L +LP V + E N +L + ++I AL Sbjct: 483 CGFFQQLFTSSNPSQTQISEALKGLLPKV-SQEMNTHLEEPFTPEDITRALSEMCPTKAP 541 Query: 460 --EGFQAGFYKSQWEVVGTDICKMVWKFFQSRHILKEINKTYISLIPKKKKPTIVVDYRP 287 +G A F++ W++VG + K + L +N T+I+LIPK +KP V+++RP Sbjct: 542 GPDGLPAAFFQKHWQIVGEGLTKTCLHILNEQGTLDSLNHTFIALIPKVEKPRKVMEFRP 601 Query: 286 IGLCNTSYKIISKILVGRLKPVMEKIISPFQAAYVSGRMISDNTIIAQEIIHSMKKKRG- 110 I LCN Y+I++K + RLKP++ IISP Q+A++ R+I+DN II E +H ++ +G Sbjct: 602 ISLCNVVYRIVAKAIANRLKPILNHIISPNQSAFIPNRLITDNVIIGYECLHKIRLSKGR 661 Query: 109 ESGWIALKLDMSKAFNRLEWSFLLKVLNYFGFS 11 +G +ALKLD+SKA++R+EW+FL + ++ GFS Sbjct: 662 RNGLVALKLDISKAYDRVEWNFLEQTMSNLGFS 694 >ref|XP_007217321.1| hypothetical protein PRUPE_ppa019733mg [Prunus persica] gi|462413471|gb|EMJ18520.1| hypothetical protein PRUPE_ppa019733mg [Prunus persica] Length = 1275 Score = 241 bits (615), Expect = 6e-61 Identities = 155/476 (32%), Positives = 237/476 (49%), Gaps = 19/476 (3%) Frame = -2 Query: 1384 PWLVLGDLNFHIXXXXXXXXXXXXG-----FVNIVINDCDLTDLG*IGKDYTWSSNITGT 1220 PWL +GD N + F NIV + DLG G +TW G Sbjct: 89 PWLCVGDFNEILSTDEKEGGPLRNNRQMQGFRNIV-DKLGFRDLGFNGYKFTWKCRF-GD 146 Query: 1219 CNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYT--QKKLWIPF*KNYS 1046 + R+D AL W +P + HL+ SDH PIL+ + QK + F Sbjct: 147 GFVRVRLDRALATTSWQNLFPGFSVQHLDPSRSDHLPILVRIRHATCQKSRYHRFHFEAM 206 Query: 1045 WLSDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQVDKLQSEL 866 W + C I + W + P +K+K +L +W++ FG+I ++ L+++L Sbjct: 207 WTTHVDCEKTIKQVWESVGDLDPMVGLDKKIKQMTWVLQRWSKSTFGHIKEETRVLRAKL 266 Query: 865 EVLQNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHAKINRK 686 L E V D V L + + ++ Q+SR + K D NT +FH K + Sbjct: 267 ASLFQAPYSERVEEDRRVVQKSLDELLAKNELYWCQRSRENWLKAGDKNTSYFHQKATNR 326 Query: 685 KARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVITSEDNV 506 + RN I ++D G W +R+ I + + D+F + SS S+ E + L +T++ Sbjct: 327 RRRNIIKGLEDSNGCWRTSRQGITSIVIDYFGDLFRSSGSSMMEEILSALEPKVTADMQQ 386 Query: 505 NLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFFQSRHILK 359 L QEI A+ +G FY+ W +VG D+ V F QS +L+ Sbjct: 387 VLIADFSYQEIKDAVFQMQPSKAPGPDGLPPLFYQKYWRIVGDDVVAAVRAFLQSNEMLR 446 Query: 358 EINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQAAYVS 179 ++N T+++LIPK K+P + RPI LCN Y+I +K L R+K VM+ +IS Q+A+V Sbjct: 447 QLNHTFVTLIPKVKEPRTMAQLRPISLCNVLYRIGAKTLANRMKFVMQSVISESQSAFVP 506 Query: 178 GRMISDNTIIAQEIIHSMK-KKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYFGF 14 GR+I DN+I+A EI H +K ++RG G +ALKLDMSKA++R+EW FL K++ GF Sbjct: 507 GRLIIDNSIVAFEIAHFLKQRRRGRKGSLALKLDMSKAYDRVEWEFLEKMMLAMGF 562 >gb|ABA96650.1| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] Length = 1100 Score = 240 bits (612), Expect = 1e-60 Identities = 154/485 (31%), Positives = 247/485 (50%), Gaps = 20/485 (4%) Frame = -2 Query: 1396 NVAQPWLVLGDLN-----FHIXXXXXXXXXXXXGFVNIVINDCDLTDLG*IGKDYTWSSN 1232 N PW ++GD N F F I ++ CD+ DLG IG +T+ + Sbjct: 239 NSTAPWCLMGDFNEAMWQFEHFSEHKRKEKQMLDFREI-LSHCDVFDLGFIGTPWTYDNK 297 Query: 1231 ITGTCNRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF*KN 1052 G N + R+D A+ + WS YP +++ HL SDH PIL+ + K Sbjct: 298 RKGGYNVRVRLDRAVASQSWSVLYPQAQVRHLVSSRSDHCPILVQCTPDEDKDKPSRCMR 357 Query: 1051 YS--WLSDNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQVDKL 878 Y W + S S EI W + + KLK L W+R+ FG++N+++ L Sbjct: 358 YEILWEREESLSEEIRTAWEQHHAATDLGSVSSKLKLVMGALQHWSREKFGSVNKELGAL 417 Query: 877 QSELEVLQNQTPGEVVHN-DILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHA 701 + ++E LQ G H+ + + + + R + Q+SR + +E D NT F H Sbjct: 418 RKKMEELQ--LGGRHTHDQEYQSCSRRMEEILYREEMMWLQRSRVAWLREGDRNTSF-HR 474 Query: 700 KINRKKARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVIT 521 K + +N I + +G+W R+++ T+ F+ + T + L ++ +T Sbjct: 475 KAAWRHRKNKISKLFLPDGSWTDQRKEMETMATNFFKDLYTKDPLVTPQPLLDLIMLKVT 534 Query: 520 SEDNVNLCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFFQS 374 + N +LCK +EI AL +GF A F++ WEV+ D+CK V FF Sbjct: 535 EQMNEDLCKAFSDKEISDALFQIGPIKAPGPDGFPARFFQRNWEVLKNDVCKAVKLFFDQ 594 Query: 373 RHILKEINKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQ 194 + + + +N T I LIPKKK+ + D+RPI LCN YK+++K LV L+P++E IIS Q Sbjct: 595 KVMPEGVNTTAIVLIPKKKESKELKDFRPISLCNVIYKVVAKCLVNHLRPILESIISQEQ 654 Query: 193 AAYVSGRMISDNTIIAQEIIHSM-KKKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYFG 17 +A++ GRMI+DN +IA E H++ + KR + A KLD++KA++R++W +L VL+ G Sbjct: 655 SAFIPGRMITDNALIAFECFHNIAQSKRESQEFCAYKLDLAKAYDRVDWQYLEGVLDRMG 714 Query: 16 FSENF 2 FS + Sbjct: 715 FSNTW 719 >gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam: rvt.hmm, score: 42.57) [Arabidopsis thaliana] Length = 1662 Score = 239 bits (611), Expect = 2e-60 Identities = 151/479 (31%), Positives = 250/479 (52%), Gaps = 18/479 (3%) Frame = -2 Query: 1384 PWLVLGDLNFHIXXXXXXXXXXXXGFV----NIVINDCDLTDLG*IGKDYTWSSNITGTC 1217 PW+++GD N + + +++ CDL D+ IG ++W + Sbjct: 511 PWILIGDFNEILSNNEKIGGPQRDEWTFRGFRNMVSTCDLKDIRSIGDRFSWVGE-RHSH 569 Query: 1216 NRKSRIDMALGNGDWSFQYPDSRLFHLNHLGSDHSPILLVTDYTQKKLWIPF*KNYSWLS 1037 K +D A N + +F +P + L L GSDH P+ L + T+ + PF + L Sbjct: 570 TVKCCLDRAFINSEGAFLFPFAELEFLEFTGSDHKPLFLSLEKTETRKMRPFRFDKRLLE 629 Query: 1036 DNSCSVEIAKGWSINVNGSPGFQCVQKLKSTRKILSKWNRDHFGNINQQV--DKLQSELE 863 + GW+ +NG ++++ R+ ++K H N+N ++ ++LQ+ L+ Sbjct: 630 VPHFKTYVKAGWNKAINGQRK-HLPDQVRTCRQAMAKLK--HKSNLNSRIRINQLQAALD 686 Query: 862 VLQNQTPGEVVHNDILKVNADLSKWHKRRADFYQQKSRFTFYKEHDNNTKFFHAKINRKK 683 + I + +L+ ++ ++QQKSR + KE D NT+FFHA + Sbjct: 687 KAMSSV-NRTERRTISHIQRELTVAYRDEERYWQQKSRNQWMKEGDRNTEFFHACTKTRF 745 Query: 682 ARNNIDAIKDHEGNWIWNREDIAAHLTDHFRKISTSSNPSLEERLYLVLPTVITSEDNVN 503 + N + IKD EG ++I H + F K+ S+ + + ++T + N + Sbjct: 746 SVNRLVTIKDEEGMIYRGDKEIGVHAQEFFTKVYESNGRPVSIIDFAGFKPIVTEQINDD 805 Query: 502 LCKIPELQEIHSAL-----------EGFQAGFYKSQWEVVGTDICKMVWKFFQSRHILKE 356 L K EI++A+ +G A FYKS WE+VG D+ K V FF++ ++ + Sbjct: 806 LTKDLSDLEIYNAICHIGDDKAPGPDGLTARFYKSCWEIVGPDVIKEVKIFFRTSYMKQS 865 Query: 355 INKTYISLIPKKKKPTIVVDYRPIGLCNTSYKIISKILVGRLKPVMEKIISPFQAAYVSG 176 IN T I +IPK P + DYRPI LCN YKIISK LV RLK ++ I+S QAA++ G Sbjct: 866 INHTNICMIPKITNPETLSDYRPIALCNVLYKIISKCLVERLKGHLDAIVSDSQAAFIPG 925 Query: 175 RMISDNTIIAQEIIHSMK-KKRGESGWIALKLDMSKAFNRLEWSFLLKVLNYFGFSENF 2 R+++DN +IA E++HS+K +KR ++A+K D+SKA++R+EW+FL + FGFSE + Sbjct: 926 RLVNDNVMIAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDRVEWNFLETTMRLFGFSETW 984