BLASTX nr result
ID: Rehmannia25_contig00016563
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00016563 (2311 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A... 451 e-124 ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A... 394 e-107 dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] 252 6e-64 dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] 250 2e-63 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 249 4e-63 gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 238 7e-60 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 232 5e-58 gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptas... 232 6e-58 emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulga... 231 1e-57 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 226 4e-56 ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258... 225 6e-56 ref|XP_004295654.1| PREDICTED: uncharacterized protein LOC101314... 219 5e-54 ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A... 218 7e-54 emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulga... 218 7e-54 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 218 1e-53 gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] 216 3e-53 gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] 216 5e-53 ref|XP_006480844.1| PREDICTED: putative ribonuclease H protein A... 214 1e-52 emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulga... 211 1e-51 gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptas... 211 1e-51 >ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 872 Score = 451 bits (1159), Expect = e-124 Identities = 260/747 (34%), Positives = 390/747 (52%), Gaps = 8/747 (1%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F C QGVRQGDPLSPLLFC AE+VL+R + M + G + + R P +L+A D++ Sbjct: 134 FSCGQGVRQGDPLSPLLFCLAEEVLSRGISMLVSSGQVKRIHSPRGTLSPSYVLFAGDVI 193 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 + + +N + YG +SGQ N +K++V+ + R + + L I G Sbjct: 194 VFCRGNRQNLLRVMSFFYEYGSVSGQIINKDKSQVFIGKHN--RRRHSISDCLGIPLGTA 251 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 P YLG P+F G P++ +FQ DK+ K W G+ LSMAGR+ L+ SVI S +V++ Sbjct: 252 PFMYLGAPIFHGKPRVAHFQAIVDKVRLKLSSWVGSFLSMAGRLQLIKSVIYSMFVYTFQ 311 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 VY WP +LL+ +E+ +NF+WSGDI+K+G V+W CCAP +EGGLG++ L N S L Sbjct: 312 VYEWPVSLLRKVERWCRNFLWSGDIDKRGIPLVSWTSCCAPIDEGGLGLKKLDVLNSSLL 371 Query: 723 MKAAWKLLQSRSMVFEILRHRYFNGGPRMAYIGSSIWSGLRPVVLELISQSRWILGERSG 902 +K W++ S +R+R+ R +Y SSIW G+R + + +RW++G Sbjct: 372 LKRCWEIFTSSFEGCCFIRNRF---SKRRSYAPSSIWPGVRKFWGLVQNNTRWLVGTGDK 428 Query: 903 VRFWLDN*LGYSIADRIGIPPHLFQYYEYPISDYFYNGVWHFTEEFIVEYTGVVIDILNF 1082 + FW DN LG + + G L +SDY NG W + + V I Sbjct: 429 ISFWRDNFLGRPLIEFFGNHGALNDNSSL-VSDYIDNGSWVLPPLLQLNLSAVCNLICQV 487 Query: 1083 PIA--PHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRRSITVWRS 1256 PI+ P D+ +W S G++++K+A+ + P V WGK +WS I R S+ W+ Sbjct: 488 PISINPSMEDKLIWQASSTGELTAKQAFLFLQQASPVVPWGKPLWSKFILPRMSLHAWKV 547 Query: 1257 IHNRLPV--LDNIRGSWGPTACSLCYADSESVDHLFTRCRFALAIWEWIQDAFEVQMPLN 1430 + + L RG + C C +ES+DH+F C FA ++W FE+ + N Sbjct: 548 MRGTVISYHLLQRRGVALVSRCEFCGNSTESLDHIFLHCSFAASVWNHFIYIFEIGLVPN 607 Query: 1431 GGVNHFYVWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRPLHTAAII-LVKA 1607 F + R SPQL LW S +W IWHARN+ F D R A + LV Sbjct: 608 TIAEVFSLGLAMDR-SPQLKELWLICFTSILWYIWHARNQIRF-DSRTFSVAGVCRLVSR 665 Query: 1608 FIMESATKDCNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPGWMKVNTDGCAR 1787 I S+ +M N+++DL L+ R + P + V W+PP GW+K+N+DG + Sbjct: 666 HIQASSRLATGHMHNTIHDLCILKSFGACCRSRRIPRMVEVIWHPPSIGWIKINSDGAWK 725 Query: 1788 GAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRGWIKLWLESD 1967 G +FR +G F G F+ ++ + ++++ +TAIE A+ R W +WLE D Sbjct: 726 HEEGIGGFGAVFRYYKGQFVGAFASHIDIPSSIAAKVMVVITAIELAWVRDWKHVWLEVD 785 Query: 1968 STYVCGLLETRSLQVPWKFLARWRMTLHYISHMEFRVSHIYREGNKVADKLSKMDVPLE- 2144 + V + + SL VPW+ RW L+ IS M F+ SHI+REGN+VAD L+ + Sbjct: 786 FSTVLDYIRSPSL-VPWQLRVRWLNCLYRISTMTFKSSHIFREGNRVADALANHGTSMSE 844 Query: 2145 --WSYTIPESIVTELNEDSSGQISFRF 2219 W P I++ D G +FRF Sbjct: 845 EVWWDVPPSFILSYYERDLLGMPNFRF 871 >ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 751 Score = 394 bits (1013), Expect = e-107 Identities = 235/686 (34%), Positives = 344/686 (50%), Gaps = 14/686 (2%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F CT+GVRQGDPLSP+LFC AE+ L+R L + R SL +LYADD+ Sbjct: 74 FSCTKGVRQGDPLSPILFCIAEEALSRGLTALFSSKKVRSISLPRGCSLTH-VLYADDLF 132 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 I ++ ++ L +YG SGQ N +K+ Y + R +V+ L G Sbjct: 133 IFCRGDTKSLRQLQSFLDNYGAASGQLVNKDKSTFYLGASHFHR-RHQVKKILGFKLGTS 191 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 P +YLGVP+FKG P ++ Q DK ++ WKG LSMAGR+ LV+ V S +HS Sbjct: 192 PFSYLGVPIFKGKPCRKHLQALVDKAKARLAGWKGKLLSMAGRVQLVHDVFQSMLLHSFS 251 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 +Y W +LL + +NFIWSGD+ + VT++W + C P+ E GL +R+L A + L Sbjct: 252 IYLWATSLLSHLSACARNFIWSGDLAIRKLVTISWQQVCTPRNEAGLDLRNLKALYTAGL 311 Query: 723 MKAAWK-LLQSRS------MVFEILRHRYFNGGPRMAYIGSSIWSGLRPVVLELISQSRW 881 + AW+ LLQS S F I RH F Y SS+W GL+ V+ L SRW Sbjct: 312 ISLAWQTLLQSSSWGSFACRRFTIFRHMKFQ------YFTSSVWHGLKRVLPLLFEHSRW 365 Query: 882 ILGERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYPISDYFYNGVWHFTEEFIVEYTGV 1061 I+G+ + + FW D L SI ++ + L ++D+ ++ W F + Sbjct: 366 IIGDGNSILFWSDKWLHSSIIQQLNM-GSLSHLLNSRVADFIWDQQWALPSHFSNLFPDC 424 Query: 1062 VIDILNFPI--APHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRR 1235 IL P+ P S D +W HS G S + Y L R F ++ W +W S IP R Sbjct: 425 AKQILEIPLPNTPES-DILIWEHSSSGIFSFSDGYELVRPYFEKLDWASSVWHSFIPPRY 483 Query: 1236 SITVWRSIHNRLPVLDNI--RGSWGPTACSLC-YADSESVDHLFTRCRFALAIWEWIQDA 1406 S+ WR H +LP D + RG + C LC ++ +E + HLF C FA IW+W+ Sbjct: 484 SVLAWRIFHLKLPTDDQLQRRGIPFVSVCQLCSFSHTEDIPHLFVNCSFAQHIWQWLAYY 543 Query: 1407 FEVQMPLNGGVNHFYVWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRPLHTA 1586 F +P +G +N + + FSPQL ++W + + + IW + N+ F + +P Sbjct: 544 FGTSLPSSGSLNDLWSSVTGKAFSPQLKNIWFASCLFALMAIWKSHNKLRFDNKQPSLMR 603 Query: 1587 AIILVKAFIMESA--TKDCNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPGWM 1760 VKA++ A T C V D L + V K ++ V W+PPL W+ Sbjct: 604 VFRSVKAWVRYIAPYTPGC---VRGVLDSKVLSSMGVILVLKCQSALRIVLWHPPLIPWL 660 Query: 1761 KVNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRG 1940 K+NT+G ++G PG G+FR+ G G + +G F EL+ + +E AF G Sbjct: 661 KLNTNGFSKGNPGLAGCGGVFRDSFGRLIGGYCQGLGTQTTFFVELMTVILGVEFAFHFG 720 Query: 1941 WIKLWLESDSTYVCGLLETRSLQVPW 2018 W +WLESDST + + + S PW Sbjct: 721 WHHIWLESDSTTILQCISSSSFAPPW 746 >dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 252 bits (643), Expect = 6e-64 Identities = 204/758 (26%), Positives = 331/758 (43%), Gaps = 20/758 (2%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGDPL+P LF + LA + + P TR + L +ADD++ Sbjct: 631 FKPGRGLRQGDPLAPYLFNLVMERLAHDIQTRVNARTWKPVHITRGGTGISHLFFADDLM 690 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 + AS + L S+ SG + N K+ ++ N + R + + LQ+ Sbjct: 691 LFGEASEHQAQIMFDCLDSFSNASGLKVNFSKSLLFCSSNVNAGLKRAIGSILQVPVAES 750 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 TYLG+P+ K F DK+ +K WK +SL+MAGR LV + +++ ++M Sbjct: 751 LGTYLGIPMLKERVSRNTFNAVIDKMRTKLSSWKASSLNMAGRRVLVQASLATVPTYTMQ 810 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 V P + ++K +NF+W D N + +VNWA C P+ EGGLG+R N++FL Sbjct: 811 VMALPVSTCNEIDKTCRNFLWGHDTNTRKLHSVNWAEICKPRNEGGLGLRMARDFNRAFL 870 Query: 723 MKAAWKLLQSRSMVF-EILRHRYFNGGPRMAYIGSSI----WSGLRPVVLELISQSRWIL 887 K AW++ + ++ ++LR +Y + S W + L +W + Sbjct: 871 TKMAWQIFSNIDKLWVKVLREKYVKNADFLHLQSQSNCSWGWRSIMKGKDVLAGAIKWNV 930 Query: 888 GERSGVRFWLDN*LG----YSIADRIGIPPHLFQYYEYPISDYFYN-GVWHFTEEFIVEY 1052 G + FW D +G S D I PH+ + + D + W + Sbjct: 931 GNGRKINFWNDWWVGDGPLASNTDCIN-QPHM---TDIKVEDLITSQRRWDTGALHNILP 986 Query: 1053 TGVVIDILNFPIAPHS--TDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIP 1226 T ++ + PIA +S D W HS G V+ AY+L + + WIW + Sbjct: 987 TNMIDMVRATPIAINSEQEDFLSWPHSTTGMVTVSSAYSLIAGHDGDDRSHDWIWRATCT 1046 Query: 1227 LRRSITVWRSIHNRLPVLDNI----RGSWGPTACSLCYADSESVDHLFTRCRFALAIWEW 1394 + + +W+ + N L V N+ RG +C +C + E++DHLF RC A A W+ Sbjct: 1047 EKIKLFMWKIVKNGLMV--NVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDS 1104 Query: 1395 IQDAFEVQMPLNGGVNHFYVWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRP 1574 Q + ++ + A + ++ W +W +W ARN +F + Sbjct: 1105 AVPPLTFQTSNHLHMHSWMKAACSSQQKDGYSTNWSLIFPYILWNLWKARNRLVFDN--N 1162 Query: 1575 LHTAAIILVKAFIMESATKDCNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPG 1754 + + IL ++F MES+ C L +R +Q V W PP G Sbjct: 1163 ITAPSDILNRSF-MESSEARC----------LLAKRTGLQ-----TAFQTWVVWSPPAAG 1206 Query: 1755 WMKVNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFK 1934 + K+N+DG + A G+ RN G + ++ N+G +F +EL + A Sbjct: 1207 FTKLNSDGACKSHSHLASAGGLLRNENGLWVAGYTCNIGTANSFLAELWGLREGLLLAKN 1266 Query: 1935 RGWIKLWLESDSTYVCGLLETRSLQVPWKFLARWRMTLHYISHMEFRVSHIYREGNKVAD 2114 RG+ KL E+DS V +L P + L E +V+HI REGN+ AD Sbjct: 1267 RGFTKLIAETDSEAVVQVLRKDGPVTPDASILVKDCKLLLDHFQEIKVTHILREGNQCAD 1326 Query: 2115 KLSKMDVPLEWSYTI----PESIVTELNEDSSGQISFR 2216 L+ + W TI P+ + L D+ G S R Sbjct: 1327 FLANLGQSSSWGTTILERPPDDLRIFLQRDAIGLASSR 1364 >dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 250 bits (639), Expect = 2e-63 Identities = 204/758 (26%), Positives = 329/758 (43%), Gaps = 20/758 (2%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGDPL+P LF + LA + + P TR + L +ADD++ Sbjct: 631 FKPGRGLRQGDPLAPYLFNLVMERLAHDIQTRVNARTWKPVHITRGGTGISHLFFADDLM 690 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 + AS + L S+ SG + N K+ ++ N + R + + LQ+ Sbjct: 691 LFGEASEHQAQIMFDCLDSFSNASGLKVNFSKSLLFCSSNVNAGLKRAIGSILQVPVAES 750 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 TYLG+P+ K F DK+ +K WK +SL+MAGR LV + +++ ++M Sbjct: 751 LGTYLGIPMLKERVSRNTFNAVIDKMRTKLSSWKASSLNMAGRRVLVQASLATVPTYTMQ 810 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 V P + ++K +NF+W D N + +VNWA C P+ EGGLG+R N++FL Sbjct: 811 VMALPVSTCNEIDKTCRNFLWGHDTNTRKLHSVNWAEICKPRNEGGLGLRMARDFNRAFL 870 Query: 723 MKAAWKLLQSRSMVF-EILRHRYFNGGPRMAYIGSSI----WSGLRPVVLELISQSRWIL 887 K AW++ + ++ ++LR +Y + S W + L +W + Sbjct: 871 TKMAWQIFSNIDKLWVKVLREKYVKNADFLHLQSQSNCSWGWRSIMKGKDVLAGAIKWNV 930 Query: 888 GERSGVRFWLDN*LG----YSIADRIGIPPHLFQYYEYPISDYFYN-GVWHFTEEFIVEY 1052 G + FW D +G S D I PH+ + + D + W + Sbjct: 931 GNGRKINFWNDWWVGDGPLASNTDCIN-QPHM---TDIKVEDLITSQRRWDTGALHNILP 986 Query: 1053 TGVVIDILNFPIAPHS--TDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIP 1226 T ++ + PIA +S D W HS G V+ AY+L + + WIW + Sbjct: 987 TNMIDMVRATPIAINSEQEDFLSWPHSTTGMVTVSSAYSLIAGHDGDDRSHDWIWRATCT 1046 Query: 1227 LRRSITVWRSIHNRLPVLDNI----RGSWGPTACSLCYADSESVDHLFTRCRFALAIWEW 1394 + + +W+ + N L V N+ RG +C +C + E++DHLF RC A A W+ Sbjct: 1047 EKIKLFMWKIVKNGLMV--NVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDS 1104 Query: 1395 IQDAFEVQMPLNGGVNHFYVWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRP 1574 Q + ++ + A + + W +W +W ARN +F + Sbjct: 1105 AVPPLTFQTSNHLHMHSWMKAACSSQQKDGYGTNWSLIFPYILWNLWKARNRLVFDN--N 1162 Query: 1575 LHTAAIILVKAFIMESATKDCNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPG 1754 + + IL ++F MES+ C L +R +Q V W PP G Sbjct: 1163 ITAPSDILNRSF-MESSEARC----------LLAKRTGLQ-----TAFQTWVVWSPPAAG 1206 Query: 1755 WMKVNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFK 1934 + K+N+DG + A G+ RN G + + N+G +F +EL + A Sbjct: 1207 FTKLNSDGACKSHSHLASAGGLLRNENGLWVAGYICNIGTANSFLAELWGLREGLLLAKN 1266 Query: 1935 RGWIKLWLESDSTYVCGLLETRSLQVPWKFLARWRMTLHYISHMEFRVSHIYREGNKVAD 2114 RG+ KL E+DS V +L P + L E +V+HI REGN+ AD Sbjct: 1267 RGFTKLIAETDSEAVVQVLRKDGPVTPDASILVKDCKLLLDHFQEIKVTHILREGNQCAD 1326 Query: 2115 KLSKMDVPLEWSYTI----PESIVTELNEDSSGQISFR 2216 L+ + W TI P+ + L D+ G S R Sbjct: 1327 FLANLGQSSSWGTTILERPPDDLRIFLQRDAIGLASSR 1364 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 249 bits (636), Expect = 4e-63 Identities = 190/722 (26%), Positives = 323/722 (44%), Gaps = 15/722 (2%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGD +SP LF A + LAR L+ Q + + + S+ L +ADD++ Sbjct: 1474 FKSERGLRQGDSISPQLFILAAEYLARGLNALYDQYPSLHYSSGCSLSVSH-LAFADDVI 1532 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 I S I L Y ++SGQ+ NP+K+ V N + + + Sbjct: 1533 IFANGSKSALQKIMAFLQEYEKLSGQRINPQKSCVVTHTNMASSRRQIILQATGFSHRPL 1592 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 PITYLG PL+KG K+ F KI + W+ +LS GRITL+ S +SS ++ + Sbjct: 1593 PITYLGAPLYKGHKKVMLFNDLVAKIEERITGWENKTLSPGGRITLLRSTLSSLPIYLLQ 1652 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 V + P +L+ + + + NF+W G K +W + P EGGL +R++ ++F Sbjct: 1653 VLKPPVIVLERINRLLNNFLWGGSTASKRIHWASWGKIALPIAEGGLDIRNVEDVCEAFS 1712 Query: 723 MKAAWKLLQSRSMVFEILRHRYFNG------GPRMAYIGSSIWSGLRPVVLELISQS--R 878 MK W+ + S+ + +R +Y G P++ S W R V + I++ R Sbjct: 1713 MKLWWRFRTTNSLWTQFMRAKYCGGQLPTDVQPKLH--DSQTWK--RMVTISSITEQNIR 1768 Query: 879 WILGERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYPISDYFYNGVWHFTEEFIVEYTG 1058 W +G + FW D +G + + F +SD+F N W+ + V Sbjct: 1769 WRIG-HGELFFWHDCWMG---EEPLVNRNQAFASSMAQVSDFFLNNSWNVEKLKTVLQQE 1824 Query: 1059 VVIDILNFPIAPHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRRS 1238 VV +I+ PI S D+ W + GD S+K A+ L RN+ E +IW +PL S Sbjct: 1825 VVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPVFNFIWHKSVPLTTS 1884 Query: 1239 ITVWRSIHNRLPV--LDNIRGSWGPTACSLCYADSESVDHLFTRCRFALAIWEWIQDAFE 1412 +WR +H+ +PV +G + C C ++ ES+ H+ + A +W + F+ Sbjct: 1885 FFLWRLLHDWIPVELKMKTKGFQLASRCRCCKSE-ESLMHVMWKNPVANQVWSYFAKVFQ 1943 Query: 1413 VQMPLNGGVNHFY-VWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRPLHTAA 1589 +Q+ +N W +S + + + T+W +W RN+ R++ Sbjct: 1944 IQIINPCTINQIICAWFYSGDYS-KPGHIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRV 2002 Query: 1590 IILVKAFIMESATKDCNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPGWMKVN 1769 + + + + D + + + P K + W P G +K+N Sbjct: 2003 VWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELKLN 2062 Query: 1770 TDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRGWIK 1949 DG + P G+ R+ G FS N G + ++EL+A + + + Sbjct: 2063 VDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAELMALHRGLLLCIEHNISR 2122 Query: 1950 LWLESDSTYVCGLLETRSLQVPWKFLARWRMTL----HYISHMEFRVSHIYREGNKVADK 2117 LW+E D+ + + ++ + +R R L +S + FR+SHI+REGN+ AD Sbjct: 2123 LWIEMDAK-----VAVQMIKEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADH 2177 Query: 2118 LS 2123 LS Sbjct: 2178 LS 2179 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 238 bits (608), Expect = 7e-60 Identities = 194/725 (26%), Positives = 317/725 (43%), Gaps = 17/725 (2%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGD +SP+LF A + L+R ++ + + + + ++ L +ADDI+ Sbjct: 595 FKSERGLRQGDSISPMLFILAAEYLSRGINELFSRYISLHYHSGCSLNISH-LAFADDIM 653 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 I T S I L Y +ISGQ+ N +K+ +N P + + + Sbjct: 654 IFTNGSKSVLEKILEFLQEYEQISGQRVNHQKSCFVTANNMPSSRRQIISQTIGFLHKTL 713 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 PITYLG PLFKG K+ F +KI + W+ LS GRITL+ SV+SS ++ + Sbjct: 714 PITYLGAPLFKGPKKVMLFDSLINKIRERITGWENKILSPGGRITLLRSVLSSMPIYLLQ 773 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 V + P +++ +E+ +F+W ++ W P EGGLG+RSL + +F Sbjct: 774 VLKPPACVIQKIERLFNSFLWGSSMDSTRIHWTAWHNITFPSSEGGLGIRSLKDSFDAFS 833 Query: 723 MKAAWKLLQSRSMVFEILRHRYFNG------GPRMAYIGSSIWSGLRPVVLELISQSRWI 884 K W+ +S+ +R +Y G P+ S+ W L Q RW Sbjct: 834 AKLWWRFDTCQSLWVRYMRLKYCTGQIHHNIAPKPH--DSATWKPLLAGRATASQQIRWR 891 Query: 885 LGERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYPISDYFYNGVWHFTEEFIVEYTGVV 1064 +G + + FW D +G + P F ++ +F + W + +V Sbjct: 892 IG-KGDIFFWHDAWMGDE--PLVNSFPS-FSQSMMKVNYFFNDDAWDVDKLKTFIPNAIV 947 Query: 1065 IDILNFPIAPHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRRSIT 1244 +IL PI+ D W + GD S K A+ L R + G+ IW IPL S Sbjct: 948 EEILKIPISREKEDIAYWALTANGDFSIKSAWELLRQRKQVNLVGQLIWHKSIPLTVSFF 1007 Query: 1245 VWRSIHNRLPVLDNIRGSWGPTACS-LCYADSESVDHLFTRCRFALAIWEWIQDAFEVQM 1421 +WR++HN LPV ++ A LC ES+ H+ A +W + F++ + Sbjct: 1008 LWRTLHNWLPVEVRMKAKGIQLASKCLCCKSEESLLHVLWESPVAQQVWNYFSKFFQIYV 1067 Query: 1422 --PLNGGVNHFYVWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDV-----RPLH 1580 P N + W F+ + + ++ W +W RN+ RD+ R + Sbjct: 1068 HNPQN-ILQILNSWYYSGDFT-KPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIW 1125 Query: 1581 TAAIILVKAFIMESATKDCNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPGWM 1760 IL K F C + D+ + + P K + W PL G + Sbjct: 1126 RIMKILRKLF---QGGLLCKWQWKGDLDIAIHWGFNFAQERQARP--KIINWIKPLIGEL 1180 Query: 1761 KVNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRG 1940 K+N DG ++ G+ R+ G FS N G + ++EL+A + + Sbjct: 1181 KLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALHRGLCLCMEYN 1240 Query: 1941 WIKLWLESDSTYVCGLLETR---SLQVPWKFLARWRMTLHYISHMEFRVSHIYREGNKVA 2111 ++W+E D+ V +++ S ++ + L R L IS R+SHI+REGN+ A Sbjct: 1241 VSRVWIEVDAQVVIQMIQNHHKGSYKIQY-LLESIRKCLQVIS---VRISHIHREGNQAA 1296 Query: 2112 DKLSK 2126 D LSK Sbjct: 1297 DFLSK 1301 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 232 bits (592), Expect = 5e-58 Identities = 191/718 (26%), Positives = 306/718 (42%), Gaps = 11/718 (1%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGD +SP LF A + L+R L+ Q + + S+ L +ADD+L Sbjct: 1511 FKSERGLRQGDSISPQLFILAAEYLSRGLNALYDQYPSLHYSSGVPLSVSH-LAFADDVL 1569 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 I T S I L Y ISGQ+ N +K+ N P + + Sbjct: 1570 IFTNGSKSALQRILVFLQEYEEISGQRINAQKSCFVTHTNIPNSRRQIIAQATGFNHQLL 1629 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 PITYLG PL+KG K+ F KI + W+ LS GRITL+ SV++S ++ + Sbjct: 1630 PITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLLQ 1689 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 V + P +L+ + + +F+W G K +WA+ P EGGL +RSL ++F Sbjct: 1690 VLKPPVCVLERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGLDIRSLAEVFEAFS 1749 Query: 723 MKAAWKLLQSRSMVFEILRHRYFNGGPRM----AYIGSSIWSGLRPVVLELISQSRWILG 890 MK W+ + S+ +R +Y G M S W + RW +G Sbjct: 1750 MKLWWRFRTTDSLWTRFMRMKYCRGQLPMQTQPKLHDSQTWKRMLTSSTITEQHMRWRVG 1809 Query: 891 ERSGVRFWLDN*LGYS--IADRIGIPPHLFQYYEYPISDYFYNGVWHFTEEFIVEYTGVV 1064 + V FW D +G + I+ + Q + D+F N W+ + V VV Sbjct: 1810 Q-GNVFFWHDCWMGEAPLISSNQEFTSSMVQ-----VCDFFTNNSWNIEKLKTVLQQEVV 1863 Query: 1065 IDILNFPIAPHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRRSIT 1244 +I PI + D W + GD S+K A+ L R + +IW +PL S Sbjct: 1864 DEIAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFF 1923 Query: 1245 VWRSIHNRLPVLDNI--RGSWGPTACSLCYADSESVDHLFTRCRFALAIWEWIQDAFEVQ 1418 +WR +H+ +PV + +G + C C ++ ES+ H+ A+ +W + F++ Sbjct: 1924 LWRLLHDWIPVELKMKSKGLQLASRCRCCKSE-ESIMHVMWDNPVAMQVWNYFAKLFQIL 1982 Query: 1419 MPLNGGVNHFY-VWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRPLHTAAII 1595 + +N W + + + + +W +W RN+ R++ + Sbjct: 1983 IINPCTINQIIGAWFYSGDYC-KPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVW 2041 Query: 1596 LVKAFIMESATKDCNYMSNSVYDLLCLRRLSV--QGRPKPPPVIKHVRWYPPLPGWMKVN 1769 V I + + D + + Q PP K W+ P G K+N Sbjct: 2042 RVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAPP--KVFSWHKPSLGEFKLN 2099 Query: 1770 TDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRGWIK 1949 DG A+ + GI R+ G FS N+G + ++EL+A + + Sbjct: 2100 VDGSAKQS-HNAAGGGILRDHAGEMVFGFSENLGTQNSLQAELLALYRGLILCRDYNIRR 2158 Query: 1950 LWLESDSTYVCGLLETRSLQVPWKFLARWRMTLHYISHMEFRVSHIYREGNKVADKLS 2123 LW+E D+ V LL+ + P +SH FR SHI+REGN+ AD L+ Sbjct: 2159 LWIEMDAISVIRLLQGNH-RGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLA 2215 >gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago truncatula] Length = 642 Score = 232 bits (591), Expect = 6e-58 Identities = 117/262 (44%), Positives = 160/262 (61%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F C +GVRQGDPLSPLLFC E+VL+R++ + +G I +RN LP Y DD++ Sbjct: 374 FNCNRGVRQGDPLSPLLFCIVEEVLSRSISILADKGLIDLIAASRNNCLPFHCFYVDDLM 433 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 + A + +K + T Y SGQ N K+ ++ T R M + N L G+ Sbjct: 434 VFCKAKMSSLIVLKSLFTRYADCSGQIMNIRKSFIFAGGITDTR-MNNIVNILGFNVGSL 492 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 P TYLG P+FKG PK +FQP DK+ +K +WK + LS+AGRI LV SV+ S VH+M Sbjct: 493 PFTYLGAPIFKGKPKGIHFQPIADKVKAKLAKWKASLLSIAGRIQLVKSVVQSMLVHTMS 552 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 +Y WP +LK MEK +KNFIWSGD+ K+ VTV W + CA EEGGLGV+SL+ N++ Sbjct: 553 IYSWPIKILKEMEKWIKNFIWSGDVTKRKMVTVAWRKICADYEEGGLGVKSLICLNEATN 612 Query: 723 MKAAWKLLQSRSMVFEILRHRY 788 +K W L+QS I+R+ + Sbjct: 613 LKICWNLMQSDEQWANIIRNSW 634 >emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 231 bits (589), Expect = 1e-57 Identities = 203/760 (26%), Positives = 329/760 (43%), Gaps = 52/760 (6%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGDPLSP LF + L + + A G + + RN L YADD L Sbjct: 624 FKLKRGLRQGDPLSPFLFVLIGEALNQVILKATNMGLWSGVEVCRNGLKITHLQYADDTL 683 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 + + A E+ +IK L + SG Q N K+ + NT + N L G Sbjct: 684 VFSDARLESLKNIKMALILFHLASGLQVNFHKSSIIGM-NTSKTWLNEAANSLLCKTGDI 742 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 P TYLG+P+ + KI+ + P +KI K WKG LS+ GR+TL+ S +S+ ++ M Sbjct: 743 PFTYLGLPIGENIHKIKAWDPIINKISMKLATWKGRMLSIGGRLTLIKSSLSNLPLYFMS 802 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 ++ P+ +++ + K + F+WSGD+ K+ V W PK+ GGLG+ ++ N + L Sbjct: 803 LFPIPKGVVEKINKITRRFLWSGDMEKRSIPLVAWKIAQLPKDMGGLGIGNIFHKNSAML 862 Query: 723 MKAAWKLLQSRSMVF-EILRHRY--------------FNGGPRMAYIGSSIWSGLRPVVL 857 K W+LL S ++ +++ ++Y +GGP +I ++I+ + V Sbjct: 863 SKWMWRLLSDSSPIWCQVVCNKYKYQGTLSITDIKVPKSGGP-WRHICAAIFH--QANVK 919 Query: 858 ELISQS-RWILGERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYPISDY-------FYN 1013 EL+ + R +G S RFWLD+ L S + P LF P + YN Sbjct: 920 ELLYKGFRKNIGSGSQTRFWLDSWL--SSSSLKSEFPRLFSITMNPNASVESLGFWEGYN 977 Query: 1014 GVWHFTEEFIVEYTGVV----IDILNFPIAP--HSTDRRVWIHSKGGDVSSKE-AYTLAR 1172 VW F+ + I+ + +D L + P + D +W SK G S+K + L + Sbjct: 978 WVWSFSWKRILRPQDAIEKARLDNLLLQVCPARQAQDHLIWAFSKSGSFSTKSVSRQLVK 1037 Query: 1173 NQFPEVQWG-KWIWSSHIPLRRSITVWRSIHNRLPVLDNIRG----SWGPTACSLCYADS 1337 Q P Q + +W +P R + VW ++ ++ D + C LC + Sbjct: 1038 LQHPHYQDAIRGVWVGLVPHRIELFVWLALLGKINTRDKLASLGIIHGDCNICPLCMTEP 1097 Query: 1338 ESVDHLFTRCRFALAIWEWIQDAFEVQMPLNGGVNHFYVWAIQQRFSPQLASLWKTAIIS 1517 E+ +HL C A IW W + ++ + + + SP +W Sbjct: 1098 ETAEHLLLHCPVASQIWSWWIGLWRIKWAFPLSLREAFTQWFWPKNSPFFKKVWSAVFFI 1157 Query: 1518 TIWVIWHARNEWIFRD----VRPLHTAAIILVKAFIMESATKDCNYMSNSVYDLLCLRRL 1685 +W +W RN+ IF + V+ L ++ + +I + ++ + + CL+ Sbjct: 1158 IVWTLWKERNQRIFSNNPSTVKVLKDMVLMRLGWWISGWKDEFPYNPTDIMRNPSCLQWS 1217 Query: 1686 SVQGRPKPPPVIK-HVRWYPPLPGWMKVNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSH 1862 ++ K VIK V W PP +K N D R G+ RN G F FS Sbjct: 1218 GIKDDSKADLVIKSSVSWCPPPSQIIKWNVDASVHTCSARSAIGGVLRNHSGNFMCLFSS 1277 Query: 1863 NVGRGFAFESELIAAMTAIERAFKRG-------WIKLWLESDSTYVCGLLETRSLQVPWK 2021 + F A + AI RA K K+ LESDS + S PW Sbjct: 1278 PI----PFMEINCAEILAIHRAVKISSAKEELKGAKIILESDSKNAVLWCNSDS-GGPWN 1332 Query: 2022 FLARWRMTLHYISH-----MEFRVSHIYREGNKVADKLSK 2126 L++I + ++ + H R N VAD ++K Sbjct: 1333 L----NFQLNFIRNTRKGGLDISIVHRSRSANVVADSMAK 1368 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 226 bits (575), Expect = 4e-56 Identities = 189/718 (26%), Positives = 308/718 (42%), Gaps = 11/718 (1%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGD +SP LF A + L+R L+ Q + + + S+ L +ADD+L Sbjct: 1509 FKSERGLRQGDSISPQLFIIAAEYLSRGLNALYDQYPSLHYSSGVSISVSH-LAFADDVL 1567 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 I T S I L Y ISGQ+ N +K+ N + + + Sbjct: 1568 IFTNGSKSALQRILAFLQEYQEISGQRINVQKSCFVTHTNVSSSRRQIIAQTTGFSHQLL 1627 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 ITYLG PL+KG K+ F KI + W+ LS GRITL+ SV++S ++ + Sbjct: 1628 LITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLLQ 1687 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 V + P +L+ + + +F+W G K +WA+ P +EGGL +R+L ++F Sbjct: 1688 VLKPPICVLERVNRIFNSFLWGGSAASKKIHWASWAKISLPIKEGGLDIRNLAEVFEAFS 1747 Query: 723 MKAAWKLLQSRSMVFEILRHRYFNGGPRM----AYIGSSIWSGLRPVVLELISQS--RWI 884 MK W+ S+ +R +Y G M S W R V I++ RW Sbjct: 1748 MKLWWRFRTIDSLWTRFMRMKYCRGQLPMHTQPKLHDSQTWK--RMVANSAITEQNMRWR 1805 Query: 885 LGERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYPISDYFYNGVWHFTEEFIVEYTGVV 1064 +G+ + FW D +G + L + D+F N W + V VV Sbjct: 1806 VGQ-GKLFFWHDCWMGETPLTSSNQELSLSM---VQVCDFFMNNSWDIEKLKTVLQQEVV 1861 Query: 1065 IDILNFPIAPHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRRSIT 1244 +I PI S D W + G+ S+K A+ L R + +IW +PL S Sbjct: 1862 DEIAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKTVPLTISFF 1921 Query: 1245 VWRSIHNRLPVLDNI--RGSWGPTACSLCYADSESVDHLFTRCRFALAIWEWIQDAFEVQ 1418 +WR +H+ +PV + +G + C C ++ ES+ H+ A +W + F++ Sbjct: 1922 LWRLLHDWIPVELKMKSKGFQLASRCRCCKSE-ESIMHVMWDNPVATQVWNYFSKFFQIL 1980 Query: 1419 MPLNGGVNHFY-VWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRPLHTAAII 1595 + +N W + + + I T+W +W RN+ R++ + Sbjct: 1981 VINPCTINQILGAWFYSGDYC-KPGHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVW 2039 Query: 1596 LVKAFIMESATKD--CNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPGWMKVN 1769 + I + + + + ++ Q PPP K W+ P G K+N Sbjct: 2040 RILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESLPPP--KVFPWHKPSIGEFKLN 2097 Query: 1770 TDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRGWIK 1949 DG A+ G+ R+ G FS N+G + ++EL+A + + Sbjct: 2098 VDGSAK-LSQNAAGGGVLRDHAGVMVFGFSENLGIQNSLQAELLALYRGLILCRDYNIRR 2156 Query: 1950 LWLESDSTYVCGLLETRSLQVPWKFLARWRMTLHYISHMEFRVSHIYREGNKVADKLS 2123 LW+E D+ V LL+ + P +SH FR+SHI+REGN+ AD L+ Sbjct: 2157 LWIEMDAASVIRLLQGNQ-RGPHAIRYLLVSIRQLLSHFSFRLSHIFREGNQAADFLA 2213 >ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum lycopersicum] Length = 1454 Score = 225 bits (574), Expect = 6e-56 Identities = 187/727 (25%), Positives = 326/727 (44%), Gaps = 13/727 (1%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F +G++QGDPLSP LF +V +R L + Q F N L +ADDI+ Sbjct: 699 FHSKRGLKQGDPLSPALFVLGAEVFSRQLSLLYQNQLYKGFHMESNGPKINHLSFADDII 758 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 I ++ + + I + + Y +S Q+ N +K+ NT I+ + +R Sbjct: 759 IFSSTDNNSLNLIMKTIDQYEEVSDQKVNKDKSFFMVTSNTSHDIIEEISRITGFSRKNS 818 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 PI YLG PL+ G +I Y+ +K++ K W L+ G++TLV V+ S +H++ Sbjct: 819 PINYLGCPLYVGGQRIIYYSEIVEKVIKKIAGWHLKILNFGGKVTLVKHVLQSMPIHTLS 878 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 P+T+L S++K + +F W + + K +W P EGG+GVR + +F Sbjct: 879 AISPPKTILNSIKKVIADFFWGIEKDGKKYHWSSWNNMAFPTNEGGIGVRLIEDMCTAFQ 938 Query: 723 MKAAWKLLQSRSMVFEILRHRY---FNGGPRMAYIGSSI-WSGLRPVVLELISQSRWILG 890 K W + S+ + L+ +Y N + G SI W L ++ S +W + Sbjct: 939 YKQWWAFRTNNSLWSKFLKAKYNQRANPVAKKYNTGDSIVWRYLTRNRQKVESLIKWHI- 997 Query: 891 ERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYPISDYFYNGVWHFTEEFIVEYT--GVV 1064 + FW D L +A + H+ ++D+ NG W+ E + ++ +V Sbjct: 998 QSGTCSFWWDCWLDKPLAMQC---DHVSSLNNSVVADFLINGNWN--ERLLRQHVPPQLV 1052 Query: 1065 IDILNFPI--APHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRRS 1238 IL I + D +W ++ G + A+ R + + IW IP + S Sbjct: 1053 PYILQTKINYQAGNIDTSIWTPTESGQFTISSAWDSIRKKRNKDPINNIIWHKQIPFKVS 1112 Query: 1239 ITVWRSIHNRLPVLDNI-RGSWGPTACSLCY-ADSESVDHLFTRCRFALAIWEWIQDAFE 1412 +WR++ +LP +N+ R + C CY + ++H+ FA IW+ A Sbjct: 1113 FFIWRALRGKLPTNENLQRIGKNLSDCYCCYNKGKDDINHILINGNFAKYIWKIYSSAVG 1172 Query: 1413 VQMPLNGGVNHFYVWAIQQRFSPQLASLWKTAIISTI-WVIWHARNEWIFRDVRPLHTAA 1589 V +P+N + + Q+++ ++ L + + I W +W R + L ++ Sbjct: 1173 V-LPINTTLRDLLLQWRNQQYTNEVHKLLIHILPNFICWNLWKNRCAVKY----GLKNSS 1227 Query: 1590 IILVKAFIMESATKDCNYMSNSV-YDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPGWMKV 1766 I V+ I ++ + + S+ + +++ + K I V+W P G K+ Sbjct: 1228 IYRVQYGIFKNIMQVITIVFPSIPWQTSWNNLINIVEQCKQHYKILIVKWNKPDLGKYKL 1287 Query: 1767 NTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRGWI 1946 NTDG A G+ GI R+ +G FS G G +E+ AA+ ++ + G+ Sbjct: 1288 NTDGSALQNSGKIGGGGILRDNQGKIIYAFSLPFGFGTNNFAEIKAALHGLDWCEQHGYK 1347 Query: 1947 KLWLESDSTYVCGLLETRSLQVPWKFLARWRMTLHYISHM-EFRVSHIYREGNKVADKLS 2123 K+ LE DS +C + + ++ +PW++ + I M +F+ HIYRE N AD LS Sbjct: 1348 KIELEVDSKLLCNWINS-NINIPWRYEELIQQIHQIIRKMDQFQCHHIYREANCTADLLS 1406 Query: 2124 KMDVPLE 2144 K LE Sbjct: 1407 KWSHNLE 1413 >ref|XP_004295654.1| PREDICTED: uncharacterized protein LOC101314263 [Fragaria vesca subsp. vesca] Length = 839 Score = 219 bits (557), Expect = 5e-54 Identities = 138/437 (31%), Positives = 205/437 (46%), Gaps = 3/437 (0%) Frame = +3 Query: 639 VNWARCCAPKEEGGLGVRSLVAANKSFLMKAAWKLLQSRSMVFEILRHRYF--NGGPRMA 812 V W +CCAP +EGGLGVR+++A N++FL+K W L + R+ +G P Sbjct: 432 VAWKKCCAPLKEGGLGVRNIMALNQAFLLKKFWDFLTKSTTAAAFFSARFLQRSGQPCSY 491 Query: 813 YIGSSIWSGLRPVVLELISQSRWILGERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYP 992 Y SSIW G+RP+ +++ S+W++G + FW N L SI D++GI L + Sbjct: 492 YKRSSIWPGMRPLFTDILYNSKWVVGNGHSIDFWHGNWLNGSIIDKLGIVHQLGKSLCGK 551 Query: 993 ISDYFYNGVWHFTEEFIVEYTGVVIDILNFPIAPHSTDRR-VWIHSKGGDVSSKEAYTLA 1169 +SD+ NG W + E + +IL + + D + VW+ S G +S AY Sbjct: 552 VSDFILNGSWLCSTNLNAELAALWSEILAIQLPSYDIDDKLVWLDSLEGSLSLSIAYEFK 611 Query: 1170 RNQFPEVQWGKWIWSSHIPLRRSITVWRSIHNRLPVLDNIRGSWGPTACSLCYADSESVD 1349 ++ V W +W RG + CSLC+A E+ Sbjct: 612 ISKQASVPWDRW----------------------------RGFSFASMCSLCHASVENSH 643 Query: 1350 HLFTRCRFALAIWEWIQDAFEVQMPLNGGVNHFYVWAIQQRFSPQLASLWKTAIISTIWV 1529 HLF C F+L +W I F V ++ F+ + +Q F QL LW + + + Sbjct: 644 HLFFECSFSLRVWCAILSLFGVNSHFLD-IHAFFSYPLQHGFGTQLQLLWWGMMGAGFYS 702 Query: 1530 IWHARNEWIFRDVRPLHTAAIILVKAFIMESATKDCNYMSNSVYDLLCLRRLSVQGRPKP 1709 IW ARN F + I +K+ I E + M NS +L R L ++GR Sbjct: 703 IWDARNSIRFHERHSTPDCLIHSIKSQIREIDSWGLGTMHNSAGELCTFRALGIKGRASR 762 Query: 1710 PPVIKHVRWYPPLPGWMKVNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFE 1889 I+ V W+ P +KVNTDG ARG PG GIFR+ G GCF+ ++G A E Sbjct: 763 SHQIREVHWHAPSVFQVKVNTDGAARGTPGLAGFGGIFRDHLGNCMGCFAGSMGIATALE 822 Query: 1890 SELIAAMTAIERAFKRG 1940 +EL A + A A ++G Sbjct: 823 AELQAIIHAASMAARKG 839 >ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 364 Score = 218 bits (556), Expect = 7e-54 Identities = 124/349 (35%), Positives = 185/349 (53%), Gaps = 2/349 (0%) Frame = +3 Query: 1101 TDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRRSITVWRSIHNRLPVL 1280 +D+ +W+ G++S+KEA+ R + P + WGK IWS I R S+ W+ + R+ Sbjct: 2 SDKLIWVPLSSGELSAKEAFQFLRPRLPSLDWGKLIWSKFIIPRISLHSWKVLRGRVLSE 61 Query: 1281 DNI--RGSWGPTACSLCYADSESVDHLFTRCRFALAIWEWIQDAFEVQMPLNGGVNHFYV 1454 D + RG + C LC D ES+ H+F C FA ++W FE+ V+ Y Sbjct: 62 DLLQRRGIALASRCVLCGRDGESLPHIFLTCSFAASLWNNRAGLFELGCLPQNLVDLLYY 121 Query: 1455 WAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRPLHTAAIILVKAFIMESATKD 1634 + + S QL +W +T+W IW ARN+ + + A L+ + ++ Sbjct: 122 GGVGR--SHQLKEIWLICYTTTLWFIWKARNKMRHDNCTIVVDAVRQLIMGHVKTASKLA 179 Query: 1635 CNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPGWMKVNTDGCARGAPGRCMAA 1814 MSNS+ +L L++ + RP P I V W+PPL GW+KVNTDG + G+ Sbjct: 180 LGCMSNSLTELRVLKKFGLLCRPHRAPRITEVNWHPPLFGWIKVNTDGAWQKTTGKSGYG 239 Query: 1815 GIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRGWIKLWLESDSTYVCGLLE 1994 GIFR+ G F G F+ N+ + ++E++A + AIE A+ R W +WLE DS V L+ Sbjct: 240 GIFRDFHGSFLGAFASNLEILNSVDAEVMAVIQAIELAWVRDWEHIWLEVDSIIVLNFLQ 299 Query: 1995 TRSLQVPWKFLARWRMTLHYISHMEFRVSHIYREGNKVADKLSKMDVPL 2141 L VPW+ W LH IS M FR SHI+REGN+VAD L+ M + + Sbjct: 300 DPHL-VPWRLRVGWGNFLHRISQMNFRSSHIFREGNQVADALANMGLSM 347 >emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 218 bits (556), Expect = 7e-54 Identities = 207/763 (27%), Positives = 324/763 (42%), Gaps = 55/763 (7%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGDPLSP LF + L+ + A G + T+N L YADD + Sbjct: 624 FKLHRGLRQGDPLSPFLFDLVVETLSLVIQKASHLGLWEGVEVTKNGEKITHLQYADDTI 683 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 I + + +IK+ L + SG Q N K+ + I + + N L G Sbjct: 684 IFCPPNLDYLLNIKKTLILFQLASGLQVNFHKSSIMGIHVDEIWL-QEAANALLCKVGRL 742 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 P TYLG+P+ ++ ++ P KI K WKG LS+AGRITL+ + ISS ++ M Sbjct: 743 PFTYLGLPIGGNISRLAHWDPIIKKIEGKLASWKGRMLSIAGRITLIKASISSLPLYYMS 802 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 ++ PR +++++ K +NF+WSG++ K V W + PKE GGL +L+ N S L Sbjct: 803 LFPAPRGVIEAINKLQRNFLWSGELRKSSLALVAWNQVVLPKESGGLNCGNLLNRNISLL 862 Query: 723 MKAAWKLLQS-RSMVFEILRHRYFNGGPRMAY-----IGSSIWSGLRPVVLE-------L 863 K W+L S+ ++++ +Y + GS W + +L + Sbjct: 863 FKWIWRLSHDPESLWQKVIKEKYGYSHTTTVHDLCIPKGSGPWRFICASILNHPSARSFV 922 Query: 864 ISQSRWILGERSGVRFWLDN*LGYS-IADRIGIPPHLFQYYEYPISDYFYNG-------V 1019 ++ R +G FWLD LG S + R P LF + P++ G V Sbjct: 923 KTKLRKAVGNGVKTLFWLDTWLGDSPLKLRF---PRLFTIVDNPMAYIASCGSWCGREWV 979 Query: 1020 WHFT---------EEFIVEYTGVVIDILNFPIAPHSTDRRVWIHSKGGDVS----SKEAY 1160 W+F+ E E G++ + ++P + DR +W K G S SKE Sbjct: 980 WNFSWSRVFRPRDAEEWEELQGLLGSVC---LSPSTDDRLIWTPHKSGAFSVKSCSKELT 1036 Query: 1161 TLARNQFPEVQ-WGKWIWSSHIPLRRSITVWRSI------HNRLPVLDNIRGSWGPTACS 1319 A +++ WG+ +W IP R + W ++ +L L+ I C Sbjct: 1037 NTALKPQSKIRIWGR-LWRGLIPPRIEVFSWVALLGKLNSRQKLATLNIIPPD--DAVCI 1093 Query: 1320 LCYADSESVDHLFTRCRFALAIWEWIQDAFEVQMPLNGGV-NHFYVWAIQQRFSPQLASL 1496 +C E+ DHL C FA +IW W + V + F W ++ +P + Sbjct: 1094 MCNGAPETSDHLLLHCPFASSIWLWWLGIWNVSWVFPKNLFEAFEQWYCHKK-NPFFRKV 1152 Query: 1497 WKTAIISTIWVIWHARNEWIFRDVRPLHTAAIILVKAFIMESATKDCNYMSNSVYDLL-- 1670 W + IW IW RN IFR + LV +M S+ ++L Sbjct: 1153 WCSIFSIIIWTIWKERNARIFRGISCSSNKLQDLVIIRLMWWIKGWGEAFPYSIVEVLRH 1212 Query: 1671 --CLRRLSVQGRPKPPPV-IKHVRWYPPLPGWMKVNTDGCARGAPGRCMAAGIFRNCRGF 1841 CL ++ P V + + W PP G MK N D GR G+ RN +G Sbjct: 1213 PQCLSWDYLKAAPAATAVSVDGMLWSPPNDGVMKWNVDASVNA--GRSAIGGVLRNSQGI 1270 Query: 1842 FTGCFSHNVGRGFAFESELIAAMTAIERAFKRGWIK---LWLESDSTYVCGLLETRSLQV 2012 F FS + +E+IA A++ + ++K L LESDS + + Sbjct: 1271 FVCVFSCPIPSIEINSAEIIAIYRAMQICYSFEFLKRAPLVLESDSANAV-MWSNENEGG 1329 Query: 2013 PWKFLARWRMTLHYISH-----MEFRVSHIYREGNKVADKLSK 2126 PW L++I + + + H R N VAD L+K Sbjct: 1330 PWNL----NFQLNFIRNARKAGLNISIVHKKRSSNAVADALAK 1368 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 218 bits (554), Expect = 1e-53 Identities = 191/723 (26%), Positives = 319/723 (44%), Gaps = 16/723 (2%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGD +SP LF A + L+R L+ + + + + S+ L +ADDI+ Sbjct: 1475 FKSERGLRQGDSISPSLFILAAEYLSRGLNQLFSRYNSLHYLSGCSMSVSH-LAFADDIV 1533 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 I T I L Y ++SGQQ N +K+ + P+ + + Sbjct: 1534 IFTNGCHSALQKILVFLQEYEQVSGQQVNHQKSCFITANGCPLSRRQIIAQVTGFQHKTL 1593 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 P+TYLG PL KG K+ F KI + W+ LS RITL+ SV+SS ++ + Sbjct: 1594 PVTYLGAPLHKGPKKVFLFDSLISKIRDRISGWENKILSPGSRITLLRSVLSSLPMYLLQ 1653 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 V + P +++ +E+ +F+W K W + P EGGL +R+L +F Sbjct: 1654 VLKPPAIVIEKIERLFNSFLWGDSNEGKRMHWAAWNKINFPCSEGGLDIRNLKDVFDAFT 1713 Query: 723 MKAAWKLLQSRSMVFEILRHRYFNG------GPRMAYIGSSIWSGLRPVVLELISQSRWI 884 +K W+ S+ L+ +Y G P++ SSIW + I +RW Sbjct: 1714 LKLWWRFYTCDSLWTLFLKTKYCLGRIPHYVQPKIH--SSSIWKRITGGRDVTIQNTRWK 1771 Query: 885 LGERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYPISDYFYNGVWHFTEEFIVEYTGVV 1064 +G R + FW D +G + I F+ + ++ W + + ++ Sbjct: 1772 IG-RGELFFWHDCWMG---DQPLVISFPSFRNDMSFVHKFYKGDSWDVDKLRLFLPVNLI 1827 Query: 1065 IDILNFPIAPHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRRSIT 1244 +IL P D W + G+ S+K A+ R Q G IW IPL S Sbjct: 1828 YEILLIPFDRTQQDVAYWTLTSNGEFSTKSAWETIRQQQSHNTLGSLIWHRSIPLSISFF 1887 Query: 1245 VWRSIHNRLPVLDNIRGSWGPTACS-LCYADSESVDHLFTRCRFALAIWEWIQDAFEVQM 1421 +WR+++N +PV ++G A +C ES+ H+ A +W + F++ + Sbjct: 1888 IWRALNNWIPVELRMKGKGIHLASKCVCCNSEESLMHVLWGNSVAKQVWAFFAKFFQIYV 1947 Query: 1422 PLNGGVNH-FYVWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRPLHTAAII- 1595 V+H + W + + + I W +W RN+ +R L+T I+ Sbjct: 1948 LNPKHVSHILWAWFYSGDYVKR-GHIRTLLPIFICWFLWLERNDAKYRH-SGLNTDRIVW 2005 Query: 1596 ----LVKAFIMESATKDCNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPGWMK 1763 L++ S + + ++ D+ + + + Q + + PP I V W P G K Sbjct: 2006 RIMKLLRQLKDGSLLQQWQWKGDT--DIAAMWQYNFQLKLRAPPQI--VYWRKPSTGEYK 2061 Query: 1764 VNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRGW 1943 +N DG +R + G+ R+ G FS N+G + ++EL A + + +R Sbjct: 2062 LNVDGSSRHGQ-HAASGGVLRDHTGKLIFGFSENIGTCNSLQAELRALLRGLLLCKERHI 2120 Query: 1944 IKLWLESDSTYVCGLL---ETRSLQVPWKFLARWRMTLHYISHMEFRVSHIYREGNKVAD 2114 KLW+E D+ LL + S + + L R L+ IS +R+SHI+REGN+VAD Sbjct: 2121 EKLWIEMDALAAIQLLPHSQKGSHDIRY-LLESIRKCLNSIS---YRISHIHREGNQVAD 2176 Query: 2115 KLS 2123 LS Sbjct: 2177 FLS 2179 >gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 216 bits (551), Expect = 3e-53 Identities = 189/742 (25%), Positives = 311/742 (41%), Gaps = 35/742 (4%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPC-LLYADDI 179 F+ +G+RQGD +SP+LF A D L+R L+ + + +P L +ADDI Sbjct: 1388 FKSERGLRQGDSISPMLFILAADYLSRGLNHLFS--CYSSLQYLSGCQMPISHLSFADDI 1445 Query: 180 LIMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGA 359 +I T I L Y ++SGQ+ N +K+ + + + + + Sbjct: 1446 VIFTNGGRSALQKILSFLQEYEQVSGQKVNHQKSCFITANGCSLSRRQIISHTTGFQHKT 1505 Query: 360 FPITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSM 539 P+TYLG PL KG K+ F KI + W+ LS GRITL+ SV+SS ++ + Sbjct: 1506 LPVTYLGAPLHKGPKKVLLFDSLISKIRDRISGWENKILSPGGRITLLRSVLSSLPMYLL 1565 Query: 540 LVYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSF 719 V + P T+++ +++ +F+W K WA+ P EGGLG+R L +F Sbjct: 1566 QVLKPPVTVIERIDRLFNSFLWGDSTECKKMHWAEWAKISFPCAEGGLGIRKLEDVCAAF 1625 Query: 720 LMKAAWKLLQSRSMVFEILRHRYFNG------GPRMAYIGSSIWSGLRPVVLELISQSRW 881 +K W+ S+ + LR +Y G P++ S +W + + RW Sbjct: 1626 TLKLWWRFQTGNSLWTQFLRTKYCLGRIPHHIQPKLH--DSHVWKRMISGREMALQNIRW 1683 Query: 882 ILGERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYPISD--YFYNG-VWHFTEEFIVEY 1052 +G + + FW D +G F ++ +S +FYNG W + Sbjct: 1684 KIG-KGDLFFWHDCWMGDKPL------AASFPEFQNDMSHGYHFYNGDTWDVDKLRSFLP 1736 Query: 1053 TGVVIDILNFPIAPHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLR 1232 T +V +IL P D W + GD S++ A+ + R + +IW IPL Sbjct: 1737 TILVEEILQVPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQRQTSNALCSFIWHRSIPLS 1796 Query: 1233 RSITVWRSIHNRLPVLDNI--RGSWGPTACSLCYADSESVDHLFTRCRFALAIWEWIQDA 1406 S +W+++HN +PV + +G + C C ++ ES+ H+ A +W + Sbjct: 1797 ISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCNSE-ESLIHVLWENPVAKQVWNFFAQL 1855 Query: 1407 FEVQMPLNGGVNHFYVWAIQQRFSPQLASLWKTA-------------IISTIWVIWHARN 1547 F++ Y+W R Q+ W + + W +W RN Sbjct: 1856 FQI-----------YIW--NPRHVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERN 1902 Query: 1548 EWIFRDVRPLHTAAIILVKAFIMESATKDCNYMSNSVY----------DLLCLRRLSVQG 1697 D + HT L ++ K C + + D+ + S Sbjct: 1903 -----DAKHRHTG---LYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTH 1954 Query: 1698 RPKPPPVIKHVRWYPPLPGWMKVNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRG 1877 + PP I + W P G K+N DG +R G+ R+ G FS N+G Sbjct: 1955 KQHAPPQI--IYWKKPSIGEYKLNVDGSSRNGL-HAATGGVLRDHTGKLIFGFSENIGPC 2011 Query: 1878 FAFESELIAAMTAIERAFKRGWIKLWLESDSTYVCGLLETRSLQVPWKFLARWRMTLHYI 2057 + ++EL A + + +R KLW+E D+ L++ S + P+ + Sbjct: 2012 NSLQAELRALLRGLLLCKERHIEKLWIEMDALVAIQLIQP-SKKGPYNLRYLLESIRMCL 2070 Query: 2058 SHMEFRVSHIYREGNKVADKLS 2123 S +R+SHI REGN+ AD LS Sbjct: 2071 SSFSYRLSHILREGNQAADYLS 2092 >gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 216 bits (549), Expect = 5e-53 Identities = 182/720 (25%), Positives = 312/720 (43%), Gaps = 13/720 (1%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGD +SPLLF A D L+R ++ + + + + F L +ADDI+ Sbjct: 1214 FKSERGLRQGDSISPLLFVLAADYLSRGINQLFNR-HKSLLYLSGCFMPISHLAFADDIV 1272 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 I T I L Y +SGQQ N +K+ + P+ + + + Sbjct: 1273 IFTNGCRPALQKILVFLQEYEEVSGQQVNHQKSCFITANGCPMTRRQIIAHTTGFQHKTL 1332 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 P+ YLG PL KG K+ F KI + W+ +LS GRITL+ SV+SS ++ + Sbjct: 1333 PVIYLGAPLHKGPKKVTLFDSLITKIRDRISGWENKTLSPGGRITLLRSVLSSLPLYLLQ 1392 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 V + P +++ +E+ +F+W N K W + P EGGL +R L +F Sbjct: 1393 VLKPPVVVIEKIERLFNSFLWGDSTNDKRIHWAAWHKLTFPCSEGGLDIRRLTDMFDAFS 1452 Query: 723 MKAAWKLLQSRSMVFEILRHRYFNGG-PRMAY---IGSSIWSGLRPVVLELISQSRWILG 890 +K W+ + + L+ +Y G P + S +W + I +RW +G Sbjct: 1453 LKLWWRFSTCEGLWTKFLKTKYCMGQIPHYVHPKLHDSQVWKRMVRGREVAIQNTRWRIG 1512 Query: 891 ERSGVRFWLDN*LGYSIADRIGIPPHLFQYYEYPISDYFYNGVWHFTEEFIVEYTGVVID 1070 + S + FW D +G + PH F+ + ++F W + + +V + Sbjct: 1513 KGS-LFFWHDCWMGDQ--PLVTSFPH-FRNDMSTVHNFFNGHNWDVDKLNLYLPMNLVDE 1568 Query: 1071 ILNFPIAPHSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKWIWSSHIPLRRSITVW 1250 IL PI D W + G+ S++ A+ R + +W IPL S +W Sbjct: 1569 ILQIPIDRSQDDVAYWSLTSNGEFSTRSAWEAIRLRKSPNVLCSLLWHKSIPLSISFFLW 1628 Query: 1251 RSIHNRLPVLDNI--RGSWGPTACSLCYADSESVDHLFTRCRFALAIWEWIQDAFEVQMP 1424 R HN +PV + +G + C +C ES+ H+ A +W + ++F++ + Sbjct: 1629 RVFHNWIPVDIRLKEKGFHLASKC-ICCNSEESLIHVLWDNPIAKQVWNFFANSFQIYIS 1687 Query: 1425 LNGGVNH-FYVWAIQQRFSPQLASLWKTAIISTIWVIWHARNEWIFRDVRPLHTAAIILV 1601 V+ + W + + + + + W +W RN+ R + + + Sbjct: 1688 KPQNVSQILWTWYLSGDY-VRKGHIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKI 1746 Query: 1602 KAFI--MESATKDCNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWYPPLPGWMKVNTD 1775 + ++ ++ D + L + + P I H W P+PG K+N D Sbjct: 1747 MKLLRQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQILH--WVKPVPGEHKLNVD 1804 Query: 1776 GCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAIERAFKRGWIKLW 1955 G +R + G+ R+ G FS N+G + ++EL A + + +R KLW Sbjct: 1805 GSSRQNQTAAI-GGVLRDHTGTLVFDFSENIGPSNSLQAELRALLRGLLLCKERNIEKLW 1863 Query: 1956 LESDSTYVCGLLETRSLQVPWKFLARWRMTL----HYISHMEFRVSHIYREGNKVADKLS 2123 +E D+ L+ + +Q K R L Y++ FR+SHI+REGN+ AD LS Sbjct: 1864 VEMDA-----LVAIQMIQQSQKGSHDIRYLLASIRKYLNFFSFRISHIFREGNQAADFLS 1918 >ref|XP_006480844.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Citrus sinensis] Length = 768 Score = 214 bits (546), Expect = 1e-52 Identities = 197/758 (25%), Positives = 321/758 (42%), Gaps = 25/758 (3%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F ++GVRQGDPLSP +F + L+ + +I Q + P + +R + L + DD+L Sbjct: 57 FSPSRGVRQGDPLSPYIFVLCVERLSHGIYQSIHQDHWKPIRLSRLGTPLSHLFFTDDLL 116 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 + A+ I +L + SG + N KT VYF N P + R+ L T Sbjct: 117 LFAEATSGQAQCINSVLGDFCLSSGTKVNQSKTHVYFSKNVPDAVATRIWRDLGYTVTKD 176 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 YLG+PL + +Q DK K W + LS+AGRITL SV+ + +++M Sbjct: 177 LGKYLGMPLLHSRVSQQTYQGILDKTDQKLLGWAASQLSLAGRITLTQSVLQAVPIYAMQ 236 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 P ++ +++ + F+WSG+ + V+W C PK GGLG + L N++ L Sbjct: 237 TTNLPGSIKTKLDQICRRFLWSGNDELRKMSLVSWHNICQPKMAGGLGFKRLDIMNEALL 296 Query: 723 MKAAWKLL-QSRSMVFEILRHRYFNGGPRM-------AYIGSSIWSGLRPVVLELISQSR 878 +K AW L+ + + ++L +Y G P + GS +W + V R Sbjct: 297 LKVAWHLITEPNKLCVQVLSTKY--GVPPLEIPHTLPTRYGSHLWKSVGRVWDYAKRGIR 354 Query: 879 WILGERSGVRFWLD--N*LGYSIADRI--GIPPHLFQYY--EYPISDYFYNGVWHFTEEF 1040 WI+G V+FW D ++AD IPP L Y ++ +D +N W Sbjct: 355 WIVGNGWKVKFWWDCWATTPLTLADLAIHPIPPDLSDLYVADFVSTDGSWN--WPKFSHI 412 Query: 1041 IVEYTGVVIDILNFPIAPHSTDRRVWIHSKGGDVSSKEAYTL---ARNQFPEVQWGKWIW 1211 + ++I ++ P A + D+ W S GD + K AY L +R Q + W K W Sbjct: 413 LPHQAVMIIASIHPPSACNGVDQVYWAASPQGDFTVKSAYDLLDQSRGQERDSYW-KIAW 471 Query: 1212 SSHIPLRRSITVWRSIHNRLPV---LDNIRGSWGPTACSLCYADSESVDHLFTRCRFALA 1382 S P I +W +HNRL L R + P +C C A E+ H+ C ++ A Sbjct: 472 SWKGPQSIKIFIWLVLHNRLKTRAELATRRLNIEP-SCERCGAGLENTIHVLHDCPYSKA 530 Query: 1383 IWEWIQDAFEVQMPLNGGVNHFYVWAIQQRFSPQLAS-LWKTAIISTIWVIWHARNEWIF 1559 S L + LW+ IW +W RN +IF Sbjct: 531 -------------------------------STSLGTKLWRVIFGVAIWRLWFWRNRFIF 559 Query: 1560 RDVRPLHTAAIILVKAFIMESATKDCNYMSNSVYDLLCLRRLSVQGRPKPPPVIKHVRWY 1739 +A + +K + + + CN L+ + + V K +RW Sbjct: 560 TKDHWESSAIALDIK--VRAAEIQQCN------------SSLTATDKSR---VEKWIRWG 602 Query: 1740 PPLPGWMKVNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESELIAAMTAI 1919 P W+K+NTDG A+ A + R+ +G + + N+G +EL + Sbjct: 603 APTWPWVKLNTDG-AKKPSDHAGAGELIRDYKGVWQIGYCANLGFCSVTSAELWGLFHGL 661 Query: 1920 ERAFKRGWIKLWLESDSTYVCGLLETRSLQVPWKFLARWRMTLHYISHMEFRVSHIYREG 2099 A+ G+ ++ +E DS + L+ + + + +++HIYRE Sbjct: 662 SIAWLHGYRRVIVEVDSRCILQLVSNSNPTINEHLSLITAIQELIGRDWLIQMNHIYREA 721 Query: 2100 NKVADKLS----KMDVPLEWSYTIPESIVTELNEDSSG 2201 N +D L+ V L + + P +++ LN D SG Sbjct: 722 NAASDFLATYSLAFPVGLHYFQSPPLNLLNILNNDVSG 759 >emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1383 Score = 211 bits (536), Expect = 1e-51 Identities = 199/760 (26%), Positives = 328/760 (43%), Gaps = 52/760 (6%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F+ +G+RQGDPLSP LF +VL++ + A S L YADD L Sbjct: 625 FKLHRGLRQGDPLSPFLFVLVGEVLSQMISKATSLQLWRGIPACSRGSEITHLQYADDTL 684 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 + A+ + +I++ L + +SG Q N K+ + + T I + N L G Sbjct: 685 MFCEANTNSLKNIQKTLIIFQLVSGLQVNFHKSSLMGLNVTSSWI-QEAANSLMCKIGTI 743 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 P +YLG+P+ +IR + P DK+ K WKG LS+ GR+TL+ + +S+ ++ M Sbjct: 744 PFSYLGLPIGDNPARIRTWDPIIDKLEKKLASWKGKLLSLGGRLTLIKASLSNLPLYYMS 803 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 ++ P+ +++ + K M+ F+W GD K+ V+W+ PK GGLG+ +++ N S L Sbjct: 804 LFPVPKGVIEKINKLMRAFLWCGDFGKRPFSMVSWSIVQQPKTSGGLGIGNILHKNLSLL 863 Query: 723 MKAAWKLLQSRSMVF-EILRHRY--------------FNGGPRMAYIGSSIWSGLRPVVL 857 K W+L ++ S ++ I+R +Y +GGP + + + G L Sbjct: 864 FKWIWRLFENPSSMWGSIIRSKYNYSSTCSISDLKKPVSGGPWKSICAAVL--GHEGARL 921 Query: 858 ELISQSRWILGERSGVRFWLDN*LGYSIADRIGIPPHLFQY---YEYPISDY----FYNG 1016 ++ R +G FW D L RI P LF I+ Y +N Sbjct: 922 IAVNGMRKNVGNGISSLFWHDTWLCEQPLKRIA--PRLFSIAINKNSSIASYGVWEGFNW 979 Query: 1017 VWHFTEEFIVEYTGVV----IDIL--NFPIAPHSTDRRVWIHSKGGDVSSKEAYTLARNQ 1178 VW F+ + ++ +V +D L + + P++ D+ +W K G S+K Sbjct: 980 VWVFSWKRVLRPQDLVEKAHLDELLKSVRLDPNADDQLIWAPEKSGRFSTKSFSKELSKM 1039 Query: 1179 FPEVQWG--KWIWSSHIPLRRSITVWRSIHNRLPVLDNIRG----SWGPTACSLCYADSE 1340 P K +W +P R + VW ++ ++ + S C LC SE Sbjct: 1040 TPPTHSDAVKGVWRGLVPHRIEVFVWIALLGKINSRHKLAAFGIISEEEDICPLCDEGSE 1099 Query: 1341 SVDHLFTRCRFALAIWEWIQDAFEVQMPLNGG-VNHFYVWAIQQRFSPQLASLWKTAIIS 1517 + DHL C A +W W D ++V+ ++ F W ++ S +W + Sbjct: 1100 TSDHLLLHCVEAQKLWAWWLDIWKVKWVFPSSLLDAFSQWKCIKKKSNFFKKVWAASFFV 1159 Query: 1518 TIWVIWHARNEWIFRD--VRPLHTAAIILVKAFIMESATKDCNY---MSNSVYDLLCL-- 1676 IW IW RN IF + ++ ++L++ A DC + ++ + LCL Sbjct: 1160 IIWTIWKERNLRIFHNSSSNAMNLQDLVLLRLGWWIGAW-DCRFPYSPTDIQRNPLCLEW 1218 Query: 1677 --RRLSVQGRPKPPPVIKHVRWYPPLPGWMKVNTDGCARGAPGRCMAAGIFRNCRGFFTG 1850 +R+ Q + P ++ W PP P +K N D + GI RN +G F Sbjct: 1219 SDQRVCAQLLKQQP---ENDSWVPPPPQVLKWNVDASVINSNSCSAIGGILRNHKGEFMC 1275 Query: 1851 CFSHNVGRGFAFESELIAAMTAIERAFKRGWIK---LWLESDSTYVCGLLETRSLQVPWK 2021 FS V +E++A AI+ + + K L LESDS + S PW Sbjct: 1276 VFSSPVPYIEINCAEILAIHRAIQISLQSDKTKNANLLLESDSANAVMWCNSES-GGPWN 1334 Query: 2022 FLARWRMTLHYISHM-----EFRVSHIYREGNKVADKLSK 2126 L++I M +++ R N VAD L+K Sbjct: 1335 M----NFQLNFIRSMRKKGLNISITYKGRSSNVVADSLAK 1370 >gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptase); Polynucleotidyl transferase, Ribonuclease H fold [Medicago truncatula] Length = 729 Score = 211 bits (536), Expect = 1e-51 Identities = 180/684 (26%), Positives = 286/684 (41%), Gaps = 28/684 (4%) Frame = +3 Query: 3 FRCTQGVRQGDPLSPLLFCAAEDVLARTLDMAIQQGYIAPFKTTRNFSLPPCLLYADDIL 182 F ++G+RQGDPLSP LF + L+ + ++ Y P + R LL+ADD+L Sbjct: 57 FYPSRGIRQGDPLSPYLFVICMERLSHIIADQVEADYWKPMRAGRYGPPISHLLFADDLL 116 Query: 183 IMTAASWENCHHIKRILTSYGRISGQQFNPEKTKVYFRDNTPIRIMRRVQNHLQITRGAF 362 + AS E H + L + + SGQ+ N EKT+VYF N + + H + Sbjct: 117 LFAEASIEQAHCVLHCLDMFCQSSGQKINREKTQVYFSKNVDNHLREDIIQHTGFNQVNS 176 Query: 363 PITYLGVPLFKGAPKIRYFQPTHDKILSKFKRWKGTSLSMAGRITLVNSVISSSYVHSML 542 YLG + G +F +KI +K WK LS+AGRITL VISS + M Sbjct: 177 LGKYLGANITPGRTSRGHFNHIINKIQNKLSGWKQQCLSLAGRITLSKFVISSIPYYHMQ 236 Query: 543 VYRWPRTLLKSMEKAMKNFIWSGDINKKGAVTVNWARCCAPKEEGGLGVRSLVAANKSFL 722 + P+T+ +EK + F+W + A V+W CC PK GGLG + N++FL Sbjct: 237 YAKIPKTICDEIEKIQRGFVWGDSNQGRKAHLVSWDVCCLPKMNGGLGFKRPHHMNEAFL 296 Query: 723 MKAAWKLLQS------RSMVFEILRHRYFNGGPRMAYIGSSIWSGLRPVVLELISQSRWI 884 MK W L++ R + + R+ N S +W + + + W Sbjct: 297 MKMLWNLIKQPDKLWCRVLYSKYGRNNDLNNNISSQPYDSPLWKAIVGIWDDFKRHVIWQ 356 Query: 885 LGERSGVRFWLDN*LGYSIADRIGIPPHLFQYY---EYPISDYF-YNGVWHFTEEFIVEY 1052 +G+ FWLD I++ + Q Y + D +GVW + Sbjct: 357 IGDGRSTNFWLDK----WISNNTSLFSSSTQSYVDTTISVRDAINTSGVWDLNFLMDNLH 412 Query: 1053 TGVVIDILNFPIAP--HSTDRRVWIHSKGGDVSSKEAYTLARNQFPEVQWGKW--IWSSH 1220 +V IL P D W + + + AY L + + P G W +W+ Sbjct: 413 VDIVNQILALPTPSDFDGPDTIGWGGTNTLKFTVQSAYNL-QQENPFAVGGDWKTLWNWK 471 Query: 1221 IPLRRSITVWRSIHNRLPVLDNIRGS-WG----PTACSLCYADSESVDHLFTRCRFALAI 1385 P R +W + H R +L N R S WG PT C C + E+V H+ C + + Sbjct: 472 GPHRIQTFIWLAAHGR--ILTNYRRSKWGVGISPT-CPCCAREDETVIHVLRDCVHSTQV 528 Query: 1386 WEWIQDAFEVQMPLNGGVNHFYV----WA---IQQRFSPQLASLWKTAIISTIWVIWHAR 1544 W + +P N N F W + ++ + W+T ++T W +W+ R Sbjct: 529 WLRL-------IPHNYITNFFSFDCREWVFNNLNKKGIGDNPATWQTTFMTTCWYLWNWR 581 Query: 1545 NEWIFR--DVRPLHTAAIILVKAFIMESATKDCNYMSNSVYDLLCLRRLSVQGRPKPPPV 1718 N+ IF RP + +I +E TK + S+ + Sbjct: 582 NKSIFEIGFQRPSNPTLVIQKFTREIEDNTKLVHKSSHQKETI----------------- 624 Query: 1719 IKHVRWYPPLPGWMKVNTDGCARGAPGRCMAAGIFRNCRGFFTGCFSHNVGRGFAFESEL 1898 ++ W P GW+K+N DG +G+ G+ R+ G + + +G AF +E+ Sbjct: 625 --YIGWMRPPFGWVKLNCDGAWKGSGTLAGCGGLLRDSDGRWIKGYFKKIGMCDAFHAEM 682 Query: 1899 IAAMTAIERAFKRGWIKLWLESDS 1970 ++ A++ L +ESDS Sbjct: 683 WGMYLGLDMAWRENTTHLIVESDS 706