BLASTX nr result
ID: Cinnamomum25_contig00022230
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum25_contig00022230 (852 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010058837.1| PREDICTED: uncharacterized protein LOC104446... 124 1e-31 ref|XP_008389974.1| PREDICTED: putative ribonuclease H protein A... 124 2e-30 ref|XP_010041561.1| PREDICTED: uncharacterized protein LOC104430... 122 3e-30 ref|XP_008375089.1| PREDICTED: uncharacterized protein LOC103438... 121 5e-30 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 128 6e-30 ref|XP_008357937.1| PREDICTED: putative ribonuclease H protein A... 122 7e-30 ref|XP_008369981.1| PREDICTED: uncharacterized protein LOC103433... 122 1e-29 ref|XP_010068552.1| PREDICTED: uncharacterized protein LOC104455... 118 4e-29 ref|XP_008366684.1| PREDICTED: uncharacterized protein LOC103430... 117 7e-29 ref|XP_010034422.1| PREDICTED: uncharacterized protein LOC104423... 120 9e-29 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 133 2e-28 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 124 2e-28 gb|AIK35195.1| LINE-type retrotransposon LIb DNA [Ipomoea batatas] 131 6e-28 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 124 8e-28 ref|XP_010038572.1| PREDICTED: uncharacterized protein LOC104427... 117 2e-27 ref|XP_008385092.1| PREDICTED: putative ribonuclease H protein A... 116 2e-27 ref|XP_008370907.1| PREDICTED: putative ribonuclease H protein A... 112 3e-27 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 129 3e-27 ref|XP_008358855.1| PREDICTED: uncharacterized protein LOC103422... 115 4e-27 ref|XP_008245249.1| PREDICTED: putative ribonuclease H protein A... 120 1e-26 >ref|XP_010058837.1| PREDICTED: uncharacterized protein LOC104446716 [Eucalyptus grandis] Length = 1688 Score = 124 bits (310), Expect(2) = 1e-31 Identities = 74/190 (38%), Positives = 98/190 (51%), Gaps = 1/190 (0%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I +R +W + LSF G+L LI+ VL +I AS+ LPSS+ IE +LR FLW G++ Sbjct: 726 ITARARSWSHRFLSFAGRLQLIRSVLHAIQTFWASVFTLPSSVILGIESILRRFLWKGTS 785 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQAST-MNSLWSSCFKLRYF 357 V+W++V LPK EG GLGIR I + N AS++K W T SLW + Sbjct: 786 LDKGGAKVSWAEVCLPKEEG-GLGIRSIKDCNKASMLKFIWILFTDKESLWCRWIHSNFL 844 Query: 358 KDQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIFL 537 K + W AT S WK+I L Q W GDG+ + W D+W P L Sbjct: 845 KKHNFWVATQPSVCSWNWKKILHLRGCCRQSFEWKVGDGQSTSLWFDSWLPCGP-----L 899 Query: 538 NQQFPQDHLL 567 + Q PQ LL Sbjct: 900 HLQVPQSFLL 909 Score = 41.2 bits (95), Expect(2) = 1e-31 Identities = 18/58 (31%), Positives = 25/58 (43%) Frame = +3 Query: 660 SFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWAPLIWNGISLARINCFGW 833 S PP S D+ + + G S +SAWN IR+ W P +W+ R W Sbjct: 940 SLPPLSVDNDRFVWRQDSSGAFSVASAWNVIRTSKAKAPWTPFVWDKDVAPRFQFLLW 997 >ref|XP_008389974.1| PREDICTED: putative ribonuclease H protein At1g65750 [Malus domestica] Length = 805 Score = 124 bits (310), Expect(2) = 2e-30 Identities = 62/172 (36%), Positives = 97/172 (56%) Frame = +1 Query: 7 SRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGSTSS 186 ++L+ WK K LS G++ L + V S+ LH S+ PSS+ + + R R+F+WSG +S Sbjct: 203 AKLTGWKGKLLSMAGRVQLTQSVFQSMLLHSFSVYKWPSSLLRXLSRCARNFIWSGDVTS 262 Query: 187 SKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFKDQ 366 K V+W Q+ PK EG GLG+R + +N +L+KLGW T +S WS + R+ Sbjct: 263 KKXVTVSWXQICAPKNEG-GLGLRDLGSLNTTALLKLGWLIITTDSPWSIYJRXRFKLHG 321 Query: 367 SIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPL 522 +++ + S IW I+S+ H + Q W+ G+G + W D W DKP+ Sbjct: 322 RLYSC--SYMRSSIWPGIKSILHILFQNCRWVIGNGSTTSLWVDKWL-DKPI 370 Score = 36.6 bits (83), Expect(2) = 2e-30 Identities = 23/82 (28%), Positives = 36/82 (43%) Frame = +3 Query: 603 LPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWA 782 +PS +S P L IL P +D L + S+ G S S + +R + WA Sbjct: 399 IPSIFSSTFPDLTK-EILEMPLPIDEDKDVLIWEASSSGGFSFSDGYEIVRHRFPVKSWA 457 Query: 783 PLIWNGISLARINCFGWRMMFN 848 +IW R + W+++FN Sbjct: 458 SIIWRPFIPPRYSILVWKILFN 479 >ref|XP_010041561.1| PREDICTED: uncharacterized protein LOC104430514 [Eucalyptus grandis] Length = 650 Score = 122 bits (305), Expect(2) = 3e-30 Identities = 64/177 (36%), Positives = 92/177 (51%), Gaps = 1/177 (0%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I +R+ +W + LSF G+L LI+ VL SI S+ LP+S+ D+ER++R FLW+G++ Sbjct: 223 ITARIQSWTHRFLSFAGRLQLIRSVLHSIQASWMSVFTLPASVLADVERIMRQFLWNGTS 282 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGW-QASTMNSLWSSCFKLRYF 357 V W + PK EG GLG+R I + N A +K W S SLW + Sbjct: 283 LGRGGAKVAWEDICCPKAEG-GLGVRNIKQCNRAYTVKYLWILFSDKESLWCRWIHSVFL 341 Query: 358 KDQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLST 528 K ++ W A S +WK+I L + WI G+G I+ WHD W PL + Sbjct: 342 KKKNFWIANTPRTCSWMWKKILQLRPYFRSSFRWIVGNGYSISLWHDYWLSCGPLDS 398 Score = 38.1 bits (87), Expect(2) = 3e-30 Identities = 22/74 (29%), Positives = 33/74 (44%), Gaps = 4/74 (5%) Frame = +3 Query: 639 ALLSILS----SFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWAPLIWNGIS 806 AL S+L + PP S+ D+ G+ S +S W+ IR K +QWA +W+ Sbjct: 426 ALKSLLERWSIALPPLSSMHDKFIWCHEPSGKFSVASTWDFIRVKRNPVQWASFVWDNAL 485 Query: 807 LARINCFGWRMMFN 848 R W + N Sbjct: 486 APRYQFILWLITKN 499 >ref|XP_008375089.1| PREDICTED: uncharacterized protein LOC103438319 [Malus domestica] Length = 1384 Score = 121 bits (304), Expect(2) = 5e-30 Identities = 61/172 (35%), Positives = 96/172 (55%) Frame = +1 Query: 7 SRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGSTSS 186 ++L+ WK K LS G++ L + V S+ LH S+ PSS+ + + R R+F+WSG +S Sbjct: 782 AKLTGWKGKLLSMAGRVQLTQSVFQSMLLHSFSVYKWPSSLLRPLSRCARNFIWSGDVTS 841 Query: 187 SKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFKDQ 366 K V+W Q+ PK EG GLG+R + +N L+K+GW T +S WS + R+ Sbjct: 842 KKSVTVSWRQICAPKNEG-GLGLRDLGSLNTTXLLKJGWLIITTDSPWSIYLRERFKLHG 900 Query: 367 SIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPL 522 +++ + S IW I+S+ H + Q W+ G+G + W D W DKP+ Sbjct: 901 RLYSC--SYKXSSIWPGIKSILHILFQNCRWVIGNGSTTSLWVDKWL-DKPI 949 Score = 37.7 bits (86), Expect(2) = 5e-30 Identities = 24/82 (29%), Positives = 36/82 (43%) Frame = +3 Query: 603 LPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWA 782 +PS +S P L IL P +D L + ST G S S + +R + WA Sbjct: 978 IPSIFSSTFPDLTK-EILEMPLPIDEDKDVLIWEVSTSGVFSFSDGYEIVRHRFPVKSWA 1036 Query: 783 PLIWNGISLARINCFGWRMMFN 848 +IW R + W+++FN Sbjct: 1037 SIIWRPFIPPRYSILVWKILFN 1058 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 128 bits (322), Expect(2) = 6e-30 Identities = 67/187 (35%), Positives = 103/187 (55%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I R+S W+ K LS G++ L++ VLSS+P++ +L P ++ + I+RL FLW ST Sbjct: 1532 IRDRISGWENKILSPGGRITLLRSVLSSLPMYLLQVLKPPVTVIERIDRLFNSFLWGDST 1591 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFK 360 K++ W++++ P EG GLGIR++ +V A +KL W+ T NSLW+ + +Y Sbjct: 1592 ECKKMHWAEWAKISFPCAEG-GLGIRKLEDVCAAFTLKLWWRFQTGNSLWTQFLRTKYCL 1650 Query: 361 DQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIFLN 540 + + L+ S +WKR+ S Q W G G ++ WHD W GDKPL+ F Sbjct: 1651 GRIPHHIQPKLHDSHVWKRMISGREMALQNIRWKIGKG-DLFFWHDCWMGDKPLAASFPE 1709 Query: 541 QQFPQDH 561 Q H Sbjct: 1710 FQNDMSH 1716 Score = 30.4 bits (67), Expect(2) = 6e-30 Identities = 21/79 (26%), Positives = 36/79 (45%) Frame = +3 Query: 612 ELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWAPLI 791 +L S P++ + IL P + + D +++G S SAW IR + T+ I Sbjct: 1730 KLRSFLPTILVEEILQ-VPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQRQTSNALCSFI 1788 Query: 792 WNGISLARINCFGWRMMFN 848 W+ I+ F W+ + N Sbjct: 1789 WHRSIPLSISFFLWKTLHN 1807 >ref|XP_008357937.1| PREDICTED: putative ribonuclease H protein At1g65750 [Malus domestica] Length = 1048 Score = 122 bits (306), Expect(2) = 7e-30 Identities = 62/172 (36%), Positives = 97/172 (56%) Frame = +1 Query: 7 SRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGSTSS 186 ++L+ WK K LS G++ L + V S+ LH S+ PSS+ + + R R+F+WSG +S Sbjct: 446 AKLTGWKGKLLSMAGRVQLTQSVFQSMLLHSFSVYKWPSSLLRPLSRCARNFIWSGDVTS 505 Query: 187 SKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFKDQ 366 K V+W Q+ PK EG GLG+R + +N +L+KLGW T +S WS + R+ Sbjct: 506 KKSVTVSWRQICAPKNEG-GLGLRDLGSLNTTALLKLGWLIITTDSPWSIYLRERFKLHG 564 Query: 367 SIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPL 522 +++ + S IW I+S+ H + Q W+ G+G + W D W DKP+ Sbjct: 565 RLYSC--SYKRSSIWPGIKSILHILFQNCRWVIGNGSTTSLWVDKWL-DKPI 613 Score = 36.6 bits (83), Expect(2) = 7e-30 Identities = 24/82 (29%), Positives = 35/82 (42%) Frame = +3 Query: 603 LPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWA 782 +PS +S P L IL P +D L + ST G S S + +R WA Sbjct: 642 IPSIFSSTFPDLTK-EILEMPLPIDEDKDVLIWEVSTSGVFSFSDGYEIVRHXFPVKSWA 700 Query: 783 PLIWNGISLARINCFGWRMMFN 848 +IW R + W+++FN Sbjct: 701 SIIWRPFIPPRYSILVWKILFN 722 >ref|XP_008369981.1| PREDICTED: uncharacterized protein LOC103433498 [Malus domestica] Length = 1384 Score = 122 bits (306), Expect(2) = 1e-29 Identities = 62/172 (36%), Positives = 97/172 (56%) Frame = +1 Query: 7 SRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGSTSS 186 ++L+ WK K LS G++ L + V S+ LH S+ PSS+ + + R R+F+WSG +S Sbjct: 782 AKLTGWKGKLLSMAGRVQLTQSVFQSMLLHSFSVYKWPSSLLRXLSRCARNFIWSGDVTS 841 Query: 187 SKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFKDQ 366 K V+W Q+ PK EG GLG+R + +N +L+KLGW T +S WS + R+ Sbjct: 842 KKXVTVSWRQICAPKNEG-GLGLRDLGSLNTXALLKLGWLIITTDSPWSIYLRERFKLHG 900 Query: 367 SIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPL 522 +++ + S IW I+S+ H + Q W+ G+G + W D W DKP+ Sbjct: 901 RLYSC--SYKRSSIWPGIKSIJHILFQNCRWVIGNGSTTSLWVDKWL-DKPI 949 Score = 35.8 bits (81), Expect(2) = 1e-29 Identities = 23/82 (28%), Positives = 35/82 (42%) Frame = +3 Query: 603 LPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWA 782 +P +S P L IL P +D L + ST G S S + +R + WA Sbjct: 978 IPXIFSSTFPDLTK-EILEMPLPIDEDKDVLIWEVSTSGVFSFSDGYEIVRHRFPVKSWA 1036 Query: 783 PLIWNGISLARINCFGWRMMFN 848 +IW R + W+++FN Sbjct: 1037 SIIWRPFIPPRYSILVWKILFN 1058 >ref|XP_010068552.1| PREDICTED: uncharacterized protein LOC104455464 [Eucalyptus grandis] Length = 1477 Score = 118 bits (295), Expect(2) = 4e-29 Identities = 71/190 (37%), Positives = 96/190 (50%), Gaps = 1/190 (0%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I +R +W + LSF G+L LI+ VL +I AS+ LPSS+ IE +LR FLW G++ Sbjct: 1197 ITARARSWSHRFLSFAGRLQLIRLVLHAIQSFWASVFTLPSSVILGIESILRRFLWKGTS 1256 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQAST-MNSLWSSCFKLRYF 357 V+W+ V LPK EG G+GIR I + N AS++K W T SLW + Sbjct: 1257 LDKGGAKVSWADVCLPKEEG-GMGIRSIKDCNKASMMKFIWILFTDKESLWCRWIHSNFL 1315 Query: 358 KDQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIFL 537 K + W A + WK+I L Q W DG+ I+ W D+W P L Sbjct: 1316 KKHNFWVAPQPSVCAWSWKKILQLRGGCRQSFGWKVSDGRSISLWFDSWLPCGP-----L 1370 Query: 538 NQQFPQDHLL 567 + Q PQ LL Sbjct: 1371 HLQVPQSFLL 1380 Score = 38.1 bits (87), Expect(2) = 4e-29 Identities = 25/91 (27%), Positives = 37/91 (40%) Frame = +3 Query: 576 LPASLRILALPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIR 755 +PA + +L S R L L + S PP S D+ + + G +SAWNAIR Sbjct: 1384 VPAKATMASLYS-CAGRSCKLRLENCGFSLPPLSVDSDRFVWRQDSSGAFFVASAWNAIR 1442 Query: 756 SKGTTLQWAPLIWNGISLARINCFGWRMMFN 848 + W +W+ R W + N Sbjct: 1443 ASKAKAPWTSFVWDKDLAPRFQFLLWLITKN 1473 >ref|XP_008366684.1| PREDICTED: uncharacterized protein LOC103430323 [Malus domestica] Length = 1380 Score = 117 bits (294), Expect(2) = 7e-29 Identities = 63/172 (36%), Positives = 91/172 (52%) Frame = +1 Query: 7 SRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGSTSS 186 +RLS WK K S G+ L++ V S+ LH S+ LPS K + R+F+WSG SS Sbjct: 782 ARLSGWKGKLXSMAGRFQLVQSVYQSLXLHSFSVYQLPSCXLKHLSACARNFIWSGDLSS 841 Query: 187 SKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFKDQ 366 K+ V+WS V PK EG GLG+R ++ +N +L+ GW A SLW S R+ Sbjct: 842 RKLVTVDWSMVXGPKKEG-GLGLRDLAXLNLTALLSFGWDALQSYSLWGSFAXQRF--PL 898 Query: 367 SIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPL 522 S + N S W ++ ++ S WI GDG+ ++ W D W D+P+ Sbjct: 899 SPYRNQNIYLRSSXWHGLKRALPILNNNSRWIIGDGRXVSFWFDKWL-DEPI 949 Score = 37.7 bits (86), Expect(2) = 7e-29 Identities = 23/80 (28%), Positives = 40/80 (50%) Frame = +3 Query: 600 ALPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQW 779 +LP ++ PS+ + IL P + D+L + S G+ S SS ++ IR + + W Sbjct: 976 SLPDYFSNLFPSI-VQQILRLPLPLNXXXDKLIWEPSPTGKFSFSSGYHLIRXRHSDCAW 1034 Query: 780 APLIWNGISLARINCFGWRM 839 A +IW R++ WR+ Sbjct: 1035 AKVIWQHFIPPRLSILAWRL 1054 >ref|XP_010034422.1| PREDICTED: uncharacterized protein LOC104423658 [Eucalyptus grandis] Length = 1706 Score = 120 bits (300), Expect(2) = 9e-29 Identities = 71/205 (34%), Positives = 101/205 (49%), Gaps = 6/205 (2%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I SR +W + LS+ G+L LIK VL SI +S+ LP S+ DIE++LR FLW G+ Sbjct: 1276 ITSRAKSWAHRLLSYAGRLQLIKSVLHSIQAFWSSVFTLPISVLNDIEQVLRQFLWKGAD 1335 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMN-SLWSSCFKLRYF 357 V+W + LPK EG GLGIR++ + N A+++K W T SLW + Sbjct: 1336 LGKGGAKVSWEDICLPKNEG-GLGIRKLRDCNKAAMMKYIWILFTDKVSLWFRWIHSNFL 1394 Query: 358 KDQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIFL 537 K Q+ W AT S WK+I L W G+G ++ W+D W PL + Sbjct: 1395 KWQNFWIATTPTVCSWAWKKILQLRSDSRPSFLWKIGNGLSVSLWYDHWHPKGPLHILLP 1454 Query: 538 NQ-----QFPQDHLLASFFQPLSGS 597 Q + + ++A PL S Sbjct: 1455 EQIIRRSELSGNAMVADLLSPLGCS 1479 Score = 35.0 bits (79), Expect(2) = 9e-29 Identities = 22/65 (33%), Positives = 31/65 (47%) Frame = +3 Query: 654 LSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWAPLIWNGISLARINCFGW 833 LSS PNS+ + + + G+ + SAW+ +R K T WA LIW G R W Sbjct: 1489 LSSPAPNSSP-NCFSWRWHPSGRFTIGSAWDRLRRKRTPAPWASLIWAGDITPRFQFILW 1547 Query: 834 RMMFN 848 + N Sbjct: 1548 LIAKN 1552 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 133 bits (334), Expect = 2e-28 Identities = 73/201 (36%), Positives = 112/201 (55%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I R++ W+ KTLS G++ L++ LSS+P++ +L P + + I RLL +FLW GST Sbjct: 1618 IEERITGWENKTLSPGGRITLLRSTLSSLPIYLLQVLKPPVIVLERINRLLNNFLWGGST 1677 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFK 360 +S +I+ +W ++ LP EG GL IR + +V A +KL W+ T NSLW+ + +Y Sbjct: 1678 ASKRIHWASWGKIALPIAEG-GLDIRNVEDVCEAFSMKLWWRFRTTNSLWTQFMRAKYCG 1736 Query: 361 DQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIFLN 540 Q + L+ S WKR+ +++ Q W G G E+ WHD W G++PL + N Sbjct: 1737 GQLPTDVQPKLHDSQTWKRMVTISSITEQNIRWRIGHG-ELFFWHDCWMGEEPL--VNRN 1793 Query: 541 QQFPQDHLLASFFQPLSGSWH 603 Q F S F L+ SW+ Sbjct: 1794 QAFASSMAQVSDFF-LNNSWN 1813 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 124 bits (312), Expect(2) = 2e-28 Identities = 69/201 (34%), Positives = 112/201 (55%), Gaps = 1/201 (0%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I R++ W+ K LS G++ L+K VL+S+P++ +L P + + I R+ FLW GS Sbjct: 1825 IEERITGWENKILSPGGRITLLKSVLTSLPIYLFQVLKPPVCVLERINRIFNSFLWGGSA 1884 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFK 360 +S KI+ +W++++LP EG GL IR ++EV A +KL W+ T +SLW+ +++Y + Sbjct: 1885 ASKKIHWTSWAKISLPVKEG-GLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCR 1943 Query: 361 DQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIFLN 540 Q + L+ S WKR+ + + Q W G G + WHD W G+ PL I N Sbjct: 1944 GQLPMHTQPKLHDSQTWKRMVASSAITEQNMRWRVGQG-NLFFWHDCWMGETPL--ISSN 2000 Query: 541 QQFPQDHL-LASFFQPLSGSW 600 +F + + FF ++ SW Sbjct: 2001 HEFSLSMVQVCDFF--MNNSW 2019 Score = 29.3 bits (64), Expect(2) = 2e-28 Identities = 14/63 (22%), Positives = 29/63 (46%) Frame = +3 Query: 654 LSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWAPLIWNGISLARINCFGW 833 ++ P ++ ++D+ + +G+ S SAW IR + IW+ + F W Sbjct: 2036 IAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKAIPLTTSFFLW 2095 Query: 834 RMM 842 R++ Sbjct: 2096 RLL 2098 >gb|AIK35195.1| LINE-type retrotransposon LIb DNA [Ipomoea batatas] Length = 1836 Score = 131 bits (329), Expect = 6e-28 Identities = 64/177 (36%), Positives = 97/177 (54%), Gaps = 1/177 (0%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 + +L+ WKA TL+ G+ IL++ L+S+P + L LP S C D++++ R+FLW + Sbjct: 1246 MRKKLATWKASTLNMAGRRILVQSSLASVPTYTMQALALPVSTCNDVDKICRNFLWGHTD 1305 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQAST-MNSLWSSCFKLRYF 357 ++ KI+ VNWS + P+ G GLG+R + N A L K+ WQ T + LW + +Y Sbjct: 1306 NTKKIHTVNWSHICRPRQMG-GLGLRTARDFNMAFLTKMAWQIFTNQDRLWVKVLREKYV 1364 Query: 358 KDQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLST 528 K + S W+ I + + +G W GDG IN WHD W+G KPL T Sbjct: 1365 KQDDFLHIPQCSNASWGWRGILKGRNILAKGLKWCVGDGTAINFWHDWWTGKKPLIT 1421 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 124 bits (312), Expect(2) = 8e-28 Identities = 68/202 (33%), Positives = 111/202 (54%), Gaps = 1/202 (0%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I R++ W+ K LS G++ L++ VL+S+P++ +L P + + + RL FLW GS Sbjct: 1655 IEERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSA 1714 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFK 360 +S +I+ +W+++ LP EG GL IR ++EV A +KL W+ T +SLW+ +++Y + Sbjct: 1715 ASKRIHWASWAKIALPVTEG-GLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCR 1773 Query: 361 DQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIFLN 540 Q L+ S WKR+ + + Q W G G + WHD W G+ PL I N Sbjct: 1774 GQLPMQTQPKLHDSQTWKRMLTSSTITEQHMRWRVGQG-NVFFWHDCWMGEAPL--ISSN 1830 Query: 541 QQFPQDHL-LASFFQPLSGSWH 603 Q+F + + FF + SW+ Sbjct: 1831 QEFTSSMVQVCDFF--TNNSWN 1850 Score = 27.3 bits (59), Expect(2) = 8e-28 Identities = 14/63 (22%), Positives = 27/63 (42%) Frame = +3 Query: 654 LSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWAPLIWNGISLARINCFGW 833 ++ P ++ +D+ + +G S SAW IR + IW+ + F W Sbjct: 1866 IAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLW 1925 Query: 834 RMM 842 R++ Sbjct: 1926 RLL 1928 >ref|XP_010038572.1| PREDICTED: uncharacterized protein LOC104427129 [Eucalyptus grandis] Length = 1429 Score = 117 bits (293), Expect(2) = 2e-27 Identities = 71/190 (37%), Positives = 95/190 (50%), Gaps = 1/190 (0%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I +R +W + LSF G+L LI+ VL +I AS+ LPSS+ IE +LR FLW G++ Sbjct: 1004 ITARARSWSHRFLSFVGRLQLIRSVLHAIQSFWASVFTLPSSVILGIESILRRFLWKGTS 1063 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQAST-MNSLWSSCFKLRYF 357 V+W+ V LPK EG GLGI I + N AS++K W T SLW + Sbjct: 1064 LDKGGAKVSWADVCLPKEEG-GLGIWSIKDCNKASMLKFIWILFTNKESLWCRWIHSNFL 1122 Query: 358 KDQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIFL 537 K + W A + WK+I L Q W GDG+ + W D+W P L Sbjct: 1123 KKHNFWVAPQPSVCAWSWKKILHLRGCCRQSFGWEVGDGRSTSLWFDSWLPCGP-----L 1177 Query: 538 NQQFPQDHLL 567 + Q PQ LL Sbjct: 1178 HLQVPQSFLL 1187 Score = 33.5 bits (75), Expect(2) = 2e-27 Identities = 22/91 (24%), Positives = 35/91 (38%) Frame = +3 Query: 576 LPASLRILALPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIR 755 +PA + +L S R + L + S PP D+ + + G S + AWN IR Sbjct: 1191 VPAKATVASLYS-YAGRSCKVRLENCGFSLPPLLVDSDRFVWRHDSSGAFSVAFAWNVIR 1249 Query: 756 SKGTTLQWAPLIWNGISLARINCFGWRMMFN 848 + W +W+ R W + N Sbjct: 1250 ASKAKAPWTSFVWDKDVAPRFQFLLWLITKN 1280 >ref|XP_008385092.1| PREDICTED: putative ribonuclease H protein At1g65750 [Malus domestica] Length = 715 Score = 116 bits (290), Expect(2) = 2e-27 Identities = 60/175 (34%), Positives = 96/175 (54%) Frame = +1 Query: 7 SRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGSTSS 186 ++L+ W K LS G++ L + V S+ LH S+ PSS+ + + R R+F+WSG +S Sbjct: 113 AKLTGWXGKLLSMAGRVQLTQSVFQSMLLHSFSVYKWPSSLLRPLSRCARNFIWSGDVTS 172 Query: 187 SKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFKDQ 366 K V+W Q+ PK EG GLG+R + +N +L+KLG T +S WS + R+ Sbjct: 173 KKSVTVSWRQICAPKNEG-GLGLRDLGSLNTTALLKLGXLIITXDSPWSIYLRERFKLHG 231 Query: 367 SIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTI 531 +++ + S IW I+S+ H + Q W+ G+G + W D W DKP+ + Sbjct: 232 RLYSC--SYKXSSIWPGIKSILHILFQNCRWVIGNGSTTSLWVDKWL-DKPIXDV 283 Score = 34.3 bits (77), Expect(2) = 2e-27 Identities = 23/82 (28%), Positives = 34/82 (41%) Frame = +3 Query: 603 LPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWA 782 +PS +S L IL P +D L + ST G S S + +R + WA Sbjct: 309 IPSXFSSTFXXLTX-EILEMPLPIDEDKDVLIWEVSTSGVFSFSDGYEIVRHRFPVKSWA 367 Query: 783 PLIWNGISLARINCFGWRMMFN 848 IW R + W+++FN Sbjct: 368 SXIWRPFIPPRYSILVWKILFN 389 >ref|XP_008370907.1| PREDICTED: putative ribonuclease H protein At1g65750 [Malus domestica] Length = 899 Score = 112 bits (279), Expect(2) = 3e-27 Identities = 56/166 (33%), Positives = 90/166 (54%) Frame = +1 Query: 7 SRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGSTSS 186 ++L+ WK K LS G++ L + V S+ LH S+ PSS+ + + R R+F+WSG + Sbjct: 297 AKLTGWKGKLLSMXGRVQLTQSVFQSMLLHSFSVYKWPSSLLRPLSRCARNFIWSGDVTX 356 Query: 187 SKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFKDQ 366 K V+W Q+ K E GLG+R + +N +L+KLGW T +S WS + R+ Sbjct: 357 KKXVTVSWRQICAXKNE-XGLGLRDLGSLNTXALLKLGWLIITTDSPWSIYLRERFKLHG 415 Query: 367 SIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTW 504 +++ + S IW I+S+ H + Q W+ G+G + W D W Sbjct: 416 RLYSC--SYKRSSIWPGIKSILHILFQNCRWVIGNGSTTSLWVDKW 459 Score = 38.1 bits (87), Expect(2) = 3e-27 Identities = 24/82 (29%), Positives = 36/82 (43%) Frame = +3 Query: 603 LPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWA 782 +PS +S P L IL P +D L + ST G S S + +R + WA Sbjct: 493 IPSIFSSTFPDLTX-EILEMPLPIDEDKDVLIWEVSTSGVFSFSDGYEIVRHRFPVKSWA 551 Query: 783 PLIWNGISLARINCFGWRMMFN 848 +IW R + W+++FN Sbjct: 552 SIIWRPFIPPRYSILVWKILFN 573 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 129 bits (323), Expect = 3e-27 Identities = 71/200 (35%), Positives = 108/200 (54%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I R++ W+ K LS G++ L++ LSS+P++ +L P + + I RL +FLW GS Sbjct: 2906 IEERITGWENKILSPGGRITLLRSTLSSLPIYLLQVLKPPIIVLERINRLFNNFLWGGSA 2965 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFK 360 SS +I+ +W ++ LP EG GL IR + +V A +KL W+ T NSLW + +Y Sbjct: 2966 SSKRIHWASWGKIALPIAEG-GLDIRNLEDVFKAFSMKLWWRFRTTNSLWMQFMRAKYCG 3024 Query: 361 DQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIFLN 540 Q + L+ S WKR+ +++ Q W G GK + WHD W G++PL + N Sbjct: 3025 GQLPTHVQPKLHDSQTWKRMVTISSITEQNIRWRVGHGK-LFFWHDCWMGEEPL--VIRN 3081 Query: 541 QQFPQDHLLASFFQPLSGSW 600 Q+F S F L+ SW Sbjct: 3082 QEFASSMAQVSDFF-LNNSW 3100 Score = 123 bits (308), Expect = 2e-25 Identities = 66/178 (37%), Positives = 97/178 (54%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I R+S W+ K LS G++ L++ VLSS P++ +L P ++ + IERL FLW S Sbjct: 1112 IRDRISGWENKILSPGGRITLLRSVLSSQPMYLLQVLKPPVTVIEKIERLFNSFLWGDSC 1171 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFK 360 K++ WS++T P EG GL IR + +V A +KL W+ T NSLW+ + +Y Sbjct: 1172 DGKKLHWTAWSKITFPVSEG-GLDIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLRTKYCL 1230 Query: 361 DQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPLSTIF 534 + L+ S +WKR+ Q W G G E+ WHD W GD+PL+T+F Sbjct: 1231 GRIPHLVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKG-ELFFWHDCWMGDQPLATLF 1287 >ref|XP_008358855.1| PREDICTED: uncharacterized protein LOC103422573 [Malus domestica] Length = 1419 Score = 115 bits (288), Expect(2) = 4e-27 Identities = 59/172 (34%), Positives = 94/172 (54%) Frame = +1 Query: 7 SRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGSTSS 186 ++L+ WK K LS G++ L + V S+ LH + PSS+ + + R R+F+WSG +S Sbjct: 782 AKLTGWKXKLLSMXGRVQLTQSVFQSMLLHSFXVYKWPSSLLRXLSRCARNFIWSGXVTS 841 Query: 187 SKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTMNSLWSSCFKLRYFKDQ 366 V+W Q+ K EG GLG+R + +N +L+KLGW T +S WS + R+ Sbjct: 842 KXXVTVSWXQICAXKNEG-GLGLRDLGSLNTTALLKLGWLIITTDSPWSIYJRXRFKLHG 900 Query: 367 SIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTWSGDKPL 522 +++ + S IW I+S+ H + Q W+ G+G + W D W DKP+ Sbjct: 901 RLYSC--SYXRSSIWPGIKSILHILFQNCRWVIGNGSTTSLWVDKWL-DKPI 949 Score = 34.3 bits (77), Expect(2) = 4e-27 Identities = 22/82 (26%), Positives = 34/82 (41%) Frame = +3 Query: 603 LPSELTSRRPSLALLSILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTLQWA 782 +PS + P L IL P +D L + S G S S + +R + WA Sbjct: 978 IPSIFSXXFPDLTK-EILEMPLPIDEDKDVLIWEXSXSGXFSFSDGYEIVRHRFPVKSWA 1036 Query: 783 PLIWNGISLARINCFGWRMMFN 848 +IW R + W+++FN Sbjct: 1037 SIIWXPFIPPRYSILVWKILFN 1058 >ref|XP_008245249.1| PREDICTED: putative ribonuclease H protein At1g65750 [Prunus mume] Length = 601 Score = 120 bits (301), Expect(2) = 1e-26 Identities = 68/206 (33%), Positives = 104/206 (50%), Gaps = 5/206 (2%) Frame = +1 Query: 1 INSRLSAWKAKTLSFTGKLILIKHVLSSIPLHCASILPLPSSICKDIERLLRHFLWSGST 180 I ++ WK LS GK +LIK V+ +IP + S+ P++ C +++ + F W S Sbjct: 112 ICGKMHGWKHLLLSQAGKEVLIKAVIQAIPAYPMSVFKFPTTFCSELDSSIGKFWWGQSM 171 Query: 181 SSSKINHVNWSQVTLPKLEGSGLGIRRISEVNNASLIKLGWQASTM-NSLWSSCFKLRYF 357 S KI+ ++W + L K+EG G+G R SE NNA L GW+ NSLW+ + +YF Sbjct: 172 DSDKIHWLSWENMGLAKIEG-GMGFRNFSEFNNALLASQGWRLLMYPNSLWAKILRDKYF 230 Query: 358 KDQSIWNATNTLYGSCIWKRIRSLAHFIHQGSAWIFGDGKEINTWHDTW----SGDKPLS 525 D + NA S W I I G+ W +G+ I+ WHD W G +PL Sbjct: 231 PDGDVLNAKKGARASWGWSSILEGIKIIRWGAQWQVVNGQNISLWHDRWLHPTLGARPLC 290 Query: 526 TIFLNQQFPQDHLLASFFQPLSGSWH 603 LN + ++ +A+ P + SW+ Sbjct: 291 P--LNHEMRKEVRVATIIDPDTRSWN 314 Score = 27.3 bits (59), Expect(2) = 1e-26 Identities = 22/80 (27%), Positives = 35/80 (43%), Gaps = 13/80 (16%) Frame = +3 Query: 648 SILSSFPPNSTARDQLT*KDSTDGQLSQSSAWNAIRSKGTTL-------------QWAPL 788 +I S+ NS D++ + GQ S S ++ +RS T + + L Sbjct: 329 AICSTSIGNSAGLDRVIWPLNRHGQYSVKSGYHFLRSMETMRSVSRPSGSRFVDKKISKL 388 Query: 789 IWNGISLARINCFGWRMMFN 848 IWN +L +I F WR + N Sbjct: 389 IWNAKTLPKIRHFMWRAVRN 408