BLASTX nr result

ID: Cocculus23_contig00033965 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00033965
         (1075 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulga...   220   9e-55
emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulga...   218   3e-54
ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659...   215   2e-53
emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulga...   214   4e-53
emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulga...   211   4e-52
emb|CAN78583.1| hypothetical protein VITISV_029931 [Vitis vinifera]   210   7e-52
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...   209   1e-51
ref|XP_007214027.1| hypothetical protein PRUPE_ppa016677mg [Prun...   208   3e-51
gb|AAC67331.1| putative non-LTR retroelement reverse transcripta...   208   4e-51
ref|XP_007202950.1| hypothetical protein PRUPE_ppa016504mg, part...   207   6e-51
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...   206   1e-50
gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptas...   206   1e-50
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   206   1e-50
emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulga...   206   1e-50
emb|CCA66198.1| hypothetical protein [Beta vulgaris subsp. vulga...   206   2e-50
emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulga...   206   2e-50
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   206   2e-50
gb|AAD12028.1| putative non-LTR retroelement reverse transcripta...   205   3e-50
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   204   4e-50
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...   204   7e-50

>emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1379

 Score =  220 bits (560), Expect = 9e-55
 Identities = 123/354 (34%), Positives = 184/354 (51%)
 Frame = -1

Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884
            FTW        R  S LDR+L N +W + FPS    +   G+SDH  +   + T+ N+GP
Sbjct: 183  FTWFR-----GRSKSVLDRLLLNPEWINEFPSMRLSLLQRGLSDHCPLLTNIHTQ-NWGP 236

Query: 883  KPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTT 704
            KPF+F N WL  P  L +V + W E  T  PM     KL+ VK  L +WN++ FG + T 
Sbjct: 237  KPFRFQNCWLTDPHCLEIVNKTWLES-TNMPM---IDKLRRVKIRLKAWNRDEFGHIDTN 292

Query: 703  TRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDS 524
             ++ E ++Q     + +   +   +E   +A+  L + ++R+E ++ Q SR+ WLK GD 
Sbjct: 293  IKIMEDEIQKFDTISNERELDEQEIERRKEAQSDLWMWMKRKELYWAQNSRILWLKHGDR 352

Query: 523  NTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPTDFP 344
            NTKFFH     +   N I  ++V    IE  + +K      FK I      +        
Sbjct: 353  NTKFFHMVASNKKRRNFIASIKVNGRRIEKPNQIKEEAVTFFKEIFTEEFTERPTLEGLQ 412

Query: 343  VPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIK 164
               +  +  D L+   +  EID  V     DKAPGPDGFN  F K  W  I  DV + ++
Sbjct: 413  FNQLSQNQADSLIQPFSDEEIDYAVNSCASDKAPGPDGFNFKFIKNAWETIKEDVYTLVR 472

Query: 163  DFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
            +F+A  ++ +G N+TF+ LIPK  N  N  DFRPIS+   +YKII+K++  R++
Sbjct: 473  EFWATSKLPKGSNSTFITLIPKIDNPENFKDFRPISMVGCVYKIIAKLMAKRIQ 526


>emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score =  218 bits (556), Expect = 3e-54
 Identities = 125/358 (34%), Positives = 184/358 (51%), Gaps = 3/358 (0%)
 Frame = -1

Query: 1066 KFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFG 887
            KFTW   Q       S+LDR+  + QW D+FP+ +  +    +SDH  I V+ K + N+G
Sbjct: 182  KFTWFRGQSK-----SKLDRMFIHPQWLDLFPTLQISLLKRTLSDHCPILVQTKLK-NWG 235

Query: 886  PKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCA---KLKNVKEALVSWNKNCFGE 716
            P+PF+FI+ WL HP  L L+ + W E          C+   KLK VK +L+ WN   FG 
Sbjct: 236  PRPFRFIDAWLSHPGCLKLISKTWLEA-------HDCSFSEKLKKVKSSLLKWNAEEFGC 288

Query: 715  VQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLK 536
            +    +  E ++Q+  + A D    AN LEE  +++  L + ++R+E  + Q+SRV+W+K
Sbjct: 289  IDEKIQSLENKIQEMDRIADDRNLEANELEERRKSQMDLWIWMKRKEVLWAQQSRVKWIK 348

Query: 535  GGDSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIP 356
             GD NT++FH     R   N I  L +    I+    LK     +F  +         + 
Sbjct: 349  EGDRNTRYFHIMATMRRKKNAIESLIIEQKQIDSPEDLKAAAVSYFSELFTEELSPRPVF 408

Query: 355  TDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVS 176
             D     +    ++ L    T+ EID+ V      K+PGPDGFN  F K+ W +I  DV 
Sbjct: 409  GDLNFKQLNDSHREILTSQFTRSEIDEAVSSCDGSKSPGPDGFNFKFVKQAWEVIKEDVY 468

Query: 175  SAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
              + +F+   R+ RG N   + LIPK  N     DFRPIS+   +YKIISKIL  RL+
Sbjct: 469  GIVNEFWHSSRLPRGCNTALIALIPKISNPEGFKDFRPISMVGCVYKIISKILARRLQ 526


>ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659506 [Glycine max]
          Length = 964

 Score =  215 bits (548), Expect = 2e-53
 Identities = 126/363 (34%), Positives = 194/363 (53%), Gaps = 7/363 (1%)
 Frame = -1

Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMF--PSCEALVEDSGISDHHHITVKLKTE 899
            G  +TW+N     SR+WS+LDR L N+ W + F   +CE + E   ISDH  + V  +  
Sbjct: 552  GPLYTWTN-----SRVWSKLDRALCNQAWFNSFGNSACEVM-EFISISDHTPLVVTTELV 605

Query: 898  VNFGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFG 719
            V  G  PFKF N+ +DHP+FL +V + W++ + G  M +VC KLK +K  L +  K  F 
Sbjct: 606  VPRGNSPFKFNNLIVDHPNFLRIVADGWKQNIHGCSMFKVCKKLKALKAPLKNLFKQEFS 665

Query: 718  EVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWL 539
             +     LAEA+        + +P + ++L   ++ R    ++ + E   + Q  + ++L
Sbjct: 666  NISNRVELAEAEYNSVLNSIKQNPQDPSLLALANRTRGQTIMLRKAESMKFAQLIKNKYL 725

Query: 538  KGGDSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSS-----ALKNHVTQHFKGILGSAN 374
               D  +KFFHA +K   +S  I  +R+ D     S      A  NH    F     +  
Sbjct: 726  LQADKCSKFFHALIKRNKHSRFIAAIRLEDGHNTSSQDEIALAFVNHFRNFFSAHELTQT 785

Query: 373  DQIIIPTDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHI 194
              I I    P   +P D    L+   ++ ++  ++  +  +KAPGPDGFN  FFK+ W+I
Sbjct: 786  PSISICNRGP--KVPTDCFAALLCPTSKQKVWNIISVMANNKAPGPDGFNVLFFKKAWNI 843

Query: 193  IGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILV 14
            +G+D+ +A+ +FF   +I + LN   + LIPK   AS +N FRPIS CN+LYKI+SKIL 
Sbjct: 844  VGDDIFAAVNEFFTTGKILKQLNHAIIVLIPKHDQASQVNHFRPISCCNLLYKIVSKILA 903

Query: 13   NRL 5
            NR+
Sbjct: 904  NRI 906


>emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1380

 Score =  214 bits (546), Expect = 4e-53
 Identities = 119/358 (33%), Positives = 189/358 (52%), Gaps = 2/358 (0%)
 Frame = -1

Query: 1069 DKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNF 890
            ++FTW           S+LDR   N +W   +P+ +  + + G+SDH  + +      N+
Sbjct: 181  ERFTWFRGNSK-----SKLDRCFVNPEWLTHYPTLKLSLLNRGLSDHCPLLLNSSVR-NW 234

Query: 889  GPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQ 710
            GPKPFKF N WL  P  + LVK+ W++     PM  V  KLK VK+ L  WN+  FG ++
Sbjct: 235  GPKPFKFQNCWLSDPRCMRLVKDTWQKSS---PMGLV-QKLKTVKKDLKDWNEKVFGNIE 290

Query: 709  TTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGG 530
               +  E ++    K + +   ++  LE++ +A+  L   ++ +E ++ Q+SR++WLK G
Sbjct: 291  ANIKQLEHEINQLDKISNERDLDSFELEKKKKAQVDLWSWMKTKESYWSQQSRIKWLKQG 350

Query: 529  DSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILG--SANDQIIIP 356
            D NTKFFH     R + N IT + V  D I +   +K    ++F+      S N  ++  
Sbjct: 351  DRNTKFFHVVASIRKHRNSITSIEVNGDKISEPEKIKLEAMKYFRKAFKEESYNRPLLEG 410

Query: 355  TDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVS 176
             DF   T        L+   +  EID+ V     DKAPGPDGFN  F K+ W +I  ++ 
Sbjct: 411  LDFKHLTEAQSAD--LIAPFSHEEIDKAVASCSSDKAPGPDGFNFTFIKKAWDVIKEEIY 468

Query: 175  SAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
              +++F+   R+ +G N  F+ LIPK  +     DFRPIS+   +YKI++K+L  RL+
Sbjct: 469  ETVQEFWNSSRLPKGCNMAFIALIPKTDSPKGFQDFRPISMVGCVYKIVAKLLTMRLQ 526


>emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score =  211 bits (537), Expect = 4e-52
 Identities = 118/340 (34%), Positives = 176/340 (51%)
 Frame = -1

Query: 1021 SRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGPKPFKFINVWLDHPD 842
            S LDR+  N +W    P+    +   G+SDH  + V  K E+++GPKPF+F N WL  P+
Sbjct: 192  SILDRLFVNPEWITNLPNLRVSLLQRGLSDHCPLLVHNK-ELDWGPKPFRFQNCWLSDPE 250

Query: 841  FLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTTTRLAEAQLQDCSKR 662
             L +VK  W++    + +     KLK VK+ L SWN   FG + +  +  E+++Q     
Sbjct: 251  CLKIVKAVWQDAEALHTI----GKLKEVKKRLKSWNLTEFGNIDSKIKKFESEIQHLDSI 306

Query: 661  AQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDSNTKFFHAKMKARWN 482
                  +   LE   +A+  L   ++R E ++ Q SRV WLK GD NT FFHA    +  
Sbjct: 307  NNTRDLDTQELENRKEAQVELWKWIKRREMYWAQNSRVTWLKEGDRNTMFFHAIASNKRR 366

Query: 481  SNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPTDFPVPTIPMDLKDGLMV 302
             N IT + V    I++ S +K   T +FK I    +    +  D     +  +  + L +
Sbjct: 367  KNSITTVEVDGLKIDEPSRIKWEATTYFKKIFKEEHGCRPLFEDLNFKCVTHEQAEQLTL 426

Query: 301  DITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGLNA 122
              +  EID+ V     DKAPGPDGFN  F K  W II +D+   +  F+   R+ +G N 
Sbjct: 427  PFSCEEIDEAVSTCSSDKAPGPDGFNFKFIKSAWGIIKHDIYEMVHKFWESSRLPQGSNV 486

Query: 121  TFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
             ++ LIPK  N  N  D+RPIS+   LYKII+K++  RL+
Sbjct: 487  AYIALIPKMSNPKNFKDYRPISMVGCLYKIIAKVMAKRLQ 526


>emb|CAN78583.1| hypothetical protein VITISV_029931 [Vitis vinifera]
          Length = 1875

 Score =  210 bits (535), Expect = 7e-52
 Identities = 120/361 (33%), Positives = 191/361 (52%), Gaps = 4/361 (1%)
 Frame = -1

Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVE---DSGISDHHHITVKLKT 902
            G  F+WS  ++  ++ W+RLDR L ++ W D    C  +V+      ISDH  I +K   
Sbjct: 915  GGVFSWSGGRN--NQAWARLDRFLVSQCWLD---KCCGVVQCRLPRPISDHFPIMLK-GG 968

Query: 901  EVNFGPKPFKFINVWLDHPDFLNLVKEKWEEQVT-GYPMQRVCAKLKNVKEALVSWNKNC 725
             +  GP PF+F N+WL    F +L++E W+  V  G    R+ +KLK +K+ +  WN+  
Sbjct: 969  GLRRGPSPFRFENMWLKVDGFKDLLREWWQGTVVRGKASFRLASKLKVLKQKIKEWNREV 1028

Query: 724  FGEVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQ 545
            FG ++    LA  Q++   +   +   + +  E + +A+   K  +  EE  +RQ SR  
Sbjct: 1029 FGRLEVNKSLALQQVEFWDRVESERSLSVSETEMKKEAKEXFKKWVLLEETHWRQMSREL 1088

Query: 544  WLKGGDSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQI 365
            WLK GD N+ FFH    A   +N + R+++   W  +   ++  + Q+F+ +L       
Sbjct: 1089 WLKEGDKNSGFFHRMANAHRRTNSMDRIKINGVWRTEEQEVREGIVQNFQQLLTEEPSWR 1148

Query: 364  IIPTDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGN 185
                   +P +     +GL V  T  EI   + D+  DKAPGPDGF G F++  W  +  
Sbjct: 1149 ADIEGLHLPRLNTCEAEGLEVPFTMEEIHSALMDMNGDKAPGPDGFTGAFWQTCWEFVKE 1208

Query: 184  DVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRL 5
            ++    K+FF ++   + LN TFL LIPKK  A ++ +FRPISL   LYK+++K+L NRL
Sbjct: 1209 EIMDLFKEFFVQKSFAKSLNTTFLVLIPKKGGAEDLGEFRPISLLGGLYKLVAKVLANRL 1268

Query: 4    K 2
            K
Sbjct: 1269 K 1269


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  209 bits (533), Expect = 1e-51
 Identities = 125/363 (34%), Positives = 190/363 (52%), Gaps = 7/363 (1%)
 Frame = -1

Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893
            G+ FTW+N     + ++ RLDRV+ N +W   F S      +   SDH  + +   T   
Sbjct: 763  GNSFTWTN-----NHMFQRLDRVVYNPEWAHCFSSTRVQHLNRDGSDHCPLLISCATASQ 817

Query: 892  FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713
             GP  F+F++ W  H DFL  V+  W+  +    +     K + +K  L  WNK  FG++
Sbjct: 818  KGPSTFRFLHAWTKHHDFLPFVERSWQVPLNSSGLTAFWIKQQRLKRDLKWWNKQIFGDI 877

Query: 712  QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533
                + AE + +   K  Q DPS+ N     ++A   L   L  EE F++QKS V+WL  
Sbjct: 878  FEKLKRAEIEAEKREKEFQQDPSSIN-RNLMNKAYAKLNRQLSIEELFWQQKSGVKWLVE 936

Query: 532  GDSNTKFFHAKMKARWNSNQITRLRVGDDWI-EDSSALKNHVTQHFKGILGSAN------ 374
            G+ NTKFFH +M+ +   N I R++  +  I ED   ++N   Q+F+ +L +        
Sbjct: 937  GERNTKFFHLRMRKKRVRNNIFRIQDSEGNIYEDPQYIQNSAVQYFQNLLTAEQCDFSRF 996

Query: 373  DQIIIPTDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHI 194
            D  +IP      TI +   + L    +  EI +VV ++  D   GPDGF+  F++  W I
Sbjct: 997  DPSLIPR-----TISITDNEFLCAAPSLKEIKEVVFNIDKDSVAGPDGFSSLFYQHCWDI 1051

Query: 193  IGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILV 14
            I  D+  A+ DFF    + +G+ +T L L+PKKPN+   +DFRPISLC +L KI++K L 
Sbjct: 1052 IKQDLLEAVLDFFNGTPMPQGVTSTTLVLLPKKPNSCQWSDFRPISLCTVLNKIVTKTLA 1111

Query: 13   NRL 5
            NRL
Sbjct: 1112 NRL 1114


>ref|XP_007214027.1| hypothetical protein PRUPE_ppa016677mg [Prunus persica]
            gi|462409892|gb|EMJ15226.1| hypothetical protein
            PRUPE_ppa016677mg [Prunus persica]
          Length = 1421

 Score =  208 bits (530), Expect = 3e-51
 Identities = 125/360 (34%), Positives = 191/360 (53%), Gaps = 6/360 (1%)
 Frame = -1

Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884
            FTWSN ++  + +  RLDR L +  W++ FP           SDH  I +   + V +GP
Sbjct: 450  FTWSNLRE--NAVCRRLDRFLVSGSWEEHFPHYRHKALPRITSDHCPIELD-SSRVKWGP 506

Query: 883  KPFKFINVWLDHPDFLNLVKEKW-EEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQT 707
             PF+F N+WL+HPDF   +K  W E+Q+ G+   +   +LK +K  L  W+K  FG+V+ 
Sbjct: 507  SPFRFENMWLNHPDFKRKIKLWWGEDQIPGWEGYKFMTRLKMLKSKLKVWSKEEFGDVER 566

Query: 706  TTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGD 527
              R AEA+L    +R   +  +  +  E       +  + +REE  +RQ+ +V+W + GD
Sbjct: 567  DLREAEARLLVLDQREGTEGLDHLLRSERDNLLLKIGDLAQREEVKWRQRGKVKWAREGD 626

Query: 526  SNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSANDQIIIPTD 350
             NTKFFH         N I +L V D   IE  + ++  V + FKG+  S N  +    +
Sbjct: 627  GNTKFFHRVANGARKRNYIEKLEVEDLGVIEVDANIEREVIRFFKGLY-SRNKNVGWGVE 685

Query: 349  ----FPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGND 182
                 P+  +  D    L       E+ + V D   DK+PGPDGF+  FF+  W ++  D
Sbjct: 686  GLNWCPISQVEADW---LERPFDLEEVQKAVFDCGKDKSPGPDGFSMSFFQSCWEVVKGD 742

Query: 181  VSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
            +   ++DFF    ++   N TF+CLIPKK N+  + D+RPISL   LYK+ISK+L +RL+
Sbjct: 743  LMKVMQDFFQSGIVNGVTNETFICLIPKKANSVKVTDYRPISLVTSLYKVISKVLASRLR 802


>gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1449

 Score =  208 bits (529), Expect = 4e-51
 Identities = 132/377 (35%), Positives = 186/377 (49%), Gaps = 20/377 (5%)
 Frame = -1

Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893
            G  FTW N++D    IW +LDRV+ NE WK ++P    + E  G SDH    + L     
Sbjct: 599  GPLFTWCNKRDNDP-IWKKLDRVMVNEAWKMVYPQSYNVFEAGGCSDHLRCRINLNMNSG 657

Query: 892  F---GPKPFKFINVWLDHPDFLNLVKEKWEE----QVTGYPMQRVCAKLKNVKEALVSWN 734
                G KPFKF+N   D  +F  LV+  W E     ++   + R   KLK +K  L    
Sbjct: 658  AQVRGNKPFKFVNAVADMEEFKPLVENFWRETEPIHMSTSSLFRFTKKLKALKPKLRGLA 717

Query: 733  KNCFGEVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKS 554
            K   G +   TR A   L    +    +PS    +E ES+A      +   EEK+ +Q S
Sbjct: 718  KEKMGNLVKRTREAYLSLCQAQQSNSQNPSQ-RAMEIESEAYVRWDRIASIEEKYLKQVS 776

Query: 553  RVQWLKGGDSNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSA 377
            ++ WLK GD N K FH    AR   N I  ++  D         +KN   + F+  L   
Sbjct: 777  KLHWLKVGDKNNKTFHRAATARAAQNSIREIQKEDGSTATTKDDIKNETERFFQEFLQ-- 834

Query: 376  NDQIIIPTDFPVPTI------------PMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPD 233
                +IP D+   T+            P + KD L   ++  EI   +  +  DK+PGPD
Sbjct: 835  ----LIPNDYEGITVEKLTSLLPYHCSPAE-KDMLTASVSAKEIRGALFSMPNDKSPGPD 889

Query: 232  GFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISL 53
            G+  +F+KR W IIG +   A+K FF +  + +G+N T L LIPKK  A  + D+RPIS 
Sbjct: 890  GYTSEFYKRAWDIIGAEFVLAVKSFFEKGFLPKGVNTTILALIPKKLEAKEMKDYRPISC 949

Query: 52   CNILYKIISKILVNRLK 2
            CN++YK+ISKI+ NRLK
Sbjct: 950  CNVIYKVISKIIANRLK 966


>ref|XP_007202950.1| hypothetical protein PRUPE_ppa016504mg, partial [Prunus persica]
            gi|462398481|gb|EMJ04149.1| hypothetical protein
            PRUPE_ppa016504mg, partial [Prunus persica]
          Length = 1162

 Score =  207 bits (527), Expect = 6e-51
 Identities = 123/356 (34%), Positives = 187/356 (52%), Gaps = 2/356 (0%)
 Frame = -1

Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884
            FTWSN ++  + +  RLDR L +  W+D FP           SDH  I +   + V +GP
Sbjct: 168  FTWSNLRE--NAVCRRLDRFLVSGSWEDHFPHYRHKALPRITSDHCPIELDT-SRVKWGP 224

Query: 883  KPFKFINVWLDHPDFLNLVKEKW-EEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQT 707
             PF+F N+WL+HPDF+  +K  W E+Q+ G+   +   +LK +K  L  W+K  FG+V+ 
Sbjct: 225  SPFRFENMWLNHPDFMRKIKLWWGEDQIPGWEGYKFMTRLKMLKSKLKVWSKEEFGDVER 284

Query: 706  TTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGD 527
              R AEA+L    +R   +  +  +  E       +  + ++EE  +RQ+ +V+W + GD
Sbjct: 285  DLREAEARLLVLDQREGTEGLDHLLRSERDNLLLKIGDLAQKEEVKWRQRGKVKWAREGD 344

Query: 526  SNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSANDQIIIPTD 350
             NTKFFH         N I +L V D   IE  + ++  V + FKG+  S  +       
Sbjct: 345  GNTKFFHRVANGARKRNYIEKLEVEDLGVIEVDANIEREVIRFFKGLYSSNKNVGWGVEG 404

Query: 349  FPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSA 170
                 I     D L       E+ + V +   DK+PGPDGF+  FF+  W ++  D+   
Sbjct: 405  LNWCPISQVEADWLERPFDLEEVQKAVFECGKDKSPGPDGFSMSFFQSCWEVVKGDLMKV 464

Query: 169  IKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
            ++DFF    ++   N TF+CLIPKK N+  + D RPISL   LYK+ISK+L +RL+
Sbjct: 465  MQDFFQSGIVNGVTNETFICLIPKKANSVKVTDNRPISLVTSLYKVISKVLASRLR 520


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  206 bits (525), Expect = 1e-50
 Identities = 118/358 (32%), Positives = 197/358 (55%), Gaps = 2/358 (0%)
 Frame = -1

Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893
            G+ FTW+N     +R++ RLDRV+ N QW +MFP       +   SDH  + +       
Sbjct: 1060 GNPFTWTN-----NRMFQRLDRVVYNHQWINMFPITRIQHLNRDGSDHCPLLISCFISSE 1114

Query: 892  FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713
              P  F+F + W+ H DF   V+  W   + G  +Q    K   +K+ L  WNK  FG++
Sbjct: 1115 KSPSSFRFQHAWVLHHDFKTSVEGNWNLPINGSGLQAFWIKQHRLKQHLKWWNKAVFGDI 1174

Query: 712  QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533
             +  + AE ++++C    Q + +  + +   +++   L   L  EE F++QKS V+W+  
Sbjct: 1175 FSKLKEAEKRVEECEILHQQEQTVGSRINL-NKSYAQLNKQLNVEEIFWKQKSGVKWVVE 1233

Query: 532  GDSNTKFFHAKMKARWNSNQITRLRVGDD-WIEDSSALKNHVTQHFKGILGSANDQIIIP 356
            G+ NTKFFH +M+ +   + I +++  D  WIED   LK    ++F  +L +    I   
Sbjct: 1234 GERNTKFFHMRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIEYFSSLLKAEPCDISRF 1293

Query: 355  TDFPVPTIPMDLKDGLM-VDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDV 179
             +  +P+I  + ++ L+  +    E+   V D+  + A GPDGF+  F+++ W+ I +D+
Sbjct: 1294 QNSLIPSIISNSENELLCAEPNLQEVKDAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDL 1353

Query: 178  SSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRL 5
              A++DFF    I RG+ +T L L+PKK +AS  ++FRPISLC ++ KII+K+L NRL
Sbjct: 1354 LDAVRDFFHGANIPRGVTSTTLVLLPKKSSASKWSEFRPISLCTVMNKIITKLLSNRL 1411


>gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H;
            Endonuclease/exonuclease/phosphatase [Medicago
            truncatula]
          Length = 1246

 Score =  206 bits (525), Expect = 1e-50
 Identities = 122/366 (33%), Positives = 190/366 (51%), Gaps = 9/366 (2%)
 Frame = -1

Query: 1075 VGDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMF--PSCEALVEDSGI---SDHHHITVK 911
            +G  +TWSN +     +  RLDR + NE+W + +   SC AL   + +   SDHH + + 
Sbjct: 176  LGAFYTWSNGRLGSDNVALRLDRAICNEEWVNFWRSSSCSALGNSALVRHQSDHHPLLMS 235

Query: 910  LKTEVNFGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNK 731
            +    +     FKF   W +H D   +V E W +   G+ M R+ AKLK++K+    WN+
Sbjct: 236  MDFCTSQRSGNFKFFKTWTEHEDCRRIVAENWSKHTRGHGMTRLQAKLKHMKQVFRHWNR 295

Query: 730  NCFGEVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSR 551
              FG+V    R+A  ++    +       +  +  +E +A   L   L  +++ +R+K R
Sbjct: 296  TVFGDVDRKVRMAVEEVNRIQQIIDSVGFSDQLYAQELEAHLILTKALHYQDELWREKLR 355

Query: 550  VQWLKGGDSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSAND 371
             Q    GD NT +FH   K R   N I+ L+ GD  I D + ++ HV  +F+ I   + D
Sbjct: 356  DQRFIHGDRNTAYFHRISKVRATKNTISFLQDGDAVITDPARIEVHVLNYFQAIF--SVD 413

Query: 370  QIIIPTDFPVPTIPMDL----KDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRT 203
               I  D  V TIP  +     + L+     GE+   V  L  D APGP+GF G F++  
Sbjct: 414  NSCIQNDLVVDTIPSLVSNVDNNSLLRLPLWGEVKNAVFTLNGDGAPGPNGFGGHFYQTY 473

Query: 202  WHIIGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISK 23
            W I+G DV  +++DFF   ++ + +N+  + LIPK P A  + D+RPI+L N  +KIISK
Sbjct: 474  WDIVGADVIQSVQDFFISGQLAQNINSNLIVLIPKVPGARVMGDYRPIALANFQFKIISK 533

Query: 22   ILVNRL 5
            IL +RL
Sbjct: 534  ILADRL 539


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  206 bits (524), Expect = 1e-50
 Identities = 118/363 (32%), Positives = 197/363 (54%), Gaps = 7/363 (1%)
 Frame = -1

Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893
            G+ FTW+N     +R++ RLDR++ N  W + FP       +   SDH  + +       
Sbjct: 1230 GNPFTWTN-----NRMFQRLDRIVYNHHWINKFPITRIQHLNRDGSDHCPLLISCFNSSE 1284

Query: 892  FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713
              P  F+F + W+ H DF   V+  W   + G  +Q   +K   +K+ L  WNK  FG++
Sbjct: 1285 KAPSSFRFQHAWVLHHDFKTSVESNWNLPINGSGLQAFWSKQHRLKQHLKWWNKVMFGDI 1344

Query: 712  QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533
             +  + AE ++++C    Q++ +  ++++  +++   L   L  EE F++QKS V+W+  
Sbjct: 1345 FSKLKEAEKRVEECEILHQNEQTVESIIKL-NKSYAQLNKQLNIEEIFWKQKSGVKWVVE 1403

Query: 532  GDSNTKFFHAKMKARWNSNQITRLRVGDD-WIEDSSALKNHVTQHFKGIL------GSAN 374
            G+ NTKFFH +M+ +   + I +++  D  WIED   LK    ++F  +L       S  
Sbjct: 1404 GERNTKFFHTRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIKYFSSLLKFEPCDDSRF 1463

Query: 373  DQIIIPTDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHI 194
             + +IP+      I     + L  +    E+   V  +  + A GPDGF+  F+++ W+I
Sbjct: 1464 QRSLIPS-----IISNSENELLCAEPNLQEVKDAVFGIDPESAAGPDGFSSYFYQQCWNI 1518

Query: 193  IGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILV 14
            I +D+  A++DFF    I RG+ +T L L+PKKP+AS  +DFRPISLC ++ KII+K+L 
Sbjct: 1519 IAHDLLDAVRDFFHGANIPRGVTSTTLILLPKKPSASKWSDFRPISLCTVMNKIITKLLS 1578

Query: 13   NRL 5
            NRL
Sbjct: 1579 NRL 1581


>emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1383

 Score =  206 bits (524), Expect = 1e-50
 Identities = 115/342 (33%), Positives = 172/342 (50%), Gaps = 2/342 (0%)
 Frame = -1

Query: 1021 SRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGPKPFKFINVWLDHPD 842
            S LDR+L + +W    P+ +  +   G+SDH  + V    +  +GPKPF+F N WL  P 
Sbjct: 193  SLLDRLLVSPEWVSHCPNIKVSILQRGLSDHCPLLVHSHIQ-EWGPKPFRFNNCWLTDPK 251

Query: 841  FLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTTTRLAEAQLQDCSKR 662
             + +V+  W       P   V  KLK  K+ L  WN N FG +    R  E  + +  K 
Sbjct: 252  CMKIVEASWSSS----PKISVVEKLKETKKRLKEWNLNEFGSIDANIRKLEDCIANFDKE 307

Query: 661  AQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDSNTKFFHAKMKARWN 482
            A +   +   LE+  +A+  L   ++R+E ++ Q+SR+ WLK GD NTKFFHA    +  
Sbjct: 308  ADERELDKEELEKRREAQADLWKWMKRKEIYWAQRSRITWLKAGDKNTKFFHAIASNKKR 367

Query: 481  SNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPT--DFPVPTIPMDLKDGL 308
             N +  +        D S +K      FK I     D +  PT  +  +  +  +  + L
Sbjct: 368  KNMMACIETDGQSTNDPSQIKKEARAFFKKIF--KEDHVKRPTLENLHLKRLSQNQANSL 425

Query: 307  MVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGL 128
            +   T  EID  V     DKAPGPDGFN  F K  W II  D+   + DF+    + +G 
Sbjct: 426  ITPFTTEEIDTAVSSCASDKAPGPDGFNFKFVKSAWDIIKTDIYGIVNDFWETGCLPQGC 485

Query: 127  NATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
            N  ++ LIPK  N S++ D+RPIS+   +YKI++K+L  RL+
Sbjct: 486  NTAYIALIPKIDNPSSLKDYRPISMVGFIYKIVAKLLAKRLQ 527


>emb|CCA66198.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score =  206 bits (523), Expect = 2e-50
 Identities = 116/340 (34%), Positives = 178/340 (52%)
 Frame = -1

Query: 1021 SRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGPKPFKFINVWLDHPD 842
            S+LDRVL   +W + FP+    + +  ISDH  + ++  + V++GP+PFKF +VWL H  
Sbjct: 192  SKLDRVLVQAEWIEKFPALAVSILNRSISDHCPLLLQ-SSIVDWGPRPFKFQDVWLSHKG 250

Query: 841  FLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTTTRLAEAQLQDCSKR 662
             + +V++ W +      MQ    KLK VK  L +WN   FG +     L EA++Q     
Sbjct: 251  CMEIVEKAWIQSKELTLMQ----KLKKVKLDLKTWNSESFGNIDANILLREAEIQKWDSE 306

Query: 661  AQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDSNTKFFHAKMKARWN 482
            A         ++  +QA+  L   L+++E ++ Q+SR++WLK GD NTKFFH     R +
Sbjct: 307  ANSRDLEPEEIKTRAQAQLELWEWLKKKEIYWAQQSRIKWLKSGDRNTKFFHICASIRRS 366

Query: 481  SNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPTDFPVPTIPMDLKDGLMV 302
             N I+ + +    IED   +K    ++FK +      +    T+     +       +  
Sbjct: 367  KNNISSILLQGKKIEDPIIIKEEAVKYFKNLFTEDFKERPTFTNLSFKKLSESQAFSISA 426

Query: 301  DITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGLNA 122
              +  EID+ V      K+PGPDGFN  F K +W +I +D  S I++F+    + RG N 
Sbjct: 427  PFSTTEIDEAVASCNPSKSPGPDGFNFKFIKASWDLIKHDFYSIIQEFWHTGILPRGSNV 486

Query: 121  TFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
             F+ LI K  + S   DFRPIS+   +YKIISK+L  RLK
Sbjct: 487  AFIALIAKIESPSGFKDFRPISMVGCVYKIISKLLAGRLK 526


>emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score =  206 bits (523), Expect = 2e-50
 Identities = 113/354 (31%), Positives = 175/354 (49%)
 Frame = -1

Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884
            FTW + Q       S+LDR+L N +W  +FPS +  +    +SDH  + VK   E+N+GP
Sbjct: 183  FTWFSGQAK-----SKLDRLLVNPEWVSLFPSLQVSILRRNLSDHCPLLVK-SDELNWGP 236

Query: 883  KPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTT 704
            +PF+F N WL HP  L ++K+ W    +G     +  KLK  K+ L  WN + FG +   
Sbjct: 237  RPFRFQNCWLSHPGCLQIIKDVWASHTSG----NLTDKLKETKKRLKIWNSSEFGHIDRN 292

Query: 703  TRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDS 524
                E ++ +    +         L E   ++  L + L R+E F+ Q SR +W+K GD 
Sbjct: 293  IEELEDRIHNLDLISNGRDLQLEELAERRSSQMELWVWLRRKEAFWAQNSRAKWIKEGDK 352

Query: 523  NTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPTDFP 344
            NTK+FH     R   N I  L   +  + D + + +     FK I         +     
Sbjct: 353  NTKYFHTLASTRKKKNTIPALITNNGVVSDPAGIHHEAVSFFKSIFKEDFSSRPVFNGLQ 412

Query: 343  VPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIK 164
              ++  +    L    +  E+D+ V      KAPGPDG+N  F K +W II  DV + ++
Sbjct: 413  FRSLSCEQVSQLTEPFSHKEVDEAVESCDPQKAPGPDGYNFRFIKDSWDIIKLDVYNIVE 472

Query: 163  DFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
            +F+    + +G N  F+ LI K+     +NDFRPIS+   +YKII+K+L  RL+
Sbjct: 473  NFWNSGSLPKGSNVAFIALIAKREVPEGLNDFRPISMVGCIYKIIAKLLARRLQ 526


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  206 bits (523), Expect = 2e-50
 Identities = 122/360 (33%), Positives = 195/360 (54%), Gaps = 6/360 (1%)
 Frame = -1

Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884
            ++W+N+     RI SR+D+   N  W + +P       ++GISDH  +   L T+ + G 
Sbjct: 183  YSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAGISDHSPLIFNLATQHDEGG 242

Query: 883  KPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTT 704
            +PFKF+N   D   F+ +VKE W      + M+ +  +L+ VK AL S++   F +    
Sbjct: 243  RPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWVRLQAVKRALKSFHSKKFSKAHC- 301

Query: 703  TRLAEAQLQDCSKRAQDDPSNANVL-EEESQARCHLKLMLEREEKFYRQKSRVQWLKGGD 527
             ++ E + +  + +A  + S  + L EEE      L+     +E   +QKSR+QWL  GD
Sbjct: 302  -QVEELRRKLAAVQALPEVSQVSELQEEEKDLIAQLRKWSTIDESILKQKSRIQWLSLGD 360

Query: 526  SNTKFFHAKMKARWNSNQITRLRVG-DDWIEDSSALKNHVTQHFKGILGSANDQIIIPTD 350
            SN+KFF   +K R   N+I  L+    D + +++ ++N +   ++ +LG+++ Q+    D
Sbjct: 361  SNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQLEA-ID 419

Query: 349  FPVPTIPMDLKDG----LMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGND 182
              V  +   L       L+  IT  EIDQ + D+   KAPG DGFN  FFK++W +I  +
Sbjct: 420  LHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLVIKQE 479

Query: 181  VSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2
            +   I DFF    +H+ +N T + LIPK   A +  D+RPI+ C+ LYKIISKIL  RL+
Sbjct: 480  IYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISKILTKRLQ 539


>gb|AAD12028.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1447

 Score =  205 bits (521), Expect = 3e-50
 Identities = 129/377 (34%), Positives = 197/377 (52%), Gaps = 20/377 (5%)
 Frame = -1

Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKT--- 902
            G  +TWSN+++    I  +LDRV+ N+ W   FP   ++ E  G  DH    + L     
Sbjct: 592  GPLYTWSNKREH-DLIAKKLDRVMVNDVWTQSFPQSYSVFEAGGCLDHLRGRINLNDGPG 650

Query: 901  EVNFGPKPFKFINVWLDHPDFLNLVKEKWEEQ----VTGYPMQRVCAKLKNVKEALVSWN 734
             +  G +PFKF+NV  +  DF   V   W+E     ++   + R   KLK++K  L +  
Sbjct: 651  SIVRGKRPFKFVNVLTEMEDFKPTVDSYWKETEPIFLSTSSLFRFSKKLKSLKPLLRNLA 710

Query: 733  KNCFGEVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKS 554
            K   G +   TR A   L    +   ++P+  N ++EE +A    + +   EEKF ++KS
Sbjct: 711  KERLGNLVKKTREAYDTLCKKQESTLNNPT-PNAMKEEVEAHDRWEHVAGLEEKFLKKKS 769

Query: 553  RVQWLKGGDSNTKFFHAKMKARWNSNQITRLRVGDDWIE-DSSALKNHVTQHFKGILGSA 377
            ++ WL GGD N K FH  +  R   N I+ ++  D  +      +K +  + F+  L   
Sbjct: 770  KLHWLDGGDKNNKAFHRAVVTREAQNSISEIQCQDGSVTAKGDEIKAYAERFFREFLQ-- 827

Query: 376  NDQIIIPTDFPVPTIPMDLKDGLMVD------------ITQGEIDQVVRDLKIDKAPGPD 233
                +IP ++   T+  DL+D L               +T  EI +V+  +  DK+PGPD
Sbjct: 828  ----LIPNEYEGVTMA-DLQDLLPFRCSETEHELLTRVVTAEEIKKVLFSMPNDKSPGPD 882

Query: 232  GFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISL 53
            GF  +FFK TW I+GN+   AI+ FFA+  + +G+N T L LIPKK  A  + D+RPIS 
Sbjct: 883  GFTSEFFKATWEILGNEFILAIQSFFAKGFLPKGINTTILALIPKKKEAKEMKDYRPISC 942

Query: 52   CNILYKIISKILVNRLK 2
            CN++YK+ISKI+ NRLK
Sbjct: 943  CNVIYKVISKIIANRLK 959


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  204 bits (520), Expect = 4e-50
 Identities = 119/359 (33%), Positives = 189/359 (52%), Gaps = 3/359 (0%)
 Frame = -1

Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893
            G+ FTW+N     +R++ RLDRV+ N++W + F S      +   SDH  + +       
Sbjct: 1024 GNSFTWTN-----NRMFQRLDRVVYNQEWAEFFSSTRVQHLNRDGSDHCPLLISCSNTNQ 1078

Query: 892  FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713
             GP  F+F++ W  H DF++ V++ W   +    +     K + +K  L  WNK+ FG++
Sbjct: 1079 RGPATFRFLHAWTKHHDFISFVEKSWNTPIHAEGLNAFWTKQQRLKRDLKWWNKHIFGDI 1138

Query: 712  QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533
                RLAE + +      Q +PS AN  E   +A   L   L  EE F++QKS V+WL  
Sbjct: 1139 FKILRLAEVEAEQRELNFQQNPSAAN-RELMHKAYAKLNRQLSIEELFWQQKSGVKWLVE 1197

Query: 532  GDSNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSANDQIIIP 356
            G+ NTKFFH +M+ +   N I R++  + + +E+   ++N   + F+ +L +    I   
Sbjct: 1198 GERNTKFFHMRMRKKRMRNHIFRIQDQEGNVLEEPHLIQNSGVEFFQNLLKAEQCDISRF 1257

Query: 355  TDFPVPTIPMDLKDGLMVDITQG--EIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGND 182
                 P I +   D   +  T    E+ + V ++  D   GPDGF+  F++  W II  D
Sbjct: 1258 DPSITPRI-ISTTDNEFLCATPSLQEVKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKQD 1316

Query: 181  VSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRL 5
            +  A+ DFF    + RG+ +T L L+PK  N S  ++FRPISLC +L KI++K+L NRL
Sbjct: 1317 LFEAVLDFFKGSPLPRGITSTTLVLLPKTQNVSQWSEFRPISLCTVLNKIVTKLLANRL 1375


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  204 bits (518), Expect = 7e-50
 Identities = 116/358 (32%), Positives = 195/358 (54%), Gaps = 2/358 (0%)
 Frame = -1

Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893
            G+ FTW+N     +R++ RLDR++ N+QW + FP       +   SDH  + +       
Sbjct: 1023 GNPFTWTN-----NRMFQRLDRMVYNQQWINKFPITRIQHLNRDGSDHCPLLLSCSNSSE 1077

Query: 892  FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713
              P  F+F++ W  H +F   V+  W   + G  +    +K K +K+ L  WNK  FG++
Sbjct: 1078 KAPSSFRFLHAWALHHNFNASVEGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDI 1137

Query: 712  QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533
             +  + AE ++++C    Q + +  + ++  +++   L   L  EE F++QKS V+W+  
Sbjct: 1138 FSNIKEAEKRVEECEILHQQEQTIGSRIQL-NKSYAQLNKQLSMEEIFWKQKSGVKWVVE 1196

Query: 532  GDSNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSANDQIIIP 356
            G+ NTKFFH +M+ +   + I +++  D +WIED   L+      F  +L + +      
Sbjct: 1197 GERNTKFFHMRMQKKRIRSHIFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTRF 1256

Query: 355  TDFPVPTIPMDLKDG-LMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDV 179
                 P+I  D  +G L  + T  E+ + V  +  + A GPDGF+  F+++ W II +D+
Sbjct: 1257 QSSLCPSIISDTDNGFLCAEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDL 1316

Query: 178  SSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRL 5
              A+K+FF    I +G+ +T L LIPK  +AS  ++FRPISLC ++ KII+KIL NRL
Sbjct: 1317 FEAVKEFFHGADIPQGMTSTTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRL 1374


Top