BLASTX nr result

ID: Papaver25_contig00034581 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00034581
         (1399 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   153   2e-34
ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A...   117   9e-24
ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A...   117   1e-23
gb|ABD28730.1| Ribonuclease H [Medicago truncatula]                   108   6e-21
ref|XP_007213453.1| hypothetical protein PRUPE_ppa024777mg, part...   108   6e-21
ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258...   104   1e-19
ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261...   103   2e-19
ref|XP_003621690.1| Cytochrome c biogenesis protein ccsA [Medica...   102   3e-19
emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulga...   100   2e-18
ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein A...   100   3e-18
ref|XP_004308214.1| PREDICTED: putative ribonuclease H protein A...    99   3e-18
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...    97   2e-17
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...    97   2e-17
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...    94   1e-16
emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulga...    94   1e-16
ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein A...    92   4e-16
ref|XP_006367640.1| PREDICTED: putative ribonuclease H protein A...    92   7e-16
ref|XP_004237273.1| PREDICTED: putative ribonuclease H protein A...    89   5e-15
ref|XP_004228797.1| PREDICTED: putative ribonuclease H protein A...    87   1e-14
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...    87   2e-14

>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 872

 Score =  153 bits (387), Expect = 2e-34
 Identities = 120/451 (26%), Positives = 203/451 (45%), Gaps = 16/451 (3%)
 Frame = -1

Query: 1372 FCASSILPGIKWVYSLFEDNTKILIGDGRDTSIFYDVWVGDMAITNILEDFSLN-RAVLV 1196
            +  SSI PG++  + L ++NT+ L+G G   S + D ++G   I       +LN  + LV
Sbjct: 399  YAPSSIWPGVRKFWGLVQNNTRWLVGTGDKISFWRDNFLGRPLIEFFGNHGALNDNSSLV 458

Query: 1195 SDIIVNGSWNLTDEYRHTLLAAGIEEEDLPFIFNG--EDRQIWRPATSGQFTVKAAKSLI 1022
            SD I NGSW L    +  L A       +P   N   ED+ IW+ +++G+ T K A   +
Sbjct: 459  SDYIDNGSWVLPPLLQLNLSAVCNLICQVPISINPSMEDKLIWQASSTGELTAKQAFLFL 518

Query: 1021 RKRIAKLECTNLLWRTSVHPALAARNWKILRRACATLDQVRDRFKIQVINKCCLCNNAEE 842
            ++    +     LW   + P ++   WK++R    +   ++ R  + ++++C  C N+ E
Sbjct: 519  QQASPVVPWGKPLWSKFILPRMSLHAWKVMRGTVISYHLLQRR-GVALVSRCEFCGNSTE 577

Query: 841  SLDHLLWHCRFAERAWSWISDIFGFRSHRNLATAYRAI---RGRSGMIKELWLLALPVVR 671
            SLDH+  HC FA   W+    IF      N      ++     RS  +KELWL+    + 
Sbjct: 578  SLDHIFLHCSFAASVWNHFIYIFEIGLVPNTIAEVFSLGLAMDRSPQLKELWLICFTSIL 637

Query: 670  SELWQTRNRFVFENKKVSWGFFQTTVFNQIQEYSVRLKGYMFNSVEDLKILDFFKV--RH 497
              +W  RN+  F+++  S       V   IQ  S    G+M N++ DL IL  F    R 
Sbjct: 638  WYIWHARNQIRFDSRTFSVAGVCRLVSRHIQASSRLATGHMHNTIHDLCILKSFGACCRS 697

Query: 496  RKVQHLLPVECIWVPPEAGELMLCCDXXXXXXXXXXXXXXXXRDH--NVIGAISMGLGKI 323
            R++  +  VE IW PP  G + +  D                R +    +GA +  +   
Sbjct: 698  RRIPRM--VEVIWHPPSIGWIKINSDGAWKHEEGIGGFGAVFRYYKGQFVGAFASHIDIP 755

Query: 322  TNYMAEIFSILIGLE--WALQWGYTKVCIRSDSLGAVLAYCESK--LPWFVLQRWREISS 155
            ++  A++  ++  +E  W   W +  + +       VL Y  S   +PW +  RW     
Sbjct: 756  SSIAAKVMVVITAIELAWVRDWKHVWLEV---DFSTVLDYIRSPSLVPWQLRVRWLNCLY 812

Query: 154  QYDAIRF--DHSYRETNFAADKMAKNGCTLA 68
            +   + F   H +RE N  AD +A +G +++
Sbjct: 813  RISTMTFKSSHIFREGNRVADALANHGTSMS 843


>ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 364

 Score =  117 bits (294), Expect = 9e-24
 Identities = 88/350 (25%), Positives = 156/350 (44%), Gaps = 10/350 (2%)
 Frame = -1

Query: 1087 DRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAARNWKILRRACATLD 908
            D+ IW P +SG+ + K A   +R R+  L+   L+W   + P ++  +WK+LR    + D
Sbjct: 3    DKLIWVPLSSGELSAKEAFQFLRPRLPSLDWGKLIWSKFIIPRISLHSWKVLRGRVLSED 62

Query: 907  QVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAERAWSWISDIF--GFRSHRNLATAYR 734
             ++ R  I + ++C LC    ESL H+   C FA   W+  + +F  G      +   Y 
Sbjct: 63   LLQRR-GIALASRCVLCGRDGESLPHIFLTCSFAASLWNNRAGLFELGCLPQNLVDLLYY 121

Query: 733  AIRGRSGMIKELWLLALPVVRSELWQTRNRFVFENKKVSWGFFQTTVFNQIQEYSVRLKG 554
               GRS  +KE+WL+        +W+ RN+   +N  +     +  +   ++  S    G
Sbjct: 122  GGVGRSHQLKEIWLICYTTTLWFIWKARNKMRHDNCTIVVDAVRQLIMGHVKTASKLALG 181

Query: 553  YMFNSVEDLKILDFFKVRHRKVQHLLPVECIWVPPEAGELMLCCDXXXXXXXXXXXXXXX 374
             M NS+ +L++L  F +  R  +     E  W PP  G + +  D               
Sbjct: 182  CMSNSLTELRVLKKFGLLCRPHRAPRITEVNWHPPLFGWIKVNTDGAWQKTTGKSGYGGI 241

Query: 373  XRDH--NVIGAISMGLGKITNYMAEIFSIL--IGLEWALQWGYTKVCIRSDSLGAVLAYC 206
             RD   + +GA +  L  + +  AE+ +++  I L W   W +  + +  DS+  VL + 
Sbjct: 242  FRDFHGSFLGAFASNLEILNSVDAEVMAVIQAIELAWVRDWEH--IWLEVDSI-IVLNFL 298

Query: 205  ESK--LPWFVLQRWREISSQYDAIRF--DHSYRETNFAADKMAKNGCTLA 68
            +    +PW +   W     +   + F   H +RE N  AD +A  G +++
Sbjct: 299  QDPHLVPWRLRVGWGNFLHRISQMNFRSSHIFREGNQVADALANMGLSMS 348


>ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 751

 Score =  117 bits (293), Expect = 1e-23
 Identities = 100/410 (24%), Positives = 177/410 (43%), Gaps = 15/410 (3%)
 Frame = -1

Query: 1372 FCASSILPGIKWVYSLFEDNTKILIGDGRDTSIFYDVWVGDMAITNI-LEDFSLNRAVLV 1196
            +  SS+  G+K V  L  ++++ +IGDG     + D W+    I  + +   S      V
Sbjct: 343  YFTSSVWHGLKRVLPLLFEHSRWIIGDGNSILFWSDKWLHSSIIQQLNMGSLSHLLNSRV 402

Query: 1195 SDIIVNGSWNLTDEYRHTLLAAGIEEEDLPFIFNGE-DRQIWRPATSGQFTVKAAKSLIR 1019
            +D I +  W L   + +       +  ++P     E D  IW  ++SG F+      L+R
Sbjct: 403  ADFIWDQQWALPSHFSNLFPDCAKQILEIPLPNTPESDILIWEHSSSGIFSFSDGYELVR 462

Query: 1018 KRIAKLECTNLLWRTSVHPALAARNWKILRRACATLDQVRDRFKIQVINKCCLCN-NAEE 842
                KL+  + +W + + P  +   W+I      T DQ++ R  I  ++ C LC+ +  E
Sbjct: 463  PYFEKLDWASSVWHSFIPPRYSVLAWRIFHLKLPTDDQLQRR-GIPFVSVCQLCSFSHTE 521

Query: 841  SLDHLLWHCRFAERAWSWISDIFG--FRSHRNLATAYRAIRGR--SGMIKELWLLALPVV 674
             + HL  +C FA+  W W++  FG    S  +L   + ++ G+  S  +K +W  +    
Sbjct: 522  DIPHLFVNCSFAQHIWQWLAYYFGTSLPSSGSLNDLWSSVTGKAFSPQLKNIWFASCLFA 581

Query: 673  RSELWQTRNRFVFENKKVSWGFFQTTVFNQIQEYSVRLKGY---MFNSVEDLKILDFFKV 503
               +W++ N+  F+NK+ S       VF  ++ +   +  Y       V D K+L    V
Sbjct: 582  LMAIWKSHNKLRFDNKQPS----LMRVFRSVKAWVRYIAPYTPGCVRGVLDSKVLSSMGV 637

Query: 502  -RHRKVQHLLPVECIWVPPEAGELMLCCDXXXXXXXXXXXXXXXXRDH--NVIGAISMGL 332
                K Q  L +  +W PP    L L  +                RD    +IG    GL
Sbjct: 638  ILVLKCQSALRI-VLWHPPLIPWLKLNTNGFSKGNPGLAGCGGVFRDSFGRLIGGYCQGL 696

Query: 331  GKITNYMAEIFSILIGLEWALQWGYTKVCIRSDSLGAVLAYCESKL--PW 188
            G  T +  E+ ++++G+E+A  +G+  + + SDS   +     S    PW
Sbjct: 697  GTQTTFFVELMTVILGVEFAFHFGWHHIWLESDSTTILQCISSSSFAPPW 746


>gb|ABD28730.1| Ribonuclease H [Medicago truncatula]
          Length = 409

 Score =  108 bits (270), Expect = 6e-21
 Identities = 98/388 (25%), Positives = 157/388 (40%), Gaps = 11/388 (2%)
 Frame = -1

Query: 1198 VSDIIVNGSWNLTD--EYRHTLLAAGIEEEDLPFIFNGEDRQIWRPATSGQFTVKAAKSL 1025
            V++ +VNG W L+D   Y+   L   I +  LP +    D+ IW  +  G  + K A S 
Sbjct: 3    VANYLVNGEWILSDFFAYKDNALVEKIHQIALP-LDETLDKLIWTDSVDGDLSNKLAFSF 61

Query: 1024 IRKRIAKLECTNLLWRTSVHPALAARNWKILRRACATLDQVRDRFKIQVINKCCLCNNAE 845
            +      +    +LW     P  A   W+ L     T D +R R    V   CC C    
Sbjct: 62   LPGHGPTVHWAKMLWNAYTPPTGAFITWRFLHNKLPTDDNLRKRGCYIVSICCCFCRKQA 121

Query: 844  ESLDHLLWHCRFAERAWSWISDIFGFRSHRNLATAYRAIRGRSGMIKELWLLALPVVRSE 665
            E+  H+   C    + W W+  +     H +    + +I   S M++ +   A+  +   
Sbjct: 122  ETSSHIFLQCPVTLQLWDWL--LKATDQHLD----FSSILNISRMVQHVMNSAIVHIMWS 175

Query: 664  LWQTRNRFVFENKKVSWGFFQTTVFNQIQEYSVRL---KGYMFNSVEDLKILDFFKVRHR 494
            +W   N   F+  +        T+  ++   S  L   KG   +S++D K+   F +   
Sbjct: 176  IWLECNNKYFDGVQKPMSTLFNTILAEVLRLSFMLDIVKG--ASSMQDFKLARLFSIPF- 232

Query: 493  KVQHLLPV-ECIWVPPEAGELMLCCDXXXXXXXXXXXXXXXXRDHNVI--GAISMGLGKI 323
            K   + P  E IWVPP  G + + CD                R    +  GA +  +G  
Sbjct: 233  KTNRVNPCREIIWVPPHGGCMKINCDGSVVGSPSCGSIGVIFRASQTMFCGAFAQNIGYA 292

Query: 322  TNYMAEIFSILIGLEWALQWGYTKVCIRSDSLGAVLAY-CESKLPWFVLQRWREISSQYD 146
            T   AE  + +  +E A +   T + I +DS+  + A+   + +PW +  RW        
Sbjct: 293  TALEAEYSACMFAIEKAKELHLTNIWIETDSVNVIRAFHFNTGVPWKMHIRWHNCLLFCR 352

Query: 145  AIR--FDHSYRETNFAADKMAKNGCTLA 68
            +IR    H  RE N  AD +AKNG  LA
Sbjct: 353  SIRSLCTHVNREGNLVADALAKNGQGLA 380


>ref|XP_007213453.1| hypothetical protein PRUPE_ppa024777mg, partial [Prunus persica]
            gi|462409318|gb|EMJ14652.1| hypothetical protein
            PRUPE_ppa024777mg, partial [Prunus persica]
          Length = 465

 Score =  108 bits (270), Expect = 6e-21
 Identities = 86/348 (24%), Positives = 149/348 (42%), Gaps = 12/348 (3%)
 Frame = -1

Query: 1087 DRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAARNWKILRRACATLD 908
            D  +W P++SG F+ K A    R + AK+    L+W+  + P  +   WK++     T D
Sbjct: 105  DLLVWAPSSSGGFSAKDAYEFTRPKFAKVPWCKLIWKPFIEPWKSFLAWKVMHGRLLTED 164

Query: 907  QVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAERAWSWISDIFGFRSHRNLATAYRAI 728
             ++ R  +           A E+++HL   C F    WS +  +FG     +  +   A+
Sbjct: 165  FLQKRAWM-----------APENINHLFSECPFTCSIWSSMFIVFGL----HFTSGPLAV 209

Query: 727  RGRSGM-------IKELWLLALPVVRSELWQTRNRFVFENKKVSWGFFQTTVFNQIQEYS 569
               SG+       + +LWLL    +   +W  RN+  FE K  +      T+ N +   S
Sbjct: 210  ILSSGLSAHFSPQLMDLWLLMFRTIVWLIWDLRNKLRFEEKVSTVSSNCRTIINHVPASS 269

Query: 568  VRLKGYMFNSVEDLKILDFFKVRHRKVQHLLPVECIWVPPEAGELMLCCDXXXXXXXXXX 389
               +G++ N V DL I+    V +R   +   VE  W PP  G + +  D          
Sbjct: 270  PLARGHILNKVHDLCIIRSIGVHYRPRPNSKIVEVTWHPPCFGFVKIKIDGACKRDSGKA 329

Query: 388  XXXXXXRDH--NVIGAISMGLGKITNYMAEIFSILIGLEWALQWGYTKVCIRSDSLGAVL 215
                  R++  +V+GA S  L   +   AE+ +++  +E A    +  + I +DSL    
Sbjct: 330  GSGGVFRNYQGHVLGAFSANLDVPSGVHAEVLAVIKAIELAWLHAWHNIWIETDSLLVTK 389

Query: 214  AYCESKL-PWFVLQRWRE--ISSQYDAIRFDHSYRETNFAADKMAKNG 80
             +    L PW +   W+   +  Q+ + +  H +RE N   D +A +G
Sbjct: 390  FFRSPHLVPWRLRVDWQNCLLRLQHMSFKISHIFREGNHDVDALANHG 437


>ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum
            lycopersicum]
          Length = 1454

 Score =  104 bits (259), Expect = 1e-19
 Identities = 104/417 (24%), Positives = 174/417 (41%), Gaps = 19/417 (4%)
 Frame = -1

Query: 1279 SIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWNLTDEYRHTLLAAGIEEEDLPFI 1100
            S ++D W+            SLN +V V+D ++NG+WN        LL   +  + +P+I
Sbjct: 1003 SFWWDCWLDKPLAMQCDHVSSLNNSV-VADFLINGNWN------ERLLRQHVPPQLVPYI 1055

Query: 1099 FNGE--------DRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAARN 944
               +        D  IW P  SGQFT+ +A   IRK+  K    N++W   +   ++   
Sbjct: 1056 LQTKINYQAGNIDTSIWTPTESGQFTISSAWDSIRKKRNKDPINNIIWHKQIPFKVSFFI 1115

Query: 943  WKILRRACATLDQVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAERAWSWISDIFGF- 767
            W+ LR    T + ++ R    + +  C  N  ++ ++H+L +  FA+  W   S   G  
Sbjct: 1116 WRALRGKLPTNENLQ-RIGKNLSDCYCCYNKGKDDINHILINGNFAKYIWKIYSSAVGVL 1174

Query: 766  ---RSHRNLATAYRAIRGRSGMIKELWLLALPVVRSELWQTRNRFVFENKKVSWGFFQTT 596
                + R+L   +R  +  + + K L  +    +   LW+ R    +  K  S    Q  
Sbjct: 1175 PINTTLRDLLLQWRNQQYTNEVHKLLIHILPNFICWNLWKNRCAVKYGLKNSSIYRVQYG 1234

Query: 595  VFNQIQEYSVRLKGYMFNSVE-DLKILDFFKVRHRKVQHLLPVECIWVPPEAGELMLCCD 419
            +F  I +        +F S+       +   +  +  QH   +   W  P+ G+  L  D
Sbjct: 1235 IFKNIMQVIT----IVFPSIPWQTSWNNLINIVEQCKQHYKILIVKWNKPDLGKYKLNTD 1290

Query: 418  XXXXXXXXXXXXXXXXRDH--NVIGAISMGLGKITNYMAEIFSILIGLEWALQWGYTKVC 245
                            RD+   +I A S+  G  TN  AEI + L GL+W  Q GY K+ 
Sbjct: 1291 GSALQNSGKIGGGGILRDNQGKIIYAFSLPFGFGTNNFAEIKAALHGLDWCEQHGYKKIE 1350

Query: 244  IRSDS-LGAVLAYCESKLPW---FVLQRWREISSQYDAIRFDHSYRETNFAADKMAK 86
            +  DS L          +PW    ++Q+  +I  + D  +  H YRE N  AD ++K
Sbjct: 1351 LEVDSKLLCNWINSNINIPWRYEELIQQIHQIIRKMDQFQCHHIYREANCTADLLSK 1407


>ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261371 [Solanum
            lycopersicum]
          Length = 1246

 Score =  103 bits (257), Expect = 2e-19
 Identities = 105/414 (25%), Positives = 182/414 (43%), Gaps = 15/414 (3%)
 Frame = -1

Query: 1282 TSIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWN--LTDEYRHTLLAAGIEEEDL 1109
            +S ++D W+ +  + +  +  S     +V+D I +G WN  L     + L    I +  L
Sbjct: 808  SSFWWDNWLDNENLASQSDHISSLNNGVVTDFIKDGKWNESLIRHQVNPLFIPKILQTKL 867

Query: 1108 PFIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAARNWKILR 929
             +    ED  IW P  +G FT+ +A   IR +        ++W   +   +A   W+ L+
Sbjct: 868  NYSTGKEDNAIWIPTETGNFTIASAWECIRNKRPIDTINTIIWHKHLPFKIAFFIWRALK 927

Query: 928  RACATLDQVRDRFKIQVINKCCLC-NNAEESLDHLLWHCRFAERAWSWISDIFGFRSHRN 752
                T +++  RF    I+KC  C +  ++ ++H+L +  FA+  W   + I G     N
Sbjct: 928  GKLPT-NELLQRFG-SAISKCYCCYSKGKDDINHILINGNFAKHIWKIHAAILGVVP-AN 984

Query: 751  LATAYRAIRGRSGMIK----ELWLLALP-VVRSELWQTRNRFVFENKKVSWGFFQTTVFN 587
                 + +  R+  +     +L +  LP V+   LW+ R    + NK  S    Q  +F 
Sbjct: 985  TTLRDQLLHWRNQQVNNEVHKLLIHILPNVICWNLWKNRCAVKYGNKSSSIHRVQYGIFK 1044

Query: 586  QI-QEYSVRLKGYMFNSVEDLKILDFFKVRHRKVQHLLPVECIWVPPEAGELMLCCDXXX 410
             + Q   +      + S  + K+++   V H K Q+ + V   W  P  G   L  D   
Sbjct: 1045 DVMQVIKIVFPSIPWQSSWN-KLINI--VEHCKQQYKI-VLVSWNKPGLGTYKLNTDGSA 1100

Query: 409  XXXXXXXXXXXXXRDHN--VIGAISMGLGKITNYMAEIFSILIGLEWALQWGYTKVCIRS 236
                         RDH   ++ A S+  G  TN +AEI + L GLEW  Q GY +V +  
Sbjct: 1101 LQNSGKIGGGGILRDHQGKIVYAFSLPFGFGTNNIAEIKAALYGLEWCDQHGYKRVELEV 1160

Query: 235  DS-LGAVLAYCESKLPWF---VLQRWREISSQYDAIRFDHSYRETNFAADKMAK 86
            DS L       ++ +PW    ++Q+ ++I+ + +  +  H YRE N  AD ++K
Sbjct: 1161 DSQLLCNWIKNKTNIPWIYEDLIQQIKQITRKIEQFQCHHIYREANITADLLSK 1214


>ref|XP_003621690.1| Cytochrome c biogenesis protein ccsA [Medicago truncatula]
            gi|355496705|gb|AES77908.1| Cytochrome c biogenesis
            protein ccsA [Medicago truncatula]
          Length = 666

 Score =  102 bits (255), Expect = 3e-19
 Identities = 92/367 (25%), Positives = 158/367 (43%), Gaps = 23/367 (6%)
 Frame = -1

Query: 1057 GQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAARNWKILRRACATLDQVRDRFKIQV 878
            G  T + A   I +    +     LW + + P+ +   W++L     T + +R R  + +
Sbjct: 16   GDLTNQLAYKFINETGNHVLWDKFLWNSYIPPSRSFITWRLLHNKLPTDENLRKRGCL-I 74

Query: 877  INKCCLCNNAEESLDHLLWHCRFAERAWSWI----SDIFGFRS-----HRNLATAYRAIR 725
            ++ CC C  + ES  H+ + C    R W W+      +    S      RN  +  + + 
Sbjct: 75   VSICCFCMKSAESSQHIFFECHVTSRLWDWLGKGTDKLLDCSSCLQLLIRNWGSGSKLVN 134

Query: 724  G--RSGMIKELWLLALPVVRSELWQTRNRFVFENKKVSWGFFQTTVFNQI-----QEYSV 566
                S +I  +W          +W  RN+  F NK  +     TT+FN I       +S+
Sbjct: 135  NILNSAIIHTIW---------SIWIERNQRCFHNKHQA----MTTLFNIILAEVKMSFSL 181

Query: 565  -RLKGYMFNSVEDLKILDFFKVRHRKVQHLLP-VECIWVPPEAGELMLCCDXXXXXXXXX 392
              +KG   ++++D K+   F +   KV+ + P ++ IW PP    + + CD         
Sbjct: 182  CMIKGN--SAMQDYKVAKLFNIPF-KVKRVTPHLDIIWKPPIGDIVKINCDGSSVGRHPC 238

Query: 391  XXXXXXXRD--HNVIGAISMGLGKITNYMAEIFSILIGLEWALQWGYTKVCIRSDSLGAV 218
                   RD  H+ +GAIS  +G  T   AE  + ++ +E A +     VC+ +DSL  V
Sbjct: 239  GSIGIVIRDSNHHFLGAISSNIGNATPLEAEFCAGMMAMEKAQEMQLMHVCLETDSLKVV 298

Query: 217  LAYCES-KLPWFVLQRWREISSQYDAIRFD--HSYRETNFAADKMAKNGCTLANAERRHY 47
             A+ +   +PW +  RW+      D+I     H  RE N  AD +AK+G  L+    + +
Sbjct: 299  NAFNKGLGVPWQMRARWQNCWDFCDSISCSCVHILREGNMVADALAKHGQGLSLYYTQLW 358

Query: 46   LGRPDFL 26
               P F+
Sbjct: 359  TSPPPFI 365


>emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1383

 Score =  100 bits (249), Expect = 2e-18
 Identities = 107/451 (23%), Positives = 184/451 (40%), Gaps = 44/451 (9%)
 Frame = -1

Query: 1300 IGDGRDTSIFYDVWVGDMAITNI---LEDFSLNRAVLVSDIIV--NGSWNLTDEYRHTLL 1136
            +G G  T+ + ++W+G++ +  +   L   ++N    +S + +     W+    ++  L 
Sbjct: 930  VGKGTQTAFWQEIWIGELPLKTLFPRLYRLTINPLATISSLGIWDGHEWHWVLPWQRALR 989

Query: 1135 AAGIEE--------EDLPFIFNGEDRQIWRPATSGQFTVKAAKSLIRK--RIAKLECTNL 986
               IEE        +D+      +D  +W P  SG F+VK+A   + K  + +  E    
Sbjct: 990  PRDIEERDALHELLKDVVLDLTNDDYLVWTPNKSGVFSVKSATLELAKCSKFSSHEIIKG 1049

Query: 985  LWRTSVHPALAARNWKILRRACATLDQVRDRFKIQVIN-------KCCLCNNAEESLDHL 827
            +WR  V   +    W       A L+++  + K+  I         C  CN   E+ +HL
Sbjct: 1050 IWRGLVPHRVEIFCW------LALLEKINTKSKLGRIGIIPIEDAVCVFCNIGLETTNHL 1103

Query: 826  LWHCRFAERAWSWISDIFGF-----RSHRNLATAYRAIRGRSGMIKELWLLALPVVRSEL 662
            L HC F+ + W+W  +I+G+     +S +N A A   I GR    K++W     ++   L
Sbjct: 1104 LLHCEFSWKLWTWWLNIWGYSWAFPKSIKN-AFAQWQIYGRGAFFKKIWHAIFFIIIWSL 1162

Query: 661  WQTRNRFVFENKKVSWGFFQTTVFNQIQEY-SVRLKGYMFNSVEDLKILDFFKVRHRK-- 491
            W+ RN  +F N   S    Q  +  ++  +      G+ F   E ++     K    K  
Sbjct: 1163 WKERNSRIFNNSNSSLEEIQDLILTRLCWWVKAWDDGFPFACSEVIRNPACLKWTQSKGC 1222

Query: 490  ----VQHLLPVECIWVPPEAGELMLCCDXXXXXXXXXXXXXXXXRDHN--VIGAISMGLG 329
                +     ++  W PP +  L    D                RD N   +   S  + 
Sbjct: 1223 NFGTIGPTNLLKAAWSPPPSNHLQWNVDASFKPGLEHAAVGGVLRDENGCFVCLFSSPIP 1282

Query: 328  KITNYMAEIFSILIGLEWALQWGYTK---VCIRSDSLGAVLAYC--ESKLPW---FVLQR 173
            ++    AEI++I   L+ +L     K   + I SDS  AV  +C  +   PW   F++  
Sbjct: 1283 RLEINSAEIYAIFRALKISLSSDRIKAQHLIIVSDSANAV-RWCNQDEGGPWNLNFMINY 1341

Query: 172  WREISSQYDAIRFDHSYRETNFAADKMAKNG 80
             R     + A+   H  RETN  AD +AK G
Sbjct: 1342 IRNARKAWLALTIIHKGRETNGVADTLAKQG 1372


>ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            lycopersicum]
          Length = 955

 Score = 99.8 bits (247), Expect = 3e-18
 Identities = 96/413 (23%), Positives = 167/413 (40%), Gaps = 14/413 (3%)
 Frame = -1

Query: 1282 TSIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWNLTDEYRHT--LLAAGIEEEDL 1109
            +S ++D W+G+ A+ N + + S    + VSD + NG WN     +H    +   I +   
Sbjct: 502  SSFWWDNWLGNEALANQVINISSLNNIHVSDFLTNGIWNERYVRQHVPPTMVPDIMQTQF 561

Query: 1108 PFIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAARNWKILR 929
             +  N ED  IW P  +G+FT+ +A  +IRK+ +     N +W   +   ++   W+ LR
Sbjct: 562  KYNINIEDTAIWTPEENGKFTIASAWEVIRKKKSTDIINNSVWHKHIPFKISFFIWRALR 621

Query: 928  RACATLDQVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAERAWSWISDIFGFR----S 761
                T D ++ +F     +  C      + ++H+L    FA   W + +  FG       
Sbjct: 622  GKLPTYDYLQ-KFGSNATDCYCCNRKGIDDINHILITGNFANYIWKYYAPTFGITQINID 680

Query: 760  HRNLATAYRAIRGRSGMIKELWLLALPVVRSELWQTRNRFVFENKKVSWGFFQTTVFNQI 581
             R+L   +  +   + + K L  +    +   LW+      + NK  S    Q  +F  +
Sbjct: 681  LRSLLLQWTNLPSSNQVYKLLISILPNFICWHLWKNMCAVKYGNKISSIQRVQYGIFKDV 740

Query: 580  QEYSVRLKGYMFNSVEDLKILDFFKVRHRKVQHLLPVECIWVPPEAGELMLCCDXXXXXX 401
             +    +K    N            +  +  Q L  +   W  P+ G   L  D      
Sbjct: 741  MQ---TIKIVFPNIPWQHSWYRLINLVEQCQQQLKVIMVSWRKPQFGIYKLNTDGSALPE 797

Query: 400  XXXXXXXXXXRDH--NVIGAISMGLGKITNYMAEIFSILIGLEWALQWGYTKVCIRSDSL 227
                      RD+   +  A S+  G  TN +AE+ +   GL+W  Q GY  + +  DS 
Sbjct: 798  SGKIGGGGILRDYTGKLHYAFSIPFGLGTNNIAEMEAARYGLDWCEQHGYKSILLEVDS- 856

Query: 226  GAVLAYCESK---LPW---FVLQRWREISSQYDAIRFDHSYRETNFAADKMAK 86
              +L    S    +PW     ++  ++I  + D     H YRE N  AD ++K
Sbjct: 857  -EILQKWISNTIAIPWRYQQTIEHIQDIGRKMDHFECQHVYREVNGTADLLSK 908


>ref|XP_004308214.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 409

 Score = 99.4 bits (246), Expect = 3e-18
 Identities = 89/382 (23%), Positives = 161/382 (42%), Gaps = 13/382 (3%)
 Frame = -1

Query: 1177 GSWN--LTDEYRHTLLAAGIEEEDLPFIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAK 1004
            G WN  +  ++    +   I +  +  + +  D+ IW P++SG+   K A   +R R+  
Sbjct: 2    GPWNFPMLLQFHFLDICKLINDVPISIVPDMSDKLIWVPSSSGELLAKEAFQFMRPRLPS 61

Query: 1003 LECTNLLWRTSVHPALAARNWKILRRACATLDQVRDRFKIQVINKCCLC-NNAEESLDHL 827
            L+ + L+W   + P ++  +WK+LR    + D ++ R  I + ++C LC  + E S  H+
Sbjct: 62   LDWSKLIWSKFIIPRISLHSWKVLRGRVLSEDLLQRR-GIVLASRCVLCGRDCESSFPHI 120

Query: 826  LWHCRFAERAWSWISDIFGFRS-HRNLA-TAYRAIRGRSGMIKELWLLALPVVRSELWQT 653
               C F    W+  + +F   S  +NL    Y    GRS  +KE+WL+        + + 
Sbjct: 121  FLTCSFVASLWNNWACLFELGSLPQNLVDLIYYGGVGRSHQLKEIWLICYTTTLWFIGKA 180

Query: 652  RNRFVFENKKVSWGFFQTTVFNQIQEYSVRLKGYMFNSVEDLKILDFFKVRHRKVQHLLP 473
            RN+   +N  +        +   ++  S    G M NS+  L++L  F +     Q L  
Sbjct: 181  RNKIRHDNCTIVVDAVHQLIMGHVKAVSKLASGCMSNSLTKLRVLKKFGLLCHPCQALRI 240

Query: 472  VECIWVPPEAGELMLCCDXXXXXXXXXXXXXXXXRDH--NVIGAISMGLGKITNYMAEIF 299
             +  W PP  G + +  D                RD   + +GA +  L    +  AE+ 
Sbjct: 241  TKVNWHPPLFGWIKVNTDGAWQKTTGKSGYGGIFRDFHGSFLGAFASNLEIPNSVDAEVM 300

Query: 298  SIL--IGLEWALQWGYTKVCIRSDSLGAVLAYCESK--LPWFVLQRWREISSQYDAIRF- 134
            +++  I L W   W +  + +  DS   VL +      +PW +         +   + F 
Sbjct: 301  AVIQAIELAWVRDWKH--ILLEVDS-AIVLNFLHDPHLVPWRLRVACGNCLHRISQMNFR 357

Query: 133  -DHSYRETNFAADKMAKNGCTL 71
              H +RE N  AD +   G ++
Sbjct: 358  SSHIFREGNQVADTLVNMGLSM 379


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 97.1 bits (240), Expect = 2e-17
 Identities = 108/451 (23%), Positives = 185/451 (41%), Gaps = 25/451 (5%)
 Frame = -1

Query: 1336 VYSLFEDNTKILIGDGRDTSIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWNLTD 1157
            + S+ E N +  IG G +   ++D W+G+  + N  + F+ + A  VSD  +N SWN+  
Sbjct: 1759 ISSITEQNIRWRIGHG-ELFFWHDCWMGEEPLVNRNQAFASSMAQ-VSDFFLNNSWNV-- 1814

Query: 1156 EYRHTLLAAGIEEE--DLPFIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLL 983
            E   T+L   + EE   +P   +  D+  W    +G F+ K+A  LIR R  +    N +
Sbjct: 1815 EKLKTVLQQEVVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPVFNFI 1874

Query: 982  WRTSVHPALAARNWKILRRACATLDQVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAE 803
            W  SV    +   W++L        +++ + K   +   C C  +EESL H++W    A 
Sbjct: 1875 WHKSVPLTTSFFLWRLLHDWIPV--ELKMKTKGFQLASRCRCCKSEESLMHVMWKNPVAN 1932

Query: 802  RAWSWISDIFGFR-------SHRNLATAYRAIRGRSGMIKE---------LWLLALPVVR 671
            + WS+ + +F  +       +    A  Y     + G I+          LW+       
Sbjct: 1933 QVWSYFAKVFQIQIINPCTINQIICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKH 1992

Query: 670  SELWQTRNRFVFENKKVSWGFFQTTVFNQIQEYSVRLKGYMFNSVEDLKILDFFKVRHRK 491
              L    NR V++  K+    FQ     Q+Q++  +          D +I   + +  + 
Sbjct: 1993 RNLGMYPNRVVWKILKLLHQLFQG---KQLQKWQWQ---------GDKQIAQEWGIILKA 2040

Query: 490  VQHLLPVECIWVPPEAGELMLCCDXXXXXXXXXXXXXXXXRDH--NVIGAISMGLGKITN 317
                 P    W+ P  GEL L  D                RDH  ++I   S   G   +
Sbjct: 2041 DAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDS 2100

Query: 316  YMAEIFSILIGLEWALQWGYTKVCIRSDSLGAVLAYCE-----SKLPWFVLQRWREISSQ 152
              AE+ ++  GL   ++   +++ I  D+  AV    E     S+  + +    R +S  
Sbjct: 2101 LQAELMALHRGLLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSG- 2159

Query: 151  YDAIRFDHSYRETNFAADKMAKNGCTLANAE 59
              + R  H +RE N AAD ++  G T  N +
Sbjct: 2160 -ISFRISHIFREGNQAADHLSNQGHTHQNLQ 2189


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 96.7 bits (239), Expect = 2e-17
 Identities = 106/450 (23%), Positives = 183/450 (40%), Gaps = 24/450 (5%)
 Frame = -1

Query: 1336 VYSLFEDNTKILIGDGRDTSIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWNLTD 1157
            + S+ E N +  +G G+    ++D W+G+  +    ++F+ + A  VSD  +N SW++ +
Sbjct: 3047 ISSITEQNIRWRVGHGK-LFFWHDCWMGEEPLVIRNQEFASSMAQ-VSDFFLNNSWDI-E 3103

Query: 1156 EYRHTLLAAGIEE-EDLPFIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLW 980
            + +  L    +EE   +P   +  DR  W P  +G F+ K+A  L R+R       N +W
Sbjct: 3104 KLKSVLQQEVVEEIAKIPINASSNDRAYWTPTPNGDFSTKSAWQLSRERKVVNPTYNYIW 3163

Query: 979  RTSVHPALAARNWKILRRACATLDQVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAER 800
              SV    +   W++L        +++ + K   +   C C  +EESL H++W    A +
Sbjct: 3164 HKSVPLTTSFFLWRLLHDWVPV--ELKMKSKGFQLASRCRCCKSEESLMHVMWDNPVANQ 3221

Query: 799  AWSWISDIFGFR-------SHRNLATAYRAIRGRSGMIKE---------LWLLALPVVRS 668
             WS+ + +F          +H   A  Y     + G I+          LW+        
Sbjct: 3222 VWSYFAKVFQIHIINPCTINHIISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHR 3281

Query: 667  ELWQTRNRFVFENKKVSWGFFQTTVFNQIQEYSVRLKGYMFNSVEDLKILDFFKVRHRKV 488
             L    NR V++  K+    FQ     Q+Q++  +          D +I   + +  + V
Sbjct: 3282 NLGMYPNRIVWKILKLIHQLFQG---KQLQKWQWQ---------GDKQIAQEWGIILKAV 3329

Query: 487  QHLLPVECIWVPPEAGELMLCCDXXXXXXXXXXXXXXXXRDH--NVIGAISMGLGKITNY 314
                P    W  P  GE  L  D                RDH  ++I   S   G   + 
Sbjct: 3330 APSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDSL 3389

Query: 313  MAEIFSILIGLEWALQWGYTKVCIRSDSLGAVLAYCE-----SKLPWFVLQRWREISSQY 149
             AE+ ++  GL   +    T++ I  D+  AV    E     S+  + +    R +S   
Sbjct: 3390 QAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSG-- 3447

Query: 148  DAIRFDHSYRETNFAADKMAKNGCTLANAE 59
             + R  H +RE N AAD ++  G T  N +
Sbjct: 3448 ISFRISHIFREGNQAADHLSNQGYTHQNLQ 3477



 Score = 81.3 bits (199), Expect = 1e-12
 Identities = 106/434 (24%), Positives = 181/434 (41%), Gaps = 20/434 (4%)
 Frame = -1

Query: 1315 NTKILIGDGRDTSIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWNLT--DEYRHT 1142
            N +  IG G +   ++D W+GD  +  +   F  N    V        W++   + Y  T
Sbjct: 1260 NIRWRIGKG-ELFFWHDCWMGDQPLATLFPSFH-NDMSHVHKFYNGDEWDIVKLNSYLPT 1317

Query: 1141 LLAAGIEEEDLPFIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHP 962
             L   I +  +PF  + ED   W   ++G+F+  +A  +IR+R       +  W  S+  
Sbjct: 1318 SLVDEILQ--IPFDRSQEDVAYWALTSNGEFSFWSAWEIIRQRQTPNALLSFNWHRSIPL 1375

Query: 961  ALAARNWKILRRACATLDQVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAERAWSWIS 782
            +++   W++L        +++D+  I + +KC  C + EESL H+LW    A++ W++ +
Sbjct: 1376 SISFFLWRVLNNWIPVELRMKDK-GIHLASKCVCCRS-EESLIHVLWENPVAKQVWNFFA 1433

Query: 781  DIFGFR-------SHRNLATAYRAIRGRSGMIKELWLLALPVVRSELWQTRNRFVFEN-- 629
              F          S    A  +     R+G I+   +L    +   LW  RN     +  
Sbjct: 1434 KSFQIYVSKPKHISQIIWAWFFSGDYTRNGHIR---ILIPLFICWFLWLERNDAKHRHMG 1490

Query: 628  ---KKVSWGFFQTTVFNQIQEYSVRLKGYMFNSVEDLKILDFFKVRHRKVQHLLPVECIW 458
                +V W   +  + NQ+   S+ LK + +    D+  +  FK   +  Q   P    W
Sbjct: 1491 MYPNRVIWRIMK--LLNQLHAGSL-LKQWQWKGDTDIATMWGFKYPPKYCQS--PQIISW 1545

Query: 457  VPPEAGELMLCCDXXXXXXXXXXXXXXXXRDH--NVIGAISMGLGKITNYMAEIFSILIG 284
            + P  GE  L  D                RDH   +  A S  LG + +  AE+ ++L G
Sbjct: 1546 IKPFIGEYKLNVD-GSSKSSQNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAELHALLRG 1604

Query: 283  LEWALQWGYTKVCIRSDSLGAVLAYCESKLP----WFVLQRWREISSQYDAIRFDHSYRE 116
            L    +   T + I  D+L AV    +S+       ++L+  R     + + R  H YRE
Sbjct: 1605 LLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSF-SYRISHIYRE 1663

Query: 115  TNFAADKMAKNGCT 74
             N AAD ++  G T
Sbjct: 1664 GNQAADFLSNKGQT 1677


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 94.4 bits (233), Expect = 1e-16
 Identities = 115/454 (25%), Positives = 192/454 (42%), Gaps = 23/454 (5%)
 Frame = -1

Query: 1300 IGDGRDTSIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWNLTDEYRHTLLAAGIE 1121
            IG G D   ++D W+GD  + N    FS +  + V+    + +W++  +   T +   I 
Sbjct: 892  IGKG-DIFFWHDAWMGDEPLVNSFPSFSQSM-MKVNYFFNDDAWDV--DKLKTFIPNAIV 947

Query: 1120 EEDL--PFIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAAR 947
            EE L  P     ED   W    +G F++K+A  L+R+R        L+W  S+   ++  
Sbjct: 948  EEILKIPISREKEDIAYWALTANGDFSIKSAWELLRQRKQVNLVGQLIWHKSIPLTVSFF 1007

Query: 946  NWKILRRACATLDQVRDRFK-IQVINKCCLCNNAEESLDHLLWHCRFAERAWSWISDIFG 770
             W+ L        +VR + K IQ+ +KC LC  +EESL H+LW    A++ W++ S  F 
Sbjct: 1008 LWRTLHNWLPV--EVRMKAKGIQLASKC-LCCKSEESLLHVLWESPVAQQVWNYFSKFFQ 1064

Query: 769  FRSH--RNL-----ATAYRAIRGRSGMIKELWLLALPVVRSELWQTRNRFVFENKKVSWG 611
               H  +N+     +  Y     + G I+ L LL    +   +W  RN    + K    G
Sbjct: 1065 IYVHNPQNILQILNSWYYSGDFTKPGHIRTLILL---FIFWFVWVERN----DAKHRDLG 1117

Query: 610  FFQTTVFNQIQEYSVRL-KGYMFNSVE---DLKILDFFKVRHRKVQHLLPVECIWVPPEA 443
             +   +  +I +   +L +G +    +   DL I   +     + +   P    W+ P  
Sbjct: 1118 MYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLI 1177

Query: 442  GELMLCCDXXXXXXXXXXXXXXXXRDH--NVIGAISMGLGKITNYMAEIFSILIGLEWAL 269
            GEL L  D                RDH  N+I   S   G   +  AE+ ++  GL   +
Sbjct: 1178 GELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALHRGLCLCM 1237

Query: 268  QWGYTKVCIRSDSLGAVLAYCESKLPWFVLQRWREI---SSQYDAIRFDHSYRETNFAAD 98
            ++  ++V I  D+   +          + +Q   E      Q  ++R  H +RE N AAD
Sbjct: 1238 EYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIHREGNQAAD 1297

Query: 97   KMAKNGCTLAN----AERRHYLGRPDFLNSIEFP 8
             ++K+G T  N     E +  L     +N +E P
Sbjct: 1298 FLSKHGHTHQNLHVFTEAQGELRGRTLVNRVEHP 1331


>emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1389

 Score = 94.0 bits (232), Expect = 1e-16
 Identities = 110/470 (23%), Positives = 192/470 (40%), Gaps = 31/470 (6%)
 Frame = -1

Query: 1396 YLKRTHKIFCASSILPGIKWVYSL-----FEDNTKILIGDGRDTSIFYDVWVGDMAITNI 1232
            YLK  + + C        +W   L     F    + LIGDG+D S + D W+    + + 
Sbjct: 905  YLKEQNLLVCKIPSNASWQWKNLLRHRNFFSKGLRWLIGDGQDISFWTDNWIFQYPLNSK 964

Query: 1231 LEDFSLNRAVLVSDIIVN-GSWNLTDEYRHTLLAAGIEEEDLPFIF----NGEDRQIWRP 1067
                  +  + V++     G W++      TL+   I +  +  +F    + +DR +W  
Sbjct: 965  YVPTVGSENIKVAECFNGLGGWDIPKLL--TLVPPNIVKA-ISSVFIPSSSQQDRLLWGL 1021

Query: 1066 ATSGQFTVKAAKSLIRK----RIAKLECTNLLWRTSVHPALAARNWKILRRACATLDQVR 899
              +GQ++VK+  SLIR+     I K+E  N +W     P +    WK      AT  ++ 
Sbjct: 1022 TPTGQYSVKSGASLIREVNGGTIEKVEF-NWIWGIHAPPKIKNFLWKACNDGLATTSRL- 1079

Query: 898  DRFKIQVINKCCLCNNAEESLDHLLWHCRFAERAWSWISDIFGFRSHRNLATAYRAIRGR 719
            +R  I V   CC C+   E++ HL + C F    +S + D F + ++ +  +  +    R
Sbjct: 1080 ERSHIFVPQNCCFCDCPSETICHLCFQCPFTLDIYSHLEDKFQWPAYPSWFSTLQLSSFR 1139

Query: 718  SGM------IKELWLLALPVVRSELWQTRNRFVFENKKVSW---GFFQTTVFNQIQEYSV 566
            S +      +   +L  L +V   +W  RN+ +F N+  S+    F   +   + ++ ++
Sbjct: 1140 SVLEACHINLTLEYLTKLSIVWWHVWYFRNKLIFNNESTSFSQASFIIHSFMGKWEKANL 1199

Query: 565  RLKGYMFNSVEDLKILDFFKVRHRKVQHLLPVECIWVPPEAGELMLCCDXXXXXXXXXXX 386
             +  +     +D K+     VR  K         IW PP    L +  D           
Sbjct: 1200 EIPSFNTPLPKDCKL----PVRSGK-------NLIWSPPNEDVLKVNFDGSKLDNGQAAY 1248

Query: 385  XXXXXRDH-NVIGAISMGLGKITN-YMAEIFSILIGLEWA--LQWGYTKVCIRSDSLGAV 218
                   +  V+ A +  LG   +  MAE   +L G++ A  LQ    K+    D++  +
Sbjct: 1249 GFVIRNSNGEVLMARAKALGVYPSILMAEAMGLLEGIKGAISLQNWSRKIIFEGDNIAVI 1308

Query: 217  LAYCESKL-PWFVLQRWRE---ISSQYDAIRFDHSYRETNFAADKMAKNG 80
             A   S   PW +     +   +   +  ++F H YRE N  AD MA  G
Sbjct: 1309 NAMSPSATGPWTIANIILDAGALLGHFQEVKFQHCYREANRLADFMAHKG 1358


>ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            lycopersicum]
          Length = 1010

 Score = 92.4 bits (228), Expect = 4e-16
 Identities = 94/411 (22%), Positives = 167/411 (40%), Gaps = 13/411 (3%)
 Frame = -1

Query: 1279 SIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWNLTD--EYRHTLLAAGIEEEDLP 1106
            S ++D W+GD A+    ++ S    V ++++  NG W      +    LL   I +  + 
Sbjct: 558  SFWWDNWIGDGAVATKCDNISSLNNVKIAELTENGKWKERQVRQLVPPLLVPNILDTVIQ 617

Query: 1105 FIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAARNWKILRR 926
                  D  IW     G+FT+ +A ++IRK+         +W  ++   ++   WK LR 
Sbjct: 618  AKNEKSDYAIWTLEDKGKFTIHSAWNIIRKKNISDPINQFIWHKNIPFKVSFFIWKALRN 677

Query: 925  ACATLDQVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAERAWSWISDIFG-FRSHRNL 749
               T D + + F +      C     ++ + H+L    FA+  W   +   G  + H NL
Sbjct: 678  KLPTNDSLMN-FGMDEQECYCCFRKGKDDILHILITGNFAKYIWKIHATRLGVHQDHANL 736

Query: 748  ATA---YRAIRGRSGMIKELWLLALPVVRSELWQTRNRFVFENKKVSWGFFQTTVFNQIQ 578
             +    +R I   + + K L+ +    +   LW+ R      +K+ S    Q  +F    
Sbjct: 737  RSLLLHWRNIPVHNQVQKLLYQILPNFICWNLWKNRCAVKHGSKQCSTQRVQYAIFKDTM 796

Query: 577  EYSVRLKGYMFNSVEDLKILD-FFKVRHRKVQHLLPVECIWVPPEAGELMLCCDXXXXXX 401
            +  +      F ++     LD    +     Q +   + +W  P  G   L  D      
Sbjct: 797  QAVM----VAFPNISRQNNLDMLINLAENCQQQVKVTKVMWEKPSLGIFKLNTDGSAIHN 852

Query: 400  XXXXXXXXXXRDHN--VIGAISMGLGKITNYMAEIFSILIGLEWALQWGYTKVCIRSDS- 230
                      RDHN  +I A ++  G  TN  AE+ + L GL W  Q GY ++ +  DS 
Sbjct: 853  INKIGGGGILRDHNGKLIYAFAIPFGIGTNNFAEMKAALYGLSWCEQHGYKRIILEVDSE 912

Query: 229  LGAVLAYCESKLPWF---VLQRWREISSQYDAIRFDHSYRETNFAADKMAK 86
            L +        +PW     + + ++I ++ +  +  H +RE N  AD +AK
Sbjct: 913  LLSKWIDNSINIPWRCQPTIYQIQDIVNKMEYFQCQHIFREANGTADLLAK 963


>ref|XP_006367640.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            tuberosum]
          Length = 1035

 Score = 91.7 bits (226), Expect = 7e-16
 Identities = 107/437 (24%), Positives = 178/437 (40%), Gaps = 26/437 (5%)
 Frame = -1

Query: 1285 DTSIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWNLTDEYRHTLLAAGIEEEDLP 1106
            ++S ++D W    A+     D +    + V   I N  W+       T L   + EE + 
Sbjct: 393  NSSFWFDNWTRQGALYYTEGDCAQEEELEVQYFITNDGWD------ETKLKDLLSEEMVE 446

Query: 1105 FIF---------NGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALA 953
             I           G D+  W    +G FTVK+A   IR R  + E    +W   +   ++
Sbjct: 447  HIILNIRPKTSEEGIDKAWWCGNLTGLFTVKSAYHRIRGRKEEEEWRRYMWIKGMPIKIS 506

Query: 952  ARNWKILRRACATLDQVRDRFKIQVINKCCLCNNAE-ESLDHLLWHCRFAERAWSWISDI 776
               W++ RR  AT D ++ R KI V++KC  C   E E++ HLL     A++ W   +  
Sbjct: 507  FFLWRVWRRKIATYDNLK-RMKIPVVSKCYCCKEGEMETMTHLLLTAPIAQKLWKQFASY 565

Query: 775  FG-FRSHRNLATA------YRAIRGRSGMIKELWLLALPVVRSELWQTRNRFVFENKKVS 617
             G   +  NL         Y+A    S  + ++    L V+  ELW+ RN +    +   
Sbjct: 566  AGIIINGLNLQQLIFKWWDYKA----SNKLSQILKAVLAVIMWELWKRRNSYRHGKETTY 621

Query: 616  WGFFQTTVFNQIQEYSVR---LKGYMFNSVEDLKILDFFK-VRHRKVQHLLPVECIWVPP 449
               +        Q  +++   +KG  ++  + + +L  +K   H KV         W  P
Sbjct: 622  NNMYYQCQLILYQLVTIKFPWIKGLTYHWPQVVGMLQNYKPPLHYKVVR-------WRKP 674

Query: 448  EAGELMLCCDXXXXXXXXXXXXXXXXRDHN--VIGAISMGLGKITNYMAEIFSILIGLEW 275
              G +    D                RD N  ++ A +  +G+ TN  AE  ++   L++
Sbjct: 675  SEGWVTCNTDGASKGNPRMSSYGYCIRDKNGDLLYAEAHNIGETTNMEAEATTVWKALQF 734

Query: 274  ALQWGYTKVCIRSDSLGAVLAYCES-KLPWFVLQRWREISS--QYDAIRFDHSYRETNFA 104
              + G  KV + +DSL        S K+PW ++++  EI    Q   ++  H YRE N  
Sbjct: 735  CYENGLRKVRLETDSLALQNMITRSWKIPWELVEKLEEIHEIMQQIDVQVCHVYREVNQL 794

Query: 103  ADKMAKNGCTLANAERR 53
            AD +A    T  N E +
Sbjct: 795  ADFIAN---TTINTEHK 808


>ref|XP_004237273.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            lycopersicum]
          Length = 601

 Score = 89.0 bits (219), Expect = 5e-15
 Identities = 96/421 (22%), Positives = 176/421 (41%), Gaps = 23/421 (5%)
 Frame = -1

Query: 1279 SIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWN--LTDEYRHTLLAAGIEEEDLP 1106
            S + D W+ + ++ N  +  S      + D  ++G WN  L  ++   LL   I +    
Sbjct: 149  SFWLDNWLENDSLANHCDHISSLNKSRLDDFWIDGKWNESLIRQHVPPLLIPIILQTLFN 208

Query: 1105 FIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAARNWKILRR 926
            +    ED  IW P  + +FT+ +A  +IRK+ +     N++W   +   ++   W  L  
Sbjct: 209  YNEGKEDTAIWIPDETVKFTISSAWKVIRKKRSHDPINNIIWHKHIPFKISFFIWGALTG 268

Query: 925  ACATLDQVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAERAW----SWISDIFGFRSH 758
               T +++  R    +++  C  +  ++ ++H+L    FA   W    S +  +    + 
Sbjct: 269  KLPT-NEILQRLGRDIVDCYCCYSKVKDDINHILLTGNFANYIWKKHASTVGALHVNTNM 327

Query: 757  RNLATAYRAIRGRSGMIKELWLLALP-VVRSELWQTRNRFVFENK-----KVSWGFFQTT 596
            R+    +R+++  +  + +L +  LP ++   LW+ R    +  K     +V +G F+ T
Sbjct: 328  RSQLLYWRSLQ-TNNEVHKLLIHTLPNIICWNLWENRCAVKYGGKQSSIYRVQYGIFKDT 386

Query: 595  VFNQIQEYS----VRLKGYMFNSVEDL-KILDFFKVRHRKVQHLLPVECIWVPPEAGELM 431
            +     EYS        G++ N ++   +    F VR             W+ P  G   
Sbjct: 387  MQIIKLEYSNIPWQYSWGHLINFIDQCQQQYKIFMVR-------------WIKPTIGRYK 433

Query: 430  LCCDXXXXXXXXXXXXXXXXRDH--NVIGAISMGLGKITNYMAEIFSILIGLEWALQWGY 257
            L  D                RDH  ++I A +      TN +A++ + L GLEW  Q GY
Sbjct: 434  LNTDGSCLQENGNIEGGGILRDHQGSIIFAFASPFAFGTNNIAKLKAALYGLEWCEQHGY 493

Query: 256  TKVCIRSDS-LGAVLAYCESKLPW---FVLQRWREISSQYDAIRFDHSYRETNFAADKMA 89
              + +  DS L +       ++PW     +Q   EI+ +    +  H YRE N  AD +A
Sbjct: 494  KDIVLEIDSELLSKWISNTIQIPWRCQQYVQHIHEITKKLHHFQCQHIYREENSTADLLA 553

Query: 88   K 86
            K
Sbjct: 554  K 554


>ref|XP_004228797.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            lycopersicum]
          Length = 389

 Score = 87.4 bits (215), Expect = 1e-14
 Identities = 87/344 (25%), Positives = 142/344 (41%), Gaps = 10/344 (2%)
 Frame = -1

Query: 1087 DRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWRTSVHPALAARNWKILRRACATLD 908
            D   W P   GQFT+ +A  +IRK+       N +W  +V    +   W+ LR    T +
Sbjct: 3    DSAYWMPDDKGQFTIFSAWDIIRKKKDPDPIHNCVWHKNVPFKTSFFIWRALRSKLPTNE 62

Query: 907  QVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAERAWSWISDIFGF----RSHRNLATA 740
             +    K ++   CC     ++ L H+L    FA+  W   +   G      + R+   +
Sbjct: 63   NLLKFGKEELECYCCY-RKGKDDLKHILITGNFAKYIWKIHTKRLGIAIVNTNLRSTLLS 121

Query: 739  YRAIRGRSGMIKELWLLALPVVRSELWQTRNRFVFENKKVSWGFFQTTVFNQIQEYSVRL 560
            +R +   + + K +  +   ++   LW+ R    + NK  S    ++ +F  I +    +
Sbjct: 122  WRRLTSYNEVHKLILHILPNIICWNLWKNRCSAKYGNKPSSIYRVESGIFKDIMQI---I 178

Query: 559  KGYMFNSVEDLKILDFFKVRHRKVQHLLPVECIWVPPEAGELMLCCDXXXXXXXXXXXXX 380
            K    N          F +  +  QHL      W  P  G   L  D             
Sbjct: 179  KAVYPNIPWQSSWERLFNLVEQCQQHLKVTMVNWERPPEGIHKLNTDGSAKHNTGKIGGG 238

Query: 379  XXXRDH--NVIGAISMGLGKITNYMAEIFSILIGLEWALQWGYTKVCIRSDS-LGAVLAY 209
               RDH   +I A ++ LG  TN  AEI + L GL+W  Q G+ K+ +  DS L      
Sbjct: 239  GILRDHQGKLIYAFAIPLGFGTNNFAEIQAALHGLQWCQQHGFEKIILEVDSELLHKWII 298

Query: 208  CESKLPW---FVLQRWREISSQYDAIRFDHSYRETNFAADKMAK 86
             +S +PW     +Q+ + IS++ +  +  H YRE N  AD +AK
Sbjct: 299  NKSSVPWRCLHYIQQIQNISNKMEVFQCKHIYREANGTADLLAK 342


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 87.0 bits (214), Expect = 2e-14
 Identities = 100/433 (23%), Positives = 178/433 (41%), Gaps = 16/433 (3%)
 Frame = -1

Query: 1330 SLFEDNTKILIGDGRDTSIFYDVWVGDMAITNILEDFSLNRAVLVSDIIVNGSWNLTDEY 1151
            ++ E N +  +G G+    ++D W+G+  +T+  ++ SL+  V V D  +N SW++  E 
Sbjct: 1796 AITEQNMRWRVGQGK-LFFWHDCWMGETPLTSSNQELSLSM-VQVCDFFMNNSWDI--EK 1851

Query: 1150 RHTLLAAGIEEE--DLPFIFNGEDRQIWRPATSGQFTVKAAKSLIRKRIAKLECTNLLWR 977
              T+L   + +E   +P     +D   W P  +G+F+ K+A  LIRKR       N +W 
Sbjct: 1852 LKTVLQQEVVDEIAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWH 1911

Query: 976  TSVHPALAARNWKILRRACATLDQVRDRFKIQVINKCCLCNNAEESLDHLLWHCRFAERA 797
             +V   ++   W++L        +++ + K   +   C C  +EES+ H++W    A + 
Sbjct: 1912 KTVPLTISFFLWRLLHDWIPV--ELKMKSKGFQLASRCRCCKSEESIMHVMWDNPVATQV 1969

Query: 796  WSWISDIFGFRSHRNL-------ATAYRAIRGRSGMIKELWLLALPVVRS-ELWQTRNRF 641
            W++ S  F               A  Y     + G I+ L    +P+     LW  RN  
Sbjct: 1970 WNYFSKFFQILVINPCTINQILGAWFYSGDYCKPGHIRTL----VPIFTLWFLWVERNDA 2025

Query: 640  VFENKKVSWGFFQTTVFNQIQEYSVRLKGYMFNSVEDLKILDFFKVRHRKVQHLLPVECI 461
               N  +        +   IQ+ S+  +   +    D +I   + +  +      P    
Sbjct: 2026 KHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESLPPPKVFP 2085

Query: 460  WVPPEAGELMLCCDXXXXXXXXXXXXXXXXRDHN--VIGAISMGLGKITNYMAEIFSILI 287
            W  P  GE  L  D                RDH   ++   S  LG   +  AE+ ++  
Sbjct: 2086 WHKPSIGEFKLNVD-GSAKLSQNAAGGGVLRDHAGVMVFGFSENLGIQNSLQAELLALYR 2144

Query: 286  GLEWALQWGYTKVCIRSDSLGAV-LAYCESKLPW---FVLQRWREISSQYDAIRFDHSYR 119
            GL     +   ++ I  D+   + L     + P    ++L   R++ S + + R  H +R
Sbjct: 2145 GLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSHF-SFRLSHIFR 2203

Query: 118  ETNFAADKMAKNG 80
            E N AAD +A  G
Sbjct: 2204 EGNQAADFLANRG 2216


Top