BLASTX nr result

ID: Papaver25_contig00011206 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00011206
         (1463 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   134   1e-34
ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A...   133   2e-30
ref|XP_004295654.1| PREDICTED: uncharacterized protein LOC101314...   106   2e-26
ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A...   118   6e-24
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...    99   2e-20
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...    97   6e-20
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...    94   1e-18
ref|XP_004308214.1| PREDICTED: putative ribonuclease H protein A...   100   2e-18
gb|ABD28730.1| Ribonuclease H [Medicago truncatula]                    99   5e-18
ref|XP_007213453.1| hypothetical protein PRUPE_ppa024777mg, part...    98   8e-18
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...    84   5e-16
emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulga...    79   2e-15
ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom...    90   3e-15
ref|XP_006367184.1| PREDICTED: uncharacterized protein LOC102601...    87   1e-14
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...    87   2e-14
emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...    67   2e-14
gb|AGV40503.1| hypothetical protein [Phaseolus vulgaris]               87   2e-14
ref|XP_004293076.1| PREDICTED: putative ribonuclease H protein A...    86   3e-14
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...    84   1e-13
ref|XP_004308354.1| PREDICTED: putative ribonuclease H protein A...    84   2e-13

>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 872

 Score =  134 bits (337), Expect(2) = 1e-34
 Identities = 115/452 (25%), Positives = 193/452 (42%), Gaps = 11/452 (2%)
 Frame = +3

Query: 141  YLRAKFFKNSGQLVGYVKSSILPGLKWVYNEVNSNTKKLIGDGRATSLYFDYWCGDTCIA 320
            ++R +F K       Y  SSI PG++  +  V +NT+ L+G G   S + D + G   I 
Sbjct: 388  FIRNRFSKRRS----YAPSSIWPGVRKFWGLVQNNTRWLVGTGDKISFWRDNFLGRPLIE 443

Query: 321  NVMGHENL-DRNLLVANCIQNWAWFLSDVVTQIFQAAGVEIQNLPVPMGG--DDLRVWKP 491
                H  L D + LV++ I N +W L  ++     A    I  +P+ +    +D  +W+ 
Sbjct: 444  FFGNHGALNDNSSLVSDYIDNGSWVLPPLLQLNLSAVCNLICQVPISINPSMEDKLIWQA 503

Query: 492  DYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDARNWKIIPGACDKVR--SRFKY 665
               G L+ + +   + +  P +     L    + P +    WK++ G         R   
Sbjct: 504  SSTGELTAKQAFLFLQQASPVVPWGKPLWSKFILPRMSLHAWKVMRGTVISYHLLQRRGV 563

Query: 666  HVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFGISPHQNLTT---TYKMAKGKSV 836
             ++++C  C +  ESLDHI   C F +  W     +F I    N      +  +A  +S 
Sbjct: 564  ALVSRCEFCGNSTESLDHIFLHCSFAASVWNHFIYIFEIGLVPNTIAEVFSLGLAMDRSP 623

Query: 837  MFKEPWLLAVLVIRSEMWMTRNGFIYNNQKVNWNIFKYKTISQVHDYSSRLK-GYMYNSQ 1013
              KE WL+    I   +W  RN   ++++  +      + +S+    SSRL  G+M+N+ 
Sbjct: 624  QLKELWLICFTSILWYIWHARNQIRFDSRTFSV-AGVCRLVSRHIQASSRLATGHMHNTI 682

Query: 1014 DDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRAGVGVVVRENNT 1193
             DL +L  FG   R  +        W PP    + +  DG  +   G  G G V R    
Sbjct: 683  HDLCILKSFGACCRSRRIPRMVEVIWHPPSIGWIKINSDGAWKHEEGIGGFGAVFRYYKG 742

Query: 1194 NVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDSMSAILVYSSNNMAVPW 1373
              +GA    + I ++  A++  VI  +E A       + +  D  S +L Y  +   VPW
Sbjct: 743  QFVGAFASHIDIPSSIAAKVMVVITAIELAWVRDWKHVWLEVD-FSTVLDYIRSPSLVPW 801

Query: 1374 FMRSRWVVVKARYGSIRF--VHTYREANFSAE 1463
             +R RW+    R  ++ F   H +RE N  A+
Sbjct: 802  QLRVRWLNCLYRISTMTFKSSHIFREGNRVAD 833



 Score = 41.2 bits (95), Expect(2) = 1e-34
 Identities = 15/39 (38%), Positives = 27/39 (69%)
 Frame = +2

Query: 5   MVAYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNICSS 121
           +V++  C +P  EGGLG+ ++  +N ++L+K CW I +S
Sbjct: 343 LVSWTSCCAPIDEGGLGLKKLDVLNSSLLLKRCWEIFTS 381


>ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 751

 Score =  133 bits (334), Expect(2) = 2e-30
 Identities = 105/408 (25%), Positives = 179/408 (43%), Gaps = 9/408 (2%)
 Frame = +3

Query: 186  YVKSSILPGLKWVYNEVNSNTKKLIGDGRATSLYFDYWCGDTCIANV-MGHENLDRNLLV 362
            Y  SS+  GLK V   +  +++ +IGDG +   + D W   + I  + MG  +   N  V
Sbjct: 343  YFTSSVWHGLKRVLPLLFEHSRWIIGDGNSILFWSDKWLHSSIIQQLNMGSLSHLLNSRV 402

Query: 363  ANCIQNWAWFLSDVVTQIFQAAGVEIQNLPVPMGGD-DLRVWKPDYKGVLSVRSSKTLIH 539
            A+ I +  W L    + +F     +I  +P+P   + D+ +W+    G+ S      L+ 
Sbjct: 403  ADFIWDQQWALPSHFSNLFPDCAKQILEIPLPNTPESDILIWEHSSSGIFSFSDGYELVR 462

Query: 540  KRYPNLEGENLLRKPSVHPSLDARNWKI--IPGACDKVRSRFKYHVINKCCLCN-SEEES 710
              +  L+  + +    + P      W+I  +    D    R     ++ C LC+ S  E 
Sbjct: 463  PYFEKLDWASSVWHSFIPPRYSVLAWRIFHLKLPTDDQLQRRGIPFVSVCQLCSFSHTED 522

Query: 711  LDHIMWSCDFFSKAWLWISDMFGIS--PHQNLTTTYKMAKGK--SVMFKEPWLLAVLVIR 878
            + H+  +C F    W W++  FG S     +L   +    GK  S   K  W  + L   
Sbjct: 523  IPHLFVNCSFAQHIWQWLAYYFGTSLPSSGSLNDLWSSVTGKAFSPQLKNIWFASCLFAL 582

Query: 879  SEMWMTRNGFIYNNQKVNWNIFKYKTISQVHDYSSRLKGYMYNSQDDLRVLNFFGVTHRK 1058
              +W + N   ++N++ +  +  ++++     Y +           D +VL+  GV    
Sbjct: 583  MAIWKSHNKLRFDNKQPSL-MRVFRSVKAWVRYIAPYTPGCVRGVLDSKVLSSMGVILVL 641

Query: 1059 VKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRAGVGVVVRENNTNVLGALTVGLVIQTN 1238
               S  +   W PP    L L  +G ++ NPG AG G V R++   ++G    GL  QT 
Sbjct: 642  KCQSALRIVLWHPPLIPWLKLNTNGFSKGNPGLAGCGGVFRDSFGRLIGGYCQGLGTQTT 701

Query: 1239 FLAEIYCVILGLEWAIKFGVADICIHTDSMSAILVYSSNNMAVPWFMR 1382
            F  E+  VILG+E+A  FG   I + +DS + +   SS++ A PW  R
Sbjct: 702  FFVELMTVILGVEFAFHFGWHHIWLESDSTTILQCISSSSFAPPWSQR 749



 Score = 28.1 bits (61), Expect(2) = 2e-30
 Identities = 10/43 (23%), Positives = 21/43 (48%)
 Frame = +2

Query: 8   VAYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNICSSKKAWG 136
           +++ +  +P  E GL +  ++ +  A L+ L W       +WG
Sbjct: 284 ISWQQVCTPRNEAGLDLRNLKALYTAGLISLAWQTLLQSSSWG 326


>ref|XP_004295654.1| PREDICTED: uncharacterized protein LOC101314263 [Fragaria vesca
            subsp. vesca]
          Length = 839

 Score =  106 bits (264), Expect(2) = 2e-26
 Identities = 89/392 (22%), Positives = 159/392 (40%), Gaps = 7/392 (1%)
 Frame = +3

Query: 141  YLRAKFFKNSGQLVGYVK-SSILPGLKWVYNEVNSNTKKLIGDGRATSLYFDYWCGDTCI 317
            +  A+F + SGQ   Y K SSI PG++ ++ ++  N+K ++G+G +   +   W   + I
Sbjct: 476  FFSARFLQRSGQPCSYYKRSSIWPGMRPLFTDILYNSKWVVGNGHSIDFWHGNWLNGSII 535

Query: 318  ANVMGHENLDRNLL--VANCIQNWAWFLSDVVTQIFQAAGVEIQNLPVPMGG-DDLRVWK 488
              +     L ++L   V++ I N +W  S  +     A   EI  + +P    DD  VW 
Sbjct: 536  DKLGIVHQLGKSLCGKVSDFILNGSWLCSTNLNAELAALWSEILAIQLPSYDIDDKLVWL 595

Query: 489  PDYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDARNWKIIPGACDKVRSRFKYH 668
               +G LS+  +                  K S   S+    W+              + 
Sbjct: 596  DSLEGSLSLSIAYEF---------------KISKQASVPWDRWR-------------GFS 627

Query: 669  VINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFGISPHQ---NLTTTYKMAKGKSVM 839
              + C LC++  E+  H+ + C F  + W  I  +FG++ H    +   +Y +  G    
Sbjct: 628  FASMCSLCHASVENSHHLFFECSFSLRVWCAILSLFGVNSHFLDIHAFFSYPLQHGFGTQ 687

Query: 840  FKEPWLLAVLVIRSEMWMTRNGFIYNNQKVNWNIFKYKTISQVHDYSSRLKGYMYNSQDD 1019
             +  W   +      +W  RN   ++ +    +   +   SQ+ +  S   G M+NS  +
Sbjct: 688  LQLLWWGMMGAGFYSIWDARNSIRFHERHSTPDCLIHSIKSQIREIDSWGLGTMHNSAGE 747

Query: 1020 LRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRAGVGVVVRENNTNV 1199
            L      G+  R  +    +   W  P   ++ +  DG AR  PG AG G + R++  N 
Sbjct: 748  LCTFRALGIKGRASRSHQIREVHWHAPSVFQVKVNTDGAARGTPGLAGFGGIFRDHLGNC 807

Query: 1200 LGALTVGLVIQTNFLAEIYCVILGLEWAIKFG 1295
            +G     + I T   AE+  +I     A + G
Sbjct: 808  MGCFAGSMGIATALEAELQAIIHAASMAARKG 839



 Score = 41.2 bits (95), Expect(2) = 2e-26
 Identities = 17/50 (34%), Positives = 28/50 (56%)
 Frame = +2

Query: 2   FMVAYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNICSSKKAWGRLFES 151
           + VA+ KC +P KEGGLG+  +  +N+A L+K  W+  +        F +
Sbjct: 430 YPVAWKKCCAPLKEGGLGVRNIMALNQAFLLKKFWDFLTKSTTAAAFFSA 479


>ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 364

 Score =  118 bits (296), Expect = 6e-24
 Identities = 89/337 (26%), Positives = 145/337 (43%), Gaps = 6/337 (1%)
 Frame = +3

Query: 471  DLRVWKPDYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDARNWKIIPGAC--DK 644
            D  +W P   G LS + +   +  R P+L+   L+    + P +   +WK++ G    + 
Sbjct: 3    DKLIWVPLSSGELSAKEAFQFLRPRLPSLDWGKLIWSKFIIPRISLHSWKVLRGRVLSED 62

Query: 645  VRSRFKYHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMF--GISPHQNLTTTYKM 818
            +  R    + ++C LC  + ESL HI  +C F +  W   + +F  G  P   +   Y  
Sbjct: 63   LLQRRGIALASRCVLCGRDGESLPHIFLTCSFAASLWNNRAGLFELGCLPQNLVDLLYYG 122

Query: 819  AKGKSVMFKEPWLLAVLVIRSEMWMTRNGFIYNNQKVNWNIFKYKTISQVHDYSSRLKGY 998
              G+S   KE WL+        +W  RN   ++N  +  +  +   +  V   S    G 
Sbjct: 123  GVGRSHQLKEIWLICYTTTLWFIWKARNKMRHDNCTIVVDAVRQLIMGHVKTASKLALGC 182

Query: 999  MYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRAGVGVVV 1178
            M NS  +LRVL  FG+  R  +        W PP    + +  DG  +   G++G G + 
Sbjct: 183  MSNSLTELRVLKKFGLLCRPHRAPRITEVNWHPPLFGWIKVNTDGAWQKTTGKSGYGGIF 242

Query: 1179 RENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDSMSAILVYSSNN 1358
            R+ + + LGA    L I  +  AE+  VI  +E A       I +  DS+  +L +  + 
Sbjct: 243  RDFHGSFLGAFASNLEILNSVDAEVMAVIQAIELAWVRDWEHIWLEVDSI-IVLNFLQDP 301

Query: 1359 MAVPWFMRSRWVVVKARYGSIRF--VHTYREANFSAE 1463
              VPW +R  W     R   + F   H +RE N  A+
Sbjct: 302  HLVPWRLRVGWGNFLHRISQMNFRSSHIFREGNQVAD 338


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 99.0 bits (245), Expect(2) = 2e-20
 Identities = 99/465 (21%), Positives = 189/465 (40%), Gaps = 14/465 (3%)
 Frame = +3

Query: 111  FALPKKLGEGYLRAKFFKNSGQLVGYVKSSILPGLKW-----VYNEVNSNTKKLIGDGRA 275
            F     L   ++RAK+    GQL  +V+  +     W     + +    N +  +G G+ 
Sbjct: 3007 FRTTNSLWMQFMRAKYC--GGQLPTHVQPKLHDSQTWKRMVTISSITEQNIRWRVGHGKL 3064

Query: 276  TSLYFDYWCGDTCIANVMGHENLDRNLLVANCIQNWAWFLSDVVTQIFQAAGVEIQNLPV 455
               + D W G+  +  +   E       V++   N +W +  + + + Q    EI  +P+
Sbjct: 3065 F-FWHDCWMGEEPLV-IRNQEFASSMAQVSDFFLNNSWDIEKLKSVLQQEVVEEIAKIPI 3122

Query: 456  PMGGDDLRVWKPDYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDARNWKIIPGA 635
                +D   W P   G  S +S+  L  +R       N +   SV  +     W+++   
Sbjct: 3123 NASSNDRAYWTPTPNGDFSTKSAWQLSRERKVVNPTYNYIWHKSVPLTTSFFLWRLLHDW 3182

Query: 636  CD-KVRSRFKYHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFGISPHQNLTTTY 812
               +++ + K   +   C C   EESL H+MW     ++ W + + +F I      T  +
Sbjct: 3183 VPVELKMKSKGFQLASRCRCCKSEESLMHVMWDNPVANQVWSYFAKVFQIHIINPCTINH 3242

Query: 813  KMAKG-KSVMFKEPWLLAVLV---IRSEMWMTRNGFIYNNQKVNWNIFKYKTISQVHDYS 980
             ++    S  + +P  +  LV   I   +W+ RN   + N  +  N   +K +  +H   
Sbjct: 3243 IISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLF 3302

Query: 981  SRLKGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRA 1160
               +   +  Q D ++   +G+  + V  S PK  FW  P   E  L  DG ++ N   A
Sbjct: 3303 QGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTA 3362

Query: 1161 GVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDSMSAIL 1340
              G ++R++  +++   +     Q +  AE+  +  GL   I   V  + I  D+  A+ 
Sbjct: 3363 AGGGLLRDHTGSMIFGFSENFGSQDSLQAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQ 3422

Query: 1341 VYSSNNMAVPWFMRSRWVVVKARYG----SIRFVHTYREANFSAE 1463
            + +  +       R+R+++          S R  H +RE N +A+
Sbjct: 3423 MINEGHQG---SSRTRYLLASIHRCLSGISFRISHIFREGNQAAD 3464



 Score = 28.9 bits (63), Expect(2) = 2e-20
 Identities = 13/41 (31%), Positives = 20/41 (48%)
 Frame = +2

Query: 11   AYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNICSSKKAW 133
            ++ K   P  EGGL I  +  + +A  MKL W   ++   W
Sbjct: 2974 SWGKIALPIAEGGLDIRNLEDVFKAFSMKLWWRFRTTNSLW 3014



 Score = 67.8 bits (164), Expect(2) = 4e-12
 Identities = 97/424 (22%), Positives = 172/424 (40%), Gaps = 17/424 (4%)
 Frame = +3

Query: 243  NTKKLIGDGRATSLYFDYWCGDTCIANVMGHENLDRNLLVANCIQNWAWFLSDVVTQIFQ 422
            N +  IG G     + D W GD  +A +    + D +  V        W +  + + +  
Sbjct: 1260 NIRWRIGKGELF-FWHDCWMGDQPLATLFPSFHNDMSH-VHKFYNGDEWDIVKLNSYLPT 1317

Query: 423  AAGVEIQNLPVPMGGDDLRVWKPDYKGVLSVRSSKTLIHKRY-PN-LEGENLLRKPSVHP 596
            +   EI  +P     +D+  W     G  S  S+  +I +R  PN L   N  R  S+  
Sbjct: 1318 SLVDEILQIPFDRSQEDVAYWALTSNGEFSFWSAWEIIRQRQTPNALLSFNWHR--SIPL 1375

Query: 597  SLDARNWKIIPGACDKVRSRFK---YHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWIS 767
            S+    W+++      V  R K    H+ +KC  C SEE SL H++W      + W + +
Sbjct: 1376 SISFFLWRVLNNWIP-VELRMKDKGIHLASKCVCCRSEE-SLIHVLWENPVAKQVWNFFA 1433

Query: 768  DMFGI----SPHQNLTTTYKMAKGKSVMFKEPWLLAVLVIRSEMWMTRNGFIYNN----- 920
              F I      H +         G         +L  L I   +W+ RN   + +     
Sbjct: 1434 KSFQIYVSKPKHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMGMYP 1493

Query: 921  QKVNWNIFKYKTISQVHDYSSRLKGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPP 1100
             +V W I   K ++Q+H   S LK + +    D+  +  +G  +       P+   W  P
Sbjct: 1494 NRVIWRIM--KLLNQLH-AGSLLKQWQWKGDTDIATM--WGFKYPPKYCQSPQIISWIKP 1548

Query: 1101 RRNELMLCCDGDARVNPGRAGVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEW 1280
               E  L  DG ++ +   AG G V+R++   +  A +  L    +  AE++ ++ GL  
Sbjct: 1549 FIGEYKLNVDGSSKSSQNAAG-GGVLRDHTGKLAFAFSENLGPLPSLQAELHALLRGLLL 1607

Query: 1281 AIKFGVADICIHTDSMSAILVYSSNNMA---VPWFMRSRWVVVKARYGSIRFVHTYREAN 1451
              +  + ++ I  D++ A+ +   +      + + + S  + +  R  S R  H YRE N
Sbjct: 1608 CKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLES--IRLCLRSFSYRISHIYREGN 1665

Query: 1452 FSAE 1463
             +A+
Sbjct: 1666 QAAD 1669



 Score = 31.6 bits (70), Expect(2) = 4e-12
 Identities = 15/47 (31%), Positives = 21/47 (44%)
 Frame = +2

Query: 11   AYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNICSSKKAWGRLFES 151
            A+ K   P  EGGL I  +R +  A  +KL W   +    W R   +
Sbjct: 1180 AWSKITFPVSEGGLDIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLRT 1226


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 96.7 bits (239), Expect(2) = 6e-20
 Identities = 102/454 (22%), Positives = 188/454 (41%), Gaps = 13/454 (2%)
 Frame = +3

Query: 141  YLRAKFFKNSGQLVGYVKSSILPGLKWVYNEVNS-----NTKKLIGDGRATSLYFDYWCG 305
            ++R K+ +  GQL  + +  +     W     NS     N +  +G G+    + D W G
Sbjct: 1764 FMRMKYCR--GQLPMHTQPKLHDSQTWKRMVANSAITEQNMRWRVGQGKLF-FWHDCWMG 1820

Query: 306  DTCIANVMGHENLDRNLL-VANCIQNWAWFLSDVVTQIFQAAGVEIQNLPVPMGGDDLRV 482
            +T + +   ++ L  +++ V +   N +W +  + T + Q    EI  +P+     D   
Sbjct: 1821 ETPLTS--SNQELSLSMVQVCDFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDAMSKDEAY 1878

Query: 483  WKPDYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDARNWKIIPGACD-KVRSRF 659
            W P   G  S +S+  LI KR       N +   +V  ++    W+++      +++ + 
Sbjct: 1879 WAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKS 1938

Query: 660  KYHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFGISPHQNLTTTYKM-AKGKSV 836
            K   +   C C   EES+ H+MW     ++ W + S  F I      T    + A   S 
Sbjct: 1939 KGFQLASRCRCCKSEESIMHVMWDNPVATQVWNYFSKFFQILVINPCTINQILGAWFYSG 1998

Query: 837  MFKEPWLLAVLVIRSEMW---MTRNGFIYNNQKVNWNIFKYKTISQVHDYSSRLKGYMYN 1007
             + +P  +  LV    +W   + RN   + N  +  N   ++ +  +   S   +   + 
Sbjct: 1999 DYCKPGHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQ 2058

Query: 1008 SQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRAGVGVVVREN 1187
             + D ++   +G+T +      PK + W  P   E  L  DG A+++   AG GV+    
Sbjct: 2059 WKGDKQIAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAGGGVLRDHA 2118

Query: 1188 NTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDSMSAILVYSSNNMAV 1367
               V G  +  L IQ +  AE+  +  GL     + +  + I  D+ S I +   N    
Sbjct: 2119 GVMVFG-FSENLGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQGNQRG- 2176

Query: 1368 PWFMRSRWVVVK--ARYGSIRFVHTYREANFSAE 1463
            P  +R   V ++    + S R  H +RE N +A+
Sbjct: 2177 PHAIRYLLVSIRQLLSHFSFRLSHIFREGNQAAD 2210



 Score = 29.3 bits (64), Expect(2) = 6e-20
 Identities = 15/43 (34%), Positives = 20/43 (46%)
 Frame = +2

Query: 11   AYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNICSSKKAWGR 139
            ++ K   P KEGGL I  +  +  A  MKL W   +    W R
Sbjct: 1721 SWAKISLPIKEGGLDIRNLAEVFEAFSMKLWWRFRTIDSLWTR 1763


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 93.6 bits (231), Expect(2) = 1e-18
 Identities = 101/465 (21%), Positives = 184/465 (39%), Gaps = 14/465 (3%)
 Frame = +3

Query: 111  FALPKKLGEGYLRAKFFKNSGQLVGYVKSSILPGLKW-----VYNEVNSNTKKLIGDGRA 275
            F     L   ++RAK+    GQL   V+  +     W     + +    N +  IG G  
Sbjct: 1719 FRTTNSLWTQFMRAKYC--GGQLPTDVQPKLHDSQTWKRMVTISSITEQNIRWRIGHGEL 1776

Query: 276  TSLYFDYWCGDTCIANVMGHENLDRNLLVANCIQNWAWFLSDVVTQIFQAAGVEIQNLPV 455
               + D W G+  + N            V++   N +W +  + T + Q    EI  +P+
Sbjct: 1777 F-FWHDCWMGEEPLVN-RNQAFASSMAQVSDFFLNNSWNVEKLKTVLQQEVVEEIVKIPI 1834

Query: 456  PMGGDDLRVWKPDYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDARNWKIIPGA 635
                +D   W     G  S +S+  LI  R       N +   SV  +     W+++   
Sbjct: 1835 DTSSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPVFNFIWHKSVPLTTSFFLWRLLHDW 1894

Query: 636  CD-KVRSRFKYHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFGISPHQNLTTTY 812
               +++ + K   +   C C   EESL H+MW     ++ W + + +F I      T   
Sbjct: 1895 IPVELKMKTKGFQLASRCRCCKSEESLMHVMWKNPVANQVWSYFAKVFQIQIINPCTINQ 1954

Query: 813  KM-AKGKSVMFKEPWLLAVLVIRSEMW---MTRNGFIYNNQKVNWNIFKYKTISQVHDYS 980
             + A   S  + +P  +  LV    +W   + RN   + N  +  N   +K +  +H   
Sbjct: 1955 IICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLF 2014

Query: 981  SRLKGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRA 1160
               +   +  Q D ++   +G+  +    S PK  FW  P   EL L  DG  + NP  A
Sbjct: 2015 QGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSA 2074

Query: 1161 GVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDSMSAIL 1340
              G ++R++  +++   +     Q +  AE+  +  GL   I+  ++ + I  D+  A+ 
Sbjct: 2075 AGGGLLRDHTGSMIFGFSENFGPQDSLQAELMALHRGLLLCIEHNISRLWIEMDAKVAVQ 2134

Query: 1341 VYSSNNMAVPWFMRSRWVVVKARYG----SIRFVHTYREANFSAE 1463
            +    +       R+R+++          S R  H +RE N +A+
Sbjct: 2135 MIKEGHQG---SSRTRYLLASIHRCLSGISFRISHIFREGNQAAD 2176



 Score = 27.7 bits (60), Expect(2) = 1e-18
 Identities = 13/41 (31%), Positives = 19/41 (46%)
 Frame = +2

Query: 11   AYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNICSSKKAW 133
            ++ K   P  EGGL I  +  +  A  MKL W   ++   W
Sbjct: 1686 SWGKIALPIAEGGLDIRNVEDVCEAFSMKLWWRFRTTNSLW 1726


>ref|XP_004308214.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 409

 Score =  100 bits (249), Expect = 2e-18
 Identities = 87/351 (24%), Positives = 145/351 (41%), Gaps = 9/351 (2%)
 Frame = +3

Query: 438  IQNLPVPMGGD--DLRVWKPDYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDAR 611
            I ++P+ +  D  D  +W P   G L  + +   +  R P+L+   L+    + P +   
Sbjct: 21   INDVPISIVPDMSDKLIWVPSSSGELLAKEAFQFMRPRLPSLDWSKLIWSKFIIPRISLH 80

Query: 612  NWKIIPGAC--DKVRSRFKYHVINKCCLCNSE-EESLDHIMWSCDFFSKAWLWISDMF-- 776
            +WK++ G    + +  R    + ++C LC  + E S  HI  +C F +  W   + +F  
Sbjct: 81   SWKVLRGRVLSEDLLQRRGIVLASRCVLCGRDCESSFPHIFLTCSFVASLWNNWACLFEL 140

Query: 777  GISPHQNLTTTYKMAKGKSVMFKEPWLLAVLVIRSEMWMTRNGFIYNNQKVNWNIFKYKT 956
            G  P   +   Y    G+S   KE WL+        +   RN   ++N  +  +      
Sbjct: 141  GSLPQNLVDLIYYGGVGRSHQLKEIWLICYTTTLWFIGKARNKIRHDNCTIVVDAVHQLI 200

Query: 957  ISQVHDYSSRLKGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGD 1136
            +  V   S    G M NS   LRVL  FG+     +        W PP    + +  DG 
Sbjct: 201  MGHVKAVSKLASGCMSNSLTKLRVLKKFGLLCHPCQALRITKVNWHPPLFGWIKVNTDGA 260

Query: 1137 ARVNPGRAGVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIH 1316
             +   G++G G + R+ + + LGA    L I  +  AE+  VI  +E A       I + 
Sbjct: 261  WQKTTGKSGYGGIFRDFHGSFLGAFASNLEIPNSVDAEVMAVIQAIELAWVRDWKHILLE 320

Query: 1317 TDSMSAILVYSSNNMAVPWFMRSRWVVVKARYGSIRF--VHTYREANFSAE 1463
             DS + +L +  +   VPW +R        R   + F   H +RE N  A+
Sbjct: 321  VDS-AIVLNFLHDPHLVPWRLRVACGNCLHRISQMNFRSSHIFREGNQVAD 370


>gb|ABD28730.1| Ribonuclease H [Medicago truncatula]
          Length = 409

 Score = 99.0 bits (245), Expect = 5e-18
 Identities = 87/377 (23%), Positives = 151/377 (40%), Gaps = 9/377 (2%)
 Frame = +3

Query: 360  VANCIQNWAWFLSDVVTQIFQAAGVEIQNLPVPMGGD-DLRVWKPDYKGVLSVRSSKTLI 536
            VAN + N  W LSD       A   +I  + +P+    D  +W     G LS + + + +
Sbjct: 3    VANYLVNGEWILSDFFAYKDNALVEKIHQIALPLDETLDKLIWTDSVDGDLSNKLAFSFL 62

Query: 537  HKRYPNLEGENLLRKPSVHPSLDARNWKIIPGAC---DKVRSRFKYHVINKCCLCNSEEE 707
                P +    +L      P+     W+ +       D +R R  Y V   CC C  + E
Sbjct: 63   PGHGPTVHWAKMLWNAYTPPTGAFITWRFLHNKLPTDDNLRKRGCYIVSICCCFCRKQAE 122

Query: 708  SLDHIMWSCDFFSKAWLWISDMFGISPHQNLTTTYKMAKGKSVMFKEPWLLAVLVIRSEM 887
            +  HI   C    + W W+  +     H + ++   +++    M +     A++ I   +
Sbjct: 123  TSSHIFLQCPVTLQLWDWL--LKATDQHLDFSSILNISR----MVQHVMNSAIVHIMWSI 176

Query: 888  WMTRNGFIYNNQKVNWNIFKYKTISQVHDYSSRL---KGYMYNSQDDLRVLNFFGVTHRK 1058
            W+  N   ++  +   +      +++V   S  L   KG   +S  D ++   F +  + 
Sbjct: 177  WLECNNKYFDGVQKPMSTLFNTILAEVLRLSFMLDIVKGA--SSMQDFKLARLFSIPFKT 234

Query: 1059 VKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRAGVGVVVRENNTNVLGALTVGLVIQTN 1238
             + +  +   W PP    + + CDG    +P    +GV+ R + T   GA    +   T 
Sbjct: 235  NRVNPCREIIWVPPHGGCMKINCDGSVVGSPSCGSIGVIFRASQTMFCGAFAQNIGYATA 294

Query: 1239 FLAEIYCVILGLEWAIKFGVADICIHTDSMSAILVYSSNNMAVPWFMRSRWVVVKARYGS 1418
              AE    +  +E A +  + +I I TDS++ I  +  N   VPW M  RW        S
Sbjct: 295  LEAEYSACMFAIEKAKELHLTNIWIETDSVNVIRAFHFNT-GVPWKMHIRWHNCLLFCRS 353

Query: 1419 IRFV--HTYREANFSAE 1463
            IR +  H  RE N  A+
Sbjct: 354  IRSLCTHVNREGNLVAD 370


>ref|XP_007213453.1| hypothetical protein PRUPE_ppa024777mg, partial [Prunus persica]
            gi|462409318|gb|EMJ14652.1| hypothetical protein
            PRUPE_ppa024777mg, partial [Prunus persica]
          Length = 465

 Score = 98.2 bits (243), Expect = 8e-18
 Identities = 87/344 (25%), Positives = 145/344 (42%), Gaps = 10/344 (2%)
 Frame = +3

Query: 462  GGDDLRVWKPDYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDARNWKIIPGAC- 638
            G  DL VW P   G  S + +      ++  +    L+ KP + P      WK++ G   
Sbjct: 102  GAGDLLVWAPSSSGGFSAKDAYEFTRPKFAKVPWCKLIWKPFIEPWKSFLAWKVMHGRLL 161

Query: 639  --DKVRSRFKYHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFGISPHQNLTTTY 812
              D ++ R                E+++H+   C F    W  +  +FG+          
Sbjct: 162  TEDFLQKR-----------AWMAPENINHLFSECPFTCSIWSSMFIVFGLHFTSGPLAVI 210

Query: 813  KMAKGKSVMFK----EPWLLAVLVIRSEMWMTRNGFIYNNQKVNWNIFKYKTISQVHDYS 980
             ++ G S  F     + WLL    I   +W  RN   +  +KV+      +TI      S
Sbjct: 211  -LSSGLSAHFSPQLMDLWLLMFRTIVWLIWDLRNKLRFE-EKVSTVSSNCRTIINHVPAS 268

Query: 981  SRL-KGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNPGR 1157
            S L +G++ N   DL ++   GV +R   +S      W PP    + +  DG  + + G+
Sbjct: 269  SPLARGHILNKVHDLCIIRSIGVHYRPRPNSKIVEVTWHPPCFGFVKIKIDGACKRDSGK 328

Query: 1158 AGVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDSMSAI 1337
            AG G V R    +VLGA +  L + +   AE+  VI  +E A      +I I TDS+   
Sbjct: 329  AGSGGVFRNYQGHVLGAFSANLDVPSGVHAEVLAVIKAIELAWLHAWHNIWIETDSLLVT 388

Query: 1338 LVYSSNNMAVPWFMRSRW--VVVKARYGSIRFVHTYREANFSAE 1463
              + S ++ VPW +R  W   +++ ++ S +  H +RE N   +
Sbjct: 389  KFFRSPHL-VPWRLRVDWQNCLLRLQHMSFKISHIFREGNHDVD 431


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 83.6 bits (205), Expect(2) = 5e-16
 Identities = 115/467 (24%), Positives = 190/467 (40%), Gaps = 27/467 (5%)
 Frame = +3

Query: 144  LRAKFFKNS---GQLVGYVKSSILPGLKWVY----NEVN-SNTKKLIGDGRATSLYF--D 293
            L  KF K     GQ+  YV   +     W       EV   NT+  IG G   SL+F  D
Sbjct: 1465 LWTKFLKTKYCMGQIPHYVHPKLHDSQVWKRMVRGREVAIQNTRWRIGKG---SLFFWHD 1521

Query: 294  YWCGDTCIANVMGHENLDRNLLVANCIQNWAWFLSDVVTQIFQAAGVEIQNLPVPMGGDD 473
             W GD  +     H   D +  V N      W +  +   +      EI  +P+    DD
Sbjct: 1522 CWMGDQPLVTSFPHFRNDMST-VHNFFNGHNWDVDKLNLYLPMNLVDEILQIPIDRSQDD 1580

Query: 474  LRVWKPDYKGVLSVRSSKTLIH-KRYPNLEGENLLRKPSVHPSLDARNWKIIPGACD-KV 647
            +  W     G  S RS+   I  ++ PN+    L  K S+  S+    W++        +
Sbjct: 1581 VAYWSLTSNGEFSTRSAWEAIRLRKSPNVLCSLLWHK-SIPLSISFFLWRVFHNWIPVDI 1639

Query: 648  RSRFK-YHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFG--ISPHQN----LTT 806
            R + K +H+ +KC  CNSEE SL H++W      + W + ++ F   IS  QN    L T
Sbjct: 1640 RLKEKGFHLASKCICCNSEE-SLIHVLWDNPIAKQVWNFFANSFQIYISKPQNVSQILWT 1698

Query: 807  TYKMAKGKSVMFKEPWLLAVLVIRSEMWMTRNGFIYN-----NQKVNWNIFKYKTISQVH 971
             Y    G  V      +L  L I   +W+ RN   +      + +V W I   K + Q+ 
Sbjct: 1699 WY--LSGDYVRKGHIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIM--KLLRQLQ 1754

Query: 972  DYSSRLKGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNP 1151
            D    LK + +    D   +  +G+       + P+   W  P   E  L  DG +R N 
Sbjct: 1755 D-GYLLKSWQWKGDKDFATM--WGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGSSRQNQ 1811

Query: 1152 GRAGVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDSMS 1331
              A +G V+R++   ++   +  +    +  AE+  ++ GL    +  +  + +  D++ 
Sbjct: 1812 -TAAIGGVLRDHTGTLVFDFSENIGPSNSLQAELRALLRGLLLCKERNIEKLWVEMDALV 1870

Query: 1332 AILVYSSNNMA---VPWFMRSRWVVVKARYGSIRFVHTYREANFSAE 1463
            AI +   +      + + + S  +     + S R  H +RE N +A+
Sbjct: 1871 AIQMIQQSQKGSHDIRYLLAS--IRKYLNFFSFRISHIFREGNQAAD 1915



 Score = 29.3 bits (64), Expect(2) = 5e-16
 Identities = 14/47 (29%), Positives = 23/47 (48%)
 Frame = +2

Query: 11   AYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNICSSKKAWGRLFES 151
            A+ K   P  EGGL I ++  M  A  +KL W   + +  W +  ++
Sbjct: 1426 AWHKLTFPCSEGGLDIRRLTDMFDAFSLKLWWRFSTCEGLWTKFLKT 1472


>emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1389

 Score = 79.3 bits (194), Expect(2) = 2e-15
 Identities = 93/441 (21%), Positives = 179/441 (40%), Gaps = 29/441 (6%)
 Frame = +3

Query: 228  NEVNSNTKKLIGDGRATSLYFDYWCGDTCIANVMGHENLDRNLLVANCIQNWA-WFLSDV 404
            N  +   + LIGDG+  S + D W     + +         N+ VA C      W +  +
Sbjct: 932  NFFSKGLRWLIGDGQDISFWTDNWIFQYPLNSKYVPTVGSENIKVAECFNGLGGWDIPKL 991

Query: 405  VTQIFQAAGVEIQNLPVPMGGD-DLRVWKPDYKGVLSVRSSKTLIHK-RYPNLEGENLLR 578
            +T +       I ++ +P     D  +W     G  SV+S  +LI +     +E      
Sbjct: 992  LTLVPPNIVKAISSVFIPSSSQQDRLLWGLTPTGQYSVKSGASLIREVNGGTIEKVEFNW 1051

Query: 579  KPSVHPSLDARN--WKIIPGACDKVRSRFKYHVI--NKCCLCNSEEESLDHIMWSCDFFS 746
               +H     +N  WK             + H+     CC C+   E++ H+ + C F  
Sbjct: 1052 IWGIHAPPKIKNFLWKACNDGLATTSRLERSHIFVPQNCCFCDCPSETICHLCFQCPFTL 1111

Query: 747  KAWLWISDMFGISPHQNLTTTYKMAKGKSVM------FKEPWLLAVLVIRSEMWMTRNGF 908
              +  + D F    + +  +T +++  +SV+          +L  + ++   +W  RN  
Sbjct: 1112 DIYSHLEDKFQWPAYPSWFSTLQLSSFRSVLEACHINLTLEYLTKLSIVWWHVWYFRNKL 1171

Query: 909  IYNNQKVNWNIFKYKTISQVHDYSSRLKGYMYNSQDDLRVLNFFGVTHR--KVKDSDPKP 1082
            I+NN+  +++   +     +H +  + +      + +L + +F     +  K+     K 
Sbjct: 1172 IFNNESTSFSQASF----IIHSFMGKWE------KANLEIPSFNTPLPKDCKLPVRSGKN 1221

Query: 1083 YFWEPPRRNELMLCCDGDARVNPGRAGVGVVVRENNTNVLGALTVGLVIQTNFL-AEIYC 1259
              W PP  + L +  DG ++++ G+A  G V+R +N  VL A    L +  + L AE   
Sbjct: 1222 LIWSPPNEDVLKVNFDG-SKLDNGQAAYGFVIRNSNGEVLMARAKALGVYPSILMAEAMG 1280

Query: 1260 VILGLEWAIKFGVADICIHTDSMSAILVYSSNNMAV----------PWFMRSRWVVVKAR 1409
            ++ G++ AI            + S  +++  +N+AV          PW + +  +   A 
Sbjct: 1281 LLEGIKGAISL---------QNWSRKIIFEGDNIAVINAMSPSATGPWTIANIILDAGAL 1331

Query: 1410 YG---SIRFVHTYREANFSAE 1463
             G    ++F H YREAN  A+
Sbjct: 1332 LGHFQEVKFQHCYREANRLAD 1352



 Score = 31.2 bits (69), Expect(2) = 2e-15
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 1/46 (2%)
 Frame = +2

Query: 8   VAYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNICSSK-KAWGRL 142
           + ++K   P   GG+G  +    N A+ MKL W I  SK   W +L
Sbjct: 855 IGWNKICQPKSVGGVGFRKAEVTNIALQMKLLWKIMVSKDNIWVKL 900


>ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
            gi|508787492|gb|EOY34748.1| Uncharacterized protein
            TCM_042328 [Theobroma cacao]
          Length = 910

 Score = 89.7 bits (221), Expect = 3e-15
 Identities = 94/411 (22%), Positives = 170/411 (41%), Gaps = 9/411 (2%)
 Frame = +3

Query: 258  IGDGRATSLYF--DYWCGDTCIANVMGHENLDRNLLVANCIQNWAWFLSDVVTQIFQAAG 431
            +G G   +L+F  D W GD  + +    E     + V +   N +W +  + T + Q   
Sbjct: 467  VGQG---NLFFWHDCWMGDAPLIS-SNQEFTSSMVQVCDFFMNNSWNVEKLKTVLQQEVV 522

Query: 432  VEIQNLPVPMGGDDLRVWKPDYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDAR 611
             EI  +P+     D   W P   G  S +S+  LI KR       N +   +V  +    
Sbjct: 523  DEIAKIPIDTMSKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFF 582

Query: 612  NWKIIPGACD-KVRSRFKYHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFGISP 788
             W+++      +++ + K   +   C C   EES+ H+MW      + W + + +F I  
Sbjct: 583  LWRLLHDWIPVELKMKSKGLQLASRCRCCKSEESIMHVMWDNPVAMQVWNYFAKLFQICI 642

Query: 789  HQNLTTTYKM-AKGKSVMFKEPWLLAVLV---IRSEMWMTRNGFIYNNQKVNWNIFKYKT 956
                T    + A   S  + +P  +  LV   I   +W+ RN   + N  +  N   ++ 
Sbjct: 643  INPCTINQIIGAWFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRV 702

Query: 957  ISQVHDYSSRLKGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGD 1136
            +  +   S   +   +  + D ++   +G+  +    + PK + W  P   E  L  DG 
Sbjct: 703  LKLIQQLSLGQQLLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGS 762

Query: 1137 ARVNPGRAGVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIH 1316
            A+ +   AG G++       V G  +  L IQ +  AE+  +  GL     + +  + I 
Sbjct: 763  AKHSHNAAGGGILRDHAGVMVFG-FSENLGIQNSLQAELLALYRGLILCRDYNIRRLWIE 821

Query: 1317 TDSMSAILVYSSNNMAVPWFMRSRWVVVK--ARYGSIRFVHTYREANFSAE 1463
             D++S I +   N+   P  +R   V ++    + S RF H +RE N +A+
Sbjct: 822  MDAISVIRLLQGNHRG-PHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAAD 871


>ref|XP_006367184.1| PREDICTED: uncharacterized protein LOC102601483 [Solanum tuberosum]
          Length = 2019

 Score = 87.4 bits (215), Expect = 1e-14
 Identities = 96/444 (21%), Positives = 167/444 (37%), Gaps = 23/444 (5%)
 Frame = +3

Query: 201  ILPGLKWVYNEVNSNTKKLIGDGR--------------ATSLYFDYWCGDTCIANVMGHE 338
            I P  +WV  ++N++   L   GR              + S ++D W G   +A+   + 
Sbjct: 775  IKPKKQWV--KINTDGSALCNPGRIGAGSNIQWRIRSGSCSFWWDNWLGVGPLAHYTSNS 832

Query: 339  NLDRNLLVANCIQNWAWFLSDVVTQIFQAAGVEIQNLPVPMGGDDLRVWKPDYKGVLSVR 518
            N   N  V+  I+   W +  V+                 +      VWK +  G+ SV 
Sbjct: 833  NRFNNDSVSEFIEEGHWNIPKVLR----------------VAPPSQAVWKLNSSGLFSVS 876

Query: 519  SSKTLIHKRYPNLEGENLLRKPSVHPSLDARNWKIIPGACDKVRSRFKYHVINKCCLCNS 698
            S+   I ++    +       P +        W+ I G          + +    C C  
Sbjct: 877  SAWNSIREKREITKINKYTWHPKIPFKCSFLLWRAIRGKLPTNEKLLSFGIEPSDCHCCH 936

Query: 699  EE--ESLDHIMWSCDFFSKAWLWISDMFGIS----PHQNLTTTYKMAKGKSVMFKEPWLL 860
                ++++H + S DF    W + +   GI     P +N+   +  A   +   K     
Sbjct: 937  SPGIDTIEHTLNSGDFAKNVWKYFAISLGIRTDFLPLRNMIMRWWSAPHNNEAHKLILHS 996

Query: 861  AVLVIRSEMWMTRNGFIYNNQKVNWNIFKYKTISQVHDYSSRLKGYMYNSQDDLRVLNFF 1040
              + I   +W  R    Y  ++ N  I + K +  + ++      + Y S   LR   F 
Sbjct: 997  TPIFICWNLWKNRCAVKYGGKQSN--IARVKHLVILDNFKLLHTVFPYISWP-LRWNKFC 1053

Query: 1041 GVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRAGVGVVVRENNTNVLGALTVG 1220
             V     +D+      W  P    + L  DG A  NPG  G G V+R +   ++ A +  
Sbjct: 1054 NVIENCSQDTKVTAVQWTKPPYRWVKLNTDGSALSNPGSIGAGDVIRNHLGEIILAYSTP 1113

Query: 1221 LVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDSMSAILVYSSNNMAVPWFMRSRWV-- 1394
            L   TN  AE+   I G+ W I      + +  DS   ++ +  NN  +PW + S+    
Sbjct: 1114 LGTGTNNQAEVEAAIFGIAWCIHMKYNQVILEVDS-QLLVDWFKNNKLIPWNISSQMQQL 1172

Query: 1395 -VVKARYGSIRFVHTYREANFSAE 1463
              +  +    + +HT+REANF A+
Sbjct: 1173 HQLATQLDHFKCIHTFREANFVAD 1196


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 87.0 bits (214), Expect = 2e-14
 Identities = 89/409 (21%), Positives = 169/409 (41%), Gaps = 7/409 (1%)
 Frame = +3

Query: 258  IGDGRATSLYFDYWCGDTCIANVMGHENLDRNLLVANCIQNWAWFLSDVVTQIFQAAGVE 437
            +G G     + D W G+  + +    E     + V +   N +W +  + T + Q    E
Sbjct: 1808 VGQGNVF-FWHDCWMGEAPLIS-SNQEFTSSMVQVCDFFTNNSWNIEKLKTVLQQEVVDE 1865

Query: 438  IQNLPVPMGGDDLRVWKPDYKGVLSVRSSKTLIHKRYPNLEGENLLRKPSVHPSLDARNW 617
            I  +P+     D   W P   G  S +S+  LI KR       N +   +V  +     W
Sbjct: 1866 IAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLW 1925

Query: 618  KIIPGACD-KVRSRFKYHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFGISPHQ 794
            +++      +++ + K   +   C C   EES+ H+MW      + W + + +F I    
Sbjct: 1926 RLLHDWIPVELKMKSKGLQLASRCRCCKSEESIMHVMWDNPVAMQVWNYFAKLFQILIIN 1985

Query: 795  NLTTTYKM-AKGKSVMFKEPWLLAVLV---IRSEMWMTRNGFIYNNQKVNWNIFKYKTIS 962
              T    + A   S  + +P  +  LV   I   +W+ RN   + N  +  N   ++ + 
Sbjct: 1986 PCTINQIIGAWFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLK 2045

Query: 963  QVHDYSSRLKGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDAR 1142
             +   S   +   +  + D ++   +G+  +    + PK + W  P   E  L  DG A+
Sbjct: 2046 LIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAK 2105

Query: 1143 VNPGRAGVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTD 1322
             +   AG G ++R++   ++   +  L  Q +  AE+  +  GL     + +  + I  D
Sbjct: 2106 QSHNAAG-GGILRDHAGEMVFGFSENLGTQNSLQAELLALYRGLILCRDYNIRRLWIEMD 2164

Query: 1323 SMSAILVYSSNNMAVPWFMRSRWVVVK--ARYGSIRFVHTYREANFSAE 1463
            ++S I +   N+   P  +R   V ++    + S RF H +RE N +A+
Sbjct: 2165 AISVIRLLQGNHRG-PHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAAD 2212


>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score = 67.0 bits (162), Expect(2) = 2e-14
 Identities = 105/480 (21%), Positives = 181/480 (37%), Gaps = 32/480 (6%)
 Frame = +3

Query: 108  IFALPKKLGEGYLRAKFFKNSGQLVGYVKSSILPGLKWVYNE---VNSNTKKLIGDGRAT 278
            I   P  L    ++ K+F  S  L   V  ++    K + +    +     ++IGDGR T
Sbjct: 882  ILTKPDSLMARVIKGKYFPRSNFLEARVSPNMSFTCKSILSARAVIQKGMCRVIGDGRDT 941

Query: 279  SLYFDYWCGDT---CIANVMGHENLDRNLLVANCIQNWAWFLSDVVTQIFQA-AGVEIQN 446
            +++ D W        IA   G    D    V   I N  W + +++  +FQ      IQ 
Sbjct: 942  TIWGDPWVPSLERYSIAATEGVSEDDGPQKVCELISNDRWNV-ELLNTLFQPWESTAIQR 1000

Query: 447  LPVPMGGD-DLRVWKPDYKGVLSVRSS--KTLIHKRY--------PNLEGENLLRKPSVH 593
            +PV +    D  +W     G  +VRS+    L+  R         PNL+    + K  + 
Sbjct: 1001 IPVALQKKPDQWMWMMSKNGQFTVRSAYYHELLEDRKTGPSTSRGPNLKLWQKIWKAKIP 1060

Query: 594  PSLDARNWKIIPGACDKVRSRFK--YHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWIS 767
            P +   +WK I        +  K   ++   C  C  +EE+ +H++W CD  S+AW    
Sbjct: 1061 PKVKLFSWKAIHNGLAVYTNMRKRGMNIDGACPRCGEKEETTEHLIWGCDESSRAWY--- 1117

Query: 768  DMFGISPHQNLTTTYKMAKGKSVMFKE---------PWLLAVLVIRSEMWMTRNGFIYNN 920
                ISP +    T  +  G   ++ E          W     +I   +W+ RN +++  
Sbjct: 1118 ----ISPLR--IHTGNIEAGSFRIWVESLLDTHKDTEWWALFWMICWNIWLGRNKWVFEK 1171

Query: 921  QKVNWNIFKYKTISQVHDYSSRLKGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPP 1100
            +K+ +     + +  V ++              +  LN    TH            W  P
Sbjct: 1172 KKLAFQEVVERAVRGVMEFEEECA-----HTSPVETLN----THEN---------GWSVP 1213

Query: 1101 RRNELMLCCDGDARVNPGRAGVGVVVRENNTNVLGALTV-GLVIQTNFLAEIYCVILGLE 1277
                + L  D     + G  G+G VVR+   +VL A    G  ++   +AE   +  GL+
Sbjct: 1214 PVGMVKLNVDAAVFKHVG-IGMGGVVRDAEGDVLLATCCGGWAMEDPAMAEACSLRYGLK 1272

Query: 1278 WAIKFGVADICIHTDSMSAILVYSSNNMAVPWFMR--SRWVVVKARYGSIRFVHTYREAN 1451
             A + G  ++ +  D     L        V  F R     + + ++  ++ F H  R  N
Sbjct: 1273 VAYEAGFRNLVVEMDCKKLFLQLRGKASDVTPFGRVVDDILYLASKCSNVVFEHVKRHCN 1332



 Score = 40.0 bits (92), Expect(2) = 2e-14
 Identities = 18/35 (51%), Positives = 22/35 (62%)
 Frame = +2

Query: 8   VAYDKCYSPYKEGGLGITQMRFMNRAMLMKLCWNI 112
           VA++K + P KEGGLGI      NRA+L K  W I
Sbjct: 848 VAWEKLFLPKKEGGLGIRNFDVFNRALLAKQAWRI 882


>gb|AGV40503.1| hypothetical protein [Phaseolus vulgaris]
          Length = 234

 Score = 86.7 bits (213), Expect = 2e-14
 Identities = 58/177 (32%), Positives = 85/177 (48%), Gaps = 6/177 (3%)
 Frame = +3

Query: 951  KTISQVHDYSSRL----KGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELM 1118
            K IS + D +  +    K  M N   D  V+ FFG+  R  K   P P  WE P    + 
Sbjct: 19   KAISIIKDLTCLVGNSSKASMKNDMLDFNVIKFFGIKTRSGKVLRPLPIRWEFPSPGWVK 78

Query: 1119 LCCDGDARVNPGRAGVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGV 1298
            +  DG AR  PG A  G + R +    +GA +  L +QT  +AE Y VI  +E A K G+
Sbjct: 79   INTDGAARGYPGLATCGGIFRGSMGEFIGAFSAFLEVQTALVAEFYGVIHAMEEAQKMGL 138

Query: 1299 ADICIHTDSMSAILVYSSNNMAVPWFMRSRWVVVKARYGSIRF--VHTYREANFSAE 1463
             ++ +  DS      +++    VPW +++RW       G+IRF   H +RE N  A+
Sbjct: 139  TNVWLECDSALVCAAFTART-NVPWMLQNRWNTCLNFCGTIRFRVTHIFREGNACAD 194


>ref|XP_004293076.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 487

 Score = 86.3 bits (212), Expect = 3e-14
 Identities = 106/447 (23%), Positives = 181/447 (40%), Gaps = 23/447 (5%)
 Frame = +3

Query: 192  KSSILPGLKWVYNEVNSNTKKLIGDGRATSLYFDYWCGDTCIANVM---GHENLDRNLLV 362
            ++ IL G++W+           +G+G     +   W  +  + N++       +D N  V
Sbjct: 42   RNLILKGMRWI-----------VGNGENIKFWTFNWAYEFPLLNLIQINDRNAIDLNETV 90

Query: 363  ANCIQNWAWFLSDVVTQIFQAAGVEIQNLPVPMGGD-DLRVWKPDYKGVLSVRSSKTLIH 539
            A+ I N  W +  ++  + Q    +I  +P+ +    D  +W P   G  SV+S+  L  
Sbjct: 91   ADYIFNGCWNIQKLLQVLDQETVKQITGIPILVSNQCDECIWAPPTDGRFSVKSATWL-- 148

Query: 540  KRYPNLEGE------NLLRKPSVHPSLDARNWKIIPGACDKVR---SRFKYHVINKCCLC 692
             +Y NLE        N + K  V   +    W ++ G   K R   S+F Y   N C LC
Sbjct: 149  -QYQNLEKHQQSDLINKVWKLDVPLKVKLFGWLLLRGRL-KTRDRLSKFGYIDDNSCPLC 206

Query: 693  NSEEESLDHIMWSCDFFSKAWLWISDMFGISPHQNLTTTYKMAKGKSVMFKEPW----LL 860
            +S+ E+ DH+   CDF ++ +     + GIS   +    Y     + +   +P+      
Sbjct: 207  DSDNETADHLFGHCDFTTEVFR----LAGISALMDWHEGYLKVL-REMFINQPYDKFLFA 261

Query: 861  AVLVIRSEMWMTRNGFIYNNQKVNWNIFKYKTISQVHDYSSRLKGYMYNSQDDLRVLNFF 1040
             VL+I  ++W  RN  I+ +  V          +  H   + L               + 
Sbjct: 262  KVLIIYWQIWKARNDTIFRD--VITTATNVAATAAFHFNETAL---------------YK 304

Query: 1041 GVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARVNPGRAGVG-VVVRENNTNVLGALTV 1217
             V    +  +      W PP  N + +  DG  +   GR+  G  V R ++ NV+ A   
Sbjct: 305  AVVGGGISQTTSSTIRWLPPHNNFIKINFDGSVQ---GRSAAGGFVFRNSDGNVILAAAK 361

Query: 1218 GLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDSMSAILVYSSNNMAVPWFMRSRWVV 1397
            GL   T   AE   +   L  A   G  ++ +  DS   ++   +  ++ PW  R + +V
Sbjct: 362  GLGSTTIPTAEATALRDSLVKARDRGYMNVQVEGDS-KLVIDAINGKLSPPW--RLQKIV 418

Query: 1398 VKAR-----YGSIRFVHTYREANFSAE 1463
               R     + S+ F H YREANF A+
Sbjct: 419  QDIRTIATSFSSVCFNHVYREANFMAD 445


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 84.3 bits (207), Expect = 1e-13
 Identities = 94/409 (22%), Positives = 176/409 (43%), Gaps = 16/409 (3%)
 Frame = +3

Query: 285  YFDYWCGDTCIANVMGHENLDRNLLVANCIQNW-AWFLSDVVTQIFQAAGVEIQNLPVPM 461
            + D W GD  + N     +  ++++  N   N  AW +  + T I  A   EI  +P+  
Sbjct: 900  WHDAWMGDEPLVN--SFPSFSQSMMKVNYFFNDDAWDVDKLKTFIPNAIVEEILKIPISR 957

Query: 462  GGDDLRVWKPDYKGVLSVRSSKTLIHKRYP-NLEGENLLRKPSVHPSLDARNWKIIPGAC 638
              +D+  W     G  S++S+  L+ +R   NL G+ +  K S+  ++    W+ +    
Sbjct: 958  EKEDIAYWALTANGDFSIKSAWELLRQRKQVNLVGQLIWHK-SIPLTVSFFLWRTLHNWL 1016

Query: 639  D-KVRSRFKYHVINKCCLCNSEEESLDHIMWSCDFFSKAWLWISDMFGISPH--QNLTTT 809
              +VR + K   +   CLC   EESL H++W      + W + S  F I  H  QN+   
Sbjct: 1017 PVEVRMKAKGIQLASKCLCCKSEESLLHVLWESPVAQQVWNYFSKFFQIYVHNPQNILQI 1076

Query: 810  YKMAKGKSVMFKEPW---LLAVLVIRSEMWMTRNGFIYNN-----QKVNWNIFKYKTISQ 965
               +   S  F +P     L +L I   +W+ RN   + +      ++ W I   K + +
Sbjct: 1077 LN-SWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIWRIM--KILRK 1133

Query: 966  VHDYSSRLKGYMYNSQDDLRVLNFFGVTHRKVKDSDPKPYFWEPPRRNELMLCCDGDARV 1145
            +       K   +  + DL +   +G    + + + PK   W  P   EL L  DG ++ 
Sbjct: 1134 LFQGGLLCK---WQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKD 1190

Query: 1146 NPGRAGVGVVVRENNTNVLGALTVGLVIQTNFLAEIYCVILGLEWAIKFGVADICIHTDS 1325
                A  G V+R++  N++   +     Q +  AE+  +  GL   +++ V+ + I  D+
Sbjct: 1191 EFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALHRGLCLCMEYNVSRVWIEVDA 1250

Query: 1326 MSAILVYSSNNMA---VPWFMRSRWVVVKARYGSIRFVHTYREANFSAE 1463
               I +  +++     + + + S  +    +  S+R  H +RE N +A+
Sbjct: 1251 QVVIQMIQNHHKGSYKIQYLLES--IRKCLQVISVRISHIHREGNQAAD 1297


>ref|XP_004308354.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 235

 Score = 83.6 bits (205), Expect = 2e-13
 Identities = 55/195 (28%), Positives = 95/195 (48%), Gaps = 2/195 (1%)
 Frame = +3

Query: 885  MWMTRNGFIYNNQKVNWNIFKYKTISQVHDYSSRLKGYMYNSQDDLRVLNFFGVTHRKVK 1064
            +W  RN   ++N+  N+       ++ +   S    G+ Y    D R+L   GV  +  K
Sbjct: 3    LWKARNKLRFDNRPPNFYTMCCSIMAWIRQISLFAPGH-YKGVLDARLLASLGVASKGGK 61

Query: 1065 DSDPKPYFWEPPRRNELMLCCDGDARVNPGRAGVGVVVRENNTNVLGALTVGLVIQTNFL 1244
                +   W+PP    + +  +G A+ NPG A  G V R+ +   LG+    L  +T+F 
Sbjct: 62   APRIQHVLWQPPFFPWIKVNTNGLAKGNPGPAACGGVFRDASGGFLGSFCHSLGWKTSFY 121

Query: 1245 AEIYCVILGLEWAIKFGVADICIHTDSMSAILVYSSNNMAVPWFMRSRW--VVVKARYGS 1418
            +E+Y VIL +E A   G   + + +DS+S +  +SS + +  W +R RW   ++  R  +
Sbjct: 122  SELYVVILAIEIAHDKGWVYLWLESDSVSVVACFSSRSFSPTWNLRVRWNNCLLIIRQMN 181

Query: 1419 IRFVHTYREANFSAE 1463
             R+ H +RE N  A+
Sbjct: 182  FRYSHIFREGNIVAD 196


Top