BLASTX nr result

ID: Mentha25_contig00042712 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00042712
         (842 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN65842.1| hypothetical protein VITISV_027369 [Vitis vinifera]   274   3e-71
ref|XP_006478124.1| PREDICTED: putative nuclease HARBI1-like [Ci...   265   2e-68
ref|XP_007037522.1| Uncharacterized protein TCM_014176 [Theobrom...   262   1e-67
ref|XP_006493601.1| PREDICTED: putative nuclease HARBI1-like iso...   258   2e-66
ref|XP_004231640.1| PREDICTED: uncharacterized protein LOC101266...   253   6e-65
ref|XP_006339380.1| PREDICTED: putative nuclease HARBI1-like [So...   249   7e-64
ref|XP_007213261.1| hypothetical protein PRUPE_ppb021413mg [Prun...   249   9e-64
gb|ADN33754.1| retrotransposon protein [Cucumis melo subsp. melo]     249   9e-64
gb|EPS72545.1| hypothetical protein M569_02213, partial [Genlise...   244   2e-62
ref|XP_007024956.1| Uncharacterized protein TCM_029406 [Theobrom...   233   7e-59
ref|XP_006348011.1| PREDICTED: uncharacterized protein LOC102587...   232   2e-58
ref|XP_007210560.1| hypothetical protein PRUPE_ppa014600mg [Prun...   231   3e-58
ref|XP_004250658.1| PREDICTED: uncharacterized protein LOC101260...   231   3e-58
gb|ADN34114.1| retrotransposon protein [Cucumis melo subsp. melo]     224   2e-56
ref|XP_004297872.1| PREDICTED: uncharacterized protein LOC101314...   218   2e-54
ref|XP_007050284.1| Uncharacterized protein TCM_004024 [Theobrom...   218   2e-54
gb|EAZ12157.1| hypothetical protein OsJ_02038 [Oryza sativa Japo...   216   9e-54
ref|XP_006587058.1| PREDICTED: putative nuclease HARBI1-like [Gl...   206   9e-51
ref|XP_006586288.1| PREDICTED: putative nuclease HARBI1-like [Gl...   206   1e-50
ref|XP_006605554.1| PREDICTED: uncharacterized protein LOC100780...   205   2e-50

>emb|CAN65842.1| hypothetical protein VITISV_027369 [Vitis vinifera]
          Length = 579

 Score =  274 bits (701), Expect = 3e-71
 Identities = 138/263 (52%), Positives = 171/263 (65%), Gaps = 28/263 (10%)
 Frame = -3

Query: 705 QVQHLHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILS 526
           +V++L+R++  SD+ C+  LRMD +TF  LC +LR +G L D KY+ VEE V  FL IL+
Sbjct: 53  RVENLNRLIYGSDVACMEQLRMDRHTFTTLCSMLRTIGKLKDSKYIDVEEMVALFLHILA 112

Query: 525 HHKKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCL 346
           HH KNRV+ F F RSG+TIS + + VL A+++L G+LL KPEPV+E+ +D RWKWFK CL
Sbjct: 113 HHVKNRVIKFRFLRSGETISRHFNAVLNAVIRLQGVLLKKPEPVSENSTDERWKWFKNCL 172

Query: 345 GALDGTYINVKVSNEDKPR----------------------------WEGSVGDSRVLRD 250
           GALDGTYI V V   DKPR                            WEGS  DSRVLRD
Sbjct: 173 GALDGTYIKVNVREGDKPRYRTRKNEIATNVLGVCSQDMQFIYVLPGWEGSTSDSRVLRD 232

Query: 249 AISRVHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRH 70
           A+SR +GL VP G YYL D GY N  GFL PY+  RYHL +W      P   EE FNM+H
Sbjct: 233 AVSRRNGLTVPHGYYYLVDVGYTNGKGFLAPYRGQRYHLNDW-REGHMPTTHEEFFNMKH 291

Query: 69  TKARNVIERAFGIMKMRWGILRS 1
           + ARNVIER FG++K+RW ILRS
Sbjct: 292 SAARNVIERCFGLLKLRWAILRS 314


>ref|XP_006478124.1| PREDICTED: putative nuclease HARBI1-like [Citrus sinensis]
          Length = 370

 Score =  265 bits (676), Expect = 2e-68
 Identities = 137/283 (48%), Positives = 172/283 (60%), Gaps = 28/283 (9%)
 Frame = -3

Query: 765 KKRKLDSLVQPYSMLDRIPNQVQHLHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGL 586
           KK     +++  S ++   +Q+ +L  ++   D+ C+  LRMD  TF  LC LLR  G +
Sbjct: 11  KKHNERKMIRSQSFINHA-SQLHYLDSIIGNGDLQCVHQLRMDRRTFGLLCELLRSRGKV 69

Query: 585 TDGKYVSVEEQVVTFLGILSHHKKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVK 406
                V+VEEQV  FL ILSHH KNR +   FFRSG+T+S Y + VLK +L+L  +LL  
Sbjct: 70  KADGLVTVEEQVCMFLHILSHHVKNRTISSRFFRSGETVSRYFNSVLKGVLRLQSLLLKA 129

Query: 405 PEPVAEDCSDPRWKWFKGCLGALDGTYINVKVSNEDKPR--------------------- 289
           PEPV E+ +D RW+WFK CLGALDGTYI V+V   DKPR                     
Sbjct: 130 PEPVPENYTDGRWRWFKNCLGALDGTYIRVRVPENDKPRYRTRKGEIATNVLGVCSRDMK 189

Query: 288 -------WEGSVGDSRVLRDAISRVHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLK 130
                  WEGS  DSR+LRDAIS+  GL VP G YYL D GY+N +GFL PY+  RYHL 
Sbjct: 190 FIFVMPGWEGSASDSRILRDAISKPTGLRVPTGYYYLVDAGYSNAEGFLAPYRGTRYHLS 249

Query: 129 EWGPNSAQPHNAEEMFNMRHTKARNVIERAFGIMKMRWGILRS 1
           EW    A P N EE FNM+H+  RNV+ER FG++KMRW ILRS
Sbjct: 250 EWRDGCA-PQNKEEFFNMKHSSTRNVVERCFGLLKMRWAILRS 291


>ref|XP_007037522.1| Uncharacterized protein TCM_014176 [Theobroma cacao]
           gi|508774767|gb|EOY22023.1| Uncharacterized protein
           TCM_014176 [Theobroma cacao]
          Length = 706

 Score =  262 bits (669), Expect = 1e-67
 Identities = 133/261 (50%), Positives = 166/261 (63%), Gaps = 28/261 (10%)
 Frame = -3

Query: 699 QHLHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHH 520
           +++ R+V  +DI CI  +RM+  TF +LC +L  +GGL   K + V+EQV  FL I++HH
Sbjct: 62  EYVRRLVYDNDISCISQIRMNRVTFLKLCEMLESIGGLKSTKNMLVDEQVAIFLHIIAHH 121

Query: 519 KKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGA 340
            KNRV+  +F RSG++IS + H VL A+LKL   L  KPEP+  + +D +WKWFK CLGA
Sbjct: 122 VKNRVISLNFRRSGESISRHFHNVLAAVLKLQEHLFRKPEPIPTNSTDNQWKWFKNCLGA 181

Query: 339 LDGTYINVKVSNEDKPR----------------------------WEGSVGDSRVLRDAI 244
           LDGTYI VKV + DKPR                            WEGSV D RVLRDA+
Sbjct: 182 LDGTYIRVKVPSADKPRYRTRKGNIATNMLGVCTPDMQFVFVLPGWEGSVADGRVLRDAL 241

Query: 243 SRVHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTK 64
            R +GL VP GCYYL D GY N +GFL PY+  RYHL EW      P + EE FNM+H  
Sbjct: 242 RRRNGLKVPNGCYYLVDAGYTNCEGFLAPYRGQRYHLNEW-RQGHDPSSHEEFFNMKHAA 300

Query: 63  ARNVIERAFGIMKMRWGILRS 1
           ARNVIER FG++KMRWGILRS
Sbjct: 301 ARNVIERCFGLLKMRWGILRS 321


>ref|XP_006493601.1| PREDICTED: putative nuclease HARBI1-like isoform X1 [Citrus
           sinensis] gi|568881482|ref|XP_006493602.1| PREDICTED:
           putative nuclease HARBI1-like isoform X2 [Citrus
           sinensis]
          Length = 393

 Score =  258 bits (659), Expect = 2e-66
 Identities = 142/310 (45%), Positives = 187/310 (60%), Gaps = 30/310 (9%)
 Frame = -3

Query: 840 MIQEIIS--SYLLSVFTIVHMFVETSCKKRKLDSLVQPYSMLDRIPNQVQHLHRMVEVSD 667
           +IQE++     +++V   ++ +V  SC++R L+S    Y    R+  Q+ HLH +V  SD
Sbjct: 14  IIQELVVMLQMIMAVCAAMYSYVYLSCEQR-LESPNSAY----RVYQQLSHLHGLVFESD 68

Query: 666 IDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKKNRVVGFDFF 487
           I C+  LRMD   F +LC LL + G L   K VS EE V  FL IL+HH KNRVVGF+F 
Sbjct: 69  IKCLSQLRMDRQAFFKLCKLLCEKGSLVRSKRVSPEEMVAMFLSILAHHVKNRVVGFNFK 128

Query: 486 RSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGALDGTYINVKVS 307
           RS +T+S   H  L+A+++       KPEP+ ++ +DP+WKWF  CLGALDGTYI V VS
Sbjct: 129 RSRRTVSKCFHECLRAMIRCQKEFWKKPEPITDNSTDPKWKWFTNCLGALDGTYIKVHVS 188

Query: 306 NEDKPR----------------------------WEGSVGDSRVLRDAISRVHGL*VPKG 211
             DKPR                            WEGS  D RVL+DA++R +GL VP G
Sbjct: 189 EADKPRYRTRKNEIATNVLGVCSQDMQFIYVLPGWEGSTHDMRVLKDALTRRNGLKVPHG 248

Query: 210 CYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKARNVIERAFGI 31
            YYL D GY N  GFL+PY+  RYHL ++  +  QPH  +E FNM+H+ ARNVIER FGI
Sbjct: 249 YYYLVDAGYTNGMGFLSPYRGERYHLSDF-RDGHQPHTPKEFFNMKHSSARNVIERCFGI 307

Query: 30  MKMRWGILRS 1
           +K RW +LRS
Sbjct: 308 LKKRWVVLRS 317


>ref|XP_004231640.1| PREDICTED: uncharacterized protein LOC101266775 [Solanum
           lycopersicum]
          Length = 315

 Score =  253 bits (646), Expect = 6e-65
 Identities = 135/307 (43%), Positives = 181/307 (58%), Gaps = 38/307 (12%)
 Frame = -3

Query: 825 ISSYLLSVFTIVHMFVETSC----------KKRKLDSLVQPYSMLDRIPNQVQHLHRMVE 676
           + S+++    +VH ++   C          +K+  +     Y M  RI   + HL+ ++ 
Sbjct: 9   VESFMILEEIVVHSYIVLICFFMAIYSMLLRKQSTNRRGIRYCMSARIQKILSHLNVLIR 68

Query: 675 VSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKKNRVVGF 496
            +DI CI  LRMD N F  L  L +++GGLTD K +S  E++  FL IL+HH+KNR +  
Sbjct: 69  DNDIVCIDKLRMDRNAFHILASLAKNIGGLTDSKNMSSTEKLAMFLNILAHHEKNRSIKV 128

Query: 495 DFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGALDGTYINV 316
           D+ RSG ++S   +  L+AILKL  +LLVKP PV E  SD RWKWFKGCLGALDGTYI++
Sbjct: 129 DYIRSGWSVSRAFNECLRAILKLTPVLLVKPNPVLEADSDDRWKWFKGCLGALDGTYISI 188

Query: 315 KVSNEDKPR----------------------------WEGSVGDSRVLRDAISRVHGL*V 220
           +V    KPR                            WEGS  D RVLRDA+ R +GL V
Sbjct: 189 RVEAIYKPRYRTRKGDIATNVLGVCDRNLNFIYVLPGWEGSAADGRVLRDAVVRRNGLKV 248

Query: 219 PKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKARNVIERA 40
           P G YYLCD GY N +GFL+PY+  RY LK+W  ++  P   EE+FNM+H +ARNVIER 
Sbjct: 249 PHGNYYLCDGGYTNGNGFLSPYRGYRYWLKDWQGDNPSPRCREELFNMKHARARNVIERT 308

Query: 39  FGIMKMR 19
           FG++K R
Sbjct: 309 FGLLKGR 315


>ref|XP_006339380.1| PREDICTED: putative nuclease HARBI1-like [Solanum tuberosum]
          Length = 316

 Score =  249 bits (637), Expect = 7e-64
 Identities = 121/242 (50%), Positives = 161/242 (66%), Gaps = 28/242 (11%)
 Frame = -3

Query: 642 MD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKKNRVVGFDFFRSGQTISH 463
           M+ N+F  L +L +++GGLTD KY+S  E++  FL IL+HH+KNR +  D+ RSG ++S 
Sbjct: 1   MNRNSFHILVLLTKEVGGLTDSKYMSSSEKLAMFLNILAHHEKNRSIKVDYIRSGWSVSQ 60

Query: 462 YLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGALDGTYINVKVSNEDKPR-- 289
             +  LKAILKL  +LLV P+PV ED  D RW+WFKGCLGALDGTYI +++ ++DKPR  
Sbjct: 61  AFNECLKAILKLAPLLLVNPKPVLEDELDDRWRWFKGCLGALDGTYIQIRIPSKDKPRYR 120

Query: 288 --------------------------WEGSVGDSRVLRDAISRVHGL*VPKGCYYLCDNG 187
                                     WEGS  D RVLR+AI+R +GL +P+G YYLCD G
Sbjct: 121 TRKGEIATNVLGVCDKNLNFTYVLPGWEGSAADGRVLRNAITRTNGLKIPEGNYYLCDGG 180

Query: 186 YANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKARNVIERAFGIMKMRWGIL 7
           Y N +GFL+PY+  RY L++W   +  P   EE+FNM+H +ARNVIER FG++K RWGIL
Sbjct: 181 YTNGNGFLSPYRGYRYWLRDWQGENPPPQCREELFNMKHARARNVIERTFGLLKGRWGIL 240

Query: 6   RS 1
           RS
Sbjct: 241 RS 242


>ref|XP_007213261.1| hypothetical protein PRUPE_ppb021413mg [Prunus persica]
           gi|462409126|gb|EMJ14460.1| hypothetical protein
           PRUPE_ppb021413mg [Prunus persica]
          Length = 364

 Score =  249 bits (636), Expect = 9e-64
 Identities = 132/252 (52%), Positives = 167/252 (66%), Gaps = 13/252 (5%)
 Frame = -3

Query: 717 RIPNQVQHLHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFL 538
           R  N++++L+ ++  +D + +  LRMD  TF  LC LLR  G L +   V+VEEQV  FL
Sbjct: 34  RHENRLRYLNSVLG-NDREYVSELRMDLKTFGLLCDLLRTDGRLKNDDLVTVEEQVCMFL 92

Query: 537 GILSHHKKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWF 358
            +L+HH KNR +   F RSG TIS Y + +L+ IL+L G LL  PEPV  +C+D RWKWF
Sbjct: 93  HMLAHHVKNRTIRNRFVRSGGTISRYFNSLLQGILRLQGSLLRVPEPVGHNCTDHRWKWF 152

Query: 357 KGCLGALDGTYINVKVSNEDKPRW-------------EGSVGDSRVLRDAISRVHGL*VP 217
           K CLGALDGTYI V+V+  +KPR+             EGS  +SRVLRDAI+R +GL VP
Sbjct: 153 KNCLGALDGTYIKVRVAETEKPRYRTRKGEIATNVLAEGSASESRVLRDAITRPNGLRVP 212

Query: 216 KGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKARNVIERAF 37
            G YYL D GY N +GFL PY+  RYHL EW   +    N +E FNM+H KARNVIER F
Sbjct: 213 TGYYYLVDGGYTNGEGFLAPYRGTRYHLSEWREGNTLV-NHQEYFNMKHAKARNVIERCF 271

Query: 36  GIMKMRWGILRS 1
           G++K RWGILRS
Sbjct: 272 GLLKARWGILRS 283


>gb|ADN33754.1| retrotransposon protein [Cucumis melo subsp. melo]
          Length = 623

 Score =  249 bits (636), Expect = 9e-64
 Identities = 127/255 (49%), Positives = 163/255 (63%), Gaps = 28/255 (10%)
 Frame = -3

Query: 684 MVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKKNRV 505
           M+  SD+ C  + RMD  TF  LC LLR++ GL+  + V VEE V  FL +L+H  KNRV
Sbjct: 1   MIHESDLVCRQSTRMDRRTFAILCHLLRNVAGLSSTEIVDVEEMVAMFLHVLAHDVKNRV 60

Query: 504 VGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGALDGTY 325
           +  +F RSG+T+S + ++VL A+L+L+  L+ +P PV  +C+D RWK F+ CLGALDGTY
Sbjct: 61  IQQEFVRSGETVSRHFNIVLLAVLRLYEELIKRPVPVTSNCNDQRWKCFENCLGALDGTY 120

Query: 324 INVKVSNEDKPR----------------------------WEGSVGDSRVLRDAISRVHG 229
           I V V   D+P                             WEGS  DSR+LRDAIS+ +G
Sbjct: 121 IKVNVPAGDRPTFRTRKGEIATNVLGVCDMKGDFVYVLAGWEGSAADSRILRDAISQENG 180

Query: 228 L*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKARNVI 49
           L VPKG YYLCD GY N +GFL PYK  RYHL+EW   +  P NA+E FNM+H+ ARNVI
Sbjct: 181 LQVPKGYYYLCDAGYPNAEGFLAPYKGQRYHLQEWRGAANAPTNAKEYFNMKHSSARNVI 240

Query: 48  ERAFGIMKMRWGILR 4
           ERAFG++K RW ILR
Sbjct: 241 ERAFGVLKGRWTILR 255


>gb|EPS72545.1| hypothetical protein M569_02213, partial [Genlisea aurea]
          Length = 372

 Score =  244 bits (624), Expect = 2e-62
 Identities = 127/272 (46%), Positives = 166/272 (61%), Gaps = 28/272 (10%)
 Frame = -3

Query: 732 YSMLDRIPNQVQHLHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQ 553
           YS+++ +P QV+HL+ ++   +  C+ + RM  NTF  LC LL   G L   ++VS  E+
Sbjct: 30  YSLMENVPRQVRHLNMVLSDHNSACLDSFRMSRNTFAHLCELLHGFG-LRSSRHVSATEK 88

Query: 552 VVTFLGILSHHKKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDP 373
           +  FLGIL+HH K R+   D  RS  T++ + + VL+ IL+LH  L V P PV  + +  
Sbjct: 89  LAMFLGILAHHTKVRIQRRDCNRSTWTVNKHFYTVLRVILRLHNQLQVNPVPVTPENATT 148

Query: 372 RWKWFKGCLGALDGTYINVKVSNEDKPR----------------------------WEGS 277
            +  F+ CLGALDG+Y+ V+V   DK R                            WEGS
Sbjct: 149 SFLPFQNCLGALDGSYVPVRVKEADKARYRNRKGFVATNVLGVCDQHMNFIYVLAGWEGS 208

Query: 276 VGDSRVLRDAISRVHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHN 97
             DSRVLRDA+ R HGL VP G YYLCD+GY + DGFLTPY+ VRYHL+EWGP    P N
Sbjct: 209 AADSRVLRDALRRDHGLRVPPGHYYLCDSGYMDCDGFLTPYRGVRYHLREWGPGMQGPQN 268

Query: 96  AEEMFNMRHTKARNVIERAFGIMKMRWGILRS 1
           A+E FNM+H  ARNVIERA+GI+K RW ILRS
Sbjct: 269 AKEYFNMKHASARNVIERAWGILKSRWAILRS 300


>ref|XP_007024956.1| Uncharacterized protein TCM_029406 [Theobroma cacao]
           gi|508780322|gb|EOY27578.1| Uncharacterized protein
           TCM_029406 [Theobroma cacao]
          Length = 516

 Score =  233 bits (594), Expect = 7e-59
 Identities = 125/287 (43%), Positives = 160/287 (55%), Gaps = 60/287 (20%)
 Frame = -3

Query: 699 QHLHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHH 520
           +++ R+V  +DI CI  +RM+  TF +LC +L  +GGL   K + V+EQV  FL I++HH
Sbjct: 62  EYVRRLVYDNDISCISQIRMNRVTFFKLCEMLESIGGLKSTKNMLVDEQVAIFLHIIAHH 121

Query: 519 KKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGA 340
            KNRV+  +F +SG++IS + H VL A+LKL   L  KPEP+  + +D RWKWFK CLGA
Sbjct: 122 VKNRVISLNFRKSGESISRHFHNVLAAVLKLQEYLFRKPEPIPTNSTDNRWKWFKNCLGA 181

Query: 339 LDGTYINVKVSNEDKPR----------------------------WEGSVGDSRVLRDAI 244
           LDGTYI VKV + DKPR                            WEGSV D RVLRDA+
Sbjct: 182 LDGTYIRVKVPSADKPRYRTRKGDIATNMLGVCTLDMQFVFVLPGWEGSVADGRVLRDAL 241

Query: 243 SRVHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEW-------------------- 124
            R +GL VP GCYYL D GY+N +GFL P++  RYHL EW                    
Sbjct: 242 RRRNGLKVPNGCYYLVDAGYSNCEGFLAPFRGQRYHLNEWRQGHEXXXXXXXXXXXXXXX 301

Query: 123 ------------GPNSAQPHNAEEMFNMRHTKARNVIERAFGIMKMR 19
                               + EE FNM+H  ARNVIER FG++KMR
Sbjct: 302 XXXXXXXXXXXXXXXXXXXSSPEEFFNMKHAAARNVIERCFGLLKMR 348


>ref|XP_006348011.1| PREDICTED: uncharacterized protein LOC102587703 [Solanum tuberosum]
          Length = 285

 Score =  232 bits (591), Expect = 2e-58
 Identities = 120/274 (43%), Positives = 166/274 (60%), Gaps = 28/274 (10%)
 Frame = -3

Query: 840 MIQEIISSYLLSVFTIVHMFVETSCKKRKLDSLVQPYSMLDRIPNQVQHLHRMVEVSDID 661
           +++EI+    L +F  + M +     KR     V  Y M  R+P  + HL+ +V+ SD  
Sbjct: 14  ILEEILCQQFLVMFIFLFMVIHGISIKRNRH--VIRYLMSSRVPKIISHLNCIVQDSDTT 71

Query: 660 CIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKKNRVVGFDFFRS 481
           CI  LRMD N+F  L +L +++GGLTD KY+S  E++  FL IL+HH+KNR +  D+ RS
Sbjct: 72  CIDKLRMDRNSFHTLVLLTKEVGGLTDSKYMSSSEKLAMFLNILAHHEKNRSIKVDYIRS 131

Query: 480 GQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGALDGTYINVKVSNE 301
           G ++S   +  LKAILKL  +LLV P+PV ED  D RW+WFKGCLGALDGTYI ++V ++
Sbjct: 132 GWSVSQAFNECLKAILKLAPLLLVNPKPVLEDELDDRWRWFKGCLGALDGTYIQIRVPSK 191

Query: 300 DKPR----------------------------WEGSVGDSRVLRDAISRVHGL*VPKGCY 205
           DKPR                            WEGS  D+RVLR+AI+R + L +P+G Y
Sbjct: 192 DKPRYRTRKGEIATNVLGVCDKNLNFTYVLPGWEGSAADARVLRNAIARTNDLKIPEGNY 251

Query: 204 YLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQP 103
           YLCD GY N +GFL+PY+  RY L++W   +  P
Sbjct: 252 YLCDGGYTNGNGFLSPYRGYRYWLRDWQDENPPP 285


>ref|XP_007210560.1| hypothetical protein PRUPE_ppa014600mg [Prunus persica]
           gi|462406295|gb|EMJ11759.1| hypothetical protein
           PRUPE_ppa014600mg [Prunus persica]
          Length = 691

 Score =  231 bits (589), Expect = 3e-58
 Identities = 122/263 (46%), Positives = 154/263 (58%), Gaps = 28/263 (10%)
 Frame = -3

Query: 705 QVQHLHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILS 526
           Q+++LH +V  SD  CI  LRMD  +F +LC +L   G L   + +S EE V  FL IL+
Sbjct: 17  QLEYLHGLVYESDTTCIDQLRMDRQSFHKLCQILVTKGELRSTRNMSTEEMVAIFLNILA 76

Query: 525 HHKKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCL 346
           HH KNRV+ F+F RSG+T+S Y H  LKA+++        PEPV E+ +D RWKWFK CL
Sbjct: 77  HHHKNRVIKFNFTRSGRTVSKYFHECLKAMIRCQKDFWKSPEPVPENSTDYRWKWFKNCL 136

Query: 345 GALDGTYINVKVSNEDKPR----------------------------WEGSVGDSRVLRD 250
           GALDGTYI VKV   +KP+                            WEGS  DSRVL+D
Sbjct: 137 GALDGTYIRVKVPEREKPKYRTRKGEIATNVLGVCSQDLQFIYVLAGWEGSAHDSRVLKD 196

Query: 249 AISRVHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRH 70
           A+S           YYL D GY N  GFL P++  RYHL +W  +  +P    E FNM+H
Sbjct: 197 ALS----------YYYLVDAGYTNGTGFLAPFRGQRYHLNDW-RDGHRPETPNEFFNMKH 245

Query: 69  TKARNVIERAFGIMKMRWGILRS 1
           + ARNVIER FG++KMRW ILRS
Sbjct: 246 SSARNVIERCFGLLKMRWAILRS 268


>ref|XP_004250658.1| PREDICTED: uncharacterized protein LOC101260895 [Solanum
           lycopersicum]
          Length = 323

 Score =  231 bits (589), Expect = 3e-58
 Identities = 122/251 (48%), Positives = 154/251 (61%), Gaps = 37/251 (14%)
 Frame = -3

Query: 642 MD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKKNRVVGFDFFRSGQTISH 463
           MD N F  L +L +D+GGLTD K +S  E++  FL IL+HH+KNR +  D+ RSG +ISH
Sbjct: 1   MDRNAFHTLVLLTKDIGGLTDSKSMSCCEKLAMFLNILAHHEKNRSIKVDYIRSGWSISH 60

Query: 462 YLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWF---------KGCLGALDGTYINVKV 310
             +  L AILKL  +LLV P+PV ED ++ +WKWF         KGCLGALDGTYI ++V
Sbjct: 61  AFNECLSAILKLTPLLLVNPKPVLEDENEDQWKWFEVDNTIIYLKGCLGALDGTYIPIRV 120

Query: 309 SNEDKPR----------------------------WEGSVGDSRVLRDAISRVHGL*VPK 214
             + KPR                            WEGS  D  VLRDAI R +GL + +
Sbjct: 121 PIQHKPRYRTRKGEITTNVLGVCDRNLNFTYVLPGWEGSAADGHVLRDAIVRRNGLKIHE 180

Query: 213 GCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKARNVIERAFG 34
           G YYLCD GY N  GFL+PY+  RY LK+W  ++  P   EE+FNMRH +ARNVIER FG
Sbjct: 181 GNYYLCDGGYTNGKGFLSPYQGYRYWLKDWRGDNPSPRCKEEIFNMRHARARNVIEREFG 240

Query: 33  IMKMRWGILRS 1
           + K RWGIL+S
Sbjct: 241 LSKGRWGILKS 251


>gb|ADN34114.1| retrotransposon protein [Cucumis melo subsp. melo]
          Length = 657

 Score =  224 bits (572), Expect = 2e-56
 Identities = 115/227 (50%), Positives = 146/227 (64%), Gaps = 29/227 (12%)
 Frame = -3

Query: 597 LGGLTDGKYVSVEEQVVTFLGILSHHKKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGI 418
           + GLT  + V VEE V  FL IL+H  K+RV+  +F RSG+TIS + ++VL A+++LH  
Sbjct: 58  IAGLTSTEVVDVEEMVAMFLHILAHDVKSRVIKREFMRSGETISRHFNMVLLAVIRLHEE 117

Query: 417 LLVKPEPVAEDCSDPRWKWFKGCLGALDGTYINVKVSNEDKPR----------------- 289
           LL KP+PV  +C+D RW+WF+ CLGALDGTYI V V   D+ R                 
Sbjct: 118 LLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVPASDRARYRTRKGEVATNVLGVCD 177

Query: 288 -----------WEGSVGDSRVLRDAISRVHGL*VPKGCYYLCDNGYANTDGFLTPYKSVR 142
                      WEGS  DSR+LRDA+SR + L VPKG YYL D GY N +GFL PY+  R
Sbjct: 178 TKGDFVYVLAGWEGSAADSRILRDALSRPNRLKVPKGYYYLVDVGYPNAEGFLAPYRGQR 237

Query: 141 YHLKEW-GPNSAQPHNAEEMFNMRHTKARNVIERAFGIMKMRWGILR 4
           YHL+EW GP +A P  ++E FNM+H  ARNVIERAFG++K RW ILR
Sbjct: 238 YHLQEWRGPENA-PSTSKEFFNMKHYSARNVIERAFGVLKGRWAILR 283


>ref|XP_004297872.1| PREDICTED: uncharacterized protein LOC101314079 [Fragaria vesca
           subsp. vesca]
          Length = 572

 Score =  218 bits (556), Expect = 2e-54
 Identities = 112/216 (51%), Positives = 134/216 (62%), Gaps = 28/216 (12%)
 Frame = -3

Query: 564 VEEQVVTFLGILSHHKKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAED 385
           V+E V  FL IL HHKKNRV+ FDF RSG+ IS   + VL  +L+L   LL  PEPV  +
Sbjct: 3   VDEMVAMFLFILGHHKKNRVIKFDFLRSGEMISRCFNKVLNGVLRLSDNLLKSPEPVLNN 62

Query: 384 CSDPRWKWFKGCLGALDGTYINVKVSNEDKPR---------------------------- 289
            +D RWKWFK CLGALDGTYI V V  +DKPR                            
Sbjct: 63  STDDRWKWFKNCLGALDGTYIRVNVPEKDKPRYHTRKNKIATNVLGVCSQDMKFIYVLPS 122

Query: 288 WEGSVGDSRVLRDAISRVHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSA 109
           WEGS  DSRVLRDA+SR +GL VP+G YYL D GY N +GFL PY+  +YHL EW     
Sbjct: 123 WEGSAADSRVLRDAMSRTNGLRVPQGYYYLVDAGYTNGNGFLAPYRGQQYHLNEW-REGH 181

Query: 108 QPHNAEEMFNMRHTKARNVIERAFGIMKMRWGILRS 1
           +P  + + FNM+H  ARNVIER FG++K+RW ILRS
Sbjct: 182 RPTTSAKFFNMKHFAARNVIERCFGLLKLRWAILRS 217


>ref|XP_007050284.1| Uncharacterized protein TCM_004024 [Theobroma cacao]
           gi|508702545|gb|EOX94441.1| Uncharacterized protein
           TCM_004024 [Theobroma cacao]
          Length = 357

 Score =  218 bits (555), Expect = 2e-54
 Identities = 117/249 (46%), Positives = 155/249 (62%)
 Frame = -3

Query: 747 SLVQPYSMLDRIPNQVQHLHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYV 568
           S V+ Y+ LD + N+ +++ R+V  +DI CI  + M+   F +LC +L+ +GGL   K +
Sbjct: 30  SKVRSYA-LDFVANR-EYVRRLVYDNDISCISQIMMNRAAFFKLCEMLKSIGGLKSTKNM 87

Query: 567 SVEEQVVTFLGILSHHKKNRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAE 388
            V+E VV FL I++HH KNRV+  +F RSG+TIS + H VL A+LKL   L  KPE +  
Sbjct: 88  LVDEHVVIFLHIIAHHVKNRVISLNFKRSGETISRHFHNVLNAVLKLQKHLFKKPESIPT 147

Query: 387 DCSDPRWKWFKGCLGALDGTYINVKVSNEDKPRWEGSVGDSRVLRDAISRVHGL*VPKGC 208
           + +D RWKWFK CLG LDGT I VKV   DKPR+     D       +  +  L    GC
Sbjct: 148 NSTDNRWKWFKNCLGTLDGTNIRVKVPRADKPRYRTRKRDIATNMLGVCTLDIL-FYIGC 206

Query: 207 YYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKARNVIERAFGIM 28
           YYL D GY N +GFL  ++  RYHL EW      PH+ E+ FNM+H  ARNVIE  F ++
Sbjct: 207 YYLVDAGYINCEGFLAHFRGQRYHLHEWHQGHL-PHSPEKFFNMKHATARNVIESCFRLL 265

Query: 27  KMRWGILRS 1
           KMRWGILRS
Sbjct: 266 KMRWGILRS 274


>gb|EAZ12157.1| hypothetical protein OsJ_02038 [Oryza sativa Japonica Group]
          Length = 598

 Score =  216 bits (550), Expect = 9e-54
 Identities = 108/214 (50%), Positives = 142/214 (66%)
 Frame = -3

Query: 642 MD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKKNRVVGFDFFRSGQTISH 463
           MD  TF  LC +LRD+GG+ D + + +EE V +FL ILSHH KNR +G  F+RSG+T+S 
Sbjct: 1   MDRRTFHILCDMLRDVGGIEDTRNMPLEESVASFLYILSHHLKNRTIGKFFYRSGETVSR 60

Query: 462 YLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGALDGTYINVKVSNEDKPRWE 283
           + ++ L A+L+LH +LL KPEP+ ED +D RWK+FK CLGALDGT+I V V    K R+ 
Sbjct: 61  HFNLCLLAVLRLHQLLLKKPEPIPEDTTDDRWKYFKNCLGALDGTHIKVTVPTRIKGRYR 120

Query: 282 GSVGDSRVLRDAISRVHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQP 103
              GD  ++ + +          GCYYL D GY N DGFL PY+  RYHL  +   +  P
Sbjct: 121 SRKGD--IVTNVL----------GCYYLVDAGYTNADGFLAPYRGQRYHLGRFTARN-PP 167

Query: 102 HNAEEMFNMRHTKARNVIERAFGIMKMRWGILRS 1
            +AEE FNMRH  ARN++ER+FG +K RW ILRS
Sbjct: 168 RSAEEYFNMRHASARNIVERSFGRLKGRWAILRS 201


>ref|XP_006587058.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
          Length = 381

 Score =  206 bits (524), Expect = 9e-51
 Identities = 114/259 (44%), Positives = 151/259 (58%), Gaps = 28/259 (10%)
 Frame = -3

Query: 693 LHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKK 514
           L+R+   ++ DCI  LR+    F +LC +L++ G L   K V ++E V  FL IL+H+ K
Sbjct: 30  LNRLYRGTETDCIEQLRVSKKAFFKLCRILQEKGQLVKTKNVPIDEAVAMFLHILAHNLK 89

Query: 513 NRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGALD 334
            RVV F + RS +TIS     VL+AI+K+    L   E   E   + +W+WFK  +GALD
Sbjct: 90  YRVVHFSYCRSMETISRQFKNVLRAIMKVSKEYLKFYEYNLEGSVENKWRWFKNSIGALD 149

Query: 333 GTYINVKVSNEDKPR----------------------------WEGSVGDSRVLRDAISR 238
           G +I V VS ED+PR                            WEGS GDSRVLRDA+ R
Sbjct: 150 GIHIPVTVSAEDRPRYRNRKGDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLRDALRR 209

Query: 237 VHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKAR 58
            + L +P G Y+L D GY N  GFL PY+  RYHL EW  N+  P N +E+FN+RH  AR
Sbjct: 210 QNCLHIPNGKYFLVDAGYTNGPGFLAPYRGTRYHLNEWIGNT--PQNYKELFNLRHASAR 267

Query: 57  NVIERAFGIMKMRWGILRS 1
           NVIER+FG++K RW ILR+
Sbjct: 268 NVIERSFGVLKKRWSILRT 286


>ref|XP_006586288.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
          Length = 381

 Score =  206 bits (523), Expect = 1e-50
 Identities = 114/259 (44%), Positives = 151/259 (58%), Gaps = 28/259 (10%)
 Frame = -3

Query: 693 LHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKK 514
           L+ +   ++ DCI  LR+   TF +LC +L++ G L   K V ++E V  FL IL+H+ K
Sbjct: 30  LNSLYRGTETDCIEQLRVSKKTFFKLCRILQEKGQLVKTKNVPIDEAVAMFLHILAHNLK 89

Query: 513 NRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGALD 334
            RVV F + RS +TIS     VL+AI+K+    L   E   E   + +W+WFK  +GALD
Sbjct: 90  YRVVHFSYCRSMETISRQFKNVLRAIMKVSKEYLKFYEYNLEGSVENKWRWFKNSIGALD 149

Query: 333 GTYINVKVSNEDKPR----------------------------WEGSVGDSRVLRDAISR 238
           G +I V VS ED+PR                            WEGS GDSRVLRDA+ R
Sbjct: 150 GIHIPVTVSAEDRPRYRNRKGDISTNVLGVCGSDLRFIYVLPGWEGSAGDSRVLRDALRR 209

Query: 237 VHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKAR 58
            + L +P G Y+L D GY N  GFL PY+  RYHL EW  N+  P N +E+FN+RH  AR
Sbjct: 210 QNCLHIPNGKYFLVDAGYTNGPGFLAPYRGTRYHLNEWIGNT--PQNYKELFNLRHASAR 267

Query: 57  NVIERAFGIMKMRWGILRS 1
           NVIER+FG++K RW ILR+
Sbjct: 268 NVIERSFGVLKKRWSILRT 286


>ref|XP_006605554.1| PREDICTED: uncharacterized protein LOC100780651 isoform X2 [Glycine
           max]
          Length = 335

 Score =  205 bits (522), Expect = 2e-50
 Identities = 114/259 (44%), Positives = 151/259 (58%), Gaps = 28/259 (10%)
 Frame = -3

Query: 693 LHRMVEVSDIDCIVNLRMD*NTFQRLCVLLRDLGGLTDGKYVSVEEQVVTFLGILSHHKK 514
           L+R+   ++ DCI  LR+   TF +LC +L++ G L   K V ++E V  FL IL+H+ K
Sbjct: 67  LNRLYRGTETDCIEQLRVSKKTFFKLCRILQEKGQLVKTKNVPIDEVVAMFLHILAHNLK 126

Query: 513 NRVVGFDFFRSGQTISHYLHVVLKAILKLHGILLVKPEPVAEDCSDPRWKWFKGCLGALD 334
            RVV F + RS +TIS     VL+AI+K+    L   E   E   + +W+WFK  +GALD
Sbjct: 127 YRVVHFSYCRSMETISRQFKNVLRAIMKVSKEYLKFHEYNLEGSVENKWRWFKNSIGALD 186

Query: 333 GTYINVKVSNEDKPR----------------------------WEGSVGDSRVLRDAISR 238
           G +I V VS ED+PR                            WEG  GDSRVLRDA+ R
Sbjct: 187 GMHIPVTVSAEDRPRYRNKKGDISTNVLGVCGPDLRFIYVLPGWEGLAGDSRVLRDALRR 246

Query: 237 VHGL*VPKGCYYLCDNGYANTDGFLTPYKSVRYHLKEWGPNSAQPHNAEEMFNMRHTKAR 58
            + L +P G Y+L D GY N  GFL PY+  RYHL EW  N+  P N +E+FN+RH  AR
Sbjct: 247 QNCLHIPNGKYFLVDVGYTNGPGFLAPYRGTRYHLNEWIRNT--PQNYKELFNLRHASAR 304

Query: 57  NVIERAFGIMKMRWGILRS 1
           NVIER+FG++K RW ILR+
Sbjct: 305 NVIERSFGVLKKRWSILRT 323


Top