BLASTX nr result

ID: Rehmannia25_contig00006770 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00006770
         (1771 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006355996.1| PREDICTED: uncharacterized protein LOC102597...   660   0.0  
ref|XP_004248518.1| PREDICTED: uncharacterized protein LOC101253...   649   0.0  
gb|EOY10316.1| Uncharacterized protein isoform 2 [Theobroma cacao]    566   e-158
gb|EOY10315.1| Uncharacterized protein isoform 1 [Theobroma cacao]    565   e-158
ref|XP_004510196.1| PREDICTED: uncharacterized protein LOC101501...   515   e-143
ref|XP_006484726.1| PREDICTED: uncharacterized protein LOC102615...   514   e-143
ref|XP_004307300.1| PREDICTED: uncharacterized protein LOC101293...   511   e-142
gb|ESW06285.1| hypothetical protein PHAVU_010G034800g [Phaseolus...   501   e-139
ref|NP_187006.2| uncharacterized protein [Arabidopsis thaliana] ...   483   e-133
gb|ESW06284.1| hypothetical protein PHAVU_010G034800g [Phaseolus...   482   e-133
ref|XP_004133970.1| PREDICTED: uncharacterized protein LOC101207...   481   e-133
ref|XP_006408291.1| hypothetical protein EUTSA_v10022176mg, part...   479   e-132
ref|XP_002884395.1| hypothetical protein ARALYDRAFT_477601 [Arab...   476   e-131
gb|EPS73084.1| hypothetical protein M569_01668, partial [Genlise...   471   e-130
ref|XP_006298972.1| hypothetical protein CARUB_v10015106mg [Caps...   469   e-129
ref|XP_004157685.1| PREDICTED: uncharacterized protein LOC101226...   466   e-128
gb|EOY10317.1| Uncharacterized protein isoform 3 [Theobroma cacao]    444   e-122
ref|XP_002524005.1| hypothetical protein RCOM_1516730 [Ricinus c...   431   e-118
gb|EOY10318.1| Uncharacterized protein isoform 4, partial [Theob...   380   e-102
ref|XP_003520361.2| PREDICTED: uncharacterized protein LOC100813...   377   e-102

>ref|XP_006355996.1| PREDICTED: uncharacterized protein LOC102597014 isoform X1 [Solanum
            tuberosum] gi|565379136|ref|XP_006355997.1| PREDICTED:
            uncharacterized protein LOC102597014 isoform X2 [Solanum
            tuberosum] gi|565379138|ref|XP_006355998.1| PREDICTED:
            uncharacterized protein LOC102597014 isoform X3 [Solanum
            tuberosum]
          Length = 544

 Score =  660 bits (1702), Expect = 0.0
 Identities = 342/546 (62%), Positives = 410/546 (75%), Gaps = 7/546 (1%)
 Frame = +2

Query: 143  MYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRAR 307
            MYS  SS N Q DV  + Q     NR N   +S+ KN+KG + +  +QD E MELYSRA+
Sbjct: 1    MYSPSSSINGQKDVRVQGQSSDLANRPNFGMSSLPKNLKGNDTINDSQDPEAMELYSRAK 60

Query: 308  AQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRK 487
            AQ++EI YLREQIALAS+RESQ+LNEKY LE+KFSELRMALDEKQ+E I SASNEL RRK
Sbjct: 61   AQQEEILYLREQIALASVRESQLLNEKYGLEKKFSELRMALDEKQNEAIISASNELTRRK 120

Query: 488  GDXXXXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLK 667
            GD           K  ED+K+IFTSSMLG+LAEYG  P V +AS+L N++KHLHDQL++K
Sbjct: 121  GDLEENLRLVNELKDTEDDKYIFTSSMLGLLAEYGVFPRVASASSLANNVKHLHDQLEMK 180

Query: 668  IRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKIL 847
            IR SHA++A+LNSM+ N++R G  D E P  SS  +Q PS SMG+  +  +  Y DG+  
Sbjct: 181  IRTSHAKIAQLNSMVTNHARGGSFDMESPHSSSINNQLPSGSMGMNEYPAFKQYIDGQHN 240

Query: 848  DPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNG 1027
            +     S  VQ +    A  L  + EM Q  ++   L    NTDR+   P  DN+ +RNG
Sbjct: 241  EAVATGSGDVQASKHLPAERLLFNREMHQQASH---LEISSNTDRDVPGPTKDNLFDRNG 297

Query: 1028 FLSGSEQRSTDQFSLPPM--HDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRG 1201
                 E+ + +    PP   ++  GSF SEG+ PGIE FQIIG+AKPGCKLLGCG+PVRG
Sbjct: 298  VNERFEESNNENRHNPPTVGNEIGGSFSSEGESPGIEVFQIIGEAKPGCKLLGCGFPVRG 357

Query: 1202 TSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFA 1381
            TSLCMFQWVRHYPDGTRQYIEGATNP+YVVTADD+DKLIAVECIPMDDQG QG+LVR+FA
Sbjct: 358  TSLCMFQWVRHYPDGTRQYIEGATNPEYVVTADDIDKLIAVECIPMDDQGHQGELVRLFA 417

Query: 1382 NDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQD 1561
            NDQN ITCD +MQ +IDT+IS+GQA F+VL+L+DSSENWEP T+ +RRS FQVK  R Q 
Sbjct: 418  NDQNNITCDTDMQSEIDTHISEGQATFNVLMLVDSSENWEPVTIFLRRSSFQVKVHRTQA 477

Query: 1562 TVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALD 1741
             VI E +SK+LLIKIPSGLSAQFV+TCSNGSS+PFST+NDIRMRDTLVLTMRIFQSKALD
Sbjct: 478  VVIVEIFSKELLIKIPSGLSAQFVITCSNGSSHPFSTNNDIRMRDTLVLTMRIFQSKALD 537

Query: 1742 EKRKGK 1759
            EKRKGK
Sbjct: 538  EKRKGK 543


>ref|XP_004248518.1| PREDICTED: uncharacterized protein LOC101253835 [Solanum
            lycopersicum]
          Length = 547

 Score =  649 bits (1675), Expect = 0.0
 Identities = 338/546 (61%), Positives = 406/546 (74%), Gaps = 7/546 (1%)
 Frame = +2

Query: 143  MYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRAR 307
            MYS  SS N Q DV  + Q     NR N   +S+ K +KG + +  +QD E MELYSRA+
Sbjct: 1    MYSPISSINGQKDVRVQGQSSDLANRQNFGMSSLPKILKGNDTINDSQDPEVMELYSRAK 60

Query: 308  AQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRK 487
            AQ++EI YLREQIALASIRESQ+LNEKY LE+KFSELRMALDEKQ+E I SASNEL RRK
Sbjct: 61   AQQEEILYLREQIALASIRESQLLNEKYGLEKKFSELRMALDEKQNEAIISASNELTRRK 120

Query: 488  GDXXXXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLK 667
            GD           K  ED+K+IF SSM+G+LAEYG  P V +AS LTN++KHLHDQL++K
Sbjct: 121  GDLEENLRLVNELKDTEDDKYIFMSSMIGLLAEYGVFPRVASASNLTNNVKHLHDQLEMK 180

Query: 668  IRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKIL 847
            IR SHA++A+LNSM+ N++R G  D E P  SS  +Q PS SMG+  +  +  Y DG+  
Sbjct: 181  IRTSHAKIAQLNSMVTNHARGGSFDMESPHSSSINNQLPSGSMGMNEYPAFKQYIDGQHN 240

Query: 848  DPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNG 1027
            + A   S  VQ +    A SL  + EM Q  N  + L    NT+R+ + P  DN+   NG
Sbjct: 241  EAAATGSGDVQASKHLPAESLLFNREMHQQANIGSHLEISSNTERDVSGPAKDNLFAING 300

Query: 1028 FLSGSEQRSTDQFSLPPM--HDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRG 1201
                 E+ + +    PP   +D  GSF SEG+ PGIE FQIIG+AKPGCKLLGCG+PVRG
Sbjct: 301  VNERFEESNNENRHNPPTVGNDIGGSFSSEGESPGIEVFQIIGEAKPGCKLLGCGFPVRG 360

Query: 1202 TSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFA 1381
            TSLCMFQWVRHYPDGTRQYIEGATNP+YVVTADD+DKLIAVECIPMDDQG QG+LVR+FA
Sbjct: 361  TSLCMFQWVRHYPDGTRQYIEGATNPEYVVTADDIDKLIAVECIPMDDQGHQGELVRLFA 420

Query: 1382 NDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQD 1561
            NDQN ITCD +MQ +IDT+IS+GQA F+VL+L+DSSENWEP T+ + RS FQVK  R Q 
Sbjct: 421  NDQNNITCDPDMQSEIDTHISEGQATFNVLMLVDSSENWEPVTIFLLRSSFQVKVHRTQA 480

Query: 1562 TVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALD 1741
             VI E +SK+L IKIPSGLS QFV+TCS+GSS+PFST+NDIRMRD+LVLTMRIFQSKALD
Sbjct: 481  VVIVENFSKELSIKIPSGLSTQFVITCSDGSSHPFSTNNDIRMRDSLVLTMRIFQSKALD 540

Query: 1742 EKRKGK 1759
            EKRKGK
Sbjct: 541  EKRKGK 546


>gb|EOY10316.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 556

 Score =  566 bits (1459), Expect = e-158
 Identities = 307/551 (55%), Positives = 379/551 (68%), Gaps = 5/551 (0%)
 Frame = +2

Query: 125  TSIQKAMYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETME 289
            T     M S + S +  N+   + Q     NRH  E       +K R       D E   
Sbjct: 14   TWTDNVMSSSEHSVHGVNNNGVQAQSSDFLNRHGSETYLAPSKLKDRS--FDFPDLEAKG 71

Query: 290  LYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASN 469
            L+ RA AQ++EIQ+LREQIA+A ++E Q+ NEK  LERKFS+LRMA+DEKQ+E ITSASN
Sbjct: 72   LHLRASAQKEEIQHLREQIAVACVKELQLQNEKCALERKFSDLRMAIDEKQNEAITSASN 131

Query: 470  ELVRRKGDXXXXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLH 649
            EL RRKGD           KVAEDE++IF SSMLG+LAEYG LP V NASA+T+S+KHLH
Sbjct: 132  ELARRKGDLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGILPPVVNASAITSSVKHLH 191

Query: 650  DQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHY 829
            DQLQ KIR SH R+ EL  ++G ++     +N+ P      +Q P  +    GFS  +HY
Sbjct: 192  DQLQWKIRTSHDRIRELTGIVGTHTGGRSHENDRPISGILNNQIPHRATASHGFSSNNHY 251

Query: 830  NDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDN 1009
             D + L P  +  RY+ +ND      +      +QL N  +      ++DR  A    D+
Sbjct: 252  TDEQHLMPPDNMLRYMPDNDHTAKNLMFNDPGQQQLSNGNSQ-EFFFSSDRGGAGRNPDS 310

Query: 1010 MLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGY 1189
              +R    +G+E  + + FS    HD + S+GSE +GPGIEGFQIIGDA PG KLLGCGY
Sbjct: 311  AFDRGAVRTGAEDVTNNVFS---HHDEMDSYGSE-EGPGIEGFQIIGDATPGEKLLGCGY 366

Query: 1190 PVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLV 1369
            PVRGT+LCMFQWVRH  DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDDQG QG+LV
Sbjct: 367  PVRGTTLCMFQWVRHLQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDQGHQGELV 426

Query: 1370 RIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDE 1549
            R+FANDQNKI CD +MQ +ID YIS+GQAAFSVL+L+DSSE WEPATL ++RS +Q+K  
Sbjct: 427  RLFANDQNKIKCDPDMQNEIDKYISRGQAAFSVLLLMDSSEKWEPATLTLKRSSYQIKIN 486

Query: 1550 RKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQS 1729
              +   ISEKYSK+L IK+PSGLS QFV+TC +GSS PFST N +RMRDTLVLTMR+FQS
Sbjct: 487  STEAVEISEKYSKELSIKVPSGLSTQFVVTCFDGSSRPFSTYN-VRMRDTLVLTMRLFQS 545

Query: 1730 KALDEKRKGKA 1762
            K LD+KRKG+A
Sbjct: 546  KNLDDKRKGRA 556


>gb|EOY10315.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 541

 Score =  565 bits (1457), Expect = e-158
 Identities = 301/521 (57%), Positives = 369/521 (70%)
 Frame = +2

Query: 200  NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQML 379
            NRH  E       +K R       D E   L+ RA AQ++EIQ+LREQIA+A ++E Q+ 
Sbjct: 29   NRHGSETYLAPSKLKDRS--FDFPDLEAKGLHLRASAQKEEIQHLREQIAVACVKELQLQ 86

Query: 380  NEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHIFT 559
            NEK  LERKFS+LRMA+DEKQ+E ITSASNEL RRKGD           KVAEDE++IF 
Sbjct: 87   NEKCALERKFSDLRMAIDEKQNEAITSASNELARRKGDLEENLKLAHDLKVAEDERYIFM 146

Query: 560  SSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV 739
            SSMLG+LAEYG LP V NASA+T+S+KHLHDQLQ KIR SH R+ EL  ++G ++     
Sbjct: 147  SSMLGLLAEYGILPPVVNASAITSSVKHLHDQLQWKIRTSHDRIRELTGIVGTHTGGRSH 206

Query: 740  DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQS 919
            +N+ P      +Q P  +    GFS  +HY D + L P  +  RY+ +ND      +   
Sbjct: 207  ENDRPISGILNNQIPHRATASHGFSSNNHYTDEQHLMPPDNMLRYMPDNDHTAKNLMFND 266

Query: 920  AEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGS 1099
               +QL N  +      ++DR  A    D+  +R    +G+E  + + FS    HD + S
Sbjct: 267  PGQQQLSNGNSQ-EFFFSSDRGGAGRNPDSAFDRGAVRTGAEDVTNNVFS---HHDEMDS 322

Query: 1100 FGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNP 1279
            +GSE +GPGIEGFQIIGDA PG KLLGCGYPVRGT+LCMFQWVRH  DGTRQYIEGATNP
Sbjct: 323  YGSE-EGPGIEGFQIIGDATPGEKLLGCGYPVRGTTLCMFQWVRHLQDGTRQYIEGATNP 381

Query: 1280 DYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQAA 1459
            +YVVTADDVDKLIAVECIPMDDQG QG+LVR+FANDQNKI CD +MQ +ID YIS+GQAA
Sbjct: 382  EYVVTADDVDKLIAVECIPMDDQGHQGELVRLFANDQNKIKCDPDMQNEIDKYISRGQAA 441

Query: 1460 FSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVLT 1639
            FSVL+L+DSSE WEPATL ++RS +Q+K    +   ISEKYSK+L IK+PSGLS QFV+T
Sbjct: 442  FSVLLLMDSSEKWEPATLTLKRSSYQIKINSTEAVEISEKYSKELSIKVPSGLSTQFVVT 501

Query: 1640 CSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGKA 1762
            C +GSS PFST N +RMRDTLVLTMR+FQSK LD+KRKG+A
Sbjct: 502  CFDGSSRPFSTYN-VRMRDTLVLTMRLFQSKNLDDKRKGRA 541


>ref|XP_004510196.1| PREDICTED: uncharacterized protein LOC101501329 [Cicer arietinum]
          Length = 538

 Score =  515 bits (1327), Expect = e-143
 Identities = 288/526 (54%), Positives = 358/526 (68%), Gaps = 6/526 (1%)
 Frame = +2

Query: 203  RHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQMLN 382
            RHN+E        K  + + H  D ETMELYSRAR QE+EI  LREQIA++ ++E Q+LN
Sbjct: 26   RHNVETQLAQNTFKSSDALNHVNDLETMELYSRARGQEEEILSLREQIAVSCMKELQLLN 85

Query: 383  EKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHIFTS 562
            EK  LER  SELRMA+DE+Q+E ITSASN+L RRKG            KVAE+E++ F S
Sbjct: 86   EKCKLERDLSELRMAVDERQNEAITSASNDLARRKGYLEENLKLAHELKVAEEERYAFMS 145

Query: 563  SMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNG--- 733
            SMLG+LAEYG  P V NAS+++N +KHLHDQLQ +IR SH R+ EL S I N++  G   
Sbjct: 146  SMLGLLAEYGLWPRVMNASSVSNYVKHLHDQLQWRIRNSHDRIGELTSGIENHADTGNNH 205

Query: 734  VVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLT 913
            VV++     S+  +Q  S  M    F Q +   + +   P    + Y+            
Sbjct: 206  VVESPNSAKSTNHAQ--SEFMFQHNFPQQNLIGNEQNHQPMSKMTGYMNPVVSGDVNGTF 263

Query: 914  QSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNG---FLSGSEQRSTDQFSLPPMH 1084
            +    +++   +   R +      +   I   M ER+G   F +G+     + + LP  H
Sbjct: 264  KRVNYQEISKAD---RDISFFRHGSIDQI--GMQERSGERNFANGNG----NLYQLPLDH 314

Query: 1085 DRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIE 1264
            D   S  SE DGPGIE FQI GDA PG KLLGCGYPVR TSLCMFQWVRH  DGTRQYIE
Sbjct: 315  DETASSVSE-DGPGIENFQICGDAIPGEKLLGCGYPVRRTSLCMFQWVRHLQDGTRQYIE 373

Query: 1265 GATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYIS 1444
            GA+NP+YVVTADDVDKLIAVECIPMDD+GRQG+LVR+FANDQNKI CD EMQ +IDTY+S
Sbjct: 374  GASNPEYVVTADDVDKLIAVECIPMDDKGRQGELVRLFANDQNKIKCDPEMQHEIDTYLS 433

Query: 1445 KGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSA 1624
            KG+A FSVL+L+DSSENWE ATL +RRSG+Q+K    +  V++EK+SKDL IK+P GLS 
Sbjct: 434  KGEAMFSVLLLMDSSENWEQATLFLRRSGYQIKINGTEAPVVAEKFSKDLSIKVPCGLST 493

Query: 1625 QFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGKA 1762
            QFVLTC NGSS+P ST + +RMRDTLVLTMR+FQSK LD+KRKG+A
Sbjct: 494  QFVLTCLNGSSHPLSTYS-VRMRDTLVLTMRLFQSKVLDDKRKGRA 538


>ref|XP_006484726.1| PREDICTED: uncharacterized protein LOC102615526 [Citrus sinensis]
          Length = 522

 Score =  514 bits (1323), Expect = e-143
 Identities = 286/546 (52%), Positives = 373/546 (68%), Gaps = 6/546 (1%)
 Frame = +2

Query: 143  MYSGDSSANDQNDVETRRQN------RHNLEATSVTKNVKGRENMIHTQDQETMELYSRA 304
            M SG++S +  N+   + +N      RH +E T +    +  +N I  QD+E MELYSRA
Sbjct: 1    MSSGNNSMHGLNNHRFQAKNSDFVNSRHKIE-THLAPTKQKEDNFISFQDREAMELYSRA 59

Query: 305  RAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRR 484
            R Q++EI  LR+QIA+A ++E Q+ NEKYTLERK SELRMA+DEKQ+E ITSA NEL RR
Sbjct: 60   RMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARR 119

Query: 485  KGDXXXXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQL 664
            KG            KVAEDE++ F SSMLG+LA+YG  PHVTNASA++N++KHL+DQLQ 
Sbjct: 120  KGVLEENLKLAHDLKVAEDERYFFMSSMLGLLADYGLWPHVTNASAISNTVKHLYDQLQS 179

Query: 665  KIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKI 844
            +IR S+ R+ +L    G ++  G +D  +  L   G    + +   R             
Sbjct: 180  QIRTSYDRIRDLTREGGTDAGAGSIDTVV--LDRHGVPMHTPNAADRP------------ 225

Query: 845  LDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERN 1024
             +P  +  R + ++   + ++L  +++M+QL NN++       ++R     + + +  R 
Sbjct: 226  -EPTDNMPRTIHDDSHSEMKNLLHNSQMQQLFNNDSSQGFSFGSNRENLGNVPNALDLR- 283

Query: 1025 GFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGT 1204
                G E+ +      P  H+ + S  SEG GPGIEGFQIIG+A PG KLLGCGYPVRGT
Sbjct: 284  -VARGPEEMNA---WFPSTHNEIASSISEG-GPGIEGFQIIGEATPGEKLLGCGYPVRGT 338

Query: 1205 SLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFAN 1384
            +LCMFQWVRH  DGTR YIEGATNP+YVVTADDVDKLIAVECIPMDDQGRQG+LVR FAN
Sbjct: 339  TLCMFQWVRHLQDGTRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVRRFAN 398

Query: 1385 DQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDT 1564
            DQNKI CD  MQ +ID YIS+G A FSVL+L+DSSENWE ATLI+RRS +++K +   + 
Sbjct: 399  DQNKIKCDLGMQSEIDAYISRGHATFSVLMLMDSSENWEQATLILRRSIYRIKID-STEA 457

Query: 1565 VISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDE 1744
            +I E++ K++ IK+P GLS QFVLT S+GSSYPFST N +RMRDTLVLTMR+ Q KALD+
Sbjct: 458  IIEERFPKEVSIKVPCGLSTQFVLTFSDGSSYPFSTYN-VRMRDTLVLTMRMLQGKALDD 516

Query: 1745 KRKGKA 1762
            KRKG+A
Sbjct: 517  KRKGRA 522


>ref|XP_004307300.1| PREDICTED: uncharacterized protein LOC101293522 [Fragaria vesca
            subsp. vesca]
          Length = 493

 Score =  511 bits (1316), Expect = e-142
 Identities = 278/523 (53%), Positives = 350/523 (66%), Gaps = 3/523 (0%)
 Frame = +2

Query: 200  NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQML 379
            NRH+ EA    KN++  ++ +H +DQE MELYSRARAQE+EIQ+LR Q+ +A ++E ++L
Sbjct: 25   NRHSSEAHCSPKNLRD-DSDVHHKDQEAMELYSRARAQEEEIQFLRGQVTVACLKELRLL 83

Query: 380  NEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHIFT 559
            NEKY LE+KF++LRMA+DEKQ+E  TSA NEL RRKGD           K A+DE+++F 
Sbjct: 84   NEKYALEKKFADLRMAIDEKQNEATTSALNELARRKGDLEENLKLTHDLKAADDERYVFM 143

Query: 560  SSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV 739
            SSMLG+LAEYG  PHV NASA++NS+KHLHD+LQ KIR SH +                 
Sbjct: 144  SSMLGLLAEYGIWPHVVNASAISNSLKHLHDELQWKIRTSHEQ----------------- 186

Query: 740  DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQS 919
                                 +GF +Y+   D + ++P      ++  ND    R+L   
Sbjct: 187  ---------------------QGFDRYT---DAQRMEPTAKVQLHM--NDFTDTRNL--- 217

Query: 920  AEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGF---LSGSEQRSTDQFSLPPMHDR 1090
                 L N E   +   N D NT    MD  +  + F   ++      T+  S P   D 
Sbjct: 218  ----MLINKENPQQFTANIDSNTTHRNMDGFILHDSFDKDVAYGRAEQTNGTSYPQTPDN 273

Query: 1091 VGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGA 1270
              S      GPGIE FQIIGDA PG KLLGCG+PVRGTSLCMFQWVRH  DGTR+ IEGA
Sbjct: 274  TSSISQ---GPGIENFQIIGDAVPGGKLLGCGFPVRGTSLCMFQWVRHLQDGTREVIEGA 330

Query: 1271 TNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKG 1450
            TNP+Y+VTADDVDK IAV+CIPMDDQGRQG+LVR FANDQNKI CD EMQ +IDT+IS+G
Sbjct: 331  TNPEYIVTADDVDKTIAVDCIPMDDQGRQGELVRHFANDQNKIKCDPEMQLEIDTHISRG 390

Query: 1451 QAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQF 1630
            QA F VL+L+DS+ENWEPATL +RRSG+Q+K    +  VI+EK+S DL IK+P G S QF
Sbjct: 391  QATFIVLLLMDSAENWEPATLFLRRSGYQIKINSTEALVIAEKFSNDLSIKVPCGFSTQF 450

Query: 1631 VLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGK 1759
            VLTCS+GSS+PFST + +RMRDTLVLTMR+ QSKALD++RKG+
Sbjct: 451  VLTCSDGSSHPFSTYS-VRMRDTLVLTMRMLQSKALDDRRKGR 492


>gb|ESW06285.1| hypothetical protein PHAVU_010G034800g [Phaseolus vulgaris]
          Length = 538

 Score =  501 bits (1291), Expect = e-139
 Identities = 276/523 (52%), Positives = 351/523 (67%), Gaps = 3/523 (0%)
 Frame = +2

Query: 203  RHNLEATSVTKNVKGRENMIHTQDQETM---ELYSRARAQEKEIQYLREQIALASIRESQ 373
            RH  E     +N K  +   H QDQ+     EL SRAR  E+EI  LREQIA A ++E Q
Sbjct: 26   RHKFETQLTQRNFKSNDAHNHIQDQDATQATELNSRARGLEEEILSLREQIAFACMKELQ 85

Query: 374  MLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHI 553
            +LNEK  LER+FSELRMA+DEK+SE I+SASN+L  RKG            K  +DE++I
Sbjct: 86   LLNEKCKLERQFSELRMAVDEKESEAISSASNDLAHRKGYLEENLKLAHDLKAVDDERYI 145

Query: 554  FTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNG 733
            F SSMLG+LAEYG  P V NA +++  +KHLHDQLQ +IR+SH R+ EL+S++ + + NG
Sbjct: 146  FMSSMLGLLAEYGLWPRVMNAFSISTCVKHLHDQLQWRIRSSHDRIGELSSVLESRADNG 205

Query: 734  VVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLT 913
                E P   +  S   +  M    FSQ +   + +      + + Y+            
Sbjct: 206  NHVVESPSSENLTSHNHNDFMFQHNFSQQNLIGNEQTHQLTSNIAGYMHPALNPDVNWSI 265

Query: 914  QSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRV 1093
            ++   +Q+P  +  + + P+   +    + D  +ERN         + + +   P  D  
Sbjct: 266  KAFNYQQIPKPDRDVASFPHGSIDKIG-VQDKNMERNFV-------NANMYQPQPELDET 317

Query: 1094 GSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGAT 1273
             S  SE D PGIE FQI GDA PG KLLGCGYPVRGT LC+FQWVRH  DGTR YIEGAT
Sbjct: 318  ASSVSE-DAPGIENFQISGDAIPGEKLLGCGYPVRGTYLCIFQWVRHLEDGTRHYIEGAT 376

Query: 1274 NPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQ 1453
            NP+YVVTADDVDKLIAVECIPMDD+GRQG+LV++FANDQNKITCD EM+ +IDT +SKG+
Sbjct: 377  NPEYVVTADDVDKLIAVECIPMDDKGRQGELVKLFANDQNKITCDSEMKHEIDTNLSKGE 436

Query: 1454 AAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFV 1633
            A FSVL+L DSSENWE ATL +RR+G+Q++    + TV+SEK+SKDL IK+PSGLS QFV
Sbjct: 437  AIFSVLLLTDSSENWERATLYLRRTGYQIRINGTEATVVSEKFSKDLSIKVPSGLSVQFV 496

Query: 1634 LTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGKA 1762
            LTCS+GSS+P ST + +RMRDTLVLTMR FQSKALDEKRKG+A
Sbjct: 497  LTCSDGSSHPLSTYS-VRMRDTLVLTMRFFQSKALDEKRKGRA 538


>ref|NP_187006.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332640436|gb|AEE73957.1| uncharacterized protein
            AT3G03560 [Arabidopsis thaliana]
          Length = 521

 Score =  483 bits (1242), Expect = e-133
 Identities = 262/523 (50%), Positives = 355/523 (67%), Gaps = 4/523 (0%)
 Frame = +2

Query: 203  RHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQMLN 382
            RH +E  ++        N    QD E M LY++ R+QE+EI  L+E+IA A +++ Q+LN
Sbjct: 12   RHEIEKDTIASRKLEDTNTKLIQDPEEMALYAKVRSQEEEIHSLQERIAAACLKDMQLLN 71

Query: 383  EKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHIFTS 562
            EKY LERK ++LR+A+DEKQ+E +TSA NEL RRKGD           KV EDE++IF +
Sbjct: 72   EKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENLKLAHDLKVTEDERYIFMT 131

Query: 563  SMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV- 739
            S+LG+LAEYG  P V NA+A+++ IKHLHDQLQ K +A + R+ EL+S++ N      + 
Sbjct: 132  SLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKTKACNDRIRELSSIVENQPGTDFIS 191

Query: 740  -DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQ 916
             DN  P  S   + + S+  G        +  + ++L P  + +R    N  +   SL  
Sbjct: 192  KDNHDPRNSKTQASYGSTDRG------NDYQTNEQLLPPMENVTRNPYHNIMQDTESLRF 245

Query: 917  SAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVG 1096
            +    Q+      +   P  + N   P+  + +     +   E+++ +  S+   ++   
Sbjct: 246  N---NQIGGGSQGIFPQPKRE-NFGYPL--SSVAGKEMIQEREEKA-ENSSMFDAYNGNE 298

Query: 1097 SFGSE--GDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGA 1270
             F S    +GPGI+GFQIIGDA PG K+LGCG+PVRGT+LCMFQWVRH  DGTRQYIEGA
Sbjct: 299  EFASHVYEEGPGIDGFQIIGDAIPGEKVLGCGFPVRGTTLCMFQWVRHLEDGTRQYIEGA 358

Query: 1271 TNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKG 1450
            T+P+Y+VTADDVDKLIAVECIPMDDQGRQG+LVR+FANDQNKI CD EMQ +IDTYIS+G
Sbjct: 359  THPEYIVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNKIRCDTEMQTEIDTYISRG 418

Query: 1451 QAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQF 1630
            QA+F+V +L+DSSE+WEPAT++++RS +Q+K    +  VISEKYSK+L I++PSG S QF
Sbjct: 419  QASFNVQLLMDSSESWEPATVVLKRSSYQIKTNTTEAVVISEKYSKELQIRVPSGESTQF 478

Query: 1631 VLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGK 1759
            VL   +GSS+P ST N +RMRDTLVLTMR+ QSKALDE+RKG+
Sbjct: 479  VLISYDGSSHPISTLN-VRMRDTLVLTMRMLQSKALDERRKGR 520


>gb|ESW06284.1| hypothetical protein PHAVU_010G034800g [Phaseolus vulgaris]
          Length = 529

 Score =  482 bits (1240), Expect = e-133
 Identities = 266/512 (51%), Positives = 340/512 (66%), Gaps = 3/512 (0%)
 Frame = +2

Query: 203  RHNLEATSVTKNVKGRENMIHTQDQETM---ELYSRARAQEKEIQYLREQIALASIRESQ 373
            RH  E     +N K  +   H QDQ+     EL SRAR  E+EI  LREQIA A ++E Q
Sbjct: 26   RHKFETQLTQRNFKSNDAHNHIQDQDATQATELNSRARGLEEEILSLREQIAFACMKELQ 85

Query: 374  MLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHI 553
            +LNEK  LER+FSELRMA+DEK+SE I+SASN+L  RKG            K  +DE++I
Sbjct: 86   LLNEKCKLERQFSELRMAVDEKESEAISSASNDLAHRKGYLEENLKLAHDLKAVDDERYI 145

Query: 554  FTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNG 733
            F SSMLG+LAEYG  P V NA +++  +KHLHDQLQ +IR+SH R+ EL+S++ + + NG
Sbjct: 146  FMSSMLGLLAEYGLWPRVMNAFSISTCVKHLHDQLQWRIRSSHDRIGELSSVLESRADNG 205

Query: 734  VVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLT 913
                E P   +  S   +  M    FSQ +   + +      + + Y+            
Sbjct: 206  NHVVESPSSENLTSHNHNDFMFQHNFSQQNLIGNEQTHQLTSNIAGYMHPALNPDVNWSI 265

Query: 914  QSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRV 1093
            ++   +Q+P  +  + + P+   +    + D  +ERN         + + +   P  D  
Sbjct: 266  KAFNYQQIPKPDRDVASFPHGSIDKIG-VQDKNMERNFV-------NANMYQPQPELDET 317

Query: 1094 GSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGAT 1273
             S  SE D PGIE FQI GDA PG KLLGCGYPVRGT LC+FQWVRH  DGTR YIEGAT
Sbjct: 318  ASSVSE-DAPGIENFQISGDAIPGEKLLGCGYPVRGTYLCIFQWVRHLEDGTRHYIEGAT 376

Query: 1274 NPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQ 1453
            NP+YVVTADDVDKLIAVECIPMDD+GRQG+LV++FANDQNKITCD EM+ +IDT +SKG+
Sbjct: 377  NPEYVVTADDVDKLIAVECIPMDDKGRQGELVKLFANDQNKITCDSEMKHEIDTNLSKGE 436

Query: 1454 AAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFV 1633
            A FSVL+L DSSENWE ATL +RR+G+Q++    + TV+SEK+SKDL IK+PSGLS QFV
Sbjct: 437  AIFSVLLLTDSSENWERATLYLRRTGYQIRINGTEATVVSEKFSKDLSIKVPSGLSVQFV 496

Query: 1634 LTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQS 1729
            LTCS+GSS+P ST + +RMRDTLVLTMR FQS
Sbjct: 497  LTCSDGSSHPLSTYS-VRMRDTLVLTMRFFQS 527


>ref|XP_004133970.1| PREDICTED: uncharacterized protein LOC101207305 [Cucumis sativus]
          Length = 536

 Score =  481 bits (1239), Expect = e-133
 Identities = 272/550 (49%), Positives = 353/550 (64%), Gaps = 11/550 (2%)
 Frame = +2

Query: 146  YSGDSSANDQNDVETRRQ--NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEK 319
            +S     ND +    R Q   RH  E +  + N++   ++ + QDQE MEL SR +AQE 
Sbjct: 5    HSSLQGLNDDSVQAARSQLKKRHTFERSLGSNNLERAVDVNNHQDQEDMELLSRVKAQEG 64

Query: 320  EIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXX 499
            EIQ LR+QI++A ++E + LNEKY LERKFS++RMA+DEKQ+E ITSA NEL  RKGD  
Sbjct: 65   EIQLLRQQISVACLKELRQLNEKYALERKFSDIRMAVDEKQTEAITSAFNELGYRKGDLE 124

Query: 500  XXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRAS 679
                     K  +DE++ + SS+LG+LAEYG  P V NAS LTN++K LHDQLQ KIR S
Sbjct: 125  VNLKLTNELKAVDDERYHYISSLLGLLAEYGIWPQVINASVLTNNVKLLHDQLQRKIRTS 184

Query: 680  HARLAELNSMIGNNSRNGVV-----DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKI 844
            + ++ E  S   N    G       + +     SR  Q+        G S+Y      + 
Sbjct: 185  YEKIGERTSPAENQFEGGFPYRKRENTDFKFFESR-YQYQKRESADIGNSRYQLPAKAEP 243

Query: 845  LDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTAS----PIMDNM 1012
            L    D      +N       L+   EM Q  N +     L    R        P+ D+ 
Sbjct: 244  LRTTDDMFISRVQNSIPGPVDLSLRPEMYQPVNYDNSPEPLYYAGREVPGAFTPPVDDDA 303

Query: 1013 LERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYP 1192
            +E   + +       ++++ P M +          GP IE FQI+G+A PG +LL CGYP
Sbjct: 304  VELQRYTTD------ERYNNPVMIE----------GPSIENFQIVGEATPGSRLLACGYP 347

Query: 1193 VRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVR 1372
             RGTSLC+FQWV H  DGTRQYIEGATNP+YVV ADDVDKLIAVECIPMDD+G QGDLV+
Sbjct: 348  TRGTSLCIFQWVWHLEDGTRQYIEGATNPEYVVGADDVDKLIAVECIPMDDKGHQGDLVK 407

Query: 1373 IFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDER 1552
            +FANDQNKI CD +MQ +IDTY+SKGQA F+VL+L+DSSENWEPA++ +RRSG+Q+K   
Sbjct: 408  LFANDQNKIRCDPDMQLEIDTYLSKGQATFNVLLLIDSSENWEPASISLRRSGYQIKMGN 467

Query: 1553 KQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSK 1732
             +  VI+EKYS++L +KIPSG+S QFVLTCS+GSS PF+T  D+RMRDTLVLTMR+FQSK
Sbjct: 468  TEAVVIAEKYSRELSLKIPSGISTQFVLTCSDGSSLPFNT-YDVRMRDTLVLTMRMFQSK 526

Query: 1733 ALDEKRKGKA 1762
            A+D++RKGKA
Sbjct: 527  AMDDRRKGKA 536


>ref|XP_006408291.1| hypothetical protein EUTSA_v10022176mg, partial [Eutrema salsugineum]
            gi|557109437|gb|ESQ49744.1| hypothetical protein
            EUTSA_v10022176mg, partial [Eutrema salsugineum]
          Length = 507

 Score =  479 bits (1233), Expect = e-132
 Identities = 262/512 (51%), Positives = 342/512 (66%), Gaps = 2/512 (0%)
 Frame = +2

Query: 203  RHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQMLN 382
            RH +E  +         N    QD E M LYSRAR+QE+EI  L+EQIA A +++ Q+LN
Sbjct: 12   RHEIEKETSASRKLEENNAKLIQDPEEMALYSRARSQEEEIHNLQEQIAAACLKDMQLLN 71

Query: 383  EKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHIFTS 562
            EKY LERK ++LR+A+DEKQ+E +TSA NEL RRKGD           KV EDE++IF +
Sbjct: 72   EKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENLKLAHDLKVTEDERYIFMT 131

Query: 563  SMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVVD 742
            S+LG+LAEYG  P V NA+A+++ IKHLHDQLQ KI+A + R+ EL+S++   S    + 
Sbjct: 132  SLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKIKACNDRIRELSSVVETQSGTDFI- 190

Query: 743  NEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQSA 922
                   S+ +  P  S G   +    H ND +I +    P   +  N      +LTQ  
Sbjct: 191  -------SKDNHDPRISKGQASYGSTDHGNDYRINEQLSPPMDNITRNP---YHNLTQET 240

Query: 923  EMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGSF 1102
            E  +  NN+    +      +   P+  + +     +   E+++       P +     F
Sbjct: 241  ESLRF-NNQIGGGSQQPRRESFGYPL--SSVAGKEMIREREEKAESSSMFDPYNGNE-EF 296

Query: 1103 GSE--GDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATN 1276
             S    +GPGI+GFQIIG+A PG K+LGCG+PVRGT+LCMFQWVRH  DGTRQYIEGAT+
Sbjct: 297  ASHVYEEGPGIDGFQIIGEAIPGEKVLGCGFPVRGTTLCMFQWVRHLEDGTRQYIEGATH 356

Query: 1277 PDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQA 1456
            P+YVVTADDVDKLIAVECIPMDDQGRQG+LVR+FANDQNKI CD EMQ +IDTYIS+GQA
Sbjct: 357  PEYVVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNKIRCDTEMQTEIDTYISRGQA 416

Query: 1457 AFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVL 1636
            +F+V +L+DS+E+WEPAT+I++RS +Q+K    +  VISEKYSK+LLIK+P G S QFVL
Sbjct: 417  SFNVQLLMDSTESWEPATVILKRSSYQIKTNNVEAMVISEKYSKELLIKVPCGFSTQFVL 476

Query: 1637 TCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSK 1732
               +GSS+P ST N +RMRDTLVLTMR+ QSK
Sbjct: 477  ISYDGSSHPISTLN-VRMRDTLVLTMRMLQSK 507


>ref|XP_002884395.1| hypothetical protein ARALYDRAFT_477601 [Arabidopsis lyrata subsp.
            lyrata] gi|297330235|gb|EFH60654.1| hypothetical protein
            ARALYDRAFT_477601 [Arabidopsis lyrata subsp. lyrata]
          Length = 519

 Score =  476 bits (1224), Expect = e-131
 Identities = 262/521 (50%), Positives = 347/521 (66%), Gaps = 2/521 (0%)
 Frame = +2

Query: 203  RHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQMLN 382
            RH +E  ++        N    QD E M LY++ R+QE+EI  L+E+IA A +++ Q+LN
Sbjct: 12   RHEIEKDTIASRKLEDSNAKLIQDPEEMALYAKVRSQEEEIHSLQERIAAACLKDMQLLN 71

Query: 383  EKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHIFTS 562
            EKY LERK ++LR+A+DEKQ+E +TSA NEL RRKGD           KV EDE++IF +
Sbjct: 72   EKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENSKLAHDLKVTEDERYIFMT 131

Query: 563  SMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV- 739
            S+LG+LAEYG  P V NA+A+++ IKHLHDQLQ K +A + R+ EL+S++ N      + 
Sbjct: 132  SLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKTKACNDRIRELSSIVENQPGTDFIS 191

Query: 740  -DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQ 916
             DN  P  S   + + S+  G        +  + ++L P  + +R    N  +    L  
Sbjct: 192  KDNHDPRNSKSQASYGSTDRG------NDYQTNEQLLPPMENVTRNPYHNVMQDTEGLRF 245

Query: 917  SAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVG 1096
            +    Q+      +   P  + N   P+     +        +  S+  F     ++   
Sbjct: 246  N---NQIGGGSQGIFQQPKRE-NFGYPLSSVAGKEMIREREEKAESSSMFDAYNGNEEFA 301

Query: 1097 SFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATN 1276
            S   E +GPGI+GFQIIGDA PG K+LGCG+PVRGT+LCMFQWVRH  DGTRQYIEGAT+
Sbjct: 302  SHVYE-EGPGIDGFQIIGDAIPGEKVLGCGFPVRGTTLCMFQWVRHLEDGTRQYIEGATH 360

Query: 1277 PDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQA 1456
            P+YVVTADDVDKLIAVECIPMDDQGRQG+LVR+FANDQNKI CD EMQ +IDTYIS+GQA
Sbjct: 361  PEYVVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNKIRCDTEMQAEIDTYISRGQA 420

Query: 1457 AFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVL 1636
            +F+V +L+DSSE+WE AT+I++RS +Q+K    +  VISEKYSK+L IK+P G S QFVL
Sbjct: 421  SFNVQLLMDSSESWETATVILKRSSYQIKTNTTE--VISEKYSKELQIKVPCGFSTQFVL 478

Query: 1637 TCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGK 1759
               +GSS+P ST N +RMRDTLVLTMR+ QSKALDE+RKG+
Sbjct: 479  ISYDGSSHPISTLN-VRMRDTLVLTMRMLQSKALDERRKGR 518


>gb|EPS73084.1| hypothetical protein M569_01668, partial [Genlisea aurea]
          Length = 401

 Score =  471 bits (1212), Expect = e-130
 Identities = 255/448 (56%), Positives = 309/448 (68%), Gaps = 2/448 (0%)
 Frame = +2

Query: 425  ALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPH 604
            ALDEKQSEVI SASNEL RRKGD              E EKHIFT+S+L ILAE+GALPH
Sbjct: 1    ALDEKQSEVIASASNELARRKGDLEVNLNLLNDLTATEHEKHIFTTSLLEILAEFGALPH 60

Query: 605  VTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFP 784
             TNASALTNSIKHLHDQLQL   +S A+LAELNSMI NN+   ++  E PGL   GS  P
Sbjct: 61   ATNASALTNSIKHLHDQLQLSFSSSRAKLAELNSMIENNA---II--EAPGLGPTGSHPP 115

Query: 785  SSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRT 964
            SSS G++G SQ   Y   + ++P+  P  Y+Q  DP  +R    +  +R++ +       
Sbjct: 116  SSSTGMQGSSQLRSYAANRNMEPSAGPPLYMQVEDP--SRVTLGTIRLREMAS------- 166

Query: 965  LPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGSFG--SEGDGPGIEGF 1138
                                              SL  + DR+  F   +  + P I  F
Sbjct: 167  ----------------------------------SLDMISDRLIKFHITASDEYPWIYNF 192

Query: 1139 QIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLI 1318
            QI G AKPGC++ GCG P  GT LCMFQWVRH PDGT ++I+GAT P YVVTADDVDKLI
Sbjct: 193  QIDGIAKPGCEITGCGVPKGGTYLCMFQWVRHNPDGTTEFIDGATYPTYVVTADDVDKLI 252

Query: 1319 AVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENW 1498
            AVECIPMD+ GR G+LVR+FAND  KITCD+EMQE+ID+Y+SKG A F VL++LDSSENW
Sbjct: 253  AVECIPMDEHGRHGNLVRMFANDNKKITCDDEMQEEIDSYVSKGSATFPVLVILDSSENW 312

Query: 1499 EPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSN 1678
            EPA++++RRSG+QVK E+KQ+ +ISEKYSK+L IKIPSGLSAQFVLTCS+GS YPFS ++
Sbjct: 313  EPASIVLRRSGYQVKVEKKQEPLISEKYSKELSIKIPSGLSAQFVLTCSDGSLYPFSMND 372

Query: 1679 DIRMRDTLVLTMRIFQSKALDEKRKGKA 1762
            D+RMRDTLVLTMRIFQ KA++EKRKG A
Sbjct: 373  DVRMRDTLVLTMRIFQMKAVNEKRKGMA 400


>ref|XP_006298972.1| hypothetical protein CARUB_v10015106mg [Capsella rubella]
            gi|482567681|gb|EOA31870.1| hypothetical protein
            CARUB_v10015106mg [Capsella rubella]
          Length = 522

 Score =  469 bits (1207), Expect = e-129
 Identities = 259/507 (51%), Positives = 339/507 (66%), Gaps = 10/507 (1%)
 Frame = +2

Query: 269  QDQETMELYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSE 448
            QD E M LY++ R+QE+EI  L+EQIA A +++ Q+LNEK  LERK ++LR+A+DEKQ+E
Sbjct: 35   QDPEEMALYAKVRSQEEEIHSLQEQIAAACLKDMQLLNEKCGLERKCADLRVAIDEKQNE 94

Query: 449  VITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALT 628
             +T+A NEL RRKGD           KV EDE++IF +S+LG+LAEYG  P V NA+A++
Sbjct: 95   SVTAALNELARRKGDLEENLKLAHDLKVTEDERYIFMTSLLGLLAEYGVWPRVANATAIS 154

Query: 629  NSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV--DNEIPGLSSRGSQFPSSSMGV 802
            + IKHLHDQLQ K +A   R+ EL+S++ N      +  DN  P  S   + + S+  G 
Sbjct: 155  SGIKHLHDQLQWKTKACTDRIRELSSIVENQPGTEFINKDNHDPRNSKSQASYGSTDRG- 213

Query: 803  RGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQSAEM-RQLPNNETLLRTLPNTD 979
                     ND +  +    P   V  N        T+      Q+      +   P  +
Sbjct: 214  ---------NDYRTNEQLLPPMENVMRNPYHNVMQDTEGLRFNNQIGGGSQGIFQQPKRE 264

Query: 980  RNTASPIMDNMLERNGFLSGSE-----QRSTDQFSLPPMHDRVGSFGSE--GDGPGIEGF 1138
             N   P+          ++G E     +   +  S+   ++    F S    +GPGI+GF
Sbjct: 265  -NFGYPLSS--------VAGKEMIREREEKAENSSMFDAYNGNEEFASHVYEEGPGIDGF 315

Query: 1139 QIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLI 1318
            QIIGDA PG K+LGCG+PVRGT+LCMFQWVRH  DGTRQYIEGAT+P+YVVTADDVDKLI
Sbjct: 316  QIIGDAIPGEKVLGCGFPVRGTTLCMFQWVRHLEDGTRQYIEGATHPEYVVTADDVDKLI 375

Query: 1319 AVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENW 1498
            AVECIPMDDQGRQG+LVR+FANDQNKI+CD EMQ +IDTYIS+GQA+F+V +L+DSSE+W
Sbjct: 376  AVECIPMDDQGRQGELVRLFANDQNKISCDTEMQTEIDTYISRGQASFNVQLLMDSSESW 435

Query: 1499 EPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSN 1678
            EPAT+I++R+ +Q+K    +  VISEKYSK+L IK+P G S QFVL   +GSS+P ST N
Sbjct: 436  EPATVILKRTSYQIKTNNVEALVISEKYSKELQIKVPCGDSTQFVLISYDGSSHPISTLN 495

Query: 1679 DIRMRDTLVLTMRIFQSKALDEKRKGK 1759
             IRMRDTLVLTMR+ QSKALD++RKG+
Sbjct: 496  -IRMRDTLVLTMRMLQSKALDDRRKGR 521


>ref|XP_004157685.1| PREDICTED: uncharacterized protein LOC101226515 [Cucumis sativus]
          Length = 484

 Score =  466 bits (1200), Expect = e-128
 Identities = 259/502 (51%), Positives = 331/502 (65%), Gaps = 9/502 (1%)
 Frame = +2

Query: 284  MELYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSA 463
            MEL SR +AQE EIQ LR+QI++A ++E + LNEKY LERKFS++RMA+DEKQ+E ITSA
Sbjct: 1    MELLSRVKAQEGEIQLLRQQISVACLKELRQLNEKYALERKFSDIRMAVDEKQTEAITSA 60

Query: 464  SNELVRRKGDXXXXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKH 643
             NEL  RKGD           K  +DE++ + SS+LG+LAEYG  P V NAS LTN++K 
Sbjct: 61   FNELGYRKGDLEVNLKLTNELKAVDDERYHYISSLLGLLAEYGIWPQVINASVLTNNVKL 120

Query: 644  LHDQLQLKIRASHARLAELNSMIGNNSRNGVV-----DNEIPGLSSRGSQFPSSSMGVRG 808
            LHDQLQ KIR S+ ++ E  S   N    G       + +     SR  Q+        G
Sbjct: 121  LHDQLQRKIRTSYEKIGERTSPAENQFEGGFPYRKRENTDFKFFESR-YQYQKRESADIG 179

Query: 809  FSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNT 988
             S+Y      + L    D      +N       L+   EM Q  N +     L    R  
Sbjct: 180  NSRYQLPAKAEPLRTTDDMFISRVQNSIPGPVDLSLRPEMYQPVNYDNSPEPLYYAGREV 239

Query: 989  AS----PIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDA 1156
                  P+ D+ +E   + +       ++++ P M +          GP IE FQI+G+A
Sbjct: 240  PGAFTPPVDDDAVELQRYTTD------ERYNNPVMIE----------GPSIENFQIVGEA 283

Query: 1157 KPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIP 1336
             PG +LL CGYP RGTSLC+FQWV H  DGTRQYIEGATNP+YVV ADDVDKLIAVECIP
Sbjct: 284  TPGSRLLACGYPTRGTSLCIFQWVWHLEDGTRQYIEGATNPEYVVGADDVDKLIAVECIP 343

Query: 1337 MDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLI 1516
            MDD+G QGDLV++FANDQNKI CD +MQ +IDTY+SKGQA F+VL+L+DSSENWEPA++ 
Sbjct: 344  MDDKGHQGDLVKLFANDQNKIRCDPDMQLEIDTYLSKGQATFNVLLLIDSSENWEPASIS 403

Query: 1517 MRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRD 1696
            +RRSG+Q+K    +  VI+EKYS++L +KIPSG+S QFVLTCS+GSS PF+T  D+RMRD
Sbjct: 404  LRRSGYQIKMGNTEAVVIAEKYSRELSLKIPSGISTQFVLTCSDGSSLPFNT-YDVRMRD 462

Query: 1697 TLVLTMRIFQSKALDEKRKGKA 1762
            TLVLTMR+FQSKA+D++RKGKA
Sbjct: 463  TLVLTMRMFQSKAMDDRRKGKA 484


>gb|EOY10317.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 481

 Score =  444 bits (1143), Expect = e-122
 Identities = 245/459 (53%), Positives = 303/459 (66%), Gaps = 5/459 (1%)
 Frame = +2

Query: 125  TSIQKAMYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETME 289
            T     M S + S +  N+   + Q     NRH  E       +K R       D E   
Sbjct: 14   TWTDNVMSSSEHSVHGVNNNGVQAQSSDFLNRHGSETYLAPSKLKDRS--FDFPDLEAKG 71

Query: 290  LYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASN 469
            L+ RA AQ++EIQ+LREQIA+A ++E Q+ NEK  LERKFS+LRMA+DEKQ+E ITSASN
Sbjct: 72   LHLRASAQKEEIQHLREQIAVACVKELQLQNEKCALERKFSDLRMAIDEKQNEAITSASN 131

Query: 470  ELVRRKGDXXXXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLH 649
            EL RRKGD           KVAEDE++IF SSMLG+LAEYG LP V NASA+T+S+KHLH
Sbjct: 132  ELARRKGDLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGILPPVVNASAITSSVKHLH 191

Query: 650  DQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHY 829
            DQLQ KIR SH R+ EL  ++G ++     +N+ P      +Q P  +    GFS  +HY
Sbjct: 192  DQLQWKIRTSHDRIRELTGIVGTHTGGRSHENDRPISGILNNQIPHRATASHGFSSNNHY 251

Query: 830  NDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDN 1009
             D + L P  +  RY+ +ND      +      +QL N  +      ++DR  A    D+
Sbjct: 252  TDEQHLMPPDNMLRYMPDNDHTAKNLMFNDPGQQQLSNGNSQ-EFFFSSDRGGAGRNPDS 310

Query: 1010 MLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGY 1189
              +R    +G+E  + + FS    HD + S+GSE +GPGIEGFQIIGDA PG KLLGCGY
Sbjct: 311  AFDRGAVRTGAEDVTNNVFS---HHDEMDSYGSE-EGPGIEGFQIIGDATPGEKLLGCGY 366

Query: 1190 PVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLV 1369
            PVRGT+LCMFQWVRH  DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDDQG QG+LV
Sbjct: 367  PVRGTTLCMFQWVRHLQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDQGHQGELV 426

Query: 1370 RIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDS 1486
            R+FANDQNKI CD +MQ +ID YIS+GQAAFSVL+LL S
Sbjct: 427  RLFANDQNKIKCDPDMQNEIDKYISRGQAAFSVLLLLKS 465


>ref|XP_002524005.1| hypothetical protein RCOM_1516730 [Ricinus communis]
            gi|223536732|gb|EEF38373.1| hypothetical protein
            RCOM_1516730 [Ricinus communis]
          Length = 510

 Score =  431 bits (1107), Expect = e-118
 Identities = 243/506 (48%), Positives = 309/506 (61%)
 Frame = +2

Query: 161  SANDQNDVETRRQNRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLRE 340
            S+  +N ++    NR    ++     +KG  N  + +D+E MELYSRAR Q++EIQ LR+
Sbjct: 14   SSTTKNSMQGTNNNRAPTPSSDSLNRLKGDGNFNYFEDREAMELYSRARTQKEEIQILRQ 73

Query: 341  QIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXX 520
            QIA A +RE ++LNEKY LERKFS+LRMA+DEKQ+E ITSA NELV RKG+         
Sbjct: 74   QIAAACMRELRLLNEKYILERKFSDLRMAIDEKQNEAITSALNELVSRKGNLEDNLKLTH 133

Query: 521  XXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAEL 700
              KV +DE++IF SSMLG+LAEYG  PHV NAS                           
Sbjct: 134  ELKVVDDERYIFMSSMLGLLAEYGVWPHVMNAST-------------------------- 167

Query: 701  NSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQ 880
                        + N + GL  +          +    + SH    +I    H  S    
Sbjct: 168  ------------ISNNVKGLYDQ----------LEWKIRTSHDRIREIEVAVHPESESQD 205

Query: 881  ENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTD 1060
            +++P           M Q+P+   +  +  N       P+ + + ++     G  + + D
Sbjct: 206  KDNPGPGFL------MHQVPHQSKIQDSNNNFPEFPFDPVRERLFDKGIGEVGRGEMTMD 259

Query: 1061 QFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYP 1240
                   HD + S  SE +GPGIEGFQIIGDA PG KLLGCGYPVRGTSLCMFQWVRH  
Sbjct: 260  LPHPSSSHDEIASSVSE-EGPGIEGFQIIGDAVPGGKLLGCGYPVRGTSLCMFQWVRHLE 318

Query: 1241 DGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQ 1420
            DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDDQGRQG+LV+ FANDQNKI CD +MQ
Sbjct: 319  DGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVKRFANDQNKIKCDPDMQ 378

Query: 1421 EDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLI 1600
              ID YISKG+A FS+ +L D+S+ W+ +TLI+RRSG+Q+K       +I+EKYSK+L I
Sbjct: 379  HAIDMYISKGEATFSIQLLTDASDKWKSSTLILRRSGYQIKTISDDIELIAEKYSKNLSI 438

Query: 1601 KIPSGLSAQFVLTCSNGSSYPFSTSN 1678
            KIPSGLS QFVL CS+GSS+P +T N
Sbjct: 439  KIPSGLSTQFVLACSSGSSHPLNTYN 464


>gb|EOY10318.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 445

 Score =  380 bits (975), Expect = e-102
 Identities = 213/416 (51%), Positives = 265/416 (63%), Gaps = 5/416 (1%)
 Frame = +2

Query: 125  TSIQKAMYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETME 289
            T     M S + S +  N+   + Q     NRH  E       +K R       D E   
Sbjct: 14   TWTDNVMSSSEHSVHGVNNNGVQAQSSDFLNRHGSETYLAPSKLKDRS--FDFPDLEAKG 71

Query: 290  LYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASN 469
            L+ RA AQ++EIQ+LREQIA+A ++E Q+ NEK  LERKFS+LRMA+DEKQ+E ITSASN
Sbjct: 72   LHLRASAQKEEIQHLREQIAVACVKELQLQNEKCALERKFSDLRMAIDEKQNEAITSASN 131

Query: 470  ELVRRKGDXXXXXXXXXXXKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLH 649
            EL RRKGD           KVAEDE++IF SSMLG+LAEYG LP V NASA+T+S+KHLH
Sbjct: 132  ELARRKGDLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGILPPVVNASAITSSVKHLH 191

Query: 650  DQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHY 829
            DQLQ KIR SH R+ EL  ++G ++     +N+ P      +Q P  +    GFS  +HY
Sbjct: 192  DQLQWKIRTSHDRIRELTGIVGTHTGGRSHENDRPISGILNNQIPHRATASHGFSSNNHY 251

Query: 830  NDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDN 1009
             D + L P  +  RY+ +ND      +      +QL N  +      ++DR  A    D+
Sbjct: 252  TDEQHLMPPDNMLRYMPDNDHTAKNLMFNDPGQQQLSNGNS-QEFFFSSDRGGAGRNPDS 310

Query: 1010 MLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGY 1189
              +R    +G+E  + + FS    HD + S+GSE +GPGIEGFQIIGDA PG KLLGCGY
Sbjct: 311  AFDRGAVRTGAEDVTNNVFS---HHDEMDSYGSE-EGPGIEGFQIIGDATPGEKLLGCGY 366

Query: 1190 PVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQ 1357
            PVRGT+LCMFQWVRH  DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDDQG Q
Sbjct: 367  PVRGTTLCMFQWVRHLQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDQGHQ 422


>ref|XP_003520361.2| PREDICTED: uncharacterized protein LOC100813936 [Glycine max]
          Length = 621

 Score =  377 bits (969), Expect = e-102
 Identities = 206/374 (55%), Positives = 259/374 (69%)
 Frame = +2

Query: 641  HLHDQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQY 820
            HLHDQLQ +IR+SH R+ EL S++ + + NG    E PG  +  S   +  M    F Q 
Sbjct: 258  HLHDQLQWRIRSSHDRMGELTSVLESRADNGNHVVESPGSGNLTSHTHNDFMFQHNFPQQ 317

Query: 821  SHYNDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPI 1000
            +   + +   P  + + Y+            ++   +Q  N +  + + P+   +    +
Sbjct: 318  NLIGNEQSHQPMSNVAGYMHPALHSDVNWGLKTFNYQQTSNADRGISSFPHASIDKIG-V 376

Query: 1001 MDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLG 1180
             D  +ERN F +G+       +  PP  D   S  SE DGPGIE FQ+ GDA PG KLLG
Sbjct: 377  QDKNMERN-FGNGNF------YQHPPDLDETASSVSE-DGPGIENFQVSGDAIPGEKLLG 428

Query: 1181 CGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQG 1360
            CGYPVRGTSLCMFQWVRH  DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDD+GRQG
Sbjct: 429  CGYPVRGTSLCMFQWVRHLEDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGRQG 488

Query: 1361 DLVRIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQV 1540
            +LV++FANDQNKITCD EM+ +I T +SKG+A FSVL+L DSSENWE ATL +RRSG+Q+
Sbjct: 489  ELVKLFANDQNKITCDSEMKHEIGTNLSKGEATFSVLLLRDSSENWEQATLFLRRSGYQI 548

Query: 1541 KDERKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRI 1720
            K    + TV+ EK+SK+L IK+P GLSAQFVLT SNGSS+P ST + +RMRDTLVLTMR+
Sbjct: 549  KINGTEATVVDEKFSKELSIKVPCGLSAQFVLTSSNGSSHPLSTYS-VRMRDTLVLTMRL 607

Query: 1721 FQSKALDEKRKGKA 1762
            FQSKALD+KRKG+A
Sbjct: 608  FQSKALDDKRKGRA 621



 Score =  194 bits (492), Expect = 1e-46
 Identities = 106/228 (46%), Positives = 145/228 (63%)
 Frame = +2

Query: 194 RQNRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQ 373
           R NR+  E     +N K  +   H Q+Q TMELYSRAR QE+EI  LREQI +A ++E Q
Sbjct: 4   RGNRNKYETQLAQRNFKSNDTQNHIQEQNTMELYSRAREQEEEILSLREQIGIACMKELQ 63

Query: 374 MLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXXKVAEDEKHI 553
           +LNEK  LER+FSELRMA+DEKQ+E I+SASN+LV+RKG            K  +DE++I
Sbjct: 64  LLNEKCKLERQFSELRMAVDEKQNEAISSASNDLVQRKGYLEENLKLAHDLKAVDDERYI 123

Query: 554 FTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNG 733
           F SSMLG+LAEYG  P V NAS++++ +KHLHDQLQ +IR+SH R+ EL S++ + + NG
Sbjct: 124 FMSSMLGLLAEYGLWPRVMNASSISSCVKHLHDQLQWRIRSSHDRMGELTSVLESRADNG 183

Query: 734 VVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYV 877
               E PG  +  S   +  M    F Q +   + +   P  + + Y+
Sbjct: 184 NHVVESPGSGNLTSHTHNDFMFQHNFPQQNLIGNEQSHQPMSNVAGYM 231


Top