BLASTX nr result

ID: Rehmannia22_contig00018948 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00018948
         (2039 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006355996.1| PREDICTED: uncharacterized protein LOC102597...   660   0.0  
ref|XP_004248518.1| PREDICTED: uncharacterized protein LOC101253...   649   0.0  
gb|EOY10316.1| Uncharacterized protein isoform 2 [Theobroma cacao]    566   e-158
gb|EOY10315.1| Uncharacterized protein isoform 1 [Theobroma cacao]    565   e-158
ref|XP_004510196.1| PREDICTED: uncharacterized protein LOC101501...   515   e-143
ref|XP_006484726.1| PREDICTED: uncharacterized protein LOC102615...   514   e-143
ref|XP_004307300.1| PREDICTED: uncharacterized protein LOC101293...   511   e-142
gb|ESW06285.1| hypothetical protein PHAVU_010G034800g [Phaseolus...   501   e-139
ref|NP_187006.2| uncharacterized protein [Arabidopsis thaliana] ...   483   e-133
gb|ESW06284.1| hypothetical protein PHAVU_010G034800g [Phaseolus...   482   e-133
ref|XP_004133970.1| PREDICTED: uncharacterized protein LOC101207...   481   e-133
ref|XP_006408291.1| hypothetical protein EUTSA_v10022176mg, part...   479   e-132
ref|XP_002884395.1| hypothetical protein ARALYDRAFT_477601 [Arab...   476   e-131
gb|EPS73084.1| hypothetical protein M569_01668, partial [Genlise...   471   e-130
ref|XP_006298972.1| hypothetical protein CARUB_v10015106mg [Caps...   469   e-129
ref|XP_004157685.1| PREDICTED: uncharacterized protein LOC101226...   466   e-128
gb|EOY10317.1| Uncharacterized protein isoform 3 [Theobroma cacao]    444   e-122
ref|XP_002524005.1| hypothetical protein RCOM_1516730 [Ricinus c...   431   e-118
gb|EOY10318.1| Uncharacterized protein isoform 4, partial [Theob...   380   e-102
ref|XP_003520361.2| PREDICTED: uncharacterized protein LOC100813...   377   e-102

>ref|XP_006355996.1| PREDICTED: uncharacterized protein LOC102597014 isoform X1 [Solanum
            tuberosum] gi|565379136|ref|XP_006355997.1| PREDICTED:
            uncharacterized protein LOC102597014 isoform X2 [Solanum
            tuberosum] gi|565379138|ref|XP_006355998.1| PREDICTED:
            uncharacterized protein LOC102597014 isoform X3 [Solanum
            tuberosum]
          Length = 544

 Score =  660 bits (1702), Expect = 0.0
 Identities = 343/546 (62%), Positives = 411/546 (75%), Gaps = 7/546 (1%)
 Frame = -3

Query: 1740 MYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRAR 1576
            MYS  SS N Q DV  + Q     NR N   +S+ KN+KG + +  +QD E MELYSRA+
Sbjct: 1    MYSPSSSINGQKDVRVQGQSSDLANRPNFGMSSLPKNLKGNDTINDSQDPEAMELYSRAK 60

Query: 1575 AQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRK 1396
            AQ++EI YLREQIALAS+RESQ+LNEKY LE+KFSELRMALDEKQ+E I SASNEL RRK
Sbjct: 61   AQQEEILYLREQIALASVRESQLLNEKYGLEKKFSELRMALDEKQNEAIISASNELTRRK 120

Query: 1395 GDXXXXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLK 1216
            GD          LK  ED+K+IFTSSMLG+LAEYG  P V +AS+L N++KHLHDQL++K
Sbjct: 121  GDLEENLRLVNELKDTEDDKYIFTSSMLGLLAEYGVFPRVASASSLANNVKHLHDQLEMK 180

Query: 1215 IRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKIL 1036
            IR SHA++A+LNSM+ N++R G  D E P  SS  +Q PS SMG+  +  +  Y DG+  
Sbjct: 181  IRTSHAKIAQLNSMVTNHARGGSFDMESPHSSSINNQLPSGSMGMNEYPAFKQYIDGQHN 240

Query: 1035 DPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNG 856
            +     S  VQ +    A  L  + EM Q  ++   L    NTDR+   P  DN+ +RNG
Sbjct: 241  EAVATGSGDVQASKHLPAERLLFNREMHQQASH---LEISSNTDRDVPGPTKDNLFDRNG 297

Query: 855  FLSGSEQRSTDQFSLPPM--HDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRG 682
                 E+ + +    PP   ++  GSF SEG+ PGIE FQIIG+AKPGCKLLGCG+PVRG
Sbjct: 298  VNERFEESNNENRHNPPTVGNEIGGSFSSEGESPGIEVFQIIGEAKPGCKLLGCGFPVRG 357

Query: 681  TSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFA 502
            TSLCMFQWVRHYPDGTRQYIEGATNP+YVVTADD+DKLIAVECIPMDDQG QG+LVR+FA
Sbjct: 358  TSLCMFQWVRHYPDGTRQYIEGATNPEYVVTADDIDKLIAVECIPMDDQGHQGELVRLFA 417

Query: 501  NDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQD 322
            NDQN ITCD +MQ +IDT+IS+GQA F+VL+L+DSSENWEP T+ +RRS FQVK  R Q 
Sbjct: 418  NDQNNITCDTDMQSEIDTHISEGQATFNVLMLVDSSENWEPVTIFLRRSSFQVKVHRTQA 477

Query: 321  TVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALD 142
             VI E +SK+LLIKIPSGLSAQFV+TCSNGSS+PFST+NDIRMRDTLVLTMRIFQSKALD
Sbjct: 478  VVIVEIFSKELLIKIPSGLSAQFVITCSNGSSHPFSTNNDIRMRDTLVLTMRIFQSKALD 537

Query: 141  EKRKGK 124
            EKRKGK
Sbjct: 538  EKRKGK 543


>ref|XP_004248518.1| PREDICTED: uncharacterized protein LOC101253835 [Solanum
            lycopersicum]
          Length = 547

 Score =  649 bits (1675), Expect = 0.0
 Identities = 339/546 (62%), Positives = 407/546 (74%), Gaps = 7/546 (1%)
 Frame = -3

Query: 1740 MYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRAR 1576
            MYS  SS N Q DV  + Q     NR N   +S+ K +KG + +  +QD E MELYSRA+
Sbjct: 1    MYSPISSINGQKDVRVQGQSSDLANRQNFGMSSLPKILKGNDTINDSQDPEVMELYSRAK 60

Query: 1575 AQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRK 1396
            AQ++EI YLREQIALASIRESQ+LNEKY LE+KFSELRMALDEKQ+E I SASNEL RRK
Sbjct: 61   AQQEEILYLREQIALASIRESQLLNEKYGLEKKFSELRMALDEKQNEAIISASNELTRRK 120

Query: 1395 GDXXXXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLK 1216
            GD          LK  ED+K+IF SSM+G+LAEYG  P V +AS LTN++KHLHDQL++K
Sbjct: 121  GDLEENLRLVNELKDTEDDKYIFMSSMIGLLAEYGVFPRVASASNLTNNVKHLHDQLEMK 180

Query: 1215 IRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKIL 1036
            IR SHA++A+LNSM+ N++R G  D E P  SS  +Q PS SMG+  +  +  Y DG+  
Sbjct: 181  IRTSHAKIAQLNSMVTNHARGGSFDMESPHSSSINNQLPSGSMGMNEYPAFKQYIDGQHN 240

Query: 1035 DPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNG 856
            + A   S  VQ +    A SL  + EM Q  N  + L    NT+R+ + P  DN+   NG
Sbjct: 241  EAAATGSGDVQASKHLPAESLLFNREMHQQANIGSHLEISSNTERDVSGPAKDNLFAING 300

Query: 855  FLSGSEQRSTDQFSLPPM--HDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRG 682
                 E+ + +    PP   +D  GSF SEG+ PGIE FQIIG+AKPGCKLLGCG+PVRG
Sbjct: 301  VNERFEESNNENRHNPPTVGNDIGGSFSSEGESPGIEVFQIIGEAKPGCKLLGCGFPVRG 360

Query: 681  TSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFA 502
            TSLCMFQWVRHYPDGTRQYIEGATNP+YVVTADD+DKLIAVECIPMDDQG QG+LVR+FA
Sbjct: 361  TSLCMFQWVRHYPDGTRQYIEGATNPEYVVTADDIDKLIAVECIPMDDQGHQGELVRLFA 420

Query: 501  NDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQD 322
            NDQN ITCD +MQ +IDT+IS+GQA F+VL+L+DSSENWEP T+ + RS FQVK  R Q 
Sbjct: 421  NDQNNITCDPDMQSEIDTHISEGQATFNVLMLVDSSENWEPVTIFLLRSSFQVKVHRTQA 480

Query: 321  TVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALD 142
             VI E +SK+L IKIPSGLS QFV+TCS+GSS+PFST+NDIRMRD+LVLTMRIFQSKALD
Sbjct: 481  VVIVENFSKELSIKIPSGLSTQFVITCSDGSSHPFSTNNDIRMRDSLVLTMRIFQSKALD 540

Query: 141  EKRKGK 124
            EKRKGK
Sbjct: 541  EKRKGK 546


>gb|EOY10316.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 556

 Score =  566 bits (1459), Expect = e-158
 Identities = 308/551 (55%), Positives = 380/551 (68%), Gaps = 5/551 (0%)
 Frame = -3

Query: 1758 TSIQKAMYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETME 1594
            T     M S + S +  N+   + Q     NRH  E       +K R       D E   
Sbjct: 14   TWTDNVMSSSEHSVHGVNNNGVQAQSSDFLNRHGSETYLAPSKLKDRS--FDFPDLEAKG 71

Query: 1593 LYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASN 1414
            L+ RA AQ++EIQ+LREQIA+A ++E Q+ NEK  LERKFS+LRMA+DEKQ+E ITSASN
Sbjct: 72   LHLRASAQKEEIQHLREQIAVACVKELQLQNEKCALERKFSDLRMAIDEKQNEAITSASN 131

Query: 1413 ELVRRKGDXXXXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLH 1234
            EL RRKGD          LKVAEDE++IF SSMLG+LAEYG LP V NASA+T+S+KHLH
Sbjct: 132  ELARRKGDLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGILPPVVNASAITSSVKHLH 191

Query: 1233 DQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHY 1054
            DQLQ KIR SH R+ EL  ++G ++     +N+ P      +Q P  +    GFS  +HY
Sbjct: 192  DQLQWKIRTSHDRIRELTGIVGTHTGGRSHENDRPISGILNNQIPHRATASHGFSSNNHY 251

Query: 1053 NDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDN 874
             D + L P  +  RY+ +ND      +      +QL N  +      ++DR  A    D+
Sbjct: 252  TDEQHLMPPDNMLRYMPDNDHTAKNLMFNDPGQQQLSNGNSQ-EFFFSSDRGGAGRNPDS 310

Query: 873  MLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGY 694
              +R    +G+E  + + FS    HD + S+GSE +GPGIEGFQIIGDA PG KLLGCGY
Sbjct: 311  AFDRGAVRTGAEDVTNNVFS---HHDEMDSYGSE-EGPGIEGFQIIGDATPGEKLLGCGY 366

Query: 693  PVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLV 514
            PVRGT+LCMFQWVRH  DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDDQG QG+LV
Sbjct: 367  PVRGTTLCMFQWVRHLQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDQGHQGELV 426

Query: 513  RIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDE 334
            R+FANDQNKI CD +MQ +ID YIS+GQAAFSVL+L+DSSE WEPATL ++RS +Q+K  
Sbjct: 427  RLFANDQNKIKCDPDMQNEIDKYISRGQAAFSVLLLMDSSEKWEPATLTLKRSSYQIKIN 486

Query: 333  RKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQS 154
              +   ISEKYSK+L IK+PSGLS QFV+TC +GSS PFST N +RMRDTLVLTMR+FQS
Sbjct: 487  STEAVEISEKYSKELSIKVPSGLSTQFVVTCFDGSSRPFSTYN-VRMRDTLVLTMRLFQS 545

Query: 153  KALDEKRKGKA 121
            K LD+KRKG+A
Sbjct: 546  KNLDDKRKGRA 556


>gb|EOY10315.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 541

 Score =  565 bits (1457), Expect = e-158
 Identities = 302/521 (57%), Positives = 370/521 (71%)
 Frame = -3

Query: 1683 NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQML 1504
            NRH  E       +K R       D E   L+ RA AQ++EIQ+LREQIA+A ++E Q+ 
Sbjct: 29   NRHGSETYLAPSKLKDRS--FDFPDLEAKGLHLRASAQKEEIQHLREQIAVACVKELQLQ 86

Query: 1503 NEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHIFT 1324
            NEK  LERKFS+LRMA+DEKQ+E ITSASNEL RRKGD          LKVAEDE++IF 
Sbjct: 87   NEKCALERKFSDLRMAIDEKQNEAITSASNELARRKGDLEENLKLAHDLKVAEDERYIFM 146

Query: 1323 SSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV 1144
            SSMLG+LAEYG LP V NASA+T+S+KHLHDQLQ KIR SH R+ EL  ++G ++     
Sbjct: 147  SSMLGLLAEYGILPPVVNASAITSSVKHLHDQLQWKIRTSHDRIRELTGIVGTHTGGRSH 206

Query: 1143 DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQS 964
            +N+ P      +Q P  +    GFS  +HY D + L P  +  RY+ +ND      +   
Sbjct: 207  ENDRPISGILNNQIPHRATASHGFSSNNHYTDEQHLMPPDNMLRYMPDNDHTAKNLMFND 266

Query: 963  AEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGS 784
               +QL N  +      ++DR  A    D+  +R    +G+E  + + FS    HD + S
Sbjct: 267  PGQQQLSNGNSQ-EFFFSSDRGGAGRNPDSAFDRGAVRTGAEDVTNNVFS---HHDEMDS 322

Query: 783  FGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNP 604
            +GSE +GPGIEGFQIIGDA PG KLLGCGYPVRGT+LCMFQWVRH  DGTRQYIEGATNP
Sbjct: 323  YGSE-EGPGIEGFQIIGDATPGEKLLGCGYPVRGTTLCMFQWVRHLQDGTRQYIEGATNP 381

Query: 603  DYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQAA 424
            +YVVTADDVDKLIAVECIPMDDQG QG+LVR+FANDQNKI CD +MQ +ID YIS+GQAA
Sbjct: 382  EYVVTADDVDKLIAVECIPMDDQGHQGELVRLFANDQNKIKCDPDMQNEIDKYISRGQAA 441

Query: 423  FSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVLT 244
            FSVL+L+DSSE WEPATL ++RS +Q+K    +   ISEKYSK+L IK+PSGLS QFV+T
Sbjct: 442  FSVLLLMDSSEKWEPATLTLKRSSYQIKINSTEAVEISEKYSKELSIKVPSGLSTQFVVT 501

Query: 243  CSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGKA 121
            C +GSS PFST N +RMRDTLVLTMR+FQSK LD+KRKG+A
Sbjct: 502  CFDGSSRPFSTYN-VRMRDTLVLTMRLFQSKNLDDKRKGRA 541


>ref|XP_004510196.1| PREDICTED: uncharacterized protein LOC101501329 [Cicer arietinum]
          Length = 538

 Score =  515 bits (1327), Expect = e-143
 Identities = 289/526 (54%), Positives = 359/526 (68%), Gaps = 6/526 (1%)
 Frame = -3

Query: 1680 RHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQMLN 1501
            RHN+E        K  + + H  D ETMELYSRAR QE+EI  LREQIA++ ++E Q+LN
Sbjct: 26   RHNVETQLAQNTFKSSDALNHVNDLETMELYSRARGQEEEILSLREQIAVSCMKELQLLN 85

Query: 1500 EKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHIFTS 1321
            EK  LER  SELRMA+DE+Q+E ITSASN+L RRKG           LKVAE+E++ F S
Sbjct: 86   EKCKLERDLSELRMAVDERQNEAITSASNDLARRKGYLEENLKLAHELKVAEEERYAFMS 145

Query: 1320 SMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNG--- 1150
            SMLG+LAEYG  P V NAS+++N +KHLHDQLQ +IR SH R+ EL S I N++  G   
Sbjct: 146  SMLGLLAEYGLWPRVMNASSVSNYVKHLHDQLQWRIRNSHDRIGELTSGIENHADTGNNH 205

Query: 1149 VVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLT 970
            VV++     S+  +Q  S  M    F Q +   + +   P    + Y+            
Sbjct: 206  VVESPNSAKSTNHAQ--SEFMFQHNFPQQNLIGNEQNHQPMSKMTGYMNPVVSGDVNGTF 263

Query: 969  QSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNG---FLSGSEQRSTDQFSLPPMH 799
            +    +++   +   R +      +   I   M ER+G   F +G+     + + LP  H
Sbjct: 264  KRVNYQEISKAD---RDISFFRHGSIDQI--GMQERSGERNFANGNG----NLYQLPLDH 314

Query: 798  DRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIE 619
            D   S  SE DGPGIE FQI GDA PG KLLGCGYPVR TSLCMFQWVRH  DGTRQYIE
Sbjct: 315  DETASSVSE-DGPGIENFQICGDAIPGEKLLGCGYPVRRTSLCMFQWVRHLQDGTRQYIE 373

Query: 618  GATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYIS 439
            GA+NP+YVVTADDVDKLIAVECIPMDD+GRQG+LVR+FANDQNKI CD EMQ +IDTY+S
Sbjct: 374  GASNPEYVVTADDVDKLIAVECIPMDDKGRQGELVRLFANDQNKIKCDPEMQHEIDTYLS 433

Query: 438  KGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSA 259
            KG+A FSVL+L+DSSENWE ATL +RRSG+Q+K    +  V++EK+SKDL IK+P GLS 
Sbjct: 434  KGEAMFSVLLLMDSSENWEQATLFLRRSGYQIKINGTEAPVVAEKFSKDLSIKVPCGLST 493

Query: 258  QFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGKA 121
            QFVLTC NGSS+P ST + +RMRDTLVLTMR+FQSK LD+KRKG+A
Sbjct: 494  QFVLTCLNGSSHPLSTYS-VRMRDTLVLTMRLFQSKVLDDKRKGRA 538


>ref|XP_006484726.1| PREDICTED: uncharacterized protein LOC102615526 [Citrus sinensis]
          Length = 522

 Score =  514 bits (1323), Expect = e-143
 Identities = 287/546 (52%), Positives = 374/546 (68%), Gaps = 6/546 (1%)
 Frame = -3

Query: 1740 MYSGDSSANDQNDVETRRQN------RHNLEATSVTKNVKGRENMIHTQDQETMELYSRA 1579
            M SG++S +  N+   + +N      RH +E T +    +  +N I  QD+E MELYSRA
Sbjct: 1    MSSGNNSMHGLNNHRFQAKNSDFVNSRHKIE-THLAPTKQKEDNFISFQDREAMELYSRA 59

Query: 1578 RAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRR 1399
            R Q++EI  LR+QIA+A ++E Q+ NEKYTLERK SELRMA+DEKQ+E ITSA NEL RR
Sbjct: 60   RMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARR 119

Query: 1398 KGDXXXXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQL 1219
            KG           LKVAEDE++ F SSMLG+LA+YG  PHVTNASA++N++KHL+DQLQ 
Sbjct: 120  KGVLEENLKLAHDLKVAEDERYFFMSSMLGLLADYGLWPHVTNASAISNTVKHLYDQLQS 179

Query: 1218 KIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKI 1039
            +IR S+ R+ +L    G ++  G +D  +  L   G    + +   R             
Sbjct: 180  QIRTSYDRIRDLTREGGTDAGAGSIDTVV--LDRHGVPMHTPNAADRP------------ 225

Query: 1038 LDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERN 859
             +P  +  R + ++   + ++L  +++M+QL NN++       ++R     + + +  R 
Sbjct: 226  -EPTDNMPRTIHDDSHSEMKNLLHNSQMQQLFNNDSSQGFSFGSNRENLGNVPNALDLR- 283

Query: 858  GFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGT 679
                G E+ +      P  H+ + S  SEG GPGIEGFQIIG+A PG KLLGCGYPVRGT
Sbjct: 284  -VARGPEEMNA---WFPSTHNEIASSISEG-GPGIEGFQIIGEATPGEKLLGCGYPVRGT 338

Query: 678  SLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFAN 499
            +LCMFQWVRH  DGTR YIEGATNP+YVVTADDVDKLIAVECIPMDDQGRQG+LVR FAN
Sbjct: 339  TLCMFQWVRHLQDGTRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVRRFAN 398

Query: 498  DQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDT 319
            DQNKI CD  MQ +ID YIS+G A FSVL+L+DSSENWE ATLI+RRS +++K +   + 
Sbjct: 399  DQNKIKCDLGMQSEIDAYISRGHATFSVLMLMDSSENWEQATLILRRSIYRIKID-STEA 457

Query: 318  VISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDE 139
            +I E++ K++ IK+P GLS QFVLT S+GSSYPFST N +RMRDTLVLTMR+ Q KALD+
Sbjct: 458  IIEERFPKEVSIKVPCGLSTQFVLTFSDGSSYPFSTYN-VRMRDTLVLTMRMLQGKALDD 516

Query: 138  KRKGKA 121
            KRKG+A
Sbjct: 517  KRKGRA 522


>ref|XP_004307300.1| PREDICTED: uncharacterized protein LOC101293522 [Fragaria vesca
            subsp. vesca]
          Length = 493

 Score =  511 bits (1316), Expect = e-142
 Identities = 279/523 (53%), Positives = 351/523 (67%), Gaps = 3/523 (0%)
 Frame = -3

Query: 1683 NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQML 1504
            NRH+ EA    KN++  ++ +H +DQE MELYSRARAQE+EIQ+LR Q+ +A ++E ++L
Sbjct: 25   NRHSSEAHCSPKNLRD-DSDVHHKDQEAMELYSRARAQEEEIQFLRGQVTVACLKELRLL 83

Query: 1503 NEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHIFT 1324
            NEKY LE+KF++LRMA+DEKQ+E  TSA NEL RRKGD          LK A+DE+++F 
Sbjct: 84   NEKYALEKKFADLRMAIDEKQNEATTSALNELARRKGDLEENLKLTHDLKAADDERYVFM 143

Query: 1323 SSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV 1144
            SSMLG+LAEYG  PHV NASA++NS+KHLHD+LQ KIR SH +                 
Sbjct: 144  SSMLGLLAEYGIWPHVVNASAISNSLKHLHDELQWKIRTSHEQ----------------- 186

Query: 1143 DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQS 964
                                 +GF +Y+   D + ++P      ++  ND    R+L   
Sbjct: 187  ---------------------QGFDRYT---DAQRMEPTAKVQLHM--NDFTDTRNL--- 217

Query: 963  AEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGF---LSGSEQRSTDQFSLPPMHDR 793
                 L N E   +   N D NT    MD  +  + F   ++      T+  S P   D 
Sbjct: 218  ----MLINKENPQQFTANIDSNTTHRNMDGFILHDSFDKDVAYGRAEQTNGTSYPQTPDN 273

Query: 792  VGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGA 613
              S      GPGIE FQIIGDA PG KLLGCG+PVRGTSLCMFQWVRH  DGTR+ IEGA
Sbjct: 274  TSSISQ---GPGIENFQIIGDAVPGGKLLGCGFPVRGTSLCMFQWVRHLQDGTREVIEGA 330

Query: 612  TNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKG 433
            TNP+Y+VTADDVDK IAV+CIPMDDQGRQG+LVR FANDQNKI CD EMQ +IDT+IS+G
Sbjct: 331  TNPEYIVTADDVDKTIAVDCIPMDDQGRQGELVRHFANDQNKIKCDPEMQLEIDTHISRG 390

Query: 432  QAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQF 253
            QA F VL+L+DS+ENWEPATL +RRSG+Q+K    +  VI+EK+S DL IK+P G S QF
Sbjct: 391  QATFIVLLLMDSAENWEPATLFLRRSGYQIKINSTEALVIAEKFSNDLSIKVPCGFSTQF 450

Query: 252  VLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGK 124
            VLTCS+GSS+PFST + +RMRDTLVLTMR+ QSKALD++RKG+
Sbjct: 451  VLTCSDGSSHPFSTYS-VRMRDTLVLTMRMLQSKALDDRRKGR 492


>gb|ESW06285.1| hypothetical protein PHAVU_010G034800g [Phaseolus vulgaris]
          Length = 538

 Score =  501 bits (1291), Expect = e-139
 Identities = 277/523 (52%), Positives = 352/523 (67%), Gaps = 3/523 (0%)
 Frame = -3

Query: 1680 RHNLEATSVTKNVKGRENMIHTQDQETM---ELYSRARAQEKEIQYLREQIALASIRESQ 1510
            RH  E     +N K  +   H QDQ+     EL SRAR  E+EI  LREQIA A ++E Q
Sbjct: 26   RHKFETQLTQRNFKSNDAHNHIQDQDATQATELNSRARGLEEEILSLREQIAFACMKELQ 85

Query: 1509 MLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHI 1330
            +LNEK  LER+FSELRMA+DEK+SE I+SASN+L  RKG           LK  +DE++I
Sbjct: 86   LLNEKCKLERQFSELRMAVDEKESEAISSASNDLAHRKGYLEENLKLAHDLKAVDDERYI 145

Query: 1329 FTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNG 1150
            F SSMLG+LAEYG  P V NA +++  +KHLHDQLQ +IR+SH R+ EL+S++ + + NG
Sbjct: 146  FMSSMLGLLAEYGLWPRVMNAFSISTCVKHLHDQLQWRIRSSHDRIGELSSVLESRADNG 205

Query: 1149 VVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLT 970
                E P   +  S   +  M    FSQ +   + +      + + Y+            
Sbjct: 206  NHVVESPSSENLTSHNHNDFMFQHNFSQQNLIGNEQTHQLTSNIAGYMHPALNPDVNWSI 265

Query: 969  QSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRV 790
            ++   +Q+P  +  + + P+   +    + D  +ERN         + + +   P  D  
Sbjct: 266  KAFNYQQIPKPDRDVASFPHGSIDKIG-VQDKNMERNFV-------NANMYQPQPELDET 317

Query: 789  GSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGAT 610
             S  SE D PGIE FQI GDA PG KLLGCGYPVRGT LC+FQWVRH  DGTR YIEGAT
Sbjct: 318  ASSVSE-DAPGIENFQISGDAIPGEKLLGCGYPVRGTYLCIFQWVRHLEDGTRHYIEGAT 376

Query: 609  NPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQ 430
            NP+YVVTADDVDKLIAVECIPMDD+GRQG+LV++FANDQNKITCD EM+ +IDT +SKG+
Sbjct: 377  NPEYVVTADDVDKLIAVECIPMDDKGRQGELVKLFANDQNKITCDSEMKHEIDTNLSKGE 436

Query: 429  AAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFV 250
            A FSVL+L DSSENWE ATL +RR+G+Q++    + TV+SEK+SKDL IK+PSGLS QFV
Sbjct: 437  AIFSVLLLTDSSENWERATLYLRRTGYQIRINGTEATVVSEKFSKDLSIKVPSGLSVQFV 496

Query: 249  LTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGKA 121
            LTCS+GSS+P ST + +RMRDTLVLTMR FQSKALDEKRKG+A
Sbjct: 497  LTCSDGSSHPLSTYS-VRMRDTLVLTMRFFQSKALDEKRKGRA 538


>ref|NP_187006.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332640436|gb|AEE73957.1| uncharacterized protein
            AT3G03560 [Arabidopsis thaliana]
          Length = 521

 Score =  483 bits (1242), Expect = e-133
 Identities = 263/523 (50%), Positives = 356/523 (68%), Gaps = 4/523 (0%)
 Frame = -3

Query: 1680 RHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQMLN 1501
            RH +E  ++        N    QD E M LY++ R+QE+EI  L+E+IA A +++ Q+LN
Sbjct: 12   RHEIEKDTIASRKLEDTNTKLIQDPEEMALYAKVRSQEEEIHSLQERIAAACLKDMQLLN 71

Query: 1500 EKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHIFTS 1321
            EKY LERK ++LR+A+DEKQ+E +TSA NEL RRKGD          LKV EDE++IF +
Sbjct: 72   EKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENLKLAHDLKVTEDERYIFMT 131

Query: 1320 SMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV- 1144
            S+LG+LAEYG  P V NA+A+++ IKHLHDQLQ K +A + R+ EL+S++ N      + 
Sbjct: 132  SLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKTKACNDRIRELSSIVENQPGTDFIS 191

Query: 1143 -DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQ 967
             DN  P  S   + + S+  G        +  + ++L P  + +R    N  +   SL  
Sbjct: 192  KDNHDPRNSKTQASYGSTDRG------NDYQTNEQLLPPMENVTRNPYHNIMQDTESLRF 245

Query: 966  SAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVG 787
            +    Q+      +   P  + N   P+  + +     +   E+++ +  S+   ++   
Sbjct: 246  N---NQIGGGSQGIFPQPKRE-NFGYPL--SSVAGKEMIQEREEKA-ENSSMFDAYNGNE 298

Query: 786  SFGSE--GDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGA 613
             F S    +GPGI+GFQIIGDA PG K+LGCG+PVRGT+LCMFQWVRH  DGTRQYIEGA
Sbjct: 299  EFASHVYEEGPGIDGFQIIGDAIPGEKVLGCGFPVRGTTLCMFQWVRHLEDGTRQYIEGA 358

Query: 612  TNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKG 433
            T+P+Y+VTADDVDKLIAVECIPMDDQGRQG+LVR+FANDQNKI CD EMQ +IDTYIS+G
Sbjct: 359  THPEYIVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNKIRCDTEMQTEIDTYISRG 418

Query: 432  QAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQF 253
            QA+F+V +L+DSSE+WEPAT++++RS +Q+K    +  VISEKYSK+L I++PSG S QF
Sbjct: 419  QASFNVQLLMDSSESWEPATVVLKRSSYQIKTNTTEAVVISEKYSKELQIRVPSGESTQF 478

Query: 252  VLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGK 124
            VL   +GSS+P ST N +RMRDTLVLTMR+ QSKALDE+RKG+
Sbjct: 479  VLISYDGSSHPISTLN-VRMRDTLVLTMRMLQSKALDERRKGR 520


>gb|ESW06284.1| hypothetical protein PHAVU_010G034800g [Phaseolus vulgaris]
          Length = 529

 Score =  482 bits (1240), Expect = e-133
 Identities = 267/512 (52%), Positives = 341/512 (66%), Gaps = 3/512 (0%)
 Frame = -3

Query: 1680 RHNLEATSVTKNVKGRENMIHTQDQETM---ELYSRARAQEKEIQYLREQIALASIRESQ 1510
            RH  E     +N K  +   H QDQ+     EL SRAR  E+EI  LREQIA A ++E Q
Sbjct: 26   RHKFETQLTQRNFKSNDAHNHIQDQDATQATELNSRARGLEEEILSLREQIAFACMKELQ 85

Query: 1509 MLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHI 1330
            +LNEK  LER+FSELRMA+DEK+SE I+SASN+L  RKG           LK  +DE++I
Sbjct: 86   LLNEKCKLERQFSELRMAVDEKESEAISSASNDLAHRKGYLEENLKLAHDLKAVDDERYI 145

Query: 1329 FTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNG 1150
            F SSMLG+LAEYG  P V NA +++  +KHLHDQLQ +IR+SH R+ EL+S++ + + NG
Sbjct: 146  FMSSMLGLLAEYGLWPRVMNAFSISTCVKHLHDQLQWRIRSSHDRIGELSSVLESRADNG 205

Query: 1149 VVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLT 970
                E P   +  S   +  M    FSQ +   + +      + + Y+            
Sbjct: 206  NHVVESPSSENLTSHNHNDFMFQHNFSQQNLIGNEQTHQLTSNIAGYMHPALNPDVNWSI 265

Query: 969  QSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRV 790
            ++   +Q+P  +  + + P+   +    + D  +ERN         + + +   P  D  
Sbjct: 266  KAFNYQQIPKPDRDVASFPHGSIDKIG-VQDKNMERNFV-------NANMYQPQPELDET 317

Query: 789  GSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGAT 610
             S  SE D PGIE FQI GDA PG KLLGCGYPVRGT LC+FQWVRH  DGTR YIEGAT
Sbjct: 318  ASSVSE-DAPGIENFQISGDAIPGEKLLGCGYPVRGTYLCIFQWVRHLEDGTRHYIEGAT 376

Query: 609  NPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQ 430
            NP+YVVTADDVDKLIAVECIPMDD+GRQG+LV++FANDQNKITCD EM+ +IDT +SKG+
Sbjct: 377  NPEYVVTADDVDKLIAVECIPMDDKGRQGELVKLFANDQNKITCDSEMKHEIDTNLSKGE 436

Query: 429  AAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFV 250
            A FSVL+L DSSENWE ATL +RR+G+Q++    + TV+SEK+SKDL IK+PSGLS QFV
Sbjct: 437  AIFSVLLLTDSSENWERATLYLRRTGYQIRINGTEATVVSEKFSKDLSIKVPSGLSVQFV 496

Query: 249  LTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQS 154
            LTCS+GSS+P ST + +RMRDTLVLTMR FQS
Sbjct: 497  LTCSDGSSHPLSTYS-VRMRDTLVLTMRFFQS 527


>ref|XP_004133970.1| PREDICTED: uncharacterized protein LOC101207305 [Cucumis sativus]
          Length = 536

 Score =  481 bits (1239), Expect = e-133
 Identities = 273/550 (49%), Positives = 354/550 (64%), Gaps = 11/550 (2%)
 Frame = -3

Query: 1737 YSGDSSANDQNDVETRRQ--NRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEK 1564
            +S     ND +    R Q   RH  E +  + N++   ++ + QDQE MEL SR +AQE 
Sbjct: 5    HSSLQGLNDDSVQAARSQLKKRHTFERSLGSNNLERAVDVNNHQDQEDMELLSRVKAQEG 64

Query: 1563 EIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXX 1384
            EIQ LR+QI++A ++E + LNEKY LERKFS++RMA+DEKQ+E ITSA NEL  RKGD  
Sbjct: 65   EIQLLRQQISVACLKELRQLNEKYALERKFSDIRMAVDEKQTEAITSAFNELGYRKGDLE 124

Query: 1383 XXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRAS 1204
                    LK  +DE++ + SS+LG+LAEYG  P V NAS LTN++K LHDQLQ KIR S
Sbjct: 125  VNLKLTNELKAVDDERYHYISSLLGLLAEYGIWPQVINASVLTNNVKLLHDQLQRKIRTS 184

Query: 1203 HARLAELNSMIGNNSRNGVV-----DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKI 1039
            + ++ E  S   N    G       + +     SR  Q+        G S+Y      + 
Sbjct: 185  YEKIGERTSPAENQFEGGFPYRKRENTDFKFFESR-YQYQKRESADIGNSRYQLPAKAEP 243

Query: 1038 LDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTAS----PIMDNM 871
            L    D      +N       L+   EM Q  N +     L    R        P+ D+ 
Sbjct: 244  LRTTDDMFISRVQNSIPGPVDLSLRPEMYQPVNYDNSPEPLYYAGREVPGAFTPPVDDDA 303

Query: 870  LERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYP 691
            +E   + +       ++++ P M +          GP IE FQI+G+A PG +LL CGYP
Sbjct: 304  VELQRYTTD------ERYNNPVMIE----------GPSIENFQIVGEATPGSRLLACGYP 347

Query: 690  VRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVR 511
             RGTSLC+FQWV H  DGTRQYIEGATNP+YVV ADDVDKLIAVECIPMDD+G QGDLV+
Sbjct: 348  TRGTSLCIFQWVWHLEDGTRQYIEGATNPEYVVGADDVDKLIAVECIPMDDKGHQGDLVK 407

Query: 510  IFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDER 331
            +FANDQNKI CD +MQ +IDTY+SKGQA F+VL+L+DSSENWEPA++ +RRSG+Q+K   
Sbjct: 408  LFANDQNKIRCDPDMQLEIDTYLSKGQATFNVLLLIDSSENWEPASISLRRSGYQIKMGN 467

Query: 330  KQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSK 151
             +  VI+EKYS++L +KIPSG+S QFVLTCS+GSS PF+T  D+RMRDTLVLTMR+FQSK
Sbjct: 468  TEAVVIAEKYSRELSLKIPSGISTQFVLTCSDGSSLPFNT-YDVRMRDTLVLTMRMFQSK 526

Query: 150  ALDEKRKGKA 121
            A+D++RKGKA
Sbjct: 527  AMDDRRKGKA 536


>ref|XP_006408291.1| hypothetical protein EUTSA_v10022176mg, partial [Eutrema salsugineum]
            gi|557109437|gb|ESQ49744.1| hypothetical protein
            EUTSA_v10022176mg, partial [Eutrema salsugineum]
          Length = 507

 Score =  479 bits (1233), Expect = e-132
 Identities = 263/512 (51%), Positives = 343/512 (66%), Gaps = 2/512 (0%)
 Frame = -3

Query: 1680 RHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQMLN 1501
            RH +E  +         N    QD E M LYSRAR+QE+EI  L+EQIA A +++ Q+LN
Sbjct: 12   RHEIEKETSASRKLEENNAKLIQDPEEMALYSRARSQEEEIHNLQEQIAAACLKDMQLLN 71

Query: 1500 EKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHIFTS 1321
            EKY LERK ++LR+A+DEKQ+E +TSA NEL RRKGD          LKV EDE++IF +
Sbjct: 72   EKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENLKLAHDLKVTEDERYIFMT 131

Query: 1320 SMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVVD 1141
            S+LG+LAEYG  P V NA+A+++ IKHLHDQLQ KI+A + R+ EL+S++   S    + 
Sbjct: 132  SLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKIKACNDRIRELSSVVETQSGTDFI- 190

Query: 1140 NEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQSA 961
                   S+ +  P  S G   +    H ND +I +    P   +  N      +LTQ  
Sbjct: 191  -------SKDNHDPRISKGQASYGSTDHGNDYRINEQLSPPMDNITRNP---YHNLTQET 240

Query: 960  EMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGSF 781
            E  +  NN+    +      +   P+  + +     +   E+++       P +     F
Sbjct: 241  ESLRF-NNQIGGGSQQPRRESFGYPL--SSVAGKEMIREREEKAESSSMFDPYNGNE-EF 296

Query: 780  GSE--GDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATN 607
             S    +GPGI+GFQIIG+A PG K+LGCG+PVRGT+LCMFQWVRH  DGTRQYIEGAT+
Sbjct: 297  ASHVYEEGPGIDGFQIIGEAIPGEKVLGCGFPVRGTTLCMFQWVRHLEDGTRQYIEGATH 356

Query: 606  PDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQA 427
            P+YVVTADDVDKLIAVECIPMDDQGRQG+LVR+FANDQNKI CD EMQ +IDTYIS+GQA
Sbjct: 357  PEYVVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNKIRCDTEMQTEIDTYISRGQA 416

Query: 426  AFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVL 247
            +F+V +L+DS+E+WEPAT+I++RS +Q+K    +  VISEKYSK+LLIK+P G S QFVL
Sbjct: 417  SFNVQLLMDSTESWEPATVILKRSSYQIKTNNVEAMVISEKYSKELLIKVPCGFSTQFVL 476

Query: 246  TCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSK 151
               +GSS+P ST N +RMRDTLVLTMR+ QSK
Sbjct: 477  ISYDGSSHPISTLN-VRMRDTLVLTMRMLQSK 507


>ref|XP_002884395.1| hypothetical protein ARALYDRAFT_477601 [Arabidopsis lyrata subsp.
            lyrata] gi|297330235|gb|EFH60654.1| hypothetical protein
            ARALYDRAFT_477601 [Arabidopsis lyrata subsp. lyrata]
          Length = 519

 Score =  476 bits (1224), Expect = e-131
 Identities = 263/521 (50%), Positives = 348/521 (66%), Gaps = 2/521 (0%)
 Frame = -3

Query: 1680 RHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQMLN 1501
            RH +E  ++        N    QD E M LY++ R+QE+EI  L+E+IA A +++ Q+LN
Sbjct: 12   RHEIEKDTIASRKLEDSNAKLIQDPEEMALYAKVRSQEEEIHSLQERIAAACLKDMQLLN 71

Query: 1500 EKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHIFTS 1321
            EKY LERK ++LR+A+DEKQ+E +TSA NEL RRKGD          LKV EDE++IF +
Sbjct: 72   EKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENSKLAHDLKVTEDERYIFMT 131

Query: 1320 SMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV- 1144
            S+LG+LAEYG  P V NA+A+++ IKHLHDQLQ K +A + R+ EL+S++ N      + 
Sbjct: 132  SLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKTKACNDRIRELSSIVENQPGTDFIS 191

Query: 1143 -DNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQ 967
             DN  P  S   + + S+  G        +  + ++L P  + +R    N  +    L  
Sbjct: 192  KDNHDPRNSKSQASYGSTDRG------NDYQTNEQLLPPMENVTRNPYHNVMQDTEGLRF 245

Query: 966  SAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVG 787
            +    Q+      +   P  + N   P+     +        +  S+  F     ++   
Sbjct: 246  N---NQIGGGSQGIFQQPKRE-NFGYPLSSVAGKEMIREREEKAESSSMFDAYNGNEEFA 301

Query: 786  SFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATN 607
            S   E +GPGI+GFQIIGDA PG K+LGCG+PVRGT+LCMFQWVRH  DGTRQYIEGAT+
Sbjct: 302  SHVYE-EGPGIDGFQIIGDAIPGEKVLGCGFPVRGTTLCMFQWVRHLEDGTRQYIEGATH 360

Query: 606  PDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQA 427
            P+YVVTADDVDKLIAVECIPMDDQGRQG+LVR+FANDQNKI CD EMQ +IDTYIS+GQA
Sbjct: 361  PEYVVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNKIRCDTEMQAEIDTYISRGQA 420

Query: 426  AFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVL 247
            +F+V +L+DSSE+WE AT+I++RS +Q+K    +  VISEKYSK+L IK+P G S QFVL
Sbjct: 421  SFNVQLLMDSSESWETATVILKRSSYQIKTNTTE--VISEKYSKELQIKVPCGFSTQFVL 478

Query: 246  TCSNGSSYPFSTSNDIRMRDTLVLTMRIFQSKALDEKRKGK 124
               +GSS+P ST N +RMRDTLVLTMR+ QSKALDE+RKG+
Sbjct: 479  ISYDGSSHPISTLN-VRMRDTLVLTMRMLQSKALDERRKGR 518


>gb|EPS73084.1| hypothetical protein M569_01668, partial [Genlisea aurea]
          Length = 401

 Score =  471 bits (1212), Expect = e-130
 Identities = 256/448 (57%), Positives = 310/448 (69%), Gaps = 2/448 (0%)
 Frame = -3

Query: 1458 ALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPH 1279
            ALDEKQSEVI SASNEL RRKGD          L   E EKHIFT+S+L ILAE+GALPH
Sbjct: 1    ALDEKQSEVIASASNELARRKGDLEVNLNLLNDLTATEHEKHIFTTSLLEILAEFGALPH 60

Query: 1278 VTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFP 1099
             TNASALTNSIKHLHDQLQL   +S A+LAELNSMI NN+   ++  E PGL   GS  P
Sbjct: 61   ATNASALTNSIKHLHDQLQLSFSSSRAKLAELNSMIENNA---II--EAPGLGPTGSHPP 115

Query: 1098 SSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRT 919
            SSS G++G SQ   Y   + ++P+  P  Y+Q  DP  +R    +  +R++ +       
Sbjct: 116  SSSTGMQGSSQLRSYAANRNMEPSAGPPLYMQVEDP--SRVTLGTIRLREMAS------- 166

Query: 918  LPNTDRNTASPIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGSFG--SEGDGPGIEGF 745
                                              SL  + DR+  F   +  + P I  F
Sbjct: 167  ----------------------------------SLDMISDRLIKFHITASDEYPWIYNF 192

Query: 744  QIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLI 565
            QI G AKPGC++ GCG P  GT LCMFQWVRH PDGT ++I+GAT P YVVTADDVDKLI
Sbjct: 193  QIDGIAKPGCEITGCGVPKGGTYLCMFQWVRHNPDGTTEFIDGATYPTYVVTADDVDKLI 252

Query: 564  AVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENW 385
            AVECIPMD+ GR G+LVR+FAND  KITCD+EMQE+ID+Y+SKG A F VL++LDSSENW
Sbjct: 253  AVECIPMDEHGRHGNLVRMFANDNKKITCDDEMQEEIDSYVSKGSATFPVLVILDSSENW 312

Query: 384  EPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSN 205
            EPA++++RRSG+QVK E+KQ+ +ISEKYSK+L IKIPSGLSAQFVLTCS+GS YPFS ++
Sbjct: 313  EPASIVLRRSGYQVKVEKKQEPLISEKYSKELSIKIPSGLSAQFVLTCSDGSLYPFSMND 372

Query: 204  DIRMRDTLVLTMRIFQSKALDEKRKGKA 121
            D+RMRDTLVLTMRIFQ KA++EKRKG A
Sbjct: 373  DVRMRDTLVLTMRIFQMKAVNEKRKGMA 400


>ref|XP_006298972.1| hypothetical protein CARUB_v10015106mg [Capsella rubella]
            gi|482567681|gb|EOA31870.1| hypothetical protein
            CARUB_v10015106mg [Capsella rubella]
          Length = 522

 Score =  469 bits (1207), Expect = e-129
 Identities = 260/507 (51%), Positives = 340/507 (67%), Gaps = 10/507 (1%)
 Frame = -3

Query: 1614 QDQETMELYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSE 1435
            QD E M LY++ R+QE+EI  L+EQIA A +++ Q+LNEK  LERK ++LR+A+DEKQ+E
Sbjct: 35   QDPEEMALYAKVRSQEEEIHSLQEQIAAACLKDMQLLNEKCGLERKCADLRVAIDEKQNE 94

Query: 1434 VITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALT 1255
             +T+A NEL RRKGD          LKV EDE++IF +S+LG+LAEYG  P V NA+A++
Sbjct: 95   SVTAALNELARRKGDLEENLKLAHDLKVTEDERYIFMTSLLGLLAEYGVWPRVANATAIS 154

Query: 1254 NSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNGVV--DNEIPGLSSRGSQFPSSSMGV 1081
            + IKHLHDQLQ K +A   R+ EL+S++ N      +  DN  P  S   + + S+  G 
Sbjct: 155  SGIKHLHDQLQWKTKACTDRIRELSSIVENQPGTEFINKDNHDPRNSKSQASYGSTDRG- 213

Query: 1080 RGFSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQSAEM-RQLPNNETLLRTLPNTD 904
                     ND +  +    P   V  N        T+      Q+      +   P  +
Sbjct: 214  ---------NDYRTNEQLLPPMENVMRNPYHNVMQDTEGLRFNNQIGGGSQGIFQQPKRE 264

Query: 903  RNTASPIMDNMLERNGFLSGSE-----QRSTDQFSLPPMHDRVGSFGSE--GDGPGIEGF 745
             N   P+          ++G E     +   +  S+   ++    F S    +GPGI+GF
Sbjct: 265  -NFGYPLSS--------VAGKEMIREREEKAENSSMFDAYNGNEEFASHVYEEGPGIDGF 315

Query: 744  QIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLI 565
            QIIGDA PG K+LGCG+PVRGT+LCMFQWVRH  DGTRQYIEGAT+P+YVVTADDVDKLI
Sbjct: 316  QIIGDAIPGEKVLGCGFPVRGTTLCMFQWVRHLEDGTRQYIEGATHPEYVVTADDVDKLI 375

Query: 564  AVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENW 385
            AVECIPMDDQGRQG+LVR+FANDQNKI+CD EMQ +IDTYIS+GQA+F+V +L+DSSE+W
Sbjct: 376  AVECIPMDDQGRQGELVRLFANDQNKISCDTEMQTEIDTYISRGQASFNVQLLMDSSESW 435

Query: 384  EPATLIMRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSN 205
            EPAT+I++R+ +Q+K    +  VISEKYSK+L IK+P G S QFVL   +GSS+P ST N
Sbjct: 436  EPATVILKRTSYQIKTNNVEALVISEKYSKELQIKVPCGDSTQFVLISYDGSSHPISTLN 495

Query: 204  DIRMRDTLVLTMRIFQSKALDEKRKGK 124
             IRMRDTLVLTMR+ QSKALD++RKG+
Sbjct: 496  -IRMRDTLVLTMRMLQSKALDDRRKGR 521


>ref|XP_004157685.1| PREDICTED: uncharacterized protein LOC101226515 [Cucumis sativus]
          Length = 484

 Score =  466 bits (1200), Expect = e-128
 Identities = 260/502 (51%), Positives = 332/502 (66%), Gaps = 9/502 (1%)
 Frame = -3

Query: 1599 MELYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSA 1420
            MEL SR +AQE EIQ LR+QI++A ++E + LNEKY LERKFS++RMA+DEKQ+E ITSA
Sbjct: 1    MELLSRVKAQEGEIQLLRQQISVACLKELRQLNEKYALERKFSDIRMAVDEKQTEAITSA 60

Query: 1419 SNELVRRKGDXXXXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKH 1240
             NEL  RKGD          LK  +DE++ + SS+LG+LAEYG  P V NAS LTN++K 
Sbjct: 61   FNELGYRKGDLEVNLKLTNELKAVDDERYHYISSLLGLLAEYGIWPQVINASVLTNNVKL 120

Query: 1239 LHDQLQLKIRASHARLAELNSMIGNNSRNGVV-----DNEIPGLSSRGSQFPSSSMGVRG 1075
            LHDQLQ KIR S+ ++ E  S   N    G       + +     SR  Q+        G
Sbjct: 121  LHDQLQRKIRTSYEKIGERTSPAENQFEGGFPYRKRENTDFKFFESR-YQYQKRESADIG 179

Query: 1074 FSQYSHYNDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNT 895
             S+Y      + L    D      +N       L+   EM Q  N +     L    R  
Sbjct: 180  NSRYQLPAKAEPLRTTDDMFISRVQNSIPGPVDLSLRPEMYQPVNYDNSPEPLYYAGREV 239

Query: 894  AS----PIMDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDA 727
                  P+ D+ +E   + +       ++++ P M +          GP IE FQI+G+A
Sbjct: 240  PGAFTPPVDDDAVELQRYTTD------ERYNNPVMIE----------GPSIENFQIVGEA 283

Query: 726  KPGCKLLGCGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIP 547
             PG +LL CGYP RGTSLC+FQWV H  DGTRQYIEGATNP+YVV ADDVDKLIAVECIP
Sbjct: 284  TPGSRLLACGYPTRGTSLCIFQWVWHLEDGTRQYIEGATNPEYVVGADDVDKLIAVECIP 343

Query: 546  MDDQGRQGDLVRIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLI 367
            MDD+G QGDLV++FANDQNKI CD +MQ +IDTY+SKGQA F+VL+L+DSSENWEPA++ 
Sbjct: 344  MDDKGHQGDLVKLFANDQNKIRCDPDMQLEIDTYLSKGQATFNVLLLIDSSENWEPASIS 403

Query: 366  MRRSGFQVKDERKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRD 187
            +RRSG+Q+K    +  VI+EKYS++L +KIPSG+S QFVLTCS+GSS PF+T  D+RMRD
Sbjct: 404  LRRSGYQIKMGNTEAVVIAEKYSRELSLKIPSGISTQFVLTCSDGSSLPFNT-YDVRMRD 462

Query: 186  TLVLTMRIFQSKALDEKRKGKA 121
            TLVLTMR+FQSKA+D++RKGKA
Sbjct: 463  TLVLTMRMFQSKAMDDRRKGKA 484


>gb|EOY10317.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 481

 Score =  444 bits (1143), Expect = e-122
 Identities = 246/459 (53%), Positives = 304/459 (66%), Gaps = 5/459 (1%)
 Frame = -3

Query: 1758 TSIQKAMYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETME 1594
            T     M S + S +  N+   + Q     NRH  E       +K R       D E   
Sbjct: 14   TWTDNVMSSSEHSVHGVNNNGVQAQSSDFLNRHGSETYLAPSKLKDRS--FDFPDLEAKG 71

Query: 1593 LYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASN 1414
            L+ RA AQ++EIQ+LREQIA+A ++E Q+ NEK  LERKFS+LRMA+DEKQ+E ITSASN
Sbjct: 72   LHLRASAQKEEIQHLREQIAVACVKELQLQNEKCALERKFSDLRMAIDEKQNEAITSASN 131

Query: 1413 ELVRRKGDXXXXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLH 1234
            EL RRKGD          LKVAEDE++IF SSMLG+LAEYG LP V NASA+T+S+KHLH
Sbjct: 132  ELARRKGDLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGILPPVVNASAITSSVKHLH 191

Query: 1233 DQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHY 1054
            DQLQ KIR SH R+ EL  ++G ++     +N+ P      +Q P  +    GFS  +HY
Sbjct: 192  DQLQWKIRTSHDRIRELTGIVGTHTGGRSHENDRPISGILNNQIPHRATASHGFSSNNHY 251

Query: 1053 NDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDN 874
             D + L P  +  RY+ +ND      +      +QL N  +      ++DR  A    D+
Sbjct: 252  TDEQHLMPPDNMLRYMPDNDHTAKNLMFNDPGQQQLSNGNSQ-EFFFSSDRGGAGRNPDS 310

Query: 873  MLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGY 694
              +R    +G+E  + + FS    HD + S+GSE +GPGIEGFQIIGDA PG KLLGCGY
Sbjct: 311  AFDRGAVRTGAEDVTNNVFS---HHDEMDSYGSE-EGPGIEGFQIIGDATPGEKLLGCGY 366

Query: 693  PVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLV 514
            PVRGT+LCMFQWVRH  DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDDQG QG+LV
Sbjct: 367  PVRGTTLCMFQWVRHLQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDQGHQGELV 426

Query: 513  RIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDS 397
            R+FANDQNKI CD +MQ +ID YIS+GQAAFSVL+LL S
Sbjct: 427  RLFANDQNKIKCDPDMQNEIDKYISRGQAAFSVLLLLKS 465


>ref|XP_002524005.1| hypothetical protein RCOM_1516730 [Ricinus communis]
            gi|223536732|gb|EEF38373.1| hypothetical protein
            RCOM_1516730 [Ricinus communis]
          Length = 510

 Score =  431 bits (1107), Expect = e-118
 Identities = 244/506 (48%), Positives = 310/506 (61%)
 Frame = -3

Query: 1722 SANDQNDVETRRQNRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLRE 1543
            S+  +N ++    NR    ++     +KG  N  + +D+E MELYSRAR Q++EIQ LR+
Sbjct: 14   SSTTKNSMQGTNNNRAPTPSSDSLNRLKGDGNFNYFEDREAMELYSRARTQKEEIQILRQ 73

Query: 1542 QIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXX 1363
            QIA A +RE ++LNEKY LERKFS+LRMA+DEKQ+E ITSA NELV RKG+         
Sbjct: 74   QIAAACMRELRLLNEKYILERKFSDLRMAIDEKQNEAITSALNELVSRKGNLEDNLKLTH 133

Query: 1362 XLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAEL 1183
             LKV +DE++IF SSMLG+LAEYG  PHV NAS                           
Sbjct: 134  ELKVVDDERYIFMSSMLGLLAEYGVWPHVMNAST-------------------------- 167

Query: 1182 NSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYVQ 1003
                        + N + GL  +          +    + SH    +I    H  S    
Sbjct: 168  ------------ISNNVKGLYDQ----------LEWKIRTSHDRIREIEVAVHPESESQD 205

Query: 1002 ENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDNMLERNGFLSGSEQRSTD 823
            +++P           M Q+P+   +  +  N       P+ + + ++     G  + + D
Sbjct: 206  KDNPGPGFL------MHQVPHQSKIQDSNNNFPEFPFDPVRERLFDKGIGEVGRGEMTMD 259

Query: 822  QFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGYPVRGTSLCMFQWVRHYP 643
                   HD + S  SE +GPGIEGFQIIGDA PG KLLGCGYPVRGTSLCMFQWVRH  
Sbjct: 260  LPHPSSSHDEIASSVSE-EGPGIEGFQIIGDAVPGGKLLGCGYPVRGTSLCMFQWVRHLE 318

Query: 642  DGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQGDLVRIFANDQNKITCDEEMQ 463
            DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDDQGRQG+LV+ FANDQNKI CD +MQ
Sbjct: 319  DGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVKRFANDQNKIKCDPDMQ 378

Query: 462  EDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQVKDERKQDTVISEKYSKDLLI 283
              ID YISKG+A FS+ +L D+S+ W+ +TLI+RRSG+Q+K       +I+EKYSK+L I
Sbjct: 379  HAIDMYISKGEATFSIQLLTDASDKWKSSTLILRRSGYQIKTISDDIELIAEKYSKNLSI 438

Query: 282  KIPSGLSAQFVLTCSNGSSYPFSTSN 205
            KIPSGLS QFVL CS+GSS+P +T N
Sbjct: 439  KIPSGLSTQFVLACSSGSSHPLNTYN 464


>gb|EOY10318.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 445

 Score =  380 bits (975), Expect = e-102
 Identities = 214/416 (51%), Positives = 266/416 (63%), Gaps = 5/416 (1%)
 Frame = -3

Query: 1758 TSIQKAMYSGDSSANDQNDVETRRQ-----NRHNLEATSVTKNVKGRENMIHTQDQETME 1594
            T     M S + S +  N+   + Q     NRH  E       +K R       D E   
Sbjct: 14   TWTDNVMSSSEHSVHGVNNNGVQAQSSDFLNRHGSETYLAPSKLKDRS--FDFPDLEAKG 71

Query: 1593 LYSRARAQEKEIQYLREQIALASIRESQMLNEKYTLERKFSELRMALDEKQSEVITSASN 1414
            L+ RA AQ++EIQ+LREQIA+A ++E Q+ NEK  LERKFS+LRMA+DEKQ+E ITSASN
Sbjct: 72   LHLRASAQKEEIQHLREQIAVACVKELQLQNEKCALERKFSDLRMAIDEKQNEAITSASN 131

Query: 1413 ELVRRKGDXXXXXXXXXXLKVAEDEKHIFTSSMLGILAEYGALPHVTNASALTNSIKHLH 1234
            EL RRKGD          LKVAEDE++IF SSMLG+LAEYG LP V NASA+T+S+KHLH
Sbjct: 132  ELARRKGDLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGILPPVVNASAITSSVKHLH 191

Query: 1233 DQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHY 1054
            DQLQ KIR SH R+ EL  ++G ++     +N+ P      +Q P  +    GFS  +HY
Sbjct: 192  DQLQWKIRTSHDRIRELTGIVGTHTGGRSHENDRPISGILNNQIPHRATASHGFSSNNHY 251

Query: 1053 NDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPIMDN 874
             D + L P  +  RY+ +ND      +      +QL N  +      ++DR  A    D+
Sbjct: 252  TDEQHLMPPDNMLRYMPDNDHTAKNLMFNDPGQQQLSNGNS-QEFFFSSDRGGAGRNPDS 310

Query: 873  MLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLGCGY 694
              +R    +G+E  + + FS    HD + S+GSE +GPGIEGFQIIGDA PG KLLGCGY
Sbjct: 311  AFDRGAVRTGAEDVTNNVFS---HHDEMDSYGSE-EGPGIEGFQIIGDATPGEKLLGCGY 366

Query: 693  PVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQ 526
            PVRGT+LCMFQWVRH  DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDDQG Q
Sbjct: 367  PVRGTTLCMFQWVRHLQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDQGHQ 422


>ref|XP_003520361.2| PREDICTED: uncharacterized protein LOC100813936 [Glycine max]
          Length = 621

 Score =  377 bits (969), Expect = e-102
 Identities = 206/374 (55%), Positives = 259/374 (69%)
 Frame = -3

Query: 1242 HLHDQLQLKIRASHARLAELNSMIGNNSRNGVVDNEIPGLSSRGSQFPSSSMGVRGFSQY 1063
            HLHDQLQ +IR+SH R+ EL S++ + + NG    E PG  +  S   +  M    F Q 
Sbjct: 258  HLHDQLQWRIRSSHDRMGELTSVLESRADNGNHVVESPGSGNLTSHTHNDFMFQHNFPQQ 317

Query: 1062 SHYNDGKILDPAHDPSRYVQENDPKQARSLTQSAEMRQLPNNETLLRTLPNTDRNTASPI 883
            +   + +   P  + + Y+            ++   +Q  N +  + + P+   +    +
Sbjct: 318  NLIGNEQSHQPMSNVAGYMHPALHSDVNWGLKTFNYQQTSNADRGISSFPHASIDKIG-V 376

Query: 882  MDNMLERNGFLSGSEQRSTDQFSLPPMHDRVGSFGSEGDGPGIEGFQIIGDAKPGCKLLG 703
             D  +ERN F +G+       +  PP  D   S  SE DGPGIE FQ+ GDA PG KLLG
Sbjct: 377  QDKNMERN-FGNGNF------YQHPPDLDETASSVSE-DGPGIENFQVSGDAIPGEKLLG 428

Query: 702  CGYPVRGTSLCMFQWVRHYPDGTRQYIEGATNPDYVVTADDVDKLIAVECIPMDDQGRQG 523
            CGYPVRGTSLCMFQWVRH  DGTRQYIEGATNP+YVVTADDVDKLIAVECIPMDD+GRQG
Sbjct: 429  CGYPVRGTSLCMFQWVRHLEDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGRQG 488

Query: 522  DLVRIFANDQNKITCDEEMQEDIDTYISKGQAAFSVLILLDSSENWEPATLIMRRSGFQV 343
            +LV++FANDQNKITCD EM+ +I T +SKG+A FSVL+L DSSENWE ATL +RRSG+Q+
Sbjct: 489  ELVKLFANDQNKITCDSEMKHEIGTNLSKGEATFSVLLLRDSSENWEQATLFLRRSGYQI 548

Query: 342  KDERKQDTVISEKYSKDLLIKIPSGLSAQFVLTCSNGSSYPFSTSNDIRMRDTLVLTMRI 163
            K    + TV+ EK+SK+L IK+P GLSAQFVLT SNGSS+P ST + +RMRDTLVLTMR+
Sbjct: 549  KINGTEATVVDEKFSKELSIKVPCGLSAQFVLTSSNGSSHPLSTYS-VRMRDTLVLTMRL 607

Query: 162  FQSKALDEKRKGKA 121
            FQSKALD+KRKG+A
Sbjct: 608  FQSKALDDKRKGRA 621



 Score =  194 bits (492), Expect = 2e-46
 Identities = 107/228 (46%), Positives = 146/228 (64%)
 Frame = -3

Query: 1689 RQNRHNLEATSVTKNVKGRENMIHTQDQETMELYSRARAQEKEIQYLREQIALASIRESQ 1510
            R NR+  E     +N K  +   H Q+Q TMELYSRAR QE+EI  LREQI +A ++E Q
Sbjct: 4    RGNRNKYETQLAQRNFKSNDTQNHIQEQNTMELYSRAREQEEEILSLREQIGIACMKELQ 63

Query: 1509 MLNEKYTLERKFSELRMALDEKQSEVITSASNELVRRKGDXXXXXXXXXXLKVAEDEKHI 1330
            +LNEK  LER+FSELRMA+DEKQ+E I+SASN+LV+RKG           LK  +DE++I
Sbjct: 64   LLNEKCKLERQFSELRMAVDEKQNEAISSASNDLVQRKGYLEENLKLAHDLKAVDDERYI 123

Query: 1329 FTSSMLGILAEYGALPHVTNASALTNSIKHLHDQLQLKIRASHARLAELNSMIGNNSRNG 1150
            F SSMLG+LAEYG  P V NAS++++ +KHLHDQLQ +IR+SH R+ EL S++ + + NG
Sbjct: 124  FMSSMLGLLAEYGLWPRVMNASSISSCVKHLHDQLQWRIRSSHDRMGELTSVLESRADNG 183

Query: 1149 VVDNEIPGLSSRGSQFPSSSMGVRGFSQYSHYNDGKILDPAHDPSRYV 1006
                E PG  +  S   +  M    F Q +   + +   P  + + Y+
Sbjct: 184  NHVVESPGSGNLTSHTHNDFMFQHNFPQQNLIGNEQSHQPMSNVAGYM 231


Top