BLASTX nr result

ID: Atropa21_contig00025653 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00025653
         (1082 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006346322.1| PREDICTED: QWRF motif-containing protein 2-l...   505   e-140
ref|XP_004230707.1| PREDICTED: uncharacterized protein LOC101268...   505   e-140
ref|XP_002263972.1| PREDICTED: uncharacterized protein LOC100242...   337   5e-90
ref|XP_004136940.1| PREDICTED: uncharacterized protein LOC101215...   324   3e-86
emb|CAN69354.1| hypothetical protein VITISV_014039 [Vitis vinifera]   322   1e-85
gb|EXB80036.1| hypothetical protein L484_003678 [Morus notabilis]     315   2e-83
ref|XP_004171185.1| PREDICTED: uncharacterized LOC101215899 [Cuc...   311   3e-82
gb|EPS70660.1| hypothetical protein M569_04100, partial [Genlise...   308   2e-81
gb|EMJ00929.1| hypothetical protein PRUPE_ppa002521mg [Prunus pe...   307   5e-81
gb|EOX98447.1| Family of Uncharacterized protein function, putat...   296   1e-77
gb|EOX98446.1| Family of Uncharacterized protein function (DUF56...   296   1e-77
ref|XP_006589620.1| PREDICTED: QWRF motif-containing protein 2-l...   293   8e-77
ref|XP_003536586.1| PREDICTED: QWRF motif-containing protein 2-l...   293   8e-77
ref|XP_002283295.1| PREDICTED: uncharacterized protein LOC100242...   291   3e-76
ref|XP_003555289.1| PREDICTED: QWRF motif-containing protein 2-l...   287   6e-75
ref|XP_004292128.1| PREDICTED: uncharacterized protein LOC101313...   283   6e-74
gb|EOX98448.1| Family of Uncharacterized protein function, putat...   282   2e-73
gb|EOX98449.1| Family of Uncharacterized protein function, putat...   280   5e-73
ref|XP_002527498.1| conserved hypothetical protein [Ricinus comm...   279   1e-72
ref|XP_002891540.1| hypothetical protein ARALYDRAFT_891910 [Arab...   274   4e-71

>ref|XP_006346322.1| PREDICTED: QWRF motif-containing protein 2-like [Solanum tuberosum]
          Length = 641

 Score =  505 bits (1300), Expect = e-140
 Identities = 283/367 (77%), Positives = 297/367 (80%), Gaps = 7/367 (1%)
 Frame = -1

Query: 1082 LDSKLSNVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXX 903
            LDSK++N  EVSAATK           SFQGET+SLPVSKTKAAPPSPNLSSLRKG    
Sbjct: 124  LDSKVNNAAEVSAATKLLVTSTRSLSVSFQGETYSLPVSKTKAAPPSPNLSSLRKGTPER 183

Query: 902  XXXXXXXR---NGADQLENSKPVDQHRWPGRARSVQGNLLARSLDCSNGERNKVIGSGNV 732
                   R   +GADQLENSKPVDQHRWPGR+R  QGNLLARSLDCSNG+R+KVIGSGNV
Sbjct: 184  RRTTTPLRGKADGADQLENSKPVDQHRWPGRSR--QGNLLARSLDCSNGDRHKVIGSGNV 241

Query: 731  IRTLQQSMIDERRASFDGRLSLDLGNAEPLKAVEQANNVN----ESSLPSDLTAXXXXXX 564
            IRTLQQSMIDERRASFDGRLSLDLGNAEPLKAVEQA +VN    +S+LPSDLTA      
Sbjct: 242  IRTLQQSMIDERRASFDGRLSLDLGNAEPLKAVEQAQDVNSANNDSTLPSDLTASDTDSV 301

Query: 563  XXXXXXXVQECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLSV 384
                    QECGGS +RIRG VPRGIVVSARFWQETNSRLRRLQDPGS L TSPGSKL  
Sbjct: 302  SSGSTGV-QECGGS-SRIRG-VPRGIVVSARFWQETNSRLRRLQDPGSPLSTSPGSKLVA 358

Query: 383  PPKLRKYSSDVPVSSPRAMSSPIRGGGIRSASPSKLIXXXXXXXXXXXXRVKNVVSTINS 204
            PPKLRKY SD PVSSPRAMSSPIRGG IRSASPSKLI            RV+NVVSTINS
Sbjct: 359  PPKLRKYHSDFPVSSPRAMSSPIRGG-IRSASPSKLIGSSPSRGMPSPSRVRNVVSTINS 417

Query: 203  NFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATLLVQKHSAE 24
            NF+ETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNR+LQWRFVNAT EATLLVQKHSAE
Sbjct: 418  NFIETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRYLQWRFVNATNEATLLVQKHSAE 477

Query: 23   KTLWNAW 3
            KTLWNAW
Sbjct: 478  KTLWNAW 484


>ref|XP_004230707.1| PREDICTED: uncharacterized protein LOC101268323 [Solanum
            lycopersicum]
          Length = 641

 Score =  505 bits (1300), Expect = e-140
 Identities = 283/367 (77%), Positives = 297/367 (80%), Gaps = 7/367 (1%)
 Frame = -1

Query: 1082 LDSKLSNVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXX 903
            LDSK+SNV EVSAATK           SFQGET+SLPVSKTKAAPPSPNLSSLRKG    
Sbjct: 124  LDSKVSNVAEVSAATKLLVTSTRSLSVSFQGETYSLPVSKTKAAPPSPNLSSLRKGTPER 183

Query: 902  XXXXXXXR---NGADQLENSKPVDQHRWPGRARSVQGNLLARSLDCSNGERNKVIGSGNV 732
                   R   +GADQLENSKPVDQHRWPGR+R  QGN LARSLDCSNG+R+KVIGSGNV
Sbjct: 184  RRTTTPLRGKADGADQLENSKPVDQHRWPGRSR--QGNPLARSLDCSNGDRHKVIGSGNV 241

Query: 731  IRTLQQSMIDERRASFDGRLSLDLGNAEPLKAVEQANNVN----ESSLPSDLTAXXXXXX 564
            IRTLQQSMIDERRASFDGRLSLD GNAEPLKAVEQA +VN    +S+LPSDLTA      
Sbjct: 242  IRTLQQSMIDERRASFDGRLSLDFGNAEPLKAVEQAQDVNSANNDSTLPSDLTASDTDSV 301

Query: 563  XXXXXXXVQECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLSV 384
                    QECGGS +RIRG VPRGIVVSARFWQETNSRLRRLQDPGS L TSPGSK+  
Sbjct: 302  SSGSTGM-QECGGS-SRIRG-VPRGIVVSARFWQETNSRLRRLQDPGSPLSTSPGSKMVA 358

Query: 383  PPKLRKYSSDVPVSSPRAMSSPIRGGGIRSASPSKLIXXXXXXXXXXXXRVKNVVSTINS 204
            PPKLRKY SDVPVSSPRAMSSPIR G IRSASPSKLI            RV+NVVSTINS
Sbjct: 359  PPKLRKYHSDVPVSSPRAMSSPIRAG-IRSASPSKLIGSSPSRGMPSPSRVRNVVSTINS 417

Query: 203  NFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATLLVQKHSAE 24
            NF+ETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNAT+EATLLVQKHSAE
Sbjct: 418  NFIETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATSEATLLVQKHSAE 477

Query: 23   KTLWNAW 3
            KTLWNAW
Sbjct: 478  KTLWNAW 484


>ref|XP_002263972.1| PREDICTED: uncharacterized protein LOC100242868 [Vitis vinifera]
          Length = 639

 Score =  337 bits (864), Expect = 5e-90
 Identities = 206/367 (56%), Positives = 245/367 (66%), Gaps = 13/367 (3%)
 Frame = -1

Query: 1064 NVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXX 885
            + GEVSAA++           SFQGE FSLP+SK KAAP   NL ++RKG          
Sbjct: 120  SAGEVSAASRLLFTSTRSLSVSFQGEAFSLPISKAKAAP---NLGNVRKGTPERRKPTPV 176

Query: 884  XRNGA-DQLENSKPVDQHRWPGRARSVQGNLLARSLDCSNGERNKVIGSGNVIRTLQQSM 708
              +GA DQ+ENS+P     WPGR+RSV  N+LARS DCS  +R K IGSG V+ + QQSM
Sbjct: 177  RGSGAVDQVENSRP-----WPGRSRSV--NVLARSFDCSV-DRKKSIGSGIVVGSFQQSM 228

Query: 707  IDE-RRASFDGRLSLDLGNAEPLKAVEQ---ANNVNESSLPSDLTAXXXXXXXXXXXXXV 540
            IDE RRASFDGRLSLDLGNAE LK  +Q    N+ N+SS+P+DLTA             +
Sbjct: 229  IDESRRASFDGRLSLDLGNAELLKVTKQDPDGNSANDSSVPTDLTASDTDSVSSGSTSGL 288

Query: 539  QECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLSVPPKL---R 369
            QEC G   R  G  PRGIVVSARFWQETNSRLRRLQDPGS L TSPGS+++V  K    +
Sbjct: 289  QECAGVSGRRSG--PRGIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRMAVAAKFIQSK 346

Query: 368  KYSSDVPVSSPRAMSSPIRGGGIRSASPSKLIXXXXXXXXXXXXR----VKNVV-STINS 204
            K+ SD P++SPR M SPIRG   R ASPSKL+                 ++N V S ++S
Sbjct: 347  KFPSDNPLASPRTMMSPIRGA-TRPASPSKLMASSMPVSSPIRASSPARLRNAVASPLSS 405

Query: 203  NFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATLLVQKHSAE 24
            +    PS+LSF+VDVRRGK+GENRIVDAHLLRLLYNRHLQWRFVNA  +A LLVQ+  AE
Sbjct: 406  SSSIAPSILSFSVDVRRGKMGENRIVDAHLLRLLYNRHLQWRFVNARADAALLVQRMRAE 465

Query: 23   KTLWNAW 3
            + LWNAW
Sbjct: 466  RNLWNAW 472


>ref|XP_004136940.1| PREDICTED: uncharacterized protein LOC101215899 [Cucumis sativus]
          Length = 667

 Score =  324 bits (831), Expect = 3e-86
 Identities = 197/379 (51%), Positives = 252/379 (66%), Gaps = 19/379 (5%)
 Frame = -1

Query: 1082 LDSKLSNVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXX 903
            LDS+  N  + SAA K           SFQGE FSLP+SKTKA   +P+LS+ RKG    
Sbjct: 130  LDSRHGNATDSSAAAKLLVTSTRSLSVSFQGEAFSLPISKTKATA-TPSLSNARKGSTPE 188

Query: 902  XXXXXXXRNGAD----QLENSKPVDQHRWPGRAR--SVQGNLLARSLDCSNGERNKV--I 747
                   R+ +D    Q+ENSK +DQHRWP R R  +++GN L+RS DC  GE+ KV  I
Sbjct: 189  RRRATPLRDKSDGSGVQVENSKLLDQHRWPARNRHANLEGNPLSRSFDCG-GEQKKVNGI 247

Query: 746  GSGNVIRTLQQSMIDE-RRASFDGRLSLDLGNAEPLKAVEQ---ANNVNESSLPSDLTAX 579
            GSG V+R LQQ++ D+ RRASFDGRLSLDL ++E +KAV Q   A++VNESS+PSDLT  
Sbjct: 248  GSGMVVRALQQTISDDSRRASFDGRLSLDLNSSELIKAVRQNPDADSVNESSVPSDLTTS 307

Query: 578  XXXXXXXXXXXXVQECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPG 399
                        VQ+CG S  + R G PRGIVVSARFWQETNSRLRRL DPGS L TSPG
Sbjct: 308  DTDSVSSGSTSGVQDCG-SVAKGRNG-PRGIVVSARFWQETNSRLRRLHDPGSPLSTSPG 365

Query: 398  SKLSVPPKL---RKYSSDVPVSSPRAMSSPIRGGGIRSASPSKL----IXXXXXXXXXXX 240
            +++  P K    +++S+D P+SSPR M+SPIRGG  R  SPSKL    +           
Sbjct: 366  ARVGAPSKFSQSKRFSNDGPLSSPRTMASPIRGG-TRPPSPSKLWTSSVSSPSRGISSPS 424

Query: 239  XRVKNVVSTINSNFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATT 60
                 V  ++ SN + TPS+LSF+VD+RRGK+GE+RIVDAH+LRL +NR+LQWRFVNA  
Sbjct: 425  RTRNGVGGSLVSNSISTPSILSFSVDIRRGKMGEDRIVDAHVLRLHHNRYLQWRFVNARA 484

Query: 59   EATLLVQKHSAEKTLWNAW 3
            +AT ++Q+ +AE+ +WNAW
Sbjct: 485  DATFMLQRLNAERNVWNAW 503


>emb|CAN69354.1| hypothetical protein VITISV_014039 [Vitis vinifera]
          Length = 601

 Score =  322 bits (826), Expect = 1e-85
 Identities = 201/360 (55%), Positives = 239/360 (66%), Gaps = 13/360 (3%)
 Frame = -1

Query: 1064 NVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXX 885
            + GEVSAA++           SFQGE FSLP+SK KAAP   NL ++RKG          
Sbjct: 120  SAGEVSAASRLLFTSTRSLSVSFQGEAFSLPISKAKAAP---NLGNVRKGTPERRKPTPV 176

Query: 884  XRNGA-DQLENSKPVDQHRWPGRARSVQGNLLARSLDCSNGERNKVIGSGNVIRTLQQSM 708
              +GA DQ+ENS+P     WPGR+RSV  N+LARS DCS  +R K IGSG V+ + QQSM
Sbjct: 177  RGSGAVDQVENSRP-----WPGRSRSV--NVLARSFDCSV-DRKKSIGSGIVVGSFQQSM 228

Query: 707  IDE-RRASFDGRLSLDLGNAEPLKAVEQ---ANNVNESSLPSDLTAXXXXXXXXXXXXXV 540
            IDE RRASFDGRLSLDLGNAE LK  +Q    N+ N+SS+P+DLTA             +
Sbjct: 229  IDESRRASFDGRLSLDLGNAELLKVTKQDPDGNSANDSSVPTDLTASDTDSVSSGSTSGL 288

Query: 539  QECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLSVPPKL---R 369
            QEC G   R  G  PRGIVVSARFWQETNSRLRRLQDPGS L TSPGS+++V  K    +
Sbjct: 289  QECAGVSGRRSG--PRGIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRMAVAAKFIQSK 346

Query: 368  KYSSDVPVSSPRAMSSPIRGGGIRSASPSKLIXXXXXXXXXXXXR----VKNVV-STINS 204
            K+ SD P++SPR M SPIRG   R ASPSKL+                 ++N V S ++S
Sbjct: 347  KFPSDNPLASPRTMMSPIRGA-TRPASPSKLMASSMPVSSPIRASSPARLRNAVASPLSS 405

Query: 203  NFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATLLVQKHSAE 24
            +    PS+LSF+VDVRRGK+GENRIVDAHLLRLLYNRHLQWRFVNA  +A LLVQ+  AE
Sbjct: 406  SSSIAPSILSFSVDVRRGKMGENRIVDAHLLRLLYNRHLQWRFVNARADAALLVQRMRAE 465


>gb|EXB80036.1| hypothetical protein L484_003678 [Morus notabilis]
          Length = 670

 Score =  315 bits (808), Expect = 2e-83
 Identities = 199/377 (52%), Positives = 237/377 (62%), Gaps = 26/377 (6%)
 Frame = -1

Query: 1055 EVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXXXRN 876
            EVSAATK           SFQGE FSLP+SKTK   PS      RK              
Sbjct: 137  EVSAATKLLVTSTRSLSVSFQGEAFSLPISKTKPTTPS----GARKATPERRRTTPLRGG 192

Query: 875  GADQLENSKPVDQHRWPGRARSVQGN------LLARSLDC-SNGERNKVIG--SGNVIRT 723
              DQLENSKP DQHRWP R R    N      LL+RS+D  + G+  K+ G  SG V+R 
Sbjct: 193  ERDQLENSKPGDQHRWPARTRQGNSNSSNSNPLLSRSVDFGAGGDGRKLNGFRSGTVVRA 252

Query: 722  LQQSMIDE-RRASFDGRLSLDLGNAEPLKAVEQANNVNESSLPSDLTAXXXXXXXXXXXX 546
            LQQS++DE RR+SFDGRLSLDLG+AE LK V  +NN  ESS PSDLTA            
Sbjct: 253  LQQSLLDETRRSSFDGRLSLDLGSAELLK-VNSSNN--ESSAPSDLTASDTDSVSSGSTS 309

Query: 545  XVQECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLSVPPKL-- 372
             +Q+  G  ++ R G PRGIVVSARFWQETNSRLRRLQDPGS L TSPGS++  P K   
Sbjct: 310  GMQDANGV-SKARTGTPRGIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRMGAPAKFVQ 368

Query: 371  -RKYSSDV-PVSSPRAMSSPIRGGGIRSASPSKLIXXXXXXXXXXXXR----------VK 228
             ++YS D+ P+SSPR M+SPIRG   R ASPSKL                        V+
Sbjct: 369  SKRYSGDINPLSSPRTMASPIRGAN-RPASPSKLWTSSSMPSPSRGMSPSRGIASPSRVR 427

Query: 227  N-VVSTINSNFV-ETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEA 54
            N V  ++N ++   TPS+LSF+VD+RRGK+GE+RIVDAH+LRLLYNR+LQWRFVNA  +A
Sbjct: 428  NGVAGSMNGSYGGNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFVNARADA 487

Query: 53   TLLVQKHSAEKTLWNAW 3
            T +VQK +AEK LWNAW
Sbjct: 488  TFMVQKLNAEKNLWNAW 504


>ref|XP_004171185.1| PREDICTED: uncharacterized LOC101215899 [Cucumis sativus]
          Length = 514

 Score =  311 bits (797), Expect = 3e-82
 Identities = 194/375 (51%), Positives = 247/375 (65%), Gaps = 19/375 (5%)
 Frame = -1

Query: 1082 LDSKLSNVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXX 903
            LDS+  N  + SAA K           SFQGE FSLP+SKTKA   +P+LS+ RKG    
Sbjct: 130  LDSRHGNATDSSAAAKLLVTSTRSLSVSFQGEAFSLPISKTKATA-TPSLSNARKGSTPE 188

Query: 902  XXXXXXXRNGAD----QLENSKPVDQHRWPGRAR--SVQGNLLARSLDCSNGERNKV--I 747
                   R+ +D    Q+ENSK +DQHRWP R R  +++GN L+RS DC  GE+ KV  I
Sbjct: 189  RRRATPLRDKSDGSGVQVENSKLLDQHRWPARNRHANLEGNPLSRSFDCG-GEQKKVNGI 247

Query: 746  GSGNVIRTLQQSMIDE-RRASFDGRLSLDLGNAEPLKAVEQ---ANNVNESSLPSDLTAX 579
            GSG V+R LQQ++ D+ RRASFDGRLSLDL ++E +KAV Q   A++VNESS+PSDLT  
Sbjct: 248  GSGMVVRALQQTISDDSRRASFDGRLSLDLNSSELIKAVRQNPDADSVNESSVPSDLTTS 307

Query: 578  XXXXXXXXXXXXVQECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPG 399
                        VQ+CG S  + R G PRGIVVSARFWQETNSRLRRL DPGS L TSPG
Sbjct: 308  DTDSVSSGSTSGVQDCG-SVAKGRNG-PRGIVVSARFWQETNSRLRRLHDPGSPLSTSPG 365

Query: 398  SKLSVPPKL---RKYSSDVPVSSPRAMSSPIRGGGIRSASPSKL----IXXXXXXXXXXX 240
            +++  P K    +++S+D P+SSPR M+SPIRGG  R  SPSKL    +           
Sbjct: 366  ARVGAPSKFSQSKRFSNDGPLSSPRTMASPIRGG-TRPPSPSKLWTSSVSSPSRGISSPS 424

Query: 239  XRVKNVVSTINSNFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATT 60
                 V  ++ SN + TPS+LSF+VD+RRGK+GE+RIVDAH+LRL +NR+LQWRFVNA  
Sbjct: 425  RTRNGVGGSLVSNSISTPSILSFSVDIRRGKMGEDRIVDAHVLRLHHNRYLQWRFVNARA 484

Query: 59   EATLLVQKHSAEKTL 15
            +AT ++Q+ +AE  L
Sbjct: 485  DATFMLQRLNAEVLL 499


>gb|EPS70660.1| hypothetical protein M569_04100, partial [Genlisea aurea]
          Length = 637

 Score =  308 bits (789), Expect = 2e-81
 Identities = 195/364 (53%), Positives = 230/364 (63%), Gaps = 13/364 (3%)
 Frame = -1

Query: 1055 EVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXXXR- 879
            EVSAATK           SFQGE FSLP+SKTK APP P+  S+RKG           R 
Sbjct: 133  EVSAATKLLVTSTRSLSVSFQGEAFSLPISKTKVAPP-PSSPSVRKGTPERKRTSTPSRV 191

Query: 878  ---NGADQLENSKPVDQHRWPGRARSVQGNLLARSLDCSNG---ERNKVIGSGNVIRTLQ 717
                G D  +  K  DQHRWPGR R V  N L++SL+ S     +R ++IGSG  IR+LQ
Sbjct: 192  RAEGGGDPADVFKLADQHRWPGRNRLVN-NPLSKSLNYSGAADDKRIELIGSGQSIRSLQ 250

Query: 716  QSMI-DERRASFDGRLSLDLGNAEPLKAVEQAN-NVNESSLPSDLTAXXXXXXXXXXXXX 543
            QSMI DERR SFDGRL LDL +++ LK   +   + N    PSD  +             
Sbjct: 251  QSMIIDERRTSFDGRLCLDLDSSDLLKEFPRGGVDRNGDYNPSDSDSASSGSTTGV---- 306

Query: 542  VQECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLSVPPKLRKY 363
              + GG  + +     RG+ VSARFWQETNSRLRRLQDP S L +SPGSK+ +PPK++++
Sbjct: 307  -HDSGGGSSLLNDA--RGMAVSARFWQETNSRLRRLQDPVSPLSSSPGSKMIIPPKMKRF 363

Query: 362  SSD-VPVSSPRAMSSPIRGG-GIRSASPSKL--IXXXXXXXXXXXXRVKNVVSTINSNFV 195
            S D   VSSPRAM SP R     R+ASP KL               R++N VSTI +NFV
Sbjct: 364  SMDGSSVSSPRAMLSPSRASVANRAASPGKLSATVGSSPSRGYSPARIRNAVSTICNNFV 423

Query: 194  ETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATLLVQKHSAEKTL 15
            ETPSVLSFAVD+RRGKVGENRIVDAHLLRLLYNRHLQW FVNA TE  LL+QKH+AEK L
Sbjct: 424  ETPSVLSFAVDIRRGKVGENRIVDAHLLRLLYNRHLQWSFVNARTETVLLLQKHTAEKNL 483

Query: 14   WNAW 3
            WNAW
Sbjct: 484  WNAW 487


>gb|EMJ00929.1| hypothetical protein PRUPE_ppa002521mg [Prunus persica]
          Length = 662

 Score =  307 bits (786), Expect = 5e-81
 Identities = 196/377 (51%), Positives = 235/377 (62%), Gaps = 18/377 (4%)
 Frame = -1

Query: 1079 DSKLSNVG-EVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXX 903
            +++LSN G EVSAAT+           SFQGE FSLP+SKTKAA  SP+ +  RK     
Sbjct: 130  EARLSNAGAEVSAATRLLVTSTRSLSVSFQGEAFSLPISKTKAAA-SPSGAVARKATPER 188

Query: 902  XXXXXXXRNGADQLENSKPVDQHRWPGRARSVQG---NLLARSLDCSNGERN-KVIGSGN 735
                       DQ ENSKP DQ+RWP R R +     N L+RSLDCS+  R    IGSG 
Sbjct: 189  RRSTPVR---GDQAENSKPSDQYRWPARTRQLSSGSNNSLSRSLDCSSETRKLNGIGSGV 245

Query: 734  VIRTLQQSMIDE-RRASFDGRLSLDLGNAEPLKAVEQ---ANNVNESSLPSDLTAXXXXX 567
              R LQQSMID+ RRASFD RLSLDLGNAEPLKA EQ   AN+ N+SS+PSDLTA     
Sbjct: 246  AARALQQSMIDDSRRASFDRRLSLDLGNAEPLKAAEQNPDANSANDSSVPSDLTASDTDS 305

Query: 566  XXXXXXXXVQECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLS 387
                    V + GG         PRGIVVSARFWQETNSRLRRLQDPGS L TSP S+  
Sbjct: 306  VSSGSTSGVHDAGGVAKSRTA--PRGIVVSARFWQETNSRLRRLQDPGSPLSTSPVSRAG 363

Query: 386  ---VPPKLRKYSSDVPVSS-PRAMSSPIRGGGIRSASPSKLIXXXXXXXXXXXXRVKNVV 219
               +  + +K++ D+P+SS PR ++SP RG   R ASP KL                 V 
Sbjct: 364  SKFIQSQSKKFNGDIPLSSSPRTIASPTRGP-TRPASPGKL-WTSSSMSPSRGYSPSRVR 421

Query: 218  STINSNFV-----ETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEA 54
            S++N +         PS+LSF+VD RRGK+GE+RIVDAH+LRLLYNR+LQWRFVNA  +A
Sbjct: 422  SSVNGSLNISYSGPAPSILSFSVDTRRGKMGEDRIVDAHMLRLLYNRYLQWRFVNARADA 481

Query: 53   TLLVQKHSAEKTLWNAW 3
            T +V + +AEK LWNAW
Sbjct: 482  TFMVHRLNAEKNLWNAW 498


>gb|EOX98447.1| Family of Uncharacterized protein function, putative isoform 2
            [Theobroma cacao]
          Length = 571

 Score =  296 bits (758), Expect = 1e-77
 Identities = 197/375 (52%), Positives = 232/375 (61%), Gaps = 21/375 (5%)
 Frame = -1

Query: 1064 NVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXX 885
            N  E+SAATK           SFQGE FSLP+SKTKA   S   +  RK           
Sbjct: 156  NATELSAATKMLITSTRSLSVSFQGEAFSLPISKTKAQVGS---AMTRKATPERRRATPV 212

Query: 884  XRNGADQLENSKPVDQHRWPGRARSVQG--NLLARSLDCSNGERNKVIGSGNVI-RTLQQ 714
              +G    ENSKPVDQHRWPGR R      N L+RSLD S+    K+ GSG ++ ++LQQ
Sbjct: 213  RDHG----ENSKPVDQHRWPGRTRQGNSGTNPLSRSLDYSS--ERKMFGSGAIVAKSLQQ 266

Query: 713  SM-IDE--RRASFDG--RLSLDLGNA-----EPLKAVEQANNVNESSLPS-DLTAXXXXX 567
            SM +DE  RR SFDG  RLSLDLG++     E  K    AN++NE+S  S DLTA     
Sbjct: 267  SMMLDESSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDS 326

Query: 566  XXXXXXXXV-QECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKL 390
                      QECGGSG       PR IVVSARFWQETNSRLRRLQDPGS L TSPGS++
Sbjct: 327  VSSGSTNSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRI 386

Query: 389  SVPPKL---RKYSSDVPVSSPRAMSSPIRGGGIRSASPSKL--IXXXXXXXXXXXXRVKN 225
                K    +++SSD  VSSPR M+SPIRGG  R ASPSKL               RV+N
Sbjct: 387  GASAKFSQSKRFSSDGVVSSPRTMASPIRGG-TRPASPSKLWTSATSSPLRGLSPARVRN 445

Query: 224  VVS-TINSNFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATL 48
             V   +  N V TPS+LSF+VD+RRGK+GE+RIVDAH+LRLLYNR+LQWRF NA  +AT 
Sbjct: 446  AVGGQMMGNSVNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFANARADATF 505

Query: 47   LVQKHSAEKTLWNAW 3
            ++QK SAEK LWNAW
Sbjct: 506  MLQKLSAEKNLWNAW 520


>gb|EOX98446.1| Family of Uncharacterized protein function (DUF566), putative isoform
            1 [Theobroma cacao]
          Length = 684

 Score =  296 bits (758), Expect = 1e-77
 Identities = 197/375 (52%), Positives = 232/375 (61%), Gaps = 21/375 (5%)
 Frame = -1

Query: 1064 NVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXX 885
            N  E+SAATK           SFQGE FSLP+SKTKA   S   +  RK           
Sbjct: 156  NATELSAATKMLITSTRSLSVSFQGEAFSLPISKTKAQVGS---AMTRKATPERRRATPV 212

Query: 884  XRNGADQLENSKPVDQHRWPGRARSVQG--NLLARSLDCSNGERNKVIGSGNVI-RTLQQ 714
              +G    ENSKPVDQHRWPGR R      N L+RSLD S+    K+ GSG ++ ++LQQ
Sbjct: 213  RDHG----ENSKPVDQHRWPGRTRQGNSGTNPLSRSLDYSS--ERKMFGSGAIVAKSLQQ 266

Query: 713  SM-IDE--RRASFDG--RLSLDLGNA-----EPLKAVEQANNVNESSLPS-DLTAXXXXX 567
            SM +DE  RR SFDG  RLSLDLG++     E  K    AN++NE+S  S DLTA     
Sbjct: 267  SMMLDESSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDS 326

Query: 566  XXXXXXXXV-QECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKL 390
                      QECGGSG       PR IVVSARFWQETNSRLRRLQDPGS L TSPGS++
Sbjct: 327  VSSGSTNSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRI 386

Query: 389  SVPPKL---RKYSSDVPVSSPRAMSSPIRGGGIRSASPSKL--IXXXXXXXXXXXXRVKN 225
                K    +++SSD  VSSPR M+SPIRGG  R ASPSKL               RV+N
Sbjct: 387  GASAKFSQSKRFSSDGVVSSPRTMASPIRGG-TRPASPSKLWTSATSSPLRGLSPARVRN 445

Query: 224  VVS-TINSNFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATL 48
             V   +  N V TPS+LSF+VD+RRGK+GE+RIVDAH+LRLLYNR+LQWRF NA  +AT 
Sbjct: 446  AVGGQMMGNSVNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFANARADATF 505

Query: 47   LVQKHSAEKTLWNAW 3
            ++QK SAEK LWNAW
Sbjct: 506  MLQKLSAEKNLWNAW 520


>ref|XP_006589620.1| PREDICTED: QWRF motif-containing protein 2-like isoform X2 [Glycine
            max]
          Length = 614

 Score =  293 bits (750), Expect = 8e-77
 Identities = 181/348 (52%), Positives = 223/348 (64%), Gaps = 16/348 (4%)
 Frame = -1

Query: 998  FQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXXXRNGADQLENSKPVDQHRWPGR 819
            FQGE FSLPVSKTKAA  +P   + RK            +      ENS+P DQHRWP R
Sbjct: 115  FQGEAFSLPVSKTKAASATP---TPRKAATPERRRATPVKG-----ENSRPADQHRWPAR 166

Query: 818  ARSVQGNLLARSLDCSNGERNKVIGSGN------VIRTLQQSMI---DERRASFDGR--L 672
             R V    L++S+D  + ++ KV+G+GN      V+R LQQSM+   ++RRASFDG   L
Sbjct: 167  TRHVDH--LSKSVDIIDNKK-KVVGNGNGNGFGKVVRALQQSMVVEGEKRRASFDGLGGL 223

Query: 671  SLDLGNAEPLKAVEQANNVNESSLPSDLTAXXXXXXXXXXXXXVQECGGSGTRIRGGVPR 492
            SLDLG AE LK    ANN N+SSL SDLTA               +  G+    +   PR
Sbjct: 224  SLDLGKAELLKGNINANNHNKSSLASDLTASDTDSVSSGSTSGAHDSSGAAKGTKE--PR 281

Query: 491  GIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLSVPPK---LRKYSSDVPVSSPRAMSS 321
            GIVVSARFWQETNSRLRRLQDPGS L TSP S++ VP +   L++Y+SD P+ SPR M+S
Sbjct: 282  GIVVSARFWQETNSRLRRLQDPGSPLSTSPASRIGVPNRNAQLKRYNSDGPMLSPRTMAS 341

Query: 320  PIRGG-GIRSASPSKLIXXXXXXXXXXXXRVKNVV-STINSNFVETPSVLSFAVDVRRGK 147
            P+RG    R ASPSKL             RV++ V S+INS    TPS+LSF+ DVRRGK
Sbjct: 342  PVRGNVNARPASPSKLWAGSSPSRGVSPARVRSTVASSINSGSGNTPSILSFSADVRRGK 401

Query: 146  VGENRIVDAHLLRLLYNRHLQWRFVNATTEATLLVQKHSAEKTLWNAW 3
            +GE+RI DAH LRLLYNR++QWRFVNA  +AT +VQK +AE+ LWNAW
Sbjct: 402  IGEDRIFDAHTLRLLYNRYVQWRFVNARADATFMVQKLNAERHLWNAW 449


>ref|XP_003536586.1| PREDICTED: QWRF motif-containing protein 2-like isoform X1 [Glycine
            max]
          Length = 613

 Score =  293 bits (750), Expect = 8e-77
 Identities = 181/348 (52%), Positives = 223/348 (64%), Gaps = 16/348 (4%)
 Frame = -1

Query: 998  FQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXXXRNGADQLENSKPVDQHRWPGR 819
            FQGE FSLPVSKTKAA  +P   + RK            +      ENS+P DQHRWP R
Sbjct: 115  FQGEAFSLPVSKTKAASATP---TPRKAATPERRRATPVKG-----ENSRPADQHRWPAR 166

Query: 818  ARSVQGNLLARSLDCSNGERNKVIGSGN------VIRTLQQSMI---DERRASFDGR--L 672
             R V    L++S+D  + ++ KV+G+GN      V+R LQQSM+   ++RRASFDG   L
Sbjct: 167  TRHVDH--LSKSVDIIDNKK-KVVGNGNGNGFGKVVRALQQSMVVEGEKRRASFDGLGGL 223

Query: 671  SLDLGNAEPLKAVEQANNVNESSLPSDLTAXXXXXXXXXXXXXVQECGGSGTRIRGGVPR 492
            SLDLG AE LK    ANN N+SSL SDLTA               +  G+    +   PR
Sbjct: 224  SLDLGKAELLKGNINANNHNKSSLASDLTASDTDSVSSGSTSGAHDSSGAAKGTKE--PR 281

Query: 491  GIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLSVPPK---LRKYSSDVPVSSPRAMSS 321
            GIVVSARFWQETNSRLRRLQDPGS L TSP S++ VP +   L++Y+SD P+ SPR M+S
Sbjct: 282  GIVVSARFWQETNSRLRRLQDPGSPLSTSPASRIGVPNRNAQLKRYNSDGPMLSPRTMAS 341

Query: 320  PIRGG-GIRSASPSKLIXXXXXXXXXXXXRVKNVV-STINSNFVETPSVLSFAVDVRRGK 147
            P+RG    R ASPSKL             RV++ V S+INS    TPS+LSF+ DVRRGK
Sbjct: 342  PVRGNVNARPASPSKLWAGSSPSRGVSPARVRSTVASSINSGSGNTPSILSFSADVRRGK 401

Query: 146  VGENRIVDAHLLRLLYNRHLQWRFVNATTEATLLVQKHSAEKTLWNAW 3
            +GE+RI DAH LRLLYNR++QWRFVNA  +AT +VQK +AE+ LWNAW
Sbjct: 402  IGEDRIFDAHTLRLLYNRYVQWRFVNARADATFMVQKLNAERHLWNAW 449


>ref|XP_002283295.1| PREDICTED: uncharacterized protein LOC100242050 [Vitis vinifera]
          Length = 743

 Score =  291 bits (745), Expect = 3e-76
 Identities = 188/374 (50%), Positives = 234/374 (62%), Gaps = 16/374 (4%)
 Frame = -1

Query: 1079 DSKLSNVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXX 900
            D +  N GEV+ A+K           SFQGE+FSL VSKTK AP     +S+RKG     
Sbjct: 219  DFRPGNAGEVTTASKMLITSARSLSVSFQGESFSLRVSKTKPAP-----ASVRKGTPERR 273

Query: 899  XXXXXXRNGADQLENSKPVDQHRWPGRARSVQGNLLARSLDCSNGERNKVIGSGNVIRTL 720
                     ADQ ENSKPVDQHRWPGR+R V  N L RS+DC++ E+ K+ GSG + R+L
Sbjct: 274  KPTPTR---ADQTENSKPVDQHRWPGRSRQV--NSLTRSMDCTD-EKKKLGGSGIMARSL 327

Query: 719  QQSMIDER-RASFDGRLSLDLGNAEPLKAVE--QANNVNESSLPSDLTAXXXXXXXXXXX 549
            QQSMIDER R   DGRL+LD GNAE  KA E   AN+V  S++ SD  A           
Sbjct: 328  QQSMIDERNRTPLDGRLNLDSGNAELGKANELVNANSVVGSTMTSDPAASDTESVSSGST 387

Query: 548  XXVQECGGSGTRIRG-GVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKL-SVPPK 375
               QE GG G   +G GVPRGI+V ARFWQET++RLRR  +P S    S G +  +VPPK
Sbjct: 388  SGAQESGGGGGGTQGRGVPRGIMVPARFWQETSNRLRRTPEPSSPQSKSNGLRTPAVPPK 447

Query: 374  L---RKYSSDVPVSSPRAM-----SSPIRGGGIRSASPSKLIXXXXXXXXXXXXR---VK 228
            L   +K  +D P+SSPR +      SP+RG  +R ASPSKL+                V+
Sbjct: 448  LIAPKKLLTDSPMSSPRGILPSRGQSPLRGP-VRPASPSKLVTTSTYSPLRGMPSPTRVR 506

Query: 227  NVVSTINSNFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATL 48
             VV ++N N    PS+LSFA DVRRGKVGENR+VDAHLLRLL+NR+LQWRF+NA  +A+L
Sbjct: 507  AVVGSLNGNLSNNPSILSFAADVRRGKVGENRMVDAHLLRLLHNRYLQWRFINARADASL 566

Query: 47   LVQKHSAEKTLWNA 6
            LVQ+ +AE++L NA
Sbjct: 567  LVQRMNAEQSLCNA 580


>ref|XP_003555289.1| PREDICTED: QWRF motif-containing protein 2-like [Glycine max]
          Length = 625

 Score =  287 bits (734), Expect = 6e-75
 Identities = 183/361 (50%), Positives = 223/361 (61%), Gaps = 29/361 (8%)
 Frame = -1

Query: 998  FQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXXXRNGADQLENSKPVDQHRWPGR 819
            FQGE FSLPVSKTKAA  +P     RK            +      ENS+PVDQHRWP R
Sbjct: 115  FQGEAFSLPVSKTKAAAATP---PPRKAATPERRRATPVKG-----ENSRPVDQHRWPAR 166

Query: 818  ARSVQGNLLARSLDCSNGERNKVIGSG--NVIRTLQQSMI---DERRASFDGR--LSLDL 660
             R V    L++S+D S  ++ KVIG+G   V+R LQQSM+   ++RRASFDG   LSLDL
Sbjct: 167  TRRVDH--LSKSVDVS--DKKKVIGNGFGKVVRALQQSMVVEGEKRRASFDGLGGLSLDL 222

Query: 659  GNAEPLKAVEQANN-----------------VNESSLPSDLTAXXXXXXXXXXXXXVQEC 531
            G AE LK    +N+                 VN+SSL SDLTA               E 
Sbjct: 223  GKAELLKGNSNSNSNANNHSNNDGGGGGGNLVNKSSLASDLTASDTDSVSSGSTSGAHES 282

Query: 530  GGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKLSVPPK---LRKYS 360
             G+    +   PRGIVVSARFWQETNSRLRRLQDPGS L TSP S++ VP +   L++Y+
Sbjct: 283  SGAAKGTKE--PRGIVVSARFWQETNSRLRRLQDPGSPLSTSPASRIGVPNRNAQLKRYN 340

Query: 359  SDVPVSSPRAMSSPIRGG-GIRSASPSKLIXXXXXXXXXXXXRVKNVV-STINSNFVETP 186
            SD P+ SPR M+SP+RG    R ASPSKL             RV++ V S+INS    TP
Sbjct: 341  SDGPMLSPRTMASPVRGNVNARPASPSKLWAGSSPSRGVSPARVRSTVASSINSGSSNTP 400

Query: 185  SVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATLLVQKHSAEKTLWNA 6
            S+LSF+ DVRRGK+GE+RI DAH LRLLYNR++QWRFVNA  +AT +VQK +AE+ LWNA
Sbjct: 401  SILSFSADVRRGKIGEDRIFDAHTLRLLYNRYVQWRFVNARADATFMVQKLNAERHLWNA 460

Query: 5    W 3
            W
Sbjct: 461  W 461


>ref|XP_004292128.1| PREDICTED: uncharacterized protein LOC101313278 [Fragaria vesca
            subsp. vesca]
          Length = 653

 Score =  283 bits (725), Expect = 6e-74
 Identities = 193/391 (49%), Positives = 237/391 (60%), Gaps = 32/391 (8%)
 Frame = -1

Query: 1079 DSKLSNVG-EVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXX 903
            + +LSN G EVSAAT+           SFQGE FSLP+SKTKAA  +P     RK     
Sbjct: 112  EQRLSNAGSEVSAATRLLVTSTRSLSVSFQGEAFSLPISKTKAAVGTP----ARKPTPER 167

Query: 902  XXXXXXXRNGADQLENSKPVDQHRWPGRARSVQGNLLARSLDCSNGERNKVIGSGNVIRT 723
                   R G DQ+ENS+P +QHRWPGR+R    +L +RS++C  G  N  +GSG V R 
Sbjct: 168  RRSTPVRREGGDQVENSRPGEQHRWPGRSRQPAVSL-SRSMECG-GSVNNGVGSGVVARA 225

Query: 722  L--QQSMIDER-------RASFDGRLSLDLG----NAEPLKAV----EQANNVNESSLPS 594
            L  QQSM+DE        R+SFDGRLSLDLG    NA+ L+A     + + + NESS+PS
Sbjct: 226  LLLQQSMVDESSRGGSRSRSSFDGRLSLDLGHLGGNADALRASHHNPDASFSANESSVPS 285

Query: 593  DLTAXXXXXXXXXXXXXVQECGGSGTRIRGG--VPRGIVVSARFWQETNSRLRRLQDPGS 420
            DLTA             VQ+  G+  + RGG  VPRGI VSARFWQETNSRLRRLQDPGS
Sbjct: 286  DLTASDTDSVSSGSTSGVQDSNGAA-KSRGGTAVPRGIAVSARFWQETNSRLRRLQDPGS 344

Query: 419  SLLTSPGSKLSVPPKL---RKYSS--DVPVSSPRAMSSPIRGGGIRSASPSKL---IXXX 264
             L TSP S+     K    +K++      VSSPR MSSPIRG   R ASP KL       
Sbjct: 345  PLATSPVSRAGAAGKYIQSKKFNGGDSNAVSSPRTMSSPIRGA-TRPASPGKLWTSSSLS 403

Query: 263  XXXXXXXXXRVKNVVS---TINSNFV-ETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYN 96
                     R +N  S    ++S +   TPS LSF++D RRGK GE+RIVDAH+LRLLYN
Sbjct: 404  PSRGSASPSRARNSFSGPGQLSSGYAASTPSFLSFSIDTRRGKKGEDRIVDAHMLRLLYN 463

Query: 95   RHLQWRFVNATTEATLLVQKHSAEKTLWNAW 3
            R++QWRFVNA  +AT +VQ+ +AE+ LWNAW
Sbjct: 464  RYVQWRFVNARADATYMVQRVNAEENLWNAW 494


>gb|EOX98448.1| Family of Uncharacterized protein function, putative isoform 3,
            partial [Theobroma cacao]
          Length = 590

 Score =  282 bits (721), Expect = 2e-73
 Identities = 192/375 (51%), Positives = 227/375 (60%), Gaps = 21/375 (5%)
 Frame = -1

Query: 1064 NVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXX 885
            N  E+SAATK           SFQGE FSLP+SKTKA   S   +  RK           
Sbjct: 156  NATELSAATKMLITSTRSLSVSFQGEAFSLPISKTKAQVGS---AMTRKATPERRRATPV 212

Query: 884  XRNGADQLENSKPVDQHRWPGRARSVQG--NLLARSLDCSNGERNKVIGSGNVI-RTLQQ 714
              +G    ENSKPVDQHRWPGR R      N L+RSLD S+    K+ GSG ++ ++LQQ
Sbjct: 213  RDHG----ENSKPVDQHRWPGRTRQGNSGTNPLSRSLDYSS--ERKMFGSGAIVAKSLQQ 266

Query: 713  SM-IDE--RRASFDG--RLSLDLGNA-----EPLKAVEQANNVNESSLPS-DLTAXXXXX 567
            SM +DE  RR SFDG  RLSLDLG++     E  K    AN++NE+S  S DLTA     
Sbjct: 267  SMMLDESSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDS 326

Query: 566  XXXXXXXXV-QECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKL 390
                      QECGGSG       PR IVVSARFWQETNSRLRRLQDPGS L TSPGS++
Sbjct: 327  VSSGSTNSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRI 386

Query: 389  SVPPKL---RKYSSDVPVSSPRAMSSPIRGGGIRSASPSKL--IXXXXXXXXXXXXRVKN 225
                K    +++SSD  VSSPR M+SPIRGG  R ASPSKL               RV+N
Sbjct: 387  GASAKFSQSKRFSSDGVVSSPRTMASPIRGG-TRPASPSKLWTSATSSPLRGLSPARVRN 445

Query: 224  VVS-TINSNFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATL 48
             V   +  N V TPS+LSF+VD+RRGK+GE+RIVDAH+LRLLYNR+LQWRF NA  +AT 
Sbjct: 446  AVGGQMMGNSVNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFANARADATF 505

Query: 47   LVQKHSAEKTLWNAW 3
            ++QK SAE      W
Sbjct: 506  MLQKLSAEIAYLEEW 520


>gb|EOX98449.1| Family of Uncharacterized protein function, putative isoform 4
            [Theobroma cacao]
          Length = 517

 Score =  280 bits (717), Expect = 5e-73
 Identities = 191/368 (51%), Positives = 226/368 (61%), Gaps = 21/368 (5%)
 Frame = -1

Query: 1064 NVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXX 885
            N  E+SAATK           SFQGE FSLP+SKTKA   S   +  RK           
Sbjct: 156  NATELSAATKMLITSTRSLSVSFQGEAFSLPISKTKAQVGS---AMTRKATPERRRATPV 212

Query: 884  XRNGADQLENSKPVDQHRWPGRARSVQG--NLLARSLDCSNGERNKVIGSGNVI-RTLQQ 714
              +G    ENSKPVDQHRWPGR R      N L+RSLD S+    K+ GSG ++ ++LQQ
Sbjct: 213  RDHG----ENSKPVDQHRWPGRTRQGNSGTNPLSRSLDYSS--ERKMFGSGAIVAKSLQQ 266

Query: 713  SM-IDE--RRASFDG--RLSLDLGNA-----EPLKAVEQANNVNESSLPS-DLTAXXXXX 567
            SM +DE  RR SFDG  RLSLDLG++     E  K    AN++NE+S  S DLTA     
Sbjct: 267  SMMLDESSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDS 326

Query: 566  XXXXXXXXV-QECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKL 390
                      QECGGSG       PR IVVSARFWQETNSRLRRLQDPGS L TSPGS++
Sbjct: 327  VSSGSTNSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRI 386

Query: 389  SVPPKL---RKYSSDVPVSSPRAMSSPIRGGGIRSASPSKL--IXXXXXXXXXXXXRVKN 225
                K    +++SSD  VSSPR M+SPIRGG  R ASPSKL               RV+N
Sbjct: 387  GASAKFSQSKRFSSDGVVSSPRTMASPIRGG-TRPASPSKLWTSATSSPLRGLSPARVRN 445

Query: 224  VVS-TINSNFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATL 48
             V   +  N V TPS+LSF+VD+RRGK+GE+RIVDAH+LRLLYNR+LQWRF NA  +AT 
Sbjct: 446  AVGGQMMGNSVNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFANARADATF 505

Query: 47   LVQKHSAE 24
            ++QK SAE
Sbjct: 506  MLQKLSAE 513


>ref|XP_002527498.1| conserved hypothetical protein [Ricinus communis]
            gi|223533138|gb|EEF34896.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 634

 Score =  279 bits (714), Expect = 1e-72
 Identities = 181/373 (48%), Positives = 227/373 (60%), Gaps = 14/373 (3%)
 Frame = -1

Query: 1079 DSKLSNVGEVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXX 900
            ++K  NV E+SAAT+           SFQGE FSLP+SK KA   SPN++  RK      
Sbjct: 116  EAKQGNVSEMSAATRMLITSTRSLSVSFQGEAFSLPISKAKAVSSSPNVT--RKVTPERR 173

Query: 899  XXXXXXRNGADQLENSKPVDQHRWPGRARSVQGNL--------LARSLDCS-NGERNKVI 747
                      DQ ENS+P+DQHRWPGR+R   GNL        L+RS DCS  G+  +V+
Sbjct: 174  KSTPVR----DQGENSRPLDQHRWPGRSRG--GNLALNERNPSLSRSFDCSVGGDEKRVM 227

Query: 746  GSGNV-IRTLQQSMIDERRASFDGRLSLDLGNAEPLKAVEQANNVNESSLPSDLTAXXXX 570
            GSG + +++LQQSMI + R     RLSLDLGNA+    V   ++V++S +  DLTA    
Sbjct: 228  GSGFMSVKSLQQSMIVDER-----RLSLDLGNAKRNPDVN--SSVSDSFVTGDLTASDSD 280

Query: 569  XXXXXXXXXVQECGGSGTRIRGGVPRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKL 390
                     +Q+ G   +R + G PRGI VSARFWQETNSRLRRLQDPGS L TSP  + 
Sbjct: 281  SVSSGSTSGLQDFGSGISRAKTG-PRGIAVSARFWQETNSRLRRLQDPGSPLSTSPNPRT 339

Query: 389  SVPPKL---RKYSSDVPVSSPRAM-SSPIRGGGIRSASPSKLIXXXXXXXXXXXXRVKNV 222
            S+  K    +++SSD PV+SPR   SSPIRG   R ASPSKL                  
Sbjct: 340  SISSKTIQSKRFSSDAPVASPRTFGSSPIRGA-TRPASPSKLWTHSASSPSRGISSPSRG 398

Query: 221  VSTINSNFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATTEATLLV 42
               ++SN    PS+LSFAVD+RRGK+GE+RI DAH+LRLLYN +LQWRFVNA  +AT  V
Sbjct: 399  -RPMSSNLSSMPSILSFAVDLRRGKMGEDRIGDAHMLRLLYNHYLQWRFVNARADATFFV 457

Query: 41   QKHSAEKTLWNAW 3
            Q+ +AEK LWNAW
Sbjct: 458  QRVNAEKNLWNAW 470


>ref|XP_002891540.1| hypothetical protein ARALYDRAFT_891910 [Arabidopsis lyrata subsp.
            lyrata] gi|297337382|gb|EFH67799.1| hypothetical protein
            ARALYDRAFT_891910 [Arabidopsis lyrata subsp. lyrata]
          Length = 660

 Score =  274 bits (701), Expect = 4e-71
 Identities = 188/379 (49%), Positives = 229/379 (60%), Gaps = 28/379 (7%)
 Frame = -1

Query: 1055 EVSAATKXXXXXXXXXXXSFQGETFSLPVSKTKAAPPSPNLSSLRKGXXXXXXXXXXXRN 876
            E+SAATK           SFQGE FSLP+SK K A  +P   S RK              
Sbjct: 129  EMSAATKMLITSTRSLSVSFQGEAFSLPISKKKEATTTP--VSHRKSTPERRRSTPVR-- 184

Query: 875  GADQLENSKPVDQHRWPGRAR-----SVQGNLLARSLDCSNGERNKVIGSGNVIRT-LQQ 714
              DQ ENSKPVDQ RWPG +R     SV  N L+RSLDC + +R K +GSG V R+ L  
Sbjct: 185  --DQRENSKPVDQQRWPGASRRGNSESVVPNSLSRSLDCGS-DRGK-LGSGFVGRSMLHN 240

Query: 713  SMIDER-RASFDGRLSLDLGNAEPLKAV-----EQANNVNESSLPSDLTAXXXXXXXXXX 552
            SMIDE  R S +GRLSLDLG  +    +      + NN   SS+  D TA          
Sbjct: 241  SMIDESPRVSINGRLSLDLGGRDEYLEIGDESQRRPNNGLTSSVSCDFTASDTDSVSSGS 300

Query: 551  XXXVQECGGSGTRIRGGV------PRGIVVSARFWQETNSRLRRLQDPGSSLLTSPGSKL 390
               VQECG     + G +      PR I+ SARFWQETNSRLRRLQDPGS L +SPG K 
Sbjct: 301  TNGVQECGSG---VNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKT 357

Query: 389  S-VPPKL---RKYSSD-VPVSSPRAMSSPIRGGGIRSASPSKL---IXXXXXXXXXXXXR 234
            S V  K    +++SSD VP+SSPR M+SP+RG  IRSASPSKL                R
Sbjct: 358  SSVSSKFGLSKRFSSDAVPLSSPRGMASPVRGSAIRSASPSKLWATTTSSPARALSSPSR 417

Query: 233  VKNVVST-INS-NFVETPSVLSFAVDVRRGKVGENRIVDAHLLRLLYNRHLQWRFVNATT 60
            V+N VS  +N+ N   TPS+LSF+ D+RRGK+GE+R++DAHLLRLLYNR+LQWRFVNA  
Sbjct: 418  VRNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRYLQWRFVNARA 477

Query: 59   EATLLVQKHSAEKTLWNAW 3
            ++T++VQ+ +AEK LWNAW
Sbjct: 478  DSTVMVQRLNAEKNLWNAW 496


Top