BLASTX nr result

ID: Sinomenium22_contig00032020 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00032020
         (913 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004301869.1| PREDICTED: uncharacterized protein LOC101304...   141   3e-31
ref|XP_007203912.1| hypothetical protein PRUPE_ppa016794mg, part...   139   2e-30
ref|XP_006466676.1| PREDICTED: uncharacterized protein LOC102617...   138   3e-30
ref|XP_006425795.1| hypothetical protein CICLE_v10024678mg [Citr...   138   3e-30
ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus c...   137   8e-30
ref|XP_002310176.2| hypothetical protein POPTR_0007s11940g [Popu...   118   4e-24
gb|EXB37241.1| hypothetical protein L484_020300 [Morus notabilis]     109   2e-21
ref|XP_006598717.1| PREDICTED: uncharacterized protein LOC100527...    91   6e-16
ref|XP_007047104.1| Vacuolar protein sorting-associated protein ...    89   2e-15
ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] ...    85   3e-14
emb|CAB62317.1| putative protein [Arabidopsis thaliana]                85   3e-14
ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arab...    84   1e-13
gb|EYU44333.1| hypothetical protein MIMGU_mgv1a000009mg [Mimulus...    81   6e-13
ref|XP_004142023.1| PREDICTED: uncharacterized protein LOC101222...    79   2e-12
ref|XP_007155985.1| hypothetical protein PHAVU_003G249100g [Phas...    77   7e-12
ref|XP_006405272.1| hypothetical protein EUTSA_v10027614mg [Eutr...    77   7e-12
ref|XP_006856204.1| hypothetical protein AMTR_s00059p00194330 [A...    75   5e-11
ref|XP_006293179.1| hypothetical protein CARUB_v10019496mg [Caps...    74   1e-10
ref|XP_004233645.1| PREDICTED: uncharacterized protein LOC101257...    68   4e-09
ref|XP_006338249.1| PREDICTED: uncharacterized protein LOC102601...    66   2e-08

>ref|XP_004301869.1| PREDICTED: uncharacterized protein LOC101304881 [Fragaria vesca
            subsp. vesca]
          Length = 3178

 Score =  141 bits (356), Expect = 3e-31
 Identities = 100/308 (32%), Positives = 141/308 (45%), Gaps = 5/308 (1%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTE-SSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGG 735
            R+AR+RAA + QC E SS       +                     +F  +        
Sbjct: 384  RVARNRAASNVQCPEFSSQKSFVTTIFNFLLISLSLLACTWRFLCKIVFLIMHPLVFRKT 443

Query: 734  IANQHVEADRPLEVVSRYSCSHYCFA--FRRICITISPTSAVPNLVYGQAEAPINIYPFS 561
            +AN+   AD  L++VS   C+ +CF+    ++ ITIS  + +   V  + ++ + I    
Sbjct: 444  LANEPKSAD--LDIVSEGPCTQFCFSVLLGKVQITISHRNEIQLFVNKKLKSHLGITYSD 501

Query: 560  LLSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXX 381
             LSF + +D + L Y+A    +SL  SCG  KVRSSS    P+ E + K           
Sbjct: 502  SLSFRLSVDALLLKYVADMCEESLLISCGQLKVRSSSLMEAPVKESSSKLSFSSMEAHWK 561

Query: 380  XXXXGDLKVLWSDPAT--KSLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKF 207
                    +LW +PA     L+  +  S   + G    FL      ++MW +W  +  KF
Sbjct: 562  ESNDNWKNILWGEPAEILSLLETYETGSADHMEGSCVSFL------KDMWLDWRSECDKF 615

Query: 206  VSFEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLW 27
               E QY E PFL CE KN ++ P   T D G  K    +GKLN  LGYSSI+S SLLL 
Sbjct: 616  GKSEIQYSETPFLLCEFKNFLIYPDLKTSDSGFLKFFFILGKLNLVLGYSSIVSLSLLLR 675

Query: 26   QMQHTLYW 3
            Q QH LYW
Sbjct: 676  QTQHALYW 683


>ref|XP_007203912.1| hypothetical protein PRUPE_ppa016794mg, partial [Prunus persica]
            gi|462399443|gb|EMJ05111.1| hypothetical protein
            PRUPE_ppa016794mg, partial [Prunus persica]
          Length = 1855

 Score =  139 bits (349), Expect = 2e-30
 Identities = 97/311 (31%), Positives = 143/311 (45%), Gaps = 8/311 (2%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            RIARHRAA + QC +    +    +H                    +   +I   +   +
Sbjct: 388  RIARHRAASNVQCAKDGLRKSFATIHFNFLLKILFILACIWRVLCKIIHFIIRLLTFRKV 447

Query: 731  -ANQHVEADRPLEVVSRYSCSHYCFAF--RRICITISPTSAVPNLVYGQAEAPINIYPFS 561
             A +  +A+  L++VS   C+ +CF      + ITIS  + +   V  + E+ I      
Sbjct: 448  LAKEPKKAN--LKIVSGGPCTEFCFILILGNVLITISHINEIQLAVNEKLESHIGTSCSD 505

Query: 560  LLSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXX 381
             LSF + +D + L Y+  +  +S+  SCG  KVRSSS     + E + K+          
Sbjct: 506  FLSFRLSVDSLLLKYVENTCEQSVLISCGQLKVRSSSLLEATVKESSSKSYFSSMEAHWK 565

Query: 380  XXXXGDLKVLWSDPATKSLDPEKVASDSFIPG-----GNAWFLHLERYLEEMWANWGKKS 216
                    +LW++PA          S+++ PG       A    L+ +L +MW NW    
Sbjct: 566  ESNDDLKNILWAEPAQNF-----PLSETYKPGYADHVEGACLSLLKNFLGDMWLNWNTAC 620

Query: 215  KKFVSFEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASL 36
            K+F   E QY ENPFL CEIKN +  P     D G  K  L++GKLN  LG SSI+S SL
Sbjct: 621  KEFEKSEIQYFENPFLLCEIKNFLTYPDLKNSDSGFLKFFLTLGKLNIVLGCSSILSISL 680

Query: 35   LLWQMQHTLYW 3
            L  Q+QH L+W
Sbjct: 681  LFKQIQHALFW 691


>ref|XP_006466676.1| PREDICTED: uncharacterized protein LOC102617616 [Citrus sinensis]
          Length = 3197

 Score =  138 bits (347), Expect = 3e-30
 Identities = 102/316 (32%), Positives = 143/316 (45%), Gaps = 13/316 (4%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            RIAR+RAA++ Q  E S  +  +  H                  + +F  +     +  +
Sbjct: 388  RIARYRAAVNVQRDEDSDKKFSVSSHLKIFSKILPLLACVWKAMYRIFHLIAQLLFLFRL 447

Query: 731  ANQHVEADRPLE--VVSRYSCSHYCFAFR--RICITISPT-SAVPNLVYGQAEAPINIYP 567
            + +  E+   +   +VS YS    CF     ++ IT  P  SA P  V  + E+   I  
Sbjct: 448  STKDPESSVNVRQGIVSEYSYPQRCFCLNLEKLFITFYPEHSAEP--VNQRLESQTGISY 505

Query: 566  FSLLSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXX 387
               LSFC+ +D + L+Y    + KS  FSCG  KV SSS    PL   +  + T      
Sbjct: 506  SDFLSFCLSVDALILMYTEDISEKSFLFSCGQLKVTSSSYIRAPLRRSSSMDSTASVKGH 565

Query: 386  XXXXXXGDLK-VLWSDPA-------TKSLDPEKVASDSFIPGGNAWFLHLERYLEEMWAN 231
                   + K VLW +PA       T    P   A  +F P        LE +L EMW N
Sbjct: 566  RRKGRVTNAKIVLWGEPAELFTLSETNKSSPTDHAEGAFDPV-------LEDFLGEMWFN 618

Query: 230  WGKKSKKFVSFEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSI 51
            W +   KF   E +Y ENP+L CE K+ +  P    PD G WK  L++GKLN  L YSS+
Sbjct: 619  WKRFCMKFDESEIEYSENPWLLCETKSFLTYPDLKNPDSGFWKCNLTVGKLNLALEYSSL 678

Query: 50   ISASLLLWQMQHTLYW 3
            +S +LLL Q+QH   W
Sbjct: 679  LSMALLLRQIQHVATW 694


>ref|XP_006425795.1| hypothetical protein CICLE_v10024678mg [Citrus clementina]
            gi|557527785|gb|ESR39035.1| hypothetical protein
            CICLE_v10024678mg [Citrus clementina]
          Length = 3169

 Score =  138 bits (347), Expect = 3e-30
 Identities = 102/316 (32%), Positives = 143/316 (45%), Gaps = 13/316 (4%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            RIAR+RAA++ Q  E S  +  +  H                  + +F  +     +  +
Sbjct: 388  RIARYRAAVNVQRDEDSDKKFSVSSHLKIFSKILPLLACVWKAMYRIFHLIAQLLFLFRL 447

Query: 731  ANQHVEADRPLE--VVSRYSCSHYCFAFR--RICITISPT-SAVPNLVYGQAEAPINIYP 567
            + +  E+   +   +VS YS    CF     ++ IT  P  SA P  V  + E+   I  
Sbjct: 448  STKDPESSVNVRQGIVSEYSYPQRCFCLNLEKLFITFYPEHSAEP--VNQRLESQTGISY 505

Query: 566  FSLLSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXX 387
               LSFC+ +D + L+Y    + KS  FSCG  KV SSS    PL   +  + T      
Sbjct: 506  SDFLSFCLSVDALILMYTEDISEKSFLFSCGQLKVTSSSYIRAPLRRSSSMDSTASVKGH 565

Query: 386  XXXXXXGDLK-VLWSDPA-------TKSLDPEKVASDSFIPGGNAWFLHLERYLEEMWAN 231
                   + K VLW +PA       T    P   A  +F P        LE +L EMW N
Sbjct: 566  RRKGRVTNAKIVLWGEPAELFTLSETNKSSPTDHAEGAFDPV-------LEDFLGEMWFN 618

Query: 230  WGKKSKKFVSFEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSI 51
            W +   KF   E +Y ENP+L CE K+ +  P    PD G WK  L++GKLN  L YSS+
Sbjct: 619  WKRFCMKFDESEIEYSENPWLLCETKSFLTYPDLKNPDSGFWKCNLTVGKLNLALEYSSL 678

Query: 50   ISASLLLWQMQHTLYW 3
            +S +LLL Q+QH   W
Sbjct: 679  LSMALLLRQIQHVATW 694


>ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus communis]
            gi|223538452|gb|EEF40058.1| hypothetical protein
            RCOM_0603630 [Ricinus communis]
          Length = 1720

 Score =  137 bits (344), Expect = 8e-30
 Identities = 95/305 (31%), Positives = 145/305 (47%), Gaps = 2/305 (0%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            RIAR++A +     E S+ E  ++                      +  S I+ F     
Sbjct: 388  RIARYKATLSIPQGEDSYKEYSVRSQFQVFSKVLSLLVFTWNVIHRVVLSNIHAFLSIVF 447

Query: 731  ANQHVEADRPLEVVSRYSCSHYCFA--FRRICITISPTSAVPNLVYGQAEAPINIYPFSL 558
            + Q  + D  L ++S   C  YCF   F ++ IT    + + N++  + E+ I I    +
Sbjct: 448  SRQEPKFDGHLGIISEDHCPQYCFLLNFGKVLITFCSGNTIHNVIK-KLESHIGISLPDI 506

Query: 557  LSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXX 378
             SFC+ LD + L+Y+     +S S SCG  KV++SS      +E + K+ T         
Sbjct: 507  HSFCLSLDALLLVYVDDIFEQSFSLSCGKLKVKTSSVTGDTATEGSSKHHTVKGNRERMT 566

Query: 377  XXXGDLKVLWSDPATKSLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKFVSF 198
                   VL  +PA   L  +    ++     +A    L+ +L EMW  W +  KK+   
Sbjct: 567  ANDSKT-VLQGEPAQIFLPLQNSQKNAEGQDESAHGPFLKTFLGEMWLTWRRACKKYDDN 625

Query: 197  EAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQMQ 18
            E +Y ENP+L CEIKN ++ P    P+ GLWK  L++GKLN  LGY S+IS ++LL QMQ
Sbjct: 626  EIEYSENPWLLCEIKNCLLHPGLKGPNSGLWKCNLTVGKLNITLGYLSMISMAILLEQMQ 685

Query: 17   HTLYW 3
            H L W
Sbjct: 686  HALKW 690


>ref|XP_002310176.2| hypothetical protein POPTR_0007s11940g [Populus trichocarpa]
            gi|550334700|gb|EEE90626.2| hypothetical protein
            POPTR_0007s11940g [Populus trichocarpa]
          Length = 914

 Score =  118 bits (295), Expect = 4e-24
 Identities = 93/304 (30%), Positives = 135/304 (44%), Gaps = 3/304 (0%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            RIAR+RA  + Q  ++S  E  +                     + +  S+++ F    +
Sbjct: 383  RIARYRAVSNIQNGKNSFKESSMDKQVNVFSKILSVFIVIWNVMYKILLSILHCFFFIIL 442

Query: 731  ANQHVEAD-RPLEVVSRYSCSHYCFA--FRRICITISPTSAVPNLVYGQAEAPINIYPFS 561
              Q  + D  P      YS S YCF   F +I +T S TS   N V  + E+   I    
Sbjct: 443  FFQRPKLDWNPGNNSEDYS-SRYCFLLNFGKILVTFSSTSKHKN-VDERIESHTGISYSD 500

Query: 560  LLSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXX 381
            + SF + +  + L Y+     +SLS SCG  KV+SSS     + +++ KN          
Sbjct: 501  IHSFSLSIHMLLLAYVDEVFEQSLSLSCGKLKVKSSSVMETAIVDRSVKNPFSSKKVRRK 560

Query: 380  XXXXGDLKVLWSDPATKSLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKFVS 201
                    +L   PA   L  +   +    P       +L+  + EMW  W K S  +  
Sbjct: 561  GSVDKLKTILMGKPAQVFLPSQTSETSVANPAEGTCNPYLQTLMGEMWLAWQKSSAGYKD 620

Query: 200  FEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQM 21
             E  Y E P+L CEIKN +MDP    P  G WK  L+ GKLN  LGYSS++S ++LL Q+
Sbjct: 621  NEIAYSETPWLLCEIKNCLMDPNLKRPVSGFWKCSLTAGKLNLALGYSSVLSLAILLGQI 680

Query: 20   QHTL 9
            QH L
Sbjct: 681  QHAL 684


>gb|EXB37241.1| hypothetical protein L484_020300 [Morus notabilis]
          Length = 874

 Score =  109 bits (272), Expect = 2e-21
 Identities = 94/307 (30%), Positives = 136/307 (44%), Gaps = 6/307 (1%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            RIAR+RAA++ Q   S  S +   V                    + F   + FF     
Sbjct: 388  RIARYRAALNVQSVFSKESYVNTHVKFFWKIFPPLGVIWKLILNLFHFIVRLLFFW---- 443

Query: 731  ANQHVEADRPLEVVSRYSCSHYCFAFR--RICITISPTSAVPNLVYGQAEAPINIYPFS- 561
                      LEVVS     H+ F+    RI + IS    +      + E+ I I PFS 
Sbjct: 444  RKAKAPTGEYLEVVSDDPFQHFGFSLNAGRILVNISHMDEIQLSEIEKLESSIGI-PFSD 502

Query: 560  LLSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXX 381
             +SF + ++ + L Y      +SL  SCG FKV+SSS    PL + + K           
Sbjct: 503  FISFSLSINALLLNYREDICEQSLVVSCGQFKVKSSSLMETPLRQDDSKIFPSHAKGQWE 562

Query: 380  XXXXGDLKVLWSDPATK---SLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKK 210
                    +LW +PA     S   +K  +D+     +++   LE  L EMW+NW K   +
Sbjct: 563  ESNNHLESILWFEPAQTFPLSETSKKSIADNAQGDCDSF---LENCLGEMWSNWAKGCVQ 619

Query: 209  FVSFEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLL 30
            F   + QY ENPFL  E+ +++  P       G WK   ++GKL+  LG SSIIS SLL+
Sbjct: 620  FEKSDIQYSENPFLLLEMTSLLTYPGLKNSYSGFWKCFFTLGKLHLGLGCSSIISISLLI 679

Query: 29   WQMQHTL 9
             Q+Q+ L
Sbjct: 680  RQLQNVL 686


>ref|XP_006598717.1| PREDICTED: uncharacterized protein LOC100527166 isoform X1 [Glycine
            max]
          Length = 3165

 Score = 90.9 bits (224), Expect = 6e-16
 Identities = 81/309 (26%), Positives = 131/309 (42%), Gaps = 6/309 (1%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            RIARHRAA+     +S +                            +   ++N FS   I
Sbjct: 388  RIARHRAALK----DSINCHEDFVTTNKFFRPFIFLLSFMWKLISTIIHCLVNIFSREKI 443

Query: 731  ANQHVEADRPLEVVSRYSCSHYCFA--FRRICITISPTSAVPNLVYGQAEAPINIYPFSL 558
                      LE +    C   CF   F +I IT+S  + +   VY + ++   I   + 
Sbjct: 444  VQDPDIDGCCLESLIEDPCQSCCFVLNFGKIIITVSQINEIDPSVYEKLQSLAGIACSAF 503

Query: 557  LSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXX 378
            LS C  +D + LI +     + +  SCG  KV S+    + +SE+    +          
Sbjct: 504  LSICFCIDALLLISVKDIFEQRIFLSCGQMKVESAP---LTMSEEACTMDPLSSAKGNEK 560

Query: 377  XXXGDLK-VLWSDPATKSLDPEKVASDSFIPGGNAWFL---HLERYLEEMWANWGKKSKK 210
                 ++ ++W +PA       K+   S I GG A      H+E ++++   NW +  +K
Sbjct: 561  EGINHMESIMWVEPA-------KIFLLSEIDGGQAEDCCDSHIEIFMKKFSVNWKRICRK 613

Query: 209  FVSFEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLL 30
                E ++ ENP +  +I+    +P    PD G  +  L +GKLN  L +SS+ S SL+L
Sbjct: 614  LNENEIEFSENPCILSKIEISSTNPDPKNPDFGFCECGLMLGKLNLVLTHSSVSSLSLIL 673

Query: 29   WQMQHTLYW 3
             Q+QH LYW
Sbjct: 674  SQIQHALYW 682


>ref|XP_007047104.1| Vacuolar protein sorting-associated protein 13C, putative [Theobroma
            cacao] gi|508699365|gb|EOX91261.1| Vacuolar protein
            sorting-associated protein 13C, putative [Theobroma
            cacao]
          Length = 3155

 Score = 89.0 bits (219), Expect = 2e-15
 Identities = 75/231 (32%), Positives = 107/231 (46%), Gaps = 4/231 (1%)
 Frame = -3

Query: 683  YSCSHYCFAFRRICITISPTSAVPNLVYGQAEAPINIYPFSLLSFCVVLDGVFLIYMAGS 504
            YS   +  +  +I IT+S  S V   V  + E+ I I    + SF   +  + L+Y+   
Sbjct: 459  YSRLRFILSVGKIYITLSSMSGVQT-VSEKVESHIGISYSDVFSFRFSIKVLLLMYIEDI 517

Query: 503  TGKSLSFSCGDFKVRSSSSHHVPLSE--KNWKNETXXXXXXXXXXXXGDLKVLWSDPATK 330
              ++LSFSCG  KV+   S      E  KN KN                  +L  +PA  
Sbjct: 518  FEQTLSFSCGKLKVKYFISSVGGAKERVKNLKN------------------ILHGEPAKI 559

Query: 329  SL--DPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKFVSFEAQYLENPFLFCEI 156
             L  +  K ++ S   GG    L  E ++ EM  NW +  K+F   E +  ENP L  E+
Sbjct: 560  FLLSESNKTSACSHADGGCDPCL--ESFIGEMCLNWRRACKQFEESEIKCPENPRLLFEM 617

Query: 155  KNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQMQHTLYW 3
            K+ +  P       GLWK  L++GK N  LGY SI+S  +LL Q+QH L W
Sbjct: 618  KSFLRHPDLKKLGSGLWKCNLTVGKFNIVLGYLSILSVVMLLRQIQHALNW 668


>ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332645140|gb|AEE78661.1| uncharacterized protein
            AT3G50380 [Arabidopsis thaliana]
          Length = 3072

 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 71/301 (23%), Positives = 118/301 (39%), Gaps = 3/301 (0%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            R+AR+RA +++Q  +  + E  L  H                     F S+  F  +  +
Sbjct: 387  RVARYRACLNSQDADDDYDESSLYGHFKYLSKTTWVLAYIWRLISRTFWSIACFLWLNKL 446

Query: 731  ANQHVEADRPLEVVSRYSCS--HYCFAFRRICITISPTSAVPNLVYGQAEAPINIYPFSL 558
              Q ++ DR  E  S       H      ++ +T  P   + + +  +          ++
Sbjct: 447  LTQELQTDRNNEDDSECVSLEFHAVVNLGKLSVTCYPEKIISSFMTSKDST--GHVDSNI 504

Query: 557  LSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXX 378
            +  C+ +D   ++Y  G   + LS SCG  KV SSS  +     K+ K+ +         
Sbjct: 505  VMLCLSVDEFLVLYTVGCLTQYLSASCGKLKVESSSFKNTSRFMKSTKDPSSSSEGNKKH 564

Query: 377  XXXGDLKVLWSDPATK-SLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKFVS 201
                   +L  DPA + S       SD      +   LHL+  L EMW NW     K   
Sbjct: 565  MREDVKTILDMDPAQQISKTVNNHGSDQ-----HEGMLHLQNLLREMWLNWNSNCMKLDK 619

Query: 200  FEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQM 21
                  + P L  +IK+ +        D   WK  + +GKL+    YSS+ S +LL+WQ+
Sbjct: 620  STFTISDKPCLLVDIKSCMAYEVVGNQDSEFWKCSMVLGKLDIVFEYSSLFSLALLIWQI 679

Query: 20   Q 18
            +
Sbjct: 680  E 680


>emb|CAB62317.1| putative protein [Arabidopsis thaliana]
          Length = 3071

 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 71/301 (23%), Positives = 118/301 (39%), Gaps = 3/301 (0%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            R+AR+RA +++Q  +  + E  L  H                     F S+  F  +  +
Sbjct: 387  RVARYRACLNSQDADDDYDESSLYGHFKYLSKTTWVLAYIWRLISRTFWSIACFLWLNKL 446

Query: 731  ANQHVEADRPLEVVSRYSCS--HYCFAFRRICITISPTSAVPNLVYGQAEAPINIYPFSL 558
              Q ++ DR  E  S       H      ++ +T  P   + + +  +          ++
Sbjct: 447  LTQELQTDRNNEDDSECVSLEFHAVVNLGKLSVTCYPEKIISSFMTSKDST--GHVDSNI 504

Query: 557  LSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXX 378
            +  C+ +D   ++Y  G   + LS SCG  KV SSS  +     K+ K+ +         
Sbjct: 505  VMLCLSVDEFLVLYTVGCLTQYLSASCGKLKVESSSFKNTSRFMKSTKDPSSSSEGNKKH 564

Query: 377  XXXGDLKVLWSDPATK-SLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKFVS 201
                   +L  DPA + S       SD      +   LHL+  L EMW NW     K   
Sbjct: 565  MREDVKTILDMDPAQQISKTVNNHGSDQ-----HEGMLHLQNLLREMWLNWNSNCMKLDK 619

Query: 200  FEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQM 21
                  + P L  +IK+ +        D   WK  + +GKL+    YSS+ S +LL+WQ+
Sbjct: 620  STFTISDKPCLLVDIKSCMAYEVVGNQDSEFWKCSMVLGKLDIVFEYSSLFSLALLIWQI 679

Query: 20   Q 18
            +
Sbjct: 680  E 680


>ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arabidopsis lyrata subsp.
            lyrata] gi|297323582|gb|EFH54003.1| hypothetical protein
            ARALYDRAFT_485391 [Arabidopsis lyrata subsp. lyrata]
          Length = 3074

 Score = 83.6 bits (205), Expect = 1e-13
 Identities = 73/301 (24%), Positives = 117/301 (38%), Gaps = 3/301 (0%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            R+AR+R  + +Q ++ S+ E  +  H                     F S+  F      
Sbjct: 387  RVARYRTCLQSQNSDESYDESFVYGHFNCLSKTTGVLACIWRLISRTFWSIACFLWSNKY 446

Query: 731  ANQHVEADRPLEVVSRYSCS--HYCFAFRRICITISPTSAVPNLVYGQAEAPINIYPFSL 558
              Q ++  R  E  S       H      ++ IT  P   + +L+  +          ++
Sbjct: 447  LTQELQTGRNNEDDSELVSLEFHAVVNLGKVSITFYPEKMISSLLTSKDST--GHMDSNI 504

Query: 557  LSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXX 378
            +  C+++D   ++Y  G   + LS SCG  KV SSS  +     K  K+ +         
Sbjct: 505  VILCLLVDEFLVMYTVGCLSQCLSASCGKLKVESSSFKNTSRFMKPTKDPSSSSEGNKKH 564

Query: 377  XXXGDLKVLWSDPATK-SLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKFVS 201
                   +L  DPA + S       SD      +   LHL+  L EMW NW +   K   
Sbjct: 565  MREDVKTILDMDPAQRISKTVNNHGSDQ-----HEGMLHLQNLLREMWLNWNRNCMKLDK 619

Query: 200  FEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQM 21
                  +NP L  +IK+ +        D   WK  + +GKL+  L YSS  S +LL+WQ 
Sbjct: 620  GTFTISDNPCLLVDIKSCMAYEDVGNQDSKFWKCSMVLGKLDIVLEYSSFFSLALLIWQT 679

Query: 20   Q 18
            +
Sbjct: 680  E 680


>gb|EYU44333.1| hypothetical protein MIMGU_mgv1a000009mg [Mimulus guttatus]
          Length = 3157

 Score = 80.9 bits (198), Expect = 6e-13
 Identities = 60/213 (28%), Positives = 100/213 (46%), Gaps = 3/213 (1%)
 Frame = -3

Query: 647  ICITISPTSAVPNLVYGQAEAPINIYPFSLLSFCVVLDGVFLIYMAGSTGKSLSFSCGDF 468
            I + + P +AV +   G+A +   I    LLS    +DG+F+ YMA  + +  +F+ G  
Sbjct: 473  ISVALIPDNAVQSTSRGKAVSDTKISYDDLLSLSFSIDGIFVRYMANISEQCFTFASGCL 532

Query: 467  KVRSSSSHHVPLS---EKNWKNETXXXXXXXXXXXXGDLKVLWSDPATKSLDPEKVASDS 297
            KV S S+     S   E++W+ E                 V+W +PA  +  PE+   D+
Sbjct: 533  KVLSLSTPTAGASGYLEEHWEKEVEKRQI-----------VIWGEPAEITCLPEETC-DA 580

Query: 296  FIPGGNAWFLHLERYLEEMWANWGKKSKKFVSFEAQYLENPFLFCEIKNVIMDPCFLTPD 117
                      +L+R L ++W NW     K        ++ P++ CEI + ++D   ++  
Sbjct: 581  AADIARTSDPYLDRLLGQLWLNWKNTCLKSEEDNMPNVQAPWILCEISSSLIDH-GISDS 639

Query: 116  CGLWKLILSMGKLNFDLGYSSIISASLLLWQMQ 18
            C  +   L +GKLNF+L Y S  S  +LL Q+Q
Sbjct: 640  CSRFNCGLVVGKLNFNLEYCSFASTVVLLSQIQ 672


>ref|XP_004142023.1| PREDICTED: uncharacterized protein LOC101222087 [Cucumis sativus]
          Length = 3608

 Score = 79.0 bits (193), Expect = 2e-12
 Identities = 70/301 (23%), Positives = 118/301 (39%), Gaps = 7/301 (2%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            +IAR+RA  + +  +   S +QLK                     ++ + ++   +    
Sbjct: 383  KIARYRAIRNIEDKKEVSSIVQLKFFYQVFSLLSCIWKMLCGIFCFIERCIVKTLT---- 438

Query: 731  ANQHVEADRPLEVVSRYSCSHYCFAFR--RICITISPTSAVPNLVYGQAEAPINIYPFSL 558
              Q  + D  +++V R S S +CF     ++ ++I P   +    +   ++   I     
Sbjct: 439  --QPHKLDGCVKIVRRDSNSQFCFMLNTGKLLVSIYPPDDIQPPTFENLKSSFGIPSSFS 496

Query: 557  LSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXX 378
            LSFC   D + ++YM     +SL  SC  F V        PL      N           
Sbjct: 497  LSFCFSFDSLVVMYMVDLCEQSLLMSCDQFNV-------TPLPSVEASNGGGCSVDLLGS 549

Query: 377  XXXGDLKVLWSDPATKSLDPEKVASDSFIPGGNAWF-----LHLERYLEEMWANWGKKSK 213
                +++   S  +    +P    + SF P             + +YLE MW  W    +
Sbjct: 550  LEGCEMERANSLKSFIRGEP----AQSFFPSNGREIDTGCNQFIVKYLEGMWLRWKSVCR 605

Query: 212  KFVSFEAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLL 33
                    Y +NP+  CEI + +           +WK  L++GKLNF L YSS++SA+LL
Sbjct: 606  NLEEGMIPYSDNPWFLCEISSSMTKSVLENSSTSIWKCNLALGKLNFALQYSSVLSAALL 665

Query: 32   L 30
            L
Sbjct: 666  L 666


>ref|XP_007155985.1| hypothetical protein PHAVU_003G249100g [Phaseolus vulgaris]
            gi|561029339|gb|ESW27979.1| hypothetical protein
            PHAVU_003G249100g [Phaseolus vulgaris]
          Length = 3168

 Score = 77.4 bits (189), Expect = 7e-12
 Identities = 58/235 (24%), Positives = 106/235 (45%), Gaps = 2/235 (0%)
 Frame = -3

Query: 701  LEVVSRYSCSHYCFA--FRRICITISPTSAVPNLVYGQAEAPINIYPFSLLSFCVVLDGV 528
            LE +   +C  YC    F +I +T+S  +     VY + ++P  I   ++LS C  +D +
Sbjct: 457  LESLIEDACQIYCLTINFGKIIMTVSKINNSHPSVYEKLQSPAGIVCSNVLSICFCIDAL 516

Query: 527  FLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXXXXXGDLKVLW 348
             L+ +     + +  SCG  KV S+       ++    NE                 ++W
Sbjct: 517  LLVSVDDIFEQKVFLSCGQMKVESTPP--TMSADACTVNELSSAKGNEIGGVNRRESIMW 574

Query: 347  SDPATKSLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKFVSFEAQYLENPFL 168
              PA   L  E  A  +     ++   ++E ++E++  +W +  +K    E +Y ENP L
Sbjct: 575  VAPAKIFLLSEIDAGQT----EDSCDAYIESFMEKLSMSWKRVCRKLNENEIEYSENPCL 630

Query: 167  FCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQMQHTLYW 3
              +++          P+ G  +  L +GKLN  L +SS+   SL+L +++H +YW
Sbjct: 631  LSKVEISSTCQDHKNPNFGFCECGLMLGKLNLVLSHSSVSLLSLVLGKIEHGIYW 685


>ref|XP_006405272.1| hypothetical protein EUTSA_v10027614mg [Eutrema salsugineum]
            gi|557106410|gb|ESQ46725.1| hypothetical protein
            EUTSA_v10027614mg [Eutrema salsugineum]
          Length = 3132

 Score = 77.4 bits (189), Expect = 7e-12
 Identities = 69/302 (22%), Positives = 117/302 (38%), Gaps = 2/302 (0%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            R+AR+R  +++Q       E  +  H                     F S+  F S   +
Sbjct: 386  RVARYRTCVNSQNGSDDFDEASIYGHFNFLCKITWVLAYIWRLISQTFWSIACFLSSRKL 445

Query: 731  ANQHVEADRPLEVVSRYSCS--HYCFAFRRICITISPTSAVPNLVYGQAEAPINIYPFSL 558
              Q ++ DR  E  S       H    F ++ IT  P   + + +  +          ++
Sbjct: 446  LTQELQTDRNNEADSEPVSLEFHAVVNFGKLSITFYPEKMISSFMTSKDSTGHT--DSNV 503

Query: 557  LSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXX 378
            ++ C+ +D   +++  G   +  S SCG  KV SS           +   T         
Sbjct: 504  VTLCLSVDEFLVMHTVGCLTQCSSASCGKLKVMSSGFGKT----SRYMRSTKDPGSSAER 559

Query: 377  XXXGDLKVLWSDPATKSLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKFVSF 198
               G +K +      +S+   K A + +    +   LHL+  L EMW+ W     K    
Sbjct: 560  KMRGHVKTILEMDPVQSILLSK-AGNHYGNEQHEGNLHLQNLLREMWSTWNSNCLKLDKS 618

Query: 197  EAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQMQ 18
              +  +NP L  ++K  +        D GLWK  + +GKL+  L YSS+ S +LL+WQ Q
Sbjct: 619  TFEISDNPCLLVDMKTCMAYQDAGNQDSGLWKCSMVLGKLDIVLEYSSLFSMALLIWQTQ 678

Query: 17   HT 12
             +
Sbjct: 679  QS 680


>ref|XP_006856204.1| hypothetical protein AMTR_s00059p00194330 [Amborella trichopoda]
            gi|548860063|gb|ERN17671.1| hypothetical protein
            AMTR_s00059p00194330 [Amborella trichopoda]
          Length = 3190

 Score = 74.7 bits (182), Expect = 5e-11
 Identities = 69/234 (29%), Positives = 104/234 (44%), Gaps = 10/234 (4%)
 Frame = -3

Query: 680  SCSHYCFAFR--RICITISPTSAVPNLVYGQAEAPINIYPFSLL-SFCVVLDGVFLIYMA 510
            S +  CF     RI I IS  +    L   +    +N  P  LL S   VL+ + L Y  
Sbjct: 422  SKTQQCFTLNIGRIFIRISHENRA-QLTNRRKTDAVNKPPGILLGSVIFVLNSLCLSYDV 480

Query: 509  GSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXXXXXGD----LKVLWSD 342
              +   LS + G F ++ S S  +   E N   +              D     K+LWS 
Sbjct: 481  NDSANFLSLTYGQFDIQFSPSSRMK-KEANQLEKEGNFEGIEFEADVVDGHDFKKILWSM 539

Query: 341  PATKSLDPEKVASDSFIPGG---NAWFLHLERYLEEMWANWGKKSKKFVSFEAQYLENPF 171
            PA +    +K   +S   G    NAW + LE +L EMW++W   +   ++        PF
Sbjct: 540  PAPQV--QQKGKGNSINYGNDFRNAWTMLLENHLSEMWSDWKISTDFCIAKGIPCSREPF 597

Query: 170  LFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQMQHTL 9
            L  E+K   ++P       G  K+ L+ GKLNFDL +S++ S SLL+ Q+++ L
Sbjct: 598  LILEVKAFAINPYLNGCGSGFLKIGLAAGKLNFDLDHSTMASVSLLVMQLKYAL 651


>ref|XP_006293179.1| hypothetical protein CARUB_v10019496mg [Capsella rubella]
            gi|482561886|gb|EOA26077.1| hypothetical protein
            CARUB_v10019496mg [Capsella rubella]
          Length = 3074

 Score = 73.6 bits (179), Expect = 1e-10
 Identities = 69/300 (23%), Positives = 115/300 (38%), Gaps = 2/300 (0%)
 Frame = -3

Query: 911  RIARHRAAMHAQCTESSHSELQLKVHGXXXXXXXXXXXXXXXXXWYLFKSVINFFSVGGI 732
            R+AR+R  +++Q  +  + E  L  H                     F SV     +  +
Sbjct: 387  RVARYRTCLNSQNVDDIYDESSLYGHFNCLSKITWVLAYIWSLISKTFWSVACCLWLNKL 446

Query: 731  ANQHVEADRPLEVVS-RYSCSHYCFAFR-RICITISPTSAVPNLVYGQAEAPINIYPFSL 558
              Q ++ DR  E  S R S   +   +  ++ +T  P   V        ++ I++     
Sbjct: 447  LTQELQPDRNNEDDSERLSLGFHAVVYLGKLSVTFYPEKMVSKDRPEHMDSNISM----- 501

Query: 557  LSFCVVLDGVFLIYMAGSTGKSLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXX 378
               C+ +D + ++   G   + LS SCG  KV SS   +      + ++ +         
Sbjct: 502  --LCLSVDELLVMSTVGCFTQCLSASCGKLKVESSDLKNTSRFMNSTQDPSSSSEGNKKH 559

Query: 377  XXXGDLKVLWSDPATKSLDPEKVASDSFIPGGNAWFLHLERYLEEMWANWGKKSKKFVSF 198
                   V+  DPA +         D      N   LHL   L EMW NW +   +    
Sbjct: 560  MGEDVRTVVDMDPAQRISKTVSNHGDD----QNEGILHLHNLLREMWLNWNRNCLRLDKS 615

Query: 197  EAQYLENPFLFCEIKNVIMDPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQMQ 18
                 +NP L  +I+N +        D   WK  + +GKL+  L YSS+ S +LL+WQ +
Sbjct: 616  TFTISDNPCLLVDIQNCMAYEHVGNQDSEFWKCSMVLGKLDIVLEYSSLFSMALLIWQTE 675


>ref|XP_004233645.1| PREDICTED: uncharacterized protein LOC101257436 [Solanum
            lycopersicum]
          Length = 3178

 Score = 68.2 bits (165), Expect = 4e-09
 Identities = 62/219 (28%), Positives = 89/219 (40%), Gaps = 2/219 (0%)
 Frame = -3

Query: 671  HYCFAFRRICITISPTSAVPNLVYGQAEAPI-NIYPFSLLSFCVVLDGVFLIYMAGSTGK 495
            H C       I+ISP + V      +    + + YP  LL+FC+ +D   L      + +
Sbjct: 466  HICLYVGDFSISISPDNEVSPSFSRKLVLDVGHSYP-GLLTFCLSVDFFCLRCSKDVSEQ 524

Query: 494  SLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXXXXXGDLKVLWSDPATKSLDPE 315
              SF+CG  KV SS      L E                        LW +P       E
Sbjct: 525  YFSFACGCLKVVSS------LMEDKANKFNNNFKGRPRKNIHNLQPTLWGEPYHVLYFTE 578

Query: 314  KVASDSFIPGGNAWFLHLERYL-EEMWANWGKKSKKFVSFEAQYLENPFLFCEIKNVIMD 138
               +DS   GG+  F+H +  L E    NW   S  FV  E Q ++NPF+ CEIK  + D
Sbjct: 579  SGGADSHDTGGD--FVHTQNSLIERACLNWRTFSSGFVESEIQNMKNPFILCEIKGFLTD 636

Query: 137  PCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQM 21
                    G     + MG+LN  L Y  I+S +++  Q+
Sbjct: 637  RSLKNLTVGYTTCCMVMGRLNLVLEYLVIVSVTVICRQV 675


>ref|XP_006338249.1| PREDICTED: uncharacterized protein LOC102601421 isoform X2 [Solanum
            tuberosum]
          Length = 2549

 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 61/220 (27%), Positives = 88/220 (40%), Gaps = 3/220 (1%)
 Frame = -3

Query: 671  HYCFAFRRICITISPTSAVPNLVYGQAEAPI-NIYPFSLLSFCVVLDGVFLIYMAGSTGK 495
            H C       I+ISP + V      +    + + YP  LL+FC+ +D   L Y    + +
Sbjct: 466  HICLYVGDFSISISPDNEVSPSFSRKLVLDVGHSYP-GLLTFCLSVDFFCLRYSKDVSEQ 524

Query: 494  SLSFSCGDFKVRSSSSHHVPLSEKNWKNETXXXXXXXXXXXXGDLKVLWSDPA-TKSLDP 318
              SF+CG  KV SS      L E                        LW +P        
Sbjct: 525  YFSFACGSLKVVSS------LMEDKANKFNNNFKGRPRKNIHNLQPTLWGEPYHVLHFTE 578

Query: 317  EKVASDSFIPGGNAWFLHLER-YLEEMWANWGKKSKKFVSFEAQYLENPFLFCEIKNVIM 141
               A+     GG+  F+H    ++E    NW   S  FV  E Q +ENPF+ CEIK  + 
Sbjct: 579  SGGANPPHGTGGD--FVHTPNSFVERACMNWRTFSSGFVENEIQNMENPFILCEIKGFLT 636

Query: 140  DPCFLTPDCGLWKLILSMGKLNFDLGYSSIISASLLLWQM 21
            D        G     + MG+LN  L Y  I+S +++  Q+
Sbjct: 637  DKSLKNLTAGYTTCCMVMGRLNLVLEYIVIVSVTVICRQV 676


Top