BLASTX nr result

ID: Catharanthus23_contig00025095 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00025095
         (1337 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006347138.1| PREDICTED: uncharacterized protein LOC102595...   135   4e-29
ref|XP_004232797.1| PREDICTED: uncharacterized protein LOC101253...   120   1e-24
ref|XP_006594627.1| PREDICTED: uncharacterized protein LOC102668...   112   5e-22
ref|XP_004485700.1| PREDICTED: uncharacterized protein LOC101496...   109   3e-21
ref|XP_003530988.2| PREDICTED: tripartite motif-containing prote...   106   2e-20
ref|XP_004504965.1| PREDICTED: uncharacterized protein LOC101502...   101   6e-19
gb|ESW31241.1| hypothetical protein PHAVU_002G221700g [Phaseolus...    98   7e-18
gb|ABD32605.1| conserved hypothetical protein [Medicago truncatula]    98   9e-18
ref|XP_003608245.1| hypothetical protein MTR_4g091220 [Medicago ...    98   9e-18
ref|XP_002271376.2| PREDICTED: uncharacterized protein LOC100241...    96   4e-17
ref|XP_003635001.1| PREDICTED: uncharacterized protein LOC100854...    95   6e-17
ref|XP_002524684.1| hypothetical protein RCOM_1095870 [Ricinus c...    90   2e-15
gb|ESW20241.1| hypothetical protein PHAVU_006G192300g [Phaseolus...    89   3e-15
gb|EXB51826.1| hypothetical protein L484_006399 [Morus notabilis]      86   5e-14
gb|EMJ12980.1| hypothetical protein PRUPE_ppa009687mg [Prunus pe...    76   3e-11
gb|EOY28422.1| Uncharacterized protein TCM_029993 [Theobroma cacao]    75   6e-11
ref|XP_006449210.1| hypothetical protein CICLE_v10016271mg [Citr...    71   9e-10
ref|XP_002316662.2| hypothetical protein POPTR_0011s04240g [Popu...    71   9e-10
ref|NP_193881.1| uncharacterized protein [Arabidopsis thaliana] ...    69   6e-09
ref|XP_002867843.1| hypothetical protein ARALYDRAFT_914525 [Arab...    67   2e-08

>ref|XP_006347138.1| PREDICTED: uncharacterized protein LOC102595206 [Solanum tuberosum]
          Length = 256

 Score =  135 bits (340), Expect = 4e-29
 Identities = 97/278 (34%), Positives = 137/278 (49%)
 Frame = +3

Query: 333  PLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKERQIE 512
            PLFSTI+T  T+I +Y PT+    +                R GA QR +Q+  K+ + E
Sbjct: 6    PLFSTIVTLYTIIFIYFPTIAASPLIISTSVLLLSLL----RLGAAQRISQKKNKKPESE 61

Query: 513  KDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSAHDLI 692
              S +  S  STK     +D ++D    +    S     N  +++V + DS  +  H   
Sbjct: 62   ILS-LQISADSTK---CENDSVLDPDPDLDPIDSNVDSSNCEKDSVFEPDSEQRMFH--- 114

Query: 693  IDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXXXXXX 872
                    +    S CE  S   P  +LF+ D FVEWN+RAPLEVI              
Sbjct: 115  ------ADESRVDSSCEKDSVLEPESRLFHGDCFVEWNVRAPLEVIYEAYEGEEDGE--- 165

Query: 873  XXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINLQWXXX 1052
                 ++EEKR+ ++ +IE+Y SLS +YP+TD+D+SS+ D   I +WDSPE++  QW   
Sbjct: 166  -----YTEEKRDEELRVIEKYASLSMYYPETDTDSSSDGDSPVIGNWDSPENVCFQWDDE 220

Query: 1053 XXXXXXXXXXXQKRYSESEVEEDNLIEIDLFPASFTSR 1166
                        KR   SEVEE+NLIEIDL PA+F  R
Sbjct: 221  DREELIEIELDCKR--NSEVEEENLIEIDLSPANFPVR 256


>ref|XP_004232797.1| PREDICTED: uncharacterized protein LOC101253150 [Solanum
            lycopersicum]
          Length = 259

 Score =  120 bits (301), Expect = 1e-24
 Identities = 92/279 (32%), Positives = 132/279 (47%), Gaps = 4/279 (1%)
 Frame = +3

Query: 333  PLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKERQIE 512
            PLFSTI+T  T+I +Y PT+    +                R GA QR +Q+  K+ + E
Sbjct: 6    PLFSTIVTLYTIIFIYFPTIAASPLIISTSLLLLSLL----RLGAAQRISQKNNKKPEFE 61

Query: 513  KDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVD--LDSTWQSAHD 686
                I          +  +DL++D    +    S+    N  +++V +  L+   +S+ D
Sbjct: 62   MLC-IQNPPDFGDSIKCENDLVLDPDPDLDPVDSDVDSSNCEKDSVFEPNLEQRNESSVD 120

Query: 687  LIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXXXX 866
                           + CE  S   P  + FY D FVEWN+RAPLEVI            
Sbjct: 121  ---------------TSCEKDSVLEPESRRFYGDCFVEWNVRAPLEVIYEAYEGEEEEDG 165

Query: 867  XXXXXXXFSEEKREAQMAIIERYTSLSKFYP--DTDSDNSSEEDFLGIRDWDSPEHINLQ 1040
                   ++EEKR+ ++ +IE+Y SLS +YP  DTD+D+SS  D   I +WDSPE++  Q
Sbjct: 166  E------YTEEKRDEELRVIEKYASLSMYYPETDTDTDSSSGGDSPVIGNWDSPENVCFQ 219

Query: 1041 WXXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDLFPASF 1157
            W               KR   SE EE+NLIEIDL PA+F
Sbjct: 220  WDDEDREELIEIELDCKR--NSEFEEENLIEIDLSPANF 256


>ref|XP_006594627.1| PREDICTED: uncharacterized protein LOC102668885 [Glycine max]
          Length = 277

 Score =  112 bits (279), Expect = 5e-22
 Identities = 85/277 (30%), Positives = 126/277 (45%), Gaps = 2/277 (0%)
 Frame = +3

Query: 324  ASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKER 503
            +S PLFS I+TFC LILLYLP LF++++F               R GA+QRS  E     
Sbjct: 30   SSQPLFSCIVTFCFLILLYLPHLFWKIVFSPVLFLSGVLLLLLLRLGAIQRSQNE----- 84

Query: 504  QIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSAH 683
              EK + +                   EP+ +   ++EE R N  E     ++     + 
Sbjct: 85   --EKKNPV-------------------EPEPI---ANEENRGNREEKQGNPIEPVETDSQ 120

Query: 684  DLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXXX 863
            D +   Y+W++     S+  I S         + +SFVEWN++APLEVI           
Sbjct: 121  DHV---YRWIT---SQSQTTIKSQMGFQSSSRFDESFVEWNVKAPLEVIYEGEET----- 169

Query: 864  XXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINLQW 1043
                      + + E +   I R+ SLS++YP+TDSD+SSE  F    +WDSPE++  +W
Sbjct: 170  ---------EQNQNEKRDEGILRHPSLSRYYPETDSDSSSESGFPATENWDSPENMCFRW 220

Query: 1044 --XXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDLFP 1148
                            +KR      +E+NLIEID+ P
Sbjct: 221  DEEDREGLIEIALDGCKKREVGFHFDEENLIEIDISP 257


>ref|XP_004485700.1| PREDICTED: uncharacterized protein LOC101496994 [Cicer arietinum]
          Length = 271

 Score =  109 bits (272), Expect = 3e-21
 Identities = 83/294 (28%), Positives = 129/294 (43%), Gaps = 4/294 (1%)
 Frame = +3

Query: 327  SDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKERQ 506
            SDPLFS I+T C LIL+YLP  F +++F               R+GA+QRS  E  KE  
Sbjct: 30   SDPLFSCIVTLCVLILIYLPHSFCKIVFSPVPILSAILLIIVLRFGAIQRSNSEE-KENL 88

Query: 507  IEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSAHD 686
            +E +S   E     ++ +                + EE+ +++     +D    W S++ 
Sbjct: 89   VESESVTTEENRENRDEK-------------QGKTGEEVEIDS-----LDQIQRWVSSNS 130

Query: 687  LIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXXXX 866
             I   +++   F   S              F  +SFVEWN++APLEVI            
Sbjct: 131  EI--KFEFQMGFESSS--------------FLDESFVEWNVKAPLEVIYEGEET------ 168

Query: 867  XXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINLQWX 1046
                     +   E  +  I RY SLS++ P++DSD+SSE +F  +  WDSPE++   W 
Sbjct: 169  --------EDISNENLVTGILRYPSLSRYCPESDSDSSSENEFPAMEKWDSPENMCYMWD 220

Query: 1047 XXXXXXXXXXXXXQKRYSES----EVEEDNLIEIDLFPASFTSR*SILRREMTG 1196
                           + +++    + EE+N+IEID+ P          RRE +G
Sbjct: 221  EEDRDGLIEIALDGCKNNDNAFGYQFEEENMIEIDISPTK--------RREFSG 266


>ref|XP_003530988.2| PREDICTED: tripartite motif-containing protein 26-like [Glycine max]
          Length = 297

 Score =  106 bits (265), Expect = 2e-20
 Identities = 85/296 (28%), Positives = 122/296 (41%), Gaps = 18/296 (6%)
 Frame = +3

Query: 324  ASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKER 503
            +SDPLF  I+ FCTL+ LYLP LF ++I                R GA+QR   E     
Sbjct: 37   SSDPLFHCIVAFCTLVFLYLPHLFLKIILSPVPILTAILLLSILRLGAIQRLQHE----- 91

Query: 504  QIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSAH 683
                           +E  + H+   ++P     N   + +        ++ D T +S  
Sbjct: 92   --------------RRENHVKHE---EQPIVYEENKGSKYKGEKQSPCTIEPDET-KSVE 133

Query: 684  DLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXXX 863
             +    ++WV      SE E+IS          S  FVEWN++APLEVI           
Sbjct: 134  QV----HQWVH-----SETEVIS----DMGFESSSRFVEWNVKAPLEVIYEEYGEG---- 176

Query: 864  XXXXXXXXFSEEKREAQMAI-IERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINLQ 1040
                      E   +A   + I RY SLS+FYP++DSD+ SE +F  I DWDSPE +  +
Sbjct: 177  ---------EEAGDDANENVGIMRYPSLSRFYPESDSDSESESEFPAIGDWDSPEDLEFR 227

Query: 1041 WXXXXXXXXXXXXXXQ-----------------KRYSESEVEEDNLIEIDLFPASF 1157
            W              +                 KR  E   +E+NLIEID+ P+ +
Sbjct: 228  WGEEEEEDDGDDDDEEEEDREGLIEIALDGCKRKRNLEFNFDEENLIEIDISPSRY 283


>ref|XP_004504965.1| PREDICTED: uncharacterized protein LOC101502893 [Cicer arietinum]
          Length = 297

 Score =  101 bits (252), Expect = 6e-19
 Identities = 83/284 (29%), Positives = 117/284 (41%), Gaps = 6/284 (2%)
 Frame = +3

Query: 324  ASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKER 503
            +S+PLFS IITFCTL+ LYLP LF +++                R GA+QRS  E  +  
Sbjct: 57   SSNPLFSCIITFCTLVFLYLPHLFSKIVLSPVLILTAILLLTILRVGAIQRSQHEQKENP 116

Query: 504  QIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSAH 683
            Q  ++ K  +S + + E               S  +    +V    N+            
Sbjct: 117  QKYREDKQKQSSTCSIE---------------SKETQSLAKVQQWNNS------------ 149

Query: 684  DLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDS-FVEWNLRAPLEVIXXXXXXXXXX 860
                              C+     + S+  F S S FVEWN+RAPL+VI          
Sbjct: 150  ------------------CDPSEIEVNSEMGFESSSCFVEWNVRAPLDVIYEEYEGEEKG 191

Query: 861  XXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINLQ 1040
                       +EK E Q   I  Y SLS++YP++DSD+SS+       DWDSPE +  +
Sbjct: 192  NDP--------KEKEENQNMGISNYPSLSRYYPESDSDSSSD-------DWDSPEDMCFR 236

Query: 1041 W-----XXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDLFPASF 1157
            W                    KR  E + EE+NLIEID+ P  +
Sbjct: 237  WDEEDREGLIEIALDGSSKVVKREMEFQYEEENLIEIDISPTRY 280


>gb|ESW31241.1| hypothetical protein PHAVU_002G221700g [Phaseolus vulgaris]
          Length = 291

 Score = 98.2 bits (243), Expect = 7e-18
 Identities = 82/294 (27%), Positives = 118/294 (40%), Gaps = 16/294 (5%)
 Frame = +3

Query: 324  ASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKER 503
            +SDPLF  I+ FCTL+ LYLP LF +V+                R GA+Q+S  ++   R
Sbjct: 37   SSDPLFHCIVAFCTLVFLYLPNLFLKVVLSPVLILTSILLLSILRLGAIQKSRHDI---R 93

Query: 504  QIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSAH 683
            +I++  +                   + P     N   E R      + ++   T     
Sbjct: 94   EIQRKHE-------------------EPPIIYEENRGTECRGEEQSWSPLEPHET----- 129

Query: 684  DLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXXX 863
            +      +WV    EG         +       S  F+EWN++APL+VI           
Sbjct: 130  ETTEQVQQWVHSETEG---------VSYVGFEPSSCFMEWNVKAPLDVIY---------- 170

Query: 864  XXXXXXXXFSEEKREAQMAI--------IERYTSLSKFYPDTDSDNSSEEDFLGIRDWDS 1019
                      EE  E + A         I RY SLS+FYP++DSD+ SE DF  I DWDS
Sbjct: 171  ----------EEYGEGEEAGDDANENTGIVRYPSLSRFYPESDSDSESESDFPAIGDWDS 220

Query: 1020 PEHINL--------QWXXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDLFPASF 1157
            PE +          +               +KR  E   EE+NLIEID+ P+ +
Sbjct: 221  PEEVGWGEEEEEEEEEDREGLIEIALDGFRRKRGMEFHFEEENLIEIDISPSRY 274


>gb|ABD32605.1| conserved hypothetical protein [Medicago truncatula]
          Length = 283

 Score = 97.8 bits (242), Expect = 9e-18
 Identities = 83/279 (29%), Positives = 119/279 (42%), Gaps = 1/279 (0%)
 Frame = +3

Query: 324  ASDPLFSTIITFCTLILLYLPT-LFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKE 500
            +S+PLFS I+TFCTLI LYLP  LF +++F               R GA Q+   +    
Sbjct: 42   SSNPLFSCIVTFCTLIFLYLPHHLFSKIVFSPVLILTGILLLTILRLGANQKYHHK---- 97

Query: 501  RQIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSA 680
                            KE +  H+ II + +  ++   EE + +       +++S  Q  
Sbjct: 98   ---------------QKETQQKHESIITKEENKATKCGEEKQNSTCPVEPKEIESLEQVH 142

Query: 681  HDLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXX 860
            H                SE E+ S     + L  S SF+EWN+RAPLE+I          
Sbjct: 143  H----------------SEREVDS----QKSLESSSSFMEWNVRAPLEIIYEGYEDEEKL 182

Query: 861  XXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINLQ 1040
                        EK E        Y SLS++YP++DSD+SSE++F     WDSPE    +
Sbjct: 183  DDP--------NEKEENWNMGNSNYPSLSRYYPESDSDSSSEDEFPVKEYWDSPEEWEEE 234

Query: 1041 WXXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDLFPASF 1157
                            KR  E + EE+NLIEID+ P  +
Sbjct: 235  DREGLIEIALDGSKMMKRDLEFQYEEENLIEIDISPTRY 273


>ref|XP_003608245.1| hypothetical protein MTR_4g091220 [Medicago truncatula]
            gi|355509300|gb|AES90442.1| hypothetical protein
            MTR_4g091220 [Medicago truncatula]
          Length = 326

 Score = 97.8 bits (242), Expect = 9e-18
 Identities = 83/279 (29%), Positives = 119/279 (42%), Gaps = 1/279 (0%)
 Frame = +3

Query: 324  ASDPLFSTIITFCTLILLYLPT-LFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKE 500
            +S+PLFS I+TFCTLI LYLP  LF +++F               R GA Q+   +    
Sbjct: 42   SSNPLFSCIVTFCTLIFLYLPHHLFSKIVFSPVLILTGILLLTILRLGANQKYHHK---- 97

Query: 501  RQIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSA 680
                            KE +  H+ II + +  ++   EE + +       +++S  Q  
Sbjct: 98   ---------------QKETQQKHESIITKEENKATKCGEEKQNSTCPVEPKEIESLEQVH 142

Query: 681  HDLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXX 860
            H                SE E+ S     + L  S SF+EWN+RAPLE+I          
Sbjct: 143  H----------------SEREVDS----QKSLESSSSFMEWNVRAPLEIIYEGYEDEEKL 182

Query: 861  XXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINLQ 1040
                        EK E        Y SLS++YP++DSD+SSE++F     WDSPE    +
Sbjct: 183  DDP--------NEKEENWNMGNSNYPSLSRYYPESDSDSSSEDEFPVKEYWDSPEEWEEE 234

Query: 1041 WXXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDLFPASF 1157
                            KR  E + EE+NLIEID+ P  +
Sbjct: 235  DREGLIEIALDGSKMMKRDLEFQYEEENLIEIDISPTRY 273


>ref|XP_002271376.2| PREDICTED: uncharacterized protein LOC100241473 [Vitis vinifera]
          Length = 251

 Score = 95.5 bits (236), Expect = 4e-17
 Identities = 84/279 (30%), Positives = 108/279 (38%), Gaps = 1/279 (0%)
 Frame = +3

Query: 318  SLASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLK 497
            SL+SDPLFS I+T   LILLY P +F  ++F               R G  Q        
Sbjct: 32   SLSSDPLFSCIVTLYILILLYFPRIFLGIVFSPVLISTGVLLLTLLRLGVNQ-------- 83

Query: 498  ERQIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSE-NAVVDLDSTWQ 674
              Q+E                        EP   SSN  E+  +   E   V      W 
Sbjct: 84   --QVE-----------------------GEP---SSNEPEQPDMRGRELKCVHPQPEKWP 115

Query: 675  SAHDLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXX 854
              H          SDF+               K F+ DSFV WN+RAPLEVI        
Sbjct: 116  EIH----------SDFDT--------------KPFFCDSFVGWNVRAPLEVIYEEFEGEE 151

Query: 855  XXXXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHIN 1034
                             E +   +ERY SLS  YP++DSD SS+ +F     W+SPE+++
Sbjct: 152  DEDLNVT---------EETRFLGLERYPSLSLCYPESDSDTSSDGEFPASNGWNSPENMS 202

Query: 1035 LQWXXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDLFPA 1151
             +W               K  +   VEEDNLIEID+ PA
Sbjct: 203  FRWDQDDREGLIEIALDGKLDAMFHVEEDNLIEIDISPA 241


>ref|XP_003635001.1| PREDICTED: uncharacterized protein LOC100854501 [Vitis vinifera]
            gi|302144177|emb|CBI23304.3| unnamed protein product
            [Vitis vinifera]
          Length = 251

 Score = 95.1 bits (235), Expect = 6e-17
 Identities = 84/278 (30%), Positives = 108/278 (38%)
 Frame = +3

Query: 318  SLASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLK 497
            SL+SDPLFS I+T   LILLY P +F  ++F               R G  Q+       
Sbjct: 32   SLSSDPLFSCIVTLYILILLYFPRVFLGIVFSPVLISTGVLLLTLLRLGVNQQ------- 84

Query: 498  ERQIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQS 677
                       E   S+ E E   DL   E KCV     +                 W  
Sbjct: 85   ----------VEGEPSSNEPEQ-PDLRGRELKCVHPQPEK-----------------WPE 116

Query: 678  AHDLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXX 857
             H          SDF+               K F+ DSFV WN+RAPLEVI         
Sbjct: 117  IH----------SDFDT--------------KPFFCDSFVGWNVRAPLEVIYEEFEGEED 152

Query: 858  XXXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINL 1037
                            E +   +ERY SLS  YP++DSD SS+ +F     W+SPE+++ 
Sbjct: 153  EDPNVT---------EETRFLGLERYPSLSLCYPESDSDTSSDGEFPASNGWNSPENMSF 203

Query: 1038 QWXXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDLFPA 1151
            +W               K  +   VEEDNLIEID+ PA
Sbjct: 204  RWDQDDREGLIEIALDGKLDAMFHVEEDNLIEIDISPA 241


>ref|XP_002524684.1| hypothetical protein RCOM_1095870 [Ricinus communis]
            gi|223536045|gb|EEF37703.1| hypothetical protein
            RCOM_1095870 [Ricinus communis]
          Length = 247

 Score = 90.1 bits (222), Expect = 2e-15
 Identities = 80/288 (27%), Positives = 115/288 (39%), Gaps = 12/288 (4%)
 Frame = +3

Query: 321  LASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKE 500
            L+++PLFS IIT  TLILLY P  F ++                 R GAVQR     L+ 
Sbjct: 9    LSTNPLFSCIITLYTLILLYFPHPF-KIFLSPVLILTALLLLFLLRLGAVQR-----LQT 62

Query: 501  RQIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSA 680
               +K+ K      +T+  EI  + +  E  C S +                        
Sbjct: 63   PDPDKNDKK----ENTESREITENNVSVEENCGSVD------------------------ 94

Query: 681  HDLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXX 860
                    KWV    + ++           K  + +SFVEWN+RAPLE+I          
Sbjct: 95   --------KWVETGLDPNQ----------SKPDFEESFVEWNVRAPLEIIYEAYEGEGDV 136

Query: 861  XXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINLQ 1040
                       + K E +   ++RY SLS +YP+TDSD+SS+ DF    +WDSP+ +  +
Sbjct: 137  D---------EDVKNETRSGRLQRYPSLSMYYPETDSDSSSDGDFSVTGEWDSPDSVCFR 187

Query: 1041 WXXXXXXXXXXXXXXQKRYSESE------------VEEDNLIEIDLFP 1148
            W               +   +              VEEDNLIEID+ P
Sbjct: 188  WEDEDRDGLLIEIALDRNNDKKHSGLDSGVDLDFYVEEDNLIEIDITP 235


>gb|ESW20241.1| hypothetical protein PHAVU_006G192300g [Phaseolus vulgaris]
          Length = 271

 Score = 89.4 bits (220), Expect = 3e-15
 Identities = 80/280 (28%), Positives = 118/280 (42%), Gaps = 5/280 (1%)
 Frame = +3

Query: 324  ASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKER 503
            +S PLFS I+TFC LILLYLP LF++++                R GA+QRS  E   E 
Sbjct: 30   SSQPLFSCILTFCFLILLYLPRLFWKIVLSPVLILSGILLLLLLRLGAIQRSQNEE-SEI 88

Query: 504  QIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSAH 683
             +E++                          V++  +   R     N V  +++      
Sbjct: 89   PVEREP-------------------------VANKENRGNREEKQGNPVEPVEA------ 117

Query: 684  DLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXXX 863
            D +   Y+WV+     S+ E  S         + +SF+EWN++APLEVI           
Sbjct: 118  DTLDHVYRWVTS---QSQPEFKSQTGFRSSSRFDESFMEWNVKAPLEVIYEGEE------ 168

Query: 864  XXXXXXXXFSEEKREAQMAI-IERYTSLSKFYPDTDSDNSSEEDFLGIRD----WDSPEH 1028
                     +E+ R       I RY SLS++YP+TDSD+SSE  F    +    WD  + 
Sbjct: 169  ---------TEQDRSGDHGEGIFRYPSLSRYYPETDSDSSSESGFPATDNMCFRWDEEDR 219

Query: 1029 INLQWXXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDLFP 1148
              L                +K+    + EE+NLIEID+ P
Sbjct: 220  EGL--------IEIALDGCKKKEVGFQFEEENLIEIDISP 251


>gb|EXB51826.1| hypothetical protein L484_006399 [Morus notabilis]
          Length = 248

 Score = 85.5 bits (210), Expect = 5e-14
 Identities = 84/294 (28%), Positives = 118/294 (40%), Gaps = 17/294 (5%)
 Frame = +3

Query: 324  ASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKER 503
            +S+PL S+I+T   LILLY+P  F RV+F               R GA+Q+   E  K  
Sbjct: 10   SSNPLVSSIVTLYALILLYVPHQFLRVLFSPVLIITGILLFSLLRLGAIQKCEDENRKAE 69

Query: 504  QIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSAH 683
              EK SK  E+  ST++ +                                         
Sbjct: 70   --EKSSK--ETTYSTQQEQ----------------------------------------- 84

Query: 684  DLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXXXX 863
                +D+ WV  +   ++ E  +    + +  YS  FV WNLRAPLEVI           
Sbjct: 85   ----EDHNWVDLYESETDSEPETGFETNSRFEYS--FVGWNLRAPLEVIYEEEDEDEE-- 136

Query: 864  XXXXXXXXFSEEKREAQMAIIE-RYTSLSKFYPDTD--SDNSSEEDFLGIRDWDSPEHIN 1034
                      EE    +  I+  RY SLS++YP+++  S+NSS+ +F     WDSPE + 
Sbjct: 137  ---------EEEGGGGEHRILSGRYASLSRYYPESEEESENSSDGEFPATGFWDSPESLG 187

Query: 1035 LQW-------XXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDL-------FPAS 1154
             +W                      KR  +  VEE+NLIEIDL       FPAS
Sbjct: 188  FRWEEDDREGLIEIALDGNKKNDRNKRGLDFHVEEENLIEIDLSVARNDEFPAS 241


>gb|EMJ12980.1| hypothetical protein PRUPE_ppa009687mg [Prunus persica]
          Length = 282

 Score = 76.3 bits (186), Expect = 3e-11
 Identities = 84/290 (28%), Positives = 118/290 (40%), Gaps = 13/290 (4%)
 Frame = +3

Query: 318  SLASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLK 497
            S+ S PLFS+I+T   LILLY P  F R++F               R GAVQR   +  +
Sbjct: 30   SVTSSPLFSSIVTLYALILLYFPYHFIRIVFSPVPIITGILLLTILRLGAVQRFEGD--E 87

Query: 498  ERQIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSEN----------- 644
             R+ E +S + E+                  +C  +N S+E    N EN           
Sbjct: 88   HREKEDNSCLLET------------------ECTKTNESKECNEENKENRGSTSTSIQRT 129

Query: 645  AVVDLDSTWQSAHDLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLE 824
             V +   T   A D     Y+  +D    SE E+   P P     + D FVEWNL+APLE
Sbjct: 130  EVQEQIPTSPEAQDHSFFTYQSETD----SESEMGFDPNPC----FEDFFVEWNLKAPLE 181

Query: 825  VIXXXXXXXXXXXXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGI 1004
            VI                    S+ ++E+Q+  +ERY SLS +          EED  G+
Sbjct: 182  VIYEENEGEEDEMDRNGNDPN-SKPEQESQVQGLERYPSLSMW---------EEEDREGL 231

Query: 1005 RDWDSPEHINLQWXXXXXXXXXXXXXXQKRYSESEV--EEDNLIEIDLFP 1148
             +    E+                    KR  + +V  EE+NLIEID+ P
Sbjct: 232  IEIALEEN-------------------SKRGMDFQVDHEEENLIEIDISP 262


>gb|EOY28422.1| Uncharacterized protein TCM_029993 [Theobroma cacao]
          Length = 244

 Score = 75.1 bits (183), Expect = 6e-11
 Identities = 73/280 (26%), Positives = 112/280 (40%), Gaps = 2/280 (0%)
 Frame = +3

Query: 309  MDFSLASDPLFSTIITFCTLILLYLPTLF-FRVIFXXXXXXXXXXXXXXXRYGAVQRSTQ 485
            + F  ++DPL S +IT   LILLY P  F  ++ F               R GA+QR+  
Sbjct: 31   LSFLSSNDPLLSFVITLYILILLYFPQSFSLKIFFFPVLVLTASLLLSLLRLGAIQRTQT 90

Query: 486  EVLKERQIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDS 665
            E  ++R +      AE+ +   +F                 S +E++            S
Sbjct: 91   ETKEKRSL------AEAEAEKTDF-----------------SQQELKW-----------S 116

Query: 666  TWQSAHDLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXX 845
            T +   +L++  ++                          ++FVEW++ APLEVI     
Sbjct: 117  TCKKDPELVMQSFE--------------------------ETFVEWDVGAPLEVIYEGHE 150

Query: 846  XXXXXXXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEE-DFLGIRDWDSP 1022
                             E       +IERY SLS +YP++DSD+SS E D+L I +W S 
Sbjct: 151  GEEEDP----------NENVSNPTRVIERYPSLSLYYPESDSDSSSSETDYLAIGEWVSS 200

Query: 1023 EHINLQWXXXXXXXXXXXXXXQKRYSESEVEEDNLIEIDL 1142
            E +  +W               KR  +   EE+NLIEID+
Sbjct: 201  EKMCYRW-EEEDREGLIEIALDKRDLDFHGEEENLIEIDI 239


>ref|XP_006449210.1| hypothetical protein CICLE_v10016271mg [Citrus clementina]
            gi|557551821|gb|ESR62450.1| hypothetical protein
            CICLE_v10016271mg [Citrus clementina]
          Length = 267

 Score = 71.2 bits (173), Expect = 9e-10
 Identities = 78/289 (26%), Positives = 111/289 (38%), Gaps = 14/289 (4%)
 Frame = +3

Query: 327  SDPLFSTIITFCTLILLYLPTLFF-RVIFXXXXXXXXXXXXXXXRYGAVQRSTQEVLKER 503
            +DPLFS+I+T   LILLY P + F ++I                R GA QR+ +      
Sbjct: 29   ADPLFSSIVTLYILILLYSPHIIFSKIILSPVLIITFTLLLTLLRLGATQRNER------ 82

Query: 504  QIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDSTWQSAH 683
                                            + N+      NNSE A  D ++T  ++ 
Sbjct: 83   --------------------------------AENTESNCTDNNSEAA--DHETTSSTSS 108

Query: 684  DLIIDDYKWVS--DFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXXXXX 857
             +   D+K V+  D N G        P P    F+ +SFVEWN+RAPLEVI         
Sbjct: 109  YIPHQDHKGVTCTDSNFG--------PNP----FFENSFVEWNVRAPLEVIYEAYEGEED 156

Query: 858  XXXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEHINL 1037
                        ++        +++Y SLS +YP+TDSD SS+      RD  SPE +  
Sbjct: 157  DEETV-------KDPGNKGPFCVDKYASLSLYYPETDSDTSSDG---CDRDRGSPESVCY 206

Query: 1038 QWXXXXXXXXXXXXXXQKRYSESE-----------VEEDNLIEIDLFPA 1151
            +W                     +            EE+NLIEID+ PA
Sbjct: 207  RWEDEDREGLIEIALDNNNSRNHKRERLDFENFHGEEENNLIEIDISPA 255


>ref|XP_002316662.2| hypothetical protein POPTR_0011s04240g [Populus trichocarpa]
            gi|550327560|gb|EEE97274.2| hypothetical protein
            POPTR_0011s04240g [Populus trichocarpa]
          Length = 274

 Score = 71.2 bits (173), Expect = 9e-10
 Identities = 77/292 (26%), Positives = 106/292 (36%), Gaps = 14/292 (4%)
 Frame = +3

Query: 318  SLASDPLFSTIITFCTLILLYLPTLFFRVIFXXXXXXXXXXXXXXXRYGAVQR---STQE 488
            +L+ +PLFS IIT  TLILLY P    ++                 R GA+QR   S  E
Sbjct: 28   NLSFNPLFSCIITLYTLILLYFPQAL-KLSISPILTITLTLLLFVLRLGAIQRHQLSVTE 86

Query: 489  VLKERQIEKDSKIAESISSTKEFEIIHDLIIDEPKCVSSNSSEEIRVNNSENAVVDLDST 668
              K  QI++D       +S+  F      +    K V+S S++  R +            
Sbjct: 87   SDKAIQIKQDKGTHFGEASSSSF------LTHVDKWVASQSADPGRFD------------ 128

Query: 669  WQSAHDLIIDDYKWVSDFNEGSECEIISAPIPSQKLFYSDSFVEWNLRAPLEVIXXXXXX 848
                                           P   L +  SFVEW++RAPL+VI      
Sbjct: 129  -------------------------------PDPNLDFEVSFVEWDVRAPLKVINEEYEG 157

Query: 849  XXXXXXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDSDNSSEEDFLGIRDWDSPEH 1028
                             +   +   +ERY SL+  YP+TDSD+ SE  F    +WDS + 
Sbjct: 158  EEGEDPNEKDAG-----QDPTRFGGLERYPSLAMCYPETDSDSDSEGGFSVAGEWDSLDR 212

Query: 1029 INLQWXXXXXXXXXXXXXXQKRYSES-----------EVEEDNLIEIDLFPA 1151
               +W                   E             VEEDNLIEID+ PA
Sbjct: 213  FCFKWEEEDREGLLIEIALDSDNKEDTGPDLDAGLNFHVEEDNLIEIDISPA 264


>ref|NP_193881.1| uncharacterized protein [Arabidopsis thaliana]
            gi|3080394|emb|CAA18714.1| hypothetical protein
            [Arabidopsis thaliana] gi|7268947|emb|CAB81257.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|45752754|gb|AAS76275.1| At4g21500 [Arabidopsis
            thaliana] gi|62320370|dbj|BAD94766.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332659060|gb|AEE84460.1| uncharacterized protein
            AT4G21500 [Arabidopsis thaliana]
          Length = 215

 Score = 68.6 bits (166), Expect = 6e-09
 Identities = 49/136 (36%), Positives = 66/136 (48%), Gaps = 17/136 (12%)
 Frame = +3

Query: 792  FVEWNLRAPLEVIXXXXXXXXXXXXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDS 971
            FVEWNLRAPLEVI                     EE+   +   +ER+ SLS  YP++DS
Sbjct: 90   FVEWNLRAPLEVIHEAYEDEEEE----------EEEEDPTRFRKMERFPSLSLCYPESDS 139

Query: 972  DN----SSEEDFLGIRDWDSPEHINLQW---------XXXXXXXXXXXXXXQKRYSESEV 1112
            ++    SSE +F  I DW+SPE++  +W                       +K  S++E+
Sbjct: 140  ESDSASSSEFNFPEIGDWNSPENMGFRWEEDEGGIGCEGLIEIKLDHYSHNRKMMSKTEI 199

Query: 1113 ----EEDNLIEIDLFP 1148
                EED LIEIDLFP
Sbjct: 200  DFHAEEDGLIEIDLFP 215


>ref|XP_002867843.1| hypothetical protein ARALYDRAFT_914525 [Arabidopsis lyrata subsp.
            lyrata] gi|297313679|gb|EFH44102.1| hypothetical protein
            ARALYDRAFT_914525 [Arabidopsis lyrata subsp. lyrata]
          Length = 222

 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 48/140 (34%), Positives = 61/140 (43%), Gaps = 21/140 (15%)
 Frame = +3

Query: 792  FVEWNLRAPLEVIXXXXXXXXXXXXXXXXXXXFSEEKREAQMAIIERYTSLSKFYPDTDS 971
            FVEWNLRAPLEVI                      EK   +   IERY SLS  YP++DS
Sbjct: 92   FVEWNLRAPLEVIHEAYEEEEEEDP---------NEKDPTRFRKIERYPSLSLCYPESDS 142

Query: 972  DNSSEEDFLGIRDWDSPEHINLQW-----------------XXXXXXXXXXXXXXQKRYS 1100
             +SSE +F  I DW+S E I  +W                                 ++ 
Sbjct: 143  ASSSEFNFPEIGDWNSAEDIGFRWEEEDDDGGIGGEGLIEIKLDEYNHRSHNSKMMNKWK 202

Query: 1101 ESEV----EEDNLIEIDLFP 1148
            ++E+    E+D LIEIDLFP
Sbjct: 203  QTEIDFHGEDDGLIEIDLFP 222


Top