BLASTX nr result

ID: Catharanthus23_contig00000133 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00000133
         (1827 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002523322.1| conserved hypothetical protein [Ricinus comm...   293   1e-76
ref|XP_002265230.1| PREDICTED: uncharacterized protein LOC100260...   289   2e-75
ref|XP_006444816.1| hypothetical protein CICLE_v10019982mg [Citr...   284   8e-74
ref|XP_006491300.1| PREDICTED: uncharacterized protein LOC102628...   283   1e-73
ref|XP_004306652.1| PREDICTED: uncharacterized protein LOC101307...   274   1e-70
ref|XP_004229996.1| PREDICTED: uncharacterized protein LOC101256...   272   3e-70
emb|CBI40568.3| unnamed protein product [Vitis vinifera]              270   2e-69
ref|XP_006339728.1| PREDICTED: uncharacterized protein LOC102587...   269   3e-69
gb|EMJ19190.1| hypothetical protein PRUPE_ppa005611mg [Prunus pe...   265   5e-68
gb|EXB83880.1| hypothetical protein L484_023487 [Morus notabilis]     247   1e-62
gb|EOX95673.1| Uncharacterized protein isoform 1 [Theobroma cacao]    239   4e-60
ref|XP_004155169.1| PREDICTED: uncharacterized LOC101208119 [Cuc...   229   3e-57
ref|XP_004133806.1| PREDICTED: uncharacterized protein LOC101208...   229   4e-57
ref|XP_003521812.1| PREDICTED: uncharacterized protein LOC100784...   226   2e-56
ref|XP_002302639.2| hypothetical protein POPTR_0002s17390g [Popu...   218   9e-54
ref|XP_006852588.1| hypothetical protein AMTR_s00021p00215510 [A...   202   5e-49
ref|XP_002272343.1| PREDICTED: uncharacterized protein LOC100243...   201   7e-49
gb|EOX95675.1| Uncharacterized protein isoform 3, partial [Theob...   194   1e-46
ref|XP_006444815.1| hypothetical protein CICLE_v10019982mg [Citr...   187   1e-44
ref|NP_001061754.1| Os08g0400300 [Oryza sativa Japonica Group] g...   179   3e-42

>ref|XP_002523322.1| conserved hypothetical protein [Ricinus communis]
            gi|223537410|gb|EEF39038.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 481

 Score =  293 bits (751), Expect = 1e-76
 Identities = 190/481 (39%), Positives = 238/481 (49%), Gaps = 29/481 (6%)
 Frame = +1

Query: 184  VAEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFX 363
            VAEARAAWQRTANRC VQEDAKRAPKLACC S+SSSS KQVD GPTN AE  +     F 
Sbjct: 3    VAEARAAWQRTANRCFVQEDAKRAPKLACCQSSSSSS-KQVDGGPTNAAEMPENSAVGFM 61

Query: 364  XXXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTTSK----- 528
                            WWLQLQP++G+ +G   EQL+   +E++   ++           
Sbjct: 62   PFHRNASYSSLPPDTRWWLQLQPSYGYQKGFTYEQLDKLENEVEILRAEFVNAPSIIDEI 121

Query: 529  PPSSKDGEAFFYESVNAEYFVDSHFGIPAK------SVKNDNGVAL----AKQLLKPLDK 678
             P    G   F  +   E   D HF I A       +VKN     L    A++ ++P D 
Sbjct: 122  RPHDDRGSTRFDGNKKYEPSFDPHFRISADYRNRDPNVKNQEAGVLYDKNAQEFIEPKDT 181

Query: 679  QNCNDAFKDEDGSGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFA 858
            +  N    D D   C   +K++   ++ ESP+ G E+++PWWRT+D+D+LA LVAQ+S  
Sbjct: 182  KE-NSKLMDLDPFECLRPQKSDDYCFDSESPFSGSEKSVPWWRTTDKDDLASLVAQKSVD 240

Query: 859  LLENCDLPQPQHTCVNREPFADFCSIDHAGIFMSSKDQK-------HPVV-----DHNSS 1002
             + NCDLP PQ   + R P     + DH      S D K        P+V       + S
Sbjct: 241  YIANCDLPPPQKLHLRRYPHGRPGASDHDDSIALSLDGKAQSGCISSPLVHAHGCPSSES 300

Query: 1003 MHDQH-TSDDHHFPSVALEPLSDDAASK-AIPKGITGENDSGKAQLLEALRHSQTXXXXX 1176
            MH +H  S + H  S   +P S  A  K  I  G   E D  KAQLLEALRHSQT     
Sbjct: 301  MHGRHRASVEGHLQSGLNKPFSSIATHKEMIEIGQVPEGDPCKAQLLEALRHSQTRAREA 360

Query: 1177 XXXXXXXXXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXX 1356
                        H++KLFFRQASQLFAYKQWF LLQLE+LY Q+KN  +P+S+       
Sbjct: 361  EKVAKQACAEREHIIKLFFRQASQLFAYKQWFHLLQLESLYYQVKNGGQPMST-LFPVAL 419

Query: 1357 XXXXXKSRKMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWML 1536
                 K RKM K+W                DI KY                  WTVGWML
Sbjct: 420  PWMPQKGRKMRKSWQKSTRGKRGKRGRPSHDISKYAVALALGLGLVGAGLLLGWTVGWML 479

Query: 1537 P 1539
            P
Sbjct: 480  P 480


>ref|XP_002265230.1| PREDICTED: uncharacterized protein LOC100260339 [Vitis vinifera]
          Length = 478

 Score =  289 bits (740), Expect = 2e-75
 Identities = 190/480 (39%), Positives = 239/480 (49%), Gaps = 28/480 (5%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQR ANRC VQEDAKRAPKLACCPS+SSSS KQ DAG  N A+  D P   F  
Sbjct: 4    AEARAVWQRAANRCFVQEDAKRAPKLACCPSSSSSS-KQADAGHANAADGPDHPPVGFMP 62

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTTSKP----- 531
                           WWLQLQPN+G+ +GL  EQLN+  +E++       + +       
Sbjct: 63   LNRTSYSNLPPDTR-WWLQLQPNYGYQKGLTSEQLNALEAEVEMLIDGTASKTSELDGAY 121

Query: 532  PSSKDGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNCNDA----- 696
              ++DG        N E F D      A  V+ D      KQ +  LD +N  D      
Sbjct: 122  AQNEDGSGRVDGGKNTESFFDVDNINFAGCVEKDPD--FGKQEVNALDSKNAQDLEVNNM 179

Query: 697  -----FKDEDGSGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFAL 861
                   + +  G + SK+ + L  + ES W+G E+N PWWRT+D DELA LV Q+S   
Sbjct: 180  WKYYELVETEPIGSSASKQPSELYLDSESSWIGVEKNEPWWRTADTDELASLVVQKSLDH 239

Query: 862  LENCDLPQPQHTCVNREPFADFCSIDHAGIFMSSKDQKHPV-VDHNSSMHDQHTS----- 1023
            +ENCDLP PQ   V  +PFA   S  H G F SS D+K       N ++H + +S     
Sbjct: 240  IENCDLPPPQKMHVRSDPFAPLGSFVHKGNFGSSLDRKAQTGTLSNLTLHLKGSSSLGSA 299

Query: 1024 DDHHFPSV-----ALEPLSDDAASKAIP--KGITGENDSGKAQLLEALRHSQTXXXXXXX 1182
            D   + S      + +P S +   K +   +GIT +ND  KAQLLEALRHSQT       
Sbjct: 300  DGRQWASAEDRHGSDKPFSYNTNHKDLTEMQGIT-DNDPSKAQLLEALRHSQTRAREAEK 358

Query: 1183 XXXXXXXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXX 1362
                      H++ LF RQASQLFAYKQWF LLQLENLY QIKN++ PIS+         
Sbjct: 359  AAKQAHEEKEHIISLFLRQASQLFAYKQWFHLLQLENLYSQIKNKDHPIST-LFPVTLPW 417

Query: 1363 XXXKSRKMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLPT 1542
               K++K  K+W                DI KY                  WT+GWMLPT
Sbjct: 418  TPYKAKKQRKSWQKATKGRRGKRAQPRYDISKYAVAFALGLSLVGAGLLLGWTIGWMLPT 477


>ref|XP_006444816.1| hypothetical protein CICLE_v10019982mg [Citrus clementina]
            gi|567904658|ref|XP_006444817.1| hypothetical protein
            CICLE_v10019982mg [Citrus clementina]
            gi|567904660|ref|XP_006444818.1| hypothetical protein
            CICLE_v10019982mg [Citrus clementina]
            gi|557547078|gb|ESR58056.1| hypothetical protein
            CICLE_v10019982mg [Citrus clementina]
            gi|557547079|gb|ESR58057.1| hypothetical protein
            CICLE_v10019982mg [Citrus clementina]
            gi|557547080|gb|ESR58058.1| hypothetical protein
            CICLE_v10019982mg [Citrus clementina]
          Length = 475

 Score =  284 bits (727), Expect = 8e-74
 Identities = 181/474 (38%), Positives = 232/474 (48%), Gaps = 23/474 (4%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQR ANRC VQEDAKRAPKLACC S+SSSS KQVDAGP   A+  D P A F  
Sbjct: 4    AEARAVWQRAANRCFVQEDAKRAPKLACCQSSSSSS-KQVDAGPAGVADAPDHPAAGFMP 62

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSD-VTTTSK----P 531
                           WWLQLQPN+G  +GL  EQ+++  +EM+   +  V + SK    P
Sbjct: 63   LNMNHLYSELPSDTRWWLQLQPNYGCQKGLTSEQISAVEAEMEALRAGFVNSPSKFSGDP 122

Query: 532  PSSKDGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNC-------- 687
                 G      S+N +   D  +   +   +N +   + KQ ++ +D +          
Sbjct: 123  SLDSTGGTLVDGSINNDVSHDELYNRVSAVCRNKDP-EVRKQNVEAVDCKTTQEFIELMD 181

Query: 688  ---NDAFKDEDGSGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFA 858
               N  F + D  GC  SK +    ++ ESPW+GG +  PWWRT+D+D+LA LVAQ+S +
Sbjct: 182  IRENYEFIEMDSVGCPSSKTSKEPCFDPESPWIGGGKTEPWWRTTDKDDLASLVAQKSVS 241

Query: 859  LLENCDLPQPQHTCVNREPFA-----DFCSIDHAGIFMSSKDQKHPVVDHNSSMHDQHTS 1023
             +ENCDLP PQ       P+A     D        +   +    +PVV    S   +  S
Sbjct: 242  YMENCDLPPPQKKHTRAHPYARSRASDLDETSSLHLKYQTDYISNPVVHAQGSPDSRRAS 301

Query: 1024 -DDHHFPSVALEPLSDDAASKAIPK-GITGENDSGKAQLLEALRHSQTXXXXXXXXXXXX 1197
             ++   P  + E      A K I +     E D  KAQLLEALRHSQT            
Sbjct: 302  VEEGQMPFGSSESFGCSTAHKGISETQEVSEGDPCKAQLLEALRHSQTRAREAETAAKEA 361

Query: 1198 XXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKS 1377
                 H++KLFFRQASQLFAY+QWFQ+LQLE LY QIKN ++PIS+            K 
Sbjct: 362  YAEKEHILKLFFRQASQLFAYRQWFQMLQLEALYFQIKNSDQPIST-LFPVALPWVPPKG 420

Query: 1378 RKMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLP 1539
            RK  KNW                D+ KY                  WTVGWMLP
Sbjct: 421  RKTGKNWQKAAKGKRGKQGRPKHDMSKYAFAFAWGLGLVGAGLLLGWTVGWMLP 474


>ref|XP_006491300.1| PREDICTED: uncharacterized protein LOC102628391 isoform X1 [Citrus
            sinensis] gi|568876470|ref|XP_006491301.1| PREDICTED:
            uncharacterized protein LOC102628391 isoform X2 [Citrus
            sinensis]
          Length = 475

 Score =  283 bits (725), Expect = 1e-73
 Identities = 181/474 (38%), Positives = 232/474 (48%), Gaps = 23/474 (4%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQR ANRC VQEDAKRAPKLACC S+SSSS KQVDAGP   A+  D P A F  
Sbjct: 4    AEARAVWQRAANRCFVQEDAKRAPKLACCQSSSSSS-KQVDAGPAGVADAPDHPAAGFMP 62

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSD-VTTTSK----P 531
                           WWLQLQPN+G  +GL  EQ+++  +EM+   +  V + SK    P
Sbjct: 63   LNMNHLYSELPSDTRWWLQLQPNYGCQKGLTSEQISAVEAEMEALRACFVNSPSKFSGDP 122

Query: 532  PSSKDGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNC-------- 687
                 G      S+N +   D  +   +   +N +   + KQ ++ +D +          
Sbjct: 123  SLDSTGGTLVDGSINNDVSHDELYNRVSAVCRNKDP-EVRKQNVEAVDCKTTQEFIELMD 181

Query: 688  ---NDAFKDEDGSGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFA 858
               N  F + D  GC  SK +    ++ ESPW+GG +  PWWRT+D+D+LA LVAQ+S +
Sbjct: 182  IRENYEFIEMDSVGCPSSKTSKEPCFDPESPWIGGGKTEPWWRTTDKDDLASLVAQKSVS 241

Query: 859  LLENCDLPQPQHTCVNREPFA-----DFCSIDHAGIFMSSKDQKHPVVDHNSSMHDQHTS 1023
             +ENCDLP PQ       P+A     D        +   +    +PVV    S   +  S
Sbjct: 242  YMENCDLPPPQKKHTRAHPYARSRASDLDETSSLHLKYQTDYISNPVVHAQGSPDSRRAS 301

Query: 1024 -DDHHFPSVALEPLSDDAASKAIPK-GITGENDSGKAQLLEALRHSQTXXXXXXXXXXXX 1197
             ++   P  + E      A K I +     E D  KAQLLEALRHSQT            
Sbjct: 302  VEEGQMPFGSSESFGCSTAHKGISETQEVSEGDPCKAQLLEALRHSQTRAREAETAAKEA 361

Query: 1198 XXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKS 1377
                 H++KLFFRQASQLFAY+QWFQ+LQLE LY QIKN ++PIS+            K 
Sbjct: 362  YAEKEHILKLFFRQASQLFAYRQWFQMLQLEALYFQIKNSDQPIST-LFPVALPWVPPKG 420

Query: 1378 RKMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLP 1539
            RK  KNW                D+ KY                  WTVGWMLP
Sbjct: 421  RKTGKNWQKAAKGKRGKQGRPKHDMSKYAFAFAWGFGLVGAGLLLGWTVGWMLP 474


>ref|XP_004306652.1| PREDICTED: uncharacterized protein LOC101307620 [Fragaria vesca
            subsp. vesca]
          Length = 442

 Score =  274 bits (700), Expect = 1e-70
 Identities = 184/460 (40%), Positives = 224/460 (48%), Gaps = 7/460 (1%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQRTANRC VQEDAKRAPKLA C S SSS+ KQVDAGP    E  D PGAAF  
Sbjct: 4    AEARAVWQRTANRCFVQEDAKRAPKLAYCQS-SSSTTKQVDAGPATATEGLDHPGAAFMP 62

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTTSKPPSSKD 546
                           WWLQ+QPNHG+ + L  EQLN+  ++M+T  +        P+SK+
Sbjct: 63   ISRNRSYSNLPADTRWWLQMQPNHGYQKDLTPEQLNALEADMETLRAGFVK----PTSKN 118

Query: 547  GEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNCNDA-FKDEDGSGC 723
             E            +D H G          G  + K+ +     +N  +  +KD      
Sbjct: 119  SE------------IDQHKGEFTDGDCVKTGYEVQKKDVDAAYGENMQELQYKDMRERYE 166

Query: 724  AVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALLENCDLPQPQHTCV 903
             +   T S  YE + PW+GG R  PWWRT+DRDELA LVAQ+S   +ENCDLP PQ    
Sbjct: 167  KMGMDTIS--YEPD-PWMGGVRTEPWWRTTDRDELASLVAQKSLDHIENCDLPPPQKLYH 223

Query: 904  NREPFADFCSI-DHAGIFMSSKDQKHPVVD-HNSSMHDQHTSDD----HHFPSVALEPLS 1065
             R P+A    + DH G+  +S D+K       N +   Q  SD           A E  S
Sbjct: 224  KRHPYAAHAGLSDHDGLLGTSLDRKAQANSLSNMTTRAQGFSDTGVTFGKCGEAADEEHS 283

Query: 1066 DDAASKAIPKGITGENDSGKAQLLEALRHSQTXXXXXXXXXXXXXXXXXHVVKLFFRQAS 1245
            D +    I      + D  KAQL+EAL HSQT                 H+ KLFF+QAS
Sbjct: 284  DTSLRDLIDLQKLTDGDPTKAQLIEALCHSQTRAREAEKAAKQAYAEKEHIFKLFFKQAS 343

Query: 1246 QLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKSRKMHKNWHXXXXXXXX 1425
            QLFAYKQWFQLLQLE LY+QIKN+++   S            K RK  KNW         
Sbjct: 344  QLFAYKQWFQLLQLETLYVQIKNKDQ-AGSTVLPVILPWMSSKDRKSRKNWRRVPKGKRS 402

Query: 1426 XXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLPTF 1545
                   DI KY                  WTVGWMLP+F
Sbjct: 403  RRVDHEYDINKYAVALALGFGLVGAGLLLGWTVGWMLPSF 442


>ref|XP_004229996.1| PREDICTED: uncharacterized protein LOC101256522 isoform 1 [Solanum
            lycopersicum] gi|460368283|ref|XP_004229997.1| PREDICTED:
            uncharacterized protein LOC101256522 isoform 2 [Solanum
            lycopersicum]
          Length = 474

 Score =  272 bits (696), Expect = 3e-70
 Identities = 181/479 (37%), Positives = 231/479 (48%), Gaps = 25/479 (5%)
 Frame = +1

Query: 184  VAEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFX 363
            VAEAR AWQR  NRCLVQEDAKRAPKLACC S S SS KQVD GP N A+ Q+  G  F 
Sbjct: 3    VAEARTAWQRAVNRCLVQEDAKRAPKLACCSSASPSS-KQVDTGPANGADAQNPSGTCFL 61

Query: 364  XXXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFH---SDVTTTSKPP 534
                            WWL LQPN+G+ +GLV E ++S  +EM+        +   +K  
Sbjct: 62   PFDRNSSYCDLSPNSRWWLHLQPNYGYQKGLVSELVDSIEAEMENIGPVLDSIPKYNKLC 121

Query: 535  SSKDGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPL-----DKQNCNDAF 699
               + ++   +       +DS     A  V +D GV  +K+L         D  N  D  
Sbjct: 122  DQNEADSICVDKFTVGGSLDSQVTRSASYVNSDLGVG-SKELTDVFTEISKDSPNLEDTG 180

Query: 700  KDEDGS-----GCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALL 864
               + S        V K+ + L ++ E PW+G  +  PWWRT+D +ELALLVAQRS   +
Sbjct: 181  YPNEASKKGLVDLTVGKQIDELSFDTEYPWIGVAKTEPWWRTADTEELALLVAQRSHDFM 240

Query: 865  ENCDLPQPQHTCVNREPFADFCSIDHA---GIFMSSKDQKHPVVDHNSSMHDQHTS---- 1023
            ENCDLPQPQ+  V ++   D  S  +A   G    S  Q++  +    ++  +  S    
Sbjct: 241  ENCDLPQPQNNFVKQDRDVDVDSKIYASSMGPKAGSMRQQNTNIHKRGNLSFERPSQLDA 300

Query: 1024 ----DDHHFPSVALEPLSDDAASKAIPKGITGENDSGKAQLLEALRHSQTXXXXXXXXXX 1191
                  H   S +L+  SD A  K +PK  T  ND  KAQLL+ALRHSQT          
Sbjct: 301  EGKLQLHTCKSSSLKN-SDTAGQKVVPKMSTSGNDESKAQLLKALRHSQTRAREAENAAK 359

Query: 1192 XXXXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIK-NQNEPISSXXXXXXXXXXX 1368
                   HVV+L FRQASQLFAYKQWFQLLQLEN Y QIK N+  PIS+           
Sbjct: 360  QAFAEKEHVVQLVFRQASQLFAYKQWFQLLQLENFYFQIKSNKKHPISAMLPVMLPRVPK 419

Query: 1369 XKSRKMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLPTF 1545
               R   K+                 D+ +Y                  WTVGWM+PTF
Sbjct: 420  KSKRPQKKS----ARVKRAKRGRPRYDLSRYAVVFALGLGLVGAGLLLGWTVGWMVPTF 474


>emb|CBI40568.3| unnamed protein product [Vitis vinifera]
          Length = 419

 Score =  270 bits (689), Expect = 2e-69
 Identities = 182/469 (38%), Positives = 226/469 (48%), Gaps = 17/469 (3%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQR ANRC VQEDAKRAPKLACCPS+SSSS KQ DAG  N A+  D P   F  
Sbjct: 4    AEARAVWQRAANRCFVQEDAKRAPKLACCPSSSSSS-KQADAGHANAADGPDHPPVGFMP 62

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTTSKPPSSKD 546
                           WWLQLQPN+G+ +GL  EQLN+  +E+                  
Sbjct: 63   LNRTSYSNLPPDTR-WWLQLQPNYGYQKGLTSEQLNALEAEV------------------ 103

Query: 547  GEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNCNDAFKDEDGSGCA 726
                       E  +D   G  +K+ + D   A                  ++EDGSG  
Sbjct: 104  -----------EMLID---GTASKTSELDGAYA------------------QNEDGSGRV 131

Query: 727  VSKKTNSLLYEC----ESPWVGGERNMPWWRTSDRDELALLVAQRSFALLENCDLPQPQH 894
               K     ++     +S W+G E+N PWWRT+D DELA LV Q+S   +ENCDLP PQ 
Sbjct: 132  DGGKNTESFFDLTTCGKSSWIGVEKNEPWWRTADTDELASLVVQKSLDHIENCDLPPPQK 191

Query: 895  TCVNREPFADFCSIDHAGIFMSSKDQKHPV-VDHNSSMHDQHTS-----DDHHFPSV--- 1047
              V  +PFA   S  H G F SS D+K       N ++H + +S     D   + S    
Sbjct: 192  MHVRSDPFAPLGSFVHKGNFGSSLDRKAQTGTLSNLTLHLKGSSSLGSADGRQWASAEDR 251

Query: 1048 --ALEPLSDDAASKAIP--KGITGENDSGKAQLLEALRHSQTXXXXXXXXXXXXXXXXXH 1215
              + +P S +   K +   +GIT +ND  KAQLLEALRHSQT                 H
Sbjct: 252  HGSDKPFSYNTNHKDLTEMQGIT-DNDPSKAQLLEALRHSQTRAREAEKAAKQAHEEKEH 310

Query: 1216 VVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKSRKMHKN 1395
            ++ LF RQASQLFAYKQWF LLQLENLY QIKN++ PIS+            K++K  K+
Sbjct: 311  IISLFLRQASQLFAYKQWFHLLQLENLYSQIKNKDHPIST-LFPVTLPWTPYKAKKQRKS 369

Query: 1396 WHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLPT 1542
            W                DI KY                  WT+GWMLPT
Sbjct: 370  WQKATKGRRGKRAQPRYDISKYAVAFALGLSLVGAGLLLGWTIGWMLPT 418


>ref|XP_006339728.1| PREDICTED: uncharacterized protein LOC102587530 isoform X1 [Solanum
            tuberosum] gi|565345288|ref|XP_006339729.1| PREDICTED:
            uncharacterized protein LOC102587530 isoform X2 [Solanum
            tuberosum]
          Length = 470

 Score =  269 bits (688), Expect = 3e-69
 Identities = 185/489 (37%), Positives = 235/489 (48%), Gaps = 35/489 (7%)
 Frame = +1

Query: 184  VAEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFX 363
            VAEAR AWQR  NRCLVQEDAKRAPKLACC S S SS KQVDAGP N A+ Q+  G  F 
Sbjct: 3    VAEARTAWQRAVNRCLVQEDAKRAPKLACCSSASPSS-KQVDAGPANGADAQNPSGTYFL 61

Query: 364  XXXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFH---SDVTTTSKPP 534
                            WWL LQPN+G+ +GLV E ++S  +EM+        +   +K  
Sbjct: 62   PFDRNSSYCDLSPNSRWWLHLQPNYGYQKGLVSELVDSIEAEMENIGPVLDSIPKYNKLC 121

Query: 535  SSKDGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPL-----DKQNCNDAF 699
               + ++   +       +DS     A  V ND GV  +K+L         D  N  D  
Sbjct: 122  DQNEADSICVDKFTVGGSLDSQVTRSASYVNNDLGVG-SKELTDVFTEISKDSPNLEDTG 180

Query: 700  KDEDGS-----GCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALL 864
                 S        V K+ + L ++ E PW+G E+  PWWRT+D +ELALLVAQRS   +
Sbjct: 181  YPNKASKKGLVDLTVGKQIDELPFDTEYPWIGVEKTEPWWRTADTEELALLVAQRSHDFM 240

Query: 865  ENCDLPQPQHTCVNREPFADFCSIDHAGIFMSSKDQKHPVVDHNSSMHDQHTS------- 1023
            ENCDLPQPQ+  V ++   D  S     I+ SS   K        SMH Q+T+       
Sbjct: 241  ENCDLPQPQNNFVKQDRDVDVDS----KIYASSTGPK------AGSMHQQNTNIYKRGNL 290

Query: 1024 --------------DDHHFPSVALEPLSDDAASKAIPKGITGENDSGKAQLLEALRHSQT 1161
                            H   S +L+  SD  + K +P+  T  +D  KAQLL+ALRHSQT
Sbjct: 291  SFERPSQLDAEGKLQLHTCKSSSLKN-SDTPSQKVVPEMNTSGDDESKAQLLKALRHSQT 349

Query: 1162 XXXXXXXXXXXXXXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIK-NQNEPISSX 1338
                             HVV+L FRQASQLFAYKQWFQLLQLEN Y QIK N+ +PIS+ 
Sbjct: 350  RAREAENAAKQAFAEKEHVVQLVFRQASQLFAYKQWFQLLQLENFYFQIKNNKKQPISA- 408

Query: 1339 XXXXXXXXXXXKSRKMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXW 1518
                       K+++  K                  D+ +Y                  W
Sbjct: 409  ----MLPRVPQKTKRPQKK---SARMKRAKCGCPKYDLSRYAVVFALGLGLVGAGLLLGW 461

Query: 1519 TVGWMLPTF 1545
            TVGWM+PTF
Sbjct: 462  TVGWMVPTF 470


>gb|EMJ19190.1| hypothetical protein PRUPE_ppa005611mg [Prunus persica]
          Length = 451

 Score =  265 bits (677), Expect = 5e-68
 Identities = 172/459 (37%), Positives = 219/459 (47%), Gaps = 8/459 (1%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQR ANRC VQEDAKRAPKLACC S SSS+ KQVDAGP   AE  D P A F  
Sbjct: 4    AEARAVWQRVANRCFVQEDAKRAPKLACCQS-SSSTTKQVDAGPATAAEGPDHPAAGFVP 62

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTTSKPPSSKD 546
                           WWLQ+QP++G+ +    EQLN+  ++M+T  +    ++  P + +
Sbjct: 63   LNRNPSYSSLPPDARWWLQMQPSYGYQKDFTYEQLNALEADMETLRAGFVKST--PKTSE 120

Query: 547  GEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNCNDAFK--DEDGSG 720
                  E  +A+   +S      K  K D      K + + +  ++  + ++    D   
Sbjct: 121  VRQQKGECTDADGHKNS------KVQKQDVNAQYGKDMKELVQYKDVREKYEIMGMDTID 174

Query: 721  CAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALLENCDLPQPQHTC 900
               SK+       C+ PW+GG R  PWWRT+DRDELA LVAQ+S   +ENCDLP PQ   
Sbjct: 175  YPFSKQPEEFC--CDYPWIGGGRAEPWWRTTDRDELASLVAQKSLNHVENCDLPPPQKMY 232

Query: 901  VNREPFADFCSIDHAGIFMSSKDQK------HPVVDHNSSMHDQHTSDDHHFPSVALEPL 1062
              R P+AD    DH  I  +S D K        +  H     D   + +    + A E  
Sbjct: 233  HKRHPYADIGCSDHNVILGTSLDGKAQTGGLSDLTSHARCYSDPGITHERK-GNAAEEGH 291

Query: 1063 SDDAASKAIPKGITGENDSGKAQLLEALRHSQTXXXXXXXXXXXXXXXXXHVVKLFFRQA 1242
            SD +           E +  KAQL+EAL HSQT                 H+ KLFFRQA
Sbjct: 292  SDKSFWDVTETQQLSEGEPTKAQLMEALCHSQTRAREAEMAAKQAYAEKEHIFKLFFRQA 351

Query: 1243 SQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKSRKMHKNWHXXXXXXX 1422
            SQLFAYKQWFQLLQLE + +QIKN ++P  S            K RK  +NW        
Sbjct: 352  SQLFAYKQWFQLLQLETICIQIKNNDQP-GSAVVPVVLPWMPFKGRKPRRNWRKGPKGKR 410

Query: 1423 XXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLP 1539
                    DI KY                  WTVGWMLP
Sbjct: 411  GRRAEPRHDITKYAVAFALGFSLVGAGLLLGWTVGWMLP 449


>gb|EXB83880.1| hypothetical protein L484_023487 [Morus notabilis]
          Length = 472

 Score =  247 bits (630), Expect = 1e-62
 Identities = 171/482 (35%), Positives = 228/482 (47%), Gaps = 28/482 (5%)
 Frame = +1

Query: 184  VAEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPA-ETQDIPGAAF 360
            VAEARA WQR ANRC VQEDAKRAPKLACC S+S+S  KQV+AG    A +  D P   F
Sbjct: 3    VAEARAVWQRAANRCFVQEDAKRAPKLACCQSSSTS--KQVEAGGHATATDGPDHPAVGF 60

Query: 361  XXXXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTTSKPPSS 540
                             WWL +QPN+G  +G   EQ+N+  +E  T ++ V  ++    S
Sbjct: 61   MPTNRCPSYSNLPPDTRWWLHMQPNYGCQKGFTYEQMNALENEEGTKNAGVVNST----S 116

Query: 541  KDGEAFFYES-VNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNCNDAFKDEDGS 717
            +  EA   +   N E FV  H     K+ +      + K+ +K LD ++  +    ED +
Sbjct: 117  RISEAHKRKGDKNNECFVSVHNAAQKKASE------VGKKNVKALDGKDIEELIGLEDST 170

Query: 718  -----------GCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALL 864
                        C+ +K++N + +E E  W+G E++ PWWR +DRDEL  LVAQ+S   +
Sbjct: 171  VSWEIMQVDSIDCSDTKQSNEMCFEPEYSWMGSEKSEPWWRMTDRDELVSLVAQKSLDRV 230

Query: 865  ENCDLPQPQHTCVNREPFADFCSIDHAGIFMSSKDQKHPVVDHNS-----SMHDQHTSDD 1029
             NCDLP PQ T   R P+A     D   I  SS D +      +S     S    ++   
Sbjct: 231  GNCDLPPPQKTSHRRHPYARIGCFDSKEISASSLDWRTQTGSLSSTGTVRSPGFANSGRT 290

Query: 1030 HHFPSVALEPL----SDDAASKAIP-KGITG-----ENDSGKAQLLEALRHSQTXXXXXX 1179
               P    + L    SD+ +S     K +T      E +  KAQL+EAL HSQT      
Sbjct: 291  QEIPGCLTKGLSLYESDETSSYCTSHKNMTEIQQDCEGEFSKAQLMEALCHSQTRAREAE 350

Query: 1180 XXXXXXXXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXX 1359
                       H+V LFFRQAS LFAYKQW QLLQLE LY+Q+ N ++ IS+        
Sbjct: 351  KAAKQAYAEKEHIVTLFFRQASLLFAYKQWLQLLQLETLYIQLNNNDQQISNLFPLIIPW 410

Query: 1360 XXXXKSRKMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLP 1539
                + RK  K+ H               D+ KY                  WTVGWMLP
Sbjct: 411  KSSCEERKPRKSLHKGVKGRGEKRGRPDHDVAKYAVAFALGLSLVGAGLLLGWTVGWMLP 470

Query: 1540 TF 1545
             F
Sbjct: 471  HF 472


>gb|EOX95673.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 396

 Score =  239 bits (609), Expect = 4e-60
 Identities = 172/462 (37%), Positives = 210/462 (45%), Gaps = 11/462 (2%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQRTANRC VQEDAKRAPKLACC S+SSS  KQ D+ P   A   D P   F  
Sbjct: 4    AEARAVWQRTANRCFVQEDAKRAPKLACCQSSSSS--KQADSSPNGAAGACDHPAVGFMP 61

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTTSKPPSSKD 546
                           WWLQLQP++G  +GL  EQL++   E+                  
Sbjct: 62   LNRSPSYSNLPPDMRWWLQLQPSYGPQKGLTSEQLHALEDEV------------------ 103

Query: 547  GEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNCNDAFKDEDGSGCA 726
                  ES+ AE           KS    +GV L          Q+  DA          
Sbjct: 104  ------ESLKAEI----------KSPSKVSGVHL----------QDAQDA---------- 127

Query: 727  VSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALLENCDLPQPQHTCVN 906
                        ESPWV G +  PWWRT+D+DELA LVAQ+S   +ENCDLP PQ   V 
Sbjct: 128  -----------TESPWVQGGKGEPWWRTTDKDELASLVAQKSSYFIENCDLPPPQKMHVR 176

Query: 907  REPFADFCSIDHAGIFMSS---KDQKHPV---VDHNSSMHDQHTSDDHHFPSVA---LEP 1059
            R   A  CS    G  +SS   K Q  P+   + ++ +  D   +      SV    ++ 
Sbjct: 177  RSSHA--CSGSSDGDEVSSLAWKSQTGPIPRPIVNSRAFTDSVRTHGRLMSSVGEGKVQC 234

Query: 1060 LSDDAASKAIPKGI--TGENDSGKAQLLEALRHSQTXXXXXXXXXXXXXXXXXHVVKLFF 1233
             SD + S      +    E+D  KAQLLEAL HSQT                 H++KLFF
Sbjct: 235  ASDTSFSTTKEDTVEQVTESDPTKAQLLEALCHSQTRAREAERAAKQAYAEKEHIIKLFF 294

Query: 1234 RQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKSRKMHKNWHXXXX 1413
            +QASQLFAYKQWFQ+LQLE LY+QIKN  +P+S+             SRK+ K+W     
Sbjct: 295  KQASQLFAYKQWFQMLQLEALYVQIKNNEQPVST-LFPAVLPWTPYNSRKLRKSWQKTGK 353

Query: 1414 XXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLP 1539
                       DI KY                  WTVGWMLP
Sbjct: 354  ARRVKNGQPRPDITKYAVAFALGLSLVGAGLLLGWTVGWMLP 395


>ref|XP_004155169.1| PREDICTED: uncharacterized LOC101208119 [Cucumis sativus]
          Length = 474

 Score =  229 bits (584), Expect = 3e-57
 Identities = 157/475 (33%), Positives = 229/475 (48%), Gaps = 22/475 (4%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPA-ETQDIPGAAFX 363
            AEARAA+QRT NRC VQEDAKRAP+LACC S+SS+S KQVD+GP N A +  D P   F 
Sbjct: 4    AEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTS-KQVDSGPANAAADGPDQPSTGFM 62

Query: 364  XXXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLN-----SEVSEMDTFHSDVTTTSK 528
                            WWLQ Q ++GF +    E +N     +E S+  T  S  ++   
Sbjct: 63   PSSRASSYSNLLPDSKWWLQTQSSYGFQKIFTLEHINPLEAGNETSKSGTEKSCTSSDIH 122

Query: 529  PPSSKDG----EAFFYESVNAEYFVDSHFGIPAKSVKNDNGVAL----AKQLLKPLDKQN 684
             P   +     + F   S++ ++ V         ++ N++   L    +++ +  +D + 
Sbjct: 123  RPEGSNTVCGVDDFSRSSLDTDHGVSGLCTKRVTTILNEDIKTLEGTDSQECVGSVDMKA 182

Query: 685  CNDAFKDEDGSGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALL 864
              +  + +  +   VSK  +   ++ +SPW+  E+  PWW  +D+DELA  VAQ+S   +
Sbjct: 183  DFECLEKDSFNSKTVSKNQDEFYFDPDSPWIQEEKAEPWWWITDKDELAYWVAQKSLDHI 242

Query: 865  ENCDLPQPQHTCVN--REPFADFCSIDHAGIFMSSKDQKHPVVDHNSSMHDQHTSD-DHH 1035
            ENCDLP P+ TC++  R P+A     +H    +S+ +  H     +     +   D    
Sbjct: 243  ENCDLPPPKKTCLSFKRCPYAKKQCYEHNTNLVSTFESTHQNCGLDFCRFGRTQRDLSES 302

Query: 1036 FPSVALEPLSDDAASKAIPKGI-----TGENDSGKAQLLEALRHSQTXXXXXXXXXXXXX 1200
                 L  LS  ++S   P  +     T E+++ KA+L++AL HSQT             
Sbjct: 303  IEQGNLLHLSHKSSSCTNPDNLTKTMQTSEDNTSKAELMDALLHSQTRAREAEIAAKRAY 362

Query: 1201 XXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKSR 1380
                H+V+LF RQA+QLFAYKQWFQLLQLE+  LQIKN N+P+S+            K+ 
Sbjct: 363  AEKEHIVELFVRQATQLFAYKQWFQLLQLES--LQIKNSNQPMSN-LFPLVLPWKSYKNM 419

Query: 1381 KMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLPTF 1545
              HK W                DI  Y                  WTVGWMLP+F
Sbjct: 420  VSHKRWRRVTGQKRVEQDQRKSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF 474


>ref|XP_004133806.1| PREDICTED: uncharacterized protein LOC101208119 [Cucumis sativus]
          Length = 474

 Score =  229 bits (583), Expect = 4e-57
 Identities = 157/475 (33%), Positives = 229/475 (48%), Gaps = 22/475 (4%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPA-ETQDIPGAAFX 363
            AEARAA+QRT NRC VQEDAKRAP+LACC S+SS+S KQVD+GP N A +  D P   F 
Sbjct: 4    AEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTS-KQVDSGPANAAADGPDQPSTGFM 62

Query: 364  XXXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLN-----SEVSEMDTFHSDVTTTSK 528
                            WWLQ Q ++GF +    E +N     +E S+  T  S  ++   
Sbjct: 63   PSSRASSYSNLLPDSKWWLQTQSSYGFQKIFTLEHINPLEAGNETSKSGTEKSCTSSDIH 122

Query: 529  PPSSKDG----EAFFYESVNAEYFVDSHFGIPAKSVKNDNGVAL----AKQLLKPLDKQN 684
             P   +     + F   S++ ++ V         ++ N++   L    +++ +  +D + 
Sbjct: 123  RPEGSNTVCGVDDFSRSSLDTDHGVSGLCTKRVTTILNEDIKTLEGTDSQECVGSVDMKA 182

Query: 685  CNDAFKDEDGSGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALL 864
              +  + +  +   VSK  +   ++ +SPW+  E+  PWW  +D+DELA  VAQ+S   +
Sbjct: 183  DFECLEKDSFNSKTVSKNQDEFYFDPDSPWIQEEKAEPWWWITDKDELAYWVAQKSLDHI 242

Query: 865  ENCDLPQPQHTCVN--REPFADFCSIDHAGIFMSSKDQKHPVVDHNSSMHDQHTSD-DHH 1035
            ENCDLP P+ TC++  R P+A     +H    +S+ +  H     +     +   D    
Sbjct: 243  ENCDLPPPKKTCLSFKRCPYAKKQCYEHNTNLVSTFESTHQNCGLDFCRFGRTQRDLSES 302

Query: 1036 FPSVALEPLSDDAASKAIPKGI-----TGENDSGKAQLLEALRHSQTXXXXXXXXXXXXX 1200
                 L  LS  ++S   P  +     T E+++ KA+L++AL HSQT             
Sbjct: 303  IEQGNLLHLSHKSSSCTNPDDLTKTMQTSEDNTSKAELMDALLHSQTRAREAEIAAKRAY 362

Query: 1201 XXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKSR 1380
                H+V+LF RQA+QLFAYKQWFQLLQLE+  LQIKN N+P+S+            K+ 
Sbjct: 363  AEKEHIVELFVRQATQLFAYKQWFQLLQLES--LQIKNSNQPMSN-LFPLVLPWKSYKNM 419

Query: 1381 KMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLPTF 1545
              HK W                DI  Y                  WTVGWMLP+F
Sbjct: 420  VSHKRWRRVTGQKRVEQDQRKSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF 474


>ref|XP_003521812.1| PREDICTED: uncharacterized protein LOC100784190 [Glycine max]
          Length = 426

 Score =  226 bits (577), Expect = 2e-56
 Identities = 159/465 (34%), Positives = 206/465 (44%), Gaps = 14/465 (3%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQRTANRC VQEDAKRAPKLACC S+ ++S K VDAGP + A+  D        
Sbjct: 4    AEARALWQRTANRCFVQEDAKRAPKLACCQSSCATS-KSVDAGPASTADESDHTTVNVTH 62

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTF-HSDVTTTSKPPSSK 543
                           WWL LQPN+G+ +GL  EQLN+   E++T   SD+        SK
Sbjct: 63   FNRKSSISNLSPDSRWWLHLQPNYGYQKGLTYEQLNALEDEVETLLASDL--------SK 114

Query: 544  DGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNCNDAFKDEDGSGC 723
            + E F                               ++L+  ++K    D     D  GC
Sbjct: 115  NSEEF-------------------------------QELMDVMEKHETMDI----DCVGC 139

Query: 724  A-VSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALLENCDLPQPQHTC 900
            +  SKK N    E +  W+  ++ +PWWRT+DRDELA  V+Q+S   +ENCDLP PQ   
Sbjct: 140  SGSSKKANDFSLESDYSWIESDKALPWWRTTDRDELASFVSQKSLNHIENCDLPPPQKKH 199

Query: 901  VNREPFADFCS--IDHAGIFMSSKDQKHP-VVDHNSSMHDQHT--SDDHHFPSVALEPLS 1065
            +   P A   +  I  A     +K +    +  H     D      +  H  +  L   +
Sbjct: 200  LRGHPCAHVNNDKIKTASYDWEAKSRSFSNLTAHTPGSLDSRLMHKNQGHSANEGLLYFA 259

Query: 1066 DDAASKAIPK-------GITGENDSGKAQLLEALRHSQTXXXXXXXXXXXXXXXXXHVVK 1224
             D  S   PK         T + D  KAQL+EAL HSQT                 H+V 
Sbjct: 260  SDKCSSQTPKHEDLKKSQQTFDGDPSKAQLMEALCHSQTRAREAEEAAKKAYAEKEHIVT 319

Query: 1225 LFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKSRKMHKNWHX 1404
            L F+QASQLFAYKQW QLLQLE L +QIK++++PIS+                  +    
Sbjct: 320  LIFKQASQLFAYKQWLQLLQLETLCIQIKSKDQPISTLFPVALPWMSYEGRSSRKRKQKI 379

Query: 1405 XXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLP 1539
                         CDI  Y                  WTVGWMLP
Sbjct: 380  CNAKQGERKANSKCDITTYAVAFALGLSLVGAGLLLGWTVGWMLP 424


>ref|XP_002302639.2| hypothetical protein POPTR_0002s17390g [Populus trichocarpa]
            gi|550345217|gb|EEE81912.2| hypothetical protein
            POPTR_0002s17390g [Populus trichocarpa]
          Length = 429

 Score =  218 bits (554), Expect = 9e-54
 Identities = 154/405 (38%), Positives = 203/405 (50%), Gaps = 29/405 (7%)
 Frame = +1

Query: 412  WWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTT-SKPPSSKDGEA---FFYESVNA 579
            WWLQLQP++G+ + L +EQLN+  +E+++  +++  + SK    K  +    F   S N+
Sbjct: 27   WWLQLQPSYGYQKCLTREQLNALETELESLRTNIVDSPSKNEICKQDDEDNMFLDGSKNS 86

Query: 580  EYFVDSHFGIPAKSVKNDNGVALAKQLLKPL---DKQNCN---DAFKDE-----DGSGCA 726
            E  +DS+  I A  +K D  V   KQ LK L   D Q  N   DA K+      D +G  
Sbjct: 87   ESSLDSYCRISADYMKKDCDVK--KQELKALYDKDFQEFNELKDARKNSKLMEMDLTGWP 144

Query: 727  VSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALLENCDLPQPQHTCVN 906
             S+K N   ++ ES W+G E+NMPWWR +D+D+LA LVAQ+S   + NCDLP PQ   + 
Sbjct: 145  ESQKDNEHGFDPESSWIGSEKNMPWWRKTDKDDLASLVAQKSLDYIGNCDLPPPQKVHIR 204

Query: 907  REPFADFCSIDHAGIFMSSKDQKHPVVDHNSSM-HDQHTSDDHHFP-----SVALEPL-- 1062
            + P A   S  H     SS D K  +   +S+  H Q        P     S   + L  
Sbjct: 205  KYPCAHSGSFQHDNTLASSLDWKAQIGCISSATGHVQGCPKSEGMPGKQRGSTEGQSLSG 264

Query: 1063 SDDAAS------KAIPKGITGENDSGKAQLLEALRHSQTXXXXXXXXXXXXXXXXXHVVK 1224
            SD A S      +A   G   E+D  KAQLLEALRHSQT                 H+VK
Sbjct: 265  SDKACSYAATIKEAAEIGQISESDPCKAQLLEALRHSQTRAREAEQVAKQACAEKEHIVK 324

Query: 1225 LFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXXXXXXXXXXXXKSRKMHKNWHX 1404
            LFF+QASQLFAYKQWFQLLQLE LY Q+KN ++PIS+            K RK+ K+W  
Sbjct: 325  LFFKQASQLFAYKQWFQLLQLETLYYQMKNSDQPISN-LFPVVLPWIPQKGRKLCKSWQK 383

Query: 1405 XXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWTVGWMLP 1539
                          D+GKY                  WTVGW+LP
Sbjct: 384  SSKGKRGKESHPKHDVGKYAVALALGLSLVGAGLLLGWTVGWVLP 428


>ref|XP_006852588.1| hypothetical protein AMTR_s00021p00215510 [Amborella trichopoda]
            gi|548856199|gb|ERN14055.1| hypothetical protein
            AMTR_s00021p00215510 [Amborella trichopoda]
          Length = 473

 Score =  202 bits (513), Expect = 5e-49
 Identities = 149/488 (30%), Positives = 210/488 (43%), Gaps = 35/488 (7%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQRTANR  VQEDAKRAPKLACCPS S S   Q + G  +     D   A    
Sbjct: 4    AEARAVWQRTANRYFVQEDAKRAPKLACCPSPSCSKT-QSETGHGDHGNGPDHSSAIPVP 62

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTTS-----KP 531
                           WWLQLQPN G  +    EQ+ +  +E+D   +   T S     + 
Sbjct: 63   LNWNPTNMNLSPESKWWLQLQPNFGNHKDFTYEQIKALEAELDVIETGHDTPSSKLDDET 122

Query: 532  PSSKDGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVAL-------AKQLLKPLDKQNCN 690
              ++DG    Y+     Y +++ F +    +K+D  + +        KQLLK  ++    
Sbjct: 123  QETEDGHGGLYK--KPHYSLETTFRVSTACLKHDCELRMEELKAVHMKQLLK--NEVEAG 178

Query: 691  DAFKDEDG--------------SGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDEL 828
               K E G              S    S+++  +  +  +PW+  E+  PWW  +D+ EL
Sbjct: 179  GYLKSEFGDYWYGDSKVMDMEPSDLLTSERSEKVSADYGAPWM-CEKTGPWWHITDKHEL 237

Query: 829  ALLVAQRSFALLENCDLPQPQHTCVNREPFADFCSIDHAGIFMSSKDQKHPVVDHNSSMH 1008
              LV Q++   +ENCDLP+P    + + PF+ F S +H  I  +  + K    D   +  
Sbjct: 238  ETLVEQKTSQHVENCDLPRPHPMQIKKGPFSGFESSEHEEIASTLFEHKFSSSDCYPTEL 297

Query: 1009 DQHTSDDHHFPSVALEPLSDDAASKAIPKG---------ITGENDSGKAQLLEALRHSQT 1161
             Q  S           PL D   + +             ++ E+++ KAQLLEAL HSQT
Sbjct: 298  SQFDSASGSLGRTQQGPLHDSMKTFSCENNKKETYEISRLSFESEASKAQLLEALCHSQT 357

Query: 1162 XXXXXXXXXXXXXXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISSXX 1341
                             H++KLFF+QAS LFAYKQW QLLQLE LYLQ+K + + +    
Sbjct: 358  RAREAEKAAQKANSEKEHIIKLFFKQASHLFAYKQWLQLLQLETLYLQLKAKEQLLPVLP 417

Query: 1342 XXXXXXXXXXKSRKMHKNWHXXXXXXXXXXXXXXCDIGKYXXXXXXXXXXXXXXXXXXWT 1521
                      + +K  K  H               D                      WT
Sbjct: 418  WKPKEDKQWRQKKKKRKIGHHIY------------DASTLAFAVAVGLSLAGAGLFLGWT 465

Query: 1522 VGWMLPTF 1545
            +GW+LPTF
Sbjct: 466  MGWLLPTF 473


>ref|XP_002272343.1| PREDICTED: uncharacterized protein LOC100243561 [Vitis vinifera]
          Length = 494

 Score =  201 bits (512), Expect = 7e-49
 Identities = 141/410 (34%), Positives = 192/410 (46%), Gaps = 27/410 (6%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AE+RA W+R  NRC + E+A RAP  +  PS+SSSS +Q D  P + A   D P      
Sbjct: 4    AESRAGWKRNTNRCFIHENASRAPNSSSFPSSSSSSKRQSDGRPGDAAHRSDHPSPD-CM 62

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSD-VTTTSKPPSSK 543
                           WWL  QPN G  +G   EQLN+  +E D    + +  T+      
Sbjct: 63   HQNCNPLEDPAPDSKWWLYPQPNFGHQKGFEHEQLNTLENEFDILSYEFINQTAIEGLGA 122

Query: 544  DGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLL----------KPLDKQNCND 693
              E       NA++F+D      A S+K D    ++K  +          K  D +    
Sbjct: 123  QTET----KKNADFFLDRSRKASAASMKEDQFARMSKPKIGLHSNPQDIGKDKDIEELWY 178

Query: 694  AFKDEDGSGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFALLENC 873
              +D D     VS+++  L  + ES W+G E+  PWWR +D+D LA +VAQ+S   +ENC
Sbjct: 179  TDEDLDPVNSLVSEQSKKLSSDLESHWMGAEKTEPWWRKADKDTLASMVAQKSVEHIENC 238

Query: 874  DLPQPQHTCVNREPFADFCSIDHAGIFMSSKDQK-----HPVVDHNSSMHDQHTSDDHHF 1038
            DLP+PQ     R   A     D   +   S DQ        + D     H   + D+   
Sbjct: 239  DLPKPQIKHFRRGLSASLEWSDQDWMVAPSLDQMAELGFSNLTDCTWKSHTSASIDEKQS 298

Query: 1039 PSVALE--PLSDDAASKAIPKGITGEN---------DSGKAQLLEALRHSQTXXXXXXXX 1185
               A+E  P   D   +     ITG +         D+ KAQL+EAL HSQT        
Sbjct: 299  SLGAIEYSPNRSDTLFRNNSHSITGTDQEETCHIPEDASKAQLVEALCHSQTRAREAEKA 358

Query: 1186 XXXXXXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISS 1335
                     H++KLFF+QASQLFAYKQW QLLQLE L L+ KN+++PISS
Sbjct: 359  AQQAYEEKEHIIKLFFKQASQLFAYKQWLQLLQLETLCLEPKNKDQPISS 408


>gb|EOX95675.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
          Length = 366

 Score =  194 bits (492), Expect = 1e-46
 Identities = 139/386 (36%), Positives = 188/386 (48%), Gaps = 14/386 (3%)
 Frame = +1

Query: 424  LQPNHGFSRGLVKEQLNSEVSEMDTFHSDVTTTSKPPSSKDGEAFFYESVNAEYFVDSH- 600
            LQP++G  +GL  EQL++   E+++  +++ + SK              V+  +  D+  
Sbjct: 7    LQPSYGPQKGLTSEQLHALEDEVESLKAEIKSPSK--------------VSGVHLQDAQD 52

Query: 601  -FGIPAKSVKNDNGVAL-AKQLLKPLDKQNCNDAFKDEDGSGCAVSKKTNSLLYECESPW 774
              GI   S   D G +L + ++LK       N  F + +   C V KKTN L Y+ ESPW
Sbjct: 53   ATGIDRNS---DKGYSLDSTEILK-------NYEFLEMESVECPVFKKTNDLCYDPESPW 102

Query: 775  VGGERNMPWWRTSDRDELALLVAQRSFALLENCDLPQPQHTCVNREPFADFCSIDHAGIF 954
            V G +  PWWRT+D+DELA LVAQ+S   +ENCDLP PQ   V R   A  CS    G  
Sbjct: 103  VQGGKGEPWWRTTDKDELASLVAQKSSYFIENCDLPPPQKMHVRRSSHA--CSGSSDGDE 160

Query: 955  MSS---KDQKHPV---VDHNSSMHDQHTSDDHHFPSVA---LEPLSDDAASKAIPKGI-- 1101
            +SS   K Q  P+   + ++ +  D   +      SV    ++  SD + S      +  
Sbjct: 161  VSSLAWKSQTGPIPRPIVNSRAFTDSVRTHGRLMSSVGEGKVQCASDTSFSTTKEDTVEQ 220

Query: 1102 TGENDSGKAQLLEALRHSQTXXXXXXXXXXXXXXXXXHVVKLFFRQASQLFAYKQWFQLL 1281
              E+D  KAQLLEAL HSQT                 H++KLFF+QASQLFAYKQWFQ+L
Sbjct: 221  VTESDPTKAQLLEALCHSQTRAREAERAAKQAYAEKEHIIKLFFKQASQLFAYKQWFQML 280

Query: 1282 QLENLYLQIKNQNEPISSXXXXXXXXXXXXKSRKMHKNWHXXXXXXXXXXXXXXCDIGKY 1461
            QLE LY+QIKN  +P+S+             SRK+ K+W                DI KY
Sbjct: 281  QLEALYVQIKNNEQPVST-LFPAVLPWTPYNSRKLRKSWQKTGKARRVKNGQPRPDITKY 339

Query: 1462 XXXXXXXXXXXXXXXXXXWTVGWMLP 1539
                              WTVGWMLP
Sbjct: 340  AVAFALGLSLVGAGLLLGWTVGWMLP 365


>ref|XP_006444815.1| hypothetical protein CICLE_v10019982mg [Citrus clementina]
            gi|557547077|gb|ESR58055.1| hypothetical protein
            CICLE_v10019982mg [Citrus clementina]
          Length = 335

 Score =  187 bits (476), Expect = 1e-44
 Identities = 118/320 (36%), Positives = 164/320 (51%), Gaps = 16/320 (5%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARA WQR ANRC VQEDAKRAPKLACC S+SSSS KQVDAGP   A+  D P A F  
Sbjct: 7    AEARAVWQRAANRCFVQEDAKRAPKLACCQSSSSSS-KQVDAGPAGVADAPDHPAAGFMP 65

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQLNSEVSEMDTFHSD-VTTTSK----P 531
                           WWLQLQPN+G  +GL  EQ+++  +EM+   +  V + SK    P
Sbjct: 66   LNMNHLYSELPSDTRWWLQLQPNYGCQKGLTSEQISAVEAEMEALRAGFVNSPSKFSGDP 125

Query: 532  PSSKDGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLDKQNC-------- 687
                 G      S+N +   D  +   +   +N +   + KQ ++ +D +          
Sbjct: 126  SLDSTGGTLVDGSINNDVSHDELYNRVSAVCRNKDP-EVRKQNVEAVDCKTTQEFIELMD 184

Query: 688  ---NDAFKDEDGSGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRSFA 858
               N  F + D  GC  SK +    ++ ESPW+GG +  PWWRT+D+D+LA LVAQ+S +
Sbjct: 185  IRENYEFIEMDSVGCPSSKTSKEPCFDPESPWIGGGKTEPWWRTTDKDDLASLVAQKSVS 244

Query: 859  LLENCDLPQPQHTCVNREPFADFCSIDHAGIFMSSKDQKHPVVDHNSSMHDQHTSDDHHF 1038
             +ENCDLP PQ       P+A   + D               +D  SS+H ++ +D    
Sbjct: 245  YMENCDLPPPQKKHTRAHPYARSRASD---------------LDETSSLHLKYQTDYISN 289

Query: 1039 PSVALEPLSDDAASKAIPKG 1098
            P V  +  S D+   ++ +G
Sbjct: 290  PVVHAQG-SPDSRRASVEEG 308


>ref|NP_001061754.1| Os08g0400300 [Oryza sativa Japonica Group]
            gi|37572976|dbj|BAC98668.1| unknown protein [Oryza sativa
            Japonica Group] gi|37805969|dbj|BAC99384.1| unknown
            protein [Oryza sativa Japonica Group]
            gi|113623723|dbj|BAF23668.1| Os08g0400300 [Oryza sativa
            Japonica Group] gi|125603330|gb|EAZ42655.1| hypothetical
            protein OsJ_27219 [Oryza sativa Japonica Group]
            gi|215695311|dbj|BAG90502.1| unnamed protein product
            [Oryza sativa Japonica Group]
          Length = 481

 Score =  179 bits (455), Expect = 3e-42
 Identities = 143/420 (34%), Positives = 196/420 (46%), Gaps = 37/420 (8%)
 Frame = +1

Query: 187  AEARAAWQRTANRCLVQEDAKRAPKLACCPSTSSSSVKQVDAGPTNPAETQDIPGAAFXX 366
            AEARAAWQR ANRC+VQED KRAPKLACCP    SS +Q      N   ++D P   F  
Sbjct: 4    AEARAAWQRAANRCIVQEDRKRAPKLACCPP---SSEQQHVKSNGNCRNSEDRPVPNFMP 60

Query: 367  XXXXXXXXXXXXXXXWWLQLQPNHGFSRGLVKEQL---NSEVSEMDTFHSDVTTTSKPPS 537
                           WWLQLQPN G  + L  E L     E+S+ +   S        P 
Sbjct: 61   LSWNPMNSSLPPDIRWWLQLQPNLGGQKNLAGEHLYFLGREISDKEVEDSAQKNIHDEPL 120

Query: 538  SKDGEAFFYESVNAEYFVDSHFGIPAKSVKNDNGVALAKQLLKPLD---------KQNCN 690
                E F       E   +  + +   S+K  +   L  Q LK +          K+N +
Sbjct: 121  FC--EMFDTNPEKIEDVFEPSWMVSTASMKYSSETGL--QDLKNIGGYSQVPSKCKENAS 176

Query: 691  DA------FKDEDGSGCAVSKKTNSLLYECESPWVGGERNMPWWRTSDRDELALLVAQRS 852
            D       F D        SK      ++  +PW GGER+ PWW+ +D +ELALLVA+R+
Sbjct: 177  DCLFNDKEFLDFKNFNPPPSKNPQKDDFDMNAPWKGGERSQPWWQITDENELALLVAERA 236

Query: 853  FALLENCDLPQPQHTCVNREPFADFCSIDHAGIFMSSKDQ------------KHPVVDHN 996
               +ENCDLP+P  T + R    +  S ++ G +  S               +H    ++
Sbjct: 237  MQHIENCDLPRP--TQIVRVQGTESRSHENMGRYRGSSGPAGTMSYPDTGQCEHIECSYS 294

Query: 997  SSMHDQH--TSD---DHHFPSVALEPLSDDAAS-KAIPKGI-TGENDSGKAQLLEALRHS 1155
            ++  D+   TSD        +VA     D +      P+G  T +N + +AQLLEAL HS
Sbjct: 295  TASTDEVDLTSDGVWQQQERNVARSDAQDFSRGINTEPRGKRTYQNPAEQAQLLEALCHS 354

Query: 1156 QTXXXXXXXXXXXXXXXXXHVVKLFFRQASQLFAYKQWFQLLQLENLYLQIKNQNEPISS 1335
            QT                  V+KLFFRQAS LFA KQW ++LQLEN+ LQ+K++   I++
Sbjct: 355  QTRAREAEMAGKKAQSEKDDVIKLFFRQASHLFACKQWLKMLQLENICLQLKHREHQIAT 414


Top