BLASTX nr result

ID: Catharanthus23_contig00008818 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00008818
         (2236 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006362933.1| PREDICTED: uncharacterized protein DDB_G0271...   202   7e-49
emb|CBI20378.3| unnamed protein product [Vitis vinifera]              199   3e-48
ref|XP_004248504.1| PREDICTED: uncharacterized protein LOC101243...   179   6e-42
ref|XP_002281734.1| PREDICTED: uncharacterized protein LOC100247...   171   2e-39
ref|XP_002316934.1| cyclic nucleotide-gated channel [Populus tri...   167   1e-38
ref|XP_002330422.1| predicted protein [Populus trichocarpa] gi|5...   162   8e-37
ref|XP_004294487.1| PREDICTED: uncharacterized protein LOC101291...   158   8e-36
gb|EOY12782.1| Uncharacterized protein isoform 1 [Theobroma caca...   155   7e-35
gb|EXB89637.1| hypothetical protein L484_018738 [Morus notabilis]     144   2e-31
ref|XP_006464670.1| PREDICTED: uncharacterized protein DDB_G0271...   141   1e-30
gb|EOY12784.1| Uncharacterized protein isoform 3 [Theobroma cacao]    137   3e-29
ref|XP_002521829.1| conserved hypothetical protein [Ricinus comm...   118   1e-23
ref|XP_006582257.1| PREDICTED: micronuclear linker histone polyp...    97   4e-17
gb|ESW04927.1| hypothetical protein PHAVU_011G137100g [Phaseolus...    89   8e-15
ref|XP_004167599.1| PREDICTED: uncharacterized LOC101210465 [Cuc...    75   1e-10
ref|XP_004149309.1| PREDICTED: uncharacterized protein LOC101210...    75   1e-10

>ref|XP_006362933.1| PREDICTED: uncharacterized protein DDB_G0271670-like [Solanum
            tuberosum]
          Length = 473

 Score =  202 bits (513), Expect = 7e-49
 Identities = 146/403 (36%), Positives = 199/403 (49%), Gaps = 10/403 (2%)
 Frame = +3

Query: 207  RKRSETDDKHWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFG 386
            +K +E  D HW FLD+IEAP+WVDLTLEC+S Y D DDEWFH+ HPFH+ SSR+L + F 
Sbjct: 29   KKYNEQFD-HWAFLDQIEAPVWVDLTLECKSAYKDMDDEWFHISHPFHQASSRELKSAFS 87

Query: 387  HSAEDHVNLELDIQEKSSPRIPLSVSKSRGKDYR-RNGVQGNQMVMFNDQHPVKTLNQ-- 557
             S E  +NLE D+Q  SSP++P SVS+SRGKD+R R   QG+Q +  + +HPVK L++  
Sbjct: 88   RSGESSINLEHDMQGSSSPKLPPSVSRSRGKDFRSRQWSQGDQTLTLDKKHPVKHLSKGG 147

Query: 558  ----KSSFLSSNSNRMTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQ 725
                +     +N  ++T S + +      A +S   ++ SN   +A   D  K   +   
Sbjct: 148  LEADRVVEHKTNHKKLTSSAAIDSDSACQALNSRDKKISSN--SLAVYSD--KTRSISSS 203

Query: 726  KFTXXXXXXXXXXXXXXXXXXXXXXXPQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVT 905
              +                          Q S EVS    GQT G LS+L+VSLRKSCVT
Sbjct: 204  ITSEHGEECYKQELCVSDSSSTITSEACGQKSFEVSGPVLGQTTGLLSSLRVSLRKSCVT 263

Query: 906  RQASRVEIANGRLSEGQXXXXXXXXXXXXXNLGDGRRIWAARHSKDMTPPDSKNLGTSVA 1085
            RQASR+E+   R SEG+                  R     R SK+ T PDS+N+ T V 
Sbjct: 264  RQASRMEVNVCRQSEGRKSSSSKSSVGSSSIPYKEREGETERESKEKT-PDSRNVTTIVE 322

Query: 1086 CKQNLHEAKLPKAPDLQPHCTISNPKL---RSQSFKSLPINQETSKVKQQKLHANVLVPH 1256
             K   +++K+P    +Q H   S PK+   RS S  S+P      KV    +    LVP 
Sbjct: 323  AKL-ANKSKVP----VQAHNRTSIPKMVTGRSVS-SSVPTETNRPKVHLTNVQRKALVPQ 376

Query: 1257 SVNKHYPPTATXXXXXXXXASSCGQIPRGRKENLAAKVAPRQK 1385
              N                +S C ++    KEN   K+   QK
Sbjct: 377  RANGRVASILVSKPSERIGSSHCRRVVSSGKENAVVKMGMSQK 419


>emb|CBI20378.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  199 bits (507), Expect = 3e-48
 Identities = 150/452 (33%), Positives = 214/452 (47%), Gaps = 12/452 (2%)
 Frame = +3

Query: 219  ETDDKHWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAE 398
            +TDD HW FL+E EAP+W DLTLE ++   D DD+WFH+ HPFH+ SS QL + F  S+E
Sbjct: 5    KTDD-HWAFLEEFEAPMWADLTLEAKTNNQDVDDKWFHISHPFHQFSSHQLKSAFSGSSE 63

Query: 399  DHVNLELDIQEKSSPRIPLSVSKSRGKDYR-RNGVQGNQMVMFNDQHPVKTLNQKSSFLS 575
               NL+ D+   SSP++P SVS+SRGK YR RN  + N     N QHPVK+L+ K+S++ 
Sbjct: 64   GSENLDFDLHGPSSPKLPSSVSRSRGKHYRSRNWGKENGGFSLNKQHPVKSLSGKTSWVD 123

Query: 576  SNSNRMTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXX 755
            S S++    K S G  K    S +    +S+ +  + P     +    D K         
Sbjct: 124  SGSSQEIKPKPSCGNLKGTCSSKTSLGCDSSSTRTSIPNYTIPISSFGDSKGRLSSVAIK 183

Query: 756  XXXXXXXXXXXXXXXXPQH-QNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEIA 932
                             Q  Q SLEVSS  FG T G LS ++++LRKSC TRQASRVEI 
Sbjct: 184  ASESNSTTSTVTFEGTHQQPQKSLEVSSGPFGHTSGLLSVMRITLRKSCATRQASRVEIN 243

Query: 933  NGRLSEGQXXXXXXXXXXXXXNLGDGRRIWAAR--HSKDMTPPDSKNLGTSVACKQNLHE 1106
              + SEG              N G   +   A    ++D TP     + TS         
Sbjct: 244  KCQQSEGCKSSAGKSSVGSSSNPGYDVKDRTATEIRNRDRTPDSRNVMRTSQTAVNRGRA 303

Query: 1107 AKLPKAPDLQPHCTISNPKLRSQSF--KSLPINQETSKVKQQKLHANVLVPHSVNKHYPP 1280
            +   KA ++      +N +   +    KS   +   SKV  Q ++   LVP  VN+  P 
Sbjct: 304  STTSKASNILVDYRTNNSRKEGKRIVAKSTTKDAVKSKVVCQTINRKGLVPLRVNEQDPL 363

Query: 1281 TATXXXXXXXXASSCGQIPRGRKENLAAKVAPRQKLSMRNTMHVPRVTPR------VPEK 1442
            TA           +  ++  G KEN + K+A  QK S R+      V  +      +  K
Sbjct: 364  TAATKAKSKVGVGASNRLAGGGKENASGKLAVSQKSSGRDIAARDIVRGQTGKKQSISRK 423

Query: 1443 SVRTTSIVPMVKERINDRSKVKKTGTMPEKVY 1538
              +T    P  K +I+ RS+ K +  + +KV+
Sbjct: 424  GDKTGFTGP--KGKISGRSEGKTSMNVHQKVF 453


>ref|XP_004248504.1| PREDICTED: uncharacterized protein LOC101243644 [Solanum
            lycopersicum]
          Length = 463

 Score =  179 bits (453), Expect = 6e-42
 Identities = 136/393 (34%), Positives = 187/393 (47%), Gaps = 9/393 (2%)
 Frame = +3

Query: 234  HWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHVNL 413
            HW FLD+IEAP+WVDLTLEC+S Y D D+EW      FH+ SSR+L + F HS E  +NL
Sbjct: 33   HWAFLDQIEAPVWVDLTLECKSAYKDMDEEW------FHQASSRELKSAFSHSGESSINL 86

Query: 414  ELDIQEKSSPRIPLSVSKSRGKDYR-RNGVQGNQMVMFNDQHPVKTLNQ------KSSFL 572
            E  IQ  SSP++P SVS+SRGKD+R R   QG+Q +  + +H VK L++      K    
Sbjct: 87   EHGIQGSSSPKLPPSVSRSRGKDFRSRQWSQGDQTLTLDKKHHVKHLSKGGLEADKVVEH 146

Query: 573  SSNSNRMTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXX 752
             +N  ++T S + +      A  S   ++ SN   +A   D  K   +     +      
Sbjct: 147  KTNKKKLTSSAALDSDSACQALYSRDKKISSN--SLAAYSD--KTRSISSSITSEHGEEC 202

Query: 753  XXXXXXXXXXXXXXXXXPQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEIA 932
                                Q S EVS    GQT G LS+L+VSLRKSCVTRQASR+E+ 
Sbjct: 203  YKQELCVSDSSSTITSEACGQKSFEVSGPILGQTTGLLSSLRVSLRKSCVTRQASRMEVN 262

Query: 933  NGRLSEGQXXXXXXXXXXXXXNLGDGRRIWAARHSKDMTPPDSKNLGTSVACKQNLHEAK 1112
              R  EG+                  R     R SK+ T P+S+N+ T V  KQ  +++K
Sbjct: 263  VCRQPEGRKSSSSKSSVGSSSIPYKEREGETERESKEKT-PESRNVTTIVEAKQ-ANKSK 320

Query: 1113 LPKAPDLQPHCTISNPKLRSQSFKSLPINQETS--KVKQQKLHANVLVPHSVNKHYPPTA 1286
            +P    +Q H   S PK+ +    S  ++ ETS  KV    +    LVP   N       
Sbjct: 321  VP----VQAHNRTSIPKMVTGRTVSSSVSSETSRPKVHPTNVQRKALVPQRANGRVASIL 376

Query: 1287 TXXXXXXXXASSCGQIPRGRKENLAAKVAPRQK 1385
                     +S C ++    KEN   +    QK
Sbjct: 377  VSKPSERIGSSHCRRVVSSGKENDVVRKGISQK 409


>ref|XP_002281734.1| PREDICTED: uncharacterized protein LOC100247040 [Vitis vinifera]
          Length = 445

 Score =  171 bits (432), Expect = 2e-39
 Identities = 143/452 (31%), Positives = 205/452 (45%), Gaps = 12/452 (2%)
 Frame = +3

Query: 219  ETDDKHWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAE 398
            +TDD HW FL+E EAP+W DLTLE ++   D           FH+ SS QL + F  S+E
Sbjct: 5    KTDD-HWAFLEEFEAPMWADLTLEAKTNNQDV----------FHQFSSHQLKSAFSGSSE 53

Query: 399  DHVNLELDIQEKSSPRIPLSVSKSRGKDYR-RNGVQGNQMVMFNDQHPVKTLNQKSSFLS 575
               NL+ D+   SSP++P SVS+SRGK YR RN  + N     N QHPVK+L+ K+S++ 
Sbjct: 54   GSENLDFDLHGPSSPKLPSSVSRSRGKHYRSRNWGKENGGFSLNKQHPVKSLSGKTSWVD 113

Query: 576  SNSNRMTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXX 755
            S S++    K S G  K    S +    +S+ +  + P     +    D K         
Sbjct: 114  SGSSQEIKPKPSCGNLKGTCSSKTSLGCDSSSTRTSIPNYTIPISSFGDSKGRLSSVAIK 173

Query: 756  XXXXXXXXXXXXXXXXPQH-QNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEIA 932
                             Q  Q SLEVSS  FG T G LS ++++LRKSC TRQASRVEI 
Sbjct: 174  ASESNSTTSTVTFEGTHQQPQKSLEVSSGPFGHTSGLLSVMRITLRKSCATRQASRVEIN 233

Query: 933  NGRLSEGQXXXXXXXXXXXXXNLGDGRRIWAAR--HSKDMTPPDSKNLGTSVACKQNLHE 1106
              + SEG              N G   +   A    ++D TP     + TS         
Sbjct: 234  KCQQSEGCKSSAGKSSVGSSSNPGYDVKDRTATEIRNRDRTPDSRNVMRTSQTAVNRGRA 293

Query: 1107 AKLPKAPDLQPHCTISNPKLRSQSF--KSLPINQETSKVKQQKLHANVLVPHSVNKHYPP 1280
            +   KA ++      +N +   +    KS   +   SKV  Q ++   LVP  VN+  P 
Sbjct: 294  STTSKASNILVDYRTNNSRKEGKRIVAKSTTKDAVKSKVVCQTINRKGLVPLRVNEQDPL 353

Query: 1281 TATXXXXXXXXASSCGQIPRGRKENLAAKVAPRQKLSMRNTMHVPRVTPR------VPEK 1442
            TA           +  ++  G KEN + K+A  QK S R+      V  +      +  K
Sbjct: 354  TAATKAKSKVGVGASNRLAGGGKENASGKLAVSQKSSGRDIAARDIVRGQTGKKQSISRK 413

Query: 1443 SVRTTSIVPMVKERINDRSKVKKTGTMPEKVY 1538
              +T    P  K +I+ RS+ K +  + +KV+
Sbjct: 414  GDKTGFTGP--KGKISGRSEGKTSMNVHQKVF 443


>ref|XP_002316934.1| cyclic nucleotide-gated channel [Populus trichocarpa]
          Length = 565

 Score =  167 bits (424), Expect = 1e-38
 Identities = 124/403 (30%), Positives = 190/403 (47%), Gaps = 8/403 (1%)
 Frame = +3

Query: 234  HWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHVNL 413
            HW FL+EIEAP+WVD T+E +S Y D DD+WFH  HPFH+C+S +L A F HS+E  ++ 
Sbjct: 10   HWAFLEEIEAPMWVDFTIEEKSNYQDVDDKWFHTSHPFHQCTSLRLKAAFAHSSERSMSS 69

Query: 414  ELDIQEKSSPRIPLSVSKSRGKDYRRNGVQGNQM-VMFNDQHPVKTLNQKSSFLSSNSNR 590
            + + +  SSP IP SVS+SRGK Y      G +  +  N +HPVK LN KSS ++S  + 
Sbjct: 70   DFEFKGPSSPNIPSSVSRSRGKHYAGMKWGGGECDLSMNKKHPVKVLNDKSSRVNSEPSD 129

Query: 591  MTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXXXXXXX 770
                K S    K  + S        + +  A   D++        + +            
Sbjct: 130  EIKPKLSLANSKGTSRSKLSMVSGKSFTRNAKETDLKAKSGQGGSE-SSLNSGMAMVSDS 188

Query: 771  XXXXXXXXXXXPQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEIANGRLSE 950
                          Q ++EV S  F  T G LSA++  LRKS VTR+ASRVEI +    +
Sbjct: 189  NTSTVTFGSDHQARQGNMEVLSRGFDHTSGLLSAVRNGLRKSFVTRKASRVEIKDEN-KQ 247

Query: 951  GQXXXXXXXXXXXXXNLGDGRRIWAA----RHSKDMTPPDSKNLG-TSVACKQNLHEAKL 1115
             +             +L  G  + ++      +K+ T PDS+N+   + A ++   ++ +
Sbjct: 248  LRDRKSSSSKSSVGSSLKPGHDVKSSTITLMRNKEQT-PDSRNVARMTEAARKKKKDSNM 306

Query: 1116 PKAPDLQPHCTISNPK-LRSQSFKSLPINQETSKVKQQKLHANVLVPHSVNKHYPPTATX 1292
             K  D++     ++ K   S   KS P     SKV++Q +    L  H  NK +    T 
Sbjct: 307  SKTSDVRVKEVFNSRKGAISNVSKSAPQEALKSKVQKQTIRVTALAEHRGNKQHSLPGTA 366

Query: 1293 XXXXXXXASSCGQIPRGRKENLAAKVAPRQKLSMRNT-MHVPR 1418
                    S   ++    KEN+  K++  Q  S R T ++VP+
Sbjct: 367  KSKEKVRVSRLNKMVAPGKENVMGKMSLSQNCSRRGTKLNVPQ 409


>ref|XP_002330422.1| predicted protein [Populus trichocarpa]
            gi|566154168|ref|XP_006370339.1| hypothetical protein
            POPTR_0001s41790g [Populus trichocarpa]
            gi|550349518|gb|ERP66908.1| hypothetical protein
            POPTR_0001s41790g [Populus trichocarpa]
          Length = 490

 Score =  162 bits (409), Expect = 8e-37
 Identities = 137/447 (30%), Positives = 193/447 (43%), Gaps = 13/447 (2%)
 Frame = +3

Query: 234  HWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHVNL 413
            HW FL+EIEAP+WVD  +E +S Y D DDEWF   HPFH+CSS QL A F +S E   + 
Sbjct: 10   HWAFLEEIEAPIWVDFLVEAKSNYQDVDDEWFRTSHPFHQCSSGQLKAAFAYSGEKSTSS 69

Query: 414  ELDIQEKSSPRIPLSVSKSRGKDY-RRNGVQGNQMVMFNDQHPVKTLNQKSSFLSSNSN- 587
            + + +   SP IP SVS+SRGK Y  +    G   +  N QHPVK L+ KSS ++S  N 
Sbjct: 70   DFECKGSFSPNIPSSVSRSRGKHYASKKWGGGGHDISMNKQHPVKVLS-KSSRVNSEPND 128

Query: 588  ------RMTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXX 749
                   +  SK ++  +  V    S +R        A  C       L    F      
Sbjct: 129  KIKPKLSLVNSKGTSRSKVSVVSGKSFTRNAKETDLEAKSCQGGTESSLNSLVF------ 182

Query: 750  XXXXXXXXXXXXXXXXXXPQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEI 929
                                 Q +LEVSS  F    G LSA++  LRKS VTR+ASRVEI
Sbjct: 183  --KAAESNTSTVTSERDHQAKQRNLEVSSRGFDHASGLLSAVRNGLRKSFVTRKASRVEI 240

Query: 930  --ANGRLSEGQXXXXXXXXXXXXXNLGDGRRIWAARHSKDMTPPDSKNLG-TSVACKQNL 1100
               N +L + +                D +    A   K+ T PDS+N+   + A ++  
Sbjct: 241  NDENKQLRDRKSSSSKSSWGSSSNPGYDAKSSTLA--FKEQT-PDSRNVARMTEAARKKT 297

Query: 1101 HEAKLPKAPDLQPHCTISNPKLR--SQSFKSLPINQETSKVKQQKLHANVLVPHSVNKHY 1274
             ++ + +A D++    + N +    S   KS  +    SKV+ Q L    L  H  N+ +
Sbjct: 298  KDSDMSRASDVRVKEKVFNSRKGGISNVAKSASLEALKSKVQNQTLRVKALADHRGNELH 357

Query: 1275 PPTATXXXXXXXXASSCGQIPRGRKENLAAKVAPRQKLSMRNTMHVPRVTPRVPEKSVRT 1454
            P   T             ++    KEN+  K +     S R T         VP+K  +T
Sbjct: 358  PLPGTAKAKEKVRVGGINKLVGPGKENVTGKASLSLNCSSRGT------KLNVPQKGDKT 411

Query: 1455 TSIVPMVKERINDRSKVKKTGTMPEKV 1535
                 +V  R N+   +  T    EKV
Sbjct: 412  V----LVDHRGNELHPLPGTAKAKEKV 434


>ref|XP_004294487.1| PREDICTED: uncharacterized protein LOC101291124 [Fragaria vesca
            subsp. vesca]
          Length = 455

 Score =  158 bits (400), Expect = 8e-36
 Identities = 128/424 (30%), Positives = 184/424 (43%), Gaps = 4/424 (0%)
 Frame = +3

Query: 234  HWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHVNL 413
            HW+FL+EIEAP+WVDL  E  S   D DD+WF+  H FH+CSSR+L   F H  E+   L
Sbjct: 6    HWDFLEEIEAPMWVDLESEVNSNKQDGDDDWFYTSHLFHQCSSRELKIAFSH-GEEGTGL 64

Query: 414  ELDIQEKSSPRIPLSVSKSRGKDYRRNGVQG-NQMVMFNDQHPVKTLNQKSSFLSSNSNR 590
              D+   SSP++P SVS+SRGK Y     +G NQ++  + +HPV  L++ SS ++S S  
Sbjct: 65   NFDLLGPSSPKLPSSVSRSRGKHYVSKKWRGDNQVIPIDKRHPVNALSRTSSCVTSESGN 124

Query: 591  MTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXXXXXXX 770
               +K S    K  + S S    +SN         IR   P C    +            
Sbjct: 125  DMKTKPSYAHLKGTSRSKSSWVSKSN--------SIRNSIPSCADSTSTLTSTDKKADES 176

Query: 771  XXXXXXXXXXXPQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEI-ANGRLS 947
                        Q + ++  SS+   Q  G LS +K  +RKSCVTRQASRVEI  + R S
Sbjct: 177  NTASTITHDIDQQQRQNMGNSSNPLSQASGLLSLIKTGMRKSCVTRQASRVEITGDTRQS 236

Query: 948  EGQXXXXXXXXXXXXXN-LGDGRRIWAARHSKDMTPPDSKNL-GTSVACKQNLHEAKLPK 1121
             G+             N   D R   +         PD++N+   S+A K  +  +K  K
Sbjct: 237  RGRNSSSGKSSVGSSSNPCYDVRSSTSTSTQYKERTPDNRNMTRISIASKNKVKFSKASK 296

Query: 1122 APDLQPHCTISNPKLRSQSFKSLPINQETSKVKQQKLHANVLVPHSVNKHYPPTATXXXX 1301
                +     SN +    + KS        KV+ Q L    L P  VN++   T+T    
Sbjct: 297  TSTNKIEQGTSNYRTGPNTGKSTYQQAAKLKVQVQNLRRKPLGPVRVNENKLITSTVKSK 356

Query: 1302 XXXXASSCGQIPRGRKENLAAKVAPRQKLSMRNTMHVPRVTPRVPEKSVRTTSIVPMVKE 1481
                     ++     EN        +K+++         T +   K V   +IV   KE
Sbjct: 357  EKPVVVGSCRLAASGIENAKGLATFDKKVNIVKGKAAGSRTQKCNSKGVAAGTIVTGQKE 416

Query: 1482 RIND 1493
              N+
Sbjct: 417  TRNN 420


>gb|EOY12782.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508720886|gb|EOY12783.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 423

 Score =  155 bits (392), Expect = 7e-35
 Identities = 134/447 (29%), Positives = 192/447 (42%), Gaps = 10/447 (2%)
 Frame = +3

Query: 234  HWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHVNL 413
            HW FL+EIEAP+WVDLTLE +    D D +WF   H FH CSSR+L + F  S ED VN 
Sbjct: 9    HWAFLEEIEAPMWVDLTLEAKLNSQDIDGDWFQTSHLFHHCSSRKLKSAFSRSGEDGVNS 68

Query: 414  ELDIQEKSSPRIPLSVSKSRGKDYRRNGVQGN-QMVMFNDQHPVKTLNQKSSFLSSNSNR 590
            ELD+   SSP +P SVS+SRGKDYR    +G+      N+  PVK LN K S L S    
Sbjct: 69   ELDLVGASSPTLPQSVSRSRGKDYRSKKWKGDCHDGSLNNIKPVKVLNGKFSRLDSGYGE 128

Query: 591  MTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXXXXXXX 770
                K S    K    SSS + + S ++E      +                        
Sbjct: 129  EIKPKLSFVSLK--GASSSKTSLVSEITETNTRSTVTS---------------------- 164

Query: 771  XXXXXXXXXXXPQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEI-ANGRLS 947
                        Q Q + EVSS  FGQ+ G L +++ SLRKSC+TR ASRVEI A+ R S
Sbjct: 165  ------ESVQQQQQQKTFEVSSRGFGQSSGLLLSVRSSLRKSCITRPASRVEINADRRES 218

Query: 948  EGQXXXXXXXXXXXXXNLG-DGRRIWAARHSKDMTPPDSKNLG-TSVACKQNLHEAKLPK 1121
              +               G D +R   A   +    PDS+N+   + A K  +  + +  
Sbjct: 219  RDRKSSSSKSSVGSSSFSGHDVKRSSIALIKRKEHTPDSRNVARMTEAAKNKVKPSNMCN 278

Query: 1122 APDLQPHCTISNPKLRSQSFKSLPINQETSKVKQQKLHANVLVPHSVNKHYPPTATXXXX 1301
              +++      N +       + P  QE +K K         +   +N+     A     
Sbjct: 279  TSNVRGKEGNRNSRTGGLPTVAKPTCQEATKSKANSQTLRSKLSQPLNEKKSLVAASKAR 338

Query: 1302 XXXXASSCGQIPRGRKENLAAKVAPRQKLSMRNTMHVPRVTPR------VPEKSVRTTSI 1463
                 S   ++    KEN   +++  QK + +       V  R        +   RT   
Sbjct: 339  KKVGVSRIKKVTGAGKENNTGEISLSQKCNGKGDAAGGMVVGRKGTSQSTSQNGGRTGLF 398

Query: 1464 VPMVKERINDRSKVKKTGTMPEKVYFR 1544
            VP  K R+ ++ + K +    ++V+FR
Sbjct: 399  VP--KGRVGNQREGKNSTNSTQRVHFR 423


>gb|EXB89637.1| hypothetical protein L484_018738 [Morus notabilis]
          Length = 514

 Score =  144 bits (362), Expect = 2e-31
 Identities = 129/441 (29%), Positives = 198/441 (44%), Gaps = 17/441 (3%)
 Frame = +3

Query: 225  DDKHWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDH 404
            D+ HW FL++IEAP+WVDLTLE  S   DK          FH CSS QL + F HS +  
Sbjct: 6    DEDHWAFLEDIEAPMWVDLTLEANSNNQDK----------FHHCSSSQLKSTFFHSGDGD 55

Query: 405  VNLELDIQEKSSPRIPLSVSKSRGKDYRRNGVQG-NQMVMFNDQHPVKTLNQKSSFLSSN 581
               + D+   SSP++P SVSKSRGK YR    +G NQ    +  HPVK L  KSS +   
Sbjct: 56   STRDFDLTGLSSPKLPASVSKSRGKQYRIKKWKGENQNFSVDKPHPVKVLTGKSSRVKLG 115

Query: 582  SNRMTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCD---QKFTXXXXXX 752
                   K S    KE + S S    ES+L   A   +    P  C+   +  +      
Sbjct: 116  LRDKKKHKLS-FIPKETSVSKSSVVCESSLKGKA-VSNGSNHPSACEDIGRSMSSEANKT 173

Query: 753  XXXXXXXXXXXXXXXXXPQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEIA 932
                              +   + +VSS +FG T G LSA+K++LRKS +TR A+R+EI 
Sbjct: 174  IDSNPTSTVTYENESGRQKQNKANDVSSKAFGHTNGLLSAMKMALRKSYITRPAARMEIN 233

Query: 933  N-GRLSEGQXXXXXXXXXXXXXNLGDGRRI--WAARHSKDMTPPDSKNLGTSVACKQNLH 1103
            N  R  +G+             N     RI   ++   K++T P+S+N+G      ++  
Sbjct: 234  NDARQIKGRNSTSSKSSVGSSSNPRHDVRISTSSSARPKEIT-PESRNMGRITYVAKSKI 292

Query: 1104 EAKLPKAPDLQ-PHCTISNPKLRSQSFKSLPINQETS--KVKQQKLHANVLVPHSVNKHY 1274
             + + KAP ++    T +N +  SQ   +   +QE +  KV  +  H   LVP  VN+  
Sbjct: 293  SSGIVKAPRIKMEEGTSNNRRGGSQGNPAKSTHQEAARQKVLYRPSHTKALVPSRVNEQD 352

Query: 1275 PPTAT--XXXXXXXXASSCGQIPRGRKENLAAKVAPRQKLSMR-----NTMHVPRVTPRV 1433
               A                 +  G KEN+  K++  +K + R     +T+   +   + 
Sbjct: 353  SAVAATKAKKKAGTRVMKSNNLVGGGKENVTGKMSQSEKCNGRGIAQDSTVAATKAKKKA 412

Query: 1434 PEKSVRTTSIVPMVKERINDR 1496
              + +++ ++V   KE +  +
Sbjct: 413  GTRVMKSNNLVGGGKENVTGK 433


>ref|XP_006464670.1| PREDICTED: uncharacterized protein DDB_G0271670-like [Citrus
            sinensis]
          Length = 492

 Score =  141 bits (355), Expect = 1e-30
 Identities = 143/516 (27%), Positives = 210/516 (40%), Gaps = 79/516 (15%)
 Frame = +3

Query: 234  HWEFLDEIEAPLWVDLTLECESGYN--DKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHV 407
            HW FLDEIEAP+WVDLTLE ++ YN  D DDEWFH  H FH+CSSRQ  A F  S E   
Sbjct: 9    HWAFLDEIEAPMWVDLTLE-DATYNSQDVDDEWFHSSHLFHQCSSRQWKAAFCCSGEGSC 67

Query: 408  NLELDIQEKSSPRIPLSVSKSRGKDYRRNGVQG-NQMVMFNDQHPVKTL----------- 551
                ++   SSP++P SVS+SRGKDY     QG N  V  N +H V+ L           
Sbjct: 68   ESNFELLGPSSPKLPSSVSRSRGKDYDSKKWQGENGDVSLNKKHLVEVLRDKSRADVGTV 127

Query: 552  --------------------------------NQKSSFLSSNSNRMTGSKSSNGRRKEVA 635
                                            N K +F S NS+   GS + NG+   + 
Sbjct: 128  KKIKSNAGFVKPKSTSADSKSREEIKPKLSIINSKGTFSSKNSSVSEGSSTQNGKGNSLK 187

Query: 636  GSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXXXXXXXXXXXXXXXXXXPQHQ 815
               S   +ES+ S      +          + +                          Q
Sbjct: 188  PIFSSRGLESSSSSAVDKENESNALSTVTSESSLRGW----------------------Q 225

Query: 816  NSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEIANGR-----------------L 944
            N++EVSS +FG +R  LSA++++LRKSCVTRQASR E  N                   +
Sbjct: 226  NTIEVSSRAFGHSRMLLSAVRITLRKSCVTRQASRAETNNDTKQSMVGINIDRRESRMDI 285

Query: 945  SEGQXXXXXXXXXXXXXNLGD-------GRRIWAARHSKDMTPPDSKNLG-TSVACKQNL 1100
            +  +             ++G         R  + +   K+ T PDS+N+   +VA    +
Sbjct: 286  NVDRRESRDRKSSSSKSSVGSSSVPSDVNRSAFISTRKKEKT-PDSRNVARMTVAPSNQV 344

Query: 1101 HEAKLPKAPDLQPHCTISNPKLRSQSFKSLPINQETSK--VKQQKLHANVLVPHSVNKHY 1274
            + +   K   +Q +    N + ++ S  +    +ET+K  V    L A    P    ++ 
Sbjct: 345  NISNESKVSVVQKNKGNFNSRRKNMSMITKSTYKETAKLNVHSHTLGAKSSQPLREKQNS 404

Query: 1275 PPTATXXXXXXXXASSCGQIPRGRKENLAAKVAPRQKLSMRNT------MHVPRVTPRVP 1436
               AT        + + G      KEN   K++  QK S R            R    +P
Sbjct: 405  VIDATKKRGKKGSSGAAG------KENTMQKMSNNQKCSGRENTAGGVIRAQNRKQQNIP 458

Query: 1437 EKSVRTTSIVPMVKERINDRSKVKKTGTMPEKVYFR 1544
            ++ V  T ++   + +I DRSK K    + + V+ R
Sbjct: 459  QRGV--TRVLAGQQGKICDRSKGKTLVCVDQSVHLR 492


>gb|EOY12784.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 413

 Score =  137 bits (344), Expect = 3e-29
 Identities = 130/447 (29%), Positives = 187/447 (41%), Gaps = 10/447 (2%)
 Frame = +3

Query: 234  HWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHVNL 413
            HW FL+EIEAP+WVDLTLE +    D           FH CSSR+L + F  S ED VN 
Sbjct: 9    HWAFLEEIEAPMWVDLTLEAKLNSQDI----------FHHCSSRKLKSAFSRSGEDGVNS 58

Query: 414  ELDIQEKSSPRIPLSVSKSRGKDYRRNGVQGN-QMVMFNDQHPVKTLNQKSSFLSSNSNR 590
            ELD+   SSP +P SVS+SRGKDYR    +G+      N+  PVK LN K S L S    
Sbjct: 59   ELDLVGASSPTLPQSVSRSRGKDYRSKKWKGDCHDGSLNNIKPVKVLNGKFSRLDSGYGE 118

Query: 591  MTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXXXXXXX 770
                K S    K    SSS + + S ++E      +                        
Sbjct: 119  EIKPKLSFVSLK--GASSSKTSLVSEITETNTRSTVTS---------------------- 154

Query: 771  XXXXXXXXXXXPQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEI-ANGRLS 947
                        Q Q + EVSS  FGQ+ G L +++ SLRKSC+TR ASRVEI A+ R S
Sbjct: 155  ------ESVQQQQQQKTFEVSSRGFGQSSGLLLSVRSSLRKSCITRPASRVEINADRRES 208

Query: 948  EGQXXXXXXXXXXXXXNLG-DGRRIWAARHSKDMTPPDSKNLG-TSVACKQNLHEAKLPK 1121
              +               G D +R   A   +    PDS+N+   + A K  +  + +  
Sbjct: 209  RDRKSSSSKSSVGSSSFSGHDVKRSSIALIKRKEHTPDSRNVARMTEAAKNKVKPSNMCN 268

Query: 1122 APDLQPHCTISNPKLRSQSFKSLPINQETSKVKQQKLHANVLVPHSVNKHYPPTATXXXX 1301
              +++      N +       + P  QE +K K         +   +N+     A     
Sbjct: 269  TSNVRGKEGNRNSRTGGLPTVAKPTCQEATKSKANSQTLRSKLSQPLNEKKSLVAASKAR 328

Query: 1302 XXXXASSCGQIPRGRKENLAAKVAPRQKLSMRNTMHVPRVTPR------VPEKSVRTTSI 1463
                 S   ++    KEN   +++  QK + +       V  R        +   RT   
Sbjct: 329  KKVGVSRIKKVTGAGKENNTGEISLSQKCNGKGDAAGGMVVGRKGTSQSTSQNGGRTGLF 388

Query: 1464 VPMVKERINDRSKVKKTGTMPEKVYFR 1544
            VP  K R+ ++ + K +    ++V+FR
Sbjct: 389  VP--KGRVGNQREGKNSTNSTQRVHFR 413


>ref|XP_002521829.1| conserved hypothetical protein [Ricinus communis]
           gi|223539042|gb|EEF40639.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 373

 Score =  118 bits (295), Expect = 1e-23
 Identities = 68/146 (46%), Positives = 86/146 (58%), Gaps = 6/146 (4%)
 Frame = +3

Query: 234 HWEFLDEIEAPLWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHVNL 413
           HW FLDEIEAP+WVDLTLE  S Y D DD WFH  H FH+CSS QL A F +S E   + 
Sbjct: 14  HWAFLDEIEAPMWVDLTLEANSNYTDVDDGWFHTSHLFHQCSSLQLKAAFAYSGEGSASS 73

Query: 414 E-LDIQEKSSPRIPLSVSKSRGKDYRRNGVQGN-QMVMFNDQHPVKTLNQKSSFLSS--- 578
           + +D++  SSP +P SVS+SRGK Y      G       N +HPVK L+ KS+  S+   
Sbjct: 74  DIIDLKRTSSPELPSSVSRSRGKHYASKKWGGKCPDFSLNKKHPVKALSGKSTTESTGFV 133

Query: 579 -NSNRMTGSKSSNGRRKEVAGSSSGS 653
            N  +++    S  + K V  SSS S
Sbjct: 134 GNETKLSFIIQSKLKLKLVWCSSSNS 159


>ref|XP_006582257.1| PREDICTED: micronuclear linker histone polyprotein-like [Glycine
           max]
          Length = 568

 Score = 96.7 bits (239), Expect = 4e-17
 Identities = 84/274 (30%), Positives = 116/274 (42%), Gaps = 18/274 (6%)
 Frame = +3

Query: 156 VSSGNRMAITPKTAFSRRKRSETDDKHWEFLDEIEAPLWVDLTLECESGYNDK-DDEWFH 332
           + +  + AIT K +F            W FL+ IEAP+WVDLTLE +SG  D  DDEWF+
Sbjct: 1   METSKKKAITMKKSFDP----------WAFLEHIEAPMWVDLTLEVKSGCVDTGDDEWFN 50

Query: 333 VIHPFHECSSRQLIAKFGHSAEDHVNLELDIQEKSSPRIPLSVSKSRGKDYRR---NGVQ 503
             HPFH+ S+R+L ++F H        E+      SP +P SVS+SRGK Y      G+ 
Sbjct: 51  TSHPFHQMSARELKSRFSHGE------EILTSGVDSPELPSSVSRSRGKHYNNKKWEGID 104

Query: 504 GNQMVMFNDQHPVKTLNQKSSF-------LSSNSNRMTGSK-------SSNGRRKEVAGS 641
            N ++        +   Q SSF        +SN N+  G K        S G+   +   
Sbjct: 105 LNSLLDKQKGLSRRGFQQGSSFGQEVKPKPNSNVNKPKGGKLGLAFERKSRGKTDSMVNC 164

Query: 642 SSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXXXXXXXXXXXXXXXXXXPQHQNS 821
           S+     S+  +  G      +     QK+                              
Sbjct: 165 SNPPSSSSSNHKCEGSTARSTITSENTQKYR----------------------------- 195

Query: 822 LEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRV 923
            EVSS  F Q R   S  +VSL KSCVTR+ S +
Sbjct: 196 -EVSSKPFDQKRS-SSIRRVSLGKSCVTRKVSSI 227


>gb|ESW04927.1| hypothetical protein PHAVU_011G137100g [Phaseolus vulgaris]
          Length = 481

 Score = 89.0 bits (219), Expect = 8e-15
 Identities = 77/256 (30%), Positives = 113/256 (44%), Gaps = 20/256 (7%)
 Frame = +3

Query: 219 ETDDKHWEFLDEIEAPLWVDLTLECESGY--NDKDDEWFHVIHPFHECSSRQLIAKFGHS 392
           +T+D +W FL+ IEAP+WVDL +E  SG      DD+WF+  HPFH+ S+R+L +KF   
Sbjct: 13  KTND-NWAFLEHIEAPMWVDLAVEAVSGGVGTGDDDDWFNTSHPFHQMSARELKSKFS-Q 70

Query: 393 AEDHVNLELDIQEKSSPRIPLSVSKSRGKDYRRNGVQGNQMVMFNDQHPVKT-LNQKSSF 569
            E+ +   +D+Q  +SP +P SVS+SRGK Y     +G  +    D+   ++ L Q SSF
Sbjct: 71  GEEILAPGIDLQGVNSPELPSSVSRSRGKHYNNKKWEGIDLNTLLDKQTGRSGLQQCSSF 130

Query: 570 -------LSSNSNR----------MTGSKSSNGRRKEVAGSSSGSRVESNLSEMAGPCDI 698
                  L  N NR          +T    + G+ +     S      S+  +  G    
Sbjct: 131 GQEVKPRLKPNVNRPKRALSGKFGLTFEPDARGKPESKVSCSKPVGSSSSDRKTGGSSAR 190

Query: 699 RKVPPLCDQKFTXXXXXXXXXXXXXXXXXXXXXXXPQHQNSLEVSSHSFGQTRGFLSALK 878
             +     QK+T                              EVSS    Q R   S   
Sbjct: 191 STITSENTQKYT------------------------------EVSSKPCDQKRS-SSIRM 219

Query: 879 VSLRKSCVTRQASRVE 926
           VS  K CVTR+ S+++
Sbjct: 220 VSFGKYCVTRKVSKIQ 235


>ref|XP_004167599.1| PREDICTED: uncharacterized LOC101210465 [Cucumis sativus]
          Length = 312

 Score = 75.1 bits (183), Expect = 1e-10
 Identities = 63/226 (27%), Positives = 90/226 (39%), Gaps = 1/226 (0%)
 Frame = +3

Query: 267 LWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHVNLELD-IQEKSSP 443
           +WVDL+LE +S   + DD+WF+  H  H+ SS  L   F    ++   L+ + I+  SSP
Sbjct: 1   MWVDLSLEGKSYNQNIDDKWFYTHHQVHQSSSHDLKLVFAQLYDEKKTLDFELIKASSSP 60

Query: 444 RIPLSVSKSRGKDYRRNGVQGNQMVMFNDQHPVKTLNQKSSFLSSNSNRMTGSKSSNGRR 623
            +P SVS+SRGKD+     +GN                   F  +    + GS S     
Sbjct: 61  TLPDSVSRSRGKDFDGRKCKGN----------------CRGFAMNKEVVVIGSSSEG--- 101

Query: 624 KEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXXXXXXXXXXXXXXXXXX 803
           KE   S + S + S +                                            
Sbjct: 102 KESVDSRTSSTIVSGIGHQ----------------------------------------- 120

Query: 804 PQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEIANGR 941
            Q     EV+S S   +   L  ++ SLRKSC TRQASR+E+ N R
Sbjct: 121 -QQHKPTEVTSQSLSSSSKLLLDMRRSLRKSCATRQASRLEVNNCR 165


>ref|XP_004149309.1| PREDICTED: uncharacterized protein LOC101210465 [Cucumis sativus]
          Length = 312

 Score = 75.1 bits (183), Expect = 1e-10
 Identities = 63/226 (27%), Positives = 90/226 (39%), Gaps = 1/226 (0%)
 Frame = +3

Query: 267 LWVDLTLECESGYNDKDDEWFHVIHPFHECSSRQLIAKFGHSAEDHVNLELD-IQEKSSP 443
           +WVDL+LE +S   + DD+WF+  H  H+ SS  L   F    ++   L+ + I+  SSP
Sbjct: 1   MWVDLSLEGKSYNQNIDDKWFYTHHQVHQSSSHDLKLVFAQLYDEKKTLDFELIKASSSP 60

Query: 444 RIPLSVSKSRGKDYRRNGVQGNQMVMFNDQHPVKTLNQKSSFLSSNSNRMTGSKSSNGRR 623
            +P SVS+SRGKD+     +GN                   F  +    + GS S     
Sbjct: 61  TLPDSVSRSRGKDFDGRKCKGN----------------CRGFAMNKEVVVIGSSSEG--- 101

Query: 624 KEVAGSSSGSRVESNLSEMAGPCDIRKVPPLCDQKFTXXXXXXXXXXXXXXXXXXXXXXX 803
           KE   S + S + S +                                            
Sbjct: 102 KESVDSRTSSTIVSGIGHQ----------------------------------------- 120

Query: 804 PQHQNSLEVSSHSFGQTRGFLSALKVSLRKSCVTRQASRVEIANGR 941
            Q     EV+S S   +   L  ++ SLRKSC TRQASR+E+ N R
Sbjct: 121 -QQHKPTEVTSQSLSSSSKLLLDMRRSLRKSCATRQASRLEVNNCR 165


Top