BLASTX nr result

ID: Perilla23_contig00007406 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00007406
         (1137 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011102097.1| PREDICTED: uncharacterized protein LOC105180...   323   2e-85
ref|XP_011102107.1| PREDICTED: uncharacterized protein LOC105180...   272   4e-70
ref|XP_011070902.1| PREDICTED: uncharacterized protein LOC105156...   218   1e-53
ref|XP_011070897.1| PREDICTED: uncharacterized protein LOC105156...   216   2e-53
ref|XP_012855407.1| PREDICTED: uncharacterized protein LOC105974...   166   4e-38
ref|XP_010091631.1| hypothetical protein L484_026481 [Morus nota...   153   2e-34
ref|XP_012847711.1| PREDICTED: uncharacterized protein LOC105967...   152   4e-34
gb|EYU28892.1| hypothetical protein MIMGU_mgv1a006975mg [Erythra...   152   4e-34
ref|XP_007045750.1| 18S pre-ribosomal assembly protein gar2-rela...   152   5e-34
ref|XP_007045749.1| 18S pre-ribosomal assembly protein gar2-rela...   152   5e-34
ref|XP_012438471.1| PREDICTED: uncharacterized protein LOC105764...   150   1e-33
gb|KHG16888.1| Polyribonucleotide nucleotidyltransferase [Gossyp...   149   3e-33
ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm...   149   5e-33
ref|XP_012478806.1| PREDICTED: uncharacterized protein LOC105794...   147   2e-32
ref|XP_012478809.1| PREDICTED: uncharacterized protein LOC105794...   147   2e-32
emb|CDP00479.1| unnamed protein product [Coffea canephora]            140   3e-30
gb|KHG21027.1| Formate--tetrahydrofolate ligase [Gossypium arbor...   138   1e-29
ref|XP_010044372.1| PREDICTED: uncharacterized protein LOC104433...   137   1e-29
ref|XP_010044366.1| PREDICTED: uncharacterized protein LOC104433...   137   1e-29
ref|XP_012464097.1| PREDICTED: uncharacterized protein LOC105783...   137   2e-29

>ref|XP_011102097.1| PREDICTED: uncharacterized protein LOC105180144 isoform X1 [Sesamum
            indicum] gi|747107584|ref|XP_011102098.1| PREDICTED:
            uncharacterized protein LOC105180144 isoform X1 [Sesamum
            indicum] gi|747107586|ref|XP_011102099.1| PREDICTED:
            uncharacterized protein LOC105180144 isoform X1 [Sesamum
            indicum] gi|747107588|ref|XP_011102100.1| PREDICTED:
            uncharacterized protein LOC105180144 isoform X1 [Sesamum
            indicum] gi|747107590|ref|XP_011102101.1| PREDICTED:
            uncharacterized protein LOC105180144 isoform X1 [Sesamum
            indicum] gi|747107592|ref|XP_011102102.1| PREDICTED:
            uncharacterized protein LOC105180144 isoform X1 [Sesamum
            indicum] gi|747107594|ref|XP_011102103.1| PREDICTED:
            uncharacterized protein LOC105180144 isoform X1 [Sesamum
            indicum] gi|747107596|ref|XP_011102104.1| PREDICTED:
            uncharacterized protein LOC105180144 isoform X1 [Sesamum
            indicum] gi|747107598|ref|XP_011102105.1| PREDICTED:
            uncharacterized protein LOC105180144 isoform X1 [Sesamum
            indicum]
          Length = 600

 Score =  323 bits (827), Expect = 2e-85
 Identities = 184/312 (58%), Positives = 215/312 (68%), Gaps = 11/312 (3%)
 Frame = -2

Query: 1136 DRKLPIEDIDTQSSL-----GLDA-GNKVVQVPDQILYQXXXXXXXXXXXXEVGGGTGED 975
            DR+LPI++  T+S L      LD  GNKV Q PDQI               E   G  ED
Sbjct: 292  DRELPIQEFGTRSFLRSFLNSLDGDGNKVTQPPDQIS-NGKASTKSHAASSEEKQGPKED 350

Query: 974  VQASCLLYNSKVENGSITFNFSSPEGAVPG--NCMTEDVEEQSADSEDV--HKDVNAENL 807
            VQAS LLYNSKVE+GSITFNF+SP   + G  N +TE+V+EQS DS D+  HKD + +NL
Sbjct: 351  VQASSLLYNSKVESGSITFNFNSPAPVLAGITNRLTENVKEQSFDSGDMQEHKDADVDNL 410

Query: 806  PEAMEEKVQCTSDRVGDTNEQCLINDEGNSDGGSVTSYVQSVRIEDSSCGNDHEGSPEHK 627
            P+  +  VQC  D   D  E  LI   GN DG S T +V  V I++ S  N HE +PE+K
Sbjct: 411  PDGGQ--VQCAGDSTADVQEPSLIIKNGNFDGSSATGHVPPVGIKEGSEENVHEQTPENK 468

Query: 626  QEKSDDVSIDAPVQSPTNESPAPT-NDVLASEPNAPNHEGMDSGDVPVVSQLQQDLGETS 450
             + S D+S D  +Q P N+S +   +DV A EPN P HE  +SG+VPVVSQLQ D GETS
Sbjct: 469  DDNSADLSQDCQLQFPNNKSQSSRKDDVQALEPNVPRHEHRNSGNVPVVSQLQYDAGETS 528

Query: 449  FSAASMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKH 270
            FSAA +I+YSGPIAFSGSLSHRSDGSTTSG+SFAFPVLQSEWNSSPVRMAKADRRHFRKH
Sbjct: 529  FSAAGLISYSGPIAFSGSLSHRSDGSTTSGRSFAFPVLQSEWNSSPVRMAKADRRHFRKH 588

Query: 269  KGWRSGLLCCRF 234
            KGWR GLLCCRF
Sbjct: 589  KGWRLGLLCCRF 600


>ref|XP_011102107.1| PREDICTED: uncharacterized protein LOC105180144 isoform X2 [Sesamum
            indicum]
          Length = 561

 Score =  272 bits (695), Expect = 4e-70
 Identities = 162/308 (52%), Positives = 186/308 (60%), Gaps = 7/308 (2%)
 Frame = -2

Query: 1136 DRKLPIEDIDTQSSL-----GLDA-GNKVVQVPDQILYQXXXXXXXXXXXXEVGGGTGED 975
            DR+LPI++  T+S L      LD  GNKV Q PDQI               E   G  ED
Sbjct: 292  DRELPIQEFGTRSFLRSFLNSLDGDGNKVTQPPDQIS-NGKASTKSHAASSEEKQGPKED 350

Query: 974  VQASCLLYNSKVENGSITFNFSSPEGAVPGNCMTEDVEEQSADSEDVHKDVNAENLPEAM 795
            VQAS LLYNSK                                    HKD + +NLP+  
Sbjct: 351  VQASSLLYNSKE-----------------------------------HKDADVDNLPDGG 375

Query: 794  EEKVQCTSDRVGDTNEQCLINDEGNSDGGSVTSYVQSVRIEDSSCGNDHEGSPEHKQEKS 615
            +  VQC  D   D  E  LI   GN DG S T +V  V I++ S  N HE +PE+K + S
Sbjct: 376  Q--VQCAGDSTADVQEPSLIIKNGNFDGSSATGHVPPVGIKEGSEENVHEQTPENKDDNS 433

Query: 614  DDVSIDAPVQSPTNESPAPT-NDVLASEPNAPNHEGMDSGDVPVVSQLQQDLGETSFSAA 438
             D+S D  +Q P N+S +   +DV A EPN P HE  +SG+VPVVSQLQ D GETSFSAA
Sbjct: 434  ADLSQDCQLQFPNNKSQSSRKDDVQALEPNVPRHEHRNSGNVPVVSQLQYDAGETSFSAA 493

Query: 437  SMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWR 258
             +I+YSGPIAFSGSLSHRSDGSTTSG+SFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWR
Sbjct: 494  GLISYSGPIAFSGSLSHRSDGSTTSGRSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWR 553

Query: 257  SGLLCCRF 234
             GLLCCRF
Sbjct: 554  LGLLCCRF 561


>ref|XP_011070902.1| PREDICTED: uncharacterized protein LOC105156465 isoform X2 [Sesamum
            indicum]
          Length = 625

 Score =  218 bits (554), Expect = 1e-53
 Identities = 137/309 (44%), Positives = 174/309 (56%), Gaps = 8/309 (2%)
 Frame = -2

Query: 1136 DRKLPIEDIDTQSSL-----GLDA-GNKVVQVPDQILYQXXXXXXXXXXXXEVGGGTGED 975
            D KLPI++  T+S L      LD   NKV Q+PD+I                   G  ED
Sbjct: 339  DSKLPIQEFGTRSFLRSFLNSLDGESNKVAQLPDEISSSKAVGEAAPP-----AAGPKED 393

Query: 974  VQASCLLYNSKVENGSITFNFSSPEGAVPG--NCMTEDVEEQSADSEDVHKDVNAENLPE 801
            +QAS L YNS+VENGSITFNF+S    V G  N  T+  +EQS DS D  KD N +    
Sbjct: 394  LQASILYYNSEVENGSITFNFNSLAPVVAGVTNGRTDVFKEQSFDSGDCLKDANLDT--- 450

Query: 800  AMEEKVQCTSDRVGDTNEQCLINDEGNSDGGSVTSYVQSVRIEDSSCGNDHEGSPEHKQE 621
                        +G  +E       G  +  S TS+  S   +D S  + H+ SP+H+++
Sbjct: 451  -----------SMGKVHELSPAIKHGFPNDISATSHAHSASSKDISNKDVHDHSPDHREK 499

Query: 620  KSDDVSIDAPVQSPTNESPAPTNDVLASEPNAPNHEGMDSGDVPVVSQLQQDLGETSFSA 441
              D +S D+ +    N S +   D  A E +   HE  D G+  VV   + +  E+SFSA
Sbjct: 500  DLDGLSTDSQLPFAINSSKS---DGQAVESHVLEHEHKDFGNSSVVGHGKYEQEESSFSA 556

Query: 440  ASMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGW 261
            A +IT+SGPI +SGSLS RSDGS  SG+SFAFP+LQSEWNSSPVRM KAD  HFRKHKGW
Sbjct: 557  AGLITFSGPIVYSGSLSVRSDGSAASGRSFAFPILQSEWNSSPVRMGKADGTHFRKHKGW 616

Query: 260  RSGLLCCRF 234
            RS +LCCRF
Sbjct: 617  RSSILCCRF 625


>ref|XP_011070897.1| PREDICTED: uncharacterized protein LOC105156465 isoform X1 [Sesamum
            indicum] gi|747049689|ref|XP_011070898.1| PREDICTED:
            uncharacterized protein LOC105156465 isoform X1 [Sesamum
            indicum] gi|747049691|ref|XP_011070900.1| PREDICTED:
            uncharacterized protein LOC105156465 isoform X1 [Sesamum
            indicum] gi|747049693|ref|XP_011070901.1| PREDICTED:
            uncharacterized protein LOC105156465 isoform X1 [Sesamum
            indicum]
          Length = 626

 Score =  216 bits (551), Expect = 2e-53
 Identities = 136/309 (44%), Positives = 174/309 (56%), Gaps = 8/309 (2%)
 Frame = -2

Query: 1136 DRKLPIEDIDTQSSL-----GLDA-GNKVVQVPDQILYQXXXXXXXXXXXXEVGGGTGED 975
            D KLPI++  T+S L      LD   NKV Q+PD+ +                  G  ED
Sbjct: 339  DSKLPIQEFGTRSFLRSFLNSLDGESNKVAQLPDEKISSSKAVGEAAPP----AAGPKED 394

Query: 974  VQASCLLYNSKVENGSITFNFSSPEGAVPG--NCMTEDVEEQSADSEDVHKDVNAENLPE 801
            +QAS L YNS+VENGSITFNF+S    V G  N  T+  +EQS DS D  KD N +    
Sbjct: 395  LQASILYYNSEVENGSITFNFNSLAPVVAGVTNGRTDVFKEQSFDSGDCLKDANLDT--- 451

Query: 800  AMEEKVQCTSDRVGDTNEQCLINDEGNSDGGSVTSYVQSVRIEDSSCGNDHEGSPEHKQE 621
                        +G  +E       G  +  S TS+  S   +D S  + H+ SP+H+++
Sbjct: 452  -----------SMGKVHELSPAIKHGFPNDISATSHAHSASSKDISNKDVHDHSPDHREK 500

Query: 620  KSDDVSIDAPVQSPTNESPAPTNDVLASEPNAPNHEGMDSGDVPVVSQLQQDLGETSFSA 441
              D +S D+ +    N S +   D  A E +   HE  D G+  VV   + +  E+SFSA
Sbjct: 501  DLDGLSTDSQLPFAINSSKS---DGQAVESHVLEHEHKDFGNSSVVGHGKYEQEESSFSA 557

Query: 440  ASMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGW 261
            A +IT+SGPI +SGSLS RSDGS  SG+SFAFP+LQSEWNSSPVRM KAD  HFRKHKGW
Sbjct: 558  AGLITFSGPIVYSGSLSVRSDGSAASGRSFAFPILQSEWNSSPVRMGKADGTHFRKHKGW 617

Query: 260  RSGLLCCRF 234
            RS +LCCRF
Sbjct: 618  RSSILCCRF 626


>ref|XP_012855407.1| PREDICTED: uncharacterized protein LOC105974797 [Erythranthe
            guttatus] gi|848915246|ref|XP_012855408.1| PREDICTED:
            uncharacterized protein LOC105974797 [Erythranthe
            guttatus]
          Length = 577

 Score =  166 bits (419), Expect = 4e-38
 Identities = 125/318 (39%), Positives = 161/318 (50%), Gaps = 17/318 (5%)
 Frame = -2

Query: 1136 DRKLPIEDIDTQSSL--------GLDAGN----KVVQVPDQILYQXXXXXXXXXXXXEVG 993
            D+ LPI++  T+S L        G D+ N    KV  + DQ +                 
Sbjct: 306  DKTLPIQEFGTRSFLRSFINSFDGDDSTNNNSFKVEHLQDQEISSSE------------A 353

Query: 992  GGTGEDVQASCLLYNSKVENGSITFNFSSPEGA-VPGNC---MTEDVEEQSADSEDVHKD 825
            G  G+ VQAS L + SKVENGSITFNF SP  A  P      +TE++EEQS        +
Sbjct: 354  GPKGDHVQASSLSFKSKVENGSITFNFKSPAPAPAPAGVTKPVTENIEEQSIIGSGNPSN 413

Query: 824  VNAENLPEAMEEKVQCTSDRVGDTNEQCLINDEGNSDGGSVTSYVQSVRIEDSSCGNDHE 645
                ++ E  EEK   T+D V   ++          D  S T  V  + +E  +     +
Sbjct: 414  ATTTSIGEIEEEKPLKTADDVFVFSD----------DYSSTTKKVDKLEMEIVT----PK 459

Query: 644  GSPEHKQEKSDDVSIDAPVQSPTNESPAPTNDVLASEPNAPNHEGMDSGDVPVVSQLQQD 465
              P   +E S D  + +  +S                 N+ N E    G     + L++D
Sbjct: 460  TIPSVAKESSSDDGVSSSGKSV----------------NSVNSEVTLGGS----NFLKRD 499

Query: 464  LGETSFSAASMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKAD-R 288
             GETSFSA  +IT+SGP A+SGS+SHRSDGS TSG+SFAFPVLQ EWNSSP R+ K    
Sbjct: 500  EGETSFSAGGLITFSGPAAYSGSISHRSDGSATSGRSFAFPVLQEEWNSSPERIPKEGVG 559

Query: 287  RHFRKHKGWRSGLLCCRF 234
            R FRKHKGWRSGLLCCRF
Sbjct: 560  RDFRKHKGWRSGLLCCRF 577


>ref|XP_010091631.1| hypothetical protein L484_026481 [Morus notabilis]
            gi|587854872|gb|EXB44897.1| hypothetical protein
            L484_026481 [Morus notabilis]
          Length = 642

 Score =  153 bits (387), Expect = 2e-34
 Identities = 107/274 (39%), Positives = 136/274 (49%), Gaps = 32/274 (11%)
 Frame = -2

Query: 959  LLYNSKVENGSITFNFSS------PEGAVPGNCMTEDVEEQSADSEDVHKDVNAENLPEA 798
            L YNSKVE   ITF+F S       +   P N ++E +E ++  + D   DV      + 
Sbjct: 374  LAYNSKVEKRRITFDFRSLATVPVAKEECPQNGISERLETENISTVD---DVTTNM--QF 428

Query: 797  MEEKVQCTSDRVGDTNEQCLINDEGNSDGGSVTSYVQ--SVRIEDSSCGNDHEGSPEHKQ 624
            +  +VQ  S  +  T E C  N           S V+  S   +       HE + E   
Sbjct: 429  VSSQVQHDSSPLTGTREDCFQNAVHECGQTQNMSVVEDGSANAQIVPSNAQHEVAREEVP 488

Query: 623  EKSDDVSIDAPVQSPTNESPA-------------------PTNDVLASE-PNAPNHEGMD 504
            +      ++ P  S  N+  +                   P+ D L  E P+ P      
Sbjct: 489  QNGVCTCVETPNTSSVNDDTSGLQKVSSSLQHVTAREEGLPSTDTLCCETPDTPMVVDGI 548

Query: 503  SGDVPVVSQLQQDLGETSFSAAS----MITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVL 336
            SG   V    Q  +GE+SFSAA      I YSGPI +SGS+S RSD STTS +SFAFPVL
Sbjct: 549  SGSQVVSGHFQYGVGESSFSAAGPLSGRINYSGPIPYSGSISLRSDSSTTSTRSFAFPVL 608

Query: 335  QSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 234
            QSEWNSSPVRMAKADRRHFRKH+GWR G+LCCRF
Sbjct: 609  QSEWNSSPVRMAKADRRHFRKHRGWRQGILCCRF 642


>ref|XP_012847711.1| PREDICTED: uncharacterized protein LOC105967645 [Erythranthe
           guttatus]
          Length = 503

 Score =  152 bits (385), Expect = 4e-34
 Identities = 71/101 (70%), Positives = 83/101 (82%)
 Frame = -2

Query: 536 EPNAPNHEGMDSGDVPVVSQLQQDLGETSFSAASMITYSGPIAFSGSLSHRSDGSTTSGK 357
           + N  N E  +     +  Q++ + GETSF+AAS++TYSGPIA+SGSLS RSDGS  SG+
Sbjct: 403 DENEKNSEQNEGSSAIISRQMKYEEGETSFAAASLVTYSGPIAYSGSLSLRSDGSAASGR 462

Query: 356 SFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 234
           SFAFP+LQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF
Sbjct: 463 SFAFPILQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 503


>gb|EYU28892.1| hypothetical protein MIMGU_mgv1a006975mg [Erythranthe guttata]
          Length = 424

 Score =  152 bits (385), Expect = 4e-34
 Identities = 71/101 (70%), Positives = 83/101 (82%)
 Frame = -2

Query: 536 EPNAPNHEGMDSGDVPVVSQLQQDLGETSFSAASMITYSGPIAFSGSLSHRSDGSTTSGK 357
           + N  N E  +     +  Q++ + GETSF+AAS++TYSGPIA+SGSLS RSDGS  SG+
Sbjct: 324 DENEKNSEQNEGSSAIISRQMKYEEGETSFAAASLVTYSGPIAYSGSLSLRSDGSAASGR 383

Query: 356 SFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 234
           SFAFP+LQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF
Sbjct: 384 SFAFPILQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 424


>ref|XP_007045750.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
           [Theobroma cacao] gi|590698568|ref|XP_007045751.1| 18S
           pre-ribosomal assembly protein gar2-related, putative
           isoform 2 [Theobroma cacao]
           gi|590698571|ref|XP_007045752.1| 18S pre-ribosomal
           assembly protein gar2-related, putative isoform 2
           [Theobroma cacao] gi|508709685|gb|EOY01582.1| 18S
           pre-ribosomal assembly protein gar2-related, putative
           isoform 2 [Theobroma cacao] gi|508709686|gb|EOY01583.1|
           18S pre-ribosomal assembly protein gar2-related,
           putative isoform 2 [Theobroma cacao]
           gi|508709687|gb|EOY01584.1| 18S pre-ribosomal assembly
           protein gar2-related, putative isoform 2 [Theobroma
           cacao]
          Length = 470

 Score =  152 bits (384), Expect = 5e-34
 Identities = 97/240 (40%), Positives = 138/240 (57%), Gaps = 19/240 (7%)
 Frame = -2

Query: 896 AVPGNCMTEDVEEQSADSEDVHKDVNAENLPEAMEEKVQCTSDRVGDTNEQCLIN----- 732
           A+  +C ++ +E+QS  S    + +    L  A+EE          D+NE+ +++     
Sbjct: 243 AMSSDCKSDGIEQQSFQSSSKKEVMVMPPLVSAVEESK--------DSNEEAIVSVPALV 294

Query: 731 ------DEGNSDGGSVTSYVQSVRIEDSSCGNDHEGSPEHKQEKSDDVSIDAPVQSPTNE 570
                 D G  +   ++    S   E +S    +E S ++K E +  ++ +    +PT+ 
Sbjct: 295 SATEELDSGKGEAILISPAQVSTSEESTSSSLVNEVSYDNKLE-TGSITFNLDSSAPTS- 352

Query: 569 SPAPTNDVLASEP----NAPNHEGMDSGDVPVVSQLQQDLGETSFSAASMIT----YSGP 414
           S    +  L SEP    + P  E   + D  + + LQQ +GE+SFSAA ++T    YSGP
Sbjct: 353 SKDECHHNLDSEPLGTGSTPKLEV--AADQSISNNLQQGIGESSFSAAGLVTGLISYSGP 410

Query: 413 IAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 234
           +A+SGSLS RSD STTS +SFAFP+LQSEWN SPVRMAKADRRH+RKHKGWR GLLCCRF
Sbjct: 411 VAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 470


>ref|XP_007045749.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
           [Theobroma cacao] gi|508709684|gb|EOY01581.1| 18S
           pre-ribosomal assembly protein gar2-related, putative
           isoform 1 [Theobroma cacao]
          Length = 527

 Score =  152 bits (384), Expect = 5e-34
 Identities = 97/240 (40%), Positives = 138/240 (57%), Gaps = 19/240 (7%)
 Frame = -2

Query: 896 AVPGNCMTEDVEEQSADSEDVHKDVNAENLPEAMEEKVQCTSDRVGDTNEQCLIN----- 732
           A+  +C ++ +E+QS  S    + +    L  A+EE          D+NE+ +++     
Sbjct: 300 AMSSDCKSDGIEQQSFQSSSKKEVMVMPPLVSAVEESK--------DSNEEAIVSVPALV 351

Query: 731 ------DEGNSDGGSVTSYVQSVRIEDSSCGNDHEGSPEHKQEKSDDVSIDAPVQSPTNE 570
                 D G  +   ++    S   E +S    +E S ++K E +  ++ +    +PT+ 
Sbjct: 352 SATEELDSGKGEAILISPAQVSTSEESTSSSLVNEVSYDNKLE-TGSITFNLDSSAPTS- 409

Query: 569 SPAPTNDVLASEP----NAPNHEGMDSGDVPVVSQLQQDLGETSFSAASMIT----YSGP 414
           S    +  L SEP    + P  E   + D  + + LQQ +GE+SFSAA ++T    YSGP
Sbjct: 410 SKDECHHNLDSEPLGTGSTPKLEV--AADQSISNNLQQGIGESSFSAAGLVTGLISYSGP 467

Query: 413 IAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 234
           +A+SGSLS RSD STTS +SFAFP+LQSEWN SPVRMAKADRRH+RKHKGWR GLLCCRF
Sbjct: 468 VAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 527


>ref|XP_012438471.1| PREDICTED: uncharacterized protein LOC105764449 [Gossypium
           raimondii] gi|823211183|ref|XP_012438472.1| PREDICTED:
           uncharacterized protein LOC105764449 [Gossypium
           raimondii] gi|763783453|gb|KJB50524.1| hypothetical
           protein B456_008G175400 [Gossypium raimondii]
           gi|763783454|gb|KJB50525.1| hypothetical protein
           B456_008G175400 [Gossypium raimondii]
          Length = 397

 Score =  150 bits (380), Expect = 1e-33
 Identities = 110/304 (36%), Positives = 156/304 (51%), Gaps = 50/304 (16%)
 Frame = -2

Query: 995 GGGTGEDVQASC-----LLYNSKVENGSITFNFSSPEGAVPGNCMTEDV---EEQSADS- 843
           G  +G+D+   C     L  ++ +++ S+     S +G +P  C ++D+    E   D+ 
Sbjct: 98  GNQSGKDIDDECGMKKKLDADTCIQDVSLLEESESNKG-IPCQCDSKDLILSREMKEDAV 156

Query: 842 ----EDVHKDVNAENLPEAM--------EEKVQCTSDRVGDTNEQCLINDEGNSDGGSVT 699
               EDV K++    L E +        + ++ C+  R   T +Q   N + +S+     
Sbjct: 157 KMITEDVSKELYTLGLGELLLMSEMSTVKAEIVCSDCRSDGTQQQ---NFQNSSEKEVTV 213

Query: 698 SYVQSVRIEDSSCGNDH------------EGSPEHKQE---------KSDDVSIDAPVQS 582
                  +E+S+ GN+             EGS   K E          + + S  + +  
Sbjct: 214 MPALVSPVEESNNGNEEAILSAPALVSAAEGSEHGKWEATLISPVLASASEESTGSRIVD 273

Query: 581 PTNESPAPTNDV------LASEPNAPNHEGM--DSGDVPVVSQLQQDLGETSFSAASMIT 426
             ++S A T+        L  EP A        D  D  + S LQ+  GE SFSAA +IT
Sbjct: 274 EVSDSSARTSSKDRCCHNLDLEPLASGSTPKLEDPADQLLSSNLQRGYGECSFSAAGLIT 333

Query: 425 YSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLL 246
           YSGPIA+SGSLSHRSD STTS +SFAFP+LQSEWNSSPVRMAKA+ RH+RKH+GWR GLL
Sbjct: 334 YSGPIAYSGSLSHRSDSSTTSTRSFAFPILQSEWNSSPVRMAKAEGRHYRKHRGWRQGLL 393

Query: 245 CCRF 234
           CCRF
Sbjct: 394 CCRF 397


>gb|KHG16888.1| Polyribonucleotide nucleotidyltransferase [Gossypium arboreum]
          Length = 408

 Score =  149 bits (377), Expect = 3e-33
 Identities = 107/304 (35%), Positives = 156/304 (51%), Gaps = 50/304 (16%)
 Frame = -2

Query: 995  GGGTGEDVQASC-----LLYNSKVENGSITFNFSSPEGAVPGNCMTEDV---EEQSADS- 843
            G  +G+D+   C     L  ++ +++ S+     S +G +P  C ++D+    E   D+ 
Sbjct: 109  GNQSGKDIDDKCSTKKKLDADTCIQDVSLLEESESNKG-IPYQCDSKDLILSREMKEDAV 167

Query: 842  ----EDVHKDVNAENLPEAM--------EEKVQCTSDRVGDTNEQCLINDEGNSDGGSVT 699
                EDV K +    L E +        + ++ C+  R   T +Q   N +  S+  +  
Sbjct: 168  KMITEDVSKKLYTLGLGELLLMSEMSTVKAEIVCSDCRSDGTQQQ---NFQNLSEKEATV 224

Query: 698  SYVQSVRIEDSSCGNDHE--------GSPEHKQEKSDDVSIDAPVQSPTNESPAPTNDV- 546
                   +E+S+ GN+           + E  +    + ++ +PV +  +E    +  V 
Sbjct: 225  MPALVSPVEESNNGNEEAILSAPALVSAAEESEHGKWEATLISPVLASASEESTGSRIVD 284

Query: 545  LASEPNAPNH-----------EGMDSGDVPVV---------SQLQQDLGETSFSAASMIT 426
              S+ +A              E + SG  P V         S LQ+  GE SFSAA +IT
Sbjct: 285  EVSDSSAQTSSKDRCCHNLDLEPLASGSTPKVEDPADQLLSSNLQRGYGECSFSAAGLIT 344

Query: 425  YSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLL 246
            YSGPIA+SGSLSHRSD STTS +SFAFP+LQSEWNSSPVRMAKA+RRH+RKH+GWR G L
Sbjct: 345  YSGPIAYSGSLSHRSDSSTTSTRSFAFPILQSEWNSSPVRMAKAERRHYRKHRGWRQGFL 404

Query: 245  CCRF 234
            CCRF
Sbjct: 405  CCRF 408


>ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis]
           gi|223546192|gb|EEF47694.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 488

 Score =  149 bits (375), Expect = 5e-33
 Identities = 85/171 (49%), Positives = 101/171 (59%), Gaps = 4/171 (2%)
 Frame = -2

Query: 734 NDEGNSDGGSVTSYVQSVRIEDSSCGNDHEGSPEHKQEKSDDVSIDAPVQSPTNESPAPT 555
           +D G+ D   + S   S   E+   G     SP H  +   D++  AP  S   E     
Sbjct: 320 SDHGH-DEVILASLAPSYATEEPENGAKAAKSPSHTLDSVSDLNSSAPTASGGEEGSQVG 378

Query: 554 NDVLASEPNAPNHEGMDSGDVPVVSQLQQDLGETSFSAAS----MITYSGPIAFSGSLSH 387
                   N+  HE     + P   QLQ   GE+SFSAA     +I+YSGPIA+SGSLS 
Sbjct: 379 GSEHLESRNSSRHEDTSITE-PFSGQLQYSHGESSFSAAGPLSGLISYSGPIAYSGSLSL 437

Query: 386 RSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 234
           RSD STTS +SFAFP+LQSEWNSSPVRMAKADRRHFRKH+ WR GLLCCRF
Sbjct: 438 RSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHFRKHRSWRQGLLCCRF 488


>ref|XP_012478806.1| PREDICTED: uncharacterized protein LOC105794265 isoform X1 [Gossypium
            raimondii] gi|823157856|ref|XP_012478807.1| PREDICTED:
            uncharacterized protein LOC105794265 isoform X1
            [Gossypium raimondii] gi|823157858|ref|XP_012478808.1|
            PREDICTED: uncharacterized protein LOC105794265 isoform
            X1 [Gossypium raimondii] gi|763763266|gb|KJB30520.1|
            hypothetical protein B456_005G147700 [Gossypium
            raimondii] gi|763763269|gb|KJB30523.1| hypothetical
            protein B456_005G147700 [Gossypium raimondii]
          Length = 518

 Score =  147 bits (370), Expect = 2e-32
 Identities = 105/286 (36%), Positives = 144/286 (50%), Gaps = 54/286 (18%)
 Frame = -2

Query: 929  SITFNFSSPEGAVPGNCMTEDVEEQSADSEDVHK----DVNAE--------NLPEAMEEK 786
            S++   + P+  +P  C TED+      ++D  K    DV+ E        ++PE    K
Sbjct: 237  SLSLEENEPKNRIPSQCDTEDLILSRKMTDDTMKMARDDVSKELFTLGELLSMPELSTVK 296

Query: 785  VQ-----CTSDRVGDTNEQCLIN-----------------DEGNSDGGSVTSYVQSVRIE 672
             +     C SD +    +QC  N                 +  NS   ++ S    V + 
Sbjct: 297  PKAMSSNCKSDGI---KQQCFQNSKEKEVMVMPPLVSADKESDNSSKETILSASAPVSVA 353

Query: 671  DSSCGNDHEG---SPEHKQEKSDDVSIDAPVQSPTNESPAPTNDVLASEPNAPNH----E 513
            +       E    SP       ++VS D+ + + +      ++  L S  N   H    E
Sbjct: 354  EEMDSRKEEATMFSPVTSSSLVNEVSDDSKLAARSIAFGFDSS-ALTSSKNEGCHNLDRE 412

Query: 512  GMDSG---------DVPVVSQLQQDLGETSFSAASMIT----YSGPIAFSGSLSHRSDGS 372
             +++G         D P  + LQ   GE+SFSAA ++T    YSGPIA+SGSLSHRSD S
Sbjct: 413  ALETGHTPKLEDIADQPSSNNLQCGNGESSFSAAGLVTGLISYSGPIAYSGSLSHRSDSS 472

Query: 371  TTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 234
            TTS +SFAFP+LQSEWNSSPVRMAKADRRH+RKH+GWR GLLCCRF
Sbjct: 473  TTSTRSFAFPILQSEWNSSPVRMAKADRRHYRKHRGWRQGLLCCRF 518


>ref|XP_012478809.1| PREDICTED: uncharacterized protein LOC105794265 isoform X2 [Gossypium
            raimondii] gi|823157862|ref|XP_012478810.1| PREDICTED:
            uncharacterized protein LOC105794265 isoform X2
            [Gossypium raimondii] gi|763763265|gb|KJB30519.1|
            hypothetical protein B456_005G147700 [Gossypium
            raimondii] gi|763763268|gb|KJB30522.1| hypothetical
            protein B456_005G147700 [Gossypium raimondii]
          Length = 466

 Score =  147 bits (370), Expect = 2e-32
 Identities = 105/286 (36%), Positives = 144/286 (50%), Gaps = 54/286 (18%)
 Frame = -2

Query: 929  SITFNFSSPEGAVPGNCMTEDVEEQSADSEDVHK----DVNAE--------NLPEAMEEK 786
            S++   + P+  +P  C TED+      ++D  K    DV+ E        ++PE    K
Sbjct: 185  SLSLEENEPKNRIPSQCDTEDLILSRKMTDDTMKMARDDVSKELFTLGELLSMPELSTVK 244

Query: 785  VQ-----CTSDRVGDTNEQCLIN-----------------DEGNSDGGSVTSYVQSVRIE 672
             +     C SD +    +QC  N                 +  NS   ++ S    V + 
Sbjct: 245  PKAMSSNCKSDGI---KQQCFQNSKEKEVMVMPPLVSADKESDNSSKETILSASAPVSVA 301

Query: 671  DSSCGNDHEG---SPEHKQEKSDDVSIDAPVQSPTNESPAPTNDVLASEPNAPNH----E 513
            +       E    SP       ++VS D+ + + +      ++  L S  N   H    E
Sbjct: 302  EEMDSRKEEATMFSPVTSSSLVNEVSDDSKLAARSIAFGFDSS-ALTSSKNEGCHNLDRE 360

Query: 512  GMDSG---------DVPVVSQLQQDLGETSFSAASMIT----YSGPIAFSGSLSHRSDGS 372
             +++G         D P  + LQ   GE+SFSAA ++T    YSGPIA+SGSLSHRSD S
Sbjct: 361  ALETGHTPKLEDIADQPSSNNLQCGNGESSFSAAGLVTGLISYSGPIAYSGSLSHRSDSS 420

Query: 371  TTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 234
            TTS +SFAFP+LQSEWNSSPVRMAKADRRH+RKH+GWR GLLCCRF
Sbjct: 421  TTSTRSFAFPILQSEWNSSPVRMAKADRRHYRKHRGWRQGLLCCRF 466


>emb|CDP00479.1| unnamed protein product [Coffea canephora]
          Length = 548

 Score =  140 bits (352), Expect = 3e-30
 Identities = 96/249 (38%), Positives = 121/249 (48%), Gaps = 4/249 (1%)
 Frame = -2

Query: 968 ASCLLYNSKVENGSITFNFSSPEGAVPGNCMTEDVEEQSADSEDVHKDVNAENLPEAMEE 789
           A+ L YNSKVE+G+ITF+F SP+ A+                 D H D + EN  E + +
Sbjct: 372 ANNLHYNSKVESGTITFDFKSPKPAI-----------------DSHADESGENSHEEVLK 414

Query: 788 KVQCTSDRVGDTNEQCLINDEGNSDGGSVTSYVQSVRIEDSSCGNDHEGSPEHKQEKSDD 609
                                                          EG   HKQE   D
Sbjct: 415 S----------------------------------------------EGVLNHKQENLTD 428

Query: 608 VSIDAPVQSPTNESPAPTNDVLASEPNAPNHEGMDSGDVPVVSQLQQDLGETSFSA---- 441
            S  A ++     S    N+    EP A   + +D       SQ+ +  GE+SFS+    
Sbjct: 429 QSA-ALIECG---SSTDKNETTVHEPKAQQQDAVDHP-----SQVHRGGGESSFSSTGPL 479

Query: 440 ASMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGW 261
           + +ITYSGPIA+SGS S RSD STTS +SFAFP+LQSEWNSSPVRM KA+RRH RKH+GW
Sbjct: 480 SGLITYSGPIAYSGSTSLRSDSSTTSTRSFAFPILQSEWNSSPVRMTKAERRHIRKHRGW 539

Query: 260 RSGLLCCRF 234
             GL CCRF
Sbjct: 540 IQGLFCCRF 548


>gb|KHG21027.1| Formate--tetrahydrofolate ligase [Gossypium arboreum]
          Length = 505

 Score =  138 bits (347), Expect = 1e-29
 Identities = 88/225 (39%), Positives = 119/225 (52%), Gaps = 16/225 (7%)
 Frame = -2

Query: 860 EQSADSEDVHKDVNAENLPEAMEEKVQCTSDRVGDTNEQCLI----------NDEGNSDG 711
           +  A S D   D N +   E   +K    +  V D+N   L           +D G  + 
Sbjct: 297 KSEAMSPDFKSDRNEQQSFENSSKKEVIVASEVEDSNNLILSAPALASTAEGSDSGKGEA 356

Query: 710 GSVTSYVQSVRIEDSSCGNDHEGSPEHKQEKSDDVSIDAPVQSPTNESPAPTNDVLASEP 531
             ++    S  +E +S G  +E         +  ++ D+        S APT+   +SEP
Sbjct: 357 TPISPAPASASLEATSSGLVNE---------TGSITFDS-------RSSAPTSGKGSSEP 400

Query: 530 NAPNHEGM--DSGDVPVVSQLQQDLGETSFSAAS----MITYSGPIAFSGSLSHRSDGST 369
                     ++ D P  S LQ   GE+SFSAA     +I+YSGPI +SG+LS RSD ST
Sbjct: 401 LETGRTSKLEETADQPFSSNLQSGNGESSFSAAGPLTGLISYSGPITYSGNLSLRSDSST 460

Query: 368 TSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 234
           TS +SFAFP+LQSEWNSSPVRMAKAD+R +R+H+GWR G LCCRF
Sbjct: 461 TSTRSFAFPILQSEWNSSPVRMAKADQRQYRRHRGWRQGFLCCRF 505


>ref|XP_010044372.1| PREDICTED: uncharacterized protein LOC104433356 isoform X3
           [Eucalyptus grandis]
          Length = 354

 Score =  137 bits (346), Expect = 1e-29
 Identities = 100/262 (38%), Positives = 138/262 (52%), Gaps = 38/262 (14%)
 Frame = -2

Query: 905 PEGAVPGNCMTEDVEEQSADSEDVHKDV----------NAENLPEAMEEKVQCTSDRVGD 756
           P   VP     +DV +  + ++DV  ++          N   LP + E   +  +D    
Sbjct: 100 PANLVPTEITADDVADAKSKAKDVASEILLCNSSDNVCNGIGLPSSKESYSEQMTDAAAL 159

Query: 755 TNEQCLINDEGNSDGGSVTS-YVQSVRIEDSSCGNDH-EGSPEHKQEKSDDVSIDAPVQS 582
           T+   +I  +G+ +  S TS Y  S      S  + H E  P++   K++ VS   PV  
Sbjct: 160 TSASEVI--QGSLEDASATSGYPSSASNSKDSLNHSHIEKVPDNC--KANQVSCP-PVSR 214

Query: 581 PTNES-------------------PAPTN--DVLASEP-NAPNHEGMDSGDVPVVSQLQQ 468
            T ++                   PA T+  ++L +E  N   H+G+       ++    
Sbjct: 215 GTKDARQANKAERRSTPSDSNASAPASTSGEEILTTETRNKMKHDGVSDSQTERLADTFS 274

Query: 467 DLGETSFSAA----SMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMA 300
             GE+SFS A    S+ITYSGPIA+SG+LS RSDGSTTS +SFAFPVL +EWNSSPVRMA
Sbjct: 275 --GESSFSMAGPVSSLITYSGPIAYSGNLSLRSDGSTTSTRSFAFPVLHNEWNSSPVRMA 332

Query: 299 KADRRHFRKHKGWRSGLLCCRF 234
           KADRR FR+H+GWR GLLCCRF
Sbjct: 333 KADRRIFRRHRGWRHGLLCCRF 354


>ref|XP_010044366.1| PREDICTED: uncharacterized protein LOC104433356 isoform X1
           [Eucalyptus grandis] gi|702275825|ref|XP_010044369.1|
           PREDICTED: uncharacterized protein LOC104433356 isoform
           X1 [Eucalyptus grandis] gi|702275830|ref|XP_010044370.1|
           PREDICTED: uncharacterized protein LOC104433356 isoform
           X1 [Eucalyptus grandis]
          Length = 409

 Score =  137 bits (346), Expect = 1e-29
 Identities = 100/262 (38%), Positives = 138/262 (52%), Gaps = 38/262 (14%)
 Frame = -2

Query: 905 PEGAVPGNCMTEDVEEQSADSEDVHKDV----------NAENLPEAMEEKVQCTSDRVGD 756
           P   VP     +DV +  + ++DV  ++          N   LP + E   +  +D    
Sbjct: 155 PANLVPTEITADDVADAKSKAKDVASEILLCNSSDNVCNGIGLPSSKESYSEQMTDAAAL 214

Query: 755 TNEQCLINDEGNSDGGSVTS-YVQSVRIEDSSCGNDH-EGSPEHKQEKSDDVSIDAPVQS 582
           T+   +I  +G+ +  S TS Y  S      S  + H E  P++   K++ VS   PV  
Sbjct: 215 TSASEVI--QGSLEDASATSGYPSSASNSKDSLNHSHIEKVPDNC--KANQVSCP-PVSR 269

Query: 581 PTNES-------------------PAPTN--DVLASEP-NAPNHEGMDSGDVPVVSQLQQ 468
            T ++                   PA T+  ++L +E  N   H+G+       ++    
Sbjct: 270 GTKDARQANKAERRSTPSDSNASAPASTSGEEILTTETRNKMKHDGVSDSQTERLADTFS 329

Query: 467 DLGETSFSAA----SMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMA 300
             GE+SFS A    S+ITYSGPIA+SG+LS RSDGSTTS +SFAFPVL +EWNSSPVRMA
Sbjct: 330 --GESSFSMAGPVSSLITYSGPIAYSGNLSLRSDGSTTSTRSFAFPVLHNEWNSSPVRMA 387

Query: 299 KADRRHFRKHKGWRSGLLCCRF 234
           KADRR FR+H+GWR GLLCCRF
Sbjct: 388 KADRRIFRRHRGWRHGLLCCRF 409


>ref|XP_012464097.1| PREDICTED: uncharacterized protein LOC105783281 [Gossypium
           raimondii] gi|823262692|ref|XP_012464099.1| PREDICTED:
           uncharacterized protein LOC105783281 [Gossypium
           raimondii] gi|763813583|gb|KJB80435.1| hypothetical
           protein B456_013G097400 [Gossypium raimondii]
           gi|763813584|gb|KJB80436.1| hypothetical protein
           B456_013G097400 [Gossypium raimondii]
          Length = 505

 Score =  137 bits (345), Expect = 2e-29
 Identities = 95/247 (38%), Positives = 130/247 (52%), Gaps = 32/247 (12%)
 Frame = -2

Query: 878 MTEDVEEQSAD--SEDVHKDV----NAENLPEAMEEKVQ-----CTSDRV------GDTN 750
           +T D+++ + +  S D  K++    +  +LPE    K +     C SDR+        + 
Sbjct: 261 VTRDMKDDAMEMMSNDGSKELFTLGDILSLPELATLKSEAMSPDCKSDRIEQQSFENSSK 320

Query: 749 EQCLINDEGNSDGGSVTSYVQSVRIEDSSCGNDHEGSPEHKQEKSDDVSIDAPVQSPTNE 570
           ++ ++          + S    V   + S     E +P      S   S++A      NE
Sbjct: 321 KEVIVASAVEESNNLILSAPALVSTAEGSDIGKGEATPISPAPAS--ASLEATSSGLVNE 378

Query: 569 SPAPTNDVLASEP------NAPNHEGMDS-----GDVPVVSQLQQDLGETSFSAAS---- 435
           + + T D  +S P      N P   G  S      D P  S LQ   GE+SFSAA     
Sbjct: 379 TGSITFDSRSSAPTSGKGSNKPLEAGRTSKLEETADQPFSSNLQSGNGESSFSAAGPLTG 438

Query: 434 MITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRHFRKHKGWRS 255
           +I+YSGPIA+SG+LS RSD STTS +SFAFP+LQSEWNSSPVRMAKADRR +R+H+GWR 
Sbjct: 439 LISYSGPIAYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRQYRRHRGWRQ 498

Query: 254 GLLCCRF 234
           G LCCRF
Sbjct: 499 GFLCCRF 505


Top