BLASTX nr result

ID: Alisma22_contig00022350 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Alisma22_contig00022350
         (917 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

CBI24867.3 unnamed protein product, partial [Vitis vinifera]          139   2e-33
OMO95816.1 hypothetical protein COLO4_15658 [Corchorus olitorius]     137   2e-32
OAY74806.1 General transcription factor 3C polypeptide 2 [Ananas...   134   2e-31
XP_018438205.1 PREDICTED: general transcription factor 3C polype...   132   8e-31
XP_020086694.1 uncharacterized protein LOC109709049 [Ananas como...   132   8e-31
XP_018438195.1 PREDICTED: uncharacterized protein LOC108810587 i...   132   1e-30
EOX93902.1 DNA binding protein, putative isoform 2 [Theobroma ca...   132   1e-30
XP_017969461.1 PREDICTED: uncharacterized protein LOC18612763 is...   132   1e-30
EOX93901.1 DNA binding protein, putative isoform 1 [Theobroma ca...   132   1e-30
XP_017969459.1 PREDICTED: uncharacterized protein LOC18612763 is...   132   1e-30
XP_017969458.1 PREDICTED: uncharacterized protein LOC18612763 is...   132   1e-30
XP_017969462.1 PREDICTED: uncharacterized protein LOC18612763 is...   132   1e-30
XP_010906991.1 PREDICTED: uncharacterized protein LOC105033760 i...   130   4e-30
XP_007145328.1 hypothetical protein PHAVU_007G229800g [Phaseolus...   129   1e-29
XP_019709366.1 PREDICTED: uncharacterized protein LOC105033760 i...   128   2e-29
XP_019709363.1 PREDICTED: uncharacterized protein LOC105033760 i...   128   2e-29
XP_019709351.1 PREDICTED: uncharacterized protein LOC105033760 i...   128   2e-29
XP_010906976.1 PREDICTED: uncharacterized protein LOC105033760 i...   128   2e-29
XP_006304420.1 hypothetical protein CARUB_v10010997mg [Capsella ...   127   3e-29
GAU42322.1 hypothetical protein TSUD_25470 [Trifolium subterraneum]   124   6e-29

>CBI24867.3 unnamed protein product, partial [Vitis vinifera]
          Length = 834

 Score =  139 bits (351), Expect = 2e-33
 Identities = 86/221 (38%), Positives = 119/221 (53%), Gaps = 20/221 (9%)
 Frame = +3

Query: 9    SDTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID 185
            +D PVTGK FS  + PG    + S F IWSVQ SR  G+ AYC ADG+ ++F+L    ++
Sbjct: 603  NDVPVTGKPFSGTQQPGLICYSCSPFPIWSVQVSRATGLAAYCSADGTVRQFQLTIKAVE 662

Query: 186  SMCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLLPVSQA---------------DMKQPKN 320
                 +   FLCGSL    S L ++TPL      V +A               +  Q K 
Sbjct: 663  KDSRNKAPHFLCGSLTEDNSVLTINTPLSTIPFVVKKALNQWGDTPRSIRGISESNQAKR 722

Query: 321  KDTHKAGPNKGTIAAP-KYKTHSKAEGEVHERTE--IMCSLAE-EELVNLQSETCTPLRT 488
             +  K+      +++  K KT SK+  + + + +   +CS  E E L N +         
Sbjct: 723  VNNQKSNDQPLDLSSKRKQKTKSKSSSKKNPKKDQAALCSYEEAENLENKEDRKEEGGNE 782

Query: 489  IESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEIT 611
            IE FPSK +ALH+VRWN N+GSE WLC+GGAAGI+RCQ+IT
Sbjct: 783  IEVFPSKIVALHRVRWNMNKGSEGWLCYGGAAGIVRCQKIT 823


>OMO95816.1 hypothetical protein COLO4_15658 [Corchorus olitorius]
          Length = 1008

 Score =  137 bits (344), Expect = 2e-32
 Identities = 91/246 (36%), Positives = 127/246 (51%), Gaps = 42/246 (17%)
 Frame = +3

Query: 6    LSDTPVTGKVFSAPRP-GTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNI 182
            +SD PVTGK F+  +  G      SSFAIW++Q SR  G+VAYC ADG+   F+L    +
Sbjct: 755  VSDVPVTGKPFTGTKQQGLHLYNCSSFAIWNIQVSRLTGMVAYCGADGTVSHFQLTSKAV 814

Query: 183  DS-MCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLLPVSQA----------------DMKQ 311
            D      R   F+CGSL+   S + ++TPLP+  L + ++                +  Q
Sbjct: 815  DKDFSRNRAPHFVCGSLIEEESVITINTPLPDIPLTMKKSTSDYGEGPRSMRAFLTETNQ 874

Query: 312  PKNKDTHKA--------------GPNKG-------TIAAPKYKTHSKAEGEVHERTEIMC 428
             KN    KA              G + G       T+AA K K    ++ E +++ +   
Sbjct: 875  AKNAKDKKAKVQTSDKQTLALCYGDDPGVESDSEETLAALKCKKKQNSQSERNKKADNDQ 934

Query: 429  SLA---EEELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRC 599
            +LA   EE   N Q E       IE FP+K +A+H+VRWN N+GSE+WLC+GGAAGI+RC
Sbjct: 935  ALAIRIEEATNNTQKEETG--NEIEVFPAKMVAMHRVRWNMNKGSERWLCYGGAAGIVRC 992

Query: 600  QEITWP 617
            QEI  P
Sbjct: 993  QEIKVP 998


>OAY74806.1 General transcription factor 3C polypeptide 2 [Ananas comosus]
          Length = 1068

 Score =  134 bits (336), Expect = 2e-31
 Identities = 85/221 (38%), Positives = 116/221 (52%), Gaps = 21/221 (9%)
 Frame = +3

Query: 9    SDTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID 185
            +D PVTG  F+  +  G      S+FAIWSVQ S+  GVVAYC ADGS  RF+L  N + 
Sbjct: 838  NDIPVTGSPFAGTKCQGLHGFTCSAFAIWSVQVSQTTGVVAYCSADGSVVRFQLRANYVK 897

Query: 186  SMCLQRVRRFLCGSLMVMGSDLLVHTPLPN---HLLPVSQA---------DMKQPKNKDT 329
                 RV  FLCGSL   G  L +++ LPN     +PVS           D  +PK  ++
Sbjct: 898  DPNRNRVPYFLCGSLTEEGHVLNINSSLPNIPLRNVPVSLKKVPGHGYLFDAGRPKEANS 957

Query: 330  HKAGPNKGTI-------AAPKYKTHSKAEGEVHERTEIMCSLAEEELVNLQ-SETCTPLR 485
               G N+  +       A P  +++SK        +E +    E+E  +    E C   +
Sbjct: 958  TDLGDNRAMVKKFNNSQAPPVKRSNSKLRKSGKNESEFINDAKEQEKTSRNFEEGC--FQ 1015

Query: 486  TIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEI 608
              +  P K +A+ +VRWN N+GSE+WLC GGAAGIIRCQ+I
Sbjct: 1016 EYDVLPPKIVAMQRVRWNMNKGSERWLCFGGAAGIIRCQKI 1056


>XP_018438205.1 PREDICTED: general transcription factor 3C polypeptide 2 isoform X2
            [Raphanus sativus]
          Length = 687

 Score =  132 bits (331), Expect = 8e-31
 Identities = 83/231 (35%), Positives = 115/231 (49%), Gaps = 32/231 (13%)
 Frame = +3

Query: 12   DTPVTGKVF-SAPRPGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNIDS 188
            D P TG+ + +  + G      SS+ IWS+Q SR  G+ AYC ADGS  RF+L    ++ 
Sbjct: 445  DVPATGRPYPNTKQQGLSVYNCSSYPIWSIQVSRLTGMAAYCTADGSVSRFQLTAKAVEK 504

Query: 189  MCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLL----PVSQADMKQPKNKDTHKAGPNKGT 356
                R   FLCG L +  S+ +VH+P+PN  +    PVS+   KQ   K      PN+ T
Sbjct: 505  DSRNRTPHFLCGRLTMNDSNFIVHSPVPNVPINLKKPVSENGEKQRCLKSLLNESPNRHT 564

Query: 357  IAA----------------PKYKTHSKAEG-------EVHERTEIMCSLA----EEELVN 455
              A                 K  + SKA+        E   R  ++C       EEE   
Sbjct: 565  APAFGEDEDQGLESEPEGSNKKSSKSKAKKGKSSTVEEDENRGALVCVKEDGDEEEEGRR 624

Query: 456  LQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEI 608
             +    +    +E  P K +A+H+VRWN N+GSE+WLC+GGAAGI+RCQEI
Sbjct: 625  KEGSNSSSGVKVEQLPPKMVAMHRVRWNVNKGSERWLCYGGAAGIVRCQEI 675


>XP_020086694.1 uncharacterized protein LOC109709049 [Ananas comosus] XP_020086695.1
            uncharacterized protein LOC109709049 [Ananas comosus]
            XP_020086696.1 uncharacterized protein LOC109709049
            [Ananas comosus] XP_020086697.1 uncharacterized protein
            LOC109709049 [Ananas comosus] XP_020086698.1
            uncharacterized protein LOC109709049 [Ananas comosus]
            XP_020086699.1 uncharacterized protein LOC109709049
            [Ananas comosus] XP_020086700.1 uncharacterized protein
            LOC109709049 [Ananas comosus] XP_020086701.1
            uncharacterized protein LOC109709049 [Ananas comosus]
            XP_020086702.1 uncharacterized protein LOC109709049
            [Ananas comosus] XP_020086704.1 uncharacterized protein
            LOC109709049 [Ananas comosus] XP_020086705.1
            uncharacterized protein LOC109709049 [Ananas comosus]
            XP_020086706.1 uncharacterized protein LOC109709049
            [Ananas comosus] XP_020086707.1 uncharacterized protein
            LOC109709049 [Ananas comosus] XP_020086708.1
            uncharacterized protein LOC109709049 [Ananas comosus]
          Length = 1073

 Score =  132 bits (332), Expect = 8e-31
 Identities = 84/221 (38%), Positives = 116/221 (52%), Gaps = 21/221 (9%)
 Frame = +3

Query: 9    SDTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID 185
            +D PVTG  F+  +  G      S+FAIWSVQ S+  GVVAYC ADGS  RF+L  N + 
Sbjct: 843  NDVPVTGSPFAGTKCQGLHGFTCSAFAIWSVQVSQTTGVVAYCSADGSVVRFQLRANYVK 902

Query: 186  SMCLQRVRRFLCGSLMVMGSDLLVHTPLPN---HLLPVSQA---------DMKQPKNKDT 329
                 RV  FLCGSL   G  L +++ LPN     +PVS           D  +PK  ++
Sbjct: 903  DPNRNRVPYFLCGSLTEEGHVLNINSSLPNIPLRNVPVSLKKVPGHGYLFDAGRPKEANS 962

Query: 330  HKAGPNKGTI-------AAPKYKTHSKAEGEVHERTEIMCSLAEEELVNLQ-SETCTPLR 485
               G  +  +       A P  +++SK        +E +    E+E  +    E C+  +
Sbjct: 963  TDLGDTRAMVKKFNNSQAPPVKRSNSKLRKSGKNESEFINDAKEQEKTSRNLEEGCS--Q 1020

Query: 486  TIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEI 608
              +  P K +A+ +VRWN N+GSE+WLC GGAAGIIRCQ+I
Sbjct: 1021 EYDVLPPKIVAMQRVRWNMNKGSERWLCFGGAAGIIRCQKI 1061


>XP_018438195.1 PREDICTED: uncharacterized protein LOC108810587 isoform X1 [Raphanus
            sativus]
          Length = 811

 Score =  132 bits (331), Expect = 1e-30
 Identities = 83/231 (35%), Positives = 115/231 (49%), Gaps = 32/231 (13%)
 Frame = +3

Query: 12   DTPVTGKVF-SAPRPGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNIDS 188
            D P TG+ + +  + G      SS+ IWS+Q SR  G+ AYC ADGS  RF+L    ++ 
Sbjct: 569  DVPATGRPYPNTKQQGLSVYNCSSYPIWSIQVSRLTGMAAYCTADGSVSRFQLTAKAVEK 628

Query: 189  MCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLL----PVSQADMKQPKNKDTHKAGPNKGT 356
                R   FLCG L +  S+ +VH+P+PN  +    PVS+   KQ   K      PN+ T
Sbjct: 629  DSRNRTPHFLCGRLTMNDSNFIVHSPVPNVPINLKKPVSENGEKQRCLKSLLNESPNRHT 688

Query: 357  IAA----------------PKYKTHSKAEG-------EVHERTEIMCSLA----EEELVN 455
              A                 K  + SKA+        E   R  ++C       EEE   
Sbjct: 689  APAFGEDEDQGLESEPEGSNKKSSKSKAKKGKSSTVEEDENRGALVCVKEDGDEEEEGRR 748

Query: 456  LQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEI 608
             +    +    +E  P K +A+H+VRWN N+GSE+WLC+GGAAGI+RCQEI
Sbjct: 749  KEGSNSSSGVKVEQLPPKMVAMHRVRWNVNKGSERWLCYGGAAGIVRCQEI 799


>EOX93902.1 DNA binding protein, putative isoform 2 [Theobroma cacao]
          Length = 846

 Score =  132 bits (331), Expect = 1e-30
 Identities = 90/245 (36%), Positives = 120/245 (48%), Gaps = 43/245 (17%)
 Frame = +3

Query: 12   DTPVTGKVFSAPRP-GTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID- 185
            D PVTGK F+  +  G      SSFAIW+VQ SR  G+VAYC ADG+  RF+L    +D 
Sbjct: 594  DVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDK 653

Query: 186  SMCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLLPVSQ----------------ADMKQPK 317
                 R   F+CGSL    S ++V+TPLP+  L + +                 +  Q K
Sbjct: 654  DFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGEGPRSMRAFLTESNQAK 713

Query: 318  NKDTHKA---GPNKGTIA----------------------APKYKTHSKAEGEVHERTEI 422
            N   +KA    P+K T+A                        K K  SK++       + 
Sbjct: 714  NAKDNKAKVPTPDKQTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQ 773

Query: 423  MCSLAEEELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQ 602
              ++   E  N Q E       IE FP K +A+H+VRWN N+GSE+WLC+GGAAGI+RCQ
Sbjct: 774  ALAVRINEPANTQKEEAG--NEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQ 831

Query: 603  EITWP 617
            EI  P
Sbjct: 832  EIIVP 836


>XP_017969461.1 PREDICTED: uncharacterized protein LOC18612763 isoform X3 [Theobroma
            cacao]
          Length = 865

 Score =  132 bits (331), Expect = 1e-30
 Identities = 90/245 (36%), Positives = 120/245 (48%), Gaps = 43/245 (17%)
 Frame = +3

Query: 12   DTPVTGKVFSAPRP-GTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID- 185
            D PVTGK F+  +  G      SSFAIW+VQ SR  G+VAYC ADG+  RF+L    +D 
Sbjct: 613  DVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDK 672

Query: 186  SMCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLLPVSQ----------------ADMKQPK 317
                 R   F+CGSL    S ++V+TPLP+  L + +                 +  Q K
Sbjct: 673  DFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMRAFLTESNQAK 732

Query: 318  NKDTHKA---GPNKGTIA----------------------APKYKTHSKAEGEVHERTEI 422
            N   +KA    P+K T+A                        K K  SK++       + 
Sbjct: 733  NAKDNKAKVPTPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQ 792

Query: 423  MCSLAEEELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQ 602
              ++   E  N Q E       IE FP K +A+H+VRWN N+GSE+WLC+GGAAGI+RCQ
Sbjct: 793  ALAVRINEPTNTQKEEAG--NEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQ 850

Query: 603  EITWP 617
            EI  P
Sbjct: 851  EIIVP 855


>EOX93901.1 DNA binding protein, putative isoform 1 [Theobroma cacao]
          Length = 868

 Score =  132 bits (331), Expect = 1e-30
 Identities = 90/245 (36%), Positives = 120/245 (48%), Gaps = 43/245 (17%)
 Frame = +3

Query: 12   DTPVTGKVFSAPRP-GTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID- 185
            D PVTGK F+  +  G      SSFAIW+VQ SR  G+VAYC ADG+  RF+L    +D 
Sbjct: 616  DVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDK 675

Query: 186  SMCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLLPVSQ----------------ADMKQPK 317
                 R   F+CGSL    S ++V+TPLP+  L + +                 +  Q K
Sbjct: 676  DFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGEGPRSMRAFLTESNQAK 735

Query: 318  NKDTHKA---GPNKGTIA----------------------APKYKTHSKAEGEVHERTEI 422
            N   +KA    P+K T+A                        K K  SK++       + 
Sbjct: 736  NAKDNKAKVPTPDKQTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQ 795

Query: 423  MCSLAEEELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQ 602
              ++   E  N Q E       IE FP K +A+H+VRWN N+GSE+WLC+GGAAGI+RCQ
Sbjct: 796  ALAVRINEPANTQKEEAG--NEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQ 853

Query: 603  EITWP 617
            EI  P
Sbjct: 854  EIIVP 858


>XP_017969459.1 PREDICTED: uncharacterized protein LOC18612763 isoform X2 [Theobroma
            cacao] XP_017969460.1 PREDICTED: uncharacterized protein
            LOC18612763 isoform X2 [Theobroma cacao]
          Length = 869

 Score =  132 bits (331), Expect = 1e-30
 Identities = 90/245 (36%), Positives = 120/245 (48%), Gaps = 43/245 (17%)
 Frame = +3

Query: 12   DTPVTGKVFSAPRP-GTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID- 185
            D PVTGK F+  +  G      SSFAIW+VQ SR  G+VAYC ADG+  RF+L    +D 
Sbjct: 617  DVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDK 676

Query: 186  SMCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLLPVSQ----------------ADMKQPK 317
                 R   F+CGSL    S ++V+TPLP+  L + +                 +  Q K
Sbjct: 677  DFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMRAFLTESNQAK 736

Query: 318  NKDTHKA---GPNKGTIA----------------------APKYKTHSKAEGEVHERTEI 422
            N   +KA    P+K T+A                        K K  SK++       + 
Sbjct: 737  NAKDNKAKVPTPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQ 796

Query: 423  MCSLAEEELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQ 602
              ++   E  N Q E       IE FP K +A+H+VRWN N+GSE+WLC+GGAAGI+RCQ
Sbjct: 797  ALAVRINEPTNTQKEEAG--NEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQ 854

Query: 603  EITWP 617
            EI  P
Sbjct: 855  EIIVP 859


>XP_017969458.1 PREDICTED: uncharacterized protein LOC18612763 isoform X1 [Theobroma
            cacao]
          Length = 877

 Score =  132 bits (331), Expect = 1e-30
 Identities = 90/245 (36%), Positives = 120/245 (48%), Gaps = 43/245 (17%)
 Frame = +3

Query: 12   DTPVTGKVFSAPRP-GTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID- 185
            D PVTGK F+  +  G      SSFAIW+VQ SR  G+VAYC ADG+  RF+L    +D 
Sbjct: 625  DVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDK 684

Query: 186  SMCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLLPVSQ----------------ADMKQPK 317
                 R   F+CGSL    S ++V+TPLP+  L + +                 +  Q K
Sbjct: 685  DFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMRAFLTESNQAK 744

Query: 318  NKDTHKA---GPNKGTIA----------------------APKYKTHSKAEGEVHERTEI 422
            N   +KA    P+K T+A                        K K  SK++       + 
Sbjct: 745  NAKDNKAKVPTPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQ 804

Query: 423  MCSLAEEELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQ 602
              ++   E  N Q E       IE FP K +A+H+VRWN N+GSE+WLC+GGAAGI+RCQ
Sbjct: 805  ALAVRINEPTNTQKEEAG--NEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQ 862

Query: 603  EITWP 617
            EI  P
Sbjct: 863  EIIVP 867


>XP_017969462.1 PREDICTED: uncharacterized protein LOC18612763 isoform X4 [Theobroma
            cacao]
          Length = 878

 Score =  132 bits (331), Expect = 1e-30
 Identities = 90/245 (36%), Positives = 120/245 (48%), Gaps = 43/245 (17%)
 Frame = +3

Query: 12   DTPVTGKVFSAPRP-GTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID- 185
            D PVTGK F+  +  G      SSFAIW+VQ SR  G+VAYC ADG+  RF+L    +D 
Sbjct: 626  DVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDK 685

Query: 186  SMCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLLPVSQ----------------ADMKQPK 317
                 R   F+CGSL    S ++V+TPLP+  L + +                 +  Q K
Sbjct: 686  DFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMRAFLTESNQAK 745

Query: 318  NKDTHKA---GPNKGTIA----------------------APKYKTHSKAEGEVHERTEI 422
            N   +KA    P+K T+A                        K K  SK++       + 
Sbjct: 746  NAKDNKAKVPTPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQ 805

Query: 423  MCSLAEEELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQ 602
              ++   E  N Q E       IE FP K +A+H+VRWN N+GSE+WLC+GGAAGI+RCQ
Sbjct: 806  ALAVRINEPTNTQKEEAG--NEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQ 863

Query: 603  EITWP 617
            EI  P
Sbjct: 864  EIIVP 868


>XP_010906991.1 PREDICTED: uncharacterized protein LOC105033760 isoform X3 [Elaeis
            guineensis]
          Length = 1346

 Score =  130 bits (327), Expect = 4e-30
 Identities = 81/233 (34%), Positives = 116/233 (49%), Gaps = 30/233 (12%)
 Frame = +3

Query: 12   DTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNIDS 188
            D PVTG+ F+  +  G  S   SSFAIWS Q SR  G+VAYC ADGS  RF+L +     
Sbjct: 1116 DVPVTGRPFAGTKYQGLHSFGCSSFAIWSAQVSRTLGLVAYCSADGSAVRFQLTEAVDKD 1175

Query: 189  MCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLLPVSQADMKQPKNKDT------------- 329
                    FLCGSLM  G  L +++PLP+  +PV      Q K+ D              
Sbjct: 1176 PKRNPKPHFLCGSLMEKGQVLEINSPLPD--VPVPNIPFAQKKSVDDCVDTAPTMQLHGC 1233

Query: 330  -----------HKAGPNKGTIAAPKYKTHSKAEGEVHE-----RTEIMCSLAEEELVNLQ 461
                       H    ++ T+     K+    + + H      +T+    + +  L   +
Sbjct: 1234 FSDVDQAKQTGHAVSGSEETMGNTTSKSRKNEKKKQHASAIAVQTKFHAEIEQGILQRKE 1293

Query: 462  SETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEITWPI 620
            +      +  E+ P K +A+H+VRWN NRGSE+WLC+GGAAGIIRCQ+++ P+
Sbjct: 1294 NRDEGSAQQFEAHPPKVVAMHRVRWNMNRGSERWLCYGGAAGIIRCQQVSLPM 1346


>XP_007145328.1 hypothetical protein PHAVU_007G229800g [Phaseolus vulgaris]
            ESW17322.1 hypothetical protein PHAVU_007G229800g
            [Phaseolus vulgaris]
          Length = 1104

 Score =  129 bits (323), Expect = 1e-29
 Identities = 84/249 (33%), Positives = 132/249 (53%), Gaps = 47/249 (18%)
 Frame = +3

Query: 9    SDTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID 185
            +D PVTG+++S  + PG     ++SFAIWSVQ SR  G++A+C ADG+  RF+L   +++
Sbjct: 845  NDLPVTGEIYSGKKQPGLHGSLYASFAIWSVQVSRITGMLAFCGADGTVFRFQLTTKSVE 904

Query: 186  S-MCLQRVRRFLCGSLMVMGSDLLVHTPLPN-------------------HLL----PVS 293
            +     R RRFLCGS+    S+L+++TP+ N                    LL    P  
Sbjct: 905  TDHARNRARRFLCGSVTEENSNLVINTPVSNAPFLCKKLPVKGRCAESFRDLLSKTNPYK 964

Query: 294  QADMKQPK-----------------NKDTHKAGPNKG--TIAAPKY-KTHSKAEGEVHER 413
             A  K P+                 N D  ++G  +   ++  PK  K ++ ++ +  E 
Sbjct: 965  NALNKVPETSSFDFDSQTLAIGADENVDLLESGSEEALYSMKQPKRTKLNNGSKKKPEEN 1024

Query: 414  TEIMCSLAEEELVNLQSET-CTPLRTI-ESFPSKRIALHKVRWNKNRGSEQWLCHGGAAG 587
             +++C   +  L+  +++   +    I E+FP K  ALHKVRWN N+GSE+WLC GGA G
Sbjct: 1025 LDVVCKDGDVPLITTEADNEKSDFGNIPETFPPKMAALHKVRWNMNKGSEKWLCFGGACG 1084

Query: 588  IIRCQEITW 614
            ++RCQEI +
Sbjct: 1085 LVRCQEIVY 1093


>XP_019709366.1 PREDICTED: uncharacterized protein LOC105033760 isoform X6 [Elaeis
            guineensis]
          Length = 1236

 Score =  128 bits (321), Expect = 2e-29
 Identities = 82/239 (34%), Positives = 120/239 (50%), Gaps = 36/239 (15%)
 Frame = +3

Query: 12   DTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFEL-----LK 173
            D PVTG+ F+  +  G  S   SSFAIWS Q SR  G+VAYC ADGS  RF++     L 
Sbjct: 1000 DVPVTGRPFAGTKYQGLHSFGCSSFAIWSAQVSRTLGLVAYCSADGSAVRFQVTELSDLT 1059

Query: 174  NNIDSMCLQRVR-RFLCGSLMVMGSDLLVHTPLPNHLLPVSQADMKQPKNKDT------- 329
              +D    +  +  FLCGSLM  G  L +++PLP+  +PV      Q K+ D        
Sbjct: 1060 EAVDKDPKRNPKPHFLCGSLMEKGQVLEINSPLPD--VPVPNIPFAQKKSVDDCVDTAPT 1117

Query: 330  -----------------HKAGPNKGTIAAPKYKTHSKAEGEVHE-----RTEIMCSLAEE 443
                             H    ++ T+     K+    + + H      +T+    + + 
Sbjct: 1118 MQLHGCFSDVDQAKQTGHAVSGSEETMGNTTSKSRKNEKKKQHASAIAVQTKFHAEIEQG 1177

Query: 444  ELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEITWPI 620
             L   ++      +  E+ P K +A+H+VRWN NRGSE+WLC+GGAAGIIRCQ+++ P+
Sbjct: 1178 ILQRKENRDEGSAQQFEAHPPKVVAMHRVRWNMNRGSERWLCYGGAAGIIRCQQVSLPM 1236


>XP_019709363.1 PREDICTED: uncharacterized protein LOC105033760 isoform X5 [Elaeis
            guineensis]
          Length = 1237

 Score =  128 bits (321), Expect = 2e-29
 Identities = 82/239 (34%), Positives = 120/239 (50%), Gaps = 36/239 (15%)
 Frame = +3

Query: 12   DTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFEL-----LK 173
            D PVTG+ F+  +  G  S   SSFAIWS Q SR  G+VAYC ADGS  RF++     L 
Sbjct: 1001 DVPVTGRPFAGTKYQGLHSFGCSSFAIWSAQVSRTLGLVAYCSADGSAVRFQVTELSDLT 1060

Query: 174  NNIDSMCLQRVR-RFLCGSLMVMGSDLLVHTPLPNHLLPVSQADMKQPKNKDT------- 329
              +D    +  +  FLCGSLM  G  L +++PLP+  +PV      Q K+ D        
Sbjct: 1061 EAVDKDPKRNPKPHFLCGSLMEKGQVLEINSPLPD--VPVPNIPFAQKKSVDDCVDTAPT 1118

Query: 330  -----------------HKAGPNKGTIAAPKYKTHSKAEGEVHE-----RTEIMCSLAEE 443
                             H    ++ T+     K+    + + H      +T+    + + 
Sbjct: 1119 MQLHGCFSDVDQAKQTGHAVSGSEETMGNTTSKSRKNEKKKQHASAIAVQTKFHAEIEQG 1178

Query: 444  ELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEITWPI 620
             L   ++      +  E+ P K +A+H+VRWN NRGSE+WLC+GGAAGIIRCQ+++ P+
Sbjct: 1179 ILQRKENRDEGSAQQFEAHPPKVVAMHRVRWNMNRGSERWLCYGGAAGIIRCQQVSLPM 1237


>XP_019709351.1 PREDICTED: uncharacterized protein LOC105033760 isoform X2 [Elaeis
            guineensis]
          Length = 1351

 Score =  128 bits (321), Expect = 2e-29
 Identities = 82/239 (34%), Positives = 120/239 (50%), Gaps = 36/239 (15%)
 Frame = +3

Query: 12   DTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFEL-----LK 173
            D PVTG+ F+  +  G  S   SSFAIWS Q SR  G+VAYC ADGS  RF++     L 
Sbjct: 1115 DVPVTGRPFAGTKYQGLHSFGCSSFAIWSAQVSRTLGLVAYCSADGSAVRFQVTELSDLT 1174

Query: 174  NNIDSMCLQRVR-RFLCGSLMVMGSDLLVHTPLPNHLLPVSQADMKQPKNKDT------- 329
              +D    +  +  FLCGSLM  G  L +++PLP+  +PV      Q K+ D        
Sbjct: 1175 EAVDKDPKRNPKPHFLCGSLMEKGQVLEINSPLPD--VPVPNIPFAQKKSVDDCVDTAPT 1232

Query: 330  -----------------HKAGPNKGTIAAPKYKTHSKAEGEVHE-----RTEIMCSLAEE 443
                             H    ++ T+     K+    + + H      +T+    + + 
Sbjct: 1233 MQLHGCFSDVDQAKQTGHAVSGSEETMGNTTSKSRKNEKKKQHASAIAVQTKFHAEIEQG 1292

Query: 444  ELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEITWPI 620
             L   ++      +  E+ P K +A+H+VRWN NRGSE+WLC+GGAAGIIRCQ+++ P+
Sbjct: 1293 ILQRKENRDEGSAQQFEAHPPKVVAMHRVRWNMNRGSERWLCYGGAAGIIRCQQVSLPM 1351


>XP_010906976.1 PREDICTED: uncharacterized protein LOC105033760 isoform X1 [Elaeis
            guineensis] XP_010906985.1 PREDICTED: uncharacterized
            protein LOC105033760 isoform X1 [Elaeis guineensis]
            XP_019709348.1 PREDICTED: uncharacterized protein
            LOC105033760 isoform X1 [Elaeis guineensis]
          Length = 1352

 Score =  128 bits (321), Expect = 2e-29
 Identities = 82/239 (34%), Positives = 120/239 (50%), Gaps = 36/239 (15%)
 Frame = +3

Query: 12   DTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFEL-----LK 173
            D PVTG+ F+  +  G  S   SSFAIWS Q SR  G+VAYC ADGS  RF++     L 
Sbjct: 1116 DVPVTGRPFAGTKYQGLHSFGCSSFAIWSAQVSRTLGLVAYCSADGSAVRFQVTELSDLT 1175

Query: 174  NNIDSMCLQRVR-RFLCGSLMVMGSDLLVHTPLPNHLLPVSQADMKQPKNKDT------- 329
              +D    +  +  FLCGSLM  G  L +++PLP+  +PV      Q K+ D        
Sbjct: 1176 EAVDKDPKRNPKPHFLCGSLMEKGQVLEINSPLPD--VPVPNIPFAQKKSVDDCVDTAPT 1233

Query: 330  -----------------HKAGPNKGTIAAPKYKTHSKAEGEVHE-----RTEIMCSLAEE 443
                             H    ++ T+     K+    + + H      +T+    + + 
Sbjct: 1234 MQLHGCFSDVDQAKQTGHAVSGSEETMGNTTSKSRKNEKKKQHASAIAVQTKFHAEIEQG 1293

Query: 444  ELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEITWPI 620
             L   ++      +  E+ P K +A+H+VRWN NRGSE+WLC+GGAAGIIRCQ+++ P+
Sbjct: 1294 ILQRKENRDEGSAQQFEAHPPKVVAMHRVRWNMNRGSERWLCYGGAAGIIRCQQVSLPM 1352


>XP_006304420.1 hypothetical protein CARUB_v10010997mg [Capsella rubella] EOA37318.1
            hypothetical protein CARUB_v10010997mg [Capsella rubella]
          Length = 822

 Score =  127 bits (320), Expect = 3e-29
 Identities = 83/238 (34%), Positives = 115/238 (48%), Gaps = 39/238 (16%)
 Frame = +3

Query: 12   DTPVTGKVF-SAPRPGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNIDS 188
            D P TG  + +  + G      S+F IWS+Q SR  G+ AYC ADGS   F+L    ++ 
Sbjct: 576  DVPATGNPYPNTKQQGLSVYNLSTFPIWSIQVSRLTGIAAYCTADGSIFHFQLTTKAVEK 635

Query: 189  MCLQRVRRFLCGSLMVMGSDLLVHTPLPNHLL----PVSQADMKQPKNKDTHKAGPNK-- 350
                R   FLCG L +  S  +VH+P+P+  +    PV +   KQ   +      PN+  
Sbjct: 636  DTRNRSPHFLCGKLTMKDSTFIVHSPVPDIPIVLKKPVGETGEKQRCLRSLLNESPNRYA 695

Query: 351  ------------------------GTI-AAPKYKTHSKAE--GEVHERTEIMCSLAE--- 440
                                    GT    PK+K        GEV E +  +  ++E   
Sbjct: 696  SNVSDVRPLAFAHEEDQDLEPEFGGTDNKGPKFKAKKGKNNIGEVDENSRALVCVSEDGD 755

Query: 441  --EELVNLQSETCTPLRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEI 608
              EE  N  S     ++T E FP K +A+H+VRWN N+GSE+WLC+GGAAGI+RCQEI
Sbjct: 756  EGEERRNKASNGSIGMKT-EGFPPKMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEI 812


>GAU42322.1 hypothetical protein TSUD_25470 [Trifolium subterraneum]
          Length = 456

 Score =  124 bits (312), Expect = 6e-29
 Identities = 81/242 (33%), Positives = 118/242 (48%), Gaps = 40/242 (16%)
 Frame = +3

Query: 9   SDTPVTGKVFSAPR-PGTRSLAHSSFAIWSVQASRNNGVVAYCCADGSTKRFELLKNNID 185
           SD PVTG +++  + P      +SS+AIWSVQ SR  G+VAYC ADG+  R++L+   ++
Sbjct: 204 SDLPVTGTIYTGKKQPWLHGSTYSSYAIWSVQVSRITGMVAYCGADGAAIRYQLITKAVE 263

Query: 186 SMCLQRVRRF-LCGSLMVMGSDLLVHTPLPNHLLPVSQAD-------------------- 302
           +        F LCGS+    S ++V+TPL N L P+ +A                     
Sbjct: 264 NEHWHNRLPFALCGSVSEEESTIIVNTPLSNSLFPMKKAQERGRCAESFRDLLAKSRIVP 323

Query: 303 ---MKQPKNKDTHKAGPNKGTIAAPKYKTHSKAEGEVHERTEIMCSLAEEELVNLQSE-- 467
               K P N     A  +  T+        + +  E  +R ++ CS  +++  N      
Sbjct: 324 NQISKTPSNDCQILALNDGDTLGLESISEEALSSQEQPKRPKLSCSRKKKQFDNTVCSDV 383

Query: 468 --TCTP-----------LRTIESFPSKRIALHKVRWNKNRGSEQWLCHGGAAGIIRCQEI 608
             T TP           +   E FP K  ALHKVRWN N+GSE+WLC GGA G++RCQ+I
Sbjct: 384 VSTNTPGVDKEKPDSGSIHEPEVFPPKMAALHKVRWNMNKGSERWLCFGGANGLLRCQKI 443

Query: 609 TW 614
            +
Sbjct: 444 VY 445


Top