BLASTX nr result

ID: Akebia24_contig00003563 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00003563
         (1613 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260...   447   e-123
emb|CBI16185.3| unnamed protein product [Vitis vinifera]              446   e-122
ref|XP_004294602.1| PREDICTED: uncharacterized protein LOC101302...   441   e-121
ref|XP_002299890.2| hypothetical protein POPTR_0001s24280g [Popu...   408   e-111
ref|XP_007016512.1| Uncharacterized protein isoform 1 [Theobroma...   362   2e-97
ref|XP_007016513.1| Uncharacterized protein isoform 2 [Theobroma...   360   8e-97
ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802...   360   1e-96
ref|XP_006592206.1| PREDICTED: uncharacterized protein LOC100819...   355   4e-95
ref|XP_007016516.1| Uncharacterized protein isoform 5 [Theobroma...   346   2e-92
ref|XP_006592207.1| PREDICTED: uncharacterized protein LOC100819...   339   2e-90
ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana] ...   335   3e-89
gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]                       335   3e-89
ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251...   335   3e-89
dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]           334   6e-89
ref|XP_006280518.1| hypothetical protein CARUB_v10026471mg [Caps...   333   1e-88
dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]        322   4e-85
ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arab...   320   2e-84
ref|XP_002534332.1| DNA binding protein, putative [Ricinus commu...   315   3e-83
ref|XP_006480745.1| PREDICTED: uncharacterized protein LOC102621...   313   2e-82
ref|XP_006341068.1| PREDICTED: uncharacterized protein LOC102588...   311   7e-82

>ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera]
          Length = 486

 Score =  447 bits (1149), Expect = e-123
 Identities = 241/450 (53%), Positives = 297/450 (66%), Gaps = 58/450 (12%)
 Frame = +1

Query: 436  SFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNS 615
            SFQ+KGFWM KG +G L+DGD   DN SRIEPKR+HQWF D  EP LFPNKKQAV +++S
Sbjct: 37   SFQNKGFWMPKG-AGHLSDGDTTFDNPSRIEPKRSHQWFADIAEPGLFPNKKQAVHSTSS 95

Query: 616  RQISGVPNVN-LPWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRR 792
            +  SG+ N +  PWEN S+F SV  QF DRLFG E +R ++F  RN   + T       R
Sbjct: 96   KSTSGISNAHGSPWENTSSFHSVPNQFIDRLFGPETARPVNFTERNISPVGTDGSR--SR 153

Query: 793  GIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQV-------------------- 912
             I+EQFGND+S+ LS+S+ +ED  +CL+YGGIRKVKVNQV                    
Sbjct: 154  DIDEQFGNDSSVGLSISNAIEDPETCLSYGGIRKVKVNQVRESDSSENASKGHSYDREIH 213

Query: 913  -------------------------KDSDNAMSM-----------PMSHSFNKRDGTTIS 984
                                     K+ +N   M           PM H +NK D  TIS
Sbjct: 214  SNIPTVQDYDRGSDTSFMSIGAAYYKEDENDKLMGHTYNTGDHDIPMGHPYNKGDANTIS 273

Query: 985  FGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNAN-VLENATPLAIVTN 1161
            FG + +  +  P  R +++Y L   QSS+Q S++  E+EL   NAN  L +A    +   
Sbjct: 274  FGSYHDEPDNIPFARPISSYGLY--QSSVQISDTESERELDASNANGTLSSAQLAKLRPE 331

Query: 1162 KTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCS 1341
               K+K E KMS K  PN+FPSNVR+L+STG+LDGVPVKY+S S EE  G+IKGSGYLC 
Sbjct: 332  SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSREELHGIIKGSGYLCG 391

Query: 1342 CQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTV 1521
            CQSCN++KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP++LLF+AIQTV
Sbjct: 392  CQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTV 451

Query: 1522 TGHPINQKAFRIWKESYQAATRELERIYGK 1611
            TG PINQK+FRIWKES+QAATREL+RIYGK
Sbjct: 452  TGSPINQKSFRIWKESFQAATRELKRIYGK 481


>emb|CBI16185.3| unnamed protein product [Vitis vinifera]
          Length = 416

 Score =  446 bits (1146), Expect = e-122
 Identities = 231/412 (56%), Positives = 290/412 (70%), Gaps = 32/412 (7%)
 Frame = +1

Query: 472  GSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVN-L 648
            G+G L+DGD   DN SRIEPKR+HQWF D  EP LFPNKKQAV +++S+  SG+ N +  
Sbjct: 4    GAGHLSDGDTTFDNPSRIEPKRSHQWFADIAEPGLFPNKKQAVHSTSSKSTSGISNAHGS 63

Query: 649  PWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASI 828
            PWEN S+F SV  QF DRLFG E +R ++F  RN   + T       R I+EQFGND+S+
Sbjct: 64   PWENTSSFHSVPNQFIDRLFGPETARPVNFTERNISPVGTDGSR--SRDIDEQFGNDSSV 121

Query: 829  ALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMS------------------------ 936
             LS+S+ +ED  +CL+YGGIRKVKVNQV++SD++ +                        
Sbjct: 122  GLSISNAIEDPETCLSYGGIRKVKVNQVRESDSSENASKGHSYDREIHSNIPTVQDYDRG 181

Query: 937  ------MPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEK 1098
                  +PM H +NK D  TISFG + +  +  P  R +++Y L   QSS+Q S++  E+
Sbjct: 182  SDTNHDIPMGHPYNKGDANTISFGSYHDEPDNIPFARPISSYGLY--QSSVQISDTESER 239

Query: 1099 ELVDPNAN-VLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPV 1275
            EL   NAN  L +A    +      K+K E KMS K  PN+FPSNVR+L+STG+LDGVPV
Sbjct: 240  ELDASNANGTLSSAQLAKLRPESASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPV 299

Query: 1276 KYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTI 1455
            KY+S S EE  G+IKGSGYLC CQSCN++KVLNAYEFERHAGCKTKHPNNHI+F+NGKTI
Sbjct: 300  KYVSLSREELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTI 359

Query: 1456 YGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            Y IVQEL+STP++LLF+AIQTVTG PINQK+FRIWKES+QAATREL+RIYGK
Sbjct: 360  YQIVQELRSTPESLLFDAIQTVTGSPINQKSFRIWKESFQAATRELKRIYGK 411


>ref|XP_004294602.1| PREDICTED: uncharacterized protein LOC101302631 [Fragaria vesca
            subsp. vesca]
          Length = 469

 Score =  441 bits (1135), Expect = e-121
 Identities = 238/465 (51%), Positives = 296/465 (63%), Gaps = 72/465 (15%)
 Frame = +1

Query: 433  MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 612
            MSFQ+KGFWMAKG +G   DGD    N SRIEPKR+HQWF+D+ EP+LFPNKKQAV   N
Sbjct: 1    MSFQNKGFWMAKG-AGHDNDGDATFGNPSRIEPKRSHQWFVDSAEPQLFPNKKQAVHIPN 59

Query: 613  SRQISGVPNVNLPWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRR 792
            S+    +PN N+ WENPS+FQSV  QF DRLFGS+ + + +F  RN   + + +  I  +
Sbjct: 60   SKLSVEMPNENVSWENPSSFQSVPHQFIDRLFGSDTASSTNFSDRNVSPVGSDDWSIRTK 119

Query: 793  GIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAM------------- 933
            GI++QFG+DA + LS+SH +E+   CL Y GIRK+KVNQVKDSD  M             
Sbjct: 120  GIDDQFGSDAPVNLSISHAIENPEVCLGYAGIRKIKVNQVKDSDIDMHASREHGSSREYN 179

Query: 934  -SMP-----------------------------MSHSFN--------------KRDGTTI 981
             ++P                             M H++N              KR+   I
Sbjct: 180  INLPTSQAFDRTHETGFISAGQAYDKEHDNVTLMGHAYNKGAAHVRPLGASYGKREENVI 239

Query: 982  SFGD--------------FQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNA 1119
            S  D              F +  +MN  GR + NY+ L  QSS+Q SE+  EKEL   NA
Sbjct: 240  SMSDGYSKGNANMISFGGFPDEQDMNTMGRAVANYDQLYHQSSVQTSETAHEKELDTTNA 299

Query: 1120 NVLENATPLA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSH 1296
            N ++N   +A        KSK E K + K  PN+FPSNVRSL+STGILDGVPVKY+S + 
Sbjct: 300  NAVDNTASVAKSKPESASKSKPESKPTKKQAPNSFPSNVRSLISTGILDGVPVKYVSMAR 359

Query: 1297 EEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQEL 1476
            EE RG+IKG+ YLC CQSCN++K LNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL
Sbjct: 360  EELRGIIKGASYLCGCQSCNFTKGLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQEL 419

Query: 1477 KSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            +STP++LLF+ +QTV G PINQKAF  WKES+QAATREL+RIYGK
Sbjct: 420  RSTPESLLFDTMQTVFGAPINQKAFLSWKESFQAATRELQRIYGK 464


>ref|XP_002299890.2| hypothetical protein POPTR_0001s24280g [Populus trichocarpa]
            gi|550348073|gb|EEE84695.2| hypothetical protein
            POPTR_0001s24280g [Populus trichocarpa]
          Length = 400

 Score =  408 bits (1049), Expect = e-111
 Identities = 219/424 (51%), Positives = 267/424 (62%), Gaps = 35/424 (8%)
 Frame = +1

Query: 445  SKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQI 624
            +KGFWM+KG      DGD   +N  R+E KR+HQWF+D TEPELFPNKKQAV+  NS   
Sbjct: 2    NKGFWMSKG-----TDGDPAFENPPRLESKRSHQWFIDDTEPELFPNKKQAVQTPNSTTT 56

Query: 625  SGVPNVNLP-WENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRRGIE 801
            SG+P+ N P W N S FQSV  QF  RLFG+E +R+++F  RN     T           
Sbjct: 57   SGIPSANSPSWHNTSGFQSVPNQFIHRLFGAETARSVNFAERNLYPAGTVE--------- 107

Query: 802  EQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSF-------- 957
                ++AS A            CLNYGGIRKVK+NQVKD D+ +  P  H F        
Sbjct: 108  ----SNASEA------------CLNYGGIRKVKINQVKDFDSGVHAPKGHGFTIESDSNN 151

Query: 958  -------------------------NKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQ 1062
                                     N  D   +SFG F +  ++ P  R L++Y+    Q
Sbjct: 152  STGQAFQRESQSSFISTGHAFDKEDNSEDTNLLSFGGFDDAHDIIPVDRPLSSYDHSYDQ 211

Query: 1063 SSIQPSESLKEKELVDPNAN-VLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRS 1239
            SS++  E++ EKEL    A  V  N       T    K++ E K + K  PN+FPSNVRS
Sbjct: 212  SSVRTREAVDEKELRTTTAKAVASNTQATKSRTEPVSKNRPELKTTRKEAPNSFPSNVRS 271

Query: 1240 LLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHP 1419
            L+STG+LDGVPVKY+S S EE RG+IKGSGYLC CQSCNYSKVLNAYEFERHAGCKTKHP
Sbjct: 272  LISTGMLDGVPVKYVSLSREELRGIIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTKHP 331

Query: 1420 NNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELER 1599
            NNHI+F+NGKTIY IVQEL+STP+++LF+ IQTV G PINQK+FRIWKES+QAATREL+R
Sbjct: 332  NNHIYFENGKTIYQIVQELRSTPESMLFDVIQTVFGAPINQKSFRIWKESFQAATRELQR 391

Query: 1600 IYGK 1611
            IYGK
Sbjct: 392  IYGK 395


>ref|XP_007016512.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508786875|gb|EOY34131.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 467

 Score =  362 bits (930), Expect = 2e-97
 Identities = 215/466 (46%), Positives = 271/466 (58%), Gaps = 73/466 (15%)
 Frame = +1

Query: 433  MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 612
            MSFQ+K FWMAKG +  ++DGD   DN SRIEPKR+H WF+DA EP+LFP+KKQA++A N
Sbjct: 1    MSFQNKSFWMAKGPAH-ISDGDAAFDNPSRIEPKRSHNWFVDA-EPQLFPSKKQAIQAPN 58

Query: 613  SRQISGVPNVNL-PWENPSNFQSVTGQ---------------FTDRL------------- 705
            ++  SG+ N+N+ PWEN S+FQSV  Q               FT+R              
Sbjct: 59   NKSSSGISNLNVSPWENVSSFQSVPSQFIDRLFGSDSERPENFTERNISPVEVDNIRRKA 118

Query: 706  ----FGSE-------------PSRTIDFGGRNFQSINT-------------------GNL 777
                FG +             P    ++GG     +N                     N 
Sbjct: 119  IEDHFGEDASVGSSISHTMEDPETCFNYGGIRKVKVNQVKDSANSMHAPKEHSFSRENNS 178

Query: 778  DIGRRGIEEQFGNDASIALSMSHTMED-------LGSCLNYGGIRKVKVNQVKDSDNAMS 936
            D+    IE     + S  +SM H+ +        +G   N G              + + 
Sbjct: 179  DMTT--IEAYDRENESSFISMGHSYDKEYDNVALMGHTYNRGDTHIRTATPAYGKGDEIP 236

Query: 937  MPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPN 1116
            + M  ++ K D   +SFG F E  E+ P GR L+++E     SS   SE   EK+L    
Sbjct: 237  ISMGDTYGKEDANILSFGGFHEEHEIIPVGRPLSSFEPSYTPSSNPSSEGASEKQLDAST 296

Query: 1117 ANVLENATPLA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWS 1293
            A V+ + T    +      ++K E K S K  PN+FPSNVRSL+STG+LDGVPVKYIS S
Sbjct: 297  AVVVASTTRTPKLRPESASRTKPELKSSKKEAPNSFPSNVRSLISTGMLDGVPVKYISLS 356

Query: 1294 HEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQE 1473
             EE RGVIKGSGYLC CQSCN+SKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQE
Sbjct: 357  REELRGVIKGSGYLCGCQSCNFSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQE 416

Query: 1474 LKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            L+STP++LLF+ IQTV G PINQK+FRIWKES+QAATREL+RIYGK
Sbjct: 417  LRSTPESLLFDTIQTVFGAPINQKSFRIWKESFQAATRELQRIYGK 462


>ref|XP_007016513.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590589665|ref|XP_007016515.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508786876|gb|EOY34132.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508786878|gb|EOY34134.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 489

 Score =  360 bits (925), Expect = 8e-97
 Identities = 214/465 (46%), Positives = 270/465 (58%), Gaps = 73/465 (15%)
 Frame = +1

Query: 436  SFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNS 615
            SFQ+K FWMAKG +  ++DGD   DN SRIEPKR+H WF+DA EP+LFP+KKQA++A N+
Sbjct: 24   SFQNKSFWMAKGPAH-ISDGDAAFDNPSRIEPKRSHNWFVDA-EPQLFPSKKQAIQAPNN 81

Query: 616  RQISGVPNVNL-PWENPSNFQSVTGQ---------------FTDRL-------------- 705
            +  SG+ N+N+ PWEN S+FQSV  Q               FT+R               
Sbjct: 82   KSSSGISNLNVSPWENVSSFQSVPSQFIDRLFGSDSERPENFTERNISPVEVDNIRRKAI 141

Query: 706  ---FGSE-------------PSRTIDFGGRNFQSINT-------------------GNLD 780
               FG +             P    ++GG     +N                     N D
Sbjct: 142  EDHFGEDASVGSSISHTMEDPETCFNYGGIRKVKVNQVKDSANSMHAPKEHSFSRENNSD 201

Query: 781  IGRRGIEEQFGNDASIALSMSHTMED-------LGSCLNYGGIRKVKVNQVKDSDNAMSM 939
            +    IE     + S  +SM H+ +        +G   N G              + + +
Sbjct: 202  MTT--IEAYDRENESSFISMGHSYDKEYDNVALMGHTYNRGDTHIRTATPAYGKGDEIPI 259

Query: 940  PMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNA 1119
             M  ++ K D   +SFG F E  E+ P GR L+++E     SS   SE   EK+L    A
Sbjct: 260  SMGDTYGKEDANILSFGGFHEEHEIIPVGRPLSSFEPSYTPSSNPSSEGASEKQLDASTA 319

Query: 1120 NVLENATPLA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSH 1296
             V+ + T    +      ++K E K S K  PN+FPSNVRSL+STG+LDGVPVKYIS S 
Sbjct: 320  VVVASTTRTPKLRPESASRTKPELKSSKKEAPNSFPSNVRSLISTGMLDGVPVKYISLSR 379

Query: 1297 EEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQEL 1476
            EE RGVIKGSGYLC CQSCN+SKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL
Sbjct: 380  EELRGVIKGSGYLCGCQSCNFSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQEL 439

Query: 1477 KSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            +STP++LLF+ IQTV G PINQK+FRIWKES+QAATREL+RIYGK
Sbjct: 440  RSTPESLLFDTIQTVFGAPINQKSFRIWKESFQAATRELQRIYGK 484


>ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max]
          Length = 463

 Score =  360 bits (923), Expect = 1e-96
 Identities = 214/461 (46%), Positives = 269/461 (58%), Gaps = 68/461 (14%)
 Frame = +1

Query: 433  MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 612
            MS Q+KGFWM KG SG + D D   DN ++IEPKR HQWF+DA E + FPNKKQAVE ++
Sbjct: 1    MSLQNKGFWMVKG-SGHINDRDTVFDNPTKIEPKRPHQWFVDAAEVDFFPNKKQAVEDAD 59

Query: 613  SRQISGVPNVNLP-WENPSNFQSVTGQ--------------FTDR--------------- 702
             +   G  NVN+P WEN  NF SV  Q              FT++               
Sbjct: 60   EKSSPGFSNVNIPPWENNPNFHSVPNQFIGRLFGSETRPVNFTEKNTYVLADDSNVRSKM 119

Query: 703  ---LFGSEPS-------------RTIDFGGRNFQSINT-----------------GNLDI 783
                +G E S               ++FGG     +N                   N D+
Sbjct: 120  VTNQYGDEASFGLSISHSIEDSEACVNFGGIKKVKVNQVKEVDVQALEGHNFGRQSNGDL 179

Query: 784  GRRGIEEQFGNDASIALSMSHTMEDLGSCLNYGG----IRKVKVNQVKDSDNAMSMPMSH 951
             +    E     ASI  +     +     L Y      +R    + VK  D+ +S+  S 
Sbjct: 180  HQAYNREVETRSASIGQAFDKDRDATLMGLTYSRGDAHVRSFGASFVKGDDSIVSI--SE 237

Query: 952  SFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLE 1131
            S+NK D   ISFG F +  ++   GR    Y+ L  QSS+  S +  EKEL   +++ + 
Sbjct: 238  SYNKEDTNIISFGGFPDERDIISVGRPAAEYDQLYNQSSVHVSTTAHEKELDVSSSDAVA 297

Query: 1132 NATPLAIVTNKT-PKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFR 1308
            +   +A V ++T  K+K E K + K  PN+FPSNVRSL+STGILDGVPVKY+S S EE R
Sbjct: 298  STLQVAKVKSETVSKNKQELKTAKKEAPNSFPSNVRSLISTGILDGVPVKYVSVSREELR 357

Query: 1309 GVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTP 1488
            G+IKGSGYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP
Sbjct: 358  GIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTP 417

Query: 1489 QNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            ++LLF+ IQTV G PINQKAFR WKES+QAATREL+RIYGK
Sbjct: 418  ESLLFDTIQTVFGAPINQKAFRNWKESFQAATRELQRIYGK 458


>ref|XP_006592206.1| PREDICTED: uncharacterized protein LOC100819317 isoform X1 [Glycine
            max]
          Length = 464

 Score =  355 bits (910), Expect = 4e-95
 Identities = 215/469 (45%), Positives = 270/469 (57%), Gaps = 76/469 (16%)
 Frame = +1

Query: 433  MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 612
            MS Q+KGFWM KG SG + D +   DN ++IEPKR HQWF+DA E + FPNKKQAVE ++
Sbjct: 1    MSLQNKGFWMVKG-SGQINDRETIFDNPTKIEPKRPHQWFVDAAEVDFFPNKKQAVEDAD 59

Query: 613  SRQISGVPNVNLP-WENPSNFQSV------------------------------------ 681
             +   G  NVN+P WEN  NF SV                                    
Sbjct: 60   EKSSPGFSNVNIPPWENNPNFHSVPNQFIGRLFGSETRPVNFTEKNTSYVLADDSNVRSK 119

Query: 682  --TGQF-TDRLFGSEPSRTID-------FGG-------------------RNFQSINTGN 774
              T Q+  D  FG   S +I+       FGG                    NF   N GN
Sbjct: 120  MITNQYGDDASFGLSISHSIEDSEACVNFGGIKKVKVNQVKEDDIQALEGHNFGRPNNGN 179

Query: 775  L--------DIGRRGIEEQFGNDASIAL-SMSHTMEDLGSCLNYGGIRKVKVNQVKDSDN 927
            L        +     I + F  D   +L  ++++  D         +R      VK  D+
Sbjct: 180  LHQAYNREVETRSASIGQAFDRDGDASLMGLTYSKGD-------AHVRSFSAPFVKGDDS 232

Query: 928  AMSMPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELV 1107
             +S+  S S+NK D   ISFG F +  ++   GR    Y+ L  QSS+  S +  EKEL 
Sbjct: 233  IVSI--SESYNKEDTNIISFGGFPDERDIISVGRPAAEYDQLYNQSSVHGSTTAHEKELD 290

Query: 1108 DPNANVLENATPLAIVTNKT-PKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYI 1284
              +++ + +   +A V ++T  K+K E K +    PN+FPSNVRSL+STGILDGVPVKYI
Sbjct: 291  VSSSDAVASTLQVAKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYI 350

Query: 1285 SWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGI 1464
            S S EE RG+IKGSGYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY I
Sbjct: 351  SVSREELRGIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQI 410

Query: 1465 VQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            VQEL+STP++LLF+ IQTV G PI+QKAFR WKES+QAATREL+RIYGK
Sbjct: 411  VQELRSTPESLLFDTIQTVFGAPIHQKAFRNWKESFQAATRELQRIYGK 459


>ref|XP_007016516.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508786879|gb|EOY34135.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 458

 Score =  346 bits (887), Expect = 2e-92
 Identities = 208/457 (45%), Positives = 263/457 (57%), Gaps = 73/457 (15%)
 Frame = +1

Query: 460  MAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPN 639
            MAKG +  ++DGD   DN SRIEPKR+H WF+DA EP+LFP+KKQA++A N++  SG+ N
Sbjct: 1    MAKGPAH-ISDGDAAFDNPSRIEPKRSHNWFVDA-EPQLFPSKKQAIQAPNNKSSSGISN 58

Query: 640  VNL-PWENPSNFQSVTGQ---------------FTDRL-----------------FGSE- 717
            +N+ PWEN S+FQSV  Q               FT+R                  FG + 
Sbjct: 59   LNVSPWENVSSFQSVPSQFIDRLFGSDSERPENFTERNISPVEVDNIRRKAIEDHFGEDA 118

Query: 718  ------------PSRTIDFGGRNFQSINT-------------------GNLDIGRRGIEE 804
                        P    ++GG     +N                     N D+    IE 
Sbjct: 119  SVGSSISHTMEDPETCFNYGGIRKVKVNQVKDSANSMHAPKEHSFSRENNSDMTT--IEA 176

Query: 805  QFGNDASIALSMSHTMED-------LGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNK 963
                + S  +SM H+ +        +G   N G              + + + M  ++ K
Sbjct: 177  YDRENESSFISMGHSYDKEYDNVALMGHTYNRGDTHIRTATPAYGKGDEIPISMGDTYGK 236

Query: 964  RDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATP 1143
             D   +SFG F E  E+ P GR L+++E     SS   SE   EK+L    A V+ + T 
Sbjct: 237  EDANILSFGGFHEEHEIIPVGRPLSSFEPSYTPSSNPSSEGASEKQLDASTAVVVASTTR 296

Query: 1144 LA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIK 1320
               +      ++K E K S K  PN+FPSNVRSL+STG+LDGVPVKYIS S EE RGVIK
Sbjct: 297  TPKLRPESASRTKPELKSSKKEAPNSFPSNVRSLISTGMLDGVPVKYISLSREELRGVIK 356

Query: 1321 GSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLL 1500
            GSGYLC CQSCN+SKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP++LL
Sbjct: 357  GSGYLCGCQSCNFSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLL 416

Query: 1501 FEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            F+ IQTV G PINQK+FRIWKES+QAATREL+RIYGK
Sbjct: 417  FDTIQTVFGAPINQKSFRIWKESFQAATRELQRIYGK 453


>ref|XP_006592207.1| PREDICTED: uncharacterized protein LOC100819317 isoform X2 [Glycine
            max]
          Length = 455

 Score =  339 bits (869), Expect = 2e-90
 Identities = 206/456 (45%), Positives = 260/456 (57%), Gaps = 76/456 (16%)
 Frame = +1

Query: 472  GSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNLP 651
            GSG + D +   DN ++IEPKR HQWF+DA E + FPNKKQAVE ++ +   G  NVN+P
Sbjct: 4    GSGQINDRETIFDNPTKIEPKRPHQWFVDAAEVDFFPNKKQAVEDADEKSSPGFSNVNIP 63

Query: 652  -WENPSNFQSV--------------------------------------TGQF-TDRLFG 711
             WEN  NF SV                                      T Q+  D  FG
Sbjct: 64   PWENNPNFHSVPNQFIGRLFGSETRPVNFTEKNTSYVLADDSNVRSKMITNQYGDDASFG 123

Query: 712  SEPSRTID-------FGG-------------------RNFQSINTGNL--------DIGR 789
               S +I+       FGG                    NF   N GNL        +   
Sbjct: 124  LSISHSIEDSEACVNFGGIKKVKVNQVKEDDIQALEGHNFGRPNNGNLHQAYNREVETRS 183

Query: 790  RGIEEQFGNDASIAL-SMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNKR 966
              I + F  D   +L  ++++  D         +R      VK  D+ +S+  S S+NK 
Sbjct: 184  ASIGQAFDRDGDASLMGLTYSKGD-------AHVRSFSAPFVKGDDSIVSI--SESYNKE 234

Query: 967  DGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPL 1146
            D   ISFG F +  ++   GR    Y+ L  QSS+  S +  EKEL   +++ + +   +
Sbjct: 235  DTNIISFGGFPDERDIISVGRPAAEYDQLYNQSSVHGSTTAHEKELDVSSSDAVASTLQV 294

Query: 1147 AIVTNKT-PKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKG 1323
            A V ++T  K+K E K +    PN+FPSNVRSL+STGILDGVPVKYIS S EE RG+IKG
Sbjct: 295  AKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYISVSREELRGIIKG 354

Query: 1324 SGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLF 1503
            SGYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP++LLF
Sbjct: 355  SGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLF 414

Query: 1504 EAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            + IQTV G PI+QKAFR WKES+QAATREL+RIYGK
Sbjct: 415  DTIQTVFGAPIHQKAFRNWKESFQAATRELQRIYGK 450


>ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana]
            gi|42573736|ref|NP_974964.1| uncharacterized protein
            [Arabidopsis thaliana] gi|332009855|gb|AED97238.1|
            uncharacterized protein AT5G59830 [Arabidopsis thaliana]
            gi|332009856|gb|AED97239.1| uncharacterized protein
            AT5G59830 [Arabidopsis thaliana]
          Length = 425

 Score =  335 bits (860), Expect = 3e-89
 Identities = 192/426 (45%), Positives = 261/426 (61%), Gaps = 33/426 (7%)
 Frame = +1

Query: 433  MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 612
            MS++SKGFW+ K      ++ D   D+S+R + KR H WF+D++  E+FPNKKQAV+  +
Sbjct: 1    MSYESKGFWVMKNNEHT-SEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--D 57

Query: 613  SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 786
                 G  NV LP WE+ S FQSV+ QF DRL G+E P R + FG R+     + +    
Sbjct: 58   PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ-- 115

Query: 787  RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDS--------------- 921
             + I E +  D S+ LS+S+ +E  G C    G RK+ V++VK++               
Sbjct: 116  NKSIAESYMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKI 175

Query: 922  ----------DNAMSMP----MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 1059
                      +N  S        H +   D   I+FG+  +   +  +  ++ NY+  + 
Sbjct: 176  ESSSIQACSRENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGSTSNVVGNYQSYV- 234

Query: 1060 QSSIQPSESLKEKEL--VDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 1233
            Q  I   + + ++E      ++ V+             PK+K E K S K    +FPSNV
Sbjct: 235  QDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNV 294

Query: 1234 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 1413
            RSL+STG+LDGVPVKY+S S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK
Sbjct: 295  RSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 354

Query: 1414 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 1593
            HPNNHI+F+NGKTIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL
Sbjct: 355  HPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATREL 414

Query: 1594 ERIYGK 1611
            +RIYGK
Sbjct: 415  QRIYGK 420


>gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]
          Length = 425

 Score =  335 bits (860), Expect = 3e-89
 Identities = 192/426 (45%), Positives = 261/426 (61%), Gaps = 33/426 (7%)
 Frame = +1

Query: 433  MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 612
            MS++SKGFW+ K      ++ D   D+S+R + KR H WF+D++  E+FPNKKQAV+  +
Sbjct: 1    MSYESKGFWVMKNNEHT-SEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--D 57

Query: 613  SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 786
                 G  NV LP WE+ S FQSV+ QF DRL G+E P R + FG R+     + +    
Sbjct: 58   PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ-- 115

Query: 787  RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDS--------------- 921
             + I E +  D S+ LS+S+ +E  G C    G RK+ V++VK++               
Sbjct: 116  NKSIAESYMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKI 175

Query: 922  ----------DNAMSMP----MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 1059
                      +N  S        H +   D   I+FG+  +   +  +  ++ NY+  + 
Sbjct: 176  ESSSIQACSWENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGSASNVVGNYQSYV- 234

Query: 1060 QSSIQPSESLKEKEL--VDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 1233
            Q  I   + + ++E      ++ V+             PK+K E K S K    +FPSNV
Sbjct: 235  QDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNV 294

Query: 1234 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 1413
            RSL+STG+LDGVPVKY+S S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK
Sbjct: 295  RSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 354

Query: 1414 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 1593
            HPNNHI+F+NGKTIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL
Sbjct: 355  HPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATREL 414

Query: 1594 ERIYGK 1611
            +RIYGK
Sbjct: 415  QRIYGK 420


>ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
          Length = 599

 Score =  335 bits (859), Expect = 3e-89
 Identities = 179/324 (55%), Positives = 223/324 (68%), Gaps = 22/324 (6%)
 Frame = +1

Query: 706  FGSEPSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSMSHTME-------DLG 864
            +G E +  I  G    Q+ N G+ +I      +  G D +I  SM HT          +G
Sbjct: 277  YGREDNNFISMG----QAYNKGDENIAMSHTYK--GGDNTI--SMGHTFSKGDNNIISMG 328

Query: 865  SCLNYGGIRKVKVNQV--KDSDNAMSM-----------PMSHSFNKRDGTTISFGDFQEG 1005
               N G    + +  +  K  +N +SM            + HS+NK +   ISFG F + 
Sbjct: 329  QTYNKGDDNTISMGHIYNKGDENTISMGHTYKGDNSNLSIGHSYNKGESNIISFGGFHDD 388

Query: 1006 SE-MNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPLAIVTNKT-PKSK 1179
             +  NPSGRL+ +Y+LLMGQ S+Q SE+L EK+LV+ NA+ L +   +    ++T  K K
Sbjct: 389  DDDTNPSGRLVCSYDLLMGQPSVQRSEALNEKKLVESNADALISTAQITASGSETVSKKK 448

Query: 1180 VEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNY 1359
             EQK+S KVPPNNFPSNVRSLLSTG+LDGVPVKYI+WS EE RG+IKGSGYLC CQSCN+
Sbjct: 449  EEQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKYIAWSREELRGIIKGSGYLCGCQSCNF 508

Query: 1360 SKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPIN 1539
            SKV+NAYEFERHAGCKTKHPNNHI+F+NGKTIYGIVQELKSTPQN LF+ IQT+TG PIN
Sbjct: 509  SKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPIN 568

Query: 1540 QKAFRIWKESYQAATRELERIYGK 1611
            QK+FR+WKES+ AATREL+RIYGK
Sbjct: 569  QKSFRLWKESFLAATRELQRIYGK 592



 Score =  223 bits (568), Expect = 2e-55
 Identities = 109/186 (58%), Positives = 139/186 (74%), Gaps = 1/186 (0%)
 Frame = +1

Query: 430 KMSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEAS 609
           +MSFQ+KGFWMAKG  GC+ DG+M  DN SRIEPKR+HQWF+D TE ELFPNKKQAVE  
Sbjct: 61  RMSFQNKGFWMAKG-VGCVTDGEMAYDNPSRIEPKRSHQWFMDGTE-ELFPNKKQAVEVP 118

Query: 610 NSRQISGVPNVNL-PWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIG 786
           NS    G+ N N+ PW N S F SV+G FT+RLF  E +RT++F  RN  S+  GN+++ 
Sbjct: 119 NSNLFPGLSNPNVSPWANASGFHSVSGHFTERLFDPEAARTVNFDDRNIPSVGAGNMNMA 178

Query: 787 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNKR 966
           R+ IE+ FGN++   LSMSH++ED  S LNYGGIRKVKV+QVKDS+N MS+ M H++ + 
Sbjct: 179 RKVIEDPFGNESLFGLSMSHSLEDPRSGLNYGGIRKVKVSQVKDSENIMSVSMGHTYTRA 238

Query: 967 DGTTIS 984
           D  T+S
Sbjct: 239 DNNTMS 244


>dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]
          Length = 425

 Score =  334 bits (857), Expect = 6e-89
 Identities = 191/426 (44%), Positives = 261/426 (61%), Gaps = 33/426 (7%)
 Frame = +1

Query: 433  MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 612
            MS++SKGFW+ K      ++ D   D+S+R + KR H WF+D++  E+FPNKKQAV+  +
Sbjct: 1    MSYESKGFWVMKNNEHT-SEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--D 57

Query: 613  SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 786
                 G  NV LP WE+ S FQSV+ QF DRL G+E P R + FG R+     + +    
Sbjct: 58   PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ-- 115

Query: 787  RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDS--------------- 921
             + I E +  D S+ LS+S+ +E  G C    G RK+ V++VK++               
Sbjct: 116  NKSIAESYMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKI 175

Query: 922  ----------DNAMSMP----MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 1059
                      +N  S        H +   D   I+FG+  +   +  +  ++ NY+  + 
Sbjct: 176  ESSSIQACSRENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGSTSNVVGNYQSYV- 234

Query: 1060 QSSIQPSESLKEKEL--VDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 1233
            Q  I   + + ++E      ++ V+             PK+K E K S K    +FPSNV
Sbjct: 235  QDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNV 294

Query: 1234 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 1413
            RSL+STG+LDGVPVKY+S S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK
Sbjct: 295  RSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 354

Query: 1414 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 1593
            HPNNHI+F+NG+TIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL
Sbjct: 355  HPNNHIYFENGRTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATREL 414

Query: 1594 ERIYGK 1611
            +RIYGK
Sbjct: 415  QRIYGK 420


>ref|XP_006280518.1| hypothetical protein CARUB_v10026471mg [Capsella rubella]
            gi|482549222|gb|EOA13416.1| hypothetical protein
            CARUB_v10026471mg [Capsella rubella]
          Length = 422

 Score =  333 bits (855), Expect = 1e-88
 Identities = 194/426 (45%), Positives = 259/426 (60%), Gaps = 33/426 (7%)
 Frame = +1

Query: 433  MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 612
            MS++SKGFW+ K      ++ D   D+S+R + KR H WF D++  ++FPNKKQAV+   
Sbjct: 1    MSYESKGFWVLKNNEHT-SEEDSVYDHSTRDDSKRPHPWFADSSRSDMFPNKKQAVQDPV 59

Query: 613  SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 786
                 G  ++ LP WE+ S FQSV+ QF DRL G+E PSR + FG R+      G     
Sbjct: 60   GGL--GKSSLGLPLWESSSVFQSVSNQFMDRLLGAEMPSRPLLFGDRDRTE---GCSHHQ 114

Query: 787  RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHS---- 954
             + I E F  + S+ LS+S+ +E  GSC    GIRK+ V++VK++ +  +    HS    
Sbjct: 115  NKSIAESFMENTSVELSISNGVEVAGSCFGGDGIRKLPVSRVKETMSTHAALDGHSQRKI 174

Query: 955  -------------------------FNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 1059
                                     +   D   I+FG+  +   +  S     NY+  + 
Sbjct: 175  ESSSIQACSRENESSFINFALAGHPYGNEDSHGITFGEINDEHGVGSSSN--GNYQSYV- 231

Query: 1060 QSSIQPSESL--KEKELVDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 1233
            Q  I+ S+ +  +E      ++ V+             PK+K E K S K    +FPSNV
Sbjct: 232  QDPIETSDMVYGQETGCSQTSSRVVSEQQMAKPSLETPPKNKAEAKTSKKEASTSFPSNV 291

Query: 1234 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 1413
            RSL+STG+LDGVPVKYIS S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK
Sbjct: 292  RSLISTGMLDGVPVKYISLSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 351

Query: 1414 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 1593
            HPNNHI+F+NGKTIY IVQEL++T +++LF+ IQTV G PINQKAFRIWKES+QAATREL
Sbjct: 352  HPNNHIYFENGKTIYQIVQELRNTQESMLFDVIQTVFGSPINQKAFRIWKESFQAATREL 411

Query: 1594 ERIYGK 1611
            +RIYGK
Sbjct: 412  QRIYGK 417


>dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]
          Length = 415

 Score =  322 bits (824), Expect = 4e-85
 Identities = 184/408 (45%), Positives = 250/408 (61%), Gaps = 33/408 (8%)
 Frame = +1

Query: 487  ADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNLP-WENP 663
            ++ D   D+S+R + KR H WF+D++  E+FPNKKQAV+  +     G  NV LP WE+ 
Sbjct: 8    SEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--DPVVGLGKSNVGLPLWESS 65

Query: 664  SNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSM 840
            S FQSV+ QF DRL G+E P R + FG R+     + +     + I E +  D S+ LS+
Sbjct: 66   SVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ--NKSIAESYMEDTSVELSI 123

Query: 841  SHTMEDLGSCLNYGGIRKVKVNQVKDS-------------------------DNAMSMP- 942
            S+ +E  G C    G RK+ V++VK++                         +N  S   
Sbjct: 124  SNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKIESSSIQACSRENESSYIN 183

Query: 943  ---MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKEL--V 1107
                 H +   D   I+FG+  +   +  +  ++ NY+  + Q  I   + + ++E    
Sbjct: 184  FALAGHPYGNEDSQGITFGEINDEHGVGSTSNVVGNYQSYV-QDPIGTLDIVYDQETGSS 242

Query: 1108 DPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYIS 1287
              ++ V+             PK+K E K S K    +FPSNVRSL+STG+LDGVPVKY+S
Sbjct: 243  QTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVS 302

Query: 1288 WSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIV 1467
             S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTKHPNNHI+F+NGKTIY IV
Sbjct: 303  VSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIV 362

Query: 1468 QELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            QEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL+RIYGK
Sbjct: 363  QELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATRELQRIYGK 410


>ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
            lyrata] gi|297310488|gb|EFH40912.1| hypothetical protein
            ARALYDRAFT_332253 [Arabidopsis lyrata subsp. lyrata]
          Length = 415

 Score =  320 bits (819), Expect = 2e-84
 Identities = 188/420 (44%), Positives = 249/420 (59%), Gaps = 45/420 (10%)
 Frame = +1

Query: 487  ADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNLP-WENP 663
            ++ D   D S+R + KR H WF+D++  E+FPNKKQAV+        G  NV LP WE+ 
Sbjct: 8    SEDDSVYDQSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQDPVGGL--GKSNVGLPLWESS 65

Query: 664  SNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSM 840
            S FQSV+ QF DRL G+E P R + FG R+     + +     + I E +  D S+ LS+
Sbjct: 66   SVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQT--KSIAESYMEDTSVELSI 123

Query: 841  SHTMEDLGSCLNYGGIRKVKVNQVK---------DSDNAMSMPMS--------------- 948
            S+ +E  GS     GIRK+ V++VK         D  N   +  S               
Sbjct: 124  SNGVEVAGSSFGGDGIRKLPVSRVKETMSTHVALDGHNQRKIESSSIQACSRENESSFIN 183

Query: 949  -----HSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYE-----------LLMGQ---SSI 1071
                 H +   D   I+FG+  +   +  +  ++ NY+           ++ GQ   SS 
Sbjct: 184  FALAGHPYGNEDSHGITFGEINDEHGVGSTSNVVGNYQSYVQDPIGTLDIVYGQETGSSQ 243

Query: 1072 QPSESLKEKELVDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLST 1251
              S  + E+++  P+   +             PK+K E K S K    +FPSNVRSL+ST
Sbjct: 244  TSSGVVSEQQVAKPSLEPV-------------PKNKAETKSSKKEASTSFPSNVRSLIST 290

Query: 1252 GILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHI 1431
            G+LDGVPV Y+S S EE RGVIKGSGYLC CQ+C ++KVLNAY FERHAGCKTKHPNNHI
Sbjct: 291  GMLDGVPVTYVSISREELRGVIKGSGYLCGCQTCEFTKVLNAYAFERHAGCKTKHPNNHI 350

Query: 1432 FFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            +F+NGKTIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL+RIYGK
Sbjct: 351  YFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATRELQRIYGK 410


>ref|XP_002534332.1| DNA binding protein, putative [Ricinus communis]
            gi|223525478|gb|EEF28050.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 417

 Score =  315 bits (808), Expect = 3e-83
 Identities = 154/278 (55%), Positives = 197/278 (70%), Gaps = 6/278 (2%)
 Frame = +1

Query: 796  IEEQFGNDASIALSMSHTME------DLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSF 957
            I   F ND +  +SM  + +       +G   N G      + Q+ +  +  +  +   +
Sbjct: 135  IHHTFSNDDNHLISMGQSYKPDENTISMGHLFNKGNDSTALMGQIYNKGDNNNFSIVQGY 194

Query: 958  NKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENA 1137
            NK + T ISFG + +  + NPSGRL++ Y++LM QSS+Q SE + E E++  N + L +A
Sbjct: 195  NKGESTIISFGGYDD-DDANPSGRLISTYDMLMAQSSLQSSEVINENEVITSNVDALLSA 253

Query: 1138 TPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVI 1317
            T       +  K K + K   KVP NNFPSNVRSLLSTG+LDGVPVKYI+WS EE RGVI
Sbjct: 254  THTTASGIENAKKKEDLKTCKKVPSNNFPSNVRSLLSTGMLDGVPVKYIAWSREELRGVI 313

Query: 1318 KGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNL 1497
            KGSGYLC CQ+CN+SKV+NAYEFERHA CKTKHPNNHI+F+NGKT+YGIVQEL+S PQN+
Sbjct: 314  KGSGYLCGCQTCNFSKVINAYEFERHADCKTKHPNNHIYFENGKTVYGIVQELRSIPQNM 373

Query: 1498 LFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            LFE IQT+TG PINQK+FR+WKES+ AATREL+RIYGK
Sbjct: 374  LFEVIQTITGSPINQKSFRLWKESFLAATRELQRIYGK 411



 Score = 77.4 bits (189), Expect = 2e-11
 Identities = 36/63 (57%), Positives = 49/63 (77%)
 Frame = +1

Query: 781 IGRRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFN 960
           +GR+  E+ FGND+S  LSMSHT+ED  S LNYGGIRKVKV+QVK+S+N M + + + +N
Sbjct: 1   MGRKVDEDPFGNDSSFGLSMSHTLEDPRSSLNYGGIRKVKVSQVKESENIMPVSLENDYN 60

Query: 961 KRD 969
           + D
Sbjct: 61  RVD 63


>ref|XP_006480745.1| PREDICTED: uncharacterized protein LOC102621289 isoform X2 [Citrus
            sinensis]
          Length = 543

 Score =  313 bits (801), Expect = 2e-82
 Identities = 170/300 (56%), Positives = 209/300 (69%), Gaps = 20/300 (6%)
 Frame = +1

Query: 772  NLDIGRRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQV-------KDSDNA 930
            NL IG+   E    N+ S+ +S S   ED G         KV+ N +       K  D+ 
Sbjct: 243  NLSIGQTYKE----NNDSLPMSHSFGKEDNGMISMGQTYNKVEENAISAGHIYNKGDDST 298

Query: 931  MSM-----------PMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQP 1077
            +SM            +  S+NK + T ISFG + +  + NPSG LL+ Y +++GQSS+  
Sbjct: 299  ISMVNTYNKDTSSLSVGQSYNKGESTIISFGGYDD-DDANPSGTLLSTYGVMIGQSSVNT 357

Query: 1078 SESLKEKELVDPNANVLENATPLAI--VTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLST 1251
            SE+L +K  V  NA+ L ++  + I    NK+ K K + K S KV  NNFPSNVRSLLST
Sbjct: 358  SEALNQKAFVKSNADTLMSSQHVTISGAENKSRK-KEDLKPSKKVTSNNFPSNVRSLLST 416

Query: 1252 GILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHI 1431
            G+LDGVPVKYI+WS EE RGVIKGSGY CSCQSCNYSKV+NAYEFERHAGCKTKHPNNHI
Sbjct: 417  GMLDGVPVKYIAWSREELRGVIKGSGYQCSCQSCNYSKVINAYEFERHAGCKTKHPNNHI 476

Query: 1432 FFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            +F+NGKTIYGIVQEL+STPQN+LFE IQT+TG PINQK+FR+WKES+ AATREL+RIYGK
Sbjct: 477  YFENGKTIYGIVQELRSTPQNMLFEVIQTITGSPINQKSFRLWKESFLAATRELQRIYGK 536



 Score =  178 bits (452), Expect = 5e-42
 Identities = 92/189 (48%), Positives = 126/189 (66%), Gaps = 1/189 (0%)
 Frame = +1

Query: 421 KNVKMSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAV 600
           K   ++ Q++GFWMAKG   CL DG+M  DNSS+++PKR+HQWF++  E ELFPNKKQA+
Sbjct: 7   KQYGITLQNQGFWMAKGAE-CLNDGEMAYDNSSKLDPKRSHQWFMEGPEAELFPNKKQAI 65

Query: 601 EASNSRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNL 777
              +S   SG+ N N+P W + S+F S++G F +RLF S  +R ++   RN  S+    L
Sbjct: 66  GVPSSNLFSGLLNSNVPAWGHTSSFHSISGPFGERLFDSPTTRAVNLDDRNVTSVTAEKL 125

Query: 778 DIGRRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSF 957
           D GR+ +   FGND+S  LSMS   ED    L+Y GIRKVKV+QVKDS+N M++ M H +
Sbjct: 126 DPGRKDL---FGNDSSFGLSMSQAQEDHRGGLSY-GIRKVKVSQVKDSENVMAVSMGHGY 181

Query: 958 NKRDGTTIS 984
           +  D  TIS
Sbjct: 182 DTVDNNTIS 190


>ref|XP_006341068.1| PREDICTED: uncharacterized protein LOC102588634 isoform X3 [Solanum
            tuberosum]
          Length = 557

 Score =  311 bits (796), Expect = 7e-82
 Identities = 162/293 (55%), Positives = 204/293 (69%), Gaps = 8/293 (2%)
 Frame = +1

Query: 757  SINTGNLDIGRRGIEEQFGNDASIALSMSHTMEDLGS-------CLNYGGIRKVKVNQVK 915
            S N  N  I    + +QF ND S   S+  T+  +         C +      + V+Q  
Sbjct: 263  SFNDNNTAIS---MGQQFSNDDSNITSVGQTINKMADTNPPMSHCYSKVDDNAISVSQTY 319

Query: 916  DSDNAMSMPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKE 1095
                  ++ MS SF   +   ISFG F +  ++N SGRL+ +Y+LLM QSS Q S+ +  
Sbjct: 320  SKVENNNLSMSQSFGNGESNIISFGGFNDDDDINSSGRLICSYDLLMSQSSGQQSDIVTG 379

Query: 1096 KELVDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNN-FPSNVRSLLSTGILDGVP 1272
            K LV+ NA+ + +A  +A   NK   SK E++ + K PP+N FPSNVRSLLSTG+LDGVP
Sbjct: 380  KRLVESNADTVTSAAQMA--GNKEFISKKEEQKATKKPPSNSFPSNVRSLLSTGMLDGVP 437

Query: 1273 VKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKT 1452
            VKYI+WS EE RG+IKGSGYLC CQSCN+SK +NAYEFERHAGCKTKHPNNHI+F+NGKT
Sbjct: 438  VKYIAWSREELRGIIKGSGYLCGCQSCNFSKAINAYEFERHAGCKTKHPNNHIYFENGKT 497

Query: 1453 IYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 1611
            IYGIVQEL++TPQ+LLFE IQT+TG  INQK+FRIWKES+ AATREL+RIYGK
Sbjct: 498  IYGIVQELRNTPQDLLFEVIQTITGSSINQKSFRIWKESFLAATRELQRIYGK 550



 Score =  160 bits (406), Expect = 1e-36
 Identities = 82/187 (43%), Positives = 116/187 (62%), Gaps = 16/187 (8%)
 Frame = +1

Query: 499  MGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNL-PWENPSNFQ 675
            M  DNSS +EPKR+HQWF+D  EPEL PNKKQA+E  N    SG+ + N+ PW N   F 
Sbjct: 1    MAYDNSSTLEPKRSHQWFMDGIEPELLPNKKQAIEVPNHSSFSGLLSSNIAPWMNTPGFH 60

Query: 676  SVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSMSHTME 855
            SV GQ+ +R F ++ +R++ F   +  S+  GN+++ R+ +E+ FG+D+S  LS+SHT+E
Sbjct: 61   SVPGQYAERQFDNDSARSLSFDDNSVPSVGIGNMNMSRKVMEDPFGSDSSFGLSISHTLE 120

Query: 856  DLGSCLNYGGIRKVKVNQVKDSDNAMS--------------MPMSHSFNKRDGTTISFG- 990
            D    LNY GIRKVKV+QVK+++N M               MP  H+F+K +   I+ G 
Sbjct: 121  DHRLGLNYSGIRKVKVSQVKEAENFMPVSMGDIYTRGISNVMPTDHAFSKAEDNCIAMGL 180

Query: 991  DFQEGSE 1011
             F  G E
Sbjct: 181  SFNGGDE 187


Top