BLASTX nr result

ID: Akebia26_contig00027767 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00027767
         (1256 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258...   183   1e-43
emb|CBI18050.3| unnamed protein product [Vitis vinifera]              183   1e-43
ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citr...   163   1e-37
gb|EXB42369.1| hypothetical protein L484_021961 [Morus notabilis]     160   1e-36
emb|CAN62161.1| hypothetical protein VITISV_017634 [Vitis vinifera]   158   5e-36
ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253...   148   4e-33
emb|CBI15828.3| unnamed protein product [Vitis vinifera]              148   4e-33
ref|XP_004302534.1| PREDICTED: uncharacterized protein LOC101304...   144   8e-32
ref|XP_007033558.1| NT domain of poly(A) polymerase and terminal...   142   3e-31
ref|XP_004170318.1| PREDICTED: uncharacterized LOC101207419 [Cuc...   141   5e-31
ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207...   141   5e-31
ref|XP_003616022.1| hypothetical protein MTR_5g075260 [Medicago ...   128   6e-27
ref|XP_006346681.1| PREDICTED: uncharacterized protein LOC102589...   127   1e-26
ref|XP_004490712.1| PREDICTED: uncharacterized protein LOC101490...   125   5e-26
ref|XP_006371669.1| hypothetical protein POPTR_0019s14930g [Popu...   122   3e-25
ref|XP_004246272.1| PREDICTED: uncharacterized protein LOC101256...   120   1e-24
ref|XP_002518281.1| nucleic acid binding protein, putative [Rici...   119   2e-24
ref|NP_850678.2| PAP/OAS1 substrate-binding domain superfamily [...   115   3e-23
dbj|BAF01063.1| hypothetical protein [Arabidopsis thaliana]           115   3e-23
ref|XP_007017068.1| NT domain of poly(A) polymerase and terminal...   111   7e-22

>ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258499 [Vitis vinifera]
          Length = 884

 Score =  183 bits (465), Expect = 1e-43
 Identities = 125/329 (37%), Positives = 174/329 (52%), Gaps = 8/329 (2%)
 Frame = +3

Query: 42   LKNHERLATVGTTD---GSSSIDCPLEDLCLSHREGDLVSTVGSPGPLNSLLDLSGDYDG 212
            + NHE L +  + D   G S   C  E L   + +       G+P   NSL DLSGDYD 
Sbjct: 573  VNNHELLNSFVSNDVPPGLSPTACSSEYLHTGNWDRPSSGNSGNPEAPNSLADLSGDYDS 632

Query: 213  YFYDLQFARWFHDFALPGPIPSSP----SQFWNKPGFNNFHHQSMQMQQRNIFSPRMNVN 380
            +F  LQ+  W +D+    P  S P    SQF +   ++    QS  ++ RNIF P++  N
Sbjct: 633  HFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQ-QSAHIR-RNIF-PQITAN 689

Query: 381  GVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRGRNP 560
            G++P  P FYP NP +++G G  +G EE+P+PRGTGTY P  +H   +   P+  RGRN 
Sbjct: 690  GIIPR-PPFYPLNPPMISGTG--FGVEEMPKPRGTGTYFPNTSH---HLCNPLTSRGRNQ 743

Query: 561  SPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGGRPV 740
            +PV   R       +G  VTP E N  E+++ E S AQ PV    GK G L+  P G PV
Sbjct: 744  APVRSPR------HSGRAVTPHETNFLERSSRELSHAQFPVHQGNGKSGSLDSHPSGSPV 797

Query: 741  AGGFPHANSFVLPPEK-PEFGSRRHLPGTPLSEGVRLLVPTTPVTQGSTSCPPTPALQRP 917
               + +AN  +LP EK  EFG +     +PL E +R     + + Q S+        QRP
Sbjct: 798  GRTYSNANGSLLPSEKVVEFGDQ--ASESPLPENIREPNHGSFLPQNSSLSLSPGGAQRP 855

Query: 918  RPVAGTNPEGIAKQSYQLKDESDFPPLSI 1004
            + +   N + +A Q+Y LKDE DFPPLS+
Sbjct: 856  KSMLSMNDDRVAVQAYHLKDEDDFPPLSV 884


>emb|CBI18050.3| unnamed protein product [Vitis vinifera]
          Length = 824

 Score =  183 bits (465), Expect = 1e-43
 Identities = 125/329 (37%), Positives = 174/329 (52%), Gaps = 8/329 (2%)
 Frame = +3

Query: 42   LKNHERLATVGTTD---GSSSIDCPLEDLCLSHREGDLVSTVGSPGPLNSLLDLSGDYDG 212
            + NHE L +  + D   G S   C  E L   + +       G+P   NSL DLSGDYD 
Sbjct: 513  VNNHELLNSFVSNDVPPGLSPTACSSEYLHTGNWDRPSSGNSGNPEAPNSLADLSGDYDS 572

Query: 213  YFYDLQFARWFHDFALPGPIPSSP----SQFWNKPGFNNFHHQSMQMQQRNIFSPRMNVN 380
            +F  LQ+  W +D+    P  S P    SQF +   ++    QS  ++ RNIF P++  N
Sbjct: 573  HFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQ-QSAHIR-RNIF-PQITAN 629

Query: 381  GVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRGRNP 560
            G++P  P FYP NP +++G G  +G EE+P+PRGTGTY P  +H   +   P+  RGRN 
Sbjct: 630  GIIPR-PPFYPLNPPMISGTG--FGVEEMPKPRGTGTYFPNTSH---HLCNPLTSRGRNQ 683

Query: 561  SPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGGRPV 740
            +PV   R       +G  VTP E N  E+++ E S AQ PV    GK G L+  P G PV
Sbjct: 684  APVRSPR------HSGRAVTPHETNFLERSSRELSHAQFPVHQGNGKSGSLDSHPSGSPV 737

Query: 741  AGGFPHANSFVLPPEK-PEFGSRRHLPGTPLSEGVRLLVPTTPVTQGSTSCPPTPALQRP 917
               + +AN  +LP EK  EFG +     +PL E +R     + + Q S+        QRP
Sbjct: 738  GRTYSNANGSLLPSEKVVEFGDQ--ASESPLPENIREPNHGSFLPQNSSLSLSPGGAQRP 795

Query: 918  RPVAGTNPEGIAKQSYQLKDESDFPPLSI 1004
            + +   N + +A Q+Y LKDE DFPPLS+
Sbjct: 796  KSMLSMNDDRVAVQAYHLKDEDDFPPLSV 824


>ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citrus clementina]
            gi|568855155|ref|XP_006481174.1| PREDICTED:
            uncharacterized protein LOC102622468 [Citrus sinensis]
            gi|557531615|gb|ESR42798.1| hypothetical protein
            CICLE_v10011044mg [Citrus clementina]
          Length = 882

 Score =  163 bits (413), Expect = 1e-37
 Identities = 120/289 (41%), Positives = 152/289 (52%), Gaps = 6/289 (2%)
 Frame = +3

Query: 153  TVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFALPGPI-PSSP---SQFWNKPGFNNF 320
            TVGSP   NSL DLSGDY+ +   L   RW+++ AL     P SP   SQF +K   N++
Sbjct: 610  TVGSPRAANSLSDLSGDYESHLISLNHVRWWYEHALNSSYSPMSPQLLSQFQSK---NSW 666

Query: 321  HHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIP 500
                  +  R    P+MN NG V P P FYP  P ++   G+S+G EE+P+ RGTGTY P
Sbjct: 667  DLMQRSLPFRRNIIPQMNANGAV-PRPLFYPMTPPMLP--GASFGMEEMPKHRGTGTYFP 723

Query: 501  IMNHRTPYRDRPVQGRGRNPSPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPSQAQPP 680
              NH   YRDRP+  RGRN +PV   R       NG V+TPPE N+ E ++ EPS A   
Sbjct: 724  NTNH---YRDRPLNLRGRNQAPVRSPR------SNGRVMTPPETNILEGSSREPSPAHIH 774

Query: 681  VLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPEK-PEFGSRRHL-PGTPLSEGVRLLV 854
            V  V  K G  E      P     P+AN  V P ++  EFGS  HL  G P  +  R   
Sbjct: 775  VHQVGVKAGLSEPCHSSSPEKKTQPNANGLVHPVDRVVEFGSVGHLYYGPPSLDSNRQPN 834

Query: 855  PTTPVTQGSTSCPPTPALQRPRPVAGTNPEGIAKQSYQLKDESDFPPLS 1001
              + + Q S+    +P   R RP  GT+ +    Q Y LKDE DFPPLS
Sbjct: 835  TCSTIGQDSSVGLSSPRTPRSRPGLGTDQDRTDVQ-YHLKDE-DFPPLS 881


>gb|EXB42369.1| hypothetical protein L484_021961 [Morus notabilis]
          Length = 928

 Score =  160 bits (404), Expect = 1e-36
 Identities = 119/331 (35%), Positives = 171/331 (51%), Gaps = 10/331 (3%)
 Frame = +3

Query: 42   LKNHERLATVGTTDGSS---SIDCPLEDLCLSHREGDLVSTV-GSPGPLNSLLDLSGDYD 209
            +++H+  + VG+    S   SI    ED   S+    + + + G+P P  +  DLSGDY+
Sbjct: 614  VRDHKASSPVGSKQHLSRLSSIALSSEDFYPSYSRYRMSAVLSGAPDPFQTSSDLSGDYE 673

Query: 210  GYFYDLQFARWFHDFALPGPIPSSP---SQFWNKPGFNNFHHQSMQMQQRNIFSPRMNVN 380
             +   L + RW + +AL   +PS P   SQF +K  +     +S+Q++Q ++FS    +N
Sbjct: 674  SHLSSLHYGRWCYKYALAASVPSIPPIISQFQSKKSWEVIR-RSVQLKQ-SVFS---QIN 728

Query: 381  GVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRGRNP 560
              V P P FY  NP ++ GG   +  EE+P+PRGTGTY P MNH   YRDRP+  RG+N 
Sbjct: 729  NGVVPQPTFYSMNPPLLPGG-IGFAVEEMPKPRGTGTYFPNMNH---YRDRPMTPRGKNQ 784

Query: 561  SPVVHGRGHFWRPQNGWVVTPPEANLF-EKANHEPSQAQPPVLAVRGKPGPLEFFPGGRP 737
            +PV   R       NG +VT    N F E++ H+ +QAQ       GK G  +  P   P
Sbjct: 785  APVRSPRN------NGRLVTLATENGFPERSGHDNAQAQIFAHKGYGKSGSSDD-PSDSP 837

Query: 738  VAGGFPHANSFVLPPEK-PEFGSRRHLPG-TPLSEGVRLLVPTTPVTQGSTSCPPTPALQ 911
                  + N  +  PE   EFGS  H+P   PL  G         + Q S S   +P  +
Sbjct: 838  RRKVNSNGNGAMHQPEPLVEFGSIAHMPSEAPLLRGSWQTNTGLALIQNSGSSLASPGTE 897

Query: 912  RPRPVAGTNPEGIAKQSYQLKDESDFPPLSI 1004
            + +PV   + + IA QSY LKDE DFPPLS+
Sbjct: 898  KLKPVLSMDKDRIAVQSYALKDEDDFPPLSV 928


>emb|CAN62161.1| hypothetical protein VITISV_017634 [Vitis vinifera]
          Length = 1147

 Score =  158 bits (399), Expect = 5e-36
 Identities = 114/316 (36%), Positives = 160/316 (50%), Gaps = 8/316 (2%)
 Frame = +3

Query: 42   LKNHERLATVGTTD---GSSSIDCPLEDLCLSHREGDLVSTVGSPGPLNSLLDLSGDYDG 212
            + NHE L +  + D   G S   C  E L   + +       G+P   NSL DLSGDYD 
Sbjct: 804  VNNHELLNSFVSNDVPPGLSPTACSSEYLHTGNWDRPSSGNSGNPEAPNSLADLSGDYDS 863

Query: 213  YFYDLQFARWFHDFALPGPIPSSP----SQFWNKPGFNNFHHQSMQMQQRNIFSPRMNVN 380
            +F  LQ+  W +D+    P  S P    SQF +   ++    QS  ++ RNIF P++  N
Sbjct: 864  HFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQ-QSAHIR-RNIF-PQITAN 920

Query: 381  GVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRGRNP 560
            G++P  P FYP NP +++G G  +G EE+P+PRGTGTY P  +H   +   P+  RGRN 
Sbjct: 921  GIIPR-PPFYPMNPPMISGTG--FGVEEMPKPRGTGTYFPNTSH---HLCNPLTSRGRNQ 974

Query: 561  SPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGGRPV 740
            +PV   R       +G  VTP E N  E+++ E S AQ PV    GK G L+  P G PV
Sbjct: 975  APVRSPR------HSGRAVTPHETNFLERSSRELSHAQFPVHQGNGKSGSLDSHPSGSPV 1028

Query: 741  AGGFPHANSFVLPPEK-PEFGSRRHLPGTPLSEGVRLLVPTTPVTQGSTSCPPTPALQRP 917
               + +AN  +LP EK  EFG R     +PL E +R     + + Q S+        QRP
Sbjct: 1029 GRTYSNANGSLLPSEKVVEFGDR--ASESPLPENIREPNHGSFLPQNSSLSLSPGGAQRP 1086

Query: 918  RPVAGTNPEGIAKQSY 965
            + +   N +    + Y
Sbjct: 1087 KSMLSMNDDRFGLRVY 1102


>ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253523 [Vitis vinifera]
          Length = 854

 Score =  148 bits (374), Expect = 4e-33
 Identities = 118/317 (37%), Positives = 158/317 (49%), Gaps = 13/317 (4%)
 Frame = +3

Query: 111  EDLCLSHREGDLVSTVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFALPGPI----PS 278
            E+  L+ R  D     GS G L +LLDLSGDYD +   LQ+ +  +  ALP P+    P 
Sbjct: 571  ENTALAFRGRDFACNAGSLGSLETLLDLSGDYDSHIRSLQYGQCCYGHALPPPLLPSPPL 630

Query: 279  SPSQF-----WNKPGFNNFHHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGG 443
            SPSQ      W+K        Q +Q  Q N+ S +M+ NGV+  G  F   +P   A   
Sbjct: 631  SPSQLQINTPWDKV------RQHLQFTQ-NLHS-QMDSNGVI-LGNHFPVKHP---ARSI 678

Query: 444  SSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRGRNPSPVVHGRGHFWRPQNGWVVTP 623
            +++G E+  +PRGTGTY P M+H  P RDRPV G+ RN +   H + H  + +NG V   
Sbjct: 679  TAFGLEDKQKPRGTGTYFPNMSH-LPNRDRPV-GQRRNQALESHSQLHRRKHRNGLVAAQ 736

Query: 624  PEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPEKPEFGS 803
             E NL E+ +HE SQ Q PVL                   G   HAN   LPP++ EFGS
Sbjct: 737  QEMNLIEETSHELSQLQYPVLG-----------------HGKSIHANGSSLPPKRLEFGS 779

Query: 804  RRHL-PGTPLSEGVRLLVPTTPVT---QGSTSCPPTPALQRPRPVAGTNPEGIAKQSYQL 971
               +  G P  +  R   P +  T    G+T+ P    +Q P+PV G   +     SY L
Sbjct: 780  FGTMSSGLPTPD--RCTKPDSSGTLPAWGATASPVGSRMQSPKPVLGNEEKRFEGLSYHL 837

Query: 972  KDESDFPPLSI*MCFKG 1022
            K+E DFPPLS+ M   G
Sbjct: 838  KNEDDFPPLSLKMQVDG 854


>emb|CBI15828.3| unnamed protein product [Vitis vinifera]
          Length = 929

 Score =  148 bits (374), Expect = 4e-33
 Identities = 118/317 (37%), Positives = 158/317 (49%), Gaps = 13/317 (4%)
 Frame = +3

Query: 111  EDLCLSHREGDLVSTVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFALPGPI----PS 278
            E+  L+ R  D     GS G L +LLDLSGDYD +   LQ+ +  +  ALP P+    P 
Sbjct: 646  ENTALAFRGRDFACNAGSLGSLETLLDLSGDYDSHIRSLQYGQCCYGHALPPPLLPSPPL 705

Query: 279  SPSQF-----WNKPGFNNFHHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGG 443
            SPSQ      W+K        Q +Q  Q N+ S +M+ NGV+  G  F   +P   A   
Sbjct: 706  SPSQLQINTPWDKV------RQHLQFTQ-NLHS-QMDSNGVI-LGNHFPVKHP---ARSI 753

Query: 444  SSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRGRNPSPVVHGRGHFWRPQNGWVVTP 623
            +++G E+  +PRGTGTY P M+H  P RDRPV G+ RN +   H + H  + +NG V   
Sbjct: 754  TAFGLEDKQKPRGTGTYFPNMSH-LPNRDRPV-GQRRNQALESHSQLHRRKHRNGLVAAQ 811

Query: 624  PEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPEKPEFGS 803
             E NL E+ +HE SQ Q PVL                   G   HAN   LPP++ EFGS
Sbjct: 812  QEMNLIEETSHELSQLQYPVLG-----------------HGKSIHANGSSLPPKRLEFGS 854

Query: 804  RRHL-PGTPLSEGVRLLVPTTPVT---QGSTSCPPTPALQRPRPVAGTNPEGIAKQSYQL 971
               +  G P  +  R   P +  T    G+T+ P    +Q P+PV G   +     SY L
Sbjct: 855  FGTMSSGLPTPD--RCTKPDSSGTLPAWGATASPVGSRMQSPKPVLGNEEKRFEGLSYHL 912

Query: 972  KDESDFPPLSI*MCFKG 1022
            K+E DFPPLS+ M   G
Sbjct: 913  KNEDDFPPLSLKMQVDG 929


>ref|XP_004302534.1| PREDICTED: uncharacterized protein LOC101304393 [Fragaria vesca
            subsp. vesca]
          Length = 878

 Score =  144 bits (363), Expect = 8e-32
 Identities = 108/293 (36%), Positives = 151/293 (51%), Gaps = 8/293 (2%)
 Frame = +3

Query: 147  VSTVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFAL-----PGPIPSSPSQFWNKPGF 311
            +S  G+P   N L DLSGDYD +   L++ R  +++ L     P P PS PSQ+     +
Sbjct: 608  ISITGNPETSNPLSDLSGDYDSHLNSLRYGRSCYEYELIAVHNPMP-PSMPSQYQRSKSW 666

Query: 312  NNFHHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGT 491
            +    QS+Q++Q N F P M+ NGVVP    ++ + P  M   G+ +G EE+ +PRGTGT
Sbjct: 667  D-VSRQSVQLRQ-NAFLP-MSPNGVVPRQAFYHMNQP--MLPNGAGFGMEEMQKPRGTGT 721

Query: 492  YIPIMNHRTPYRDRPVQGRGRNPSPVVHGRGHFWRPQNGWVVTP-PEANLFEKANHEPSQ 668
            Y P  NH   YRDRP+  RGRN +PV   R       NG+ + P PE N  ++ +H+ SQ
Sbjct: 722  YFPNTNH---YRDRPMTTRGRNQAPVRSPR------NNGYAMIPSPENNFPDRNSHDLSQ 772

Query: 669  AQPPVLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPEK-PEFGSRRHLPGTPLSEGVR 845
            AQ P+    GK G     P   P    +P+AN  + P ++  EFG   H+P      G  
Sbjct: 773  AQMPLQKGGGKFG-FPDSPTSSPRTKAYPNANGSIHPYDRVTEFGPVEHVPLEAPPSG-- 829

Query: 846  LLVPTTPVTQGSTSCPPTPALQ-RPRPVAGTNPEGIAKQSYQLKDESDFPPLS 1001
                      GS+S   +   Q        T+ + I+ +SY LKDE DFPPLS
Sbjct: 830  -----RQTNSGSSSSQNSSVGQASTNSELSTDQDRISVKSYHLKDEEDFPPLS 877


>ref|XP_007033558.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative [Theobroma
            cacao] gi|508712587|gb|EOY04484.1| NT domain of poly(A)
            polymerase and terminal uridylyl transferase-containing
            protein, putative [Theobroma cacao]
          Length = 890

 Score =  142 bits (358), Expect = 3e-31
 Identities = 112/293 (38%), Positives = 148/293 (50%), Gaps = 8/293 (2%)
 Frame = +3

Query: 150  STVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFALPG---PIPSSPSQFWNKPGFNNF 320
            S  G    L+S LDL GD+D +   L + RW  D+A      PI    SQ  +   ++  
Sbjct: 616  SVAGGQEALSSFLDLCGDHDSHLRSLSYGRWCFDYAFNASVSPITPLVSQLQSNNSWDVV 675

Query: 321  HHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIP 500
              QS+Q + RN  SP MN NGVVP    +YP NP ++   G  +G EE+P+PRGTGTY P
Sbjct: 676  R-QSVQFR-RNAISP-MNANGVVPR-QVYYPMNPPMLPAAG--FGMEEMPKPRGTGTYFP 729

Query: 501  IMNHRTP-YRDRPVQGRGRNPSPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPSQAQP 677
              NH T  YRDR +  RGR+   V   R       N   +T PE N  E+++ E +Q Q 
Sbjct: 730  --NHNTNHYRDRSLTARGRSQVQVRSPRN------NSRAITSPETNSPERSSRELAQVQS 781

Query: 678  PVLAVRGKPGP--LEFFPGGRPVAGGFPHANSFVLPPEKP-EFGSRRHLPGTPLS-EGVR 845
            P     GK G   L  F   + +   +P+AN  V  PE+  EFGS   LP  P S E   
Sbjct: 782  PHQG-GGKSGSSDLRHFGSEKVL---YPNANGSVHHPERVVEFGSIGPLPLGPASPESNM 837

Query: 846  LLVPTTPVTQGSTSCPPTPALQRPRPVAGTNPEGIAKQSYQLKDESDFPPLSI 1004
               P +P     ++  P   +QR +   G   + IA +SY LK+E DFPPLSI
Sbjct: 838  QHNPGSPHALNLSASQPPSGMQRSKSTVGVEQDRIAIRSYHLKNEEDFPPLSI 890


>ref|XP_004170318.1| PREDICTED: uncharacterized LOC101207419 [Cucumis sativus]
          Length = 816

 Score =  141 bits (356), Expect = 5e-31
 Identities = 120/337 (35%), Positives = 167/337 (49%), Gaps = 17/337 (5%)
 Frame = +3

Query: 42   LKNHERLATVGTTDGSS----SIDCPLEDLCLSHREGD-LVSTVGSPGPLNSLLDLSGDY 206
            + N + +A    T  SS    S+    ED   S R    L S VG P   N+L DL+GDY
Sbjct: 500  VNNDDEVANQSETKQSSPPLHSVSLSSEDFYPSSRGYRFLTSNVGPPEAFNALSDLNGDY 559

Query: 207  DGYFYDLQFARWFHDFALP----GPIPSS-PSQFWNKPGFNNFHHQSMQMQQRNIFSPRM 371
            + +   LQ  RW++++AL      PIP   PSQ+ NK  + +   +S+Q++Q N F+ ++
Sbjct: 560  ESHCNSLQIGRWYYEYALSAAALSPIPPPLPSQYPNKNPW-DIIRRSVQVKQ-NAFA-QI 616

Query: 372  NVNGVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRG 551
            N NG++   P FYP  P  +  GG++   EE+P+PRGTGTY P MNH   YRDRP   RG
Sbjct: 617  NSNGLL-ARPAFYPM-PSPILPGGATLAMEEMPKPRGTGTYFPNMNH---YRDRPASARG 671

Query: 552  RNPSPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGG 731
            RN   V   R       NG  +TP E  + EK+  +  Q   P +   G  G L      
Sbjct: 672  RNQVSVRSPR------NNGRSLTPLETTVAEKSGQDLYQV--PTVNHGGGIGMLS--SSS 721

Query: 732  RPVAGGFPHANSFVLPPEKP-EFGSRRHLPGTPLSEGVRLLVPTTPVT----QGSTSCPP 896
             PV     + N  +  P++  EFGS  HL   P+   V      TP T      S     
Sbjct: 722  SPVRKAHHNGNGAMPRPDRAVEFGSFGHL---PIESSVDCSGEPTPATAHFQNSSALNVS 778

Query: 897  TPALQRPRPVAGTNPE--GIAKQSYQLKDESDFPPLS 1001
            +P +Q+ +    T+ +   +  QSY+LKDE DFPPLS
Sbjct: 779  SPKMQKAKQTLITDQDRLSVHMQSYELKDEEDFPPLS 815


>ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207419 [Cucumis sativus]
          Length = 898

 Score =  141 bits (356), Expect = 5e-31
 Identities = 120/337 (35%), Positives = 167/337 (49%), Gaps = 17/337 (5%)
 Frame = +3

Query: 42   LKNHERLATVGTTDGSS----SIDCPLEDLCLSHREGD-LVSTVGSPGPLNSLLDLSGDY 206
            + N + +A    T  SS    S+    ED   S R    L S VG P   N+L DL+GDY
Sbjct: 582  VNNDDEVANQSETKQSSPPLHSVSLSSEDFYPSSRGYRFLTSNVGPPEAFNALSDLNGDY 641

Query: 207  DGYFYDLQFARWFHDFALP----GPIPSS-PSQFWNKPGFNNFHHQSMQMQQRNIFSPRM 371
            + +   LQ  RW++++AL      PIP   PSQ+ NK  + +   +S+Q++Q N F+ ++
Sbjct: 642  ESHCNSLQIGRWYYEYALSAAALSPIPPPLPSQYPNKNPW-DIIRRSVQVKQ-NAFA-QI 698

Query: 372  NVNGVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRG 551
            N NG++   P FYP  P  +  GG++   EE+P+PRGTGTY P MNH   YRDRP   RG
Sbjct: 699  NSNGLL-ARPAFYPM-PSPILPGGATLAMEEMPKPRGTGTYFPNMNH---YRDRPASARG 753

Query: 552  RNPSPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGG 731
            RN   V   R       NG  +TP E  + EK+  +  Q   P +   G  G L      
Sbjct: 754  RNQVSVRSPR------NNGRSLTPLETTVAEKSGQDLYQV--PTVNHGGGIGMLS--SSS 803

Query: 732  RPVAGGFPHANSFVLPPEKP-EFGSRRHLPGTPLSEGVRLLVPTTPVT----QGSTSCPP 896
             PV     + N  +  P++  EFGS  HL   P+   V      TP T      S     
Sbjct: 804  SPVRKAHHNGNGAMPRPDRAVEFGSFGHL---PIESSVDCSGEPTPATAHFQNSSALNVS 860

Query: 897  TPALQRPRPVAGTNPE--GIAKQSYQLKDESDFPPLS 1001
            +P +Q+ +    T+ +   +  QSY+LKDE DFPPLS
Sbjct: 861  SPKMQKAKQTLITDQDRLSVHMQSYELKDEEDFPPLS 897


>ref|XP_003616022.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
            gi|355517357|gb|AES98980.1| hypothetical protein
            MTR_5g075260 [Medicago truncatula]
          Length = 490

 Score =  128 bits (321), Expect = 6e-27
 Identities = 99/291 (34%), Positives = 145/291 (49%), Gaps = 7/291 (2%)
 Frame = +3

Query: 150  STVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFAL-PGPIPSSPS--QFWNKPGFNNF 320
            S  G      SLLDL+GDYD +  +L +    + + + P  +PS P   +F N+   N++
Sbjct: 212  SVSGGTEASKSLLDLAGDYDSHIANLHYGHMCNGYPVSPVVVPSPPRSPKFHNR---NSW 268

Query: 321  HHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIP 500
                  +Q  +   P+ N NGVV  GP +  ++P I     +S+GAEE  +PRGTG Y P
Sbjct: 269  ETVRQCLQMNHSIHPQTNSNGVV--GPLYLVNHPTIPM---ASFGAEEKRKPRGTGAYFP 323

Query: 501  IMNHRTPYRD-RPVQGRGRNPSPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPSQAQP 677
             M  R P+RD RP+ GRGR  +P  HG    +   NG+ +   E NL  + + EP+    
Sbjct: 324  NMTSR-PFRDNRPMPGRGRGLAPGTHGHLQRYNHSNGFALASQEVNLSVEGSFEPALEVY 382

Query: 678  PVLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPEKPEFGS-RRHLPGTPLSEGVRLLV 854
            P L+  G+P   E +   +P   G  HAN F    +K E GS    L G+P +E      
Sbjct: 383  PGLS-NGRPRSSETY-FSQPSTWGARHANGFPHSSDKHESGSGSPQLRGSPRTEVSN--H 438

Query: 855  PTTPVTQGSTSCPPTPAL--QRPRPVAGTNPEGIAKQSYQLKDESDFPPLS 1001
            P   ++    S P T     ++   ++  +P+ I  Q Y LK+E DFPPLS
Sbjct: 439  PDQGISTSGVSVPNTEIATEEKSNSLSVADPKRIEVQGYHLKNEDDFPPLS 489


>ref|XP_006346681.1| PREDICTED: uncharacterized protein LOC102589320 isoform X1 [Solanum
            tuberosum] gi|565359810|ref|XP_006346682.1| PREDICTED:
            uncharacterized protein LOC102589320 isoform X2 [Solanum
            tuberosum]
          Length = 852

 Score =  127 bits (318), Expect = 1e-26
 Identities = 109/302 (36%), Positives = 143/302 (47%), Gaps = 11/302 (3%)
 Frame = +3

Query: 129  HREGDLVSTVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFAL--PG-PIPSSPSQFWN 299
            H   DL ST G+   L +L DLSGDYD Y   LQ+  WF+++AL  P  P+P +P   ++
Sbjct: 574  HLNWDLASTSGAELSLKALSDLSGDYDNYLKYLQYGHWFYEYALNIPALPVPQAPPSPYH 633

Query: 300  KPGFNNFHHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGGSSYGA-EEVPRP 476
               ++    Q     + N FS   + NGV+ P   FYP NP +M      Y A EE+P+ 
Sbjct: 634  MK-YSWEAAQQPSYMKTNGFS-HGSTNGVI-PSQAFYPINPMLM--HSMPYAALEEMPKQ 688

Query: 477  RGTGTYIPIMNHRTPYRDRPVQGRGRN----PSPVVHGRGHFWRPQNGWVVTPPEANLFE 644
            RGTGTY P +NH  P+  RP   +GR+     SP  +GR  F            E + FE
Sbjct: 689  RGTGTYFPNLNH-PPHGYRPSIVKGRHQAGLSSPRTNGRATF-----------TEMHTFE 736

Query: 645  KANHEPSQAQPPVLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPE-KPEFGSRRHLP- 818
            ++ HE  Q++         P       G   + G        VLP E   EFGS   LP 
Sbjct: 737  RSFHEQLQSESSADQSNVHPLSSSHRRGHHSMTG-------MVLPTEGMVEFGSVGVLPL 789

Query: 819  GTPLSEGVRLLVPTTPVTQGSTSCPPTPALQRPRPVAGTNPEGIA-KQSYQLKDESDFPP 995
            GT +SE  R     +  TQ  +   P PA QR   V     + +  K SY LKDE DFPP
Sbjct: 790  GTSISERSRQQRAVSSPTQQCSPVSPIPAFQRSNSVFSKELDRVTLKSSYHLKDEDDFPP 849

Query: 996  LS 1001
            LS
Sbjct: 850  LS 851


>ref|XP_004490712.1| PREDICTED: uncharacterized protein LOC101490873 [Cicer arietinum]
          Length = 811

 Score =  125 bits (313), Expect = 5e-26
 Identities = 104/300 (34%), Positives = 145/300 (48%), Gaps = 10/300 (3%)
 Frame = +3

Query: 129  HREGDLVSTVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFAL-PGPIPSSPS--QFWN 299
            H +    S  G      SLLDL+GDYD +  +LQ+ +  + +++ P  +PSSP   +F N
Sbjct: 526  HSDRYNTSASGGTEASKSLLDLAGDYDSHITNLQYGQMCNGYSVSPVVVPSSPRSPKFHN 585

Query: 300  KPGFNNFHHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPR 479
            +   N +      +Q  ++  P+ N N VV    Q Y  N   +    +S+GAEE  +PR
Sbjct: 586  R---NPWETVRQCLQMNHVIHPQANSNCVVG---QLYLVNHSALPM--TSFGAEEKRKPR 637

Query: 480  GTGTYIPIMNHRTPYRD-RPVQGRGRNPSPVVHGRGHFWRPQNGWVVTPPEANLFEKANH 656
            GTG Y P MN R PYRD RP+ GRGR  +P  HG    +   NG  + P E NL  + + 
Sbjct: 638  GTGAYFPNMNSR-PYRDNRPMPGRGRGQAPGTHGHLQRYPRNNGLALAPQELNLPVEGSF 696

Query: 657  EPSQAQPPVLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPEKPEFGS-RRHLPGTPLS 833
            EP+    P L   GK    E +   +P      HAN F    +K E GS    L G P +
Sbjct: 697  EPALEGYPALG-NGKARSSETY-FSQPSTWSSRHANGFPHLSDKHESGSVSPQLRGPPRT 754

Query: 834  EGVRLLVPTTPVTQGSTSCPPTPAL-----QRPRPVAGTNPEGIAKQSYQLKDESDFPPL 998
            E     V   P    STS    P +     +R   ++  +P+ I  Q+Y LK+E DFPPL
Sbjct: 755  E-----VSNHPEPGVSTSRVSVPNMGIMTEERSNSLSVADPKRIEVQAYHLKNEEDFPPL 809


>ref|XP_006371669.1| hypothetical protein POPTR_0019s14930g [Populus trichocarpa]
            gi|550317591|gb|ERP49466.1| hypothetical protein
            POPTR_0019s14930g [Populus trichocarpa]
          Length = 808

 Score =  122 bits (306), Expect = 3e-25
 Identities = 105/323 (32%), Positives = 153/323 (47%), Gaps = 6/323 (1%)
 Frame = +3

Query: 51   HERLATVGTTDGSSSIDCPLEDLCLSHREGDLVSTVGSPGPLNSLLDLSGDYDGYFYDLQ 230
            HE +A   +T  + + + P E+L  +  E D     G+  PL SLL L GD++G+   L 
Sbjct: 502  HEGIAPSVSTTPNPADNVP-ENLSTTRVEKDFAGITGNSQPLKSLLGLRGDHNGHLQSLA 560

Query: 231  FARWFHDFALPGPIPSSPSQFWNKPGFNNFH--HQSMQMQQRNIFSPRMNVNGVVPPGPQ 404
            ++++ H  A+  PIP  PS        N +    QS+Q++Q      +MN N +   G Q
Sbjct: 561  YSQYCHMHAVSAPIPPCPSMLPLSENKNRWETVQQSLQLKQNG--HSQMNTNHIF--GTQ 616

Query: 405  FYPSNPHIMAGG--GSSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRGRNPSPVVHG 578
             Y  NP    GG   ++  +EE    RGTGTYIP M++ +   DR   GRGR      HG
Sbjct: 617  LYCVNP----GGPFRAATDSEEKKIRRGTGTYIPNMSYHSSRGDRLSLGRGRTQPQANHG 672

Query: 579  RGHFWRPQNGWVVTPPEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGGRPVAGGFPH 758
            + H +  +NG   T  E NL E   H+ S+A+ P L   GKP PLE      P   G  +
Sbjct: 673  QLHKYTHENGLPTTLQEKNLSEH-GHDLSEAEYPHLG-NGKPVPLEAH-HSYPSVWGSSN 729

Query: 759  ANSFVLPPEKPEFGSR--RHLPGTPLSEGVRLLVPTTPVTQGSTSCPPTPALQRPRPVAG 932
            AN       + + GSR  +H  G P +    L+V + P   G+++  P  +  +   +  
Sbjct: 730  ANGSSRAFVRTDCGSRGLQHPEGPPSTSD--LVVLSCP---GTSATSPVASTAKDLEILE 784

Query: 933  TNPEGIAKQSYQLKDESDFPPLS 1001
               E    Q Y LKD   FPPL+
Sbjct: 785  NEQERALLQQYHLKDNVHFPPLT 807


>ref|XP_004246272.1| PREDICTED: uncharacterized protein LOC101256025 [Solanum
            lycopersicum]
          Length = 849

 Score =  120 bits (301), Expect = 1e-24
 Identities = 107/302 (35%), Positives = 139/302 (46%), Gaps = 11/302 (3%)
 Frame = +3

Query: 129  HREGDLVSTVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFALPGP---IPSSPSQFWN 299
            H   DL ST G+     +L DLS DYD Y   LQ+  WF++ AL  P   +P +P   ++
Sbjct: 571  HLNLDLASTSGAELSSKALSDLSADYDNYLKHLQYGLWFYEHALNIPALTVPQAPPSPYH 630

Query: 300  KPGFNNFHHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGGSSYGA-EEVPRP 476
               ++    Q       N FS   + NGV+ P   FYP NP +M   G  Y A EE+P+ 
Sbjct: 631  MK-YSWEAAQQPSYMNTNGFS-HGSTNGVI-PSQAFYPINPMLM--HGMPYAALEEMPKQ 685

Query: 477  RGTGTYIPIMNHRTPYRDRPVQGRGRNP----SPVVHGRGHFWRPQNGWVVTPPEANLFE 644
            RGTGTY P +NH  P+  RP   +GR+     SP  +GRG F            E +   
Sbjct: 686  RGTGTYFPNLNH-PPHGYRPSTVKGRHQAGLRSPRTNGRGTF-----------SEMHTLG 733

Query: 645  KANHEPSQAQPPVLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPEKP-EFGSRRHLP- 818
            ++ HE  Q Q    A +    PL       P   G       VLP E+   FGS    P 
Sbjct: 734  RSYHE--QVQSESSADQSNVHPL-----SSPHRRGHHSMTGMVLPTERTVNFGSVGTGPL 786

Query: 819  GTPLSEGVRLLVPTTPVTQGSTSCPPTPALQRPRPVAGTNPEGIA-KQSYQLKDESDFPP 995
            GT +SE  R       +TQ S+   P PA QR   V     + +  K SY LKDE +FPP
Sbjct: 787  GTSISERSRQQRTVPSLTQQSSPVSPVPAFQRSNSVFSKELDRVTLKSSYHLKDEDEFPP 846

Query: 996  LS 1001
            LS
Sbjct: 847  LS 848


>ref|XP_002518281.1| nucleic acid binding protein, putative [Ricinus communis]
            gi|223542501|gb|EEF44041.1| nucleic acid binding protein,
            putative [Ricinus communis]
          Length = 821

 Score =  119 bits (299), Expect = 2e-24
 Identities = 109/341 (31%), Positives = 161/341 (47%), Gaps = 11/341 (3%)
 Frame = +3

Query: 12   WNDPEETIYFLKN-------HERLATVGTTDGSSSIDCPLEDLCLSHREGDLVSTVGSPG 170
            W++ +E  + + N       HE   ++ +T   S ++   E+L  +  E D  S    P 
Sbjct: 486  WSESKENHFVINNSACSCSNHEGKTSLCSTI-PSLVNNISENLAPTTAERDFASISQIPR 544

Query: 171  PLNSLLDLSGDYDGYFYDLQFARWFHDFALPGPI-PSSPS--QFWNKPGFNNFHHQSMQM 341
               SLLDL+GDYD +   ++F +    FA+  P+ P SP+     NK  +     QS+Q+
Sbjct: 545  SFKSLLDLTGDYDSHLKSVKFGQGCCFFAVSAPVLPCSPTAPHSKNKNPWETVR-QSLQL 603

Query: 342  QQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGTGTYIPIMNHRTP 521
            + RN+ S ++N NG+      F     + +    +++ +EE  + RGTGTYIP M++ + 
Sbjct: 604  K-RNVHS-QINTNGIFGHQQHFL----NHLVPFTTAFSSEEKRKQRGTGTYIPNMSYHSN 657

Query: 522  YRDRPVQGRGRNPSPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPSQAQPPVLAVRGK 701
             R+RP   R +N     +G  H     NG   T P  N ++   HE S+A+ P L   GK
Sbjct: 658  -RERPSSERRKNHVTANNGDLHRRTRDNGLAATRPGINSYQHG-HELSEAEYPYLG-NGK 714

Query: 702  PGPLEFFPGGRPVAGGFPHANSFVLPPEKPEFGSRR-HLPGTPLSEGVRLLVPTTPVTQG 878
            P P E     +    G   AN F  P E+ +FG +   L    L E V     +T  T  
Sbjct: 715  PVPSEV-QLSQSFVWGPSSANGFSRPSERIDFGGQELQLQEASLQERVPTQDSSTSSTLV 773

Query: 879  STSCPPTPALQRPRPVAGTNPEGIAKQSYQLKDESDFPPLS 1001
              S P   A +R  PV     E  A +SY LKDE DFPPLS
Sbjct: 774  FPSSPEVTAAERREPVLQNVQERAASESYHLKDEVDFPPLS 814


>ref|NP_850678.2| PAP/OAS1 substrate-binding domain superfamily [Arabidopsis thaliana]
            gi|332645293|gb|AEE78814.1| PAP/OAS1 substrate-binding
            domain superfamily [Arabidopsis thaliana]
          Length = 829

 Score =  115 bits (289), Expect = 3e-23
 Identities = 107/315 (33%), Positives = 141/315 (44%), Gaps = 11/315 (3%)
 Frame = +3

Query: 87   SSSIDCPLEDLCLSHREGDLVSTVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFALPG 266
            +S++  P ED+ L H  G  VS  G+P   N L DLSGDY+     L+F RW+ D+   G
Sbjct: 551  ASAVPWPQEDMHL-HYSGHCVS--GTP---NMLSDLSGDYESQLNSLRFGRWWFDYVQNG 604

Query: 267  PI-PSSPSQFWNKPGFNNFHHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGG 443
            P+ P SP      P  N++      +  R      +N NGVVP    F+  NP ++ G G
Sbjct: 605  PMSPLSPPGLPQLPNNNSWEVMRHALPFRRNAPTPVNANGVVPR-QVFFHVNPQMIPGPG 663

Query: 444  SSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRGRNP----SPVVHGRGHFWRPQNGW 611
              +G EE+P+PRGTGTY P  NH   YRDRP   RGRN     SP  +GR      Q   
Sbjct: 664  --FGIEELPKPRGTGTYFPNANH---YRDRPFSPRGRNSHQARSPRNNGRS---MSQAHS 715

Query: 612  VVTPPEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPEK- 788
             +  P+ N  E+  H P+Q              L+ FP      G   H      P EK 
Sbjct: 716  EMNFPDRNTRERQLHYPNQTNGS--CDMSHTDSLDSFP---DTNGSTNH------PYEKA 764

Query: 789  PEFGSRRHLPGTPLSEGVRLLVP-----TTPVTQGSTSCPPTPALQRPRPVAGTNPEGIA 953
            P+F     LP       V +L P          +G  + P  P   +PRP +        
Sbjct: 765  PDFRPTEPLP-------VEVLSPPEDSKPRDSIEGHHNRPHRP---KPRPSSTQEERVTP 814

Query: 954  KQSYQLKDESDFPPL 998
             QSY L D+ +FPPL
Sbjct: 815  TQSYHLTDDDEFPPL 829


>dbj|BAF01063.1| hypothetical protein [Arabidopsis thaliana]
          Length = 660

 Score =  115 bits (289), Expect = 3e-23
 Identities = 107/315 (33%), Positives = 141/315 (44%), Gaps = 11/315 (3%)
 Frame = +3

Query: 87   SSSIDCPLEDLCLSHREGDLVSTVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFALPG 266
            +S++  P ED+ L H  G  VS  G+P   N L DLSGDY+     L+F RW+ D+   G
Sbjct: 382  ASAVPWPQEDMHL-HYSGHCVS--GTP---NMLSDLSGDYESQLNSLRFGRWWFDYVQNG 435

Query: 267  PI-PSSPSQFWNKPGFNNFHHQSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGG 443
            P+ P SP      P  N++      +  R      +N NGVVP    F+  NP ++ G G
Sbjct: 436  PMSPLSPPGLPQLPNNNSWEVMRHALPFRRNAPTPVNANGVVPR-QVFFHVNPQMIPGPG 494

Query: 444  SSYGAEEVPRPRGTGTYIPIMNHRTPYRDRPVQGRGRNP----SPVVHGRGHFWRPQNGW 611
              +G EE+P+PRGTGTY P  NH   YRDRP   RGRN     SP  +GR      Q   
Sbjct: 495  --FGIEELPKPRGTGTYFPNANH---YRDRPFSPRGRNSHQARSPRNNGRS---MSQAHS 546

Query: 612  VVTPPEANLFEKANHEPSQAQPPVLAVRGKPGPLEFFPGGRPVAGGFPHANSFVLPPEK- 788
             +  P+ N  E+  H P+Q              L+ FP      G   H      P EK 
Sbjct: 547  EMNFPDRNTRERQLHYPNQTNGS--CDMSHTDSLDSFP---DTNGSTNH------PYEKA 595

Query: 789  PEFGSRRHLPGTPLSEGVRLLVP-----TTPVTQGSTSCPPTPALQRPRPVAGTNPEGIA 953
            P+F     LP       V +L P          +G  + P  P   +PRP +        
Sbjct: 596  PDFRPTEPLP-------VEVLSPPEDSKPRDSIEGHHNRPHRP---KPRPSSTQEERVTP 645

Query: 954  KQSYQLKDESDFPPL 998
             QSY L D+ +FPPL
Sbjct: 646  TQSYHLTDDDEFPPL 660


>ref|XP_007017068.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative isoform 1
            [Theobroma cacao] gi|508787431|gb|EOY34687.1| NT domain
            of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative isoform 1
            [Theobroma cacao]
          Length = 836

 Score =  111 bits (277), Expect = 7e-22
 Identities = 96/298 (32%), Positives = 138/298 (46%), Gaps = 10/298 (3%)
 Frame = +3

Query: 141  DLVSTVGSPGPLNSLLDLSGDYDGYFYDLQFARWFHDFALPGPIPSSPSQFWNKPGFNNF 320
            +L    G    L SLLDL+GDYDG F+ L + ++ H F++  P+          P   N 
Sbjct: 565  ELAGIFGDSESLKSLLDLTGDYDGQFWSLLYGQYCHLFSVSSPV---------SPHLQNE 615

Query: 321  HH-----QSMQMQQRNIFSPRMNVNGVVPPGPQFYPSNPHIMAGGGSSYGAEEVPRPRGT 485
            +H     QS+ ++Q +++S R + NG++  G QF  S P +           E  + RGT
Sbjct: 616  NHWETIEQSIPLKQ-DLYSQR-DSNGIL--GSQFCFSKPPVAVHTALD---SEDKKKRGT 668

Query: 486  GTYIPIMNHRTPYRDRPVQGRGRNPSPVVHGRGHFWRPQNGWVVTPPEANLFEKANHEPS 665
            GTYIP + +R+  R+R   GRG   +   + +   +    G      E  L ++ +HE S
Sbjct: 669  GTYIPSIKYRSN-RERHSSGRGIFQASRAYSQLQRYTNNKGSATVQQEMALSQEGSHELS 727

Query: 666  QAQPPVLAVRGKPGPLEFFPGGR----PVAGGFPHANSFVLPPEKPEF-GSRRHLPGTPL 830
              + P L      GP++F P       P   G   A+    PPE+ E   S   L  T +
Sbjct: 728  PKEYPAL------GPVKFGPPNTHPPYPSVWGLCAASGLNCPPERFESESSSLELQSTNM 781

Query: 831  SEGVRLLVPTTPVTQGSTSCPPTPALQRPRPVAGTNPEGIAKQSYQLKDESDFPPLSI 1004
             E   L     P T GST     PA Q  +PV  +N E  A  SY LK+E DFPPLS+
Sbjct: 782  PEDNAL---PDPCTCGSTPSVMIPAAQSAKPVLESNQESDAGLSYHLKNEHDFPPLSL 836


Top