BLASTX nr result

ID: Paeonia22_contig00007867 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00007867
         (2267 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002300521.2| hypothetical protein POPTR_0001s45660g [Popu...   161   1e-36
ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853...   152   8e-34
ref|XP_007039310.1| Uncharacterized protein isoform 5 [Theobroma...   147   2e-32
ref|XP_007039306.1| Uncharacterized protein isoform 1 [Theobroma...   145   1e-31
ref|XP_004309511.1| PREDICTED: uncharacterized protein LOC101295...   134   1e-28
ref|XP_007210890.1| hypothetical protein PRUPE_ppa001807mg [Prun...   134   2e-28
ref|XP_002518949.1| conserved hypothetical protein [Ricinus comm...   127   2e-26
gb|EXC02134.1| hypothetical protein L484_024100 [Morus notabilis]     124   2e-25
ref|XP_006377881.1| hypothetical protein POPTR_0011s15260g, part...    95   2e-16
ref|XP_003523306.2| PREDICTED: uncharacterized protein LOC100778...    85   1e-13
ref|XP_007148023.1| hypothetical protein PHAVU_006G174000g [Phas...    85   1e-13
ref|XP_007148022.1| hypothetical protein PHAVU_006G174000g [Phas...    85   1e-13
ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma...    81   2e-12
ref|XP_007148021.1| hypothetical protein PHAVU_006G174000g [Phas...    77   3e-11
ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prun...    77   3e-11
ref|XP_007039225.1| Uncharacterized protein isoform 6 [Theobroma...    73   5e-10
ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma...    73   5e-10
ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma...    73   5e-10
ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma...    73   5e-10
ref|XP_002893751.1| predicted protein [Arabidopsis lyrata subsp....    70   3e-09

>ref|XP_002300521.2| hypothetical protein POPTR_0001s45660g [Populus trichocarpa]
            gi|550349961|gb|EEE85326.2| hypothetical protein
            POPTR_0001s45660g [Populus trichocarpa]
          Length = 911

 Score =  161 bits (408), Expect = 1e-36
 Identities = 206/774 (26%), Positives = 320/774 (41%), Gaps = 192/774 (24%)
 Frame = +2

Query: 227  LSVHNQSDFTALSTSTDIPLTDYRRDSSGLLKGLSYGDDRRSFSCKDNNGSVSLPNESLL 406
            L   ++SDF A+  +    L  Y    SG L+G+ +         KD +G  S+ N+   
Sbjct: 122  LFTESKSDFDAVPVTKSTEL-GYEAHKSGDLRGILHW--------KDKHGGFSMFNDDST 172

Query: 407  KQGV-------PAAEGSQAFL----NSASLCTNGSGVLGRDHQIGSRGMEQPGADSSSSP 553
            KQ +       PA   ++        S SLC   SG+  +DH++  +   +   DS   P
Sbjct: 173  KQALVHMLFIFPAGSPAEGLKLSPETSDSLCGKLSGISLKDHEVRPKRTRE--IDSQCVP 230

Query: 554  VEISNVATLKRPSTLCSTAILQD----VLKLPYPASVATPQVNGSIGG------------ 685
            + +    T    S L S+AILQD    +  LP   S ++   N +  G            
Sbjct: 231  ISLKFSTT----SDLNSSAILQDPQSGINYLPPSVSWSSCDTNIAYFGRSLSQQLDFHAA 286

Query: 686  ---------VMVFPV--SSP------------VLSEDVNFSDGFAVNNNDNSFAYTTFCL 796
                     +   PV  S P            VLSE+++ SDG    + +N   Y    L
Sbjct: 287  KQNVPPSSDINSLPVLVSEPSVASTGYLPFNHVLSENLD-SDGDGGVSKNNFLGYGQASL 345

Query: 797  KVPDFVWNSEGKEFNQDGSLIDTEKEIK-----DHLFVES--SSTKEAQVSNKKDPLFI- 952
            K P  V + + KE   +  L D  KE K      H  +E    +  E Q++    P+ + 
Sbjct: 346  KKPHAVVD-KSKEVFHNKVLTDKGKEGKMGKPVTHKVMEPVPMAKSELQITCPSPPIDLT 404

Query: 953  ----KESELF-----------------VSH------PQNHLTEELHCPERCVS------- 1030
                K  E+F                 V+H      P      ++ CP   +        
Sbjct: 405  LEVDKSKEVFHHKVLADKGKEGKLGKPVTHEVMEPVPMAKSELQITCPSLLIDLTLESLG 464

Query: 1031 ------IESSSEAL-DNNSEVDSPCWKGT-QARHSPFEGSRPVNSELLKHEVEAGKSLNP 1186
                  IE+SS+ + +N+S++DSPCWKG   A  S  E S P N + LK E EA   LNP
Sbjct: 465  IKESDPIENSSKIINENDSDLDSPCWKGKLAAEQSSCEVSVPDNFQHLKSEQEACSYLNP 524

Query: 1187 LAPQFFPRKVKESSDYRGIE-------CCQKSIS------------LDAAEGGPCPSKVI 1309
            LAP FFP   K+  +Y G E         QK+ S              +A  G   S+  
Sbjct: 525  LAPHFFPSSDKQKVNYCGNEGDGNDCFSFQKTASSVVNLVSREQRLQHSATAGSSSSEQS 584

Query: 1310 TEVGALCLDEVYASKKE-PALNNSKTSPEIISSQMAIPNVMEDYFRS----VTGDNTYGS 1474
            +   A C  +++   KE   L +S +S    SS + +P+V+EDYF S    +TG    G 
Sbjct: 585  SITEAHCYSDMHVPNKEYELLTDSSSSSMHGSSCVVLPSVLEDYFTSSGQLLTGQCVGGF 644

Query: 1475 VTGIKGAAPTGSFSGAVWDNYHPSST---------------------------IDIQVAV 1573
               IK  AP GS S +++ + H   +                           +D Q+ V
Sbjct: 645  GKAIKDTAPNGSTSVSLFASKHVFDSSSCREGVSTDLSETYGGATKPLCSPPRLDFQIVV 704

Query: 1574 NALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETP-PYA-- 1744
              + ++SELL+QNC++D +SL E + +I++ +I+NL  C+R   G+ +   E+  P+   
Sbjct: 705  KTMNELSELLMQNCTNDLDSLNEHEHDIIKRIIHNLTLCIRNRVGEHTLMSESSHPHTSY 764

Query: 1745 ----GTSNFEGTDVMHYVQKTQSESVPH------------------------------GF 1822
                 T   + +++     +T++  V H                              GF
Sbjct: 765  CVRKSTHLNKCSNMELQTTRTKAVMVSHELGHQNKHERQMSSTSFRERFLDSLNARNGGF 824

Query: 1823 DK----SQVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYKDSLVRM 1972
            +K    +QV EK  +  +++EE+  PQVL YK LWLEAEA + S+KYK S++ +
Sbjct: 825  NKNEDITQVNEKALEGHYELEEEENPQVLFYKNLWLEAEAALCSMKYKASVLEV 878


>ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853355 [Vitis vinifera]
            gi|302143995|emb|CBI23100.3| unnamed protein product
            [Vitis vinifera]
          Length = 1167

 Score =  152 bits (383), Expect = 8e-34
 Identities = 196/767 (25%), Positives = 296/767 (38%), Gaps = 163/767 (21%)
 Frame = +2

Query: 194  HLSPEFHCKTPLSVHNQSDFTALSTST------DIPLTDYRRDSSGL------------L 319
            +++P     +PL V N+ ++  LSTS          L DY +  SGL            L
Sbjct: 157  YVAPAIEDNSPLVVLNEPNYDLLSTSHAAHLNGSSSLDDYTQSMSGLEYPSRWCGFWNGL 216

Query: 320  KGLSYGDD---RRSFSCKDNNGSVSLPNESLLKQGVPAAEG-SQAFLNSASLCTNGSGVL 487
              +  G       S   K++N   S    S + QG P AEG S +   S         +L
Sbjct: 217  ADIEQGKKVELDESLCSKESNFVGSSIYRSYINQGDPTAEGVSNSEEGSVLSDRKYVDIL 276

Query: 488  GRDHQIGSRGMEQPGADSSSSPVEISNVATLKRPST--LCSTAILQDVLKLPYPASVATP 661
            GRD+ +GS   +     S   P     V +L  P T  L ST++L +    P+P + +  
Sbjct: 277  GRDNCVGSLSPDHFNNKSFYEPKANPMVVSLDFPRTSFLGSTSVLPET---PHPRAPSLE 333

Query: 662  QVNGS-----------------IGGVMVFPVS----SP--VLSEDVNFSDGFAVN----- 757
             V  S                 I   +  PVS    SP  V+    N      VN     
Sbjct: 334  PVTNSWNYRKPQSALYEKCFRKIDSCVDDPVSKAKSSPAIVIRPPANSPSSLGVNSFSSR 393

Query: 758  ------NNDNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTKEA 919
                  N++N   +    ++ P     SEG+E   D S ++   +  DHL +ESSSTK+ 
Sbjct: 394  NMICTDNSENVSGHHLSNMEEPHIPVISEGRELYSDTSQLNGHWQRNDHLSMESSSTKKH 453

Query: 920  QVSNKK------DPLFIKESELFVSH--PQNHLTEELHCPERCVSIESSSEALDN-NSEV 1072
            ++ N +      D L    SEL + H   ++  +   +  E   SI+++SE LD+ N  V
Sbjct: 454  ELLNNEMGVKETDNLLRARSELQIPHLNVEDGFSFSPNSIEAVNSIDNTSETLDHYNPAV 513

Query: 1073 DSPCWKGTQARH-SPFEGSRPVNSELLKHEVEAGKSLNPLAPQFFPR-----------KV 1216
            DSPCWKG+   H SPFE S  ++   L  ++EA    N      FP            K 
Sbjct: 514  DSPCWKGSITSHFSPFEVSEALSPHNLMEQLEALDGFNLQGHHIFPLNSDDAVNVSSLKP 573

Query: 1217 KESSDYRGIECCQKSI------------------SLDAAEGGPCPSKVITEVGALCLDEV 1342
             E+++Y    C +  +                  SLDA + GP   K+ +  G    +++
Sbjct: 574  NENTEYHKNVCGENGLLPSWKRPSVVNHPSREQRSLDAFKTGPYCQKLSSGDGNQSSNDI 633

Query: 1343 YASKKEPALNNSKTSPEIISSQMAIPNVMEDYFRS-----------VTGDN--------- 1462
               K++ +L NS  S  +  S     +  E  F S           VTG+N         
Sbjct: 634  IQPKRDHSLLNSSKSDNLELSHTMRQSFEEVKFTSERKLSSGVGVEVTGNNINDVSRDGS 693

Query: 1463 ---TYGSVTGIKGAAPTGSFSGAVWDNYHPSST---IDIQVAVNALYKISELLVQNCSSD 1624
               TY     I  +  +G  +         S +   ID+ + +N +  +S LL+ +CS +
Sbjct: 694  SHETYHLTENISCSPLSGDDASTKLTKQPASESTPKIDVHMLINTVQDLSVLLLSHCSDN 753

Query: 1625 SNSLKEQDQNILQHVINNLYFCMRKSGGQ-----------------RSSSIETPPYAGTS 1753
            + SLKEQD   L+ VI+N   C+ K G +                 +S+S   P     +
Sbjct: 754  AFSLKEQDHETLKRVIDNFDACLTKKGQKIAEQGSSHFLGELPDLNKSASASWPLGKKVA 813

Query: 1754 NFEGTDVMH-----------------------YVQKTQSESVPHGFDKSQVIEKFPKMKH 1864
            +    D  H                       +V     E   +     Q I K      
Sbjct: 814  DANVEDQFHCQSDHKGKRHCSVSGNKDEKLSDFVSLVNDEDTVNDDSTIQAIRKILDKNF 873

Query: 1865 QIEEDMQPQVLLYKKLWLEAEAEMLSVKYKDSLVRMKRR*IKSKAQK 2005
              EE+  PQ LLY+ LWLEAEA + S+ Y+    RMK    K K +K
Sbjct: 874  HDEEETDPQALLYRNLWLEAEAALCSISYRARFDRMKIEMEKFKLRK 920


>ref|XP_007039310.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|590674956|ref|XP_007039311.1| Uncharacterized protein
            isoform 5 [Theobroma cacao] gi|508776555|gb|EOY23811.1|
            Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508776556|gb|EOY23812.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 835

 Score =  147 bits (372), Expect = 2e-32
 Identities = 182/739 (24%), Positives = 272/739 (36%), Gaps = 148/739 (20%)
 Frame = +2

Query: 203  PEFHCKTPLSVHNQSDFTALSTSTDIPLTDYRRDSSGLLKGLSYGDDRRSFSCKDNNGSV 382
            P  H    LS H  S    + +S+         +  GL     +        C     S 
Sbjct: 105  PPLHTHFTLSTHQSSQTNFIPSSSSFGNVG---NKGGLQGTAVHQQGTEILRCNRQVASA 161

Query: 383  -SLPNESLLKQGVPAAEGSQAFLNSASLCTNGSGVLGRDHQIGSRGMEQPGADSSSSPVE 559
             SL + + L+QG            S  L   GS V+G+D+QI     E+   +SS  P+ 
Sbjct: 162  GSLSSNNPLEQGTTLEGSKLVSETSFVLRGKGSVVIGKDNQIRPEDKEKIHTESSIFPLA 221

Query: 560  ISNVATLKRPSTLCSTAILQDVLKLPYPASVATPQ---------VNGSIGGVMVFPVSS- 709
             S V  L +    C T        LP+P      Q         +  S+ G  +FP  S 
Sbjct: 222  NSEVNLLMK----CVTKPFSISSDLPFPPRPQDTQSQLLYSAESIACSLFGSTIFPYESC 277

Query: 710  ----------------------------------------PVLSEDVNFSDGFAVNNNDN 769
                                                    PV   +V      AV++ D+
Sbjct: 278  FPHLGSCHAETLVSHAPECFSYSAQICKPSSAGSNPPIVNPVPLVNVASGGSDAVSSRDS 337

Query: 770  SFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKKDPLF 949
             F Y    +     V N   K    D  +I+           E     E       +P  
Sbjct: 338  YFDYVLPGMMDTSTVHNPVDKVACHDQVIIEKG---------EKGKIVEPFHDETNNPSI 388

Query: 950  IKESELFVSHPQ--NHLTEELHCPERCVSIESSSEALDNNSEVDSPCWKGTQAR------ 1105
              +S+L ++ P     LT E H  +  +  + SS +   +S+VDSPCWKGTQA       
Sbjct: 389  RAKSKLRIACPNVPQDLTLEQHGAKPGIPDDKSSTS-HGDSDVDSPCWKGTQANKSPLSD 447

Query: 1106 -----------HSPFEGSRPVNSELLKHEVEAGKSLNPLAPQFFPRKVKESSDYRGIE-- 1246
                        SPF  S P+ SE  K+E  A  SLNP AP F P   K   D+   E  
Sbjct: 448  SVPANSEDSKGQSPFRVSMPLKSEHSKNEKVARSSLNPQAPVFIPGNSKPKVDHHQKEGH 507

Query: 1247 -----CCQKSISLDAAEGGP------------CPSKVITEVGALCLDEVYASKKEPALNN 1375
                   QKS +LD                  CPS+ I ++G     +V+ SKKE  +  
Sbjct: 508  GDSSLSSQKSAALDVTSSSSEHRSTDSVNAVKCPSERIDDIGIQSSSDVHDSKKECGIPY 567

Query: 1376 SKTSPEIISSQMAI-PNVMEDYFRS----VTGDNTYGSVTGIKGAAPTGSFSGAVWDNYH 1540
                   ++S  +  P + E+Y  S    V G N  GS+ GI  AA  G  S     ++ 
Sbjct: 568  KSFRSSAVNSSCSFQPYLREEYVTSASQLVRGTNVAGSMEGIADAAHNGLDSVEDIAHHG 627

Query: 1541 PSST-------------------------------------IDIQVAVNALYKISELLVQ 1609
            PS++                                     ID+++ +N +  +SELL+Q
Sbjct: 628  PSTSFSFLETETALNSHSTGVGVFSDFTERPQEPSKSTPPKIDVKLMINTMQYLSELLLQ 687

Query: 1610 NCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETPP----------------- 1738
            N S D  SL E + + L  ++NNLY  +R   G  +  +E+                   
Sbjct: 688  NSSFDLGSLSEHEYDKLLTIMNNLYVLIRNKAGLMAVRLESSHPCTLYCRRQPADRHEEM 747

Query: 1739 YAGTSNFEGTDVMHYVQKTQSESVPHGFDKSQVIEKFPKMKHQIEEDMQPQVLLYKKLWL 1918
            Y  ++      +++   ++  E    G D SQVIEK PK+   IE++M  + L Y+ LWL
Sbjct: 748  YKTSAPMLSGRMLYSFYQSNDEGFEKGGDISQVIEKDPKVIPSIEKEMPSEALFYRDLWL 807

Query: 1919 EAEAEMLSVKYKDSLVRMK 1975
            EA+A +   KY+   ++M+
Sbjct: 808  EAKAALNLKKYQAHALQMQ 826


>ref|XP_007039306.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590674942|ref|XP_007039307.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590674946|ref|XP_007039308.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590674950|ref|XP_007039309.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508776551|gb|EOY23807.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776552|gb|EOY23808.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508776553|gb|EOY23809.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776554|gb|EOY23810.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 848

 Score =  145 bits (365), Expect = 1e-31
 Identities = 188/753 (24%), Positives = 277/753 (36%), Gaps = 162/753 (21%)
 Frame = +2

Query: 203  PEFHCKTPLSVHNQSDFTALSTSTDIPLTDYRRDSSGLLKGLSYGDDRRSFSCKDNNGSV 382
            P  H    LS H  S    + +S+         +  GL     +        C     S 
Sbjct: 105  PPLHTHFTLSTHQSSQTNFIPSSSSFGNVG---NKGGLQGTAVHQQGTEILRCNRQVASA 161

Query: 383  -SLPNESLLKQGVPAAEGSQAFLNSASLCTNGSGVLGRDHQIGSRGMEQPGADSSSSPVE 559
             SL + + L+QG            S  L   GS V+G+D+QI     E+   +SS  P+ 
Sbjct: 162  GSLSSNNPLEQGTTLEGSKLVSETSFVLRGKGSVVIGKDNQIRPEDKEKIHTESSIFPLA 221

Query: 560  ISNVATLKRPSTLCSTAILQDVLKLPYPASVATPQ---------VNGSIGGVMVFPVSS- 709
             S V  L +    C T        LP+P      Q         +  S+ G  +FP  S 
Sbjct: 222  NSEVNLLMK----CVTKPFSISSDLPFPPRPQDTQSQLLYSAESIACSLFGSTIFPYESC 277

Query: 710  ----------------------------------------PVLSEDVNFSDGFAVNNNDN 769
                                                    PV   +V      AV++ D+
Sbjct: 278  FPHLGSCHAETLVSHAPECFSYSAQICKPSSAGSNPPIVNPVPLVNVASGGSDAVSSRDS 337

Query: 770  SFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKKDPLF 949
             F Y    +     V N   K    D  +I+           E     E       +P  
Sbjct: 338  YFDYVLPGMMDTSTVHNPVDKVACHDQVIIEKG---------EKGKIVEPFHDETNNPSI 388

Query: 950  IKESELFVSHPQ--NHLTEELHCPERCVSIESSSEALDNNSEVDSPCWKGTQAR------ 1105
              +S+L ++ P     LT E H  +  +  + SS +   +S+VDSPCWKGTQA       
Sbjct: 389  RAKSKLRIACPNVPQDLTLEQHGAKPGIPDDKSSTS-HGDSDVDSPCWKGTQANKSPLSD 447

Query: 1106 -----------HSPFEGSRPVNSELLKHEVEAGKSLNPLAPQFFPRKVKESSDYRGIE-- 1246
                        SPF  S P+ SE  K+E  A  SLNP AP F P   K   D+   E  
Sbjct: 448  SVPANSEDSKGQSPFRVSMPLKSEHSKNEKVARSSLNPQAPVFIPGNSKPKVDHHQKEGH 507

Query: 1247 -----CCQKSISLDAAEGGP------------CPSKVITEVGALCLDEVYASKKEPALNN 1375
                   QKS +LD                  CPS+ I ++G     +V+ SKKE  +  
Sbjct: 508  GDSSLSSQKSAALDVTSSSSEHRSTDSVNAVKCPSERIDDIGIQSSSDVHDSKKECGIPY 567

Query: 1376 SKTSPEIISSQMAI-PNVMEDYFRS----VTGDNTYGSVTGIKGAAPTGSFSGAVWDNYH 1540
                   ++S  +  P + E+Y  S    V G N  GS+ GI  AA  G  S     ++ 
Sbjct: 568  KSFRSSAVNSSCSFQPYLREEYVTSASQLVRGTNVAGSMEGIADAAHNGLDSVEDIAHHG 627

Query: 1541 PSST-------------------------------------IDIQVAVNALYKISELLVQ 1609
            PS++                                     ID+++ +N +  +SELL+Q
Sbjct: 628  PSTSFSFLETETALNSHSTGVGVFSDFTERPQEPSKSTPPKIDVKLMINTMQYLSELLLQ 687

Query: 1610 NCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETP-PYAGTSNFEGTDVMHYV 1786
            N S D  SL E + + L  ++NNLY  +R   G  +  +E+  P       +  D  H V
Sbjct: 688  NSSFDLGSLSEHEYDKLLTIMNNLYVLIRNKAGLMAVRLESSHPCTLYCRRQPAD-RHEV 746

Query: 1787 QKTQSESVPH--------------------------GFDK----SQVIEKFPKMKHQIEE 1876
            +K + ++V H                          GF+K    SQVIEK PK+   IE+
Sbjct: 747  KKVKDKAVLHDQEMYKTSAPMLSGRMLYSFYQSNDEGFEKGGDISQVIEKDPKVIPSIEK 806

Query: 1877 DMQPQVLLYKKLWLEAEAEMLSVKYKDSLVRMK 1975
            +M  + L Y+ LWLEA+A +   KY+   ++M+
Sbjct: 807  EMPSEALFYRDLWLEAKAALNLKKYQAHALQMQ 839


>ref|XP_004309511.1| PREDICTED: uncharacterized protein LOC101295876 [Fragaria vesca
            subsp. vesca]
          Length = 674

 Score =  134 bits (338), Expect = 1e-28
 Identities = 167/651 (25%), Positives = 263/651 (40%), Gaps = 105/651 (16%)
 Frame = +2

Query: 317  LKGLSYGDDRRSFSCKDNNGSVSLPNESLLKQGVPAAEGSQAFLNSASLCTNGSGV-LGR 493
            L+G S+G    S +C          N    +QG P  +      NS S     S + +G+
Sbjct: 70   LQGSSFGRHEASLAC----------NNYAYEQGKPVKKSKLYDNNSGSARDKCSHLTMGK 119

Query: 494  DHQIGSRGMEQPGADSSSSPVEISNVATLKRP-STLCSTAILQDVLK--LPYPASVA--- 655
            ++   SR   Q  A   S  V  S     + P S  CS ++LQ   +  LPY   VA   
Sbjct: 120  ENPFTSRSTNQVDAGIFSFSVVNSVATPFEFPMSVKCSASMLQSYSQPELPYTTPVAGWN 179

Query: 656  ---------------------------TPQVN-------GSIGGVMVFPVSSPVLSEDVN 733
                                       +P+ N       GS    + F  S  +L ++  
Sbjct: 180  QTNSTMTFGESGLTKSDPCTDNFTVSRSPRDNAFPDVESGSSDTCITFSPSKSILLKNAE 239

Query: 734  FSDGFAVNNNDNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTK 913
             + G AV + DNS  Y++  +     +   EGK+ + D S      E          +  
Sbjct: 240  VTGGSAVIHKDNSSKYSSHDIMDLHQLLYGEGKKNDHDKSSSYKGNE---------RTCV 290

Query: 914  EAQVSNKKDPLFIKESELFVSHPQNHLTEELHCPERCVSIESSSEALDNNSEVDSPCWKG 1093
            EA  S   DPL   +S+  V+  + H    L   +   +I  S++   N+S+VDSPCW+G
Sbjct: 291  EAVSSEGSDPLLTDKSDPQVTLKKPHDKSSLEHQDAEEAISLSTKLDGNDSDVDSPCWRG 350

Query: 1094 TQA-RHSPFEGSRPVNSELLKHEVEAGKSLNPLAPQFFPRKVK----------ESSDYRG 1240
            + A R +P   SR ++S  +++  EA  SLNPLAP FFPR  K          ++ D+  
Sbjct: 351  SLASRQTPLGVSRSLSSHSIENVQEASYSLNPLAPHFFPRPSKAIDNCYANEYDADDFSS 410

Query: 1241 I------------ECCQKSISLDAAEGGPCPSKVITEVGALCLDEVYASKKEPALNNSKT 1384
                            +++IS+D A  G   S  I  +G    + ++ SK+E AL N   
Sbjct: 411  FIKSDSGAVGAVSSFSKENISVDKA--GAKSSLSINGMGTQTSNNIHESKREYALLNKSG 468

Query: 1385 SPEIISSQMAIPNVMEDYFRSVTGDNTYGSVTGIKGAAPTGSFSGAVWDNYHPSSTIDIQ 1564
            S   +S                            KG +   S            S ID+ 
Sbjct: 469  SDSALS----------------------------KGVSKLLS----------TDSKIDVS 490

Query: 1565 VAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETPPYA 1744
              ++ ++ +S  LVQNCS+D   +   D +++QH+INNL  C++   G +  SI     +
Sbjct: 491  TVLDMMHDLSSFLVQNCSND---VLLDDHDLIQHIINNLRMCIQHRAGGK-CSIPDFTVS 546

Query: 1745 GTSNF--EGTDVMHY------VQKTQSE--SVPHGFD----------KSQVIEKFP---- 1852
            GTSNF  + T+++         Q+T +    VP   D            ++++ FP    
Sbjct: 547  GTSNFPNKSTEIIEVGCSNMGFQETNTGPFDVPLELDYQNLINRLDFTGRMLDSFPSDSN 606

Query: 1853 ----KMKHQIE-------------EDMQPQVLLYKKLWLEAEAEMLSVKYK 1954
                K K  I+             +++ PQ L+YKKLWLEAEA + ++KY+
Sbjct: 607  IGTGKSKDIIQVMGNTGRDNYLTKDEIDPQALVYKKLWLEAEATLRAMKYE 657


>ref|XP_007210890.1| hypothetical protein PRUPE_ppa001807mg [Prunus persica]
            gi|462406625|gb|EMJ12089.1| hypothetical protein
            PRUPE_ppa001807mg [Prunus persica]
          Length = 762

 Score =  134 bits (337), Expect = 2e-28
 Identities = 193/730 (26%), Positives = 301/730 (41%), Gaps = 127/730 (17%)
 Frame = +2

Query: 167  KDHPLQTIYHLSPEF----HCKTPLSVHNQSDFTALSTSTDIPLTDYRRDSSGLLKGLSY 334
            +D P  T  +   EF    H     S +  SD   + +++   LT+Y   SS       +
Sbjct: 59   EDDPFSTAPYSFLEFVEDSHFPQYPSANAASDLGFMPSASKESLTNYTELSS-------F 111

Query: 335  GDDRRSFSCKDNNGSVSLPNESLLKQG------------------------------VPA 424
            G  + SFS   +N + SL  E+LL+QG                               PA
Sbjct: 112  GHSQASFS---SNKNASLAYETLLEQGPLLSCMEGTLSMRVVRNESYFLWLTCYDVNTPA 168

Query: 425  AEGSQA-FLNSASLCTNGSGV-LGRDHQIGSRGMEQPGADSSSSPVEISNVATLKRP--- 589
             +GS+    NS S+    S + +G ++Q  SR  +Q  A   S     S V T+  P   
Sbjct: 169  VKGSKPNHENSESVHEKCSDLTIGTENQFISRSTDQVDAGFFS----FSAVNTMATPHEF 224

Query: 590  --STLCSTAILQDV--LKLPYPASVATPQ-----------------------------VN 670
              S   ST+ LQD    +LPY A   T                                N
Sbjct: 225  PMSVTSSTSRLQDYSQAQLPYTAPNVTWSHCNSEIALCDSGFTKLDALTAKSTVFHLPTN 284

Query: 671  GSIGGVMVFPVSSPV-------LSEDVNFSDGFAVNNNDNSFAYTTFCLKVPDFVWNSEG 829
             S   V++   +S         LS++V+F   +  NN D+S   +   +K    + +SEG
Sbjct: 285  NSFPAVLLESDTSTTVSPLNLALSKNVDFKGNYPPNNYDSSSKCSPSGIKDLHDLISSEG 344

Query: 830  KEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKKDPLFIKES---ELFVSHPQNHLTE 1000
            KE + DGS  D  K  KD   + S     A +    +PL    +   +  + HP      
Sbjct: 345  KEIHHDGSPNDKGKGGKDGKPLSSEGIG-ALLKATSEPLITLTNIPDDFSLKHPG----- 398

Query: 1001 ELHCPERCVSIESSSEALDNNSEVDSPCWKGTQARHSPFEGSRPVNSELLKHEVEAGKSL 1180
                P+  VSI  + +  +N+S++DSPCWKGT A    +  SR ++S+ + +E E   SL
Sbjct: 399  ----PKGAVSISKNLD--ENDSDLDSPCWKGTLASRQ-YGVSRSLSSDFVGNEQEVRNSL 451

Query: 1181 NPLAPQFFPRKVKESSDYRGIECCQKSISLDAAEGGPCPSKVITEVGALCLDEVYASKKE 1360
            NPLAPQFFPR  K   DY   +            G    S   +E  A+          +
Sbjct: 452  NPLAPQFFPRHAKAIVDYHANDYV----------GDDFSSFQKSESSAVNSSSKGHGPVD 501

Query: 1361 PALNNSKTSPEIISSQMAIPNVMEDYFR--SVTGDNTYGSVTGIKGAAPTGSFSGAVWDN 1534
             A + S +S + I +Q +  N + D  R   +  ++  GSV  +    P G     +   
Sbjct: 502  QAGSKSSSSIKGIGTQTS--NDIHDLERVYPLLNNSESGSVLNL----PEG-----LSKL 550

Query: 1535 YHPSSTIDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNL-YFCMRKSGGQ 1711
                S +D+   +N ++ +SELLVQ CS+D +SL E  ++++Q++INNL  +     GG+
Sbjct: 551  LSTHSKLDVPTILNMMHDLSELLVQKCSNDLDSLNEH-KHVMQNIINNLCTYIQHGDGGK 609

Query: 1712 RSSS----IETP--PYAGTSNFEGTDVMHYVQKTQSESVPH------------------- 1816
               S      TP  P   T   + +++   V K ++ +VP                    
Sbjct: 610  VPISDITLTGTPYCPVKSTELHKCSNMGFQVTKKKALAVPQEINYQNDREGRKVNSHVFT 669

Query: 1817 -------------GFDKS----QVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSV 1945
                         G +KS    QV+    +  H   E++ PQ L+YKKLWL+AEA + S+
Sbjct: 670  ERMLDSFPSCSGVGTEKSNDIVQVMGNALRDNHLTTEELDPQALVYKKLWLQAEAALCSM 729

Query: 1946 KYKDSLVRMK 1975
            KY+  ++ M+
Sbjct: 730  KYETCVLCMQ 739


>ref|XP_002518949.1| conserved hypothetical protein [Ricinus communis]
            gi|223541936|gb|EEF43482.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 605

 Score =  127 bits (319), Expect = 2e-26
 Identities = 161/621 (25%), Positives = 254/621 (40%), Gaps = 90/621 (14%)
 Frame = +2

Query: 338  DDRRSFSCKDNNGSVSLPNESLLKQGVPAAEGSQ-AFLNSASLCTNGSGVLGRDHQIGSR 514
            DD R   C+D + +  +P      QG   AEG +   L S SL  N  G  G D +    
Sbjct: 41   DDHR---CEDKDSAFFIPYRFSSTQGKLPAEGLKPCVLRSGSLYENFIGTSGIDSE---- 93

Query: 515  GMEQPGADSSSSPVEISNVATLKRPSTLCSTAILQDV----------------------- 625
                   +SS +  +I         S LCST+I  D                        
Sbjct: 94   -------NSSKTTNQIEWCVPFPDTSELCSTSIHGDTQSGLAYQITCSSSDSNISFYDRY 146

Query: 626  -----------LKLPYPASVATP-QVNGSIG-GVMVFPVSSPVLSEDVNFSDGFAVNNND 766
                       LKL   +  ++P QV+G  G G    P++  +L      SDG+   +N 
Sbjct: 147  FSQPLDSHAATLKLSCVSEHSSPVQVSGPSGTGAGYLPLN--LLLHHSMQSDGYGAFSN- 203

Query: 767  NSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDT-EKEIKDHLFVESSSTKEAQVSNKKDP 943
                  T C  V                +L++T  K+    + +E S   E   S   + 
Sbjct: 204  ------TSCNPVI----------IEDRCTLLNTTNKDTLREILLEKSQDAENGKSKTNEV 247

Query: 944  LFIKESELFVSHPQNHLTEELHCPERCVSIESSSEALDNNSEVDSPCWKGT-QARHSPFE 1120
            +   +S      P + +  EL      V ++ + E    NS++DSPCWKGT  A  S  E
Sbjct: 248  I---DSLTMPQVPYSSVPHEL-----TVKLQGAEEG---NSDLDSPCWKGTLAANQSILE 296

Query: 1121 GSRPVNSELLKHEVEAGKSLNPLAPQFFPRKVKESSDYRGIECCQKSISL---------- 1270
             S PVN + L+   E   SL+ LA + F    K++  YR  EC + S S           
Sbjct: 297  DSGPVNGQQLRSGQEELNSLSLLASELFASSDKQNC-YRVNECDEDSSSFFHKTASSAVP 355

Query: 1271 ---------DAAEGGPCPSKVITEVGALCLDEVYASKKEPALNNSKTSPEIISSQMAIPN 1423
                     ++   G   S++   + + C ++V    KE A+  +  +  ++ S +  P+
Sbjct: 356  LQPVEQRSANSVTTGSAFSELTNVIWSCCTNDVCLPDKEDAILKNSNNSSMLKSCILEPS 415

Query: 1424 VMEDYFRS----VTGDNTYGSVTGIKGAAPTGSFSGAVWDNYHPSST------------- 1552
             +ED+  S    VTG N  G++ GI+ +   GS S   ++N +  S+             
Sbjct: 416  SVEDHCYSNSQLVTGPNIAGTLRGIRESVQHGS-SRISFENKNVISSSSCRIHIPSDFTE 474

Query: 1553 --------------IDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFC 1690
                          + IQ  VN + ++SELL+ NCS+D +SL E + +I++H+INNL  C
Sbjct: 475  TCQGASRSFSCPPRLHIQKVVNTMNELSELLLHNCSNDLDSLNEHEHDIIEHIINNLTAC 534

Query: 1691 MRKSGGQRSSSIE-TPPYAGTSNFEGTDVMHYVQKTQSESVPHGFDKSQVIEKFPKMKHQ 1867
            +R   G+R+   E T P     + +  D++                K+Q+IEK     H+
Sbjct: 535  IRNRNGRRTLMPEATHPCTSYCHRKSADIL----------------KTQIIEKDMAKDHE 578

Query: 1868 IEEDMQPQVLLYKKLWLEAEA 1930
            I +D+ P+V+LYK L LE  A
Sbjct: 579  I-KDVNPRVMLYKNLLLETRA 598


>gb|EXC02134.1| hypothetical protein L484_024100 [Morus notabilis]
          Length = 753

 Score =  124 bits (310), Expect = 2e-25
 Identities = 142/539 (26%), Positives = 225/539 (41%), Gaps = 100/539 (18%)
 Frame = +2

Query: 659  PQVNGSIGGVMVFPVSSPVLSEDVNFSDGFAVNNNDNSFAYTTFCLKVPDFVWNSEGKEF 838
            P +  S  G   F  SS +L ++V+       +NN +S        +    + N+   E 
Sbjct: 195  PMLGSSANGTD-FTTSSCILPKNVDLPGNSVASNNKSSSGRIISGNRDIHGLPNAYSNEG 253

Query: 839  NQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKKDPLFIKESEL--FVSHPQNHLTEELHC 1012
            +QD  L D   EIK+   V   +          DP+ I +SE+   ++   +    E   
Sbjct: 254  HQDKGLGDEGMEIKNAKSVPCKAL---------DPVVIAKSEVRFAINDIFDGSVMERVG 304

Query: 1013 PERCVSIESSSEALDNN-SEVDSPCWKGTQ-ARHSPFEGSRPVNSELLKHEVEAGKSLNP 1186
                +S + SS+ LD + S++DSPCWKG Q +  SP   +   ++  +++E EAG SLNP
Sbjct: 305  TLAAISTKGSSKLLDEDESDLDSPCWKGIQNSTKSPNIVAESSSTHSIRNESEAGTSLNP 364

Query: 1187 LAPQFFPRKVKESSDY------RGI------ECCQKSIS------LDAAEGGPCPSKVIT 1312
             APQFFP   K S DY       G+      EC    +S      +D+ + G        
Sbjct: 365  RAPQFFPSHSKGSIDYLQNNTVGGVPYFGKGECSAFDLSYKETPIVDSYKAGLETRGSTN 424

Query: 1313 EVGALCLDEVYASKKEPA-LNNSKTSPEIISSQMAIPNVMEDYFR----SVTGDNTYGSV 1477
             VG    + V    KE A L +SK+S  +   QM  P +++ +F     SV G +  G  
Sbjct: 425  AVGYQYSNGVNEPGKESAMLKDSKSSSALSPPQMIKPYLVDGFFTSKEVSVKGVDFEGFA 484

Query: 1478 TGIKGAA---------------PTGSFSG------------AVWDNYHPSSTIDIQVAVN 1576
             GI  AA               P  S SG             + ++       ++ V VN
Sbjct: 485  DGIMDAANKNPRNLSALAAEYVPHLSSSGVGALSDCSELLQCLTESLSKCPKTNVAVTVN 544

Query: 1577 ALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETPPYAGTSN 1756
            A+  +S+LLV+NCS+D +SL E +  +++H+INNLY  ++   G+ +  ++   + G+ +
Sbjct: 545  AIRCLSDLLVENCSNDLDSLNEHEHEMIRHIINNLYALIKHRVGEETPILDL-LHTGSLD 603

Query: 1757 FEGTDVMHYVQKTQSESV-----------------PHGFDKS------------------ 1831
            +       Y Q      V                  H + KS                  
Sbjct: 604  YRDKSTATYEQSNMEFQVIPRTKDLVVRQELDSRSDHAWRKSYSHAATRKMKDLVPSPKD 663

Query: 1832 -----------QVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYKDSLVRMK 1975
                        V+    K    I+E++ PQV L   LWLEAE  + S+KY++ ++RMK
Sbjct: 664  VGCSERGNSIVPVLRNALKENQWIDEEIHPQVFL--NLWLEAEGALCSMKYENYILRMK 720


>ref|XP_006377881.1| hypothetical protein POPTR_0011s15260g, partial [Populus trichocarpa]
            gi|550328449|gb|ERP55678.1| hypothetical protein
            POPTR_0011s15260g, partial [Populus trichocarpa]
          Length = 873

 Score = 94.7 bits (234), Expect = 2e-16
 Identities = 87/273 (31%), Positives = 129/273 (47%), Gaps = 32/273 (11%)
 Frame = +2

Query: 857  IDTEKEI-KDHLFVESSSTKEAQVSNKK--DPLFIKESELFV---SHPQNHLTEELHCPE 1018
            +D  KE+  D +  + S  K ++ + ++  +PL +  SEL +   SHP    ++ L   E
Sbjct: 578  VDKRKEVFHDKVLTDKSKGKMSKPATQEVMEPLSMTVSELQITCPSHPIELASKSLGVKE 637

Query: 1019 RCVSIESSSEALDNNSEVDSPCWKGT-QARHSPFEGSRPVNSELLKHEVEAGKSLNPLAP 1195
                  SS    +N+S++DSPCWKG   A  S  E SRP + + LK    A  +LNPLAP
Sbjct: 638  SDPIGNSSEIINENDSDLDSPCWKGKLSANQSTCEVSRPDDFQHLKSARGACSNLNPLAP 697

Query: 1196 QFFPRKVKESSDYRGIEC-------CQK----SISLDAAE----------GGPCPSKVIT 1312
             F P   K+  +YRG EC        QK    ++SL + E                  IT
Sbjct: 698  HFVPSCGKQKVNYRGTECEGDDSLTFQKTESSAVSLFSREHTLQKPGTAGSSSSDRSSIT 757

Query: 1313 EVGALCLDEVYASKKEPALNNSKTSPEIISSQMAIPNVMEDYFRS----VTGDNTYGSVT 1480
            E     +D    +K+   L NS TS  + SS +  P++ EDYF S    +TG    GS  
Sbjct: 758  ETHC-SIDNHVRNKEYEPLTNSSTSSMLSSSCLVQPSIPEDYFISNGQLLTGKKVGGSGK 816

Query: 1481 GIKGAAPTGSFSGAVWDNYHPSSTIDIQVAVNA 1579
             IK A   GS S ++  + H +S+   +V V++
Sbjct: 817  DIKDAVSNGSTSVSLLASEHVTSSSSCRVGVSS 849



 Score = 90.5 bits (223), Expect = 3e-15
 Identities = 87/269 (32%), Positives = 126/269 (46%), Gaps = 34/269 (12%)
 Frame = +2

Query: 857  IDTEKEI-KDHLFVESSSTKEAQVSNKK--DPLFIKESELFV---SHPQNHLTEELHCPE 1018
            +D  KE+  D +  + S  K ++ + ++  +PL +  SEL +   SHP    ++ L   E
Sbjct: 181  VDKRKEVFHDEVLTDKSKVKMSKPATQEVMEPLSMTVSELQITCPSHPIELASKSLGVKE 240

Query: 1019 RCVSIESSSEALDNNSEVDSPCWKGT-QARHSPFEGSRPVNSELLKHEVEAGKSLNPLAP 1195
                  SS    +N+S++DSPCWKG   A  S  E SRP + + LK    A  +LNPLAP
Sbjct: 241  SDPIGNSSEIINENDSDLDSPCWKGKLSANQSTCEVSRPDDFQHLKSARGACSNLNPLAP 300

Query: 1196 QFFPRKVKESSDYRGIEC-------CQK----SISLDAAE----------GGPCPSKVIT 1312
             F P   ++  +YRG EC        QK    ++SL + E                  IT
Sbjct: 301  HFVPSCGQQKVNYRGTECEGDDSLTFQKTESSAVSLFSREHTLQKPGTAGSSSSDRSSIT 360

Query: 1313 EVGALCLDEVYASKKEPALNNSKTSPEIISSQMAIPNVMEDYFRS----VTGDNTYGSVT 1480
            E      + V   + EP L NS TS  + SS +  P+++EDYF S    +T     GS  
Sbjct: 361  ETHCSIDNHVRNEEYEP-LTNSSTSSMLSSSCVVQPSILEDYFTSNGQLLTRQKVGGSGK 419

Query: 1481 GIKGAAPTGSFSGAVWDNYH--PSSTIDI 1561
             I+ A P GS S ++  + H  P ST  I
Sbjct: 420  VIEDAVPNGSTSVSLLASKHVRPISTRQI 448


>ref|XP_003523306.2| PREDICTED: uncharacterized protein LOC100778126 [Glycine max]
          Length = 1048

 Score = 85.1 bits (209), Expect = 1e-13
 Identities = 111/419 (26%), Positives = 173/419 (41%), Gaps = 66/419 (15%)
 Frame = +2

Query: 896  ESSSTKEAQVSNKKDPLFIKESELFVSHPQ-NHLTEELHCPERCVSIESSSEALDN-NSE 1069
            E SS+ +A +S+K   + + +     SH   ++L    +  E    ++ S E +D  N  
Sbjct: 347  EPSSSNKAMISDKNVSMNVVDYIFRGSHANVDNLRLRPNATEGANFVQKSFEGVDQCNPA 406

Query: 1070 VDSPCWKGTQA-RHSPFEGSRPVNSELL-KHEVEAGKSLNP-----LAPQFFPRKVKESS 1228
             DSPCWKG  A R S FE S  +  E + K E+  G  +       L  +   +K  E+S
Sbjct: 407  EDSPCWKGASAARFSHFEPSAALPQEYVHKKEISFGSIIQEPQNILLDTENNMKKSGENS 466

Query: 1229 DYRGIECCQKSISLDAAEGGPCPSKVITEV-------GALCLDEVYASK----------- 1354
            +  G +   K ++ + +  G      +T+        G+   D  + SK           
Sbjct: 467  N--GYQTHTKIVNQERSSAGSPRKFSVTKFAPEYFKSGSAVNDGPFQSKPSCGFGLHYLD 524

Query: 1355 ----KEPALNNSK-TSPEIISSQMAIPNV--------MEDYFRSVTGDNTYGSVTGIKGA 1495
                KE  +  +K T     SSQM + +V         +      TGD   G    +   
Sbjct: 525  ITKMKENTVPPAKPTDCASGSSQMGLQHVDLKEFIIFQKQQALVCTGDVDSGC--NVNNC 582

Query: 1496 APTGSFSGAVWDNYHPSSTID------------------IQVAVNALYKISELLVQNCSS 1621
            +   S   A      PSS +D                  +Q+ ++ L  +SELL+ +C +
Sbjct: 583  SEYSSSCSAEHVPPSPSSVVDTTTTPENSARKVSTEKLNVQMLLDTLQNLSELLLYHCLN 642

Query: 1622 DSNSLKEQDQNILQHVINNLYFCMRKSGGQRS-------SSIETPPYAGTS-NFEGTDVM 1777
            D+  LKE+D NIL++VI+NL  C  K+  Q +       +  ET   AG S  F      
Sbjct: 643  DACELKERDCNILKNVISNLNTCALKNAEQIAPAQECFFNQPETSKSAGESREFHQNASF 702

Query: 1778 HYVQKTQSESVPHGFDKSQVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYK 1954
               Q T++E          +     +  H  +E  +PQ +LYK LWLEAEA + SV YK
Sbjct: 703  KRPQLTKTEMTKACNMTKDLKRILSENFHDDDEGAEPQTVLYKNLWLEAEAALCSVYYK 761


>ref|XP_007148023.1| hypothetical protein PHAVU_006G174000g [Phaseolus vulgaris]
            gi|561021246|gb|ESW20017.1| hypothetical protein
            PHAVU_006G174000g [Phaseolus vulgaris]
          Length = 572

 Score = 85.1 bits (209), Expect = 1e-13
 Identities = 99/403 (24%), Positives = 156/403 (38%), Gaps = 78/403 (19%)
 Frame = +2

Query: 980  PQNHLTEELHCPERCVSIESSSEALDNNSEVDSPCWKGTQA-RHSPFEGSRPVNSELLKH 1156
            P   LT ++   +     +SS   ++N+S+VDSPCWKGT+A   +  E S  V    ++ 
Sbjct: 148  PVKSLTTDMSSAKNTYLDQSSKTLVENDSDVDSPCWKGTRAFCQTSIENSGSVQINNVEK 207

Query: 1157 EVEAGKSLNPLAPQFFPRKVKESSD--------------YRGIECCQKSISLDA------ 1276
              E   SLNPLAPQFFPR      D              + G     K++  ++      
Sbjct: 208  ATEKHNSLNPLAPQFFPRIAYVKDDFGSSNSSSPVATNFFSGEHMLMKTVMAESPVELNM 267

Query: 1277 -AEGGPCPSKVITEVGALCLDEVYASKKEPALN-----NSKTSPEIISSQMAIPNVMEDY 1438
              E  P  +    E     +++   S  +P LN        +S E  S     P  + D 
Sbjct: 268  GIELQPSSNTRGKEKAINMINDPKNSYVDPVLNLHCKVTKSSSKEDCSMSKGKPEAVVDA 327

Query: 1439 FRSVTGDNTYGSVTGIKGAAPTGSFSG------------AVWDNYHPSSTIDIQVAVNAL 1582
               V G     S      ++ + S SG             V  +   S   D+ + V+A+
Sbjct: 328  DNFVKGATKSSSPISTLASSSSSSSSGVAVVTDLMKTFEGVSKSLSKSPKPDVGMVVSAI 387

Query: 1583 YKISELLVQNCSS--DSNSLKEQDQNILQHVINNL-----YFCMRKSGGQRSSSIETP-- 1735
            + +SELLVQ       SN+    D+ ++Q  INNL       C+++    +S+ ++ P  
Sbjct: 388  HVLSELLVQTSMDGVGSNNEHGHDEIMIQQTINNLNDFRTKRCVQRIPTLKSTPVDHPSC 447

Query: 1736 ---PYAGTSNFEGTDV--------MH----YVQK---------------TQSESVPHGFD 1825
               P       E T +        +H    Y +K                 S    +   
Sbjct: 448  HNRPLELPKGLEMTSIETLNDPNKLHPQNDYTKKKTVFKMFGQSGKSFFAPSSDKGNEIA 507

Query: 1826 KSQVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYK 1954
            + QVI +        ++ M P+ LL+  LWL++EAE    KYK
Sbjct: 508  QLQVIRRSLGKTLDFDKHMHPEALLFLNLWLDSEAERCYSKYK 550


>ref|XP_007148022.1| hypothetical protein PHAVU_006G174000g [Phaseolus vulgaris]
            gi|561021245|gb|ESW20016.1| hypothetical protein
            PHAVU_006G174000g [Phaseolus vulgaris]
          Length = 571

 Score = 85.1 bits (209), Expect = 1e-13
 Identities = 99/403 (24%), Positives = 156/403 (38%), Gaps = 78/403 (19%)
 Frame = +2

Query: 980  PQNHLTEELHCPERCVSIESSSEALDNNSEVDSPCWKGTQA-RHSPFEGSRPVNSELLKH 1156
            P   LT ++   +     +SS   ++N+S+VDSPCWKGT+A   +  E S  V    ++ 
Sbjct: 148  PVKSLTTDMSSAKNTYLDQSSKTLVENDSDVDSPCWKGTRAFCQTSIENSGSVQINNVEK 207

Query: 1157 EVEAGKSLNPLAPQFFPRKVKESSD--------------YRGIECCQKSISLDA------ 1276
              E   SLNPLAPQFFPR      D              + G     K++  ++      
Sbjct: 208  ATEKHNSLNPLAPQFFPRIAYVKDDFGSSNSSSPVATNFFSGEHMLMKTVMAESPVELNM 267

Query: 1277 -AEGGPCPSKVITEVGALCLDEVYASKKEPALN-----NSKTSPEIISSQMAIPNVMEDY 1438
              E  P  +    E     +++   S  +P LN        +S E  S     P  + D 
Sbjct: 268  GIELQPSSNTRGKEKAINMINDPKNSYVDPVLNLHCKVTKSSSKEDCSMSKGKPEAVVDA 327

Query: 1439 FRSVTGDNTYGSVTGIKGAAPTGSFSG------------AVWDNYHPSSTIDIQVAVNAL 1582
               V G     S      ++ + S SG             V  +   S   D+ + V+A+
Sbjct: 328  DNFVKGATKSSSPISTLASSSSSSSSGVAVVTDLMKTFEGVSKSLSKSPKPDVGMVVSAI 387

Query: 1583 YKISELLVQNCSS--DSNSLKEQDQNILQHVINNL-----YFCMRKSGGQRSSSIETP-- 1735
            + +SELLVQ       SN+    D+ ++Q  INNL       C+++    +S+ ++ P  
Sbjct: 388  HVLSELLVQTSMDGVGSNNEHGHDEIMIQQTINNLNDFRTKRCVQRIPTLKSTPVDHPSC 447

Query: 1736 ---PYAGTSNFEGTDV--------MH----YVQK---------------TQSESVPHGFD 1825
               P       E T +        +H    Y +K                 S    +   
Sbjct: 448  HNRPLELPKGLEMTSIETLNDPNKLHPQNDYTKKKTVFKMFGQSGKSFFAPSSDKGNEIA 507

Query: 1826 KSQVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYK 1954
            + QVI +        ++ M P+ LL+  LWL++EAE    KYK
Sbjct: 508  QLQVIRRSLGKTLDFDKHMHPEALLFLNLWLDSEAERCYSKYK 550


>ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508776469|gb|EOY23725.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 1059

 Score = 80.9 bits (198), Expect = 2e-12
 Identities = 58/167 (34%), Positives = 83/167 (49%), Gaps = 22/167 (13%)
 Frame = +2

Query: 1541 PSSTIDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRS- 1717
            P S   I V V+ +  +SELL+ +CS+++  L+EQD   L+ VINNL  CM K+ GQ + 
Sbjct: 628  PVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQETL 687

Query: 1718 -SSIETPPYAGTSNFEGTDVM---------HYVQK----TQSESVPHGFD-------KSQ 1834
             S +      G+      DV+         H+ +K    ++  SV  G D        +Q
Sbjct: 688  LSELHKGTSTGSPQVAAIDVLSQHTQVKRKHFGKKDEKCSEFVSVRSGTDIKVKNDKMTQ 747

Query: 1835 VIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYKDSLVRMK 1975
             I+K        +E+  PQVLLYK LWLEAEA + S+ Y      MK
Sbjct: 748  AIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYMARYNNMK 794


>ref|XP_007148021.1| hypothetical protein PHAVU_006G174000g [Phaseolus vulgaris]
            gi|561021244|gb|ESW20015.1| hypothetical protein
            PHAVU_006G174000g [Phaseolus vulgaris]
          Length = 460

 Score = 77.4 bits (189), Expect = 3e-11
 Identities = 77/294 (26%), Positives = 119/294 (40%), Gaps = 41/294 (13%)
 Frame = +2

Query: 980  PQNHLTEELHCPERCVSIESSSEALDNNSEVDSPCWKGTQA-RHSPFEGSRPVNSELLKH 1156
            P   LT ++   +     +SS   ++N+S+VDSPCWKGT+A   +  E S  V    ++ 
Sbjct: 148  PVKSLTTDMSSAKNTYLDQSSKTLVENDSDVDSPCWKGTRAFCQTSIENSGSVQINNVEK 207

Query: 1157 EVEAGKSLNPLAPQFFPRKVKESSD--------------YRGIECCQKSISLDA------ 1276
              E   SLNPLAPQFFPR      D              + G     K++  ++      
Sbjct: 208  ATEKHNSLNPLAPQFFPRIAYVKDDFGSSNSSSPVATNFFSGEHMLMKTVMAESPVELNM 267

Query: 1277 -AEGGPCPSKVITEVGALCLDEVYASKKEPALN-----NSKTSPEIISSQMAIPNVMEDY 1438
              E  P  +    E     +++   S  +P LN        +S E  S     P  + D 
Sbjct: 268  GIELQPSSNTRGKEKAINMINDPKNSYVDPVLNLHCKVTKSSSKEDCSMSKGKPEAVVDA 327

Query: 1439 FRSVTGDNTYGSVTGIKGAAPTGSFSG------------AVWDNYHPSSTIDIQVAVNAL 1582
               V G     S      ++ + S SG             V  +   S   D+ + V+A+
Sbjct: 328  DNFVKGATKSSSPISTLASSSSSSSSGVAVVTDLMKTFEGVSKSLSKSPKPDVGMVVSAI 387

Query: 1583 YKISELLVQNCSS--DSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETPP 1738
            + +SELLVQ       SN+    D+ ++Q  INNL     K   QR  ++++ P
Sbjct: 388  HVLSELLVQTSMDGVGSNNEHGHDEIMIQQTINNLNDFRTKRCVQRIPTLKSTP 441


>ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prunus persica]
            gi|462417047|gb|EMJ21784.1| hypothetical protein
            PRUPE_ppa000352mg [Prunus persica]
          Length = 1254

 Score = 77.4 bits (189), Expect = 3e-11
 Identities = 61/173 (35%), Positives = 86/173 (49%), Gaps = 23/173 (13%)
 Frame = +2

Query: 1553 IDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIET 1732
            +D+Q+ V+ L  +SELL+ NCS+    LK+ D   L+ VINNL+ C+ K+  + S   E+
Sbjct: 694  VDVQMLVDTLKNLSELLLTNCSNGLCQLKKTDIATLKAVINNLHICISKNVEKWSPMQES 753

Query: 1733 PPY-AGTSNFEGTDVMHY----VQKTQSESVPHGFD--------KSQV-IEKFPKMKHQI 1870
            P +   TS        H+      +  S S P   D        KS + + K  KM   I
Sbjct: 754  PTFQQNTSQCYAELSEHHKVLSADRPLSASAPDIQDQVIGSIHVKSDIDVVKEDKMTQAI 813

Query: 1871 E---------EDMQPQVLLYKKLWLEAEAEMLSVKYKDSLVRMKRR*IKSKAQ 2002
            +         E+  PQVLLYK LWLEAEA + S+ YK    R+K    K KA+
Sbjct: 814  KEILSENFHSEETDPQVLLYKNLWLEAEAVLCSINYKARFNRVKIEMDKCKAE 866



 Score = 65.5 bits (158), Expect = 1e-07
 Identities = 110/397 (27%), Positives = 148/397 (37%), Gaps = 61/397 (15%)
 Frame = +2

Query: 194  HLSPEFHCKTPLSVHNQSDFTALSTSTDIPLTD--------------YRRDSSGLLKGLS 331
            +LSP  H  +PL V +Q  +  LST+   PL                Y     GL  GLS
Sbjct: 143  YLSPTIHGDSPLVVPDQPSYDWLSTTHFAPLDGCSRKDYTQRPPDLKYTAQWGGLWNGLS 202

Query: 332  ------YGDDRRSFSCKDNNGSVSLPNESLLKQGVPAAEGSQAFLNSASLCTN------- 472
                   GD   SF  K  + S S   ++ + Q  P +  S      AS   N       
Sbjct: 203  EWEQGKQGDFDGSFCSKKTDVSGSFLYKNFMNQE-PHSSNSLNSFEEASHGINTLGWEKP 261

Query: 473  ---GSGVLGRDHQIGSRGMEQPGADSSSSPVEISNVAT--LKRPSTLCSTAILQDVLKLP 637
               G+  LG    +G      P   S S    +S V    LK PS+ C T       K P
Sbjct: 262  GGSGNAHLGDKSLVGKNSKFTPSDFSKSVMGSLSVVPEPHLKAPSSQCVTKTSN--CKTP 319

Query: 638  YPASVATPQVNGSIGGVMVFPVSSPV-----------LSED-------VNFSDGFAVNNN 763
            Y  S  T Q++ S+  +     SSP            LSE        +NF    A  ++
Sbjct: 320  YSVSSETQQLDASLDYITSISESSPAFATRTPALGTKLSEPGTGLFRRLNFISDAADTDH 379

Query: 764  DNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKK-- 937
             + ++       +P     SEGK    D S +      KD    ESSS +  ++SN +  
Sbjct: 380  GDYYSSGVQESHLPQI---SEGKVLF-DSSQLGFHLGAKDCFSAESSSARNEELSNNRNI 435

Query: 938  ------DPLFIKESELFVSHPQ-NHLTEELHCPERCVSIESSSEALD-NNSEVDSPCWKG 1093
                  D +F  +  L  SH   +         E   S  SSS+ +D NN  VDSPCWKG
Sbjct: 436  INKDAWDKVFKAKPGLQNSHVGLDGFKMAFKTNETINSFLSSSDNVDPNNPGVDSPCWKG 495

Query: 1094 TQAR-HSPFEGSRPVNSELLKHEVEAGKSLNPLAPQF 1201
                  SPF  S     E +K ++E    LN   P F
Sbjct: 496  VPGSCFSPFGASEDGVPEQIK-KLEDCSGLNIHMPMF 531


>ref|XP_007039225.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508776470|gb|EOY23726.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 827

 Score = 73.2 bits (178), Expect = 5e-10
 Identities = 58/187 (31%), Positives = 82/187 (43%), Gaps = 42/187 (22%)
 Frame = +2

Query: 1541 PSSTIDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRS- 1717
            P S   I V V+ +  +SELL+ +CS+++  L+EQD   L+ VINNL  CM K+ GQ + 
Sbjct: 628  PVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQETL 687

Query: 1718 ---------------------SSIETPPYAGTSNFEGTDVM---------HYVQKTQS-- 1801
                                 S +      G+      DV+         H+ +K +   
Sbjct: 688  LSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQHTQVKRKHFGKKDEKCS 747

Query: 1802 --ESVPHGFD-------KSQVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYK 1954
               SV  G D        +Q I+K        +E+  PQVLLYK LWLEAEA + S+ Y 
Sbjct: 748  EFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYM 807

Query: 1955 DSLVRMK 1975
                 MK
Sbjct: 808  ARYNNMK 814


>ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508776467|gb|EOY23723.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1068

 Score = 73.2 bits (178), Expect = 5e-10
 Identities = 58/187 (31%), Positives = 82/187 (43%), Gaps = 42/187 (22%)
 Frame = +2

Query: 1541 PSSTIDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRS- 1717
            P S   I V V+ +  +SELL+ +CS+++  L+EQD   L+ VINNL  CM K+ GQ + 
Sbjct: 617  PVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQETL 676

Query: 1718 ---------------------SSIETPPYAGTSNFEGTDVM---------HYVQKTQS-- 1801
                                 S +      G+      DV+         H+ +K +   
Sbjct: 677  LSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQHTQVKRKHFGKKDEKCS 736

Query: 1802 --ESVPHGFD-------KSQVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYK 1954
               SV  G D        +Q I+K        +E+  PQVLLYK LWLEAEA + S+ Y 
Sbjct: 737  EFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYM 796

Query: 1955 DSLVRMK 1975
                 MK
Sbjct: 797  ARYNNMK 803


>ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508776466|gb|EOY23722.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1017

 Score = 73.2 bits (178), Expect = 5e-10
 Identities = 58/187 (31%), Positives = 82/187 (43%), Gaps = 42/187 (22%)
 Frame = +2

Query: 1541 PSSTIDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRS- 1717
            P S   I V V+ +  +SELL+ +CS+++  L+EQD   L+ VINNL  CM K+ GQ + 
Sbjct: 628  PVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQETL 687

Query: 1718 ---------------------SSIETPPYAGTSNFEGTDVM---------HYVQKTQS-- 1801
                                 S +      G+      DV+         H+ +K +   
Sbjct: 688  LSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQHTQVKRKHFGKKDEKCS 747

Query: 1802 --ESVPHGFD-------KSQVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYK 1954
               SV  G D        +Q I+K        +E+  PQVLLYK LWLEAEA + S+ Y 
Sbjct: 748  EFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYM 807

Query: 1955 DSLVRMK 1975
                 MK
Sbjct: 808  ARYNNMK 814


>ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590674635|ref|XP_007039223.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508776465|gb|EOY23721.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776468|gb|EOY23724.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1079

 Score = 73.2 bits (178), Expect = 5e-10
 Identities = 58/187 (31%), Positives = 82/187 (43%), Gaps = 42/187 (22%)
 Frame = +2

Query: 1541 PSSTIDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRS- 1717
            P S   I V V+ +  +SELL+ +CS+++  L+EQD   L+ VINNL  CM K+ GQ + 
Sbjct: 628  PVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQETL 687

Query: 1718 ---------------------SSIETPPYAGTSNFEGTDVM---------HYVQKTQS-- 1801
                                 S +      G+      DV+         H+ +K +   
Sbjct: 688  LSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQHTQVKRKHFGKKDEKCS 747

Query: 1802 --ESVPHGFD-------KSQVIEKFPKMKHQIEEDMQPQVLLYKKLWLEAEAEMLSVKYK 1954
               SV  G D        +Q I+K        +E+  PQVLLYK LWLEAEA + S+ Y 
Sbjct: 748  EFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYM 807

Query: 1955 DSLVRMK 1975
                 MK
Sbjct: 808  ARYNNMK 814


>ref|XP_002893751.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297339593|gb|EFH70010.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 606

 Score = 70.5 bits (171), Expect = 3e-09
 Identities = 48/180 (26%), Positives = 85/180 (47%), Gaps = 3/180 (1%)
 Frame = +2

Query: 737  SDGFAVNNNDNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEI---KDHLFVESSS 907
            S  F++ +      +  + L +     + +    + D SL +  +++   K+ L +E   
Sbjct: 29   SPSFSLKSEHEDSIWGDYTLDLSFLSSDDQRSGLDDDDSLSNLSRDVETKKEGLVLEEKI 88

Query: 908  TKEAQVSNKKDPLFIKESELFVSHPQNHLTEELHCPERCVSIESSSEALDNNSEVDSPCW 1087
                +V    +P+F K  E+ +  P N +  +      CVS +SS+E+ +++SE DSPCW
Sbjct: 89   ASSGKVLVNPNPIFSKLPEVLIK-PSN-VAGDAKLGLSCVSEKSSTESDEDDSEEDSPCW 146

Query: 1088 KGTQARHSPFEGSRPVNSELLKHEVEAGKSLNPLAPQFFPRKVKESSDYRGIECCQKSIS 1267
             G  +  S   G++ V S     ++     LNPLAPQF P   K+  +  G +C + S S
Sbjct: 147  IGMHSHKSLASGAKAVASRRSTDDLSGFHRLNPLAPQFIPSNSKKKVETDGEKCEENSSS 206


Top