BLASTX nr result

ID: Anemarrhena21_contig00001576 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Anemarrhena21_contig00001576
         (2554 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008802020.1| PREDICTED: large proline-rich protein BAG6 i...   574   e-160
ref|XP_008802019.1| PREDICTED: large proline-rich protein BAG6 i...   571   e-160
ref|XP_010910594.1| PREDICTED: uncharacterized protein LOC105036...   570   e-159
ref|XP_008802021.1| PREDICTED: uncharacterized protein LOC103715...   546   e-152
ref|XP_010244866.1| PREDICTED: large proline-rich protein BAG6-l...   470   e-129
ref|XP_010244741.1| PREDICTED: large proline-rich protein BAG6-l...   470   e-129
ref|XP_010244742.1| PREDICTED: uncharacterized protein LOC104588...   462   e-127
ref|XP_010913366.1| PREDICTED: large proline-rich protein BAG6, ...   416   e-117
ref|XP_007013657.1| Ubiquitin-like superfamily protein, putative...   423   e-115
ref|XP_007013655.1| Ubiquitin-like superfamily protein, putative...   421   e-114
ref|XP_007013661.1| Ubiquitin-like superfamily protein, putative...   420   e-114
ref|XP_007013658.1| Ubiquitin-like superfamily protein, putative...   419   e-114
ref|XP_007013659.1| Ubiquitin-like superfamily protein, putative...   411   e-111
ref|XP_002532506.1| scythe/bat3, putative [Ricinus communis] gi|...   409   e-111
ref|XP_007013662.1| Ubiquitin-like superfamily protein, putative...   409   e-111
ref|XP_007013656.1| Ubiquitin-like superfamily protein, putative...   409   e-111
ref|XP_003573668.2| PREDICTED: large proline-rich protein BAG6 [...   403   e-109
gb|KJB64610.1| hypothetical protein B456_010G057100 [Gossypium r...   398   e-107
ref|XP_008384827.1| PREDICTED: large proline-rich protein bag6-A...   396   e-107
ref|XP_004955550.1| PREDICTED: large proline-rich protein bag6-A...   395   e-107

>ref|XP_008802020.1| PREDICTED: large proline-rich protein BAG6 isoform X2 [Phoenix
            dactylifera]
          Length = 745

 Score =  574 bits (1480), Expect = e-160
 Identities = 357/777 (45%), Positives = 467/777 (60%), Gaps = 70/777 (9%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            M +  A++V     D+ EDS TTVEIKIKTLDS TYTLRV+KCVPV +LKEQIA+VTGV+
Sbjct: 1    METNGASEVTTSHVDVTEDSETTVEIKIKTLDSHTYTLRVNKCVPVLMLKEQIATVTGVV 60

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTS-SSHVGSEGASDHPGT 2114
            SEQQRLICRGKVLKDD+LLSAYHVEDGHTLHLVVRQP QS+++  + HVGSEGAS +P  
Sbjct: 61   SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPPTGHVGSEGASANPAA 120

Query: 2113 SSGIGIAHNRSSQVAHSFVFESVNVDQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
            +S   IAHNR S VA S VFE+VN+DQGD  +S                          +
Sbjct: 121  NSSSSIAHNRGSHVARSIVFEAVNIDQGDNRTSHLSRFISSILSSIGTTNTAFQNPRNDL 180

Query: 1933 REMISGRLARASSDSGL-----SNSTHPQLLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  +G   R S D+GL     SN        + D +Q+ ++F S    G   P VIPDS
Sbjct: 181  RE--TG--GRTSGDTGLPDPVQSNPNPSSSRVEVDSQQSPLRFQSVFPLGSQQPIVIPDS 236

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAA-QVGLPTPA 1592
            LTT++QY+ ++RDDFRREG   N  E  ++A   G+   +   HD  SP   Q GLP+PA
Sbjct: 237  LTTMNQYLGVIRDDFRREGISTNGREQTNDAAAAGMNSIDVQNHDFLSPPPRQGGLPSPA 296

Query: 1591 SLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSM 1412
            SLAEI+ STRQLLM+Q G CLSQLA  L D  S+TD L RM++QSSA+  GV++RNLGS+
Sbjct: 297  SLAEIVLSTRQLLMDQAGGCLSQLAGHLEDHVSVTDPLTRMDLQSSAIRSGVILRNLGSL 356

Query: 1411 LLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXX 1232
            LLEL RTTMTLRMG+ P EAVVNAGPA FISA+GPN LMVQ VPF P  +          
Sbjct: 357  LLELGRTTMTLRMGQTPLEAVVNAGPAVFISASGPNPLMVQPVPFFPGSSFGGTPF---- 412

Query: 1231 XXGLEGDSPDATFLPRNIEIRIRT-GRAVPVVSTNASEAADSXXXXXQTDPARNSSGAN- 1058
                 G+ P + F+PRN++IR+RT GRAVPV + N  E A +     QTDP RN S AN 
Sbjct: 413  -----GEPPASAFIPRNVDIRVRTGGRAVPVTNANLGEQAGAQPPQEQTDPTRNPSTANS 467

Query: 1057 VQQSFSG--------GDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQ 902
            V Q+FSG        G+S VRVVP+RT VVAVP+G +P  SDS+ S+V + YPLLAR++ 
Sbjct: 468  VNQAFSGISSTTSFAGESGVRVVPIRT-VVAVPTGHSP--SDSSGSAVGVIYPLLARIQH 524

Query: 901  SSASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSANTLP 722
             +++NA++AR              +++  L +S +Q ++                AN+ P
Sbjct: 525  VNSANANNARGSRASNEPNQSHPNINQPNL-ESAMQNQS---------------PANSAP 568

Query: 721  FVSGLSPPPNESSAHQGFSAVFVSSQPSPPIDNSENITQANED----------------- 593
             VS ++P  NE  ++QG S V ++SQ +PP  +SE+ TQA+ +                 
Sbjct: 569  VVSWINPSANELPSYQGSSPVSIASQQAPPASSSESNTQAHVEPQVGQGPMSQFLNRVDE 628

Query: 592  ----------------------------------TIRGTESRDAARAGTEQEAFFSNVLQ 515
                                              T R TE+++A+R G +   FFS++++
Sbjct: 629  WLRTALLSGEQVQGGGTSHQESVTGSAAVQNQTGTARNTETQEASRVGNDDGVFFSSLVR 688

Query: 514  RLIPLISQGR--EHGESSANGSSSQIEGENLSGSSSFEHQRDPPEIPSPKRTKRNSD 350
             L+P +SQ      G +S +GS++Q    +L+ SSS +H RDPPE PSPKR +RNS+
Sbjct: 689  ELMPFLSQRTTVPGGSASNDGSTAQTASHHLNDSSSSQHHRDPPEAPSPKRPRRNSE 745


>ref|XP_008802019.1| PREDICTED: large proline-rich protein BAG6 isoform X1 [Phoenix
            dactylifera]
          Length = 746

 Score =  571 bits (1472), Expect = e-160
 Identities = 357/778 (45%), Positives = 467/778 (60%), Gaps = 71/778 (9%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            M +  A++V     D+ EDS TTVEIKIKTLDS TYTLRV+KCVPV +LKEQIA+VTGV+
Sbjct: 1    METNGASEVTTSHVDVTEDSETTVEIKIKTLDSHTYTLRVNKCVPVLMLKEQIATVTGVV 60

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTS-SSHVGSEGASDHPGT 2114
            SEQQRLICRGKVLKDD+LLSAYHVEDGHTLHLVVRQP QS+++  + HVGSEGAS +P  
Sbjct: 61   SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPPTGHVGSEGASANPAA 120

Query: 2113 SSGIGIAHNRSSQVAHSFVFESVNVDQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
            +S   IAHNR S VA S VFE+VN+DQGD  +S                          +
Sbjct: 121  NSSSSIAHNRGSHVARSIVFEAVNIDQGDNRTSHLSRFISSILSSIGTTNTAFQNPRNDL 180

Query: 1933 REMISGRLARASSDSGL-----SNSTHPQLLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  +G   R S D+GL     SN        + D +Q+ ++F S    G   P VIPDS
Sbjct: 181  RE--TG--GRTSGDTGLPDPVQSNPNPSSSRVEVDSQQSPLRFQSVFPLGSQQPIVIPDS 236

Query: 1768 LTTLSQYISLMRDDFRREGFGANVS-EHIDNAEFTGVRFNEGWTHDSHSPAA-QVGLPTP 1595
            LTT++QY+ ++RDDFRREG   N   E  ++A   G+   +   HD  SP   Q GLP+P
Sbjct: 237  LTTMNQYLGVIRDDFRREGISTNAGREQTNDAAAAGMNSIDVQNHDFLSPPPRQGGLPSP 296

Query: 1594 ASLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGS 1415
            ASLAEI+ STRQLLM+Q G CLSQLA  L D  S+TD L RM++QSSA+  GV++RNLGS
Sbjct: 297  ASLAEIVLSTRQLLMDQAGGCLSQLAGHLEDHVSVTDPLTRMDLQSSAIRSGVILRNLGS 356

Query: 1414 MLLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXX 1235
            +LLEL RTTMTLRMG+ P EAVVNAGPA FISA+GPN LMVQ VPF P  +         
Sbjct: 357  LLLELGRTTMTLRMGQTPLEAVVNAGPAVFISASGPNPLMVQPVPFFPGSSFGGTPF--- 413

Query: 1234 XXXGLEGDSPDATFLPRNIEIRIRT-GRAVPVVSTNASEAADSXXXXXQTDPARNSSGAN 1058
                  G+ P + F+PRN++IR+RT GRAVPV + N  E A +     QTDP RN S AN
Sbjct: 414  ------GEPPASAFIPRNVDIRVRTGGRAVPVTNANLGEQAGAQPPQEQTDPTRNPSTAN 467

Query: 1057 -VQQSFSG--------GDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVR 905
             V Q+FSG        G+S VRVVP+RT VVAVP+G +P  SDS+ S+V + YPLLAR++
Sbjct: 468  SVNQAFSGISSTTSFAGESGVRVVPIRT-VVAVPTGHSP--SDSSGSAVGVIYPLLARIQ 524

Query: 904  QSSASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSANTL 725
              +++NA++AR              +++  L +S +Q ++                AN+ 
Sbjct: 525  HVNSANANNARGSRASNEPNQSHPNINQPNL-ESAMQNQS---------------PANSA 568

Query: 724  PFVSGLSPPPNESSAHQGFSAVFVSSQPSPPIDNSENITQANED---------------- 593
            P VS ++P  NE  ++QG S V ++SQ +PP  +SE+ TQA+ +                
Sbjct: 569  PVVSWINPSANELPSYQGSSPVSIASQQAPPASSSESNTQAHVEPQVGQGPMSQFLNRVD 628

Query: 592  -----------------------------------TIRGTESRDAARAGTEQEAFFSNVL 518
                                               T R TE+++A+R G +   FFS+++
Sbjct: 629  EWLRTALLSGEQVQGGGTSHQESVTGSAAVQNQTGTARNTETQEASRVGNDDGVFFSSLV 688

Query: 517  QRLIPLISQGR--EHGESSANGSSSQIEGENLSGSSSFEHQRDPPEIPSPKRTKRNSD 350
            + L+P +SQ      G +S +GS++Q    +L+ SSS +H RDPPE PSPKR +RNS+
Sbjct: 689  RELMPFLSQRTTVPGGSASNDGSTAQTASHHLNDSSSSQHHRDPPEAPSPKRPRRNSE 746


>ref|XP_010910594.1| PREDICTED: uncharacterized protein LOC105036531 [Elaeis guineensis]
          Length = 745

 Score =  570 bits (1470), Expect = e-159
 Identities = 354/774 (45%), Positives = 464/774 (59%), Gaps = 71/774 (9%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MG+  A  +    GD+ EDS TTVEIKIKTLDSQTYTLRV+KCVP+ +LKEQIA+VTGV+
Sbjct: 1    MGTNGARDITTSHGDVTEDSETTVEIKIKTLDSQTYTLRVNKCVPILMLKEQIATVTGVV 60

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATT-SSSHVGSEGASDHPGT 2114
            SEQQRLICRGKVLKDD+LLSAYHVEDGHTLHLVVRQP QS+++ S+ HVGSEGAS +P  
Sbjct: 61   SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPSTGHVGSEGASANPAA 120

Query: 2113 SSGIGIAHNRSSQVAHSFVFESVNVDQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
            +S    AHNR S VA S VFE+VN+DQGD  +S                          +
Sbjct: 121  NSSSSTAHNRGSHVARSIVFEAVNIDQGDNRTSHLGRIISSLLSSIGTTNTAFQNPRNDL 180

Query: 1933 REMISGRLARASSDSGLS-----NSTHPQLLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE +     R S D+GLS     N   P    + D +Q  ++F S    G   P VIPDS
Sbjct: 181  RETVG----RTSGDTGLSDAMQSNPNPPTSRVELDSQQGPLRFQSVFPLGSQQPIVIPDS 236

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHD--SHSPAAQVGLPTP 1595
            LTT++QY+ ++RDDFRREG      E  ++A   G+  N+   HD  S  P+ Q GLP+P
Sbjct: 237  LTTMNQYLGVIRDDFRREGLSIYGREQTNDAAAAGMNGNDVQNHDFLSPLPSRQGGLPSP 296

Query: 1594 ASLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGS 1415
            ASLAEI+ STRQLLM+Q G CLSQLAR+L D  S+TD L RM++QSSA+  GVL+RNLGS
Sbjct: 297  ASLAEIVLSTRQLLMDQAGGCLSQLARRLDDHVSVTDPLMRMDLQSSAIRSGVLLRNLGS 356

Query: 1414 MLLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXX 1235
            +LLEL RTTMTL MG+ P EAVVNAGPA FISA+GPN +MVQ VPF P  +         
Sbjct: 357  LLLELGRTTMTLHMGQTPLEAVVNAGPAVFISASGPNPVMVQPVPFFPGSSFGGAQF--- 413

Query: 1234 XXXGLEGDSPDATFLPRNIEIRIRT-GRAVPVVSTNASEAADSXXXXXQTDPARNSSGAN 1058
                  G+   + F+PRNI+IR+RT GR +PV + N  E A +     QTDP RN S AN
Sbjct: 414  ------GEPLASAFIPRNIDIRVRTGGRTIPVTNANLGEQAGAQQPLEQTDPTRNPSAAN 467

Query: 1057 -VQQSFSG--------GDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVR 905
             V Q+FSG        G+S VRVVP+RT VVAVP+G +P  SDS+ S++ + YPLLAR++
Sbjct: 468  SVNQAFSGIPSSTSFAGESGVRVVPVRT-VVAVPAGHSP--SDSSGSAIGVIYPLLARIQ 524

Query: 904  QSSASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSANTL 725
              ++ NA++AR               +     Q  +   NL+S    A R  S   A++ 
Sbjct: 525  HVNSGNANNAR----------GSRASNESNQSQPNINLPNLES----AMRNQS--PASSA 568

Query: 724  PFVSGLSPPPNESSAHQGFSAVFVSSQPSPPIDNSENITQA------------------- 602
            P VS ++P  NE   +QG S V ++SQ +PP  NSE+ TQA                   
Sbjct: 569  PVVSWMNPSANELPGYQGSSLVSITSQQAPPASNSESNTQAHVGQQVGQGSMSQLLSRVD 628

Query: 601  -------------------NEDTIRG-------------TESRDAARAGTEQEAFFSNVL 518
                               +++++ G             TE+ +A+R G++   FFS+++
Sbjct: 629  EWIRTALFPGEQVQVGGTGHQESVTGSVAVQNQTGTTGNTETHEASRVGSDDGVFFSSLV 688

Query: 517  QRLIPLISQGR--EHGESSANGSSSQIEGENLSGSSSFEHQRDPPEIPSPKRTK 362
            ++L+P +S+      G +S +GS++Q    +L+ SSS +H RDPPE PSPKR +
Sbjct: 689  RQLMPFLSEHTTVPGGSASNDGSTAQTASNHLNDSSSSQHHRDPPEAPSPKRPR 742


>ref|XP_008802021.1| PREDICTED: uncharacterized protein LOC103715983 isoform X3 [Phoenix
            dactylifera]
          Length = 721

 Score =  546 bits (1406), Expect = e-152
 Identities = 346/777 (44%), Positives = 454/777 (58%), Gaps = 70/777 (9%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            M +  A++V     D+ EDS TTVEIKIKTLDS TYTLRV+KCVPV +LKEQIA+VTGV+
Sbjct: 1    METNGASEVTTSHVDVTEDSETTVEIKIKTLDSHTYTLRVNKCVPVLMLKEQIATVTGVV 60

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTS-SSHVGSEGASDHPGT 2114
            SEQQRLICRGKVLKDD+LLSAYHVEDGHTLHLVVRQP QS+++  + HVGSEGAS +P  
Sbjct: 61   SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPPTGHVGSEGASANPAA 120

Query: 2113 SSGIGIAHNRSSQVAHSFVFESVNVDQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
            +S   IAHNR S VA S VFE+VN+DQGD  +S                          +
Sbjct: 121  NSSSSIAHNRGSHVARSIVFEAVNIDQGDNRTSHLSRFISSILSSIGTTNTAFQNPRNDL 180

Query: 1933 REMISGRLARASSDSGL-----SNSTHPQLLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  +G   R S D+GL     SN        + D +Q+ ++F S    G   P VIPDS
Sbjct: 181  RE--TG--GRTSGDTGLPDPVQSNPNPSSSRVEVDSQQSPLRFQSVFPLGSQQPIVIPDS 236

Query: 1768 LTTLSQYISLMRDDFRREGFGANVS-EHIDNAEFTGVRFNEGWTHDSHSPAA-QVGLPTP 1595
            LTT++QY+ ++RDDFRREG   N   E  ++A   G+   +   HD  SP   Q GLP+P
Sbjct: 237  LTTMNQYLGVIRDDFRREGISTNAGREQTNDAAAAGMNSIDVQNHDFLSPPPRQGGLPSP 296

Query: 1594 ASLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGS 1415
            ASLAEI+ STRQLLM+Q G CLSQLA  L D  S+TD L RM++QSSA+  GV++RNLGS
Sbjct: 297  ASLAEIVLSTRQLLMDQAGGCLSQLAGHLEDHVSVTDPLTRMDLQSSAIRSGVILRNLGS 356

Query: 1414 MLLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXX 1235
            +LLEL RTTMTLRMG+ P EAVVNAGPA FISA+GPN LMVQ VPF P  +         
Sbjct: 357  LLLELGRTTMTLRMGQTPLEAVVNAGPAVFISASGPNPLMVQPVPFFPGSSFGGTPF--- 413

Query: 1234 XXXGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGAN- 1058
                  G+ P + F+PRN++IR+R                        TDP RN S AN 
Sbjct: 414  ------GEPPASAFIPRNVDIRVR------------------------TDPTRNPSTANS 443

Query: 1057 VQQSFSG--------GDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQ 902
            V Q+FSG        G+S VRVVP+RT VVAVP+G +P  SDS+ S+V + YPLLAR++ 
Sbjct: 444  VNQAFSGISSTTSFAGESGVRVVPIRT-VVAVPTGHSP--SDSSGSAVGVIYPLLARIQH 500

Query: 901  SSASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSANTLP 722
             +++NA++AR              +++  L +S +Q ++                AN+ P
Sbjct: 501  VNSANANNARGSRASNEPNQSHPNINQPNL-ESAMQNQS---------------PANSAP 544

Query: 721  FVSGLSPPPNESSAHQGFSAVFVSSQPSPPIDNSENITQANED----------------- 593
             VS ++P  NE  ++QG S V ++SQ +PP  +SE+ TQA+ +                 
Sbjct: 545  VVSWINPSANELPSYQGSSPVSIASQQAPPASSSESNTQAHVEPQVGQGPMSQFLNRVDE 604

Query: 592  ----------------------------------TIRGTESRDAARAGTEQEAFFSNVLQ 515
                                              T R TE+++A+R G +   FFS++++
Sbjct: 605  WLRTALLSGEQVQGGGTSHQESVTGSAAVQNQTGTARNTETQEASRVGNDDGVFFSSLVR 664

Query: 514  RLIPLISQGR--EHGESSANGSSSQIEGENLSGSSSFEHQRDPPEIPSPKRTKRNSD 350
             L+P +SQ      G +S +GS++Q    +L+ SSS +H RDPPE PSPKR +RNS+
Sbjct: 665  ELMPFLSQRTTVPGGSASNDGSTAQTASHHLNDSSSSQHHRDPPEAPSPKRPRRNSE 721


>ref|XP_010244866.1| PREDICTED: large proline-rich protein BAG6-like [Nelumbo nucifera]
          Length = 794

 Score =  470 bits (1210), Expect = e-129
 Identities = 307/710 (43%), Positives = 400/710 (56%), Gaps = 17/710 (2%)
 Frame = -1

Query: 2473 RMGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGV 2294
            RMGS DA ++++ G D AE S  TVEIKIKTLDSQTYTLRV+KCVPVP LKEQIA++TGV
Sbjct: 29   RMGSNDANELMISGSDEAECSEATVEIKIKTLDSQTYTLRVNKCVPVPALKEQIATITGV 88

Query: 2293 LSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP-PQSATTSSSHVGSEGASDHPG 2117
            LSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP P S+ ++   +G EG+ DHP 
Sbjct: 89   LSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPPSSASTMGFIGPEGSPDHPA 148

Query: 2116 TSSGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXX 1940
            +      +H++ +QVAHS V  + N+ DQGD                             
Sbjct: 149  SEPTTNTSHSQGNQVAHSVVLGTFNIADQGD-GVLPDINRIVSAVLGSIFTNIGSGSEGA 207

Query: 1939 XIREMISGRLARAS-----SDSGLSNSTHPQLLPQRDRRQADVQFSSTVHSGPPNPTVIP 1775
              RE  S RL R S     SDS  S    P    Q D      +  +    GP    VIP
Sbjct: 208  NHREPASERLERTSGASVPSDSVRSQPGQPTAGVQSDPLHGAFRLPTPASLGPLQAPVIP 267

Query: 1774 DSLTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPA--AQVGLP 1601
            DSL TLSQYI+ MR +FR          H  N++      NEG  +DS S +   Q GLP
Sbjct: 268  DSLATLSQYINRMRHEFR-----VIARSHSSNSQPASTPGNEGRDYDSASRSNEGQAGLP 322

Query: 1600 TPASLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNL 1421
            TPASLAE++ STRQ+L++Q GECL QL RQL DQ +ITD L RM +QS+AM  GV ++NL
Sbjct: 323  TPASLAEVILSTRQMLIDQAGECLYQLTRQLEDQGNITDPLMRMTIQSNAMRSGVFLQNL 382

Query: 1420 GSMLLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPN-TXXXXX 1244
            G++LLEL R TM LRMGR P+EAVVNAGP+ FIS +GPN +MVQ +PF P  +       
Sbjct: 383  GALLLELGRATMMLRMGRTPSEAVVNAGPSVFISNSGPNPIMVQPLPFQPGTSFGAVPIG 442

Query: 1243 XXXXXXGLEGDSPDATFLPRNIEIRIR--TGRAVPVVSTNASEAADSXXXXXQTD---PA 1079
                   L G +  + F+PRNI+IRIR  TG +VP  + N  E A        T+   P+
Sbjct: 443  AVHPGSSLVGGTLGSGFIPRNIDIRIRTATGSSVPTANVNQGEQAGVRQPSGPTNSVRPS 502

Query: 1078 RNSSGANVQQSFSGGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQS 899
             ++SGA+   SF+ G+S VRVVP+RT V AVP+ VN   SDS+ SS+ LFYPLLARV+  
Sbjct: 503  GSASGASGSPSFA-GESGVRVVPIRTVVAAVPAPVNRPASDSSGSSIGLFYPLLARVQHV 561

Query: 898  SASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSANTLPF 719
            ++ + +  R+            E  R  + +S +Q +N+  + G +   DS         
Sbjct: 562  TSGHFNSTRDSQVSGDRPPSVPETVRHPIPESVVQNQNISLHIGTSRDADS--------- 612

Query: 718  VSGLSPPPNESSAHQGFSAVFVSSQPSPPIDNSENITQANEDTIRGTESRDAARAGTEQE 539
                    N    +Q  S    SS+ S P D++ N    N + +    S    + G E  
Sbjct: 613  -------ANVVVQNQQGSLPTSSSRQSQPSDSNNNNNNNNNNNL----SSGRIQNGQESA 661

Query: 538  AFFSNVLQRLIPLISQGRE--HGESSANGSSSQIEGENLSGSSSFEHQRD 395
            A  SN   +L+  I  G +   GE++  G S+    E++  + S  + R+
Sbjct: 662  AHISNRFDQLLRTIFPGEQISVGEANFPGMSTGSATEHVGTAGSTANARE 711


>ref|XP_010244741.1| PREDICTED: large proline-rich protein BAG6-like isoform X1 [Nelumbo
            nucifera]
          Length = 684

 Score =  470 bits (1210), Expect = e-129
 Identities = 315/727 (43%), Positives = 407/727 (55%), Gaps = 23/727 (3%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MG+ DA +VL+ GG  AE S  TVEIKIKTLDSQTYTLRV+KCVPVP LKEQIA+VTGVL
Sbjct: 1    MGTNDANEVLISGGAEAECSEATVEIKIKTLDSQTYTLRVNKCVPVPALKEQIATVTGVL 60

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP-PQSATTSSSHVGSEGASDHPGT 2114
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP P  + ++   +GSEG  DH  +
Sbjct: 61   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPPPSASTMGFMGSEGLHDHSAS 120

Query: 2113 SSGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXX 1937
                  +H + +QVAHS V  + N+ DQG  D ++                         
Sbjct: 121  EPTTNASHGQGNQVAHSVVLGTFNIADQG--DGTLPDINRIVSAVLGSILTNGSNSEGGN 178

Query: 1936 IREMISGRLARASSDSGLSNSTHPQLLP-----QRDRRQADVQFSSTVHSGPPNPTVIPD 1772
             RE  S R+ R    S   +S  PQ        Q D      +F S V  GP  P VIPD
Sbjct: 179  RRESGSERIDRTIGASVPHDSMRPQPSQPAAGVQSDPLHGAFRFPSAVSLGPLQPPVIPD 238

Query: 1771 SLTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDS--HSPAAQVGLPT 1598
            SL TLSQY++ MR +F           + +N +  GV  NEG  HDS  HS A Q GLPT
Sbjct: 239  SLATLSQYLTRMRHEFH-----VIARSYNNNTQPAGVTGNEGQEHDSAPHSSAGQAGLPT 293

Query: 1597 PASLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLG 1418
            PASLAE++ STR +L+EQ GECL QL RQL  Q+++TD L RM VQS+AM  GV+++NLG
Sbjct: 294  PASLAEVIHSTRHMLIEQAGECLYQLTRQLEGQANMTDPLMRMTVQSNAMRSGVILQNLG 353

Query: 1417 SMLLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPN-TXXXXXX 1241
            ++LLEL R TMTLRMGR P+EAVVNAGP+ FIS +GPN +MVQ +PF P  +        
Sbjct: 354  ALLLELGRATMTLRMGRTPSEAVVNAGPSVFISTSGPNPIMVQPLPFQPGTSFGAVPMGA 413

Query: 1240 XXXXXGLEGDSPDATFLPRNIEIRIR--TGRAVPVVSTNASEAADSXXXXXQTDPARN-S 1070
                  L G +  + F+PRNI+IRIR  TG ++P  + N  E A       QT+PAR   
Sbjct: 414  VHSGSSLVGSTLASGFIPRNIDIRIRTVTGSSIPTANVNQGEQAGVQQPPGQTNPARPVL 473

Query: 1069 SGANVQQSFSGGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSSAS 890
            +GA    SF+ G+S VRVVP+RT V AVP+ VN   SD + SS+ LFYPLLARV+  ++ 
Sbjct: 474  AGAAGAHSFT-GESGVRVVPIRTVVAAVPAPVNRPPSDPSGSSLGLFYPLLARVQHVTSG 532

Query: 889  NASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSANTLPFVSG 710
            + S  R             E +++   +S +Q +N+  N  DA   D + S NT P    
Sbjct: 533  HFSSPRGSQVASERPPSVPETEQRPSPESAVQHQNIGLNH-DASARDVN-STNTTP---- 586

Query: 709  LSPPPNESSAHQGFSAVFVSSQPSPPIDNSENITQANEDTIRGTESRDAAR-AGTEQEAF 533
                                        N + IT+         ++RDAA   G++   F
Sbjct: 587  ---------------------------QNDQGITR------NAADARDAASGVGSDDGIF 613

Query: 532  FSNVLQRLIPLISQGR---------EHGESSANGSSSQIEGENLSGSSSFEHQRDPPEIP 380
             SN+L+++IP+ISQ           E  ++S + +      +    SS+   +R    IP
Sbjct: 614  LSNLLRQVIPVISQITATEQDVILPESIDTSGHRADCGSSTQTTESSSTGNLRRCSDHIP 673

Query: 379  SPKRTKR 359
            +   +KR
Sbjct: 674  TAPNSKR 680


>ref|XP_010244742.1| PREDICTED: uncharacterized protein LOC104588496 isoform X2 [Nelumbo
            nucifera]
          Length = 676

 Score =  462 bits (1190), Expect = e-127
 Identities = 313/725 (43%), Positives = 403/725 (55%), Gaps = 21/725 (2%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MG+ DA +VL+ GG  AE S  TVEIKIKTLDSQTYTLRV+KCVPVP LKEQIA+VTGVL
Sbjct: 1    MGTNDANEVLISGGAEAECSEATVEIKIKTLDSQTYTLRVNKCVPVPALKEQIATVTGVL 60

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP-PQSATTSSSHVGSEGASDHPGT 2114
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP P  + ++   +GSEG  DH  +
Sbjct: 61   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPPPSASTMGFMGSEGLHDHSAS 120

Query: 2113 SSGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXX 1937
                  +H + +QVAHS V  + N+ DQG  D ++                         
Sbjct: 121  EPTTNASHGQGNQVAHSVVLGTFNIADQG--DGTLPDINRIVSAVLGSILTNGSNSEGGN 178

Query: 1936 IREMISGRLARASSDSGLSNSTHPQLLP-----QRDRRQADVQFSSTVHSGPPNPTVIPD 1772
             RE  S R+ R    S   +S  PQ        Q D      +F S V  GP  P VIPD
Sbjct: 179  RRESGSERIDRTIGASVPHDSMRPQPSQPAAGVQSDPLHGAFRFPSAVSLGPLQPPVIPD 238

Query: 1771 SLTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDS--HSPAAQVGLPT 1598
            SL TLSQY++ MR +F           + +N +  GV  NEG  HDS  HS A Q GLPT
Sbjct: 239  SLATLSQYLTRMRHEFH-----VIARSYNNNTQPAGVTGNEGQEHDSAPHSSAGQAGLPT 293

Query: 1597 PASLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLG 1418
            PASLAE++ STR +L+EQ GECL QL RQL  Q+++TD L RM VQS+AM  GV+++NLG
Sbjct: 294  PASLAEVIHSTRHMLIEQAGECLYQLTRQLEGQANMTDPLMRMTVQSNAMRSGVILQNLG 353

Query: 1417 SMLLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPN-TXXXXXX 1241
            ++LLEL R TMTLRMGR P+EAVVNAGP+ FIS +GPN +MVQ +PF P  +        
Sbjct: 354  ALLLELGRATMTLRMGRTPSEAVVNAGPSVFISTSGPNPIMVQPLPFQPGTSFGAVPMGA 413

Query: 1240 XXXXXGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARN-SSG 1064
                  L G +  + F+PRNI+IRIRT       + N  E A       QT+PAR   +G
Sbjct: 414  VHSGSSLVGSTLASGFIPRNIDIRIRT------ANVNQGEQAGVQQPPGQTNPARPVLAG 467

Query: 1063 ANVQQSFSGGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSSASNA 884
            A    SF+ G+S VRVVP+RT V AVP+ VN   SD + SS+ LFYPLLARV+  ++ + 
Sbjct: 468  AAGAHSFT-GESGVRVVPIRTVVAAVPAPVNRPPSDPSGSSLGLFYPLLARVQHVTSGHF 526

Query: 883  SDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSANTLPFVSGLS 704
            S  R             E +++   +S +Q +N+  N  DA   D + S NT P      
Sbjct: 527  SSPRGSQVASERPPSVPETEQRPSPESAVQHQNIGLNH-DASARDVN-STNTTP------ 578

Query: 703  PPPNESSAHQGFSAVFVSSQPSPPIDNSENITQANEDTIRGTESRDAAR-AGTEQEAFFS 527
                                      N + IT+         ++RDAA   G++   F S
Sbjct: 579  -------------------------QNDQGITR------NAADARDAASGVGSDDGIFLS 607

Query: 526  NVLQRLIPLISQGR---------EHGESSANGSSSQIEGENLSGSSSFEHQRDPPEIPSP 374
            N+L+++IP+ISQ           E  ++S + +      +    SS+   +R    IP+ 
Sbjct: 608  NLLRQVIPVISQITATEQDVILPESIDTSGHRADCGSSTQTTESSSTGNLRRCSDHIPTA 667

Query: 373  KRTKR 359
              +KR
Sbjct: 668  PNSKR 672


>ref|XP_010913366.1| PREDICTED: large proline-rich protein BAG6, partial [Elaeis
            guineensis]
          Length = 465

 Score =  416 bits (1068), Expect(2) = e-117
 Identities = 232/406 (57%), Positives = 280/406 (68%), Gaps = 8/406 (1%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MG+  A  +    GD+ EDS TTVEIKIKTLDSQTYTLRV+KCVP+ +LKEQIA+VTGV+
Sbjct: 1    MGTNGARDITTSHGDVTEDSETTVEIKIKTLDSQTYTLRVNKCVPILMLKEQIATVTGVV 60

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATT-SSSHVGSEGASDHPGT 2114
            SEQQRLICRGKVLKDD+LLSAYHVEDGHTLHLVVRQP QS+++ S+ HVGSEGAS +P  
Sbjct: 61   SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPSTGHVGSEGASANPAA 120

Query: 2113 SSGIGIAHNRSSQVAHSFVFESVNVDQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
            +S    AHNR S VA S VFE+VN+DQGD  +S                          +
Sbjct: 121  NSSSSTAHNRGSHVARSIVFEAVNIDQGDNRTSHLGRIISSLLSSIGTTNTAFQNPRNDL 180

Query: 1933 REMISGRLARASSDSGL-----SNSTHPQLLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE +     R S D+GL     SN   P    + D +Q  ++F S    G   P VIPDS
Sbjct: 181  RETV----GRTSGDTGLSDAMQSNPNPPTSRVELDSQQGPLRFQSVFPLGSQQPIVIPDS 236

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHD--SHSPAAQVGLPTP 1595
            LTT++QY+ ++RDDFRREG      E  ++A   G+  N+   HD  S  P+ Q GLP+P
Sbjct: 237  LTTMNQYLGVIRDDFRREGHSIYGREQTNDAAAAGMNGNDVQNHDFLSPLPSRQGGLPSP 296

Query: 1594 ASLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGS 1415
            ASLAEI+ STRQLLM+Q G CLSQLAR+L D  S+TD L RM++QSSA+  GVL+RNLGS
Sbjct: 297  ASLAEIVLSTRQLLMDQAGGCLSQLARRLDDHVSVTDPLMRMDLQSSAIRSGVLLRNLGS 356

Query: 1414 MLLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPF 1277
            +LLEL RTTMTL MG+ P EAVVNAGPA FISA+GPN +MVQ VPF
Sbjct: 357  LLLELGRTTMTLHMGQTPLEAVVNAGPAVFISASGPNPVMVQPVPF 402



 Score = 38.1 bits (87), Expect(2) = e-117
 Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
 Frame = -2

Query: 1209 HLMLHFFLGI*KYESVQAVL-FLLLVPMQAKQLIRSNSNSKQTRQGIPLV 1063
            HL+LH FLG   Y S Q V+ +LL +     +L+ SN  SKQ +Q I L+
Sbjct: 416  HLLLHLFLGTLTYVSAQVVVPYLLPMLTWVNRLVHSNRKSKQIQQEIHLL 465


>ref|XP_007013657.1| Ubiquitin-like superfamily protein, putative isoform 3 [Theobroma
            cacao] gi|508784020|gb|EOY31276.1| Ubiquitin-like
            superfamily protein, putative isoform 3 [Theobroma cacao]
          Length = 730

 Score =  423 bits (1087), Expect = e-115
 Identities = 295/739 (39%), Positives = 392/739 (53%), Gaps = 33/739 (4%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS  A KV  R  +  E S TT+EIKIKTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSTGADKV-PRDSE-TEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEGASDHPGTS 2111
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLH+VVRQP   ++  S H  ++ AS   GTS
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSAS---GTS 115

Query: 2110 SGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
             G       S+ VA S V E+ NV DQGD                              +
Sbjct: 116  RG------HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDV 169

Query: 1933 REMISGRLARASSDSGLSNSTHPQ-----LLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  S RL R S  SG+ +S+  Q     +  Q DR  +     + V  GP  P VIPDS
Sbjct: 170  REHGSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDS 229

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPAS 1589
            L TLSQY+S +R +F   G          +   TG R +      S+S   Q GLPTPAS
Sbjct: 230  LATLSQYLSHLRREFDGIGRAGGEDPQAASLSRTGDRDSNP---ASNSGTVQEGLPTPAS 286

Query: 1588 LAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSML 1409
            LAE+L +TRQLL+EQ GECL QLARQL DQ ++TDS AR++ QS A   GVL++NLGS+ 
Sbjct: 287  LAEVLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLF 346

Query: 1408 LELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXXX 1229
            LEL RTTMT+R+G+ P+EAVVNAGPA FIS +GPN LMVQA+PF P  +           
Sbjct: 347  LELGRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQP 406

Query: 1228 XGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANVQQ 1049
                 +      LPR I+I+IR G +V   + N  E  D+     Q +P+  S   N   
Sbjct: 407  GSGLVNGLGTGLLPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRST 466

Query: 1048 SFS---------GGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSS 896
              S          G+S VRVVP+RT V AVP+    L SDS+ +SV L+YP L R +  +
Sbjct: 467  QTSSRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIA 526

Query: 895  ASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHF-------- 740
            + + S  R             + ++  + +S  Q+++ + ++ D    + +         
Sbjct: 527  SGHVSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNT 586

Query: 739  -SANTLPFVSGLSPPPNES-----SAHQGFSAVFVSSQPSPPIDNSENITQANEDTIRGT 578
             S +     +G +    +S     S+ Q   A+F   + +    + +     +     GT
Sbjct: 587  RSVSINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGT 646

Query: 577  ESRDAAR--AGTEQEAFFSNVLQRLIPLISQGREHGESS--ANGSSSQIEGENLSGSSSF 410
             S   A   + T+Q  F SN+L +++P + Q     +S+     +++  + E+ S  SS 
Sbjct: 647  SSGAPAAEPSITDQGVFLSNLLHQIMPYVPQQASSQQSTVPTEEANTSTQAEHTSPGSSR 706

Query: 409  EHQRDPPEIPSPKRTKRNS 353
                  P  P+ KR KR S
Sbjct: 707  RPSDSEPNSPNSKRQKRQS 725


>ref|XP_007013655.1| Ubiquitin-like superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|590578981|ref|XP_007013660.1| Ubiquitin-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
            gi|508784018|gb|EOY31274.1| Ubiquitin-like superfamily
            protein, putative isoform 1 [Theobroma cacao]
            gi|508784023|gb|EOY31279.1| Ubiquitin-like superfamily
            protein, putative isoform 1 [Theobroma cacao]
          Length = 724

 Score =  421 bits (1083), Expect = e-114
 Identities = 294/737 (39%), Positives = 391/737 (53%), Gaps = 33/737 (4%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS  A KV  R  +  E S TT+EIKIKTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSTGADKV-PRDSE-TEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEGASDHPGTS 2111
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLH+VVRQP   ++  S H  ++ AS   GTS
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSAS---GTS 115

Query: 2110 SGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
             G       S+ VA S V E+ NV DQGD                              +
Sbjct: 116  RG------HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDV 169

Query: 1933 REMISGRLARASSDSGLSNSTHPQ-----LLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  S RL R S  SG+ +S+  Q     +  Q DR  +     + V  GP  P VIPDS
Sbjct: 170  REHGSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDS 229

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPAS 1589
            L TLSQY+S +R +F   G          +   TG R +      S+S   Q GLPTPAS
Sbjct: 230  LATLSQYLSHLRREFDGIGRAGGEDPQAASLSRTGDRDSNP---ASNSGTVQEGLPTPAS 286

Query: 1588 LAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSML 1409
            LAE+L +TRQLL+EQ GECL QLARQL DQ ++TDS AR++ QS A   GVL++NLGS+ 
Sbjct: 287  LAEVLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLF 346

Query: 1408 LELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXXX 1229
            LEL RTTMT+R+G+ P+EAVVNAGPA FIS +GPN LMVQA+PF P  +           
Sbjct: 347  LELGRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQP 406

Query: 1228 XGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANVQQ 1049
                 +      LPR I+I+IR G +V   + N  E  D+     Q +P+  S   N   
Sbjct: 407  GSGLVNGLGTGLLPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRST 466

Query: 1048 SFS---------GGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSS 896
              S          G+S VRVVP+RT V AVP+    L SDS+ +SV L+YP L R +  +
Sbjct: 467  QTSSRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIA 526

Query: 895  ASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHF-------- 740
            + + S  R             + ++  + +S  Q+++ + ++ D    + +         
Sbjct: 527  SGHVSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNT 586

Query: 739  -SANTLPFVSGLSPPPNES-----SAHQGFSAVFVSSQPSPPIDNSENITQANEDTIRGT 578
             S +     +G +    +S     S+ Q   A+F   + +    + +     +     GT
Sbjct: 587  RSVSINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGT 646

Query: 577  ESRDAAR--AGTEQEAFFSNVLQRLIPLISQGREHGESS--ANGSSSQIEGENLSGSSSF 410
             S   A   + T+Q  F SN+L +++P + Q     +S+     +++  + E+ S  SS 
Sbjct: 647  SSGAPAAEPSITDQGVFLSNLLHQIMPYVPQQASSQQSTVPTEEANTSTQAEHTSPGSSR 706

Query: 409  EHQRDPPEIPSPKRTKR 359
                  P  P+ KR KR
Sbjct: 707  RPSDSEPNSPNSKRQKR 723


>ref|XP_007013661.1| Ubiquitin-like superfamily protein, putative isoform 7 [Theobroma
            cacao] gi|508784024|gb|EOY31280.1| Ubiquitin-like
            superfamily protein, putative isoform 7 [Theobroma cacao]
          Length = 725

 Score =  420 bits (1079), Expect = e-114
 Identities = 294/737 (39%), Positives = 390/737 (52%), Gaps = 34/737 (4%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS  A KV  R  +  E S TT+EIKIKTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSTGADKV-PRDSE-TEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEGASDHPGTS 2111
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLH+VVRQP   ++  S H  ++ AS   GTS
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSAS---GTS 115

Query: 2110 SGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
             G       S+ VA S V E+ NV DQGD                              +
Sbjct: 116  RG------HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDV 169

Query: 1933 REMISGRLARASSDSGLSNSTHPQ-----LLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  S RL R S  SG+ +S+  Q     +  Q DR  +     + V  GP  P VIPDS
Sbjct: 170  REHGSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDS 229

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPAS 1589
            L TLSQY+S +R +F   G          +   TG R +      S+S   Q GLPTPAS
Sbjct: 230  LATLSQYLSHLRREFDGIGRAGGEDPQAASLSRTGDRDSNP---ASNSGTVQEGLPTPAS 286

Query: 1588 LAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSML 1409
            LAE+L +TRQLL+EQ GECL QLARQL DQ ++TDS AR++ QS A   GVL++NLGS+ 
Sbjct: 287  LAEVLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLF 346

Query: 1408 LELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXXX 1229
            LEL RTTMT+R+G+ P+EAVVNAGPA FIS +GPN LMVQA+PF P  +           
Sbjct: 347  LELGRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQP 406

Query: 1228 XGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANVQQ 1049
                 +      LPR I+I+IR G +V   + N  E  D+     Q +P+  S   N   
Sbjct: 407  GSGLVNGLGTGLLPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRST 466

Query: 1048 SFS---------GGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSS 896
              S          G+S VRVVP+RT V AVP+    L SDS+ +SV L+YP L R +  +
Sbjct: 467  QTSSRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIA 526

Query: 895  ASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHF-------- 740
            + + S  R             + ++  + +S  Q+++ + ++ D    + +         
Sbjct: 527  SGHVSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNT 586

Query: 739  -SANTLPFVSGLSPPPNES-----SAHQGFSAVFVSSQPSPPIDNSENITQANEDTIRGT 578
             S +     +G +    +S     S+ Q   A+F   + +    + +     +     GT
Sbjct: 587  RSVSINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGT 646

Query: 577  ESRDAAR--AGTEQEAFFSNVLQRLIPLISQGREHGESSA---NGSSSQIEGENLSGSSS 413
             S   A   + T+Q  F SN+L +++P + Q     +S+      ++S  + E+ S  SS
Sbjct: 647  SSGAPAAEPSITDQGVFLSNLLHQIMPYVPQQASSQQSTVPTEEANTSTQQAEHTSPGSS 706

Query: 412  FEHQRDPPEIPSPKRTK 362
                   P  P+ KR K
Sbjct: 707  RRPSDSEPNSPNSKRQK 723


>ref|XP_007013658.1| Ubiquitin-like superfamily protein, putative isoform 4 [Theobroma
            cacao] gi|508784021|gb|EOY31277.1| Ubiquitin-like
            superfamily protein, putative isoform 4 [Theobroma cacao]
          Length = 724

 Score =  419 bits (1078), Expect = e-114
 Identities = 293/736 (39%), Positives = 390/736 (52%), Gaps = 33/736 (4%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS  A KV  R  +  E S TT+EIKIKTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSTGADKV-PRDSE-TEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEGASDHPGTS 2111
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLH+VVRQP   ++  S H  ++ AS   GTS
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSAS---GTS 115

Query: 2110 SGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
             G       S+ VA S V E+ NV DQGD                              +
Sbjct: 116  RG------HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDV 169

Query: 1933 REMISGRLARASSDSGLSNSTHPQ-----LLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  S RL R S  SG+ +S+  Q     +  Q DR  +     + V  GP  P VIPDS
Sbjct: 170  REHGSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDS 229

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPAS 1589
            L TLSQY+S +R +F   G          +   TG R +      S+S   Q GLPTPAS
Sbjct: 230  LATLSQYLSHLRREFDGIGRAGGEDPQAASLSRTGDRDSNP---ASNSGTVQEGLPTPAS 286

Query: 1588 LAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSML 1409
            LAE+L +TRQLL+EQ GECL QLARQL DQ ++TDS AR++ QS A   GVL++NLGS+ 
Sbjct: 287  LAEVLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLF 346

Query: 1408 LELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXXX 1229
            LEL RTTMT+R+G+ P+EAVVNAGPA FIS +GPN LMVQA+PF P  +           
Sbjct: 347  LELGRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQP 406

Query: 1228 XGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANVQQ 1049
                 +      LPR I+I+IR G +V   + N  E  D+     Q +P+  S   N   
Sbjct: 407  GSGLVNGLGTGLLPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRST 466

Query: 1048 SFS---------GGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSS 896
              S          G+S VRVVP+RT V AVP+    L SDS+ +SV L+YP L R +  +
Sbjct: 467  QTSSRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIA 526

Query: 895  ASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHF-------- 740
            + + S  R             + ++  + +S  Q+++ + ++ D    + +         
Sbjct: 527  SGHVSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNT 586

Query: 739  -SANTLPFVSGLSPPPNES-----SAHQGFSAVFVSSQPSPPIDNSENITQANEDTIRGT 578
             S +     +G +    +S     S+ Q   A+F   + +    + +     +     GT
Sbjct: 587  RSVSINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGT 646

Query: 577  ESRDAAR--AGTEQEAFFSNVLQRLIPLISQGREHGESS--ANGSSSQIEGENLSGSSSF 410
             S   A   + T+Q  F SN+L +++P + Q     +S+     +++  + E+ S  SS 
Sbjct: 647  SSGAPAAEPSITDQGVFLSNLLHQIMPYVPQQASSQQSTVPTEEANTSTQAEHTSPGSSR 706

Query: 409  EHQRDPPEIPSPKRTK 362
                  P  P+ KR K
Sbjct: 707  RPSDSEPNSPNSKRQK 722


>ref|XP_007013659.1| Ubiquitin-like superfamily protein, putative isoform 5 [Theobroma
            cacao] gi|508784022|gb|EOY31278.1| Ubiquitin-like
            superfamily protein, putative isoform 5 [Theobroma cacao]
          Length = 729

 Score =  411 bits (1056), Expect = e-111
 Identities = 284/699 (40%), Positives = 375/699 (53%), Gaps = 31/699 (4%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS  A KV  R  +  E S TT+EIKIKTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSTGADKV-PRDSE-TEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEGASDHPGTS 2111
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLH+VVRQP   ++  S H  ++ AS   GTS
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSAS---GTS 115

Query: 2110 SGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
             G       S+ VA S V E+ NV DQGD                              +
Sbjct: 116  RG------HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDV 169

Query: 1933 REMISGRLARASSDSGLSNSTHPQ-----LLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  S RL R S  SG+ +S+  Q     +  Q DR  +     + V  GP  P VIPDS
Sbjct: 170  REHGSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDS 229

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPAS 1589
            L TLSQY+S +R +F   G          +   TG R +      S+S   Q GLPTPAS
Sbjct: 230  LATLSQYLSHLRREFDGIGRAGGEDPQAASLSRTGDRDSNP---ASNSGTVQEGLPTPAS 286

Query: 1588 LAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSML 1409
            LAE+L +TRQLL+EQ GECL QLARQL DQ ++TDS AR++ QS A   GVL++NLGS+ 
Sbjct: 287  LAEVLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLF 346

Query: 1408 LELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXXX 1229
            LEL RTTMT+R+G+ P+EAVVNAGPA FIS +GPN LMVQA+PF P  +           
Sbjct: 347  LELGRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQP 406

Query: 1228 XGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANVQQ 1049
                 +      LPR I+I+IR G +V   + N  E  D+     Q +P+  S   N   
Sbjct: 407  GSGLVNGLGTGLLPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRST 466

Query: 1048 SFS---------GGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSS 896
              S          G+S VRVVP+RT V AVP+    L SDS+ +SV L+YP L R +  +
Sbjct: 467  QTSSRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIA 526

Query: 895  ASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHF-------- 740
            + + S  R             + ++  + +S  Q+++ + ++ D    + +         
Sbjct: 527  SGHVSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNT 586

Query: 739  -SANTLPFVSGLSPPPNES-----SAHQGFSAVFVSSQPSPPIDNSENITQANEDTIRGT 578
             S +     +G +    +S     S+ Q   A+F   + +    + +     +     GT
Sbjct: 587  RSVSINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGT 646

Query: 577  ESRDAAR--AGTEQEAFFSNVLQRLIPLISQGREHGESS 467
             S   A   + T+Q  F SN+L +++P + Q     +S+
Sbjct: 647  SSGAPAAEPSITDQGVFLSNLLHQIMPYVPQQASSQQST 685


>ref|XP_002532506.1| scythe/bat3, putative [Ricinus communis] gi|223527781|gb|EEF29882.1|
            scythe/bat3, putative [Ricinus communis]
          Length = 709

 Score =  409 bits (1050), Expect = e-111
 Identities = 293/737 (39%), Positives = 378/737 (51%), Gaps = 34/737 (4%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS  A K+   G D+AE S TT+EIK+KTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSDGAQKI--PGTDVAEGSETTIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEGASDHPGTS 2111
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP      SS  + +  A+D P +S
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP---VIPSSDGLSNHSATD-PASS 114

Query: 2110 SGIGIAHNRSSQVAHSFVFESVNV-DQGD-IDSSVXXXXXXXXXXXXXXXXXXXXXXXXX 1937
            +  G        VA S V E+ ++ DQGD +   +                         
Sbjct: 115  TSRG-------HVAPSVVIETFSMPDQGDGVPPEISRIVSAVLGSFGFPNIGSGGEGVDV 167

Query: 1936 IREMISGRLARASSDSGLSNSTHPQLLPQRDRRQADVQFSSTVHSGPPNPTVIPDSLTTL 1757
             RE    R A AS ++          + Q DR Q+     +TV  G  +P +IPDSLTTL
Sbjct: 168  ARERDQHRSAAASPEAAQLQPEQGSRI-QSDRSQSVFGLPTTVSLGSLHPPIIPDSLTTL 226

Query: 1756 SQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPASLAEI 1577
            SQY+S MR +F             +  E T           S S   Q  LPTPA LAE+
Sbjct: 227  SQYLSHMRREF-------------NTIEATRRDEQRETNSTSRSGTGQERLPTPAYLAEV 273

Query: 1576 LFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSMLLELA 1397
            + S+RQ + EQ  ECL QLARQL +Q+++TDS AR+N+QSSA   GV + NLG+ LLEL 
Sbjct: 274  ITSSRQFINEQVAECLQQLARQLENQANVTDSAARLNIQSSAWRTGVQLHNLGAFLLELG 333

Query: 1396 RTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXXXXGLE 1217
            RTTMTLR+G+ P+EAVVNAGPA FIS +GPN LMVQ +PF    +               
Sbjct: 334  RTTMTLRLGQAPSEAVVNAGPAVFISPSGPNPLMVQPLPFQTGASFGALPLGSVQPGSGL 393

Query: 1216 GDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANV------ 1055
             +     FLPR I+I+IR G +    + N  E  D+     Q +P   S G N+      
Sbjct: 394  VNGIGTGFLPRRIDIQIRRGSSTASTNVNREERGDTQQPSGQRNPGTGSGGENLGNQTAS 453

Query: 1054 ---QQSFSGGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSSASNA 884
               + S  GGDS VRVVP+RT V +VP     L SDS+ +S+ LFYPLL R     AS+ 
Sbjct: 454  RATEASSFGGDSGVRVVPIRTMVASVPGQFGRLPSDSSTNSIGLFYPLLGRF-PHVASHV 512

Query: 883  SDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHF--------SANT 728
            S AR             + D+Q + +  +QR N +  + D    +S+         S N 
Sbjct: 513  SGARGSQASGEHHPAGVQRDQQSISEPAVQRVNAEPRTRDGSLPNSNLRQEPSSTRSINI 572

Query: 727  LPFVSGLSPPPNESSAHQGFSAVFVSSQPSPPIDNSENITQ--ANEDTIRGTESRDAARA 554
                +G +    ES        +  +  P   I   +   Q  A E+    T   ++   
Sbjct: 573  NILSAGGTQNSPESERQNSILQLLRNLLPGGEIHVEDAGLQGTATENAGASTAHAESQSG 632

Query: 553  GTEQEAFFSNVLQRLIPLISQ-------------GREHGESSANGSSSQIEGENLSGSSS 413
             T++  F SN+L+ ++P+ISQ              R      A  SS+Q E  N+   SS
Sbjct: 633  VTDEGIFLSNLLREIMPVISQHGVAEPNFVPQEDARASDHQRAQDSSTQAETSNV--GSS 690

Query: 412  FEHQRDPPEIPSPKRTK 362
              H       P+ KR K
Sbjct: 691  RRHSDTEASPPNSKRRK 707


>ref|XP_007013662.1| Ubiquitin-like superfamily protein, putative isoform 8 [Theobroma
            cacao] gi|590578991|ref|XP_007013663.1| Ubiquitin-like
            superfamily protein, putative isoform 8 [Theobroma cacao]
            gi|590578994|ref|XP_007013664.1| Ubiquitin-like
            superfamily protein, putative isoform 8 [Theobroma cacao]
            gi|508784025|gb|EOY31281.1| Ubiquitin-like superfamily
            protein, putative isoform 8 [Theobroma cacao]
            gi|508784026|gb|EOY31282.1| Ubiquitin-like superfamily
            protein, putative isoform 8 [Theobroma cacao]
            gi|508784027|gb|EOY31283.1| Ubiquitin-like superfamily
            protein, putative isoform 8 [Theobroma cacao]
          Length = 575

 Score =  409 bits (1050), Expect = e-111
 Identities = 264/586 (45%), Positives = 334/586 (56%), Gaps = 15/586 (2%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS  A KV  R  +  E S TT+EIKIKTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSTGADKV-PRDSE-TEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEGASDHPGTS 2111
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLH+VVRQP   ++  S H  ++ AS   GTS
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSAS---GTS 115

Query: 2110 SGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
             G       S+ VA S V E+ NV DQGD                              +
Sbjct: 116  RG------HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDV 169

Query: 1933 REMISGRLARASSDSGLSNSTHPQ-----LLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  S RL R S  SG+ +S+  Q     +  Q DR  +     + V  GP  P VIPDS
Sbjct: 170  REHGSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDS 229

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPAS 1589
            L TLSQY+S +R +F   G          +   TG R +      S+S   Q GLPTPAS
Sbjct: 230  LATLSQYLSHLRREFDGIGRAGGEDPQAASLSRTGDRDSNP---ASNSGTVQEGLPTPAS 286

Query: 1588 LAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSML 1409
            LAE+L +TRQLL+EQ GECL QLARQL DQ ++TDS AR++ QS A   GVL++NLGS+ 
Sbjct: 287  LAEVLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLF 346

Query: 1408 LELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXXX 1229
            LEL RTTMT+R+G+ P+EAVVNAGPA FIS +GPN LMVQA+PF P  +           
Sbjct: 347  LELGRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQP 406

Query: 1228 XGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANVQQ 1049
                 +      LPR I+I+IR G +V   + N  E  D+     Q +P+  S   N   
Sbjct: 407  GSGLVNGLGTGLLPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRST 466

Query: 1048 SFS---------GGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSS 896
              S          G+S VRVVP+RT V AVP+    L SDS+ +SV L+YP L R +  +
Sbjct: 467  QTSSRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIA 526

Query: 895  ASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAF 758
            + + S  R             + ++  + +S  Q+++ + ++ D +
Sbjct: 527  SGHVSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGW 572


>ref|XP_007013656.1| Ubiquitin-like superfamily protein, putative isoform 2 [Theobroma
            cacao] gi|590578998|ref|XP_007013665.1| Ubiquitin-like
            superfamily protein, putative isoform 2 [Theobroma cacao]
            gi|508784019|gb|EOY31275.1| Ubiquitin-like superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|508784028|gb|EOY31284.1| Ubiquitin-like superfamily
            protein, putative isoform 2 [Theobroma cacao]
          Length = 579

 Score =  409 bits (1050), Expect = e-111
 Identities = 267/596 (44%), Positives = 338/596 (56%), Gaps = 15/596 (2%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS  A KV  R  +  E S TT+EIKIKTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSTGADKV-PRDSE-TEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEGASDHPGTS 2111
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLH+VVRQP   ++  S H  ++ AS   GTS
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSAS---GTS 115

Query: 2110 SGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXI 1934
             G       S+ VA S V E+ NV DQGD                              +
Sbjct: 116  RG------HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDV 169

Query: 1933 REMISGRLARASSDSGLSNSTHPQ-----LLPQRDRRQADVQFSSTVHSGPPNPTVIPDS 1769
            RE  S RL R S  SG+ +S+  Q     +  Q DR  +     + V  GP  P VIPDS
Sbjct: 170  REHGSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDS 229

Query: 1768 LTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPAS 1589
            L TLSQY+S +R +F   G          +   TG R +      S+S   Q GLPTPAS
Sbjct: 230  LATLSQYLSHLRREFDGIGRAGGEDPQAASLSRTGDRDSNP---ASNSGTVQEGLPTPAS 286

Query: 1588 LAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSML 1409
            LAE+L +TRQLL+EQ GECL QLARQL DQ ++TDS AR++ QS A   GVL++NLGS+ 
Sbjct: 287  LAEVLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLF 346

Query: 1408 LELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXXX 1229
            LEL RTTMT+R+G+ P+EAVVNAGPA FIS +GPN LMVQA+PF P  +           
Sbjct: 347  LELGRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQP 406

Query: 1228 XGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANVQQ 1049
                 +      LPR I+I+IR G +V   + N  E  D+     Q +P+  S   N   
Sbjct: 407  GSGLVNGLGTGLLPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRST 466

Query: 1048 SFS---------GGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSS 896
              S          G+S VRVVP+RT V AVP+    L SDS+ +SV L+YP L R +  +
Sbjct: 467  QTSSRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIA 526

Query: 895  ASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSANT 728
            + + S  R             + ++  + +S  Q+++ + ++ D   G+  F A +
Sbjct: 527  SGHVSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRD---GNMRFIAKS 579


>ref|XP_003573668.2| PREDICTED: large proline-rich protein BAG6 [Brachypodium distachyon]
          Length = 861

 Score =  403 bits (1036), Expect = e-109
 Identities = 304/756 (40%), Positives = 395/756 (52%), Gaps = 41/756 (5%)
 Frame = -1

Query: 2494 IACSQLTRMGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQ 2315
            +A   L  +   D++ V M     AEDS TT+EI IKTLDSQTY LRV+KCVPVP+LKE+
Sbjct: 160  VAIHLLLEIMDCDSSNVQM--SHCAEDSETTIEINIKTLDSQTYNLRVNKCVPVPLLKEK 217

Query: 2314 IASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEG 2135
            IA+VTG+LSEQQRLICRG+VLKDD+LLSAYHVEDGHTLHLVVRQP Q AT  S + G+E 
Sbjct: 218  IATVTGILSEQQRLICRGRVLKDDELLSAYHVEDGHTLHLVVRQPGQPAT--SGNAGNEA 275

Query: 2134 ASDHPGTSSGIGIAHNRSSQVAHSFVFESVNVDQGDIDSSVXXXXXXXXXXXXXXXXXXX 1955
             S +         AH+    VA S V E+VN+DQG   SS+                   
Sbjct: 276  PSANS--------AHHHGPTVARSIVLEAVNLDQGGEFSSLAQILQSLF----------- 316

Query: 1954 XXXXXXIREMISGRLARASSDSGLSNSTHPQL----LPQRDRRQADVQFSSTVHSGPPNP 1787
                       SG    A SD+  S  T P        + D++QA + F      G   P
Sbjct: 317  --------SAASG--GPAPSDTRPSEPTQPSFPNGARVELDQQQASLLFPEAT-PGSSEP 365

Query: 1786 TVIPDSLTTLSQYISLMRDDFRREGFGANVSE--HID--NAEFTGVRFNEGWTHDSHSPA 1619
             VIPDSLTT+SQYI  MRD FRREGF  N     +ID  N +   V   +       S +
Sbjct: 366  NVIPDSLTTISQYIEFMRDSFRREGFNENGQPIGNIDHRNTQSAHVGGTQNQESQPDSSS 425

Query: 1618 AQVGLPTPASLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCG 1439
            AQ GLPT + LAE ++STRQLL++  G  LSQL+ QL D  +++DS  R N+Q  AM  G
Sbjct: 426  AQHGLPTASLLAETMYSTRQLLVDLAGPLLSQLSAQLGDLVNVSDSATRRNLQHGAMRHG 485

Query: 1438 VLMRNLGSMLLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNT 1259
            VL++NLGS+LLEL RTTM LR+   P+EAVVN+GPA FIS +GPN LMV  VPF P   +
Sbjct: 486  VLLQNLGSLLLELGRTTMMLRINPAPSEAVVNSGPAVFISPSGPNPLMVPPVPFFPGARS 545

Query: 1258 XXXXXXXXXXXGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPA 1079
                            S  +   PR  +I  RT  +VPV STN SE   +     +TD  
Sbjct: 546  VQMGPIFSSL-----SSQGSVLHPREADIHARTSGSVPVASTNPSEPVGAQQAQERTDRT 600

Query: 1078 RNSSGANVQQS---FSGG-----DSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYP 923
             N+S  NV+++    +GG      S VR++PLRT VVAVP+G+    S S+   V + YP
Sbjct: 601  GNASHTNVREASARVTGGAPFAVGSGVRLLPLRT-VVAVPAGIRRPPSGSSSGGVGVIYP 659

Query: 922  LLARVRQSSASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLD---------SNS 770
            ++ R++Q   ++ SDAR            +    Q   Q     E  +         SNS
Sbjct: 660  MVTRIQQRVNTSGSDARNGQTPNEPGRSDTHATVQPNPQPSQAHETGNPVFPVDVNVSNS 719

Query: 769  GDAFRGDSHFSANTLPFVSGLSPPPN--ESSAHQGFSAVFVSSQPSPPIDNSENITQANE 596
              A  G  +  +   P +S L        S++  G S   V+SQ   P+ ++E +   N 
Sbjct: 720  SQASPGQQNGQS---PLLSHLMDNFQWIGSASSVGNSRANVTSQ-HVPMSSAEQVDVTN- 774

Query: 595  DTIRGTESRDAARAGTEQEAF-FSNVLQRLIPLISQGREHGESSANGSSSQIEGENLSGS 419
               RG         G   E   F+N LQ+++P ISQ  E+   S  G  S I  +  SGS
Sbjct: 775  ---RGAPEVP----GVSNEGIRFANFLQQIMPFISQ-VENQPQSTPGDGSSIPSQVASGS 826

Query: 418  S-------------SFEHQRDPPEIPSPKRTKRNSD 350
            S             S +H RDP + P+ KR +R SD
Sbjct: 827  SNSARDEPSDSRRNSHDHNRDPVDGPNSKR-QRTSD 861


>gb|KJB64610.1| hypothetical protein B456_010G057100 [Gossypium raimondii]
          Length = 720

 Score =  398 bits (1022), Expect = e-107
 Identities = 287/734 (39%), Positives = 387/734 (52%), Gaps = 31/734 (4%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS  A KV        E S  T+EIKIKTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSTSAHKV--PSDTEMEGSEATIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP-PQSATTSSSHVGSEGASDHPGT 2114
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP P S+  S  H  ++ AS   GT
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPPSSDGSPYHSANDPAS---GT 115

Query: 2113 SSGIGIAHNRSSQVAHSFVFESVNV-DQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXX 1937
            S G        S  A SFV E+ NV DQGD                              
Sbjct: 116  SRG-------HSNHAPSFVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANMASGNTGSD 168

Query: 1936 IREMISGRLARASSDSGLSNSTHPQ-----LLPQRDRRQADVQFSSTVHSGPPNPTVIPD 1772
             R+  S R  R S  SG+ +S+  Q     +  Q DR  +     + V  G   P VIPD
Sbjct: 169  ARDHGSQRQERTSGGSGMPDSSQAQTELASMTSQSDRAHSAFGLPAAVSLGSMQPPVIPD 228

Query: 1771 SLTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPA 1592
            SL TLSQY+S +R++F   G              TG R +      S+S     GLPTPA
Sbjct: 229  SLATLSQYLSHIRNEFDALGRAGGNDSQTAPMSRTGSRDSNS---ASNSGTVHEGLPTPA 285

Query: 1591 SLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSM 1412
            SLAE+L STRQ+L+EQ GE + QLARQL DQ ++TD  AR+  Q++A+  G L+ NLGS+
Sbjct: 286  SLAEVLLSTRQMLIEQAGESVQQLARQLEDQVNVTDPSARLIAQTNALRTGALLHNLGSL 345

Query: 1411 LLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXX 1232
            LLEL RTTMTLR+G+ P+EAVVNAGPA FIS +GPN LMVQA+PF P  +          
Sbjct: 346  LLELGRTTMTLRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAIPVGTVQ 405

Query: 1231 XXGLEGDSPDATFLPRNIEIRIRTGRAVPVVST-------NASEAADSXXXXXQTDPARN 1073
                  +     F+PR I+I+IR G ++   +T        + ++  S     +   ++ 
Sbjct: 406  PGSGLVNGLGTGFVPRRIDIQIRRGSSMATPNTREEHPPNQSGQSNQSMVSDSENRSSQT 465

Query: 1072 SSGANVQQSFSGGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSSA 893
            +S  +   SF+ G+S VRVVP+RT V AVP+ +  L S+S+ +SV ++YPLL R++  + 
Sbjct: 466  TSRVSDTPSFA-GESGVRVVPIRTMVAAVPAPLGRLPSESSGNSVGVYYPLLGRLQNIAP 524

Query: 892  SNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSANTLPFVS 713
             + S  R            ++ +  R+ +S +Q ++ + ++ D    +++      P   
Sbjct: 525  GHVSGERGPQASGEHPSSGAQPELLRIPESAVQHQSSEESARDGSLPNANSRQQERPNTR 584

Query: 712  GLS--------PPPNESSAHQGFSAVFVSSQ---PSPPIDNSENITQAN-EDTIRG-TES 572
             ++           N+ S  Q  S V    +   P   I   E  +Q    D++RG  E+
Sbjct: 585  SVNISILAAGRTENNQDSERQSPSNVLQFLRTIFPGGEIQVEEASSQGTARDSVRGQAEA 644

Query: 571  RDAARAG----TEQEAFFSNVLQRLIPLISQGREHGESSANGSSSQIEGENLSGSSSFEH 404
             + A A     T Q  F SN+L +++P ISQ      S+   +++    +  S  +S   
Sbjct: 645  SNVAPAAETSITNQGVFLSNLLHQIMPYISQHAGSQRSTPEEATTSAPADLSSTGNSRRP 704

Query: 403  QRDPPEIPSPKRTK 362
                   P+ KR K
Sbjct: 705  NDTEQNPPNSKRQK 718


>ref|XP_008384827.1| PREDICTED: large proline-rich protein bag6-A isoform X2 [Malus
            domestica]
          Length = 725

 Score =  396 bits (1017), Expect = e-107
 Identities = 291/746 (39%), Positives = 378/746 (50%), Gaps = 42/746 (5%)
 Frame = -1

Query: 2470 MGSKDAAKVLMRGGDIAEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVL 2291
            MGS     + M   +  E S  T+EIKIKTLDSQTYTLRVDK +PVP LKEQIASVTGVL
Sbjct: 1    MGSNGTEHIPM--DEQVEGSEATIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 2290 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP-PQSATTSSSHVGSEGASDHPGT 2114
            SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLV+RQP P S          EG  DHPGT
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVLRQPIPPSP---------EGLPDHPGT 109

Query: 2113 SSGIGIAHNRSSQVAHSFVFESVNVD-QGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXX 1937
            +     A + S  V    V E+ ++  QGD  +                           
Sbjct: 110  NP----ASSTSRHVGPGVVIETFSMPVQGDGFAPEVNRIISAVLGSIGMPNIAGGSEGIE 165

Query: 1936 IREMISGRLARASSDSGLSNSTHPQLLPQRDRRQADVQFSSTVHS-----GPPNPTVIPD 1772
            +RE  S R  R S   G+ + +         R  +D    + +HS     GP  P VIPD
Sbjct: 166  VREHGSQRPERTSGLGGMFDFSQFHSEQAATRGPSDRSNGTFIHSTSFPLGPHPPLVIPD 225

Query: 1771 SLTTLSQYISLMRDDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSHSPAAQVGLPTPA 1592
            SLTTL+QY+S MR +F   G  A       N      R  E     S S   Q GLPTPA
Sbjct: 226  SLTTLTQYLSHMRCEFEAIGIDAG-----SNQAAATHRTEESSNSSSRSGTRQEGLPTPA 280

Query: 1591 SLAEILFSTRQLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSM 1412
            SLAE++ STRQLL EQ GE L Q A QL +Q ++TD  AR++ Q+SA   G L  NLG+ 
Sbjct: 281  SLAEVMRSTRQLLAEQVGESLLQFASQLDNQVNVTDPAARLSTQASASRNGALFHNLGAF 340

Query: 1411 LLELARTTMTLRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXX 1232
            LLEL RTTMTL+MG+ P++AVVNAGPA FIS TGPN +MVQ +PF    N          
Sbjct: 341  LLELGRTTMTLQMGQAPSDAVVNAGPAVFISPTGPNPIMVQPLPFQSGTNFGAIPMGAVQ 400

Query: 1231 XXGLEGDSPDATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANVQ 1052
                 G      FLPR I+I+IR G +    +    E  ++     Q +PA +S G +  
Sbjct: 401  PGSGLGSGLGTGFLPRRIDIQIRRGSSATTPNATREENGETHQPSGQRNPATSSGGEDPT 460

Query: 1051 Q---------SFSGGDSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQS 899
                      S   GD AVR+VP+RT V  VP+ ++   SDS+  S  L+YPLL R +  
Sbjct: 461  NQATSRVPGGSAFAGDPAVRIVPMRTMVATVPASLSRQPSDSSGISGGLYYPLLGRFQHV 520

Query: 898  SASNASDAREXXXXXXXXXXXSEVDRQRLHQSPLQRENLDSNSGDAFRGDSHFSAN---- 731
            ++ N S  R               D+Q    +  Q+   DS++ D+ R  S  +      
Sbjct: 521  ASGNVSSERGTPASRDRHPANLHTDQQSSESAAEQQNAADSSAADSARDGSAPNVRRPSI 580

Query: 730  ----TLPFVSGLSPPPNESSAHQGFSAV-------FVSSQPSPPIDNSENITQAN-EDTI 587
                ++  +S    P ++ S  Q  S++       F + +      ++E +   +  D  
Sbjct: 581  SRSVSINILSAGGTPNSQDSERQVPSSILQFIRTLFPNGELHVEDGSAEGVIAGSVPDQA 640

Query: 586  R----GTESRDAARAGTEQEAFFSNVLQRLIPLISQ--GREHGESS----ANGSSSQIEG 437
            R    G  + +A    T++  F SN+L +++P ISQ  G E G SS    A  SS++ E 
Sbjct: 641  RTSSGGVAAXEAEPRATDEGIFLSNLLHQIMPFISQATGGEPGNSSEHRMAQDSSTRAEX 700

Query: 436  ENLSGSSSFEHQRDPPEIPSPKRTKR 359
             N+  S    HQR   E P P  +KR
Sbjct: 701  SNVGSS----HQRSDSE-PDPPTSKR 721


>ref|XP_004955550.1| PREDICTED: large proline-rich protein bag6-A isoform X1 [Setaria
            italica] gi|835935657|ref|XP_012699523.1| PREDICTED:
            large proline-rich protein bag6-A isoform X1 [Setaria
            italica]
          Length = 687

 Score =  395 bits (1015), Expect = e-107
 Identities = 271/719 (37%), Positives = 381/719 (52%), Gaps = 32/719 (4%)
 Frame = -1

Query: 2422 AEDSATTVEIKIKTLDSQTYTLRVDKCVPVPVLKEQIASVTGVLSEQQRLICRGKVLKDD 2243
            AEDS TT+EIKIKTLDSQTY LRV+KCVPVP+LKE+IA+VTG+LSEQQRLICRG+VLKDD
Sbjct: 13   AEDSETTIEIKIKTLDSQTYNLRVNKCVPVPLLKEKIATVTGILSEQQRLICRGRVLKDD 72

Query: 2242 QLLSAYHVEDGHTLHLVVRQPPQSATTSSSHVGSEGASDHPGTSSGIGIAHNRSSQVAHS 2063
            +LLSAYHVEDGHTLHLVVRQP QSA + S+  G+E  + + G          R   VA S
Sbjct: 73   ELLSAYHVEDGHTLHLVVRQPGQSAPSGSA--GTEANASNSG--------RRRGPTVARS 122

Query: 2062 FVFESVNVDQGDIDSSVXXXXXXXXXXXXXXXXXXXXXXXXXIREMISGRLARASSDSGL 1883
             V E+VNVD G  +                              + + G ++  SS +  
Sbjct: 123  VVLEAVNVDPGSSELPAFVAQIL---------------------QSVLGSISAQSSGAPA 161

Query: 1882 SNSTHPQLLPQR----------DRRQADVQFSSTVHSGPPNPTVIPDSLTTLSQYISLMR 1733
            S+ T P    Q           D++Q  + F S    G   P VIPD+LTT+SQYI  MR
Sbjct: 162  SSDTRPSEPTQSSIPNTVRVELDQQQTPLLFQSEPAHGSSQPNVIPDALTTMSQYIDFMR 221

Query: 1732 DDFRREGFGANVSEHIDNAEFTGVRFNEGWTHDSH---SPAAQVGLPTPASLAEILFSTR 1562
            D FRREGF  N     +    T    + G T +        + +GL T + LAE + STR
Sbjct: 222  DSFRREGFNHNGQAEGNVENRTAGSTSVGGTQNQEIQPESTSTLGLHTASLLAETMHSTR 281

Query: 1561 QLLMEQTGECLSQLARQLADQSSITDSLARMNVQSSAMSCGVLMRNLGSMLLELARTTMT 1382
            Q+++EQ G  LSQL+ QL D  ++TD   R ++QSSA   G L++NLGS+LLEL RTTM 
Sbjct: 282  QIIVEQAGALLSQLSAQLGDLQNVTDPATRRDLQSSAFRSGSLLQNLGSLLLELGRTTML 341

Query: 1381 LRMGRIPAEAVVNAGPANFISATGPNALMVQAVPFHPAPNTXXXXXXXXXXXGLEGDSPD 1202
            LR+    +EAVVN+GPA +IS +GPN LMVQ VPF P  +                 S  
Sbjct: 342  LRINPASSEAVVNSGPALYISPSGPNPLMVQPVPFFPGRSVQMGTLFSGL------GSQG 395

Query: 1201 ATFLPRNIEIRIRTGRAVPVVSTNASEAADSXXXXXQTDPARNSSGANVQQSFSGG---- 1034
            +   PR+++I +RTG +VPV STN SE A +     +T  A +++        +GG    
Sbjct: 396  SVLHPRDVDIHVRTGGSVPVASTNPSETAGA--QANRTGEASHANIGEASAGVAGGTPFS 453

Query: 1033 -DSAVRVVPLRTAVVAVPSGVNPLQSDSTVSSVRLFYPLLARVRQSSASNASDAREXXXX 857
             +S VR++PLRT VVA+P+G++   S S+   V + YPL+ RVRQ + +  +D R     
Sbjct: 454  VESGVRLLPLRT-VVAMPAGISRAPSGSSSGGVGIIYPLITRVRQRANTIGNDERNGQSP 512

Query: 856  XXXXXXXSEVDRQRLHQSPLQRE--NLDSNSGDAFRGDSHFSANTLPFVSGLSPPPN-ES 686
                   +  ++Q + QS    E  NL+S +       S  S      +  LS   +   
Sbjct: 513  NEPARSSTHPNQQSIPQSSQAHEAGNLESVADVNVGNGSETSPGQQNGLVTLSQIMDLLG 572

Query: 685  SAHQGFSAVFVSSQPSPPIDNSENITQANEDTIRGTESRDAARAGTEQEAFFSNVLQRLI 506
            S   G +    SS    P  ++E +   N  T +           +E+   F++++++++
Sbjct: 573  SMLPGENVRGNSSSQQAPTSSAEQVDGRNSATTQ-------VSGASEEALHFASMVRQIM 625

Query: 505  PLISQGREHGES-----------SANGSSSQIEGENLSGSSSFEHQRDPPEIPSPKRTK 362
            P ISQ     +S           +A+GS+++   +    +SS +H RD  + P+ KR +
Sbjct: 626  PFISQVETQNQSAPPDTSSTHAQAASGSANRARDDPRDSTSSHQHNRDQIDEPNSKRQR 684


Top