BLASTX nr result

ID: Cocculus22_contig00003373 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00003373
         (1146 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          472   e-130
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   470   e-130
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   469   e-130
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   469   e-130
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   468   e-129
ref|NP_001141813.1| uncharacterized protein LOC100273952 precurs...   468   e-129
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   465   e-128
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   464   e-128
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   462   e-128
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   461   e-127
ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria...   461   e-127
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   461   e-127
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   461   e-127
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  460   e-127
ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [A...   458   e-126
ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [S...   456   e-126
ref|XP_006655467.1| PREDICTED: cysteine proteinase RD21a-like [O...   454   e-125
gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indi...   452   e-125
ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] g...   452   e-125
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   450   e-124

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  472 bits (1214), Expect = e-130
 Identities = 215/320 (67%), Positives = 249/320 (77%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWRK GAVT VKDQ +CGACW+FS TGAIEGIN+IVTGSLVSLSEQEL+DCD+SYN+G
Sbjct: 121  SVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNG 180

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            C GG+MDYAF+FV+ NHGIDTEEDYPYQ  DRSCN+ KLKR VVTIDGY D+P +NE+EL
Sbjct: 181  CEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKEL 240

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            LKAVA+QPVSVG+CGSER FQLYS GIF+GPC+TSLDHAV+IVGYGSENGVDYWI+KNSW
Sbjct: 241  LKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSW 300

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G  WGMDGYMHMQRN+G  +G+CGINMLA                  TRC L ++CG GE
Sbjct: 301  GSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGE 360

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC   + G +CLSWKCCELDSAVCCKD  +CCP +YP+CDT    CL+  GN + ++ 
Sbjct: 361  TCCCVHHIFG-ICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEK 419

Query: 242  FEKKXXXXXXXXXXXLFEAW 183
            F K            L E W
Sbjct: 420  FAKNSSSGKFRSWSSLLEGW 439


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  470 bits (1210), Expect = e-130
 Identities = 211/304 (69%), Positives = 250/304 (82%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSEQELIDCD+SYN+G
Sbjct: 128  SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 187

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            C GGLMDYAF+FV+KNHGIDTE+DYPYQ  D +C ++KLK+RVVTID Y  + S+NE+ L
Sbjct: 188  CNGGLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKAL 247

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            ++AVASQPVSVG+CGSER FQLYS+GIFSGPC+TSLDHAV+IVGYGS+NGVDYWI+KNSW
Sbjct: 248  MEAVASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 307

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            GKSWGMDG+MHMQRN G+ +GVCGINMLA                  T+C+L +YC +GE
Sbjct: 308  GKSWGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGE 367

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC   L GL C SWKCCEL+SAVCCKD  +CCP +YP+CDT    CL+  GN++ +KP
Sbjct: 368  TCCCARTLFGL-CFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKP 426

Query: 242  FEKK 231
            F KK
Sbjct: 427  FWKK 430


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  469 bits (1208), Expect = e-130
 Identities = 216/321 (67%), Positives = 249/321 (77%), Gaps = 1/321 (0%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            SIDWRKKGAVT VKDQASCGACWAFS TGAIEGIN+IVTGSLVSLSEQELIDCDRSYNSG
Sbjct: 120  SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYA++FV+KNHGIDTE+DYPY+     CN+ KL R +VTIDGY D+P +NE++L
Sbjct: 180  CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AV +QPVSVG+CGSER FQLYS+GIF+GPC+TSLDHAV+IVGY SENGVDYWI+KNSW
Sbjct: 240  LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSW 299

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G+SWGM+GYMHMQRN G+  G+CGINMLA                  TRCSL++YC AGE
Sbjct: 300  GRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGE 359

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL-RAAGNYSMVK 246
            TCCCG  +LG +CLSWKCC   SAVCC DH YCCP NYPICD+   QCL R  GN +  +
Sbjct: 360  TCCCGSSILG-ICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAE 418

Query: 245  PFEKKXXXXXXXXXXXLFEAW 183
              E +             + W
Sbjct: 419  AIEMRGSSWKFGSWSSFIDVW 439


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  469 bits (1207), Expect = e-130
 Identities = 215/321 (66%), Positives = 249/321 (77%), Gaps = 1/321 (0%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            SIDWRKKGAVT VKDQASCGACWAFS TGAIEGIN+IVTGSLVSLSEQELIDCDRSYNSG
Sbjct: 120  SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYA++FV+KNHGIDTE+DYPY+     CN+ KL R +VTIDGY D+P +NE++L
Sbjct: 180  CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AV +QPVSVG+CGSER FQLYS+GIF+GPC+TSLDHAV+I+GY SENGVDYWI+KNSW
Sbjct: 240  LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSW 299

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G+SWGM+GYMHMQRN G+  G+CGINMLA                  TRCSL++YC  GE
Sbjct: 300  GRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGE 359

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL-RAAGNYSMVK 246
            TCCCG  +LG +CLSWKCC   SAVCC DH YCCP NYPICD+   QCL R  GN +  +
Sbjct: 360  TCCCGSSILG-ICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAE 418

Query: 245  PFEKKXXXXXXXXXXXLFEAW 183
              E +             +AW
Sbjct: 419  AIEMRGSSWKFGSWSSFIDAW 439


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  468 bits (1205), Expect = e-129
 Identities = 208/304 (68%), Positives = 249/304 (81%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            SIDWR KG VT VKDQ SCGACW+FS TGAIEGIN+IVTGSLVSLSEQELI+CD+SYN G
Sbjct: 117  SIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDG 176

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYAF+FV+ NHGIDTEEDYPY+A D +CN++++KRRVVTID Y D+P +NE++L
Sbjct: 177  CGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQL 236

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AVA+QPVSVG+CGSER FQ+YS GIF+GPC+TSLDHAV+IVGYGSENGVDYWI+KNSW
Sbjct: 237  LQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSW 296

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G  WGM GYMHMQRN+G+ QGVCGINMLA                  T+C+L++YC AGE
Sbjct: 297  GTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGE 356

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC  +  G +C+SWKCC LDSAVCCKD ++CCPH+YP+CDT    C + AGN + ++ 
Sbjct: 357  TCCCARKFFG-ICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEA 415

Query: 242  FEKK 231
             E K
Sbjct: 416  IEGK 419


>ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
            gi|194706024|gb|ACF87096.1| unknown [Zea mays]
            gi|413945958|gb|AFW78607.1| hypothetical protein
            ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  468 bits (1205), Expect = e-129
 Identities = 211/304 (69%), Positives = 247/304 (81%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            ++DWRK GAVT VKDQ SCGACW+FS TGA+EGIN+I TGSLVSLSEQELIDCDRSYNSG
Sbjct: 140  ALDWRKSGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSG 199

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYA+KFV+KN GIDTEEDYPY+  D +CN+NKLK+RVVTIDGYTD+PS+ E+ L
Sbjct: 200  CGGGLMDYAYKFVIKNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLL 259

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AVA QPVSVG+CGS R FQLY  GIF GPC TSLDHAV+IVGYGSE G DYWI+KNSW
Sbjct: 260  LQAVAQQPVSVGICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSW 319

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G+SWGM GYMHM RN GD +GVCGINM+A                  T+CSL++YC  G 
Sbjct: 320  GESWGMKGYMHMHRNTGDSKGVCGINMMASFPTKTSPNPPPSPGPGPTKCSLLTYCPEGS 379

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC +R+LG  CLSW CCELD+AVCCKD+ YCCPH+YP+CDT   QCL+A+GN+S ++ 
Sbjct: 380  TCCCSWRVLGF-CLSWSCCELDNAVCCKDNRYCCPHDYPVCDTGRGQCLKASGNFSAIEG 438

Query: 242  FEKK 231
              +K
Sbjct: 439  IRRK 442


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  465 bits (1196), Expect = e-128
 Identities = 207/304 (68%), Positives = 249/304 (81%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSEQELIDCD+SYN+G
Sbjct: 121  SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 180

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            C GGLMDYAF+FV+KNHGIDTE+DYPYQ  D +C ++KLK++VVTID Y  + S++E+ L
Sbjct: 181  CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            ++AVA+QPVSVG+CGSER FQLYS+GIFSGPC+TSLDHAV+IVGYGS+NGVDYWI+KNSW
Sbjct: 241  MEAVAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 300

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            GKSWGMDG+MHMQRN  +  GVCGINMLA                  T+C+L +YC +GE
Sbjct: 301  GKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGE 360

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC   L GL C SWKCCE++SAVCCKD  +CCPH+YP+CDT    CL+  GN++ +KP
Sbjct: 361  TCCCARELFGL-CFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKP 419

Query: 242  FEKK 231
            F KK
Sbjct: 420  FWKK 423


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  464 bits (1194), Expect = e-128
 Identities = 207/304 (68%), Positives = 248/304 (81%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSEQELIDCD+SYN+G
Sbjct: 121  SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 180

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            C GGLMDYAF+FV+KNHGIDTE+DYPYQ  D +C ++KLK++VVTID Y  + S++E+ L
Sbjct: 181  CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            ++AVA+QPVSVG+CGSER FQLYS GIFSGPC+TSLDHAV+IVGYGS+NGVDYWI+KNSW
Sbjct: 241  MEAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 300

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            GKSWGMDG+MHMQRN  +  GVCGINMLA                  T+C+L +YC +GE
Sbjct: 301  GKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGE 360

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC   L GL C SWKCCE++SAVCCKD  +CCPH+YP+CDT    CL+  GN++ +KP
Sbjct: 361  TCCCARELFGL-CFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKP 419

Query: 242  FEKK 231
            F KK
Sbjct: 420  FWKK 423


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  462 bits (1190), Expect = e-128
 Identities = 213/304 (70%), Positives = 247/304 (81%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQI+TGSL+SLSEQELIDCDRSYNSG
Sbjct: 117  SLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSG 176

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYA++FV+ NHGIDTE DYPYQA D SC ++KL+R VVTIDGY DIPS++E +L
Sbjct: 177  CGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKL 236

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AVA+QPVSVG+CGSER FQLYS GIFSGPC+TSLDHAV+IVGYGSENGVDYWI+KNSW
Sbjct: 237  LQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSW 296

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            GKSWGMDGYMHMQRN+G+ +GVCGIN LA                  T+CS+++ C AGE
Sbjct: 297  GKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGE 356

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC  + LGL CLSWKCC L SAVCCKD  +CCP +YPICDT    CL+   N +  + 
Sbjct: 357  TCCCAKKFLGL-CLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEI 415

Query: 242  FEKK 231
             E +
Sbjct: 416  LENR 419


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  461 bits (1187), Expect = e-127
 Identities = 211/324 (65%), Positives = 253/324 (78%), Gaps = 2/324 (0%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSEQELIDCD+SYN+G
Sbjct: 121  SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 180

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            C GGLMDYAF+FV+KNHGIDTE+DYPYQ  D +C ++KLK++VVTID Y  + S++E+ L
Sbjct: 181  CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240

Query: 782  LKAVASQPVSVGLCGSERGFQLYS--TGIFSGPCATSLDHAVVIVGYGSENGVDYWILKN 609
             +AVA+QPVSVG+CGSER FQLYS  +GIFSGPC+TSLDHAV+IVGYGS+NGVDYWI+KN
Sbjct: 241  REAVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKN 300

Query: 608  SWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGA 429
            SWGKSWGMDG+MHMQRN G+ +G+CGINMLA                  T+C+L +YC A
Sbjct: 301  SWGKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSA 360

Query: 428  GETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMV 249
            GETCCC   L GL C SWKCCE++SAVCC D  +CCPH+YP+CDT    CL+  GN++ +
Sbjct: 361  GETCCCARNLFGL-CFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAI 419

Query: 248  KPFEKKXXXXXXXXXXXLFEAWNM 177
            KPF KK            FE W M
Sbjct: 420  KPFWKK----DSSNKLGRFEGWVM 439


>ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria italica]
          Length = 454

 Score =  461 bits (1186), Expect = e-127
 Identities = 209/304 (68%), Positives = 244/304 (80%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            ++DWRKKGAVT VK+Q SCGACW+FS TGAIEGIN+I TGSLVSLSEQELIDCDRSYN+G
Sbjct: 134  AVDWRKKGAVTKVKNQGSCGACWSFSATGAIEGINKIKTGSLVSLSEQELIDCDRSYNNG 193

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYAFKFV+KN GIDTE+DYPY+  D +CN+NKLKRRVVTIDGY+D+PS+ E  L
Sbjct: 194  CGGGLMDYAFKFVIKNGGIDTEDDYPYRQADGTCNKNKLKRRVVTIDGYSDVPSNKENLL 253

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AVA QPVSVG+CGS R FQLYS GIF GPC TSLDHAV+IVGYGSE G DYWI+KNSW
Sbjct: 254  LQAVAQQPVSVGICGSARAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSW 313

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G+ WGM GYMHM RN G   G+CGINM+                   T+C+L++YC  G 
Sbjct: 314  GERWGMKGYMHMHRNTGASSGICGINMMPSFPTKTSPNPPPSPGPGPTKCNLLTYCPEGS 373

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC +R+LGL CLSW CC LD+A+CCKD+ YCCPH+YPICDT   QCLRA GN+S ++ 
Sbjct: 374  TCCCSWRVLGL-CLSWSCCGLDNAICCKDNRYCCPHDYPICDTVRAQCLRANGNFSGIEG 432

Query: 242  FEKK 231
             +KK
Sbjct: 433  IKKK 436


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  461 bits (1185), Expect = e-127
 Identities = 207/320 (64%), Positives = 250/320 (78%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWR KGAVT VKDQ SCGACW+FS TGAIEGIN+IVTG+LVSLSEQEL+DCDRSYNSG
Sbjct: 118  SMDWRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSG 177

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            C GGLMDYA++FV+ NHGID EEDYPY   +++CN+ K KRRVVTIDGY  +P++NE+ L
Sbjct: 178  CEGGLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLL 237

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AVA QPVSVG+CGSER FQLYS GIF+GPC++SLDHAV+IVGYGSENGVDYWI+KNSW
Sbjct: 238  LQAVAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSW 297

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G  WGM+GY+HM RN+GD +G+CGINMLA                  T+C L +YC AGE
Sbjct: 298  GTRWGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGE 357

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC  R+ G +C SWKCCELDSAVCCKD+ +CCP++YP+CDT+  QCL+  GN + ++ 
Sbjct: 358  TCCCTHRIFG-ICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEA 416

Query: 242  FEKKXXXXXXXXXXXLFEAW 183
            FEK+             E W
Sbjct: 417  FEKRHSTRKFSSWRPFVENW 436


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  461 bits (1185), Expect = e-127
 Identities = 206/292 (70%), Positives = 246/292 (84%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            SIDWRKKGAV+ VKDQ +CGACW+FS TGAIEGIN+IVTGSLVSLSEQEL+DCDRSYN+G
Sbjct: 122  SIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNG 181

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            C GGLMDYA++FV++N+GIDTEEDYPYQA +++CN+ KLKR VVTIDGYTD+P +NE+EL
Sbjct: 182  CEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKEL 241

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            LKAVA+QPVSVG+CGSER FQLYS GIF+GPC+TSLDHAV+IVGYGSENGVDYWI+KNSW
Sbjct: 242  LKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSW 301

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G  WG++GYM+M RN+G+ QG+CGINMLA                  T+C L + CG GE
Sbjct: 302  GTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGE 361

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAA 267
            TCCC  R+ GL C SWKCCELDSAVCCKD ++CCPH+YP+CDT+   CL+ +
Sbjct: 362  TCCCTRRIFGL-CFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLKVS 412


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  460 bits (1183), Expect = e-127
 Identities = 211/290 (72%), Positives = 238/290 (82%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWRKKGAVT VKDQ SCGACWAFS TGAIEGIN+IVTGSLVSLSEQELIDCD SYN+G
Sbjct: 118  SLDWRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAG 177

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            C GGLMDYA++FV+ NHGIDTEEDYPYQA D+SC + KLKRRVVTIDGYTD+  +N  +L
Sbjct: 178  CDGGLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQL 237

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AV +QPVSVG+CGSER FQLYS GIF+GPC+TSLDHAV+IVGY SENGVDYWI+KNSW
Sbjct: 238  LQAVVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSW 297

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            GK WGMDGY+HMQRN G+ QGVCGINMLA                  TRCS  + CG GE
Sbjct: 298  GKQWGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGE 357

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLR 273
            TCCC +R LGL C SWKCC L+SAVCCKD I+CCP +YP+CDTQ   CL+
Sbjct: 358  TCCCSWRFLGL-CFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406


>ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda]
            gi|548841210|gb|ERN01273.1| hypothetical protein
            AMTR_s00002p00249780 [Amborella trichopoda]
          Length = 475

 Score =  458 bits (1178), Expect = e-126
 Identities = 204/304 (67%), Positives = 249/304 (81%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWR KGAVT VKDQ SCGACWAFS TGAIEGIN+IVTGSL+SLSEQE+IDCD +YNSG
Sbjct: 161  SLDWRDKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLISLSEQEIIDCDTTYNSG 220

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYAFK+V KNHGIDTE+DYPY+ +  SC ++K +R VVTIDG+TDIPS++E+ +
Sbjct: 221  CGGGLMDYAFKWVTKNHGIDTEKDYPYREVQGSCIKDKAERHVVTIDGHTDIPSNSEDLI 280

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AVA QPVSVG+CGSER FQLYS+GIFSGPC+TSLDHAV+IVGYGS+NGVDYWI+KNSW
Sbjct: 281  LQAVAKQPVSVGICGSERSFQLYSSGIFSGPCSTSLDHAVLIVGYGSKNGVDYWIVKNSW 340

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G SWGMDGYMHM RN+GD QGVCGINM+                    +CSL++YC +G 
Sbjct: 341  GTSWGMDGYMHMLRNSGDSQGVCGINMMPSYPTKSGANPPPSPPPGPVKCSLLTYCPSGN 400

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKP 243
            TCCC +R LG +CLSW CC+LD+AVCCKD  YCCP +YP+C+T T  CL+ +GN++ +  
Sbjct: 401  TCCCTWRFLG-ICLSWSCCDLDNAVCCKDGQYCCPQDYPVCNTATGYCLKGSGNWTEMDG 459

Query: 242  FEKK 231
             +++
Sbjct: 460  LKRR 463


>ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
            gi|241945324|gb|EES18469.1| hypothetical protein
            SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  456 bits (1173), Expect = e-126
 Identities = 208/305 (68%), Positives = 245/305 (80%), Gaps = 1/305 (0%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            ++DWR+ GAVT VKDQ SCGACW+FS TGA+EGIN+I TGSLVSLSEQELIDCDRSYNSG
Sbjct: 142  ALDWRENGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSG 201

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYA+KFVVKN GIDTEEDYPY+  D +CN+NKLK+R+VTIDGY+D+PS+ E+ L
Sbjct: 202  CGGGLMDYAYKFVVKNGGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLL 261

Query: 782  LKAVASQPVSVGLCGSERGFQLYS-TGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNS 606
            L+AVA QPVSVG+CGS R FQLYS  GIF GPC TSLDHAV+IVGYGSE G DYWI+KNS
Sbjct: 262  LQAVAQQPVSVGICGSARAFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNS 321

Query: 605  WGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAG 426
            WG+SWGM GYMHM RN GD +GVCGINM+A                  T+CSL++YC  G
Sbjct: 322  WGESWGMKGYMHMHRNTGDSKGVCGINMMASFPTKSSPNPPPSPGPGPTKCSLLTYCPEG 381

Query: 425  ETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVK 246
             TCCC +R+LG  CLSW CCELD+AVCCKD+  CCPH+YP+CDT    CL+A+GN S ++
Sbjct: 382  STCCCSWRILGF-CLSWSCCELDNAVCCKDNKSCCPHDYPVCDTDRGLCLKASGNSSAIE 440

Query: 245  PFEKK 231
               +K
Sbjct: 441  GIRRK 445


>ref|XP_006655467.1| PREDICTED: cysteine proteinase RD21a-like [Oryza brachyantha]
          Length = 377

 Score =  454 bits (1167), Expect = e-125
 Identities = 205/305 (67%), Positives = 245/305 (80%), Gaps = 1/305 (0%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            ++DWR+ G VT VKDQ SCGACW+FS TGA+EGIN+I TGSL+SLSEQELIDCDRSYN+G
Sbjct: 56   ALDWRQSGVVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNTG 115

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYA+KFVVKN GIDTEEDYPY+  D +CN+NKLKRRVVTIDGY D+P++NE+ L
Sbjct: 116  CGGGLMDYAYKFVVKNGGIDTEEDYPYRETDGTCNKNKLKRRVVTIDGYKDVPANNEDLL 175

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AVA QPVSVG+CGS R FQLYS GIF GPC TSLDHAV+IVGYGSE G DYWI+KNSW
Sbjct: 176  LQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSW 235

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G+SWGM GYMHM RN G+  G+CGIN +                   T+CSL++YC  G 
Sbjct: 236  GESWGMKGYMHMHRNTGNSYGICGINQMPSFPTKTSPNPPPSPGPGPTKCSLLTYCPEGS 295

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRA-AGNYSMVK 246
            TCCC +R+LGL CLSW CCELDSA CCKD+ YCCPH+YPICDT +++C +A  GN+S+++
Sbjct: 296  TCCCSWRVLGL-CLSWSCCELDSATCCKDNRYCCPHDYPICDTASRRCFKANNGNFSVME 354

Query: 245  PFEKK 231
               +K
Sbjct: 355  GGSRK 359


>gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  452 bits (1164), Expect = e-125
 Identities = 204/305 (66%), Positives = 246/305 (80%), Gaps = 1/305 (0%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            ++DWR+ GAVT VKDQ SCGACW+FS TGA+EGIN+I TGSL+SLSEQELIDCDRSYNSG
Sbjct: 128  AVDWRQSGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNSG 187

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYA+KFVVKN GIDTE DYPY+  D +CN+NKLKRRVVTIDGY D+P++NE+ L
Sbjct: 188  CGGGLMDYAYKFVVKNGGIDTEADYPYRETDGTCNKNKLKRRVVTIDGYKDVPANNEDML 247

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AVA QPVSVG+CGS R FQLYS GIF GPC TSLDHA++IVGYGSE G DYWI+KNSW
Sbjct: 248  LQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSW 307

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G+SWGM GYM+M RN G+  GVCGIN +                   T+CSL++YC  G 
Sbjct: 308  GESWGMKGYMYMHRNTGNSNGVCGINQMPSFPTKSSPNPPPSPGPGPTKCSLLTYCPEGS 367

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRA-AGNYSMVK 246
            TCCC +R+LGL CLSW CCELD+AVCCKD+ YCCPH+YP+CDT +++C +A  GN+S+++
Sbjct: 368  TCCCSWRVLGL-CLSWSCCELDNAVCCKDNRYCCPHDYPVCDTASQRCFKANNGNFSVME 426

Query: 245  PFEKK 231
               +K
Sbjct: 427  GGSRK 431


>ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] gi|48475189|gb|AAT44258.1|
            hypothetical protein [Oryza sativa Japonica Group]
            gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa
            Japonica Group]
          Length = 450

 Score =  452 bits (1164), Expect = e-125
 Identities = 204/305 (66%), Positives = 246/305 (80%), Gaps = 1/305 (0%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            ++DWR+ GAVT VKDQ SCGACW+FS TGA+EGIN+I TGSL+SLSEQELIDCDRSYNSG
Sbjct: 129  AVDWRQSGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNSG 188

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            CGGGLMDYA+KFVVKN GIDTE DYPY+  D +CN+NKLKRRVVTIDGY D+P++NE+ L
Sbjct: 189  CGGGLMDYAYKFVVKNGGIDTEADYPYRETDGTCNKNKLKRRVVTIDGYKDVPANNEDML 248

Query: 782  LKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSW 603
            L+AVA QPVSVG+CGS R FQLYS GIF GPC TSLDHA++IVGYGSE G DYWI+KNSW
Sbjct: 249  LQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSW 308

Query: 602  GKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGE 423
            G+SWGM GYM+M RN G+  GVCGIN +                   T+CSL++YC  G 
Sbjct: 309  GESWGMKGYMYMHRNTGNSNGVCGINQMPSFPTKSSPNPPPSPGPGPTKCSLLTYCPEGS 368

Query: 422  TCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRA-AGNYSMVK 246
            TCCC +R+LGL CLSW CCELD+AVCCKD+ YCCPH+YP+CDT +++C +A  GN+S+++
Sbjct: 369  TCCCSWRVLGL-CLSWSCCELDNAVCCKDNRYCCPHDYPVCDTASQRCFKANNGNFSVME 427

Query: 245  PFEKK 231
               +K
Sbjct: 428  GGSRK 432


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
            gi|482576142|gb|EOA40329.1| hypothetical protein
            CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  450 bits (1158), Expect = e-124
 Identities = 209/332 (62%), Positives = 248/332 (74%), Gaps = 28/332 (8%)
 Frame = -2

Query: 1142 SIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELIDCDRSYNSG 963
            S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSEQELIDCD+SYN G
Sbjct: 123  SLDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDG 182

Query: 962  CGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTDIPSSNEEEL 783
            C GGLMDYAF+FV+KN GIDTE+DYPYQ  D +C ++KLK+RVV+ID Y  +  S+E+ L
Sbjct: 183  CNGGLMDYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKAL 242

Query: 782  LKAVASQPVSVGLCGSERGFQLYST----------------------------GIFSGPC 687
            L+AVA+QPVSVG+CGSER FQLYS+                            GIFSGPC
Sbjct: 243  LEAVAAQPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPC 302

Query: 686  ATSLDHAVVIVGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXX 507
            +TSLDHAV+IVGYGS+NGVDYWI+KNSWGKSWGMDG+MHMQRN G+ QG+CGINMLA   
Sbjct: 303  STSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYP 362

Query: 506  XXXXXXXXXXXXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIY 327
                           T+C+L +YC A ETCCC   L GL CLSWKCCE++SAVCCKD  +
Sbjct: 363  IKTHPNPPPPSPPGPTKCNLFTYCSAAETCCCARNLFGL-CLSWKCCEIESAVCCKDGRH 421

Query: 326  CCPHNYPICDTQTKQCLRAAGNYSMVKPFEKK 231
            CCPH+YP+CDT    CL+  GN++ +KPF KK
Sbjct: 422  CCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 453


Top