BLASTX nr result

ID: Cnidium21_contig00004565 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00004565
         (3328 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi...   840   0.0  
gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal...   839   0.0  
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]         838   0.0  
gb|AAF16534.1|AC013482_8 T26F17.17 [Arabidopsis thaliana]             831   0.0  
gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi...   828   0.0  

>gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1352

 Score =  840 bits (2170), Expect = 0.0
 Identities = 410/732 (56%), Positives = 528/732 (72%), Gaps = 2/732 (0%)
 Frame = -3

Query: 3326 PQHNGVAERKNRTILDMARSMLKTKNMPNSFWAESVACSVYILNRSPTKAVPKSTPGEAW 3147
            PQ NGVAERKNRTIL+MARSMLK+K +P   WAE+VAC+VY+LNRSPTK+V   TP EAW
Sbjct: 625  PQQNGVAERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAW 684

Query: 3146 SGHKPSVKHLKVFGAVAYAHIPAETRTKLDDRAQKMVFIGYKQG--GYKLFDPVTKKVNV 2973
            SG K  V HL+VFG++A+AH+P E R+KLDD+++K +FIGY     GYKL++P TKK  +
Sbjct: 685  SGRKSGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTII 744

Query: 2972 SRDVTFAEDEAWNWSENSPEDQKKYVLLEEESEEAIPNEVPTTSHSNLPIPPAADSQRPQ 2793
            SR++ F E+  W+W+ N  ED   +   EE+  E    E P+   +  P  P +      
Sbjct: 745  SRNIVFDEEGEWDWNSNE-EDYNFFPHFEEDEPEPTREEPPSEEPTTPPTSPTSSQIEES 803

Query: 2792 RQRKTPERYQGYEVSLDHNVNDDGELIHHAFIADLEPVTLQEAFENPKWEMAMHEELNAI 2613
               +TP R++  +   +   N +   +   F A+ EP+  QEA E   W  AM EE+ +I
Sbjct: 804  SSERTP-RFRSIQELYEVTENQENLTLFCLF-AECEPMDFQEAIEKKTWRNAMDEEIKSI 861

Query: 2612 EKNDTWALTDLPQGHKAIDVKWVFKTKLGADGTVEKYKARLVAKGFEQREGYDYQEVFAP 2433
            +KNDTW LT LP GHK I VKWV+K K  + G VE+YKARLVAKG+ QR G DY EVFAP
Sbjct: 862  QKNDTWELTSLPNGHKTIGVKWVYKAKKNSKGEVERYKARLVAKGYIQRAGIDYDEVFAP 921

Query: 2432 VARMETIRIMVSMAAQNHWKLHQMDVKSAFLNGPLEEMVFVKQPPGFVKEGSEQKVYRLI 2253
            VAR+ET+R+++S+AAQN WK+HQMDVKSAFLNG LEE V+++QP G++ +G E KV RL 
Sbjct: 922  VARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLK 981

Query: 2252 KALYGLKQSPRAWNKRIDAFFIEAGFKRCPSDHGLYVKAGDGGDILMVCLYVDDLVFTSN 2073
            KALYGLKQ+PRAWN RID +F E  F +CP +H LY+K     DIL+ CLYVDDL+FT N
Sbjct: 982  KALYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKI-QKEDILIACLYVDDLIFTGN 1040

Query: 2072 NAGLVESFKKSIMKEFEMSDLGLLSYFLGVEVVQRSDGIFICQRKYVADILKKFKMDMCN 1893
            N  + E FKK + KEFEM+D+GL+SY+LG+EV Q  +GIFI Q  Y  ++LKKFKMD  N
Sbjct: 1041 NPSMFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKMDDSN 1100

Query: 1892 PTKTPVEVGTKLTKKGEGPLVDPTLFKQIVGSLRYLTCTRPDISYGVGLISRFMESPNQS 1713
            P  TP+E G KL+KK EG  VDPT FK +VGSLRYLTCTRPDI Y VG++SR+ME P  +
Sbjct: 1101 PVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHPTTT 1160

Query: 1712 HMKVARRIMRYLKGTQDFGLFYDSTDNCALVGYSDSDWAGDLDDRKSTTGSCFTLGSAAC 1533
            H K A+RI+RY+KGT +FGL Y +T +  LVGYSDSDW GD+DDRKST+G  F +G  A 
Sbjct: 1161 HFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDRKSTSGFVFYIGDTAF 1220

Query: 1532 SWVSRKQPTVALSTCEAEYMAASTSACQALWLASLLSEMHIPLKGNLKIYVDNKSAINLA 1353
            +W+S+KQP V LSTCEAEY+AA++  C A+WL +LL E+ +P +   KI+VDNKSAI LA
Sbjct: 1221 TWMSKKQPIVVLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAIALA 1280

Query: 1352 KNPVAHGRSKHIDIKWHFLRELVEQKKIDLVFCKSENQVADIMTKPLKLDAFVKLRSRLG 1173
            KNPV H RSKHID ++H++RE V +K + L + K+ +QVADI TKPLK + F+K+RS LG
Sbjct: 1281 KNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADIFTKPLKREDFIKMRSLLG 1340

Query: 1172 ICPIESVLRGNV 1137
            +   +S LRG V
Sbjct: 1341 VA--KSSLRGGV 1350


>gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  839 bits (2168), Expect = 0.0
 Identities = 408/732 (55%), Positives = 527/732 (71%), Gaps = 2/732 (0%)
 Frame = -3

Query: 3326 PQHNGVAERKNRTILDMARSMLKTKNMPNSFWAESVACSVYILNRSPTKAVPKSTPGEAW 3147
            PQ NGV ERKNRTIL+MARSMLK+K +P   WAE+VAC+VY+LNRSPTK+V   TP EAW
Sbjct: 625  PQQNGVVERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAW 684

Query: 3146 SGHKPSVKHLKVFGAVAYAHIPAETRTKLDDRAQKMVFIGYKQG--GYKLFDPVTKKVNV 2973
            SG KP V HL+VFG++A+AH+P E R+KLDD+++K +FIGY     GYKL++P TKK  +
Sbjct: 685  SGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTII 744

Query: 2972 SRDVTFAEDEAWNWSENSPEDQKKYVLLEEESEEAIPNEVPTTSHSNLPIPPAADSQRPQ 2793
            SR++ F E+  W+W+ N  ED   +   EE+  E    E P+   +  P  P +      
Sbjct: 745  SRNIVFDEEGEWDWNSNE-EDYNFFPHFEEDEPEPTREEPPSEEPTTPPTSPTSSQIEES 803

Query: 2792 RQRKTPERYQGYEVSLDHNVNDDGELIHHAFIADLEPVTLQEAFENPKWEMAMHEELNAI 2613
               +TP R++  +   +   N +   +   F A+ EP+  Q+A E   W  AM EE+ +I
Sbjct: 804  SSERTP-RFRSIQELYEVTENQENLTLFCLF-AECEPMDFQKAIEKKTWRNAMDEEIKSI 861

Query: 2612 EKNDTWALTDLPQGHKAIDVKWVFKTKLGADGTVEKYKARLVAKGFEQREGYDYQEVFAP 2433
            +KNDTW LT LP GHKAI VKWV+K K  + G VE+YKARLVAKG+ QR G DY EVFAP
Sbjct: 862  QKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRVGIDYDEVFAP 921

Query: 2432 VARMETIRIMVSMAAQNHWKLHQMDVKSAFLNGPLEEMVFVKQPPGFVKEGSEQKVYRLI 2253
            VAR+ET+R+++S+AAQN WK+HQMDVKSAFLNG LEE V+++QP G++ +G E KV RL 
Sbjct: 922  VARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLK 981

Query: 2252 KALYGLKQSPRAWNKRIDAFFIEAGFKRCPSDHGLYVKAGDGGDILMVCLYVDDLVFTSN 2073
            K LYGLKQ+PRAWN RID +F E  F +CP +H LY+K     DIL+ CLYVDDL+FT N
Sbjct: 982  KVLYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKI-QKEDILIACLYVDDLIFTGN 1040

Query: 2072 NAGLVESFKKSIMKEFEMSDLGLLSYFLGVEVVQRSDGIFICQRKYVADILKKFKMDMCN 1893
            N  + E FKK + KEFEM+D+GL+SY+LG+EV Q  +GIFI Q  Y  ++LKKFKMD  N
Sbjct: 1041 NPSIFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKMDDSN 1100

Query: 1892 PTKTPVEVGTKLTKKGEGPLVDPTLFKQIVGSLRYLTCTRPDISYGVGLISRFMESPNQS 1713
            P  TP+E G KL+KK EG  VDPT FK +VGSLRYLTCTRPDI Y VG++SR+ME P  +
Sbjct: 1101 PVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHPTTT 1160

Query: 1712 HMKVARRIMRYLKGTQDFGLFYDSTDNCALVGYSDSDWAGDLDDRKSTTGSCFTLGSAAC 1533
            H K A+RI+RY+KGT +FGL Y +T +  LVGYSDSDW GD+DDRKST+G  F +G  A 
Sbjct: 1161 HFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDRKSTSGFVFYIGDTAF 1220

Query: 1532 SWVSRKQPTVALSTCEAEYMAASTSACQALWLASLLSEMHIPLKGNLKIYVDNKSAINLA 1353
            +W+S+KQP V LSTCEAEY+AA++  C A+WL +LL E+ +P +   KI+VDNKSAI LA
Sbjct: 1221 TWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAIALA 1280

Query: 1352 KNPVAHGRSKHIDIKWHFLRELVEQKKIDLVFCKSENQVADIMTKPLKLDAFVKLRSRLG 1173
            KNPV H RSKHID ++H++RE V +K + L + K+ +QVAD  TKPLK + F+K+RS LG
Sbjct: 1281 KNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADFFTKPLKRENFIKMRSLLG 1340

Query: 1172 ICPIESVLRGNV 1137
            +   +S LRG V
Sbjct: 1341 VA--KSSLRGGV 1350


>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
          Length = 1352

 Score =  838 bits (2164), Expect = 0.0
 Identities = 407/732 (55%), Positives = 527/732 (71%), Gaps = 2/732 (0%)
 Frame = -3

Query: 3326 PQHNGVAERKNRTILDMARSMLKTKNMPNSFWAESVACSVYILNRSPTKAVPKSTPGEAW 3147
            PQ NGV ERKNRTIL+MARSMLK+K +P   WAE+VAC+VY+LNRSPTK+V   TP EAW
Sbjct: 625  PQQNGVVERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAW 684

Query: 3146 SGHKPSVKHLKVFGAVAYAHIPAETRTKLDDRAQKMVFIGYKQG--GYKLFDPVTKKVNV 2973
            SG KP V HL+VFG++A+AH+P E R+KLDD+++K +FIGY     GYKL++P TKK  +
Sbjct: 685  SGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTII 744

Query: 2972 SRDVTFAEDEAWNWSENSPEDQKKYVLLEEESEEAIPNEVPTTSHSNLPIPPAADSQRPQ 2793
            SR++ F E+  W+W+ N  ED   +   EE+  E    E P+   +  P  P +      
Sbjct: 745  SRNIVFDEEGEWDWNSNE-EDYNFFPHFEEDEPEPTREEPPSEEPTTPPTSPTSSQIEES 803

Query: 2792 RQRKTPERYQGYEVSLDHNVNDDGELIHHAFIADLEPVTLQEAFENPKWEMAMHEELNAI 2613
               +TP R++  +   +   N +   +   F A+ EP+  Q+A E   W  AM EE+ +I
Sbjct: 804  SSERTP-RFRSIQELYEVTENQENLTLFCLF-AECEPMDFQKAIEKKTWRNAMDEEIKSI 861

Query: 2612 EKNDTWALTDLPQGHKAIDVKWVFKTKLGADGTVEKYKARLVAKGFEQREGYDYQEVFAP 2433
            +KNDTW LT LP GHKAI VKWV+K K  + G VE+YKARLVAKG+ QR G DY EVFAP
Sbjct: 862  QKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRVGIDYDEVFAP 921

Query: 2432 VARMETIRIMVSMAAQNHWKLHQMDVKSAFLNGPLEEMVFVKQPPGFVKEGSEQKVYRLI 2253
            VAR+ET+R+++S+AAQN WK+HQMDVKSAFLNG LEE V+++QP G++ +G E KV RL 
Sbjct: 922  VARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLK 981

Query: 2252 KALYGLKQSPRAWNKRIDAFFIEAGFKRCPSDHGLYVKAGDGGDILMVCLYVDDLVFTSN 2073
            K LYGLKQ+PRAWN RID +F E  F +CP +H LY+K     DIL+ CLYVDDL+FT N
Sbjct: 982  KVLYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKI-QKEDILIACLYVDDLIFTGN 1040

Query: 2072 NAGLVESFKKSIMKEFEMSDLGLLSYFLGVEVVQRSDGIFICQRKYVADILKKFKMDMCN 1893
            N  + E FKK + KEFEM+D+GL+SY+LG+EV Q  +GIFI Q  Y  ++LKKFK+D  N
Sbjct: 1041 NPSIFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKIDDSN 1100

Query: 1892 PTKTPVEVGTKLTKKGEGPLVDPTLFKQIVGSLRYLTCTRPDISYGVGLISRFMESPNQS 1713
            P  TP+E G KL+KK EG  VDPT FK +VGSLRYLTCTRPDI Y VG++SR+ME P  +
Sbjct: 1101 PVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHPTTT 1160

Query: 1712 HMKVARRIMRYLKGTQDFGLFYDSTDNCALVGYSDSDWAGDLDDRKSTTGSCFTLGSAAC 1533
            H K A+RI+RY+KGT +FGL Y +T +  LVGYSDSDW GD+DDRKST+G  F +G  A 
Sbjct: 1161 HFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDRKSTSGFVFYIGDTAF 1220

Query: 1532 SWVSRKQPTVALSTCEAEYMAASTSACQALWLASLLSEMHIPLKGNLKIYVDNKSAINLA 1353
            +W+S+KQP V LSTCEAEY+AA++  C A+WL +LL E+ +P +   KI+VDNKSAI LA
Sbjct: 1221 TWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAIALA 1280

Query: 1352 KNPVAHGRSKHIDIKWHFLRELVEQKKIDLVFCKSENQVADIMTKPLKLDAFVKLRSRLG 1173
            KNPV H RSKHID ++H++RE V +K + L + K+ +QVAD  TKPLK + F+K+RS LG
Sbjct: 1281 KNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADFFTKPLKRENFIKMRSLLG 1340

Query: 1172 ICPIESVLRGNV 1137
            +   +S LRG V
Sbjct: 1341 VA--KSSLRGGV 1350


>gb|AAF16534.1|AC013482_8 T26F17.17 [Arabidopsis thaliana]
          Length = 1291

 Score =  831 bits (2146), Expect = 0.0
 Identities = 412/738 (55%), Positives = 527/738 (71%), Gaps = 8/738 (1%)
 Frame = -3

Query: 3326 PQHNGVAERKNRTILDMARSMLKTKNMPNSFWAESVACSVYILNRSPTKAVPKSTPGEAW 3147
            PQ NGVAERKNRTIL+MARSMLK+K +P   WAE+VAC+VY+LNRSPTK+V   TP EAW
Sbjct: 564  PQQNGVAERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAW 623

Query: 3146 SGHKPSVKHLKVFGAVAYAHIPAETRTKLDDRAQKMVFIGYKQG--GYKLFDPVTKKVNV 2973
            SG KP V HL+VFG++A+AH+P E R+KLDD+++K +FIGY     GYKL++P TKK  +
Sbjct: 624  SGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTII 683

Query: 2972 SRDVTFAEDEAWNWSENSPEDQKKYVLLEEESEEAI----PNEVPTTSHSNLPIPPAADS 2805
            SR++ F E+  W+W+ N  ED   +   EE+  E      P+E PTT  ++L      +S
Sbjct: 684  SRNIVFDEEGEWDWNSNE-EDYNFFPHFEEDEPEPTREEPPSEEPTTRPTSLTSSQIEES 742

Query: 2804 --QRPQRQRKTPERYQGYEVSLDHNVNDDGELIHHAFIADLEPVTLQEAFENPKWEMAMH 2631
              +R  R R   E Y+  E        +   L      A+ EP+  QEA E   W  AM 
Sbjct: 743  SSERTPRFRSIQELYEVTE--------NQENLTLFCLFAECEPMDFQEAIEKKTWRNAMD 794

Query: 2630 EELNAIEKNDTWALTDLPQGHKAIDVKWVFKTKLGADGTVEKYKARLVAKGFEQREGYDY 2451
            EE+ +I+KNDTW LT LP GHKAI VKWV+K K  + G VE+YKARLVAKG+ QR G DY
Sbjct: 795  EEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRAGIDY 854

Query: 2450 QEVFAPVARMETIRIMVSMAAQNHWKLHQMDVKSAFLNGPLEEMVFVKQPPGFVKEGSEQ 2271
             EVFAPVAR+ET+R+++S+AAQN WK+HQMD K AFLNG  EE V+++QP G++ +G E 
Sbjct: 855  DEVFAPVARLETVRLIISLAAQNKWKIHQMDFKLAFLNGDFEEEVYIEQPQGYIVKGEED 914

Query: 2270 KVYRLIKALYGLKQSPRAWNKRIDAFFIEAGFKRCPSDHGLYVKAGDGGDILMVCLYVDD 2091
            KV RL KALYGLKQ+PRAWN RID +F E  F +CP +H LY+K     DIL+ CLYVDD
Sbjct: 915  KVLRLKKALYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKI-QKEDILIACLYVDD 973

Query: 2090 LVFTSNNAGLVESFKKSIMKEFEMSDLGLLSYFLGVEVVQRSDGIFICQRKYVADILKKF 1911
            L+FT NN  + E FKK + KEFEM+D+GL+SY+LG+EV Q  + IFI Q  Y  ++LKKF
Sbjct: 974  LIFTGNNPSMFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDNRIFITQEGYAKEVLKKF 1033

Query: 1910 KMDMCNPTKTPVEVGTKLTKKGEGPLVDPTLFKQIVGSLRYLTCTRPDISYGVGLISRFM 1731
            KMD  NP  TP+E G KL+KK EG  VDPT FK +VGSLRYLTCTRPDI Y VG++SR+M
Sbjct: 1034 KMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYM 1093

Query: 1730 ESPNQSHMKVARRIMRYLKGTQDFGLFYDSTDNCALVGYSDSDWAGDLDDRKSTTGSCFT 1551
            E P  +H K A+RI+RY+KGT +FGL Y +T +  LVGYSDSDW  D+DDRKST+G  F 
Sbjct: 1094 EHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGRDVDDRKSTSGFVFY 1153

Query: 1550 LGSAACSWVSRKQPTVALSTCEAEYMAASTSACQALWLASLLSEMHIPLKGNLKIYVDNK 1371
            +G  A +W+S+KQP V LSTCEAEY+AA++  C A+WL +LL E+ +P +   KI+VDNK
Sbjct: 1154 IGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNK 1213

Query: 1370 SAINLAKNPVAHGRSKHIDIKWHFLRELVEQKKIDLVFCKSENQVADIMTKPLKLDAFVK 1191
            SAI LAKNPV H RSKHID ++H++RE V +K + L + K+ +QVADI TKPLK + F+K
Sbjct: 1214 SAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADIFTKPLKREDFIK 1273

Query: 1190 LRSRLGICPIESVLRGNV 1137
            +RS LG+   +S LRG V
Sbjct: 1274 MRSLLGVA--KSSLRGGV 1289


>gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana]
            gi|12321387|gb|AAG50765.1|AC079131_10 copia-type
            polyprotein, putative [Arabidopsis thaliana]
          Length = 1320

 Score =  828 bits (2138), Expect = 0.0
 Identities = 408/732 (55%), Positives = 520/732 (71%), Gaps = 2/732 (0%)
 Frame = -3

Query: 3326 PQHNGVAERKNRTILDMARSMLKTKNMPNSFWAESVACSVYILNRSPTKAVPKSTPGEAW 3147
            PQ NGVAERKNRTIL+MARSMLK+K +P   WAE+VAC+VY+LNRSPTK+V   TP EAW
Sbjct: 625  PQQNGVAERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAW 684

Query: 3146 SGHKPSVKHLKVFGAVAYAHIPAETRTKLDDRAQKMVFIGYKQG--GYKLFDPVTKKVNV 2973
            SG KP V HL+VFG++A+AH+P E R+KLDD+++K +FIGY     GYKL++P TKK  +
Sbjct: 685  SGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTII 744

Query: 2972 SRDVTFAEDEAWNWSENSPEDQKKYVLLEEESEEAIPNEVPTTSHSNLPIPPAADSQRPQ 2793
            SR++ F E+  W+W+ N  ED   +   EE+  E    E P+   +  P  P + SQ  +
Sbjct: 745  SRNIVFDEEGEWDWNSNE-EDYNFFPHFEEDKPEPTREEPPSEEPTTPPTSPTS-SQIEE 802

Query: 2792 RQRKTPERYQGYEVSLDHNVNDDGELIHHAFIADLEPVTLQEAFENPKWEMAMHEELNAI 2613
            +                                  EP+  QEA E   W  AM EE+ +I
Sbjct: 803  K---------------------------------CEPMDFQEAIEKKTWRNAMDEEIKSI 829

Query: 2612 EKNDTWALTDLPQGHKAIDVKWVFKTKLGADGTVEKYKARLVAKGFEQREGYDYQEVFAP 2433
            +KNDTW LT LP GHKAI VKWV+K K  + G VE+YKARLVAKG+ QR G DY EVFAP
Sbjct: 830  QKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRAGIDYDEVFAP 889

Query: 2432 VARMETIRIMVSMAAQNHWKLHQMDVKSAFLNGPLEEMVFVKQPPGFVKEGSEQKVYRLI 2253
            VAR+ET+R+++S+AAQN WK+HQMDVKSAFLNG LEE V+++QP G++ +G E KV RL 
Sbjct: 890  VARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLK 949

Query: 2252 KALYGLKQSPRAWNKRIDAFFIEAGFKRCPSDHGLYVKAGDGGDILMVCLYVDDLVFTSN 2073
            KALYGLKQ+PRAWN RID +F E  F +CP +H LY+K     DIL+ CLYVDDL+FT N
Sbjct: 950  KALYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKI-QKEDILIACLYVDDLIFTGN 1008

Query: 2072 NAGLVESFKKSIMKEFEMSDLGLLSYFLGVEVVQRSDGIFICQRKYVADILKKFKMDMCN 1893
            N  + E FKK + KEFEM+D+GL+SY+LG+EV Q  +GIFI Q  Y  ++LKKFKMD  N
Sbjct: 1009 NPSMFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKMDDSN 1068

Query: 1892 PTKTPVEVGTKLTKKGEGPLVDPTLFKQIVGSLRYLTCTRPDISYGVGLISRFMESPNQS 1713
            P  TP+E G KL+KK EG  VDPT FK +VGSLRYLTCTRPDI Y VG++SR+ME P  +
Sbjct: 1069 PVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHPTTT 1128

Query: 1712 HMKVARRIMRYLKGTQDFGLFYDSTDNCALVGYSDSDWAGDLDDRKSTTGSCFTLGSAAC 1533
            H K A+RI+RY+KGT +FGL Y +T +  LVGYSDSDW GD+DDRKST+G  F +G  A 
Sbjct: 1129 HFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDRKSTSGFVFYIGDTAF 1188

Query: 1532 SWVSRKQPTVALSTCEAEYMAASTSACQALWLASLLSEMHIPLKGNLKIYVDNKSAINLA 1353
            +W+S+KQP V LSTCEAEY+AA++  C A+WL +LL E+ +P +   KI+VDNKSAI LA
Sbjct: 1189 TWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAIALA 1248

Query: 1352 KNPVAHGRSKHIDIKWHFLRELVEQKKIDLVFCKSENQVADIMTKPLKLDAFVKLRSRLG 1173
            KNPV H RSKHID ++H++RE V +K + L + K+ +QVADI TKPLK + F+K+RS LG
Sbjct: 1249 KNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADIFTKPLKREDFIKMRSLLG 1308

Query: 1172 ICPIESVLRGNV 1137
            +   +S LRG V
Sbjct: 1309 VA--KSSLRGGV 1318


Top