BLASTX nr result
ID: Achyranthes23_contig00010410
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes23_contig00010410 (1570 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY11664.1| Tetratricopeptide repeat (TPR)-like superfamily p... 383 e-103 gb|EOY11663.1| Tetratricopeptide repeat (TPR)-like superfamily p... 383 e-103 gb|EOY11661.1| Tetratricopeptide repeat (TPR)-like superfamily p... 383 e-103 gb|EOY11660.1| Tetratricopeptide repeat-like superfamily protein... 376 e-101 gb|EMJ06489.1| hypothetical protein PRUPE_ppa006661mg [Prunus pe... 373 e-100 ref|XP_002330255.1| predicted protein [Populus trichocarpa] 369 2e-99 ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated prot... 365 3e-98 emb|CBI39598.3| unnamed protein product [Vitis vinifera] 365 3e-98 ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated prot... 364 6e-98 ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated prot... 363 1e-97 gb|EOY11662.1| Tetratricopeptide repeat-like superfamily protein... 358 4e-96 ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citr... 354 7e-95 gb|EXB53029.1| RNA polymerase II-associated protein 3 [Morus not... 351 4e-94 ref|XP_006392426.1| hypothetical protein EUTSA_v10023436mg [Eutr... 348 5e-93 ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated prot... 343 9e-92 ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated prot... 341 4e-91 ref|NP_176039.2| carboxylate clamp-tetratricopeptide repeat prot... 340 8e-91 ref|XP_006302207.1| hypothetical protein CARUB_v10020218mg [Caps... 339 2e-90 ref|XP_006302206.1| hypothetical protein CARUB_v10020218mg [Caps... 339 2e-90 ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated prot... 338 3e-90 >gb|EOY11664.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 5 [Theobroma cacao] Length = 389 Score = 383 bits (983), Expect = e-103 Identities = 204/391 (52%), Positives = 267/391 (68%), Gaps = 2/391 (0%) Frame = -2 Query: 1569 GKSKVTYLSNNTSELRPSV-NYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEA 1393 GKS + S +S + NYD+ + +SSSF+ +N+ DAASEK+ GNEYFK+KKF EA Sbjct: 21 GKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEA 80 Query: 1392 IDCYSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKE 1213 IDCYSRSI LSPTAVA+ANRAMAYLKI++++EAEDDCTEALNLDDRYIKAYSRRATARKE Sbjct: 81 IDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKE 140 Query: 1212 LKKLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEV-EN 1036 L KL++S+ED EFALRLEP ++++KKQ AE KSL EK IL KA+G + +EV ++ Sbjct: 141 LGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKS 200 Query: 1035 KAVANGHGKHVLSSPETSKKAGVGVLSDHLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXX 856 + NG G H S+ ++++ GV + + Sbjct: 201 ETKENGLGMH--SASNSTQRTGVATVQGYQTKK-----------------------NNRT 235 Query: 855 XKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQP 676 K EL A++Q+L NI PN+AY+FE+SW+ SGDR+LQAHLLKVT P Sbjct: 236 RKPELKASVQELASLAATRAMAEAAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSP 295 Query: 675 ERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRA 496 LPQIFKNAL+A +L+DIIKC+ATFF EE +L + YL +LTK+PRFDM+IMCL S ++A Sbjct: 296 SALPQIFKNALSASMLVDIIKCVATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKA 355 Query: 495 ELCRTWDEIFLSNATPVQHAESLNKLHPRYC 403 +L + WD++F + ATP++ AE L+ L YC Sbjct: 356 DLLKVWDDVFCNEATPIEWAEILDNLRSVYC 386 >gb|EOY11663.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 4 [Theobroma cacao] Length = 421 Score = 383 bits (983), Expect = e-103 Identities = 204/391 (52%), Positives = 267/391 (68%), Gaps = 2/391 (0%) Frame = -2 Query: 1569 GKSKVTYLSNNTSELRPSV-NYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEA 1393 GKS + S +S + NYD+ + +SSSF+ +N+ DAASEK+ GNEYFK+KKF EA Sbjct: 53 GKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEA 112 Query: 1392 IDCYSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKE 1213 IDCYSRSI LSPTAVA+ANRAMAYLKI++++EAEDDCTEALNLDDRYIKAYSRRATARKE Sbjct: 113 IDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKE 172 Query: 1212 LKKLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEV-EN 1036 L KL++S+ED EFALRLEP ++++KKQ AE KSL EK IL KA+G + +EV ++ Sbjct: 173 LGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKS 232 Query: 1035 KAVANGHGKHVLSSPETSKKAGVGVLSDHLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXX 856 + NG G H S+ ++++ GV + + Sbjct: 233 ETKENGLGMH--SASNSTQRTGVATVQGYQTKK-----------------------NNRT 267 Query: 855 XKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQP 676 K EL A++Q+L NI PN+AY+FE+SW+ SGDR+LQAHLLKVT P Sbjct: 268 RKPELKASVQELASLAATRAMAEAAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSP 327 Query: 675 ERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRA 496 LPQIFKNAL+A +L+DIIKC+ATFF EE +L + YL +LTK+PRFDM+IMCL S ++A Sbjct: 328 SALPQIFKNALSASMLVDIIKCVATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKA 387 Query: 495 ELCRTWDEIFLSNATPVQHAESLNKLHPRYC 403 +L + WD++F + ATP++ AE L+ L YC Sbjct: 388 DLLKVWDDVFCNEATPIEWAEILDNLRSVYC 418 >gb|EOY11661.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 [Theobroma cacao] Length = 422 Score = 383 bits (983), Expect = e-103 Identities = 204/391 (52%), Positives = 267/391 (68%), Gaps = 2/391 (0%) Frame = -2 Query: 1569 GKSKVTYLSNNTSELRPSV-NYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEA 1393 GKS + S +S + NYD+ + +SSSF+ +N+ DAASEK+ GNEYFK+KKF EA Sbjct: 54 GKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEA 113 Query: 1392 IDCYSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKE 1213 IDCYSRSI LSPTAVA+ANRAMAYLKI++++EAEDDCTEALNLDDRYIKAYSRRATARKE Sbjct: 114 IDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKE 173 Query: 1212 LKKLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEV-EN 1036 L KL++S+ED EFALRLEP ++++KKQ AE KSL EK IL KA+G + +EV ++ Sbjct: 174 LGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKS 233 Query: 1035 KAVANGHGKHVLSSPETSKKAGVGVLSDHLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXX 856 + NG G H S+ ++++ GV + + Sbjct: 234 ETKENGLGMH--SASNSTQRTGVATVQGYQTKK-----------------------NNRT 268 Query: 855 XKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQP 676 K EL A++Q+L NI PN+AY+FE+SW+ SGDR+LQAHLLKVT P Sbjct: 269 RKPELKASVQELASLAATRAMAEAAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSP 328 Query: 675 ERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRA 496 LPQIFKNAL+A +L+DIIKC+ATFF EE +L + YL +LTK+PRFDM+IMCL S ++A Sbjct: 329 SALPQIFKNALSASMLVDIIKCVATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKA 388 Query: 495 ELCRTWDEIFLSNATPVQHAESLNKLHPRYC 403 +L + WD++F + ATP++ AE L+ L YC Sbjct: 389 DLLKVWDDVFCNEATPIEWAEILDNLRSVYC 419 >gb|EOY11660.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] Length = 468 Score = 376 bits (966), Expect = e-101 Identities = 204/415 (49%), Positives = 271/415 (65%), Gaps = 26/415 (6%) Frame = -2 Query: 1569 GKSKVTYLSNNTSELRPSV-NYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEA 1393 GKS + S +S + NYD+ + +SSSF+ +N+ DAASEK+ GNEYFK+KKF EA Sbjct: 53 GKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEA 112 Query: 1392 IDCYSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKE 1213 IDCYSRSI LSPTAVA+ANRAMAYLKI++++EAEDDCTEALNLDDRYIKAYSRRATARKE Sbjct: 113 IDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKE 172 Query: 1212 LKKLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEV-EN 1036 L KL++S+ED EFALRLEP ++++KKQ AE KSL EK IL KA+G + +EV ++ Sbjct: 173 LGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKS 232 Query: 1035 KAVANGHGKHVLSSPETSKKAGVGVLSDHL------------------------KSSVIX 928 + NG G H S+ ++++ GV + + ++++ Sbjct: 233 ETKENGLGMH--SASNSTQRTGVATVQGYQTKVSEYDKQKKPEKGSVTSEGIGDRNTLAG 290 Query: 927 XXXXXXXXXXXXXXXXXXXXXXXXXKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYE 748 K EL A++Q+L NI PN+AY+ Sbjct: 291 SRKDGTQLDSGIVGLESIKKNNRTRKPELKASVQELASLAATRAMAEAAKNISPPNTAYQ 350 Query: 747 FELSWKGFSGDRSLQAHLLKVTQPERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVN 568 FE+SW+ SGDR+LQAHLLKVT P LPQIFKNAL+A +L+DIIKC+ATFF EE +L + Sbjct: 351 FEVSWRALSGDRALQAHLLKVTSPSALPQIFKNALSASMLVDIIKCVATFFREEVDLAIK 410 Query: 567 YLLHLTKIPRFDMIIMCLHSADRAELCRTWDEIFLSNATPVQHAESLNKLHPRYC 403 YL +LTK+PRFDM+IMCL S ++A+L + WD++F + ATP++ AE L+ L YC Sbjct: 411 YLENLTKVPRFDMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEILDNLRSVYC 465 >gb|EMJ06489.1| hypothetical protein PRUPE_ppa006661mg [Prunus persica] Length = 401 Score = 373 bits (957), Expect = e-100 Identities = 196/388 (50%), Positives = 259/388 (66%) Frame = -2 Query: 1566 KSKVTYLSNNTSELRPSVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAID 1387 K K L ++ S N D ++ MSSSF+ D++ DAASEK+ GNEYFK+KKF EAID Sbjct: 36 KLKTRDLGTSSGNYDYSRNLDSINTMSSSFISEDSLPDAASEKELGNEYFKQKKFREAID 95 Query: 1386 CYSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELK 1207 CYSRSIALSP+AVAYANRAMAY+KI+ ++EAEDDCTEALNLDDRYIKAYSRRATARKEL Sbjct: 96 CYSRSIALSPSAVAYANRAMAYIKIKSFQEAEDDCTEALNLDDRYIKAYSRRATARKELG 155 Query: 1206 KLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEVENKAV 1027 KL++S+ED EFALRLEPQ++++KKQ E KSL +K IL KA+G + +++V K Sbjct: 156 KLKESIEDAEFALRLEPQNQEIKKQYTEAKSLYDKTILQKASGAQKNSVQEMRKV-GKLD 214 Query: 1026 ANGHGKHVLSSPETSKKAGVGVLSDHLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXXXKT 847 +G+ + + +++ + + DH K + Sbjct: 215 TKVNGQSIQPASSSAQITEMTAVQDHTKRN------------------------NTTRNP 250 Query: 846 ELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQPERL 667 E+ A++Q+L I PNSAY+FE+SW+GFSGD + Q LLK P L Sbjct: 251 EVKASVQELASRAASRVKAVAAEKIKPPNSAYQFEVSWRGFSGDNARQTSLLKAISPSAL 310 Query: 666 PQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRAELC 487 PQIFKNALT P+L+DIIKC+ATFFVEE +L VNYL +LT++PRFD +IM L S+D A+L Sbjct: 311 PQIFKNALTVPILLDIIKCVATFFVEEMDLAVNYLENLTRVPRFDTLIMFLSSSDNADLV 370 Query: 486 RTWDEIFLSNATPVQHAESLNKLHPRYC 403 + WDE+F + ATP+++AE L+ L +YC Sbjct: 371 KIWDEVFDNEATPIEYAEKLDNLRTKYC 398 >ref|XP_002330255.1| predicted protein [Populus trichocarpa] Length = 434 Score = 369 bits (948), Expect = 2e-99 Identities = 201/386 (52%), Positives = 262/386 (67%), Gaps = 13/386 (3%) Frame = -2 Query: 1518 SVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDCYSRSIALSPTAVAYA 1339 S N+ ++ +SSSF ++ VDA +EK+ GNEYFK+KKFNEAI+CYSRSIALSPTAVAYA Sbjct: 73 SRNFGAINRLSSSFTTDEITVDATTEKELGNEYFKQKKFNEAIECYSRSIALSPTAVAYA 132 Query: 1338 NRAMAYLKIRR----YKEAEDDCTEALNLDDRYIKAYSRRATARKELKKLRDSLEDVEFA 1171 NRAMAYLKI+R ++EAEDDCTEALNLDDRYIKAYSRRATARKEL KL++S+ED EFA Sbjct: 133 NRAMAYLKIKRQFFLFREAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDSEFA 192 Query: 1170 LRLEPQDEDLKKQRAEVKSLLEKA-------ILAKATGNATIAALVVKE-VENKAVANGH 1015 L+LEP ++++KKQ AEVKSL EKA IL KA+G + ++ ++A NGH Sbjct: 193 LKLEPNNQEIKKQYAEVKSLYEKASDYLMLEILQKASGTLRSSLQGTQQGGRSEASVNGH 252 Query: 1014 GKHVLSSPETSKKAGVGV-LSDHLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXXXKTELV 838 H +S ++K GV D+ K + + EL Sbjct: 253 AVHPVSI--ATQKTGVSASKKDNTKKN------------------------NRTRRQELK 286 Query: 837 ATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQPERLPQI 658 ++ +L NI PNSAY+FE+SW+GFSGDR+LQAHLLKVT P LPQI Sbjct: 287 TSVIELASQAASRAMAEAAKNITPPNSAYQFEVSWQGFSGDRALQAHLLKVTSPSALPQI 346 Query: 657 FKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRAELCRTW 478 FKNAL+ P+LIDIIKC+A+FF+++ + V YL +LTK+PRFDM+IMCL S D ++L + W Sbjct: 347 FKNALSVPILIDIIKCVASFFIDDMDFAVKYLENLTKVPRFDMLIMCLSSTDTSDLLKMW 406 Query: 477 DEIFLSNATPVQHAESLNKLHPRYCP 400 D +F S +TP+++AE L+ L +YCP Sbjct: 407 DGVFCSASTPIEYAEILDNLRSKYCP 432 >ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated protein 3-like [Vitis vinifera] Length = 474 Score = 365 bits (937), Expect = 3e-98 Identities = 199/397 (50%), Positives = 255/397 (64%), Gaps = 17/397 (4%) Frame = -2 Query: 1542 NNTSELRPSVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDCYSRSIAL 1363 ++T + S N+D +S +SSSFM +++ DAASEK+ GNEYFK++KF EAIDCYSRSIAL Sbjct: 75 SDTRQHEYSRNHDAISRISSSFMTEESLPDAASEKELGNEYFKQRKFKEAIDCYSRSIAL 134 Query: 1362 SPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELKKLRDSLED 1183 PTAVAYANRAMAY+KI+R++EAEDDC EALNLDDRYIKAYSRRATARKEL K +++ ED Sbjct: 135 LPTAVAYANRAMAYIKIKRFREAEDDCMEALNLDDRYIKAYSRRATARKELGKFKEATED 194 Query: 1182 VEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEVENKAV-ANGHGKH 1006 EFALRLEPQ++++KKQ AE KSL EK IL KA+G + +++V V N + Sbjct: 195 AEFALRLEPQNQEIKKQYAEAKSLYEKEILQKASGALKSSVQGLQKVGKSVVEVNADTQG 254 Query: 1005 VLSSPETSKKAGVGVLSDHLK----------------SSVIXXXXXXXXXXXXXXXXXXX 874 V S +S+ AG + D + Sbjct: 255 VRSISSSSQGAGEAAIQDRFMVPANTSTSMEETENKGTGNRSKENGYLENAVQNSGLEDV 314 Query: 873 XXXXXXXKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHL 694 + E+ +++Q+L NI PNSAY+FE+SW+G GD +LQA Sbjct: 315 MSNHKTGQREMKSSLQELASRAASRAMVEAAKNITAPNSAYQFEVSWRGLLGDHALQASY 374 Query: 693 LKVTQPERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCL 514 LK P LPQIFKNAL+AP+LIDIIKCIATFFV E +L V +L +LTKI RFDMIIMCL Sbjct: 375 LKAISPNALPQIFKNALSAPILIDIIKCIATFFVTEMDLAVKFLDNLTKISRFDMIIMCL 434 Query: 513 HSADRAELCRTWDEIFLSNATPVQHAESLNKLHPRYC 403 S D+ +L + WDE+F + ATP +A++L KL PRYC Sbjct: 435 SSTDKTDLLKIWDEVFCNKATPSGYADTLGKLRPRYC 471 >emb|CBI39598.3| unnamed protein product [Vitis vinifera] Length = 1097 Score = 365 bits (937), Expect = 3e-98 Identities = 199/397 (50%), Positives = 255/397 (64%), Gaps = 17/397 (4%) Frame = -2 Query: 1542 NNTSELRPSVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDCYSRSIAL 1363 ++T + S N+D +S +SSSFM +++ DAASEK+ GNEYFK++KF EAIDCYSRSIAL Sbjct: 698 SDTRQHEYSRNHDAISRISSSFMTEESLPDAASEKELGNEYFKQRKFKEAIDCYSRSIAL 757 Query: 1362 SPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELKKLRDSLED 1183 PTAVAYANRAMAY+KI+R++EAEDDC EALNLDDRYIKAYSRRATARKEL K +++ ED Sbjct: 758 LPTAVAYANRAMAYIKIKRFREAEDDCMEALNLDDRYIKAYSRRATARKELGKFKEATED 817 Query: 1182 VEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEVENKAV-ANGHGKH 1006 EFALRLEPQ++++KKQ AE KSL EK IL KA+G + +++V V N + Sbjct: 818 AEFALRLEPQNQEIKKQYAEAKSLYEKEILQKASGALKSSVQGLQKVGKSVVEVNADTQG 877 Query: 1005 VLSSPETSKKAGVGVLSDHLK----------------SSVIXXXXXXXXXXXXXXXXXXX 874 V S +S+ AG + D + Sbjct: 878 VRSISSSSQGAGEAAIQDRFMVPANTSTSMEETENKGTGNRSKENGYLENAVQNSGLEDV 937 Query: 873 XXXXXXXKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHL 694 + E+ +++Q+L NI PNSAY+FE+SW+G GD +LQA Sbjct: 938 MSNHKTGQREMKSSLQELASRAASRAMVEAAKNITAPNSAYQFEVSWRGLLGDHALQASY 997 Query: 693 LKVTQPERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCL 514 LK P LPQIFKNAL+AP+LIDIIKCIATFFV E +L V +L +LTKI RFDMIIMCL Sbjct: 998 LKAISPNALPQIFKNALSAPILIDIIKCIATFFVTEMDLAVKFLDNLTKISRFDMIIMCL 1057 Query: 513 HSADRAELCRTWDEIFLSNATPVQHAESLNKLHPRYC 403 S D+ +L + WDE+F + ATP +A++L KL PRYC Sbjct: 1058 SSTDKTDLLKIWDEVFCNKATPSGYADTLGKLRPRYC 1094 >ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated protein 3-like [Fragaria vesca subsp. vesca] Length = 407 Score = 364 bits (934), Expect = 6e-98 Identities = 192/373 (51%), Positives = 249/373 (66%), Gaps = 1/373 (0%) Frame = -2 Query: 1518 SVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDCYSRSIALSPTAVAYA 1339 S NY+ ++ +SSSF D + DAASEKD GNEYFK+KKF EAIDCYSRSIAL+PTAVA+A Sbjct: 58 STNYEPMNTVSSSFTSEDGLPDAASEKDLGNEYFKQKKFKEAIDCYSRSIALTPTAVAFA 117 Query: 1338 NRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELKKLRDSLEDVEFALRLE 1159 NRAM+Y+KI+R++EAE+DCTEALNLDDRYIKAYSRRATARKEL KL++S+ED EFALRLE Sbjct: 118 NRAMSYIKIKRFQEAENDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDAEFALRLE 177 Query: 1158 PQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEVENK-AVANGHGKHVLSSPETS 982 P ++++KKQ AE KSL EK IL K +G I+ ++VE NGH +SS T+ Sbjct: 178 PHNQEIKKQYAEAKSLYEKGILQKVSGAIKISEQDKQKVEKSGTTVNGHSIQPVSS--TT 235 Query: 981 KKAGVGVLSDHLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXXXKTELVATIQDLXXXXXX 802 ++ + DH K K ++Q+L Sbjct: 236 QRTETTAVGDHTKK------------------------INTNGKQASKLSVQELASRAAS 271 Query: 801 XXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQPERLPQIFKNALTAPLLID 622 NI P+SAY+FE SW+G SGDR+LQA LLK P LPQIFKNALT +L+D Sbjct: 272 RAKALAAENITPPSSAYQFEASWRGLSGDRALQAKLLKAISPSALPQIFKNALTVHILVD 331 Query: 621 IIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRAELCRTWDEIFLSNATPVQ 442 I+KC+ TFF++E +L V+ L +LTK+PRFD +IM L S D+A+L + WDE+F + ATP++ Sbjct: 332 ILKCVTTFFIDEMDLAVSVLENLTKVPRFDTLIMFLSSNDKADLAKIWDEVFYNEATPIE 391 Query: 441 HAESLNKLHPRYC 403 AE L+ L +YC Sbjct: 392 FAEKLDNLRAKYC 404 >ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated protein 3-like [Citrus sinensis] Length = 438 Score = 363 bits (931), Expect = 1e-97 Identities = 196/382 (51%), Positives = 249/382 (65%), Gaps = 10/382 (2%) Frame = -2 Query: 1518 SVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDCYSRSIALSPTAVAYA 1339 S NYD +SH+SSS M ++ DA SEK+ GNE FK+KKF EAIDCYSRSIALSPTAVAYA Sbjct: 62 SRNYDPVSHISSSLMNEESTPDATSEKELGNECFKQKKFKEAIDCYSRSIALSPTAVAYA 121 Query: 1338 NRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELKKLRDSLEDVEFALRLE 1159 NRAMAYLK+RR++EAEDDCTEALNLDDRYIKAYSRRATARKEL KL++S+ED EFALRLE Sbjct: 122 NRAMAYLKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDSEFALRLE 181 Query: 1158 PQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEVENKAVANGHGKHVLSSPETSK 979 PQ++++KKQ AEVKSL EK + KA+ E K+ +G V + T++ Sbjct: 182 PQNQEIKKQLAEVKSLYEKEVFQKASKTL--------EKYGKSGMKVNGHEVRAVRNTTQ 233 Query: 978 KAGVGVLSD----------HLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXXXKTELVATI 829 K GV + D +L+ K L A++ Sbjct: 234 KTGVAEIQDLTISKKTENKNLRDESKTEGQRDGSGANATHISGLDKRNHRTKKAVLDASV 293 Query: 828 QDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQPERLPQIFKN 649 Q+L NI P SAYEFE+SW+GF+GD +LQA LLK P LPQIFKN Sbjct: 294 QELATRATSRAVAEAAKNITPPKSAYEFEVSWRGFAGDHALQARLLKAISPNALPQIFKN 353 Query: 648 ALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRAELCRTWDEI 469 AL+A +LIDI+K +ATFF E +L + YL +LT +PRFD++IMCL AD+A+L + WDE Sbjct: 354 ALSASILIDIVKVVATFFTGEVDLAIKYLEYLTMVPRFDLVIMCLSLADKADLRKVWDET 413 Query: 468 FLSNATPVQHAESLNKLHPRYC 403 F + +TP+++AE L+ L +YC Sbjct: 414 FCNESTPIEYAEILDNLRSKYC 435 >gb|EOY11662.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 3, partial [Theobroma cacao] Length = 412 Score = 358 bits (919), Expect = 4e-96 Identities = 195/394 (49%), Positives = 258/394 (65%), Gaps = 26/394 (6%) Frame = -2 Query: 1569 GKSKVTYLSNNTSELRPSV-NYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEA 1393 GKS + S +S + NYD+ + +SSSF+ +N+ DAASEK+ GNEYFK+KKF EA Sbjct: 21 GKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEA 80 Query: 1392 IDCYSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKE 1213 IDCYSRSI LSPTAVA+ANRAMAYLKI++++EAEDDCTEALNLDDRYIKAYSRRATARKE Sbjct: 81 IDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKE 140 Query: 1212 LKKLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEV-EN 1036 L KL++S+ED EFALRLEP ++++KKQ AE KSL EK IL KA+G + +EV ++ Sbjct: 141 LGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKS 200 Query: 1035 KAVANGHGKHVLSSPETSKKAGVGVLSDHL------------------------KSSVIX 928 + NG G H S+ ++++ GV + + ++++ Sbjct: 201 ETKENGLGMH--SASNSTQRTGVATVQGYQTKVSEYDKQKKPEKGSVTSEGIGDRNTLAG 258 Query: 927 XXXXXXXXXXXXXXXXXXXXXXXXXKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYE 748 K EL A++Q+L NI PN+AY+ Sbjct: 259 SRKDGTQLDSGIVGLESIKKNNRTRKPELKASVQELASLAATRAMAEAAKNISPPNTAYQ 318 Query: 747 FELSWKGFSGDRSLQAHLLKVTQPERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVN 568 FE+SW+ SGDR+LQAHLLKVT P LPQIFKNAL+A +L+DIIKC+ATFF EE +L + Sbjct: 319 FEVSWRALSGDRALQAHLLKVTSPSALPQIFKNALSASMLVDIIKCVATFFREEVDLAIK 378 Query: 567 YLLHLTKIPRFDMIIMCLHSADRAELCRTWDEIF 466 YL +LTK+PRFDM+IMCL S ++A+L + WD++F Sbjct: 379 YLENLTKVPRFDMLIMCLSSTEKADLLKVWDDVF 412 >ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citrus clementina] gi|557535662|gb|ESR46780.1| hypothetical protein CICLE_v10003914mg [Citrus clementina] Length = 977 Score = 354 bits (908), Expect = 7e-95 Identities = 194/382 (50%), Positives = 244/382 (63%), Gaps = 10/382 (2%) Frame = -2 Query: 1518 SVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDCYSRSIALSPTAVAYA 1339 S NYD +S +SSS M ++ DA SEK+ GNE FK+KKF EAIDCYSRSIALSPTAVAYA Sbjct: 601 SRNYDPVSRISSSLMNEESTPDATSEKELGNECFKQKKFKEAIDCYSRSIALSPTAVAYA 660 Query: 1338 NRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELKKLRDSLEDVEFALRLE 1159 NRAMAYLK+RR++EAEDDCTEALNLDDRYIKAYSRRATARKEL KL++S+ED EFALRLE Sbjct: 661 NRAMAYLKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDSEFALRLE 720 Query: 1158 PQDEDLKKQRAEVKSLLEKAILAKATGNATIAALVVKEVENKAVANGHGKHVLSSPETSK 979 PQ++++KKQ AEVKSL EK + KA+ E K+ +G V + T + Sbjct: 721 PQNQEIKKQLAEVKSLYEKEVFQKASKTL--------EKYGKSGMKVNGHEVRAVRNTIQ 772 Query: 978 KAGVGVLSD----------HLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXXXKTELVATI 829 K GV + D +L+ K L A++ Sbjct: 773 KTGVAEIQDLTISKKTENKNLRDESKTEGQRDGSGANATHISGLDKRNHRTKKAVLDASV 832 Query: 828 QDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQPERLPQIFKN 649 Q+L NI P SAYEFE+SW+GF+GD +LQA LLK P LPQIFKN Sbjct: 833 QELATRATSRAVAEAAKNITPPKSAYEFEVSWRGFAGDHALQARLLKAISPNALPQIFKN 892 Query: 648 ALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRAELCRTWDEI 469 AL+A +LIDI+K +A FF E +L + YL +LT +PRFD +IMCL AD+A+L + WDE Sbjct: 893 ALSASILIDIVKVVAMFFPGEVDLAIKYLEYLTMVPRFDFVIMCLSLADKADLRKVWDET 952 Query: 468 FLSNATPVQHAESLNKLHPRYC 403 F + TP+++AE L+ L +YC Sbjct: 953 FCNELTPIEYAEILDNLRSKYC 974 >gb|EXB53029.1| RNA polymerase II-associated protein 3 [Morus notabilis] Length = 450 Score = 351 bits (901), Expect = 4e-94 Identities = 195/408 (47%), Positives = 258/408 (63%), Gaps = 19/408 (4%) Frame = -2 Query: 1569 GKSKVTYLSNNTSELRPSVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAI 1390 GKS +++ S YD ++ +SSS + D+ DAASEK+ GNEYFK+KKF EAI Sbjct: 67 GKSSTFEYLSSSMPYDYSRKYDAINQVSSSSISEDSYTDAASEKELGNEYFKQKKFKEAI 126 Query: 1389 DCYSRSIALSPTAVAYANRAMAYLKIRR-----------------YKEAEDDCTEALNLD 1261 DCYSRSIALS TAVAYANRAMAYLK++R ++EAE DCTEALN+D Sbjct: 127 DCYSRSIALSSTAVAYANRAMAYLKLKRQLLPYLIFFCKSIFLIRFQEAEGDCTEALNMD 186 Query: 1260 DRYIKAYSRRATARKELKKLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKAT 1081 DRYIKAYSRRATARKEL KL++ +ED EFALRLEP ++++KKQ +E KSL EK IL KA+ Sbjct: 187 DRYIKAYSRRATARKELGKLKECIEDAEFALRLEPNNQEIKKQYSEAKSLCEKVILQKAS 246 Query: 1080 G--NATIAALVVKEVENKAVANGHGKHVLSSPETSKKAGVGVLSDHLKSSVIXXXXXXXX 907 T+ + E ++ V N + V S+ ++K V D+ K + Sbjct: 247 VALENTVQKMQKAEKKDTKVQNNGIQPVESA---TQKTEAAVAEDYTKIN---------- 293 Query: 906 XXXXXXXXXXXXXXXXXXKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKG 727 K E A++Q+L NI +P SAY+FE+SW+G Sbjct: 294 --------------QTAKKQEPKASVQELASRAASRAMNGTAKNIRSPTSAYQFEVSWRG 339 Query: 726 FSGDRSLQAHLLKVTQPERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTK 547 SGDR+LQA LLK P LPQIFKN+LT P+L+DI+KCIATFF+EE ++ V +L +LTK Sbjct: 340 LSGDRALQASLLKTVSPGALPQIFKNSLTVPILVDIVKCIATFFIEEMDVTVTFLENLTK 399 Query: 546 IPRFDMIIMCLHSADRAELCRTWDEIFLSNATPVQHAESLNKLHPRYC 403 +PRFD+++MCL S DRA+L + W+E+F ATP++HAE L+ L +YC Sbjct: 400 VPRFDILVMCLTSKDRADLVKIWNEVFCKEATPIEHAEKLDNLRSKYC 447 >ref|XP_006392426.1| hypothetical protein EUTSA_v10023436mg [Eutrema salsugineum] gi|557088932|gb|ESQ29712.1| hypothetical protein EUTSA_v10023436mg [Eutrema salsugineum] Length = 473 Score = 348 bits (892), Expect = 5e-93 Identities = 201/430 (46%), Positives = 268/430 (62%), Gaps = 48/430 (11%) Frame = -2 Query: 1548 LSNNTSE-LRPSVN--------YDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNE 1396 LSN TSE RPS + Y +S +SSSF +++ +DA SEK+QGNEYFK+KKFNE Sbjct: 41 LSNPTSEKFRPSGSGQYDFVKKYGPMSGLSSSFADDESPLDANSEKEQGNEYFKQKKFNE 100 Query: 1395 AIDCYSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARK 1216 AIDCYSRSIALSP AVA+ANRAMAYLKI+RY+EAE DCTEALNLDDRY KAYSRRATARK Sbjct: 101 AIDCYSRSIALSPNAVAFANRAMAYLKIKRYREAEIDCTEALNLDDRYTKAYSRRATARK 160 Query: 1215 ELKKLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATG--NATIAALV---- 1054 L +++++ED EFALRLEPQ ++L+KQ A++KSLLEK I+ KA+G +T L+ Sbjct: 161 ALGMVKEAMEDAEFALRLEPQSQELQKQYADIKSLLEKEIIEKASGAMQSTAQELLKTAG 220 Query: 1053 ------------VKEVENKAVANG------------HGKHVLSS--PETSKKAG---VGV 961 K V A NG GK ++ S PE K G + Sbjct: 221 LDKKAKIPNTDTKKPVTLVAKTNGDMVRPVSRSNESSGKKLIESVDPEEKHKEGSKKISA 280 Query: 960 LSDHLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXXXKT----ELVATIQDLXXXXXXXXX 793 +++++ + + + EL ++Q+L Sbjct: 281 ITENVDNKKVTPISQSYEKEAKSSGGNVTQPSEQVNQVSTKLELKPSVQELATRAASLAM 340 Query: 792 XXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQPERLPQIFKNALTAPLLIDIIK 613 NI P SAYEFE +W+ FSGDR+LQ LLKV P LPQIFKNALT+P+L+DIIK Sbjct: 341 SEVTKNIKAPKSAYEFENTWRSFSGDRALQKQLLKVMTPSSLPQIFKNALTSPVLVDIIK 400 Query: 612 CIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRAELCRTWDEIFLSNATPVQHAE 433 C+A+ F E+ +L V Y+ +LTK+PRF+M++MCL S ++ EL + W+++F S ATP+++AE Sbjct: 401 CVASIFTEDMDLAVKYIENLTKVPRFNMLVMCLTSTEKNELLKIWEDVFCSKATPMEYAE 460 Query: 432 SLNKLHPRYC 403 L+KL +YC Sbjct: 461 VLDKLRLKYC 470 >ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated protein 3-like [Solanum lycopersicum] Length = 470 Score = 343 bits (881), Expect = 9e-92 Identities = 187/398 (46%), Positives = 249/398 (62%), Gaps = 13/398 (3%) Frame = -2 Query: 1560 KVTYLSNNTSELRPSVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDCY 1381 K T + N S NY+ +SH+SS + ++ ++A SEK+ GNE FK+KKFNEAIDCY Sbjct: 71 KSTSIRNAAGPYSYSKNYNPISHLSSELISEESNINANSEKELGNECFKQKKFNEAIDCY 130 Query: 1380 SRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELKKL 1201 SRSIALSPTAV+YANRAMAYLKI+R++EAE+DCTEALNLDDRYIKAYSRR+T+RKEL KL Sbjct: 131 SRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRSTSRKELGKL 190 Query: 1200 RDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAAL------------ 1057 ++S+ED EFAL LEP++ ++KKQ EVK+L EK IL + +G ++A Sbjct: 191 KESIEDAEFALWLEPRNPEIKKQYGEVKALYEKEILKRVSGATDVSAQGPQKSGKTIKIG 250 Query: 1056 -VVKEVENKAVANGHGKHVLSSPETSKKAGVGVLSDHLKSSVIXXXXXXXXXXXXXXXXX 880 V++ V + + + + + G + D Sbjct: 251 PVIQSVSSSSQKVAEVRTIPAKENNRDVLGTAKVEDTHMQISNKDSDASPTVPTLNLAFG 310 Query: 879 XXXXXXXXXKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQA 700 K EL ++Q+L NI PNSAY+FE+SW+G SGDR+LQ Sbjct: 311 TAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVSWRGLSGDRNLQT 370 Query: 699 HLLKVTQPERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIM 520 LLKVT P LP+IFKNAL+AP+L+DI++CIATFF+E+ L + YL LTK+PRFDMIIM Sbjct: 371 QLLKVTSPAMLPRIFKNALSAPMLMDIVRCIATFFIEDMNLAIRYLEDLTKVPRFDMIIM 430 Query: 519 CLHSADRAELCRTWDEIFLSNATPVQHAESLNKLHPRY 406 CL SAD++EL + W+EIF A +H+ +L L Y Sbjct: 431 CLSSADKSELLKIWEEIFCKVAE--EHSATLGALRVSY 466 >ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X1 [Solanum tuberosum] Length = 468 Score = 341 bits (875), Expect = 4e-91 Identities = 185/398 (46%), Positives = 247/398 (62%), Gaps = 13/398 (3%) Frame = -2 Query: 1560 KVTYLSNNTSELRPSVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDCY 1381 K T + + S NY+ +SH+SS + ++ ++A SEK+ GNE FK+KKFNEAIDCY Sbjct: 70 KSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQKKFNEAIDCY 129 Query: 1380 SRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELKKL 1201 SRSIALSPTAV+YANRAMAYLKI+R++EAE+DCTEALNLDDRYIKAYSRR+T+RKEL KL Sbjct: 130 SRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRSTSRKELGKL 189 Query: 1200 RDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAILAKATGNATIAAL------------ 1057 ++S+ED EFALRLEPQ+ ++KKQ EVK+L EK I + +G ++A Sbjct: 190 KESIEDAEFALRLEPQNPEIKKQYGEVKALYEKEIRKRVSGATDVSAQRAQKSGKTIKSG 249 Query: 1056 -VVKEVENKAVANGHGKHVLSSPETSKKAGVGVLSDHLKSSVIXXXXXXXXXXXXXXXXX 880 V++ V + + + + G + D Sbjct: 250 PVIQSVSSSSQKMAEVWTIPAKENNRDVPGTAKVEDTHMQINNKDSDASPTVPTLNPAFG 309 Query: 879 XXXXXXXXXKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQA 700 K EL ++Q+L NI PNSAY+FE+SW+G SGDR+LQ Sbjct: 310 TAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVSWRGLSGDRNLQT 369 Query: 699 HLLKVTQPERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIM 520 LLKVT P LP+IFKNAL+AP+L+DI++C+ATFF+E+ L + YL LTK+PRFDMIIM Sbjct: 370 QLLKVTSPAMLPRIFKNALSAPMLMDIVRCVATFFIEDMNLAIRYLEDLTKVPRFDMIIM 429 Query: 519 CLHSADRAELCRTWDEIFLSNATPVQHAESLNKLHPRY 406 CL S D++EL + W+EIF A +H+ +L L Y Sbjct: 430 CLSSTDKSELLKIWEEIFCKEAE--EHSATLGALRVPY 465 >ref|NP_176039.2| carboxylate clamp-tetratricopeptide repeat protein [Arabidopsis thaliana] gi|53828529|gb|AAU94374.1| At1g56440 [Arabidopsis thaliana] gi|59958350|gb|AAX12885.1| At1g56440 [Arabidopsis thaliana] gi|110743110|dbj|BAE99447.1| hypothetical protein [Arabidopsis thaliana] gi|332195274|gb|AEE33395.1| carboxylate clamp-tetratricopeptide repeat [Arabidopsis thaliana] Length = 476 Score = 340 bits (873), Expect = 8e-91 Identities = 190/427 (44%), Positives = 265/427 (62%), Gaps = 40/427 (9%) Frame = -2 Query: 1563 SKVTYLSNNTSELRPSVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDC 1384 S T+ + + + + Y + +SSS + ++++D++SEK+QGNE+FK+KKFNEAIDC Sbjct: 48 SSETFRPSGSGKYDFAKKYRSIRDLSSSLI-GESLLDSSSEKEQGNEFFKQKKFNEAIDC 106 Query: 1383 YSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELKK 1204 YSRSIALSP AV YANRAMAYLKI+RY+EAE DCTEALNLDDRYIKAYSRRATARKEL Sbjct: 107 YSRSIALSPNAVTYANRAMAYLKIKRYREAEVDCTEALNLDDRYIKAYSRRATARKELGM 166 Query: 1203 LRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEKAI--------------LAKATG---- 1078 ++++ ED EFALRLEP+ ++LKKQ A++KSLLEK I L K +G Sbjct: 167 IKEAKEDAEFALRLEPESQELKKQYADIKSLLEKEIIEKATGAMQSTAQELLKTSGLDKK 226 Query: 1077 -----------NATIAALVVKEVENKAVANGH--GKHVLSS--PETSKKAG---VGVLSD 952 T+ A +++ + + GK ++ + PE K G + +++ Sbjct: 227 IQKPKTEMTSKPVTLVAKTNRDIVQPVLGSNESSGKKLIENIQPEEKSKEGSMKIPAITE 286 Query: 951 HLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXXXKT----ELVATIQDLXXXXXXXXXXXX 784 L S + + EL ++Q+L Sbjct: 287 ILDSKKVTPGSQSYEKEAKPSDRNGTQPSGPENQVSKQLELKPSVQELAAHAASLAMTEA 346 Query: 783 XSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQPERLPQIFKNALTAPLLIDIIKCIA 604 NI TP SAYEFE SW+ FSGD +L++ LLKVT P LPQIFKNALT+P+L+DIIKC+A Sbjct: 347 SKNIKTPKSAYEFENSWRSFSGDSALRSQLLKVTTPSSLPQIFKNALTSPVLVDIIKCVA 406 Query: 603 TFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRAELCRTWDEIFLSNATPVQHAESLN 424 +FF E+ +L V Y+ +LTK+PRF+M++MCL S ++ EL + W+++F + ATP+++AE L+ Sbjct: 407 SFFTEDMDLAVKYIENLTKVPRFNMLVMCLTSTEKNELLKIWEDVFCNKATPMEYAEVLD 466 Query: 423 KLHPRYC 403 KL RYC Sbjct: 467 KLRSRYC 473 >ref|XP_006302207.1| hypothetical protein CARUB_v10020218mg [Capsella rubella] gi|482570917|gb|EOA35105.1| hypothetical protein CARUB_v10020218mg [Capsella rubella] Length = 478 Score = 339 bits (869), Expect = 2e-90 Identities = 199/439 (45%), Positives = 259/439 (58%), Gaps = 51/439 (11%) Frame = -2 Query: 1566 KSKVTYLSN-NTSELRPS--------VNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFK 1414 K K + SN N+ + +PS NY +S +SSS + ++++D++SEK+QGNE+FK Sbjct: 38 KQKPSNSSNLNSEKFKPSGSGQYDFVKNYSSISDLSSSLI-GESLLDSSSEKEQGNEFFK 96 Query: 1413 KKKFNEAIDCYSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSR 1234 +KKFNEAIDCYSRS+ALS AVAYANRAMAYLKI+RY+EAE DCTEALNLDDRYIKAYSR Sbjct: 97 QKKFNEAIDCYSRSLALSANAVAYANRAMAYLKIKRYREAEVDCTEALNLDDRYIKAYSR 156 Query: 1233 RATARKELKKLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLE----------------- 1105 RATARKEL ++++ ED EFALRLEP E+LKKQ A +KSLLE Sbjct: 157 RATARKELGMIKEAKEDAEFALRLEPASEELKKQYANIKSLLEKEIVEKATGAMQSTAQE 216 Query: 1104 ---------KAILAKATGNATIAALVVKEVENKAVANGHGKHVLSS-------PETSKKA 973 K L K N L VK +KA + K L PE +K Sbjct: 217 LLKTAGLDKKTKLPKTQVNLKPVTLGVKTNSDKARPDSVRKESLGKKLIENVQPEPQEKC 276 Query: 972 GVG-----VLSDHLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXXXKT----ELVATIQDL 820 G ++++ S+ + EL ++Q+L Sbjct: 277 KEGSKKTSAITENPDSNQVMPRSQCYGKEEKSSNRNGAQSSGQGNPVSKKLELKPSVQEL 336 Query: 819 XXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQPERLPQIFKNALT 640 NI P SAYEFE SW+ FSGDR+LQ LLKV P LPQIFKNALT Sbjct: 337 AAHAASLAMVEVSKNIKAPKSAYEFENSWRSFSGDRALQTQLLKVMSPNSLPQIFKNALT 396 Query: 639 APLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRAELCRTWDEIFLS 460 +P+L+DIIKC+A FF E+ +L V Y+ +LTK+PRF+M++MCL S ++ EL + WD++F Sbjct: 397 SPVLVDIIKCVAVFFTEDMDLAVKYIENLTKVPRFNMLVMCLTSTEKNELMKIWDDVFCD 456 Query: 459 NATPVQHAESLNKLHPRYC 403 ATP+++AE L+KL RYC Sbjct: 457 KATPMEYAEVLDKLKSRYC 475 >ref|XP_006302206.1| hypothetical protein CARUB_v10020218mg [Capsella rubella] gi|482570916|gb|EOA35104.1| hypothetical protein CARUB_v10020218mg [Capsella rubella] Length = 468 Score = 339 bits (869), Expect = 2e-90 Identities = 199/439 (45%), Positives = 259/439 (58%), Gaps = 51/439 (11%) Frame = -2 Query: 1566 KSKVTYLSN-NTSELRPS--------VNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFK 1414 K K + SN N+ + +PS NY +S +SSS + ++++D++SEK+QGNE+FK Sbjct: 28 KQKPSNSSNLNSEKFKPSGSGQYDFVKNYSSISDLSSSLI-GESLLDSSSEKEQGNEFFK 86 Query: 1413 KKKFNEAIDCYSRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSR 1234 +KKFNEAIDCYSRS+ALS AVAYANRAMAYLKI+RY+EAE DCTEALNLDDRYIKAYSR Sbjct: 87 QKKFNEAIDCYSRSLALSANAVAYANRAMAYLKIKRYREAEVDCTEALNLDDRYIKAYSR 146 Query: 1233 RATARKELKKLRDSLEDVEFALRLEPQDEDLKKQRAEVKSLLE----------------- 1105 RATARKEL ++++ ED EFALRLEP E+LKKQ A +KSLLE Sbjct: 147 RATARKELGMIKEAKEDAEFALRLEPASEELKKQYANIKSLLEKEIVEKATGAMQSTAQE 206 Query: 1104 ---------KAILAKATGNATIAALVVKEVENKAVANGHGKHVLSS-------PETSKKA 973 K L K N L VK +KA + K L PE +K Sbjct: 207 LLKTAGLDKKTKLPKTQVNLKPVTLGVKTNSDKARPDSVRKESLGKKLIENVQPEPQEKC 266 Query: 972 GVG-----VLSDHLKSSVIXXXXXXXXXXXXXXXXXXXXXXXXXXKT----ELVATIQDL 820 G ++++ S+ + EL ++Q+L Sbjct: 267 KEGSKKTSAITENPDSNQVMPRSQCYGKEEKSSNRNGAQSSGQGNPVSKKLELKPSVQEL 326 Query: 819 XXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAHLLKVTQPERLPQIFKNALT 640 NI P SAYEFE SW+ FSGDR+LQ LLKV P LPQIFKNALT Sbjct: 327 AAHAASLAMVEVSKNIKAPKSAYEFENSWRSFSGDRALQTQLLKVMSPNSLPQIFKNALT 386 Query: 639 APLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMCLHSADRAELCRTWDEIFLS 460 +P+L+DIIKC+A FF E+ +L V Y+ +LTK+PRF+M++MCL S ++ EL + WD++F Sbjct: 387 SPVLVDIIKCVAVFFTEDMDLAVKYIENLTKVPRFNMLVMCLTSTEKNELMKIWDDVFCD 446 Query: 459 NATPVQHAESLNKLHPRYC 403 ATP+++AE L+KL RYC Sbjct: 447 KATPMEYAEVLDKLKSRYC 465 >ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X2 [Solanum tuberosum] Length = 467 Score = 338 bits (868), Expect = 3e-90 Identities = 185/397 (46%), Positives = 246/397 (61%), Gaps = 12/397 (3%) Frame = -2 Query: 1560 KVTYLSNNTSELRPSVNYDQLSHMSSSFMPNDNIVDAASEKDQGNEYFKKKKFNEAIDCY 1381 K T + + S NY+ +SH+SS + ++ ++A SEK+ GNE FK+KKFNEAIDCY Sbjct: 70 KSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQKKFNEAIDCY 129 Query: 1380 SRSIALSPTAVAYANRAMAYLKIRRYKEAEDDCTEALNLDDRYIKAYSRRATARKELKKL 1201 SRSIALSPTAV+YANRAMAYLKI+R++EAE+DCTEALNLDDRYIKAYSRR+T+RKEL KL Sbjct: 130 SRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRSTSRKELGKL 189 Query: 1200 RDSLEDVEFALRLEPQDEDLKKQRAEVKSLLEK------------AILAKATGNATIAAL 1057 ++S+ED EFALRLEPQ+ ++KKQ EVK+L EK A A+ +G + Sbjct: 190 KESIEDAEFALRLEPQNPEIKKQYGEVKALYEKIRKRVSGATDVSAQRAQKSGKTIKSGP 249 Query: 1056 VVKEVENKAVANGHGKHVLSSPETSKKAGVGVLSDHLKSSVIXXXXXXXXXXXXXXXXXX 877 V++ V + + + + G + D Sbjct: 250 VIQSVSSSSQKMAEVWTIPAKENNRDVPGTAKVEDTHMQINNKDSDASPTVPTLNPAFGT 309 Query: 876 XXXXXXXXKTELVATIQDLXXXXXXXXXXXXXSNIGTPNSAYEFELSWKGFSGDRSLQAH 697 K EL ++Q+L NI PNSAY+FE+SW+G SGDR+LQ Sbjct: 310 AKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVSWRGLSGDRNLQTQ 369 Query: 696 LLKVTQPERLPQIFKNALTAPLLIDIIKCIATFFVEETELGVNYLLHLTKIPRFDMIIMC 517 LLKVT P LP+IFKNAL+AP+L+DI++C+ATFF+E+ L + YL LTK+PRFDMIIMC Sbjct: 370 LLKVTSPAMLPRIFKNALSAPMLMDIVRCVATFFIEDMNLAIRYLEDLTKVPRFDMIIMC 429 Query: 516 LHSADRAELCRTWDEIFLSNATPVQHAESLNKLHPRY 406 L S D++EL + W+EIF A +H+ +L L Y Sbjct: 430 LSSTDKSELLKIWEEIFCKEAE--EHSATLGALRVPY 464