BLASTX nr result
ID: Cocculus23_contig00019070
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00019070 (1720 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI39598.3| unnamed protein product [Vitis vinifera] 478 e-132 ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated prot... 476 e-131 ref|XP_007031161.1| Tetratricopeptide repeat (TPR)-like superfam... 466 e-128 ref|XP_007031159.1| Tetratricopeptide repeat (TPR)-like superfam... 465 e-128 ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated prot... 464 e-128 ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated prot... 461 e-127 ref|XP_007031158.1| Tetratricopeptide repeat-like superfamily pr... 456 e-125 ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citr... 454 e-125 ref|XP_007205290.1| hypothetical protein PRUPE_ppa006661mg [Prun... 454 e-125 ref|XP_004144746.1| PREDICTED: RNA polymerase II-associated prot... 443 e-121 gb|EXB53029.1| RNA polymerase II-associated protein 3 [Morus not... 439 e-120 ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated prot... 431 e-118 ref|NP_176039.2| carboxylate clamp-tetratricopeptide repeat prot... 430 e-117 ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated prot... 429 e-117 ref|XP_007145004.1| hypothetical protein PHAVU_007G201600g [Phas... 426 e-116 ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated prot... 426 e-116 ref|XP_006339935.1| PREDICTED: RNA polymerase II-associated prot... 426 e-116 ref|XP_006588434.1| PREDICTED: uncharacterized protein LOC100784... 418 e-114 ref|XP_004495650.1| PREDICTED: RNA polymerase II-associated prot... 417 e-114 ref|XP_007031162.1| Tetratricopeptide repeat (TPR)-like superfam... 416 e-113 >emb|CBI39598.3| unnamed protein product [Vitis vinifera] Length = 1097 Score = 478 bits (1229), Expect = e-132 Identities = 267/474 (56%), Positives = 326/474 (68%), Gaps = 48/474 (10%) Frame = -3 Query: 1646 SKSMA-RVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEK--------KKT 1494 S SMA R P+KH RDQ LDFQGFL DLQDWELSLKEKDKK KAQ+ E+ K + Sbjct: 621 SVSMATRFPSKHARDQALDFQGFLTDLQDWELSLKEKDKKMKAQAEEKDVPTARGNVKHS 680 Query: 1493 EIISEAKGVARKAPSV-------DYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFK 1335 +S + GV+ + +YSR+ +I ISS F EES PDAA+EKE GNEYFK Sbjct: 681 SKLSSSPGVSLRLGQSRSDTRQHEYSRNHDAISRISSSFMTEESLPDAASEKELGNEYFK 740 Query: 1334 QKKFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSR 1155 Q+KFKEAIDCYSRSIAL PTAVA+ANRAMAY+K+KRF EAE DC EALNLDDRY KAYSR Sbjct: 741 QRKFKEAIDCYSRSIALLPTAVAYANRAMAYIKIKRFREAEDDCMEALNLDDRYIKAYSR 800 Query: 1154 RATARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVG 975 RATARKELGK KE+ ED+EFALRLEPQNQE+KKQY + K+LY+KE+L KAS +K S G Sbjct: 801 RATARKELGKFKEATEDAEFALRLEPQNQEIKKQYAEAKSLYEKEILQKASGALKSSVQG 860 Query: 974 EQSVG--------STTG-KAVSIKEMGSGS----------TNAKRKIGEQELD-----QS 867 Q VG T G +++S G+G N + E E Sbjct: 861 LQKVGKSVVEVNADTQGVRSISSSSQGAGEAAIQDRFMVPANTSTSMEETENKGTGNRSK 920 Query: 866 QDGQFNQVTQN--------GHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEF 711 ++G QN H ++E+K S+Q+LA RAAS+A EAAKNI AP +AY+F Sbjct: 921 ENGYLENAVQNSGLEDVMSNHKTGQREMKSSLQELASRAASRAMVEAAKNITAPNSAYQF 980 Query: 710 ELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVEL 531 E+SW+GL GD ALQA LKAI P+ LP++F++AL+AP+LIDI+KCIATFFV E +LAV+ Sbjct: 981 EVSWRGLLGDHALQASYLKAISPNALPQIFKNALSAPILIDIIKCIATFFVTEMDLAVKF 1040 Query: 530 LDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 369 LDN+TK+SRFDMI MCLS DK D+ K W+EVF ++A P A+TL KLR +YC Sbjct: 1041 LDNLTKISRFDMIIMCLSSTDKTDLLKIWDEVFCNKATPSGYADTLGKLRPRYC 1094 >ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated protein 3-like [Vitis vinifera] Length = 474 Score = 476 bits (1226), Expect = e-131 Identities = 263/468 (56%), Positives = 322/468 (68%), Gaps = 47/468 (10%) Frame = -3 Query: 1631 RVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEK--------KKTEIISEA 1476 R P+KH RDQ LDFQGFL DLQDWELSLKEKDKK KAQ+ E+ K + +S + Sbjct: 4 RFPSKHARDQALDFQGFLTDLQDWELSLKEKDKKMKAQAEEKDVPTARGNVKHSSKLSSS 63 Query: 1475 KGVARKAPSV-------DYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKE 1317 GV+ + +YSR+ +I ISS F EES PDAA+EKE GNEYFKQ+KFKE Sbjct: 64 PGVSLRLGQSRSDTRQHEYSRNHDAISRISSSFMTEESLPDAASEKELGNEYFKQRKFKE 123 Query: 1316 AIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARK 1137 AIDCYSRSIAL PTAVA+ANRAMAY+K+KRF EAE DC EALNLDDRY KAYSRRATARK Sbjct: 124 AIDCYSRSIALLPTAVAYANRAMAYIKIKRFREAEDDCMEALNLDDRYIKAYSRRATARK 183 Query: 1136 ELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVG- 960 ELGK KE+ ED+EFALRLEPQNQE+KKQY + K+LY+KE+L KAS +K S G Q VG Sbjct: 184 ELGKFKEATEDAEFALRLEPQNQEIKKQYAEAKSLYEKEILQKASGALKSSVQGLQKVGK 243 Query: 959 -------STTG-KAVSIKEMGSGS----------TNAKRKIGEQELD-----QSQDGQFN 849 T G +++S G+G N + E E ++G Sbjct: 244 SVVEVNADTQGVRSISSSSQGAGEAAIQDRFMVPANTSTSMEETENKGTGNRSKENGYLE 303 Query: 848 QVTQN--------GHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKG 693 QN H ++E+K S+Q+LA RAAS+A EAAKNI AP +AY+FE+SW+G Sbjct: 304 NAVQNSGLEDVMSNHKTGQREMKSSLQELASRAASRAMVEAAKNITAPNSAYQFEVSWRG 363 Query: 692 LSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTK 513 L GD ALQA LKAI P+ LP++F++AL+AP+LIDI+KCIATFFV E +LAV+ LDN+TK Sbjct: 364 LLGDHALQASYLKAISPNALPQIFKNALSAPILIDIIKCIATFFVTEMDLAVKFLDNLTK 423 Query: 512 VSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 369 +SRFDMI MCLS DK D+ K W+EVF ++A P A+TL KLR +YC Sbjct: 424 ISRFDMIIMCLSSTDKTDLLKIWDEVFCNKATPSGYADTLGKLRPRYC 471 >ref|XP_007031161.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 4 [Theobroma cacao] gi|508719766|gb|EOY11663.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 4 [Theobroma cacao] Length = 421 Score = 466 bits (1198), Expect = e-128 Identities = 245/425 (57%), Positives = 307/425 (72%), Gaps = 7/425 (1%) Frame = -3 Query: 1622 NKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPS-- 1449 +KH+RDQ LDFQGFLN+LQDWELSLKEKDK K+Q++++++ T G + S Sbjct: 3 SKHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLTNEKGRPTGKSSLIDSST 62 Query: 1448 -----VDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIAL 1284 DY ++ +SS F EE+ PDAA+EKE GNEYFKQKKFKEAIDCYSRSI L Sbjct: 63 TSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEAIDCYSRSIGL 122 Query: 1283 SPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFED 1104 SPTAVA ANRAMAYLK+K+F+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES ED Sbjct: 123 SPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIED 182 Query: 1103 SEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEM 924 +EFALRLEP NQE+KKQ+ + K+LY+KE+L KAS +++KS Q VG + KE Sbjct: 183 TEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKS-----ETKEN 237 Query: 923 GSGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQDLALRAASQAKTEAAK 744 G G +A + Q Q T+ + K E+K S+Q+LA AA++A EAAK Sbjct: 238 GLGMHSASNSTQRTGVATVQGYQ----TKKNNRTRKPELKASVQELASLAATRAMAEAAK 293 Query: 743 NIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATF 564 NI P TAY+FE+SW+ LSGDRALQA LLK PS LP++F++AL+A +L+DI+KC+ATF Sbjct: 294 NISPPNTAYQFEVSWRALSGDRALQAHLLKVTSPSALPQIFKNALSASMLVDIIKCVATF 353 Query: 563 FVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKL 384 F EE +LA++ L+N+TKV RFDM+ MCLS +KAD+ K W++VF +EA PIE AE L L Sbjct: 354 FREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEILDNL 413 Query: 383 RVKYC 369 R YC Sbjct: 414 RSVYC 418 >ref|XP_007031159.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 [Theobroma cacao] gi|508719764|gb|EOY11661.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 [Theobroma cacao] Length = 422 Score = 465 bits (1196), Expect = e-128 Identities = 245/429 (57%), Positives = 311/429 (72%), Gaps = 11/429 (2%) Frame = -3 Query: 1622 NKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKK-----------KTEIISEA 1476 +KH+RDQ LDFQGFLN+LQDWELSLKEKDK K+Q++++++ K+ +I + Sbjct: 3 SKHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLKTNEKGRPTGKSSLIDSS 62 Query: 1475 KGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSR 1296 +R+ DY ++ +SS F EE+ PDAA+EKE GNEYFKQKKFKEAIDCYSR Sbjct: 63 TTSSRQ---YDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEAIDCYSR 119 Query: 1295 SIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKE 1116 SI LSPTAVA ANRAMAYLK+K+F+EAE DCTEALNLDDRY KAYSRRATARKELGKLKE Sbjct: 120 SIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKE 179 Query: 1115 SFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVS 936 S ED+EFALRLEP NQE+KKQ+ + K+LY+KE+L KAS +++KS Q VG + Sbjct: 180 SIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKS-----E 234 Query: 935 IKEMGSGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQDLALRAASQAKT 756 KE G G +A + Q Q T+ + K E+K S+Q+LA AA++A Sbjct: 235 TKENGLGMHSASNSTQRTGVATVQGYQ----TKKNNRTRKPELKASVQELASLAATRAMA 290 Query: 755 EAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKC 576 EAAKNI P TAY+FE+SW+ LSGDRALQA LLK PS LP++F++AL+A +L+DI+KC Sbjct: 291 EAAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSPSALPQIFKNALSASMLVDIIKC 350 Query: 575 IATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAET 396 +ATFF EE +LA++ L+N+TKV RFDM+ MCLS +KAD+ K W++VF +EA PIE AE Sbjct: 351 VATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEI 410 Query: 395 LSKLRVKYC 369 L LR YC Sbjct: 411 LDNLRSVYC 419 >ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated protein 3-like [Fragaria vesca subsp. vesca] Length = 407 Score = 464 bits (1195), Expect = e-128 Identities = 248/427 (58%), Positives = 310/427 (72%), Gaps = 4/427 (0%) Frame = -3 Query: 1637 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQS-NEEKKKTEIISEAKGVAR 1461 MAR P+KH RDQ LDFQGFL+DLQDWELSLK+KDKK + Q N+E K+ R Sbjct: 1 MARAPSKHGRDQALDFQGFLSDLQDWELSLKDKDKKMRPQQPNKEAPKS----------R 50 Query: 1460 KAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALS 1281 + YS + + + +SS F +E+ PDAA+EK+ GNEYFKQKKFKEAIDCYSRSIAL+ Sbjct: 51 DFGTSSYSTNYEPMNTVSSSFTSEDGLPDAASEKDLGNEYFKQKKFKEAIDCYSRSIALT 110 Query: 1280 PTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDS 1101 PTAVAFANRAM+Y+K+KRF+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES ED+ Sbjct: 111 PTAVAFANRAMSYIKIKRFQEAENDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDA 170 Query: 1100 EFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSV--GSTTGKAVSIKE 927 EFALRLEP NQE+KKQY + K+LY+K +L K S +K S +Q V TT SI+ Sbjct: 171 EFALRLEPHNQEIKKQYAEAKSLYEKGILQKVSGAIKISEQDKQKVEKSGTTVNGHSIQP 230 Query: 926 MGSGSTNAK-RKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQDLALRAASQAKTEA 750 + S + + +G+ ++ NG KQ K S+Q+LA RAAS+AK A Sbjct: 231 VSSTTQRTETTAVGDHT---------KKINTNG----KQASKLSVQELASRAASRAKALA 277 Query: 749 AKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIA 570 A+NI P +AY+FE SW+GLSGDRALQA+LLKAI PS LP++F++ALT +L+DI+KC+ Sbjct: 278 AENITPPSSAYQFEASWRGLSGDRALQAKLLKAISPSALPQIFKNALTVHILVDILKCVT 337 Query: 569 TFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLS 390 TFF++E +LAV +L+N+TKV RFD + M LS DKAD+ K W+EVF +EA PIE AE L Sbjct: 338 TFFIDEMDLAVSVLENLTKVPRFDTLIMFLSSNDKADLAKIWDEVFYNEATPIEFAEKLD 397 Query: 389 KLRVKYC 369 LR KYC Sbjct: 398 NLRAKYC 404 >ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated protein 3-like [Citrus sinensis] Length = 438 Score = 461 bits (1187), Expect = e-127 Identities = 248/435 (57%), Positives = 305/435 (70%), Gaps = 18/435 (4%) Frame = -3 Query: 1619 KHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPSVD- 1443 KHNRDQ LDFQGFLNDLQDW+LSL EKDKK K +++ K + S K + +PS + Sbjct: 3 KHNRDQALDFQGFLNDLQDWDLSLNEKDKKMKHKASS--KDNLVSSSLKSAKKPSPSGNS 60 Query: 1442 YSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSPTAVAF 1263 YSR+ + ISS EES+PDA +EKE GNE FKQKKFKEAIDCYSRSIALSPTAVA+ Sbjct: 61 YSRNYDPVSHISSSLMNEESTPDATSEKELGNECFKQKKFKEAIDCYSRSIALSPTAVAY 120 Query: 1262 ANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFALRL 1083 ANRAMAYLKL+RF+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES EDSEFALRL Sbjct: 121 ANRAMAYLKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDSEFALRL 180 Query: 1082 EPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEMGSG---- 915 EPQNQE+KKQ + K+LY+KE+ KAS+ ++K V +AV +G Sbjct: 181 EPQNQEIKKQLAEVKSLYEKEVFQKASKTLEKYGKSGMKVNGHEVRAVRNTTQKTGVAEI 240 Query: 914 -------STNAKRKIGEQELDQSQDGQFNQVT------QNGHGITKQEIKPSIQDLALRA 774 T K E + + +DG T + H K + S+Q+LA RA Sbjct: 241 QDLTISKKTENKNLRDESKTEGQRDGSGANATHISGLDKRNHRTKKAVLDASVQELATRA 300 Query: 773 ASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLL 594 S+A EAAKNI PK+AYEFE+SW+G +GD ALQARLLKAI P+ LP++F++AL+A +L Sbjct: 301 TSRAVAEAAKNITPPKSAYEFEVSWRGFAGDHALQARLLKAISPNALPQIFKNALSASIL 360 Query: 593 IDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVP 414 IDIVK +ATFF E +LA++ L+ +T V RFD++ MCLS+ADKAD+ K W+E F +E+ P Sbjct: 361 IDIVKVVATFFTGEVDLAIKYLEYLTMVPRFDLVIMCLSLADKADLRKVWDETFCNESTP 420 Query: 413 IECAETLSKLRVKYC 369 IE AE L LR KYC Sbjct: 421 IEYAEILDNLRSKYC 435 >ref|XP_007031158.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508719763|gb|EOY11660.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] Length = 468 Score = 456 bits (1174), Expect = e-125 Identities = 248/464 (53%), Positives = 313/464 (67%), Gaps = 46/464 (9%) Frame = -3 Query: 1622 NKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPS-- 1449 +KH+RDQ LDFQGFLN+LQDWELSLKEKDK K+Q++++++ T G + S Sbjct: 3 SKHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLTNEKGRPTGKSSLIDSST 62 Query: 1448 -----VDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIAL 1284 DY ++ +SS F EE+ PDAA+EKE GNEYFKQKKFKEAIDCYSRSI L Sbjct: 63 TSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEAIDCYSRSIGL 122 Query: 1283 SPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFED 1104 SPTAVA ANRAMAYLK+K+F+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES ED Sbjct: 123 SPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIED 182 Query: 1103 SEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEM 924 +EFALRLEP NQE+KKQ+ + K+LY+KE+L KAS +++KS Q VG + K + M Sbjct: 183 TEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKSETKENGL-GM 241 Query: 923 GSGSTNAKR---------KIGEQELDQSQDGQFNQVTQNGHG------------------ 825 S S + +R + E D+ + + VT G G Sbjct: 242 HSASNSTQRTGVATVQGYQTKVSEYDKQKKPEKGSVTSEGIGDRNTLAGSRKDGTQLDSG 301 Query: 824 ------------ITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGD 681 K E+K S+Q+LA AA++A EAAKNI P TAY+FE+SW+ LSGD Sbjct: 302 IVGLESIKKNNRTRKPELKASVQELASLAATRAMAEAAKNISPPNTAYQFEVSWRALSGD 361 Query: 680 RALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRF 501 RALQA LLK PS LP++F++AL+A +L+DI+KC+ATFF EE +LA++ L+N+TKV RF Sbjct: 362 RALQAHLLKVTSPSALPQIFKNALSASMLVDIIKCVATFFREEVDLAIKYLENLTKVPRF 421 Query: 500 DMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 369 DM+ MCLS +KAD+ K W++VF +EA PIE AE L LR YC Sbjct: 422 DMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEILDNLRSVYC 465 >ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citrus clementina] gi|557535662|gb|ESR46780.1| hypothetical protein CICLE_v10003914mg [Citrus clementina] Length = 977 Score = 454 bits (1169), Expect = e-125 Identities = 248/434 (57%), Positives = 305/434 (70%), Gaps = 18/434 (4%) Frame = -3 Query: 1616 HNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPSVD-Y 1440 HNRDQ LDFQGFLNDLQDW+LSL EKDKK K +++ K + S K + +PS + Y Sbjct: 543 HNRDQALDFQGFLNDLQDWDLSLHEKDKKMKHKASS--KDNLVSSSLKSGEKPSPSGNSY 600 Query: 1439 SRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 1260 SR+ + ISS EES+PDA +EKE GNE FKQKKFKEAIDCYSRSIALSPTAVA+A Sbjct: 601 SRNYDPVSRISSSLMNEESTPDATSEKELGNECFKQKKFKEAIDCYSRSIALSPTAVAYA 660 Query: 1259 NRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFALRLE 1080 NRAMAYLKL+RF+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES EDSEFALRLE Sbjct: 661 NRAMAYLKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDSEFALRLE 720 Query: 1079 PQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAV--SIKEMG----- 921 PQNQE+KKQ + K+LY+KE+ KAS+ ++K V +AV +I++ G Sbjct: 721 PQNQEIKKQLAEVKSLYEKEVFQKASKTLEKYGKSGMKVNGHEVRAVRNTIQKTGVAEIQ 780 Query: 920 ----SGSTNAKRKIGEQELDQSQDGQFNQVT------QNGHGITKQEIKPSIQDLALRAA 771 S T K E + + +DG T + H K + S+Q+LA RA Sbjct: 781 DLTISKKTENKNLRDESKTEGQRDGSGANATHISGLDKRNHRTKKAVLDASVQELATRAT 840 Query: 770 SQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLI 591 S+A EAAKNI PK+AYEFE+SW+G +GD ALQARLLKAI P+ LP++F++AL+A +LI Sbjct: 841 SRAVAEAAKNITPPKSAYEFEVSWRGFAGDHALQARLLKAISPNALPQIFKNALSASILI 900 Query: 590 DIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPI 411 DIVK +A FF E +LA++ L+ +T V RFD + MCLS+ADKAD+ K W+E F +E PI Sbjct: 901 DIVKVVAMFFPGEVDLAIKYLEYLTMVPRFDFVIMCLSLADKADLRKVWDETFCNELTPI 960 Query: 410 ECAETLSKLRVKYC 369 E AE L LR KYC Sbjct: 961 EYAEILDNLRSKYC 974 >ref|XP_007205290.1| hypothetical protein PRUPE_ppa006661mg [Prunus persica] gi|462400932|gb|EMJ06489.1| hypothetical protein PRUPE_ppa006661mg [Prunus persica] Length = 401 Score = 454 bits (1168), Expect = e-125 Identities = 245/424 (57%), Positives = 300/424 (70%), Gaps = 1/424 (0%) Frame = -3 Query: 1637 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQ-SNEEKKKTEIISEAKGVAR 1461 MAR PNKH RDQ LD WELSLK+KDKK + + S++EK KT + + G Sbjct: 1 MARAPNKHGRDQALD----------WELSLKDKDKKMRPKDSHQEKLKTRDLGTSSG--- 47 Query: 1460 KAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALS 1281 + DYSR+L SI +SS F +E+S PDAA+EKE GNEYFKQKKF+EAIDCYSRSIALS Sbjct: 48 ---NYDYSRNLDSINTMSSSFISEDSLPDAASEKELGNEYFKQKKFREAIDCYSRSIALS 104 Query: 1280 PTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDS 1101 P+AVA+ANRAMAY+K+K F+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES ED+ Sbjct: 105 PSAVAYANRAMAYIKIKSFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDA 164 Query: 1100 EFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEMG 921 EFALRLEPQNQE+KKQYT+ K+LYDK +L KAS K S + VG + K G Sbjct: 165 EFALRLEPQNQEIKKQYTEAKSLYDKTILQKASGAQKNSVQEMRKVGK-----LDTKVNG 219 Query: 920 SGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKN 741 A E+ QD T+ + E+K S+Q+LA RAAS+ K AA+ Sbjct: 220 QSIQPASSSAQITEMTAVQDH-----TKRNNTTRNPEVKASVQELASRAASRVKAVAAEK 274 Query: 740 IKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFF 561 IK P +AY+FE+SW+G SGD A Q LLKAI PS LP++F++ALT P+L+DI+KC+ATFF Sbjct: 275 IKPPNSAYQFEVSWRGFSGDNARQTSLLKAISPSALPQIFKNALTVPILLDIIKCVATFF 334 Query: 560 VEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLR 381 VEE +LAV L+N+T+V RFD + M LS +D AD+ K W+EVF +EA PIE AE L LR Sbjct: 335 VEEMDLAVNYLENLTRVPRFDTLIMFLSSSDNADLVKIWDEVFDNEATPIEYAEKLDNLR 394 Query: 380 VKYC 369 KYC Sbjct: 395 TKYC 398 >ref|XP_004144746.1| PREDICTED: RNA polymerase II-associated protein 3-like [Cucumis sativus] gi|449517788|ref|XP_004165926.1| PREDICTED: RNA polymerase II-associated protein 3-like [Cucumis sativus] Length = 458 Score = 443 bits (1140), Expect = e-121 Identities = 246/460 (53%), Positives = 306/460 (66%), Gaps = 38/460 (8%) Frame = -3 Query: 1637 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARK 1458 MA KH RDQ LDFQGFLNDLQDWE+S K KDKK K Q+ ++K E + K Sbjct: 1 MADSSAKHGRDQLLDFQGFLNDLQDWEVSFKGKDKKLKPQAIGKEK------EDRRQTEK 54 Query: 1457 APSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSP 1278 A + DY + ++ +S F E S DAA+EKE GNEYFKQKKFKEAIDCYSRSIALSP Sbjct: 55 ASAADYMKQYDAVNRLSRNFQTEGSFVDAASEKEQGNEYFKQKKFKEAIDCYSRSIALSP 114 Query: 1277 TAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSE 1098 TAVAFANRAMAYLK++RF+EAE DCTEALNLDDRY KAYSRRATARKELGK KE+ ED+E Sbjct: 115 TAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAE 174 Query: 1097 FALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKA-------V 939 FA RLEP NQE+KKQ+ D +A K +L KAS + S ++++ + A V Sbjct: 175 FAQRLEPNNQEIKKQHADLRAFVGKAILEKASGASRSSTKNKKTLKKSDSDAKIQDIPPV 234 Query: 938 SIKEMGSGSTNAKRKIGEQ----------ELDQSQDGQ------FNQVTQNG-------- 831 S +G A+ ++ E L++S+D +V NG Sbjct: 235 SSSTSRTGLLAARERVEENGGGNAVKTSARLEESEDTSSGAEITSKKVATNGFHKDSSSY 294 Query: 830 -------HGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRAL 672 H KQE+K S+ +LA +AAS++ EAAKNI AP TAY+FE+SW+G SGD+AL Sbjct: 295 LSALERDHLPRKQELKASVYELASQAASRSMVEAAKNIIAPTTAYQFEVSWRGFSGDQAL 354 Query: 671 QARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMI 492 QARLLK I P+ LP++F+DALTAP+LIDIVKC+ATFF+EE LA+ L+N+ V RF ++ Sbjct: 355 QARLLKTISPAKLPQIFKDALTAPILIDIVKCVATFFIEEPALAISFLENLVNVPRFSIL 414 Query: 491 SMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKY 372 MCLS ++K D+ K W+EVF EAVPIE AE L LR KY Sbjct: 415 MMCLSSSEKFDLLKIWDEVFCDEAVPIEYAEMLDSLRSKY 454 >gb|EXB53029.1| RNA polymerase II-associated protein 3 [Morus notabilis] Length = 450 Score = 439 bits (1130), Expect = e-120 Identities = 245/458 (53%), Positives = 308/458 (67%), Gaps = 35/458 (7%) Frame = -3 Query: 1637 MARVPNKHNRDQNLDFQGFLNDLQDWELSL--KEKDKKFKAQSNEEK---KKTEIISEAK 1473 MAR P KH RD+ L FQGFLNDLQDWE SL K+KDKK KAQ++++ ++ I EA Sbjct: 1 MARAPTKHGRDEALAFQGFLNDLQDWEFSLEDKDKDKKMKAQASDKGISVSSSKKIGEA- 59 Query: 1472 GVARKAPS-------------VDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQ 1332 G RKA DYSR +I +SS +E+S DAA+EKE GNEYFKQ Sbjct: 60 GKDRKAAGKSSTFEYLSSSMPYDYSRKYDAINQVSSSSISEDSYTDAASEKELGNEYFKQ 119 Query: 1331 KKFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKR-----------------FEEAEVDC 1203 KKFKEAIDCYSRSIALS TAVA+ANRAMAYLKLKR F+EAE DC Sbjct: 120 KKFKEAIDCYSRSIALSSTAVAYANRAMAYLKLKRQLLPYLIFFCKSIFLIRFQEAEGDC 179 Query: 1202 TEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDK 1023 TEALN+DDRY KAYSRRATARKELGKLKE ED+EFALRLEP NQE+KKQY++ K+L +K Sbjct: 180 TEALNMDDRYIKAYSRRATARKELGKLKECIEDAEFALRLEPNNQEIKKQYSEAKSLCEK 239 Query: 1022 ELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEMGSGSTNAKRKIGEQELDQSQDGQFNQV 843 +L KAS ++ + Q + K ++ G + Q+ + + + ++ Sbjct: 240 VILQKASVALENT---VQKMQKAEKKDTKVQNNGIQPVES----ATQKTEAAVAEDYTKI 292 Query: 842 TQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQAR 663 Q KQE K S+Q+LA RAAS+A AKNI++P +AY+FE+SW+GLSGDRALQA Sbjct: 293 NQTAK---KQEPKASVQELASRAASRAMNGTAKNIRSPTSAYQFEVSWRGLSGDRALQAS 349 Query: 662 LLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMC 483 LLK + P LP++F+++LT P+L+DIVKCIATFF+EE ++ V L+N+TKV RFD++ MC Sbjct: 350 LLKTVSPGALPQIFKNSLTVPILVDIVKCIATFFIEEMDVTVTFLENLTKVPRFDILVMC 409 Query: 482 LSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 369 L+ D+AD+ K W EVF EA PIE AE L LR KYC Sbjct: 410 LTSKDRADLVKIWNEVFCKEATPIEHAEKLDNLRSKYC 447 >ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X1 [Solanum tuberosum] Length = 468 Score = 431 bits (1107), Expect = e-118 Identities = 238/467 (50%), Positives = 309/467 (66%), Gaps = 45/467 (9%) Frame = -3 Query: 1637 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEI---------- 1488 MA+VP+KH+RDQ D QG LN+LQDWELSLK KDKK K+Q++ ++ E Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETLREDWSRTSELLTS 60 Query: 1487 -------ISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQK 1329 + ++ + A +YS++ I +SS+ +EES+ +A +EKE GNE FKQK Sbjct: 61 PQVNGTRVGKSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQK 120 Query: 1328 KFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRA 1149 KF EAIDCYSRSIALSPTAV++ANRAMAYLK+KRF+EAE DCTEALNLDDRY KAYSRR+ Sbjct: 121 KFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRS 180 Query: 1148 TARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQ 969 T+RKELGKLKES ED+EFALRLEPQN E+KKQY + KALY+KE+ + S S Q Sbjct: 181 TSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKEIRKRVSGATDVSAQRAQ 240 Query: 968 SVGSTTGKAVSIKEMGSGSTNAKR--KIGEQELDQSQDG---------QFNQ-------- 846 G T I+ + S S I +E ++ G Q N Sbjct: 241 KSGKTIKSGPVIQSVSSSSQKMAEVWTIPAKENNRDVPGTAKVEDTHMQINNKDSDASPT 300 Query: 845 ---------VTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKG 693 + H I+KQE++ S+Q+LA RAA AKTEAAKNI AP +AY+FE+SW+G Sbjct: 301 VPTLNPAFGTAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVSWRG 360 Query: 692 LSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTK 513 LSGDR LQ +LLK P+ LP++F++AL+AP+L+DIV+C+ATFF+E+ LA+ L+++TK Sbjct: 361 LSGDRNLQTQLLKVTSPAMLPRIFKNALSAPMLMDIVRCVATFFIEDMNLAIRYLEDLTK 420 Query: 512 VSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKY 372 V RFDMI MCLS DK+++ K WEE+F EA E + TL LRV Y Sbjct: 421 VPRFDMIIMCLSSTDKSELLKIWEEIFCKEAE--EHSATLGALRVPY 465 >ref|NP_176039.2| carboxylate clamp-tetratricopeptide repeat protein [Arabidopsis thaliana] gi|53828529|gb|AAU94374.1| At1g56440 [Arabidopsis thaliana] gi|59958350|gb|AAX12885.1| At1g56440 [Arabidopsis thaliana] gi|110743110|dbj|BAE99447.1| hypothetical protein [Arabidopsis thaliana] gi|332195274|gb|AEE33395.1| carboxylate clamp-tetratricopeptide repeat [Arabidopsis thaliana] Length = 476 Score = 430 bits (1105), Expect = e-117 Identities = 233/478 (48%), Positives = 316/478 (66%), Gaps = 55/478 (11%) Frame = -3 Query: 1637 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARK 1458 MAR P+KH RDQ DFQGF NDLQDWELSLK+KDKK K Q + G + Sbjct: 1 MARSPSKHGRDQTQDFQGFFNDLQDWELSLKDKDKKIKQQPANSSNPSSETFRPSGSGK- 59 Query: 1457 APSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSP 1278 D+++ +SI D+SS E S D+++EKE GNE+FKQKKF EAIDCYSRSIALSP Sbjct: 60 ---YDFAKKYRSIRDLSSSLIGE-SLLDSSSEKEQGNEFFKQKKFNEAIDCYSRSIALSP 115 Query: 1277 TAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSE 1098 AV +ANRAMAYLK+KR+ EAEVDCTEALNLDDRY KAYSRRATARKELG +KE+ ED+E Sbjct: 116 NAVTYANRAMAYLKIKRYREAEVDCTEALNLDDRYIKAYSRRATARKELGMIKEAKEDAE 175 Query: 1097 FALRLEPQNQELKKQYTDTKALYDKELLAKAS--------EMVKKSRVGEQ--------- 969 FALRLEP++QELKKQY D K+L +KE++ KA+ E++K S + ++ Sbjct: 176 FALRLEPESQELKKQYADIKSLLEKEIIEKATGAMQSTAQELLKTSGLDKKIQKPKTEMT 235 Query: 968 ----SVGSTTGKAVSIKEMGSGSTNAKRKIGE-QELDQSQDGQF-----------NQVTQ 837 ++ + T + + +GS ++ K+ I Q ++S++G +VT Sbjct: 236 SKPVTLVAKTNRDIVQPVLGSNESSGKKLIENIQPEEKSKEGSMKIPAITEILDSKKVTP 295 Query: 836 NGHGITKQ----------------------EIKPSIQDLALRAASQAKTEAAKNIKAPKT 723 K+ E+KPS+Q+LA AAS A TEA+KNIK PK+ Sbjct: 296 GSQSYEKEAKPSDRNGTQPSGPENQVSKQLELKPSVQELAAHAASLAMTEASKNIKTPKS 355 Query: 722 AYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETEL 543 AYEFE SW+ SGD AL+++LLK PS+LP++F++ALT+P+L+DI+KC+A+FF E+ +L Sbjct: 356 AYEFENSWRSFSGDSALRSQLLKVTTPSSLPQIFKNALTSPVLVDIIKCVASFFTEDMDL 415 Query: 542 AVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 369 AV+ ++N+TKV RF+M+ MCL+ +K ++ K WE+VF ++A P+E AE L KLR +YC Sbjct: 416 AVKYIENLTKVPRFNMLVMCLTSTEKNELLKIWEDVFCNKATPMEYAEVLDKLRSRYC 473 >ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated protein 3-like [Solanum lycopersicum] Length = 470 Score = 429 bits (1103), Expect = e-117 Identities = 235/468 (50%), Positives = 309/468 (66%), Gaps = 46/468 (9%) Frame = -3 Query: 1637 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQS------------------N 1512 MARVP+ H+RDQ D QG N+LQDWEL+LK KDKK K+Q+ + Sbjct: 1 MARVPSNHSRDQFQDMQGLFNNLQDWELALKGKDKKMKSQAGGKETLKEDWSRTSEPLTS 60 Query: 1511 EEKKKTEIISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQ 1332 + T+ + ++ + A YS++ I +SS+ +EES+ +A +EKE GNE FKQ Sbjct: 61 PQANGTQQVGKSTSIRNAAGPYSYSKNYNPISHLSSELISEESNINANSEKELGNECFKQ 120 Query: 1331 KKFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRR 1152 KKF EAIDCYSRSIALSPTAV++ANRAMAYLK+KRF+EAE DCTEALNLDDRY KAYSRR Sbjct: 121 KKFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRR 180 Query: 1151 ATARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGE 972 +T+RKELGKLKES ED+EFAL LEP+N E+KKQY + KALY+KE+L + S S G Sbjct: 181 STSRKELGKLKESIEDAEFALWLEPRNPEIKKQYGEVKALYEKEILKRVSGATDVSAQGP 240 Query: 971 QSVGSTTGKAVSIKEMGSGS------------TNAKRKIGEQELDQSQDGQFNQ------ 846 Q G T I+ + S S N + +G +++ + N+ Sbjct: 241 QKSGKTIKIGPVIQSVSSSSQKVAEVRTIPAKENNRDVLGTAKVEDTHMQISNKDSDASP 300 Query: 845 ----------VTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWK 696 + H I+KQE++ S+Q+LA RAA AKTEAAKNI AP +AY+FE+SW+ Sbjct: 301 TVPTLNLAFGTAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVSWR 360 Query: 695 GLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVT 516 GLSGDR LQ +LLK P+ LP++F++AL+AP+L+DIV+CIATFF+E+ LA+ L+++T Sbjct: 361 GLSGDRNLQTQLLKVTSPAMLPRIFKNALSAPMLMDIVRCIATFFIEDMNLAIRYLEDLT 420 Query: 515 KVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKY 372 KV RFDMI MCLS ADK+++ K WEE+F V E + TL LRV Y Sbjct: 421 KVPRFDMIIMCLSSADKSELLKIWEEIFCK--VAEEHSATLGALRVSY 466 >ref|XP_007145004.1| hypothetical protein PHAVU_007G201600g [Phaseolus vulgaris] gi|561018194|gb|ESW16998.1| hypothetical protein PHAVU_007G201600g [Phaseolus vulgaris] Length = 465 Score = 426 bits (1096), Expect = e-116 Identities = 238/464 (51%), Positives = 306/464 (65%), Gaps = 52/464 (11%) Frame = -3 Query: 1598 LDFQGFLNDLQDWELSLKEKDKKFKAQSNEE--KKKTEIISEAKGV--ARKAPSVDYSRS 1431 +DFQGFLNDLQDWELS K+K + K+Q + K + ++ + GV A KA ++ + R+ Sbjct: 1 MDFQGFLNDLQDWELSRKDKTQTLKSQKENQFTKASSSRLTGSVGVEKASKADAISFDRA 60 Query: 1430 LKSIG--DISS---------KFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIAL 1284 S G D+S F E PDAA+EK+ GNE+FKQKKFKEA DCYSRSIAL Sbjct: 61 RNSQGLYDLSKINDPLNRLHGSFVPEDVPDAASEKDLGNEFFKQKKFKEARDCYSRSIAL 120 Query: 1283 SPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFED 1104 SPTAVA+ANRAMA +KL+RF+EAE DCTEAL+LDDRY KAYSRRATARKELGK+KES ED Sbjct: 121 SPTAVAYANRAMANIKLRRFQEAEDDCTEALDLDDRYIKAYSRRATARKELGKIKESMED 180 Query: 1103 SEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKA------ 942 +EFALRLEP NQE+KKQY D K+LY+K++L KAS ++++ G VG + K Sbjct: 181 AEFALRLEPNNQEIKKQYADAKSLYEKDILHKASGALRRTVQGTNKVGKSDEKVNGGSIH 240 Query: 941 -------------VSIKEMGSGSTNAKRKIGEQELD----------QSQDGQ-------- 855 V+ K++ K + +E+D Q+Q G Sbjct: 241 PISHGAQKSGPAEVNHKKVNEQQVPIKESLVTEEVDSRDTITRKRPQAQGGDDSKKSLSA 300 Query: 854 FNQVTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRA 675 N + Q H I K E K S+Q LA RAAS+A EAAKNI P TAYEFE+SW+ LSGD A Sbjct: 301 SNSLEQRNHRIIKPEFKASVQQLASRAASRAMAEAAKNITPPTTAYEFEVSWRALSGDLA 360 Query: 674 LQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDM 495 LQARLLKAI P LPK+F++AL++ +L+DI+KC+++FF E+ +L V ++++ KV RFDM Sbjct: 361 LQARLLKAISPRELPKIFKNALSSTILVDIIKCLSSFFTEDMDLVVSYMEHLIKVPRFDM 420 Query: 494 ISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYCSG 363 I +CLS +K DI K W+EVF S+A PIE AE L LR K+C G Sbjct: 421 IVLCLSSTNKDDIRKIWDEVFRSKATPIEYAEILDNLRSKFCLG 464 >ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X2 [Solanum tuberosum] Length = 467 Score = 426 bits (1095), Expect = e-116 Identities = 236/467 (50%), Positives = 313/467 (67%), Gaps = 45/467 (9%) Frame = -3 Query: 1637 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEI---------- 1488 MA+VP+KH+RDQ D QG LN+LQDWELSLK KDKK K+Q++ ++ E Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETLREDWSRTSELLTS 60 Query: 1487 -------ISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQK 1329 + ++ + A +YS++ I +SS+ +EES+ +A +EKE GNE FKQK Sbjct: 61 PQVNGTRVGKSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQK 120 Query: 1328 KFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRA 1149 KF EAIDCYSRSIALSPTAV++ANRAMAYLK+KRF+EAE DCTEALNLDDRY KAYSRR+ Sbjct: 121 KFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRS 180 Query: 1148 TARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDK-------------ELLAK 1008 T+RKELGKLKES ED+EFALRLEPQN E+KKQY + KALY+K + K Sbjct: 181 TSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKIRKRVSGATDVSAQRAQK 240 Query: 1007 ASEMVKKSRVGEQSVGSTTGKAVSIKEMGSGSTN------AKRKIGEQELDQSQDGQFNQ 846 + + +K V QSV S++ K + + + N AK + +++ Sbjct: 241 SGKTIKSGPV-IQSVSSSSQKMAEVWTIPAKENNRDVPGTAKVEDTHMQINNKDSDASPT 299 Query: 845 V---------TQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKG 693 V + H I+KQE++ S+Q+LA RAA AKTEAAKNI AP +AY+FE+SW+G Sbjct: 300 VPTLNPAFGTAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVSWRG 359 Query: 692 LSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTK 513 LSGDR LQ +LLK P+ LP++F++AL+AP+L+DIV+C+ATFF+E+ LA+ L+++TK Sbjct: 360 LSGDRNLQTQLLKVTSPAMLPRIFKNALSAPMLMDIVRCVATFFIEDMNLAIRYLEDLTK 419 Query: 512 VSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKY 372 V RFDMI MCLS DK+++ K WEE+F EA E + TL LRV Y Sbjct: 420 VPRFDMIIMCLSSTDKSELLKIWEEIFCKEAE--EHSATLGALRVPY 464 >ref|XP_006339935.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X4 [Solanum tuberosum] Length = 419 Score = 426 bits (1094), Expect = e-116 Identities = 229/439 (52%), Positives = 298/439 (67%), Gaps = 17/439 (3%) Frame = -3 Query: 1637 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEI---------- 1488 MA+VP+KH+RDQ D QG LN+LQDWELSLK KDKK K+Q++ ++ E Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETLREDWSRTSELLTS 60 Query: 1487 -------ISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQK 1329 + ++ + A +YS++ I +SS+ +EES+ +A +EKE GNE FKQK Sbjct: 61 PQVNGTRVGKSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQK 120 Query: 1328 KFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRA 1149 KF EAIDCYSRSIALSPTAV++ANRAMAYLK+KRF+EAE DCTEALNLDDRY KAYSRR+ Sbjct: 121 KFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRS 180 Query: 1148 TARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQ 969 T+RKELGKLKES ED+EFALRLEPQN E+KKQY + KALY+KE K Q Sbjct: 181 TSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKENNRDVPGTAKVEDTHMQ 240 Query: 968 SVGSTTGKAVSIKEMGSGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQD 789 + + ++ + AK+ H I+KQE++ S+Q+ Sbjct: 241 INNKDSDASPTVPTLNPAFGTAKKT---------------------HKISKQELEESVQE 279 Query: 788 LALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDAL 609 LA RAA AKTEAAKNI AP +AY+FE+SW+GLSGDR LQ +LLK P+ LP++F++AL Sbjct: 280 LAARAAGLAKTEAAKNIAAPNSAYQFEVSWRGLSGDRNLQTQLLKVTSPAMLPRIFKNAL 339 Query: 608 TAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFS 429 +AP+L+DIV+C+ATFF+E+ LA+ L+++TKV RFDMI MCLS DK+++ K WEE+F Sbjct: 340 SAPMLMDIVRCVATFFIEDMNLAIRYLEDLTKVPRFDMIIMCLSSTDKSELLKIWEEIFC 399 Query: 428 SEAVPIECAETLSKLRVKY 372 EA E + TL LRV Y Sbjct: 400 KEAE--EHSATLGALRVPY 416 >ref|XP_006588434.1| PREDICTED: uncharacterized protein LOC100784528 isoform X1 [Glycine max] Length = 459 Score = 418 bits (1075), Expect = e-114 Identities = 236/461 (51%), Positives = 301/461 (65%), Gaps = 49/461 (10%) Frame = -3 Query: 1598 LDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPSVDYSRSLKSI 1419 +DFQGFLNDLQDWELS K+K + K ++ + + E A K ++ + R+ S Sbjct: 1 MDFQGFLNDLQDWELSRKDKTRAQKENASSSQLTGSVGVEK---ASKGDTISFDRARNSP 57 Query: 1418 GD-----ISSKF------FAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSPTA 1272 G I+ F F E PDA +EK+ GNE+FKQKKFKEA DCYSRSIALSPTA Sbjct: 58 GQYDLSRINDPFNRVHSSFVPEDVPDAVSEKDLGNEFFKQKKFKEARDCYSRSIALSPTA 117 Query: 1271 VAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFA 1092 VA+ANRAMA +KL+RF+EAE DCTEALNLDDRY KAYSRRATARKELGK+KES +D+ FA Sbjct: 118 VAYANRAMANIKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKIKESMDDAAFA 177 Query: 1091 LRLEPQNQELKKQYTDTKALYDKELLAKAS-------EMVKKSRVGEQSV--GSTTGKAV 939 LRLEP NQE+KKQY D K+LY+K++L KAS + +KS+ E+ + GS + Sbjct: 178 LRLEPNNQEIKKQYADAKSLYEKDILQKASGALRSTVQGTQKSQKSEEKINGGSIQPISH 237 Query: 938 SIKEMGSGSTNAKRKIGEQEL---------------------DQSQDGQ--------FNQ 846 S ++ G N +K EQ++ QSQ G N Sbjct: 238 STQKSGLAEVNHHKKDNEQQILVKESLLTEDVDSRETKARSRPQSQGGDGSKEGLSASNS 297 Query: 845 VTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQA 666 + Q H ITK E+K S+Q LA RAAS+ EAAKN+ P TAY+FE+SW+ SGD ALQA Sbjct: 298 LEQRNHSITKLEMKASVQQLASRAASRVVAEAAKNVTPPTTAYQFEVSWRAFSGDLALQA 357 Query: 665 RLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISM 486 RLLKAI P LPK+F++AL++ +LI+I+KC+A+FF E+ +L V L+++TKV RFD+I M Sbjct: 358 RLLKAISPHELPKIFKNALSSAILIEIIKCLASFFTEDMDLVVSYLEHLTKVPRFDVIVM 417 Query: 485 CLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYCSG 363 CLS +K DI K W+EVFSSEA PIE AE L LR K+ G Sbjct: 418 CLSSTNKDDIRKIWDEVFSSEATPIEYAEILDNLRSKFGLG 458 >ref|XP_004495650.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X4 [Cicer arietinum] Length = 454 Score = 417 bits (1073), Expect = e-114 Identities = 231/455 (50%), Positives = 300/455 (65%), Gaps = 43/455 (9%) Frame = -3 Query: 1598 LDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKG-------VARKAPSVDY 1440 +DFQGFLNDLQDWE+S K K K K+ + + + ++G + A D+ Sbjct: 1 MDFQGFLNDLQDWEISTKNKAPKTKSHKENSGRSVGVENGSRGDTISFDHAKKSAAQYDF 60 Query: 1439 SRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 1260 SR+ + ++S F A E PDAA+EK+ GNE+FKQKKFKEAIDCYSRSIALSPTAVA+A Sbjct: 61 SRNNDLLSRVTSSF-ASEDVPDAASEKDLGNEFFKQKKFKEAIDCYSRSIALSPTAVAYA 119 Query: 1259 NRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFALRLE 1080 NRAMA +KL+RF+EAE DCTEALNLDDRY KAYSRRATARKELGK KES ED+EFALRLE Sbjct: 120 NRAMARIKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKNKESMEDAEFALRLE 179 Query: 1079 PQNQELKKQYTDTKALYDKELLAKASEMVKKS--RVGEQSV---GSTTGKAVSIKEMGSG 915 P NQE+KKQY D K+LY+KE++ K S+ ++ + ++G+ GS++ ++VS SG Sbjct: 180 PNNQEVKKQYADAKSLYEKEIVHKTSKALRNTVQKLGKSETKVNGSSSIQSVSHDTQKSG 239 Query: 914 ST-------------------------NAKRKIGEQELDQSQDGQ------FNQVTQNGH 828 S N K G + Q+ +G N + Q H Sbjct: 240 SAEVHHRTKGNECQIPAIESVLMEEIDNKDTKSGSRTQGQAGNGSKEGYSASNSLEQRNH 299 Query: 827 GITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAI 648 K E+K S+Q LA +AAS+A +AAKNI P TAY+FE+SW+G +GD ALQA LLKA+ Sbjct: 300 RTRKPEMKASVQQLASQAASRAMADAAKNITPPTTAYQFEVSWRGFAGDCALQACLLKAM 359 Query: 647 PPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMAD 468 P LPK+F++AL++ LLI+I+KC+A+FF E+ +L V +DN+TKV RFD+I MCL A Sbjct: 360 SPHELPKIFKNALSSTLLIEIIKCVASFFAEDVDLVVSYMDNLTKVPRFDVIVMCLPSAA 419 Query: 467 KADIGKTWEEVFSSEAVPIECAETLSKLRVKYCSG 363 K D+ K W EVF SEA P+E AE L LR K+ G Sbjct: 420 KDDLRKIWNEVFCSEATPMEYAEILGSLRSKFYLG 454 >ref|XP_007031162.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 5 [Theobroma cacao] gi|508719767|gb|EOY11664.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 5 [Theobroma cacao] Length = 389 Score = 416 bits (1068), Expect = e-113 Identities = 219/388 (56%), Positives = 276/388 (71%) Frame = -3 Query: 1532 KFKAQSNEEKKKTEIISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEH 1353 K + ++NE+ + T S + DY ++ +SS F EE+ PDAA+EKE Sbjct: 8 KEQLKTNEKGRPTGKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKEL 67 Query: 1352 GNEYFKQKKFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRY 1173 GNEYFKQKKFKEAIDCYSRSI LSPTAVA ANRAMAYLK+K+F+EAE DCTEALNLDDRY Sbjct: 68 GNEYFKQKKFKEAIDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRY 127 Query: 1172 TKAYSRRATARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMV 993 KAYSRRATARKELGKLKES ED+EFALRLEP NQE+KKQ+ + K+LY+KE+L KAS ++ Sbjct: 128 IKAYSRRATARKELGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVL 187 Query: 992 KKSRVGEQSVGSTTGKAVSIKEMGSGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQ 813 +KS Q VG + KE G G +A + Q Q T+ + K Sbjct: 188 RKSMQEAQEVGKS-----ETKENGLGMHSASNSTQRTGVATVQGYQ----TKKNNRTRKP 238 Query: 812 EIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTL 633 E+K S+Q+LA AA++A EAAKNI P TAY+FE+SW+ LSGDRALQA LLK PS L Sbjct: 239 ELKASVQELASLAATRAMAEAAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSPSAL 298 Query: 632 PKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIG 453 P++F++AL+A +L+DI+KC+ATFF EE +LA++ L+N+TKV RFDM+ MCLS +KAD+ Sbjct: 299 PQIFKNALSASMLVDIIKCVATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKADLL 358 Query: 452 KTWEEVFSSEAVPIECAETLSKLRVKYC 369 K W++VF +EA PIE AE L LR YC Sbjct: 359 KVWDDVFCNEATPIEWAEILDNLRSVYC 386