BLASTX nr result
ID: Cocculus22_contig00011542
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00011542 (1673 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272014.2| PREDICTED: transcription elongation regulato... 364 6e-98 ref|XP_006847887.1| hypothetical protein AMTR_s00029p00102340 [A... 329 2e-87 ref|XP_003538973.2| PREDICTED: pre-mRNA-processing protein 40C-l... 327 1e-86 ref|XP_006590824.1| PREDICTED: pre-mRNA-processing protein 40C-l... 327 1e-86 ref|XP_006590814.1| PREDICTED: pre-mRNA-processing protein 40C-l... 327 1e-86 ref|XP_006590813.1| PREDICTED: pre-mRNA-processing protein 40C-l... 327 1e-86 ref|XP_006590812.1| PREDICTED: pre-mRNA-processing protein 40C-l... 327 1e-86 ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma c... 327 1e-86 ref|XP_007131664.1| hypothetical protein PHAVU_011G031500g [Phas... 318 5e-84 ref|XP_007131663.1| hypothetical protein PHAVU_011G031500g [Phas... 318 5e-84 ref|XP_002315059.2| hypothetical protein POPTR_0010s17750g [Popu... 311 6e-82 ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-l... 305 4e-80 ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citr... 303 2e-79 ref|XP_007221939.1| hypothetical protein PRUPE_ppa001490mg [Prun... 301 4e-79 ref|XP_002515795.1| Pre-mRNA-processing protein PRP40, putative ... 299 2e-78 ref|XP_004505734.1| PREDICTED: pre-mRNA-processing protein 40C-l... 294 7e-77 ref|XP_003540642.1| PREDICTED: pre-mRNA-processing protein 40C-l... 286 1e-74 gb|EXC33082.1| Transcription elongation regulator 1 [Morus notab... 278 4e-72 ref|XP_006592053.1| PREDICTED: pre-mRNA-processing protein 40C-l... 275 4e-71 ref|XP_006360861.1| PREDICTED: pre-mRNA-processing protein 40C-l... 275 4e-71 >ref|XP_002272014.2| PREDICTED: transcription elongation regulator 1-like [Vitis vinifera] gi|297738259|emb|CBI27460.3| unnamed protein product [Vitis vinifera] Length = 1046 Score = 364 bits (935), Expect = 6e-98 Identities = 233/558 (41%), Positives = 288/558 (51%), Gaps = 50/558 (8%) Frame = -1 Query: 1526 GGPIMAHSTPSSTAAGLGP---QPPKSPTGNTSDTLQESARXXXXXXXXXXXXXXXXSYN 1356 GGP TP+ A + + +G S+++QESA+ SY+ Sbjct: 26 GGPSGGPPTPTGAIAPASVATIRTSEGASGTASNSIQESAQGKFVNAPPHVLPGPSFSYS 85 Query: 1355 VVPNTLPASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTT----------------- 1227 +P+ AS +SQQ + V+ SN AS Q PVPG SS++ Sbjct: 86 GIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPVPGPSSSSGPSFSYNIAHKGAGFPG 145 Query: 1226 ---------------GP-----SFSYNINTHNNIDXXXXXXXXXXXXXSGAAAQDAGXXX 1107 GP SFS+N N SGA AQ+AG Sbjct: 146 SQPFQSSTSIASGPRGPTPNAASFSFNGNPQ-----LVQKDQTLKSDNSGAVAQEAGSMS 200 Query: 1106 XXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIWMPTAPSVSGPLXXXXXXXXXXXXXX 927 PTT+WMP+ PS P Sbjct: 201 SASHVSQSVPFPCSSSTMSVSSSPKMG---PTTLWMPSNPSFPVP------SGMPVTPGT 251 Query: 926 XXXXGILPFAP-----SVRSTAIDSSSSALQRPIISSTTSLPSHPSGQQLVYPSYPSLPA 762 GI P P +V S ++D SSS + R I + + S+P+ QQ +YPSY SLPA Sbjct: 252 PGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAP-VSSNPAIQQQIYPSYSSLPA 310 Query: 761 MAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHGVVPPAVPLLNSQVPAISSVG 582 Q WLQ +GGLPRPPF+PY V P PFPL HG+ P+VPL +SQ P ++ VG Sbjct: 311 TNASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVG 370 Query: 581 PPGYA--SASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGA---AVTNEDIDAWTAHRT 417 G SA++ A S + ELPPPGID NK +G G A NE +DAWTAH+T Sbjct: 371 TAGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGAAVNEQVDAWTAHKT 430 Query: 416 ETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEKVAGSDWALVTTDDGKKYYY 237 +TG VYYYNA+TGESTYEKPS FKGE DK TVQPTPVS EK+ G+DWALVTT+DGKKYYY Sbjct: 431 DTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYY 490 Query: 236 NDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALAEKGSGSASLSAPAVNTGGR 57 N KTK+SSWQIP E+TE+RKK+D +L + N EKG +LSAPAV TGGR Sbjct: 491 NTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGR 550 Query: 56 DATPLRPSAAPGSSSALD 3 DATPLR SA PGS+SALD Sbjct: 551 DATPLRTSAVPGSASALD 568 >ref|XP_006847887.1| hypothetical protein AMTR_s00029p00102340 [Amborella trichopoda] gi|548851192|gb|ERN09468.1| hypothetical protein AMTR_s00029p00102340 [Amborella trichopoda] Length = 808 Score = 329 bits (844), Expect = 2e-87 Identities = 202/491 (41%), Positives = 264/491 (53%), Gaps = 50/491 (10%) Frame = -1 Query: 1325 SSQQSSATPVMKSNQPA---SAATLQPPVPGQSSTT------------------------ 1227 ++ ++A+ M+ +PA SAA+LQPPVPGQSS + Sbjct: 121 TTTSATASNPMQGGKPAGPTSAASLQPPVPGQSSVSVHPNSWDPERPVQNALAQARPPFL 180 Query: 1226 ---GP----SFSYNINTHNNIDXXXXXXXXXXXXXSGAAAQDAGXXXXXXXXXXXXXXXX 1068 GP FS++ N+ + S A AQ+A Sbjct: 181 VRKGPPSTSGFSFSGNSQSVSSEDSQKHQASNSDASAAVAQEA-KTSQPSSSTAQTTPLP 239 Query: 1067 XXXXXXXXXXXXXSNFYPTTIWMPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSV 888 N Y T +MP AP GP L + ++ Sbjct: 240 APSSTTSRPVSSSPNTYATPFYMPKAPPFPGPPRLPVTPGTPGPPGIALSAPQLSSSVNI 299 Query: 887 RSTAIDSSSSALQRPIISSTT-------SLPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQ 729 R + ID++S A+ RP I+S+ S+P + Q +Y YP+LP + PPPQA W+ Sbjct: 300 RPSVIDTNS-AIMRPNIASSAPGTSNAASVPITQTAQPPIYSPYPTLPGVVPPPQAMWMH 358 Query: 728 SQPIGGLPRPPFLPYSGVLPGPFPLAGHGV-VPPAVPLLNSQVPAISSVGPPG-YASASM 555 +GGL RPPFLPY G PGPFP+ + VPP +SQ P +S +GPPG A Sbjct: 359 PSQMGGLQRPPFLPYPGTFPGPFPMPLRPITVPPVAMPDSSQPPGVSPIGPPGGIPLADH 418 Query: 554 GSGPPAGNSLVQPELPPPGID-------YNKKADGGGAAVTNEDIDAWTAHRTETGAVYY 396 G+G ++ + + PPPGID Y K D AV+NED D WTAH+T+TGAVYY Sbjct: 419 GAGIQV--TISEEQSPPPGIDKEKDTIDYTNKDDN---AVSNEDTDQWTAHKTDTGAVYY 473 Query: 395 YNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEKVAGSDWALVTTDDGKKYYYNDKTKVS 216 YNA+TGESTYEKP GFKGEPDK +Q TPVS EK+ G+DWALV T+DGKKYYYN K+K+S Sbjct: 474 YNALTGESTYEKPPGFKGEPDKVILQRTPVSWEKLVGTDWALVATNDGKKYYYNTKSKIS 533 Query: 215 SWQIPKEVTELRKKEDGGSLNANTTPVQNAGALAEKGSGSASLSAPAVNTGGRDATPLRP 36 SWQ+P EV ELRKK++ + PVQNAG ++KGS S+SLSAPA+NTGGR+A + Sbjct: 534 SWQVPPEVAELRKKQEADAALKANAPVQNAGISSDKGSVSSSLSAPAINTGGREAMTFKS 593 Query: 35 SAAPGSSSALD 3 + AP SSSALD Sbjct: 594 ATAPVSSSALD 604 >ref|XP_003538973.2| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine max] Length = 968 Score = 327 bits (838), Expect = 1e-86 Identities = 192/457 (42%), Positives = 245/457 (53%), Gaps = 4/457 (0%) Frame = -1 Query: 1361 YNVVPNTLPASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNNID 1182 Y ++ N + AS SSQQSS P MKSN + +QPP G S PSFSYNI I Sbjct: 43 YGMLQN-VNASGSSQQSSTHPGMKSNSAVNPMVVQPP--GVSLHAAPSFSYNIPQSGAIF 99 Query: 1181 XXXXXXXXXXXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIW 1002 + AQD G N+ P T W Sbjct: 100 SSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDP---NYRPATSW 156 Query: 1001 MPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQRPIISSTTS 822 MPTA +S P+ I+ P+ ST DSS +AL RP + T++ Sbjct: 157 MPTA--MSFPVLPVMPTQGNPGPPGLASSAIISSNPAAPSTGTDSSPAALLRPNMP-TSA 213 Query: 821 LPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHG 642 + S P+ Q P YPS+PAMA PPQ WLQ + G+ RPP+L Y PGPFP G Sbjct: 214 IASDPTAPQKGLP-YPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYPAPFPGPFPFPARG 272 Query: 641 VVPPAVPLLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGA 462 V PAVP+ +SQ P ++ VG G S S G + +Q E+ D KK + Sbjct: 273 VALPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDT 332 Query: 461 ----AVTNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEK 294 A N+ +DAWTAH+TE G +YYYNAVTGESTY+KP+GFKGE + + QP PVSM Sbjct: 333 VNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMD 392 Query: 293 VAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALA 114 + G+DW LV+T DGKKYYYN++TK S WQIP EV EL+KK+DG + V N L+ Sbjct: 393 LPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLS 452 Query: 113 EKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 ++GSG +L+APA+NTGGRDA L+PS+ S SALD Sbjct: 453 DRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALD 489 >ref|XP_006590824.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine max] Length = 980 Score = 327 bits (838), Expect = 1e-86 Identities = 192/457 (42%), Positives = 245/457 (53%), Gaps = 4/457 (0%) Frame = -1 Query: 1361 YNVVPNTLPASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNNID 1182 Y ++ N + AS SSQQSS P MKSN + +QPP G S PSFSYNI I Sbjct: 55 YGMLQN-VNASGSSQQSSTHPGMKSNSAVNPMVVQPP--GVSLHAAPSFSYNIPQSGAIF 111 Query: 1181 XXXXXXXXXXXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIW 1002 + AQD G N+ P T W Sbjct: 112 SSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDP---NYRPATSW 168 Query: 1001 MPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQRPIISSTTS 822 MPTA +S P+ I+ P+ ST DSS +AL RP + T++ Sbjct: 169 MPTA--MSFPVLPVMPTQGNPGPPGLASSAIISSNPAAPSTGTDSSPAALLRPNMP-TSA 225 Query: 821 LPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHG 642 + S P+ Q P YPS+PAMA PPQ WLQ + G+ RPP+L Y PGPFP G Sbjct: 226 IASDPTAPQKGLP-YPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYPAPFPGPFPFPARG 284 Query: 641 VVPPAVPLLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGA 462 V PAVP+ +SQ P ++ VG G S S G + +Q E+ D KK + Sbjct: 285 VALPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDT 344 Query: 461 ----AVTNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEK 294 A N+ +DAWTAH+TE G +YYYNAVTGESTY+KP+GFKGE + + QP PVSM Sbjct: 345 VNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMD 404 Query: 293 VAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALA 114 + G+DW LV+T DGKKYYYN++TK S WQIP EV EL+KK+DG + V N L+ Sbjct: 405 LPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLS 464 Query: 113 EKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 ++GSG +L+APA+NTGGRDA L+PS+ S SALD Sbjct: 465 DRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALD 501 >ref|XP_006590814.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X3 [Glycine max] Length = 850 Score = 327 bits (838), Expect = 1e-86 Identities = 192/457 (42%), Positives = 245/457 (53%), Gaps = 4/457 (0%) Frame = -1 Query: 1361 YNVVPNTLPASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNNID 1182 Y ++ N + AS SSQQSS P MKSN + +QPP G S PSFSYNI I Sbjct: 55 YGMLQN-VNASGSSQQSSTHPGMKSNSAVNPMVVQPP--GVSLHAAPSFSYNIPQSGAIF 111 Query: 1181 XXXXXXXXXXXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIW 1002 + AQD G N+ P T W Sbjct: 112 SSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDP---NYRPATSW 168 Query: 1001 MPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQRPIISSTTS 822 MPTA +S P+ I+ P+ ST DSS +AL RP + T++ Sbjct: 169 MPTA--MSFPVLPVMPTQGNPGPPGLASSAIISSNPAAPSTGTDSSPAALLRPNMP-TSA 225 Query: 821 LPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHG 642 + S P+ Q P YPS+PAMA PPQ WLQ + G+ RPP+L Y PGPFP G Sbjct: 226 IASDPTAPQKGLP-YPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYPAPFPGPFPFPARG 284 Query: 641 VVPPAVPLLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGA 462 V PAVP+ +SQ P ++ VG G S S G + +Q E+ D KK + Sbjct: 285 VALPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDT 344 Query: 461 ----AVTNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEK 294 A N+ +DAWTAH+TE G +YYYNAVTGESTY+KP+GFKGE + + QP PVSM Sbjct: 345 VNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMD 404 Query: 293 VAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALA 114 + G+DW LV+T DGKKYYYN++TK S WQIP EV EL+KK+DG + V N L+ Sbjct: 405 LPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLS 464 Query: 113 EKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 ++GSG +L+APA+NTGGRDA L+PS+ S SALD Sbjct: 465 DRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALD 501 >ref|XP_006590813.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine max] Length = 968 Score = 327 bits (838), Expect = 1e-86 Identities = 192/457 (42%), Positives = 245/457 (53%), Gaps = 4/457 (0%) Frame = -1 Query: 1361 YNVVPNTLPASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNNID 1182 Y ++ N + AS SSQQSS P MKSN + +QPP G S PSFSYNI I Sbjct: 43 YGMLQN-VNASGSSQQSSTHPGMKSNSAVNPMVVQPP--GVSLHAAPSFSYNIPQSGAIF 99 Query: 1181 XXXXXXXXXXXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIW 1002 + AQD G N+ P T W Sbjct: 100 SSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDP---NYRPATSW 156 Query: 1001 MPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQRPIISSTTS 822 MPTA +S P+ I+ P+ ST DSS +AL RP + T++ Sbjct: 157 MPTA--MSFPVLPVMPTQGNPGPPGLASSAIISSNPAAPSTGTDSSPAALLRPNMP-TSA 213 Query: 821 LPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHG 642 + S P+ Q P YPS+PAMA PPQ WLQ + G+ RPP+L Y PGPFP G Sbjct: 214 IASDPTAPQKGLP-YPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYPAPFPGPFPFPARG 272 Query: 641 VVPPAVPLLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGA 462 V PAVP+ +SQ P ++ VG G S S G + +Q E+ D KK + Sbjct: 273 VALPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDT 332 Query: 461 ----AVTNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEK 294 A N+ +DAWTAH+TE G +YYYNAVTGESTY+KP+GFKGE + + QP PVSM Sbjct: 333 VNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMD 392 Query: 293 VAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALA 114 + G+DW LV+T DGKKYYYN++TK S WQIP EV EL+KK+DG + V N L+ Sbjct: 393 LPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLS 452 Query: 113 EKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 ++GSG +L+APA+NTGGRDA L+PS+ S SALD Sbjct: 453 DRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALD 489 >ref|XP_006590812.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine max] Length = 980 Score = 327 bits (838), Expect = 1e-86 Identities = 192/457 (42%), Positives = 245/457 (53%), Gaps = 4/457 (0%) Frame = -1 Query: 1361 YNVVPNTLPASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNNID 1182 Y ++ N + AS SSQQSS P MKSN + +QPP G S PSFSYNI I Sbjct: 55 YGMLQN-VNASGSSQQSSTHPGMKSNSAVNPMVVQPP--GVSLHAAPSFSYNIPQSGAIF 111 Query: 1181 XXXXXXXXXXXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIW 1002 + AQD G N+ P T W Sbjct: 112 SSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDP---NYRPATSW 168 Query: 1001 MPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQRPIISSTTS 822 MPTA +S P+ I+ P+ ST DSS +AL RP + T++ Sbjct: 169 MPTA--MSFPVLPVMPTQGNPGPPGLASSAIISSNPAAPSTGTDSSPAALLRPNMP-TSA 225 Query: 821 LPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHG 642 + S P+ Q P YPS+PAMA PPQ WLQ + G+ RPP+L Y PGPFP G Sbjct: 226 IASDPTAPQKGLP-YPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYPAPFPGPFPFPARG 284 Query: 641 VVPPAVPLLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGA 462 V PAVP+ +SQ P ++ VG G S S G + +Q E+ D KK + Sbjct: 285 VALPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDT 344 Query: 461 ----AVTNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEK 294 A N+ +DAWTAH+TE G +YYYNAVTGESTY+KP+GFKGE + + QP PVSM Sbjct: 345 VNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMD 404 Query: 293 VAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALA 114 + G+DW LV+T DGKKYYYN++TK S WQIP EV EL+KK+DG + V N L+ Sbjct: 405 LPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLS 464 Query: 113 EKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 ++GSG +L+APA+NTGGRDA L+PS+ S SALD Sbjct: 465 DRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALD 501 >ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma cacao] gi|508709257|gb|EOY01154.1| Pre-mRNA-processing protein 40C [Theobroma cacao] Length = 816 Score = 327 bits (838), Expect = 1e-86 Identities = 178/346 (51%), Positives = 218/346 (63%), Gaps = 5/346 (1%) Frame = -1 Query: 1025 NFYPTTIWMPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVR----STAIDSSSS 858 NF P T WMPT S P+ PSV+ S A+DS SS Sbjct: 8 NFAPVTSWMPTTQSF--PMSTESSGTSGTAGHPG-------LVPSVQMITASAAVDSPSS 58 Query: 857 ALQRPIISSTTSLPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSG 678 A+ RP + + S+ + QQ +YP+Y LP+MA PQ W+Q P+GG PRPPF+PY Sbjct: 59 AVPRP----SAPVSSNQAVQQQIYPTYTPLPSMASSPQGFWMQHPPMGGFPRPPFVPYPT 114 Query: 677 VLPGPFPLAGHGVVPPAVPLLNSQVPAISSVGPPGYA-SASMGSGPPAGNSLVQPELPPP 501 + PGPFP A G+ PA P +SQ P +S + +A S ++ + + S +Q PP Sbjct: 115 IYPGPFPSASSGMPHPA-PSSDSQPPGVSPLATSPFAPSIAIPANQSSVASGIQTGFPPQ 173 Query: 500 GIDYNKKADGGGAAVTNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATV 321 GID N+ A NE D WTAH+T+TG VYYYNA+TGESTYEKP+GFKGEPDK V Sbjct: 174 GID-NRNVGTRVEAAVNEQSDIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVPV 232 Query: 320 QPTPVSMEKVAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTT 141 QPTPVS+E++AG++WALVTT DGKKYYYN KTK+SSWQIP EV ELRKK+D + Sbjct: 233 QPTPVSVEQLAGTEWALVTTSDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHAV 292 Query: 140 PVQNAGALAEKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 PV N +AEKGS SLSAPAV+TGGRDA PLR S PGSSSALD Sbjct: 293 PVPNIDVVAEKGSTPISLSAPAVSTGGRDAMPLRTSVVPGSSSALD 338 >ref|XP_007131664.1| hypothetical protein PHAVU_011G031500g [Phaseolus vulgaris] gi|561004664|gb|ESW03658.1| hypothetical protein PHAVU_011G031500g [Phaseolus vulgaris] Length = 830 Score = 318 bits (815), Expect = 5e-84 Identities = 200/510 (39%), Positives = 260/510 (50%), Gaps = 6/510 (1%) Frame = -1 Query: 1514 MAHSTPSSTAAGLGPQPPKSPTGNTSDTLQESARXXXXXXXXXXXXXXXXSYNVVPNTLP 1335 ++H P + + P SPT N+S+ +A Y V+ N Sbjct: 7 LSHEAPPPVSGEMS-LPVASPTPNSSNATPSTA--------PAPAPVPPFPYGVLQNA-N 56 Query: 1334 ASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNNIDXXXXXXXXX 1155 AS SSQQSSA V+KSN + QPPVPG SS SFSYNI Sbjct: 57 ASGSSQQSSAHNVIKSNSIVNPVVFQPPVPGVSSHAALSFSYNIPPSGAAFPSNQQNTQS 116 Query: 1154 XXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIWMPTAPSVSG 975 S + AQD N+ PTT WMPTA S+ Sbjct: 117 SSEISDSVAQDV----TKLSSASSTPHSVPAHTSTPIMPPSDPNYRPTTSWMPTAMSL-- 170 Query: 974 PLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQRPI--ISSTTSLPSHPSG 801 P+ ++ P+V ST DSSS+AL RP IS+ S P++P Sbjct: 171 PVHPVMPTPGNPGPPGLASSSMISINPAVPSTGTDSSSAALLRPNMPISAIASDPTNP-- 228 Query: 800 QQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHGVVPPAVP 621 L YPS+P+MA PPQ WLQ+ + G+ RPP+L Y PGPFP GV PAVP Sbjct: 229 --LKGLPYPSMPSMAAPPQGLWLQTPQMSGVFRPPYLQYPAPFPGPFPFPARGVTLPAVP 286 Query: 620 LLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGA----AVT 453 + +SQ ++ V + S G + +Q E+ D KK + A Sbjct: 287 IPDSQPRGVTPVSGGSSTFSPASSNQLRGTTALQTEVISGPADDKKKLNAVIAPNEDTSN 346 Query: 452 NEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEKVAGSDWA 273 N+ ++AWTAH+TE G +YYYNA+TGESTY+KP+GF GE + + QPTPVSM + G+DW Sbjct: 347 NDQLEAWTAHKTEAGIIYYYNAMTGESTYDKPAGFIGESHQVSAQPTPVSMTDLPGTDWL 406 Query: 272 LVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALAEKGSGSA 93 LV+T DGKKYYYN++TK S WQIP EV EL+KK+DG V N L+++GSG Sbjct: 407 LVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDQLMSVPNNNVLSDRGSGMV 466 Query: 92 SLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 +L+APA+NTGGRDA L+PS SSSALD Sbjct: 467 TLNAPAINTGGRDAAALKPSNLQNSSSALD 496 >ref|XP_007131663.1| hypothetical protein PHAVU_011G031500g [Phaseolus vulgaris] gi|561004663|gb|ESW03657.1| hypothetical protein PHAVU_011G031500g [Phaseolus vulgaris] Length = 977 Score = 318 bits (815), Expect = 5e-84 Identities = 200/510 (39%), Positives = 260/510 (50%), Gaps = 6/510 (1%) Frame = -1 Query: 1514 MAHSTPSSTAAGLGPQPPKSPTGNTSDTLQESARXXXXXXXXXXXXXXXXSYNVVPNTLP 1335 ++H P + + P SPT N+S+ +A Y V+ N Sbjct: 7 LSHEAPPPVSGEMS-LPVASPTPNSSNATPSTA--------PAPAPVPPFPYGVLQNA-N 56 Query: 1334 ASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNNIDXXXXXXXXX 1155 AS SSQQSSA V+KSN + QPPVPG SS SFSYNI Sbjct: 57 ASGSSQQSSAHNVIKSNSIVNPVVFQPPVPGVSSHAALSFSYNIPPSGAAFPSNQQNTQS 116 Query: 1154 XXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIWMPTAPSVSG 975 S + AQD N+ PTT WMPTA S+ Sbjct: 117 SSEISDSVAQDV----TKLSSASSTPHSVPAHTSTPIMPPSDPNYRPTTSWMPTAMSL-- 170 Query: 974 PLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQRPI--ISSTTSLPSHPSG 801 P+ ++ P+V ST DSSS+AL RP IS+ S P++P Sbjct: 171 PVHPVMPTPGNPGPPGLASSSMISINPAVPSTGTDSSSAALLRPNMPISAIASDPTNP-- 228 Query: 800 QQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHGVVPPAVP 621 L YPS+P+MA PPQ WLQ+ + G+ RPP+L Y PGPFP GV PAVP Sbjct: 229 --LKGLPYPSMPSMAAPPQGLWLQTPQMSGVFRPPYLQYPAPFPGPFPFPARGVTLPAVP 286 Query: 620 LLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGA----AVT 453 + +SQ ++ V + S G + +Q E+ D KK + A Sbjct: 287 IPDSQPRGVTPVSGGSSTFSPASSNQLRGTTALQTEVISGPADDKKKLNAVIAPNEDTSN 346 Query: 452 NEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEKVAGSDWA 273 N+ ++AWTAH+TE G +YYYNA+TGESTY+KP+GF GE + + QPTPVSM + G+DW Sbjct: 347 NDQLEAWTAHKTEAGIIYYYNAMTGESTYDKPAGFIGESHQVSAQPTPVSMTDLPGTDWL 406 Query: 272 LVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALAEKGSGSA 93 LV+T DGKKYYYN++TK S WQIP EV EL+KK+DG V N L+++GSG Sbjct: 407 LVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDQLMSVPNNNVLSDRGSGMV 466 Query: 92 SLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 +L+APA+NTGGRDA L+PS SSSALD Sbjct: 467 TLNAPAINTGGRDAAALKPSNLQNSSSALD 496 >ref|XP_002315059.2| hypothetical protein POPTR_0010s17750g [Populus trichocarpa] gi|550330031|gb|EEF01230.2| hypothetical protein POPTR_0010s17750g [Populus trichocarpa] Length = 963 Score = 311 bits (797), Expect = 6e-82 Identities = 207/520 (39%), Positives = 257/520 (49%), Gaps = 21/520 (4%) Frame = -1 Query: 1499 PSSTAAGLGPQ--PPKSPTGNTSDTLQESARXXXXXXXXXXXXXXXXSYNVVPNTLPASE 1326 P++T +G+ PP++ GN + + S YNV PN Sbjct: 16 PTATESGVAAATPPPENSAGNNAHSSYSSPAPTFT-------------YNVTPNM----- 57 Query: 1325 SSQQSSATPVMKSNQPASAATLQPPVPGQSST-----------TGPSFSYNINTHNNIDX 1179 S+ + SN P PVPG +S+ TGP F N +++D Sbjct: 58 -----SSGAALNSNPPGQPV----PVPGPASSVGLSFSYKIPQTGPGFPGNQQLQSSVDK 108 Query: 1178 XXXXXXXXXXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIWM 999 + A+Q A N PT Sbjct: 109 SPAIAQGSAPSVAPIASQSASFPLHSPSSSYTSLSS---------------NLGPTPSQT 153 Query: 998 PTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVR-STAIDSSSSALQRPIISSTTS 822 P S P G++P AP + S A DS +QRPI+ + Sbjct: 154 PATASFYLP------PGLPRTPGTLAPQGLVPSAPMTQPSVAADSLPLGVQRPIMPT--- 204 Query: 821 LPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHG 642 +PS + QQ YP+YPSLP MA PQA W+ PIGG+PR PFL Y PG FP GHG Sbjct: 205 MPSSNAVQQQTYPTYPSLPVMAASPQALWMHPPPIGGMPRQPFLSYPAAFPGSFPPPGHG 264 Query: 641 VVPPAVPLLNSQVPAISSVGP----PGYASASMGSGPPAGNSLVQPELPPPGIDYNKKAD 474 + P+V L +SQ P + VG P +SAS+ P A +Q ELPPPGID + Sbjct: 265 MPYPSVSLPDSQPPGVVPVGHSYAIPMSSSASVHQLPGAPG--MQTELPPPGIDNHNHLH 322 Query: 473 GGGA---AVTNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVS 303 G A +E AWTAH+T+TG YYYNAVTG STYEKP GFK EP+K VQPTPVS Sbjct: 323 HSGIRDNAAVSEPSHAWTAHKTDTGVFYYYNAVTGVSTYEKPPGFK-EPEKVPVQPTPVS 381 Query: 302 MEKVAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAG 123 ME +AG+DW L+TT+D KKYYYN+KTK+SSWQIP EVTELRK ++ N V Sbjct: 382 MENLAGTDWVLITTNDSKKYYYNNKTKLSSWQIPSEVTELRKNQEAEVSKGNAMSVSQVN 441 Query: 122 ALAEKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 AL EKGS SLSAPA NTGGRDAT LR + PG+SSALD Sbjct: 442 ALTEKGSAPISLSAPAANTGGRDATALRVLSVPGTSSALD 481 >ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-like [Citrus sinensis] Length = 978 Score = 305 bits (781), Expect = 4e-80 Identities = 173/343 (50%), Positives = 207/343 (60%), Gaps = 6/343 (1%) Frame = -1 Query: 1013 TTIWMPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAI-DSSSSALQRPII 837 TT WMPT PS S P G+L S+A D SSA RP + Sbjct: 167 TTSWMPTIPSFSTP------PGLFVTPQTQAPPGLLTLRTKDTSSAFGDFYSSAGLRPSV 220 Query: 836 SSTTSLPSHPSG--QQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGP 663 T S PS+ Q +YP+YPSLP + PQ LQ +G P PFLPY P P Sbjct: 221 P-TPSAPSNSGSAIQHQIYPTYPSLPPIGVSPQGPLLQPPQMGVRPWLPFLPYPAAYPSP 279 Query: 662 FPLAGHGVVPPAVPLLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNK 483 FPL HG+ P+V +++Q P +SS+ S S G + E PP G D + Sbjct: 280 FPLPAHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDKKE 339 Query: 482 KADGGGAAV---TNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPT 312 + + NE +DAWTAH+T+TG VYYYNAVTGESTYEKP+GFKGEPDK VQPT Sbjct: 340 HVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPT 399 Query: 311 PVSMEKVAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQ 132 P+SME + G+DWALVTT+DGKKYYYN K KVSSWQIP EVTEL+KKED +L + P Sbjct: 400 PISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKEQSVP-- 457 Query: 131 NAGALAEKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 N + EKGS + SLS+PAVNTGGRDAT LR S+ PGSSSALD Sbjct: 458 NTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALD 500 >ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citrus clementina] gi|557539684|gb|ESR50728.1| hypothetical protein CICLE_v10030612mg [Citrus clementina] Length = 1015 Score = 303 bits (775), Expect = 2e-79 Identities = 172/343 (50%), Positives = 207/343 (60%), Gaps = 6/343 (1%) Frame = -1 Query: 1013 TTIWMPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAI-DSSSSALQRPII 837 TT WMPT PS S P G+L S+A D SSA RP + Sbjct: 204 TTSWMPTIPSFSTP------PGLFVTPQTQAPPGLLTLRTKDTSSAFGDFYSSAGLRPSV 257 Query: 836 SSTTSLPSHPSG--QQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGP 663 T S PS+ Q +YP++PSLP + PQ LQ +G P PFLPY P P Sbjct: 258 P-TPSAPSNSGSAIQHQIYPTHPSLPPVGVSPQRPLLQPPQMGVRPWLPFLPYPAAYPSP 316 Query: 662 FPLAGHGVVPPAVPLLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNK 483 FPL HG+ P+V +++Q P +SS+ S S G + E PP G D + Sbjct: 317 FPLPAHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDKKE 376 Query: 482 KADGGGAAV---TNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPT 312 + + NE +DAWTAH+T+TG VYYYNAVTGESTYEKP+GFKGEPDK VQPT Sbjct: 377 HVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPT 436 Query: 311 PVSMEKVAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQ 132 P+SME + G+DWALVTT+DGKKYYYN K KVSSWQIP EVTEL+KKED +L + P Sbjct: 437 PISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKEQSVP-- 494 Query: 131 NAGALAEKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 N + EKGS + SLS+PAVNTGGRDAT LR S+ PGSSSALD Sbjct: 495 NTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALD 537 >ref|XP_007221939.1| hypothetical protein PRUPE_ppa001490mg [Prunus persica] gi|462418875|gb|EMJ23138.1| hypothetical protein PRUPE_ppa001490mg [Prunus persica] Length = 814 Score = 301 bits (772), Expect = 4e-79 Identities = 171/348 (49%), Positives = 205/348 (58%), Gaps = 7/348 (2%) Frame = -1 Query: 1025 NFYPTTIWMPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQ- 849 N TT W+PT PS + I F P+ S IDSSS AL+ Sbjct: 8 NMGTTTSWVPTGPSFNLTSGMPGTPGTPGPPGIAHPVQI-SFNPTAPSAPIDSSSVALRP 66 Query: 848 ----RPIISSTTSLPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYS 681 P+ SS Q V Y SL +M PPQ WLQS IGG PRPPFLPY Sbjct: 67 SMQIAPVASSAV--------QPQVGAPYLSLSSMGAPPQGVWLQSPQIGGFPRPPFLPYP 118 Query: 680 GVLPGPFPLAGHGVVPPAVPLLNSQVPAISSVG-PPGYASASMGSGPP-AGNSLVQPELP 507 PGPFPL H + P+VPL +SQ P + VG +S S SG AG+S +Q ELP Sbjct: 119 AAFPGPFPLPAHVMPLPSVPLPDSQPPGVIPVGNTAAISSPSAASGHQLAGSSGIQIELP 178 Query: 506 PPGIDYNKKADGGGAAVTNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKA 327 PGI +A NE +DAWTAH+TETG VYYYNA+TGESTY+KP GFK EPDK Sbjct: 179 HPGIGNENRAS------VNEQLDAWTAHKTETGVVYYYNALTGESTYDKPPGFKEEPDKV 232 Query: 326 TVQPTPVSMEKVAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNAN 147 ++QPTPVS ++G+DW LVTT DGKK+Y+N KTKVSSWQIP EV ELRKK+D + Sbjct: 233 SMQPTPVSTVNLSGTDWVLVTTSDGKKFYHNGKTKVSSWQIPNEVIELRKKQDADVPKEH 292 Query: 146 TTPVQNAGALAEKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 + + EKGS SL+APA+NTGGR+A +PSA G+SSALD Sbjct: 293 PVSIPINNVMTEKGSAPISLTAPAINTGGREAMAFKPSAVQGTSSALD 340 >ref|XP_002515795.1| Pre-mRNA-processing protein PRP40, putative [Ricinus communis] gi|223545064|gb|EEF46576.1| Pre-mRNA-processing protein PRP40, putative [Ricinus communis] Length = 886 Score = 299 bits (766), Expect = 2e-78 Identities = 158/300 (52%), Positives = 201/300 (67%), Gaps = 9/300 (3%) Frame = -1 Query: 875 IDSSSSALQRPIISSTTSLPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPP 696 +DS++S++QRP++ + T S+P QQ Y +YPSLPAMA Q W +GG+PR P Sbjct: 112 VDSATSSVQRPVMPTVTHA-SNPVVQQQSYHTYPSLPAMAASAQGLWFHPPQMGGMPRTP 170 Query: 695 FLPYS-GVLPGPFPLAGHGVVPPAVPLLNSQVPAISSVGPPGYASASMGSGPPAGNSLV- 522 FLPY V PG +PL HG+ P++ + Q VG PG A+ S +G+ L+ Sbjct: 171 FLPYPPAVFPGSYPLPAHGISRPSISSPDFQPSGAPPVGIPG---ANPPSSAASGHQLMG 227 Query: 521 ----QPELPPPGIDYNKKADGGGA---AVTNEDIDAWTAHRTETGAVYYYNAVTGESTYE 363 Q E+PPPGID + G A T++ +DAWTAH+T+ G VYYYNAVTG STYE Sbjct: 228 TPGMQKEIPPPGIDNRSQIHDFGTKNNAATSDSLDAWTAHKTDAGVVYYYNAVTGVSTYE 287 Query: 362 KPSGFKGEPDKATVQPTPVSMEKVAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTEL 183 KP GFK EP+K +QPTPVSME +AG+DWAL+TT+DGK YYYN+KTK+SSWQIP EVTEL Sbjct: 288 KPPGFKSEPEKVPMQPTPVSMENLAGTDWALITTNDGKNYYYNNKTKLSSWQIPSEVTEL 347 Query: 182 RKKEDGGSLNANTTPVQNAGALAEKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 +KK++ L V ++ L EKGS SLSAPA+NTGGRDAT LR S A G+SSALD Sbjct: 348 KKKQE-AELKEQEMSVSSSSVLNEKGSVQISLSAPAINTGGRDATALRASNALGASSALD 406 >ref|XP_004505734.1| PREDICTED: pre-mRNA-processing protein 40C-like [Cicer arietinum] Length = 953 Score = 294 bits (753), Expect = 7e-77 Identities = 182/456 (39%), Positives = 242/456 (53%), Gaps = 6/456 (1%) Frame = -1 Query: 1352 VPNTLPASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNNIDXXX 1173 V + AS +SQQSS+ MK N + L P P +++T PSFSYN++ + Sbjct: 36 VNQNVNASGNSQQSSSHSGMKPNSGVNPP-LVPGFPPRAAT--PSFSYNVS-QSVAPFTG 91 Query: 1172 XXXXXXXXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIWMPT 993 S + AQD N+ PTT+WMPT Sbjct: 92 NQHAQSSTNMSDSIAQDFSKVSSASSNPHPIPAPTSISAMPPPSDP---NYRPTTLWMPT 148 Query: 992 APSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQRPIISSTTSLPS 813 AP+ GI+P P+ S+ D SSA+ RP + T + S Sbjct: 149 APT----FPVHTLMPGTPGPPGLAKPGIMPSNPAAPSSNTDFPSSAVPRPNMP-TAPIGS 203 Query: 812 HPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHGVVP 633 P+ P YP +P+M PPQ WLQ + G+ RPPFL Y PGPFP GV Sbjct: 204 DPNASHKGLP-YPPIPSMVAPPQGFWLQPPQMSGVHRPPFLQYPAAFPGPFPFPARGVTL 262 Query: 632 PAVPLLNSQVPAISSVGPPGYASASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGAAVT 453 PAVP+ +SQ P ++ VG G ++ S+ S G S +Q + D K A VT Sbjct: 263 PAVPVPDSQPPGVTPVGAAGISAFSVSSHQLRGTSGLQTVVISAHADDKKL----NATVT 318 Query: 452 ------NEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEKV 291 N+ +DAWTAH+TE G VYYYNA+TGESTY+KP+GFKGE + +VQPTPVS+ + Sbjct: 319 HNEDAANDQLDAWTAHKTEAGIVYYYNALTGESTYDKPAGFKGEAHQVSVQPTPVSVVDL 378 Query: 290 AGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALAE 111 G+DW LV+T DGKKYYYN++TK S WQIP EV EL+KK+DG + + PV NA L + Sbjct: 379 PGTDWQLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDAAKDHLMPVLNATVLPD 438 Query: 110 KGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 +G G +L+APA+ TGGRDA ++P + S SALD Sbjct: 439 RGFGMVTLNAPAITTGGRDAATVKPFSVQSSPSALD 474 >ref|XP_003540642.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine max] Length = 930 Score = 286 bits (733), Expect = 1e-74 Identities = 178/449 (39%), Positives = 223/449 (49%), Gaps = 5/449 (1%) Frame = -1 Query: 1334 ASESSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNNIDXXXXXXXXX 1155 AS SSQ S P + SN + +QPP G SS PSFSYNI I Sbjct: 55 ASGSSQLLSTHPAIISNSAVNPMVVQPP--GVSSHAAPSFSYNIPQSGAIFSSNQQH--- 109 Query: 1154 XXXXSGAAAQDAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNFYPTTIWMPTAPSVSG 975 AQ + N+ P T WMPTA +S Sbjct: 110 --------AQSSTDVSKLSSASSIPHSVPAHTSTSLMPPPSDPNYCPATSWMPTA--LSF 159 Query: 974 PLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQRPIISSTTSLPSHPSGQQ 795 P+ P P + S+AI SS+ Sbjct: 160 PVHPVMPTQGN------------PGPPGLASSAIISSN---------------------- 185 Query: 794 LVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPGPFPLAGHGVVPPAVPLL 615 P+ PS+PA+A PPQ WLQ + G+ RPP+L Y PGPFP GV PAVP+ Sbjct: 186 ---PAAPSIPALAAPPQGLWLQPPQMSGVLRPPYLQYPAPFPGPFPFPARGVALPAVPIP 242 Query: 614 NSQVPAISSVGPPGYA-SASMGSGPPAGNSLVQPELPPPGIDYNKKADGGGA----AVTN 450 +SQ P ++ VG G + S S G + +Q E+ D KK + A N Sbjct: 243 DSQPPGVTPVGAAGGTPTPSASSYQLRGTTALQTEVISGSADDKKKLNSVDTLNEDAANN 302 Query: 449 EDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATVQPTPVSMEKVAGSDWAL 270 + +DAWTAH+TE G +YYYNAVTGESTY KPSGFKGE + + QPTPVSM + G+DW L Sbjct: 303 DQLDAWTAHKTEAGIIYYYNAVTGESTYHKPSGFKGESHQVSAQPTPVSMIDLPGTDWRL 362 Query: 269 VTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTTPVQNAGALAEKGSGSAS 90 V+T DGKKYYYN+ TK S WQIP EV EL+KK+DG + V N L+++GSG + Sbjct: 363 VSTSDGKKYYYNNLTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVPNTNVLSDRGSGMVT 422 Query: 89 LSAPAVNTGGRDATPLRPSAAPGSSSALD 3 L+APA+NTGGRDA L+PS SSSALD Sbjct: 423 LNAPAINTGGRDAAALKPSTLQNSSSALD 451 >gb|EXC33082.1| Transcription elongation regulator 1 [Morus notabilis] Length = 829 Score = 278 bits (712), Expect = 4e-72 Identities = 155/303 (51%), Positives = 196/303 (64%), Gaps = 6/303 (1%) Frame = -1 Query: 893 SVRSTAIDSSSSALQRPIISSTT-SLPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQP- 720 +V A+D+S + +QRPI+ S ++ S+ + QQ + Y SLP+MA PPQ WLQ P Sbjct: 49 TVGPVAVDTSLT-VQRPIMPSPMGAMASNSAVQQQIGVPYQSLPSMAAPPQGPWLQPSPQ 107 Query: 719 IGGLPRPPFLPYSGVLPGPFPLAGHGVVPPAVPLLNSQVPAISSVGPPGYASASMGSGPP 540 +GG+PR P L Y PGPFP G+ PP+VP +SQ P I+ VG + Sbjct: 108 MGGVPRLPNLLYHAAFPGPFPSMARGI-PPSVPGPDSQPPGIAPVGNTRLTPTPFAASVQ 166 Query: 539 ---AGNSLVQPELPPPGIDYN-KKADGGGAAVTNEDIDAWTAHRTETGAVYYYNAVTGES 372 AG+S + EL + + +A NE DAWTAH+TE G VYYYN +TGES Sbjct: 167 PVVAGSSGTRMELHTSDEQTHVRDVRSQVSADVNEQSDAWTAHKTEAGVVYYYNTLTGES 226 Query: 371 TYEKPSGFKGEPDKATVQPTPVSMEKVAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEV 192 TY+KP GFKGEP+K +VQP PVSM + G+DW LV+T DGKKYYYN+KTKVSSWQIP EV Sbjct: 227 TYDKPPGFKGEPEKVSVQPVPVSMVNLPGTDWVLVSTSDGKKYYYNNKTKVSSWQIPNEV 286 Query: 191 TELRKKEDGGSLNANTTPVQNAGALAEKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSS 12 TELRKK++ N+T V N LAEKGS +L+APA+NTGGRDA LR ++A GSSS Sbjct: 287 TELRKKQESDIPKENSTSVPNNNVLAEKGSTPINLNAPAINTGGRDAMALRSTSAQGSSS 346 Query: 11 ALD 3 ALD Sbjct: 347 ALD 349 >ref|XP_006592053.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine max] Length = 854 Score = 275 bits (703), Expect = 4e-71 Identities = 153/346 (44%), Positives = 194/346 (56%), Gaps = 5/346 (1%) Frame = -1 Query: 1025 NFYPTTIWMPTAPSVSGPLXXXXXXXXXXXXXXXXXXGILPFAPSVRSTAIDSSSSALQR 846 N+ P T WMPTA +S P+ P P + S+AI SS+ Sbjct: 69 NYCPATSWMPTA--LSFPVHPVMPTQGN------------PGPPGLASSAIISSN----- 109 Query: 845 PIISSTTSLPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQSQPIGGLPRPPFLPYSGVLPG 666 P+ PS+PA+A PPQ WLQ + G+ RPP+L Y PG Sbjct: 110 --------------------PAAPSIPALAAPPQGLWLQPPQMSGVLRPPYLQYPAPFPG 149 Query: 665 PFPLAGHGVVPPAVPLLNSQVPAISSVGPPGYA-SASMGSGPPAGNSLVQPELPPPGIDY 489 PFP GV PAVP+ +SQ P ++ VG G + S S G + +Q E+ D Sbjct: 150 PFPFPARGVALPAVPIPDSQPPGVTPVGAAGGTPTPSASSYQLRGTTALQTEVISGSADD 209 Query: 488 NKKADGGGA----AVTNEDIDAWTAHRTETGAVYYYNAVTGESTYEKPSGFKGEPDKATV 321 KK + A N+ +DAWTAH+TE G +YYYNAVTGESTY KPSGFKGE + + Sbjct: 210 KKKLNSVDTLNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYHKPSGFKGESHQVSA 269 Query: 320 QPTPVSMEKVAGSDWALVTTDDGKKYYYNDKTKVSSWQIPKEVTELRKKEDGGSLNANTT 141 QPTPVSM + G+DW LV+T DGKKYYYN+ TK S WQIP EV EL+KK+DG + Sbjct: 270 QPTPVSMIDLPGTDWRLVSTSDGKKYYYNNLTKTSCWQIPNEVAELKKKQDGDVTKDHLM 329 Query: 140 PVQNAGALAEKGSGSASLSAPAVNTGGRDATPLRPSAAPGSSSALD 3 V N L+++GSG +L+APA+NTGGRDA L+PS SSSALD Sbjct: 330 SVPNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSTLQNSSSALD 375 >ref|XP_006360861.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X4 [Solanum tuberosum] Length = 1027 Score = 275 bits (703), Expect = 4e-71 Identities = 152/306 (49%), Positives = 193/306 (63%), Gaps = 4/306 (1%) Frame = -1 Query: 908 LPFAPSVRSTAIDSSSSALQRPIISSTTSLPSHPSGQQLVYPSYPSLPAMAPPPQAAWLQ 729 +P + ++ +TA S RP S L ++PS QQ Y Y S + P Q WLQ Sbjct: 263 IPSSSNLTATASPGGPSLPLRPNASPVHVL-ANPSVQQQTYSPYFSPTPITPSHQGPWLQ 321 Query: 728 SQPIGGLPRPPFLPYSGVLPGPFPLAGHGVVPPAVPLLNSQVPAISSVG-PPGYASASMG 552 P+ + RPPF Y PFPL+ G +V L +++ P ++ V PPG + + Sbjct: 322 PPPVTTMLRPPFPSYPAGFAVPFPLSATGAPLSSVTLPDTRPPGVAPVAAPPGVPTTA-- 379 Query: 551 SGPPAGNSLVQPELPPPGIDYNKK---ADGGGAAVTNEDIDAWTAHRTETGAVYYYNAVT 381 P S +QPELPP G+D K AD A T+E ++ WTAHRTETGA+YYYN++T Sbjct: 380 -SQPTHASGLQPELPP-GVDSGKHVNDADTKQGASTSEQLETWTAHRTETGAIYYYNSLT 437 Query: 380 GESTYEKPSGFKGEPDKATVQPTPVSMEKVAGSDWALVTTDDGKKYYYNDKTKVSSWQIP 201 GESTYEKP+GF+GEP K QPTPVS E++AG+DWALV T+DG++YYYN KTK+SSWQIP Sbjct: 438 GESTYEKPAGFRGEPGKVAAQPTPVSWERLAGTDWALVATNDGQRYYYNTKTKLSSWQIP 497 Query: 200 KEVTELRKKEDGGSLNANTTPVQNAGALAEKGSGSASLSAPAVNTGGRDATPLRPSAAPG 21 EVTEL+KK D +L A + + N EKGS SLS PAV+TGGRDAT LRPS PG Sbjct: 498 SEVTELKKKHDADALQAQSPSILNVNESTEKGSAPISLSIPAVSTGGRDATSLRPSLVPG 557 Query: 20 SSSALD 3 SSALD Sbjct: 558 -SSALD 562 Score = 60.5 bits (145), Expect = 2e-06 Identities = 38/106 (35%), Positives = 55/106 (51%) Frame = -1 Query: 1505 STPSSTAAGLGPQPPKSPTGNTSDTLQESARXXXXXXXXXXXXXXXXSYNVVPNTLPASE 1326 S+ +S + +P S + +D+ QE+A+ SY N S Sbjct: 15 SSQTSVMSSATGEPTTSSSTPNADSTQEAAQGKFISPPGYSVCRASFSYM---NANVPSG 71 Query: 1325 SSQQSSATPVMKSNQPASAATLQPPVPGQSSTTGPSFSYNINTHNN 1188 SSQQ S++PV+ S S+A LQPP+PGQS+ G SFSYNI+ +N Sbjct: 72 SSQQPSSSPVIPSTSAGSSALLQPPIPGQSANVGSSFSYNISQTDN 117