BLASTX nr result
ID: Catharanthus23_contig00015979
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00015979 (1560 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004247752.1| PREDICTED: uncharacterized protein LOC101258... 290 9e-76 ref|XP_006354457.1| PREDICTED: uncharacterized protein LOC102579... 283 1e-73 ref|XP_006354456.1| PREDICTED: uncharacterized protein LOC102579... 283 1e-73 emb|CBI23686.3| unnamed protein product [Vitis vinifera] 265 5e-68 ref|XP_006489672.1| PREDICTED: splicing factor U2af large subuni... 254 9e-65 ref|XP_006489671.1| PREDICTED: splicing factor U2af large subuni... 254 9e-65 gb|EXB46745.1| Splicing factor U2AF 50 kDa subunit [Morus notabi... 253 2e-64 gb|ESW17866.1| hypothetical protein PHAVU_007G275200g [Phaseolus... 251 5e-64 ref|XP_006588544.1| PREDICTED: uncharacterized protein LOC100810... 250 1e-63 ref|XP_006420295.1| hypothetical protein CICLE_v10004248mg [Citr... 250 1e-63 gb|ABK96758.1| unknown [Populus trichocarpa x Populus deltoides] 246 3e-62 ref|XP_004497972.1| PREDICTED: serine/arginine repetitive matrix... 236 3e-59 ref|XP_004296390.1| PREDICTED: splicing factor U2AF 50 kDa subun... 234 6e-59 ref|XP_002281833.2| PREDICTED: uncharacterized protein LOC100266... 234 8e-59 ref|XP_004497970.1| PREDICTED: serine/arginine repetitive matrix... 230 1e-57 gb|EMJ25476.1| hypothetical protein PRUPE_ppa019989mg, partial [... 226 2e-56 ref|XP_006857448.1| hypothetical protein AMTR_s00067p00176230 [A... 222 3e-55 ref|XP_002528813.1| splicing factor u2af large subunit, putative... 207 1e-50 ref|XP_002465895.1| hypothetical protein SORBIDRAFT_01g047730 [S... 206 2e-50 gb|EOY06129.1| Splicing factor U2AF 50 kDa subunit, putative [Th... 198 5e-48 >ref|XP_004247752.1| PREDICTED: uncharacterized protein LOC101258490 [Solanum lycopersicum] Length = 903 Score = 290 bits (743), Expect = 9e-76 Identities = 190/478 (39%), Positives = 270/478 (56%), Gaps = 48/478 (10%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 TGVP KS A + +TVEDSS+KIF+GGIS+ ISSEML++IA+ FGPLKA+H NSD+ Sbjct: 440 TGVPQKSVAAADRIDNTVEDSSYKIFVGGISRTISSEMLMEIAKAFGPLKAYHFRMNSDL 499 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 + CAFLEY DH VTLKACAGLNGMKLGG+VLT V+A+PD + L + P Y IP+HAK Sbjct: 500 NEPCAFLEYVDHSVTLKACAGLNGMKLGGKVLTVVRAVPDTALLDKDENTPLYRIPQHAK 559 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCS-- 534 PLLEK TE+LK+KNV++A C+RFG +K++NVVKQ CS Sbjct: 560 PLLEKHTEVLKLKNVVDANVLSFLSEAELEELLEDIRLECARFGAIKSINVVKQSQCSLI 619 Query: 535 --------STTESFASMDAGDAA-----VDDEGKQEVTVG------TTAHELENLDESKP 657 S+T + ++MD G+ + E+ VG + HELE + S Sbjct: 620 SDPAAMDTSSTLNDSNMDFGEECDKNDPITRSDDHELEVGGPHFPSSDHHELE-VGGSHI 678 Query: 658 PSSTMEAVE------------DNCNSDVKPGRCSPLS---SSTDPDDFSKANVDN----- 777 P+S +E NSD + RC+ S + DD KA D+ Sbjct: 679 PNSDDHELEVGRPHFPNSDEPMETNSDKEAERCADSKTHISESSQDDSQKAGDDDALAGG 738 Query: 778 GHSDDKLLANIITDETC-----ESNIEDKDTSIKDAXXXXXXXXXXXXREKSPDAS-IDH 939 HSDD+ +I D++ +S++ ++T ++ ++++ + S ++H Sbjct: 739 SHSDDRPSEELIKDDSSDPLPDDSSVSAQETIFQENLEVTRTGMVSERKDENANPSPLEH 798 Query: 940 L-ISNDKVVDNSTTVASGIEDTMKIEKGSSSEEDITRTSASALNPGNKKDSDINEKAEDK 1116 L I+ND S +++ +K E+ + + +D R S + +K++ D E+ E K Sbjct: 799 LEINND----------SPVKEAIKSEEDNGNVDD--RPSEPEFS--SKEELDAPEELEKK 844 Query: 1117 GEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDDRIVTVEYISPNLYRKRFPK 1290 E+ + +VF PG VLVEF+RAEA C AAHCLH R FDDRIVTVEY+ +LY+ +F K Sbjct: 845 EEIP-ITEVFDPGCVLVEFRRAEAACTAAHCLHGRLFDDRIVTVEYVPLDLYQTKFAK 901 >ref|XP_006354457.1| PREDICTED: uncharacterized protein LOC102579232 isoform X2 [Solanum tuberosum] Length = 1061 Score = 283 bits (725), Expect = 1e-73 Identities = 188/475 (39%), Positives = 264/475 (55%), Gaps = 47/475 (9%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 TGVP KS A + DTVEDSS+KIF+GGIS+ ISSEML++IA+ FGPLKA+H NSD+ Sbjct: 606 TGVPQKSVAAADRIDDTVEDSSYKIFVGGISRTISSEMLMEIAKAFGPLKAYHFRMNSDL 665 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 + CAFLEY DH VTLKACAGLNGMKLGG+VLT VQA+PD + L + P Y IP+HAK Sbjct: 666 NEPCAFLEYVDHSVTLKACAGLNGMKLGGKVLTVVQAVPDTALLDKDENTPLYRIPQHAK 725 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLLEK TE+LK+KNV++A C+RFG+VK++NVVKQ CS T Sbjct: 726 PLLEKHTEVLKLKNVVDANVLNFLSEAELEELLEDIRLECARFGSVKSINVVKQSQCSLT 785 Query: 541 TESFASMDAGDAAVDD-----EG-----------KQEVTVG------TTAHELE------ 636 ++ A+MD D EG E+ VG + HELE Sbjct: 786 SDP-AAMDTSSTLNDSNMEFGEGCDRNDPITRSDDYELEVGGPHFPNSDHHELEVGGSHI 844 Query: 637 ------NLDESKP--PSSTMEAVEDNCNSDVKPGRCSPLSSSTDPDDFSKANVDN----- 777 L+ +P P+S E +E N + + S T D KA D+ Sbjct: 845 PNSDDHELEVGRPHFPNSD-EPMETNSDEEAD---SKTHISETSQGDSQKAGDDDALAGG 900 Query: 778 GHSDDKLLANIITDETC-----ESNIEDKDTSIKDAXXXXXXXXXXXXREKSPDAS-IDH 939 HSDD+ +I D++ +S++ ++T+ ++ ++++ + S ++H Sbjct: 901 SHSDDRPSEELIKDDSSDPLPDDSSVSAQETNFQENFEVTHTGMVSERKDENANPSPLEH 960 Query: 940 LISNDKVVDNSTTVASGIEDTMKIEKGSSSEEDITRTSASALNPGNKKDSDINEKAEDKG 1119 L N++ S +++ +K S E++ AS +K++ D E+ E K Sbjct: 961 LEINNE---------SPVKEAIK-----SEEDNGNADGASEPEFSSKEELDAPEELEKKE 1006 Query: 1120 EVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDDRIVTVEYISPNLYRKRF 1284 E+ ++ + F PG VLVEF+RAEA +AAHCLH R FDDRIVTVEY+ +LY+ +F Sbjct: 1007 EI-SITEAFDPGCVLVEFRRAEAASMAAHCLHGRLFDDRIVTVEYVPLDLYQTKF 1060 >ref|XP_006354456.1| PREDICTED: uncharacterized protein LOC102579232 isoform X1 [Solanum tuberosum] Length = 1105 Score = 283 bits (725), Expect = 1e-73 Identities = 188/475 (39%), Positives = 264/475 (55%), Gaps = 47/475 (9%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 TGVP KS A + DTVEDSS+KIF+GGIS+ ISSEML++IA+ FGPLKA+H NSD+ Sbjct: 650 TGVPQKSVAAADRIDDTVEDSSYKIFVGGISRTISSEMLMEIAKAFGPLKAYHFRMNSDL 709 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 + CAFLEY DH VTLKACAGLNGMKLGG+VLT VQA+PD + L + P Y IP+HAK Sbjct: 710 NEPCAFLEYVDHSVTLKACAGLNGMKLGGKVLTVVQAVPDTALLDKDENTPLYRIPQHAK 769 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLLEK TE+LK+KNV++A C+RFG+VK++NVVKQ CS T Sbjct: 770 PLLEKHTEVLKLKNVVDANVLNFLSEAELEELLEDIRLECARFGSVKSINVVKQSQCSLT 829 Query: 541 TESFASMDAGDAAVDD-----EG-----------KQEVTVG------TTAHELE------ 636 ++ A+MD D EG E+ VG + HELE Sbjct: 830 SDP-AAMDTSSTLNDSNMEFGEGCDRNDPITRSDDYELEVGGPHFPNSDHHELEVGGSHI 888 Query: 637 ------NLDESKP--PSSTMEAVEDNCNSDVKPGRCSPLSSSTDPDDFSKANVDN----- 777 L+ +P P+S E +E N + + S T D KA D+ Sbjct: 889 PNSDDHELEVGRPHFPNSD-EPMETNSDEEAD---SKTHISETSQGDSQKAGDDDALAGG 944 Query: 778 GHSDDKLLANIITDETC-----ESNIEDKDTSIKDAXXXXXXXXXXXXREKSPDAS-IDH 939 HSDD+ +I D++ +S++ ++T+ ++ ++++ + S ++H Sbjct: 945 SHSDDRPSEELIKDDSSDPLPDDSSVSAQETNFQENFEVTHTGMVSERKDENANPSPLEH 1004 Query: 940 LISNDKVVDNSTTVASGIEDTMKIEKGSSSEEDITRTSASALNPGNKKDSDINEKAEDKG 1119 L N++ S +++ +K S E++ AS +K++ D E+ E K Sbjct: 1005 LEINNE---------SPVKEAIK-----SEEDNGNADGASEPEFSSKEELDAPEELEKKE 1050 Query: 1120 EVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDDRIVTVEYISPNLYRKRF 1284 E+ ++ + F PG VLVEF+RAEA +AAHCLH R FDDRIVTVEY+ +LY+ +F Sbjct: 1051 EI-SITEAFDPGCVLVEFRRAEAASMAAHCLHGRLFDDRIVTVEYVPLDLYQTKF 1104 >emb|CBI23686.3| unnamed protein product [Vitis vinifera] Length = 882 Score = 265 bits (676), Expect = 5e-68 Identities = 170/438 (38%), Positives = 231/438 (52%), Gaps = 8/438 (1%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 TGV +K A +SD V+DS HKIFIGGIS+ +SS+ML++IA FGPLKA+ + N D+ Sbjct: 491 TGVQEKLVAAPDAISDIVKDSPHKIFIGGISRALSSDMLMEIAAAFGPLKAYRFQVNEDL 550 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 CAFLEY D VTLKACAGLNGMKLGGQVLT VQA+P+A ++ + + P Y IPEHAK Sbjct: 551 GEPCAFLEYVDQSVTLKACAGLNGMKLGGQVLTVVQAIPNALAMENTGNLPFYGIPEHAK 610 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCS-S 537 PLLE+PT++LK+KNV+ C+RFGTVK+VN+VK N S Sbjct: 611 PLLERPTQVLKLKNVVNPDDLSSLSEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHVS 670 Query: 538 TTESFASMDAGDAAVDDEGKQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKPGR 717 T E + A D+ G N D + Sbjct: 671 TLEVY-------EAADNTG------------------------------SNLGCDGNSMK 693 Query: 718 CSPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDTSIKDAXXXXXXXXX 897 L TD + N SDDK L ++I +E CE + D +T++K+ Sbjct: 694 AETLGGGTDNGSIDEVVERNSISDDKSLTDLIKNELCEPSHIDSNTAVKE-------PGC 746 Query: 898 XXXREKSPDASIDHLISNDKVVDNSTTVASGI--EDTMKIEKGSSSEEDITR----TSAS 1059 + P D L + V+ A+ + ED + K + EE+ R TSA Sbjct: 747 PDGSDDIPRGLPDQLNNMKHEVELRNDKAADVIQEDFIIKNKLMTVEEETNRKLLGTSAE 806 Query: 1060 A-LNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDDR 1236 +PG K SD K + + + +L +F+ G VLVE+ R EA C+AAHCLH R+FDDR Sbjct: 807 LDSSPGIK--SDFTGKNDSEKGLCDLDDMFEVGCVLVEYGRTEASCMAAHCLHGRYFDDR 864 Query: 1237 IVTVEYISPNLYRKRFPK 1290 +V V Y++ +LYR +FP+ Sbjct: 865 VVVVGYVALDLYRMKFPR 882 >ref|XP_006489672.1| PREDICTED: splicing factor U2af large subunit B-like isoform X2 [Citrus sinensis] Length = 965 Score = 254 bits (648), Expect = 9e-65 Identities = 172/445 (38%), Positives = 238/445 (53%), Gaps = 15/445 (3%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 +G +KS + ++S V+DS HKIFIGGIS+ +SS+M+++I FGPLKA+H E N D Sbjct: 551 SGEAEKSVASVDSVSGIVKDSPHKIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH 610 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 + CAF+EY D LVT KA AGLNG+K+GGQVLTAVQA+ D S + + + P + IP+HA Sbjct: 611 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHAL 670 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLL+KPTE+LK+KNV C+RFGTVK+VNVVK G+ + Sbjct: 671 PLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGTVKSVNVVKYGDSNIF 730 Query: 541 T-------ESFASMDAGDAAVDDE--GKQEVTVGTTAH------ELENLDESKPPSSTME 675 T E+ AS G +DE KQE T H ELE L++SK ME Sbjct: 731 TIQACEGNENTASAGVGQNLTNDETNEKQERLEEVTDHKSIKNNELEILNDSK---EVME 787 Query: 676 AVEDNCNSDVKPGRCSPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDT 855 A E N +VK R + S +P + + D + A+ T E + + Sbjct: 788 AGEVN---NVKDNRPASGSMGDEPSQLCELDTDMA---VEYQAHDSTSEIVSQGVPTQVN 841 Query: 856 SIKDAXXXXXXXXXXXXREKSPDASIDHLISNDKVVDNSTTVASGIEDTMKIEKGSSSEE 1035 ++KD P A D + N ++ S ++ + +E+ + + E Sbjct: 842 TLKD----------------EPCAHDDKVTCNIQLEHMGEENKSSAKEDLNLEEVNGNSE 885 Query: 1036 DITRTSASALNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLH 1215 T A N + S + E +++ + N G +F+PG V VE++RAEA C+AAH LH Sbjct: 886 AFT----GASNEMGMQSSAV-ENGDNENQDPNQGHIFEPGCVFVEYRRAEASCMAAHSLH 940 Query: 1216 RRHFDDRIVTVEYISPNLYRKRFPK 1290 RR FDDRIV VEYI NLYR RF K Sbjct: 941 RRLFDDRIVAVEYIPLNLYRARFSK 965 >ref|XP_006489671.1| PREDICTED: splicing factor U2af large subunit B-like isoform X1 [Citrus sinensis] Length = 967 Score = 254 bits (648), Expect = 9e-65 Identities = 172/445 (38%), Positives = 238/445 (53%), Gaps = 15/445 (3%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 +G +KS + ++S V+DS HKIFIGGIS+ +SS+M+++I FGPLKA+H E N D Sbjct: 553 SGEAEKSVASVDSVSGIVKDSPHKIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH 612 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 + CAF+EY D LVT KA AGLNG+K+GGQVLTAVQA+ D S + + + P + IP+HA Sbjct: 613 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHAL 672 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLL+KPTE+LK+KNV C+RFGTVK+VNVVK G+ + Sbjct: 673 PLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGTVKSVNVVKYGDSNIF 732 Query: 541 T-------ESFASMDAGDAAVDDE--GKQEVTVGTTAH------ELENLDESKPPSSTME 675 T E+ AS G +DE KQE T H ELE L++SK ME Sbjct: 733 TIQACEGNENTASAGVGQNLTNDETNEKQERLEEVTDHKSIKNNELEILNDSK---EVME 789 Query: 676 AVEDNCNSDVKPGRCSPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDT 855 A E N +VK R + S +P + + D + A+ T E + + Sbjct: 790 AGEVN---NVKDNRPASGSMGDEPSQLCELDTDMA---VEYQAHDSTSEIVSQGVPTQVN 843 Query: 856 SIKDAXXXXXXXXXXXXREKSPDASIDHLISNDKVVDNSTTVASGIEDTMKIEKGSSSEE 1035 ++KD P A D + N ++ S ++ + +E+ + + E Sbjct: 844 TLKD----------------EPCAHDDKVTCNIQLEHMGEENKSSAKEDLNLEEVNGNSE 887 Query: 1036 DITRTSASALNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLH 1215 T A N + S + E +++ + N G +F+PG V VE++RAEA C+AAH LH Sbjct: 888 AFT----GASNEMGMQSSAV-ENGDNENQDPNQGHIFEPGCVFVEYRRAEASCMAAHSLH 942 Query: 1216 RRHFDDRIVTVEYISPNLYRKRFPK 1290 RR FDDRIV VEYI NLYR RF K Sbjct: 943 RRLFDDRIVAVEYIPLNLYRARFSK 967 >gb|EXB46745.1| Splicing factor U2AF 50 kDa subunit [Morus notabilis] Length = 931 Score = 253 bits (645), Expect = 2e-64 Identities = 168/440 (38%), Positives = 233/440 (52%), Gaps = 10/440 (2%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 TG +KS DA T+SD V+DS +KIFIGGISK +SS+ML++I FGPLKA+H E N ++ Sbjct: 523 TGDLEKSTDAVDTISDVVKDSPNKIFIGGISKALSSKMLMEIVSAFGPLKAYHFEVNDEL 582 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 + CAFLEY D + KACAGLNGMKLGG+VLT +QA+ A SLG+ + Y IPEHAK Sbjct: 583 NDPCAFLEYVDQSIAPKACAGLNGMKLGGKVLTVIQAIRGAESLGNSAESSLYKIPEHAK 642 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLL++PT++LK+KN+ C RFG VK+VNVVKQ N T Sbjct: 643 PLLKQPTQVLKLKNMFNLVGFSSLSEPEVEEVIEDVRLECVRFGNVKSVNVVKQSNSQIT 702 Query: 541 TESFASMDAGDAAVDDEGKQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKPGRC 720 + ++ A + G G A + EN C + +P Sbjct: 703 SSGICELN-NRAQTGEFGPNLGCEGNNA-KTENF--------------GGCTNG-EPSGI 745 Query: 721 SPLSSSTDPDDFSKANV--DNGHSDDKLLANIIT-DETCESNIEDKDTSIKDAXXXXXXX 891 + L + + + V D+G +D++ L NII D++C++ D + + Sbjct: 746 AALEFVKNDQELKENEVPKDSG-TDNRQLDNIIAEDKSCQTGQLTSDENEPNIIPEELPT 804 Query: 892 XXXXXREKSPDASIDHLISNDKVVDNSTTVASGIEDTMKIEKGSSSEEDITRTSASALNP 1071 RE S +DKV + T DT +EK + E++ TR + Sbjct: 805 QLNSPREVSEQL-------DDKVGSATPT------DTHGMEKKITGEDNSTRGDTDSKKQ 851 Query: 1072 GNKKDSD------INEKA-EDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFD 1230 G ++ D N+K +D E +LG +F+ G VLVEF R EA C AAHCLH R FD Sbjct: 852 GTVEEFDGFMETESNDKVMDDSKEQFDLGSIFEVGCVLVEFGRTEAACTAAHCLHGRLFD 911 Query: 1231 DRIVTVEYISPNLYRKRFPK 1290 DRIV+VEY++ + Y+ RFPK Sbjct: 912 DRIVSVEYVALDHYKTRFPK 931 >gb|ESW17866.1| hypothetical protein PHAVU_007G275200g [Phaseolus vulgaris] Length = 972 Score = 251 bits (642), Expect = 5e-64 Identities = 175/448 (39%), Positives = 225/448 (50%), Gaps = 18/448 (4%) Frame = +1 Query: 1 TGVPDKSADANFTL-SDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSD 177 TG P++S D T+ SD V DS HKIFIGGIS L+SSEML++IA FG LKA+H E N+ Sbjct: 571 TGEPERSMDDTVTIISDVVIDSPHKIFIGGISNLLSSEMLMEIASAFGSLKAYHFETNAS 630 Query: 178 IDASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHA 357 DASCAFLEY+DH V++KACAG+NG+KLGG+VLT VQA+PDASS + SY IPEHA Sbjct: 631 -DASCAFLEYSDHSVSIKACAGMNGLKLGGEVLTVVQAMPDASSPSENAGESSYGIPEHA 689 Query: 358 KPLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSS 537 KPLL KPT++L+IKNV C+RFGT+K++NVV+ + Sbjct: 690 KPLLRKPTQVLEIKNVFAVESISSLSDMTVEEILDDVRFECARFGTIKSINVVRHSS--- 746 Query: 538 TTESFASMDAGDAAVDDEGKQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKPGR 717 E + T E E ++E VE D Sbjct: 747 ---------------------EKNLATKLEECEVINE----------VESEVFQDTNCIT 775 Query: 718 CSPLSSSTDPDDFSKANVDNG--HSDDKLLANIITDETCESNIEDK-------------- 849 S SS +D K+ NG DDK L D+ N + K Sbjct: 776 NSIKSSFSDKATDLKSEATNGVNFHDDKELEEYKVDDGTGINTDKKAELFDIKSCLEHPV 835 Query: 850 -DTSIKDAXXXXXXXXXXXXREKSPDASIDHLISNDKVVDNSTTVASGIEDTMKIEKGSS 1026 DT+++D + D +DKVV N V IE Sbjct: 836 NDTAVEDVGGKSIPCSIIQASPVQQETPDDVPTLHDKVVANDIDV--------DIENKIV 887 Query: 1027 SEEDITRTSASALNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAH 1206 + ++ + SA G + D +K D + + G VF+PGSVLVE+ RAEACC AAH Sbjct: 888 GDNMDSKGTVSAFQEGCSELVD-PQKGNDAKD--DNGHVFEPGSVLVEYGRAEACCSAAH 944 Query: 1207 CLHRRHFDDRIVTVEYISPNLYRKRFPK 1290 LH R FD R+VTVEY+S +LYR RF K Sbjct: 945 SLHGRLFDGRMVTVEYVSQSLYRARFTK 972 >ref|XP_006588544.1| PREDICTED: uncharacterized protein LOC100810537 [Glycine max] Length = 985 Score = 250 bits (638), Expect = 1e-63 Identities = 169/436 (38%), Positives = 233/436 (53%), Gaps = 6/436 (1%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 TG P +S D ++SD V DS HKIFIGGIS +SSEML++IA FG LKA+H E + Sbjct: 583 TGEPARSVDVAVSISDVVIDSPHKIFIGGISNHLSSEMLMEIAGVFGSLKAYHFETKVN- 641 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 + CAFLEY DH VT+KACAGLNGMKLGG+VLT +QA+PDAS L + + SY +PEHAK Sbjct: 642 NGPCAFLEYVDHSVTIKACAGLNGMKLGGEVLTVLQAMPDASPLENAGESLSYGVPEHAK 701 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLL KPT++L+I NV A C+RFGT+K++NVVK S+ Sbjct: 702 PLLRKPTQVLEINNVFAADTILSLSDMAIEEILDDVRLECARFGTIKSINVVKH----SS 757 Query: 541 TESFASMDAGDAAVDDEGKQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKPGRC 720 E+ A+ ++ +EV+ T N + S +T E ++ Sbjct: 758 GENLATKLEECKVINKVDAKEVSQDTNC-ITNNTESSFSDKATYPDFEGTNGMEIH---- 812 Query: 721 SPLSSSTDPDDFSKANVDNGHS--DDKLLANIITDETCESNIEDKDT-SIKDAXXXXXXX 891 D ++ + VD G DK A + ++C +++D + D Sbjct: 813 -------DNNEMEEVKVDEGSCVYVDK-NAEVFDYKSCREHVDDSAVEDVGDKGIPCSII 864 Query: 892 XXXXXREKSPDAS---IDHLISNDKVVDNSTTVASGIEDTMKIEKGSSSEEDITRTSASA 1062 ++ +P+ D +++ND V+ + S +DT+ + SE DI SA Sbjct: 865 QECPDQQDTPNDGPEFYDKMVANDIDVNIENNMES--KDTVCAFQEGFSEWDI---SAEL 919 Query: 1063 LNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDDRIV 1242 ++P D++ ED G VFKPGSVLVE+ RAEACC AAH LH R FD RIV Sbjct: 920 VSPQKSIDTE-----ED-----IYGHVFKPGSVLVEYGRAEACCSAAHSLHGRFFDGRIV 969 Query: 1243 TVEYISPNLYRKRFPK 1290 TV Y++ +LYR RF K Sbjct: 970 TVGYVALSLYRSRFTK 985 >ref|XP_006420295.1| hypothetical protein CICLE_v10004248mg [Citrus clementina] gi|557522168|gb|ESR33535.1| hypothetical protein CICLE_v10004248mg [Citrus clementina] Length = 967 Score = 250 bits (638), Expect = 1e-63 Identities = 169/445 (37%), Positives = 239/445 (53%), Gaps = 15/445 (3%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 +G +KS + ++S V+DS HKIFIGGIS+ +SS+M+++I FGPLKA+H E N D Sbjct: 553 SGEAEKSVASVDSVSGIVKDSPHKIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH 612 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 + CAF+EY D LVT KA AGLNG+K+GG++LTAVQA+ D S + + + P + IP+HA Sbjct: 613 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGRLLTAVQAVLDGSIMDNSGNPPFHGIPKHAL 672 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLL+KPTE+LK+KNV C+RFGTVK+VNVVK G+ + + Sbjct: 673 PLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGTVKSVNVVKYGDSNIS 732 Query: 541 T-------ESFASMDAGDAAVDDEGK------QEVT--VGTTAHELENLDESKPPSSTME 675 T E+ AS G +DE +EVT +ELE L++SK ME Sbjct: 733 TIQACEGNENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSK---EVME 789 Query: 676 AVEDNCNSDVKPGRCSPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDT 855 A E N +VK R + + +P + + D + A T E + + Sbjct: 790 AGEVN---NVKDNRPASGTMGDEPSQLCELDTDMA---VEYQARDSTSEIVSQGVPTQVN 843 Query: 856 SIKDAXXXXXXXXXXXXREKSPDASIDHLISNDKVVDNSTTVASGIEDTMKIEKGSSSEE 1035 ++KD SP A D + N ++ S S ++ + +E+ + + E Sbjct: 844 TLKD----------------SPCAHDDKVTCNIQLEHMSEENKSSAKEDLNLEEVNGNSE 887 Query: 1036 DITRTSASALNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLH 1215 T A N + S + E +++ + N G +F+PG V VE+ RAEA C+AAH LH Sbjct: 888 AFT----GASNEMGMQSSAV-ENGDNENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLH 942 Query: 1216 RRHFDDRIVTVEYISPNLYRKRFPK 1290 RR FDDRIV VEYI NLYR RF K Sbjct: 943 RRLFDDRIVAVEYIPLNLYRARFSK 967 >gb|ABK96758.1| unknown [Populus trichocarpa x Populus deltoides] Length = 787 Score = 246 bits (627), Expect = 3e-62 Identities = 160/441 (36%), Positives = 225/441 (51%), Gaps = 11/441 (2%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 TG +KSA A + D V+DS HKIFIGGISK++SS+ML++IA FGPLKA+ E+ D Sbjct: 384 TGELEKSAAAIDAIGDIVKDSPHKIFIGGISKVLSSKMLMEIASAFGPLKAYQFENRKDP 443 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 D AFLEYAD VT KACAGLNGMKLGGQV+TA+QA+P+ASS G + I +HAK Sbjct: 444 DEPFAFLEYADESVTFKACAGLNGMKLGGQVITAIQAVPNASSSGSDGNSQFGQISQHAK 503 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 LLEKPTE+LK+KNV ++ C+RFG+VK++NV+K + + Sbjct: 504 ALLEKPTEVLKLKNVFDSESLSSLSNTEVEEVLEDVRLECARFGSVKSINVIKYAAITIS 563 Query: 541 TESFASMDAGDAAVDDEGKQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKPGRC 720 T S + D V E Q + D + P + + D Sbjct: 564 TSK--SCEFNDDTVSAEATQSL----------GCDGTNPKTRNIRGSID----------- 600 Query: 721 SPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDTSIKDAXXXXXXXXXX 900 K N DDK ++++ DE C+ D D +++D Sbjct: 601 ------------QKFMEGNSIGDDKPASDVMEDEPCQPGQVDSDMAVQDLACKSSSDSQE 648 Query: 901 XXREKSPDASIDHLISNDKV--VDNSTTVASGIEDTMKIEKGSSS---------EEDITR 1047 ++ S D+++D + + ++ VD +G ED E G + EE Sbjct: 649 PPQDVS-DSNVDKVTDDIEIEEVDAENKSTAG-EDLNLKEVGDNKLMAGEELNLEEVSGD 706 Query: 1048 TSASALNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHF 1227 + +N + + EK + K + +LG +F+ G V VEF+R E C+AAHCLH R F Sbjct: 707 VEKAFVNDSMEMKPNSIEKGDCKEQDCSLGLIFERGCVFVEFRRTEGACMAAHCLHGRLF 766 Query: 1228 DDRIVTVEYISPNLYRKRFPK 1290 DDR V VEY+ ++Y RFPK Sbjct: 767 DDRAVVVEYVPLDIYLARFPK 787 >ref|XP_004497972.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X3 [Cicer arietinum] Length = 1127 Score = 236 bits (601), Expect = 3e-59 Identities = 164/439 (37%), Positives = 226/439 (51%), Gaps = 9/439 (2%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 T P++S + T+SD V +S +KIFIGGIS +SSEML++IA FG LKA+H E Sbjct: 733 TDEPERSVEVAVTISDDVVNSPNKIFIGGISNHVSSEMLMEIAGVFGSLKAYHFEATVS- 791 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 + SCAF+EY DH VT+KACAGLNGMKLGG+VLT VQA+PDA + + PSY IPEHA+ Sbjct: 792 NGSCAFVEYVDHAVTIKACAGLNGMKLGGEVLTVVQAMPDAPPVENDGKPPSYGIPEHAE 851 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVV---KQGNC 531 PLL +PT++L+IKNV C+RFGTVK++NV K+ N Sbjct: 852 PLLGEPTQVLEIKNVFTGESISSLSDMGIEEILEDVRLECARFGTVKSINVARHRKEKNL 911 Query: 532 SSTTESF-ASMDAGDAAVDDEGKQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVK 708 ++ E +D+ +A++D T N + S +T E ED N + Sbjct: 912 ATELEEVKKKVDSDEASLD-----------THPVANNAEYSFSEEATKELDEDKNNDGI- 959 Query: 709 PGRCSPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDTSIKDAXXXXXX 888 NVD + ++ AN +E S+ D ++ Sbjct: 960 -----------------SVNVD---KNAEVFANTACEEHLVSDATVTDAGNEEGMPSSII 999 Query: 889 XXXXXXREKSPDAS--IDHLISNDKVVDNSTTVASGIEDTMKI---EKGSSSEEDITRTS 1053 R+ D D +++ND VD V +E + ++G + + TS Sbjct: 1000 HGYPDHRDTPNDDQELHDDMVANDTDVD-IKIVGGNMESKNNVCPFQEGIFECDTSSDTS 1058 Query: 1054 ASALNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDD 1233 + + PG +NE ED VF+PGSVLVE+ R EAC AAHCLHRR FD Sbjct: 1059 SKLVGPG----KGVNE--EDNA----YDHVFEPGSVLVEYARTEACRSAAHCLHRRLFDG 1108 Query: 1234 RIVTVEYISPNLYRKRFPK 1290 R+VTV+YI+ +LYR RF K Sbjct: 1109 RMVTVQYIALSLYRARFSK 1127 >ref|XP_004296390.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Fragaria vesca subsp. vesca] Length = 542 Score = 234 bits (598), Expect = 6e-59 Identities = 156/432 (36%), Positives = 224/432 (51%), Gaps = 7/432 (1%) Frame = +1 Query: 10 PDKS---ADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 P+KS AD T+SD V DS +KIFIGGISK++SSEML++I FGPLKA+H E N ++ Sbjct: 138 PEKSVAAADGIVTISDVVNDSPNKIFIGGISKVLSSEMLLEIVSVFGPLKAYHFEANEEL 197 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 AFLEY D VTLKACAGLNG+KLGG+V+T VQA+ SS + + Y IPEHAK Sbjct: 198 TEPYAFLEYVDQSVTLKACAGLNGIKLGGRVITVVQAIRSGSSSVNSGNASVYEIPEHAK 257 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLL++P+ ILK++NV C+RFG VK+V +VK N Sbjct: 258 PLLKQPSHILKLRNVFNLENMSSLSEQEIEEVLEDVRLECARFGMVKSVKIVKHANNHVV 317 Query: 541 TESFASMDAGDAAVDDEGKQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKPGRC 720 T V+ G+ + + SK + + ++++ + DVK Sbjct: 318 TTGACEAVNN---VESGGQWQ-------------NYSKEKGAKTDTLDEHIDKDVKVTSG 361 Query: 721 SPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDTSIKDAXXXXXXXXXX 900 L+ D+ ++N DK +++ D++C+ DKDT I+ + Sbjct: 362 VKLTGELKEDEVPESNC---LGFDKPADDLVEDKSCQIGQLDKDTEIQGSDDLSNQDSEE 418 Query: 901 XXREKSPDASIDHLISNDKVVDNSTTVASGIEDTMKIEKGSSSEEDITRTSASALNPGNK 1080 P++ D NDK T+ + I+++M E +++ T N G + Sbjct: 419 L--TNLPNSKEDASECNDK-----TSEVTRIQNSMPEEVDGENQDTFAGT---VDNVGAE 468 Query: 1081 KD----SDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDDRIVTV 1248 D S+ E+ K + G +F+PGSV VEF R EA +AAHCLH R F+DRIVTV Sbjct: 469 TDSILESETKEQHNGKESDFDPGSIFEPGSVFVEFGRTEASWMAAHCLHGRVFEDRIVTV 528 Query: 1249 EYISPNLYRKRF 1284 EY++ + YR F Sbjct: 529 EYVASDHYRAHF 540 >ref|XP_002281833.2| PREDICTED: uncharacterized protein LOC100266510 [Vitis vinifera] Length = 895 Score = 234 bits (597), Expect = 8e-59 Identities = 161/432 (37%), Positives = 226/432 (52%), Gaps = 18/432 (4%) Frame = +1 Query: 49 TVEDSSHKIFIGGISKLIS-------SEMLIDIARTFGPLKAFHVEHNSDIDASCAFLEY 207 T ED+S + GIS S + L++IA FGPLKA+ + N D+ CAFLEY Sbjct: 496 TPEDASAALSFDGISFSGSILKIRRPKDFLMEIAAAFGPLKAYRFQVNEDLGEPCAFLEY 555 Query: 208 ADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAKPLLEKPTEI 387 D VTLKACAGLNGMKLGGQVLT VQA+P+A ++ + + P Y IPEHAKPLLE+PT++ Sbjct: 556 VDQSVTLKACAGLNGMKLGGQVLTVVQAIPNALAMENTGNLPFYGIPEHAKPLLERPTQV 615 Query: 388 LKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCS-STTESFASMD 564 LK+KNV+ C+RFGTVK+VN+VK N ST E + + D Sbjct: 616 LKLKNVVNPDDLSSLSEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHVSTLEVYEAAD 675 Query: 565 AGDAAVDDEG---KQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKPGRCSPLSS 735 + + +G K E G T + ++ KPP +DVK Sbjct: 676 NTGSNLGCDGNSMKAETLGGGTDNGSSDISGIKPP------------TDVK--------- 714 Query: 736 STDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDTSIKDAXXXXXXXXXXXXREK 915 D + + N SDDK L ++I +E CE + D +T++K+ + Sbjct: 715 --DLKEVDEVVERNSISDDKSLTDLIKNELCEPSHIDSNTAVKE-------PGCPDGSDD 765 Query: 916 SPDASIDHLISNDKVVDNSTTVASGI--EDTMKIEKGSSSEEDITR----TSASA-LNPG 1074 P D L + V+ A+ + ED + K + EE+ R TSA +PG Sbjct: 766 IPRGLPDQLNNMKHEVELRNDKAADVIQEDFIIKNKLMTVEEETNRKLLGTSAELDSSPG 825 Query: 1075 NKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDDRIVTVEY 1254 K SD K + + + +L +F+ G VLVE+ R EA C+AAHCLH R+FDDR+V V Y Sbjct: 826 IK--SDFTGKNDSEKGLCDLDDMFEVGCVLVEYGRTEASCMAAHCLHGRYFDDRVVVVGY 883 Query: 1255 ISPNLYRKRFPK 1290 ++ +LYR +FP+ Sbjct: 884 VALDLYRMKFPR 895 >ref|XP_004497970.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X1 [Cicer arietinum] gi|502123016|ref|XP_004497971.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X2 [Cicer arietinum] Length = 1130 Score = 230 bits (587), Expect = 1e-57 Identities = 160/438 (36%), Positives = 220/438 (50%), Gaps = 8/438 (1%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 T P++S + T+SD V +S +KIFIGGIS +SSEML++IA FG LKA+H E Sbjct: 733 TDEPERSVEVAVTISDDVVNSPNKIFIGGISNHVSSEMLMEIAGVFGSLKAYHFEATVS- 791 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSL---GDVTDCPSYAIPE 351 + SCAF+EY DH VT+KACAGLNGMKLGG+VLT VQA+PDA + + PSY IPE Sbjct: 792 NGSCAFVEYVDHAVTIKACAGLNGMKLGGEVLTVVQAMPDAPPVIFQENDGKPPSYGIPE 851 Query: 352 HAKPLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNC 531 HA+PLL +PT++L+IKNV C+RFGTVK++NV + Sbjct: 852 HAEPLLGEPTQVLEIKNVFTGESISSLSDMGIEEILEDVRLECARFGTVKSINVARH--- 908 Query: 532 SSTTESFASMDAGDAAVDDEGKQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKP 711 ++E + T E++ +S S V +N Sbjct: 909 ---------------------RKEKNLATELEEVKKKVDSDEASLDTHPVANNAEYSFSE 947 Query: 712 GRCSPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDTSIKDAXXXXXXX 891 L + D S NVD + ++ AN +E S+ D ++ Sbjct: 948 EATKELDEDKNNDGIS-VNVDK---NAEVFANTACEEHLVSDATVTDAGNEEGMPSSIIH 1003 Query: 892 XXXXXREKSPDASI--DHLISNDKVVDNSTTVASGIEDTMKI---EKGSSSEEDITRTSA 1056 R+ D D +++ND VD V +E + ++G + + TS+ Sbjct: 1004 GYPDHRDTPNDDQELHDDMVANDTDVDIKI-VGGNMESKNNVCPFQEGIFECDTSSDTSS 1062 Query: 1057 SALNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDDR 1236 + PG +NE ED VF+PGSVLVE+ R EAC AAHCLHRR FD R Sbjct: 1063 KLVGPGK----GVNE--EDNA----YDHVFEPGSVLVEYARTEACRSAAHCLHRRLFDGR 1112 Query: 1237 IVTVEYISPNLYRKRFPK 1290 +VTV+YI+ +LYR RF K Sbjct: 1113 MVTVQYIALSLYRARFSK 1130 >gb|EMJ25476.1| hypothetical protein PRUPE_ppa019989mg, partial [Prunus persica] Length = 400 Score = 226 bits (576), Expect = 2e-56 Identities = 164/423 (38%), Positives = 215/423 (50%), Gaps = 19/423 (4%) Frame = +1 Query: 73 IFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDIDASCAFLEYADHLVTLKACAGLNG 252 IFIGGISK +SSEML+++ FGPLKA+H E N +++ AFLEY D VTLKACAGLNG Sbjct: 1 IFIGGISKSLSSEMLMELISVFGPLKAYHFEVNKELNEPHAFLEYVDQSVTLKACAGLNG 60 Query: 253 MKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAKPLLEKPTEILKIKNVLEAXXXXXX 432 MKLGG+VLTAVQA+ DASSL + + + IPE+AKPLL++P+++LK++NVL Sbjct: 61 MKLGGRVLTAVQAIHDASSLENSGNASLHEIPEYAKPLLKQPSQVLKLRNVLNLEHISLL 120 Query: 433 XXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSSTTESFASMDAGDAAVDDEGKQEVTV 612 C+RFGTVK+V VVK N TT F ++D ++ + Sbjct: 121 SEPEVEEVLEDVRLECARFGTVKSVKVVKHCNNYVTTGVFEAVDDAESGGYQNILEFEQK 180 Query: 613 GTTAHELENLDESK---PPSSTMEAVEDNCNSDVKPGRCSPLSSSTDPDDFSKANVDNGH 783 G LE ++K PS+ E ED +V G C FS +D+ Sbjct: 181 GAKTDTLEEHIDNKFVEFPSNAKEVKED----EVTKGSC-----------FSVTALDDEP 225 Query: 784 SDDKLLANIITDETCESNIEDKDTSIKDAXXXXXXXXXXXXRE--KSPDASIDHLISNDK 957 +DD + +++C+ D IK + + + DAS K Sbjct: 226 TDD-----FVEEKSCKIGQFGDDIEIKGSENPSNRVPEQLHNQLNSTKDAS--------K 272 Query: 958 VVDNSTTVASGIEDTMKIEKGSSSEEDITRTSASA--LNPGNKKD------SDINEKAE- 1110 D T A I D K + EE T+ A L KD SD NEK E Sbjct: 273 CFDVKATEAIEINDLSLENKLMAEEEGSTQEEADGEKLRSFAGKDCSLGTESDANEKIEI 332 Query: 1111 -----DKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHFDDRIVTVEYISPNLYR 1275 K +LG +F+PG V VEF R EA +AAHCLH R F+DRIVTVEYIS + YR Sbjct: 333 KEQNHGKEHDYDLGSIFEPGCVFVEFGRIEASLMAAHCLHGRVFEDRIVTVEYISLDHYR 392 Query: 1276 KRF 1284 F Sbjct: 393 AHF 395 >ref|XP_006857448.1| hypothetical protein AMTR_s00067p00176230 [Amborella trichopoda] gi|548861541|gb|ERN18915.1| hypothetical protein AMTR_s00067p00176230 [Amborella trichopoda] Length = 928 Score = 222 bits (566), Expect = 3e-55 Identities = 158/439 (35%), Positives = 223/439 (50%), Gaps = 9/439 (2%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 T P + DA +SD V+DS HKIFIGGI K +SS+ L +I FG LKA+H E N + Sbjct: 514 TEKPVATVDA---VSDIVKDSPHKIFIGGIPKSLSSDKLQEIVSVFGHLKAYHFEVNRES 570 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 SCAFLEY D +TLKACAGLNGMKLGG VLT VQA PD S+ PSY IP+HAK Sbjct: 571 GGSCAFLEYTDQSITLKACAGLNGMKLGGCVLTVVQAFPDVSAEEISKGPPSYGIPQHAK 630 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLL++PT+ILK+KNV C+RFGTVK+VN+++ S + Sbjct: 631 PLLKEPTQILKLKNVFN---MDDLSESEIEESLEDIRIECTRFGTVKSVNIIR---LSKS 684 Query: 541 TESFASMDAGDAAVDDEG-KQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKPGR 717 +E +M D G KQ+ T +E LD S + A +D+ + K Sbjct: 685 SEEAPNMTITTGNNDSPGPKQDPT-----QIMEKLDSVN--SDILGAKQDSLHELEKSDP 737 Query: 718 CSPLSSSTDPDDFSKANV-DNGHSDDKLLANIITDET--CESNIEDKDTSIKDAXXXXXX 888 + +D D + + + G+S++ + I ++T E +DKD + Sbjct: 738 VNCDMQMSDQDPIQEIEIWEPGYSENVEIVASIDEKTRDLEMITDDKDEHLLKNKEDESG 797 Query: 889 XXXXXXREKSPDASIDHLISNDKVVDNSTTVASGIEDTMKIEKGSSSEEDITRTSASALN 1068 + D + D L + + N+ E T + + E+ + + Sbjct: 798 TSNCEQTTLAGDDASDQLPCSLSLQYNNAH-----EPTFSLSQQDRVSEEFQKKCEA--- 849 Query: 1069 PGNKK--DSDINEKAEDKGEVANLGK---VFKPGSVLVEFKRAEACCIAAHCLHRRHFDD 1233 PG+ K D D+ +D+ + N F+PG VLVE+ R EA C+AAHCLH R + D Sbjct: 850 PGSMKLEDFDMGSSGDDQKTMINPSSDFDAFQPGCVLVEYSRKEAACLAAHCLHGRLYGD 909 Query: 1234 RIVTVEYISPNLYRKRFPK 1290 V VEY++ +LYR RFP+ Sbjct: 910 HRVAVEYVAYDLYRARFPR 928 >ref|XP_002528813.1| splicing factor u2af large subunit, putative [Ricinus communis] gi|223531725|gb|EEF33547.1| splicing factor u2af large subunit, putative [Ricinus communis] Length = 844 Score = 207 bits (527), Expect = 1e-50 Identities = 140/396 (35%), Positives = 202/396 (51%), Gaps = 2/396 (0%) Frame = +1 Query: 109 EMLIDIARTFGPLKAFHVEHNSDIDASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQ 288 + +++IA TFGPLKA+H E+ D++ CAF+EYAD VT +ACAGLNGMKLGGQV++AVQ Sbjct: 486 DFIMEIASTFGPLKAYHFENIDDVNGPCAFVEYADQSVTFRACAGLNGMKLGGQVISAVQ 545 Query: 289 ALPDASSLGDVTDCPSYAIPEHAKPLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXX 468 +P+AS+L P Y +PE AKPLL+KPT++LK+KN+ + Sbjct: 546 VIPNASTLEIDGKQPFYGVPEQAKPLLDKPTQVLKLKNLFDPETLPSLSRIEIEEVLEDV 605 Query: 469 XXXCSRFGTVKAVNVVKQGNCSSTTESFASMDAGDAAVDDEGKQEVTVGTTAHELENLDE 648 C+RFGTVK+VNVV+ G T M+ +D G Q +NL Sbjct: 606 RLECARFGTVKSVNVVRNGPIPIFTSEACKMNED---MDSAGPQ-----------QNLGG 651 Query: 649 SKPPSSTMEAVEDNCNSDVKPGRCSPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETC 828 + + T + + D + V+ + D DD K NG DDK +++ DE+ Sbjct: 652 DETNAETEKTIGDIHHEPVE---------ANDTDD-DKPVEGNGVEDDKPADDLMEDESS 701 Query: 829 ESNIEDKDTSIKD--AXXXXXXXXXXXXREKSPDASIDHLISNDKVVDNSTTVASGIEDT 1002 + D + ++++ ++ S D S D L + KV D+ + E Sbjct: 702 QLGQFDSNMAVENLSGDGVPEPQEPIPIQQTSKDES-DCL--HGKVTDDVQMKDTIAEHK 758 Query: 1003 MKIEKGSSSEEDITRTSASALNPGNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRA 1182 + I++ +E T A +SD K + + +L +F P V VEF R Sbjct: 759 LPIQQ--ELKESFTNDHA--------VESDATGKGDHEEHNCDLSYIFYPSCVFVEFGRT 808 Query: 1183 EACCIAAHCLHRRHFDDRIVTVEYISPNLYRKRFPK 1290 EA CIAAHCLH R +D R VTV YI ++YR RFPK Sbjct: 809 EASCIAAHCLHGRLYDGRTVTVGYIPLDVYRSRFPK 844 >ref|XP_002465895.1| hypothetical protein SORBIDRAFT_01g047730 [Sorghum bicolor] gi|241919749|gb|EER92893.1| hypothetical protein SORBIDRAFT_01g047730 [Sorghum bicolor] Length = 969 Score = 206 bits (525), Expect = 2e-50 Identities = 145/441 (32%), Positives = 214/441 (48%), Gaps = 14/441 (3%) Frame = +1 Query: 10 PDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDIDAS 189 P K A+ +SD V DS HKIFI GI+ +ISSEML++I FGPL A+ NS++ Sbjct: 564 PKKPAEETALISDVVADSPHKIFIAGIAGVISSEMLMEIVSAFGPLAAYRFLFNSELGGP 623 Query: 190 CAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAKPLL 369 CAFLEYAD +T KACAGLNGM LGG VLTAV P+ P Y IPE+AK LL Sbjct: 624 CAFLEYADRSITSKACAGLNGMMLGGCVLTAVHVFPNPPVEAANEASPFYGIPENAKSLL 683 Query: 370 EKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSSTTES 549 ++PT++L++KN E C+RFG VK+V+VV+ Sbjct: 684 KEPTKVLQLKNTFEREEYMLLSKSELEETLEDVRVECTRFGAVKSVHVVE---------- 733 Query: 550 FASMDAGDAAVDDEGKQEVTVGTTAH-ELENL-----DESKPPSSTMEAVEDNCNSDVKP 711 AG + ++ E+ + T + EN+ + S P + +++ + + S+ K Sbjct: 734 ---YPAGGGSAAEDNTVELKIECTEFADTENIAKAVSEYSVPINQSIDVLNHSEASETKD 790 Query: 712 GRCSPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDTSIKDAXXXXXXX 891 P+ S D H D L +N E C++ + D+D + + Sbjct: 791 --VDPIPESQD------------HKDKHLPSNAALCE-CKAPVADEDAELDETQSRAALP 835 Query: 892 XXXXXREKSPDASIDHLISNDKVVDNSTTVASGIEDTMKIEKGSSSEEDITRTSASALNP 1071 +A++D + + V + + D +EK S ++ T + S P Sbjct: 836 TSQHAEVGHTEAAVD-----ENKHTGAGEVTATVMDDDAVEK--SHQDPRTSETCSPAEP 888 Query: 1072 GNKKD-----SDINEKAEDK---GEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHRRHF 1227 +K + D+ E +K E ++ G VF+PGSVLVEF R EA CIAAH LH R F Sbjct: 889 TDKVEKPGSADDVTENRPEKVPAVETSDTGFVFEPGSVLVEFMRKEAACIAAHSLHGRRF 948 Query: 1228 DDRIVTVEYISPNLYRKRFPK 1290 +R V Y +LY +++P+ Sbjct: 949 GNRTVHAGYAPYDLYLQKYPR 969 >gb|EOY06129.1| Splicing factor U2AF 50 kDa subunit, putative [Theobroma cacao] Length = 1032 Score = 198 bits (504), Expect = 5e-48 Identities = 122/289 (42%), Positives = 163/289 (56%) Frame = +1 Query: 1 TGVPDKSADANFTLSDTVEDSSHKIFIGGISKLISSEMLIDIARTFGPLKAFHVEHNSDI 180 TG +KS +A +SD V+DS HKIFIGGISK IS EML++IA FGPLKA+H E N D+ Sbjct: 519 TGELEKS-EAVTKVSDFVKDSHHKIFIGGISKAISCEMLVEIANAFGPLKAYHFEINEDL 577 Query: 181 DASCAFLEYADHLVTLKACAGLNGMKLGGQVLTAVQALPDASSLGDVTDCPSYAIPEHAK 360 A LEY D VTLKACAGLNGMKLGGQV+TAVQA+P+ SSLG+ D S+ IP+HA+ Sbjct: 578 GDQYAILEYVDESVTLKACAGLNGMKLGGQVITAVQAVPNGSSLGNGGDRQSFVIPQHAR 637 Query: 361 PLLEKPTEILKIKNVLEAXXXXXXXXXXXXXXXXXXXXXCSRFGTVKAVNVVKQGNCSST 540 PLL+KPT++LK+K+ L C+RFGT+K+VN+VK N Sbjct: 638 PLLQKPTQVLKLKS-LFPEDFSSLSEAEAEEVLEDVRLECARFGTIKSVNIVKHAN---- 692 Query: 541 TESFASMDAGDAAVDDEGKQEVTVGTTAHELENLDESKPPSSTMEAVEDNCNSDVKPGRC 720 A + GD +DD ++ LEN DE + TME V D G Sbjct: 693 ----AIIATGDKKIDDNTRET----GARRNLEN-DEINVQTETMEEVTD--------GNS 735 Query: 721 SPLSSSTDPDDFSKANVDNGHSDDKLLANIITDETCESNIEDKDTSIKD 867 + P D + + +D+K L ++ +E+C + D + +D Sbjct: 736 GGTAQVKFPSDAHEEKAGDSINDEKPLCKLVDNESCRQGEFEGDINKED 784 Score = 95.9 bits (237), Expect = 4e-17 Identities = 68/204 (33%), Positives = 100/204 (49%), Gaps = 17/204 (8%) Frame = +1 Query: 730 SSSTDPDDFSKANVDNGHS-----DDKLLANIITDETCESNIEDKDTSIKDAXXXXXXXX 894 +S + D +S N DN S D+ L AN ESN+E+ + + + Sbjct: 838 ASKEESDYYSDRNADNIKSVAINVDEILAAN-------ESNLEEVNGKLPEGCPNAEVAI 890 Query: 895 XXXXREKSP---DASIDHLISNDKVVDNSTTVASGIE-DTMKIEKGSSSEEDITRTSASA 1062 + P I + ++ VA ++ + + +EK +ED+ Sbjct: 891 EDPASKSVPISISQEIPRMPRTEEQDSQFDKVADNVQIEVINVEKKLVPKEDLELKEVDG 950 Query: 1063 LNP--------GNKKDSDINEKAEDKGEVANLGKVFKPGSVLVEFKRAEACCIAAHCLHR 1218 P G K +SD E+AE+K NL ++F+PG V VE++R EA C+AAHC+H Sbjct: 951 KLPEAVDGSAGGVKIESDTIEQAENKEN--NLQQIFEPGCVFVEYRRIEASCMAAHCIHG 1008 Query: 1219 RHFDDRIVTVEYISPNLYRKRFPK 1290 R FDDRIVTVEYI P+LYR +FPK Sbjct: 1009 RLFDDRIVTVEYIDPDLYRLKFPK 1032