BLASTX nr result
ID: Catharanthus23_contig00010147
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00010147 (2142 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004234688.1| PREDICTED: uncharacterized protein LOC101255... 414 e-113 ref|XP_006346854.1| PREDICTED: uncharacterized protein LOC102582... 412 e-112 emb|CBI16864.3| unnamed protein product [Vitis vinifera] 402 e-109 ref|XP_002279557.2| PREDICTED: uncharacterized protein LOC100247... 401 e-109 ref|XP_006482308.1| PREDICTED: uncharacterized protein LOC102626... 396 e-107 gb|EMJ18348.1| hypothetical protein PRUPE_ppa003054mg [Prunus pe... 393 e-106 ref|XP_006430828.1| hypothetical protein CICLE_v10013582mg, part... 388 e-105 ref|XP_002525972.1| nucleic acid binding protein, putative [Rici... 385 e-104 ref|XP_006373469.1| hypothetical protein POPTR_0017s14060g [Popu... 380 e-102 gb|EOY04289.1| Proline-rich spliceosome-associated family protei... 378 e-102 ref|XP_004304149.1| PREDICTED: uncharacterized protein LOC101295... 369 3e-99 gb|EXB72259.1| Zinc finger CCHC domain-containing protein 8 [Mor... 367 1e-98 ref|XP_006593391.1| PREDICTED: uncharacterized protein LOC100527... 364 9e-98 gb|ADN34281.1| nucleic acid binding protein [Cucumis melo subsp.... 362 3e-97 ref|XP_004169819.1| PREDICTED: uncharacterized protein LOC101230... 360 1e-96 ref|XP_004141493.1| PREDICTED: uncharacterized protein LOC101212... 360 1e-96 ref|XP_002305958.1| proline-rich spliceosome-associated family p... 359 3e-96 ref|XP_006603953.1| PREDICTED: uncharacterized protein LOC100805... 356 2e-95 ref|XP_004514436.1| PREDICTED: uncharacterized protein LOC101500... 354 8e-95 ref|XP_002329267.1| predicted protein [Populus trichocarpa] 354 1e-94 >ref|XP_004234688.1| PREDICTED: uncharacterized protein LOC101255771 [Solanum lycopersicum] Length = 530 Score = 414 bits (1065), Expect = e-113 Identities = 219/452 (48%), Positives = 288/452 (63%), Gaps = 18/452 (3%) Frame = -1 Query: 1962 METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLE--------VNVNESHGS 1807 M TED +LPAS + G +N E+ +G N + + E +++ GS Sbjct: 1 MGTEDPNNLPASDNLERGIENIEVGANGDTSKTTNFEVSESNEPLRESDSDMDLESDPGS 60 Query: 1806 ST--DIVNEDNQLLLDADNKSENQDKLEVI-----ADAGLME--DGNGLCNHAEDAFQDS 1654 D+ +Q+ ++ +++ ++ A+ GL+ D N N ED S Sbjct: 61 QVGVDLTGTPSQVCVELAETVGITEEVTMVDSVIHAENGLLSLPDANYSSNQTEDQDHVS 120 Query: 1653 CPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSA 1474 + KRPR D +QPSV V+Y SLTR+S++ LE LLQQWS+WHA++CSSA Sbjct: 121 TQEIGGVKCLSGVKRPRATLDVEQPSVHVVYDSLTRESRKMLEGLLQQWSEWHAKHCSSA 180 Query: 1473 NDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGYXX 1294 DS + SGEETYFPAL VG +KPSA+++W++ + S E I DGNS+PLYDRGY Sbjct: 181 QDSRELLESGEETYFPALHVGLEKPSAVTYWVDKQASNNKSEFIPLDGNSIPLYDRGYSF 240 Query: 1293 XXXXXXXXXXLERVRERAEDSRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNA 1114 +ER E + SRCFNCGSY H+LK+CPKPRDN AVN+ARKQHK RRNQ+A Sbjct: 241 ALTATDSSTNVERGIEMVDSSRCFNCGSYGHALKECPKPRDNAAVNSARKQHKSRRNQSA 300 Query: 1113 AARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD 934 ++RNPTRYYQ++P GKYDGL+PG L++ETRKLLGLGELDPPPW+NRMR++GYPPGYLE D Sbjct: 301 SSRNPTRYYQDSPRGKYDGLRPGALDSETRKLLGLGELDPPPWINRMRQMGYPPGYLEDD 360 Query: 933 -DQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEERWAASG 757 DQPSGITIF S P+K +V+FPGVNAPIPE+ADE RW A+ Sbjct: 361 EDQPSGITIFADEKNKEETEEGEILDKSLPNLPRKMTVDFPGVNAPIPEHADERRWEAAP 420 Query: 756 SSDLYMSRNHSYSRYNRPSEPISRGRHHEQQR 661 SS Y SR+HS++RYN + ++RG +HEQ+R Sbjct: 421 SSSRY-SRSHSHNRYNHAQDYVNRGHYHEQRR 451 >ref|XP_006346854.1| PREDICTED: uncharacterized protein LOC102582187 [Solanum tuberosum] Length = 530 Score = 412 bits (1059), Expect = e-112 Identities = 215/452 (47%), Positives = 285/452 (63%), Gaps = 18/452 (3%) Frame = -1 Query: 1962 METEDVISLPASSSPANGGDNEELNDSG----------CQPSEQNCQSLDVLEVNVNESH 1813 M TED + PAS + G +N E+ +G + E +S +++ + Sbjct: 1 MGTEDPNNCPASDNLERGIENSEVGANGDTSKPTNFVVSESKEPQQESDSDMDLESDPGS 60 Query: 1812 GSSTDIVNEDNQLLLDADNKSENQDKLEVI-----ADAGLME--DGNGLCNHAEDAFQDS 1654 D+ +Q+ ++ E +++ + A+ GL+ D N N +D S Sbjct: 61 QVGVDLTGTPSQVGVELAETVEITEEVTTLDSVVHAENGLLSLPDRNNSSNQTKDQDHVS 120 Query: 1653 CPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSA 1474 + KRPR D +QPSV V+Y SLTR+S+ LE LLQQWS+WHA++CSSA Sbjct: 121 TQEIGGVKCLSGVKRPRATLDVEQPSVHVVYDSLTRESRNMLEGLLQQWSEWHAKHCSSA 180 Query: 1473 NDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGYXX 1294 +DS + SGEETYFPAL VG + PSA+++W++ + S E I DGNS+PLYDRGY Sbjct: 181 HDSRELLESGEETYFPALHVGLENPSAVTYWVDKQASNNKSEFIPLDGNSIPLYDRGYSF 240 Query: 1293 XXXXXXXXXXLERVRERAEDSRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNA 1114 +ER E + SRCFNCGSY H+LK+CPKPRDN AVN+ARKQHK RRNQ+A Sbjct: 241 ALTATDSSTNVERGMEMVDSSRCFNCGSYGHALKECPKPRDNAAVNSARKQHKSRRNQSA 300 Query: 1113 AARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD 934 ++RNPTRYYQ++P GKYDGL+PG L++ETRKLLGLGELDPPPW+NRMR++GYPPGYLE D Sbjct: 301 SSRNPTRYYQDSPRGKYDGLRPGALDSETRKLLGLGELDPPPWINRMRQMGYPPGYLEED 360 Query: 933 -DQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEERWAASG 757 DQPSGITIF S P+K SV+FPGVNAPIPE+ADE RW A+ Sbjct: 361 EDQPSGITIFADEKNKEETEEGEILDKSFPNPPRKMSVDFPGVNAPIPEHADERRWEAAP 420 Query: 756 SSDLYMSRNHSYSRYNRPSEPISRGRHHEQQR 661 SS Y SR++S++RYN + ++RG +HEQ+R Sbjct: 421 SSSRY-SRSYSHNRYNHAQDYVNRGHYHEQRR 451 >emb|CBI16864.3| unnamed protein product [Vitis vinifera] Length = 1165 Score = 402 bits (1032), Expect = e-109 Identities = 230/465 (49%), Positives = 292/465 (62%), Gaps = 26/465 (5%) Frame = -1 Query: 1980 PERML---EMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSL--DVLE--VNVN 1822 P+++L +M TE++I+ PA S G ++ EL++S +P E + S +V E +N+ Sbjct: 582 PQKLLLDSDMGTEELINPPAPSGSVCGSEDNELHNSNPEPGEADSSSSNSEVKEDKLNIE 641 Query: 1821 ESHGSSTDIVNEDNQL----LLD---ADNKSENQDKLEV----IADAGLMEDGNGL---- 1687 + D D++L +LD D + +Q +EV + + +G+ Sbjct: 642 SLMQNKVDFEKVDSRLTPGVVLDKDLVDKQLTSQGSVEVTETIVVTKLINSSSSGVPTEN 701 Query: 1686 -CNHAEDAFQDSCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQ 1510 C A D S+SG KR RL DEQQPSV VIY SLTRDSKRKLEELLQQ Sbjct: 702 GCLTAPDEGPIGNHMIDGTSISGV-KRARLTIDEQQPSVHVIYNSLTRDSKRKLEELLQQ 760 Query: 1509 WSQWHAQNCSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFD 1333 WS+WHA+ SS++D + SGE+TYFPAL VG +K SA+SFW++N+T Q+KE I D Sbjct: 761 WSEWHAKYVSSSHDPKGQLDSGEKTYFPALHVGLNKSSAVSFWVDNQTRKQQDKEFISLD 820 Query: 1332 GNSVPLYDRGYXXXXXXXXXXXXLERVRERAEDSRCFNCGSYSHSLKDCPKPRDNVAVNN 1153 G+SVPLYDRG+ E E + SRCFNCGSY+HS+K+CPKPRDNVAVNN Sbjct: 821 GDSVPLYDRGFALGLVSEDGQSKPEGALEIIDASRCFNCGSYNHSMKECPKPRDNVAVNN 880 Query: 1152 ARKQHKVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRM 973 ARKQHK RRNQN +RNPTRYYQN+PGG+YDGL+PG L ETR+LLGLGELDPPPWLNRM Sbjct: 881 ARKQHKSRRNQNPGSRNPTRYYQNSPGGRYDGLRPGALGVETRELLGLGELDPPPWLNRM 940 Query: 972 REIGYPPGYL--EVDDQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAP 799 RE+GYPPGYL E ++QPSGITI+ + + +K SVEFPG+NAP Sbjct: 941 REMGYPPGYLDPEEEEQPSGITIYADEEVKDEQEDGEILETEYLEPQRKMSVEFPGINAP 1000 Query: 798 IPENADEERWAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 664 IP+NADE RWAA S + NHSY EP SR HEQ+ Sbjct: 1001 IPKNADERRWAA--GSRPHRRLNHSY-------EPSSRRNSHEQR 1036 >ref|XP_002279557.2| PREDICTED: uncharacterized protein LOC100247996 [Vitis vinifera] Length = 575 Score = 401 bits (1030), Expect = e-109 Identities = 230/458 (50%), Positives = 287/458 (62%), Gaps = 23/458 (5%) Frame = -1 Query: 1962 METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSL--DVLE--VNVNESHGSSTDI 1795 M TE++I+ PA S G ++ EL++S +P E + S +V E +N+ + D Sbjct: 1 MGTEELINPPAPSGSVCGSEDNELHNSNPEPGEADSSSSNSEVKEDKLNIESLMQNKVDF 60 Query: 1794 VNEDNQL----LLD---ADNKSENQDKLEV----IADAGLMEDGNGL-----CNHAEDAF 1663 D++L +LD D + +Q +EV + + +G+ C A D Sbjct: 61 EKVDSRLTPGVVLDKDLVDKQLTSQGSVEVTETIVVTKLINSSSSGVPTENGCLTAPDEG 120 Query: 1662 QDSCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNC 1483 S+SG KR RL DEQQPSV VIY SLTRDSKRKLEELLQQWS+WHA+ Sbjct: 121 PIGNHMIDGTSISGV-KRARLTIDEQQPSVHVIYNSLTRDSKRKLEELLQQWSEWHAKYV 179 Query: 1482 SSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDR 1306 SS++D + SGE+TYFPAL VG +K SA+SFW++N+T Q+KE I DG+SVPLYDR Sbjct: 180 SSSHDPKGQLDSGEKTYFPALHVGLNKSSAVSFWVDNQTRKQQDKEFISLDGDSVPLYDR 239 Query: 1305 GYXXXXXXXXXXXXLERVRERAEDSRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRR 1126 G+ E E + SRCFNCGSY+HS+K+CPKPRDNVAVNNARKQHK RR Sbjct: 240 GFALGLVSEDGQSKPEGALEIIDASRCFNCGSYNHSMKECPKPRDNVAVNNARKQHKSRR 299 Query: 1125 NQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGY 946 NQN +RNPTRYYQN+PGG+YDGL+PG L ETR+LLGLGELDPPPWLNRMRE+GYPPGY Sbjct: 300 NQNPGSRNPTRYYQNSPGGRYDGLRPGALGVETRELLGLGELDPPPWLNRMREMGYPPGY 359 Query: 945 L--EVDDQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEER 772 L E ++QPSGITI+ + + +K SVEFPG+NAPIP+NADE R Sbjct: 360 LDPEEEEQPSGITIYADEEVKDEQEDGEILETEYLEPQRKMSVEFPGINAPIPKNADERR 419 Query: 771 WAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 658 WAA S + NHSY EP SR HE QRW Sbjct: 420 WAA--GSRPHRRLNHSY-------EPSSRRNSHE-QRW 447 >ref|XP_006482308.1| PREDICTED: uncharacterized protein LOC102626617 [Citrus sinensis] Length = 553 Score = 396 bits (1018), Expect = e-107 Identities = 230/458 (50%), Positives = 284/458 (62%), Gaps = 23/458 (5%) Frame = -1 Query: 1962 METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNVNESHGSSTDIVNED 1783 ME EDVI L ASS +N E+ +P + + Q D E ++S+G S ++ NE Sbjct: 1 MEAEDVIDLLASSPSGCEEENNEMPGRDGEPGKSDFQPNDS-EKKEDDSNGESMEL-NEL 58 Query: 1782 NQLLLDADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAF--------QDSCPQTR---- 1639 N + D E + +V+ D+ + +G AE Q+ C + Sbjct: 59 NVEIEDGQLIEEGEVGKDVVDDSNVNVEGTTTVELAETIVESDSRIHVQNGCLEVGNRSP 118 Query: 1638 -------ANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCS 1480 +S SG KR R+ DE+QPSV VIY SLTR SK+KLEELLQQWS+W AQ S Sbjct: 119 NHNRMKDVSSTSGV-KRARMTLDEEQPSVHVIYNSLTRASKQKLEELLQQWSEWQAQFGS 177 Query: 1479 SANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQ-NKELIRFDGNSVPLYDRG 1303 S+ND G+ GE+T+FPA++VG K A+SFW++N+T Q NK I D +S PLYDRG Sbjct: 178 SSNDPNEGIEFGEQTFFPAIRVGKAKGPAVSFWIDNQTRNQQNKNFIPSDSHSTPLYDRG 237 Query: 1302 YXXXXXXXXXXXXLERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRR 1126 Y LE E +D SRCFNCGSYSHSLK+CPKPRD AVNNARKQHK +R Sbjct: 238 YALGLTSGDGSSNLEGGLEIIDDASRCFNCGSYSHSLKECPKPRDKDAVNNARKQHKSKR 297 Query: 1125 NQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGY 946 NQN+A+RNP RYYQN+ GGKYDGL+PG L+AETR+LLGLGELDPPPWL+RMRE+GYPPGY Sbjct: 298 NQNSASRNPMRYYQNSAGGKYDGLRPGALDAETRQLLGLGELDPPPWLHRMRELGYPPGY 357 Query: 945 L--EVDDQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEER 772 L E DDQPSGITI+ + +K + EFPG+NAPIPENADE Sbjct: 358 LDSEDDDQPSGITIYADREIKEGQEDGEIIETGRPASKRKMTAEFPGINAPIPENADERL 417 Query: 771 WAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 658 WAA SS SR+ S+ R N SE ISRGR+HE QRW Sbjct: 418 WAARPSSS-DSSRDRSHHRLNHHSESISRGRYHE-QRW 453 >gb|EMJ18348.1| hypothetical protein PRUPE_ppa003054mg [Prunus persica] Length = 608 Score = 393 bits (1009), Expect = e-106 Identities = 243/509 (47%), Positives = 293/509 (57%), Gaps = 74/509 (14%) Frame = -1 Query: 1962 METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLD-------------------- 1843 ME+ED I LP S N+E N+S C E N Q + Sbjct: 1 MESEDFIGLPPSGDSLCRNVNDEPNNSNCDSKEVNSQPTNSEDREDKPKSENLGNDSDAQ 60 Query: 1842 ------VLEVNV-NESHGSSTDIVNEDNQLLLDADNKSENQDKLEVIADAGLMEDGNGLC 1684 V E N+ NE GS +D+ ED L A N+S++ D E I G +DG+ C Sbjct: 61 REVSHCVPEENLENELVGSGSDMEIEDISNL-PALNRSDSAD--EEIKIKG-NKDGDAHC 116 Query: 1683 ----NHAEDAF-----------------------------------QDSCP----QTRAN 1633 NH D F QD+ P +T Sbjct: 117 LQQANHNNDLFDESSLLSVAQSETVTVAQESNVFCSKVHKNGCLPVQDASPFGTHKTGGT 176 Query: 1632 SLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGM 1453 ++SG KR R+ DE+QPSV+V YKSLTR SK KLEELLQQWS+WHAQ S+ D + Sbjct: 177 TISGV-KRARITVDERQPSVRVTYKSLTRASKHKLEELLQQWSEWHAQYVPSSQDPIEVV 235 Query: 1452 VSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXX 1276 SGE+T+FPAL VG +K SA+SFWM+N+T ++KE D N VPLYDRGY Sbjct: 236 ESGEDTFFPALHVGTEKTSAVSFWMDNQTRKAESKESTPLDSNYVPLYDRGYALGLTLAG 295 Query: 1275 XXXXLERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNP 1099 LE E +D SRCFNCGSY+HSLKDCPKPR++VAVNNARKQ K +RNQNA +RN Sbjct: 296 GSSNLEGGLEIIDDASRCFNCGSYNHSLKDCPKPRNHVAVNNARKQLKFKRNQNANSRNS 355 Query: 1098 TRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQP 925 TRYYQN+P GKYDGL+PG L+AETRKLLG+GELDPPPWLNRMREIGYPPGYL+ D DQP Sbjct: 356 TRYYQNSPAGKYDGLRPGALDAETRKLLGIGELDPPPWLNRMREIGYPPGYLDPDDEDQP 415 Query: 924 SGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEERWAASGSSDL 745 SGI I+ + + +K +VEFPG+N PIPE+ADE W A G S Sbjct: 416 SGIIIYADEEIKGEQEDGEIIETDYPEPQRKMTVEFPGLNGPIPEDADERLW-APGPSFS 474 Query: 744 YMSRNHSYSRYNRPSEPISRGRHHEQQRW 658 SRN SYSR N SEP+SRG HH +QRW Sbjct: 475 DHSRNRSYSRSNHYSEPVSRG-HHREQRW 502 >ref|XP_006430828.1| hypothetical protein CICLE_v10013582mg, partial [Citrus clementina] gi|557532885|gb|ESR44068.1| hypothetical protein CICLE_v10013582mg, partial [Citrus clementina] Length = 1076 Score = 388 bits (996), Expect = e-105 Identities = 227/460 (49%), Positives = 283/460 (61%), Gaps = 23/460 (5%) Frame = -1 Query: 1968 LEMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNVNESHGSSTDIVN 1789 L ME EDVI L ASS +N E+ D +P + + Q D E ++S+G S ++ N Sbjct: 522 LYMEAEDVIDLLASSPSGCEEENNEMPDRDGEPGKSDFQPNDS-EKKEDDSNGESMEL-N 579 Query: 1788 EDNQLLLDADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAF--------QDSCPQTR-- 1639 E N + D E + +V+ D+ + +G AE Q+ C + Sbjct: 580 ELNVEIEDGQLIEEGEVGKDVVDDSNVNVEGTTTVELAETIVESDSRIHVQNGCLEVGNR 639 Query: 1638 ---------ANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQN 1486 +S+SG KR R+ DE+QPSV VIY SLTR SK+KLEELLQQWS+W AQ Sbjct: 640 SPNHNRMKDVSSISGV-KRARMTLDEEQPSVHVIYNSLTRASKQKLEELLQQWSEWQAQF 698 Query: 1485 CSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQ-NKELIRFDGNSVPLYD 1309 SS+ND G+ GE+T+FPA++VG K A+ +++ + Q NK I D +S PLYD Sbjct: 699 GSSSNDPNEGIEFGEQTFFPAIRVGKAKGPAVVIFLDRQPKQQQNKNFIPSDSHSTPLYD 758 Query: 1308 RGYXXXXXXXXXXXXLERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKV 1132 RGY LE E +D SRCFNCGSYSHSLK+CPKPRD AVNNARKQHK Sbjct: 759 RGYALGLTSGDGSSNLEGGLEIIDDASRCFNCGSYSHSLKECPKPRDKDAVNNARKQHKS 818 Query: 1131 RRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPP 952 +RNQN+A+RNP RYYQN+ GGKYDGL+PG L+AETR+LLGLGELDPPPWL+RMRE+GYPP Sbjct: 819 KRNQNSASRNPMRYYQNSAGGKYDGLRPGALDAETRQLLGLGELDPPPWLHRMRELGYPP 878 Query: 951 GYL--EVDDQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADE 778 GYL E DDQPSGITI+ + +K + EFPG+NAPIPENADE Sbjct: 879 GYLDSEDDDQPSGITIYADGEIKEGQEDGEIIETGRPASKRKMTTEFPGINAPIPENADE 938 Query: 777 ERWAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 658 WAA SS SR+ S+ R N SE ISRGR+HE QRW Sbjct: 939 RLWAARPSSS-DSSRDRSHHRLNHHSESISRGRYHE-QRW 976 >ref|XP_002525972.1| nucleic acid binding protein, putative [Ricinus communis] gi|223534704|gb|EEF36396.1| nucleic acid binding protein, putative [Ricinus communis] Length = 693 Score = 385 bits (989), Expect = e-104 Identities = 232/534 (43%), Positives = 294/534 (55%), Gaps = 98/534 (18%) Frame = -1 Query: 1971 MLEMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQ-------------------S 1849 +L METED+ISLP S++ +G +N EL+ P E Q Sbjct: 36 ILSMETEDMISLPDSTNSGDGIENNELDQPESGPGEAESQPSNYEAEEGMIDGHNMGLNE 95 Query: 1848 LDV-----------LEVNVNE------SHGSSTDIVNEDN--------------QLLLDA 1762 +D+ LE+N N+ + GS +NE+N + L+DA Sbjct: 96 VDIGNKTETSDPEKLELNQNDFGAEECTKGSKDSELNEENVKTEECSAVQENLGENLVDA 155 Query: 1761 DNKSENQDKLEVIADAGLMEDGNGLCNHAEDA---------------------------- 1666 + + D+ + + G++ + C D Sbjct: 156 VTEEDTIDRDYLFLNQGVVREEGAQCLVETDVDMDLVDSPVMQVNIEVAEAVAVSGNLSS 215 Query: 1665 ------FQDSCPQTRANSLS----------GAFKRPRLAEDEQQPSVQVIYKSLTRDSKR 1534 Q+SC T+ SL KR R+A +EQQPSV V Y SLTR SKR Sbjct: 216 FGFRLNAQNSCLDTQNESLIQNHMMKGGHVSGVKRARIAYNEQQPSVHVTYNSLTRASKR 275 Query: 1533 KLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQ- 1357 KLEELLQQWS+WH Q SS+ D + SGEETYFPAL VG +K SA+SFW+EN+T Q Sbjct: 276 KLEELLQQWSEWHVQRGSSSQDLNEVLESGEETYFPALCVGTEKSSAVSFWIENQTKKQL 335 Query: 1356 NKELIRFDGNSVPLYDRGYXXXXXXXXXXXXLERVRERA-EDSRCFNCGSYSHSLKDCPK 1180 N +LI D +SVPLYDRG+ +E E E +RCFNCGSYSH+LK+CPK Sbjct: 336 NNDLISSDSDSVPLYDRGFAIGLTSTDGPSNVEGGLEIVNEAARCFNCGSYSHALKECPK 395 Query: 1179 PRDNVAVNNARKQHKVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGEL 1000 PR+N AVNNARKQHK +RNQNA +RN TRYYQ++ GGKY+GLKPG+L+AETR+LLGLGEL Sbjct: 396 PRNNAAVNNARKQHKSKRNQNAGSRNGTRYYQSSSGGKYEGLKPGSLDAETRRLLGLGEL 455 Query: 999 DPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKS 826 DPPPWLNRMRE+GYPPGYL+ D DQPSGI IF + P+K + Sbjct: 456 DPPPWLNRMRELGYPPGYLDPDDEDQPSGIIIFADGDIKDEQEDGEIIETENPDPPRKMA 515 Query: 825 VEFPGVNAPIPENADEERWAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 664 VEFPG+NAPIPENADE W +G S RN + + + SE ISR HHEQ+ Sbjct: 516 VEFPGINAPIPENADERLW-ETGPSSYNSFRNRPFRKSDHSSETISRWHHHEQR 568 >ref|XP_006373469.1| hypothetical protein POPTR_0017s14060g [Populus trichocarpa] gi|550320291|gb|ERP51266.1| hypothetical protein POPTR_0017s14060g [Populus trichocarpa] Length = 615 Score = 380 bits (977), Expect = e-102 Identities = 225/501 (44%), Positives = 290/501 (57%), Gaps = 66/501 (13%) Frame = -1 Query: 1962 METEDVISLPASSSPANGGDNEELNDSGCQPSEQN------------------------- 1858 MET+D+I LP S +N+EL+ S PSE Sbjct: 1 METDDMIGLPGSIDFGYKNENDELSKSDFGPSESRSQPCSNDGKESKDDEEGLGLCEGVV 60 Query: 1857 ----------CQSLDVLEVNVNESHGSSTDIVNEDNQLL-------------LDADNKSE 1747 C L+V + E+ +++V E+ + +D Sbjct: 61 GNEEGIVDPGCSGLNVGDTGTEEAATDQSNLVLEERDIGSKGVQFAVETEADMDLVVSPV 120 Query: 1746 NQDKLEVIADAGLME---DGNGLCNHAEDAFQDSCPQTRANSLS----------GAFKRP 1606 Q L+V+ DA ++ D + + + ED F D T+ NSL KR Sbjct: 121 RQVNLDVV-DAVIVSKKPDISSIIGNVEDCFLD----TQNNSLVQQGKVDGSHISGVKRK 175 Query: 1605 RLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFP 1426 R+A DEQQPSV V+Y SLTR K+KLEELLQQWS+WHAQ +S++DS + SGE+TYFP Sbjct: 176 RMAYDEQQPSVHVMYNSLTRSGKQKLEELLQQWSEWHAQQ-NSSHDSDEMLQSGEDTYFP 234 Query: 1425 ALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXLERVR 1249 AL+VG +K SA+SFW+EN+ Q+ +LI N VPLYDRGY +E Sbjct: 235 ALRVGMEKSSAVSFWIENQARKQQDNDLILQHSNFVPLYDRGYVLGLTSADGPINVEGGL 294 Query: 1248 ERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNPTRYYQNTPG 1072 E + + RCFNCG+Y+HSLK+CPKPRDN AVNNARKQHK +RNQN+++RNPTRYYQ++ G Sbjct: 295 EIVDAAARCFNCGAYNHSLKECPKPRDNAAVNNARKQHKFKRNQNSSSRNPTRYYQSSSG 354 Query: 1071 GKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXX 898 GKYDGLKPG+L+ ETR+LLGLGELDPPPWLNRMRE+GYPPGYL+ D DQPSGITIF Sbjct: 355 GKYDGLKPGSLDTETRQLLGLGELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFDDG 414 Query: 897 XXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEERW-AASGSSDLYMSRNHSY 721 H + P+K SVEFPG+NAPIPENA++ W SSD + R+ S Sbjct: 415 DVEEEQEDGEIMETDHPEPPRKMSVEFPGINAPIPENANQRFWEVGPSSSDPF--RHRSR 472 Query: 720 SRYNRPSEPISRGRHHEQQRW 658 R N SE R HHEQ+++ Sbjct: 473 HRSNHSSEATGRWHHHEQRQY 493 >gb|EOY04289.1| Proline-rich spliceosome-associated family protein / zinc knuckle family protein, putative isoform 1 [Theobroma cacao] gi|508712393|gb|EOY04290.1| Proline-rich spliceosome-associated family protein / zinc knuckle family protein, putative isoform 1 [Theobroma cacao] Length = 595 Score = 378 bits (971), Expect = e-102 Identities = 225/477 (47%), Positives = 287/477 (60%), Gaps = 44/477 (9%) Frame = -1 Query: 1962 METEDVISLPASS--SPANGGDNEELNDSGCQPSEQ--NCQSLD------VLEVNVNESH 1813 ME +D+I+LPASS S + G+ +L+D CQ Q N ++ D LEVN Sbjct: 1 MEGQDIINLPASSNSSGSESGELRDLDDGPCQVGSQPNNAETKDGEGKVESLEVNEGVIK 60 Query: 1812 GSSTDIVNE---DNQLLLDADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAF------- 1663 +D++ E DN L+ D+ S+ Q E+ + E GL A A+ Sbjct: 61 NPQSDLIVETEVDNTLV---DDSSDMQISDEITETVRVKETLEGLSFGAHSAYFTADEKM 117 Query: 1662 ---QDSCP--QTRANSLSGA--------------FKRPRLAEDEQQPSVQVIYKSLTRDS 1540 S P + R ++ +G+ KRPR+ D+QQPSV ++Y LTR S Sbjct: 118 DGLSSSVPTKKRRLDAQNGSPIQNDMMDGIPISGVKRPRMTFDDQQPSVHIVYNFLTRAS 177 Query: 1539 KRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSG 1360 K+KLEELLQ+WS+W A++ + + D + SGEETYFPAL+VGA+KPS +SFW++N+T Sbjct: 178 KQKLEELLQKWSEWQAEHGTLSPDENELIESGEETYFPALRVGAEKPSTVSFWIDNQTRN 237 Query: 1359 -QNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXLERVRERAED-SRCFNCGSYSHSLKDC 1186 ++ E+I D N VPLYDRGY LE E +D SRCFNCGSYSHSLK C Sbjct: 238 PRDTEIITLDSNIVPLYDRGYAMCLTSADGSSNLEGGLEIKDDASRCFNCGSYSHSLKQC 297 Query: 1185 PKPRDNVAVNNARKQH-KVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGL 1009 PKPRDN+AVN ARKQH K +RNQN +RN RYYQ++ GGKYD LKPG L A+TR+LLGL Sbjct: 298 PKPRDNLAVNAARKQHYKSKRNQNTGSRNAIRYYQSSQGGKYDDLKPGVLSADTRQLLGL 357 Query: 1008 GELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPK 835 GE DPPPWLNRMREIGYP GYL D DQPSGITI+ H + K Sbjct: 358 GEFDPPPWLNRMREIGYPTGYLAPDDEDQPSGITIYADGETNEEQEDGEITEVVHAEPEK 417 Query: 834 KKSVEFPGVNAPIPENADEERWAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 664 K +VEFPG+NAPIP ADE+ W A GSS SR+ S+ R + SEP SRG HHE++ Sbjct: 418 KMTVEFPGINAPIPVEADEKLW-APGSSSSESSRSRSHRRLHHSSEPGSRGHHHERR 473 >ref|XP_004304149.1| PREDICTED: uncharacterized protein LOC101295545 [Fragaria vesca subsp. vesca] Length = 553 Score = 369 bits (947), Expect = 3e-99 Identities = 219/461 (47%), Positives = 280/461 (60%), Gaps = 26/461 (5%) Frame = -1 Query: 1962 METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNV-------NESHGSS 1804 ME+ED I+LP S +G ++ ELND E++ + +D +N N SS Sbjct: 1 MESEDFIALPDSGD--SGFEDGELND------EKDAKEVDAQPINSEDKEDKPNSERESS 52 Query: 1803 TDIVNED--------NQLL-LDADNKSENQDKLEVIADAGLMEDG--NGLCNHAEDAFQD 1657 + N+L+ D+D + E+ + L + +G E N + +++ Sbjct: 53 AQREESECIPEASPANELVDNDSDMEIEDINNLPALTSSGPKEGDVQNVNIDLHSTLYEN 112 Query: 1656 SCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSS 1477 +A KR R DEQQ SV+V Y LTR SK KLEELLQQWS+WHA++ SS Sbjct: 113 GHLAVQAKR---GVKRARTTVDEQQASVRVTYSHLTRASKHKLEELLQQWSEWHAKHVSS 169 Query: 1476 ANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGY 1300 + D+ + SGEET FPAL VG ++ S +SFWM+N+T + QN E + D N PLYDRGY Sbjct: 170 SQDTPQVLESGEETLFPALHVGTERTSGVSFWMDNQTGTAQNMESLPLDSNYAPLYDRGY 229 Query: 1299 XXXXXXXXXXXXLERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRN 1123 E E +D SRCFNCGSY+H+L++CPKPRD+VAVN ARKQ K+++N Sbjct: 230 ALGLTVAGSSTNQEGGLEIIDDASRCFNCGSYNHALRECPKPRDHVAVNKARKQLKIKKN 289 Query: 1122 QNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYL 943 Q +RN TRYYQN+P GKYDGL+PG LEAETRKLLGLGELDPPPWLNRMREIGYPPGYL Sbjct: 290 QTPNSRNSTRYYQNSPAGKYDGLRPGALEAETRKLLGLGELDPPPWLNRMREIGYPPGYL 349 Query: 942 EVD--DQPSGITIFXXXXXXXXXXXXXXXXXSHNKAP---KKKSVEFPGVNAPIPENADE 778 +VD DQPSGI I+ + P +K +V FPG+NAPIPENADE Sbjct: 350 DVDDEDQPSGIIIYGVEETKGEQEDGEIIETDLPEPPEPRRKMTVGFPGMNAPIPENADE 409 Query: 777 ERWAASGS-SDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 658 RW S S SD SRNHS++R N EP+SRG H+ +QRW Sbjct: 410 RRWTPSPSVSD--PSRNHSHNRPNHYYEPVSRG-HYREQRW 447 >gb|EXB72259.1| Zinc finger CCHC domain-containing protein 8 [Morus notabilis] Length = 660 Score = 367 bits (941), Expect = 1e-98 Identities = 217/466 (46%), Positives = 273/466 (58%), Gaps = 30/466 (6%) Frame = -1 Query: 1965 EMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNV---NESHGSSTD- 1798 +ME ED+ +P ++ G +N ++ P + QS+ + + ++ +E+ S+ D Sbjct: 80 DMEIEDLNGVPVLTAAGYGLENNGIDSFSNDPRQAGSQSVTLADKDMKTTSENLVSNMDG 139 Query: 1797 IVNEDNQLLLDA-------DNKSENQDKLEVIADAGLMEDGNGL--CNHAEDAFQDS--- 1654 ED L A DN S Q + + A + E C A QD Sbjct: 140 AQREDGTWKLKAIQEKDLADNSSLLQVNVNLTDTAAVAEASKTTFGCEIGRVAVQDKISI 199 Query: 1653 --------CPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQW 1498 C N KR R+ +EQQPSV V + LTR SK KLEEL+QQWS+W Sbjct: 200 RTKKREGYCILCLVNYTISGVKRSRVMFEEQQPSVCVKFNFLTRSSKYKLEELMQQWSEW 259 Query: 1497 HAQNCSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQ-NKELIRFDGNSV 1321 AQ+ SS+ D + SGEETYF AL +G +K S++ FW++ +T Q N EL D NSV Sbjct: 260 QAQHHSSSQDPPEALESGEETYFSALHIGLEKASSVPFWIDKQTGKQQNNELSPLDCNSV 319 Query: 1320 PLYDRGYXXXXXXXXXXXXLERVRERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARK 1144 PLYDRG+ +E E ED+ RCFNCGSY+H+LK+CPKPRDNVAVNNARK Sbjct: 320 PLYDRGFALGLTSDGGSSNVEGGLEIVEDAVRCFNCGSYNHALKECPKPRDNVAVNNARK 379 Query: 1143 QHKVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREI 964 Q K +RNQN ++RNPTRYYQN+P GKYDGLKPGTL+ ETRKLLGL ELDPPPWL RMREI Sbjct: 380 QLKSKRNQNPSSRNPTRYYQNSPAGKYDGLKPGTLDPETRKLLGLRELDPPPWLGRMREI 439 Query: 963 GYPPGYLEVD--DQPSGITIFXXXXXXXXXXXXXXXXXSHNK-APKKKSVEFPGVNAPIP 793 GYPPGYL+ D DQPSGITI+ + N+ P+K +VEFPG+N PIP Sbjct: 440 GYPPGYLDPDEEDQPSGITIYADGEGNKAEQEDGEIIEADNREPPRKMTVEFPGINGPIP 499 Query: 792 ENADEERW-AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 658 ENAD W AA SSD+Y RN R N SEP R HH ++RW Sbjct: 500 ENADRRIWTAAPASSDIY--RNRLLRRSNHSSEPTGRS-HHREERW 542 >ref|XP_006593391.1| PREDICTED: uncharacterized protein LOC100527170 isoform X1 [Glycine max] gi|571495821|ref|XP_006593392.1| PREDICTED: uncharacterized protein LOC100527170 isoform X2 [Glycine max] Length = 517 Score = 364 bits (934), Expect = 9e-98 Identities = 200/400 (50%), Positives = 263/400 (65%), Gaps = 10/400 (2%) Frame = -1 Query: 1833 VNVNESHGSSTDIVNEDNQLLLDADNKSENQDKLEVIAD---AGLMEDGNGLCNHAEDAF 1663 +N +ES +D + E ++L D ++ KL V+ + G++ + NG C ED Sbjct: 7 MNPSESSNLGSDSL-EKEKILEDETEDLQDGLKLSVVTEEMSGGVLAE-NG-CISLEDGS 63 Query: 1662 QDSCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNC 1483 +T S+SGA KR R+ DE QPSV Y SLTR S++KL+ELLQQWS+WHA++ Sbjct: 64 LKRSIETVETSVSGA-KRARITVDEDQPSVHFTYNSLTRASRQKLQELLQQWSEWHAKHV 122 Query: 1482 SSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDR 1306 S+ND+ + SGEET+FPAL VG +K SA+SFWMEN+T +NK+ I NSVPLYDR Sbjct: 123 LSSNDASEVLESGEETFFPALHVGLEKTSAVSFWMENQTRKDKNKDFIPLADNSVPLYDR 182 Query: 1305 GYXXXXXXXXXXXXLERVRERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVR 1129 GY ++ E + + RCFNCGSY+HSL++CP+PRDN AVNNAR +HK R Sbjct: 183 GYTLGLTSADGSSNVDGGLEIIDAAARCFNCGSYNHSLRECPRPRDNTAVNNARNKHKSR 242 Query: 1128 RNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPG 949 RNQN+++RNPTRYYQN+P GKYDGL+PG L+ TR+LLGLGELDPPPWLNRMRE+GYPPG Sbjct: 243 RNQNSSSRNPTRYYQNSPAGKYDGLRPGALDDATRQLLGLGELDPPPWLNRMRELGYPPG 302 Query: 948 YLEVD--DQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEE 775 YL+VD DQPSGITI+ + +K +KK+V+FPG+NAPIP+NADE Sbjct: 303 YLDVDDEDQPSGITIYTDREIADQEDGEIMEADA-SKPKRKKTVKFPGINAPIPDNADER 361 Query: 774 RW---AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 664 W A SSD+ + + R N ++ SRG H E + Sbjct: 362 LWGTRAGPSSSDISRNLSLPQHRSNYSTDYGSRGYHREHR 401 >gb|ADN34281.1| nucleic acid binding protein [Cucumis melo subsp. melo] Length = 610 Score = 362 bits (930), Expect = 3e-97 Identities = 212/450 (47%), Positives = 270/450 (60%), Gaps = 14/450 (3%) Frame = -1 Query: 1965 EMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNV----NESHGSSTD 1798 +ME ED+ +LP S + +N E+ S + N ++L N NE H D Sbjct: 81 DMEIEDLNNLPDFSKTRSRSENSEIL-SKAEDLPVNSADGNILPSNEPLQQNELHTRYED 139 Query: 1797 IVNEDNQLLLD--ADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAFQDSCPQTRANSLS 1624 + + ++Q DN S ++ ++ G+ D N L + A + Sbjct: 140 VCHVESQNFQKDLVDNSSFSKTGGQLTVMNGVSIDFNELNSGAPMENGSATSHHHGGPRI 199 Query: 1623 GAFKRPRLAE---DEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGS-- 1459 KRPR+A DEQQPSV ++Y SLTRDSK+KL+ELL+QWS+WHAQ S + D Sbjct: 200 SGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHAQQGSLSRDDKDTE 259 Query: 1458 GMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGYXXXXXXX 1279 + SGEET+FPAL VG K SA++FWM+N+ S Q + + D NSVPLYDRG+ Sbjct: 260 NLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQTFVPIDDNSVPLYDRGFTLGLTSA 319 Query: 1278 XXXXXLERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARN 1102 +E ++ +D SRCFNCGSY+HSLKDC KPRDN AVNNAR ++K + N+A+RN Sbjct: 320 NDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYK--KQHNSASRN 377 Query: 1101 PTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYL--EVDDQ 928 TRYYQN+ GGKYD L+PGTL+AETR+LLGL ELDPPPWLNRMRE+GYPPGYL E +DQ Sbjct: 378 STRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPEDEDQ 437 Query: 927 PSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEERWAASGSSD 748 PSGITI+ + K KK SVEFPG+NAPIPENADE WA SS Sbjct: 438 PSGITIY-ADEKTDEQEDGEITEAEYRKPQKKMSVEFPGINAPIPENADERLWAPEPSSS 496 Query: 747 LYMSRNHSYSRYNRPSEPISRGRHHEQQRW 658 + RN S R N E +RG H QQRW Sbjct: 497 -GLPRNRSNQRLNHYPEYDTRGNDHHQQRW 525 >ref|XP_004169819.1| PREDICTED: uncharacterized protein LOC101230973 [Cucumis sativus] Length = 610 Score = 360 bits (925), Expect = 1e-96 Identities = 212/457 (46%), Positives = 269/457 (58%), Gaps = 21/457 (4%) Frame = -1 Query: 1965 EMETEDVISLPASSSPANGGDNEEL-----------NDSGCQPSEQNCQSLDVLEVNVNE 1819 +ME ED+ +LP S + +N E+ D PS + Q NE Sbjct: 81 DMEIEDLNNLPDFSKTRSRSENSEILSKAADLPVNSADGNILPSSEPLQQ--------NE 132 Query: 1818 SHGSSTDIVNEDNQLLLD--ADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAFQDSCPQ 1645 H D+ + +++ DN S + ++ G+ D N L + A + Sbjct: 133 FHTRYEDVCHVESKNFQKDLVDNSSFLKTGGQLTVMNGVSIDFNELNSGAPMENGSATSH 192 Query: 1644 TRANSLSGAFKRPRLAE---DEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSA 1474 KRPR+A DEQQPSV ++Y SLTRDSK+KL+ELL+QWS+WHAQ S + Sbjct: 193 HHGGPRISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHAQQGSLS 252 Query: 1473 NDSGS--GMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGY 1300 D + SGEET+FPAL VG K SA++FWM+N+ S Q + + D NSVPLYDRG+ Sbjct: 253 CDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQNFVPIDDNSVPLYDRGF 312 Query: 1299 XXXXXXXXXXXXLERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRN 1123 +E ++ +D SRCFNCGSY+HSLKDC KPRDN AVNNAR ++K + Sbjct: 313 TLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYK--KQ 370 Query: 1122 QNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYL 943 N+A+RN TRYYQN+ GGKYD L+PGTL+AETR+LLGL ELDPPPWLNRMRE+GYPPGYL Sbjct: 371 HNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRMRELGYPPGYL 430 Query: 942 --EVDDQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEERW 769 E +DQPSGITI+ + K KKKSVEFPG+NAPIPENADE W Sbjct: 431 DPEDEDQPSGITIY-ADEKTDEQEDGEITEAEYRKPRKKKSVEFPGINAPIPENADERLW 489 Query: 768 AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 658 A S+ +SRN S R N E +RG H QQRW Sbjct: 490 APEPSNS-GLSRNRSNQRLNHYPEYDTRGNDHHQQRW 525 >ref|XP_004141493.1| PREDICTED: uncharacterized protein LOC101212144 [Cucumis sativus] Length = 610 Score = 360 bits (925), Expect = 1e-96 Identities = 212/457 (46%), Positives = 268/457 (58%), Gaps = 21/457 (4%) Frame = -1 Query: 1965 EMETEDVISLPASSSPANGGDNEEL-----------NDSGCQPSEQNCQSLDVLEVNVNE 1819 +ME ED+ +LP S + +N E+ D PS + Q NE Sbjct: 81 DMEIEDLNNLPDFSKTRSRSENSEILSKAADLPVNSADGNILPSSELLQQ--------NE 132 Query: 1818 SHGSSTDIVNEDNQLLLD--ADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAFQDSCPQ 1645 H D+ + +++ DN S + ++ G+ D N L + A + Sbjct: 133 LHTRYEDVCHVESKKFQKDLVDNSSFLKTGGQLTVMNGVSIDFNELNSGAPMENGSATSH 192 Query: 1644 TRANSLSGAFKRPRLAE---DEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSA 1474 KRPR+A DEQQPSV ++Y SLTRDSK+KL+ELL+QWS+WHAQ S + Sbjct: 193 HHGGPRISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHAQQGSLS 252 Query: 1473 NDSGS--GMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGY 1300 D + SGEET+FPAL VG K SA++FWM+N+ S Q + + D NSVPLYDRG+ Sbjct: 253 CDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQNFVPIDDNSVPLYDRGF 312 Query: 1299 XXXXXXXXXXXXLERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRN 1123 E ++ +D SRCFNCGSY+HSLKDC KPRDN AVNNAR ++K + Sbjct: 313 TLGLTSANDSSNAEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYK--KQ 370 Query: 1122 QNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYL 943 N+A+RN TRYYQN+ GGKYD L+PGTL+AETR+LLGL ELDPPPWLNRMRE+GYPPGYL Sbjct: 371 HNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRMRELGYPPGYL 430 Query: 942 --EVDDQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEERW 769 E +DQPSGITI+ + K KKKSVEFPG+NAPIPENADE W Sbjct: 431 DPEDEDQPSGITIY-ADEKTDEQEDGEITEAEYRKPRKKKSVEFPGINAPIPENADERLW 489 Query: 768 AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 658 A S+ +SRN S R N E +RG H QQRW Sbjct: 490 APEPSNS-GLSRNRSNQRLNHYPEYDTRGNDHHQQRW 525 >ref|XP_002305958.1| proline-rich spliceosome-associated family protein [Populus trichocarpa] gi|222848922|gb|EEE86469.1| proline-rich spliceosome-associated family protein [Populus trichocarpa] Length = 531 Score = 359 bits (921), Expect = 3e-96 Identities = 202/431 (46%), Positives = 268/431 (62%), Gaps = 16/431 (3%) Frame = -1 Query: 1902 NEELNDSGCQPSEQNCQSLDVLEVNVNESHGSSTDIVNEDNQLLLDADNKSENQDKLEVI 1723 + +L + + S+ + +S+++ E V E N+ + D + +N + +E+ Sbjct: 19 HSQLGSNETKESKDDEESVELNEGAVGNDERMKNGESVELNEGAVGNDERMKNGESVELN 78 Query: 1722 ADAGLMEDGN----------GLCNHAEDAFQDSCPQTRANSLSGAFKRPRLAEDEQQPSV 1573 A +G G+ + E +AN +SG KR R+ +E+QPSV Sbjct: 79 EGAVGNNEGTKNGEGFELNVGVIGNDEVTVDPGYSALKAN-VSGV-KRKRITYNEEQPSV 136 Query: 1572 QVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFPALQVGADKPSA 1393 V+Y SLTR SK+KLEELLQQWS+WHAQ SS++DS + SGE+TYFPAL++G K SA Sbjct: 137 HVMYNSLTRASKKKLEELLQQWSEWHAQQNSSSHDSDEMLQSGEDTYFPALRIGMVKSSA 196 Query: 1392 MSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXLERVRERAEDS-RCFN 1219 ++FW+EN+T Q+ +I N VPLYDRGY +ER E D+ RC+N Sbjct: 197 VTFWIENQTRKQQDNAIIPLQSNYVPLYDRGYALGLTSADGPINIERGLEIVGDAARCYN 256 Query: 1218 CGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTL 1039 C SY+HSLK+CPKPRDN AVNNARKQHK +RNQN+++RNPTRYYQ++ GGKYDGLKPG+L Sbjct: 257 CASYNHSLKECPKPRDNAAVNNARKQHKFKRNQNSSSRNPTRYYQSSSGGKYDGLKPGSL 316 Query: 1038 EAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXXXXXXXXXXXXX 865 + ET+KLLGLGELDPPPWLNRM+E+GYPPGYL+ D DQPSGITIF Sbjct: 317 DTETQKLLGLGELDPPPWLNRMQELGYPPGYLDPDDEDQPSGITIFADGDVNEEQEDGEI 376 Query: 864 XXXSHNKAPKKK-SVEFPGVNAPIPENADEERW-AASGSSDLYMSRNHSYSRYNRPSEPI 691 P++K SVEFPG+NA IPENAD+ W SSD + R+ S R SE Sbjct: 377 TETDPPPEPQRKMSVEFPGINAAIPENADQRLWEVGPTSSDPW--RHRSQHRLKYSSEAT 434 Query: 690 SRGRHHEQQRW 658 R HHEQ+++ Sbjct: 435 GRWHHHEQRQY 445 >ref|XP_006603953.1| PREDICTED: uncharacterized protein LOC100805423 isoform X1 [Glycine max] gi|571554248|ref|XP_006603954.1| PREDICTED: uncharacterized protein LOC100805423 isoform X2 [Glycine max] Length = 519 Score = 356 bits (914), Expect = 2e-95 Identities = 199/399 (49%), Positives = 264/399 (66%), Gaps = 12/399 (3%) Frame = -1 Query: 1824 NESHGSSTDIVNEDNQLLLDADNKSENQD--KLEVIAD---AGLMEDGNGLCNHAEDAFQ 1660 +E+ +D + ++N L D K + QD KL+V+ + GL+ + NG C ED Sbjct: 12 SENSNLGSDSLEKENIL---EDEKEDLQDGLKLKVVTEEVSGGLLAE-NG-CISLEDGSL 66 Query: 1659 DSCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCS 1480 +T S+SGA KR R+ DE QPSV Y SLTR S++KL+ELLQ+WS WHA++ S Sbjct: 67 KRSLETVGTSVSGA-KRARITVDEYQPSVHFTYNSLTRASRQKLQELLQKWSAWHAKHVS 125 Query: 1479 SANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRG 1303 S++D+ + SGEET+FPAL VG +K SA+SFWMEN+T + +NK+ I N+VPLYDRG Sbjct: 126 SSSDASEVLESGEETFFPALHVGLEKTSAVSFWMENQTRNDKNKDFIPLADNTVPLYDRG 185 Query: 1302 YXXXXXXXXXXXXLERVRERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRR 1126 Y ++ E + + RCFNCGSY+HSL++CP+PRDN+AVNNAR + K RR Sbjct: 186 YALGLTSADGSSNVDGGLEIIDAAARCFNCGSYNHSLRECPRPRDNIAVNNARDKLKSRR 245 Query: 1125 NQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGY 946 NQN+++R+PTRYYQN+P GKYDGL+PG+L+ TRKLLGL ELDPPPWLNRMRE+GYPPGY Sbjct: 246 NQNSSSRHPTRYYQNSPAGKYDGLRPGSLDDATRKLLGLRELDPPPWLNRMRELGYPPGY 305 Query: 945 LEVD--DQPSGITIFXXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEER 772 L+VD DQPSGITIF + +K +KK+V+FPG+NAPIPE ADE Sbjct: 306 LDVDNEDQPSGITIFTDSEIADQEDGEIMEANA-SKPKRKKTVKFPGINAPIPEKADERL 364 Query: 771 W---AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 664 W A SSD+ + + R N ++ SRG H E + Sbjct: 365 WGTRAGPSSSDISRNLSLPQHRSNYSTDYGSRGYHREHR 403 >ref|XP_004514436.1| PREDICTED: uncharacterized protein LOC101500938 isoform X1 [Cicer arietinum] gi|502168650|ref|XP_004514437.1| PREDICTED: uncharacterized protein LOC101500938 isoform X2 [Cicer arietinum] Length = 532 Score = 354 bits (909), Expect = 8e-95 Identities = 216/442 (48%), Positives = 275/442 (62%), Gaps = 10/442 (2%) Frame = -1 Query: 1959 ETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNVNESHGSSTDIVNEDN 1780 E E + + +S +G + E +D + S+ SLD +++ V +S T IVN D Sbjct: 3 EEEHMNEVLKNSVGISGSEAENKSDKNMEISD----SLDEVKM-VEKSSLLETSIVNTDL 57 Query: 1779 QLLLDADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAFQDSCPQTRANSLSGAFKRPRL 1600 QL + LE+ + E+ +++ S R + KR R+ Sbjct: 58 QLEVG----------LELTDTVSISEEEGVRGTVHDESLNGSIEIDRRGT-----KRARI 102 Query: 1599 A-EDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFPA 1423 +DE QPSV IYKSLTR SK+KLEELLQQWS WHA++ SS+ND + SGEET+FPA Sbjct: 103 TVDDENQPSVHFIYKSLTRASKKKLEELLQQWSHWHAKHVSSSNDPSEVLESGEETFFPA 162 Query: 1422 LQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXLERVRE 1246 L VG + SA+SFWMEN+T + NK +I DG+SVPLYDRGY + E Sbjct: 163 LCVGHETTSAVSFWMENQTVNDTNKYVIPIDGDSVPLYDRGYALGLTSSSNNA--DGGLE 220 Query: 1245 RAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNPTRYYQNTPGG 1069 +D SRCFNCGSY+H+L++CP+PRDNVAVNNARKQ K RRNQN+++R+PTRYYQ++P G Sbjct: 221 IIDDPSRCFNCGSYNHALRECPRPRDNVAVNNARKQLKSRRNQNSSSRHPTRYYQSSPAG 280 Query: 1068 KYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXXX 895 KYDGLKPG L+ TR+LLGLGELDPPPWLNRMRE+GYPPGYL+ D D+PSGITIF Sbjct: 281 KYDGLKPGALDDATRQLLGLGELDPPPWLNRMRELGYPPGYLDADDEDEPSGITIF--TD 338 Query: 894 XXXXXXXXXXXXXSHNKAPKKK-SVEFPGVNAPIPENADEERWAA----SGSSDLYMSRN 730 + + PK+K SVEFPG+NAPIPE ADE WAA SSD+ S+N Sbjct: 339 KDMEEQEDGEIVGADSSQPKRKMSVEFPGINAPIPEKADERLWAARVGPPSSSDI--SKN 396 Query: 729 HSYSRYNRPSEPISRGRHHEQQ 664 S R S SRG H EQ+ Sbjct: 397 WS---QQRSSSYGSRGHHREQR 415 >ref|XP_002329267.1| predicted protein [Populus trichocarpa] Length = 289 Score = 354 bits (908), Expect = 1e-94 Identities = 175/286 (61%), Positives = 213/286 (74%), Gaps = 4/286 (1%) Frame = -1 Query: 1614 KRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEET 1435 KR R+A DEQQPSV V+Y SLTR K+KLEELLQQWS+WHAQ +S++DS + SGE+T Sbjct: 4 KRKRMAYDEQQPSVHVMYNSLTRSGKQKLEELLQQWSEWHAQQ-NSSHDSNEMLQSGEDT 62 Query: 1434 YFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXLE 1258 YFPAL+VG +K SA+SFW+EN+ Q+ +LI N VPLYDRGY +E Sbjct: 63 YFPALRVGMEKSSAVSFWIENQARKQQDNDLILQHSNFVPLYDRGYVLGLTSADGPINVE 122 Query: 1257 RVRERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNPTRYYQN 1081 E + + RCFNCG+Y+HSLK+CPKPRDN AVNNARKQHK +RNQN+++RNPTRYYQ+ Sbjct: 123 GGLEIVDAAARCFNCGAYNHSLKECPKPRDNAAVNNARKQHKFKRNQNSSSRNPTRYYQS 182 Query: 1080 TPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIF 907 + GGKYDGLKPG+L+ ETR+LLGLGELDPPPWLNRMRE+GYPPGYL+ D DQPSGITIF Sbjct: 183 SSGGKYDGLKPGSLDTETRQLLGLGELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIF 242 Query: 906 XXXXXXXXXXXXXXXXXSHNKAPKKKSVEFPGVNAPIPENADEERW 769 H + P+K SVEFPG+NAPIPENA++ W Sbjct: 243 DDGDVEEEQEDGEIMETDHPEPPRKMSVEFPGINAPIPENANQRFW 288