BLASTX nr result
ID: Paeonia23_contig00003722
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia23_contig00003722 (2284 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 689 0.0 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 629 e-177 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 625 e-176 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 623 e-175 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 614 e-173 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 607 e-171 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 600 e-169 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 570 e-160 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 563 e-157 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 551 e-154 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 532 e-148 ref|XP_002312652.1| RNA recognition motif-containing family prot... 521 e-145 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 493 e-136 gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 467 e-129 ref|XP_002315647.1| RNA recognition motif-containing family prot... 455 e-125 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 390 e-105 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 389 e-105 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 383 e-103 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 378 e-102 ref|XP_002889992.1| RNA recognition motif-containing protein [Ar... 327 1e-86 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 689 bits (1777), Expect = 0.0 Identities = 384/663 (57%), Positives = 419/663 (63%), Gaps = 1/663 (0%) Frame = -2 Query: 2151 MAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQMHR 1972 MAEEQLDYEDEE AIS GEGFLQMHR Sbjct: 1 MAEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHR 60 Query: 1971 SEAPVRTGIGNGG-LQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNAAHFPDQRVGPL 1795 SEAP +G+ GG QA KTDV LEAG SQGL IPGVS EGKYSN HF +++ GP+ Sbjct: 61 SEAPAPSGVMAGGPFQAHKTDVPPQKLEAGTSQGLIIPGVSIEGKYSNP-HFHEKKEGPM 119 Query: 1794 AAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMPGKIV 1615 A KGPE+GS ++ DG VSQKGRV EMT D QVRNLGFQGST IP K+G + +++ GKI Sbjct: 120 AVKGPEMGSTSHLDGPSVSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIA 179 Query: 1614 SESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSVENGP 1435 +ES P+LNS GGPR PQ+ N+ VNENQIRP+V+NG Sbjct: 180 NESTPVLNSGTGGPRAVPQMLSNQMGMNVNV-----------NRPMVNENQIRPAVDNGA 228 Query: 1434 TMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAASACKE 1255 TMLFVGELHWWTTD+ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYD AA+ACKE Sbjct: 229 TMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKE 288 Query: 1254 GMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGGGMNY 1075 GMNGY+FNGRACVVAFASPQTLKQMGASYMNK+Q Q +Q QGRRPMNDGVGRGGGMN Sbjct: 289 GMNGYIFNGRACVVAFASPQTLKQMGASYMNKTQAQ---SQSQGRRPMNDGVGRGGGMNM 345 Query: 1074 SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXXXXXX 895 GAVG KNMV Sbjct: 346 Q-GGDAGRNYGRGGWGRGGQGILNRGPGGGGPMRGRGGAVGAKNMV--GNTAGVGASGGG 402 Query: 894 XXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPAVNTM 715 G+MHPQ MMG+GFDPTYM GM+PSFPAVNTM Sbjct: 403 YGQGLAGPTFGGPAGGLMHPQGMMGSGFDPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTM 462 Query: 714 GLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRTRESS 535 GLAGVAPHVNPAFFGR MD HHAGMWTDTSMGGWGG+EHGRRTRESS Sbjct: 463 GLAGVAPHVNPAFFGRGMAANGMGMMGATGMDGHHAGMWTDTSMGGWGGEEHGRRTRESS 522 Query: 534 YGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNSXXXXXXXXXXXXXXXXXX 355 YGGDDGAS+YGYGE +HEK RSN A REKERGSER+WSGNS Sbjct: 523 YGGDDGASDYGYGEVNHEKVGRSNTASREKERGSERDWSGNSERRHRDEREQDWERSDKD 582 Query: 354 XRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVDYGKRR 175 RYREEKD YRDHR RERD NED+WDRGQ R + +EDHRSRSRD DYGKRR Sbjct: 583 HRYREEKDGYRDHRQRERDFNNEDDWDRGQSSSRSRSRSRAVADEDHRSRSRDGDYGKRR 642 Query: 174 RLP 166 RLP Sbjct: 643 RLP 645 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 629 bits (1623), Expect = e-177 Identities = 355/670 (52%), Positives = 410/670 (61%), Gaps = 5/670 (0%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MD MAEEQ+D+ DEE AI GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1980 MHRSEAPVRTG-IGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSN-AAHFPDQR 1807 + RSEAP++ G +G+ GL+AQ+ + E +EAGGSQGL+IPGVS +GK+ N +A +P++ Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKE 120 Query: 1806 VGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMP 1627 P A PE+ S +Y G+ +SQKG V E T D QV+NLGFQG T +K G+D + +P Sbjct: 121 EQP-AVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 1626 GKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV 1447 KI ++ A LNS GGP+G P + N +NENQ++P + Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNV-------------NHPVMNENQVQPPI 226 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENGPTMLFVGELHWWTTD+ELESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEFYDP +A+ Sbjct: 227 ENGPTMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAA 286 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 CKEGMNGY+FNGRACVVAFASPQTLKQMGASYMNK+Q Q Q Q QGRRP N+G+GRGG Sbjct: 287 VCKEGMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQA-QPQGRRP-NEGLGRGG 344 Query: 1086 GMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXX 907 +NY G VG KNMV Sbjct: 345 NLNYQ---SGDAGRNYGRGGWGRGGQGGVNRAGGGGLMRGRGGVGVKNMVGISAGVGNGA 401 Query: 906 XXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPA 727 GMMHPQ MMGAGFDPTYM GM+PSFPA Sbjct: 402 NGAGAYGQGPGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPA 461 Query: 726 VNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRT 547 VNTMGLAGVAPHVNPAFFGR MD HAGMWTD SMGGWGGDEHGRRT Sbjct: 462 VNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRT 521 Query: 546 RESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXX 376 RESSYGG+DGASEYGYG+A+HEKG RS+ A REKER SEREWSGNS Sbjct: 522 RESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDR 580 Query: 375 XXXXXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRD 196 RYREEKDSYR+HRHRERDL +D+WDRGQ MPEE+HRSRSRD Sbjct: 581 SEREHREHRYREEKDSYREHRHRERDLDYDDDWDRGQSSSRSRRRSHAMPEEEHRSRSRD 640 Query: 195 VDYGKRRRLP 166 VDYGK+RRLP Sbjct: 641 VDYGKKRRLP 650 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 625 bits (1613), Expect = e-176 Identities = 360/671 (53%), Positives = 407/671 (60%), Gaps = 6/671 (0%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MD MAEEQ+D+ DEE AI GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1980 MHRSEAPVRTG-IGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNA-AHFPDQR 1807 + RSEAP + G +G+ GLQAQK + E EAGGSQGL+IPGVS +GK+ N A +P+Q Sbjct: 61 LQRSEAPPQPGGMGSTGLQAQKNEAPEPRGEAGGSQGLNIPGVSVQGKHLNVTARYPEQD 120 Query: 1806 VGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMP 1627 P ++ PE+GS +Y G +SQKGRV E T D QV+N+GFQG + HK G+D + +P Sbjct: 121 GQPAVSR-PEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVP 179 Query: 1626 GKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV 1447 KI + A LNS GGP+GAP + N ++ENQ+RP + Sbjct: 180 QKIANVPAQSLNSGTGGPQGAPHVPPNQMGLNV-------------NHPMISENQVRPPI 226 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENGPTMLFVGELHWWTTD+ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDP +A+ Sbjct: 227 ENGPTMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAA 286 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 ACKEGM+GY+FNGRACVVAFASPQTLKQMGASYMNK+Q Q Q Q QGRRP NDG+GRGG Sbjct: 287 ACKEGMDGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQA-QPQGRRP-NDGLGRGG 344 Query: 1086 GMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMV-XXXXXXXXX 910 MNY G VG KNMV Sbjct: 345 NMNYQ---SGDAGRNYGRGGWGRGGQGVVNRSGVGGPMRGRGGVGVKNMVGSSAGVGNGA 401 Query: 909 XXXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFP 730 GMMHPQ MMGAGFDPTYM GM+PSFP Sbjct: 402 NGGAAYGQGPAGPPFGGPAGGMMHPQGMMGAGFDPTYMGRGGSYGGFPGPGFPGMLPSFP 461 Query: 729 AVNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRR 550 AVNT+GLAGVAPHVNPAFFGR MD H GMWTDTSMGGWGGDEHGRR Sbjct: 462 AVNTLGLAGVAPHVNPAFFGRGMAPNGMGMMGGPGMDGPHVGMWTDTSMGGWGGDEHGRR 521 Query: 549 TRESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXX 379 TRESSYGG+DGASEYGYG+A+HEKG RS+ A REKER S+REWSGNS Sbjct: 522 TRESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSDREWSGNSDRRHRDEKERDWD 580 Query: 378 XXXXXXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSR 199 RYREEKDSYR+HRHRERDL +D+ DRGQ MPEE RSRSR Sbjct: 581 RSEREHREHRYREEKDSYREHRHRERDLDYDDDLDRGQSSSRSRRRSHAMPEEQRRSRSR 640 Query: 198 DVDYGKRRRLP 166 DVDYGKRRRLP Sbjct: 641 DVDYGKRRRLP 651 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 623 bits (1606), Expect = e-175 Identities = 358/668 (53%), Positives = 395/668 (59%), Gaps = 6/668 (0%) Frame = -2 Query: 2151 MAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQMHR 1972 MAEEQ+DYEDEE AIS EGFLQMHR Sbjct: 1 MAEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHR 60 Query: 1971 SEAPVRTG-IGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNA-AHFPDQRVGP 1798 SEAP+ G +GNGGLQAQKTDV E+ ++AG SQ IPGVS +GKYS+A A FP+Q+ P Sbjct: 61 SEAPLPPGGVGNGGLQAQKTDVTETRVQAGVSQESKIPGVSVQGKYSSAVAQFPEQQGQP 120 Query: 1797 LAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMPGKI 1618 AK PELGS Y GST +P G DS+++ GK Sbjct: 121 PVAKEPELGSTGY---------------------------GSTTMPPNVGGDSSDITGKT 153 Query: 1617 VSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSVENG 1438 ES P +NS GP G Q+ N+ NENQIRP VENG Sbjct: 154 ALESVPSMNSGTAGPTGVTQMPTNQISIKVNA-----------NRPMFNENQIRPPVENG 202 Query: 1437 PTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAASACK 1258 TMLFVGELHWWTTD+ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEF+DP AA+ACK Sbjct: 203 STMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFHDPAAATACK 262 Query: 1257 EGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGGGMN 1078 EGM+GYLFNGRACVVAFASPQTLKQMGASY++KSQ Q Q +QQ GRRPMN+GVGRGGG+N Sbjct: 263 EGMDGYLFNGRACVVAFASPQTLKQMGASYLSKSQGQTQ-SQQPGRRPMNEGVGRGGGVN 321 Query: 1077 YSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXXXXX 898 Y GA+G KNM Sbjct: 322 YQTGDTGGRNFGRGGWGRGGQGVANRGPGGGGPMRGRGGAMGAKNMA-GNPAGVGTGANG 380 Query: 897 XXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPAVNT 718 GMM+PQ MMGAGFDPTYM GM+ SFPAVNT Sbjct: 381 GYGQGLAGPGFGGPVGGMMNPQGMMGAGFDPTYMGRGGGYGGFPGPAFPGMLSSFPAVNT 440 Query: 717 MGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRTRES 538 MGLAGVAPHVNPAFFGR MD HHAGMW D SMGGWGGDEHGRRTRES Sbjct: 441 MGLAGVAPHVNPAFFGRGMATNGMGMMGSSGMDGHHAGMWNDPSMGGWGGDEHGRRTRES 500 Query: 537 SYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNS----XXXXXXXXXXXXX 370 SYGGDDGASEYGYGEA+HEKG RSNA RE+ERGSER+WSGNS Sbjct: 501 SYGGDDGASEYGYGEANHEKGGRSNAPSRERERGSERDWSGNSERRHRDEREQDWDRSER 560 Query: 369 XXXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVD 190 RY+EEKDSYRDHR RERD+ ED+WDRGQ + MPE+DHRSRSRDVD Sbjct: 561 GEHREHRYKEEKDSYRDHRQRERDVGYEDDWDRGQSSSRPRSRSKAMPEDDHRSRSRDVD 620 Query: 189 YGKRRRLP 166 YGKRRRLP Sbjct: 621 YGKRRRLP 628 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 614 bits (1583), Expect = e-173 Identities = 348/667 (52%), Positives = 404/667 (60%), Gaps = 5/667 (0%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MD MAEEQ+D+ DEE AI GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1980 MHRSEAPVRTG-IGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSN-AAHFPDQR 1807 + RSEAP++ G +G+ GL+AQ+ + E +EAGGSQGL+IPGVS +GK+ N +A +P++ Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKE 120 Query: 1806 VGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMP 1627 P A PE+ S +Y G+ +SQKG V E T D QV+NLGFQG T +K G+D + +P Sbjct: 121 EQP-AVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 1626 GKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV 1447 KI ++ A LNS GGP+G P + N +NENQ++P + Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNV-------------NHPVMNENQVQPPI 226 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENGPTMLFVGELHWWTTD+ELESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEFYDP +A+ Sbjct: 227 ENGPTMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAA 286 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 CKEGMNGY+FNGRACVVAFASPQTLKQMGASYMNK+Q Q Q Q QGRRP N+G+GRGG Sbjct: 287 VCKEGMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQA-QPQGRRP-NEGLGRGG 344 Query: 1086 GMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXX 907 +NY G VG KNMV Sbjct: 345 NLNYQ---SGDAGRNYGRGGWGRGGQGGVNRAGGGGLMRGRGGVGVKNMVGISAGVGNGA 401 Query: 906 XXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPA 727 GMMHPQ MMGAGFDPTYM GM+PSFPA Sbjct: 402 NGAGAYGQGPGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPA 461 Query: 726 VNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRT 547 VNTMGLAGVAPHVNPAFFGR MD HAGMWTD SMGGWGGDEHGRRT Sbjct: 462 VNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRT 521 Query: 546 RESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXX 376 RESSYGG+DGASEYGYG+A+HEKG RS+ A REKER SEREWSGNS Sbjct: 522 RESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDR 580 Query: 375 XXXXXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRD 196 RYREEKDSYR+HRHRERDL +D+WDRGQ MPEE+HRSRSRD Sbjct: 581 SEREHREHRYREEKDSYREHRHRERDLDYDDDWDRGQSSSRSRRRSHAMPEEEHRSRSRD 640 Query: 195 VDYGKRR 175 V Y + + Sbjct: 641 VGYREEK 647 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 607 bits (1566), Expect = e-171 Identities = 350/666 (52%), Positives = 398/666 (59%), Gaps = 4/666 (0%) Frame = -2 Query: 2151 MAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQMHR 1972 MA+EQ+DYEDEE AI E FLQMHR Sbjct: 1 MADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEEMGEDDEYDDLYNDVNIG-ENFLQMHR 59 Query: 1971 SEAP-VRTGIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNAAHFPDQRVGPL 1795 SEAP +GNGG Q + ++ L +E+GGSQGL+IPGV+ E KYS HFP+Q V Sbjct: 60 SEAPPAPPSVGNGGFQPRNSNDLR--VESGGSQGLNIPGVAVESKYSTGTHFPEQNV--- 114 Query: 1794 AAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMPGKIV 1615 KGPE+GSV Y DG+ ++QK RV EMT D Q RN+GFQGST P GVD ++M KI Sbjct: 115 --KGPEIGSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKIS 172 Query: 1614 SESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSVENGP 1435 ++ P+ N+ G PR PQ+ N+ NENQIRP +ENG Sbjct: 173 NDPTPVPNA--GVPRVIPQLPASQMNMNMDT-----------NRSATNENQIRPPLENGS 219 Query: 1434 TMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAASACKE 1255 TML+VGELHWWTTD+ELE+VLSQYG VKEIKFFDERASGKSKGYCQVEFYD AA+ACKE Sbjct: 220 TMLYVGELHWWTTDAELENVLSQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKE 279 Query: 1254 GMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGGGMNY 1075 GMNG+LFNGRACVVAFAS QTLKQMGASYMNK+Q QPQ +Q QGRRPMNDG GRGG MNY Sbjct: 280 GMNGHLFNGRACVVAFASQQTLKQMGASYMNKNQGQPQ-SQNQGRRPMNDGAGRGGNMNY 338 Query: 1074 SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXXXXXX 895 G++G KN+V Sbjct: 339 Q-GGDAGRNFGRGGWGRGGQGILNRGPGGGGRMGGRGGSMGAKNIVGGAGGVGSGANGGG 397 Query: 894 XXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPAVNTM 715 M+ PQ+MM AGFDPTYM GM+PSFPAVN M Sbjct: 398 YGQGLAGPAFGGPAGAMLPPQSMMRAGFDPTYMGRGAGYGGFAGPGFPGMLPSFPAVNAM 457 Query: 714 GLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRTRESS 535 GLAGVAPHVNPAFFGR MD +AGMW+DTSMGGW G+E GRRTRESS Sbjct: 458 GLAGVAPHVNPAFFGRGMAPNGMGMMGPSGMDGPNAGMWSDTSMGGW-GEEPGRRTRESS 516 Query: 534 YGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXXXXXX 364 YGGDDGASEYGYGE +HEKG RS+AA REKER SER+WSGNS Sbjct: 517 YGGDDGASEYGYGEVNHEKGARSSAASREKERASERDWSGNSDRRHRDDREHDWDRSERE 576 Query: 363 XXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVDYG 184 RYREEK+SYRDHR RERD ED+WDRGQ R +PEED+RSRSRD DYG Sbjct: 577 HKEHRYREEKESYRDHRQRERDSGYEDDWDRGQSSSRSRSRSRAVPEEDYRSRSRDADYG 636 Query: 183 KRRRLP 166 KRRRLP Sbjct: 637 KRRRLP 642 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 600 bits (1547), Expect = e-169 Identities = 354/715 (49%), Positives = 407/715 (56%), Gaps = 50/715 (6%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MD MAEEQ+D+ DEE AI GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1980 MHRSEAPVRTG-IGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSN-AAHFPDQR 1807 + RSEAP++ G +G+ GL+AQ+ + E +EAGGSQGL+IPGVS +GK+ N +A +P++ Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKE 120 Query: 1806 VGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMP 1627 P A PE+ S +Y G+ +SQKG V E T D QV+NLGFQG T +K G+D + +P Sbjct: 121 EQP-AVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 1626 GKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV 1447 KI ++ A LNS GGP+G P + N +NENQ++P + Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNV-------------NHPVMNENQVQPPI 226 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENGPTMLFVGELHWWTTD+ELESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEFYDP +A+ Sbjct: 227 ENGPTMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAA 286 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 CKEGMNGY+FNGRACVVAFASPQTLKQMGASYMNK+Q Q Q Q QGRRP N+G+GRGG Sbjct: 287 VCKEGMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQA-QPQGRRP-NEGLGRGG 344 Query: 1086 GMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXX 907 +NY G VG KNMV Sbjct: 345 NLNYQ---SGDAGRNYGRGGWGRGGQGGVNRAGGGGLMRGRGGVGVKNMVGISAGVGNGA 401 Query: 906 XXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPA 727 GMMHPQ MMGAGFDPTYM GM+PSFPA Sbjct: 402 NGAGAYGQGPGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPA 461 Query: 726 VNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRT 547 VNTMGLAGVAPHVNPAFFGR MD HAGMWTD SMGGWGGDEHGRRT Sbjct: 462 VNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRT 521 Query: 546 RESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXX 376 RESSYGG+DGASEYGYG+A+HEKG RS+ A REKER SEREWSGNS Sbjct: 522 RESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDR 580 Query: 375 XXXXXXXXRYREEK---------------------------------------------D 331 RYREEK D Sbjct: 581 SEREHREHRYREEKDSYREHRHREREWSGNSDRRHRDEKERDWDRSEREHREHRYREEKD 640 Query: 330 SYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVDYGKRRRLP 166 SYR+HRHRERDL +D+ DRGQ MPEE RSRSRDVDYGKRRRLP Sbjct: 641 SYREHRHRERDLDYDDDLDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRRLP 695 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 570 bits (1470), Expect = e-160 Identities = 333/670 (49%), Positives = 379/670 (56%), Gaps = 5/670 (0%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MDPM EEQ+DYE+EE AI GEGFLQ Sbjct: 1 MDPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQ 60 Query: 1980 MHRSEAPVR-TGIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNAAHFPDQRV 1804 MHR E P+ G+GNGGLQAQK +V E ++ G SQ + PG S EGKYS+ P+Q+ Sbjct: 61 MHRPEPPLPPAGVGNGGLQAQKNNVPEQRVQGGASQEVKNPGFSVEGKYSSV---PEQKD 117 Query: 1803 GPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMPG 1624 P + PE+ S QKGRV EMT D QVRN+GFQG+ + DS+++ G Sbjct: 118 QPPVSVVPEMAS----------QKGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTG 167 Query: 1623 KIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSVE 1444 KI + P +NS GP Q+ N+ VNENQIRP VE Sbjct: 168 KIANGPIPSMNSGSNGPPAVQQMPANQMNMKINV-----------NRPMVNENQIRPPVE 216 Query: 1443 NGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAASA 1264 NG LFVGELHWWTTD+ELE VLSQ+GR+KEIKFFDERASGKSKGYCQV+FYDP AASA Sbjct: 217 NGSATLFVGELHWWTTDAELEGVLSQFGRIKEIKFFDERASGKSKGYCQVDFYDPAAASA 276 Query: 1263 CKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGGG 1084 CKEGM+GY+FNGRACVVAFAS QTLKQMG SY+NKSQ Q Q Q QGRRPMNDG GRGG Sbjct: 277 CKEGMDGYVFNGRACVVAFASSQTLKQMGDSYVNKSQGQVQ-TQPQGRRPMNDGAGRGGN 335 Query: 1083 MNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXXX 904 MN+ GA+G +NMV Sbjct: 336 MNFQ-GGDTGRNFGRGNNWGRGGQGVLNRGPGGGGPGRGRGAMGARNMVGNNAGVGTGAN 394 Query: 903 XXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPAV 724 GMM+ MMG GFDPTYM GM+P FP V Sbjct: 395 GGGYGQGLGGPGFGGPVGGMMNAPGMMGPGFDPTYMGRGGGYGGFPGPGFPGMLPQFPGV 454 Query: 723 NTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRTR 544 N MGLAGVAPHVNPAFFGR M+ HHA MW D SM GW G+E RRTR Sbjct: 455 NAMGLAGVAPHVNPAFFGRGMATNGMGMMGSSGMEGHHAPMWNDPSMAGWTGEEQDRRTR 514 Query: 543 ESSYGGDDGASEYG-YGEASHEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXX 376 ESSYGGDDG SEYG YGEA+HEK VRS+AAPRE+ER SEREW+G S Sbjct: 515 ESSYGGDDGGSEYGNYGEANHEKPVRSSAAPRERERESEREWTGTSERRHRDEREQDWDR 574 Query: 375 XXXXXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRD 196 RY+EEKDSYRDHR RERD+ ED+ DRG + MPE+DHRSRSRD Sbjct: 575 SEREHREPRYKEEKDSYRDHRRRERDVAYEDDRDRGHSSSRPRSRSKAMPEDDHRSRSRD 634 Query: 195 VDYGKRRRLP 166 VDYGKRRRLP Sbjct: 635 VDYGKRRRLP 644 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 563 bits (1450), Expect = e-157 Identities = 323/621 (52%), Positives = 374/621 (60%), Gaps = 5/621 (0%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MD MAEEQ+D+ DEE AI GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1980 MHRSEAPVRTG-IGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSN-AAHFPDQR 1807 + RSEAP++ G +G+ GL+AQ+ + E +EAGGSQGL+IPGVS +GK+ N +A +P++ Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKE 120 Query: 1806 VGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMP 1627 P A PE+ S +Y G+ +SQKG V E T D QV+NLGFQG T +K G+D + +P Sbjct: 121 EQP-AVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 1626 GKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV 1447 KI ++ A LNS GGP+G P + N +NENQ++P + Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNV-------------NHPVMNENQVQPPI 226 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENGPTMLFVGELHWWTTD+ELESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEFYDP +A+ Sbjct: 227 ENGPTMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAA 286 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 CKEGMNGY+FNGRACVVAFASPQTLKQMGASYMNK+Q Q Q Q QGRRP N+G+GRGG Sbjct: 287 VCKEGMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQA-QPQGRRP-NEGLGRGG 344 Query: 1086 GMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXX 907 +NY G VG KNMV Sbjct: 345 NLNYQ---SGDAGRNYGRGGWGRGGQGGVNRAGGGGLMRGRGGVGVKNMVGISAGVGNGA 401 Query: 906 XXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPA 727 GMMHPQ MMGAGFDPTYM GM+PSFPA Sbjct: 402 NGAGAYGQGPGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPA 461 Query: 726 VNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRT 547 VNTMGLAGVAPHVNPAFFGR MD HAGMWTD SMGGWGGDEHGRRT Sbjct: 462 VNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRT 521 Query: 546 RESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXX 376 RESSYGG+DGASEYGYG+A+HEKG RS+ A REKER SEREWSGNS Sbjct: 522 RESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDR 580 Query: 375 XXXXXXXXRYREEKDSYRDHR 313 RYREEKDSYR+HR Sbjct: 581 SEREHREHRYREEKDSYREHR 601 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 551 bits (1419), Expect = e-154 Identities = 335/670 (50%), Positives = 382/670 (57%), Gaps = 8/670 (1%) Frame = -2 Query: 2151 MAEEQLDYEDEEXXXXXXXXXXXXXA-ISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQMH 1975 MAE+ +D+EDEE IS GEGFLQ+ Sbjct: 1 MAEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQ 60 Query: 1974 RSEAP---VRTGIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNA-AHFPDQR 1807 RSEAP G+GNG LQAQK + E E GGSQ +IPGVSAEG++S+A + FP Q+ Sbjct: 61 RSEAPSLPAAAGVGNG-LQAQKRNFPEPREEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQ 119 Query: 1806 VGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMP 1627 G K E GS+ Y DGA SQKGR+ GFQGS + H GVDS+++P Sbjct: 120 DGLKVDKKSEAGSMVYPDGASGSQKGRIVA----------GFQGSKPMLHSVGVDSSDIP 169 Query: 1626 GKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV 1447 GK+V+E NS GPRG + VNENQIRPS+ Sbjct: 170 GKMVNEPIQAPNSGGAGPRGILPMQGNQTTVNANVSHPI-----------VNENQIRPSI 218 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENG TMLFVGELHWWTTD+ELESVLSQYGRVKEIKFFDERASGKSKGYCQVE+YD AA Sbjct: 219 ENGSTMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAV 278 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 ACKEGM+G++FNGRACVVAFASPQTLKQMGA+YM+K+QVQ Q +Q QGRRP+NDGVGRGG Sbjct: 279 ACKEGMHGHVFNGRACVVAFASPQTLKQMGAAYMSKNQVQNQ-SQPQGRRPINDGVGRGG 337 Query: 1086 GMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXX 907 N+ GA+G KNMV Sbjct: 338 NPNFQ-SGDGGRNFGRGGWGRGGQGAPNRGPGSGGPMRGRGGAMGAKNMV----GNNAGV 392 Query: 906 XXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPA 727 GMM+PQ MMG GFDPTYM GM+PSFPA Sbjct: 393 GGGGYGQGLAGPPFGGPAGGMMNPQGMMGTGFDPTYMGRGVGYGGFAGPAFPGMLPSFPA 452 Query: 726 VNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRT 547 VNTMG A VAPHVNPAFFGR MD H GMW D S+GGWGG+EHGRRT Sbjct: 453 VNTMGFAAVAPHVNPAFFGRGMTNNGMGMVGSSLMDGHQGGMWNDPSIGGWGGEEHGRRT 512 Query: 546 RESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXX 376 RESSYGGDDGASEYGYG+ +HEKG R ERGSER+WSGNS Sbjct: 513 RESSYGGDDGASEYGYGDTNHEKGGR--------ERGSERDWSGNSERRNHEERDQDWDR 564 Query: 375 XXXXXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRD 196 RYRE KD RD+R +ER+L ED+WDRGQ RV+ E+ HRSRSRD Sbjct: 565 SQKEQKEHRYREGKDGSRDYRPKERELDYEDDWDRGQSSSRLRSRSRVVQEDHHRSRSRD 624 Query: 195 VDYGKRRRLP 166 VDYGKRRRLP Sbjct: 625 VDYGKRRRLP 634 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 532 bits (1371), Expect = e-148 Identities = 315/667 (47%), Positives = 369/667 (55%), Gaps = 2/667 (0%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MDP A+EQLDY DEE I GEGFLQ Sbjct: 1 MDPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQ 60 Query: 1980 MHRSEAPVRT-GIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNA-AHFPDQR 1807 + RSE PV + GNG QAQK S GS+ IPG++ EGKY+ FP Q+ Sbjct: 61 LQRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLGSEEAKIPGIATEGKYAGTEVQFPQQK 120 Query: 1806 VGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMP 1627 P+ + E + D A ++ + MT + Q N G+QGS +P K G D MP Sbjct: 121 GEPVVERETERPA----DAAQKARPSAIT-MTLNSQAGNSGYQGSMPMPQKIGADPMAMP 175 Query: 1626 GKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV 1447 K SE+ PL+NS GPR P + N ++E RPS+ Sbjct: 176 EKNASEATPLMNSVVPGPRVVPHMPTNQLNSSGNVNM---------NNPVISETPFRPSL 226 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENG TMLFVGELHWWTTD+ELESVL+QYG VKEIKFFDERASGKSKGYCQVEF+DP +A+ Sbjct: 227 ENGNTMLFVGELHWWTTDAELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAA 286 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 ACKEGMNGY FNGRACVVAFA+PQT+KQMG+SY NK+Q Q Q +Q QGRRPMN+GVGR G Sbjct: 287 ACKEGMNGYNFNGRACVVAFATPQTIKQMGSSYANKTQNQVQ-SQPQGRRPMNEGVGR-G 344 Query: 1086 GMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXX 907 G NY+ GA+G KNM+ Sbjct: 345 GPNYT---PGDAGRNFGRGSWGRGGPGMPNRGPGGGPVRGRGAMGSKNMM--VNPGAGNG 399 Query: 906 XXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPA 727 G+MHPQ MMG GFDP++M GM+P F A Sbjct: 400 AGGAFGQGLAGPAFGGPPAGLMHPQGMMGPGFDPSFMGRGAGYGGFSGPAFPGMMPPFQA 459 Query: 726 VNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRT 547 VN MGL GVAPHVNPAFFGR MD H GMWTDTS GGWGG+EHGRRT Sbjct: 460 VNPMGLPGVAPHVNPAFFGRGMAANGMGMMSAAGMDGPHPGMWTDTSGGGWGGEEHGRRT 519 Query: 546 RESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNSXXXXXXXXXXXXXX 367 RESSYGG+D ASEYGYGE SH+KG RS+A REKERGSER+WSGNS Sbjct: 520 RESSYGGEDNASEYGYGEVSHDKGARSSAVSREKERGSERDWSGNSDKRHRDEREHDRDR 579 Query: 366 XXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVDY 187 RYREE+D YRD+R +ER+ E+++DRGQ R EEDHRSRSRD +Y Sbjct: 580 HDKEHRYREERDGYRDYRQKERESEYEEDYDRGQSSSRSRSKSRAAQEEDHRSRSRDTNY 639 Query: 186 GKRRRLP 166 GKRRR P Sbjct: 640 GKRRRAP 646 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 521 bits (1343), Expect = e-145 Identities = 309/614 (50%), Positives = 354/614 (57%), Gaps = 4/614 (0%) Frame = -2 Query: 1995 EGFLQMHRSEAPVRTG-IGNGGLQAQKTDVLESTLEAGGSQGLSIPG--VSAEGKYSNA- 1828 E FLQMH SEAP +GNGG Q + ES +E GGSQ L+I G + EG YSNA Sbjct: 42 ENFLQMHGSEAPAPPATVGNGGFQTRNAH--ESRIETGGSQALAITGGGPAVEGIYSNAK 99 Query: 1827 AHFPDQRVGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSG 1648 AHFP+Q+ +A + ++G V DG+ V+QKGRV EM+ D QVRN+GFQ ST +P G Sbjct: 100 AHFPEQKQVAVAVEAQDVGPV---DGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIG 156 Query: 1647 VDSTNMPGKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNE 1468 VD ++M K E PL + GPRGAPQ+ N+ VNE Sbjct: 157 VDPSDMSRKNAIEPEPLPITGSAGPRGAPQMQVNQMHMSADV-----------NRPVVNE 205 Query: 1467 NQIRPSVENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEF 1288 NQ+RP +ENG T L+VGELHWWTTD+ELES SQ+GRVKEIKFFDERASGKSKGYCQV+F Sbjct: 206 NQVRPPIENGSTTLYVGELHWWTTDAELESFASQFGRVKEIKFFDERASGKSKGYCQVDF 265 Query: 1287 YDPGAASACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMN 1108 Y+ AA+ACKEGMNG++FNGR CVVAFASPQTLKQMGASYMNK+Q QPQ Q QGR MN Sbjct: 266 YEAAAAAACKEGMNGHVFNGRPCVVAFASPQTLKQMGASYMNKTQGQPQ-TQSQGRGSMN 324 Query: 1107 DGVGRGGGMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXX 928 DG GRGG N+ GA+GPKNM Sbjct: 325 DGAGRGGNANFQ---SGDGGRNYGRGAWGRGGQGILNRGPGGGPMRGRGAMGPKNMAGNV 381 Query: 927 XXXXXXXXXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXG 748 GMM PQ MMGAGFDP YM G Sbjct: 382 AGVGSGANGGGYGQGLAGPAFGGPAGGMMPPQGMMGAGFDPLYMGRGGGYGGFAGPGFPG 441 Query: 747 MIPSFPAVNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGG 568 M+PSFPAVN+MGLAGVAPHVNPAFF R MD + GMW Sbjct: 442 MLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMVSSGMDGPNPGMW---------- 491 Query: 567 DEHGRRTRESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNSXXXXXXX 388 ESSY GD+GASEYGYGE +HEKG RS+ A REKERGSER+WSGNS Sbjct: 492 --------ESSYDGDEGASEYGYGEGNHEKGARSSGASREKERGSERDWSGNSDRRHRDE 543 Query: 387 XXXXXXXXXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRS 208 RY+EEKDSYR HR RERD ED+ DRG R PEED+RS Sbjct: 544 REQDWDRPEREHRYKEEKDSYRGHRQRERDSGYEDDRDRGHSSSRARSRSRAAPEEDYRS 603 Query: 207 RSRDVDYGKRRRLP 166 R+RDVDYGKRRRLP Sbjct: 604 RTRDVDYGKRRRLP 617 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 493 bits (1269), Expect = e-136 Identities = 295/614 (48%), Positives = 337/614 (54%), Gaps = 4/614 (0%) Frame = -2 Query: 1995 EGFLQMHRSEAPVRTGI-GNGGLQAQKTDVLESTLEAGGSQGLSIPG--VSAEGKYSNA- 1828 E FLQMH SEAP GNGG Q + ES +E GGSQ L+ G V+ EGKYSNA Sbjct: 42 ENFLQMHGSEAPAPPATAGNGGFQTRNAH--ESRVETGGSQVLATSGAGVAVEGKYSNAG 99 Query: 1827 AHFPDQRVGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSG 1648 AHFP+Q+ + + ++GS+ Y DG+ V+Q +GS Sbjct: 100 AHFPEQKQAGIGVEANDVGSIGYGDGSSVAQ------------------KGSA------- 134 Query: 1647 VDSTNMPGKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNE 1468 GPRG PQ+ N+ VNE Sbjct: 135 -----------------------GPRGVPQMQVNQMNMNADV-----------NRPVVNE 160 Query: 1467 NQIRPSVENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEF 1288 NQ+RP +ENGPT L+VGELHWWTTD+ELESV SQYGRVKEIKFFDERASGKSKGYCQV+F Sbjct: 161 NQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFFDERASGKSKGYCQVDF 220 Query: 1287 YDPGAASACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMN 1108 Y+ AA+ACKEGMN ++FNGR CVVAFAS QTLKQMGASYM+K+Q QPQ Q QGR MN Sbjct: 221 YEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKTQGQPQ-PQSQGRGSMN 279 Query: 1107 DGVGRGGGMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXX 928 DG+GRGG NY +GPKNM Sbjct: 280 DGMGRGGNANYQSGDGGRNYGRGGWGRGGQGVLNRGPGGGPMRGRG---GMGPKNMAGNV 336 Query: 927 XXXXXXXXXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXG 748 GMMH Q MMGAGFDP YM G Sbjct: 337 AGVGSGANGGGYGQGIAGPAFGGPAGGMMHHQGMMGAGFDPLYMGRGGGYGGFPGHGFPG 396 Query: 747 MIPSFPAVNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGG 568 M+PSFPAVN+MGLAGVAPHVNPAFF R M+ + G W DTSMGGWG Sbjct: 397 MLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMASSGMEGPNPGKWPDTSMGGWG- 455 Query: 567 DEHGRRTRESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNSXXXXXXX 388 +E GRRTRESSY GD+GASEYGYGE +HEKG RS+ A REKER SER+WSGNS Sbjct: 456 EEPGRRTRESSYDGDEGASEYGYGEGNHEKGARSSGASREKERVSERDWSGNSDRRHRDE 515 Query: 387 XXXXXXXXXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRS 208 +YREEKD+YR HR RERD ED+ DRG R PEED+RS Sbjct: 516 REQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRGHSSSRARSRSRAAPEEDYRS 575 Query: 207 RSRDVDYGKRRRLP 166 RSRDVDYGKRRR P Sbjct: 576 RSRDVDYGKRRRPP 589 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 467 bits (1202), Expect = e-129 Identities = 290/667 (43%), Positives = 345/667 (51%), Gaps = 2/667 (0%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MDP+ +EQLDY DEE AI GEGF+Q Sbjct: 1 MDPVTDEQLDYGDEEYGGNQKMQYHHGGAIPALAEDEMIGDDDEYDDLYNDVNVGEGFMQ 60 Query: 1980 MHRSEAPVRTGIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYS-NAAHFPDQRV 1804 M RSEAP + +GN K + EA SQ ++ V EG Y+ N DQ+ Sbjct: 61 MQRSEAPPPSAVGNNSFSISKNTAPGTRAEAIASQEVNNGRVGNEGSYAPNGVQLSDQKN 120 Query: 1803 GPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMPG 1624 A GP SQ+ R+ E+ Q +LG+QGS ++ HK+ D N Sbjct: 121 NLTAVGGP-------AQPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSE 173 Query: 1623 KIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV- 1447 IV E A L+ + G +G PQ N+ +E IRPS Sbjct: 174 NIVGEPASLVYPNTGSSKGVPQAPSNLMNSNANVNVNV-------NRSMDDEYLIRPSGG 226 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENG M++VGELHWWTTD+E+ESVL QYGRVKEIKFFDERASGKSKGYCQVEFYDP AA+ Sbjct: 227 ENGNPMIYVGELHWWTTDAEVESVLIQYGRVKEIKFFDERASGKSKGYCQVEFYDPAAAT 286 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 ACK+GM G++FNGRACVV +A+PQT KQMGASY NK+Q Q Q +Q QGR PMNDG GRG Sbjct: 287 ACKDGMQGHIFNGRACVVTYANPQTSKQMGASY-NKNQGQSQ-SQLQGRNPMNDGAGRG- 343 Query: 1086 GMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXXXXXXXXX 907 N + G +G KNM+ Sbjct: 344 --NGTNYPSGDAGRNFGRGGGWGRGNQAPNRGPGAGPIRGRGGMGNKNMIGNAPGAGGGG 401 Query: 906 XXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPA 727 MMHPQ MMG GFD +M GM+P F Sbjct: 402 AYGQGLNGPGFGGPPG----MMHPQGMMGPGFDLAFMGRGGGYGGFSGPPFQGMLPPFQG 457 Query: 726 VNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRT 547 VN+MGL GVAPHVNPAFFGR M H+GMW D +MGGWGG+EHGR Sbjct: 458 VNSMGLPGVAPHVNPAFFGRGMNPNGMGMMGNPGMVGPHSGMWNDPNMGGWGGEEHGR-- 515 Query: 546 RESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNSXXXXXXXXXXXXXX 367 ESSYGG+D ASEYGYGE SH+K VRS+AAPREKER SERE+ Sbjct: 516 -ESSYGGEDNASEYGYGEGSHDKSVRSSAAPREKERTSEREYPERK---HREERENDGER 571 Query: 366 XXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVDY 187 +YREEKD YR+HRH+ER+ +D+WDRGQ V EEDHRSRSRD DY Sbjct: 572 NDRDSKYREEKDRYREHRHKERESGYDDDWDRGQSSRSRSRSGAVQ-EEDHRSRSRDADY 630 Query: 186 GKRRRLP 166 GKRRR+P Sbjct: 631 GKRRRMP 637 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 455 bits (1170), Expect = e-125 Identities = 283/614 (46%), Positives = 325/614 (52%), Gaps = 4/614 (0%) Frame = -2 Query: 1995 EGFLQMHRSEAPVRTGI-GNGGLQAQKTDVLESTLEAGGSQGLSIPG--VSAEGKYSNA- 1828 E FLQMH SEAP GNGG Q + ES +E GGSQ L+ G V+ EGKYSNA Sbjct: 42 ENFLQMHGSEAPAPPATAGNGGFQTRNAH--ESRVETGGSQVLATSGAGVAVEGKYSNAG 99 Query: 1827 AHFPDQRVGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSG 1648 AHFP+Q+ + + ++GS+ Y DG+ V+Q +GS Sbjct: 100 AHFPEQKQAGIGVEANDVGSIGYGDGSSVAQ------------------KGSA------- 134 Query: 1647 VDSTNMPGKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNE 1468 GPRG PQ+ N+ VNE Sbjct: 135 -----------------------GPRGVPQMQVNQMNMNADV-----------NRPVVNE 160 Query: 1467 NQIRPSVENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEF 1288 NQ+RP +ENGPT L+VGELHWWTTD+ELESV SQYGRVKEIKFFDERASGKSKGYCQV+F Sbjct: 161 NQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFFDERASGKSKGYCQVDF 220 Query: 1287 YDPGAASACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMN 1108 Y+ AA+ACKEGMN ++FNGR CVVAFAS QTLKQMGASYM+K+Q QPQ Q QGR MN Sbjct: 221 YEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKTQGQPQ-PQSQGRGSMN 279 Query: 1107 DGVGRGGGMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVXXX 928 DG+GRGG NY G +GPKNM Sbjct: 280 DGMGRGGNANYQ---SGDGGRNYGRGGWGRGGQGVLNRGPGGGPMRGRGGMGPKNMAGNV 336 Query: 927 XXXXXXXXXXXXXXXXXXXXXXXXXXGMMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXG 748 GMMH Q MMGAGFDP YM G Sbjct: 337 AGVGSGANGGGYGQGIAGPAFGGPAGGMMHHQGMMGAGFDPLYMGRGGGYGGFPGHGFPG 396 Query: 747 MIPSFPAVNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGG 568 M+PSFPAVN+MGLAGVAPHVNPAFF R + GM + M G Sbjct: 397 MLPSFPAVNSMGLAGVAPHVNPAFFARGMA-------------PNGMGMMASSGMEG--- 440 Query: 567 DEHGRRTRESSYGGDDGASEYGYGEASHEKGVRSNAAPREKERGSEREWSGNSXXXXXXX 388 +ESSY GD+GASEYGYGE +HEKG RS+ A REKER SER+WSGNS Sbjct: 441 ---PNPGKESSYDGDEGASEYGYGEGNHEKGARSSGASREKERVSERDWSGNSDRRHRDE 497 Query: 387 XXXXXXXXXXXXRYREEKDSYRDHRHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRS 208 +YREEKD+YR HR RERD ED+ DRG R PEED+RS Sbjct: 498 REQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRGHSSSRARSRSRAAPEEDYRS 557 Query: 207 RSRDVDYGKRRRLP 166 RSRDVDYGKRRR P Sbjct: 558 RSRDVDYGKRRRPP 571 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 390 bits (1001), Expect = e-105 Identities = 212/364 (58%), Positives = 245/364 (67%), Gaps = 2/364 (0%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MD MAEEQ+DYE+EE AI G+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 1980 MHRSEAPVRT-GIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNAA-HFPDQR 1807 + EAP + G+GNG LQ +KTDV E ++AG SQG ++PGVS EGKY+NA HFP Q Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120 Query: 1806 VGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMP 1627 +A P +GS NY DGA VSQKG V E T D VRN+GFQGST P ++GVD +NMP Sbjct: 121 DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMP 180 Query: 1626 GKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV 1447 G++ +E AP+LN GP+GA N N+ VNENQIRP + Sbjct: 181 GRVANEPAPVLNPGAAGPQGA------------LIPANQMGVNINVNRAMVNENQIRPPL 228 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENG TMLFVGELHWWTTD+ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEF+D AA+ Sbjct: 229 ENGGTMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAA 288 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 ACK+GMNG++FNGR CVVAFASPQTLKQMGASYMNK+Q QPQ +Q QGRRPMNDG GRGG Sbjct: 289 ACKDGMNGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQ-SQTQGRRPMNDGGGRGG 347 Query: 1086 GMNY 1075 MNY Sbjct: 348 NMNY 351 Score = 267 bits (682), Expect = 2e-68 Identities = 139/230 (60%), Positives = 152/230 (66%), Gaps = 3/230 (1%) Frame = -2 Query: 846 MMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPAVNTMGLAGVAPHVNPAFFGR 667 MMHPQNMMG GFDPTYM GM+PSFPAVN MGLAGVAPHVNPAFF R Sbjct: 428 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 486 Query: 666 XXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEAS 487 MD H GMWTD+SMGGW G+EHGRRTRESSYGGDDGAS+YGYGEA+ Sbjct: 487 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEAN 546 Query: 486 HEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXXXXXXXXXXRYREEKDSYRDH 316 HEKG RS AA REK+RGSER+WSGN+ R+REEKDSYRD Sbjct: 547 HEKGARSTAASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 606 Query: 315 RHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVDYGKRRRLP 166 R R+RD +D WDRG R +P+EDHRSRSRDVDYGKRRRLP Sbjct: 607 RQRDRDSTYDDNWDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLP 656 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 389 bits (1000), Expect = e-105 Identities = 212/364 (58%), Positives = 245/364 (67%), Gaps = 2/364 (0%) Frame = -2 Query: 2160 MDPMAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQ 1981 MD MAEEQ+DYE+EE AI G+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 1980 MHRSEAPVRT-GIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNAA-HFPDQR 1807 + EAP + G+GNG LQ +KTDV E ++AG SQG ++PGVS EGKY+NA HFP Q Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120 Query: 1806 VGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMP 1627 +A P +GS NY DGA VSQKG V E T D VRN+GFQGST P ++GVD +NMP Sbjct: 121 DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNMP 180 Query: 1626 GKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSV 1447 G++ +E AP+LN GP+GA N N+ VNENQIRP + Sbjct: 181 GRVANEPAPVLNPGAAGPQGA------------LIPANQMGVNINVNRAMVNENQIRPPL 228 Query: 1446 ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAAS 1267 ENG TMLFVGELHWWTTD+ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEF+D AA+ Sbjct: 229 ENGGTMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAA 288 Query: 1266 ACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGG 1087 ACK+GMNG++FNGR CVVAFASPQTLKQMGASYMNK+Q QPQ +Q QGRRPMNDG GRGG Sbjct: 289 ACKDGMNGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQ-SQTQGRRPMNDGGGRGG 347 Query: 1086 GMNY 1075 MNY Sbjct: 348 NMNY 351 Score = 267 bits (683), Expect = 1e-68 Identities = 139/230 (60%), Positives = 152/230 (66%), Gaps = 3/230 (1%) Frame = -2 Query: 846 MMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPAVNTMGLAGVAPHVNPAFFGR 667 MMHPQNMMG GFDPTYM GM+PSFPAVN MGLAGVAPHVNPAFF R Sbjct: 428 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 486 Query: 666 XXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEAS 487 MD H GMWTD+SMGGW G+EHGRRTRESSYGGDDGAS+YGYGEA+ Sbjct: 487 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEAN 546 Query: 486 HEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXXXXXXXXXXRYREEKDSYRDH 316 HEKG RS AA REK+RGSER+WSGN+ R+REEKDSYRD Sbjct: 547 HEKGARSTAASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 606 Query: 315 RHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVDYGKRRRLP 166 R R+RD +D WDRG R +P+EDHRSRSRDVDYGKRRRLP Sbjct: 607 RQRDRDSTYDDNWDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLP 656 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 383 bits (984), Expect = e-103 Identities = 207/361 (57%), Positives = 242/361 (67%), Gaps = 2/361 (0%) Frame = -2 Query: 2151 MAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQMHR 1972 MAEEQ+DYE++E AI G+G LQ + Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQ 60 Query: 1971 SEAPVRT-GIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNA-AHFPDQRVGP 1798 EAP + G+GNG LQ +KTDV E ++ GGSQG +IPGVS EGKY+NA +HFP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDVQ 120 Query: 1797 LAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMPGKI 1618 +A P +GS NY DGA VSQKG V E T D VRN+GFQGST P ++GVD +NMPG++ Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRV 180 Query: 1617 VSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSVENG 1438 +E AP+LN GP+GA N N++ VNENQIRP +ENG Sbjct: 181 ANEPAPVLNPGAAGPQGA------------LIPANQMGVNANVNRVMVNENQIRPPLENG 228 Query: 1437 PTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAASACK 1258 TMLFVGELHWWTTD+ELESVLSQYGR KEIKFFDERASGKSKGYCQVEF+D AA+ACK Sbjct: 229 GTMLFVGELHWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACK 288 Query: 1257 EGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGGGMN 1078 +GMNG++FNGR CVVAFASPQTLKQMGASYMNK+Q QPQ +Q QG RPMNDG GRGG N Sbjct: 289 DGMNGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQ-SQNQGSRPMNDGGGRGGNTN 347 Query: 1077 Y 1075 Y Sbjct: 348 Y 348 Score = 269 bits (688), Expect = 4e-69 Identities = 139/230 (60%), Positives = 152/230 (66%), Gaps = 3/230 (1%) Frame = -2 Query: 846 MMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPAVNTMGLAGVAPHVNPAFFGR 667 MMHPQNMMG GFDPTYM GM+PSFPAVN MGLAGVAPHVNPAFF R Sbjct: 425 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 483 Query: 666 XXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEAS 487 MD H GMWTD+SMGGW G+EHGRRTRESSYGGDDGAS+YGYGEA+ Sbjct: 484 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEAN 543 Query: 486 HEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXXXXXXXXXXRYREEKDSYRDH 316 HEKG RS AA REK+RGSER+WSGN+ R+REEKDSYRD Sbjct: 544 HEKGARSTAASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 603 Query: 315 RHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVDYGKRRRLP 166 R R+RD +D WDRGQ +P+EDHRSRSRDVDYGKRRRLP Sbjct: 604 RQRDRDSTYDDNWDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLP 653 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 378 bits (971), Expect = e-102 Identities = 206/361 (57%), Positives = 240/361 (66%), Gaps = 2/361 (0%) Frame = -2 Query: 2151 MAEEQLDYEDEEXXXXXXXXXXXXXAISXXXXXXXXXXXXXXXXXXXXXXXGEGFLQMHR 1972 MAEEQ+DYE++E AI G+G LQ + Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQ 60 Query: 1971 SEAPVRT-GIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNA-AHFPDQRVGP 1798 EAP + G+GNG LQ +KTDV E ++ GGSQG +IPGVS EGKY+NA + FP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDVQ 120 Query: 1797 LAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIPHKSGVDSTNMPGKI 1618 +A P +GS NY DGA VSQKG V E T D VRN+GFQGST P ++GVD +NMPG+ Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRA 180 Query: 1617 VSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVNENQIRPSVENG 1438 +E AP+LN GP+GA N N++ VNENQIRP +ENG Sbjct: 181 ANEPAPVLNPGAAGPQGA------------LIPANQMGVNANVNRVMVNENQIRPPLENG 228 Query: 1437 PTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPGAASACK 1258 TMLFVGELHWWTTD+ELESVLSQYGR KEIKFFDERASGKSKGYCQVEF+D AA+ACK Sbjct: 229 GTMLFVGELHWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACK 288 Query: 1257 EGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRPMNDGVGRGGGMN 1078 +GMNG++FNGR CVVAFASPQTLKQMGASYMNK+Q QPQ +Q QG RPMNDG GRGG N Sbjct: 289 DGMNGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQ-SQNQGSRPMNDGGGRGGNTN 347 Query: 1077 Y 1075 Y Sbjct: 348 Y 348 Score = 269 bits (688), Expect = 4e-69 Identities = 139/230 (60%), Positives = 151/230 (65%), Gaps = 3/230 (1%) Frame = -2 Query: 846 MMHPQNMMGAGFDPTYMXXXXXXXXXXXXXXXGMIPSFPAVNTMGLAGVAPHVNPAFFGR 667 MMHPQNMMG GFDPTYM GM+PSFPAVN MGLAGVAPHVNPAFF R Sbjct: 425 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 483 Query: 666 XXXXXXXXXXXXXXMDAHHAGMWTDTSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEAS 487 MD H GMWTD+SMGGW G+EHGRRTRESSYGGDDGAS+YGYGEAS Sbjct: 484 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEAS 543 Query: 486 HEKGVRSNAAPREKERGSEREWSGNS---XXXXXXXXXXXXXXXXXXXRYREEKDSYRDH 316 HEKG RS A REK+RGSER+WSGN+ R+REEKDSYRD Sbjct: 544 HEKGARSTTASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 603 Query: 315 RHRERDLVNEDEWDRGQXXXXXXXXXRVMPEEDHRSRSRDVDYGKRRRLP 166 R R+RD +D WDRGQ +P+EDHRSRSRDVDYGKRRRLP Sbjct: 604 RQRDRDSTYDDNWDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLP 653 >ref|XP_002889992.1| RNA recognition motif-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297335834|gb|EFH66251.1| RNA recognition motif-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 573 Score = 327 bits (838), Expect = 1e-86 Identities = 239/618 (38%), Positives = 298/618 (48%), Gaps = 10/618 (1%) Frame = -2 Query: 1995 EGFLQMHRSEAPVRT--GIGNGGLQAQKTDVLESTLEAGGSQGLSIPGVSAEGKYSNAAH 1822 E F Q H P G GN LQAQ + V A G+ + G + EGKY N Sbjct: 47 ESFFQAHNQPQPPAQVGGTGNASLQAQTSHVA-----AEPRMGI-VSGGTVEGKYRNDG- 99 Query: 1821 FPDQRVGPLAAKGPELGSVNYTDGALVSQKGRVNEMTTDGQVRNLGFQGSTLIP---HKS 1651 G GP+ S Y + KG + D Q +G QGST + H Sbjct: 100 ------GHNGISGPDTRSDVYPQASSFGAKG----LNIDIQSNKIGQQGSTSVVLNNHGF 149 Query: 1650 GVDSTNMPGKIVSESAPLLNSDPGGPRGAPQISXXXXXXXXXXXXXXXXXXXNDNQIRVN 1471 ++ N+P P+ N P+GA QI + + +N Sbjct: 150 SGNAVNVP------ELPVHNPYGAPPQGAQQIPVSQMSV--------------NPNVMMN 189 Query: 1470 ENQIRPSV-ENGPTMLFVGELHWWTTDSELESVLSQYGRVKEIKFFDERASGKSKGYCQV 1294 ++ +P V +NG TMLFVGELHWWTTD+E+ESVLSQYGRVKEIKFFDER SGKSKGYCQV Sbjct: 190 KSPTQPFVVDNGNTMLFVGELHWWTTDAEIESVLSQYGRVKEIKFFDERVSGKSKGYCQV 249 Query: 1293 EFYDPGAASACKEGMNGYLFNGRACVVAFASPQTLKQMGASYMNKSQVQPQINQQQGRRP 1114 EFYD AA++CKEGMNGY+FNG+ACVVAFASP+TLKQMGA++ ++Q Q NQ Q RRP Sbjct: 250 EFYDSAAAASCKEGMNGYIFNGKACVVAFASPETLKQMGANFTGRNQGQ---NQIQNRRP 306 Query: 1113 MNDGVGRGGGMNYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAVGPKNMVX 934 +N+G+GR G N + GP NM Sbjct: 307 LNEGMGR--GNNNNNMNTQNGDGGRNYGRGGFARGGQGMSNRGGPWGGGMRGRGPNNMAS 364 Query: 933 XXXXXXXXXXXXXXXXXXXXXXXXXXXXGMMHPQNMMGAG-FDPTYMXXXXXXXXXXXXX 757 MMHPQ MMGAG FDPT+M Sbjct: 365 GSGTGPYGPGLAGPAFGG-----------MMHPQGMMGAGGFDPTFMGRGGGFGGYSGIA 413 Query: 756 XXGMIPSFPAVNTMGLAGVAPHVNPAFFGRXXXXXXXXXXXXXXMDAHHAGMWTDTSMGG 577 GM S+P VN MG+ GVAPHVNPAFFG H A MW++ + G Sbjct: 414 YPGMPHSYPGVNAMGMVGVAPHVNPAFFG----TGMGTMGSAGMNGVHAAAMWSEANGG- 468 Query: 576 WGGDEHGRRTRESSYGGDDGASEY-GYGEASHEKGVRSNAAPREKERG-SEREWSGNSXX 403 GG++G SEY GY + + EK + + R+KER +ER+WS NS Sbjct: 469 ---------------GGEEGGSEYGGYEDETQEKEEKPS---RDKERATTERDWSENS-- 508 Query: 402 XXXXXXXXXXXXXXXXXRYREEKDSYRDHR-HRERDLVNEDEWDRGQXXXXXXXXXRVMP 226 +REEKDS+R+++ R+RD DE+DRGQ R M Sbjct: 509 -----------GDRRHKSHREEKDSHREYKQQRDRD---SDEFDRGQSSVKSRSRSR-MS 553 Query: 225 EEDHRSRSRDVDYGKRRR 172 E+DHRSRSRD DYGKRRR Sbjct: 554 EDDHRSRSRDADYGKRRR 571