BLASTX nr result
ID: Rauwolfia21_contig00001096
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00001096 (2260 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma c... 613 e-172 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 583 e-164 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 583 e-163 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 415 e-113 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 372 e-100 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 367 1e-98 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 367 2e-98 gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus pe... 364 1e-97 gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma c... 363 2e-97 gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Th... 363 2e-97 gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma c... 363 2e-97 gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma c... 363 2e-97 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 359 2e-96 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 359 3e-96 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 358 6e-96 ref|XP_002312652.1| RNA recognition motif-containing family prot... 321 8e-85 gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise... 311 1e-81 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 285 5e-74 ref|XP_002315647.1| RNA recognition motif-containing family prot... 285 5e-74 ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A... 272 5e-70 >gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 613 bits (1581), Expect = e-172 Identities = 341/664 (51%), Positives = 403/664 (60%), Gaps = 10/664 (1%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 D MA+EQIDFG+E+YGG+QKMQY G GAI ALA++EM+ VGEGFL + Sbjct: 2 DAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQL 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 QR+EA P + G QA K PE R EA Q NIPG + + K+ + R+PEQ Sbjct: 62 QRSEAPPQPGGMGSTGLQAQKNEAPEPRGEAGGSQGLNIPGVSVQGKHLNVTARYPEQDG 121 Query: 1741 GLAAN---IGP---PPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGK 1580 A + +G P T K + E T +Q NMG+QG + KVG D SG Sbjct: 122 QPAVSRPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDP---SG- 177 Query: 1579 VPGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTML 1400 VP + SLN G G +G P +P +QM +NVN P+I+E+Q+RP +ENG TML Sbjct: 178 VPQKIANVPAQSLNSGTGGPQGAPHVPPNQM----GLNVNHPMISENQVRPPIENGPTML 233 Query: 1399 FVGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMN 1220 FVGELHWWTTD ELESVL+QYG+VKEIKFFDERASGKSKGYCQVEFY+PA+AAACKEGM+ Sbjct: 234 FVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMD 293 Query: 1219 GYVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSYPXXX 1040 GY+FNGRACVVAFA+PQT+KQM A+YMNK Q Q+Q+QPQGRRP NDG GRG +Y Sbjct: 294 GYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NDGLGRGGNMNY--QS 350 Query: 1039 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKNMIXXXXXXXXXXXXXXXXXXX 860 KNM+ Sbjct: 351 GDAGRNYGRGGWGRGGQGVVNRSGVGGPMRGRGGVGVKNMVGSSAGVGNGANGGAAYGQG 410 Query: 859 XXXXXXXXXXXGLMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAG 680 G+MHPQ MMG GFDPTYMGRG YGGF GP FPGM+P FPAVN +GLAG Sbjct: 411 PAGPPFGGPAGGMMHPQGMMGAGFDPTYMGRGGSYGGFPGPGFPGMLPSFPAVNTLGLAG 470 Query: 679 VAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGE 500 VAPHVNPAFF GPH GMWTD+SMGGWGG+EHGRRTRESSYGGE Sbjct: 471 VAPHVNPAFFGRGMAPNGMGMMGGPGMDGPHVGMWTDTSMGGWGGDEHGRRTRESSYGGE 530 Query: 499 DNASEYGYGEASHDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXX 329 D ASEYGYG+A+H+KG RSSGASREKER +R+WSG+S Sbjct: 531 DGASEYGYGDANHEKG-RSSGASREKERVSDREWSGNSDRRHRDEKERDWDRSEREHREH 589 Query: 328 XXXXEKDGYRDYHHKERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRR 152 EKD YR++ H+ERD Y++D DRGQ AMPEE RSRSRD DYGKRRR Sbjct: 590 RYREEKDSYREHRHRERDLDYDDDLDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRR 649 Query: 151 VPSD 140 +PS+ Sbjct: 650 LPSE 653 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 583 bits (1504), Expect = e-164 Identities = 322/659 (48%), Positives = 385/659 (58%), Gaps = 5/659 (0%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 DPM +EQID+ EE+YGG+QK+QY GAI ALA++E + VGEGFL M Sbjct: 2 DPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQM 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 R E LPP N G QA K N PE R + QE PG + E KY S PEQK Sbjct: 62 HRPEPPLPPAGVGNGGLQAQKNNVPEQRVQGGASQEVKNPGFSVEGKYSS----VPEQKD 117 Query: 1741 GLAANIGPPPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKVPGSAE 1562 ++ P A K + E TH +Q NMG+QG+ M V DS ++GK+ Sbjct: 118 QPPVSVVPEM---ASQKGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIA---- 170 Query: 1561 SATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLFVGELH 1382 + +PS+N G V QMP +QM + INVNRP++NE+Q+RP VENG+ LFVGELH Sbjct: 171 NGPIPSMNSGSNGPPAVQQMPANQM--NMKINVNRPMVNENQIRPPVENGSATLFVGELH 228 Query: 1381 WWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNGYVFNG 1202 WWTTD ELE VL+Q+G++KEIKFFDERASGKSKGYCQV+FY+PAAA+ACKEGM+GYVFNG Sbjct: 229 WWTTDAELEGVLSQFGRIKEIKFFDERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNG 288 Query: 1201 RACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSYPXXXXXXXXX 1022 RACVVAFA+ QT+KQM +Y+NK+Q Q Q+QPQGRRPMNDGAGRG ++ Sbjct: 289 RACVVAFASSQTLKQMGDSYVNKSQGQVQTQPQGRRPMNDGAGRGGNMNFQGGDTGRNFG 348 Query: 1021 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKNMIXXXXXXXXXXXXXXXXXXXXXXXXX 842 A+NM+ Sbjct: 349 RGNNWGRGGQGVLNRGPGGGGPGRGRGAMGARNMVGNNAGVGTGANGGGYGQGLGGPGFG 408 Query: 841 XXXXXGLMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVN 662 + P MMGPGFDPTYMGRG GYGGF GP FPGM+P FP VN MGLAGVAPHVN Sbjct: 409 GPVGGMMNAP-GMMGPGFDPTYMGRGGGYGGFPGPGFPGMLPQFPGVNAMGLAGVAPHVN 467 Query: 661 PAFFXXXXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEY 482 PAFF G HA MW D SM GW GEE RRTRESSYGG+D SEY Sbjct: 468 PAFFGRGMATNGMGMMGSSGMEGHHAPMWNDPSMAGWTGEEQDRRTRESSYGGDDGGSEY 527 Query: 481 G-YGEASHDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXXXXXXE 314 G YGEA+H+K VRSS A RE+ER ER+W+G+S E Sbjct: 528 GNYGEANHEKPVRSSAAPRERERESEREWTGTSERRHRDEREQDWDRSEREHREPRYKEE 587 Query: 313 KDGYRDYHHKERDRGYENDWDRG-QPXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 KD YRD+ +ERD YE+D DRG +AMPE+ HRSRSRD DYGKRRR+PS+ Sbjct: 588 KDSYRDHRRRERDVAYEDDRDRGHSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 646 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 583 bits (1503), Expect = e-163 Identities = 328/665 (49%), Positives = 395/665 (59%), Gaps = 13/665 (1%) Frame = -1 Query: 2095 MADEQIDFGEEDYGGSQKMQYHG-GGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPMQ 1919 MA++ IDF +E+YGG+QK QY G GGAISALA++E++ VGEGFL +Q Sbjct: 1 MAEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQ 60 Query: 1918 RNEA-SLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 R+EA SLP + G QA K NFPE R E Q+PNIPG + E ++ S+G +FP Q+ Sbjct: 61 RSEAPSLPAAAGVGNGLQAQKRNFPEPREEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQD 120 Query: 1741 GLAANIGPPPTTDALHKAQA-----PETTHSSQAGNM--GYQGSVAMPQKVGPDSLAMSG 1583 GL + K++A P+ SQ G + G+QGS M VG DS Sbjct: 121 GLKVD----------KKSEAGSMVYPDGASGSQKGRIVAGFQGSKPMLHSVGVDS----S 166 Query: 1582 KVPGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTM 1403 +PG + + + N G RG+ M G+Q T N NV+ PI+NE+Q+RP++ENG+TM Sbjct: 167 DIPGKMVNEPIQAPNSGGAGPRGILPMQGNQTT--VNANVSHPIVNENQIRPSIENGSTM 224 Query: 1402 LFVGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGM 1223 LFVGELHWWTTD ELESVL+QYG+VKEIKFFDERASGKSKGYCQVE+Y+ AAA ACKEGM Sbjct: 225 LFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAVACKEGM 284 Query: 1222 NGYVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSYPXX 1043 +G+VFNGRACVVAFA+PQT+KQM A YM+K QVQ QSQPQGRRP+NDG GRG ++ Sbjct: 285 HGHVFNGRACVVAFASPQTLKQMGAAYMSKNQVQNQSQPQGRRPINDGVGRGGNPNFQSG 344 Query: 1042 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKNMIXXXXXXXXXXXXXXXXXX 863 AKNM+ Sbjct: 345 DGGRNFGRGGWGRGGQGAPNRGPGSGGPMRGRGGAMGAKNMV-----GNNAGVGGGGYGQ 399 Query: 862 XXXXXXXXXXXXGLMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLA 683 G+M+PQ MMG GFDPTYMGRG GYGGF+GPAFPGM+P FPAVN MG A Sbjct: 400 GLAGPPFGGPAGGMMNPQGMMGTGFDPTYMGRGVGYGGFAGPAFPGMLPSFPAVNTMGFA 459 Query: 682 GVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGG 503 VAPHVNPAFF G GMW D S+GGWGGEEHGRRTRESSYGG Sbjct: 460 AVAPHVNPAFFGRGMTNNGMGMVGSSLMDGHQGGMWNDPSIGGWGGEEHGRRTRESSYGG 519 Query: 502 EDNASEYGYGEASHDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXX 332 +D ASEYGYG+ +H+KG R ERG ERDWSG+S Sbjct: 520 DDGASEYGYGDTNHEKGGR--------ERGSERDWSGNSERRNHEERDQDWDRSQKEQKE 571 Query: 331 XXXXXEKDGYRDYHHKERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRR 155 KDG RDY KER+ YE+DWDRGQ R + E+ HRSRSRD DYGKRR Sbjct: 572 HRYREGKDGSRDYRPKERELDYEDDWDRGQSSSRLRSRSRVVQEDHHRSRSRDVDYGKRR 631 Query: 154 RVPSD 140 R+PS+ Sbjct: 632 RLPSE 636 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 415 bits (1066), Expect = e-113 Identities = 214/352 (60%), Positives = 253/352 (71%), Gaps = 2/352 (0%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 DP ADEQ+D+G+E+YGGS KMQYHG G I ALAEDEM+ +GEGFL + Sbjct: 2 DPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQL 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 QR+E +P V A N FQA K +FP SRA +E IPG ATE KY + ++FP+QK Sbjct: 62 QRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLGSEEAKIPGIATEGKYAGTEVQFPQQKG 121 Query: 1741 GLAANIGPPPTTDALHKAQ--APETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKVPGS 1568 DA KA+ A T +SQAGN GYQGS+ MPQK+G D +AM K Sbjct: 122 EPVVERETERPADAAQKARPSAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNASE 181 Query: 1567 AESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLFVGE 1388 A + + S+ PGP R VP MP +Q+ SS N+N+N P+I+E+ RP++ENGNTMLFVGE Sbjct: 182 A-TPLMNSVVPGP---RVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGE 237 Query: 1387 LHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNGYVF 1208 LHWWTTD ELESVLTQYG VKEIKFFDERASGKSKGYCQVEF++PA+AAACKEGMNGY F Sbjct: 238 LHWWTTDAELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNF 297 Query: 1207 NGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 NGRACVVAFATPQTIKQM ++Y NKTQ Q QSQPQGRRPMN+G GRG G +Y Sbjct: 298 NGRACVVAFATPQTIKQMGSSYANKTQNQVQSQPQGRRPMNEGVGRG-GPNY 348 Score = 298 bits (764), Expect = 5e-78 Identities = 149/229 (65%), Positives = 159/229 (69%), Gaps = 1/229 (0%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 LMHPQ MMGPGFDP++MGRGAGYGGFSGPAFPGM+PPF AVNPMGL GVAPHVNPAFF Sbjct: 420 LMHPQGMMGPGFDPSFMGRGAGYGGFSGPAFPGMMPPFQAVNPMGLPGVAPHVNPAFFGR 479 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GPH GMWTD+S GGWGGEEHGRRTRESSYGGEDNASEYGYGE S Sbjct: 480 GMAANGMGMMSAAGMDGPHPGMWTDTSGGGWGGEEHGRRTRESSYGGEDNASEYGYGEVS 539 Query: 463 HDKGVRSSGASREKERGQERDWSGSSXXXXXXXXXXXXXXXXXXXXXXXEKDGYRDYHHK 284 HDKG RSS SREKERG ERDWSG+S E+DGYRDY K Sbjct: 540 HDKGARSSAVSREKERGSERDWSGNSDKRHRDEREHDRDRHDKEHRYREERDGYRDYRQK 599 Query: 283 ERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 ER+ YE D+DRGQ RA EE HRSRSRD +YGKRRR PS+ Sbjct: 600 ERESEYEEDYDRGQSSSRSRSKSRAAQEEDHRSRSRDTNYGKRRRAPSE 648 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 372 bits (954), Expect = e-100 Identities = 199/352 (56%), Positives = 241/352 (68%), Gaps = 7/352 (1%) Frame = -1 Query: 2095 MADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPMQR 1916 MA+EQ+D+ +E+YGG+QKM + GGGAISALA+DE++ VGEGFL M R Sbjct: 1 MAEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHR 60 Query: 1915 NEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKSGL 1736 +EA P + FQA K + P + EA Q IPG + E KY S F E+K G Sbjct: 61 SEAPAPSGVMAGGPFQAHKTDVPPQKLEAGTSQGLIIPGVSIEGKY--SNPHFHEKKEGP 118 Query: 1735 AANIGPPPTTDA-------LHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKV 1577 A GP + + K + E TH +Q N+G+QGS +PQK G + + GK+ Sbjct: 119 MAVKGPEMGSTSHLDGPSVSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKI 178 Query: 1576 PGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLF 1397 + P LN G G R VPQM +QM N+NVNRP++NE+Q+RPAV+NG TMLF Sbjct: 179 ANEST----PVLNSGTGGPRAVPQMLSNQM--GMNVNVNRPMVNENQIRPAVDNGATMLF 232 Query: 1396 VGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNG 1217 VGELHWWTTD ELESVL+QYG+VKEIKFFDERASGKSKGYCQVEFY+ +AAAACKEGMNG Sbjct: 233 VGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNG 292 Query: 1216 YVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNG 1061 Y+FNGRACVVAFA+PQT+KQM A+YMNKT QAQSQ QGRRPMNDG GRG G Sbjct: 293 YIFNGRACVVAFASPQTLKQMGASYMNKT--QAQSQSQGRRPMNDGVGRGGG 342 Score = 282 bits (722), Expect = 4e-73 Identities = 142/229 (62%), Positives = 157/229 (68%), Gaps = 1/229 (0%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 LMHPQ MMG GFDPTYMGRG YGGFSG AFPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 419 LMHPQGMMGSGFDPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGR 478 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 G HAGMWTD+SMGGWGGEEHGRRTRESSYGG+D AS+YGYGE + Sbjct: 479 GMAANGMGMMGATGMDGHHAGMWTDTSMGGWGGEEHGRRTRESSYGGDDGASDYGYGEVN 538 Query: 463 HDKGVRSSGASREKERGQERDWSGSSXXXXXXXXXXXXXXXXXXXXXXXEKDGYRDYHHK 284 H+K RS+ ASREKERG ERDWSG+S EKDGYRD+ + Sbjct: 539 HEKVGRSNTASREKERGSERDWSGNSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQR 598 Query: 283 ERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 ERD E+DWDRGQ RA+ +E HRSRSRD DYGKRRR+PS+ Sbjct: 599 ERDFNNEDDWDRGQSSSRSRSRSRAVADEDHRSRSRDGDYGKRRRLPSE 647 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 367 bits (942), Expect = 1e-98 Identities = 192/357 (53%), Positives = 234/357 (65%), Gaps = 7/357 (1%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 D MA+EQID+ EE+YGG+QKMQY GGGAI ALA++E++ VG+G L Sbjct: 2 DSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQF 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 Q+ EA P N Q K + PE + +A + Q N+PG + E KY ++G FP Q Sbjct: 62 QQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQND 121 Query: 1741 GLAA----NIGP---PPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSG 1583 A N+G P K ETTH + NMG+QGS + P + G D M Sbjct: 122 VQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNM-- 179 Query: 1582 KVPGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTM 1403 PG + P LNPG +G +P +QM NINVNR ++NE+Q+RP +ENG TM Sbjct: 180 --PGRVANEPAPVLNPGAAGPQGA-LIPANQM--GVNINVNRAMVNENQIRPPLENGGTM 234 Query: 1402 LFVGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGM 1223 LFVGELHWWTTD ELESVL+QYG+VKEIKFFDERASGKSKGYCQVEF++ AAAAACK+GM Sbjct: 235 LFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGM 294 Query: 1222 NGYVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 NG+VFNGR CVVAFA+PQT+KQM A+YMNK Q Q QSQ QGRRPMNDG GRG +Y Sbjct: 295 NGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNY 351 Score = 275 bits (702), Expect = 8e-71 Identities = 139/232 (59%), Positives = 158/232 (68%), Gaps = 4/232 (1%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MHPQ MMG GFDPTYMGRG GYGGFSGP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 428 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 486 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GPH GMWTDSSMGGW GEEHGRRTRESSYGG+D AS+YGYGEA+ Sbjct: 487 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEAN 546 Query: 463 HDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXXXXXXEKDGYRDY 293 H+KG RS+ ASREK+RG ERDWSG++ EKD YRD Sbjct: 547 HEKGARSTAASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 606 Query: 292 HHKERDRGYENDWDRG-QPXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 ++RD Y+++WDRG RA+P+E HRSRSRD DYGKRRR+PS+ Sbjct: 607 RQRDRDSTYDDNWDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 367 bits (941), Expect = 2e-98 Identities = 192/357 (53%), Positives = 234/357 (65%), Gaps = 7/357 (1%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 D MA+EQID+ EE+YGG+QKMQY GGGAI ALA++E++ VG+G L Sbjct: 2 DSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQF 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 Q+ EA P N Q K + PE + +A + Q N+PG + E KY ++G FP Q Sbjct: 62 QQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQND 121 Query: 1741 GLAA----NIGP---PPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSG 1583 A N+G P K ETTH + NMG+QGS + P + G D M Sbjct: 122 VQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNM-- 179 Query: 1582 KVPGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTM 1403 PG + P LNPG +G +P +QM NINVNR ++NE+Q+RP +ENG TM Sbjct: 180 --PGRVANEPAPVLNPGAAGPQGA-LIPANQM--GVNINVNRAMVNENQIRPPLENGGTM 234 Query: 1402 LFVGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGM 1223 LFVGELHWWTTD ELESVL+QYG+VKEIKFFDERASGKSKGYCQVEF++ AAAAACK+GM Sbjct: 235 LFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGM 294 Query: 1222 NGYVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 NG+VFNGR CVVAFA+PQT+KQM A+YMNK Q Q QSQ QGRRPMNDG GRG +Y Sbjct: 295 NGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNY 351 Score = 275 bits (703), Expect = 6e-71 Identities = 139/232 (59%), Positives = 158/232 (68%), Gaps = 4/232 (1%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MHPQ MMG GFDPTYMGRG GYGGFSGP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 428 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 486 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GPH GMWTDSSMGGW GEEHGRRTRESSYGG+D AS+YGYGEA+ Sbjct: 487 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEAN 546 Query: 463 HDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXXXXXXEKDGYRDY 293 H+KG RS+ ASREK+RG ERDWSG++ EKD YRD Sbjct: 547 HEKGARSTAASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 606 Query: 292 HHKERDRGYENDWDRG-QPXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 ++RD Y+++WDRG RA+P+E HRSRSRD DYGKRRR+PS+ Sbjct: 607 RQRDRDSTYDDNWDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658 >gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 364 bits (934), Expect = 1e-97 Identities = 195/348 (56%), Positives = 242/348 (69%) Frame = -1 Query: 2095 MADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPMQR 1916 MA+EQID+ +E+YGG+QK+QY G GAISALA++E + V EGFL M R Sbjct: 1 MAEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHR 60 Query: 1915 NEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKSGL 1736 +EA LPP N G QA K + E+R +A + QE IPG + + KY S+ +FPEQ+ Sbjct: 61 SEAPLPPGGVGNGGLQAQKTDVTETRVQAGVSQESKIPGVSVQGKYSSAVAQFPEQQ--- 117 Query: 1735 AANIGPPPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKVPGSAESA 1556 G PP A+ PE G+ GY GS MP VG DS ++GK + ES Sbjct: 118 ----GQPPV------AKEPEL------GSTGY-GSTTMPPNVGGDSSDITGKT--ALES- 157 Query: 1555 TLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLFVGELHWW 1376 +PS+N G GV QMP +Q+ S +N NRP+ NE+Q+RP VENG+TMLFVGELHWW Sbjct: 158 -VPSMNSGTAGPTGVTQMPTNQI--SIKVNANRPMFNENQIRPPVENGSTMLFVGELHWW 214 Query: 1375 TTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNGYVFNGRA 1196 TTD ELESVL+QYG+VKEIKFFDERASGKSKGYCQVEF++PAAA ACKEGM+GY+FNGRA Sbjct: 215 TTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRA 274 Query: 1195 CVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 CVVAFA+PQT+KQM A+Y++K+Q Q QSQ GRRPMN+G GRG G +Y Sbjct: 275 CVVAFASPQTLKQMGASYLSKSQGQTQSQQPGRRPMNEGVGRGGGVNY 322 Score = 281 bits (719), Expect = 9e-73 Identities = 141/233 (60%), Positives = 158/233 (67%), Gaps = 5/233 (2%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +M+PQ MMG GFDPTYMGRG GYGGF GPAFPGM+ FPAVN MGLAGVAPHVNPAFF Sbjct: 398 MMNPQGMMGAGFDPTYMGRGGGYGGFPGPAFPGMLSSFPAVNTMGLAGVAPHVNPAFFGR 457 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 G HAGMW D SMGGWGG+EHGRRTRESSYGG+D ASEYGYGEA+ Sbjct: 458 GMATNGMGMMGSSGMDGHHAGMWNDPSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEAN 517 Query: 463 HDKGVRSSGASREKERGQERDWSGSS----XXXXXXXXXXXXXXXXXXXXXXXEKDGYRD 296 H+KG RS+ SRE+ERG ERDWSG+S EKD YRD Sbjct: 518 HEKGGRSNAPSRERERGSERDWSGNSERRHRDEREQDWDRSERGEHREHRYKEEKDSYRD 577 Query: 295 YHHKERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 + +ERD GYE+DWDRGQ +AMPE+ HRSRSRD DYGKRRR+PS+ Sbjct: 578 HRQRERDVGYEDDWDRGQSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 630 >gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 363 bits (931), Expect = 2e-97 Identities = 191/356 (53%), Positives = 241/356 (67%), Gaps = 6/356 (1%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 D MA+EQIDFG+E+YGG QKMQY G GAI ALA++EM+ VGEGFL + Sbjct: 2 DAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQL 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 QR+EA L P + G +A + PE R EA Q NIPG + + K+ + R+PE++ Sbjct: 62 QRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKEE 121 Query: 1741 GLAAN-----IGPPPTTDAL-HKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGK 1580 A N G P+ ++ K E TH Q N+G+QG + KVG D SG Sbjct: 122 QPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDP---SG- 177 Query: 1579 VPGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTML 1400 VP + SLN G G +G P +P +QM + NVN P++NE+Q++P +ENG TML Sbjct: 178 VPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGT----NVNHPVMNENQVQPPIENGPTML 233 Query: 1399 FVGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMN 1220 FVGELHWWTTD ELESVL+QYG++KEIKFFDE+ASGKSKGYCQVEFY+P++AA CKEGMN Sbjct: 234 FVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMN 293 Query: 1219 GYVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 GY+FNGRACVVAFA+PQT+KQM A+YMNK Q Q+Q+QPQGRRP N+G GRG +Y Sbjct: 294 GYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNY 348 Score = 227 bits (578), Expect = 2e-56 Identities = 107/146 (73%), Positives = 116/146 (79%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MHPQ MMG GFDPTYM RG GYGGF GP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 422 MMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGR 481 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GPHAGMWTD+SMGGWGG+EHGRRTRESSYGGED ASEYGYG+A+ Sbjct: 482 GMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDAN 541 Query: 463 HDKGVRSSGASREKERGQERDWSGSS 386 H+KG RSSGASREKER ER+WSG+S Sbjct: 542 HEKG-RSSGASREKERVSEREWSGNS 566 >gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 363 bits (931), Expect = 2e-97 Identities = 191/356 (53%), Positives = 241/356 (67%), Gaps = 6/356 (1%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 D MA+EQIDFG+E+YGG QKMQY G GAI ALA++EM+ VGEGFL + Sbjct: 2 DAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQL 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 QR+EA L P + G +A + PE R EA Q NIPG + + K+ + R+PE++ Sbjct: 62 QRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKEE 121 Query: 1741 GLAAN-----IGPPPTTDAL-HKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGK 1580 A N G P+ ++ K E TH Q N+G+QG + KVG D SG Sbjct: 122 QPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDP---SG- 177 Query: 1579 VPGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTML 1400 VP + SLN G G +G P +P +QM + NVN P++NE+Q++P +ENG TML Sbjct: 178 VPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGT----NVNHPVMNENQVQPPIENGPTML 233 Query: 1399 FVGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMN 1220 FVGELHWWTTD ELESVL+QYG++KEIKFFDE+ASGKSKGYCQVEFY+P++AA CKEGMN Sbjct: 234 FVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMN 293 Query: 1219 GYVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 GY+FNGRACVVAFA+PQT+KQM A+YMNK Q Q+Q+QPQGRRP N+G GRG +Y Sbjct: 294 GYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNY 348 Score = 268 bits (686), Expect = 6e-69 Identities = 136/227 (59%), Positives = 152/227 (66%), Gaps = 4/227 (1%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MHPQ MMG GFDPTYM RG GYGGF GP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 422 MMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGR 481 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GPHAGMWTD+SMGGWGG+EHGRRTRESSYGGED ASEYGYG+A+ Sbjct: 482 GMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDAN 541 Query: 463 HDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXXXXXXEKDGYRDY 293 H+KG RSSGASREKER ER+WSG+S EKD YR++ Sbjct: 542 HEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREH 600 Query: 292 HHKERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRR 155 H+ERD Y++DWDRGQ AMPEE HRSRSRD Y + + Sbjct: 601 RHRERDLDYDDDWDRGQSSSRSRRRSHAMPEEEHRSRSRDVGYREEK 647 >gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 363 bits (931), Expect = 2e-97 Identities = 191/356 (53%), Positives = 241/356 (67%), Gaps = 6/356 (1%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 D MA+EQIDFG+E+YGG QKMQY G GAI ALA++EM+ VGEGFL + Sbjct: 2 DAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQL 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 QR+EA L P + G +A + PE R EA Q NIPG + + K+ + R+PE++ Sbjct: 62 QRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKEE 121 Query: 1741 GLAAN-----IGPPPTTDAL-HKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGK 1580 A N G P+ ++ K E TH Q N+G+QG + KVG D SG Sbjct: 122 QPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDP---SG- 177 Query: 1579 VPGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTML 1400 VP + SLN G G +G P +P +QM + NVN P++NE+Q++P +ENG TML Sbjct: 178 VPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGT----NVNHPVMNENQVQPPIENGPTML 233 Query: 1399 FVGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMN 1220 FVGELHWWTTD ELESVL+QYG++KEIKFFDE+ASGKSKGYCQVEFY+P++AA CKEGMN Sbjct: 234 FVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMN 293 Query: 1219 GYVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 GY+FNGRACVVAFA+PQT+KQM A+YMNK Q Q+Q+QPQGRRP N+G GRG +Y Sbjct: 294 GYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNY 348 Score = 256 bits (655), Expect = 2e-65 Identities = 142/277 (51%), Positives = 158/277 (57%), Gaps = 49/277 (17%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MHPQ MMG GFDPTYM RG GYGGF GP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 422 MMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGR 481 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GPHAGMWTD+SMGGWGG+EHGRRTRESSYGGED ASEYGYG+A+ Sbjct: 482 GMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDAN 541 Query: 463 HDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXXXXXXEKDGYRDY 293 H+KG RSSGASREKER ER+WSG+S EKD YR++ Sbjct: 542 HEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREH 600 Query: 292 HH---------------------------------------------KERDRGYENDWDR 248 H +ERD Y++D DR Sbjct: 601 RHREREWSGNSDRRHRDEKERDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDDLDR 660 Query: 247 GQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 GQ AMPEE RSRSRD DYGKRRR+PS+ Sbjct: 661 GQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRRLPSE 697 >gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 363 bits (931), Expect = 2e-97 Identities = 191/356 (53%), Positives = 241/356 (67%), Gaps = 6/356 (1%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 D MA+EQIDFG+E+YGG QKMQY G GAI ALA++EM+ VGEGFL + Sbjct: 2 DAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQL 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKS 1742 QR+EA L P + G +A + PE R EA Q NIPG + + K+ + R+PE++ Sbjct: 62 QRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKEE 121 Query: 1741 GLAAN-----IGPPPTTDAL-HKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGK 1580 A N G P+ ++ K E TH Q N+G+QG + KVG D SG Sbjct: 122 QPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDP---SG- 177 Query: 1579 VPGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTML 1400 VP + SLN G G +G P +P +QM + NVN P++NE+Q++P +ENG TML Sbjct: 178 VPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGT----NVNHPVMNENQVQPPIENGPTML 233 Query: 1399 FVGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMN 1220 FVGELHWWTTD ELESVL+QYG++KEIKFFDE+ASGKSKGYCQVEFY+P++AA CKEGMN Sbjct: 234 FVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMN 293 Query: 1219 GYVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 GY+FNGRACVVAFA+PQT+KQM A+YMNK Q Q+Q+QPQGRRP N+G GRG +Y Sbjct: 294 GYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNY 348 Score = 285 bits (729), Expect = 6e-74 Identities = 143/232 (61%), Positives = 160/232 (68%), Gaps = 4/232 (1%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MHPQ MMG GFDPTYM RG GYGGF GP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 422 MMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGR 481 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GPHAGMWTD+SMGGWGG+EHGRRTRESSYGGED ASEYGYG+A+ Sbjct: 482 GMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDAN 541 Query: 463 HDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXXXXXXEKDGYRDY 293 H+KG RSSGASREKER ER+WSG+S EKD YR++ Sbjct: 542 HEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREH 600 Query: 292 HHKERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 H+ERD Y++DWDRGQ AMPEE HRSRSRD DYGK+RR+PS+ Sbjct: 601 RHRERDLDYDDDWDRGQSSSRSRRRSHAMPEEEHRSRSRDVDYGKKRRLPSE 652 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 359 bits (922), Expect = 2e-96 Identities = 194/350 (55%), Positives = 236/350 (67%), Gaps = 2/350 (0%) Frame = -1 Query: 2095 MADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPMQR 1916 MADEQID+ +E+YGG+QK+QY G GAI ALAE+EM +GE FL M R Sbjct: 1 MADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNIGENFLQMHR 59 Query: 1915 NEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQ--KS 1742 +EA P S N GFQ N + R E+ Q NIPG A E+KY S+G FPEQ K Sbjct: 60 SEAPPAPPSVGNGGFQPRNSN--DLRVESGGSQGLNIPGVAVESKY-STGTHFPEQNVKG 116 Query: 1741 GLAANIGPPPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKVPGSAE 1562 ++G P + K + E T+ SQA NMG+QGS + P +G D M+ K+ Sbjct: 117 PEIGSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKISND-- 174 Query: 1561 SATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLFVGELH 1382 P+ P G R +PQ+P QM + N++ NR NE+Q+RP +ENG+TML+VGELH Sbjct: 175 ----PTPVPNAGVPRVIPQLPASQM--NMNMDTNRSATNENQIRPPLENGSTMLYVGELH 228 Query: 1381 WWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNGYVFNG 1202 WWTTD ELE+VL+QYG VKEIKFFDERASGKSKGYCQVEFY+ AAAAACKEGMNG++FNG Sbjct: 229 WWTTDAELENVLSQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNGHLFNG 288 Query: 1201 RACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 RACVVAFA+ QT+KQM A+YMNK Q Q QSQ QGRRPMNDGAGRG +Y Sbjct: 289 RACVVAFASQQTLKQMGASYMNKNQGQPQSQNQGRRPMNDGAGRGGNMNY 338 Score = 272 bits (696), Expect = 4e-70 Identities = 140/232 (60%), Positives = 160/232 (68%), Gaps = 4/232 (1%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 ++ PQ+MM GFDPTYMGRGAGYGGF+GP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 414 MLPPQSMMRAGFDPTYMGRGAGYGGFAGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFGR 473 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GP+AGMW+D+SMGGW GEE GRRTRESSYGG+D ASEYGYGE + Sbjct: 474 GMAPNGMGMMGPSGMDGPNAGMWSDTSMGGW-GEEPGRRTRESSYGGDDGASEYGYGEVN 532 Query: 463 HDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXXXXXXEKDGYRDY 293 H+KG RSS ASREKER ERDWSG+S EK+ YRD+ Sbjct: 533 HEKGARSSAASREKERASERDWSGNSDRRHRDDREHDWDRSEREHKEHRYREEKESYRDH 592 Query: 292 HHKERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 +ERD GYE+DWDRGQ RA+PEE +RSRSRDADYGKRRR+PS+ Sbjct: 593 RQRERDSGYEDDWDRGQSSSRSRSRSRAVPEEDYRSRSRDADYGKRRRLPSE 644 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 359 bits (921), Expect = 3e-96 Identities = 190/355 (53%), Positives = 230/355 (64%), Gaps = 7/355 (1%) Frame = -1 Query: 2095 MADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPMQR 1916 MA+EQID+ E++YGG+QKMQY GGGAI ALA++E++ VG+G L Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQ 60 Query: 1915 NEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKSGL 1736 EA P N Q K + PE R + Q NIPG + E KY ++G FP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDVQ 120 Query: 1735 AA----NIGP---PPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKV 1577 A N+G P K ETTH + NMG+QGS + P + G D M Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNM---- 176 Query: 1576 PGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLF 1397 PG A + P LNPG +G +P +QM N NVNR ++NE+Q+RP +ENG TMLF Sbjct: 177 PGRAANEPAPVLNPGAAGPQGA-LIPANQM--GVNANVNRVMVNENQIRPPLENGGTMLF 233 Query: 1396 VGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNG 1217 VGELHWWTTD ELESVL+QYG+ KEIKFFDERASGKSKGYCQVEF++ AAAAACK+GMNG Sbjct: 234 VGELHWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNG 293 Query: 1216 YVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 +VFNGR CVVAFA+PQT+KQM A+YMNK Q Q QSQ QG RPMNDG GRG T+Y Sbjct: 294 HVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNY 348 Score = 277 bits (709), Expect = 1e-71 Identities = 140/232 (60%), Positives = 158/232 (68%), Gaps = 4/232 (1%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MHPQ MMG GFDPTYMGRG GYGGFSGP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 425 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 483 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GPH GMWTDSSMGGW GEEHGRRTRESSYGG+D AS+YGYGEAS Sbjct: 484 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEAS 543 Query: 463 HDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXXXXXXEKDGYRDY 293 H+KG RS+ ASREK+RG ERDWSG++ EKD YRD Sbjct: 544 HEKGARSTTASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 603 Query: 292 HHKERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 ++RD Y+++WDRGQ A+P+E HRSRSRD DYGKRRR+PS+ Sbjct: 604 RQRDRDSTYDDNWDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 358 bits (919), Expect = 6e-96 Identities = 189/355 (53%), Positives = 229/355 (64%), Gaps = 7/355 (1%) Frame = -1 Query: 2095 MADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPMQR 1916 MA+EQID+ E++YGG+QKMQY GGGAI ALA++E++ VG+G L Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQ 60 Query: 1915 NEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQKSGL 1736 EA P N Q K + PE R + Q NIPG + E KY ++G FP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDVQ 120 Query: 1735 AA----NIGP---PPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKV 1577 A N+G P K ETTH + NMG+QGS + P + G D M Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNM---- 176 Query: 1576 PGSAESATLPSLNPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLF 1397 PG + P LNPG +G +P +QM N NVNR ++NE+Q+RP +ENG TMLF Sbjct: 177 PGRVANEPAPVLNPGAAGPQGA-LIPANQM--GVNANVNRVMVNENQIRPPLENGGTMLF 233 Query: 1396 VGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNG 1217 VGELHWWTTD ELESVL+QYG+ KEIKFFDERASGKSKGYCQVEF++ AAAAACK+GMNG Sbjct: 234 VGELHWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNG 293 Query: 1216 YVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 +VFNGR CVVAFA+PQT+KQM A+YMNK Q Q QSQ QG RPMNDG GRG T+Y Sbjct: 294 HVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNY 348 Score = 276 bits (707), Expect = 2e-71 Identities = 139/232 (59%), Positives = 158/232 (68%), Gaps = 4/232 (1%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MHPQ MMG GFDPTYMGRG GYGGFSGP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 425 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 483 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GPH GMWTDSSMGGW GEEHGRRTRESSYGG+D AS+YGYGEA+ Sbjct: 484 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEAN 543 Query: 463 HDKGVRSSGASREKERGQERDWSGSS---XXXXXXXXXXXXXXXXXXXXXXXEKDGYRDY 293 H+KG RS+ ASREK+RG ERDWSG++ EKD YRD Sbjct: 544 HEKGARSTAASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 603 Query: 292 HHKERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 ++RD Y+++WDRGQ A+P+E HRSRSRD DYGKRRR+PS+ Sbjct: 604 RQRDRDSTYDDNWDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 321 bits (823), Expect = 8e-85 Identities = 182/343 (53%), Positives = 217/343 (63%), Gaps = 6/343 (1%) Frame = -1 Query: 2062 DYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPMQRNEASLPPVSAS 1883 DY +KMQY G GAI ALAE+EM VGE FL M +EA PP + Sbjct: 2 DYEEEEKMQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNVGENFLQMHGSEAPAPPATVG 60 Query: 1882 NMGFQAPKGNFPESRAEATMLQEPNIPGG--ATEAKYGSSGLRFPEQKSGLAA----NIG 1721 N GFQ N ESR E Q I GG A E Y ++ FPEQK A ++G Sbjct: 61 NGGFQTR--NAHESRIETGGSQALAITGGGPAVEGIYSNAKAHFPEQKQVAVAVEAQDVG 118 Query: 1720 PPPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKVPGSAESATLPSL 1541 P + K + E +H Q NMG+Q S +P +G D MS K + E LP Sbjct: 119 PVDGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRK--NAIEPEPLPIT 176 Query: 1540 NPGPGSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLFVGELHWWTTDVE 1361 G RG PQM +QM SA+ VNRP++NE+Q+RP +ENG+T L+VGELHWWTTD E Sbjct: 177 --GSAGPRGAPQMQVNQMHMSAD--VNRPVVNENQVRPPIENGSTTLYVGELHWWTTDAE 232 Query: 1360 LESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNGYVFNGRACVVAF 1181 LES +Q+G+VKEIKFFDERASGKSKGYCQV+FYE AAAAACKEGMNG+VFNGR CVVAF Sbjct: 233 LESFASQFGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVFNGRPCVVAF 292 Query: 1180 ATPQTIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 A+PQT+KQM A+YMNKTQ Q Q+Q QGR MNDGAGRG ++ Sbjct: 293 ASPQTLKQMGASYMNKTQGQPQTQSQGRGSMNDGAGRGGNANF 335 Score = 227 bits (578), Expect = 2e-56 Identities = 121/229 (52%), Positives = 137/229 (59%), Gaps = 1/229 (0%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +M PQ MMG GFDP YMGRG GYGGF+GP FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 409 MMPPQGMMGAGFDPLYMGRGGGYGGFAGPGFPGMLPSFPAVNSMGLAGVAPHVNPAFFAR 468 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GP+ GMW ESSY G++ ASEYGYGE + Sbjct: 469 GMAPNGMGMMVSSGMDGPNPGMW------------------ESSYDGDEGASEYGYGEGN 510 Query: 463 HDKGVRSSGASREKERGQERDWSGSSXXXXXXXXXXXXXXXXXXXXXXXEKDGYRDYHHK 284 H+KG RSSGASREKERG ERDWSG+S EKD YR + + Sbjct: 511 HEKGARSSGASREKERGSERDWSGNSDRRHRDEREQDWDRPEREHRYKEEKDSYRGHRQR 570 Query: 283 ERDRGYENDWDRG-QPXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 ERD GYE+D DRG RA PEE +RSR+RD DYGKRRR+PS+ Sbjct: 571 ERDSGYEDDRDRGHSSSRARSRSRAAPEEDYRSRTRDVDYGKRRRLPSE 619 >gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea] Length = 508 Score = 311 bits (796), Expect = 1e-81 Identities = 182/354 (51%), Positives = 219/354 (61%), Gaps = 4/354 (1%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXV-GEGFLP 1925 +PM EQ DFGEE+YGG QKMQY+ GGAI ALA++EMI GE F+ Sbjct: 2 EPMNGEQFDFGEEEYGGGQKMQYNQGGAIPALADEEMIGEEDDEYDDLYNDVNVGESFMQ 61 Query: 1924 MQRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPGGATEAKYGSSGLRFPEQK 1745 +QR ++ +PP A N + G+ E+ +E N A +G L+FPEQK Sbjct: 62 VQRPDSQIPPFKAENRVNPSGTGD------ESIPSEEANASKYAGNRAFGPGALQFPEQK 115 Query: 1744 SGLAANIGPPPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKVPGSA 1565 +GL T D +T +SQ GYQGSVA P D + K G Sbjct: 116 AGLNTTEETSVTVDR------SQTVRNSQTDQSGYQGSVA-PNNKTEDQVKNMDKTVGDP 168 Query: 1564 ESATLPSLNPGPG-SSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLFVGE 1388 S +NP G S+G +P + M +AN N RP+ +E + ENGNTML+VGE Sbjct: 169 SS-----INPNVGVGSKGA--VPFNFMNMAANANAIRPVDDEYSNLGSSENGNTMLYVGE 221 Query: 1387 LHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNGYVF 1208 LHWWTTD E+ESVL QYGKVKEIKFFDERASGKSKGYCQVEF++PAAA ACKEGMNGYVF Sbjct: 222 LHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFFDPAAAHACKEGMNGYVF 281 Query: 1207 NGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRR-PMND-GAGRGNGTSY 1052 NGRACVVAFATPQTIKQM A+YMN+ Q Q Q+Q GR MND GAGRG GT++ Sbjct: 282 NGRACVVAFATPQTIKQMGASYMNRNQGQPQAQFPGRNAAMNDGGAGRGVGTNF 335 Score = 140 bits (353), Expect = 2e-30 Identities = 69/109 (63%), Positives = 79/109 (72%), Gaps = 2/109 (1%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGG-FSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFX 647 +MHPQ MMGPGFD +MGRGAGYGG F+GPAFPGM+PPFPAVN +GL GVAPHVNPAFF Sbjct: 401 MMHPQGMMGPGFDLAFMGRGAGYGGGFTGPAFPGMLPPFPAVNTLGLPGVAPHVNPAFFG 460 Query: 646 XXXXXXXXXXXXXXXXXGPHAGMWTDSSM-GGWGGEEHGRRTRESSYGG 503 GP++G+W D+S+ GGWGGEE GR ESSYGG Sbjct: 461 RGMAPNGMGMMGPSGMGGPYSGLWNDASVGGGWGGEEQGRGP-ESSYGG 508 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 285 bits (730), Expect = 5e-74 Identities = 170/339 (50%), Positives = 205/339 (60%), Gaps = 2/339 (0%) Frame = -1 Query: 2062 DYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPMQRNEASLPPVSAS 1883 D+ +KMQY G GAI ALAE+E+ VGE FL M +EA PP +A Sbjct: 2 DFEEEEKMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAPPATAG 60 Query: 1882 NMGFQAPKGNFPESRAEA--TMLQEPNIPGGATEAKYGSSGLRFPEQKSGLAANIGPPPT 1709 N GFQ N ESR E + + + G A E KY ++G FPEQK A IG Sbjct: 61 NGGFQTR--NAHESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQ---AGIG---- 111 Query: 1708 TDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKVPGSAESATLPSLNPGP 1529 ++ G++GY ++ QK GSA GP Sbjct: 112 ------------VEANDVGSIGYGDGSSVAQK-------------GSA----------GP 136 Query: 1528 GSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLFVGELHWWTTDVELESV 1349 RGVPQM +QM + N +VNRP++NE+Q+RP +ENG T L+VGELHWWTTD ELESV Sbjct: 137 ---RGVPQMQVNQM--NMNADVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESV 191 Query: 1348 LTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNGYVFNGRACVVAFATPQ 1169 +QYG+VKEIKFFDERASGKSKGYCQV+FYE AAAAACKEGMN +VFNGR CVVAFA+ Q Sbjct: 192 ASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQ 251 Query: 1168 TIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 T+KQM A+YM+KTQ Q Q Q QGR MNDG GRG +Y Sbjct: 252 TLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANY 290 Score = 253 bits (646), Expect = 3e-64 Identities = 133/229 (58%), Positives = 147/229 (64%), Gaps = 1/229 (0%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MH Q MMG GFDP YMGRG GYGGF G FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 364 MMHHQGMMGAGFDPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFAR 423 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GP+ G W D+SMGGW GEE GRRTRESSY G++ ASEYGYGE + Sbjct: 424 GMAPNGMGMMASSGMEGPNPGKWPDTSMGGW-GEEPGRRTRESSYDGDEGASEYGYGEGN 482 Query: 463 HDKGVRSSGASREKERGQERDWSGSSXXXXXXXXXXXXXXXXXXXXXXXEKDGYRDYHHK 284 H+KG RSSGASREKER ERDWSG+S EKD YR + + Sbjct: 483 HEKGARSSGASREKERVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQR 542 Query: 283 ERDRGYENDWDRG-QPXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 ERD GYE+D DRG RA PEE +RSRSRD DYGKRRR PS+ Sbjct: 543 ERDSGYEDDRDRGHSSSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 591 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 285 bits (730), Expect = 5e-74 Identities = 170/339 (50%), Positives = 205/339 (60%), Gaps = 2/339 (0%) Frame = -1 Query: 2062 DYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPMQRNEASLPPVSAS 1883 D+ +KMQY G GAI ALAE+E+ VGE FL M +EA PP +A Sbjct: 2 DFEEEEKMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAPPATAG 60 Query: 1882 NMGFQAPKGNFPESRAEA--TMLQEPNIPGGATEAKYGSSGLRFPEQKSGLAANIGPPPT 1709 N GFQ N ESR E + + + G A E KY ++G FPEQK A IG Sbjct: 61 NGGFQTR--NAHESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQ---AGIG---- 111 Query: 1708 TDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSLAMSGKVPGSAESATLPSLNPGP 1529 ++ G++GY ++ QK GSA GP Sbjct: 112 ------------VEANDVGSIGYGDGSSVAQK-------------GSA----------GP 136 Query: 1528 GSSRGVPQMPGDQMTSSANINVNRPIINESQMRPAVENGNTMLFVGELHWWTTDVELESV 1349 RGVPQM +QM + N +VNRP++NE+Q+RP +ENG T L+VGELHWWTTD ELESV Sbjct: 137 ---RGVPQMQVNQM--NMNADVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESV 191 Query: 1348 LTQYGKVKEIKFFDERASGKSKGYCQVEFYEPAAAAACKEGMNGYVFNGRACVVAFATPQ 1169 +QYG+VKEIKFFDERASGKSKGYCQV+FYE AAAAACKEGMN +VFNGR CVVAFA+ Q Sbjct: 192 ASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQ 251 Query: 1168 TIKQMAANYMNKTQVQAQSQPQGRRPMNDGAGRGNGTSY 1052 T+KQM A+YM+KTQ Q Q Q QGR MNDG GRG +Y Sbjct: 252 TLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANY 290 Score = 214 bits (545), Expect = 1e-52 Identities = 120/229 (52%), Positives = 133/229 (58%), Gaps = 1/229 (0%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTYMGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAFFXX 644 +MH Q MMG GFDP YMGRG GYGGF G FPGM+P FPAVN MGLAGVAPHVNPAFF Sbjct: 364 MMHHQGMMGAGFDPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFAR 423 Query: 643 XXXXXXXXXXXXXXXXGPHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGYGEAS 464 GM S M G +ESSY G++ ASEYGYGE + Sbjct: 424 GMAPNG-------------MGMMASSGMEG------PNPGKESSYDGDEGASEYGYGEGN 464 Query: 463 HDKGVRSSGASREKERGQERDWSGSSXXXXXXXXXXXXXXXXXXXXXXXEKDGYRDYHHK 284 H+KG RSSGASREKER ERDWSG+S EKD YR + + Sbjct: 465 HEKGARSSGASREKERVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQR 524 Query: 283 ERDRGYENDWDRG-QPXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 ERD GYE+D DRG RA PEE +RSRSRD DYGKRRR PS+ Sbjct: 525 ERDSGYEDDRDRGHSSSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 573 >ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] gi|548855834|gb|ERN13697.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] Length = 659 Score = 272 bits (695), Expect = 5e-70 Identities = 170/374 (45%), Positives = 216/374 (57%), Gaps = 24/374 (6%) Frame = -1 Query: 2101 DPMADEQIDFGEEDYGGSQKMQYHGGGAISALAEDEMINXXXXXXXXXXXXXVGEGFLPM 1922 DPMA+EQ+D+ +EDYG +QKM + GGAISALA++E++ VG+GF+ Sbjct: 2 DPMAEEQLDYEDEDYGANQKMPFQTGGAISALADEELMGEDDEYDDLYNDVNVGDGFMQS 61 Query: 1921 QRNEASLPPVSASNMGFQAPKGNFPESRAEATMLQEPNIPG------GATEAKY-GSSGL 1763 +++ + S N G QAPK E NIPG G +AK G S L Sbjct: 62 LQHQEPVQYESMGN-GVQAPK-------EEPISTPPVNIPGVGHEEKGEKDAKLSGFSDL 113 Query: 1762 ----RFPEQKSGLAANIGPPPTTDALHKAQAPETTHSSQAGNMGYQGSVAMPQKVGPDSL 1595 F EQ S A + K + E Q G++ + A P K + Sbjct: 114 DQKKAFQEQASNQLAG------ASSGLKIRVSEPVSEPQPQASGFRNAPAPPAKGSGFNT 167 Query: 1594 AMS---GKVPGSAESATLPSLNPGPGSSRGV-PQMPGDQMT------SSANINVNRPIIN 1445 A + K S +P + PGPG G P ++M + A I+ + + Sbjct: 168 AGAMDANKQLAQTSSNAVPRVGPGPGPGIGAGPNANMNRMMGPGPNQAGAVIDTSARFGS 227 Query: 1444 ESQMRPAV---ENGNTMLFVGELHWWTTDVELESVLTQYGKVKEIKFFDERASGKSKGYC 1274 E+ R + E+GNTMLFVGEL WWTTD ELESVL+QYG+VK++KFFDERASGKSKGYC Sbjct: 228 ENSNRLSHGGGESGNTMLFVGELQWWTTDAELESVLSQYGRVKDLKFFDERASGKSKGYC 287 Query: 1273 QVEFYEPAAAAACKEGMNGYVFNGRACVVAFATPQTIKQMAANYMNKTQVQAQSQPQGRR 1094 QVEFY+PAAAAACKE MNG+VFNGRACVVAFA+ T+KQ+ NY+NKTQ QAQ+Q QGRR Sbjct: 288 QVEFYDPAAAAACKESMNGHVFNGRACVVAFASQHTLKQLTTNYLNKTQAQAQAQSQGRR 347 Query: 1093 PMNDGAGRGNGTSY 1052 PMNDG GR G SY Sbjct: 348 PMNDGGGRAGGPSY 361 Score = 226 bits (576), Expect = 3e-56 Identities = 123/235 (52%), Positives = 141/235 (60%), Gaps = 7/235 (2%) Frame = -1 Query: 823 LMHPQAMMGPGFDPTY---MGRGAGYGGFSGPAFPGMIPPFPAVNPMGLAGVAPHVNPAF 653 L+HPQ MMG GFDPTY +GRG+GYGGFSGP FPGM+P F + +GL GVAPHVNPAF Sbjct: 429 LLHPQGMMGSGFDPTYGAHLGRGSGYGGFSGPHFPGMLPSFSPMGTVGLPGVAPHVNPAF 488 Query: 652 FXXXXXXXXXXXXXXXXXXGPHAGMWTDSSMG---GWGGEEHGRRTRESSYGGEDNASEY 482 F G H GMW DSSMG GWG EEHGRRTRESSY G+D AS+Y Sbjct: 489 FGRGVSANGMGMMGSGAMDGHHGGMWGDSSMGGGVGWGNEEHGRRTRESSY-GDDGASDY 547 Query: 481 GYGEASHDKGVRSSGASREKERGQERDWSGSSXXXXXXXXXXXXXXXXXXXXXXXEKDGY 302 GYG+ H++G S REK+RG ERDWS EKDGY Sbjct: 548 GYGDGGHERGGGRSNPGREKDRGSERDWSSG---PERRHRDDRDSDWDRDPRYKDEKDGY 604 Query: 301 RDYHHKERDRGYENDWDRGQ-PXXXXXXXRAMPEEVHRSRSRDADYGKRRRVPSD 140 D+ +ERD E+DWDRG+ R M EE RSRS+D DYGKRRRVPS+ Sbjct: 605 SDHRQRERDWDNEDDWDRGRTSSRSRSKSRMMQEEDQRSRSKDVDYGKRRRVPSE 659