BLASTX nr result
ID: Catharanthus22_contig00001979
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00001979 (1550 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 350 7e-94 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 328 3e-87 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 318 4e-84 gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma c... 317 7e-84 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 317 7e-84 gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus pe... 315 3e-83 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 314 6e-83 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 314 7e-83 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 312 2e-82 gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma c... 309 2e-81 gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Th... 309 2e-81 gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma c... 309 2e-81 gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma c... 309 2e-81 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 308 4e-81 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 308 5e-81 ref|XP_002312652.1| RNA recognition motif-containing family prot... 292 3e-76 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 251 5e-64 ref|XP_002315647.1| RNA recognition motif-containing family prot... 251 5e-64 gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise... 247 9e-63 emb|CBI16834.3| unnamed protein product [Vitis vinifera] 238 5e-60 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 350 bits (899), Expect = 7e-94 Identities = 183/326 (56%), Positives = 216/326 (66%), Gaps = 2/326 (0%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 G I ALAEDEM+ GEGFLQ+QR++ + N FQ K SFP Sbjct: 28 GTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQLQRSEVPVPSVDAGNGNFQAQKDSFPA 87 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGPPPTTDVSQKAQ--APEMT 354 SRA EA +PG TE KY + ++FP+QK D +QKA+ A MT Sbjct: 88 SRAGGLGSEEAKIPGIATEGKYAGTEVQFPQQKGEPVVERETERPADAAQKARPSAITMT 147 Query: 355 HGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQMPGNQ 534 SQAGN YQGS+ MPQK+G D + +P S P +N R +P MP NQ Sbjct: 148 LNSQAGNSGYQGSMPMPQKIGADPM----AMPEKNASEATPLMNSVVPGPRVVPHMPTNQ 203 Query: 535 TTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKEIKFF 714 SS N+N+N P+I+E RP++ENGN MLFVGELHWWTTDAELESVLTQYG VKEIKFF Sbjct: 204 LNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELHWWTTDAELESVLTQYGNVKEIKFF 263 Query: 715 DERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANYMNKT 894 DERASGKSKGYCQVEF++P +AAACKEGMNGY+FNGRACVVAFATPQ+IKQM ++Y NKT Sbjct: 264 DERASGKSKGYCQVEFFDPASAAACKEGMNGYNFNGRACVVAFATPQTIKQMGSSYANKT 323 Query: 895 QVQAQSQPQGRIPMSDAAGRGNGTNY 972 Q Q QSQPQGR PM++ GRG G NY Sbjct: 324 QNQVQSQPQGRRPMNEGVGRG-GPNY 348 Score = 163 bits (413), Expect = 2e-37 Identities = 76/116 (65%), Positives = 82/116 (70%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 +MHPQ MMGPGFDP++M M+PPF AVNPMGL GVAPHVNPAFFGR Sbjct: 420 LMHPQGMMGPGFDPSFMGRGAGYGGFSGPAFPGMMPPFQAVNPMGLPGVAPHVNPAFFGR 479 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA+N H GMWTD+S GGWGGEEHGRRTRESSYGGEDNASEYGY Sbjct: 480 GMAANGMGMMSAAGMDGPHPGMWTDTSGGGWGGEEHGRRTRESSYGGEDNASEYGY 535 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 328 bits (842), Expect = 3e-87 Identities = 185/331 (55%), Positives = 218/331 (65%), Gaps = 8/331 (2%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMG-FQPPKTSFP 177 GAISALA+DE++ GEGFLQM R++A P+G G FQ KT P Sbjct: 25 GAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHRSEAP-APSGVMAGGPFQAHKTDVP 83 Query: 178 ESRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSG----LAPNVGPPPTTD---VSQKA 336 + EA +PG E KY S F E+K G P +G D VSQK Sbjct: 84 PQKLEAGTSQGLIIPGVSIEGKY--SNPHFHEKKEGPMAVKGPEMGSTSHLDGPSVSQKG 141 Query: 337 QAPEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIP 516 + EMTH +Q N+ +QGS +PQK G + D K+ + + P LN G G R +P Sbjct: 142 RVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIA----NESTPVLNSGTGGPRAVP 197 Query: 517 QMPGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKV 696 QM NQ N+NVNRP++NENQIRPAV+NG MLFVGELHWWTTDAELESVL+QYG+V Sbjct: 198 QMLSNQM--GMNVNVNRPMVNENQIRPAVDNGATMLFVGELHWWTTDAELESVLSQYGRV 255 Query: 697 KEIKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAA 876 KEIKFFDERASGKSKGYCQVEFY+ +AAAACKEGMNGY FNGRACVVAFA+PQ++KQM A Sbjct: 256 KEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNGYIFNGRACVVAFASPQTLKQMGA 315 Query: 877 NYMNKTQVQAQSQPQGRIPMSDAAGRGNGTN 969 +YMNKT QAQSQ QGR PM+D GRG G N Sbjct: 316 SYMNKT--QAQSQSQGRRPMNDGVGRGGGMN 344 Score = 163 bits (413), Expect = 2e-37 Identities = 76/116 (65%), Positives = 82/116 (70%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 +MHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFFGR Sbjct: 419 LMHPQGMMGSGFDPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGR 478 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA+N HAGMWTD+SMGGWGGEEHGRRTRESSYGG+D AS+YGY Sbjct: 479 GMAANGMGMMGATGMDGHHAGMWTDTSMGGWGGEEHGRRTRESSYGGDDGASDYGY 534 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 318 bits (815), Expect = 4e-84 Identities = 174/332 (52%), Positives = 218/332 (65%), Gaps = 8/332 (2%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQA-SLTPTGGSNMGFQPPKTSFP 177 GAISALA++E++ GEGFLQ+QR++A SL G G Q K +FP Sbjct: 26 GAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQRSEAPSLPAAAGVGNGLQAQKRNFP 85 Query: 178 ESRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGPPPTTDVSQKAQAPEMTH 357 E R E + ++PG E ++ S+G +FP Q+ GL V +K++A M + Sbjct: 86 EPREEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQDGLK----------VDKKSEAGSMVY 135 Query: 358 -----GSQAGNME--YQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIP 516 GSQ G + +QGS M VG DS D +PG + + + N G RGI Sbjct: 136 PDGASGSQKGRIVAGFQGSKPMLHSVGVDSSD----IPGKMVNEPIQAPNSGGAGPRGIL 191 Query: 517 QMPGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKV 696 M GNQTT N NV+ PI+NENQIRP++ENG+ MLFVGELHWWTTDAELESVL+QYG+V Sbjct: 192 PMQGNQTT--VNANVSHPIVNENQIRPSIENGSTMLFVGELHWWTTDAELESVLSQYGRV 249 Query: 697 KEIKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAA 876 KEIKFFDERASGKSKGYCQVE+Y+ AA ACKEGM+G+ FNGRACVVAFA+PQ++KQM A Sbjct: 250 KEIKFFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHVFNGRACVVAFASPQTLKQMGA 309 Query: 877 NYMNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 YM+K QVQ QSQPQGR P++D GRG N+ Sbjct: 310 AYMSKNQVQNQSQPQGRRPINDGVGRGGNPNF 341 Score = 148 bits (374), Expect = 5e-33 Identities = 70/116 (60%), Positives = 75/116 (64%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MM+PQ MMG GFDPTYM M+P FPAVN MG A VAPHVNPAFFGR Sbjct: 413 MMNPQGMMGTGFDPTYMGRGVGYGGFAGPAFPGMLPSFPAVNTMGFAAVAPHVNPAFFGR 472 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GM +N GMW D S+GGWGGEEHGRRTRESSYGG+D ASEYGY Sbjct: 473 GMTNNGMGMVGSSLMDGHQGGMWNDPSIGGWGGEEHGRRTRESSYGGDDGASEYGY 528 >gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 317 bits (813), Expect = 7e-84 Identities = 171/330 (51%), Positives = 212/330 (64%), Gaps = 6/330 (1%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++EM+ GEGFLQ+QR++A P G + G Q K PE Sbjct: 28 GAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAPPQPGGMGSTGLQAQKNEAPE 87 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLA---PNVGP---PPTTDVSQKAQA 342 R EA ++PG + K+++ R+PEQ A P +G P T +SQK + Sbjct: 88 PRGEAGGSQGLNIPGVSVQGKHLNVTARYPEQDGQPAVSRPEMGSGSYPSGTSISQKGRV 147 Query: 343 PEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQM 522 E T +Q NM +QG KVG D +K+ + SLN G G +G P + Sbjct: 148 MEGTQDTQVKNMGFQGLSSASHKVGIDPSGVPQKIA----NVPAQSLNSGTGGPQGAPHV 203 Query: 523 PGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKE 702 P NQ +NVN P+I+ENQ+RP +ENG MLFVGELHWWTTDAELESVL+QYG+VKE Sbjct: 204 PPNQM----GLNVNHPMISENQVRPPIENGPTMLFVGELHWWTTDAELESVLSQYGRVKE 259 Query: 703 IKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANY 882 IKFFDERASGKSKGYCQVEFY+P +AAACKEGM+GY FNGRACVVAFA+PQ++KQM A+Y Sbjct: 260 IKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYMFNGRACVVAFASPQTLKQMGASY 319 Query: 883 MNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 MNK Q Q+Q+QPQGR P +D GRG NY Sbjct: 320 MNKNQGQSQAQPQGRRP-NDGLGRGGNMNY 348 Score = 162 bits (409), Expect = 5e-37 Identities = 76/116 (65%), Positives = 80/116 (68%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMHPQ MMG GFDPTYM M+P FPAVN +GLAGVAPHVNPAFFGR Sbjct: 423 MMHPQGMMGAGFDPTYMGRGGSYGGFPGPGFPGMLPSFPAVNTLGLAGVAPHVNPAFFGR 482 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA N H GMWTD+SMGGWGG+EHGRRTRESSYGGED ASEYGY Sbjct: 483 GMAPNGMGMMGGPGMDGPHVGMWTDTSMGGWGGDEHGRRTRESSYGGEDGASEYGY 538 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 317 bits (813), Expect = 7e-84 Identities = 170/324 (52%), Positives = 208/324 (64%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++E + GEGFLQM R + L P G N G Q K + PE Sbjct: 28 GAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQMHRPEPPLPPAGVGNGGLQAQKNNVPE 87 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGPPPTTDVSQKAQAPEMTHG 360 R + E PG E KY S PEQK +V P SQK + EMTH Sbjct: 88 QRVQGGASQEVKNPGFSVEGKYSS----VPEQKDQPPVSVVPEMA---SQKGRVMEMTHD 140 Query: 361 SQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQMPGNQTT 540 +Q NM +QG+ M V DS D K+ + +PS+N G+ + QMP NQ Sbjct: 141 AQVRNMGFQGAATMQSNVVADSSDLTGKIA----NGPIPSMNSGSNGPPAVQQMPANQM- 195 Query: 541 SSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKEIKFFDE 720 + INVNRP++NENQIRP VENG+A LFVGELHWWTTDAELE VL+Q+G++KEIKFFDE Sbjct: 196 -NMKINVNRPMVNENQIRPPVENGSATLFVGELHWWTTDAELEGVLSQFGRIKEIKFFDE 254 Query: 721 RASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANYMNKTQV 900 RASGKSKGYCQV+FY+P AA+ACKEGM+GY FNGRACVVAFA+ Q++KQM +Y+NK+Q Sbjct: 255 RASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRACVVAFASSQTLKQMGDSYVNKSQG 314 Query: 901 QAQSQPQGRIPMSDAAGRGNGTNY 972 Q Q+QPQGR PM+D AGRG N+ Sbjct: 315 QVQTQPQGRRPMNDGAGRGGNMNF 338 Score = 137 bits (346), Expect = 1e-29 Identities = 67/115 (58%), Positives = 71/115 (61%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MM+ MMGPGFDPTYM M+P FP VN MGLAGVAPHVNPAFFGR Sbjct: 414 MMNAPGMMGPGFDPTYMGRGGGYGGFPGPGFPGMLPQFPGVNAMGLAGVAPHVNPAFFGR 473 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYG 1545 GMA+N HA MW D SM GW GEE RRTRESSYGG+D SEYG Sbjct: 474 GMATNGMGMMGSSGMEGHHAPMWNDPSMAGWTGEEQDRRTRESSYGGDDGGSEYG 528 >gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 315 bits (808), Expect = 3e-83 Identities = 172/324 (53%), Positives = 211/324 (65%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAISALA++E + EGFLQM R++A L P G N G Q KT E Sbjct: 25 GAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHRSEAPLPPGGVGNGGLQAQKTDVTE 84 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGPPPTTDVSQKAQAPEMTHG 360 +R +A + E+ +PG + KY S+ +FPEQ+ G PP A+ PE+ Sbjct: 85 TRVQAGVSQESKIPGVSVQGKYSSAVAQFPEQQ-------GQPPV------AKEPEL--- 128 Query: 361 SQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQMPGNQTT 540 G+ Y GS MP VG DS D + G ++PS+N G G+ QMP NQ Sbjct: 129 ---GSTGY-GSTTMPPNVGGDSSD----ITGKTALESVPSMNSGTAGPTGVTQMPTNQI- 179 Query: 541 SSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKEIKFFDE 720 S +N NRP+ NENQIRP VENG+ MLFVGELHWWTTDAELESVL+QYG+VKEIKFFDE Sbjct: 180 -SIKVNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDE 238 Query: 721 RASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANYMNKTQV 900 RASGKSKGYCQVEF++P AA ACKEGM+GY FNGRACVVAFA+PQ++KQM A+Y++K+Q Sbjct: 239 RASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVAFASPQTLKQMGASYLSKSQG 298 Query: 901 QAQSQPQGRIPMSDAAGRGNGTNY 972 Q QSQ GR PM++ GRG G NY Sbjct: 299 QTQSQQPGRRPMNEGVGRGGGVNY 322 Score = 155 bits (393), Expect = 3e-35 Identities = 74/116 (63%), Positives = 79/116 (68%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MM+PQ MMG GFDPTYM M+ FPAVN MGLAGVAPHVNPAFFGR Sbjct: 398 MMNPQGMMGAGFDPTYMGRGGGYGGFPGPAFPGMLSSFPAVNTMGLAGVAPHVNPAFFGR 457 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA+N HAGMW D SMGGWGG+EHGRRTRESSYGG+D ASEYGY Sbjct: 458 GMATNGMGMMGSSGMDGHHAGMWNDPSMGGWGGDEHGRRTRESSYGGDDGASEYGY 513 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 314 bits (805), Expect = 6e-83 Identities = 170/331 (51%), Positives = 207/331 (62%), Gaps = 7/331 (2%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++E++ G+G LQ Q+ +A G N Q KT PE Sbjct: 28 GAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQPEAPPPSAGVGNGRLQVKKTDVPE 87 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLA----PNVGP---PPTTDVSQKAQ 339 + +A + ++VPG E KY ++G FP Q PN+G P VSQK Sbjct: 88 QQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQNDVQVAVNRPNMGSGNYPDGASVSQKGS 147 Query: 340 APEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQ 519 E TH + NM +QGS P + G D + +PG + P LNPGA +G Sbjct: 148 VQETTHDAHVRNMGFQGSTSGPSRTGVDPSN----MPGRVANEPAPVLNPGAAGPQGA-L 202 Query: 520 MPGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVK 699 +P NQ NINVNR ++NENQIRP +ENG MLFVGELHWWTTDAELESVL+QYG+VK Sbjct: 203 IPANQM--GVNINVNRAMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGRVK 260 Query: 700 EIKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAAN 879 EIKFFDERASGKSKGYCQVEF++ AAAACK+GMNG+ FNGR CVVAFA+PQ++KQM A+ Sbjct: 261 EIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAFASPQTLKQMGAS 320 Query: 880 YMNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 YMNK Q Q QSQ QGR PM+D GRG NY Sbjct: 321 YMNKNQGQPQSQTQGRRPMNDGGGRGGNMNY 351 Score = 152 bits (384), Expect = 4e-34 Identities = 75/116 (64%), Positives = 79/116 (68%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFF R Sbjct: 428 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 486 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA+N H GMWTDSSMGGW GEEHGRRTRESSYGG+D AS+YGY Sbjct: 487 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGY 542 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 314 bits (804), Expect = 7e-83 Identities = 170/331 (51%), Positives = 207/331 (62%), Gaps = 7/331 (2%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++E++ G+G LQ Q+ +A G N Q KT PE Sbjct: 28 GAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQPEAPPPSAGVGNGRLQVKKTDVPE 87 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLA----PNVGP---PPTTDVSQKAQ 339 + +A + ++VPG E KY ++G FP Q PN+G P VSQK Sbjct: 88 QQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQNDVQVAVNRPNMGSGNYPDGASVSQKGS 147 Query: 340 APEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQ 519 E TH + NM +QGS P + G D + +PG + P LNPGA +G Sbjct: 148 VQETTHDAHVRNMGFQGSTSGPPRTGVDPSN----MPGRVANEPAPVLNPGAAGPQGA-L 202 Query: 520 MPGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVK 699 +P NQ NINVNR ++NENQIRP +ENG MLFVGELHWWTTDAELESVL+QYG+VK Sbjct: 203 IPANQM--GVNINVNRAMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGRVK 260 Query: 700 EIKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAAN 879 EIKFFDERASGKSKGYCQVEF++ AAAACK+GMNG+ FNGR CVVAFA+PQ++KQM A+ Sbjct: 261 EIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAFASPQTLKQMGAS 320 Query: 880 YMNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 YMNK Q Q QSQ QGR PM+D GRG NY Sbjct: 321 YMNKNQGQPQSQTQGRRPMNDGGGRGGNMNY 351 Score = 152 bits (385), Expect = 3e-34 Identities = 75/116 (64%), Positives = 79/116 (68%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFF R Sbjct: 428 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 486 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA+N H GMWTDSSMGGW GEEHGRRTRESSYGG+D AS+YGY Sbjct: 487 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGY 542 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 312 bits (800), Expect = 2e-82 Identities = 174/326 (53%), Positives = 212/326 (65%), Gaps = 2/326 (0%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALAE+EM GE FLQM R++A P N GFQP ++ + Sbjct: 25 GAIPALAEEEM-GEDDEYDDLYNDVNIGENFLQMHRSEAPPAPPSVGNGGFQPRNSN--D 81 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQ--KSGLAPNVGPPPTTDVSQKAQAPEMT 354 R E+ ++PG E+KY S+G FPEQ K +VG P + ++QK + EMT Sbjct: 82 LRVESGGSQGLNIPGVAVESKY-STGTHFPEQNVKGPEIGSVGYPDGSSIAQKTRVMEMT 140 Query: 355 HGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQMPGNQ 534 + SQA NM +QGS P +G D D K+ P+ P AG R IPQ+P +Q Sbjct: 141 NDSQARNMGFQGSTSGPSNIGVDPSDMNNKISND------PTPVPNAGVPRVIPQLPASQ 194 Query: 535 TTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKEIKFF 714 + N++ NR NENQIRP +ENG+ ML+VGELHWWTTDAELE+VL+QYG VKEIKFF Sbjct: 195 M--NMNMDTNRSATNENQIRPPLENGSTMLYVGELHWWTTDAELENVLSQYGMVKEIKFF 252 Query: 715 DERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANYMNKT 894 DERASGKSKGYCQVEFY+ AAAACKEGMNG+ FNGRACVVAFA+ Q++KQM A+YMNK Sbjct: 253 DERASGKSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACVVAFASQQTLKQMGASYMNKN 312 Query: 895 QVQAQSQPQGRIPMSDAAGRGNGTNY 972 Q Q QSQ QGR PM+D AGRG NY Sbjct: 313 QGQPQSQNQGRRPMNDGAGRGGNMNY 338 Score = 142 bits (357), Expect = 5e-31 Identities = 71/116 (61%), Positives = 78/116 (67%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 M+ PQ+MM GFDPTYM M+P FPAVN MGLAGVAPHVNPAFFGR Sbjct: 414 MLPPQSMMRAGFDPTYMGRGAGYGGFAGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFGR 473 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA N +AGMW+D+SMGGW GEE GRRTRESSYGG+D ASEYGY Sbjct: 474 GMAPNGMGMMGPSGMDGPNAGMWSDTSMGGW-GEEPGRRTRESSYGGDDGASEYGY 528 >gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 309 bits (791), Expect = 2e-81 Identities = 163/330 (49%), Positives = 209/330 (63%), Gaps = 6/330 (1%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++EM+ GEGFLQ+QR++A L P G + G + + PE Sbjct: 28 GAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAPLQPGGLGSTGLKAQRNEAPE 87 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGP------PPTTDVSQKAQA 342 R EA ++PG + K+ + R+PE++ A N P + +SQK Sbjct: 88 PRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKEEQPAVNRPEMVSGSYPSGSSISQKGSV 147 Query: 343 PEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQM 522 E TH Q N+ +QG KVG D +K+ SLN G G +G P + Sbjct: 148 TEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQ----SLNSGTGGPQGPPHV 203 Query: 523 PGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKE 702 P NQ + NVN P++NENQ++P +ENG MLFVGELHWWTTDAELESVL+QYG++KE Sbjct: 204 PPNQMGT----NVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKE 259 Query: 703 IKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANY 882 IKFFDE+ASGKSKGYCQVEFY+P++AA CKEGMNGY FNGRACVVAFA+PQ++KQM A+Y Sbjct: 260 IKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAFASPQTLKQMGASY 319 Query: 883 MNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 MNK Q Q+Q+QPQGR P ++ GRG NY Sbjct: 320 MNKNQGQSQAQPQGRRP-NEGLGRGGNLNY 348 Score = 164 bits (416), Expect = 7e-38 Identities = 78/116 (67%), Positives = 81/116 (69%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFFGR Sbjct: 422 MMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGR 481 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA N HAGMWTD+SMGGWGG+EHGRRTRESSYGGED ASEYGY Sbjct: 482 GMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGY 537 >gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 309 bits (791), Expect = 2e-81 Identities = 163/330 (49%), Positives = 209/330 (63%), Gaps = 6/330 (1%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++EM+ GEGFLQ+QR++A L P G + G + + PE Sbjct: 28 GAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAPLQPGGLGSTGLKAQRNEAPE 87 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGP------PPTTDVSQKAQA 342 R EA ++PG + K+ + R+PE++ A N P + +SQK Sbjct: 88 PRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKEEQPAVNRPEMVSGSYPSGSSISQKGSV 147 Query: 343 PEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQM 522 E TH Q N+ +QG KVG D +K+ SLN G G +G P + Sbjct: 148 TEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQ----SLNSGTGGPQGPPHV 203 Query: 523 PGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKE 702 P NQ + NVN P++NENQ++P +ENG MLFVGELHWWTTDAELESVL+QYG++KE Sbjct: 204 PPNQMGT----NVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKE 259 Query: 703 IKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANY 882 IKFFDE+ASGKSKGYCQVEFY+P++AA CKEGMNGY FNGRACVVAFA+PQ++KQM A+Y Sbjct: 260 IKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAFASPQTLKQMGASY 319 Query: 883 MNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 MNK Q Q+Q+QPQGR P ++ GRG NY Sbjct: 320 MNKNQGQSQAQPQGRRP-NEGLGRGGNLNY 348 Score = 164 bits (416), Expect = 7e-38 Identities = 78/116 (67%), Positives = 81/116 (69%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFFGR Sbjct: 422 MMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGR 481 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA N HAGMWTD+SMGGWGG+EHGRRTRESSYGGED ASEYGY Sbjct: 482 GMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGY 537 >gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 309 bits (791), Expect = 2e-81 Identities = 163/330 (49%), Positives = 209/330 (63%), Gaps = 6/330 (1%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++EM+ GEGFLQ+QR++A L P G + G + + PE Sbjct: 28 GAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAPLQPGGLGSTGLKAQRNEAPE 87 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGP------PPTTDVSQKAQA 342 R EA ++PG + K+ + R+PE++ A N P + +SQK Sbjct: 88 PRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKEEQPAVNRPEMVSGSYPSGSSISQKGSV 147 Query: 343 PEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQM 522 E TH Q N+ +QG KVG D +K+ SLN G G +G P + Sbjct: 148 TEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQ----SLNSGTGGPQGPPHV 203 Query: 523 PGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKE 702 P NQ + NVN P++NENQ++P +ENG MLFVGELHWWTTDAELESVL+QYG++KE Sbjct: 204 PPNQMGT----NVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKE 259 Query: 703 IKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANY 882 IKFFDE+ASGKSKGYCQVEFY+P++AA CKEGMNGY FNGRACVVAFA+PQ++KQM A+Y Sbjct: 260 IKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAFASPQTLKQMGASY 319 Query: 883 MNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 MNK Q Q+Q+QPQGR P ++ GRG NY Sbjct: 320 MNKNQGQSQAQPQGRRP-NEGLGRGGNLNY 348 Score = 164 bits (416), Expect = 7e-38 Identities = 78/116 (67%), Positives = 81/116 (69%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFFGR Sbjct: 422 MMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGR 481 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA N HAGMWTD+SMGGWGG+EHGRRTRESSYGGED ASEYGY Sbjct: 482 GMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGY 537 >gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 309 bits (791), Expect = 2e-81 Identities = 163/330 (49%), Positives = 209/330 (63%), Gaps = 6/330 (1%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++EM+ GEGFLQ+QR++A L P G + G + + PE Sbjct: 28 GAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAPLQPGGLGSTGLKAQRNEAPE 87 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGP------PPTTDVSQKAQA 342 R EA ++PG + K+ + R+PE++ A N P + +SQK Sbjct: 88 PRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKEEQPAVNRPEMVSGSYPSGSSISQKGSV 147 Query: 343 PEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQM 522 E TH Q N+ +QG KVG D +K+ SLN G G +G P + Sbjct: 148 TEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQ----SLNSGTGGPQGPPHV 203 Query: 523 PGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKE 702 P NQ + NVN P++NENQ++P +ENG MLFVGELHWWTTDAELESVL+QYG++KE Sbjct: 204 PPNQMGT----NVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKE 259 Query: 703 IKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANY 882 IKFFDE+ASGKSKGYCQVEFY+P++AA CKEGMNGY FNGRACVVAFA+PQ++KQM A+Y Sbjct: 260 IKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAFASPQTLKQMGASY 319 Query: 883 MNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 MNK Q Q+Q+QPQGR P ++ GRG NY Sbjct: 320 MNKNQGQSQAQPQGRRP-NEGLGRGGNLNY 348 Score = 164 bits (416), Expect = 7e-38 Identities = 78/116 (67%), Positives = 81/116 (69%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFFGR Sbjct: 422 MMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGR 481 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA N HAGMWTD+SMGGWGG+EHGRRTRESSYGGED ASEYGY Sbjct: 482 GMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGY 537 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 308 bits (789), Expect = 4e-81 Identities = 167/331 (50%), Positives = 203/331 (61%), Gaps = 7/331 (2%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++E++ G+G LQ Q+ +A G N Q KT PE Sbjct: 25 GAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQPEAPPPSAGVGNGRLQVKKTDVPE 84 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLA----PNVGP---PPTTDVSQKAQ 339 R + +++PG E KY ++G FP Q PN+G P VSQK Sbjct: 85 QRVQVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDVQVAVNRPNMGSGNYPDGASVSQKGS 144 Query: 340 APEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQ 519 E TH + NM +QGS P + G D + +PG + P LNPGA +G Sbjct: 145 VQETTHDAHVRNMGFQGSTSGPSRTGVDPSN----MPGRAANEPAPVLNPGAAGPQGA-L 199 Query: 520 MPGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVK 699 +P NQ N NVNR ++NENQIRP +ENG MLFVGELHWWTTDAELESVL+QYG+ K Sbjct: 200 IPANQM--GVNANVNRVMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGRAK 257 Query: 700 EIKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAAN 879 EIKFFDERASGKSKGYCQVEF++ AAAACK+GMNG+ FNGR CVVAFA+PQ++KQM A+ Sbjct: 258 EIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAFASPQTLKQMGAS 317 Query: 880 YMNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 YMNK Q Q QSQ QG PM+D GRG TNY Sbjct: 318 YMNKNQGQPQSQNQGSRPMNDGGGRGGNTNY 348 Score = 152 bits (385), Expect = 3e-34 Identities = 75/116 (64%), Positives = 79/116 (68%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFF R Sbjct: 425 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 483 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA+N H GMWTDSSMGGW GEEHGRRTRESSYGG+D AS+YGY Sbjct: 484 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGY 539 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 308 bits (788), Expect = 5e-81 Identities = 167/331 (50%), Positives = 203/331 (61%), Gaps = 7/331 (2%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALA++E++ G+G LQ Q+ +A G N Q KT PE Sbjct: 25 GAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQPEAPPPSAGVGNGRLQVKKTDVPE 84 Query: 181 SRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLA----PNVGP---PPTTDVSQKAQ 339 R + +++PG E KY ++G FP Q PN+G P VSQK Sbjct: 85 QRVQVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDVQVAVNRPNMGSGNYPDGASVSQKGS 144 Query: 340 APEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQ 519 E TH + NM +QGS P + G D + +PG + P LNPGA +G Sbjct: 145 VQETTHDAHVRNMGFQGSTSGPSRTGVDPSN----MPGRVANEPAPVLNPGAAGPQGA-L 199 Query: 520 MPGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVK 699 +P NQ N NVNR ++NENQIRP +ENG MLFVGELHWWTTDAELESVL+QYG+ K Sbjct: 200 IPANQM--GVNANVNRVMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGRAK 257 Query: 700 EIKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAAN 879 EIKFFDERASGKSKGYCQVEF++ AAAACK+GMNG+ FNGR CVVAFA+PQ++KQM A+ Sbjct: 258 EIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAFASPQTLKQMGAS 317 Query: 880 YMNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 YMNK Q Q QSQ QG PM+D GRG TNY Sbjct: 318 YMNKNQGQPQSQNQGSRPMNDGGGRGGNTNY 348 Score = 152 bits (384), Expect = 4e-34 Identities = 75/116 (64%), Positives = 79/116 (68%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFF R Sbjct: 425 MMHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNR 483 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA+N H GMWTDSSMGGW GEEHGRRTRESSYGG+D AS+YGY Sbjct: 484 GMAANGMGMMGSSGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGY 539 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 292 bits (747), Expect = 3e-76 Identities = 170/332 (51%), Positives = 205/332 (61%), Gaps = 8/332 (2%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALAE+EM GE FLQM ++A P N GFQ E Sbjct: 15 GAIPALAEEEM-GEDDEYDDLYNDVNVGENFLQMHGSEAPAPPATVGNGGFQTRNAH--E 71 Query: 181 SRAEATMLHEAHVPGG--VTEAKYVSSGLRFPEQKSGL----APNVGPPPTTDVSQKAQA 342 SR E + GG E Y ++ FPEQK A +VGP + V+QK + Sbjct: 72 SRIETGGSQALAITGGGPAVEGIYSNAKAHFPEQKQVAVAVEAQDVGPVDGSSVAQKGRV 131 Query: 343 PEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSS--RGIP 516 EM+H Q NM +Q S +P +G D D RK +A P P GS+ RG P Sbjct: 132 IEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRK------NAIEPEPLPITGSAGPRGAP 185 Query: 517 QMPGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKV 696 QM NQ SA+ VNRP++NENQ+RP +ENG+ L+VGELHWWTTDAELES +Q+G+V Sbjct: 186 QMQVNQMHMSAD--VNRPVVNENQVRPPIENGSTTLYVGELHWWTTDAELESFASQFGRV 243 Query: 697 KEIKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAA 876 KEIKFFDERASGKSKGYCQV+FYE AAAACKEGMNG+ FNGR CVVAFA+PQ++KQM A Sbjct: 244 KEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVFNGRPCVVAFASPQTLKQMGA 303 Query: 877 NYMNKTQVQAQSQPQGRIPMSDAAGRGNGTNY 972 +YMNKTQ Q Q+Q QGR M+D AGRG N+ Sbjct: 304 SYMNKTQGQPQTQSQGRGSMNDGAGRGGNANF 335 Score = 97.8 bits (242), Expect = 1e-17 Identities = 54/116 (46%), Positives = 58/116 (50%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MM PQ MMG GFDP YM M+P FPAVN MGLAGVAPHVNPAFF R Sbjct: 409 MMPPQGMMGAGFDPLYMGRGGGYGGFAGPGFPGMLPSFPAVNSMGLAGVAPHVNPAFFAR 468 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA N + GMW ESSY G++ ASEYGY Sbjct: 469 GMAPNGMGMMVSSGMDGPNPGMW------------------ESSYDGDEGASEYGY 506 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 251 bits (642), Expect = 5e-64 Identities = 149/326 (45%), Positives = 180/326 (55%), Gaps = 2/326 (0%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALAE+E+ GE FLQM ++A P N GFQ E Sbjct: 15 GAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAPPATAGNGGFQTRNAH--E 71 Query: 181 SRAEA--TMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGPPPTTDVSQKAQAPEMT 354 SR E + + G E KY ++G FPEQK QA Sbjct: 72 SRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQK-------------------QAGIGV 112 Query: 355 HGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQMPGNQ 534 + G++ Y + QK G+ RG+PQM NQ Sbjct: 113 EANDVGSIGYGDGSSVAQK--------------------------GSAGPRGVPQMQVNQ 146 Query: 535 TTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKEIKFF 714 + N +VNRP++NENQ+RP +ENG L+VGELHWWTTDAELESV +QYG+VKEIKFF Sbjct: 147 M--NMNADVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFF 204 Query: 715 DERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANYMNKT 894 DERASGKSKGYCQV+FYE AAAACKEGMN + FNGR CVVAFA+ Q++KQM A+YM+KT Sbjct: 205 DERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKT 264 Query: 895 QVQAQSQPQGRIPMSDAAGRGNGTNY 972 Q Q Q Q QGR M+D GRG NY Sbjct: 265 QGQPQPQSQGRGSMNDGMGRGGNANY 290 Score = 132 bits (332), Expect = 4e-28 Identities = 67/116 (57%), Positives = 72/116 (62%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMH Q MMG GFDP YM M+P FPAVN MGLAGVAPHVNPAFF R Sbjct: 364 MMHHQGMMGAGFDPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFAR 423 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA N + G W D+SMGGW GEE GRRTRESSY G++ ASEYGY Sbjct: 424 GMAPNGMGMMASSGMEGPNPGKWPDTSMGGW-GEEPGRRTRESSYDGDEGASEYGY 478 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 251 bits (642), Expect = 5e-64 Identities = 149/326 (45%), Positives = 180/326 (55%), Gaps = 2/326 (0%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMGFQPPKTSFPE 180 GAI ALAE+E+ GE FLQM ++A P N GFQ E Sbjct: 15 GAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAPPATAGNGGFQTRNAH--E 71 Query: 181 SRAEA--TMLHEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGPPPTTDVSQKAQAPEMT 354 SR E + + G E KY ++G FPEQK QA Sbjct: 72 SRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQK-------------------QAGIGV 112 Query: 355 HGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIPQMPGNQ 534 + G++ Y + QK G+ RG+PQM NQ Sbjct: 113 EANDVGSIGYGDGSSVAQK--------------------------GSAGPRGVPQMQVNQ 146 Query: 535 TTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVKEIKFF 714 + N +VNRP++NENQ+RP +ENG L+VGELHWWTTDAELESV +QYG+VKEIKFF Sbjct: 147 M--NMNADVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFF 204 Query: 715 DERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAANYMNKT 894 DERASGKSKGYCQV+FYE AAAACKEGMN + FNGR CVVAFA+ Q++KQM A+YM+KT Sbjct: 205 DERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKT 264 Query: 895 QVQAQSQPQGRIPMSDAAGRGNGTNY 972 Q Q Q Q QGR M+D GRG NY Sbjct: 265 QGQPQPQSQGRGSMNDGMGRGGNANY 290 Score = 96.7 bits (239), Expect = 2e-17 Identities = 56/116 (48%), Positives = 60/116 (51%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 MMH Q MMG GFDP YM M+P FPAVN MGLAGVAPHVNPAFF R Sbjct: 364 MMHHQGMMGAGFDPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFAR 423 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA N GM S M G +ESSY G++ ASEYGY Sbjct: 424 GMAPNG-------------MGMMASSGMEG------PNPGKESSYDGDEGASEYGY 460 >gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea] Length = 508 Score = 247 bits (631), Expect = 9e-63 Identities = 160/333 (48%), Positives = 191/333 (57%), Gaps = 9/333 (2%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXX-GEGFLQMQRNQASLTPTGGSNMGFQPPKT--- 168 GAI ALA++EMI GE F+Q+QR + + P N P T Sbjct: 28 GAIPALADEEMIGEEDDEYDDLYNDVNVGESFMQVQRPDSQIPPFKAENR-VNPSGTGDE 86 Query: 169 SFPESRAEATML--HEAHVPGGVTEAKYVSSGLRFPEQKSGLAPNVGPPPTTDVSQKAQA 342 S P A A+ + A PG L+FPEQK+GL T D SQ + Sbjct: 87 SIPSEEANASKYAGNRAFGPGA----------LQFPEQKAGLNTTEETSVTVDRSQTVR- 135 Query: 343 PEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAG-SSRGIPQ 519 SQ YQGS+ P D + + K G S +NP G S+G Sbjct: 136 -----NSQTDQSGYQGSVA-PNNKTEDQVKNMDKTVGDPSS-----INPNVGVGSKGA-- 182 Query: 520 MPGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKVK 699 +P N +AN N RP+ +E + ENGN ML+VGELHWWTTDAE+ESVL QYGKVK Sbjct: 183 VPFNFMNMAANANAIRPVDDEYSNLGSSENGNTMLYVGELHWWTTDAEIESVLIQYGKVK 242 Query: 700 EIKFFDERASGKSKGYCQVEFYEPTAAAACKEGMNGYHFNGRACVVAFATPQSIKQMAAN 879 EIKFFDERASGKSKGYCQVEF++P AA ACKEGMNGY FNGRACVVAFATPQ+IKQM A+ Sbjct: 243 EIKFFDERASGKSKGYCQVEFFDPAAAHACKEGMNGYVFNGRACVVAFATPQTIKQMGAS 302 Query: 880 YMNKTQVQAQSQPQGR-IPMSD-AAGRGNGTNY 972 YMN+ Q Q Q+Q GR M+D AGRG GTN+ Sbjct: 303 YMNRNQGQPQAQFPGRNAAMNDGGAGRGVGTNF 335 Score = 112 bits (281), Expect = 3e-22 Identities = 59/109 (54%), Positives = 67/109 (61%), Gaps = 2/109 (1%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXX-MIPPFPAVNPMGLAGVAPHVNPAFFG 1377 MMHPQ MMGPGFD +M M+PPFPAVN +GL GVAPHVNPAFFG Sbjct: 401 MMHPQGMMGPGFDLAFMGRGAGYGGGFTGPAFPGMLPPFPAVNTLGLPGVAPHVNPAFFG 460 Query: 1378 RGMASNXXXXXXXXXXXXXHAGMWTDSSM-GGWGGEEHGRRTRESSYGG 1521 RGMA N ++G+W D+S+ GGWGGEE GR ESSYGG Sbjct: 461 RGMAPNGMGMMGPSGMGGPYSGLWNDASVGGGWGGEEQGRGP-ESSYGG 508 >emb|CBI16834.3| unnamed protein product [Vitis vinifera] Length = 491 Score = 238 bits (607), Expect = 5e-60 Identities = 142/277 (51%), Positives = 171/277 (61%), Gaps = 10/277 (3%) Frame = +1 Query: 1 GAISALAEDEMINXXXXXXXXXXXXXXGEGFLQMQRNQASLTPTGGSNMG-FQPPKTSFP 177 GAISALA+DE++ GEGFLQM R++A P+G G FQ KT P Sbjct: 7 GAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHRSEAP-APSGVMAGGPFQAHKTDVP 65 Query: 178 ESRAEATMLHEAHVPGGVTEAKYVSSGLRFPEQKSG----LAPNVGPPPTTD---VSQKA 336 + EA +PG E KY S F E+K G P +G D VSQK Sbjct: 66 PQKLEAGTSQGLIIPGVSIEGKY--SNPHFHEKKEGPMAVKGPEMGSTSHLDGPSVSQKG 123 Query: 337 QAPEMTHGSQAGNMEYQGSLGMPQKVGPDSLDTLRKVPGPGESATLPSLNPGAGSSRGIP 516 + EMTH +Q N+ +QGS +PQK G + D K+ + + P LN G G R +P Sbjct: 124 RVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIA----NESTPVLNSGTGGPRAVP 179 Query: 517 QMPGNQTTSSANINVNRPIINENQIRPAVENGNAMLFVGELHWWTTDAELESVLTQYGKV 696 QM NQ N+NVNRP++NENQIRPAV+NG MLFVGELHWWTTDAELESVL+QYG+V Sbjct: 180 QMLSNQM--GMNVNVNRPMVNENQIRPAVDNGATMLFVGELHWWTTDAELESVLSQYGRV 237 Query: 697 KEIKFFDERASGKSKGYCQVEFYEPTAAAAC--KEGM 801 KEIKFFDERASGKSKGYCQVEFY+ +AAAA KEG+ Sbjct: 238 KEIKFFDERASGKSKGYCQVEFYDASAAAAFSGKEGI 274 Score = 124 bits (311), Expect = 1e-25 Identities = 66/116 (56%), Positives = 71/116 (61%) Frame = +1 Query: 1201 MMHPQAMMGPGFDPTYMXXXXXXXXXXXXXXXXMIPPFPAVNPMGLAGVAPHVNPAFFGR 1380 +MHPQ MMG GFDPTYM M+P FPAVN MGLAGVAPHVNPAFFGR Sbjct: 294 LMHPQGMMGSGFDPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGR 353 Query: 1381 GMASNXXXXXXXXXXXXXHAGMWTDSSMGGWGGEEHGRRTRESSYGGEDNASEYGY 1548 GMA+N GM G GEEHGRRTRESSYGG+D AS+YGY Sbjct: 354 GMAAN---------------GM-------GMMGEEHGRRTRESSYGGDDGASDYGY 387