BLASTX nr result
ID: Akebia23_contig00008069
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00008069 (1716 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002284674.1| PREDICTED: uncharacterized protein LOC100247... 346 2e-92 ref|XP_002308629.2| hypothetical protein POPTR_0006s26160g [Popu... 290 2e-75 ref|XP_007227718.1| hypothetical protein PRUPE_ppa006815mg [Prun... 277 1e-71 ref|XP_002324258.2| hypothetical protein POPTR_0018s00980g [Popu... 267 1e-68 ref|XP_002516598.1| conserved hypothetical protein [Ricinus comm... 266 2e-68 ref|XP_006356629.1| PREDICTED: uncharacterized protein LOC102585... 264 1e-67 ref|XP_006450586.1| hypothetical protein CICLE_v10008522mg [Citr... 262 3e-67 ref|XP_006476169.1| PREDICTED: uncharacterized protein LOC102623... 261 5e-67 ref|XP_004245248.1| PREDICTED: uncharacterized protein LOC101254... 258 6e-66 ref|XP_004291408.1| PREDICTED: uncharacterized protein LOC101296... 257 1e-65 ref|XP_002299638.1| hypothetical protein POPTR_0001s17990g [Popu... 233 2e-58 ref|XP_002528195.1| conserved hypothetical protein [Ricinus comm... 229 3e-57 ref|XP_007024096.1| Pre-mRNA cleavage complex 2 protein Pcf11, p... 224 9e-56 ref|XP_007161279.1| hypothetical protein PHAVU_001G056900g [Phas... 224 1e-55 ref|XP_007024099.1| Pre-mRNA cleavage complex 2 protein Pcf11, p... 223 3e-55 ref|XP_007024097.1| Pre-mRNA cleavage complex 2 protein Pcf11, p... 223 3e-55 gb|EXB60468.1| hypothetical protein L484_014922 [Morus notabilis] 220 1e-54 ref|XP_006596129.1| PREDICTED: uncharacterized protein LOC100789... 216 3e-53 ref|XP_007024098.1| Pre-mRNA cleavage complex 2 protein Pcf11, p... 215 6e-53 ref|XP_007024100.1| Pre-mRNA cleavage complex 2 protein Pcf11, p... 213 2e-52 >ref|XP_002284674.1| PREDICTED: uncharacterized protein LOC100247517 [Vitis vinifera] Length = 411 Score = 346 bits (888), Expect = 2e-92 Identities = 199/407 (48%), Positives = 246/407 (60%), Gaps = 45/407 (11%) Frame = +1 Query: 277 MLRKRSRSVQKDQNKGH-LMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXX 453 MLRKRSRS QKDQ+ GH M D+VSE FQS+V+GQK+K +SFFSVPG+FVGL Sbjct: 1 MLRKRSRSFQKDQHMGHPTMADAVSELYFQSDVMGQKHKGNSFFSVPGLFVGLNYKGLSD 60 Query: 454 XXXXXXXXXXLDYRVFSNLGNPFRFSRSCPNGPQKSWDCGKVGLGIVDSLNDETKL---- 621 LD+RVFSNLG+PFR RS +G KSWDC KVGL I+DSL+D KL Sbjct: 61 SDSVRSPTSPLDFRVFSNLGSPFRSPRSSQDGQHKSWDCSKVGLSIIDSLDDGGKLSGKV 120 Query: 622 -GISETRNILFGSHRKINIPSLD-------------------------------ESDVAF 705 G SE++ ILFG +I P+ +SDV F Sbjct: 121 LGSSESKTILFGPQMRIKTPNSPSHINFFDGSKSLPKNYASFPHTQIKSRPQKRDSDVVF 180 Query: 706 RTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIRSDKKTTRSSSP--LM 879 E LEP F +I S SL+S +S S T LT N S + TT+ SSP ++ Sbjct: 181 EIEETPLEPEAFGRIRSCSLDSSRSFSSLTNLTKRQSNLSSGNLCPGNMTTQVSSPPQIL 240 Query: 880 GGSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCD 1059 GG+ + D FL M+ NS+P S+GSG GL GSLSASEIELSEDYTCVISHGPNP+TTHI+ D Sbjct: 241 GGNPNPDNFLPMKLNSIPASVGSGQGLIGSLSASEIELSEDYTCVISHGPNPKTTHIYGD 300 Query: 1060 CILECHTNELENCTKRN----GSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYM 1227 CILECH+N+L N K + GSP +++ + S Y ++DFLS C CK K+ GKDIYM Sbjct: 301 CILECHSNDLANHNKNDEHKIGSPLIVECSDNSTPYPSNDFLSICYSCKKKLEEGKDIYM 360 Query: 1228 HRGEKSFC--GCHSREILADEELEKPMXXXXXXXPRSTYCEEIFSTG 1362 +RGEK+FC C S+EIL DEE+EK P S E++F TG Sbjct: 361 YRGEKAFCSLNCRSQEILIDEEMEKTTDDSSEKSPVSKCGEDLFETG 407 >ref|XP_002308629.2| hypothetical protein POPTR_0006s26160g [Populus trichocarpa] gi|550337113|gb|EEE92152.2| hypothetical protein POPTR_0006s26160g [Populus trichocarpa] Length = 411 Score = 290 bits (741), Expect = 2e-75 Identities = 176/408 (43%), Positives = 223/408 (54%), Gaps = 44/408 (10%) Frame = +1 Query: 277 MLRKRSRSVQKDQNKGHL-MPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXX 453 MLRKR+RS+QKDQ G L M DS SES+FQS+ +G +K +SFF+VPG+FVG Sbjct: 1 MLRKRTRSLQKDQQMGQLTMSDSGSESHFQSDNMGHNHKANSFFTVPGLFVGSSLKGLSD 60 Query: 454 XXXXXXXXXXLDYRVFSNLGNPFRFSRSCPNGPQKSWDCGKVGLGIVDSLNDETK----- 618 LD+R+FSN+GNP + RS G +KSWDC KVGL IVDSL+D+ K Sbjct: 61 CDSVRSPTSPLDFRMFSNIGNPSKSPRSSHGGQRKSWDCNKVGLSIVDSLDDDGKGSGKV 120 Query: 619 LGISETRNILFGSHRKINIPSLD-------------------------------ESDVAF 705 L SE++NILFG + P+ SDV F Sbjct: 121 LRSSESKNILFGPRVRSKTPNFQSRTDSFQAPKSLPRNFAIFPRTLTKSPLLKGSSDVLF 180 Query: 706 RTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIRSDKKTTRSSSP-LMG 882 GE + F KI S SL+S +S S + L N S D TTR P L G Sbjct: 181 EIGEDPSDSEPFGKIRSCSLDSCRSFSSLSRLAGQNSKASSGNFCLDNVTTRGECPQLFG 240 Query: 883 GSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDC 1062 GS + + F P+S+ SG+G GSLSASEIELSEDYTCVISHGPNP+TTHI+ DC Sbjct: 241 GSPNSNNFSNTNLTFTPMSVSSGNGFIGSLSASEIELSEDYTCVISHGPNPKTTHIYGDC 300 Query: 1063 ILECHTNELENCTKRN----GSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYMH 1230 ILEC +N+L N K G P+ + + + ++ FLSFC +C K+ GKDIY++ Sbjct: 301 ILECQSNDLSNFGKNEAKEIGLPQAVTCSKIPGSFPSEVFLSFCYYCNKKLDEGKDIYIY 360 Query: 1231 RGEKSFC--GCHSREILADEELEKPMXXXXXXXPRSTYCEEIFSTGTV 1368 RGEK+FC C S EI+ DEELE P S E +F TG + Sbjct: 361 RGEKAFCSLSCRSEEIMIDEELENTTHKSSECVPMSGEGEGLFETGII 408 >ref|XP_007227718.1| hypothetical protein PRUPE_ppa006815mg [Prunus persica] gi|462424654|gb|EMJ28917.1| hypothetical protein PRUPE_ppa006815mg [Prunus persica] Length = 394 Score = 277 bits (708), Expect = 1e-71 Identities = 181/408 (44%), Positives = 227/408 (55%), Gaps = 44/408 (10%) Frame = +1 Query: 277 MLRKRSRSVQKDQNK-GHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXX 453 MLRKRSRS+QKDQ++ GHL ++++ S+VLG K++SFFSVPG+FVGL Sbjct: 1 MLRKRSRSIQKDQHQMGHL---PIADAG--SDVLGHNPKSNSFFSVPGLFVGLSSKGLID 55 Query: 454 XXXXXXXXXXLDYRVFSNLGNPFRFSRSCPNGPQKSWDCGKVGLGIVDSLNDETKLG--- 624 LD+RVFSNLGNPFR RS +G Q+SW KVGL I+DS +D+ K Sbjct: 56 SDSVRSPTSPLDFRVFSNLGNPFRSPRSNSDGQQRSWGSSKVGLSIIDSFDDDVKFSGKV 115 Query: 625 --ISETRNILFGSHRKINIPSLDE-------------------------------SDVAF 705 SE++NILFG +I P SDV F Sbjct: 116 PRSSESKNILFGPGMRIKTPDSQSNTNSFASPKSLPKNYAVFPHSKIKSPLEKGSSDVLF 175 Query: 706 RTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIRSDKKTTRSSSPLMGG 885 GE EP F KI S SL+S ++ S +GL+ NPN + TT+ P +GG Sbjct: 176 EIGESPTEPESFGKIRSCSLDSGRAFSTLSGLSNLNPNSTSGNFCMGSLTTQ---PFIGG 232 Query: 886 SDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDCI 1065 S + L Q N+ SIGS +GL GSLSASEIELSEDYTCVISHG NP+ THIF DCI Sbjct: 233 SPN----LATQMNTG--SIGSSNGLVGSLSASEIELSEDYTCVISHGANPKKTHIFGDCI 286 Query: 1066 LECHTNELENCTKRNGSPRVIKSPEGSAL-----YATDDFLSFCCFCKNKMVGGKDIYMH 1230 L CH+N+L N K G P G++L Y +++FLSFC +C K+ GKDIY++ Sbjct: 287 LGCHSNDLSNFGKNEGKEIGFARP-GTSLGNFVQYPSNNFLSFCYYCNKKLEEGKDIYIY 345 Query: 1231 RGEKSFC--GCHSREILADEELEKPMXXXXXXXPRSTYCEEIFSTGTV 1368 RGEK+FC C S EIL DEELEK S EE+F TG + Sbjct: 346 RGEKAFCSLSCRSEEILIDEELEKCNDQSSEKPLESD--EELFETGII 391 >ref|XP_002324258.2| hypothetical protein POPTR_0018s00980g [Populus trichocarpa] gi|550317758|gb|EEF02823.2| hypothetical protein POPTR_0018s00980g [Populus trichocarpa] Length = 415 Score = 267 bits (683), Expect = 1e-68 Identities = 173/418 (41%), Positives = 221/418 (52%), Gaps = 56/418 (13%) Frame = +1 Query: 277 MLRKRSRSVQKDQNKGHL-MPDSVSESNFQSEV-LGQKYKNSSFFSVPGIFVGLXXXXXX 450 MLRKR+RS++KDQ G L M DS SES FQ + +G +K +SFF+VPG+FVGL Sbjct: 1 MLRKRTRSLKKDQQTGQLTMSDSGSESYFQPDNNMGHSHKANSFFTVPGLFVGLSHKGLS 60 Query: 451 XXXXXXXXXXXLDYRVFSNLGNPFRFSRSCPNGPQKSWDCGKVGLGIVDSLNDETK---- 618 LD R+FSN+GNP + RS G QKSWDC KVGL I+DSL+D+ Sbjct: 61 DCDSVRSPTSPLDSRMFSNIGNPHKSLRSSHGGQQKSWDCNKVGLSILDSLDDDDDDDDG 120 Query: 619 ------LGISETRNILFGSHRKINIPSL-------------------------------D 687 L SE++NILFG + + D Sbjct: 121 KGYGKVLQSSESKNILFGPRVRSKTANFQSHTDPFQAPKSLPRNFAIFPRTLTKSPLQKD 180 Query: 688 ESDVAFRTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTYYNP-----NFSLQYIRSDKK 852 SDV F GE E F +I S SL+S +S S + L N NFSL I Sbjct: 181 SSDVLFEIGEGPFESETFGRIRSCSLDSCRSFSSMSRLAGQNLKASSLNFSLHNI----- 235 Query: 853 TTRSSSP--LMGGSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHG 1026 TT+ P L+GGS + + F P+S SG+G SLSASEIELSEDYTCVISHG Sbjct: 236 TTQVDCPPQLLGGSSNTNNFSNTNLTYTPMSASSGNGFISSLSASEIELSEDYTCVISHG 295 Query: 1027 PNPRTTHIFCDCILECHTNELENCTKRN----GSPRVIKSPEGSALYATDDFLSFCCFCK 1194 PNP+TTHI+ CILECH+N+ N K G + + + + ++DFLSFC +C Sbjct: 296 PNPKTTHIYGGCILECHSNDFSNFGKNKEKEIGLAQAATCSKIPSSFPSEDFLSFCYYCN 355 Query: 1195 NKMVGGKDIYMHRGEKSFC--GCHSREILADEELEKPMXXXXXXXPRSTYCEEIFSTG 1362 K+ GKDIY++RGEK+FC C S EI+ DEELE P S+ + +F TG Sbjct: 356 KKLDEGKDIYIYRGEKAFCSLSCRSEEIMIDEELENTTSKSAVDVPTSSSWKGLFETG 413 >ref|XP_002516598.1| conserved hypothetical protein [Ricinus communis] gi|223544418|gb|EEF45939.1| conserved hypothetical protein [Ricinus communis] Length = 435 Score = 266 bits (681), Expect = 2e-68 Identities = 178/415 (42%), Positives = 222/415 (53%), Gaps = 48/415 (11%) Frame = +1 Query: 262 RF*EIMLRKRSRSVQKDQNKGHL-MPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXX 438 RF +MLRKR+RS+QKDQ G L M DS S+ N QS+ LG +K +SFF+VPG+FVGL Sbjct: 22 RFLGVMLRKRTRSLQKDQQMGPLTMSDSGSQFNSQSDCLGYNHKRTSFFNVPGLFVGLSP 81 Query: 439 XXXXXXXXXXXXXXXLDYRVFSNLGNP-FRFSRSCPNGPQKSWDCGKVGLGIVDSLNDE- 612 LD R+FSNLGN +R RS NG QKSWDC KVGL IV+SL+DE Sbjct: 82 KGMSDCDSVRSPTSPLDLRLFSNLGNSSYRSPRSSQNGHQKSWDCSKVGLSIVNSLDDED 141 Query: 613 --TK-----LGISETRNILFGSHRKINIPSLDE--------------------------- 690 TK L SE++NILFG +I P+ Sbjct: 142 DDTKVSGKVLRSSESKNILFGQKVRIKTPTFQVNANSFEAPKSLPRNFAILPHSYTKSSL 201 Query: 691 ----SDVAFRTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIRSDKKTT 858 S V F GE EP F KI S SL+S KS S + L N N + T Sbjct: 202 QKGCSKVIFEIGEAPTEPEHFGKIRSCSLDSCKSFSTLSRLANRNSNVICGNFPLNNVAT 261 Query: 859 RSSSPLM---GGSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGP 1029 +SSPL G + L M N P GS G GSLSASEIELSEDYTCVISHGP Sbjct: 262 GTSSPLQFSGGSPPQSNNSLHMDLNLPPA--GSTSGFVGSLSASEIELSEDYTCVISHGP 319 Query: 1030 NPRTTHIFCDCILECHTNELENCTKRNGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVG 1209 N + THI+ DC+LEC++NE K P+ I S + + ++DFL+FC +C ++ G Sbjct: 320 NAKKTHIYGDCVLECYSNE----GKEIRMPQAITSSIIPSPFPSNDFLNFCYYCNRRLDG 375 Query: 1210 GKDIYMHRGEKSFC--GCHSREILADEELEKP--MXXXXXXXPRSTYCEEIFSTG 1362 GKDIY++RGEK+FC C S EI+ DEE+EK P+ EE++ G Sbjct: 376 GKDIYIYRGEKAFCSLSCRSEEIMIDEEMEKTTNKTCDEPEPPKCDNGEELYENG 430 >ref|XP_006356629.1| PREDICTED: uncharacterized protein LOC102585748 [Solanum tuberosum] Length = 407 Score = 264 bits (674), Expect = 1e-67 Identities = 171/415 (41%), Positives = 219/415 (52%), Gaps = 48/415 (11%) Frame = +1 Query: 277 MLRKRSRSVQKDQNKGHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXX 456 ML+KR+RS QK GHLM D +S+S FQS+VL +K+K++SFF+VPG+FVGL Sbjct: 1 MLKKRTRSHQKVHTMGHLMSDGISDSYFQSDVLVRKHKSNSFFNVPGVFVGLNPKGSESD 60 Query: 457 XXXXXXXXXLDYRVFSNLGNPFRFSRSCPNGPQKSWDCGKVGLGIVDSLNDETKLG---- 624 LD+RVFSNLGNPFR S S G K+W C KVGLGIVDSL+DE K Sbjct: 61 SVRSPTSP-LDFRVFSNLGNPFRSSTSEGAGANKTWGCTKVGLGIVDSLDDEMKQSGKVF 119 Query: 625 -ISETRNILFGSHRKINI--------PSLDE-------------------------SDVA 702 S+++NILFG+ +I SL+E SDV Sbjct: 120 RSSDSKNILFGTQMRIKTHDFQSCVDDSLEEPKSLPKNISIFPHTLSKSSNLRKGSSDVV 179 Query: 703 FRTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTYYNPNF----SLQYIRSDKKTTRSSS 870 F G+ E S SL+S +S S L F ++ + S K R S Sbjct: 180 FGIGDALSEHELSRNFRSCSLDSGRSSSRFASLANRTVAFGSENAINPVVSHTKCVRGCS 239 Query: 871 PLMGGSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHI 1050 L + G + + +P +GS L GS+SAS+IELSEDYTCV + GPN + THI Sbjct: 240 KLGNPAG------GAKLSPIPTPVGSNTSLVGSISASDIELSEDYTCVRTRGPNAKVTHI 293 Query: 1051 FCDCILECHTNEL----ENCTKRNGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKD 1218 FCDCILECH NEL +N ++ P V S E + + DFL FC CK K + GKD Sbjct: 294 FCDCILECHNNELPNFCKNANEKTVLPEVTDSSEVLTSFPSSDFLRFCSSCK-KRLDGKD 352 Query: 1219 IYMHRGEKSFCG--CHSREILADEELEKPMXXXXXXXPRSTYCEEIFSTGTVIGT 1377 IYM+RGEK+FC C S IL DEE+EK + + +E+F TG I T Sbjct: 353 IYMYRGEKAFCSLDCRSEAILIDEEMEKKVNNHSESTIKPNSRDEVFDTGLFIVT 407 >ref|XP_006450586.1| hypothetical protein CICLE_v10008522mg [Citrus clementina] gi|557553812|gb|ESR63826.1| hypothetical protein CICLE_v10008522mg [Citrus clementina] Length = 399 Score = 262 bits (670), Expect = 3e-67 Identities = 180/420 (42%), Positives = 228/420 (54%), Gaps = 42/420 (10%) Frame = +1 Query: 277 MLRKRSRSVQKDQNKGHLM-PDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXX 453 MLRKR+RSV+K+Q HL P+SV+ES F SE L K +S F+VPG+FVGL Sbjct: 1 MLRKRTRSVEKEQQMSHLKTPESVAESFFNSENL----KGNSLFNVPGLFVGLSPKGLSD 56 Query: 454 XXXXXXXXXXLDYRVFSNLGNPFRFSRSCPNGPQKSWDCGKVGLGIVDSLNDE----TKL 621 LD+R FSNLGN FR +S KSWD KVGL I+DSL ++ +K+ Sbjct: 57 TDSVRSPTSPLDFRAFSNLGNSFRSPKSAHYEQHKSWDTSKVGLSIIDSLRNDMKPSSKV 116 Query: 622 GISETRNILFGSHRKI-------NIPSLD------------------------ESDVAFR 708 SE++NI+FG +I NI S D SDV Sbjct: 117 LRSESKNIIFGPQMRIKTPNSQTNINSFDAPKSLPKNYAIFPCTQIKSLLQTGNSDVVLE 176 Query: 709 TGEIQLEPNQ-FHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIRSDKKTTRSSSPLM-G 882 GE E ++ F K S SL+S +S G T S + +K + SSPLM G Sbjct: 177 IGETPFEEHEPFGKTRSCSLDSCRSFPVLAGFTDCGSIMSSENFGFEKLACQESSPLMVG 236 Query: 883 GSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDC 1062 GS + F + N + SIGSG+G T SLSASEIELSEDYT V+SHGPNPRTTHI+ DC Sbjct: 237 GSPRSNNFSDSKVNLMSTSIGSGNGFTESLSASEIELSEDYTRVVSHGPNPRTTHIYGDC 296 Query: 1063 ILECHTNELENCTKR--NGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYMHRG 1236 ILEC TN+ + K GS V+ + Y +DDFLSFCC C NK + GKDIY++RG Sbjct: 297 ILECRTNDQSDDYKNEAEGSDGVMII---TTQYPSDDFLSFCCSC-NKKLEGKDIYIYRG 352 Query: 1237 EKSFCG--CHSREILADEELEKPMXXXXXXXPRSTYCEEIFSTGTVIGT*R*CGCFYLAT 1410 EK+FC C S+EIL DEE+EK + P+S C E+ T CF++ T Sbjct: 353 EKAFCSADCRSQEILIDEEMEKDI--NSESSPKSDDCGELSET-----------CFFITT 399 >ref|XP_006476169.1| PREDICTED: uncharacterized protein LOC102623549 [Citrus sinensis] Length = 399 Score = 261 bits (668), Expect = 5e-67 Identities = 179/420 (42%), Positives = 228/420 (54%), Gaps = 42/420 (10%) Frame = +1 Query: 277 MLRKRSRSVQKDQNKGHLM-PDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXX 453 MLRKR+RSV+K+Q HL P+SV+ES F SE L +S F+VPG+FVGL Sbjct: 1 MLRKRTRSVEKEQQMSHLKTPESVAESFFNSENL----TGNSLFNVPGLFVGLSPKGLSD 56 Query: 454 XXXXXXXXXXLDYRVFSNLGNPFRFSRSCPNGPQKSWDCGKVGLGIVDSLNDE----TKL 621 LD+R FSNLGN FR +S KSWD KVGL I+DSL ++ +K+ Sbjct: 57 TDSVRSPTSPLDFRAFSNLGNSFRSPKSAHYEQHKSWDTSKVGLSIIDSLRNDMKPSSKV 116 Query: 622 GISETRNILFGSHRKI-------NIPSLD------------------------ESDVAFR 708 SE++NI+FG +I NI S D SDV Sbjct: 117 LRSESKNIIFGPQMRIKTPNSQTNINSFDAPKSLPKNYAIFPCTQIKSLLQKGNSDVVLE 176 Query: 709 TGEIQLEPNQ-FHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIRSDKKTTRSSSPLM-G 882 GE E ++ F K S SL+S +S G T S + +K + SSPLM G Sbjct: 177 IGETPFEEHEPFGKTRSCSLDSCRSFPALAGFTDCGSIMSSENFGFEKLACQESSPLMVG 236 Query: 883 GSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDC 1062 GS + FL + N + SIGSG+G T SLSASEIELSEDYT V+SHGPNPRTTHI+ DC Sbjct: 237 GSPRSNNFLDSKVNLMSTSIGSGNGFTESLSASEIELSEDYTRVVSHGPNPRTTHIYGDC 296 Query: 1063 ILECHTNELENCTKR--NGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYMHRG 1236 ILEC TN+ + K GS V+ + Y +DDFLSFCC C NK + GKDIY++RG Sbjct: 297 ILECRTNDQSDDYKNEAEGSDGVMII---TTQYPSDDFLSFCCSC-NKKLEGKDIYIYRG 352 Query: 1237 EKSFCG--CHSREILADEELEKPMXXXXXXXPRSTYCEEIFSTGTVIGT*R*CGCFYLAT 1410 EK+FC C ++EIL DEE+EK + P+S C E+ T CF++ T Sbjct: 353 EKAFCSADCRAQEILIDEEMEKDI--NSESSPKSDDCGELSET-----------CFFITT 399 >ref|XP_004245248.1| PREDICTED: uncharacterized protein LOC101254717 [Solanum lycopersicum] Length = 406 Score = 258 bits (659), Expect = 6e-66 Identities = 168/415 (40%), Positives = 216/415 (52%), Gaps = 48/415 (11%) Frame = +1 Query: 277 MLRKRSRSVQKDQNKGHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXX 456 ML+KR+RS QK Q GHLM D +S+S FQ +V +K+KN+SFF+VPG+FVG Sbjct: 1 MLKKRTRSHQKVQTMGHLMSDGISDSYFQPDVFVRKHKNNSFFNVPGVFVGFNPKGSESD 60 Query: 457 XXXXXXXXXLDYRVFSNLGNPFRFSRSCPNGPQKSWDCGKVGLGIVDSLNDETK-----L 621 LD+RVFSNLGNPFR S S G K+W C KVGLGIVDSL+DE K Sbjct: 61 SVRSPTSP-LDFRVFSNLGNPFRSSTSEGAGANKTWGCTKVGLGIVDSLDDEMKHSGKVF 119 Query: 622 GISETRNILFGSHRKINI--------PSLDE-------------------------SDVA 702 S+++NILFG+ +I SL+E SDV Sbjct: 120 RSSDSKNILFGTQMRIKAHDFQSCVDDSLEEPKSLPKNISIFPHTLSKSSNLRKGSSDVV 179 Query: 703 FRTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTY----YNPNFSLQYIRSDKKTTRSSS 870 F G+ E S SL+S +S S L ++ + S K R S Sbjct: 180 FGIGDALSEHEYSRNFRSCSLDSGRSSSRFASLANRTVAVGSENAINPVVSQTKCVRGCS 239 Query: 871 PLMGGSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHI 1050 L + G + + +P +GS L GS+SAS+I+LSEDYTCV + GPN + THI Sbjct: 240 KLGNPAG------GAKLSPIPTPVGSNTSLVGSISASDIQLSEDYTCVRTRGPNAKVTHI 293 Query: 1051 FCDCILECHTNEL----ENCTKRNGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKD 1218 FCDCILECH NEL +N ++ P V S E + + DFL FC CK K+ GKD Sbjct: 294 FCDCILECHNNELPNFCKNANEKTVLPEVTDSSEVLTSFPSSDFLRFCSSCKKKL-DGKD 352 Query: 1219 IYMHRGEKSFCG--CHSREILADEELEKPMXXXXXXXPRSTYCEEIFSTGTVIGT 1377 IYM+RGEK+FC C S IL DEE+EK + + +E+F TG I T Sbjct: 353 IYMYRGEKAFCSLDCRSEAILIDEEMEK-VNNDSESSIKPNSRDEVFDTGLFIAT 406 >ref|XP_004291408.1| PREDICTED: uncharacterized protein LOC101296169 [Fragaria vesca subsp. vesca] Length = 403 Score = 257 bits (657), Expect = 1e-65 Identities = 164/385 (42%), Positives = 219/385 (56%), Gaps = 46/385 (11%) Frame = +1 Query: 277 MLRKRSRSVQKDQNK---GHL-MPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXX 444 MLRKR+RS QKDQ++ GHL + ++ SES+F+S+VLG K++ FF++PG+FVGL Sbjct: 1 MLRKRTRSTQKDQDQHQMGHLPISNTGSESHFRSDVLGPNPKSNPFFTIPGLFVGLGPIG 60 Query: 445 XXXXXXXXXXXXXLDYRVFSNLGNPFRFSRSCPNGPQKSWDCGKVGLGIVDSLNDETKLG 624 LD+RVFSNLG+PFR RS +G ++SW KVGL I+DS +D+ K Sbjct: 61 LTDSDSIRSPTSPLDFRVFSNLGSPFRSPRSPLDGHKRSWGSSKVGLSIIDSFDDDVKCS 120 Query: 625 -----ISETRNILFGS------------------------------HRKINIPSLDES-D 696 SE++NILFG H K+ P + S D Sbjct: 121 GKVPRSSESKNILFGPGMRIKTRDSRSNTNSIGSPRSLPKNYAIFPHSKVKSPLQESSSD 180 Query: 697 VAFRTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIRSDKKTTRSSSPL 876 V F GE EP F KI S S +S ++ S +GL+ NPN + + + ++ Sbjct: 181 VVFEIGETPSEPESFGKIRSCSFDSARTFSTLSGLSKLNPNSTRNFCLENV----TNPQF 236 Query: 877 MGGSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFC 1056 +GGS + + + S GSG+ GSLSASEIELSEDYTCVISHG NP+TTHIF Sbjct: 237 IGGSPNSATLMNVG------STGSGNEFVGSLSASEIELSEDYTCVISHGANPKTTHIFG 290 Query: 1057 DCILECHTNEL----ENCTKRNGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIY 1224 DCIL CH+ +L EN K GSP++ S Y +++FLSFC +C ++ GKDIY Sbjct: 291 DCIL-CHSEDLSKSFENEKKGIGSPQLATSLGSFVQYPSNNFLSFCHYCNKELEEGKDIY 349 Query: 1225 MHRGEKSFC--GCHSREILADEELE 1293 ++RGEK+FC C S EIL DEELE Sbjct: 350 IYRGEKAFCSLSCRSVEILNDEELE 374 >ref|XP_002299638.1| hypothetical protein POPTR_0001s17990g [Populus trichocarpa] gi|222846896|gb|EEE84443.1| hypothetical protein POPTR_0001s17990g [Populus trichocarpa] Length = 374 Score = 233 bits (594), Expect = 2e-58 Identities = 153/380 (40%), Positives = 193/380 (50%), Gaps = 39/380 (10%) Frame = +1 Query: 331 MPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXXXXXXXXXXXLDYRVFSNL 510 M DS +E+N Q + ++ SSFF++PG FVG LD+ F+NL Sbjct: 1 MADSDTETNSQPDTFSLRHLRSSFFNIPGFFVGCGYRGSQDFDSVRSPQSPLDFSFFTNL 60 Query: 511 GNPFRFSRSCPNGP----QKSWDCGKVGLGIVDSLNDETK-----LGISETRNILFG--- 654 NPF S P P QK WDC KVGLGIV L DETK L + + I+F Sbjct: 61 SNPF--SNRSPRLPCQNVQKKWDCNKVGLGIVHLLVDETKPTGEVLDSDKRKTIIFAPQV 118 Query: 655 -------------------SHRKINIPSLDESDVAFRTGEIQLEPNQFHKIGSSSLNSDK 777 S K + P L +SD AF + + LE F SSS+ Sbjct: 119 KTFSSVKSNSLPRNYTISLSRTKTSSPRLGKSDGAFGSEGVLLETKPFE---SSSV---- 171 Query: 778 SESHPTGLTYYNPNFSLQYIRSDKKTTRSSS-PL-MGGSDDVDIFLGMQTNSLPISIGSG 951 GL PN S Q S+ TT + S PL + + L ++ NSLPI++GSG Sbjct: 172 -----IGLATSKPNLSSQKFYSENITTSTRSFPLEICDCSQTNKSLVIKPNSLPITVGSG 226 Query: 952 DGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDCILECHTNELENCTKRNGS----P 1119 G GSLSA EIELSEDYTC+ISHGPNP+TTH+F D ILECH+NEL N K P Sbjct: 227 QGYVGSLSAREIELSEDYTCIISHGPNPKTTHVFGDYILECHSNELSNFDKTENPGIKLP 286 Query: 1120 RVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYMHRGEKSFCG--CHSREILADEELE 1293 + K P+ + D+F SFC CK K+ +DIYM+RGEK FC CHS E A+ E E Sbjct: 287 QEAKHPKHPTPFPPDEFFSFCYSCKKKLEKAEDIYMYRGEKVFCSFDCHSEETFAERETE 346 Query: 1294 KPMXXXXXXXPRSTYCEEIF 1353 K P S+Y E++F Sbjct: 347 KTCNKSSKSSPGSSYHEDVF 366 >ref|XP_002528195.1| conserved hypothetical protein [Ricinus communis] gi|223532407|gb|EEF34202.1| conserved hypothetical protein [Ricinus communis] Length = 374 Score = 229 bits (584), Expect = 3e-57 Identities = 146/369 (39%), Positives = 195/369 (52%), Gaps = 26/369 (7%) Frame = +1 Query: 331 MPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXXXXXXXXXXXLDYRVFSNL 510 M DS ES+ QS+ LG K+ +SSFF+ PG FVG LD+ S+L Sbjct: 1 MADSALESHCQSDALGLKHISSSFFNFPGFFVGFGSRGSSESDSVRSPTSPLDFSFLSSL 60 Query: 511 GNPFRFSRSCPNGP-----QKSWDCGKVGLGIVDSLNDETK-----LGISETRNILFGSH 660 NPF S P P QK+W+ KVGLGI++ L DETK L + +NI+FGS Sbjct: 61 SNPF--SLKSPRSPSQNDHQKNWNSSKVGLGIINLLADETKPPGVVLNSPKRKNIIFGSQ 118 Query: 661 RK----INIPSLDESDVAFRTGEIQLEPNQFHKIGSSSLNSDKS---------ESHPTGL 801 K + SL + + + Q K S ++ ++ S P L Sbjct: 119 VKTGYSVRSNSLPRDYMLLLLPKTKTLNRQLGKSNSEAVFGVEAVQLECKPFENSSPITL 178 Query: 802 TYYNPNFSLQYIRSDKKTTRSSSPLMG-GSDDVDIFLGMQTNSLPISIGSGDGLTGSLSA 978 + +P S ++ ++ TT +S G D LG +++SLP+ IGS G GSLSA Sbjct: 179 SPKSPLISKKFCSENRTTTITSLSFFDDGGTPTDDSLGTKSSSLPVPIGSSKGYVGSLSA 238 Query: 979 SEIELSEDYTCVISHGPNPRTTHIFCDCILECHTNELENCTKRNGSPRVIKSPEGSALYA 1158 +IELSEDYTC+IS+GPNP+TTHIF DCILECHTNEL N + P+ SP Sbjct: 239 RDIELSEDYTCIISYGPNPKTTHIFGDCILECHTNELSNFDMGSELPQETNSP-----LP 293 Query: 1159 TDDFLSFCCFCKNKMVGGKDIYMHRGEKSFC--GCHSREILADEELEKPMXXXXXXXPRS 1332 +D+FLSFC CK K+ DIYM+RGEK+FC CHS EI ++E EK S Sbjct: 294 SDEFLSFCYTCKKKLETRDDIYMYRGEKAFCSFNCHSEEIFGEDETEKTYDNSPKSSSMS 353 Query: 1333 TYCEEIFST 1359 +Y E++F T Sbjct: 354 SYHEDLFLT 362 >ref|XP_007024096.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 2 [Theobroma cacao] gi|508779462|gb|EOY26718.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 2 [Theobroma cacao] Length = 394 Score = 224 bits (571), Expect = 9e-56 Identities = 157/394 (39%), Positives = 201/394 (51%), Gaps = 44/394 (11%) Frame = +1 Query: 322 GHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXXXXXXXXXXXLDYRVF 501 G++M D SES FQS+ LG ++ +SS F++PG VG LD RVF Sbjct: 3 GNVMADPDSESYFQSDTLGLRHISSSLFNIPGFLVGFSTKGSSDSDMVRSPTSPLDLRVF 62 Query: 502 SNLGNPFRF---SRSCPNGPQKSWDCGKVGLGIVDSLNDETK-----LGISETRNILFGS 657 +N NPF S +G QK WDC K+GLGIV+ L DE K L + +NI+FG Sbjct: 63 ANFSNPFSVRSPRSSSQSGYQKKWDCSKMGLGIVNLLADEIKSDGEDLDSPKRKNIIFGP 122 Query: 658 HRKINIPSLDESDVAFRTGEIQ----------------LEPNQFHKIGSSSL-------- 765 K PS F ++ +PN G SSL Sbjct: 123 QVKTKFPSSSRYSHEFLGNSMKSNSLPRNYIISQLSKDRKPNT--NSGGSSLVFGNEEVP 180 Query: 766 ---NSDKSESHPTGL-TYYNPNFSLQYIRSDKKTT--RSSSPLMGGSDDVDIFLGMQTNS 927 SD S P+ + + N N S + S+ TT SSS +G + VD L + +S Sbjct: 181 LEPKSDSSRLSPSFIASTKNCNLSSRSFCSENGTTSLNSSSLPIGRALQVDDSLLSKPSS 240 Query: 928 LPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDCILECHTNELENCTKR 1107 LPI +G GSLSA EIELSEDYTC+ISHGPNP+TTHIF DCILECH EL N K+ Sbjct: 241 LPIPVGHS---IGSLSAHEIELSEDYTCIISHGPNPKTTHIFGDCILECHNTELTNFDKK 297 Query: 1108 ----NGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYMHRGEKSFCG--CHSRE 1269 ++ KSPE S Y +D+FLSFC C+ K+ +DIYM+RGEK+FC C S E Sbjct: 298 AEPETKVSQLDKSPETSTPYPSDEFLSFCYSCQKKLEKDEDIYMYRGEKAFCSFDCRSEE 357 Query: 1270 ILADEELEKPMXXXXXXXPRSTYCEEIFSTGTVI 1371 I A EE+EK P + E++F G I Sbjct: 358 IFA-EEMEKTCNNSFNGSPEQSDDEDLFLMGMPI 390 >ref|XP_007161279.1| hypothetical protein PHAVU_001G056900g [Phaseolus vulgaris] gi|593796484|ref|XP_007161280.1| hypothetical protein PHAVU_001G056900g [Phaseolus vulgaris] gi|561034743|gb|ESW33273.1| hypothetical protein PHAVU_001G056900g [Phaseolus vulgaris] gi|561034744|gb|ESW33274.1| hypothetical protein PHAVU_001G056900g [Phaseolus vulgaris] Length = 399 Score = 224 bits (570), Expect = 1e-55 Identities = 151/388 (38%), Positives = 199/388 (51%), Gaps = 48/388 (12%) Frame = +1 Query: 277 MLRKRSRSVQKDQNKGHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXX 456 MLRKR+RS+QKDQ++ M ++SE+NF+S LG K++S F+ P +FVG+ Sbjct: 1 MLRKRTRSIQKDQHQVCKM--TISEANFESHALGSNAKSNSIFNAPLLFVGMGPKGLLDS 58 Query: 457 XXXXXXXXXLDYRVFSNLGNPFRFSRSCPN-GPQKSWDCGKVGLGIVDSLNDETK----- 618 LD FSNL NPFR S N G Q+SWDC KVGL I+DSL + +K Sbjct: 59 DSVKSPTSPLDVSFFSNLSNPFRTPSSLSNEGQQRSWDCAKVGLSIIDSLEECSKFSQKI 118 Query: 619 LGISETRNILF------------------GSHRKINIPS---------------LDESDV 699 L SE++ ++ ++P DES V Sbjct: 119 LQASESKKTTLCPQIITKAPNCKPYMDMESAYASKSLPKGSCRIHCAQNGYIFPKDESTV 178 Query: 700 AFRTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIRSDKKTTRSSSPLM 879 F GE + F K S SL+S + +GLT PN K + Sbjct: 179 LFEIGEAPPQHESFEKAVSVSLDSCSPIRNLSGLTC--PNIDSDPENLALKHKCCPPHFI 236 Query: 880 GGS-DDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFC 1056 GGS D+ I L NS P++ S + SLSASEIELSEDYTCVISHGPNP+TTHIFC Sbjct: 237 GGSHDNTQILLPAALNSNPVAAVSSNEFIKSLSASEIELSEDYTCVISHGPNPKTTHIFC 296 Query: 1057 DCILECHTNELENCTKRNGSPRVIKSPEGSALYA------TDDFLSFCCFCKNKMVGGKD 1218 D ILE H + + K + + +++YA ++DFLSFC C K+ GKD Sbjct: 297 DFILETHATDFKKHNKNGEEGKELSLFSVNSMYAPNHFPSSEDFLSFCHHCNKKLEEGKD 356 Query: 1219 IYMHRGEKSFC--GCHSREILADEELEK 1296 IY++RGEK+FC C + EI+ DEELEK Sbjct: 357 IYIYRGEKAFCSLSCRAIEIMIDEELEK 384 >ref|XP_007024099.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 5 [Theobroma cacao] gi|508779465|gb|EOY26721.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 5 [Theobroma cacao] Length = 403 Score = 223 bits (567), Expect = 3e-55 Identities = 155/388 (39%), Positives = 199/388 (51%), Gaps = 44/388 (11%) Frame = +1 Query: 322 GHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXXXXXXXXXXXLDYRVF 501 G++M D SES FQS+ LG ++ +SS F++PG VG LD RVF Sbjct: 3 GNVMADPDSESYFQSDTLGLRHISSSLFNIPGFLVGFSTKGSSDSDMVRSPTSPLDLRVF 62 Query: 502 SNLGNPFRF---SRSCPNGPQKSWDCGKVGLGIVDSLNDETK-----LGISETRNILFGS 657 +N NPF S +G QK WDC K+GLGIV+ L DE K L + +NI+FG Sbjct: 63 ANFSNPFSVRSPRSSSQSGYQKKWDCSKMGLGIVNLLADEIKSDGEDLDSPKRKNIIFGP 122 Query: 658 HRKINIPSLDESDVAFRTGEIQ----------------LEPNQFHKIGSSSL-------- 765 K PS F ++ +PN G SSL Sbjct: 123 QVKTKFPSSSRYSHEFLGNSMKSNSLPRNYIISQLSKDRKPNT--NSGGSSLVFGNEEVP 180 Query: 766 ---NSDKSESHPTGL-TYYNPNFSLQYIRSDKKTT--RSSSPLMGGSDDVDIFLGMQTNS 927 SD S P+ + + N N S + S+ TT SSS +G + VD L + +S Sbjct: 181 LEPKSDSSRLSPSFIASTKNCNLSSRSFCSENGTTSLNSSSLPIGRALQVDDSLLSKPSS 240 Query: 928 LPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDCILECHTNELENCTKR 1107 LPI +G GSLSA EIELSEDYTC+ISHGPNP+TTHIF DCILECH EL N K+ Sbjct: 241 LPIPVGHS---IGSLSAHEIELSEDYTCIISHGPNPKTTHIFGDCILECHNTELTNFDKK 297 Query: 1108 ----NGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYMHRGEKSFCG--CHSRE 1269 ++ KSPE S Y +D+FLSFC C+ K+ +DIYM+RGEK+FC C S E Sbjct: 298 AEPETKVSQLDKSPETSTPYPSDEFLSFCYSCQKKLEKDEDIYMYRGEKAFCSFDCRSEE 357 Query: 1270 ILADEELEKPMXXXXXXXPRSTYCEEIF 1353 I A EE+EK P + E++F Sbjct: 358 IFA-EEMEKTCNNSFNGSPEQSDDEDLF 384 >ref|XP_007024097.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 3 [Theobroma cacao] gi|508779463|gb|EOY26719.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 3 [Theobroma cacao] Length = 404 Score = 223 bits (567), Expect = 3e-55 Identities = 155/388 (39%), Positives = 199/388 (51%), Gaps = 44/388 (11%) Frame = +1 Query: 322 GHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXXXXXXXXXXXLDYRVF 501 G++M D SES FQS+ LG ++ +SS F++PG VG LD RVF Sbjct: 3 GNVMADPDSESYFQSDTLGLRHISSSLFNIPGFLVGFSTKGSSDSDMVRSPTSPLDLRVF 62 Query: 502 SNLGNPFRF---SRSCPNGPQKSWDCGKVGLGIVDSLNDETK-----LGISETRNILFGS 657 +N NPF S +G QK WDC K+GLGIV+ L DE K L + +NI+FG Sbjct: 63 ANFSNPFSVRSPRSSSQSGYQKKWDCSKMGLGIVNLLADEIKSDGEDLDSPKRKNIIFGP 122 Query: 658 HRKINIPSLDESDVAFRTGEIQ----------------LEPNQFHKIGSSSL-------- 765 K PS F ++ +PN G SSL Sbjct: 123 QVKTKFPSSSRYSHEFLGNSMKSNSLPRNYIISQLSKDRKPNT--NSGGSSLVFGNEEVP 180 Query: 766 ---NSDKSESHPTGL-TYYNPNFSLQYIRSDKKTT--RSSSPLMGGSDDVDIFLGMQTNS 927 SD S P+ + + N N S + S+ TT SSS +G + VD L + +S Sbjct: 181 LEPKSDSSRLSPSFIASTKNCNLSSRSFCSENGTTSLNSSSLPIGRALQVDDSLLSKPSS 240 Query: 928 LPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDCILECHTNELENCTKR 1107 LPI +G GSLSA EIELSEDYTC+ISHGPNP+TTHIF DCILECH EL N K+ Sbjct: 241 LPIPVGHS---IGSLSAHEIELSEDYTCIISHGPNPKTTHIFGDCILECHNTELTNFDKK 297 Query: 1108 ----NGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYMHRGEKSFCG--CHSRE 1269 ++ KSPE S Y +D+FLSFC C+ K+ +DIYM+RGEK+FC C S E Sbjct: 298 AEPETKVSQLDKSPETSTPYPSDEFLSFCYSCQKKLEKDEDIYMYRGEKAFCSFDCRSEE 357 Query: 1270 ILADEELEKPMXXXXXXXPRSTYCEEIF 1353 I A EE+EK P + E++F Sbjct: 358 IFA-EEMEKTCNNSFNGSPEQSDDEDLF 384 >gb|EXB60468.1| hypothetical protein L484_014922 [Morus notabilis] Length = 431 Score = 220 bits (561), Expect = 1e-54 Identities = 175/438 (39%), Positives = 232/438 (52%), Gaps = 71/438 (16%) Frame = +1 Query: 277 MLRKRSRSVQKDQNK-GHL-MPDSVSES-NFQSEVLGQKYKNSSFFSVPGIFVGL---XX 438 MLRKR+RS+QKDQ++ GH + +S SES F S++L + FS G+ VGL Sbjct: 1 MLRKRTRSIQKDQHQMGHQPITNSGSESFFFHSDILNNNNPKRNSFS--GLLVGLSPKGL 58 Query: 439 XXXXXXXXXXXXXXXLDYRVFSNLGNP-FRFSR----SCPNGPQKSW-DCGKVGL-GIVD 597 LD+++FS+LGNP FR S+ S NG Q+SW KVGL I+D Sbjct: 59 ATSTDCDSVRSPTSPLDFKLFSSLGNPFFRSSKATRSSHENGQQRSWGGSTKVGLISIID 118 Query: 598 SLNDETK-----LGISETRNILFG-------------------------------SHRKI 669 SL+D+ K L SE++NILFG H Sbjct: 119 SLDDDIKFPGKVLRSSESKNILFGPKFRVKTSTSGQANTNSFESPKSLPKNYAIFPHSSK 178 Query: 670 NIPSLDE--SDVAFRTGEIQLE-PNQFHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIR 840 P L++ SDV F GE LE P+ +I S SL+S ++ S+ T + NF L+ Sbjct: 179 TKPPLEKGSSDVLFEIGESPLEPPDSLGQIRSCSLDSCRTMSNSPIST--SMNFCLE--- 233 Query: 841 SDKKTTRSSSP-LMGGSDDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVI 1017 ++ T SSSP GGS + + G + +++P+S+GSG+G GSLSASEIELSEDYTCVI Sbjct: 234 NNVTTQVSSSPQFFGGSPNSNRISGTKLSTIPVSLGSGNGFIGSLSASEIELSEDYTCVI 293 Query: 1018 SHGPNPRTTHIFCDCILECHTNELENCTKRNGSPRVI-------KSPEGSALYATDDFLS 1176 SHGPNP+TTHIF DCILE + +L N + + I K+ SA Y ++ FLS Sbjct: 294 SHGPNPKTTHIFGDCILETESCDLSNFAAKADDNKEIGFSQPIGKNTRISAPYPSNYFLS 353 Query: 1177 FCCFCKNKMVGGKDIYMHRGEKSFC--GCHSREILADEELEKPMXXXXXXXPRSTYCE-- 1344 FC C K+ GKDIY++RGEK+FC C S EIL DEELEK P S + Sbjct: 354 FCYSCNKKLEDGKDIYIYRGEKAFCSLSCRSLEILMDEELEKSNDKDPENPPNSHDVDHD 413 Query: 1345 -------EIFSTGTVIGT 1377 E+F TG + T Sbjct: 414 DDDDDGKELFETGLIAAT 431 >ref|XP_006596129.1| PREDICTED: uncharacterized protein LOC100789230 isoform X1 [Glycine max] Length = 425 Score = 216 bits (550), Expect = 3e-53 Identities = 152/415 (36%), Positives = 197/415 (47%), Gaps = 46/415 (11%) Frame = +1 Query: 271 EIMLRKRSRSVQKDQNKGHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXX 450 EIMLRKR+RS+QKDQ+ H ++S++N +S LG K++S F+ P +FVG+ Sbjct: 18 EIMLRKRTRSIQKDQH--HTGQMAISDTNSESHALGGNGKSNSIFNAPLLFVGMGHKGLL 75 Query: 451 XXXXXXXXXXXLDYRVFSNLGNPFRFSRSCPN-GPQKSWDCGKVGLGIVDSLNDETK--- 618 LD+ SNL NPFR S N GP +SWDC KVGL I+DSL + +K Sbjct: 76 DCDSVKSPTSPLDFGFLSNLSNPFRTPSSLSNEGPHRSWDCAKVGLSIIDSLEECSKFSW 135 Query: 619 --LGISETRNILFGSHRKINIPSLD-------------------------------ESDV 699 L SE++ P ES V Sbjct: 136 KILQASESKKTSLCPQMITKAPKCKSYMDSTQASKSLPKDFCKIPCTQNGSIVPKGESTV 195 Query: 700 AFRTGEIQLEPNQFHKIGSSSLNSDKSESHPTGLTYYNPNFSLQYIRSDKKTTRSSSPLM 879 F GE LE F K S SL+S + +GLT NF K S + Sbjct: 196 LFEIGETPLEHEFFGKAVSFSLDSYSPTKYLSGLT--GSNFDTDSENFALKQMCSPPHFI 253 Query: 880 GGS-DDVDIFLGMQTNSLPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFC 1056 GGS ++ I L + NS P++ + SLSA EIE SEDYTCVISHGPN +TTHIFC Sbjct: 254 GGSQNNTKILLPSELNSNPVAAVYSNEFIESLSACEIENSEDYTCVISHGPNAKTTHIFC 313 Query: 1057 DCILECHTNELENCTKRNG--------SPRVIKSPEGSALYATDDFLSFCCFCKNKMVGG 1212 CILE H N+ E K S ++ +P Y + DFLS C C K+ G Sbjct: 314 GCILETHANDSERHYKAEEEGKGLSLFSVNILHTPN---QYPSHDFLSVCYHCNKKLEEG 370 Query: 1213 KDIYMHRGEKSFCGCHSREILADEELEKPMXXXXXXXPRSTYCEEIFSTGTVIGT 1377 KDIY++RGEKSFC REI + ++ P+ + E+F TGT I T Sbjct: 371 KDIYIYRGEKSFCSLSCREIEIMMDEQEKSNSSPENSPKCGFGGEVFETGTPIAT 425 >ref|XP_007024098.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 4 [Theobroma cacao] gi|508779464|gb|EOY26720.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 4 [Theobroma cacao] Length = 392 Score = 215 bits (547), Expect = 6e-53 Identities = 155/394 (39%), Positives = 199/394 (50%), Gaps = 44/394 (11%) Frame = +1 Query: 322 GHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXXXXXXXXXXXLDYRVF 501 G++M D SES FQS+ LG ++ +SS F++PG VG LD RVF Sbjct: 3 GNVMADPDSESYFQSDTLGLRHISSSLFNIPGFLVGFSTKGSSDSDMVRSPTSPLDLRVF 62 Query: 502 SNLGNPFRF---SRSCPNGPQKSWDCGKVGLGIVDSLNDETK-----LGISETRNILFGS 657 +N NPF S +G QK WDC K+GLGIV+ L DE K L + +NI+FG Sbjct: 63 ANFSNPFSVRSPRSSSQSGYQKKWDCSKMGLGIVNLLADEIKSDGEDLDSPKRKNIIFGP 122 Query: 658 HRKINIPSLDESDVAFRTGEIQ----------------LEPNQFHKIGSSSL-------- 765 K PS F ++ +PN G SSL Sbjct: 123 QVKTKFPSSSRYSHEFLGNSMKSNSLPRNYIISQLSKDRKPNT--NSGGSSLVFGNEEVP 180 Query: 766 ---NSDKSESHPTGL-TYYNPNFSLQYIRSDKKTT--RSSSPLMGGSDDVDIFLGMQTNS 927 SD S P+ + + N N S + S+ TT SSS +G + VD L + +S Sbjct: 181 LEPKSDSSRLSPSFIASTKNCNLSSRSFCSENGTTSLNSSSLPIGRALQVDDSLLSKPSS 240 Query: 928 LPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDCILECHTNELENCTKR 1107 LPI +G GSLSA EIELSEDYTC+ISHGPNP+TTHIF DCILECH EL N K+ Sbjct: 241 LPIPVGHS---IGSLSAHEIELSEDYTCIISHGPNPKTTHIFGDCILECHNTELTNFDKK 297 Query: 1108 ----NGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYMHRGEKSFCG--CHSRE 1269 ++ KSPE S Y +D+FLSFC C+ K+ +DIY+ GEK+FC C S E Sbjct: 298 AEPETKVSQLDKSPETSTPYPSDEFLSFCYSCQKKLEKDEDIYI--GEKAFCSFDCRSEE 355 Query: 1270 ILADEELEKPMXXXXXXXPRSTYCEEIFSTGTVI 1371 I A EE+EK P + E++F G I Sbjct: 356 IFA-EEMEKTCNNSFNGSPEQSDDEDLFLMGMPI 388 >ref|XP_007024100.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 6 [Theobroma cacao] gi|508779466|gb|EOY26722.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 6 [Theobroma cacao] Length = 401 Score = 213 bits (543), Expect = 2e-52 Identities = 153/388 (39%), Positives = 197/388 (50%), Gaps = 44/388 (11%) Frame = +1 Query: 322 GHLMPDSVSESNFQSEVLGQKYKNSSFFSVPGIFVGLXXXXXXXXXXXXXXXXXLDYRVF 501 G++M D SES FQS+ LG ++ +SS F++PG VG LD RVF Sbjct: 3 GNVMADPDSESYFQSDTLGLRHISSSLFNIPGFLVGFSTKGSSDSDMVRSPTSPLDLRVF 62 Query: 502 SNLGNPFRF---SRSCPNGPQKSWDCGKVGLGIVDSLNDETK-----LGISETRNILFGS 657 +N NPF S +G QK WDC K+GLGIV+ L DE K L + +NI+FG Sbjct: 63 ANFSNPFSVRSPRSSSQSGYQKKWDCSKMGLGIVNLLADEIKSDGEDLDSPKRKNIIFGP 122 Query: 658 HRKINIPSLDESDVAFRTGEIQ----------------LEPNQFHKIGSSSL-------- 765 K PS F ++ +PN G SSL Sbjct: 123 QVKTKFPSSSRYSHEFLGNSMKSNSLPRNYIISQLSKDRKPNT--NSGGSSLVFGNEEVP 180 Query: 766 ---NSDKSESHPTGL-TYYNPNFSLQYIRSDKKTT--RSSSPLMGGSDDVDIFLGMQTNS 927 SD S P+ + + N N S + S+ TT SSS +G + VD L + +S Sbjct: 181 LEPKSDSSRLSPSFIASTKNCNLSSRSFCSENGTTSLNSSSLPIGRALQVDDSLLSKPSS 240 Query: 928 LPISIGSGDGLTGSLSASEIELSEDYTCVISHGPNPRTTHIFCDCILECHTNELENCTKR 1107 LPI +G GSLSA EIELSEDYTC+ISHGPNP+TTHIF DCILECH EL N K+ Sbjct: 241 LPIPVGHS---IGSLSAHEIELSEDYTCIISHGPNPKTTHIFGDCILECHNTELTNFDKK 297 Query: 1108 ----NGSPRVIKSPEGSALYATDDFLSFCCFCKNKMVGGKDIYMHRGEKSFCG--CHSRE 1269 ++ KSPE S Y +D+FLSFC C+ K+ +DIY+ GEK+FC C S E Sbjct: 298 AEPETKVSQLDKSPETSTPYPSDEFLSFCYSCQKKLEKDEDIYI--GEKAFCSFDCRSEE 355 Query: 1270 ILADEELEKPMXXXXXXXPRSTYCEEIF 1353 I A EE+EK P + E++F Sbjct: 356 IFA-EEMEKTCNNSFNGSPEQSDDEDLF 382