BLASTX nr result
ID: Akebia22_contig00010802
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00010802 (7960 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278494.1| PREDICTED: uncharacterized protein LOC100259... 443 e-121 ref|XP_004139818.1| PREDICTED: uncharacterized protein LOC101210... 398 e-107 ref|XP_007221792.1| hypothetical protein PRUPE_ppa003948mg [Prun... 367 5e-98 ref|XP_004300997.1| PREDICTED: uncharacterized protein LOC101304... 353 6e-94 emb|CBI26253.3| unnamed protein product [Vitis vinifera] 348 3e-92 ref|XP_004496723.1| PREDICTED: uncharacterized protein LOC101489... 321 3e-84 ref|XP_004247355.1| PREDICTED: uncharacterized protein LOC101252... 320 5e-84 ref|XP_007034759.1| Uncharacterized protein TCM_020625 [Theobrom... 320 7e-84 ref|XP_006360795.1| PREDICTED: uncharacterized protein LOC102579... 313 7e-82 ref|XP_007143259.1| hypothetical protein PHAVU_007G057400g [Phas... 305 2e-79 ref|XP_002517137.1| conserved hypothetical protein [Ricinus comm... 305 3e-79 ref|XP_003556049.1| PREDICTED: uncharacterized protein LOC100805... 304 4e-79 ref|XP_006826283.1| hypothetical protein AMTR_s00004p00052660 [A... 298 4e-77 ref|XP_007143258.1| hypothetical protein PHAVU_007G057400g [Phas... 291 3e-75 ref|XP_006406164.1| hypothetical protein EUTSA_v10020450mg [Eutr... 289 2e-74 gb|EXB41290.1| hypothetical protein L484_004460 [Morus notabilis] 287 7e-74 emb|CAN69769.1| hypothetical protein VITISV_022064 [Vitis vinifera] 284 4e-73 ref|XP_006406165.1| hypothetical protein EUTSA_v10020450mg [Eutr... 283 7e-73 ref|XP_006489387.1| PREDICTED: uncharacterized protein LOC102629... 281 3e-72 ref|XP_003535649.1| PREDICTED: protein SUPPRESSOR OF GENE SILENC... 279 2e-71 >ref|XP_002278494.1| PREDICTED: uncharacterized protein LOC100259596 [Vitis vinifera] Length = 582 Score = 443 bits (1140), Expect = e-121 Identities = 263/490 (53%), Positives = 308/490 (62%), Gaps = 27/490 (5%) Frame = +1 Query: 3634 MAGGNPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGT----NAESKPNKPSSNPKDEQ 3801 MAGGNPK +RK+RWESG+NP D KSG N+ S P P+++PK Sbjct: 1 MAGGNPKASSHKPSSSSS-HRKSRWESGSNP--DKKSGDSKPPNSSSTPKTPNNDPKQAP 57 Query: 3802 SQGSGPS--KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFH 3975 + SG S KP +D L DP YGFH Sbjct: 58 ASTSGSSHPKPPAD-SVPTSAAAPVRPPVAGAPFLPDPTT--------FGPPPAPQYGFH 108 Query: 3976 MLDRRTIALADGSVRSYFALPPDYQDFPPHG-RPFDPSERFFPFGHGGREPEPGGMGFGF 4152 ML+RRTI LADGSVRSYFAL PDYQDFPP R DP+ RF P G G PEP G G G Sbjct: 109 MLERRTIVLADGSVRSYFALSPDYQDFPPPPPRAMDPAGRFLPMGPG-HGPEPVGPGLG- 166 Query: 4153 DKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDER 4332 FP G +SPEGFR +RD+ + RG DYWNSLGLDGR EGS+KRKY + DER Sbjct: 167 --RFPLTGPMSPEGFRGERDDPYSRGRH-QDYWNSLGLDGRG--HPEGSMKRKYSEEDER 221 Query: 4333 DVR---------DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHP 4485 D R DEFARQRQQLL Y DR+ YL+G SSPF R Sbjct: 222 DRREDRDRRDGNDEFARQRQQLLQYGNPSLNPNGYPLGG--DRSEYLAGP-SSPFRRG-V 277 Query: 4486 MDSARRIDEIRSSKHMRVGGDY---------GVDVSLKHPDVDQQALKKAFLRFVKSLNE 4638 MD R DE+RSSK+MR+GG Y G +V LKH +VDQ ALKKAF++FVK +NE Sbjct: 278 MDPIRG-DELRSSKYMRIGGGYEGFSRQGGVGDNVGLKHHNVDQNALKKAFIQFVKLINE 336 Query: 4639 NLAQKKNYLEDGKYGSLQCLACGRASKEFPDVHGLIMHAYNSHS--ADLHVDHLGLHKAL 4812 + +Q++ YLEDGK G L+CLACGR+SK+FPD+H L+MH YNS+S A+L VDHLGLHKAL Sbjct: 337 SASQRRLYLEDGKQGPLRCLACGRSSKDFPDMHALVMHTYNSNSDNANLLVDHLGLHKAL 396 Query: 4813 CVLMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGN 4992 CVL+GWNY+ PDNSK YQ +SADEAAAN DDLIMWPP VIIHNT SG+ KDGRMEG+GN Sbjct: 397 CVLLGWNYSMPPDNSKTYQFLSADEAAANQDDLIMWPPTVIIHNTVSGKGKDGRMEGLGN 456 Query: 4993 KVMDNKLKGI 5022 K MDNKL+ + Sbjct: 457 KAMDNKLRDL 466 >ref|XP_004139818.1| PREDICTED: uncharacterized protein LOC101210911 [Cucumis sativus] gi|449492576|ref|XP_004159037.1| PREDICTED: uncharacterized LOC101210911 [Cucumis sativus] Length = 564 Score = 398 bits (1023), Expect = e-107 Identities = 228/478 (47%), Positives = 282/478 (58%), Gaps = 15/478 (3%) Frame = +1 Query: 3634 MAGG---NPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGTNAESKPNKPSSNPKDEQS 3804 MAGG N +RK+RWES +N P + SKP+ PSS Sbjct: 1 MAGGSNTNKSSQKPSSSSAAASHRKSRWESSSNNPPSLPKSDSKSSKPHHPSSK-SGISP 59 Query: 3805 QGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLD 3984 + P P P L P++S YGFHML+ Sbjct: 60 NSTHPKHPTDKPLNPTPASAPLPSPGLP---LPFPDLSALGPPPPPS------YGFHMLE 110 Query: 3985 RRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFDKHF 4164 RRTI LADGSVRSYFALP DY +F P R D + RF P G E GG FD F Sbjct: 111 RRTIVLADGSVRSYFALPLDYHEFTPPARSMDLAARFLPMGAAASGHEYGG----FDHRF 166 Query: 4165 PPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVRD 4344 PPGG +SP+ FR R+E FGRG P D+WNS G D R + + S+KRK+ D E+D +D Sbjct: 167 PPGGPMSPDEFRGAREEQFGRGR-PQDHWNSRGTDERGGPA-DSSMKRKFNDDSEKDRKD 224 Query: 4345 E---FARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEI 4515 E +R++QQLLH R ++L+GT+ D R ++ Sbjct: 225 EKDDLSRRQQQLLHNGNPNGFLTGSGER----RGDFLAGTS----------DPYGRTEDT 270 Query: 4516 RSSKHMRVGGDY---------GVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLE 4668 R SK+MR GG Y G V+ K+ +VDQ AL+KAFL FVK++NEN QKKNYLE Sbjct: 271 RFSKYMRAGGSYENEGLRLGNGNSVAPKYLEVDQSALRKAFLHFVKTINENANQKKNYLE 330 Query: 4669 DGKYGSLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVP 4848 DGK+G LQCLAC R+S++FPD+HGLIMH YNS SAD VDHLGLHKALCVLMGWNY+K P Sbjct: 331 DGKHGRLQCLACARSSRDFPDMHGLIMHTYNSESADSQVDHLGLHKALCVLMGWNYSKPP 390 Query: 4849 DNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKGI 5022 DNS+ Y+ +SADEAAAN +DLIMWPPLVIIHNT +G+SKDGRMEG+GNK MD+K++ + Sbjct: 391 DNSRGYRFLSADEAAANQEDLIMWPPLVIIHNTITGKSKDGRMEGLGNKAMDSKIRDL 448 >ref|XP_007221792.1| hypothetical protein PRUPE_ppa003948mg [Prunus persica] gi|462418728|gb|EMJ22991.1| hypothetical protein PRUPE_ppa003948mg [Prunus persica] Length = 539 Score = 367 bits (942), Expect = 5e-98 Identities = 220/465 (47%), Positives = 262/465 (56%), Gaps = 21/465 (4%) Frame = +1 Query: 3691 NRKTRWESGNNPQPDHKSGTN----AESKPNKPSSNPKDEQSQGSGPSKPVSDPKTXXXX 3858 NRK+RWES NP + T ++ KP KP+S P + S PS P P Sbjct: 22 NRKSRWESSPNPAAAATAITTKNNPSDPKPAKPNSGPSPKPGATSTPSHPKHPPSAPSPG 81 Query: 3859 XXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRSYFALP 4038 P V YGFHML+RRT LADGSVRSYFALP Sbjct: 82 PAPFPFPDPAAFGPPPPPV----------------YGFHMLERRTFVLADGSVRSYFALP 125 Query: 4039 PDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFDKHFPPGGRLSPEGFRRDRDEA 4218 PDYQ+FPP P DPS RF PFG GG PPG Sbjct: 126 PDYQEFPP---PMDPSGRFLPFGPGG----------------PPGP-------------- 152 Query: 4219 FGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDG-DERDVRDEFARQRQQLLHYXXXXX 4395 GP DYWNSLGLDGR P EG KRKY + D+RD EF +R Q + + Sbjct: 153 -----GP-DYWNSLGLDGRGPA--EGPAKRKYAEEEDQRDKAGEFGMRRPQFMQHANPNG 204 Query: 4396 XXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIRSSKHMRVGGDY-------- 4551 R +L+ TSSPF R+ D R +E R++K+MR+GG Sbjct: 205 FPVGPG-----SRGEFLA-ETSSPFRRE-AADQGRGGEEARANKYMRIGGGGYESAGFRL 257 Query: 4552 --------GVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACG 4707 G +V KH VDQ ALKKAFL +VK ++EN Q+K YLEDGK G L CLAC Sbjct: 258 GGGGGGGGGENVVHKHVQVDQSALKKAFLNYVKLIHENTQQRKIYLEDGKNGRLHCLACA 317 Query: 4708 RASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADE 4887 R+SK+FPD+H LIMH+YNS +ADL VDHLGLHKALCVLMGW+Y K PDNSKAYQ +SA+E Sbjct: 318 RSSKDFPDMHSLIMHSYNSDNADLRVDHLGLHKALCVLMGWDYLKPPDNSKAYQFLSAEE 377 Query: 4888 AAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKGI 5022 AAAN DDLIMWPP+VIIHNT +G+SKDGRMEG+GNK MD+ ++ + Sbjct: 378 AAANVDDLIMWPPVVIIHNTVTGKSKDGRMEGLGNKAMDSIIRDL 422 >ref|XP_004300997.1| PREDICTED: uncharacterized protein LOC101304679 [Fragaria vesca subsp. vesca] Length = 529 Score = 353 bits (907), Expect = 6e-94 Identities = 225/501 (44%), Positives = 266/501 (53%), Gaps = 38/501 (7%) Frame = +1 Query: 3634 MAGGN-PKGXXXXXXXXXXX--NRKTRWESG------------NNPQPDHKSGTNAESKP 3768 MAGGN PKG NRK+RWES N PD K T KP Sbjct: 1 MAGGNHPKGPPHKPSSSSSAASNRKSRWESSPSTNNKNNQNHRNKNPPDPKPATGPSPKP 60 Query: 3769 NKPSS--NPKDEQ--SQGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXX 3936 K +S NPK S G+ P P DP + P V Sbjct: 61 GKTASPANPKHPPAPSPGAAPPFPFPDPSSFGPPP---------------PPV------- 98 Query: 3937 XXXXXXXXXYGFHMLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGG 4116 YGFH L+RRTI LADG+VRSYFALPPDYQDFPP DPS RF P Sbjct: 99 ---------YGFHNLERRTIVLADGTVRSYFALPPDYQDFPPPH--MDPSGRFLP----- 142 Query: 4117 REPEPGGMGFGFDKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEG 4296 FG GG DYWNSLG+DGR + Sbjct: 143 ----------------------------------FGPGGPAPDYWNSLGIDGRGGPQEGS 168 Query: 4297 SLKRKYGDGDE-RDVRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFH 4473 S+KRK+G+ +E RD +E A++RQQL+ N SSPF Sbjct: 169 SMKRKFGEEEEHRDKGEELAKRRQQLVQLG----------------NPNGFPAGPSSPFR 212 Query: 4474 RDHPMDSARRIDEIRSSKHMRVGGDY---------------GVDVSLKHPDVDQQALKKA 4608 R+ S R D+ R+SK MR GG + G +V K+ VDQ ALKKA Sbjct: 213 REMGAQS-RSGDDPRASKFMRTGGGFENVGFRQSGGSGGGGGDNVGHKYLQVDQAALKKA 271 Query: 4609 FLRFVKSLNENLAQKKNYLEDGKYGSLQCLACG---RASKEFPDVHGLIMHAYNSHSADL 4779 FL F K +NEN AQKK Y+EDGK G L CLACG R++K+FPD+H LIMH+YN+ +AD+ Sbjct: 272 FLYFAKVINENGAQKKIYIEDGKQGRLNCLACGTTGRSAKDFPDMHSLIMHSYNTDNADI 331 Query: 4780 HVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGR 4959 VDHLGLHKALCVLMGWNY K PDNSKAYQ +SADEAAAN DDLIMWPP+VIIHNT +G+ Sbjct: 332 RVDHLGLHKALCVLMGWNYLKPPDNSKAYQFLSADEAAANQDDLIMWPPMVIIHNTLTGK 391 Query: 4960 SKDGRMEGMGNKVMDNKLKGI 5022 SKDGRMEG+GNK MD+ ++ + Sbjct: 392 SKDGRMEGLGNKAMDSYIRAL 412 >emb|CBI26253.3| unnamed protein product [Vitis vinifera] Length = 507 Score = 348 bits (892), Expect = 3e-92 Identities = 222/480 (46%), Positives = 261/480 (54%), Gaps = 17/480 (3%) Frame = +1 Query: 3634 MAGGNPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGT----NAESKPNKPSSNPKDEQ 3801 MAGGNPK +RK+RWESG+NP D KSG N+ S P P+++PK Sbjct: 1 MAGGNPKASSHKPSSSSS-HRKSRWESGSNP--DKKSGDSKPPNSSSTPKTPNNDPKQAP 57 Query: 3802 SQGSGPS--KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFH 3975 + SG S KP +D L DP YGFH Sbjct: 58 ASTSGSSHPKPPAD-SVPTSAAAPVRPPVAGAPFLPDPTT--------FGPPPAPQYGFH 108 Query: 3976 MLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFD 4155 ML+RRTI LADGSVRSYFAL PDYQDFPP P P M Sbjct: 109 MLERRTIVLADGSVRSYFALSPDYQDFPP--------------------PPPRAMD---- 144 Query: 4156 KHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERD 4335 P GR P G G GP Sbjct: 145 ----PAGRFLP----------MGPGHGP-------------------------------- 158 Query: 4336 VRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEI 4515 + FARQRQQLL Y DR+ YL+G SSPF R MD R DE+ Sbjct: 159 --EPFARQRQQLLQYGNPSLNPNGYPLGG--DRSEYLAGP-SSPFRRG-VMDPIRG-DEL 211 Query: 4516 RSSKHMRVGGDY---------GVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLE 4668 RSSK+MR+GG Y G +V LKH +VDQ ALKKAF++FVK +NE+ +Q++ YLE Sbjct: 212 RSSKYMRIGGGYEGFSRQGGVGDNVGLKHHNVDQNALKKAFIQFVKLINESASQRRLYLE 271 Query: 4669 DGKYGSLQCLACGRASKEFPDVHGLIMHAYNSHS--ADLHVDHLGLHKALCVLMGWNYAK 4842 DGK G L+CLACGR+SK+FPD+H L+MH YNS+S A+L VDHLGLHKALCVL+GWNY+ Sbjct: 272 DGKQGPLRCLACGRSSKDFPDMHALVMHTYNSNSDNANLLVDHLGLHKALCVLLGWNYSM 331 Query: 4843 VPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKGI 5022 PDNSK YQ +SADEAAAN DDLIMWPP VIIHNT SG+ KDGRMEG+GNK MDNKL+ + Sbjct: 332 PPDNSKTYQFLSADEAAANQDDLIMWPPTVIIHNTVSGKGKDGRMEGLGNKAMDNKLRDL 391 >ref|XP_004496723.1| PREDICTED: uncharacterized protein LOC101489729 [Cicer arietinum] Length = 491 Score = 321 bits (823), Expect = 3e-84 Identities = 208/476 (43%), Positives = 255/476 (53%), Gaps = 15/476 (3%) Frame = +1 Query: 3634 MAGGN-PKGXXXXXXXXXXXNRKTRWESGNNPQPDH-KSGTNAESKPN--KPSSNPKDEQ 3801 MAGGN PK +RKTRWES + P + KS ++ +SKPN P+SNP + Sbjct: 1 MAGGNHPKSSSSS-------HRKTRWESNTSATPTNTKSPSDPKSKPNHNNPNSNPNQKP 53 Query: 3802 SQGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHML 3981 + P + +D +P YGFHML Sbjct: 54 NPNPSPKQHPNDHPALIPFQ------------FPEPG-----------PPPPPAYGFHML 90 Query: 3982 DRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFDKH 4161 +RRTI LADGSVRSYFALPPDYQDF P RP D F+ Sbjct: 91 ERRTIILADGSVRSYFALPPDYQDFAPPPRPLDR----------------------FNMR 128 Query: 4162 FPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVR 4341 FPP R DY N + + S KRKYG+ + R Sbjct: 129 FPPVVRHP-------------------DYQNPM---------EASSAKRKYGE----EGR 156 Query: 4342 DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTT-----SSPFHRDHPMDSARRI 4506 DEFARQR+QLL AN + G S P RD M+S Sbjct: 157 DEFARQREQLLRNANGF--------------ANRVPGGEFPVGPSGPLKRDM-MESI--- 198 Query: 4507 DEIRSSKHMRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGS 4686 ++R SKH RV G V+ + +H V Q ALKKAFL+FV+ +N+N KK++LEDGK G Sbjct: 199 -DLRPSKHSRVDGVGSVNNNARHVQVAQDALKKAFLQFVRLINDNTLLKKSFLEDGKQGR 257 Query: 4687 LQCLACG------RASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVP 4848 LQC+ACG R++K+F D+H LIMH YNS +ADL HLGLHKALCVLMGWNY+K P Sbjct: 258 LQCVACGSAGGSNRSAKDFSDMHALIMHTYNSDNADLSAGHLGLHKALCVLMGWNYSKPP 317 Query: 4849 DNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLK 5016 DNSKAYQ +SADEA AN DDLIMWPPLVI+HNTN+G+S+DGRMEG+GNK MDNK++ Sbjct: 318 DNSKAYQFLSADEAEANQDDLIMWPPLVIVHNTNTGKSRDGRMEGLGNKWMDNKIR 373 >ref|XP_004247355.1| PREDICTED: uncharacterized protein LOC101252627 [Solanum lycopersicum] Length = 512 Score = 320 bits (821), Expect = 5e-84 Identities = 202/473 (42%), Positives = 248/473 (52%), Gaps = 10/473 (2%) Frame = +1 Query: 3634 MAGGNPKGXXXXXXXXXXXNRKTRWES--GNNPQPDHKS---GTNAESKPNKPSSNPKDE 3798 MAGGNP +RK+RWES G P D K+ G A S P S P + Sbjct: 1 MAGGNPPKPSSNKPAPSASHRKSRWESTTGKKPSSDPKTSVAGAGAASGSGDPKSKPSPK 60 Query: 3799 QSQGSGPS----KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXX- 3963 + P+ KP+S P DPN Sbjct: 61 PTNPIQPTTPNPKPISKPSPKP-----------------DPNAHFGLPPFPFRDPPPPPL 103 Query: 3964 YGFHMLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMG 4143 YGFHML+RRTI LADGSVRSYFALP DYQDFP RP G G Sbjct: 104 YGFHMLERRTIVLADGSVRSYFALPHDYQDFPAFPRP----------------DFRGPPG 147 Query: 4144 FGFDKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDG 4323 GF++ FP +GF R+R+ D+WN LG++G +G++KRK+GD Sbjct: 148 LGFERQFPD------DGFMRNRNP---------DHWNPLGVEGGRV--GDGAMKRKFGD- 189 Query: 4324 DERDVRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARR 4503 + +D R RQQ+L + G++SS R M+ Sbjct: 190 ---EGKDGLDRLRQQVLEHGNAGPVPP---------------GSSSSYMGRGEEMN---- 227 Query: 4504 IDEIRSSKHMRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYG 4683 R K+MR GG G KH +VDQ ALKK+FL VK + + K++YL DGK G Sbjct: 228 ----RPPKYMRSGGFEGRASRTKHNEVDQSALKKSFLPMVKLIFDTANVKRSYLADGKQG 283 Query: 4684 SLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKA 4863 LQCLAC R SK+FPD+H LIMHAYN SAD VDHL HKALCVLMGWNY PD+SK+ Sbjct: 284 RLQCLACNRTSKDFPDMHSLIMHAYNPDSADSLVDHLAFHKALCVLMGWNYLTPPDHSKS 343 Query: 4864 YQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKGI 5022 YQ++SADEA AN DDL++WPPLVIIHNT +G+ DGRMEG+GNK MD+ LKGI Sbjct: 344 YQMLSADEATANRDDLVLWPPLVIIHNTITGKRDDGRMEGLGNKAMDSYLKGI 396 >ref|XP_007034759.1| Uncharacterized protein TCM_020625 [Theobroma cacao] gi|508713788|gb|EOY05685.1| Uncharacterized protein TCM_020625 [Theobroma cacao] Length = 496 Score = 320 bits (820), Expect = 7e-84 Identities = 214/481 (44%), Positives = 259/481 (53%), Gaps = 18/481 (3%) Frame = +1 Query: 3634 MAGGNPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGTNAESKPNKPSSNPKDEQSQGS 3813 MAG NP +RK+RWES + S PNK S+ K + S + Sbjct: 1 MAGPNPP---KQPSSSSNNHRKSRWESSS-------------SIPNKNPSSTKPKPSPKT 44 Query: 3814 GPS-KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXX----YGFHM 3978 GPS P + K+ SDPN + YGFHM Sbjct: 45 GPSPSPATQNKSQ-----------------SDPNPALPPIPFPDPAALGPPPPPAYGFHM 87 Query: 3979 LDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFDK 4158 L+RRTI L DGSVRSYFALP DYQ+FP RP Sbjct: 88 LERRTIVLYDGSVRSYFALPSDYQEFPT--RPL--------------------------- 118 Query: 4159 HFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDV 4338 PP P GFR +R DYWN P G KRKYG+ +E+D+ Sbjct: 119 LVPPDFGSPPLGFRDNR-----------DYWNG-------PGEGPGLFKRKYGE-EEKDL 159 Query: 4339 R----DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRI 4506 R +EFARQR DR L+G TSSPF R Sbjct: 160 REEKKEEFARQRH------GHPNAKVYSSGPGWPDR---LAG-TSSPF----------RN 199 Query: 4507 DEIRSSKHMRVGGDY---GVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGK 4677 +E+R++K+MRVGG + + + KH +VDQ ALKKAFL FVK++ EN AQKKNYLEDGK Sbjct: 200 EEMRAAKYMRVGGGFENNNLGFNNKHLEVDQNALKKAFLHFVKAVFENAAQKKNYLEDGK 259 Query: 4678 YGSLQCLACG------RASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYA 4839 G LQCLACG R+SK+FPD+HGLIMH Y S +ADL VDHLGLHKALCVLMGWNY+ Sbjct: 260 QGRLQCLACGRFDDKFRSSKDFPDMHGLIMHTYYSDNADLRVDHLGLHKALCVLMGWNYS 319 Query: 4840 KVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKG 5019 K PDNSK Y+ + ADEAAAN +DLIMWPP+VI+HNT +G+SKDGRMEG+GNK MD+KL+ Sbjct: 320 KPPDNSKVYRFLPADEAAANQEDLIMWPPVVIVHNTITGKSKDGRMEGLGNKAMDSKLRD 379 Query: 5020 I 5022 + Sbjct: 380 L 380 >ref|XP_006360795.1| PREDICTED: uncharacterized protein LOC102579696 [Solanum tuberosum] Length = 513 Score = 313 bits (803), Expect = 7e-82 Identities = 198/474 (41%), Positives = 249/474 (52%), Gaps = 11/474 (2%) Frame = +1 Query: 3634 MAGGNP---KGXXXXXXXXXXXNRKTRWES--GNNPQPDHKSGT-NAESKPNKPSSNPKD 3795 MAGGNP +RK+RWES G P D K+ A S P S P Sbjct: 1 MAGGNPPKPSSSKPAPSSASASHRKSRWESTTGKKPSSDPKTSVAGAASGSGDPKSKPSP 60 Query: 3796 EQSQGSGPS----KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXX 3963 + + + P+ KP+ +P DPN Sbjct: 61 KTTNPNHPTTPNPKPIKNPSPKP-----------------DPNAHFGLPPFPFRDPPPPP 103 Query: 3964 -YGFHMLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGM 4140 YGFHML+RRTI LADGSVRSYFALP DYQDFP RP G Sbjct: 104 LYGFHMLERRTIVLADGSVRSYFALPHDYQDFPAFTRP----------------DFRGPP 147 Query: 4141 GFGFDKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGD 4320 G GF++ FP +GF R+R+ D+WN +G++G +G++KRK+GD Sbjct: 148 GLGFERQFPD------DGFMRNRNP---------DHWNPIGVEGGRV--GDGAMKRKFGD 190 Query: 4321 GDERDVRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSAR 4500 + +D R RQQ+L + G++S R M+ Sbjct: 191 ----EGKDGLDRLRQQVLEHGNAGPVPP---------------GSSSLYMGRGEEMN--- 228 Query: 4501 RIDEIRSSKHMRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKY 4680 R +K+MR GG G KH +VDQ ALKK+FL VK + + K++YL DGK Sbjct: 229 -----RPAKYMRSGGFEGSASRTKHNEVDQSALKKSFLLMVKLIFDTANVKRSYLADGKQ 283 Query: 4681 GSLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSK 4860 G LQCLAC R SK+FPD+H LIMHAYNS SAD VDHL HKALCVLMGW+Y PD+SK Sbjct: 284 GRLQCLACNRTSKDFPDMHSLIMHAYNSESADSLVDHLAFHKALCVLMGWSYLTPPDHSK 343 Query: 4861 AYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKGI 5022 +YQ++SADEA AN DDL++WPPLVIIHNT +G+ DGRMEG+GNK MD+ LKGI Sbjct: 344 SYQMLSADEATANRDDLVLWPPLVIIHNTITGKRDDGRMEGLGNKAMDSYLKGI 397 >ref|XP_007143259.1| hypothetical protein PHAVU_007G057400g [Phaseolus vulgaris] gi|561016449|gb|ESW15253.1| hypothetical protein PHAVU_007G057400g [Phaseolus vulgaris] Length = 478 Score = 305 bits (781), Expect = 2e-79 Identities = 198/455 (43%), Positives = 239/455 (52%), Gaps = 9/455 (1%) Frame = +1 Query: 3691 NRKTRWESGNNPQPDHKSGTNAESKPNKPSSNPKDEQSQGSGPSKPVSDPKTXXXXXXXX 3870 +RK+RWE N+ P K +N P PS +P PS P+ P Sbjct: 27 HRKSRWEP-NSSSPKPKPNSNPNPSPKHPSDHPSLLPFPFPDPS-PLGPPPPPA------ 78 Query: 3871 XXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRSYFALPPDYQ 4050 YGFHML+RRTI LADGSVRSYFALP DYQ Sbjct: 79 -------------------------------YGFHMLERRTIVLADGSVRSYFALPLDYQ 107 Query: 4051 DFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFDKHFPPGGRLSPEGFRRDRDEAFGRG 4230 DF P RP D F FPP LSP FR Sbjct: 108 DFAP--RPLD-----------------------FLHRFPPP--LSPGRFRL--------- 131 Query: 4231 GGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVRDEFARQRQQLLHYXXXXXXXXXX 4410 P G+ KRKYGD D RD+ ARQR+QLL Sbjct: 132 ----------------PDFPPGASKRKYGDDDGS--RDDLARQREQLLR----------- 162 Query: 4411 XXXXXXDRANYLSGTTSSPFHRDHPMDSARRID-----EIRSSKHMRVGGDYGVDVSLKH 4575 AN LS + F + + + E+R SKH R G + S +H Sbjct: 163 -------NANGLSRISGGEFSAGPSGGTPLKRELVDPPEMRPSKHSRHDG---ANFS-RH 211 Query: 4576 PDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACG--RASKEFPDVHGLIM 4749 VDQ ALK+AF+ F K +N+N++QK++YLEDGK G L CLACG R++K+FPD+H LIM Sbjct: 212 SQVDQDALKRAFVNFAKLINDNVSQKRSYLEDGKQGRLHCLACGTGRSAKDFPDMHSLIM 271 Query: 4750 HAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPL 4929 H YNS +AD VDHLGLHKALCVLMGWNY+K PDNSKAYQ +S+DEAAAN DDLIMWPPL Sbjct: 272 HTYNSDNADSQVDHLGLHKALCVLMGWNYSKPPDNSKAYQFLSSDEAAANQDDLIMWPPL 331 Query: 4930 VIIHNTNSGRSKDGRMEGMGNKVMDNKLK--GILG 5028 VIIHNTN+G+++DGRMEG+GNK MDNK++ G +G Sbjct: 332 VIIHNTNTGKNRDGRMEGLGNKTMDNKIRELGFMG 366 >ref|XP_002517137.1| conserved hypothetical protein [Ricinus communis] gi|223543772|gb|EEF45300.1| conserved hypothetical protein [Ricinus communis] Length = 505 Score = 305 bits (780), Expect = 3e-79 Identities = 187/447 (41%), Positives = 236/447 (52%), Gaps = 3/447 (0%) Frame = +1 Query: 3691 NRKTRWESGNNPQPDHKSGTNAESKPNKPSSNPKDEQSQGSGPSKPVSDPKTXXXXXXXX 3870 +RK+RWES + P S +N ++K +SNP + + + P T Sbjct: 20 HRKSRWESSSTNNPTSDSKSNHQTKQPPSNSNPSPKPLTNNNNNTNNRTPATPSNSSLPP 79 Query: 3871 XXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRSYFALPPDYQ 4050 + P YGFHML+RRTIALADGSVRSYFALPPDYQ Sbjct: 80 GSTLPFHDLAPPP-------PPVPPPPPPPTYGFHMLERRTIALADGSVRSYFALPPDYQ 132 Query: 4051 DFPPHGRPFDPSERFFPFGHGGREPE-PGGMGFGFDKHFPPGGRLSPEGFR-RDRDEAFG 4224 DFP P RF P G P+ PGG FPP +SP+G RD ++ Sbjct: 133 DFP-----LRPPLRFPPLGPN---PDFPGG------PRFPP---MSPQGLGFRDHNQN-- 173 Query: 4225 RGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVRDEFARQRQQLLHYXXXXXXXX 4404 KRK+G G E F+R Sbjct: 174 --------------------------KRKFGGGGE------FSRYGNN----------NN 191 Query: 4405 XXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIRSSKHMRVG-GDYGVDVSLKHPD 4581 D+ + T+SSPF R D+ R++KHMR G D ++ + KHP+ Sbjct: 192 ITNGSYHPDQLMAGTSTSSSPFRRSFG-------DDFRAAKHMRFGDNDLNINNN-KHPE 243 Query: 4582 VDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACGRASKEFPDVHGLIMHAYN 4761 VD L KAFL F K +NE A +K YLE+GK G L CL CGR+SK+FPD H L+MH YN Sbjct: 244 VDHIKLNKAFLHFTKLINETEADRKRYLENGKQGRLMCLVCGRSSKDFPDTHALVMHTYN 303 Query: 4762 SHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIH 4941 S +ADL VDHLGLHKALC+LMGWNY+K PDN+K YQL+ AD AA N DDL+MWPP+VIIH Sbjct: 304 SDNADLRVDHLGLHKALCILMGWNYSKPPDNAKVYQLLPADVAATNQDDLVMWPPMVIIH 363 Query: 4942 NTNSGRSKDGRMEGMGNKVMDNKLKGI 5022 NT +G+ KDGR+EG+GNK MDNK++ + Sbjct: 364 NTVTGKGKDGRIEGLGNKAMDNKIRDL 390 >ref|XP_003556049.1| PREDICTED: uncharacterized protein LOC100805242 [Glycine max] Length = 475 Score = 304 bits (779), Expect = 4e-79 Identities = 204/470 (43%), Positives = 239/470 (50%), Gaps = 9/470 (1%) Frame = +1 Query: 3634 MAGGN-PKGXXXXXXXXXXXNRKTRWESGNNPQP----DHKSGTNAESKPNKPSSNPKDE 3798 M GGN PK +RK+RWE ++ D KS T KPN P+SNP Sbjct: 1 MVGGNHPKSSHHKKPPPSASHRKSRWEPNSSSSAKSPADPKSSTAPSPKPN-PNSNPNPS 59 Query: 3799 QSQGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHM 3978 P P DP L P YGFHM Sbjct: 60 PKHLPFPF-PFPDPAPAP---------------LGTP--------------PPPAYGFHM 89 Query: 3979 LDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFDK 4158 L+RRTI LADGSVRSYFALP DYQDF P RP D RF P P Sbjct: 90 LERRTIVLADGSVRSYFALPSDYQDFAP--RPLDLPPRF---------PPP--------- 129 Query: 4159 HFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDV 4338 LSP FR DY ++ + + KRKYGD D Sbjct: 130 -------LSPGRFRLP------------DYSHA---------AAAAAAKRKYGD-DNGGP 160 Query: 4339 RDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIR 4518 RD+ ARQR+QLL AN LS S E+R Sbjct: 161 RDDLARQREQLLR------------------NANGLSREQFS-----------AGPSELR 191 Query: 4519 SSKHMRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCL 4698 SKH R+ G S +H VDQ ALKKAF F K ++EN +QK+ YLEDGK G L CL Sbjct: 192 PSKHSRLDGSN----STRHSQVDQDALKKAFCNFAKLISENASQKRTYLEDGKQGRLHCL 247 Query: 4699 AC----GRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAY 4866 C GR++K+FPD+H LIMH YN +AD +DHLGLHKALCVLM WNY+K PDNSKAY Sbjct: 248 VCGTGTGRSAKDFPDMHALIMHTYNPDNADSRIDHLGLHKALCVLMRWNYSKPPDNSKAY 307 Query: 4867 QLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLK 5016 Q + ADEAAAN DDLIMWPPLVIIHNTN+G+++DGRMEG+GNK+MDNK++ Sbjct: 308 QFLPADEAAANQDDLIMWPPLVIIHNTNTGKNRDGRMEGLGNKMMDNKIR 357 >ref|XP_006826283.1| hypothetical protein AMTR_s00004p00052660 [Amborella trichopoda] gi|548830597|gb|ERM93520.1| hypothetical protein AMTR_s00004p00052660 [Amborella trichopoda] Length = 575 Score = 298 bits (762), Expect = 4e-77 Identities = 208/486 (42%), Positives = 250/486 (51%), Gaps = 44/486 (9%) Frame = +1 Query: 3691 NRKTRWESGNNP----QPDHKSGTNAESKPNKPSSNPKDEQSQGSGPSKPVSDPKTXXXX 3858 +RK+RW++ +P Q D K E + PS PK + P PV +P Sbjct: 17 HRKSRWDNSKSPADGPQSDRKKAPAREEEG--PSPKPKPNLNANPNPPPPVPEPSFPVPP 74 Query: 3859 XXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRSYFALP 4038 ++PN+ YGFHML+RRTI LADGSVRSYFALP Sbjct: 75 T-------------NEPNIG---------------YGFHMLERRTIVLADGSVRSYFALP 106 Query: 4039 PDYQ-DFPP---HGRPFDPS----ERFFPFGHGGREP------EPGGMGFGFDKHFPPGG 4176 PD DFP H P D + ER P G +P MG FD H P Sbjct: 107 PDPNPDFPNLDLHRFPPDRATLGLERRGPIEPEGFDPGFPRRESNLSMGRAFDFHGPLEN 166 Query: 4177 -RLSPEGFRRDRDEAFGRGG--GPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDV--- 4338 R PE FR + G GP + L G E S+KRKY + + R++ Sbjct: 167 LRGPPENFRGPPENLRGPENLRGPPE-----NLHG----PHENSIKRKYVEEEGRELGFS 217 Query: 4339 -----------RDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYL--SGTTSS----- 4464 DE +R R QLL Y + + L SG S Sbjct: 218 REAGPFPGHLQSDELSRHRHQLLQYGNPNPMFDGFQASRLPESGSPLPESGRVSEDMRSL 277 Query: 4465 --PFHRDHPMDSARRIDEIRSSKHMRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNE 4638 P + D + SA+ S+K+ R V + PDV+Q AL+KAFLRFVK+LNE Sbjct: 278 KLPRYDDKRVGSAKA-----SAKNAR---PCEAVVLKRLPDVNQDALQKAFLRFVKTLNE 329 Query: 4639 NLAQKKNYLEDGKYGSLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCV 4818 N +QKKNYLEDGK GSL CL CGR SKEF DVH LIMHAY+ + D+ DHL HKALCV Sbjct: 330 NPSQKKNYLEDGKSGSLHCLVCGRNSKEFSDVHSLIMHAYHMQNVDVRTDHLAFHKALCV 389 Query: 4819 LMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKV 4998 LMGWNYAKVP+NSKAYQ S DEA AN +D I+WPP+VIIHNTN GR KDGR+EGMGNK Sbjct: 390 LMGWNYAKVPENSKAYQTFSTDEATANKEDHIIWPPIVIIHNTNYGRRKDGRIEGMGNKE 449 Query: 4999 MDNKLK 5016 MD KLK Sbjct: 450 MDTKLK 455 >ref|XP_007143258.1| hypothetical protein PHAVU_007G057400g [Phaseolus vulgaris] gi|561016448|gb|ESW15252.1| hypothetical protein PHAVU_007G057400g [Phaseolus vulgaris] Length = 396 Score = 291 bits (745), Expect = 3e-75 Identities = 177/360 (49%), Positives = 212/360 (58%), Gaps = 9/360 (2%) Frame = +1 Query: 3976 MLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFD 4155 ML+RRTI LADGSVRSYFALP DYQDF P RP D F Sbjct: 1 MLERRTIVLADGSVRSYFALPLDYQDFAP--RPLD-----------------------FL 35 Query: 4156 KHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERD 4335 FPP LSP FR P G+ KRKYGD D Sbjct: 36 HRFPPP--LSPGRFRL-------------------------PDFPPGASKRKYGDDDGS- 67 Query: 4336 VRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRID-- 4509 RD+ ARQR+QLL AN LS + F + + + Sbjct: 68 -RDDLARQREQLLR------------------NANGLSRISGGEFSAGPSGGTPLKRELV 108 Query: 4510 ---EIRSSKHMRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKY 4680 E+R SKH R G + S +H VDQ ALK+AF+ F K +N+N++QK++YLEDGK Sbjct: 109 DPPEMRPSKHSRHDG---ANFS-RHSQVDQDALKRAFVNFAKLINDNVSQKRSYLEDGKQ 164 Query: 4681 GSLQCLACG--RASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDN 4854 G L CLACG R++K+FPD+H LIMH YNS +AD VDHLGLHKALCVLMGWNY+K PDN Sbjct: 165 GRLHCLACGTGRSAKDFPDMHSLIMHTYNSDNADSQVDHLGLHKALCVLMGWNYSKPPDN 224 Query: 4855 SKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLK--GILG 5028 SKAYQ +S+DEAAAN DDLIMWPPLVIIHNTN+G+++DGRMEG+GNK MDNK++ G +G Sbjct: 225 SKAYQFLSSDEAAANQDDLIMWPPLVIIHNTNTGKNRDGRMEGLGNKTMDNKIRELGFMG 284 >ref|XP_006406164.1| hypothetical protein EUTSA_v10020450mg [Eutrema salsugineum] gi|557107310|gb|ESQ47617.1| hypothetical protein EUTSA_v10020450mg [Eutrema salsugineum] Length = 545 Score = 289 bits (739), Expect = 2e-74 Identities = 185/459 (40%), Positives = 233/459 (50%), Gaps = 17/459 (3%) Frame = +1 Query: 3691 NRKTRWESGNNPQPDHKSGTNAESKPNKPSSN---------PKDEQSQGSGPSKPVSDPK 3843 +RK+RW S NN K+ N + NKP + PK S P+ S P Sbjct: 30 DRKSRWASSNNDGGSSKNNINNNNNSNKPMTGGQKVADNKLPKPNPSPKLAPTPSQSYPN 89 Query: 3844 TXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRS 4023 S + YGFHML+RRTI L DGSVRS Sbjct: 90 HPNPAGPSSRPAPGSAFPASQ--FAFPDSSAALGAPPAPTYGFHMLERRTIVLVDGSVRS 147 Query: 4024 YFALPPDYQDFPP-HGRPFDPSERFFPFGHGGREPEPGGMGFGFDKHFPPGGRLSPEGFR 4200 YFALPP+Y+DFPP R DP+ F MG F + FPP PE FR Sbjct: 148 YFALPPNYRDFPPSQSRLADPAANRF-------------MGPEFSR-FPP---FHPEEFR 190 Query: 4201 RDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYG-----DGDERDVRDEFARQRQ 4365 R W+ EGS+KRK+ D ERD R E RQR Sbjct: 191 DQRQ-----------LWDR----------PEGSMKRKFPGEEEIDRRERDERGEMLRQRH 229 Query: 4366 QLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIRSSKHMRVGG 4545 Q +HY L TSSPF RD D+ R++KHMR+G Sbjct: 230 QFMHYGNPNDQS--------------LMARTSSPFTRDVGEDA-------RAAKHMRIGS 268 Query: 4546 DYGVD--VSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACGRASK 4719 + +L + VDQ ALKK+FL +VK + E+ ++KKNYLE+G G LQCL CGR+ K Sbjct: 269 SRHENGGQALNYLQVDQVALKKSFLGYVKRIYEDPSEKKNYLENGSTGPLQCLVCGRSPK 328 Query: 4720 EFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEAAAN 4899 + D HGL+MH Y A V HLGLHKALCVLMGWN++K PDNSKAYQ + A+ AA N Sbjct: 329 DVQDTHGLVMHTYYYDDASSRVHHLGLHKALCVLMGWNFSKAPDNSKAYQNLPAEVAAIN 388 Query: 4900 NDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLK 5016 D LI+WPP +I+HNT++G+ KDGRMEG+G+K MDN+++ Sbjct: 389 QDQLIIWPPHIIVHNTSTGKGKDGRMEGLGSKRMDNRIR 427 >gb|EXB41290.1| hypothetical protein L484_004460 [Morus notabilis] Length = 523 Score = 287 bits (734), Expect = 7e-74 Identities = 168/353 (47%), Positives = 209/353 (59%) Frame = +1 Query: 3964 YGFHMLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMG 4143 YGFHML+RRTI LADGSVRSYFALPPDYQDFPP P+ RFFP Sbjct: 98 YGFHMLERRTIVLADGSVRSYFALPPDYQDFPP------PAARFFP-------------- 137 Query: 4144 FGFDKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDG 4323 GG +SP G R +D YWNSLGLDG KRK+ D Sbjct: 138 ---------GGPVSPVGPNRHQD-----------YWNSLGLDG--------PAKRKFPDE 169 Query: 4324 DERDVRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARR 4503 ++ D R RA+ + T + ++ ++ Sbjct: 170 EDTDQR------------------------RYGEDSRASKYTRTVGGFDNGNNNNNNNVG 205 Query: 4504 IDEIRSSKHMRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYG 4683 + + S GGDY + KH DVDQ LKKAFLRFVK LNEN ++K Y E+GK Sbjct: 206 LRQGSGSG----GGDY--NPGHKHLDVDQIELKKAFLRFVKILNENAKERKIYFENGK-- 257 Query: 4684 SLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKA 4863 LQC+ACGR+SK+FPD LI H+YN + DL VDHLGLHKALCVLMGWNY++ PDNS+A Sbjct: 258 RLQCVACGRSSKDFPDTPSLITHSYNYDNDDLRVDHLGLHKALCVLMGWNYSRPPDNSRA 317 Query: 4864 YQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKGI 5022 YQ +SADEAAAN DDLI+WPP+VIIHNT +G++K+GRMEG+GNK+MD +++ + Sbjct: 318 YQFLSADEAAANQDDLILWPPMVIIHNTLTGKNKEGRMEGLGNKLMDARIRDL 370 >emb|CAN69769.1| hypothetical protein VITISV_022064 [Vitis vinifera] Length = 400 Score = 284 bits (727), Expect = 4e-73 Identities = 189/397 (47%), Positives = 223/397 (56%), Gaps = 25/397 (6%) Frame = +1 Query: 3634 MAGGNPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGT----NAESKPNKPSSNPKDEQ 3801 MAGGNPK +RK+RWESG+NP D KSG N+ S P P+++PK Sbjct: 1 MAGGNPKASSHKPSSSSS-HRKSRWESGSNP--DKKSGDSKPPNSSSTPKTPNNDPKQAP 57 Query: 3802 SQGSGPS--KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFH 3975 + SG S KP +D L DP YGFH Sbjct: 58 ASTSGSSHPKPPAD-SVPTSAAAPVRPPVAGAPFLPDPTT--------FGPPPTPQYGFH 108 Query: 3976 MLDRRTIALADGSVRSYFALPPDYQDFPPHG-RPFDPSERFFPFGHGGREPEPGGMGFGF 4152 ML+RRTI LADGSVRSYFAL PDYQDFPP R DP+ RF P G G PEP G G G Sbjct: 109 MLERRTIVLADGSVRSYFALSPDYQDFPPPPPRAMDPAGRFLPMGPG-HGPEPVGPGLG- 166 Query: 4153 DKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDER 4332 FP G +SPEGFR +RD+ + RG DYWNSLGLDGR EGS+KRKY + DER Sbjct: 167 --RFPXTGPMSPEGFRGERDDPYSRGRH-QDYWNSLGLDGRG--HPEGSMKRKYSEEDER 221 Query: 4333 DVR---------DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHP 4485 D R DEFARQRQQLL Y DR+ YL+G SSPF R Sbjct: 222 DRREDRDRRDGNDEFARQRQQLLQYGNPSLNPNGYPLGG--DRSEYLAGP-SSPFRRG-V 277 Query: 4486 MDSARRIDEIRSSKHMRVGGDY---------GVDVSLKHPDVDQQALKKAFLRFVKSLNE 4638 MD R DE+RSSK+MR+GG Y G +V LKH +VDQ ALKKAF++FVK +NE Sbjct: 278 MDPIRG-DELRSSKYMRIGGGYEGFSRQGGVGDNVGLKHHNVDQNALKKAFIQFVKLINE 336 Query: 4639 NLAQKKNYLEDGKYGSLQCLACGRASKEFPDVHGLIM 4749 + +Q++ YLEDGK G L+CLACGR K P V L++ Sbjct: 337 SASQRRLYLEDGKQGPLRCLACGRFGKNGPLVPSLLL 373 >ref|XP_006406165.1| hypothetical protein EUTSA_v10020450mg [Eutrema salsugineum] gi|557107311|gb|ESQ47618.1| hypothetical protein EUTSA_v10020450mg [Eutrema salsugineum] Length = 548 Score = 283 bits (725), Expect = 7e-73 Identities = 185/462 (40%), Positives = 233/462 (50%), Gaps = 20/462 (4%) Frame = +1 Query: 3691 NRKTRWESGNNPQPDHKSGTNAESKPNKPSSN---------PKDEQSQGSGPSKPVSDPK 3843 +RK+RW S NN K+ N + NKP + PK S P+ S P Sbjct: 30 DRKSRWASSNNDGGSSKNNINNNNNSNKPMTGGQKVADNKLPKPNPSPKLAPTPSQSYPN 89 Query: 3844 TXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRS 4023 S + YGFHML+RRTI L DGSVRS Sbjct: 90 HPNPAGPSSRPAPGSAFPASQ--FAFPDSSAALGAPPAPTYGFHMLERRTIVLVDGSVRS 147 Query: 4024 YFALPPDYQDFPP-HGRPFDPSERFFPFGHGGREPEPGGMGFGFDKHFPPGGRLSPEGFR 4200 YFALPP+Y+DFPP R DP+ F MG F + FPP PE FR Sbjct: 148 YFALPPNYRDFPPSQSRLADPAANRF-------------MGPEFSR-FPP---FHPEEFR 190 Query: 4201 RDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYG-----DGDERDVRDEFARQRQ 4365 R W+ EGS+KRK+ D ERD R E RQR Sbjct: 191 DQRQ-----------LWDR----------PEGSMKRKFPGEEEIDRRERDERGEMLRQRH 229 Query: 4366 QLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIRSSKHMRVGG 4545 Q +HY L TSSPF RD D+ R++KHMR+G Sbjct: 230 QFMHYGNPNDQS--------------LMARTSSPFTRDVGEDA-------RAAKHMRIGS 268 Query: 4546 DYGVD--VSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACGR--- 4710 + +L + VDQ ALKK+FL +VK + E+ ++KKNYLE+G G LQCL CGR Sbjct: 269 SRHENGGQALNYLQVDQVALKKSFLGYVKRIYEDPSEKKNYLENGSTGPLQCLVCGRFDR 328 Query: 4711 ASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEA 4890 + K+ D HGL+MH Y A V HLGLHKALCVLMGWN++K PDNSKAYQ + A+ A Sbjct: 329 SPKDVQDTHGLVMHTYYYDDASSRVHHLGLHKALCVLMGWNFSKAPDNSKAYQNLPAEVA 388 Query: 4891 AANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLK 5016 A N D LI+WPP +I+HNT++G+ KDGRMEG+G+K MDN+++ Sbjct: 389 AINQDQLIIWPPHIIVHNTSTGKGKDGRMEGLGSKRMDNRIR 430 >ref|XP_006489387.1| PREDICTED: uncharacterized protein LOC102629231 [Citrus sinensis] Length = 470 Score = 281 bits (720), Expect = 3e-72 Identities = 190/472 (40%), Positives = 232/472 (49%), Gaps = 9/472 (1%) Frame = +1 Query: 3634 MAGGN-PKGXXXXXXXXXXXN--RKTRWESGNNPQPDHKSGTNAES-KPNKPSSNPKDEQ 3801 MAGGN PK + RK+RWES NP D K + P +P S P Sbjct: 1 MAGGNHPKSSSHKPPPSSALSSYRKSRWESPKNPPSDQKPKPSPNKHSPAQPKSLPAPTH 60 Query: 3802 SQGS--GPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFH 3975 S GP P S+P YGFH Sbjct: 61 PSFSSHGPPLPYSEPPPPPPA-----------------------------------YGFH 85 Query: 3976 MLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFD 4155 ML+RRTI LADGSVRSYFALPPDY P H P F P Sbjct: 86 MLERRTIVLADGSVRSYFALPPDYDFTPRHNSLLRPEFHFSP------------------ 127 Query: 4156 KHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERD 4335 GFR D R+ I+ G +KRK+G +E++ Sbjct: 128 ---------EAAGFR----------------------DRREYINGPGPMKRKFGVDEEKE 156 Query: 4336 VRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEI 4515 ++ +R DR L GT+ H D +E Sbjct: 157 LQHLMSRANSS-------------------RDR---LVGTSG---HFD---------EET 182 Query: 4516 RSSKHMRVG-GDYGVDV-SLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYL-EDGKYGS 4686 R++K+MR G G V K+ +VD LKK FL FVK +NEN+A +K+YL EDGK G Sbjct: 183 RAAKYMRTTPGAVGPSVVKHKYDEVDHAMLKKVFLHFVKVINENVALRKSYLVEDGKQGR 242 Query: 4687 LQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAY 4866 LQC+AC R+SK+F D+HGLIMH YNS +ADL VDHLGLHKALCVLMGWNY+K PDNSKAY Sbjct: 243 LQCIACRRSSKDFSDMHGLIMHTYNSDNADLRVDHLGLHKALCVLMGWNYSKPPDNSKAY 302 Query: 4867 QLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKGI 5022 + + DEAAAN DDLIMWPP+VIIHNT +G+ KDGRMEG+GNK MD ++ + Sbjct: 303 KFLPPDEAAANQDDLIMWPPVVIIHNTLTGKGKDGRMEGLGNKAMDKTIRDL 354 >ref|XP_003535649.1| PREDICTED: protein SUPPRESSOR OF GENE SILENCING 3-like [Glycine max] Length = 460 Score = 279 bits (713), Expect = 2e-71 Identities = 192/468 (41%), Positives = 227/468 (48%), Gaps = 7/468 (1%) Frame = +1 Query: 3634 MAGGN-PKGXXXXXXXXXXXNRKTRWE---SGNNPQPDHKSGTNAESKPNKPSSNPKDEQ 3801 MAGGN PK +RK+RWE S N D KS ++ P KP SN Sbjct: 1 MAGGNHPKSSHHNKPPPSASHRKSRWEPNSSSANSPADPKSKSSTAPSP-KPKSNTNPNP 59 Query: 3802 SQGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHML 3981 S P P DP L P YGFHML Sbjct: 60 SPKHLPF-PFPDPAP-----------------LGPP--------------PPPAYGFHML 87 Query: 3982 DRRTIALADGSVRSYFALPPDYQDFPPHGRPFDPSERFFPFGHGGREPEPGGMGFGFDKH 4161 +RRTI LADGSVRSYFALPPDYQDF P RP D RF Sbjct: 88 ERRTIVLADGSVRSYFALPPDYQDFAP--RPLDLPPRFC--------------------- 124 Query: 4162 FPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVR 4341 P + R+ D+ GGP D Sbjct: 125 LPDYSYTAAAAKRKYGDD----DGGPRD-------------------------------- 148 Query: 4342 DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEI-R 4518 + ARQR+QLL AN +S S D R+D + Sbjct: 149 -DLARQREQLLR------------------NANGISREQFSAGPSDLRPSKHSRLDGLSN 189 Query: 4519 SSKHMRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCL 4698 S++H +V DQ ALKK+F F K +NEN++QK+ LEDGK G L CL Sbjct: 190 STRHSQV---------------DQDALKKSFCNFSKLINENVSQKRTCLEDGKQGRLHCL 234 Query: 4699 AC--GRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQL 4872 AC GR++K+FPD+H LIMH YN +AD VDHLGLHKALCVLMGWNY+K PDNSKAYQ Sbjct: 235 ACGTGRSAKDFPDMHALIMHTYNPDNADSRVDHLGLHKALCVLMGWNYSKPPDNSKAYQF 294 Query: 4873 VSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLK 5016 + ADEAAAN DDLIMWPPLVIIHNTN+G+++DGRMEG+GNK MDNK++ Sbjct: 295 LPADEAAANQDDLIMWPPLVIIHNTNTGKNRDGRMEGLGNKTMDNKIR 342