BLASTX nr result
ID: Rehmannia22_contig00025637
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00025637 (1400 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EPS68138.1| hypothetical protein M569_06638 [Genlisea aurea] 225 4e-56 ref|XP_006338569.1| PREDICTED: uncharacterized protein LOC102594... 186 3e-44 ref|XP_002274197.1| PREDICTED: uncharacterized protein LOC100267... 177 1e-41 ref|XP_004232302.1| PREDICTED: uncharacterized protein LOC101252... 173 1e-40 ref|XP_004232301.1| PREDICTED: uncharacterized protein LOC101252... 173 1e-40 ref|XP_006599108.1| PREDICTED: uncharacterized protein LOC102666... 134 7e-29 gb|ESW12953.1| hypothetical protein PHAVU_008G155500g [Phaseolus... 126 3e-26 ref|XP_006604053.1| PREDICTED: uncharacterized protein LOC102668... 125 4e-26 gb|EMJ18867.1| hypothetical protein PRUPE_ppa1027165mg [Prunus p... 124 1e-25 gb|ESW23014.1| hypothetical protein PHAVU_004G012100g [Phaseolus... 119 4e-24 ref|XP_004489260.1| PREDICTED: histone-lysine N-methyltransferas... 116 2e-23 ref|XP_002524654.1| conserved hypothetical protein [Ricinus comm... 108 4e-21 ref|XP_006580692.1| PREDICTED: uncharacterized protein LOC102666... 105 6e-20 emb|CBI34908.3| unnamed protein product [Vitis vinifera] 100 2e-18 ref|XP_006603798.1| PREDICTED: uncharacterized protein LOC102667... 94 2e-16 ref|XP_003618211.1| DNA mismatch repair protein Msh6 [Medicago t... 83 2e-13 ref|XP_004489261.1| PREDICTED: histone-lysine N-methyltransferas... 80 2e-12 gb|EOY02415.1| Tudor/PWWP/MBT superfamily protein, putative isof... 80 3e-12 ref|XP_006446521.1| hypothetical protein CICLE_v10014124mg [Citr... 79 4e-12 ref|XP_006395397.1| hypothetical protein EUTSA_v10005709mg [Eutr... 79 6e-12 >gb|EPS68138.1| hypothetical protein M569_06638 [Genlisea aurea] Length = 759 Score = 225 bits (573), Expect = 4e-56 Identities = 169/485 (34%), Positives = 237/485 (48%), Gaps = 39/485 (8%) Frame = +1 Query: 4 PGEVPPLGPREDDWLSSPGVSSAKSRDDKIYHRRKQKSVNELMGENNSVEP-----KNPK 168 P EVP LGP ++DW SP S D+IYHRRKQKSV +L+ E+++V+ K P Sbjct: 304 PIEVPILGPSDEDW-PSPSAKLQDSATDRIYHRRKQKSVADLLRESDNVKSSRQRNKAPP 362 Query: 169 RAKVKDEKDPSRKRKVVNAEKKGKPKEIKVDFTENNS----------------GEAKEEP 300 AK ++ KD + V + K K ++I+ +EN +EE Sbjct: 363 AAKQRN-KDTRKSSSSVKSLKSVKKRKIESSKSENGELADIKTVKEDTVTPAESNLREES 421 Query: 301 EKVFT-PRERKKSKYLSPPYTNPTWRMGXXXXXXXXXXXXXXXIDR-----------SAS 444 E V T PRERK SKYLS PY P W++G D S+S Sbjct: 422 ESVSTTPRERKVSKYLSYPYIIPEWKIGYTNFKLGSEASKTPKKDHLPIQEEPELEASSS 481 Query: 445 PPICKLVDKASHEEMP-DVHLKATSRA-----VEENDKKMTFSVSDVDVQVDEMLSEVQF 606 +V+ +S +E P D ++ + A ++ ND KM+F VSDV + DE+L ++ Sbjct: 482 QQKLVVVNTSSDKEHPFDDTIEQSKSASSGSRLDHNDVKMSFPVSDVTLSPDELLLGIKN 541 Query: 607 AAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIFQKPETVXXXXXXXXXXLKIQEN 786 AA+DPLYLS +G+ D V+ F SA RSS Y+HG+DY K + Sbjct: 542 AALDPLYLSKEGTLDEVWGFASAFRSSMYIHGSDYPKNSKGRK------------RKSIG 589 Query: 787 DDVAQEKPKSPDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKEEIVRLFGKFGSL 966 ++KPK E SKEEIV++F ++GSL Sbjct: 590 THADEKKPKKSSCEV-----------KMEGKSSIVLSFAPRFPVPSKEEIVKVFSEYGSL 638 Query: 967 NENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYLLQHFSAGSQSQPKIP 1146 N +T+V+ DS+S ++VY + DA+ AFKSS +V L+ S+ P Sbjct: 639 NIEQTSVMKDSYSAQVVYMDEGDAKSAFKSS--------SDVKLRLKGCSSTK------P 684 Query: 1147 SPQNRDSEKPTDEDFMSYVRGISRKLEITTAILENYHPKFSTEEKSSLKEEMKHLMENVE 1326 S P D M+ V+ I RK +IT AIL NYH KFS EEK LK+E+KH+ME+VE Sbjct: 685 HETPTSSAIPESSDLMADVQAIKRKFDITAAILANYHTKFSGEEKRGLKDELKHVMESVE 744 Query: 1327 MVSEK 1341 +VS+K Sbjct: 745 VVSDK 749 >ref|XP_006338569.1| PREDICTED: uncharacterized protein LOC102594150 [Solanum tuberosum] Length = 833 Score = 186 bits (471), Expect = 3e-44 Identities = 161/501 (32%), Positives = 220/501 (43%), Gaps = 58/501 (11%) Frame = +1 Query: 4 PGEVPPLGPREDDWLSSPGVSSAKSRDDKIYHRRKQKSVNELMGEN-------------- 141 P EVP GP E++ +S + DKIY +RKQKSV ELMGEN Sbjct: 350 PIEVPIQGPSEEEIPNSGSSKFPMTACDKIYQKRKQKSVAELMGENAKPKGKKTTEDDST 409 Query: 142 -NSVEPKNPKR-------------AKVKDEKDPSRKRKVVNAEKKGKPKEIKVDFTENNS 279 +SVE KR +K DEK R K K K++ V E + Sbjct: 410 PSSVETSEKKRKKSGEKAKGHTGSSKSVDEKIGKRVSKKSGDSDLVKTKKLSVSIPERDE 469 Query: 280 GEAKEEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXXXXXXXXXXXXXIDRS------- 438 +++ RERKKSKYLSPPYT+P W G D S Sbjct: 470 LGDQQDMNAGPLSRERKKSKYLSPPYTSPKWNAGKSSFKRDLEIESQKFSDISKIGERMT 529 Query: 439 -ASPPICKLVDKASHEEMPDVHLKATSRAVEENDKKMTFSVSDVDVQVDEMLSEVQFAAV 615 A+ + D +E D L +SR + + K TF ++ VDE+LSEVQ A+ Sbjct: 530 KAARLLLSSPDANGNEAFKD-DLDKSSRIRKRSPK--TFDTMAINSSVDEVLSEVQSTAL 586 Query: 616 DPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIFQKPETVXXXXXXXXXXLKIQENDDV 795 +PL L GS + F+S R+S Y G++YK + + ET + + + Sbjct: 587 NPLLLR-NGSLEKARGFISTFRNSVYFDGSNYKQYHQVET-------GKKRKSVGSRNVI 638 Query: 796 AQEKPKSPDS---------------------EAPXXXXXXXXXXXXXXXXXXXXXXXXXX 912 +Q KSPDS P Sbjct: 639 SQSDSKSPDSVPSKKRKTNHAKSEVTKLKKESGPSSQGKEDEDDGGETSSVILLVTFLTG 698 Query: 913 XXXSKE-EIVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVEN 1089 E EI+R++ KFG LNE ET VL DS+SVRIVY++ SDA AFK SV +SPFG N Sbjct: 699 FSLPSEDEIIRIYNKFGELNEEETKVLCDSNSVRIVYRRGSDAAQAFKESVRQSPFGAAN 758 Query: 1090 VNYLLQHFSAGSQSQPKIPSPQNRDSEKPTDEDFMSYVRGISRKLEITTAILENYHPKFS 1269 VN+ L S S+S+ + S + R + S V+ I +KL+ ++IL K + Sbjct: 759 VNFTL---SYSSKSESPLSSLKARKGK--------SQVQLIKQKLKGMSSILGKCKGKIT 807 Query: 1270 TEEKSSLKEEMKHLMENVEMV 1332 +EEKS L+ E+K L+E V V Sbjct: 808 SEEKSELENEIKGLLEKVSAV 828 >ref|XP_002274197.1| PREDICTED: uncharacterized protein LOC100267992 [Vitis vinifera] Length = 976 Score = 177 bits (448), Expect = 1e-41 Identities = 155/557 (27%), Positives = 239/557 (42%), Gaps = 109/557 (19%) Frame = +1 Query: 4 PGEVPPLGPREDDWLSSPGVSS-----------AKSRDDKIYHRRKQKSVNELMGENNSV 150 P EVP GP EDDWLS P S A +DK+Y RRKQKS+ E+M N V Sbjct: 424 PVEVPIQGPCEDDWLSMPVSPSFGKTSRTLLHKATGSEDKLYQRRKQKSMAEIMRGNGDV 483 Query: 151 EPKNPKRAKVKDE---------KDPSRKRK--------VVN---AEKKGKPKEIKVDFT- 267 EPKN + K++ + R++K VVN A +G+ K+ ++ + Sbjct: 484 EPKNEETDMGKEDINSVKLATASEKKRRKKGGNEAESHVVNSNLASPRGRRKKSRLSGSP 543 Query: 268 -------------------------------------ENNSGEAKEEPEKVFTPRERKKS 336 EN+ G EE E+ RERKKS Sbjct: 544 VTSEDRALSVESDGSEGKRESENSPVSRERKKKGLSVENDGGRLPEESEQTSVSRERKKS 603 Query: 337 KYLSPPYTNP------TWRMGXXXXXXXXXXXXXXXIDRSA--------SPPICKLVDKA 474 KYL PPYTN + MG +RS+ SP I K + Sbjct: 604 KYLCPPYTNVIRMHRNSGSMGDSKTEFLEVSNVAGKGERSSRAAGQSVGSPTILKCSSET 663 Query: 475 SHEEMPDVHLKATSRAVEENDKKMTFSVSDVDVQVDEMLSEVQFAAVDPLYLSIKGSFDM 654 +++ + + ++ + ++ + + E+LS ++ AA++P YL S D Sbjct: 664 TYQNKD-----SKEHQTPKQNRNKVIDLKEIRISLQEVLSGIRSAALNPFYLRENKSVDK 718 Query: 655 VYAFVSARRSSTYLHGADYKIF------QKPETVXXXXXXXXXXLKIQENDDVAQ----E 804 + F+SA RS+ Y G++YK+F +K + LK +++ Q Sbjct: 719 ISGFLSAFRSAIYHDGSNYKMFNKHGPGRKRKRQESEPGSSREDLKQNDHNSSKQARRSR 778 Query: 805 KPKSPDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXX----------SKEEIVRLFGK 954 K ++ + + P SK++++++F K Sbjct: 779 KNETAEPDGPELKQAAAGKSDTKTKHKDKDKKVESATLLLSFGPGISLPSKDDLIKIFSK 838 Query: 955 FGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYLLQHFSAGS--Q 1128 FG+LNE+ET +L DS R+V+ + SDAE AF S SPFG E V Y L++ S+ + + Sbjct: 839 FGTLNESETEILYDSFCARVVFSRSSDAEEAFNGSQKASPFGAEQVTYRLRYPSSSTSRR 898 Query: 1129 SQPKIPSPQNRDSEK----PTDEDFMSYVRGISRKLEITTAILENYHPKFSTEEKSSLKE 1296 + K P N+ + K P+ S + I +KLE+ T +LE K S E KS+L+ Sbjct: 899 TPDKKHHPPNKKAGKAPANPSAGGEKSQLNFIKQKLEMMTCMLEKSSGKMSGEMKSNLEG 958 Query: 1297 EMKHLMENVEMVSEKSS 1347 EMK L+E V ++E SS Sbjct: 959 EMKGLLEKVSTMAETSS 975 >ref|XP_004232302.1| PREDICTED: uncharacterized protein LOC101252451 isoform 2 [Solanum lycopersicum] Length = 809 Score = 173 bits (439), Expect = 1e-40 Identities = 156/498 (31%), Positives = 218/498 (43%), Gaps = 54/498 (10%) Frame = +1 Query: 4 PGEVPPLGPREDDWLSSPGVSSAK---SRDDKIYHRRKQKSVNELMGEN----------- 141 P EVP GP E+ P S+K + DKIY +RKQKSV ELMGEN Sbjct: 328 PIEVPIQGPSEE----IPNSGSSKFPMTACDKIYQKRKQKSVAELMGENAKPKGKKTTED 383 Query: 142 ----NSVEPKNPKRAKVK-------------DEKDPSRKRKVVNAEKKGKPKEIKVDFTE 270 +SVE KR K DEK R K K K++ V E Sbjct: 384 DSTPSSVETSEKKRKKSGEKAKGQTGSSMSVDEKIGKRVNKKSGDSDLVKTKKLSVSIPE 443 Query: 271 NNSGEAKEEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXXXXXXXXXXXXXIDRSASPP 450 ++ +++ + RERKKSKYLSPPYT+P W G D S Sbjct: 444 SDEVGNQQDNAGPLS-RERKKSKYLSPPYTSPKWNAGKSSFKRELAIESQKFSDNS---K 499 Query: 451 ICKLVDKASHEEMPDVHLKATSRAVEENDK--------KMTFSVSDVDVQVDEMLSEVQF 606 I + + KA+ + ++ DK TF ++ VDE+LSEVQ Sbjct: 500 IGERMTKAARLLLSSPDSNGKEAFKDDVDKSSGINKRSSRTFDTVAINSSVDEVLSEVQS 559 Query: 607 AAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIFQKPETVXXXXXXXXXXLKIQEN 786 A++PL L GS + F+S R+S Y G++YK + + ET L Q + Sbjct: 560 TALNPLLLR-NGSLEKARGFISTFRNSLYYDGSNYKQYHQMETGKKRKSAGSGNLISQSD 618 Query: 787 ----DDVAQEKPKS-----------PDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 921 D + +K K+ D Sbjct: 619 TESPDSIPSKKRKTNYAKSEVTKLKKDYGPSSQGKEDEDDGREASSVILLVAFLTGFSLP 678 Query: 922 SKEEIVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYL 1101 ++EI+R++ KFG LNE ET VL DS+SVRIVY+ +DA AFK SV +SPFG NVN+ Sbjct: 679 PEDEIIRIYNKFGELNEEETEVLRDSNSVRIVYRHGADAAQAFKESVRQSPFGAANVNFT 738 Query: 1102 LQHFSAGSQSQPKIPSPQNRDSEKPTDEDFMSYVRGISRKLEITTAILENYHPKFSTEEK 1281 L S S+S+ + S + R + S V+ I +KL+ +IL+ K ++ EK Sbjct: 739 L---SYSSKSESPLSSLKARKGK--------SQVQLIKQKLKGMASILDKCKGKITSAEK 787 Query: 1282 SSLKEEMKHLMENVEMVS 1335 S L+ E+K L+E V V+ Sbjct: 788 SELENEIKGLVEKVSAVT 805 >ref|XP_004232301.1| PREDICTED: uncharacterized protein LOC101252451 isoform 1 [Solanum lycopersicum] Length = 835 Score = 173 bits (439), Expect = 1e-40 Identities = 156/498 (31%), Positives = 218/498 (43%), Gaps = 54/498 (10%) Frame = +1 Query: 4 PGEVPPLGPREDDWLSSPGVSSAK---SRDDKIYHRRKQKSVNELMGEN----------- 141 P EVP GP E+ P S+K + DKIY +RKQKSV ELMGEN Sbjct: 354 PIEVPIQGPSEE----IPNSGSSKFPMTACDKIYQKRKQKSVAELMGENAKPKGKKTTED 409 Query: 142 ----NSVEPKNPKRAKVK-------------DEKDPSRKRKVVNAEKKGKPKEIKVDFTE 270 +SVE KR K DEK R K K K++ V E Sbjct: 410 DSTPSSVETSEKKRKKSGEKAKGQTGSSMSVDEKIGKRVNKKSGDSDLVKTKKLSVSIPE 469 Query: 271 NNSGEAKEEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXXXXXXXXXXXXXIDRSASPP 450 ++ +++ + RERKKSKYLSPPYT+P W G D S Sbjct: 470 SDEVGNQQDNAGPLS-RERKKSKYLSPPYTSPKWNAGKSSFKRELAIESQKFSDNS---K 525 Query: 451 ICKLVDKASHEEMPDVHLKATSRAVEENDK--------KMTFSVSDVDVQVDEMLSEVQF 606 I + + KA+ + ++ DK TF ++ VDE+LSEVQ Sbjct: 526 IGERMTKAARLLLSSPDSNGKEAFKDDVDKSSGINKRSSRTFDTVAINSSVDEVLSEVQS 585 Query: 607 AAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIFQKPETVXXXXXXXXXXLKIQEN 786 A++PL L GS + F+S R+S Y G++YK + + ET L Q + Sbjct: 586 TALNPLLLR-NGSLEKARGFISTFRNSLYYDGSNYKQYHQMETGKKRKSAGSGNLISQSD 644 Query: 787 ----DDVAQEKPKS-----------PDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 921 D + +K K+ D Sbjct: 645 TESPDSIPSKKRKTNYAKSEVTKLKKDYGPSSQGKEDEDDGREASSVILLVAFLTGFSLP 704 Query: 922 SKEEIVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYL 1101 ++EI+R++ KFG LNE ET VL DS+SVRIVY+ +DA AFK SV +SPFG NVN+ Sbjct: 705 PEDEIIRIYNKFGELNEEETEVLRDSNSVRIVYRHGADAAQAFKESVRQSPFGAANVNFT 764 Query: 1102 LQHFSAGSQSQPKIPSPQNRDSEKPTDEDFMSYVRGISRKLEITTAILENYHPKFSTEEK 1281 L S S+S+ + S + R + S V+ I +KL+ +IL+ K ++ EK Sbjct: 765 L---SYSSKSESPLSSLKARKGK--------SQVQLIKQKLKGMASILDKCKGKITSAEK 813 Query: 1282 SSLKEEMKHLMENVEMVS 1335 S L+ E+K L+E V V+ Sbjct: 814 SELENEIKGLVEKVSAVT 831 >ref|XP_006599108.1| PREDICTED: uncharacterized protein LOC102666492 isoform X1 [Glycine max] gi|571526483|ref|XP_006599109.1| PREDICTED: uncharacterized protein LOC102666492 isoform X2 [Glycine max] gi|571526487|ref|XP_006599110.1| PREDICTED: uncharacterized protein LOC102666492 isoform X3 [Glycine max] Length = 937 Score = 134 bits (338), Expect = 7e-29 Identities = 145/552 (26%), Positives = 232/552 (42%), Gaps = 106/552 (19%) Frame = +1 Query: 10 EVPPLGPREDDWLSSPGVSSAKSRD---------DKIYHRRKQKSVNELMGENNSVEPKN 162 E P GP E+D+ + P S KS + +++ HR KQKS+ E+MGE+ V KN Sbjct: 394 EAPAHGPFEEDYSTMP--MSPKSGELSHSHGISGNRLNHRIKQKSIAEIMGEDKDVNTKN 451 Query: 163 PK---------RAKVKDEKDP--------------SRKRKVVNAEKKG------------ 237 + R K K +D + R V AE G Sbjct: 452 QEGDATEKVTVRKKRKGSEDTMASKSVQMRKALFSNTDRNVAGAENDGGCWGKEDGDNGT 511 Query: 238 ----KPKEIKVDFTENNSGEAKE---------EPEKVFTPRERKKSKYLSPPYTNPT--- 369 K K+ +++SG KE + EK RE+KKSKYLSPP+T P Sbjct: 512 LAQLKKKKKAFGIGKSSSGSKKETDLEGKFKGKNEKGSLSREKKKSKYLSPPFTIPAREQ 571 Query: 370 --WRMGXXXXXXXXXXXXXXXIDRSA-----SPPICKLVDKASHEEMPDVHLK------A 510 + + R++ SP KL D+A E + +K + Sbjct: 572 RKGEIETESPKVSGKDQESEPLTRASDQLLKSPVPLKLNDEAFQENVSKELVKEQDLPDS 631 Query: 511 TSRAVEENDKKMTFSVSDVDVQVDEMLSEVQFAAVDPLYLSIKGSFDMVYAFVSARRSST 690 ++ E D+ T + + V + E+LSEV++AA++P S S + + F+ RSS Sbjct: 632 SNYRTPEYDENKTIDTTKIQVPLGEVLSEVRYAAINPQTPSNTNSLERIVDFIFIYRSSL 691 Query: 691 YLHGADYKIFQ--KPETVXXXXXXXXXXLKIQE---------NDDVAQEK---------- 807 + G+ YKI++ KP L+ + ND +++ Sbjct: 692 FRQGSYYKIYKKHKPSKKRKKPESDLGILRKDQIQSDHISAINDSEPKKRRIKKETALGL 751 Query: 808 PKSPDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKEEIVRLFGKFGSLNENETNV 987 PK S A SK +++ L+GKFG+LNE+ET + Sbjct: 752 PKEKLSAAAKIGKKGTDKNASGAALFVSFEPGSSLP--SKSDLITLYGKFGALNESETAM 809 Query: 988 LTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYLLQHFSAGSQSQ---PKIPSPQN 1158 ++ R+ + K S+AE A S +PF ++ L++ SAGS+S+ PK S + Sbjct: 810 FASDYTARVFFLKASNAEKALSHSQNLNPFDSSGASFRLEYLSAGSKSEKSKPKASSTKK 869 Query: 1159 RDS--EKP-------TDEDFMSYVRGISRKLEITTAILENYHPKFSTEEKSSLKEEMKHL 1311 +D KP T+ ++Y++ +KL+ T++LE K + K+ L+ EMK L Sbjct: 870 KDKTPAKPSASLSPGTEASKLNYIK---QKLQCLTSMLEASDAKL-PDIKAKLESEMKRL 925 Query: 1312 MENVEMVSEKSS 1347 +E+V + E SS Sbjct: 926 LEDVNKMVESSS 937 >gb|ESW12953.1| hypothetical protein PHAVU_008G155500g [Phaseolus vulgaris] gi|561014093|gb|ESW12954.1| hypothetical protein PHAVU_008G155500g [Phaseolus vulgaris] Length = 931 Score = 126 bits (316), Expect = 3e-26 Identities = 145/557 (26%), Positives = 229/557 (41%), Gaps = 111/557 (19%) Frame = +1 Query: 10 EVPPLGPREDDWLSSPGVSSAKS---------RDDKIYHRRKQKSVNELMGENNSVEPKN 162 EVP GP E+D+ + P S KS +++ HR KQKS+ E+MGE+ KN Sbjct: 388 EVPVHGPFEEDYSTMP--VSPKSGGLNLSHGISGNRLNHRIKQKSIAEIMGEDKDFSAKN 445 Query: 163 PK---------RAKVKDEKD-----PSRKRKVVN-------------------------- 222 R K K +D P +KRK + Sbjct: 446 KVGDATEKVTVRKKRKGSEDTMVSNPVQKRKELFPNTYRNKAGAENDGYSCGKENSDNGA 505 Query: 223 -AEKKGKPKEIKVDFTENNS-------GEAKEEPEKVFTPRERKKSKYLSPPYTNPT--W 372 A+ K K K + + S G+A+ EK RERKKSKYLSPP+T PT Sbjct: 506 LAQLKKKKKVFGIGKASSASKKETDQEGKAQGNSEKGSLSRERKKSKYLSPPFTIPTRDQ 565 Query: 373 RMGXXXXXXXXXXXXXXXIDRSASPPICKLVDKASHEEMP--------------DVHLK- 507 R G S P+ + DK +P ++ ++ Sbjct: 566 RKGEIEIESPKVSGKD-----QVSEPMTRASDKLLESPVPWKLNGDPFQEKFSKELSIEH 620 Query: 508 ----ATSRAVEENDKKMTFSVSDVDVQVDEMLSEVQFAAVDPLYLSIKGSFDMVYAFVSA 675 +++ + D+ T + + V + E+L EV+ AA++P + S + V F+ Sbjct: 621 DFPDSSNHQTSKYDEDKTIDTTKIQVPLGEVLREVRCAAINPQTPTDTISLERVAEFIFI 680 Query: 676 RRSSTYLHGADYKIFQK---------PET-VXXXXXXXXXXLKIQENDDVAQEK------ 807 R+S + G++YK+++K PE+ V I + D +K Sbjct: 681 YRNSIFRQGSNYKVYKKLKPGKKRKKPESDVGMLGKDQIQSDHISAHKDSEPKKRRRKNE 740 Query: 808 -----PKSPDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKEEIVRLFGKFGSLNE 972 PK S P SK +++ L+ KFG+LNE Sbjct: 741 TTSGLPKEKQSATPKAGKKGTNKNASGATLFASFEPGSSLP--SKSDLITLYSKFGTLNE 798 Query: 973 NETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYLLQHFSAGSQSQ---PKI 1143 +ET + + ++ ++ + K SDAE A S +PFG + LQ+ S+GS+S+ K Sbjct: 799 SETAMFSSDYAAQVFFLKASDAEKALSDSQNMNPFGSSKATFRLQYLSSGSKSEKSISKT 858 Query: 1144 PSPQNRD--------SEKPTDEDF-MSYVRGISRKLEITTAILENYHPKFSTEEKSSLKE 1296 SP+ +D S P E + ++Y++ +KL+ T ILE K S++ K L+ Sbjct: 859 SSPKKKDKTPAKPSTSLSPGSEAYKLNYIK---QKLQGLTLILEASDAK-SSDIKKKLES 914 Query: 1297 EMKHLMENVEMVSEKSS 1347 EMK L+E+V + E SS Sbjct: 915 EMKGLLEDVNKMVESSS 931 >ref|XP_006604053.1| PREDICTED: uncharacterized protein LOC102668257 isoform X1 [Glycine max] gi|571554991|ref|XP_006604054.1| PREDICTED: uncharacterized protein LOC102668257 isoform X2 [Glycine max] Length = 927 Score = 125 bits (314), Expect = 4e-26 Identities = 135/539 (25%), Positives = 212/539 (39%), Gaps = 96/539 (17%) Frame = +1 Query: 19 PLGPREDDWLSSPGVSSAKSRDDKIYHRRKQKSVNELMGENNSVEPKNPK---------R 171 P+ P+ + S G+S +++ HR KQKS+ E+MGE+ KN + R Sbjct: 400 PMSPKSGELSHSHGISG-----NRLNHRIKQKSIAEIMGEDKDANTKNKQGDATEKVSVR 454 Query: 172 AKVKDEKDPSRKRKVVN--------------------------------AEKKGKPKEIK 255 K K +D + V A+ K K K Sbjct: 455 KKRKGSEDTMASKSVQKRKGLFLNTDRNAAGAENDGGSWGKEDGDNGTLAQLKKKKKSFG 514 Query: 256 VDFTENNS-------GEAKEEPEKVFTPRERKKSKYLSPPYTNPT--WRMGXXXXXXXXX 408 + T + S G+AK + K RERKKSKYLSPP+ P R G Sbjct: 515 IGNTSSGSKKETDHEGKAKVKNGKGSLSRERKKSKYLSPPFAIPAREQRKGERETESPKV 574 Query: 409 XXXXXX---IDRSA-----SPPICKLVDKASHE----------EMPDVHLKATSRAVEEN 534 + R++ SP KL D+ E ++PD +++ E Sbjct: 575 SGKDQQSEPLTRASDQLLKSPVPLKLNDEPFQENVSKELVIDQDLPD----SSNYRTPEY 630 Query: 535 DKKMTFSVSDVDVQVDEMLSEVQFAAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYK 714 D+ T + + V E+LSEV +AA++P S + + F+ RSS Y G+ YK Sbjct: 631 DENKTIDTTKIQVPSGEVLSEVCYAAINPQTPMNINSLERIVDFIFIYRSSLYRQGSYYK 690 Query: 715 IFQKPETVXXXXXXXXXXLKIQENDDVAQEKPKSPDSEAPXXXXXXXXXXXXXXXXXXXX 894 I++K + L I D + +K + + P Sbjct: 691 IYKKHKP-SKKGKKPESDLGILRKDQIQSDKKSANNDSEPKKRRKNETTSSLPKEKQSAA 749 Query: 895 XXXXXXXXXSK-------------------EEIVRLFGKFGSLNENETNVLTDSHSVRIV 1017 K ++ L+GKFG+LNE+ET++L+ + R+ Sbjct: 750 AKTGKKGIDKKASGASLFISFGPGSSLPSNSDLTTLYGKFGALNESETSMLSSDCTARVF 809 Query: 1018 YKKDSDAELAFKSSVTESPFGVENVNYLLQHFSAGSQSQ-PKIPSPQNRDSEKPTDEDFM 1194 + K SDAE A S +PFG ++ L++ SAGS+S+ K + + +K + Sbjct: 810 FLKASDAEKALSHSQNMNPFGSSEASFRLEYLSAGSKSEKSKFKASSTKKKDKTPAKPSA 869 Query: 1195 SYVRG--------ISRKLEITTAILENYHPKFSTEEKSSLKEEMKHLMENVEMVSEKSS 1347 S G I KL+ T++LE K + K+ L+ EMK L+E+V + E SS Sbjct: 870 SLSPGGEASKLNYIKEKLQGLTSMLEASDAKL-PDIKTKLESEMKQLLEDVNRMVESSS 927 >gb|EMJ18867.1| hypothetical protein PRUPE_ppa1027165mg [Prunus persica] Length = 944 Score = 124 bits (310), Expect = 1e-25 Identities = 137/573 (23%), Positives = 220/573 (38%), Gaps = 127/573 (22%) Frame = +1 Query: 10 EVPPLGPREDDWLSSPGV------------SSAKSRDDKIYHRRKQKSVNELMGENNSV- 150 EVP GP ED WLSSPG SS K +D+ Y RRKQKS+ +LMG ++ + Sbjct: 379 EVPVQGPFED-WLSSPGGAKTGQTDQTFSRSSPKILEDRQYQRRKQKSIADLMGGDDDIQ 437 Query: 151 -------------------EPKNPKRAKVKDE---------------------------- 189 E K K ++ DE Sbjct: 438 AKTKDGGIMANEGAVSEKPEQKKRKGSESHDESNLSSDVVKRKLRLSKSPTSTLTKKILS 497 Query: 190 --------KDPSRKRKVVNAEKK----------GKPKEIKVDFTENNSGEA--------- 288 K+ K ++ KK GK KE D + GE Sbjct: 498 VENDCSGSKEEGNKGRLSRRRKKDESFGMDSDDGKMKEETGDSPLSRDGELRSGGLQSDM 557 Query: 289 KEEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXXXXXXXXXXXXXIDRSASPPICKLVD 468 K++ + RERKKSKYLSPP+TN + A+ + Sbjct: 558 KDQIDNRPLSRERKKSKYLSPPFTNLNMVKRMRDIEIESEVSNENQLGERATSNLIGSPH 617 Query: 469 KAS--HEEMPDVHLKATSRAVEENDKKMTFSVSDVDVQVDEMLSEVQFAAVDPLYLSIKG 642 + E++ H S D++ + + ++SE++ AA++P Y + Sbjct: 618 MLNCCTEKLKKKHTTELSPKAPAEDEEKSIDPLKANASASLVISELRSAALNPSYPIKRK 677 Query: 643 SFDMVYAFVSARRSSTYLHGADYKIFQ---------------------KPETVXXXXXXX 759 SF++ F++ R S Y +G++Y++++ + +T Sbjct: 678 SFEIFRDFMAIFRDSIYRNGSNYELYKNRQPHRKRKNLISEPGSLGKDQSQTAENLRDSE 737 Query: 760 XXXLKIQENDD--VAQEKPKSPDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKEE 933 KI+++ D + + +PD + +K + Sbjct: 738 SGHKKIKKSSDKPIGKHATGTPDLKT-----RRKKRDEKASPASLFVTFGPGSSLPTKAD 792 Query: 934 IVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYLLQHF 1113 +++++ KFG LNE ET + ++ R+ + + SDAE AF S +SPFG NVN+ L + Sbjct: 793 LIKIYSKFGELNEMETEMFYNNFCARVSFLRISDAEEAFNHSQNDSPFGASNVNFRLHNL 852 Query: 1114 SAGSQ--------SQPKIPS-------PQNRDSEKPTDEDFMSYVRGISRKLEITTAILE 1248 S S+ + P S P +S+ P D + S + I KLE T++L+ Sbjct: 853 STASKVRELSEISNSPPAKSRGKTRSQPVGTNSQPPVDGE-ASQLDFIRHKLEKLTSMLD 911 Query: 1249 NYHPKFSTEEKSSLKEEMKHLMENVEMVSEKSS 1347 N K S KS L+ E+K L+E V + E SS Sbjct: 912 NSDGKVSAVTKSKLESEIKELLETVSTMVESSS 944 >gb|ESW23014.1| hypothetical protein PHAVU_004G012100g [Phaseolus vulgaris] Length = 1282 Score = 119 bits (297), Expect = 4e-24 Identities = 130/492 (26%), Positives = 202/492 (41%), Gaps = 47/492 (9%) Frame = +1 Query: 10 EVPPLGPREDDWLSSPGVSSAKSRDDKIYHRRKQKSVNELMGENNSVEPKNPKRAKVKDE 189 E P GP D+ SPG+ S H RKQKS+ E+M E+ V + R Sbjct: 813 EAPTQGPF-DELGHSPGLPGNISN-----HVRKQKSIAEIMREDKDVHTAS--REVEATG 864 Query: 190 KDPSRKRKVVNAEKKGKP----KEIKVDFTEN----------NSG------------EAK 291 + RKRK A + KP KE+ +D E+ NSG E Sbjct: 865 SNGRRKRKGSEAGVRSKPVQKKKELLLDTDEDVSSAEHCAEENSGSIGSWLQSKEKKEVL 924 Query: 292 EEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXXXXXXXXXXXXXIDRSASPPICKLVDK 471 +E K RERKKSKYLSPP+T P ++ S Sbjct: 925 DEGRKGSLSRERKKSKYLSPPFTTPI-------RGQREESIEAESLEVSRKVKASHTSSV 977 Query: 472 ASHEEMPDVHL----KATSRAVEENDKKMTFSVSDVDVQVDEMLSEVQFAAVDPLYLSIK 639 A+ + P V++ +++ EE+D K + V+E+L +V AA+ P Sbjct: 978 AAVLQYPPVYMGRLFDSSNYQTEEDDGKKVIDPKKIQAPVEEVLFQVLNAAISPQIRREG 1037 Query: 640 GSFDMVYAFVSARRSSTYLHGADYKIFQK---------PETVXXXXXXXXXXLKIQENDD 792 S D F A RSS Y G+ +++K PE+ + Q+ + Sbjct: 1038 TSLDQFVDFTFAFRSSLYSEGSLRDLYEKNQPGRKRKRPESEEEDGMLKDAQISSQKQNS 1097 Query: 793 VAQEKPKSPDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKEEIVRLFGKFGSLNE 972 +++K K S S+ E++ ++ KFG+LNE Sbjct: 1098 GSKKKRKETAS-------GKKGGDENAQGAVLVVSFWQGTSTPSRSELISVYSKFGALNE 1150 Query: 973 NETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYLLQHFSAGSQSQPKIPSP 1152 ET+V ++++ R+ + + SDAE A S +PFG + + LQ+ S GS+S+ Sbjct: 1151 AETDVFNNNYTARVSFLRTSDAENALNHSKNNNPFGSADATFQLQYHSEGSKSEEHGERS 1210 Query: 1153 QNRDSEKPTDEDFMSYVRG--------ISRKLEITTAILENYHPKFSTEEKSSLKEEMKH 1308 +N+ T +S +G I +KL+ T ILE K S + + L+ E+K Sbjct: 1211 KNKPLVAATSPS-VSVSQGGEASRLIFIQKKLQGLTLILEASGDK-SPDLMAELQREVKA 1268 Query: 1309 LMENVEMVSEKS 1344 L+E+V + E S Sbjct: 1269 LLEDVNQMVEAS 1280 >ref|XP_004489260.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like isoform X1 [Cicer arietinum] Length = 967 Score = 116 bits (291), Expect = 2e-23 Identities = 130/540 (24%), Positives = 218/540 (40%), Gaps = 97/540 (17%) Frame = +1 Query: 19 PLGPREDDWLSSPGVSSAKSRDDKIYHRRKQKSVNELMGENNS--VEPKNPK-------- 168 PL P+ + SP +S ++S RRKQKS+ ++M E+ V KN + Sbjct: 449 PLSPKSGEPCHSPEISGSRSN-----RRRKQKSIADIMWEDKDKDVHTKNKEEDASDEVL 503 Query: 169 -------RAKVKDEKD-----PSRKRKV------------------------------VN 222 R K KD +D P RKRK +N Sbjct: 504 DAIASRGRKKRKDSEDVATSKPVRKRKEFVIDTDGNSAGSGKEGRGDKKNSDKVKSLHLN 563 Query: 223 AEKKGKPKEIKVDFT---ENNSGEAKEEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXX 393 +K+ E V+ + EN+ G++KEE EK F RERKKSKYLSPP+T + Sbjct: 564 KKKEAFGNESVVNGSKEEENDEGKSKEENEKGFLSRERKKSKYLSPPFTTSIREL----- 618 Query: 394 XXXXXXXXXXXIDRSASPPICKLVDKASHEEMPDVHLKATSRAVEENDKKMTFSVSDVDV 573 R +SP + K + + L +S ++D++ V V Sbjct: 619 VKGSKGTKARDAVRLSSP-----ISKCNSVAFLESKLSDSSNHQTQDDEEKAIDPEKVKV 673 Query: 574 QVDEMLSEVQFAAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIF---------QK 726 ++LS+++ A+ P SFD F+ RSS Y G+ YK + +K Sbjct: 674 SSAKILSKLRSVAISPQISREGASFDRFVDFILVMRSSLYREGSLYKAYKKVLPGRKRKK 733 Query: 727 PETVXXXXXXXXXXLKIQENDDVAQEKPKSP--------------DSEAPXXXXXXXXXX 864 PE+ ++D V+ ++ +P + A Sbjct: 734 PESKSELEMLGKDQ---NQSDHVSPDEDSAPIKRRKEKKTTSVQKSTRASETKTGEKGTD 790 Query: 865 XXXXXXXXXXXXXXXXXXXSKEEIVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAEL 1044 SK +++ ++ KFG+LNE ET++ +++ R+ + + DAE Sbjct: 791 EKSSAAVLFVSFWPGSTLPSKSDLITMYSKFGALNELETDMFRTNYTARVSFLRTHDAEK 850 Query: 1045 AFKSSVTESPFGVENVNYLLQHFSA---------GSQSQPKIPSPQNRDSEKPT------ 1179 A S ++PF V + LQ+ S+ +S+ K + SE PT Sbjct: 851 ALNHSQNKNPFESSEVTFQLQYASSDGSKSVGEHSERSKSKASQYNKQKSETPTTPSVSP 910 Query: 1180 ----DEDFMSYVRGISRKLEITTAILENYHPKFSTEEKSSLKEEMKHLMENVEMVSEKSS 1347 ++ +S+++G KL+ ++LE+ K S E K+ L+ +K L+E+V ++E +S Sbjct: 911 SQGSEKTKLSFIKG---KLQGLVSMLESSDEK-SPEFKTKLEINVKSLLEDVNKMAESTS 966 >ref|XP_002524654.1| conserved hypothetical protein [Ricinus communis] gi|223536015|gb|EEF37673.1| conserved hypothetical protein [Ricinus communis] Length = 1072 Score = 108 bits (271), Expect = 4e-21 Identities = 116/460 (25%), Positives = 191/460 (41%), Gaps = 54/460 (11%) Frame = +1 Query: 100 RRKQKSVNELMG----ENNSVEPKNPKRAKVKDEKDPSRKRKVVNAEKKGKPKEIKVDFT 267 R KQ+SV + E S K+ VKDE+ E PK KV Sbjct: 589 RMKQESVKTPLSRARKEKGSSHAKDAGSIGVKDEE---------MRENTVSPK--KVIGG 637 Query: 268 ENNSGEAKEEPEKVFTPRERKKSKYLSPPYTNPTW-----RMGXXXXXXXXXXXXXXXID 432 +++G+A+E+ +K RERK+SKYLSPPYTN + + Sbjct: 638 PSDNGKAEEQIQKGALLRERKRSKYLSPPYTNLNKVAKKNEVEAESVKVSSEAQLAEPLT 697 Query: 433 RSAS-----PPICK----LVDKASHEEMPDVHLKATSRAVE--ENDKKMTFSVSDVDVQV 579 ++AS PPI K K + +E VH + + + D+ + Sbjct: 698 KAASHVIGSPPILKPSGEKFQKRTPKEPGVVHETSDGSGPQTPKQDQNKIIDPMIIKAPA 757 Query: 580 DEMLSEVQFAAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIFQKPETVXXXXXXX 759 +E+LS+++ AA++PLYL S D+V FVSA R+S+Y + D + + Sbjct: 758 NEVLSKMRSAALNPLYLKETNSVDVVGEFVSAFRNSSYCNMTDSEYSELHSGRKRKSQKS 817 Query: 760 XXXLKIQENDDVAQEKPKSPDSE--------------------APXXXXXXXXXXXXXXX 879 ++E + + Q P + A Sbjct: 818 EPGSLVKEQNRIDQSSPDQKSHQTKTKKNKAKVDKPKVKQAASARDMKTKNKEPNGETPG 877 Query: 880 XXXXXXXXXXXXXXSKEEIVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSS 1059 +K ++++++ K+G+LNENET + ++ R+++ K S+AE AF S Sbjct: 878 AALYVTFGPGSSLPTKNDLIQIYRKYGALNENETEMFYANYCARVLFLKTSEAEEAFNDS 937 Query: 1060 VTESPFGVENVNYLLQHFSAGSQSQP--KIPSPQNRD------------SEKPTDEDFMS 1197 SPF NV + L++ SA ++++ IPS + S + +S Sbjct: 938 QLSSPFKAANVTFRLRYLSAETKTRELRDIPSKKRASLAKEGAKTPGAPSASQSSGGNLS 997 Query: 1198 YVRGISRKLEITTAILENYHPKFSTEEKSSLKEEMKHLME 1317 + I +KLE+ T++LE K S KS L+ E+K L+E Sbjct: 998 ELNFIKQKLEMITSLLETSIGKISPNTKSILEGEIKVLLE 1037 >ref|XP_006580692.1| PREDICTED: uncharacterized protein LOC102666447 [Glycine max] Length = 1053 Score = 105 bits (261), Expect = 6e-20 Identities = 126/506 (24%), Positives = 206/506 (40%), Gaps = 61/506 (12%) Frame = +1 Query: 10 EVPPLGPREDDWLSSPGVSSAKSRDDKIYHRRKQKSVNELMGENNSVEPKNPKRAKVKDE 189 E P GP D+ SPG+S + S RKQKS+ E+MGE+ V N + + Sbjct: 564 EAPTQGPF-DELGHSPGLSGSISNPV-----RKQKSIAEIMGEDKDVHTANRELDATVEM 617 Query: 190 -----KDPSRKRKVVNAEKKGKPKEIKVDFT-----------------ENNS-------- 279 + +KRK KP + K++ E NS Sbjct: 618 VNAIGSNVGKKRKGSEDGMASKPVQKKMELLLDADGDVSCAKNDGNGDEGNSDVGSLLQS 677 Query: 280 ---------GEAKEEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXXXXXXXXXXXXXID 432 G+++E EK RERK+SKYLSPP+T PT + Sbjct: 678 KEKKEAFDEGKSEERNEKGNLSRERKRSKYLSPPFTIPT-----RGQREVYLEPESLKVS 732 Query: 433 RSASPPICKLVDKASHEEMPDVH---LKATSRAVEENDKKMTFSVSDVDVQVDEMLSEVQ 603 R A + D A +P +S E+D K + + V E+LS+V Sbjct: 733 RKAKVSQRRAGD-AGLSSLPVYKGRFFDGSSYQTREDDGKNIVDPNKIQAPVAEVLSQVL 791 Query: 604 FAAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIFQK---------PETVXXXXXX 756 AA+ PL S D F A RSS Y G+ +++++K PE+ Sbjct: 792 NAAISPLIRREGTSLDQFVDFTYAFRSSLYCQGSLHEVYEKNQPGRKRKKPES---EEDE 848 Query: 757 XXXXLKIQENDDVAQEKPKSPDSEA-PXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKEE 933 L + ++ ++ K S + S+ + Sbjct: 849 MLKGLNLSADEHISSLKQNSGQKKRRKETASGKKGTDKNAAGAVLFVSFWPGSSMPSRSD 908 Query: 934 IVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFG-VENVNYLLQH 1110 +V ++ KFG+LNE ET++ +++ R+ + + SDAE A+ S +PFG +V + LQ+ Sbjct: 909 LVSVYSKFGALNEAETDMFCTNYTARVSFLRTSDAEKAYNHSQNNNPFGSPTDVTFQLQY 968 Query: 1111 FSAGSQSQPKIPSPQNRDSEKPTDEDFMSYVRG--------ISRKLEITTAILENYHPKF 1266 S GS+S + ++++ P +++ +G I +KL+ T +LE K Sbjct: 969 SSDGSKSGQQ--GERSKNKSLPAATAPVAFSQGTEASKLIFIQQKLQGMTLMLEASGGK- 1025 Query: 1267 STEEKSSLKEEMKHLMENVEMVSEKS 1344 S + + ++ EMK L+E+V + E S Sbjct: 1026 SPDMMAKVESEMKALLEDVNKMVEAS 1051 >emb|CBI34908.3| unnamed protein product [Vitis vinifera] Length = 533 Score = 100 bits (248), Expect = 2e-18 Identities = 90/367 (24%), Positives = 144/367 (39%), Gaps = 33/367 (8%) Frame = +1 Query: 106 KQKSVNELMGENNSVEPKNPKRAKVKDE---------KDPSRKRKVVNAEKKGKPKEIKV 258 KQKS+ E+M N VEPKN + K++ + R++K N + + + V Sbjct: 204 KQKSMAEIMRGNGDVEPKNEETDMGKEDINSVKLATASEKKRRKKGGNEAESHVDRALSV 263 Query: 259 DF------------------------TENNSGEAKEEPEKVFTPRERKKSKYLSPPYTNP 366 + EN+ G EE E+ RERKKSKYL PPYTN Sbjct: 264 ESDGSEGKRESENSPVSRERKKKGLSVENDGGRLPEESEQTSVSRERKKSKYLCPPYTN- 322 Query: 367 TWRMGXXXXXXXXXXXXXXXIDRSASPPICKLVDKASHEEMPDVHLKATSRAVEENDKKM 546 + ++ + S ++N K+ Sbjct: 323 ----------------------------VIRMHRNSGSMGDSKTEFLEVSNTPKQNRNKV 354 Query: 547 TFSVSDVDVQVDEMLSEVQFAAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIFQK 726 + ++ + + E+LS ++ AA++P YL S D + F+SA R+ + +K Sbjct: 355 -IDLKEIRISLQEVLSGIRSAALNPFYLRENKSVDKISGFLSAFRT---------RRSRK 404 Query: 727 PETVXXXXXXXXXXLKIQENDDVAQEKPKSPDSEAPXXXXXXXXXXXXXXXXXXXXXXXX 906 ET ++D + K K E+ Sbjct: 405 NETAEPDGPELKQAAA-GKSDTKTKHKDKDKKVESATLLLSFGPGISLP----------- 452 Query: 907 XXXXXSKEEIVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVE 1086 SK++++++F KFG+LNE+ET +L DS R+V+ + SDAE AF S SPFG E Sbjct: 453 -----SKDDLIKIFSKFGTLNESETEILYDSFCARVVFSRSSDAEEAFNGSQKASPFGAE 507 Query: 1087 NVNYLLQ 1107 + L+ Sbjct: 508 QMKSNLE 514 >ref|XP_006603798.1| PREDICTED: uncharacterized protein LOC102667448 isoform X1 [Glycine max] gi|571553268|ref|XP_006603799.1| PREDICTED: uncharacterized protein LOC102667448 isoform X2 [Glycine max] gi|571553271|ref|XP_006603800.1| PREDICTED: uncharacterized protein LOC102667448 isoform X3 [Glycine max] Length = 1097 Score = 93.6 bits (231), Expect = 2e-16 Identities = 124/513 (24%), Positives = 205/513 (39%), Gaps = 68/513 (13%) Frame = +1 Query: 10 EVPPLGPREDDWLSSPGVSSAKSRDDKIYHRRKQKSVNELMGENNSVEPKNPKRA----- 174 E P GP D+ SPG+S + S RKQKS+ E+MGE+ V + Sbjct: 601 EAPTQGPF-DELGHSPGLSGSTSNPV-----RKQKSIAEIMGEDKDVHTAANREVDATVE 654 Query: 175 -----------KVKDEKD------PSRKRK---------VVNAEKKGKPKE--------- 249 K K +D P +KR+ V++A+ GK E Sbjct: 655 MVNAIGLNVGKKRKGSEDNGMALKPVQKRRELLVDTDGDVLSAKNDGKGDEENSSIGSLL 714 Query: 250 --IKVDFTENNSGEAKEEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXXXXXXXXXXXX 423 I+ + G+++E K RERK+SKYLSPP+T P Sbjct: 715 QSIEKKTEAFDEGKSEERNGKGNLSRERKRSKYLSPPFTIPI-----RGQREVYIEPESL 769 Query: 424 XIDRSASPPICKLVDKASHEEMPDVHLKATSRAVE------ENDKKMTFSVSDVDVQVDE 585 + R A K+ +++ + P R+ + ++D + + V E Sbjct: 770 KVSRKA-----KVSQRSAGADGPSSLPVYKGRSFDSSNYQTQDDGETIIDPKKIQAPVKE 824 Query: 586 MLSEVQFAAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIFQKPET------VXXX 747 +LS+V AA PL S D F A RSS Y G+ ++++K + + Sbjct: 825 VLSQVLDAATSPLIRREGTSLDQFVDFTYAFRSSLYSQGSLCELYKKNQPGRKRKMLESE 884 Query: 748 XXXXXXXLKIQENDDVAQEKPKS-PDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXS 924 L + ++ ++ K S P S Sbjct: 885 EDGMLKELNLSADEHLSSLKQNSGPKKRRKETASGKKGNDENAAGAVLFVSFWPGSSMPS 944 Query: 925 KEEIVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFG-VENVNYL 1101 + ++V ++ KFG+LNE ET++ +++ R+ + + SDAE A+ S +PFG +V + Sbjct: 945 RSDLVSVYSKFGALNEAETDMFRTNYTARVSFLRTSDAEKAYNHSQNNNPFGSPTDVTFQ 1004 Query: 1102 LQHFSAGSQSQPKIPSPQNRDSEKPTDEDF---MSYVRG--------ISRKLEITTAILE 1248 LQ+ S GS+S + R + KP +++ +G I +KL+ T +LE Sbjct: 1005 LQYSSDGSKS--GVQQQGERSNNKPLPAAATAPVAFSQGTEASKLIFIQQKLQGMTLMLE 1062 Query: 1249 NYH-PKFSTEEKSSLKEEMKHLMENVEMVSEKS 1344 S + + L+ EMK L+E+V + E S Sbjct: 1063 EASGGGKSPDMMAKLESEMKALLEDVNKMVEAS 1095 >ref|XP_003618211.1| DNA mismatch repair protein Msh6 [Medicago truncatula] gi|355493226|gb|AES74429.1| DNA mismatch repair protein Msh6 [Medicago truncatula] Length = 882 Score = 83.2 bits (204), Expect = 2e-13 Identities = 85/297 (28%), Positives = 121/297 (40%), Gaps = 56/297 (18%) Frame = +1 Query: 10 EVPPLGPREDDWLSSPGV--SSAKSRDDKIYHRRKQKSVNELMGENNSVEPKNPKRAKVK 183 E P GP E+D+ + P S + SR ++ RRKQKS+ ++MGE+ V+ K+ + Sbjct: 460 EAPSQGPFEEDYSTLPLTPKSGSGSRPNR---RRKQKSIADIMGEDKDVDAKDKEWDASD 516 Query: 184 DE-----KDPSRKRKVVNAE--------------------------------------KK 234 DE K RK+K N + K Sbjct: 517 DEVLVAIKSRGRKKKKDNGDAGTSEPVQKTKELLGGTDTETVRGGKESCEDKENSDGGKS 576 Query: 235 GKPKEIKVDFT----------ENNSGEAKEEPEKVFTPRERKKSKYLSPPYTNPTWRMGX 384 + E K F EN+ G+ KE EK F RERKKSKYLSPP+T Sbjct: 577 QQSDEEKEAFGNDNNSDGSRGENDEGKPKEPNEKGFLSRERKKSKYLSPPFTT------- 629 Query: 385 XXXXXXXXXXXXXXIDRSASPPICKLVDKASHEEMPDVHLKATSRAVEENDKKMTFSVSD 564 + S P+ V K + E + + +D+K T Sbjct: 630 ---------SIRDFVKGRGSGPLSPRVSKYNSEAFQEFEFSVSLNHQTPDDEKETLDPEK 680 Query: 565 VDVQVDEMLSEVQFAAVDPLYLSIKG-SFDMVYAFVSARRSSTYLHGADYKIFQKPE 732 V V E+LS+++ AAV P +S KG S D + FVS RSS Y G+ +K + + E Sbjct: 681 VKVPSVEILSKIRDAAVSP-QISRKGTSSDRLVDFVSVMRSSLYREGSLHKEYNEAE 736 >ref|XP_004489261.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like isoform X2 [Cicer arietinum] Length = 864 Score = 80.5 bits (197), Expect = 2e-12 Identities = 120/509 (23%), Positives = 194/509 (38%), Gaps = 66/509 (12%) Frame = +1 Query: 19 PLGPREDDWLSSPGVSSAKSRDDKIYHRRKQKSVNELMGENNS--VEPKNPK-------- 168 PL P+ + SP +S ++S RRKQKS+ ++M E+ V KN + Sbjct: 434 PLSPKSGEPCHSPEISGSRSN-----RRRKQKSIADIMWEDKDKDVHTKNKEEDASDEVL 488 Query: 169 -------RAKVKDEKD-----PSRKRKV------------------------------VN 222 R K KD +D P RKRK +N Sbjct: 489 DAIASRGRKKRKDSEDVATSKPVRKRKEFVIDTDGNSAGSGKEGRGDKKNSDKVKSLHLN 548 Query: 223 AEKKGKPKEIKVDFT---ENNSGEAKEEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXX 393 +K+ E V+ + EN+ G++KEE EK F RERKKSKYLSPP+T + Sbjct: 549 KKKEAFGNESVVNGSKEEENDEGKSKEENEKGFLSRERKKSKYLSPPFTTSIREL----- 603 Query: 394 XXXXXXXXXXXIDRSASPPICKLVDKASHEEMPDVHLKATSRAVEENDKKMTFSVSDVDV 573 R +SP + K + + L +S ++D++ V V Sbjct: 604 VKGSKGTKARDAVRLSSP-----ISKCNSVAFLESKLSDSSNHQTQDDEEKAIDPEKVKV 658 Query: 574 QVDEMLSEVQFAAVDPLYLSIKGSFDMVYAFVSARRSSTYLHGADYKIFQKPETVXXXXX 753 ++LS+++ A+ P SFD F+ RSS Y G+ YK ++K Sbjct: 659 SSAKILSKLRSVAISPQISREGASFDRFVDFILVMRSSLYREGSLYKAYKKV-------- 710 Query: 754 XXXXXLKIQENDDVAQEKPKSPDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKEE 933 + K K P+ SK E Sbjct: 711 -------------LPGRKRKKPE---------------------------------SKSE 724 Query: 934 IVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSS-VTESPFGVENVNYLLQH 1110 + L G +V D S I +K+ KS+ +E+ G + + + Sbjct: 725 LEML----GKDQNQSDHVSPDEDSAPIKRRKEKKTTSVQKSTRASETKTGEKGTDE--KS 778 Query: 1111 FSAGSQSQPKIPSPQNRDSEKPT----------DEDFMSYVRGISRKLEITTAILENYHP 1260 +A +S+ K + SE PT ++ +S+++G KL+ ++LE+ Sbjct: 779 SAAVERSKSKASQYNKQKSETPTTPSVSPSQGSEKTKLSFIKG---KLQGLVSMLESSDE 835 Query: 1261 KFSTEEKSSLKEEMKHLMENVEMVSEKSS 1347 K S E K+ L+ +K L+E+V ++E +S Sbjct: 836 K-SPEFKTKLEINVKSLLEDVNKMAESTS 863 >gb|EOY02415.1| Tudor/PWWP/MBT superfamily protein, putative isoform 1 [Theobroma cacao] gi|508710519|gb|EOY02416.1| Tudor/PWWP/MBT superfamily protein, putative isoform 1 [Theobroma cacao] Length = 1013 Score = 79.7 bits (195), Expect = 3e-12 Identities = 47/163 (28%), Positives = 84/163 (51%), Gaps = 21/163 (12%) Frame = +1 Query: 922 SKEEIVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYL 1101 +K++++R++ ++G+LN +T++ ++ R+V+ + S+A+ AF SS SPFG NV++ Sbjct: 851 TKDDLIRIYSRYGALNVEDTDMFYNNFCARVVFIRSSEAKQAFNSSQYASPFGASNVSFR 910 Query: 1102 LQHFSAGS---------------------QSQPKIPSPQNRDSEKPTDEDFMSYVRGISR 1218 L+ A S S+ + S ++ D D S + I Sbjct: 911 LRIHPAASAHDHREKPSAKPSPLAKERAKSSKKSLASQKSADQASQNSADQASQLNFIRH 970 Query: 1219 KLEITTAILENYHPKFSTEEKSSLKEEMKHLMENVEMVSEKSS 1347 KLE+ T++LE K S+E KS + E+K L+E V + + SS Sbjct: 971 KLEMLTSMLEKSDEKMSSEIKSKVHSEIKGLLEKVNTMVKSSS 1013 >ref|XP_006446521.1| hypothetical protein CICLE_v10014124mg [Citrus clementina] gi|557549132|gb|ESR59761.1| hypothetical protein CICLE_v10014124mg [Citrus clementina] Length = 1025 Score = 79.3 bits (194), Expect = 4e-12 Identities = 74/259 (28%), Positives = 117/259 (45%), Gaps = 36/259 (13%) Frame = +1 Query: 64 SSAKSRDDKIYHRRKQKSVNELMGENN-------------SVEPKNPKR----AKVKDEK 192 S AK + K++ R++K N++ N SVE +R AK + EK Sbjct: 482 SKAKRKTRKVFSSREEKKKNKVSHTKNDDGNKEETNASPVSVEKTTVQRDDGEAKEQVEK 541 Query: 193 D-PSRKRKVVNAEK-KGKPKEIKVDFTENNSGEAKEEPEKVFTPRERKKSKYLSPPYTNP 366 SR+RK N E+ P ++ + + GEAKE+ EK F RERK+SKYLSPPYT+ Sbjct: 542 SFLSRERKRSNREETNASPMSVERKTVQRDDGEAKEQVEKSFLSRERKRSKYLSPPYTSI 601 Query: 367 TWRMGXXXXXXXXXXXXXXXI-------------DRSASPPIC--KLVDKASHEEMPDVH 501 R + +S + +C ++V K + + H Sbjct: 602 NKRQTKKDIEEFLKVSYEAQVAEQMTKAAGNLIGSKSPANLMCSDEVVRKKDAKNVGAEH 661 Query: 502 LKATSRAVE--ENDKKMTFSVSDVDVQVDEMLSEVQFAAVDPLYLSIKGSFDMVYAFVSA 675 K+ S E + D++ V +++S ++ AV+ L + S D+V FVS Sbjct: 662 EKSDSSNPEKMKPDQRTVIDTMKVKASAKDVISGIRSTAVNLDSLK-EDSLDVVEGFVSV 720 Query: 676 RRSSTYLHGADYKIFQKPE 732 RSS Y +G++YKI+ K + Sbjct: 721 FRSSVYSNGSNYKIYNKSQ 739 Score = 78.6 bits (192), Expect = 6e-12 Identities = 49/155 (31%), Positives = 83/155 (53%), Gaps = 13/155 (8%) Frame = +1 Query: 922 SKEEIVRLFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFGVENVNYL 1101 SK+++++ + KFGSLN+ ET + ++H R+V+ + DAE A KSS SPF N + Sbjct: 870 SKKDLIKFYSKFGSLNKEETEMFYNNHCARVVFLRSYDAEEALKSSQLASPFEASNCKFE 929 Query: 1102 LQHFSAGSQSQPKIPSPQNRDS----------EKPTDEDFM---SYVRGISRKLEITTAI 1242 L++ S+ S+ Q + R S ++P + + S + +KLE+ +++ Sbjct: 930 LRNSSSTSKVQKRKEISNARSSPAKEGGKALKKEPGSKSSIAEASSFNYVKQKLEMVSSV 989 Query: 1243 LENYHPKFSTEEKSSLKEEMKHLMENVEMVSEKSS 1347 L + K + E KS L+ E+K L+E V V SS Sbjct: 990 LADSDGKMTPELKSKLEHEVKDLLEKVNTVVGSSS 1024 >ref|XP_006395397.1| hypothetical protein EUTSA_v10005709mg [Eutrema salsugineum] gi|557092036|gb|ESQ32683.1| hypothetical protein EUTSA_v10005709mg [Eutrema salsugineum] Length = 728 Score = 78.6 bits (192), Expect = 6e-12 Identities = 94/453 (20%), Positives = 174/453 (38%), Gaps = 11/453 (2%) Frame = +1 Query: 73 KSRDDKIYHRRKQKSVNELMGEN-NSVEPKNPKRAKVKDEKDPSRKRKVVNAEKKGKPKE 249 ++RD K + RK+K+ L E+ + VE + E+ ++ + K Sbjct: 346 ETRDAKSFSSRKRKNKRGLEDEDEDGVEKREESNDSNHLEECEKKEDSGMETPMASLCKR 405 Query: 250 IKVDFTE-----NNSGEAKEEPEKVFTPRERKKSKYLSPPYTNPTWRMGXXXXXXXXXXX 414 ++ D + N SGE + K RERKKSKYLSP Y Sbjct: 406 LRSDVSSSVERNNGSGETTVQTGK----RERKKSKYLSPEY------------------- 442 Query: 415 XXXXIDRSASPPICKLVDKASHEEMPDVHLKATSRAVEENDKKMTFSVSDVDVQVDEMLS 594 M D + + +N + ++ +E+L+ Sbjct: 443 ------------------------MTDFGWGSRKSKMTKNAAEKALDFVSLEAAPEEVLN 478 Query: 595 EVQFAAVDPLYLS-IKGSFDMVYAFVSARRSSTYLHGADYKIFQKPETVXXXXXXXXXXL 771 ++ A+D Y+ S DM+ FVS RS TY GA++K Sbjct: 479 LIRSVALDTQYVEDYNSSCDMITEFVSIYRSFTYHDGANHK------------------R 520 Query: 772 KIQEND---DVAQEKPKSPDSEAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKEEIVR 942 I E D ++ Q+ ++ SK++++R Sbjct: 521 NISEEDKQPELKQQVVDEKETVKKMNKRELEEQFSSVELVIKSGFGPSGATLPSKDDLIR 580 Query: 943 LFGKFGSLNENETNVLTDSHSVRIVYKKDSDAELAFKSSVTESPFG-VENVNYLLQHFSA 1119 + KFG+L++N + + ++ R+ + SD E AF+ S+ + PF V + LQ Sbjct: 581 TYEKFGALDKNRSCMFDNNSCARVYFLNVSDGEEAFRKSLKKCPFATTSTVTFKLQF--- 637 Query: 1120 GSQSQPKIPSPQNRDSEKPTDEDFMSYVRGISRKLEITTAILENYHPKFSTEEKSSLKEE 1299 P SP+N + +K + + + + +KLE A+L+ + + E K L++E Sbjct: 638 -----PSSDSPENTNEKKEAE---IMEIECLKKKLEEMRALLDQSQGEVTQELKMKLEDE 689 Query: 1300 MKHLMENVEMVSEKSSLKEEMKHLMENVEMVSE 1398 ++ ++ V ++ +++ + + V VSE Sbjct: 690 SRNFLDKVRKMNVCLLVEDFLAFRRQKVLKVSE 722