BLASTX nr result
ID: Mentha28_contig00021793
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00021793 (2272 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU29658.1| hypothetical protein MIMGU_mgv1a006859mg [Mimulus... 248 1e-62 ref|XP_006349838.1| PREDICTED: uncharacterized protein LOC102584... 110 4e-21 ref|XP_002272609.1| PREDICTED: uncharacterized protein LOC100258... 110 4e-21 ref|XP_007035794.1| Uncharacterized protein isoform 1 [Theobroma... 101 2e-18 emb|CAN76673.1| hypothetical protein VITISV_011790 [Vitis vinifera] 100 2e-18 ref|XP_004253153.1| PREDICTED: uncharacterized protein LOC101259... 97 3e-17 ref|XP_006597704.1| PREDICTED: uncharacterized protein LOC100816... 91 2e-15 ref|XP_006597703.1| PREDICTED: uncharacterized protein LOC100816... 91 2e-15 ref|XP_006597702.1| PREDICTED: uncharacterized protein LOC100816... 91 2e-15 ref|XP_006841433.1| hypothetical protein AMTR_s00003p00049560 [A... 74 2e-10 ref|XP_004295083.1| PREDICTED: uncharacterized protein LOC101308... 74 4e-10 ref|XP_006586892.1| PREDICTED: uncharacterized protein LOC100816... 72 1e-09 ref|XP_006586891.1| PREDICTED: uncharacterized protein LOC100816... 72 1e-09 ref|XP_003609773.1| Pre-mRNA polyadenylation factor fip1 [Medica... 70 3e-09 ref|XP_006857169.1| hypothetical protein AMTR_s00065p00171490 [A... 70 4e-09 gb|EXB71059.1| hypothetical protein L484_004194 [Morus notabilis] 68 2e-08 ref|XP_007147362.1| hypothetical protein PHAVU_006G117800g [Phas... 67 3e-08 ref|XP_006651314.1| PREDICTED: uncharacterized protein LOC102703... 67 4e-08 gb|EXB82160.1| hypothetical protein L484_005444 [Morus notabilis] 66 8e-08 ref|XP_006378540.1| hypothetical protein POPTR_0010s15520g [Popu... 65 1e-07 >gb|EYU29658.1| hypothetical protein MIMGU_mgv1a006859mg [Mimulus guttatus] Length = 428 Score = 248 bits (632), Expect = 1e-62 Identities = 186/536 (34%), Positives = 258/536 (48%), Gaps = 9/536 (1%) Frame = -2 Query: 1761 MDSEALGYNR-YHNQRRHPFHGDIEGVRNFSPKY-SSAVDQSGSQFMHR-EGVHLRRTRQ 1591 MD EA N YH QRRH H ++E R F P Y SSAV+ S +QF ++ + VH RRT++ Sbjct: 1 MDYEATEDNNWYHKQRRHVVHSNMEVSRKFLPNYHSSAVNMSDTQFRNKGDEVHFRRTKR 60 Query: 1590 DFLSPLHDHDDRFVGGKYGRTRPSSGGARDNDHDRRIQPDRVNCRYNQLVAHDRREVDSS 1411 +LSPL D+++ + RR D + RY Q + RRE + S Sbjct: 61 HYLSPLRDYNN-----------------DAKEKSRRFIRDYPDHRYGQNIDRQRRETERS 103 Query: 1410 GRGKRRHRSP-VSREDLCYIDTEVNERKNIKHQPFPFKSSEEPYASDRGVFLGAPGPKFG 1234 G RR +P +S ++L Y + E N R+ +K + PF S Sbjct: 104 VSGNRRRDNPHISSDNLWYKEGEDNGRRCVKQRHLPFYSHLA------------------ 145 Query: 1233 VARRNMRCSWKEMCIESDQYGTDVTTFFTRESLRYHPPEDFHVRRRDFPPSSNTNITRES 1054 + + + Y T+ T ES++ Sbjct: 146 ----------ENQHVRNKHYLQSTDTYITNESIK-------------------------- 169 Query: 1053 MKGNQYQNHFVRRRHNQHSEVLLPREDEYKSWKQDNIVFGSEEPSHNVKRMSKNDEADDR 874 +HFVRRRH +E L RE+ YKSW+QDN +F SE PS++ + SKND DR Sbjct: 170 -------DHFVRRRH--QTEALHSREEVYKSWQQDNTIFHSERPSYHYPKKSKNDRLGDR 220 Query: 873 PAFGHVTKVNKRERGRKNSEISREEDISDHFDGCHETPKLNSHEQTHSSHKESVD-WLVV 697 AFG V + FDGC + + + Q H ++ SVD LVV Sbjct: 221 HAFGRVAE----------------------FDGCLKFIEADKCVQMHRKYQYSVDSRLVV 258 Query: 696 VGRKCTLQSSIRRTTEAGEDTYYGRSDLVGLTVNKEPNNLVDLDGSEPEETV---TNEVK 526 V +K T S RR +E G+D ++DL N+ P NL DL + E+ T+E K Sbjct: 259 VDKKRTTPQSSRRASEDGDDFNCHKNDLTESNANQNPGNLEDLGDFKLEKAASISTDERK 318 Query: 525 -PDSSLSIKNQPSKYSENKLNLSLEIEEGQINNEETKNKDASQMNATSNNNGVVEKLDDE 349 ++LS KN K+SEN N L++EEGQI EE+ +A++ VVE L DE Sbjct: 319 VKTTNLSDKNWQDKFSENPKNECLDVEEGQIIGEESNGHTVK--SASNGTAAVVESLGDE 376 Query: 348 KIKEIMVKMERRRERFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRW 181 KI+EIM KMERRRERFKE IT+S+D K+S+L ++ E +L+RPARKRRW Sbjct: 377 KIQEIMAKMERRRERFKEQITLSRDSAKSSNLASET-----AFEGKLERPARKRRW 427 >ref|XP_006349838.1| PREDICTED: uncharacterized protein LOC102584286 [Solanum tuberosum] Length = 1130 Score = 110 bits (274), Expect = 4e-21 Identities = 176/702 (25%), Positives = 283/702 (40%), Gaps = 49/702 (6%) Frame = -2 Query: 2139 TGDGRYHERWIDSTQEHSNH----PKRSNYNRP----DEDSSYATNAKHLYNRHVNHGKH 1984 +GD +Y R S Q H P R + P DEDS + ++A+ LY R ++ Sbjct: 488 SGDPKYFTRGRRSVQRELLHDRRRPGRMSGTIPAHLKDEDS-HKSDARILYER-----RN 541 Query: 1983 RDMVNLKYNDSCVPYYSHSERIMAY---------SDGRLHDHHFGPAFWKDQYWDIP-NY 1834 ++ + D + SH ++ + GR D+ +F K+ + Sbjct: 542 STVIRYRQRDRRYAFDSHEREDTSHFKRAEPVYSNAGRFSDYPCRDSFTKNPEMEHQLRC 601 Query: 1833 RYQPGHPDGHNVSERQNLSDKKGSMDSEALGYNRYHNQRRH--------PFHGDIEGVRN 1678 +Y G +V + + + D E L +R H RR PFH E + Sbjct: 602 KYDKNWSGGRSVKRKLDPLELSIYTDDELLERDRPHYGRRLTVQDMDTVPFH---ESEQW 658 Query: 1677 FSPKYSSAVDQSGSQFMHR--EGVHLRRTRQDFLSPLHDHDDRFVGGKYGRTRPSSGGAR 1504 F S + D++ SQ M + + +R R D L ++ + R RP Sbjct: 659 FDKYISYSDDENPSQRMRKIDQLPSKKRVRTDDLVTECNYIYDIMEETDNRYRPY----- 713 Query: 1503 DNDHDRRIQPDRVNCRYNQLVAHDRREVDSSGRGKRRHRSPV-SREDLCYIDTEVNERKN 1327 N D I D Y+ + + RRE+ S RGKRR SP S D+C++D + E + Sbjct: 714 -NHRDTNILEDG----YHVNLTYFRREIKSPSRGKRRDVSPCKSSNDICFMDLKDEEGRF 768 Query: 1326 IKHQPFPFKSSEEPYASD-RGVFLGAPGPKFGVARRNMRCSWKEMCIESDQYGTDVTTFF 1150 ++P F+ E S R P + G+ +C G ++T Sbjct: 769 DGYRPPSFRLYRESCTSSRRWQSPELPRGRHGIFSGTRKCDG----------GANLTNSI 818 Query: 1149 TRESLRYHPPEDFHVRRRDFPPSSNTNITRESMKGNQYQNHFVRRRHNQHSEVLLPREDE 970 + +P GN Q++F RRR Q SE + EDE Sbjct: 819 GSDQTSKYP-------------------------GN--QDNFKRRRGGQQSEGMQWVEDE 851 Query: 969 YKSWKQDNIVFGSEEPSHNVKRMSKN---DEADDRPAFGHVTK-VNKRERGRKNSEISRE 802 S Q NI F +E S++ +R S + + D+ V K ++ R ++ ++ RE Sbjct: 852 NSSRYQQNI-FDAERTSYSFRRSSSDRRFNSFDNNHGPNPVEKLLDDRHVEQEKYKLIRE 910 Query: 801 EDISDHFDGCHETPKLNSHEQTHSSHKESVDWLVVVGRKCTLQSSIRRTTEAGEDTYYGR 622 + + F + ++H + ++SVD ++V S R ++AG T + R Sbjct: 911 GNNASQFGQGSKVFHKDNHWRRFPRGRDSVDTGLIVEN----GESSGRCSKAGGVTSFDR 966 Query: 621 SDLVGLTVNKEPNNLVDLDG-SEP---EETVTNEVKPDSSLSIKNQPSKYSENKLNLSLE 454 + E L +DG S+P + T V D + K + +S+ SL+ Sbjct: 967 YSHLDSDSYVE---LKPIDGTSKPHFRKTLRTRNVTTDPKENDKGRLDIFSDANQEESLD 1023 Query: 453 IEEGQINNEETK-----------NKDASQMNATSNNNGVVEKLDDEKIKEIMVKMERRRE 307 IEEGQI E + S+M + + V + ++ +I EIM KME+R E Sbjct: 1024 IEEGQIIEEMNEKIIKKRITCSGKSQISEMKNFAYDKNVEGQDNNPRILEIMAKMEKRGE 1083 Query: 306 RFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRW 181 RFK+PI + D + S L+DS + E RPARKRRW Sbjct: 1084 RFKQPIALKSDTKNVSKPLVDSFALS--TEPMQPRPARKRRW 1123 >ref|XP_002272609.1| PREDICTED: uncharacterized protein LOC100258583 [Vitis vinifera] gi|296083247|emb|CBI22883.3| unnamed protein product [Vitis vinifera] Length = 1300 Score = 110 bits (274), Expect = 4e-21 Identities = 118/452 (26%), Positives = 198/452 (43%), Gaps = 23/452 (5%) Frame = -2 Query: 1458 RYNQLVAHDRREVDSSGRGKRRHRSPVSREDLCYIDTEVNERKNIKHQPFPFKSSEEPYA 1279 +Y + V R+V+ GR KR + + I E +++ HQ S EP+ Sbjct: 890 KYGRHVPSTGRKVNLYGRRKRYEDGHLDLDSSWSIGVEDEYGRHVDHQSLSSWSYREPHT 949 Query: 1278 SD--RGVFLGAPGPKFGVARRNMRCSWKEMCIESDQYGTDVTTFFTRESLRYHPPEDFHV 1105 ++ V + G RR + + ESD +G D + T++S+ P+D Sbjct: 950 ANGRNDVNDSRLTERHGRDRRQI---CPQGYRESDWFGNDNDAYNTKDSII--GPDD--- 1001 Query: 1104 RRRDFPPSSNTNITRESMKGNQYQNHFVRRRHNQHSEVLLPREDEYKSWKQDNIVFGSEE 925 Q RRR + E L E E S D ++ +EE Sbjct: 1002 -----------------------QVQIGRRRSRRQYEALHWTEKELISSHLDENLY-NEE 1037 Query: 924 PSHNVKRMSKNDEADDRPAFGHVTKV--NKRERGRKNSEISREEDISDHFDGCHETPKLN 751 S + +R S + + HV + NK+ + ++ I RE D D Sbjct: 1038 ASLSYERTSGHTRIHTKYGSAHVGMLVHNKKSQQQRYKRI-REGRSDDFIDRSSNVLGQG 1096 Query: 750 SHEQTHSSHKESVDWLVVVGRKCTLQSSIRRTTEAGEDTYYGRSDLVGLTVNKEPNNLVD 571 +HEQ + SVD +V G+ S R +EA ++ R + + ++++ L D Sbjct: 1097 NHEQAVLRSRASVDLIVGEGK------SSGRRSEARSAVHHDRFENMDWKIDEDQGILKD 1150 Query: 570 LDGSEPEETVTNEVKPDSSLSIKNQPSKYSENKLNLSLEIEEGQINNEE------TKNKD 409 ++G + + + ++K +S+ + + K+ + + +L+IEEGQI EE + KD Sbjct: 1151 VNGPQRGKIIQPDLKSESNWNNEKCLDKFLVTEHDEALDIEEGQIIPEEMNEDDSVETKD 1210 Query: 408 ASQMNATSNN-------------NGVVEKLDDEKIKEIMVKMERRRERFKEPITMSKDGE 268 AS+ S N N VV + D+++I + + KME+R+ERFK+PIT+ K+ + Sbjct: 1211 ASESITPSRNVKRRLGNANAANGNKVVAECDNQRILQTLAKMEKRQERFKKPITLKKEPD 1270 Query: 267 KTSSLLLDSNVETEVAEARLQRPARKRRWLGT 172 K +D V E+AE QRP RKRRW G+ Sbjct: 1271 KIPKPQVDPIV--EMAETMQQRPLRKRRWNGS 1300 >ref|XP_007035794.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508714823|gb|EOY06720.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1247 Score = 101 bits (251), Expect = 2e-18 Identities = 138/530 (26%), Positives = 225/530 (42%), Gaps = 34/530 (6%) Frame = -2 Query: 1668 KYSSAVDQSGSQFMHR-EGVHLRRTRQDFLSPL-HDHDDRFVGGKYGRTRPSSGGARDND 1495 +YSSA + Q+ +G+ LR+ PL + H++ + KYGR+ P + RD Sbjct: 775 RYSSASKERDIQWRRGYDGLQLRKKTDHDDCPLDYKHENERLKEKYGRSIPFTRCERD-- 832 Query: 1494 HDRRIQPDRVNCRYNQLVAHDRREVDSSGRGKRRHRSPVSREDLCYIDTEVNERKNIKHQ 1315 ++P Y + + RRE SGR K R+ P Y + + Sbjct: 833 ---MVEP------YERWLPPIRREFKVSGR-KGRYVDPA------YFPLD---------R 867 Query: 1314 PFPFKSSEE-PYASDRGVFLGAPG-PKFGVARRNMRCSWKEMCIESDQYGTDVTTFFTRE 1141 P+P +S E + R + L P RR W+ + ++ F ++ Sbjct: 868 PWPMESEEYLRHTYCRSLALETDREPSVPNGRR-----WRNTLLSRNE------AFDSKF 916 Query: 1140 SLRYHPPEDFHVRRRDFPPSS-------NTNITRESMKGNQYQNHFVRRRHNQHSEVLLP 982 RYH + D + N GNQ Q+ RR H+Q V+ Sbjct: 917 IKRYHRHQRIVCHEEDGDNGRCGCYDYVDDNEDGILQNGNQVQSW--RRGHSQRGRVV-- 972 Query: 981 REDEYKSWKQDNIVFG----SEEPSHNVKRMSKNDEADDRP-AFGHVTKVNKRERGRKNS 817 W +D ++ ++ S + ++ SK+D R + +N Sbjct: 973 ------HWTKDKLLGNDRLLAQWVSFSCQKTSKHDLIHARHGSLRDEMLINDLMLEHHGY 1026 Query: 816 EISREEDISDHFDGCHETPKLNSHEQTHSSHKESVDWLVVVGRKCTLQSSIRRTTEAGED 637 E+ E ++ CHE + +Q ++SVD +V G+ SS+R + G Sbjct: 1027 EMITEGSNAN----CHEGNSIIRQKQKVLKDRDSVDLIVGEGK-----SSVRHL-DGGSL 1076 Query: 636 TYYGRSDLVGLTVNKEPNNLVDLDGSEPEETVTNEVK-PDSSLSIKNQPSKYSENKLNLS 460 GR + +GL E +L D++ S V ++ D S +I+ Q K+S + N Sbjct: 1077 ICNGRLEKIGLEFPMEQKSLRDVNDSCGGNRVKTDISNTDGSRTIEKQLDKFSVAECNQD 1136 Query: 459 LEIEEGQ-INNEETKNKDASQMNAT----------------SNNNGVVEKLDDEKIKEIM 331 L+IEEGQ I E++ N + ++ T S+ N V + D+++I E + Sbjct: 1137 LDIEEGQTICEEQSINLEKENVSETMVQRSKVKMRTLHVDSSDGNRAVGEYDNKRIVETL 1196 Query: 330 VKMERRRERFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRW 181 KME+RRERFK+PIT+ + +KTS +D V+T E + QRPARKRRW Sbjct: 1197 AKMEKRRERFKDPITIKMEPDKTSEPQVDLVVDTN--EIKHQRPARKRRW 1244 >emb|CAN76673.1| hypothetical protein VITISV_011790 [Vitis vinifera] Length = 1338 Score = 100 bits (250), Expect = 2e-18 Identities = 120/485 (24%), Positives = 202/485 (41%), Gaps = 56/485 (11%) Frame = -2 Query: 1458 RYNQLVAHDRREVDSSGRGKRRHRSPVSREDLCYIDTEVNERKNIKHQPFPFKSSEEPYA 1279 +Y + V R+V+ GR KR + + I E +++ HQ S EP+ Sbjct: 890 KYGRHVPSTGRKVNLYGRRKRYEDGHLDLDSSWSIGVEDEYGRHVDHQSLSSWSYREPHT 949 Query: 1278 SD--RGVFLGAPGPKFGVARRNMRCSWKEMCIESDQYGTDVTTFFTRESLRYHPPEDFHV 1105 ++ V + G RR + + ESD +G D + T++S+ P+D Sbjct: 950 ANGRNDVNDSRLTERHGRDRRQI---CPQGYRESDWFGNDNDAYNTKDSII--GPDD--- 1001 Query: 1104 RRRDFPPSSNTNITRESMKGNQYQNHFVRRRHNQHSEVLLPREDEYKSWKQDNIVFGSEE 925 Q RRR + E L E E S D ++ +EE Sbjct: 1002 -----------------------QVQIGRRRSRRQYEALHWTEKELISSHLDENLY-NEE 1037 Query: 924 PSHNVKRMSKNDEADDRPAFGHVTKV--NKRERGRKNSEISREEDISDHFDGCHETPKLN 751 S + +R S + + HV + NK+ + ++ I RE D D Sbjct: 1038 ASLSYERTSGHTRIHTKYGSAHVGMLVHNKKSQQQRYKRI-REGRSDDFIDRSSNVLGQG 1096 Query: 750 SHEQTHSSHKESVDWLVVVGRKC---------------------------------TLQS 670 +HEQ + SVD +V G KC + ++ Sbjct: 1097 NHEQXVLRSRASVDLIVGEG-KCVASAFMAGSKAEYSQNVSHKIESFALAPTKDLLSFEN 1155 Query: 669 SIRRTTEAGEDTYYGRSDLVGLTVNKEPNNLVDLDGSEPEETVTNEVKPDSSLSIKNQPS 490 S R +EA ++ R + + ++++ L D++G + + + ++K +S+ + + Sbjct: 1156 SSGRRSEARSAVHHDRFENMDWKIDEDQGILKDVNGPQRGKIIQPDLKSESNWNNEKCLD 1215 Query: 489 KYSENKLNLSLEIEEGQINNEE------TKNKDASQMNATSNN-------------NGVV 367 K+ + + +L+IEEGQI EE + KDAS+ S N N VV Sbjct: 1216 KFLVTEHDEALDIEEGQIIPEEMNXDDSVETKDASESITPSRNVKRRLGNANAANGNKVV 1275 Query: 366 EKLDDEKIKEIMVKMERRRERFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKR 187 + D+++I + + KME+R+ERFK+PIT+ K+ +K +D V E+AE QRP RKR Sbjct: 1276 AECDNQRILQTLAKMEKRQERFKKPITLKKEPDKIPKPQVDPIV--EMAETMQQRPLRKR 1333 Query: 186 RWLGT 172 RW G+ Sbjct: 1334 RWNGS 1338 >ref|XP_004253153.1| PREDICTED: uncharacterized protein LOC101259137 [Solanum lycopersicum] Length = 1130 Score = 97.1 bits (240), Expect = 3e-17 Identities = 182/743 (24%), Positives = 292/743 (39%), Gaps = 51/743 (6%) Frame = -2 Query: 2256 EKSHIRSRYSSPSLRRESQEPIAQKYCPPKDSERCGVRGTGDGRYHERWIDSTQEHSNH- 2080 EKSH + E +E Y P ++ + +GD +Y + S Q H Sbjct: 449 EKSHDHHTRLISNAESELREKGTTDYQPISRTDHNRTK-SGDFKYFTQGRRSVQRDLLHD 507 Query: 2079 ---PKRSNYNRP----DEDSSYATNAKHLYNRHVNHGKHRDMVNLKYNDSCVPYYSHSER 1921 P R P DEDS + ++A+ LY R ++ ++ + D + SH Sbjct: 508 RRRPGRMGETIPAHLKDEDS-HKSDARILYER-----RNSSVIRHRQRDRRYAFDSHERE 561 Query: 1920 IMAY---------SDGRLHDHHFGPAFWKDQYWDIP-NYRYQPGHPDGHNVSERQNLSDK 1771 ++ + GR D+ +F K+ + RY G +V + + + Sbjct: 562 DTSHFKRAEPFYSNAGRFSDYPCRGSFTKNPQMEYQLRCRYDKNWSGGRSVKRKLDHLEL 621 Query: 1770 KGSMDSEALGYNRYHNQRRHPFHGDIEGV-----RNFSPKYSSAVDQSGSQFMHREGVHL 1606 D + L +R H R D+E + + KY S D R+ L Sbjct: 622 STYTDDKLLERDRPHYGGRLTVQ-DMENISFHESEQWIDKYISYSDDENPSQRIRKIDQL 680 Query: 1605 ---RRTRQDFLSPLHDHDDRFVGGKYGRTRPSSGGARDNDHDRRIQPDRVNCRYNQLVAH 1435 +R R D L ++ + R RP N D I D Y+ + + Sbjct: 681 PKKKRVRTDDLVTECNYIYDIMEETDNRYRPY------NHRDTDILEDG----YDVNLTY 730 Query: 1434 DRREVDSSGRGKRRHRSPV-SREDLCYIDTEVNERKNIKHQPFPFKSSEEPYASDRGVFL 1258 RRE+ S RG+RR SP S D+C++D L Sbjct: 731 FRREIKSPSRGQRRDISPCKSSNDICFMD------------------------------L 760 Query: 1257 GAPGPKFGVARRNMRCSWKEMCIESDQYGTDVTTFFTRESLRYHPPEDFHVRRRD---FP 1087 G +F R + C ++E C S ++ + E R R+ D F Sbjct: 761 KDMGGRFDGYRPSSFCLYRESCTSSRRWQS-------LELPRGRNRIFSGTRKCDGGQFA 813 Query: 1086 PSSNTNITRESMKGNQYQNHFVRRRHNQHSEVLLPREDEYKSWKQDNIVFGSEEPSHNVK 907 +N+ +++K Q+ F RRR + SE + EDE S Q+N VF +E S++ + Sbjct: 814 SLTNSIGANQTIKYPANQDIFKRRRGGRQSEGMQWVEDENNSGYQEN-VFDAERTSYSFR 872 Query: 906 RMSKNDEA---DDRPAFGHVTKV-NKRERGRKNSEISREEDISDHFDGCHETPKLNSHEQ 739 R S + D+ V K+ + R ++ ++ RE + ++ F + ++H + Sbjct: 873 RTSSDKRFKSFDNNHGPNPVEKLLDDRHVEQEKYKLIREGNNANQFGQGSKVFHKDNHWR 932 Query: 738 THSSHKESVDWLVVVGRKCTLQSSIRRTTEAGEDTYYGRSDLVGLTVNKEPNNLVDLDGS 559 ++SVD ++V S R ++AG T + R G + L +DG+ Sbjct: 933 RFPRGRDSVDTDLIVENG----ESSGRCSKAGGVTSFDR---YGHLDSDCYLKLKPVDGT 985 Query: 558 EP----EETVTNEVKPDSSLSIKNQPSKYSENKLNLSLEIEEGQINNEET---------- 421 E T V D + K + + +S+ SL+IEEGQI E Sbjct: 986 SKLHFRETLRTRNVTTDPKENDKERLAIFSDANQEESLDIEEGQIIEEMNEKIVKKRITY 1045 Query: 420 --KNKDASQMNATSNNNGVVEKLDDEKIKEIMVKMERRRERFKEPITMSKDGEKTSSLLL 247 K++ N + N VE KI EI+ KME+R ERFK+PI + D + S+ L+ Sbjct: 1046 SGKSEIGEMKNFATGKN--VEGQGSPKILEIIAKMEKRGERFKQPIALKSDTKNISTPLV 1103 Query: 246 DS-NVETEVAEARLQRPARKRRW 181 DS V TE + RPARKRRW Sbjct: 1104 DSFAVSTEPMQ---PRPARKRRW 1123 >ref|XP_006597704.1| PREDICTED: uncharacterized protein LOC100816009 isoform X3 [Glycine max] Length = 1101 Score = 91.3 bits (225), Expect = 2e-15 Identities = 156/709 (22%), Positives = 265/709 (37%), Gaps = 57/709 (8%) Frame = -2 Query: 2130 GRYHERWIDSTQEHSNHPKRSNYNRPDEDS-----SYATNAKHLYNRHVNHGKHRDMVNL 1966 G++ + W + + + H N + +++ S A N L +R V++G+H+D + + Sbjct: 493 GQFRKEWRNQSGGYEPHSYDMNKHTENDNDVSILKSSARNLSLLAHRPVDYGRHKDQLQV 552 Query: 1965 KYNDSCVPYYSHSERIMAYSDGRLHDHHFGPAFWKDQYWDIPNYRYQPGHPDGHNVSERQ 1786 + SH R ++ + +++G D+ + ++R + H D + E Sbjct: 553 --------FGSHKRRDLSCNRETKQSYYYGGEKVIDE---LVSWRSKYYHEDRESFRENT 601 Query: 1785 NLSDKK-------------GSMDSEALGYNRYHNQRRHPFHG----DIEGVRNFSPKYSS 1657 N D+K G DSE + YH H R F PK+SS Sbjct: 602 NRYDRKNGDVGDYFFEPGPGFADSEDRDRDWYHLGCGHSSDDLCPCSYRESRQFPPKHSS 661 Query: 1656 AVDQ---SGSQFMHREGVHLRRTRQDFLSPLHDHDDRFVGGKYGRTRPSSGGARDNDHDR 1486 D+ + + M + + R DF + + F+ Y + + D++R Sbjct: 662 FPDKERYTPRKRMDEKSLIERNCIDDF----DECEFEFLNKSYRMSTVAEREQEFLDNNR 717 Query: 1485 RIQPDRVNCRYNQLVAHDRREVDSSGRGKRRHRSPVSREDLCYIDTEVNER--------- 1333 Q + + RR V RG+R + P+ +LC EV + Sbjct: 718 EEQ-------FPHIYRDWRRSVR---RGRRFDKPPLVLNNLCSGTMEVEDNCQKYTHFRT 767 Query: 1332 KNIKHQPFPFKSSEEPYASDRGVF--LGAPGPKFGVARRNMRCSWKEMCIESDQYGTDVT 1159 N KH+ + S + YA V LG G + AR N +W D T Sbjct: 768 SNFKHRRQSYTDSVKNYAYGSRVNGNLGGSG-RDKHARDNRDSNWS----------CDYT 816 Query: 1158 TFFTRESLRYHPPEDFHVRRRDFPPSSNTNITRESMKGNQYQNHFVRRRHNQHSEVLLPR 979 E R P +++ R PS Sbjct: 817 DTAEDEDFRICPVKEYQFYRS---PS---------------------------------- 839 Query: 978 EDEYKSWKQDNIVFGSEEPSHNVKRMSKNDEADDRPAFGHVTKVNKRERGRKNSEISREE 799 ++ +W +D I+F E +H +K ++DD P H + KR+ + Sbjct: 840 --KFLNWTEDEIIFMRHE-THATSLFTKV-QSDDLPLQQHQLSMPKRDNEK--------- 886 Query: 798 DISDHFDGCHETPKLNSHEQTHSSHKESVDWLVVVGRKCTLQSSIRRTTEAGEDTYYGRS 619 +F G + + Q ++SVD + G+ S + + GR Sbjct: 887 ----YFKGSSKIMCRSKGGQAVLRCRKSVDLIHGEGKSQVRSSRV---------SCNGRL 933 Query: 618 DLVGLTVNKEPNNL-VDLDGSEPEETVTNEVKPDSSLSIKNQPSKYSENKLNLSLEIEEG 442 + V + K+ V D S + K +S+L K + S +IEEG Sbjct: 934 ENVNQGIAKKRKRASVGFDESNKNTFKFDSPKYESNLKSKKWVQNLQDQAQKESSDIEEG 993 Query: 441 QINNEE-------TKNKDASQMNATSNN-------------NGVVEKLDDEKIKEIMVKM 322 QI EE +DAS+ A +++ + + D ++I + + KM Sbjct: 994 QIVAEEPYMEKVSVSRRDASEGPAVTDSVNKKRMSQNENSSDQYIGGYDSQRILDSLAKM 1053 Query: 321 ERRRERFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRWLG 175 E+RRERFK+P+TM K+ E++ L DS V+T E + RP RKRRW+G Sbjct: 1054 EKRRERFKQPMTMKKEAEESLKLNNDSIVDT--GEMKQHRPTRKRRWVG 1100 >ref|XP_006597703.1| PREDICTED: uncharacterized protein LOC100816009 isoform X2 [Glycine max] Length = 1101 Score = 91.3 bits (225), Expect = 2e-15 Identities = 156/709 (22%), Positives = 265/709 (37%), Gaps = 57/709 (8%) Frame = -2 Query: 2130 GRYHERWIDSTQEHSNHPKRSNYNRPDEDS-----SYATNAKHLYNRHVNHGKHRDMVNL 1966 G++ + W + + + H N + +++ S A N L +R V++G+H+D + + Sbjct: 493 GQFRKEWRNQSGGYEPHSYDMNKHTENDNDVSILKSSARNLSLLAHRPVDYGRHKDQLQV 552 Query: 1965 KYNDSCVPYYSHSERIMAYSDGRLHDHHFGPAFWKDQYWDIPNYRYQPGHPDGHNVSERQ 1786 + SH R ++ + +++G D+ + ++R + H D + E Sbjct: 553 --------FGSHKRRDLSCNRETKQSYYYGGEKVIDE---LVSWRSKYYHEDRESFRENT 601 Query: 1785 NLSDKK-------------GSMDSEALGYNRYHNQRRHPFHG----DIEGVRNFSPKYSS 1657 N D+K G DSE + YH H R F PK+SS Sbjct: 602 NRYDRKNGDVGDYFFEPGPGFADSEDRDRDWYHLGCGHSSDDLCPCSYRESRQFPPKHSS 661 Query: 1656 AVDQ---SGSQFMHREGVHLRRTRQDFLSPLHDHDDRFVGGKYGRTRPSSGGARDNDHDR 1486 D+ + + M + + R DF + + F+ Y + + D++R Sbjct: 662 FPDKERYTPRKRMDEKSLIERNCIDDF----DECEFEFLNKSYRMSTVAEREQEFLDNNR 717 Query: 1485 RIQPDRVNCRYNQLVAHDRREVDSSGRGKRRHRSPVSREDLCYIDTEVNER--------- 1333 Q + + RR V RG+R + P+ +LC EV + Sbjct: 718 EEQ-------FPHIYRDWRRSVR---RGRRFDKPPLVLNNLCSGTMEVEDNCQKYTHFRT 767 Query: 1332 KNIKHQPFPFKSSEEPYASDRGVF--LGAPGPKFGVARRNMRCSWKEMCIESDQYGTDVT 1159 N KH+ + S + YA V LG G + AR N +W D T Sbjct: 768 SNFKHRRQSYTDSVKNYAYGSRVNGNLGGSG-RDKHARDNRDSNWS----------CDYT 816 Query: 1158 TFFTRESLRYHPPEDFHVRRRDFPPSSNTNITRESMKGNQYQNHFVRRRHNQHSEVLLPR 979 E R P +++ R PS Sbjct: 817 DTAEDEDFRICPVKEYQFYRS---PS---------------------------------- 839 Query: 978 EDEYKSWKQDNIVFGSEEPSHNVKRMSKNDEADDRPAFGHVTKVNKRERGRKNSEISREE 799 ++ +W +D I+F E +H +K ++DD P H + KR+ + Sbjct: 840 --KFLNWTEDEIIFMRHE-THATSLFTKV-QSDDLPLQQHQLSMPKRDNEK--------- 886 Query: 798 DISDHFDGCHETPKLNSHEQTHSSHKESVDWLVVVGRKCTLQSSIRRTTEAGEDTYYGRS 619 +F G + + Q ++SVD + G+ S + + GR Sbjct: 887 ----YFKGSSKIMCRSKGGQAVLRCRKSVDLIHGEGKSQVRSSRV---------SCNGRL 933 Query: 618 DLVGLTVNKEPNNL-VDLDGSEPEETVTNEVKPDSSLSIKNQPSKYSENKLNLSLEIEEG 442 + V + K+ V D S + K +S+L K + S +IEEG Sbjct: 934 ENVNQGIAKKRKRASVGFDESNKNTFKFDSPKYESNLKSKKWVQNLQDQAQKESSDIEEG 993 Query: 441 QINNEE-------TKNKDASQMNATSNN-------------NGVVEKLDDEKIKEIMVKM 322 QI EE +DAS+ A +++ + + D ++I + + KM Sbjct: 994 QIVAEEPYMEKVSVSRRDASEGPAVTDSVNKKRMSQNENSSDQYIGGYDSQRILDSLAKM 1053 Query: 321 ERRRERFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRWLG 175 E+RRERFK+P+TM K+ E++ L DS V+T E + RP RKRRW+G Sbjct: 1054 EKRRERFKQPMTMKKEAEESLKLNNDSIVDT--GEMKQHRPTRKRRWVG 1100 >ref|XP_006597702.1| PREDICTED: uncharacterized protein LOC100816009 isoform X1 [Glycine max] Length = 1104 Score = 91.3 bits (225), Expect = 2e-15 Identities = 156/709 (22%), Positives = 265/709 (37%), Gaps = 57/709 (8%) Frame = -2 Query: 2130 GRYHERWIDSTQEHSNHPKRSNYNRPDEDS-----SYATNAKHLYNRHVNHGKHRDMVNL 1966 G++ + W + + + H N + +++ S A N L +R V++G+H+D + + Sbjct: 496 GQFRKEWRNQSGGYEPHSYDMNKHTENDNDVSILKSSARNLSLLAHRPVDYGRHKDQLQV 555 Query: 1965 KYNDSCVPYYSHSERIMAYSDGRLHDHHFGPAFWKDQYWDIPNYRYQPGHPDGHNVSERQ 1786 + SH R ++ + +++G D+ + ++R + H D + E Sbjct: 556 --------FGSHKRRDLSCNRETKQSYYYGGEKVIDE---LVSWRSKYYHEDRESFRENT 604 Query: 1785 NLSDKK-------------GSMDSEALGYNRYHNQRRHPFHG----DIEGVRNFSPKYSS 1657 N D+K G DSE + YH H R F PK+SS Sbjct: 605 NRYDRKNGDVGDYFFEPGPGFADSEDRDRDWYHLGCGHSSDDLCPCSYRESRQFPPKHSS 664 Query: 1656 AVDQ---SGSQFMHREGVHLRRTRQDFLSPLHDHDDRFVGGKYGRTRPSSGGARDNDHDR 1486 D+ + + M + + R DF + + F+ Y + + D++R Sbjct: 665 FPDKERYTPRKRMDEKSLIERNCIDDF----DECEFEFLNKSYRMSTVAEREQEFLDNNR 720 Query: 1485 RIQPDRVNCRYNQLVAHDRREVDSSGRGKRRHRSPVSREDLCYIDTEVNER--------- 1333 Q + + RR V RG+R + P+ +LC EV + Sbjct: 721 EEQ-------FPHIYRDWRRSVR---RGRRFDKPPLVLNNLCSGTMEVEDNCQKYTHFRT 770 Query: 1332 KNIKHQPFPFKSSEEPYASDRGVF--LGAPGPKFGVARRNMRCSWKEMCIESDQYGTDVT 1159 N KH+ + S + YA V LG G + AR N +W D T Sbjct: 771 SNFKHRRQSYTDSVKNYAYGSRVNGNLGGSG-RDKHARDNRDSNWS----------CDYT 819 Query: 1158 TFFTRESLRYHPPEDFHVRRRDFPPSSNTNITRESMKGNQYQNHFVRRRHNQHSEVLLPR 979 E R P +++ R PS Sbjct: 820 DTAEDEDFRICPVKEYQFYRS---PS---------------------------------- 842 Query: 978 EDEYKSWKQDNIVFGSEEPSHNVKRMSKNDEADDRPAFGHVTKVNKRERGRKNSEISREE 799 ++ +W +D I+F E +H +K ++DD P H + KR+ + Sbjct: 843 --KFLNWTEDEIIFMRHE-THATSLFTKV-QSDDLPLQQHQLSMPKRDNEK--------- 889 Query: 798 DISDHFDGCHETPKLNSHEQTHSSHKESVDWLVVVGRKCTLQSSIRRTTEAGEDTYYGRS 619 +F G + + Q ++SVD + G+ S + + GR Sbjct: 890 ----YFKGSSKIMCRSKGGQAVLRCRKSVDLIHGEGKSQVRSSRV---------SCNGRL 936 Query: 618 DLVGLTVNKEPNNL-VDLDGSEPEETVTNEVKPDSSLSIKNQPSKYSENKLNLSLEIEEG 442 + V + K+ V D S + K +S+L K + S +IEEG Sbjct: 937 ENVNQGIAKKRKRASVGFDESNKNTFKFDSPKYESNLKSKKWVQNLQDQAQKESSDIEEG 996 Query: 441 QINNEE-------TKNKDASQMNATSNN-------------NGVVEKLDDEKIKEIMVKM 322 QI EE +DAS+ A +++ + + D ++I + + KM Sbjct: 997 QIVAEEPYMEKVSVSRRDASEGPAVTDSVNKKRMSQNENSSDQYIGGYDSQRILDSLAKM 1056 Query: 321 ERRRERFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRWLG 175 E+RRERFK+P+TM K+ E++ L DS V+T E + RP RKRRW+G Sbjct: 1057 EKRRERFKQPMTMKKEAEESLKLNNDSIVDT--GEMKQHRPTRKRRWVG 1103 >ref|XP_006841433.1| hypothetical protein AMTR_s00003p00049560 [Amborella trichopoda] gi|548843454|gb|ERN03108.1| hypothetical protein AMTR_s00003p00049560 [Amborella trichopoda] Length = 1203 Score = 74.3 bits (181), Expect = 2e-10 Identities = 57/177 (32%), Positives = 89/177 (50%), Gaps = 9/177 (5%) Frame = -2 Query: 678 LQSSIRRTTEAGEDTYYGRSD-----LVGLTVNKEPNNLVDLDGSEPEET----VTNEVK 526 + S I R + +++ SD +T NKE + ++ EE VT VK Sbjct: 1043 INSKIERVSHRNKESSSDHSDDKWLDKFPITQNKEDGSGQQKKDAKVEEPKKIEVTKTVK 1102 Query: 525 PDSSLSIKNQPSKYSENKLNLSLEIEEGQINNEETKNKDASQMNATSNNNGVVEKLDDEK 346 +S + PS + + + S+ NE+ K A+ +NN +V K+++E+ Sbjct: 1103 --KKVSKRTTPSSIIKERFSGSM--------NEKAHQKGAN------DNNKMVTKINNER 1146 Query: 345 IKEIMVKMERRRERFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRWLG 175 I E M KME+R+ERFKEPI +K+ EK S+ +++ E E + QRP RKRRW G Sbjct: 1147 ILETMAKMEKRKERFKEPIVSNKEPEKISN-APSVSIQVEETEVKGQRPQRKRRWCG 1202 >ref|XP_004295083.1| PREDICTED: uncharacterized protein LOC101308556 [Fragaria vesca subsp. vesca] Length = 408 Score = 73.6 bits (179), Expect = 4e-10 Identities = 95/346 (27%), Positives = 157/346 (45%), Gaps = 34/346 (9%) Frame = -2 Query: 1110 HVRRRDFPPSSNTNITRESMKG-----NQYQNHFVR-RRHNQHSEVLLPREDEYKSWKQD 949 HVR+ D ++ + + G N Y N +R RR N SEV+ ED++ Sbjct: 61 HVRKIDVEEANEIDWFDDHYDGYEIEDNVYANDHLRWRRSNWGSEVMHWTEDQFTVRHHA 120 Query: 948 NIVFGSEEPSHNVKRMSKNDEADDRPAFGHVT---KVNKRERGRKNSEISREEDISDHFD 778 + ++ SE+ S + ++ ++++ + +G ++ + + + ++ ++ R+E I +F Sbjct: 121 DKLY-SEKASCSYRKYVRHEKFHAK--YGPLSDGMRYDNMQPEQRRLKMPRKE-IGANFV 176 Query: 777 GCHETPKLNSHEQTHSSHKESVDWLVVVGRKCTLQSSIRRTTEAGEDTYYGRSDLVGLTV 598 HEQ+ + S+D L V RK + R ++A + GR + +G + Sbjct: 177 NRSVKMYRGKHEQSVRC-RNSMD-LAVRERKI-----LTRCSKARNLMHNGRPENMGAEI 229 Query: 597 NKEPNNLVDLDGSEPEETVTNEVKPDSSLSIKNQPSK-----YSENKLNLSLEIEEGQIN 433 E E E+ VK ++ I NQ +K + N L+IEEGQI Sbjct: 230 GGEWMTSGISQACESEKA--RAVKITQNI-IWNQNNKKGHDIFPVTAQNADLDIEEGQIV 286 Query: 432 NEET------KNKDASQMNA--------------TSNNNGVVEKLDDEKIKEIMVKMERR 313 +E + K AS S N VVE D ++I + M KME+R Sbjct: 287 TQEQNTTHPLQRKHASDYTEPADSLIKGVFDSRNASKGNKVVEGYDKQRILQTMAKMEQR 346 Query: 312 RERFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRWLG 175 ERFKEPIT+ K+ +K +D VET A+ + RPARKR+W G Sbjct: 347 GERFKEPITLKKEPDKQLMPEVDPTVET--ADEKQHRPARKRQWGG 390 >ref|XP_006586892.1| PREDICTED: uncharacterized protein LOC100816396 isoform X2 [Glycine max] Length = 1094 Score = 71.6 bits (174), Expect = 1e-09 Identities = 53/144 (36%), Positives = 74/144 (51%), Gaps = 18/144 (12%) Frame = -2 Query: 552 EETVTNEVKPDSSLSIKNQPSK-----YSENKLNLSLEIEEGQINNEETKNKDASQMNAT 388 +E+ N K D+ NQ SK + S EIEEGQ EE ++AS+ A Sbjct: 952 DESNKNASKFDTPKHKSNQESKKWVQDLQDQAQKESSEIEEGQFVAEEPYMEEASEGPAV 1011 Query: 387 ---------SNNNGVVEKL----DDEKIKEIMVKMERRRERFKEPITMSKDGEKTSSLLL 247 S N E+ D ++I + + KME+RRERFK+P+TM K+ E++ L Sbjct: 1012 TDGVNKKRMSQNENSSEQCIGGYDSQRILDSLAKMEKRRERFKQPMTMKKEAEESLKLND 1071 Query: 246 DSNVETEVAEARLQRPARKRRWLG 175 DS V+ E + RPARKRRW+G Sbjct: 1072 DSIVDK--GEMKQHRPARKRRWVG 1093 >ref|XP_006586891.1| PREDICTED: uncharacterized protein LOC100816396 isoform X1 [Glycine max] Length = 1097 Score = 71.6 bits (174), Expect = 1e-09 Identities = 53/144 (36%), Positives = 74/144 (51%), Gaps = 18/144 (12%) Frame = -2 Query: 552 EETVTNEVKPDSSLSIKNQPSK-----YSENKLNLSLEIEEGQINNEETKNKDASQMNAT 388 +E+ N K D+ NQ SK + S EIEEGQ EE ++AS+ A Sbjct: 955 DESNKNASKFDTPKHKSNQESKKWVQDLQDQAQKESSEIEEGQFVAEEPYMEEASEGPAV 1014 Query: 387 ---------SNNNGVVEKL----DDEKIKEIMVKMERRRERFKEPITMSKDGEKTSSLLL 247 S N E+ D ++I + + KME+RRERFK+P+TM K+ E++ L Sbjct: 1015 TDGVNKKRMSQNENSSEQCIGGYDSQRILDSLAKMEKRRERFKQPMTMKKEAEESLKLND 1074 Query: 246 DSNVETEVAEARLQRPARKRRWLG 175 DS V+ E + RPARKRRW+G Sbjct: 1075 DSIVDK--GEMKQHRPARKRRWVG 1096 >ref|XP_003609773.1| Pre-mRNA polyadenylation factor fip1 [Medicago truncatula] gi|355510828|gb|AES91970.1| Pre-mRNA polyadenylation factor fip1 [Medicago truncatula] Length = 1110 Score = 70.5 bits (171), Expect = 3e-09 Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 9/103 (8%) Frame = -2 Query: 456 EIEEGQINNEETKNKDASQMNATSNNNGVVEKLDDEKIKEIMVKMERRRERFKEPITMSK 277 ++ EG E K K + N N+ ++ LD +KI + + KME+RRERFK+PI M+K Sbjct: 1011 DVSEGATLAENVKKKISQNGN---NSEPQIDNLDSQKILDTLAKMEKRRERFKQPIGMNK 1067 Query: 276 D---------GEKTSSLLLDSNVETEVAEARLQRPARKRRWLG 175 + E SL L++N ++ E + QRP RKRRW G Sbjct: 1068 EAVKQPISLNNEVVKSLKLNTNSAVDIGEMKQQRPVRKRRWNG 1110 >ref|XP_006857169.1| hypothetical protein AMTR_s00065p00171490 [Amborella trichopoda] gi|548861252|gb|ERN18636.1| hypothetical protein AMTR_s00065p00171490 [Amborella trichopoda] Length = 1406 Score = 70.1 bits (170), Expect = 4e-09 Identities = 107/461 (23%), Positives = 186/461 (40%), Gaps = 18/461 (3%) Frame = -2 Query: 1506 RDNDHDRRIQPDRVNCRYNQLVAHD--RREVDSSGRGKRRHRSPVSREDLCYIDTEVNER 1333 +D+ D R + DR R H +RE DSS R + R ED ++E Sbjct: 991 KDDSLDHRRREDRARSRDRPEDHHSFRQRERDSSWRQRER-------EDHHRGESEGRSA 1043 Query: 1332 KNIKHQPFPFKSSEEPYASDRGVFLGAPGPKFGVARRNMRCSWKEMCIESDQYGTDVTTF 1153 + + + S+ + ++G R ++ K M + D + D Sbjct: 1044 QLSREREDARGSARSDRTMEERAWVGGS--------RAIKDGSKSMGSDKDHHLKDKRRH 1095 Query: 1152 FTRE-SLRYHPPEDFHVRRRDFPPSSNTNITRESMKGNQYQNHFVRRRHNQHSEVLLPRE 976 ++ +R ED RRR S+ +RES N+ +N F R + +E Sbjct: 1096 SEQQPKIRDRIEEDTSTRRRGREESA---YSRESHPINEERN-FRREKSTTQNE------ 1145 Query: 975 DEYKSWKQDNIVFGSEEPSHNVKRMSKNDEADDRPAFGHVTKVNKRERGRKNSEISREE- 799 + ++ N +++ +++ D + + R +N +++R + Sbjct: 1146 ------SESQRMYKDRSKESNTRKIKESERVDQNDLASVASNKHDRAVSHRNEKVARRDV 1199 Query: 798 ---DISDHFDGCHETPKLNSHEQTHSSHKESVDWLVVVGRKCTLQSSIRRTTEAGEDTYY 628 S+ F G E P+ +H + S+ K+S D V + E + Sbjct: 1200 PYQATSNAFTGRGE-PRDRNHPRYSSTSKKSSDHDSHVRQSAKPPKPSEEGVSDDESSRR 1258 Query: 627 GRSDLVGLTVNKE------PNNLVDLDGSEPEETVTNEVKPDSSLSIKNQPSKYSENKLN 466 GRS L T +K+ P + + SEPE+ + V L +++ EN+ Sbjct: 1259 GRSKLERWTSHKDREGNPQPKATRESESSEPEK-IEALVFDQEDLEREDEQDVKRENEKL 1317 Query: 465 LSLEIEEGQINNEETKNKDASQMNATSNNNGVVEKLD---DEKIKEIMVKMERRRERFKE 295 SL EE I E M TSN++ +V D +++ E + K+++R ERFK Sbjct: 1318 QSLGEEENSIGFE---------MKGTSNDDWLVVDADRNGEDRHLETVEKLKKRSERFKL 1368 Query: 294 PITMSKDGEKTSSLLLDSNV--ETEVAEARLQRPARKRRWL 178 P+ GEK SS ++S ++E E + +RPARKRRW+ Sbjct: 1369 PMP----GEKESSRRVESEAASQSEHVEIKQERPARKRRWV 1405 >gb|EXB71059.1| hypothetical protein L484_004194 [Morus notabilis] Length = 1179 Score = 67.8 bits (164), Expect = 2e-08 Identities = 156/737 (21%), Positives = 286/737 (38%), Gaps = 74/737 (10%) Frame = -2 Query: 2169 KDSERCGVRGTGDGRYHERWIDSTQEHSNHPKRSNYNRPDEDSSYATNAKHLYNR---HV 1999 +D C G+ ++ R +DS H +R N D D+S +A+ +Y++ Sbjct: 511 RDYSNCKSPIQGERKHQTRSVDS------HAQRK-INIYDNDTSPGLDAEDMYDKGRLSA 563 Query: 1998 NHGKHRD-MVNLKYND-SCVPYYSHSERIMAYSDGRLHDHHFGPAFWKDQYWDIPNYRYQ 1825 ++G+ ++ M ++ + D + YY S++ Y DH + NYR + Sbjct: 564 DYGRWKENMEDVNFTDREDLTYYEKSKQSHYYGSREFADH---------THTARKNYRNR 614 Query: 1824 -PGHPDGHNVSERQNLSDKKGSM--DSEALGYNRYHNQRRHPFHGDIEGVRNFSPKYSSA 1654 +G + QN +K+G + D GY RY RR P GD+ V + + S Sbjct: 615 GQDFHEGRDPYVVQNC-EKRGYLCEDDRREGY-RY---RRGPLSGDMPPVYKETEQLVSR 669 Query: 1653 VDQSGSQFMHREGVHLRRTRQDFLSPLHDHDDRFVGGKYGRT---RPSSGGARDNDHDRR 1483 + Q R + F+ P ++H +F + T R + + + +R Sbjct: 670 YSATSEQIDFRS--KRKNNGLQFMKP-NNHSSQFPDYELDGTDIMREKNARSVSLVNWKR 726 Query: 1482 IQPDRVNCRYNQLVAHDRREVDSSGRGKRRHRSPVSREDLCYIDTEV-----NERKNIKH 1318 D ++ Y + V R+EV +S + + E + E ++ N+ H Sbjct: 727 ---DTLDESYERQVPKRRKEVKNSAWKRCNDAFSLELEGAWSRELEDEYWRNSDVHNLSH 783 Query: 1317 QPFPFKSSEEPYASDRGVFLGAPGPKFGVARRNMRCSWKEMCIESDQYGTDVTTFFTRES 1138 + +S EE + G SW IE + +G +R+S Sbjct: 784 HSYR-ESDEERWTELEG-------------------SWSRK-IEDEYWGNTDVHHLSRQS 822 Query: 1137 LRYHPPEDFHVRRRDFPPSSNTNITR------------ESMKGNQYQNH---------FV 1021 H D PP + +++R E + +N+ F+ Sbjct: 823 ---HRESDGGRWTDPMPPRNGASLSRFVERYRRQLPAGEGKESGWLENYNDLHKFEDGFI 879 Query: 1020 RRRHNQHSEVLLPREDEYKSWKQDNIVFGSEEPS--HNVKRMSKNDEADDRPAFGHVTKV 847 R + H R + WK + + + EEP+ H ++++ + R +G + Sbjct: 880 YRDNKVHF-----RRERRCGWKSEVLPWMEEEPTIRHRYEKLNFKKSSFLRKNYGRHRR- 933 Query: 846 NKRERGRKNSEISREEDISD-----------HFDGCHETPKL--NSHEQTHSSHKESVDW 706 N+ G + + ++ +D + G + + K+ +EQ ++S++ Sbjct: 934 NQSTHGSLHDAMHIDDMQADKHGYRMIKDGSYSRGIYRSQKMFRAKNEQAFLRCRDSLNL 993 Query: 705 LVVVGRKCTLQSSIRRTTEAGEDTYYGRSDLVGLTVNKEPNNLVDLDGSEPEETVTNEVK 526 V G+ S RR T+ + S L G + D++ S E V + + Sbjct: 994 FVGGGKL-----SRRRPTDRNLSCH---SRLEGTYIE-------DVNESSQYEAVQSNL- 1037 Query: 525 PDSSLSIKNQP--SKYSENKLNLSLEIEEGQINNEE-------------------TKNKD 409 P L++ N+ ++ N +IEEGQI EE + K Sbjct: 1038 PKVGLNLSNEDFHDQFPLAARNEDFDIEEGQIVTEEFYRDPLERPHDSVSAARTESVKKR 1097 Query: 408 ASQMNATSNNNGVVEKLDDEKIKEIMVKMERRRERFKEPITMSKDGEKTSSL-LLDSNVE 232 + + S+ + + DD+ I E + KMERRRERFKEPI + ++ +K + ++ + Sbjct: 1098 MLEYDLASHGSKTGGQCDDQWILETLAKMERRRERFKEPIALKREQDKCAKPDIVPAPTI 1157 Query: 231 TEVAEARLQRPARKRRW 181 E AE + RPARKR+W Sbjct: 1158 VETAETKQHRPARKRQW 1174 >ref|XP_007147362.1| hypothetical protein PHAVU_006G117800g [Phaseolus vulgaris] gi|561020585|gb|ESW19356.1| hypothetical protein PHAVU_006G117800g [Phaseolus vulgaris] Length = 1101 Score = 67.4 bits (163), Expect = 3e-08 Identities = 48/160 (30%), Positives = 78/160 (48%), Gaps = 21/160 (13%) Frame = -2 Query: 594 KEPNNLVDLDGSEPEETVTNEVKPDSSLSIKNQPSKYSENKLNLSLEIEEGQINNEETKN 415 K + V D S + + K + +L K + + +IEEGQI ++ K+ Sbjct: 941 KRRRDSVGFDESNKRASKFDASKYEGNLGCKKWIKNLQDQGQKENSDIEEGQIVTQKWKS 1000 Query: 414 ---------KDASQ--------MNATSNNNGVVEKL----DDEKIKEIMVKMERRRERFK 298 +DAS+ S N G ++ D ++I + + KME+RRERFK Sbjct: 1001 SIEEASVARRDASKGPVVTDSVKKRMSPNEGSSDQCIGGYDSQRILDSLAKMEKRRERFK 1060 Query: 297 EPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRWL 178 +PITM K+ E++ L DS++ + +E + RP RKRRW+ Sbjct: 1061 QPITMKKEAEESLKLNSDSSI-VDTSEMKQHRPVRKRRWV 1099 >ref|XP_006651314.1| PREDICTED: uncharacterized protein LOC102703384 [Oryza brachyantha] Length = 1066 Score = 67.0 bits (162), Expect = 4e-08 Identities = 75/292 (25%), Positives = 121/292 (41%), Gaps = 7/292 (2%) Frame = -2 Query: 1035 QNHFVRRRHNQHSEVLLPREDEYKSWKQ-DNIVFGSEEPSHNVKRMSKNDEADDRPAFGH 859 + +V HN E+ + + DNI ++ H + + +D D H Sbjct: 813 KKRYVAEMHNYTKEIDVEAMCSLNDMRNNDNIRNIYDKKRHEIMNLQPSDA--DNLLLIH 870 Query: 858 VTKVNKRERGRKNSEISREEDISDHFDGCHETPKLNSHEQTHSSHKESVDWLVV------ 697 KR+ R+ EI RE + +GC L + HSS +SV V Sbjct: 871 ----RKRKFNRQGIEIRRE--VESDSEGC-----LPADSDLHSSKLKSVHQKVRKPRSYR 919 Query: 696 VGRKCTLQSSIRRTTEAGEDTYYGRSDLVGLTVNKEPNNLVDLDGSEPEETVTNEVKPDS 517 + R L+ SI++ + +++N+E + E E + + + Sbjct: 920 ISRNQILEKSIQQKQQH-------------VSINQECEEI------EEGELIEQDHHDTA 960 Query: 516 SLSIKNQPSKYSENKLNLSLEIEEGQINNEETKNKDASQMNATSNNNGVVEKLDDEKIKE 337 S S NQ SK + + +G + N +K+ D S NG + DD+ I E Sbjct: 961 SRSKFNQRSKVVLRSVIEASSAGQGGMVNATSKDADCS--------NGATRECDDKHILE 1012 Query: 336 IMVKMERRRERFKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRW 181 +M KM++RRERFKEPI K+ ++ LL + V + + RPARKR W Sbjct: 1013 VMKKMQKRRERFKEPIAPQKEEDEHGKELLAATY--SVDDMKNPRPARKRLW 1062 >gb|EXB82160.1| hypothetical protein L484_005444 [Morus notabilis] Length = 1337 Score = 65.9 bits (159), Expect = 8e-08 Identities = 136/619 (21%), Positives = 232/619 (37%), Gaps = 31/619 (5%) Frame = -2 Query: 1941 YYSHSERIMAYSDGRLHDHHFGPAFWKDQYWDIPNYRYQPGHPDGHN------VSERQNL 1780 YY + E + +H H F + + D P+ +Q D HN + ++ Sbjct: 792 YYPYKE----FDPSSVHLHMRSDGFERRKERDNPDGAWQRRDDDSHNRRIRTEETRKRER 847 Query: 1779 SDKKGSM-----------DSEALGYNRYH-NQRRHPFHGDIEGVRNFSPKYSSAVDQSGS 1636 D+ GS D + L ++R + H H D + V P+Y D Sbjct: 848 GDEVGSRHRSKVRESDRSDKDELIHSRKQMDNGSHRAHYDKDVV----PRYRGRDDNLKG 903 Query: 1635 QFMHREGVHLRRTRQDFLSPLHDHDDRFVGGKYGRTRPSSGGARDNDHDRRIQPDRVNCR 1456 ++ H + H +R + D+ + + G R+N + R+ + D V Sbjct: 904 RYEHMDDYHSKRKK----------DEEHLRRDHANKEEMMHGQRENTNRRKRERDEVL-- 951 Query: 1455 YNQLVAHDRREVDSSGR---GKRRHRSPVSREDLCYIDTEVNERKNIKHQPFPFKSSEEP 1285 D+R+ D R G H S V +D ++ E +ER+ + + K E Sbjct: 952 -------DQRKRDGQQRLRDGLDDHHS-VRHKDESWLQRERSERQREREEWQRLKQPHED 1003 Query: 1284 YASDRGVFLGAPGPKFGVARRNMRCSWKEMCIESDQYGTDVTTFFTRESLRYHPPEDFHV 1105 R G + G R + W D+ + +E++R+ P Sbjct: 1004 NKPKRERDEGRSVTRGG--RSSEDKGWVGHPKIMDESKGPDKEYQYKETIRHGEPSKRRD 1061 Query: 1104 RRRDFPPSSNTNITRESM--KGNQYQNHFVRRRHNQHSEVLLPREDEYKSWKQDNIVFGS 931 R D S+ + RE +GNQ N R R + S R D + D++ Sbjct: 1062 RTED---ESSRHGGREDAYARGNQVSNGERRSRLERPSV----RNDRSVN-ASDDLKVQD 1113 Query: 930 EEPSHNVKRMSKNDEADDRPAFGHVTKVNKRERGRKNSEISREEDISDHFDGCHETPKLN 751 ++ N KR ++ E D +K N+ + G +++E + I F G + P + Sbjct: 1114 KKHKENAKR-NRESEGGDYITLAS-SKRNQEDHGGQSNETVLKGSIEKGF-GERDNPAQH 1170 Query: 750 SHEQTHSSHKESVDWLVVVGRKCTLQSSIRRTTEAGEDTYYGRSDLVGLTVNKEPNNLVD 571 + S D Q +RR GRS L T +KE + + Sbjct: 1171 QSSRKQKEEASSDDE----------QQDLRR----------GRSKLERWTSHKERDFSIK 1210 Query: 570 LDGSEPEETVTNEVKPDSSLS---IKNQPSKYSENKLNLSLEIEEGQINNEETKNKDASQ 400 S ++ + SL I ++PSK +E I + + KD + Sbjct: 1211 SKSSSTQKCKEMDGNNSGSLEGRKISDEPSK----------PVETVDIQHSLAEEKDCTD 1260 Query: 399 MNATSNNNGVVEKLDDEKIKEIMVKMERRRERFKEPITMSKDG---EKTSSLLLDSNVET 229 + A +G LDD + + + K+++R ERFK P+ KD +K S L S Sbjct: 1261 LEA---KDGDTRLLDDRHL-DTVEKLKKRSERFKLPMPSDKDALAVKKLESEALPSAKSG 1316 Query: 228 EVAEARL--QRPARKRRWL 178 +A++ + +RPARKRRW+ Sbjct: 1317 SLADSEIKQERPARKRRWI 1335 >ref|XP_006378540.1| hypothetical protein POPTR_0010s15520g [Populus trichocarpa] gi|550329875|gb|ERP56337.1| hypothetical protein POPTR_0010s15520g [Populus trichocarpa] Length = 194 Score = 65.5 bits (158), Expect = 1e-07 Identities = 53/164 (32%), Positives = 78/164 (47%), Gaps = 23/164 (14%) Frame = -2 Query: 594 KEPNNLVDLDGSEPEETVTNEVKPDSSLSIKNQPSKYSENKLNLSLEIEEGQINNEET-- 421 KEP + D +E + + +V + K + N L IE+GQI EE+ Sbjct: 38 KEP--MCSKDFNESQTGIQTDVLETGGDDKEKWIGKSQVTEHNEKLNIEDGQIMAEESSM 95 Query: 420 -------------------KNKDASQMNATSNN--NGVVEKLDDEKIKEIMVKMERRRER 304 KN++ NA+S N +G V D ++I + + KME+RRER Sbjct: 96 ESKLAKKCAFKSVVPTCNAKNRNFLCENASSRNKNDGAV---DSKRILDTIAKMEKRRER 152 Query: 303 FKEPITMSKDGEKTSSLLLDSNVETEVAEARLQRPARKRRWLGT 172 FK+PI K+ +KTS ++ ++T A RPARKRRW GT Sbjct: 153 FKDPIAQKKELDKTSEPQVEVIIDT--VPANQDRPARKRRWGGT 194