BLASTX nr result
ID: Rheum21_contig00010757
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00010757 (1201 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ05877.1| hypothetical protein PRUPE_ppa000310mg [Prunus pe... 329 1e-87 gb|EXB38890.1| hypothetical protein L484_027325 [Morus notabilis] 323 7e-86 ref|XP_006475505.1| PREDICTED: uncharacterized protein LOC102623... 318 4e-84 ref|XP_006475502.1| PREDICTED: uncharacterized protein LOC102623... 312 2e-82 ref|XP_004287588.1| PREDICTED: uncharacterized protein LOC101306... 311 4e-82 gb|EOY30368.1| Serine/arginine repetitive matrix protein 2 isofo... 307 5e-81 ref|XP_006451534.1| hypothetical protein CICLE_v10007265mg [Citr... 306 1e-80 gb|EOY30366.1| Serine/arginine repetitive matrix protein 2 isofo... 306 1e-80 gb|EOY30365.1| Serine/arginine repetitive matrix protein 2 isofo... 306 1e-80 ref|XP_006381653.1| hypothetical protein POPTR_0006s14860g [Popu... 304 5e-80 ref|XP_004144119.1| PREDICTED: uncharacterized protein LOC101208... 289 2e-75 ref|XP_002514096.1| hypothetical protein RCOM_1046470 [Ricinus c... 287 5e-75 ref|XP_002279178.2| PREDICTED: uncharacterized protein LOC100257... 286 1e-74 gb|ESW26618.1| hypothetical protein PHAVU_003G134300g [Phaseolus... 285 3e-74 emb|CBI27872.3| unnamed protein product [Vitis vinifera] 284 4e-74 ref|XP_006597829.1| PREDICTED: uncharacterized protein LOC100812... 283 8e-74 ref|XP_006587024.1| PREDICTED: uncharacterized protein LOC100803... 283 1e-73 ref|XP_006600451.1| PREDICTED: uncharacterized protein LOC100805... 281 5e-73 ref|XP_006597826.1| PREDICTED: uncharacterized protein LOC100812... 278 2e-72 ref|XP_006593923.1| PREDICTED: uncharacterized protein LOC100775... 277 5e-72 >gb|EMJ05877.1| hypothetical protein PRUPE_ppa000310mg [Prunus persica] Length = 1297 Score = 329 bits (844), Expect = 1e-87 Identities = 197/414 (47%), Positives = 255/414 (61%), Gaps = 14/414 (3%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VN R +NK+ R+D+NS SP+SS K+ S R P+ G++PK+SP + T +DW+IS Sbjct: 380 VNHRAVNKASVRDDFNSASPTSSTKINASVRAPRSGSGVVPKLSPVVHRATVANDWDISH 439 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 CT+KP A G N+KR V W GQRPQKI R ARR+NF+PIVS+NEET ++ Sbjct: 440 CTSKPPAAVGANNRKRMASARSSSPPVAQWAGQRPQKISRTARRSNFVPIVSSNEETPTM 499 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLKSE-----QFSESEESGVAEIXXXXXXXXXX 680 D SD +D G AK P S +QVKLK+E SESEESGVAEI Sbjct: 500 DSASDITGSDIGMGFAKRLPGSSPQQVKLKAEPLSSAALSESEESGVAEI--KSRDKGKK 557 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 + D +AG NV+K S VLP+ K KL++ E+ GD ++RQGR R FTS+RSLMP TV+K Sbjct: 558 TDEIDEKAGQNVQKVSPLVLPSRKNKLVTGEDLGDGVRRQGRTGRGFTSTRSLMPMTVEK 617 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 + N AKQ R+SR+ DK+ESK GRPP R+LSDRKAYT + +A++AAADF DDG Sbjct: 618 IGNVGTAKQLRSSRLGFDKSESKAGRPPTRRLSDRKAYTRQKHTAINAAADFLVGSDDGH 677 Query: 331 QELLAAAKAVVDSSQ--LSLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQV-VPTSA 161 +ELLAAA AVV+S++ S FWR ME F F+S+AD +LKQQ + TQ VP+S Sbjct: 678 EELLAAANAVVNSARSFSSSFWRQMEPFFGFLSDADTAYLKQQGNIESNVMTQAQVPSSI 737 Query: 160 HANGT-GNSLNELAFVENMGHTGLMHATEELISEAAVHSTFPLCQRLIAALITD 2 + T N L + G E L+ A PLCQRL+AA+I + Sbjct: 738 DCSATVTNGLRLIGCEPKSGE----FRPEHLVPGAGDRVAIPLCQRLLAAVILE 787 >gb|EXB38890.1| hypothetical protein L484_027325 [Morus notabilis] Length = 1303 Score = 323 bits (829), Expect = 7e-86 Identities = 195/414 (47%), Positives = 252/414 (60%), Gaps = 14/414 (3%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLRT+NK+ R+D NS SP S+AK+ S R P+ G LPK SP +PT +DWEIS Sbjct: 386 VNLRTVNKANGRDDLNSASPISNAKVNASVRAPRSGTGGLPKSSPVVHRPTVSNDWEISH 445 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 CTNKP + G N+KR VTHW GQRPQKI R ARR+NF+PIVS+N+ET ++ Sbjct: 446 CTNKPPSGIGANNRKRMASTRSSSPPVTHWAGQRPQKISRTARRSNFVPIVSSNDETPAM 505 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLKSE-----QFSESEESGVAEIXXXXXXXXXX 680 D SD ND G K S +QVKLK + SESEESG E Sbjct: 506 DSPSDVTGNDIGSGFTKRMSGGSPQQVKLKGDPLSAAALSESEESGAVE--TKSRDKVKK 563 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 +AD +AG +V+K S VL + K KL+S E+ GD ++RQGR R F+S+RSLMP TV+K Sbjct: 564 SDEADEKAGQSVQKVSSLVLSSRKNKLVSGEDLGDGVRRQGRTGRGFSSTRSLMPMTVEK 623 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 + AKQ R++R+ DK ESK GRPP RKLSDRKAYT + +A++AAADF +DG Sbjct: 624 IGVVGTAKQLRSARLGFDKTESKAGRPPTRKLSDRKAYTRQKHTAINAAADFLVGSEDGN 683 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +ELLAAA AV++ ++ S FW+ ME F FIS+ADI +LKQQ+ + F T + T Sbjct: 684 EELLAAANAVINPVRVCSSPFWKQMEPFFGFISDADISYLKQQENLEF---TALTSTQVP 740 Query: 157 ANGTGNSL--NELAFVENMGHTGLMHATEELISEAAVHSTFPLCQRLIAALITD 2 +NG G + N E G E+L+ H+ LCQRLIAALI++ Sbjct: 741 SNGDGGNTVSNGFGSTECESRNG-EFLLEQLVQGTGDHNEISLCQRLIAALISE 793 >ref|XP_006475505.1| PREDICTED: uncharacterized protein LOC102623432 isoform X4 [Citrus sinensis] Length = 1287 Score = 318 bits (814), Expect = 4e-84 Identities = 188/420 (44%), Positives = 252/420 (60%), Gaps = 20/420 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLR +NK+ R+++NS SP+S+ KM S R P+ G+ PK+SP + + +DWE+S Sbjct: 379 VNLRAVNKTNVRDEFNSASPTSNTKMTASVRGPRSGSGVAPKLSPVVHRAAAPNDWEVSH 438 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 C NKP+A GP N+KRT V HW GQRPQKI R ARRTN +PIVSNN+ET ++ Sbjct: 439 CMNKPTASVGPNNRKRTMSARSSSPPVAHWAGQRPQKISRTARRTNIVPIVSNNDETAAL 498 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLK-----SEQFSESEESGVAEIXXXXXXXXXX 680 D SD A ++ G K + S +QVKLK S SESEESGV I Sbjct: 499 DSSSDVAGSEIGGGFGKRLSSNSPQQVKLKGDSLSSAALSESEESGVPSI--KSKDKGRK 556 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKK-LISEERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 + D +AG NV+K S VLP+ K K + ++ GD ++RQGR RSF S+R+L+P TV+K Sbjct: 557 SDEIDEKAGQNVQKVSTLVLPSRKNKPVYGDDLGDGVRRQGRTGRSFASARALLPMTVEK 616 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 L N AKQ R++R+ DK ESK GRPP RKLSDRKAY + + + AAADF DDG Sbjct: 617 LGNAGTAKQLRSARLGFDKIESKAGRPPTRKLSDRKAYKRQKPTTISAAADFIVGSDDGH 676 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVR--------FVSDT 182 +ELLAAA AV++S+ S FWR ME +F FIS+ DI +LK Q+ ++ F+SDT Sbjct: 677 EELLAAANAVINSAHTLSSSFWRQMEPLFGFISDGDIAYLKLQENLQSIVPSTTPFLSDT 736 Query: 181 QVVPTSAHANGTGNSLNELAFVENMGHTGLMHATEELISEAAVHSTFPLCQRLIAALITD 2 ++ + G ++ V G E+L+ ++ PL QRLIAALIT+ Sbjct: 737 DACFSTPNGYGLIKQERDVGPVTGAGR------VEQLVPSPRGYNAVPLYQRLIAALITE 790 >ref|XP_006475502.1| PREDICTED: uncharacterized protein LOC102623432 isoform X1 [Citrus sinensis] gi|568843196|ref|XP_006475503.1| PREDICTED: uncharacterized protein LOC102623432 isoform X2 [Citrus sinensis] gi|568843198|ref|XP_006475504.1| PREDICTED: uncharacterized protein LOC102623432 isoform X3 [Citrus sinensis] Length = 1290 Score = 312 bits (800), Expect = 2e-82 Identities = 188/423 (44%), Positives = 252/423 (59%), Gaps = 23/423 (5%) Frame = -1 Query: 1201 VNLRTINK---SGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWE 1031 VNLR +NK + R+++NS SP+S+ KM S R P+ G+ PK+SP + + +DWE Sbjct: 379 VNLRAVNKYAMTNVRDEFNSASPTSNTKMTASVRGPRSGSGVAPKLSPVVHRAAAPNDWE 438 Query: 1030 ISQCTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEET 851 +S C NKP+A GP N+KRT V HW GQRPQKI R ARRTN +PIVSNN+ET Sbjct: 439 VSHCMNKPTASVGPNNRKRTMSARSSSPPVAHWAGQRPQKISRTARRTNIVPIVSNNDET 498 Query: 850 MSVDGMSD-ADNDNGFMCAKDFPNISAKQVKLK-----SEQFSESEESGVAEIXXXXXXX 689 ++D SD A ++ G K + S +QVKLK S SESEESGV I Sbjct: 499 AALDSSSDVAGSEIGGGFGKRLSSNSPQQVKLKGDSLSSAALSESEESGVPSI--KSKDK 556 Query: 688 XXXXXKADGRAGHNVKKASIAVLPAHKKK-LISEERGDNLKRQGRASRSFTSSRSLMPTT 512 + D +AG NV+K S VLP+ K K + ++ GD ++RQGR RSF S+R+L+P T Sbjct: 557 GRKSDEIDEKAGQNVQKVSTLVLPSRKNKPVYGDDLGDGVRRQGRTGRSFASARALLPMT 616 Query: 511 VDKLCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GD 341 V+KL N AKQ R++R+ DK ESK GRPP RKLSDRKAY + + + AAADF D Sbjct: 617 VEKLGNAGTAKQLRSARLGFDKIESKAGRPPTRKLSDRKAYKRQKPTTISAAADFIVGSD 676 Query: 340 DGQQELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVR--------FV 191 DG +ELLAAA AV++S+ S FWR ME +F FIS+ DI +LK Q+ ++ F+ Sbjct: 677 DGHEELLAAANAVINSAHTLSSSFWRQMEPLFGFISDGDIAYLKLQENLQSIVPSTTPFL 736 Query: 190 SDTQVVPTSAHANGTGNSLNELAFVENMGHTGLMHATEELISEAAVHSTFPLCQRLIAAL 11 SDT ++ + G ++ V G E+L+ ++ PL QRLIAAL Sbjct: 737 SDTDACFSTPNGYGLIKQERDVGPVTGAGR------VEQLVPSPRGYNAVPLYQRLIAAL 790 Query: 10 ITD 2 IT+ Sbjct: 791 ITE 793 >ref|XP_004287588.1| PREDICTED: uncharacterized protein LOC101306665 [Fragaria vesca subsp. vesca] Length = 1290 Score = 311 bits (796), Expect = 4e-82 Identities = 189/414 (45%), Positives = 245/414 (59%), Gaps = 15/414 (3%) Frame = -1 Query: 1198 NLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQC 1019 N R +NKS R+D+NS SP+SS KM S R P+ + PK+SP + T +DWEISQC Sbjct: 382 NQRVVNKSNARDDFNSASPTSSTKMNASVRAPRSGSAVTPKLSPVVHRATVPNDWEISQC 441 Query: 1018 TNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSVD 839 TNKP A GP N+KR V W GQRPQK+ R ARR+NF PIVS+NEET +D Sbjct: 442 TNKPPAVVGPNNRKRMTSARSSSPPVAQWAGQRPQKMSRTARRSNFNPIVSSNEETPVID 501 Query: 838 GMSD-ADNDNGFMCAKDFPNISAKQVKLKSE-----QFSESEESGVAEIXXXXXXXXXXX 677 SD +D G A+ P S +QVKLK E SESEESG AE+ Sbjct: 502 SASDMTGSDIGQGFARRLPGSSPQQVKLKGEPLSSAALSESEESGAAEV--KSRDKGKKS 559 Query: 676 XKADGRAGHN--VKKASIAVLPAHKKK-LISEERGDNLKRQGRASRSFTSSRSLMPTTVD 506 + D + G N ++K VLP+ K+K E+ GD ++RQGR R F S+RS++P TV+ Sbjct: 560 DEIDEKPGQNIQIQKVPSLVLPSRKQKSAAGEDLGDGVRRQGRTGRGFASTRSIVPMTVE 619 Query: 505 KLCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDG 335 K+ N AKQ R+SR+ +DK+ESK GRPP R+LSDRKAYT + +A++ AADF DDG Sbjct: 620 KMGNVGTAKQLRSSRLGVDKSESKAGRPPTRRLSDRKAYTRQKHTAINPAADFLVGSDDG 679 Query: 334 QQELLAAAKAVVDSSQ--LSLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSA 161 +EL+ AAKA VDS++ S FW ME FRF+S+ADI +LK E + + VP S Sbjct: 680 HEELMTAAKAAVDSARSCSSSFWMKMEPFFRFVSDADINYLKGNIESSVTTPAE-VPCSL 738 Query: 160 HANGTGN-SLNELAFVENMGHTGLMHATEELISEAAVHSTFPLCQRLIAALITD 2 N T + L F G +E+ + HS PLCQRLIAALI++ Sbjct: 739 DGNLTVHYGLGSNEFEPRSGE----FRSEQSVPGTGDHSEIPLCQRLIAALISE 788 >gb|EOY30368.1| Serine/arginine repetitive matrix protein 2 isoform 4 [Theobroma cacao] Length = 1144 Score = 307 bits (787), Expect = 5e-81 Identities = 185/419 (44%), Positives = 249/419 (59%), Gaps = 19/419 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLR +NK R+++NS SP+SS KM S R P+ G+ PK+SP + T+ +DWE+S Sbjct: 245 VNLRAVNKMSVRDEFNSASPTSSTKMNASIRGPRSGSGVAPKLSPVVHRATASNDWELSH 304 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 CTNKP G N+KRT V HW GQRPQK R ARRTN +PIVS+N+ET S+ Sbjct: 305 CTNKPPTAGGANNRKRTTSARSSSPPVAHWAGQRPQKSSRTARRTNLVPIVSSNDETPSL 364 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLKSE-----QFSESEESGVAEIXXXXXXXXXX 680 D +SD A N+ G A+ + S +QVKLK + SESEES AEI Sbjct: 365 DTVSDMAGNEIGSGFARRLSSSSPQQVKLKGDALSTAALSESEESAAAEI--KSKEKVKK 422 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 + D +AG NV+K S VLP+ K KL++ E+ GD ++RQGR R TS+RS+MP TV+K Sbjct: 423 SDEMDEKAGQNVQKVSTLVLPSRKTKLMTGEDIGDGVRRQGRTGRGVTSTRSVMPMTVEK 482 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 N AKQ R++R+ LDKAESK GRPP RKL+DRKAY + +A++AAAD +DG Sbjct: 483 FGNVGTAKQLRSARLGLDKAESKAGRPPTRKLTDRKAYARQKHAAINAAADLLVSSEDGH 542 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +EL+AA A+V + + FWR ME FIS+ DI +LKQQ ++ P + Sbjct: 543 EELVAAVNALVSFAHAFPNSFWRQMEPFLGFISDVDIAYLKQQQGNCELTKLASTPVPSI 602 Query: 157 ANGTGNSLNELAFVENMGHTGL--MHATEELISEAAV-----HSTFPLCQRLIAALITD 2 +G N +E G+ + +T EL+S+ V ++ PLCQR IAALI + Sbjct: 603 IDGCSIISNGCELLEQGRDAGIDAVTSTVELLSQQLVLETRDNNVIPLCQRFIAALIPE 661 >ref|XP_006451534.1| hypothetical protein CICLE_v10007265mg [Citrus clementina] gi|557554760|gb|ESR64774.1| hypothetical protein CICLE_v10007265mg [Citrus clementina] Length = 1255 Score = 306 bits (784), Expect = 1e-80 Identities = 185/412 (44%), Positives = 242/412 (58%), Gaps = 12/412 (2%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLR +NK+ R+++NS SP+S+ KM S R P+ G+ PK+SP + + +DWE+S Sbjct: 379 VNLRAVNKTNVRDEFNSASPTSNTKMTASVRGPRSGSGVAPKLSPVVHRAAAPNDWEVSH 438 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 C NKP+A GP N+KRT V HW GQRPQKI R ARRTN +PIVSNN+ET ++ Sbjct: 439 CMNKPTASVGPNNRKRTMSARSSSPPVAHWAGQRPQKISRTARRTNIVPIVSNNDETAAL 498 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLK-----SEQFSESEESGVAEIXXXXXXXXXX 680 D SD A ++ G K + S +QVKLK S SESEESGV I Sbjct: 499 DSSSDVAGSEIGGGFGKRLSSNSPQQVKLKGDSLSSAALSESEESGVPSI--KSKDKGRK 556 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKK-LISEERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 + D +AG NV+K S VLP+ K K + ++ GD ++RQGR RSF S+R+L+P TV+K Sbjct: 557 SDEIDEKAGQNVQKVSTLVLPSRKNKPVYGDDLGDGVRRQGRTGRSFASARALLPMTVEK 616 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 L N AKQ R++R+ DK ESK GRPP RKLSDRKAY + + + AAADF DDG Sbjct: 617 LGNAGTAKQLRSARLGFDKIESKAGRPPTRKLSDRKAYKRQKPTTISAAADFIVGSDDGH 676 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +ELLAAA AV++S+ S FWR ME +F FIS+ DI +LK Q+ V P + Sbjct: 677 EELLAAANAVINSAHTLSSSFWRQMEPLFGFISDGDIAYLKLQER-------DVGPVT-- 727 Query: 157 ANGTGNSLNELAFVENMGHTGLMHATEELISEAAVHSTFPLCQRLIAALITD 2 G G E+L+ ++ PL QRLIAALIT+ Sbjct: 728 --GAGR-------------------VEQLVPSPRGYNAVPLYQRLIAALITE 758 >gb|EOY30366.1| Serine/arginine repetitive matrix protein 2 isoform 2 [Theobroma cacao] gi|508783111|gb|EOY30367.1| Serine/arginine repetitive matrix protein 2 isoform 2 [Theobroma cacao] Length = 1282 Score = 306 bits (784), Expect = 1e-80 Identities = 185/419 (44%), Positives = 249/419 (59%), Gaps = 19/419 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLR +NK R+++NS SP+SS KM S R P+ G+ PK+SP + T+ +DWE+S Sbjct: 384 VNLRAVNKMSVRDEFNSASPTSSTKMNASIRGPRSGSGVAPKLSPVVHRATASNDWELSH 443 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 CTNKP G N+KRT V HW GQRPQK R ARRTN +PIVS+N+ET S+ Sbjct: 444 CTNKPPTAGGANNRKRTTSARSSSPPVAHWAGQRPQKSSRTARRTNLVPIVSSNDETPSL 503 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLKSE-----QFSESEESGVAEIXXXXXXXXXX 680 D +SD A N+ G A+ + S +QVKLK + SESEES AEI Sbjct: 504 DTVSDMAGNEIGSGFARRLSSSSPQQVKLKGDALSTAALSESEESAAAEI--KSKEKVKK 561 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 + D +AG NV+K S VLP+ K KL++ E+ GD ++RQGR R TS+RS+MP TV+K Sbjct: 562 SDEMDEKAGQNVQKVSTLVLPSRKTKLMTGEDIGDGVRRQGRTGRGVTSTRSVMPMTVEK 621 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 N AKQ R++R+ LDKAESK GRPP RKL+DRKAY + +A++AAAD +DG Sbjct: 622 FGNVGTAKQLRSARLGLDKAESKAGRPPTRKLTDRKAYARQKHAAINAAADLLVSSEDGH 681 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +EL+AA A+V + + FWR ME FIS+ DI +LKQQ ++ P + Sbjct: 682 EELVAAVNALVSFAHAFPNSFWRQMEPFLGFISDVDIAYLKQQGNCE-LTKLASTPVPSI 740 Query: 157 ANGTGNSLNELAFVENMGHTGL--MHATEELISEAAV-----HSTFPLCQRLIAALITD 2 +G N +E G+ + +T EL+S+ V ++ PLCQR IAALI + Sbjct: 741 IDGCSIISNGCELLEQGRDAGIDAVTSTVELLSQQLVLETRDNNVIPLCQRFIAALIPE 799 >gb|EOY30365.1| Serine/arginine repetitive matrix protein 2 isoform 1 [Theobroma cacao] Length = 1327 Score = 306 bits (784), Expect = 1e-80 Identities = 185/419 (44%), Positives = 249/419 (59%), Gaps = 19/419 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLR +NK R+++NS SP+SS KM S R P+ G+ PK+SP + T+ +DWE+S Sbjct: 384 VNLRAVNKMSVRDEFNSASPTSSTKMNASIRGPRSGSGVAPKLSPVVHRATASNDWELSH 443 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 CTNKP G N+KRT V HW GQRPQK R ARRTN +PIVS+N+ET S+ Sbjct: 444 CTNKPPTAGGANNRKRTTSARSSSPPVAHWAGQRPQKSSRTARRTNLVPIVSSNDETPSL 503 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLKSE-----QFSESEESGVAEIXXXXXXXXXX 680 D +SD A N+ G A+ + S +QVKLK + SESEES AEI Sbjct: 504 DTVSDMAGNEIGSGFARRLSSSSPQQVKLKGDALSTAALSESEESAAAEI--KSKEKVKK 561 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 + D +AG NV+K S VLP+ K KL++ E+ GD ++RQGR R TS+RS+MP TV+K Sbjct: 562 SDEMDEKAGQNVQKVSTLVLPSRKTKLMTGEDIGDGVRRQGRTGRGVTSTRSVMPMTVEK 621 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 N AKQ R++R+ LDKAESK GRPP RKL+DRKAY + +A++AAAD +DG Sbjct: 622 FGNVGTAKQLRSARLGLDKAESKAGRPPTRKLTDRKAYARQKHAAINAAADLLVSSEDGH 681 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +EL+AA A+V + + FWR ME FIS+ DI +LKQQ ++ P + Sbjct: 682 EELVAAVNALVSFAHAFPNSFWRQMEPFLGFISDVDIAYLKQQGNCE-LTKLASTPVPSI 740 Query: 157 ANGTGNSLNELAFVENMGHTGL--MHATEELISEAAV-----HSTFPLCQRLIAALITD 2 +G N +E G+ + +T EL+S+ V ++ PLCQR IAALI + Sbjct: 741 IDGCSIISNGCELLEQGRDAGIDAVTSTVELLSQQLVLETRDNNVIPLCQRFIAALIPE 799 >ref|XP_006381653.1| hypothetical protein POPTR_0006s14860g [Populus trichocarpa] gi|550336366|gb|ERP59450.1| hypothetical protein POPTR_0006s14860g [Populus trichocarpa] Length = 1117 Score = 304 bits (778), Expect = 5e-80 Identities = 190/418 (45%), Positives = 246/418 (58%), Gaps = 18/418 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VN+R + K+ R +D+NS SP+SSAKM PS R P+ GI+PK+SP + T+ +DWE+S Sbjct: 190 VNIRAVTKAVR-DDFNSASPTSSAKMNPSIRAPRSGSGIMPKLSPVVHRATAPNDWELSH 248 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 CTNKP A G N+KRT V HW GQRPQKI R ARRTN +PIV NN+E+ ++ Sbjct: 249 CTNKPPAV-GANNRKRTASARSSSPPVAHWAGQRPQKIYRTARRTNLVPIV-NNDESPTL 306 Query: 841 DGMSDAD-NDNGFMCAKDFPNISAKQVKLKSEQFS-----ESEESGVAEIXXXXXXXXXX 680 D +SD N+ G A+ S +QVKLK + S ESEESG E+ Sbjct: 307 DSVSDVSGNEIGVGFARRLSGNSPQQVKLKGDTLSSAVLSESEESGATEVKSKDKSRKSD 366 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 D +AG NV+K S LP+ K KL+S E+ GD ++RQGR R FTS+RSL+PT V+K Sbjct: 367 EI--DEKAGQNVQKISPLGLPSRKNKLVSGEDIGDGVRRQGRTGRGFTSTRSLVPTAVEK 424 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 L N AKQ R++R+ DK ESK GRPP RKLSDRKAYT + + ++A ADF +DG Sbjct: 425 LGNVGTAKQLRSARLGFDKNESKTGRPPTRKLSDRKAYTRQKNTTVNATADFLVGSEDGH 484 Query: 331 QELLAAAKAVVDS--SQLSLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +ELLAAA AV++ + LS FWR ME F FIS+ DI LKQQ + F + + P + Sbjct: 485 EELLAAASAVINPGLALLSSFWRQMETFFGFISDVDIAHLKQQGSIVFTAPS-ATPVHSD 543 Query: 157 ANGTGNSLNELAFVENMGHTGLMHATEELISE------AAVHSTFPLCQRLIAALITD 2 AN N E+ L A E SE V PL Q L+AAL ++ Sbjct: 544 ANNYSTVPNGYGLFEHDREVELELAAETRTSELLPDQLMPVDREIPLSQLLLAALTSE 601 >ref|XP_004144119.1| PREDICTED: uncharacterized protein LOC101208478 [Cucumis sativus] Length = 1288 Score = 289 bits (739), Expect = 2e-75 Identities = 173/411 (42%), Positives = 236/411 (57%), Gaps = 11/411 (2%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLR +NKS R+D+ S SP+S+AK+ PS R P+ + GI PK SP + + +DW++S Sbjct: 377 VNLRGVNKSNVRDDFVSTSPTSNAKVNPSVRAPRSSSGIAPKFSPVVHRAIASNDWDMSN 436 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 CTNKP +P G +N+KR V+HW QRPQKI R+ARRTN PIVS+N++ + Sbjct: 437 CTNKPISPVGVSNRKRMISMRSSSPPVSHWASQRPQKISRSARRTNLGPIVSSNDDN-PL 495 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLKSEQF-----SESEESGVAEIXXXXXXXXXX 680 D SD ND G + S +QVK+K E SESEESG AEI Sbjct: 496 DSTSDVVGNDTGLGFGRRMSGSSPQQVKIKGEPLSSAAQSESEESGAAEI--KSREKTRK 553 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLISEERGDNLKRQGRASRSFTSSRSLMPTTVDKL 500 D ++ V+K VLP K K + E+ GD ++RQGR R+F S+RSLMP TV+K+ Sbjct: 554 SEDLDDKSEQGVQKVPALVLPTRKNKSVDEDIGDGVRRQGRTGRAFPSTRSLMPMTVEKI 613 Query: 499 CNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQQ 329 AKQ R++R+ DK ESK GRPP RK +DRKAY + SA++ DF D G + Sbjct: 614 DAVGTAKQLRSARLGFDKVESKAGRPPTRKFTDRKAYKRQKHSAINVGTDFLVGSDHGHE 673 Query: 328 ELLAAAKAVVDSSQ--LSLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAHA 155 ELLAAA AV + + S FWR ME FRF+SEADI L++Q ++ + + + A Sbjct: 674 ELLAAANAVTNPGRTFFSPFWRQMEQFFRFVSEADITHLRKQGDLEGAASGPKIVSDKDA 733 Query: 154 NGTGNSLNELAFVENMGHTGLMHATEELISEAAVHSTFPLCQRLIAALITD 2 S + +EN + E +I E+ H+ PL QRL+A+LI + Sbjct: 734 YNI--SHDNFEHIENEASEVPL---EHIIQESKDHTVIPLYQRLLASLIPE 779 >ref|XP_002514096.1| hypothetical protein RCOM_1046470 [Ricinus communis] gi|223546552|gb|EEF48050.1| hypothetical protein RCOM_1046470 [Ricinus communis] Length = 1291 Score = 287 bits (735), Expect = 5e-75 Identities = 185/414 (44%), Positives = 241/414 (58%), Gaps = 14/414 (3%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLR ++K+ R+D+NS SP+SS KM S R P+ GI PK+SP + T+ ++WE+S Sbjct: 380 VNLRAVHKANVRDDFNSASPTSSTKMNTSTRGPRSGSGIAPKLSPVVHRATAPNEWELSH 439 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 C+NKP A G N+KRT V HW GQRPQKI RAARRTN +PIV NN+E+ ++ Sbjct: 440 CSNKPPAV-GVNNRKRTASTRSSSPPVAHWAGQRPQKISRAARRTNLIPIVPNNDESPAL 498 Query: 841 DGMSDADNDN-GFMCAKDFPNISAKQVKLKSEQ-----FSESEESGVAEIXXXXXXXXXX 680 D +SD G AK S +QVKLKSE SESEESG EI Sbjct: 499 DTVSDVSGSELGLGFAKRLTGNSPQQVKLKSEPASSAALSESEESGAPEIKSKDKGKRSD 558 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 D +AG NV K S L + K KL++ E+ GD ++RQGR R T+ RSLMP +V+K Sbjct: 559 EI--DEKAGLNVLKVSTLGLQSRKNKLVTGEDLGDGVRRQGRTGRGSTT-RSLMPMSVEK 615 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 + N AKQ R++R+ DK ESK GRPP RKLSDRKAY + + ++AAADF DDG Sbjct: 616 VGNVGTAKQLRSARLGFDKNESKTGRPPTRKLSDRKAYKRQKHTMVNAAADFLVGSDDGH 675 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +EL AAA AV++ + FWR ME F FIS+ADI LKQQ V + + S+ Sbjct: 676 EELTAAASAVINPVHACPNPFWRQMESFFGFISDADIACLKQQGNVESTAPSP-AQVSSE 734 Query: 157 ANGTGNSLNELAFVENMGHTGLMHATEELISEAAVHST--FPLCQRLIAALITD 2 N N +E+ GL TE+ +SE V L Q+LIAA+I++ Sbjct: 735 INICSTVPNGYGLIEHEEEMGL--TTEKRLSEQLVPGARDISLYQKLIAAIISE 786 >ref|XP_002279178.2| PREDICTED: uncharacterized protein LOC100257683 [Vitis vinifera] Length = 1297 Score = 286 bits (732), Expect = 1e-74 Identities = 180/414 (43%), Positives = 235/414 (56%), Gaps = 14/414 (3%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLR +NK+ RED++SPSP+S+ KM SAR P+ G+LPK + T+ +DWE S Sbjct: 384 VNLRAVNKANAREDFSSPSPTSNMKMNASARAPRSGSGLLPKAFSIVHRATALNDWEPSH 443 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 CTNK S G N+KRT V W GQRPQKI R RRTN +PIVS+N+ET + Sbjct: 444 CTNKLSPAVGANNRKRTPSTRSSSPPVAQWAGQRPQKISRTGRRTNLVPIVSSNDETPVL 503 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLKSEQF-----SESEESGVAEIXXXXXXXXXX 680 D +SD A N+NG A+ + S +QVKL+ + F SESEESG A+I Sbjct: 504 DSVSDVAGNENGLGSARRLSSNSPQQVKLRGDHFSSATLSESEESGAADI--KSRDKSKK 561 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLISEE-RGDNLKRQGRASRSFTSSRSLMPTTVDK 503 D +AG + VLP+ K +LISEE GD ++RQGR R F SSRSL+P Sbjct: 562 SDDIDEKAGQTL------VLPSRKNRLISEEDLGDGVRRQGRTGRGFPSSRSLVP----- 610 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADFGDDGQQEL 323 AKQ R++++ +K ESK GRPP RKLSDRKAYT + +A++AAADF +DG +EL Sbjct: 611 -----MAKQLRSAKLGYNKTESKDGRPPTRKLSDRKAYTRQKHTAINAAADFINDGHEEL 665 Query: 322 LAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAHANG 149 LAAA AV++ + FWR ME F F+S+ADI +LKQQ + P +G Sbjct: 666 LAAANAVINPIHAFSNSFWRQMEPFFGFLSDADIAYLKQQGNLE-----STTPVPLDVDG 720 Query: 148 TGNSLNELAFVENMGHTGLMHATEELISEAAVHST-----FPLCQRLIAALITD 2 N +E+ G T +L T PLCQRLI ALI++ Sbjct: 721 YNTVANGFGLLEHERDVGTGTETIKLSPGLLTPGTRADDPIPLCQRLITALISE 774 >gb|ESW26618.1| hypothetical protein PHAVU_003G134300g [Phaseolus vulgaris] Length = 1296 Score = 285 bits (728), Expect = 3e-74 Identities = 177/417 (42%), Positives = 242/417 (58%), Gaps = 17/417 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VN R +NK+ R+++NS SP++SAKM + R P+ G+ PK+SP + +DWE+S Sbjct: 389 VNFRAVNKATARDEFNSASPTTSAKMNTAVRAPRSGSGVAPKLSPVVHRAAVPNDWELSH 448 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 C KP P+ N+KR V W QRPQK R ARRTNF+ IVSNN+E ++ Sbjct: 449 CATKP--PAAGNNRKRVASARSSSPPVVPW--QRPQKSSRTARRTNFMSIVSNNDEAPAL 504 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLKSEQ-----FSESEESGVAEIXXXXXXXXXX 680 D SD A ND G ++ S++Q+KLK++ SESEESGVA+ Sbjct: 505 DTASDVAGNDLGLGFSRRLAGSSSQQIKLKADPSTSAALSESEESGVADTKPKEKGRKPE 564 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLISEERGDNLKRQGRASRSFTSSRSLMPTTVDKL 500 D ++G NV+K S VLP K KL+SEE GD ++RQGR RS T++RSLMP T +KL Sbjct: 565 EI--DQKSGQNVQKVSNLVLPTRKNKLVSEEHGDGVRRQGRTGRSLTATRSLMPMTSEKL 622 Query: 499 CNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQQ 329 N AKQ R++R+ DK ESK GRPP RKLSDRKAY + A++AAADF +DG + Sbjct: 623 GNIGTAKQLRSARLGSDKNESKAGRPPSRKLSDRKAYARQK-PAINAAADFFVGSEDGHE 681 Query: 328 ELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAHA 155 ELLAA K +++S+ S FWR ME F I+E D+ + KQ+ + + ++PT Sbjct: 682 ELLAAVKGLINSAHTFSSPFWRQMEPFFSLITEEDVAYWKQKVN---LESSVLMPTPIRL 738 Query: 154 NGTGNSLNE---LAFVENMGHTGLMHA---TEELISEAAVHSTFPLCQRLIAALITD 2 +G +N A + G +A TE+L H+ PLC RLIAALI++ Sbjct: 739 DGCETIVNGYGLTACERDSGSDAQWNAGIITEQLQLSKGDHNMIPLCHRLIAALISE 795 >emb|CBI27872.3| unnamed protein product [Vitis vinifera] Length = 1304 Score = 284 bits (727), Expect = 4e-74 Identities = 181/417 (43%), Positives = 235/417 (56%), Gaps = 17/417 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VNLR +NK+ RED++SPSP+S+ KM SAR P+ G+LPK + T+ +DWE S Sbjct: 388 VNLRAVNKANAREDFSSPSPTSNMKMNASARAPRSGSGLLPKAFSIVHRATALNDWEPSH 447 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 CTNK S G N+KRT V W GQRPQKI R RRTN +PIVS+N+ET + Sbjct: 448 CTNKLSPAVGANNRKRTPSTRSSSPPVAQWAGQRPQKISRTGRRTNLVPIVSSNDETPVL 507 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLKSEQF-----SESEESGVAEIXXXXXXXXXX 680 D +SD A N+NG A+ + S +QVKL+ + F SESEESG A+I Sbjct: 508 DSVSDVAGNENGLGSARRLSSNSPQQVKLRGDHFSSATLSESEESGAADI--KSRDKSKK 565 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLISEE-RGDNLKRQGRASRSFTSSRSLMPTTVDK 503 D +AG + VLP+ K +LISEE GD ++RQGR R F SSRSL+P Sbjct: 566 SDDIDEKAGQTL------VLPSRKNRLISEEDLGDGVRRQGRTGRGFPSSRSLVP----- 614 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 AKQ R++++ +K ESK GRPP RKLSDRKAYT + +A++AAADF DDG Sbjct: 615 -----MAKQLRSAKLGYNKTESKDGRPPTRKLSDRKAYTRQKHTAINAAADFIIGSDDGH 669 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +ELLAAA AV++ + FWR ME F F+S+ADI +LKQQ + P Sbjct: 670 EELLAAANAVINPIHAFSNSFWRQMEPFFGFLSDADIAYLKQQGNLE-----STTPVPLD 724 Query: 157 ANGTGNSLNELAFVENMGHTGLMHATEELISEAAVHST-----FPLCQRLIAALITD 2 +G N +E+ G T +L T PLCQRLI ALI++ Sbjct: 725 VDGYNTVANGFGLLEHERDVGTGTETIKLSPGLLTPGTRADDPIPLCQRLITALISE 781 >ref|XP_006597829.1| PREDICTED: uncharacterized protein LOC100812435 isoform X4 [Glycine max] Length = 1292 Score = 283 bits (725), Expect = 8e-74 Identities = 181/421 (42%), Positives = 241/421 (57%), Gaps = 21/421 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VN R +NK+ R++YNS SP+SSAKM R P+ G+ PK SP + + +DWE S Sbjct: 382 VNFRAVNKATVRDEYNSVSPNSSAKMNTPIRAPRSGSGVGPKSSPGVHRASFPNDWEPSH 441 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 C KP A G N+KR V HW QRPQK R ARRTNF+P VS+N+++ ++ Sbjct: 442 CMTKPPASVGTNNRKRVASARSSSPPVVHW--QRPQKSSRTARRTNFVPNVSSNDDSPAL 499 Query: 841 DGMSDAD-NDNGFMCAKDFPNISAKQVKLKSEQF-----SESEESGVAEIXXXXXXXXXX 680 D +SD ND G + S +Q+KLK + SESEESGVAEI Sbjct: 500 DSVSDVTGNDLGLGFVRRLAGNSPQQIKLKGDSLTSATLSESEESGVAEIKPKEKGRKPE 559 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 D +AG NV+K S VLP K KL+S EE GD ++RQGR R+F S+RS P T +K Sbjct: 560 EI--DQKAGQNVQKVSNLVLPTRKNKLVSGEEHGDGVRRQGRTGRNFPSARSPTPVTSEK 617 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 L N KQ R+SR+ L+K+ES+ GRPP RKLSDRKAY + SA+ A+ADF +DG Sbjct: 618 LGNIGTVKQLRSSRLGLEKSESRAGRPPTRKLSDRKAYARQKHSAISASADFLVGSEDGH 677 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPT--- 167 +ELLAA K V++S++ S FWR ME F +SE D+ + KQ+ + + ++PT Sbjct: 678 EELLAAVKGVINSARAFSSQFWRQMEPFFGLMSEEDLAYWKQKIN---LEPSGLMPTPVP 734 Query: 166 ------SAHANGTGNSLNELAFVENMGHTGLMHATEELISEAAVHSTFPLCQRLIAALIT 5 A ANG G + +E F E TG E+L + P CQRLI+ALI+ Sbjct: 735 SYIDDCEAVANGFGLTGSERDF-EPGDQTGAGIVAEQLQLAKGDSNGIPFCQRLISALIS 793 Query: 4 D 2 + Sbjct: 794 E 794 >ref|XP_006587024.1| PREDICTED: uncharacterized protein LOC100803232 isoform X1 [Glycine max] Length = 1293 Score = 283 bits (724), Expect = 1e-73 Identities = 180/423 (42%), Positives = 243/423 (57%), Gaps = 23/423 (5%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VN R +NK+ R++YNS SP+SSAKM + R P+ G+ PK+SP + + +D E SQ Sbjct: 380 VNFRAVNKATVRDEYNSASPNSSAKMNTTIRAPRTGSGVAPKLSPGVHRASVPNDCEPSQ 439 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 C KP A G N+KR V HW QRPQK R ARRTNF+PIVS+N+++ ++ Sbjct: 440 CMTKPPASVGTNNRKRVASARSSSPPVVHW--QRPQKSSRTARRTNFVPIVSSNDDSPAL 497 Query: 841 DGMSDA-DNDNGFMCAKDFPNISAKQVKLKSEQF-----SESEESGVAEIXXXXXXXXXX 680 D +SD DND G + S +Q+KLK + SESEESGVAEI Sbjct: 498 DSVSDVTDNDLGLGFVRRLAGNSPQQIKLKGDSLTSATLSESEESGVAEIKPKEKGRKPE 557 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 D +AG NV+K VLP K KL+S EE GD ++RQGR R+F ++RS P T +K Sbjct: 558 EI--DQQAGKNVQKVFNLVLPTRKNKLVSGEEHGDGVQRQGRTGRNFPAARSPTPVTSEK 615 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 L N KQ R+SR+ L+K+ES+ GRPP RKLSDRKAY + SA+ A+ADF +DG Sbjct: 616 LGNIGTVKQLRSSRLGLEKSESRAGRPPTRKLSDRKAYARQKHSAISASADFLVGSEDGH 675 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVR-----------FV 191 +ELLAA K V++S++ S FWR +E F I+E DIG+ KQ+ + ++ Sbjct: 676 EELLAAVKGVINSARAFSSQFWRQIEPFFGLINEEDIGYWKQKINLESSGLMPSPVPSYI 735 Query: 190 SDTQVVPTSAHANGTGNSLNELAFVENMGHTGLMHATEELISEAAVHSTFPLCQRLIAAL 11 D + V ANG G + +E F E G E+L + LCQRLI+AL Sbjct: 736 DDCKAV-----ANGFGLTGSERDF-EPGDQMGAAIVAEQLQLAKGDSNGISLCQRLISAL 789 Query: 10 ITD 2 I++ Sbjct: 790 ISE 792 >ref|XP_006600451.1| PREDICTED: uncharacterized protein LOC100805358 isoform X1 [Glycine max] Length = 1293 Score = 281 bits (718), Expect = 5e-73 Identities = 180/420 (42%), Positives = 241/420 (57%), Gaps = 20/420 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VN R +NK+ R+++NS SP+S AKM + R P+ G+ PK+SP + +DWE+S Sbjct: 384 VNFRAVNKATARDEFNSASPTSGAKMNTAIRAPRSGSGVAPKLSPVVHRAGVSNDWELSH 443 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 + KP A G +N+KR V W QRPQK R ARRTNF+PIVSN++E ++ Sbjct: 444 SSPKPPAAGGTSNRKRVASARSSSPPVVPW--QRPQKSSRTARRTNFMPIVSNSDEAPAL 501 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLK-----SEQFSESEESGVAEIXXXXXXXXXX 680 D SD A ND G A+ S +Q+KLK S SESEESGVA++ Sbjct: 502 DTASDVAGNDLGLGFARRLAGSSPQQIKLKGDPSSSAALSESEESGVADVKPKEKGRKAE 561 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 D ++G NV+K S VLP K KL+S EE GD ++RQGR R+ ++RS++P T +K Sbjct: 562 EI--DQKSGQNVQKVSNMVLPTRKNKLVSGEEHGDGVRRQGRTGRNLAATRSMIPMTSEK 619 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 L N AKQ R++R+ DK ESK GRPP RKLSDRKAY + A++AAADF +DG Sbjct: 620 LGNIGTAKQLRSARLGSDKNESKAGRPPSRKLSDRKAYARQK-PAINAAADFFVESEDGH 678 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +ELLAA K V++S+ S FWR ME F I+E DI + KQ +V S T + PT Sbjct: 679 EELLAAVKGVINSAHAFSSPFWRQMEPFFSLITEEDIAYWKQ--KVNLESST-LTPTPIP 735 Query: 157 ANGTG-----NSLNELAFVENMGHTGLMHA---TEELISEAAVHSTFPLCQRLIAALITD 2 +N G N + + G +A E+L H+ PLCQRLIAALI++ Sbjct: 736 SNIDGVETIVNGYGLMGCERDAGFDAQWNAGIVAEQLQLSKGDHNVIPLCQRLIAALISE 795 >ref|XP_006597826.1| PREDICTED: uncharacterized protein LOC100812435 isoform X1 [Glycine max] gi|571519354|ref|XP_006597827.1| PREDICTED: uncharacterized protein LOC100812435 isoform X2 [Glycine max] Length = 1300 Score = 278 bits (712), Expect = 2e-72 Identities = 181/429 (42%), Positives = 240/429 (55%), Gaps = 29/429 (6%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VN R +NK+ R++YNS SP+SSAKM R P+ G+ PK SP + + +DWE S Sbjct: 382 VNFRAVNKATVRDEYNSVSPNSSAKMNTPIRAPRSGSGVGPKSSPGVHRASFPNDWEPSH 441 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 C KP A G N+KR V HW QRPQK R ARRTNF+P VS+N+++ ++ Sbjct: 442 CMTKPPASVGTNNRKRVASARSSSPPVVHW--QRPQKSSRTARRTNFVPNVSSNDDSPAL 499 Query: 841 DGMSDAD-NDNGFMCAKDFPNISAKQVKLKSEQF-----SESEESGVAEIXXXXXXXXXX 680 D +SD ND G + S +Q+KLK + SESEESGVAEI Sbjct: 500 DSVSDVTGNDLGLGFVRRLAGNSPQQIKLKGDSLTSATLSESEESGVAEIKPKEKGRKPE 559 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 D +AG NV+K S VLP K KL+S EE GD ++RQGR R+F S+RS P T +K Sbjct: 560 EI--DQKAGQNVQKVSNLVLPTRKNKLVSGEEHGDGVRRQGRTGRNFPSARSPTPVTSEK 617 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 L N KQ R+SR+ L+K+ES+ GRPP RKLSDRKAY + SA+ A+ADF +DG Sbjct: 618 LGNIGTVKQLRSSRLGLEKSESRAGRPPTRKLSDRKAYARQKHSAISASADFLVGSEDGH 677 Query: 331 QELLAAAKAVVDS----------SQLSLFWRHMELMFRFISEADIGFLKQQDEVRFVSDT 182 +ELLAA K V++S + S FWR ME F +SE D+ + KQ+ + + Sbjct: 678 EELLAAVKGVINSVLYFLITAARAFSSQFWRQMEPFFGLMSEEDLAYWKQKIN---LEPS 734 Query: 181 QVVPT---------SAHANGTGNSLNELAFVENMGHTGLMHATEELISEAAVHSTFPLCQ 29 ++PT A ANG G + +E F E TG E+L + P CQ Sbjct: 735 GLMPTPVPSYIDDCEAVANGFGLTGSERDF-EPGDQTGAGIVAEQLQLAKGDSNGIPFCQ 793 Query: 28 RLIAALITD 2 RLI+ALI++ Sbjct: 794 RLISALISE 802 >ref|XP_006593923.1| PREDICTED: uncharacterized protein LOC100775655 isoform X1 [Glycine max] gi|571497496|ref|XP_006593924.1| PREDICTED: uncharacterized protein LOC100775655 isoform X2 [Glycine max] gi|571497498|ref|XP_006593925.1| PREDICTED: uncharacterized protein LOC100775655 isoform X3 [Glycine max] gi|571497500|ref|XP_006593926.1| PREDICTED: uncharacterized protein LOC100775655 isoform X4 [Glycine max] gi|571497502|ref|XP_006593927.1| PREDICTED: uncharacterized protein LOC100775655 isoform X5 [Glycine max] gi|571497505|ref|XP_006593928.1| PREDICTED: uncharacterized protein LOC100775655 isoform X6 [Glycine max] gi|571497507|ref|XP_006593929.1| PREDICTED: uncharacterized protein LOC100775655 isoform X7 [Glycine max] gi|571497509|ref|XP_006593930.1| PREDICTED: uncharacterized protein LOC100775655 isoform X8 [Glycine max] gi|571497511|ref|XP_006593931.1| PREDICTED: uncharacterized protein LOC100775655 isoform X9 [Glycine max] gi|571497514|ref|XP_006593932.1| PREDICTED: uncharacterized protein LOC100775655 isoform X10 [Glycine max] Length = 1295 Score = 277 bits (709), Expect = 5e-72 Identities = 179/420 (42%), Positives = 238/420 (56%), Gaps = 20/420 (4%) Frame = -1 Query: 1201 VNLRTINKSGRREDYNSPSPSSSAKMIPSARPPQLNKGILPKMSPSSQQPTSCSDWEISQ 1022 VN R +NK+ R+++NS SP+SSAK+ + R P+ G+ PK+SP + +DWE+S Sbjct: 384 VNFRAVNKATARDEFNSASPTSSAKINTAIRAPRSGSGVAPKLSPVVHRAGVSNDWELSH 443 Query: 1021 CTNKPSAPSGPTNKKRTXXXXXXXXSVTHWTGQRPQKILRAARRTNFLPIVSNNEETMSV 842 T KP A G N+KR V W QRPQK R ARRTNF+PIV N++E ++ Sbjct: 444 STTKPPAAGGTNNRKRVASARSSSPPVVPW--QRPQKSSRTARRTNFMPIVPNSDEASAL 501 Query: 841 DGMSD-ADNDNGFMCAKDFPNISAKQVKLK-----SEQFSESEESGVAEIXXXXXXXXXX 680 D SD A ND G A+ S +Q+K K S SESEESGVA++ Sbjct: 502 DTASDVAGNDLGLGFARRLAGSSPQQIKQKGDPSSSAALSESEESGVADVKPKEKGRKAE 561 Query: 679 XXKADGRAGHNVKKASIAVLPAHKKKLIS-EERGDNLKRQGRASRSFTSSRSLMPTTVDK 503 D ++G NV+K S VLP K KL+S EE GD ++RQGR RS ++RS++P T +K Sbjct: 562 EI--DQKSGQNVQKVSNMVLPTRKNKLVSGEEHGDGVRRQGRTGRSLAATRSMIPMTSEK 619 Query: 502 LCNTAAAKQPRTSRMILDKAESKVGRPPMRKLSDRKAYTNNRLSALHAAADF---GDDGQ 332 L N AKQ R++R+ DK ESK GRPP RKLSDRKAY + A++AAADF +DG Sbjct: 620 LGNIGTAKQLRSARLGSDKNESKAGRPPSRKLSDRKAYARQK-PAINAAADFFVGSEDGH 678 Query: 331 QELLAAAKAVVDSSQL--SLFWRHMELMFRFISEADIGFLKQQDEVRFVSDTQVVPTSAH 158 +ELLAA K V++S+ S FWR ME F I+E DI + KQ +V S T + PT Sbjct: 679 EELLAAVKGVINSAHAFSSPFWRQMEPFFSLITEEDITYWKQ--KVNLESST-LTPTPVP 735 Query: 157 ANGTG-----NSLNELAFVENMGHTGLMHA---TEELISEAAVHSTFPLCQRLIAALITD 2 +N G N + + G +A E+ H+ PLCQRLIAALI++ Sbjct: 736 SNIDGCETIVNGYGLMGCERDAGFDAQWNAGIVAEQSQLSKGDHNVIPLCQRLIAALISE 795