BLASTX nr result
ID: Rehmannia25_contig00006599
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00006599 (1137 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006359164.1| PREDICTED: mucin-2-like isoform X2 [Solanum ... 249 1e-63 ref|XP_004229349.1| PREDICTED: uncharacterized protein LOC101251... 248 2e-63 ref|XP_006359163.1| PREDICTED: mucin-2-like isoform X1 [Solanum ... 238 3e-60 gb|EPS68426.1| hypothetical protein M569_06343, partial [Genlise... 213 9e-53 emb|CBI32170.3| unnamed protein product [Vitis vinifera] 205 2e-50 ref|XP_006439026.1| hypothetical protein CICLE_v10032226mg [Citr... 191 5e-46 gb|EOX96773.1| Hydroxyproline-rich glycoprotein family protein, ... 186 2e-44 ref|XP_004149622.1| PREDICTED: uncharacterized protein LOC101211... 184 4e-44 ref|XP_006439027.1| hypothetical protein CICLE_v10032226mg [Citr... 184 8e-44 gb|EOX96772.1| Hydroxyproline-rich glycoprotein family protein, ... 181 5e-43 ref|XP_002330893.1| predicted protein [Populus trichocarpa] gi|5... 174 8e-41 ref|XP_006294535.1| hypothetical protein CARUB_v10023571mg [Caps... 171 4e-40 ref|XP_002881247.1| hypothetical protein ARALYDRAFT_482225 [Arab... 167 6e-39 ref|NP_180843.2| proline-rich uncharacterized protein [Arabidops... 164 6e-38 gb|EXC42165.1| hypothetical protein L484_002415 [Morus notabilis] 163 1e-37 ref|XP_004171685.1| PREDICTED: uncharacterized protein LOC101226... 162 2e-37 ref|XP_004306495.1| PREDICTED: uncharacterized protein LOC101308... 161 4e-37 ref|XP_006410417.1| hypothetical protein EUTSA_v10016920mg [Eutr... 156 1e-35 ref|XP_003546862.1| PREDICTED: uncharacterized protein LOC100779... 155 3e-35 gb|AAB91981.1| hypothetical protein [Arabidopsis thaliana] gi|20... 154 8e-35 >ref|XP_006359164.1| PREDICTED: mucin-2-like isoform X2 [Solanum tuberosum] Length = 344 Score = 249 bits (637), Expect = 1e-63 Identities = 136/248 (54%), Positives = 171/248 (68%), Gaps = 9/248 (3%) Frame = +1 Query: 253 LLYPVASSGRGFLARPLHMP----AAGPSPRPPYVFPYLDPGQGNPGFIRPNHLPHVLLX 420 +LYPVASSGRGFL++P + P + RP + +DPG G +RP+HL H LL Sbjct: 94 ILYPVASSGRGFLSKPSNYPNRPVVSHLGSRPTFGLNQMDPGLGQSTGVRPSHLQHALLG 153 Query: 421 XXXXXXXXXXXXX--VMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDDTFA 591 V+PG VKG PV SS H K S+SD NG ++ RDR +DDTFA Sbjct: 154 SSPTVNSAGPAASAGVLPGAVKGFPVVSSSHHKIASTQPSLSDCNGFREKRDRSKDDTFA 213 Query: 592 IIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPVAAQVVDSPDK-- 765 IIRDRKVRIS++ASLY+LCRSWL+NG+P++TQ QY+D V+SLPRPL +A Q +SP K Sbjct: 214 IIRDRKVRISDNASLYTLCRSWLRNGLPDDTQSQYMDGVRSLPRPLALAPQDAESPVKKE 273 Query: 766 GAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLALLLPPMVE 945 G +VE+LS KELLQRH+KRAK++RSRLREERL+RI+RYK RLALLLPPMVE Sbjct: 274 GDKEEEEEAGESVEHLSPKELLQRHVKRAKRIRSRLREERLRRIARYKTRLALLLPPMVE 333 Query: 946 QHFKSESA 969 Q F+++ A Sbjct: 334 QQFRNDPA 341 >ref|XP_004229349.1| PREDICTED: uncharacterized protein LOC101251026 [Solanum lycopersicum] Length = 342 Score = 248 bits (634), Expect = 2e-63 Identities = 135/248 (54%), Positives = 170/248 (68%), Gaps = 9/248 (3%) Frame = +1 Query: 253 LLYPVASSGRGFLARPLHMP----AAGPSPRPPYVFPYLDPGQGNPGFIRPNHLPHVLLX 420 +LYPVASSGRGFL++P + P + RP + +DPG G +RP+HL H LL Sbjct: 92 ILYPVASSGRGFLSKPSNYPNRPVVSHLGSRPVFGVNQMDPGSGQSAGVRPSHLQHALLG 151 Query: 421 XXXXXXXXXXXXX--VMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDDTFA 591 V+PG VKG PV SS H K S+SD NG +D RDR +D+TFA Sbjct: 152 SSPTVNSAGPAASSGVLPGAVKGFPVVSSSHNKIASTQPSLSDCNGFRDKRDRSKDETFA 211 Query: 592 IIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPVAAQVVDSPDK-- 765 IIRDRKVRI ++ASLY+LCRSWL+NG+P++TQ QY+D V+SLPRPL +A Q +SP K Sbjct: 212 IIRDRKVRICDNASLYTLCRSWLRNGLPDDTQSQYMDGVRSLPRPLALAPQDAESPVKKE 271 Query: 766 GAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLALLLPPMVE 945 G +VE+LS KELLQRH+KRAK++RSRLREERL+RI+RYK RLALLLPPMVE Sbjct: 272 GDKEEEEEAGESVEHLSPKELLQRHVKRAKRIRSRLREERLRRIARYKTRLALLLPPMVE 331 Query: 946 QHFKSESA 969 Q F+++ A Sbjct: 332 QQFRNDPA 339 >ref|XP_006359163.1| PREDICTED: mucin-2-like isoform X1 [Solanum tuberosum] Length = 366 Score = 238 bits (608), Expect = 3e-60 Identities = 135/270 (50%), Positives = 170/270 (62%), Gaps = 31/270 (11%) Frame = +1 Query: 253 LLYPVASSGRGFLARPLHMP----AAGPSPRPPYVFPYLDPGQGNPGFIRPNHLPHVLLX 420 +LYPVASSGRGFL++P + P + RP + +DPG G +RP+HL H LL Sbjct: 94 ILYPVASSGRGFLSKPSNYPNRPVVSHLGSRPTFGLNQMDPGLGQSTGVRPSHLQHALLG 153 Query: 421 XXXXXXXXXXXXX--VMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDDTFA 591 V+PG VKG PV SS H K S+SD NG ++ RDR +DDTFA Sbjct: 154 SSPTVNSAGPAASAGVLPGAVKGFPVVSSSHHKIASTQPSLSDCNGFREKRDRSKDDTFA 213 Query: 592 IIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPVAAQVVDSPDKG- 768 IIRDRKVRIS++ASLY+LCRSWL+NG+P++TQ QY+D V+SLPRPL +A Q +SP K Sbjct: 214 IIRDRKVRISDNASLYTLCRSWLRNGLPDDTQSQYMDGVRSLPRPLALAPQDAESPVKKE 273 Query: 769 -----------------------AXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLRE 879 +VE+LS KELLQRH+KRAK++RSRLRE Sbjct: 274 GDKEEEEEDCSFSSMLILKRVNFPIPINFKAGESVEHLSPKELLQRHVKRAKRIRSRLRE 333 Query: 880 ERLQRISRYKNRLALLLPPMVEQHFKSESA 969 ERL+RI+RYK RLALLLPPMVEQ F+++ A Sbjct: 334 ERLRRIARYKTRLALLLPPMVEQQFRNDPA 363 >gb|EPS68426.1| hypothetical protein M569_06343, partial [Genlisea aurea] Length = 302 Score = 213 bits (543), Expect = 9e-53 Identities = 129/250 (51%), Positives = 155/250 (62%), Gaps = 4/250 (1%) Frame = +1 Query: 193 QLAPRXXXXXXXXXXQDPSQLLYPVASSGRGFLARPLHMPAAGPSPRPPYVFPYLDPGQG 372 QLAPR QDPSQ + SSG G ++RPL A P+ RPPY P L Sbjct: 66 QLAPRTPHS------QDPSQ----IGSSGGGIVSRPLS--AGRPTQRPPYGSPCLL---- 109 Query: 373 NPGFIRPNHLPHVLLXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISDNNG 552 + G RPN+L HV+L MPGV +GIP +S H K S + D+NG Sbjct: 110 DQGLARPNNLNHVILGPMRGSSADTSGAGAMPGVAQGIPFPTSSHSKVHPHSILVGDSNG 169 Query: 553 HK-DLRDR-RDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRP 726 H DLR R RDD A+IRDRKVR+SE+ASLY+LCRSWL+NGVP + QPQY+D VKSLPRP Sbjct: 170 HTTDLRGRHRDDVVALIRDRKVRLSENASLYALCRSWLRNGVPADMQPQYVDVVKSLPRP 229 Query: 727 LPVAAQVVDSPDK--GAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRIS 900 V+ Q DSP+K + +V LS KELLQRHIKRAKK+RS+L E R +RI Sbjct: 230 SHVSGQTADSPEKNEASSEVETEDEDSVGQLSEKELLQRHIKRAKKIRSKLNERRFKRID 289 Query: 901 RYKNRLALLL 930 RYK+RLALLL Sbjct: 290 RYKSRLALLL 299 >emb|CBI32170.3| unnamed protein product [Vitis vinifera] Length = 342 Score = 205 bits (522), Expect = 2e-50 Identities = 125/259 (48%), Positives = 154/259 (59%), Gaps = 24/259 (9%) Frame = +1 Query: 244 PSQLLYPVASSGRGFLARPLHM------------PAAGPSPRPPYVFPYLDPGQGNP-GF 384 P +LYPVASSGRGF+ +PL P A PR Q P GF Sbjct: 85 PQGILYPVASSGRGFIPKPLRPQSSDHNTVTVANPGAAFPPRSAATAAAAFSHQARPFGF 144 Query: 385 --------IRPNHLPHVLLXXXXXXXXXXXXXXVMPGV--VKGIPVSSSHHPKAGLPSSS 534 + +PH+L +PG +KGIPVS+ HPK S Sbjct: 145 PQSDLNYPVHSMRMPHLL--------PSHVGVTAVPGSAPIKGIPVSA--HPKVAPSPPS 194 Query: 535 ISDNNGHKDLRDR-RDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVK 711 +SD NG+KD RDR RDDTF +RDRKVRIS+ AS+Y+LCRSWL+NG EETQPQ+ D++K Sbjct: 195 VSDCNGYKDSRDRNRDDTFVTVRDRKVRISDGASIYALCRSWLRNGFSEETQPQHYDSMK 254 Query: 712 SLPRPLPVAAQVVDSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQ 891 SLPRPLP+ + P K +VENL ++LLQRHIKRAKKVR+RLRE+RL+ Sbjct: 255 SLPRPLPIPVTDPNLPKKKEDDEEEEDEGSVENLLPQDLLQRHIKRAKKVRARLREQRLK 314 Query: 892 RISRYKNRLALLLPPMVEQ 948 RI+RYK RLALLLPP VE+ Sbjct: 315 RIARYKTRLALLLPPPVER 333 >ref|XP_006439026.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] gi|557541222|gb|ESR52266.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] Length = 303 Score = 191 bits (485), Expect = 5e-46 Identities = 112/252 (44%), Positives = 151/252 (59%), Gaps = 20/252 (7%) Frame = +1 Query: 253 LLYPVASSGRGFLARPLHMPA--------AGPSPRPPYVFPYLDPGQGNPGF-IRPNHLP 405 ++YPVASSGRGF+ +P+ G PRP + PY P N + +H Sbjct: 43 VVYPVASSGRGFIPKPMRPSDQTVTVANHGGYPPRPNQLPPYPRPHLDNHHHPVLHHHQH 102 Query: 406 HVLLXXXXXXXXXXXXXXVM--PGVVKGIPVSSSHHPKAGLPSSSIS-----DNNGH-KD 561 H ++ + P ++G+PVSS H A S+S+S D+NG+ K Sbjct: 103 HHMIRPPPLNNQQHQHPQISSNPSPIRGVPVSSGHLKVAPSSSASLSPVIPPDSNGYNKH 162 Query: 562 LRDRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPVA- 738 LRD D+TF I+RDRKVRI+E ASLY+LCRSWL+NG PEETQPQ+ D VKSLPRPLP+ Sbjct: 163 LRDNSDETFTIVRDRKVRITEGASLYALCRSWLRNGSPEETQPQHADGVKSLPRPLPMPR 222 Query: 739 --AQVVDSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKN 912 A + + NV+ LS ++LL+RH++RAK++R+RL ER +RI RYK Sbjct: 223 ADANIAKEKESEEDEDETDEDENVDRLSEEDLLRRHVQRAKQIRARLSNERAKRIERYKT 282 Query: 913 RLALLLPPMVEQ 948 RL+LLLPP+VEQ Sbjct: 283 RLSLLLPPLVEQ 294 >gb|EOX96773.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 276 Score = 186 bits (471), Expect = 2e-44 Identities = 115/254 (45%), Positives = 153/254 (60%), Gaps = 15/254 (5%) Frame = +1 Query: 253 LLYPVASSGRGFL-----ARPLHMPAAGPSPRPPYVFPYLDPGQGNPGFIRPN----HLP 405 ++YPVASSGRGFL RPL P P P + + +P +P P+ H P Sbjct: 49 VMYPVASSGRGFLPTNHPCRPLLPYHHHPHPHPHH---FANPRPPSPSLSLPHPTHFHPP 105 Query: 406 HVLLXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDD 582 L S S HPK SS+S+ NG+K++RDR +DD Sbjct: 106 LKAL-------------------------SLSLHPKVAPSPSSLSETNGYKNVRDRTKDD 140 Query: 583 TFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPVAA-----QV 747 + +RDRKVRI++ AS+Y+LCRSWL+NG P+ETQPQY D KSLP+PLP+ + Sbjct: 141 SLVNVRDRKVRITDGASVYALCRSWLRNGFPDETQPQYGDVSKSLPQPLPIPVTDNLLKD 200 Query: 748 VDSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLALL 927 + ++ +VENLSA++LL+RHI RAKKVRSRLR+ERL+RI+RYK RLALL Sbjct: 201 TEDEEEQEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLALL 260 Query: 928 LPPMVEQHFKSESA 969 LPP+VEQ F+S++A Sbjct: 261 LPPLVEQ-FRSDAA 273 >ref|XP_004149622.1| PREDICTED: uncharacterized protein LOC101211370 [Cucumis sativus] Length = 376 Score = 184 bits (468), Expect = 4e-44 Identities = 120/261 (45%), Positives = 153/261 (58%), Gaps = 24/261 (9%) Frame = +1 Query: 238 QDPSQ-LLYPVASSGRGFLARPLH-MPA---------AGPSPRPPYVFPYLDPGQGNPGF 384 QD SQ +LYPVASSGRGF+ R + +PA G RP FP+ G +P Sbjct: 121 QDASQAILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIG--SPHL 178 Query: 385 IRPNHLPHVLLXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDL 564 +H H+ + G +K P SS PKA P +I ++NG K++ Sbjct: 179 DSMSHPMHMTRPPNLQQQLIPFSGSSISGSIKCAPNSSD--PKA-FPPQTICESNGCKEM 235 Query: 565 RDRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPVAA- 741 R R DDT ++RDRKVRI++ ASLY+LCRSWL+NG EE+QPQY +SLPRPLP+A Sbjct: 236 RVR-DDTLCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGSFFRSLPRPLPIAVA 294 Query: 742 ------------QVVDSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREER 885 + VD DK ++E+LS +ELL+RH++RAKKVRSRLREER Sbjct: 295 GAAPLQKKEVVKEEVDEKDK--------DEGSIEHLSTQELLKRHVRRAKKVRSRLREER 346 Query: 886 LQRISRYKNRLALLLPPMVEQ 948 LQRI RYK RLALLLPP +EQ Sbjct: 347 LQRIERYKTRLALLLPPPIEQ 367 >ref|XP_006439027.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] gi|568858552|ref|XP_006482814.1| PREDICTED: RNA-binding protein 33-like [Citrus sinensis] gi|557541223|gb|ESR52267.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] Length = 297 Score = 184 bits (466), Expect = 8e-44 Identities = 109/251 (43%), Positives = 147/251 (58%), Gaps = 19/251 (7%) Frame = +1 Query: 253 LLYPVASSGRGFLARPLHMPA--------AGPSPRPPYVFPYLDPGQGNPGF-IRPNHLP 405 ++YPVASSGRGF+ +P+ G PRP + PY P N + +H Sbjct: 43 VVYPVASSGRGFIPKPMRPSDQTVTVANHGGYPPRPNQLPPYPRPHLDNHHHPVLHHHQH 102 Query: 406 HVLLXXXXXXXXXXXXXXVM--PGVVKGIPVSSSHHPKAGLPSSSIS-----DNNGHKDL 564 H ++ + P ++G+PVSS H A S+S+S D+NG Sbjct: 103 HHMIRPPPLNNQQHQHPQISSNPSPIRGVPVSSGHLKVAPSSSASLSPVIPPDSNG---- 158 Query: 565 RDRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPVA-- 738 D D+TF I+RDRKVRI+E ASLY+LCRSWL+NG PEETQPQ+ D VKSLPRPLP+ Sbjct: 159 -DNSDETFTIVRDRKVRITEGASLYALCRSWLRNGSPEETQPQHADGVKSLPRPLPMPRA 217 Query: 739 -AQVVDSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNR 915 A + + NV+ LS ++LL+RH++RAK++R+RL ER +RI RYK R Sbjct: 218 DANIAKEKESEEDEDETDEDENVDRLSEEDLLRRHVQRAKQIRARLSNERAKRIERYKTR 277 Query: 916 LALLLPPMVEQ 948 L+LLLPP+VEQ Sbjct: 278 LSLLLPPLVEQ 288 >gb|EOX96772.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 277 Score = 181 bits (459), Expect = 5e-43 Identities = 115/255 (45%), Positives = 153/255 (60%), Gaps = 16/255 (6%) Frame = +1 Query: 253 LLYPVASSGRGFL-----ARPLHMPAAGPSPRPPYVFPYLDPGQGNPGFIRPN----HLP 405 ++YPVASSGRGFL RPL P P P + + +P +P P+ H P Sbjct: 49 VMYPVASSGRGFLPTNHPCRPLLPYHHHPHPHPHH---FANPRPPSPSLSLPHPTHFHPP 105 Query: 406 HVLLXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDD 582 L S S HPK SS+S+ NG+K++RDR +DD Sbjct: 106 LKAL-------------------------SLSLHPKVAPSPSSLSETNGYKNVRDRTKDD 140 Query: 583 TFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQ-PQYLDTVKSLPRPLPVAA-----Q 744 + +RDRKVRI++ AS+Y+LCRSWL+NG P+ETQ PQY D KSLP+PLP+ + Sbjct: 141 SLVNVRDRKVRITDGASVYALCRSWLRNGFPDETQQPQYGDVSKSLPQPLPIPVTDNLLK 200 Query: 745 VVDSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLAL 924 + ++ +VENLSA++LL+RHI RAKKVRSRLR+ERL+RI+RYK RLAL Sbjct: 201 DTEDEEEQEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLAL 260 Query: 925 LLPPMVEQHFKSESA 969 LLPP+VEQ F+S++A Sbjct: 261 LLPPLVEQ-FRSDAA 274 >ref|XP_002330893.1| predicted protein [Populus trichocarpa] gi|566150610|ref|XP_006369465.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550348014|gb|ERP66034.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 340 Score = 174 bits (440), Expect = 8e-41 Identities = 117/273 (42%), Positives = 156/273 (57%), Gaps = 32/273 (11%) Frame = +1 Query: 253 LLYPVASSGRGFLARPL--------------HMPAAGPSPRP--PYVFPYLDPGQGNPGF 384 +LYPVASSGRGF+ RP+ H AG + RP P + +P Sbjct: 77 VLYPVASSGRGFIPRPVRPHQDQTPANQGAYHPRGAGVAYRPHTPTTVVGSPSSRSHPNP 136 Query: 385 IRPNHLPHV-------LLXXXXXXXXXXXXXXVMPGV--------VKGIPVSSSHHPKAG 519 + L H+ L+ V G+ +KGIPV+ + Sbjct: 137 QQLGDLHHLHNVQQQHLMMSRQHPTHLQHHNYVGFGLGVGSVAAPIKGIPVTG----QLK 192 Query: 520 LPSSSISDNNGHKDLRDR-RDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQY 696 + S +SD+NG+K+LRDR RDD ++RDRKVRIS+ A LY+LCRSWL+NG PEE++ Y Sbjct: 193 VAPSPVSDSNGYKNLRDRSRDDNLMVVRDRKVRISDGAPLYALCRSWLRNGFPEESEVHY 252 Query: 697 LDTVKSLPRPLPVAAQVVDSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLR 876 D+VK LPRPL + + +K V+NLSA ELL+RHIK AKKVR+RLR Sbjct: 253 GDSVKPLPRPLLPKEESEEEVEKEKKDEEP-----VDNLSAAELLKRHIKHAKKVRARLR 307 Query: 877 EERLQRISRYKNRLALLLPPMVEQHFKSESADE 975 EERL+RI+RYK+RLALLLPP VEQ F++++ E Sbjct: 308 EERLKRIARYKSRLALLLPPQVEQ-FRNDTPAE 339 >ref|XP_006294535.1| hypothetical protein CARUB_v10023571mg [Capsella rubella] gi|482563243|gb|EOA27433.1| hypothetical protein CARUB_v10023571mg [Capsella rubella] Length = 339 Score = 171 bits (434), Expect = 4e-40 Identities = 110/250 (44%), Positives = 138/250 (55%), Gaps = 14/250 (5%) Frame = +1 Query: 241 DPSQLLYPVASSGRGFLARPLHM-------PAAGPS---PRPPYVFPYLDPGQGNPGFIR 390 DPS L+YP SSGRGF RP P A P PRP Y + + G Sbjct: 98 DPSTLIYPFGSSGRGFPTRPARQNSNSVADPVASPGGHPPRPVYAYHHGQFG-------- 149 Query: 391 PNHLPHVLLXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRD 570 ++L + + PG +KG+P P+A +SI DN GHK R Sbjct: 150 -SNLDPMFQFMRAAHPQNQQSPQLGPGHMKGVP--HFLQPRATPSPTSILDNVGHKKARS 206 Query: 571 RRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPV----A 738 R DD ++R RKVRI+E ASLYSLCRSWL+NG E QPQ DT+ LP+PLPV Sbjct: 207 R-DDALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIQPQRSDTLTCLPKPLPVDMTET 265 Query: 739 AQVVDSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRL 918 + DS ++ +V+ LS +LL+RH+ RAKKVRSRLRE+RL+RI+RYK RL Sbjct: 266 SLPKDSVEEPNPEEDKEDEESVKELSTSDLLKRHVDRAKKVRSRLREDRLKRIARYKARL 325 Query: 919 ALLLPPMVEQ 948 ALLLPP EQ Sbjct: 326 ALLLPPFGEQ 335 >ref|XP_002881247.1| hypothetical protein ARALYDRAFT_482225 [Arabidopsis lyrata subsp. lyrata] gi|297327086|gb|EFH57506.1| hypothetical protein ARALYDRAFT_482225 [Arabidopsis lyrata subsp. lyrata] Length = 334 Score = 167 bits (424), Expect = 6e-39 Identities = 111/251 (44%), Positives = 138/251 (54%), Gaps = 15/251 (5%) Frame = +1 Query: 241 DPSQLLYPVASSGRGFLARPLHM-------PAAGPSPRPPY-VFPYLDPGQGNPGFIRPN 396 DPS L+YP SSGRGF RP P P PP V+ Y GQ N Sbjct: 91 DPSSLIYPFGSSGRGFPTRPGRQNSNSVADPVGSPGGYPPRPVYGYHQHGQ-----FGSN 145 Query: 397 HLPHVLLXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDRR 576 P + + G +KG+P P+ +SI DN+GHK R R Sbjct: 146 LDPVLQQLMRAAHLQNQQSPQLGSGHMKGVP--HFLQPRVTPSPTSILDNSGHKKARSR- 202 Query: 577 DDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPV------- 735 DD ++R RKVRI+E ASLYSLCRSWL+NG E +PQ DT+ LP+PLPV Sbjct: 203 DDALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIKPQRSDTMTCLPKPLPVDMTETSL 262 Query: 736 AAQVVDSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNR 915 +VV+ P++ +V++LS +LL+RHI RAKKVRSRLREERL+RI+RYK R Sbjct: 263 PKEVVEEPNR---EEDKEDEESVKHLSESDLLKRHIDRAKKVRSRLREERLKRIARYKAR 319 Query: 916 LALLLPPMVEQ 948 LALLLPP EQ Sbjct: 320 LALLLPPFGEQ 330 >ref|NP_180843.2| proline-rich uncharacterized protein [Arabidopsis thaliana] gi|26450185|dbj|BAC42211.1| unknown protein [Arabidopsis thaliana] gi|28827576|gb|AAO50632.1| unknown protein [Arabidopsis thaliana] gi|330253655|gb|AEC08749.1| proline-rich uncharacterized protein [Arabidopsis thaliana] Length = 337 Score = 164 bits (415), Expect = 6e-38 Identities = 110/259 (42%), Positives = 140/259 (54%), Gaps = 23/259 (8%) Frame = +1 Query: 241 DPSQLLYPVASSGRGFLARPLHM-------PAAGPSPR------PPYVFPY------LDP 363 DPS L+YP SSGRGF RP+ P PSP P Y + + LDP Sbjct: 93 DPSSLIYPFGSSGRGFPTRPVRQNSNSVADPVGSPSPGGYTPRGPVYGYHHGQFVSNLDP 152 Query: 364 GQGNPGFIRPNHLPHVLLXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISD 543 F+R H + + G +KG+P P+A +SI D Sbjct: 153 MNQ---FMRAAHPQN------------QQSPQLGSGHMKGVP--HFLQPRATPSPTSILD 195 Query: 544 NNGHKDLRDRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPR 723 N+GHK R R DD ++R RKVRI+E ASLYSLCRSWL+NG E +PQ +D + LP+ Sbjct: 196 NSGHKKARSR-DDALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIKPQRIDMMTCLPK 254 Query: 724 PLPVAAQVVDSP----DKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQ 891 PLPV P ++ +V++LS +LL+RHI RAKKVR+RLREERL+ Sbjct: 255 PLPVDKTETSLPKDLVEEAICEEDKEDEESVKHLSESDLLKRHIDRAKKVRARLREERLK 314 Query: 892 RISRYKNRLALLLPPMVEQ 948 RI+RYK RLALLLPP EQ Sbjct: 315 RIARYKARLALLLPPFGEQ 333 >gb|EXC42165.1| hypothetical protein L484_002415 [Morus notabilis] Length = 454 Score = 163 bits (413), Expect = 1e-37 Identities = 102/236 (43%), Positives = 134/236 (56%), Gaps = 11/236 (4%) Frame = +1 Query: 244 PSQLLYPVASSGRGFLARPLHM---PAAGPSPRPPYVFPYLDPGQGNPGFIRPNHLPHVL 414 P + YPV SSGRGF++ P PAAG P NP RP + + Sbjct: 77 PQGIPYPVVSSGRGFISLPKSSSSSPAAGADQTVTVASP-------NPSGYRPRPAANYV 129 Query: 415 ---LXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDD 582 + ++ G VKG+PVS PK PS S+ D NG+KD+RD+ RDD Sbjct: 130 VRPIQHIHHYHHHQQQPHLVAGPVKGVPVSIQLQPKVP-PSPSVPDCNGYKDMRDKVRDD 188 Query: 583 TFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPVAAQVVDSPD 762 + I+RDRKVRI+EDASLY+LC+SWL+NG EE+Q QY D V SLPRPLP+ + Sbjct: 189 SLTIVRDRKVRITEDASLYALCQSWLRNGFSEESQKQYGDAVMSLPRPLPIPMATNNEQK 248 Query: 763 KGA----XXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRL 918 K +V+NLSA++L +RH+KRAKKVR+RLRE R +RI+R + L Sbjct: 249 KEGEEDDNDGDEEDEESVKNLSAEDLFKRHLKRAKKVRARLREVRQKRIARVVSAL 304 >ref|XP_004171685.1| PREDICTED: uncharacterized protein LOC101226490 [Cucumis sativus] Length = 196 Score = 162 bits (410), Expect = 2e-37 Identities = 93/173 (53%), Positives = 117/173 (67%), Gaps = 13/173 (7%) Frame = +1 Query: 469 GVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDRRDDTFAIIRDRKVRISEDASLYSLC 648 G +K P SS PKA P +I ++NG K++R R DDT ++RDRKVRI++ ASLY+LC Sbjct: 27 GSIKCAPNSSD--PKA-FPPQTICESNGCKEMRVR-DDTLCVVRDRKVRITDGASLYALC 82 Query: 649 RSWLKNGVPEETQPQYLDTVKSLPRPLPVAA-------------QVVDSPDKGAXXXXXX 789 RSWL+NG EE+QPQY +SLPRPLP+A + VD DK Sbjct: 83 RSWLRNGSQEESQPQYGSFFRSLPRPLPIAVAGAAPLQKKEVVKEEVDEKDK-------- 134 Query: 790 XXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLALLLPPMVEQ 948 ++E+LS +ELL+RH++RAKKVRSRLREERLQRI RYK RLALLLPP +EQ Sbjct: 135 DEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQ 187 >ref|XP_004306495.1| PREDICTED: uncharacterized protein LOC101308794 [Fragaria vesca subsp. vesca] Length = 254 Score = 161 bits (408), Expect = 4e-37 Identities = 103/256 (40%), Positives = 150/256 (58%), Gaps = 13/256 (5%) Frame = +1 Query: 241 DPSQLLYPVASSGRGFLARPLHMPAAGPSPRPP-YVFPYLDPGQGNPGFI-----RPNHL 402 DP+ A++GR PL A P+P PP +++ Q +PG + RP + Sbjct: 4 DPNHTANAAAAAGR-----PLRPIAPAPTPPPPAHMYTVPMRAQSSPGALVYPSARPPYP 58 Query: 403 PHVLLXXXXXXXXXXXXXXVMPGVVKGI---PVSSSHHPKAGLPSSSISDNNGHKDLRDR 573 P + P + + P+ P SS+ D+NG +D Sbjct: 59 PPLNFHPHPHPYPPHLHPSPPPPAYQSLLPPPIKDLRFSGLVAPPSSVPDSNGIRD--KG 116 Query: 574 RDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPR--PLPVAAQV 747 RDDT +I+DRKVRI++ ASLY LCRSWL+NG EE+QP+Y D +SLP+ P+P+A+ + Sbjct: 117 RDDTQFLIQDRKVRITDGASLYVLCRSWLRNGTSEESQPRYGDATRSLPKPSPIPMASAI 176 Query: 748 VDSPDKG--AXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLA 921 + D+G +VE++S ++LL+RHIKRA+KVR+RLREERL+RI+RYK+RLA Sbjct: 177 PPNKDEGDKKEDNEDKVEESVEHVSPEDLLKRHIKRARKVRARLREERLRRIARYKSRLA 236 Query: 922 LLLPPMVEQHFKSESA 969 LLLPP+VEQ F+++ A Sbjct: 237 LLLPPLVEQ-FRNDLA 251 >ref|XP_006410417.1| hypothetical protein EUTSA_v10016920mg [Eutrema salsugineum] gi|557111586|gb|ESQ51870.1| hypothetical protein EUTSA_v10016920mg [Eutrema salsugineum] Length = 328 Score = 156 bits (395), Expect = 1e-35 Identities = 104/251 (41%), Positives = 138/251 (54%), Gaps = 14/251 (5%) Frame = +1 Query: 238 QDPSQLLYPVASSGRGFLARPLHMPAA----------GPSPRPPYVFPYLDPGQGNPGFI 387 QDPS L+YP SSGRG RP ++ G PRP YV+ + GQ Sbjct: 87 QDPSGLVYPYPSSGRGLPTRPGRQNSSSVADPMGSPGGYPPRPAYVYHH---GQSR---- 139 Query: 388 RPNHLPHVLLXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLR 567 ++L ++ + G + G+P P+ P +SI DN+G K+ R Sbjct: 140 --SNLDPMIQFMRTAHPQIQQSPHLGSGYMIGVP--HFLQPRVAYPPTSILDNSGRKNAR 195 Query: 568 DRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPRPLPVAAQV 747 R D+ ++R RKVRI+E ASLYSLCRSWL+NG E Q Q DTV LP+PLPV Sbjct: 196 SR-DEVLVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIQ-QRSDTVTYLPKPLPVDMME 253 Query: 748 V----DSPDKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNR 915 +S ++ +V+ LS +LL+RH+ RAKKVR+RLRE+RL+RI+RYK R Sbjct: 254 TSLSRESVEEAHREEDNEDEESVKQLSDSDLLKRHVDRAKKVRARLREDRLKRIARYKAR 313 Query: 916 LALLLPPMVEQ 948 LALLLPP EQ Sbjct: 314 LALLLPPFGEQ 324 >ref|XP_003546862.1| PREDICTED: uncharacterized protein LOC100779268 isoform X1 [Glycine max] Length = 274 Score = 155 bits (392), Expect = 3e-35 Identities = 81/153 (52%), Positives = 111/153 (72%), Gaps = 7/153 (4%) Frame = +1 Query: 511 KAGLPSSSISDNNGHKDLRDRR---DDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEE 681 K S+++D NG KD R +DTF ++RDRKVR+++DASLY+LCRSWL+NG+ EE Sbjct: 113 KKAAARSAVADVNGGKDTNTREKSSEDTFIVVRDRKVRVTDDASLYALCRSWLRNGINEE 172 Query: 682 TQPQYLDTVKSLPRPLP---VAAQVVD-SPDKGAXXXXXXXXXNVENLSAKELLQRHIKR 849 +QPQ D +K+LP+PLP VA+ + + D+ +VE+LS ++LL+RHIKR Sbjct: 173 SQPQQKDVIKALPKPLPASMVASYLSNKKEDEKDEDEKEENEQSVEHLSPQDLLKRHIKR 232 Query: 850 AKKVRSRLREERLQRISRYKNRLALLLPPMVEQ 948 AK VR+RLREERLQRI+RY++RL LLLPP +EQ Sbjct: 233 AKNVRARLREERLQRITRYRSRLRLLLPPAIEQ 265 >gb|AAB91981.1| hypothetical protein [Arabidopsis thaliana] gi|20197061|gb|AAM14901.1| hypothetical protein [Arabidopsis thaliana] Length = 346 Score = 154 bits (388), Expect = 8e-35 Identities = 107/259 (41%), Positives = 138/259 (53%), Gaps = 23/259 (8%) Frame = +1 Query: 241 DPSQLLYPVASSGRGFLARPLHM-------PAAGPSPR------PPYVFPY------LDP 363 DPS L+YP SSGRGF RP+ P PSP P Y + + LDP Sbjct: 93 DPSSLIYPFGSSGRGFPTRPVRQNSNSVADPVGSPSPGGYTPRGPVYGYHHGQFVSNLDP 152 Query: 364 GQGNPGFIRPNHLPHVLLXXXXXXXXXXXXXXVMPGVVKGIPVSSSHHPKAGLPSSSISD 543 F+R H + + P +V VS + + +A +SI D Sbjct: 153 MNQ---FMRAAHPQNQQSPQLGSGHMKGVPHFLQPRLVL---VSENVYVEATPSPTSILD 206 Query: 544 NNGHKDLRDRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQPQYLDTVKSLPR 723 N+GHK R R DD ++R RKVRI+E ASLYSLCRSWL+NG E + +D + LP+ Sbjct: 207 NSGHKKARSR-DDALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIKR--IDMMTCLPK 263 Query: 724 PLPVAAQVVDSP----DKGAXXXXXXXXXNVENLSAKELLQRHIKRAKKVRSRLREERLQ 891 PLPV P ++ +V++LS +LL+RHI RAKKVR+RLREERL+ Sbjct: 264 PLPVDKTETSLPKDLVEEAICEEDKEDEESVKHLSESDLLKRHIDRAKKVRARLREERLK 323 Query: 892 RISRYKNRLALLLPPMVEQ 948 RI+RYK RLALLLPP EQ Sbjct: 324 RIARYKARLALLLPPFGEQ 342