BLASTX nr result
ID: Rehmannia24_contig00004241
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia24_contig00004241 (963 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006359164.1| PREDICTED: mucin-2-like isoform X2 [Solanum ... 245 2e-62 ref|XP_004229349.1| PREDICTED: uncharacterized protein LOC101251... 244 4e-62 ref|XP_006359163.1| PREDICTED: mucin-2-like isoform X1 [Solanum ... 234 4e-59 gb|EPS68426.1| hypothetical protein M569_06343, partial [Genlise... 203 9e-50 emb|CBI32170.3| unnamed protein product [Vitis vinifera] 199 2e-48 gb|EOX96772.1| Hydroxyproline-rich glycoprotein family protein, ... 187 4e-45 ref|XP_006439026.1| hypothetical protein CICLE_v10032226mg [Citr... 186 9e-45 gb|EOX96773.1| Hydroxyproline-rich glycoprotein family protein, ... 181 4e-43 ref|XP_006439027.1| hypothetical protein CICLE_v10032226mg [Citr... 179 1e-42 ref|XP_004149622.1| PREDICTED: uncharacterized protein LOC101211... 177 5e-42 ref|XP_002330893.1| predicted protein [Populus trichocarpa] gi|5... 169 1e-39 ref|XP_006294535.1| hypothetical protein CARUB_v10023571mg [Caps... 162 2e-37 gb|EXC42165.1| hypothetical protein L484_002415 [Morus notabilis] 158 3e-36 ref|XP_002881247.1| hypothetical protein ARALYDRAFT_482225 [Arab... 158 3e-36 ref|XP_004171685.1| PREDICTED: uncharacterized protein LOC101226... 157 4e-36 ref|XP_004306495.1| PREDICTED: uncharacterized protein LOC101308... 155 3e-35 ref|NP_180843.2| proline-rich uncharacterized protein [Arabidops... 154 4e-35 ref|XP_003546862.1| PREDICTED: uncharacterized protein LOC100779... 150 5e-34 ref|XP_006410417.1| hypothetical protein EUTSA_v10016920mg [Eutr... 150 9e-34 gb|AAB91981.1| hypothetical protein [Arabidopsis thaliana] gi|20... 149 1e-33 >ref|XP_006359164.1| PREDICTED: mucin-2-like isoform X2 [Solanum tuberosum] Length = 344 Score = 245 bits (626), Expect = 2e-62 Identities = 137/249 (55%), Positives = 172/249 (69%), Gaps = 9/249 (3%) Frame = -3 Query: 910 LLYPVASSGRGFLARPLHMP----AAGPSPRPPYVFPYLDPGQGNPGFIRPNHLPHVLLX 743 +LYPVASSGRGFL++P + P + RP + +DPG G +RP+HL H LL Sbjct: 94 ILYPVASSGRGFLSKPSNYPNRPVVSHLGSRPTFGLNQMDPGLGQSTGVRPSHLQHALLG 153 Query: 742 XXXXXXXXXXXXG--VMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDDTFA 572 V+PG VKG PV SS H K S+SD NG ++ RDR +DDTFA Sbjct: 154 SSPTVNSAGPAASAGVLPGAVKGFPVVSSSHHKIASTQPSLSDCNGFREKRDRSKDDTFA 213 Query: 571 IIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVAAQVVDSPDK- 395 IIRDRKVRIS++ASLY+LCRSWL+NG+P++TQ QY+D V+SLPRPL +A Q +SP K Sbjct: 214 IIRDRKVRISDNASLYTLCRSWLRNGLPDDTQS-QYMDGVRSLPRPLALAPQDAESPVKK 272 Query: 394 -GAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLALLLPPMV 218 G E+VE+LS KELLQRH+KRAK++RSRLREERL+RI+RYK RLALLLPPMV Sbjct: 273 EGDKEEEEEAGESVEHLSPKELLQRHVKRAKRIRSRLREERLRRIARYKTRLALLLPPMV 332 Query: 217 EQHFKSESA 191 EQ F+++ A Sbjct: 333 EQQFRNDPA 341 >ref|XP_004229349.1| PREDICTED: uncharacterized protein LOC101251026 [Solanum lycopersicum] Length = 342 Score = 244 bits (623), Expect = 4e-62 Identities = 136/249 (54%), Positives = 171/249 (68%), Gaps = 9/249 (3%) Frame = -3 Query: 910 LLYPVASSGRGFLARPLHMP----AAGPSPRPPYVFPYLDPGQGNPGFIRPNHLPHVLLX 743 +LYPVASSGRGFL++P + P + RP + +DPG G +RP+HL H LL Sbjct: 92 ILYPVASSGRGFLSKPSNYPNRPVVSHLGSRPVFGVNQMDPGSGQSAGVRPSHLQHALLG 151 Query: 742 XXXXXXXXXXXXG--VMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDDTFA 572 V+PG VKG PV SS H K S+SD NG +D RDR +D+TFA Sbjct: 152 SSPTVNSAGPAASSGVLPGAVKGFPVVSSSHNKIASTQPSLSDCNGFRDKRDRSKDETFA 211 Query: 571 IIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVAAQVVDSPDK- 395 IIRDRKVRI ++ASLY+LCRSWL+NG+P++TQ QY+D V+SLPRPL +A Q +SP K Sbjct: 212 IIRDRKVRICDNASLYTLCRSWLRNGLPDDTQS-QYMDGVRSLPRPLALAPQDAESPVKK 270 Query: 394 -GAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLALLLPPMV 218 G E+VE+LS KELLQRH+KRAK++RSRLREERL+RI+RYK RLALLLPPMV Sbjct: 271 EGDKEEEEEAGESVEHLSPKELLQRHVKRAKRIRSRLREERLRRIARYKTRLALLLPPMV 330 Query: 217 EQHFKSESA 191 EQ F+++ A Sbjct: 331 EQQFRNDPA 339 >ref|XP_006359163.1| PREDICTED: mucin-2-like isoform X1 [Solanum tuberosum] Length = 366 Score = 234 bits (597), Expect = 4e-59 Identities = 136/271 (50%), Positives = 171/271 (63%), Gaps = 31/271 (11%) Frame = -3 Query: 910 LLYPVASSGRGFLARPLHMP----AAGPSPRPPYVFPYLDPGQGNPGFIRPNHLPHVLLX 743 +LYPVASSGRGFL++P + P + RP + +DPG G +RP+HL H LL Sbjct: 94 ILYPVASSGRGFLSKPSNYPNRPVVSHLGSRPTFGLNQMDPGLGQSTGVRPSHLQHALLG 153 Query: 742 XXXXXXXXXXXXG--VMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDDTFA 572 V+PG VKG PV SS H K S+SD NG ++ RDR +DDTFA Sbjct: 154 SSPTVNSAGPAASAGVLPGAVKGFPVVSSSHHKIASTQPSLSDCNGFREKRDRSKDDTFA 213 Query: 571 IIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVAAQVVDSPDKG 392 IIRDRKVRIS++ASLY+LCRSWL+NG+P++TQ QY+D V+SLPRPL +A Q +SP K Sbjct: 214 IIRDRKVRISDNASLYTLCRSWLRNGLPDDTQS-QYMDGVRSLPRPLALAPQDAESPVKK 272 Query: 391 ------------------------AXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLR 284 E+VE+LS KELLQRH+KRAK++RSRLR Sbjct: 273 EGDKEEEEEDCSFSSMLILKRVNFPIPINFKAGESVEHLSPKELLQRHVKRAKRIRSRLR 332 Query: 283 EERLQRISRYKNRLALLLPPMVEQHFKSESA 191 EERL+RI+RYK RLALLLPPMVEQ F+++ A Sbjct: 333 EERLRRIARYKTRLALLLPPMVEQQFRNDPA 363 >gb|EPS68426.1| hypothetical protein M569_06343, partial [Genlisea aurea] Length = 302 Score = 203 bits (516), Expect = 9e-50 Identities = 120/227 (52%), Positives = 147/227 (64%), Gaps = 4/227 (1%) Frame = -3 Query: 898 VASSGRGFLARPLHMPAAGPSPRPPYVFPYLDPGQGNPGFIRPNHLPHVLLXXXXXXXXX 719 + SSG G ++RPL A P+ RPPY P L + G RPN+L HV+L Sbjct: 80 IGSSGGGIVSRPLS--AGRPTQRPPYGSPCLL----DQGLARPNNLNHVILGPMRGSSAD 133 Query: 718 XXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHK-DLRDR-RDDTFAIIRDRKVRI 545 G MPGV +GIP +S H K S + D+NGH DLR R RDD A+IRDRKVR+ Sbjct: 134 TSGAGAMPGVAQGIPFPTSSHSKVHPHSILVGDSNGHTTDLRGRHRDDVVALIRDRKVRL 193 Query: 544 SEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVAAQVVDSPDK--GAXXXXXX 371 SE+ASLY+LCRSWL+NGVP + QPQY+D VKSLPRP V+ Q DSP+K + Sbjct: 194 SENASLYALCRSWLRNGVPAD-MQPQYVDVVKSLPRPSHVSGQTADSPEKNEASSEVETE 252 Query: 370 XXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLALLL 230 ++V LS KELLQRHIKRAKK+RS+L E R +RI RYK+RLALLL Sbjct: 253 DEDSVGQLSEKELLQRHIKRAKKIRSKLNERRFKRIDRYKSRLALLL 299 >emb|CBI32170.3| unnamed protein product [Vitis vinifera] Length = 342 Score = 199 bits (505), Expect = 2e-48 Identities = 124/257 (48%), Positives = 153/257 (59%), Gaps = 24/257 (9%) Frame = -3 Query: 910 LLYPVASSGRGFLARPLHM------------PAAGPSPRPPYVFPYLDPGQGNP-GF--- 779 +LYPVASSGRGF+ +PL P A PR Q P GF Sbjct: 88 ILYPVASSGRGFIPKPLRPQSSDHNTVTVANPGAAFPPRSAATAAAAFSHQARPFGFPQS 147 Query: 778 -----IRPNHLPHVLLXXXXXXXXXXXXXGVMPGV--VKGIPVSSSHHPKAGLPSSSISD 620 + +PH+L +PG +KGIPVS+ HPK S+SD Sbjct: 148 DLNYPVHSMRMPHLL--------PSHVGVTAVPGSAPIKGIPVSA--HPKVAPSPPSVSD 197 Query: 619 NNGHKDLRDR-RDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSL 443 NG+KD RDR RDDTF +RDRKVRIS+ AS+Y+LCRSWL+NG EETQ PQ+ D++KSL Sbjct: 198 CNGYKDSRDRNRDDTFVTVRDRKVRISDGASIYALCRSWLRNGFSEETQ-PQHYDSMKSL 256 Query: 442 PRPLPVAAQVVDSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRI 263 PRPLP+ + P K +VENL ++LLQRHIKRAKKVR+RLRE+RL+RI Sbjct: 257 PRPLPIPVTDPNLPKKKEDDEEEEDEGSVENLLPQDLLQRHIKRAKKVRARLREQRLKRI 316 Query: 262 SRYKNRLALLLPPMVEQ 212 +RYK RLALLLPP VE+ Sbjct: 317 ARYKTRLALLLPPPVER 333 >gb|EOX96772.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 277 Score = 187 bits (476), Expect = 4e-45 Identities = 116/255 (45%), Positives = 155/255 (60%), Gaps = 15/255 (5%) Frame = -3 Query: 910 LLYPVASSGRGFL-----ARPLHMPAAGPSPRPPYVFPYLDPGQGNPGFIRPN----HLP 758 ++YPVASSGRGFL RPL P P P + + +P +P P+ H P Sbjct: 49 VMYPVASSGRGFLPTNHPCRPLLPYHHHPHPHPHH---FANPRPPSPSLSLPHPTHFHPP 105 Query: 757 HVLLXXXXXXXXXXXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDD 581 L S S HPK SS+S+ NG+K++RDR +DD Sbjct: 106 LKAL-------------------------SLSLHPKVAPSPSSLSETNGYKNVRDRTKDD 140 Query: 580 TFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVAA-----Q 416 + +RDRKVRI++ AS+Y+LCRSWL+NG P+ETQQPQY D KSLP+PLP+ + Sbjct: 141 SLVNVRDRKVRITDGASVYALCRSWLRNGFPDETQQPQYGDVSKSLPQPLPIPVTDNLLK 200 Query: 415 VVDSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLAL 236 + ++ ++VENLSA++LL+RHI RAKKVRSRLR+ERL+RI+RYK RLAL Sbjct: 201 DTEDEEEQEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLAL 260 Query: 235 LLPPMVEQHFKSESA 191 LLPP+VEQ F+S++A Sbjct: 261 LLPPLVEQ-FRSDAA 274 >ref|XP_006439026.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] gi|557541222|gb|ESR52266.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] Length = 303 Score = 186 bits (473), Expect = 9e-45 Identities = 113/253 (44%), Positives = 152/253 (60%), Gaps = 20/253 (7%) Frame = -3 Query: 910 LLYPVASSGRGFLARPLHMPA--------AGPSPRPPYVFPYLDPGQGNPGF-IRPNHLP 758 ++YPVASSGRGF+ +P+ G PRP + PY P N + +H Sbjct: 43 VVYPVASSGRGFIPKPMRPSDQTVTVANHGGYPPRPNQLPPYPRPHLDNHHHPVLHHHQH 102 Query: 757 HVLLXXXXXXXXXXXXXGVM--PGVVKGIPVSSSHHPKAGLPSSSIS-----DNNGH-KD 602 H ++ + P ++G+PVSS H A S+S+S D+NG+ K Sbjct: 103 HHMIRPPPLNNQQHQHPQISSNPSPIRGVPVSSGHLKVAPSSSASLSPVIPPDSNGYNKH 162 Query: 601 LRDRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVA 422 LRD D+TF I+RDRKVRI+E ASLY+LCRSWL+NG PEET QPQ+ D VKSLPRPLP+ Sbjct: 163 LRDNSDETFTIVRDRKVRITEGASLYALCRSWLRNGSPEET-QPQHADGVKSLPRPLPMP 221 Query: 421 ---AQVVDSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYK 251 A + + ENV+ LS ++LL+RH++RAK++R+RL ER +RI RYK Sbjct: 222 RADANIAKEKESEEDEDETDEDENVDRLSEEDLLRRHVQRAKQIRARLSNERAKRIERYK 281 Query: 250 NRLALLLPPMVEQ 212 RL+LLLPP+VEQ Sbjct: 282 TRLSLLLPPLVEQ 294 >gb|EOX96773.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 276 Score = 181 bits (459), Expect = 4e-43 Identities = 115/255 (45%), Positives = 154/255 (60%), Gaps = 15/255 (5%) Frame = -3 Query: 910 LLYPVASSGRGFL-----ARPLHMPAAGPSPRPPYVFPYLDPGQGNPGFIRPN----HLP 758 ++YPVASSGRGFL RPL P P P + + +P +P P+ H P Sbjct: 49 VMYPVASSGRGFLPTNHPCRPLLPYHHHPHPHPHH---FANPRPPSPSLSLPHPTHFHPP 105 Query: 757 HVLLXXXXXXXXXXXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDD 581 L S S HPK SS+S+ NG+K++RDR +DD Sbjct: 106 LKAL-------------------------SLSLHPKVAPSPSSLSETNGYKNVRDRTKDD 140 Query: 580 TFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVAA-----Q 416 + +RDRKVRI++ AS+Y+LCRSWL+NG P+ETQ PQY D KSLP+PLP+ + Sbjct: 141 SLVNVRDRKVRITDGASVYALCRSWLRNGFPDETQ-PQYGDVSKSLPQPLPIPVTDNLLK 199 Query: 415 VVDSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLAL 236 + ++ ++VENLSA++LL+RHI RAKKVRSRLR+ERL+RI+RYK RLAL Sbjct: 200 DTEDEEEQEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLAL 259 Query: 235 LLPPMVEQHFKSESA 191 LLPP+VEQ F+S++A Sbjct: 260 LLPPLVEQ-FRSDAA 273 >ref|XP_006439027.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] gi|568858552|ref|XP_006482814.1| PREDICTED: RNA-binding protein 33-like [Citrus sinensis] gi|557541223|gb|ESR52267.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] Length = 297 Score = 179 bits (454), Expect = 1e-42 Identities = 110/252 (43%), Positives = 148/252 (58%), Gaps = 19/252 (7%) Frame = -3 Query: 910 LLYPVASSGRGFLARPLHMPA--------AGPSPRPPYVFPYLDPGQGNPGF-IRPNHLP 758 ++YPVASSGRGF+ +P+ G PRP + PY P N + +H Sbjct: 43 VVYPVASSGRGFIPKPMRPSDQTVTVANHGGYPPRPNQLPPYPRPHLDNHHHPVLHHHQH 102 Query: 757 HVLLXXXXXXXXXXXXXGVM--PGVVKGIPVSSSHHPKAGLPSSSIS-----DNNGHKDL 599 H ++ + P ++G+PVSS H A S+S+S D+NG Sbjct: 103 HHMIRPPPLNNQQHQHPQISSNPSPIRGVPVSSGHLKVAPSSSASLSPVIPPDSNG---- 158 Query: 598 RDRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVA- 422 D D+TF I+RDRKVRI+E ASLY+LCRSWL+NG PEETQ PQ+ D VKSLPRPLP+ Sbjct: 159 -DNSDETFTIVRDRKVRITEGASLYALCRSWLRNGSPEETQ-PQHADGVKSLPRPLPMPR 216 Query: 421 --AQVVDSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKN 248 A + + ENV+ LS ++LL+RH++RAK++R+RL ER +RI RYK Sbjct: 217 ADANIAKEKESEEDEDETDEDENVDRLSEEDLLRRHVQRAKQIRARLSNERAKRIERYKT 276 Query: 247 RLALLLPPMVEQ 212 RL+LLLPP+VEQ Sbjct: 277 RLSLLLPPLVEQ 288 >ref|XP_004149622.1| PREDICTED: uncharacterized protein LOC101211370 [Cucumis sativus] Length = 376 Score = 177 bits (449), Expect = 5e-42 Identities = 116/256 (45%), Positives = 149/256 (58%), Gaps = 23/256 (8%) Frame = -3 Query: 910 LLYPVASSGRGFLARPLH-MPA---------AGPSPRPPYVFPYLDPGQGNPGFIRPNHL 761 +LYPVASSGRGF+ R + +PA G RP FP+ G +P +H Sbjct: 127 ILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIG--SPHLDSMSHP 184 Query: 760 PHVLLXXXXXXXXXXXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDRRDD 581 H+ + G +K P SS PKA P +I ++NG K++R R DD Sbjct: 185 MHMTRPPNLQQQLIPFSGSSISGSIKCAPNSSD--PKA-FPPQTICESNGCKEMRVR-DD 240 Query: 580 TFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVAA------ 419 T ++RDRKVRI++ ASLY+LCRSWL+NG EE+Q PQY +SLPRPLP+A Sbjct: 241 TLCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ-PQYGSFFRSLPRPLPIAVAGAAPL 299 Query: 418 -------QVVDSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRIS 260 + VD DK ++E+LS +ELL+RH++RAKKVRSRLREERLQRI Sbjct: 300 QKKEVVKEEVDEKDKDEG--------SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIE 351 Query: 259 RYKNRLALLLPPMVEQ 212 RYK RLALLLPP +EQ Sbjct: 352 RYKTRLALLLPPPIEQ 367 >ref|XP_002330893.1| predicted protein [Populus trichocarpa] gi|566150610|ref|XP_006369465.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550348014|gb|ERP66034.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 340 Score = 169 bits (428), Expect = 1e-39 Identities = 117/274 (42%), Positives = 156/274 (56%), Gaps = 32/274 (11%) Frame = -3 Query: 910 LLYPVASSGRGFLARPL--------------HMPAAGPSPRP--PYVFPYLDPGQGNPGF 779 +LYPVASSGRGF+ RP+ H AG + RP P + +P Sbjct: 77 VLYPVASSGRGFIPRPVRPHQDQTPANQGAYHPRGAGVAYRPHTPTTVVGSPSSRSHPNP 136 Query: 778 IRPNHLPHV-------LLXXXXXXXXXXXXXGVMPGV--------VKGIPVSSSHHPKAG 644 + L H+ L+ V G+ +KGIPV+ + Sbjct: 137 QQLGDLHHLHNVQQQHLMMSRQHPTHLQHHNYVGFGLGVGSVAAPIKGIPVTG----QLK 192 Query: 643 LPSSSISDNNGHKDLRDR-RDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQ 467 + S +SD+NG+K+LRDR RDD ++RDRKVRIS+ A LY+LCRSWL+NG PEE++ Sbjct: 193 VAPSPVSDSNGYKNLRDRSRDDNLMVVRDRKVRISDGAPLYALCRSWLRNGFPEESEV-H 251 Query: 466 YLDTVKSLPRPLPVAAQVVDSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRL 287 Y D+VK LPRPL + + +K V+NLSA ELL+RHIK AKKVR+RL Sbjct: 252 YGDSVKPLPRPLLPKEESEEEVEKEKKDEEP-----VDNLSAAELLKRHIKHAKKVRARL 306 Query: 286 REERLQRISRYKNRLALLLPPMVEQHFKSESADE 185 REERL+RI+RYK+RLALLLPP VEQ F++++ E Sbjct: 307 REERLKRIARYKSRLALLLPPQVEQ-FRNDTPAE 339 >ref|XP_006294535.1| hypothetical protein CARUB_v10023571mg [Capsella rubella] gi|482563243|gb|EOA27433.1| hypothetical protein CARUB_v10023571mg [Capsella rubella] Length = 339 Score = 162 bits (409), Expect = 2e-37 Identities = 109/249 (43%), Positives = 137/249 (55%), Gaps = 14/249 (5%) Frame = -3 Query: 916 SQLLYPVASSGRGFLARPLHM-------PAAGPS---PRPPYVFPYLDPGQGNPGFIRPN 767 S L+YP SSGRGF RP P A P PRP Y + + G + Sbjct: 100 STLIYPFGSSGRGFPTRPARQNSNSVADPVASPGGHPPRPVYAYHHGQFG---------S 150 Query: 766 HLPHVLLXXXXXXXXXXXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDRR 587 +L + + PG +KG+P P+A +SI DN GHK R R Sbjct: 151 NLDPMFQFMRAAHPQNQQSPQLGPGHMKGVP--HFLQPRATPSPTSILDNVGHKKARSR- 207 Query: 586 DDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPV----AA 419 DD ++R RKVRI+E ASLYSLCRSWL+NG E Q PQ DT+ LP+PLPV + Sbjct: 208 DDALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIQ-PQRSDTLTCLPKPLPVDMTETS 266 Query: 418 QVVDSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLA 239 DS ++ E+V+ LS +LL+RH+ RAKKVRSRLRE+RL+RI+RYK RLA Sbjct: 267 LPKDSVEEPNPEEDKEDEESVKELSTSDLLKRHVDRAKKVRSRLREDRLKRIARYKARLA 326 Query: 238 LLLPPMVEQ 212 LLLPP EQ Sbjct: 327 LLLPPFGEQ 335 >gb|EXC42165.1| hypothetical protein L484_002415 [Morus notabilis] Length = 454 Score = 158 bits (399), Expect = 3e-36 Identities = 102/232 (43%), Positives = 134/232 (57%), Gaps = 11/232 (4%) Frame = -3 Query: 904 YPVASSGRGFLARPLHM---PAAGPSPRPPYVFPYLDPGQGNPGFIRPNHLPHVL---LX 743 YPV SSGRGF++ P PAAG P NP RP + + + Sbjct: 82 YPVVSSGRGFISLPKSSSSSPAAGADQTVTVASP-------NPSGYRPRPAANYVVRPIQ 134 Query: 742 XXXXXXXXXXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDR-RDDTFAII 566 ++ G VKG+PVS PK PS S+ D NG+KD+RD+ RDD+ I+ Sbjct: 135 HIHHYHHHQQQPHLVAGPVKGVPVSIQLQPKVP-PSPSVPDCNGYKDMRDKVRDDSLTIV 193 Query: 565 RDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVAAQVVDSPDKGA- 389 RDRKVRI+EDASLY+LC+SWL+NG EE+Q+ QY D V SLPRPLP+ + K Sbjct: 194 RDRKVRITEDASLYALCQSWLRNGFSEESQK-QYGDAVMSLPRPLPIPMATNNEQKKEGE 252 Query: 388 ---XXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRL 242 E+V+NLSA++L +RH+KRAKKVR+RLRE R +RI+R + L Sbjct: 253 EDDNDGDEEDEESVKNLSAEDLFKRHLKRAKKVRARLREVRQKRIARVVSAL 304 >ref|XP_002881247.1| hypothetical protein ARALYDRAFT_482225 [Arabidopsis lyrata subsp. lyrata] gi|297327086|gb|EFH57506.1| hypothetical protein ARALYDRAFT_482225 [Arabidopsis lyrata subsp. lyrata] Length = 334 Score = 158 bits (399), Expect = 3e-36 Identities = 110/250 (44%), Positives = 137/250 (54%), Gaps = 15/250 (6%) Frame = -3 Query: 916 SQLLYPVASSGRGFLARPLHM-------PAAGPSPRPPY-VFPYLDPGQGNPGFIRPNHL 761 S L+YP SSGRGF RP P P PP V+ Y GQ N Sbjct: 93 SSLIYPFGSSGRGFPTRPGRQNSNSVADPVGSPGGYPPRPVYGYHQHGQ-----FGSNLD 147 Query: 760 PHVLLXXXXXXXXXXXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDRRDD 581 P + + G +KG+P P+ +SI DN+GHK R R DD Sbjct: 148 PVLQQLMRAAHLQNQQSPQLGSGHMKGVP--HFLQPRVTPSPTSILDNSGHKKARSR-DD 204 Query: 580 TFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPV-------A 422 ++R RKVRI+E ASLYSLCRSWL+NG E + PQ DT+ LP+PLPV Sbjct: 205 ALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIK-PQRSDTMTCLPKPLPVDMTETSLP 263 Query: 421 AQVVDSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRL 242 +VV+ P++ E+V++LS +LL+RHI RAKKVRSRLREERL+RI+RYK RL Sbjct: 264 KEVVEEPNR---EEDKEDEESVKHLSESDLLKRHIDRAKKVRSRLREERLKRIARYKARL 320 Query: 241 ALLLPPMVEQ 212 ALLLPP EQ Sbjct: 321 ALLLPPFGEQ 330 >ref|XP_004171685.1| PREDICTED: uncharacterized protein LOC101226490 [Cucumis sativus] Length = 196 Score = 157 bits (398), Expect = 4e-36 Identities = 93/174 (53%), Positives = 117/174 (67%), Gaps = 13/174 (7%) Frame = -3 Query: 694 GVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDRRDDTFAIIRDRKVRISEDASLYSLC 515 G +K P SS PKA P +I ++NG K++R R DDT ++RDRKVRI++ ASLY+LC Sbjct: 27 GSIKCAPNSSD--PKA-FPPQTICESNGCKEMRVR-DDTLCVVRDRKVRITDGASLYALC 82 Query: 514 RSWLKNGVPEETQQPQYLDTVKSLPRPLPVAA-------------QVVDSPDKGAXXXXX 374 RSWL+NG EE+Q PQY +SLPRPLP+A + VD DK Sbjct: 83 RSWLRNGSQEESQ-PQYGSFFRSLPRPLPIAVAGAAPLQKKEVVKEEVDEKDKDEG---- 137 Query: 373 XXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLALLLPPMVEQ 212 ++E+LS +ELL+RH++RAKKVRSRLREERLQRI RYK RLALLLPP +EQ Sbjct: 138 ----SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQ 187 >ref|XP_004306495.1| PREDICTED: uncharacterized protein LOC101308794 [Fragaria vesca subsp. vesca] Length = 254 Score = 155 bits (391), Expect = 3e-35 Identities = 100/239 (41%), Positives = 144/239 (60%), Gaps = 13/239 (5%) Frame = -3 Query: 868 RPLHMPAAGPSPRPP-YVFPYLDPGQGNPGFI-----RPNHLPHVLLXXXXXXXXXXXXX 707 RPL A P+P PP +++ Q +PG + RP + P + Sbjct: 17 RPLRPIAPAPTPPPPAHMYTVPMRAQSSPGALVYPSARPPYPPPLNFHPHPHPYPPHLHP 76 Query: 706 GVMPGVVKGI---PVSSSHHPKAGLPSSSISDNNGHKDLRDRRDDTFAIIRDRKVRISED 536 P + + P+ P SS+ D+NG +D RDDT +I+DRKVRI++ Sbjct: 77 SPPPPAYQSLLPPPIKDLRFSGLVAPPSSVPDSNGIRD--KGRDDTQFLIQDRKVRITDG 134 Query: 535 ASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRP--LPVAAQVVDSPDKG--AXXXXXXX 368 ASLY LCRSWL+NG EE+Q P+Y D +SLP+P +P+A+ + + D+G Sbjct: 135 ASLYVLCRSWLRNGTSEESQ-PRYGDATRSLPKPSPIPMASAIPPNKDEGDKKEDNEDKV 193 Query: 367 XENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLALLLPPMVEQHFKSESA 191 E+VE++S ++LL+RHIKRA+KVR+RLREERL+RI+RYK+RLALLLPP+VEQ F+++ A Sbjct: 194 EESVEHVSPEDLLKRHIKRARKVRARLREERLRRIARYKSRLALLLPPLVEQ-FRNDLA 251 >ref|NP_180843.2| proline-rich uncharacterized protein [Arabidopsis thaliana] gi|26450185|dbj|BAC42211.1| unknown protein [Arabidopsis thaliana] gi|28827576|gb|AAO50632.1| unknown protein [Arabidopsis thaliana] gi|330253655|gb|AEC08749.1| proline-rich uncharacterized protein [Arabidopsis thaliana] Length = 337 Score = 154 bits (390), Expect = 4e-35 Identities = 109/258 (42%), Positives = 139/258 (53%), Gaps = 23/258 (8%) Frame = -3 Query: 916 SQLLYPVASSGRGFLARPLHM-------PAAGPSPR------PPYVFPY------LDPGQ 794 S L+YP SSGRGF RP+ P PSP P Y + + LDP Sbjct: 95 SSLIYPFGSSGRGFPTRPVRQNSNSVADPVGSPSPGGYTPRGPVYGYHHGQFVSNLDPMN 154 Query: 793 GNPGFIRPNHLPHVLLXXXXXXXXXXXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNN 614 F+R H + + G +KG+P P+A +SI DN+ Sbjct: 155 Q---FMRAAHPQN------------QQSPQLGSGHMKGVP--HFLQPRATPSPTSILDNS 197 Query: 613 GHKDLRDRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRP 434 GHK R R DD ++R RKVRI+E ASLYSLCRSWL+NG E + PQ +D + LP+P Sbjct: 198 GHKKARSR-DDALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIK-PQRIDMMTCLPKP 255 Query: 433 LPVAAQVVDSP----DKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQR 266 LPV P ++ E+V++LS +LL+RHI RAKKVR+RLREERL+R Sbjct: 256 LPVDKTETSLPKDLVEEAICEEDKEDEESVKHLSESDLLKRHIDRAKKVRARLREERLKR 315 Query: 265 ISRYKNRLALLLPPMVEQ 212 I+RYK RLALLLPP EQ Sbjct: 316 IARYKARLALLLPPFGEQ 333 >ref|XP_003546862.1| PREDICTED: uncharacterized protein LOC100779268 isoform X1 [Glycine max] Length = 274 Score = 150 bits (380), Expect = 5e-34 Identities = 81/154 (52%), Positives = 112/154 (72%), Gaps = 7/154 (4%) Frame = -3 Query: 652 KAGLPSSSISDNNGHKDLRDRR---DDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEE 482 K S+++D NG KD R +DTF ++RDRKVR+++DASLY+LCRSWL+NG+ EE Sbjct: 113 KKAAARSAVADVNGGKDTNTREKSSEDTFIVVRDRKVRVTDDASLYALCRSWLRNGINEE 172 Query: 481 TQQPQYLDTVKSLPRPLP---VAAQVVDSP-DKGAXXXXXXXXENVENLSAKELLQRHIK 314 +Q PQ D +K+LP+PLP VA+ + + D+ ++VE+LS ++LL+RHIK Sbjct: 173 SQ-PQQKDVIKALPKPLPASMVASYLSNKKEDEKDEDEKEENEQSVEHLSPQDLLKRHIK 231 Query: 313 RAKKVRSRLREERLQRISRYKNRLALLLPPMVEQ 212 RAK VR+RLREERLQRI+RY++RL LLLPP +EQ Sbjct: 232 RAKNVRARLREERLQRITRYRSRLRLLLPPAIEQ 265 >ref|XP_006410417.1| hypothetical protein EUTSA_v10016920mg [Eutrema salsugineum] gi|557111586|gb|ESQ51870.1| hypothetical protein EUTSA_v10016920mg [Eutrema salsugineum] Length = 328 Score = 150 bits (378), Expect = 9e-34 Identities = 102/249 (40%), Positives = 136/249 (54%), Gaps = 14/249 (5%) Frame = -3 Query: 916 SQLLYPVASSGRGFLARPLHMPAA----------GPSPRPPYVFPYLDPGQGNPGFIRPN 767 S L+YP SSGRG RP ++ G PRP YV+ + GQ + Sbjct: 90 SGLVYPYPSSGRGLPTRPGRQNSSSVADPMGSPGGYPPRPAYVYHH---GQSR------S 140 Query: 766 HLPHVLLXXXXXXXXXXXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNNGHKDLRDRR 587 +L ++ + G + G+P P+ P +SI DN+G K+ R R Sbjct: 141 NLDPMIQFMRTAHPQIQQSPHLGSGYMIGVP--HFLQPRVAYPPTSILDNSGRKNARSR- 197 Query: 586 DDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRPLPVAAQVV- 410 D+ ++R RKVRI+E ASLYSLCRSWL+NG E QQ DTV LP+PLPV Sbjct: 198 DEVLVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIQQRS--DTVTYLPKPLPVDMMETS 255 Query: 409 ---DSPDKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQRISRYKNRLA 239 +S ++ E+V+ LS +LL+RH+ RAKKVR+RLRE+RL+RI+RYK RLA Sbjct: 256 LSRESVEEAHREEDNEDEESVKQLSDSDLLKRHVDRAKKVRARLREDRLKRIARYKARLA 315 Query: 238 LLLPPMVEQ 212 LLLPP EQ Sbjct: 316 LLLPPFGEQ 324 >gb|AAB91981.1| hypothetical protein [Arabidopsis thaliana] gi|20197061|gb|AAM14901.1| hypothetical protein [Arabidopsis thaliana] Length = 346 Score = 149 bits (377), Expect = 1e-33 Identities = 106/258 (41%), Positives = 138/258 (53%), Gaps = 23/258 (8%) Frame = -3 Query: 916 SQLLYPVASSGRGFLARPLHM-------PAAGPSPR------PPYVFPY------LDPGQ 794 S L+YP SSGRGF RP+ P PSP P Y + + LDP Sbjct: 95 SSLIYPFGSSGRGFPTRPVRQNSNSVADPVGSPSPGGYTPRGPVYGYHHGQFVSNLDPMN 154 Query: 793 GNPGFIRPNHLPHVLLXXXXXXXXXXXXXGVMPGVVKGIPVSSSHHPKAGLPSSSISDNN 614 F+R H + + P +V VS + + +A +SI DN+ Sbjct: 155 Q---FMRAAHPQNQQSPQLGSGHMKGVPHFLQPRLVL---VSENVYVEATPSPTSILDNS 208 Query: 613 GHKDLRDRRDDTFAIIRDRKVRISEDASLYSLCRSWLKNGVPEETQQPQYLDTVKSLPRP 434 GHK R R DD ++R RKVRI+E ASLYSLCRSWL+NG E ++ +D + LP+P Sbjct: 209 GHKKARSR-DDALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIKR---IDMMTCLPKP 264 Query: 433 LPVAAQVVDSP----DKGAXXXXXXXXENVENLSAKELLQRHIKRAKKVRSRLREERLQR 266 LPV P ++ E+V++LS +LL+RHI RAKKVR+RLREERL+R Sbjct: 265 LPVDKTETSLPKDLVEEAICEEDKEDEESVKHLSESDLLKRHIDRAKKVRARLREERLKR 324 Query: 265 ISRYKNRLALLLPPMVEQ 212 I+RYK RLALLLPP EQ Sbjct: 325 IARYKARLALLLPPFGEQ 342