BLASTX nr result
ID: Jatropha_contig00039274
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00039274 (615 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus t... 201 1e-49 ref|XP_002519338.1| conserved hypothetical protein [Ricinus comm... 196 5e-48 gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus cl... 192 6e-47 gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus pe... 163 4e-38 ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306... 156 4e-36 ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820... 154 2e-35 ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809... 154 2e-35 ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256... 152 7e-35 emb|CBI26022.3| unnamed protein product [Vitis vinifera] 152 7e-35 emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera] 149 4e-34 gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus... 148 1e-33 gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, ... 141 1e-31 gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, ... 141 1e-31 gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, ... 141 1e-31 gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, ... 141 1e-31 gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema s... 136 4e-30 ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798... 131 1e-28 ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arab... 129 8e-28 ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Caps... 128 1e-27 ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511... 127 2e-27 >gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus trichocarpa] Length = 547 Score = 201 bits (512), Expect = 1e-49 Identities = 126/205 (61%), Positives = 152/205 (74%), Gaps = 12/205 (5%) Frame = +1 Query: 37 HSTTPSRFRLNSKTKEAA----NGSL--SPAQEARAKSVPPGVKKDSKLKRSLM-LNKQK 195 HSTTPSR R+N KT + A NGS SPA + RAKSVPP VKKD+K+++SL+ NK K Sbjct: 4 HSTTPSRHRVNFKTPKPAEVANNGSPVPSPANKTRAKSVPPDVKKDTKVRKSLVGNNKPK 63 Query: 196 SGEQLLGSQNGVKVVARSVNRPVVEQFAKPRR----LPQLDSSARRKEEGDKE-LQERLQ 360 SGE ++GSQ+ V VV RSVNRP EQFA+PRR L +++S R +EE K+ L E+L+ Sbjct: 64 SGELVVGSQD-VTVVGRSVNRPGSEQFARPRRQRPVLDPINASRRNEEESYKKGLHEKLE 122 Query: 361 LNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESNG 540 L+E LI DLQS+VLALK ELDKA N ELELQNKKL +DLAAAEAKV+A + R Q S G Sbjct: 123 LSETLINDLQSEVLALKVELDKANGLNQELELQNKKLTEDLAAAEAKVSALNTRHQ-SVG 181 Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615 + + P FKDIQKLIA KLENS VKK Sbjct: 182 EHQRPRFKDIQKLIAIKLENSPVKK 206 >ref|XP_002519338.1| conserved hypothetical protein [Ricinus communis] gi|223541653|gb|EEF43202.1| conserved hypothetical protein [Ricinus communis] Length = 532 Score = 196 bits (497), Expect = 5e-48 Identities = 125/202 (61%), Positives = 148/202 (73%), Gaps = 7/202 (3%) Frame = +1 Query: 31 MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLN-KQKSGEQ 207 MS TTPSRFRLNSK + PA++ RA+SVPP KKD+KL+RS+++N K KS ++ Sbjct: 1 MSQPTTPSRFRLNSKAPKPE----PPAKKERAQSVPPDFKKDTKLRRSVLVNTKPKSRDE 56 Query: 208 LLGSQNGVKVVAR---SVNRPVVEQFAKPRRLPQLDSSARRKEEGDK-ELQERLQLNEIL 375 LLGSQ V V SVNRPV EQF+KPR SAR+ EE K EL ER++LN+ L Sbjct: 57 LLGSQMEVARVVSPSLSVNRPVHEQFSKPRT----QRSARKIEEDTKKELLERIELNDNL 112 Query: 376 IKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--HIRDQESNGKQK 549 I+DL+SQVL+LKAELDKAQS N+ELE QNKKL QDLA+AEAKVAA + ES G + Sbjct: 113 IQDLKSQVLSLKAELDKAQSLNEELESQNKKLQQDLASAEAKVAAALNNTPLPESIGGYQ 172 Query: 550 SPIFKDIQKLIANKLENSTVKK 615 SP FKDIQKLIANKLENSTVKK Sbjct: 173 SPKFKDIQKLIANKLENSTVKK 194 >gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus clementina] Length = 561 Score = 192 bits (488), Expect = 6e-47 Identities = 122/218 (55%), Positives = 153/218 (70%), Gaps = 20/218 (9%) Frame = +1 Query: 16 AKPTAMSHST---TPSRFRLNSKTKEAA------NG-SLSPAQEARAKSVPPGVKKD--S 159 +K MSHST T SR R NSKT+E+ NG SLSP +ARAKSVPP VK + S Sbjct: 8 SKTNNMSHSTAATTTSRLRANSKTRESPKQEAGINGVSLSPELKARAKSVPPDVKTNNIS 67 Query: 160 KLKRSLMLNKQKSGEQLLGSQNG--VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG 333 K +R+L+LNK KS E +GS VKV RS+NRPVVEQFA+PRR +D++ + E+G Sbjct: 68 KSRRALVLNKPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFARPRRQRIVDANPGKIEDG 127 Query: 334 -----DKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEA 498 KE +E+L+L+E L+KDLQS+V ALKAE KAQS N ELE QNKKL +DL AAEA Sbjct: 128 LMDKKKKEFEEKLRLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLVEDLVAAEA 187 Query: 499 KVAAFHIRDQ-ESNGKQKSPIFKDIQKLIANKLENSTV 609 K+A+ R+Q E+ G+ +SP FKD+QKLIANKLE+S V Sbjct: 188 KIASLSSREQREAVGEYQSPKFKDVQKLIANKLEHSIV 225 >gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus persica] Length = 552 Score = 163 bits (412), Expect = 4e-38 Identities = 103/205 (50%), Positives = 131/205 (63%), Gaps = 10/205 (4%) Frame = +1 Query: 31 MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNKQKSGEQL 210 MS T PS R ++ +K + S P+ RAKS+ +RSL+LNK KSGE + Sbjct: 20 MSQPTPPSYLRASASSKAKESPSPRPS---RAKSI----------RRSLLLNKPKSGELV 66 Query: 211 LGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG----DKELQERLQL 363 LGSQ K V R NR V EQFA+PR D +++R EE ++ELQERL + Sbjct: 67 LGSQKSKELEETKAVGRPGNRQVAEQFARPRPQRPADPNSKRNEEDPHVKNRELQERLDM 126 Query: 364 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 540 +E L + Q++VLALKAELDKAQ N EL+ QNK L + LAAAEAK+AAF R+Q E+NG Sbjct: 127 SESLTMNFQAEVLALKAELDKAQGLNVELQSQNKNLTEKLAAAEAKIAAFTTREQRETNG 186 Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615 + +SP FKD+QKLIANKLE VKK Sbjct: 187 EYQSPKFKDLQKLIANKLERPVVKK 211 >ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306034 [Fragaria vesca subsp. vesca] Length = 560 Score = 156 bits (395), Expect = 4e-36 Identities = 105/208 (50%), Positives = 135/208 (64%), Gaps = 15/208 (7%) Frame = +1 Query: 37 HSTTPSRFRLNSKTKEAANGSLSPAQE-ARAKSVPPGVKKDS---KLKRSLMLNKQKSGE 204 HST S+ R +SK KE S SP Q +RAKSV P V S ++R+L+ NK KSGE Sbjct: 15 HSTNMSQLRASSKAKE----SQSPTQRPSRAKSVTPDVNHSSDSRSVRRALLQNKPKSGE 70 Query: 205 QLLGSQNG-----VKVVARSVNRPVVEQFAKPRRL-PQLDSSARRKEEGD----KELQER 354 +LGSQ KVV S VVEQFAKPRR P ++++ +R E+ KE+QE+ Sbjct: 71 LVLGSQKSKDFEEFKVVGSSRKPQVVEQFAKPRRQRPVVEANCKRNEDDPHRNMKEMQEK 130 Query: 355 LQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-E 531 ++++E +I LQ++VL LK ELDK N EL+ +NKKL+++L AAEAK+AA Q E Sbjct: 131 IEMSESMIMKLQAEVLGLKVELDKEHGLNLELQAKNKKLSENLTAAEAKIAALTTPQQRE 190 Query: 532 SNGKQKSPIFKDIQKLIANKLENSTVKK 615 SNG Q SP FKD+QKLIANKLE S VKK Sbjct: 191 SNGYQ-SPKFKDLQKLIANKLECSVVKK 217 >ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820086 [Glycine max] Length = 576 Score = 154 bits (388), Expect = 2e-35 Identities = 101/212 (47%), Positives = 135/212 (63%), Gaps = 17/212 (8%) Frame = +1 Query: 31 MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNK 189 + +S TPSR RL SK +E N + RAKSV P +K +S++K+ L+LNK Sbjct: 27 IQNSLTPSRLRLPSKYREPPKTPPEVVNNGMVSTPLRRAKSVTPELKHNSRIKKGLVLNK 86 Query: 190 QKSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----K 339 K E++LG+ Q G KVV+R V VEQF++PR + R KE+ D K Sbjct: 87 AKPNEEVLGTTQRGREVEEAKVVSRFVRPHAVEQFSRPRSGVGDFAFKRDKEDPDGKSKK 146 Query: 340 ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI 519 EL E+L+ +E LIK+LQS+VLALKAEL+K + N ELE N+KL +DLAAAEAKV + Sbjct: 147 ELMEKLEASESLIKNLQSEVLALKAELEKVKGLNVELESNNRKLTEDLAAAEAKVVSLS- 205 Query: 520 RDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615 +++ NG+ +SP FK IQKLIA+KLE S VKK Sbjct: 206 GNEKPNGEHQSPKFKLIQKLIADKLERSIVKK 237 >ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809254 [Glycine max] Length = 562 Score = 154 bits (388), Expect = 2e-35 Identities = 101/213 (47%), Positives = 133/213 (62%), Gaps = 18/213 (8%) Frame = +1 Query: 31 MSHSTTPSRFRLNSKTKE--------AANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLN 186 + +S TPSR RL SK +E N + RAKSV P +K +S++KR L+LN Sbjct: 10 LQNSLTPSRLRLPSKYREPPKTPPEVVVNNVVVSTPSRRAKSVTPELKHNSRIKRGLVLN 69 Query: 187 KQKSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD---- 336 K K E+++G+ Q G KVVAR V VVEQFA+PR + R KE+ D Sbjct: 70 KAKPNEEVVGTTQRGREAEETKVVARFVRPHVVEQFARPRNGAGDFAFKRDKEDSDEKSK 129 Query: 337 KELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFH 516 KEL E+L+ +E LIK+LQS+V ALKAEL+K + ELE N+KL +DLAAAE KV + Sbjct: 130 KELMEKLEASESLIKNLQSEVQALKAELEKVKGLKVELESHNRKLTEDLAAAEVKVVSLG 189 Query: 517 IRDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615 +++ NG+ +SP FK IQKLIA+KLE S VKK Sbjct: 190 -GNEKPNGEHQSPKFKHIQKLIADKLERSIVKK 221 >ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256278 [Vitis vinifera] Length = 551 Score = 152 bits (384), Expect = 7e-35 Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 6/205 (2%) Frame = +1 Query: 19 KPTAMSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNKQKS 198 +P++ S S++ S ++ + S SPA RA+S P + K +RSL+LNK KS Sbjct: 19 RPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSLLLNKPKS 78 Query: 199 GEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQL 363 G+ LGSQ VKV+ RS NRPVV+Q A P+ S ++ KELQE+L L Sbjct: 79 GDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA-----PRRPSEGPEPDDKTKELQEKLDL 133 Query: 364 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 540 + LI +LQS+VL LKAELDKAQS N EL+ N KL +DLAAA AK+ A R Q ES Sbjct: 134 RQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKITALTSRQQEESVT 193 Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615 + +SP FKDIQKLIANKLE+ +K+ Sbjct: 194 EYQSPKFKDIQKLIANKLEHPKIKQ 218 >emb|CBI26022.3| unnamed protein product [Vitis vinifera] Length = 572 Score = 152 bits (384), Expect = 7e-35 Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 6/205 (2%) Frame = +1 Query: 19 KPTAMSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNKQKS 198 +P++ S S++ S ++ + S SPA RA+S P + K +RSL+LNK KS Sbjct: 40 RPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSLLLNKPKS 99 Query: 199 GEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQL 363 G+ LGSQ VKV+ RS NRPVV+Q A P+ S ++ KELQE+L L Sbjct: 100 GDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA-----PRRPSEGPEPDDKTKELQEKLDL 154 Query: 364 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 540 + LI +LQS+VL LKAELDKAQS N EL+ N KL +DLAAA AK+ A R Q ES Sbjct: 155 RQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKITALTSRQQEESVT 214 Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615 + +SP FKDIQKLIANKLE+ +K+ Sbjct: 215 EYQSPKFKDIQKLIANKLEHPKIKQ 239 >emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera] Length = 348 Score = 149 bits (377), Expect = 4e-34 Identities = 97/205 (47%), Positives = 127/205 (61%), Gaps = 6/205 (2%) Frame = +1 Query: 19 KPTAMSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNKQKS 198 +P++ S S++ S ++ + S SPA RA+S P + K +RSL+LNK KS Sbjct: 40 RPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSLLLNKPKS 99 Query: 199 GEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQL 363 G+ LGSQ VKV+ RS NRPVV+Q A P+ S ++ KELQE+L L Sbjct: 100 GDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA-----PRRPSEGPEPDDKTKELQEKLDL 154 Query: 364 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 540 + LI +LQS+VL LKAELDKAQS N EL+ N KL +DLAAA AK+ A R Q ES Sbjct: 155 RQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKITALTSRQQEESVT 214 Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615 + +SP FKDIQKLIA KLE+ +K+ Sbjct: 215 EYQSPKFKDIQKLIAXKLEHPKIKQ 239 >gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus vulgaris] gi|561011661|gb|ESW10568.1| hypothetical protein PHAVU_009G220500g [Phaseolus vulgaris] Length = 584 Score = 148 bits (373), Expect = 1e-33 Identities = 99/213 (46%), Positives = 131/213 (61%), Gaps = 18/213 (8%) Frame = +1 Query: 31 MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNK 189 + S TPSR RL SK +E NG +S RAKSV P +K S++KR L+LNK Sbjct: 28 LQSSLTPSRLRLPSKYREPPRTPPEVVNGVVSTPTR-RAKSVTPELKHASRIKRGLVLNK 86 Query: 190 QKSGEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----KE 342 K E+++G+ G K V R + VEQFA PR + R KEE D KE Sbjct: 87 AKPNEEVVGTHRGREAVEPKAVPRFMRPHAVEQFASPRSAVGDFAMKRDKEEPDGKSKKE 146 Query: 343 LQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--H 516 L E+L+++E LI++LQS+VLALKAEL+K + N ELE N+KL +D+AAAE+KV + Sbjct: 147 LMEKLEVSESLIRNLQSEVLALKAELEKVKGLNVELESHNRKLTKDIAAAESKVMSLGGS 206 Query: 517 IRDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615 + +E G+ +SP FK IQKLIA+KLE S VKK Sbjct: 207 EKMKEPIGEHQSPKFKHIQKLIADKLERSRVKK 239 >gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5, partial [Theobroma cacao] Length = 458 Score = 141 bits (356), Expect = 1e-31 Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 18/221 (8%) Frame = +1 Query: 7 KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSK-LKRS 174 KPAA K T MSH STTPSR R+NSK S EAR ++ P VK +K +S Sbjct: 19 KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73 Query: 175 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 339 L+LNK KSG+Q +VV VV+QFA+PRRL +++ +K E + Sbjct: 74 LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123 Query: 340 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 507 EL+E+L +E L+KDL++QVL LKAELD A+S N ELE N+KL +DL AAEAK+A Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183 Query: 508 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLENSTVKK 615 A RD +ESNG +S FKDIQ+ IANKLE+ + + Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITR 224 >gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 4 [Theobroma cacao] Length = 565 Score = 141 bits (356), Expect = 1e-31 Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 18/221 (8%) Frame = +1 Query: 7 KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSK-LKRS 174 KPAA K T MSH STTPSR R+NSK S EAR ++ P VK +K +S Sbjct: 19 KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73 Query: 175 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 339 L+LNK KSG+Q +VV VV+QFA+PRRL +++ +K E + Sbjct: 74 LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123 Query: 340 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 507 EL+E+L +E L+KDL++QVL LKAELD A+S N ELE N+KL +DL AAEAK+A Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183 Query: 508 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLENSTVKK 615 A RD +ESNG +S FKDIQ+ IANKLE+ + + Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITR 224 >gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 561 Score = 141 bits (356), Expect = 1e-31 Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 18/221 (8%) Frame = +1 Query: 7 KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSK-LKRS 174 KPAA K T MSH STTPSR R+NSK S EAR ++ P VK +K +S Sbjct: 19 KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73 Query: 175 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 339 L+LNK KSG+Q +VV VV+QFA+PRRL +++ +K E + Sbjct: 74 LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123 Query: 340 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 507 EL+E+L +E L+KDL++QVL LKAELD A+S N ELE N+KL +DL AAEAK+A Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183 Query: 508 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLENSTVKK 615 A RD +ESNG +S FKDIQ+ IANKLE+ + + Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITR 224 >gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 564 Score = 141 bits (356), Expect = 1e-31 Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 18/221 (8%) Frame = +1 Query: 7 KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSK-LKRS 174 KPAA K T MSH STTPSR R+NSK S EAR ++ P VK +K +S Sbjct: 19 KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73 Query: 175 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 339 L+LNK KSG+Q +VV VV+QFA+PRRL +++ +K E + Sbjct: 74 LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123 Query: 340 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 507 EL+E+L +E L+KDL++QVL LKAELD A+S N ELE N+KL +DL AAEAK+A Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183 Query: 508 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLENSTVKK 615 A RD +ESNG +S FKDIQ+ IANKLE+ + + Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITR 224 >gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema salsugineum] Length = 554 Score = 136 bits (343), Expect = 4e-30 Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 14/206 (6%) Frame = +1 Query: 40 STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDS-----KLKRSLMLNKQKSGE 204 STTPSR R AAN S RA+ G K S K +RS++L + KSGE Sbjct: 8 STTPSRVR-------AANSHYSVISRPRAQD-DNGKPKSSGHDPGKNRRSILLKRAKSGE 59 Query: 205 Q---LLGSQNGVKVVARSVNRP-VVEQFAKPRRLPQLDSSARRKEEGDK-----ELQERL 357 + +L Q ARSVNRP VVEQF PRR S EE +K EL+E+L Sbjct: 60 EETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKSEEMAAEEDEKRKKMEELEEKL 114 Query: 358 QLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESN 537 NE LIKDLQ+QV +LKAEL++A+SSN ELEL+N+KL+QDL +AEAK+++ D+ + Sbjct: 115 VANESLIKDLQAQVSSLKAELEEARSSNAELELKNRKLSQDLVSAEAKISSLSSNDKPAK 174 Query: 538 GKQKSPIFKDIQKLIANKLENSTVKK 615 Q + FKDIQK+IA+KLE S VKK Sbjct: 175 EHQNTR-FKDIQKIIASKLEQSKVKK 199 >ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798183 [Glycine max] Length = 565 Score = 131 bits (330), Expect = 1e-28 Identities = 89/196 (45%), Positives = 123/196 (62%), Gaps = 7/196 (3%) Frame = +1 Query: 49 PSRFRLNSKTKEAANGSLS--PAQEARAKSVPPGVKKDSKLKRSLMLNKQKSGEQLLGSQ 222 P R R +SK ++ ++ RA+SVPP +K S+ KR +++NK K E++LGSQ Sbjct: 33 PPRLRASSKAPKSPPEVVNRESISSTRAESVPPDLKNVSRAKRGVVVNKPKLNEEVLGSQ 92 Query: 223 NG----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQ 390 + +VAR R V F R+ DS ++K+E LQE+L+++E LIK LQ Sbjct: 93 KAEEGKIVIVARPRRR--VGDFGS-RKSEDDDSHGKKKKE---LLQEKLEVSENLIKSLQ 146 Query: 391 SQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI-RDQESNGKQKSPIFKD 567 S+VLAL+ ELD+ +S N ELE QN KL Q+LAAAEAK++ I + + G+ +SP FKD Sbjct: 147 SEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKPIGEHRSPKFKD 206 Query: 568 IQKLIANKLENSTVKK 615 IQKLIA KLE S VKK Sbjct: 207 IQKLIAEKLERSRVKK 222 >ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arabidopsis lyrata subsp. lyrata] gi|297337263|gb|EFH67680.1| hypothetical protein ARALYDRAFT_473964 [Arabidopsis lyrata subsp. lyrata] Length = 567 Score = 129 bits (323), Expect = 8e-28 Identities = 98/218 (44%), Positives = 128/218 (58%), Gaps = 26/218 (11%) Frame = +1 Query: 40 STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPGVKKD-SKLKRSLML 183 STTPSR R AAN S + RA KS VK D +K +RS++L Sbjct: 8 STTPSRVR-------AANSHYSVISKPRAQDDNGLTGGKPKSSGHDVKNDPAKNRRSILL 60 Query: 184 NKQKSGEQ---LLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARR 321 + K GE+ +L Q ARSVNRP VVEQF PRR + + Sbjct: 61 KRAKYGEEETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKTEESVMATAVVAEDE 115 Query: 322 KEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAK 501 K + +EL+E+L +NE LIKDLQ QVL LK EL++A++SN ELEL+NKKL+QDLA+AEAK Sbjct: 116 KRKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARNSNAELELKNKKLSQDLASAEAK 175 Query: 502 VAAFHIRDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615 +++ D+ + Q + FKDIQ+LIA+KLE S VKK Sbjct: 176 ISSLSSNDKPAKEHQNTR-FKDIQRLIASKLEQSKVKK 212 >ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Capsella rubella] gi|482576206|gb|EOA40393.1| hypothetical protein CARUB_v10009119mg [Capsella rubella] Length = 450 Score = 128 bits (322), Expect = 1e-27 Identities = 98/217 (45%), Positives = 128/217 (58%), Gaps = 25/217 (11%) Frame = +1 Query: 40 STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPGVKKD-SKLKRSLML 183 STTPSR R AAN S RA K VK D +K +RS++L Sbjct: 8 STTPSRVR-------AANSHYSVISRPRAQDDNGLAGGKPKHSNHDVKNDPAKNRRSILL 60 Query: 184 NKQKSGEQLLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARRKEE 330 K KSG++ + V ARSVNRP VVEQF PRR + +++A E+ Sbjct: 61 RKAKSGDEETTAVL-VPQRARSVNRPAVVEQFGCPRRPISRKIEETVMSTAEAAAEEDEK 119 Query: 331 GDK--ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKV 504 + EL+E+L +NE LIKDLQ QVL LK EL++A+SSN ELEL N+KL+QDLA+AEAK+ Sbjct: 120 RKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARSSNVELELNNRKLSQDLASAEAKI 179 Query: 505 AAFHIRDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615 ++ D+ + Q S FKDIQ+LIA+KLE S V+K Sbjct: 180 SSLSSNDKPAKEHQNSR-FKDIQRLIASKLEQSKVRK 215 >ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511271 [Cicer arietinum] Length = 933 Score = 127 bits (319), Expect = 2e-27 Identities = 91/212 (42%), Positives = 132/212 (62%), Gaps = 16/212 (7%) Frame = +1 Query: 28 AMSHSTTPSRFRL---NSKTKEA-------ANGSLSPAQEARAKSVPPGVKKDSKLKRSL 177 ++ +TT +R R+ +SK KE+ N + + RAKSVPP +K +SK KR + Sbjct: 67 SIPQTTTTTRIRVVKGSSKIKESPKTPPEIVNNNRASISSTRAKSVPPDLKNNSKAKRGI 126 Query: 178 MLNKQ--KSGEQL-LGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQ 348 ++ + KS E++ SQ G K + + VV +PRR D +++ KE+ Sbjct: 127 VVMNKLVKSNEEVECSSQKGTKEAEEA--KIVV---VRPRRRRTNDDPDEKEK---KEMV 178 Query: 349 ERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF---HI 519 E+L++++ LIK+L+S+V ALKAELDK ++ N ELE QN KL Q+LAAAEAK+AA + Sbjct: 179 EKLEMSDNLIKNLESEVKALKAELDKVKNLNVELESQNVKLTQNLAAAEAKIAAVGSNNS 238 Query: 520 RDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615 R +E G+ +SP FKDIQKLIA+KLE S VKK Sbjct: 239 RKKELIGEHQSPKFKDIQKLIADKLEMSKVKK 270