BLASTX nr result
ID: Jatropha_contig00016832
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00016832 (691 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus t... 211 2e-52 ref|XP_002519338.1| conserved hypothetical protein [Ricinus comm... 204 2e-50 gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus cl... 195 1e-47 gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus pe... 166 6e-39 ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306... 164 2e-38 emb|CBI26022.3| unnamed protein product [Vitis vinifera] 158 2e-36 ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809... 157 2e-36 ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820... 156 5e-36 ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256... 155 8e-36 emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera] 155 1e-35 gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus... 151 2e-34 gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, ... 144 3e-32 gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, ... 144 3e-32 gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, ... 144 3e-32 gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, ... 144 3e-32 gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema s... 135 9e-30 ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798... 133 4e-29 ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Caps... 131 2e-28 ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511... 130 4e-28 ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arab... 130 4e-28 >gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus trichocarpa] Length = 547 Score = 211 bits (537), Expect = 2e-52 Identities = 130/212 (61%), Positives = 158/212 (74%), Gaps = 12/212 (5%) Frame = +1 Query: 88 HSTTPSRFRLNSKTKEAA----NGSL--SPAQEARAKSVPPDVKKDSKLKRSLM-LNKQR 246 HSTTPSR R+N KT + A NGS SPA + RAKSVPPDVKKD+K+++SL+ NK + Sbjct: 4 HSTTPSRHRVNFKTPKPAEVANNGSPVPSPANKTRAKSVPPDVKKDTKVRKSLVGNNKPK 63 Query: 247 SGEQLLGSQNGVKVVARSVNRPVVEQFAKPRR----LPQLDSSARRKEEGDKE-LQERLQ 411 SGE ++GSQ+ V VV RSVNRP EQFA+PRR L +++S R +EE K+ L E+L+ Sbjct: 64 SGELVVGSQD-VTVVGRSVNRPGSEQFARPRRQRPVLDPINASRRNEEESYKKGLHEKLE 122 Query: 412 LNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESNG 591 L+E LI DLQS+VLALK ELDKA N ELELQNKKL +DLAAAEAKV+A + R Q S G Sbjct: 123 LSETLINDLQSEVLALKVELDKANGLNQELELQNKKLTEDLAAAEAKVSALNTRHQ-SVG 181 Query: 592 KQKSPIFKDIQKLIANKLEKFTVKKEAINGPT 687 + + P FKDIQKLIA KLE VKKEAINGP+ Sbjct: 182 EHQRPRFKDIQKLIAIKLENSPVKKEAINGPS 213 >ref|XP_002519338.1| conserved hypothetical protein [Ricinus communis] gi|223541653|gb|EEF43202.1| conserved hypothetical protein [Ricinus communis] Length = 532 Score = 204 bits (520), Expect = 2e-50 Identities = 128/209 (61%), Positives = 154/209 (73%), Gaps = 7/209 (3%) Frame = +1 Query: 82 MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLN-KQRSGEQ 258 MS TTPSRFRLNSK + PA++ RA+SVPPD KKD+KL+RS+++N K +S ++ Sbjct: 1 MSQPTTPSRFRLNSKAPKPE----PPAKKERAQSVPPDFKKDTKLRRSVLVNTKPKSRDE 56 Query: 259 LLGSQNGVKVVAR---SVNRPVVEQFAKPRRLPQLDSSARRKEEGDK-ELQERLQLNEIL 426 LLGSQ V V SVNRPV EQF+KPR SAR+ EE K EL ER++LN+ L Sbjct: 57 LLGSQMEVARVVSPSLSVNRPVHEQFSKPRT----QRSARKIEEDTKKELLERIELNDNL 112 Query: 427 IKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--HIRDQESNGKQK 600 I+DL+SQVL+LKAELDKAQS N+ELE QNKKL QDLA+AEAKVAA + ES G + Sbjct: 113 IQDLKSQVLSLKAELDKAQSLNEELESQNKKLQQDLASAEAKVAAALNNTPLPESIGGYQ 172 Query: 601 SPIFKDIQKLIANKLEKFTVKKEAINGPT 687 SP FKDIQKLIANKLE TVKK+A+NGPT Sbjct: 173 SPKFKDIQKLIANKLENSTVKKDAMNGPT 201 >gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus clementina] Length = 561 Score = 195 bits (495), Expect = 1e-47 Identities = 124/228 (54%), Positives = 158/228 (69%), Gaps = 20/228 (8%) Frame = +1 Query: 67 AKPTAMSHST---TPSRFRLNSKTKEAA------NG-SLSPAQEARAKSVPPDVKKD--S 210 +K MSHST T SR R NSKT+E+ NG SLSP +ARAKSVPPDVK + S Sbjct: 8 SKTNNMSHSTAATTTSRLRANSKTRESPKQEAGINGVSLSPELKARAKSVPPDVKTNNIS 67 Query: 211 KLKRSLMLNKQRSGEQLLGSQNG--VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG 384 K +R+L+LNK +S E +GS VKV RS+NRPVVEQFA+PRR +D++ + E+G Sbjct: 68 KSRRALVLNKPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFARPRRQRIVDANPGKIEDG 127 Query: 385 -----DKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEA 549 KE +E+L+L+E L+KDLQS+V ALKAE KAQS N ELE QNKKL +DL AAEA Sbjct: 128 LMDKKKKEFEEKLRLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLVEDLVAAEA 187 Query: 550 KVAAFHIRDQ-ESNGKQKSPIFKDIQKLIANKLEKFTVKKEAINGPTI 690 K+A+ R+Q E+ G+ +SP FKD+QKLIANKLE V +AI+ +I Sbjct: 188 KIASLSSREQREAVGEYQSPKFKDVQKLIANKLEHSIVMTDAISETSI 235 >gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus persica] Length = 552 Score = 166 bits (420), Expect = 6e-39 Identities = 104/208 (50%), Positives = 135/208 (64%), Gaps = 10/208 (4%) Frame = +1 Query: 82 MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNKQRSGEQL 261 MS T PS R ++ +K + S P+ RAKS+ +RSL+LNK +SGE + Sbjct: 20 MSQPTPPSYLRASASSKAKESPSPRPS---RAKSI----------RRSLLLNKPKSGELV 66 Query: 262 LGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG----DKELQERLQL 414 LGSQ K V R NR V EQFA+PR D +++R EE ++ELQERL + Sbjct: 67 LGSQKSKELEETKAVGRPGNRQVAEQFARPRPQRPADPNSKRNEEDPHVKNRELQERLDM 126 Query: 415 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 591 +E L + Q++VLALKAELDKAQ N EL+ QNK L + LAAAEAK+AAF R+Q E+NG Sbjct: 127 SESLTMNFQAEVLALKAELDKAQGLNVELQSQNKNLTEKLAAAEAKIAAFTTREQRETNG 186 Query: 592 KQKSPIFKDIQKLIANKLEKFTVKKEAI 675 + +SP FKD+QKLIANKLE+ VKKEA+ Sbjct: 187 EYQSPKFKDLQKLIANKLERPVVKKEAV 214 >ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306034 [Fragaria vesca subsp. vesca] Length = 560 Score = 164 bits (416), Expect = 2e-38 Identities = 108/215 (50%), Positives = 141/215 (65%), Gaps = 15/215 (6%) Frame = +1 Query: 88 HSTTPSRFRLNSKTKEAANGSLSPAQE-ARAKSVPPDVKKDS---KLKRSLMLNKQRSGE 255 HST S+ R +SK KE S SP Q +RAKSV PDV S ++R+L+ NK +SGE Sbjct: 15 HSTNMSQLRASSKAKE----SQSPTQRPSRAKSVTPDVNHSSDSRSVRRALLQNKPKSGE 70 Query: 256 QLLGSQNG-----VKVVARSVNRPVVEQFAKPRRL-PQLDSSARRKEEGD----KELQER 405 +LGSQ KVV S VVEQFAKPRR P ++++ +R E+ KE+QE+ Sbjct: 71 LVLGSQKSKDFEEFKVVGSSRKPQVVEQFAKPRRQRPVVEANCKRNEDDPHRNMKEMQEK 130 Query: 406 LQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-E 582 ++++E +I LQ++VL LK ELDK N EL+ +NKKL+++L AAEAK+AA Q E Sbjct: 131 IEMSESMIMKLQAEVLGLKVELDKEHGLNLELQAKNKKLSENLTAAEAKIAALTTPQQRE 190 Query: 583 SNGKQKSPIFKDIQKLIANKLEKFTVKKEAINGPT 687 SNG Q SP FKD+QKLIANKLE VKKEA+N P+ Sbjct: 191 SNGYQ-SPKFKDLQKLIANKLECSVVKKEALNEPS 224 >emb|CBI26022.3| unnamed protein product [Vitis vinifera] Length = 572 Score = 158 bits (399), Expect = 2e-36 Identities = 110/237 (46%), Positives = 138/237 (58%), Gaps = 16/237 (6%) Frame = +1 Query: 16 KEIDATNNRKGNINKPA-AKPTAMSHSTTPSRFRLNSKTKEAA-------NG--SLSPAQ 165 + IDA + N P K T SH PS +S + + NG S SPA Sbjct: 12 RPIDALSQEAMKQNPPTPCKTTTPSHLRRPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAP 71 Query: 166 EARAKSVPPDVKKDSKLKRSLMLNKQRSGEQLLGSQNG-----VKVVARSVNRPVVEQFA 330 RA+S P ++ K +RSL+LNK +SG+ LGSQ VKV+ RS NRPVV+Q A Sbjct: 72 RPRARSGPLEMNNSHKARRSLLLNKPKSGDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA 131 Query: 331 KPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQ 510 P+ S ++ KELQE+L L + LI +LQS+VL LKAELDKAQS N EL+ Sbjct: 132 -----PRRPSEGPEPDDKTKELQEKLDLRQNLINNLQSEVLGLKAELDKAQSFNLELQSL 186 Query: 511 NKKLAQDLAAAEAKVAAFHIRDQ-ESNGKQKSPIFKDIQKLIANKLEKFTVKKEAIN 678 N KL +DLAAA AK+ A R Q ES + +SP FKDIQKLIANKLE +K+EA N Sbjct: 187 NAKLTEDLAAALAKITALTSRQQEESVTEYQSPKFKDIQKLIANKLEHPKIKQEASN 243 >ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809254 [Glycine max] Length = 562 Score = 157 bits (398), Expect = 2e-36 Identities = 102/216 (47%), Positives = 137/216 (63%), Gaps = 18/216 (8%) Frame = +1 Query: 82 MSHSTTPSRFRLNSKTKE--------AANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLN 237 + +S TPSR RL SK +E N + RAKSV P++K +S++KR L+LN Sbjct: 10 LQNSLTPSRLRLPSKYREPPKTPPEVVVNNVVVSTPSRRAKSVTPELKHNSRIKRGLVLN 69 Query: 238 KQRSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD---- 387 K + E+++G+ Q G KVVAR V VVEQFA+PR + R KE+ D Sbjct: 70 KAKPNEEVVGTTQRGREAEETKVVARFVRPHVVEQFARPRNGAGDFAFKRDKEDSDEKSK 129 Query: 388 KELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFH 567 KEL E+L+ +E LIK+LQS+V ALKAEL+K + ELE N+KL +DLAAAE KV + Sbjct: 130 KELMEKLEASESLIKNLQSEVQALKAELEKVKGLKVELESHNRKLTEDLAAAEVKVVSLG 189 Query: 568 IRDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675 +++ NG+ +SP FK IQKLIA+KLE+ VKKEAI Sbjct: 190 -GNEKPNGEHQSPKFKHIQKLIADKLERSIVKKEAI 224 >ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820086 [Glycine max] Length = 576 Score = 156 bits (395), Expect = 5e-36 Identities = 101/215 (46%), Positives = 139/215 (64%), Gaps = 17/215 (7%) Frame = +1 Query: 82 MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNK 240 + +S TPSR RL SK +E N + RAKSV P++K +S++K+ L+LNK Sbjct: 27 IQNSLTPSRLRLPSKYREPPKTPPEVVNNGMVSTPLRRAKSVTPELKHNSRIKKGLVLNK 86 Query: 241 QRSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----K 390 + E++LG+ Q G KVV+R V VEQF++PR + R KE+ D K Sbjct: 87 AKPNEEVLGTTQRGREVEEAKVVSRFVRPHAVEQFSRPRSGVGDFAFKRDKEDPDGKSKK 146 Query: 391 ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI 570 EL E+L+ +E LIK+LQS+VLALKAEL+K + N ELE N+KL +DLAAAEAKV + Sbjct: 147 ELMEKLEASESLIKNLQSEVLALKAELEKVKGLNVELESNNRKLTEDLAAAEAKVVSLS- 205 Query: 571 RDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675 +++ NG+ +SP FK IQKLIA+KLE+ VKKE+I Sbjct: 206 GNEKPNGEHQSPKFKLIQKLIADKLERSIVKKESI 240 >ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256278 [Vitis vinifera] Length = 551 Score = 155 bits (393), Expect = 8e-36 Identities = 100/209 (47%), Positives = 131/209 (62%), Gaps = 6/209 (2%) Frame = +1 Query: 70 KPTAMSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNKQRS 249 +P++ S S++ S ++ + S SPA RA+S P ++ K +RSL+LNK +S Sbjct: 19 RPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSLLLNKPKS 78 Query: 250 GEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQL 414 G+ LGSQ VKV+ RS NRPVV+Q A P+ S ++ KELQE+L L Sbjct: 79 GDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA-----PRRPSEGPEPDDKTKELQEKLDL 133 Query: 415 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 591 + LI +LQS+VL LKAELDKAQS N EL+ N KL +DLAAA AK+ A R Q ES Sbjct: 134 RQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKITALTSRQQEESVT 193 Query: 592 KQKSPIFKDIQKLIANKLEKFTVKKEAIN 678 + +SP FKDIQKLIANKLE +K+EA N Sbjct: 194 EYQSPKFKDIQKLIANKLEHPKIKQEASN 222 >emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera] Length = 348 Score = 155 bits (392), Expect = 1e-35 Identities = 109/237 (45%), Positives = 137/237 (57%), Gaps = 16/237 (6%) Frame = +1 Query: 16 KEIDATNNRKGNINKPA-AKPTAMSHSTTPSRFRLNSKTKEAA-------NG--SLSPAQ 165 + IDA + N P K T SH PS +S + + NG S SPA Sbjct: 12 RPIDALSQEAMKQNPPTPCKTTTPSHLRRPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAP 71 Query: 166 EARAKSVPPDVKKDSKLKRSLMLNKQRSGEQLLGSQNG-----VKVVARSVNRPVVEQFA 330 RA+S P ++ K +RSL+LNK +SG+ LGSQ VKV+ RS NRPVV+Q A Sbjct: 72 RPRARSGPLEMNNSHKARRSLLLNKPKSGDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA 131 Query: 331 KPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQ 510 P+ S ++ KELQE+L L + LI +LQS+VL LKAELDKAQS N EL+ Sbjct: 132 -----PRRPSEGPEPDDKTKELQEKLDLRQNLINNLQSEVLGLKAELDKAQSFNLELQSL 186 Query: 511 NKKLAQDLAAAEAKVAAFHIRDQ-ESNGKQKSPIFKDIQKLIANKLEKFTVKKEAIN 678 N KL +DLAAA AK+ A R Q ES + +SP FKDIQKLIA KLE +K+EA N Sbjct: 187 NAKLTEDLAAALAKITALTSRQQEESVTEYQSPKFKDIQKLIAXKLEHPKIKQEASN 243 >gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus vulgaris] gi|561011661|gb|ESW10568.1| hypothetical protein PHAVU_009G220500g [Phaseolus vulgaris] Length = 584 Score = 151 bits (381), Expect = 2e-34 Identities = 99/216 (45%), Positives = 135/216 (62%), Gaps = 18/216 (8%) Frame = +1 Query: 82 MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNK 240 + S TPSR RL SK +E NG +S RAKSV P++K S++KR L+LNK Sbjct: 28 LQSSLTPSRLRLPSKYREPPRTPPEVVNGVVSTPTR-RAKSVTPELKHASRIKRGLVLNK 86 Query: 241 QRSGEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----KE 393 + E+++G+ G K V R + VEQFA PR + R KEE D KE Sbjct: 87 AKPNEEVVGTHRGREAVEPKAVPRFMRPHAVEQFASPRSAVGDFAMKRDKEEPDGKSKKE 146 Query: 394 LQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--H 567 L E+L+++E LI++LQS+VLALKAEL+K + N ELE N+KL +D+AAAE+KV + Sbjct: 147 LMEKLEVSESLIRNLQSEVLALKAELEKVKGLNVELESHNRKLTKDIAAAESKVMSLGGS 206 Query: 568 IRDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675 + +E G+ +SP FK IQKLIA+KLE+ VKKEA+ Sbjct: 207 EKMKEPIGEHQSPKFKHIQKLIADKLERSRVKKEAL 242 >gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5, partial [Theobroma cacao] Length = 458 Score = 144 bits (362), Expect = 3e-32 Identities = 105/224 (46%), Positives = 136/224 (60%), Gaps = 18/224 (8%) Frame = +1 Query: 58 KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225 KPAA K T MSH STTPSR R+NSK S EAR ++ P VK +K +S Sbjct: 19 KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73 Query: 226 LMLNKQRSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390 L+LNK +SG+Q +VV VV+QFA+PRRL +++ +K E + Sbjct: 74 LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123 Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558 EL+E+L +E L+KDL++QVL LKAELD A+S N ELE N+KL +DL AAEAK+A Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183 Query: 559 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675 A RD +ESNG +S FKDIQ+ IANKLE + +EAI Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITREAI 227 >gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 4 [Theobroma cacao] Length = 565 Score = 144 bits (362), Expect = 3e-32 Identities = 105/224 (46%), Positives = 136/224 (60%), Gaps = 18/224 (8%) Frame = +1 Query: 58 KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225 KPAA K T MSH STTPSR R+NSK S EAR ++ P VK +K +S Sbjct: 19 KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73 Query: 226 LMLNKQRSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390 L+LNK +SG+Q +VV VV+QFA+PRRL +++ +K E + Sbjct: 74 LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123 Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558 EL+E+L +E L+KDL++QVL LKAELD A+S N ELE N+KL +DL AAEAK+A Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183 Query: 559 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675 A RD +ESNG +S FKDIQ+ IANKLE + +EAI Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITREAI 227 >gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 561 Score = 144 bits (362), Expect = 3e-32 Identities = 105/224 (46%), Positives = 136/224 (60%), Gaps = 18/224 (8%) Frame = +1 Query: 58 KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225 KPAA K T MSH STTPSR R+NSK S EAR ++ P VK +K +S Sbjct: 19 KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73 Query: 226 LMLNKQRSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390 L+LNK +SG+Q +VV VV+QFA+PRRL +++ +K E + Sbjct: 74 LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123 Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558 EL+E+L +E L+KDL++QVL LKAELD A+S N ELE N+KL +DL AAEAK+A Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183 Query: 559 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675 A RD +ESNG +S FKDIQ+ IANKLE + +EAI Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITREAI 227 >gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 564 Score = 144 bits (362), Expect = 3e-32 Identities = 105/224 (46%), Positives = 136/224 (60%), Gaps = 18/224 (8%) Frame = +1 Query: 58 KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225 KPAA K T MSH STTPSR R+NSK S EAR ++ P VK +K +S Sbjct: 19 KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73 Query: 226 LMLNKQRSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390 L+LNK +SG+Q +VV VV+QFA+PRRL +++ +K E + Sbjct: 74 LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123 Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558 EL+E+L +E L+KDL++QVL LKAELD A+S N ELE N+KL +DL AAEAK+A Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183 Query: 559 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675 A RD +ESNG +S FKDIQ+ IANKLE + +EAI Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITREAI 227 >gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema salsugineum] Length = 554 Score = 135 bits (341), Expect = 9e-30 Identities = 96/208 (46%), Positives = 126/208 (60%), Gaps = 13/208 (6%) Frame = +1 Query: 91 STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDS----KLKRSLMLNKQRSGEQ 258 STTPSR R AAN S RA+ K K +RS++L + +SGE+ Sbjct: 8 STTPSRVR-------AANSHYSVISRPRAQDDNGKPKSSGHDPGKNRRSILLKRAKSGEE 60 Query: 259 ---LLGSQNGVKVVARSVNRP-VVEQFAKPRRLPQLDSSARRKEEGDK-----ELQERLQ 411 +L Q ARSVNRP VVEQF PRR S EE +K EL+E+L Sbjct: 61 ETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKSEEMAAEEDEKRKKMEELEEKLV 115 Query: 412 LNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESNG 591 NE LIKDLQ+QV +LKAEL++A+SSN ELEL+N+KL+QDL +AEAK+++ D+ + Sbjct: 116 ANESLIKDLQAQVSSLKAELEEARSSNAELELKNRKLSQDLVSAEAKISSLSSNDKPAKE 175 Query: 592 KQKSPIFKDIQKLIANKLEKFTVKKEAI 675 Q + FKDIQK+IA+KLE+ VKKE + Sbjct: 176 HQNTR-FKDIQKIIASKLEQSKVKKELV 202 >ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798183 [Glycine max] Length = 565 Score = 133 bits (335), Expect = 4e-29 Identities = 89/197 (45%), Positives = 125/197 (63%), Gaps = 7/197 (3%) Frame = +1 Query: 100 PSRFRLNSKTKEAANGSLS--PAQEARAKSVPPDVKKDSKLKRSLMLNKQRSGEQLLGSQ 273 P R R +SK ++ ++ RA+SVPPD+K S+ KR +++NK + E++LGSQ Sbjct: 33 PPRLRASSKAPKSPPEVVNRESISSTRAESVPPDLKNVSRAKRGVVVNKPKLNEEVLGSQ 92 Query: 274 NG----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQ 441 + +VAR R V F R+ DS ++K+E LQE+L+++E LIK LQ Sbjct: 93 KAEEGKIVIVARPRRR--VGDFGS-RKSEDDDSHGKKKKE---LLQEKLEVSENLIKSLQ 146 Query: 442 SQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI-RDQESNGKQKSPIFKD 618 S+VLAL+ ELD+ +S N ELE QN KL Q+LAAAEAK++ I + + G+ +SP FKD Sbjct: 147 SEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKPIGEHRSPKFKD 206 Query: 619 IQKLIANKLEKFTVKKE 669 IQKLIA KLE+ VKKE Sbjct: 207 IQKLIAEKLERSRVKKE 223 >ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Capsella rubella] gi|482576206|gb|EOA40393.1| hypothetical protein CARUB_v10009119mg [Capsella rubella] Length = 450 Score = 131 bits (329), Expect = 2e-28 Identities = 98/220 (44%), Positives = 131/220 (59%), Gaps = 25/220 (11%) Frame = +1 Query: 91 STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPDVKKD-SKLKRSLML 234 STTPSR R AAN S RA K DVK D +K +RS++L Sbjct: 8 STTPSRVR-------AANSHYSVISRPRAQDDNGLAGGKPKHSNHDVKNDPAKNRRSILL 60 Query: 235 NKQRSGEQLLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARRKEE 381 K +SG++ + V ARSVNRP VVEQF PRR + +++A E+ Sbjct: 61 RKAKSGDEETTAVL-VPQRARSVNRPAVVEQFGCPRRPISRKIEETVMSTAEAAAEEDEK 119 Query: 382 GDK--ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKV 555 + EL+E+L +NE LIKDLQ QVL LK EL++A+SSN ELEL N+KL+QDLA+AEAK+ Sbjct: 120 RKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARSSNVELELNNRKLSQDLASAEAKI 179 Query: 556 AAFHIRDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675 ++ D+ + Q S FKDIQ+LIA+KLE+ V+KE + Sbjct: 180 SSLSSNDKPAKEHQNSR-FKDIQRLIASKLEQSKVRKEVV 218 >ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511271 [Cicer arietinum] Length = 933 Score = 130 bits (327), Expect = 4e-28 Identities = 92/214 (42%), Positives = 134/214 (62%), Gaps = 16/214 (7%) Frame = +1 Query: 79 AMSHSTTPSRFRL---NSKTKEA-------ANGSLSPAQEARAKSVPPDVKKDSKLKRSL 228 ++ +TT +R R+ +SK KE+ N + + RAKSVPPD+K +SK KR + Sbjct: 67 SIPQTTTTTRIRVVKGSSKIKESPKTPPEIVNNNRASISSTRAKSVPPDLKNNSKAKRGI 126 Query: 229 MLNKQ--RSGEQL-LGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQ 399 ++ + +S E++ SQ G K + + VV +PRR D +++ KE+ Sbjct: 127 VVMNKLVKSNEEVECSSQKGTKEAEEA--KIVV---VRPRRRRTNDDPDEKEK---KEMV 178 Query: 400 ERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF---HI 570 E+L++++ LIK+L+S+V ALKAELDK ++ N ELE QN KL Q+LAAAEAK+AA + Sbjct: 179 EKLEMSDNLIKNLESEVKALKAELDKVKNLNVELESQNVKLTQNLAAAEAKIAAVGSNNS 238 Query: 571 RDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEA 672 R +E G+ +SP FKDIQKLIA+KLE VKKEA Sbjct: 239 RKKELIGEHQSPKFKDIQKLIADKLEMSKVKKEA 272 >ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arabidopsis lyrata subsp. lyrata] gi|297337263|gb|EFH67680.1| hypothetical protein ARALYDRAFT_473964 [Arabidopsis lyrata subsp. lyrata] Length = 567 Score = 130 bits (327), Expect = 4e-28 Identities = 98/219 (44%), Positives = 130/219 (59%), Gaps = 26/219 (11%) Frame = +1 Query: 91 STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPDVKKD-SKLKRSLML 234 STTPSR R AAN S + RA KS DVK D +K +RS++L Sbjct: 8 STTPSRVR-------AANSHYSVISKPRAQDDNGLTGGKPKSSGHDVKNDPAKNRRSILL 60 Query: 235 NKQRSGEQ---LLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARR 372 + + GE+ +L Q ARSVNRP VVEQF PRR + + Sbjct: 61 KRAKYGEEETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKTEESVMATAVVAEDE 115 Query: 373 KEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAK 552 K + +EL+E+L +NE LIKDLQ QVL LK EL++A++SN ELEL+NKKL+QDLA+AEAK Sbjct: 116 KRKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARNSNAELELKNKKLSQDLASAEAK 175 Query: 553 VAAFHIRDQESNGKQKSPIFKDIQKLIANKLEKFTVKKE 669 +++ D+ + Q + FKDIQ+LIA+KLE+ VKKE Sbjct: 176 ISSLSSNDKPAKEHQNTR-FKDIQRLIASKLEQSKVKKE 213