BLASTX nr result

ID: Jatropha_contig00016832 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00016832
         (691 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus t...   211   2e-52
ref|XP_002519338.1| conserved hypothetical protein [Ricinus comm...   204   2e-50
gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus cl...   195   1e-47
gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus pe...   166   6e-39
ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306...   164   2e-38
emb|CBI26022.3| unnamed protein product [Vitis vinifera]              158   2e-36
ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809...   157   2e-36
ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820...   156   5e-36
ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256...   155   8e-36
emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera]   155   1e-35
gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus...   151   2e-34
gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, ...   144   3e-32
gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, ...   144   3e-32
gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, ...   144   3e-32
gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, ...   144   3e-32
gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema s...   135   9e-30
ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798...   133   4e-29
ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Caps...   131   2e-28
ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511...   130   4e-28
ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arab...   130   4e-28

>gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus trichocarpa]
          Length = 547

 Score =  211 bits (537), Expect = 2e-52
 Identities = 130/212 (61%), Positives = 158/212 (74%), Gaps = 12/212 (5%)
 Frame = +1

Query: 88  HSTTPSRFRLNSKTKEAA----NGSL--SPAQEARAKSVPPDVKKDSKLKRSLM-LNKQR 246
           HSTTPSR R+N KT + A    NGS   SPA + RAKSVPPDVKKD+K+++SL+  NK +
Sbjct: 4   HSTTPSRHRVNFKTPKPAEVANNGSPVPSPANKTRAKSVPPDVKKDTKVRKSLVGNNKPK 63

Query: 247 SGEQLLGSQNGVKVVARSVNRPVVEQFAKPRR----LPQLDSSARRKEEGDKE-LQERLQ 411
           SGE ++GSQ+ V VV RSVNRP  EQFA+PRR    L  +++S R +EE  K+ L E+L+
Sbjct: 64  SGELVVGSQD-VTVVGRSVNRPGSEQFARPRRQRPVLDPINASRRNEEESYKKGLHEKLE 122

Query: 412 LNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESNG 591
           L+E LI DLQS+VLALK ELDKA   N ELELQNKKL +DLAAAEAKV+A + R Q S G
Sbjct: 123 LSETLINDLQSEVLALKVELDKANGLNQELELQNKKLTEDLAAAEAKVSALNTRHQ-SVG 181

Query: 592 KQKSPIFKDIQKLIANKLEKFTVKKEAINGPT 687
           + + P FKDIQKLIA KLE   VKKEAINGP+
Sbjct: 182 EHQRPRFKDIQKLIAIKLENSPVKKEAINGPS 213


>ref|XP_002519338.1| conserved hypothetical protein [Ricinus communis]
           gi|223541653|gb|EEF43202.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 532

 Score =  204 bits (520), Expect = 2e-50
 Identities = 128/209 (61%), Positives = 154/209 (73%), Gaps = 7/209 (3%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLN-KQRSGEQ 258
           MS  TTPSRFRLNSK  +       PA++ RA+SVPPD KKD+KL+RS+++N K +S ++
Sbjct: 1   MSQPTTPSRFRLNSKAPKPE----PPAKKERAQSVPPDFKKDTKLRRSVLVNTKPKSRDE 56

Query: 259 LLGSQNGVKVVAR---SVNRPVVEQFAKPRRLPQLDSSARRKEEGDK-ELQERLQLNEIL 426
           LLGSQ  V  V     SVNRPV EQF+KPR       SAR+ EE  K EL ER++LN+ L
Sbjct: 57  LLGSQMEVARVVSPSLSVNRPVHEQFSKPRT----QRSARKIEEDTKKELLERIELNDNL 112

Query: 427 IKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--HIRDQESNGKQK 600
           I+DL+SQVL+LKAELDKAQS N+ELE QNKKL QDLA+AEAKVAA   +    ES G  +
Sbjct: 113 IQDLKSQVLSLKAELDKAQSLNEELESQNKKLQQDLASAEAKVAAALNNTPLPESIGGYQ 172

Query: 601 SPIFKDIQKLIANKLEKFTVKKEAINGPT 687
           SP FKDIQKLIANKLE  TVKK+A+NGPT
Sbjct: 173 SPKFKDIQKLIANKLENSTVKKDAMNGPT 201


>gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus clementina]
          Length = 561

 Score =  195 bits (495), Expect = 1e-47
 Identities = 124/228 (54%), Positives = 158/228 (69%), Gaps = 20/228 (8%)
 Frame = +1

Query: 67  AKPTAMSHST---TPSRFRLNSKTKEAA------NG-SLSPAQEARAKSVPPDVKKD--S 210
           +K   MSHST   T SR R NSKT+E+       NG SLSP  +ARAKSVPPDVK +  S
Sbjct: 8   SKTNNMSHSTAATTTSRLRANSKTRESPKQEAGINGVSLSPELKARAKSVPPDVKTNNIS 67

Query: 211 KLKRSLMLNKQRSGEQLLGSQNG--VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG 384
           K +R+L+LNK +S E  +GS     VKV  RS+NRPVVEQFA+PRR   +D++  + E+G
Sbjct: 68  KSRRALVLNKPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFARPRRQRIVDANPGKIEDG 127

Query: 385 -----DKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEA 549
                 KE +E+L+L+E L+KDLQS+V ALKAE  KAQS N ELE QNKKL +DL AAEA
Sbjct: 128 LMDKKKKEFEEKLRLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLVEDLVAAEA 187

Query: 550 KVAAFHIRDQ-ESNGKQKSPIFKDIQKLIANKLEKFTVKKEAINGPTI 690
           K+A+   R+Q E+ G+ +SP FKD+QKLIANKLE   V  +AI+  +I
Sbjct: 188 KIASLSSREQREAVGEYQSPKFKDVQKLIANKLEHSIVMTDAISETSI 235


>gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus persica]
          Length = 552

 Score =  166 bits (420), Expect = 6e-39
 Identities = 104/208 (50%), Positives = 135/208 (64%), Gaps = 10/208 (4%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNKQRSGEQL 261
           MS  T PS  R ++ +K   + S  P+   RAKS+          +RSL+LNK +SGE +
Sbjct: 20  MSQPTPPSYLRASASSKAKESPSPRPS---RAKSI----------RRSLLLNKPKSGELV 66

Query: 262 LGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG----DKELQERLQL 414
           LGSQ        K V R  NR V EQFA+PR     D +++R EE     ++ELQERL +
Sbjct: 67  LGSQKSKELEETKAVGRPGNRQVAEQFARPRPQRPADPNSKRNEEDPHVKNRELQERLDM 126

Query: 415 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 591
           +E L  + Q++VLALKAELDKAQ  N EL+ QNK L + LAAAEAK+AAF  R+Q E+NG
Sbjct: 127 SESLTMNFQAEVLALKAELDKAQGLNVELQSQNKNLTEKLAAAEAKIAAFTTREQRETNG 186

Query: 592 KQKSPIFKDIQKLIANKLEKFTVKKEAI 675
           + +SP FKD+QKLIANKLE+  VKKEA+
Sbjct: 187 EYQSPKFKDLQKLIANKLERPVVKKEAV 214


>ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306034 [Fragaria vesca
           subsp. vesca]
          Length = 560

 Score =  164 bits (416), Expect = 2e-38
 Identities = 108/215 (50%), Positives = 141/215 (65%), Gaps = 15/215 (6%)
 Frame = +1

Query: 88  HSTTPSRFRLNSKTKEAANGSLSPAQE-ARAKSVPPDVKKDS---KLKRSLMLNKQRSGE 255
           HST  S+ R +SK KE    S SP Q  +RAKSV PDV   S    ++R+L+ NK +SGE
Sbjct: 15  HSTNMSQLRASSKAKE----SQSPTQRPSRAKSVTPDVNHSSDSRSVRRALLQNKPKSGE 70

Query: 256 QLLGSQNG-----VKVVARSVNRPVVEQFAKPRRL-PQLDSSARRKEEGD----KELQER 405
            +LGSQ        KVV  S    VVEQFAKPRR  P ++++ +R E+      KE+QE+
Sbjct: 71  LVLGSQKSKDFEEFKVVGSSRKPQVVEQFAKPRRQRPVVEANCKRNEDDPHRNMKEMQEK 130

Query: 406 LQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-E 582
           ++++E +I  LQ++VL LK ELDK    N EL+ +NKKL+++L AAEAK+AA     Q E
Sbjct: 131 IEMSESMIMKLQAEVLGLKVELDKEHGLNLELQAKNKKLSENLTAAEAKIAALTTPQQRE 190

Query: 583 SNGKQKSPIFKDIQKLIANKLEKFTVKKEAINGPT 687
           SNG Q SP FKD+QKLIANKLE   VKKEA+N P+
Sbjct: 191 SNGYQ-SPKFKDLQKLIANKLECSVVKKEALNEPS 224


>emb|CBI26022.3| unnamed protein product [Vitis vinifera]
          Length = 572

 Score =  158 bits (399), Expect = 2e-36
 Identities = 110/237 (46%), Positives = 138/237 (58%), Gaps = 16/237 (6%)
 Frame = +1

Query: 16  KEIDATNNRKGNINKPA-AKPTAMSHSTTPSRFRLNSKTKEAA-------NG--SLSPAQ 165
           + IDA +      N P   K T  SH   PS    +S +  +        NG  S SPA 
Sbjct: 12  RPIDALSQEAMKQNPPTPCKTTTPSHLRRPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAP 71

Query: 166 EARAKSVPPDVKKDSKLKRSLMLNKQRSGEQLLGSQNG-----VKVVARSVNRPVVEQFA 330
             RA+S P ++    K +RSL+LNK +SG+  LGSQ       VKV+ RS NRPVV+Q A
Sbjct: 72  RPRARSGPLEMNNSHKARRSLLLNKPKSGDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA 131

Query: 331 KPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQ 510
                P+  S     ++  KELQE+L L + LI +LQS+VL LKAELDKAQS N EL+  
Sbjct: 132 -----PRRPSEGPEPDDKTKELQEKLDLRQNLINNLQSEVLGLKAELDKAQSFNLELQSL 186

Query: 511 NKKLAQDLAAAEAKVAAFHIRDQ-ESNGKQKSPIFKDIQKLIANKLEKFTVKKEAIN 678
           N KL +DLAAA AK+ A   R Q ES  + +SP FKDIQKLIANKLE   +K+EA N
Sbjct: 187 NAKLTEDLAAALAKITALTSRQQEESVTEYQSPKFKDIQKLIANKLEHPKIKQEASN 243


>ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809254 [Glycine max]
          Length = 562

 Score =  157 bits (398), Expect = 2e-36
 Identities = 102/216 (47%), Positives = 137/216 (63%), Gaps = 18/216 (8%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKE--------AANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLN 237
           + +S TPSR RL SK +E          N  +      RAKSV P++K +S++KR L+LN
Sbjct: 10  LQNSLTPSRLRLPSKYREPPKTPPEVVVNNVVVSTPSRRAKSVTPELKHNSRIKRGLVLN 69

Query: 238 KQRSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD---- 387
           K +  E+++G+ Q G      KVVAR V   VVEQFA+PR      +  R KE+ D    
Sbjct: 70  KAKPNEEVVGTTQRGREAEETKVVARFVRPHVVEQFARPRNGAGDFAFKRDKEDSDEKSK 129

Query: 388 KELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFH 567
           KEL E+L+ +E LIK+LQS+V ALKAEL+K +    ELE  N+KL +DLAAAE KV +  
Sbjct: 130 KELMEKLEASESLIKNLQSEVQALKAELEKVKGLKVELESHNRKLTEDLAAAEVKVVSLG 189

Query: 568 IRDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675
             +++ NG+ +SP FK IQKLIA+KLE+  VKKEAI
Sbjct: 190 -GNEKPNGEHQSPKFKHIQKLIADKLERSIVKKEAI 224


>ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820086 [Glycine max]
          Length = 576

 Score =  156 bits (395), Expect = 5e-36
 Identities = 101/215 (46%), Positives = 139/215 (64%), Gaps = 17/215 (7%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNK 240
           + +S TPSR RL SK +E         N  +      RAKSV P++K +S++K+ L+LNK
Sbjct: 27  IQNSLTPSRLRLPSKYREPPKTPPEVVNNGMVSTPLRRAKSVTPELKHNSRIKKGLVLNK 86

Query: 241 QRSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----K 390
            +  E++LG+ Q G      KVV+R V    VEQF++PR      +  R KE+ D    K
Sbjct: 87  AKPNEEVLGTTQRGREVEEAKVVSRFVRPHAVEQFSRPRSGVGDFAFKRDKEDPDGKSKK 146

Query: 391 ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI 570
           EL E+L+ +E LIK+LQS+VLALKAEL+K +  N ELE  N+KL +DLAAAEAKV +   
Sbjct: 147 ELMEKLEASESLIKNLQSEVLALKAELEKVKGLNVELESNNRKLTEDLAAAEAKVVSLS- 205

Query: 571 RDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675
            +++ NG+ +SP FK IQKLIA+KLE+  VKKE+I
Sbjct: 206 GNEKPNGEHQSPKFKLIQKLIADKLERSIVKKESI 240


>ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256278 [Vitis vinifera]
          Length = 551

 Score =  155 bits (393), Expect = 8e-36
 Identities = 100/209 (47%), Positives = 131/209 (62%), Gaps = 6/209 (2%)
 Frame = +1

Query: 70  KPTAMSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNKQRS 249
           +P++ S S++ S  ++ +        S SPA   RA+S P ++    K +RSL+LNK +S
Sbjct: 19  RPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSLLLNKPKS 78

Query: 250 GEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQL 414
           G+  LGSQ       VKV+ RS NRPVV+Q A     P+  S     ++  KELQE+L L
Sbjct: 79  GDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA-----PRRPSEGPEPDDKTKELQEKLDL 133

Query: 415 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 591
            + LI +LQS+VL LKAELDKAQS N EL+  N KL +DLAAA AK+ A   R Q ES  
Sbjct: 134 RQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKITALTSRQQEESVT 193

Query: 592 KQKSPIFKDIQKLIANKLEKFTVKKEAIN 678
           + +SP FKDIQKLIANKLE   +K+EA N
Sbjct: 194 EYQSPKFKDIQKLIANKLEHPKIKQEASN 222


>emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera]
          Length = 348

 Score =  155 bits (392), Expect = 1e-35
 Identities = 109/237 (45%), Positives = 137/237 (57%), Gaps = 16/237 (6%)
 Frame = +1

Query: 16  KEIDATNNRKGNINKPA-AKPTAMSHSTTPSRFRLNSKTKEAA-------NG--SLSPAQ 165
           + IDA +      N P   K T  SH   PS    +S +  +        NG  S SPA 
Sbjct: 12  RPIDALSQEAMKQNPPTPCKTTTPSHLRRPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAP 71

Query: 166 EARAKSVPPDVKKDSKLKRSLMLNKQRSGEQLLGSQNG-----VKVVARSVNRPVVEQFA 330
             RA+S P ++    K +RSL+LNK +SG+  LGSQ       VKV+ RS NRPVV+Q A
Sbjct: 72  RPRARSGPLEMNNSHKARRSLLLNKPKSGDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA 131

Query: 331 KPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQ 510
                P+  S     ++  KELQE+L L + LI +LQS+VL LKAELDKAQS N EL+  
Sbjct: 132 -----PRRPSEGPEPDDKTKELQEKLDLRQNLINNLQSEVLGLKAELDKAQSFNLELQSL 186

Query: 511 NKKLAQDLAAAEAKVAAFHIRDQ-ESNGKQKSPIFKDIQKLIANKLEKFTVKKEAIN 678
           N KL +DLAAA AK+ A   R Q ES  + +SP FKDIQKLIA KLE   +K+EA N
Sbjct: 187 NAKLTEDLAAALAKITALTSRQQEESVTEYQSPKFKDIQKLIAXKLEHPKIKQEASN 243


>gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus vulgaris]
           gi|561011661|gb|ESW10568.1| hypothetical protein
           PHAVU_009G220500g [Phaseolus vulgaris]
          Length = 584

 Score =  151 bits (381), Expect = 2e-34
 Identities = 99/216 (45%), Positives = 135/216 (62%), Gaps = 18/216 (8%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNK 240
           +  S TPSR RL SK +E         NG +S     RAKSV P++K  S++KR L+LNK
Sbjct: 28  LQSSLTPSRLRLPSKYREPPRTPPEVVNGVVSTPTR-RAKSVTPELKHASRIKRGLVLNK 86

Query: 241 QRSGEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----KE 393
            +  E+++G+  G      K V R +    VEQFA PR      +  R KEE D    KE
Sbjct: 87  AKPNEEVVGTHRGREAVEPKAVPRFMRPHAVEQFASPRSAVGDFAMKRDKEEPDGKSKKE 146

Query: 394 LQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--H 567
           L E+L+++E LI++LQS+VLALKAEL+K +  N ELE  N+KL +D+AAAE+KV +    
Sbjct: 147 LMEKLEVSESLIRNLQSEVLALKAELEKVKGLNVELESHNRKLTKDIAAAESKVMSLGGS 206

Query: 568 IRDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675
            + +E  G+ +SP FK IQKLIA+KLE+  VKKEA+
Sbjct: 207 EKMKEPIGEHQSPKFKHIQKLIADKLERSRVKKEAL 242


>gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, putative isoform
           5, partial [Theobroma cacao]
          Length = 458

 Score =  144 bits (362), Expect = 3e-32
 Identities = 105/224 (46%), Positives = 136/224 (60%), Gaps = 18/224 (8%)
 Frame = +1

Query: 58  KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 226 LMLNKQRSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390
           L+LNK +SG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 559 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675
           A   RD     +ESNG  +S  FKDIQ+ IANKLE   + +EAI
Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITREAI 227


>gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 4
           [Theobroma cacao]
          Length = 565

 Score =  144 bits (362), Expect = 3e-32
 Identities = 105/224 (46%), Positives = 136/224 (60%), Gaps = 18/224 (8%)
 Frame = +1

Query: 58  KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 226 LMLNKQRSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390
           L+LNK +SG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 559 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675
           A   RD     +ESNG  +S  FKDIQ+ IANKLE   + +EAI
Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITREAI 227


>gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
           [Theobroma cacao]
          Length = 561

 Score =  144 bits (362), Expect = 3e-32
 Identities = 105/224 (46%), Positives = 136/224 (60%), Gaps = 18/224 (8%)
 Frame = +1

Query: 58  KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 226 LMLNKQRSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390
           L+LNK +SG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 559 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675
           A   RD     +ESNG  +S  FKDIQ+ IANKLE   + +EAI
Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITREAI 227


>gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
           [Theobroma cacao]
          Length = 564

 Score =  144 bits (362), Expect = 3e-32
 Identities = 105/224 (46%), Positives = 136/224 (60%), Gaps = 18/224 (8%)
 Frame = +1

Query: 58  KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 226 LMLNKQRSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390
           L+LNK +SG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 559 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675
           A   RD     +ESNG  +S  FKDIQ+ IANKLE   + +EAI
Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITREAI 227


>gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema salsugineum]
          Length = 554

 Score =  135 bits (341), Expect = 9e-30
 Identities = 96/208 (46%), Positives = 126/208 (60%), Gaps = 13/208 (6%)
 Frame = +1

Query: 91  STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDS----KLKRSLMLNKQRSGEQ 258
           STTPSR R       AAN   S     RA+      K       K +RS++L + +SGE+
Sbjct: 8   STTPSRVR-------AANSHYSVISRPRAQDDNGKPKSSGHDPGKNRRSILLKRAKSGEE 60

Query: 259 ---LLGSQNGVKVVARSVNRP-VVEQFAKPRRLPQLDSSARRKEEGDK-----ELQERLQ 411
              +L  Q      ARSVNRP VVEQF  PRR     S     EE +K     EL+E+L 
Sbjct: 61  ETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKSEEMAAEEDEKRKKMEELEEKLV 115

Query: 412 LNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESNG 591
            NE LIKDLQ+QV +LKAEL++A+SSN ELEL+N+KL+QDL +AEAK+++    D+ +  
Sbjct: 116 ANESLIKDLQAQVSSLKAELEEARSSNAELELKNRKLSQDLVSAEAKISSLSSNDKPAKE 175

Query: 592 KQKSPIFKDIQKLIANKLEKFTVKKEAI 675
            Q +  FKDIQK+IA+KLE+  VKKE +
Sbjct: 176 HQNTR-FKDIQKIIASKLEQSKVKKELV 202


>ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798183 [Glycine max]
          Length = 565

 Score =  133 bits (335), Expect = 4e-29
 Identities = 89/197 (45%), Positives = 125/197 (63%), Gaps = 7/197 (3%)
 Frame = +1

Query: 100 PSRFRLNSKTKEAANGSLS--PAQEARAKSVPPDVKKDSKLKRSLMLNKQRSGEQLLGSQ 273
           P R R +SK  ++    ++       RA+SVPPD+K  S+ KR +++NK +  E++LGSQ
Sbjct: 33  PPRLRASSKAPKSPPEVVNRESISSTRAESVPPDLKNVSRAKRGVVVNKPKLNEEVLGSQ 92

Query: 274 NG----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQ 441
                 + +VAR   R  V  F   R+    DS  ++K+E    LQE+L+++E LIK LQ
Sbjct: 93  KAEEGKIVIVARPRRR--VGDFGS-RKSEDDDSHGKKKKE---LLQEKLEVSENLIKSLQ 146

Query: 442 SQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI-RDQESNGKQKSPIFKD 618
           S+VLAL+ ELD+ +S N ELE QN KL Q+LAAAEAK++   I  + +  G+ +SP FKD
Sbjct: 147 SEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKPIGEHRSPKFKD 206

Query: 619 IQKLIANKLEKFTVKKE 669
           IQKLIA KLE+  VKKE
Sbjct: 207 IQKLIAEKLERSRVKKE 223


>ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Capsella rubella]
           gi|482576206|gb|EOA40393.1| hypothetical protein
           CARUB_v10009119mg [Capsella rubella]
          Length = 450

 Score =  131 bits (329), Expect = 2e-28
 Identities = 98/220 (44%), Positives = 131/220 (59%), Gaps = 25/220 (11%)
 Frame = +1

Query: 91  STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPDVKKD-SKLKRSLML 234
           STTPSR R       AAN   S     RA           K    DVK D +K +RS++L
Sbjct: 8   STTPSRVR-------AANSHYSVISRPRAQDDNGLAGGKPKHSNHDVKNDPAKNRRSILL 60

Query: 235 NKQRSGEQLLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARRKEE 381
            K +SG++   +   V   ARSVNRP VVEQF  PRR          +   +++A   E+
Sbjct: 61  RKAKSGDEETTAVL-VPQRARSVNRPAVVEQFGCPRRPISRKIEETVMSTAEAAAEEDEK 119

Query: 382 GDK--ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKV 555
             +  EL+E+L +NE LIKDLQ QVL LK EL++A+SSN ELEL N+KL+QDLA+AEAK+
Sbjct: 120 RKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARSSNVELELNNRKLSQDLASAEAKI 179

Query: 556 AAFHIRDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEAI 675
           ++    D+ +   Q S  FKDIQ+LIA+KLE+  V+KE +
Sbjct: 180 SSLSSNDKPAKEHQNSR-FKDIQRLIASKLEQSKVRKEVV 218


>ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511271 [Cicer arietinum]
          Length = 933

 Score =  130 bits (327), Expect = 4e-28
 Identities = 92/214 (42%), Positives = 134/214 (62%), Gaps = 16/214 (7%)
 Frame = +1

Query: 79  AMSHSTTPSRFRL---NSKTKEA-------ANGSLSPAQEARAKSVPPDVKKDSKLKRSL 228
           ++  +TT +R R+   +SK KE+        N + +     RAKSVPPD+K +SK KR +
Sbjct: 67  SIPQTTTTTRIRVVKGSSKIKESPKTPPEIVNNNRASISSTRAKSVPPDLKNNSKAKRGI 126

Query: 229 MLNKQ--RSGEQL-LGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQ 399
           ++  +  +S E++   SQ G K    +  + VV    +PRR    D    +++   KE+ 
Sbjct: 127 VVMNKLVKSNEEVECSSQKGTKEAEEA--KIVV---VRPRRRRTNDDPDEKEK---KEMV 178

Query: 400 ERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF---HI 570
           E+L++++ LIK+L+S+V ALKAELDK ++ N ELE QN KL Q+LAAAEAK+AA    + 
Sbjct: 179 EKLEMSDNLIKNLESEVKALKAELDKVKNLNVELESQNVKLTQNLAAAEAKIAAVGSNNS 238

Query: 571 RDQESNGKQKSPIFKDIQKLIANKLEKFTVKKEA 672
           R +E  G+ +SP FKDIQKLIA+KLE   VKKEA
Sbjct: 239 RKKELIGEHQSPKFKDIQKLIADKLEMSKVKKEA 272


>ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arabidopsis lyrata subsp.
           lyrata] gi|297337263|gb|EFH67680.1| hypothetical protein
           ARALYDRAFT_473964 [Arabidopsis lyrata subsp. lyrata]
          Length = 567

 Score =  130 bits (327), Expect = 4e-28
 Identities = 98/219 (44%), Positives = 130/219 (59%), Gaps = 26/219 (11%)
 Frame = +1

Query: 91  STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPDVKKD-SKLKRSLML 234
           STTPSR R       AAN   S   + RA           KS   DVK D +K +RS++L
Sbjct: 8   STTPSRVR-------AANSHYSVISKPRAQDDNGLTGGKPKSSGHDVKNDPAKNRRSILL 60

Query: 235 NKQRSGEQ---LLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARR 372
            + + GE+   +L  Q      ARSVNRP VVEQF  PRR          +     +   
Sbjct: 61  KRAKYGEEETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKTEESVMATAVVAEDE 115

Query: 373 KEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAK 552
           K +  +EL+E+L +NE LIKDLQ QVL LK EL++A++SN ELEL+NKKL+QDLA+AEAK
Sbjct: 116 KRKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARNSNAELELKNKKLSQDLASAEAK 175

Query: 553 VAAFHIRDQESNGKQKSPIFKDIQKLIANKLEKFTVKKE 669
           +++    D+ +   Q +  FKDIQ+LIA+KLE+  VKKE
Sbjct: 176 ISSLSSNDKPAKEHQNTR-FKDIQRLIASKLEQSKVKKE 213


Top