BLASTX nr result

ID: Jatropha_contig00039274 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00039274
         (615 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus t...   201   1e-49
ref|XP_002519338.1| conserved hypothetical protein [Ricinus comm...   196   5e-48
gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus cl...   192   6e-47
gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus pe...   163   4e-38
ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306...   156   4e-36
ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820...   154   2e-35
ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809...   154   2e-35
ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256...   152   7e-35
emb|CBI26022.3| unnamed protein product [Vitis vinifera]              152   7e-35
emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera]   149   4e-34
gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus...   148   1e-33
gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, ...   141   1e-31
gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, ...   141   1e-31
gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, ...   141   1e-31
gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, ...   141   1e-31
gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema s...   136   4e-30
ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798...   131   1e-28
ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arab...   129   8e-28
ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Caps...   128   1e-27
ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511...   127   2e-27

>gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus trichocarpa]
          Length = 547

 Score =  201 bits (512), Expect = 1e-49
 Identities = 126/205 (61%), Positives = 152/205 (74%), Gaps = 12/205 (5%)
 Frame = +1

Query: 37  HSTTPSRFRLNSKTKEAA----NGSL--SPAQEARAKSVPPGVKKDSKLKRSLM-LNKQK 195
           HSTTPSR R+N KT + A    NGS   SPA + RAKSVPP VKKD+K+++SL+  NK K
Sbjct: 4   HSTTPSRHRVNFKTPKPAEVANNGSPVPSPANKTRAKSVPPDVKKDTKVRKSLVGNNKPK 63

Query: 196 SGEQLLGSQNGVKVVARSVNRPVVEQFAKPRR----LPQLDSSARRKEEGDKE-LQERLQ 360
           SGE ++GSQ+ V VV RSVNRP  EQFA+PRR    L  +++S R +EE  K+ L E+L+
Sbjct: 64  SGELVVGSQD-VTVVGRSVNRPGSEQFARPRRQRPVLDPINASRRNEEESYKKGLHEKLE 122

Query: 361 LNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESNG 540
           L+E LI DLQS+VLALK ELDKA   N ELELQNKKL +DLAAAEAKV+A + R Q S G
Sbjct: 123 LSETLINDLQSEVLALKVELDKANGLNQELELQNKKLTEDLAAAEAKVSALNTRHQ-SVG 181

Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615
           + + P FKDIQKLIA KLENS VKK
Sbjct: 182 EHQRPRFKDIQKLIAIKLENSPVKK 206


>ref|XP_002519338.1| conserved hypothetical protein [Ricinus communis]
           gi|223541653|gb|EEF43202.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 532

 Score =  196 bits (497), Expect = 5e-48
 Identities = 125/202 (61%), Positives = 148/202 (73%), Gaps = 7/202 (3%)
 Frame = +1

Query: 31  MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLN-KQKSGEQ 207
           MS  TTPSRFRLNSK  +       PA++ RA+SVPP  KKD+KL+RS+++N K KS ++
Sbjct: 1   MSQPTTPSRFRLNSKAPKPE----PPAKKERAQSVPPDFKKDTKLRRSVLVNTKPKSRDE 56

Query: 208 LLGSQNGVKVVAR---SVNRPVVEQFAKPRRLPQLDSSARRKEEGDK-ELQERLQLNEIL 375
           LLGSQ  V  V     SVNRPV EQF+KPR       SAR+ EE  K EL ER++LN+ L
Sbjct: 57  LLGSQMEVARVVSPSLSVNRPVHEQFSKPRT----QRSARKIEEDTKKELLERIELNDNL 112

Query: 376 IKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--HIRDQESNGKQK 549
           I+DL+SQVL+LKAELDKAQS N+ELE QNKKL QDLA+AEAKVAA   +    ES G  +
Sbjct: 113 IQDLKSQVLSLKAELDKAQSLNEELESQNKKLQQDLASAEAKVAAALNNTPLPESIGGYQ 172

Query: 550 SPIFKDIQKLIANKLENSTVKK 615
           SP FKDIQKLIANKLENSTVKK
Sbjct: 173 SPKFKDIQKLIANKLENSTVKK 194


>gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus clementina]
          Length = 561

 Score =  192 bits (488), Expect = 6e-47
 Identities = 122/218 (55%), Positives = 153/218 (70%), Gaps = 20/218 (9%)
 Frame = +1

Query: 16  AKPTAMSHST---TPSRFRLNSKTKEAA------NG-SLSPAQEARAKSVPPGVKKD--S 159
           +K   MSHST   T SR R NSKT+E+       NG SLSP  +ARAKSVPP VK +  S
Sbjct: 8   SKTNNMSHSTAATTTSRLRANSKTRESPKQEAGINGVSLSPELKARAKSVPPDVKTNNIS 67

Query: 160 KLKRSLMLNKQKSGEQLLGSQNG--VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG 333
           K +R+L+LNK KS E  +GS     VKV  RS+NRPVVEQFA+PRR   +D++  + E+G
Sbjct: 68  KSRRALVLNKPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFARPRRQRIVDANPGKIEDG 127

Query: 334 -----DKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEA 498
                 KE +E+L+L+E L+KDLQS+V ALKAE  KAQS N ELE QNKKL +DL AAEA
Sbjct: 128 LMDKKKKEFEEKLRLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLVEDLVAAEA 187

Query: 499 KVAAFHIRDQ-ESNGKQKSPIFKDIQKLIANKLENSTV 609
           K+A+   R+Q E+ G+ +SP FKD+QKLIANKLE+S V
Sbjct: 188 KIASLSSREQREAVGEYQSPKFKDVQKLIANKLEHSIV 225


>gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus persica]
          Length = 552

 Score =  163 bits (412), Expect = 4e-38
 Identities = 103/205 (50%), Positives = 131/205 (63%), Gaps = 10/205 (4%)
 Frame = +1

Query: 31  MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNKQKSGEQL 210
           MS  T PS  R ++ +K   + S  P+   RAKS+          +RSL+LNK KSGE +
Sbjct: 20  MSQPTPPSYLRASASSKAKESPSPRPS---RAKSI----------RRSLLLNKPKSGELV 66

Query: 211 LGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG----DKELQERLQL 363
           LGSQ        K V R  NR V EQFA+PR     D +++R EE     ++ELQERL +
Sbjct: 67  LGSQKSKELEETKAVGRPGNRQVAEQFARPRPQRPADPNSKRNEEDPHVKNRELQERLDM 126

Query: 364 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 540
           +E L  + Q++VLALKAELDKAQ  N EL+ QNK L + LAAAEAK+AAF  R+Q E+NG
Sbjct: 127 SESLTMNFQAEVLALKAELDKAQGLNVELQSQNKNLTEKLAAAEAKIAAFTTREQRETNG 186

Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615
           + +SP FKD+QKLIANKLE   VKK
Sbjct: 187 EYQSPKFKDLQKLIANKLERPVVKK 211


>ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306034 [Fragaria vesca
           subsp. vesca]
          Length = 560

 Score =  156 bits (395), Expect = 4e-36
 Identities = 105/208 (50%), Positives = 135/208 (64%), Gaps = 15/208 (7%)
 Frame = +1

Query: 37  HSTTPSRFRLNSKTKEAANGSLSPAQE-ARAKSVPPGVKKDS---KLKRSLMLNKQKSGE 204
           HST  S+ R +SK KE    S SP Q  +RAKSV P V   S    ++R+L+ NK KSGE
Sbjct: 15  HSTNMSQLRASSKAKE----SQSPTQRPSRAKSVTPDVNHSSDSRSVRRALLQNKPKSGE 70

Query: 205 QLLGSQNG-----VKVVARSVNRPVVEQFAKPRRL-PQLDSSARRKEEGD----KELQER 354
            +LGSQ        KVV  S    VVEQFAKPRR  P ++++ +R E+      KE+QE+
Sbjct: 71  LVLGSQKSKDFEEFKVVGSSRKPQVVEQFAKPRRQRPVVEANCKRNEDDPHRNMKEMQEK 130

Query: 355 LQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-E 531
           ++++E +I  LQ++VL LK ELDK    N EL+ +NKKL+++L AAEAK+AA     Q E
Sbjct: 131 IEMSESMIMKLQAEVLGLKVELDKEHGLNLELQAKNKKLSENLTAAEAKIAALTTPQQRE 190

Query: 532 SNGKQKSPIFKDIQKLIANKLENSTVKK 615
           SNG Q SP FKD+QKLIANKLE S VKK
Sbjct: 191 SNGYQ-SPKFKDLQKLIANKLECSVVKK 217


>ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820086 [Glycine max]
          Length = 576

 Score =  154 bits (388), Expect = 2e-35
 Identities = 101/212 (47%), Positives = 135/212 (63%), Gaps = 17/212 (8%)
 Frame = +1

Query: 31  MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNK 189
           + +S TPSR RL SK +E         N  +      RAKSV P +K +S++K+ L+LNK
Sbjct: 27  IQNSLTPSRLRLPSKYREPPKTPPEVVNNGMVSTPLRRAKSVTPELKHNSRIKKGLVLNK 86

Query: 190 QKSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----K 339
            K  E++LG+ Q G      KVV+R V    VEQF++PR      +  R KE+ D    K
Sbjct: 87  AKPNEEVLGTTQRGREVEEAKVVSRFVRPHAVEQFSRPRSGVGDFAFKRDKEDPDGKSKK 146

Query: 340 ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI 519
           EL E+L+ +E LIK+LQS+VLALKAEL+K +  N ELE  N+KL +DLAAAEAKV +   
Sbjct: 147 ELMEKLEASESLIKNLQSEVLALKAELEKVKGLNVELESNNRKLTEDLAAAEAKVVSLS- 205

Query: 520 RDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615
            +++ NG+ +SP FK IQKLIA+KLE S VKK
Sbjct: 206 GNEKPNGEHQSPKFKLIQKLIADKLERSIVKK 237


>ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809254 [Glycine max]
          Length = 562

 Score =  154 bits (388), Expect = 2e-35
 Identities = 101/213 (47%), Positives = 133/213 (62%), Gaps = 18/213 (8%)
 Frame = +1

Query: 31  MSHSTTPSRFRLNSKTKE--------AANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLN 186
           + +S TPSR RL SK +E          N  +      RAKSV P +K +S++KR L+LN
Sbjct: 10  LQNSLTPSRLRLPSKYREPPKTPPEVVVNNVVVSTPSRRAKSVTPELKHNSRIKRGLVLN 69

Query: 187 KQKSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD---- 336
           K K  E+++G+ Q G      KVVAR V   VVEQFA+PR      +  R KE+ D    
Sbjct: 70  KAKPNEEVVGTTQRGREAEETKVVARFVRPHVVEQFARPRNGAGDFAFKRDKEDSDEKSK 129

Query: 337 KELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFH 516
           KEL E+L+ +E LIK+LQS+V ALKAEL+K +    ELE  N+KL +DLAAAE KV +  
Sbjct: 130 KELMEKLEASESLIKNLQSEVQALKAELEKVKGLKVELESHNRKLTEDLAAAEVKVVSLG 189

Query: 517 IRDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615
             +++ NG+ +SP FK IQKLIA+KLE S VKK
Sbjct: 190 -GNEKPNGEHQSPKFKHIQKLIADKLERSIVKK 221


>ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256278 [Vitis vinifera]
          Length = 551

 Score =  152 bits (384), Expect = 7e-35
 Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 6/205 (2%)
 Frame = +1

Query: 19  KPTAMSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNKQKS 198
           +P++ S S++ S  ++ +        S SPA   RA+S P  +    K +RSL+LNK KS
Sbjct: 19  RPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSLLLNKPKS 78

Query: 199 GEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQL 363
           G+  LGSQ       VKV+ RS NRPVV+Q A     P+  S     ++  KELQE+L L
Sbjct: 79  GDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA-----PRRPSEGPEPDDKTKELQEKLDL 133

Query: 364 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 540
            + LI +LQS+VL LKAELDKAQS N EL+  N KL +DLAAA AK+ A   R Q ES  
Sbjct: 134 RQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKITALTSRQQEESVT 193

Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615
           + +SP FKDIQKLIANKLE+  +K+
Sbjct: 194 EYQSPKFKDIQKLIANKLEHPKIKQ 218


>emb|CBI26022.3| unnamed protein product [Vitis vinifera]
          Length = 572

 Score =  152 bits (384), Expect = 7e-35
 Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 6/205 (2%)
 Frame = +1

Query: 19  KPTAMSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNKQKS 198
           +P++ S S++ S  ++ +        S SPA   RA+S P  +    K +RSL+LNK KS
Sbjct: 40  RPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSLLLNKPKS 99

Query: 199 GEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQL 363
           G+  LGSQ       VKV+ RS NRPVV+Q A     P+  S     ++  KELQE+L L
Sbjct: 100 GDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA-----PRRPSEGPEPDDKTKELQEKLDL 154

Query: 364 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 540
            + LI +LQS+VL LKAELDKAQS N EL+  N KL +DLAAA AK+ A   R Q ES  
Sbjct: 155 RQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKITALTSRQQEESVT 214

Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615
           + +SP FKDIQKLIANKLE+  +K+
Sbjct: 215 EYQSPKFKDIQKLIANKLEHPKIKQ 239


>emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera]
          Length = 348

 Score =  149 bits (377), Expect = 4e-34
 Identities = 97/205 (47%), Positives = 127/205 (61%), Gaps = 6/205 (2%)
 Frame = +1

Query: 19  KPTAMSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNKQKS 198
           +P++ S S++ S  ++ +        S SPA   RA+S P  +    K +RSL+LNK KS
Sbjct: 40  RPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSLLLNKPKS 99

Query: 199 GEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQL 363
           G+  LGSQ       VKV+ RS NRPVV+Q A     P+  S     ++  KELQE+L L
Sbjct: 100 GDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA-----PRRPSEGPEPDDKTKELQEKLDL 154

Query: 364 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 540
            + LI +LQS+VL LKAELDKAQS N EL+  N KL +DLAAA AK+ A   R Q ES  
Sbjct: 155 RQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKITALTSRQQEESVT 214

Query: 541 KQKSPIFKDIQKLIANKLENSTVKK 615
           + +SP FKDIQKLIA KLE+  +K+
Sbjct: 215 EYQSPKFKDIQKLIAXKLEHPKIKQ 239


>gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus vulgaris]
           gi|561011661|gb|ESW10568.1| hypothetical protein
           PHAVU_009G220500g [Phaseolus vulgaris]
          Length = 584

 Score =  148 bits (373), Expect = 1e-33
 Identities = 99/213 (46%), Positives = 131/213 (61%), Gaps = 18/213 (8%)
 Frame = +1

Query: 31  MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPGVKKDSKLKRSLMLNK 189
           +  S TPSR RL SK +E         NG +S     RAKSV P +K  S++KR L+LNK
Sbjct: 28  LQSSLTPSRLRLPSKYREPPRTPPEVVNGVVSTPTR-RAKSVTPELKHASRIKRGLVLNK 86

Query: 190 QKSGEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----KE 342
            K  E+++G+  G      K V R +    VEQFA PR      +  R KEE D    KE
Sbjct: 87  AKPNEEVVGTHRGREAVEPKAVPRFMRPHAVEQFASPRSAVGDFAMKRDKEEPDGKSKKE 146

Query: 343 LQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--H 516
           L E+L+++E LI++LQS+VLALKAEL+K +  N ELE  N+KL +D+AAAE+KV +    
Sbjct: 147 LMEKLEVSESLIRNLQSEVLALKAELEKVKGLNVELESHNRKLTKDIAAAESKVMSLGGS 206

Query: 517 IRDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615
            + +E  G+ +SP FK IQKLIA+KLE S VKK
Sbjct: 207 EKMKEPIGEHQSPKFKHIQKLIADKLERSRVKK 239


>gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, putative isoform
           5, partial [Theobroma cacao]
          Length = 458

 Score =  141 bits (356), Expect = 1e-31
 Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 18/221 (8%)
 Frame = +1

Query: 7   KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSK-LKRS 174
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 175 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 339
           L+LNK KSG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 340 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 507
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 508 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLENSTVKK 615
           A   RD     +ESNG  +S  FKDIQ+ IANKLE+  + +
Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITR 224


>gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 4
           [Theobroma cacao]
          Length = 565

 Score =  141 bits (356), Expect = 1e-31
 Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 18/221 (8%)
 Frame = +1

Query: 7   KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSK-LKRS 174
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 175 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 339
           L+LNK KSG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 340 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 507
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 508 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLENSTVKK 615
           A   RD     +ESNG  +S  FKDIQ+ IANKLE+  + +
Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITR 224


>gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
           [Theobroma cacao]
          Length = 561

 Score =  141 bits (356), Expect = 1e-31
 Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 18/221 (8%)
 Frame = +1

Query: 7   KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSK-LKRS 174
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 175 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 339
           L+LNK KSG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 340 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 507
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 508 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLENSTVKK 615
           A   RD     +ESNG  +S  FKDIQ+ IANKLE+  + +
Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITR 224


>gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
           [Theobroma cacao]
          Length = 564

 Score =  141 bits (356), Expect = 1e-31
 Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 18/221 (8%)
 Frame = +1

Query: 7   KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDSK-LKRS 174
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 175 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 339
           L+LNK KSG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 340 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 507
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 508 AFHIRD-----QESNGKQKSPIFKDIQKLIANKLENSTVKK 615
           A   RD     +ESNG  +S  FKDIQ+ IANKLE+  + +
Sbjct: 184 ALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKITR 224


>gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema salsugineum]
          Length = 554

 Score =  136 bits (343), Expect = 4e-30
 Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 14/206 (6%)
 Frame = +1

Query: 40  STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPGVKKDS-----KLKRSLMLNKQKSGE 204
           STTPSR R       AAN   S     RA+    G  K S     K +RS++L + KSGE
Sbjct: 8   STTPSRVR-------AANSHYSVISRPRAQD-DNGKPKSSGHDPGKNRRSILLKRAKSGE 59

Query: 205 Q---LLGSQNGVKVVARSVNRP-VVEQFAKPRRLPQLDSSARRKEEGDK-----ELQERL 357
           +   +L  Q      ARSVNRP VVEQF  PRR     S     EE +K     EL+E+L
Sbjct: 60  EETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKSEEMAAEEDEKRKKMEELEEKL 114

Query: 358 QLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESN 537
             NE LIKDLQ+QV +LKAEL++A+SSN ELEL+N+KL+QDL +AEAK+++    D+ + 
Sbjct: 115 VANESLIKDLQAQVSSLKAELEEARSSNAELELKNRKLSQDLVSAEAKISSLSSNDKPAK 174

Query: 538 GKQKSPIFKDIQKLIANKLENSTVKK 615
             Q +  FKDIQK+IA+KLE S VKK
Sbjct: 175 EHQNTR-FKDIQKIIASKLEQSKVKK 199


>ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798183 [Glycine max]
          Length = 565

 Score =  131 bits (330), Expect = 1e-28
 Identities = 89/196 (45%), Positives = 123/196 (62%), Gaps = 7/196 (3%)
 Frame = +1

Query: 49  PSRFRLNSKTKEAANGSLS--PAQEARAKSVPPGVKKDSKLKRSLMLNKQKSGEQLLGSQ 222
           P R R +SK  ++    ++       RA+SVPP +K  S+ KR +++NK K  E++LGSQ
Sbjct: 33  PPRLRASSKAPKSPPEVVNRESISSTRAESVPPDLKNVSRAKRGVVVNKPKLNEEVLGSQ 92

Query: 223 NG----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQ 390
                 + +VAR   R  V  F   R+    DS  ++K+E    LQE+L+++E LIK LQ
Sbjct: 93  KAEEGKIVIVARPRRR--VGDFGS-RKSEDDDSHGKKKKE---LLQEKLEVSENLIKSLQ 146

Query: 391 SQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI-RDQESNGKQKSPIFKD 567
           S+VLAL+ ELD+ +S N ELE QN KL Q+LAAAEAK++   I  + +  G+ +SP FKD
Sbjct: 147 SEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKPIGEHRSPKFKD 206

Query: 568 IQKLIANKLENSTVKK 615
           IQKLIA KLE S VKK
Sbjct: 207 IQKLIAEKLERSRVKK 222


>ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arabidopsis lyrata subsp.
           lyrata] gi|297337263|gb|EFH67680.1| hypothetical protein
           ARALYDRAFT_473964 [Arabidopsis lyrata subsp. lyrata]
          Length = 567

 Score =  129 bits (323), Expect = 8e-28
 Identities = 98/218 (44%), Positives = 128/218 (58%), Gaps = 26/218 (11%)
 Frame = +1

Query: 40  STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPGVKKD-SKLKRSLML 183
           STTPSR R       AAN   S   + RA           KS    VK D +K +RS++L
Sbjct: 8   STTPSRVR-------AANSHYSVISKPRAQDDNGLTGGKPKSSGHDVKNDPAKNRRSILL 60

Query: 184 NKQKSGEQ---LLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARR 321
            + K GE+   +L  Q      ARSVNRP VVEQF  PRR          +     +   
Sbjct: 61  KRAKYGEEETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKTEESVMATAVVAEDE 115

Query: 322 KEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAK 501
           K +  +EL+E+L +NE LIKDLQ QVL LK EL++A++SN ELEL+NKKL+QDLA+AEAK
Sbjct: 116 KRKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARNSNAELELKNKKLSQDLASAEAK 175

Query: 502 VAAFHIRDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615
           +++    D+ +   Q +  FKDIQ+LIA+KLE S VKK
Sbjct: 176 ISSLSSNDKPAKEHQNTR-FKDIQRLIASKLEQSKVKK 212


>ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Capsella rubella]
           gi|482576206|gb|EOA40393.1| hypothetical protein
           CARUB_v10009119mg [Capsella rubella]
          Length = 450

 Score =  128 bits (322), Expect = 1e-27
 Identities = 98/217 (45%), Positives = 128/217 (58%), Gaps = 25/217 (11%)
 Frame = +1

Query: 40  STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPGVKKD-SKLKRSLML 183
           STTPSR R       AAN   S     RA           K     VK D +K +RS++L
Sbjct: 8   STTPSRVR-------AANSHYSVISRPRAQDDNGLAGGKPKHSNHDVKNDPAKNRRSILL 60

Query: 184 NKQKSGEQLLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARRKEE 330
            K KSG++   +   V   ARSVNRP VVEQF  PRR          +   +++A   E+
Sbjct: 61  RKAKSGDEETTAVL-VPQRARSVNRPAVVEQFGCPRRPISRKIEETVMSTAEAAAEEDEK 119

Query: 331 GDK--ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKV 504
             +  EL+E+L +NE LIKDLQ QVL LK EL++A+SSN ELEL N+KL+QDLA+AEAK+
Sbjct: 120 RKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARSSNVELELNNRKLSQDLASAEAKI 179

Query: 505 AAFHIRDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615
           ++    D+ +   Q S  FKDIQ+LIA+KLE S V+K
Sbjct: 180 SSLSSNDKPAKEHQNSR-FKDIQRLIASKLEQSKVRK 215


>ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511271 [Cicer arietinum]
          Length = 933

 Score =  127 bits (319), Expect = 2e-27
 Identities = 91/212 (42%), Positives = 132/212 (62%), Gaps = 16/212 (7%)
 Frame = +1

Query: 28  AMSHSTTPSRFRL---NSKTKEA-------ANGSLSPAQEARAKSVPPGVKKDSKLKRSL 177
           ++  +TT +R R+   +SK KE+        N + +     RAKSVPP +K +SK KR +
Sbjct: 67  SIPQTTTTTRIRVVKGSSKIKESPKTPPEIVNNNRASISSTRAKSVPPDLKNNSKAKRGI 126

Query: 178 MLNKQ--KSGEQL-LGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQ 348
           ++  +  KS E++   SQ G K    +  + VV    +PRR    D    +++   KE+ 
Sbjct: 127 VVMNKLVKSNEEVECSSQKGTKEAEEA--KIVV---VRPRRRRTNDDPDEKEK---KEMV 178

Query: 349 ERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF---HI 519
           E+L++++ LIK+L+S+V ALKAELDK ++ N ELE QN KL Q+LAAAEAK+AA    + 
Sbjct: 179 EKLEMSDNLIKNLESEVKALKAELDKVKNLNVELESQNVKLTQNLAAAEAKIAAVGSNNS 238

Query: 520 RDQESNGKQKSPIFKDIQKLIANKLENSTVKK 615
           R +E  G+ +SP FKDIQKLIA+KLE S VKK
Sbjct: 239 RKKELIGEHQSPKFKDIQKLIADKLEMSKVKK 270


Top