BLASTX nr result

ID: Rehmannia23_contig00028136 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00028136
         (869 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579...   143   9e-32
gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich g...   121   4e-25
ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295...   113   8e-23
ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295...   108   3e-21
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   106   1e-20
gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich g...   105   2e-20
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   102   1e-19
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   102   2e-19
gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich g...   101   4e-19
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   100   9e-19
gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao]    98   3e-18
gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich g...    98   5e-18
emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]    98   5e-18
gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich g...    96   2e-17
ref|XP_002513626.1| conserved hypothetical protein [Ricinus comm...    96   2e-17
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...    95   3e-17
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]      95   4e-17
gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich g...    94   6e-17
emb|CBI22611.3| unnamed protein product [Vitis vinifera]               94   6e-17
ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295...    94   8e-17

>ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum]
          Length = 197

 Score =  143 bits (360), Expect = 9e-32
 Identities = 80/199 (40%), Positives = 121/199 (60%), Gaps = 4/199 (2%)
 Frame = +2

Query: 65  QGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXX 244
           Q YPLAPS I+PRSD E+ATNN FQS +Q +KKK    L                     
Sbjct: 5   QKYPLAPSNIMPRSDAEFATNN-FQSNNQRRKKK----LRSTFLLTIFLTGIILLFCFTF 59

Query: 245 XXXRTPRVRLDDITVTSDASTGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGN-TNMGQF 421
              ++P++R+++I +T+D   G + F+ +V +RNRNF RY +DS+L TI +   T +G+F
Sbjct: 60  LRIKSPKIRIENIRITNDGD-GRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRF 118

Query: 422 VIHDARARARSTRRIYVIADLRVPGA---NSSVLDLNIEARLRGKVRLIRVIRRNRSADM 592
           VI D   R RST+ IYV+ +  +P      S +L +  EA++RGKV++ RV R  ++ D+
Sbjct: 119 VIPDGEVRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFRWKKNVDL 178

Query: 593 NCTMRINLSTNVVQNLRCE 649
           +CTM INL+ + +Q+L C+
Sbjct: 179 SCTMSINLTISAIQDLDCQ 197


>gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 215

 Score =  121 bits (303), Expect = 4e-25
 Identities = 78/222 (35%), Positives = 125/222 (56%), Gaps = 18/222 (8%)
 Frame = +2

Query: 38  MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217
           MAEK QQ    +PLAP+   PRSDEE A+    QS++ +K+KKR+K              
Sbjct: 1   MAEKDQQV---HPLAPANGHPRSDEESAS---LQSKE-LKRKKRIKYAVYIAAFAVFQTV 53

Query: 218 XXXXXXXXXXXXRTPRVRLDDITV-------TSDASTGNVRFTGRVSVRNRNFGRYEFDS 376
                       + P+VR+  +TV       T  A++ N+RF  +V+V+N NFG Y+FD+
Sbjct: 54  VILIFALTVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDN 113

Query: 377 SLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGA-----------NSSVLDLN 523
           +  +       +G+ +I  ARARARST+++ V  ++                +SSVL LN
Sbjct: 114 ATMSFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLN 173

Query: 524 IEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
            +A+L+GKV L++V+++ +S +MNCT+  N+ST  +Q+L+C+
Sbjct: 174 SQAKLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215


>ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca
           subsp. vesca]
          Length = 200

 Score =  113 bits (283), Expect = 8e-23
 Identities = 71/209 (33%), Positives = 111/209 (53%), Gaps = 5/209 (2%)
 Frame = +2

Query: 38  MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217
           MA+K+QQ    YPLAPS    RSD E        S+D++K+KKR+KC             
Sbjct: 1   MADKHQQV---YPLAPSNGYTRSDGESL------SEDELKRKKRIKCFAYIGIFIVFQMA 51

Query: 218 XXXXXXXXXXXXRTPRVRLDDIT----VTSDASTGNVRFTGRVSVRNRNFGRYEFDSSLA 385
                       +TP+VRLD  +    VTS  ++ +  F  ++ V+N N+G Y+FD  + 
Sbjct: 52  VGAVFGLTVLKVKTPKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVV 111

Query: 386 TIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGANSS-VLDLNIEARLRGKVRLIR 562
           T +   T +G F +   +A  R T++I     L     NSS  L L  EA+L GKV L+ 
Sbjct: 112 TFKYQGTPVGTFTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMF 171

Query: 563 VIRRNRSADMNCTMRINLSTNVVQNLRCE 649
           ++++ +SA MNCT++I++S   V+++ C+
Sbjct: 172 IMKKKKSASMNCTIQIDVSGQTVKSVVCK 200


>ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  108 bits (269), Expect = 3e-21
 Identities = 71/217 (32%), Positives = 107/217 (49%), Gaps = 13/217 (5%)
 Frame = +2

Query: 38  MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217
           MAEK Q+  Q YPLA      RSD E        S+D++K+KKR+KC             
Sbjct: 1   MAEKSQKTHQTYPLASENGYTRSDGESL------SEDELKRKKRIKCFAYIGIFIVFQMA 54

Query: 218 XXXXXXXXXXXXRTPRVRLDDIT---VTSDASTGNVRFTGRVSVRNRNFGRYEFDSSLAT 388
                       +TP+VRL   T   VTS  ++ +  F  ++ V+N N+G Y+FD  + T
Sbjct: 55  IGAVFGLTVLKVKTPKVRLGTSTLSDVTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVT 114

Query: 389 IRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGANSS----------VLDLNIEARL 538
                  +G  V+   +A  R T++I V   L      SS          VL L  EA+L
Sbjct: 115 FMYQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKL 174

Query: 539 RGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
            GKV L+ ++++ +SA MNCT++I++S   V++L C+
Sbjct: 175 TGKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  106 bits (264), Expect = 1e-20
 Identities = 74/219 (33%), Positives = 113/219 (51%), Gaps = 15/219 (6%)
 Frame = +2

Query: 38  MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217
           MAEK  Q    YPLAP+    RSD E     +  S+D++K++KR K              
Sbjct: 1   MAEKTNQ---AYPLAPANGYTRSDGE-----SLVSEDELKRQKRRKLFMYIGIFIVVQII 52

Query: 218 XXXXXXXXXXXXRTPRVRLDDITVTSDASTG-----NVRFTGRVSVRNRNFGRYEFDSSL 382
                       +TP+VRL  I V S  S       +  FT ++ V+N N+G Y+FD+S 
Sbjct: 53  VMTVFGLTVMKVKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDAST 112

Query: 383 ATIRSGNTNMGQFVIHDARARARSTRRIYVIADLR---VPGA-------NSSVLDLNIEA 532
           AT       +GQ  I  ++AR RST++I V   L    +P +       NS +L L  +A
Sbjct: 113 ATFMYQGVAVGQVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQA 172

Query: 533 RLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
           +L GKV L+ ++++ +SA M+CT+  +LST  V++L+C+
Sbjct: 173 KLTGKVELMLIMKKKKSATMDCTIAFDLSTKTVKSLQCK 211


>gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 259

 Score =  105 bits (263), Expect = 2e-20
 Identities = 59/187 (31%), Positives = 101/187 (54%), Gaps = 17/187 (9%)
 Frame = +2

Query: 137 QSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXXXXRTPRVRL-----DDITVTSDA 301
           +   ++K+KKRMKCL                        + P+ R+     DD+T  + +
Sbjct: 11  EQSKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSS 70

Query: 302 STGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNMGQFVIHD--ARARARSTRRIYVI 475
            + N++F  +V+V+N NFG Y+F++S  T     + +G+ ++    ARARARST+++ V 
Sbjct: 71  PSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVT 130

Query: 476 ADLRVPGA----------NSSVLDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTN 625
            DL   G           NS  L L  ++ L GKV L++VI++ +S +MNCTM +NL+  
Sbjct: 131 MDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQK 190

Query: 626 VVQNLRC 646
           +V++++C
Sbjct: 191 LVRDIKC 197


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  102 bits (255), Expect = 1e-19
 Identities = 67/211 (31%), Positives = 102/211 (48%), Gaps = 16/211 (7%)
 Frame = +2

Query: 65  QGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXX 244
           Q YPLAPS    RSD E        S+D++K+KKR+KC                      
Sbjct: 7   QSYPLAPSNGYTRSDGESL------SEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTI 60

Query: 245 XXXRTPRVRLD-----DITVTSDASTGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTN 409
              +TP+VRL      D T +  A + +  F  ++ V+N N+G Y+FD  + T       
Sbjct: 61  MKVKTPKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMP 120

Query: 410 MGQFVIHDARARARSTRRIYVIADLRVPGANSS-----------VLDLNIEARLRGKVRL 556
           +G  V+   +A  R T++I V   L      SS           VL L  EA+L GKV L
Sbjct: 121 VGTVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVEL 180

Query: 557 IRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
           + ++++ +SA MNCT++I++S   V++L C+
Sbjct: 181 MLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  102 bits (253), Expect = 2e-19
 Identities = 68/219 (31%), Positives = 109/219 (49%), Gaps = 15/219 (6%)
 Frame = +2

Query: 38  MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217
           MAEK  Q    YPLAP+    RSD E     +  S+D++K++KR++              
Sbjct: 1   MAEKTHQ---AYPLAPANGYTRSDGE-----SLVSKDELKRRKRIRLFTYIGIFIVFQII 52

Query: 218 XXXXXXXXXXXXRTPRVRLDDITVTSDASTG-----NVRFTGRVSVRNRNFGRYEFDSSL 382
                       +TP+VRL +I V    S       +  FT ++ V+N N+G Y+FD+S 
Sbjct: 53  VMTVFGLTVMKVKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDAST 112

Query: 383 ATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGA----------NSSVLDLNIEA 532
            T       +GQ  +   +A  RST+++ V   L   G           NS VL LN +A
Sbjct: 113 VTFMYQGVAVGQVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQA 172

Query: 533 RLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
           +L GKV L+ ++++ +S+ M+C +  +LST  V++L+C+
Sbjct: 173 KLSGKVELMLIMKKKKSSTMDCMIGFDLSTKTVKSLQCK 211


>gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 201

 Score =  101 bits (251), Expect = 4e-19
 Identities = 58/200 (29%), Positives = 102/200 (51%), Gaps = 18/200 (9%)
 Frame = +2

Query: 104 SDEEYATNN-NFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXXXXRTPRVRLDD 280
           +++ Y   N + +S  ++K+KKRMK                          + P+ R+  
Sbjct: 2   AEQNYQQKNIDMESAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRS 61

Query: 281 ITVTSDASTG-------NVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNMGQFVIHDAR 439
           ITV   A T        N++F   V+V+N NFG ++FD++  +   G   +G+  +   R
Sbjct: 62  ITVEDIAYTSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGR 121

Query: 440 ARARSTRRIYVIADLR---VPG-------ANSSVLDLNIEARLRGKVRLIRVIRRNRSAD 589
           A+ARST+++ V  DL    +P         +S  L L    +L GKV L+++I++ +SA 
Sbjct: 122 AKARSTKKMNVTVDLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQ 181

Query: 590 MNCTMRINLSTNVVQNLRCE 649
           MNCTM +NL++  +Q+++C+
Sbjct: 182 MNCTMTVNLASRAIQDIKCQ 201


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  100 bits (248), Expect = 9e-19
 Identities = 64/216 (29%), Positives = 110/216 (50%), Gaps = 13/216 (6%)
 Frame = +2

Query: 38  MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217
           MAE+ Q+     P A    + RSD E   ++   S  +++KKKR+KCL            
Sbjct: 1   MAERNQEAYPFAPYANGQAMARSDAE---SSRAHSDHELRKKKRIKCLIYIAVFAVFQII 57

Query: 218 XXXXXXXXXXXXRTPRVRLDDITVTSDASTGN-------VRFTGRVSVRNRNFGRYEFDS 376
                       ++P+ R+  ITV  D +T N       + F   VSV+N NFGRY++D 
Sbjct: 58  VITVFALTVMKIKSPKFRIKSITV-QDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQ 116

Query: 377 SLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGAN------SSVLDLNIEARL 538
           +  +     T +G  V+  A AR ++TR+  V   ++   +N      +  + L+  +++
Sbjct: 117 TSISFIYEGTQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKI 176

Query: 539 RGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRC 646
            GKV L+ +I++ +SA+M CTM ++LS+  VQ+++C
Sbjct: 177 NGKVYLMNMIKKKKSAEMKCTMVVHLSSKQVQDIKC 212


>gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
          Length = 214

 Score = 98.2 bits (243), Expect = 3e-18
 Identities = 65/210 (30%), Positives = 106/210 (50%), Gaps = 17/210 (8%)
 Frame = +2

Query: 71  YPLAPSTIV-PRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXX 247
           YPL P+     RSDEE    ++     ++KKKKRMKCL                      
Sbjct: 9   YPLVPAANGHERSDEESVAAHS----KELKKKKRMKCLLYIVLFAVFQTGIILLFALTVM 64

Query: 248 XXRTPRVRLDD-----ITVTSDASTG-NVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTN 409
             R P+ R+         V ++AS   +++   + +V+N NFG ++++  L T     T 
Sbjct: 65  RIRNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTP 124

Query: 410 MGQFVIHDARARARSTRRIYVIADLR---VPGAN-------SSVLDLNIEARLRGKVRLI 559
           +G+  I  ARARARST+++ V+ +L    +P  N       + VL L   ++L GK+ L+
Sbjct: 125 VGRATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLM 184

Query: 560 RVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
           +VI++ +S  MNCTM + + T  V+N+ C+
Sbjct: 185 KVIKKKKSTQMNCTMDVAIDTRTVRNIICK 214


>gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 213

 Score = 97.8 bits (242), Expect = 5e-18
 Identities = 67/215 (31%), Positives = 101/215 (46%), Gaps = 16/215 (7%)
 Frame = +2

Query: 53  QQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXX 232
           Q+  Q  PLAP    PRSD E+       SQ   +K+K  KCL                 
Sbjct: 2   QEDPQAKPLAPVEYYPRSDMEFGGIKPTASQ---RKEKSSKCLVYVLVGMVIQGAVLLIF 58

Query: 233 XXXXXXXRTPRVRLDDITV------TSDASTGNVRFTGRVSVRNRNFGRYEFDSSLATIR 394
                  RTP V +  +TV       S A + N+     V+V N NFG ++F+++  T+ 
Sbjct: 59  ASIVLRARTPDVEIVSVTVRNLKYGNSSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVW 118

Query: 395 SGNTNMGQFVIHDARARARSTRRIYVIAD---LRVPGA-------NSSVLDLNIEARLRG 544
            G+  +G+  I   RA+AR+T R+ V  D   L +P         +S +L+LN   +L G
Sbjct: 119 CGSVVVGKMKIPTGRAQARATERLNVSVDVSSLPLPDTKNVSCNISSGLLELNSHVKLSG 178

Query: 545 KVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
           KV ++  ++R R  +MNC M +NL+    Q+  CE
Sbjct: 179 KVSIMNFMKRRRHPEMNCFMTLNLTGQTKQDFPCE 213


>emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]
          Length = 186

 Score = 97.8 bits (242), Expect = 5e-18
 Identities = 58/179 (32%), Positives = 98/179 (54%), Gaps = 13/179 (7%)
 Frame = +2

Query: 152 MKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXXXXRTPRVRLDDITVTSDASTGN---VRF 322
           +++KK +KCL                        R P+VR+  I+V +   + N   +  
Sbjct: 8   VRRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVENQHFSTNSFSMDL 67

Query: 323 TGRVSVRNRNFGRYEFDSSLATIRSGNTNMGQFVIHDARARARSTRRIYV---IADLRVP 493
             RV+V+N NFG ++FD+S ATI    T +G+  I  ARAR+RST+R  +   I+  +V 
Sbjct: 68  KARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISSSKVN 127

Query: 494 G-------ANSSVLDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
                    NS VL+L+  A+L GK+ L ++ ++ +SA+M+CTM ++ +T+ ++NL C+
Sbjct: 128 NHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSCK 186


>gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 188

 Score = 95.9 bits (237), Expect = 2e-17
 Identities = 57/184 (30%), Positives = 93/184 (50%), Gaps = 16/184 (8%)
 Frame = +2

Query: 146 DQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXXXXRTPRVRLDDITV------TSDAST 307
           ++ K+ + MKC                         +TP  RL  +TV       S    
Sbjct: 5   EKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPH 64

Query: 308 GNVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLR 487
            N+R    ++V+N+NFG + FD++ A +  G+  +G   I  +RARAR T+R+ V  D+ 
Sbjct: 65  FNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVS 124

Query: 488 VPGAN----------SSVLDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQN 637
               +          S  L L   ARLRGKV L++++++ ++A+MNCTM +NL+++ VQ+
Sbjct: 125 SSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQD 184

Query: 638 LRCE 649
           L CE
Sbjct: 185 LDCE 188


>ref|XP_002513626.1| conserved hypothetical protein [Ricinus communis]
           gi|223547534|gb|EEF49029.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 217

 Score = 95.5 bits (236), Expect = 2e-17
 Identities = 68/226 (30%), Positives = 111/226 (49%), Gaps = 22/226 (9%)
 Frame = +2

Query: 38  MAEKYQQQAQGYPLAPSTIVP----RSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXX 205
           MAEK Q        AP+ +V     RSDEE  T    Q+++ ++KKKRMKC+        
Sbjct: 1   MAEKEQ--------APTPLVADGQTRSDEESGTAGTAQTKE-LRKKKRMKCIAFVVAFTI 51

Query: 206 XXXXXXXXXXXXXXXXRTPRVRL------DDITVTSDASTG--NVRFTGRVSVRNRNFGR 361
                           + P+ R+      D   V +DA+    N+    +  V+N NFG 
Sbjct: 52  FQTGIILLFVFTVLRFKDPKFRVRSASFDDTFHVGTDAAAPSFNLTMNTQFGVKNTNFGH 111

Query: 362 YEFDSSLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVP----------GANSSV 511
           +++++S  T     T +G   +  ARARARSTR+   I  LR              +S  
Sbjct: 112 FKYETSTVTFEYRGTVVGLVNVDKARARARSTRKFDAIVVLRTDRLPDGFELSSDISSGK 171

Query: 512 LDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
           + L+  +RL G++ L++VI++ +SA+MNCTM +++ T  +Q++ C+
Sbjct: 172 IPLSSSSRLDGEIHLMKVIKKKKSAEMNCTMNVDIQTRTLQDIVCK 217


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score = 95.1 bits (235), Expect = 3e-17
 Identities = 71/223 (31%), Positives = 110/223 (49%), Gaps = 19/223 (8%)
 Frame = +2

Query: 38  MAEKYQQQAQG-YPLAPST-IVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXX 211
           MAE  +  A   YPL PS     RSD+E A +    S ++++ KKRM+CL          
Sbjct: 1   MAENKEAAATSPYPLMPSAPSYMRSDQEAAASAP-PSAEELRHKKRMRCLLYVSIFAVFQ 59

Query: 212 XXXXXXXXXXXXXXRTP--RVRLDDITVTSDASTGNVRFTGRVSV----RNRNFGRYEFD 373
                         ++P  RVR   IT     S  N  F   + V    +N NFG +E++
Sbjct: 60  VVVITVFALTVMKIKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFGVKNTNFGHFEYE 119

Query: 374 SSLATIRSGNTNMGQFVIHDARARARSTRRIYVIA-DLRVPG--ANS--------SVLDL 520
             +      +  +GQ  + + R RARSTR++ V + DL   G  ANS         ++ +
Sbjct: 120 DGIVVFTYRDVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPI 179

Query: 521 NIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
            I ++L GK+ L+++I++ +SA MNCTM + L+T  VQN+ C+
Sbjct: 180 TISSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVCK 222


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score = 94.7 bits (234), Expect = 4e-17
 Identities = 65/220 (29%), Positives = 113/220 (51%), Gaps = 17/220 (7%)
 Frame = +2

Query: 38  MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217
           MAE+YQQ    YPLAP+   PRSDEE   ++N  +++ +K++KR+K              
Sbjct: 1   MAERYQQV---YPLAPANGHPRSDEE---SSNLDAKE-LKRRKRIKLAIYAFIFTASQII 53

Query: 218 XXXXXXXXXXXXRTPRVRLDDI-------TVTSDASTGNVRFTGRVSVRNRNFGRYEFDS 376
                       ++P++RL D        T +    + ++ FT ++ V+N N+G Y+FD+
Sbjct: 54  VTLVFVLVVMRVKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDN 113

Query: 377 SLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGANSS----------VLDLNI 526
           + A        +GQ VI   +A  RST+++ V   L      ++          +L L  
Sbjct: 114 TTAAFAYEGETVGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRC 173

Query: 527 EARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRC 646
            A++ GKV+L+ ++++ +SA+MNCT+ I++    V NL+C
Sbjct: 174 TAKMTGKVKLMLIMKKKKSANMNCTINIHVKEKTV-NLKC 212


>gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao]
          Length = 185

 Score = 94.0 bits (232), Expect = 6e-17
 Identities = 55/149 (36%), Positives = 88/149 (59%), Gaps = 17/149 (11%)
 Frame = +2

Query: 254 RTPRVRLDDITVTSDASTGN-------VRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNM 412
           + P+VR   +TV  + STGN       +R   +V+V+N NFG +++++S   I  G   +
Sbjct: 38  KNPKVRFGAVTV-ENFSTGNSSSPFFDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMPV 96

Query: 413 GQFVIHDARARARSTRRIYVIADLRVPGAN----------SSVLDLNIEARLRGKVRLIR 562
           G+  I  ARARAR T++  V  D+     +          S VL L+ EA+L GKV L++
Sbjct: 97  GEATIVKARARARQTKKFDVTIDISSSKLSTNSNLGNDIASGVLPLSSEAKLSGKVHLMK 156

Query: 563 VIRRNRSADMNCTMRINLSTNVVQNLRCE 649
           VI++ +S++M+CTM IN+ T  VQ+L+C+
Sbjct: 157 VIKKKKSSEMSCTMGINIGTRTVQDLKCK 185


>emb|CBI22611.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score = 94.0 bits (232), Expect = 6e-17
 Identities = 65/232 (28%), Positives = 114/232 (49%), Gaps = 17/232 (7%)
 Frame = +2

Query: 2   FLSSNKQKIYKTMAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCL 181
           FL+ ++ K  K MA+K QQ    +P+ P+    ++D E          +++++ K  + +
Sbjct: 78  FLNFSRAKA-KKMAQKKQQV---HPIEPTGGPAKTDVE---------SEELRRMKCTRYI 124

Query: 182 XXXXXXXXXXXXXXXXXXXXXXXXRTPRVR-----LDDITVTSDASTG--NVRFTGRVSV 340
                                   R+P+ R     ++++  TSD ++   N+RF  +V+V
Sbjct: 125 AYLSAFALFETIVIMVCVVTLMRIRSPKFRFRAVSIENLNYTSDTTSPSFNIRFNAKVAV 184

Query: 341 RNRNFGRYEFDSSLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLR----------V 490
           +N NFG ++F +S  T+     ++G   I  ARARARST+++ V  D+            
Sbjct: 185 KNTNFGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKMNVTVDVTSNNVSSNSNLA 244

Query: 491 PGANSSVLDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRC 646
              NS  L L  + +L GKV L++V ++ +S  MNCT++INL   V+Q  +C
Sbjct: 245 SDINSGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINLENKVIQEWKC 296


>ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca
           subsp. vesca]
          Length = 212

 Score = 93.6 bits (231), Expect = 8e-17
 Identities = 63/222 (28%), Positives = 108/222 (48%), Gaps = 18/222 (8%)
 Frame = +2

Query: 38  MAEKYQQ---QAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXX 208
           MAEK++      +GY         + DE+  T   FQS++++K++KR+K           
Sbjct: 1   MAEKFKHALASVKGY-------ATKKDEQLPT---FQSEEELKRQKRIKLFTYIGIFIGF 50

Query: 209 XXXXXXXXXXXXXXXRTPRVRL-----DDITVTSDASTGNVRFTGRVSVRNRNFGRYEFD 373
                          +TP+VRL      ++     + + +  F  ++ ++N N+G Y+FD
Sbjct: 51  QIIVMTVFGLTVMKVKTPKVRLGATNVQNLNFVPTSPSFDTTFATQIRIKNTNWGPYKFD 110

Query: 374 SSLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLR---VPGAN-------SSVLDLN 523
           +  AT       +GQ     ++A  RST++I     L    +P  +       S VL L 
Sbjct: 111 AGTATFMYQGVAVGQVSFPKSKAGMRSTKKINAEVSLNSNEIPSTSNLGSELSSGVLTLT 170

Query: 524 IEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649
            EA+L GKV L+ ++++ +SA MNCTM+++LST  +Q L C+
Sbjct: 171 SEAKLTGKVELMLIMKKKKSATMNCTMKLDLSTKTIQALECK 212


Top