BLASTX nr result

ID: Rehmannia25_contig00009967 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00009967
         (1781 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004242132.1| PREDICTED: uncharacterized protein LOC101245...   124   1e-25
ref|XP_006347331.1| PREDICTED: myb-like protein X-like [Solanum ...   124   2e-25
gb|EOY10007.1| Uncharacterized protein isoform 1 [Theobroma caca...    99   6e-18
ref|NP_197218.2| uncharacterized protein [Arabidopsis thaliana] ...    92   7e-16
gb|EPS71228.1| hypothetical protein M569_03533 [Genlisea aurea]        91   2e-15
ref|XP_002884360.1| hypothetical protein ARALYDRAFT_477561 [Arab...    91   2e-15
ref|XP_002520009.1| conserved hypothetical protein [Ricinus comm...    89   5e-15
ref|XP_002274897.2| PREDICTED: uncharacterized protein LOC100259...    89   6e-15
ref|XP_006297493.1| hypothetical protein CARUB_v10013512mg [Caps...    87   3e-14
ref|XP_006400246.1| hypothetical protein EUTSA_v10013127mg [Eutr...    86   5e-14
ref|XP_006371235.1| hypothetical protein POPTR_0019s06950g [Popu...    85   9e-14
ref|XP_002873814.1| hypothetical protein ARALYDRAFT_488576 [Arab...    85   9e-14
ref|XP_002331774.1| predicted protein [Populus trichocarpa]            85   9e-14
ref|NP_186963.1| uncharacterized protein [Arabidopsis thaliana] ...    85   1e-13
gb|AAU44469.1| hypothetical protein AT3G03130 [Arabidopsis thali...    84   2e-13
emb|CBI26558.3| unnamed protein product [Vitis vinifera]               83   4e-13
gb|ESW03530.1| hypothetical protein PHAVU_011G021300g [Phaseolus...    77   2e-11
ref|XP_006287470.1| hypothetical protein CARUB_v10000682mg [Caps...    77   2e-11
ref|XP_003538933.1| PREDICTED: dentin sialophosphoprotein-like [...    77   2e-11
gb|EXB75013.1| hypothetical protein L484_012137 [Morus notabilis]      74   2e-10

>ref|XP_004242132.1| PREDICTED: uncharacterized protein LOC101245265 [Solanum
           lycopersicum]
          Length = 704

 Score =  124 bits (312), Expect = 1e-25
 Identities = 96/275 (34%), Positives = 136/275 (49%), Gaps = 19/275 (6%)
 Frame = +1

Query: 1   ALCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVP 180
           ALCKKNKIPANITNVAMA+AL +LE V+GIEE+++  +S  A SS ES   SE  +    
Sbjct: 13  ALCKKNKIPANITNVAMADALQSLEFVDGIEEVLKTCESDVANSSMESPGKSEALASV-- 70

Query: 181 PMAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKKG 360
           P  GR T +    K + E++            T V+D +++K D +ETPAL     R++ 
Sbjct: 71  PRTGRRTTQRKTIKHDSETMQTTTRSHCRTRGTVVRDIDEAKKDMLETPAL--PTTRRRA 128

Query: 361 QMASACWKMDSQLKEC---------AEEDKKDVLMTPATMGVTXXXXXXXXXXXVKKVYS 513
              S   K++S +KEC          EE+KKDV  TPA   +T           V++V  
Sbjct: 129 ATTSVRVKLESAMKECEPKEEIVDQVEEEKKDVPKTPAA-ALTSQRKEVKAKSSVRQV-- 185

Query: 514 TYSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGE-----NEEMKDAN-----ELS 663
            YSTRRSVRLA              S     + ++++ E     N E+  A+     +  
Sbjct: 186 -YSTRRSVRLAGKPTQESSTQEDEKSGTLTFDAVSEETEESLEVNSELHSAHKSEILDKK 244

Query: 664 GITDVDATTMEEKFENEDKSVVVSNQKQDLSIGEE 768
           GI    + +++ K E++  SV  SN      IG E
Sbjct: 245 GIDLKSSESLDMKNESDTLSVQNSNTLVQNKIGME 279



 Score = 62.4 bits (150), Expect = 6e-07
 Identities = 43/101 (42%), Positives = 59/101 (58%)
 Frame = +1

Query: 1300 TPSKHSASKASMTLKRMTGFSDNKENIGNGSKLFVMEDVKMAKNTVEDNLHELSVRKLSK 1479
            TP ++S++  S+    +T   DNKEN+            K  K T  DNL  LS+RKL+K
Sbjct: 621  TPVENSSAVTSIGQMLVT---DNKENL---------VCTKENKGTAGDNLQNLSLRKLTK 668

Query: 1480 MLKEKLEITKKSSKNENGNEVLSRPALQALPENRLVDETQN 1602
            MLK+ L I    SKN +G E ++R ALQ +PENRL+ E +N
Sbjct: 669  MLKD-LNI----SKNPSGKEAVTRSALQKVPENRLISENEN 704


>ref|XP_006347331.1| PREDICTED: myb-like protein X-like [Solanum tuberosum]
          Length = 683

 Score =  124 bits (310), Expect = 2e-25
 Identities = 122/434 (28%), Positives = 188/434 (43%), Gaps = 9/434 (2%)
 Frame = +1

Query: 1    ALCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVP 180
            ALCKKNKIPAN+TNVAMA+AL +LE V+GIEE+++  +S  A  S ES   SE  +    
Sbjct: 13   ALCKKNKIPANLTNVAMADALQSLEFVDGIEEVLKTCESDVANPSMESPGKSEALASV-- 70

Query: 181  PMAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKKG 360
            P  GR T +    K + E++            T V+D +++K D +ETPAL     R++ 
Sbjct: 71   PRTGRRTTQRKTIKHDSETLQTTTRSHCRTRGTVVRDVDEAKNDMLETPAL--PTTRRRA 128

Query: 361  QMASACWKMDSQLKEC---------AEEDKKDVLMTPATMGVTXXXXXXXXXXXVKKVYS 513
               S   K++S +KEC          EE+KKDV  TPA   +T           V++V  
Sbjct: 129  ATTSVRAKLESAMKECEPKVEIVDPVEEEKKDVPKTPAA-ALTSQRKEVKAKSSVRQV-- 185

Query: 514  TYSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDATTM 693
             YSTRRSVRLA                             E     +E SG    DA + 
Sbjct: 186  -YSTRRSVRLAGKPM------------------------QESSTQEDEKSGTLTFDAVS- 219

Query: 694  EEKFENEDKSVVVSNQKQDLSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKRDNVPLGSHD 873
                E  D+S+ V++  Q     EE+                   +   K  ++   S  
Sbjct: 220  ----EETDESLEVNSDLQSAHKSEELDKNG-------------IDLKSSKSLDMKNKSDT 262

Query: 874  VCISEIEITVKNTTEELKTEEGAKSQDKTAEFVESVILESKDNFAVGESKDVPDFNVALD 1053
            V + +    V+N   ++  E+G   Q  +A  +E V+L++K      E        VAL 
Sbjct: 263  VSVQDSNTLVQN---KIDMEDGV--QQDSANDLEVVVLDTKAKEGSEE--------VALG 309

Query: 1054 KLTELTLQQEATKDDGVAKADSDFIDHVANPLQQNKNEPDEIEILQLEENKKAFDPHADF 1233
               + + ++   + + VA+A  +      N  Q   ++P     +  + N++  +PH D 
Sbjct: 310  CNNDGSGEEPMEESEIVAEAKEEI--DFQNKSQNLGDDPKSNSDITKQPNRQD-EPHGDT 366

Query: 1234 SEHIPTSDVEMSTH 1275
            S+ +  +D E   H
Sbjct: 367  SDFMAENDGEEEGH 380


>gb|EOY10007.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508718111|gb|EOY10008.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 698

 Score = 99.0 bits (245), Expect = 6e-18
 Identities = 112/433 (25%), Positives = 167/433 (38%), Gaps = 41/433 (9%)
 Frame = +1

Query: 4    LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
            LCKKNKIPANITNVAMA+AL ALEIVEG++E M  SQS     +  S +        +P 
Sbjct: 14   LCKKNKIPANITNVAMADALKALEIVEGLDEFMNQSQSPEKTMNKSSQE--------IPS 65

Query: 184  MAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKKGQ 363
               R++ R    K+E +S            +T   D E+   +  ETP +    +R+  +
Sbjct: 66   TVTRTSTRRKPTKEEPQSSQTTTRTRRITRRTMELDEENKNVNVPETPVVATTTSRRAQR 125

Query: 364  MASACWKMDSQLKECAEEDKKDVLMTPATM------GVTXXXXXXXXXXXVKKVYSTYST 525
                         E  E+ K D+L TPA        GV               V   Y T
Sbjct: 126  TE----------PEVEEQKKSDLLETPALQSNRRRAGVGSTRRKVEAQKDEGSVQQGYGT 175

Query: 526  RRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVD--ATTMEE 699
            RRSVRL                +  K + + +D E EE K  N  SG T  +  A  +  
Sbjct: 176  RRSVRLLEKCMEGLSLKESGRMEPVKIDEMVED-EIEENK--NRQSGATSEENLARNLSV 232

Query: 700  KFENE-DKSVVVSNQKQDLSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKRD--------- 849
              E E D    V N + D ++  E                      +EK D         
Sbjct: 233  SLEGERDLKDDVHNAENDCTVISEAGSQEPDNLDCLLALDTKDASPDEKTDESDAYLAEG 292

Query: 850  ----------NVPLGSHDVCISEIEITVKNTTEELKTEE--GAKSQDKT--------AEF 969
                       +    +D  + E    + N++EEL  E   G+ + + T        AE+
Sbjct: 293  ADKLADMSDGTIDPKGYDDAVPEDSYEIDNSSEELVAENNGGSHADENTEVLDHASSAEY 352

Query: 970  VE--SVILESKDNFAVGESKDVPDFNVALDKLTELTLQQEATKDDGVAKADSDFIDHVAN 1143
            VE    ++  +    VG+  D+   N  L KL E     +A      +     F+D +  
Sbjct: 353  VEPKEAVIGEECQKLVGKDCDINVVNDDLAKLPEAEEYDDAKASQNASAIPEGFMDSLEK 412

Query: 1144 -PLQQNKNEPDEI 1179
               ++++++PD+I
Sbjct: 413  LGNEESEDDPDQI 425


>ref|NP_197218.2| uncharacterized protein [Arabidopsis thaliana]
            gi|22655304|gb|AAM98242.1| putative protein [Arabidopsis
            thaliana] gi|133778842|gb|ABO38761.1| At5g17160
            [Arabidopsis thaliana] gi|332005006|gb|AED92389.1|
            uncharacterized protein AT5G17160 [Arabidopsis thaliana]
          Length = 569

 Score = 92.0 bits (227), Expect = 7e-16
 Identities = 145/614 (23%), Positives = 235/614 (38%), Gaps = 84/614 (13%)
 Frame = +1

Query: 4    LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
            LCK+NKIPAN+TN+AMA+AL +LEIV+G++E M  S+S    S T         +   P 
Sbjct: 14   LCKRNKIPANMTNIAMADALKSLEIVDGLDEYMNQSESSAPHSPTS-------VAKLPPS 66

Query: 184  MAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKKGQ 363
             A R+TRR    K E +  + +              +   ++ +      M Q N  K  
Sbjct: 67   TATRTTRRKTTTKAEPQPSSQLV-------------SRSCRSTSKSLAGDMDQENINK-- 111

Query: 364  MASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVKKVYSTYSTRRSVRL 543
                   +  ++K    + + +VL TPA                 + V S YSTRRS RL
Sbjct: 112  ------NVAQEMKTSNVKFEANVLKTPAAGSTRKTSAATSCTKKDELVQSVYSTRRSTRL 165

Query: 544  AXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDATTME-EKFENEDK 720
                           + L  K   T D +  + +D  +     + + T  E E     D 
Sbjct: 166  ----------LEKCMADLSLKTKETVDNKPAKNEDTEQKVSAQEKNLTGSEGEVIPGRDL 215

Query: 721  SVVVSNQKQDLSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKRDNVPLGSHDVCISEIEIT 900
            SV +    ++L    +   G               + ++E+ + V     +   S +++ 
Sbjct: 216  SVSMEQVWENLKNDSDKIAGD---LEVIVVMDANTEANKEEMNEVTADKKESENSLVQVD 272

Query: 901  VKNTTEELKTEEGAKSQDKTAE--------------FVESVILESKDNFAVGESKDVPDF 1038
             +  T +   EEG K  D   E               +ES I E+ ++    ESK+V   
Sbjct: 273  KEEETLQAICEEGPKKNDNDQEIGDLVIYVDVSDIPLLESAITETHND--DNESKNVLAI 330

Query: 1039 NVALDKL-TELTLQQ-EATKDDGVAKADSDFID-HVANPLQQNKNEPDEIEIL------- 1188
            + ++D+  TE  +Q+ +A  +  V + DSD  D      +Q+N +EP++I          
Sbjct: 331  DRSVDQQETEHAIQENDAEPETKVNQTDSDAGDSKTKQAIQENDSEPEKINNFDEETMVD 390

Query: 1189 --------QLEENKKAFDPHADFSE---------------------------HIPTSDVE 1263
                    + EEN    D     SE                             P S   
Sbjct: 391  QTDSDSETEPEENHSGVDSDGTISEADSNQAVVGSDIADEEMTLSGSEGSAATAPNSPPR 450

Query: 1264 MSTHLIVKPTCLTPSKHSA--------SKASMTLKR--MTGFSDNKENIGNGSKLFVM-- 1407
            +    ++K T ++P    +        SK++  LK   +   ++NKE   N  ++ +M  
Sbjct: 451  LEEAKVIKTTLVSPFAVESISTQFPRPSKSTTPLKNSPLKLVNENKE---NNMEMMMMNV 507

Query: 1408 -----------EDVKMAKNTV-EDNLHELSVRKLSKMLKEKLEITKKSSKNENGNEVLSR 1551
                       E  K  K T+ E+NL   S+R+L KM+K   E++ K+S         +R
Sbjct: 508  NNNENGESKGEEGKKKKKVTIDEENLKNTSIRQLEKMVK---ELSIKTS---------NR 555

Query: 1552 PALQALPENRLVDE 1593
             ALQ LP N    E
Sbjct: 556  TALQVLPGNNKTAE 569


>gb|EPS71228.1| hypothetical protein M569_03533 [Genlisea aurea]
          Length = 603

 Score = 90.9 bits (224), Expect = 2e-15
 Identities = 141/599 (23%), Positives = 240/599 (40%), Gaps = 77/599 (12%)
 Frame = +1

Query: 1    ALCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVP 180
            +LCK+N IPAN+TNVAMA+AL+ LE VEGIE ++  ++ G ++S  ES   S+  SP+V 
Sbjct: 13   SLCKRNGIPANMTNVAMADALSDLEKVEGIEGVLSSAEYGDSESVNESSQRSDFTSPYV- 71

Query: 181  PMAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAE--DSKADAIETPA------LM 336
                RS R  NV ++   +  PM        K+   D +  D K +  +T A      L 
Sbjct: 72   ---CRSNRARNVTERGSTAAKPMRSSRRTTKKSLTMDTKGLDEKENTSKTTAVRRSARLA 128

Query: 337  AQANRKKGQMASACWKMD-SQLKECAEEDKKDVL--MTPATMGVTXXXXXXXXXXXVKKV 507
            ++  +   +  +   K+D   +K C   + K+ L  +  A   VT           +K  
Sbjct: 129  SKYGKSSREEGTDTLKLDRPAVKYCEIVNLKESLDSVDDAISSVTDVDTLETNCDVIKGE 188

Query: 508  YSTYSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDA--NELSGITDVD 681
                S RR                      F++E    +G++ +MK    +++   T  D
Sbjct: 189  TEVVSERRQELSIDKEAAVDTVPEVEIP--FEEE----NGDDCKMKGEGFDDIQIETQND 242

Query: 682  ATTMEEKFENEDKSVVVSNQKQ-----------DLSIGEEIK-LGSSAXXXXXXXXXXXA 825
               +++  ++E+K V V + K+           DL+   E K  G  +            
Sbjct: 243  YERVDDLTKHEEKDVGVESNKEGAEKDSASLEMDLTKQHEEKDAGVESNKEGAEKDSASL 302

Query: 826  KMDEEKRDNVPLGSHDVCISEIEITVKNTTEELKTEEGAKSQDKTAEFVESVILESKDNF 1005
            +MD  K    P    DV +S +   ++ +  E+  EEGA+     A     +     D  
Sbjct: 303  EMDHTK----PEEKKDVSVSPV---LRVSGVEI-DEEGAEKNSSAASPEMDLTAVENDPS 354

Query: 1006 AVGESKDVPDFNVALDKLTELT---LQQEATKDDGVAKAD--SDFIDHVANPLQQNKNEP 1170
              G    + DF  A D +TE+    +Q E   +  V   +  + F     N  + + +E 
Sbjct: 355  EKGHELSM-DFKEAKDVITEIAGSYIQHEKCSEADVFNNNNTASFSSDEVNVKEADHDER 413

Query: 1171 DEIEILQLEENKKAFDPHADFSEHIPTSDVEMSTHLIVKPTCLTPSKHSASKASMTLKRM 1350
              I+  + +++     PH   S  + TSD + S H + +     PS    S +SM   + 
Sbjct: 414  SSIDADEFQDSSNEAPPH---SVVLLTSDDDDSVHGVSEEKEEEPSSFCISSSSMPAVQK 470

Query: 1351 T----------GFSDNKENIGNGS--------------------------KLFVM----- 1407
            T          G   +K ++  G+                           +FV      
Sbjct: 471  TNKCGGGGGGGGARWSKHHVKGGNLNLTGSATFDSCLPIPKDLLAPERRKSVFVKTNHHR 530

Query: 1408 --EDVKMAKNTVEDNLHELSVRKLSKMLKEKLEITK----KSSKNENGNEVLSRPALQA 1566
              E+ +  KN  +D L  +S+ KL K LKE+L  TK    ++ K+ +   VL +P +++
Sbjct: 531  NDENKENNKNNDDDVLKAMSIGKLKKKLKEQLSTTKSIGPENKKSSSDEAVLLQPTIRS 589


>ref|XP_002884360.1| hypothetical protein ARALYDRAFT_477561 [Arabidopsis lyrata subsp.
            lyrata] gi|297330200|gb|EFH60619.1| hypothetical protein
            ARALYDRAFT_477561 [Arabidopsis lyrata subsp. lyrata]
          Length = 521

 Score = 90.9 bits (224), Expect = 2e-15
 Identities = 131/565 (23%), Positives = 220/565 (38%), Gaps = 54/565 (9%)
 Frame = +1

Query: 7    CKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPPM 186
            CK+NKIPAN+TN+AMA+AL  LEIVEG++E M PS+  +  S   ++       P     
Sbjct: 15   CKRNKIPANMTNIAMADALRDLEIVEGMDEFMDPSRDQSPTSVARNL-------PSAART 67

Query: 187  AGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQV--KDAEDSKADAIETPA---------- 330
            A R+TRR    K E +S   +        K+     D E+   + ++ P+          
Sbjct: 68   AARTTRR-KTTKDETQSSELVTRSCYVVSKSLAGEMDQENKNMNMLQNPSVPQSLAVKLD 126

Query: 331  ---LMAQANRKKGQMASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVK 501
               +M +AN  K   A +     ++  + A   KKD                       +
Sbjct: 127  VTDMMPEANVSKTPAARS-----TRRAQAAASSKKD-----------------------E 158

Query: 502  KVYSTYSTRRSVRLAXXXXXXXXXXXXXXSQLFK-----KELLTKDGENEEMKDANELSG 666
             V   YSTRRSVRL               ++  +      +   K  EN E  +   +  
Sbjct: 159  SVQRVYSTRRSVRLLEESMADLSLKTNVPAKKHEDSPAGSKFQEKSDENSENTEKGGVMS 218

Query: 667  ITDVDATTMEEKFENEDKSVVVSNQKQDLSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKR 846
            + D++  ++E++++           K D  I  +I  G               +M+    
Sbjct: 219  VRDLN-DSLEKEWD---------GSKNDPDI--DILYGDLGDITFFDASTSKEQMNGTDS 266

Query: 847  DNVPLGSHDVCISEIEITVKN------------TTEELKTEEGAKSQDKTAEFVESVILE 990
              V      V ++E E + ++            TT  L   + A+S+    +       E
Sbjct: 267  STVSASDSFVLVNEQETSKEDGFVVVDHAASTTTTNTLACYKEAESEQMRIDSES----E 322

Query: 991  SKDNFAVGESKDVPDFNVALDKLTELTLQQEATKDDGVAKAD----------SDFIDHVA 1140
            S++     +  +  DF VA+D   E   + E    D V+K D          S  +D   
Sbjct: 323  SEETEYETDPWEGDDFGVAVDTNQE-AFESEVRASDNVSKVDSVTTVLIADESKELDFSP 381

Query: 1141 NPLQQNKNEPD-------EIEILQLEENKKAFDPHADF-SEHIPTSDVEMSTHLIVKPTC 1296
            +PL + + E D       EI  ++LEEN  A +   D  SE  P   ++  T      + 
Sbjct: 382  SPLAEEELEVDSDEWSDYEIGEVELEENSCASEESIDIESEEAP--GLDKKTPASSSSSS 439

Query: 1297 LTPSKHSASKASMTLKRMTGFSDNKE----NIGNGSKLFVMEDVKMAKNTVEDNLHELSV 1464
            L  ++   S +    + +     +KE    N   G     ++  K  K T+++ L ++S+
Sbjct: 440  LAGNETKTSLSPFEAESILESDKDKEMAVNNNEEGKAEAEVKKTKKKKKTIDEELRDVSM 499

Query: 1465 RKLSKMLKEKLEITKKSSKNENGNE 1539
            R+L+KM+K   E+  KS +   G E
Sbjct: 500  RQLTKMVK---ELAIKSKQQHKGPE 521


>ref|XP_002520009.1| conserved hypothetical protein [Ricinus communis]
           gi|223540773|gb|EEF42333.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 737

 Score = 89.4 bits (220), Expect = 5e-15
 Identities = 76/258 (29%), Positives = 122/258 (47%), Gaps = 16/258 (6%)
 Frame = +1

Query: 1   ALCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVP 180
           ALCKKNKIPAN+TNVAMA+AL ALE V+G++E++   +S   QS        +  +P   
Sbjct: 13  ALCKKNKIPANMTNVAMADALKALEKVDGLDEVINAPRSDPQQSP------EKTGNPEPR 66

Query: 181 PMAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDA--EDSKADAIETPALMAQANRK 354
            +   STRR  +  +   S  P          +  ++A  E++  + +ETPA+    +R+
Sbjct: 67  TVCRTSTRRKPINVEPESSQLPTRTRRTTKKTSAAEEAEQENNNENLLETPAV--STSRR 124

Query: 355 KGQMASACWKMDSQLKECAE-------EDKKDVLMTPATMGVTXXXXXXXXXXXV--KKV 507
           +   ASA  K+D+QL E  E       E+K DV  TPA                +  K V
Sbjct: 125 RVTAASARRKIDTQLMESVEDEKAAVGEEKSDVPETPAIRSSRSKAPVVSTKKKIEEKSV 184

Query: 508 YSTYSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDAT 687
              Y TR SVRL                ++ K E L ++ ++ E +      G +++D +
Sbjct: 185 QRVYGTRHSVRLLEKSLADLSVKEKRTVEVVKIEGLCEETDHVEQQKGVP-GGDSEIDES 243

Query: 688 -----TMEEKFENEDKSV 726
                 ++ +F+ E+K++
Sbjct: 244 LENEGELKHEFQEENKTI 261



 Score = 63.5 bits (153), Expect = 3e-07
 Identities = 79/322 (24%), Positives = 133/322 (41%), Gaps = 8/322 (2%)
 Frame = +1

Query: 637  EMKDANELSGITDVDATTMEEKFENEDKSVVVSNQKQDLSIGEEIKLGSSAXXXXXXXXX 816
            +M   N  + I  ++    +E  EN D  VV ++    +   E I + ++A         
Sbjct: 430  DMVTENSETVIAALEPEIEKEMIENRDSLVVQASDDSAMET-EHISIVNAATEVSVEVVD 488

Query: 817  XXAKMDEEKRDNVPLGSHDVCISEIEITVKNTTEELKTEEGAKSQDKTAEFVESVILESK 996
                   E    V +   D+     E +  N+ E+ +  + A  +D   + +E    E  
Sbjct: 489  LLNPKVSEVEGQVCVEVMDLSAVVGESSEMNSMEDKQHLDAASEEDSDGDDIE----EES 544

Query: 997  DNFAVGESKDVPDFNVALDKLTELTLQQEATKDDG-----VAKADSDFIDHVANP-LQQN 1158
            D +   E+  + D NV   K + +  Q+ ++  D        K  S F   +A+  +   
Sbjct: 545  DGY---ETDSICDSNVTEAKESAMIAQEFSSSSDSDNTPRSVKQKSPFCSLIADSEVPAE 601

Query: 1159 KNEPDEIEILQLEENKKAFDPHADFSEHIPTSDVEMSTHLIV--KPTCLTPSKHSASKAS 1332
            +   D I+ L     K            I +S   ++T  +   +PT LTP K S++K  
Sbjct: 602  ECAHDSIQTLDKSPYKPLVSGDTSTGS-IVSSPFAINTIQVQFPRPTALTPKK-SSTKKQ 659

Query: 1333 MTLKRMTGFSDNKENIGNGSKLFVMEDVKMAKNTVEDNLHELSVRKLSKMLKEKLEITKK 1512
             T++++     NKENI N  +     + K  K   ++N    S+ KL K  KE L I K 
Sbjct: 660  ATIQKIILADINKENIDNSGRKV---EPKKNKTKKQNNYEGFSLNKLRKEFKE-LRIAKN 715

Query: 1513 SSKNENGNEVLSRPALQALPEN 1578
            ++   N +EV +R ALQ LPEN
Sbjct: 716  NNGGRNVSEVETRSALQILPEN 737


>ref|XP_002274897.2| PREDICTED: uncharacterized protein LOC100259588 [Vitis vinifera]
          Length = 569

 Score = 89.0 bits (219), Expect = 6e-15
 Identities = 108/410 (26%), Positives = 176/410 (42%), Gaps = 17/410 (4%)
 Frame = +1

Query: 4    LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
            LCKKNKIPAN+TNVAMA+AL AL+ V+G+EE++ PS+S   QS     +  E+ SP +P 
Sbjct: 14   LCKKNKIPANMTNVAMADALKALQNVDGLEELLNPSESQNPQSP----EKPEIGSPEIPR 69

Query: 184  MAGR-STRRTNV-AKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKK 357
               R STRR  + A +E ES   +        + + ++    K++  +TPAL   ++RK+
Sbjct: 70   TVCRTSTRRRPIKAAEEPESSQTLTRTHRGTRRIK-EEVNQEKSEVPQTPAL--PSSRKR 126

Query: 358  GQMASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVKKVYSTYSTRRSV 537
               ASA  K  +++++                                 V   YSTRRS 
Sbjct: 127  PPAASARQKTVTRVEQ-------------------------------SSVQRVYSTRRSA 155

Query: 538  RLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDAN------ELSGITDVDATTMEE 699
            RL+              S++  K+    DG+ EE K A+      + S ITD      + 
Sbjct: 156  RLS-EKLARTEPMEVEFSKVMTKDF---DGDEEENKGADSQTISEDNSKITDDSEVISKS 211

Query: 700  KFE-NEDKSVVVSNQKQD--------LSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKRDN 852
                N+ K+ V  N  +         L + EE                  +++D EK D 
Sbjct: 212  VLSGNDSKAEVEENTGESAKPDNSDFLEVSEEKDEAHDEQENTEAELQKNSEVDCEKMD- 270

Query: 853  VPLGSHDVCISEIEITVKNTTEELKTEEGAKSQDKTAEFVESVILESKDNFAVGESKDVP 1032
                  +  +SEI  T  + ++E   ++       T   V SVI+ +  N   G   +  
Sbjct: 271  ----ESNKLLSEILKTSVHLSDESTVKKVLLVDHPTGTDVSSVIMSNTKNLNEGLKLENE 326

Query: 1033 DFNVALDKLTELTLQQEATKDDGVAKADSDFIDHVANPLQQNKNEPDEIE 1182
              +   D   +LT   +A+ DD    +++  + +++N   +N NE  ++E
Sbjct: 327  QQHGESDLELDLTAPPQASVDDPSCDSETREL-NLSN--TKNLNEDSKLE 373


>ref|XP_006297493.1| hypothetical protein CARUB_v10013512mg [Capsella rubella]
            gi|482566202|gb|EOA30391.1| hypothetical protein
            CARUB_v10013512mg [Capsella rubella]
          Length = 501

 Score = 86.7 bits (213), Expect = 3e-14
 Identities = 125/534 (23%), Positives = 212/534 (39%), Gaps = 27/534 (5%)
 Frame = +1

Query: 7    CKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPPM 186
            CK+NKIPAN+TN+AMA+AL ALEIVEG+++ M  S    A++                  
Sbjct: 15   CKRNKIPANMTNIAMADALKALEIVEGVDDFMNQSPMSVART------------------ 56

Query: 187  AGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKKGQM 366
            A R+TR+   +K + +S + +        K+   + E    D   T  L+   N    + 
Sbjct: 57   AARTTRK-KASKDDTQSSDLVTRSCYVASKSLAGEMEQENRD---TNMLL---NPPVSRT 109

Query: 367  ASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVKKVYSTYSTRRSVRLA 546
            ++A   +  +L    +  + +V  TPA +  T            + V   YSTRRSVRL 
Sbjct: 110  SAAATSVAVKLDVTDDMPEVNVSKTPA-VRTTRRAQAAASCKKDESVQRVYSTRRSVRLL 168

Query: 547  XXXXXXXXXXXXXXSQLFKKELLTKD----GENEEMKDANELSGITDVDATTMEEKFENE 714
                          ++  K E+  K+    G   + K       + +V+  ++ +  ++ 
Sbjct: 169  EESMADLSLKSNVPAK--KHEVAEKEDSPAGSKYQAKSDENSENVKEVEVVSVRDLNDSL 226

Query: 715  DKSVVVSNQKQDLSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKRDNVPLGSHDVCISEIE 894
            +K    S    DL    ++  G               +++      +      V + E E
Sbjct: 227  EKEWDGSKNDPDL----DVLYGDLGDITFFDASTSKEQLNGADSSTLSASDSFVLVQEQE 282

Query: 895  ITVKN------------TTEELKTEEGAKSQDKTAEFVESVILESKDNFAVGESKDVPDF 1038
             + +N            TT  L   + ++S+    +  ES   E++D    GE  D  D 
Sbjct: 283  TSEENDFVVVDHAASTTTTNTLVCNKESESEQMKID-SESESEETEDETDPGEGDDFGDT 341

Query: 1039 NVALDKLTELTLQQEATKDDGVAKADSDFIDHVA-NPLQQNKNEPDEIEI--LQLEENKK 1209
            N       +   +   +    V+K DS     +A   L+ + +E  + EI  ++LEEN  
Sbjct: 342  N-------QEAFESRVSASANVSKVDSVTSVVIAEEELEVDSDEWSDYEIGEVELEENSY 394

Query: 1210 AFDPHADF-SEHIPTSDVEMSTHLIVKPTCLTP-SKHSASKASMTLKRMTGFSD------ 1365
              +   D  SE  P SD +            TP S  S+S+A   LK      D      
Sbjct: 395  GSEESIDIESEEAPVSDKK------------TPASSSSSSEAESILKSDKNKEDAVEMDL 442

Query: 1366 NKENIGNGSKLFVMEDVKMAKNTVEDNLHELSVRKLSKMLKEKLEITKKSSKNE 1527
            N    G G      +  K  K   E++  ++S+R+L+KM++E L I  K  +++
Sbjct: 443  NNNGDGEGEAEAKKKTKKKKKIVAEESFKDVSMRQLTKMVRE-LAIKNKQQQHK 495


>ref|XP_006400246.1| hypothetical protein EUTSA_v10013127mg [Eutrema salsugineum]
            gi|557101336|gb|ESQ41699.1| hypothetical protein
            EUTSA_v10013127mg [Eutrema salsugineum]
          Length = 565

 Score = 85.9 bits (211), Expect = 5e-14
 Identities = 139/590 (23%), Positives = 222/590 (37%), Gaps = 65/590 (11%)
 Frame = +1

Query: 4    LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
            LCK+NKIPAN+TN+AMA+AL AL+IVEG+ E M  S S   QS T     S    P  P 
Sbjct: 14   LCKRNKIPANMTNLAMADALKALDIVEGLNEYMNQSDSNVLQSPT-----SVAKQP--PS 66

Query: 184  MAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAE-------DSKADAIETPALMAQ 342
             A R+TRR    K E +S + +           +   E        + +   +T  + +Q
Sbjct: 67   TATRTTRRKTAIKAEPQSSSQLGNRSCHMTSKSLAITEMDQETINKNVSQQPDTNIVKSQ 126

Query: 343  ANRKKGQMASACWK--MDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVKKVYST 516
             N  K   A +  K    +      +E KKDVL                       V S 
Sbjct: 127  DNVAKTPAARSTRKALAATSCSSKVQESKKDVL-----------------------VQSV 163

Query: 517  YSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEM--------KDANELSGIT 672
            YSTRRS RL               S   K E   K+ E E+          D+ E S  T
Sbjct: 164  YSTRRSTRLLEKCMADLSLKTKETSVNDKPE---KNEETEQKVSAQEKIPADSEERSEDT 220

Query: 673  DV-DATTMEEKFENEDKSVVVSNQKQDLSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKRD 849
            +V     +    E E K +   + K    + + + LG +              M +EK  
Sbjct: 221  EVIPGRDLSASMEKEWKMLKNDSDKVTGGLEKYVDLGDTDAKNETNNEKMNEVMIDEKES 280

Query: 850  N---VPLGSHDVCISEIEITVKNTTEELKTEEGAKSQDKTAEFVESVILESKDNFAVGES 1020
                V +   +      +   + + ++ + E   +  +   +  ++ +LE  +     ++
Sbjct: 281  EDSLVQVDKLEEASQADKAICEGSLKKNENEPEIRDVEVHVDLGDNPVLEYANTDTNNDN 340

Query: 1021 KDVPD---FNVALDKLTELTLQQEATKDDGV-----AKADSDFIDHVANPLQQNKNEPDE 1176
            K+  +   FN  L    + T+Q+   + + +      + D D  D    P + N +   +
Sbjct: 341  KEWKNDQAFNSLLQADYQETIQEIGPEPEKINSFDEDQTDGDGGDSETEPEEDNSDIDSD 400

Query: 1177 IEILQLEENKKAFD-----PHADFSEHIPTSDVEMSTHLIVKPTCLTP---SKHSASKAS 1332
              I   +  +            +FSE   ++   +S HL+++   +     S H+A   S
Sbjct: 401  GNISDADSTQAVLGSDTAVEEMNFSESEGSAAAPISPHLLLEEATVKTAPLSPHAAEPIS 460

Query: 1333 MTLKRMTGFS---------------DNKENIGNGSKLFVMEDV----------KMAKNTV 1437
            +   R    +               DNK    N  K+  + D           K  K  V
Sbjct: 461  VQFPRPNKSTTTTPPKKSAMKLVNVDNKNKENNMEKMMNVNDSDNGEWKGAANKKKKEKV 520

Query: 1438 E---DNLHELSVRKLSKMLKEKLEITKKSSKNENGNEVLSRPALQALPEN 1578
            E   + L ++S+R+L KM+K   E++ KSS N N        ALQ LP N
Sbjct: 521  EIDAEKLKDVSMRQLVKMVK---ELSIKSSNNRN--------ALQILPGN 559


>ref|XP_006371235.1| hypothetical protein POPTR_0019s06950g [Populus trichocarpa]
           gi|550316935|gb|ERP49032.1| hypothetical protein
           POPTR_0019s06950g [Populus trichocarpa]
          Length = 683

 Score = 85.1 bits (209), Expect = 9e-14
 Identities = 79/242 (32%), Positives = 106/242 (43%), Gaps = 9/242 (3%)
 Frame = +1

Query: 4   LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
           LCKKNKIPAN+TN+AMA+AL  L+ VEG EE     +    QS  +++      SP VP 
Sbjct: 14  LCKKNKIPANMTNIAMADALKVLDKVEGREEFTNVPEPDPQQSPEKAIS----GSPEVPQ 69

Query: 184 MAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQV--KDAEDSKADAIETPALMAQANRKK 357
            + R+  R    + E ES  P+         T V   D E+  A+  ETP ++A+  R +
Sbjct: 70  TSVRTLTRRKPLRIEPESSKPLTRTRCTTRGTVVGEGDQENKTANLSETPIMLAR--RIR 127

Query: 358 GQMASACWKMDSQLKECAE-EDKKDVLMTPATMG------VTXXXXXXXXXXXVKKVYST 516
              ASA  KM+S+  E  E ++K +V  TPA                       K V   
Sbjct: 128 TSTASARHKMESKSMESVENQEKNNVPKTPAARSSRRRAPAVSARGKLEAQNEEKSVQRV 187

Query: 517 YSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDATTME 696
           YSTR SVRL                +  K + L  + E+ E KD       T  D  T  
Sbjct: 188 YSTRHSVRLLEKGMEGLGLKEKERVRPLKMDGLCWEIEDVETKDE------TGDDLLTKS 241

Query: 697 EK 702
           EK
Sbjct: 242 EK 243


>ref|XP_002873814.1| hypothetical protein ARALYDRAFT_488576 [Arabidopsis lyrata subsp.
            lyrata] gi|297319651|gb|EFH50073.1| hypothetical protein
            ARALYDRAFT_488576 [Arabidopsis lyrata subsp. lyrata]
          Length = 568

 Score = 85.1 bits (209), Expect = 9e-14
 Identities = 147/617 (23%), Positives = 233/617 (37%), Gaps = 87/617 (14%)
 Frame = +1

Query: 4    LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
            LCK+NKIPAN+TN+AMA+AL +LEIV+G++E M  S+S   QS T         +   P 
Sbjct: 14   LCKRNKIPANMTNLAMADALKSLEIVDGLDEYMNQSESNAQQSPTS-------VAKLPPN 66

Query: 184  MAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKKGQ 363
             A R+TRR                               +KAD   +  L++++ R   +
Sbjct: 67   TAARTTRRKTT----------------------------TKADPQPSSQLVSRSCRATSK 98

Query: 364  MASACWKMDSQLKECAEEDK-------KDVLMTPATMGVTXXXXXXXXXXXVKKVYSTYS 522
              +    +++  K  A+E K        +V  TPA                 + V S YS
Sbjct: 99   SLAGEMDLENVNKNVAQEPKTNTVRFEANVPKTPAARSTRKASAATSCSKKDELVQSVYS 158

Query: 523  TRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDATTMEEK 702
            TRRS RL               S   K+ L  K  +NE+ +    +S      A +  E 
Sbjct: 159  TRRSTRL-------LEKCMADLSLKTKETLDNKPAKNEDTE--QNVSAKEKNPAGSEGEV 209

Query: 703  FENEDKSVVVSNQKQDLSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKRDNVPLGSHDVCI 882
                D SV +        + E +K  +              + ++EK + V     +   
Sbjct: 210  IPGRDLSVSME------QVWENLKNDTDQVVGDLAVMDANTETNKEKMNEVLADEKESEN 263

Query: 883  SEIEITVKNTTEELKTEEGAKSQDKTAEF--------VESVILES---KDNFAVGESKDV 1029
            S ++   +  T     E G K  D   E         ++  +LES   + +    ESK+V
Sbjct: 264  SLVQADKQEETLHAICEAGPKKNDNDQEIEDLEIYVDLDIPVLESGNTETHNDDNESKNV 323

Query: 1030 PDFNVALDKL-TELTLQQ-EATKDDGVAKADSDFID-HVANPLQQNKNEPDEIEILQLEE 1200
              F+  +D+  TE  +Q+ ++  +  V + DSD  D      +Q+N +EP++I     + 
Sbjct: 324  LTFDNPVDQQETEHAIQENDSEPETKVDQTDSDAGDSKPKQAIQENDSEPEKINNFDEDT 383

Query: 1201 NKKAFDPHADFSEHIP--------------------------TSDVEMSTH--------- 1275
                 D  A  SE  P                          T+D EM+           
Sbjct: 384  MVDQTDSDAGDSETEPDEEHSGVDSDGTISEAESNQAVLGSETADEEMTLSESEGSTATA 443

Query: 1276 ----------LIVKPTCLTP----------SKHSASKASMTLKRMTGFSDNKENIGNGSK 1395
                       ++K T ++P           + S S   +    +   ++NKEN      
Sbjct: 444  PNSPPLLEEAKVIKTTPVSPFAAEPISVQFPRPSKSTTPLKNSALKLVNENKENNMEVMM 503

Query: 1396 LFV----------MEDVKMAKNTVEDNLHEL-SVRKLSKMLKEKLEITKKSSKNENGNEV 1542
            + V           E  K  K T+++ + E+ SVR+L KM+K   E++ KSS        
Sbjct: 504  MNVNNNENGESKGEEGKKKKKVTIDEEILEVASVRQLRKMVK---ELSIKSS-------- 552

Query: 1543 LSRPALQALPENRLVDE 1593
             +R ALQ LPEN    E
Sbjct: 553  -NRTALQILPENNQTAE 568


>ref|XP_002331774.1| predicted protein [Populus trichocarpa]
          Length = 683

 Score = 85.1 bits (209), Expect = 9e-14
 Identities = 79/242 (32%), Positives = 106/242 (43%), Gaps = 9/242 (3%)
 Frame = +1

Query: 4   LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
           LCKKNKIPAN+TN+AMA+AL  L+ VEG EE     +    QS  +++      SP VP 
Sbjct: 14  LCKKNKIPANMTNIAMADALKVLDKVEGREEFTNVPEPDPQQSPEKAIS----GSPEVPQ 69

Query: 184 MAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQV--KDAEDSKADAIETPALMAQANRKK 357
            + R+  R    + E ES  P+         T V   D E+  A+  ETP ++A+  R +
Sbjct: 70  TSVRTLTRRKPLRIEPESSKPLTRTRCTTRGTVVGEGDQENKTANLSETPIMLAR--RIR 127

Query: 358 GQMASACWKMDSQLKECAE-EDKKDVLMTPATMG------VTXXXXXXXXXXXVKKVYST 516
              ASA  KM+S+  E  E ++K +V  TPA                       K V   
Sbjct: 128 TSTASARHKMESKSMESVENQEKNNVPKTPAARSSRRRAPAVSARGKLEAQNEEKSVQRV 187

Query: 517 YSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDATTME 696
           YSTR SVRL                +  K + L  + E+ E KD       T  D  T  
Sbjct: 188 YSTRHSVRLLEKGMEGLGLKEKERVRPLKMDGLCWEIEDVETKDE------TGDDLLTKS 241

Query: 697 EK 702
           EK
Sbjct: 242 EK 243


>ref|NP_186963.1| uncharacterized protein [Arabidopsis thaliana]
            gi|6714423|gb|AAF26111.1|AC012328_14 hypothetical protein
            [Arabidopsis thaliana] gi|61742693|gb|AAX55167.1|
            hypothetical protein At3g03130 [Arabidopsis thaliana]
            gi|332640384|gb|AEE73905.1| uncharacterized protein
            AT3G03130 [Arabidopsis thaliana]
          Length = 520

 Score = 84.7 bits (208), Expect = 1e-13
 Identities = 133/567 (23%), Positives = 221/567 (38%), Gaps = 56/567 (9%)
 Frame = +1

Query: 7    CKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPPM 186
            CK+NKIPAN+TN+AMA+AL  LEIVEG++E M PS+  +  S   ++       P     
Sbjct: 15   CKRNKIPANMTNIAMADALRDLEIVEGMDEFMDPSRDQSPTSVARNL-------PSAART 67

Query: 187  AGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQV--KDAEDSKADAIETPA---------- 330
            A R+TRR +  K E +S   +        K+     D E+   + ++ P+          
Sbjct: 68   AARTTRRKS-TKDETQSSELVTRSCYVVSKSLAGEMDQENKDMNMLQNPSVPQSRAVKLD 126

Query: 331  ---LMAQANRKKGQMASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVK 501
               +M +AN  K   A +     ++  + A   KKD                       +
Sbjct: 127  VNDIMPEANVSKTPAARS-----TRRAQAAASSKKD-----------------------E 158

Query: 502  KVYSTYSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVD 681
             V   YSTRRSVRL                   K  +  K  ++E+    ++    +D  
Sbjct: 159  SVQRVYSTRRSVRLLEESMADLS---------LKTNVPVK--KHEDSPAGSKFQAKSD-- 205

Query: 682  ATTMEEKFENEDKSVVVSNQKQDLSIGEE-----------IKLGSSAXXXXXXXXXXXAK 828
                 E  EN DK  V+S +  + S+ +E           I  G                
Sbjct: 206  -----ENSENTDKGGVMSGRDLNDSLEKEWDGSKNDPDLDILYGDLGDITFFDASTSKEH 260

Query: 829  MDEEKRDNVPLGSHDVCISEIE-------ITVKNTTEELKTEEGAKSQDKTAEFVE-SVI 984
            ++      V      V ++E E       + V + T    T   A +++   E ++    
Sbjct: 261  LNRTDSSTVSASDSFVLVNEHETSQEDGFVVVDHATSTTTTNTLACNKESEPEQMKIDSE 320

Query: 985  LESKDNFAVGESKDVPDFNVALDKLTELTLQQEATKDDGVAKAD----------SDFIDH 1134
             ES++     +  +  DF VA+    E   + + +  D V+K D          S  +D 
Sbjct: 321  SESEETEYETDPWEGDDFGVAVHTNQE-AFESKVSASDNVSKVDSVATVLIADESKELDF 379

Query: 1135 VANPLQQNKNEPD-------EIEILQLEENKKAFDPHADF-SEHIPTSDVEMSTHLIVKP 1290
             ++PL   + E D       EI  ++LEEN    +   +  SE  P SD    T      
Sbjct: 380  SSSPLAVEELEEDSDEWSDYEIGEVELEENSCGSEESIEIESEEAPVSD--KKTPASSSS 437

Query: 1291 TCLTPSKHSASKASMTLKRMTGFSDNKE----NIGNGSKLFVMEDVKMAKNTVEDNLHEL 1458
            + L  ++   S +    + +    ++KE    N G G     ++  K  K T+++ L ++
Sbjct: 438  SSLAGNETRTSLSPFEAESILESEEDKEMAVNNNGEGKAEAEVKKTK-KKKTIDEELKDV 496

Query: 1459 SVRKLSKMLKEKLEITKKSSKNENGNE 1539
            S+R+L+KM+K   E+  KS +   G E
Sbjct: 497  SMRQLTKMVK---ELAIKSKQQHKGPE 520


>gb|AAU44469.1| hypothetical protein AT3G03130 [Arabidopsis thaliana]
          Length = 520

 Score = 84.0 bits (206), Expect = 2e-13
 Identities = 133/567 (23%), Positives = 220/567 (38%), Gaps = 56/567 (9%)
 Frame = +1

Query: 7    CKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPPM 186
            CK+NKIPAN+TN+AMA+AL  LEIVEG++E M PS+  +  S   ++       P     
Sbjct: 15   CKRNKIPANMTNIAMADALRDLEIVEGMDEFMDPSRDQSPTSVARNL-------PSAART 67

Query: 187  AGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQV--KDAEDSKADAIETPA---------- 330
            A R+TRR +  K E +S   +        K+     D E+   + ++ P+          
Sbjct: 68   AARTTRRKS-TKDETQSSELVTRSCYVVSKSLAGEMDQENKDMNMLQNPSVPQSRAVKLD 126

Query: 331  ---LMAQANRKKGQMASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVK 501
               +M +AN  K   A       ++  + A   KKD                       +
Sbjct: 127  VNDIMPEANVSKTPAA-----RXTRRAQAAASSKKD-----------------------E 158

Query: 502  KVYSTYSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVD 681
             V   YSTRRSVRL                   K  +  K  ++E+    ++    +D  
Sbjct: 159  SVQRVYSTRRSVRLLEESMADLS---------LKTNVPVK--KHEDSPAGSKFQAKSD-- 205

Query: 682  ATTMEEKFENEDKSVVVSNQKQDLSIGEE-----------IKLGSSAXXXXXXXXXXXAK 828
                 E  EN DK  V+S +  + S+ +E           I  G                
Sbjct: 206  -----ENSENTDKGGVMSGRDLNDSLEKEWDGSKNDPDLDILYGDLGDITFFDASTSKEH 260

Query: 829  MDEEKRDNVPLGSHDVCISEIE-------ITVKNTTEELKTEEGAKSQDKTAEFVE-SVI 984
            ++      V      V ++E E       + V + T    T   A +++   E ++    
Sbjct: 261  LNRTDSSTVSASDSFVLVNEHETSQEDGFVVVDHATSTTTTNTLACNKESEPEQMKIDSE 320

Query: 985  LESKDNFAVGESKDVPDFNVALDKLTELTLQQEATKDDGVAKAD----------SDFIDH 1134
             ES++     +  +  DF VA+    E   + + +  D V+K D          S  +D 
Sbjct: 321  SESEETEYETDPWEGDDFGVAVHTNQE-AFESKVSASDNVSKVDSVATVLIADESKELDF 379

Query: 1135 VANPLQQNKNEPD-------EIEILQLEENKKAFDPHADF-SEHIPTSDVEMSTHLIVKP 1290
             ++PL   + E D       EI  ++LEEN    +   +  SE  P SD    T      
Sbjct: 380  SSSPLAVEELEEDSDEWSDYEIGEVELEENSCGSEESIEIESEEAPVSD--KKTPASSSS 437

Query: 1291 TCLTPSKHSASKASMTLKRMTGFSDNKE----NIGNGSKLFVMEDVKMAKNTVEDNLHEL 1458
            + L  ++   S +    + +    ++KE    N G G     ++  K  K T+++ L ++
Sbjct: 438  SSLAGNETRTSLSPFEAESILESEEDKEMAVNNNGEGKAEAEVKKTK-KKKTIDEELKDV 496

Query: 1459 SVRKLSKMLKEKLEITKKSSKNENGNE 1539
            S+R+L+KM+K   E+  KS +   G E
Sbjct: 497  SMRQLTKMVK---ELAIKSKQQHKGPE 520


>emb|CBI26558.3| unnamed protein product [Vitis vinifera]
          Length = 298

 Score = 82.8 bits (203), Expect = 4e-13
 Identities = 77/266 (28%), Positives = 124/266 (46%), Gaps = 5/266 (1%)
 Frame = +1

Query: 4   LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
           LCKKNKIPAN+TNVAMA+AL AL+ V+G+EE++ PS+S   QS     +  E+ SP +P 
Sbjct: 14  LCKKNKIPANMTNVAMADALKALQNVDGLEELLNPSESQNPQSP----EKPEIGSPEIPR 69

Query: 184 MAGR-STRRTNV-AKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKK 357
              R STRR  + A +E ES   +        + + ++    K++  +TPAL   ++RK+
Sbjct: 70  TVCRTSTRRRPIKAAEEPESSQTLTRTHRGTRRIK-EEVNQEKSEVPQTPAL--PSSRKR 126

Query: 358 GQMASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVKKVYSTYSTRRSV 537
              ASA  K  +++++                                 V   YSTRRS 
Sbjct: 127 PPAASARQKTVTRVEQ-------------------------------SSVQRVYSTRRSA 155

Query: 538 RLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDATTMEEKFENED 717
           RL+               ++   +++TKD + +E ++    S     D + + +  E   
Sbjct: 156 RLS------EKLARTEPMEVEFSKVMTKDFDGDEEENKGADSQTISEDNSKITDDSEVIS 209

Query: 718 KSVVVSNQKQ---DLSIGEEIKLGSS 786
           KSV+  N  +   + + GE  K  +S
Sbjct: 210 KSVLSGNDSKAEVEENTGESAKPDNS 235


>gb|ESW03530.1| hypothetical protein PHAVU_011G021300g [Phaseolus vulgaris]
          Length = 740

 Score = 77.0 bits (188), Expect = 2e-11
 Identities = 96/391 (24%), Positives = 151/391 (38%), Gaps = 23/391 (5%)
 Frame = +1

Query: 1    ALCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVP 180
            ALCKKNKIPANITNVAMA+ALAAL+ VEG++EI+   ++     S +       AS    
Sbjct: 13   ALCKKNKIPANITNVAMADALAALDQVEGLDEILNSIEADVGTPSVQCRTAGRAAS---Q 69

Query: 181  PMAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKKG 360
              A R+    + AK    S  P+         + V + E+  A+    P +     R++ 
Sbjct: 70   RKAARAEAEDSTAKVS-ASARPL-RGARGGVASGVMEQENKDAN---VPPVTPAVGRRRA 124

Query: 361  QMASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVK-------KVYSTY 519
               S   K + ++ E  E DK D   T A   V             K        V  TY
Sbjct: 125  TAVSTRRKKEVEMVE-QEGDKNDAPKTLAAASVGGRRTTSRSVCTTKIETPGGASVQRTY 183

Query: 520  STRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDATTMEE 699
            STRRSVRL               +   K +    D  ++E+ + +  +G    D+   E+
Sbjct: 184  STRRSVRLLENGLSKMNLIDTEDTGFDKID--DDDDVSQELSNVSHKAG----DSCDTEQ 237

Query: 700  KFENEDKSVVVSNQKQDLSI----------------GEEIKLGSSAXXXXXXXXXXXAKM 831
                +  S VVS   Q+  +                G ++KL S              + 
Sbjct: 238  GSSLQMDSSVVSENTQEFEVCSSEHNTEYECQSHVSGSDVKLVSVTENNAVVQPHALDEA 297

Query: 832  DEEKRDNVPLGSHDVCISEIEITVKNTTEELKTEEGAKSQDKTAEFVESVILESKDNFAV 1011
            + EK + + +G               T      E G++      E  +S  LE+++   +
Sbjct: 298  EPEKINCLEMG---------------TEPNASDEAGSEPLPDLEETCDSSELETENKDCL 342

Query: 1012 GESKDVPDFNVALDKLTELTLQQEATKDDGV 1104
            G  ++      + D   E+T  ++A+ D  V
Sbjct: 343  GAYQESFPVEASTDASVEVTGLEKASTDASV 373


>ref|XP_006287470.1| hypothetical protein CARUB_v10000682mg [Capsella rubella]
            gi|482556176|gb|EOA20368.1| hypothetical protein
            CARUB_v10000682mg [Capsella rubella]
          Length = 533

 Score = 77.0 bits (188), Expect = 2e-11
 Identities = 137/578 (23%), Positives = 213/578 (36%), Gaps = 48/578 (8%)
 Frame = +1

Query: 4    LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
            LCK+NKIPAN+TNVAMA+AL ALEIVEG++E M    S + QS T     S    P  P 
Sbjct: 14   LCKRNKIPANMTNVAMADALDALEIVEGLDEYM----SQSVQSPT-----SVAKQP--PN 62

Query: 184  MAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKKGQ 363
             A R+TRR    K E +S             T +  +   ++ +      M Q N  K  
Sbjct: 63   TATRATRRKTTVKAEPQS-------------TSLMVSRSCRSTSKSLAGEMDQENLNK-- 107

Query: 364  MASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVKKVYSTYSTRRSVRL 543
                   +  + K    + + +V  TPA                   V S YSTRRS RL
Sbjct: 108  ------NVAQEPKTSTVKFEANVPKTPAARSTRKASVPTSCAKKDDLVQSVYSTRRSTRL 161

Query: 544  AXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDATTMEEKFENEDKS 723
                             L   E L K  +NE  +   ++S      A +  E     D S
Sbjct: 162  LEKCMADL--------SLKTGETLDKPAKNEVTE--QKISVQEKNPAGSEAEVIPGRDLS 211

Query: 724  VVVSNQKQDLSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKRDNVPLGSHDVCISEIEITV 903
            V +    ++L    +  +G               ++++E+   V     +   S +++  
Sbjct: 212  VSMEQVWENLKNVSDQVVGDLEVYVDIAAMDANTEVNKEEMSEVRADEKECENSLVKVDK 271

Query: 904  KNTT------EELKTEEGAKSQDKTAEFVESVILES--------KDNFAVGESKDVPDFN 1041
            +  T      ++  +++G    +   +F    +LE         K+  A+G S D  +  
Sbjct: 272  QEETLQAMCKKKNDSDQGIGDLELDVDFGAIPVLEHANTDDNEFKNVLALGSSVDAQETE 331

Query: 1042 VAL----------------DKLTELTLQQEATKDDGVAKADSDFIDHVANPLQQNKNEPD 1173
             A+                D +      +   +D+    +D    +  +N      +  D
Sbjct: 332  QAIQENGAEPEKINKFDEEDTMVNAGDSETEAEDNCGVDSDGTISEADSNQAVLGSDIAD 391

Query: 1174 EIEILQLEENKKAFDPHAD---FSEHIPTSDVEMSTHLIVKPTCLTPSKHSASKASMTLK 1344
            E   L   E   A  P++        + T+ +       +      PSK +    +  LK
Sbjct: 392  EEMTLSESEGSVASAPNSPPLLEKAEVKTTPLSPFAEESISVQFPRPSKSTTPSKNSALK 451

Query: 1345 RMTGFSDNKE------------NIGNGSKLFVMEDVKMAKNTV---EDNLHELSVRKLSK 1479
             + G  +NKE            N  NG K    E+ K  K  V   E+NL   S+R+L++
Sbjct: 452  LVDG--ENKENNMEIMMRVDVNNNENGEK--KGEEAKKKKKKVRIDEENLKNKSIRQLTE 507

Query: 1480 MLKEKLEITKKSSKNENGNEVLSRPALQALPENRLVDE 1593
            +LK    +T K SK         R ALQ LP N    E
Sbjct: 508  ILK---NLTIKDSK---------RTALQILPGNNQTAE 533


>ref|XP_003538933.1| PREDICTED: dentin sialophosphoprotein-like [Glycine max]
          Length = 722

 Score = 77.0 bits (188), Expect = 2e-11
 Identities = 95/386 (24%), Positives = 147/386 (38%), Gaps = 35/386 (9%)
 Frame = +1

Query: 1    ALCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQS--GT--------AQSSTESMD 150
            ALCKKNKIPANITNVAMA+ALAAL  VEG+++   PS+   GT         ++ST+   
Sbjct: 13   ALCKKNKIPANITNVAMADALAALNQVEGLDDFFNPSEGDVGTPSVNHRTVVRTSTQRKA 72

Query: 151  MSEVASPFVPPMAGRSTRRTNVAKQEIESVN------PMXXXXXXXXKTQVKDAEDSKAD 312
              E A          STRR  VA++ +E  N      P+         T V      + +
Sbjct: 73   AIEEAEGL---KVKTSTRRVRVAEEVVEQENKDANAPPITPAASRRRATAVSTRRKKEVE 129

Query: 313  AIE----------TPALMAQANRKKGQMASACWKMDSQLKECAEEDKKDVLMTPATMGVT 462
             +E          TPA +A  +R++    S C                  + TP   G  
Sbjct: 130  MVEEDAGVQGNPKTPAAVAPVSRRRATSRSVCTTK---------------IETPGAHGT- 173

Query: 463  XXXXXXXXXXXVKKVYSTYSTRRSVRLAXXXXXXXXXXXXXXSQLFKKELLTKDG----- 627
                            S Y+TRRSVRL                 L K  LL  +      
Sbjct: 174  ----------------SVYNTRRSVRL-------------LEKDLSKMSLLDTEDTTGLV 204

Query: 628  --ENEEMKDANELSGITDVDATTMEEKFENEDKSVVVSNQKQDLSI-GEEIKLGSSAXXX 798
              + +  +D++ +S   + D++  E+    + +S VVS   ++L +   E          
Sbjct: 205  KIDGDVSQDSSNVSHQLEEDSSGNEKGDSLQMESTVVSGDTRELEVCSLEKNTEYECQSR 264

Query: 799  XXXXXXXXAKMDEEKRDNVPLGSHDVCISEIEITVKNTTEELKTEEGAKSQDKTAEFVES 978
                      + E      P G ++    ++             E G++S     E  +S
Sbjct: 265  DLDSDVKLVSVTEIDMLVEPHGPNEAGSEKVNCLELEAEPNASDEAGSESLPVLEESYDS 324

Query: 979  VILESKDNFAVGESKDV-PDFNVALD 1053
              LE+++NF +  S+D  P+  +  D
Sbjct: 325  SELETQNNFPLEASEDAFPEVTIGQD 350


>gb|EXB75013.1| hypothetical protein L484_012137 [Morus notabilis]
          Length = 791

 Score = 73.9 bits (180), Expect = 2e-10
 Identities = 103/440 (23%), Positives = 167/440 (37%), Gaps = 23/440 (5%)
 Frame = +1

Query: 4    LCKKNKIPANITNVAMAEALAALEIVEGIEEIMQPSQSGTAQSSTESMDMSEVASPFVPP 183
            LCKKNKIPAN+TNVAMA++LA+L+ VEG++E +  S+S + Q    ++    + SP  P 
Sbjct: 14   LCKKNKIPANLTNVAMADSLASLQHVEGLDEFLNESKSESQQFPEGAL----IGSPDAPR 69

Query: 184  MAGRSTRRTNVAKQEIESVNPMXXXXXXXXKTQVKDAEDSKADAIETPALMAQANRKKGQ 363
             + R++ R      E ES   +        +  V++ +  + + +  P   A  + ++G+
Sbjct: 70   TSCRTSTRRKPISDEPESSQILTRTCRGTRRGVVEEMDQERTEVV--PKTPAARSSRRGR 127

Query: 364  MASACWKMDSQLKECAEEDKKDVLMTPATMGVTXXXXXXXXXXXVKKVYSTYSTRRSVRL 543
             ASA  K +SQ  E                                 V    STRRSVRL
Sbjct: 128  PASARQKTESQKDE-------------------------------SSVQRACSTRRSVRL 156

Query: 544  AXXXXXXXXXXXXXXSQLFKKELLTKDGENEEMKDANELSGITDVDATTMEEKFENEDKS 723
                            +  +K  L KD + + MK       I D+D++            
Sbjct: 157  --------------LEKTMEKLSLVKDKKIQPMK-------IDDIDSSVTMSGTNGSSSE 195

Query: 724  VVVSNQKQ-DLSIGEEIKLGSSAXXXXXXXXXXXAKMDEEKRDNVPLGSHDVCISEIEIT 900
            V    +K  DL +   +K   S              + E++ D+V         SE+   
Sbjct: 196  VCSGKEKTVDLEVSSVLK---SEGSPEIQIDLNNNNVQEKREDHVSELEESKSKSELMDL 252

Query: 901  VKNTTEELKTEE---------------------GAKSQDKTAEFVESVILESKDNFAVGE 1017
            V+ + E +   E                      + S+D  AE      L S+D  A   
Sbjct: 253  VEKSVENMDVIEETFGDKEINSVQLANFPYETQNSHSEDSKAE----QDLGSEDPLAAEV 308

Query: 1018 SKDVPDFNVALDKLTELTLQQEATKDDGVAKADSDF-IDHVANPLQQNKNEPDEIEILQL 1194
              DV    +A D   + +L+ E  +   V ++      +  ++ +Q +K+E    E   +
Sbjct: 309  LDDVSVNIIAQDIAPKSSLRLEENEFSNVKESVEPIPFNTSSSCVQADKSETLSSEASAV 368

Query: 1195 EENKKAFDPHADFSEHIPTS 1254
             E K       D   HI TS
Sbjct: 369  PETKSWTIKSPDGQSHILTS 388


Top