BLASTX nr result

ID: Rehmannia22_contig00038416 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00038416
         (858 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAB41200.1| unnamed protein product [Nicotiana tabacum]           189   1e-45
ref|NP_565136.1| uncharacterized protein [Arabidopsis thaliana] ...   181   3e-43
ref|XP_002277121.1| PREDICTED: uncharacterized protein LOC100240...   180   5e-43
emb|CAN67638.1| hypothetical protein VITISV_044257 [Vitis vinifera]   179   9e-43
gb|AAK68828.1| Unknown protein [Arabidopsis thaliana]                 179   1e-42
ref|XP_006301649.1| hypothetical protein CARUB_v10022093mg [Caps...   178   2e-42
ref|XP_002887649.1| hypothetical protein ARALYDRAFT_476818 [Arab...   177   3e-42
ref|XP_006342203.1| PREDICTED: uncharacterized protein LOC102591...   174   5e-41
ref|XP_004238475.1| PREDICTED: uncharacterized protein LOC101247...   172   2e-40
ref|XP_002510751.1| conserved hypothetical protein [Ricinus comm...   172   2e-40
gb|AAM62953.1| unknown [Arabidopsis thaliana]                         172   2e-40
ref|NP_564129.1| uncharacterized protein [Arabidopsis thaliana] ...   171   4e-40
ref|XP_006416338.1| hypothetical protein EUTSA_v10008766mg [Eutr...   169   9e-40
ref|XP_002890422.1| hypothetical protein ARALYDRAFT_889558 [Arab...   167   4e-39
ref|XP_006390172.1| hypothetical protein EUTSA_v10019130mg [Eutr...   166   8e-39
ref|XP_002301889.1| hypothetical protein POPTR_0002s00420g [Popu...   165   2e-38
gb|EOY15857.1| Poly polymerase 1, putative [Theobroma cacao]          164   3e-38
ref|XP_006304055.1| hypothetical protein CARUB_v10009886mg [Caps...   164   5e-38
ref|XP_002307010.1| hypothetical protein POPTR_0005s28050g [Popu...   163   6e-38
gb|EMJ12210.1| hypothetical protein PRUPE_ppa020532mg [Prunus pe...   159   2e-36

>dbj|BAB41200.1| unnamed protein product [Nicotiana tabacum]
          Length = 205

 Score =  189 bits (479), Expect = 1e-45
 Identities = 118/208 (56%), Positives = 146/208 (70%), Gaps = 11/208 (5%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQK-SSANVVSVDGDLRTYSLPITVSQVLQSLNSSPDSFFLCNSDR 551
           MG CLSS S   +  QK SSA V+S +G+LR Y++PI VSQVLQS  SS  SF +CNSDR
Sbjct: 1   MGACLSSSSVIDAKDQKPSSAYVISTNGELRQYTVPINVSQVLQSEMSSEASF-ICNSDR 59

Query: 550 LYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHINASNRRR 371
           LYF DFIP L ++ +L+  QIYFVLPT+KLQYRL+AS+MAALAVKAS AL+ +N +NRR 
Sbjct: 60  LYFDDFIPRLDSEYQLQPGQIYFVLPTSKLQYRLSASEMAALAVKASAALEDLNKNNRRH 119

Query: 370 KRK---------TRISPVLVSDEQDPQSNQTVNHIMTSSVNNATKSSSAGLGVS-RSGSV 221
            +K         +RISP+L+  E + + N T +        +  K+ S GLGVS RS SV
Sbjct: 120 SKKFIRKNKKSNSRISPMLLQVEDESRDNYTQS-------QSNYKAPSMGLGVSMRSASV 172

Query: 220 RKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           RKL R SSRRAK+AVRSFR KL+TI EG
Sbjct: 173 RKLQRISSRRAKMAVRSFR-KLMTIQEG 199


>ref|NP_565136.1| uncharacterized protein [Arabidopsis thaliana]
           gi|12323985|gb|AAG51956.1|AC015450_17 unknown protein;
           83277-83927 [Arabidopsis thaliana]
           gi|21592540|gb|AAM64489.1| unknown [Arabidopsis
           thaliana] gi|23197634|gb|AAN15344.1| Unknown protein
           [Arabidopsis thaliana] gi|332197742|gb|AEE35863.1|
           uncharacterized protein AT1G76600 [Arabidopsis thaliana]
          Length = 216

 Score =  181 bits (459), Expect = 3e-43
 Identities = 102/213 (47%), Positives = 139/213 (65%), Gaps = 16/213 (7%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQS------LNSSPDSFFL 566
           MG+C+S    +Y +   ++A +V+++GDLR Y +P+  SQVL+S       +SS  S+FL
Sbjct: 1   MGLCVSVNRNEY-VSSSTTAKIVTINGDLREYDVPVLASQVLESESTSSSSSSSSSSYFL 59

Query: 565 CNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHINA 386
           CNSD LY+ DFIP++++D+ L++ QIYFVLP +K QYRL+ASDMAALAVKASVA++    
Sbjct: 60  CNSDSLYYDDFIPAIESDEILQANQIYFVLPISKRQYRLSASDMAALAVKASVAIEKAAG 119

Query: 385 SNRRRKRKTRISPVLVSDEQDPQSNQTVNH----------IMTSSVNNATKSSSAGLGVS 236
              RR+R  RISPV+  ++ +      VN+          +    + N T       G S
Sbjct: 120 KKNRRRRSGRISPVVTLNQANDNRIAAVNNRIGGEATNMMMQKGKLPNRTTPFKDTNGYS 179

Query: 235 RSGSVRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           RSGSVRKL RY+S RAKLAVRSFR +L TIYEG
Sbjct: 180 RSGSVRKLKRYTSGRAKLAVRSFRLRLSTIYEG 212


>ref|XP_002277121.1| PREDICTED: uncharacterized protein LOC100240987 [Vitis vinifera]
           gi|297745627|emb|CBI40792.3| unnamed protein product
           [Vitis vinifera]
          Length = 191

 Score =  180 bits (457), Expect = 5e-43
 Identities = 104/199 (52%), Positives = 134/199 (67%), Gaps = 2/199 (1%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQSLNSSPDSFFLCNSDRL 548
           MG C+SS     +    ++A ++S+ G+LR YS P+TVSQVL    SS  S F+CNSD L
Sbjct: 1   MGGCVSSVGSYSNSTSTATAKLISLHGELREYSAPVTVSQVLH-FESSSSSCFVCNSDSL 59

Query: 547 YFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVAL-DHINASNRRR 371
           Y+ D+IP++ ADDEL   QIYF+L  TKLQYRL AS+MAALAVKAS+AL +H   +  RR
Sbjct: 60  YYDDYIPAMNADDELLPGQIYFLLSATKLQYRLTASEMAALAVKASIALQNHSKKAGHRR 119

Query: 370 KRKTRISPVLVSDEQDPQSNQTVNHIMTSSVNNATKS-SSAGLGVSRSGSVRKLTRYSSR 194
             K+RISPVL  +++          +M + V    KS     LGVSR GS+RKL R+SSR
Sbjct: 120 NNKSRISPVLEVNQK----------VMEAEVGGVKKSFEKPALGVSRHGSLRKLQRHSSR 169

Query: 193 RAKLAVRSFRNKLITIYEG 137
           RA++AVRSFR +L TIYEG
Sbjct: 170 RARMAVRSFRLRLTTIYEG 188


>emb|CAN67638.1| hypothetical protein VITISV_044257 [Vitis vinifera]
          Length = 191

 Score =  179 bits (455), Expect = 9e-43
 Identities = 104/199 (52%), Positives = 134/199 (67%), Gaps = 2/199 (1%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQSLNSSPDSFFLCNSDRL 548
           MG C+SS     +    ++A ++S+ G+LR YS P+TVSQVL    SS  S F+CNSD L
Sbjct: 1   MGGCVSSVGSYSNSTSTATAKLISLHGELREYSAPVTVSQVLH-FESSSSSCFVCNSDSL 59

Query: 547 YFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVAL-DHINASNRRR 371
           Y+ D+IP++ ADDEL   QIYF+L   KLQYRL+AS+MAALAVKAS+AL +H   +  RR
Sbjct: 60  YYDDYIPAMNADDELLPGQIYFLLSAXKLQYRLSASEMAALAVKASIALQNHSKKAGHRR 119

Query: 370 KRKTRISPVLVSDEQDPQSNQTVNHIMTSSVNNATKS-SSAGLGVSRSGSVRKLTRYSSR 194
             K+RISPVL  +++          +M + V    KS     LGVSR GSVRKL R+SSR
Sbjct: 120 NNKSRISPVLEVNQK----------VMEAEVGGVKKSFEKPALGVSRHGSVRKLQRHSSR 169

Query: 193 RAKLAVRSFRNKLITIYEG 137
           RA++AVRSFR +L TIYEG
Sbjct: 170 RARMAVRSFRLRLTTIYEG 188


>gb|AAK68828.1| Unknown protein [Arabidopsis thaliana]
          Length = 216

 Score =  179 bits (453), Expect = 1e-42
 Identities = 101/213 (47%), Positives = 138/213 (64%), Gaps = 16/213 (7%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQS------LNSSPDSFFL 566
           MG+C+S    +Y +   ++A +V+++GDLR Y +P+  SQVL+S       +SS  S+FL
Sbjct: 1   MGLCVSVNRNEY-VSSSTTAKIVTINGDLREYDVPVLASQVLESESTSSSSSSSSSSYFL 59

Query: 565 CNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHINA 386
           CNSD LY+ DFIP++++D+ L++ QIYFVLP +K QYRL+ASDMAALAV ASVA++    
Sbjct: 60  CNSDSLYYDDFIPAIESDEILQANQIYFVLPISKRQYRLSASDMAALAVXASVAIEKAAG 119

Query: 385 SNRRRKRKTRISPVLVSDEQDPQSNQTVNH----------IMTSSVNNATKSSSAGLGVS 236
              RR+R  RISPV+  ++ +      VN+          +    + N T       G S
Sbjct: 120 KKNRRRRSGRISPVVTLNQANDNRIAAVNNRIGGEATNMMMQKGKLPNRTTPFKDTNGYS 179

Query: 235 RSGSVRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           RSGSVRKL RY+S RAKLAVRSFR +L TIYEG
Sbjct: 180 RSGSVRKLKRYTSGRAKLAVRSFRLRLSTIYEG 212


>ref|XP_006301649.1| hypothetical protein CARUB_v10022093mg [Capsella rubella]
           gi|482570359|gb|EOA34547.1| hypothetical protein
           CARUB_v10022093mg [Capsella rubella]
          Length = 244

 Score =  178 bits (452), Expect = 2e-42
 Identities = 102/206 (49%), Positives = 138/206 (66%), Gaps = 6/206 (2%)
 Frame = -3

Query: 736 K*HMGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQS-----LNSSPDSF 572
           K  MG+C+S    D      ++A +V+++GDLR Y +P+  SQVL+S      +S   SF
Sbjct: 36  KQQMGLCVSVNRNDCD-SSSTTAKIVTINGDLREYDVPVLASQVLESESTASSSSRSSSF 94

Query: 571 FLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHI 392
           FLCNSD L + DFIP++ +D+ L++ QIYFVLP +K QYRL+ASDMAALAVKASVA++  
Sbjct: 95  FLCNSDSLSYDDFIPAIGSDEILQADQIYFVLPISKRQYRLSASDMAALAVKASVAMEKA 154

Query: 391 NASNRRRKRKTRISPVLVSDEQDP-QSNQTVNHIMTSSVNNATKSSSAGLGVSRSGSVRK 215
              N RR++  RISPV+  ++ +   + +T+       ++N T    AG G SRS SVRK
Sbjct: 155 AGKNNRRRKSGRISPVVTLNQPNRIGAGETMAMGKGKQLDNKTAPFKAGNGYSRSDSVRK 214

Query: 214 LTRYSSRRAKLAVRSFRNKLITIYEG 137
           L RY+S RAKLAVRSFR +L TIYEG
Sbjct: 215 LKRYTSGRAKLAVRSFRLRLATIYEG 240


>ref|XP_002887649.1| hypothetical protein ARALYDRAFT_476818 [Arabidopsis lyrata subsp.
           lyrata] gi|297333490|gb|EFH63908.1| hypothetical protein
           ARALYDRAFT_476818 [Arabidopsis lyrata subsp. lyrata]
          Length = 217

 Score =  177 bits (450), Expect = 3e-42
 Identities = 102/214 (47%), Positives = 139/214 (64%), Gaps = 17/214 (7%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQSLNSSPDS--------F 572
           MG+C+S    +Y     S+A +V+++GDLR Y +P+  SQVL+S ++S  S        +
Sbjct: 1   MGLCVSVNRNEYD-SSPSTAKIVTINGDLREYDVPVLASQVLESESTSSSSSSSSRSSSY 59

Query: 571 FLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALD-H 395
           FLCNSD LY+ DFIP++++D+ L++ QIYFVLP +K QYRL+ASDMAALAVKASVA++  
Sbjct: 60  FLCNSDSLYYDDFIPAIESDEILQADQIYFVLPISKRQYRLSASDMAALAVKASVAIEKS 119

Query: 394 INASNRRRKRKTRISPVLVSDEQDPQSNQTVNH--------IMTSSVNNATKSSSAGLGV 239
               NRRR+   RISPV+  ++ +      +N+        +    + N T       G 
Sbjct: 120 AGKKNRRRRSSGRISPVVTLNQPNDNRIAAMNNRIGGEATILQKGKLPNRTTPFKDTTGY 179

Query: 238 SRSGSVRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           SRSGSVRKL RY+S RAKLAVRSFR +L TIYEG
Sbjct: 180 SRSGSVRKLKRYTSGRAKLAVRSFRLRLSTIYEG 213


>ref|XP_006342203.1| PREDICTED: uncharacterized protein LOC102591626 [Solanum tuberosum]
          Length = 196

 Score =  174 bits (440), Expect = 5e-41
 Identities = 114/207 (55%), Positives = 139/207 (67%), Gaps = 10/207 (4%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQS-LNSSPDSFFLCNSDR 551
           MG CLSS S      Q +S  V+SV+G+LR YS+PI VSQVLQS ++S   S FLC+SDR
Sbjct: 1   MGACLSSSSTIVDQKQLTSY-VISVNGELRQYSVPINVSQVLQSDISSDHASSFLCSSDR 59

Query: 550 LYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHINASNRRR 371
           LYF DFIPSL +  +L   QIYF+LP +KL+YRL+ASDMAALAVKAS AL+ +N +  + 
Sbjct: 60  LYFDDFIPSLDSQYQLLPGQIYFMLPASKLKYRLSASDMAALAVKASTALEDLNNNKNKN 119

Query: 370 K--------RKTRISPVLVSDEQDPQSNQTVNHIMTSSVNNATKSSSAGLGVS-RSGSVR 218
           K         K RISP+L+  E+D  + QT N             S+ GLG+S RS SVR
Sbjct: 120 KSRKSRKIMSKKRISPMLLQVEEDDYT-QTNN------------KSTVGLGISMRSASVR 166

Query: 217 KLTRYSSRRAKLAVRSFRNKLITIYEG 137
           KL R SSRRAK+AVRSFR KLITI EG
Sbjct: 167 KLQRLSSRRAKMAVRSFR-KLITIPEG 192


>ref|XP_004238475.1| PREDICTED: uncharacterized protein LOC101247521 [Solanum
           lycopersicum]
          Length = 187

 Score =  172 bits (435), Expect = 2e-40
 Identities = 111/202 (54%), Positives = 134/202 (66%), Gaps = 5/202 (2%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQSLNSSPDSFFLCNSDRL 548
           MG CLSS S      Q +S  V+S+ G+LR YS+PI VSQVLQS  SS    FLC+SDRL
Sbjct: 1   MGACLSSSSNIVDQKQLTSY-VISIGGELRQYSVPINVSQVLQSDISST---FLCSSDRL 56

Query: 547 YFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHINASNRRRK 368
           YF  FIPSL +  +L   QIYF+LP +KL+Y L ASDMAALA+KAS AL+ +N +  R+ 
Sbjct: 57  YFDHFIPSLDSQYQLLPNQIYFMLPASKLKYPLTASDMAALALKASTALEELNKNKSRKS 116

Query: 367 R----KTRISPVLVSDEQDPQSNQTVNHIMTSSVNNATKSSSAGLGVS-RSGSVRKLTRY 203
           R    K RISP+L+  EQ+               NN  KS++ GLG+S RS SVRKL R 
Sbjct: 117 RKITSKNRISPMLLPLEQE--------------TNNYYKSTTVGLGISMRSASVRKLQRL 162

Query: 202 SSRRAKLAVRSFRNKLITIYEG 137
           SSRRAK+AVRSFR KLITI EG
Sbjct: 163 SSRRAKMAVRSFR-KLITIQEG 183


>ref|XP_002510751.1| conserved hypothetical protein [Ricinus communis]
           gi|223551452|gb|EEF52938.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 210

 Score =  172 bits (435), Expect = 2e-40
 Identities = 103/207 (49%), Positives = 130/207 (62%), Gaps = 10/207 (4%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ-----SLNSSPDSFFLC 563
           MG C S      S     +ANVVS++G LR Y++P+  SQVL      S +SS  SFFLC
Sbjct: 1   MGSCFSCSVFSESDLLPPAANVVSINGTLRQYNVPVIASQVLDAEAASSSSSSSTSFFLC 60

Query: 562 NSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHINAS 383
           NSD L + D IP+L +D +L + Q+YF+LP +KLQ RL A DMAALAVKASVAL + + +
Sbjct: 61  NSDFLSYDDLIPALDSDAQLYANQLYFILPKSKLQNRLTAPDMAALAVKASVALQNASKN 120

Query: 382 NRRRKRKTRISPVLVSDEQDPQS---NQTVNHIMTSSVNNATKSSS--AGLGVSRSGSVR 218
              R++K RISPVL+ ++   Q    N T             K      G+G SRSGSVR
Sbjct: 121 EAHRRKKARISPVLLVNQSSSQRHLLNPTSGDAYPRKTFQKAKGEQPPVGMGFSRSGSVR 180

Query: 217 KLTRYSSRRAKLAVRSFRNKLITIYEG 137
           +L RY+SRRAKLAVRSFR +L TIYEG
Sbjct: 181 RLHRYTSRRAKLAVRSFRLRLTTIYEG 207


>gb|AAM62953.1| unknown [Arabidopsis thaliana]
          Length = 210

 Score =  172 bits (435), Expect = 2e-40
 Identities = 96/209 (45%), Positives = 135/209 (64%), Gaps = 12/209 (5%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ---------SLNSSPDS 575
           MG+C+S +  D +     +  +V+V+GDLR Y++P+  SQVL+         S +S P S
Sbjct: 1   MGICVSFRREDSN--SSPTVKIVTVNGDLREYNVPVIASQVLEAESAAAYSSSSSSRPSS 58

Query: 574 FFLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDH 395
           +F+C+SD LY+ DFIP++++++ L++ QIYFVLP +K Q RL ASDMAALAVKASVA+ +
Sbjct: 59  YFICDSDSLYYDDFIPAIKSEEPLQADQIYFVLPISKRQSRLTASDMAALAVKASVAIQN 118

Query: 394 INASNRRRKRKTRISPVLVSDEQDPQSNQTVNHIMTSSVN---NATKSSSAGLGVSRSGS 224
                 RR++K RISPV++    +   N   +           + T    A  G++RSGS
Sbjct: 119 SVKKESRRRKKVRISPVMMLTGSNDSVNGNTSETTVKKGRPFVSKTAPFKASSGINRSGS 178

Query: 223 VRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           VR L RY+S+RAKLAVRSFR KL TIYEG
Sbjct: 179 VRNLRRYTSKRAKLAVRSFRLKLSTIYEG 207


>ref|NP_564129.1| uncharacterized protein [Arabidopsis thaliana]
           gi|32815937|gb|AAP88353.1| At1g21010 [Arabidopsis
           thaliana] gi|110743853|dbj|BAE99761.1| hypothetical
           protein [Arabidopsis thaliana]
           gi|332191933|gb|AEE30054.1| uncharacterized protein
           AT1G21010 [Arabidopsis thaliana]
          Length = 210

 Score =  171 bits (432), Expect = 4e-40
 Identities = 96/209 (45%), Positives = 135/209 (64%), Gaps = 12/209 (5%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ---------SLNSSPDS 575
           MG+C+S +  D +     +  +V+V+GDLR Y++P+  SQVL+         S +S P S
Sbjct: 1   MGICVSFRREDSN--SSPTVKIVTVNGDLREYNVPVIASQVLEAESAAAYSSSSSSRPSS 58

Query: 574 FFLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDH 395
           +F+C+SD LY+ DFIP++++++ L++ QIYFVLP +K Q RL ASDMAALAVKASVA+ +
Sbjct: 59  YFICDSDSLYYDDFIPAIKSEEPLQADQIYFVLPISKRQSRLTASDMAALAVKASVAIQN 118

Query: 394 INASNRRRKRKTRISPVLVSDEQDPQSNQTVNHIMTSSVN---NATKSSSAGLGVSRSGS 224
                 RR++K RISPV++    +   N   +           + T    A  G++RSGS
Sbjct: 119 SVKKESRRRKKVRISPVMMLTGSNDSVNGNGSETTVKKGRPFVSKTAPVKASSGINRSGS 178

Query: 223 VRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           VR L RY+S+RAKLAVRSFR KL TIYEG
Sbjct: 179 VRNLRRYTSKRAKLAVRSFRLKLSTIYEG 207


>ref|XP_006416338.1| hypothetical protein EUTSA_v10008766mg [Eutrema salsugineum]
           gi|557094109|gb|ESQ34691.1| hypothetical protein
           EUTSA_v10008766mg [Eutrema salsugineum]
          Length = 212

 Score =  169 bits (429), Expect = 9e-40
 Identities = 98/211 (46%), Positives = 133/211 (63%), Gaps = 14/211 (6%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ--------SLNSSPDSF 572
           MG C+S    D +     +  +V+V+GDLR Y++P+  SQVL+        S +S P S+
Sbjct: 1   MGNCVSFNRRDSN--SSPTVKIVTVNGDLREYNVPVLASQVLEAESAAASSSSSSRPSSY 58

Query: 571 FLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHI 392
           F+C+SD LY+ DFIP++++   L++ QIYFVLP +K Q RL ASDMAALAVKASVA+ + 
Sbjct: 59  FICDSDSLYYDDFIPAIESGSSLQAEQIYFVLPISKRQNRLTASDMAALAVKASVAIQNS 118

Query: 391 NASNRRRKRKTRISPVLVSDEQDPQSNQTVNHIMTSSVN------NATKSSSAGLGVSRS 230
                RR++K RISPV++  + +            S+V       N T    A  G++RS
Sbjct: 119 LGKEPRRRKKGRISPVMMLTQPNDSVEAVNGKASESTVRKGGLSMNKTAPFKASSGLNRS 178

Query: 229 GSVRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           GSVR L RY+S+RAKLAVRSFR KL TIYEG
Sbjct: 179 GSVRNLRRYTSKRAKLAVRSFRLKLSTIYEG 209


>ref|XP_002890422.1| hypothetical protein ARALYDRAFT_889558 [Arabidopsis lyrata subsp.
           lyrata] gi|297336264|gb|EFH66681.1| hypothetical protein
           ARALYDRAFT_889558 [Arabidopsis lyrata subsp. lyrata]
          Length = 207

 Score =  167 bits (423), Expect = 4e-39
 Identities = 95/206 (46%), Positives = 133/206 (64%), Gaps = 9/206 (4%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ------SLNSSPDSFFL 566
           MG+C+S    D +     +  +V+V+GDLR Y++P+  SQVL+      S +S   S+F+
Sbjct: 1   MGICVSFHRKDSN--SSPTVKIVTVNGDLREYNVPVIASQVLEAESAAASSSSRSSSYFI 58

Query: 565 CNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHINA 386
           C+SD LY+ DFIP++++++ L++ QIYFVLP +K Q RL ASDMAALAVKASVA+ +   
Sbjct: 59  CDSDSLYYDDFIPAIKSEEPLQADQIYFVLPISKRQNRLTASDMAALAVKASVAIQNSVK 118

Query: 385 SNRRRKRKTRISPVLVSDEQDPQSNQTVNHIMTSS---VNNATKSSSAGLGVSRSGSVRK 215
              RR++K RISPV++    +   N   +         + + T    A  G +RSGSVR 
Sbjct: 119 KESRRRKKVRISPVMMLTGSNDSLNGNGSETTVKKGRPLVSKTAPFKASSGYNRSGSVRN 178

Query: 214 LTRYSSRRAKLAVRSFRNKLITIYEG 137
           L RY+S+RAKLAVRSFR KL TIYEG
Sbjct: 179 LRRYTSKRAKLAVRSFRLKLSTIYEG 204


>ref|XP_006390172.1| hypothetical protein EUTSA_v10019130mg [Eutrema salsugineum]
           gi|557086606|gb|ESQ27458.1| hypothetical protein
           EUTSA_v10019130mg [Eutrema salsugineum]
          Length = 213

 Score =  166 bits (421), Expect = 8e-39
 Identities = 97/213 (45%), Positives = 135/213 (63%), Gaps = 16/213 (7%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ---------SLNSSPDS 575
           MG+C+S   G        +A +V+V+GDLR Y++P+   QVL+         S +S P S
Sbjct: 1   MGLCVSV--GRSECDSSPTAKIVTVNGDLREYNVPVLAYQVLEAESMASSSSSSSSRPSS 58

Query: 574 FFLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDH 395
           +FLCNSD LY+ DFIP++++++ L++ QIYFVLP +K QYRL ASDMAALAVKASVA++ 
Sbjct: 59  YFLCNSDSLYYDDFIPAIESEEILQAGQIYFVLPISKRQYRLTASDMAALAVKASVAIEK 118

Query: 394 INASNRRRKRKTRISPVLVSDEQDPQ-------SNQTVNHIMTSSVNNATKSSSAGLGVS 236
                 RR+ + RISPV++ ++ + +       S +T         N  T   ++  G S
Sbjct: 119 AAGKKNRRRNRGRISPVMMLNQPNDRIVAVNGLSGETTMMQKGKLSNKTTPFKTSTSGYS 178

Query: 235 RSGSVRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           RSGSVRKL RY+S RAKLA RS + +L TIYEG
Sbjct: 179 RSGSVRKLRRYASGRAKLAARSIK-RLSTIYEG 210


>ref|XP_002301889.1| hypothetical protein POPTR_0002s00420g [Populus trichocarpa]
           gi|222843615|gb|EEE81162.1| hypothetical protein
           POPTR_0002s00420g [Populus trichocarpa]
          Length = 214

 Score =  165 bits (417), Expect = 2e-38
 Identities = 102/210 (48%), Positives = 133/210 (63%), Gaps = 13/210 (6%)
 Frame = -3

Query: 727 MGVCLSSQS-GDYSLHQK---SSANVVSVDGDLRTYSLPITVSQVLQS-------LNSSP 581
           MG C+SS   G +  H++    +A V+S+ GDLR Y LP  VSQVL+S        +SS 
Sbjct: 1   MGACVSSSYLGHHESHEQLRPKTAKVISIHGDLREYYLPAFVSQVLRSEIASSSSSSSSS 60

Query: 580 DSFFLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVAL 401
            S+FLCNSD L + ++IP L +D  L + +IYFVLP +KLQ RLA+SDMAALAVKAS+AL
Sbjct: 61  SSWFLCNSDHLSYDEYIPVLASDVPLHADEIYFVLPNSKLQRRLASSDMAALAVKASLAL 120

Query: 400 DHIN-ASNRRRKRKTRISPV-LVSDEQDPQSNQTVNHIMTSSVNNATKSSSAGLGVSRSG 227
            + +     RR +K RISPV LVS + D Q +  +             + S  +G SRSG
Sbjct: 121 QNSSKKGGSRRGKKARISPVLLVSPDHDHQQHNVIYQKRKHEPQVQRAADSVAIGFSRSG 180

Query: 226 SVRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           S R   +Y+SRRAKLAVRSF+ +L TIYEG
Sbjct: 181 SDRSFKKYTSRRAKLAVRSFKLRLTTIYEG 210


>gb|EOY15857.1| Poly polymerase 1, putative [Theobroma cacao]
          Length = 205

 Score =  164 bits (416), Expect = 3e-38
 Identities = 102/209 (48%), Positives = 135/209 (64%), Gaps = 12/209 (5%)
 Frame = -3

Query: 727 MGVC--LSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ--------SLNSSPD 578
           MG C  L+S++         +A VVS++GDL  Y++P+ VSQVLQ        S +S P 
Sbjct: 1   MGTCFSLNSRTALSESAAPPTAQVVSLNGDLHKYNIPVLVSQVLQAEAAASSSSSSSLPS 60

Query: 577 S-FFLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVAL 401
           S  FLCNSDRLY+ D+IP+L  D +L++ QIYFVLP +KLQ RL + DMAALAVKASVA+
Sbjct: 61  SAIFLCNSDRLYYDDYIPALDIDHQLQANQIYFVLPNSKLQQRLTSKDMAALAVKASVAI 120

Query: 400 DHINASNRRRKRKTRISPVLVSDEQDPQSNQTVNHIMTSSVNNATKS-SSAGLGVSRSGS 224
            + + +   R++K RISPVL+  +  P  ++            A KS +     +SRS S
Sbjct: 121 QNSSKNESHRRKKARISPVLLVAQSLPVVDK-------DDAPTAPKSFAEPRPRLSRSAS 173

Query: 223 VRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           +RKL RY+SRRAKLAVRSFR +L TIYEG
Sbjct: 174 IRKLHRYTSRRAKLAVRSFRLRLSTIYEG 202


>ref|XP_006304055.1| hypothetical protein CARUB_v10009886mg [Capsella rubella]
           gi|482572766|gb|EOA36953.1| hypothetical protein
           CARUB_v10009886mg [Capsella rubella]
          Length = 296

 Score =  164 bits (414), Expect = 5e-38
 Identities = 96/211 (45%), Positives = 130/211 (61%), Gaps = 14/211 (6%)
 Frame = -3

Query: 727 MGVCLSSQSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ---------SLNSSPDS 575
           MG+C+S    D +     +  +V+V+GDLR Y++P+  SQVL+         S +S   S
Sbjct: 85  MGICVSFNRKDSNA--SPTVKIVTVNGDLREYNVPVVASQVLEAESAAASSSSSSSRSSS 142

Query: 574 FFLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDH 395
           +F+C+SD LY+ DFIP+++++  L++ QIYFVLP +K Q RL ASDMAALAVKASVA+ +
Sbjct: 143 YFICDSDSLYYDDFIPAIESETPLQADQIYFVLPVSKRQNRLTASDMAALAVKASVAIQN 202

Query: 394 INASNRRRKRKTRISPVLVSDEQDPQ-----SNQTVNHIMTSSVNNATKSSSAGLGVSRS 230
                 RR++K RISPV++    +       S  TV           T    A  G +RS
Sbjct: 203 SAGKESRRRKKVRISPVMMLTGSNDSVNGNGSESTVKKGRPFVSGYKTAPFKASSGYNRS 262

Query: 229 GSVRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           GSV  L RY+S+RAKLAVRSFR KL TIYEG
Sbjct: 263 GSVSNLRRYTSKRAKLAVRSFRLKLSTIYEG 293


>ref|XP_002307010.1| hypothetical protein POPTR_0005s28050g [Populus trichocarpa]
           gi|222856459|gb|EEE94006.1| hypothetical protein
           POPTR_0005s28050g [Populus trichocarpa]
          Length = 211

 Score =  163 bits (413), Expect = 6e-38
 Identities = 100/208 (48%), Positives = 130/208 (62%), Gaps = 11/208 (5%)
 Frame = -3

Query: 727 MGVCLSSQ--SGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ------SLNSSPDSF 572
           MG C SS     D    +  +A V+S+ GDLR Y LP  VSQVLQ      S +SS  S+
Sbjct: 1   MGGCFSSSFLGEDSEQVRPQTAKVISIHGDLREYYLPAFVSQVLQAEIASSSSSSSSSSW 60

Query: 571 FLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDHI 392
           FLCNSD L + ++IP+L +D  L + QIYFVLP +KLQ+RL +SDMAALAVKAS+AL + 
Sbjct: 61  FLCNSDLLLYDEYIPALDSDVPLHADQIYFVLPKSKLQHRLTSSDMAALAVKASLALQNS 120

Query: 391 NASNRRRKRKTRISPVLVSDEQDPQSNQTVNHI---MTSSVNNATKSSSAGLGVSRSGSV 221
           +  + RR +K RISPVL+ +       Q V  +            +  +  +  SRSGSV
Sbjct: 121 SNKDPRRGKKARISPVLLVNPDHEHQGQNVVKVSFDRKPKPQVQQRQPANPIVFSRSGSV 180

Query: 220 RKLTRYSSRRAKLAVRSFRNKLITIYEG 137
           RK  +Y+SRRAKLAVRSF+ +L TIYEG
Sbjct: 181 RKFQKYTSRRAKLAVRSFKLRLTTIYEG 208


>gb|EMJ12210.1| hypothetical protein PRUPE_ppa020532mg [Prunus persica]
          Length = 222

 Score =  159 bits (401), Expect = 2e-36
 Identities = 98/221 (44%), Positives = 137/221 (61%), Gaps = 24/221 (10%)
 Frame = -3

Query: 727 MGVCLSS----QSGDYSLHQKSSANVVSVDGDLRTYSLPITVSQVLQ-----SLNSSPDS 575
           MG C+SS       +  L    +A V+S++G LR Y +P+ VSQVL+     S +SS  S
Sbjct: 1   MGGCVSSVIRSSHNEELLLNSPTAKVISINGSLREYPVPVIVSQVLEAGQTASSSSSSSS 60

Query: 574 FFLCNSDRLYFGDFIPSLQADDELESAQIYFVLPTTKLQYRLAASDMAALAVKASVALDH 395
            FLCNSDRLY+ ++IP L ++DELE+ QIYF+LP +KL++RL+A+DMAALAV+AS+A   
Sbjct: 61  SFLCNSDRLYYDNYIPVLDSEDELEADQIYFILPRSKLEHRLSATDMAALAVRASLAFQD 120

Query: 394 INASN--------------RRRKRKTRISPVLVSDEQDPQSNQTVNHIMTSSVNNATKSS 257
            ++S+              RR  +K R+SPVL++           N I     ++A K  
Sbjct: 121 ASSSSSSYKTKEKKKDLHPRRNYKKARVSPVLINYANSDMDRDDFNEITIG--DSAYKGQ 178

Query: 256 -SAGLGVSRSGSVRKLTRYSSRRAKLAVRSFRNKLITIYEG 137
            S    +SRS SV+KL RY+S+RAK+AVRSFR +L TI EG
Sbjct: 179 MSQKQQISRSKSVKKLQRYTSKRAKMAVRSFRLRLTTIDEG 219


Top