BLASTX nr result

ID: Rehmannia22_contig00025256 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00025256
         (1418 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267033.2| PREDICTED: uncharacterized protein LOC100244...   134   1e-28
ref|XP_006357694.1| PREDICTED: uncharacterized protein LOC102585...    80   2e-12
ref|XP_004243880.1| PREDICTED: uncharacterized protein LOC101266...    77   1e-11
gb|EOX93580.1| Uncharacterized protein isoform 4, partial [Theob...    68   8e-09
gb|EOX93579.1| Uncharacterized protein isoform 3 [Theobroma cacao]     68   8e-09
gb|EOX93578.1| Uncharacterized protein isoform 2 [Theobroma cacao]     68   8e-09
gb|EOX93577.1| Uncharacterized protein isoform 1 [Theobroma cacao]     68   8e-09
ref|XP_006447849.1| hypothetical protein CICLE_v10014049mg [Citr...    64   2e-07

>ref|XP_002267033.2| PREDICTED: uncharacterized protein LOC100244248 [Vitis vinifera]
          Length = 1505

 Score =  134 bits (336), Expect = 1e-28
 Identities = 139/505 (27%), Positives = 229/505 (45%), Gaps = 48/505 (9%)
 Frame = +3

Query: 3    NSHDVLNEDQSAARIEFGEANGAHETFPSPSSRKFSSNTLGAAASSRDNYKTVRSP---- 170
            + H +   +Q A  + FGE           S  K   + L +   S+ + K V+S     
Sbjct: 352  SDHQIRTPNQPAKAMTFGEMKEFTRDAHGLSRSKVEFSALNSGTLSKLDNKIVQSDFLMQ 411

Query: 171  ----------------------YQLSKVGQNGSD---SPLVASTSS-PAKRQQTLLTTA- 269
                                  Y+ S + +N      SPL  S     AKRQQ  L  A 
Sbjct: 412  HECGHLSSTEGWVKESSPKDGTYKNSNIDRNVDQRYRSPLAGSVHLLSAKRQQQFLDNAN 471

Query: 270  SPSKHWELASPLPQLGSLLGNESVEHQVTETSIQKSISKLELLERSAFSSAFSAKIDNST 449
            SP   W +     Q GS L N+ + H  +E+SIQKSISKL++LE S++ S     I  S 
Sbjct: 472  SPRNSWAVTPSKNQHGSFLSNDHMRHGDSESSIQKSISKLKILEASSYGSLRDG-IGGSK 530

Query: 450  VKTLDFLK-SPNFDSFLEKNRHISRINFVE---DSFTEEKSDSVHQSKERSYAFSTDHAR 617
            +++LD+L  +P  +  LE+N    +   ++   DS  EE   SV              A+
Sbjct: 531  LRSLDYLSATPPLNVILEENSKDPQHKHLDVPIDSL-EEHLGSV--------------AQ 575

Query: 618  AETLNHISGENNL---DEHLGLVEIGNPLNELSAKSISRDQLKYG-GTASSPSKITLSGN 785
             + +     E NL   DE  G ++ G  L+ +S   +  D+       A SPS+ T SG+
Sbjct: 576  KDGILTPKNEGNLSQTDETTGFLKDGKSLHHMSMGILQMDETTTPMAVALSPSQFTWSGH 635

Query: 786  NLMRNLFTSKHSNEDALMTE-TESLLAEI---ASGEGGKAIITSKFVSSPDRMLEKKLSA 953
             ++++ FT++ + +  L++  T S L +I    + E   +    +FVSSP + LEKKL A
Sbjct: 636  KVLQHNFTTEDTRDGTLVSSGTNSPLGKIILDCAREKKTSSTPDQFVSSPMKRLEKKLLA 695

Query: 954  SPGPQSTKSKDLVLQGQLKQIADLDSRDSTPERKFTDVNISKATAHR-GSVSKGMKGELS 1130
            SP  Q + S+DL  Q Q  + +    +D +    F   + S ATA++  S     +G+ S
Sbjct: 696  SPEYQGSLSRDLKQQDQHNKFSFGSGQDGSTIENFAITSHSSATANKLDSPHLERRGQSS 755

Query: 1131 SPFVEVNRLKNLIEIKSTDNREADIYNGKEIFGTTDNVSTPAKEKKSQVMYS----KNLD 1298
            +PF+E+   K   ++   +++E ++++ +   G   +  TP+++  +    S    KNL 
Sbjct: 756  TPFIEIKHSKEFSQVTRMNDKEINLHDLQNESGALMDFETPSRDMDTLNHQSPSPEKNLQ 815

Query: 1299 VGNLTRRDMPWVEENLPGGESRAVS 1373
             G  + R    ++  LPGG  +A S
Sbjct: 816  TGEESTR----LKNELPGGGIKASS 836


>ref|XP_006357694.1| PREDICTED: uncharacterized protein LOC102585412 [Solanum tuberosum]
          Length = 1364

 Score = 80.1 bits (196), Expect = 2e-12
 Identities = 97/351 (27%), Positives = 153/351 (43%), Gaps = 33/351 (9%)
 Frame = +3

Query: 204  DSPLVASTSSPAKRQQTLLTTAS-------PSKHWELASP-------------------- 302
            +SPLV   SS    Q+ ++   +       P +H++ A P                    
Sbjct: 416  ESPLVDLVSSSPATQRLIVMERTLDKFSIPPEEHFQTAFPGRSHFLFEQWVKQNSTAVSS 475

Query: 303  LPQLGSLLGNESVEHQVTETSIQKSISKLELLERSAFSSAFSAKIDNSTVKTLDFLKSPN 482
               L S L NE      +  S+QKSISKLE L+ SAFSS+   KI +  V+ L+F K+P 
Sbjct: 476  PEDLISFLSNEKRGPWTSSASLQKSISKLERLKASAFSSSRGDKIPHMGVRALEFPKTPP 535

Query: 483  FDSFLEKNRHISRINFVEDSFT---EEKSDSVHQSKERSYAFSTDHARAETLNHISGENN 653
             DS L+K     R+  ++ + T   E+ S S  +  ER  +     + ++ L+       
Sbjct: 536  LDSILKKRNLDMRVKCLDAAMTCTEEQFSGSTMKEGERKTSI-PGGSWSKALSSSEDVIQ 594

Query: 654  LDEHLGLVEIGNPLNELSAKSISRDQLKYGGTASSPSKITLSGNNLMRNLFTSKHSNED- 830
             ++  G  + G  LN L A  +  DQL      SS S+ +LSG     ++ T     +  
Sbjct: 595  CEQSFGPEKPGKSLNHLDAGILPMDQLLKPADPSSSSRFSLSGKK--NDMVTPNDLRQKI 652

Query: 831  ALMTETESLLAEIASGEGGKAIITSKFVSSPDRMLEKKLSASPGPQSTKSKDLVLQGQ-L 1007
            +L++ T+S L +  SG      I  K V +P++ L+ K +     QS+  K+  L  + L
Sbjct: 653  SLISRTDSPLVDY-SGREEVIAIAQKLVFTPEKSLQSKWTEH---QSSPFKESKLDDEHL 708

Query: 1008 KQIADLDSRDSTPERKFTDVNISKATAHRGSVSKGMKGELS-SPFVEVNRL 1157
            K    + +  S      TD     ATA     S  +  E S SP VE +++
Sbjct: 709  KSFGPVKNASSI--SNVTDGPSVTATAGNWYSSSTLTEEQSGSPVVEGSKV 757


>ref|XP_004243880.1| PREDICTED: uncharacterized protein LOC101266239 [Solanum
            lycopersicum]
          Length = 1367

 Score = 77.4 bits (189), Expect = 1e-11
 Identities = 83/303 (27%), Positives = 133/303 (43%), Gaps = 4/303 (1%)
 Frame = +3

Query: 87   SPSSRKFSSNTLGAAASSRDNYKTVRSPYQLSKVGQNGSDSPLVASTSSPAKRQQTLLTT 266
            SP++++     +G+ +  + N   V SP  L     N    P  +S S            
Sbjct: 423  SPATQRLI--VMGSPSPVKQNSTAVSSPKDLISFLSNEKRGPWTSSAS----------LQ 470

Query: 267  ASPSKHWELASPLPQLGSLLGNESVEHQVTETSIQKSISKLELLERSAFSSAFSAKIDNS 446
             S SK   L  P+    S L N       +  S+QKSISKLE L+ SAFSS    KI + 
Sbjct: 471  KSISKLPFLEDPI----SFLSNGKRGPWTSSASLQKSISKLERLKASAFSSFGGDKIPHM 526

Query: 447  TVKTLDFLKSPNFDSFLEK---NRHISRINFVEDSFTEEKSDSVHQSKERSYAFSTDHAR 617
             V+ L+F K+P  DS L+K   +  + R++       E+ S S  +  ER   F+   + 
Sbjct: 527  GVRALEFPKTPPLDSILKKRNLDMGVKRLDAAMTCSEEQISGSTMKEGERK-TFTPGGSW 585

Query: 618  AETLNHISGENNLDEHLGLVEIGNPLNELSAKSISRDQLKYGGTASSPSKITLSGNNLMR 797
            ++ L+        ++  G  +    LN+L A  +  DQL      SS S+ +LSG     
Sbjct: 586  SKALSSSEDVIQCEQSFGPEKPEKSLNQLEAGILPMDQLLKPADPSSSSRFSLSGKK--N 643

Query: 798  NLFTSKHSNED-ALMTETESLLAEIASGEGGKAIITSKFVSSPDRMLEKKLSASPGPQST 974
            ++ T     +  +L++ T+S L +  SG      I  K V +P++ L+ K +        
Sbjct: 644  DMVTPNDLRQKISLISRTDSPLVDY-SGREEVIAIAQKLVFTPEKSLDSKCTEHQSSPFK 702

Query: 975  KSK 983
            +SK
Sbjct: 703  ESK 705


>gb|EOX93580.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 1232

 Score = 68.2 bits (165), Expect = 8e-09
 Identities = 91/347 (26%), Positives = 149/347 (42%), Gaps = 4/347 (1%)
 Frame = +3

Query: 207  SPLVASTSS-PAKRQQTLLTTASPSKHWELASPLPQL-GSLLGNESVEHQVTETSIQKSI 380
            SPL  S  S  AKRQQ LL T +  +     +P P+  GS+L   SV+   +  SI KS 
Sbjct: 444  SPLAGSIYSISAKRQQILLDTTNSPRRALFVTPSPRHPGSILSKGSVKQGGSVPSILKSN 503

Query: 381  SKLELLERSAFSSAFSAKIDNSTVKTLDFLKS--PNFDSFLEKNRHISRINFVEDSFTEE 554
            SKL++LE S  +SAF+  I  S ++  + L S    F++ +E+           +SF  +
Sbjct: 504  SKLKILEPSPCASAFNDGIVKSKLRLSESLSSRASPFNTIMEE---------PSESFQCQ 554

Query: 555  KSDSVHQSKERSYAFSTDHARAETLNHISGENNLDEHLGLVEIGNPLNELSAKSISRDQL 734
            ++++   + E   +   D  + +   H +G          ++ G        K  +    
Sbjct: 555  QANAPIINLEEQLS-GVDLKKGKV--HCNGLGTPKNISSFIQDGGTSGLGKDKEYNDKST 611

Query: 735  KYGGTASSPSKITLSGNNLMRNLFTSKHSNEDALMTETESLLAEIASGEGGKAIITSKFV 914
            +   T +SPSK T SG  +  +  TS    +  L+  T   ++E     G    + S  V
Sbjct: 612  ERMATFTSPSKFTHSGKKMGHHTLTSVELLDGTLVASTFG-ISEDKRDTGTVYKLVSPLV 670

Query: 915  SSPDRMLEKKLSASPGPQSTKSKDLVLQGQLKQIADLDSRDSTPERKFTDVNISKATAHR 1094
               DR+   +LS++   Q T S +L LQ Q      +  R+          N    TA  
Sbjct: 671  ---DRL--NQLSSATKNQGTLSGNLKLQHQDNSTTIVSGRECNLVETVPISNYLTPTAEN 725

Query: 1095 GSVSKGMKGELSSPFVEVNRLKNLIEIKSTDNREADIYNGKEIFGTT 1235
             + S        SP V++N LK+   ++  D RE+   NG ++  T+
Sbjct: 726  RTQS-------GSPLVKINSLKDFCLVRKVDERES---NGLDLQNTS 762


>gb|EOX93579.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 1266

 Score = 68.2 bits (165), Expect = 8e-09
 Identities = 91/347 (26%), Positives = 149/347 (42%), Gaps = 4/347 (1%)
 Frame = +3

Query: 207  SPLVASTSS-PAKRQQTLLTTASPSKHWELASPLPQL-GSLLGNESVEHQVTETSIQKSI 380
            SPL  S  S  AKRQQ LL T +  +     +P P+  GS+L   SV+   +  SI KS 
Sbjct: 444  SPLAGSIYSISAKRQQILLDTTNSPRRALFVTPSPRHPGSILSKGSVKQGGSVPSILKSN 503

Query: 381  SKLELLERSAFSSAFSAKIDNSTVKTLDFLKS--PNFDSFLEKNRHISRINFVEDSFTEE 554
            SKL++LE S  +SAF+  I  S ++  + L S    F++ +E+           +SF  +
Sbjct: 504  SKLKILEPSPCASAFNDGIVKSKLRLSESLSSRASPFNTIMEE---------PSESFQCQ 554

Query: 555  KSDSVHQSKERSYAFSTDHARAETLNHISGENNLDEHLGLVEIGNPLNELSAKSISRDQL 734
            ++++   + E   +   D  + +   H +G          ++ G        K  +    
Sbjct: 555  QANAPIINLEEQLS-GVDLKKGKV--HCNGLGTPKNISSFIQDGGTSGLGKDKEYNDKST 611

Query: 735  KYGGTASSPSKITLSGNNLMRNLFTSKHSNEDALMTETESLLAEIASGEGGKAIITSKFV 914
            +   T +SPSK T SG  +  +  TS    +  L+  T   ++E     G    + S  V
Sbjct: 612  ERMATFTSPSKFTHSGKKMGHHTLTSVELLDGTLVASTFG-ISEDKRDTGTVYKLVSPLV 670

Query: 915  SSPDRMLEKKLSASPGPQSTKSKDLVLQGQLKQIADLDSRDSTPERKFTDVNISKATAHR 1094
               DR+   +LS++   Q T S +L LQ Q      +  R+          N    TA  
Sbjct: 671  ---DRL--NQLSSATKNQGTLSGNLKLQHQDNSTTIVSGRECNLVETVPISNYLTPTAEN 725

Query: 1095 GSVSKGMKGELSSPFVEVNRLKNLIEIKSTDNREADIYNGKEIFGTT 1235
             + S        SP V++N LK+   ++  D RE+   NG ++  T+
Sbjct: 726  RTQS-------GSPLVKINSLKDFCLVRKVDERES---NGLDLQNTS 762


>gb|EOX93578.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 1194

 Score = 68.2 bits (165), Expect = 8e-09
 Identities = 91/347 (26%), Positives = 149/347 (42%), Gaps = 4/347 (1%)
 Frame = +3

Query: 207  SPLVASTSS-PAKRQQTLLTTASPSKHWELASPLPQL-GSLLGNESVEHQVTETSIQKSI 380
            SPL  S  S  AKRQQ LL T +  +     +P P+  GS+L   SV+   +  SI KS 
Sbjct: 444  SPLAGSIYSISAKRQQILLDTTNSPRRALFVTPSPRHPGSILSKGSVKQGGSVPSILKSN 503

Query: 381  SKLELLERSAFSSAFSAKIDNSTVKTLDFLKS--PNFDSFLEKNRHISRINFVEDSFTEE 554
            SKL++LE S  +SAF+  I  S ++  + L S    F++ +E+           +SF  +
Sbjct: 504  SKLKILEPSPCASAFNDGIVKSKLRLSESLSSRASPFNTIMEE---------PSESFQCQ 554

Query: 555  KSDSVHQSKERSYAFSTDHARAETLNHISGENNLDEHLGLVEIGNPLNELSAKSISRDQL 734
            ++++   + E   +   D  + +   H +G          ++ G        K  +    
Sbjct: 555  QANAPIINLEEQLS-GVDLKKGKV--HCNGLGTPKNISSFIQDGGTSGLGKDKEYNDKST 611

Query: 735  KYGGTASSPSKITLSGNNLMRNLFTSKHSNEDALMTETESLLAEIASGEGGKAIITSKFV 914
            +   T +SPSK T SG  +  +  TS    +  L+  T   ++E     G    + S  V
Sbjct: 612  ERMATFTSPSKFTHSGKKMGHHTLTSVELLDGTLVASTFG-ISEDKRDTGTVYKLVSPLV 670

Query: 915  SSPDRMLEKKLSASPGPQSTKSKDLVLQGQLKQIADLDSRDSTPERKFTDVNISKATAHR 1094
               DR+   +LS++   Q T S +L LQ Q      +  R+          N    TA  
Sbjct: 671  ---DRL--NQLSSATKNQGTLSGNLKLQHQDNSTTIVSGRECNLVETVPISNYLTPTAEN 725

Query: 1095 GSVSKGMKGELSSPFVEVNRLKNLIEIKSTDNREADIYNGKEIFGTT 1235
             + S        SP V++N LK+   ++  D RE+   NG ++  T+
Sbjct: 726  RTQS-------GSPLVKINSLKDFCLVRKVDERES---NGLDLQNTS 762


>gb|EOX93577.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 1375

 Score = 68.2 bits (165), Expect = 8e-09
 Identities = 91/347 (26%), Positives = 149/347 (42%), Gaps = 4/347 (1%)
 Frame = +3

Query: 207  SPLVASTSS-PAKRQQTLLTTASPSKHWELASPLPQL-GSLLGNESVEHQVTETSIQKSI 380
            SPL  S  S  AKRQQ LL T +  +     +P P+  GS+L   SV+   +  SI KS 
Sbjct: 444  SPLAGSIYSISAKRQQILLDTTNSPRRALFVTPSPRHPGSILSKGSVKQGGSVPSILKSN 503

Query: 381  SKLELLERSAFSSAFSAKIDNSTVKTLDFLKS--PNFDSFLEKNRHISRINFVEDSFTEE 554
            SKL++LE S  +SAF+  I  S ++  + L S    F++ +E+           +SF  +
Sbjct: 504  SKLKILEPSPCASAFNDGIVKSKLRLSESLSSRASPFNTIMEE---------PSESFQCQ 554

Query: 555  KSDSVHQSKERSYAFSTDHARAETLNHISGENNLDEHLGLVEIGNPLNELSAKSISRDQL 734
            ++++   + E   +   D  + +   H +G          ++ G        K  +    
Sbjct: 555  QANAPIINLEEQLS-GVDLKKGKV--HCNGLGTPKNISSFIQDGGTSGLGKDKEYNDKST 611

Query: 735  KYGGTASSPSKITLSGNNLMRNLFTSKHSNEDALMTETESLLAEIASGEGGKAIITSKFV 914
            +   T +SPSK T SG  +  +  TS    +  L+  T   ++E     G    + S  V
Sbjct: 612  ERMATFTSPSKFTHSGKKMGHHTLTSVELLDGTLVASTFG-ISEDKRDTGTVYKLVSPLV 670

Query: 915  SSPDRMLEKKLSASPGPQSTKSKDLVLQGQLKQIADLDSRDSTPERKFTDVNISKATAHR 1094
               DR+   +LS++   Q T S +L LQ Q      +  R+          N    TA  
Sbjct: 671  ---DRL--NQLSSATKNQGTLSGNLKLQHQDNSTTIVSGRECNLVETVPISNYLTPTAEN 725

Query: 1095 GSVSKGMKGELSSPFVEVNRLKNLIEIKSTDNREADIYNGKEIFGTT 1235
             + S        SP V++N LK+   ++  D RE+   NG ++  T+
Sbjct: 726  RTQS-------GSPLVKINSLKDFCLVRKVDERES---NGLDLQNTS 762


>ref|XP_006447849.1| hypothetical protein CICLE_v10014049mg [Citrus clementina]
            gi|557550460|gb|ESR61089.1| hypothetical protein
            CICLE_v10014049mg [Citrus clementina]
          Length = 1371

 Score = 63.9 bits (154), Expect = 2e-07
 Identities = 97/452 (21%), Positives = 191/452 (42%), Gaps = 58/452 (12%)
 Frame = +3

Query: 237  AKRQQTLL-TTASPSKHWELASPLPQLGSLLGNESVEHQVTETSIQKSISKLELLERSAF 413
            AKR+Q  L TT +PS          +  S    E++E     ++IQKS  K+++   SA 
Sbjct: 455  AKRKQIFLDTTITPSPK--------KTSSFFSKENIEVGEKVSTIQKSHLKVKISSPSAH 506

Query: 414  SSAFSAKIDNSTVKTLDFLKSPN--FDSFLEKNRHISRINFVEDSFTE-EKSDSVHQSKE 584
            +S    +I+ S  +  ++L S     ++F+E+     +   V+ S    EK       K 
Sbjct: 507  TSVLREEIEKSKRRLSEYLSSSASPVNNFVEETGRNLQFQHVDASVMNLEKHFLSADRKN 566

Query: 585  RSYAFSTDH----------------------ARAETLNHI-------------------- 638
              +A +T+                       +  E+L+H+                    
Sbjct: 567  VEHAITTNMNGGGGGTPKNFGSLNQNKTGILSGGESLDHLFSPILSEDKQTEVTDGDGDG 626

Query: 639  SGEN--NLDEH-LGLVEIGNPLNELSAKSISRD-QLKYGGTASSPSKITLSGNNLMRNLF 806
            + EN  +L+++ +G+++ G  L+ +     S D Q +    A+SP+++T+ G   +++L 
Sbjct: 627  TPENVGSLNQNKIGVIKGGESLDHVFNPIPSEDKQTEVTAAAASPARLTMPGK--IQHLL 684

Query: 807  TSKHSNEDALMTETESLLAEIASGEGGKAIITSK----FVSSPDRMLEKKLSASPGPQST 974
             S +  +  L        AE  + +  K +  +     F+S P + L++KLS+      +
Sbjct: 685  MSNNPMQGPLAVSVSDTSAEEITLDLKKDLKVTNDFDTFMSPPMKNLDQKLSSPAETHGS 744

Query: 975  KSKDLVLQGQLKQIADLDSRDSTPERKFTDVNISKATAHRGSVSKGMKGELSSPFVEVNR 1154
             S +L    Q + + D     ++ E   +  +++    +  S++  ++    SP +E+NR
Sbjct: 745  VSGNLKHDVQSRSLVDSGLDGNSIEYATSGNHLTGTVNNLDSLAVELRTNSYSPLIEINR 804

Query: 1155 LKNLIEIKSTDNRE---ADIYNGKEIFGTTDNVSTPAKEKKSQV-MYSKNLDVGNLTRRD 1322
            L +  ++K  D+R+   + +    E       +S      K Q+    KNL + N    D
Sbjct: 805  LTDFTKVKRVDDRDIYTSALLKASETVKKFQTLSGDMNLMKFQLPTPDKNLQIAN----D 860

Query: 1323 MPWVEENLPGGESRAVSDGPVSPSGCRNLDEP 1418
                +  LPG + +A +  P SP+  R ++EP
Sbjct: 861  PSLTKGELPGEKIKASTCVPTSPNILRTINEP 892


Top