BLASTX nr result

ID: Atractylodes22_contig00009610 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00009610
         (2347 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|2...   218   4e-54
ref|NP_171828.1| uncharacterized protein [Arabidopsis thaliana] ...   208   6e-51
ref|XP_002889451.1| hypothetical protein ARALYDRAFT_470307 [Arab...   207   1e-50
ref|XP_003520215.1| PREDICTED: uncharacterized protein LOC100789...   207   1e-50
ref|XP_003536443.1| PREDICTED: uncharacterized protein LOC100820...   202   4e-49

>ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|222861833|gb|EEE99375.1|
            predicted protein [Populus trichocarpa]
          Length = 549

 Score =  218 bits (556), Expect = 4e-54
 Identities = 183/525 (34%), Positives = 241/525 (45%), Gaps = 14/525 (2%)
 Frame = +3

Query: 159  MGFNSVYRALQDVFPQVDSRILRAVAIEHSKXXXXXXXXXXXXXIPRLSEQSSTVISSNK 338
            MGF++VY+ L DVFPQVD+RIL+AVAIEHSK             IP LS  S+      +
Sbjct: 1    MGFSTVYKCLTDVFPQVDARILKAVAIEHSKDADIAAEVVLSEVIPSLSRHSAAPSPPCE 60

Query: 339  GKSPLSFTEDGGKELVATMDPVDVISCLSEHSTEAGPSSIDGSLPGAMEDADSFGQPNSL 518
              SP S   DG  E                H   +   S+  S PG + + D  G+    
Sbjct: 61   DTSP-SLPLDGQTEQEEETG--------LRHRQVSLVKSVRSSEPGLIAEEDD-GK---- 106

Query: 519  TVVAPVSLGPLGGSSFHVANAETDTVCDGAPLLLEKFGDKSNINPDNASPDLICESGSYD 698
                 ++ G   G S H  N +                D+  + P  A+ D     G  +
Sbjct: 107  ---TELTSGVNDGDSTHQENRQ----------------DQPIVVPSGANADTNQLQGHIE 147

Query: 699  NHDQGCVKTGGENAAASLSTSVASGAEVSGSIQEEAGLALDVTTPATNNKYGHLHV--DT 872
               +   +TG  +   SL  SV S     G I EE     ++T    +    H  +  D 
Sbjct: 148  TEQE--EETGLRHRQVSLVKSVRSSE--PGLIAEEDDGKTELTGGVNDGDSTHQEIRQDQ 203

Query: 873  EMII---APASSQSVGGICQTD----LAVPQELD-----PSNSSKSLSEKDAFAIVNPED 1016
             +++   A A +  + G  ++D    L  PQ  +      S+ +  L   D    VN E+
Sbjct: 204  PVVVPSGANADTNQLQGHIESDELILLGKPQHQEGISQPGSSQTLILVSNDLLLGVNAEN 263

Query: 1017 ESIMTSMLTGSNQYCSTELLNDIIEDAGNNKKFLVSAMDSVVNLMKEVXXXXXXXXXXXX 1196
                      S QY   ELL +I+E A +NKK L SAM+SV+N+MKEV            
Sbjct: 264  --------MNSKQYRQIELLEEIVEAAKDNKKTLFSAMESVMNMMKEVELQEISAEQAKE 315

Query: 1197 XXVRGCSDILAKVDELKQALRRAKEANDMHAGEVYAEKAILATELKELQLRLFNLSDERN 1376
               RG  DIL +V++LKQ L  AKEANDMHAGEVY EKAILATE++ELQ RL +LSDER+
Sbjct: 316  EAARGGLDILVEVEKLKQMLVHAKEANDMHAGEVYGEKAILATEVRELQARLLSLSDERD 375

Query: 1377 QSLGILDEMXXXXXXXXXXXXXXXXXXXXXXXXXXQSAREALAYQESQMXXXXXXXXXXX 1556
             +L ILDEM                          ++AR ALA QE  M           
Sbjct: 376  NALAILDEMRQTLESRLAAAEELRKTAELEKLEKEETARNALAEQEIIMEKVVQESKILQ 435

Query: 1557 XXXXXXXXXQEFLMDRGRAIDILQGEISVKCQDVKLLKEKFDKGI 1691
                     QEFLMDRG  +D LQGEISV CQDV+LLKE+FD+ +
Sbjct: 436  KEAEENAKLQEFLMDRGCVVDTLQGEISVICQDVRLLKERFDERV 480


>ref|NP_171828.1| uncharacterized protein [Arabidopsis thaliana]
            gi|334182264|ref|NP_001184898.1| uncharacterized protein
            [Arabidopsis thaliana] gi|3850585|gb|AAC72125.1| ESTs
            gb|H36966, gb|R65511, gb|T42324 and gb|T20569 come from
            this gene [Arabidopsis thaliana]
            gi|332189433|gb|AEE27554.1| uncharacterized protein
            [Arabidopsis thaliana] gi|332189434|gb|AEE27555.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 571

 Score =  208 bits (529), Expect = 6e-51
 Identities = 175/522 (33%), Positives = 239/522 (45%), Gaps = 11/522 (2%)
 Frame = +3

Query: 159  MGFNSVYRALQDVFPQVDSRILRAVAIEHSKXXXXXXXXXXXXXIPRLSEQSSTVISSNK 338
            MGF SVYR+L ++FPQ+D+RILRAVAIEH K             IP  S       + + 
Sbjct: 1    MGFGSVYRSLTEIFPQIDARILRAVAIEHPKDADEAAAVVLSEIIPSFSSNLFHNFTQSS 60

Query: 339  GKSPLSFTE---DGGKELVATMDPVDVISCLSEHSTEAGPSSIDGSLPGAME-------- 485
             KS  S +E   + G E VA+     + +  S+ ST +  SS + +LP  +         
Sbjct: 61   YKSSGSISEREVEHGLEDVASRCRPFLGASGSKASTSSSSSSSE-TLPLVVTRDHNTRAL 119

Query: 486  DADSFGQPNSLTVVAPVSLGPLGGSSFHVANAETDTVCDGAPLLLEKFGDKSNINPDNAS 665
              D     N LT + P              N + D VC       E    K     +N +
Sbjct: 120  STDLVSNMNELTTLQP--------------NVDPD-VCHKDLESEEIQSVKKARGKENGN 164

Query: 666  PDLICESGSYDNHDQGCVKTGGENAAASLSTSVASGAEVSGSIQEEAGLALDVTTPATNN 845
             DL        ++ +  +    ++ A+ +S       +++ +  E+ G   D+T     N
Sbjct: 165  YDLFGRCFDVTSNAKIGLDVPEDDIASVVSLFSLDNVKLASNFWEDLGF--DITWNQAEN 222

Query: 846  KYGHLHVDTEMIIAPASSQSVGGICQTDLAVPQELDPSNSSKSLSEKDAFAIVNPEDESI 1025
                L   T       + Q   G C         L    S++SL  ++        D  I
Sbjct: 223  AVSKLVDSTPGDTMTTTQQ---GSCFEVGHGSTNLVDETSNRSLFSENG-------DTEI 272

Query: 1026 MTSMLTGSNQYCSTELLNDIIEDAGNNKKFLVSAMDSVVNLMKEVXXXXXXXXXXXXXXV 1205
              +  T S   CS + L DIIEDA +NKK L++ M++V N+M+EV               
Sbjct: 273  GDAFST-STHVCSVDQLEDIIEDAKSNKKNLLTEMETVTNIMREVELKEKDAEKSKEEAA 331

Query: 1206 RGCSDILAKVDELKQALRRAKEANDMHAGEVYAEKAILATELKELQLRLFNLSDERNQSL 1385
            RG  D L KV+ELK+ L  AKEANDMHAGEVY EK+ILATE+KEL+ RL NLS+ERN+SL
Sbjct: 332  RGGLDTLQKVEELKKMLEHAKEANDMHAGEVYGEKSILATEVKELENRLLNLSEERNKSL 391

Query: 1386 GILDEMXXXXXXXXXXXXXXXXXXXXXXXXXXQSAREALAYQESQMXXXXXXXXXXXXXX 1565
             ILDEM                           SA +ALA QE+ M              
Sbjct: 392  AILDEMRGSLEIRLAAALELKKTAEKEKKDKEDSALKALAEQEANMEKVVQESKLLQQEA 451

Query: 1566 XXXXXXQEFLMDRGRAIDILQGEISVKCQDVKLLKEKFDKGI 1691
                  ++FLMDRG+ +D LQGEISV CQDVKLLKEKF+  +
Sbjct: 452  EENSKLRDFLMDRGQIVDTLQGEISVICQDVKLLKEKFENRV 493


>ref|XP_002889451.1| hypothetical protein ARALYDRAFT_470307 [Arabidopsis lyrata subsp.
            lyrata] gi|297335293|gb|EFH65710.1| hypothetical protein
            ARALYDRAFT_470307 [Arabidopsis lyrata subsp. lyrata]
          Length = 573

 Score =  207 bits (527), Expect = 1e-50
 Identities = 173/518 (33%), Positives = 244/518 (47%), Gaps = 7/518 (1%)
 Frame = +3

Query: 159  MGFNSVYRALQDVFPQVDSRILRAVAIEHSKXXXXXXXXXXXXXIPRLSEQSSTVISSNK 338
            MGF SVYR+L ++FPQ+D+RILRAVAIEH K             IP  S   S  ++ + 
Sbjct: 1    MGFGSVYRSLTEIFPQIDARILRAVAIEHPKDADEAAAVVLSEIIPSFSSNLSHNLTQSS 60

Query: 339  GKSPLSFT----EDGGKELVATMDPVDVISCLSEHSTEAGPSSIDGSLP-GAMEDADSFG 503
             KS  S +    E G +++V+   P    S     ++ +  SS   +LP   + D ++  
Sbjct: 61   NKSSGSISDREVERGLEDVVSRCRPFLGASGSKPSTSSSCSSSSSETLPLVVVRDHNTRA 120

Query: 504  QPNSLT--VVAPVSLGPLGGSSFHVANAETDTVCDGAPLLLEKFGDKSNINPDNASPDLI 677
                L   +  P +L P  G      + E++ V       L+K   K + N D       
Sbjct: 121  LSTDLVSNMNEPTNLQPNVGLDVCHKDLESEEVQS-----LKKARGKEHGNYDFFGRCFD 175

Query: 678  CESGSYDNHDQGCVKTGGENAAASLSTSVASGAEVSGSIQEEAGLALDVTTPATNNKYGH 857
             +S    N   G +    ++ A+ +S       +++    E+  L   +T     N    
Sbjct: 176  VKS----NAKLGLL-VPEDDIASVVSAISLDNIKLTSDFWED--LCFGMTWNQAENAVSK 228

Query: 858  LHVDTEMIIAPASSQSVGGICQTDLAVPQELDPSNSSKSLSEKDAFAIVNPEDESIMTSM 1037
            L   T       + Q   G C        E+D S S+  + E    ++V+   ++ +   
Sbjct: 229  LVDSTPGDTTTTTQQ---GSCF-------EVD-SGSTNLVDETSNRSLVSENGDTEIGDT 277

Query: 1038 LTGSNQYCSTELLNDIIEDAGNNKKFLVSAMDSVVNLMKEVXXXXXXXXXXXXXXVRGCS 1217
             + S   CS + L +IIEDA +NKK L++ M++V NLM+EV               RG  
Sbjct: 278  FSTSTHVCSVDHLEEIIEDAKSNKKTLLTEMETVTNLMREVELQEKDAEKSKEEAARGGL 337

Query: 1218 DILAKVDELKQALRRAKEANDMHAGEVYAEKAILATELKELQLRLFNLSDERNQSLGILD 1397
            D L KV+ELK+ L  AKEANDMHAGEVY EK+ILATE+KEL+ RL NLS+ERN+SL ILD
Sbjct: 338  DTLQKVEELKKMLEHAKEANDMHAGEVYGEKSILATEVKELENRLLNLSEERNKSLTILD 397

Query: 1398 EMXXXXXXXXXXXXXXXXXXXXXXXXXXQSAREALAYQESQMXXXXXXXXXXXXXXXXXX 1577
            EM                           SA +AL  QE+ M                  
Sbjct: 398  EMRGSLEIRLATALEMKKTAEQEKKNKEDSALQALVEQEANMEKVVQESKLLQQEAEENS 457

Query: 1578 XXQEFLMDRGRAIDILQGEISVKCQDVKLLKEKFDKGI 1691
              +EFLMDRG+ +D LQGEISV CQDVKLLKEKF+  +
Sbjct: 458  KLREFLMDRGQIVDSLQGEISVICQDVKLLKEKFENRV 495


>ref|XP_003520215.1| PREDICTED: uncharacterized protein LOC100789476 [Glycine max]
          Length = 603

 Score =  207 bits (526), Expect = 1e-50
 Identities = 168/533 (31%), Positives = 239/533 (44%), Gaps = 22/533 (4%)
 Frame = +3

Query: 159  MGFNSVYRALQDVFPQVDSRILRAVAIEHSKXXXXXXXXXXXXXIPRLSEQSSTVISSNK 338
            MGFNSVYR+LQ++FPQVD R+LRAVAIEH K             IP +S++    I    
Sbjct: 1    MGFNSVYRSLQEIFPQVDPRLLRAVAIEHPKDADLAAGIVIAEVIPFMSKKLPAAIPPQH 60

Query: 339  GKSPLSF-----TEDGGKELV--ATMDPVDVISCLSEHS--TEAGPSSIDGSLPGAMEDA 491
                 S      +E+ G  L     +D V V    + HS   E   ++    +P   E  
Sbjct: 61   NNYVASLNVEVESEEEGNRLRHRQLVDDVTVGPSSAPHSISVEVIKTADYSFVPDLNEAL 120

Query: 492  DSFGQPNSLT--VVAPVSLGPLGGSSFHVANAETDTVCDGAPLLLEKFGDKSNINPDNAS 665
            D     N  T   +    +  L        N   +T+ + A  +   F  + N N +   
Sbjct: 121  DKSTMSNDGTDKFLEMNDIKELDIYQNAEDNFSGETLNEIAQEMSNGFSQEDNENFERRF 180

Query: 666  PDLICES-----------GSYDNHDQGCVKTGGENAAASLSTSVASGAEVSGSIQEEAGL 812
             D+ CE+             ++N  +      G+       ++     EV  S+ ++   
Sbjct: 181  VDVDCENLISSGICQEMEPKHNNLSKEAASNNGDGNRIGNDSNEMGWLEVVSSLVDD--- 237

Query: 813  ALDVTTPATNNKYGHLHVDTEMIIAPASSQSVGGICQTDLAVPQELDPSNSSKSLSEKDA 992
              D TT     +     ++ E   AP      G       ++  EL   +SS   +  D 
Sbjct: 238  -YDATTSHRLEECETYLIELETSEAPKVCHVQGDALNYKDSLQSELVAGSSSTGDNTSDV 296

Query: 993  FAIVNPEDESIMTSMLTGSNQYCSTELLNDIIEDAGNNKKFLVSAMDSVVNLMKEVXXXX 1172
                  ED+    +  +  +  C  +LL +II++A  NKK L S+M+S++NLM+EV    
Sbjct: 297  ------EDDIGAKNAGSQYSHVCRIDLLEEIIDEAKTNKKMLFSSMESLINLMREVELQE 350

Query: 1173 XXXXXXXXXXVRGCSDILAKVDELKQALRRAKEANDMHAGEVYAEKAILATELKELQLRL 1352
                        G S+ILA+++E K  + +A EANDMH+GEVY EKAIL TELKELQ RL
Sbjct: 351  KAAEQANMEAATGGSNILARIEEYKTMVVQANEANDMHSGEVYGEKAILTTELKELQSRL 410

Query: 1353 FNLSDERNQSLGILDEMXXXXXXXXXXXXXXXXXXXXXXXXXXQSAREALAYQESQMXXX 1532
              LSDER++SL ILDE+                          +SAR+AL  QE  +   
Sbjct: 411  LGLSDERDRSLAILDEIRHILEVRLAAAEELRKAAEQLKLEKEESARKALVEQERLVEKV 470

Query: 1533 XXXXXXXXXXXXXXXXXQEFLMDRGRAIDILQGEISVKCQDVKLLKEKFDKGI 1691
                             QEFL+DRGR +D+LQGEISV CQD+KLLKEKFD  +
Sbjct: 471  VHESQRLQQEAEENSKLQEFLIDRGRVVDMLQGEISVICQDIKLLKEKFDANL 523


>ref|XP_003536443.1| PREDICTED: uncharacterized protein LOC100820331 [Glycine max]
          Length = 546

 Score =  202 bits (513), Expect = 4e-49
 Identities = 180/533 (33%), Positives = 237/533 (44%), Gaps = 22/533 (4%)
 Frame = +3

Query: 159  MGFNSVYRALQDVFPQVDSRILRAVAIEHSKXXXXXXXXXXXXXIPRLSEQSSTVISSNK 338
            MGFNSVYR LQ++FPQVD R+LRAVAIEH K             IP +S           
Sbjct: 1    MGFNSVYRNLQEIFPQVDPRLLRAVAIEHPKDADLAAGIVLAEVIPFMS----------- 49

Query: 339  GKSPLSFTEDGGKELVATMDPVDVISCLSEHSTEAGPSSIDGSLPGAMEDADSFGQPNSL 518
                        K+L A + P        +H+    P  ++      +E  +   +    
Sbjct: 50   ------------KKLPAAIPP--------QHNDHGAPLDVE------VESEEEGNRLRHC 83

Query: 519  TVVAPVSLGPLGGSSFHVANAETDTVCDGAPLLLEKFGDKSNINPDNASPDLICESGSYD 698
              V  V++GP    S +  N++ DT         EKF             D I E   + 
Sbjct: 84   QRVDDVNVGPSSTLS-NGCNSKDDT---------EKF----------LGMDDIKELDIFQ 123

Query: 699  NHDQGCVKTGGENAAASLSTSVASGAEVSGSIQEEAGLALDVTTPATNNKYGHLHVDTEM 878
            N +   +       A  +S         +G IQEE            N +   +  D E 
Sbjct: 124  NAEDNFIGETLNEIAQEMS---------NGFIQEEDN---------ENFERQPVDFDCEN 165

Query: 879  IIAPASSQSVGGI-----CQTDL-----AVPQEL-----DPSNSSKSL-SEKDAFAIV-- 1004
            +I+ A    V        C+T L     +  QE+     D  NS  SL SE DA +    
Sbjct: 166  LISSADDYDVTPSHRLEECETYLIELESSEAQEVCHVQGDTLNSKDSLQSELDAGSSTAG 225

Query: 1005 -NPEDESIMTSMLTGSNQYCST---ELLNDIIEDAGNNKKFLVSAMDSVVNLMKEVXXXX 1172
             N  D        +  +QY      +LL +II++A  NKK L S+M+S++NLM+EV    
Sbjct: 226  GNTSDVENDNGAKSAGSQYSQVSRIDLLEEIIDEAKTNKKTLFSSMESLINLMREVEVQE 285

Query: 1173 XXXXXXXXXXVRGCSDILAKVDELKQALRRAKEANDMHAGEVYAEKAILATELKELQLRL 1352
                        G S+ILA+++E K  L +AKEANDMHAGEVY EKAILATELKELQ RL
Sbjct: 286  KAAEQANMEAATGGSNILARIEEYKTMLVQAKEANDMHAGEVYGEKAILATELKELQSRL 345

Query: 1353 FNLSDERNQSLGILDEMXXXXXXXXXXXXXXXXXXXXXXXXXXQSAREALAYQESQMXXX 1532
              LSDER++SL ILDEM                          +SAR+AL  QE  +   
Sbjct: 346  LGLSDERDKSLAILDEMRHILEERLAAAEESRKAAEQQKLEKEESARKALVEQERLVEMV 405

Query: 1533 XXXXXXXXXXXXXXXXXQEFLMDRGRAIDILQGEISVKCQDVKLLKEKFDKGI 1691
                             QEFL+DRGR +D+LQGEISV CQD+KLLKEKFD  +
Sbjct: 406  VHESQRLQQEAEENSKLQEFLIDRGRVVDMLQGEISVICQDIKLLKEKFDANL 458


Top