BLASTX nr result

ID: Atractylodes22_contig00019442 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00019442
         (1891 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera]   238   2e-97
ref|XP_002520708.1| conserved hypothetical protein [Ricinus comm...   216   4e-91
ref|XP_003547032.1| PREDICTED: uncharacterized protein LOC547668...   208   7e-81
emb|CAA09794.1| NDX1 homeobox protein [Glycine max]                   208   7e-81
ref|XP_003542016.1| PREDICTED: uncharacterized protein LOC100781...   205   3e-80

>emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera]
          Length = 1134

 Score =  238 bits (606), Expect(2) = 2e-97
 Identities = 150/402 (37%), Positives = 219/402 (54%), Gaps = 45/402 (11%)
 Frame = +1

Query: 589  DDVQLLREFIRRLEAQITPQQPNIHQVK------------------EIHSRGILSLPAPQ 714
            DD    R F + +++ ITP +    +++                  E  S G  S P  +
Sbjct: 637  DDCFSCRVFFKEIQSLITPTELEESKLEGSMSWDKFSRLDIGEHHQEAQSTGGCSSPLLR 696

Query: 715  RLSPEGHDRSGNQDEGVSVNLAMDQLKQL---NLKKNGENQPNNQRTNVPESSAAKHVEE 885
            + +P+  +RS N  EG S N  + ++ Q    N+ +  +    ++R +  ++   + + +
Sbjct: 697  KAAPDVTNRSANLKEGTSENSTLQEVDQFFGRNMDQADDVMRQDRRKD--KNKLGRALRD 754

Query: 886  GDRNAQNIEINGSDSSTMQLKNSVDRTNNVE--------------AVPEEEMVRSMQSEE 1023
            G+++ QN+E +GSDSS+ + KNS D+ +N E               V E+E V  + SEE
Sbjct: 755  GEKDVQNVETSGSDSSSTRGKNSTDQIDNSEFPKSNEHIKASGSGGVQEDEKVEIIPSEE 814

Query: 1024 KQLKRRKRNIMNNTQIAMIENALRDKPDMQRKAASVELWAEKLSAHGSHVTASQLKNWXX 1203
            KQ ++RKR IMN+TQ+ +IE AL D+PDMQR AA ++ WA+KLS HG  +TASQLKNW  
Sbjct: 815  KQRRKRKRTIMNDTQMTLIEKALVDEPDMQRNAALIQSWADKLSFHGPELTASQLKNW-L 873

Query: 1204 XXXXXXXXXXXXDVRVPSGGDSVFTDKQGGSGTDPVSDSPKSPANKLFDTFPSAPKGTHQ 1383
                        DVRV S  DS F DKQ GSG   + DSP+SP    F    +A  GTHQ
Sbjct: 874  NNRKARLARAAKDVRVASEVDSTFPDKQVGSGVGSLHDSPESPGEDFFAP-STARGGTHQ 932

Query: 1384 KDARGAVTR----------NERSEPVPKDSFKREHGQYVMLTDGKGEEIGKGCIHLAKGI 1533
                G+V+R           E  +  P +  +RE GQYV+L DG+G++IGKG +H  +G 
Sbjct: 933  SAIGGSVSRAGADNAEAATAEFVDINPAEFVRREPGQYVVLLDGQGDDIGKGKVHQVQGK 992

Query: 1534 WFGINLEELGLCVVDVTYLKVDASTQLLHPCDATGTTFAQAE 1659
            W+G NLEE   CVVDV  LK +  ++L HP + TGT+F +AE
Sbjct: 993  WYGKNLEESQTCVVDVMELKAERWSRLPHPSETTGTSFDEAE 1034



 Score =  146 bits (369), Expect(2) = 2e-97
 Identities = 93/204 (45%), Positives = 116/204 (56%), Gaps = 23/204 (11%)
 Frame = +2

Query: 2    LDLAKSTVSEVLEVVKMMFC-DLNGVSASSFKGHPRGILQLNAMRLTEILSDDSNFQSTI 178
            LDLAKS   EVLE++K  F  D   +S  S K HP G+LQLNAMRL +I SDDSNF+S I
Sbjct: 422  LDLAKSIALEVLELLKTAFGGDQKYLSGGSEKTHPTGLLQLNAMRLADIFSDDSNFRSFI 481

Query: 179  AFNL-----------TEALTTVFLLPQGEFLSSWCSSAREPLEEEVTLDYDSLSASGRVL 325
                           TE L  +F LP GEFLSSWCSS     EE+ +L+YD   A+G VL
Sbjct: 482  TVYFVYDHAICISFQTEVLAAIFSLPHGEFLSSWCSSDLPVREEDASLEYDPFVAAGWVL 541

Query: 326  GXXXXXXXXXXXXXXX-----RAPQTPYAHQRISLLVKIVANLSCFVPDLCKEKE-GLFL 487
                                    Q PYAHQR SLLVK++ANL CFVP++C+E+E  LFL
Sbjct: 542  DSFSSPDLLNLMSSESTFIQNNMSQAPYAHQRTSLLVKVIANLHCFVPNICEEQEKDLFL 601

Query: 488  NLFLQCL-----RVAHSGGAERAS 544
            +  L+CL     R + S  A++A+
Sbjct: 602  HKCLECLQMERPRFSFSSDAQKAA 625


>ref|XP_002520708.1| conserved hypothetical protein [Ricinus communis]
            gi|223540093|gb|EEF41670.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 957

 Score =  216 bits (551), Expect(2) = 4e-91
 Identities = 147/428 (34%), Positives = 212/428 (49%), Gaps = 46/428 (10%)
 Frame = +1

Query: 544  LFVHAKSLVPGYLNEDDVQLLREFIRRLEAQITPQQPNIHQVKEIHSRGILSLPAPQRLS 723
            L  HA+SL+P +LNE+DVQLLR F  +L++ I       +QV+EI     +SL    +L 
Sbjct: 525  LLSHAESLIPNFLNEEDVQLLRVFFNQLQSLINTADFEQNQVQEIKFERSISLEKFCKLD 584

Query: 724  PEGHDRSGNQDEGVSVNLAMDQLKQLNLKKNGENQPNNQRTNVPESSAA---KHVEEGD- 891
               H +      G S  L+  +L   N+  N + + +     + E   +   +H++ GD 
Sbjct: 585  INEHQQEAQSTGGYSSALSKKELSNRNISSNRKEEISENSAFLEEEQLSFRNEHMKYGDD 644

Query: 892  ---------------------RNAQNIEINGSDSSTMQLKNSVD--------------RT 966
                                 R+ QNIE +GSD+S+ + KN                 + 
Sbjct: 645  AMREEKDKSGGTASTIKREIDRDFQNIETSGSDTSSTRGKNFAGQLGNSDFPKSSEHKKE 704

Query: 967  NNVEAVPEEEMVRSMQSEEKQLKRRKRNIMNNTQIAMIENALRDKPDMQRKAASVELWAE 1146
            N ++ V E E V ++Q EEKQ ++RKR IMN  Q+++IE AL D+PDM R AAS++ WA+
Sbjct: 705  NGLQGVQEGEKVETIQFEEKQPRKRKRTIMNEYQMSLIEEALVDEPDMHRNAASLQSWAD 764

Query: 1147 KLSAHGSHVTASQLKNW-XXXXXXXXXXXXXXDVRVPSGGDSVFTDKQGGSGTDPVSDSP 1323
            KLS HGS VT+SQLKNW               DVR P   D   ++KQ         DS 
Sbjct: 765  KLSLHGSEVTSSQLKNWLNNRKARLARAGAGKDVRTPMEVDHALSEKQSVPALRHSHDSS 824

Query: 1324 KSPANKLFDTFPSAPKGTHQKDARGAVTRNERSEPV------PKDSFKREHGQYVMLTDG 1485
            +S          + P G     AR     N              +  + + GQYV+L D 
Sbjct: 825  ESHGE------VNVPAGARLSTARIGSAENAEISLAQFFGIDAAELVQCKPGQYVVLVDK 878

Query: 1486 KGEEIGKGCIHLAKGIWFGINLEELGLCVVDVTYLKVDASTQLLHPCDATGTTFAQAEMI 1665
            +G+EIGKG ++  +G W+G +LEE   CVVDVT LK +   +L +P +ATGT+F++AE  
Sbjct: 879  QGDEIGKGKVYQVQGKWYGKSLEESETCVVDVTELKAERWVRLPYPSEATGTSFSEAETK 938

Query: 1666 LDRKRVLW 1689
            L   RVLW
Sbjct: 939  LGVMRVLW 946



 Score =  147 bits (370), Expect(2) = 4e-91
 Identities = 84/177 (47%), Positives = 109/177 (61%), Gaps = 7/177 (3%)
 Frame = +2

Query: 5   DLAKSTVSEVLEVVKMMFC-DLNGVSASSFKGHPRGILQLNAMRLTEILSDDSNFQSTIA 181
           DLAKS   EVLE++K     D   ++ASS +  P G+L+LNAMRL +I SDDSNF+S I 
Sbjct: 321 DLAKSVALEVLELLKAALSKDPKHLTASSERTFPMGLLRLNAMRLADIFSDDSNFRSYIT 380

Query: 182 FNLTEALTTVFLLPQGEFLSSWCSSAREPLEEEVTLDYDSLSASGRVLG-----XXXXXX 346
              T+ LT +F LP GEFLS WCSS     EE+ TL++D   A+G VL            
Sbjct: 381 TCFTKVLTAIFSLPHGEFLSIWCSSELPLREEDATLEFDIFIAAGWVLDTISSLNLSNAL 440

Query: 347 XXXXXXXXXRAPQTPYAHQRISLLVKIVANLSCFVPDLCKEKE-GLFLNLFLQCLRV 514
                      PQ  YAHQR SL VK++ANL CFVP++C+E+E  LFL+ FL+C+R+
Sbjct: 441 NSEITLIPSNMPQATYAHQRTSLFVKVIANLHCFVPNICEEQERNLFLHKFLECMRM 497


>ref|XP_003547032.1| PREDICTED: uncharacterized protein LOC547668 [Glycine max]
          Length = 1080

 Score =  208 bits (529), Expect(2) = 7e-81
 Identities = 152/427 (35%), Positives = 213/427 (49%), Gaps = 45/427 (10%)
 Frame = +1

Query: 544  LFVHAKSLVPGYLNEDDVQLLREFIRRLEAQITPQQPNIHQVKEIHSRGILSLP------ 705
            L  HA+SL+P +LN +DVQLLR F   L++  T      +QV++      LS        
Sbjct: 659  LLSHAESLIPNFLNVEDVQLLRVFFGELQSLFTSTGFGENQVQDSKFDESLSWDKLSKFN 718

Query: 706  -------------APQRLSPEGH----DRSGNQDEGVSVNLAMDQLKQLNLKKNGENQP- 831
                          P  L+ + H     + GN  EG+S N A   + Q N +    NQ  
Sbjct: 719  MNEHYQEAQSAGGCPPSLTGKEHASLNKKGGNFKEGMSENSAFPDMDQHNTRAEETNQGK 778

Query: 832  --NNQRT----NVPESSAAKHVEEGDRNAQNIEINGSDSSTMQLKNSVDRTNNVEAVPEE 993
              N Q       +P  +A+    E D++AQN+E +GSDSS+ + KN VD  +N E     
Sbjct: 779  GLNKQNQVDDKGIPGKTASGGAREMDKDAQNVETSGSDSSSAKGKNVVDNMDNGELSKSN 838

Query: 994  EMVRSMQSEEK---------QLKRRKRNIMNNTQIAMIENALRDKPDMQRKAASVELWAE 1146
            E ++    EE          Q ++RKR IMN+ Q+ +IE AL+D+PDMQR AAS++ WA+
Sbjct: 839  ERLKRTAVEENPEDEKIELSQRRKRKRTIMNDKQVMLIERALKDEPDMQRNAASLQSWAD 898

Query: 1147 KLSAHGSHVTASQLKNWXXXXXXXXXXXXXXDVRVPSGGDSVFTDKQGGSGTDPVS---D 1317
            KLS HGS VT+SQLKNW              DV+  +G D+   +KQ G    PV    D
Sbjct: 899  KLSGHGSEVTSSQLKNW-LNNRKARLARTARDVKAAAGDDNPVPEKQRG----PVPGSYD 953

Query: 1318 SPKSPANKLFDTFPSAPKGTHQKDARGAVTRNERSEPVPKDSFKREH---GQYVMLTDGK 1488
            SP SP +            +H         ++E +  V   S +  H   GQ V+L   +
Sbjct: 954  SPGSPGDV-----------SHVARIASGDNKSELARFVDIGSPEFGHCNAGQNVVLVGVR 1002

Query: 1489 GEEIGKGCIHLAKGIWFGINLEELGLCVVDVTYLKVDASTQLLHPCDATGTTFAQAEMIL 1668
            G+EIG+G +    G W+G +LEEL   VVD++ LK D   +L +P +ATG TFA+AE  L
Sbjct: 1003 GDEIGRGKVFQVHGKWYGKSLEELSAHVVDISELKADKGMRLPYPSEATGNTFAEAETKL 1062

Query: 1669 DRKRVLW 1689
               RVLW
Sbjct: 1063 GVMRVLW 1069



 Score =  121 bits (303), Expect(2) = 7e-81
 Identities = 70/176 (39%), Positives = 99/176 (56%), Gaps = 5/176 (2%)
 Frame = +2

Query: 2   LDLAKSTVSEVLEVVKMMFCDLNGVSASSFKGHPRGILQLNAMRLTEILSDDSNFQSTIA 181
           LDLAKS   EV +++K  F    G   ++ +  P G +QLNAMRL +I SDDSNF+S + 
Sbjct: 457 LDLAKSVALEVFDLLKKAFGRDPG-HLTADRSFPMGFVQLNAMRLADIFSDDSNFRSYMI 515

Query: 182 FNLTEALTTVFLLPQGEFLSSWCSSAREPLEEEVTLDYDSLSASGRVLG----XXXXXXX 349
              T+ LT +  L  G+FLS WCSS     EE+ +++YD  +A G +L            
Sbjct: 516 LCFTKVLTAIISLSHGDFLSCWCSSNLSETEEDASIEYDIFAAVGWILDNTSPDVRNATN 575

Query: 350 XXXXXXXXRAPQTPYAHQRISLLVKIVANLSCFVPDLCKEKE-GLFLNLFLQCLRV 514
                     P+  YAH R SL VK  ANL CFVP++C+E+E  LF+   ++CL++
Sbjct: 576 LEFNLIPNSMPKASYAHHRTSLFVKFFANLHCFVPNICEEQERNLFVLKVMECLQM 631


>emb|CAA09794.1| NDX1 homeobox protein [Glycine max]
          Length = 626

 Score =  208 bits (529), Expect(2) = 7e-81
 Identities = 152/427 (35%), Positives = 213/427 (49%), Gaps = 45/427 (10%)
 Frame = +1

Query: 544  LFVHAKSLVPGYLNEDDVQLLREFIRRLEAQITPQQPNIHQVKEIHSRGILSLP------ 705
            L  HA+SL+P +LN +DVQLLR F   L++  T      +QV++      LS        
Sbjct: 205  LLSHAESLIPNFLNVEDVQLLRVFFGELQSLFTSTGFGENQVQDSKFDESLSWDKLSKFN 264

Query: 706  -------------APQRLSPEGH----DRSGNQDEGVSVNLAMDQLKQLNLKKNGENQP- 831
                          P  L+ + H     + GN  EG+S N A   + Q N +    NQ  
Sbjct: 265  MNEHYQEAQSAGGCPPSLTGKEHASLNKKGGNFKEGMSENSAFPDMDQHNTRAEETNQGK 324

Query: 832  --NNQRT----NVPESSAAKHVEEGDRNAQNIEINGSDSSTMQLKNSVDRTNNVEAVPEE 993
              N Q       +P  +A+    E D++AQN+E +GSDSS+ + KN VD  +N E     
Sbjct: 325  GLNKQNQVDDKGIPGKTASGGAREMDKDAQNVETSGSDSSSAKGKNVVDNMDNGELSKSN 384

Query: 994  EMVRSMQSEEK---------QLKRRKRNIMNNTQIAMIENALRDKPDMQRKAASVELWAE 1146
            E ++    EE          Q ++RKR IMN+ Q+ +IE AL+D+PDMQR AAS++ WA+
Sbjct: 385  ERLKRTAVEENPEDEKIELSQRRKRKRTIMNDKQVMLIERALKDEPDMQRNAASLQSWAD 444

Query: 1147 KLSAHGSHVTASQLKNWXXXXXXXXXXXXXXDVRVPSGGDSVFTDKQGGSGTDPVS---D 1317
            KLS HGS VT+SQLKNW              DV+  +G D+   +KQ G    PV    D
Sbjct: 445  KLSGHGSEVTSSQLKNW-LNNRKARLARTARDVKAAAGDDNPVPEKQRG----PVPGSYD 499

Query: 1318 SPKSPANKLFDTFPSAPKGTHQKDARGAVTRNERSEPVPKDSFKREH---GQYVMLTDGK 1488
            SP SP +            +H         ++E +  V   S +  H   GQ V+L   +
Sbjct: 500  SPGSPGDV-----------SHVARIASGDNKSELARFVDIGSPEFGHCNAGQNVVLVGVR 548

Query: 1489 GEEIGKGCIHLAKGIWFGINLEELGLCVVDVTYLKVDASTQLLHPCDATGTTFAQAEMIL 1668
            G+EIG+G +    G W+G +LEEL   VVD++ LK D   +L +P +ATG TFA+AE  L
Sbjct: 549  GDEIGRGKVFQVHGKWYGKSLEELSAHVVDISELKADKGMRLPYPSEATGNTFAEAETKL 608

Query: 1669 DRKRVLW 1689
               RVLW
Sbjct: 609  GVMRVLW 615



 Score =  121 bits (303), Expect(2) = 7e-81
 Identities = 70/176 (39%), Positives = 99/176 (56%), Gaps = 5/176 (2%)
 Frame = +2

Query: 2   LDLAKSTVSEVLEVVKMMFCDLNGVSASSFKGHPRGILQLNAMRLTEILSDDSNFQSTIA 181
           LDLAKS   EV +++K  F    G   ++ +  P G +QLNAMRL +I SDDSNF+S + 
Sbjct: 3   LDLAKSVALEVFDLLKKAFGRDPG-HLTADRSFPMGFVQLNAMRLADIFSDDSNFRSYMI 61

Query: 182 FNLTEALTTVFLLPQGEFLSSWCSSAREPLEEEVTLDYDSLSASGRVLG----XXXXXXX 349
              T+ LT +  L  G+FLS WCSS     EE+ +++YD  +A G +L            
Sbjct: 62  LCFTKVLTAIISLSHGDFLSCWCSSNLSETEEDASIEYDIFAAVGWILDNTSPDVRNATN 121

Query: 350 XXXXXXXXRAPQTPYAHQRISLLVKIVANLSCFVPDLCKEKE-GLFLNLFLQCLRV 514
                     P+  YAH R SL VK  ANL CFVP++C+E+E  LF+   ++CL++
Sbjct: 122 LEFNLIPNSMPKASYAHHRTSLFVKFFANLHCFVPNICEEQERNLFVLKVMECLQM 177


>ref|XP_003542016.1| PREDICTED: uncharacterized protein LOC100781915 [Glycine max]
          Length = 945

 Score =  205 bits (522), Expect(2) = 3e-80
 Identities = 149/425 (35%), Positives = 211/425 (49%), Gaps = 43/425 (10%)
 Frame = +1

Query: 544  LFVHAKSLVPGYLNEDDVQLLREFIRRLEAQITPQQPNIHQVK----------------- 672
            L  HA+SL+P +LN +DVQLLR F   L++  T      +QV+                 
Sbjct: 520  LLSHAESLIPNFLNVEDVQLLRVFFGELQSLFTSTGFGENQVQDSKFEESLYWDKLSKFN 579

Query: 673  --EIHSRGILSLPAPQRLSPEGH----DRSGNQDEGVSVNLAMDQLKQLNLKKNGENQPN 834
              E + +   +   P  L+ + H     + GN  EG+S N A   + Q N +    NQ  
Sbjct: 580  RNEHYQKAQSAGGCPSSLTGKEHADLNKKGGNFKEGMSENSAFPDMDQHNTRAEDTNQGK 639

Query: 835  --NQRTNVPESSAAKHVEEG-----DRNAQNIEINGSDSSTMQLKNSVDRTNNVEAVPEE 993
              N+   V +   A     G     D++AQN+E +GSDSS+ + KN VD  +N E     
Sbjct: 640  GLNRLNQVDDKGIAGKTASGGAREMDKDAQNVETSGSDSSSAKGKNVVDNMDNGELSKSN 699

Query: 994  EMVRSMQSEEK---------QLKRRKRNIMNNTQIAMIENALRDKPDMQRKAASVELWAE 1146
            E ++    EE          Q ++RKR IMN+ Q+ +IE AL+D+PDMQR AAS++ WA+
Sbjct: 700  ERLKRTAVEENPEDEKIELSQRRKRKRTIMNDKQVMLIERALKDEPDMQRNAASLQSWAD 759

Query: 1147 KLSAHGSHVTASQLKNWXXXXXXXXXXXXXXDVRVPSGGDSVFTDKQGGSGTDPVS---D 1317
            KLS HGS VT+SQLKNW              DV+  +G D+   DKQ G    PV    D
Sbjct: 760  KLSGHGSEVTSSQLKNW-LNNRKARLARTARDVKAAAGDDNPVPDKQRG----PVPGSYD 814

Query: 1318 SPKSPANKLFDTFPSAPKGTHQKDARGAVTRNERSEPVPKDSFKR-EHGQYVMLTDGKGE 1494
            SP SP +           G ++ +   A+    R   +    F     GQYV+L   + +
Sbjct: 815  SPGSPGD--VSHVARIASGDNKSEPSLALA---RFVDIGSPEFGHCNAGQYVVLVGVRQD 869

Query: 1495 EIGKGCIHLAKGIWFGINLEELGLCVVDVTYLKVDASTQLLHPCDATGTTFAQAEMILDR 1674
            EIG+G +    G W+G +L+EL   VVD++ LK D   +L +P +ATG TFA+AE  L  
Sbjct: 870  EIGRGKVFQVHGKWYGKSLDELSAHVVDISELKADKGMRLPYPSEATGNTFAEAETKLGV 929

Query: 1675 KRVLW 1689
             RVLW
Sbjct: 930  MRVLW 934



 Score =  122 bits (305), Expect(2) = 3e-80
 Identities = 71/176 (40%), Positives = 100/176 (56%), Gaps = 5/176 (2%)
 Frame = +2

Query: 2   LDLAKSTVSEVLEVVKMMFCDLNGVSASSFKGHPRGILQLNAMRLTEILSDDSNFQSTIA 181
           LDLAKS   EV +++K  F    G   ++ +  P G +QLNAMRL +I SDDSNF+S + 
Sbjct: 318 LDLAKSVALEVFDLLKKTFGRDPG-HLTADRSFPMGFVQLNAMRLADIFSDDSNFRSYMI 376

Query: 182 FNLTEALTTVFLLPQGEFLSSWCSSAREPLEEEVTLDYDSLSASGRVLG----XXXXXXX 349
              T+ LT +  L  G+FLS WCSS    +EE+ +L+YD  +A G +L            
Sbjct: 377 LCFTKVLTAIISLSHGDFLSCWCSSNLLKMEEDASLEYDIFAAVGWILDYTSLDVRNATN 436

Query: 350 XXXXXXXXRAPQTPYAHQRISLLVKIVANLSCFVPDLCKEKE-GLFLNLFLQCLRV 514
                     P+  YAH R SL VK  ANL CFVP++C+E+E  LF+   ++CL++
Sbjct: 437 LEFNLIPNSMPKASYAHHRTSLFVKFFANLHCFVPNICEEQERNLFVLKVMECLQM 492


Top