BLASTX nr result

ID: Atractylodes22_contig00011295 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00011295
         (1453 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266577.2| PREDICTED: ZF-HD homeobox protein At4g24660-...   226   9e-57
emb|CAN72985.1| hypothetical protein VITISV_009036 [Vitis vinifera]   221   5e-55
emb|CBI17508.3| unnamed protein product [Vitis vinifera]              218   3e-54
ref|XP_002327798.1| predicted protein [Populus trichocarpa] gi|2...   216   1e-53
ref|XP_003517228.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   216   2e-53

>ref|XP_002266577.2| PREDICTED: ZF-HD homeobox protein At4g24660-like [Vitis vinifera]
          Length = 345

 Score =  226 bits (577), Expect = 9e-57
 Identities = 134/298 (44%), Positives = 168/298 (56%), Gaps = 38/298 (12%)
 Frame = +3

Query: 465  NETPDQRRLPLPQGNHYHGFKLQKQQLEAQTPRKRTPNPDPDLDXXXXXXXXXXXXXXXX 644
            ++T D R L      H+H F LQ+Q    +      P+PDPD                  
Sbjct: 56   SQTLDHRHL------HHHQFNLQQQTQHGEVG---DPDPDPDPVSATIAVSGATATPITG 106

Query: 645  XXDTVV--------------IRYRECLKNHAANMGGHVLDGCGEFMPSGEDDTPEALRCG 782
              +  V              IRYRECLKNHAA+MGGHV DGCGEFMPSGE+ T EAL+C 
Sbjct: 107  GSNPKVAAAPPHPPPQSAASIRYRECLKNHAASMGGHVFDGCGEFMPSGEEGTLEALKCA 166

Query: 783  ACECHRSFHRREVEGKSIQSG--YYTHHNHMGSSHNP------PPRAATV-------XXX 917
            AC+CHR+FHR+E++G+S  +   YYT + +  SS         PP  A +          
Sbjct: 167  ACDCHRNFHRKEIDGESQPTANCYYTCNPNTNSSRRNTIAPQLPPSHAPLPHLHQHHKYS 226

Query: 918  XXXXXXXXXXXXXXMMMAF---GGTPDESSSEDQNMFRTNANLMAQ------TSKKRFRT 1070
                          MMMAF   GG P ESSSED NMF++N  +  Q       SKKRFRT
Sbjct: 227  HGLSGSPLMSPIPPMMMAFGGGGGAPAESSSEDLNMFQSNVGMHLQPQPAFALSKKRFRT 286

Query: 1071 KFTEEQKEKMHDLAERIGWKIQKQYEQEILQVCNEVGLKRKVFKVWMHNNKQATKKNQ 1244
            KF++EQK+KM + AE++GWKIQKQ EQE+ Q C++VG+KR+VFKVWMHNNKQA KK Q
Sbjct: 287  KFSQEQKDKMQEFAEKLGWKIQKQEEQEVQQFCSDVGVKRQVFKVWMHNNKQAMKKKQ 344


>emb|CAN72985.1| hypothetical protein VITISV_009036 [Vitis vinifera]
          Length = 250

 Score =  221 bits (562), Expect = 5e-55
 Identities = 117/218 (53%), Positives = 144/218 (66%), Gaps = 24/218 (11%)
 Frame = +3

Query: 663  IRYRECLKNHAANMGGHVLDGCGEFMPSGEDDTPEALRCGACECHRSFHRREVEGKSIQS 842
            IRYRECLKNHAA+MGGHV DGCGEFMPSGE+ T EAL+C AC+CHR+FHR+E++G+S  +
Sbjct: 32   IRYRECLKNHAASMGGHVFDGCGEFMPSGEEGTLEALKCAACDCHRNFHRKEIDGESQPT 91

Query: 843  G--YYTHHNHMGSSHNP------PPRAATV-------XXXXXXXXXXXXXXXXXMMMAF- 974
               YYT + +  SS         PP  A +                        MMMAF 
Sbjct: 92   ANCYYTCNPNTNSSRRNTIAPQLPPSHAPLPHLHQXHKYSHGLSGSPLMSPIPPMMMAFG 151

Query: 975  --GGTPDESSSEDQNMFRTNANLMAQ------TSKKRFRTKFTEEQKEKMHDLAERIGWK 1130
              GG P ESSSED NMF++N  +  Q       SKKRFRTKF++EQK+KM + AE++GWK
Sbjct: 152  GGGGAPAESSSEDLNMFQSNVGMHLQPQPAFALSKKRFRTKFSQEQKDKMQEFAEKLGWK 211

Query: 1131 IQKQYEQEILQVCNEVGLKRKVFKVWMHNNKQATKKNQ 1244
            IQ Q EQE+ Q C++VG+KR+VFKVWMHNNKQA KK Q
Sbjct: 212  IQXQEEQEVQQFCSDVGVKRQVFKVWMHNNKQAMKKKQ 249


>emb|CBI17508.3| unnamed protein product [Vitis vinifera]
          Length = 410

 Score =  218 bits (555), Expect = 3e-54
 Identities = 123/273 (45%), Positives = 156/273 (57%), Gaps = 28/273 (10%)
 Frame = +3

Query: 510  HYHGFKLQKQQLEAQTPRKRTPNPDPDLDXXXXXXXXXXXXXXXXXXDTVV--------- 662
            H+H F LQ+Q    +      P+PDPD                    +  V         
Sbjct: 27   HHHQFNLQQQTQHGEVG---DPDPDPDPVSATIAVSGATATPITGGSNPKVAAAPPHPPP 83

Query: 663  -----IRYRECLKNHAANMGGHVLDGCGEFMPSGEDDTPEALRCGACECHRSFHRREVEG 827
                 IRYRECLKNHAA+MGGHV DGCGEFMPSGE+ T EAL+C AC+CHR+FHR+E++G
Sbjct: 84   QSAASIRYRECLKNHAASMGGHVFDGCGEFMPSGEEGTLEALKCAACDCHRNFHRKEIDG 143

Query: 828  KSIQSG--YYTHHNHMGSSHNP------PPRAATVXXXXXXXXXXXXXXXXXMMMAFGGT 983
            +S  +   YYT + +  SS         PP  A +                     +   
Sbjct: 144  ESQPTANCYYTCNPNTNSSRRNTIAPQLPPSHAPLPHLHQHH-------------KYSHA 190

Query: 984  PDESSSEDQNMFRTNANLMAQ------TSKKRFRTKFTEEQKEKMHDLAERIGWKIQKQY 1145
            P ESSSED NMF++N  +  Q       SKKRFRTKF++EQK+KM + AE++GWKIQKQ 
Sbjct: 191  PAESSSEDLNMFQSNVGMHLQPQPAFALSKKRFRTKFSQEQKDKMQEFAEKLGWKIQKQE 250

Query: 1146 EQEILQVCNEVGLKRKVFKVWMHNNKQATKKNQ 1244
            EQE+ Q C++VG+KR+VFKVWMHNNKQA KK Q
Sbjct: 251  EQEVQQFCSDVGVKRQVFKVWMHNNKQAMKKKQ 283


>ref|XP_002327798.1| predicted protein [Populus trichocarpa] gi|222836883|gb|EEE75276.1|
            predicted protein [Populus trichocarpa]
          Length = 341

 Score =  216 bits (550), Expect = 1e-53
 Identities = 133/328 (40%), Positives = 172/328 (52%), Gaps = 45/328 (13%)
 Frame = +3

Query: 396  SYNNTPQESSTTTAPLISSPTTRNETPDQRRLPLPQGNHYHGFKLQKQQLEAQ---TPRK 566
            S+N      S++  P  S+PT R+       LP      +     Q+QQ + Q    P+ 
Sbjct: 15   SFNPPNNRDSSSRIP--SAPTRRDHRHTDTVLPHTLDLEHQSLYQQQQQQQQQKQLNPQH 72

Query: 567  RTPNPDPDLDXXXXXXXXXXXXXXXXXXDTVV---------------------IRYRECL 683
            +   P  DLD                  +T                       IRYRECL
Sbjct: 73   QACKPTRDLDLTPDPTQATTPVATTSATNTAPTPSRSISRSPPPPPTSASSASIRYRECL 132

Query: 684  KNHAANMGGHVLDGCGEFMPSGEDDTPEALRCGACECHRSFHRREVEGKS---IQSGYYT 854
            KNHAA+MGGHVLDGCGEFMP GE+ TPE  +C ACECHRSFHRRE++G       S  Y 
Sbjct: 133  KNHAASMGGHVLDGCGEFMPGGEEGTPETFKCAACECHRSFHRREIDGAPQCVANSTCYK 192

Query: 855  HHN----------HMGSSHNPPPRAA--TVXXXXXXXXXXXXXXXXXMMMAF--GGTPDE 992
            + N           + +SH PP  A+                     MMM+F  GG   E
Sbjct: 193  NSNGKRNILPLPQQLVTSHAPPQSASLHPHQRYHHGTLSTYTTPIAPMMMSFGGGGAAAE 252

Query: 993  SSSEDQNMFRTN----ANLMAQTSKKRFRTKFTEEQKEKMHDLAERIGWKIQKQYEQEIL 1160
            SSSED NM++++    ++     SKKRFRT+F+EEQK+KM + AE++GW+IQKQ EQE+ 
Sbjct: 253  SSSEDLNMYQSDLQGQSSAQPLISKKRFRTRFSEEQKDKMMEFAEKLGWRIQKQDEQEVQ 312

Query: 1161 QVCNEVGLKRKVFKVWMHNNKQATKKNQ 1244
            Q C++VG+KRKVFKVWMHNNKQ+ KK Q
Sbjct: 313  QFCSQVGVKRKVFKVWMHNNKQSMKKKQ 340


>ref|XP_003517228.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Glycine max]
          Length = 317

 Score =  216 bits (549), Expect = 2e-53
 Identities = 127/313 (40%), Positives = 170/313 (54%), Gaps = 20/313 (6%)
 Frame = +3

Query: 369  METRERNSVSYNNTPQESSTTTAPLISSPTT--RNETPDQRRLPLPQGNHYHGFKLQKQQ 542
            +E     ++ YN  P   S++++  +SSPT   R+ +    +      N      L    
Sbjct: 11   IEIPTTTTLGYNLLPNRDSSSSSSKLSSPTVGERSSSDHDHQTHTLIFNETPHHNLYPPP 70

Query: 543  LEAQTPRKRTPNPDPDLDXXXXXXXXXXXXXXXXXXDTVVIRYRECLKNHAANMGGHVLD 722
                 P+ + P  DPDL                    T  IRYRECL+NHAA+MG HV+D
Sbjct: 71   PSLAPPQPQRPTLDPDLSTPIAPTSNPPRT------STPSIRYRECLRNHAASMGSHVVD 124

Query: 723  GCGEFMPSGEDDTPEALRCGACECHRSFHRREVEGK-SIQS----------GYYTHHNHM 869
            GCGEFM SGE+ TPE+LRC ACECHR+FHR+EVEG+   QS           YYT + H 
Sbjct: 125  GCGEFMASGEEGTPESLRCAACECHRNFHRKEVEGELQPQSLPQQHVPNYHSYYT-NKHN 183

Query: 870  GSSHNPPPRAATVXXXXXXXXXXXXXXXXXMMMAFGGTPDESSSED-------QNMFRTN 1028
            G  H P P ++++                 +MMAFGG P ESSSED       Q   +  
Sbjct: 184  GHFHYPTPSSSSLHHRLVATTTATPSLVPPVMMAFGG-PAESSSEDLINNTGAQLSVQQQ 242

Query: 1029 ANLMAQTSKKRFRTKFTEEQKEKMHDLAERIGWKIQKQYEQEILQVCNEVGLKRKVFKVW 1208
            A L   ++KKRFRTKF++ QK++M + A++I WKIQK  EQE+   C +VG+KR+VFKVW
Sbjct: 243  APLTHSSNKKRFRTKFSQHQKDRMMEFADKIDWKIQKHNEQEVQHFCTQVGVKRQVFKVW 302

Query: 1209 MHNNKQATKKNQQ 1247
            MHNNKQ +   +Q
Sbjct: 303  MHNNKQTSSSKKQ 315