BLASTX nr result

ID: Atractylodes21_contig00020815 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00020815
         (1599 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAD10638.1| PBF68 protein [Nicotiana tabacum]                     332   2e-88
ref|XP_004155250.1| PREDICTED: uncharacterized LOC101205072 [Cuc...   330   7e-88
ref|XP_003553790.1| PREDICTED: uncharacterized protein LOC100800...   330   7e-88
ref|XP_004134299.1| PREDICTED: uncharacterized protein LOC101205...   329   1e-87
ref|XP_002301412.1| predicted protein [Populus trichocarpa] gi|2...   318   3e-84

>emb|CAD10638.1| PBF68 protein [Nicotiana tabacum]
          Length = 594

 Score =  332 bits (851), Expect = 2e-88
 Identities = 203/530 (38%), Positives = 281/530 (53%), Gaps = 78/530 (14%)
 Frame = -1

Query: 1560 EPHQGKLEETVIRILKAANLEIATELSVRKEAEKLLGVDLSDLASKRLVRRILESFLLSY 1381
            E  + K+ E V+ ILK A++E ATE SVR    + LG ++ ++  K+ +R ++ESFLLS 
Sbjct: 7    EHKRRKIREVVLDILKTADIETATEYSVRTTVAQQLGTEILNIQEKQFIRHVIESFLLST 66

Query: 1380 --SPXXXXXXXXXXXXXVQTGKFPVDDRRIA---------------------CDDGGGRV 1270
              +P                  F  +++  A                      ++   R 
Sbjct: 67   VENPTLDNNRRISTAEKGVNTDFVAEEQLSADHPPTQHQEADGSLPNGNLVDSNENNCRT 126

Query: 1269 ICELPGMRRVSIKKFRGTKLVSIREYYQKEGKVFPSGRGITLNPEQWSAFRSSFPDIEAA 1090
            IC+L   R V I    G   V+IR++Y+K+GK+ PS RGI L+ +QWS+FRSSFP I  A
Sbjct: 127  ICKLSDKRSVGILDIHGKPFVAIRDFYEKDGKLVPSSRGINLSVQQWSSFRSSFPAIVEA 186

Query: 1089 ITKMQAGIRG-------------EGMGKKQTDAEAS-NPSTSLASEPRG----------- 985
            I  M+  IR              +G  + QT+   S N      S  R            
Sbjct: 187  IATMELKIRSTTCENQTAADVAAQGREQIQTNISQSVNHQEGKLSADRNENGDDVSNSAI 246

Query: 984  ---------LEKLQIEAGTSICPP------------------DPLIPIATIRFTGRNYYC 886
                     +E+ Q EAG S   P                    L+P+ TIR  G+NYYC
Sbjct: 247  ITNSQVQMPIERQQTEAGISNSAPCFAPQGQIQQSSRTTSLAHSLVPVKTIRLDGKNYYC 306

Query: 885  WKRQMEFFLKQLKVSYVLVESCPKIPVSSEASFEEISQSKSRAQKWMNDDYICRHSILNS 706
            WK Q EFFLKQL ++YVL E CP               +    QKW++DDY+C H+ILNS
Sbjct: 307  WKHQAEFFLKQLNIAYVLSEPCPN--------------TLENRQKWVDDDYLCCHNILNS 352

Query: 705  LSDQLFDRFSVKTLNARELWDELKSLYADDFGTQRSHVNSYIQFQMVDGISILEQVQELH 526
            LSD+LF+ +S K  +A+ELW+EL+S Y +DFGT+ S VN Y+QF MVDGISILEQVQELH
Sbjct: 353  LSDKLFEEYSKKNYSAKELWEELRSTYDEDFGTKSSEVNKYLQFLMVDGISILEQVQELH 412

Query: 525  RIAGIITTSGIHIDENFHVSVIISKLPPSWKHVRAKLMQEEYLPLDKLIYRLKDEEDSRS 346
            +IA  +  SGI IDENFH+S II+KLPPSWK  R +LM E    LD L++ L+ E+D R+
Sbjct: 413  KIADSLMASGIWIDENFHISAIIAKLPPSWKDCRTRLMHENVPSLDMLMHHLRVEDDCRN 472

Query: 345  --QREKGGRNIGPKKRDM-RGLCFGCHKEGHKRQDCPLTRSNQRDQNHNG 205
              + +K  + +G +K+D+ +  C+ C KEGH  + C   R+ Q  +  NG
Sbjct: 473  RYRNDKHEKRVGARKKDLSKKQCYNCGKEGHISKYC-TERNYQGCEKSNG 521


>ref|XP_004155250.1| PREDICTED: uncharacterized LOC101205072 [Cucumis sativus]
          Length = 468

 Score =  330 bits (846), Expect = 7e-88
 Identities = 197/475 (41%), Positives = 268/475 (56%), Gaps = 36/475 (7%)
 Frame = -1

Query: 1545 KLEETVIRILKAANLEIATELSVRKEAEKLLGVDLSDLASKRLVRRILESFLLSYSPXXX 1366
            ++EE VI +LK +++E  TE  VR + E+ LG+DLS+   K LVR ++ESFLLS S    
Sbjct: 8    RIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLLSMSERVC 67

Query: 1365 XXXXXXXXXXVQTGKFPVDDRRIA---CDDGGGRVICELPGMRRVSIKKFRGTKLVSIRE 1195
                      V+     V+ + +     +D G  +IC L   R V+I KF+G  +VS+R+
Sbjct: 68   MWKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGAPMVSVRQ 127

Query: 1194 YYQKEGKVFPSGRGITLNPEQWSAFRSSFPDIEAAITKMQAGIRGEGMGKKQTDAEASNP 1015
            YY+K+GK  P+ +GI++  EQWS F+S+ P I  AI +M+   R E   +K  DA  SNP
Sbjct: 128  YYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEK-IDA-FSNP 185

Query: 1014 STSLASEPRGLEKLQIEAGTSICPPDPLIPIATIRFTGRNYYCWKRQMEFFLKQLKVSYV 835
            +T + S                    P  PI TIRF G+NY  W  QME  L+ LK++YV
Sbjct: 186  TTRVTS--------------------PKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYV 225

Query: 834  LVESCPKIPVSSEASFEEISQSKSRAQKWMNDDYICRHSILNSLSDQLFDRFSVKTLNAR 655
            L   CP   +  E+S    +QSK+  QKWM DD++CR +ILNSLSD+LF+ +S KT++A 
Sbjct: 226  LSNQCPTAVLGEESSSGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSAS 285

Query: 654  ELWDELKSLY-ADDFGTQRSHVNSYIQFQMVDGISILEQVQELHRIAGIITTSGIHIDEN 478
            ELW ELK LY  ++FGT+RS V  Y++F+MV+  SILEQV+EL+ IA  I +SG  IDE+
Sbjct: 286  ELWKELKLLYLLEEFGTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDED 345

Query: 477  FHVSVIISKLPPSWKHVRAKLMQEEYLPLDKLIYRLKDEEDSRSQREK--GGRNIGPKKR 304
            FHVS IISKLP SWK+V   LM E+YLPL KL  RL+ EE  R+Q+     G +  P  R
Sbjct: 346  FHVSAIISKLPLSWKNVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPR 405

Query: 303  DMRG------------------------------LCFGCHKEGHKRQDCPLTRSN 229
                                              LC  C KEGH   +CP  + N
Sbjct: 406  GQHHAANHPSKMGDPKPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVN 460


>ref|XP_003553790.1| PREDICTED: uncharacterized protein LOC100800963 [Glycine max]
          Length = 515

 Score =  330 bits (846), Expect = 7e-88
 Identities = 197/488 (40%), Positives = 284/488 (58%), Gaps = 31/488 (6%)
 Frame = -1

Query: 1569 MEIEPHQGKLEETVIRILKAANLEIATELSVRKEAEKLLGVDLSDLASKRLVRRILESFL 1390
            ME E  + K+EE V+ ILK +N+E ATE ++R  A + LG+DLSD  S+  VR ++ES+L
Sbjct: 1    METETRR-KVEEMVLDILKKSNIEEATEFTIRVAASERLGIDLSDSPSRHFVRTVVESYL 59

Query: 1389 LSYSPXXXXXXXXXXXXXVQTGKFPVDDRR---IACD----DGGGRVICELPGMRRVSIK 1231
            LS +                  K   D ++   +A      D   RVIC+L   R +++K
Sbjct: 60   LSVAANEISKDAEKKENEDIAAKNDDDVKKGDVVAVPKLKRDDPERVICQLSSRRNLAVK 119

Query: 1230 KFRGTKLVSIREYYQKEGKVFPSGRGITLNPEQWSAFRSSFPDIEAAITKMQAGIRGEGM 1051
             F+GT LVSIRE+Y K+GK+ P  +GI+L+ EQWS F+ S P IE AI KM+  IR E  
Sbjct: 120  HFKGTTLVSIREFYMKDGKLLPGSKGISLSSEQWSTFKKSVPAIEEAIKKMEGRIRLEPN 179

Query: 1050 GKKQTDAEASNPSTSLASEPRGLE------------------KLQIEAGTSICPP---DP 934
            GK+  DA  SN +  +A EP G +                  K   +A  S+      +P
Sbjct: 180  GKQNGDA--SNSAVDVALEPNGKQNGDASNSVVDVAPLEPHGKQNGDASNSVVDVAALEP 237

Query: 933  LIPIATIRFTGRNYYCWKRQMEFFLKQLKVSYVLVESCPKIPVSSEASFEEISQSKSRAQ 754
            ++PI  IR  G+N+  W RQME  LKQLKV YVL E CP   +   A  E+I+ +K+  +
Sbjct: 238  VVPIEVIRLDGKNFQSWARQMELLLKQLKVDYVLDEPCPNPTLGESAKAEDIATAKAAER 297

Query: 753  KWMNDDYICRHSILNSLSDQLFDRFSVKTLNARELWDELKSLYA-DDFGTQRSHVNSYIQ 577
            +W+NDD  C  +IL+ LSD L++ ++ + L+A++LW+ELK +Y  ++FGT+R HV  Y++
Sbjct: 298  RWLNDDLTCHRNILSHLSDPLYNLYANRKLSAKDLWEELKLVYLYEEFGTKRYHVKKYLE 357

Query: 576  FQMVDGISILEQVQELHRIAGIITTSGIHIDENFHVSVIISKLPPSWKHVRAKLMQEEYL 397
            FQMV+  +++EQ++EL+ +A  I  +G+ ID+NFHVS IISKLPPSWK    KLM+EEYL
Sbjct: 358  FQMVEEKAVIEQIRELNGMADSIAAAGMFIDDNFHVSAIISKLPPSWKDFCIKLMREEYL 417

Query: 396  PLDKLIYRLKDEEDSRSQREKGGRNIGPKKRDMRGLCFGCHKEGHKRQDC-PLTRSNQRD 220
            P  KL+ R++ EE+ R     G + +      M G     +  GH+R D  PL     R 
Sbjct: 418  PYRKLMERIQIEEEYR----YGVKRVVEHSNSMEGY-HQAYNGGHRRADYKPLGMCRNRS 472

Query: 219  Q-NHNGVP 199
            + N   VP
Sbjct: 473  EINARSVP 480


>ref|XP_004134299.1| PREDICTED: uncharacterized protein LOC101205072 [Cucumis sativus]
          Length = 468

 Score =  329 bits (844), Expect = 1e-87
 Identities = 195/475 (41%), Positives = 266/475 (56%), Gaps = 36/475 (7%)
 Frame = -1

Query: 1545 KLEETVIRILKAANLEIATELSVRKEAEKLLGVDLSDLASKRLVRRILESFLLSYSPXXX 1366
            ++EE VI +LK +++E  TE  VR + E+ LG+DLS+   K LVR ++ESFLLS S    
Sbjct: 8    RIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLLSMSERVC 67

Query: 1365 XXXXXXXXXXVQTGKFPVDDRRIA---CDDGGGRVICELPGMRRVSIKKFRGTKLVSIRE 1195
                      V+     V+ + +     +D G  +IC L   R V+I KF+G  +VS+R+
Sbjct: 68   MGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGAPMVSVRQ 127

Query: 1194 YYQKEGKVFPSGRGITLNPEQWSAFRSSFPDIEAAITKMQAGIRGEGMGKKQTDAEASNP 1015
            YY+K+GK  P+ +GI++  EQWS F+S+ P I  AI +M+   R E   +K      SNP
Sbjct: 128  YYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKI--GAFSNP 185

Query: 1014 STSLASEPRGLEKLQIEAGTSICPPDPLIPIATIRFTGRNYYCWKRQMEFFLKQLKVSYV 835
            +T + S                    P  PI TIRF G+NY  W  QME  L+ LK++YV
Sbjct: 186  TTRVTS--------------------PKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYV 225

Query: 834  LVESCPKIPVSSEASFEEISQSKSRAQKWMNDDYICRHSILNSLSDQLFDRFSVKTLNAR 655
            L   CP   +  E+S    +QSK+  QKWM DD++CR +ILNSLSD+LF+ +S KT++A 
Sbjct: 226  LSNQCPTAVLGEESSSGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSAS 285

Query: 654  ELWDELKSLY-ADDFGTQRSHVNSYIQFQMVDGISILEQVQELHRIAGIITTSGIHIDEN 478
            ELW ELK LY  ++FGT+RS V  Y++F+MV+  SILEQV+EL+ IA  I +SG  IDE+
Sbjct: 286  ELWKELKLLYLLEEFGTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDED 345

Query: 477  FHVSVIISKLPPSWKHVRAKLMQEEYLPLDKLIYRLKDEEDSRSQREK--GGRNIGPKKR 304
            FHVS IISKLP SWK+V   LM E+YLPL KL  RL+ EE  R+Q+     G +  P  R
Sbjct: 346  FHVSAIISKLPLSWKNVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPR 405

Query: 303  DMRG------------------------------LCFGCHKEGHKRQDCPLTRSN 229
                                              LC  C KEGH   +CP  + N
Sbjct: 406  GQHHAANHPSKMGDPKPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVN 460


>ref|XP_002301412.1| predicted protein [Populus trichocarpa] gi|222843138|gb|EEE80685.1|
            predicted protein [Populus trichocarpa]
          Length = 459

 Score =  318 bits (815), Expect = 3e-84
 Identities = 194/468 (41%), Positives = 269/468 (57%), Gaps = 34/468 (7%)
 Frame = -1

Query: 1545 KLEETVIRILKAANLEIATELSVRKEAEKLLGVDLSDLASKRLVRRILESFLLSYSPXXX 1366
            K++ETVI ILK A+++  TE  VR  A + L  DLS +  K+ +R ++ESFLLS      
Sbjct: 2    KIQETVIDILKHASMDEITEFKVRATATERLDFDLSHIEHKKFIRGVIESFLLSTMDEEG 61

Query: 1365 XXXXXXXXXXVQTG-----KFPVDDRRIACDDGGGRVICELPGMRRVSIKKFRGTKLVSI 1201
                       +       +  +  + +  D  G RVIC+L   R V+I++F+G   VSI
Sbjct: 62   KEANGNVREDTKEALQEEHEEVLTKKEVGTD--GNRVICKLSERRSVTIQEFKGKSFVSI 119

Query: 1200 REYYQKEGKVFPSGRGITLNPEQWSAFRSSFPDIEAAITKMQAGIRGEGMGKKQTDAEAS 1021
            R++YQK+G + PS  GI L  EQW+A + + P IE AI KMQ+     G+  +Q + + S
Sbjct: 120  RDFYQKDGNLLPSKIGICLTSEQWTAIKQNVPAIEEAIAKMQSI---SGLDVEQ-NGQIS 175

Query: 1020 NPSTSLASEPRGLEKLQIEAGTSICPPDPLIPIATIRFTGRNYYCWKRQMEFFLKQLKVS 841
             P     S+   LE  +IE                 RF G+NY  W  QMEFFLKQLK+ 
Sbjct: 176  KPVADSISQELPLEISRIEVS---------------RFDGKNYQFWAPQMEFFLKQLKIV 220

Query: 840  YVLVESCPKIPVSSEASFEEISQSKSRAQKWMNDDYICRHSILNSLSDQLFDRFSVKTLN 661
            YVL    P I  S  AS EEI+Q+K+  QKW NDD++CR +ILNSLSD ++ +++ K   
Sbjct: 221  YVLTVPRPSIATSPPASAEEIAQAKATEQKWCNDDHLCRLNILNSLSDSIYYKYAKKIKT 280

Query: 660  ARELWDELKSLYA-DDFGTQRSHVNSYIQFQMVDGISILEQVQELHRIAGIITTSGIHID 484
            A+ELW++LK +Y  ++FGT+RS V  YI+FQMVD  SI +Q+QEL+ IA  I  +G+ ID
Sbjct: 281  AKELWEDLKLVYLYEEFGTKRSQVKKYIEFQMVDEKSIFDQLQELNGIADAIVAAGMFID 340

Query: 483  ENFHVSVIISKLPPSWKHVRAKLMQEEYLPLDKLIYRLKDEEDSRSQREKG--------- 331
            ENFHVS +ISKLPPSWK    KLM EEYLP   L+ R++ EE+SR+Q + G         
Sbjct: 341  ENFHVSTVISKLPPSWKDFCMKLMHEEYLPFWILMDRVRAEEESRNQDKLGEPSNHVHSH 400

Query: 330  -GRNIGPKKRDMR--GL----------------CFGCHKEGHKRQDCP 244
              + +GP+ RDM+  GL                C+ C K+GH  + CP
Sbjct: 401  HPKYLGPRIRDMKKPGLHWKRRDIEVDNNKSLTCYFCGKKGHISKHCP 448


Top