BLASTX nr result

ID: Atractylodes21_contig00013787 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00013787
         (1602 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278478.1| PREDICTED: uncharacterized protein LOC100255...   342   2e-91
ref|XP_002308026.1| predicted protein [Populus trichocarpa] gi|2...   334   4e-89
ref|XP_002529503.1| conserved hypothetical protein [Ricinus comm...   307   4e-81
ref|XP_003518124.1| PREDICTED: uncharacterized protein LOC100781...   306   1e-80
ref|NP_191299.1| uncharacterized protein [Arabidopsis thaliana] ...   305   3e-80

>ref|XP_002278478.1| PREDICTED: uncharacterized protein LOC100255467 [Vitis vinifera]
          Length = 485

 Score =  342 bits (877), Expect = 2e-91
 Identities = 217/472 (45%), Positives = 291/472 (61%), Gaps = 24/472 (5%)
 Frame = -1

Query: 1431 FFLVFFPQEHH--HQQQGLINPSSPFKSPTINAFLRRTNSSHILAKAQSTISICAXXXXX 1258
            F LVFFP++    ++++ L + SS   S ++ +   R +SS+I  +AQSTISICA     
Sbjct: 18   FLLVFFPEDDPIINKKKNLFSSSSSSSSSSLKS-TTRIHSSNIFTRAQSTISICALIVFI 76

Query: 1257 XXXXXXXXXFEPNNNFKSS------RRHLSQTHPTFHFKP-KSNSPALQRLGTLYSRGTK 1099
                     FEP     SS      RR LS+          KS+S ALQ +G LY RGT+
Sbjct: 77   TLLLFTLSTFEPTTIAASSPHSIASRRWLSEKSTASKSNSLKSSSFALQGMGCLYRRGTR 136

Query: 1098 PMNDLLLCHVSDSVTATELKTFLRAFHRSGLLAKSDLLFIFPSITTPESSDNVIQEENQL 919
             MNDL++ HV++ VT  + + FLRA +RS + AK+D++ IF S +       V+QEE + 
Sbjct: 137  AMNDLVVGHVTEDVTQDQFRMFLRALYRSSISAKTDVVLIFSSSSASSELGPVVQEETES 196

Query: 918  FLKLIRRYKAELDNGSNFSNFPASFDVTQFVXXXXXXXXXXXPIWGRKIKSGF-DXXXXX 742
            F +L+RRY  EL++ S  S    SFDVTQF+           P+WGR+I+S + D     
Sbjct: 197  FSRLVRRY-GELNSTSEESGV-TSFDVTQFLKFGKKEKERREPLWGRRIRSNYSDPNSEE 254

Query: 741  XXGTEFTRMSHGSVVGFGVGELDPENSLSGFLDHVPISLRRWASYPMILGRLRRNFKHVM 562
                E TR+ +GSVVGF   ELDPENSL+GFLDHVP+SLRRWA YPMILGR+RRNFKH+M
Sbjct: 255  GEEAEPTRLRYGSVVGFEAAELDPENSLAGFLDHVPMSLRRWACYPMILGRVRRNFKHMM 314

Query: 561  LVDVKEILLLGDPLSRVRNRSPESVFLSSTPPP-TKHNRKTTDK--NHQKAINPAVIMGG 391
            LVDVK  +LLGDPL RVR++S ESVFL + P   T   RK +DK  NHQK +N  +IMGG
Sbjct: 315  LVDVKSFVLLGDPLVRVRSKSTESVFLWTNPETLTPKGRKNSDKTQNHQKLVNSEIIMGG 374

Query: 390  VRGVRRLSAAMLTEIVRATTHKK-KTPVTESDLLSRLVTNEFVQKSIRLVXXXXXXXXXX 214
             RGVRRLS AML EIVRA   +K K+P +ES LL++LV+N  + K + L           
Sbjct: 375  TRGVRRLSNAMLIEIVRAAMQRKGKSPFSESVLLNQLVSNGVLSKGVDLTISNESVPDSN 434

Query: 213  XXSGVSIANHT--------AVMRRGNSNIDID--VMKHICSFSIESSFYREC 88
              +GV+ +N T        +V++RGNSN D++  +MK ICS +++SS Y +C
Sbjct: 435  SLAGVN-SNSTSSLSLSKYSVVQRGNSNRDLNQIIMKDICSSTLDSSVYSDC 485


>ref|XP_002308026.1| predicted protein [Populus trichocarpa] gi|222854002|gb|EEE91549.1|
            predicted protein [Populus trichocarpa]
          Length = 490

 Score =  334 bits (856), Expect = 4e-89
 Identities = 214/490 (43%), Positives = 283/490 (57%), Gaps = 42/490 (8%)
 Frame = -1

Query: 1431 FFLVFFPQEHHHQQQG-LINPSSPFKS-------------PTINA--FLRRTNSSH-ILA 1303
            F LVFFP++           P+ PF S             P  N+   +RRTNS++ I+ 
Sbjct: 4    FLLVFFPEDTSSSSSSPTATPTVPFSSSSSSPTPPKTTSSPNCNSSKIIRRTNSNNPIIT 63

Query: 1302 KAQSTISICAXXXXXXXXXXXXXXFEPNNNF--------KSSRRHLSQTHPTFHFKPKSN 1147
            K QSTISICA              FEP   +        K+ RR LSQ         K+ 
Sbjct: 64   KTQSTISICALLLLLTLLLFTLSTFEPTIPYPSTTISINKTPRRFLSQKPQNKLKTAKAR 123

Query: 1146 SP------ALQRLGTLYSRGTKPMNDLLLCHVSDSVTATELKTFLRAFHRSGLLAKSDLL 985
            S       ALQ +G LY RGT+ M+DL++ HV +     E + FLR  HRSGL A++D++
Sbjct: 124  SEKFRSLFALQGMGKLYRRGTRAMSDLVVAHVVEETNEAEFRLFLRVLHRSGLTARADVV 183

Query: 984  FIFPSITTPESSDNVIQEENQLFLKLIRRYKAELDNGSNFSNFPASFDVTQFVXXXXXXX 805
            F+FPS       +++IQEEN  FLKL+  YK +L+  S+ S   +SFDV+QF+       
Sbjct: 184  FVFPSSLFASRFESLIQEENDSFLKLVNYYK-KLNGTSHDSVSASSFDVSQFLKSEKKQV 242

Query: 804  XXXXPIWGRKIKSGFDXXXXXXXGTE--FTRMSHGSVVGFGVGELDPENSLSGFLDHVPI 631
                 +WG++I+   D         E   T   +GSVVGF   ELDPENSL+GFLDH+P+
Sbjct: 243  GEP--LWGKRIRVNGDGNFSESGEGEGELTWFRYGSVVGFEASELDPENSLAGFLDHLPM 300

Query: 630  SLRRWASYPMILGRLRRNFKHVMLVDVKEILLLGDPLSRVRNRSPESVFLSSTPP--PTK 457
            SLRRWA YPM+LGR+RRNFKHVMLVDVK ++L  DPL RVRNRSPESV++ +      +K
Sbjct: 301  SLRRWACYPMLLGRVRRNFKHVMLVDVKNLVLFSDPLGRVRNRSPESVYIRTKQESGSSK 360

Query: 456  HNRKTTDK-NHQKAINPAVIMGGVRGVRRLSAAMLTEIVR-ATTHKKKTPVTESDLLSRL 283
            HNRK ++K      +N A++MGG RG+RRLS AMLTEI R A  HKKK+ VTES +LS+L
Sbjct: 361  HNRKISEKAQSHSQVNSAILMGGARGIRRLSIAMLTEIARVAMQHKKKSSVTESGILSQL 420

Query: 282  VTNEFVQKSIRLVXXXXXXXXXXXXSGVSIA---NHTAVMRRGNSNIDID--VMKHICSF 118
            V N  V K+I L+            +G + +   N++ + R GNSN DI+  +MK ICS 
Sbjct: 421  VGNVHVLKNIDLITSTESIPGMSSLTGSNSSLWNNYSIIQRGGNSNHDINSIIMKQICSR 480

Query: 117  SIESSFYREC 88
              ESS YR+C
Sbjct: 481  EAESSAYRDC 490


>ref|XP_002529503.1| conserved hypothetical protein [Ricinus communis]
            gi|223531019|gb|EEF32872.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 499

 Score =  307 bits (787), Expect = 4e-81
 Identities = 206/499 (41%), Positives = 281/499 (56%), Gaps = 51/499 (10%)
 Frame = -1

Query: 1431 FFLVFFPQEHHHQQQGLINP------SSPFKSPTINAF--LRRTNSSH-ILAKAQSTISI 1279
            F LVF P+++        +       +SP  S + N+   ++RTNS++ IL K QSTISI
Sbjct: 20   FLLVFLPEDNSATATTAADAIVDKANTSPSPSSSYNSTKTIKRTNSNNPILTKTQSTISI 79

Query: 1278 CAXXXXXXXXXXXXXXFEPN-----NNFKSSRRHLSQTHPTFHFK------PKSN----- 1147
            CA              FEP+     N  K+ RR L +  P+   K       KSN     
Sbjct: 80   CALLLFLSLLLFTLSTFEPSIPNSANTLKTPRRLLPEKFPSNPIKIHQPIMTKSNFSWFS 139

Query: 1146 ----------------SPALQRLGTLYSRGTKPMNDLLLCHVSDSVTATELKTFLRAFHR 1015
                            S ALQ +G LY RGTK MNDL++ HV++     E + FLR  HR
Sbjct: 140  KIWLSRNGNKRSELLSSFALQGMGKLYLRGTKAMNDLVVGHVAEDTNEDEFRVFLRLLHR 199

Query: 1014 SGLLAKSDLLFIFPSITTPESSDNVIQEENQLFLKLIRRYKAELDNGSNFSNFPAS---F 844
            SG+ AK+DL+FIF S       + +I+EEN  FLKL++ YK       +  +  AS   F
Sbjct: 200  SGVTAKADLVFIFSSSLLVSRFEGLIREENDSFLKLVQYYKQLNSTSPDPVSVAASGLRF 259

Query: 843  DVTQFVXXXXXXXXXXXPIWGRKIKSGFDXXXXXXXGTEFTRMSHGSVVGFGVGELDPEN 664
            DVTQFV            +WG++I+   +        +E TR+S+GSVVGF   ELDPEN
Sbjct: 260  DVTQFVKHGKKEMAEP--LWGKRIRVN-NYNESEETESESTRLSYGSVVGFETSELDPEN 316

Query: 663  SLSGFLDHVPISLRRWASYPMILGRLRRNFKHVMLVDVKEILLLGDPLSRVRNRSPESVF 484
            SL+GFLDHVP+SL+RWA YPM+LGR+RRNFKHVMLVDVK+++LL DPL RVRNRSPESV 
Sbjct: 317  SLNGFLDHVPMSLKRWACYPMLLGRVRRNFKHVMLVDVKKLVLLSDPLGRVRNRSPESVH 376

Query: 483  LSS---TPPPTKHNRKTTDKNHQKA-INPAVIMGGVRGVRRLSAAMLTEIVRAT-THKKK 319
            +S+   +   TKH ++ +DK    + +N  ++MGG+RG+RRLS+AMLTEIVRA   +KKK
Sbjct: 377  ISTKLESSSSTKHVKRNSDKTQSHSQVNSGILMGGIRGIRRLSSAMLTEIVRAAMQNKKK 436

Query: 318  TPVTESDLLSRLVTNEFVQKSIRLVXXXXXXXXXXXXSGVSIA--NHTAVMRRGNSNIDI 145
              VTES +LS+LV+N  + K+I L+            +  + A  +H  +++RGN     
Sbjct: 437  ISVTESKILSQLVSNGHILKNIDLIMAAESIPEMSSLTEANSARWDHHKIIQRGN----- 491

Query: 144  DVMKHICSFSIESSFYREC 88
                       +SS YR+C
Sbjct: 492  -----------DSSVYRDC 499


>ref|XP_003518124.1| PREDICTED: uncharacterized protein LOC100781101 [Glycine max]
          Length = 482

 Score =  306 bits (784), Expect = 1e-80
 Identities = 190/489 (38%), Positives = 276/489 (56%), Gaps = 18/489 (3%)
 Frame = -1

Query: 1509 MGSTAKLSKRTTAXXXXXXXXXXXXGFFLVFFPQEHHHQQQGLIN----PSSPFKSPTIN 1342
            MGS  K     T             GF LVFFP+E+++    +      P+    SP+ +
Sbjct: 1    MGSFPKAKPNNTHTDTNTNTHNRVMGFLLVFFPEENNNTTPTIATKSTKPNLVSSSPSPH 60

Query: 1341 AFLRRTNSSH--ILAKAQSTISICAXXXXXXXXXXXXXXFEPNNNFKSS-------RRHL 1189
            +  R ++SS+  +L+KAQSTISIC               FEP ++   S       R++ 
Sbjct: 61   SLKRISSSSNNALLSKAQSTISICFLLLFTTLLLFTLSTFEPTHHKPRSIPPKSKLRKNT 120

Query: 1188 SQTHPTFHFKPKSNSPALQRLGTLYSRGTKPMNDLLLCHVSDSVTATELKTFLRAFHRSG 1009
            S +   F       S ALQR+GTLY RGT+ MND+++CHV +  T  E + FLR  HRSG
Sbjct: 121  SSSDVAFP------STALQRMGTLYRRGTRAMNDIVVCHVPEDTTHDEFRIFLRLLHRSG 174

Query: 1008 LLAKSDLLFIFPSITTPESSDNVIQEENQLFLKLIRRYKAELDNGSNFSNFPASFDVTQF 829
            L +KSD++FIF S ++  +  +++ EEN  FL LI  + A+L++        ++FD T+F
Sbjct: 175  LTSKSDVVFIFVSASSSTTFAHIVHEENTSFLSLINLH-AQLNSTQWAKPSESNFDATRF 233

Query: 828  VXXXXXXXXXXXPIWGRKIKSGFDXXXXXXXGTEFTRMSHGSVVGFGVGELDPENSLSGF 649
            +            +WG+KI++ +          E +R S+GSV+ F   ELDPENSL+GF
Sbjct: 234  LKAPKKGEP----LWGKKIRTNYSGSEG-----ELSRASYGSVLSFDANELDPENSLAGF 284

Query: 648  LDHVPISLRRWASYPMILGRLRRNFKHVMLVDVKEILLLGDPLSRVRNRSPESVFLSSTP 469
            LD VP+SLRRWA YPM+LGR+RRNFKHV LVDVK +++  DPL RVRN+SPESVF+    
Sbjct: 285  LDRVPLSLRRWACYPMLLGRVRRNFKHVTLVDVKNVVIFNDPLGRVRNQSPESVFVY--- 341

Query: 468  PPTKHNRKTTDKNHQKAINPAVIMGGVRGVRRLSAAMLTEIVRAT--THKKKTPVTESDL 295
            P  KH R +        +N  ++MGG RG+RR+S AML  IVRA    HK+K  V++S +
Sbjct: 342  PNAKHGRNSERTQSHHPVNSVILMGGARGIRRVSHAMLVAIVRAAMQPHKRKNSVSDSAI 401

Query: 294  LSRLVTNEFVQKSIRLVXXXXXXXXXXXXSGVSIANHTAVMRRGNSN---IDIDVMKHIC 124
            LS+LV N+F  +++ L+            +G +  +  A+++RG  N   ++  V K IC
Sbjct: 402  LSQLVRNKFALRNVHLIVSGESIPEASSLAGSTSFSDCAIIQRGTGNYYDLNSIVKKQIC 461

Query: 123  SFSIESSFY 97
            S  ++S  Y
Sbjct: 462  SSVMDSFVY 470


>ref|NP_191299.1| uncharacterized protein [Arabidopsis thaliana]
            gi|6735319|emb|CAB68146.1| putative protein [Arabidopsis
            thaliana] gi|332646130|gb|AEE79651.1| uncharacterized
            protein [Arabidopsis thaliana]
          Length = 474

 Score =  305 bits (780), Expect = 3e-80
 Identities = 200/474 (42%), Positives = 268/474 (56%), Gaps = 28/474 (5%)
 Frame = -1

Query: 1425 LVFFPQEHHHQQQGLINPSSPFKSPTINAFLRRTNSSHILAKAQSTISICAXXXXXXXXX 1246
            LVFFP  H++      +PSS   SP      R  +S  +L+KAQSTISIC          
Sbjct: 20   LVFFPDHHNNNDD---SPSSSSSSPATTTLFRSRSSRLLLSKAQSTISICILLLFLTLFL 76

Query: 1245 XXXXXFEPNNNF---KSSRRHLS-----QTHPTFHFKPKSNSPALQRLGTLYSRGTKPMN 1090
                 FEP++ F    SSR H           +   + + N  ALQ +GTL+ RGTK M+
Sbjct: 77   FTLSTFEPSSGFPAVSSSRPHRRFLLNRDISASSESRRRYNRFALQGMGTLFLRGTKSMH 136

Query: 1089 DLLLCHVSDSVTATELKTFLRAFHRSGLLAKSDLLFIFPSITTPESSDNVIQEENQLFLK 910
            DL++ H+S   T  +L+ F+R  HRSG+ +KSD++ +F S T       +I+EEN  FLK
Sbjct: 137  DLIVVHISSDTTEDDLRLFMRLIHRSGVTSKSDVVLLFNSGTR---FTEMIEEENDSFLK 193

Query: 909  LIRRYKAELDNGSNFSNFPASFDVTQFVXXXXXXXXXXXPIWGRKI-KSGFDXXXXXXXG 733
            L+  ++    N SN  +    F++T+F+            IWG+K  ++ ++        
Sbjct: 194  LVDVHR----NSSNQIDSVWGFNLTKFMKKQSKSSSSEP-IWGKKTHRANYNDTSSLNNS 248

Query: 732  TEFTRM-SHGSVVGFGVGELDPENSLSGFLDHVPISLRRWASYPMILGRLRRNFKHVMLV 556
            TE T + +HGSVVGF V ELDPENSLSGF+DHVPISLRRWA YPM+LGR+RRNFKHVMLV
Sbjct: 249  TESTELLTHGSVVGFDVTELDPENSLSGFMDHVPISLRRWACYPMLLGRVRRNFKHVMLV 308

Query: 555  DVKEILLLGDPLSRVRNRSPESVFLSSTPPPTKHNRKTTDKNHQKAINPAVIMGGVRGVR 376
            D K  L LGDPL+R+RNRS ESV   S     KH   ++       +NPA+++GG +G+R
Sbjct: 309  DAKTSLFLGDPLTRIRNRSLESVLFFS-----KH---SSSSKKSSEVNPAILIGGAKGIR 360

Query: 375  RLSAAMLTEIVRAT---THKKKTPVTESDLLSRLVTNEFVQKSIRLVXXXXXXXXXXXXS 205
            RLS++M TEIVRAT    HKKK  VTES +LS+LV N  + K+  +V            +
Sbjct: 361  RLSSSMHTEIVRATIQQQHKKKNSVTESVVLSQLVGNVHMTKNFEVVTSESVVPEASSLA 420

Query: 204  GV--------SIANHTAVMRRG---NSNIDID----VMKHICSFSIESSFYREC 88
             +        SI NH  + R G   NSN  ID    +MK ICS  ++SS Y  C
Sbjct: 421  ELRTRNSAASSIKNHDIIQRGGGNSNSNHIIDIMAIIMKRICSCELDSSVYNYC 474


Top