BLASTX nr result

ID: Scutellaria23_contig00021271 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria23_contig00021271
         (1647 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002317274.1| predicted protein [Populus trichocarpa] gi|2...   412   e-112
ref|XP_003543103.1| PREDICTED: uncharacterized protein LOC100790...   408   e-111
ref|XP_002275818.1| PREDICTED: uncharacterized protein LOC100252...   401   e-109
ref|XP_002518567.1| conserved hypothetical protein [Ricinus comm...   398   e-108
ref|XP_002886512.1| hypothetical protein ARALYDRAFT_475155 [Arab...   394   e-107

>ref|XP_002317274.1| predicted protein [Populus trichocarpa] gi|222860339|gb|EEE97886.1|
            predicted protein [Populus trichocarpa]
          Length = 473

 Score =  412 bits (1058), Expect = e-112
 Identities = 226/406 (55%), Positives = 277/406 (68%), Gaps = 23/406 (5%)
 Frame = +2

Query: 224  VLSFVXXXXXXXXXXXXXXLYIPILTIVAGLDLKMASSYSAFMVTGGSIANVGRQMFAKL 403
            VL F+              LYIPILTIVA LDLK ASS+SAFMVTGGS+ANV   MF + 
Sbjct: 70   VLCFIAASVSSAGGIGGGGLYIPILTIVASLDLKTASSFSAFMVTGGSVANVMCNMFTRS 129

Query: 404  RGGDGKPLIDYDIALLSEPCMLLGVSCGVICNLVLPEWLITMVFALFLAFCTFKTCRSGL 583
                G+ L+DYDIA+LSEPCMLLGVS GVICNLV PEWL+T++FA+FLA  TFKTC++G+
Sbjct: 130  AKFGGQTLVDYDIAILSEPCMLLGVSVGVICNLVFPEWLVTILFAVFLACSTFKTCQNGV 189

Query: 584  FFWKSESEV--RSENG-----------------EAGESTKMPLLGSEMVDGEGKLDIPLM 706
            F WK ESE   R+E+G                 E   S K PLLG E+      L  P M
Sbjct: 190  FHWKLESEEVNRNESGNLENGLVEYETSTKESEEVISSVKEPLLGVELTSSV--LRFPWM 247

Query: 707  KLGMLVLIWFSFFLLYLLRGNRYGQGIINIEPCRAGYWTISSIQIPLAIIFTAWILHCRK 886
            KLG+L +IWFSF +LYLLRGNRYG+GII +E C  GYW +SS+QIPLAI+FTAWIL+ ++
Sbjct: 248  KLGILFIIWFSFSILYLLRGNRYGEGIIPMESCGFGYWVVSSLQIPLAIMFTAWILYRKE 307

Query: 887  SSRNTAATKRES--GGETISW--SCGDRFLPXXXXXXXXXXXXXXXXXXXXISPLLLQIG 1054
            S ++    ++ S  G E ++   +      P                    ISPLLL +G
Sbjct: 308  SCQHQTINQQLSVKGMEDLTGGGTSNKLIFPVMALLAGMLGGVFGIGGGMLISPLLLHVG 367

Query: 1055 IHPEVTAATCSFMVLFSSTMSAIQYLLLGMEHVYGAVIYAAVCFAASLVGLTLVQRAIMK 1234
            I PE+TAATCSFMV FSS+MSA+QYLLLGMEHV  A+I + +CF ASL+GL +VQRAI+K
Sbjct: 368  IAPEITAATCSFMVFFSSSMSALQYLLLGMEHVDTAIILSVICFVASLLGLLVVQRAIVK 427

Query: 1235 HGRASLIVFSVGTVMALSTVLITSFGAVDVWKDYTSGNYMGFKKPC 1372
            +GRAS+IVFSV TVMALSTVL+TSFGA++VW+DY SG  MGFK PC
Sbjct: 428  YGRASMIVFSVSTVMALSTVLMTSFGALNVWRDYNSGRNMGFKLPC 473


>ref|XP_003543103.1| PREDICTED: uncharacterized protein LOC100790958 [Glycine max]
          Length = 466

 Score =  408 bits (1048), Expect = e-111
 Identities = 221/405 (54%), Positives = 266/405 (65%), Gaps = 22/405 (5%)
 Frame = +2

Query: 224  VLSFVXXXXXXXXXXXXXXLYIPILTIVAGLDLKMASSYSAFMVTGGSIANVGRQMFAKL 403
            VL F+              L++PIL+IVAGLDLK ASS SAFMVTGGSIANV   M    
Sbjct: 63   VLCFIASAISSAGGIGGGGLFVPILSIVAGLDLKTASSLSAFMVTGGSIANVMCNMCITS 122

Query: 404  RGGDGKPLIDYDIALLSEPCMLLGVSCGVICNLVLPEWLITMVFALFLAFCTFKTCRSGL 583
                GK LIDYDIAL SEPCMLLGVS GVICNLV PEWLIT++FA+FLA+ T KTC+SGL
Sbjct: 123  PKFGGKSLIDYDIALSSEPCMLLGVSLGVICNLVFPEWLITVLFAIFLAWSTSKTCKSGL 182

Query: 584  FFWKSESEVRSENGEAGESTKMPLLGSEMVD----------------------GEGKLDI 697
             FWK+ESEV  +NG   E  +  LL +E ++                      G  K+ I
Sbjct: 183  LFWKAESEVIRKNGLINEELEKGLLENETIEQRKVYIENNEPKSIEVSLLAPQGNSKVRI 242

Query: 698  PLMKLGMLVLIWFSFFLLYLLRGNRYGQGIINIEPCRAGYWTISSIQIPLAIIFTAWILH 877
            P  KL +L+LIWFSFF +YLLRGNRYG+GII +EPC  GYW +SS+Q+PLA++FTAWI+ 
Sbjct: 243  PWFKLAVLLLIWFSFFSVYLLRGNRYGEGIIPMEPCGVGYWILSSVQVPLAVVFTAWIVF 302

Query: 878  CRKSSRNTAATKRESGGETISWSCGDRFLPXXXXXXXXXXXXXXXXXXXXISPLLLQIGI 1057
             ++S R+     +  G  T          P                    ISPLLLQ+G+
Sbjct: 303  RKESLRDRTLIPKVPG-LTKKRPSNILVFPLMALLAGILGGVFGIGGGMLISPLLLQVGV 361

Query: 1058 HPEVTAATCSFMVLFSSTMSAIQYLLLGMEHVYGAVIYAAVCFAASLVGLTLVQRAIMKH 1237
             PEVTAATCSFMVLFS+TMS +QYLLLGMEHV  A++ A +CF ASL+GL +VQRAI K+
Sbjct: 362  TPEVTAATCSFMVLFSATMSGLQYLLLGMEHVQAALVLAIMCFVASLLGLLVVQRAIRKY 421

Query: 1238 GRASLIVFSVGTVMALSTVLITSFGAVDVWKDYTSGNYMGFKKPC 1372
            GRAS+IVFSV  VM +S VL+TSFGA+ VW DY SG YMGFK PC
Sbjct: 422  GRASIIVFSVSIVMFISNVLMTSFGAIKVWTDYESGEYMGFKLPC 466


>ref|XP_002275818.1| PREDICTED: uncharacterized protein LOC100252710 [Vitis vinifera]
            gi|297741863|emb|CBI33227.3| unnamed protein product
            [Vitis vinifera]
          Length = 458

 Score =  401 bits (1030), Expect = e-109
 Identities = 232/469 (49%), Positives = 287/469 (61%), Gaps = 24/469 (5%)
 Frame = +2

Query: 38   HKFLDFIFLVILSTIFSVSHAQPSLSISEIFVS---FNSTHPWRSTPPXXXXXXXXXXXX 208
            H  +    L+ L  +FS+SHA+ S  IS++       N++H W +               
Sbjct: 4    HTLMRRAVLLPLFILFSLSHAEQSQPISDVPKMEELTNTSHQWSNLQKVFQEIQLKFSPP 63

Query: 209  XXXTAVLSFVXXXXXXXXXXXXXXLYIPILTIVAGLDLKMASSYSAFMVTGGSIANVGRQ 388
                AVL F+              L++PIL IV GLDLK AS++SAFMV GGS AN+   
Sbjct: 64   IVRAAVLCFIAASISSAGGIGGGGLFVPILAIVGGLDLKTASTFSAFMVAGGSTANILCT 123

Query: 389  MFAKLRGGDGKPLIDYDIALLSEPCMLLGVSCGVICNLVLPEWLITMVFALFLAFCTFKT 568
            MF     G GK +ID+DIALLSEPC+LLGVS GV+CN+V PEWLIT++F +FL++ T KT
Sbjct: 124  MFINCIHG-GKSVIDFDIALLSEPCLLLGVSIGVVCNIVFPEWLITILFVVFLSWTTSKT 182

Query: 569  CRSGLFFWKSESEVRSENG-----------------EAGESTKMPLLGSEMVDGEGKLDI 697
            CR G+  WK ESEV   NG                 E  +S K PL+G        K+ I
Sbjct: 183  CRKGVVSWKLESEVIRRNGFGELENGVRRDESNGENEVIKSLKEPLMGEVE---NFKISI 239

Query: 698  PLMKLGMLVLIWFSFFLLYLLRGNRYGQGIINIEPCRAGYWTISSIQIPLAIIFTAWILH 877
            P  K G LV+IW SFFLLY+LRG+R GQ II +EPC  GYW +SS+Q PLAI FTAWILH
Sbjct: 240  PWTKFGALVVIWLSFFLLYILRGDRDGQSIIPMEPCGEGYWILSSLQFPLAITFTAWILH 299

Query: 878  CRKSSRNTAATKRESGGETISWSCGDR----FLPXXXXXXXXXXXXXXXXXXXXISPLLL 1045
             R++S      ++E  G+T     G++      P                    ISPLLL
Sbjct: 300  RRETSN-----QQEILGQT-----GEKPPNLIFPIMALLAGILGGVFGIGGGMLISPLLL 349

Query: 1046 QIGIHPEVTAATCSFMVLFSSTMSAIQYLLLGMEHVYGAVIYAAVCFAASLVGLTLVQRA 1225
             IGI PEVTAATCS MV FSSTMS+ QYLL+GMEH   A+I+A +CF AS++G+ +VQRA
Sbjct: 350  HIGIPPEVTAATCSVMVFFSSTMSSFQYLLIGMEHKEVALIFAIICFFASILGVVVVQRA 409

Query: 1226 IMKHGRASLIVFSVGTVMALSTVLITSFGAVDVWKDYTSGNYMGFKKPC 1372
            I K+GRASLIVFSV TVMALSTVLITSFGA+DVW+DY  G YMGFK PC
Sbjct: 410  IEKYGRASLIVFSVSTVMALSTVLITSFGAIDVWRDYARGEYMGFKLPC 458


>ref|XP_002518567.1| conserved hypothetical protein [Ricinus communis]
            gi|223542412|gb|EEF43954.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 463

 Score =  398 bits (1022), Expect = e-108
 Identities = 228/462 (49%), Positives = 278/462 (60%), Gaps = 20/462 (4%)
 Frame = +2

Query: 47   LDFIFLVILSTIFSVSHAQPSLSISEIFVS---FNSTHPWRSTPPXXXXXXXXXXXXXXX 217
            L F   + +   F  S+A+ + SI+    +    N T  WR+                  
Sbjct: 7    LAFTLSLTVLISFYRSNAEQTQSIARFLETDQYLNETGHWRNYLIQSQEPMLKLASPMVL 66

Query: 218  TAVLSFVXXXXXXXXXXXXXXLYIPILTIVAGLDLKMASSYSAFMVTGGSIANVGRQMFA 397
            + VL F+              L++PILTIVAGLDLK ASS+SAFMVTGGSIANV   +F+
Sbjct: 67   SGVLCFIAASISSAGGIGGGGLFVPILTIVAGLDLKTASSFSAFMVTGGSIANVLCNLFS 126

Query: 398  KLRGGDGKPLIDYDIALLSEPCMLLGVSCGVICNLVLPEWLITMVFALFLAFCTFKTCRS 577
               GG  K LIDYDIALLSEPCMLLGVS GVICNL+ PEWLIT++F LFL + TFKTC++
Sbjct: 127  PKFGG--KALIDYDIALLSEPCMLLGVSVGVICNLIFPEWLITVLFVLFLVWSTFKTCKN 184

Query: 578  GLFFWKSESEVRSENGEAG-----------------ESTKMPLLGSEMVDGEGKLDIPLM 706
             +  W  ESE    NG                    +  K PL+G EM   E ++     
Sbjct: 185  AVAHWNLESEEVKRNGHGNLENGRVKDRSSIGNEEIKIIKEPLMGIEM---ENRMSFTWE 241

Query: 707  KLGMLVLIWFSFFLLYLLRGNRYGQGIINIEPCRAGYWTISSIQIPLAIIFTAWILHCRK 886
            KLG+LVLIW SF  LYLLRGNRYG+GI  ++PC  GYW +SS+QIPLAIIFTAWIL  ++
Sbjct: 242  KLGVLVLIWLSFSFLYLLRGNRYGEGIAPLKPCGVGYWVVSSLQIPLAIIFTAWILLKKR 301

Query: 887  SSRNTAATKRESGGETISWSCGDRFLPXXXXXXXXXXXXXXXXXXXXISPLLLQIGIHPE 1066
              +N  A  ++        +      P                    ISPLLL +GI PE
Sbjct: 302  HYQNQTANLQDIDDSMEGRAPNKLTFPIMALLAGILGGVFGIGGGMLISPLLLHVGIPPE 361

Query: 1067 VTAATCSFMVLFSSTMSAIQYLLLGMEHVYGAVIYAAVCFAASLVGLTLVQRAIMKHGRA 1246
            VTAATCSFMV FSSTMSA QYLL GMEH   A+++A++CF ASLVGL +VQR I  +GRA
Sbjct: 362  VTAATCSFMVFFSSTMSAFQYLLSGMEHTDTALMFASICFVASLVGLLVVQRIIQDYGRA 421

Query: 1247 SLIVFSVGTVMALSTVLITSFGAVDVWKDYTSGNYMGFKKPC 1372
            S+IVFSV  VMALSTVLITSFG +DVW++Y SG  MGFK PC
Sbjct: 422  SIIVFSVSIVMALSTVLITSFGTIDVWRNYESGTNMGFKLPC 463


>ref|XP_002886512.1| hypothetical protein ARALYDRAFT_475155 [Arabidopsis lyrata subsp.
            lyrata] gi|297332353|gb|EFH62771.1| hypothetical protein
            ARALYDRAFT_475155 [Arabidopsis lyrata subsp. lyrata]
          Length = 461

 Score =  394 bits (1012), Expect = e-107
 Identities = 227/462 (49%), Positives = 281/462 (60%), Gaps = 15/462 (3%)
 Frame = +2

Query: 32   MKHKFLDFIFLVILSTIF---SVSHAQPSLSISEIFVSFNSTHPWRSTPPXXXXXXXXXX 202
            M++ F+  +F  I   IF   S++  +PS+ +S +    N T  +               
Sbjct: 1    MRNNFVPIVFSFITFIIFLTPSIAEQEPSI-LSPVDHFLNKTSSYLKFSTKFNQPKIELT 59

Query: 203  XXXXXTAVLSFVXXXXXXXXXXXXXXLYIPILTIVAGLDLKMASSYSAFMVTGGSIANVG 382
                   +LSF+              LY+PI+TIVAGLDLK ASS+SAFMVTGGSIANVG
Sbjct: 60   TSTIIAGLLSFLASSISSAGGIGGGGLYVPIMTIVAGLDLKTASSFSAFMVTGGSIANVG 119

Query: 383  RQMFAKLRGGDGKPLIDYDIALLSEPCMLLGVSCGVICNLVLPEWLITMVFALFLAFCTF 562
              +F +     GK LID+D+ALL EPCMLLGVS GVICNLV P WLIT +FA+FLA+ T 
Sbjct: 120  CNLFVRNPKSGGKTLIDFDLALLLEPCMLLGVSIGVICNLVFPNWLITSLFAVFLAWSTL 179

Query: 563  KTCRSGLFFWKSESE---VRSEN--GEAGESTKMPLLGSEMV-DGEGKLDIPLMKLGMLV 724
            KT  +GL++W+ ESE   +R  N  GE  E  K+  L   ++ D E     P +KLG+LV
Sbjct: 180  KTFGNGLYYWRLESEMVKIRESNRIGEDDEEDKIESLKLPLLEDYERPKRFPWIKLGVLV 239

Query: 725  LIWFSFFLLYLLRGNRYGQGIINIEPCRAGYWTISSIQIPLAIIFTAWILHC------RK 886
            +IW S+F +YLLRGN+YG+GII+IEPC   YW ISS QIPL + FT WI         + 
Sbjct: 240  IIWLSYFAVYLLRGNKYGEGIISIEPCGNAYWLISSSQIPLTLFFTLWICFSDNVQSQQP 299

Query: 887  SSRNTAATKRESGGETISWSCGDRFLPXXXXXXXXXXXXXXXXXXXXISPLLLQIGIHPE 1066
            S  N +    E               P                    ISPLLLQ+GI PE
Sbjct: 300  SDYNVSIKDVEDLRSNDGARSNKCMFPVMALLAGVLGGVFGIGGGMLISPLLLQVGIAPE 359

Query: 1067 VTAATCSFMVLFSSTMSAIQYLLLGMEHVYGAVIYAAVCFAASLVGLTLVQRAIMKHGRA 1246
            VTAATCSFMVLFSSTMSAIQYLLLGMEH   A I+A +CF ASLVGL +VQ+ I ++GRA
Sbjct: 360  VTAATCSFMVLFSSTMSAIQYLLLGMEHTGTASIFAVICFVASLVGLKVVQKVITEYGRA 419

Query: 1247 SLIVFSVGTVMALSTVLITSFGAVDVWKDYTSGNYMGFKKPC 1372
            S+IVFSV  VMALS VL+TS+GA+DVW DY +G YMGFK PC
Sbjct: 420  SIIVFSVCIVMALSIVLMTSYGALDVWNDYVAGRYMGFKLPC 461


Top