BLASTX nr result

ID: Lithospermum22_contig00008068 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum22_contig00008068
         (1653 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002324349.1| predicted protein [Populus trichocarpa] gi|2...   432   e-118
dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein ...   429   e-118
dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (...   427   e-117
ref|NP_196638.2| aspartyl protease family protein [Arabidopsis t...   410   e-112
ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2...   407   e-111

>ref|XP_002324349.1| predicted protein [Populus trichocarpa] gi|222865783|gb|EEF02914.1|
            predicted protein [Populus trichocarpa]
          Length = 490

 Score =  432 bits (1111), Expect = e-118
 Identities = 248/488 (50%), Positives = 327/488 (67%), Gaps = 10/488 (2%)
 Frame = +3

Query: 48   FFLLCWL-KPQGHYALNQNMNHQPPLYHTISVSSLISTNDKSSCNNIVPTSTSNGHSKRK 224
            F  LC L   +  YAL      +    H+I VSSL+ +   +SC    P++    ++  K
Sbjct: 20   FLCLCLLFSLEKGYALEGRKVAESHHSHSIEVSSLLPS---ASCK---PSTKVLSNNDNK 73

Query: 225  ASLRVAHKYGACSSSPQGGNKAETNANLAH-EILSHDQARVESIKARLE--KFNSNKDLS 395
            ASL+V HK+G CS   Q     E +A   H EIL  DQ+RV+SI +RL   K +  KD+ 
Sbjct: 74   ASLKVVHKHGPCSKLSQD----EASAAPTHTEILLQDQSRVKSIHSRLSNSKTSGGKDVK 129

Query: 396  DTKTTTLPAHRGTDLRTLNYVVEVGLGTPAKQLSLVFDTGSDITWTQCQPCAKSCYQQQQ 575
             T +TT+PA  G+ + + NY+V VGLGTP K LSL+FDTGSDITWTQCQPCA+SCY+Q++
Sbjct: 130  VTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKE 189

Query: 576  PIFDPSKSTSFSNIXXXXXXXXXXXXATGNSPRCANNSTCVYEINYGDNSFTVGIFGKEK 755
             IFDPS+STS++NI            ATGN+P CA+ S CVY I YGD+SF+VG FG EK
Sbjct: 190  QIFDPSQSTSYTNISCSSSICNSLTSATGNTPGCAS-SACVYGIQYGDSSFSVGFFGTEK 248

Query: 756  LTLSGGELLENIPFGCGQNNVGLFGATAGLIGLGRDPLSIVSQTAQKYGKVFSYCLPTTK 935
            LTL+  +   NI FGCGQNN GLFG +AGL+GLGRD LS+VSQTAQKY K+FSYCLP++ 
Sbjct: 249  LTLTSTDAFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYCLPSS- 307

Query: 936  STSGGYLSFGRSGLNANLQYTQLST--SDDPYYIIQMTAITVGGSPVPISATDLKSDGES 1109
            S+S G+L+FG S  + N ++T LST  +   +Y +  T I+VGG  + ISA+   + G +
Sbjct: 308  SSSTGFLTFGGSA-SKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFSTAG-A 365

Query: 1110 IIDSGTVITRLPASIYKPMRDAFKKQMARYKMAQPISIYDTCYDFSNEKEINVPIISFTF 1289
            IIDSGTVITRLP + Y  +R +F+  M++Y M + +SI DTCYDFS+   I+VP I F+F
Sbjct: 366  IIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSF 425

Query: 1290 GGSNVKVDIPSDGVFYQVNGGVSQVCLAFAEDS----VNIFGNSQQQTLEVVYDVAGGKI 1457
              S ++VDI + G+ Y     +SQVCLAFA +S    V IFGN QQ+TLEV YD + GK+
Sbjct: 426  -SSGIEVDIDATGILYA--SSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKV 482

Query: 1458 GFAPNGCT 1481
            GFAP GC+
Sbjct: 483  GFAPGGCS 490


>dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  429 bits (1104), Expect = e-118
 Identities = 242/478 (50%), Positives = 312/478 (65%), Gaps = 25/478 (5%)
 Frame = +3

Query: 123  YHTISVSSLISTNDKSSCNNIVPTSTSNGHSKRKASLRVAHKYGACSSSPQGGNKAETNA 302
            +HT+ +SSL+ +   SSCN       +    +R ASL V ++ G C+   Q G KA T  
Sbjct: 45   FHTLQLSSLLPS---SSCN------PATKGKRRGASLEVVNRQGPCTLLNQKGAKAPTLT 95

Query: 303  NLAHEILSHDQARVESIKARL-------------EKFNSNKDLSDTKTTTLPAHRGTDLR 443
                EIL+HDQARV+SI+AR+             +  N  K + D+K   LPA  G  L 
Sbjct: 96   ----EILAHDQARVDSIQARITDQSYDLFKKKDKKSSNKKKSVKDSKAN-LPAQSGLPLG 150

Query: 444  TLNYVVEVGLGTPAKQLSLVFDTGSDITWTQCQPCAKSCYQQQQPIFDPSKSTSFSNIXX 623
            T NY+V VGLGTP K LSL+FDTGSD+TWTQCQPC KSCY QQQPIFDPS S ++SNI  
Sbjct: 151  TGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISC 210

Query: 624  XXXXXXXXXXATGNSPRCANNSTCVYEINYGDNSFTVGIFGKEKLTLSGGELLENIPFGC 803
                      ATGNSP C++ S CVY I YGD+SFT+G F K+KLTL+  ++ +   FGC
Sbjct: 211  TSAACSSLKSATGNSPGCSS-SNCVYGIQYGDSSFTIGFFAKDKLTLTQNDVFDGFMFGC 269

Query: 804  GQNNVGLFGATAGLIGLGRDPLSIVSQTAQKYGKVFSYCLPTTKSTSGGYLSFGR-SGLN 980
            GQNN GLFG TAGLIGLGRDPLSIV QTAQK+GK FSYCLPT++  S G+L+FG  +G+ 
Sbjct: 270  GQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRG-SNGHLTFGNGNGVK 328

Query: 981  AN------LQYTQLSTSD-DPYYIIQMTAITVGGSPVPISATDLKSDGESIIDSGTVITR 1139
            A+      + +T  ++S    YY I +  I+VGG  + IS    ++ G +IIDSGTVITR
Sbjct: 329  ASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNAG-TIIDSGTVITR 387

Query: 1140 LPASIYKPMRDAFKKQMARYKMAQPISIYDTCYDFSNEKEINVPIISFTFGGSNVKVDIP 1319
            LP++ Y  ++ AFK+ M++Y  A  +S+ DTCYD SN   I++P ISF F G N  V++ 
Sbjct: 388  LPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNFNG-NANVELD 446

Query: 1320 SDGVFYQVNGGVSQVCLAFA----EDSVNIFGNSQQQTLEVVYDVAGGKIGFAPNGCT 1481
             +G+   +  G SQVCLAFA    +DS+ IFGN QQQTLEVVYDVAGG++GF   GC+
Sbjct: 447  PNGIL--ITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502


>dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
            sylvestris]
          Length = 502

 Score =  427 bits (1098), Expect = e-117
 Identities = 245/514 (47%), Positives = 324/514 (63%), Gaps = 25/514 (4%)
 Frame = +3

Query: 15   YSNFTSIVVTLFFLLCWLKPQGHYALNQNMNHQPPLYHTISVSSLISTNDKSSCNNIVPT 194
            +S+FT +++ L F +     +  +AL      +   +HT+ ++SL+ +   SSCN     
Sbjct: 15   FSSFTFLLILLSFPV-----EKSHALEAKETIESH-FHTLQLTSLLPS---SSCN----- 60

Query: 195  STSNGHSKRKASLRVAHKYGACSSSPQGGNKAETNANLAHEILSHDQARVESIKARL--- 365
             T+    +R ASL V ++ G C+   Q G KA T      EIL+HDQARV+SI+AR+   
Sbjct: 61   -TATKGKRRGASLEVVNRQGPCTQLNQKGAKAPTLT----EILAHDQARVDSIQARVTDQ 115

Query: 366  ----------EKFNSNKDLSDTKTTTLPAHRGTDLRTLNYVVEVGLGTPAKQLSLVFDTG 515
                      +  N  K + D+K   LPA  G  L T NY+V VGLGTP K LSL+FDTG
Sbjct: 116  SYDLFKKKDKKSSNKKKSVKDSKAN-LPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTG 174

Query: 516  SDITWTQCQPCAKSCYQQQQPIFDPSKSTSFSNIXXXXXXXXXXXXATGNSPRCANNSTC 695
            SD+TWTQCQPC KSCY QQQPIFDPS S ++SNI            ATGNSP C++ S C
Sbjct: 175  SDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTACSGLKSATGNSPGCSS-SNC 233

Query: 696  VYEINYGDNSFTVGIFGKEKLTLSGGELLENIPFGCGQNNVGLFGATAGLIGLGRDPLSI 875
            VY I YGD+SFTVG F K+ LTL+  ++ +   FGCGQNN GLFG TAGLIGLGRDPLSI
Sbjct: 234  VYGIQYGDSSFTVGFFAKDTLTLTQNDVFDGFMFGCGQNNRGLFGKTAGLIGLGRDPLSI 293

Query: 876  VSQTAQKYGKVFSYCLPTTKSTSGGYLSFGR-------SGLNANLQYTQLSTSDD-PYYI 1031
            V QTAQK+GK FSYCLPT++  S G+L+FG          +   + +T  ++S    +Y 
Sbjct: 294  VQQTAQKFGKYFSYCLPTSRG-SNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYF 352

Query: 1032 IQMTAITVGGSPVPISATDLKSDGESIIDSGTVITRLPASIYKPMRDAFKKQMARYKMAQ 1211
            I +  I+VGG  + IS    ++ G +IIDSGTVITRLP+++Y  ++  FK+ M++Y  A 
Sbjct: 353  IDVLGISVGGKALSISPMLFQNAG-TIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAP 411

Query: 1212 PISIYDTCYDFSNEKEINVPIISFTFGGSNVKVDIPSDGVFYQVNGGVSQVCLAFA---- 1379
             +S+ DTCYD SN   I++P ISF F G N  VD+  +G+   +  G SQVCLAFA    
Sbjct: 412  ALSLLDTCYDLSNYTSISIPKISFNFNG-NANVDLEPNGIL--ITNGASQVCLAFAGNGD 468

Query: 1380 EDSVNIFGNSQQQTLEVVYDVAGGKIGFAPNGCT 1481
            +D++ IFGN QQQTLEVVYDVAGG++GF   GC+
Sbjct: 469  DDTIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502


>ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
            gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40
            [Arabidopsis thaliana] gi|24111269|gb|AAN46758.1|
            At5g10770/T30N20_40 [Arabidopsis thaliana]
            gi|332004211|gb|AED91594.1| aspartyl protease family
            protein [Arabidopsis thaliana]
          Length = 474

 Score =  410 bits (1054), Expect = e-112
 Identities = 222/458 (48%), Positives = 305/458 (66%), Gaps = 6/458 (1%)
 Frame = +3

Query: 126  HTISVSSLISTNDKSSCNNIVPTSTSNGHSKRKASLRVAHKYGACSSSPQGGNKAETNAN 305
            HTI VSSL+ ++  SSC      ST+      K+SL V H++G CS    G  KA +  +
Sbjct: 34   HTIQVSSLLPSSS-SSCVLSPRASTT------KSSLHVTHRHGTCSRLNNG--KATSPDH 84

Query: 306  LAHEILSHDQARVESIKARLEKFNSNKDLSDTKTTTLPAHRGTDLRTLNYVVEVGLGTPA 485
            +  EIL  DQARV SI ++L K  +   +S++K+T LPA  G+ L + NY+V VGLGTP 
Sbjct: 85   V--EILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPK 142

Query: 486  KQLSLVFDTGSDITWTQCQPCAKSCYQQQQPIFDPSKSTSFSNIXXXXXXXXXXXXATGN 665
              LSL+FDTGSD+TWTQCQPC ++CY Q++PIF+PSKSTS+ N+            ATGN
Sbjct: 143  NDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGN 202

Query: 666  SPRCANNSTCVYEINYGDNSFTVGIFGKEKLTLSGGELLENIPFGCGQNNVGLFGATAGL 845
            +  C + S C+Y I YGD SF+VG   KEK TL+  ++ + + FGCG+NN GLF   AGL
Sbjct: 203  AGSC-SASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGL 261

Query: 846  IGLGRDPLSIVSQTAQKYGKVFSYCLPTTKSTSGGYLSFGRSGLNANLQYTQLSTSDD-- 1019
            +GLGRD LS  SQTA  Y K+FSYCLP++ S + G+L+FG +G++ ++++T +ST  D  
Sbjct: 262  LGLGRDKLSFPSQTATAYNKIFSYCLPSSASYT-GHLTFGSAGISRSVKFTPISTITDGT 320

Query: 1020 PYYIIQMTAITVGGSPVPISATDLKSDGESIIDSGTVITRLPASIYKPMRDAFKKQMARY 1199
             +Y + + AITVGG  +PI +T   + G ++IDSGTVITRLP   Y  +R +FK +M++Y
Sbjct: 321  SFYGLNIVAITVGGQKLPIPSTVFSTPG-ALIDSGTVITRLPPKAYAALRSSFKAKMSKY 379

Query: 1200 KMAQPISIYDTCYDFSNEKEINVPIISFTFGGSNVKVDIPSDGVFYQVNGGVSQVCLAFA 1379
                 +SI DTC+D S  K + +P ++F+F G  V V++ S G+FY     +SQVCLAFA
Sbjct: 380  PTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAV-VELGSKGIFYVFK--ISQVCLAFA 436

Query: 1380 ----EDSVNIFGNSQQQTLEVVYDVAGGKIGFAPNGCT 1481
                + +  IFGN QQQTLEVVYD AGG++GFAPNGC+
Sbjct: 437  GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474


>ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  407 bits (1047), Expect = e-111
 Identities = 223/459 (48%), Positives = 298/459 (64%), Gaps = 8/459 (1%)
 Frame = +3

Query: 126  HTISVSSLISTNDKSSCNNIVPTSTSNGHSKRKASLRVAHKYGACSSSPQGGNKAETNAN 305
            H + +SSL+ +   SSC      S+S    K KASL V HK+G CS       KA++   
Sbjct: 46   HLVHLSSLLPS---SSC------SSSTKGPKTKASLEVVHKHGPCSQLNDHDGKAKSTTP 96

Query: 306  LAHEILSHDQARVESIKARLEK-FNSNKDLSDTKTTTLPAHRGTDLRTLNYVVEVGLGTP 482
             + +IL+ D+ RV+ I +RL K    +  + +  + TLPA  G+ + + NY V VGLGTP
Sbjct: 97   HS-DILNQDKERVKYINSRLSKNLGQDSSVEELDSATLPAKSGSLIGSGNYFVVVGLGTP 155

Query: 483  AKQLSLVFDTGSDITWTQCQPCAKSCYQQQQPIFDPSKSTSFSNIXXXXXXXXXXXXATG 662
             + LSL+FDTGSD+TWTQC+PCA+SCY+QQ  IFDPSKSTS+SNI            ATG
Sbjct: 156  KRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNITCTSALCTQLSTATG 215

Query: 663  NSPRC-ANNSTCVYEINYGDNSFTVGIFGKEKLTLSGGELLENIPFGCGQNNVGLFGATA 839
            N P C A+   C+Y I YGD+SF+VG F +E+LT++  ++++N  FGCGQNN GLFG +A
Sbjct: 216  NDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATDVVDNFLFGCGQNNQGLFGGSA 275

Query: 840  GLIGLGRDPLSIVSQTAQKYGKVFSYCLPTTKSTSGGYLSFGRSGLNANLQYTQLST--S 1013
            GLIGLGR P+S V QTA KY K+FSYCLP+T S+S G+LSFG +     L+YT  ST   
Sbjct: 276  GLIGLGRHPISFVQQTAAKYRKIFSYCLPST-SSSTGHLSFGPAATGRYLKYTPFSTISR 334

Query: 1014 DDPYYIIQMTAITVGGSPVPISATDLKSDGESIIDSGTVITRLPASIYKPMRDAFKKQMA 1193
               +Y + +TAI VGG  +P+S++   S G +IIDSGTVITRLP + Y  +R AF++ M+
Sbjct: 335  GSSFYGLDITAIAVGGVKLPVSSSTF-STGGAIIDSGTVITRLPPTAYGALRSAFRQGMS 393

Query: 1194 RYKMAQPISIYDTCYDFSNEKEINVPIISFTFGGSNVKVDIPSDGVFYQVNGGVSQVCLA 1373
            +Y  A  +SI DTCYD S  K  ++P I F+F G  V V +P  G+ +  +    QVCLA
Sbjct: 394  KYPSAGELSILDTCYDLSGYKVFSIPTIEFSFAG-GVTVKLPPQGILFVAS--TKQVCLA 450

Query: 1374 FA----EDSVNIFGNSQQQTLEVVYDVAGGKIGFAPNGC 1478
            FA    +  V I+GN QQ+T+EVVYDV GG+IGF   GC
Sbjct: 451  FAANGDDSDVTIYGNVQQRTIEVVYDVGGGRIGFGAGGC 489


Top