BLASTX nr result

ID: Lithospermum22_contig00007896 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum22_contig00007896
         (1476 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga...   206   2e-50
emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabid...   205   3e-50
emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...   203   8e-50
gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-...   202   2e-49
gb|AAD29058.1| putative non-LTR retroelement reverse transcripta...   201   4e-49

>emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1363

 Score =  206 bits (523), Expect = 2e-50
 Identities = 133/494 (26%), Positives = 228/494 (46%), Gaps = 22/494 (4%)
 Frame = +1

Query: 1    ILSCITNS*FSVIVNGESAGFFKSTKGIRQGDPISPSLFILAEDYLLRGLHKLMMEHPSL 180
            I+ CI++   S+I NGE    F  ++GIRQGDP+SP +F+L  + L   +   + +    
Sbjct: 610  IMDCISSPAISLIWNGEVTQSFSPSRGIRQGDPLSPYIFVLCMERLSMLISDRIRDGSWK 669

Query: 181  SYYTKCNNKIPCLAFADDCLIFYNGTKSSLSKITQLLDHYQAASGQVLNKAKSTCILSSK 360
                  +  +  + +ADD  +F   +  +   I  +L+ +   SG  +N +KS  I   K
Sbjct: 670  PIKISSDLGVSHIFYADDVFLFGQASVRNGGVIQNVLEEFGNISGLRVNMSKSLAIFPPK 729

Query: 361  LTPNRRKKIQHYTCFNKQVIPFNYLGIPIYKGKKQVILFDALLDRIKKTISSWEHRFLSY 540
            + P RR+ +  +           YLG  I   K +   +D LL+++K  I+ W+ ++L+ 
Sbjct: 730  MNPQRRRMLADFLTMKGSTSFGKYLGCNILPNKLRRGDYDGLLEKVKSAINGWQAKYLNM 789

Query: 541  GGKIVLIQAVLSSLPLYYLQVLKMPEQIKTRIGIIFNKFLWGDR------PGCSWKKLCS 702
             G+  LI++V+SS P+Y +Q   +P  +   I     KFLW            SW ++CS
Sbjct: 790  AGRCTLIKSVVSSFPVYGMQSSLLPVSVMNEIEKDCRKFLWNKMDKSHYLARMSWDRICS 849

Query: 703  PF-EGGLNFRSLDDLYAASMMK-AWYRMREGNSIWSKFMLSKYCRIRHPSVAKVRPHQSK 876
            P  +GGL FR L +   A M K  W  +++   +W + + ++Y        A  + H S 
Sbjct: 850  PTGKGGLGFRRLHNWNLAFMAKLGWMIIKDETKLWVRILKARYWERGSFLSAVGKNHHSP 909

Query: 877  LWKNITKFREITDKHIIWSLGKG-NCDLWLESWLPSGPLNTSVKSGV-------MVHHML 1032
            +W++I K RE+ +K ++  +G G +  LW   W+  GPL   + S +        V +++
Sbjct: 910  IWRDIVKGRELLEKGLVRRIGNGRSTSLWYHWWVGGGPLVDVMGSNIPEFMSHWQVSNII 969

Query: 1033 TDNKWDPNKXXXXXXXXXXXXXXTIPIHKDS--QDTILWKLTNNGNFDFKSAWQIARTPK 1206
               +WD  K               IP+   S  +D   W    NG F  KSA+ +    +
Sbjct: 970  KRGRWDTKKISHLLPPDILKQIKEIPLASMSEVEDDFTWNFEKNGTFSVKSAYYLINRRE 1029

Query: 1207 SKS---TLFNVIWHSSIPKKKSFLTWRHIHNWIPMESIQQQRGTSLASKCVCCSE-TESI 1374
             ++     +  +W  +IP K   L W  IHN +P      +R  +   +CV C    E +
Sbjct: 1030 EETGGKGSWRGLWRKNIPFKYKLLIWNGIHNILPTALFLAKRIHNFNPQCVACDHPIEDM 1089

Query: 1375 DHVFFKNTIAYNIW 1416
             H+F    +A ++W
Sbjct: 1090 IHLFRDCCVASSVW 1103


>emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabidopsis thaliana]
            gi|7267666|emb|CAB78094.1| RNA-directed DNA
            polymerase-like protein [Arabidopsis thaliana]
          Length = 1274

 Score =  205 bits (521), Expect = 3e-50
 Identities = 140/514 (27%), Positives = 243/514 (47%), Gaps = 25/514 (4%)
 Frame = +1

Query: 1    ILSCITNS*FSVIVNGESAGFFKSTKGIRQGDPISPSLFILAEDYLLRGLHKLMMEHPSL 180
            ++ C+    +S ++NG   G    ++G+RQGDP+SP LFIL  + +L GL +   E   +
Sbjct: 547  VMQCVCTVSYSFLINGSPQGSVVPSRGLRQGDPLSPYLFILCTE-VLSGLCRKAQEKGVM 605

Query: 181  S--YYTKCNNKIPCLAFADDCLIFYNGTKSSLSKITQLLDHYQAASGQVLNKAKSTCILS 354
                  + + ++  L FADD + F     +    ++ +L  Y+ ASGQ +N AKS    S
Sbjct: 606  VGIRVARGSPQVNHLLFADDTMFFCKTNPTCCGALSNILKKYELASGQSINLAKSAITFS 665

Query: 355  SKLTPNRRKKIQHYTCFNKQVIPFNYLGIPIYKGKKQVILFDALLDRIKKTISSWEHRFL 534
            SK   + +++++     + +     YLG+P + G+++  +F +++DRI++   SW  RFL
Sbjct: 666  SKTPQDIKRRVKLSLRIDNEGGIGKYLGLPEHFGRRKRDIFSSIVDRIRQRSHSWSIRFL 725

Query: 535  SYGGKIVLIQAVLSSLPLYYLQVLKMPEQIKTRIGIIFNKFLWGDRPG------CSWKKL 696
            S  GK +L++AVLSS+P Y +   K+P  +  +I  +  +F W  +P        SW KL
Sbjct: 726  SSAGKQILLKAVLSSMPSYAMMCFKLPASLCKQIQSVLTRFWWDSKPDKRKMAWVSWDKL 785

Query: 697  CSPF-EGGLNFRSLDDLYAASMMKAWYRMREGNSIWSKFMLSKYCRIRHPSVAKVRP-HQ 870
              P  EGGL FR ++         +W  ++E +S+ S+ +L KYC           P   
Sbjct: 786  TLPINEGGLGFREIE------AKLSWRILKEPHSLLSRVLLGKYCNTSSFMDCSASPSFA 839

Query: 871  SKLWKNITKFREITDKHIIWSLGKG-NCDLWLESWL-PSGPLN-----TSVKSGVMVHHM 1029
            S  W+ I   R++  K + WS+G+G + ++W E+WL PS P       T     + VH +
Sbjct: 840  SHGWRGILAGRDLLRKGLGWSIGQGDSINVWTEAWLSPSSPQTPIGPPTETNKDLSVHDL 899

Query: 1030 LTDN--KWDPNKXXXXXXXXXXXXXXTIPIHKDSQDTILWKLTNNGNFDFKSAWQIARTP 1203
            +  +   W+                         QD+++W    +G +  K+ + +A+  
Sbjct: 900  ICHDVKSWNVEAIRKHLPQYEDQIRKITINALPLQDSLVWLPVKSGEYTTKTGYALAKLN 959

Query: 1204 KSKSTLFNVIWHSSI------PKKKSFLTWRHIHNWIPMESIQQQRGTSLASKCVCCSET 1365
               ++  +  W  +I      PK K FL W+ +   +P+     +R       C  C +T
Sbjct: 960  SFPASQLDFNWQKNIWKIHTSPKVKHFL-WKAMKGALPVGEALSRRNIEAEVTCKRCGQT 1018

Query: 1366 ESIDHVFFKNTIAYNIWRLFAEIFGASNEEIHNT 1467
            ES  H+      A  +W L   +F  S E  H++
Sbjct: 1019 ESSLHLMLLCPYAKKVWELAPVLFNPS-EATHSS 1051


>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score =  203 bits (517), Expect = 8e-50
 Identities = 138/491 (28%), Positives = 240/491 (48%), Gaps = 35/491 (7%)
 Frame = +1

Query: 1    ILSCITNS*FSVIVNGESAGFFKSTKGIRQGDPISPSLFILAEDYLLRGLHKLMM--EHP 174
            +++C+T++ FSV+VNG+ +  F  ++G+RQGDP+SP LF++  +    GL  L+   E  
Sbjct: 615  VMNCVTSARFSVLVNGQPSRNFFPSRGLRQGDPLSPFLFVVCAE----GLSTLLRDAEEK 670

Query: 175  SLSYYTKCNNKIPCLA---FADDCLIFYNGTKSSLSKITQLLDHYQAASGQVLNKAKSTC 345
             + +  K  +++  ++   FADD L+F   T+  +  +  +L  Y+AASGQ LN  KS  
Sbjct: 671  KVIHGVKIGHRVSPISHLFFADDSLLFIRATEEEVENVMDILSTYEAASGQKLNMEKSEM 730

Query: 346  ILSSKLTPNRRKKIQHYTCFNKQVIPFNYLGIPIYKGKKQVILFDALLDRIKKTISSWEH 525
              S  L P++   +Q    F        YLG+P + G  +  +F A+ DR+ K +  W+ 
Sbjct: 731  SYSRNLEPDKINTLQMKLAFKTVEGHEKYLGLPTFIGSSKKRVFQAIQDRVWKKLKGWKG 790

Query: 526  RFLSYGGKIVLIQAVLSSLPLYYLQVLKMPEQIKTRIGIIFNKFLWGDR------PGCSW 687
            ++LS  G+ VLI+AV  ++P Y +Q   +P+ I   I  +   F WG +         +W
Sbjct: 791  KYLSQAGREVLIKAVAQAIPTYAMQCFVIPKSIIDGIEKMCRNFFWGQKEEERRVAWVAW 850

Query: 688  KKLCSP-FEGGLNFRSLDDLYAASMMK-AWYRMREGNSIWSKFMLSKYCRIRHPSVAKVR 861
            +KL  P  EGGL  R+ D    A + K AW  + + +S+ ++ +  KY    +   A+V 
Sbjct: 851  EKLFLPKKEGGLGIRNFDVFNRALLAKQAWRILTKPDSLMARVIKGKYFPRSNFLEARVS 910

Query: 862  PHQSKLWKNITKFREITDKHIIWSLGKG-NCDLWLESWLPS-GPLNTSVKSGV------- 1014
            P+ S   K+I   R +  K +   +G G +  +W + W+PS    + +   GV       
Sbjct: 911  PNMSFTCKSILSARAVIQKGMCRVIGDGRDTTIWGDPWVPSLERYSIAATEGVSEDDGPQ 970

Query: 1015 MVHHMLTDNKWDPNKXXXXXXXXXXXXXXTIPIH-KDSQDTILWKLTNNGNFDFKSAW-- 1185
             V  ++++++W+                  IP+  +   D  +W ++ NG F  +SA+  
Sbjct: 971  KVCELISNDRWNVELLNTLFQPWESTAIQRIPVALQKKPDQWMWMMSKNGQFTVRSAYYH 1030

Query: 1186 ----------QIARTPKSKSTLFNVIWHSSIPKKKSFLTWRHIHNWIPMESIQQQRGTSL 1335
                        +R P  K  L+  IW + IP K    +W+ IHN + + +  ++RG ++
Sbjct: 1031 ELLEDRKTGPSTSRGPNLK--LWQKIWKAKIPPKVKLFSWKAIHNGLAVYTNMRKRGMNI 1088

Query: 1336 ASKCVCCSETE 1368
               C  C E E
Sbjct: 1089 DGACPRCGEKE 1099


>gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana]
          Length = 1142

 Score =  202 bits (513), Expect = 2e-49
 Identities = 144/505 (28%), Positives = 237/505 (46%), Gaps = 31/505 (6%)
 Frame = +1

Query: 1    ILSCITNS*FSVIVNGESAGFFKSTKGIRQGDPISPSLFILAEDYLLRGLHKLMMEHPSL 180
            I+ CIT   + V++NG+  G     +G+RQGDP+SP LFIL  + L+  + K   E  +L
Sbjct: 390  IMWCITTVQYKVLINGQPKGLIIPERGLRQGDPLSPYLFILCTEVLIANIRKA--ERQNL 447

Query: 181  SYYTKCNNKIPC---LAFADDCLIFYNGTKSSLSKITQLLDHYQAASGQVLNKAKSTCIL 351
                K     P    L FADD L F    K     I ++L  Y++ SGQ +N +KS+   
Sbjct: 448  ITGIKVATPSPAVSHLLFADDSLFFCKANKEQCGIILEILKQYESVSGQQINFSKSSIQF 507

Query: 352  SSKLTPNRRKKIQHYTCFNKQVIPFNYLGIPIYKGKKQVILFDALLDRIKKTISSWEHRF 531
              K+  + +  I+     +      +YLG+P   G  +  +F  + DR++  I+ W  +F
Sbjct: 508  GHKVEDSIKADIKLILGIHNLGGMGSYLGLPESLGGSKTKVFSFVRDRLQSRINGWSAKF 567

Query: 532  LSYGGKIVLIQAVLSSLPLYYLQVLKMPEQIKTRIGIIFNKFLW---GDRPG---CSWKK 693
            LS GGK V+I++V ++LP Y +   ++P+ I +++     KF W   GD  G    +W K
Sbjct: 568  LSKGGKEVMIKSVAATLPRYVMSCFRLPKAITSKLTSAVAKFWWSSNGDSRGMHWMAWDK 627

Query: 694  LC-SPFEGGLNFRSLDDLYAASMMK-AWYRMREGNSIWSKFMLSKYCRIRHPSVAKVRPH 867
            LC S  +GGL FR++DD  +A + K  W  +   +S+++K    +Y R  +P  +     
Sbjct: 628  LCSSKSDGGLGFRNVDDFNSALLAKQLWRLITAPDSLFAKVFKGRYFRKSNPLDSIKSYS 687

Query: 868  QSKLWKNITKFREITDKHIIWSLGKG-NCDLWLESWLPSGPLNTSVKSGVMVHHML---- 1032
             S  W+++   R +  K +I  +G G +  +W + W+P+     +   G +V   L    
Sbjct: 688  PSYGWRSMISARSLVYKGLIKRVGSGASISVWNDPWIPAQFPRPAKYGGSIVDPSLKVKS 747

Query: 1033 ----TDNKWDPNKXXXXXXXXXXXXXXTIPI-HKDSQDTILWKLTNNGNFDFKSAWQIAR 1197
                  N W+ +                +PI + + +DT+ W  T  GN+  KS +  AR
Sbjct: 748  LIDSRSNFWNIDLLKELFDPEDVPLISALPIGNPNMEDTLGWHFTKAGNYTVKSGYHTAR 807

Query: 1198 TPKSK---------STLFNVIWHSSIPKKKSFLTWRHIHNWIPMESIQQQRGTSLASKCV 1350
               ++         +TL   IW    P K     W+ +   +P+    ++RG      CV
Sbjct: 808  LDLNEGTTLIGPDLTTLKAYIWKVQCPPKLRHFLWQILSGCVPVSENLRKRGILCDKGCV 867

Query: 1351 CC-SETESIDHVFFKNTIAYNIWRL 1422
             C +  ESI+H  F+   A  IW L
Sbjct: 868  SCGASEESINHTLFQCHPARQIWAL 892


>gb|AAD29058.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1229

 Score =  201 bits (511), Expect = 4e-49
 Identities = 138/498 (27%), Positives = 232/498 (46%), Gaps = 24/498 (4%)
 Frame = +1

Query: 1    ILSCITNS*FSVIVNGESAGFFKSTKGIRQGDPISPSLFILAEDYLLRGLHKLMMEHPSL 180
            +L C+T+  +S ++NG   G    T+G+RQGDP+SP LFIL  + +L GL         L
Sbjct: 501  VLECVTSVSYSFLINGTPQGKVVPTRGLRQGDPLSPCLFILCTE-VLSGLCTRAQRLRQL 559

Query: 181  SYYTKCNN--KIPCLAFADDCLIFYNGTKSSLSKITQLLDHYQAASGQVLNKAKSTCILS 354
                   N  ++  L FADD + F      S +K++++L  Y  ASGQ +N  KS+   S
Sbjct: 560  PGVRVSINGPRVNHLLFADDTMFFSKSDPESCNKLSEILSRYGKASGQSINFHKSSVTFS 619

Query: 355  SKLTPNRRKKIQHYTCFNKQVIPFNYLGIPIYKGKKQVILFDALLDRIKKTISSWEHRFL 534
            SK   + + +++      K+     YLG+P + G+++  +F A++D+I++   SW  RFL
Sbjct: 620  SKTPRSVKGQVKRILKIRKEGGTGKYLGLPEHFGRRKRDIFGAIIDKIRQKSHSWASRFL 679

Query: 535  SYGGKIVLIQAVLSSLPLYYLQVLKMPEQIKTRIGIIFNKFLWGDRPG------CSWKKL 696
            S  GK V+++AVL+S+PLY +   K+P  +  +I  +  +F W  +P        +W KL
Sbjct: 680  SQAGKQVMLKAVLASMPLYSMSCFKLPSALCRKIQSLLTRFWWDTKPDVRKTSWVAWSKL 739

Query: 697  CSPFE-GGLNFRSLDDLYAASMMK-AWYRMREGNSIWSKFMLSKYCRIRHPSVAKVRPHQ 870
             +P   GGL FR ++    + + K  W  +    S+ S+ +L KYC        K+    
Sbjct: 740  TNPKNAGGLGFRDIERCNDSLLAKLGWRLLNSPESLLSRILLGKYCHSSSFMECKLPSQP 799

Query: 871  SKLWKNITKFREITDKHIIWSLGKG-NCDLWLESWL----PSGPLNTSVK--SGVMVHHM 1029
            S  W++I   REI  + + W +  G    +W + WL    P  P+  +++    + V  +
Sbjct: 800  SHGWRSIIAGREILKEGLGWLITNGEKVSIWNDPWLSISKPLVPIGPALREHQDLRVSAL 859

Query: 1030 LTDN--KWDPNKXXXXXXXXXXXXXXTIPIHKDSQDTILWKLTNNGNFDFKSAWQIART- 1200
            +  N  +WD NK                       D + W    +G +  +S + IA   
Sbjct: 860  INQNTLQWDWNKIAVILPNYENLIKQLPAPSSRGVDKLAWLPVKSGQYTSRSGYGIASVA 919

Query: 1201 ----PKSKSTLFNVIWHSSIPKKKSFLTWRHIHNWIPMESIQQQRGTSLASKCVCCSETE 1368
                P+++    + +W      K   L W+     +P+     +R  S ++ C  C   E
Sbjct: 920  SIPIPQTQFNWQSNLWKLQTLPKIKHLMWKAAMEALPVGIQLVRRHISPSAACHRCGAPE 979

Query: 1369 SIDHVFFKNTIAYNIWRL 1422
            S  H+FF    A  +W L
Sbjct: 980  STTHLFFHCEFAAQVWEL 997


Top