BLASTX nr result

ID: Lithospermum22_contig00014418 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum22_contig00014418
         (1688 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...   116   3e-48
gb|AFP55557.1| non-ltr retroelement reverse transcriptase [Rosa ...    94   4e-45
emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...    89   3e-40
gb|EEC83100.1| hypothetical protein OsI_28249 [Oryza sativa Indi...    91   8e-37
gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-...    86   2e-36

>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score =  116 bits (291), Expect(3) = 3e-48
 Identities = 63/188 (33%), Positives = 86/188 (45%), Gaps = 2/188 (1%)
 Frame = -2

Query: 1033 LDPCWNNLWGMKIPPHVKSFMWRCLNNNLPTKDRLRRRGIRVESACVFCKNGREDLQHLF 854
            LDP +  +W + +PP +  F+WRC+NN L     L  R +  E +CV C +  E + HL 
Sbjct: 1047 LDPIFQQIWKLDVPPKIHHFLWRCVNNCLSVASNLAYRHLAREKSCVRCPSHGETVNHLL 1106

Query: 853  LHCPFACRLWFATPLNICTTGGPWTN--FREWWVYMMTQFKSMECPENFDLLAWILWFIW 680
              CPFA   W  +PL     GG W    FR     +       E  ++  L+ WILW +W
Sbjct: 1107 FKCPFARLTWAISPLP-APPGGEWAESLFRNMHHVLSVHKSQPEESDHHALIPWILWRLW 1165

Query: 679  KFRNGLVFGEANLCEEAIWQEGYNLYENYKNVYARSPQLLAPSVVIDDMECWQRPSRGWL 500
            K RN LVF         +  +     + + N   + PQ    S   D    WQ PS GW+
Sbjct: 1166 KNRNDLVFKGREFTAPQVILKATEDMDAWNN--RKEPQPQVTSSTRDRCVKWQPPSHGWV 1223

Query: 499  KINTDACW 476
            K NTD  W
Sbjct: 1224 KCNTDGAW 1231



 Score = 75.9 bits (185), Expect(3) = 3e-48
 Identities = 47/158 (29%), Positives = 77/158 (48%), Gaps = 13/158 (8%)
 Frame = -3

Query: 1560 LLAKQGWRVATQLASLLFKLLKGRYFRCSSFLKAKLGTNPSYGWRNLMEGRKVLQKGIRW 1381
            LL KQ WR+ T+  SL+ K+ K RYF  S  L A LG+ PS+ W+++ E + ++++GIR 
Sbjct: 865  LLGKQLWRMITEKDSLMAKVFKSRYFSKSDPLNAPLGSRPSFAWKSIYEAQVLIKQGIRA 924

Query: 1380 RVGDRRSINMWTEPRVSRKTDFKLRGAQDNGFRWVLQLIRNGI-------------WDRR 1240
             +G+  +IN+WT+P +  K     +  + +    V Q   N I             W+  
Sbjct: 925  VIGNGETINVWTDPWIGAKPAKAAQAVKRS--HLVSQYAANSIHVVKDLLLPDGRDWNWN 982

Query: 1239 RVEDLLGSDKAVEILSIPLSRCGVCDKLIWHYTKSRTY 1126
             V  L   +    IL++        D+  W Y++S  Y
Sbjct: 983  LVSLLFPDNTQENILALRPGGKETRDRFTWEYSRSGHY 1020



 Score = 48.9 bits (115), Expect(3) = 3e-48
 Identities = 22/41 (53%), Positives = 26/41 (63%)
 Frame = -1

Query: 1682 MAKFVWANGKGDRGIHWKT*DKLDEDKANGGLGFKDLECIN 1560
            MA+F W N K  RG+HWK    L   KA GGLGFK++E  N
Sbjct: 822  MAEFWWKNKKEGRGLHWKAWCHLSRPKAVGGLGFKEIEAFN 862


>gb|AFP55557.1| non-ltr retroelement reverse transcriptase [Rosa rugosa]
          Length = 1747

 Score = 94.4 bits (233), Expect(5) = 4e-45
 Identities = 51/147 (34%), Positives = 81/147 (55%), Gaps = 2/147 (1%)
 Frame = -3

Query: 1560 LLAKQGWRVATQLASLLFKLLKGRYFRCSSFLKAKLGTNPSYGWRNLMEGRKVLQKGIRW 1381
            LL KQGWR+     SL+ ++LK +YF    F++A+LG++PSY WR+ + GR++L+KG+RW
Sbjct: 1152 LLGKQGWRLMMYPDSLVARMLKAKYFPWDDFMEAELGSSPSYLWRSFLWGRELLRKGVRW 1211

Query: 1380 RVGDRRSINMWTEPRVSRKTDFK--LRGAQDNGFRWVLQLIRNGIWDRRRVEDLLGSDKA 1207
            R+GD + + ++ +P V     F+  LR       R    L  NG W+   +      D+ 
Sbjct: 1212 RIGDGKEVRVFIDPWVPGLPSFRPILRQGAPLFLRVSDLLHNNGGWNMEALNYWFTDDEC 1271

Query: 1206 VEILSIPLSRCGVCDKLIWHYTKSRTY 1126
              I SI +      D  +W+Y K+  Y
Sbjct: 1272 EAISSITVGATRRPDVYMWNYCKNGRY 1298



 Score = 75.5 bits (184), Expect(5) = 4e-45
 Identities = 50/190 (26%), Positives = 80/190 (42%), Gaps = 5/190 (2%)
 Frame = -2

Query: 1039 IILDP--CWNNLWGMKIPPHVKSFMWRCLNNNLPTKDRLRRRGIRVESACVFCKNGREDL 866
            I+L P   W +LW +K+PP +  F+WRC    +P  + L  + I   ++C  C+ GRE  
Sbjct: 1318 IVLAPRNFWKHLWKLKLPPKINHFLWRCSMGFIPCMEVLLWKHIAHSASCFRCQQGRESP 1377

Query: 865  QHLFLHCPFACRLWFATPLNICTTGGPWTNFREWWVYMMTQFKSMECPENFDLLAWILWF 686
             H    C     ++         + G + +F    ++++    S    E   L A +LW 
Sbjct: 1378 VHATWGCSCCVAVFERAGFYSKLSSGQFPSF----IHLLHHAFSTLDKEELQLFAVLLWL 1433

Query: 685  IWKFRNGLVFGEANLCEEAIWQEGYNLYENYKNVY---ARSPQLLAPSVVIDDMECWQRP 515
             W  RN      A +  + I++ G    + +K      A         VV   +  WQ P
Sbjct: 1434 NWHERNNCYHKGAVVPSDIIYENGVKFLKCFKEALGCRAGVEVKAVEEVVPGSLRRWQAP 1493

Query: 514  SRGWLKINTD 485
            S G LK+N D
Sbjct: 1494 SSGQLKVNCD 1503



 Score = 39.7 bits (91), Expect(5) = 4e-45
 Identities = 18/43 (41%), Positives = 26/43 (60%)
 Frame = -1

Query: 1688 RTMAKFVWANGKGDRGIHWKT*DKLDEDKANGGLGFKDLECIN 1560
            + +A+F W   +G +GIHW+    L   K +GGLGF+DL   N
Sbjct: 1108 KCVARFWWGK-EGGKGIHWRRWSDLCFSKKDGGLGFRDLSLFN 1149



 Score = 36.6 bits (83), Expect(5) = 4e-45
 Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 1/80 (1%)
 Frame = -3

Query: 348  LQQIELESDSKSLIQILNKHQQAPAEVEIVVGDILYLANLLEVKF*F-TKRVFNNVAHTV 172
            L+ I +ESD    I +LN  ++  A    +V DI     L+ +   +  +R  N  AH +
Sbjct: 1560 LRNIMVESDCLEAIHLLNSKERCLAPEGGLVEDIQNTMALVNISSIYHVRREGNTAAHAI 1619

Query: 171  AHWENRDQRGTTWLHNSPHW 112
            A +  R+     WL + P W
Sbjct: 1620 AKFVARNNGRYVWLEDGPDW 1639



 Score = 24.6 bits (52), Expect(5) = 4e-45
 Identities = 11/36 (30%), Positives = 20/36 (55%)
 Frame = -1

Query: 443  GARNTQLPHSTAAVVAEAMTIRDGLEFAIENHCNKL 336
            G +N Q  H  +++VAE + I+ GL+  +E     +
Sbjct: 1530 GGKNFQ--HPVSSLVAELLAIKVGLDLVVERRLRNI 1563


>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score = 89.0 bits (219), Expect(4) = 3e-40
 Identities = 53/181 (29%), Positives = 82/181 (45%), Gaps = 1/181 (0%)
 Frame = -2

Query: 1021 WNNLWGMKIPPHVKSFMWRCLNNNLPTKDRLRRRGIRVESACVFCKNGREDLQHLFLHCP 842
            W  +W  KIPP VK F W+ ++N L     +R+RG+ ++ AC  C    E  +HL   C 
Sbjct: 1051 WQKIWKAKIPPKVKLFSWKAIHNGLAVYTNMRKRGMNIDGACPRCGEKEETTEHLIWGCD 1110

Query: 841  FACRLWFATPLNICTTGGPWTNFREWWVYMMTQFKSMECPENFDLLAWILWFIWKFRNGL 662
             + R W+ +PL I T      +FR W   ++   K  E    + L   I W IW  RN  
Sbjct: 1111 ESSRAWYISPLRIHTGNIEAGSFRIWVESLLDTHKDTEW---WALFWMICWNIWLGRNKW 1167

Query: 661  VFGEANLCEEAIWQEGYNLYENYKNVYARSPQLLAPSVVIDDME-CWQRPSRGWLKINTD 485
            VF +  L  + + +        ++   A +    +P   ++  E  W  P  G +K+N D
Sbjct: 1168 VFEKKKLAFQEVVERAVRGVMEFEEECAHT----SPVETLNTHENGWSVPPVGMVKLNVD 1223

Query: 484  A 482
            A
Sbjct: 1224 A 1224



 Score = 81.6 bits (200), Expect(4) = 3e-40
 Identities = 45/146 (30%), Positives = 83/146 (56%), Gaps = 4/146 (2%)
 Frame = -3

Query: 1560 LLAKQGWRVATQLASLLFKLLKGRYFRCSSFLKAKLGTNPSYGWRNLMEGRKVLQKGIRW 1381
            LLAKQ WR+ T+  SL+ +++KG+YF  S+FL+A++  N S+  ++++  R V+QKG+  
Sbjct: 874  LLAKQAWRILTKPDSLMARVIKGKYFPRSNFLEARVSPNMSFTCKSILSARAVIQKGMCR 933

Query: 1380 RVGDRRSINMWTEPRVSRKTDFKLRG----AQDNGFRWVLQLIRNGIWDRRRVEDLLGSD 1213
             +GD R   +W +P V     + +      ++D+G + V +LI N  W+   +  L    
Sbjct: 934  VIGDGRDTTIWGDPWVPSLERYSIAATEGVSEDDGPQKVCELISNDRWNVELLNTLFQPW 993

Query: 1212 KAVEILSIPLSRCGVCDKLIWHYTKS 1135
            ++  I  IP++     D+ +W  +K+
Sbjct: 994  ESTAIQRIPVALQKKPDQWMWMMSKN 1019



 Score = 34.3 bits (77), Expect(4) = 3e-40
 Identities = 32/110 (29%), Positives = 38/110 (34%), Gaps = 10/110 (9%)
 Frame = -3

Query: 417  LYCCCSCGGDDDPRRP*IC---------Y*EPLQQIELESDSKSLIQILNKHQQAPAEVE 265
            L  CC     +DP     C         Y    + + +E D K L   L           
Sbjct: 1247 LATCCGGWAMEDPAMAEACSLRYGLKVAYEAGFRNLVVEMDCKKLFLQLRGKASDVTPFG 1306

Query: 264  IVVGDILYLANLLE-VKF*FTKRVFNNVAHTVAHWENRDQRGTTWLHNSP 118
             VV DILYLA+    V F   KR  N VAH +A           WL   P
Sbjct: 1307 RVVDDILYLASKCSNVVFEHVKRHCNKVAHLLAQMCKNAMEKRVWLEEYP 1356



 Score = 29.3 bits (64), Expect(4) = 3e-40
 Identities = 12/38 (31%), Positives = 19/38 (50%)
 Frame = -1

Query: 1673 FVWANGKGDRGIHWKT*DKLDEDKANGGLGFKDLECIN 1560
            F W   + +R + W   +KL   K  GGLG ++ +  N
Sbjct: 834  FFWGQKEEERRVAWVAWEKLFLPKKEGGLGIRNFDVFN 871


>gb|EEC83100.1| hypothetical protein OsI_28249 [Oryza sativa Indica Group]
          Length = 1300

 Score = 90.9 bits (224), Expect(3) = 8e-37
 Identities = 52/145 (35%), Positives = 78/145 (53%), Gaps = 4/145 (2%)
 Frame = -3

Query: 1560 LLAKQGWRVATQLASLLFKLLKGRYFRCSSFLKAKLGTNPSYGWRNLMEGRKVLQKGIRW 1381
            LLA+Q WR+     SL  ++LK +YF   S +      N S GWR +  G ++L++GI W
Sbjct: 787  LLARQSWRILEFPESLCARVLKAKYFPNGSLIDTSFSGNASPGWRGIEYGLELLKQGIIW 846

Query: 1380 RVGDRRSINMWTEPRVSRKTDFKLRGAQDNG---FRWVLQLI-RNGIWDRRRVEDLLGSD 1213
            RVG+ R+I +W +P + R  DF  R     G    +WV +L+ +NG WD  +++ +    
Sbjct: 847  RVGNGRTIRIWRDPWIPR--DFSRRPITHKGTSRVKWVSELLDQNGEWDSHKIQQIFLPI 904

Query: 1212 KAVEILSIPLSRCGVCDKLIWHYTK 1138
               +ILSI  SR    D + WH  K
Sbjct: 905  DVEKILSIHTSRFHENDFVAWHSDK 929



 Score = 71.6 bits (174), Expect(3) = 8e-37
 Identities = 56/212 (26%), Positives = 84/212 (39%), Gaps = 21/212 (9%)
 Frame = -2

Query: 1048 SGGIILDPCWNNLWGMKIPPHVKSFMWRCLNNNLPTKDRLRRRGIRVESACVFCKNGRED 869
            S G  L   WN LW   +P  V+ F+WR  +N+L T    +++ +   S C  C    ED
Sbjct: 954  SSGQELSKAWNQLWSCHVPQKVRIFIWRAASNSLATMVNKKKKRLEHCSMCSICGTEEED 1013

Query: 868  LQHLFLHCPFACRLWFATPLNICTT---GGPWTNFREWWVYMMTQFKSMECPENFDLLAW 698
            + H    CP A  LW         T      WT     W++ +++  S    E    L  
Sbjct: 1014 VAHALCRCPHAKYLWEVMRRAKAITVQADRNWTGAD--WIFDISERIS---KEERPTLLM 1068

Query: 697  ILWFIWKFRNGLVFGEANLCEEAIWQEGYNLY------------------ENYKNVYARS 572
            +LW IW  RN +  G+A +  E + Q   + Y                  ++     A  
Sbjct: 1069 MLWRIWYVRNEITHGKAAVPAE-VSQRFISSYITSLLEIRQFPDANLCKGKHVIRCAAAG 1127

Query: 571  PQLLAPSVVIDDMECWQRPSRGWLKINTDACW 476
             Q+  P V    +  W RP  GW+K+N D  +
Sbjct: 1128 AQVNHPRVNSVPVR-WVRPQAGWMKLNVDGSY 1158



 Score = 40.0 bits (92), Expect(3) = 8e-37
 Identities = 16/43 (37%), Positives = 22/43 (51%)
 Frame = -1

Query: 1688 RTMAKFVWANGKGDRGIHWKT*DKLDEDKANGGLGFKDLECIN 1560
            + +  F W  G G R  HW+  D L + K  GG+GF+D    N
Sbjct: 742  KAVRNFWWGAGDGKRRTHWRAWDSLTKPKQCGGMGFRDFRLFN 784


>gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana]
          Length = 1142

 Score = 86.3 bits (212), Expect(3) = 2e-36
 Identities = 52/149 (34%), Positives = 82/149 (55%), Gaps = 4/149 (2%)
 Frame = -3

Query: 1560 LLAKQGWRVATQLASLLFKLLKGRYFRCSSFLKAKLGTNPSYGWRNLMEGRKVLQKGIRW 1381
            LLAKQ WR+ T   SL  K+ KGRYFR S+ L +    +PSYGWR+++  R ++ KG+  
Sbjct: 649  LLAKQLWRLITAPDSLFAKVFKGRYFRKSNPLDSIKSYSPSYGWRSMISARSLVYKGLIK 708

Query: 1380 RVGDRRSINMWTEPRVSRK--TDFKLRGAQDNGFRWVLQLI--RNGIWDRRRVEDLLGSD 1213
            RVG   SI++W +P +  +     K  G+  +    V  LI  R+  W+   +++L   +
Sbjct: 709  RVGSGASISVWNDPWIPAQFPRPAKYGGSIVDPSLKVKSLIDSRSNFWNIDLLKELFDPE 768

Query: 1212 KAVEILSIPLSRCGVCDKLIWHYTKSRTY 1126
                I ++P+    + D L WH+TK+  Y
Sbjct: 769  DVPLISALPIGNPNMEDTLGWHFTKAGNY 797



 Score = 70.1 bits (170), Expect(3) = 2e-36
 Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 4/123 (3%)
 Frame = -2

Query: 1012 LWGMKIPPHVKSFMWRCLNNNLPTKDRLRRRGIRVESACVFCKNGREDLQHLFLHCPFAC 833
            +W ++ PP ++ F+W+ L+  +P  + LR+RGI  +  CV C    E + H    C  A 
Sbjct: 828  IWKVQCPPKLRHFLWQILSGCVPVSENLRKRGILCDKGCVSCGASEESINHTLFQCHPAR 887

Query: 832  RLW----FATPLNICTTGGPWTNFREWWVYMMTQFKSMECPENFDLLAWILWFIWKFRNG 665
            ++W      T   I  +   +TN    +  + +   S   P       WI+W+IWK RN 
Sbjct: 888  QIWALSQIPTAPGIFPSNSIFTNLDHLFWRIPSGVDSAPYP-------WIIWYIWKARNE 940

Query: 664  LVF 656
             VF
Sbjct: 941  KVF 943



 Score = 45.1 bits (105), Expect(3) = 2e-36
 Identities = 18/41 (43%), Positives = 28/41 (68%)
 Frame = -1

Query: 1682 MAKFVWANGKGDRGIHWKT*DKLDEDKANGGLGFKDLECIN 1560
            +AKF W++    RG+HW   DKL   K++GGLGF++++  N
Sbjct: 606  VAKFWWSSNGDSRGMHWMAWDKLCSSKSDGGLGFRNVDDFN 646


Top