BLASTX nr result

ID: Coptis24_contig00023273 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00023273
         (1440 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...   167   5e-39
gb|EEC84982.1| hypothetical protein OsI_32248 [Oryza sativa Indi...   151   5e-34
gb|AAF69169.1|AC007915_21 F27F5.21 [Arabidopsis thaliana]             147   7e-33
gb|AAD21778.1| putative non-LTR retroelement reverse transcripta...   144   4e-32
gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-...   143   1e-31

>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score =  167 bits (424), Expect = 5e-39
 Identities = 133/431 (30%), Positives = 212/431 (49%), Gaps = 20/431 (4%)
 Frame = -1

Query: 1407 NHTNITLIPKIPNLNQVKNFRHISLCNSISWRIISTILANRLKPSLVKIISPLHSAFVKN 1228
            N TNI LIPKI    ++ +FR ISLCN I +++I  ++ANRLK  L  +IS   +AFVK 
Sbjct: 485  NKTNICLIPKILKAEKMTDFRPISLCNVI-YKVIGKLMANRLKKILPSLISETQAAFVKG 543

Query: 1227 RQITDNAKRFCKKLKKM----KGIHGFLAIKLDMPRAFDHIK*PFLFKNLKQLVSVKIGL 1060
            R I+DN     + L  +    K    F+AIK D+ +A+D ++ PFL K ++       GL
Sbjct: 544  RLISDNILIAHELLHALSSNNKCSEEFIAIKTDISKAYDRVEWPFLEKAMR-------GL 596

Query: 1059 MFA*LLHPFLLIFN----------LNGSPFGKFSASRGI*AKVTLYPPTFLFIICTKFLS 910
             FA   H   LI            +NG+P G+   SRG+     L P  +LF+ICT+ L 
Sbjct: 597  GFAD--HWIRLIMECVKSVRYQVLINGTPHGEIIPSRGLRQGDPLSP--YLFVICTEMLV 652

Query: 909  RNIAHLQSTIQVHGVCISRNAAPITHLMLADDCFIFLRAN---LIEAHNILVFYSV*ITI 739
            + +   +   Q+ G+ ++R A PI+HL+ ADD   + + N   L +   I+  YS+    
Sbjct: 653  KMLQSAEQKNQITGLKVARGAPPISHLLFADDSMFYCKVNDEALGQIIRIIEEYSLASGQ 712

Query: 738  LICQLRN*LS*IFF--NF*S*LAC*T*KIDYRILRIPFSNSNNTYIGVSFNPVKPNQIGT 565
             +  L+   S I+F  +      C    +  R L I        Y+G+     + +++ T
Sbjct: 713  RVNYLK---SSIYFGKHISEERRC----LVKRKLGIEREGGEGVYLGLP-ESFQGSKVAT 764

Query: 564  SKILKS-VTYKINSWRNKLLFKAGRGTLIKSVVCAIPT*NMQTDLFTKTFCHDLDILSSN 388
               LK  +  K+  W++  L   G+  L+K+V  A+PT  M      KT C  ++ + + 
Sbjct: 765  LSYLKDRLGKKVLGWQSNFLSPGGKEILLKAVAMALPTYTMSCFKIPKTICQQIESVMAE 824

Query: 387  FWWGTTNHNDNNKIYL*NWTRCCDKKSNGGCGFKHTYAQSMTLISKFC*KLISERNLFWV 208
            FWW   N  +   ++   W      K+ GG GFK   A ++ L+ K   ++I+E++    
Sbjct: 825  FWW--KNKKEGRGLHWKAWCHLSRPKAVGGLGFKEIEAFNIALLGKQLWRMITEKDSLMA 882

Query: 207  KLLKCKYFPNS 175
            K+ K +YF  S
Sbjct: 883  KVFKSRYFSKS 893


>gb|EEC84982.1| hypothetical protein OsI_32248 [Oryza sativa Indica Group]
            gi|222630794|gb|EEE62926.1| hypothetical protein
            OsJ_17731 [Oryza sativa Japonica Group]
          Length = 893

 Score =  151 bits (381), Expect = 5e-34
 Identities = 121/419 (28%), Positives = 205/419 (48%), Gaps = 9/419 (2%)
 Frame = -1

Query: 1440 RLSY*FFASFWNHTNITLIPKIPNLNQVKNFRHISLCNSISWRIISTILANRLKPSLVKI 1261
            +LS   +   WN T +TLIPK+ +  ++K+ R ISLC ++ +++ S +L+NRLK  L  I
Sbjct: 149  KLSNEHYGEGWNDTVVTLIPKVQSPERLKDLRPISLC-TVVYKLASKVLSNRLKLILPDI 207

Query: 1260 ISPLHSAFVKNRQITDNAKRFCKKL----KKMKGIHGFLAIKLDMPRAFDHIK*PFLFKN 1093
            ISP  SAFV  R ITDN     +       K  G  G+ A+KLDM +A+D ++  FL K 
Sbjct: 208  ISPNQSAFVPQRLITDNVLLAYEMTHFMQTKRTGREGYAALKLDMSKAYDRVEWSFLEKM 267

Query: 1092 LKQLVSVKIGL-MFA*LLHPFLLIFNLNGSPFGKFSASRGI*AKVTLYPPTFLFIICTKF 916
            + +L   +  + +    +        +NG    +   SRG+     + P  +LF+IC + 
Sbjct: 268  MVRLGFAEGWVKLIMRCVSTVTYRIKVNGDLTDQIIPSRGLRQGDPISP--YLFLICAEG 325

Query: 915  LSRNIAHLQSTIQVHGVCISRNAAPITHLMLADDCFIFLRANLIEA---HNILVFYSV*I 745
             S  +   +    + GV + + A  ++HL+ ADD  +  + N   A    N+L  Y    
Sbjct: 326  FSSLLYAAEERGDLSGVKVCQQAPSVSHLLFADDSLLLFKVNERSAQCLQNVLNLYESCS 385

Query: 744  TILICQLRN*LS*IFFNF*S*LAC*T*KIDYRILRIPFSNSNNTYIGVSFNPVKPNQIGT 565
              ++ + +   S I F+  +  A    K+   IL I     N  Y+G+     +      
Sbjct: 386  GQIVNKDK---SSIMFSKNTSQA--DRKMVMEILDISTEARNEKYLGLPVYMGRSRAKTF 440

Query: 564  SKILKSVTYKINSWRNKLLFKAGRGTLIKSVVCAIPT*NMQTDLFTKTFCHDLDILSSNF 385
            + + + V  KI  W+ KLL KAG+  LIK+V  AIPT  M     TKT C ++  +   +
Sbjct: 441  AYLKERVWKKIQGWKEKLLSKAGKDILIKAVAQAIPTFAMSCFDLTKTLCDEISAIICRY 500

Query: 384  WWGTTNHNDNNKIYL*NWTRCCDKKSNGGCGFKHTYAQSMTLISKFC*KLIS-ERNLFW 211
            +W  +     NK++  +W   C +K  GG G++  +  ++ ++++     +S  R++FW
Sbjct: 501  FW--SQQETENKMHWLSWDLLCRRKKKGGLGYRDLHLFNLAMLARQGDWDVSLVRDIFW 557


>gb|AAF69169.1|AC007915_21 F27F5.21 [Arabidopsis thaliana]
          Length = 1023

 Score =  147 bits (371), Expect = 7e-33
 Identities = 126/431 (29%), Positives = 200/431 (46%), Gaps = 20/431 (4%)
 Frame = -1

Query: 1407 NHTNITLIPKIPNLNQVKNFRHISLCNSISWRIISTILANRLKPSLVKIISPLHSAFVKN 1228
            N TNI LIPK     ++   R ISLCN + ++IIS IL  RLK  L  +IS  HSAFV+ 
Sbjct: 335  NTTNICLIPKTERPTRMTELRPISLCN-VGYKIISKILCQRLKKILPSLISETHSAFVEG 393

Query: 1227 RQITDN---AKRFCKKLKKMKGIHG-FLAIKLDMPRAFDHIK*PFLFKNLKQLVSVK--- 1069
            R I+DN   A+     L+      G F+AIK DM +A+D ++  F+ + LK++   +   
Sbjct: 394  RLISDNILIAQEMFHGLRTNPSCKGKFMAIKTDMSKAYDRVEWNFIEELLKKMGFCEKWI 453

Query: 1068 IGLMFA*LLHPFLLIFNLNGSPFGKFSASRGI*AKVTLYPPTFLFIICTKFLSRNIAHLQ 889
              +M+      + ++  LNG P G     RG+     L P  +LFI+CT+ L  NI   +
Sbjct: 454  SWIMWCITTVQYRVL--LNGQPRGLIVPKRGLRQGDPLSP--YLFILCTEVLIANIRKAE 509

Query: 888  STIQVHGVCISRNAAPITHLMLADDCFIFLRANLIEAHNILVFYSV*ITILICQLRN*LS 709
                + G+ ++  +  ++HL+ A+D   F +AN  +   IL        +    +    S
Sbjct: 510  QEKLITGIKVATASPSVSHLLFANDSLFFCKANKEQCGVILGILKQYEAVSGQMINFSKS 569

Query: 708  *IFFNF*S*LAC*T*KIDYRI-------LRIPFSNSNNTYIGVSFNPVKPNQIGTSK--- 559
             I F           K+   I       L I       +Y+G+      P  +G SK   
Sbjct: 570  SIQFGH---------KVGDDIKAEIKSALGIHSIGGMGSYLGL------PESLGGSKTRI 614

Query: 558  ---ILKSVTYKINSWRNKLLFKAGRGTLIKSVVCAIPT*NMQTDLFTKTFCHDLDILSSN 388
               +   +  +IN W  K L K G+  +IKSVV  +PT  M      KT    L    + 
Sbjct: 615  FSFVRDRLQTRINGWSAKFLSKGGKEVMIKSVVAVLPTYVMSCFRLPKTITSKLTSAVAK 674

Query: 387  FWWGTTNHNDNNKIYL*NWTRCCDKKSNGGCGFKHTYAQSMTLISKFC*KLISERNLFWV 208
            FWW  ++  ++  ++   W + C  K++GG GF+     +  L++K   +LI+  +  + 
Sbjct: 675  FWW--SSDGESRGMHWMAWNKLCSSKADGGLGFRSVNDFNSALLAKQLWRLITVPDSLFA 732

Query: 207  KLLKCKYFPNS 175
            K+ K +YF  S
Sbjct: 733  KVFKGRYFRKS 743


>gb|AAD21778.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1715

 Score =  144 bits (364), Expect = 4e-32
 Identities = 122/428 (28%), Positives = 199/428 (46%), Gaps = 14/428 (3%)
 Frame = -1

Query: 1407 NHTNITLIPKIPNLNQVKNFRHISLCNSISWRIISTILANRLKPSLVKIISPLHSAFVKN 1228
            N T I LIPKI +   + ++R ISLC + S++IIS IL  RLK  L  +IS   +AFV  
Sbjct: 847  NQTQICLIPKIIDPKHMSDYRPISLCTA-SYKIISKILIKRLKQCLGDVISDSQAAFVPG 905

Query: 1227 RQITDN---AKRFCKKLKKMKGIH-GFLAIKLDMPRAFDHIK*PFLFKNLKQLVS----V 1072
            + I+DN   A      LK  +    G++A+K D+ +A+D ++  FL K + QL      V
Sbjct: 906  QNISDNVLVAHELLHSLKSRRECQSGYVAVKTDISKAYDRVEWNFLEKVMIQLGFAPRWV 965

Query: 1071 KIGLMFA*LLHPFLLIFNLNGSPFGKFSASRGI*AKVTLYPPTFLFIICTKFLSRNIAHL 892
            K  +     +   +LI   NGSP+GK   SRGI     L P  +LF+ C + LS  +   
Sbjct: 966  KWIMTCVTSVSYEVLI---NGSPYGKIFPSRGIRQGDPLSP--YLFLFCAEVLSNMLRKA 1020

Query: 891  QSTIQVHGVCISRNAAPITHLMLADDCFIFLRANLIEAHNILVFYSV*ITILICQLRN*L 712
            +   Q+HG+ I+++   I+HL+ ADD   F RA+      + + +         ++    
Sbjct: 1021 EVNKQIHGMKITKDCLAISHLLFADDSLFFCRASNQNIEQLALIFKKYEEASGQKINYAK 1080

Query: 711  S*IFFNF*S*LAC*T*KIDYRILRIPFSNSNNTYIGVSFNPVKPNQIGTSK------ILK 550
            S I F     +     +  +R+L I        Y+G+      P Q+G  K      I+ 
Sbjct: 1081 SSIIFG--QKIPTMRRQRLHRLLGIDNVRGGGKYLGL------PEQLGRRKVELFEYIVT 1132

Query: 549  SVTYKINSWRNKLLFKAGRGTLIKSVVCAIPT*NMQTDLFTKTFCHDLDILSSNFWWGTT 370
             V  +   W    L  AG+  +IK++  A+P  +M   L     C++++ L + FWWG  
Sbjct: 1133 KVKERTEGWAYNYLSPAGKEIVIKAIAMALPVYSMNCFLLPTLICNEINSLITAFWWG-- 1190

Query: 369  NHNDNNKIYL*NWTRCCDKKSNGGCGFKHTYAQSMTLISKFC*KLISERNLFWVKLLKCK 190
                              K++ G  GFK  +  +  L++K   ++++       +L K  
Sbjct: 1191 ------------------KENEGDLGFKDLHQFNRALLAKQAWRILTNPQSLLARLYKGL 1232

Query: 189  YFPNSYFL 166
            Y+PN+ +L
Sbjct: 1233 YYPNTTYL 1240


>gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana]
          Length = 1142

 Score =  143 bits (360), Expect = 1e-31
 Identities = 123/425 (28%), Positives = 199/425 (46%), Gaps = 14/425 (3%)
 Frame = -1

Query: 1407 NHTNITLIPKIPNLNQVKNFRHISLCNSISWRIISTILANRLKPSLVKIISPLHSAFVKN 1228
            N TNI LIPK     ++   R ISLCN + +++IS IL  RLK  L  +IS   SAFV  
Sbjct: 269  NTTNICLIPKTERPTRMTELRPISLCN-VGYKVISKILCQRLKTVLPNLISETQSAFVDG 327

Query: 1227 RQITDN---AKRFCKKLKKMKGIHG-FLAIKLDMPRAFDHIK*PFLFKNLKQLVSVK--- 1069
            R I+DN   A+     L+        F+AIK DM +A+D ++  F+   L+++   +   
Sbjct: 328  RLISDNILIAQEMFHGLRTNSSCKDKFMAIKTDMSKAYDQVEWNFIEALLRKMGFCEKWI 387

Query: 1068 IGLMFA*LLHPFLLIFNLNGSPFGKFSASRGI*AKVTLYPPTFLFIICTKFLSRNIAHLQ 889
              +M+      + ++  +NG P G     RG+     L P  +LFI+CT+ L  NI   +
Sbjct: 388  SWIMWCITTVQYKVL--INGQPKGLIIPERGLRQGDPLSP--YLFILCTEVLIANIRKAE 443

Query: 888  STIQVHGVCISRNAAPITHLMLADDCFIFLRANLIEAHNILVFYSV*ITILICQLRN*LS 709
                + G+ ++  +  ++HL+ ADD   F +AN  +   IL       ++   Q+    S
Sbjct: 444  RQNLITGIKVATPSPAVSHLLFADDSLFFCKANKEQCGIILEILKQYESVSGQQINFSKS 503

Query: 708  *IFFNF*S*LAC*T*KIDYR-ILRIPFSNSNNTYIGVSFNPVKPNQIGTSK------ILK 550
             I F         + K D + IL I       +Y+G+      P  +G SK      +  
Sbjct: 504  SIQFGH---KVEDSIKADIKLILGIHNLGGMGSYLGL------PESLGGSKTKVFSFVRD 554

Query: 549  SVTYKINSWRNKLLFKAGRGTLIKSVVCAIPT*NMQTDLFTKTFCHDLDILSSNFWWGTT 370
             +  +IN W  K L K G+  +IKSV   +P   M      K     L    + FWW  +
Sbjct: 555  RLQSRINGWSAKFLSKGGKEVMIKSVAATLPRYVMSCFRLPKAITSKLTSAVAKFWW--S 612

Query: 369  NHNDNNKIYL*NWTRCCDKKSNGGCGFKHTYAQSMTLISKFC*KLISERNLFWVKLLKCK 190
            ++ D+  ++   W + C  KS+GG GF++    +  L++K   +LI+  +  + K+ K +
Sbjct: 613  SNGDSRGMHWMAWDKLCSSKSDGGLGFRNVDDFNSALLAKQLWRLITAPDSLFAKVFKGR 672

Query: 189  YFPNS 175
            YF  S
Sbjct: 673  YFRKS 677


Top