BLASTX nr result

ID: Coptis21_contig00022735 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00022735
         (2280 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   242   3e-61
ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   229   3e-57
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   224   8e-56
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   218   7e-54
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               214   6e-53

>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  242 bits (618), Expect = 3e-61
 Identities = 149/430 (34%), Positives = 210/430 (48%), Gaps = 10/430 (2%)
 Frame = -2

Query: 2279 THLSFADDVLIFSKGDVDSIRAIKTILATFTEATGLGISLNKSTILTGGMSIVESQALAD 2100
            THLSFADD+++ S G + SI  I  +   F + +GL ISL KST+   G+S      +AD
Sbjct: 699  THLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVAD 758

Query: 2099 XXXXXXXXXXVKYLGFPLFASRLGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVL 1920
                      V+YLG PL   RL   DCLPL+E +  RI +W +R LS+AGRL LI SVL
Sbjct: 759  RFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVL 818

Query: 1919 SSFHTYWSRTFVLPKTVLDKVSKICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMID 1740
             S   +W   F LP+  + ++ K+C+ FLWSG  +     K S  ++   KDE GL +  
Sbjct: 819  WSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRS 878

Query: 1739 LHTWNVAAYCGLVFKLASRENSLWGNWVWTHSIKNKHFWTMK-APKDCSWVWRGILEHRK 1563
            L   N      LV+K+ S  NSLW  WV  H ++N  FW +K      SW+W+ +L++R+
Sbjct: 879  LKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYRE 938

Query: 1562 TAMKFTRNVIANGDDTFIWHDLWCTETPLLWDNNARQMLQLG--EEAKVSELISNGRW-- 1395
             A   ++  + NG  T  W+D W     LL     R ++ LG      V E  +N R   
Sbjct: 939  VAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQRR 998

Query: 1394 --NDIVLSLPECDLKTKILRTDIYDLMNKDQVVW---SLTHSGKFSARSAYIAXXXXXXX 1230
              ND+   + +   K+   RT+      +D+V+W   S      FS R  +         
Sbjct: 999  HRNDVYNVIEDALKKSWDTRTE-----TEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSAR 1053

Query: 1229 XXXXXXXWGKLVIPKHSFCTWQLFSGSLSTQDRLVNKGIINHSKCSLCGTTARENSKHLF 1050
                   W     PK+SFC+W    G L T DR++N      + C  C  T  E   HLF
Sbjct: 1054 VPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTL-ETRDHLF 1112

Query: 1049 FECSYTKRVW 1020
            F CS+T  +W
Sbjct: 1113 FTCSFTSVIW 1122


>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
            predicted protein [Populus trichocarpa]
          Length = 517

 Score =  229 bits (583), Expect = 3e-57
 Identities = 133/402 (33%), Positives = 204/402 (50%), Gaps = 4/402 (0%)
 Frame = -2

Query: 2213 IKTILATFTEATGLGISLNKSTILTGGMSIVESQALADXXXXXXXXXXVKYLGFPLFASR 2034
            I+T+L  F + +GL  + NKS I   G+   E + +            +KYLG PL +SR
Sbjct: 8    IRTVLTKFQDLSGLYPNPNKSDIFLSGVLNAEREQIIHILGFREGELPMKYLGVPLLSSR 67

Query: 2033 LGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVLSSFHTYWSRTFVLPKTVLDKVS 1854
            L    C  L++ ITS++ +W  R LS+AGR+QLI+SVL S   YW+  F+LP  V+  V 
Sbjct: 68   LKAIYCKGLVDRITSKVRHWTCRTLSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVE 127

Query: 1853 KICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMIDLHTWNVAAYCGLVFKLAS-REN 1677
            +I   FLWSG  +     K + + +   K E GL +  +  WN  A    ++ L +  + 
Sbjct: 128  QIMKSFLWSGSDMRTTGAKVAWDQVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDG 187

Query: 1676 SLWGNWVWTHSIKNKHFWTMKAPKDCSWVWRGILEHRKTAMKFTRNVIANGDDTFIWHDL 1497
            S+W  W+ ++ ++ ++FWT+K P++CSW W  IL+ R  A    + +I +G  T +W D 
Sbjct: 188  SIWSTWIRSNLLRGRNFWTIKTPQNCSWAWGKILKLRSLAWPKMKYIIGDGMTTSLWFDN 247

Query: 1496 WCTETPLLWDNNARQMLQLG--EEAKVSELISNGRW-NDIVLSLPECDLKTKILRTDIYD 1326
            W   +PL      R +   G  + AKV+ LI N  W      ++    +   I       
Sbjct: 248  WHPHSPLADSYGERFIYDSGMAKNAKVNVLIQNSEWKTPTTQAIGWHPIIEAIPSNSNPK 307

Query: 1325 LMNKDQVVWSLTHSGKFSARSAYIAXXXXXXXXXXXXXXWGKLVIPKHSFCTWQLFSGSL 1146
            +  KD++VW  + + +FS + A+                W K  +P+HSF  W      L
Sbjct: 308  MGQKDELVWLDSPNHRFSVKVAWEQLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKL 367

Query: 1145 STQDRLVNKGIINHSKCSLCGTTARENSKHLFFECSYTKRVW 1020
            +TQD+L   GI   ++CSLC     E+  HLFFECSYTK +W
Sbjct: 368  TTQDKLHRFGIHGPNRCSLC-LRNNEDHNHLFFECSYTKAIW 408


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  224 bits (571), Expect = 8e-56
 Identities = 151/459 (32%), Positives = 214/459 (46%), Gaps = 7/459 (1%)
 Frame = -2

Query: 2279 THLSFADDVLIFSKGDVDSIRAIKTILATFTEATGLGISLNKSTILTGGMSIVESQALAD 2100
            THL FADD+++FS G   SI+    I   F   + L ISL KSTI   G+S     ++  
Sbjct: 846  THLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQ 905

Query: 2099 XXXXXXXXXXVKYLGFPLFASRLGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVL 1920
                      VKYLG PL   R+   D LPL+E I +RI++W NR LS AGRLQLI SVL
Sbjct: 906  QFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVL 965

Query: 1919 SSFHTYWSRTFVLPKTVLDKVSKICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMID 1740
            SS   +W   F LPK  L ++ K+ + FLWSGP L     K +   +   K+E GL +  
Sbjct: 966  SSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKP 1025

Query: 1739 LHTWNVAAYCGLVFKLASRENSLWGNWVWTHSIKNKHFWTMKAPKDC-SWVWRGILEHRK 1563
            L   N  +   L++++ S  +SLW  WV  H I+ + FW++K      SW+WR IL+ R 
Sbjct: 1026 LKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQRD 1085

Query: 1562 TAMKFTRNVIANGDDTFIWHDLWCTETPLLWDNNARQMLQLG--EEAKVSELISNGRWND 1389
             A  F R  + +G  T  WHD WC    L     +R  + LG    A V+E+++  R   
Sbjct: 1086 KARLFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPNNATVAEVMNTHRRKR 1145

Query: 1388 IVLS-LPECDLKTKILRTDIYDLMNKDQVVWSL---THSGKFSARSAYIAXXXXXXXXXX 1221
                 L +   + ++ R D     + D+ +W     T    FS+   +            
Sbjct: 1146 HRADFLNQIKSQIELARQD--RSTDGDRSLWKQKEDTFKSSFSSSKTWQQIRSISLRCDW 1203

Query: 1220 XXXXWGKLVIPKHSFCTWQLFSGSLSTQDRLVNKGIINHSKCSLCGTTARENSKHLFFEC 1041
                W     PK+SF TW  F   L+T D++          C  CG    E   HLFF C
Sbjct: 1204 YRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEEL-ETRDHLFFSC 1262

Query: 1040 SYTKRVWSGVKAKIGMGFRVADSNNEWHCLTMSSIVSCR 924
             Y+  VW  +   +  G  + +    W+ +T   + S R
Sbjct: 1263 PYSSHVWFSLTKGLLNGRNILN----WNLITPHLLDSSR 1297


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  218 bits (554), Expect = 7e-54
 Identities = 141/438 (32%), Positives = 205/438 (46%), Gaps = 11/438 (2%)
 Frame = -2

Query: 2279 THLSFADDVLIFSKGDVDSIRAIKTILATFTEATGLGISLNKSTILTGGMSIVESQALAD 2100
            THL FADD++I + G V S+  I  ++  F + +GL I++ K+T+ T G+S      +  
Sbjct: 104  THLCFADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMIS 163

Query: 2099 XXXXXXXXXXVKYLGFPLFASRLGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVL 1920
                      V+YLG PL   RL  +D  PL E I +RI  W +R LS AGRL LI SVL
Sbjct: 164  RYPFGLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVL 223

Query: 1919 SSFHTYWSRTFVLPKTVLDKVSKICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMID 1740
             S   +W   F LP   L +++ IC+ FLWSGP L +   K S + +   K E GL +  
Sbjct: 224  WSTMNFWMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRS 283

Query: 1739 LHTWNVAAYCGLVFKLASRENSLWGNWVWTHSIKNKHFWTMKAPKDC-SWVWRGILEHRK 1563
            L   NV +   L++++ S ++SLW  W   + +K + FW++       SW+W+ +L++R+
Sbjct: 284  LTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLTPNSSLGSWMWKKMLKYRE 343

Query: 1562 TAMKFTRNVIANGDDTFIWHDLWCTETPLLWDNNARQMLQLG--EEAKVSELISNGR--- 1398
            TA  F+R  + NG  T  W D W     L+     R  + LG      V+E  SN R   
Sbjct: 344  TAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNRRRRK 403

Query: 1397 -----WNDIVLSLPECDLKTKILRTDIYDLMNKDQVVWSLTHSGKFSARSAYIAXXXXXX 1233
                  NDI  +L +      +LR D      K  V         FS +  +        
Sbjct: 404  HRTEQLNDIEAALNQKYQTRNLLREDATLWRGKGDV-----FKTSFSTKDTWNQVRKKSN 458

Query: 1232 XXXXXXXXWGKLVIPKHSFCTWQLFSGSLSTQDRLVNKGIINHSKCSLCGTTARENSKHL 1053
                    W     PK+ FCTW      LST  R+      +  KC+ C T+  E   HL
Sbjct: 459  EVAWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSI-ETRDHL 517

Query: 1052 FFECSYTKRVWSGVKAKI 999
            FF CSY   +W+ +   +
Sbjct: 518  FFSCSYASAIWTAIAKNV 535


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  214 bits (546), Expect = 6e-53
 Identities = 140/434 (32%), Positives = 206/434 (47%), Gaps = 11/434 (2%)
 Frame = -2

Query: 2279 THLSFADDVLIFSKGDVDSIRAIKTILATFTEATGLGISLNKSTILTGGMSIVESQALAD 2100
            THLSFADD+++ S G   SI  I  +   F + +GL ISL KST+   G+S +  Q +A 
Sbjct: 346  THLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAA 405

Query: 2099 XXXXXXXXXXVKYLGFPLFASRLGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVL 1920
                      V+YLG PL   RL   D  PL+E I  RI+ W  R  S AGR  LI SVL
Sbjct: 406  KFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVL 465

Query: 1919 SSFHTYWSRTFVLPKTVLDKVSKICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMID 1740
             S   +W   F LP+  + ++ K+C+ FLWSG  ++    K S +++   K E GL + +
Sbjct: 466  WSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRN 525

Query: 1739 LHTWNVAAYCGLVFKLASRENSLWGNWVWTHSIKNKHFWTMKAPKDC-SWVWRGILEHRK 1563
            L   N  +   LV+++ S  NSLW  WV  + I+ K  W++K      SW+WR IL+ R 
Sbjct: 526  LKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRD 585

Query: 1562 TAMKFTRNVIANGDDTFIWHDLWCTETPLLWDNNARQMLQLG--EEAKVSEL---ISNGR 1398
             A  F+R  + NG+    W+D W     L+     +  + LG   EA V++     S  R
Sbjct: 586  VAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVADAWTRRSRRR 645

Query: 1397 WNDIVLSLPECDLKTKILRTDIYDLMNKDQVVW---SLTHSGKFSARSAYIAXXXXXXXX 1227
                +L+    +++  +    I+    +D V+W   +      FS R  +          
Sbjct: 646  HRTSLLN----EIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDTWHLIKATSSTV 701

Query: 1226 XXXXXXWGKLVIPKHSFCTWQLFSGSLSTQDRLV--NKGIINHSKCSLCGTTARENSKHL 1053
                  W +   PK++ CTW      L T DR++  N        C LC T   +  +HL
Sbjct: 702  SWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLC-TNNSKTLEHL 760

Query: 1052 FFECSYTKRVWSGV 1011
            FF CSY   VW+ +
Sbjct: 761  FFSCSYASTVWAAL 774


Top