BLASTX nr result
ID: Coptis21_contig00022735
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00022735 (2280 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 242 3e-61 ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2... 229 3e-57 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 224 8e-56 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 218 7e-54 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 214 6e-53 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 242 bits (618), Expect = 3e-61 Identities = 149/430 (34%), Positives = 210/430 (48%), Gaps = 10/430 (2%) Frame = -2 Query: 2279 THLSFADDVLIFSKGDVDSIRAIKTILATFTEATGLGISLNKSTILTGGMSIVESQALAD 2100 THLSFADD+++ S G + SI I + F + +GL ISL KST+ G+S +AD Sbjct: 699 THLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVAD 758 Query: 2099 XXXXXXXXXXVKYLGFPLFASRLGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVL 1920 V+YLG PL RL DCLPL+E + RI +W +R LS+AGRL LI SVL Sbjct: 759 RFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVL 818 Query: 1919 SSFHTYWSRTFVLPKTVLDKVSKICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMID 1740 S +W F LP+ + ++ K+C+ FLWSG + K S ++ KDE GL + Sbjct: 819 WSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRS 878 Query: 1739 LHTWNVAAYCGLVFKLASRENSLWGNWVWTHSIKNKHFWTMK-APKDCSWVWRGILEHRK 1563 L N LV+K+ S NSLW WV H ++N FW +K SW+W+ +L++R+ Sbjct: 879 LKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYRE 938 Query: 1562 TAMKFTRNVIANGDDTFIWHDLWCTETPLLWDNNARQMLQLG--EEAKVSELISNGRW-- 1395 A ++ + NG T W+D W LL R ++ LG V E +N R Sbjct: 939 VAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQRR 998 Query: 1394 --NDIVLSLPECDLKTKILRTDIYDLMNKDQVVW---SLTHSGKFSARSAYIAXXXXXXX 1230 ND+ + + K+ RT+ +D+V+W S FS R + Sbjct: 999 HRNDVYNVIEDALKKSWDTRTE-----TEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSAR 1053 Query: 1229 XXXXXXXWGKLVIPKHSFCTWQLFSGSLSTQDRLVNKGIINHSKCSLCGTTARENSKHLF 1050 W PK+SFC+W G L T DR++N + C C T E HLF Sbjct: 1054 VPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTL-ETRDHLF 1112 Query: 1049 FECSYTKRVW 1020 F CS+T +W Sbjct: 1113 FTCSFTSVIW 1122 >ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1| predicted protein [Populus trichocarpa] Length = 517 Score = 229 bits (583), Expect = 3e-57 Identities = 133/402 (33%), Positives = 204/402 (50%), Gaps = 4/402 (0%) Frame = -2 Query: 2213 IKTILATFTEATGLGISLNKSTILTGGMSIVESQALADXXXXXXXXXXVKYLGFPLFASR 2034 I+T+L F + +GL + NKS I G+ E + + +KYLG PL +SR Sbjct: 8 IRTVLTKFQDLSGLYPNPNKSDIFLSGVLNAEREQIIHILGFREGELPMKYLGVPLLSSR 67 Query: 2033 LGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVLSSFHTYWSRTFVLPKTVLDKVS 1854 L C L++ ITS++ +W R LS+AGR+QLI+SVL S YW+ F+LP V+ V Sbjct: 68 LKAIYCKGLVDRITSKVRHWTCRTLSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVE 127 Query: 1853 KICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMIDLHTWNVAAYCGLVFKLAS-REN 1677 +I FLWSG + K + + + K E GL + + WN A ++ L + + Sbjct: 128 QIMKSFLWSGSDMRTTGAKVAWDQVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDG 187 Query: 1676 SLWGNWVWTHSIKNKHFWTMKAPKDCSWVWRGILEHRKTAMKFTRNVIANGDDTFIWHDL 1497 S+W W+ ++ ++ ++FWT+K P++CSW W IL+ R A + +I +G T +W D Sbjct: 188 SIWSTWIRSNLLRGRNFWTIKTPQNCSWAWGKILKLRSLAWPKMKYIIGDGMTTSLWFDN 247 Query: 1496 WCTETPLLWDNNARQMLQLG--EEAKVSELISNGRW-NDIVLSLPECDLKTKILRTDIYD 1326 W +PL R + G + AKV+ LI N W ++ + I Sbjct: 248 WHPHSPLADSYGERFIYDSGMAKNAKVNVLIQNSEWKTPTTQAIGWHPIIEAIPSNSNPK 307 Query: 1325 LMNKDQVVWSLTHSGKFSARSAYIAXXXXXXXXXXXXXXWGKLVIPKHSFCTWQLFSGSL 1146 + KD++VW + + +FS + A+ W K +P+HSF W L Sbjct: 308 MGQKDELVWLDSPNHRFSVKVAWEQLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKL 367 Query: 1145 STQDRLVNKGIINHSKCSLCGTTARENSKHLFFECSYTKRVW 1020 +TQD+L GI ++CSLC E+ HLFFECSYTK +W Sbjct: 368 TTQDKLHRFGIHGPNRCSLC-LRNNEDHNHLFFECSYTKAIW 408 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 224 bits (571), Expect = 8e-56 Identities = 151/459 (32%), Positives = 214/459 (46%), Gaps = 7/459 (1%) Frame = -2 Query: 2279 THLSFADDVLIFSKGDVDSIRAIKTILATFTEATGLGISLNKSTILTGGMSIVESQALAD 2100 THL FADD+++FS G SI+ I F + L ISL KSTI G+S ++ Sbjct: 846 THLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQ 905 Query: 2099 XXXXXXXXXXVKYLGFPLFASRLGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVL 1920 VKYLG PL R+ D LPL+E I +RI++W NR LS AGRLQLI SVL Sbjct: 906 QFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVL 965 Query: 1919 SSFHTYWSRTFVLPKTVLDKVSKICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMID 1740 SS +W F LPK L ++ K+ + FLWSGP L K + + K+E GL + Sbjct: 966 SSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKP 1025 Query: 1739 LHTWNVAAYCGLVFKLASRENSLWGNWVWTHSIKNKHFWTMKAPKDC-SWVWRGILEHRK 1563 L N + L++++ S +SLW WV H I+ + FW++K SW+WR IL+ R Sbjct: 1026 LKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQRD 1085 Query: 1562 TAMKFTRNVIANGDDTFIWHDLWCTETPLLWDNNARQMLQLG--EEAKVSELISNGRWND 1389 A F R + +G T WHD WC L +R + LG A V+E+++ R Sbjct: 1086 KARLFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPNNATVAEVMNTHRRKR 1145 Query: 1388 IVLS-LPECDLKTKILRTDIYDLMNKDQVVWSL---THSGKFSARSAYIAXXXXXXXXXX 1221 L + + ++ R D + D+ +W T FS+ + Sbjct: 1146 HRADFLNQIKSQIELARQD--RSTDGDRSLWKQKEDTFKSSFSSSKTWQQIRSISLRCDW 1203 Query: 1220 XXXXWGKLVIPKHSFCTWQLFSGSLSTQDRLVNKGIINHSKCSLCGTTARENSKHLFFEC 1041 W PK+SF TW F L+T D++ C CG E HLFF C Sbjct: 1204 YRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEEL-ETRDHLFFSC 1262 Query: 1040 SYTKRVWSGVKAKIGMGFRVADSNNEWHCLTMSSIVSCR 924 Y+ VW + + G + + W+ +T + S R Sbjct: 1263 PYSSHVWFSLTKGLLNGRNILN----WNLITPHLLDSSR 1297 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 218 bits (554), Expect = 7e-54 Identities = 141/438 (32%), Positives = 205/438 (46%), Gaps = 11/438 (2%) Frame = -2 Query: 2279 THLSFADDVLIFSKGDVDSIRAIKTILATFTEATGLGISLNKSTILTGGMSIVESQALAD 2100 THL FADD++I + G V S+ I ++ F + +GL I++ K+T+ T G+S + Sbjct: 104 THLCFADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMIS 163 Query: 2099 XXXXXXXXXXVKYLGFPLFASRLGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVL 1920 V+YLG PL RL +D PL E I +RI W +R LS AGRL LI SVL Sbjct: 164 RYPFGLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVL 223 Query: 1919 SSFHTYWSRTFVLPKTVLDKVSKICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMID 1740 S +W F LP L +++ IC+ FLWSGP L + K S + + K E GL + Sbjct: 224 WSTMNFWMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRS 283 Query: 1739 LHTWNVAAYCGLVFKLASRENSLWGNWVWTHSIKNKHFWTMKAPKDC-SWVWRGILEHRK 1563 L NV + L++++ S ++SLW W + +K + FW++ SW+W+ +L++R+ Sbjct: 284 LTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLTPNSSLGSWMWKKMLKYRE 343 Query: 1562 TAMKFTRNVIANGDDTFIWHDLWCTETPLLWDNNARQMLQLG--EEAKVSELISNGR--- 1398 TA F+R + NG T W D W L+ R + LG V+E SN R Sbjct: 344 TAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNRRRRK 403 Query: 1397 -----WNDIVLSLPECDLKTKILRTDIYDLMNKDQVVWSLTHSGKFSARSAYIAXXXXXX 1233 NDI +L + +LR D K V FS + + Sbjct: 404 HRTEQLNDIEAALNQKYQTRNLLREDATLWRGKGDV-----FKTSFSTKDTWNQVRKKSN 458 Query: 1232 XXXXXXXXWGKLVIPKHSFCTWQLFSGSLSTQDRLVNKGIINHSKCSLCGTTARENSKHL 1053 W PK+ FCTW LST R+ + KC+ C T+ E HL Sbjct: 459 EVAWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSI-ETRDHL 517 Query: 1052 FFECSYTKRVWSGVKAKI 999 FF CSY +W+ + + Sbjct: 518 FFSCSYASAIWTAIAKNV 535 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 214 bits (546), Expect = 6e-53 Identities = 140/434 (32%), Positives = 206/434 (47%), Gaps = 11/434 (2%) Frame = -2 Query: 2279 THLSFADDVLIFSKGDVDSIRAIKTILATFTEATGLGISLNKSTILTGGMSIVESQALAD 2100 THLSFADD+++ S G SI I + F + +GL ISL KST+ G+S + Q +A Sbjct: 346 THLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAA 405 Query: 2099 XXXXXXXXXXVKYLGFPLFASRLGVKDCLPLIESITSRISNWNNRVLSHAGRLQLIHSVL 1920 V+YLG PL RL D PL+E I RI+ W R S AGR LI SVL Sbjct: 406 KFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVL 465 Query: 1919 SSFHTYWSRTFVLPKTVLDKVSKICNRFLWSGPSLTKCLHKASHELLRYSKDEVGLNMID 1740 S +W F LP+ + ++ K+C+ FLWSG ++ K S +++ K E GL + + Sbjct: 466 WSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRN 525 Query: 1739 LHTWNVAAYCGLVFKLASRENSLWGNWVWTHSIKNKHFWTMKAPKDC-SWVWRGILEHRK 1563 L N + LV+++ S NSLW WV + I+ K W++K SW+WR IL+ R Sbjct: 526 LKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRD 585 Query: 1562 TAMKFTRNVIANGDDTFIWHDLWCTETPLLWDNNARQMLQLG--EEAKVSEL---ISNGR 1398 A F+R + NG+ W+D W L+ + + LG EA V++ S R Sbjct: 586 VAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVADAWTRRSRRR 645 Query: 1397 WNDIVLSLPECDLKTKILRTDIYDLMNKDQVVW---SLTHSGKFSARSAYIAXXXXXXXX 1227 +L+ +++ + I+ +D V+W + FS R + Sbjct: 646 HRTSLLN----EIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDTWHLIKATSSTV 701 Query: 1226 XXXXXXWGKLVIPKHSFCTWQLFSGSLSTQDRLV--NKGIINHSKCSLCGTTARENSKHL 1053 W + PK++ CTW L T DR++ N C LC T + +HL Sbjct: 702 SWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLC-TNNSKTLEHL 760 Query: 1052 FFECSYTKRVWSGV 1011 FF CSY VW+ + Sbjct: 761 FFSCSYASTVWAAL 774