BLASTX nr result
ID: Coptis25_contig00022902
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis25_contig00022902 (1732 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002267329.1| PREDICTED: uncharacterized protein LOC100263... 357 4e-96 ref|XP_002325302.1| predicted protein [Populus trichocarpa] gi|2... 289 2e-75 ref|NP_001242104.1| uncharacterized protein LOC100809786 [Glycin... 266 1e-68 ref|XP_003552484.1| PREDICTED: uncharacterized protein LOC100810... 265 4e-68 ref|NP_197743.1| smr (Small MutS Related) domain-containing prot... 261 5e-67 >ref|XP_002267329.1| PREDICTED: uncharacterized protein LOC100263151 [Vitis vinifera] Length = 435 Score = 357 bits (917), Expect = 4e-96 Identities = 212/441 (48%), Positives = 272/441 (61%), Gaps = 13/441 (2%) Frame = +3 Query: 186 LSSAKGRSLGWAAFDLKQRGKQGREPESYTDPYPPIQSMSTDVPLPEXXXXXXXXXXXXX 365 +SSA G+S GWAAFDLKQR KQG EPE +PYPPI S T + P Sbjct: 1 MSSASGKSPGWAAFDLKQRQKQGLEPELDKEPYPPIPSSFTSLR-PCRNSASNGCSGRSF 59 Query: 366 XXXXQPSIDFP-VHWGTGNHSPTQTIGGHSSRNFGNEAEIGNHVNPVIA--KLKDLYCWA 536 PS++FP + P Q GG+S + ++ N VIA KLK+LY WA Sbjct: 60 SSLLVPSVNFPTLEENKDCKKPMQ--GGNSGNK--QQTKVAEVSNLVIAFNKLKELYSWA 115 Query: 537 DEGLIEDILGAVNNDFDQASTLLKSMVLSRSSEET------ENHGAQRADFEKTT-QAVK 695 D LIEDI+ AV+ND D+ASTLL +MV + S EE E + +E QA Sbjct: 116 DNSLIEDIMAAVDNDIDKASTLLGAMVSTGSFEENKETSIVELNSTSGNPYENCKLQADN 175 Query: 696 GARINDVTNIADLKSALDECVFETWNEMTVEDVRTDSKLYDSTAQ--LISRRVIAAPVEP 869 G + + T +++L S + + + + +T E + L+D A LI R+ + P+EP Sbjct: 176 GVFLGNGTVLSELSSTIGDLLIDNNKGLTDECGSSGKNLFDDAADMTLILGRMKSIPIEP 235 Query: 870 EWEEDDAYLSHRKDAIRAIRLASQHSRAASNAFMRGDHFSAQKLSMQARKEWXXXXXXXX 1049 EWEEDD YLSHRKDAIR +R ASQHSRAA+NAF+RGDH SA++ S++A+ EW Sbjct: 236 EWEEDDVYLSHRKDAIRFMRSASQHSRAATNAFLRGDHVSAKQFSLKAKDEWVKAERLNS 295 Query: 1050 XXXXXILRIRNSKNGVWKLDLHGLHASEAVHFLQGHLWNIEMQIPLNRSVSPNRSLNPTS 1229 IL IRNS N +WKLDLHGLHA+EAV LQ HLW IE Q+P NRSVSPNR+ Sbjct: 296 KAANEILDIRNSNNDLWKLDLHGLHAAEAVQALQEHLWKIETQMPFNRSVSPNRAKTKV- 354 Query: 1230 GDIHSPS-EATTCLGTNMADKEQGISWHRPRALQVVTGTGNHSRGQATLPMAVKSFLVEN 1406 G + SPS E+ +C+ DK+ +S RP +LQV+TG GNHSRGQA LP AV+SFL E+ Sbjct: 355 GILRSPSLESFSCVDNEELDKQWTLSRQRPTSLQVITGRGNHSRGQAALPTAVRSFLNEH 414 Query: 1407 GYRFDEARPGVIDVRPKYRYK 1469 GYRF+EARPGVI VRPK+R++ Sbjct: 415 GYRFEEARPGVIAVRPKFRHR 435 >ref|XP_002325302.1| predicted protein [Populus trichocarpa] gi|222862177|gb|EEE99683.1| predicted protein [Populus trichocarpa] Length = 429 Score = 289 bits (739), Expect = 2e-75 Identities = 187/440 (42%), Positives = 251/440 (57%), Gaps = 14/440 (3%) Frame = +3 Query: 192 SAKGRSLGWAAFDLKQRGKQGREPESYTDPYPPIQSMSTDVPLPEXXXXXXXXXXXXXXX 371 S + +S GWAAFDLKQR K G DP+P I + L Sbjct: 5 SRRVKSSGWAAFDLKQRQKDGEVDGK--DPFPAIGDLPVTGGLRRNDDVGGLSSKSFSSV 62 Query: 372 XXQP-SIDFPVHWGTGNHSPTQTI----GGHSSRNFGNEAEIGNHVNPVIAKLKDLYCWA 536 P S FP ++ T + G+ + E + G V + +LK+++ WA Sbjct: 63 LQPPASAGFPALKTQNVNNLTAKVADFSAGYRVSDKVIEEKNGGSVLLDLQRLKEIHGWA 122 Query: 537 DEGLIEDILGAVNNDFDQASTLLKSMVLSRSSEETENHGAQ-RADFEKTTQAVKGARIND 713 D LIED++ +V+ND ++A LL MV + +E E GA+ + F K+ Sbjct: 123 DFSLIEDVMVSVDNDAEKACVLLNGMVSNADFDEDE--GAKFNSGFNKSL---------- 170 Query: 714 VTNIADLKSALDECVFET--WNEMTVEDVRTD---SKLYDSTA--QLISRRVIAAPVEPE 872 +IADL S L++ + + N+ ++R D S D+ A +LI + + PVEPE Sbjct: 171 ADDIADLSSTLEDALKDNDHNNDNNSIELREDVGVSSSVDAAANMKLILGHLKSIPVEPE 230 Query: 873 WEEDDAYLSHRKDAIRAIRLASQHSRAASNAFMRGDHFSAQKLSMQARKEWXXXXXXXXX 1052 WEEDD YLSHRK+A+R +RLASQHSRAA+NAF+R DHFSAQ+ S++AR++W Sbjct: 231 WEEDDVYLSHRKNALRMMRLASQHSRAATNAFLRRDHFSAQQHSLRAREKWSAAEQLNAK 290 Query: 1053 XXXXILRIRNSKNGVWKLDLHGLHASEAVHFLQGHLWNIEMQIPLNRSVSPNRSLNPTSG 1232 IL IRNS N WKLDLHGLHA+EA LQ HL IE +P NRS+SP R + +G Sbjct: 291 AAKEILSIRNSDNDPWKLDLHGLHAAEAGQALQEHLLKIETLVPNNRSISPCR-IKTKNG 349 Query: 1233 DIH-SPSEATTCLGTNMADKEQGISWHRPRALQVVTGTGNHSRGQATLPMAVKSFLVENG 1409 +H SP +A + + DK+Q RP +LQV+TG GNHSRGQA LP AVKSFL +NG Sbjct: 350 ILHSSPFDAFSTVDAENLDKQQATFRQRPTSLQVITGVGNHSRGQAALPTAVKSFLNDNG 409 Query: 1410 YRFDEARPGVIDVRPKYRYK 1469 YRFDE RPGVI VRPK+R++ Sbjct: 410 YRFDETRPGVITVRPKFRHR 429 >ref|NP_001242104.1| uncharacterized protein LOC100809786 [Glycine max] gi|255639453|gb|ACU20021.1| unknown [Glycine max] Length = 427 Score = 266 bits (681), Expect = 1e-68 Identities = 178/454 (39%), Positives = 247/454 (54%), Gaps = 17/454 (3%) Frame = +3 Query: 153 MTISAIPSRSKLSSAKGRSLGWAAFDLKQRGKQGREPESYTDPYPPIQSMSTDVPLPEXX 332 MT SA S K+S A+G+S GW AFDLKQR E E DP+P I + V Sbjct: 1 MTASARSSLKKMSWARGQSSGWTAFDLKQRKNNNFESEDDEDPFPAIGTTDPMV------ 54 Query: 333 XXXXXXXXXXXXXXXQPSIDFPVHWGTGNHSPTQTIGGHSSRNFGNEAEIGNHVNPVIAK 512 P+ +FP + G +S +G S + A VN I K Sbjct: 55 -GKNHVPAKPFSSVLLPTRNFPP-FKEGGNSKKAMVGSDSDGKYCG-ATAQEDVNLAIKK 111 Query: 513 LKDLYCWADEGLIEDILGAVNNDFDQASTLLKSMVLSRSSEETENHGAQRADF------- 671 L++ + WA+ LI+DI AVNN+ D+A+ LL++M + + EE++ R+ Sbjct: 112 LREQHLWAEHSLIDDIFSAVNNNIDKATALLETMDPAANFEESKVSSNPRSTTSDDTPCK 171 Query: 672 EKTTQAVKGARIND--------VTNIADLKSALDECVFETWNEMT-VEDVRTDSKLYDST 824 +KT ++ ++ D V N+ D L++ + +++ V+ +R KL +S Sbjct: 172 DKTDDSLTSEKVEDDIPFDSNLVDNLQDNDKDLEDRNAPSGQKLSDVDYLRCKMKLLNSI 231 Query: 825 AQLISRRVIAAPVEPEWEEDDAYLSHRKDAIRAIRLASQHSRAASNAFMRGDHFSAQKLS 1004 PVEPEWE+DD Y+S+RKDA+R +R AS+HSRAAS+AF+RGDHFSAQ S Sbjct: 232 -----------PVEPEWEDDDIYISNRKDALRTMRSASRHSRAASSAFLRGDHFSAQHHS 280 Query: 1005 MQARKEWXXXXXXXXXXXXXILRIRNSKNGVWKLDLHGLHASEAVHFLQGHLWNIEMQIP 1184 M+AR E IL +RN++N +WKLDLHGLHA+EA+ LQ HL+ IE Q Sbjct: 281 MKARAERHTAEELNSDAAKKILSVRNNENDIWKLDLHGLHATEAIQALQEHLYRIESQ-G 339 Query: 1185 LNRSVSPNRSLNPTSGDIHSPSEATTCLGT-NMADKEQGISWHRPRALQVVTGTGNHSRG 1361 ++S + TS + + LG+ N D+E + RP AL V+TG GNHSRG Sbjct: 340 FSKS-------SATSNGVKENGLGHSTLGSLNFMDREAPLRL-RPLALHVITGVGNHSRG 391 Query: 1362 QATLPMAVKSFLVENGYRFDEARPGVIDVRPKYR 1463 QA LP AV+SFL EN YRF+E RPGVI V PK+R Sbjct: 392 QAALPTAVRSFLNENRYRFEEMRPGVITVWPKFR 425 >ref|XP_003552484.1| PREDICTED: uncharacterized protein LOC100810197 [Glycine max] Length = 432 Score = 265 bits (676), Expect = 4e-68 Identities = 182/458 (39%), Positives = 252/458 (55%), Gaps = 21/458 (4%) Frame = +3 Query: 153 MTISAIPSRSKLSSAKGRSLGWAAFDLKQRGKQGREPESYTDPYPPIQSMSTDVPLPEXX 332 M SA S K+S AKG+S GW AFDLKQR + E E DP+P I TD + + Sbjct: 1 MAASAHSSLKKMSWAKGQSSGWTAFDLKQRKNKDFESEVDDDPFPAIGP--TDPIIKKNH 58 Query: 333 XXXXXXXXXXXXXXXQPSIDFPVHWGTGNHSPTQTIGGHSSRNFGNEAEIGNHVNPVIAK 512 P+ +FP GN S +G S + A VN I K Sbjct: 59 VPAKPFSSVLL-----PTKNFPPLNEDGN-SKKAMLGSDSDGKYCG-ATTQEDVNLAIKK 111 Query: 513 LKDLYCWADEGLIEDILGAVNNDFDQASTLLKSMVLSRSSEETE---NHGAQRAD----F 671 L++ + WA+ LI+DI AVNN+ D+A++LL++M + + EE++ N + +D Sbjct: 112 LREQHLWAEHSLIDDIFTAVNNNIDKATSLLETMAPAVNFEESKVSINPRSTTSDDTPCM 171 Query: 672 EKTTQAVKGARIND--------VTNIADLKSALDECVFETWNEMT-VEDVRTDSKLYDST 824 +KT ++ ++ D V N+ D L++ + +++ V+ +R KL +S Sbjct: 172 DKTDDSLTSEKVEDDIPFDYNLVDNLQDNDKDLEDRNAPSGQKLSGVDYLRCKMKLLNSV 231 Query: 825 AQLISRRVIAAPVEPEWEEDDAYLSHRKDAIRAIRLASQHSRAASNAFMRGDHFSAQKLS 1004 PVEPEWE+DD Y+S+RKDA+R +RLAS+HS+AAS+AF+RGDHFSAQ S Sbjct: 232 -----------PVEPEWEDDDIYISNRKDALRTMRLASRHSKAASSAFLRGDHFSAQHHS 280 Query: 1005 MQARKEWXXXXXXXXXXXXXILRIRNSKNGVWKLDLHGLHASEAVHFLQGHLWNIEMQIP 1184 M+AR EW IL IRN++N +W+LDLHGLHA+EA+ LQ HL+ IE Q Sbjct: 281 MKARAEWHTAEELNSDAAKKILSIRNNENDIWRLDLHGLHATEAIQALQEHLYRIECQ-G 339 Query: 1185 LNRSVSPNRSLNPTSGDIHSPSEATTCLGT-NMADKE----QGISWHRPRALQVVTGTGN 1349 ++S + TS + + LG+ N D+E Q RP AL V+TG GN Sbjct: 340 FSKS-------SATSNGVKENGLGHSTLGSFNFMDREKLDTQAPLRLRPLALHVITGIGN 392 Query: 1350 HSRGQATLPMAVKSFLVENGYRFDEARPGVIDVRPKYR 1463 HSRG A LP AV+SFL EN YRF+E RPGVI V PK+R Sbjct: 393 HSRGLAALPAAVRSFLNENRYRFEEMRPGVITVWPKFR 430 >ref|NP_197743.1| smr (Small MutS Related) domain-containing protein [Arabidopsis thaliana] gi|8809708|dbj|BAA97249.1| unnamed protein product [Arabidopsis thaliana] gi|22531192|gb|AAM97100.1| unknown protein [Arabidopsis thaliana] gi|23198016|gb|AAN15535.1| unknown protein [Arabidopsis thaliana] gi|332005795|gb|AED93178.1| smr (Small MutS Related) domain-containing protein [Arabidopsis thaliana] Length = 435 Score = 261 bits (666), Expect = 5e-67 Identities = 172/443 (38%), Positives = 233/443 (52%), Gaps = 16/443 (3%) Frame = +3 Query: 186 LSSAKGRSLGWAAFDLKQRGKQGREPESYTDPYPPIQ-SMSTDVPLPEXXXXXXXXXXXX 362 +S KG+S GW AFDLKQR KQG E E DP+PP+ S++ + Sbjct: 1 MSWMKGKSSGWTAFDLKQRQKQGLESEVEGDPFPPVSTSVNASFGVRGRLRRNHEPSEKS 60 Query: 363 XXXXXQPSIDFPVHWGTGNHSPTQTIGGHSSRNFGNEAEIGNHVNPVIAKLKDLYCWADE 542 P FP Q GG R + N + KLK++ WAD+ Sbjct: 61 FSSVLLPPSRFPA-LTENKDCGNQERGGCCRRKPDTLSLPVNSHDLAFTKLKEMNSWADD 119 Query: 543 GLIEDILGAVNNDFDQASTLLKSMVLSRSSEE----------TENHGAQRADFEKT-TQA 689 LI D+L + +DF+ A LK MV S +E ++N ++ FEKT T + Sbjct: 120 NLIRDVLLSTEDDFEMALAFLKGMVSSGKEDEEPTSKIEGYSSDNRRSEYRTFEKTVTSS 179 Query: 690 VKGARINDVTNIA--DLKSALDECVFETWNEMTVEDVRTDSKLYDSTAQL--ISRRVIAA 857 VK A + + DL+++ D F + + + K D ++L I +R+ + Sbjct: 180 VKMAARSTFEDAGKYDLENS-DGSSF-------LVNASDNEKFPDDISELDSIIQRLQSI 231 Query: 858 PVEPEWEEDDAYLSHRKDAIRAIRLASQHSRAASNAFMRGDHFSAQKLSMQARKEWXXXX 1037 P+EPEWEEDD YLSHRKDA++ +R AS HSRAA NAF R DH SA++ S +AR++W Sbjct: 232 PIEPEWEEDDLYLSHRKDALKVMRSASNHSRAAQNAFQRYDHASAKQHSDKAREDWLAAE 291 Query: 1038 XXXXXXXXXILRIRNSKNGVWKLDLHGLHASEAVHFLQGHLWNIEMQIPLNRSVSPNRSL 1217 I+ I N N +WKLDLHGLHA+EAV LQ L IE +NRSVSPNR Sbjct: 292 KLNAEAAKKIIGITNKDNDIWKLDLHGLHATEAVQALQERLQMIEGHFTVNRSVSPNRGR 351 Query: 1218 NPTSGDIHSPSEATTCLGTNMADKEQGISWHRPRALQVVTGTGNHSRGQATLPMAVKSFL 1397 + + + E L ++ S +LQV+TG G HSRGQA+LP+AVK+F Sbjct: 352 SKNAALRSASQEPFGRLDEEGMHCQRTSSRELRNSLQVITGIGKHSRGQASLPLAVKTFF 411 Query: 1398 VENGYRFDEARPGVIDVRPKYRY 1466 +N YRFDE RPGVI VRPK+R+ Sbjct: 412 EDNRYRFDETRPGVITVRPKFRH 434