BLASTX nr result
ID: Coptis23_contig00036874
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis23_contig00036874 (375 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278762.1| PREDICTED: pentatricopeptide repeat-containi... 185 4e-45 ref|XP_002868248.1| pentatricopeptide repeat-containing protein ... 182 3e-44 ref|XP_002315764.1| predicted protein [Populus trichocarpa] gi|2... 181 6e-44 ref|NP_193218.1| pentatricopeptide repeat-containing protein [Ar... 174 5e-42 gb|EEE60540.1| hypothetical protein OsJ_13880 [Oryza sativa Japo... 171 4e-41 >ref|XP_002278762.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14820 [Vitis vinifera] gi|297737070|emb|CBI26271.3| unnamed protein product [Vitis vinifera] Length = 727 Score = 185 bits (469), Expect = 4e-45 Identities = 82/124 (66%), Positives = 103/124 (83%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVDKGLHMF 194 NVI+WT +I+ FAMHGDAG+ALR F QM + IEPN +TFV VLYACSHAGLV++G +F Sbjct: 420 NVISWTCMISAFAMHGDAGSALRFFHQMEDENIEPNGITFVGVLYACSHAGLVEEGRKIF 479 Query: 193 ALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVHGD 14 M N++NI PK HYGCMVDL GRAN LRE+L+LV++MP PNV++WGSL++ACRVHG+ Sbjct: 480 YSMINEHNITPKHVHYGCMVDLFGRANLLREALELVEAMPLAPNVIIWGSLMAACRVHGE 539 Query: 13 VELG 2 +ELG Sbjct: 540 IELG 543 Score = 68.9 bits (167), Expect = 4e-10 Identities = 37/122 (30%), Positives = 67/122 (54%), Gaps = 2/122 (1%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVD--KGLH 200 +++ W+++I+G+A AL LF++M + GI+P+ VT +SV+ AC+H G +D K +H Sbjct: 319 DLVCWSAMISGYAESDSPQEALNLFNEMQSLGIKPDQVTMLSVITACAHLGALDQAKWIH 378 Query: 199 MFALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVH 20 +F LP ++++ + L + + MP NV+ W ++SA +H Sbjct: 379 LFVDKNGFGGALPIN---NALIEMYAKCGSLERARRIFDKMP-RKNVISWTCMISAFAMH 434 Query: 19 GD 14 GD Sbjct: 435 GD 436 >ref|XP_002868248.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297314084|gb|EFH44507.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 725 Score = 182 bits (461), Expect = 3e-44 Identities = 80/124 (64%), Positives = 103/124 (83%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVDKGLHMF 194 NV++W+S+I FAMHG+A +L LF QM + +EPN VTFV VLY CSH+GLV++G +F Sbjct: 412 NVVSWSSMINAFAMHGEASDSLSLFAQMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIF 471 Query: 193 ALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVHGD 14 A MT++YNI PK EHYGCMVDL GRAN LRE+L++++SMP PNVV+WGSL+SACRVHG+ Sbjct: 472 ASMTDEYNITPKIEHYGCMVDLFGRANLLREALEVIESMPMAPNVVIWGSLMSACRVHGE 531 Query: 13 VELG 2 +ELG Sbjct: 532 LELG 535 Score = 70.5 bits (171), Expect = 1e-10 Identities = 37/122 (30%), Positives = 71/122 (58%), Gaps = 2/122 (1%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVDKG--LH 200 +++ WT++I+ +A ALR+F++M GI+P+ VT +SV+ AC + G +DK +H Sbjct: 311 DLVCWTTMISAYAESDHPQEALRVFEEMCCSGIKPDVVTMLSVISACVNLGTLDKAKWVH 370 Query: 199 MFALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVH 20 + + ++LP ++++ + L + D+ + MP T NVV W S+++A +H Sbjct: 371 RYTHLNGLESVLPID---NALINMYAKCGGLDAARDVFEKMP-TRNVVSWSSMINAFAMH 426 Query: 19 GD 14 G+ Sbjct: 427 GE 428 >ref|XP_002315764.1| predicted protein [Populus trichocarpa] gi|222864804|gb|EEF01935.1| predicted protein [Populus trichocarpa] Length = 452 Score = 181 bits (459), Expect = 6e-44 Identities = 81/124 (65%), Positives = 102/124 (82%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVDKGLHMF 194 NVI+WTS+I FA+HGDA AL+ F QM + I+PN VTFV VLYACSHAGLV++G F Sbjct: 145 NVISWTSMINAFAIHGDASNALKFFYQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRTF 204 Query: 193 ALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVHGD 14 A MTN++NI PK EHYGCMVDL GRAN LR++L+LV++MP PNVV+WGSL++AC++HG+ Sbjct: 205 ASMTNEHNITPKHEHYGCMVDLFGRANLLRDALELVETMPLAPNVVIWGSLMAACQIHGE 264 Query: 13 VELG 2 ELG Sbjct: 265 NELG 268 Score = 66.2 bits (160), Expect = 3e-09 Identities = 36/122 (29%), Positives = 67/122 (54%), Gaps = 2/122 (1%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVD--KGLH 200 +++ W+++I+G+A AL LF +M GI+P+ VT +SV+ AC+ G++D K +H Sbjct: 44 DLVCWSAMISGYAESDKPQEALNLFSEMQVFGIKPDQVTILSVISACARLGVLDRAKWIH 103 Query: 199 MFALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVH 20 M+ LP ++D+ + L + + + M + NV+ W S+++A +H Sbjct: 104 MYVDKNGLGGALPVN---NALIDMYAKCGNLGAARGVFEKMQ-SRNVISWTSMINAFAIH 159 Query: 19 GD 14 GD Sbjct: 160 GD 161 >ref|NP_193218.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75274931|sp|O23337.1|PP311_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g14820 gi|2244839|emb|CAB10261.1| hypothetical protein [Arabidopsis thaliana] gi|7268228|emb|CAB78524.1| hypothetical protein [Arabidopsis thaliana] gi|332658106|gb|AEE83506.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 722 Score = 174 bits (442), Expect = 5e-42 Identities = 76/124 (61%), Positives = 101/124 (81%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVDKGLHMF 194 NV++W+S+I +MHG+A AL LF +M + +EPN VTFV VLY CSH+GLV++G +F Sbjct: 409 NVVSWSSMINALSMHGEASDALSLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIF 468 Query: 193 ALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVHGD 14 A MT++YNI PK EHYGCMVDL GRAN LRE+L++++SMP NVV+WGSL+SACR+HG+ Sbjct: 469 ASMTDEYNITPKLEHYGCMVDLFGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGE 528 Query: 13 VELG 2 +ELG Sbjct: 529 LELG 532 Score = 59.7 bits (143), Expect = 2e-07 Identities = 31/120 (25%), Positives = 68/120 (56%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVDKGLHMF 194 +++ WT++I+ + ALR+F++M GI+P+ V+ SV+ AC++ G++DK + Sbjct: 308 DLVCWTTMISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVH 367 Query: 193 ALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVHGD 14 + + + + + ++++ + L + D+ + MP NVV W S+++A +HG+ Sbjct: 368 SCI-HVNGLESELSINNALINMYAKCGGLDATRDVFEKMP-RRNVVSWSSMINALSMHGE 425 >gb|EEE60540.1| hypothetical protein OsJ_13880 [Oryza sativa Japonica Group] Length = 594 Score = 171 bits (434), Expect = 4e-41 Identities = 77/124 (62%), Positives = 98/124 (79%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVDKGLHMF 194 NV+TWTSIIT AMHGD +AL LF+ M ++GI+PN VTF+ +LYAC HAGLV++G +F Sbjct: 367 NVVTWTSIITASAMHGDGRSALTLFENMKSEGIQPNGVTFLGLLYACCHAGLVEEGRLLF 426 Query: 193 ALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVHGD 14 +M +Y I P EHYGCMVDLLGRA L ++ DL++SM PNVV+WGSLL+ACR+HGD Sbjct: 427 KIMVQQYRIEPMHEHYGCMVDLLGRAKLLGQAADLIQSMHLRPNVVIWGSLLAACRMHGD 486 Query: 13 VELG 2 +ELG Sbjct: 487 LELG 490 Score = 74.7 bits (182), Expect = 7e-12 Identities = 39/122 (31%), Positives = 69/122 (56%), Gaps = 2/122 (1%) Frame = -3 Query: 373 NVITWTSIITGFAMHGDAGTALRLFDQMNADGIEPNSVTFVSVLYACSHAGLVDKG--LH 200 +V++W+++I G+A AL LF M G++P+ +T +SV+ AC++ G ++K +H Sbjct: 266 DVVSWSAMIAGYAESSKPMEALNLFHDMQRSGVKPDEITMLSVISACANVGALEKARCIH 325 Query: 199 MFALMTNKYNILPKQEHYGCMVDLLGRANFLRESLDLVKSMPFTPNVVVWGSLLSACRVH 20 F + ILP ++D+ + L +LD+ +MP NVV W S+++A +H Sbjct: 326 SFVENHSMCKILPIG---NALIDMFSKCGSLTLALDVFNAMP-QKNVVTWTSIITASAMH 381 Query: 19 GD 14 GD Sbjct: 382 GD 383