BLASTX nr result

ID: Coptis24_contig00010730 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00010730
         (1753 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002271824.1| PREDICTED: pentatricopeptide repeat-containi...   366   1e-98
emb|CAN70994.1| hypothetical protein VITISV_038698 [Vitis vinifera]   366   1e-98
ref|XP_002326871.1| predicted protein [Populus trichocarpa] gi|2...   331   4e-88
ref|XP_002880144.1| pentatricopeptide repeat-containing protein ...   322   3e-85
ref|NP_182015.1| Pentatricopeptide repeat-containing protein [Ar...   315   3e-83

>ref|XP_002271824.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880
            [Vitis vinifera] gi|297734603|emb|CBI16654.3| unnamed
            protein product [Vitis vinifera]
          Length = 577

 Score =  366 bits (939), Expect = 1e-98
 Identities = 172/315 (54%), Positives = 227/315 (72%)
 Frame = -3

Query: 1751 VFSWSCMISGYARRNRSEDAMKLFQNMVDESRVRPNEVTLVSVLSVCGHFTALDQGKWVH 1572
            +FSW+ MISGY +  +  +A+KLF  M   + + P+EVT+VSVL       ALD G WVH
Sbjct: 247  LFSWNAMISGYRQNKQPYEALKLFHEMQSTTSLEPDEVTIVSVLPAIADLGALDLGGWVH 306

Query: 1571 TYIERNGMRLSDNLGAALVDMYAKCGCVETAYEVFSKLDHRNVSTWNALITGLAVNGIAH 1392
             ++ R  +  + N+G AL+DMYAKCG +  +  VF  +  +  ++WNALI   A+NG A 
Sbjct: 307  RFVRRKKLDRATNVGTALIDMYAKCGEIVKSRGVFDNMPEKETASWNALINAFAINGRAK 366

Query: 1391 AALEVFTEMLRLETKPDRITFLGVLMACCHGGLVEEGRYHFKSMLKDYGIQPELKHYGCM 1212
             AL +F EM      P+ IT +GVL AC H GLVEEG+  FK+M +++G+ P+++HYGCM
Sbjct: 367  EALGLFMEMNHKGFMPNEITMIGVLSACNHSGLVEEGKRWFKAM-EEFGLTPKIEHYGCM 425

Query: 1211 VDILGRAGRVEEAEELMMNMPYELNSIVLSSFLFACGCRGDVMRAEKFIRKAFDIEPWND 1032
            VD+LGRAG ++EAE+LM +MPYE N I+LSSFLFACG   DV RAE+ +++A  +E WND
Sbjct: 426  VDLLGRAGCLQEAEKLMESMPYEANGIILSSFLFACGYSKDVARAERVLKEAIKMEAWND 485

Query: 1031 GNYIMMRNLYAGEKRWKEVEEIKGLMRKYRAKKEVGCSVIEVNNGVWEFVAGDRAHPQWE 852
            GNYIM+RNLYA EKRWKE +E+KGLMR+   KKE GCS IEV++ VWEFVAGDR HP+WE
Sbjct: 486  GNYIMLRNLYANEKRWKEADEVKGLMRRNGVKKEAGCSAIEVDSRVWEFVAGDRVHPKWE 545

Query: 851  GMHLVLRQLWLHMKG 807
             +H VL QLW+HMKG
Sbjct: 546  AIHSVLGQLWVHMKG 560



 Score = 58.9 bits (141), Expect = 4e-06
 Identities = 34/133 (25%), Positives = 67/133 (50%)
 Frame = -3

Query: 1748 FSWSCMISGYARRNRSEDAMKLFQNMVDESRVRPNEVTLVSVLSVCGHFTALDQGKWVHT 1569
            F  + MI  Y    +  ++  L++++   +   P+  T   +   C    A+ +G+ +H+
Sbjct: 53   FLCNSMIKAYVGMRQYSESFALYRDLRRNTSFTPDSFTFSVLAKSCALNMAIWEGQEIHS 112

Query: 1568 YIERNGMRLSDNLGAALVDMYAKCGCVETAYEVFSKLDHRNVSTWNALITGLAVNGIAHA 1389
            ++   G  L      ALVDMYAK G ++ A ++F ++  R+  +W ALI G   +G    
Sbjct: 113  HVVAVGFCLDLYAATALVDMYAKFGKMDCARKLFDEMIDRSQVSWTALIGGYVRSGDMDN 172

Query: 1388 ALEVFTEMLRLET 1350
            A ++F +M+  ++
Sbjct: 173  AGKLFDQMIEKDS 185


>emb|CAN70994.1| hypothetical protein VITISV_038698 [Vitis vinifera]
          Length = 751

 Score =  366 bits (939), Expect = 1e-98
 Identities = 172/315 (54%), Positives = 227/315 (72%)
 Frame = -3

Query: 1751 VFSWSCMISGYARRNRSEDAMKLFQNMVDESRVRPNEVTLVSVLSVCGHFTALDQGKWVH 1572
            +FSW+ MISGY +  +  +A+KLF  M   + + P+EVT+VSVL       ALD G WVH
Sbjct: 421  LFSWNAMISGYXQNKQPYEALKLFHEMQSTTSLEPDEVTIVSVLPAIADLGALDLGGWVH 480

Query: 1571 TYIERNGMRLSDNLGAALVDMYAKCGCVETAYEVFSKLDHRNVSTWNALITGLAVNGIAH 1392
             ++ R  +  + N+G AL+DMYAKCG +  +  VF  +  +  ++WNALI   A+NG A 
Sbjct: 481  RFVRRKKLDRATNVGTALIDMYAKCGEIVKSRGVFDNMPEKETASWNALINAFAINGRAK 540

Query: 1391 AALEVFTEMLRLETKPDRITFLGVLMACCHGGLVEEGRYHFKSMLKDYGIQPELKHYGCM 1212
             AL +F EM      P+ IT +GVL AC H GLVEEG+  FK+M +++G+ P+++HYGCM
Sbjct: 541  EALGLFMEMNHKGFMPNEITMIGVLSACNHSGLVEEGKRWFKAM-EEFGLTPKIEHYGCM 599

Query: 1211 VDILGRAGRVEEAEELMMNMPYELNSIVLSSFLFACGCRGDVMRAEKFIRKAFDIEPWND 1032
            VD+LGRAG ++EAE+LM +MPYE N I+LSSFLFACG   DV RAE+ +++A  +E WND
Sbjct: 600  VDLLGRAGCLQEAEKLMESMPYEANGIILSSFLFACGYSKDVARAERVLKEAIKMEAWND 659

Query: 1031 GNYIMMRNLYAGEKRWKEVEEIKGLMRKYRAKKEVGCSVIEVNNGVWEFVAGDRAHPQWE 852
            GNYIM+RNLYA EKRWKE +E+KGLMR+   KKE GCS IEV++ VWEFVAGDR HP+WE
Sbjct: 660  GNYIMLRNLYANEKRWKEADEVKGLMRRNGVKKEAGCSAIEVDSRVWEFVAGDRVHPKWE 719

Query: 851  GMHLVLRQLWLHMKG 807
             +H VL QLW+HMKG
Sbjct: 720  AIHSVLGQLWVHMKG 734



 Score = 58.9 bits (141), Expect = 4e-06
 Identities = 34/133 (25%), Positives = 67/133 (50%)
 Frame = -3

Query: 1748 FSWSCMISGYARRNRSEDAMKLFQNMVDESRVRPNEVTLVSVLSVCGHFTALDQGKWVHT 1569
            F  + MI  Y    +  ++  L++++   +   P+  T   +   C    A+ +G+ +H+
Sbjct: 227  FLCNSMIKAYVGMRQYSESFALYRDLRRNTSFTPDSFTFSVLAKSCALNMAIWEGQEIHS 286

Query: 1568 YIERNGMRLSDNLGAALVDMYAKCGCVETAYEVFSKLDHRNVSTWNALITGLAVNGIAHA 1389
            ++   G  L      ALVDMYAK G ++ A ++F ++  R+  +W ALI G   +G    
Sbjct: 287  HVVAVGFCLDLYAATALVDMYAKFGKMDCARKLFDEMIDRSQVSWTALIGGYVRSGDMDN 346

Query: 1388 ALEVFTEMLRLET 1350
            A ++F +M+  ++
Sbjct: 347  AGKLFDQMIEKDS 359


>ref|XP_002326871.1| predicted protein [Populus trichocarpa] gi|222835186|gb|EEE73621.1|
            predicted protein [Populus trichocarpa]
          Length = 581

 Score =  331 bits (848), Expect = 4e-88
 Identities = 158/315 (50%), Positives = 226/315 (71%)
 Frame = -3

Query: 1745 SWSCMISGYARRNRSEDAMKLFQNMVDESRVRPNEVTLVSVLSVCGHFTALDQGKWVHTY 1566
            SW+ MI GY +  +  +A+KLF+ +   +   PNEVT+VS+L       AL+ G+WVH +
Sbjct: 263  SWNAMIGGYCQNKQPHEALKLFRELQSSTVFEPNEVTVVSILPAIATLGALELGEWVHRF 322

Query: 1565 IERNGMRLSDNLGAALVDMYAKCGCVETAYEVFSKLDHRNVSTWNALITGLAVNGIAHAA 1386
            ++R  +  + N+  +LVDMY KCG +  A +VFS++  +  +TWNALI G A+NG+A  A
Sbjct: 323  VQRKKLDAAVNVCTSLVDMYLKCGEISKARKVFSEIPKKETATWNALINGFAMNGLASEA 382

Query: 1385 LEVFTEMLRLETKPDRITFLGVLMACCHGGLVEEGRYHFKSMLKDYGIQPELKHYGCMVD 1206
            LE F+EM +   KP+ IT  GVL AC HGGLVEEG+  FK+M++  G+ P+++HYGC+VD
Sbjct: 383  LEAFSEMQQEGIKPNDITMTGVLSACSHGGLVEEGKGQFKAMIES-GLSPKIEHYGCLVD 441

Query: 1205 ILGRAGRVEEAEELMMNMPYELNSIVLSSFLFACGCRGDVMRAEKFIRKAFDIEPWNDGN 1026
            +LGRAG ++EAE L+ +MP+E N I+LSSF FACG   DV RA++ + +A ++EP N+G 
Sbjct: 442  LLGRAGCLDEAENLIKSMPFEANGIILSSFSFACGFSNDVTRAQRVLNQAVNMEPGNNGI 501

Query: 1025 YIMMRNLYAGEKRWKEVEEIKGLMRKYRAKKEVGCSVIEVNNGVWEFVAGDRAHPQWEGM 846
            Y+MMRNLYA E+RWK+V+EI GLMR+  AKKEVG S IEV++ V EF++G  AHPQ + +
Sbjct: 502  YVMMRNLYAMEERWKDVKEINGLMRRRGAKKEVGSSAIEVDSRVSEFISGGIAHPQLDVI 561

Query: 845  HLVLRQLWLHMKGAV 801
              V+ QLW+HM+ +V
Sbjct: 562  ESVIGQLWIHMRDSV 576


>ref|XP_002880144.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297325983|gb|EFH56403.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 555

 Score =  322 bits (824), Expect = 3e-85
 Identities = 150/312 (48%), Positives = 222/312 (71%)
 Frame = -3

Query: 1745 SWSCMISGYARRNRSEDAMKLFQNMVDESRVRPNEVTLVSVLSVCGHFTALDQGKWVHTY 1566
            SW+ MI GY +  + ++A++LFQ M   + + P++VT++SVL       AL  G+W H +
Sbjct: 240  SWNTMIGGYCQNKQPQEAIRLFQEMQATTSLDPDDVTILSVLPAISDTGALSLGEWCHCF 299

Query: 1565 IERNGMRLSDNLGAALVDMYAKCGCVETAYEVFSKLDHRNVSTWNALITGLAVNGIAHAA 1386
            ++R  +     +  A++DMY+KCG +E A  +F ++  + V++WNA+I G A+NG AHAA
Sbjct: 300  VQRKNLDKKVKVCTAILDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIHGYALNGNAHAA 359

Query: 1385 LEVFTEMLRLETKPDRITFLGVLMACCHGGLVEEGRYHFKSMLKDYGIQPELKHYGCMVD 1206
            L++F  M + E KPD IT L V+ AC HGGLVEEGR  F+ M++ +G+  +++HYGCMVD
Sbjct: 360  LDLFLTMAK-EEKPDEITMLAVISACNHGGLVEEGRKWFQ-MMRKFGLNAKIEHYGCMVD 417

Query: 1205 ILGRAGRVEEAEELMMNMPYELNSIVLSSFLFACGCRGDVMRAEKFIRKAFDIEPWNDGN 1026
            +LGRAG +++AE L+ NMP++ N I+LSSFL ACG   D+ RAE+ ++KA ++EP NDGN
Sbjct: 418  LLGRAGNLKQAEHLITNMPFKPNGIILSSFLSACGQYKDIERAERILKKAVELEPQNDGN 477

Query: 1025 YIMMRNLYAGEKRWKEVEEIKGLMRKYRAKKEVGCSVIEVNNGVWEFVAGDRAHPQWEGM 846
            Y+++RNLYA +KRW +   +K +MRK  AKKEVGCS+IE+N  V EF++GD  HP  + +
Sbjct: 478  YVLLRNLYAADKRWDDFGMVKNMMRKNEAKKEVGCSLIEINYIVSEFISGDTTHPHRQSI 537

Query: 845  HLVLRQLWLHMK 810
            HLVL +L +HMK
Sbjct: 538  HLVLEKLLVHMK 549



 Score = 59.3 bits (142), Expect = 3e-06
 Identities = 35/128 (27%), Positives = 60/128 (46%)
 Frame = -3

Query: 1748 FSWSCMISGYARRNRSEDAMKLFQNMVDESRVRPNEVTLVSVLSVCGHFTALDQGKWVHT 1569
            F  + MI  Y       D+   ++++  E+ + P+  T  ++   C     + QG  +H+
Sbjct: 43   FLCNSMIKAYLETRHYNDSFAFYRDLRKETCLAPDNFTFTTMTKSCTLSMCVYQGLQLHS 102

Query: 1568 YIERNGMRLSDNLGAALVDMYAKCGCVETAYEVFSKLDHRNVSTWNALITGLAVNGIAHA 1389
             I R+G      +   +VDMYAK G +  A  VF ++  R+  +W ALI G    G    
Sbjct: 103  QIWRSGFCADMYVSTGVVDMYAKFGKMGCARNVFDEMPQRSEVSWTALICGYVRFGELDL 162

Query: 1388 ALEVFTEM 1365
            A ++F +M
Sbjct: 163  ASKLFDQM 170


>ref|NP_182015.1| Pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546766|sp|Q1PEU4.2|PP201_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g44880 gi|2344896|gb|AAC31836.1| hypothetical protein
            [Arabidopsis thaliana] gi|330255385|gb|AEC10479.1|
            Pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 555

 Score =  315 bits (806), Expect = 3e-83
 Identities = 149/311 (47%), Positives = 218/311 (70%)
 Frame = -3

Query: 1745 SWSCMISGYARRNRSEDAMKLFQNMVDESRVRPNEVTLVSVLSVCGHFTALDQGKWVHTY 1566
            SW+ MI GY +  + ++ ++LFQ M   + + P++VT++SVL       AL  G+W H +
Sbjct: 240  SWNTMIGGYCQNKQPQEGIRLFQEMQATTSLDPDDVTILSVLPAISDTGALSLGEWCHCF 299

Query: 1565 IERNGMRLSDNLGAALVDMYAKCGCVETAYEVFSKLDHRNVSTWNALITGLAVNGIAHAA 1386
            ++R  +     +  A++DMY+KCG +E A  +F ++  + V++WNA+I G A+NG A AA
Sbjct: 300  VQRKKLDKKVKVCTAILDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIHGYALNGNARAA 359

Query: 1385 LEVFTEMLRLETKPDRITFLGVLMACCHGGLVEEGRYHFKSMLKDYGIQPELKHYGCMVD 1206
            L++F  M+ +E KPD IT L V+ AC HGGLVEEGR  F  M ++ G+  +++HYGCMVD
Sbjct: 360  LDLFVTMM-IEEKPDEITMLAVITACNHGGLVEEGRKWFHVM-REMGLNAKIEHYGCMVD 417

Query: 1205 ILGRAGRVEEAEELMMNMPYELNSIVLSSFLFACGCRGDVMRAEKFIRKAFDIEPWNDGN 1026
            +LGRAG ++EAE+L+ NMP+E N I+LSSFL ACG   D+ RAE+ ++KA ++EP NDGN
Sbjct: 418  LLGRAGSLKEAEDLITNMPFEPNGIILSSFLSACGQYKDIERAERILKKAVELEPQNDGN 477

Query: 1025 YIMMRNLYAGEKRWKEVEEIKGLMRKYRAKKEVGCSVIEVNNGVWEFVAGDRAHPQWEGM 846
            Y+++RNLYA +KRW +   +K +MRK +AKKEVGCS+IE+N  V EF++GD  HP    +
Sbjct: 478  YVLLRNLYAADKRWDDFGMVKNVMRKNQAKKEVGCSLIEINYIVSEFISGDTTHPHRRSI 537

Query: 845  HLVLRQLWLHM 813
            HLVL  L +HM
Sbjct: 538  HLVLGDLLMHM 548



 Score = 60.8 bits (146), Expect = 1e-06
 Identities = 36/128 (28%), Positives = 61/128 (47%)
 Frame = -3

Query: 1748 FSWSCMISGYARRNRSEDAMKLFQNMVDESRVRPNEVTLVSVLSVCGHFTALDQGKWVHT 1569
            F  + MI  Y    +  D+  L++++  E+   P+  T  ++   C     + QG  +H+
Sbjct: 43   FLSNSMIKAYLETRQYPDSFALYRDLRKETCFAPDNFTFTTLTKSCSLSMCVYQGLQLHS 102

Query: 1568 YIERNGMRLSDNLGAALVDMYAKCGCVETAYEVFSKLDHRNVSTWNALITGLAVNGIAHA 1389
             I R G      +   +VDMYAK G +  A   F ++ HR+  +W ALI+G    G    
Sbjct: 103  QIWRFGFCADMYVSTGVVDMYAKFGKMGCARNAFDEMPHRSEVSWTALISGYIRCGELDL 162

Query: 1388 ALEVFTEM 1365
            A ++F +M
Sbjct: 163  ASKLFDQM 170


Top