BLASTX nr result

ID: Coptis21_contig00009145 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00009145
         (2008 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi...   680   0.0  
ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|2...   649   0.0  
ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm...   642   0.0  
ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi...   625   e-176
ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi...   620   e-175

>ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic [Vitis vinifera]
          Length = 511

 Score =  680 bits (1755), Expect = 0.0
 Identities = 350/515 (67%), Positives = 406/515 (78%)
 Frame = +3

Query: 60   LKSPRVFNFLM*TQTFKTMATRSFVPRYNHHLITCKFNRSSFNTIMKYYSCPRVEIKTSV 239
            L SP    F + + +F     R  VP+++   +    +R++  TI  + + PR  +    
Sbjct: 11   LMSPTELGFTL-SSSFSIQRPRLIVPKFSRSFLGEYCSRAT--TICNHQN-PRFVVPKR- 65

Query: 240  LKIPSLLFVKKKNEFRSFGATELDRFLTSDEKDEMGEAFFEAIEELERMVREPADVLEEM 419
                      K  EFR F + ELD+FLTSD++DEM E FFEAIEELERM REP+DVLEEM
Sbjct: 66   ---------DKIREFRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREPSDVLEEM 116

Query: 420  NNKLSSRELQLVLVYFSQEGRDSWCALEVFEWLQKENKVDKETMELMVSIMCGWVRKLIE 599
            N++LS+RELQLVLVYFSQEGRDSWCALEVFEWL+KEN+VDKETMELMVSIMC WV+KLIE
Sbjct: 117  NDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIE 176

Query: 600  GEHSXXXXXXXXXXXXXXXXKPSFSMIEKVISLYWEMGKKESGVLFVKDVLSRGIAYTVD 779
            GEH                 KP FSMIEKVISLYWEM +KE  VLFVK+VL R IAY+ D
Sbjct: 177  GEHDVGDVVDLLVDMDCVGLKPGFSMIEKVISLYWEMEEKEKAVLFVKEVLRREIAYSED 236

Query: 780  DEENNKGGPTGYLAWKMMVDGNYLGAVKLVIDFRESGLKPEVYSYLIAMTAIVKELNEFS 959
            D + +KGGPTGYLAWKMM +GNY GAVKLVI  RESGLKPEVYSYLIAMTA+VKELNEF+
Sbjct: 237  DGDGHKGGPTGYLAWKMMAEGNYRGAVKLVIHLRESGLKPEVYSYLIAMTAVVKELNEFA 296

Query: 960  KAFRKLKGFVKAGLIPELDAENVWLIENYQSDLLSDGVRLSQWVIEEGSSLNSAVVHERL 1139
            KA RKLKGF K+GLI ELDAENV LIE YQSDLL+DGVRLS WVI+EG S    VV+ERL
Sbjct: 297  KALRKLKGFTKSGLIAELDAENVELIEKYQSDLLADGVRLSSWVIQEGRSPLHGVVYERL 356

Query: 1140 LAMYICAGEGLKAEQQLWEMKLIGKEPERELYDIVLAICASQNEANSVSRLLTGLEVTSS 1319
            LAMYICAG GL+AE+QLWEMKL+GKE +RELYDIVLAICAS+ EA+++SRLLTG+EVTSS
Sbjct: 357  LAMYICAGRGLEAERQLWEMKLVGKEADRELYDIVLAICASKKEASAISRLLTGMEVTSS 416

Query: 1320 IRRKKTLSWLLRGYVKGGHFQDASKTIIKMLDIGLHPEYLDRAAVLQGLRKAIQDTGSVE 1499
            IRRKKTLSWLLRGY+KG HF DAS+TIIKMLD+GL PEYLDRAAVLQGLR  IQ TG+VE
Sbjct: 417  IRRKKTLSWLLRGYIKGSHFDDASETIIKMLDLGLCPEYLDRAAVLQGLRNRIQQTGNVE 476

Query: 1500 PYLSLCKYLSDADLIGPCLVYFYVDRYNLWVIKMV 1604
             YL LCK+LSDA+LIGPCLVY Y+ +Y LW++K +
Sbjct: 477  TYLKLCKHLSDANLIGPCLVYLYIKKYKLWILKTI 511


>ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|222850384|gb|EEE87931.1|
            predicted protein [Populus trichocarpa]
          Length = 500

 Score =  649 bits (1675), Expect = 0.0
 Identities = 319/456 (69%), Positives = 378/456 (82%), Gaps = 2/456 (0%)
 Frame = +3

Query: 243  KIPSLLFVK--KKNEFRSFGATELDRFLTSDEKDEMGEAFFEAIEELERMVREPADVLEE 416
            K P+ +  K  K  EFR F + ELD+++TSD+++EMGE FFEAIEELERM REP+D+LEE
Sbjct: 45   KRPNFVVAKTTKVREFRLFKSVELDQYVTSDDEEEMGEGFFEAIEELERMTREPSDILEE 104

Query: 417  MNNKLSSRELQLVLVYFSQEGRDSWCALEVFEWLQKENKVDKETMELMVSIMCGWVRKLI 596
            MN++LS+RELQLVLVYFSQEGRDSWCALEVFEWL+KEN+VDKETMELMVSIMC WV+KLI
Sbjct: 105  MNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLI 164

Query: 597  EGEHSXXXXXXXXXXXXXXXXKPSFSMIEKVISLYWEMGKKESGVLFVKDVLSRGIAYTV 776
            EGE                  KPSFSMIEKVISLYW+MGKKE  V FVK+VL RGIAY+ 
Sbjct: 165  EGEQDVGDVVDLLVDMDCVGLKPSFSMIEKVISLYWDMGKKEGAVSFVKEVLRRGIAYSG 224

Query: 777  DDEENNKGGPTGYLAWKMMVDGNYLGAVKLVIDFRESGLKPEVYSYLIAMTAIVKELNEF 956
            DD E  KGGPTGYL WKMMVDGNY  AVKLVI  RESGLKPE+Y+YLIAMTA+VKELNEF
Sbjct: 225  DDGEGQKGGPTGYLTWKMMVDGNYRNAVKLVIHLRESGLKPEIYAYLIAMTAVVKELNEF 284

Query: 957  SKAFRKLKGFVKAGLIPELDAENVWLIENYQSDLLSDGVRLSQWVIEEGSSLNSAVVHER 1136
            SKA RKLKG+ ++G++ ELDAENV L+E YQSDLL+DGV LS WVI+EGS     VVHER
Sbjct: 285  SKALRKLKGYSRSGMVTELDAENVELVEKYQSDLLADGVCLSSWVIQEGSPALYGVVHER 344

Query: 1137 LLAMYICAGEGLKAEQQLWEMKLIGKEPERELYDIVLAICASQNEANSVSRLLTGLEVTS 1316
            LLAMYICAG GL AE+QLWEMKL+GKE + +LYDIVLAICASQ EA++V+RLLT +EV S
Sbjct: 345  LLAMYICAGRGLDAERQLWEMKLVGKEADGDLYDIVLAICASQKEASAVARLLTRIEVAS 404

Query: 1317 SIRRKKTLSWLLRGYVKGGHFQDASKTIIKMLDIGLHPEYLDRAAVLQGLRKAIQDTGSV 1496
            S+R+KK+LSWLLRGY+KGGH+ +A++T+IKMLD+GL P+YLDR AV+QGLRK IQ  G+V
Sbjct: 405  SMRKKKSLSWLLRGYIKGGHYGEAAETLIKMLDLGLSPDYLDRVAVMQGLRKRIQQWGNV 464

Query: 1497 EPYLSLCKYLSDADLIGPCLVYFYVDRYNLWVIKMV 1604
            E YL LCK LSD +LIGP LVY Y+ +Y LW++K++
Sbjct: 465  ESYLKLCKRLSDVNLIGPSLVYLYIKKYKLWIMKLL 500


>ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis]
            gi|223539607|gb|EEF41193.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 499

 Score =  642 bits (1657), Expect = 0.0
 Identities = 317/445 (71%), Positives = 369/445 (82%)
 Frame = +3

Query: 270  KKNEFRSFGATELDRFLTSDEKDEMGEAFFEAIEELERMVREPADVLEEMNNKLSSRELQ 449
            +  EFR   + ELD+++ SD+++EM E FFEAIEELERM REP+DVLEEMN+KLS+RELQ
Sbjct: 55   RNREFRVLKSVELDQYIASDDEEEMSEGFFEAIEELERMTREPSDVLEEMNDKLSARELQ 114

Query: 450  LVLVYFSQEGRDSWCALEVFEWLQKENKVDKETMELMVSIMCGWVRKLIEGEHSXXXXXX 629
            LVLVYFSQEGRDSWCALEVFEWL+KEN+VDKETMELMVSIMC W++KLIEGEH       
Sbjct: 115  LVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWIKKLIEGEHEIGDVVD 174

Query: 630  XXXXXXXXXXKPSFSMIEKVISLYWEMGKKESGVLFVKDVLSRGIAYTVDDEENNKGGPT 809
                      KPSFSMIEKVISLYWE+G+KE  V FVK+VL R +AY  DD E  KGGPT
Sbjct: 175  LLVDMDCVGLKPSFSMIEKVISLYWEIGEKEKSVSFVKEVLRREVAYFEDDGEGQKGGPT 234

Query: 810  GYLAWKMMVDGNYLGAVKLVIDFRESGLKPEVYSYLIAMTAIVKELNEFSKAFRKLKGFV 989
            GYLAWKMMVDGNY  AVKLVI FRESGLKPEVYSYLIAMTA+VKELNEF+KA RKLKGF 
Sbjct: 235  GYLAWKMMVDGNYRDAVKLVIHFRESGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFA 294

Query: 990  KAGLIPELDAENVWLIENYQSDLLSDGVRLSQWVIEEGSSLNSAVVHERLLAMYICAGEG 1169
            K+GLI ELDAEN  LIE YQSDL++DGV LS WVI+EGS     VVHERLLAMYICAG G
Sbjct: 295  KSGLIAELDAENTRLIEKYQSDLIADGVCLSSWVIQEGSPSLYGVVHERLLAMYICAGRG 354

Query: 1170 LKAEQQLWEMKLIGKEPERELYDIVLAICASQNEANSVSRLLTGLEVTSSIRRKKTLSWL 1349
            L AE+QLWEMKL+GK  + +LYDIVLAICASQ EA++VSRLLT +EVTSS+++KKTLSWL
Sbjct: 355  LDAERQLWEMKLVGKHADGDLYDIVLAICASQKEASAVSRLLTRVEVTSSLQKKKTLSWL 414

Query: 1350 LRGYVKGGHFQDASKTIIKMLDIGLHPEYLDRAAVLQGLRKAIQDTGSVEPYLSLCKYLS 1529
            LRGY+KGG + +A++ ++KMLD+GL P+YLDR AVLQGLRK IQ  G+VE YL+LCK LS
Sbjct: 415  LRGYLKGGQYDEAAEALVKMLDMGLCPDYLDRVAVLQGLRKRIQQWGNVESYLNLCKRLS 474

Query: 1530 DADLIGPCLVYFYVDRYNLWVIKMV 1604
            D +LIGP LVY Y+ +Y LW++KM+
Sbjct: 475  DENLIGPSLVYLYIKKYKLWIMKML 499


>ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 508

 Score =  625 bits (1612), Expect = e-176
 Identities = 306/472 (64%), Positives = 383/472 (81%), Gaps = 2/472 (0%)
 Frame = +3

Query: 195  MKYYSCPRVEIKTSVLKIPSLLFVKKKN--EFRSFGATELDRFLTSDEKDEMGEAFFEAI 368
            +K+Y    +  ++   K PS +  K  +   FR+  + E+D+++TS+  DEM + FFEAI
Sbjct: 41   LKFYG--GLSARSCKFKNPSFVSAKHGSLRGFRALKSVEMDQYVTSN--DEMSDGFFEAI 96

Query: 369  EELERMVREPADVLEEMNNKLSSRELQLVLVYFSQEGRDSWCALEVFEWLQKENKVDKET 548
            EELERM REP+DVLEEMN++LS+RELQLVLVYFSQ+GRDSWCALEVF+WL+KEN+VDKET
Sbjct: 97   EELERMTREPSDVLEEMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKET 156

Query: 549  MELMVSIMCGWVRKLIEGEHSXXXXXXXXXXXXXXXXKPSFSMIEKVISLYWEMGKKESG 728
            MELMV+IMCGWV+KLI+ +H                 +P FSMIEKVISLYWEMG+KE  
Sbjct: 157  MELMVAIMCGWVKKLIQQQHGVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGA 216

Query: 729  VLFVKDVLSRGIAYTVDDEENNKGGPTGYLAWKMMVDGNYLGAVKLVIDFRESGLKPEVY 908
            VLFV++VL RGI Y  +DEE +KGGPTGYLAWKMM +G+Y  AV+LVI FRESGLKPE+Y
Sbjct: 217  VLFVEEVLRRGIPYVEEDEEGHKGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIY 276

Query: 909  SYLIAMTAIVKELNEFSKAFRKLKGFVKAGLIPELDAENVWLIENYQSDLLSDGVRLSQW 1088
            SYL+AMTA+VKELNEF+KA RKLKGF +AGL+ ELD E+V L E YQSD L+DGVRLS W
Sbjct: 277  SYLVAMTAVVKELNEFAKALRKLKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNW 336

Query: 1089 VIEEGSSLNSAVVHERLLAMYICAGEGLKAEQQLWEMKLIGKEPERELYDIVLAICASQN 1268
            VI++GS     +VHERLLAMYICAG G++AE+QLWEMKL+GKE + +LYDIVLAICASQ 
Sbjct: 337  VIQDGSPSLHGIVHERLLAMYICAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQK 396

Query: 1269 EANSVSRLLTGLEVTSSIRRKKTLSWLLRGYVKGGHFQDASKTIIKMLDIGLHPEYLDRA 1448
            E+N+ +RLLT LEV SS ++KK+LSWLLRGY+KGGHF +A++TI+KML++G +PEYLDRA
Sbjct: 397  ESNATARLLTRLEVVSSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRA 456

Query: 1449 AVLQGLRKAIQDTGSVEPYLSLCKYLSDADLIGPCLVYFYVDRYNLWVIKMV 1604
            AVLQGLRK IQ  G+++ Y+ LCK LSDA+LIGPCLV+ Y+ +Y LWV+KM+
Sbjct: 457  AVLQGLRKRIQQYGNLDTYVRLCKSLSDANLIGPCLVHLYIRKYKLWVVKML 508


>ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 510

 Score =  620 bits (1598), Expect = e-175
 Identities = 306/483 (63%), Positives = 379/483 (78%), Gaps = 2/483 (0%)
 Frame = +3

Query: 162  CKFNRSSFNTIMKYYSCPRVEIKTSVLKIPSLLFVKKKNEFRSFGATELDRFLTSD-EKD 338
            C ++   ++ ++   SC          K PS +       FR+  + ELD+++TSD E+D
Sbjct: 36   CGYSLKFYDGVLSARSCK--------FKNPSFVKQGSIRGFRALKSVELDQYVTSDDEED 87

Query: 339  EMGEAFFEAIEELERMVREPADVLEEMNNKLSSRELQLVLVYFSQEGRDSWCALEVFEWL 518
            EM + FFEAIEELERM REP+DVLEEMN++LS+RELQLVLVYFSQ+GRDSWCALEVF+WL
Sbjct: 88   EMSDGFFEAIEELERMTREPSDVLEEMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWL 147

Query: 519  QKENKVDKETMELMVSIMCGWVRKLIEGEHSXXXXXXXXXXXXXXXX-KPSFSMIEKVIS 695
            +KEN+VDKETMELMV+IMCGWV+KLI+  H                  +P FSMIEKVIS
Sbjct: 148  RKENRVDKETMELMVAIMCGWVKKLIQEHHGVVGDVVDLLVDMDCVGLRPGFSMIEKVIS 207

Query: 696  LYWEMGKKESGVLFVKDVLSRGIAYTVDDEENNKGGPTGYLAWKMMVDGNYLGAVKLVID 875
            LYWEMG+KE  VLFV++VL RGI Y  +DEE +KGGPTGYLAWKMM +G+Y  AV+LVI 
Sbjct: 208  LYWEMGEKEGAVLFVEEVLRRGIPYLEEDEEGHKGGPTGYLAWKMMAEGDYTSAVRLVIH 267

Query: 876  FRESGLKPEVYSYLIAMTAIVKELNEFSKAFRKLKGFVKAGLIPELDAENVWLIENYQSD 1055
            F ESGLKPEVYSYL+AMTA+VKELNE +KA RKLK F + GL+ ELD E+V L E YQSD
Sbjct: 268  FTESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSFARTGLVAELDLEDVELTEKYQSD 327

Query: 1056 LLSDGVRLSQWVIEEGSSLNSAVVHERLLAMYICAGEGLKAEQQLWEMKLIGKEPERELY 1235
            LL DGVRLS W I++GS     ++HERLLAMYICAG G++AE+QLWEMKL+GKE + +LY
Sbjct: 328  LLGDGVRLSNWAIQDGSPSLHGIIHERLLAMYICAGHGIEAEKQLWEMKLVGKEADGDLY 387

Query: 1236 DIVLAICASQNEANSVSRLLTGLEVTSSIRRKKTLSWLLRGYVKGGHFQDASKTIIKMLD 1415
            DIVLAICASQ E+N+ +RLLT LEV SS ++KK+LSWLLRGY+KGGHF +A++TI+KMLD
Sbjct: 388  DIVLAICASQKESNATARLLTRLEVASSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLD 447

Query: 1416 IGLHPEYLDRAAVLQGLRKAIQDTGSVEPYLSLCKYLSDADLIGPCLVYFYVDRYNLWVI 1595
            +G +PEYLDRAAVLQGLRK IQ  G+++ Y+ LCK LSDA+LIGPCLV+ Y+ +Y LWV+
Sbjct: 448  LGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVRLCKSLSDANLIGPCLVHLYIRKYKLWVV 507

Query: 1596 KMV 1604
            KM+
Sbjct: 508  KML 510


Top