BLASTX nr result

ID: Coptis21_contig00016154 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00016154
         (1326 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002282464.1| PREDICTED: pentatricopeptide repeat-containi...   526   e-147
emb|CAN80315.1| hypothetical protein VITISV_020760 [Vitis vinifera]   520   e-145
ref|XP_004157162.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   503   e-140
ref|XP_004142608.1| PREDICTED: pentatricopeptide repeat-containi...   503   e-140
ref|XP_003532746.1| PREDICTED: pentatricopeptide repeat-containi...   501   e-139

>ref|XP_002282464.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580
            [Vitis vinifera]
          Length = 807

 Score =  526 bits (1356), Expect = e-147
 Identities = 254/400 (63%), Positives = 317/400 (79%), Gaps = 4/400 (1%)
 Frame = +3

Query: 3    QYVGTIEELDGFCKENKVKDAVEVLTVLENNGITVDLPRYLLLMQICGEIGVLQYAKTVH 182
            QY GT+EE+D FCK+ KVK+A+EVL +LE     VDLPRYL LM+ CGE   LQ AK VH
Sbjct: 408  QYSGTLEEVDDFCKDGKVKEAIEVLGLLEKQHTPVDLPRYLRLMKACGEAKALQEAKAVH 467

Query: 183  DHLIRSLGHVNLSVHNKVLEMYSSCGSMSDAFDVFEKMNERNLTSWDTMITGLAKNGCGE 362
            + LI+S+  + +S +N++LEMYS CGSM DA+ VF+KM ERNLTSWDTMIT  AKN  GE
Sbjct: 468  ESLIKSVSPLKVSTYNRILEMYSKCGSMDDAYAVFKKMPERNLTSWDTMITWFAKNDLGE 527

Query: 363  DAIDLFTRFKGMGLTPDGQIFVGVFLACGVLFDIDEGMLHFESMSKVYGIVPTMEHYLSV 542
            +AIDLF +FK  GL PDGQ+F+GVF+AC VL D+ EGMLHF SMSK YGIVP+M+HY S+
Sbjct: 528  EAIDLFIQFKESGLKPDGQMFIGVFMACSVLGDVIEGMLHFNSMSKDYGIVPSMKHYASM 587

Query: 543  VNMLGSTGCLEEAMEFIEKMPFDPNVEVWESLMNLCRIHGNIELGDHCAKLVNFLDPSHL 722
            V+MLG++G L+EA+EF+EKMP +P+V+VWE+LMN+CR+ GN+E+GD CA+LV  L+PS L
Sbjct: 588  VDMLGNSGYLDEALEFVEKMPLEPSVDVWETLMNICRVQGNMEIGDRCAELVEHLEPSRL 647

Query: 723  TDLSKKGLLPA-XXXXXXXXXXXXXRSDDLRDMRSKVQEYRAGDTCHPEN---YSLLKGM 890
            T+ SK GL+P                S +L ++RS+V EYRAGDT HPEN   Y+ L+G+
Sbjct: 648  TEQSKAGLVPVKASDLEKEKEKKKLASQNLLEVRSRVHEYRAGDTSHPENDKIYAKLRGL 707

Query: 891  SAQMKEAGYVPELRQVLHDVDDESKEEALLSHSERLAVAHGLVSSAARMPIRVIKNLRVC 1070
             AQMKEAGYVPE R VLHD+D E KEEALL+HSERLAVA+GL+SS AR PIRVIKNLRVC
Sbjct: 708  KAQMKEAGYVPETRFVLHDIDQEGKEEALLAHSERLAVAYGLLSSPARSPIRVIKNLRVC 767

Query: 1071 VDCHKALKIMAKIVGRQFIMRDTKRFHHFQDGLCSCKDFW 1190
             DCH ALKI++K+VGR+ I+RD KRFHHF+DGLCSC+D+W
Sbjct: 768  GDCHTALKIISKLVGRELIIRDAKRFHHFKDGLCSCRDYW 807


>emb|CAN80315.1| hypothetical protein VITISV_020760 [Vitis vinifera]
          Length = 1148

 Score =  520 bits (1338), Expect = e-145
 Identities = 252/399 (63%), Positives = 315/399 (78%), Gaps = 4/399 (1%)
 Frame = +3

Query: 3    QYVGTIEELDGFCKENKVKDAVEVLTVLENNGITVDLPRYLLLMQICGEIGVLQYAKTVH 182
            QY GT+EE+D FCK+ KVK+A+EVL +LE     VDLPRYL LM+ CGE   LQ AK VH
Sbjct: 408  QYSGTLEEVDDFCKDGKVKEAIEVLGLLEKQHTPVDLPRYLRLMKACGEAKALQEAKAVH 467

Query: 183  DHLIRSLGHVNLSVHNKVLEMYSSCGSMSDAFDVFEKMNERNLTSWDTMITGLAKNGCGE 362
            + LI+S+  + +S +N++LEMYS CGSM DA+ VF+KM ERNLTSWDTMIT  AKN  GE
Sbjct: 468  ESLIKSVSPLKVSTYNRILEMYSKCGSMDDAYAVFKKMPERNLTSWDTMITWFAKNDLGE 527

Query: 363  DAIDLFTRFKGMGLTPDGQIFVGVFLACGVLFDIDEGMLHFESMSKVYGIVPTMEHYLSV 542
            +AIDLF +FK  GL PD Q+F+GVF+AC VL D+ EGMLHF SMSK YGIVP+M+HY S+
Sbjct: 528  EAIDLFIQFKESGLKPDXQMFIGVFMACSVLGDVIEGMLHFNSMSKDYGIVPSMKHYASM 587

Query: 543  VNMLGSTGCLEEAMEFIEKMPFDPNVEVWESLMNLCRIHGNIELGDHCAKLVNFLDPSHL 722
            V+MLG++G L+EA+EF+EKMP +P+V+VWE+LMN+CR+ GN+E+GD CA+LV  L+PS L
Sbjct: 588  VDMLGNSGYLDEALEFVEKMPLEPSVDVWETLMNICRVQGNMEIGDRCAELVEHLEPSRL 647

Query: 723  TDLSKKGLLPA-XXXXXXXXXXXXXRSDDLRDMRSKVQEYRAGDTCHPEN---YSLLKGM 890
            T+ SK GL+P                S +L ++RS+V EYRAGDT HPEN   Y+ L+G+
Sbjct: 648  TEQSKAGLVPVKASDLEKEKEKKKLASQNLLEVRSRVHEYRAGDTSHPENDKIYAKLRGL 707

Query: 891  SAQMKEAGYVPELRQVLHDVDDESKEEALLSHSERLAVAHGLVSSAARMPIRVIKNLRVC 1070
             AQMKEAGYVPE R VLHD+D E KEEALL+HSERLAVA+GL+SS AR PIRVIKNLRVC
Sbjct: 708  KAQMKEAGYVPETRFVLHDIDQEGKEEALLAHSERLAVAYGLLSSPARSPIRVIKNLRVC 767

Query: 1071 VDCHKALKIMAKIVGRQFIMRDTKRFHHFQDGLCSCKDF 1187
             DCH ALKI++K+VGR+ I+RD KRFHHF+DGLCSC+D+
Sbjct: 768  GDCHTALKIISKLVGRELIIRDAKRFHHFKDGLCSCRDY 806


>ref|XP_004157162.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g25580-like [Cucumis sativus]
          Length = 731

 Score =  503 bits (1294), Expect = e-140
 Identities = 246/397 (61%), Positives = 306/397 (77%), Gaps = 4/397 (1%)
 Frame = +3

Query: 12   GTIEELDGFCKENKVKDAVEVLTVLENNGITVDLPRYLLLMQICGEIGVLQYAKTVHDHL 191
            G +E+LD FCKE K+K+AV++L VLE   I VDL RYL LM  CGE   L+ AK V +++
Sbjct: 335  GPLEKLDEFCKEGKLKEAVQILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKVVCNYV 394

Query: 192  IRSLGHVNLSVHNKVLEMYSSCGSMSDAFDVFEKMNERNLTSWDTMITGLAKNGCGEDAI 371
            I+S  HV +S +NK+LEMYS CGSM DA+ +F KM  RN+TSWDTMIT LAKNG GEDAI
Sbjct: 395  IKSQTHVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAI 454

Query: 372  DLFTRFKGMGLTPDGQIFVGVFLACGVLFDIDEGMLHFESMSKVYGIVPTMEHYLSVVNM 551
            DLF  FK  GL PDG++F+GVF AC VL D DEGMLHFESM+K YGI P+M HY+S+V+M
Sbjct: 455  DLFYEFKKAGLRPDGKMFIGVFSACSVLGDADEGMLHFESMTKNYGITPSMHHYVSIVDM 514

Query: 552  LGSTGCLEEAMEFIEKMPFDPNVEVWESLMNLCRIHGNIELGDHCAKLVNFLDPSHLTDL 731
            LGS G ++EA+EFIEKMP +P V++WE++MN+ R HG +ELGD C +LV  LD S L + 
Sbjct: 515  LGSIGFVDEAVEFIEKMPLEPGVDIWETMMNISRAHGLMELGDRCFELVEHLDSSRLNEQ 574

Query: 732  SKKGLLPAXXXXXXXXXXXXXRSD-DLRDMRSKVQEYRAGDTCHPEN---YSLLKGMSAQ 899
            SK GLLP               ++ +L ++RS+V EYRAGDT HPEN   Y+LL+G+  Q
Sbjct: 575  SKAGLLPVKASDLXKREREEKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQ 634

Query: 900  MKEAGYVPELRQVLHDVDDESKEEALLSHSERLAVAHGLVSSAARMPIRVIKNLRVCVDC 1079
            MKEAGY+PE R VLHD+D E+K +ALL HSERLAVA+GL+SS+AR PIRVIKNLRVC DC
Sbjct: 635  MKEAGYIPETRFVLHDIDQEAKNDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDC 694

Query: 1080 HKALKIMAKIVGRQFIMRDTKRFHHFQDGLCSCKDFW 1190
            H ALKI++KIVGR+ I+RD KRFHHF+DGLCSC+D+W
Sbjct: 695  HSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 731


>ref|XP_004142608.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580-like
            [Cucumis sativus]
          Length = 671

 Score =  503 bits (1294), Expect = e-140
 Identities = 245/397 (61%), Positives = 305/397 (76%), Gaps = 4/397 (1%)
 Frame = +3

Query: 12   GTIEELDGFCKENKVKDAVEVLTVLENNGITVDLPRYLLLMQICGEIGVLQYAKTVHDHL 191
            G +E+LD FCKE K+K+AV++L VLE   I VDL RYL LM  CGE   L+ AK V +++
Sbjct: 275  GPLEKLDEFCKEGKLKEAVQILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKVVCNYV 334

Query: 192  IRSLGHVNLSVHNKVLEMYSSCGSMSDAFDVFEKMNERNLTSWDTMITGLAKNGCGEDAI 371
            I+S  HV +S +NK+LEMYS CGSM DA+ +F KM  RN+TSWDTMIT LAKNG GEDAI
Sbjct: 335  IKSQTHVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAI 394

Query: 372  DLFTRFKGMGLTPDGQIFVGVFLACGVLFDIDEGMLHFESMSKVYGIVPTMEHYLSVVNM 551
            DLF  FK  GL PDG++F+GVF AC VL D DEGMLHFESM+K YGI P+M HY+S+V+M
Sbjct: 395  DLFYEFKKAGLRPDGKMFIGVFSACSVLGDADEGMLHFESMTKNYGITPSMHHYVSIVDM 454

Query: 552  LGSTGCLEEAMEFIEKMPFDPNVEVWESLMNLCRIHGNIELGDHCAKLVNFLDPSHLTDL 731
            LGS G ++EA+EFIEKMP +P V++WE++MN+ R HG +ELGD C +LV  LD S L + 
Sbjct: 455  LGSIGFVDEAVEFIEKMPLEPGVDIWETMMNISRAHGLMELGDRCFELVEHLDSSRLNEQ 514

Query: 732  SKKGLLPAXXXXXXXXXXXXXRSD-DLRDMRSKVQEYRAGDTCHPEN---YSLLKGMSAQ 899
            SK GLLP               ++ +L ++RS+V EYRAGDT HPEN   Y+LL+G+  Q
Sbjct: 515  SKAGLLPVKASDLEKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQ 574

Query: 900  MKEAGYVPELRQVLHDVDDESKEEALLSHSERLAVAHGLVSSAARMPIRVIKNLRVCVDC 1079
            MKEAGY+PE R VLHD+D E+K +ALL HSERLAVA+GL+SS+AR PIRVIKNLRVC DC
Sbjct: 575  MKEAGYIPETRFVLHDIDQEAKNDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDC 634

Query: 1080 HKALKIMAKIVGRQFIMRDTKRFHHFQDGLCSCKDFW 1190
            H ALKI++KIVGR+ I+RD KRFHHF+DGLCSC+D+W
Sbjct: 635  HSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 671


>ref|XP_003532746.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580-like
            [Glycine max]
          Length = 664

 Score =  501 bits (1289), Expect = e-139
 Identities = 244/399 (61%), Positives = 307/399 (76%), Gaps = 4/399 (1%)
 Frame = +3

Query: 6    YVGTIEELDGFCKENKVKDAVEVLTVLENNGITVDLPRYLLLMQICGEIGVLQYAKTVHD 185
            Y GT+EELD FC E  VK+AVEVL +LE   I VDLPRYL LM  CGE   L+ AK VH 
Sbjct: 266  YRGTLEELDNFCIEGNVKEAVEVLELLEKLDIPVDLPRYLQLMHQCGENKSLEEAKNVHR 325

Query: 186  HLIRSLGHVNLSVHNKVLEMYSSCGSMSDAFDVFEKMNERNLTSWDTMITGLAKNGCGED 365
            H ++ L  + +S +N++LEMY  CGS+ DA ++F  M ERNLT+WDTMIT LAKNG  ED
Sbjct: 326  HALQHLSPLQVSTYNRILEMYLECGSVDDALNIFNNMPERNLTTWDTMITQLAKNGFAED 385

Query: 366  AIDLFTRFKGMGLTPDGQIFVGVFLACGVLFDIDEGMLHFESMSKVYGIVPTMEHYLSVV 545
            +IDLFT+FK +GL PDGQ+F+GV  ACG+L DIDEGM HFESM+K YGIVP+M H++SVV
Sbjct: 386  SIDLFTQFKNLGLKPDGQMFIGVLFACGMLGDIDEGMQHFESMNKDYGIVPSMTHFVSVV 445

Query: 546  NMLGSTGCLEEAMEFIEKMPFDPNVEVWESLMNLCRIHGNIELGDHCAKLVNFLDPSHLT 725
            +M+GS G L+EA EFIEKMP  P+ ++WE+LMNLCR+HGN  LGD CA+LV  LD S L 
Sbjct: 446  DMIGSIGHLDEAFEFIEKMPMKPSADIWETLMNLCRVHGNTGLGDCCAELVEQLDSSCLN 505

Query: 726  DLSKKGLLPAXXXXXXXXXXXXXRSD-DLRDMRSKVQEYRAGDTCHPEN---YSLLKGMS 893
            + SK GL+P               ++ +L ++RS+V+EYRAGDT HPE+   Y+LL+G+ 
Sbjct: 506  EQSKAGLVPVKASDLTKEKEKRTLTNKNLLEVRSRVREYRAGDTFHPESDKIYALLRGLK 565

Query: 894  AQMKEAGYVPELRQVLHDVDDESKEEALLSHSERLAVAHGLVSSAARMPIRVIKNLRVCV 1073
            +QMKEAGYVPE + VLHD+D E KEEALL+HSERLA+A+GL++S AR P+RVIKNLRVC 
Sbjct: 566  SQMKEAGYVPETKFVLHDIDQEGKEEALLAHSERLAIAYGLLNSPARAPMRVIKNLRVCG 625

Query: 1074 DCHKALKIMAKIVGRQFIMRDTKRFHHFQDGLCSCKDFW 1190
            DCH ALKI++K+VGR+ I+RD KRFHHF DGLCSC+D+W
Sbjct: 626  DCHTALKIISKLVGRELIIRDAKRFHHFNDGLCSCRDYW 664


Top