BLASTX nr result

ID: Coptis21_contig00023341 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00023341
         (446 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI37903.3| unnamed protein product [Vitis vinifera]              187   1e-45
ref|XP_002277549.1| PREDICTED: pentatricopeptide repeat-containi...   187   1e-45
ref|XP_002528283.1| pentatricopeptide repeat-containing protein,...   182   2e-44
ref|NP_179705.1| pentatricopeptide repeat-containing protein [Ar...   162   3e-38
ref|XP_004172296.1| PREDICTED: pentatricopeptide repeat-containi...   161   5e-38

>emb|CBI37903.3| unnamed protein product [Vitis vinifera]
          Length = 516

 Score =  187 bits (474), Expect = 1e-45
 Identities = 90/148 (60%), Positives = 112/148 (75%)
 Frame = +1

Query: 1   SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180
           SGY R+G+G +AL+LF++MM+  +RPDQFTF             KHGKQIH+YL+R  F+
Sbjct: 297 SGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACASIASLKHGKQIHAYLLRINFQ 356

Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360
           PN IVVS+LIDMYSKCGSLG+ R VFD M +KLD+VLWNT++S LAQHG G+E I +   
Sbjct: 357 PNTIVVSALIDMYSKCGSLGIGRKVFDLMGNKLDVVLWNTIISALAQHGCGEEAIQMLDD 416

Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444
           M R G KP++IT VVILNACSHSGLV +
Sbjct: 417 MVRSGAKPDKITFVVILNACSHSGLVQQ 444



 Score = 63.9 bits (154), Expect = 1e-08
 Identities = 42/170 (24%), Positives = 73/170 (42%), Gaps = 31/170 (18%)
 Frame = +1

Query: 7   YVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKPN 186
           + + G   EAL+ +SE    GI+ + F+F                +Q+H  ++   F  N
Sbjct: 167 HAQCGYWDEALRFYSEFRQLGIQCNGFSFAGVLTVCVKLKEVGLTRQVHGQILVAGFLSN 226

Query: 187 AIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTM--------------------- 303
            ++ SS++D Y KCG +G  R +FD+M  + D++ W TM                     
Sbjct: 227 VVLSSSVLDAYVKCGLMGDARKLFDEMSAR-DVLAWTTMVSGYAKWGDMKSANELFVEMP 285

Query: 304 ----------LSTLAQHGLGDECIHLFHHMQRLGTKPNRITLVVILNACS 423
                     +S  A++G+G + + LF  M     +P++ T    L AC+
Sbjct: 286 EKNPVSWTALISGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACA 335


>ref|XP_002277549.1| PREDICTED: pentatricopeptide repeat-containing protein At2g21090
           [Vitis vinifera]
          Length = 612

 Score =  187 bits (474), Expect = 1e-45
 Identities = 90/148 (60%), Positives = 112/148 (75%)
 Frame = +1

Query: 1   SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180
           SGY R+G+G +AL+LF++MM+  +RPDQFTF             KHGKQIH+YL+R  F+
Sbjct: 297 SGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACASIASLKHGKQIHAYLLRINFQ 356

Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360
           PN IVVS+LIDMYSKCGSLG+ R VFD M +KLD+VLWNT++S LAQHG G+E I +   
Sbjct: 357 PNTIVVSALIDMYSKCGSLGIGRKVFDLMGNKLDVVLWNTIISALAQHGCGEEAIQMLDD 416

Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444
           M R G KP++IT VVILNACSHSGLV +
Sbjct: 417 MVRSGAKPDKITFVVILNACSHSGLVQQ 444



 Score = 63.9 bits (154), Expect = 1e-08
 Identities = 42/170 (24%), Positives = 73/170 (42%), Gaps = 31/170 (18%)
 Frame = +1

Query: 7   YVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKPN 186
           + + G   EAL+ +SE    GI+ + F+F                +Q+H  ++   F  N
Sbjct: 167 HAQCGYWDEALRFYSEFRQLGIQCNGFSFAGVLTVCVKLKEVGLTRQVHGQILVAGFLSN 226

Query: 187 AIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTM--------------------- 303
            ++ SS++D Y KCG +G  R +FD+M  + D++ W TM                     
Sbjct: 227 VVLSSSVLDAYVKCGLMGDARKLFDEMSAR-DVLAWTTMVSGYAKWGDMKSANELFVEMP 285

Query: 304 ----------LSTLAQHGLGDECIHLFHHMQRLGTKPNRITLVVILNACS 423
                     +S  A++G+G + + LF  M     +P++ T    L AC+
Sbjct: 286 EKNPVSWTALISGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACA 335


>ref|XP_002528283.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223532320|gb|EEF34121.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 602

 Score =  182 bits (462), Expect = 2e-44
 Identities = 92/148 (62%), Positives = 106/148 (71%)
 Frame = +1

Query: 1   SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180
           +GY R  LG +AL+LF++MM   IRPDQFTF              HGKQIH YLIR   +
Sbjct: 288 AGYARHDLGHKALELFTKMMALNIRPDQFTFSSCLCASASIASLNHGKQIHGYLIRTNIR 347

Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360
           PN IVVSSLIDMYSKCG L V R VFD M DK D+VLWNT++S+LAQHG G E I +F  
Sbjct: 348 PNTIVVSSLIDMYSKCGCLEVGRLVFDLMGDKWDVVLWNTIISSLAQHGRGQEAIQMFDD 407

Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444
           M RLG KP+RITL+V+LNACSHSGLV E
Sbjct: 408 MVRLGMKPDRITLIVLLNACSHSGLVQE 435



 Score = 64.3 bits (155), Expect = 1e-08
 Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 31/161 (19%)
 Frame = +1

Query: 7   YVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKPN 186
           Y +SG   +AL+ + E+   GI  ++++F             +  KQ H  ++   F  N
Sbjct: 158 YAKSGFCNDALRFYRELRRLGIGYNEYSFAGLLNICVKVKELELSKQAHGQVLVAGFLSN 217

Query: 187 AIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQ--------------- 321
            ++ SS++D Y+KC  +G  R +FD+M  + D++ W TM+S  AQ               
Sbjct: 218 LVISSSVLDAYAKCSEMGDARRLFDEMIIR-DVLAWTTMVSGYAQWGDVEAARELFDLMP 276

Query: 322 ----------------HGLGDECIHLFHHMQRLGTKPNRIT 396
                           H LG + + LF  M  L  +P++ T
Sbjct: 277 EKNPVAWTSLIAGYARHDLGHKALELFTKMMALNIRPDQFT 317


>ref|NP_179705.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75206523|sp|Q9SKQ4.1|PP167_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g21090 gi|4803934|gb|AAD29807.1| unknown protein
           [Arabidopsis thaliana] gi|330252028|gb|AEC07122.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 597

 Score =  162 bits (410), Expect = 3e-38
 Identities = 81/148 (54%), Positives = 99/148 (66%)
 Frame = +1

Query: 1   SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180
           +GYVR G G  AL LF +M+  G++P+QFTF             +HGK+IH Y+IR   +
Sbjct: 284 AGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVR 343

Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360
           PNAIV+SSLIDMYSK GSL     VF   DDK D V WNTM+S LAQHGLG + + +   
Sbjct: 344 PNAIVISSLIDMYSKSGSLEASERVFRICDDKHDCVFWNTMISALAQHGLGHKALRMLDD 403

Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444
           M +   +PNR TLVVILNACSHSGLV+E
Sbjct: 404 MIKFRVQPNRTTLVVILNACSHSGLVEE 431



 Score = 60.1 bits (144), Expect = 2e-07
 Identities = 42/162 (25%), Positives = 66/162 (40%), Gaps = 31/162 (19%)
 Frame = +1

Query: 4   GYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKP 183
           GY + G   EAL  + E    GI+ ++F+F             +  +Q H  ++   F  
Sbjct: 153 GYAQDGNLHEALWFYKEFRRSGIKFNEFSFAGLLTACVKSRQLQLNRQAHGQVLVAGFLS 212

Query: 184 NAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLA--------------- 318
           N ++  S+ID Y+KCG +   +  FD+M  K DI +W T++S  A               
Sbjct: 213 NVVLSCSIIDAYAKCGQMESAKRCFDEMTVK-DIHIWTTLISGYAKLGDMEAAEKLFCEM 271

Query: 319 ----------------QHGLGDECIHLFHHMQRLGTKPNRIT 396
                           + G G+  + LF  M  LG KP + T
Sbjct: 272 PEKNPVSWTALIAGYVRQGSGNRALDLFRKMIALGVKPEQFT 313


>ref|XP_004172296.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g21090-like [Cucumis sativus]
          Length = 611

 Score =  161 bits (408), Expect = 5e-38
 Identities = 81/148 (54%), Positives = 101/148 (68%)
 Frame = +1

Query: 1   SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180
           SGY R+ LG EAL  F++MM  GI P+Q+TF             KHGKQ+H YLIR  F+
Sbjct: 300 SGYARNSLGHEALDYFTKMMKFGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTYFR 359

Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360
            N IVVSSLIDMYSKCG L     VF  M +K D+V+WNTM+S LAQ+G G++ + +F+ 
Sbjct: 360 CNTIVVSSLIDMYSKCGMLEASCCVFHLMGNKQDVVVWNTMISALAQNGHGEKAMQMFND 419

Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444
           M   G KP+RIT +VIL+ACSHSGLV E
Sbjct: 420 MVESGLKPDRITFIVILSACSHSGLVQE 447



 Score = 61.2 bits (147), Expect = 8e-08
 Identities = 36/119 (30%), Positives = 59/119 (49%)
 Frame = +1

Query: 7   YVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKPN 186
           Y + G   EA+ L+ +     +  + F+F             +  KQ+H  ++   F  N
Sbjct: 170 YAKQGCFNEAIGLYRDFRRLDMGFNAFSFAGVLILCVKLKELQLAKQVHGQVLVAGFLSN 229

Query: 187 AIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHHM 363
            ++ SS++D YSKCG +   R++FD+M  K DI  W T++S  A+ G  +    LFH M
Sbjct: 230 LVLSSSIVDAYSKCGEMRCARTLFDEMLVK-DIHAWTTIVSGYAKWGDMNSASELFHQM 287


Top