BLASTX nr result
ID: Coptis21_contig00023341
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00023341 (446 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI37903.3| unnamed protein product [Vitis vinifera] 187 1e-45 ref|XP_002277549.1| PREDICTED: pentatricopeptide repeat-containi... 187 1e-45 ref|XP_002528283.1| pentatricopeptide repeat-containing protein,... 182 2e-44 ref|NP_179705.1| pentatricopeptide repeat-containing protein [Ar... 162 3e-38 ref|XP_004172296.1| PREDICTED: pentatricopeptide repeat-containi... 161 5e-38 >emb|CBI37903.3| unnamed protein product [Vitis vinifera] Length = 516 Score = 187 bits (474), Expect = 1e-45 Identities = 90/148 (60%), Positives = 112/148 (75%) Frame = +1 Query: 1 SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180 SGY R+G+G +AL+LF++MM+ +RPDQFTF KHGKQIH+YL+R F+ Sbjct: 297 SGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACASIASLKHGKQIHAYLLRINFQ 356 Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360 PN IVVS+LIDMYSKCGSLG+ R VFD M +KLD+VLWNT++S LAQHG G+E I + Sbjct: 357 PNTIVVSALIDMYSKCGSLGIGRKVFDLMGNKLDVVLWNTIISALAQHGCGEEAIQMLDD 416 Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444 M R G KP++IT VVILNACSHSGLV + Sbjct: 417 MVRSGAKPDKITFVVILNACSHSGLVQQ 444 Score = 63.9 bits (154), Expect = 1e-08 Identities = 42/170 (24%), Positives = 73/170 (42%), Gaps = 31/170 (18%) Frame = +1 Query: 7 YVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKPN 186 + + G EAL+ +SE GI+ + F+F +Q+H ++ F N Sbjct: 167 HAQCGYWDEALRFYSEFRQLGIQCNGFSFAGVLTVCVKLKEVGLTRQVHGQILVAGFLSN 226 Query: 187 AIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTM--------------------- 303 ++ SS++D Y KCG +G R +FD+M + D++ W TM Sbjct: 227 VVLSSSVLDAYVKCGLMGDARKLFDEMSAR-DVLAWTTMVSGYAKWGDMKSANELFVEMP 285 Query: 304 ----------LSTLAQHGLGDECIHLFHHMQRLGTKPNRITLVVILNACS 423 +S A++G+G + + LF M +P++ T L AC+ Sbjct: 286 EKNPVSWTALISGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACA 335 >ref|XP_002277549.1| PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Vitis vinifera] Length = 612 Score = 187 bits (474), Expect = 1e-45 Identities = 90/148 (60%), Positives = 112/148 (75%) Frame = +1 Query: 1 SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180 SGY R+G+G +AL+LF++MM+ +RPDQFTF KHGKQIH+YL+R F+ Sbjct: 297 SGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACASIASLKHGKQIHAYLLRINFQ 356 Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360 PN IVVS+LIDMYSKCGSLG+ R VFD M +KLD+VLWNT++S LAQHG G+E I + Sbjct: 357 PNTIVVSALIDMYSKCGSLGIGRKVFDLMGNKLDVVLWNTIISALAQHGCGEEAIQMLDD 416 Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444 M R G KP++IT VVILNACSHSGLV + Sbjct: 417 MVRSGAKPDKITFVVILNACSHSGLVQQ 444 Score = 63.9 bits (154), Expect = 1e-08 Identities = 42/170 (24%), Positives = 73/170 (42%), Gaps = 31/170 (18%) Frame = +1 Query: 7 YVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKPN 186 + + G EAL+ +SE GI+ + F+F +Q+H ++ F N Sbjct: 167 HAQCGYWDEALRFYSEFRQLGIQCNGFSFAGVLTVCVKLKEVGLTRQVHGQILVAGFLSN 226 Query: 187 AIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTM--------------------- 303 ++ SS++D Y KCG +G R +FD+M + D++ W TM Sbjct: 227 VVLSSSVLDAYVKCGLMGDARKLFDEMSAR-DVLAWTTMVSGYAKWGDMKSANELFVEMP 285 Query: 304 ----------LSTLAQHGLGDECIHLFHHMQRLGTKPNRITLVVILNACS 423 +S A++G+G + + LF M +P++ T L AC+ Sbjct: 286 EKNPVSWTALISGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACA 335 >ref|XP_002528283.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223532320|gb|EEF34121.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 602 Score = 182 bits (462), Expect = 2e-44 Identities = 92/148 (62%), Positives = 106/148 (71%) Frame = +1 Query: 1 SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180 +GY R LG +AL+LF++MM IRPDQFTF HGKQIH YLIR + Sbjct: 288 AGYARHDLGHKALELFTKMMALNIRPDQFTFSSCLCASASIASLNHGKQIHGYLIRTNIR 347 Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360 PN IVVSSLIDMYSKCG L V R VFD M DK D+VLWNT++S+LAQHG G E I +F Sbjct: 348 PNTIVVSSLIDMYSKCGCLEVGRLVFDLMGDKWDVVLWNTIISSLAQHGRGQEAIQMFDD 407 Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444 M RLG KP+RITL+V+LNACSHSGLV E Sbjct: 408 MVRLGMKPDRITLIVLLNACSHSGLVQE 435 Score = 64.3 bits (155), Expect = 1e-08 Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 31/161 (19%) Frame = +1 Query: 7 YVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKPN 186 Y +SG +AL+ + E+ GI ++++F + KQ H ++ F N Sbjct: 158 YAKSGFCNDALRFYRELRRLGIGYNEYSFAGLLNICVKVKELELSKQAHGQVLVAGFLSN 217 Query: 187 AIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQ--------------- 321 ++ SS++D Y+KC +G R +FD+M + D++ W TM+S AQ Sbjct: 218 LVISSSVLDAYAKCSEMGDARRLFDEMIIR-DVLAWTTMVSGYAQWGDVEAARELFDLMP 276 Query: 322 ----------------HGLGDECIHLFHHMQRLGTKPNRIT 396 H LG + + LF M L +P++ T Sbjct: 277 EKNPVAWTSLIAGYARHDLGHKALELFTKMMALNIRPDQFT 317 >ref|NP_179705.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206523|sp|Q9SKQ4.1|PP167_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g21090 gi|4803934|gb|AAD29807.1| unknown protein [Arabidopsis thaliana] gi|330252028|gb|AEC07122.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 597 Score = 162 bits (410), Expect = 3e-38 Identities = 81/148 (54%), Positives = 99/148 (66%) Frame = +1 Query: 1 SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180 +GYVR G G AL LF +M+ G++P+QFTF +HGK+IH Y+IR + Sbjct: 284 AGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVR 343 Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360 PNAIV+SSLIDMYSK GSL VF DDK D V WNTM+S LAQHGLG + + + Sbjct: 344 PNAIVISSLIDMYSKSGSLEASERVFRICDDKHDCVFWNTMISALAQHGLGHKALRMLDD 403 Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444 M + +PNR TLVVILNACSHSGLV+E Sbjct: 404 MIKFRVQPNRTTLVVILNACSHSGLVEE 431 Score = 60.1 bits (144), Expect = 2e-07 Identities = 42/162 (25%), Positives = 66/162 (40%), Gaps = 31/162 (19%) Frame = +1 Query: 4 GYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKP 183 GY + G EAL + E GI+ ++F+F + +Q H ++ F Sbjct: 153 GYAQDGNLHEALWFYKEFRRSGIKFNEFSFAGLLTACVKSRQLQLNRQAHGQVLVAGFLS 212 Query: 184 NAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLA--------------- 318 N ++ S+ID Y+KCG + + FD+M K DI +W T++S A Sbjct: 213 NVVLSCSIIDAYAKCGQMESAKRCFDEMTVK-DIHIWTTLISGYAKLGDMEAAEKLFCEM 271 Query: 319 ----------------QHGLGDECIHLFHHMQRLGTKPNRIT 396 + G G+ + LF M LG KP + T Sbjct: 272 PEKNPVSWTALIAGYVRQGSGNRALDLFRKMIALGVKPEQFT 313 >ref|XP_004172296.1| PREDICTED: pentatricopeptide repeat-containing protein At2g21090-like [Cucumis sativus] Length = 611 Score = 161 bits (408), Expect = 5e-38 Identities = 81/148 (54%), Positives = 101/148 (68%) Frame = +1 Query: 1 SGYVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFK 180 SGY R+ LG EAL F++MM GI P+Q+TF KHGKQ+H YLIR F+ Sbjct: 300 SGYARNSLGHEALDYFTKMMKFGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTYFR 359 Query: 181 PNAIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHH 360 N IVVSSLIDMYSKCG L VF M +K D+V+WNTM+S LAQ+G G++ + +F+ Sbjct: 360 CNTIVVSSLIDMYSKCGMLEASCCVFHLMGNKQDVVVWNTMISALAQNGHGEKAMQMFND 419 Query: 361 MQRLGTKPNRITLVVILNACSHSGLVDE 444 M G KP+RIT +VIL+ACSHSGLV E Sbjct: 420 MVESGLKPDRITFIVILSACSHSGLVQE 447 Score = 61.2 bits (147), Expect = 8e-08 Identities = 36/119 (30%), Positives = 59/119 (49%) Frame = +1 Query: 7 YVRSGLGPEALKLFSEMMIRGIRPDQFTFXXXXXXXXXXXXXKHGKQIHSYLIRRVFKPN 186 Y + G EA+ L+ + + + F+F + KQ+H ++ F N Sbjct: 170 YAKQGCFNEAIGLYRDFRRLDMGFNAFSFAGVLILCVKLKELQLAKQVHGQVLVAGFLSN 229 Query: 187 AIVVSSLIDMYSKCGSLGVCRSVFDQMDDKLDIVLWNTMLSTLAQHGLGDECIHLFHHM 363 ++ SS++D YSKCG + R++FD+M K DI W T++S A+ G + LFH M Sbjct: 230 LVLSSSIVDAYSKCGEMRCARTLFDEMLVK-DIHAWTTIVSGYAKWGDMNSASELFHQM 287