BLASTX nr result
ID: Coptis23_contig00027124
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis23_contig00027124 (604 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera] 273 2e-71 ref|XP_002281821.2| PREDICTED: putative pentatricopeptide repeat... 273 2e-71 ref|NP_177580.1| pentatricopeptide repeat-containing protein [Ar... 223 3e-56 ref|XP_002887539.1| hypothetical protein ARALYDRAFT_339633 [Arab... 216 2e-54 ref|XP_003522318.1| PREDICTED: LOW QUALITY PROTEIN: putative pen... 206 3e-51 >emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera] Length = 1060 Score = 273 bits (698), Expect = 2e-71 Identities = 133/217 (61%), Positives = 168/217 (77%), Gaps = 16/217 (7%) Frame = -2 Query: 603 GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYA 424 GEWIH Y+R RG + DLCLNN+LINMY KCG+IG ARR FDG ++DVT+WTSMIVG+A Sbjct: 768 GEWIHAYIRH-RGLDTDLCLNNSLINMYSKCGEIGTARRLFDGTQKKDVTTWTSMIVGHA 826 Query: 423 LHGEAGEALRLFDDMK--------NNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGW 268 LHG+A EAL+LF +MK N ++G ++ LV+PN+VTF+GVLMACSHAG VEEG Sbjct: 827 LHGQAEEALQLFTEMKETNKRARKNKRNGEXESSLVLPNDVTFMGVLMACSHAGLVEEGK 886 Query: 267 RHLDSMSKKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSL 88 +H SM + + L+PRISH+GCMVDLLCRAG L EAY I+ MP++PNAVVWRTL GACSL Sbjct: 887 QHFRSMKEDYSLRPRISHFGCMVDLLCRAGLLTEAYEFILKMPVRPNAVVWRTLLGACSL 946 Query: 87 QG--------NIELGAKVRRRLLQLDPSYAGDDVTMS 1 QG NI++ ++ RR+LL+L+PS+ GD+V MS Sbjct: 947 QGDSNGNGNSNIKIXSEARRQLLELEPSHVGDNVIMS 983 >ref|XP_002281821.2| PREDICTED: putative pentatricopeptide repeat-containing protein At1g74400-like [Vitis vinifera] Length = 482 Score = 273 bits (697), Expect = 2e-71 Identities = 133/217 (61%), Positives = 168/217 (77%), Gaps = 16/217 (7%) Frame = -2 Query: 603 GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYA 424 GEWIH Y+R RG + DLCLNN+LINMY KCG+IG ARR FDG ++DVT+WTSMIVG+A Sbjct: 190 GEWIHAYIRH-RGLDTDLCLNNSLINMYSKCGEIGTARRLFDGTQKKDVTTWTSMIVGHA 248 Query: 423 LHGEAGEALRLFDDMK--------NNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGW 268 LHG+A EAL+LF +MK N ++G ++ LV+PN+VTF+GVLMACSHAG VEEG Sbjct: 249 LHGQAEEALQLFTEMKETNKRARKNKRNGEHESSLVLPNDVTFMGVLMACSHAGLVEEGK 308 Query: 267 RHLDSMSKKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSL 88 +H SM + + L+PRISH+GCMVDLLCRAG L EAY I+ MP++PNAVVWRTL GACSL Sbjct: 309 QHFRSMKEDYSLRPRISHFGCMVDLLCRAGLLTEAYEFILKMPVRPNAVVWRTLLGACSL 368 Query: 87 QG--------NIELGAKVRRRLLQLDPSYAGDDVTMS 1 QG NI++ ++ RR+LL+L+PS+ GD+V MS Sbjct: 369 QGDSNGNGNSNIKIYSEARRQLLELEPSHVGDNVIMS 405 Score = 58.9 bits (141), Expect = 7e-07 Identities = 43/162 (26%), Positives = 75/162 (46%), Gaps = 6/162 (3%) Frame = -2 Query: 567 GFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYALHGEAGEALRLF 388 GFE + L +LI+MY G++ A FD + +++ SWTS+I Y + +AL+LF Sbjct: 100 GFEPIIFLQTSLISMYSATGNVADAHNMFDEIPSKNLISWTSVISAYVDNQRPNKALQLF 159 Query: 387 DDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEG-WRHLDSMSKKHGLKPRISHY 211 M+ + V P+ VT L AC+ G ++ G W H + + GL + Sbjct: 160 RQMQMDD--------VQPDIVTVTVALSACADLGALDMGEWIH--AYIRHRGLDTDLCLN 209 Query: 210 GCMVDLLCRAGHLNEAYALI-----MNMPLQPNAVVWRTLHG 100 ++++ + G + A L ++ + +V LHG Sbjct: 210 NSLINMYSKCGEIGTARRLFDGTQKKDVTTWTSMIVGHALHG 251 >ref|NP_177580.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75169846|sp|Q9CA73.1|PP119_ARATH RecName: Full=Putative pentatricopeptide repeat-containing protein At1g74400 gi|12324820|gb|AAG52382.1|AC011765_34 hypothetical protein; 20273-21661 [Arabidopsis thaliana] gi|332197466|gb|AEE35587.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 462 Score = 223 bits (567), Expect = 3e-56 Identities = 107/194 (55%), Positives = 142/194 (73%) Frame = -2 Query: 582 VRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYALHGEAGE 403 ++RKR DL L N+L+NMYVK G+ +AR+ FD R+DVT++TSMI GYAL+G+A E Sbjct: 194 IKRKRRLAMDLTLRNSLLNMYVKSGETEKARKLFDESMRKDVTTYTSMIFGYALNGQAQE 253 Query: 402 ALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWRHLDSMSKKHGLKPR 223 +L LF MK ++ ++ PN+VTFIGVLMACSH+G VEEG RH SM + LKPR Sbjct: 254 SLELFKKMKTIDQS--QDTVITPNDVTFIGVLMACSHSGLVEEGKRHFKSMIMDYNLKPR 311 Query: 222 ISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIELGAKVRRRLL 43 +H+GCMVDL CR+GHL +A+ I MP++PN V+WRTL GACSL GN+ELG +V+RR+ Sbjct: 312 EAHFGCMVDLFCRSGHLKDAHEFINQMPIKPNTVIWRTLLGACSLHGNVELGEEVQRRIF 371 Query: 42 QLDPSYAGDDVTMS 1 +LD + GD V +S Sbjct: 372 ELDRDHVGDYVALS 385 Score = 56.6 bits (135), Expect = 3e-06 Identities = 45/194 (23%), Positives = 91/194 (46%), Gaps = 2/194 (1%) Frame = -2 Query: 603 GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGL-SRRDVTSWTSMIVGY 427 G IH VR K GF A + + +L+ Y GD+ AR+ FD ++++ WT+MI Y Sbjct: 84 GRQIHALVR-KLGFNAVIQIQTSLVGFYSSVGDVDYARQVFDETPEKQNIVLWTAMISAY 142 Query: 426 ALHGEAGEALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWR-HLDSM 250 + + EA+ LF M+ K + + V L AC+ G V+ G + S+ Sbjct: 143 TENENSVEAIELFKRMEAEK--------IELDGVIVTVALSACADLGAVQMGEEIYSRSI 194 Query: 249 SKKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIEL 70 +K L ++ ++++ ++G +A L + ++ + + ++ +L G + Sbjct: 195 KRKRRLAMDLTLRNSLLNMYVKSGETEKARKL-FDESMRKDVTTYTSMIFGYALNGQAQE 253 Query: 69 GAKVRRRLLQLDPS 28 ++ +++ +D S Sbjct: 254 SLELFKKMKTIDQS 267 >ref|XP_002887539.1| hypothetical protein ARALYDRAFT_339633 [Arabidopsis lyrata subsp. lyrata] gi|297333380|gb|EFH63798.1| hypothetical protein ARALYDRAFT_339633 [Arabidopsis lyrata subsp. lyrata] Length = 665 Score = 216 bits (550), Expect = 2e-54 Identities = 107/202 (52%), Positives = 146/202 (72%), Gaps = 1/202 (0%) Frame = -2 Query: 603 GEWIHGY-VRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGY 427 GE I+ ++RKR DL L N+L+NMYVK G+I +AR+ FD R+DVT++T MI GY Sbjct: 186 GEQIYSRSIKRKRRLAMDLTLRNSLLNMYVKSGEIEKARKLFDETMRKDVTTYTCMIFGY 245 Query: 426 ALHGEAGEALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWRHLDSMS 247 AL+GEA E+L LF MK ++ ++ PN+VTFIGVLMACSH+G VEEG ++ SM Sbjct: 246 ALNGEAQESLELFKKMKMIDQS--QDTVITPNDVTFIGVLMACSHSGLVEEGKQYFKSMI 303 Query: 246 KKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIELG 67 + LKPR +H+GCMVDL CR+GHL +A+ I MP++PNAV+WRTL GAC L GN+ELG Sbjct: 304 VDYNLKPRDAHFGCMVDLFCRSGHLKDAHEFIKQMPIKPNAVIWRTLLGACILHGNVELG 363 Query: 66 AKVRRRLLQLDPSYAGDDVTMS 1 +V++R+ +L+ + GD V +S Sbjct: 364 EEVQKRIFKLERDHVGDYVALS 385 Score = 63.5 bits (153), Expect = 3e-08 Identities = 46/194 (23%), Positives = 95/194 (48%), Gaps = 2/194 (1%) Frame = -2 Query: 603 GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGL-SRRDVTSWTSMIVGY 427 G IH VR K GF A + + +L+ Y GD+ AR+ FD ++++ WT+MI Y Sbjct: 84 GRQIHALVR-KLGFNAVIQIQTSLVGFYSSAGDLDDARQVFDETPEKQNIVLWTAMISAY 142 Query: 426 ALHGEAGEALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWR-HLDSM 250 + + + EA++LF M+ K + +EV L AC+ G V+ G + + S+ Sbjct: 143 SENENSVEAIKLFKRMEEEK--------IELDEVIVTAALSACADLGAVQMGEQIYSRSI 194 Query: 249 SKKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIEL 70 +K L ++ ++++ ++G + +A L + ++ + + + +L G + Sbjct: 195 KRKRRLAMDLTLRNSLLNMYVKSGEIEKARKL-FDETMRKDVTTYTCMIFGYALNGEAQE 253 Query: 69 GAKVRRRLLQLDPS 28 ++ +++ +D S Sbjct: 254 SLELFKKMKMIDQS 267 >ref|XP_003522318.1| PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g74400-like [Glycine max] Length = 333 Score = 206 bits (523), Expect = 3e-51 Identities = 105/202 (51%), Positives = 145/202 (71%), Gaps = 1/202 (0%) Frame = -2 Query: 603 GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYA 424 G+ +H + K G++ + L L+ Y + G++ R+ FDG+ +DVT+W SMIVG+A Sbjct: 88 GKQLHALII-KFGYQTIVQLQITLLKTYAQRGNL--PRKVFDGMXNKDVTTWISMIVGHA 144 Query: 423 LHGEAGEALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWRHLDSMSK 244 +HG+A EAL+LF +M++ D ++ PN+VTFIGVLMACSHAG VEEG H SMS+ Sbjct: 145 VHGQAREALQLFSEMRDKDDC-----VLTPNDVTFIGVLMACSHAGMVEEGKLHFRSMSE 199 Query: 243 KHGLKPRISHYGCMVDLLCR-AGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIELG 67 +G++PR +H+GCMVDLLCR GHL ++Y IM MP+ PNAVVWRTL GACS++G +EL Sbjct: 200 VYGIEPREAHFGCMVDLLCRVGGHLRDSYDFIMEMPVPPNAVVWRTLLGACSVRGELELA 259 Query: 66 AKVRRRLLQLDPSYAGDDVTMS 1 A+VR++LL+LD Y D V MS Sbjct: 260 AEVRQKLLKLDLGYVVDSVAMS 281