BLASTX nr result

ID: Coptis23_contig00027124 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00027124
         (604 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera]   273   2e-71
ref|XP_002281821.2| PREDICTED: putative pentatricopeptide repeat...   273   2e-71
ref|NP_177580.1| pentatricopeptide repeat-containing protein [Ar...   223   3e-56
ref|XP_002887539.1| hypothetical protein ARALYDRAFT_339633 [Arab...   216   2e-54
ref|XP_003522318.1| PREDICTED: LOW QUALITY PROTEIN: putative pen...   206   3e-51

>emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera]
          Length = 1060

 Score =  273 bits (698), Expect = 2e-71
 Identities = 133/217 (61%), Positives = 168/217 (77%), Gaps = 16/217 (7%)
 Frame = -2

Query: 603  GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYA 424
            GEWIH Y+R  RG + DLCLNN+LINMY KCG+IG ARR FDG  ++DVT+WTSMIVG+A
Sbjct: 768  GEWIHAYIRH-RGLDTDLCLNNSLINMYSKCGEIGTARRLFDGTQKKDVTTWTSMIVGHA 826

Query: 423  LHGEAGEALRLFDDMK--------NNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGW 268
            LHG+A EAL+LF +MK        N ++G  ++ LV+PN+VTF+GVLMACSHAG VEEG 
Sbjct: 827  LHGQAEEALQLFTEMKETNKRARKNKRNGEXESSLVLPNDVTFMGVLMACSHAGLVEEGK 886

Query: 267  RHLDSMSKKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSL 88
            +H  SM + + L+PRISH+GCMVDLLCRAG L EAY  I+ MP++PNAVVWRTL GACSL
Sbjct: 887  QHFRSMKEDYSLRPRISHFGCMVDLLCRAGLLTEAYEFILKMPVRPNAVVWRTLLGACSL 946

Query: 87   QG--------NIELGAKVRRRLLQLDPSYAGDDVTMS 1
            QG        NI++ ++ RR+LL+L+PS+ GD+V MS
Sbjct: 947  QGDSNGNGNSNIKIXSEARRQLLELEPSHVGDNVIMS 983


>ref|XP_002281821.2| PREDICTED: putative pentatricopeptide repeat-containing protein
           At1g74400-like [Vitis vinifera]
          Length = 482

 Score =  273 bits (697), Expect = 2e-71
 Identities = 133/217 (61%), Positives = 168/217 (77%), Gaps = 16/217 (7%)
 Frame = -2

Query: 603 GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYA 424
           GEWIH Y+R  RG + DLCLNN+LINMY KCG+IG ARR FDG  ++DVT+WTSMIVG+A
Sbjct: 190 GEWIHAYIRH-RGLDTDLCLNNSLINMYSKCGEIGTARRLFDGTQKKDVTTWTSMIVGHA 248

Query: 423 LHGEAGEALRLFDDMK--------NNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGW 268
           LHG+A EAL+LF +MK        N ++G  ++ LV+PN+VTF+GVLMACSHAG VEEG 
Sbjct: 249 LHGQAEEALQLFTEMKETNKRARKNKRNGEHESSLVLPNDVTFMGVLMACSHAGLVEEGK 308

Query: 267 RHLDSMSKKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSL 88
           +H  SM + + L+PRISH+GCMVDLLCRAG L EAY  I+ MP++PNAVVWRTL GACSL
Sbjct: 309 QHFRSMKEDYSLRPRISHFGCMVDLLCRAGLLTEAYEFILKMPVRPNAVVWRTLLGACSL 368

Query: 87  QG--------NIELGAKVRRRLLQLDPSYAGDDVTMS 1
           QG        NI++ ++ RR+LL+L+PS+ GD+V MS
Sbjct: 369 QGDSNGNGNSNIKIYSEARRQLLELEPSHVGDNVIMS 405



 Score = 58.9 bits (141), Expect = 7e-07
 Identities = 43/162 (26%), Positives = 75/162 (46%), Gaps = 6/162 (3%)
 Frame = -2

Query: 567 GFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYALHGEAGEALRLF 388
           GFE  + L  +LI+MY   G++  A   FD +  +++ SWTS+I  Y  +    +AL+LF
Sbjct: 100 GFEPIIFLQTSLISMYSATGNVADAHNMFDEIPSKNLISWTSVISAYVDNQRPNKALQLF 159

Query: 387 DDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEG-WRHLDSMSKKHGLKPRISHY 211
             M+ +         V P+ VT    L AC+  G ++ G W H  +  +  GL   +   
Sbjct: 160 RQMQMDD--------VQPDIVTVTVALSACADLGALDMGEWIH--AYIRHRGLDTDLCLN 209

Query: 210 GCMVDLLCRAGHLNEAYALI-----MNMPLQPNAVVWRTLHG 100
             ++++  + G +  A  L       ++    + +V   LHG
Sbjct: 210 NSLINMYSKCGEIGTARRLFDGTQKKDVTTWTSMIVGHALHG 251


>ref|NP_177580.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75169846|sp|Q9CA73.1|PP119_ARATH RecName:
           Full=Putative pentatricopeptide repeat-containing
           protein At1g74400 gi|12324820|gb|AAG52382.1|AC011765_34
           hypothetical protein; 20273-21661 [Arabidopsis thaliana]
           gi|332197466|gb|AEE35587.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 462

 Score =  223 bits (567), Expect = 3e-56
 Identities = 107/194 (55%), Positives = 142/194 (73%)
 Frame = -2

Query: 582 VRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYALHGEAGE 403
           ++RKR    DL L N+L+NMYVK G+  +AR+ FD   R+DVT++TSMI GYAL+G+A E
Sbjct: 194 IKRKRRLAMDLTLRNSLLNMYVKSGETEKARKLFDESMRKDVTTYTSMIFGYALNGQAQE 253

Query: 402 ALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWRHLDSMSKKHGLKPR 223
           +L LF  MK       ++ ++ PN+VTFIGVLMACSH+G VEEG RH  SM   + LKPR
Sbjct: 254 SLELFKKMKTIDQS--QDTVITPNDVTFIGVLMACSHSGLVEEGKRHFKSMIMDYNLKPR 311

Query: 222 ISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIELGAKVRRRLL 43
            +H+GCMVDL CR+GHL +A+  I  MP++PN V+WRTL GACSL GN+ELG +V+RR+ 
Sbjct: 312 EAHFGCMVDLFCRSGHLKDAHEFINQMPIKPNTVIWRTLLGACSLHGNVELGEEVQRRIF 371

Query: 42  QLDPSYAGDDVTMS 1
           +LD  + GD V +S
Sbjct: 372 ELDRDHVGDYVALS 385



 Score = 56.6 bits (135), Expect = 3e-06
 Identities = 45/194 (23%), Positives = 91/194 (46%), Gaps = 2/194 (1%)
 Frame = -2

Query: 603 GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGL-SRRDVTSWTSMIVGY 427
           G  IH  VR K GF A + +  +L+  Y   GD+  AR+ FD    ++++  WT+MI  Y
Sbjct: 84  GRQIHALVR-KLGFNAVIQIQTSLVGFYSSVGDVDYARQVFDETPEKQNIVLWTAMISAY 142

Query: 426 ALHGEAGEALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWR-HLDSM 250
             +  + EA+ LF  M+  K        +  + V     L AC+  G V+ G   +  S+
Sbjct: 143 TENENSVEAIELFKRMEAEK--------IELDGVIVTVALSACADLGAVQMGEEIYSRSI 194

Query: 249 SKKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIEL 70
            +K  L   ++    ++++  ++G   +A  L  +  ++ +   + ++    +L G  + 
Sbjct: 195 KRKRRLAMDLTLRNSLLNMYVKSGETEKARKL-FDESMRKDVTTYTSMIFGYALNGQAQE 253

Query: 69  GAKVRRRLLQLDPS 28
             ++ +++  +D S
Sbjct: 254 SLELFKKMKTIDQS 267


>ref|XP_002887539.1| hypothetical protein ARALYDRAFT_339633 [Arabidopsis lyrata subsp.
           lyrata] gi|297333380|gb|EFH63798.1| hypothetical protein
           ARALYDRAFT_339633 [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  216 bits (550), Expect = 2e-54
 Identities = 107/202 (52%), Positives = 146/202 (72%), Gaps = 1/202 (0%)
 Frame = -2

Query: 603 GEWIHGY-VRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGY 427
           GE I+   ++RKR    DL L N+L+NMYVK G+I +AR+ FD   R+DVT++T MI GY
Sbjct: 186 GEQIYSRSIKRKRRLAMDLTLRNSLLNMYVKSGEIEKARKLFDETMRKDVTTYTCMIFGY 245

Query: 426 ALHGEAGEALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWRHLDSMS 247
           AL+GEA E+L LF  MK       ++ ++ PN+VTFIGVLMACSH+G VEEG ++  SM 
Sbjct: 246 ALNGEAQESLELFKKMKMIDQS--QDTVITPNDVTFIGVLMACSHSGLVEEGKQYFKSMI 303

Query: 246 KKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIELG 67
             + LKPR +H+GCMVDL CR+GHL +A+  I  MP++PNAV+WRTL GAC L GN+ELG
Sbjct: 304 VDYNLKPRDAHFGCMVDLFCRSGHLKDAHEFIKQMPIKPNAVIWRTLLGACILHGNVELG 363

Query: 66  AKVRRRLLQLDPSYAGDDVTMS 1
            +V++R+ +L+  + GD V +S
Sbjct: 364 EEVQKRIFKLERDHVGDYVALS 385



 Score = 63.5 bits (153), Expect = 3e-08
 Identities = 46/194 (23%), Positives = 95/194 (48%), Gaps = 2/194 (1%)
 Frame = -2

Query: 603 GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGL-SRRDVTSWTSMIVGY 427
           G  IH  VR K GF A + +  +L+  Y   GD+  AR+ FD    ++++  WT+MI  Y
Sbjct: 84  GRQIHALVR-KLGFNAVIQIQTSLVGFYSSAGDLDDARQVFDETPEKQNIVLWTAMISAY 142

Query: 426 ALHGEAGEALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWR-HLDSM 250
           + +  + EA++LF  M+  K        +  +EV     L AC+  G V+ G + +  S+
Sbjct: 143 SENENSVEAIKLFKRMEEEK--------IELDEVIVTAALSACADLGAVQMGEQIYSRSI 194

Query: 249 SKKHGLKPRISHYGCMVDLLCRAGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIEL 70
            +K  L   ++    ++++  ++G + +A  L  +  ++ +   +  +    +L G  + 
Sbjct: 195 KRKRRLAMDLTLRNSLLNMYVKSGEIEKARKL-FDETMRKDVTTYTCMIFGYALNGEAQE 253

Query: 69  GAKVRRRLLQLDPS 28
             ++ +++  +D S
Sbjct: 254 SLELFKKMKMIDQS 267


>ref|XP_003522318.1| PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide
           repeat-containing protein At1g74400-like [Glycine max]
          Length = 333

 Score =  206 bits (523), Expect = 3e-51
 Identities = 105/202 (51%), Positives = 145/202 (71%), Gaps = 1/202 (0%)
 Frame = -2

Query: 603 GEWIHGYVRRKRGFEADLCLNNALINMYVKCGDIGRARRTFDGLSRRDVTSWTSMIVGYA 424
           G+ +H  +  K G++  + L   L+  Y + G++   R+ FDG+  +DVT+W SMIVG+A
Sbjct: 88  GKQLHALII-KFGYQTIVQLQITLLKTYAQRGNL--PRKVFDGMXNKDVTTWISMIVGHA 144

Query: 423 LHGEAGEALRLFDDMKNNKDGNRKNGLVIPNEVTFIGVLMACSHAGRVEEGWRHLDSMSK 244
           +HG+A EAL+LF +M++  D      ++ PN+VTFIGVLMACSHAG VEEG  H  SMS+
Sbjct: 145 VHGQAREALQLFSEMRDKDDC-----VLTPNDVTFIGVLMACSHAGMVEEGKLHFRSMSE 199

Query: 243 KHGLKPRISHYGCMVDLLCR-AGHLNEAYALIMNMPLQPNAVVWRTLHGACSLQGNIELG 67
            +G++PR +H+GCMVDLLCR  GHL ++Y  IM MP+ PNAVVWRTL GACS++G +EL 
Sbjct: 200 VYGIEPREAHFGCMVDLLCRVGGHLRDSYDFIMEMPVPPNAVVWRTLLGACSVRGELELA 259

Query: 66  AKVRRRLLQLDPSYAGDDVTMS 1
           A+VR++LL+LD  Y  D V MS
Sbjct: 260 AEVRQKLLKLDLGYVVDSVAMS 281


Top