BLASTX nr result

ID: Coptis24_contig00012508 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00012508
         (2143 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI19618.3| unnamed protein product [Vitis vinifera]              410   e-112
ref|XP_003517821.1| PREDICTED: pentatricopeptide repeat-containi...   347   1e-92
ref|XP_003620888.1| Pentatricopeptide repeat-containing protein ...   333   1e-88
ref|NP_001239891.1| uncharacterized protein LOC100783921 [Glycin...   332   2e-88
ref|XP_002303271.1| predicted protein [Populus trichocarpa] gi|2...   313   2e-82

>emb|CBI19618.3| unnamed protein product [Vitis vinifera]
          Length = 576

 Score =  410 bits (1053), Expect = e-112
 Identities = 207/390 (53%), Positives = 278/390 (71%), Gaps = 2/390 (0%)
 Frame = +3

Query: 786  RKKIPVAVSRVSQQEIPHMREKVRISSDLDSFCREGRVQEALELMEDIEKQGLVVDPNAI 965
            RKK  V VSRVSQ+E+ +++E+     DL   CR+G V+ AL +++++E+ G+ V    +
Sbjct: 196  RKKYLVVVSRVSQREVVNVQEE-----DLKRLCRQGNVEAALHVIDEMERNGVTVSALGL 250

Query: 966  ISLLQACINLKLLESGRRVHTYIMR-SSSRGINKFNKLVEMYCKLGSTKDALGVFEEMSW 1142
              LLQ CI+LKLLE G+R H  +MR SS+  +  FNKL+EMY  LG T+ A  VFEEM  
Sbjct: 251  AELLQVCIDLKLLEVGKRAHELVMRLSSNPSVIVFNKLLEMYFDLGDTRSACRVFEEMRG 310

Query: 1143 KNLDSWNKMLAGLAENEQGKVVLEMFTEMNRSGVRPDQFTFSAVLMACGSLGAVKEGISY 1322
            + LDSWN+M+ GL +N +G+  L +F+++ + G+ PD  TF  VL AC  LGAV+EG+++
Sbjct: 311  RTLDSWNRMILGLVKNGEGEEALAIFSKLKKDGIEPDGSTFIGVLSACECLGAVEEGLAH 370

Query: 1323 FESMGKDFGISPSMEHYISIVDLLGKSGNVSEAKELVRKMPMKPSPLVQETLRRYSSIGL 1502
            F SM  D+GI+PSMEH+  IVDL G+   ++EAKE +  MP++PS ++ +TL++Y    L
Sbjct: 371  FNSMSTDYGITPSMEHFAIIVDLFGRLQKIAEAKEFIASMPLEPSSMIWQTLQKY----L 426

Query: 1503 KSV-LDVLGNHKNASDLKTSDIXXXXXXXXXXXXSVNPNKAEAYEKARSLNEEMKSAGYV 1679
            K+  +D        S LK S              + +P K++AYEK RSL++ +K AGYV
Sbjct: 427  KTERVDEPAPLTTGSGLKLSHKKRVKSNFVSKQKNASPEKSKAYEKLRSLHKGVKEAGYV 486

Query: 1680 PDTRYVLQDIDREAKEKALMYHSERLAIAYGLISTPPGTTLRIMKNLRICGDCHNAVKII 1859
             DTRYVL D+D+EAKEK+L+YHSERLAIAYGLISTPPGTTLRI+KNLRICGDCHN +KI+
Sbjct: 487  SDTRYVLHDLDQEAKEKSLLYHSERLAIAYGLISTPPGTTLRIIKNLRICGDCHNFIKIL 546

Query: 1860 SKIVEREIIVRDNKRFHHFKDGKCSCGDYW 1949
            S I +REIIVRDNKRFHHF+DGKCSCGDYW
Sbjct: 547  SNIEKREIIVRDNKRFHHFRDGKCSCGDYW 576


>ref|XP_003517821.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like
            [Glycine max]
          Length = 452

 Score =  347 bits (889), Expect = 1e-92
 Identities = 189/381 (49%), Positives = 246/381 (64%), Gaps = 17/381 (4%)
 Frame = +3

Query: 858  ISSDLDSFCREGRVQEALELMEDIEKQGLVVDPNAIISLLQACINLKLLESGRRVHTYIM 1037
            ++ DL S C EG + + LELM     QG V D    ++LL  C + + LESG+RVH ++ 
Sbjct: 77   LNVDLVSLCEEGNLDQVLELMG----QGAVADYRVYLALLNLCEHTRSLESGKRVHEFLR 132

Query: 1038 RSS-SRGINKFNKLVEMYCKLGSTKDALGVFEEMSWKNLDSWNKMLAGLAENEQGKVVLE 1214
            RS+  R +   N+L+ MYCK GS KDA  VF+++  +N+ SW+ M+ G A N  G   L 
Sbjct: 133  RSTFRRDVELSNRLIGMYCKCGSVKDARRVFDQIPERNISSWHLMIGGYAANGLGCDGLL 192

Query: 1215 MFTEMNRSGVRPDQFTFSAVLMACGSLGAVKEGISYFESMGKDFGISPSMEHYISIVDLL 1394
            +F +M ++GV PD  TF  VL AC    AV+EG  +FESM K+ GI PSMEHY+ ++++L
Sbjct: 193  VFQQMKQAGVPPDGETFELVLAACAQAEAVEEGFLHFESM-KEHGIVPSMEHYLEVINIL 251

Query: 1395 GKSGNVSEAKELVRKMPMKPSPLVQETLRRYSSIG--------LKSVLDVLGNHKNASDL 1550
            G +G ++EA+E + K+P++      E+LR ++            + VL  L   K  +D 
Sbjct: 252  GNTGQLNEAEEFIEKIPIELGVEAWESLRNFAQKHGDLDLEDHAEEVLTCLDPSKAVADK 311

Query: 1551 -------KTSDIXXXXXXXXXXXXSVN-PNKAEAYEKARSLNEEMKSAGYVPDTRYVLQD 1706
                   K SD+              + P K EA+EK   L+ +M+ AGYVPDTRYVL D
Sbjct: 312  LPPPPRKKQSDMNMLEEKNRVTEYRYSIPYKEEAHEKLGGLSGQMREAGYVPDTRYVLHD 371

Query: 1707 IDREAKEKALMYHSERLAIAYGLISTPPGTTLRIMKNLRICGDCHNAVKIISKIVEREII 1886
            ID E KEKAL YHSERLAIAYGLISTPP TTLRI+KNLRICGDCHNA+KI+SKIV RE+I
Sbjct: 372  IDEEEKEKALQYHSERLAIAYGLISTPPRTTLRIIKNLRICGDCHNAIKIMSKIVGRELI 431

Query: 1887 VRDNKRFHHFKDGKCSCGDYW 1949
            VRDNKRFHHFKDGKCSCGDYW
Sbjct: 432  VRDNKRFHHFKDGKCSCGDYW 452


>ref|XP_003620888.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355495903|gb|AES77106.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 446

 Score =  333 bits (853), Expect = 1e-88
 Identities = 195/414 (47%), Positives = 249/414 (60%), Gaps = 23/414 (5%)
 Frame = +3

Query: 777  FSVRKKIPVAVSRVSQQ------EIPHMREKVRISSDLDSFCREGRVQEALELMEDIEKQ 938
            F+ RK  P+ +   S Q      + PH  + V  +     F +EG V + LELM     Q
Sbjct: 42   FTPRKTQPLRMGNPSIQPKLNHHQAPHQHKNVNFAH----FLQEGNVNQVLELMG----Q 93

Query: 939  GLVVDPNAIISLLQACINLKLLESGRRVHTYIMRSSSRG-INKFNKLVEMYCKLGSTKDA 1115
            G   D +  +SLL+ C +LK LE G+RVH ++ RS   G +   N+L+ +Y K GS KDA
Sbjct: 94   GAFADYSDFLSLLKLCEDLKSLELGKRVHEFLRRSKFGGNVELCNRLIGLYVKCGSVKDA 153

Query: 1116 LGVFEEMSWKNLDSWNKMLAGLAENEQGKVVLEMFTEMNRSGVRPDQFTFSAVLMACGSL 1295
              VF++M  +N+ S N M+ G   N  G   L +F +M + GV PD+ TF+ VL  C  +
Sbjct: 154  RKVFDKMPDRNVGSLNLMIGGYNVNGLGIDGLLVFKQMRQQGVVPDEETFALVLAVCALV 213

Query: 1296 GAVKEGISYFESMGKDFGISPSMEHYISIVDLLGKSGNVSEAKELVRKMPMKPSPLVQET 1475
              V+EG+  FESM K++GI P MEHY+ +V++ G +G + EA E +  MP++    + ET
Sbjct: 214  DGVEEGLMQFESM-KEYGIVPGMEHYLGVVNIFGCAGRLDEAHEFIENMPIEAGVELWET 272

Query: 1476 LRRYSSIG--------LKSVLDVLGNHKNASDL-------KTSDIXXXXXXXXXXXXSVN 1610
            LR ++ I            +L VL   K A+D        K S I              N
Sbjct: 273  LRNFARIHGDLEREDCADELLTVLDPSKAAADKVPLPQRKKQSAINMLEEKNRVSEYRCN 332

Query: 1611 -PNKAEAYEKARSLNEEMKSAGYVPDTRYVLQDIDREAKEKALMYHSERLAIAYGLISTP 1787
             P K E   K R L  +M+ AGYVPDTRYVL DID E KEKAL YHSERLAIAYGLISTP
Sbjct: 333  MPYKEEGDVKLRGLTGQMREAGYVPDTRYVLHDIDEEEKEKALQYHSERLAIAYGLISTP 392

Query: 1788 PGTTLRIMKNLRICGDCHNAVKIISKIVEREIIVRDNKRFHHFKDGKCSCGDYW 1949
            P TTLRI+KNLRICGDCHNA+KI+SKIV RE+IVRDNKRFHHFKDGKCSCGDYW
Sbjct: 393  PRTTLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 446


>ref|NP_001239891.1| uncharacterized protein LOC100783921 [Glycine max]
            gi|255636013|gb|ACU18351.1| unknown [Glycine max]
          Length = 449

 Score =  332 bits (852), Expect = 2e-88
 Identities = 181/381 (47%), Positives = 242/381 (63%), Gaps = 17/381 (4%)
 Frame = +3

Query: 858  ISSDLDSFCREGRVQEALELMEDIEKQGLVVDPNAIISLLQACINLKLLESGRRVHTYIM 1037
            ++ DL + C EG + + LELM     QG V D    ++LL  C + + LESG+RVH  + 
Sbjct: 74   LNVDLVALCEEGNLDQVLELMG----QGAVADYRVYLALLNLCEHTRSLESGKRVHEILR 129

Query: 1038 RSSSRG-INKFNKLVEMYCKLGSTKDALGVFEEMSWKNLDSWNKMLAGLAENEQGKVVLE 1214
            RS+ RG +   N+L+ MYCK GS K+A  VF++M  +N+ +W+ M+ G   N  G   L 
Sbjct: 130  RSAFRGDVELSNRLIGMYCKCGSVKNARRVFDQMLDRNMATWHLMIGGYTSNGLGCDGLL 189

Query: 1215 MFTEMNRSGVRPDQFTFSAVLMACGSLGAVKEGISYFESMGKDFGISPSMEHYISIVDLL 1394
            +F +M ++ + PD  TF  VL AC    AV+EG  +FESM K++GI PSMEHY+ +++++
Sbjct: 190  VFQQMKQAELPPDGETFELVLAACSQAEAVEEGFLHFESM-KEYGIVPSMEHYLEVINIM 248

Query: 1395 GKSGNVSEAKELVRKMPMKPSPLVQETLRRYSSIG--------LKSVLDVLGNHKNASDL 1550
            G +G + EA+E +  +P++      E+LR+++ I          + +L      K  +D 
Sbjct: 249  GNAGQLKEAEEFIENVPIELGVEAWESLRKFARIHGDLDLEDCAEELLTRFDPSKAIADK 308

Query: 1551 -------KTSDIXXXXXXXXXXXXSVN-PNKAEAYEKARSLNEEMKSAGYVPDTRYVLQD 1706
                   K SD+              + P K E  EK   L+ +M+ AGYVPDTRYVL D
Sbjct: 309  LPTPPRKKQSDVNMLEEKNRATEYRYSIPYKEEDNEKLGGLSGQMREAGYVPDTRYVLHD 368

Query: 1707 IDREAKEKALMYHSERLAIAYGLISTPPGTTLRIMKNLRICGDCHNAVKIISKIVEREII 1886
            ID E KEKAL YHSERLAIAYGLISTPP TTLRI+KNLRICGDCHNA+KI+SKIV RE+I
Sbjct: 369  IDEEEKEKALQYHSERLAIAYGLISTPPRTTLRIIKNLRICGDCHNAIKIMSKIVGRELI 428

Query: 1887 VRDNKRFHHFKDGKCSCGDYW 1949
            VRDNKRFHHFKDGKCSCGDYW
Sbjct: 429  VRDNKRFHHFKDGKCSCGDYW 449


>ref|XP_002303271.1| predicted protein [Populus trichocarpa] gi|222840703|gb|EEE78250.1|
            predicted protein [Populus trichocarpa]
          Length = 334

 Score =  313 bits (801), Expect = 2e-82
 Identities = 166/330 (50%), Positives = 218/330 (66%), Gaps = 15/330 (4%)
 Frame = +3

Query: 1005 ESGRRVHTYIMRSSSRGINKFNK-LVEMYCKLGSTKDALGVFEEMSWKNLDSWNKMLAGL 1181
            E  ++VH Y ++S+ RG  K N  +++MY K GS  DA  VF+ M  +N+DSW+ M+   
Sbjct: 6    EDAKKVHDYFLQSTFRGDVKLNNNVIKMYGKCGSMADARRVFDHMPERNMDSWHLMINEY 65

Query: 1182 AENEQGKVVLEMFTEMNRSGVRPDQFTFSAVLMACGSLGAVKEGISYFESMGKDFGISPS 1361
            A N+ G   LE+F +M + G+ P   TF AVL AC S  AV+EG  YFE M ++FGISP+
Sbjct: 66   ANNDLGDEGLELFEQMKKLGLEPTGETFHAVLSACASAEAVEEGFLYFEEMSREFGISPT 125

Query: 1362 MEHYISIVDLLGKSGNVSEAKELVRKMPMKPSPLVQETLRRYS----SIGLKSVLDVLGN 1529
            +EHY+SI+D+LGKS  ++EA E + K+P +P+  + E LR+Y+     I L+   + L  
Sbjct: 126  LEHYLSIIDVLGKSAYLNEAVEYIEKLPFEPTVEIWEALRKYARSHGDIDLEDHAEELIV 185

Query: 1530 HKNASDLKTSDIXXXXXXXXXXXXSV----------NPNKAEAYEKARSLNEEMKSAGYV 1679
              ++S    + I             +          NP   +  EK + L E MK+ GYV
Sbjct: 186  SLDSSKAVANKIPTPPPKKYNLISMLEGKNRVAEFRNPTFYKDDEKLKELRE-MKTGGYV 244

Query: 1680 PDTRYVLQDIDREAKEKALMYHSERLAIAYGLISTPPGTTLRIMKNLRICGDCHNAVKII 1859
            PDTRYVL DID+EAKE+AL+YHSERLAIAYGLISTP    LRI+KNLR+CGDCHNA+KI+
Sbjct: 245  PDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARMPLRIIKNLRVCGDCHNAIKIM 304

Query: 1860 SKIVEREIIVRDNKRFHHFKDGKCSCGDYW 1949
            SKIV RE+IVRDNKRFHHFKDGKCSCGDYW
Sbjct: 305  SKIVGRELIVRDNKRFHHFKDGKCSCGDYW 334


Top