BLASTX nr result

ID: Coptis21_contig00003208 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00003208
         (1791 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004149525.1| PREDICTED: UPF0586 protein C9orf41 homolog [...   624   e-176
ref|XP_003543999.1| PREDICTED: UPF0586 protein C9orf41 homolog [...   621   e-175
ref|XP_003556852.1| PREDICTED: uncharacterized protein LOC100791...   617   e-174
ref|XP_002879368.1| hypothetical protein ARALYDRAFT_902264 [Arab...   602   e-169
ref|NP_850185.1| N2227-like domain-containing protein [Arabidops...   600   e-169

>ref|XP_004149525.1| PREDICTED: UPF0586 protein C9orf41 homolog [Cucumis sativus]
          Length = 492

 Score =  624 bits (1610), Expect = e-176
 Identities = 317/479 (66%), Positives = 360/479 (75%), Gaps = 30/479 (6%)
 Frame = -2

Query: 1691 EEQADNQHTHSKLEEAMEVKSLRRIISAYLNYSDAAEEDVKRYERSYRNLPPAHKAILSH 1512
            E++ + Q    KLEEA+EVKSLRRI+SAYLNY +A+EEDVKRYERS+  LPPAHKA+LSH
Sbjct: 6    EDEDEEQTRQRKLEEALEVKSLRRIVSAYLNYPEASEEDVKRYERSFSKLPPAHKALLSH 65

Query: 1511 YPFKCEQLRRCISVNSFFIQNMLEAFEPPLDMSQDTVIYEHEELEYVPKN--CFSGERN- 1341
            +P K E+LRRCIS NS+FI NML+AFEPPLDMSQDT   +    ++   +  C  GERN 
Sbjct: 66   FPLKFERLRRCISTNSYFIFNMLQAFEPPLDMSQDTDCCDGSYPDHAHDDQFCCRGERNA 125

Query: 1340 ----------FCSGQSASTSGRTCFSEGDQT---KGYKESQSSAPNDVEA---VNNQAGL 1209
                       CSG+  STSGR C  E  Q    +G  +S  ++  + E    VN+   L
Sbjct: 126  NGNLCSRESNVCSGEPTSTSGRMCSLESKQICCPEGASDSPKASTINQEVENGVNHDQHL 185

Query: 1208 GS-----------ISESKGNVNLDPGDWLDPKFQLKVPLVDVDKVRCIIRNIVRDWASEG 1062
                          S+  GN      +WLDP  QL VPLVDVDKVRCIIRNIVRDWA EG
Sbjct: 186  EEKEVTDKHSGHCASDCNGNDCSSSHEWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAEEG 245

Query: 1061 GKERDQCYTPILEELDRLFPNRSKDSPPRCLVPGAGLGRLTLEISCLGFISQGNEFSYYM 882
             KER+QCY PILEEL  LFP+R K+SPP CLVPGAGLGRL LEISCLGFISQGNEFSYYM
Sbjct: 246  QKEREQCYKPILEELHSLFPDRKKESPPACLVPGAGLGRLALEISCLGFISQGNEFSYYM 305

Query: 881  MICSMFILNHTETVGEWTVHPWIHSNCNSLSDNDQLRAVSFPDIHPASAGITEGFSMCGG 702
            MICS FILNHT+ VGEWT++PWIHSN NSLSD+DQLR VS PDIHPASAGITEGFSMCGG
Sbjct: 306  MICSSFILNHTQKVGEWTIYPWIHSNSNSLSDSDQLRPVSIPDIHPASAGITEGFSMCGG 365

Query: 701  DFVEVYSDSNQEGAWDAVVTCFFLDTAHNIVEYIEIISRILKDGGVWINLGPLLYHFADA 522
            DFVEVYSD +Q G WDAVVTCFF+DTAHNI+EYIE+IS+ILKDGGVWINLGPLLYHFAD 
Sbjct: 366  DFVEVYSDPSQVGLWDAVVTCFFIDTAHNIIEYIEVISKILKDGGVWINLGPLLYHFADM 425

Query: 521  YGTEDEMSIELSLEDVMRVAFDYGFHLEHEKIVETTYTANPQAMMQNRYNAAFWTMTKK 345
            YG EDEMSIE SLEDV ++   YGF  E E+ VETTYT NP++MMQNRY AAFWTM KK
Sbjct: 426  YGQEDEMSIEPSLEDVKKIILHYGFVFEKERTVETTYTTNPRSMMQNRYYAAFWTMRKK 484


>ref|XP_003543999.1| PREDICTED: UPF0586 protein C9orf41 homolog [Glycine max]
          Length = 456

 Score =  621 bits (1601), Expect = e-175
 Identities = 313/453 (69%), Positives = 358/453 (79%), Gaps = 4/453 (0%)
 Frame = -2

Query: 1691 EEQADNQHTHSKLEEAMEVKSLRRIISAYLNYSDAAEEDVKRYERSYRNLPPAHKAILSH 1512
            EE  ++Q    KLEEA+E++SLRRIISAYLNY DAAEEDV+R ERSYR LPP+HKA+LS 
Sbjct: 2    EEAEEDQRRRLKLEEALEIQSLRRIISAYLNYPDAAEEDVRRNERSYRKLPPSHKALLSQ 61

Query: 1511 YPFKCEQLRRCISVNSFFIQNMLEAFEPPLDMSQDTVIYEHEELEYVPKNCFSGER-NFC 1335
            YP K ++LR CIS+N+ FI +ML+AFEPPLDMSQD    E    E   K+    E  + C
Sbjct: 62   YPQKFQRLRWCISMNTHFIFSMLQAFEPPLDMSQDADFSEDPHPESAQKDHLVSEGISAC 121

Query: 1334 SGQSASTSGRTCFSEGDQTKGYKESQSSAPNDVEAVNNQAGLGS-ISESKGNV--NLDPG 1164
            S +SA          G +++    + + +P  +     +   GS I++SKGNV       
Sbjct: 122  SCESAP-------EVGIESRHQSNTGNHSPRLIHTKETREYCGSPIADSKGNVPDTSSQQ 174

Query: 1163 DWLDPKFQLKVPLVDVDKVRCIIRNIVRDWASEGGKERDQCYTPILEELDRLFPNRSKDS 984
             WL P  +L VPLVD DKVRCIIRNIVRDWA+EG KERDQCY PILEEL+ LFPNRSK+S
Sbjct: 175  QWLAPSLKLNVPLVDADKVRCIIRNIVRDWAAEGKKERDQCYNPILEELNMLFPNRSKES 234

Query: 983  PPRCLVPGAGLGRLTLEISCLGFISQGNEFSYYMMICSMFILNHTETVGEWTVHPWIHSN 804
            PP CLVPGAGLGRL LEISCLGFISQGNEFSYYMMICS FILNH++T GEWT++PWIHSN
Sbjct: 235  PPACLVPGAGLGRLALEISCLGFISQGNEFSYYMMICSSFILNHSQTAGEWTIYPWIHSN 294

Query: 803  CNSLSDNDQLRAVSFPDIHPASAGITEGFSMCGGDFVEVYSDSNQEGAWDAVVTCFFLDT 624
            CNSLSD+DQLR VS PDIHPASAGITEGFSMCGGDFVEVYSDS+Q GAWDAVVTCFF+DT
Sbjct: 295  CNSLSDSDQLRPVSIPDIHPASAGITEGFSMCGGDFVEVYSDSSQIGAWDAVVTCFFIDT 354

Query: 623  AHNIVEYIEIISRILKDGGVWINLGPLLYHFADAYGTEDEMSIELSLEDVMRVAFDYGFH 444
            AHNIVEYIEIIS+ILKDGGVWINLGPLLYHFAD YG +DEMSIELSLEDV RVAF YGF 
Sbjct: 355  AHNIVEYIEIISKILKDGGVWINLGPLLYHFADMYGQDDEMSIELSLEDVKRVAFHYGFE 414

Query: 443  LEHEKIVETTYTANPQAMMQNRYNAAFWTMTKK 345
             E+E+ +ETTYTAN ++MMQNRY AAFWTM KK
Sbjct: 415  FENERTIETTYTANSRSMMQNRYFAAFWTMRKK 447


>ref|XP_003556852.1| PREDICTED: uncharacterized protein LOC100791662 [Glycine max]
          Length = 627

 Score =  617 bits (1591), Expect = e-174
 Identities = 313/485 (64%), Positives = 364/485 (75%), Gaps = 28/485 (5%)
 Frame = -2

Query: 1715 NRRKKMPNEEQADNQHTHSKLEEAMEVKSLRRIISAYLNYSDAAEEDVKRYERSYRNLPP 1536
            +R +++ +E + + Q    KLEEA+E++SLRRIISAYLNY DAAEEDV+RYERSYR LPP
Sbjct: 134  HRLRRVMDEAEEEQQRRRLKLEEALEIQSLRRIISAYLNYPDAAEEDVRRYERSYRKLPP 193

Query: 1535 AHKAILSHYPFKCEQLRRCISVNSFFIQNMLEAFEPPLDMSQDTVIYEHEELEYVPKNCF 1356
            +HKA+LSHY  K ++LR CIS+N+ FI  ML+AFEPPLDMSQD    E    E   K+  
Sbjct: 194  SHKALLSHYSRKFQRLRWCISMNTHFIFGMLQAFEPPLDMSQDVDFSEDPHPESTQKDHL 253

Query: 1355 SGER-NFCSGQSAST---------------SGRTCFSEGDQTKGYK---------ESQSS 1251
              E  + CS +S                     TC S+       +          + S 
Sbjct: 254  VSEGISACSCESVPVRITCSVSDQHRCVEGGNHTCISQAQMHSNEEVDIESCHQSNTGSH 313

Query: 1250 APNDVEAVNNQAGLGS-ISESKGNVNLDPGD--WLDPKFQLKVPLVDVDKVRCIIRNIVR 1080
            +P+ +         GS I++S GNV +      WLDP  +L VPLVDVDKVRCIIRNIVR
Sbjct: 314  SPSMIHPKETSEYCGSPIADSNGNVPVTSSQQQWLDPSLKLNVPLVDVDKVRCIIRNIVR 373

Query: 1079 DWASEGGKERDQCYTPILEELDRLFPNRSKDSPPRCLVPGAGLGRLTLEISCLGFISQGN 900
            DWA+EG  ERDQCY+PIL+EL+ LFPNRSKDSPP CLVPGAGLGRL LEISCLGFISQGN
Sbjct: 374  DWAAEGKNERDQCYSPILDELNMLFPNRSKDSPPACLVPGAGLGRLALEISCLGFISQGN 433

Query: 899  EFSYYMMICSMFILNHTETVGEWTVHPWIHSNCNSLSDNDQLRAVSFPDIHPASAGITEG 720
            EFSYYMMICS FILNH++T GEWT++PWIHSNCNSLSD+DQLR VS PD+HPASAGITEG
Sbjct: 434  EFSYYMMICSSFILNHSQTAGEWTIYPWIHSNCNSLSDSDQLRPVSIPDMHPASAGITEG 493

Query: 719  FSMCGGDFVEVYSDSNQEGAWDAVVTCFFLDTAHNIVEYIEIISRILKDGGVWINLGPLL 540
            FSMCGGDFVEVYSDS+Q GAWDAVVTCFF+DTAHNIVEYIEIIS+ILK+GGVWINLGPLL
Sbjct: 494  FSMCGGDFVEVYSDSSQVGAWDAVVTCFFIDTAHNIVEYIEIISKILKEGGVWINLGPLL 553

Query: 539  YHFADAYGTEDEMSIELSLEDVMRVAFDYGFHLEHEKIVETTYTANPQAMMQNRYNAAFW 360
            YHFAD YG +DEMSIELSLEDV RVA  YGF LE E+ +ETTYTAN ++MMQNRY +AFW
Sbjct: 554  YHFADMYGQDDEMSIELSLEDVKRVALHYGFELEKERTIETTYTANSRSMMQNRYFSAFW 613

Query: 359  TMTKK 345
            TM KK
Sbjct: 614  TMRKK 618


>ref|XP_002879368.1| hypothetical protein ARALYDRAFT_902264 [Arabidopsis lyrata subsp.
            lyrata] gi|297325207|gb|EFH55627.1| hypothetical protein
            ARALYDRAFT_902264 [Arabidopsis lyrata subsp. lyrata]
          Length = 508

 Score =  602 bits (1552), Expect = e-169
 Identities = 300/467 (64%), Positives = 354/467 (75%), Gaps = 19/467 (4%)
 Frame = -2

Query: 1691 EEQADNQHTHSKLEEAMEVKSLRRIISAYLNYSDAAEEDVKRYERSYRNLPPAHKAILSH 1512
            EE+ +      KLEEA+E KSLRRIISAYLNY +A+EED+KR+ERSYR L P+HKA++SH
Sbjct: 36   EEEEEKIRRQKKLEEALEAKSLRRIISAYLNYPEASEEDLKRWERSYRKLSPSHKALVSH 95

Query: 1511 YPFKCEQLRRCISVNSFFIQNMLEAFEPPLDMSQDTVIYEHEELEYVPKNCFS-GERNFC 1335
            YP K ++LRRCIS NS+FI NML+AFEPP+D+SQ+    E   LE  P   ++  ER+  
Sbjct: 96   YPIKFQRLRRCISANSYFIFNMLQAFEPPIDLSQELDGCEDSNLECAPHERYTLDERHDS 155

Query: 1334 SGQSASTSGRTCFSEGDQTK------GYKESQSSAPNDVEAVNNQAGL-----------G 1206
            S Q A T+  T   E    +        +E Q    +D  + ++ A             G
Sbjct: 156  SCQPALTNSCTYKEESKHIREPITGVSIEELQRKEAHDHSSKDDSADARITNKTCECDGG 215

Query: 1205 SISESKGNVNLDPGDWLDPKFQLKVPLVDVDKVRCIIRNIVRDWASEGGKERDQCYTPIL 1026
             ++   G+V+    DWLD   Q  VPLVDVDKVRCIIRNIVRDWA+EG +ERDQCY PIL
Sbjct: 216  QLNHDHGSVSFSSHDWLDSSLQTHVPLVDVDKVRCIIRNIVRDWAAEGQRERDQCYKPIL 275

Query: 1025 EELDRLFPNRSKDS-PPRCLVPGAGLGRLTLEISCLGFISQGNEFSYYMMICSMFILNHT 849
            EELD LFP+RSK+S PP CLVPGAGLGRL LEISCLGFISQGNEFSYYMMICS FILN++
Sbjct: 276  EELDSLFPDRSKESTPPACLVPGAGLGRLALEISCLGFISQGNEFSYYMMICSSFILNYS 335

Query: 848  ETVGEWTVHPWIHSNCNSLSDNDQLRAVSFPDIHPASAGITEGFSMCGGDFVEVYSDSNQ 669
            +  GEWT++PWIHSNCNSLSDNDQLR ++ PDIHPASAGITEGFSMCGGDFVEVY++S+ 
Sbjct: 336  QVPGEWTIYPWIHSNCNSLSDNDQLRPIAIPDIHPASAGITEGFSMCGGDFVEVYNESSH 395

Query: 668  EGAWDAVVTCFFLDTAHNIVEYIEIISRILKDGGVWINLGPLLYHFADAYGTEDEMSIEL 489
             G WDAVVTCFF+DTAHN++EYIE IS+ILKDGGVWINLGPLLYHFAD YG E+EMSIEL
Sbjct: 396  AGMWDAVVTCFFIDTAHNVIEYIETISKILKDGGVWINLGPLLYHFADTYGHENEMSIEL 455

Query: 488  SLEDVMRVAFDYGFHLEHEKIVETTYTANPQAMMQNRYNAAFWTMTK 348
            SLEDV RVA  YGF +E E+ +ETTYT NP+AMMQNRY  AFWTM K
Sbjct: 456  SLEDVKRVASHYGFVIEKERTIETTYTTNPRAMMQNRYYTAFWTMRK 502


>ref|NP_850185.1| N2227-like domain-containing protein [Arabidopsis thaliana]
            gi|20259498|gb|AAM13869.1| unknown protein [Arabidopsis
            thaliana] gi|22136766|gb|AAM91702.1| unknown protein
            [Arabidopsis thaliana] gi|330253550|gb|AEC08644.1|
            N2227-like domain-containing protein [Arabidopsis
            thaliana]
          Length = 504

 Score =  600 bits (1547), Expect = e-169
 Identities = 299/473 (63%), Positives = 358/473 (75%), Gaps = 19/473 (4%)
 Frame = -2

Query: 1706 KKMPNEEQADNQHTHSKLEEAMEVKSLRRIISAYLNYSDAAEEDVKRYERSYRNLPPAHK 1527
            +++ + E+ +      KLEEA+E KSLRRIISAYLNY +A+EED+KR+ERSYR L PAHK
Sbjct: 26   RELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYLNYPEASEEDLKRWERSYRKLSPAHK 85

Query: 1526 AILSHYPFKCEQLRRCISVNSFFIQNMLEAFEPPLDMSQDTVIYEHEELEYVPKNCFS-G 1350
            A++ HYP K ++LRRCIS NS+FI NML+AFEPP+D+SQ+    E   L+  P   ++  
Sbjct: 86   ALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPIDLSQELDGCEDSNLDCAPHERYTLD 145

Query: 1349 ERNFCSGQSASTSGRTCFSEGDQTKG-----------YKESQSSAPNDVEA---VNNQA- 1215
            ER+  S Q A T+  T   E    +             KE+   +P D  A   +N++  
Sbjct: 146  ERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEELQRKEAHDHSPKDDSADTRINDKTC 205

Query: 1214 --GLGSISESKGNVNLDPGDWLDPKFQLKVPLVDVDKVRCIIRNIVRDWASEGGKERDQC 1041
                G ++   G+V+    DWLD   Q  VPLVDVDKVRCIIRNIVRDWA+EG +ERDQC
Sbjct: 206  DCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDVDKVRCIIRNIVRDWAAEGQRERDQC 265

Query: 1040 YTPILEELDRLFPNRSKDS-PPRCLVPGAGLGRLTLEISCLGFISQGNEFSYYMMICSMF 864
            Y PILEELD LFP+R K+S PP CLVPGAGLGRL LEISCLGFISQGNEFSYYMMICS F
Sbjct: 266  YKPILEELDSLFPDRLKESTPPACLVPGAGLGRLALEISCLGFISQGNEFSYYMMICSSF 325

Query: 863  ILNHTETVGEWTVHPWIHSNCNSLSDNDQLRAVSFPDIHPASAGITEGFSMCGGDFVEVY 684
            ILN+T+  GEWT++PWIHSNCNSLSDNDQLR ++ PDIHPASAGITEGFSMCGGDFVEVY
Sbjct: 326  ILNYTQVPGEWTIYPWIHSNCNSLSDNDQLRPIAIPDIHPASAGITEGFSMCGGDFVEVY 385

Query: 683  SDSNQEGAWDAVVTCFFLDTAHNIVEYIEIISRILKDGGVWINLGPLLYHFADAYGTEDE 504
            ++S+  G WDAVVTCFF+DTAHN++EYI+ IS+ILKDGGVWINLGPLLYHFAD YG E+E
Sbjct: 386  NESSHAGMWDAVVTCFFIDTAHNVIEYIQTISKILKDGGVWINLGPLLYHFADTYGHENE 445

Query: 503  MSIELSLEDVMRVAFDYGFHLEHEKIVETTYTANPQAMMQNRYNAAFWTMTKK 345
            MSIELSLEDV RVA  +GF +E E+ +ETTYT NP+AMMQNRY  AFWTM KK
Sbjct: 446  MSIELSLEDVKRVASHFGFVIEKERTIETTYTTNPRAMMQNRYYTAFWTMRKK 498


Top