BLASTX nr result

ID: Coptis21_contig00008871 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00008871
         (2107 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003517802.1| PREDICTED: uncharacterized protein LOC100779...   343   8e-92
ref|XP_003520084.1| PREDICTED: uncharacterized protein LOC100789...   342   3e-91
ref|XP_004135450.1| PREDICTED: uncharacterized protein LOC101203...   337   6e-90
gb|ABB47573.1| expressed protein [Oryza sativa Japonica Group] g...   278   4e-72
ref|XP_003613854.1| hypothetical protein MTR_5g041750 [Medicago ...   270   1e-69

>ref|XP_003517802.1| PREDICTED: uncharacterized protein LOC100779481 [Glycine max]
          Length = 422

 Score =  343 bits (881), Expect = 8e-92
 Identities = 191/382 (50%), Positives = 233/382 (60%), Gaps = 7/382 (1%)
 Frame = -3

Query: 1607 AKTGSLCCVAARPHVSTTASGEWSMGPHEPYWRTNTSFSPPLSRRWERRFQSDGLPYGSR 1428
            AKTGSLCCVA+RPH S   S +WSMGP+EPYWRTN+SFSPP +R W+ RFQS+GL YG  
Sbjct: 28   AKTGSLCCVASRPHESNAGSRDWSMGPNEPYWRTNSSFSPPPTR-WDFRFQSEGLSYGVN 86

Query: 1427 SGLQLYEXXXXXXXXXXXXSMRDDRFPN--DAASDGAGSYFSSPSDSFQTQQWTPSPMTE 1254
             G+QLY              +R +   +   +ASDG G + SSPSD  Q  QWTP  + E
Sbjct: 87   DGVQLYGSSTSENDKESRGWVRGNHLYDLHYSASDGTGIFLSSPSDLSQGPQWTPPAIQE 146

Query: 1253 GIISDYASGALRAEHAS-GPLVFTPGMEGTSGAPYNAGSISSRSDGSEYEAVFKTXXXXX 1077
              I +Y +   +  H S G + FTP  EGTS   Y  GS SS+S+ SE E+  K+     
Sbjct: 147  ISIDNYETSTRKDSHPSVGRVSFTPNKEGTSVNHYCGGSTSSQSESSESESTTKSHLSSE 206

Query: 1076 XXXXXXXXS--KPIHPISFLNQTPERDVSGTITTGNYLNRQALNEAGSSTPRRETLRWXX 903
                       KPIHP+SF + T  RD      T          E  +STP R+  RW  
Sbjct: 207  RNFANLRSFMSKPIHPMSFNDLTTTRDAFDPAVTD-------FTEFDTSTPLRDGQRWSS 259

Query: 902  XXS-MDFTDVSEQLDSDSAVPSYNLSEGSVKCGLCDRLLSQRSPWSSRRIVRSGDMPVTG 726
              S  +F DV+E  + ++   S+ LS+G  KCGLC+R LSQRSPWSSRRIVRSGDMP  G
Sbjct: 260  ASSSQEFADVTESFELETPGRSHFLSDG-FKCGLCERFLSQRSPWSSRRIVRSGDMPTIG 318

Query: 725  ILSCRHVFHADCLEQTTPKIQKHDPPCPVCARL-ENALEQPTSSRLRNGLPRLKQVGEDG 549
            +L C H FHA+CLEQ TPK +K DPPCPVC +L EN+ +Q +  RLR G PRLK   +DG
Sbjct: 319  VLPCCHAFHAECLEQATPKTRKSDPPCPVCVKLEENSPDQRSHLRLRTGFPRLKSSRDDG 378

Query: 548  PSRSWSCGQVGDCVEGALHASP 483
            PSR W C QVGDCVEGALHA P
Sbjct: 379  PSRPWGCVQVGDCVEGALHAPP 400


>ref|XP_003520084.1| PREDICTED: uncharacterized protein LOC100789831 [Glycine max]
          Length = 508

 Score =  342 bits (876), Expect = 3e-91
 Identities = 201/445 (45%), Positives = 250/445 (56%), Gaps = 9/445 (2%)
 Frame = -3

Query: 1607 AKTGSLCCVAARPHVSTTASGEWSMGPHEPYWRTNTSFSPPLSRRWERRFQSDGLPYGSR 1428
            AKTGSLCCVA+RPH S   S +WSMGP+EPYWRTN+S+SPP +R W+ RFQS+GLPY   
Sbjct: 71   AKTGSLCCVASRPHESNAGSRDWSMGPNEPYWRTNSSYSPPPTR-WDFRFQSEGLPYDVN 129

Query: 1427 SGLQLYEXXXXXXXXXXXXSMRDDRFPN--DAASDGAGSYFSSPSDSFQTQQWTPSPMTE 1254
             G+QLY              +R +   +   +ASD  G + SSPSD  Q  QWTP  + E
Sbjct: 130  DGVQLYGSSTSSIDKESRGWVRGNHLYDLHYSASDDTGIFLSSPSDLSQGPQWTPPAIQE 189

Query: 1253 GIISDYASGALRAEHASGPLV-FTPGMEGTSGAPYNAGSISSRSDGSEYEAVFKTXXXXX 1077
              I +Y +   +  H S   V FTP  EGTS  P + GS SS+S+ SE E+  K+     
Sbjct: 190  ISIDNYETSTRKDSHPSVDRVSFTPNKEGTSVNPNSGGSTSSQSESSESESTAKSRLSSQ 249

Query: 1076 XXXXXXXXS--KPIHPISFLNQTPERDVSGTITTGNYLNRQALNEAGSSTPRRETLRWXX 903
                       KPIHP+SF + T  RD      T          E  +STP R+  RW  
Sbjct: 250  RNFSNLRSFMSKPIHPMSFNDLTTTRDAFDPAVTD-------FTEFDTSTPLRDGHRWSS 302

Query: 902  XXS-MDFTDVSEQLDSDSAVPSYNLSEGSVKCGLCDRLLSQRSPWSSRRIVRSGDMPVTG 726
              S  +F D++E  + ++   S+ LS+G  +CGLC+R L+QRSPWSSRRIVRSGDMP  G
Sbjct: 303  ASSSQEFADITESFELETPGRSHFLSDG-FRCGLCERFLTQRSPWSSRRIVRSGDMPTIG 361

Query: 725  ILSCRHVFHADCLEQTTPKIQKHDPPCPVCARL--ENALEQPTSSRLRNGLPRLKQVGED 552
            +L C H FHA+CLEQTTPK QK DPPCPVC +L  EN+ +Q    RLR G PRLK   +D
Sbjct: 362  VLPCCHAFHAECLEQTTPKTQKSDPPCPVCVKLEEENSPDQRGHLRLRTGFPRLKSSRDD 421

Query: 551  GPSRSWSCGQVGDCVEGALHASPXXXXXXXXXXXXXXXXSFKGISSKELP-DXXXXXXXX 375
            GPSR W C QVGDCVEGALHA P                S KG   KE P          
Sbjct: 422  GPSRPWGCVQVGDCVEGALHAPPRNTMLLLNRNRIKKNLSLKGNIGKEFPGKMRKNGTFS 481

Query: 374  XXXXXXXSVEQEAFECSRLTSGPTM 300
                   S + EA   S+ T+GP++
Sbjct: 482  SHLFSGSSADGEAVGSSKATAGPSV 506


>ref|XP_004135450.1| PREDICTED: uncharacterized protein LOC101203618 [Cucumis sativus]
            gi|449532609|ref|XP_004173273.1| PREDICTED:
            uncharacterized LOC101203618 [Cucumis sativus]
          Length = 436

 Score =  337 bits (865), Expect = 6e-90
 Identities = 207/444 (46%), Positives = 253/444 (56%), Gaps = 11/444 (2%)
 Frame = -3

Query: 1598 GSLCCVAARPHVSTTASGEWSMGPHEPYWRTNTSFSPPLSRRWERRFQSDGLPYGSRSGL 1419
            GSLCCVAARPH S  AS +WS+GPHEP+W TNTSFSPP SR W+ +FQS+GLP+G    +
Sbjct: 2    GSLCCVAARPHGSNAASRDWSLGPHEPFWHTNTSFSPPPSR-WDIQFQSEGLPHGWHDAV 60

Query: 1418 QLYEXXXXXXXXXXXXSMRDDR--FPNDAASDGAGSYFSSPSDSFQTQQWTPSPMTEGII 1245
            QLY              +R +   + +++ASDGAG + SSPSD  Q  QWTP  + E  I
Sbjct: 61   QLYGSSTSSNSKESRSWIRGNNHLYTHNSASDGAGLFLSSPSDISQGPQWTPPAIQEINI 120

Query: 1244 SDYASGALRAEHASGPLVFTPGMEGTSGAPYNAGSISSRSDGSEYEAVFK--TXXXXXXX 1071
              Y + A + + +     F P  EG S  P +  S  S+SD SE E   K  +       
Sbjct: 121  DGYET-ATKRDPSLRTFSFWPAAEGNSENPDSGSSTFSQSDSSETEPTVKLRSSSNWNFT 179

Query: 1070 XXXXXXSKPIHPISFLNQTPERDVSGTITTGNYLNRQALNEAGSSTPRRETLRWXXXXS- 894
                  SKPIHP++   QT   +   +   G         E  SSTP+R+  RW    S 
Sbjct: 180  SRRSFMSKPIHPLAIPMQTSSGEAFESTNLG-------FAEFDSSTPQRDNQRWSSASSS 232

Query: 893  MDFTDVSEQLDSDSAVPSYNLSEGSVKCGLCDRLLSQRSPWSSRRIVRSGDMPVTGILSC 714
            +DF DVSE L+SD    S   S+ S +CGLC+R LSQRSPWSSRRIVRS DMPV G+LSC
Sbjct: 233  IDFADVSEPLESDFYFKSSCRSD-SFRCGLCERFLSQRSPWSSRRIVRSTDMPVAGVLSC 291

Query: 713  RHVFHADCLEQTTPKIQKHDPPCPVCARLEN--ALEQPTSSRLR--NGLPRLK-QVGEDG 549
            RHVFHA+CL+QTTPK  K DPPCP+C + EN  + EQ T+SRLR  N LPR +    EDG
Sbjct: 292  RHVFHAECLDQTTPKTCKSDPPCPLCLKHENDRSPEQRTNSRLRNANSLPRPRPSTSEDG 351

Query: 548  PSRSWSCGQVGDCVEGALHASPXXXXXXXXXXXXXXXXSFKGISSKELP-DXXXXXXXXX 372
            PSR W C QVGDCVEGALHA P                SFKG SSKE P           
Sbjct: 352  PSRPWGCAQVGDCVEGALHA-PPRNSMLFVNRNRSKNLSFKGNSSKEFPGKLRKSGSYSS 410

Query: 371  XXXXXXSVEQEAFECSRLTSGPTM 300
                    +QE   CSR ++GP+M
Sbjct: 411  RLVSARPFDQEFVGCSRTSAGPSM 434


>gb|ABB47573.1| expressed protein [Oryza sativa Japonica Group]
            gi|125531860|gb|EAY78425.1| hypothetical protein
            OsI_33515 [Oryza sativa Indica Group]
          Length = 433

 Score =  278 bits (711), Expect = 4e-72
 Identities = 174/400 (43%), Positives = 223/400 (55%), Gaps = 32/400 (8%)
 Frame = -3

Query: 1598 GSLCCVAARPHVSTTASGEWS-MGPHEPYWRTNTSFSPPLSRRWERRFQSDGLPYGSR-- 1428
            GSLCCVA+RPH ++TAS EWS +G  +P WRTN  FSPPLSRRWE R  S+GL YGS+  
Sbjct: 2    GSLCCVASRPHGASTASREWSSIGRSDPLWRTNAGFSPPLSRRWEYRINSEGLSYGSQGD 61

Query: 1427 SGLQLY--EXXXXXXXXXXXXSMRDDRFPND---AASDGAGSYFSSPSDSFQTQQWTPSP 1263
            SG   +                 R D  P+    + S+GA SYF+SP  +FQ        
Sbjct: 62   SGAAAHYGSSLSSNSKEPSRSWERSDVPPDHHRYSTSEGAISYFNSPDVTFQNHHIMLPM 121

Query: 1262 MTEGIISDYASGALRAEHASGPLVFTPGMEGTSGAPYNAGSISSRSDGSEYEAVFKTXXX 1083
            + +  I +Y    +      G L+ +   EG SG   + GS SSRSDGSEY+ V K+   
Sbjct: 122  LQDSGIDEYMR--VSVAEPIGALLLS---EGISGQQNSGGSTSSRSDGSEYDIVPKSYSS 176

Query: 1082 XXXXXXXXXXS--KPIHPISFLNQTPERDVSGTITTGNYLNRQA---------------- 957
                         KPIHP+SF    PE  + G  T     N                   
Sbjct: 177  TPRNFPSRRSFLSKPIHPLSF----PEHALEGQETDSPVANASTSSPMPSEFKAIGEIRP 232

Query: 956  ---LNEAGSSTPRRETLRWXXXXSMDFTDVSEQLDSDSAVP--SYNLSEGSVKCGLCDRL 792
               ++ A +S    E+  W    SMD TD+SE+ D++ + P  S N+ + + +C LC+RL
Sbjct: 233  SGLMDYAYASGSHGESANWSAASSMDLTDLSERHDAERSGPLRSNNIMDRT-RCDLCERL 291

Query: 791  LSQRSPWSSRRIVRSGDMPVTGILSCRHVFHADCLEQTTPKIQKHDPPCPVCARLENA-L 615
            LS+RSPW SRRIVR+GD+PV G+L C HV+HA+CLE+TTPK QKHDPPCP C RL     
Sbjct: 292  LSKRSPWGSRRIVRTGDLPVAGVLPCCHVYHAECLERTTPKGQKHDPPCPACDRLSGKDT 351

Query: 614  EQPTSSRLRNGLPRLKQVGEDGPSRSWSCGQVGDCVEGAL 495
            EQ +  RLRNG PRL+ +GE GPSR WSC Q GDCV GA+
Sbjct: 352  EQWSICRLRNGFPRLRSLGE-GPSRVWSCAQAGDCVAGAV 390


>ref|XP_003613854.1| hypothetical protein MTR_5g041750 [Medicago truncatula]
            gi|355515189|gb|AES96812.1| hypothetical protein
            MTR_5g041750 [Medicago truncatula]
          Length = 447

 Score =  270 bits (690), Expect = 1e-69
 Identities = 159/368 (43%), Positives = 203/368 (55%), Gaps = 18/368 (4%)
 Frame = -3

Query: 1616 QNIAKTGSLCCVAARPHVSTTASGEWSMGPHEPYWRTNTSFSPPLSRRWERRFQSDGLPY 1437
            + IAKTGSLCCVA+RPH S+  S EWS+GPHEPYWRTNTS+SPP SR W+ RFQS+GLPY
Sbjct: 31   EKIAKTGSLCCVASRPHGSSADSREWSLGPHEPYWRTNTSYSPPPSR-WDFRFQSEGLPY 89

Query: 1436 GSRSGLQLYEXXXXXXXXXXXXSMRDDRFPND---AASDGAGSYFSSP--SDSFQTQQWT 1272
                G QLY+            +        D   + +DG G + SSP  SD  Q  QW 
Sbjct: 90   SLSDGGQLYDGSSTSSNGKESRTWVRGNHLYDLHYSVADGTGIFVSSPCPSDLSQGPQWM 149

Query: 1271 PSPMTEGIISDYASGALRAEHAS-GPLVFTPGMEGTSGAPYNAGSISSRSDGSEYEAVFK 1095
            P  + E    DY +   +  H S G + FTP  EGTS  PYN GS SS S+ SE E+   
Sbjct: 150  PPAIQEISFDDYTAVTRKDFHPSLGRISFTPTKEGTSQNPYNRGSTSSESESSESESTTN 209

Query: 1094 TXXXXXXXXXXXXXS--KPIHPISFLNQTPERDV--------SGTITTGNYLNRQALNEA 945
            +                KPIHP+SF + T  RD         +G  T+    + Q  + A
Sbjct: 210  SQLSFQRNFSNHRSFISKPIHPLSFPDLTTARDAFDHAVSDYTGFDTSNRLRDSQRSSNA 269

Query: 944  GSSTPRRETLRWXXXXSMDFTDVSEQLDSDSAVPSYNLSEGSVKCGLCDRLLSQRSPWSS 765
             SS               D  D++E  D ++    +  S+   +C LC++ +SQRSPWSS
Sbjct: 270  SSS--------------QDSADITESFDLETPAHLHTQSD-EFRCSLCEKFMSQRSPWSS 314

Query: 764  RRIVRSGDMPVTGILSCRHVFHADCLEQTTPKIQKHDPPCPVCARLEN--ALEQPTSSRL 591
            RRIVRSGDMP  G+L CRHVFHA+CL+Q TPK +K +PPCPVC +LE   + +Q    RL
Sbjct: 315  RRIVRSGDMPAAGVLPCRHVFHAECLDQATPKTRKIEPPCPVCVKLEEQYSPDQRGVVRL 374

Query: 590  RNGLPRLK 567
            RN  P+ K
Sbjct: 375  RNSFPKFK 382


Top