BLASTX nr result

ID: Glycyrrhiza23_contig00019403 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00019403
         (1466 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003536818.1| PREDICTED: uncharacterized protein LOC100797...   448   e-123
ref|XP_003556587.1| PREDICTED: uncharacterized protein LOC100779...   442   e-121
ref|XP_002512665.1| conserved hypothetical protein [Ricinus comm...   375   e-101
ref|XP_003551802.1| PREDICTED: uncharacterized protein LOC100788...   374   e-101
ref|XP_003531912.1| PREDICTED: uncharacterized protein LOC100804...   371   e-100

>ref|XP_003536818.1| PREDICTED: uncharacterized protein LOC100797767 [Glycine max]
          Length = 325

 Score =  448 bits (1153), Expect = e-123
 Identities = 246/322 (76%), Positives = 251/322 (77%), Gaps = 2/322 (0%)
 Frame = -2

Query: 1228 KAPRLPRWTRQEILVLIQGKRDAENKFRRGRNAGLPFGSGQVEPKWASVSSYCRKHGVNR 1049
            KAPRLPRWTRQEILVLIQGKRDAENKFRRGR AGLPFGSGQVEPKWASVSSYCRKHGVNR
Sbjct: 24   KAPRLPRWTRQEILVLIQGKRDAENKFRRGRTAGLPFGSGQVEPKWASVSSYCRKHGVNR 83

Query: 1048 GPVQCRKRWSNLAGDYKKIKEWESHIREETESFWVMRNDLRRERKLPGFFDKEVYDILDS 869
            GPVQCRKRWSNLAGDYKKIKEWES IREETESFWVMRNDLRRERKLPGFFDKEVYDILDS
Sbjct: 84   GPVQCRKRWSNLAGDYKKIKEWESQIREETESFWVMRNDLRRERKLPGFFDKEVYDILDS 143

Query: 868  GXXXXXXXXXXXXXXXXXXXXXXXXXXLMPAGAKGTEEEPHLLYDSNRSAA--GEDGLFS 695
                                        +PA     E  PH LYDSNRSA   GEDGLFS
Sbjct: 144  -----PAALALALSSSSPPPPTTTKTITLPA----EEPLPH-LYDSNRSAPGDGEDGLFS 193

Query: 694  DFEPEEVEVDASPIPEKKDPPNPKDIPAPAIPISEKQYQPLLRGCQAQGVTNEKQPTSNP 515
            DFE +EV   +     KK+    KDIPAP IPISEK YQPLLR CQA+ VTNEKQ TSNP
Sbjct: 194  DFEQDEVAASS-----KKN----KDIPAP-IPISEKLYQPLLRRCQAEDVTNEKQSTSNP 243

Query: 514  EMGSTSQGERKRKRLATDGEEETLQYQLIDVLERNGKMLSAQLEVQNINFQLDREQRKDH 335
            EMGSTSQGERKRKRLATDGEEETLQYQLIDVLERNGKMLSAQLE QNINFQLDREQRKDH
Sbjct: 244  EMGSTSQGERKRKRLATDGEEETLQYQLIDVLERNGKMLSAQLEAQNINFQLDREQRKDH 303

Query: 334  ASNXXXXXXXXXXXLGRIADKL 269
            ASN           LGRIADKL
Sbjct: 304  ASNLVAVLDKLADALGRIADKL 325


>ref|XP_003556587.1| PREDICTED: uncharacterized protein LOC100779050 [Glycine max]
          Length = 319

 Score =  442 bits (1136), Expect = e-121
 Identities = 236/320 (73%), Positives = 245/320 (76%)
 Frame = -2

Query: 1228 KAPRLPRWTRQEILVLIQGKRDAENKFRRGRNAGLPFGSGQVEPKWASVSSYCRKHGVNR 1049
            KAPRLPRWTRQEILVLIQGKRDAENKFRRGR AGL FGSGQVEPKWASVSSYCRKHGVNR
Sbjct: 24   KAPRLPRWTRQEILVLIQGKRDAENKFRRGRTAGLAFGSGQVEPKWASVSSYCRKHGVNR 83

Query: 1048 GPVQCRKRWSNLAGDYKKIKEWESHIREETESFWVMRNDLRRERKLPGFFDKEVYDILDS 869
            GPVQCRKRWSNLAGDYKKIKEWES IR+ETESFWVMRNDLRRERKL GFFDKEVYDILDS
Sbjct: 84   GPVQCRKRWSNLAGDYKKIKEWESQIRDETESFWVMRNDLRRERKLAGFFDKEVYDILDS 143

Query: 868  GXXXXXXXXXXXXXXXXXXXXXXXXXXLMPAGAKGTEEEPHLLYDSNRSAAGEDGLFSDF 689
            G                                    ++P  LYDSNRSA GEDGLFSDF
Sbjct: 144  GSGPTTLALSLSSSPPPTTTIT----------TTTVPDDPPHLYDSNRSAPGEDGLFSDF 193

Query: 688  EPEEVEVDASPIPEKKDPPNPKDIPAPAIPISEKQYQPLLRGCQAQGVTNEKQPTSNPEM 509
            E +          E+K+PP+ KDIPAP IPISEKQY   LR CQA+GVTNEK  TSNPEM
Sbjct: 194  EQD----------EEKNPPSNKDIPAP-IPISEKQY---LRRCQAEGVTNEKLSTSNPEM 239

Query: 508  GSTSQGERKRKRLATDGEEETLQYQLIDVLERNGKMLSAQLEVQNINFQLDREQRKDHAS 329
            GSTSQGERKRKRL TDGEEETLQYQLIDVLERNGKMLSAQLE QNINFQLDREQRKDHAS
Sbjct: 240  GSTSQGERKRKRLTTDGEEETLQYQLIDVLERNGKMLSAQLEAQNINFQLDREQRKDHAS 299

Query: 328  NXXXXXXXXXXXLGRIADKL 269
            N           LG+IADKL
Sbjct: 300  NLVAVLDKLADALGKIADKL 319


>ref|XP_002512665.1| conserved hypothetical protein [Ricinus communis]
            gi|223548626|gb|EEF50117.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 347

 Score =  375 bits (964), Expect = e-101
 Identities = 205/330 (62%), Positives = 228/330 (69%), Gaps = 10/330 (3%)
 Frame = -2

Query: 1228 KAPRLPRWTRQEILVLIQGKRDAENKFRRGRNAGLPFGSGQVEPKWASVSSYCRKHGVNR 1049
            K PRLPRWTRQEILVLIQGK+ AEN+ RRGR AG+ FGSGQVEPKWASVSSYC++HGVNR
Sbjct: 38   KTPRLPRWTRQEILVLIQGKKVAENRVRRGRTAGMAFGSGQVEPKWASVSSYCKRHGVNR 97

Query: 1048 GPVQCRKRWSNLAGDYKKIKEWESHIREETESFWVMRNDLRRERKLPGFFDKEVYDILDS 869
            GPVQCRKRWSNLAGDYKKIKEWE+HIREETESFWVMRNDLRRERKLPGFFD+EV+DILD 
Sbjct: 98   GPVQCRKRWSNLAGDYKKIKEWENHIREETESFWVMRNDLRRERKLPGFFDREVFDILDG 157

Query: 868  GXXXXXXXXXXXXXXXXXXXXXXXXXXLMPAGAKGTEEEPHLLYDSNRSAAGEDGLFSDF 689
                                        +        E+   ++DS R+AA EDGLFSDF
Sbjct: 158  A----------------GGVSAAPATPGLALALAPATEDSEAVFDSGRTAAAEDGLFSDF 201

Query: 688  EPEEVEVDASPIPEKKDPPNPKDIPAPA-------IPISEKQYQPLLRGCQAQGVTNEKQ 530
            E E    DA   PEK+       I A A       +PISEKQYQP +R  Q+QG TNEKQ
Sbjct: 202  EQE----DAGGSPEKEAVKEAPPIKAAATGGIAAPVPISEKQYQPAVRTDQSQGATNEKQ 257

Query: 529  PTSNPEMGSTSQGERKRKRL-ATDGEEE--TLQYQLIDVLERNGKMLSAQLEVQNINFQL 359
            P SNPEMGS     RKRKR   TDG+EE  TLQ QLI VLERNG+ML+AQLE QN NFQL
Sbjct: 258  PPSNPEMGSGLHESRKRKRFGTTDGDEETTTLQNQLIGVLERNGEMLTAQLEAQNTNFQL 317

Query: 358  DREQRKDHASNXXXXXXXXXXXLGRIADKL 269
            DREQRKD A++           LG+IADKL
Sbjct: 318  DREQRKDQANSLVAVLNKLADALGKIADKL 347


>ref|XP_003551802.1| PREDICTED: uncharacterized protein LOC100788594 [Glycine max]
          Length = 329

 Score =  374 bits (959), Expect = e-101
 Identities = 204/323 (63%), Positives = 228/323 (70%), Gaps = 4/323 (1%)
 Frame = -2

Query: 1225 APRLPRWTRQEILVLIQGKRDAENKFRRGRNAGLPFGSGQVEPKWASVSSYCRKHGVNRG 1046
            A RLPRWTRQEILVLIQGK DAE++FR GR +G  FGSG  EPKWA VSSYC+KHGVNR 
Sbjct: 34   AARLPRWTRQEILVLIQGKSDAESRFRPGRGSGSAFGSG--EPKWALVSSYCKKHGVNRE 91

Query: 1045 PVQCRKRWSNLAGDYKKIKEWESHIREETESFWVMRNDLRRERKLPGFFDKEVYDILDSG 866
            PVQCRKRWSNLAGDYKKIKEWES +R+E ESFW+MRNDLRRERKLPG+FD+EVY+ILDS 
Sbjct: 92   PVQCRKRWSNLAGDYKKIKEWESQVRDEAESFWLMRNDLRRERKLPGYFDREVYNILDS- 150

Query: 865  XXXXXXXXXXXXXXXXXXXXXXXXXXLMPAGAKGTEEEPHLLYDSNRSAAGEDGLFSDFE 686
                                       +   A   +EE H LYDSNR    EDGLFSD E
Sbjct: 151  ------------PSSTAAAAAAETPVAVAEAASAGDEEVH-LYDSNRRVGSEDGLFSDSE 197

Query: 685  PEEVEVDASPIPEKKDPPNPKDIPAPAIPISEKQYQPLLRGCQA----QGVTNEKQPTSN 518
             +EV + A+           KD+PAP +P+SEKQYQP L GC+     QG TN K+ T N
Sbjct: 198  KDEVLLLAA----------AKDVPAP-VPLSEKQYQPHLHGCEGEGNPQGTTNGKRATPN 246

Query: 517  PEMGSTSQGERKRKRLATDGEEETLQYQLIDVLERNGKMLSAQLEVQNINFQLDREQRKD 338
            PEMGSTSQGERKRKRLATDGEEETLQ QLIDVLE+NGKML  QLE QN+NFQLDR+Q+KD
Sbjct: 247  PEMGSTSQGERKRKRLATDGEEETLQSQLIDVLEKNGKMLHDQLEAQNLNFQLDRQQQKD 306

Query: 337  HASNXXXXXXXXXXXLGRIADKL 269
             ASN           LGRIADKL
Sbjct: 307  TASNIVAVLDKLADALGRIADKL 329


>ref|XP_003531912.1| PREDICTED: uncharacterized protein LOC100804601 [Glycine max]
          Length = 326

 Score =  371 bits (952), Expect = e-100
 Identities = 204/323 (63%), Positives = 226/323 (69%), Gaps = 4/323 (1%)
 Frame = -2

Query: 1225 APRLPRWTRQEILVLIQGKRDAENKFRRGRNAGLPFGSGQVEPKWASVSSYCRKHGVNRG 1046
            A RLPRWTRQEILVLIQGK DAE++FR GR +G  FGS   EPKWA VSSYC+KHGVNR 
Sbjct: 35   AARLPRWTRQEILVLIQGKSDAESRFRPGRGSGSAFGSS--EPKWALVSSYCKKHGVNRE 92

Query: 1045 PVQCRKRWSNLAGDYKKIKEWESHIREETESFWVMRNDLRRERKLPGFFDKEVYDILDSG 866
            PVQCRKRWSNLAGDYKKIKEWES +R+ETESFW+MRNDLRRERKLPG+FD+EVY+ILDS 
Sbjct: 93   PVQCRKRWSNLAGDYKKIKEWESQVRDETESFWLMRNDLRRERKLPGYFDREVYNILDS- 151

Query: 865  XXXXXXXXXXXXXXXXXXXXXXXXXXLMPAGAKGTEEEPHLLYDSNRSAAGEDGLFSDFE 686
                                        P      EEE H LYDSNR    EDGLFSD E
Sbjct: 152  ----------------PSSTAAAAAAETPVPVATVEEEVH-LYDSNRRVGSEDGLFSDSE 194

Query: 685  PEEVEVDASPIPEKKDPPNPKDIPAPAIPISEKQYQPLLRGCQA----QGVTNEKQPTSN 518
             +EV + A+           KD+PAP +PISEKQYQP L+GC+     QG TNEK+   N
Sbjct: 195  KDEVLLLAT----------TKDVPAP-VPISEKQYQPHLQGCEGEGNPQGTTNEKRANPN 243

Query: 517  PEMGSTSQGERKRKRLATDGEEETLQYQLIDVLERNGKMLSAQLEVQNINFQLDREQRKD 338
            PEMGSTSQGERKRK LATDGEEETLQ QLIDVLE+NGKML  QLE Q +NFQLDR+Q+KD
Sbjct: 244  PEMGSTSQGERKRKWLATDGEEETLQSQLIDVLEKNGKMLRDQLEAQKLNFQLDRQQQKD 303

Query: 337  HASNXXXXXXXXXXXLGRIADKL 269
             ASN           LGRIADKL
Sbjct: 304  TASNIVAVLDKLADALGRIADKL 326


Top