BLASTX nr result

ID: Glycyrrhiza23_contig00016473 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00016473
         (1660 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355...   165   3e-38
ref|XP_003635931.1| Cellular nucleic acid-binding protein [Medic...   162   2e-37
ref|XP_003622194.1| Cellular nucleic acid-binding protein-like p...   156   2e-35
emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera]   155   4e-35
ref|XP_003539358.1| PREDICTED: uncharacterized protein LOC100797...   151   4e-34

>ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355506807|gb|AES87949.1| Pol
            polyprotein [Medicago truncatula]
          Length = 745

 Score =  165 bits (418), Expect = 3e-38
 Identities = 103/311 (33%), Positives = 156/311 (50%), Gaps = 24/311 (7%)
 Frame = +3

Query: 648  QMFFKFYKLNPPTFSGGSNAMAAQYWLEAIEKIYQVVQCTEEQKVTFATHMLSEEAENWW 827
            +M   F K NPPTF G  +   AQ WL+ IE+I++V+QCTE+QKV F TH L+EEA++WW
Sbjct: 35   RMLETFMKKNPPTFKGRCDPDGAQTWLKEIERIFRVMQCTEDQKVRFGTHQLAEEADDWW 94

Query: 828  KGERAHLVAVGTPQDWNHFKEAFLNKYFPTTLKKQKDREFMQLKQGEMTVAEYVNKFEEL 1007
                  L   G    W  F+  FL +YFP  ++ +K+ EF++LKQG M+V EY  KF EL
Sbjct: 95   VALLPTLGQEGAVVTWAVFRREFLRRYFPEDVRGKKEIEFLELKQGNMSVTEYAAKFVEL 154

Query: 1008 ARYSSHVRYAADEEWKIDQFKWGLRADIRNCLAQITFTSYATLVHQSYVAEESLNSMFE- 1184
            +++  H      E  +  +F+ GLR DI+  +       +  LV+   + EE   + ++ 
Sbjct: 155  SKFYPHYTAENAEFSRCIKFENGLRPDIKRAIGYQQLRVFQDLVNSCRIYEEDTKAHYKV 214

Query: 1185 --EKQLRWQKKKDEGKSSQHSKVKGNL-----NKGKQAQEV-----------ATVTP--- 1301
              E++ + Q+ + +  S+   K K  +      K K A E+           +   P   
Sbjct: 215  VNERKGKGQQSRPKPYSAPADKGKQKMVDVRRPKKKDAAEIVYFNCGEKGHKSNACPEEI 274

Query: 1302 RGCPNCGKFHKGV--CMTGQDICFYCRQSGHVQKNCPKLKQGRASGPNDPTQGRVFALSA 1475
            + C  CGK    V  C     +CF C   GH+   C + K+        PT GRVFAL+ 
Sbjct: 275  KKCVRCGKKGHVVADCNRTDIVCFNCNGEGHISSQCTQPKRA-------PTTGRVFALTG 327

Query: 1476 KKAKGVDNLIK 1508
             + +  D LI+
Sbjct: 328  TQTESEDRLIR 338


>ref|XP_003635931.1| Cellular nucleic acid-binding protein [Medicago truncatula]
            gi|355501866|gb|AES83069.1| Cellular nucleic acid-binding
            protein [Medicago truncatula]
          Length = 558

 Score =  162 bits (411), Expect = 2e-37
 Identities = 96/262 (36%), Positives = 146/262 (55%), Gaps = 5/262 (1%)
 Frame = +3

Query: 639  DPNQMFFKFYKLNPPTFSGGSNAMAAQYWLEAIEKIYQVVQCTEEQKVTFATHMLSEEAE 818
            D  +M   F + +PPTF    +   AQ WL+ +E++++V+QC+E QKV F  HML+EEAE
Sbjct: 74   DGVRMLETFLRNHPPTFKERYDPDGAQNWLKEVERVFRVMQCSEVQKVRFGAHMLAEEAE 133

Query: 819  NWWKGERAHLVAVGTPQDWNHFKEAFLNKYFPTTLKKQKDREFMQLKQGEMTVAEYVNKF 998
            +WW      L   G    W  F+  FLN+YFP  ++ +K+ EF++LKQG+M+V EYV KF
Sbjct: 134  DWWVSLLPILEQDGVAVTWAVFRREFLNRYFPEDVRGKKEIEFLELKQGDMSVTEYVAKF 193

Query: 999  EELARYSSHVRYAADEEWKIDQFKWGLRADIRNCLAQITFTSYATLVHQSYVAEESLNSM 1178
             ELA++  H  Y A E  K  +FK GLRADI+  +      ++  LV    + EE   + 
Sbjct: 194  VELAKFYPH--YTA-EFSKCIKFKNGLRADIKRAIGYQKIRNFYDLVSSCRIYEEDTKAH 250

Query: 1179 FE---EKQLRWQKKKDEGKSSQHSKVKGNLNKGKQAQEVATVTPRGCPNCG-KFHK-GVC 1343
            ++   E++ + Q+ + +  S+  +KVK  LN  ++ +     T   C  CG K HK  VC
Sbjct: 251  YKVMSERRGKGQQSRPKPYSAPANKVKQRLNDERRPRRRDAPTEIVCFKCGEKGHKSNVC 310

Query: 1344 MTGQDICFYCRQSGHVQKNCPK 1409
               +  CF C + GH   +C +
Sbjct: 311  DRDEKKCFRCGKKGHTLADCKR 332


>ref|XP_003622194.1| Cellular nucleic acid-binding protein-like protein, partial [Medicago
            truncatula] gi|355497209|gb|AES78412.1| Cellular nucleic
            acid-binding protein-like protein, partial [Medicago
            truncatula]
          Length = 509

 Score =  156 bits (394), Expect = 2e-35
 Identities = 95/303 (31%), Positives = 153/303 (50%), Gaps = 25/303 (8%)
 Frame = +3

Query: 675  NPPTFSGGSNAMAAQYWLEAIEKIYQVVQCTEEQKVTFATHMLSEEAENWWKGERAHLVA 854
            +PPTF G  +   AQ WL+ IE++++V+QCTE QKV F THML+EEA++WW      L  
Sbjct: 62   HPPTFKGRYDLDGAQTWLKEIERVFRVMQCTEVQKVRFGTHMLAEEADDWWISLLPVLKQ 121

Query: 855  VGTPQDWNHFKEAFLNKYFPTTLKKQKDREFMQLKQGEMTVAEYVNKFEELARYSSHVRY 1034
             G    W  F+  FL++YF   ++ +K+ EF++LKQG M+V EY  KF ELA++  H   
Sbjct: 122  DGAVVTWAVFRREFLDRYFLEDVRGKKEIEFLELKQGNMSVTEYAAKFVELAKFYPHYTA 181

Query: 1035 AADEEWKIDQFKWGLRADIRNCLAQITFTSYATLVHQSYVAEESLNSMFE---EKQLRWQ 1205
               +  K  +F+ GLRA+I+  +      +++ LV    + EE   + ++   E++++ Q
Sbjct: 182  ETAKFSKCIKFENGLRAEIKRAIGYQKIRTFSDLVSSCRIYEEDTKAHYKIVNERKVKGQ 241

Query: 1206 KKKDEGKSSQHSKVKGNLNKGKQAQEVATVTPRGCPNCG-KFHK---------------- 1334
            +   +  S+   K K  +   ++ ++        C  CG K HK                
Sbjct: 242  QSCPKPYSAPADKGKQRMVDERRPRKKDAHVEIVCYTCGEKGHKSNACPRDVKRCFCCGK 301

Query: 1335 -----GVCMTGQDICFYCRQSGHVQKNCPKLKQGRASGPNDPTQGRVFALSAKKAKGVDN 1499
                   C     +CF C + GH+   C K K+ +       T GRVFAL+  + +  D+
Sbjct: 302  KGHTLAECKHDDIVCFNCNEEGHIGSQCKKPKKAQ-------TTGRVFALTGTQTESEDH 354

Query: 1500 LIK 1508
            LI+
Sbjct: 355  LIR 357


>emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera]
          Length = 360

 Score =  155 bits (391), Expect = 4e-35
 Identities = 92/265 (34%), Positives = 140/265 (52%), Gaps = 2/265 (0%)
 Frame = +3

Query: 654  FFKFYKLNPPTFSGGSNAMAAQYWLEAIEKIYQVVQCTEEQKVTFATHMLSEEAENWWKG 833
            F  F KL PP FSG ++   A+ W+  +EK + V+ C+EEQK ++A  ML +E ++WW+ 
Sbjct: 100  FDDFKKLGPPYFSGATDPTEAEAWILKMEKFFGVIDCSEEQKASYAAFMLDKETDHWWRM 159

Query: 834  ERAHLVAVGTPQDWNHFKEAFLNKYFPTTLKKQKDREFMQLKQGEMTVAEYVNKFEELAR 1013
             R  L   G P  W  F+EAF  KYFP ++++QK  EF++L+QG+MTVA+Y  KF EL+R
Sbjct: 160  TRRLLEDQG-PITWRQFREAFYKKYFPDSVRRQKVGEFIRLEQGDMTVAQYEAKFTELSR 218

Query: 1014 YSSHVRYAADEEWKIDQFKWGLRADIRNCLAQITFTSYATLVHQSYVAEESLNSMFEEKQ 1193
            +S  +   A EE K  +F+  L+  ++N  + +    Y              +   E+++
Sbjct: 219  FSPQL--IATEEEKALKFQDXLKPYLKNKXSILXLGXY--------------SEYREQQR 262

Query: 1194 LRWQKKKDEGKSSQHSKVKG-NLNKGKQAQEVATVTPRGCPNCGKFHKG-VCMTGQDICF 1367
             R +     G   Q     G N NKGK AQ +       CP CGK H G  C      CF
Sbjct: 263  KRNRSDGAHGNQXQRRSTSGRNQNKGKAAQNL----DGACPTCGKKHGGRPCYRETGACF 318

Query: 1368 YCRQSGHVQKNCPKLKQGRASGPND 1442
             C + GH+ ++CP+ ++     P +
Sbjct: 319  GCGKQGHLIRDCPENRKFITGKPKE 343


>ref|XP_003539358.1| PREDICTED: uncharacterized protein LOC100797981 [Glycine max]
          Length = 970

 Score =  151 bits (382), Expect = 4e-34
 Identities = 110/334 (32%), Positives = 162/334 (48%), Gaps = 39/334 (11%)
 Frame = +3

Query: 648  QMFFKFYKLNPPTFSGGSNAMAAQYWLEAIEKIYQVVQCTEEQKVTFATHMLSEEAENWW 827
            Q    F K +PP FSG  +   A+ WL   EKI++ + C EE KV +AT ML  EAENWW
Sbjct: 35   QGLMAFRKNHPPKFSGDYDPEGARLWLAETEKIFEAMGCLEEHKVPYATFMLQGEAENWW 94

Query: 828  KGERAHLVAVGTPQDWNHFKEAFLNKYFPTTLKKQKDREFMQLKQGEMTVAEYVNKFEEL 1007
            K  R    A      WN FK  FL  YFP  L+K+K REF+ LKQG M+V EY  KF EL
Sbjct: 95   KFVRPTFAAPRGVIPWNVFKGKFLENYFPRDLRKRKAREFLDLKQGNMSVGEYTTKFNEL 154

Query: 1008 ARYSSHVRYAADEEWKIDQFKWGLRADIRNCLAQITFTSYATLVHQSYVAEESLNSMFEE 1187
             +Y    + A +EE    QF+ GLR +I+  ++ +  T +  LV +  + E+ +    +E
Sbjct: 155  LQYWPQYQDARNEEDLCAQFENGLRLEIQQEVSYMQITDFNQLVTKCRIFEDKM----KE 210

Query: 1188 KQLR---WQKKKDEGKSSQHSKVKG-NLNKGKQAQEVATVT-PRG----CPNCGKFH-KG 1337
            +Q R     ++    + + + ++K  + NKGKQ   ++ ++  RG    C  CG  H + 
Sbjct: 211  RQARGFGGPQRSHPFRGNSNKRMKPYSRNKGKQPMAMSNMSQSRGTGVQCFQCGGPHLRR 270

Query: 1338 VC---MTGQDICFYCRQSGHVQKNC--------------------PKLKQGRASGPNDPT 1448
             C      Q+  + C + GH  + C                      ++    S  N+ +
Sbjct: 271  NCPQLQQAQEKRYICGKVGHYARECRVTGRPTVTVNSNTVNRGPTNSIRSDNVSNNNNTS 330

Query: 1449 QGR------VFALSAKKAKGVDNLIKDKGKAKEK 1532
             GR      VFA+S  +A   D+LI+D    KEK
Sbjct: 331  GGRPKVPSWVFAMSGSEAAASDDLIQD---CKEK 361


Top