BLASTX nr result

ID: Glycyrrhiza23_contig00020547 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00020547
         (1682 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003551053.1| PREDICTED: uncharacterized protein LOC100794...   317   8e-84
ref|XP_003538823.1| PREDICTED: uncharacterized protein LOC100817...   293   1e-76
ref|XP_003517284.1| PREDICTED: uncharacterized protein LOC100785...   283   1e-73
ref|XP_002522328.1| DNA binding protein, putative [Ricinus commu...   244   7e-62
ref|XP_002306455.1| predicted protein [Populus trichocarpa] gi|2...   243   1e-61

>ref|XP_003551053.1| PREDICTED: uncharacterized protein LOC100794220 [Glycine max]
          Length = 344

 Score =  317 bits (811), Expect = 8e-84
 Identities = 204/387 (52%), Positives = 242/387 (62%), Gaps = 13/387 (3%)
 Frame = +1

Query: 211  MELSLDLSLAFIVPRSTVSELLGEASQSKDDDGSQKMTTRHEDLVKRLEDEMRKIEAFKR 390
            ME+SLDLSLAF VPR TV E+LG+ ++SKD  GS++M T  EDLVKRLEDE +KIEAFK 
Sbjct: 1    MEVSLDLSLAF-VPRRTVCEILGDIAKSKD--GSRRMATI-EDLVKRLEDEKKKIEAFK- 55

Query: 391  ELPLCMTLVNDAITRLKEERDKCTRVQDEPVVEESLPVVKLGTSCGGNGSMMTIGKGSSD 570
                        I++LKEE     R++DE VVEE + ++K  T+   NGS+M +G  SSD
Sbjct: 56   -----------PISKLKEEIKGGVRMKDEAVVEELMKLMK--TNSEANGSLMIVGNESSD 102

Query: 571  KKNWMSSAQLWNAETESRNAEDDRCVPHNPIHQPPRKETNNSGGPFLAFIGNSGVSKTVM 750
             KNWM+S QLWN ET+ RN E D  VP NPI Q  + +TN S            VSKTVM
Sbjct: 103  TKNWMNSVQLWNVETKQRNEEGDLFVPSNPIEQ--KNDTNKS------------VSKTVM 148

Query: 751  MREREDKEVSEVPSLGLMTLPASDLNR---------GSSR---SLVEIKGXXXXXXXXXX 894
               +++K++S+VPSLGLM+    +LN          GSS    S VEIKG          
Sbjct: 149  ---KDNKKMSQVPSLGLMSPAVLELNHRKTESGYGHGSSMIITSSVEIKGHHQSQQPQQN 205

Query: 895  XXXXX-CWSPELHRRFVDALQQLGGPQVATPKQIRELMQVEGLTNDEVKSHLQKYRLHFR 1071
                  CWSP+LHRRFVDALQQLGGPQVATPKQIRELMQV GLTNDEVKSHLQKYRLHF+
Sbjct: 206  PRKQRRCWSPDLHRRFVDALQQLGGPQVATPKQIRELMQVVGLTNDEVKSHLQKYRLHFK 265

Query: 1072 RIPISPNGQANNGGILMAQDECGDKXXXXXXKLKGTRSSQSGSPQGPLFLRXXXXXXXXX 1251
            R   S  G AN+G   MAQD+CGD               +SGSPQGPLFL          
Sbjct: 266  RPQGSSIGHANSGLCKMAQDKCGD--------------DKSGSPQGPLFL--GGSGKGLS 309

Query: 1252 XXXRNSMDTEEEDEQSDCHSWKGVLHH 1332
               RNSMDT E DE+SDC +WKG +HH
Sbjct: 310  SSGRNSMDT-EGDEESDCRNWKGGIHH 335


>ref|XP_003538823.1| PREDICTED: uncharacterized protein LOC100817326 [Glycine max]
          Length = 342

 Score =  293 bits (749), Expect = 1e-76
 Identities = 193/396 (48%), Positives = 228/396 (57%), Gaps = 17/396 (4%)
 Frame = +1

Query: 211  MELSLDLSLAFIVPRSTVSELLGEASQSKDDDGSQKMTTRHEDLVKRLEDEMRKIEAFKR 390
            MELSLDLSL F VP+  +S    + S ++D     K+ T  +  V+RLE+E++K+EAFKR
Sbjct: 1    MELSLDLSLGF-VPKP-LSLFFADVSANRD-----KVATL-DGFVQRLEEELKKVEAFKR 52

Query: 391  ELPLCMTLVNDAITRLKEERDKCTRVQDEPVVEESLPVVKLGTSCGGNGSMMTIGKGSSD 570
            ELPLC+ L+NDAI RLKEE+ KC+ +QD P          L TS GGN +       SS+
Sbjct: 53   ELPLCILLLNDAIARLKEEKVKCSGMQDPP----------LKTSSGGNKNE------SSE 96

Query: 571  KKNWMSSAQLWNAE-TESRNAEDDRCVPHNPIHQPPRKETNNSGGPFLAFIGNSGVSKTV 747
            K NWMSSAQLW+ + T+SRN EDDR VP NPI+                  GNS V    
Sbjct: 97   KMNWMSSAQLWSTQKTKSRNEEDDRSVPANPIN------------------GNSCVL--- 135

Query: 748  MMREREDKEVSEVPSLGLMTLPAS-----------DLNRGSSRSLVEIKGXXXXXXXXXX 894
                  +KE S+VP  GLM   +            D++ GSS   VE++           
Sbjct: 136  ------EKEGSQVPRFGLMARASELSHSNSKSVGGDISSGSSLLRVEVQSQPQPPQHMQQ 189

Query: 895  XXXXX--CWSPELHRRFVDALQQLGGPQVATPKQIRELMQVEGLTNDEVKSHLQKYRLHF 1068
                   CWSPELHRRFVDALQQLGG QVATPKQIRELMQVEGLTNDEVKSHLQKYRLH 
Sbjct: 190  NPRKQRRCWSPELHRRFVDALQQLGGAQVATPKQIRELMQVEGLTNDEVKSHLQKYRLHV 249

Query: 1069 RRIPISPNGQANNGGILMAQDECGDKXXXXXXKLKGTRSSQSGSPQG---PLFLRXXXXX 1239
            RR P+S  GQA+NG   M+QDE GDK        KG   SQSGSPQG   PL L      
Sbjct: 250  RRFPVSSTGQADNGS-WMSQDESGDKS-------KGNNMSQSGSPQGPLTPLILGGGGGG 301

Query: 1240 XXXXXXXRNSMDTEEEDEQSDCHSWKGVLHHPPLEA 1347
                         + EDEQSDC +WKG LHH  LEA
Sbjct: 302  SAKGLSSPGQNSVDGEDEQSDCRNWKGGLHHHQLEA 337


>ref|XP_003517284.1| PREDICTED: uncharacterized protein LOC100785723 [Glycine max]
          Length = 343

 Score =  283 bits (724), Expect = 1e-73
 Identities = 191/396 (48%), Positives = 230/396 (58%), Gaps = 18/396 (4%)
 Frame = +1

Query: 211  MELSLDLSLAFIVPRSTVSELLGEASQSKDDDGSQKMTTRHEDLVKRLEDEMRKIEAFKR 390
            ME SLDL L F VP+  +S   G+ S ++D     K+ T  +  V+RLE+E+ K+EAFKR
Sbjct: 1    MEPSLDLRLGF-VPKP-LSLFFGDVSGNRDK--CDKVVTL-DGFVQRLEEELTKVEAFKR 55

Query: 391  ELPLCMTLVNDAITRLKEERDKCTRVQDEPVVEESLPVVKLGTSCGGNGSMMTIGKGSSD 570
            ELPLC+ L+NDAI RLKEE+ KC+ +QD P          L TS GGN +       +S+
Sbjct: 56   ELPLCILLLNDAIARLKEEKVKCSGMQDPP----------LKTSSGGNENE------NSE 99

Query: 571  KKNWMSSAQLWNAE-TESRNAEDDRCVPHNPIHQPPRKETNNSGGPFLAFIGNSGVSKTV 747
            KKNWMSSAQLW+ + ++SRN EDDR VP N I+                  GNS V    
Sbjct: 100  KKNWMSSAQLWSTQKSKSRNEEDDRSVPANSIN------------------GNSCVP--- 138

Query: 748  MMREREDKEVSEVPSLGLMTLPASDLNRGSSRSL--------------VEIKGXXXXXXX 885
                  +KE S+VPS GLM   AS+L+  +S+S+              V+ +        
Sbjct: 139  ------EKEGSQVPSFGLMAR-ASELSHSNSKSVGGDTSSGSSLLRVEVQSQPQPPQHMQ 191

Query: 886  XXXXXXXXCWSPELHRRFVDALQQLGGPQVATPKQIRELMQVEGLTNDEVKSHLQKYRLH 1065
                    CWSPELHRRFVDALQQLGG QVATPKQIRELMQVEGLTNDEVKSHLQKYRLH
Sbjct: 192  QNPRKQRRCWSPELHRRFVDALQQLGGAQVATPKQIRELMQVEGLTNDEVKSHLQKYRLH 251

Query: 1066 FRRIPISPNGQANNGGILMAQDECGDKXXXXXXKLKGTRSSQSGSPQG---PLFLRXXXX 1236
             RR P+   GQ +NG   M QDECGDK        KG   SQSGSPQG   PL L     
Sbjct: 252  VRRFPVFSIGQVDNGS-WMTQDECGDKS-------KG-NMSQSGSPQGPLTPLLLGGAGS 302

Query: 1237 XXXXXXXXRNSMDTEEEDEQSDCHSWKGVLHHPPLE 1344
                    RNS+D E+E + SDC +WKG LHH  LE
Sbjct: 303  AKGLSSPGRNSVDAEDE-QSSDCRNWKGGLHHQQLE 337


>ref|XP_002522328.1| DNA binding protein, putative [Ricinus communis]
            gi|223538406|gb|EEF40012.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 370

 Score =  244 bits (622), Expect = 7e-62
 Identities = 178/410 (43%), Positives = 222/410 (54%), Gaps = 29/410 (7%)
 Frame = +1

Query: 211  MELSLDLSLAFIVPRSTVSELLGEASQSKDDDGSQKMTTRHEDLVKRLEDEMRKIEAFKR 390
            MELSLDLSL + VP+ T+SE L E S+ KD   S    ++ +D V+RLEDEMRKI+AFKR
Sbjct: 1    MELSLDLSLVY-VPK-TISEYLKEVSKVKD---SSLKLSKLDDYVQRLEDEMRKIDAFKR 55

Query: 391  ELPLCMTLVNDAITRLKEERDKCTRVQDEPVVEESLPVVKLGTSCGGNGSMMTIGKGSSD 570
            ELPLCM L+NDAI RLKEE  +C  +++   V+        G SC        +     D
Sbjct: 56   ELPLCMLLLNDAIVRLKEEAMQCKELEEVVSVK--------GNSC--RNEERELENDMID 105

Query: 571  KKNWMSSAQLWN------------------AETESRNAEDD-RCVPHNPIHQPPRKETNN 693
            KKNWMSS QLWN                  +ET+ R+ EDD R    NP      +   +
Sbjct: 106  KKNWMSSVQLWNNNNHTNNNNFDSENQESKSETKQRSEEDDDRSTCENPTQLCNHR---S 162

Query: 694  SGGPFLAFIGNSGVSKTVMMREREDKEV-SEVPSLGLMTLPASDLNRGSSRSLV------ 852
             GG F+ F   SG  K      +E+KEV S+V  L LMT P S+L    SR+L+      
Sbjct: 163  KGGGFMPFKSTSGFEK------KEEKEVVSQVTGLSLMT-PVSELG---SRNLMSKTNGT 212

Query: 853  ---EIKGXXXXXXXXXXXXXXXCWSPELHRRFVDALQQLGGPQVATPKQIRELMQVEGLT 1023
               +I+                CWSPELHRRF+DAL QLGG QVATPKQIRELMQV+GLT
Sbjct: 213  DQFKIQNKPQQQQQQPYKKQRRCWSPELHRRFIDALHQLGGSQVATPKQIRELMQVDGLT 272

Query: 1024 NDEVKSHLQKYRLHFRRIPISPNGQANNGGILMAQDECGDKXXXXXXKLKGTRSSQSGSP 1203
            NDEVKSHLQKYRLH R++P S   QAN   + MAQ+   D             +S+S SP
Sbjct: 273  NDEVKSHLQKYRLHIRKLPASSAAQAN--ALWMAQNGHKDDS-------SKQSNSKSSSP 323

Query: 1204 QGPLFLRXXXXXXXXXXXXRNSMDTEEEDEQSDCHSWKGVLHHPPLEAGV 1353
            QGP  L              +SM+  E+D++S  +SW G  H    E  V
Sbjct: 324  QGP--LHGCGSAKGMSSTGGDSMEV-EDDDRSVSNSWNGRQHKQAGEVDV 370


>ref|XP_002306455.1| predicted protein [Populus trichocarpa] gi|222855904|gb|EEE93451.1|
            predicted protein [Populus trichocarpa]
          Length = 333

 Score =  243 bits (620), Expect = 1e-61
 Identities = 173/392 (44%), Positives = 213/392 (54%), Gaps = 17/392 (4%)
 Frame = +1

Query: 211  MELSLDLSLAFIVPRSTVSELLGEASQSKDDDGSQKMTTRHEDLVKRLEDEMRKIEAFKR 390
            MELSLDLSL + VP++ +SE L E S  KD  GSQK+    +D VKRLEDE RKI+AFKR
Sbjct: 1    MELSLDLSLVY-VPKA-ISECLKEVSMVKD--GSQKLPNP-DDYVKRLEDERRKIDAFKR 55

Query: 391  ELPLCMTLVNDAITRLKEERDKCTRVQDEPVVEESLPVVKLGTSCGGNGSMMTIGKGSSD 570
            ELPLCM L+N+AI RLKEE  +C  +        +L  +K  ++  GN           D
Sbjct: 56   ELPLCMLLLNEAIIRLKEEAMQCKELN-------ALVPLKGDSNEDGN-----------D 97

Query: 571  KKNWMSSAQLWN---------------AETESRNAEDD-RCVPHNPIHQPPRKETNNSGG 702
            KK WMSS QLWN               +E + R  EDD R    NPI         N GG
Sbjct: 98   KKKWMSSVQLWNTNNNINLDCKNQDTRSEPKQRGEEDDDRSTCENPIQLGNH---GNKGG 154

Query: 703  PFLAFIGNSGVSKTVMMREREDKEV-SEVPSLGLMTLPASDLNRGSSRSLVEIKGXXXXX 879
             F+ F   SG  ++   +++E+KEV S+V  L LM        R   R            
Sbjct: 155  AFVPFKALSGFERS---KKKEEKEVVSQVTGLSLMKQQRQHAYRKQRR------------ 199

Query: 880  XXXXXXXXXXCWSPELHRRFVDALQQLGGPQVATPKQIRELMQVEGLTNDEVKSHLQKYR 1059
                      CWSPELHR FVDALQQLGG QVATPKQIRELMQV+GLTNDEVKSHLQKYR
Sbjct: 200  ----------CWSPELHRCFVDALQQLGGYQVATPKQIRELMQVDGLTNDEVKSHLQKYR 249

Query: 1060 LHFRRIPISPNGQANNGGILMAQDECGDKXXXXXXKLKGTRSSQSGSPQGPLFLRXXXXX 1239
            LH R++P S    AN+  +  +QD+C D              S+S SP+ P  L      
Sbjct: 250  LHLRKVPASSATPAND--LWKSQDQCEDPVMH--------NISESNSPKAP--LHGSSSA 297

Query: 1240 XXXXXXXRNSMDTEEEDEQSDCHSWKGVLHHP 1335
                    +SM+  E+D++S+ HSW GVLHHP
Sbjct: 298  KAASNSGGDSMEA-EDDDKSESHSWNGVLHHP 328


Top