BLASTX nr result

ID: Glycyrrhiza23_contig00003252 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00003252
         (1756 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAE71297.1| hypothetical protein [Trifolium pratense]             338   3e-90
ref|XP_003552493.1| PREDICTED: uncharacterized protein LOC100814...   338   4e-90
ref|XP_003521066.1| PREDICTED: uncharacterized protein LOC100791...   337   6e-90
ref|XP_003529036.1| PREDICTED: uncharacterized protein LOC100805...   337   8e-90
dbj|BAE71280.1| putative nuclear antigen homolog [Trifolium prat...   336   1e-89

>dbj|BAE71297.1| hypothetical protein [Trifolium pratense]
          Length = 371

 Score =  338 bits (867), Expect = 3e-90
 Identities = 210/384 (54%), Positives = 231/384 (60%), Gaps = 6/384 (1%)
 Frame = +3

Query: 234  MNPFDLLGDDAEDPSQQLIAAEQLKAAAPAVAKKGQEQQGKATGRGGAQAQQLSKTAQLP 413
            +NPFDLLGDDAEDPSQ LIAAEQLKAAA    KKG EQ  +A  +         K AQLP
Sbjct: 4    INPFDLLGDDAEDPSQ-LIAAEQLKAAA--APKKGTEQGKQAAPK---------KAAQLP 51

Query: 414  SKPLPPSQAVREARNEAPXXXXXXXXXXXXXXXXXXXXXXXXXXDFSSDENSFPSSGAPA 593
            SKPLPPSQAVREARNE                            DFS+D+NS P+   PA
Sbjct: 52   SKPLPPSQAVREARNEPSRGGRGGGRGFGRGRGGGGSGGGGYGRDFSNDDNSLPAP--PA 109

Query: 594  NQGAFEEGDAGNPSEXXXXXXXXXXXXXXXXXX---FSNGEAGEEGRPRRAFERRSGTGR 764
            NQG+FE GD+GNPSE                     FSNGEAGEEGRPRR F+R SG+GR
Sbjct: 110  NQGSFE-GDSGNPSERRGYGAPRAPYRVGGERRRGGFSNGEAGEEGRPRRTFDRHSGSGR 168

Query: 765  GSEVKRDGAGRGNWGTETDEIAQVTEEVANETQKNLSDEKPAGEEDA-ADGNKXXXXXXX 941
            G   KR+GAGRGNWGT++DEIAQVTEEV+NE +KN+++EKPAG EDA A+GNK       
Sbjct: 169  GGGFKREGAGRGNWGTQSDEIAQVTEEVSNEPEKNVAEEKPAGGEDATAEGNKDAPANEA 228

Query: 942  XXXXXXXXXXXXXXYEKVLEEKRKALQALKTEVRKVDTKEFESMQPLSNKKDGHEIFIKL 1121
                          YEKVLEEKRKALQALKTE RKVDTKEFESM+PLS KK+  EIF KL
Sbjct: 229  EEKEPEDKEMTLEEYEKVLEEKRKALQALKTEGRKVDTKEFESMKPLSCKKENDEIFAKL 288

Query: 1122 GSDKDKRKDAFXXXXXXXXXXXINEFLKPA--EXXXXXXXXXXXXXXXXXXXXXXXXXXX 1295
            GSDKDKRKDAF           INEFLKPA  E                           
Sbjct: 289  GSDKDKRKDAF-EKEKAKKALSINEFLKPAEGEKYYNPGGRGGRGGRGGRGGSRGGGYGG 347

Query: 1296 XXXXNVSAPSIEDPGQFPTLGGGK 1367
                NV APSIEDPGQFPTLGGGK
Sbjct: 348  NAYSNVPAPSIEDPGQFPTLGGGK 371


>ref|XP_003552493.1| PREDICTED: uncharacterized protein LOC100814825 [Glycine max]
          Length = 370

 Score =  338 bits (866), Expect = 4e-90
 Identities = 201/378 (53%), Positives = 222/378 (58%), Gaps = 4/378 (1%)
 Frame = +3

Query: 237  NPFDLLGDDAEDPSQQLIAAEQLKAAAPAVAKKGQEQQGKATGRGGAQAQQLSKTAQLPS 416
            NPFDLLGDDAEDPSQ LIAAEQLKAAA A A     ++  A         Q +K AQLP+
Sbjct: 5    NPFDLLGDDAEDPSQ-LIAAEQLKAAAAASAATAAPKKAPA---------QQNKPAQLPT 54

Query: 417  KPLPPSQAVREARNEAPXXXXXXXXXXXXXXXXXXXXXXXXXXDFSSDENSFPSSGAPAN 596
            KP PP+QAVR+ARNE                            D S+DENSFP+S AP N
Sbjct: 55   KPPPPAQAVRDARNEP----VRGGRGGGRGGGRGFGRGRGFSRDSSNDENSFPTSRAPYN 110

Query: 597  QGAFEEGDAGNPSEXXXXXXXXXXXXXXXXXXFSNGEAGE--EGRPRRAFERRSGTGRGS 770
            QG FEEGDAG  SE                  FSNGE GE  +GRPRRAF+RRSGTGRG+
Sbjct: 111  QGPFEEGDAGKSSERRSYGGPRVPYRGGRRGGFSNGETGEGEDGRPRRAFDRRSGTGRGN 170

Query: 771  EVKRDGAGRGNWGTETDEIAQVTEEVANETQKNLSDEKPAGEEDAADGNKXXXXXXXXXX 950
            E KR+G+GRGNWGT+TDE+AQVT+EV NET+KNL DEKPA EED ADGNK          
Sbjct: 171  EFKREGSGRGNWGTQTDELAQVTDEVVNETEKNLGDEKPAVEEDVADGNKDSPTNETEEK 230

Query: 951  XXXXXXXXXXXYEKVLEEKRKALQALKTEVRKVDTKEFESMQPLSNKKDGHEIFIKLGSD 1130
                       YEKVLEE+RKA QALKTE RKVDTKEFESMQ LS+KKD H+IFIKLGSD
Sbjct: 231  EPEDKEMTLEEYEKVLEERRKAFQALKTEERKVDTKEFESMQALSSKKDNHDIFIKLGSD 290

Query: 1131 KDKRKDAFXXXXXXXXXXXINEFLKPA--EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1304
            KDKRK+AF           I EFLKPA  E                              
Sbjct: 291  KDKRKEAFEKEEKSKKSVSITEFLKPAEGEAYYNPGRGRGRGRGSRGGGGGGGVYRGSST 350

Query: 1305 XNVSAPSIEDPGQFPTLG 1358
             N  APSIEDPG FP LG
Sbjct: 351  NNAPAPSIEDPGHFPNLG 368


>ref|XP_003521066.1| PREDICTED: uncharacterized protein LOC100791259 [Glycine max]
          Length = 368

 Score =  337 bits (864), Expect = 6e-90
 Identities = 202/380 (53%), Positives = 221/380 (58%), Gaps = 5/380 (1%)
 Frame = +3

Query: 237  NPFDLLGDDAEDPSQQLIAAEQLKAAAPAVAKKGQEQQGKATGRGGAQAQQLSKTAQLPS 416
            NPFDLLGDDAEDPSQ LIAAEQLKAAA A     ++ Q K   R GA AQ       LPS
Sbjct: 5    NPFDLLGDDAEDPSQ-LIAAEQLKAAAAAATAPPKKDQAKPGARSGAPAQ-------LPS 56

Query: 417  KPLPPSQAVREARNEAPXXXXXXXXXXXXXXXXXXXXXXXXXXDFSSDENSFPSSGAPAN 596
            KPLPPSQAVREA+NE                            DFS+D+NS     APAN
Sbjct: 57   KPLPPSQAVREAKNETSYGGRGGGRGGGRGFGRGRGGGGGFGRDFSNDDNS----SAPAN 112

Query: 597  QGAFEEGDAGNPSEXXXXXXXXXXXXXXXXXX-----FSNGEAGEEGRPRRAFERRSGTG 761
            QG+FE GD+GNPSE                       FSNGE G+EGRPRRAFER SGTG
Sbjct: 113  QGSFE-GDSGNPSERRGYGGPRGPYRGGGSGRGRRGGFSNGETGDEGRPRRAFERHSGTG 171

Query: 762  RGSEVKRDGAGRGNWGTETDEIAQVTEEVANETQKNLSDEKPAGEEDAADGNKXXXXXXX 941
            RG+E KR+G+GRGNWGT+ D+IA+VTEEV NET+K L+DEKP GEEDAA+GNK       
Sbjct: 172  RGNEFKREGSGRGNWGTQNDDIAEVTEEVVNETEKVLADEKPVGEEDAAEGNKDSPANEN 231

Query: 942  XXXXXXXXXXXXXXYEKVLEEKRKALQALKTEVRKVDTKEFESMQPLSNKKDGHEIFIKL 1121
                          YEKVLEEKRKALQA KTE RKVD KEF SMQPLSNKK+  EIFIKL
Sbjct: 232  EEKEPEDKEMTLEEYEKVLEEKRKALQAQKTEARKVDIKEFASMQPLSNKKENDEIFIKL 291

Query: 1122 GSDKDKRKDAFXXXXXXXXXXXINEFLKPAEXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1301
            GSDKDKRKDA            I EFLKPAE                             
Sbjct: 292  GSDKDKRKDALEKEEKSKKSVNITEFLKPAE----GERYYPGGRGRGRGRGSRGGYSGNA 347

Query: 1302 XXNVSAPSIEDPGQFPTLGG 1361
              N  APSIEDPG FP+LGG
Sbjct: 348  YSNAPAPSIEDPGHFPSLGG 367


>ref|XP_003529036.1| PREDICTED: uncharacterized protein LOC100805110 [Glycine max]
          Length = 368

 Score =  337 bits (863), Expect = 8e-90
 Identities = 204/380 (53%), Positives = 220/380 (57%), Gaps = 5/380 (1%)
 Frame = +3

Query: 237  NPFDLLGDDAEDPSQQLIAAEQLKAAAPAVAKKGQEQQGKATGRGGAQAQQLSKTAQLPS 416
            NPFDLLGDDAEDPSQ LIAAEQLKAAA A     ++ Q K   RGGA AQ       LPS
Sbjct: 5    NPFDLLGDDAEDPSQ-LIAAEQLKAAAAAATAPPKKDQAKPGARGGAPAQ-------LPS 56

Query: 417  KPLPPSQAVREARNEAPXXXXXXXXXXXXXXXXXXXXXXXXXXDFSSDENSFPSSGAPAN 596
            KPLPPSQAVREA+NE                            DFS+D+NS     APAN
Sbjct: 57   KPLPPSQAVREAKNETSYGSRGGGRGGGRGFGRGRGGGGGFGRDFSNDDNS----SAPAN 112

Query: 597  QGAFEEGDAGNPSEXXXXXXXXXXXXXXXXXX-----FSNGEAGEEGRPRRAFERRSGTG 761
            QG+FE GD+GN SE                       F+NGE GEEGRPRRAFE  SGTG
Sbjct: 113  QGSFE-GDSGNHSERRGYGGPRGPYRGGGGGRGRRGGFTNGEVGEEGRPRRAFEHHSGTG 171

Query: 762  RGSEVKRDGAGRGNWGTETDEIAQVTEEVANETQKNLSDEKPAGEEDAADGNKXXXXXXX 941
            RG+E KRDG+GRGNWGT+ D+IA VTEEV  ET+KN +DEKPAGEEDA +GNK       
Sbjct: 172  RGNEFKRDGSGRGNWGTQNDDIAVVTEEVVYETEKNFADEKPAGEEDAPEGNKDCPANEN 231

Query: 942  XXXXXXXXXXXXXXYEKVLEEKRKALQALKTEVRKVDTKEFESMQPLSNKKDGHEIFIKL 1121
                          YEKVLEEKRKALQA KTEVRKVD KEF SMQPLSNKK+  EIFIKL
Sbjct: 232  EEKEPEDKEMTLEEYEKVLEEKRKALQAQKTEVRKVDIKEFASMQPLSNKKENDEIFIKL 291

Query: 1122 GSDKDKRKDAFXXXXXXXXXXXINEFLKPAEXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1301
            GSDKDKRKDA            I EFLKPAE                             
Sbjct: 292  GSDKDKRKDALEKEEKSKKSVNITEFLKPAE----GERYYSGGRGRGRGRGSRGGYSGNA 347

Query: 1302 XXNVSAPSIEDPGQFPTLGG 1361
              NV APSIEDPGQFPTL G
Sbjct: 348  YSNVPAPSIEDPGQFPTLSG 367


>dbj|BAE71280.1| putative nuclear antigen homolog [Trifolium pratense]
          Length = 371

 Score =  336 bits (861), Expect = 1e-89
 Identities = 200/375 (53%), Positives = 217/375 (57%)
 Frame = +3

Query: 234  MNPFDLLGDDAEDPSQQLIAAEQLKAAAPAVAKKGQEQQGKATGRGGAQAQQLSKTAQLP 413
            +NPFDLL DDAEDPS  LIAAE LKAAA  V K   ++QG    RGGAQ    +K AQLP
Sbjct: 4    INPFDLLDDDAEDPSL-LIAAELLKAAAAPVKKPADKEQGGGKQRGGAQ----TKPAQLP 58

Query: 414  SKPLPPSQAVREARNEAPXXXXXXXXXXXXXXXXXXXXXXXXXXDFSSDENSFPSSGAPA 593
            SKP PP+QAVRE+RNE                            D+S+ ENSFP SGAP 
Sbjct: 59   SKPTPPAQAVRESRNEGGRGGRGFSGRGGGRGFGGGRGGRGFGRDYSNGENSFPGSGAPE 118

Query: 594  NQGAFEEGDAGNPSEXXXXXXXXXXXXXXXXXXFSNGEAGEEGRPRRAFERRSGTGRGSE 773
            N G  EEGD    SE                  FSNGEAGEEGRPRR FER SGTGRG+E
Sbjct: 119  NHGPIEEGDKF--SERRNYGGPRPPYRGGRRGGFSNGEAGEEGRPRRTFERHSGTGRGNE 176

Query: 774  VKRDGAGRGNWGTETDEIAQVTEEVANETQKNLSDEKPAGEEDAADGNKXXXXXXXXXXX 953
             KR+GAGRGNWGTETDEIAQVTEE   E +KN+ DEKPA E DAA+GNK           
Sbjct: 177  FKREGAGRGNWGTETDEIAQVTEEAVIEGEKNIGDEKPAVENDAAEGNKDSAANEAEEKE 236

Query: 954  XXXXXXXXXXYEKVLEEKRKALQALKTEVRKVDTKEFESMQPLSNKKDGHEIFIKLGSDK 1133
                      Y+KVLEEKRKALQ +KTE RKVDTKEFE+MQ LS KKD  EIF KLGSDK
Sbjct: 237  AEDKEMTLEEYQKVLEEKRKALQVVKTEERKVDTKEFETMQALSCKKDNFEIFAKLGSDK 296

Query: 1134 DKRKDAFXXXXXXXXXXXINEFLKPAEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNV 1313
            DKRK+AF           INEFLKPAE                               NV
Sbjct: 297  DKRKEAFDKEEKAKKSVSINEFLKPAE--GDAYHRGRGGRGREARGGGGGYRGGNLNRNV 354

Query: 1314 SAPSIEDPGQFPTLG 1358
             APSIEDPG FPTLG
Sbjct: 355  RAPSIEDPGHFPTLG 369


Top