BLASTX nr result

ID: Glycyrrhiza23_contig00013313 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00013313
         (1672 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003530891.1| PREDICTED: nuclear pore complex protein Nup1...   759   0.0  
ref|XP_003525230.1| PREDICTED: nuclear pore complex protein Nup1...   751   0.0  
ref|XP_003630944.1| Nuclear pore complex protein Nup155 [Medicag...   750   0.0  
ref|XP_002526002.1| protein with unknown function [Ricinus commu...   632   e-178
emb|CBI26833.3| unnamed protein product [Vitis vinifera]              629   e-178

>ref|XP_003530891.1| PREDICTED: nuclear pore complex protein Nup155-like [Glycine max]
          Length = 1486

 Score =  759 bits (1959), Expect = 0.0
 Identities = 397/485 (81%), Positives = 417/485 (85%), Gaps = 5/485 (1%)
 Frame = +2

Query: 2    LHKEFGSPIRSAASQSALDPASRKKYISQIVQLGVQSPDRIFHEYLYQAMIDXXXXXXXX 181
            L +EFG+PI+S ASQSALDPASRKKYI QIVQLGVQSPDRIFHEYLYQAMID        
Sbjct: 1002 LQREFGTPIKSTASQSALDPASRKKYICQIVQLGVQSPDRIFHEYLYQAMIDLGLENELL 1061

Query: 182  XXXXPDLLPFLQSAGRKPIHEVRAVTATTSPMGQSGAPMSSNQVKYYELLARYYVLKRQH 361
                PDLLPFLQSAGR  IHEVRAVTATTSP+GQSGAPMSSNQVKYYELLARYYVLKRQH
Sbjct: 1062 EYGGPDLLPFLQSAGRNSIHEVRAVTATTSPVGQSGAPMSSNQVKYYELLARYYVLKRQH 1121

Query: 362  MXXXXXXXXXXXXXSIDGVPTLEQRCQYLSNAVLQAKNATSSDGLVGSTRSSIDSGLLDL 541
            M             S DGVPTLEQRCQYLSNAVLQAKNAT+SDGLVGS R SIDSG LDL
Sbjct: 1122 MLAAHALLRLAERRSTDGVPTLEQRCQYLSNAVLQAKNATNSDGLVGSGRISIDSGFLDL 1181

Query: 542  IEGKLAVLRFQIKIKEELEAMASRSEVLHSTSNSIENGLVPEGSSTVDANFANATREKAK 721
            +EGKLAVL FQIKIKEELE+MASRS+VL  TS S ENG+VPEGSST DANFANATREKAK
Sbjct: 1182 LEGKLAVLWFQIKIKEELESMASRSDVLPGTSESAENGVVPEGSSTADANFANATREKAK 1241

Query: 722  ELSSDVKSITQLYNEYAVPFELWEICLEMLYFANYSGDNDSSIVRETWARLKDQAISRGG 901
            EL+SDVKSITQLYNEYAVPF LWEICLEMLYFANYSGD DSSIVRETWARL DQAISRGG
Sbjct: 1242 ELASDVKSITQLYNEYAVPFGLWEICLEMLYFANYSGDTDSSIVRETWARLMDQAISRGG 1301

Query: 902  IAEACSVIKRVGPRLYPGDGAILPLDIICLQLEKAGLERLNSGVESVGDEDVARALVSAC 1081
            IAEACSV+KRVGPR+YPGDGA+LPLDIICL LEKAGLERLNSGVE+VGDEDVARALVSAC
Sbjct: 1302 IAEACSVLKRVGPRIYPGDGAVLPLDIICLHLEKAGLERLNSGVEAVGDEDVARALVSAC 1361

Query: 1082 KGAAEPVLNAYDQLLSNGAILTSPNXXXXXXXXXXXXXXEWAMSVYSQRMGTSAS----- 1246
            KGAAEPVLNAYDQLLSNGAIL SP+              EWAMSVYSQRMG+S++     
Sbjct: 1362 KGAAEPVLNAYDQLLSNGAILPSPSVRLRMLRSVLVVLREWAMSVYSQRMGSSSATGHSL 1421

Query: 1247 ILGGGFSLERTVASQGIRDKITSAANRYMTEVRRLALPQSQTELVYRGFRELEESLISPH 1426
            ILGGGFS ERT+ASQGIRDKITSAANRYMTEVRRLALPQ+QTE VYRGFRELEES IS H
Sbjct: 1422 ILGGGFSTERTIASQGIRDKITSAANRYMTEVRRLALPQNQTEHVYRGFRELEESFISQH 1481

Query: 1427 SFDRF 1441
            SFDRF
Sbjct: 1482 SFDRF 1486


>ref|XP_003525230.1| PREDICTED: nuclear pore complex protein Nup155-like [Glycine max]
          Length = 1485

 Score =  751 bits (1939), Expect = 0.0
 Identities = 393/484 (81%), Positives = 417/484 (86%), Gaps = 4/484 (0%)
 Frame = +2

Query: 2    LHKEFGSPIRSAASQSALDPASRKKYISQIVQLGVQSPDRIFHEYLYQAMIDXXXXXXXX 181
            L +EFG+PIRS ASQSALDPASRKKYI QIVQLGVQSPDRIFHEYLYQAMID        
Sbjct: 1002 LQREFGTPIRSTASQSALDPASRKKYICQIVQLGVQSPDRIFHEYLYQAMIDLGLENELL 1061

Query: 182  XXXXPDLLPFLQSAGRKPIHEVRAVTATTSPMGQSGAPMSSNQVKYYELLARYYVLKRQH 361
                PDLLPFLQSAGR  +HEVRAVTAT SP+GQSGAPMSSNQVKYYELLARYYVLKRQH
Sbjct: 1062 EYGGPDLLPFLQSAGRNSLHEVRAVTATISPVGQSGAPMSSNQVKYYELLARYYVLKRQH 1121

Query: 362  MXXXXXXXXXXXXXSIDGVPTLEQRCQYLSNAVLQAKNATSSDGLVGSTRSSIDSGLLDL 541
            M             SIDGVPTLE RCQYLSNAVLQAKNAT+SDGLVGS RSSIDSG LDL
Sbjct: 1122 MLAAHALLRLAERRSIDGVPTLELRCQYLSNAVLQAKNATNSDGLVGSGRSSIDSGFLDL 1181

Query: 542  IEGKLAVLRFQIKIKEELEAMASRSEVLHSTSNSIENGLVPEGSSTVDANFANATREKAK 721
            +EGKLAVLRFQIKIKEELE++ASRS+VL +T +S ENG+VPEGSST DANFANATREKAK
Sbjct: 1182 LEGKLAVLRFQIKIKEELESVASRSDVLPATPDSAENGVVPEGSSTADANFANATREKAK 1241

Query: 722  ELSSDVKSITQLYNEYAVPFELWEICLEMLYFANYSGDNDSSIVRETWARLKDQAISRGG 901
            EL+SDVKSITQLYNEYAVPF LWEICLEMLYFAN+S D DSSIVRETWARL DQAISRGG
Sbjct: 1242 ELASDVKSITQLYNEYAVPFGLWEICLEMLYFANFSSDTDSSIVRETWARLIDQAISRGG 1301

Query: 902  IAEACSVIKRVGPRLYPGDGAILPLDIICLQLEKAGLERLNSGVESVGDEDVARALVSAC 1081
            IAEACSV+KRVGPR+YPGDGA+LPLDIICL LEKAGLERLNSGVE+VGDEDVARALVSAC
Sbjct: 1302 IAEACSVLKRVGPRIYPGDGAVLPLDIICLHLEKAGLERLNSGVEAVGDEDVARALVSAC 1361

Query: 1082 KGAAEPVLNAYDQLLSNGAILTSPNXXXXXXXXXXXXXXEWAMSVYSQRMGTSAS----I 1249
            KGAAEPVLNAYDQLLSNGAIL S +              EWAMSVYSQRMG+SA+    I
Sbjct: 1362 KGAAEPVLNAYDQLLSNGAILPSASVRLRMLRSVLVVLREWAMSVYSQRMGSSAAGHSLI 1421

Query: 1250 LGGGFSLERTVASQGIRDKITSAANRYMTEVRRLALPQSQTELVYRGFRELEESLISPHS 1429
            LGGGFS ERT+ASQGIRDKITSAANRYMTE+RRLALPQ+QTE VYRGFRELEES IS HS
Sbjct: 1422 LGGGFSSERTIASQGIRDKITSAANRYMTELRRLALPQNQTEHVYRGFRELEESFISQHS 1481

Query: 1430 FDRF 1441
            FDRF
Sbjct: 1482 FDRF 1485


>ref|XP_003630944.1| Nuclear pore complex protein Nup155 [Medicago truncatula]
            gi|355524966|gb|AET05420.1| Nuclear pore complex protein
            Nup155 [Medicago truncatula]
          Length = 1484

 Score =  750 bits (1937), Expect = 0.0
 Identities = 395/482 (81%), Positives = 414/482 (85%), Gaps = 4/482 (0%)
 Frame = +2

Query: 8    KEFGSPIRSAASQSALDPASRKKYISQIVQLGVQSPDRIFHEYLYQAMIDXXXXXXXXXX 187
            KEFGSPI SA SQSALDPASRKKYISQIVQLGVQSPDRIFHEYLYQAMID          
Sbjct: 1004 KEFGSPIGSA-SQSALDPASRKKYISQIVQLGVQSPDRIFHEYLYQAMIDLGLENELLEY 1062

Query: 188  XXPDLLPFLQSAGRKPIHEVRAVTATTSPMGQSGAPMSSNQVKYYELLARYYVLKRQHMX 367
              PDLLPFL+SAGR PIHEVRAVTATTSPMGQSGAPMSSNQVKY+ELLARYYVLKRQHM 
Sbjct: 1063 GGPDLLPFLKSAGRTPIHEVRAVTATTSPMGQSGAPMSSNQVKYFELLARYYVLKRQHML 1122

Query: 368  XXXXXXXXXXXXSIDGVPTLEQRCQYLSNAVLQAKNATSSDGLVGSTRSSIDSGLLDLIE 547
                        S DGVPTLEQRCQYLSNAVLQAKNAT+SDGLV STRSS D+GLLD++E
Sbjct: 1123 AAHALLRLAGRPSTDGVPTLEQRCQYLSNAVLQAKNATNSDGLVSSTRSSSDTGLLDMLE 1182

Query: 548  GKLAVLRFQIKIKEELEAMASRSEVLHSTSNSIENGLVPEGSSTVDANFANATREKAKEL 727
            GKLAVLRFQIKIKEELE MAS SEVLHSTSNS+ENGLV + S TVDANFANATREKAKEL
Sbjct: 1183 GKLAVLRFQIKIKEELEHMASSSEVLHSTSNSVENGLVSDASPTVDANFANATREKAKEL 1242

Query: 728  SSDVKSITQLYNEYAVPFELWEICLEMLYFANYSGDNDSSIVRETWARLKDQAISRGGIA 907
            SSD+KSITQLYNEYAVPF+LWE CLEMLYFANYSGD+DSSIVRETWARL DQAIS GGIA
Sbjct: 1243 SSDLKSITQLYNEYAVPFKLWETCLEMLYFANYSGDSDSSIVRETWARLIDQAISGGGIA 1302

Query: 908  EACSVIKRVGPRLYPGDGAILPLDIICLQLEKAGLERLNSGVESVGDEDVARALVSACKG 1087
            EACSV+KR+GPRLYPGDG +  LDIICL LEKA LERLN+GVESVGDEDVARALVSACKG
Sbjct: 1303 EACSVLKRLGPRLYPGDGTVFQLDIICLHLEKAALERLNTGVESVGDEDVARALVSACKG 1362

Query: 1088 AAEPVLNAYDQLLSNGAILTSPNXXXXXXXXXXXXXXEWAMSVYSQRMGTSAS----ILG 1255
            AAEPVLNAYDQLLSNGAIL SPN              EWAMS+YS RMGT A+    I+G
Sbjct: 1363 AAEPVLNAYDQLLSNGAILPSPNLRLRMLRSVLVVLREWAMSIYSHRMGTGATGSSIIIG 1422

Query: 1256 GGFSLERTVASQGIRDKITSAANRYMTEVRRLALPQSQTELVYRGFRELEESLISPHSFD 1435
            GGFSLERTVASQGIRDKITS ANRYMTEVRRLALPQSQTE VY GF+ELEESLISPHSFD
Sbjct: 1423 GGFSLERTVASQGIRDKITSVANRYMTEVRRLALPQSQTEGVYCGFKELEESLISPHSFD 1482

Query: 1436 RF 1441
            RF
Sbjct: 1483 RF 1484


>ref|XP_002526002.1| protein with unknown function [Ricinus communis]
            gi|223534734|gb|EEF36426.1| protein with unknown function
            [Ricinus communis]
          Length = 1490

 Score =  632 bits (1629), Expect = e-178
 Identities = 337/490 (68%), Positives = 384/490 (78%), Gaps = 10/490 (2%)
 Frame = +2

Query: 2    LHKEFGSPIRSAASQSALDPASRKKYISQIVQLGVQSPDRIFHEYLYQAMIDXXXXXXXX 181
            L +EFGSP+R +AS++ LD ASR+KYISQIVQLGVQSPDR+FHEYLY+ MID        
Sbjct: 1003 LQREFGSPLRPSASRAVLDQASRRKYISQIVQLGVQSPDRLFHEYLYRTMIDLGLENELL 1062

Query: 182  XXXXPDLLPFLQSAGRKPIHEVRAVTATTSP---MGQSGAPMSSNQVKYYELLARYYVLK 352
                PDL+PFLQ+AGR+ + EVRAVTA TS    +G SGAP+++NQ KY++LLARYYV K
Sbjct: 1063 EYGGPDLVPFLQNAGRETLQEVRAVTAVTSATSSIGHSGAPVTANQAKYFDLLARYYVSK 1122

Query: 353  RQHMXXXXXXXXXXXXXSIDG--VPTLEQRCQYLSNAVLQAKNATSSDGLVGSTRSSIDS 526
            RQHM             S D   VPTLEQR QYLSNAVLQAKNA+ S GLVGS + ++DS
Sbjct: 1123 RQHMLAAHILLRLAERRSTDARDVPTLEQRRQYLSNAVLQAKNASDSGGLVGSMKGALDS 1182

Query: 527  GLLDLIEGKLAVLRFQIKIKEELEAMASRSEVLHSTSNSIENGLVPEGSSTVDANFANAT 706
            GLLDL+EGKL VLRFQIKIK+ELEA+ASR E   S S  ++NG VP+ ++  D  +A   
Sbjct: 1183 GLLDLLEGKLVVLRFQIKIKDELEAIASRLESSSSMSEPVQNGSVPDNNANPD--YAKVA 1240

Query: 707  REKAKELSSDVKSITQLYNEYAVPFELWEICLEMLYFANYSGDNDSSIVRETWARLKDQA 886
            REKAKELS D+KSITQLYNEYAVPFELWEICLEMLYFANY+GD DSSIVRETWARL DQA
Sbjct: 1241 REKAKELSLDLKSITQLYNEYAVPFELWEICLEMLYFANYTGDTDSSIVRETWARLIDQA 1300

Query: 887  ISRGGIAEACSVIKRVGPRLYPGDGAILPLDIICLQLEKAGLERLNSGVESVGDEDVARA 1066
            +SRGGIAEACSV+KRVG  +YPGDGAILPLD +CL LEKA LERL SG E VGDEDVARA
Sbjct: 1301 LSRGGIAEACSVLKRVGSHIYPGDGAILPLDTLCLHLEKAALERLESGAEPVGDEDVARA 1360

Query: 1067 LVSACKGAAEPVLNAYDQLLSNGAILTSPNXXXXXXXXXXXXXXEWAMSVYSQRMGTSAS 1246
            L++ACKGA EPVLNAYDQLLSNGAIL SPN              EWAMSV +QRMGT+ S
Sbjct: 1361 LLAACKGATEPVLNAYDQLLSNGAILPSPNLRLRLLQSLLVVLREWAMSVLAQRMGTTTS 1420

Query: 1247 ----ILGGGFSLER-TVASQGIRDKITSAANRYMTEVRRLALPQSQTELVYRGFRELEES 1411
                ILGG FS E+ TV +QGIRDKITSAANRYMTEV+RL LPQS+TE VYRGFR+LEES
Sbjct: 1421 GASLILGGTFSQEQTTVINQGIRDKITSAANRYMTEVKRLPLPQSKTEAVYRGFRDLEES 1480

Query: 1412 LISPHSFDRF 1441
            LISP SF+RF
Sbjct: 1481 LISPFSFNRF 1490


>emb|CBI26833.3| unnamed protein product [Vitis vinifera]
          Length = 753

 Score =  629 bits (1623), Expect = e-178
 Identities = 339/499 (67%), Positives = 386/499 (77%), Gaps = 10/499 (2%)
 Frame = +2

Query: 8    KEFGSPIRSAASQSALDPASRKKYISQIVQLGVQSPDRIFHEYLYQAMIDXXXXXXXXXX 187
            KEFGSP+R AA +S LD ASR KYI QIVQLGVQS DR+FHEYLY+ MID          
Sbjct: 252  KEFGSPVRPAA-RSTLDQASRDKYIRQIVQLGVQSSDRVFHEYLYRTMIDLGLENELLEY 310

Query: 188  XXPDLLPFLQSAGRKPIHEVRAV---TATTSPMGQSGAPMSSNQVKYYELLARYYVLKRQ 358
              PDL+PFLQ+AGR+ + EVRAV   T+T SP+G  GAP+ SNQ KY++LLARYYVLKRQ
Sbjct: 311  GGPDLVPFLQNAGRESLQEVRAVSSITSTRSPVGLFGAPIPSNQTKYFDLLARYYVLKRQ 370

Query: 359  HMXXXXXXXXXXXXXSIDG--VPTLEQRCQYLSNAVLQAKNATSSDGLVGSTRSSIDSGL 532
            H+             S D   VPTLEQR QYLSNAVLQAKNA++SDGLVGS R + D+GL
Sbjct: 371  HVLAAHVLLRLAERRSTDAGDVPTLEQRRQYLSNAVLQAKNASNSDGLVGSVRGASDNGL 430

Query: 533  LDLIEGKLAVLRFQIKIKEELEAMASRSEVLHSTSNSIENGLVPEGSSTVDANFANATRE 712
            LDL+EGKLAVLRFQIKIK ELEA+ASR E  + TS S+ N    E +   D NFAN  +E
Sbjct: 431  LDLLEGKLAVLRFQIKIKGELEAIASRLESSNVTSESVLNESCSESNLNADTNFANTVQE 490

Query: 713  KAKELSSDVKSITQLYNEYAVPFELWEICLEMLYFANYSGDNDSSIVRETWARLKDQAIS 892
            KA+E+S D+KSITQLYNEYAVPFELWEICLEMLYFANYSGD DSSIVRETWARL DQA+S
Sbjct: 491  KAREISLDLKSITQLYNEYAVPFELWEICLEMLYFANYSGDADSSIVRETWARLIDQALS 550

Query: 893  RGGIAEACSVIKRVGPRLYPGDGAILPLDIICLQLEKAGLERLNSGVESVGDEDVARALV 1072
            +GGIAEACSV+KRVG  +YPGDGA+LPLD +CL LEKA LERL SGVE VGDEDV RAL+
Sbjct: 551  KGGIAEACSVLKRVGSHIYPGDGAVLPLDTLCLHLEKAALERLASGVEPVGDEDVVRALL 610

Query: 1073 SACKGAAEPVLNAYDQLLSNGAILTSPNXXXXXXXXXXXXXXEWAMSVYSQRMGTSAS-- 1246
            +ACKGA EPVLN Y+QLLSNGAIL SPN              EWAMSV++QRMGTSA+  
Sbjct: 611  AACKGATEPVLNTYEQLLSNGAILPSPNLRLRLLRSVLVVLREWAMSVFAQRMGTSATGA 670

Query: 1247 --ILGGGFSLER-TVASQGIRDKITSAANRYMTEVRRLALPQSQTELVYRGFRELEESLI 1417
              ILGG FSLE+ TV +QG+RDKITSAANRYMTEVRRLALPQSQTE VYRGFRELEESLI
Sbjct: 671  SLILGGAFSLEQTTVINQGVRDKITSAANRYMTEVRRLALPQSQTEAVYRGFRELEESLI 730

Query: 1418 SPHSFDRF*SLLQYIMLYQ 1474
            SP SF+ F  +L    +++
Sbjct: 731  SPFSFELFLDVLFVFFVFK 749


Top