BLASTX nr result

ID: Angelica22_contig00024699 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00024699
         (1253 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269338.1| PREDICTED: CBS domain-containing protein CBS...   383   e-104
ref|XP_002284800.1| PREDICTED: CBS domain-containing protein CBS...   327   5e-87
ref|XP_004141365.1| PREDICTED: CBS domain-containing protein CBS...   321   2e-85
ref|XP_003543253.1| PREDICTED: CBS domain-containing protein CBS...   316   7e-84
ref|XP_002528574.1| conserved hypothetical protein [Ricinus comm...   315   1e-83

>ref|XP_002269338.1| PREDICTED: CBS domain-containing protein CBSX5 [Vitis vinifera]
          Length = 384

 Score =  383 bits (983), Expect = e-104
 Identities = 200/377 (53%), Positives = 256/377 (67%), Gaps = 19/377 (5%)
 Frame = +2

Query: 59   MAVSILGTEVSDLCLGKPPLRLLPITCTVAESITALKRSGESHVSVWSCESTGG------ 220
            MAVS+LG EVSDLCLGKP LR LP++ TVA+++ ALKRSG++++SVWSC+ T        
Sbjct: 1    MAVSLLGHEVSDLCLGKPALRSLPVSATVADALAALKRSGDAYLSVWSCDHTSKINKSHL 60

Query: 221  -DYICVGKICMVDVICYLCNKDSIVRPLDALQAPVSEILPKVPGIVRHLESNTSLLEAID 397
             D  C+GKICMVDV+C+LC +D++  P DALQ+P+S +LPKVPG+VRHL+ N+ LLEAID
Sbjct: 61   EDCRCIGKICMVDVVCFLCREDNLSCPSDALQSPLSLLLPKVPGLVRHLKPNSRLLEAID 120

Query: 398  YILEGTQNLIIPVQN--NLRKRVLRKPS---------SFCWLTREDVVRYLLNCVGAFSP 544
             +LEG QN++IP+Q+  N RK+++ KPS          FCWLT+EDVVR+LLN +G+FSP
Sbjct: 121  LMLEGAQNIVIPIQSRTNPRKKLVPKPSFNSTLHNGVEFCWLTQEDVVRFLLNSIGSFSP 180

Query: 545  VPTFTIESLNMIEAD-IMTVHYDSPAXXXXXXXXXXXXXXXXXAVIDQDNRLIGEISPLT 721
            +P  TIESLN+I+ + I +V+Y  PA                 AV+DQ+N+L+GEISP T
Sbjct: 181  LPGLTIESLNIIDTENIPSVYYHDPASSALTAISQSLINQTSVAVLDQENKLVGEISPFT 240

Query: 722  LACCNEATAAAILTLSAGDLMAYIDCGSPSEELIQLVKSELQMKKLAGMXXXXXXXXXXX 901
            LACC+E  AAAI TLSAGDLMAYIDCG P E+L+QLVK+ L+ +KL              
Sbjct: 241  LACCDETVAAAIATLSAGDLMAYIDCGGPPEDLVQLVKARLEERKLGAFLDLMDEEFSYS 300

Query: 902  XXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIVCYPWSSLMAVMIQALTHRVNCVWVVQED 1081
                                        EAIVCYPWSSLMAVMIQAL HRV+ VWV++ED
Sbjct: 301  SSSSSDEEFGFGRRGGSGKYSARMARRSEAIVCYPWSSLMAVMIQALAHRVSYVWVIEED 360

Query: 1082 FSLLGNVTLAGILKVFR 1132
            +SL G VT +GI KVFR
Sbjct: 361  WSLAGIVTFSGIFKVFR 377


>ref|XP_002284800.1| PREDICTED: CBS domain-containing protein CBSX5 [Vitis vinifera]
            gi|302143038|emb|CBI20333.3| unnamed protein product
            [Vitis vinifera]
          Length = 394

 Score =  327 bits (837), Expect = 5e-87
 Identities = 171/387 (44%), Positives = 239/387 (61%), Gaps = 29/387 (7%)
 Frame = +2

Query: 59   MAVSILGTEVSDLCLGKPPLRLLPITCTVAESITALKRSGESHVSVWSCESTGG---DYI 229
            MA S+L   VSDLCLGKP LR L I+ TV E+++ALK S +S +SVWSC+ +     +  
Sbjct: 1    MAFSLLAHVVSDLCLGKPALRSLSISATVGEALSALKTSEDSFISVWSCDHSANIQDECR 60

Query: 230  CVGKICMVDVICYLCNKDSIVRPLDALQAPVSEILPKVPGIVRHLESNTSLLEAIDYILE 409
            CVGKICMVDV+CYLC  D+++ P  AL++PVS++LP +PG+V H+E ++SLLEAID IL+
Sbjct: 61   CVGKICMVDVVCYLCKDDNLLSPSSALKSPVSDLLPNIPGLVMHVEPHSSLLEAIDLILQ 120

Query: 410  GTQNLIIPVQ----NNLRKRVLRKPSS----------FCWLTREDVVRYLLNCVGAFSPV 547
            G QNL++P++    N+ R+++ +KP +          +CWLT+EDVVRYLL+ +G  SP+
Sbjct: 121  GAQNLVVPIRSSISNSSRRKLYQKPQTSPTTMHKGCEYCWLTQEDVVRYLLSSIGLLSPI 180

Query: 548  PTFTIESLNMIEADIMTVHYDSPAXXXXXXXXXXXXXXXXXAVIDQDNRLIGEISPLTLA 727
                I++L +I+ D++ ++Y SPA                 AV+D++  LIGEISP TLA
Sbjct: 181  AALPIDTLRIIDTDVLAINYHSPASSSLPAILRSLRDQTSVAVVDENGALIGEISPFTLA 240

Query: 728  CCNEATAAAILTLSAGDLMAYIDCGSPSEELIQLVKSELQMKKLAGM------------X 871
            CC+E  AAAI TLS+GDLM+YIDCG P EE+++ VK+ L+ + L GM             
Sbjct: 241  CCDETVAAAITTLSSGDLMSYIDCGGPPEEIVKTVKTRLKQRNLEGMLEEFALDSSSTSS 300

Query: 872  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIVCYPWSSLMAVMIQALTHR 1051
                                                  EAIVC+P SSL+AVMIQA+ HR
Sbjct: 301  LSASSSDEESSPSPKTALYRPGKYSRSSSYSARMVRRAEAIVCHPGSSLVAVMIQAIAHR 360

Query: 1052 VNCVWVVQEDFSLLGNVTLAGILKVFR 1132
            VN VWV+++D  L G VT + +LK+FR
Sbjct: 361  VNYVWVIEDDCCLAGIVTFSSMLKIFR 387


>ref|XP_004141365.1| PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis sativus]
          Length = 401

 Score =  321 bits (823), Expect = 2e-85
 Identities = 178/394 (45%), Positives = 242/394 (61%), Gaps = 36/394 (9%)
 Frame = +2

Query: 59   MAVSILGTEVSDLCLGKPPLRLLPITCTVAESITALKRSGESHVSVWSCE---------- 208
            MAVS+   +VSDLCLGKPPLR L ++ T+A+++ AL+ S +  VSVW C           
Sbjct: 1    MAVSLFSHDVSDLCLGKPPLRPLSLSATLADALLALQFSHDYFVSVWDCRLPKRGCTGAV 60

Query: 209  ---STGGDYIC---VGKICMVDVICYLCNKDSIVRPLDALQAPVSEILPKVPGIVRHLES 370
               + GGD+ C   VGK+CMVDVICYLC +++++ P  ALQA VSEILP++PGIV HLE 
Sbjct: 61   DGGAAGGDFECCRCVGKLCMVDVICYLCKEENLLSPSSALQASVSEILPQIPGIVMHLEP 120

Query: 371  NTSLLEAIDYILEGTQNLIIPVQ----NNLRKRVLRKPSS-------FCWLTREDVVRYL 517
            + SLLEAID +L+G QNL++P++    +N R++ L+  ++       FCWLT+ED++RYL
Sbjct: 121  SASLLEAIDLVLQGAQNLVVPIKTRLGSNSRRKQLKNSTNGIHGGHEFCWLTQEDIIRYL 180

Query: 518  LNCVGAFSPVPTFTIESLNMIEADIMTVHYDSPAXXXXXXXXXXXXXXXXXAVIDQDNRL 697
            L  +G FSP+   +++SL +I  + ++V+Y SPA                 AVID D  L
Sbjct: 181  LGSIGLFSPIAALSLDSLGIICTNALSVNYHSPASSAIGAISHSITNQTSVAVIDGDGIL 240

Query: 698  IGEISPLTLACCNEATAAAILTLSAGDLMAYIDCGSPSEELIQLVKSELQMKKLAGM--- 868
            IGEISP  LA C++A AAAI+TLS+GDLMAYIDCG P E+L+++VK+ L+  KL GM   
Sbjct: 241  IGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKDSKLEGMLEE 300

Query: 869  ------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIVCYPWSSLMAVM 1030
                                                         EAIVC+P SSL+AVM
Sbjct: 301  FTNSPSSIGSPSFTSSSSDEEFSPSPSSRRYRRSSSYSARITRRAEAIVCHPRSSLVAVM 360

Query: 1031 IQALTHRVNCVWVVQEDFSLLGNVTLAGILKVFR 1132
            IQA+THRVN VWV+++D SL+G VT   +LKVFR
Sbjct: 361  IQAITHRVNYVWVIEDDCSLIGMVTFLDMLKVFR 394


>ref|XP_003543253.1| PREDICTED: CBS domain-containing protein CBSX5-like [Glycine max]
          Length = 389

 Score =  316 bits (810), Expect = 7e-84
 Identities = 172/383 (44%), Positives = 226/383 (59%), Gaps = 25/383 (6%)
 Frame = +2

Query: 59   MAVSILGTEVSDLCLGKPPLRLLPITCTVAESITALKRSG-ESHVSVWSCESTGGDYICV 235
            MAVS L  +VSDLCLGKPPLR L    TVA+++ ALK S  E+HVS+WS      +  CV
Sbjct: 1    MAVSFLARDVSDLCLGKPPLRSLSAAATVADALAALKSSDHETHVSLWSFCENKNEVRCV 60

Query: 236  GKICMVDVICYLCNKDSIVRPLDALQAPVSEILPKVPGIVRHLESNTSLLEAIDYILEGT 415
            GK+CMVDVICYLC +D+++ P  AL+ P+S ILPK   +V HL+ ++SL EAID IL+G 
Sbjct: 61   GKLCMVDVICYLCREDNLLSPSKALKEPLSSILPKDQSLVVHLQPSSSLFEAIDLILQGA 120

Query: 416  QNLIIPVQNNLRKRVLRKPS----------------SFCWLTREDVVRYLLNCVGAFSPV 547
            QNL++P+    R  V R+                   FCWLT+EDV+R+LL  +G F+P+
Sbjct: 121  QNLVVPILPTKRSGVSRRKQQQHQKASSTINSHSSCEFCWLTQEDVIRFLLGSIGVFTPL 180

Query: 548  PTFTIESLNMIEADIMTVHYDSPAXXXXXXXXXXXXXXXXXAVIDQDNRLIGEISPLTLA 727
            P  +I+SL +I +D++ + Y SPA                 A++D D   IGEISP TLA
Sbjct: 181  PALSIDSLGIISSDVLAIDYYSPASSAVGAISKSLTQQTSVAIVDSDGTFIGEISPFTLA 240

Query: 728  CCNEATAAAILTLSAGDLMAYIDCGSPSEELIQLVKSELQMKKLAGM--------XXXXX 883
            CC+E  AAAI TLSAGDLMAYIDCG P E+L++LVK+ L+ K    M             
Sbjct: 241  CCDETVAAAIATLSAGDLMAYIDCGGPPEDLVRLVKARLKEKNFEKMLQEFTILSSCESS 300

Query: 884  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIVCYPWSSLMAVMIQALTHRVNCV 1063
                                              EAIVC+P SSL+AVMIQA+ HRVN +
Sbjct: 301  QSTSSDEELPTRTPARSGRLARSSSYSARMVRKAEAIVCHPKSSLVAVMIQAIAHRVNYL 360

Query: 1064 WVVQEDFSLLGNVTLAGILKVFR 1132
            WV+++D SL+G VT + +LKVFR
Sbjct: 361  WVIEDDCSLVGIVTFSNMLKVFR 383


>ref|XP_002528574.1| conserved hypothetical protein [Ricinus communis]
            gi|223532018|gb|EEF33829.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 408

 Score =  315 bits (808), Expect = 1e-83
 Identities = 176/401 (43%), Positives = 236/401 (58%), Gaps = 43/401 (10%)
 Frame = +2

Query: 59   MAVSILGTEVSDLCLGKPPLRLLPITCTVAESITALKRSGESHVSVWSCE---------- 208
            MAVS+   EVSDLCLGKP LR LP+T TVAE+++ALK S +S +SVW+C+          
Sbjct: 1    MAVSLFAHEVSDLCLGKPALRSLPVTATVAEALSALKNSDDSFLSVWNCDHITKRNSGFN 60

Query: 209  ---STGGDYICVGKICMVDVICYLCNKDSIVRPLDALQAPVSEILPKVPGIVRHLESNTS 379
                   +  CVGK+ +VDVICYLC   ++V P DAL+ PVS +LPK+PG+V H+E ++S
Sbjct: 61   CDREDRDECKCVGKVSIVDVICYLCQDKNLVSPSDALKDPVSVLLPKIPGLVMHVEPSSS 120

Query: 380  LLEAIDYILEGTQNLIIPVQ------NNLRKR------------VLRKPSSFCWLTREDV 505
            L+EAID IL+G QNL++P++      N+ RK+             + K   FCWL +ED+
Sbjct: 121  LVEAIDLILQGAQNLVVPIKTRLSSSNSRRKQQQKLSATSTGLTTIHKGREFCWLAQEDI 180

Query: 506  VRYLLNCVGAFSPVPTFTIESLNMIEADIMTVHYDSPAXXXXXXXXXXXXXXXXXAVIDQ 685
            +R+ L+ +G FSPVP  +I+SL +I  DI+T+ Y+SPA                 AV+D 
Sbjct: 181  IRFFLSSIGLFSPVPALSIDSLGIITTDIITIDYNSPASATLGAINRALATQTSVAVVDG 240

Query: 686  DNR-LIGEISPLTLACCNEATAAAILTLSAGDLMAYIDCGSPSEELIQLVKSELQMKKLA 862
            D   LIGE+SP TLACC+E  AAAI TLS+GDLMAYIDCG P E+L+++V + L+ + L 
Sbjct: 241  DEGILIGELSPFTLACCDETVAAAITTLSSGDLMAYIDCGGPPEDLVRVVMARLKHRGLE 300

Query: 863  GM-----------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIVCYPW 1009
             M                                                  EAIVC+P 
Sbjct: 301  AMLQEFTNSTTSLVSFSTLSSSSSDEESTTTLHRSGKYSRSKSYSARMVRRAEAIVCHPK 360

Query: 1010 SSLMAVMIQALTHRVNCVWVVQEDFSLLGNVTLAGILKVFR 1132
            SSL+AVMIQA+ HRVN VWV++ED SL+G VT   +LKVFR
Sbjct: 361  SSLVAVMIQAIAHRVNYVWVIEEDCSLVGIVTFCNMLKVFR 401


Top