BLASTX nr result

ID: Cephaelis21_contig00015642 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00015642
         (1507 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002284800.1| PREDICTED: CBS domain-containing protein CBS...   467   e-129
ref|XP_002528574.1| conserved hypothetical protein [Ricinus comm...   462   e-127
ref|XP_002329144.1| predicted protein [Populus trichocarpa] gi|2...   459   e-127
ref|XP_002317509.1| predicted protein [Populus trichocarpa] gi|2...   450   e-124
ref|XP_003543253.1| PREDICTED: CBS domain-containing protein CBS...   426   e-117

>ref|XP_002284800.1| PREDICTED: CBS domain-containing protein CBSX5 [Vitis vinifera]
            gi|302143038|emb|CBI20333.3| unnamed protein product
            [Vitis vinifera]
          Length = 394

 Score =  467 bits (1202), Expect = e-129
 Identities = 247/398 (62%), Positives = 297/398 (74%), Gaps = 1/398 (0%)
 Frame = -3

Query: 1406 MAAGLLVHEVADLCLGKPALKXXXXXXXXXXXXXXXXXSEENCISVWSCDHHSKNSFDVN 1227
            MA  LL H V+DLCLGKPAL+                 SE++ ISVWSCDH S N  D  
Sbjct: 1    MAFSLLAHVVSDLCLGKPALRSLSISATVGEALSALKTSEDSFISVWSCDH-SANIQD-- 57

Query: 1226 GGDCVCIGKICIVDVICYLCREENLASPRLALKSPVSALLSNVPGLVRHVEPSSSIVEAI 1047
              +C C+GKIC+VDV+CYLC+++NL SP  ALKSPVS LL N+PGLV HVEP SS++EAI
Sbjct: 58   --ECRCVGKICMVDVVCYLCKDDNLLSPSSALKSPVSDLLPNIPGLVMHVEPHSSLLEAI 115

Query: 1046 DLILQGAQNLVVPXXXXXXXXXXXXXXXXXXSTTPTIHNGREFCWLTQEDVIRFLLNSIG 867
            DLILQGAQNLVVP                  ++  T+H G E+CWLTQEDV+R+LL+SIG
Sbjct: 116  DLILQGAQNLVVPIRSSISNSSRRKLYQKPQTSPTTMHKGCEYCWLTQEDVVRYLLSSIG 175

Query: 866  LFSPIPTLSVERLGLISTDFLTIQYDSKACSAIGAILDSLTNQTSVAVVDDDGILIGEIS 687
            L SPI  L ++ L +I TD L I Y S A S++ AIL SL +QTSVAVVD++G LIGEIS
Sbjct: 176  LLSPIAALPIDTLRIIDTDVLAINYHSPASSSLPAILRSLRDQTSVAVVDENGALIGEIS 235

Query: 686  PFTLACCDETVAAAITTLSAGDLMAYIDCGGPPEDIVRVVVARLKENNLKGMLEEFTIDA 507
            PFTLACCDETVAAAITTLS+GDLM+YIDCGGPPE+IV+ V  RLK+ NL+GMLEEF +D+
Sbjct: 236  PFTLACCDETVAAAITTLSSGDLMSYIDCGGPPEEIVKTVKTRLKQRNLEGMLEEFALDS 295

Query: 506  SNIQVNXXXXSDEE-FPSPKTTLSMSGRYNRSSSYSARMVRRAEAIVCNPGSSLVAVMMQ 330
            S+        SDEE  PSPKT L   G+Y+RSSSYSARMVRRAEAIVC+PGSSLVAVM+Q
Sbjct: 296  SSTSSLSASSSDEESSPSPKTALYRPGKYSRSSSYSARMVRRAEAIVCHPGSSLVAVMIQ 355

Query: 329  AIAHRVNYVWVIEEDGSVVGIVTFSNILEVVRDYLDSM 216
            AIAHRVNYVWVIE+D  + GIVTFS++L++ R++L SM
Sbjct: 356  AIAHRVNYVWVIEDDCCLAGIVTFSSMLKIFREHLQSM 393


>ref|XP_002528574.1| conserved hypothetical protein [Ricinus communis]
            gi|223532018|gb|EEF33829.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 408

 Score =  462 bits (1188), Expect = e-127
 Identities = 245/407 (60%), Positives = 295/407 (72%), Gaps = 10/407 (2%)
 Frame = -3

Query: 1406 MAAGLLVHEVADLCLGKPALKXXXXXXXXXXXXXXXXXSEENCISVWSCDHHSKNSFDVN 1227
            MA  L  HEV+DLCLGKPAL+                 S+++ +SVW+CDH +K +   N
Sbjct: 1    MAVSLFAHEVSDLCLGKPALRSLPVTATVAEALSALKNSDDSFLSVWNCDHITKRNSGFN 60

Query: 1226 -----GGDCVCIGKICIVDVICYLCREENLASPRLALKSPVSALLSNVPGLVRHVEPSSS 1062
                   +C C+GK+ IVDVICYLC+++NL SP  ALK PVS LL  +PGLV HVEPSSS
Sbjct: 61   CDREDRDECKCVGKVSIVDVICYLCQDKNLVSPSDALKDPVSVLLPKIPGLVMHVEPSSS 120

Query: 1061 IVEAIDLILQGAQNLVVPXXXXXXXXXXXXXXXXXXSTTPT----IHNGREFCWLTQEDV 894
            +VEAIDLILQGAQNLVVP                  S T T    IH GREFCWL QED+
Sbjct: 121  LVEAIDLILQGAQNLVVPIKTRLSSSNSRRKQQQKLSATSTGLTTIHKGREFCWLAQEDI 180

Query: 893  IRFLLNSIGLFSPIPTLSVERLGLISTDFLTIQYDSKACSAIGAILDSLTNQTSVAVVD- 717
            IRF L+SIGLFSP+P LS++ LG+I+TD +TI Y+S A + +GAI  +L  QTSVAVVD 
Sbjct: 181  IRFFLSSIGLFSPVPALSIDSLGIITTDIITIDYNSPASATLGAINRALATQTSVAVVDG 240

Query: 716  DDGILIGEISPFTLACCDETVAAAITTLSAGDLMAYIDCGGPPEDIVRVVVARLKENNLK 537
            D+GILIGE+SPFTLACCDETVAAAITTLS+GDLMAYIDCGGPPED+VRVV+ARLK   L+
Sbjct: 241  DEGILIGELSPFTLACCDETVAAAITTLSSGDLMAYIDCGGPPEDLVRVVMARLKHRGLE 300

Query: 536  GMLEEFTIDASNIQVNXXXXSDEEFPSPKTTLSMSGRYNRSSSYSARMVRRAEAIVCNPG 357
             ML+EFT   +++       S        TTL  SG+Y+RS SYSARMVRRAEAIVC+P 
Sbjct: 301  AMLQEFTNSTTSLVSFSTLSSSSSDEESTTTLHRSGKYSRSKSYSARMVRRAEAIVCHPK 360

Query: 356  SSLVAVMMQAIAHRVNYVWVIEEDGSVVGIVTFSNILEVVRDYLDSM 216
            SSLVAVM+QAIAHRVNYVWVIEED S+VGIVTF N+L+V R++L++M
Sbjct: 361  SSLVAVMIQAIAHRVNYVWVIEEDCSLVGIVTFCNMLKVFREHLEAM 407


>ref|XP_002329144.1| predicted protein [Populus trichocarpa] gi|222869813|gb|EEF06944.1|
            predicted protein [Populus trichocarpa]
          Length = 411

 Score =  459 bits (1182), Expect = e-127
 Identities = 242/412 (58%), Positives = 299/412 (72%), Gaps = 15/412 (3%)
 Frame = -3

Query: 1406 MAAGLLVHEVADLCLGKPALKXXXXXXXXXXXXXXXXXSEENCISVWSCDHHSKNSFDVN 1227
            MA  LL HEV+DLCLGKPAL+                 S++N ISVW+CDH +K + D  
Sbjct: 1    MAVSLLAHEVSDLCLGKPALRSLALTTTIAEALFALKNSDDNFISVWNCDHAAKTNNDYK 60

Query: 1226 GG---------DCVCIGKICIVDVICYLCREENLASPRLALKSPVSALLSNVPGLVRHVE 1074
            G          +C C+GK+ +VDV+CYLC++ENL  P  ALK+PVS LL  +PG+V HVE
Sbjct: 61   GNCEEEGCDVCECKCVGKVSMVDVVCYLCKDENLLFPSDALKAPVSVLLPEIPGMVVHVE 120

Query: 1073 PSSSIVEAIDLILQGAQNLVVPXXXXXXXXXXXXXXXXXXSTTPTIHNGREFCWLTQEDV 894
            P+SS++EAIDLILQGA+NLVVP                   T+PTIHNGREFCWLTQED+
Sbjct: 121  PTSSLLEAIDLILQGAKNLVVPIKTRYSTRRKQHQKLSI--TSPTIHNGREFCWLTQEDI 178

Query: 893  IRFLLNSIGLFSPIPTLSVERLGLISTDFLTIQYDSKACSAIGAILDSLTNQTSVAVVDD 714
            IRF L SIGLF+P+P LS++ LG+IST+FLTI Y S A S + AI  SL ++TSVAV+D 
Sbjct: 179  IRFFLGSIGLFAPLPALSIDTLGIISTEFLTIDYHSPAISELEAISRSLADETSVAVIDS 238

Query: 713  DGILIGEISPFTLACCDETVAAAITTLSAGDLMAYIDCGGPPEDIVRVVVARLKENNLKG 534
            DGILIGE+SPFTLACCD++VAAAITTLS+GDLMAYIDCGGPPED+V+VV+ RLKE  L+ 
Sbjct: 239  DGILIGELSPFTLACCDDSVAAAITTLSSGDLMAYIDCGGPPEDLVKVVMERLKERGLEA 298

Query: 533  MLEEFTIDA-----SNIQVNXXXXSDEEFPS-PKTTLSMSGRYNRSSSYSARMVRRAEAI 372
            ML+EFT  +     S+ Q       +E   S P +TL  SG+Y+RS SYSARMVRRAEAI
Sbjct: 299  MLQEFTNSSCYSTTSSCQSQSSSSDEESASSTPVSTLHRSGKYSRSMSYSARMVRRAEAI 358

Query: 371  VCNPGSSLVAVMMQAIAHRVNYVWVIEEDGSVVGIVTFSNILEVVRDYLDSM 216
            VC+P SSLVAVM+QAIAHRVNYVWVIE+D S+VGIV F ++L+V R+ L+ M
Sbjct: 359  VCHPKSSLVAVMIQAIAHRVNYVWVIEDDCSLVGIVRFYDMLKVFRESLEDM 410


>ref|XP_002317509.1| predicted protein [Populus trichocarpa] gi|222860574|gb|EEE98121.1|
            predicted protein [Populus trichocarpa]
          Length = 401

 Score =  450 bits (1157), Expect = e-124
 Identities = 229/406 (56%), Positives = 291/406 (71%), Gaps = 9/406 (2%)
 Frame = -3

Query: 1406 MAAGLLVHEVADLCLGKPALKXXXXXXXXXXXXXXXXXSEENCISVWSCDHHSKNSFDVN 1227
            MA  LL  E++DLCLGKPAL+                 S++N +SVWSC+H +K + D  
Sbjct: 1    MAVSLLAREISDLCLGKPALRSLSLTTTITEVLFALKNSDDNFLSVWSCEHTAKTNKDYR 60

Query: 1226 G---------GDCVCIGKICIVDVICYLCREENLASPRLALKSPVSALLSNVPGLVRHVE 1074
            G         G+C C+GK+ +VDVICYLC++ENL SP  ALK+PVS LL  +PG+V HVE
Sbjct: 61   GNCEEDGCDVGECKCVGKVSMVDVICYLCKDENLLSPSDALKAPVSVLLPEIPGMVVHVE 120

Query: 1073 PSSSIVEAIDLILQGAQNLVVPXXXXXXXXXXXXXXXXXXSTTPTIHNGREFCWLTQEDV 894
            P+SS+++AIDLILQGA+NLVVP                   T+PTIHNGREFCWLTQED+
Sbjct: 121  PTSSLLDAIDLILQGAKNLVVPIKTRYSSSSRRKQHQKLSITSPTIHNGREFCWLTQEDI 180

Query: 893  IRFLLNSIGLFSPIPTLSVERLGLISTDFLTIQYDSKACSAIGAILDSLTNQTSVAVVDD 714
            IRF L SIGLF+P+P LS++ LG+ISTD+LTI Y S A S + AI  SL ++ SVA++D 
Sbjct: 181  IRFFLGSIGLFAPLPALSIDTLGIISTDYLTIDYHSPAISELEAISGSLADENSVAIIDS 240

Query: 713  DGILIGEISPFTLACCDETVAAAITTLSAGDLMAYIDCGGPPEDIVRVVVARLKENNLKG 534
            DGILIGE+SPFTLACCDE+VAAAITTLS+GDLMAYIDCGGPP+D+V +V+ RLK   L+ 
Sbjct: 241  DGILIGELSPFTLACCDESVAAAITTLSSGDLMAYIDCGGPPDDLVNLVMTRLKGRGLEA 300

Query: 533  MLEEFTIDASNIQVNXXXXSDEEFPSPKTTLSMSGRYNRSSSYSARMVRRAEAIVCNPGS 354
            ML+EFT  +     +          +P + L   G+Y+RS SYSARMVRRAEAIVC+P S
Sbjct: 301  MLQEFTNSSCYSTTSSWS------STPFSALQRPGKYSRSMSYSARMVRRAEAIVCHPKS 354

Query: 353  SLVAVMMQAIAHRVNYVWVIEEDGSVVGIVTFSNILEVVRDYLDSM 216
            SLVAVM+QAIAHR+NYVWVIE+D S+VGIV F ++L+V R+ ++ M
Sbjct: 355  SLVAVMIQAIAHRLNYVWVIEDDCSLVGIVRFCDVLKVFRESIEDM 400


>ref|XP_003543253.1| PREDICTED: CBS domain-containing protein CBSX5-like [Glycine max]
          Length = 389

 Score =  426 bits (1096), Expect = e-117
 Identities = 235/400 (58%), Positives = 285/400 (71%), Gaps = 3/400 (0%)
 Frame = -3

Query: 1406 MAAGLLVHEVADLCLGKPALKXXXXXXXXXXXXXXXXXSE-ENCISVWSCDHHSKNSFDV 1230
            MA   L  +V+DLCLGKP L+                 S+ E  +S+WS        F  
Sbjct: 1    MAVSFLARDVSDLCLGKPPLRSLSAAATVADALAALKSSDHETHVSLWS--------FCE 52

Query: 1229 NGGDCVCIGKICIVDVICYLCREENLASPRLALKSPVSALLSNVPGLVRHVEPSSSIVEA 1050
            N  +  C+GK+C+VDVICYLCRE+NL SP  ALK P+S++L     LV H++PSSS+ EA
Sbjct: 53   NKNEVRCVGKLCMVDVICYLCREDNLLSPSKALKEPLSSILPKDQSLVVHLQPSSSLFEA 112

Query: 1049 IDLILQGAQNLVVPXXXXXXXXXXXXXXXXXXSTTPTI--HNGREFCWLTQEDVIRFLLN 876
            IDLILQGAQNLVVP                    + TI  H+  EFCWLTQEDVIRFLL 
Sbjct: 113  IDLILQGAQNLVVPILPTKRSGVSRRKQQQHQKASSTINSHSSCEFCWLTQEDVIRFLLG 172

Query: 875  SIGLFSPIPTLSVERLGLISTDFLTIQYDSKACSAIGAILDSLTNQTSVAVVDDDGILIG 696
            SIG+F+P+P LS++ LG+IS+D L I Y S A SA+GAI  SLT QTSVA+VD DG  IG
Sbjct: 173  SIGVFTPLPALSIDSLGIISSDVLAIDYYSPASSAVGAISKSLTQQTSVAIVDSDGTFIG 232

Query: 695  EISPFTLACCDETVAAAITTLSAGDLMAYIDCGGPPEDIVRVVVARLKENNLKGMLEEFT 516
            EISPFTLACCDETVAAAI TLSAGDLMAYIDCGGPPED+VR+V ARLKE N + ML+EFT
Sbjct: 233  EISPFTLACCDETVAAAIATLSAGDLMAYIDCGGPPEDLVRLVKARLKEKNFEKMLQEFT 292

Query: 515  IDASNIQVNXXXXSDEEFPSPKTTLSMSGRYNRSSSYSARMVRRAEAIVCNPGSSLVAVM 336
            I  S+ + +    SDEE P+   T + SGR  RSSSYSARMVR+AEAIVC+P SSLVAVM
Sbjct: 293  I-LSSCESSQSTSSDEELPT--RTPARSGRLARSSSYSARMVRKAEAIVCHPKSSLVAVM 349

Query: 335  MQAIAHRVNYVWVIEEDGSVVGIVTFSNILEVVRDYLDSM 216
            +QAIAHRVNY+WVIE+D S+VGIVTFSN+L+V R++L+++
Sbjct: 350  IQAIAHRVNYLWVIEDDCSLVGIVTFSNMLKVFREHLETI 389