BLASTX nr result

ID: Catharanthus23_contig00005128 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00005128
         (2490 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006350399.1| PREDICTED: CBS domain-containing protein CBS...   572   e-160
ref|XP_002279210.1| PREDICTED: CBS domain-containing protein CBS...   569   e-159
ref|XP_004237371.1| PREDICTED: CBS domain-containing protein CBS...   565   e-158
ref|XP_002512622.1| conserved hypothetical protein [Ricinus comm...   551   e-154
gb|ESW11733.1| hypothetical protein PHAVU_008G055300g [Phaseolus...   550   e-153
gb|EMJ08669.1| hypothetical protein PRUPE_ppa006223mg [Prunus pe...   549   e-153
ref|XP_003534522.1| PREDICTED: CBS domain-containing protein CBS...   548   e-153
ref|XP_006480403.1| PREDICTED: CBS domain-containing protein CBS...   546   e-152
ref|XP_003552437.1| PREDICTED: CBS domain-containing protein CBS...   543   e-151
ref|XP_002325465.1| CBS domain-containing family protein [Populu...   535   e-149
ref|XP_004492827.1| PREDICTED: CBS domain-containing protein CBS...   534   e-149
ref|XP_004302150.1| PREDICTED: CBS domain-containing protein CBS...   531   e-148
gb|EOY09904.1| Cystathionine beta-synthase family protein isofor...   526   e-146
gb|AFK42288.1| unknown [Medicago truncatula]                          521   e-145
ref|XP_004152831.1| PREDICTED: CBS domain-containing protein CBS...   520   e-145
ref|XP_002329005.1| predicted protein [Populus trichocarpa] gi|5...   520   e-145
gb|EXC29924.1| CBS domain-containing protein CBSX6 [Morus notabi...   512   e-142
ref|XP_003624141.1| hypothetical protein MTR_7g079680 [Medicago ...   511   e-142
ref|XP_006391564.1| hypothetical protein EUTSA_v10018605mg [Eutr...   493   e-136
ref|XP_002888360.1| hypothetical protein ARALYDRAFT_475589 [Arab...   491   e-136

>ref|XP_006350399.1| PREDICTED: CBS domain-containing protein CBSX6-like [Solanum
            tuberosum]
          Length = 422

 Score =  572 bits (1473), Expect = e-160
 Identities = 292/416 (70%), Positives = 340/416 (81%), Gaps = 1/416 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFLYHVVGDLTVGKP            AAIRAIGESTE GIPVWK RS K + ENAE
Sbjct: 1    MASVFLYHVVGDLTVGKPELVEFTETQTVEAAIRAIGESTECGIPVWKTRSQKGLIENAE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +R++RFVGILNSLDIVAFLA++ECLADQ+KAMKTPVSEVV+P++SLLKE+DPATRLIDAL
Sbjct: 61   MRRKRFVGILNSLDIVAFLAREECLADQEKAMKTPVSEVVLPDNSLLKELDPATRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSSAASINWPTSSSATII 1476
            EMMKQGVKRLLVPKS+ W+G+SKRFS+L+NGKWLKNID S   + AA+ N P++SS   I
Sbjct: 121  EMMKQGVKRLLVPKSVVWRGMSKRFSILYNGKWLKNIDTS---NPAANANRPSTSSPVPI 177

Query: 1477 RDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLPQD 1656
            RDKFCCLSRED+IRF+IGCLGALAP+PL+ I SLG INPNY  IEAS+PAID+T+KLP D
Sbjct: 178  RDKFCCLSREDIIRFIIGCLGALAPIPLSSIYSLGIINPNYCSIEASRPAIDATQKLPSD 237

Query: 1657 PCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMSVP 1836
            P AVAV++PM+D   KIIG+ISA KLWKCDY            GQFVMGVEDNI+S S+P
Sbjct: 238  PPAVAVIDPMADDYNKIIGEISATKLWKCDYLAAAWALANFSAGQFVMGVEDNISSSSLP 297

Query: 1837 DLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTCKH 2016
            D +V+ +    N AN  GG  RPKKFSSRSIGF SN ++ +L   R+MYRGRSAPLTCK 
Sbjct: 298  DFAVNPMVTNANTANSRGGIVRPKKFSSRSIGFVSNPTNSSLSVSRSMYRGRSAPLTCKE 357

Query: 2017 TSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQ-PTTVPEAQT 2181
            TSSLAAVMAQ+LSHRATHVWVT+AENE++L+G+VG+ DILAAVTR  PTT PE  +
Sbjct: 358  TSSLAAVMAQMLSHRATHVWVTDAENEDHLVGVVGYADILAAVTRPLPTTNPETHS 413


>ref|XP_002279210.1| PREDICTED: CBS domain-containing protein CBSX6 [Vitis vinifera]
            gi|297735292|emb|CBI17654.3| unnamed protein product
            [Vitis vinifera]
          Length = 427

 Score =  569 bits (1467), Expect = e-159
 Identities = 292/416 (70%), Positives = 336/416 (80%), Gaps = 1/416 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFLYHVVGDLTVGKP            +AIR IGES EG IP+WK+RS   V E +E
Sbjct: 1    MASVFLYHVVGDLTVGKPELVEFPETETVESAIRVIGESAEGSIPIWKRRSHVGVMEKSE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGILNSLDIVAFLA+D CL DQ+KAMKTPVSEVVVPN+SLL+EVDPATRLIDAL
Sbjct: 61   MRQQRFVGILNSLDIVAFLARDACLVDQEKAMKTPVSEVVVPNNSLLREVDPATRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSS-AASINWPTSSSATI 1473
            EMMK G+KRLLVPKS+ WKG+SKRFS+L+NGKWLKN+D S ++++ A + N P+SSS   
Sbjct: 121  EMMKHGLKRLLVPKSVVWKGMSKRFSILYNGKWLKNLDASSSSTNLAPNANRPSSSSTPS 180

Query: 1474 IRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLPQ 1653
             R+KFCCLSREDVIRF+IGCLGALAPLPL+ ISSLGAI+PNY  IEAS PAI  T+KLPQ
Sbjct: 181  SRNKFCCLSREDVIRFVIGCLGALAPLPLSSISSLGAISPNYYSIEASFPAIQVTQKLPQ 240

Query: 1654 DPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMSV 1833
            DP AVAVVE   DGQ+KIIG+ISACKLWKCDY            GQFVMGVEDN+TS S+
Sbjct: 241  DPSAVAVVESTPDGQYKIIGEISACKLWKCDYLAAAWALANLSAGQFVMGVEDNVTSRSL 300

Query: 1834 PDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTCK 2013
            P  SV+   GE N+ANGG    R +KFSSRSIGFFSN +SP+ GA R+MYRGRSAPLTCK
Sbjct: 301  PHFSVNLTGGENNMANGGAS-TRQRKFSSRSIGFFSNPASPSFGASRSMYRGRSAPLTCK 359

Query: 2014 HTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTVPEAQT 2181
             TSSLAAVMAQ+LSHRATHVWVTEAE+E+ L+G+VG+ DILAAV +QP  V  + T
Sbjct: 360  VTSSLAAVMAQMLSHRATHVWVTEAESEDILVGVVGYADILAAVIKQPAPVIPSTT 415


>ref|XP_004237371.1| PREDICTED: CBS domain-containing protein CBSX6-like [Solanum
            lycopersicum]
          Length = 420

 Score =  565 bits (1457), Expect = e-158
 Identities = 288/416 (69%), Positives = 339/416 (81%), Gaps = 1/416 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFLYHVVGDLTVGKP            AAI+AIGESTE GIPVWK RS K + ENAE
Sbjct: 1    MASVFLYHVVGDLTVGKPELVEFTETQTVEAAIKAIGESTECGIPVWKTRSQKGMIENAE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +R++RFVGILNSLDIVAFLA++ECLADQ+KAMKTPVSEVV+P++SLLKE+DPATRLIDAL
Sbjct: 61   MRRKRFVGILNSLDIVAFLAREECLADQEKAMKTPVSEVVLPDNSLLKELDPATRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSSAASINWPTSSSATII 1476
            EMMKQGVKRLLVPKS+ W+G+SKRFS+L+NGKWLKNID S   + AA+ N P++SS   I
Sbjct: 121  EMMKQGVKRLLVPKSVVWRGMSKRFSILYNGKWLKNIDTS---NPAANANRPSTSSPVPI 177

Query: 1477 RDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLPQD 1656
            RDKFCCLSRED+IRF+IGCLGALAP+PL+ I SLG INPNY  IEAS+PAID+T+KLP D
Sbjct: 178  RDKFCCLSREDIIRFIIGCLGALAPIPLSSIYSLGIINPNYCSIEASRPAIDATQKLPSD 237

Query: 1657 PCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMSVP 1836
            P AVAV++PM+D   KIIG+ISA KLWKCDY            GQFVMGVEDNI+  S+P
Sbjct: 238  PPAVAVIDPMADDYNKIIGEISATKLWKCDYLAAAWALANFSAGQFVMGVEDNISPSSLP 297

Query: 1837 DLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTCKH 2016
            D  V+S+    N AN  G   RPKKFSSRSIGF SN ++ +L   R+MYRGRSAPLTCK 
Sbjct: 298  DFPVNSMVTNANTANSRGSIVRPKKFSSRSIGFVSNPTNSSLSVSRSMYRGRSAPLTCKE 357

Query: 2017 TSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQ-PTTVPEAQT 2181
            TSSLAAVMAQ+LSHRATHVWVT+A+N+++L+G+VG+ DILAAVTR  PTT PE+ +
Sbjct: 358  TSSLAAVMAQMLSHRATHVWVTDADNDDHLVGVVGYADILAAVTRPLPTTNPESHS 413


>ref|XP_002512622.1| conserved hypothetical protein [Ricinus communis]
            gi|223548583|gb|EEF50074.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 432

 Score =  551 bits (1421), Expect = e-154
 Identities = 287/418 (68%), Positives = 333/418 (79%), Gaps = 9/418 (2%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFLYHVVGDLTVGKP            +AIRAIGESTE GIPVWK+RS  ++ EN E
Sbjct: 1    MASVFLYHVVGDLTVGKPEMVEFCETETVESAIRAIGESTECGIPVWKRRSHVNMIENNE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGILNSLDIVAFLAK +CL DQ+KAMKTPVSEVVVP++SLLK+VDPATRLIDAL
Sbjct: 61   MRQQRFVGILNSLDIVAFLAKAQCLEDQEKAMKTPVSEVVVPDNSLLKQVDPATRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPS---------GANSSAASINW 1449
            EMMKQGVKRLLV K   WKG+SKRFS+L+NGKWLKN+D S          +N+  ++I+ 
Sbjct: 121  EMMKQGVKRLLVSKGTVWKGMSKRFSILYNGKWLKNVDSSNSSNNLMGNSSNNLMSNISR 180

Query: 1450 PTSSSATIIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAI 1629
            P+SSS TI RDKFCCLSREDVIRF+IGCLGALAPLPL+ ISSLGAIN NY  +EAS  AI
Sbjct: 181  PSSSSTTISRDKFCCLSREDVIRFIIGCLGALAPLPLSSISSLGAINFNYYSVEASLSAI 240

Query: 1630 DSTKKLPQDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVE 1809
            ++T+KLP+DPCAVAVVEPM DG  KIIG+ISA +LWKCDY            GQFVMGVE
Sbjct: 241  EATQKLPKDPCAVAVVEPMPDGHSKIIGEISASRLWKCDYLAAAWALANLSSGQFVMGVE 300

Query: 1810 DNITSMSVPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRG 1989
            DN+T+ S+PD +V+S   + N ANGGG   RPKKFSS+SIGF  N  S + G  R+MYRG
Sbjct: 301  DNLTARSLPDFTVNSTVSDNNTANGGGS-TRPKKFSSKSIGF--NPGSSSFGIGRSMYRG 357

Query: 1990 RSAPLTCKHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTT 2163
            RSAPLTCK TSSLAAVMAQ+LSHRATHVWVTEA+N+E L+G+VG+ DIL AVT+ P +
Sbjct: 358  RSAPLTCKITSSLAAVMAQMLSHRATHVWVTEADNDEVLVGVVGYADILFAVTKPPAS 415


>gb|ESW11733.1| hypothetical protein PHAVU_008G055300g [Phaseolus vulgaris]
          Length = 423

 Score =  550 bits (1417), Expect = e-153
 Identities = 280/412 (67%), Positives = 332/412 (80%), Gaps = 2/412 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVF+YHVVGDLTVGKP            +AIRAIGES EG IP+WKKRS   + EN+E
Sbjct: 1    MASVFVYHVVGDLTVGKPELVEFHESETVESAIRAIGESPEGSIPIWKKRSHVGI-ENSE 59

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGIL+S DIVAFLAK +CL DQDKA+K PVSEVVVPN+SLL+ VDPATRLIDAL
Sbjct: 60   MRQQRFVGILSSFDIVAFLAKSQCLEDQDKALKIPVSEVVVPNNSLLRLVDPATRLIDAL 119

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDP--SGANSSAASINWPTSSSAT 1470
            +MMKQGVKRLLVPKS+ WKG+SKRFSV++ GKWLKN D   + +N+   ++NW  S+S T
Sbjct: 120  DMMKQGVKRLLVPKSVAWKGMSKRFSVIYYGKWLKNNDSPSNSSNNLPPNLNWSPSTSGT 179

Query: 1471 IIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLP 1650
            +IRDK+CCLSREDV+RF+IGCLGALAPLPLT I +LGAIN +YSYIE+S PAI +TKKLP
Sbjct: 180  VIRDKYCCLSREDVLRFIIGCLGALAPLPLTSIVALGAINADYSYIESSTPAIAATKKLP 239

Query: 1651 QDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMS 1830
            QDP AVAV E   DGQ KI+G+ISACKLWKCDY            GQFVMGVEDN++S S
Sbjct: 240  QDPSAVAVTENTPDGQCKILGEISACKLWKCDYLAAAWALANLSAGQFVMGVEDNVSSRS 299

Query: 1831 VPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTC 2010
            +P+ SV S TG ++L NG     +P+KFSSRSIGFFSN+ S + G+ R+MYRGRSAPLTC
Sbjct: 300  LPEFSVDSPTGNSDLVNGS---RKPRKFSSRSIGFFSNSGSHSFGS-RSMYRGRSAPLTC 355

Query: 2011 KHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTV 2166
            K TSSLAAV+AQ+LSHRATHVWVTE EN+E L+G+VG+ DILAAVT+ PT +
Sbjct: 356  KLTSSLAAVLAQMLSHRATHVWVTEDENDEVLVGVVGYTDILAAVTKSPTAM 407


>gb|EMJ08669.1| hypothetical protein PRUPE_ppa006223mg [Prunus persica]
          Length = 421

 Score =  549 bits (1414), Expect = e-153
 Identities = 281/413 (68%), Positives = 334/413 (80%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFL+HVVGDLTVGKP            AAI+AIGES E GIPVWKK+S   + EN E
Sbjct: 1    MASVFLFHVVGDLTVGKPEMVELCETETMEAAIKAIGESMECGIPVWKKKSHVGMVENDE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGILNSLDIVAF AK ECL D DKA+KTPVS+VVVPN+SLL++VDPATRLIDAL
Sbjct: 61   MRQQRFVGILNSLDIVAFFAKSECLEDHDKALKTPVSDVVVPNNSLLRQVDPATRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSSAASINWPTSSSATII 1476
            EMMK GVKRLLV KS+ WKG+SKRFSV+++GKWLKN+D SG+++S A+ N P+SSSAT  
Sbjct: 121  EMMKHGVKRLLVRKSVVWKGMSKRFSVIYSGKWLKNMDTSGSSNSLAA-NRPSSSSATST 179

Query: 1477 RDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLPQD 1656
            RDKFCCLSREDVIRFLIGCLGALAP+PL+ IS+LGAIN NY ++EAS  AI++T KLP+D
Sbjct: 180  RDKFCCLSREDVIRFLIGCLGALAPIPLSSISTLGAINTNYQFVEASSAAIEATHKLPED 239

Query: 1657 PCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMSVP 1836
            P AVAVVE   + Q+KIIG+ISA KLWKCDY            GQFVMGVEDN +S S+P
Sbjct: 240  PSAVAVVEHTPEDQYKIIGEISASKLWKCDYLAAAWALANLSAGQFVMGVEDNASSRSLP 299

Query: 1837 DLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTCKH 2016
            D+SV+ + G  N+ANGGG   +PKKFSSRSIGF  + +S +LG  R+MYRGRSAPLTCK 
Sbjct: 300  DISVNQIAGNNNVANGGGS-TKPKKFSSRSIGF--SPASASLGVSRSMYRGRSAPLTCKV 356

Query: 2017 TSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTVPEA 2175
            TSSLAAVMAQ+LSHRATHVWVTE E+++ L+G+VG+ DI+AAVT+QP  +  A
Sbjct: 357  TSSLAAVMAQMLSHRATHVWVTEDESDDILVGVVGYADIMAAVTKQPAPITPA 409


>ref|XP_003534522.1| PREDICTED: CBS domain-containing protein CBSX6-like [Glycine max]
          Length = 425

 Score =  548 bits (1412), Expect = e-153
 Identities = 281/418 (67%), Positives = 335/418 (80%), Gaps = 2/418 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVF+YHVVGDLTVGKP            +AIRAIGE  EG IP+WKKRS   + EN++
Sbjct: 1    MASVFVYHVVGDLTVGKPELAEFHESETVESAIRAIGECHEGTIPIWKKRSQLGI-ENSD 59

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGIL+S DIVAFLAK +CL DQDKA+KTPVSEVVV N+SLL+ VDPATRLIDAL
Sbjct: 60   MRQQRFVGILSSFDIVAFLAKSQCLEDQDKALKTPVSEVVVHNNSLLRVVDPATRLIDAL 119

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSG--ANSSAASINWPTSSSAT 1470
            +MMKQGVKRLLVPKS+ WKG+SKRFSV++ GKWLKN +  G  +N+   ++N   S+S T
Sbjct: 120  DMMKQGVKRLLVPKSVAWKGMSKRFSVIYYGKWLKNSESPGNSSNNLPLNMNRSPSTSIT 179

Query: 1471 IIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLP 1650
             IRD++CCLSREDV+RF+IGCLGALAPLPLT I+SLGAIN NY+YIE+S PAI++T+KLP
Sbjct: 180  PIRDRYCCLSREDVLRFIIGCLGALAPLPLTSIASLGAINSNYNYIESSTPAIEATQKLP 239

Query: 1651 QDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMS 1830
            QDP AVAV+E  SDGQ KIIG+ISACKLWKCDY            GQFVMGVEDN+T  S
Sbjct: 240  QDPSAVAVIESTSDGQCKIIGEISACKLWKCDYLSAAWALANLSAGQFVMGVEDNVTPRS 299

Query: 1831 VPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTC 2010
            +P  S+   +GE +LANGGG   +P+KFSSRS+GFFSNT+S + G+ R+MYRGRSAPLTC
Sbjct: 300  LPQFSLDLASGENDLANGGGS-RKPRKFSSRSVGFFSNTASHSFGS-RSMYRGRSAPLTC 357

Query: 2011 KHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTVPEAQTS 2184
            K TSSLAAV+AQ+LSHRATHVWVTE EN+E L+G+VG+ DILAAVT+ PT    A  S
Sbjct: 358  KITSSLAAVLAQMLSHRATHVWVTEDENDEVLVGVVGYADILAAVTKPPTAFIPANRS 415


>ref|XP_006480403.1| PREDICTED: CBS domain-containing protein CBSX6-like [Citrus sinensis]
          Length = 433

 Score =  546 bits (1408), Expect = e-152
 Identities = 281/421 (66%), Positives = 328/421 (77%), Gaps = 8/421 (1%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVF+YHVVGDLTVGKP            AAI+AIGESTE GIPVWKK++   + EN E
Sbjct: 1    MASVFIYHVVGDLTVGKPELAEFYETETVEAAIKAIGESTECGIPVWKKKTHVGIIENGE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGILNS DIVAFLAK +CL DQDKAMKTPVS+V+VPN+SLLK+VDP TRLIDAL
Sbjct: 61   MRQQRFVGILNSFDIVAFLAKSDCLEDQDKAMKTPVSQVIVPNNSLLKQVDPGTRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSS--AASINWPTSSSAT 1470
            EMMKQGV+RLLVPKS+ WKG+SKRFS+L+NGKWLKN+D S ++S+   A+ N P+SSS T
Sbjct: 121  EMMKQGVRRLLVPKSVVWKGMSKRFSILYNGKWLKNMDASNSSSNNLIANANRPSSSSTT 180

Query: 1471 IIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLP 1650
             +RDKFCCLSREDVIRFLIGCLGALAPLPL+ ISSLG INPNYS IEAS PAI++T K P
Sbjct: 181  SVRDKFCCLSREDVIRFLIGCLGALAPLPLSSISSLGVINPNYSSIEASVPAIEATLKPP 240

Query: 1651 QDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMS 1830
             DP A+AV+EP S+ Q+KIIG+ISA KLWKCDY            GQFVMGVEDN+T  S
Sbjct: 241  GDPSAIAVLEPTSEDQYKIIGEISASKLWKCDYLAAAWALANLSAGQFVMGVEDNVTPRS 300

Query: 1831 VPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGF------FSNTSSPNLGAFRNMYRGR 1992
             PD S +S   E N  NG G   RP+KF SRSIGF       + + SP+ G  R+MYRGR
Sbjct: 301  FPDYSANSTLRENNTVNGVGS-TRPRKFCSRSIGFNPSSPCLAASRSPSFGTGRSMYRGR 359

Query: 1993 SAPLTCKHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTVPE 2172
            S PLTCK TSSLAAVMAQ+LSHRATHVWVTE E+++ L+G+VG+ DIL AVT+QP  +  
Sbjct: 360  STPLTCKITSSLAAVMAQMLSHRATHVWVTEDESDDVLVGVVGYADILVAVTKQPAALTP 419

Query: 2173 A 2175
            A
Sbjct: 420  A 420


>ref|XP_003552437.1| PREDICTED: CBS domain-containing protein CBSX6-like [Glycine max]
          Length = 425

 Score =  543 bits (1400), Expect = e-151
 Identities = 280/418 (66%), Positives = 333/418 (79%), Gaps = 2/418 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVF+YHVVGDLTVGKP            +AIRAIGES EG IP+WKKRS   + EN++
Sbjct: 1    MASVFVYHVVGDLTVGKPELAEFHESETVESAIRAIGESPEGSIPIWKKRSQLGI-ENSD 59

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGIL+S DIVAFLAK  CL DQDKA+KTPVSEVVV N+SLL+ VDPATRLIDAL
Sbjct: 60   MRQQRFVGILSSFDIVAFLAKSRCLEDQDKALKTPVSEVVVHNNSLLRVVDPATRLIDAL 119

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSG--ANSSAASINWPTSSSAT 1470
            +MMKQGVKRLLVPKSI WKG+SKRFSV++ GKWLKN +  G  +N+   S+N   S+S T
Sbjct: 120  DMMKQGVKRLLVPKSIAWKGMSKRFSVIYYGKWLKNSESPGNSSNNLPLSMNRSPSTSVT 179

Query: 1471 IIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLP 1650
             I DK+CCLSREDV+RF+IGCLGALAPLPLT I++L AIN NY+YIE+S PAI++T+KLP
Sbjct: 180  PIPDKYCCLSREDVLRFIIGCLGALAPLPLTSIAALEAINSNYNYIESSTPAIEATQKLP 239

Query: 1651 QDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMS 1830
            QDP AVAV+E  SDGQ KIIG+ISACKLWKCDY            GQFVMGVEDN+T  S
Sbjct: 240  QDPSAVAVIESASDGQCKIIGEISACKLWKCDYLSAAWALANLSAGQFVMGVEDNVTPRS 299

Query: 1831 VPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTC 2010
            +P+ S+ S +G+ +L N GG   +P+KFSSRS+GFFSN++S N  + R+MYRGRSAPLTC
Sbjct: 300  LPEFSLDSPSGDIDLVNSGGS-RKPRKFSSRSVGFFSNSASHNFSS-RSMYRGRSAPLTC 357

Query: 2011 KHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTVPEAQTS 2184
            K TSSLAAV+AQ+LSHRATHVWVTE EN+E L+G+VG+ DILAAVT+ PTT   A  S
Sbjct: 358  KITSSLAAVLAQMLSHRATHVWVTEDENDEVLVGVVGYADILAAVTKPPTTFIPANRS 415


>ref|XP_002325465.1| CBS domain-containing family protein [Populus trichocarpa]
            gi|222862340|gb|EEE99846.1| CBS domain-containing family
            protein [Populus trichocarpa]
          Length = 422

 Score =  535 bits (1379), Expect = e-149
 Identities = 277/413 (67%), Positives = 329/413 (79%), Gaps = 3/413 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            M SVFLYHVVGDLTVGKP            +AIRAIGESTE GIPVWK++S   + EN+E
Sbjct: 1    MPSVFLYHVVGDLTVGKPEMVEFYETETVESAIRAIGESTECGIPVWKRKSHVGMIENSE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
             R QRFVGILNSLDIVAFLA  ECL D+DKA+KTPVS+VVVPN+SLLK+VDPATRLIDAL
Sbjct: 61   TRLQRFVGILNSLDIVAFLASTECLEDRDKAIKTPVSQVVVPNTSLLKQVDPATRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSS---AASINWPTSSSA 1467
            EMMKQGV+RL+VPKS+GWKG+SKRFS+L+NGKWLKN D S ++S+     + N P+SSS 
Sbjct: 121  EMMKQGVRRLIVPKSMGWKGMSKRFSILYNGKWLKNADTSNSSSNNNLTINPNRPSSSSG 180

Query: 1468 TIIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKL 1647
            T  RDKFCCLSREDVIRFLIGCLGALAPLPL+ ISSLGAIN NY+ +EAS PAI++T+KL
Sbjct: 181  TSNRDKFCCLSREDVIRFLIGCLGALAPLPLSSISSLGAINTNYNSLEASLPAIEATRKL 240

Query: 1648 PQDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSM 1827
            P+DP A+AVVEP+ +GQ KIIG+ISA +LWKCDY            GQFVMGVEDN+TS 
Sbjct: 241  PEDPSAIAVVEPIPNGQCKIIGEISASRLWKCDYLAAAWALANLSAGQFVMGVEDNVTSR 300

Query: 1828 SVPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLT 2007
            S+PD +V+S   + N A+G G   R +KFSSRSIGF    S   +G  R++YRGRSAPLT
Sbjct: 301  SLPDFAVNSAADDDNTAHGAGS-TRLRKFSSRSIGFNPGNS---IGIGRSVYRGRSAPLT 356

Query: 2008 CKHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTV 2166
            CK TSSLAAVMAQ+LSHRATHVWV E  +++ L+G+VG+ DILAAVT+QP +V
Sbjct: 357  CKITSSLAAVMAQMLSHRATHVWVIEDHSDDILVGVVGYADILAAVTKQPASV 409


>ref|XP_004492827.1| PREDICTED: CBS domain-containing protein CBSX6-like [Cicer arietinum]
          Length = 425

 Score =  534 bits (1376), Expect = e-149
 Identities = 278/418 (66%), Positives = 334/418 (79%), Gaps = 2/418 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASV ++HVVGDLTVGKP            +AIRAI ES EG IP+WKKRS   V +N++
Sbjct: 1    MASVLVHHVVGDLTVGKPELVEFHVTETVESAIRAIAESPEGSIPIWKKRSH-GVIDNSD 59

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQ RFVGIL+S DIVAFLAK + L DQDKA+K PVSEVV+PN+SLL+ VDP TRLIDAL
Sbjct: 60   MRQMRFVGILSSFDIVAFLAKTQFLEDQDKALKMPVSEVVLPNNSLLRLVDPGTRLIDAL 119

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDP--SGANSSAASINWPTSSSAT 1470
            +MMKQGVKRLLVPKSI WKG+SKRFSV++  KW KN +   S +N+  A+++   SSS+T
Sbjct: 120  DMMKQGVKRLLVPKSILWKGMSKRFSVIYYDKWNKNHESPTSSSNNLLANLS-RNSSSST 178

Query: 1471 IIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLP 1650
             IRDK+CCLSREDV+RF+IGCLGALAPLPLT IS+LGAINPNYSYIE+S PA+++T+K+P
Sbjct: 179  SIRDKYCCLSREDVLRFIIGCLGALAPLPLTSISTLGAINPNYSYIESSTPALEATQKVP 238

Query: 1651 QDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMS 1830
            QDP AVAV+E   DGQ KIIG+ISAC+LWKCDY            GQFVMGVEDN+TS S
Sbjct: 239  QDPSAVAVIESTPDGQCKIIGEISACRLWKCDYLGAAWALANLSAGQFVMGVEDNVTSRS 298

Query: 1831 VPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTC 2010
            +P+L V+S TG+ NL NGGG   +PKKFSSRSIGFF+N++S N G+ R+MYRGRSAPLTC
Sbjct: 299  LPELCVNSKTGDNNLVNGGGS-RKPKKFSSRSIGFFNNSASHNFGS-RSMYRGRSAPLTC 356

Query: 2011 KHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTVPEAQTS 2184
            K TSSLAAVMAQ+LSHRATHVWVTE EN++ L+G+VG+ DILAAVT+ PT    A  S
Sbjct: 357  KMTSSLAAVMAQMLSHRATHVWVTEDENDDVLVGVVGYADILAAVTKPPTIFIAANKS 414


>ref|XP_004302150.1| PREDICTED: CBS domain-containing protein CBSX6-like [Fragaria vesca
            subsp. vesca]
          Length = 426

 Score =  531 bits (1369), Expect = e-148
 Identities = 271/411 (65%), Positives = 326/411 (79%), Gaps = 1/411 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFLYHVVGDLTVGKP            AAI+AIG+STE GIPVWK++    +  N E
Sbjct: 1    MASVFLYHVVGDLTVGKPEMVEFCETESVEAAIKAIGDSTECGIPVWKRKPHAGIETN-E 59

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGILNSLDIVAF AK EC+ D DKA+ TPV+EVV PN+SLL++VDP TRLIDAL
Sbjct: 60   MRQQRFVGILNSLDIVAFFAKKECMEDHDKALNTPVAEVVAPNNSLLRQVDPGTRLIDAL 119

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGA-NSSAASINWPTSSSATI 1473
            EMMK GV+RLLV KS+ W+G+SKRFS+L+NGKWLKN D SG+ N+  AS N P+SSS T 
Sbjct: 120  EMMKHGVRRLLVRKSVVWQGMSKRFSILYNGKWLKNADTSGSSNNLGASSNRPSSSSTTS 179

Query: 1474 IRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLPQ 1653
             RDKFCCLSREDVIRFLIGCLGALAPLPL+ ISS+GAINPNY  IEA+ PAI++TKKLP+
Sbjct: 180  ARDKFCCLSREDVIRFLIGCLGALAPLPLSSISSVGAINPNYKSIEATSPAIEATKKLPE 239

Query: 1654 DPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMSV 1833
            DP A+AV+E + DGQ+KIIG+ISA KLWKCD+            GQFVMGVEDN++S S+
Sbjct: 240  DPSAIAVIEQLQDGQYKIIGEISASKLWKCDHLAAAWALANLSGGQFVMGVEDNMSSRSL 299

Query: 1834 PDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTCK 2013
              ++V+   G  +LANGG    RPKKFSS+SIGF  + ++P+  A R MYRGRSAPLTCK
Sbjct: 300  KVIAVNQAAGNNDLANGGES-TRPKKFSSKSIGF--DPANPSFAASRTMYRGRSAPLTCK 356

Query: 2014 HTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTV 2166
             TSSLAAVMAQ+LSHRATHVWVTE E+++ L+G+VG+ DI+ AVT+QP +V
Sbjct: 357  VTSSLAAVMAQMLSHRATHVWVTEDESDDVLVGVVGYADIMVAVTKQPASV 407


>gb|EOY09904.1| Cystathionine beta-synthase family protein isoform 1 [Theobroma
            cacao]
          Length = 424

 Score =  526 bits (1355), Expect = e-146
 Identities = 274/411 (66%), Positives = 320/411 (77%), Gaps = 1/411 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFLYHVVGDLTVGKP            +AIRAIGESTE GIPVWK+RS   + E  E
Sbjct: 1    MASVFLYHVVGDLTVGKPELVEFSETETVESAIRAIGESTECGIPVWKRRSHVGMIEKNE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGIL SLDIV+FLA+ +CL DQDKAMK  VS+VVVPN++LLK VDP TRLIDAL
Sbjct: 61   MRQQRFVGILTSLDIVSFLARTQCLEDQDKAMKAQVSDVVVPNNALLKIVDPGTRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSSAASINWPTSSSA-TI 1473
            EMMKQGV+RLLVPKS  WKG+SKRFS+L+NGKWLKNI+   +N+   + N P+SSS  T 
Sbjct: 121  EMMKQGVRRLLVPKSKVWKGMSKRFSILYNGKWLKNIENGSSNNLITNANPPSSSSTPTY 180

Query: 1474 IRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLPQ 1653
            +RDKFCCLSRED+IRFLIGCLGALAP+PL+ ISSLGAIN NYS IEAS PA+++T+K P 
Sbjct: 181  MRDKFCCLSREDIIRFLIGCLGALAPVPLSSISSLGAINLNYSSIEASLPALEATQKHPG 240

Query: 1654 DPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMSV 1833
            DP AVAVVE   DG  KI+G+ISA KLWKCDY            GQFVMGVEDN++S  +
Sbjct: 241  DPSAVAVVEVTPDGHHKILGEISASKLWKCDYLAAAWALANLSAGQFVMGVEDNVSSRLL 300

Query: 1834 PDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTCK 2013
            PD SV+S   +  + NG G   RP+KFSSRSIGF  N  SP+ G  R+MYRGRSAPLTCK
Sbjct: 301  PDFSVNSAVQDNKIVNGVGS-TRPRKFSSRSIGF--NPVSPSFGVGRSMYRGRSAPLTCK 357

Query: 2014 HTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTV 2166
             TSSLAAVMAQ+LSHRATHVWVTE EN++ L+G+VG+ DIL AVT+QP  +
Sbjct: 358  TTSSLAAVMAQMLSHRATHVWVTEDENDDILVGVVGYADILVAVTKQPAAM 408


>gb|AFK42288.1| unknown [Medicago truncatula]
          Length = 422

 Score =  521 bits (1343), Expect = e-145
 Identities = 269/408 (65%), Positives = 324/408 (79%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASV +YHVVGDLTVGKP            +AIRAI ES EG IPVWKKRS + V EN++
Sbjct: 1    MASVLVYHVVGDLTVGKPELVEFHETETVESAIRAIAESPEGSIPVWKKRS-QGVIENSD 59

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQ RFVGIL+S D+V FLAK  CL DQDKA+KTPVSE VV N+ LLK VDP TRLIDAL
Sbjct: 60   MRQTRFVGILSSFDVVGFLAKSSCLEDQDKALKTPVSEFVVRNNYLLKLVDPGTRLIDAL 119

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSSAASINWPTSSSATII 1476
            +MMKQGVKRLLVPKSI WKG+SKRFSV+++GKWLKN +   ++++  S+N   ++SA+I 
Sbjct: 120  DMMKQGVKRLLVPKSIVWKGMSKRFSVIYHGKWLKNPESPSSSNNNLSVNLNGNTSASI- 178

Query: 1477 RDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLPQD 1656
            RDK+CCLSREDV+RF+IGCLGALAP+PLT I++LGAINPNYSYIE+S PA++ST+K+ QD
Sbjct: 179  RDKYCCLSREDVLRFIIGCLGALAPIPLTSIAALGAINPNYSYIESSTPALESTQKVLQD 238

Query: 1657 PCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMSVP 1836
            P AVAV+E MSDGQ KIIG+ISA KLWKCDY            GQFVMGVEDN+T  S P
Sbjct: 239  PSAVAVIESMSDGQCKIIGEISAIKLWKCDYLSAAWALANLSAGQFVMGVEDNVTPGSPP 298

Query: 1837 DLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTCKH 2016
            DL ++    + +L NGGGG  + KKFSSRSIGFFSN+ S + G+ R+M+RGRS PLTCK 
Sbjct: 299  DLCINP-GADNDLVNGGGGSRKLKKFSSRSIGFFSNSPSNSFGS-RSMFRGRSTPLTCKM 356

Query: 2017 TSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPT 2160
            TSSLAAVMAQ+LSHRATHVWVTE EN++ L+G+VG+ DIL AVT+ PT
Sbjct: 357  TSSLAAVMAQMLSHRATHVWVTEDENDDVLVGVVGYADILGAVTKPPT 404


>ref|XP_004152831.1| PREDICTED: CBS domain-containing protein CBSX6-like [Cucumis sativus]
            gi|449477694|ref|XP_004155096.1| PREDICTED: CBS
            domain-containing protein CBSX6-like isoform 1 [Cucumis
            sativus] gi|449477697|ref|XP_004155097.1| PREDICTED: CBS
            domain-containing protein CBSX6-like isoform 2 [Cucumis
            sativus]
          Length = 425

 Score =  520 bits (1340), Expect = e-145
 Identities = 273/418 (65%), Positives = 322/418 (77%), Gaps = 2/418 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFLYHVVGDLTVGKP             AIR IGESTE G+P+WK+++   + ENAE
Sbjct: 1    MASVFLYHVVGDLTVGKPEMTEFYETETIETAIRVIGESTECGVPIWKRKTHVGIIENAE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            ++QQRFVGIL+SLDIVAFLA+ E L DQ++AMK PVSE VVPN SLL++VDPATRLIDAL
Sbjct: 61   MKQQRFVGILSSLDIVAFLARSENLEDQERAMKAPVSEAVVPNYSLLRQVDPATRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSSAASI--NWPTSSSAT 1470
            EMMKQGV+RLL+ KS+ WKG+SKRFS+L+NGKWLKNID  G +S+  ++  N P+SSS +
Sbjct: 121  EMMKQGVRRLLIRKSVVWKGMSKRFSILYNGKWLKNIDTPGNSSNNLNLNPNRPSSSSTS 180

Query: 1471 IIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKLP 1650
               DKFCCLSREDVIRFLIGCLGALAPLPL+ IS+L AINPNY  I+AS PAID + KLP
Sbjct: 181  TSHDKFCCLSREDVIRFLIGCLGALAPLPLSSISTLEAINPNYCSIDASTPAIDISHKLP 240

Query: 1651 QDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSMS 1830
             DP AVAVVE + D Q++IIG+ISA KLWKC+Y            GQFVMGVEDN+TS  
Sbjct: 241  DDPVAVAVVENIHDNQYRIIGEISASKLWKCNYLAAAWALANLSAGQFVMGVEDNMTSRM 300

Query: 1831 VPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLTC 2010
            VPDLS +    E + ANGGG   R +KFSSRSIGF  N  S      R+MYRGRSAPLTC
Sbjct: 301  VPDLSTNGNVDENDSANGGGA-TRARKFSSRSIGF--NPLSRAFRINRSMYRGRSAPLTC 357

Query: 2011 KHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTVPEAQTS 2184
            K TSSLAAVMAQ+LSHRA+HVWVTE EN++ L+G+VG+ DILAAVT+QPT+   A  S
Sbjct: 358  KVTSSLAAVMAQMLSHRASHVWVTEDENDDILVGVVGYADILAAVTKQPTSFIPANRS 415


>ref|XP_002329005.1| predicted protein [Populus trichocarpa]
            gi|566200185|ref|XP_002319814.2| CBS domain-containing
            family protein [Populus trichocarpa]
            gi|550325290|gb|EEE95737.2| CBS domain-containing family
            protein [Populus trichocarpa]
          Length = 424

 Score =  520 bits (1340), Expect = e-145
 Identities = 277/413 (67%), Positives = 326/413 (78%), Gaps = 3/413 (0%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFLYHVVGDLTVGKP            +AIRAIGESTE GIPVWK++S  S+ E +E
Sbjct: 1    MASVFLYHVVGDLTVGKPEMVEFYETETVESAIRAIGESTECGIPVWKRKSHVSMIETSE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLIDAL 1296
            +RQQRFVGILNSLDIVAFLA  ECL DQDKA+KT VS+VVVPN+SLLK+VDPATRLIDAL
Sbjct: 61   MRQQRFVGILNSLDIVAFLASTECLEDQDKAIKTSVSQVVVPNASLLKQVDPATRLIDAL 120

Query: 1297 EMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSS---AASINWPTSSSA 1467
            EMMKQGV+RLLVPKS+ WKG+SKRFS L+NGKWLKN D S  +S+     + N P+SSS 
Sbjct: 121  EMMKQGVRRLLVPKSMVWKGMSKRFSFLYNGKWLKNADASNNSSNNNLTINTNRPSSSSG 180

Query: 1468 TIIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKKL 1647
            T  R+KFCCLSREDVIRFLIGCLGALAPLPL+ ISSLG INPNY+ +EAS PA ++T+KL
Sbjct: 181  TSNRNKFCCLSREDVIRFLIGCLGALAPLPLSSISSLGVINPNYTSVEASLPAFEATRKL 240

Query: 1648 PQDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNITSM 1827
              DP  VAVVEP+ DGQ KIIG+ISA +LWKCDY            GQFVMGVEDN T+ 
Sbjct: 241  HGDPSEVAVVEPIPDGQCKIIGEISASRLWKCDYLAAAWALANLSAGQFVMGVEDNETAR 300

Query: 1828 SVPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAPLT 2007
            S+ D +V+S  G+ + ANG G   R ++FSSRSIG F+  SS  +G  R+MYRGRSAPLT
Sbjct: 301  SLLDFAVNSAVGDESTANGIGS-TRLREFSSRSIG-FNPGSSIRMG--RSMYRGRSAPLT 356

Query: 2008 CKHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPTTV 2166
            CK TSSLAAVMAQ+LSHRATHVWV E ++++ L+G+VG+ DILAAVT+QP +V
Sbjct: 357  CKITSSLAAVMAQMLSHRATHVWVIEDDSDDILVGVVGYADILAAVTKQPASV 409


>gb|EXC29924.1| CBS domain-containing protein CBSX6 [Morus notabilis]
          Length = 453

 Score =  512 bits (1319), Expect = e-142
 Identities = 275/449 (61%), Positives = 326/449 (72%), Gaps = 33/449 (7%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASVFLYHVVGDLTVGKP            +AIRAIGESTEGGI VWKKRS   V E AE
Sbjct: 1    MASVFLYHVVGDLTVGKPEMVEFCETETVESAIRAIGESTEGGIAVWKKRSVVGVIEKAE 60

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATR----- 1281
            +RQQRFVGILNSLDIVAFLA+ ECL +QDKA+KTP+S+VVVPN+SLL++VDPATR     
Sbjct: 61   MRQQRFVGILNSLDIVAFLARAECLENQDKALKTPISDVVVPNNSLLRQVDPATRFESSA 120

Query: 1282 --------------------------LIDALEMMKQGVKRLLVPKSIGWKGVSKRFSVLF 1383
                                      LIDALEMM+QGVKRLLV KS+ WKG+SKRFS+L+
Sbjct: 121  CLPHMNTCLHQQSRACEQQYLIKDLKLIDALEMMRQGVKRLLVRKSVVWKGMSKRFSILY 180

Query: 1384 NGKWLKNIDPSGANSS--AASINWPTSSSATIIRDKFCCLSREDVIRFLIGCLGALAPLP 1557
            NGKWLKNID SG++S+    + NW +SS+ T  RDKFCCLSREDV+RF+IGCLGALAPLP
Sbjct: 181  NGKWLKNIDASGSSSNNLLPNPNWLSSSTTTSSRDKFCCLSREDVLRFIIGCLGALAPLP 240

Query: 1558 LTPISSLGAINPNYSYIEASQPAIDSTKKLPQDPCAVAVVEPMSDGQFKIIGDISACKLW 1737
            L+ IS+LGAINPNY  IEAS PAI+   KLP+DP A+AVVE     + KIIG+ISA KLW
Sbjct: 241  LSSISTLGAINPNYYSIEASSPAIEVALKLPEDPSAIAVVERTVRDRCKIIGEISASKLW 300

Query: 1738 KCDYFXXXXXXXXXXXGQFVMGVEDNITSMSVPDLSVSSLTGETNLANGGGGPARPKKFS 1917
            KCD+            GQFVMGVEDN++S S+PD  ++   G  NLANG G   R ++FS
Sbjct: 301  KCDHLAAAWALANLSAGQFVMGVEDNVSSRSLPDFCLNPAAGNNNLANGSGS-TRSRRFS 359

Query: 1918 SRSIGFFSNTSSPNLGAFRNMYRGRSAPLTCKHTSSLAAVMAQLLSHRATHVWVTEAENE 2097
            SRSIGF     +P   +F +MYRGRSAPLTCK TSSLAAVMAQ+LSHRATHVWV E  ++
Sbjct: 360  SRSIGF-----NPGNSSFGSMYRGRSAPLTCKVTSSLAAVMAQMLSHRATHVWVIEDGSD 414

Query: 2098 ENLIGIVGFVDILAAVTRQPTTVPEAQTS 2184
            + L+G+VG+ DILAAVT+QP ++  A  S
Sbjct: 415  DVLVGVVGYSDILAAVTKQPASIAAANRS 443


>ref|XP_003624141.1| hypothetical protein MTR_7g079680 [Medicago truncatula]
            gi|355499156|gb|AES80359.1| hypothetical protein
            MTR_7g079680 [Medicago truncatula]
          Length = 437

 Score =  511 bits (1317), Expect = e-142
 Identities = 269/423 (63%), Positives = 324/423 (76%), Gaps = 15/423 (3%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVPENAE 1116
            MASV +YHVVGDLTVGKP            +AIRAI ES EG IPVWKKRS + V EN++
Sbjct: 1    MASVLVYHVVGDLTVGKPELVEFHETETVESAIRAIAESPEGSIPVWKKRS-QGVIENSD 59

Query: 1117 IRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPAT------ 1278
            +RQ RFVGIL+S D+V FLAK  CL DQDKA+KTPVSE VV N+ LLK VDP T      
Sbjct: 60   MRQTRFVGILSSFDVVGFLAKSSCLEDQDKALKTPVSEFVVRNNYLLKLVDPGTSLESVA 119

Query: 1279 ---------RLIDALEMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANSS 1431
                     RLIDAL+MMKQGVKRLLVPKSI WKG+SKRFSV+++GKWLKN +   ++++
Sbjct: 120  TVHFLRTEQRLIDALDMMKQGVKRLLVPKSIVWKGMSKRFSVIYHGKWLKNPESPSSSNN 179

Query: 1432 AASINWPTSSSATIIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIE 1611
              S+N   ++SA+I RDK+CCLSREDV+RF+IGCLGALAP+PLT I++LGAINPNYSYIE
Sbjct: 180  NLSVNLNGNTSASI-RDKYCCLSREDVLRFIIGCLGALAPIPLTSIAALGAINPNYSYIE 238

Query: 1612 ASQPAIDSTKKLPQDPCAVAVVEPMSDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQ 1791
            +S PA++ST+K+ QDP AVAV+E MSDGQ KIIG+ISA KLWKCDY            GQ
Sbjct: 239  SSTPALESTQKVLQDPSAVAVIESMSDGQCKIIGEISAIKLWKCDYLSAAWALANLSAGQ 298

Query: 1792 FVMGVEDNITSMSVPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAF 1971
            FVMGVEDN+T  S PDL ++    + +L NGGGG  + KKFSSRSIGFFSN+ S + G+ 
Sbjct: 299  FVMGVEDNVTPGSPPDLCINP-GADNDLVNGGGGSRKLKKFSSRSIGFFSNSPSNSFGS- 356

Query: 1972 RNMYRGRSAPLTCKHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTR 2151
            R+M+RGRS PLTCK TSSLAAVMAQ+LSHRATHVWVTE EN++ L+G+VG+ DIL AVT+
Sbjct: 357  RSMFRGRSTPLTCKMTSSLAAVMAQMLSHRATHVWVTEDENDDVLVGVVGYADILGAVTK 416

Query: 2152 QPT 2160
             PT
Sbjct: 417  PPT 419


>ref|XP_006391564.1| hypothetical protein EUTSA_v10018605mg [Eutrema salsugineum]
            gi|557087998|gb|ESQ28850.1| hypothetical protein
            EUTSA_v10018605mg [Eutrema salsugineum]
          Length = 425

 Score =  493 bits (1270), Expect = e-136
 Identities = 269/414 (64%), Positives = 322/414 (77%), Gaps = 6/414 (1%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVP---E 1107
            MASVFLYHVVGDLTVGKP            +AIRAIGESTE GIPVW+KRS+ S+P   E
Sbjct: 1    MASVFLYHVVGDLTVGKPEMVEFYETESVESAIRAIGESTECGIPVWRKRSTPSIPGFVE 60

Query: 1108 NAEIRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLI 1287
            N+E+RQ RFVGILNSLDIVAFLAK ECL  ++KAMK PVSEVV PN++LLK+VDP TRLI
Sbjct: 61   NSEMRQHRFVGILNSLDIVAFLAKSECL-QEEKAMKIPVSEVVCPNNTLLKQVDPGTRLI 119

Query: 1288 DALEMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANS-SAASINWPTSSS 1464
            DALEMMKQGV+RLLVPKS+ W+G+SKRFS+L+NGKWLKN + S ++S +A S N P +S 
Sbjct: 120  DALEMMKQGVRRLLVPKSVVWRGMSKRFSILYNGKWLKNSENSSSSSLTADSTNRPATSK 179

Query: 1465 ATIIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTKK 1644
            A+  RDKFCCLSREDVIRFLIG LGALAPLPLT ISSL  IN NY++IEAS PAI++T++
Sbjct: 180  ASS-RDKFCCLSREDVIRFLIGVLGALAPLPLTSISSLDIINLNYNFIEASCPAIEATRR 238

Query: 1645 LPQDPCAVAVVEPM-SDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNIT 1821
             P DP A+AV+E   ++ QFKIIG+ISA KLWKCDY            GQFVMGVEDN++
Sbjct: 239  PPCDPSAIAVLEQTENEQQFKIIGEISASKLWKCDYLAAAWALANLYAGQFVMGVEDNMS 298

Query: 1822 SMSVPDLSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRSAP 2001
            S S  D   +S  GE N    G    + KKFSSRSIG F+ TS   L   R+MYRGRSAP
Sbjct: 299  SRSFSDFLQTSFVGEQN----GTSTTKAKKFSSRSIG-FNPTSPTRLSIGRSMYRGRSAP 353

Query: 2002 LTCKHTSSLAAVMAQLLSHRATHVWVTEAENEEN-LIGIVGFVDILAAVTRQPT 2160
            LTCK +SSLAAVMAQ+LSHRATHVWVTEA+++++ L+G+VG+ +IL AVT+QP+
Sbjct: 354  LTCKTSSSLAAVMAQMLSHRATHVWVTEADSDDDVLVGVVGYGEILTAVTKQPS 407


>ref|XP_002888360.1| hypothetical protein ARALYDRAFT_475589 [Arabidopsis lyrata subsp.
            lyrata] gi|297334201|gb|EFH64619.1| hypothetical protein
            ARALYDRAFT_475589 [Arabidopsis lyrata subsp. lyrata]
          Length = 424

 Score =  491 bits (1263), Expect = e-136
 Identities = 269/415 (64%), Positives = 322/415 (77%), Gaps = 7/415 (1%)
 Frame = +1

Query: 937  MASVFLYHVVGDLTVGKPXXXXXXXXXXXXAAIRAIGESTEGGIPVWKKRSSKSVP---E 1107
            MASVFLYHVVGDLTVGKP            +AIRAIGESTE GIPVW+KRS+ ++P   E
Sbjct: 1    MASVFLYHVVGDLTVGKPEMVEFYETETVESAIRAIGESTECGIPVWRKRSTPNLPGFVE 60

Query: 1108 NAEIRQQRFVGILNSLDIVAFLAKDECLADQDKAMKTPVSEVVVPNSSLLKEVDPATRLI 1287
            N+E+RQQRFVGILNSLDIVAFLAK ECL  ++KAMK PVSEVV P+++LLK+VDP TRLI
Sbjct: 61   NSEMRQQRFVGILNSLDIVAFLAKSECL-QEEKAMKIPVSEVVSPDNTLLKQVDPGTRLI 119

Query: 1288 DALEMMKQGVKRLLVPKSIGWKGVSKRFSVLFNGKWLKNIDPSGANS--SAASINWPTSS 1461
            DALEMMKQGV+RLLVPKS+ W+G+SKRFS+L+NGKWLKN + S ++S  +A S N PT +
Sbjct: 120  DALEMMKQGVRRLLVPKSVVWRGMSKRFSILYNGKWLKNSENSSSSSGLAADSTNRPT-T 178

Query: 1462 SATIIRDKFCCLSREDVIRFLIGCLGALAPLPLTPISSLGAINPNYSYIEASQPAIDSTK 1641
            S T  RDKFCCLSREDVIRFLIG LGALAPLPLT IS+LG IN NY++IEA  PAI++T+
Sbjct: 179  SMTSCRDKFCCLSREDVIRFLIGVLGALAPLPLTSISTLGIINQNYNFIEAYLPAIEATR 238

Query: 1642 KLPQDPCAVAVVEPM-SDGQFKIIGDISACKLWKCDYFXXXXXXXXXXXGQFVMGVEDNI 1818
            + P DP A+AV+E   ++ QFKIIG+ISA KLWKCDY            GQFVMGVEDN+
Sbjct: 239  RPPCDPSAIAVLEQTENEQQFKIIGEISASKLWKCDYLAAAWALANLYAGQFVMGVEDNM 298

Query: 1819 TSMSVPD-LSVSSLTGETNLANGGGGPARPKKFSSRSIGFFSNTSSPNLGAFRNMYRGRS 1995
            +S S  D L  S   GE N      G  + KKFSSRSIG F+ TS   L   R+MYRGRS
Sbjct: 299  SSRSFSDFLQTSFPGGEQN------GTTKAKKFSSRSIG-FNPTSPTRLSIGRSMYRGRS 351

Query: 1996 APLTCKHTSSLAAVMAQLLSHRATHVWVTEAENEENLIGIVGFVDILAAVTRQPT 2160
            APLTCK +SSLAAVMAQ+LSHRATHVWVTEA++++ L+G+VG+ +IL AVT+QP+
Sbjct: 352  APLTCKTSSSLAAVMAQMLSHRATHVWVTEADSDDVLVGVVGYGEILTAVTKQPS 406


Top