BLASTX nr result

ID: Zingiber23_contig00009725 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber23_contig00009725
         (1454 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004982715.1| PREDICTED: CBS domain-containing protein CBS...   374   e-101
ref|NP_001146653.1| uncharacterized protein LOC100280253 [Zea ma...   371   e-100
gb|AAM93687.1| unknown protein [Oryza sativa Japonica Group] gi|...   370   e-100
ref|XP_003574134.1| PREDICTED: CBS domain-containing protein CBS...   362   2e-97
gb|ACG41914.1| hypothetical protein [Zea mays]                        355   3e-95
ref|NP_001141815.1| uncharacterized protein LOC100273954 [Zea ma...   354   6e-95
gb|EXB82787.1| CBS domain-containing protein CBSX5 [Morus notabi...   314   7e-83
ref|XP_003543253.1| PREDICTED: CBS domain-containing protein CBS...   311   6e-82
ref|XP_006467823.1| PREDICTED: CBS domain-containing protein CBS...   309   2e-81
gb|EXB76307.1| CBS domain-containing protein CBSX5 [Morus notabi...   308   3e-81
ref|XP_002269338.1| PREDICTED: CBS domain-containing protein CBS...   308   5e-81
ref|XP_003540455.1| PREDICTED: CBS domain-containing protein CBS...   307   6e-81
ref|XP_002528574.1| conserved hypothetical protein [Ricinus comm...   305   2e-80
ref|XP_006449313.1| hypothetical protein CICLE_v10015412mg [Citr...   305   3e-80
gb|ESW21594.1| hypothetical protein PHAVU_005G083300g [Phaseolus...   305   4e-80
ref|XP_006662481.1| PREDICTED: CBS domain-containing protein CBS...   303   1e-79
gb|EOY28290.1| CBS domain-containing protein [Theobroma cacao]        303   1e-79
ref|XP_006451922.1| hypothetical protein CICLE_v10008537mg [Citr...   302   3e-79
gb|EOY12875.1| Cystathionine beta-synthase family protein isofor...   301   4e-79
ref|XP_002519040.1| conserved hypothetical protein [Ricinus comm...   300   8e-79

>ref|XP_004982715.1| PREDICTED: CBS domain-containing protein CBSX5-like [Setaria italica]
          Length = 374

 Score =  374 bits (959), Expect = e-101
 Identities = 225/383 (58%), Positives = 271/383 (70%), Gaps = 4/383 (1%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRAV 1193
            MAV L++NEVSDLCIGKPAVRSLP SAA  G++ AALRR      A  V        RAV
Sbjct: 1    MAVSLLANEVSDLCIGKPAVRSLPLSAAA-GELAAALRRVARSGAAACVA--VTGPARAV 57

Query: 1192 SGKLCVADVICFLCSDGN-LASPAVALERPMSALLPK-GGGLVRRVEPQFSILEALDLIL 1019
            +G++ +AD++CFLC++   LA PA AL +P+SALLPK GGG VRRV+P+ SILEALD IL
Sbjct: 58   AGRVGLADLLCFLCTEPEALARPAAALAKPVSALLPKDGGGEVRRVDPRSSILEALDAIL 117

Query: 1018 DGAQSLVVPIRPVGRKKNIHGGAGGAVADFCWLTHEDFVRFFLNSIAHFSPIPTLSIDAL 839
             GAQ L VP+R  GRKK + GGA  A ADFCWLT ED VR+FLNSI  F      S+ +L
Sbjct: 118  SGAQVLAVPLRAGGRKKQLIGGA--AAADFCWLTQEDLVRYFLNSIGLFYHAAARSVSSL 175

Query: 838  GLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEISPASLCACDETIS 659
            GLVR+ D  +V   E  L+  PL+R A++  TAVAVVTEDG L+GEISPA L ACDET  
Sbjct: 176  GLVRT-DFLSVRPGESALSAAPLIRHAVATETAVAVVTEDGHLVGEISPALLAACDET-- 232

Query: 658  VAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLELMEDELCXXXXXXXX 479
             AA IATL+V DLM++IDYFGSP + ++R +K+ LK+KGL  ML+L+EDE          
Sbjct: 233  AAAAIATLSVADLMAYIDYFGSPPEHILRAVKAGLKDKGLDAMLDLIEDETLSSFSSLSA 292

Query: 478  XXXSEDELGGKQLRRLRSRCFSMGRR-AEDPEVCHPGSSLVAAMVQALAHRVSYLWVV-D 305
               S++E G   LRR  S   S GRR AE+P VC P SSLVA MVQALAHRVSY+WV+ +
Sbjct: 293  SSSSDEETGRPLLRRPSSG--SYGRRSAEEPVVCSPASSLVAVMVQALAHRVSYVWVLEE 350

Query: 304  EDDYSLMGIVTFADMLRVFREQL 236
            EDD  L GIVTFAD+LRVFREQL
Sbjct: 351  EDDCRLAGIVTFADVLRVFREQL 373


>ref|NP_001146653.1| uncharacterized protein LOC100280253 [Zea mays]
            gi|219888199|gb|ACL54474.1| unknown [Zea mays]
          Length = 375

 Score =  371 bits (953), Expect = e-100
 Identities = 230/388 (59%), Positives = 271/388 (69%), Gaps = 8/388 (2%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRR---GDAPHLAVLVVDRAAPEK 1202
            MAV  ++NEVSDLCIGKPAVRSLP SAA  GD+ AALRR     AP   V V   A    
Sbjct: 1    MAVSFLANEVSDLCIGKPAVRSLPLSAAA-GDLAAALRRVARSGAPS-CVAVTGPA---- 54

Query: 1201 RAVSGKLCVADVICFLCSDGN-LASPAVALERPMSALLPK-GGGLVRRVEPQFSILEALD 1028
            RAV G++ +ADV+CFLC+D   LA PAV   +P+SALLPK G G VRRV+P+ SILEALD
Sbjct: 55   RAVVGRVGLADVLCFLCTDPEALARPAVVFSKPVSALLPKDGAGEVRRVDPRSSILEALD 114

Query: 1027 LILDGAQSLVVPIRPVGRKKNIHGGAGGAVA-DFCWLTHEDFVRFFLNSIAHFSPIPTLS 851
             +L GAQ L VP+R   RKK + GG G A A DFCWLT ED VR+FLNSI  F  +   S
Sbjct: 115  AVLSGAQVLAVPLRAGWRKKQLGGGGGSAAAGDFCWLTQEDLVRYFLNSIGLFYHVAARS 174

Query: 850  IDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEISPASLCACD 671
            + +LGLVR+ D  +V   E  L+ +PL+RRA++  TAVAVVTEDG LLGEISPA L ACD
Sbjct: 175  VSSLGLVRT-DFLSVRPGEAALSAVPLIRRAVATETAVAVVTEDGHLLGEISPALLAACD 233

Query: 670  ETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLELMEDELCXXXX 491
            ET   AA IATL+  DLM+++DYFGSP + + R IK+ LK+KGL  ML L+EDE      
Sbjct: 234  ET--AAAAIATLSAADLMAYVDYFGSPPEHISRAIKAGLKDKGLDAMLALVEDE--TLSS 289

Query: 490  XXXXXXXSEDELGGKQLRRLRSRCFSMGRR-AEDPEVCHPGSSLVAAMVQALAHRVSYLW 314
                   S++E G  QLRR  S   S GRR AE+P VC P SSLVA MVQALAHRVSY+W
Sbjct: 290  FSSASSSSDEEAGRTQLRRPSSG--SYGRRSAEEPVVCSPASSLVAVMVQALAHRVSYVW 347

Query: 313  VVDED-DYSLMGIVTFADMLRVFREQLL 233
            V+DED D  L GIVTFAD+LRVFREQLL
Sbjct: 348  VLDEDSDCRLAGIVTFADVLRVFREQLL 375


>gb|AAM93687.1| unknown protein [Oryza sativa Japonica Group]
            gi|31432888|gb|AAP54464.1| CBS domain-containing protein,
            putative, expressed [Oryza sativa Japonica Group]
            gi|125532527|gb|EAY79092.1| hypothetical protein
            OsI_34199 [Oryza sativa Indica Group]
          Length = 385

 Score =  370 bits (951), Expect = e-100
 Identities = 220/388 (56%), Positives = 270/388 (69%), Gaps = 9/388 (2%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRG--DAPHLAVLVVDRAAPEKR 1199
            MAVRL++NEVSDLCIGKPAVRSLP SAA  GD+ AALRRG   A   A   V    P  R
Sbjct: 1    MAVRLLANEVSDLCIGKPAVRSLPLSAAA-GDLAAALRRGPQQAAGGAAACVAVVGPG-R 58

Query: 1198 AVSGKLCVADVICFLCS-DGNLASPAVALERPMSALLPK-GGGLVRRVEPQFSILEALDL 1025
            AV+G+L +ADV+CFLC+  G LA P  AL +P SALLPK G G VRRV+P+ S+LEALD 
Sbjct: 59   AVAGRLGLADVLCFLCAAPGALAHPTAALSKPASALLPKDGAGEVRRVDPRASVLEALDA 118

Query: 1024 ILDGAQSLVVPIRPVGRKKNIHGGAGGAVA-DFCWLTHEDFVRFFLNSIAHFSPIPTLSI 848
            +L GAQ L VP+R  GR+K + GG GG    D+CWLT ED VR+FLNSI+ FS +   S+
Sbjct: 119  VLSGAQVLAVPLRSGGRRKQLGGGGGGGGGGDYCWLTQEDLVRYFLNSISLFSHVAGRSV 178

Query: 847  DALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEISPASLCACDE 668
             +LGLVR+ D  TV   E  L+ +PL+RRA++  TAVAVV + G L+GEISPA L +CDE
Sbjct: 179  SSLGLVRADDLLTVRPHEAALSAVPLLRRAIATETAVAVVDDGGHLVGEISPALLASCDE 238

Query: 667  TISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLELMEDEL--CXXX 494
            T   AA IATL+V DLM+++DYFG+P + ++R +K+ LK KGL  MLEL+E+E       
Sbjct: 239  T--AAAAIATLSVADLMAYVDYFGAPPEHILRAVKAGLKSKGLDAMLELVENEAVSSFAF 296

Query: 493  XXXXXXXXSEDELGGKQLRRLRSRCFSMGRRA-EDPEVCHPGSSLVAAMVQALAHRVSYL 317
                    S+DE  G+  R  R    S GRR+ E+P VC P SSLVA M+QALAHR SYL
Sbjct: 297  SSSSTSSSSDDEAHGRAARLRRPSSGSYGRRSTEEPVVCSPASSLVAVMMQALAHRASYL 356

Query: 316  WVVDE-DDYSLMGIVTFADMLRVFREQL 236
            WV+DE DD  L GIVTFAD+L VFREQL
Sbjct: 357  WVLDEDDDCRLAGIVTFADVLTVFREQL 384


>ref|XP_003574134.1| PREDICTED: CBS domain-containing protein CBSX5-like [Brachypodium
            distachyon]
          Length = 377

 Score =  362 bits (930), Expect = 2e-97
 Identities = 216/384 (56%), Positives = 263/384 (68%), Gaps = 5/384 (1%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRAV 1193
            MAV L++N+VSDLCIGKPAVR   P +A  GD+ AA+R+G  P  A  V       + AV
Sbjct: 1    MAVSLLANDVSDLCIGKPAVRRSLPLSAAAGDLAAAVRKG--PRAAAAVCAAVTGPRGAV 58

Query: 1192 SGKLCVADVICFLCSDGN-LASPAVALERPMSALLPKGG-GLVRRVEPQFSILEALDLIL 1019
             G+  +ADV+C LC+  + LA PA AL +P+SALLPK G G VRRV+P+ S+LEALD +L
Sbjct: 59   VGRAGLADVLCLLCASPDALARPAAALAKPVSALLPKDGEGEVRRVDPRSSVLEALDAVL 118

Query: 1018 DGAQSLVVPIRPVG-RKKNIHGGAGGAVADFCWLTHEDFVRFFLNSIAHFSPIPTLSIDA 842
            +GAQ L VP+R  G RKK + G A G   DFCWLT ED VR+FLNSI+ F  +   S+ +
Sbjct: 119  NGAQVLAVPLRSGGGRKKQLGGVAAGVAGDFCWLTQEDLVRYFLNSISLFYHVAARSVSS 178

Query: 841  LGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEISPASLCACDETI 662
            LGLV SAD  +V  DE  L+ +PL+R +++  TAVAVV+ DG L+GEIS A L ACDET 
Sbjct: 179  LGLV-SADYLSVRPDEAALSAVPLIRASIAAETAVAVVSADGHLVGEISTAHLAACDET- 236

Query: 661  SVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLELMEDELCXXXXXXX 482
              AA IATL+  DLM++IDYFGSP + ++R IK+ LK KGL  MLELMEDE         
Sbjct: 237  -AAAAIATLSAADLMAYIDYFGSPPEHILRSIKAGLKAKGLDAMLELMEDETMTSFSFSS 295

Query: 481  XXXXSEDELGGKQLRRLRSRCFSMGRRA-EDPEVCHPGSSLVAAMVQALAHRVSYLWVVD 305
                 ED  G   LRR  S  F  GRR+ E+P VC P SSLVA MVQALAHRVSYLWV+D
Sbjct: 296  SSSSDED-TGRAHLRRPSSGSF--GRRSTEEPVVCSPASSLVAVMVQALAHRVSYLWVLD 352

Query: 304  E-DDYSLMGIVTFADMLRVFREQL 236
            E DD  L GIVTFAD+LRVFREQL
Sbjct: 353  EDDDCRLAGIVTFADVLRVFREQL 376


>gb|ACG41914.1| hypothetical protein [Zea mays]
          Length = 374

 Score =  355 bits (910), Expect = 3e-95
 Identities = 215/384 (55%), Positives = 261/384 (67%), Gaps = 4/384 (1%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRAV 1193
            MAV  ++NEVSDLCIGKPAVRSLP SAAT GD+ AALRR      A  V        RAV
Sbjct: 1    MAVNFLANEVSDLCIGKPAVRSLPLSAAT-GDLAAALRRVSRSGAAACVA--VTGPARAV 57

Query: 1192 SGKLCVADVICFLCSDGN-LASPAVALERPMSALLPK-GGGLVRRVEPQFSILEALDLIL 1019
             G++ +ADV+CFLC+D   LA PA    +P+SALLPK G G VRRV+P+ SILEALD +L
Sbjct: 58   VGRVGLADVLCFLCTDPEALARPAAVFAKPVSALLPKDGAGEVRRVDPRSSILEALDAVL 117

Query: 1018 DGAQSLVVPIRPVGRKKNIHGGAGGAVADFCWLTHEDFVRFFLNSIAHFSPIPTLSIDAL 839
             GAQ L VP+R  GRKK + G A     DFCWLT ED VR+FLN I     +   S+ +L
Sbjct: 118  SGAQVLAVPLRAGGRKKQLVGAADDG--DFCWLTQEDLVRYFLNYICLVYNVAARSVSSL 175

Query: 838  GLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEISPASLCACDETIS 659
            GLVR AD  +V   E  L+ +PL+RRA++  TAVAVV EDG L+GEISPA L ACDET  
Sbjct: 176  GLVR-ADFLSVRPGEASLSAVPLIRRAVATETAVAVVAEDGHLVGEISPALLAACDET-- 232

Query: 658  VAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLELMEDELCXXXXXXXX 479
             AA IATL+  DLM++ID++ SP + ++R +K+ LK+KGL  +L L+EDE          
Sbjct: 233  AAAAIATLSAADLMAYIDHYVSPPEHILRAVKAGLKDKGLDALLALVEDETLSSFSSLSA 292

Query: 478  XXXSEDELGGKQLRRLRSRCFSMGRRAED-PEVCHPGSSLVAAMVQALAHRVSYLWVVDE 302
               S++E G  QLRR  S   S GRRA D P VC P SSLVA +VQALAHRVSY+WV+DE
Sbjct: 293  SSSSDEEAGRAQLRRPSSG--SYGRRAADEPVVCSPASSLVAVLVQALAHRVSYVWVLDE 350

Query: 301  D-DYSLMGIVTFADMLRVFREQLL 233
            D D  L GIV FAD+LRVFR+QLL
Sbjct: 351  DNDCRLAGIVRFADVLRVFRDQLL 374


>ref|NP_001141815.1| uncharacterized protein LOC100273954 [Zea mays]
            gi|194706032|gb|ACF87100.1| unknown [Zea mays]
            gi|413933920|gb|AFW68471.1| hypothetical protein
            ZEAMMB73_518907 [Zea mays]
          Length = 374

 Score =  354 bits (908), Expect = 6e-95
 Identities = 218/385 (56%), Positives = 261/385 (67%), Gaps = 5/385 (1%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRAV 1193
            MAV  ++NE+SDLCIGKPAVRSLP SAAT GD+ AALRR      A  V        RAV
Sbjct: 1    MAVNFLANELSDLCIGKPAVRSLPLSAAT-GDLAAALRRVSRSGAAACVA--VTGPARAV 57

Query: 1192 SGKLCVADVICFLCSDGN-LASPAVALERPMSALLPK-GGGLVRRVEPQFSILEALDLIL 1019
             G++  ADV+C LC+D   LA PA    +P+SALLPK G G VRRV+P+ SILEALD IL
Sbjct: 58   VGRVGPADVLCLLCTDPEALARPAAVFSKPVSALLPKDGAGEVRRVDPRSSILEALDAIL 117

Query: 1018 DGAQSLVVPIRPVGRKKNIHGGAGGAVADFCWLTHEDFVRFFLNSIAHFSPIPTLSIDAL 839
             GAQ L VP+R  GRKK + G A G   DFCWLT ED VR+FLN I     +   S+ +L
Sbjct: 118  SGAQVLAVPLRAGGRKKQLVGAADG---DFCWLTQEDLVRYFLNYICLVYNVAARSVSSL 174

Query: 838  GLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEISPASLCACDETIS 659
            GLVR AD  +V   E  L+ +PL+RRA++  TAVAVV EDG L+GEISPA L ACDET  
Sbjct: 175  GLVR-ADFLSVRPGEAALSAVPLIRRAVATETAVAVVAEDGHLVGEISPALLAACDET-- 231

Query: 658  VAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLELMEDELCXXXXXXXX 479
             AA IATL+  DLM++ID++ SP + ++R +K+ LK+KGL  +L L+EDE          
Sbjct: 232  AAAAIATLSAADLMAYIDHYVSPPEHILRAVKAGLKDKGLDALLALVEDETLSSFSSLSS 291

Query: 478  XXXSEDELGGK-QLRRLRSRCFSMGRRAED-PEVCHPGSSLVAAMVQALAHRVSYLWVVD 305
               S DE  G+ QLRR  S   S GRRA D P VC P SSLVA +VQALAHRVSY+WV+D
Sbjct: 292  ASSSSDEEAGRAQLRRPSSG--SYGRRAADEPVVCSPASSLVAVLVQALAHRVSYVWVLD 349

Query: 304  ED-DYSLMGIVTFADMLRVFREQLL 233
            ED D  L GIVTFAD+LRVFREQLL
Sbjct: 350  EDNDCRLAGIVTFADVLRVFREQLL 374


>gb|EXB82787.1| CBS domain-containing protein CBSX5 [Morus notabilis]
          Length = 389

 Score =  314 bits (804), Expect = 7e-83
 Identities = 191/395 (48%), Positives = 250/395 (63%), Gaps = 16/395 (4%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRAV 1193
            MAV L++ EVSDLC+GKPA+R+L  +A TVG+ L+AL+R    +L+V   D ++      
Sbjct: 1    MAVNLLAREVSDLCLGKPALRALLVTA-TVGEALSALKRLGQTYLSVWSCDHSSKIGTGG 59

Query: 1192 S-------GKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRVEPQFSILEA 1034
            S       GK+CVAD ICFLC + NL SPA AL+  +  L+PK  GLVR +EP  S+LEA
Sbjct: 60   SAGNCRCVGKVCVADAICFLCKEENLKSPATALQASVLVLIPKVPGLVRHLEPNASLLEA 119

Query: 1033 LDLILDGAQSLVVPIRPVGRKKNIH---------GGAGGAVADFCWLTHEDFVRFFLNSI 881
            +DLIL+GAQ+LV+PI+      N             A     ++CW+T ED +R+ LN I
Sbjct: 120  VDLILEGAQNLVIPIQTRSSSSNSRKNLLQKPSFNSARHDNREYCWITQEDIIRYLLNCI 179

Query: 880  AHFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGE 701
              FSPIP   I+AL L+ S     V +D+P  + LPL+ +AL   T+VAVV  D +L+GE
Sbjct: 180  GLFSPIPASPINALNLIDSKHILAVNYDDPAASTLPLISQALVCQTSVAVVDTDSKLIGE 239

Query: 700  ISPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLEL 521
            ISP +L +CDET  VAA IATL+ GDLM++ID  G P D LV+L+K RL+E+    +L+L
Sbjct: 240  ISPFTLNSCDET--VAAAIATLSAGDLMAYIDCGGPPED-LVQLVKDRLEERNCGALLDL 296

Query: 520  MEDELCXXXXXXXXXXXSEDELGGKQLRRLRSRCFSMGRRAEDPEVCHPGSSLVAAMVQA 341
            ME++             S DE  G    R       M RR+E   VCHP SSLVA MVQA
Sbjct: 297  MEEDYSTISSSSSLCSSSSDEESG----RFGGNSARMVRRSE-AIVCHPWSSLVAVMVQA 351

Query: 340  LAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
            LAHR+SY+WVV E D +LMGIVTFA ML+VFRE+L
Sbjct: 352  LAHRLSYMWVV-EADGTLMGIVTFAGMLKVFRERL 385


>ref|XP_003543253.1| PREDICTED: CBS domain-containing protein CBSX5-like [Glycine max]
          Length = 389

 Score =  311 bits (796), Expect = 6e-82
 Identities = 188/394 (47%), Positives = 251/394 (63%), Gaps = 15/394 (3%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGD-APHLAVLVVDRAAPEKRA 1196
            MAV  ++ +VSDLC+GKP +RSL  +AATV D LAAL+  D   H+++        E R 
Sbjct: 1    MAVSFLARDVSDLCLGKPPLRSLS-AAATVADALAALKSSDHETHVSLWSFCENKNEVRC 59

Query: 1195 VSGKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRVEPQFSILEALDLILD 1016
            V GKLC+ DVIC+LC + NL SP+ AL+ P+S++LPK   LV  ++P  S+ EA+DLIL 
Sbjct: 60   V-GKLCMVDVICYLCREDNLLSPSKALKEPLSSILPKDQSLVVHLQPSSSLFEAIDLILQ 118

Query: 1015 GAQSLVVPIRPV------GRKKNIHGGAGGAV-----ADFCWLTHEDFVRFFLNSIAHFS 869
            GAQ+LVVPI P        RK+  H  A   +      +FCWLT ED +RF L SI  F+
Sbjct: 119  GAQNLVVPILPTKRSGVSRRKQQQHQKASSTINSHSSCEFCWLTQEDVIRFLLGSIGVFT 178

Query: 868  PIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEISPA 689
            P+P LSID+LG++ S+D   + +  P  + +  + ++L+  T+VA+V  DG  +GEISP 
Sbjct: 179  PLPALSIDSLGII-SSDVLAIDYYSPASSAVGAISKSLTQQTSVAIVDSDGTFIGEISPF 237

Query: 688  SLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQML-ELMED 512
            +L  CDET  VAA IATL+ GDLM++ID  G P D LVRL+K+RLKEK  ++ML E    
Sbjct: 238  TLACCDET--VAAAIATLSAGDLMAYIDCGGPPED-LVRLVKARLKEKNFEKMLQEFTIL 294

Query: 511  ELCXXXXXXXXXXXSEDELGGKQLRRLRSRCFS--MGRRAEDPEVCHPGSSLVAAMVQAL 338
              C                  +  R  RS  +S  M R+AE   VCHP SSLVA M+QA+
Sbjct: 295  SSCESSQSTSSDEELPTRTPARSGRLARSSSYSARMVRKAE-AIVCHPKSSLVAVMIQAI 353

Query: 337  AHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
            AHRV+YLWV+ EDD SL+GIVTF++ML+VFRE L
Sbjct: 354  AHRVNYLWVI-EDDCSLVGIVTFSNMLKVFREHL 386


>ref|XP_006467823.1| PREDICTED: CBS domain-containing protein CBSX5-like [Citrus sinensis]
          Length = 412

 Score =  309 bits (791), Expect = 2e-81
 Identities = 179/400 (44%), Positives = 256/400 (64%), Gaps = 21/400 (5%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRAV 1193
            MAVRL+S  VSDLCIGKPA+RSL  S++TV D L+AL+R +  +++V   D +AP+++A 
Sbjct: 1    MAVRLLSVGVSDLCIGKPALRSLSVSSSTVADALSALKRLNESYISVWSCDHSAPKRKAT 60

Query: 1192 S----------------GKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRV 1061
            +                GK+C+ D+I FLC + NL +P  AL+ P+S LLP+  G++R +
Sbjct: 61   TADIDDHHQDSAACRCIGKVCMVDIISFLCKEENLLNPESALQDPVSVLLPEASGVIRHL 120

Query: 1060 EPQFSILEALDLILDGAQSLVVPIRPVGRKKNIHGGAGGAV---ADFCWLTHEDFVRFFL 890
            EP  S+LEA+DL+L G Q+LV+P+ P G K              +++CWLT ED +R+FL
Sbjct: 121  EPSASLLEAVDLLLGGVQNLVIPL-PAGTKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFL 179

Query: 889  NSIAHFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRL 710
            N I   SP P   I++L +V  A  F + +DEP    +PL+ ++    T+VA+V E+GRL
Sbjct: 180  NCIGLLSPTPNQPINSLNIVDDAGIFAIQYDEPAAFAIPLIAQSHINQTSVALVDEEGRL 239

Query: 709  LGEISPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQM 530
            +G+ISP S  +CDET  VAA + TL+ GDLM+++D  G P   LVRL+K RL+EK +   
Sbjct: 240  VGDISPFSFNSCDET--VAAAMVTLSAGDLMAYMD-CGRPPKDLVRLVKQRLEEKSMVGF 296

Query: 529  LELMEDELCXXXXXXXXXXXSEDELGGKQLRRLRSRCFS--MGRRAEDPEVCHPGSSLVA 356
            LELMED+L            S+DE      + +RSR +S  +  R+E   +CHP SSL+A
Sbjct: 297  LELMEDDLEISSGSCPDSSSSDDESSTGSAQSVRSRGYSARVVHRSE-AILCHPWSSLMA 355

Query: 355  AMVQALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
             ++QALA RVSY+WVV E+D +L+GIVTF  MLRV R++L
Sbjct: 356  VIMQALARRVSYVWVV-EEDCTLVGIVTFTGMLRVIRDRL 394


>gb|EXB76307.1| CBS domain-containing protein CBSX5 [Morus notabilis]
          Length = 401

 Score =  308 bits (790), Expect = 3e-81
 Identities = 195/412 (47%), Positives = 252/412 (61%), Gaps = 31/412 (7%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVD--RAAPEKR 1199
            MAV L+++++SDLC+ KPA+ SLP SA TV D LAAL+  D P L+V      R +PE  
Sbjct: 1    MAVSLLAHDISDLCLAKPALTSLPISA-TVADALAALKTSDDPFLSVWDCHHHRHSPESA 59

Query: 1198 AVS-------GKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRVEPQFSIL 1040
            A         GK+C+ DVICFLC D NL SP+ AL  P+S LLPK   LV  +EP  S+L
Sbjct: 60   AAGEWHCRCVGKVCMVDVICFLCRDDNLLSPSTALNTPVSDLLPKIPTLVTHLEPSSSLL 119

Query: 1039 EALDLILDGAQSLVVPIRP----VGRKKN------------IHGGAGGAVADFCWLTHED 908
            EA+DLIL GAQ+LVVPI+       R+K             +H G      +FCWLT ED
Sbjct: 120  EAIDLILQGAQNLVVPIKRRHSISSRRKQQQQPKISTATTTLHNGR-----EFCWLTQED 174

Query: 907  FVRFFLNSIAHFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVV 728
             VRF L+SI  FSPIP LSID+LG++ +    +  +  P +A L L+ R+L+  T+VAVV
Sbjct: 175  VVRFLLSSIGLFSPIPALSIDSLGIITTDGVLSTDYHSPAIAALGLISRSLADQTSVAVV 234

Query: 727  TEDGRLLGEISPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKE 548
              DG L+GEISP +L +CDET   AA IATL+ GDLM++ID  G P D LVR++  RLKE
Sbjct: 235  DGDGVLIGEISPLTLASCDET--AAAAIATLSAGDLMAYIDGSGPPED-LVRVVNRRLKE 291

Query: 547  KGLKQMLE----LMEDELCXXXXXXXXXXXSEDELGGKQLRRLRSRCFS--MGRRAEDPE 386
            + L+ +++    L                     L     R  RS  +S  M RRAE   
Sbjct: 292  RNLEGIVDDFIVLSSQSSSSSSDEESSLASPTTTLARSAGRYSRSASYSARMARRAE-AI 350

Query: 385  VCHPGSSLVAAMVQALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQLLP 230
            VCHP SSLVA M+QA+AHRV+Y+WVVD DD  L+GIVTF+ +L+VFRE  +P
Sbjct: 351  VCHPKSSLVAVMIQAIAHRVNYVWVVD-DDSCLVGIVTFSAILKVFREYTMP 401


>ref|XP_002269338.1| PREDICTED: CBS domain-containing protein CBSX5 [Vitis vinifera]
          Length = 384

 Score =  308 bits (788), Expect = 5e-81
 Identities = 181/396 (45%), Positives = 255/396 (64%), Gaps = 17/396 (4%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRA- 1196
            MAV L+ +EVSDLC+GKPA+RSLP SA TV D LAAL+R    +L+V   D  +   ++ 
Sbjct: 1    MAVSLLGHEVSDLCLGKPALRSLPVSA-TVADALAALKRSGDAYLSVWSCDHTSKINKSH 59

Query: 1195 -----VSGKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRVEPQFSILEAL 1031
                   GK+C+ DV+CFLC + NL+ P+ AL+ P+S LLPK  GLVR ++P   +LEA+
Sbjct: 60   LEDCRCIGKICMVDVVCFLCREDNLSCPSDALQSPLSLLLPKVPGLVRHLKPNSRLLEAI 119

Query: 1030 DLILDGAQSLVVPI--RPVGRKK---------NIHGGAGGAVADFCWLTHEDFVRFFLNS 884
            DL+L+GAQ++V+PI  R   RKK          +H G      +FCWLT ED VRF LNS
Sbjct: 120  DLMLEGAQNIVIPIQSRTNPRKKLVPKPSFNSTLHNGV-----EFCWLTQEDVVRFLLNS 174

Query: 883  IAHFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLG 704
            I  FSP+P L+I++L ++ + +  +V + +P  + L  + ++L   T+VAV+ ++ +L+G
Sbjct: 175  IGSFSPLPGLTIESLNIIDTENIPSVYYHDPASSALTAISQSLINQTSVAVLDQENKLVG 234

Query: 703  EISPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLE 524
            EISP +L  CDET  VAA IATL+ GDLM++ID  G P D LV+L+K+RL+E+ L   L+
Sbjct: 235  EISPFTLACCDET--VAAAIATLSAGDLMAYIDCGGPPED-LVQLVKARLEERKLGAFLD 291

Query: 523  LMEDELCXXXXXXXXXXXSEDELGGKQLRRLRSRCFSMGRRAEDPEVCHPGSSLVAAMVQ 344
            LM++E                  GG      R     M RR+E   VC+P SSL+A M+Q
Sbjct: 292  LMDEEFSYSSSSSSDEEFGFGRRGGSGKYSAR-----MARRSE-AIVCYPWSSLMAVMIQ 345

Query: 343  ALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
            ALAHRVSY+WV+ E+D+SL GIVTF+ + +VFR+ L
Sbjct: 346  ALAHRVSYVWVI-EEDWSLAGIVTFSGIFKVFRQHL 380


>ref|XP_003540455.1| PREDICTED: CBS domain-containing protein CBSX5-like [Glycine max]
          Length = 390

 Score =  307 bits (787), Expect = 6e-81
 Identities = 184/397 (46%), Positives = 251/397 (63%), Gaps = 18/397 (4%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAP-HLAVLVVDRAAPEKRA 1196
            MAV  ++ +VSDLC+GKP +RSL  +AATV D L AL+  D   H++V   +    E   
Sbjct: 1    MAVSFLARDVSDLCLGKPPLRSLS-AAATVADALDALKSSDGEIHVSVWSFEN---EVGR 56

Query: 1195 VSGKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRVEPQFSILEALDLILD 1016
              GKLC+ DVIC+LC + NL SP+ +L+ P+S++LPK   LV  ++P  S+LEA+DLIL 
Sbjct: 57   CVGKLCMVDVICYLCREDNLLSPSKSLKEPLSSILPKDHNLVVHLQPSSSLLEAIDLILQ 116

Query: 1015 GAQSLVVPIRP-----VGRKKNIHGGAGGAV-----ADFCWLTHEDFVRFFLNSIAHFSP 866
            GAQ+ VVPI P     V R+K  H  A   +      +FCWLT ED +RF L SI  F+P
Sbjct: 117  GAQNFVVPILPTKRSGVSRRKQQHQKASSTINSHSSCEFCWLTQEDVIRFLLGSIGVFTP 176

Query: 865  IPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEISPAS 686
            +P LSID+LG+V S+D   + +  P  + +  + ++L+  T+VA+V  DG  +GEISP +
Sbjct: 177  LPALSIDSLGIV-SSDVLAIDYYSPASSTVGAISKSLAQQTSVAIVDSDGTFIGEISPFT 235

Query: 685  LCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLE------ 524
            L  CDET  VAA +ATL+ GDLM++ID  G P D LVR++K+RLKEK L++ML+      
Sbjct: 236  LACCDET--VAAAMATLSAGDLMAYIDCGGPPED-LVRVVKARLKEKNLEKMLQEFTILS 292

Query: 523  -LMEDELCXXXXXXXXXXXSEDELGGKQLRRLRSRCFSMGRRAEDPEVCHPGSSLVAAMV 347
                 +L            +       +L R  S    M R+AE   VCHP SSLVA M+
Sbjct: 293  SCESSQLASSSSSSDEESTTRTPARSGRLARSSSYSARMVRKAE-AIVCHPKSSLVAVMI 351

Query: 346  QALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
            QA+AHRV+YLWV+ EDD SL+GIVTF++ML+VFRE L
Sbjct: 352  QAIAHRVNYLWVI-EDDCSLVGIVTFSNMLKVFREHL 387


>ref|XP_002528574.1| conserved hypothetical protein [Ricinus communis]
            gi|223532018|gb|EEF33829.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 408

 Score =  305 bits (782), Expect = 2e-80
 Identities = 195/416 (46%), Positives = 258/416 (62%), Gaps = 37/416 (8%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAV------------L 1229
            MAV L ++EVSDLC+GKPA+RSLP +A TV + L+AL+  D   L+V             
Sbjct: 1    MAVSLFAHEVSDLCLGKPALRSLPVTA-TVAEALSALKNSDDSFLSVWNCDHITKRNSGF 59

Query: 1228 VVDRAAPEKRAVSGKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRVEPQF 1049
              DR   ++    GK+ + DVIC+LC D NL SP+ AL+ P+S LLPK  GLV  VEP  
Sbjct: 60   NCDREDRDECKCVGKVSIVDVICYLCQDKNLVSPSDALKDPVSVLLPKIPGLVMHVEPSS 119

Query: 1048 SILEALDLILDGAQSLVVPIR-----PVGRKK-------------NIHGGAGGAVADFCW 923
            S++EA+DLIL GAQ+LVVPI+        R+K              IH G      +FCW
Sbjct: 120  SLVEAIDLILQGAQNLVVPIKTRLSSSNSRRKQQQKLSATSTGLTTIHKG-----REFCW 174

Query: 922  LTHEDFVRFFLNSIAHFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHT 743
            L  ED +RFFL+SI  FSP+P LSID+LG++ + D  T+ ++ P  A L  + RAL+  T
Sbjct: 175  LAQEDIIRFFLSSIGLFSPVPALSIDSLGII-TTDIITIDYNSPASATLGAINRALATQT 233

Query: 742  AVAVVT-EDGRLLGEISPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLI 566
            +VAVV  ++G L+GE+SP +L  CDET  VAA I TL+ GDLM++ID  G P D LVR++
Sbjct: 234  SVAVVDGDEGILIGELSPFTLACCDET--VAAAITTLSSGDLMAYIDCGGPPED-LVRVV 290

Query: 565  KSRLKEKGLKQMLELMEDELCXXXXXXXXXXXSEDELGGKQLRR----LRSRCFS--MGR 404
             +RLK +GL+ ML+   +              S DE     L R     RS+ +S  M R
Sbjct: 291  MARLKHRGLEAMLQEFTNSTTSLVSFSTLSSSSSDEESTTTLHRSGKYSRSKSYSARMVR 350

Query: 403  RAEDPEVCHPGSSLVAAMVQALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
            RAE   VCHP SSLVA M+QA+AHRV+Y+WV+ E+D SL+GIVTF +ML+VFRE L
Sbjct: 351  RAE-AIVCHPKSSLVAVMIQAIAHRVNYVWVI-EEDCSLVGIVTFCNMLKVFREHL 404


>ref|XP_006449313.1| hypothetical protein CICLE_v10015412mg [Citrus clementina]
            gi|557551924|gb|ESR62553.1| hypothetical protein
            CICLE_v10015412mg [Citrus clementina]
          Length = 412

 Score =  305 bits (781), Expect = 3e-80
 Identities = 179/400 (44%), Positives = 254/400 (63%), Gaps = 21/400 (5%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRAV 1193
            MAVRL+S  VSDLCIGKPA+RSL  S++TV D L+AL+R +  +L+V   D +A +++A 
Sbjct: 1    MAVRLLSVGVSDLCIGKPALRSLSVSSSTVADALSALKRLNESYLSVWSCDHSARKRKAA 60

Query: 1192 S----------------GKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRV 1061
            +                GK+C+ D+I FLC + NL +P  AL+ P+S LLP+  G+VR +
Sbjct: 61   AADIDDDHQDSAACRCIGKVCMVDIISFLCKEENLWNPESALQAPVSVLLPEASGVVRHL 120

Query: 1060 EPQFSILEALDLILDGAQSLVVPIRPVGRKKNIHGGAGGAV---ADFCWLTHEDFVRFFL 890
            EP  S+LEA+DL+L G Q+LV+P+ P G K              +++CWLT ED +R+FL
Sbjct: 121  EPSASLLEAVDLLLGGVQNLVIPL-PAGTKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFL 179

Query: 889  NSIAHFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRL 710
            N I   SP P   I++L +V  A  F + +DEP  + +PL+ ++    T+VA+V E+GRL
Sbjct: 180  NCIGLLSPTPNQPINSLNIVDDAGIFAIQYDEPAASAIPLIAQSHISQTSVALVDEEGRL 239

Query: 709  LGEISPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQM 530
            +G+ISP S  +CDE   VAA + TL+ GDLM+++D  G P   LVRL+K RL+EK +   
Sbjct: 240  VGDISPFSFNSCDE--KVAAAMVTLSAGDLMAYMD-CGRPPKDLVRLVKQRLEEKSMVGF 296

Query: 529  LELMEDELCXXXXXXXXXXXSEDELGGKQLRRLRSRCFS--MGRRAEDPEVCHPGSSLVA 356
            LELMED+L            S+DE      R +RS  +S  +  R+E   +CHP SSL+A
Sbjct: 297  LELMEDDLEISSGSCSNSSSSDDESSTGSARSVRSGGYSARVVHRSE-AIICHPWSSLMA 355

Query: 355  AMVQALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
             ++QALA RVSY+WVV E+D +L+GIVTF  MLRV R++L
Sbjct: 356  VIMQALARRVSYVWVV-EEDCTLVGIVTFTGMLRVIRDRL 394


>gb|ESW21594.1| hypothetical protein PHAVU_005G083300g [Phaseolus vulgaris]
          Length = 393

 Score =  305 bits (780), Expect = 4e-80
 Identities = 184/397 (46%), Positives = 254/397 (63%), Gaps = 18/397 (4%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRAV 1193
            MAV  ++ +VSDLC+GKP +RSL  +AATV D LAAL+  D   ++V +      E+   
Sbjct: 1    MAVSFLARDVSDLCLGKPPLRSLS-TAATVADALAALKNTDESFISVWLCCEHEKEQELC 59

Query: 1192 S--GKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRVEPQFSILEALDLIL 1019
               GK+C+ DVIC+L  + NL SP+ AL +P+S +LPK   LV  ++P  S+LEA+DLIL
Sbjct: 60   RCVGKVCMVDVICYLSKEDNLLSPSSALNQPISVILPKDCSLVVHLQPSSSLLEAIDLIL 119

Query: 1018 DGAQSLVVPIRPVGR----KKNIHGGAGGAV------ADFCWLTHEDFVRFFLNSIAHFS 869
             GAQ+LVVPI P  R    ++  H  A   V       +FCWLT ED +RF L SI  F+
Sbjct: 120  QGAQNLVVPILPTKRSGVSRRKQHLKASSTVINSHNGGEFCWLTQEDVIRFLLGSIGVFT 179

Query: 868  PIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEISPA 689
            P+P LS+D+LG++ S+D   + +  P  + +  + ++L+  T+VA+V  DG  +GEISP 
Sbjct: 180  PLPALSLDSLGII-SSDVLAIDYFSPASSAVGAISKSLAQQTSVAIVDSDGTFIGEISPF 238

Query: 688  SLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLELMEDE 509
            +L  CDET  VAA IATL+ GDLM++ID  G P D LVR++K+RLKEK L++ML+     
Sbjct: 239  TLACCDET--VAAAIATLSAGDLMAYIDCGGPPED-LVRVVKARLKEKSLEKMLQEYTVL 295

Query: 508  LCXXXXXXXXXXXSEDELGGKQLRR----LRSRCFS--MGRRAEDPEVCHPGSSLVAAMV 347
                         S++E   + + R     RS  +S  M R+AE   VCHP SSL+A M+
Sbjct: 296  SSCESLHSAFSSSSDEESPTRTMTRSGRYSRSSSYSARMVRKAE-AIVCHPKSSLIAVMI 354

Query: 346  QALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
            QA+AHRV+YLWV+ EDD SL+GIVTF+DML+VFRE L
Sbjct: 355  QAIAHRVNYLWVI-EDDCSLVGIVTFSDMLKVFREHL 390


>ref|XP_006662481.1| PREDICTED: CBS domain-containing protein CBSX5-like [Oryza
            brachyantha]
          Length = 321

 Score =  303 bits (776), Expect = 1e-79
 Identities = 170/304 (55%), Positives = 213/304 (70%), Gaps = 3/304 (0%)
 Frame = -2

Query: 1138 LASPAVALERPMSALLPK-GGGLVRRVEPQFSILEALDLILDGAQSLVVPIRPVGRKKNI 962
            ++ PA AL +P+SALLPK G G VRRV+P+ S+LEA+D +L GAQ L VP+R  GR+K +
Sbjct: 20   VSHPAAALSKPVSALLPKDGAGEVRRVDPRASVLEAIDAVLSGAQVLAVPLRSGGRRKQL 79

Query: 961  HGGAGGAVADFCWLTHEDFVRFFLNSIAHFSPIPTLSIDALGLVRSADAFTVCHDEPGLA 782
             GG GG   DFCWLT ED VR+FLNSI+ FS +   SI +LGLVR+ D  +V   E  L+
Sbjct: 80   GGGGGGGGGDFCWLTQEDLVRYFLNSISLFSHVAGRSISSLGLVRADDVLSVRPHEAALS 139

Query: 781  LLPLVRRALSGHTAVAVVTEDGRLLGEISPASLCACDETISVAAGIATLTVGDLMSFIDY 602
             +PL+RRA++  TAVAVV + G L+GEISPA L +CDET   AA +ATL+V DLM++IDY
Sbjct: 140  AVPLLRRAIATETAVAVVDDYGHLVGEISPALLASCDET--AAAAVATLSVADLMAYIDY 197

Query: 601  FGSPSDSLVRLIKSRLKEKGLKQMLELMEDELCXXXXXXXXXXXSEDELGGKQLRRLRSR 422
            FGSP + + R IK+ LK KGL  MLEL+E+E             S+DE  G+  +  R  
Sbjct: 198  FGSPPEHISRAIKAGLKSKGLDAMLELVENE-AVSSFAFSSSSSSDDEAHGRAAKLRRPS 256

Query: 421  CFSMGRRA-EDPEVCHPGSSLVAAMVQALAHRVSYLWVVDE-DDYSLMGIVTFADMLRVF 248
              S GRR+ E+P VC P SSLVA M+QALAHR SYLWV+DE DD  L GIVTF D+LRVF
Sbjct: 257  SGSYGRRSTEEPVVCSPASSLVAVMMQALAHRASYLWVLDEDDDCRLAGIVTFVDVLRVF 316

Query: 247  REQL 236
            REQL
Sbjct: 317  REQL 320


>gb|EOY28290.1| CBS domain-containing protein [Theobroma cacao]
          Length = 397

 Score =  303 bits (776), Expect = 1e-79
 Identities = 181/399 (45%), Positives = 254/399 (63%), Gaps = 20/399 (5%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDR-------- 1217
            MAV L+  EVSDLC+GKPA+RSL  SA TVG  L+ L+R    +++V   D         
Sbjct: 1    MAVSLLEREVSDLCLGKPALRSLSISA-TVGHALSVLKRFGDNYISVWNCDHRHLPDADK 59

Query: 1216 --AAPEKRAVSGKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRRVEPQFSI 1043
              A  E+    GK+C+ D+ICFLC + NL++P  AL+ P+S L+PK  GL+R +EP  S+
Sbjct: 60   TDAGFEECRCVGKVCMVDIICFLCKEENLSNPGTALQAPVSVLIPKVPGLIRHLEPNASL 119

Query: 1042 LEALDLILDGAQSLVVPIR---PVGRKKNIHGGAGGAV----ADFCWLTHEDFVRFFLNS 884
            +EA+DLIL+GAQ+LV+P+       RKK +      +      ++CWLT ED +R+ LNS
Sbjct: 120  VEAMDLILEGAQNLVIPLESGTTNSRKKLLQITLSNSTLHNNREYCWLTQEDIIRYLLNS 179

Query: 883  IAHFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLG 704
            I  FSP P   I++L ++ + +   V +D+P    LP + ++L   T+VA+V  DG+L+G
Sbjct: 180  IGLFSPTPVNPINSLNIIDTQNILAVHYDDPASLALPFIAQSLEMQTSVAIVDTDGKLIG 239

Query: 703  EISPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLE 524
            EISP +L +C E   VAA IATL+ GDLM++ID  G P D L++L+K RL+E+ L+Q LE
Sbjct: 240  EISPFTLNSCGE--DVAAAIATLSAGDLMAYIDCGGRPED-LIQLVKERLQERNLEQALE 296

Query: 523  LMEDE---LCXXXXXXXXXXXSEDELGGKQLRRLRSRCFSMGRRAEDPEVCHPGSSLVAA 353
            LME++                S++E G  +  RL      + RR+E   VC+P SSLVA 
Sbjct: 297  LMEEDSGISSGASFSSSYSSSSDEEFGVGRGGRLGGYSARLVRRSE-AIVCYPWSSLVAV 355

Query: 352  MVQALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
            M+QALAHRVSY+WVV EDD +L GIVTFA M++VFRE+L
Sbjct: 356  MIQALAHRVSYVWVV-EDDGTLAGIVTFAGMMKVFRERL 393


>ref|XP_006451922.1| hypothetical protein CICLE_v10008537mg [Citrus clementina]
            gi|568820423|ref|XP_006464720.1| PREDICTED: CBS
            domain-containing protein CBSX5-like [Citrus sinensis]
            gi|557555148|gb|ESR65162.1| hypothetical protein
            CICLE_v10008537mg [Citrus clementina]
          Length = 397

 Score =  302 bits (773), Expect = 3e-79
 Identities = 187/400 (46%), Positives = 255/400 (63%), Gaps = 21/400 (5%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAPEKRAV 1193
            MAV L+++EVSDLC+GKPA+R+L  SAA + D L+AL+  D   ++V   +  +  ++  
Sbjct: 1    MAVSLLADEVSDLCLGKPALRALSISAA-IADALSALKNSDESFISVWDCNCCSNHRKPG 59

Query: 1192 S-------GKLCVADVICFLCSDGNLASPAVALERPMSALLPKG-GGLVRRVEPQFSILE 1037
                    GK+C+ DVIC+LC D N  SP++AL++P+S LLP+    LV  VEP  S+LE
Sbjct: 60   DACECQCVGKVCMVDVICYLCKDSNSLSPSLALKQPVSVLLPQLLPPLVMHVEPSCSLLE 119

Query: 1036 ALDLILDGAQSLVVPIR---PVGRKKNIHGGAGGAV----ADFCWLTHEDFVRFFLNSIA 878
            A+DL+L GAQ+LVVPI+    + RK+     +         +FCWLT ED +RF L+SI+
Sbjct: 120  AMDLMLGGAQNLVVPIKNRLSIKRKQQQKLSSSSLTNHNGREFCWLTQEDIIRFILSSIS 179

Query: 877  HFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLGEI 698
             FSPIP LSID+LG++ S D   V +  P    L  + R+L   T+VAVV  DG L+GEI
Sbjct: 180  LFSPIPALSIDSLGII-STDVVAVDYHSPASLALEAISRSLFDQTSVAVVDSDGFLIGEI 238

Query: 697  SPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLELM 518
            SP++L  CDET  VAA I TL+ GDLM++ID  G P D LVR++K RLK+KGL+ MLE  
Sbjct: 239  SPSTLGCCDET--VAAAITTLSAGDLMAYIDCGGPPED-LVRVVKERLKDKGLEGMLEHF 295

Query: 517  EDELCXXXXXXXXXXXSEDELGGKQLRR----LRSRCFS--MGRRAEDPEVCHPGSSLVA 356
            +                E+     +L R     RS  +S  M RRAE   VCHP SSL+A
Sbjct: 296  DMSSSLMPYLSTSSSSDEESTATSKLTRSGKHSRSMSYSARMVRRAE-AIVCHPTSSLMA 354

Query: 355  AMVQALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
             M+QA+AHRV+Y+WV+ EDD +L GIVTF+D+L+VFR+ L
Sbjct: 355  VMIQAIAHRVTYVWVI-EDDCTLTGIVTFSDLLKVFRKHL 393


>gb|EOY12875.1| Cystathionine beta-synthase family protein isoform 1 [Theobroma
            cacao]
          Length = 406

 Score =  301 bits (772), Expect = 4e-79
 Identities = 192/415 (46%), Positives = 251/415 (60%), Gaps = 36/415 (8%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRAAP----- 1208
            MAVRL+S+E+SDLC+GKPA+RSL  ++ T+ D +  L+  D   ++V   +  A      
Sbjct: 1    MAVRLLSHELSDLCLGKPALRSLSITS-TIADAVEVLKTSDENFVSVWSCNHKAKTASGF 59

Query: 1207 ------------EKRAVSGKLCVADVICFLCSDGNLASPAVALERPMSALLPKGGGLVRR 1064
                        E R V GK+C+ DVIC+LC D NL SP+VAL+ P+S LLPK   LV  
Sbjct: 60   ESAARFSDDDDDECRCV-GKVCMVDVICYLCKDENLVSPSVALKEPVSVLLPKIPDLVMH 118

Query: 1063 VEPQFSILEALDLILDGAQSLVVPIRPVGRKK-----------NIHGGAGGAVADFCWLT 917
            VEP  S+LEA+DLIL GAQ+LVVPI+     K            IH G      +FCWLT
Sbjct: 119  VEPSCSLLEAVDLILQGAQNLVVPIKTKLSNKRKQQQKPSPTVTIHKG-----REFCWLT 173

Query: 916  HEDFVRFFLNSIAHFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAV 737
             ED +RF L+SI  FSPIP  SID+LG++ + D  T+ +  P  A    + RAL   T+V
Sbjct: 174  QEDVIRFLLSSIGLFSPIPAFSIDSLGII-NPDILTIEYHSPASAATGAISRALVDQTSV 232

Query: 736  AVVTEDGRLLGEISPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSR 557
            AVV  +G L+GEISP +L  CDET  VAA + TL+ GDLM++ID  G P D LVR++ +R
Sbjct: 233  AVVDSEGTLIGEISPFTLACCDET--VAAALKTLSSGDLMAYIDCGGPPED-LVRVVTAR 289

Query: 556  LKEKGLKQMLELMEDELCXXXXXXXXXXXSEDELGG------KQLRRLRSRCFS--MGRR 401
            LKE+ L  MLE     +             E+ +        +  R  RS  +S  M RR
Sbjct: 290  LKERNLNGMLEHFTMSMSSGGFSSASSSSDEESMTAPVSPLPRSGRHSRSMSYSARMVRR 349

Query: 400  AEDPEVCHPGSSLVAAMVQALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
            AE   VCHP SSLVA M+QA+AHRV+Y+WV+ EDD SL+GIVTF+D+L+VFRE L
Sbjct: 350  AE-AIVCHPKSSLVAVMIQAIAHRVNYVWVI-EDDCSLVGIVTFSDILKVFREHL 402


>ref|XP_002519040.1| conserved hypothetical protein [Ricinus communis]
            gi|223541703|gb|EEF43251.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 396

 Score =  300 bits (769), Expect = 8e-79
 Identities = 177/399 (44%), Positives = 258/399 (64%), Gaps = 20/399 (5%)
 Frame = -2

Query: 1372 MAVRLVSNEVSDLCIGKPAVRSLPPSAATVGDVLAALRRGDAPHLAVLVVDRA------- 1214
            MAV L++ EVSDLC+GKPA+RSL  SA TV + L+ L+R D P+L+V   D         
Sbjct: 1    MAVSLLAREVSDLCLGKPALRSLSVSA-TVAEALSGLKRSDDPYLSVWSCDHNNKKNYIN 59

Query: 1213 ----APEKRAVSGKLCVADVICFLCSDGNLASPAVALERPMSALL-PKGGGLVRRVEPQF 1049
                + E R + GK+C+ D+I FLC + NL +   AL+ P+S++L  K  GLVR +EP  
Sbjct: 60   NNNNSNECRCI-GKICMVDIISFLCKEENLKNLPRALQEPLSSVLVSKVYGLVRPLEPHA 118

Query: 1048 SILEALDLILDGAQSLVVPIR-PVGRKKNIHGGAGGAVA----DFCWLTHEDFVRFFLNS 884
            S+LEA+DLIL+GAQ+LV+P+  P  RKK IH  +  +      ++CWLT ED +R+ LN 
Sbjct: 119  SLLEAIDLILEGAQNLVIPVHSPFTRKKLIHRTSSYSTLHNNREYCWLTQEDIIRYLLNC 178

Query: 883  IAHFSPIPTLSIDALGLVRSADAFTVCHDEPGLALLPLVRRALSGHTAVAVVTEDGRLLG 704
            I  FSPIP  ++++L ++ +     VC+DEP  + LPL+ ++L   T+VA++  +G+L+G
Sbjct: 179  IGLFSPIPNHTVESLNIIDTESILAVCYDEPASSALPLISQSLVKQTSVAILDIEGKLIG 238

Query: 703  EISPASLCACDETISVAAGIATLTVGDLMSFIDYFGSPSDSLVRLIKSRLKEKGLKQMLE 524
            EISP +L +CDE   VAA IATL+ G+LM+++D  G P + L+RL+K RL+E+ L+ +LE
Sbjct: 239  EISPYTLNSCDEL--VAAAIATLSAGELMAYVD-CGDPPEDLIRLVKERLEERNLEIVLE 295

Query: 523  LMEDE---LCXXXXXXXXXXXSEDELGGKQLRRLRSRCFSMGRRAEDPEVCHPGSSLVAA 353
            LME+E                S++E G  +    R     + RR  D  VC P SSLVA 
Sbjct: 296  LMEEESGISSSSSSFSSFSSSSDEEFGLGKSGSFRGHSTRVARRT-DAIVCFPWSSLVAV 354

Query: 352  MVQALAHRVSYLWVVDEDDYSLMGIVTFADMLRVFREQL 236
            M+QA++HR SY+WVV+ED  +L+G+VTF  ML+VFRE++
Sbjct: 355  MIQAISHRASYVWVVEEDG-TLVGVVTFTGMLKVFRERV 392


Top