BLASTX nr result

ID: Rehmannia23_contig00010558 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00010558
         (1405 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006365887.1| PREDICTED: uncharacterized protein LOC102604...   418   e-114
ref|XP_004232753.1| PREDICTED: uncharacterized protein LOC101265...   415   e-113
ref|XP_004234468.1| PREDICTED: uncharacterized protein LOC101251...   414   e-113
ref|XP_006343279.1| PREDICTED: uncharacterized protein LOC102586...   413   e-112
ref|XP_002317274.1| hypothetical protein POPTR_0011s03350g [Popu...   404   e-110
ref|XP_002275818.1| PREDICTED: uncharacterized protein LOC100252...   390   e-106
gb|EMJ12793.1| hypothetical protein PRUPE_ppa005423mg [Prunus pe...   386   e-104
ref|XP_006467958.1| PREDICTED: uncharacterized protein LOC102621...   385   e-104
ref|XP_006300468.1| hypothetical protein CARUB_v10020270mg [Caps...   384   e-104
gb|EOY28541.1| Sulfite exporter TauE/SafE family protein, putati...   383   e-103
ref|XP_006449135.1| hypothetical protein CICLE_v10015151mg [Citr...   382   e-103
gb|EXB51986.1| hypothetical protein L484_019763 [Morus notabilis]     380   e-103
emb|CAA10487.1| hypothetical protein [Arabidopsis thaliana]           380   e-102
ref|NP_176367.1| Sulfite exporter TauE/SafE family protein [Arab...   380   e-102
ref|XP_002518567.1| conserved hypothetical protein [Ricinus comm...   379   e-102
ref|XP_004485660.1| PREDICTED: uncharacterized protein LOC101511...   375   e-101
ref|XP_002886512.1| hypothetical protein ARALYDRAFT_475155 [Arab...   375   e-101
ref|XP_006594653.1| PREDICTED: uncharacterized protein LOC100790...   374   e-101
ref|XP_003531000.1| PREDICTED: uncharacterized protein LOC100806...   372   e-100
ref|NP_001242649.1| uncharacterized protein LOC100803518 precurs...   372   e-100

>ref|XP_006365887.1| PREDICTED: uncharacterized protein LOC102604949 [Solanum tuberosum]
          Length = 462

 Score =  418 bits (1074), Expect = e-114
 Identities = 223/373 (59%), Positives = 259/373 (69%), Gaps = 15/373 (4%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFAKR---GAKPLIDFDIALLSEPCMLLGV 360
            TI+AG+DLK ASS+SAFMVTGGSIANV    F K    G K LIDFDIALLSEPCMLLGV
Sbjct: 96   TIIAGVDLKTASSFSAFMVTGGSIANVVTNMFVKSTKYGGKILIDFDIALLSEPCMLLGV 155

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMEN---- 528
            S GVI N V PEWLI+I FA+FLG CTFKTCKSG  +WKLESE     NG  ++EN    
Sbjct: 156  SIGVICNRVLPEWLITILFALFLGFCTFKTCKSGFFYWKLESE---LENGKQELENGLLK 212

Query: 529  EETCGESTKTPLLRGEMAEGKLG----IPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIH 696
             E+C E  +  L   E  EGK G    IPW+++GMLV+ WFSFF LYLLRGN+YGQGII 
Sbjct: 213  NESCDEDDEALL---EKKEGKGGVVSNIPWMKMGMLVMFWFSFFFLYLLRGNQYGQGIIP 269

Query: 697  IEACGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQENGDETRIHS----LVFPIM 864
            +EACG GYW            FT+WIL++R+S+     ++  G     H     L+FPIM
Sbjct: 270  MEACGVGYWIISSVQFPMAIIFTSWILYNRESQQNMPSKKPEGTSETKHGPSGKLIFPIM 329

Query: 865  AXXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHI 1044
            A                  SPLL+Q+GI PEVTAATCSFMV FSSTMSA+QYL LGMEH+
Sbjct: 330  ALLAGVLGGVFGIGGGMLISPLLIQVGITPEVTAATCSFMVFFSSTMSAVQYLFLGMEHV 389

Query: 1045 YGALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRD 1224
              AL FA +C +AS++GL +VQRAI  HGRASLIVFSVG VMALST+LMTSFGA D+WRD
Sbjct: 390  DDALIFAVVCCIASIIGLVVVQRAIEHHGRASLIVFSVGIVMALSTLLMTSFGAVDIWRD 449

Query: 1225 YTSGKSMGFKKPC 1263
            Y SG  MGFK+PC
Sbjct: 450  YISGNYMGFKQPC 462


>ref|XP_004232753.1| PREDICTED: uncharacterized protein LOC101265296 [Solanum
            lycopersicum]
          Length = 459

 Score =  415 bits (1067), Expect = e-113
 Identities = 221/370 (59%), Positives = 258/370 (69%), Gaps = 12/370 (3%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFAKR---GAKPLIDFDIALLSEPCMLLGV 360
            TI+AG+DLK ASS+SAFMVTGGSIANV    F K    G K LIDFDIALLSEPCMLLGV
Sbjct: 96   TIIAGVDLKTASSFSAFMVTGGSIANVVTNMFVKSPKYGGKILIDFDIALLSEPCMLLGV 155

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRR----NGCLKMEN 528
            S GVI N V PEWLI+I FA+FL  CTFKTCKSG  +WKLESE    +    NG LK   
Sbjct: 156  SIGVICNRVLPEWLITILFALFLCFCTFKTCKSGFFYWKLESESEKGKKELENGLLK--- 212

Query: 529  EETCGESTKTPLLRGEMAEGKLG-IPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEA 705
             E+C E  +  L   E  EG++  IPW+++G+LV+ WFSFF LYLLRGN+YGQGII +EA
Sbjct: 213  NESCDEDDEALL---EKKEGRISNIPWMKMGILVMFWFSFFFLYLLRGNQYGQGIIPMEA 269

Query: 706  CGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQENGDETRIHS----LVFPIMAXX 873
            CG GYW            FT+WIL++R S+     ++  G     H     L+FPIMA  
Sbjct: 270  CGVGYWIISSVQFPLAIIFTSWILYNRGSQQNMPSKKPEGTSETKHGPSGKLIFPIMALL 329

Query: 874  XXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYGA 1053
                            SPLL+Q+GI PEVTAATCSFMV FSSTMSA+QYL LGMEH+  A
Sbjct: 330  AGVLGGVFGIGGGMLISPLLIQVGITPEVTAATCSFMVFFSSTMSAVQYLFLGMEHVDDA 389

Query: 1054 LTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYTS 1233
            L FA +C +AS++GL +VQRAI  HGRASL+VFSVGTVMALSTVLMTSFGA D+WRDYTS
Sbjct: 390  LIFAVVCCIASIIGLLVVQRAIEHHGRASLMVFSVGTVMALSTVLMTSFGAVDIWRDYTS 449

Query: 1234 GKSMGFKKPC 1263
            G  MGFK+PC
Sbjct: 450  GNYMGFKQPC 459


>ref|XP_004234468.1| PREDICTED: uncharacterized protein LOC101251260 [Solanum
            lycopersicum]
          Length = 455

 Score =  414 bits (1065), Expect = e-113
 Identities = 221/365 (60%), Positives = 264/365 (72%), Gaps = 8/365 (2%)
 Frame = +1

Query: 193  IVAGLDLKIASSYSAFMVTGGSIANVACQFF---AKRGAKPLIDFDIALLSEPCMLLGVS 363
            IVAG+DLK ASS+SAFMVTGGSIANV C  F   AK G K L+DFDIALLS+PC+LLGVS
Sbjct: 101  IVAGVDLKTASSFSAFMVTGGSIANVVCSMFLPSAKHGGKILVDFDIALLSQPCILLGVS 160

Query: 364  CGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMENEETCG 543
             GVI NLVFPEWLI+I FAIFL  CTFKT KSG+ +WK+ESE +MR     K EN E   
Sbjct: 161  IGVICNLVFPEWLITILFAIFLAWCTFKTFKSGIYYWKIESEEVMR-----KKENIE--- 212

Query: 544  ESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEACGAGYW 723
               + PLL+ E  +G   IPW+++G+L++IWFSFF LYLLRGNRYGQGIIH++ACG  YW
Sbjct: 213  -EIEGPLLQKE-EKGIKNIPWMKMGVLIIIWFSFFSLYLLRGNRYGQGIIHMKACGVVYW 270

Query: 724  XXXXXXXXXXXXFTTWILHSRKS-KNTAAIQQENGDETRIHS----LVFPIMAXXXXXXX 888
                        FT+WI ++R++ +N  + +QE   ET+ +      +FP+MA       
Sbjct: 271  IISSVQFPLAIIFTSWIFYNRENHQNLPSKKQEITCETKNNGPSRMFIFPLMALLAGVLG 330

Query: 889  XXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYGALTFAA 1068
                       SPLL+Q+GI PE+TAATCSFMV FSSTMSA+QYL LGMEH+  AL FA 
Sbjct: 331  GVFGIGGGMLISPLLIQVGITPEITAATCSFMVFFSSTMSAVQYLFLGMEHVNTALIFAI 390

Query: 1069 ICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYTSGKSMG 1248
            IC +ASL+GL +VQRAI  HGRASLIVFSVG VMALSTVL+TSFGA DVW+DYTSGK MG
Sbjct: 391  ICLIASLIGLVVVQRAIEHHGRASLIVFSVGIVMALSTVLITSFGAFDVWKDYTSGKYMG 450

Query: 1249 FKKPC 1263
            FK PC
Sbjct: 451  FKPPC 455


>ref|XP_006343279.1| PREDICTED: uncharacterized protein LOC102586118 [Solanum tuberosum]
          Length = 469

 Score =  413 bits (1061), Expect = e-112
 Identities = 219/365 (60%), Positives = 262/365 (71%), Gaps = 8/365 (2%)
 Frame = +1

Query: 193  IVAGLDLKIASSYSAFMVTGGSIANVACQFF---AKRGAKPLIDFDIALLSEPCMLLGVS 363
            IVAG+DLK ASS+SAFMVTGGSIANV C  F   AK G K L+DFDIALLS+PC+LLGVS
Sbjct: 115  IVAGVDLKTASSFSAFMVTGGSIANVVCSLFLPSAKHGGKILVDFDIALLSQPCILLGVS 174

Query: 364  CGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMENEETCG 543
             GV+ NLVFPEWLI+I FAIFL  CT KT +SG+ +WK+ESE +MRR      EN E   
Sbjct: 175  IGVVCNLVFPEWLITILFAIFLAWCTLKTFRSGIYYWKIESEEVMRRK-----ENFE--- 226

Query: 544  ESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEACGAGYW 723
               + PLL  E  +G   IPW+++G+L++IWFSFF LYLLRGNRYGQGIIH++ACG  YW
Sbjct: 227  -EIEGPLLEKE-EKGIKNIPWMKMGVLIMIWFSFFFLYLLRGNRYGQGIIHMKACGVVYW 284

Query: 724  XXXXXXXXXXXXFTTWILHSRKS-KNTAAIQQENGDETRIHS----LVFPIMAXXXXXXX 888
                        FT+WIL+ R++ +N  + +QE   ET+ +      +FP+MA       
Sbjct: 285  IISSVQFPLAIIFTSWILYKRENHQNLPSKKQEITCETKNNGPSRMFIFPLMALLAGVLG 344

Query: 889  XXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYGALTFAA 1068
                       SPLL+Q+GI PE+TAATCSFMV FSSTMSA+QYL LGMEH+  AL FA 
Sbjct: 345  GVFGIGGGMLISPLLIQVGITPEITAATCSFMVFFSSTMSAVQYLFLGMEHVNTALIFAI 404

Query: 1069 ICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYTSGKSMG 1248
            +C +ASL+GL +VQRAI  HGRASLIVFSVG VMALSTVL+TSFGA DVWRDYTSGK MG
Sbjct: 405  VCLIASLIGLVVVQRAIEHHGRASLIVFSVGIVMALSTVLITSFGAVDVWRDYTSGKYMG 464

Query: 1249 FKKPC 1263
            FK PC
Sbjct: 465  FKPPC 469


>ref|XP_002317274.1| hypothetical protein POPTR_0011s03350g [Populus trichocarpa]
            gi|222860339|gb|EEE97886.1| hypothetical protein
            POPTR_0011s03350g [Populus trichocarpa]
          Length = 473

 Score =  404 bits (1039), Expect = e-110
 Identities = 216/380 (56%), Positives = 260/380 (68%), Gaps = 22/380 (5%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFF---AKRGAKPLIDFDIALLSEPCMLLGV 360
            TIVA LDLK ASS+SAFMVTGGS+ANV C  F   AK G + L+D+DIA+LSEPCMLLGV
Sbjct: 95   TIVASLDLKTASSFSAFMVTGGSVANVMCNMFTRSAKFGGQTLVDYDIAILSEPCMLLGV 154

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRR------NGCLKM 522
            S GVI NLVFPEWL++I FA+FL   TFKTC++GV  WKLESE + R       NG ++ 
Sbjct: 155  SVGVICNLVFPEWLVTILFAVFLACSTFKTCQNGVFHWKLESEEVNRNESGNLENGLVEY 214

Query: 523  ENEETCGE----STKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGI 690
            E      E    S K PLL  E+    L  PW++LG+L +IWFSF +LYLLRGNRYG+GI
Sbjct: 215  ETSTKESEEVISSVKEPLLGVELTSSVLRFPWMKLGILFIIWFSFSILYLLRGNRYGEGI 274

Query: 691  IHIEACGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQE---------NGDETRIH 843
            I +E+CG GYW            FT WIL+ ++S     I Q+          G  T  +
Sbjct: 275  IPMESCGFGYWVVSSLQIPLAIMFTAWILYRKESCQHQTINQQLSVKGMEDLTGGGTS-N 333

Query: 844  SLVFPIMAXXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYL 1023
             L+FP+MA                  SPLLL +GI PE+TAATCSFMV FSS+MSA+QYL
Sbjct: 334  KLIFPVMALLAGMLGGVFGIGGGMLISPLLLHVGIAPEITAATCSFMVFFSSSMSALQYL 393

Query: 1024 LLGMEHIYGALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFG 1203
            LLGMEH+  A+  + ICFVASL+GL +VQRAIVK+GRAS+IVFSV TVMALSTVLMTSFG
Sbjct: 394  LLGMEHVDTAIILSVICFVASLLGLLVVQRAIVKYGRASMIVFSVSTVMALSTVLMTSFG 453

Query: 1204 AADVWRDYTSGKSMGFKKPC 1263
            A +VWRDY SG++MGFK PC
Sbjct: 454  ALNVWRDYNSGRNMGFKLPC 473


>ref|XP_002275818.1| PREDICTED: uncharacterized protein LOC100252710 [Vitis vinifera]
            gi|297741863|emb|CBI33227.3| unnamed protein product
            [Vitis vinifera]
          Length = 458

 Score =  390 bits (1002), Expect = e-106
 Identities = 212/368 (57%), Positives = 253/368 (68%), Gaps = 11/368 (2%)
 Frame = +1

Query: 193  IVAGLDLKIASSYSAFMVTGGSIANVACQFFAK--RGAKPLIDFDIALLSEPCMLLGVSC 366
            IV GLDLK AS++SAFMV GGS AN+ C  F     G K +IDFDIALLSEPC+LLGVS 
Sbjct: 95   IVGGLDLKTASTFSAFMVAGGSTANILCTMFINCIHGGKSVIDFDIALLSEPCLLLGVSI 154

Query: 367  GVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMEN----EE 534
            GV+ N+VFPEWLI+I F +FL   T KTC+ GV+ WKLESE ++RRNG  ++EN    +E
Sbjct: 155  GVVCNIVFPEWLITILFVVFLSWTTSKTCRKGVVSWKLESE-VIRRNGFGELENGVRRDE 213

Query: 535  TCGE-----STKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHI 699
            + GE     S K PL+ GE+   K+ IPW + G LVVIW SFF+LY+LRG+R GQ II +
Sbjct: 214  SNGENEVIKSLKEPLM-GEVENFKISIPWTKFGALVVIWLSFFLLYILRGDRDGQSIIPM 272

Query: 700  EACGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQENGDETRIHSLVFPIMAXXXX 879
            E CG GYW            FT WILH R++ N   I  + G++    +L+FPIMA    
Sbjct: 273  EPCGEGYWILSSLQFPLAITFTAWILHRRETSNQQEILGQTGEKPP--NLIFPIMALLAG 330

Query: 880  XXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYGALT 1059
                          SPLLL IGI PEVTAATCS MV FSSTMS+ QYLL+GMEH   AL 
Sbjct: 331  ILGGVFGIGGGMLISPLLLHIGIPPEVTAATCSVMVFFSSTMSSFQYLLIGMEHKEVALI 390

Query: 1060 FAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYTSGK 1239
            FA ICF AS++G+ +VQRAI K+GRASLIVFSV TVMALSTVL+TSFGA DVWRDY  G+
Sbjct: 391  FAIICFFASILGVVVVQRAIEKYGRASLIVFSVSTVMALSTVLITSFGAIDVWRDYARGE 450

Query: 1240 SMGFKKPC 1263
             MGFK PC
Sbjct: 451  YMGFKLPC 458


>gb|EMJ12793.1| hypothetical protein PRUPE_ppa005423mg [Prunus persica]
          Length = 462

 Score =  386 bits (992), Expect = e-104
 Identities = 209/367 (56%), Positives = 252/367 (68%), Gaps = 9/367 (2%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVA---CQFFAKRGAKPLIDFDIALLSEPCMLLGV 360
            T+VAGLDL+ ASS SAFMVTGGS+ANV    C+  AK G K +ID+DIALLSEPCMLLGV
Sbjct: 97   TLVAGLDLRTASSLSAFMVTGGSVANVIYNLCKGSAKFGGKNVIDYDIALLSEPCMLLGV 156

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMEN---E 531
            S GVI NLVFPEWLI+I FA+FL   T  +CK+G+ +WK+ESE +MR N C  + N   +
Sbjct: 157  SVGVICNLVFPEWLITILFALFLAWSTSMSCKNGLAYWKMESEELMR-NDCENLGNGLND 215

Query: 532  ETCGESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEACG 711
            ET G       L G   +  L +PW ++G+LV++W SF ++YL RGNRYGQGI  IE CG
Sbjct: 216  ETEGVKGIAEPLLGTKGKCILRLPWTKMGVLVLVWCSFCIIYLFRGNRYGQGITPIEPCG 275

Query: 712  AGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQENGDE---TRIHSLVFPIMAXXXXX 882
             GYW            +T WIL  +++     + Q+N ++    R   L+FP+MA     
Sbjct: 276  IGYWVLSSVQIPLAIIYTAWILCRKENLQHHTLNQKNIEDLPKVRPSKLIFPLMALLAGI 335

Query: 883  XXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYGALTF 1062
                         SPLL+Q+GI PEVTAATCSFMV FSS+MSA QYLLLGMEH   AL F
Sbjct: 336  LGGVFGIGGGMLISPLLVQVGIAPEVTAATCSFMVFFSSSMSAFQYLLLGMEHADTALVF 395

Query: 1063 AAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYTSGKS 1242
            A +CFVASL+GL ++QRAI  +GRASLIVFSV TVMALSTVLMTSFGA DVWRDY SGK 
Sbjct: 396  AIMCFVASLLGLVVLQRAIKVYGRASLIVFSVSTVMALSTVLMTSFGALDVWRDYVSGKY 455

Query: 1243 MGFKKPC 1263
            MGFK PC
Sbjct: 456  MGFKLPC 462


>ref|XP_006467958.1| PREDICTED: uncharacterized protein LOC102621436 [Citrus sinensis]
          Length = 465

 Score =  385 bits (990), Expect = e-104
 Identities = 212/371 (57%), Positives = 250/371 (67%), Gaps = 13/371 (3%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFAKRGAKPLIDFDIALLSEPCMLLGVSCG 369
            TI AGL+L+ A+S+SAFMVTGGSIANV C      G K  ID+DIALLSEPCMLLGVS G
Sbjct: 96   TISAGLELRTATSFSAFMVTGGSIANVMCNMLGTIGGKSFIDYDIALLSEPCMLLGVSIG 155

Query: 370  VIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMEN----EET 537
            VI NLVFPEWL+++ FAI L   TFKTC +G  FWKLESE + R + C K+EN    +E 
Sbjct: 156  VICNLVFPEWLVTVLFAILLAWSTFKTCTNGFSFWKLESENLKRDHTCGKIENGIVKDEN 215

Query: 538  CG-----ESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIE 702
            C      +S + PLL  + +  +L  PW++LG+LV++WF F VLYL RGNR GQGII ++
Sbjct: 216  CDGSEGVKSYEEPLLSVDESN-QLSFPWMKLGVLVLVWFCFSVLYLFRGNRDGQGIITMK 274

Query: 703  ACGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQEN-GDETR---IHSLVFPIMAX 870
             CG GYW            FT WIL  ++S    A  Q+  GD TR    + L+FP+MA 
Sbjct: 275  PCGVGYWILSSLQIPLAIAFTAWILCRKESTQYHAPNQQGIGDLTRRGTSNKLIFPLMAL 334

Query: 871  XXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYG 1050
                             SPLLLQIG  PEVTAATCSFMV FSSTMSA QYLLLGME    
Sbjct: 335  LAGILGGLFGIGGGMLISPLLLQIGTAPEVTAATCSFMVFFSSTMSAFQYLLLGMEQSGT 394

Query: 1051 ALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYT 1230
            AL FA +CFVASL+GL +VQ+AI + GRASLIVFSV  VMALSTVL+TSF A D+WRDYT
Sbjct: 395  ALIFAIVCFVASLLGLLVVQKAIQEFGRASLIVFSVSIVMALSTVLITSFEALDIWRDYT 454

Query: 1231 SGKSMGFKKPC 1263
            SG  MGFK PC
Sbjct: 455  SGNYMGFKFPC 465


>ref|XP_006300468.1| hypothetical protein CARUB_v10020270mg [Capsella rubella]
            gi|482569178|gb|EOA33366.1| hypothetical protein
            CARUB_v10020270mg [Capsella rubella]
          Length = 457

 Score =  384 bits (986), Expect = e-104
 Identities = 211/368 (57%), Positives = 247/368 (67%), Gaps = 10/368 (2%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA---KRGAKPLIDFDIALLSEPCMLLGV 360
            TIVAGLDLK ASS+SAFMVTGGSIANV C  F    K G K LIDFD+ALL EPCMLLGV
Sbjct: 93   TIVAGLDLKTASSFSAFMVTGGSIANVGCNLFLRNPKSGGKTLIDFDLALLLEPCMLLGV 152

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMENEETC 540
            S GVI NLVFP WLI+I FA+FL   T KT  +GV +W+LESE    R    + + EE  
Sbjct: 153  SIGVICNLVFPNWLITILFAVFLAWSTLKTFGNGVYYWRLESEMAKIRESSQEEDKEEEK 212

Query: 541  GESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEACGAGY 720
             ES K PLL  E  +     PW++LG+LV+IW S+F +YLLRGN+YG+GII IE CG  Y
Sbjct: 213  VESLKLPLLDYERPKR---FPWMKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGITY 269

Query: 721  WXXXXXXXXXXXXFTTWILHSR--KSKNTAAIQQE-----NGDETRIHSLVFPIMAXXXX 879
            W            FT WI  S   +S+ ++A  ++       D  + +  +FP+MA    
Sbjct: 270  WLLSSTQIPLTLFFTLWICFSDNVQSQQSSACAKDVEDLRPNDGAQSNKCMFPVMALLAG 329

Query: 880  XXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYGALT 1059
                          SPLLLQ+GI PEVTAATCSFMVLFSSTMSAIQYLLLGMEH   A  
Sbjct: 330  VLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGMEHTGTASI 389

Query: 1060 FAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYTSGK 1239
            FA ICFVASLVGL +VQ+ I ++GRAS+IVFSVG VMALS VLMTS+GA DVW DY SG+
Sbjct: 390  FAVICFVASLVGLKVVQKVITEYGRASIIVFSVGIVMALSIVLMTSYGALDVWNDYVSGR 449

Query: 1240 SMGFKKPC 1263
             MGFK PC
Sbjct: 450  YMGFKLPC 457


>gb|EOY28541.1| Sulfite exporter TauE/SafE family protein, putative [Theobroma cacao]
          Length = 456

 Score =  383 bits (983), Expect = e-103
 Identities = 208/365 (56%), Positives = 249/365 (68%), Gaps = 7/365 (1%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQF---FAKRGAKPLIDFDIALLSEPCMLLGV 360
            TIVAGLDLK AS++SAFMV GGS ANV        +K G K LID+D+ALLSEPCMLLGV
Sbjct: 98   TIVAGLDLKTASTFSAFMVAGGSTANVIYNLRTTSSKFGGKTLIDYDVALLSEPCMLLGV 157

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMENEETC 540
            S GV+ NLVFPEWLI+I FA FL   TFKTC++G+  WK+ESE    RN C K+EN  TC
Sbjct: 158  SVGVVCNLVFPEWLITILFATFLAWSTFKTCRNGIGLWKMESEHQETRNRCEKVENG-TC 216

Query: 541  GESTKTPLLRGEMA--EGKLG--IPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEAC 708
            GES +   L   +   EGK+    PW +L +LV++WFSFFV+YLLRGNRYGQG++ ++ C
Sbjct: 217  GESGEINDLEEPLVSTEGKVKSRFPWKKLVVLVMVWFSFFVIYLLRGNRYGQGVMPMKPC 276

Query: 709  GAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQENGDETRIHSLVFPIMAXXXXXXX 888
            G GYW            FT WIL     K   AI  +  ++  ++ L+FP MA       
Sbjct: 277  GVGYWTLSLFQMPLAIAFTAWIL-----KRKEAIACQGPNKQGVNKLIFPFMALLAGGLG 331

Query: 889  XXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYGALTFAA 1068
                       SPLLL +G+ PEVTAATCSFMV FSSTMSA QYLLLGM+    AL F+ 
Sbjct: 332  GVFGIGGGMLISPLLLHVGVAPEVTAATCSFMVFFSSTMSAFQYLLLGMKQTGTALIFSV 391

Query: 1069 ICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYTSGKSMG 1248
            ICFVASL+GL +VQ+AI + GRASLIVFSVG VMALS +LMTSFGA +VW DYTSG  MG
Sbjct: 392  ICFVASLLGLVVVQKAIRELGRASLIVFSVGIVMALSAILMTSFGALNVWDDYTSGSYMG 451

Query: 1249 FKKPC 1263
            FK+PC
Sbjct: 452  FKQPC 456


>ref|XP_006449135.1| hypothetical protein CICLE_v10015151mg [Citrus clementina]
            gi|557551746|gb|ESR62375.1| hypothetical protein
            CICLE_v10015151mg [Citrus clementina]
          Length = 465

 Score =  382 bits (982), Expect = e-103
 Identities = 209/371 (56%), Positives = 251/371 (67%), Gaps = 13/371 (3%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFAKRGAKPLIDFDIALLSEPCMLLGVSCG 369
            TI+AGL+L+ A+S+SAFMVTGGSIANV C      G K  ID+DIALLSEPCMLLGVS G
Sbjct: 96   TILAGLELRTATSFSAFMVTGGSIANVMCNMLGTIGGKSFIDYDIALLSEPCMLLGVSIG 155

Query: 370  VIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMEN----EET 537
            VI NLVFPEWL+++ FAIFL   TFKTC +G   WKLESE + R + C K+EN    ++ 
Sbjct: 156  VICNLVFPEWLVTVLFAIFLAWSTFKTCTNGFSLWKLESENLKRDHTCGKIENGIVKDDN 215

Query: 538  CG-----ESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIE 702
            C      +S + PLL  + +  +L  PW++LG+LV++WF F VLYL RGNR GQGII ++
Sbjct: 216  CDGSEGVKSYEEPLLSVDESN-QLSFPWMKLGVLVLVWFCFSVLYLFRGNRDGQGIITMK 274

Query: 703  ACGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQEN-GDETR---IHSLVFPIMAX 870
             CG GYW            FT WIL  ++S    A  Q+  G+ TR    + L+FP+MA 
Sbjct: 275  PCGVGYWILSSLQIPLAIAFTAWILCRKESTQYHAPNQQGIGNLTRRGTSNKLIFPLMAL 334

Query: 871  XXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYG 1050
                             SPLLLQIG  P+VTAATCSFMV FSSTMSA QYLLLGME    
Sbjct: 335  LAGILGGLFGIGGGMLISPLLLQIGTAPKVTAATCSFMVFFSSTMSAFQYLLLGMEQSGT 394

Query: 1051 ALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYT 1230
            AL FA +CFVASL+GL +VQ+AI + GRASLIVFSV  VMALSTVL+TSF A D+WRDYT
Sbjct: 395  ALIFAIVCFVASLLGLLVVQKAIQEFGRASLIVFSVSIVMALSTVLITSFEALDIWRDYT 454

Query: 1231 SGKSMGFKKPC 1263
            SG  MGFK PC
Sbjct: 455  SGNYMGFKLPC 465


>gb|EXB51986.1| hypothetical protein L484_019763 [Morus notabilis]
          Length = 465

 Score =  380 bits (976), Expect = e-103
 Identities = 205/371 (55%), Positives = 252/371 (67%), Gaps = 13/371 (3%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA---KRGAKPLIDFDIALLSEPCMLLGV 360
            TI AG+DLK ASS+SAFMVTG SIANV C   +   +   K LIDFDIALLS+P MLLGV
Sbjct: 95   TITAGVDLKTASSFSAFMVTGASIANVVCNLASAIPRFAGKSLIDFDIALLSQPSMLLGV 154

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMEN---E 531
            S GVI N++FPEWL++I FA+FL   T KT ++GV+ WKLE+E    R GC ++EN   E
Sbjct: 155  SVGVICNVMFPEWLVTILFAVFLAWSTSKTWRNGVMCWKLETEVKRLRKGCGELENGLSE 214

Query: 532  ETCGESTKTPLLRGE--MAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEA 705
            +    S K PLL  +  + + +  +PW++LG++V+IW SF  +YLLRGNRYG+GI  IE 
Sbjct: 215  DVKLSSIKEPLLGSDHQLRKCQYRLPWMKLGVVVLIWCSFCFVYLLRGNRYGEGITPIEP 274

Query: 706  CGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQENGDET-----RIHSLVFPIMAX 870
            CG GYW            +T WIL  + S     + Q+N  E      +   L+FP+MA 
Sbjct: 275  CGVGYWLLSSFQIPLAIVYTAWILCRKDSSQNQNLNQKNVMENLTQVGQSKKLIFPLMAL 334

Query: 871  XXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIYG 1050
                             SPLLLQ+G+ PEVTAATCSFMV FSSTMSA QYL LGMEH   
Sbjct: 335  LAGMLGGVFGIGGGMLISPLLLQVGVAPEVTAATCSFMVFFSSTMSAFQYLFLGMEHTDV 394

Query: 1051 ALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDYT 1230
            AL FAA+CFVASL+GL +VQ+AI ++GRASLIVFSVG VMALSTVLMT+FGA DVWRDY 
Sbjct: 395  ALIFAAVCFVASLLGLVVVQKAIQEYGRASLIVFSVGIVMALSTVLMTTFGALDVWRDYI 454

Query: 1231 SGKSMGFKKPC 1263
            SG+ MGFK+PC
Sbjct: 455  SGEYMGFKRPC 465


>emb|CAA10487.1| hypothetical protein [Arabidopsis thaliana]
          Length = 389

 Score =  380 bits (975), Expect = e-102
 Identities = 210/375 (56%), Positives = 247/375 (65%), Gaps = 17/375 (4%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA---KRGAKPLIDFDIALLSEPCMLLGV 360
            TIVAGLDLK ASS+SAFMVTGGSIANV C  F    K G K LIDFD+ALL EPCMLLGV
Sbjct: 20   TIVAGLDLKTASSFSAFMVTGGSIANVGCNLFVRNPKSGGKTLIDFDLALLLEPCMLLGV 79

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGM-MRRNGCLKMENEET 537
            S GVI NLVFP WLI+  FA+FL   T KT  +G+ +W+LESE + +R +  ++ ++EE 
Sbjct: 80   SIGVICNLVFPNWLITSLFAVFLAWSTLKTFGNGLYYWRLESEMVKIRESNRIEEDDEED 139

Query: 538  CGESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEACGAG 717
              ES K PLL       +   PW++LG+LV+IW S+F +YLLRGN+YG+GII IE CG  
Sbjct: 140  KIESLKLPLLEDYQRPKRF--PWIKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGNA 197

Query: 718  YWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQEN-------------GDETRIHSLVFP 858
            YW            FT WI  S    N  + QQ +              D  R +  +FP
Sbjct: 198  YWLISSSQIPLTLFFTLWICFS---DNVQSQQQSDYHVSVKDVEDLRSNDGARSNKCMFP 254

Query: 859  IMAXXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGME 1038
            +MA                  SPLLLQ+GI PEVTAATCSFMVLFSSTMSAIQYLLLGME
Sbjct: 255  VMALLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGME 314

Query: 1039 HIYGALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVW 1218
            H   A  FA ICFVASLVGL +VQ+ I ++GRAS+IVFSVG VMALS VLMTS+GA DVW
Sbjct: 315  HTGTASIFAVICFVASLVGLKVVQKVITEYGRASIIVFSVGIVMALSIVLMTSYGALDVW 374

Query: 1219 RDYTSGKSMGFKKPC 1263
             DY SG+ MGFK PC
Sbjct: 375  NDYVSGRYMGFKLPC 389


>ref|NP_176367.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
            gi|13272465|gb|AAK17171.1|AF325103_1 unknown protein
            [Arabidopsis thaliana] gi|4508075|gb|AAD21419.1| Unknown
            protein [Arabidopsis thaliana]
            gi|111074464|gb|ABH04605.1| At1g61740 [Arabidopsis
            thaliana] gi|332195760|gb|AEE33881.1| Sulfite exporter
            TauE/SafE family protein [Arabidopsis thaliana]
          Length = 458

 Score =  380 bits (975), Expect = e-102
 Identities = 210/375 (56%), Positives = 247/375 (65%), Gaps = 17/375 (4%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA---KRGAKPLIDFDIALLSEPCMLLGV 360
            TIVAGLDLK ASS+SAFMVTGGSIANV C  F    K G K LIDFD+ALL EPCMLLGV
Sbjct: 89   TIVAGLDLKTASSFSAFMVTGGSIANVGCNLFVRNPKSGGKTLIDFDLALLLEPCMLLGV 148

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGM-MRRNGCLKMENEET 537
            S GVI NLVFP WLI+  FA+FL   T KT  +G+ +W+LESE + +R +  ++ ++EE 
Sbjct: 149  SIGVICNLVFPNWLITSLFAVFLAWSTLKTFGNGLYYWRLESEMVKIRESNRIEEDDEED 208

Query: 538  CGESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEACGAG 717
              ES K PLL       +   PW++LG+LV+IW S+F +YLLRGN+YG+GII IE CG  
Sbjct: 209  KIESLKLPLLEDYQRPKRF--PWIKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGNA 266

Query: 718  YWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQEN-------------GDETRIHSLVFP 858
            YW            FT WI  S    N  + QQ +              D  R +  +FP
Sbjct: 267  YWLISSSQIPLTLFFTLWICFS---DNVQSQQQSDYHVSVKDVEDLRSNDGARSNKCMFP 323

Query: 859  IMAXXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGME 1038
            +MA                  SPLLLQ+GI PEVTAATCSFMVLFSSTMSAIQYLLLGME
Sbjct: 324  VMALLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGME 383

Query: 1039 HIYGALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVW 1218
            H   A  FA ICFVASLVGL +VQ+ I ++GRAS+IVFSVG VMALS VLMTS+GA DVW
Sbjct: 384  HTGTASIFAVICFVASLVGLKVVQKVITEYGRASIIVFSVGIVMALSIVLMTSYGALDVW 443

Query: 1219 RDYTSGKSMGFKKPC 1263
             DY SG+ MGFK PC
Sbjct: 444  NDYVSGRYMGFKLPC 458


>ref|XP_002518567.1| conserved hypothetical protein [Ricinus communis]
            gi|223542412|gb|EEF43954.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 463

 Score =  379 bits (972), Expect = e-102
 Identities = 214/372 (57%), Positives = 249/372 (66%), Gaps = 14/372 (3%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA-KRGAKPLIDFDIALLSEPCMLLGVSC 366
            TIVAGLDLK ASS+SAFMVTGGSIANV C  F+ K G K LID+DIALLSEPCMLLGVS 
Sbjct: 94   TIVAGLDLKTASSFSAFMVTGGSIANVLCNLFSPKFGGKALIDYDIALLSEPCMLLGVSV 153

Query: 367  GVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMENEETCGE 546
            GVI NL+FPEWLI++ F +FL   TFKTCK+ V  W LESE + +RNG   +EN      
Sbjct: 154  GVICNLIFPEWLITVLFVLFLVWSTFKTCKNAVAHWNLESEEV-KRNGHGNLENGRVKDR 212

Query: 547  ST---------KTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHI 699
            S+         K PL+  EM E ++   W +LG+LV+IW SF  LYLLRGNRYG+GI  +
Sbjct: 213  SSIGNEEIKIIKEPLMGIEM-ENRMSFTWEKLGVLVLIWLSFSFLYLLRGNRYGEGIAPL 271

Query: 700  EACGAGYWXXXXXXXXXXXXFTTWILHSRK--SKNTAAIQQ-ENGDETRI-HSLVFPIMA 867
            + CG GYW            FT WIL  ++     TA +Q  ++  E R  + L FPIMA
Sbjct: 272  KPCGVGYWVVSSLQIPLAIIFTAWILLKKRHYQNQTANLQDIDDSMEGRAPNKLTFPIMA 331

Query: 868  XXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIY 1047
                              SPLLL +GI PEVTAATCSFMV FSSTMSA QYLL GMEH  
Sbjct: 332  LLAGILGGVFGIGGGMLISPLLLHVGIPPEVTAATCSFMVFFSSTMSAFQYLLSGMEHTD 391

Query: 1048 GALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDY 1227
             AL FA+ICFVASLVGL +VQR I  +GRAS+IVFSV  VMALSTVL+TSFG  DVWR+Y
Sbjct: 392  TALMFASICFVASLVGLLVVQRIIQDYGRASIIVFSVSIVMALSTVLITSFGTIDVWRNY 451

Query: 1228 TSGKSMGFKKPC 1263
             SG +MGFK PC
Sbjct: 452  ESGTNMGFKLPC 463


>ref|XP_004485660.1| PREDICTED: uncharacterized protein LOC101511691 [Cicer arietinum]
          Length = 468

 Score =  375 bits (964), Expect = e-101
 Identities = 213/378 (56%), Positives = 250/378 (66%), Gaps = 20/378 (5%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA---KRGAKPLIDFDIALLSEPCMLLGV 360
            TIVAGLDLKIASS SAFMVTGGS+ANV C  F    K G K LID+DIAL SEPCMLLGV
Sbjct: 92   TIVAGLDLKIASSLSAFMVTGGSVANVICYMFTTSPKFGGKSLIDYDIALSSEPCMLLGV 151

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLK--MEN-- 528
            S GVI NLVFPEWLI++ FAIFL   T KTCKSGV+FW +ESE M R+NG  K  +EN  
Sbjct: 152  SIGVICNLVFPEWLITLMFAIFLAWSTSKTCKSGVMFWNIESEEM-RQNGLEKGLLENGT 210

Query: 529  --EETCG--------ESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRY 678
              EE+ G        +  K  ++  +     + IPWL+L  L+++WFSFF +YLLRGN Y
Sbjct: 211  SEEESKGLVRMLKENDGPKIIVMVPKENSKLMSIPWLKLLALLLVWFSFFSIYLLRGNGY 270

Query: 679  GQGIIHIEACGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQENGDETR---IHSL 849
            GQ II IE CG GYW            FT W++  ++S     +  E   + R      L
Sbjct: 271  GQRIIPIEPCGVGYWIISSVQVPLAVVFTAWMVFRKESLQDPTLIPEVQCQNRNSPSKKL 330

Query: 850  VFPIMAXXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLL 1029
            VFP+MA                  SPLLLQ+GI PEVTAATCSFMV FS+TMSA+QYLLL
Sbjct: 331  VFPLMALLAGILGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVFFSATMSALQYLLL 390

Query: 1030 GMEHIYGALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAA 1209
            GMEH+  AL  A +CFVASL+GL +VQ+AI K+GR SLIVFSV  VM+LS VLMTSFGA 
Sbjct: 391  GMEHVQIALILAIMCFVASLLGLLVVQKAIQKYGRPSLIVFSVSIVMSLSVVLMTSFGAI 450

Query: 1210 DVWRDYTSGKSMGFKKPC 1263
             +W DY SGK MGFK PC
Sbjct: 451  KIWGDYKSGKYMGFKPPC 468


>ref|XP_002886512.1| hypothetical protein ARALYDRAFT_475155 [Arabidopsis lyrata subsp.
            lyrata] gi|297332353|gb|EFH62771.1| hypothetical protein
            ARALYDRAFT_475155 [Arabidopsis lyrata subsp. lyrata]
          Length = 461

 Score =  375 bits (964), Expect = e-101
 Identities = 209/372 (56%), Positives = 247/372 (66%), Gaps = 14/372 (3%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA---KRGAKPLIDFDIALLSEPCMLLGV 360
            TIVAGLDLK ASS+SAFMVTGGSIANV C  F    K G K LIDFD+ALL EPCMLLGV
Sbjct: 92   TIVAGLDLKTASSFSAFMVTGGSIANVGCNLFVRNPKSGGKTLIDFDLALLLEPCMLLGV 151

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGM-MRRNGCLKMENEET 537
            S GVI NLVFP WLI+  FA+FL   T KT  +G+ +W+LESE + +R +  +  ++EE 
Sbjct: 152  SIGVICNLVFPNWLITSLFAVFLAWSTLKTFGNGLYYWRLESEMVKIRESNRIGEDDEED 211

Query: 538  CGESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRGNRYGQGIIHIEACGAG 717
              ES K PLL  E  E     PW++LG+LV+IW S+F +YLLRGN+YG+GII IE CG  
Sbjct: 212  KIESLKLPLL--EDYERPKRFPWIKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGNA 269

Query: 718  YWXXXXXXXXXXXXFTTWILHS------RKSKNTAAIQQ----ENGDETRIHSLVFPIMA 867
            YW            FT WI  S      + S    +I+      + D  R +  +FP+MA
Sbjct: 270  YWLISSSQIPLTLFFTLWICFSDNVQSQQPSDYNVSIKDVEDLRSNDGARSNKCMFPVMA 329

Query: 868  XXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQYLLLGMEHIY 1047
                              SPLLLQ+GI PEVTAATCSFMVLFSSTMSAIQYLLLGMEH  
Sbjct: 330  LLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGMEHTG 389

Query: 1048 GALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTSFGAADVWRDY 1227
             A  FA ICFVASLVGL +VQ+ I ++GRAS+IVFSV  VMALS VLMTS+GA DVW DY
Sbjct: 390  TASIFAVICFVASLVGLKVVQKVITEYGRASIIVFSVCIVMALSIVLMTSYGALDVWNDY 449

Query: 1228 TSGKSMGFKKPC 1263
             +G+ MGFK PC
Sbjct: 450  VAGRYMGFKLPC 461


>ref|XP_006594653.1| PREDICTED: uncharacterized protein LOC100790958 isoform X1 [Glycine
            max]
          Length = 471

 Score =  374 bits (959), Expect = e-101
 Identities = 210/386 (54%), Positives = 250/386 (64%), Gaps = 28/386 (7%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA---KRGAKPLIDFDIALLSEPCMLLGV 360
            +IVAGLDLK ASS SAFMVTGGSIANV C       K G K LID+DIAL SEPCMLLGV
Sbjct: 88   SIVAGLDLKTASSLSAFMVTGGSIANVMCNMCITSPKFGGKSLIDYDIALSSEPCMLLGV 147

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMENEETC 540
            S GVI NLVFPEWLI++ FAIFL   T KTCKSG+LFWK ESE ++R+NG +  E E+  
Sbjct: 148  SLGVICNLVFPEWLITVLFAIFLAWSTSKTCKSGLLFWKAESE-VIRKNGLINEELEKGL 206

Query: 541  GE-----------------STKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLRG 669
             E                 S +  LL  +    K+ IPW +L +L++IWFSFF +YLLRG
Sbjct: 207  LENETIEQRKVYIENNEPKSIEVSLLAPQ-GNSKVRIPWFKLAVLLLIWFSFFSVYLLRG 265

Query: 670  NRYGQGIIHIEACGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQENGD------- 828
            NRYG+GII +E CG GYW            FT WI+  ++S     +  +  D       
Sbjct: 266  NRYGEGIIPMEPCGVGYWILSSVQVPLAVVFTAWIVFRKESLRDRTLIPKLPDMKVPGLT 325

Query: 829  -ETRIHSLVFPIMAXXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTM 1005
             +   + LVFP+MA                  SPLLLQ+G+ PEVTAATCSFMVLFS+TM
Sbjct: 326  KKRPSNILVFPLMALLAGILGGVFGIGGGMLISPLLLQVGVTPEVTAATCSFMVLFSATM 385

Query: 1006 SAIQYLLLGMEHIYGALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTV 1185
            S +QYLLLGMEH+  AL  A +CFVASL+GL +VQRAI K+GRAS+IVFSV  VM +S V
Sbjct: 386  SGLQYLLLGMEHVQAALVLAIMCFVASLLGLLVVQRAIRKYGRASIIVFSVSIVMFISNV 445

Query: 1186 LMTSFGAADVWRDYTSGKSMGFKKPC 1263
            LMTSFGA  VW DY SG+ MGFK PC
Sbjct: 446  LMTSFGAIKVWTDYESGEYMGFKLPC 471


>ref|XP_003531000.1| PREDICTED: uncharacterized protein LOC100806202 [Glycine max]
          Length = 473

 Score =  372 bits (956), Expect = e-100
 Identities = 207/382 (54%), Positives = 247/382 (64%), Gaps = 24/382 (6%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA---KRGAKPLIDFDIALLSEPCMLLGV 360
            TIVA LDLK ASS SAFMVTGGSIANV C   A   K G K LID+DIALLSEPCMLLGV
Sbjct: 92   TIVASLDLKTASSLSAFMVTGGSIANVMCNLRATNPKLGGKSLIDYDIALLSEPCMLLGV 151

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKMEN---- 528
            S GVI NLVFPEWLI++ FA+FL   T KTC SGV+FWK+ESE   + +G   +E     
Sbjct: 152  SVGVICNLVFPEWLITMLFAVFLTWSTSKTCNSGVVFWKIESEERRKNDGFEGLEKGLLE 211

Query: 529  EETCGESTKTPLLRGEMAEGK--------------LGIPWLRLGMLVVIWFSFFVLYLLR 666
            +E+  E  +   +  E  + K              + IPWL+L +L+++WFSFF LYLLR
Sbjct: 212  DESSEEREEGVQVEKEKEKVKSIEEQVMVPEENIRVRIPWLKLVVLLLVWFSFFSLYLLR 271

Query: 667  GNRYGQGIIHIEACGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQENG---DETR 837
            GN+YGQ II +E CG GYW            FT WI++ ++S     + QE+        
Sbjct: 272  GNKYGQSIIPMEPCGVGYWIISSAQVPLALFFTAWIVYRKESHQDQNLMQEDSCLSSNGP 331

Query: 838  IHSLVFPIMAXXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQ 1017
             + L+FP+MA                  SPLLL +GI PEVTAATCSFMV FSSTMSA+Q
Sbjct: 332  SNKLIFPMMALLAGILGGVFGIGGGMLISPLLLHVGIAPEVTAATCSFMVFFSSTMSALQ 391

Query: 1018 YLLLGMEHIYGALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTS 1197
            YLLLGM+HI  AL  A ICFVASL+GL +VQ+AI  +GR SLIVFSV  VM LS VLMTS
Sbjct: 392  YLLLGMDHIETALILALICFVASLIGLLVVQKAIQSYGRPSLIVFSVSIVMTLSIVLMTS 451

Query: 1198 FGAADVWRDYTSGKSMGFKKPC 1263
            FGA   W+DYTSG+ MGFK PC
Sbjct: 452  FGAIRTWKDYTSGRYMGFKLPC 473


>ref|NP_001242649.1| uncharacterized protein LOC100803518 precursor [Glycine max]
            gi|255636709|gb|ACU18690.1| unknown [Glycine max]
          Length = 473

 Score =  372 bits (955), Expect = e-100
 Identities = 205/382 (53%), Positives = 243/382 (63%), Gaps = 24/382 (6%)
 Frame = +1

Query: 190  TIVAGLDLKIASSYSAFMVTGGSIANVACQFFA---KRGAKPLIDFDIALLSEPCMLLGV 360
            TIVA LDLK ASS SAFMVTGGSIANV C   A   K G K LID+DIALLSEPCMLLGV
Sbjct: 92   TIVACLDLKTASSLSAFMVTGGSIANVLCNLCATSPKFGGKSLIDYDIALLSEPCMLLGV 151

Query: 361  SCGVIFNLVFPEWLISIFFAIFLGLCTFKTCKSGVLFWKLESEGMMRRNGCLKME----- 525
            S GVI NLVFPEWLI++ FA+FL   T KTC SGVLFWK+ESE   + +G  ++E     
Sbjct: 152  SVGVICNLVFPEWLITMLFAVFLTWSTSKTCNSGVLFWKIESEERRKNDGFERLEKGLLE 211

Query: 526  -------------NEETCGESTKTPLLRGEMAEGKLGIPWLRLGMLVVIWFSFFVLYLLR 666
                         N E  G  +    +       ++ IPWL+L +L+++W SFF LYLLR
Sbjct: 212  DGSSEEREERVQVNNEKAGMKSIEEQVMVPEENIRMRIPWLKLVVLLLVWLSFFSLYLLR 271

Query: 667  GNRYGQGIIHIEACGAGYWXXXXXXXXXXXXFTTWILHSRKSKNTAAIQQEN---GDETR 837
            GN+YGQ II +E CG GYW            FT WI++ ++S     + QE+        
Sbjct: 272  GNKYGQSIIPMEPCGVGYWILSSAQVPLALFFTAWIVYRKESHQDQNLMQEDPCLSSNGP 331

Query: 838  IHSLVFPIMAXXXXXXXXXXXXXXXXXXSPLLLQIGIEPEVTAATCSFMVLFSSTMSAIQ 1017
             + L+FP+MA                  SPLLL +GI PEVTAATCSFMV FSSTMSA+Q
Sbjct: 332  SNKLIFPMMALLAGILGGVFGIGGGMLISPLLLHVGIAPEVTAATCSFMVFFSSTMSALQ 391

Query: 1018 YLLLGMEHIYGALTFAAICFVASLVGLTLVQRAIVKHGRASLIVFSVGTVMALSTVLMTS 1197
            YLLLGM+HI  AL  A ICFVASL+GL +VQRA+  +GR SLIVFSV  VM LS VLMTS
Sbjct: 392  YLLLGMDHIETALILALICFVASLIGLLVVQRAVQSYGRPSLIVFSVSIVMTLSIVLMTS 451

Query: 1198 FGAADVWRDYTSGKSMGFKKPC 1263
            FG    W+DYTSG+ MGFK PC
Sbjct: 452  FGVIRTWKDYTSGRYMGFKLPC 473


Top