BLASTX nr result

ID: Rheum21_contig00009520 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00009520
         (2136 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ17871.1| hypothetical protein PRUPE_ppb003710mg [Prunus pe...   273   2e-70
ref|XP_002283268.2| PREDICTED: uncharacterized protein LOC100253...   271   6e-70
ref|XP_006482290.1| PREDICTED: uncharacterized protein LOC102621...   263   2e-67
gb|EOY04244.1| Uncharacterized protein isoform 2 [Theobroma cacao]    260   2e-66
ref|XP_006363153.1| PREDICTED: uncharacterized protein LOC102587...   257   1e-65
ref|XP_006430814.1| hypothetical protein CICLE_v10011716mg [Citr...   257   1e-65
ref|XP_004232375.1| PREDICTED: uncharacterized protein LOC101248...   257   2e-65
ref|XP_006482289.1| PREDICTED: uncharacterized protein LOC102621...   254   8e-65
gb|EOY04247.1| Uncharacterized protein isoform 5 [Theobroma cacao]    252   5e-64
gb|EOY04243.1| Uncharacterized protein isoform 1 [Theobroma cacao]    249   3e-63
ref|XP_002305950.2| hypothetical protein POPTR_0004s10220g [Popu...   248   6e-63
gb|ABK95828.1| unknown [Populus trichocarpa]                          248   8e-63
gb|EXB97178.1| hypothetical protein L484_008668 [Morus notabilis]     245   6e-62
ref|XP_002329273.1| predicted protein [Populus trichocarpa]           238   8e-60
ref|XP_006373454.1| hypothetical protein POPTR_0017s13920g [Popu...   225   5e-56
gb|EOY04245.1| Uncharacterized protein isoform 3 [Theobroma caca...   219   3e-54
ref|XP_004156925.1| PREDICTED: uncharacterized LOC101211683 [Cuc...   216   4e-53
ref|XP_006593724.1| PREDICTED: uncharacterized protein LOC100805...   211   1e-51
gb|ESW23618.1| hypothetical protein PHAVU_004G062800g, partial [...   196   4e-47
ref|XP_002882236.1| hypothetical protein ARALYDRAFT_340395 [Arab...   179   4e-42

>gb|EMJ17871.1| hypothetical protein PRUPE_ppb003710mg [Prunus persica]
          Length = 503

 Score =  273 bits (698), Expect = 2e-70
 Identities = 180/433 (41%), Positives = 259/433 (59%), Gaps = 10/433 (2%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQ 1473
            AP SS+  F           +  LF+ +GP  GGS+VL+RF +  K+   FV A+V C Q
Sbjct: 90   APPSSSSTFLLLQNPNPNPNTRVLFIVSGPYRGGSQVLLRFYILHKQKQ-FVRAQVVCTQ 148

Query: 1472 RGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGG 1293
            + L+FD+K G VLVD  HG+ + L+GSVN+FAMYS S++K+WVF VKS   D  D N G 
Sbjct: 149  KELQFDQKLG-VLVDAHHGVSIKLAGSVNFFAMYSVSSSKIWVFAVKSIDNDDNDDNDGM 207

Query: 1292 VRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLP 1113
            V   ++L++CA+I+C   V S+S+S GFLI+GE NGVRVF LR LVKG V          
Sbjct: 208  V---VKLMRCAVIECCKLVWSISISFGFLILGEDNGVRVFNLRQLVKGRV---------- 254

Query: 1112 LRLGSKENENLEGIVEIKLRNSNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMP 933
                 ++ + L    + + RN  LPNG++ G +  +D+        K          G  
Sbjct: 255  -----RKAKLLNSSSKTEGRNLCLPNGVI-GDHAHSDLGD------KGNKYGGGKFHGTS 302

Query: 932  RLVPNGVMGAQIDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPKKA 753
             +  NG +  + D    S+ K +S +L QDS E G  F+ F  K+ E+ ++ +  +P KA
Sbjct: 303  EIPCNGDLCGKNDRNYVSA-KQRSVKLRQDSPEEGVCFVTFKGKEFET-SKSTRMIPAKA 360

Query: 752  ISIHAFMKNKFLVADSDGNVHILCASVSGP-------SVMKQLSNIKEVQHIAVLPDLSE 594
            ISI A   NKFL+ DS+G + IL   +S P       S +++L +I +VQ +AVLPD++ 
Sbjct: 361  ISIEALSPNKFLILDSNGALRIL--HISSPVLGSNITSYLRELPHIMKVQKLAVLPDIAS 418

Query: 593  SSQTVWLSDGYHSVHVMAASDTNTCSNVN-KSDSED--MRLSVVQTIFVSENIRDVQPLS 423
             +Q+VW SDG++SVH+M ASD +   N N ++DSE+  + +SVV TIF SE I+D+ PL+
Sbjct: 419  RTQSVWASDGFNSVHMMLASDMDNAGNENDRNDSEEKLIHISVVLTIFASEKIQDLIPLA 478

Query: 422  ANAILILGQDNLY 384
            ANAILILGQ N++
Sbjct: 479  ANAILILGQGNMW 491


>ref|XP_002283268.2| PREDICTED: uncharacterized protein LOC100253163 [Vitis vinifera]
          Length = 466

 Score =  271 bits (694), Expect = 6e-70
 Identities = 178/414 (42%), Positives = 241/414 (58%), Gaps = 9/414 (2%)
 Frame = -2

Query: 1583 LFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQRGLEFDRKSGGVLVDCSHGLKVV 1404
            LFV A P   G+ V++RF V  K    F  A V C QR L+FD K G VL + +HG+ V 
Sbjct: 105  LFVVAAPHRAGAAVILRFYVLQKTQL-FTKAEVLCTQRDLQFDPKLG-VLFNANHGVSVK 162

Query: 1403 LSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGGVRVNLRLVKCALIDCKLPVTSMS 1224
            L GS+N FAMYS S +K+WVF VK A +D+ D       V L+L KCA+IDC +PV S+S
Sbjct: 163  LGGSINIFAMYSVSNSKIWVFSVKMAGDDRDDG------VVLKLRKCAVIDCGVPVFSIS 216

Query: 1223 VSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLPLRLGSKENENLEGIVEIKLRNSN 1044
            VS  FLI+GE NGVRVF LR LVKG +           R   +E++NL           N
Sbjct: 217  VSGEFLILGEENGVRVFQLRPLVKGWI-----------RKEQRESKNL-----------N 254

Query: 1043 LPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMPRLVPNGVMGAQIDGTPASSVKPK 864
             PNG  G ++   + N                      +  NG +  + D     SVK +
Sbjct: 255  FPNGC-GSKSAGVEANME--------------------IACNGDLEGRTD-LHRVSVKRR 292

Query: 863  SRRLSQDSNEWGAIFLPFNSKDEESV-ARKSPYLPKKAISIHAFMKNKFLVADSDGNVHI 687
            S R  QDS+E  A F+ F  K+   + +   P +P KA+SI A    KFL+ DSDG+VH+
Sbjct: 293  SVRFRQDSSEGSACFVAFKGKEVGHLKSMMPPLIPVKAVSIQALSAKKFLILDSDGDVHL 352

Query: 686  LCASVS--GPSV---MKQLSNIKEVQHIAVLPDLSESSQTVWLSDGYHSVHVMAASDTNT 522
            LC S+   G  +   M+Q +N  +VQ +AVLPD S   +TVW+SDG++SVH+M  SDT+T
Sbjct: 353  LCLSIYHLGSEITCHMRQFTNTMKVQKLAVLPDTSTRGRTVWISDGFYSVHMMTVSDTDT 412

Query: 521  CSNV-NKSDSED--MRLSVVQTIFVSENIRDVQPLSANAILILGQDNLYAYAIS 369
             +N  +++DSE+   ++SV Q IF SE I+D+ PL+ANA+LILGQ +L+AYAIS
Sbjct: 413  SANEDDENDSEEKLKQISVTQAIFASERIQDIIPLAANALLILGQGSLFAYAIS 466


>ref|XP_006482290.1| PREDICTED: uncharacterized protein LOC102621692 isoform X2 [Citrus
            sinensis] gi|568857474|ref|XP_006482291.1| PREDICTED:
            uncharacterized protein LOC102621692 isoform X3 [Citrus
            sinensis] gi|568857476|ref|XP_006482292.1| PREDICTED:
            uncharacterized protein LOC102621692 isoform X4 [Citrus
            sinensis]
          Length = 449

 Score =  263 bits (673), Expect = 2e-67
 Identities = 175/439 (39%), Positives = 251/439 (57%), Gaps = 11/439 (2%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQ 1473
            +PS S  F   +            F+A GP     ++++R  V  +    +  A+V C Q
Sbjct: 69   SPSYSFTFLLLNHTPNPNPSPRVAFIAVGPHRSEPKLVLRLYVLKRNNF-YGKAQVFCKQ 127

Query: 1472 RGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGG 1293
            +G+ FD K G VL+D +HG+ + L GSVN+FAM+S S++K+WVFGV     D  D    G
Sbjct: 128  KGVSFDEKLG-VLLDITHGVGLKLVGSVNFFAMHSLSSSKIWVFGVMLMDGDGDD----G 182

Query: 1292 VRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLP 1113
            VRVNL  ++CA+I+C  PV S+S+S GF+I+GE NGVRV  LR LVKG V          
Sbjct: 183  VRVNL--MRCAVIECCKPVWSLSLSFGFMILGEDNGVRVLNLRSLVKGKVK--------- 231

Query: 1112 LRLGSKENENLEGIVEIKLRNSNLPNGLVG--GQNGRNDINKNELNQVKVKVIAHYNAVG 939
                             K++NS+LPNG++G  G +G  +                     
Sbjct: 232  -----------------KIKNSSLPNGIIGDYGFDGPTE--------------------- 253

Query: 938  MPRLVPNGVMGAQIDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESV-ARKSPYLP 762
              R+  NG +  +ID   + SVK +S +  QDS+E GA FL F  K+ E + + K P + 
Sbjct: 254  --RIACNGYLDEKID-KHSVSVKQRSVKYKQDSDEGGACFLAFRMKEVEGLKSTKMPLMS 310

Query: 761  KKAISIHAFMKNKFLVADSDGNVHILCAS--VSGPSV---MKQLSNIKEVQHIAVLPDLS 597
             KAISI A    KFL+ DS GN+H+L  S  V+G ++   ++QL ++  VQ +AV PD+S
Sbjct: 311  LKAISIQAVSLKKFLILDSSGNLHMLHLSSPVAGSNIIGHIRQLPHVMNVQKLAVHPDIS 370

Query: 596  ESSQTVWLSDGYHSVHVMAASDTNTCSNVN-KSDSED--MRLSVVQTIFVSENIRDVQPL 426
              +QT+W++DGYHSV+VM ASD +   N N +++SE+   + SV++ IFV E I+D+ PL
Sbjct: 371  LRTQTIWITDGYHSVNVMVASDMDAADNENGRNESEENLTQCSVIEAIFVGEKIQDLVPL 430

Query: 425  SANAILILGQDNLYAYAIS 369
            +AN +LILGQ NLYAYA S
Sbjct: 431  AANGLLILGQGNLYAYANS 449


>gb|EOY04244.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 445

 Score =  260 bits (664), Expect = 2e-66
 Identities = 166/413 (40%), Positives = 243/413 (58%), Gaps = 8/413 (1%)
 Frame = -2

Query: 1583 LFVAAGPSHGGSRVLIRF-LVQSKEAAGFVPARVGCG-QRGLEFDRKSGGVLVDCSHGLK 1410
            LF+  GP  GGS+VL+RF L ++ ++  F  A+V    Q+G+EFD K G VL+D SHGLK
Sbjct: 82   LFIVGGPYKGGSKVLLRFFLFRNDDSKVFEKAKVVVSNQKGIEFDDKVG-VLIDVSHGLK 140

Query: 1409 VVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGGVRVNLRLVKCALIDCKLPVTS 1230
            V+++GSVN+FA YSAS++KVW+FGVK    D+GD       V  +L+KCA+IDC  PV S
Sbjct: 141  VMIAGSVNFFAFYSASSSKVWIFGVKLVGNDEGDDG-----VVFKLMKCAVIDCTKPVFS 195

Query: 1229 MSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLPLRLGSKENENLEGIVEIKLRN 1050
            MSVSS  L++GE NGVRV+ LR LVKG   R                         +++ 
Sbjct: 196  MSVSSECLVLGEENGVRVWNLRELVKGKKIR-------------------------RVKY 230

Query: 1049 SNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMPRLVPNGVMGAQIDGTPASSVK 870
            S L NG++G  +G      +                    +V NG +  +I+     SVK
Sbjct: 231  SGLSNGVIGDSDGFGGGGSSSSG-----------------IVCNGYLNEKIE-KHCVSVK 272

Query: 869  PKSRRLSQDSNEWGAIFLPFNSKDEESV-ARKSPYLPKKAISIHAFMKNKFLVADSDGNV 693
             +S +  Q+S E GA F+ F  K+ + + + K P++  KAISI      KFL+ +S G++
Sbjct: 273  QRSGKYRQESAEEGACFVAFEQKEVKGLKSTKVPFMSMKAISIQPLSPKKFLILNSIGDL 332

Query: 692  ---HILCASVSGPSV--MKQLSNIKEVQHIAVLPDLSESSQTVWLSDGYHSVHVMAASDT 528
               H+L  +V       M+QL ++ +VQ +AVLPD+S   QTVW+SDG+H+VH+M  +  
Sbjct: 333  SVLHVLNTAVGSNITCHMRQLPHVLKVQKLAVLPDISSRRQTVWISDGHHTVHMMDITSA 392

Query: 527  NTCSNVNKSDSEDMRLSVVQTIFVSENIRDVQPLSANAILILGQDNLYAYAIS 369
               ++  +SD + +R+SV Q IF SE I+D+ P++AN+I+ILG+ +LY YAIS
Sbjct: 393  VNENDERESDEKLLRISVSQAIFSSEKIQDMIPMAANSIMILGRGSLYTYAIS 445


>ref|XP_006363153.1| PREDICTED: uncharacterized protein LOC102587994 [Solanum tuberosum]
          Length = 469

 Score =  257 bits (657), Expect = 1e-65
 Identities = 169/418 (40%), Positives = 239/418 (57%), Gaps = 11/418 (2%)
 Frame = -2

Query: 1589 LTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQRGLEFDRKSGGVLVDCSHGLK 1410
            +TLF+ + P  GGS VL RF + +     F PA+V C     +FD    GV+   SHG+ 
Sbjct: 87   ITLFLISSPISGGSAVLFRFYILNSARKSFTPAKVVCNHSDFKFDESKLGVVFGVSHGVS 146

Query: 1409 VVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGGVRVNLRLVKCALIDCKLPVTS 1230
            V L   VN FA+YS S  KVWVF VK         + GG    L+L+K A+IDC LPV S
Sbjct: 147  VKLVADVNVFALYSISNGKVWVFAVK---------HLGG--EELKLMKYAVIDCSLPVFS 195

Query: 1229 MSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLPLRLGSKENENLEGIVEIKLRN 1050
            +SVS G LI+GE NGVRVFPLR LVKG V +     K  L  G  E + +E      ++ 
Sbjct: 196  ISVSFGVLILGEDNGVRVFPLRPLVKGRVKKERGANKKSLN-GGLEKDKME------IKK 248

Query: 1049 SNLPNGLVGGQNGR-NDINKNELNQVKVKVIAHYNAVGMPRLVPNGVMGAQIDGTPASSV 873
              L NG++ G N   +  + ++L ++K                 NGV+  +++     S 
Sbjct: 249  LPLRNGMIHGINAEISFADGSKLMELK--------------FPSNGVLDERVENR-TESA 293

Query: 872  KPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPKKAISIHAFMKNKFLVADSDGNV 693
            K +S RL QDS E  A F+ F +KD+   + K P    KAI I A    +FL+ DS+GN+
Sbjct: 294  KLRSVRLRQDSREGIANFVAFKNKDDNFESIKIPVKSAKAIGIQALSSTRFLILDSEGNL 353

Query: 692  HI--LCASVSG---PSVMKQLSNIKEVQHIAVLPDLSESSQTVWLSDGYHSVHVMAASDT 528
            H+  L  SV G   P  MKQL++  +V+ + VLPD S  +QTVW+SD  H+VH++A +D 
Sbjct: 354  HLLFLATSVHGSETPYSMKQLTHNMKVRKLTVLPDSSTRAQTVWISDALHTVHMIAVTDM 413

Query: 527  NTCSNVNKSDSED-----MRLSVVQTIFVSENIRDVQPLSANAILILGQDNLYAYAIS 369
            +  ++VN++D +D     ++ SVVQ IF SE ++++  LSAN IL+LGQ +++AYAIS
Sbjct: 414  D--ASVNQTDCKDPAEKLVQTSVVQAIFSSEKVQEIAALSANTILLLGQGSMFAYAIS 469


>ref|XP_006430814.1| hypothetical protein CICLE_v10011716mg [Citrus clementina]
            gi|557532871|gb|ESR44054.1| hypothetical protein
            CICLE_v10011716mg [Citrus clementina]
          Length = 448

 Score =  257 bits (657), Expect = 1e-65
 Identities = 170/434 (39%), Positives = 248/434 (57%), Gaps = 11/434 (2%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQ 1473
            +PS S  F   +            F+A GP     ++++R  V  +    +  A+V C Q
Sbjct: 69   SPSYSFTFLLLNHTPNPNPSPRVAFIAVGPHRSEPKLVLRLYVLKRNNF-YGKAQVFCKQ 127

Query: 1472 RGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGG 1293
            +G+ FD K G VL+D +HGL + L GSVN+FAMYS S++K+WVFGVK    D  D    G
Sbjct: 128  KGVSFDEKLG-VLLDINHGLGLKLVGSVNFFAMYSLSSSKIWVFGVKLMDGDGDD----G 182

Query: 1292 VRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLP 1113
            VRV  +L++CA+I+C  PV S+S+S GF+I+GE NGVRV  LR LVKG V          
Sbjct: 183  VRV--KLMRCAVIECCKPVWSLSLSFGFMILGEDNGVRVLNLRSLVKGKVK--------- 231

Query: 1112 LRLGSKENENLEGIVEIKLRNSNLPNGLVG--GQNGRNDINKNELNQVKVKVIAHYNAVG 939
                             K++NS+LPNG++G  G +G  +                     
Sbjct: 232  -----------------KIKNSSLPNGIIGDYGFDGPTE--------------------- 253

Query: 938  MPRLVPNGVMGAQIDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESV-ARKSPYLP 762
              R+  NG +  +ID   + SVK +S +  QDS+E GA FL F  K+ E + + K P + 
Sbjct: 254  --RIACNGYLDEKID-KHSVSVKQRSVKYKQDSDEGGACFLAFRMKEVEGLKSTKMPLMS 310

Query: 761  KKAISIHAFMKNKFLVADSDGNVHILCAS--VSGPSV---MKQLSNIKEVQHIAVLPDLS 597
             KAISI A    KFL+ DS GN+H+L  S  V+G ++   ++QL ++  VQ +AV PD+S
Sbjct: 311  LKAISIQAVSLKKFLILDSSGNLHMLHLSSPVAGSNIIGHIRQLPHVMNVQKLAVHPDIS 370

Query: 596  ESSQTVWLSDGYHSVHVMAASDTNTCSNVN-KSDSED--MRLSVVQTIFVSENIRDVQPL 426
              +QT+W++DGYHSV+VM +SD +   N N +++SE+   + SV++ IFV E I+D+ PL
Sbjct: 371  LRTQTIWITDGYHSVNVMVSSDMDAADNENGRNESEENLTQCSVIEAIFVGEKIQDLVPL 430

Query: 425  SANAILILGQDNLY 384
            +AN +LILGQ N++
Sbjct: 431  AANGLLILGQGNIW 444


>ref|XP_004232375.1| PREDICTED: uncharacterized protein LOC101248829 [Solanum
            lycopersicum]
          Length = 466

 Score =  257 bits (656), Expect = 2e-65
 Identities = 168/418 (40%), Positives = 240/418 (57%), Gaps = 11/418 (2%)
 Frame = -2

Query: 1589 LTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQRGLEFDRKSGGVLVDCSHGLK 1410
            +TLF+ + P +GGS VL RF + +     F PA+V C     +FD    GV+   SHG+ 
Sbjct: 87   ITLFLISSPIYGGSAVLFRFYILNSARKSFTPAKVVCNHTDFKFDESKFGVVFGVSHGVS 146

Query: 1409 VVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGGVRVNLRLVKCALIDCKLPVTS 1230
            + L   VN FA+YS S ++VWVF VK         + GG    L+L+K A+IDC LPV S
Sbjct: 147  LKLVADVNVFALYSISNSRVWVFAVK---------HLGG--EELKLMKYAVIDCSLPVFS 195

Query: 1229 MSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLPLRLGSKENENLEGIVEIKLRN 1050
            +SVS G LI+GE NGVRVFPLR LVKG V +     K  L  G  E + +E      ++ 
Sbjct: 196  ISVSFGVLILGEDNGVRVFPLRPLVKGRVKKERATNKKSLN-GGLEKDKME------IKK 248

Query: 1049 SNLPNGLVGGQNGR-NDINKNELNQVKVKVIAHYNAVGMPRLVPNGVMGAQIDGTPASSV 873
              L NG++ G N   +  + ++L ++K                 NG++  + +     S 
Sbjct: 249  LPLRNGMIHGMNAEISAADGSKLMELK--------------FTSNGMVENRTE-----SA 289

Query: 872  KPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPKKAISIHAFMKNKFLVADSDGNV 693
            K +S RL QDS E  A F+ F +KD+   + K P    KAI I A    +FL+ DS+GN+
Sbjct: 290  KLRSVRLRQDSREGIANFVAFKNKDDNFESIKIPVKSAKAIGIQALSSTRFLILDSEGNL 349

Query: 692  HIL--CASVSG---PSVMKQLSNIKEVQHIAVLPDLSESSQTVWLSDGYHSVHVMAASDT 528
            H+L    SV G   P  MKQL++  +V+ + VLPD S  +QTVW +D  H+VH++A +D 
Sbjct: 350  HLLFPATSVHGSETPYSMKQLTHNMKVRKLTVLPDSSTRTQTVWTTDALHTVHMIAVTDM 409

Query: 527  NTCSNVNKSDSED-----MRLSVVQTIFVSENIRDVQPLSANAILILGQDNLYAYAIS 369
            +  S+VNK+DS+D     ++ SVVQ IF SE ++++  LSAN IL+LGQ +++AYAIS
Sbjct: 410  D-ASSVNKTDSKDPAEKLVQTSVVQAIFSSEKVQEIAALSANTILLLGQGSMFAYAIS 466


>ref|XP_006482289.1| PREDICTED: uncharacterized protein LOC102621692 isoform X1 [Citrus
            sinensis]
          Length = 458

 Score =  254 bits (650), Expect = 8e-65
 Identities = 169/434 (38%), Positives = 247/434 (56%), Gaps = 11/434 (2%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQ 1473
            +PS S  F   +            F+A GP     ++++R  V  +    +  A+V C Q
Sbjct: 69   SPSYSFTFLLLNHTPNPNPSPRVAFIAVGPHRSEPKLVLRLYVLKRNNF-YGKAQVFCKQ 127

Query: 1472 RGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGG 1293
            +G+ FD K G VL+D +HG+ + L GSVN+FAM+S S++K+WVFGV     D  D    G
Sbjct: 128  KGVSFDEKLG-VLLDITHGVGLKLVGSVNFFAMHSLSSSKIWVFGVMLMDGDGDD----G 182

Query: 1292 VRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLP 1113
            VRVNL  ++CA+I+C  PV S+S+S GF+I+GE NGVRV  LR LVKG V          
Sbjct: 183  VRVNL--MRCAVIECCKPVWSLSLSFGFMILGEDNGVRVLNLRSLVKGKVK--------- 231

Query: 1112 LRLGSKENENLEGIVEIKLRNSNLPNGLVG--GQNGRNDINKNELNQVKVKVIAHYNAVG 939
                             K++NS+LPNG++G  G +G  +                     
Sbjct: 232  -----------------KIKNSSLPNGIIGDYGFDGPTE--------------------- 253

Query: 938  MPRLVPNGVMGAQIDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESV-ARKSPYLP 762
              R+  NG +  +ID   + SVK +S +  QDS+E GA FL F  K+ E + + K P + 
Sbjct: 254  --RIACNGYLDEKID-KHSVSVKQRSVKYKQDSDEGGACFLAFRMKEVEGLKSTKMPLMS 310

Query: 761  KKAISIHAFMKNKFLVADSDGNVHILCAS--VSGPSV---MKQLSNIKEVQHIAVLPDLS 597
             KAISI A    KFL+ DS GN+H+L  S  V+G ++   ++QL ++  VQ +AV PD+S
Sbjct: 311  LKAISIQAVSLKKFLILDSSGNLHMLHLSSPVAGSNIIGHIRQLPHVMNVQKLAVHPDIS 370

Query: 596  ESSQTVWLSDGYHSVHVMAASDTNTCSNVN-KSDSED--MRLSVVQTIFVSENIRDVQPL 426
              +QT+W++DGYHSV+VM ASD +   N N +++SE+   + SV++ IFV E I+D+ PL
Sbjct: 371  LRTQTIWITDGYHSVNVMVASDMDAADNENGRNESEENLTQCSVIEAIFVGEKIQDLVPL 430

Query: 425  SANAILILGQDNLY 384
            +AN +LILGQ N++
Sbjct: 431  AANGLLILGQGNIW 444


>gb|EOY04247.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 458

 Score =  252 bits (643), Expect = 5e-64
 Identities = 162/407 (39%), Positives = 238/407 (58%), Gaps = 8/407 (1%)
 Frame = -2

Query: 1583 LFVAAGPSHGGSRVLIRF-LVQSKEAAGFVPARVGCG-QRGLEFDRKSGGVLVDCSHGLK 1410
            LF+  GP  GGS+VL+RF L ++ ++  F  A+V    Q+G+EFD K G VL+D SHGLK
Sbjct: 82   LFIVGGPYKGGSKVLLRFFLFRNDDSKVFEKAKVVVSNQKGIEFDDKVG-VLIDVSHGLK 140

Query: 1409 VVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGGVRVNLRLVKCALIDCKLPVTS 1230
            V+++GSVN+FA YSAS++KVW+FGVK    D+GD       V  +L+KCA+IDC  PV S
Sbjct: 141  VMIAGSVNFFAFYSASSSKVWIFGVKLVGNDEGDDG-----VVFKLMKCAVIDCTKPVFS 195

Query: 1229 MSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLPLRLGSKENENLEGIVEIKLRN 1050
            MSVSS  L++GE NGVRV+ LR LVKG   R                         +++ 
Sbjct: 196  MSVSSECLVLGEENGVRVWNLRELVKGKKIR-------------------------RVKY 230

Query: 1049 SNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMPRLVPNGVMGAQIDGTPASSVK 870
            S L NG++G  +G      +                    +V NG +  +I+     SVK
Sbjct: 231  SGLSNGVIGDSDGFGGGGSSSSG-----------------IVCNGYLNEKIE-KHCVSVK 272

Query: 869  PKSRRLSQDSNEWGAIFLPFNSKDEESV-ARKSPYLPKKAISIHAFMKNKFLVADSDGNV 693
             +S +  Q+S E GA F+ F  K+ + + + K P++  KAISI      KFL+ +S G++
Sbjct: 273  QRSGKYRQESAEEGACFVAFEQKEVKGLKSTKVPFMSMKAISIQPLSPKKFLILNSIGDL 332

Query: 692  ---HILCASVSGPSV--MKQLSNIKEVQHIAVLPDLSESSQTVWLSDGYHSVHVMAASDT 528
               H+L  +V       M+QL ++ +VQ +AVLPD+S   QTVW+SDG+H+VH+M  +  
Sbjct: 333  SVLHVLNTAVGSNITCHMRQLPHVLKVQKLAVLPDISSRRQTVWISDGHHTVHMMDITSA 392

Query: 527  NTCSNVNKSDSEDMRLSVVQTIFVSENIRDVQPLSANAILILGQDNL 387
               ++  +SD + +R+SV Q IF SE I+D+ P++AN+I+ILG+ NL
Sbjct: 393  VNENDERESDEKLLRISVSQAIFSSEKIQDMIPMAANSIMILGRGNL 439


>gb|EOY04243.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 480

 Score =  249 bits (636), Expect = 3e-63
 Identities = 160/405 (39%), Positives = 237/405 (58%), Gaps = 8/405 (1%)
 Frame = -2

Query: 1583 LFVAAGPSHGGSRVLIRF-LVQSKEAAGFVPARVGCG-QRGLEFDRKSGGVLVDCSHGLK 1410
            LF+  GP  GGS+VL+RF L ++ ++  F  A+V    Q+G+EFD K G VL+D SHGLK
Sbjct: 82   LFIVGGPYKGGSKVLLRFFLFRNDDSKVFEKAKVVVSNQKGIEFDDKVG-VLIDVSHGLK 140

Query: 1409 VVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGGVRVNLRLVKCALIDCKLPVTS 1230
            V+++GSVN+FA YSAS++KVW+FGVK    D+GD       V  +L+KCA+IDC  PV S
Sbjct: 141  VMIAGSVNFFAFYSASSSKVWIFGVKLVGNDEGDDG-----VVFKLMKCAVIDCTKPVFS 195

Query: 1229 MSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLPLRLGSKENENLEGIVEIKLRN 1050
            MSVSS  L++GE NGVRV+ LR LVKG   R                         +++ 
Sbjct: 196  MSVSSECLVLGEENGVRVWNLRELVKGKKIR-------------------------RVKY 230

Query: 1049 SNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMPRLVPNGVMGAQIDGTPASSVK 870
            S L NG++G  +G      +                    +V NG +  +I+     SVK
Sbjct: 231  SGLSNGVIGDSDGFGGGGSSSSG-----------------IVCNGYLNEKIE-KHCVSVK 272

Query: 869  PKSRRLSQDSNEWGAIFLPFNSKDEESV-ARKSPYLPKKAISIHAFMKNKFLVADSDGNV 693
             +S +  Q+S E GA F+ F  K+ + + + K P++  KAISI      KFL+ +S G++
Sbjct: 273  QRSGKYRQESAEEGACFVAFEQKEVKGLKSTKVPFMSMKAISIQPLSPKKFLILNSIGDL 332

Query: 692  ---HILCASVSGPSV--MKQLSNIKEVQHIAVLPDLSESSQTVWLSDGYHSVHVMAASDT 528
               H+L  +V       M+QL ++ +VQ +AVLPD+S   QTVW+SDG+H+VH+M  +  
Sbjct: 333  SVLHVLNTAVGSNITCHMRQLPHVLKVQKLAVLPDISSRRQTVWISDGHHTVHMMDITSA 392

Query: 527  NTCSNVNKSDSEDMRLSVVQTIFVSENIRDVQPLSANAILILGQD 393
               ++  +SD + +R+SV Q IF SE I+D+ P++AN+I+ILG++
Sbjct: 393  VNENDERESDEKLLRISVSQAIFSSEKIQDMIPMAANSIMILGRE 437


>ref|XP_002305950.2| hypothetical protein POPTR_0004s10220g [Populus trichocarpa]
            gi|550340727|gb|EEE86461.2| hypothetical protein
            POPTR_0004s10220g [Populus trichocarpa]
          Length = 442

 Score =  248 bits (634), Expect = 6e-63
 Identities = 171/435 (39%), Positives = 239/435 (54%), Gaps = 8/435 (1%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQ 1473
            +PSSS+ F               LF+ AGP  GGS++L+RF V   ++  + P +V C Q
Sbjct: 63   SPSSSSSFLLIHQDPIPK----VLFLVAGPYKGGSQILLRFHVLQNDSFFYKP-QVVCNQ 117

Query: 1472 RGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGG 1293
            +GL FD K G VL+D +HG+ + + GS+N+F ++S S+ KVWVF VK  + D GD     
Sbjct: 118  KGLAFDSKLG-VLLDINHGVSIKIVGSINFFVLHSVSSKKVWVFAVK--IIDDGDGEM-- 172

Query: 1292 VRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLP 1113
                L+L++CA+I+C +PV S+SVSSG LI+GE NGVRVF LR LVK  V +        
Sbjct: 173  ----LKLMRCAVIECSVPVWSISVSSGVLILGEDNGVRVFNLRQLVKWKVKKVK------ 222

Query: 1112 LRLGSKENENLEGIVEIKLRNSNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMP 933
               G   N  L+      L++SN       G    N ++ +  N                
Sbjct: 223  ---GFDSNGKLD---RKGLKSSN-------GDGEDNGVSSSSGNAC-------------- 255

Query: 932  RLVPNGVMGAQIDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPKKA 753
                NG +  + D     SVK +S R SQDS E GA F+ F  K E +   K   L  KA
Sbjct: 256  ----NGALDGKTD-KHCVSVKQRSVRCSQDSGEGGACFVAF--KREATEGMKPTTL--KA 306

Query: 752  ISIHAFMKNKFLVADSDGNVHILCAS--VSGPSV---MKQLSNIKEVQHIAVLPDLSESS 588
            +SI A    KF++ DS G++HILC S  V GP+V   M++L +  +VQ +AV PD S   
Sbjct: 307  VSIQALPPKKFVILDSTGDLHILCLSAPVVGPNVIAHMRRLPHSMKVQKLAVFPDFSSKM 366

Query: 587  QTVWLSDGYHSVHVMAASDTNTCSNVNKSD---SEDMRLSVVQTIFVSENIRDVQPLSAN 417
            QT W+SDG+HSVH +  S+ +   N N  D    + +R++V+Q I  +E I+D+ PL AN
Sbjct: 367  QTFWVSDGFHSVHTITLSNMDAAVNTNDGDVTQEKLIRITVIQAILSAEKIQDLIPLGAN 426

Query: 416  AILILGQDNLYAYAI 372
             ILILGQ N+Y+Y I
Sbjct: 427  GILILGQGNIYSYTI 441


>gb|ABK95828.1| unknown [Populus trichocarpa]
          Length = 442

 Score =  248 bits (633), Expect = 8e-63
 Identities = 172/435 (39%), Positives = 238/435 (54%), Gaps = 8/435 (1%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQ 1473
            +PSSS+ F               LF+ AGP  GGS++L+RF V   ++  + P +V C Q
Sbjct: 63   SPSSSSSFLLIHQDPIPK----VLFLVAGPYKGGSQILLRFHVLQNDSFFYKP-QVVCNQ 117

Query: 1472 RGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGG 1293
            +GL FD K G VL+D +HG+ + + GS+N+F ++S S+ KVWVF VK  + D GD     
Sbjct: 118  KGLAFDSKLG-VLLDINHGVSIKIVGSINFFVLHSVSSKKVWVFAVK--IIDDGDGEM-- 172

Query: 1292 VRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLP 1113
                L+L++CA+I+C +PV S+SVSSG LI+GE NGVRVF LR LVK  V +        
Sbjct: 173  ----LKLMRCAVIECSVPVWSISVSSGVLILGEDNGVRVFNLRQLVKWKVKKVK------ 222

Query: 1112 LRLGSKENENLEGIVEIKLRNSNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMP 933
               G   N  L+      L++SN       G    N ++ +  N                
Sbjct: 223  ---GFDSNGKLD---RKGLKSSN-------GDGEDNGVSSSSGNAC-------------- 255

Query: 932  RLVPNGVMGAQIDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPKKA 753
                NG +  + D     SVK +S R SQDS E GA F+ F  K E +   K   L  KA
Sbjct: 256  ----NGALDGKTD-KHCVSVKQRSVRCSQDSGEGGACFVAF--KREATEGMKPTTL--KA 306

Query: 752  ISIHAFMKNKFLVADSDGNVHILCAS--VSGPSVM---KQLSNIKEVQHIAVLPDLSESS 588
            +SI A    KF++ DS G++HILC S  V GP+VM   +QL +  +VQ +AV PD S   
Sbjct: 307  VSIQALPPKKFVILDSIGDLHILCLSAPVVGPNVMAHMRQLPHSMKVQKLAVFPDFSSKM 366

Query: 587  QTVWLSDGYHSVHVMAASDTNTCSNVNKSD---SEDMRLSVVQTIFVSENIRDVQPLSAN 417
            QT W+SDG HSVH +  S+ +   N N  D    + +R++V+Q I  +E I+D+ PL AN
Sbjct: 367  QTFWVSDGLHSVHTITLSNMDAAVNTNNGDVTQEKLIRITVIQAILSAEKIQDLIPLGAN 426

Query: 416  AILILGQDNLYAYAI 372
             ILILGQ N+Y+Y I
Sbjct: 427  GILILGQGNIYSYTI 441


>gb|EXB97178.1| hypothetical protein L484_008668 [Morus notabilis]
          Length = 600

 Score =  245 bits (625), Expect = 6e-62
 Identities = 173/420 (41%), Positives = 244/420 (58%), Gaps = 24/420 (5%)
 Frame = -2

Query: 1583 LFVAAGPSHGGSRVLIRF-LVQSKEAAGFVPARVGCGQRGLEFDRKSGGVLVDCSHGLKV 1407
            LFVA+GP  GGSR+L+RF ++Q K+   F  ARV C Q+  +F  + G VLVD  HG+ V
Sbjct: 87   LFVASGPHAGGSRILLRFYILQGKKL--FHKARVVCNQKDFQFVERFG-VLVDSVHGVSV 143

Query: 1406 VLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGGVRVNLRLVKCALIDCKLPVTSM 1227
             L+GSVN+FAMYS S +K W+F VK  V+D+           ++L++CA+I+C  PV S+
Sbjct: 144  KLAGSVNFFAMYSVSGSKAWIFAVK-LVDDEV----------VKLMRCAVIECSKPVFSI 192

Query: 1226 SVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLPLRLGSKENENLEGIVEIKLRNS 1047
            ++S G LI+GE  GVRVF LR LVKG                +K+ +NL+   +   R S
Sbjct: 193  TLSFGVLILGEEWGVRVFNLRQLVKGR---------------AKKVKNLQPNSKSDGRKS 237

Query: 1046 NLPNGLVGGQ-----------NGRNDINKNELNQVKVKVIAHY-NAVGMPRLVPNGVMGA 903
             LPNG++G              G +   K  +     +    Y +      LV + ++  
Sbjct: 238  RLPNGVIGADVLGDLKDYVHSEGGDRCGKCVIEGSSERTCNCYLDGKSNRHLVSDNIVNF 297

Query: 902  Q--IDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPK-KAISIHAFM 732
                +     +VK ++ RL QDS+E GA FL F+ KD E  A KS  +   KAISI A  
Sbjct: 298  AHVANQVVEHAVKQRAVRLRQDSSEAGACFLAFSGKDVE--ASKSRVITSVKAISIQALS 355

Query: 731  KNKFLVADSDGNVHILC--ASVSGPSV---MKQLSNIKEVQHIAVLPDLSESSQTVWLSD 567
              KFL+ DS GN+H+LC    V+G  +   ++QL  +  VQ +AVL D S  +QTVWLSD
Sbjct: 356  PKKFLILDSAGNLHLLCWFNRVTGSDMTPHIRQLPQVTNVQKLAVLADSSIRTQTVWLSD 415

Query: 566  GYHSVHVMAASD-TNTCSNVNKSDSED--MRLSVVQTIFVSENIRDVQPLSANAILILGQ 396
            G+HS+HV+AASD     S  +++++E+  M++SV+Q IF SE I DV PL++NAILILGQ
Sbjct: 416  GHHSLHVVAASDIVAAVSENDRTENEEKLMQISVIQAIFASEKIEDVIPLASNAILILGQ 475


>ref|XP_002329273.1| predicted protein [Populus trichocarpa]
          Length = 434

 Score =  238 bits (607), Expect = 8e-60
 Identities = 162/427 (37%), Positives = 230/427 (53%), Gaps = 8/427 (1%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQ 1473
            +PSSS+ F               LF+ A P  GGS++L+RF +  K+   F   +V C Q
Sbjct: 62   SPSSSSSFLLIHQDPIPK----VLFLVASPYKGGSQILLRFYLLQKDNI-FCKPQVVCNQ 116

Query: 1472 RGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGG 1293
            +G+ FD K G VL+D +HG+ + + GSVN+F ++S S+ KVWVF VK  + D GD     
Sbjct: 117  KGIAFDSKLG-VLLDINHGVSIKIVGSVNFFVLHSVSSKKVWVFAVK--LIDDGDGEM-- 171

Query: 1292 VRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLP 1113
                ++L++CA+I+C +PV S+SVSSG L++GE NGVRVF LR LVKG V          
Sbjct: 172  ----VKLMRCAVIECSVPVWSISVSSGVLVLGEDNGVRVFNLRQLVKGRV---------- 217

Query: 1112 LRLGSKENENLEGIVEIKLRNSNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMP 933
                 K  +++    +   +   LPNG+VG        + N                   
Sbjct: 218  -----KNVKDISSNGKSDGKGFKLPNGVVGDDYFHGSSSGNGC----------------- 255

Query: 932  RLVPNGVMGAQIDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPKKA 753
                NGV+  + D     SVK +S R  QDS E GA F+ F  ++ E +  K+     KA
Sbjct: 256  ----NGVLDMKTD-KQYVSVKLRSVRCRQDSGEGGACFVAFKREEVEVLKPKT----SKA 306

Query: 752  ISIHAFMKNKFLVADSDGNVHILC--ASVSGPSV---MKQLSNIKEVQHIAVLPDLSESS 588
            +SI A    KF++ DS G++HILC  A V G +    M++L +  +VQ +AVLPD+S   
Sbjct: 307  VSIQALSHKKFVILDSMGDLHILCLSAPVIGSNFMAHMRRLPHSMKVQKLAVLPDISLKM 366

Query: 587  QTVWLSDGYHSVHVMAASDTNTCSNVNKSDSED---MRLSVVQTIFVSENIRDVQPLSAN 417
            QT W+SDG HSVH +  SD     N N  D      ++++V+Q IF +E I+D+ PL AN
Sbjct: 367  QTFWVSDGLHSVHTITLSDMGAAVNSNNEDETQEKLIQITVIQAIFSAEKIQDLIPLGAN 426

Query: 416  AILILGQ 396
             ILILGQ
Sbjct: 427  GILILGQ 433


>ref|XP_006373454.1| hypothetical protein POPTR_0017s13920g [Populus trichocarpa]
            gi|550320276|gb|ERP51251.1| hypothetical protein
            POPTR_0017s13920g [Populus trichocarpa]
          Length = 427

 Score =  225 bits (574), Expect = 5e-56
 Identities = 155/420 (36%), Positives = 223/420 (53%), Gaps = 8/420 (1%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQ 1473
            +PSSS+ F               LF+ A P  GG ++L+RF +  K+   F   +V C Q
Sbjct: 62   SPSSSSSFLLIHQDPIPK----VLFLVASPYKGGYQILLRFYLLQKDNI-FCKPQVVCNQ 116

Query: 1472 RGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGG 1293
            +G+ FD K G VL+D +HG+ + + GSVN+F ++S S+ KVWVF VK  + D GD     
Sbjct: 117  KGIAFDSKLG-VLLDINHGVSIKIVGSVNFFVLHSVSSKKVWVFAVK--LIDDGDGEM-- 171

Query: 1292 VRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLP 1113
                ++L++CA+I+C +PV S+SVSSG L++GE NGVRVF LR LVKG V          
Sbjct: 172  ----VKLMRCAVIECSVPVWSISVSSGVLVLGEDNGVRVFNLRQLVKGRV---------- 217

Query: 1112 LRLGSKENENLEGIVEIKLRNSNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMP 933
                 K  +++    +   +   LPNG+VG        + N                   
Sbjct: 218  -----KNVKDISSNGKSDGKGLKLPNGVVGDDYFHGSSSGNGC----------------- 255

Query: 932  RLVPNGVMGAQIDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPKKA 753
                NGV+  + D     SVK +S R  QDS E GA F+ F  ++ E +  K+     KA
Sbjct: 256  ----NGVLDMKTD-KQYVSVKLRSVRCRQDSGEGGACFVAFKREEVEVLKPKT----SKA 306

Query: 752  ISIHAFMKNKFLVADSDGNVHILC--ASVSGPSV---MKQLSNIKEVQHIAVLPDLSESS 588
            +SI A    KF++ DS G++HILC  A V G +    M++L +  +VQ +AVLPD+S   
Sbjct: 307  VSIQALSHKKFVILDSMGDLHILCLSAPVIGSNFMAHMRRLPHSMKVQKLAVLPDISLKM 366

Query: 587  QTVWLSDGYHSVHVMAASDTNTCSNVNKSDSED---MRLSVVQTIFVSENIRDVQPLSAN 417
            QT W+SDG HSVH +  SD     N N  D      ++++V+Q IF +E I+D+ PL AN
Sbjct: 367  QTFWVSDGLHSVHTITLSDMGAAVNSNNEDETQEKLIQITVIQAIFSAEKIQDLIPLGAN 426


>gb|EOY04245.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508712349|gb|EOY04246.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 469

 Score =  219 bits (559), Expect = 3e-54
 Identities = 146/378 (38%), Positives = 217/378 (57%), Gaps = 9/378 (2%)
 Frame = -2

Query: 1583 LFVAAGPSHGGSRVLIRF-LVQSKEAAGFVPARVGCG-QRGLEFDRKSGGVLVDCSHGLK 1410
            LF+  GP  GGS+VL+RF L ++ ++  F  A+V    Q+G+EFD K G VL+D SHGLK
Sbjct: 82   LFIVGGPYKGGSKVLLRFFLFRNDDSKVFEKAKVVVSNQKGIEFDDKVG-VLIDVSHGLK 140

Query: 1409 VVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGGVRVNLRLVKCALIDCKLPVTS 1230
            V+++GSVN+FA YSAS++KVW+FGVK    D+GD       V  +L+KCA+IDC  PV S
Sbjct: 141  VMIAGSVNFFAFYSASSSKVWIFGVKLVGNDEGDDG-----VVFKLMKCAVIDCTKPVFS 195

Query: 1229 MSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLPLRLGSKENENLEGIVEIKLRN 1050
            MSVSS  L++GE NGVRV+ LR LVKG   R                         +++ 
Sbjct: 196  MSVSSECLVLGEENGVRVWNLRELVKGKKIR-------------------------RVKY 230

Query: 1049 SNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMPRLVPNGVMGAQIDGTPASSVK 870
            S L NG++G  +G      +                    +V NG +  +I+     SVK
Sbjct: 231  SGLSNGVIGDSDGFGGGGSSSSG-----------------IVCNGYLNEKIE-KHCVSVK 272

Query: 869  PKSRRLSQDSNEWGAIFLPFNSKDEESV-ARKSPYLPKKAISIHAFMKNKFLVADSDGN- 696
             +S +  Q+S E GA F+ F  K+ + + + K P++  KAISI      KFL+ +S G+ 
Sbjct: 273  QRSGKYRQESAEEGACFVAFEQKEVKGLKSTKVPFMSMKAISIQPLSPKKFLILNSIGDL 332

Query: 695  --VHILCASVSGPSV---MKQLSNIKEVQHIAVLPDLSESSQTVWLSDGYHSVHVMAASD 531
              +H+L  +V G ++   M+QL ++ +VQ +AVLPD+S   QTVW+SDG+H+VH+M  + 
Sbjct: 333  SVLHVLNTAV-GSNITCHMRQLPHVLKVQKLAVLPDISSRRQTVWISDGHHTVHMMDITS 391

Query: 530  TNTCSNVNKSDSEDMRLS 477
                ++  +SD + +R+S
Sbjct: 392  AVNENDERESDEKLLRIS 409


>ref|XP_004156925.1| PREDICTED: uncharacterized LOC101211683 [Cucumis sativus]
          Length = 524

 Score =  216 bits (549), Expect = 4e-53
 Identities = 161/450 (35%), Positives = 237/450 (52%), Gaps = 28/450 (6%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPARVGCGQ 1473
            +P SSA F             + LFV +GP  GGS++L+RF V       F  A V C Q
Sbjct: 65   SPCSSAAFVALQNSNSNSDTKV-LFVVSGPHKGGSQILLRFYVLEGSKL-FRRAPVVCTQ 122

Query: 1472 RGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGG 1293
            + L  D K G VLV+  HG+ V L+GSVN+FAMYS S+ K+WVF VK      GD + G 
Sbjct: 123  KDLRSDDKLG-VLVNFRHGISVRLAGSVNFFAMYSVSSMKIWVFAVKMV----GDGDDG- 176

Query: 1292 VRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLP 1113
              + L+L++CA+IDC  P+ S+++S GFL++GE NG+RV  LR  V+G  GR        
Sbjct: 177  --IGLKLMRCAVIDCCKPIWSLNISFGFLLLGEDNGIRVVNLRPFVRGR-GRKVRNL--- 230

Query: 1112 LRLGSKENENLEGIVEIKLRNSNLPNGLVGGQNGRNDINKNEL----NQVKVKVIAHYNA 945
                   N N     + +++ S LP+  V G +G ND+N   L    N   ++     +A
Sbjct: 231  -------NANTSSNAKREVQKSFLPHVDVCGTSGGNDLNGGSLVVSSNGFNLQASRSEDA 283

Query: 944  VGMPRLVPNGVMGAQID-----GTP----------ASSVKPKSRRLSQDSNEWGAIFLPF 810
                 L  NG +  ++D     G P           S V+P+  +L QDS+E G  F+  
Sbjct: 284  ---GSLACNGCLDGKLDKISSSGFPYMARNWVLKVPSFVRPRCIKLRQDSSE-GLYFVAL 339

Query: 809  NSKDEESVARKSPYLPKKAISIHAFMKNKFLVADSDGNVHILCASVSGPSV-----MKQL 645
              +  E + + +  +  KAISI A    K L+ DS G++H+L  + +         ++ L
Sbjct: 340  KGRGNEGL-KSAKMMSLKAISIQALSPKKILILDSVGDLHLLHIANTANGFDFSCNIRPL 398

Query: 644  SNIKEVQHIAVLPDLSESSQTVWLSDGYHSVHVMAASDTNTCSNVNK-SDSEDM---RLS 477
             ++ + Q +   PD    +QTVWLSDG HSVH+M   D ++    N  ++SE++   R+S
Sbjct: 399  PHLMKAQMLTSFPDTIIRNQTVWLSDGNHSVHIMVIPDVDSVVPENMGNESEEVLMKRIS 458

Query: 476  VVQTIFVSENIRDVQPLSANAILILGQDNL 387
            V+Q IF  E I+D+  L+ANA+LILGQ  L
Sbjct: 459  VMQAIFAGEKIQDITSLAANAVLILGQGTL 488


>ref|XP_006593724.1| PREDICTED: uncharacterized protein LOC100805793 isoform X1 [Glycine
            max] gi|571496875|ref|XP_006593725.1| PREDICTED:
            uncharacterized protein LOC100805793 isoform X2 [Glycine
            max]
          Length = 448

 Score =  211 bits (536), Expect = 1e-51
 Identities = 157/443 (35%), Positives = 232/443 (52%), Gaps = 15/443 (3%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXS--LTLFVAAGPSHGGSRVLIR-FLVQSKEAAGFVPAR-V 1485
            +PSSS+ F                 LF+ + P   G  +L+R + ++  E   F     V
Sbjct: 70   SPSSSSTFLLLQNHTNPTSSVGPTVLFIVSSPHRTG--ILLRLYRLRRLETPSFSRVTDV 127

Query: 1484 GCGQRGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDA 1305
             C  + L F+   G V+++  HG  V L+GSVNYFA+++ S+ KVWVF VK   +D G  
Sbjct: 128  LCSHKDLRFEPNLG-VVLNAKHGASVRLAGSVNYFALHALSSNKVWVFAVKD--DDDG-- 182

Query: 1304 NFGGVRVNLRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXX 1125
                    LRL++CA+I+C  PV S++V+ GFLI+GE NGVRVF LR LVKG  G+    
Sbjct: 183  -------GLRLMRCAVIECTRPVFSVNVAFGFLILGEENGVRVFGLRRLVKGRSGK---- 231

Query: 1124 XKLPLRLGSKENENLEGIVEIKLRNSNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNA 945
                 R+G+ +          +LRN        GG  G                      
Sbjct: 232  -----RVGNSK----------QLRNG-------GGGRG---------------------- 247

Query: 944  VGMPRLVPNGVMGAQIDG-TPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPY 768
             G+  +  NG +  +++    A++VK  + +L  D+ + G+ F+     + ++ +     
Sbjct: 248  AGLEAVNCNGDLKGKMERYVVATAVKQTNVKLKHDNRDGGSCFVTLKVNEVKTKSPTKVS 307

Query: 767  LPKKAISIHAFMKNKFLVADSDGNVHILCASVSGPSV-----MKQLSNIKEVQHIAVLPD 603
            +  KAISI A  +  FL+ DS G++H+L  S SG  V     + QL +I +V+ +AVLPD
Sbjct: 308  MSIKAISIQAVSQRMFLILDSHGDLHLLSLSNSGIGVDITGNVLQLPHIMKVRSLAVLPD 367

Query: 602  LSESSQTVWLSDGYHSVHVMAASDTNTCSNVNKSDSED-----MRLSVVQTIFVSENIRD 438
            LS  SQT+W+SDG HSVH+  A D      +N++D  D     M L V++ +F SE I+D
Sbjct: 368  LSTMSQTIWISDGCHSVHMFTAMDIENA--LNEADGNDCNEKLMHLPVIRVLFSSEKIQD 425

Query: 437  VQPLSANAILILGQDNLYAYAIS 369
            +  LSAN+ILILGQ +LYAYAIS
Sbjct: 426  IISLSANSILILGQGSLYAYAIS 448


>gb|ESW23618.1| hypothetical protein PHAVU_004G062800g, partial [Phaseolus vulgaris]
          Length = 442

 Score =  196 bits (497), Expect = 4e-47
 Identities = 150/429 (34%), Positives = 223/429 (51%), Gaps = 10/429 (2%)
 Frame = -2

Query: 1652 APSSSACFFRFDXXXXXXXXSLTLFVAAGPSHGGSRVLIRFLVQSKEAAGFVPA-RVGCG 1476
            +PSSS+ F               +F+ + P    SR+L+R L + ++ + F    RV C 
Sbjct: 70   SPSSSSTFLLLQQHPSAAPA--VIFLVSSPYR--SRILLR-LYRLRDPSSFERVTRVLCL 124

Query: 1475 QRGLEFDRKSGGVLVDCSHGLKVVLSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFG 1296
             + L F +   GV++D  HG  V L+ SVNYFA+++ S+ KVWVF VK   +D G  N  
Sbjct: 125  HKDLCF-QPGLGVILDAKHGAAVRLAASVNYFALHALSSNKVWVFAVK---DDGGGGNDD 180

Query: 1295 GVRVN-LRLVKCALIDCKLPVTSMSVSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXK 1119
            G     +RL++CA+I+C  PV S+SV+ GFLI+GE NGVRVF LR LVKG  G       
Sbjct: 181  GSGSGGVRLMRCAVIECARPVFSLSVAFGFLILGEENGVRVFGLRRLVKGKSGNK----- 235

Query: 1118 LPLRLGSKENENLEGIVEIKLRNSNLPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVG 939
               R+G+ +          +LRN       VG + G                       G
Sbjct: 236  ---RVGNSK----------QLRNG------VGVRGG-----------------------G 253

Query: 938  MPRLVPNGVMGAQIDGTPASSVKPKSRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPK 759
            +     NG +  +++    ++VK    +   D  + G+ F+     +  + +     +  
Sbjct: 254  LEVANCNGDLEGKMERHGVAAVKQTHVKSKLDDRDGGSCFVVLKGNEVNTNSVTKVSMSI 313

Query: 758  KAISIHAFMKNKFLVADSDGNVHILCASVSGPSV-----MKQLSNIKEVQHIAVLPDLSE 594
            KAISI A  +  FL+ DS G++H+L  S SG  V     ++ L    +V+ I+VLPDLS 
Sbjct: 314  KAISIQAVSQRMFLILDSHGDLHLLSLSNSGVGVDITGNVRPLPRTMKVKSISVLPDLSA 373

Query: 593  SSQTVWLSDGYHSVHVMAASD-TNTCSNVNKSDSED--MRLSVVQTIFVSENIRDVQPLS 423
             SQT+W+SDGYHSVH+  A D  N  + V+ +D  +  +RL VV+ +F SE I+D+  LS
Sbjct: 374  MSQTIWISDGYHSVHMFTAMDIENALNEVDGNDCNEKLLRLPVVRVLFSSEKIQDIISLS 433

Query: 422  ANAILILGQ 396
            AN++LILGQ
Sbjct: 434  ANSVLILGQ 442


>ref|XP_002882236.1| hypothetical protein ARALYDRAFT_340395 [Arabidopsis lyrata subsp.
            lyrata] gi|297328076|gb|EFH58495.1| hypothetical protein
            ARALYDRAFT_340395 [Arabidopsis lyrata subsp. lyrata]
          Length = 487

 Score =  179 bits (454), Expect = 4e-42
 Identities = 141/416 (33%), Positives = 208/416 (50%), Gaps = 22/416 (5%)
 Frame = -2

Query: 1580 FVAAGPSHGGSRVLIRFL-VQSKEAAGFVPARVGCGQRGLEFDRKSGGVLVDCSHGLKVV 1404
            F+ AGP  GGSR+L+RF  ++  +  GFV A+V C Q+G+EFD+K G VL++ SHG+ V 
Sbjct: 98   FIVAGPYRGGSRLLLRFYGLREGKNKGFVRAKVICDQKGIEFDQKVG-VLLNLSHGVSVK 156

Query: 1403 LSGSVNYFAMYSASAAKVWVFGVKSAVEDQGDANFGGVRVNLRLVKCALIDCKLPVTSMS 1224
            + GS NYF+MYS S++K+ +FG+K   +     +   V V  +LV+C  I+C  PV S+ 
Sbjct: 157  IVGSTNYFSMYSVSSSKILIFGLKVVTDGSNCGDDDAVVV--KLVRCGEIECVRPVWSIG 214

Query: 1223 VSSGFLIMGELNGVRVFPLRLLVKGSVGRXXXXXKLPLRLGSKENENLEGIVEIKLRNSN 1044
            + SG LI+GE +GVRV  LR +VKG            L+ G K+N         +LRN +
Sbjct: 215  IFSGLLILGEDDGVRVLNLREIVKGR-----------LKKGRKDNG--------RLRNGH 255

Query: 1043 LPNGLVGGQNGRNDINKNELNQVKVKVIAHYNAVGMPRLVPNGVMGAQIDGTPASSVKPK 864
            +                       V+V    NAV     V  G++  +  G+        
Sbjct: 256  I-----------------------VEVKKKENAVH----VNKGLLSKRRQGS-------S 281

Query: 863  SRRLSQDSNEWGAIFLPFNSKDEESVARKSPYLPKKAISIHAFMKNKFLVADSDGNVHIL 684
              R+   S +  A  +  + K E  V      +  +AISI A    +FL+ DS G +H+L
Sbjct: 282  ETRMCFVSFQKNAAAVGADLKSETCVV-----MSLRAISIQALSIKRFLILDSAGYIHVL 336

Query: 683  CASVSG--------PSVMKQLSNIKEVQHIAVLPDLSESSQTVWLSDGYHSVHVMAASDT 528
               VSG           M+QL    +VQ +A+LP++S  +++ W+SDG +SVH +  SD 
Sbjct: 337  --HVSGRHSLGSNFTCDMQQLPRFMDVQKLALLPEISVGTKSFWISDGDYSVHRVTISDE 394

Query: 527  NTCSNVNKSDSEDMRL-------------SVVQTIFVSENIRDVQPLSANAILILG 399
             T S   K   ED ++             +V  TIF  E I+D+ PL  N  LILG
Sbjct: 395  ETTS---KEKDEDKKIREERPPIQSSDYGAVTHTIFSPEKIQDLVPLGGNGALILG 447


Top