BLASTX nr result

ID: Mentha24_contig00023296 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00023296
         (1219 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU37877.1| hypothetical protein MIMGU_mgv1a000071mg [Mimulus...   384   e-104
ref|XP_006487400.1| PREDICTED: uncharacterized protein LOC102615...   295   3e-77
ref|XP_002276690.1| PREDICTED: uncharacterized protein LOC100248...   286   1e-74
ref|XP_006367335.1| PREDICTED: uncharacterized protein LOC102601...   281   5e-73
ref|XP_002305483.2| hypothetical protein POPTR_0004s17490g [Popu...   280   7e-73
ref|XP_004248871.1| PREDICTED: uncharacterized protein LOC101247...   274   6e-71
ref|XP_006423585.1| hypothetical protein CICLE_v10030126mg, part...   273   1e-70
gb|EXB36837.1| hypothetical protein L484_003222 [Morus notabilis]     271   5e-70
ref|XP_007041937.1| Urb2/Npa2, putative isoform 4 [Theobroma cac...   258   5e-66
ref|XP_007041935.1| Urb2/Npa2, putative isoform 2 [Theobroma cac...   258   5e-66
ref|XP_007041934.1| Urb2/Npa2, putative isoform 1 [Theobroma cac...   258   5e-66
ref|XP_002529253.1| conserved hypothetical protein [Ricinus comm...   253   1e-64
ref|XP_007200948.1| hypothetical protein PRUPE_ppa000049mg [Prun...   246   2e-62
ref|XP_006282534.1| hypothetical protein CARUB_v10003970mg [Caps...   245   2e-62
ref|NP_194744.2| uncharacterized protein [Arabidopsis thaliana] ...   238   4e-60
emb|CBI37935.3| unnamed protein product [Vitis vinifera]              238   4e-60
emb|CAB43850.1| hypothetical protein [Arabidopsis thaliana] gi|7...   238   4e-60
ref|XP_002869394.1| hypothetical protein ARALYDRAFT_913468 [Arab...   237   9e-60
ref|XP_004150076.1| PREDICTED: uncharacterized protein LOC101208...   236   1e-59
ref|XP_006487402.1| PREDICTED: uncharacterized protein LOC102615...   230   1e-57

>gb|EYU37877.1| hypothetical protein MIMGU_mgv1a000071mg [Mimulus guttatus]
          Length = 1929

 Score =  384 bits (987), Expect = e-104
 Identities = 221/412 (53%), Positives = 279/412 (67%), Gaps = 6/412 (1%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFI 1039
            +ACFQGLLWG+AS       +D+ S     S   K+M RI S V   M+F    +K  F+
Sbjct: 1331 IACFQGLLWGLAST------LDNKSFRMKLSNNTKMMTRINSSVHSCMNFISFLIKASFL 1384

Query: 1038 EDNSTLDM--SAHSDELRARTPSDIDTDPTICEAETKSCLSFPELEAFLTEVPDQKLLLK 865
            ED  +  M  S   D L  R              E +SC +  +LEAFL++V  QKL LK
Sbjct: 1385 EDQPSGKMVSSGTKDVLMKRN------------LEEQSCPAISDLEAFLSQVQHQKLCLK 1432

Query: 864  KSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLEF 685
            KS+LMQ FRGENAEA++FL QLF+A S ++RLN+QI   S  WSL  +VVDI++F+LLEF
Sbjct: 1433 KSLLMQIFRGENAEASFFLGQLFMACSVVVRLNMQIDLTSIPWSLFAIVVDIAQFLLLEF 1492

Query: 684  SRT-ELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQG 508
            SR+ E+P+QFA FWLDG V+FLEELG+YFP  +PSLS+DFY K+  LH + IGKCISLQ 
Sbjct: 1493 SRSEEMPHQFAFFWLDGAVKFLEELGSYFPRFDPSLSRDFYSKMTGLHLKVIGKCISLQK 1552

Query: 507  KEAKLASQERGSLTKMAGQVQSQFSRERNRLAELLEKLRMSFNTYI-KRSSEFHLLSIIE 331
            KEAKL +Q +           S  S E NRL E  E+LR+SF  Y+ K+SSE HLLS+I 
Sbjct: 1553 KEAKLDNQGK-----------SCISLETNRLDEFKERLRISFRKYMEKKSSELHLLSVIV 1601

Query: 330  SVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIE-VLTGPRR-SNVIKK 157
            +VERALVGE +G M NYEIVCGS NGGEVS+ VA  I CLDSI+E +LTG +     IK+
Sbjct: 1602 AVERALVGEQKGVMANYEIVCGSSNGGEVSSFVAGGIDCLDSILELLLTGSKHLEGTIKE 1661

Query: 156  HLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGPDSGAVTLMCIELLTKIS 1
            H+++LVA LFN+ILHLQGP IF  Y  SI+  + P+SG+V LMC+E+LTKIS
Sbjct: 1662 HIQSLVACLFNVILHLQGPTIFYDYVESIKAYERPNSGSVVLMCVEILTKIS 1713


>ref|XP_006487400.1| PREDICTED: uncharacterized protein LOC102615643 isoform X1 [Citrus
            sinensis] gi|568868198|ref|XP_006487401.1| PREDICTED:
            uncharacterized protein LOC102615643 isoform X2 [Citrus
            sinensis]
          Length = 2093

 Score =  295 bits (754), Expect = 3e-77
 Identities = 185/439 (42%), Positives = 262/439 (59%), Gaps = 33/439 (7%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFI 1039
            ++CF G+LWG+AS      A + + +++S  +    +++I   ++V+ DF    ++ + +
Sbjct: 1430 VSCFNGILWGLASVVNHINA-EKSDKVKSLWWKSIHISKINHSINVFSDFIGTVLRILVV 1488

Query: 1038 ED-------------NSTLDMSAHSDE----LRARTPS---DIDTDPTIC--------EA 943
            ED             NS   M   SD+    L ART S   DID D +          + 
Sbjct: 1489 EDDQPPGSSGEVSFENSNSKMERMSDKQHQILGARTCSASFDIDDDDSAIAGLGNNQSQL 1548

Query: 942  ETKSCLSFPELEAFLTEVPDQKLLLKKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNL 763
            E  +C +    E  L E+      LK+  L    +G N EAA  LRQL +A SAILRLNL
Sbjct: 1549 EDVNCPANSLTEGDLIELQ----CLKRHFLGGLLKGANPEAANLLRQLLVAASAILRLNL 1604

Query: 762  QIKFNSSSWSLLPVVVDISEFVLLEFSRTE-LPNQFALFWLDGVVRFLEELGNYFPHINP 586
            QI     + SLLP+ V IS+F+LL+ + T  +P  F   WLDGV+R+LEELG++FP  NP
Sbjct: 1605 QISGTPFASSLLPISVGISKFLLLQLADTVGVPQPFTFVWLDGVLRYLEELGSHFPLTNP 1664

Query: 585  SLSKDFYVKLIVLHQRAIGKCISLQGKEAKLASQERGSLTKM----AGQVQSQFSRERNR 418
            +L+++ Y +LI LH RAIGKCI+LQGK+A LAS ER S TK+     G  +   S   + 
Sbjct: 1665 TLTRNMYAELIELHLRAIGKCINLQGKKATLASHERESSTKILDESVGLSEVSLSHGPHW 1724

Query: 417  LAELLEKLRMSFNTYIKRSSEFHLLSIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSA 238
            L E   +LRMSF   I++ S+ HLLS ++++ERALVG  EG+   Y+I  GS +GG+VS+
Sbjct: 1725 LDEFKSRLRMSFKVLIQKPSDLHLLSAVQAIERALVGVQEGNTMIYQISTGSGDGGKVSS 1784

Query: 237  VVASAIVCLDSIIEVLTGPRRSNVIKKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSD 58
             VA+ I CLD IIE   G +R NV+K+H++NL+A LFNII+HLQ P IF   + S    +
Sbjct: 1785 TVAAGIDCLDLIIEYAQGRKRLNVVKRHIQNLIAALFNIIVHLQSPIIFYEKQISCGREN 1844

Query: 57   GPDSGAVTLMCIELLTKIS 1
             PD G+V LMCIE+LT++S
Sbjct: 1845 IPDPGSVILMCIEVLTRVS 1863


>ref|XP_002276690.1| PREDICTED: uncharacterized protein LOC100248664 [Vitis vinifera]
          Length = 2129

 Score =  286 bits (733), Expect = 1e-74
 Identities = 178/447 (39%), Positives = 259/447 (57%), Gaps = 41/447 (9%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVF- 1042
            ++CFQG +WG+ASA       + +  M+   +  +  +++  C++V+ DF I+F   +F 
Sbjct: 1454 VSCFQGFMWGLASAMNHIDVKECDDEMKLLKWKNEPFSKLNLCINVFTDF-IDFSLCMFL 1512

Query: 1041 IEDNSTL---------------------------DMSAHSDELRARTPS-------DIDT 964
            IED+                              D+S  + + +++T         D D+
Sbjct: 1513 IEDDQQPEGLGGAQNLSGLDQKNDCSLEPYGGENDISCANKQQKSKTARSSGSLHIDNDS 1572

Query: 963  DPTICEAETKSCLSFPELEAFLTEVPDQKLL-LKKSVLMQAFRGENAEAAYFLRQLFIAF 787
            + T  +       S      FL++V   +L  L + +L    +G+N EAA+FLR+LFIA 
Sbjct: 1573 ENTGGQEMRLQLDSAVCATNFLSDVDLFELRRLNRPLLRSLLKGDNPEAAFFLRELFIAS 1632

Query: 786  SAILRLNLQIKFNSSSWSLLPVVVDISEFVLLEFSR-TELPNQFALFWLDGVVRFLEELG 610
            SAILRLNLQI     S   +P+   IS+ +LLE +   ++P   +L WLDGV+++LEELG
Sbjct: 1633 SAILRLNLQINCIPLSSCFVPIFNGISQLLLLELANMADVPQPISLVWLDGVLKYLEELG 1692

Query: 609  NYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQGKEAKLASQERGSLTKM----AGQVQS 442
            N FP  NP+L +D Y KLI LH +AIGKCISLQGK A LAS +  S TK      G   +
Sbjct: 1693 NQFPLTNPTLYRDVYAKLIDLHLKAIGKCISLQGKRATLASHDAESSTKTLDSHVGLSDA 1752

Query: 441  QFSRERNRLAELLEKLRMSFNTYIKRSSEFHLLSIIESVERALVGEWEGSMTNYEIVCGS 262
              S       E   +LRMSF  +IK+ SE HLLS I+++ERALVG  EG M  Y++  GS
Sbjct: 1753 SLSHGPYCFDEFKSRLRMSFKVFIKKPSELHLLSAIQALERALVGVQEGCMVIYDVNTGS 1812

Query: 261  LNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVIKKHLRNLVAGLFNIILHLQGPAIFCRY 82
             +GG+VS++ A+ I CLD ++E ++G +R +V+K+HL++L+AGLFNI+LHLQ P IF R 
Sbjct: 1813 AHGGKVSSITAAGIDCLDLVLEFVSGRKRLSVVKRHLKSLIAGLFNIVLHLQSPFIFYRK 1872

Query: 81   EHSIRNSDGPDSGAVTLMCIELLTKIS 1
                +    PD G+V LMCIE+LT+IS
Sbjct: 1873 LIHNKGQTDPDPGSVILMCIEVLTRIS 1899


>ref|XP_006367335.1| PREDICTED: uncharacterized protein LOC102601821 [Solanum tuberosum]
          Length = 2086

 Score =  281 bits (718), Expect = 5e-73
 Identities = 170/437 (38%), Positives = 254/437 (58%), Gaps = 31/437 (7%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFI 1039
            ++CFQG L G+ SA        S++ +ES+S+ +K+    K C++   D   + +  +F+
Sbjct: 1426 VSCFQGFLCGLVSAMDSLDIKRSSTLIESTSHNLKM----KPCIETCADLLNSILHLLFL 1481

Query: 1038 ED-------------------NSTLDMSAHSDELRARTPSDIDTDP----TICEAETKSC 928
            E                    N  L    +     A  P+++  +     +    ++  C
Sbjct: 1482 EGDQCPQGLSSTHTAIETECCNELLAAGTYQSRDSADEPNNVKKEEHYSGSADSVQSNDC 1541

Query: 927  LS----FPELEAFLTEVPDQKLLLKKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQ 760
             +    F  +E+ L  V  ++  L+KS+L    +GEN EAA+ L+ +F A SAIL+ +L 
Sbjct: 1542 KNDLQKFGGIESLLANVDFEQQYLRKSLLQGLSKGENLEAAFCLKHIFGASSAILKFSLH 1601

Query: 759  IKFNSSSWSLLPVVVDISEFVLLEFSR-TELPNQFALFWLDGVVRFLEELGNYFPHINPS 583
             K  S   +LLP+++ +S  +L +F+  +    QF+  WLDGV +F+ ELG  FP +NP 
Sbjct: 1602 TKSTSLPKNLLPILIRVSHVLLSDFANHSGSLEQFSFIWLDGVAKFIGELGKIFPLLNPL 1661

Query: 582  LSKDFYVKLIVLHQRAIGKCISLQGKEAKLASQERGSLTKM-AGQVQSQFSRER--NRLA 412
             S+D +VK I LH RA+GKCISLQGKEA LAS+E  S TKM +G  +   S     N L 
Sbjct: 1662 SSRDLFVKQIELHLRAMGKCISLQGKEAALASREIESSTKMLSGLPEHDLSNSHWLNHLD 1721

Query: 411  ELLEKLRMSFNTYIKRSSEFHLLSIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSAVV 232
            EL  +LRMSF  ++ R+SE HLLS I+++ERALVG  E  + NYE+  GS +G +VSA V
Sbjct: 1722 ELKSRLRMSFANFVSRASELHLLSAIQAIERALVGVQEHCIINYEVTTGSSHGAKVSAYV 1781

Query: 231  ASAIVCLDSIIEVLTGPRRSNVIKKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGP 52
            A+ I CLD I+E ++G ++  V+K+H++NLV+ L N++LHLQGP IF R     ++   P
Sbjct: 1782 AAGIDCLDVILESVSGRKKLAVVKRHIQNLVSSLLNVVLHLQGPKIFFRNHKFRKDFTEP 1841

Query: 51   DSGAVTLMCIELLTKIS 1
            D G+V LMCI +LTKIS
Sbjct: 1842 DPGSVCLMCISVLTKIS 1858


>ref|XP_002305483.2| hypothetical protein POPTR_0004s17490g [Populus trichocarpa]
            gi|550341234|gb|EEE85994.2| hypothetical protein
            POPTR_0004s17490g [Populus trichocarpa]
          Length = 2070

 Score =  280 bits (717), Expect = 7e-73
 Identities = 158/415 (38%), Positives = 254/415 (61%), Gaps = 9/415 (2%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFI 1039
            ++CF G +WG+ASA   + A DS+ + +   +  +++++I  C++ + DF       +F+
Sbjct: 1427 VSCFSGFMWGLASALDHSNATDSDYKAKLLRWKCEVISKISHCINAFADFICFSFHMLFV 1486

Query: 1038 EDN------STLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPELEAFLTEVPDQK 877
            +D+      S       SD+  +   S      T+ +  ++S  +   +   L+++   +
Sbjct: 1487 KDDLQPNHLSATGNFVKSDDRDSSLVSGDSWKVTVNKHGSQS-ENVTSIAGILSKLDSYE 1545

Query: 876  LL-LKKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEF 700
             L L K  L     G++ +AA  +RQL IA SAI++LNL+ K      SL+P    IS+ 
Sbjct: 1546 CLPLNKEWLQSFLEGDHPKAAVLIRQLLIAASAIVKLNLETKCTPLLSSLVPSFTGISQV 1605

Query: 699  VLLEFSR-TELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKC 523
            +LL+ +  TE+P  F+  WLDGV+++L+ELG++FP  NP+ +++ + KL+ LH +A+GKC
Sbjct: 1606 LLLKLADGTEVPKPFSFVWLDGVLKYLQELGSHFPITNPTSTRNVFSKLLELHLKALGKC 1665

Query: 522  ISLQGKEAKLASQERG-SLTKMAGQVQSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHL 346
            ISLQGKEA L S ++  S   +   + S        L E   +LRMSF + I++ SE HL
Sbjct: 1666 ISLQGKEATLTSHDKELSTNTLHSHIGSASLSHPYYLDEFKARLRMSFKSLIRKPSELHL 1725

Query: 345  LSIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNV 166
            LS I+++ERALVG +EG    YEI  G+++GG+VS+ VA+ I CLD ++E ++G +R NV
Sbjct: 1726 LSAIQAIERALVGVYEGCPIIYEITTGNVDGGKVSSTVAAGIDCLDLVLEYVSGRKRLNV 1785

Query: 165  IKKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGPDSGAVTLMCIELLTKIS 1
            +K+++++LVA LFNIILH+Q P IF R        +GPD GAV LMC+E+LT++S
Sbjct: 1786 VKRNIQSLVAALFNIILHVQSPLIFYRIAMDSERYNGPDPGAVILMCVEVLTRVS 1840


>ref|XP_004248871.1| PREDICTED: uncharacterized protein LOC101247970 [Solanum
            lycopersicum]
          Length = 2051

 Score =  274 bits (700), Expect = 6e-71
 Identities = 172/437 (39%), Positives = 256/437 (58%), Gaps = 31/437 (7%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFI 1039
            ++CFQG L G+ SA        S++ +ES+   +K+    K C++   +   + +  +F+
Sbjct: 1391 ISCFQGFLCGLVSAMDSLDIKSSSTFIESTICNLKM----KPCIETCANLLYSILHLLFL 1446

Query: 1038 EDN----------STLDMSAHSDELRARTPSDIDTDP----------------TICEAET 937
            E +          +T++    ++ L A T    D+                  ++   ++
Sbjct: 1447 EGDQCPQGLSSTHTTIETECCNELLAAGTYQSRDSADEANNVNKEEHYSGSADSLQSNDS 1506

Query: 936  KSCLS-FPELEAFLTEVPDQKLLLKKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQ 760
            K+ L  F  +E+ L  V  ++  L+KS+L     GEN EAA+ L+ +F A SAIL+ +L 
Sbjct: 1507 KNDLQKFGGIESLLANVDFEQQYLRKSLLQALSIGENLEAAFCLKHIFGASSAILKFSLH 1566

Query: 759  IKFNSSSWSLLPVVVDISEFVLLEFSR-TELPNQFALFWLDGVVRFLEELGNYFPHINPS 583
             K  S   +LLP+++ +S  +L +F+  +    QF+  WLDGV +F+ ELG  FP +NP 
Sbjct: 1567 TKSTSLPKNLLPLLIRVSHVLLSDFANHSGSLEQFSFIWLDGVAKFIGELGKVFPLLNPL 1626

Query: 582  LSKDFYVKLIVLHQRAIGKCISLQGKEAKLASQERGSLTKM-AGQVQSQFSRER--NRLA 412
             S+D +VK I LH RA+GKCISLQGKEA LAS+E  S TKM +G  +   S     N L 
Sbjct: 1627 SSRDLFVKHIELHLRAMGKCISLQGKEATLASREIESSTKMLSGLPEHDLSNSHWLNHLD 1686

Query: 411  ELLEKLRMSFNTYIKRSSEFHLLSIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSAVV 232
            EL  +LRMSF  ++ R+SE HLLS I+++ERALVG  E  + NYEI  GS +G +VSA V
Sbjct: 1687 ELKSRLRMSFANFVSRASELHLLSAIQAIERALVGVQEHCIINYEITTGSSHGAQVSAYV 1746

Query: 231  ASAIVCLDSIIEVLTGPRRSNVIKKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGP 52
            A+ I CLD I+E ++G ++  VIK+H++NLV+ L N+ILHLQGP +F R     ++   P
Sbjct: 1747 AAGIDCLDLILESVSGRKKVAVIKRHIQNLVSSLLNVILHLQGPKMFFRNHKFRKDFAEP 1806

Query: 51   DSGAVTLMCIELLTKIS 1
            D G+V LMCI +LTKIS
Sbjct: 1807 DPGSVCLMCISVLTKIS 1823


>ref|XP_006423585.1| hypothetical protein CICLE_v10030126mg, partial [Citrus clementina]
            gi|557525519|gb|ESR36825.1| hypothetical protein
            CICLE_v10030126mg, partial [Citrus clementina]
          Length = 2119

 Score =  273 bits (698), Expect = 1e-70
 Identities = 180/439 (41%), Positives = 252/439 (57%), Gaps = 33/439 (7%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFI 1039
            ++CF G+LWG+AS      A + + +++S  +    +++I   ++V+ DF    ++ + +
Sbjct: 1492 VSCFNGILWGLASVVNHINA-EKSDKVKSIWWKSIHISKINLSINVFSDFIGTVLRILVV 1550

Query: 1038 ED-------------NSTLDMSAHSDE----LRARTPS---DIDTDPTIC--------EA 943
            ED             NS   M   SD+    L ART S   DID D +          + 
Sbjct: 1551 EDDQPPGSSGEVSFENSNSKMERMSDKQHQILGARTCSASFDIDDDDSAIAGLGNNQSQL 1610

Query: 942  ETKSCLSFPELEAFLTEVPDQKLLLKKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNL 763
            E  +C +    E  L E+      LK+  L    +G N EAA  LRQL +A SAILRLNL
Sbjct: 1611 EDVNCPANSLTEGDLIELQ----CLKRHFLGGLLKGANPEAANLLRQLLVAASAILRLNL 1666

Query: 762  QIKFNSSSWSLLPVVVDISEFVLLEFSRTE-LPNQFALFWLDGVVRFLEELGNYFPHINP 586
            QI     + SLLP+ V IS+F+LL+ + T  +P  F   WLDGV+R+LEELG++FP  NP
Sbjct: 1667 QISGTPFASSLLPISVGISKFLLLQLADTVGVPQPFTFVWLDGVLRYLEELGSHFPLTNP 1726

Query: 585  SLSKDFYVKLIVLHQRAIGKCISLQGKEAKLASQERGSLTKM----AGQVQSQFSRERNR 418
            +L+++ Y +LI LH RAIGKCI+LQGK+A LAS ER S TK+     G  +   S   + 
Sbjct: 1727 TLTRNMYAELIELHLRAIGKCINLQGKKATLASHERESSTKILDESVGLSKVSLSHGPHW 1786

Query: 417  LAELLEKLRMSFNTYIKRSSEFHLLSIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSA 238
            L E   +LRMSF   I++ S+ HLLS ++++ERALVG  EG+   Y+I  GS +GG+VS+
Sbjct: 1787 LDEFKSRLRMSFKVLIQKPSDLHLLSAVQAIERALVGVQEGNTMIYQISTGSGDGGKVSS 1846

Query: 237  VVASAIVCLDSIIEVLTGPRRSNVIKKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSD 58
             VA+ I CLD IIE   G            NL+A LFNII+HLQ P IF   + S    +
Sbjct: 1847 TVAAGIDCLDLIIEYAQG-----------NNLIAALFNIIVHLQSPIIFYEKQISCEREN 1895

Query: 57   GPDSGAVTLMCIELLTKIS 1
             PD G+V LMCIE+LT++S
Sbjct: 1896 IPDPGSVILMCIEVLTRVS 1914


>gb|EXB36837.1| hypothetical protein L484_003222 [Morus notabilis]
          Length = 2053

 Score =  271 bits (692), Expect = 5e-70
 Identities = 168/435 (38%), Positives = 247/435 (56%), Gaps = 29/435 (6%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFI 1039
            ++CF G LWG+AS         S+ ++  S +  K    I  C++V+ +F+   +  + +
Sbjct: 1406 ISCFSGFLWGLASVMKQTDVRSSDHKVILSWWKEKSNTEINLCINVFEEFSSLLLGVMLL 1465

Query: 1038 EDNSTLDMSAHSDEL-RARTPSDIDT--------DPTICEAET-------------KSCL 925
             D      +  +  L  A   +DI          D   C A +             K   
Sbjct: 1466 GDAQCFQKADKNKYLVGAEQEADISCGKQQGGTGDGLTCSASSDSHDDFGTEGVAKKGIQ 1525

Query: 924  SFPELEA--FLTEVPD-QKLLLKKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIK 754
            S   + A  FLT +     L L K  L     G+  EAA+ LRQL I+ SAILRLNL +K
Sbjct: 1526 SVGSISAVDFLTAIDSLDHLPLNKPFLRNLLEGDCPEAAFLLRQLLISSSAILRLNLHVK 1585

Query: 753  FNSSSWSLLPVVVDISEFVLLEFSRTELPNQFALFWLDGVVRFLEELGNYFPHINPSLSK 574
                S +L  +   IS+ +L E     +P   +  WLDGVV++LEELGN+FP  +P+LS+
Sbjct: 1586 SAHLSANLTQMFTGISQILLSELVDKNVPQPLSFVWLDGVVKYLEELGNHFPVTDPTLSR 1645

Query: 573  DFYVKLIVLHQRAIGKCISLQGKEAKLASQERGSLTKM----AGQVQSQFSRERNRLAEL 406
            + YVK++ L  R +GKCI+LQGK A LAS E  + TK+     G  Q     +   + E 
Sbjct: 1646 NLYVKMVELQLRTLGKCIALQGKRATLASHETEASTKLLYGHLGLSQESLPCKPCGVDEF 1705

Query: 405  LEKLRMSFNTYIKRSSEFHLLSIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVAS 226
              ++R+SF  +IK+ SE HLLS ++++ERALVG  E S  +Y+I  GS NGG+VS++VA+
Sbjct: 1706 KSRVRLSFTEFIKKPSELHLLSAVQAIERALVGMRERSTVSYDIQTGSPNGGKVSSIVAA 1765

Query: 225  AIVCLDSIIEVLTGPRRSNVIKKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGPDS 46
            A+ CLD ++E ++G +R +V+K+H+++L+AG+FNIILHLQ P IF  YE  I +S  PD 
Sbjct: 1766 ALDCLDLVLEFVSGRKRLSVVKRHIQSLIAGVFNIILHLQSPLIF--YERLIGDSI-PDP 1822

Query: 45   GAVTLMCIELLTKIS 1
            GAV LMC+E+L +IS
Sbjct: 1823 GAVILMCVEVLIRIS 1837


>ref|XP_007041937.1| Urb2/Npa2, putative isoform 4 [Theobroma cacao]
            gi|508705872|gb|EOX97768.1| Urb2/Npa2, putative isoform 4
            [Theobroma cacao]
          Length = 1533

 Score =  258 bits (658), Expect = 5e-66
 Identities = 167/414 (40%), Positives = 242/414 (58%), Gaps = 8/414 (1%)
 Frame = -2

Query: 1218 LACFQGLLWGIASA--SGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTV 1045
            ++CF G LWG+ASA   GD ++ + N++     +  + ++++  C++V++DF I+ V  +
Sbjct: 1056 ISCFGGFLWGLASALNQGDEKSGEVNAKY--LRWKCEPLSKLNICINVFLDF-ISEVFHM 1112

Query: 1044 FIEDNSTLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPE-LEAFLTEVPDQKLLL 868
            F+ DN     S +                   +AE+   L +   L  F T++ +   L 
Sbjct: 1113 FL-DNDQQSRSYY-------------------DAESSQKLDYSRHLLVFETDLVELHYL- 1151

Query: 867  KKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLE 688
             K  L    +G++ + A  LR L I  SAI RLNL+I   S S  ++P+ + IS+ +LLE
Sbjct: 1152 NKHFLQGLLKGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLE 1211

Query: 687  FSRT-ELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQ 511
             + + E+P  F   WLDG V++LEELG++FP  +P+L+ + Y KLI L  RAIGKCISLQ
Sbjct: 1212 LANSGEIPPPFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELLLRAIGKCISLQ 1271

Query: 510  GKEAKLASQERGSLTKM----AGQVQSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLL 343
            GK A L S ER S TK+     G  +S  S   + L E   +LRMSF  +IK  SE  LL
Sbjct: 1272 GKRATLESHERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLL 1331

Query: 342  SIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVI 163
            S ++++ERALVG   G    Y+I  GS NGG VS+ VA+ I CLD I+E  +G R   V+
Sbjct: 1332 SAMQAIERALVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILEYGSGRRCLRVV 1391

Query: 162  KKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGPDSGAVTLMCIELLTKIS 1
            K+H+++LVA LFNIILHLQ P IF     S      PD+G+V LMC E+LT+++
Sbjct: 1392 KRHIQSLVAALFNIILHLQSPLIFYGKFVSNEGDRNPDAGSVVLMCAEVLTRVA 1445


>ref|XP_007041935.1| Urb2/Npa2, putative isoform 2 [Theobroma cacao]
            gi|508705870|gb|EOX97766.1| Urb2/Npa2, putative isoform 2
            [Theobroma cacao]
          Length = 2065

 Score =  258 bits (658), Expect = 5e-66
 Identities = 167/414 (40%), Positives = 242/414 (58%), Gaps = 8/414 (1%)
 Frame = -2

Query: 1218 LACFQGLLWGIASA--SGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTV 1045
            ++CF G LWG+ASA   GD ++ + N++     +  + ++++  C++V++DF I+ V  +
Sbjct: 1446 ISCFGGFLWGLASALNQGDEKSGEVNAKY--LRWKCEPLSKLNICINVFLDF-ISEVFHM 1502

Query: 1044 FIEDNSTLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPE-LEAFLTEVPDQKLLL 868
            F+ DN     S +                   +AE+   L +   L  F T++ +   L 
Sbjct: 1503 FL-DNDQQSRSYY-------------------DAESSQKLDYSRHLLVFETDLVELHYL- 1541

Query: 867  KKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLE 688
             K  L    +G++ + A  LR L I  SAI RLNL+I   S S  ++P+ + IS+ +LLE
Sbjct: 1542 NKHFLQGLLKGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLE 1601

Query: 687  FSRT-ELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQ 511
             + + E+P  F   WLDG V++LEELG++FP  +P+L+ + Y KLI L  RAIGKCISLQ
Sbjct: 1602 LANSGEIPPPFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELLLRAIGKCISLQ 1661

Query: 510  GKEAKLASQERGSLTKM----AGQVQSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLL 343
            GK A L S ER S TK+     G  +S  S   + L E   +LRMSF  +IK  SE  LL
Sbjct: 1662 GKRATLESHERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLL 1721

Query: 342  SIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVI 163
            S ++++ERALVG   G    Y+I  GS NGG VS+ VA+ I CLD I+E  +G R   V+
Sbjct: 1722 SAMQAIERALVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILEYGSGRRCLRVV 1781

Query: 162  KKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGPDSGAVTLMCIELLTKIS 1
            K+H+++LVA LFNIILHLQ P IF     S      PD+G+V LMC E+LT+++
Sbjct: 1782 KRHIQSLVAALFNIILHLQSPLIFYGKFVSNEGDRNPDAGSVVLMCAEVLTRVA 1835


>ref|XP_007041934.1| Urb2/Npa2, putative isoform 1 [Theobroma cacao]
            gi|508705869|gb|EOX97765.1| Urb2/Npa2, putative isoform 1
            [Theobroma cacao]
          Length = 2090

 Score =  258 bits (658), Expect = 5e-66
 Identities = 167/414 (40%), Positives = 242/414 (58%), Gaps = 8/414 (1%)
 Frame = -2

Query: 1218 LACFQGLLWGIASA--SGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTV 1045
            ++CF G LWG+ASA   GD ++ + N++     +  + ++++  C++V++DF I+ V  +
Sbjct: 1470 ISCFGGFLWGLASALNQGDEKSGEVNAKY--LRWKCEPLSKLNICINVFLDF-ISEVFHM 1526

Query: 1044 FIEDNSTLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPE-LEAFLTEVPDQKLLL 868
            F+ DN     S +                   +AE+   L +   L  F T++ +   L 
Sbjct: 1527 FL-DNDQQSRSYY-------------------DAESSQKLDYSRHLLVFETDLVELHYL- 1565

Query: 867  KKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLE 688
             K  L    +G++ + A  LR L I  SAI RLNL+I   S S  ++P+ + IS+ +LLE
Sbjct: 1566 NKHFLQGLLKGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLE 1625

Query: 687  FSRT-ELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQ 511
             + + E+P  F   WLDG V++LEELG++FP  +P+L+ + Y KLI L  RAIGKCISLQ
Sbjct: 1626 LANSGEIPPPFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELLLRAIGKCISLQ 1685

Query: 510  GKEAKLASQERGSLTKM----AGQVQSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLL 343
            GK A L S ER S TK+     G  +S  S   + L E   +LRMSF  +IK  SE  LL
Sbjct: 1686 GKRATLESHERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLL 1745

Query: 342  SIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVI 163
            S ++++ERALVG   G    Y+I  GS NGG VS+ VA+ I CLD I+E  +G R   V+
Sbjct: 1746 SAMQAIERALVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILEYGSGRRCLRVV 1805

Query: 162  KKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGPDSGAVTLMCIELLTKIS 1
            K+H+++LVA LFNIILHLQ P IF     S      PD+G+V LMC E+LT+++
Sbjct: 1806 KRHIQSLVAALFNIILHLQSPLIFYGKFVSNEGDRNPDAGSVVLMCAEVLTRVA 1859


>ref|XP_002529253.1| conserved hypothetical protein [Ricinus communis]
            gi|223531289|gb|EEF33131.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 2057

 Score =  253 bits (646), Expect = 1e-64
 Identities = 157/408 (38%), Positives = 235/408 (57%), Gaps = 7/408 (1%)
 Frame = -2

Query: 1203 GLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFIEDNST 1024
            G LWG++SA      +DS+ ++E      +  ++I  C++V+ DF    +   F+ED+  
Sbjct: 1446 GFLWGVSSALNHTNKIDSD-KVEILKLNFEPSSQIGLCINVFTDFISFILHKYFVEDDRQ 1504

Query: 1023 LDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPELEAFLTEVPDQKLLLKKSVLMQA 844
               S   D      PSD  ++  + + +   C S                 L    L   
Sbjct: 1505 RGSSF--DVQNVEQPSD-RSNCVLSQLDNYKCES-----------------LNNYFLQSL 1544

Query: 843  FRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLEFSR-TELP 667
              G++ EAA  +RQL IA SA+L+LNLQ    +S  SL+P    IS  +LL+ +  +E+P
Sbjct: 1545 LDGDHPEAAILIRQLLIASSALLKLNLQTNCTTSLSSLVPSFFGISHVLLLKLADVSEVP 1604

Query: 666  NQFALFWLDGVVRFLEELGNYFPH-INPSLSKDFYVKLIVLHQRAIGKCISLQGKEAKLA 490
              F+L WLDGV+++L+ELG++FP  ++ + +   Y +L+ LH  A+GKCI+LQGKEA LA
Sbjct: 1605 QPFSLIWLDGVLKYLQELGSHFPSKVDSTSTVSVYTRLVELHLNALGKCITLQGKEATLA 1664

Query: 489  SQERGSLTKMA----GQVQSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLLSIIESVE 322
            S E  S +K+     G  +S FS     L E   +LRMS    I +S E H+   I+++E
Sbjct: 1665 SHEMESSSKILSNNKGSSESSFSHTSFFLDEFKARLRMSLKVLISKSIELHMFPAIQAIE 1724

Query: 321  RALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVIKKHLRNL 142
            RALVG  EG    YEI  G+ +GG+VS+ VA+ I CLD ++E ++G R+S+V++ H++ L
Sbjct: 1725 RALVGVQEGCTMIYEIKTGTADGGKVSSTVAAGIDCLDLVLEYISGGRQSSVVRGHIQKL 1784

Query: 141  VAGLFNIILHLQGPAIF-CRYEHSIRNSDGPDSGAVTLMCIELLTKIS 1
            VA LFNII+HLQ   +F  R   S+ N  GPD GAV LMC+E++T+IS
Sbjct: 1785 VAALFNIIVHLQSSLVFYVRPTGSVHN--GPDPGAVILMCVEVVTRIS 1830


>ref|XP_007200948.1| hypothetical protein PRUPE_ppa000049mg [Prunus persica]
            gi|462396348|gb|EMJ02147.1| hypothetical protein
            PRUPE_ppa000049mg [Prunus persica]
          Length = 2128

 Score =  246 bits (627), Expect = 2e-62
 Identities = 159/450 (35%), Positives = 251/450 (55%), Gaps = 44/450 (9%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFI 1039
            ++C  G LWG+A       +  S+ ++ SS   ++ ++ +  C+DV+ +F    +  +  
Sbjct: 1452 ISCISGFLWGLACFVNHTDSRSSDHKVNSSRQKLEPISELHLCIDVFAEFCSLLLPMLVC 1511

Query: 1038 E---------DNSTLDMSA-HSDELRARTPSDIDTDPTICEAETKSCLSFP---ELEAF- 901
            +         D+  L  S  ++D L     +D++TD    E   +S  +     ++ A+ 
Sbjct: 1512 DSSQQSRTLCDSQNLQKSDFNADLLGVPEGTDVETDIAGVELHDESGAAMTASSDIHAYS 1571

Query: 900  ---------------------LTEVPDQKLL--LKKSVLMQAFRGENAEAAYFLRQLFIA 790
                                 L ++ D  +L  L + +L +   G+   AA+ LRQL IA
Sbjct: 1572 GSGSVRRRRLHLEGANCAASALNDI-DSFILQSLNRPLLRRLLNGDYPGAAFLLRQLLIA 1630

Query: 789  FSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLEFS-RTELPNQFALFWLDGVVRFLEEL 613
             SAILRL+L +     S SL+     I++ +LLE +    +P  F    LDGV+++LEE+
Sbjct: 1631 SSAILRLSLHMNSPPLSSSLVHTFTSITQVLLLESTDMNHVPCFFYFVCLDGVLKYLEEI 1690

Query: 612  GNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQGKEAKLASQERGSLTKMAGQV----Q 445
             N+FP  NP+LS+  Y K++ L  RA+GKCI+LQGK A L S E  S TKM        +
Sbjct: 1691 ANHFPLTNPTLSRSLYDKMVQLQLRALGKCITLQGKRATLVSHETESSTKMLHSPMEFSE 1750

Query: 444  SQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLLSIIESVERALVGEWEGSMTNYEIVCG 265
            +  S     L EL  +LR SF  +IK+ SE HLLS ++++ERALVG  +G   +Y+I  G
Sbjct: 1751 ASLSGRPYLLDELKARLRSSFTVFIKKPSELHLLSAVQAIERALVGVRDGCTMSYDIHTG 1810

Query: 264  SLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVIKKHLRNLVAGLFNIILHLQGPAIFCR 85
            S++GG+VS+VVA+ I CLD I+E ++G +R NV+K+H+++ ++ LFN+IL+LQ P IF  
Sbjct: 1811 SVDGGKVSSVVAAGIDCLDLILEHVSGRKRLNVVKRHIQSFISSLFNVILNLQSPVIF-- 1868

Query: 84   YEHSIRN--SDGPDSGAVTLMCIELLTKIS 1
            YE SI+N     PD G + LMC+++L +IS
Sbjct: 1869 YERSIQNKGDTDPDPGTIILMCVDVLARIS 1898


>ref|XP_006282534.1| hypothetical protein CARUB_v10003970mg [Capsella rubella]
            gi|482551239|gb|EOA15432.1| hypothetical protein
            CARUB_v10003970mg [Capsella rubella]
          Length = 1963

 Score =  245 bits (626), Expect = 2e-62
 Identities = 152/410 (37%), Positives = 233/410 (56%), Gaps = 5/410 (1%)
 Frame = -2

Query: 1215 ACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFIE 1036
            ACF GLLWG+ASA    R +  N +     +  +  +++   + V  +F   F + +F  
Sbjct: 1349 ACFSGLLWGLASAV-SQRDMHKNHQNTKLKWKSEQFSKLSCIIHVLSNFFEVFAQGLFFS 1407

Query: 1035 DNSTLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPELEAFLTEVPDQKLLLKKSV 856
             +   ++  + +  R    ++   D  +C                + +  D    +KK +
Sbjct: 1408 GDRQREIQTNINWTRLFDGTEGSID-LMC--------------GDVVDTSD----VKKEI 1448

Query: 855  LMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLEFS-R 679
            +    +G+ +E    LR L IA +AILRLNLQI   + S + + V+ +IS  +L EF+  
Sbjct: 1449 IESMMKGDTSEKVLALRHLLIASAAILRLNLQIDGITFSPTFVSVLTNISNDLLSEFADM 1508

Query: 678  TELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQGKEA 499
            +E+P +F+  WLDG V+ LEELG+ F   NPSL++D Y KLI LH + IGKCISLQGKEA
Sbjct: 1509 SEVPFEFSFIWLDGAVKVLEELGSQFCLSNPSLNRDLYSKLIELHLKVIGKCISLQGKEA 1568

Query: 498  KLASQERGSLTKM--AGQV--QSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLLSIIE 331
             L S E G  T    A QV  +   S   + L EL ++LRMSF  +I  SSE HLLS+++
Sbjct: 1569 TLESHETGFGTNAIHAKQVLLEKNQSHRLHWLDELKQRLRMSFKVFIHSSSELHLLSVVQ 1628

Query: 330  SVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVIKKHL 151
            ++ER+LVG WE     Y I  G+ +GG +    A+ + CLD I+E  TG +R NV+K+H+
Sbjct: 1629 AIERSLVGVWEVCPAIYCIQTGNRDGGRIPETAAAGLDCLDLILEHATGRKRLNVVKRHI 1688

Query: 150  RNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGPDSGAVTLMCIELLTKIS 1
            + L++ +F I+ H+Q P IF  + +++  S  PD+G V LMC+E+L +I+
Sbjct: 1689 QGLISAVFGIMAHMQSPFIF--FTNTVVGSSSPDAGPVILMCVEVLIRIA 1736


>ref|NP_194744.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332660326|gb|AEE85726.1| uncharacterized protein
            AT4G30150 [Arabidopsis thaliana]
          Length = 2009

 Score =  238 bits (607), Expect = 4e-60
 Identities = 149/413 (36%), Positives = 235/413 (56%), Gaps = 8/413 (1%)
 Frame = -2

Query: 1215 ACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFIE 1036
            +CF GLLWG+ASA   NR +  N +     +  +  +++   + V  +F   F + +F+ 
Sbjct: 1392 SCFSGLLWGLASAVS-NRDMQKNHQNAKLRWKSEQFSKLSRIIHVLSNFFEVFAQCLFLS 1450

Query: 1035 DNSTLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPELEAFLTEVPDQKLLLKKSV 856
             +   ++  + +  R    ++  ++  +C                + E  D    +KK +
Sbjct: 1451 GDVQREIQTNINWTRLLDGTE-GSNGLVC--------------GDVVETSD----VKKKI 1491

Query: 855  LMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLEFS-R 679
            +    +G+++E    L+ L IA +AILRLNLQI   + S + + V+ +IS  +L  F+  
Sbjct: 1492 IESLIKGDSSEVVLALKHLLIASAAILRLNLQIDGITFSPTFVSVLTNISNDLLSVFADM 1551

Query: 678  TELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQGKEA 499
            +E P +F+  WLDG V+ +EELG+ F   NP+L+ D Y KLI LH + IGKCISLQGKEA
Sbjct: 1552 SEAPLEFSFIWLDGAVKVVEELGSQFCLSNPTLNIDLYSKLIELHLKVIGKCISLQGKEA 1611

Query: 498  KLASQERGSLTKMAGQ----VQSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLLSIIE 331
             L S E G  T          + + S   + L EL ++LRMSF  +I  SSE HLLS ++
Sbjct: 1612 TLESHETGFGTNAIHAKLVLTEKKRSHRLHWLDELKQRLRMSFKVFIHSSSELHLLSGVQ 1671

Query: 330  SVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVIKKHL 151
            ++ERALVG WE     Y I  G+ +GG +S  VA+ + CLD I+E  TG +R NV+K+H+
Sbjct: 1672 AIERALVGVWEVCPAIYCIQTGNRDGGRISETVAAGLDCLDLILEHATGRKRLNVVKRHI 1731

Query: 150  RNLVAGLFNIILHLQGPAIFCRYEHSI---RNSDGPDSGAVTLMCIELLTKIS 1
            + L++ +F I+ H+Q P IF  + +++   + S+ PDSGAV LMC+ +L +I+
Sbjct: 1732 QGLMSAVFGIMAHMQSPFIF--FSNAVVGNQGSNSPDSGAVILMCVGVLIRIA 1782


>emb|CBI37935.3| unnamed protein product [Vitis vinifera]
          Length = 1831

 Score =  238 bits (607), Expect = 4e-60
 Identities = 153/408 (37%), Positives = 220/408 (53%), Gaps = 2/408 (0%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVF- 1042
            ++CFQG +WG+ASA       + +  M+   +  +  +++  C++V+ DF I+F   +F 
Sbjct: 1264 VSCFQGFMWGLASAMNHIDVKECDDEMKLLKWKNEPFSKLNLCINVFTDF-IDFSLCMFL 1322

Query: 1041 IEDNSTLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPELEAFLTEVPDQKLLLKK 862
            IED+   +      E+R      +  D  +C     S +   EL             L +
Sbjct: 1323 IEDDQQPEGLG---EMR------LQLDSAVCATNFLSDVDLFELRR-----------LNR 1362

Query: 861  SVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLEFS 682
             +L    +G+N EAA+FLR+LFIA SAILRLNLQI     S   +P+   IS+ +LLE +
Sbjct: 1363 PLLRSLLKGDNPEAAFFLRELFIASSAILRLNLQINCIPLSSCFVPIFNGISQLLLLELA 1422

Query: 681  R-TELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQGK 505
               ++P   +L WLDGV+++LEELGN FP  NP+L +D Y KLI LH +AIGKCISLQGK
Sbjct: 1423 NMADVPQPISLVWLDGVLKYLEELGNQFPLTNPTLYRDVYAKLIDLHLKAIGKCISLQGK 1482

Query: 504  EAKLASQERGSLTKMAGQVQSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLLSIIESV 325
             A LAS +  S TK                                       L I E  
Sbjct: 1483 RATLASHDAESSTK--------------------------------------TLDIQE-- 1502

Query: 324  ERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVIKKHLRN 145
                     G M  Y++  GS +GG+VS++ A+ I CLD ++E ++G +R +V+K+HL++
Sbjct: 1503 ---------GCMVIYDVNTGSAHGGKVSSITAAGIDCLDLVLEFVSGRKRLSVVKRHLKS 1553

Query: 144  LVAGLFNIILHLQGPAIFCRYEHSIRNSDGPDSGAVTLMCIELLTKIS 1
            L+AGLFNI+LHLQ P IF R     +    PD G+V LMCIE+LT+IS
Sbjct: 1554 LIAGLFNIVLHLQSPFIFYRKLIHNKGQTDPDPGSVILMCIEVLTRIS 1601


>emb|CAB43850.1| hypothetical protein [Arabidopsis thaliana]
            gi|7269915|emb|CAB81008.1| hypothetical protein
            [Arabidopsis thaliana]
          Length = 1966

 Score =  238 bits (607), Expect = 4e-60
 Identities = 149/413 (36%), Positives = 235/413 (56%), Gaps = 8/413 (1%)
 Frame = -2

Query: 1215 ACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFIE 1036
            +CF GLLWG+ASA   NR +  N +     +  +  +++   + V  +F   F + +F+ 
Sbjct: 1392 SCFSGLLWGLASAVS-NRDMQKNHQNAKLRWKSEQFSKLSRIIHVLSNFFEVFAQCLFLS 1450

Query: 1035 DNSTLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPELEAFLTEVPDQKLLLKKSV 856
             +   ++  + +  R    ++  ++  +C                + E  D    +KK +
Sbjct: 1451 GDVQREIQTNINWTRLLDGTE-GSNGLVC--------------GDVVETSD----VKKKI 1491

Query: 855  LMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLEFS-R 679
            +    +G+++E    L+ L IA +AILRLNLQI   + S + + V+ +IS  +L  F+  
Sbjct: 1492 IESLIKGDSSEVVLALKHLLIASAAILRLNLQIDGITFSPTFVSVLTNISNDLLSVFADM 1551

Query: 678  TELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQGKEA 499
            +E P +F+  WLDG V+ +EELG+ F   NP+L+ D Y KLI LH + IGKCISLQGKEA
Sbjct: 1552 SEAPLEFSFIWLDGAVKVVEELGSQFCLSNPTLNIDLYSKLIELHLKVIGKCISLQGKEA 1611

Query: 498  KLASQERGSLTKMAGQ----VQSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLLSIIE 331
             L S E G  T          + + S   + L EL ++LRMSF  +I  SSE HLLS ++
Sbjct: 1612 TLESHETGFGTNAIHAKLVLTEKKRSHRLHWLDELKQRLRMSFKVFIHSSSELHLLSGVQ 1671

Query: 330  SVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVIKKHL 151
            ++ERALVG WE     Y I  G+ +GG +S  VA+ + CLD I+E  TG +R NV+K+H+
Sbjct: 1672 AIERALVGVWEVCPAIYCIQTGNRDGGRISETVAAGLDCLDLILEHATGRKRLNVVKRHI 1731

Query: 150  RNLVAGLFNIILHLQGPAIFCRYEHSI---RNSDGPDSGAVTLMCIELLTKIS 1
            + L++ +F I+ H+Q P IF  + +++   + S+ PDSGAV LMC+ +L +I+
Sbjct: 1732 QGLMSAVFGIMAHMQSPFIF--FSNAVVGNQGSNSPDSGAVILMCVGVLIRIA 1782


>ref|XP_002869394.1| hypothetical protein ARALYDRAFT_913468 [Arabidopsis lyrata subsp.
            lyrata] gi|297315230|gb|EFH45653.1| hypothetical protein
            ARALYDRAFT_913468 [Arabidopsis lyrata subsp. lyrata]
          Length = 1967

 Score =  237 bits (604), Expect = 9e-60
 Identities = 153/414 (36%), Positives = 239/414 (57%), Gaps = 9/414 (2%)
 Frame = -2

Query: 1215 ACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFIE 1036
            +C  GLLWG+ASA   +R +  N +     +  +  + + S + V  +F   F + +F+ 
Sbjct: 1350 SCVSGLLWGLASAVS-HRDMQKNHQNAKLRWKSEQFSNLSSIIHVLSNFFEVFAQCLFL- 1407

Query: 1035 DNSTLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPELEAFLTEVPDQKLLLKKSV 856
                      S +++    ++I+    +  AE  + L   ++     E  D    +KK +
Sbjct: 1408 ----------SGDVQQEIQTNINWTRLLDGAEGSNGLVCGDV----VETND----VKKKI 1449

Query: 855  LMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLEFS-R 679
            +    +G+++E    LR L IA +AILRLNLQI   + S + + V+ +IS  +L  F+  
Sbjct: 1450 IESLIKGDSSEVVLALRHLLIASAAILRLNLQIDGIAFSPTFVSVLSNISNDLLSVFADM 1509

Query: 678  TELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQGKEA 499
            +E   +F+  WLDG V+ +EELG+ F   NP+L+ D Y KLI LH + IGKCISLQGKEA
Sbjct: 1510 SEASLEFSFIWLDGAVKVVEELGSQFCLSNPTLNIDLYSKLIELHLKVIGKCISLQGKEA 1569

Query: 498  KLASQERGSLT-----KMAGQVQSQFSRERNRLAELLEKLRMSFNTYIKRSSEFHLLSII 334
             L S E G  T     K+    ++Q S   + L EL ++LRMSF  +I+ SSE HLLS +
Sbjct: 1570 TLESHETGFGTNAIHAKLVLSAKNQ-SHRLHWLDELKQRLRMSFKVFIQSSSELHLLSGV 1628

Query: 333  ESVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNVIKKH 154
            +++ERALVG WE     Y I  G+ +GG +S  VA+ + CLD I+E  TG +R NV+K+H
Sbjct: 1629 QAIERALVGVWEVCPAIYSIQTGNRDGGRISETVAAGLDCLDLILEHATGRKRLNVVKRH 1688

Query: 153  LRNLVAGLFNIILHLQGPAIFCRYEHSI---RNSDGPDSGAVTLMCIELLTKIS 1
            ++ L++ +F I+ H+Q P IF  + +++   + S  PDSG+V LMC+E+L +I+
Sbjct: 1689 IQGLLSAVFGIMAHMQSPFIF--FTNAVVGNQGSSSPDSGSVILMCVEVLIRIA 1740


>ref|XP_004150076.1| PREDICTED: uncharacterized protein LOC101208263 [Cucumis sativus]
          Length = 1981

 Score =  236 bits (602), Expect = 1e-59
 Identities = 145/415 (34%), Positives = 229/415 (55%), Gaps = 10/415 (2%)
 Frame = -2

Query: 1215 ACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFIE 1036
            +C  G LWG+AS          N  M S     +  + + +C++   +  +  +  +F++
Sbjct: 1343 SCLNGFLWGLASVDDHTDLRKGNHHMRSMKLKREYSSELNNCMNAISEL-LGLILEMFLD 1401

Query: 1035 DNSTLDMSAHSDELRARTPSDIDTDPTICEAETKSCLSFPELEAFLTEVPDQKL----LL 868
             +S L  +    +      S    D +   ++ +  L      +F + + D K     LL
Sbjct: 1402 RDSQLPKNLCDYQAFQDLESSYCDDDSENVSKKRKRLKLENKSSFASILNDAKSIEMQLL 1461

Query: 867  KKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNLQIKFNSSSWSLLPVVVDISEFVLLE 688
             +  L    +G   E  + L+QLF+A S ILRL+ Q      S S + +++ IS F+LLE
Sbjct: 1462 NQPFLRGLLQGSYPEVNFALKQLFLAASRILRLHKQYDTTPLSSSSMTILIGISRFLLLE 1521

Query: 687  F-SRTELPNQFALFWLDGVVRFLEELGNYFPHINPSLSKDFYVKLIVLHQRAIGKCISLQ 511
            F    ++P  F L   DGV+++LEELG+ F   +P  S++ Y +LI LH +A+GKCI LQ
Sbjct: 1522 FVDMVDVPQPFLLACFDGVLKYLEELGHLFRFADPVQSRNLYSELINLHLQAVGKCICLQ 1581

Query: 510  GKEAKLASQERGSLTKMAGQVQSQFSRERNR-----LAELLEKLRMSFNTYIKRSSEFHL 346
            GK A LAS E  S TK    +   F +E +      + E    LRMSF  +I+ ++E HL
Sbjct: 1582 GKRATLASHETESTTKT---LDGGFFKESSFPGVYCMDEFKASLRMSFKVFIREATELHL 1638

Query: 345  LSIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSAVVASAIVCLDSIIEVLTGPRRSNV 166
            LS ++++ERALVG  EG  T Y +  GS +GG+ S++VA+ + CLD ++E+ +G +   V
Sbjct: 1639 LSAVQAIERALVGVQEGCTTIYGLYSGSEDGGKCSSIVAAGVECLDLVLEIFSGRKCMGV 1698

Query: 165  IKKHLRNLVAGLFNIILHLQGPAIFCRYEHSIRNSDGPDSGAVTLMCIELLTKIS 1
            IK+H+ +L AGL +I+LHLQ P IF R   ++++   PD G+V LM IE+LT++S
Sbjct: 1699 IKRHIESLTAGLLSIVLHLQSPQIFYRM-IAMKDRSDPDPGSVILMSIEVLTRVS 1752


>ref|XP_006487402.1| PREDICTED: uncharacterized protein LOC102615643 isoform X3 [Citrus
            sinensis]
          Length = 1811

 Score =  230 bits (586), Expect = 1e-57
 Identities = 153/374 (40%), Positives = 217/374 (58%), Gaps = 33/374 (8%)
 Frame = -2

Query: 1218 LACFQGLLWGIASASGDNRAVDSNSRMESSSYYVKLMARIKSCVDVYMDFAINFVKTVFI 1039
            ++CF G+LWG+AS      A + + +++S  +    +++I   ++V+ DF    ++ + +
Sbjct: 1430 VSCFNGILWGLASVVNHINA-EKSDKVKSLWWKSIHISKINHSINVFSDFIGTVLRILVV 1488

Query: 1038 ED-------------NSTLDMSAHSDE----LRARTPS---DIDTDPTIC--------EA 943
            ED             NS   M   SD+    L ART S   DID D +          + 
Sbjct: 1489 EDDQPPGSSGEVSFENSNSKMERMSDKQHQILGARTCSASFDIDDDDSAIAGLGNNQSQL 1548

Query: 942  ETKSCLSFPELEAFLTEVPDQKLLLKKSVLMQAFRGENAEAAYFLRQLFIAFSAILRLNL 763
            E  +C +    E  L E+      LK+  L    +G N EAA  LRQL +A SAILRLNL
Sbjct: 1549 EDVNCPANSLTEGDLIELQ----CLKRHFLGGLLKGANPEAANLLRQLLVAASAILRLNL 1604

Query: 762  QIKFNSSSWSLLPVVVDISEFVLLEFSRTE-LPNQFALFWLDGVVRFLEELGNYFPHINP 586
            QI     + SLLP+ V IS+F+LL+ + T  +P  F   WLDGV+R+LEELG++FP  NP
Sbjct: 1605 QISGTPFASSLLPISVGISKFLLLQLADTVGVPQPFTFVWLDGVLRYLEELGSHFPLTNP 1664

Query: 585  SLSKDFYVKLIVLHQRAIGKCISLQGKEAKLASQERGSLTKM----AGQVQSQFSRERNR 418
            +L+++ Y +LI LH RAIGKCI+LQGK+A LAS ER S TK+     G  +  FS   + 
Sbjct: 1665 TLTRNMYAELIELHLRAIGKCINLQGKKATLASHERESSTKILDESVGLSEVSFSHGPHW 1724

Query: 417  LAELLEKLRMSFNTYIKRSSEFHLLSIIESVERALVGEWEGSMTNYEIVCGSLNGGEVSA 238
            L +   +LRMSF   I++ S  HLLS ++++ERALVG  EG+ T Y+I  GS +GG+VS+
Sbjct: 1725 LDDFKSRLRMSFKVLIQKPSYLHLLSAVQAIERALVGVQEGNTTIYQISTGSGDGGKVSS 1784

Query: 237  VVASAIVCLDSIIE 196
             VA+ I CLD IIE
Sbjct: 1785 TVAAGIDCLDLIIE 1798


Top