BLASTX nr result

ID: Mentha24_contig00013217 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00013217
         (1610 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU45327.1| hypothetical protein MIMGU_mgv1a001518mg [Mimulus...   284   9e-74
ref|XP_004235030.1| PREDICTED: uncharacterized protein LOC101252...   199   3e-48
ref|XP_006347527.1| PREDICTED: uncharacterized protein LOC102592...   199   4e-48
ref|XP_006347526.1| PREDICTED: uncharacterized protein LOC102592...   196   2e-47
ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citr...   155   5e-35
ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628...   153   2e-34
ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853...   149   4e-33
ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citr...   147   1e-32
ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citr...   141   1e-30
ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301...   134   9e-29
ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Popu...   125   4e-26
ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Popu...   122   4e-25
ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus c...   122   4e-25
ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma...   121   1e-24
ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma...   119   3e-24
ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma...   119   3e-24
gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis]     117   2e-23
ref|XP_007136359.1| hypothetical protein PHAVU_009G038600g [Phas...   116   3e-23
ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma...   114   2e-22
ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prun...   112   6e-22

>gb|EYU45327.1| hypothetical protein MIMGU_mgv1a001518mg [Mimulus guttatus]
          Length = 804

 Score =  284 bits (726), Expect = 9e-74
 Identities = 177/368 (48%), Positives = 230/368 (62%), Gaps = 12/368 (3%)
 Frame = +1

Query: 64   LSGGPNTMMMKEPNLMSNLTSVFDMKVSDTKHLFAE---GCIVNDVSEGAAVAVHAAEKV 234
            +SG    M       ++NLTSVF M V DT  L  E   G   NDVSE  AVAVHAAE+V
Sbjct: 327  ISGDDPNMPRIGSGTLNNLTSVFHMNVLDTSQLIGEEGSGTSQNDVSEAGAVAVHAAEEV 386

Query: 235  LASPASQDDVTEHTMVQSPKLDVQSIVKSMHSLSELLRYHISSDLCSLGIENVETLELVM 414
            LASPASQ+D TE      PKL+V  I+K+MH+LS LL +H+SSD CSL  E+ ETL+  M
Sbjct: 387  LASPASQEDATE----PDPKLNVPKIIKTMHNLSALLLFHLSSDTCSLDEESSETLKHTM 442

Query: 415  SNLNTCLSKKDVQALATNKSEVKDXXXXXXXXXXXXCGAGMISRDPHTKCEALNSCTSPN 594
            SNL + L +K  +A  TN  E K+                 IS + +   EA N     +
Sbjct: 443  SNLGSSLCEKLNRA--TNHPEPKNHVGDTSDKLGESREVFTISGNHNMANEAANPHIKLD 500

Query: 595  YLHMHKGGRDFSVPGKKE---PMVSSLRDDLHITGDDDMAKAIKKVLEQNFEIDENMQSQ 765
            Y  +H+G R +S+PGKK+   P+ S LRDDL IT DDDMAKAIKKVL++NF ++E+M SQ
Sbjct: 501  YHQVHEGERTYSLPGKKDDKSPVFSPLRDDLDITSDDDMAKAIKKVLDENFHLNEDMDSQ 560

Query: 766  ALLFKSLWLEAEAKLCSMSYKARFERMKAQMEEIKLKAHKVDGDIERMKPELCISP---- 933
            ALLFKSLWL+AEAKLCS++YKARF+RMK  M+E KLKA + + +I +M  ++ IS     
Sbjct: 561  ALLFKSLWLDAEAKLCSITYKARFDRMKILMDETKLKAQQENENIAQMLSKVSISKPTLQ 620

Query: 934  --DPITMSAPNVEASVLARFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYVAASYNIL 1107
                +   A +VE SV+ARFNILKSR            + Q+E+VD +H   + A +NIL
Sbjct: 621  NISSLPEHAEDVETSVMARFNILKSR-EDNPKPLIIEKEQQNELVDGEHEGTIMARFNIL 679

Query: 1108 KSREENPS 1131
            KSR+E+ S
Sbjct: 680  KSRKESCS 687


>ref|XP_004235030.1| PREDICTED: uncharacterized protein LOC101252062 [Solanum
            lycopersicum]
          Length = 1175

 Score =  199 bits (506), Expect = 3e-48
 Identities = 142/411 (34%), Positives = 206/411 (50%), Gaps = 15/411 (3%)
 Frame = +1

Query: 154  KHLFAEGCI-----VNDVSEGAAVAVHAAEKVLASPASQDDVTE---HTMVQSPKLDVQS 309
            KH   EG +     +ND  EG  VA+ AAE VL SPASQ+D  +   + M  SPKLDVQ+
Sbjct: 616  KHNLPEGYMHTGLNLNDTLEGGVVALDAAENVLRSPASQEDAKQAQPYQMGSSPKLDVQT 675

Query: 310  IVKSMHSLSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDX 489
            +V ++H+LSELL+     + C L  ++ +TL+  ++NL  C  KK    + T  + V + 
Sbjct: 676  LVHAIHNLSELLKSQCLPNACLLEGQDYDTLKSAITNLGACTVKK----IETKDTMVTEH 731

Query: 490  XXXXXXXXXXXCGAGMISRDPHTKCE-ALNSCTSPNYLHMHKGGRDFSVPGKKEPMVSSL 666
                          G  + +P    E A +SC   N        ++     +  P+++S 
Sbjct: 732  DTFERLKESHRSYMGTETGNPQFMEEVARDSCGLDNQPMPEDKSKNNGKKTENSPLLTSA 791

Query: 667  RDDLHITGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLCSMSYKARFERM 846
             DDL  + ++ + +AIKKVL +NF  DE MQ QALLFK+LWLEAEAKLCS+SYK+RF+RM
Sbjct: 792  -DDLGDSNEEQVVQAIKKVLNENFLSDEGMQPQALLFKNLWLEAEAKLCSLSYKSRFDRM 850

Query: 847  KAQMEEIKLKA-----HKVDGDIERMKPELCISPDPITMSA-PNVEASVLARFNILKSRX 1008
            K +ME+ +          V  + +        S  P T S   +V+ S++ RFNIL  R 
Sbjct: 851  KIEMEKHRFSQDLNLNSSVAPEAKNDSASKISSQSPSTSSKNVHVDYSLMERFNILNRRE 910

Query: 1009 XXXXXXXXXXXKYQSEIVDSKHADYVAASYNILKSREENPSSXXXXXXXXXXXXFGKHAD 1188
                       +  S  V S   D V    NIL+ +  N SS                 D
Sbjct: 911  EKLNSSFFMKEENDSVKVGSDSEDSVTMKLNILRKQGNNFSSSFMQEKKASDIVSSDTED 970

Query: 1189 SVTARYNILKSREQNPSPVNAEEQHQSEIVEGKLADSLMTRINVLRSREEN 1341
            SV  R+NIL+ RE+N       E+   +++     DS+  R+N+LR RE+N
Sbjct: 971  SVMERFNILRRREENLKSSFMGEKKDQDVIANDAEDSVKVRLNILRQREDN 1021


>ref|XP_006347527.1| PREDICTED: uncharacterized protein LOC102592566 isoform X2 [Solanum
            tuberosum]
          Length = 1166

 Score =  199 bits (505), Expect = 4e-48
 Identities = 148/415 (35%), Positives = 209/415 (50%), Gaps = 7/415 (1%)
 Frame = +1

Query: 172  GCIVNDVSEGAAVAVHAAEKVLASPASQDDVTE---HTMVQSPKLDVQSIVKSMHSLSEL 342
            G  +ND  EG  VA+ AAE VL SPASQ+D  +   + M  SPKLDVQ++V ++H+LSEL
Sbjct: 626  GLSLNDTLEGGVVALDAAENVLRSPASQEDAKQAQQYQMGSSPKLDVQTLVHAIHNLSEL 685

Query: 343  LRYHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXXXXXX 522
            L+    ++ C L  ++++TL+  ++NL  C +KK    + T  + V              
Sbjct: 686  LKSQCLANACLLEGQDIDTLKSAITNLGACTAKK----IETKDTMVSQHDTFEKFEESRR 741

Query: 523  CGAGMISRDPHTKCE-ALNSCTSPNYLHMHKGGRDFSVPGKKEPMVSSLR--DDLHITGD 693
               G  +  P    E A +SC   N        ++    GKK    + L   DDL  + +
Sbjct: 742  SFMGTETGHPQFMEEVAWDSCGLDNQPTPEDKSKN---NGKKTENSALLTPADDLGDSNE 798

Query: 694  DDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLCSMSYKARFERMKAQMEEIKL 873
            + + +AIKKVL +NF  DE MQ QALLFK+LWLEAEAKLCS+SYK+RF+RMK +ME  K 
Sbjct: 799  EQVVQAIKKVLNENFLSDEGMQPQALLFKNLWLEAEAKLCSLSYKSRFDRMKIEME--KH 856

Query: 874  KAHKVDGDIERMKPELCISPDPITMS-APNVEASVLARFNILKSRXXXXXXXXXXXXKYQ 1050
            +  +V  + E        +  P T S + +++ SV+ RFNIL +R            +  
Sbjct: 857  RFSQVAPEAENDSASKITTQSPSTSSKSVHIDDSVMERFNIL-NRREEKLSSSFMKEEND 915

Query: 1051 SEIVDSKHADYVAASYNILKSREENPSSXXXXXXXXXXXXFGKHADSVTARYNILKSREQ 1230
            S  V S   D V    NIL+ +  N SS                 DSV  R+NIL+ RE 
Sbjct: 916  SVKVGSDSEDSVTMRLNILRKQGNNSSSSFMQEKKASDIVSSDTEDSVMERFNILRRRED 975

Query: 1231 NPSPVNAEEQHQSEIVEGKLADSLMTRINVLRSREENSKLISVDDGKLNSYFESE 1395
            N       E+   ++V     DS+  R+N+LR RE+N          LNS F  E
Sbjct: 976  NLKSSFMGEKKDQDVVANDAEDSVKVRLNILRQREDN----------LNSSFTEE 1020


>ref|XP_006347526.1| PREDICTED: uncharacterized protein LOC102592566 isoform X1 [Solanum
            tuberosum]
          Length = 1173

 Score =  196 bits (499), Expect = 2e-47
 Identities = 149/421 (35%), Positives = 210/421 (49%), Gaps = 13/421 (3%)
 Frame = +1

Query: 172  GCIVNDVSEGAAVAVHAAEKVLASPASQDDVTE---HTMVQSPKLDVQSIVKSMHSLSEL 342
            G  +ND  EG  VA+ AAE VL SPASQ+D  +   + M  SPKLDVQ++V ++H+LSEL
Sbjct: 626  GLSLNDTLEGGVVALDAAENVLRSPASQEDAKQAQQYQMGSSPKLDVQTLVHAIHNLSEL 685

Query: 343  LRYHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXXXXXX 522
            L+    ++ C L  ++++TL+  ++NL  C +KK    + T  + V              
Sbjct: 686  LKSQCLANACLLEGQDIDTLKSAITNLGACTAKK----IETKDTMVSQHDTFEKFEESRR 741

Query: 523  CGAGMISRDPHTKCE-ALNSCTSPNYLHMHKGGRDFSVPGKKEPMVSSLR--DDLHITGD 693
               G  +  P    E A +SC   N        ++    GKK    + L   DDL  + +
Sbjct: 742  SFMGTETGHPQFMEEVAWDSCGLDNQPTPEDKSKN---NGKKTENSALLTPADDLGDSNE 798

Query: 694  DDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLCSMSYKARFERMKAQME---- 861
            + + +AIKKVL +NF  DE MQ QALLFK+LWLEAEAKLCS+SYK+RF+RMK +ME    
Sbjct: 799  EQVVQAIKKVLNENFLSDEGMQPQALLFKNLWLEAEAKLCSLSYKSRFDRMKIEMEKHRF 858

Query: 862  --EIKLKAHKVDGDIERMKPELCISPDPITMS-APNVEASVLARFNILKSRXXXXXXXXX 1032
              E+ L +  V  + E        +  P T S + +++ SV+ RFNIL +R         
Sbjct: 859  SQELNLNS-SVAPEAENDSASKITTQSPSTSSKSVHIDDSVMERFNIL-NRREEKLSSSF 916

Query: 1033 XXXKYQSEIVDSKHADYVAASYNILKSREENPSSXXXXXXXXXXXXFGKHADSVTARYNI 1212
               +  S  V S   D V    NIL+ +  N SS                 DSV  R+NI
Sbjct: 917  MKEENDSVKVGSDSEDSVTMRLNILRKQGNNSSSSFMQEKKASDIVSSDTEDSVMERFNI 976

Query: 1213 LKSREQNPSPVNAEEQHQSEIVEGKLADSLMTRINVLRSREENSKLISVDDGKLNSYFES 1392
            L+ RE N       E+   ++V     DS+  R+N+LR RE+N          LNS F  
Sbjct: 977  LRRREDNLKSSFMGEKKDQDVVANDAEDSVKVRLNILRQREDN----------LNSSFTE 1026

Query: 1393 E 1395
            E
Sbjct: 1027 E 1027


>ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|557543530|gb|ESR54508.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 1041

 Score =  155 bits (392), Expect = 5e-35
 Identities = 139/482 (28%), Positives = 223/482 (46%), Gaps = 35/482 (7%)
 Frame = +1

Query: 172  GCIVNDVSEGAA--VAVHAAEKVLASPASQDDVTE-----HTMVQSPKLDVQSIVKSMHS 330
            G  +N  SEG +  V +HA E VL+SP+S + V       H    +P++ V++++ +MH+
Sbjct: 579  GLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLISTMHN 638

Query: 331  LSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXX 510
            LSELL +H S+D+C L   + E L+LV++NL+ C+SK+        +S +          
Sbjct: 639  LSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESLLTQKSSEFIRE 698

Query: 511  XXXXCGAGMISRDPHTKCEALNSCTSPNYLHMHK-------GGR------DFSVPG---- 639
                     +S    TK  A +    PNY H+ +        G+      DF+  G    
Sbjct: 699  FPELHEGVTVSSPKETKA-AFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFTSQGGHAE 757

Query: 640  --KKEPMVSSLRDDLHITGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLC 813
              K + M    +DD     DD+M +AIKKVL  NF  +E+ + Q LL+++LWLEAEA LC
Sbjct: 758  RVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVEEEDEKLQVLLYRNLWLEAEAALC 817

Query: 814  SMSYKARFERMKAQMEEIKLKAHKVDGDIERMK----PELCISPDPITMSAPNVEASVLA 981
            S++YKARF RMK ++E  KL   KV+    ++K     ++ +   PI   + + +  V+A
Sbjct: 818  SINYKARFNRMKIELENCKLLKAKVNKLPPQVKDDSTQDVSVHDFPIANISSHPD-DVVA 876

Query: 982  RFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYVAASYNILKSREENPSSXXXXXXXXX 1161
            R  ILK +            + +S       AD V       ++ +  P+S         
Sbjct: 877  RSQILKCQ------------ESESHANQRPTADEVDNFLFEARNDQTPPTSTCSLSNATS 924

Query: 1162 XXXFGKHADSVTARYNILKSREQNPSPVNAEEQHQSEIVEGKLADSLMTRINVLRSREEN 1341
                     SV AR++ILK+R +N S  N  +Q   + V  KL ++  + +N       N
Sbjct: 925  TSKADDVEASVIARFHILKNRIENSSCSNMGDQILPQ-VAFKLFENGTSDVNTGPELHRN 983

Query: 1342 SK-----LISVDDGKLNSYFESEPQVEYGGSVTNNPSIHLLTXXXXXXEWEHVLKEDFIL 1506
            S       ++V +  LN      P++   G+      +          +WEHV KE+   
Sbjct: 984  SSNHMQDKLTVKEFHLNDAVIQSPRLNKLGN-----QLPASCYDSSSLDWEHVSKEELPA 1038

Query: 1507 KN 1512
            +N
Sbjct: 1039 QN 1040


>ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628429 [Citrus sinensis]
          Length = 1065

 Score =  153 bits (386), Expect = 2e-34
 Identities = 143/505 (28%), Positives = 226/505 (44%), Gaps = 58/505 (11%)
 Frame = +1

Query: 172  GCIVNDVSEGAA--VAVHAAEKVLASPASQDDVTE-----HTMVQSPKLDVQSIVKSMHS 330
            G  +N  SEG +  V +HA E VL+SP+S + V       H    +P++ V++++ SMH+
Sbjct: 580  GLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLISSMHN 639

Query: 331  LSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXX 510
            LSELL +H S+D+C L   + E L+LV++NL+ C+SK+        +S +          
Sbjct: 640  LSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESLLTQKSSEFIRE 699

Query: 511  XXXXCGAGMISRDPHTKCEALNSCTSPNYLHMHK-------GGR------DFSVPG---- 639
                     +S    TK  A +    PNY H+ +        G+      DF+  G    
Sbjct: 700  FPELHEGVTVSSPQETKA-AFSVLNQPNYQHVQEQRSPDIAAGKKIEKCSDFTSQGGHAE 758

Query: 640  --KKEPMVSSLRDDLHITGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLC 813
              K + M    +DD     DD+M +AIKKVL  NF  +E+ + Q LL+++LWLEAEA LC
Sbjct: 759  RVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVKEEDEKLQVLLYRNLWLEAEAALC 818

Query: 814  SMSYKARFERMKAQMEEIK-LKAHKVDGDIERMK--PELCISPD---------------- 936
            +++YKARF RMK ++E  K LKA  +  +   ++   +   SPD                
Sbjct: 819  AINYKARFNRMKIELENCKLLKAKDLSENTSELEKLSQTTFSPDLHAVNKLPPQVKDDTT 878

Query: 937  --------PITMSAPNVEASVLARFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYVAA 1092
                    PI  S+ + +  V+ARF ILK +            + +S       AD V  
Sbjct: 879  QDVSVRDFPIANSSSHPD-DVVARFQILKCQ------------ESKSHANQKPTADEVDN 925

Query: 1093 SYNILKSREENPSSXXXXXXXXXXXXFGKHADSVTARYNILKSREQNPSPVNAEEQHQSE 1272
                 ++ +  P+S                  SV AR++ILK+R +N S  N  +Q   +
Sbjct: 926  FLFEARNDQTPPTSTCSLSNATSTSKADDVEASVIARFHILKNRIENSSCSNMGDQILPQ 985

Query: 1273 IVEGKLADSLMTRINVLRSREENSKL-----ISVDDGKLNSYFESEPQVEYGGSVTNNPS 1437
             V  KL ++  + +N       NS       ++V +  LN      P++   G+      
Sbjct: 986  -VAFKLFENGTSDVNTGPELHRNSSTHMQDKLTVKEFHLNDAVIQSPRLNKLGN-----Q 1039

Query: 1438 IHLLTXXXXXXEWEHVLKEDFILKN 1512
            +          +WEHV KE+   +N
Sbjct: 1040 LPASCYDSSSLDWEHVSKEELPAQN 1064


>ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853355 [Vitis vinifera]
            gi|302143995|emb|CBI23100.3| unnamed protein product
            [Vitis vinifera]
          Length = 1167

 Score =  149 bits (376), Expect = 4e-33
 Identities = 154/547 (28%), Positives = 233/547 (42%), Gaps = 61/547 (11%)
 Frame = +1

Query: 40   STRSKGVELSGGPNTMMMKEPNLMSNLTSVFDMKVSDTKHLFAEGCIVNDVSEGAAV--A 213
            S++S  +ELS   +TM          +    + K+S    +   G  +NDVS   +    
Sbjct: 645  SSKSDNLELS---HTMRQS----FEEVKFTSERKLSSGVGVEVTGNNINDVSRDGSSHET 697

Query: 214  VHAAEKVLASPASQDDVTEHTMVQ-----SPKLDVQSIVKSMHSLSELLRYHISSDLCSL 378
             H  E +  SP S DD +     Q     +PK+DV  ++ ++  LS LL  H S +  SL
Sbjct: 698  YHLTENISCSPLSGDDASTKLTKQPASESTPKIDVHMLINTVQDLSVLLLSHCSDNAFSL 757

Query: 379  GIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXXXXXXCGAGMISRDPHT 558
              ++ ETL+ V+ N + CL+KK  +      S                   G    D + 
Sbjct: 758  KEQDHETLKRVIDNFDACLTKKGQKIAEQGSSHFLGELPDLNKSASASWPLGKKVADANV 817

Query: 559  KCEALNSCTSPNYLHMHKGGRDFSVPGKKEPMVS---SLRDDLHITGDDDMAKAIKKVLE 729
              E    C S      HKG R  SV G K+  +S   SL +D     DD   +AI+K+L+
Sbjct: 818  --EDQFHCQSD-----HKGKRHCSVSGNKDEKLSDFVSLVNDEDTVNDDSTIQAIRKILD 870

Query: 730  QNFEIDENMQSQALLFKSLWLEAEAKLCSMSYKARFERMKAQMEEIKLK----------- 876
            +NF  +E    QALL+++LWLEAEA LCS+SY+ARF+RMK +ME+ KL+           
Sbjct: 871  KNFHDEEETDPQALLYRNLWLEAEAALCSISYRARFDRMKIEMEKFKLRKTEDLLKNTID 930

Query: 877  -----AHKVDGDI-----------ERMKPELCI--SPDPITMSAPNVEASVLARFNILKS 1002
                 + KV  DI           E   P++ I  SP+  TMS     A V+ RF+ILK 
Sbjct: 931  VEKQSSSKVSSDISMVDKFEREAQENPVPDITIEDSPNVTTMSH---AADVVDRFHILKR 987

Query: 1003 RXXXXXXXXXXXXKYQSEIVDSKHADYVAASYNILKSREENPSSXXXXXXXXXXXXFGKH 1182
            R              QS     K +  + +  N+  + +++ S                 
Sbjct: 988  RYENSDSLNSKDVGKQS---SCKVSHDMNSDDNLAPAAKDDHSPNIST---------STQ 1035

Query: 1183 ADSVTARYNILKSREQNPSPVNAEEQHQSEIVEGKLADS------LMTRI-NVLRSREEN 1341
            +D V AR+ ILK R    +P+NAE Q   E V+ + A        +  R+ +V    +  
Sbjct: 1036 SDDVMARFRILKCRADKSNPMNAERQQPPEEVDLEFAGKGSHWMFIKDRVEDVTLGPDLQ 1095

Query: 1342 SKLISVDDGKLNSY---FESEPQVEYGGSVTNNPSIHLLT------------XXXXXXEW 1476
              + +    + +SY   F+ E   E+     ++P I L                    +W
Sbjct: 1096 VHIANHTKDRFDSYLDDFDCEIVKEFHEHAMDDPVIQLPRSNRLQNQLPAGFSDGSSADW 1155

Query: 1477 EHVLKED 1497
            EHVLKE+
Sbjct: 1156 EHVLKEE 1162


>ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|557543533|gb|ESR54511.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 1064

 Score =  147 bits (372), Expect = 1e-32
 Identities = 141/505 (27%), Positives = 223/505 (44%), Gaps = 58/505 (11%)
 Frame = +1

Query: 172  GCIVNDVSEGAA--VAVHAAEKVLASPASQDDVTE-----HTMVQSPKLDVQSIVKSMHS 330
            G  +N  SEG +  V +HA E VL+SP+S + V       H    +P++ V++++ +MH+
Sbjct: 579  GLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLISTMHN 638

Query: 331  LSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXX 510
            LSELL +H S+D+C L   + E L+LV++NL+ C+SK+        +S +          
Sbjct: 639  LSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESLLTQKSSEFIRE 698

Query: 511  XXXXCGAGMISRDPHTKCEALNSCTSPNYLHMHK-------GGR------DFSVPG---- 639
                     +S    TK  A +    PNY H+ +        G+      DF+  G    
Sbjct: 699  FPELHEGVTVSSPKETKA-AFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFTSQGGHAE 757

Query: 640  --KKEPMVSSLRDDLHITGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLC 813
              K + M    +DD     DD+M +AIKKVL  NF  +E+ + Q LL+++LWLEAEA LC
Sbjct: 758  RVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVEEEDEKLQVLLYRNLWLEAEAALC 817

Query: 814  SMSYKARFERMKAQMEEIKLKAHK----VDGDIERMKPELCISPD--PITMSAPNVE--- 966
            S++YKARF RMK ++E  KL   K       ++E++  +   SPD   +    P V+   
Sbjct: 818  SINYKARFNRMKIELENCKLLKAKDFSENTSELEKLS-QTTFSPDLHAVNKLPPQVKDDS 876

Query: 967  ------------------ASVLARFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYVAA 1092
                                V+AR  ILK +            + +S       AD V  
Sbjct: 877  TQDVSVHDFPIANISSHPDDVVARSQILKCQ------------ESESHANQRPTADEVDN 924

Query: 1093 SYNILKSREENPSSXXXXXXXXXXXXFGKHADSVTARYNILKSREQNPSPVNAEEQHQSE 1272
                 ++ +  P+S                  SV AR++ILK+R +N S  N  +Q   +
Sbjct: 925  FLFEARNDQTPPTSTCSLSNATSTSKADDVEASVIARFHILKNRIENSSCSNMGDQILPQ 984

Query: 1273 IVEGKLADSLMTRINVLRSREENSK-----LISVDDGKLNSYFESEPQVEYGGSVTNNPS 1437
             V  KL ++  + +N       NS       ++V +  LN      P++   G+      
Sbjct: 985  -VAFKLFENGTSDVNTGPELHRNSSNHMQDKLTVKEFHLNDAVIQSPRLNKLGN-----Q 1038

Query: 1438 IHLLTXXXXXXEWEHVLKEDFILKN 1512
            +          +WEHV KE+   +N
Sbjct: 1039 LPASCYDSSSLDWEHVSKEELPAQN 1063


>ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|557543534|gb|ESR54512.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 842

 Score =  141 bits (355), Expect = 1e-30
 Identities = 92/260 (35%), Positives = 139/260 (53%), Gaps = 26/260 (10%)
 Frame = +1

Query: 172  GCIVNDVSEGAA--VAVHAAEKVLASPASQDDVTE-----HTMVQSPKLDVQSIVKSMHS 330
            G  +N  SEG +  V +HA E VL+SP+S + V       H    +P++ V++++ +MH+
Sbjct: 579  GLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLISTMHN 638

Query: 331  LSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXX 510
            LSELL +H S+D+C L   + E L+LV++NL+ C+SK+        +S +          
Sbjct: 639  LSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESLLTQKSSEFIRE 698

Query: 511  XXXXCGAGMISRDPHTKCEALNSCTSPNYLHMHK-------GGR------DFSVPG---- 639
                     +S    TK  A +    PNY H+ +        G+      DF+  G    
Sbjct: 699  FPELHEGVTVSSPKETKA-AFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFTSQGGHAE 757

Query: 640  --KKEPMVSSLRDDLHITGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLC 813
              K + M    +DD     DD+M +AIKKVL  NF  +E+ + Q LL+++LWLEAEA LC
Sbjct: 758  RVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVEEEDEKLQVLLYRNLWLEAEAALC 817

Query: 814  SMSYKARFERMKAQMEEIKL 873
            S++YKARF RMK ++E  KL
Sbjct: 818  SINYKARFNRMKIELENCKL 837


>ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301835 [Fragaria vesca
            subsp. vesca]
          Length = 1218

 Score =  134 bits (338), Expect = 9e-29
 Identities = 113/435 (25%), Positives = 195/435 (44%), Gaps = 18/435 (4%)
 Frame = +1

Query: 181  VNDVSEGAAVAVHAAEKVLASPASQDDVTEHTMVQSPK----LDVQSIVKSMHSLSELLR 348
            +ND  E  +      E    SP+ +D  T+ T     +    +D+Q +V  M+SLSE+L 
Sbjct: 649  INDTLECGSSHTSPIENTFCSPSVEDADTKLTTSYGEESNMNMDIQMLVNKMNSLSEVLL 708

Query: 349  YHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXXXXXXCG 528
             + S+  C L  ++++ L+ V++NLN+C+ K D   L+  +S                  
Sbjct: 709  VNCSNSSCQLKKKDIDALKAVINNLNSCILKHDEDFLSMPESPPIQQSTIKYIEELCKPN 768

Query: 529  AGMISRDPHTKCEALNSCTSPNYLH-MHKGGRDFSVPGKKEPMVSSL--RDDLHITGDDD 699
              +    P        S   P +L  + K     ++    + ++SS+  + D+     ++
Sbjct: 769  KALSPDMPQLTKIFAPSIQDPLHLQGVQKVKNHDNLVKNDDEVISSVSAKSDIDFVKQEE 828

Query: 700  MAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLCSMSYKARFERMKAQMEEIKLKA 879
            M + IKK+L +NF  D+    Q LL+K+LWLEAEA +CS +YKARF R+K +ME+ K   
Sbjct: 829  MTQDIKKILSENFHTDDT-HPQTLLYKNLWLEAEAVICSTNYKARFNRLKTEMEKCKADQ 887

Query: 880  HK-----VDGDIERMKPELCISPDPITMSAPNVEASVLARFNILKSRXXXXXXXXXXXXK 1044
             K         + + + E+C++ +P+      V+ S L + N+ +S              
Sbjct: 888  SKDVFEHTADMMTQSRSEVCVNSNPVEKLTSEVQGSPLPKLNLQESPTL----------- 936

Query: 1045 YQSEIVDSKHADYVAASYNILKSREENPSSXXXXXXXXXXXXFGKHADSVTARYNILKSR 1224
                   ++  D V A +++L++R EN SS                 D V     +    
Sbjct: 937  -------TQGDDNVMARFHVLRNRIENLSSVNATFGDESSSTLSLVPDKVD---EVAPEA 986

Query: 1225 EQNPSPVNAEEQHQSEIVEGKLAD---SLMTRINVLRSREENSKLIS---VDDGKLNSYF 1386
            +  PSP  + +   +  + G   D   S+M R +++R R ENSK IS   V+D   +S  
Sbjct: 987  DARPSPRISLQDSPTSSITGLSNDYEASVMARFHIIRDRVENSKFISDANVED-TASSKV 1045

Query: 1387 ESEPQVEYGGSVTNN 1431
              E + E G   T++
Sbjct: 1046 SREHEAEEGACETSD 1060


>ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa]
            gi|550321678|gb|EEF06077.2| hypothetical protein
            POPTR_0015s00600g [Populus trichocarpa]
          Length = 1236

 Score =  125 bits (315), Expect = 4e-26
 Identities = 109/396 (27%), Positives = 181/396 (45%), Gaps = 17/396 (4%)
 Frame = +1

Query: 208  VAVHAAEKVLASPASQDDV-TEHTMVQSP----KLDVQSIVKSMHSLSELLRYHISSDLC 372
            V  HA E+VL SP S +    +HT  Q      K+  +++V +MH+L+ELL ++ S+D C
Sbjct: 631  VPFHAIEQVLCSPPSSEHAPAQHTQSQGEESLSKMHARTLVDTMHNLAELLLFYSSNDTC 690

Query: 373  SLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXXXXXXCGAGMISRDP 552
             L  E+ + L+ V++NL+ C+SK   + ++T +S +                 G +    
Sbjct: 691  ELKDEDFDVLKDVINNLDICISKNLERKISTQESLIPQQATSQFHGKLSDLYKGQLEFQ- 749

Query: 553  HTKCEALNSCTSPNYLHMHKGGRDFSVPGKKEPMVS--SLRDDLHITGDDDMAKAIKKVL 726
            H + E  +   S                 +KE + +  S R       DD+M +AIKKVL
Sbjct: 750  HFEDEEEHKIASDK---------------RKEKLSNWASTRCAADTVKDDNMTQAIKKVL 794

Query: 727  EQNFEIDENMQSQALLFKSLWLEAEAKLCSMSYKARFERMKAQMEEIKLKAHKVDGDIER 906
             +NF I+E  +SQ LL+++LWLEAEA LCS++Y ARF RMK +ME    K H    + + 
Sbjct: 795  AKNFPIEEESESQILLYRNLWLEAEASLCSVNYMARFNRMKIEME----KGHSQKANEKS 850

Query: 907  MKPELCISPDPITMSAPNVEASVLARFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYV 1086
            M  E         +S P V + +L   +                    S +  + H+D V
Sbjct: 851  MVLE--------NLSRPKVSSDILPADD-------KGSPVQDVSFLDSSILSRNSHSDDV 895

Query: 1087 AASYNILKSREENPSSXXXXXXXXXXXXFGKHADSVTARYNIL-----KSREQNPSPVNA 1251
             A ++ILKSR ++ +S                +  V+   N++      +++     V+ 
Sbjct: 896  MARFHILKSRVDDSNSMSTSAVEKL------SSSKVSPDLNLVDKLACDTKDSTKPNVSI 949

Query: 1252 EEQHQSEIVE-----GKLADSLMTRINVLRSREENS 1344
            ++ H S            AD ++ R ++L+ R +NS
Sbjct: 950  QDSHMSGTSSNADDVSSHADDVIARFHILKCRVDNS 985


>ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Populus trichocarpa]
            gi|550326088|gb|EEE96055.2| hypothetical protein
            POPTR_0012s00720g [Populus trichocarpa]
          Length = 1227

 Score =  122 bits (307), Expect = 4e-25
 Identities = 124/451 (27%), Positives = 188/451 (41%), Gaps = 67/451 (14%)
 Frame = +1

Query: 208  VAVHAAEKVLASPASQDDV-TEHTMVQ----SPKLDVQSIVKSMHSLSELLRYHISSDLC 372
            V  HA E VL SP S +    +HT  Q    S K+  +++V +MH+LSELL ++ S+D C
Sbjct: 630  VPYHAIEHVLCSPPSSEHAPAQHTQSQVGESSSKMHARTLVDTMHNLSELLLFYSSNDTC 689

Query: 373  SLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXXXXXXCGAGMISRDP 552
             L  E+ + L  V++NL+  +SK   +  +T +S +                    S+ P
Sbjct: 690  ELKDEDFDVLNDVINNLDIFISKNSERKNSTQESLIPRRAT---------------SQSP 734

Query: 553  HTKCEALNSCTSPNYLHMHKGGRDFSVPGKKEPMVS--SLRDDLHITGDDDMAKAIKKVL 726
                E         +    K  +  S   +KE + +  S+R       DD++ +AIKKVL
Sbjct: 735  GKLSELYKGQLEFQHFEDEKECKIVS-DERKEKLSNFVSMRGATDTVKDDNVTQAIKKVL 793

Query: 727  EQNFEIDENMQSQALLFKSLWLEAEAKLCSMSYKARFERMKAQMEE-------------- 864
             QNF I E  +SQ LL+K+LWLEAEA LC ++   RF R+K ++E+              
Sbjct: 794  AQNFPIKEESESQILLYKNLWLEAEASLCVVNCMDRFNRLKIEIEKGSSQKVNEFSSAAP 853

Query: 865  ---------IKLKAHKVDGDIERMKPE---LCISPDPITMSAPNVEASVLARFNILKSRX 1008
                       L   KV  DI   + E   +   PD   +S  +    V+ARF+I+KSR 
Sbjct: 854  VVPENSMIMENLLGPKVSSDILPAEDEGSPVHNVPDSSILSRNSHSDDVMARFHIIKSRV 913

Query: 1009 XXXXXXXXXXXKYQSEIV--DSKHADYVAASYNILKSREENPSSXXXXXXXXXXXXFGKH 1182
                          S  V  D    D  A         +   SS               H
Sbjct: 914  DDSNSLNTSAMDLSSPKVSPDLNKVDKFA--------HDTKDSSKSHISFQDSIRGASSH 965

Query: 1183 ADSVTARYNILKSREQNPSPVN------------AEEQHQSEIV---------------- 1278
            AD+V  R++ILK R +N S VN            + +Q+Q + +                
Sbjct: 966  ADNVMDRFHILKCRVENSSSVNTATGGILASSMVSPDQNQVDKLAHDTKDSIMSYTIQDS 1025

Query: 1279 ----EGKLADSLMTRINVLRSREENSKLISV 1359
                    AD +MTR  +L  R++NS  +++
Sbjct: 1026 PMSGRSSHADDVMTRFCILNGRDDNSNSVTI 1056


>ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus communis]
            gi|223539484|gb|EEF41073.1| hypothetical protein
            RCOM_0756330 [Ricinus communis]
          Length = 1125

 Score =  122 bits (307), Expect = 4e-25
 Identities = 114/411 (27%), Positives = 182/411 (44%), Gaps = 33/411 (8%)
 Frame = +1

Query: 208  VAVHAAEKVLASPASQDDVTEHTM-----VQSPKLDVQSIVKSMHSLSELLRYHISSDLC 372
            V  HA E VL+SP S D  +         V + K  +++++ +M +LSELL +H+S+DLC
Sbjct: 637  VPFHAVEHVLSSPPSADSASIKLTKACGGVSTQKTYIRTVIDTMQNLSELLIFHLSNDLC 696

Query: 373  SLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXXXXXXCGAG------ 534
             L  ++   L+ ++SNL  C+ K   +  +T +S + +               G      
Sbjct: 697  DLKEDDSNALKGMISNLELCMLKNVERMTSTQESIIPERDGAQLSGKSSKLQKGTNGNGF 756

Query: 535  MISRDPHTKCEALNSCTSPNYLHMHKGGRDFSVPGKKEPMVSS---LRDDLHITGDDDMA 705
            +ISR      + L    S  Y H+       S  GK +  +SS   +R    +   D M 
Sbjct: 757  LISRS-----DPLEFQYSVKYQHVQDEHNISS--GKNDETLSSYVSVRAAADMLKRDKMT 809

Query: 706  KAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLCSMSYKARFERMKAQMEEIKLKAHK 885
            +AIK  L +NF  +E  + Q LL+K+LWLEAEA LC  S  ARF R+K++ME       K
Sbjct: 810  QAIKNALTENFHGEEETEPQVLLYKNLWLEAEASLCYASCMARFNRIKSEME-------K 862

Query: 886  VDGDIERMKPELCISPDPITMSAPNVEASVLARFNILKSRXXXXXXXXXXXXKYQSEIVD 1065
             D +     PE C+  + ++ S  N+ +      N+L S             +  S +  
Sbjct: 863  CDSEKANGSPENCMVEEKLSKS--NIRSDPCTG-NVLASNTKGSPLPDTSIPE-SSILCT 918

Query: 1066 SKHADYVAASYNILKSREENPSSXXXXXXXXXXXXFGKHADSVTARYNILKSREQNPSPV 1245
            S HAD V A Y+ILK R ++ ++                 D +    + L S + +P P 
Sbjct: 919  SSHADDVTARYHILKYRVDSTNAVNT-----------SSLDKMLGSADKLSSSQFSPCPN 967

Query: 1246 NAE----EQHQSEIVEGKLADSL---------------MTRINVLRSREEN 1341
            N E    E+   +  +  + DSL               M R ++L+ R++N
Sbjct: 968  NVEKGVCEEKDGQKPDISIQDSLVSNTTSHLNDVEASVMARFHILKCRDDN 1018


>ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508776469|gb|EOY23725.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 1059

 Score =  121 bits (303), Expect = 1e-24
 Identities = 145/541 (26%), Positives = 218/541 (40%), Gaps = 71/541 (13%)
 Frame = +1

Query: 103  NLMSNLTSVFDMKVSDTKHLFAEGCIVNDVSE--GAAVAVHAAEKVLASPASQDDV-TEH 273
            NL  + T V D+++            +NDVS    + V+ HA + +  +P+S +DV T+H
Sbjct: 572  NLCRSETGVADLEMK-----------INDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTKH 620

Query: 274  TMVQSPK----LDVQSIVKSMHSLSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSK 441
            T     +      +  +V +M +LSELL YH S++ C L  ++V++LE V++NL+TC+SK
Sbjct: 621  TKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSK 680

Query: 442  KDVQALATNKSEVKDXXXXXXXXXXXXCGAGMISRDPHTKC-EALNSCTSPNYLHMHKGG 618
               Q   T  SE+                 G  +  P     + L+  T     H     
Sbjct: 681  NIGQE--TLLSELHK---------------GTSTGSPQVAAIDVLSQHTQVKRKHF---- 719

Query: 619  RDFSVPGKKEPMVS---SLRDDLHI-TGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSL 786
                  GKK+   S   S+R    I   +D M +AIKKVL +NF   E    Q LL+K+L
Sbjct: 720  ------GKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNL 773

Query: 787  WLEAEAKLCSMSYKARFERMKAQMEEIKLKAHKVDGDIERMKPELCISPDPITMSAPNVE 966
            WLEAEA LCS++Y AR+  MK ++E+ KL   K   D+    P+     D I+ S  + +
Sbjct: 774  WLEAEAALCSINYMARYNNMKIEIEKCKLDTEK---DLSEDTPD----EDKISRSKLSAD 826

Query: 967  ASVLARFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYVAASYNILKSREENP-SSXXX 1143
                 +   +                  S    S HAD V A +++LK R  N  S    
Sbjct: 827  LDTNKKLTAIAESAPTLDVSNQNFPIASS----SNHADDVTARFHVLKHRLNNSYSVHTR 882

Query: 1144 XXXXXXXXXFGKHADSVTARYNILK-----SREQNPSPVNAEEQHQSEIVEGKLADSLMT 1308
                         +D+V      +K     S +   SPV     H  ++       S+MT
Sbjct: 883  DADELSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDSPVPGTACHTDDV-----EASIMT 937

Query: 1309 RINVLRSR--------EENSKLI--------------------SVDDGKLNSYFESEPQ- 1401
            R+++L+SR        E   K +                    + DDG L    ES  Q 
Sbjct: 938  RLHILKSRGNVDLDSNEMEQKPLPEVVDLGFAGKKKQIPIDEDTADDGVLGFNLESVSQN 997

Query: 1402 --VEYGGSVTNNPSIHLLT----------------------XXXXXXEWEHVLKEDFILK 1509
              V+Y G  +     HL                              +WEHVLKE+   +
Sbjct: 998  QVVDYAGEQSVVKDFHLCVKHDCTIQSPKSTRLGNQLSAGWYDSCSSDWEHVLKEELSGQ 1057

Query: 1510 N 1512
            N
Sbjct: 1058 N 1058


>ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508776467|gb|EOY23723.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1068

 Score =  119 bits (299), Expect = 3e-24
 Identities = 140/537 (26%), Positives = 216/537 (40%), Gaps = 67/537 (12%)
 Frame = +1

Query: 103  NLMSNLTSVFDMKVSDTKHLFAEGCIVNDVSE--GAAVAVHAAEKVLASPASQDDV-TEH 273
            NL  + T V D+++            +NDVS    + V+ HA + +  +P+S +DV T+H
Sbjct: 561  NLCRSETGVADLEMK-----------INDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTKH 609

Query: 274  TMVQSPK----LDVQSIVKSMHSLSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSK 441
            T     +      +  +V +M +LSELL YH S++ C L  ++V++LE V++NL+TC+SK
Sbjct: 610  TKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSK 669

Query: 442  KDVQALATNKSEVKDXXXXXXXXXXXXCGAGMISRDPHTKCEALNSCTSPNYLHMHKGGR 621
               Q   T  SE+                   + +   T    + +    +  H     +
Sbjct: 670  NIGQE--TLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQ-HTQVKRK 726

Query: 622  DFSVPGKKEPMVSSLRDDLHI-TGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEA 798
             F    +K     S+R    I   +D M +AIKKVL +NF   E    Q LL+K+LWLEA
Sbjct: 727  HFGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEA 786

Query: 799  EAKLCSMSYKARFERMKAQMEEIKLKAHKVDGDIERMKPELCISPDPITMSAPNVEASVL 978
            EA LCS++Y AR+  MK ++E+ KL   K   D+    P+     D I+ S  + +    
Sbjct: 787  EAALCSINYMARYNNMKIEIEKCKLDTEK---DLSEDTPD----EDKISRSKLSADLDTN 839

Query: 979  ARFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYVAASYNILKSREENP-SSXXXXXXX 1155
             +   +                  S    S HAD V A +++LK R  N  S        
Sbjct: 840  KKLTAIAESAPTLDVSNQNFPIASS----SNHADDVTARFHVLKHRLNNSYSVHTRDADE 895

Query: 1156 XXXXXFGKHADSVTARYNILK-----SREQNPSPVNAEEQHQSEIVEGKLADSLMTRINV 1320
                     +D+V      +K     S +   SPV     H  ++       S+MTR+++
Sbjct: 896  LSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDSPVPGTACHTDDV-----EASIMTRLHI 950

Query: 1321 LRSR--------EENSKLI--------------------SVDDGKLNSYFESEPQ---VE 1407
            L+SR        E   K +                    + DDG L    ES  Q   V+
Sbjct: 951  LKSRGNVDLDSNEMEQKPLPEVVDLGFAGKKKQIPIDEDTADDGVLGFNLESVSQNQVVD 1010

Query: 1408 YGGSVTNNPSIHLLT----------------------XXXXXXEWEHVLKEDFILKN 1512
            Y G  +     HL                              +WEHVLKE+   +N
Sbjct: 1011 YAGEQSVVKDFHLCVKHDCTIQSPKSTRLGNQLSAGWYDSCSSDWEHVLKEELSGQN 1067


>ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590674635|ref|XP_007039223.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508776465|gb|EOY23721.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776468|gb|EOY23724.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1079

 Score =  119 bits (299), Expect = 3e-24
 Identities = 140/537 (26%), Positives = 216/537 (40%), Gaps = 67/537 (12%)
 Frame = +1

Query: 103  NLMSNLTSVFDMKVSDTKHLFAEGCIVNDVSE--GAAVAVHAAEKVLASPASQDDV-TEH 273
            NL  + T V D+++            +NDVS    + V+ HA + +  +P+S +DV T+H
Sbjct: 572  NLCRSETGVADLEMK-----------INDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTKH 620

Query: 274  TMVQSPK----LDVQSIVKSMHSLSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSK 441
            T     +      +  +V +M +LSELL YH S++ C L  ++V++LE V++NL+TC+SK
Sbjct: 621  TKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSK 680

Query: 442  KDVQALATNKSEVKDXXXXXXXXXXXXCGAGMISRDPHTKCEALNSCTSPNYLHMHKGGR 621
               Q   T  SE+                   + +   T    + +    +  H     +
Sbjct: 681  NIGQE--TLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQ-HTQVKRK 737

Query: 622  DFSVPGKKEPMVSSLRDDLHI-TGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEA 798
             F    +K     S+R    I   +D M +AIKKVL +NF   E    Q LL+K+LWLEA
Sbjct: 738  HFGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEA 797

Query: 799  EAKLCSMSYKARFERMKAQMEEIKLKAHKVDGDIERMKPELCISPDPITMSAPNVEASVL 978
            EA LCS++Y AR+  MK ++E+ KL   K   D+    P+     D I+ S  + +    
Sbjct: 798  EAALCSINYMARYNNMKIEIEKCKLDTEK---DLSEDTPD----EDKISRSKLSADLDTN 850

Query: 979  ARFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYVAASYNILKSREENP-SSXXXXXXX 1155
             +   +                  S    S HAD V A +++LK R  N  S        
Sbjct: 851  KKLTAIAESAPTLDVSNQNFPIASS----SNHADDVTARFHVLKHRLNNSYSVHTRDADE 906

Query: 1156 XXXXXFGKHADSVTARYNILK-----SREQNPSPVNAEEQHQSEIVEGKLADSLMTRINV 1320
                     +D+V      +K     S +   SPV     H  ++       S+MTR+++
Sbjct: 907  LSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDSPVPGTACHTDDV-----EASIMTRLHI 961

Query: 1321 LRSR--------EENSKLI--------------------SVDDGKLNSYFESEPQ---VE 1407
            L+SR        E   K +                    + DDG L    ES  Q   V+
Sbjct: 962  LKSRGNVDLDSNEMEQKPLPEVVDLGFAGKKKQIPIDEDTADDGVLGFNLESVSQNQVVD 1021

Query: 1408 YGGSVTNNPSIHLLT----------------------XXXXXXEWEHVLKEDFILKN 1512
            Y G  +     HL                              +WEHVLKE+   +N
Sbjct: 1022 YAGEQSVVKDFHLCVKHDCTIQSPKSTRLGNQLSAGWYDSCSSDWEHVLKEELSGQN 1078


>gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis]
          Length = 1159

 Score =  117 bits (292), Expect = 2e-23
 Identities = 106/371 (28%), Positives = 168/371 (45%), Gaps = 33/371 (8%)
 Frame = +1

Query: 286  SPKLDVQSIVKSMHSLSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALAT 465
            SP +DV  +V ++ +LSELL +H +S    L  +++ET++ ++ NL+ C SK   + ++T
Sbjct: 663  SPTIDVPVLVSTIRNLSELLLFHCTSGSYQLKQKDLETIQSMIDNLSVCASKNSEKTVST 722

Query: 466  NKSEVKDXXXXXXXXXXXXCGAGMISRDPHTKCEALNSCTSPNYLHMHKGGRDFSVPGKK 645
              S  +                    +   T    L+     N   +HKG + +    + 
Sbjct: 723  QDSTSEKYTSDYLGDKNHKGFTLNKLQVTKTAGPILDLLADQN---VHKGNKYYVAGKEN 779

Query: 646  EPMVSSL--RDDLHITGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLCSM 819
            + ++ S+  R D+ I  +D   +A+KKVL  NF+ +E    QALL+K+LWLEAEA LCSM
Sbjct: 780  DELLDSVSVRADVDIVDEDKAIQALKKVLTDNFDYEEEASPQALLYKNLWLEAEAALCSM 839

Query: 820  SYKARFERMKAQMEEIKL-KAHKVDGDI----------ERMKPEL----CISP------- 933
            S KARF R+K +ME  KL K+    G+             + P+L     +SP       
Sbjct: 840  SCKARFNRVKLEMENPKLPKSKDAHGNTITTEMDKVSRSEVSPDLNGANTLSPKAKGCAT 899

Query: 934  ----DPITMSAPNVEASVLARFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYVAASYN 1101
                +   +S    +  V+ RF IL+ R               S    S H++ V     
Sbjct: 900  TKSQESSVLSTNAEDDDVMDRFQILRCRAKKSNYGIVADKDKPSSPKVSPHSNKVGKI-- 957

Query: 1102 ILKSREENPSS-----XXXXXXXXXXXXFGKHADSVTARYNILKSREQNPSPVNAEEQHQ 1266
            + ++ EE  SS                    +  SV AR++ILKSR  N SP++ + Q  
Sbjct: 958  LPEANEETGSSKPDIRRQASSNSSTDKPSNDYEASVMARFHILKSRGDNCSPLSTQGQ-L 1016

Query: 1267 SEIVEGKLADS 1299
            +E V+G    S
Sbjct: 1017 AENVDGSTIGS 1027


>ref|XP_007136359.1| hypothetical protein PHAVU_009G038600g [Phaseolus vulgaris]
            gi|561009446|gb|ESW08353.1| hypothetical protein
            PHAVU_009G038600g [Phaseolus vulgaris]
          Length = 1123

 Score =  116 bits (291), Expect = 3e-23
 Identities = 111/442 (25%), Positives = 186/442 (42%), Gaps = 38/442 (8%)
 Frame = +1

Query: 280  VQSPKLDVQSIVKSMHSLSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSKK--DVQ 453
            V + KL+VQ +V +M +LSELL YH  +D+C L   +   L+ V+SNLNTC  K     Q
Sbjct: 695  VTTEKLNVQILVNTMQNLSELLLYHCKNDVCVLKERDCNALKDVISNLNTCALKSAAPAQ 754

Query: 454  ALATNKSEVKDXXXXXXXXXXXXCGAGMISRDPHTKC-EALNSCTSPNYLHMHKGGRDFS 630
                N+ E  +                   R P TK    ++   +P     +   R   
Sbjct: 755  ECLFNQPETFNCARELQEFHQN----ASFKRLPSTKIGPEISKVENPLVAEANLHFRSAK 810

Query: 631  VPGKKEPMVSSLRDDLHITGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKL 810
               K    +SS R+   +T   D+ K +K+ L +NF  DE    Q  L+K+LWLEAEA+L
Sbjct: 811  PLWKLSDSISSRRETTEMTKTGDITKDLKRTLNENFHDDEGADPQTALYKNLWLEAEAEL 870

Query: 811  CSMSYKARFERMKAQMEEIKLKAHKVDGDIE-RMKPEL-------------------CIS 930
            CS+ YKAR+ ++K +M+    K  +++ + +  + P L                   C++
Sbjct: 871  CSVYYKARYNQIKIEMDNHSYKEREMENESKSEVVPTLSQNQSSETKVHNYPNRGSSCLN 930

Query: 931  -------PDPITMSAPNVEASVLARFNILKSRXXXXXXXXXXXXKYQSEIVDSKHADYVA 1089
                   P+  T    N E+SV+AR+ +LK+R            +   ++ D        
Sbjct: 931  CFTDVNKPNSATTPGRNDESSVMARYQVLKARVVDLSCIDTTNPEEPLDMADKSSPGESD 990

Query: 1090 ASYNILKSREENPSSXXXXXXXXXXXXFGKHADSVTARYNILKSREQNPSPVNAEEQHQS 1269
              Y +    +++P                    SV AR++ILKSR +  S ++ E +   
Sbjct: 991  KQYAV-NFCQDSPFPEKN----------STDEASVVARFHILKSRREGSSSISLEGKQLD 1039

Query: 1270 --EIVEGKLADSLMTRINVLRSRE--ENSKLISVDD----GKLNSYFESEPQVEYGGSVT 1425
              E  +  + D+ + +I+  +  +  ENS ++ +       K   + + E   E     T
Sbjct: 1040 GVESADKDMDDTTIAKISEGKGLDVHENSAMVHLGSYIAMDKQEFHQDLEDSQEIQPCRT 1099

Query: 1426 NNPSIHLLTXXXXXXEWEHVLK 1491
            +   +          +WEHV K
Sbjct: 1100 SEFQLPNYYSDGFSSDWEHVEK 1121


>ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508776466|gb|EOY23722.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1017

 Score =  114 bits (284), Expect = 2e-22
 Identities = 105/366 (28%), Positives = 165/366 (45%), Gaps = 45/366 (12%)
 Frame = +1

Query: 103  NLMSNLTSVFDMKVSDTKHLFAEGCIVNDVSE--GAAVAVHAAEKVLASPASQDDV-TEH 273
            NL  + T V D+++            +NDVS    + V+ HA + +  +P+S +DV T+H
Sbjct: 572  NLCRSETGVADLEMK-----------INDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTKH 620

Query: 274  TMVQSPK----LDVQSIVKSMHSLSELLRYHISSDLCSLGIENVETLELVMSNLNTCLSK 441
            T     +      +  +V +M +LSELL YH S++ C L  ++V++LE V++NL+TC+SK
Sbjct: 621  TKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSK 680

Query: 442  KDVQALATNKSEVKDXXXXXXXXXXXXCGAGMISRDPHTKCEALNSCTSPNYLHMHKGGR 621
               Q   T  SE+                   + +   T    + +    +  H     +
Sbjct: 681  NIGQE--TLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQ-HTQVKRK 737

Query: 622  DFSVPGKKEPMVSSLRDDLHI-TGDDDMAKAIKKVLEQNFEIDENMQSQALLFKSLWLEA 798
             F    +K     S+R    I   +D M +AIKKVL +NF   E    Q LL+K+LWLEA
Sbjct: 738  HFGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEA 797

Query: 799  EAKLCSMSYKARFERMKAQMEEIKLKAH-----------KVDGDIERM-KPELCISPDPI 942
            EA LCS++Y AR+  MK ++E+ KL              K+  D + +   +L +  D +
Sbjct: 798  EAALCSINYMARYNNMKIEIEKCKLDTEKDLSEDTPDEDKISRDADELSSSKLSLDSDAV 857

Query: 943  ----------------TMSAP---------NVEASVLARFNILKSRXXXXXXXXXXXXKY 1047
                            T  +P         +VEAS++ R +ILKSR            K 
Sbjct: 858  DKLATEVKDSSTSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEMEQKP 917

Query: 1048 QSEIVD 1065
              E+VD
Sbjct: 918  LPEVVD 923


>ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prunus persica]
            gi|462417047|gb|EMJ21784.1| hypothetical protein
            PRUPE_ppa000352mg [Prunus persica]
          Length = 1254

 Score =  112 bits (279), Expect = 6e-22
 Identities = 119/426 (27%), Positives = 181/426 (42%), Gaps = 19/426 (4%)
 Frame = +1

Query: 184  NDVSE--GAAVAVHAAEKVLASPASQDDVTEHTMVQSP----KLDVQSIVKSMHSLSELL 345
            ND  E   + V  H  E VL S A +D  T+ +         K+DVQ +V ++ +LSELL
Sbjct: 652  NDTMEYGSSHVPSHVVENVLCSSA-EDAATKLSKSNGEESMLKVDVQMLVDTLKNLSELL 710

Query: 346  RYHISSDLCSLGIENVETLELVMSNLNTCLSKKDVQALATNKSEVKDXXXXXXXXXXXXC 525
              + S+ LC L   ++ TL+ V++NL+ C+SK   +     +S                C
Sbjct: 711  LTNCSNGLCQLKKTDIATLKAVINNLHICISKNVEKWSPMQESPT-------FQQNTSQC 763

Query: 526  GAGMISRDPHTKCEALNSCTSPNYLHMHKGGRDFSVPGKKEPMVSSL--RDDLHITGDDD 699
             A +     H K  + +   S             S P  ++ ++ S+  + D+ +  +D 
Sbjct: 764  YAEL---SEHHKVLSADRPLSA------------SAPDIQDQVIGSIHVKSDIDVVKEDK 808

Query: 700  MAKAIKKVLEQNFEIDENMQSQALLFKSLWLEAEAKLCSMSYKARFERMKAQMEEIKLKA 879
            M +AIK++L +NF  +E    Q LL+K+LWLEAEA LCS++YKARF R+K +M++ K + 
Sbjct: 809  MTQAIKEILSENFHSEET-DPQVLLYKNLWLEAEAVLCSINYKARFNRVKIEMDKCKAEN 867

Query: 880  HK--VDGDIERMKPELC-ISPD-----PITMSAPNVEASVLARFNILKSRXXXXXXXXXX 1035
             K   +   + MK     +SPD     P+T  A     S +    IL             
Sbjct: 868  SKDVFEYTADMMKQSKSEVSPDSNPVNPLTPEAQGCPTSNVPDLPILSQE---------- 917

Query: 1036 XXKYQSEIVDSKHADYVAASYNILKSREENPSSXXXXXXXXXXXXFGKHADSVTARYNIL 1215
                          D V A ++IL+ R EN +S                   V     I 
Sbjct: 918  --------------DEVLARFDILRGRVENTNSINASNAAELSSKASPEPSKVE---RIA 960

Query: 1216 KSREQNPSPVNAEEQHQSEIVEGKLAD---SLMTRINVLRSREENSKLISVDDGKLNSYF 1386
                  PSP  + +        G   D   S+M R ++LR R E SK IS     +N   
Sbjct: 961  PEANGTPSPGISIQDSSISSTIGVTDDYEASVMARFHILRDRVEKSKFISA----VNMEE 1016

Query: 1387 ESEPQV 1404
             S P+V
Sbjct: 1017 PSSPKV 1022


Top