BLASTX nr result

ID: Catharanthus23_contig00023067 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00023067
         (1348 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343429.1| PREDICTED: uncharacterized protein LOC102582...   346   1e-92
ref|XP_002268828.1| PREDICTED: uncharacterized protein LOC100260...   346   1e-92
emb|CBI17457.3| unnamed protein product [Vitis vinifera]              342   2e-91
ref|XP_004289121.1| PREDICTED: uncharacterized protein LOC101309...   336   2e-89
gb|EMJ05399.1| hypothetical protein PRUPE_ppa003533mg [Prunus pe...   336   2e-89
gb|EOX90985.1| Uncharacterized protein isoform 1 [Theobroma cacao]    330   6e-88
ref|XP_004234605.1| PREDICTED: uncharacterized protein LOC101246...   330   8e-88
ref|XP_002521520.1| conserved hypothetical protein [Ricinus comm...   330   8e-88
gb|EOX90986.1| Uncharacterized protein isoform 2 [Theobroma cacao]    326   2e-86
ref|XP_006380773.1| hypothetical protein POPTR_0007s13390g [Popu...   324   6e-86
ref|XP_002310271.2| hypothetical protein POPTR_0007s13390g [Popu...   322   3e-85
ref|XP_002334039.1| predicted protein [Populus trichocarpa]           320   7e-85
gb|EPS62368.1| hypothetical protein M569_12423, partial [Genlise...   320   1e-84
ref|XP_004169392.1| PREDICTED: uncharacterized LOC101204887 [Cuc...   318   4e-84
ref|XP_004149986.1| PREDICTED: uncharacterized protein LOC101204...   318   4e-84
ref|XP_006466852.1| PREDICTED: uncharacterized protein LOC102618...   312   2e-82
ref|XP_006425615.1| hypothetical protein CICLE_v10025250mg [Citr...   310   1e-81
gb|ESW28079.1| hypothetical protein PHAVU_003G257200g [Phaseolus...   308   4e-81
ref|XP_002876990.1| At3g26750 [Arabidopsis lyrata subsp. lyrata]...   303   8e-80
ref|NP_189310.2| uncharacterized protein [Arabidopsis thaliana] ...   303   1e-79

>ref|XP_006343429.1| PREDICTED: uncharacterized protein LOC102582264 [Solanum tuberosum]
          Length = 544

 Score =  346 bits (888), Expect = 1e-92
 Identities = 194/447 (43%), Positives = 265/447 (59%), Gaps = 12/447 (2%)
 Frame = +3

Query: 42   NPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEAETSL 221
            N   WRFTWEAQSH  T++L LFN  I+    C+++ V L +E+S L V F   +   S+
Sbjct: 17   NSSKWRFTWEAQSHTSTLRLILFNSNIK---SCTEITVNLSVEKSLLTVCFVEGD---SV 70

Query: 222  RVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVT--------EFGSTSDDFRPL 377
            RVP+P +LID ++PVH +  DDH+EVKL LLLPVDHP+++        E    SD F P 
Sbjct: 71   RVPVPRVLIDPEAPVHCRVFDDHVEVKLSLLLPVDHPLISGLDLSEPEEEKPDSDTFFPF 130

Query: 378  SADSDQKKLSALEEVYFYCRSCSTKLTRPLRSFKELPSVNWQDVADNWXXXXXXXXXXVG 557
            S + + KKLSA+EEV+FYCRSCSTKLT+ +R F E+PSV+WQDVADNW          + 
Sbjct: 131  SVNYEIKKLSAMEEVHFYCRSCSTKLTKGIRLFNEMPSVDWQDVADNWFGTCCCSFGGIS 190

Query: 558  EKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLASVNVLTQN 737
            E+LV  +AKSY+C  GVCL+   +V+ICK+DL+GCE   +     D T +          
Sbjct: 191  EQLVMQFAKSYSCTTGVCLITGASVIICKEDLVGCE---FPELKGDQTYD---------- 237

Query: 738  TPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKISDRLSGVSL- 914
               +++   K   L+                C E+++   K  ++ V  + D  S   + 
Sbjct: 238  ---SQVNSAKVTSLRP---------------CPEEENNGVKPNNEVVKMMIDGDSSTCIP 279

Query: 915  -QLQDAEKQVSLVTCSTNGEAKFGNCDTGCC--RPSAGSSNEQKPGMDIELLAHQKILLD 1085
             +L+D +K  SL   S+       + +TGCC    S   S E++  ++ ELL  QKI L 
Sbjct: 280  SKLKDEDKMKSLAGISSEANCDTKSHNTGCCTNNLSESFSKEKEYELNTELLDKQKIFLK 339

Query: 1086 GFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLFKCQISTCLP 1265
            G + + FM R SNLSKDV+W+EFLCP CS L+GAYPC  D + LD GVRL+K  ISTCLP
Sbjct: 340  GCLGDAFMLRHSNLSKDVKWIEFLCPKCSSLIGAYPCSSDKAPLDDGVRLYKFNISTCLP 399

Query: 1266 GFDSNNCFRRYSWERMFSIQLLESAKE 1346
                N+ FR Y+ ERMFS QLLE+A++
Sbjct: 400  VRGLNDLFREYTLERMFSRQLLEAAQD 426


>ref|XP_002268828.1| PREDICTED: uncharacterized protein LOC100260906 [Vitis vinifera]
          Length = 612

 Score =  346 bits (888), Expect = 1e-92
 Identities = 212/492 (43%), Positives = 279/492 (56%), Gaps = 52/492 (10%)
 Frame = +3

Query: 27   LNPYQNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDE 206
            L   +NPR WRFTWEAQSHIPT++L+LF+   +P  QC +LKV L  E+S LLVS+  +E
Sbjct: 5    LGTSENPRKWRFTWEAQSHIPTLRLFLFDQGTKPCIQCKNLKVDLNFERSLLLVSWFEEE 64

Query: 207  AETSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGS-------TSDD 365
             E S RVP+P +L+D +SP+ F+A +DHIEVKL+LLLPVDH IV+ F S       TS  
Sbjct: 65   TEISFRVPVPRVLVDIESPISFRAMEDHIEVKLVLLLPVDHHIVSNFNSILNMSEATSQL 124

Query: 366  FRPLSA-DSDQKKLSALEEVYFYCRSCSTKLT-RPLRSFKELPSVNWQDVADNWXXXXXX 539
            F   S   SD K LS+   V+FYC+SCST LT +PL SF E+PS+NW++VADNW      
Sbjct: 125  FSMDSVFGSDIKSLSSRGGVHFYCKSCSTNLTKKPLSSFAEMPSINWREVADNWFGACCC 184

Query: 540  XXXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLASV 719
                + EKLV  YA SY+C    CLL+ T+V++CKDDL+G E         D  QN  S 
Sbjct: 185  SFGGISEKLVARYANSYSCGEESCLLDATSVILCKDDLVGFE-----FPDRDGDQNYESE 239

Query: 720  NVLTQNTPVNRIEQ------GKTVCLKSGSAGEQGVNGK--ESHICSE---DDDLNKKLE 866
               T++  +N   Q      G+ VC          ++GK    HI  E   D    K +E
Sbjct: 240  PDCTEDDCINEDMQDAGGNHGRCVCPTVKKEKMSDLSGKLNSLHIQKEPFVDSPGYKIIE 299

Query: 867  HD----------AVDKISDRLSGV-------SLQLQDAEKQV--------------SLVT 953
             +           V   S+ ++          + + + +K+V              S   
Sbjct: 300  KEITVPSLVGTVPVSYFSENVASAPGCCADNRIHVLNHDKEVCTPDTVSYFSENVPSAPG 359

Query: 954  CSTNGEAKFGNCDTGCCRPSAGS-SNEQKPGMDIELLAHQKILLDGFIWNGFMARTSNLS 1130
            C  +      N D   C P     S EQK     E+LA++K  L+GF+ N FMAR+ NLS
Sbjct: 360  CCADNRIHVLNHDKEVCMPDTSEISKEQKVTKASEVLANKKSFLNGFLGNIFMARSYNLS 419

Query: 1131 KDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLFKCQISTCLPGFDSNNCFRRYSWER 1310
            KDV+W++F CP CS LLGAYPC    + LDGGVRLFKC ISTCLP  +S + FR+Y+ ER
Sbjct: 420  KDVEWIKFACPQCSSLLGAYPCADGYAPLDGGVRLFKCYISTCLPVCESGDLFRKYTLER 479

Query: 1311 MFSIQLLESAKE 1346
            MF+ QLLESAK+
Sbjct: 480  MFTSQLLESAKD 491


>emb|CBI17457.3| unnamed protein product [Vitis vinifera]
          Length = 769

 Score =  342 bits (877), Expect = 2e-91
 Identities = 193/445 (43%), Positives = 264/445 (59%), Gaps = 5/445 (1%)
 Frame = +3

Query: 27   LNPYQNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDE 206
            L   +NPR WRFTWEAQSHIPT++L+LF+   +P  QC +LKV L  E+S LLVS+  +E
Sbjct: 262  LGTSENPRKWRFTWEAQSHIPTLRLFLFDQGTKPCIQCKNLKVDLNFERSLLLVSWFEEE 321

Query: 207  AETSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGS----TSDDFRP 374
             E S RVP+P +L+D +SP+ F+A +DHIEVKL+LLLPVDH IV+ F S    +    + 
Sbjct: 322  TEISFRVPVPRVLVDIESPISFRAMEDHIEVKLVLLLPVDHHIVSNFNSILNMSEATSQL 381

Query: 375  LSADSDQKKLSALEEVYFYCRSCSTKLT-RPLRSFKELPSVNWQDVADNWXXXXXXXXXX 551
             S DSD K LS+   V+FYC+SCST LT +PL SF E+PS+NW++VADNW          
Sbjct: 382  FSMDSDIKSLSSRGGVHFYCKSCSTNLTKKPLSSFAEMPSINWREVADNWFGACCCSFGG 441

Query: 552  VGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLASVNVLT 731
            + EKLV  YA SY+C    CLL+ T+V++CKDDL+G E         D  QN  S    T
Sbjct: 442  ISEKLVARYANSYSCGEESCLLDATSVILCKDDLVGFE-----FPDRDGDQNYESEPDCT 496

Query: 732  QNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKISDRLSGVS 911
            ++  +N   Q         + G  G       +C         ++ + +  +S +L+ + 
Sbjct: 497  EDDCINEDMQ--------DAGGNHG-----RCVC-------PTVKKEKMSDLSGKLNSLH 536

Query: 912  LQLQDAEKQVSLVTCSTNGEAKFGNCDTGCCRPSAGSSNEQKPGMDIELLAHQKILLDGF 1091
            +Q +  E++V+  +                                 E+LA++K  L+GF
Sbjct: 537  IQKEPFEQKVTKAS---------------------------------EVLANKKSFLNGF 563

Query: 1092 IWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLFKCQISTCLPGF 1271
            + N FMAR+ NLSKDV+W++F CP CS LLGAYPC    + LDGGVRLFKC ISTCLP  
Sbjct: 564  LGNIFMARSYNLSKDVEWIKFACPQCSSLLGAYPCADGYAPLDGGVRLFKCYISTCLPVC 623

Query: 1272 DSNNCFRRYSWERMFSIQLLESAKE 1346
            +S + FR+Y+ ERMF+ QLLESAK+
Sbjct: 624  ESGDLFRKYTLERMFTSQLLESAKD 648


>ref|XP_004289121.1| PREDICTED: uncharacterized protein LOC101309100 [Fragaria vesca
            subsp. vesca]
          Length = 571

 Score =  336 bits (861), Expect = 2e-89
 Identities = 199/462 (43%), Positives = 270/462 (58%), Gaps = 22/462 (4%)
 Frame = +3

Query: 27   LNPYQNPRNWRFTWEAQSHIPTVKLYLFNPIIR----PATQCSDLKVKLIIEQSSLLVSF 194
            L   ++   WRFTWE+QSHIPT++L+LFN        P+TQC +L V L +  S LL+++
Sbjct: 5    LETAESSTEWRFTWESQSHIPTLRLFLFNSSTNSSTNPSTQCHNLTVHLSLPHSLLLLTW 64

Query: 195  TN--DEAETSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGST---- 356
            T   D A  SL VP+P +L+D++SPV FKA DDHIEVKL LLLPVDH IV  F S     
Sbjct: 65   TAAADAAVVSLCVPMPRVLLDDESPVSFKALDDHIEVKLALLLPVDHSIVLNFESLLSLG 124

Query: 357  --------SDDFRPLSADSDQKKLSALEEVYFYCRSCSTKLTR-PLRSFKELPSVNWQDV 509
                     +  +PLS DSD K LSA  EV+FYCRSCS KLT  PL +F E+PSVNW++V
Sbjct: 125  EMRLDEDEDNGLKPLSVDSDVKSLSARGEVHFYCRSCSYKLTASPLSTFVEMPSVNWREV 184

Query: 510  ADNWXXXXXXXXXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCE---SLNWN 680
            ADNW          V EKLV  YA SYT   GVCL+ +T + +CKDDL+GCE   S    
Sbjct: 185  ADNWFGNCCCSFGEVSEKLVAGYANSYTSKMGVCLVSSTNITLCKDDLVGCEFPDSGVCE 244

Query: 681  HRHNDFTQNLASVNVLTQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKK 860
             R N+   +       ++  P + +   +   +      E+ V+       +ED+D ++ 
Sbjct: 245  RRDNESDASGECGPKESELAPGSNVRCNE---IPKSENKEKYVDASCKSDATEDEDKSEG 301

Query: 861  LEHDAVDKISDRLSGVSLQLQDAEKQVSLVTCSTNGEAKFGNCDTGCCRPSAGSSNEQKP 1040
            + H   +  SD     S + +D +          + E+   N D  CC      S+ ++ 
Sbjct: 302  VPHRCSE--SDCSVSESRKERDGD--------CDDMESHAQNYDGECCSHHLSKSSSEQ- 350

Query: 1041 GMDIELLAHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLD 1220
              + E+L + K LL+G++ N FM R+SNLS DV+WVEF CPHCS LLGAYPC +   ++D
Sbjct: 351  --NTEILKNHKSLLNGYLENIFMVRSSNLSVDVEWVEFFCPHCSSLLGAYPCDNGGRLID 408

Query: 1221 GGVRLFKCQISTCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
            GGVRLFKC I+T LP   + + FR+Y+ ERMF+ QLLE AK+
Sbjct: 409  GGVRLFKCNIATSLPVGGTRDVFRKYTLERMFANQLLECAKD 450


>gb|EMJ05399.1| hypothetical protein PRUPE_ppa003533mg [Prunus persica]
          Length = 567

 Score =  336 bits (861), Expect = 2e-89
 Identities = 192/456 (42%), Positives = 259/456 (56%), Gaps = 20/456 (4%)
 Frame = +3

Query: 39   QNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEAETS 218
            +N R WRFTWEAQSHIPT++L+LF+   +P+ +C  L V +   +S +LVS+T D A+ S
Sbjct: 9    ENLRKWRFTWEAQSHIPTLRLFLFDSYTKPSIKCEKLSVLIRPSESLVLVSWTED-AQVS 67

Query: 219  LRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGST-----------SDD 365
            L VP+P +L+D DSPV F A DDHIEVKL+LLLPVDHPIV  F S             D 
Sbjct: 68   LSVPMPRVLVDADSPVSFSALDDHIEVKLVLLLPVDHPIVLSFDSILSLNEEKEIAFEDA 127

Query: 366  FRPLSADSDQKKLSALEEVYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNWXXXXXXX 542
             +PL   S+ K+LS+   V+FYCR+CS KLT  PL  F E+PSVNW++VADNW       
Sbjct: 128  SKPLPLASEVKRLSSSGGVHFYCRNCSFKLTASPLSHFVEMPSVNWREVADNWFGACCCS 187

Query: 543  XXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNW-NHRHNDFTQNLASV 719
               + EKLV  YA SY C  GVCLL +T + +CK+DL+G E  +W  H   D   + +  
Sbjct: 188  FGGISEKLVARYANSYACAKGVCLLNSTNITLCKEDLVGFEFPDWGEHPRYDSESDGSGE 247

Query: 720  NVLTQNTPVNRIEQGKTVCLKS--GSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKISD 893
            N  ++    +++  G  +        A  +     E   C              V K   
Sbjct: 248  NGFSE----SKLNSGSNLACNEIPRFAEVRDTYFAEDFKCE-------------VTKDES 290

Query: 894  RLSGVSLQLQDAEKQVSLVT----CSTNGEAKFGNCDTGC-CRPSAGSSNEQKPGMDIEL 1058
               G   +  ++E  V + +    C+  G       + GC    S  S  + KP   IE+
Sbjct: 291  NSEGTPHRCSESEYSVKMASTPGCCNYTGSHVQNYDEEGCRLHLSEISLEDPKPAKSIEI 350

Query: 1059 LAHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLF 1238
            L + K  L+GF+ N FM R+SN+S DV+W+EF CP CS LLGAYPC   N+++DGGVRLF
Sbjct: 351  LKNHKSFLNGFLENIFMVRSSNISVDVEWIEFFCPQCSSLLGAYPCDDGNALVDGGVRLF 410

Query: 1239 KCQISTCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
            KC +ST LP     N FR+Y+ E+MF+ QLLE AK+
Sbjct: 411  KCNVSTSLPVGGPTNLFRKYTLEKMFANQLLECAKD 446


>gb|EOX90985.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 564

 Score =  330 bits (847), Expect = 6e-88
 Identities = 194/463 (41%), Positives = 273/463 (58%), Gaps = 24/463 (5%)
 Frame = +3

Query: 30   NPYQNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEA 209
            NP +NPR WRFTWEAQSH P ++L+LF+   +P+ QC  LKV L + QS +LVS+  +E 
Sbjct: 5    NP-ENPRKWRFTWEAQSHSPNLRLFLFDSQTKPSVQCKKLKVHLNLFQSQVLVSWLKEEK 63

Query: 210  E--TSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGST--------- 356
            E   ++RVP+P +LID++SPV F+A DDHIEVKL+LLLPV HPIV+ F S          
Sbjct: 64   EEEVTVRVPIPRVLIDSESPVSFRALDDHIEVKLVLLLPVGHPIVSRFDSVLNSSENGDD 123

Query: 357  ---SDDFRPLSADSDQKKLSALEE-VYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNW 521
                D   PL  D+D K LS++EE V+FYCR+CS +LT  PLR+F E+PS++W++VADNW
Sbjct: 124  ALAPDAATPLVMDTDLKSLSSIEEGVHFYCRNCSIRLTENPLRNFVEMPSIDWREVADNW 183

Query: 522  XXXXXXXXXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFT 701
                      + EK+VT +A SY C  GVCLL  TAV++ KDDL+ C+  N    H   +
Sbjct: 184  FGACCCSFGGISEKMVTRFANSYKCAKGVCLLSFTAVVLSKDDLVACKLYNRTQEHQPGS 243

Query: 702  QNLASVNVLTQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVD 881
             + +S  VL++    +R E    +C            GK S +  ++D + K +     +
Sbjct: 244  -DFSSDCVLSEEMLSSR-ESTNDLC------------GKLSSMHLKNDSVTKNVLVAKEE 289

Query: 882  KISDRLSGVSLQLQDAEKQVSLVTCSTNGEAKFGN-CDTG-------CCRPSAGSSNEQK 1037
                +L         +E + S++ C  + E    N  D G        C     +S    
Sbjct: 290  ANGHKLFSALPVPDVSENETSVLGCCVHTENHIRNHVDEGGQHDVSETCLVDQNTS---- 345

Query: 1038 PGMDIELLAHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSML 1217
                 +LLA+QK+ L+G + N FMA++ NLS D++W+EF+CP+C  LLGAYP  +  + +
Sbjct: 346  -----KLLANQKLFLNGSLGNAFMAKSYNLSMDIEWMEFVCPNCLSLLGAYPFDNGGAPI 400

Query: 1218 DGGVRLFKCQISTCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
            DGGVRLFKC ISTC       + FR+YS ERMF+ QLLE+AK+
Sbjct: 401  DGGVRLFKCYISTCTSAGGLGDMFRKYSLERMFTNQLLENAKD 443


>ref|XP_004234605.1| PREDICTED: uncharacterized protein LOC101246799 [Solanum
            lycopersicum]
          Length = 553

 Score =  330 bits (846), Expect = 8e-88
 Identities = 191/456 (41%), Positives = 266/456 (58%), Gaps = 16/456 (3%)
 Frame = +3

Query: 27   LNPYQNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDE 206
            +NP  N   WRF WEAQSH  T++L LF+  I+P   C+++ V L +E+S L V F   +
Sbjct: 21   VNP--NCSKWRFKWEAQSHTSTLRLILFSSNIKP---CTEITVNLSVEKSLLTVCFVEGD 75

Query: 207  AETSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVT----------EFGST 356
               ++RVP+P +LID ++PVH +  DDH+E+KL LLLPVDHP+++          E    
Sbjct: 76   ---TIRVPVPRVLIDPEAPVHCRVFDDHVEIKLALLLPVDHPLISGLDLSEPEPEEEKLD 132

Query: 357  SDDFRPLSADSDQKKLSALEEVYFYCRSCSTKLTRPLRSFKELPSVNWQDVADNWXXXXX 536
            SD   P S + + KKLSA+EEV+FYC+SCSTKLT+ +R F E+PSV+WQDVADNW     
Sbjct: 133  SDTCFPFSVNYEIKKLSAMEEVHFYCKSCSTKLTKGIRLFNEMPSVDWQDVADNWFGTCC 192

Query: 537  XXXXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLAS 716
                 + E+LV  +AKSY+C  GVCL+   +V+I K+DL+ CE                 
Sbjct: 193  CSFGGISEQLVMKFAKSYSCTTGVCLITGASVIIFKEDLVVCE----------------- 235

Query: 717  VNVLTQN-TPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKI-- 887
              VL ++ T  +++   K   L+                C E+ + N    H+ V K+  
Sbjct: 236  FPVLKRDQTYDSQLNSAKMTSLRP---------------CPEEKN-NGVKPHNVVVKMMI 279

Query: 888  -SDRLSGVSLQLQDAEKQVSLVTCSTNGEAKFGNCDTGCCRPSAGS--SNEQKPGMDIEL 1058
              D  + +  +L+D +K  SL   S+       N +TGCC  +     S +++  M+ EL
Sbjct: 280  NGDSSTCIHSKLKDEDKMKSLAGISSEANCDIKNHNTGCCSNNLSERFSKDREYEMNTEL 339

Query: 1059 LAHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLF 1238
            L  QKI L G + + FM R SNLSKDV+W+EFLCP CS L+GAYPC  D + LD GVRL+
Sbjct: 340  LDKQKIFLKGCLGDAFMLRHSNLSKDVKWIEFLCPKCSSLIGAYPCSSDKAPLDDGVRLY 399

Query: 1239 KCQISTCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
            K  ISTCLP    N+ FR Y+ ERMFS QLLE+A++
Sbjct: 400  KFNISTCLPVVGLNDLFREYTLERMFSRQLLEAAQD 435


>ref|XP_002521520.1| conserved hypothetical protein [Ricinus communis]
            gi|223539198|gb|EEF40791.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 532

 Score =  330 bits (846), Expect = 8e-88
 Identities = 192/446 (43%), Positives = 258/446 (57%), Gaps = 10/446 (2%)
 Frame = +3

Query: 39   QNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEA-ET 215
            +NPR WRFTWEAQSH PT+KL LF+   +P+ QC++L+V L + QS LL+++ ++   E 
Sbjct: 4    ENPRKWRFTWEAQSHSPTLKLLLFDSQTKPSIQCNNLRVNLNLSQSHLLLTWIDENTDEF 63

Query: 216  SLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVT--------EFGSTSDDFR 371
            SL+VP+P +LID D P  F+A DDHIE KL+LLLPVDHPI T        E     D  +
Sbjct: 64   SLKVPIPKVLIDPDCPFSFRALDDHIEAKLVLLLPVDHPIFTNLSLVDDGESNDVLDCLK 123

Query: 372  PLSADSDQKKLSALEEVYFYCRSCSTKLT-RPLRSFKELPSVNWQDVADNWXXXXXXXXX 548
            PL  DSD K LS++EEV+FYCRSCST+LT  PLR F ELPSVNW++ ADNW         
Sbjct: 124  PLYMDSDLKSLSSMEEVHFYCRSCSTRLTANPLRHFVELPSVNWRETADNWFGACCCSFG 183

Query: 549  XVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLASVNVL 728
             + EKLV  YA +YTC  GVCLL   AV +CKDDL+ C+ ++ +    +           
Sbjct: 184  GISEKLVNRYADAYTCAKGVCLLSPPAVTLCKDDLVECKFVDCDGIQRN----------- 232

Query: 729  TQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKISDRLSGV 908
                  +R E    + L   S   +G N K    C      NK    D  + ++ R  G 
Sbjct: 233  -----ESRKEYSGPIGLSEESMLYRGSNPKIDASCD-----NKNDTLDLSENVASR-PGC 281

Query: 909  SLQLQDAEKQVSLVTCSTNGEAKFGNCDTGCCRPSAGSSNEQKPGMDIELLAHQKILLDG 1088
               +  A   V + T   +  +  G             + + K     E +A+++ +L+G
Sbjct: 282  CNSMHHALDDVEVSTHEVHQPSLLGQI-----------TRKAK-----ENMANRRSILNG 325

Query: 1089 FIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLFKCQISTCLPG 1268
            F+   FMAR+ NLS DVQW +F+CP CS LLGAYPC  D+  LD GVRLFKC +ST LP 
Sbjct: 326  FLGGVFMARSYNLSMDVQWKQFVCPQCSTLLGAYPCADDDVPLDDGVRLFKCYLSTSLPV 385

Query: 1269 FDSNNCFRRYSWERMFSIQLLESAKE 1346
              S + FR+Y+ E+MF+ QL+ESAK+
Sbjct: 386  GGSADLFRKYNLEKMFTNQLVESAKD 411


>gb|EOX90986.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 565

 Score =  326 bits (835), Expect = 2e-86
 Identities = 194/464 (41%), Positives = 273/464 (58%), Gaps = 25/464 (5%)
 Frame = +3

Query: 30   NPYQNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEA 209
            NP +NPR WRFTWEAQSH P ++L+LF+   +P+ QC  LKV L + QS +LVS+  +E 
Sbjct: 5    NP-ENPRKWRFTWEAQSHSPNLRLFLFDSQTKPSVQCKKLKVHLNLFQSQVLVSWLKEEK 63

Query: 210  E--TSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGST--------- 356
            E   ++RVP+P +LID++SPV F+A DDHIEVKL+LLLPV HPIV+ F S          
Sbjct: 64   EEEVTVRVPIPRVLIDSESPVSFRALDDHIEVKLVLLLPVGHPIVSRFDSVLNSSENGDD 123

Query: 357  ---SDDFRPLSADSDQKKLSALEE-VYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNW 521
                D   PL  D+D K LS++EE V+FYCR+CS +LT  PLR+F E+PS++W++VADNW
Sbjct: 124  ALAPDAATPLVMDTDLKSLSSIEEGVHFYCRNCSIRLTENPLRNFVEMPSIDWREVADNW 183

Query: 522  XXXXXXXXXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFT 701
                      + EK+VT +A SY C  GVCLL  TAV++ KDDL+ C+  N    H   +
Sbjct: 184  FGACCCSFGGISEKMVTRFANSYKCAKGVCLLSFTAVVLSKDDLVACKLYNRTQEHQPGS 243

Query: 702  QNLASVNVLTQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVD 881
             + +S  VL++    +R E    +C            GK S +  ++D + K +     +
Sbjct: 244  -DFSSDCVLSEEMLSSR-ESTNDLC------------GKLSSMHLKNDSVTKNVLVAKEE 289

Query: 882  KISDRLSGVSLQLQDAEKQVSLVTCSTNGEAKFGN-CDTG-------CCRPSAGSSNEQK 1037
                +L         +E + S++ C  + E    N  D G        C     +S    
Sbjct: 290  ANGHKLFSALPVPDVSENETSVLGCCVHTENHIRNHVDEGGQHDVSETCLVDQNTS---- 345

Query: 1038 PGMDIELLAHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSML 1217
                 +LLA+QK+ L+G + N FMA++ NLS D++W+EF+CP+C  LLGAYP  +  + +
Sbjct: 346  -----KLLANQKLFLNGSLGNAFMAKSYNLSMDIEWMEFVCPNCLSLLGAYPFDNGGAPI 400

Query: 1218 DGGVRLFKCQISTCLPGFDSNNCF-RRYSWERMFSIQLLESAKE 1346
            DGGVRLFKC ISTC       + F R+YS ERMF+ QLLE+AK+
Sbjct: 401  DGGVRLFKCYISTCTSAGGLGDMFSRKYSLERMFTNQLLENAKD 444


>ref|XP_006380773.1| hypothetical protein POPTR_0007s13390g [Populus trichocarpa]
            gi|550334794|gb|ERP58570.1| hypothetical protein
            POPTR_0007s13390g [Populus trichocarpa]
          Length = 560

 Score =  324 bits (830), Expect = 6e-86
 Identities = 187/443 (42%), Positives = 263/443 (59%), Gaps = 7/443 (1%)
 Frame = +3

Query: 39   QNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDE--AE 212
            +NP+ WRFTWE QS  P +KL+LFN   +P+     L+ +L + +S LLV+FT +E  +E
Sbjct: 4    KNPKKWRFTWETQSQSPNLKLFLFNSQSKPSVH--HLQAQLNLSKSHLLVTFTENEETSE 61

Query: 213  TSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGSTSDDFRPLSADSD 392
             S+RVP+P +LID +SPV+ KA DDHIEVKL+LLLPVDH +V+ F     D   LS D +
Sbjct: 62   VSIRVPIPRVLIDPESPVNAKASDDHIEVKLVLLLPVDHHLVSTF-----DLLNLSDDEN 116

Query: 393  QKKLSALEEVYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNWXXXXXXXXXXVGEKLV 569
             K LS++E V+FYCRSCS +LTR PL+ F E+PSVNW ++ADNW            EKLV
Sbjct: 117  LKSLSSMEGVHFYCRSCSNRLTRSPLKQFVEMPSVNWPEMADNWFGGCCCSFGGASEKLV 176

Query: 570  TSYAKSYTCVPGVCLLENTAVLICKDDLIGCE-SLNWNHRHNDFTQNLASVNVLTQNTPV 746
              YA +Y C  GVC+L +TAV +C DDL GC+ S  +  +     Q      +  +    
Sbjct: 177  NRYAHAYACPMGVCMLNSTAVTLCSDDLAGCKFSEKYRIQTCKPEQESGDEGLSEEAMRD 236

Query: 747  NRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKISDRLSGVS-LQLQ 923
               E G+     S      GVNGK    CS+ ++  + ++    ++ ++  + +S L   
Sbjct: 237  FETESGRGTRCDSQCGVIHGVNGKSGSSCSKLENHGENVKFKVAEEKTNSSTLLSALPAS 296

Query: 924  D-AEKQVSLVTCSTNGEAKFGNCDTGCCRPSAGSSNE-QKPGMDIELLAHQKILLDGFIW 1097
            D +EK      C  +        D G      G S E QK   D+EL  +Q+  L+GF+ 
Sbjct: 297  DLSEKVAPGPGCCDSVHHTQDYTDEGGIHDVCGPSLEDQKTTKDMELRINQRSFLNGFLG 356

Query: 1098 NGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLFKCQISTCLPGFDS 1277
            + F AR+ NLS D++W +F+CP CS L+GAYPC + +  +D GVRLFKC IST LP  + 
Sbjct: 357  DAFTARSYNLSTDIEWKQFVCPQCSSLIGAYPCANGDMPVDDGVRLFKCYISTSLPVGEQ 416

Query: 1278 NNCFRRYSWERMFSIQLLESAKE 1346
             + FR+Y+ ERMF+ QL+ESAK+
Sbjct: 417  ADLFRKYTLERMFTSQLVESAKD 439


>ref|XP_002310271.2| hypothetical protein POPTR_0007s13390g [Populus trichocarpa]
            gi|550334795|gb|EEE90721.2| hypothetical protein
            POPTR_0007s13390g [Populus trichocarpa]
          Length = 525

 Score =  322 bits (824), Expect = 3e-85
 Identities = 191/451 (42%), Positives = 256/451 (56%), Gaps = 15/451 (3%)
 Frame = +3

Query: 39   QNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDE--AE 212
            +NP+ WRFTWE QS  P +KL+LFN   +P+     L+ +L + +S LLV+FT +E  +E
Sbjct: 4    KNPKKWRFTWETQSQSPNLKLFLFNSQSKPSVH--HLQAQLNLSKSHLLVTFTENEETSE 61

Query: 213  TSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFG--STSDD------- 365
             S+RVP+P +LID +SPV+ KA DDHIEVKL+LLLPVDH +V+ F   + SDD       
Sbjct: 62   VSIRVPIPRVLIDPESPVNAKASDDHIEVKLVLLLPVDHHLVSTFDLLNLSDDESERNED 121

Query: 366  ---FRPLSADSDQKKLSALEEVYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNWXXXX 533
                 P   DSD K LS++E V+FYCRSCS +LTR PL+ F E+PSVNW ++ADNW    
Sbjct: 122  LDLLNPFIMDSDLKSLSSMEGVHFYCRSCSNRLTRSPLKQFVEMPSVNWPEMADNWFGGC 181

Query: 534  XXXXXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLA 713
                    EKLV  YA +Y C  GVC+L +TAV +C DDL GC S        DF     
Sbjct: 182  CCSFGGASEKLVNRYAHAYACPMGVCMLNSTAVTLCSDDLAGCLS---EEAMRDF----- 233

Query: 714  SVNVLTQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKISD 893
                          E G+     S      GVNGK    CS       KLE+        
Sbjct: 234  ------------ETESGRGTRCDSQCGVIHGVNGKSGSSCS-------KLENH------- 267

Query: 894  RLSGVSLQLQDAEKQVSLVTCSTNGEAKFGNCDTGCCRPSAGSSNEQKPGMDIELLAHQK 1073
               G +++ + AE++       TN     G     C      S  +QK   D+EL  +Q+
Sbjct: 268  ---GENVKFKVAEEK-------TNNYTDEGGIHDVC----GPSLEDQKTTKDMELRINQR 313

Query: 1074 ILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLFKCQIS 1253
              L+GF+ + F AR+ NLS D++W +F+CP CS L+GAYPC + +  +D GVRLFKC IS
Sbjct: 314  SFLNGFLGDAFTARSYNLSTDIEWKQFVCPQCSSLIGAYPCANGDMPVDDGVRLFKCYIS 373

Query: 1254 TCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
            T LP  +  + FR+Y+ ERMF+ QL+ESAK+
Sbjct: 374  TSLPVGEQADLFRKYTLERMFTSQLVESAKD 404


>ref|XP_002334039.1| predicted protein [Populus trichocarpa]
          Length = 554

 Score =  320 bits (821), Expect = 7e-85
 Identities = 186/440 (42%), Positives = 260/440 (59%), Gaps = 7/440 (1%)
 Frame = +3

Query: 48   RNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDE--AETSL 221
            + WRFTWE QS  P +KL+LFN   +P+     L+ +L + +S LLV+FT +E  +E S+
Sbjct: 1    KKWRFTWETQSQSPNLKLFLFNSQSKPSVH--HLQAQLNLSKSHLLVTFTENEETSEVSI 58

Query: 222  RVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGSTSDDFRPLSADSDQKK 401
            RVP+P +LID +SPV+ KA DDHIEVKL+LLLPVDHP+V+ F     D   LS D + K 
Sbjct: 59   RVPIPRVLIDPESPVNAKASDDHIEVKLVLLLPVDHPLVSTF-----DLLNLSDDENLKS 113

Query: 402  LSALEEVYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNWXXXXXXXXXXVGEKLVTSY 578
            LS++E V+FYCRSCS +LTR PL+ F E+PSVNW ++ADNW            EKLV  Y
Sbjct: 114  LSSMEGVHFYCRSCSNRLTRSPLKQFVEMPSVNWPEMADNWFGGCCCSFGGASEKLVNRY 173

Query: 579  AKSYTCVPGVCLLENTAVLICKDDLIGCE-SLNWNHRHNDFTQNLASVNVLTQNTPVNRI 755
            A +Y C  GVC+L +TAV +C DDL GC+ S  +  +     Q      +  +       
Sbjct: 174  AHAYACPMGVCMLNSTAVTLCSDDLAGCKFSEKYRIQTCKPEQESGDEGLSEEAMRDFET 233

Query: 756  EQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKISDRLSGVS-LQLQD-A 929
            E G+     S      GVNGK    CS+ ++  + ++    ++ ++  + +S L   D +
Sbjct: 234  ESGRGTRCDSQCGVIHGVNGKSGSSCSKLENHGENVKFKVAEEKTNSSTLLSALPASDLS 293

Query: 930  EKQVSLVTCSTNGEAKFGNCDTGCCRPSAGSSNE-QKPGMDIELLAHQKILLDGFIWNGF 1106
            EK      C  +        D G      G S E QK   D+EL  +Q+  L+GF+ + F
Sbjct: 294  EKVAPGPGCCDSVHHTQDYTDEGGIHDVCGPSLEDQKTTKDMELRINQRSFLNGFLGDAF 353

Query: 1107 MARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLFKCQISTCLPGFDSNNC 1286
             AR+ NLS D++W +F+CP CS L+GAYPC + +  +D GVRLFKC IST LP     + 
Sbjct: 354  TARSYNLSTDIEWKQFVCPQCSSLIGAYPCANGDMPVDDGVRLFKCYISTSLPVGGPADL 413

Query: 1287 FRRYSWERMFSIQLLESAKE 1346
            FR+Y+ ERMF+ QL+ESAK+
Sbjct: 414  FRKYTLERMFTSQLVESAKD 433


>gb|EPS62368.1| hypothetical protein M569_12423, partial [Genlisea aurea]
          Length = 500

 Score =  320 bits (819), Expect = 1e-84
 Identities = 188/441 (42%), Positives = 257/441 (58%), Gaps = 10/441 (2%)
 Frame = +3

Query: 54   WRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEAE--TSLRV 227
            W+FTWE+QSH   ++L LFN  +   +   DL+V L++E+S LLVSF  +  E  T LRV
Sbjct: 1    WKFTWESQSHASVIRLLLFNSDVGLHSVVRDLEVILLLEESVLLVSFMENGTEKVTQLRV 60

Query: 228  PLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGSTS----DDFRPLSADSDQ 395
             +P +LID DSP HF A DDHIEVKL+LLLPVDHP+V++  S      D FRP +ADSD 
Sbjct: 61   QIPRVLIDPDSPPHFNAFDDHIEVKLVLLLPVDHPLVSQVDSILNLEIDRFRP-TADSDL 119

Query: 396  KKLSALEEVYFYCRSCSTKLTRPLRSFKELPSVNWQDVADNWXXXXXXXXXXVGEKLVTS 575
            K+LS +EEV+FYCR CS+KLT  LR F E+PS NW+DVADNW            EKLV S
Sbjct: 120  KRLSCIEEVHFYCRHCSSKLTNGLRCFYEMPSANWRDVADNWFGNCCCSFGGASEKLVAS 179

Query: 576  YAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLA-SVNVLTQNTPVNR 752
            YAKS+   PGV  L  +++L+CK DL+GC+  +   +    ++ L+ S  +      +NR
Sbjct: 180  YAKSHVFAPGVGFLSASSLLLCKSDLLGCKLRDVKVKPKKDSKELSCSRKISKDGDCLNR 239

Query: 753  IEQGKTVCLKSGSAGEQGVNGKESH--ICSEDDDLNKKLEHDAVDKISDRLSGVSLQLQD 926
            ++              + V+G  S   IC  +D LN +                    +D
Sbjct: 240  VD-------------AESVSGTLSTEVICGSEDPLNGE--------------------RD 266

Query: 927  AEKQVSLVTCSTNGEAKFGNCDTGCCRPSAGSSNEQKPGMDIE-LLAHQKILLDGFIWNG 1103
            +E       C  N    F   +          SN Q  G D E +L ++++ LDG++ +G
Sbjct: 267  SE-------CCNNLHCDFETVE--------ALSNLQITGADDEKILENERVFLDGYLGDG 311

Query: 1104 FMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLFKCQISTCLPGFDSNN 1283
            FM ++S +S ++QW E+LC  CS L+GAYPC  DN+ LDGGVRLFK  ISTC+P  +S +
Sbjct: 312  FMLKSSAVSNEIQWSEYLCSQCSSLIGAYPCIDDNTPLDGGVRLFKYNISTCVPYGESED 371

Query: 1284 CFRRYSWERMFSIQLLESAKE 1346
             FR YS ER F+ QLLESA++
Sbjct: 372  VFRDYSLERAFASQLLESAED 392


>ref|XP_004169392.1| PREDICTED: uncharacterized LOC101204887 [Cucumis sativus]
          Length = 513

 Score =  318 bits (814), Expect = 4e-84
 Identities = 190/456 (41%), Positives = 258/456 (56%), Gaps = 20/456 (4%)
 Frame = +3

Query: 39   QNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEAETS 218
            +NPR WRFTWEAQSHIP ++L LF+ I  P+ QC +LKV+L ++QS + V++  D  + S
Sbjct: 9    ENPRKWRFTWEAQSHIPILRLLLFDSITNPSLQCRNLKVQLNLQQSVVCVAWLQD-LDMS 67

Query: 219  LRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGSTSD-----------D 365
            +RVP+P +L+D DSP+ F+A +DHIEVKL+LLLPVDHPI+  F +  D            
Sbjct: 68   IRVPMPPVLVDADSPLSFRAFEDHIEVKLVLLLPVDHPIILNFDNVLDFSQEQGTSHSKA 127

Query: 366  FRPLSADSDQKKLSALEEVYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNWXXXXXXX 542
             +PLS DSDQ  LS    V+FYCR+CS +L++ PLR F E+PSVNW++VADNW       
Sbjct: 128  SKPLSMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCS 187

Query: 543  XXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNL---- 710
               + EKLV  Y  SY C  GVCLL  T + + KDDLIG    +     N+ TQ L    
Sbjct: 188  FGGISEKLVNRYTNSYRCEKGVCLLTLTTITLSKDDLIGHVFPD-----NEGTQQLKDES 242

Query: 711  --ASVNVLTQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDK 884
              A  + LT+    +      T  +KS     + +N K          L   +E    +K
Sbjct: 243  DFADGDCLTEAKEESPCNHTSTEKVKS-----KQINNKS---------LYANMEGSVAEK 288

Query: 885  ISDRLSGVSLQLQDAEKQVSLVTCSTNGEAK-FGNCDTGCCRPSAGS-SNEQKPGMDIEL 1058
             SD +        D+     +  C  + E+    + D  C   + G+  ++ KP   +++
Sbjct: 289  ASDEV--------DSPIVTPIPDCCHHEESNVLHHLDKDCMHHTCGTIKSDPKPVNAVDI 340

Query: 1059 LAHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLF 1238
               Q+  L+GF+ N FMAR SNLS D +W EF CP CS L+GAYP  +     DGGVR F
Sbjct: 341  SDDQRSFLNGFLGNIFMARLSNLSADFEWAEFFCPQCSTLIGAYPWRNGCGPTDGGVRFF 400

Query: 1239 KCQISTCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
            KC +STCL   +S N  R Y+ ERMF+ QLLESA E
Sbjct: 401  KCYVSTCLAA-ESGNLLREYTLERMFANQLLESAHE 435


>ref|XP_004149986.1| PREDICTED: uncharacterized protein LOC101204887 [Cucumis sativus]
          Length = 556

 Score =  318 bits (814), Expect = 4e-84
 Identities = 190/456 (41%), Positives = 258/456 (56%), Gaps = 20/456 (4%)
 Frame = +3

Query: 39   QNPRNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEAETS 218
            +NPR WRFTWEAQSHIP ++L LF+ I  P+ QC +LKV+L ++QS + V++  D  + S
Sbjct: 9    ENPRKWRFTWEAQSHIPILRLLLFDSITNPSLQCRNLKVQLNLQQSVVCVAWLQD-LDMS 67

Query: 219  LRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGSTSD-----------D 365
            +RVP+P +L+D DSP+ F+A +DHIEVKL+LLLPVDHPI+  F +  D            
Sbjct: 68   IRVPMPPVLVDADSPLSFRAFEDHIEVKLVLLLPVDHPIILNFDNVLDFSQEQGTSHSKA 127

Query: 366  FRPLSADSDQKKLSALEEVYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNWXXXXXXX 542
             +PLS DSDQ  LS    V+FYCR+CS +L++ PLR F E+PSVNW++VADNW       
Sbjct: 128  SKPLSMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCS 187

Query: 543  XXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNL---- 710
               + EKLV  Y  SY C  GVCLL  T + + KDDLIG    +     N+ TQ L    
Sbjct: 188  FGGISEKLVNRYTNSYRCEKGVCLLTLTTITLSKDDLIGHVFPD-----NEGTQQLKDES 242

Query: 711  --ASVNVLTQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDK 884
              A  + LT+    +      T  +KS     + +N K          L   +E    +K
Sbjct: 243  DFADGDCLTEAKEESPCNHTSTEKVKS-----KQINNKS---------LYANMEGSVAEK 288

Query: 885  ISDRLSGVSLQLQDAEKQVSLVTCSTNGEAK-FGNCDTGCCRPSAGS-SNEQKPGMDIEL 1058
             SD +        D+     +  C  + E+    + D  C   + G+  ++ KP   +++
Sbjct: 289  ASDEV--------DSPIVTPIPDCCHHEESNVLHHLDKDCMHHTCGTIKSDPKPVNAVDI 340

Query: 1059 LAHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLF 1238
               Q+  L+GF+ N FMAR SNLS D +W EF CP CS L+GAYP  +     DGGVR F
Sbjct: 341  SDDQRSFLNGFLGNIFMARLSNLSADFEWAEFFCPQCSTLIGAYPWRNGCGPTDGGVRFF 400

Query: 1239 KCQISTCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
            KC +STCL   +S N  R Y+ ERMF+ QLLESA E
Sbjct: 401  KCYVSTCLAA-ESGNLLREYTLERMFANQLLESAHE 435


>ref|XP_006466852.1| PREDICTED: uncharacterized protein LOC102618002 [Citrus sinensis]
          Length = 577

 Score =  312 bits (800), Expect = 2e-82
 Identities = 201/477 (42%), Positives = 272/477 (57%), Gaps = 41/477 (8%)
 Frame = +3

Query: 39   QNPRNWRFTWEAQSHIPTVKLYLFNPIIRPAT--QCSDLKVKLIIEQSSLLVSFTNDE-- 206
            +NPR WRFTWEAQSH PT+KL++F+      T  QC +L+V L + +  + +++   +  
Sbjct: 4    KNPRKWRFTWEAQSHSPTLKLFVFDSHTHTNTSNQCQNLEVNLNLPKHHVSITWVQQQDL 63

Query: 207  -AETSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGST--------S 359
             ++ SL VP+P +LID++SPV ++A  DHIEVKL+LLLPVDHPIV    S         S
Sbjct: 64   QSQISLLVPIPKVLIDSESPVSYRALADHIEVKLVLLLPVDHPIVASSDSVLRLTQIADS 123

Query: 360  DDFRPLSADSDQKKLSALEE-VYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNWXXXX 533
                PL  DSD KKLS  +E V+FYCR+CSTKLT+ P+R+F E+PSVNWQ+ ADNW    
Sbjct: 124  HLSTPLLMDSDIKKLSLKKEGVHFYCRNCSTKLTKTPIRNFVEMPSVNWQEAADNWFGAC 183

Query: 534  XXXXXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLA 713
                  + EKLVT YA  YTCV  +CLL  T V + KDDL+GC          +F + + 
Sbjct: 184  CCSFGGISEKLVTRYANCYTCVTSMCLLNCTTVTLFKDDLVGC----------NFPERVG 233

Query: 714  SVNVLTQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKIS- 890
                  Q    + +E      +  GSA + G+  +++  C   DD N+ L HD   K+S 
Sbjct: 234  GGKC--QAGLASTVED-----VTCGSALDPGIIYRKTAGC---DDQNE-LVHDFSGKLSF 282

Query: 891  ----DRLSGVSLQLQDAEKQVS----LVTCSTNGEAKFGNCDTGCC-------------- 1004
                D    V+ Q    EK+ S      T   +G ++      GCC              
Sbjct: 283  MGPKDENFIVTQQFDVNEKETSGDSLFCTLPVSGSSENVASAPGCCVHATQEIQDHVGEG 342

Query: 1005 -RP--SAGSSNEQKPGMDIELLAHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSC 1175
             +P  S  SS  QK   +    A Q+   +GF+ N FMAR+ NLSKD++W+EFLCP C  
Sbjct: 343  CKPTLSESSSLYQKTERNT---ADQRSFRNGFLGNIFMARSYNLSKDIEWIEFLCPCCLT 399

Query: 1176 LLGAYPCFHDNSMLDGGVRLFKCQISTCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
             LGAYP    ++ +DGGVRLFKC ISTCLP   S++ FR+Y+ ERMF+ QLLESAK+
Sbjct: 400  FLGAYPSSTSDAPIDGGVRLFKCYISTCLPISGSDDKFRKYTLERMFTNQLLESAKD 456


>ref|XP_006425615.1| hypothetical protein CICLE_v10025250mg [Citrus clementina]
            gi|557527605|gb|ESR38855.1| hypothetical protein
            CICLE_v10025250mg [Citrus clementina]
          Length = 577

 Score =  310 bits (793), Expect = 1e-81
 Identities = 200/477 (41%), Positives = 270/477 (56%), Gaps = 41/477 (8%)
 Frame = +3

Query: 39   QNPRNWRFTWEAQSHIPTVKLYLFNPIIRPAT--QCSDLKVKLIIEQSSLLVSFTNDEA- 209
            +NPR WRFTWEAQSH PT+KL++F+      T  QC +L+V L + +  + +++   +  
Sbjct: 4    KNPRKWRFTWEAQSHSPTLKLFVFDSHTHTNTSNQCQNLEVNLNLPKHHVSITWVQQQDL 63

Query: 210  --ETSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGST--------S 359
              + SL VP+P +LID++SP  ++A  DHIEVKL+LLLPVDHPIV    S         S
Sbjct: 64   QNQISLLVPIPKVLIDSESPASYRALADHIEVKLVLLLPVDHPIVASSDSVLRLTQIADS 123

Query: 360  DDFRPLSADSDQKKLSALEE-VYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNWXXXX 533
                PL  DSD KKLS  +E V+FYCR+CSTKLT+ P+R+F E+PSVNWQ+ ADNW    
Sbjct: 124  HLSTPLLMDSDIKKLSLKKEGVHFYCRNCSTKLTKTPIRNFVEMPSVNWQEAADNWFGAC 183

Query: 534  XXXXXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLA 713
                  + EKLVT YA  YTCV  +CLL  T V + KDDL+GC          +F + + 
Sbjct: 184  CCSFGGISEKLVTRYANCYTCVTSMCLLNCTTVTLFKDDLVGC----------NFPERVG 233

Query: 714  SVNVLTQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKIS- 890
                  Q    + +E      +  GSA + G+  +++  C   DD N+ L HD   K+S 
Sbjct: 234  GGKC--QAGLASTVED-----VTCGSALDPGIIYRKTAGC---DDQNE-LVHDFSGKLSF 282

Query: 891  ----DRLSGVSLQLQDAEKQVS----LVTCSTNGEAKFGNCDTGCC-------------- 1004
                D    V+ Q    EK+ S      T   +G ++      GCC              
Sbjct: 283  MGPKDENFIVTQQFDVNEKETSGDSLFCTLPVSGSSENVASAPGCCVHATQEIQDHVGEG 342

Query: 1005 -RP--SAGSSNEQKPGMDIELLAHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSC 1175
             +P  S  SS  QK   +    A Q+   +GF+ N FMAR+ NLSKD++W+EFLCP C  
Sbjct: 343  CKPTLSESSSLYQKTERNT---ADQRSFRNGFLGNIFMARSYNLSKDIEWIEFLCPCCLT 399

Query: 1176 LLGAYPCFHDNSMLDGGVRLFKCQISTCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
             LGAYP    ++ +DGGVRLFKC ISTCLP   S++ FR+Y+ ERMF+ QLLESAK+
Sbjct: 400  FLGAYPSSTSDAPIDGGVRLFKCYISTCLPISGSDDKFRKYTLERMFTNQLLESAKD 456


>gb|ESW28079.1| hypothetical protein PHAVU_003G257200g [Phaseolus vulgaris]
          Length = 571

 Score =  308 bits (788), Expect = 4e-81
 Identities = 185/455 (40%), Positives = 260/455 (57%), Gaps = 24/455 (5%)
 Frame = +3

Query: 54   WRFTWEAQSHIPTVKLYLF--NPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEAETSLRV 227
            WR+TWEAQSH+PT++L LF  +  + P+ QC DL V L    S L ++ ++     SLRV
Sbjct: 10   WRYTWEAQSHVPTLRLMLFPNDKTLNPSLQCHDLAVNLHSSHSFLTLTTSS----LSLRV 65

Query: 228  PLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFG-STSDDFRPLSADSDQKKL 404
            PLP +L+D DSPV F+   DHIEVKLLLLLPVDHPI++    STS    PL ++SD  KL
Sbjct: 66   PLPAVLVDADSPVTFRPLSDHIEVKLLLLLPVDHPILSSLHPSTSPLPDPLISESDVSKL 125

Query: 405  SALEEVYFYCRSCSTKLTR-PLRSFKELPSVNWQDVADNWXXXXXXXXXXVGEKLVTSYA 581
            S+  EV FYCR+C+ KLTR PLR+F E+PSVNW++VADNW          + EK+V  Y 
Sbjct: 126  SSAGEVDFYCRTCTFKLTRIPLRNFVEMPSVNWREVADNWFGACCCSFGGISEKMVMRYV 185

Query: 582  KSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLASVNVLTQNTPVNRIEQ 761
             SYTCVPGVCLL + +V +CKDDL+      +N       Q   SV    ++  + ++ +
Sbjct: 186  SSYTCVPGVCLLSSASVTLCKDDLV-----EYNFPEGCGKQECTSVAENPRDDAIGKVSR 240

Query: 762  GKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKL--EHDAVDKISDRLSGVSLQLQDAEK 935
                C  +        + + +  CS+D  +       +  VD   ++LS ++L+ + A+ 
Sbjct: 241  N---CELNDERTSTCSDDERTSTCSDDGGVTLAFGSNYRFVDSEDEKLS-MNLRCEVAKS 296

Query: 936  QVSLVTCSTNGEAKFGNCDTG----CC-------------RPSAGSSN-EQKPGMDIELL 1061
            +      S +     G  D      CC               S G++  E  P   +E+L
Sbjct: 297  KPDCGHFSDSHPDSNGTKDVSETPSCCARMTNNLGDEHSEHHSCGTAGREGMPTETLEIL 356

Query: 1062 AHQKILLDGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPCFHDNSMLDGGVRLFK 1241
             +QK LL+GF+ + FMAR SNL+KD+ W EF CP C  L+GAYPC   ++ +D GVRLFK
Sbjct: 357  GNQKSLLNGFLEDIFMARLSNLTKDIDWREFTCPQCRSLIGAYPCCEGHTPVDEGVRLFK 416

Query: 1242 CQISTCLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
            C ISTCLP     + F +YS  +MF+ +L+E A +
Sbjct: 417  CYISTCLPVGGPEDMFSKYSMGKMFANRLVECASD 451


>ref|XP_002876990.1| At3g26750 [Arabidopsis lyrata subsp. lyrata]
            gi|297322828|gb|EFH53249.1| At3g26750 [Arabidopsis lyrata
            subsp. lyrata]
          Length = 511

 Score =  303 bits (777), Expect = 8e-80
 Identities = 182/450 (40%), Positives = 247/450 (54%), Gaps = 14/450 (3%)
 Frame = +3

Query: 39   QNP---RNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEA 209
            QNP   R WR+TWEAQSH P ++L+LF+    P   C  L V +I+E+S LLV++ N+EA
Sbjct: 3    QNPKTQRTWRYTWEAQSHSPNLRLFLFDSTTNPKIHCKILNVSIILEKSQLLVTWINEEA 62

Query: 210  E-----TSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGSTSDD--- 365
                   SL VP+P +L+D +S V+FKA +DHIEV+L+LLLPVDHP+V++F   +D    
Sbjct: 63   TRKEEIVSLHVPIPKVLLDTESHVNFKALEDHIEVRLVLLLPVDHPLVSDFNLVTDSREK 122

Query: 366  FRPLSADSDQKKLSALEEVYFYCRSCSTKLTRP-LRSFKELPSVNWQDVADNWXXXXXXX 542
             +PL    D K LS +  V+FYCR+CS +LT+  L  F E+PS+NW++ ADNW       
Sbjct: 123  SKPLVMGYDLKTLSLMGGVHFYCRNCSNRLTKKELFDFSEMPSINWRESADNWFGTCCCS 182

Query: 543  XXXVGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLASVN 722
               + EK+V  Y  SYTC  G+CLL  T VL+ KDDL+ C   +     ++F  +LA   
Sbjct: 183  FGGISEKMVVKYTNSYTCSSGLCLLSATTVLLSKDDLVECSFSDKGGIVDEFDSSLALSC 242

Query: 723  VLTQNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKISDRLS 902
             L    P +R  +G           E   NG ES +C + DD                  
Sbjct: 243  DLGVVEPGSRSSEGN---------AESHENGSES-VCGQADD------------------ 274

Query: 903  GVSLQLQDAEKQVSLVTCSTNGEAKFGNCDTGCCRPSAGSSNEQKPGMDIELLAHQKILL 1082
                         S++ C  N          GCC      SNE    ++ +L   +K LL
Sbjct: 275  -------------SILRCIGNESL------PGCCVHDLPDSNESFQ-LEQKLTLDKKFLL 314

Query: 1083 DGFIWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYPC--FHDNSMLDGGVRLFKCQIST 1256
            DGF+ + FMA+ SN+SK+V W+EF CP C   LGAYP      +  +DGGVRLFKC IST
Sbjct: 315  DGFLEDVFMAKASNVSKNVDWIEFACPKCYSPLGAYPSGGGSKDKPIDGGVRLFKCYIST 374

Query: 1257 CLPGFDSNNCFRRYSWERMFSIQLLESAKE 1346
                 +S N FR+Y+ ERMF+ QL+E AKE
Sbjct: 375  SSVTGESLNVFRKYTLERMFTNQLVECAKE 404


>ref|NP_189310.2| uncharacterized protein [Arabidopsis thaliana]
            gi|9279664|dbj|BAB01221.1| unnamed protein product
            [Arabidopsis thaliana] gi|332643690|gb|AEE77211.1|
            uncharacterized protein AT3G26750 [Arabidopsis thaliana]
          Length = 526

 Score =  303 bits (776), Expect = 1e-79
 Identities = 176/447 (39%), Positives = 243/447 (54%), Gaps = 14/447 (3%)
 Frame = +3

Query: 48   RNWRFTWEAQSHIPTVKLYLFNPIIRPATQCSDLKVKLIIEQSSLLVSFTNDEAE----- 212
            R WR+TWEAQSH P ++L+LF+    P   C  L V  I+ +S LLV++ N+E E     
Sbjct: 20   RTWRYTWEAQSHSPNLRLFLFDSKTNPKIHCKSLNVSTIVGKSQLLVTWINEEDEEAASK 79

Query: 213  ---TSLRVPLPNILIDNDSPVHFKAHDDHIEVKLLLLLPVDHPIVTEFGSTSDDFR---P 374
                SL VP+P +L+D +SPV+FKA DDHIEV+L+LLLPVDHP+V++F   +D      P
Sbjct: 80   EEIVSLLVPIPRVLLDTESPVNFKALDDHIEVRLVLLLPVDHPLVSDFNLVTDSREKSAP 139

Query: 375  LSADSDQKKLSALEEVYFYCRSCSTKLTRP-LRSFKELPSVNWQDVADNWXXXXXXXXXX 551
            L    D K LS +  V+FYCRSCS +LT+  L  F E+PS+NW++ ADNW          
Sbjct: 140  LVMGYDLKTLSLMGGVHFYCRSCSNRLTKKELLDFSEMPSINWRESADNWFGTCCCSFGG 199

Query: 552  VGEKLVTSYAKSYTCVPGVCLLENTAVLICKDDLIGCESLNWNHRHNDFTQNLASVNVLT 731
            + EK+V  Y  SYTC  G+CLL  T VL+ KDDL+ C          +F  +LA    + 
Sbjct: 200  ISEKMVVKYTNSYTCSSGLCLLSATTVLLSKDDLVECILSEKGGTEVEFESSLALSCDVG 259

Query: 732  QNTPVNRIEQGKTVCLKSGSAGEQGVNGKESHICSEDDDLNKKLEHDAVDKISDRLSGVS 911
               P +RI +G     +SG            ++C + D+   +     +DK S       
Sbjct: 260  VVEPGSRISEGNAESHESGG----------ENVCGQVDESKTR----CIDKAS------- 298

Query: 912  LQLQDAEKQVSLVTCSTNGEAKFGNCDTGCCRPSAGSSNEQKPGMDIELLAHQKILLDGF 1091
                                        GCC   +  SNE     + +L   +K LLDGF
Sbjct: 299  --------------------------LPGCCVHDSPDSNESVQLEEKKLTLDKKFLLDGF 332

Query: 1092 IWNGFMARTSNLSKDVQWVEFLCPHCSCLLGAYP--CFHDNSMLDGGVRLFKCQISTCLP 1265
            + + FMA+ SN+SK+V+W+EF CP CS  LGAYP     +   +DGGVRLFKC IST   
Sbjct: 333  LEDVFMAKASNVSKNVEWIEFACPECSSPLGAYPSGVGSNGKPIDGGVRLFKCYISTSST 392

Query: 1266 GFDSNNCFRRYSWERMFSIQLLESAKE 1346
              +S++ FR+Y+ ERMF+ QL+E +KE
Sbjct: 393  TGESSDVFRKYTLERMFTNQLVECSKE 419


Top