BLASTX nr result

ID: Cinnamomum23_contig00011824 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum23_contig00011824
         (1411 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010268082.1| PREDICTED: uncharacterized protein LOC104605...   211   8e-52
ref|XP_010268079.1| PREDICTED: uncharacterized protein LOC104605...   211   8e-52
ref|XP_007026080.1| Homeodomain-like superfamily protein, putati...   192   5e-46
ref|XP_007026078.1| Homeodomain-like superfamily protein, putati...   192   5e-46
ref|XP_006389624.1| hypothetical protein POPTR_0021s00740g [Popu...   190   3e-45
ref|XP_006428336.1| hypothetical protein CICLE_v10010907mg [Citr...   190   3e-45
ref|XP_006480350.1| PREDICTED: uncharacterized protein LOC102624...   189   6e-45
ref|XP_011047989.1| PREDICTED: uncharacterized protein LOC105142...   187   1e-44
ref|XP_007026079.1| Homeodomain-like superfamily protein, putati...   185   8e-44
gb|KHG10856.1| 30S ribosomal S5, chloroplastic [Gossypium arboreum]   184   1e-43
ref|XP_010655394.1| PREDICTED: uncharacterized protein LOC100247...   182   7e-43
ref|XP_002268966.1| PREDICTED: uncharacterized protein LOC100247...   182   7e-43
ref|XP_002518479.1| conserved hypothetical protein [Ricinus comm...   182   7e-43
ref|XP_012454018.1| PREDICTED: uncharacterized protein LOC105776...   180   2e-42
gb|KJB69277.1| hypothetical protein B456_011G014000, partial [Go...   180   2e-42
ref|XP_012091341.1| PREDICTED: uncharacterized protein LOC105649...   174   2e-40
ref|XP_012091339.1| PREDICTED: uncharacterized protein LOC105649...   174   2e-40
ref|XP_012091340.1| PREDICTED: uncharacterized protein LOC105649...   174   2e-40
gb|KDO45255.1| hypothetical protein CISIN_1g000732mg [Citrus sin...   172   7e-40
ref|XP_010105693.1| hypothetical protein L484_011305 [Morus nota...   169   6e-39

>ref|XP_010268082.1| PREDICTED: uncharacterized protein LOC104605144 isoform X2 [Nelumbo
            nucifera]
          Length = 1481

 Score =  211 bits (538), Expect = 8e-52
 Identities = 177/479 (36%), Positives = 231/479 (48%), Gaps = 32/479 (6%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE GAE DLQMHPLLFQ+PEDG  PY  +                 LQTNL+LL K    
Sbjct: 999  EEKGAEPDLQMHPLLFQAPEDGSFPYYPLKCGTASSAFAFLPQNQ-LQTNLNLLCKPHP- 1056

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCDPEP------------LQGPCVQ 1068
               VD  + +LRSKE   + C IDFHPLL+  DNIN   +              QG   Q
Sbjct: 1057 NPQVDSINKSLRSKETSLSSC-IDFHPLLRKTDNINDSVDASSTTNFSINLTSFQGNSAQ 1115

Query: 1067 IPNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASREVMGRRHRFEHRS 888
              NPS+  L  P V   Q AT T  TS +EKANELDL+IHLSS++    +G R   EHRS
Sbjct: 1116 SQNPSDCVLIDPQVRCCQLATGTVPTSSFEKANELDLEIHLSSSSR---IGCRGLTEHRS 1172

Query: 887  NGSSIRAREDGTI------EENQQEESRTKADSST------HVMDAHELALSSIGISRLT 744
             G  I A + G +         Q  +  T A  S       H +    +   S  I+  T
Sbjct: 1173 KGQQISALDCGPMVGKVSSPSYQSSKHYTAASVSNKQCNKEHALGTRAMVQESRNINIYT 1232

Query: 743  EDNLDEQSNPGIVMEQEELSDSDEDVGD-VEFECEEMADSEGEGLGYRELDNLQNKELPS 567
            EDN  +QS P IVMEQEELSDSD+++G+ V+FECEEMADSEGE   + +  N+QNK++  
Sbjct: 1233 EDNTGDQSLPEIVMEQEELSDSDDEIGENVQFECEEMADSEGEETDHEQFLNIQNKDVLP 1292

Query: 566  VALEEERTSTDANNGSESILMRCDPKKNIL-VGDSNTYSQKLVLTAKSNNI-GSSTQSNS 393
            VA+E+    T A +  +  L  C P+       +S+T S KL  T K  +I G   QS S
Sbjct: 1293 VAVEDV-ARTAACDDQQCELRICGPQAIACDATESSTASCKLGFTKKCKDIRGRVLQSTS 1351

Query: 392  GSCTLGPSSSTLKRERRNRGHRET-----SNVVQSRPVRSSKRKPNVDRVAVHSQEVPQL 228
                LG  +S    E    G+ +T      N + SRP RSS++     +     Q     
Sbjct: 1352 D--PLGYLNSPRPSEESRNGNDQTGKSCLENGLPSRPKRSSRKMMPYSKAGTAEQH---- 1405

Query: 227  VLNPTTAECSTTIASTKKPKKRGCRSNPEGVEMGNYKHVSSENMVDHPDDCGMRDGQKI 51
                     +T  A T K +KR  R+        +   V   + +D+  DC   D  +I
Sbjct: 1406 ------GTGTTGGAPTSKARKRKVRN-------ASITGVPGCSNIDNLVDCHSCDSPRI 1451


>ref|XP_010268079.1| PREDICTED: uncharacterized protein LOC104605144 isoform X1 [Nelumbo
            nucifera] gi|720038747|ref|XP_010268080.1| PREDICTED:
            uncharacterized protein LOC104605144 isoform X1 [Nelumbo
            nucifera] gi|720038750|ref|XP_010268081.1| PREDICTED:
            uncharacterized protein LOC104605144 isoform X1 [Nelumbo
            nucifera]
          Length = 1512

 Score =  211 bits (538), Expect = 8e-52
 Identities = 177/479 (36%), Positives = 231/479 (48%), Gaps = 32/479 (6%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE GAE DLQMHPLLFQ+PEDG  PY  +                 LQTNL+LL K    
Sbjct: 1030 EEKGAEPDLQMHPLLFQAPEDGSFPYYPLKCGTASSAFAFLPQNQ-LQTNLNLLCKPHP- 1087

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCDPEP------------LQGPCVQ 1068
               VD  + +LRSKE   + C IDFHPLL+  DNIN   +              QG   Q
Sbjct: 1088 NPQVDSINKSLRSKETSLSSC-IDFHPLLRKTDNINDSVDASSTTNFSINLTSFQGNSAQ 1146

Query: 1067 IPNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASREVMGRRHRFEHRS 888
              NPS+  L  P V   Q AT T  TS +EKANELDL+IHLSS++    +G R   EHRS
Sbjct: 1147 SQNPSDCVLIDPQVRCCQLATGTVPTSSFEKANELDLEIHLSSSSR---IGCRGLTEHRS 1203

Query: 887  NGSSIRAREDGTI------EENQQEESRTKADSST------HVMDAHELALSSIGISRLT 744
             G  I A + G +         Q  +  T A  S       H +    +   S  I+  T
Sbjct: 1204 KGQQISALDCGPMVGKVSSPSYQSSKHYTAASVSNKQCNKEHALGTRAMVQESRNINIYT 1263

Query: 743  EDNLDEQSNPGIVMEQEELSDSDEDVGD-VEFECEEMADSEGEGLGYRELDNLQNKELPS 567
            EDN  +QS P IVMEQEELSDSD+++G+ V+FECEEMADSEGE   + +  N+QNK++  
Sbjct: 1264 EDNTGDQSLPEIVMEQEELSDSDDEIGENVQFECEEMADSEGEETDHEQFLNIQNKDVLP 1323

Query: 566  VALEEERTSTDANNGSESILMRCDPKKNIL-VGDSNTYSQKLVLTAKSNNI-GSSTQSNS 393
            VA+E+    T A +  +  L  C P+       +S+T S KL  T K  +I G   QS S
Sbjct: 1324 VAVEDV-ARTAACDDQQCELRICGPQAIACDATESSTASCKLGFTKKCKDIRGRVLQSTS 1382

Query: 392  GSCTLGPSSSTLKRERRNRGHRET-----SNVVQSRPVRSSKRKPNVDRVAVHSQEVPQL 228
                LG  +S    E    G+ +T      N + SRP RSS++     +     Q     
Sbjct: 1383 D--PLGYLNSPRPSEESRNGNDQTGKSCLENGLPSRPKRSSRKMMPYSKAGTAEQH---- 1436

Query: 227  VLNPTTAECSTTIASTKKPKKRGCRSNPEGVEMGNYKHVSSENMVDHPDDCGMRDGQKI 51
                     +T  A T K +KR  R+        +   V   + +D+  DC   D  +I
Sbjct: 1437 ------GTGTTGGAPTSKARKRKVRN-------ASITGVPGCSNIDNLVDCHSCDSPRI 1482


>ref|XP_007026080.1| Homeodomain-like superfamily protein, putative isoform 3 [Theobroma
            cacao] gi|508781446|gb|EOY28702.1| Homeodomain-like
            superfamily protein, putative isoform 3 [Theobroma cacao]
          Length = 1402

 Score =  192 bits (488), Expect = 5e-46
 Identities = 155/455 (34%), Positives = 219/455 (48%), Gaps = 24/455 (5%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE    +DLQMHPLLFQ+PEDG +PY  +N           F G   Q NL L    QQ 
Sbjct: 963  EERSTHTDLQMHPLLFQAPEDGQVPYYPLNCGTGASSSFSFFSGNQPQLNLSLFYNPQQT 1022

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCD----------PEPLQGPCVQIP 1062
              +V+    +L+ K++ S  C IDFHPLLQ  D+ N +             L G  V   
Sbjct: 1023 NHSVESLTRSLKMKDSVSISCGIDFHPLLQRTDDTNSELVTECSTASLSVNLDGKSVAPC 1082

Query: 1061 NPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE--VMGRRHRFEHRS 888
            NPSN+     +     +AT +  +SP EKANELDL+IHLSS +++E   +       H++
Sbjct: 1083 NPSNAVQMKSVAQCSPFATRSRPSSPNEKANELDLEIHLSSLSTKENAALSGDAATHHKN 1142

Query: 887  NGSSIRAREDGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGI 708
            +  S+   ++       +    T +  +  V  A    + S    R  +D  D QS+  I
Sbjct: 1143 SAVSLLNSQNAA-----ETRDTTHSSGNKFVSGARASTIPSKTTGRYMDDTSD-QSHLEI 1196

Query: 707  VMEQEELSDSDEDVGD-VEFECEEMADSEGEGLGYRELDNLQNKELPSVALEEERTSTDA 531
            VMEQEELSDSDE+  + VEFECEEMADSEGEG G  ++  +Q+KE       +  T  D 
Sbjct: 1197 VMEQEELSDSDEEFEEHVEFECEEMADSEGEGSGCEQVSEMQDKEAEGSTTRKTVTDEDF 1256

Query: 530  NNGSESILMRCDPKKNILVGDSNTYS-QKLVLTAKSNNIGSSTQSNSGSCTLGPSSSTLK 354
            NN  + +  RC+ + NI V +  T    KL LT    +  SS  S   S +   S S  K
Sbjct: 1257 NNQQQELSTRCNSQGNICVPEKGTPPFLKLGLTCPRKDASSSWLSLDSSASGRTSRSKPK 1316

Query: 353  RERRNRGHRETSNVVQS----RPVR---SSKRKPNVDRVAVHSQEVPQLVLNPTTAECST 195
             E         +  + S    RP++    S RK  V   A+   E  QL L P       
Sbjct: 1317 NEVSTISKGPPTKTLASYRLNRPLKHATPSTRKVTVQEHAIDMAE--QLSLGP------L 1368

Query: 194  TIASTKKPKKRGCRSNP---EGVEMGNYKHVSSEN 99
            ++ + +KP+KR  R+N     G  +GN K+ + ++
Sbjct: 1369 SVPTLRKPRKR--RANTIANTGSSLGNPKNDAKDS 1401


>ref|XP_007026078.1| Homeodomain-like superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|508781444|gb|EOY28700.1| Homeodomain-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 1463

 Score =  192 bits (488), Expect = 5e-46
 Identities = 155/455 (34%), Positives = 219/455 (48%), Gaps = 24/455 (5%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE    +DLQMHPLLFQ+PEDG +PY  +N           F G   Q NL L    QQ 
Sbjct: 1024 EERSTHTDLQMHPLLFQAPEDGQVPYYPLNCGTGASSSFSFFSGNQPQLNLSLFYNPQQT 1083

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCD----------PEPLQGPCVQIP 1062
              +V+    +L+ K++ S  C IDFHPLLQ  D+ N +             L G  V   
Sbjct: 1084 NHSVESLTRSLKMKDSVSISCGIDFHPLLQRTDDTNSELVTECSTASLSVNLDGKSVAPC 1143

Query: 1061 NPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE--VMGRRHRFEHRS 888
            NPSN+     +     +AT +  +SP EKANELDL+IHLSS +++E   +       H++
Sbjct: 1144 NPSNAVQMKSVAQCSPFATRSRPSSPNEKANELDLEIHLSSLSTKENAALSGDAATHHKN 1203

Query: 887  NGSSIRAREDGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGI 708
            +  S+   ++       +    T +  +  V  A    + S    R  +D  D QS+  I
Sbjct: 1204 SAVSLLNSQNAA-----ETRDTTHSSGNKFVSGARASTIPSKTTGRYMDDTSD-QSHLEI 1257

Query: 707  VMEQEELSDSDEDVGD-VEFECEEMADSEGEGLGYRELDNLQNKELPSVALEEERTSTDA 531
            VMEQEELSDSDE+  + VEFECEEMADSEGEG G  ++  +Q+KE       +  T  D 
Sbjct: 1258 VMEQEELSDSDEEFEEHVEFECEEMADSEGEGSGCEQVSEMQDKEAEGSTTRKTVTDEDF 1317

Query: 530  NNGSESILMRCDPKKNILVGDSNTYS-QKLVLTAKSNNIGSSTQSNSGSCTLGPSSSTLK 354
            NN  + +  RC+ + NI V +  T    KL LT    +  SS  S   S +   S S  K
Sbjct: 1318 NNQQQELSTRCNSQGNICVPEKGTPPFLKLGLTCPRKDASSSWLSLDSSASGRTSRSKPK 1377

Query: 353  RERRNRGHRETSNVVQS----RPVR---SSKRKPNVDRVAVHSQEVPQLVLNPTTAECST 195
             E         +  + S    RP++    S RK  V   A+   E  QL L P       
Sbjct: 1378 NEVSTISKGPPTKTLASYRLNRPLKHATPSTRKVTVQEHAIDMAE--QLSLGP------L 1429

Query: 194  TIASTKKPKKRGCRSNP---EGVEMGNYKHVSSEN 99
            ++ + +KP+KR  R+N     G  +GN K+ + ++
Sbjct: 1430 SVPTLRKPRKR--RANTIANTGSSLGNPKNDAKDS 1462


>ref|XP_006389624.1| hypothetical protein POPTR_0021s00740g [Populus trichocarpa]
            gi|550312453|gb|ERP48538.1| hypothetical protein
            POPTR_0021s00740g [Populus trichocarpa]
          Length = 1441

 Score =  190 bits (482), Expect = 3e-45
 Identities = 154/453 (33%), Positives = 220/453 (48%), Gaps = 36/453 (7%)
 Frame = -3

Query: 1397 STEESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQ 1218
            + EE G +SDLQMHPLLFQ+PE GCLPY  ++           F G   Q NL L     
Sbjct: 972  TAEERGTDSDLQMHPLLFQAPEGGCLPYLPLSCSSGTSSSFSFFSGNQPQLNLSLFHNPL 1031

Query: 1217 QAGGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNIN-------CDPEP---LQGPCVQ 1068
            QA   VD F+ + +SK++ S  C+IDFHPLLQ  D  N        +P     L G   Q
Sbjct: 1032 QANHVVDGFNKSSKSKDSTSASCSIDFHPLLQRTDEENNNLVMACSNPNQFVCLSGESAQ 1091

Query: 1067 IPNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASREVMGRRH----RF 900
              N   +  +   V +   A +   +S  EKAN+LDLDIHLSS +++EV  R        
Sbjct: 1092 FQNHFGAVQNKSFVNNIPIAVDPKHSSSNEKANDLDLDIHLSSNSAKEVSERSRDVGANN 1151

Query: 899  EHRSNGS---SIRAREDGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLD 729
            + RS  S   S R  E   I   + + +      S  V  A    + S  +S    D + 
Sbjct: 1152 QPRSTTSEPKSGRRMETCKINSPRDQHNEHPTVHSNLVSGADASPVQSNNVSTCNMDVVG 1211

Query: 728  EQSNPGIVMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNKELPSVALE 555
            +QS+P IVMEQEELSDSDE++ + V+FECEEMADS+G EG G   +  +Q+K+  S A+E
Sbjct: 1212 DQSHPEIVMEQEELSDSDEEIEENVDFECEEMADSDGEEGAGCEPVAEVQDKDAQSFAME 1271

Query: 554  EERTSTD------------ANNGSESILMRCDPKKNI---LVGDSNTYSQKLVLTAKS-- 426
            E   + D             + G  SIL +  P  N+    +G   T S  L L +++  
Sbjct: 1272 EVTNAEDYGDQQWKLRSPVHSRGKPSILRKGSPLLNLSLTSLGKETTSSSWLSLDSRAAV 1331

Query: 425  NNIGSSTQSNSGSCTLGPSSSTLKRERRNRGHRETSNVVQSRPVRSSKRKPNVDRVAVHS 246
            ++    T    G+    P++  L   R NR  ++T+      P+   + + NV  +A   
Sbjct: 1332 DSPRMKTLHEKGAINDSPAAKNLSPCRPNRLCKKTT------PITKVETQKNVSDMA--- 1382

Query: 245  QEVPQLVLNPTTAECSTTIASTKKPKKRGCRSN 147
                QL L P        +++ +KP+KR CR+N
Sbjct: 1383 ---QQLSLGP------LAVSTLRKPRKRMCRTN 1406


>ref|XP_006428336.1| hypothetical protein CICLE_v10010907mg [Citrus clementina]
            gi|557530393|gb|ESR41576.1| hypothetical protein
            CICLE_v10010907mg [Citrus clementina]
          Length = 1424

 Score =  190 bits (482), Expect = 3e-45
 Identities = 146/436 (33%), Positives = 217/436 (49%), Gaps = 21/436 (4%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE G E DLQMHPLLFQ+PEDG LPY  +N           F G   Q NL L    +Q 
Sbjct: 992  EERGTEPDLQMHPLLFQAPEDGHLPYYPLNCSASTSSSFSFFSGNQPQLNLSLFHNPRQL 1051

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSAD--NINCDPEP--------LQGPCVQIP 1062
              A+  F+ +L++KE+ S  C IDFHPLL+  +  N N    P         +    Q  
Sbjct: 1052 SHALSCFNKSLKTKESTSGSCVIDFHPLLKRTEVANNNLVTTPSNARISVGSERKSDQHK 1111

Query: 1061 NPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE-VMGRRHRFEHRSN 885
            NP ++  S   V +  +A N+  +S  EK+NELDL+IHLSS++++E  +G R    H   
Sbjct: 1112 NPFDALQSKTSVSNGPFAANSVPSSINEKSNELDLEIHLSSSSAKERALGNREMAPHNLM 1171

Query: 884  GSSIRARE-DGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGI 708
             S   A   D T+ +N         ++ + V      ++ + G      D++ + S+P I
Sbjct: 1172 QSMTVANSGDKTVTQNNDNLHYQYGENYSQVASNGHFSVQTTG----NIDDIGDHSHPEI 1227

Query: 707  VMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNKELPSVALEEERTSTD 534
            VMEQEELSDSDE++ + VEFECEEM DSEG EG G  ++  +Q KE+PS+  E+   +TD
Sbjct: 1228 VMEQEELSDSDEEIEEHVEFECEEMTDSEGEEGSGCEQITEMQEKEVPSLMTEK---ATD 1284

Query: 533  ANNGSESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIGSSTQSNS-----GSCTLGPS 369
             ++  +   +R      +    ++       L     N+G  T S+S      S    P 
Sbjct: 1285 GDSDDQQHELR--SSHGLCSAPASRKGSSPFLKLGLTNLGKDTASSSWLSLNSSAPGNPI 1342

Query: 368  SSTLKRERRNRGHRETSNVVQSRPVRSSKR-KPNVDRVAVHSQEVPQLVLNPTTAECS-T 195
             +  K    +      + ++ SRP+RS K+  P+  +VA       Q+     T + S +
Sbjct: 1343 CTKSKNSEDSISGGPAAKIMASRPIRSCKKVSPSSKKVAT------QMHATDMTEQLSLS 1396

Query: 194  TIASTKKPKKRGCRSN 147
            ++A     KKRGCR+N
Sbjct: 1397 SLAVQTVRKKRGCRTN 1412


>ref|XP_006480350.1| PREDICTED: uncharacterized protein LOC102624036 isoform X1 [Citrus
            sinensis] gi|568853408|ref|XP_006480351.1| PREDICTED:
            uncharacterized protein LOC102624036 isoform X2 [Citrus
            sinensis]
          Length = 1424

 Score =  189 bits (479), Expect = 6e-45
 Identities = 145/436 (33%), Positives = 217/436 (49%), Gaps = 21/436 (4%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE G + DLQMHPLLFQ+PEDG LPY  +N           F G   Q NL L    +Q 
Sbjct: 992  EERGTQPDLQMHPLLFQAPEDGHLPYYPLNCSASTSSSFSFFSGNQPQLNLSLFHNPRQL 1051

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSAD--NINCDPEP--------LQGPCVQIP 1062
              A+  F+ +L++KE+ S  C IDFHPLL+  +  N N    P         +    Q  
Sbjct: 1052 SHALSCFNKSLKTKESTSGSCVIDFHPLLKRTEVANNNLVTTPSNARISVGSERKSDQHK 1111

Query: 1061 NPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE-VMGRRHRFEHRSN 885
            NP ++  S   V +  +A N+  +S  EK+NELDL+IHLSS++++E  +G R    H   
Sbjct: 1112 NPFDALQSKTSVSNGPFAANSVPSSINEKSNELDLEIHLSSSSAKERALGNREMAPHNLM 1171

Query: 884  GSSIRARE-DGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGI 708
             S   A   D T+ +N         ++ + V      ++ + G      D++ + S+P I
Sbjct: 1172 QSMTVANSGDKTVTQNNDNLHYQYGENYSQVASNGHFSVQTTG----NIDDIGDHSHPEI 1227

Query: 707  VMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNKELPSVALEEERTSTD 534
            VMEQEELSDSDE++ + VEFECEEM DSEG EG G  ++  +Q KE+PS+  E+   +TD
Sbjct: 1228 VMEQEELSDSDEEIEEHVEFECEEMTDSEGEEGSGCEQITEMQEKEVPSLMTEK---ATD 1284

Query: 533  ANNGSESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIGSSTQSNS-----GSCTLGPS 369
             ++  +   +R      +    ++       L     N+G  T S+S      S    P 
Sbjct: 1285 GDSDDQQHELR--SSHGLCSAPASRKGSSPFLKLGLTNLGKDTASSSWLSLNSSAPGNPI 1342

Query: 368  SSTLKRERRNRGHRETSNVVQSRPVRSSKR-KPNVDRVAVHSQEVPQLVLNPTTAECS-T 195
             +  K    +      + ++ SRP+RS K+  P+  +VA       Q+     T + S +
Sbjct: 1343 CTKSKNSEDSISGGPAAKIMASRPIRSCKKVSPSSKKVAT------QMHATDMTEQLSLS 1396

Query: 194  TIASTKKPKKRGCRSN 147
            ++A     KKRGCR+N
Sbjct: 1397 SLAVQTVRKKRGCRTN 1412


>ref|XP_011047989.1| PREDICTED: uncharacterized protein LOC105142175 [Populus euphratica]
          Length = 1427

 Score =  187 bits (476), Expect = 1e-44
 Identities = 151/453 (33%), Positives = 219/453 (48%), Gaps = 36/453 (7%)
 Frame = -3

Query: 1397 STEESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQ 1218
            + EE G +SDLQMHPLLFQ+PE GCLPY  ++           F G   Q NL L     
Sbjct: 957  AAEERGTDSDLQMHPLLFQAPEGGCLPYYPLSCSSGTSSSFSFFSGNQPQLNLSLFHNPL 1016

Query: 1217 QAGGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNIN-------CDPEP---LQGPCVQ 1068
            QA   VD F+ + +SK++ S  C+IDFHPLLQ  D  N        +P     L G   Q
Sbjct: 1017 QANHVVDGFNKSSKSKDSTSASCSIDFHPLLQRTDEENNNLVMACSNPNQFVCLSGESAQ 1076

Query: 1067 IPNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASREVMGRRH----RF 900
              N   +  +   V     A +  L+S  EKAN+LDLDIHLSS +++EV  R        
Sbjct: 1077 FQNHFGAVQNKSFVNHIPIAVDPKLSSSNEKANDLDLDIHLSSNSAKEVSERSRDVGANT 1136

Query: 899  EHRSNGS---SIRAREDGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLD 729
            + RS  S   S R  E   I   + + +      S  V       + S  +S    D++ 
Sbjct: 1137 QPRSTTSEPKSGRRMETCKINSPRDKHNEHPTVHSNLVSGVDASPVQSNNVSTCNMDDVG 1196

Query: 728  EQSNPGIVMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNKELPSVALE 555
            +QS+P IVMEQEELSDSDE++ + V+FECEEMADS+G EG     +  +Q+K+  + ++E
Sbjct: 1197 DQSHPEIVMEQEELSDSDEEIEENVDFECEEMADSDGEEGAACEPVAEVQDKDAQNFSME 1256

Query: 554  EERTSTD------------ANNGSESILMRCDPKKNI---LVGDSNTYSQKLVLTAKS-- 426
            E   + D             + G  SIL +  P  N+    +G   T S  L L +++  
Sbjct: 1257 EVTNAEDNGDQQWKLRSPVHSRGKPSILRKGSPLLNLSLTSLGKETTSSSWLSLDSRAAV 1316

Query: 425  NNIGSSTQSNSGSCTLGPSSSTLKRERRNRGHRETSNVVQSRPVRSSKRKPNVDRVAVHS 246
            ++    T    G+    P++  L   R NR  ++T+      P+   + + NV  +A   
Sbjct: 1317 DSPRMKTLHEKGAINDSPAAKNLSPCRPNRLCKKTTT-----PITKVETQKNVSDMA--- 1368

Query: 245  QEVPQLVLNPTTAECSTTIASTKKPKKRGCRSN 147
                QL L P        +++ +KP+KR CR+N
Sbjct: 1369 ---QQLSLGP------LAVSTLRKPRKRMCRTN 1392


>ref|XP_007026079.1| Homeodomain-like superfamily protein, putative isoform 2 [Theobroma
            cacao] gi|508781445|gb|EOY28701.1| Homeodomain-like
            superfamily protein, putative isoform 2 [Theobroma cacao]
          Length = 1374

 Score =  185 bits (469), Expect = 8e-44
 Identities = 150/445 (33%), Positives = 213/445 (47%), Gaps = 14/445 (3%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE    +DLQMHPLLFQ+PEDG +PY  +N           F G   Q NL L    QQ 
Sbjct: 963  EERSTHTDLQMHPLLFQAPEDGQVPYYPLNCGTGASSSFSFFSGNQPQLNLSLFYNPQQT 1022

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCDPEPLQGPCVQIPNPSNSSLSAP 1032
              +V+    +L+ K++ S  C IDFHPLLQ  D+                  +NS L   
Sbjct: 1023 NHSVESLTRSLKMKDSVSISCGIDFHPLLQRTDD------------------TNSELMKS 1064

Query: 1031 MVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE--VMGRRHRFEHRSNGSSIRARED 858
            +     +AT +  +SP EKANELDL+IHLSS +++E   +       H+++  S+   ++
Sbjct: 1065 VAQCSPFATRSRPSSPNEKANELDLEIHLSSLSTKENAALSGDAATHHKNSAVSLLNSQN 1124

Query: 857  GTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGIVMEQEELSDS 678
                   +    T +  +  V  A    + S    R  +D  D QS+  IVMEQEELSDS
Sbjct: 1125 AA-----ETRDTTHSSGNKFVSGARASTIPSKTTGRYMDDTSD-QSHLEIVMEQEELSDS 1178

Query: 677  DEDVGD-VEFECEEMADSEGEGLGYRELDNLQNKELPSVALEEERTSTDANNGSESILMR 501
            DE+  + VEFECEEMADSEGEG G  ++  +Q+KE       +  T  D NN  + +  R
Sbjct: 1179 DEEFEEHVEFECEEMADSEGEGSGCEQVSEMQDKEAEGSTTRKTVTDEDFNNQQQELSTR 1238

Query: 500  CDPKKNILVGDSNTYS-QKLVLTAKSNNIGSSTQSNSGSCTLGPSSSTLKRERRNRGHRE 324
            C+ + NI V +  T    KL LT    +  SS  S   S +   S S  K E        
Sbjct: 1239 CNSQGNICVPEKGTPPFLKLGLTCPRKDASSSWLSLDSSASGRTSRSKPKNEVSTISKGP 1298

Query: 323  TSNVVQS----RPVR---SSKRKPNVDRVAVHSQEVPQLVLNPTTAECSTTIASTKKPKK 165
             +  + S    RP++    S RK  V   A+   E  QL L P       ++ + +KP+K
Sbjct: 1299 PTKTLASYRLNRPLKHATPSTRKVTVQEHAIDMAE--QLSLGP------LSVPTLRKPRK 1350

Query: 164  RGCRSNP---EGVEMGNYKHVSSEN 99
            R  R+N     G  +GN K+ + ++
Sbjct: 1351 R--RANTIANTGSSLGNPKNDAKDS 1373


>gb|KHG10856.1| 30S ribosomal S5, chloroplastic [Gossypium arboreum]
          Length = 756

 Score =  184 bits (467), Expect = 1e-43
 Identities = 158/457 (34%), Positives = 211/457 (46%), Gaps = 30/457 (6%)
 Frame = -3

Query: 1397 STEESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQ 1218
            S  +    +DLQMHPLLFQ+PEDG +PY  +N           F G   Q NL L    Q
Sbjct: 322  SVAKESTRTDLQMHPLLFQAPEDGQVPYYPLNCGAGASSSFSLFSGNQPQLNLSLFYNPQ 381

Query: 1217 QAGGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCD---PEPLQGPCVQI------ 1065
            QA           + KE+ S    IDFHPLLQ  D  N +      +  P V +      
Sbjct: 382  QAK----------KMKESVSGSYGIDFHPLLQRTDETNSELITSGSIASPSVGLDGKSAA 431

Query: 1064 PNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASREVMGRRHRFEHRSN 885
            PNPSN+    P+V    +A  +  +SP EKANELDL+IHLSS++++E             
Sbjct: 432  PNPSNAVQMRPVVHYSPFAARSRPSSPNEKANELDLEIHLSSSSAKENAALCRGVTAHPT 491

Query: 884  GSSIRAREDGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGIV 705
             SS+R +      E Q       +  +  V       +SS  I R  +D  D QS+P IV
Sbjct: 492  NSSVRLQNSHNATETQ---DTFHSSGNKFVSGGCASTISSKFIGRYIDDGSD-QSHPEIV 547

Query: 704  MEQEELSDSDEDVGD-VEFECEEMADSEGEG-LGYRELDNLQNKELPSVALEEERTSTDA 531
            MEQEELSDSDEDV + VEFECEEMADSEGEG  G  ++  +Q+K+       E     D 
Sbjct: 548  MEQEELSDSDEDVEEHVEFECEEMADSEGEGDSGCEQVSEMQDKDAQGSVTREIVMDEDC 607

Query: 530  N--------NGSESILMRCDP--------KKNILVGDSNTYSQKLVLTAKSNNIGSSTQS 399
            N        +G++S    CDP        K        +  S  L L A ++   S  + 
Sbjct: 608  NDQQWELSIHGNKSQNNVCDPESRSPSFLKAGSTCPKKDKSSSWLSLDASASGRTSRAKP 667

Query: 398  NSGSCTLGPSSSTLKRERRNRGHRETSNVVQSRPVRSSKRKPNVDRVAVHSQEVPQLVLN 219
             + + T+   + T    + +  HR T    Q+ P   S RK  +   AV   E  QL L 
Sbjct: 668  KNEASTMSKCTPT----KTSASHRTTRPSKQATP---STRKVTLQEHAVDMAE--QLSLG 718

Query: 218  PTTAECSTTIASTKKPKKRGCRSNP---EGVEMGNYK 117
            P +A  S      +KP+KR CR+N     G  +GN K
Sbjct: 719  PLSAPTS------RKPRKRTCRANKITNVGTSLGNSK 749


>ref|XP_010655394.1| PREDICTED: uncharacterized protein LOC100247051 isoform X2 [Vitis
            vinifera]
          Length = 1487

 Score =  182 bits (461), Expect = 7e-43
 Identities = 151/448 (33%), Positives = 223/448 (49%), Gaps = 32/448 (7%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE G ESDL MHPLLFQ+ EDG LPY   N           F G   Q NL L     QA
Sbjct: 1023 EERGIESDLHMHPLLFQASEDGRLPYYPFNCSHGPSNSFSFFSGNQSQVNLSLFHNPHQA 1082

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNI-------------NCDPEPLQGPCV 1071
               V+ F+ +L+SKE+  + C IDFHPLLQ +D+I             + D E  +G   
Sbjct: 1083 NPKVNSFYKSLKSKESTPS-CGIDFHPLLQRSDDIDNDLVTSRPTGQLSFDLESFRGKRA 1141

Query: 1070 QIPNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE-VMGRRHRFEH 894
            Q+ N  ++ L+ P V      + T  +      NELDL+IHLSS +  E V+G  +  E+
Sbjct: 1142 QLQNSFDAVLTEPRVNSAPPRSGTKPSCLDGIENELDLEIHLSSTSKTEKVVGSTNVTEN 1201

Query: 893  R--------SNGSSIRAREDGTIEENQQEESRTKADSSTHV-----MDAHELALSSIGIS 753
                     ++G+++ A ++ + + +QQ + R    S   V       A  L L S  I 
Sbjct: 1202 NQRKSASTLNSGTAVEA-QNSSSQYHQQSDHRPSVSSPLEVRGKLISGACALVLPSNDIL 1260

Query: 752  RLTEDNLDEQSNPGIVMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNK 579
                DN+ +QS P IVMEQEELSDSDE++G+ VEFECEEMADSEG E     ++ +LQ+K
Sbjct: 1261 ----DNIGDQSLPEIVMEQEELSDSDEEIGEHVEFECEEMADSEGEESSDSEQIVDLQDK 1316

Query: 578  ELPSVALEEERTSTDANNGSESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIG-SSTQ 402
             +P V +E+     D +N         +P+ N  +   +T   +L  T +  +   SS+ 
Sbjct: 1317 VVPIVEMEKLVPDVDFDNEQCEPRRIDNPQSNDCITKDSTSPVRLGSTGQERDTRCSSSW 1376

Query: 401  SNSGSCTLG--PSSSTLKRERRNRGHRETSNVVQSRPVRSSKRKPNVDRVAVHSQEVPQL 228
             +  SC  G  P +     +  N    +  N    RP RSS++   + +  V +Q+ P +
Sbjct: 1377 LSLNSCPPGCPPQAKAHCIQSSNEEGPDMKNQEPPRPNRSSRKTTPIPKY-VAAQKQP-M 1434

Query: 227  VLNPTTAECSTTIASTKKPKKRGCRSNP 144
             + P   + S  +   +KP+KR  R++P
Sbjct: 1435 NMPPQLGQDSLAVIPVRKPRKRSGRTHP 1462


>ref|XP_002268966.1| PREDICTED: uncharacterized protein LOC100247051 isoform X1 [Vitis
            vinifera] gi|731404334|ref|XP_010655393.1| PREDICTED:
            uncharacterized protein LOC100247051 isoform X1 [Vitis
            vinifera]
          Length = 1514

 Score =  182 bits (461), Expect = 7e-43
 Identities = 151/448 (33%), Positives = 223/448 (49%), Gaps = 32/448 (7%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE G ESDL MHPLLFQ+ EDG LPY   N           F G   Q NL L     QA
Sbjct: 1050 EERGIESDLHMHPLLFQASEDGRLPYYPFNCSHGPSNSFSFFSGNQSQVNLSLFHNPHQA 1109

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNI-------------NCDPEPLQGPCV 1071
               V+ F+ +L+SKE+  + C IDFHPLLQ +D+I             + D E  +G   
Sbjct: 1110 NPKVNSFYKSLKSKESTPS-CGIDFHPLLQRSDDIDNDLVTSRPTGQLSFDLESFRGKRA 1168

Query: 1070 QIPNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE-VMGRRHRFEH 894
            Q+ N  ++ L+ P V      + T  +      NELDL+IHLSS +  E V+G  +  E+
Sbjct: 1169 QLQNSFDAVLTEPRVNSAPPRSGTKPSCLDGIENELDLEIHLSSTSKTEKVVGSTNVTEN 1228

Query: 893  R--------SNGSSIRAREDGTIEENQQEESRTKADSSTHV-----MDAHELALSSIGIS 753
                     ++G+++ A ++ + + +QQ + R    S   V       A  L L S  I 
Sbjct: 1229 NQRKSASTLNSGTAVEA-QNSSSQYHQQSDHRPSVSSPLEVRGKLISGACALVLPSNDIL 1287

Query: 752  RLTEDNLDEQSNPGIVMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNK 579
                DN+ +QS P IVMEQEELSDSDE++G+ VEFECEEMADSEG E     ++ +LQ+K
Sbjct: 1288 ----DNIGDQSLPEIVMEQEELSDSDEEIGEHVEFECEEMADSEGEESSDSEQIVDLQDK 1343

Query: 578  ELPSVALEEERTSTDANNGSESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIG-SSTQ 402
             +P V +E+     D +N         +P+ N  +   +T   +L  T +  +   SS+ 
Sbjct: 1344 VVPIVEMEKLVPDVDFDNEQCEPRRIDNPQSNDCITKDSTSPVRLGSTGQERDTRCSSSW 1403

Query: 401  SNSGSCTLG--PSSSTLKRERRNRGHRETSNVVQSRPVRSSKRKPNVDRVAVHSQEVPQL 228
             +  SC  G  P +     +  N    +  N    RP RSS++   + +  V +Q+ P +
Sbjct: 1404 LSLNSCPPGCPPQAKAHCIQSSNEEGPDMKNQEPPRPNRSSRKTTPIPKY-VAAQKQP-M 1461

Query: 227  VLNPTTAECSTTIASTKKPKKRGCRSNP 144
             + P   + S  +   +KP+KR  R++P
Sbjct: 1462 NMPPQLGQDSLAVIPVRKPRKRSGRTHP 1489


>ref|XP_002518479.1| conserved hypothetical protein [Ricinus communis]
            gi|223542324|gb|EEF43866.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1399

 Score =  182 bits (461), Expect = 7e-43
 Identities = 154/452 (34%), Positives = 207/452 (45%), Gaps = 21/452 (4%)
 Frame = -3

Query: 1397 STEESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQ 1218
            + EE G ESDLQMHPLLFQSPEDG L Y  ++           F     Q NL L   S+
Sbjct: 965  AAEERGTESDLQMHPLLFQSPEDGRLSYYPLSCSTGASSSFTFFSANQPQLNLSLFHSSR 1024

Query: 1217 QAGGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCDPEP----------LQGPCVQ 1068
             A   VD F+ + ++ E+ S  C IDFHPLLQ A+  N D             L G   Q
Sbjct: 1025 PANHTVDCFNKSSKTGESTSASCGIDFHPLLQRAEEENIDFATSCSIAHQYVCLGGKSAQ 1084

Query: 1067 IPNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASREVMGRRHRFEHRS 888
              NP  +  +   V      T +   S  EKANELDL+IHLSS ++ E            
Sbjct: 1085 PQNPLGAVQTKSPVNSGPSTTGSKPPSSIEKANELDLEIHLSSMSAVE------------ 1132

Query: 887  NGSSIRAREDGTIEENQQEESRTKADSSTHVMD----AHELALSSIGISRLTEDNLDEQS 720
                 + R    +  + Q E  T A +S + +D    A  +A+ S   +R   ++  +Q+
Sbjct: 1133 -----KTRGSRDVGASNQLEPSTSAPNSGNTIDKDKSADAIAVQSNNDARCDMEDKGDQA 1187

Query: 719  NPGIVMEQEELSDSDEDVGD-VEFECEEMADSEGEG-LGYRELDNLQNKELPSVALEEER 546
             P IVMEQEELSDSDE+  + VEFECEEMADS+GE  LG   +  +Q+KE PS+A+EE  
Sbjct: 1188 PPEIVMEQEELSDSDEETEEHVEFECEEMADSDGEEVLGCEPIAEVQDKEFPSIAMEEVT 1247

Query: 545  TSTDANNGSESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIGSSTQSNSGSC-TLGPS 369
            T  D  N          P  N       +   KL L +   +  +S+     SC ++ P 
Sbjct: 1248 TDADYGNKQCEWSSPVHPTGNTSTPRKGSTFLKLNLKSLGRDATNSSWLTLDSCASVDPP 1307

Query: 368  SSTLKRERRNRG-HRETSNVVQSRPVRSSKRKPNVDRVAVHSQEV---PQLVLNPTTAEC 201
            S   K E    G      N+   R  RS K+  +    A     V    QL L       
Sbjct: 1308 SRKAKHEECILGVCPVVKNLASGRSNRSCKKLTSTKSGATEKDVVDMAQQLSLG------ 1361

Query: 200  STTIASTKKPKKRGCRSNPEGVEMGNYKHVSS 105
               +++ KKP+KR  R+N  G+  G     SS
Sbjct: 1362 LLAVSTLKKPRKRASRTN-TGLSTGRINETSS 1392


>ref|XP_012454018.1| PREDICTED: uncharacterized protein LOC105776090 [Gossypium raimondii]
          Length = 1452

 Score =  180 bits (457), Expect = 2e-42
 Identities = 158/457 (34%), Positives = 209/457 (45%), Gaps = 30/457 (6%)
 Frame = -3

Query: 1397 STEESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQ 1218
            S  +    +DLQMHPLLFQ+PEDG +PY  +N           F G   Q NL L    Q
Sbjct: 1018 SVAKESTRTDLQMHPLLFQAPEDGQVPYYPLNCGAGASSSFSLFSGNQPQLNLSLFYNPQ 1077

Query: 1217 QAGGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCD---PEPLQGPCVQI------ 1065
            QA           + KE+ S    IDFHPLLQ  D  N +      +  P V +      
Sbjct: 1078 QAK----------KMKESVSASYGIDFHPLLQRTDETNNELITSGSIASPSVGLDGKSAA 1127

Query: 1064 PNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASREVMGRRHRFEHRSN 885
            PNPSN+    P+V    +A  +  +SP EKANELDL+IHLSS++++E             
Sbjct: 1128 PNPSNAVQMRPVVHYSPFAARSRPSSPNEKANELDLEIHLSSSSAKENAALSRGVTPHPT 1187

Query: 884  GSSIRAREDGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGIV 705
             SS+R        E Q       +  +  V       +SS  I R  +D  D QS+P IV
Sbjct: 1188 NSSVRLLNSHNATETQ---DTFHSSGNKFVSGGCASTISSKVIGRYIDDGSD-QSHPEIV 1243

Query: 704  MEQEELSDSDEDVGD-VEFECEEMADSEGEG-LGYRELDNLQNKELPSVALEEERTSTDA 531
            MEQEELSDSDEDV + VEFECEEMADSEGEG  G  ++  +Q+K+       E     D 
Sbjct: 1244 MEQEELSDSDEDVEEHVEFECEEMADSEGEGDSGCEQVSEMQDKDAQGSVTREIVMDEDC 1303

Query: 530  N--------NGSESILMRCDP--------KKNILVGDSNTYSQKLVLTAKSNNIGSSTQS 399
            N        +G +S    CDP        K        +  S  L L A ++   S  + 
Sbjct: 1304 NDQQWELSIHGYKSQNNVCDPESRSPSFLKTGSTCPKKDKSSSWLSLDASASGRTSRAKP 1363

Query: 398  NSGSCTLGPSSSTLKRERRNRGHRETSNVVQSRPVRSSKRKPNVDRVAVHSQEVPQLVLN 219
             + + T+   + T    + +  HR T    Q+ P   S RK  +   AV   E  QL L 
Sbjct: 1364 KNEASTISKCTPT----KTSASHRTTRPSKQATP---STRKVALQEHAVDMAE--QLSLG 1414

Query: 218  PTTAECSTTIASTKKPKKRGCRSNP---EGVEMGNYK 117
            P +A  S      +KP+KR CR+N     G  +GN K
Sbjct: 1415 PLSAPTS------RKPRKRTCRANKITNVGTSLGNSK 1445


>gb|KJB69277.1| hypothetical protein B456_011G014000, partial [Gossypium raimondii]
          Length = 1469

 Score =  180 bits (457), Expect = 2e-42
 Identities = 158/457 (34%), Positives = 209/457 (45%), Gaps = 30/457 (6%)
 Frame = -3

Query: 1397 STEESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQ 1218
            S  +    +DLQMHPLLFQ+PEDG +PY  +N           F G   Q NL L    Q
Sbjct: 1035 SVAKESTRTDLQMHPLLFQAPEDGQVPYYPLNCGAGASSSFSLFSGNQPQLNLSLFYNPQ 1094

Query: 1217 QAGGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCD---PEPLQGPCVQI------ 1065
            QA           + KE+ S    IDFHPLLQ  D  N +      +  P V +      
Sbjct: 1095 QAK----------KMKESVSASYGIDFHPLLQRTDETNNELITSGSIASPSVGLDGKSAA 1144

Query: 1064 PNPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASREVMGRRHRFEHRSN 885
            PNPSN+    P+V    +A  +  +SP EKANELDL+IHLSS++++E             
Sbjct: 1145 PNPSNAVQMRPVVHYSPFAARSRPSSPNEKANELDLEIHLSSSSAKENAALSRGVTPHPT 1204

Query: 884  GSSIRAREDGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGIV 705
             SS+R        E Q       +  +  V       +SS  I R  +D  D QS+P IV
Sbjct: 1205 NSSVRLLNSHNATETQ---DTFHSSGNKFVSGGCASTISSKVIGRYIDDGSD-QSHPEIV 1260

Query: 704  MEQEELSDSDEDVGD-VEFECEEMADSEGEG-LGYRELDNLQNKELPSVALEEERTSTDA 531
            MEQEELSDSDEDV + VEFECEEMADSEGEG  G  ++  +Q+K+       E     D 
Sbjct: 1261 MEQEELSDSDEDVEEHVEFECEEMADSEGEGDSGCEQVSEMQDKDAQGSVTREIVMDEDC 1320

Query: 530  N--------NGSESILMRCDP--------KKNILVGDSNTYSQKLVLTAKSNNIGSSTQS 399
            N        +G +S    CDP        K        +  S  L L A ++   S  + 
Sbjct: 1321 NDQQWELSIHGYKSQNNVCDPESRSPSFLKTGSTCPKKDKSSSWLSLDASASGRTSRAKP 1380

Query: 398  NSGSCTLGPSSSTLKRERRNRGHRETSNVVQSRPVRSSKRKPNVDRVAVHSQEVPQLVLN 219
             + + T+   + T    + +  HR T    Q+ P   S RK  +   AV   E  QL L 
Sbjct: 1381 KNEASTISKCTPT----KTSASHRTTRPSKQATP---STRKVALQEHAVDMAE--QLSLG 1431

Query: 218  PTTAECSTTIASTKKPKKRGCRSNP---EGVEMGNYK 117
            P +A  S      +KP+KR CR+N     G  +GN K
Sbjct: 1432 PLSAPTS------RKPRKRTCRANKITNVGTSLGNSK 1462


>ref|XP_012091341.1| PREDICTED: uncharacterized protein LOC105649330 isoform X3 [Jatropha
            curcas]
          Length = 1429

 Score =  174 bits (440), Expect = 2e-40
 Identities = 152/444 (34%), Positives = 207/444 (46%), Gaps = 29/444 (6%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE G +SDLQMHPLLFQ+PEDGCL Y   +           F G   Q NL L     QA
Sbjct: 985  EERGNDSDLQMHPLLFQAPEDGCLSYYPPSCSTATPSSFAFFAGNQPQLNLSLFHAPHQA 1044

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCDPEP----------LQGPCVQIP 1062
                D  + + ++KE+ S  C IDFHPLLQ     + +             L G   Q  
Sbjct: 1045 NQISDCLNKSSKTKESISASCGIDFHPLLQRTGEESSELATACSNTHQFVCLGGKSAQFQ 1104

Query: 1061 NPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE-VMGRRHRFEHRSN 885
            NPS+  +   + ++   AT +  + P EK+NELDL+IHLSS +++E   G R    +   
Sbjct: 1105 NPSD-VVQTKLPVNSPSATASKPSGPNEKSNELDLEIHLSSTSTKEKTKGTRDSASNYQP 1163

Query: 884  GSSIRARED-GTIEENQ------QEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDE 726
               I A     TIE+++      Q         S  V     LA+ S        D++ +
Sbjct: 1164 KLMISAPNPVNTIEKHKPNNPCHQHGENCSTVQSNLVSCGDALAVPSNSDRICNMDDVGD 1223

Query: 725  QSNPGIVMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNKELPSVALEE 552
            QS+P I+MEQEELSDSDE+  + VEFE EEMADS+G EGLG   +  + +KE+   A EE
Sbjct: 1224 QSHPEIIMEQEELSDSDEETEEHVEFEREEMADSDGEEGLGGELVTEVPDKEITCSATEE 1283

Query: 551  ERT---STDANNGSESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIGSSTQSNSGSC- 384
              T   ST   +G+ SI  +  P              KL LT+      SS      SC 
Sbjct: 1284 VTTEWKSTIHTDGNSSIPGKASP------------FLKLSLTSMRKESSSSAWLTLDSCA 1331

Query: 383  TLGPSSSTLKRERRNRGHRETS-NVVQSRPVRSSKRKPNVDRVAVHSQEV----PQLVLN 219
             + P     K E    G    +  ++  RP RS K+     R  V  ++V     QL L 
Sbjct: 1332 AVDPPRINAKYEECTIGACPVAKKLISGRPNRSCKKTTQSMRTVVTEKDVMDMAQQLSLG 1391

Query: 218  PTTAECSTTIASTKKPKKRGCRSN 147
            P        +++ KKP+KR CR+N
Sbjct: 1392 P------LAVSTLKKPRKRACRTN 1409


>ref|XP_012091339.1| PREDICTED: uncharacterized protein LOC105649330 isoform X1 [Jatropha
            curcas]
          Length = 1435

 Score =  174 bits (440), Expect = 2e-40
 Identities = 152/444 (34%), Positives = 207/444 (46%), Gaps = 29/444 (6%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE G +SDLQMHPLLFQ+PEDGCL Y   +           F G   Q NL L     QA
Sbjct: 991  EERGNDSDLQMHPLLFQAPEDGCLSYYPPSCSTATPSSFAFFAGNQPQLNLSLFHAPHQA 1050

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCDPEP----------LQGPCVQIP 1062
                D  + + ++KE+ S  C IDFHPLLQ     + +             L G   Q  
Sbjct: 1051 NQISDCLNKSSKTKESISASCGIDFHPLLQRTGEESSELATACSNTHQFVCLGGKSAQFQ 1110

Query: 1061 NPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE-VMGRRHRFEHRSN 885
            NPS+  +   + ++   AT +  + P EK+NELDL+IHLSS +++E   G R    +   
Sbjct: 1111 NPSD-VVQTKLPVNSPSATASKPSGPNEKSNELDLEIHLSSTSTKEKTKGTRDSASNYQP 1169

Query: 884  GSSIRARED-GTIEENQ------QEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDE 726
               I A     TIE+++      Q         S  V     LA+ S        D++ +
Sbjct: 1170 KLMISAPNPVNTIEKHKPNNPCHQHGENCSTVQSNLVSCGDALAVPSNSDRICNMDDVGD 1229

Query: 725  QSNPGIVMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNKELPSVALEE 552
            QS+P I+MEQEELSDSDE+  + VEFE EEMADS+G EGLG   +  + +KE+   A EE
Sbjct: 1230 QSHPEIIMEQEELSDSDEETEEHVEFEREEMADSDGEEGLGGELVTEVPDKEITCSATEE 1289

Query: 551  ERT---STDANNGSESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIGSSTQSNSGSC- 384
              T   ST   +G+ SI  +  P              KL LT+      SS      SC 
Sbjct: 1290 VTTEWKSTIHTDGNSSIPGKASP------------FLKLSLTSMRKESSSSAWLTLDSCA 1337

Query: 383  TLGPSSSTLKRERRNRGHRETS-NVVQSRPVRSSKRKPNVDRVAVHSQEV----PQLVLN 219
             + P     K E    G    +  ++  RP RS K+     R  V  ++V     QL L 
Sbjct: 1338 AVDPPRINAKYEECTIGACPVAKKLISGRPNRSCKKTTQSMRTVVTEKDVMDMAQQLSLG 1397

Query: 218  PTTAECSTTIASTKKPKKRGCRSN 147
            P        +++ KKP+KR CR+N
Sbjct: 1398 P------LAVSTLKKPRKRACRTN 1415


>ref|XP_012091340.1| PREDICTED: uncharacterized protein LOC105649330 isoform X2 [Jatropha
            curcas] gi|643703680|gb|KDP20744.1| hypothetical protein
            JCGZ_21215 [Jatropha curcas]
          Length = 1433

 Score =  174 bits (440), Expect = 2e-40
 Identities = 152/444 (34%), Positives = 207/444 (46%), Gaps = 29/444 (6%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE G +SDLQMHPLLFQ+PEDGCL Y   +           F G   Q NL L     QA
Sbjct: 989  EERGNDSDLQMHPLLFQAPEDGCLSYYPPSCSTATPSSFAFFAGNQPQLNLSLFHAPHQA 1048

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCDPEP----------LQGPCVQIP 1062
                D  + + ++KE+ S  C IDFHPLLQ     + +             L G   Q  
Sbjct: 1049 NQISDCLNKSSKTKESISASCGIDFHPLLQRTGEESSELATACSNTHQFVCLGGKSAQFQ 1108

Query: 1061 NPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE-VMGRRHRFEHRSN 885
            NPS+  +   + ++   AT +  + P EK+NELDL+IHLSS +++E   G R    +   
Sbjct: 1109 NPSD-VVQTKLPVNSPSATASKPSGPNEKSNELDLEIHLSSTSTKEKTKGTRDSASNYQP 1167

Query: 884  GSSIRARED-GTIEENQ------QEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDE 726
               I A     TIE+++      Q         S  V     LA+ S        D++ +
Sbjct: 1168 KLMISAPNPVNTIEKHKPNNPCHQHGENCSTVQSNLVSCGDALAVPSNSDRICNMDDVGD 1227

Query: 725  QSNPGIVMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNKELPSVALEE 552
            QS+P I+MEQEELSDSDE+  + VEFE EEMADS+G EGLG   +  + +KE+   A EE
Sbjct: 1228 QSHPEIIMEQEELSDSDEETEEHVEFEREEMADSDGEEGLGGELVTEVPDKEITCSATEE 1287

Query: 551  ERT---STDANNGSESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIGSSTQSNSGSC- 384
              T   ST   +G+ SI  +  P              KL LT+      SS      SC 
Sbjct: 1288 VTTEWKSTIHTDGNSSIPGKASP------------FLKLSLTSMRKESSSSAWLTLDSCA 1335

Query: 383  TLGPSSSTLKRERRNRGHRETS-NVVQSRPVRSSKRKPNVDRVAVHSQEV----PQLVLN 219
             + P     K E    G    +  ++  RP RS K+     R  V  ++V     QL L 
Sbjct: 1336 AVDPPRINAKYEECTIGACPVAKKLISGRPNRSCKKTTQSMRTVVTEKDVMDMAQQLSLG 1395

Query: 218  PTTAECSTTIASTKKPKKRGCRSN 147
            P        +++ KKP+KR CR+N
Sbjct: 1396 P------LAVSTLKKPRKRACRTN 1413


>gb|KDO45255.1| hypothetical protein CISIN_1g000732mg [Citrus sinensis]
          Length = 1325

 Score =  172 bits (435), Expect = 7e-40
 Identities = 124/347 (35%), Positives = 179/347 (51%), Gaps = 14/347 (4%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            EE G + DLQMHPLLFQ+PEDG LPY  +N           F G   Q NL L    +Q 
Sbjct: 958  EERGTQPDLQMHPLLFQAPEDGRLPYYPLNCSASTSSSFSFFSGNQPQLNLSLFHNPRQL 1017

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSAD--NINCDPEPLQGP-CV-------QIP 1062
              A+  F+ +L++KE+ S  C IDFHPLL+  +  N N    P     CV       Q  
Sbjct: 1018 SHALSCFNKSLKTKESTSGSCVIDFHPLLKRTEVANNNLVTTPSNARICVGSERKSDQHK 1077

Query: 1061 NPSNSSLSAPMVIDRQWATNTTLTSPYEKANELDLDIHLSSAASRE-VMGRRHRFEHRSN 885
            NP ++  S   V +  +A N+  +S  EK+NELDL+IHLSS++++E  +G R    H   
Sbjct: 1078 NPFDALQSKTSVSNGPFAVNSVPSSINEKSNELDLEIHLSSSSAKERALGNREMAPHNLM 1137

Query: 884  GSSIRARE-DGTIEENQQEESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGI 708
             S   A   D T  +N         ++ + V      ++ + G      D++ + S+P I
Sbjct: 1138 QSMTVANSGDKTETQNNDSLHYQYGENCSQVASNGHFSIQTTG----NIDDIGDHSHPEI 1193

Query: 707  VMEQEELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNKELPSVALEEERTSTD 534
            VMEQEELSDSDE++ + VEFECEEM DSEG EG G  ++  +Q KE+PS+  E+   +TD
Sbjct: 1194 VMEQEELSDSDEEIEEHVEFECEEMTDSEGEEGSGCEQITEMQEKEVPSLVTEK---ATD 1250

Query: 533  ANNGSESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIGSSTQSNS 393
             ++  +   +R      +    ++       L     N+G  T S+S
Sbjct: 1251 GDSDDQQHELR--SSHGLCGAPASRKGSSPFLKLGLTNLGKDTASSS 1295


>ref|XP_010105693.1| hypothetical protein L484_011305 [Morus notabilis]
            gi|587918207|gb|EXC05724.1| hypothetical protein
            L484_011305 [Morus notabilis]
          Length = 1423

 Score =  169 bits (427), Expect = 6e-39
 Identities = 146/431 (33%), Positives = 208/431 (48%), Gaps = 21/431 (4%)
 Frame = -3

Query: 1391 EESGAESDLQMHPLLFQSPEDGCLPYSLMNYXXXXXXXXXXFLGTPLQTNLDLLSKSQQA 1212
            ++   +SDLQMHPLLFQ+PEDG LPY  +N           F G   Q +L LL   +Q 
Sbjct: 998  DDGNIDSDLQMHPLLFQAPEDGRLPYYPLNCSPSNSSSFSFFSGNQPQLHLSLLHNPRQE 1057

Query: 1211 GGAVDHFHPALRSKEAPSNLCTIDFHPLLQSADNINCDPEPLQGPCVQIPNPSNSSLSAP 1032
               V  F  +L+ K++ S+   IDFHPLLQ  D ++ D   +Q   +   +P  +S    
Sbjct: 1058 -NLVGSFTKSLQLKDSTSSSYGIDFHPLLQRTDYVHGDLIDVQTESLVNADPHTTSKFV- 1115

Query: 1031 MVIDRQWATNTTLTSPYEKANELDLDIHLSSAASREVMGRRHRFEHRSNGSSIRAREDGT 852
                             EKANELDL+IH+SSA+ +E    R+   H    S+  A     
Sbjct: 1116 -----------------EKANELDLEIHISSASRKEGSWNRNETAHNPVRSATNAPNSEF 1158

Query: 851  IEENQQ-------EESRTKADSSTHVMDAHELALSSIGISRLTEDNLDEQSNPGIVMEQE 693
              + Q            + ++ S  V   H   L    I R  +D + +QS+P IVMEQE
Sbjct: 1159 TSKTQNSNRSLYLHNESSPSNISRPVSGGHSSVLPGDNIGRYVDD-MGDQSHPEIVMEQE 1217

Query: 692  ELSDSDEDVGD-VEFECEEMADSEG-EGLGYRELDNLQNKELPSVALEEERTSTDANNGS 519
            ELSDSDE+  + VEFECEEM DSEG EG G  +++ LQ +E  S A+E+  T+ D ++ +
Sbjct: 1218 ELSDSDEENEETVEFECEEMTDSEGDEGSGCEQINELQTEERCSQAMEKLNTA-DCDDKT 1276

Query: 518  ESILMRCDPKKNILVGDSNTYSQKLVLTAKSNNIGSSTQSNSGSCTLGPSSS------TL 357
                 +   + N+ +   N  S +L LT++    G    SNS   +L  S +        
Sbjct: 1277 CESRTKIHYQDNVPISGKNIPSLELGLTSR----GKDDASNSSWLSLDSSGAHHCLAHLK 1332

Query: 356  KRERRN---RGHRETSNVVQSRPVRSSKRKP-NVDRVAVHSQ--EVPQLVLNPTTAECST 195
            K ER N     +  T ++  SRP RSSK+K  ++D V    Q  +  QL L P       
Sbjct: 1333 KSERENTAISANPVTKSLASSRPSRSSKKKNLSMDDVVEQRQNFDGKQLSLAP------L 1386

Query: 194  TIASTKKPKKR 162
             I   +KP+KR
Sbjct: 1387 RIPILRKPRKR 1397


Top