BLASTX nr result

ID: Forsythia21_contig00005617 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00005617
         (1931 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011094389.1| PREDICTED: uncharacterized protein LOC105174...   568   e-159
ref|XP_011077886.1| PREDICTED: uncharacterized protein LOC105161...   563   e-157
emb|CDP15389.1| unnamed protein product [Coffea canephora]            533   e-148
ref|XP_009760140.1| PREDICTED: uncharacterized protein LOC104212...   532   e-148
ref|XP_009602665.1| PREDICTED: uncharacterized protein LOC104097...   530   e-147
ref|XP_012066462.1| PREDICTED: uncharacterized protein LOC105629...   528   e-147
ref|XP_012066461.1| PREDICTED: uncharacterized protein LOC105629...   528   e-147
ref|XP_012840131.1| PREDICTED: uncharacterized protein LOC105960...   528   e-147
gb|KHG28971.1| Lysine--tRNA ligase [Gossypium arboreum]               522   e-145
gb|KHG10982.1| Lysine--tRNA ligase [Gossypium arboreum]               522   e-145
ref|XP_002533549.1| calmodulin binding protein, putative [Ricinu...   520   e-144
ref|XP_010049678.1| PREDICTED: uncharacterized protein LOC104438...   518   e-144
gb|KCW89220.1| hypothetical protein EUGRSUZ_A01525 [Eucalyptus g...   518   e-144
ref|XP_009611163.1| PREDICTED: uncharacterized protein LOC104104...   518   e-144
ref|XP_010110598.1| hypothetical protein L484_001201 [Morus nota...   516   e-143
ref|XP_012467025.1| PREDICTED: uncharacterized protein LOC105785...   515   e-143
gb|KHF97284.1| Pre-mRNA-splicing factor CWC22 [Gossypium arboreum]    514   e-143
ref|XP_009798181.1| PREDICTED: uncharacterized protein LOC104244...   512   e-142
gb|KJB15075.1| hypothetical protein B456_002G158900 [Gossypium r...   510   e-141
ref|XP_006339127.1| PREDICTED: uncharacterized protein LOC102601...   509   e-141

>ref|XP_011094389.1| PREDICTED: uncharacterized protein LOC105174097 [Sesamum indicum]
          Length = 507

 Score =  568 bits (1464), Expect = e-159
 Identities = 311/520 (59%), Positives = 360/520 (69%), Gaps = 17/520 (3%)
 Frame = +2

Query: 125  MEVQTHALATFDTSPFSRTPVNHSEICNAQVQSSEMSDYSSEAPEQHEVESTDSGRRECE 304
            MEV+ H L+TFD +  + T  +       + +   +    S +P+Q           E E
Sbjct: 1    MEVEAHGLSTFDLNSGNLTAFS------LRNRPEFLVPNMSGSPKQPS---------ELE 45

Query: 305  DVLG-KIAVDRREWEXXXXXXXXXXXXXXXVTCPGTAATKVQKVYRSYRTRRMLADSAVV 481
             V G ++A    EWE               V CP  AA K+QKVYRSYRTRRMLADSAVV
Sbjct: 46   KVTGDQMAEHGGEWEATAAPTSPAAAL---VGCPSNAAMKLQKVYRSYRTRRMLADSAVV 102

Query: 482  AEELWCVA----------ISFVDFLKPDTAASRWNRVILNASKVGKGLSMDAKAQLLAFQ 631
            AEELW  A          ISF +FLKP+TAASRWNR+ LNASKVGKGLS DAKAQ LAFQ
Sbjct: 103  AEELWWQALDFARLNHSTISFFNFLKPETAASRWNRIGLNASKVGKGLSKDAKAQKLAFQ 162

Query: 632  HWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWLDLGEGKEFDLEECPRSKLRQQCIK 811
            HWIEAIDPRHRYGHSLH YY+EWCK+NAGQ FFYWLDLG+GKE DL+ECPRSKLRQQCIK
Sbjct: 163  HWIEAIDPRHRYGHSLHFYYEEWCKTNAGQPFFYWLDLGDGKEVDLKECPRSKLRQQCIK 222

Query: 812  YLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPELKWIFVVSTSKKLYTGEKKKGLFH 991
            YLGP+EREQYEYV+VDG+ILHK TG  L T+ GSP  KWIFV+STSK+LY GEKKKGLFH
Sbjct: 223  YLGPKEREQYEYVLVDGRILHKVTGYPLDTNKGSPGSKWIFVMSTSKRLYIGEKKKGLFH 282

Query: 992  HSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPNDDSLDSFLSILKENGVNVDEVEIR 1171
            HSSF            +  DGVLKCIS YSGHYKP D+ L SFL  L+ENGVN+D+VE  
Sbjct: 283  HSSFLAGGATLAAGRLIAEDGVLKCISAYSGHYKPTDERLGSFLLFLEENGVNLDDVEKH 342

Query: 1172 KANEDSESHDNGKLAKD------LSISDSVQPDLPSEEEKDPSLETIEASRPEIKIEYKR 1333
            KAN+D E+++NGK++ D      LSISDSVQ DLP+EEE + S E  E S+PE K EYKR
Sbjct: 343  KANDDYENNENGKISMDAAVSKVLSISDSVQCDLPNEEE-ELSSEAPEVSKPETKNEYKR 401

Query: 1334 TLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQLSRKWSTGAGARIGCVADYPAELR 1513
            TLSGGLQSP+A+V + +I+QRINSKKAAKSYQLGHQLS +WSTG G RIGC+ADYP ELR
Sbjct: 402  TLSGGLQSPKADVPRISIMQRINSKKAAKSYQLGHQLSVQWSTGVGPRIGCIADYPLELR 461

Query: 1514 LQAFEXXXXXXXXXXXXXXXXXXXLLSSPTAFPPYLSNGD 1633
            LQA E                    L SP+  PP +SN D
Sbjct: 462  LQALELTKLSPRTCPPLSTPGRSAALVSPSC-PPNISNDD 500


>ref|XP_011077886.1| PREDICTED: uncharacterized protein LOC105161772 [Sesamum indicum]
          Length = 491

 Score =  563 bits (1452), Expect = e-157
 Identities = 299/430 (69%), Positives = 327/430 (76%), Gaps = 17/430 (3%)
 Frame = +2

Query: 404  GTAATKVQKVYRSYRTRRMLADSAVVAEELWCVA----------ISFVDFLKPDTAASRW 553
            G AA KVQKVYRSYRTRRMLADSAVVAEELW             ISF +FLKP+TA SRW
Sbjct: 64   GAAAMKVQKVYRSYRTRRMLADSAVVAEELWWQVLDFAQLNQSTISFFNFLKPETAVSRW 123

Query: 554  NRVILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFY 733
            NRV LNASKVGKGLS DAKAQ LAFQHWIEAIDPRHRYGH+LH YY+EWCK+N+GQ FFY
Sbjct: 124  NRVGLNASKVGKGLSKDAKAQKLAFQHWIEAIDPRHRYGHNLHFYYEEWCKTNSGQPFFY 183

Query: 734  WLDLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGS 913
            WLDLG+GKE DL+ECPRSKLR+QCIKYLGPQERE YE+VVVDGKILHK TG+ L T  G 
Sbjct: 184  WLDLGDGKEVDLKECPRSKLRRQCIKYLGPQEREHYEFVVVDGKILHKVTGRPLDTTKGL 243

Query: 914  PELKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYK 1093
            P  KWIFV+STSKKLYTGEKKKGLFHHSSF            M  DGVLKCIS YSGHYK
Sbjct: 244  PGSKWIFVMSTSKKLYTGEKKKGLFHHSSFLAGGATLAAGRLMAEDGVLKCISAYSGHYK 303

Query: 1094 PNDDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKDLSI------SDSVQPDL 1255
            P DDSLDSFLS LKENGVN+ EVEI+K NED +++++GKLA+D S+      SDS   DL
Sbjct: 304  PTDDSLDSFLSFLKENGVNLVEVEIQKVNEDYDANESGKLAEDGSMSEVPPASDSSPCDL 363

Query: 1256 PSEEEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLG 1435
            PSE+E +PS E  E S+P+   EYKRTLSGGLQSPRA+V K  ILQRINSKKAAKSYQLG
Sbjct: 364  PSEKE-EPSSEKPETSQPKTITEYKRTLSGGLQSPRADVPKTVILQRINSKKAAKSYQLG 422

Query: 1436 HQLSRKWSTGAGARIGCVADYPAELRLQAFEXXXXXXXXXXXXXXXXXXXLLSSPTAFPP 1615
            H+LS KWSTGAG RIGCVADYP ELRLQA E                    L SP A P 
Sbjct: 423  HKLSIKWSTGAGPRIGCVADYPVELRLQALEFTNVSPRACAPVSTPRPITTLDSP-ASPS 481

Query: 1616 YLSN-GDTTS 1642
            +LSN G TTS
Sbjct: 482  HLSNEGLTTS 491


>emb|CDP15389.1| unnamed protein product [Coffea canephora]
          Length = 526

 Score =  533 bits (1374), Expect = e-148
 Identities = 286/490 (58%), Positives = 340/490 (69%), Gaps = 22/490 (4%)
 Frame = +2

Query: 125  MEVQTHALATFDTS------PFSRTPVNHS-EICNAQ-VQSSEMSDYSSE--APEQHEVE 274
            MEV+ HAL++FD +      PF+ T      E+ N+Q  +SS++  YS    AP + ++ 
Sbjct: 1    MEVEAHALSSFDLNSSPDNFPFTTTAAREPPELRNSQPFRSSKLLGYSESDHAPVREDLG 60

Query: 275  STDSGRRECEDVLGKIAVDRREWEXXXXXXXXXXXXXXXVTCPGTAATKVQKVYRSYRTR 454
            S  +   +C+DV G  A                        CP  AA K+QKVYRSYRTR
Sbjct: 61   SPVAAGGKCQDVGGNSASP-----VANAFAGEEPLPEKMGRCPSRAAMKLQKVYRSYRTR 115

Query: 455  RMLADSAVVAEELWCVA----------ISFVDFLKPDTAASRWNRVILNASKVGKGLSMD 604
            RMLAD AVVAEELW  A          ISF +F KP++AASRWNRV LNASKVGKGLS D
Sbjct: 116  RMLADCAVVAEELWWQALDFARLNHSTISFFNFSKPESAASRWNRVSLNASKVGKGLSKD 175

Query: 605  AKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWLDLGEGKEFDLEECPR 784
            AKAQ LAFQHWIEAIDPRHRYGHSLHLYY+EWCK +AGQ FF+WLDLG+GKE DL  CPR
Sbjct: 176  AKAQKLAFQHWIEAIDPRHRYGHSLHLYYEEWCKGDAGQPFFFWLDLGDGKEVDLTACPR 235

Query: 785  SKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPELKWIFVVSTSKKLYT 964
            SKLR+Q IKYLGP+ERE YEY+V +GKI HK +GK L T   SP  KWIFV+STSK+LY 
Sbjct: 236  SKLRKQHIKYLGPKEREHYEYLVTEGKIWHKQSGKPLDTTEASPGAKWIFVMSTSKRLYA 295

Query: 965  GEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPNDDSLDSFLSILKENG 1144
            GEKKKG FHHSSF            +V+DG L+ ISPYSGHY+P DDS D+FL  LK +G
Sbjct: 296  GEKKKGTFHHSSFLAGGVTLAAGRLVVIDGTLQSISPYSGHYRPTDDSFDTFLDFLKAHG 355

Query: 1145 VNVDEVEIRKANEDSESHDNGKLAKDLSISD--SVQPDLPSEEEKDPSLETIEASRPEIK 1318
            VN+DEVE+ KANED E+ +  K  KD S S+  +V     +    D ++ETIEA + EI 
Sbjct: 356  VNLDEVEMNKANEDYENFEEAKTTKDESTSEVSTVSDSCETNAPDDGAVETIEAPKLEIN 415

Query: 1319 IEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQLSRKWSTGAGARIGCVADY 1498
            + Y+RTLSGGLQSP+AEV K +ILQRINSKKAA+SYQLG QLS KWSTGAG RIGC+ADY
Sbjct: 416  VNYQRTLSGGLQSPKAEVPKTSILQRINSKKAARSYQLGKQLSLKWSTGAGPRIGCIADY 475

Query: 1499 PAELRLQAFE 1528
            P ELR+QA E
Sbjct: 476  PIELRIQALE 485


>ref|XP_009760140.1| PREDICTED: uncharacterized protein LOC104212542 [Nicotiana
            sylvestris]
          Length = 472

 Score =  532 bits (1370), Expect = e-148
 Identities = 274/427 (64%), Positives = 310/427 (72%), Gaps = 17/427 (3%)
 Frame = +2

Query: 410  AATKVQKVYRSYRTRRMLADSAVVAEELWCVAI----------SFVDFLKPDTAASRWNR 559
            AA  VQKVYRSYRTRRMLADSA+VAEELW  AI          SF +F  P+TAASRWNR
Sbjct: 44   AAMTVQKVYRSYRTRRMLADSALVAEELWWQAIDYARLNHSTISFFNFPTPETAASRWNR 103

Query: 560  VILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWL 739
            + LNASKVGKGLS D KAQ LAFQHWIEAIDPRHRYGHSLH+YY+EWCK++AGQ FFYWL
Sbjct: 104  ITLNASKVGKGLSRDGKAQKLAFQHWIEAIDPRHRYGHSLHMYYEEWCKTDAGQPFFYWL 163

Query: 740  DLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPE 919
            DLG+GK+ DL+ECPRSKL++QCIKYLGPQERE YEY+V +GKI+HK TG  L T  G PE
Sbjct: 164  DLGDGKKVDLKECPRSKLQKQCIKYLGPQEREHYEYIVAEGKIVHKQTGNHLDTTNGLPE 223

Query: 920  LKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPN 1099
             KWIFV+STSKKLY GEKK+G+FHHSSF             V DG+LK IS YSGHY+PN
Sbjct: 224  AKWIFVMSTSKKLYAGEKKRGIFHHSSFLAGGATLAAGRLGVTDGILKSISAYSGHYRPN 283

Query: 1100 DDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKDLSISD-------SVQPDLP 1258
            DDSLDS LS LKENGVN+DEV+IRK  ED ES+  G+  +  S+S+        +Q D P
Sbjct: 284  DDSLDSLLSFLKENGVNLDEVKIRKVKEDDESNVEGQSYESHSVSELSTVSYSPLQVDDP 343

Query: 1259 SEEEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGH 1438
             EEEKD S E + + R +    YKRTLSGGL+SPRAEV   AILQRINSKK+AKSYQLGH
Sbjct: 344  MEEEKDLSSELMRSPRAQETSSYKRTLSGGLESPRAEVPTTAILQRINSKKSAKSYQLGH 403

Query: 1439 QLSRKWSTGAGARIGCVADYPAELRLQAFEXXXXXXXXXXXXXXXXXXXLLSSPTAFPPY 1618
            QLS  WSTGAG RIGC+ADYPAELR QA E                    + SPT   P 
Sbjct: 404  QLSLAWSTGAGPRIGCIADYPAELRQQALELTNLSPRPCSASSTPRPIDDIVSPTCPSPV 463

Query: 1619 LSNGDTT 1639
            L NGD T
Sbjct: 464  LCNGDVT 470


>ref|XP_009602665.1| PREDICTED: uncharacterized protein LOC104097754 [Nicotiana
            tomentosiformis]
          Length = 485

 Score =  530 bits (1366), Expect = e-147
 Identities = 294/527 (55%), Positives = 347/527 (65%), Gaps = 22/527 (4%)
 Frame = +2

Query: 125  MEVQTHALATFDTSPFSRTPVN-HSEICNAQVQSSEMSDYSSEAPEQHEVESTDSGRREC 301
            MEV+THAL++FD       P+N HS   N+ +                +  S D+  R+C
Sbjct: 1    MEVETHALSSFD-------PMNSHSPPINSIL----------------DTPSVDNNGRKC 37

Query: 302  E----DVLGKIAVDRREWEXXXXXXXXXXXXXXXVTCPGTAATKVQKVYRSYRTRRMLAD 469
            +     V G I VD+   +                     AA  VQKVYRSYRTRRMLAD
Sbjct: 38   QWFDSVVGGGIPVDKTTTQ---------------------AAMTVQKVYRSYRTRRMLAD 76

Query: 470  SAVVAEELWCVAI----------SFVDFLKPDTAASRWNRVILNASKVGKGLSMDAKAQL 619
            SA+VAEELW  AI          SF +F  P+TAASRWNR+ LNASKVGKGLS D KAQ 
Sbjct: 77   SALVAEELWWQAIDYARLNHSTISFFNFPTPETAASRWNRITLNASKVGKGLSRDGKAQK 136

Query: 620  LAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWLDLGEGKEFDLEECPRSKLRQ 799
            LAFQHWIEAIDPRHRYGHSLH+YY+EWCK++AGQ FF+WLDLG+GK+ DL+ECPRSKL++
Sbjct: 137  LAFQHWIEAIDPRHRYGHSLHMYYEEWCKTDAGQPFFFWLDLGDGKKVDLKECPRSKLQK 196

Query: 800  QCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPELKWIFVVSTSKKLYTGEKKK 979
            QCIKYLGPQERE YEY+V +GKI+HK TG  L T  G PE KWIFV+STSK LY GEKK+
Sbjct: 197  QCIKYLGPQEREHYEYIVAEGKIVHKQTGNYLDTTNGLPEAKWIFVMSTSKNLYAGEKKR 256

Query: 980  GLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPNDDSLDSFLSILKENGVNVDE 1159
            G+FHHSSF             V DG+LK IS YSGHY+PNDDSLDS LS LKENGVN+DE
Sbjct: 257  GIFHHSSFLAGGATLAAGRLGVTDGILKSISAYSGHYRPNDDSLDSLLSFLKENGVNLDE 316

Query: 1160 VEIRKANEDSESH------DNGKLAKDLSISDS-VQPDLPSEEEKDPSLETIEASRPEIK 1318
            V+IRK  ED ES+      ++  +++  +ISDS +Q D P EEEKD S + + ++R +  
Sbjct: 317  VKIRKVKEDDESNVEALSSESHSVSELSTISDSPLQVDDPKEEEKDLSSDLMRSTRVQET 376

Query: 1319 IEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQLSRKWSTGAGARIGCVADY 1498
              YKRTLSGGL+SPR EV   AILQRINSKK+AKSYQLGHQLS  WSTGAG RIGC+ADY
Sbjct: 377  SSYKRTLSGGLESPRTEVPTTAILQRINSKKSAKSYQLGHQLSLAWSTGAGPRIGCIADY 436

Query: 1499 PAELRLQAFEXXXXXXXXXXXXXXXXXXXLLSSPTAFPPYLSNGDTT 1639
            PAELR QA E                    + SPT   P L NGD T
Sbjct: 437  PAELRQQALELTNLSPRPCSASSIPRPIDDIVSPTCPSPVLCNGDVT 483


>ref|XP_012066462.1| PREDICTED: uncharacterized protein LOC105629472 isoform X2 [Jatropha
            curcas]
          Length = 487

 Score =  528 bits (1361), Expect = e-147
 Identities = 273/436 (62%), Positives = 318/436 (72%), Gaps = 25/436 (5%)
 Frame = +2

Query: 410  AATKVQKVYRSYRTRRMLADSAVVAEELWCV----------AISFVDFLKPDTAASRWNR 559
            AA K+QKVYRSYRTRR LADSAVVAEELW             ISF +FLKP+TAASRWNR
Sbjct: 51   AALKLQKVYRSYRTRRRLADSAVVAEELWWQLIDFARLNHSTISFFNFLKPETAASRWNR 110

Query: 560  VILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWL 739
            + LNASKVGKGLS DAKAQ LAFQHWIEAIDPRHRYGH+LH YY+EWCK+N+GQ FFYWL
Sbjct: 111  ISLNASKVGKGLSKDAKAQKLAFQHWIEAIDPRHRYGHNLHFYYEEWCKTNSGQPFFYWL 170

Query: 740  DLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPE 919
            D+G+GKE DLEECPRSKLR+QCIKYLGPQERE YEY+VV+GKI+HK TG  L T  GS  
Sbjct: 171  DIGDGKELDLEECPRSKLRKQCIKYLGPQEREHYEYIVVEGKIIHKQTGNLLDTSEGSKG 230

Query: 920  LKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPN 1099
             KWIFV+ST K+LY GEKKKG+FHHSSF            +  DG+LK ISPYSGHY+P 
Sbjct: 231  AKWIFVMSTFKRLYAGEKKKGIFHHSSFLAGGATLAAGRLVAEDGILKSISPYSGHYRPT 290

Query: 1100 DDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKD---LSISDSVQP---DLPS 1261
            +DS DSFLS L+++GVN+DEV+I KA+EDS+++D+GK       L +  ++ P   ++P 
Sbjct: 291  EDSFDSFLSFLQDDGVNLDEVQINKASEDSDNYDDGKFNASEMRLEVPSTLGPTKLEMP- 349

Query: 1262 EEEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQ 1441
             EEKD + +  + +  E + EYKRTLSGGLQSPRA+V K AILQRINSKKAAKSYQLGHQ
Sbjct: 350  VEEKDSTSQPSKVAESESRGEYKRTLSGGLQSPRADVPKNAILQRINSKKAAKSYQLGHQ 409

Query: 1442 LSRKWSTGAGARIGCVADYPAELRLQAFEXXXXXXXXXXXXXXXXXXXLLSSPTA----- 1606
            LSRKWSTGAG RIGCVADYP ELR+QA E                    L+SPTA     
Sbjct: 410  LSRKWSTGAGPRIGCVADYPVELRVQALEFVNLSPRTPPTPSYHRRMAGLTSPTASLASP 469

Query: 1607 ----FPPYLSNGDTTS 1642
                    +SNGD  S
Sbjct: 470  TAEPTTSDISNGDVAS 485


>ref|XP_012066461.1| PREDICTED: uncharacterized protein LOC105629472 isoform X1 [Jatropha
            curcas] gi|643736397|gb|KDP42716.1| hypothetical protein
            JCGZ_23656 [Jatropha curcas]
          Length = 517

 Score =  528 bits (1361), Expect = e-147
 Identities = 273/436 (62%), Positives = 318/436 (72%), Gaps = 25/436 (5%)
 Frame = +2

Query: 410  AATKVQKVYRSYRTRRMLADSAVVAEELWCV----------AISFVDFLKPDTAASRWNR 559
            AA K+QKVYRSYRTRR LADSAVVAEELW             ISF +FLKP+TAASRWNR
Sbjct: 81   AALKLQKVYRSYRTRRRLADSAVVAEELWWQLIDFARLNHSTISFFNFLKPETAASRWNR 140

Query: 560  VILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWL 739
            + LNASKVGKGLS DAKAQ LAFQHWIEAIDPRHRYGH+LH YY+EWCK+N+GQ FFYWL
Sbjct: 141  ISLNASKVGKGLSKDAKAQKLAFQHWIEAIDPRHRYGHNLHFYYEEWCKTNSGQPFFYWL 200

Query: 740  DLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPE 919
            D+G+GKE DLEECPRSKLR+QCIKYLGPQERE YEY+VV+GKI+HK TG  L T  GS  
Sbjct: 201  DIGDGKELDLEECPRSKLRKQCIKYLGPQEREHYEYIVVEGKIIHKQTGNLLDTSEGSKG 260

Query: 920  LKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPN 1099
             KWIFV+ST K+LY GEKKKG+FHHSSF            +  DG+LK ISPYSGHY+P 
Sbjct: 261  AKWIFVMSTFKRLYAGEKKKGIFHHSSFLAGGATLAAGRLVAEDGILKSISPYSGHYRPT 320

Query: 1100 DDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKD---LSISDSVQP---DLPS 1261
            +DS DSFLS L+++GVN+DEV+I KA+EDS+++D+GK       L +  ++ P   ++P 
Sbjct: 321  EDSFDSFLSFLQDDGVNLDEVQINKASEDSDNYDDGKFNASEMRLEVPSTLGPTKLEMP- 379

Query: 1262 EEEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQ 1441
             EEKD + +  + +  E + EYKRTLSGGLQSPRA+V K AILQRINSKKAAKSYQLGHQ
Sbjct: 380  VEEKDSTSQPSKVAESESRGEYKRTLSGGLQSPRADVPKNAILQRINSKKAAKSYQLGHQ 439

Query: 1442 LSRKWSTGAGARIGCVADYPAELRLQAFEXXXXXXXXXXXXXXXXXXXLLSSPTA----- 1606
            LSRKWSTGAG RIGCVADYP ELR+QA E                    L+SPTA     
Sbjct: 440  LSRKWSTGAGPRIGCVADYPVELRVQALEFVNLSPRTPPTPSYHRRMAGLTSPTASLASP 499

Query: 1607 ----FPPYLSNGDTTS 1642
                    +SNGD  S
Sbjct: 500  TAEPTTSDISNGDVAS 515


>ref|XP_012840131.1| PREDICTED: uncharacterized protein LOC105960489 [Erythranthe
            guttatus] gi|604330017|gb|EYU35150.1| hypothetical
            protein MIMGU_mgv1a005330mg [Erythranthe guttata]
          Length = 488

 Score =  528 bits (1360), Expect = e-147
 Identities = 291/519 (56%), Positives = 349/519 (67%), Gaps = 14/519 (2%)
 Frame = +2

Query: 125  MEVQTHALATFDTSPFSRTPVNHSEICNAQVQSSEMSDYSSEAPEQHEVESTDSGRRECE 304
            MEV++H L+ FD           + +       SE+S  S EAP   +   TD   R+  
Sbjct: 1    MEVESHILSNFDLK---------NSLSATAFSFSEISGSSEEAPPLPK--ETDQTARKLN 49

Query: 305  DVLGKIAVDRREWEXXXXXXXXXXXXXXXVTCPGTAATKVQKVYRSYRTRRMLADSAVVA 484
            D     A                        C  TAA K+QKVYRSYRTRRMLAD+AVVA
Sbjct: 50   DAAAPPA-----------------------RCTSTAAMKLQKVYRSYRTRRMLADTAVVA 86

Query: 485  EELWCVA----------ISFVDFLKPDTAASRWNRVILNASKVGKGLSMDAKAQLLAFQH 634
            EELW  A          ISF ++ KP++AASRWNRV LNASKVGKGLS DAKAQ LAFQH
Sbjct: 87   EELWWQALDFARLNHSTISFFNYSKPESAASRWNRVRLNASKVGKGLSKDAKAQKLAFQH 146

Query: 635  WIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWLDLGEGKEFDLEECPRSKLRQQCIKY 814
            WIEAIDPRHRYGHSLHLYY+EWC++++GQ FFYWLDLG+GKE DL+ECPR+KLR+QCIKY
Sbjct: 147  WIEAIDPRHRYGHSLHLYYEEWCQASSGQPFFYWLDLGDGKEVDLKECPRAKLRKQCIKY 206

Query: 815  LGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPELKWIFVVSTSKKLYTGEKKKGLFHH 994
            LGP+ERE YE+VVV GKIL K TG+ L T+ GS   KWIFV+STS +LY+GEKKKGLFHH
Sbjct: 207  LGPKEREHYEFVVVGGKILRKLTGEPLDTNKGSAGSKWIFVMSTSNRLYSGEKKKGLFHH 266

Query: 995  SSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPNDDSLDSFLSILKENGVNVDEVEIRK 1174
            SSF            +  DGVLK IS YSGHYKP DDSLD+F   L+ENGVN+D+VEI+K
Sbjct: 267  SSFLAGGATLAAGRLVADDGVLKSISAYSGHYKPTDDSLDTFKLFLQENGVNLDQVEIQK 326

Query: 1175 ANEDSESHDNGKLAKDLSISD-SVQPD-LPSEEEKDPSLETIEASRPEIKIEYKRTLSGG 1348
            A +D ++ ++G L+ + +IS+ S  PD +P+EEE D  L  +EAS+P  KIEY+RTLSGG
Sbjct: 327  AKDDYDNSESGNLSVEEAISEVSPLPDSIPNEEEADSHL--LEASKPATKIEYRRTLSGG 384

Query: 1349 LQSPRAEVSKEAILQRINSKKAAKSYQLGHQLSRKWSTGAGARIGCVADYPAELRLQAFE 1528
            LQSP+++V K +ILQRINSKKAAKSYQLG QLS KWSTGAG RIGC+ADYP ELR QA E
Sbjct: 385  LQSPKSDVPKTSILQRINSKKAAKSYQLGDQLSSKWSTGAGPRIGCIADYPLELRFQALE 444

Query: 1529 XXXXXXXXXXXXXXXXXXXL--LSSPTAFPPYLSNGDTT 1639
                               L  L SP++ P  + N D T
Sbjct: 445  FTNLSPRSFLRPPLFTPRKLAALVSPSSHPSNVGNYDDT 483


>gb|KHG28971.1| Lysine--tRNA ligase [Gossypium arboreum]
          Length = 494

 Score =  522 bits (1344), Expect = e-145
 Identities = 263/388 (67%), Positives = 303/388 (78%), Gaps = 15/388 (3%)
 Frame = +2

Query: 410  AATKVQKVYRSYRTRRMLADSAVVAEELWCVA----------ISFVDFLKPDTAASRWNR 559
            AA KVQKVYRSYRTRR LADSAVVAEELW +A          ISF ++LKP+TAAS WNR
Sbjct: 92   AAVKVQKVYRSYRTRRRLADSAVVAEELWWLALNYARLNHSTISFFNYLKPETAASWWNR 151

Query: 560  VILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWL 739
            V LNASKVGKGLS+DAKAQ LAFQHWIEAIDPRHRYGH+LH+YY EWCK++AGQ FFYWL
Sbjct: 152  VGLNASKVGKGLSIDAKAQKLAFQHWIEAIDPRHRYGHNLHIYYDEWCKTDAGQPFFYWL 211

Query: 740  DLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPE 919
            D+G+GK+ DLEEC RSKLR+QCIKYLGPQERE YEY+VV+GKI+HK T   L T  GS E
Sbjct: 212  DIGDGKDVDLEECSRSKLRKQCIKYLGPQEREHYEYIVVEGKIIHKQTRNVLDTFQGSKE 271

Query: 920  LKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPN 1099
            +KWIFV+STSKKLY GEKKKG+FHHSSF            +V  G+LK IS YSGHY+P 
Sbjct: 272  VKWIFVMSTSKKLYAGEKKKGMFHHSSFLAGGVTLAAGRLVVEKGILKSISAYSGHYRPT 331

Query: 1100 DDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKDLSISDSVQPDLPSE----- 1264
            DDSL+SFLS LK NGVN+D+VEIR++ +DS+S+D+G  +  +S        +P+E     
Sbjct: 332  DDSLNSFLSFLKGNGVNLDKVEIRRSTDDSDSYDDGNSSSSVSTFGFSASSVPTEPKINK 391

Query: 1265 EEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQL 1444
            EEK+ SLE+ +  +PE    Y+RTLSGGLQSPR EV K AILQRINSKKA KSYQLGHQL
Sbjct: 392  EEKNLSLESYDTKQPETTNTYERTLSGGLQSPRTEVPKTAILQRINSKKATKSYQLGHQL 451

Query: 1445 SRKWSTGAGARIGCVADYPAELRLQAFE 1528
            S KWSTGAG RIGCVADYP ELR QA E
Sbjct: 452  SLKWSTGAGPRIGCVADYPLELRQQALE 479


>gb|KHG10982.1| Lysine--tRNA ligase [Gossypium arboreum]
          Length = 494

 Score =  522 bits (1344), Expect = e-145
 Identities = 263/388 (67%), Positives = 303/388 (78%), Gaps = 15/388 (3%)
 Frame = +2

Query: 410  AATKVQKVYRSYRTRRMLADSAVVAEELWCVA----------ISFVDFLKPDTAASRWNR 559
            AA KVQKVYRSYRTRR LADSAVVAEELW +A          ISF ++LKP+TAAS WNR
Sbjct: 92   AAVKVQKVYRSYRTRRRLADSAVVAEELWWLALNYARLNHSTISFFNYLKPETAASWWNR 151

Query: 560  VILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWL 739
            V LNASKVGKGLS+DAKAQ LAFQHWIEAIDPRHRYGH+LH+YY EWCK++AGQ FFYWL
Sbjct: 152  VGLNASKVGKGLSIDAKAQKLAFQHWIEAIDPRHRYGHNLHIYYDEWCKTDAGQPFFYWL 211

Query: 740  DLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPE 919
            D+G+GK+ DLEEC RSKLR+QCIKYLGPQERE YEY+VV+GKI+HK T   L T  GS E
Sbjct: 212  DIGDGKDVDLEECSRSKLRKQCIKYLGPQEREHYEYIVVEGKIIHKQTRNVLDTFQGSKE 271

Query: 920  LKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPN 1099
            +KWIFV+STSKKLY GEKKKG+FHHSSF            +V  G+LK IS YSGHY+P 
Sbjct: 272  VKWIFVMSTSKKLYAGEKKKGMFHHSSFLAGGVTLAAGRLVVEKGILKSISAYSGHYRPT 331

Query: 1100 DDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKDLSISDSVQPDLPSE----- 1264
            DDSL+SFLS LK NGVN+D+VEIR++ +DS+S+D+G  +  +S        +P+E     
Sbjct: 332  DDSLNSFLSFLKGNGVNLDKVEIRRSTDDSDSYDDGNSSSSVSTFGFSASSVPTEPKINK 391

Query: 1265 EEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQL 1444
            EEK+ SLE+ +  +PE    Y+RTLSGGLQSPR EV K AILQRINSKKA KSYQLGHQL
Sbjct: 392  EEKNLSLESYDTKQPETTNTYERTLSGGLQSPRTEVPKTAILQRINSKKATKSYQLGHQL 451

Query: 1445 SRKWSTGAGARIGCVADYPAELRLQAFE 1528
            S KWSTGAG RIGCVADYP ELR QA E
Sbjct: 452  SLKWSTGAGPRIGCVADYPLELRQQALE 479


>ref|XP_002533549.1| calmodulin binding protein, putative [Ricinus communis]
            gi|223526585|gb|EEF28839.1| calmodulin binding protein,
            putative [Ricinus communis]
          Length = 476

 Score =  520 bits (1340), Expect = e-144
 Identities = 266/428 (62%), Positives = 311/428 (72%), Gaps = 16/428 (3%)
 Frame = +2

Query: 407  TAATKVQKVYRSYRTRRMLADSAVVAEELWCVAI----------SFVDFLKPDTAASRWN 556
            TAA K+QKVYRSYRTRR LADSAVVAEELW  AI          SF +F+KP+TA SRWN
Sbjct: 49   TAAVKLQKVYRSYRTRRRLADSAVVAEELWWQAIDYARLNHSTISFFNFMKPETAVSRWN 108

Query: 557  RVILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYW 736
            R+ LNASKVGKGLS DAKAQ LAFQHWIEAIDPRHRYGHSLHLYY+EWC++N+GQ FFYW
Sbjct: 109  RISLNASKVGKGLSKDAKAQKLAFQHWIEAIDPRHRYGHSLHLYYEEWCRTNSGQPFFYW 168

Query: 737  LDLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSP 916
            LD+G+GKE DLE+CPRSKLR QCIKYLGP+ER  YEY+V +G+I+ K TG  L T +GS 
Sbjct: 169  LDIGDGKELDLEDCPRSKLRHQCIKYLGPKERGYYEYIVFEGRIVQKYTGNLLDTSSGSK 228

Query: 917  ELKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKP 1096
              KWIFV+ST K+LY GEKKKG FHHSSF            +  +G+LK ISPYSGHY+P
Sbjct: 229  GAKWIFVMSTFKRLYAGEKKKGKFHHSSFLAGGATLAAGRLVAENGILKSISPYSGHYRP 288

Query: 1097 NDDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKDLSISDSVQPDLPSE---- 1264
             DDS DSFLS+LK+NGVN+DEV+I KA+EDS+ +D+GK +    I++++    P E    
Sbjct: 289  TDDSFDSFLSLLKDNGVNLDEVQINKASEDSDIYDDGKFSGSKMINETLSKSKPPELELP 348

Query: 1265 -EEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQ 1441
             E+KD + E  E  + E +  YKRTLSGGLQSPRAEV +  ILQRINSKKA KSYQLGHQ
Sbjct: 349  NEQKDATSEPAEVKQTENEGIYKRTLSGGLQSPRAEVPRTVILQRINSKKAGKSYQLGHQ 408

Query: 1442 LSRKWSTGAGARIGCVADYPAELRLQAFEXXXXXXXXXXXXXXXXXXXLLSSPTAFP-PY 1618
            LS KWSTGAG RIGCVADYP E+RLQA E                    L+SPT  P   
Sbjct: 409  LSLKWSTGAGPRIGCVADYPVEVRLQALEFVNLSPRSPPTPSYYRRVAGLASPTTQPISD 468

Query: 1619 LSNGDTTS 1642
             +NGD  S
Sbjct: 469  AANGDGNS 476


>ref|XP_010049678.1| PREDICTED: uncharacterized protein LOC104438270 [Eucalyptus grandis]
          Length = 560

 Score =  518 bits (1334), Expect = e-144
 Identities = 261/388 (67%), Positives = 303/388 (78%), Gaps = 14/388 (3%)
 Frame = +2

Query: 407  TAATKVQKVYRSYRTRRMLADSAVVAEELWCVAISFVD--------FLKPDTAASRWNRV 562
            +AA KVQKVYRSYRTRR LADSAVVAEELW  A+ F          F  P++AASRW+RV
Sbjct: 143  SAAVKVQKVYRSYRTRRRLADSAVVAEELWWQALDFARLNRSTISFFNFPESAASRWSRV 202

Query: 563  ILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWLD 742
             L ASKVGKGLS DAKAQ+LAFQHWIEAIDPRHRYGHSLHLYYKEWCK+++GQ FFYWLD
Sbjct: 203  GLIASKVGKGLSKDAKAQILAFQHWIEAIDPRHRYGHSLHLYYKEWCKADSGQPFFYWLD 262

Query: 743  LGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPEL 922
            +G+GKE DL+ECPRSKLRQQ IKYLGPQERE YEYVV++GKI+HK TG+ LHT  GS + 
Sbjct: 263  IGDGKELDLKECPRSKLRQQGIKYLGPQEREHYEYVVINGKIVHKQTGELLHTKKGSEDA 322

Query: 923  KWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPND 1102
            KWIFV+S SK+LYTGEKKKG+FHHSSF                GVLK IS YSGHY+P +
Sbjct: 323  KWIFVMSPSKRLYTGEKKKGVFHHSSFLAGGVTIAAGRIAAEHGVLKSISAYSGHYRPTE 382

Query: 1103 DSLDSFLSILKENGVNVDEVEIRKANEDSESHDN------GKLAKDLSISDSVQPDLPSE 1264
            D LD+F+S LKENGVN+DEVEIRKA+EDSES+++      G+L +     +  QP++P E
Sbjct: 383  DRLDNFISFLKENGVNLDEVEIRKASEDSESYEDNKPSARGRLVETSGTPELSQPEIPKE 442

Query: 1265 EEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQL 1444
            E ++   E+ E++ P+ KI YKRTLSGGLQSPR EV K  IL+RINSK AAKSYQLG+QL
Sbjct: 443  EMENVCSESTESTIPQEKINYKRTLSGGLQSPRTEVPKTKILERINSKSAAKSYQLGNQL 502

Query: 1445 SRKWSTGAGARIGCVADYPAELRLQAFE 1528
            S KWSTGAG RIGCVADYP ELR QA E
Sbjct: 503  SLKWSTGAGPRIGCVADYPVELRQQALE 530


>gb|KCW89220.1| hypothetical protein EUGRSUZ_A01525 [Eucalyptus grandis]
          Length = 572

 Score =  518 bits (1334), Expect = e-144
 Identities = 261/388 (67%), Positives = 303/388 (78%), Gaps = 14/388 (3%)
 Frame = +2

Query: 407  TAATKVQKVYRSYRTRRMLADSAVVAEELWCVAISFVD--------FLKPDTAASRWNRV 562
            +AA KVQKVYRSYRTRR LADSAVVAEELW  A+ F          F  P++AASRW+RV
Sbjct: 155  SAAVKVQKVYRSYRTRRRLADSAVVAEELWWQALDFARLNRSTISFFNFPESAASRWSRV 214

Query: 563  ILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWLD 742
             L ASKVGKGLS DAKAQ+LAFQHWIEAIDPRHRYGHSLHLYYKEWCK+++GQ FFYWLD
Sbjct: 215  GLIASKVGKGLSKDAKAQILAFQHWIEAIDPRHRYGHSLHLYYKEWCKADSGQPFFYWLD 274

Query: 743  LGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPEL 922
            +G+GKE DL+ECPRSKLRQQ IKYLGPQERE YEYVV++GKI+HK TG+ LHT  GS + 
Sbjct: 275  IGDGKELDLKECPRSKLRQQGIKYLGPQEREHYEYVVINGKIVHKQTGELLHTKKGSEDA 334

Query: 923  KWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPND 1102
            KWIFV+S SK+LYTGEKKKG+FHHSSF                GVLK IS YSGHY+P +
Sbjct: 335  KWIFVMSPSKRLYTGEKKKGVFHHSSFLAGGVTIAAGRIAAEHGVLKSISAYSGHYRPTE 394

Query: 1103 DSLDSFLSILKENGVNVDEVEIRKANEDSESHDN------GKLAKDLSISDSVQPDLPSE 1264
            D LD+F+S LKENGVN+DEVEIRKA+EDSES+++      G+L +     +  QP++P E
Sbjct: 395  DRLDNFISFLKENGVNLDEVEIRKASEDSESYEDNKPSARGRLVETSGTPELSQPEIPKE 454

Query: 1265 EEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQL 1444
            E ++   E+ E++ P+ KI YKRTLSGGLQSPR EV K  IL+RINSK AAKSYQLG+QL
Sbjct: 455  EMENVCSESTESTIPQEKINYKRTLSGGLQSPRTEVPKTKILERINSKSAAKSYQLGNQL 514

Query: 1445 SRKWSTGAGARIGCVADYPAELRLQAFE 1528
            S KWSTGAG RIGCVADYP ELR QA E
Sbjct: 515  SLKWSTGAGPRIGCVADYPVELRQQALE 542


>ref|XP_009611163.1| PREDICTED: uncharacterized protein LOC104104723 isoform X1 [Nicotiana
            tomentosiformis]
          Length = 521

 Score =  518 bits (1333), Expect = e-144
 Identities = 279/493 (56%), Positives = 338/493 (68%), Gaps = 25/493 (5%)
 Frame = +2

Query: 125  MEVQTHALATFDTS-----PFS------RTPVNHSEICNAQVQSSEMSDYSSEAPEQHEV 271
            MEV+T+A + FD +     PFS      ++    SE+     +SS+MS     +P     
Sbjct: 1    MEVETNAGSNFDLNNSQPHPFSYRKLLDKSACELSELHIPDFRSSDMSLEGFGSPVAAAG 60

Query: 272  ESTDSGRRECEDVLGKIAVDRREWEXXXXXXXXXXXXXXXVTCPGTAATKVQKVYRSYRT 451
               DSG              RRE++                  P  AA ++QKVYRSYRT
Sbjct: 61   VECDSGMVLVPASPAVSENGRREYDIADTAPKS----------PSNAAKRLQKVYRSYRT 110

Query: 452  RRMLADSAVVAEELWCVAI----------SFVDFLKPDTAASRWNRVILNASKVGKGLSM 601
            RRMLADSAVVAEELW  AI          SF ++LKP+TAASRWNRV LNASKVGKGLS 
Sbjct: 111  RRMLADSAVVAEELWWQAIDYARLNHSTISFFNYLKPETAASRWNRVGLNASKVGKGLSK 170

Query: 602  DAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWLDLGEGKEFDLEECP 781
            DA+AQ+LAFQHWIEAIDPRHRYGH+LH+YY+EWC ++AGQ FF+WLDLG+G++ D++ECP
Sbjct: 171  DAEAQILAFQHWIEAIDPRHRYGHNLHIYYEEWCNTDAGQPFFFWLDLGDGRKVDIKECP 230

Query: 782  RSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPELKWIFVVSTSKKLY 961
            R+KL++QCIKYLGPQERE YEY++  G+ILHK TG  L T  G P  KWIFV+STSK+L+
Sbjct: 231  RAKLQKQCIKYLGPQEREHYEYIIAGGQILHKLTGILLDTTKGPPGAKWIFVMSTSKRLF 290

Query: 962  TGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPNDDSLDSFLSILKEN 1141
             GEKKKG+FHHSSF            +V DG +K ISPYSGHY+P DDSLDSFLSILKEN
Sbjct: 291  AGEKKKGMFHHSSFLAGGATLAAGRLVVEDGTVKSISPYSGHYRPTDDSLDSFLSILKEN 350

Query: 1142 GVNVDEVEIRKANEDSESHDNGKLAKD----LSISDSVQPDLPSEEEKDPSLETIEASRP 1309
            GV VDEV+I+KANED ++ ++ K  ++    LS      P    EEEKD S E++ A + 
Sbjct: 351  GVKVDEVKIKKANEDYDNSEDVKSIENRPSKLSTPSDSPPQAVKEEEKDCSAESMGAPQA 410

Query: 1310 EIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQLSRKWSTGAGARIGCV 1489
            E    Y+RTLSGGLQSPR +V K AILQRI+SKK+ KSYQLGHQLSR WSTGAG RIGC+
Sbjct: 411  ESTNSYQRTLSGGLQSPRTDVPKTAILQRISSKKSTKSYQLGHQLSRVWSTGAGPRIGCI 470

Query: 1490 ADYPAELRLQAFE 1528
            ADYPAELR QA E
Sbjct: 471  ADYPAELRWQALE 483


>ref|XP_010110598.1| hypothetical protein L484_001201 [Morus notabilis]
            gi|587940315|gb|EXC26929.1| hypothetical protein
            L484_001201 [Morus notabilis]
          Length = 511

 Score =  516 bits (1329), Expect = e-143
 Identities = 288/523 (55%), Positives = 340/523 (65%), Gaps = 19/523 (3%)
 Frame = +2

Query: 131  VQTHALATFDTSPFSRTPVNHSEICNAQVQSSEMSDYSSEAPEQHEVESTDSGRRECE-- 304
            VQ   L+TFD    S+ P  +  + +  + SS +    S + +  E ES  SG R+    
Sbjct: 4    VQAQRLSTFDLQ--SKPPFPYP-LGDMSISSSNLHSLDSRSSDVSESESPASGDRQLPSF 60

Query: 305  DVLGKIAVDRREWEXXXXXXXXXXXXXXXVTCPGTAATKVQKVYRSYRTRRMLADSAVVA 484
            DV G +  D    +                     AA K+QK YR YRTRR LADSA+VA
Sbjct: 61   DVAGSLTGDHTPAKETIES--------------SAAAVKLQKAYRGYRTRRRLADSAIVA 106

Query: 485  EELWCVA----------ISFVDFLKPDTAASRWNRVILNASKVGKGLSMDAKAQLLAFQH 634
            EELW  A          ISF +F KP+TA SRWNR+ LNASKVGKGLS DAKAQ LAFQH
Sbjct: 107  EELWWQALDFARLNHSTISFFNFSKPETATSRWNRISLNASKVGKGLSKDAKAQKLAFQH 166

Query: 635  WIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWLDLGEGKEFDLEECPRSKLRQQCIKY 814
            WIEAIDPRHRYGHSLH YY+EWCK++AGQ FFYWLDLG+GKE DL+ECPRSKL+QQCIKY
Sbjct: 167  WIEAIDPRHRYGHSLHKYYEEWCKADAGQPFFYWLDLGDGKELDLKECPRSKLKQQCIKY 226

Query: 815  LGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPELKWIFVVSTSKKLYTGEKKKGLFHH 994
            LGP+ERE YEY++ DGKI+HK T   L T   S   KWIFV+ST KKLY GEKKKG FHH
Sbjct: 227  LGPKEREHYEYLLEDGKIIHKQTRDLLDTKKRSQGAKWIFVMSTFKKLYAGEKKKGTFHH 286

Query: 995  SSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPNDDSLDSFLSILKENGVNVDEVEIRK 1174
            SSF               +G LK IS YSGHY+P+DD LDSFLS LKENGV ++EVEIRK
Sbjct: 287  SSFLAGGATLAAGRLEAENGTLKSISAYSGHYRPSDDRLDSFLSFLKENGVKLNEVEIRK 346

Query: 1175 ANEDSESHDNGKLAKDL----SISDSVQPDLPS--EEEKDPSLETIEASRPEIKIEYKRT 1336
            ANED +S+D+ K  +      S+++S  P+L +  E++K  S ET E  +   K  YKRT
Sbjct: 347  ANEDYDSYDDSKFNRAATSLKSLANSGPPELKTHVEQQKTSSSETTELPQTGTKTNYKRT 406

Query: 1337 LSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQLSRKWSTGAGARIGCVADYPAELRL 1516
            LSGGLQSPRAEV K  IL+RINSKKAAKSYQLG+QLS KWSTGAG RIGCVADYP E+R+
Sbjct: 407  LSGGLQSPRAEVPKNKILERINSKKAAKSYQLGNQLSLKWSTGAGPRIGCVADYPVEVRM 466

Query: 1517 QAFEXXXXXXXXXXXXXXXXXXXLLSSPTAF-PPYLSNGDTTS 1642
            QA E                    L+SPT+   P  +NGD TS
Sbjct: 467  QALEFVNLSPRTPPTPLSFRRLAGLASPTSHSTPDPNNGDGTS 509


>ref|XP_012467025.1| PREDICTED: uncharacterized protein LOC105785475 isoform X1 [Gossypium
            raimondii] gi|763747634|gb|KJB15073.1| hypothetical
            protein B456_002G158900 [Gossypium raimondii]
          Length = 537

 Score =  515 bits (1326), Expect = e-143
 Identities = 259/388 (66%), Positives = 303/388 (78%), Gaps = 15/388 (3%)
 Frame = +2

Query: 410  AATKVQKVYRSYRTRRMLADSAVVAEELWCVA----------ISFVDFLKPDTAASRWNR 559
            AA KVQKVYRSYRTRR LADSAVVAEELW +A          ISF ++LKP+TAASRWNR
Sbjct: 92   AAVKVQKVYRSYRTRRRLADSAVVAEELWWLALNYARLNHSTISFFNYLKPETAASRWNR 151

Query: 560  VILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWL 739
            V LNASKVGKGLS+DAKAQ LAFQHWIEAIDPRHRYGH+LH+YY EWCK++AGQ FFYWL
Sbjct: 152  VRLNASKVGKGLSIDAKAQKLAFQHWIEAIDPRHRYGHNLHIYYDEWCKTDAGQPFFYWL 211

Query: 740  DLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPE 919
            D+G+GK+ DLEEC RS L++Q IKYLGPQERE YEY+VV+GKI+HK T   L T  GS E
Sbjct: 212  DIGDGKDVDLEECSRSNLQKQLIKYLGPQEREHYEYIVVEGKIIHKQTRNVLDTFQGSKE 271

Query: 920  LKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPN 1099
            +KWIFV+STSKKLY G+KKKG+FHHSSF            +V  G+LK IS YSGHY+P 
Sbjct: 272  VKWIFVMSTSKKLYAGKKKKGMFHHSSFLAGGATLAAGRLVVEKGILKSISAYSGHYRPT 331

Query: 1100 DDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKDLSISDSVQPDLPSE----- 1264
            +DSL+SFLS LKENGVN+D+VEIR + +DS+S+D+GK +  +S        +P+E     
Sbjct: 332  NDSLNSFLSFLKENGVNLDKVEIRHSTDDSDSYDDGKSSSSVSTFGFSASSVPTEPKINN 391

Query: 1265 EEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQL 1444
            EEK+ SLE+ +  +PE    Y+RTLSGGL+SPR EV K AILQRINSKKA +SYQLGHQL
Sbjct: 392  EEKNLSLESYDTKQPETTNTYERTLSGGLKSPRTEVPKTAILQRINSKKATESYQLGHQL 451

Query: 1445 SRKWSTGAGARIGCVADYPAELRLQAFE 1528
            S KWSTGAG RIGCVADYP ELR QA E
Sbjct: 452  SLKWSTGAGPRIGCVADYPLELRQQALE 479


>gb|KHF97284.1| Pre-mRNA-splicing factor CWC22 [Gossypium arboreum]
          Length = 519

 Score =  514 bits (1325), Expect = e-143
 Identities = 256/389 (65%), Positives = 298/389 (76%), Gaps = 12/389 (3%)
 Frame = +2

Query: 398  CPGTAATKVQKVYRSYRTRRMLADSAVVAEELWCV---------AISFVDFLKPDTAASR 550
            C   AA KVQKVYRSY TRR +ADSAVVAEE W V          ISF ++ KP++ ASR
Sbjct: 87   CHSNAAVKVQKVYRSYWTRRRIADSAVVAEEWWRVLDYARLNHSTISFFNYSKPESVASR 146

Query: 551  WNRVILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFF 730
            WNRV+LNASKVGKGL  DAKAQ LAF+HWIEAIDPRHRYGH+LH+YY EWCKS+AGQ FF
Sbjct: 147  WNRVVLNASKVGKGLFKDAKAQKLAFRHWIEAIDPRHRYGHNLHIYYDEWCKSDAGQPFF 206

Query: 731  YWLDLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTG 910
            YWLD+GEGKE DL+ECPRSKLRQQCIKYLGPQERE YEY+V +GKI+HK TG  LHT  G
Sbjct: 207  YWLDIGEGKEIDLQECPRSKLRQQCIKYLGPQEREHYEYIVFEGKIIHKQTGNVLHTIKG 266

Query: 911  SPELKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHY 1090
            S   KWIFVVSTSKKLY GEKKKG+FHHSSF            +   G+LK IS YSGHY
Sbjct: 267  SEGRKWIFVVSTSKKLYAGEKKKGMFHHSSFLAGGATFAAGRLVAEQGILKSISAYSGHY 326

Query: 1091 KPNDDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKD---LSISDSVQPDLPS 1261
            +P DDSLD+FLS LKE GVN+ EVEIR+A++DS+++ N K       + +S  ++P++ S
Sbjct: 327  RPTDDSLDNFLSFLKEKGVNLSEVEIRRASDDSDNYGNDKSGSGGTAVEVSVPIEPEINS 386

Query: 1262 EEEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQ 1441
             EEK+ SL++ E ++      YKRTLSGGLQSP  EV K+AILQRINSKKA KSYQLGH+
Sbjct: 387  -EEKNVSLQSSETNQTGASYTYKRTLSGGLQSPTVEVPKKAILQRINSKKAVKSYQLGHR 445

Query: 1442 LSRKWSTGAGARIGCVADYPAELRLQAFE 1528
            LS KWSTGAG RIGC+ADYP ELR +A E
Sbjct: 446  LSLKWSTGAGPRIGCIADYPVELRQEALE 474


>ref|XP_009798181.1| PREDICTED: uncharacterized protein LOC104244456 isoform X1 [Nicotiana
            sylvestris]
          Length = 529

 Score =  512 bits (1318), Expect = e-142
 Identities = 279/495 (56%), Positives = 337/495 (68%), Gaps = 27/495 (5%)
 Frame = +2

Query: 125  MEVQTHALATFDTS------PFSRTPVNHSEICNAQ------VQSSEMSDYSSEAPEQHE 268
            MEV+T A + FD +      PFS   +     C          +SS+MS  SS APE   
Sbjct: 1    MEVETDAGSNFDLNNSQPHHPFSYRKLLDMSACELSELHIPDFRSSDMST-SSPAPELEG 59

Query: 269  VESTDSGRR-ECEDVLGKIAVDRREWEXXXXXXXXXXXXXXXVTCPGTAATKVQKVYRSY 445
              S  +    +C++ +  +       E                  P  AA ++QKVYRSY
Sbjct: 60   FGSPVAAAGVDCDNGMVLVPASPAVSENGRGEYDITDTAPRS---PSNAAKRLQKVYRSY 116

Query: 446  RTRRMLADSAVVAEELWCVAI----------SFVDFLKPDTAASRWNRVILNASKVGKGL 595
            RTRRMLADSAVVAEELW  AI          SF ++LKP+ AASRWNRV LNASKVGKGL
Sbjct: 117  RTRRMLADSAVVAEELWWQAIDYARLNHSTISFFNYLKPEMAASRWNRVGLNASKVGKGL 176

Query: 596  SMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWLDLGEGKEFDLEE 775
            S DA+AQ+LAF+HWIEAIDPRHRYGH+LH+YY+EW K++AGQ FF+WLDLGEG++ DL+E
Sbjct: 177  SKDAEAQILAFRHWIEAIDPRHRYGHNLHIYYEEWYKTDAGQPFFFWLDLGEGRKVDLKE 236

Query: 776  CPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPELKWIFVVSTSKK 955
            CPR+KL++QCIKYLGPQERE YEY++  GKILHK TG  L T  G P  KWIFV+STSK+
Sbjct: 237  CPRAKLQKQCIKYLGPQERELYEYMIAGGKILHKLTGNLLDTTKGPPGAKWIFVMSTSKR 296

Query: 956  LYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPNDDSLDSFLSILK 1135
            LY GEKKKG+FHHSSF            +V +G++K ISPYSGHY+P DDSLDSFLSILK
Sbjct: 297  LYAGEKKKGMFHHSSFLAGGATLAAGRLVVENGIVKSISPYSGHYRPTDDSLDSFLSILK 356

Query: 1136 ENGVNVDEVEIRKANEDSESHDNGKLAKD----LSISDSVQPDLPSEEEKDPSLETIEAS 1303
            ENGV  DEV+I+KANED ++ ++ K  ++    LS      P    EEEKD S+E++ A 
Sbjct: 357  ENGVKADEVKIKKANEDYDNSEDVKSVENRPSKLSTPSDSPPQAVKEEEKDCSVESMGAP 416

Query: 1304 RPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQLSRKWSTGAGARIG 1483
              E    Y+RTLSGGLQSPR +V K AILQRI+SKK+ KSYQLGHQLSR WSTGAG RIG
Sbjct: 417  HAESTNSYQRTLSGGLQSPRRDVPKTAILQRISSKKSTKSYQLGHQLSRVWSTGAGPRIG 476

Query: 1484 CVADYPAELRLQAFE 1528
            C+ADYPAELR QA E
Sbjct: 477  CIADYPAELRWQALE 491


>gb|KJB15075.1| hypothetical protein B456_002G158900 [Gossypium raimondii]
          Length = 536

 Score =  510 bits (1313), Expect = e-141
 Identities = 259/388 (66%), Positives = 302/388 (77%), Gaps = 15/388 (3%)
 Frame = +2

Query: 410  AATKVQKVYRSYRTRRMLADSAVVAEELWCVA----------ISFVDFLKPDTAASRWNR 559
            AA KVQKVYRSYRTRR LADSAVVAEELW +A          ISF ++LKP+TAASRWNR
Sbjct: 92   AAVKVQKVYRSYRTRRRLADSAVVAEELWWLALNYARLNHSTISFFNYLKPETAASRWNR 151

Query: 560  VILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWL 739
            V LNASKVGKGLS+DAKAQ LAFQHWIEAIDPRHRYGH+LH+YY EWCK++AGQ FFYWL
Sbjct: 152  VRLNASKVGKGLSIDAKAQKLAFQHWIEAIDPRHRYGHNLHIYYDEWCKTDAGQPFFYWL 211

Query: 740  DLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPE 919
            D+G+GK+ DLEEC RS L++Q IKYLGPQERE YEY+VV+GKI+HK T   L T  GS E
Sbjct: 212  DIGDGKDVDLEECSRSNLQKQLIKYLGPQEREHYEYIVVEGKIIHKQTRNVLDTFQGSKE 271

Query: 920  LKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPN 1099
            +KWIFV+STSKKLY G KKKG+FHHSSF            +V  G+LK IS YSGHY+P 
Sbjct: 272  VKWIFVMSTSKKLYAG-KKKGMFHHSSFLAGGATLAAGRLVVEKGILKSISAYSGHYRPT 330

Query: 1100 DDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNGKLAKDLSISDSVQPDLPSE----- 1264
            +DSL+SFLS LKENGVN+D+VEIR + +DS+S+D+GK +  +S        +P+E     
Sbjct: 331  NDSLNSFLSFLKENGVNLDKVEIRHSTDDSDSYDDGKSSSSVSTFGFSASSVPTEPKINN 390

Query: 1265 EEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQL 1444
            EEK+ SLE+ +  +PE    Y+RTLSGGL+SPR EV K AILQRINSKKA +SYQLGHQL
Sbjct: 391  EEKNLSLESYDTKQPETTNTYERTLSGGLKSPRTEVPKTAILQRINSKKATESYQLGHQL 450

Query: 1445 SRKWSTGAGARIGCVADYPAELRLQAFE 1528
            S KWSTGAG RIGCVADYP ELR QA E
Sbjct: 451  SLKWSTGAGPRIGCVADYPLELRQQALE 478


>ref|XP_006339127.1| PREDICTED: uncharacterized protein LOC102601309 [Solanum tuberosum]
          Length = 466

 Score =  509 bits (1312), Expect = e-141
 Identities = 259/389 (66%), Positives = 298/389 (76%), Gaps = 16/389 (4%)
 Frame = +2

Query: 410  AATKVQKVYRSYRTRRMLADSAVVAEELWCVA----------ISFVDFLKPDTAASRWNR 559
            AA  VQKVYRSYRTRRMLADSA+VAEELW  A          ISF +  +P+TA SRWNR
Sbjct: 55   AAMTVQKVYRSYRTRRMLADSALVAEELWWQALDYARLNHSTISFFNVPQPETAVSRWNR 114

Query: 560  VILNASKVGKGLSMDAKAQLLAFQHWIEAIDPRHRYGHSLHLYYKEWCKSNAGQAFFYWL 739
            + LNASKVGKGLS D KAQ LAFQHWIEAIDPRHRYGH+LH+YYKEWCK++AGQ FF+WL
Sbjct: 115  ITLNASKVGKGLSRDGKAQKLAFQHWIEAIDPRHRYGHNLHMYYKEWCKTDAGQPFFFWL 174

Query: 740  DLGEGKEFDLEECPRSKLRQQCIKYLGPQEREQYEYVVVDGKILHKTTGKTLHTDTGSPE 919
            DLG+GK+ +L+ECPRSKL++Q IKYLGPQERE YEYVV +GKILH  TG  L T  G P 
Sbjct: 175  DLGDGKKVELKECPRSKLQKQSIKYLGPQEREHYEYVVAEGKILHSLTGNHLDTTNGLPG 234

Query: 920  LKWIFVVSTSKKLYTGEKKKGLFHHSSFXXXXXXXXXXXXMVVDGVLKCISPYSGHYKPN 1099
             KWIFV+STSK+LY GEKKKG+FHHSSF            +V DG+LK IS YSGHY+P 
Sbjct: 235  AKWIFVMSTSKRLYAGEKKKGIFHHSSFLAGGATLAAGRLVVKDGILKSISAYSGHYRPT 294

Query: 1100 DDSLDSFLSILKENGVNVDEVEIRKANEDSESHDNG-----KLAKDLS-ISDSVQPDLPS 1261
            DDSLDSFLS LKENGVN++E EI+K  +D ES++ G     + A DLS IS+S+Q DL  
Sbjct: 295  DDSLDSFLSFLKENGVNLEETEIKKPRDDDESYEEGNSSESRSASDLSTISESLQVDLQK 354

Query: 1262 EEEKDPSLETIEASRPEIKIEYKRTLSGGLQSPRAEVSKEAILQRINSKKAAKSYQLGHQ 1441
            EEEKD S + + + + E    YKRTLSGGL+SPRAEV K AILQRINSKK +KSYQLGHQ
Sbjct: 355  EEEKDFSSKLVISPQAEKASNYKRTLSGGLESPRAEVPKTAILQRINSKKLSKSYQLGHQ 414

Query: 1442 LSRKWSTGAGARIGCVADYPAELRLQAFE 1528
            LS  WSTGAG RIGC+ DYP ELR QA E
Sbjct: 415  LSLAWSTGAGPRIGCINDYPVELREQALE 443


Top