BLASTX nr result

ID: Phellodendron21_contig00000692 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Phellodendron21_contig00000692
         (1916 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_006434539.1 hypothetical protein CICLE_v10000894mg [Citrus cl...   809   0.0  
KDO83830.1 hypothetical protein CISIN_1g010232mg [Citrus sinensis]    806   0.0  
XP_006473143.1 PREDICTED: uncharacterized protein LOC102610673 [...   803   0.0  
KDO83831.1 hypothetical protein CISIN_1g010232mg [Citrus sinensis]    711   0.0  
GAV68150.1 S1 domain-containing protein [Cephalotus follicularis]     676   0.0  
OAY58518.1 hypothetical protein MANES_02G184000 [Manihot esculenta]   667   0.0  
KJB28949.1 hypothetical protein B456_005G076800 [Gossypium raimo...   659   0.0  
XP_012482380.1 PREDICTED: uncharacterized protein LOC105797016 i...   660   0.0  
XP_017647911.1 PREDICTED: 30S ribosomal protein S1 homolog A [Go...   658   0.0  
XP_018848096.1 PREDICTED: uncharacterized protein LOC109011380 [...   655   0.0  
XP_015579628.1 PREDICTED: uncharacterized protein LOC8281436 iso...   653   0.0  
EOY16872.1 Nucleic acid-binding proteins superfamily isoform 1 [...   650   0.0  
XP_015579629.1 PREDICTED: uncharacterized protein LOC8281436 iso...   650   0.0  
XP_007019649.2 PREDICTED: 30S ribosomal protein S1 homolog A iso...   649   0.0  
EOY16874.1 Nucleic acid-binding proteins superfamily isoform 3, ...   649   0.0  
XP_016693813.1 PREDICTED: LOW QUALITY PROTEIN: 30S ribosomal pro...   646   0.0  
EOY16873.1 Nucleic acid-binding proteins superfamily isoform 2 [...   644   0.0  
XP_007019648.2 PREDICTED: 30S ribosomal protein S1 homolog A iso...   643   0.0  
OMO98749.1 hypothetical protein COLO4_13726 [Corchorus olitorius]     643   0.0  
OMO89013.1 hypothetical protein CCACVL1_08061 [Corchorus capsula...   642   0.0  

>XP_006434539.1 hypothetical protein CICLE_v10000894mg [Citrus clementina] ESR47779.1
            hypothetical protein CICLE_v10000894mg [Citrus
            clementina]
          Length = 514

 Score =  809 bits (2089), Expect = 0.0
 Identities = 420/515 (81%), Positives = 441/515 (85%), Gaps = 2/515 (0%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKARKSHSFAASFRFLNSTQIVF 1658
            MQTL+QPCKSFTF LLNS  PL+T SFVQNGNTKYPL + R  HSFAASFRFL ST IVF
Sbjct: 1    MQTLVQPCKSFTFTLLNSAPPLSTCSFVQNGNTKYPLPQTRNFHSFAASFRFLRSTHIVF 60

Query: 1657 CSKNDFFDDFSSTQLPENVENDGVXXXXXXXXXXKPDLVPISNGVVSKVDKESEKPDEEE 1478
            CS+ D FDD SS Q PENVEN+G+          KP+LVPISNGV S+VDK+SEKPDEEE
Sbjct: 61   CSQKDVFDDLSSAQFPENVENEGLEGNEELELLNKPNLVPISNGVASEVDKKSEKPDEEE 120

Query: 1477 ALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEYYEPKPGDFVVGV 1298
            ALAPFLKFFKPRD                E+I VDDKV E +KVSVEYYEPKPGDFV+GV
Sbjct: 121  ALAPFLKFFKPRDSAEEVEEEGSEVGVSRESIGVDDKVGE-DKVSVEYYEPKPGDFVIGV 179

Query: 1297 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEFMVRGKMGIVKDD 1118
            VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEM+FLLCDLK DAEEFMVRGKMGIVKDD
Sbjct: 180  VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMDFLLCDLKKDAEEFMVRGKMGIVKDD 239

Query: 1117 DA--MSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP 944
            DA  MSGG GPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP
Sbjct: 240  DAIAMSGGSGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP 299

Query: 943  IEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYVQIARINEANND 764
            IEVK TEWNTGGLLTRIEGLRAFLPKAELL RVNNFTELKEKVGR +YVQI RINE  ND
Sbjct: 300  IEVKFTEWNTGGLLTRIEGLRAFLPKAELLSRVNNFTELKEKVGRRMYVQITRINEDTND 359

Query: 763  LILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISNMSRTRVTSVGD 584
            LILSEREAWA LNLREGTLLEGTVKKI+PYGAQIRIGD+NRSGLLHISNMSRTRVTSV D
Sbjct: 360  LILSEREAWATLNLREGTLLEGTVKKIYPYGAQIRIGDSNRSGLLHISNMSRTRVTSVSD 419

Query: 583  XXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEMAKKYRQKLPAV 404
                         KSMFPDKISLSIADLESEPGLFVS+KERVFSEAEEMAKKYRQKLPAV
Sbjct: 420  LLNEGERVKVLVVKSMFPDKISLSIADLESEPGLFVSDKERVFSEAEEMAKKYRQKLPAV 479

Query: 403  SVSPMSEAPPNDALPFDSEASMCANWKWFKFEKDT 299
            SVSP SE+ P D LPFDSEASMCANWKWF+FE+D+
Sbjct: 480  SVSPKSESLPTDTLPFDSEASMCANWKWFRFEQDS 514


>KDO83830.1 hypothetical protein CISIN_1g010232mg [Citrus sinensis]
          Length = 514

 Score =  806 bits (2081), Expect = 0.0
 Identities = 419/515 (81%), Positives = 441/515 (85%), Gaps = 2/515 (0%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKARKSHSFAASFRFLNSTQIVF 1658
            MQT +Q CKSFTF LLNS  PL+T SFVQNGN KYPLQ++RK HSFAASFRFL ST IVF
Sbjct: 1    MQTPVQACKSFTFTLLNSAPPLSTCSFVQNGNAKYPLQQSRKFHSFAASFRFLRSTHIVF 60

Query: 1657 CSKNDFFDDFSSTQLPENVENDGVXXXXXXXXXXKPDLVPISNGVVSKVDKESEKPDEEE 1478
            CS+ D FDD SS Q PENVEN+G+          KP+LVPISNGV S+VDK+SEKPDEEE
Sbjct: 61   CSQKDVFDDLSSAQFPENVENEGLEGNEELELLNKPNLVPISNGVASEVDKKSEKPDEEE 120

Query: 1477 ALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEYYEPKPGDFVVGV 1298
            ALAPFLKFFKPRD                E+IDVDDKV E +KVSVEYYEPKPGDFV+GV
Sbjct: 121  ALAPFLKFFKPRDSAEEVEEEGSEVGVSRESIDVDDKVGE-DKVSVEYYEPKPGDFVIGV 179

Query: 1297 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEFMVRGKMGIVKDD 1118
            VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEM+FLLCDLK DAEEFMVRGKMGIVKDD
Sbjct: 180  VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMDFLLCDLKKDAEEFMVRGKMGIVKDD 239

Query: 1117 DA--MSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP 944
            DA  MSGG GPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP
Sbjct: 240  DAIAMSGGSGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP 299

Query: 943  IEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYVQIARINEANND 764
            IEVK TEWNTGGLLTRIEGLRAFLPKAELL RVNNFTELKEKVGR +YVQI RINE  ND
Sbjct: 300  IEVKFTEWNTGGLLTRIEGLRAFLPKAELLSRVNNFTELKEKVGRRMYVQITRINEDTND 359

Query: 763  LILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISNMSRTRVTSVGD 584
            LILSEREAWA LNLREGTLLEGTVKKI+PYGAQIRIGD+NRSGLLHISNMSRTRVTSV D
Sbjct: 360  LILSEREAWATLNLREGTLLEGTVKKIYPYGAQIRIGDSNRSGLLHISNMSRTRVTSVSD 419

Query: 583  XXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEMAKKYRQKLPAV 404
                         KSMFPDKISLSIADLESEPGLFVS+KERVFSEAEEMAKKYRQKLPAV
Sbjct: 420  LLNEGERVKVLVVKSMFPDKISLSIADLESEPGLFVSDKERVFSEAEEMAKKYRQKLPAV 479

Query: 403  SVSPMSEAPPNDALPFDSEASMCANWKWFKFEKDT 299
            SVSP SE+ P D  PFDSEASMCANWKWF+FE+D+
Sbjct: 480  SVSPKSESLPTDTPPFDSEASMCANWKWFRFEQDS 514


>XP_006473143.1 PREDICTED: uncharacterized protein LOC102610673 [Citrus sinensis]
          Length = 514

 Score =  803 bits (2073), Expect = 0.0
 Identities = 417/515 (80%), Positives = 440/515 (85%), Gaps = 2/515 (0%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKARKSHSFAASFRFLNSTQIVF 1658
            MQT +Q CKSFTF LLNS  PL+T SFVQNGN KYPLQ++RK HSFAASFRFL ST IVF
Sbjct: 1    MQTPVQACKSFTFTLLNSAPPLSTCSFVQNGNAKYPLQQSRKFHSFAASFRFLRSTHIVF 60

Query: 1657 CSKNDFFDDFSSTQLPENVENDGVXXXXXXXXXXKPDLVPISNGVVSKVDKESEKPDEEE 1478
            CS+ D FDD SSTQ PENVEN+G           KP+LVPISNG+ S+VDK+ EKPDEEE
Sbjct: 61   CSQKDVFDDLSSTQFPENVENEGFEGNEELELLNKPNLVPISNGIASEVDKKFEKPDEEE 120

Query: 1477 ALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEYYEPKPGDFVVGV 1298
            ALAPFLKFFKPRD                E+IDVDDKVEE +KVSVEYYEPKPGDFV+GV
Sbjct: 121  ALAPFLKFFKPRDSAEEVEEEESEVGVSRESIDVDDKVEE-DKVSVEYYEPKPGDFVIGV 179

Query: 1297 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEFMVRGKMGIVKDD 1118
            VVSGNENKLDVNV ADLLGTMLTKEVLPLYDKEM+FLLCDLK DAEEFMVRGKMGIVKDD
Sbjct: 180  VVSGNENKLDVNVAADLLGTMLTKEVLPLYDKEMDFLLCDLKKDAEEFMVRGKMGIVKDD 239

Query: 1117 DA--MSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP 944
            DA  MSGG GPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQL+EP
Sbjct: 240  DAIAMSGGSGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLHEP 299

Query: 943  IEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYVQIARINEANND 764
            IEVK TEWNTGGLLTRIEGLRAFLPKAELL RVNNFTELKEKVGR +YVQI RINE  ND
Sbjct: 300  IEVKFTEWNTGGLLTRIEGLRAFLPKAELLSRVNNFTELKEKVGRRMYVQITRINEDTND 359

Query: 763  LILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISNMSRTRVTSVGD 584
            LILSEREAWA LNLREGTLLEGTVKKI+PYGAQIRIGD+NRSGLLHISNMSRTRVTSV D
Sbjct: 360  LILSEREAWATLNLREGTLLEGTVKKIYPYGAQIRIGDSNRSGLLHISNMSRTRVTSVSD 419

Query: 583  XXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEMAKKYRQKLPAV 404
                         KSMFPDKISLSIADLESEPGLFVS+KERVFSEAEEMAKKYRQKLPAV
Sbjct: 420  LLNEGERVKVLVVKSMFPDKISLSIADLESEPGLFVSDKERVFSEAEEMAKKYRQKLPAV 479

Query: 403  SVSPMSEAPPNDALPFDSEASMCANWKWFKFEKDT 299
            SVSP SE+ P D  PFDSEASMCANWKWF+FE+D+
Sbjct: 480  SVSPKSESLPTDTPPFDSEASMCANWKWFRFEQDS 514


>KDO83831.1 hypothetical protein CISIN_1g010232mg [Citrus sinensis]
          Length = 483

 Score =  711 bits (1834), Expect = 0.0
 Identities = 373/460 (81%), Positives = 391/460 (85%), Gaps = 2/460 (0%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKARKSHSFAASFRFLNSTQIVF 1658
            MQT +Q CKSFTF LLNS  PL+T SFVQNGN KYPLQ++RK HSFAASFRFL ST IVF
Sbjct: 1    MQTPVQACKSFTFTLLNSAPPLSTCSFVQNGNAKYPLQQSRKFHSFAASFRFLRSTHIVF 60

Query: 1657 CSKNDFFDDFSSTQLPENVENDGVXXXXXXXXXXKPDLVPISNGVVSKVDKESEKPDEEE 1478
            CS+ D FDD SS Q PENVEN+G+          KP+LVPISNGV S+VDK+SEKPDEEE
Sbjct: 61   CSQKDVFDDLSSAQFPENVENEGLEGNEELELLNKPNLVPISNGVASEVDKKSEKPDEEE 120

Query: 1477 ALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEYYEPKPGDFVVGV 1298
            ALAPFLKFFKPRD                E+IDVDDKV E +KVSVEYYEPKPGDFV+GV
Sbjct: 121  ALAPFLKFFKPRDSAEEVEEEGSEVGVSRESIDVDDKVGE-DKVSVEYYEPKPGDFVIGV 179

Query: 1297 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEFMVRGKMGIVKDD 1118
            VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEM+FLLCDLK DAEEFMVRGKMGIVKDD
Sbjct: 180  VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMDFLLCDLKKDAEEFMVRGKMGIVKDD 239

Query: 1117 DA--MSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP 944
            DA  MSGG GPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP
Sbjct: 240  DAIAMSGGSGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRVRQIKQLNEP 299

Query: 943  IEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYVQIARINEANND 764
            IEVK TEWNTGGLLTRIEGLRAFLPKAELL RVNNFTELKEKVGR +YVQI RINE  ND
Sbjct: 300  IEVKFTEWNTGGLLTRIEGLRAFLPKAELLSRVNNFTELKEKVGRRMYVQITRINEDTND 359

Query: 763  LILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISNMSRTRVTSVGD 584
            LILSEREAWA LNLREGTLLEGTVKKI+PYGAQIRIGD+NRSGLLHISNMSRTRVTSV D
Sbjct: 360  LILSEREAWATLNLREGTLLEGTVKKIYPYGAQIRIGDSNRSGLLHISNMSRTRVTSVSD 419

Query: 583  XXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKE 464
                         KSMFPDKISLSIADLESEPGLFVS+KE
Sbjct: 420  LLNEGERVKVLVVKSMFPDKISLSIADLESEPGLFVSDKE 459


>GAV68150.1 S1 domain-containing protein [Cephalotus follicularis]
          Length = 538

 Score =  676 bits (1745), Expect = 0.0
 Identities = 360/535 (67%), Positives = 411/535 (76%), Gaps = 23/535 (4%)
 Frame = -2

Query: 1837 MQTLLQPCKSFT---FNLLNSTTPL---NTFSFVQNGN-------TKYPLQKARKSHSFA 1697
            MQTLLQPCKS +   ++  +S +PL   + F+ + N          K PLQK    H++A
Sbjct: 1    MQTLLQPCKSLSILKYSSSSSPSPLPSLHNFTLICNDAIHNKAYLNKCPLQKRDHFHAYA 60

Query: 1696 A----------SFRFLNSTQIVFCSKNDFFDDFSSTQLPENVENDGVXXXXXXXXXXKPD 1547
                        FRF  ST +VFCS+ND FDDFSSTQLP+ +END            KP 
Sbjct: 61   TVIYPKLVYAKRFRFWRSTPLVFCSQNDVFDDFSSTQLPKKLENDRNQDAEELELLSKPS 120

Query: 1546 LVPISNGVVSKVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDK 1367
             VPI+NG   KVD+ES+KPDE+E LAPFL+FFKPR+                EN + D K
Sbjct: 121  PVPINNGSALKVDEESQKPDEDEVLAPFLRFFKPRNSFEEVEEEGSGLEVNKEN-NGDSK 179

Query: 1366 VEECNKVSVEYYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFL 1187
             EE  KVSVE YEPKPGD VVGVVVSGNENKLDVNVGAD LGTMLTKEVLPLYDKEME+L
Sbjct: 180  KEESKKVSVECYEPKPGDCVVGVVVSGNENKLDVNVGADFLGTMLTKEVLPLYDKEMEYL 239

Query: 1186 LCDLKTDAEEFMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLST 1007
            LCDLK DAEEFMV+GKMGIVKDD+A+SGG  PG PVVETGTVLFAEVLGRTLSGRPLLS+
Sbjct: 240  LCDLKRDAEEFMVQGKMGIVKDDNALSGGPAPGAPVVETGTVLFAEVLGRTLSGRPLLSS 299

Query: 1006 RRLFRKMAWHRVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTEL 827
            RRLFR+M+WHRVRQIK+L+EPIEVK+TEWNTGGLLTRIEGLRAFLPK EL+KRVN FTEL
Sbjct: 300  RRLFRQMSWHRVRQIKELDEPIEVKMTEWNTGGLLTRIEGLRAFLPKVELMKRVNTFTEL 359

Query: 826  KEKVGRHIYVQIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDT 647
            KE VGR +YVQI R+NE NNDL+LSEREAW  L+L+ GTLLEGTVKKIFPYGAQIRI +T
Sbjct: 360  KENVGRRMYVQITRVNEDNNDLMLSEREAWERLHLQVGTLLEGTVKKIFPYGAQIRISET 419

Query: 646  NRSGLLHISNMSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNK 467
            NRSGLLHISN+SR R+TSV D             KS+FPDKISLSIADLESEPGLF+SNK
Sbjct: 420  NRSGLLHISNISRNRITSVSDLLKVDEKVKVLVVKSVFPDKISLSIADLESEPGLFISNK 479

Query: 466  ERVFSEAEEMAKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            E+VFSEAEEMAKKYRQKLPA+S    S +PP++ALP+++EAS+ ANWKWFKFE D
Sbjct: 480  EKVFSEAEEMAKKYRQKLPAISAPKKSTSPPSNALPYENEASLYANWKWFKFEMD 534


>OAY58518.1 hypothetical protein MANES_02G184000 [Manihot esculenta]
          Length = 523

 Score =  667 bits (1720), Expect = 0.0
 Identities = 352/525 (67%), Positives = 403/525 (76%), Gaps = 13/525 (2%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPL--NTFSFVQNGNTK-----------YPLQKARKSHSFA 1697
            MQTLLQPCKS    L NS+ PL  N      NG              +P+  + K+ SF+
Sbjct: 1    MQTLLQPCKSVP--LFNSSLPLGINNCLLKYNGPIHTTCLKFSSFHLFPVTGSLKALSFS 58

Query: 1696 ASFRFLNSTQIVFCSKNDFFDDFSSTQLPENVENDGVXXXXXXXXXXKPDLVPISNGVVS 1517
             +F    ST I FC +ND FDDFSSTQ+PE  +N  +          KP  V I+NGV  
Sbjct: 59   ENFPLWRSTHISFCCQNDAFDDFSSTQIPEGAQNYRIQENEELELLNKPSPVAINNGVGL 118

Query: 1516 KVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVE 1337
            +V+KESE  ++EEALAPFLKFFKPRD                   +++++ +E  KV V+
Sbjct: 119  EVEKESETNNKEEALAPFLKFFKPRDSLEEVNEEEDDSSVVEGKSNLNNEDKEAKKVKVD 178

Query: 1336 YYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEE 1157
            YYEPKPGDFVVGVVVSGNENKLD+NVGADLLGTMLTKEVLPLYDKEME+LLCD+  DAE 
Sbjct: 179  YYEPKPGDFVVGVVVSGNENKLDLNVGADLLGTMLTKEVLPLYDKEMEYLLCDMDKDAER 238

Query: 1156 FMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWH 977
            FMVRGK GIVKD+ A+SGG GPGRPVVETGT+LFAEVLGRTLSGRPLLSTRRLFR++AWH
Sbjct: 239  FMVRGKTGIVKDEAAVSGGGGPGRPVVETGTILFAEVLGRTLSGRPLLSTRRLFRRIAWH 298

Query: 976  RVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYV 797
            RVRQIK+LNEPIEVKITEWNTGGLLTRIEGLRAFLPKAEL+ RVNNF ELKE VGR I V
Sbjct: 299  RVRQIKELNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELMNRVNNFKELKENVGRRINV 358

Query: 796  QIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISN 617
             I RINEANNDLILSEREAW +LNL+EGTLLEGTVKKIF YGAQ+RIG+TNRSGLLHISN
Sbjct: 359  LITRINEANNDLILSEREAWEMLNLKEGTLLEGTVKKIFSYGAQVRIGETNRSGLLHISN 418

Query: 616  MSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEM 437
            +SR+R+T+V +             KSMFPDKISLSIADLESEPGLF+SNKERVF+EAEEM
Sbjct: 419  ISRSRITAVNELLKVDEKVKALVVKSMFPDKISLSIADLESEPGLFISNKERVFAEAEEM 478

Query: 436  AKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            AKKYRQKLPAV  +  SE P N+A PF+SEA+M ANWKWF+FE++
Sbjct: 479  AKKYRQKLPAVLTTRKSETPLNNAFPFESEATMYANWKWFRFERE 523


>KJB28949.1 hypothetical protein B456_005G076800 [Gossypium raimondii] KJB28950.1
            hypothetical protein B456_005G076800 [Gossypium
            raimondii] KJB28952.1 hypothetical protein
            B456_005G076800 [Gossypium raimondii]
          Length = 514

 Score =  659 bits (1701), Expect = 0.0
 Identities = 346/524 (66%), Positives = 401/524 (76%), Gaps = 12/524 (2%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKAR-----------KSHSFAAS 1691
            MQTLLQPCKS +F  LN ++    F    NG  K+     R           K+ SF+  
Sbjct: 1    MQTLLQPCKSLSF--LNFSSQYFAF----NGAPKWQYSVKRTCYSITAAVTPKALSFSRK 54

Query: 1690 FRFLNSTQIVFCSKNDFFDDFSSTQLPENVEND-GVXXXXXXXXXXKPDLVPISNGVVSK 1514
            + FL STQIV CS+ND FD+FSSTQ PE  END G+          KP   P++NG VS 
Sbjct: 55   YMFLRSTQIVLCSQNDTFDEFSSTQFPERFENDSGIEENEELELLNKPSPAPVNNGFVSD 114

Query: 1513 VDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEY 1334
            VDKESEKPD+EE L PFLKFF+P +                   D ++K++E  KV VEY
Sbjct: 115  VDKESEKPDKEEVLEPFLKFFRPSEPLEVEEGSELE--------DSEEKIDEVKKVGVEY 166

Query: 1333 YEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEF 1154
            YEPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTK+VLPLYDKEM++L+CDL+ +AEEF
Sbjct: 167  YEPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKDVLPLYDKEMDYLVCDLENNAEEF 226

Query: 1153 MVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHR 974
            MV GKMGIVKDDDAMSGG GPGRPVVETGTVLFAEVLGRTLSGRPLLSTR+LFR++AWHR
Sbjct: 227  MVYGKMGIVKDDDAMSGGPGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRQLFRRIAWHR 286

Query: 973  VRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYVQ 794
            VRQIK LNEPIEVK TEWNTGGLLTRIEGLRAFLPKAEL+KRVNNF+ELK  VGR ++V+
Sbjct: 287  VRQIKHLNEPIEVKFTEWNTGGLLTRIEGLRAFLPKAELMKRVNNFSELKGYVGRRMHVK 346

Query: 793  IARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISNM 614
            + RINEANNDLILSEREAW +++LR+GTL+EGTV KI PYGAQ+RI D+NRSGLLHISNM
Sbjct: 347  VTRINEANNDLILSEREAWEMMHLRDGTLVEGTVVKILPYGAQVRIADSNRSGLLHISNM 406

Query: 613  SRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEMA 434
            S++R+TSV +             KS+FPDKISLS A+LESEPGLF+ NKERVFSEAEEMA
Sbjct: 407  SKSRITSVAELLKEDEKVKVLVVKSLFPDKISLSTAELESEPGLFILNKERVFSEAEEMA 466

Query: 433  KKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            KKYRQ LPAVS    +E  P DAL F++E S+ ANWKWFKFE++
Sbjct: 467  KKYRQSLPAVSAPRNTEPLPADALSFENEESLYANWKWFKFERE 510


>XP_012482380.1 PREDICTED: uncharacterized protein LOC105797016 isoform X1 [Gossypium
            raimondii]
          Length = 550

 Score =  660 bits (1703), Expect = 0.0
 Identities = 346/525 (65%), Positives = 402/525 (76%), Gaps = 12/525 (2%)
 Frame = -2

Query: 1840 KMQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKAR-----------KSHSFAA 1694
            +MQTLLQPCKS +F  LN ++    F    NG  K+     R           K+ SF+ 
Sbjct: 36   RMQTLLQPCKSLSF--LNFSSQYFAF----NGAPKWQYSVKRTCYSITAAVTPKALSFSR 89

Query: 1693 SFRFLNSTQIVFCSKNDFFDDFSSTQLPENVEND-GVXXXXXXXXXXKPDLVPISNGVVS 1517
             + FL STQIV CS+ND FD+FSSTQ PE  END G+          KP   P++NG VS
Sbjct: 90   KYMFLRSTQIVLCSQNDTFDEFSSTQFPERFENDSGIEENEELELLNKPSPAPVNNGFVS 149

Query: 1516 KVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVE 1337
             VDKESEKPD+EE L PFLKFF+P +                   D ++K++E  KV VE
Sbjct: 150  DVDKESEKPDKEEVLEPFLKFFRPSEPLEVEEGSELE--------DSEEKIDEVKKVGVE 201

Query: 1336 YYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEE 1157
            YYEPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTK+VLPLYDKEM++L+CDL+ +AEE
Sbjct: 202  YYEPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKDVLPLYDKEMDYLVCDLENNAEE 261

Query: 1156 FMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWH 977
            FMV GKMGIVKDDDAMSGG GPGRPVVETGTVLFAEVLGRTLSGRPLLSTR+LFR++AWH
Sbjct: 262  FMVYGKMGIVKDDDAMSGGPGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRQLFRRIAWH 321

Query: 976  RVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYV 797
            RVRQIK LNEPIEVK TEWNTGGLLTRIEGLRAFLPKAEL+KRVNNF+ELK  VGR ++V
Sbjct: 322  RVRQIKHLNEPIEVKFTEWNTGGLLTRIEGLRAFLPKAELMKRVNNFSELKGYVGRRMHV 381

Query: 796  QIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISN 617
            ++ RINEANNDLILSEREAW +++LR+GTL+EGTV KI PYGAQ+RI D+NRSGLLHISN
Sbjct: 382  KVTRINEANNDLILSEREAWEMMHLRDGTLVEGTVVKILPYGAQVRIADSNRSGLLHISN 441

Query: 616  MSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEM 437
            MS++R+TSV +             KS+FPDKISLS A+LESEPGLF+ NKERVFSEAEEM
Sbjct: 442  MSKSRITSVAELLKEDEKVKVLVVKSLFPDKISLSTAELESEPGLFILNKERVFSEAEEM 501

Query: 436  AKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            AKKYRQ LPAVS    +E  P DAL F++E S+ ANWKWFKFE++
Sbjct: 502  AKKYRQSLPAVSAPRNTEPLPADALSFENEESLYANWKWFKFERE 546


>XP_017647911.1 PREDICTED: 30S ribosomal protein S1 homolog A [Gossypium arboreum]
            KHG28109.1 30S ribosomal S1, chloroplastic [Gossypium
            arboreum]
          Length = 550

 Score =  658 bits (1698), Expect = 0.0
 Identities = 346/525 (65%), Positives = 400/525 (76%), Gaps = 12/525 (2%)
 Frame = -2

Query: 1840 KMQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKAR-----------KSHSFAA 1694
            +MQTLLQPCKS +F  LN ++    F    NG  K+     R           K+ SF  
Sbjct: 36   RMQTLLQPCKSLSF--LNFSSQYFAF----NGAPKWQYSVKRTCNSITAAGTPKALSFPR 89

Query: 1693 SFRFLNSTQIVFCSKNDFFDDFSSTQLPENVEND-GVXXXXXXXXXXKPDLVPISNGVVS 1517
             + FL STQIV CS+ND FD+FSSTQLPE  END G+          KP   P++NG VS
Sbjct: 90   KYTFLRSTQIVLCSQNDTFDEFSSTQLPERFENDSGIEENEELELLNKPSPAPVNNGFVS 149

Query: 1516 KVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVE 1337
             VDKESEKPD+EE L PFLKFF+P +                   D ++K++E  KV VE
Sbjct: 150  DVDKESEKPDKEEVLEPFLKFFRPSEPLQVEGRGELE--------DSEEKIDEVKKVGVE 201

Query: 1336 YYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEE 1157
            YYEPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTK+VLPLYDKEM++L+CDL+  AEE
Sbjct: 202  YYEPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKDVLPLYDKEMDYLMCDLENKAEE 261

Query: 1156 FMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWH 977
            FM  GKMGIVKDDDAMSGG GPGRPVVETGTVLFAEVLGRTLSGRPLLSTR+LFR++AWH
Sbjct: 262  FMFYGKMGIVKDDDAMSGGPGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRQLFRRIAWH 321

Query: 976  RVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYV 797
            RVRQIK LNEPIEVK TEWNTGGLLTRIEGLRAFLPKAEL+KRVNNF+ELK  VGR ++V
Sbjct: 322  RVRQIKHLNEPIEVKFTEWNTGGLLTRIEGLRAFLPKAELMKRVNNFSELKGYVGRRMHV 381

Query: 796  QIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISN 617
            +++RINEANNDLILSEREAW +++LR+GTL+EGTV KI PYGAQ+RI D+NRSGLLHISN
Sbjct: 382  KVSRINEANNDLILSEREAWEMMHLRDGTLVEGTVVKILPYGAQVRIADSNRSGLLHISN 441

Query: 616  MSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEM 437
            MS+TR+TSV +             KS+FPDKISLS A+LESEPGLF+ NKERVFSEAEEM
Sbjct: 442  MSKTRITSVAELLKEDEKVKVLVVKSLFPDKISLSTAELESEPGLFILNKERVFSEAEEM 501

Query: 436  AKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            AKKYRQ LPAV     +E  P DAL F++E S+ ANWKWFKFE++
Sbjct: 502  AKKYRQSLPAVYAPRNTEPLPADALSFENEESLYANWKWFKFERE 546


>XP_018848096.1 PREDICTED: uncharacterized protein LOC109011380 [Juglans regia]
          Length = 531

 Score =  655 bits (1691), Expect = 0.0
 Identities = 353/531 (66%), Positives = 395/531 (74%), Gaps = 19/531 (3%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQN--------GNTKYPLQKARKSHSFAA---- 1694
            M  LLQPCKS +    N + P+N+F+ V N           K+P  K  K HSF+     
Sbjct: 1    MTILLQPCKSLSSP--NYSLPVNSFTAVCNTVQNKAYPSIPKWPALKPSKFHSFSTLGPL 58

Query: 1693 -SFRFLN------STQIVFCSKNDFFDDFSSTQLPENVENDGVXXXXXXXXXXKPDLVPI 1535
             +  F N       T++ FCS+N+ FD FSSTQ+P+  E+             KP  +PI
Sbjct: 59   QNLGFTNHSPSRRGTRVAFCSRNEVFDGFSSTQVPQKPESKDGKGNEELELLNKPSPIPI 118

Query: 1534 SNGVVSKVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEEC 1355
             NGV S++DKE EKP +EEAL PFLKFFK RD                   DVDDK EE 
Sbjct: 119  DNGVSSELDKECEKPGKEEALEPFLKFFKGRDDEAEVKEEAGVLEE---KFDVDDKYEEA 175

Query: 1354 NKVSVEYYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDL 1175
            NKV VEYYEPKPGDFVVGVVVSGNENKLD+NVGADLLGTMLTKEVLPLYDKEME+LLCD+
Sbjct: 176  NKVGVEYYEPKPGDFVVGVVVSGNENKLDLNVGADLLGTMLTKEVLPLYDKEMEYLLCDM 235

Query: 1174 KTDAEEFMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLF 995
              DA EFMVRGKMGIVK+++A+  G  PGRPVVETGTVLFAEVLGRTLSGRPLLSTRR F
Sbjct: 236  DRDANEFMVRGKMGIVKNNEALGRGSAPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRFF 295

Query: 994  RKMAWHRVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKV 815
            R+++WHRVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELL RVNNF+ELKE V
Sbjct: 296  RRISWHRVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLNRVNNFSELKENV 355

Query: 814  GRHIYVQIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSG 635
            GR IYVQI RI+E  NDLILSEREAW  L L EGTLLEGTVKKIFPYGAQIRIG+TNRSG
Sbjct: 356  GRQIYVQITRIDETKNDLILSEREAWEKLYLSEGTLLEGTVKKIFPYGAQIRIGETNRSG 415

Query: 634  LLHISNMSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVF 455
            LLHISN++R R+TSV D             KSMFPDKISLSIADLESEPGLFVSNKE+VF
Sbjct: 416  LLHISNITRGRITSVSDLLAVDEKVKVLVAKSMFPDKISLSIADLESEPGLFVSNKEKVF 475

Query: 454  SEAEEMAKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            +EA  MAKKYR+KLPA      SE  P +ALPFD+EASM ANWKWFKFE++
Sbjct: 476  AEAAVMAKKYREKLPAGLAPYKSELSPTNALPFDNEASMYANWKWFKFERE 526


>XP_015579628.1 PREDICTED: uncharacterized protein LOC8281436 isoform X1 [Ricinus
            communis]
          Length = 523

 Score =  653 bits (1684), Expect = 0.0
 Identities = 345/525 (65%), Positives = 399/525 (76%), Gaps = 13/525 (2%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKAR-------------KSHSFA 1697
            MQTL QPCKS +F   N + PLNT +F+ + N+    +                KS SF+
Sbjct: 1    MQTLFQPCKSLSFQ--NISLPLNTNNFIVSCNSSVHAKCFHSSSFQLFSSIGLLKSLSFS 58

Query: 1696 ASFRFLNSTQIVFCSKNDFFDDFSSTQLPENVENDGVXXXXXXXXXXKPDLVPISNGVVS 1517
             +     STQI FCS+ND FD+ SSTQ+PE  E D +          KP  V I+NG+  
Sbjct: 59   KNSPLWRSTQISFCSQNDTFDNLSSTQVPEEQEEDRIRENEELELLNKPSPVVINNGLSI 118

Query: 1516 KVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVE 1337
            +VDKESE  D++EALAPFLKFF+PRD                 N + +++ +E  KV+V+
Sbjct: 119  EVDKESETIDKDEALAPFLKFFEPRDSLEEIKEEGKELGVIEGNSNGNNEDKEAKKVNVD 178

Query: 1336 YYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEE 1157
            YYEPKPGDFVVGVVVSGNE+KLDVNVGADLLGTMLTKEVLPLYDKEME+LLCD++ DAE 
Sbjct: 179  YYEPKPGDFVVGVVVSGNESKLDVNVGADLLGTMLTKEVLPLYDKEMEYLLCDMEKDAER 238

Query: 1156 FMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWH 977
            FMVRGK+GI+KD+ AMSGG G GRPVVETGT+LFAEVLGRTLSGRPLLSTRRLFR++AWH
Sbjct: 239  FMVRGKIGIIKDEAAMSGGPGLGRPVVETGTILFAEVLGRTLSGRPLLSTRRLFRRIAWH 298

Query: 976  RVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYV 797
            RVRQIK+LNEPIEV+ITEWNTGGLLTRIEGLRAFLPKAEL+ RV NF ELKE V R I V
Sbjct: 299  RVRQIKELNEPIEVRITEWNTGGLLTRIEGLRAFLPKAELMNRVKNFKELKENVSRRINV 358

Query: 796  QIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISN 617
             I RINE NN+LILSEREAW +LNLREGTLLEG V+KIFPYGAQ+RIG+TNRSGLLHISN
Sbjct: 359  LITRINEDNNELILSEREAWEMLNLREGTLLEGNVRKIFPYGAQVRIGETNRSGLLHISN 418

Query: 616  MSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEM 437
            ++R+RVT+V D             KSMFPDKISLSIADLESEPGLFVSNKE+VF+EAEEM
Sbjct: 419  ITRSRVTAVSDLLKVDERVKVLVVKSMFPDKISLSIADLESEPGLFVSNKEKVFAEAEEM 478

Query: 436  AKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            AKKYRQKLPAV  +  S  P +  L FD EA+M ANWKWFKFE+D
Sbjct: 479  AKKYRQKLPAVLATRKSATPLSSTLTFDDEATMYANWKWFKFERD 523


>EOY16872.1 Nucleic acid-binding proteins superfamily isoform 1 [Theobroma cacao]
          Length = 512

 Score =  650 bits (1678), Expect = 0.0
 Identities = 355/525 (67%), Positives = 398/525 (75%), Gaps = 13/525 (2%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYP--LQKARKSHSF-------AASFR 1685
            MQTLLQPCKSF F  LNS T      F  NG  K    ++    S+SF       A SF 
Sbjct: 1    MQTLLQPCKSFPF--LNSLTQY----FSLNGAPKCQCSVKTTPSSYSFTTTGTPGALSFS 54

Query: 1684 ---FLNSTQIVFCSKNDFFDDFSSTQLPENVENDG-VXXXXXXXXXXKPDLVPISNGVVS 1517
               FL ST+IVFCS+ND FD+FSSTQLPE++END  +          KP  VP++NG  +
Sbjct: 55   KITFLRSTRIVFCSQNDTFDEFSSTQLPESLENDSRIRENEELELLNKPSPVPVNNGFAA 114

Query: 1516 KVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVE 1337
             V    EKPD++EAL PFLKFF+P +                     ++K  E  KV VE
Sbjct: 115  DV----EKPDKDEALEPFLKFFRPGESLEIEEGGGELGVS-------EEKSNEFKKVGVE 163

Query: 1336 YYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEE 1157
            YYEPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEME+L CDLKT+AEE
Sbjct: 164  YYEPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMEYLSCDLKTNAEE 223

Query: 1156 FMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWH 977
            FM  GKMGIVKDDDAMSGG  PGRPVVETGT+LFAEVLGRTLSGRPLLSTRRLFR++AWH
Sbjct: 224  FMGYGKMGIVKDDDAMSGGPVPGRPVVETGTMLFAEVLGRTLSGRPLLSTRRLFRRIAWH 283

Query: 976  RVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYV 797
            RVRQIKQLNEPIEVK TEWNTGGLLTRIEGLRAFLPKAEL+KRVNNF+ELKE VG  +YV
Sbjct: 284  RVRQIKQLNEPIEVKFTEWNTGGLLTRIEGLRAFLPKAELMKRVNNFSELKEYVGCRMYV 343

Query: 796  QIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISN 617
            +I RINEANNDLI+SEREAW +L+LR+GTLLEG V KI PYGAQ+RIGD+NRSGLLHISN
Sbjct: 344  KITRINEANNDLIMSEREAWEMLHLRDGTLLEGIVVKILPYGAQVRIGDSNRSGLLHISN 403

Query: 616  MSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEM 437
            MS+TR+TSV +             KS+FPDKISLS ADLESEPGLF+SNKERVFSEAEEM
Sbjct: 404  MSKTRITSVAELLKEGEKIKVLVVKSLFPDKISLSTADLESEPGLFISNKERVFSEAEEM 463

Query: 436  AKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            AKKYRQ LPAVS     E  P DALPFD+E S+  NWKWFKFE++
Sbjct: 464  AKKYRQNLPAVSAPRNVEPLPTDALPFDNEESLYVNWKWFKFERE 508


>XP_015579629.1 PREDICTED: uncharacterized protein LOC8281436 isoform X2 [Ricinus
            communis]
          Length = 523

 Score =  650 bits (1678), Expect = 0.0
 Identities = 344/525 (65%), Positives = 398/525 (75%), Gaps = 13/525 (2%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKAR-------------KSHSFA 1697
            MQTL QPCKS +F   N + PLNT +F+ + N+    +                KS SF+
Sbjct: 1    MQTLFQPCKSLSFQ--NISLPLNTNNFIVSCNSSVHAKCFHSSSFQLFSSIGLLKSLSFS 58

Query: 1696 ASFRFLNSTQIVFCSKNDFFDDFSSTQLPENVENDGVXXXXXXXXXXKPDLVPISNGVVS 1517
             +     STQI FCS+ND FD+ SSTQ+PE  E D +          KP  V I+NG+  
Sbjct: 59   KNSPLWRSTQISFCSQNDTFDNLSSTQVPEEQEEDRIRENEELELLNKPSPVVINNGLSI 118

Query: 1516 KVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVE 1337
            +VDKESE  D++EALAPFLKFF+PRD                 N + +++ +E  KV+V+
Sbjct: 119  EVDKESETIDKDEALAPFLKFFEPRDSLEEIKEEGKELGVIEGNSNGNNEDKEAKKVNVD 178

Query: 1336 YYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEE 1157
            YYEPKPGDFVVGVVVSGNE+KLDVNVGADLLGTMLTKEVLPLYDKEME+LLCD++ DAE 
Sbjct: 179  YYEPKPGDFVVGVVVSGNESKLDVNVGADLLGTMLTKEVLPLYDKEMEYLLCDMEKDAER 238

Query: 1156 FMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWH 977
            FMVRGK+GI+KD+ AMSGG G GRPVVETGT+LFAEVLGRTLSGRPLLSTRRLFR++AWH
Sbjct: 239  FMVRGKIGIIKDEAAMSGGPGLGRPVVETGTILFAEVLGRTLSGRPLLSTRRLFRRIAWH 298

Query: 976  RVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYV 797
            RV QIK+LNEPIEV+ITEWNTGGLLTRIEGLRAFLPKAEL+ RV NF ELKE V R I V
Sbjct: 299  RVSQIKELNEPIEVRITEWNTGGLLTRIEGLRAFLPKAELMNRVKNFKELKENVSRRINV 358

Query: 796  QIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISN 617
             I RINE NN+LILSEREAW +LNLREGTLLEG V+KIFPYGAQ+RIG+TNRSGLLHISN
Sbjct: 359  LITRINEDNNELILSEREAWEMLNLREGTLLEGNVRKIFPYGAQVRIGETNRSGLLHISN 418

Query: 616  MSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEM 437
            ++R+RVT+V D             KSMFPDKISLSIADLESEPGLFVSNKE+VF+EAEEM
Sbjct: 419  ITRSRVTAVSDLLKVDERVKVLVVKSMFPDKISLSIADLESEPGLFVSNKEKVFAEAEEM 478

Query: 436  AKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            AKKYRQKLPAV  +  S  P +  L FD EA+M ANWKWFKFE+D
Sbjct: 479  AKKYRQKLPAVLATRKSATPLSSTLTFDDEATMYANWKWFKFERD 523


>XP_007019649.2 PREDICTED: 30S ribosomal protein S1 homolog A isoform X2 [Theobroma
            cacao]
          Length = 512

 Score =  649 bits (1675), Expect = 0.0
 Identities = 353/523 (67%), Positives = 396/523 (75%), Gaps = 11/523 (2%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKARKSHSF-------AASFR-- 1685
            MQTLLQPCKSF F  LNS T    FS       K  ++    S+SF       A SF   
Sbjct: 1    MQTLLQPCKSFPF--LNSLTQY--FSLNGAPKCKCSVKTTPSSYSFTTTGTPGALSFSKI 56

Query: 1684 -FLNSTQIVFCSKNDFFDDFSSTQLPENVENDG-VXXXXXXXXXXKPDLVPISNGVVSKV 1511
             FL ST+IVFCS+ND FD+FSSTQLPE++END  +          KP  VP++NG  + V
Sbjct: 57   TFLRSTRIVFCSQNDTFDEFSSTQLPESLENDSRIRENEELELLNKPSPVPVNNGFAADV 116

Query: 1510 DKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEYY 1331
                EKPD++EAL PFLKFF+P +                     ++K  E  KV VEYY
Sbjct: 117  ----EKPDKDEALEPFLKFFRPGESLEVEEGGGELGVS-------EEKSNEFKKVGVEYY 165

Query: 1330 EPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEFM 1151
            EPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEME+L CDLKT+ EEFM
Sbjct: 166  EPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMEYLSCDLKTNVEEFM 225

Query: 1150 VRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRV 971
              GKMGIVKDDDAMSGG  PGRPVVETGT+LFAEVLGRTLSGRPLLSTRRLFR++AWHRV
Sbjct: 226  GYGKMGIVKDDDAMSGGPVPGRPVVETGTMLFAEVLGRTLSGRPLLSTRRLFRRIAWHRV 285

Query: 970  RQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYVQI 791
            RQIKQL+EPIEVK TEWNTGGLLTRIEGLRAFLPKAEL+KRVNNF+ELKE VG  +YV+I
Sbjct: 286  RQIKQLDEPIEVKFTEWNTGGLLTRIEGLRAFLPKAELMKRVNNFSELKEYVGCRMYVKI 345

Query: 790  ARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISNMS 611
             RINEANNDLILSEREAW +L+LR+GTLLEG V KI PYGAQ+RIGD+NRSGLLHISNMS
Sbjct: 346  TRINEANNDLILSEREAWEMLHLRDGTLLEGIVVKILPYGAQVRIGDSNRSGLLHISNMS 405

Query: 610  RTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEMAK 431
            +TR+TSV +             KS+FPDKISLS ADLESEPGLF+SNKERVFSEAEEMAK
Sbjct: 406  KTRITSVAELLKEGEKIKVLVVKSLFPDKISLSTADLESEPGLFISNKERVFSEAEEMAK 465

Query: 430  KYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            KYRQ LPAVS     E  P DALPFD+E S+  NWKWFKFE++
Sbjct: 466  KYRQNLPAVSAPRNVEPLPTDALPFDNEESLYVNWKWFKFERE 508


>EOY16874.1 Nucleic acid-binding proteins superfamily isoform 3, partial
            [Theobroma cacao]
          Length = 511

 Score =  649 bits (1673), Expect = 0.0
 Identities = 354/524 (67%), Positives = 397/524 (75%), Gaps = 13/524 (2%)
 Frame = -2

Query: 1834 QTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYP--LQKARKSHSF-------AASFR- 1685
            QTLLQPCKSF F  LNS T      F  NG  K    ++    S+SF       A SF  
Sbjct: 1    QTLLQPCKSFPF--LNSLTQY----FSLNGAPKCQCSVKTTPSSYSFTTTGTPGALSFSK 54

Query: 1684 --FLNSTQIVFCSKNDFFDDFSSTQLPENVENDG-VXXXXXXXXXXKPDLVPISNGVVSK 1514
              FL ST+IVFCS+ND FD+FSSTQLPE++END  +          KP  VP++NG  + 
Sbjct: 55   ITFLRSTRIVFCSQNDTFDEFSSTQLPESLENDSRIRENEELELLNKPSPVPVNNGFAAD 114

Query: 1513 VDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEY 1334
            V    EKPD++EAL PFLKFF+P +                     ++K  E  KV VEY
Sbjct: 115  V----EKPDKDEALEPFLKFFRPGESLEIEEGGGELGVS-------EEKSNEFKKVGVEY 163

Query: 1333 YEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEF 1154
            YEPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEME+L CDLKT+AEEF
Sbjct: 164  YEPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMEYLSCDLKTNAEEF 223

Query: 1153 MVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHR 974
            M  GKMGIVKDDDAMSGG  PGRPVVETGT+LFAEVLGRTLSGRPLLSTRRLFR++AWHR
Sbjct: 224  MGYGKMGIVKDDDAMSGGPVPGRPVVETGTMLFAEVLGRTLSGRPLLSTRRLFRRIAWHR 283

Query: 973  VRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYVQ 794
            VRQIKQLNEPIEVK TEWNTGGLLTRIEGLRAFLPKAEL+KRVNNF+ELKE VG  +YV+
Sbjct: 284  VRQIKQLNEPIEVKFTEWNTGGLLTRIEGLRAFLPKAELMKRVNNFSELKEYVGCRMYVK 343

Query: 793  IARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISNM 614
            I RINEANNDLI+SEREAW +L+LR+GTLLEG V KI PYGAQ+RIGD+NRSGLLHISNM
Sbjct: 344  ITRINEANNDLIMSEREAWEMLHLRDGTLLEGIVVKILPYGAQVRIGDSNRSGLLHISNM 403

Query: 613  SRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEMA 434
            S+TR+TSV +             KS+FPDKISLS ADLESEPGLF+SNKERVFSEAEEMA
Sbjct: 404  SKTRITSVAELLKEGEKIKVLVVKSLFPDKISLSTADLESEPGLFISNKERVFSEAEEMA 463

Query: 433  KKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            KKYRQ LPAVS     E  P DALPFD+E S+  NWKWFKFE++
Sbjct: 464  KKYRQNLPAVSAPRNVEPLPTDALPFDNEESLYVNWKWFKFERE 507


>XP_016693813.1 PREDICTED: LOW QUALITY PROTEIN: 30S ribosomal protein S1 homolog A
            [Gossypium hirsutum]
          Length = 552

 Score =  646 bits (1667), Expect = 0.0
 Identities = 342/527 (64%), Positives = 398/527 (75%), Gaps = 14/527 (2%)
 Frame = -2

Query: 1840 KMQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKAR-----------KSHSFAA 1694
            +MQTLLQPCKS +F  LN ++    F    NG  K+     R           K+ SF+ 
Sbjct: 36   RMQTLLQPCKSLSF--LNFSSQYFAF----NGAPKWQYSVKRTCYSITAAGTPKALSFSR 89

Query: 1693 SFRFLNSTQIVFCSKNDFFDDFSSTQLPENVEND-GVXXXXXXXXXXKPDLVPISNGVVS 1517
             + FL STQIV CS+ND FD+FSSTQ PE  END G+          KP   P++NG VS
Sbjct: 90   KYMFLRSTQIVLCSQNDTFDEFSSTQFPERFENDSGIEENEELELLNKPSPAPVNNGFVS 149

Query: 1516 KVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVE 1337
             VDKESEKPD+EE L PFLKFF+P +                   D ++K++E  KV VE
Sbjct: 150  DVDKESEKPDKEEVLEPFLKFFRPSEPLEVEEGGELE--------DSEEKIDEVKKVGVE 201

Query: 1336 YYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEE 1157
            YYEPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTK+VLPLYDKEM++L+CDL+  AEE
Sbjct: 202  YYEPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKDVLPLYDKEMDYLVCDLENKAEE 261

Query: 1156 FMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWH 977
            FMV GKMGIVKDDDAMSGG GPGRPVVETGTVLFAEVLGRTLSGRPLLSTR+LFR++AWH
Sbjct: 262  FMVYGKMGIVKDDDAMSGGPGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRQLFRRIAWH 321

Query: 976  RVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYV 797
            RVRQIK LNEP+E K  EWNTGGLLTRIEGLRAFLPKAEL+KRVNNF+ELK  VGR ++V
Sbjct: 322  RVRQIKHLNEPVEGKFREWNTGGLLTRIEGLRAFLPKAELMKRVNNFSELKGYVGRRMHV 381

Query: 796  QIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISN 617
            ++ RINEANNDLILSEREAW +++LR+GTL+EGTV KI PYGAQ+RI D+NRSGLLHISN
Sbjct: 382  KVTRINEANNDLILSEREAWEMMHLRDGTLVEGTVVKILPYGAQVRIADSNRSGLLHISN 441

Query: 616  MSRTRVTSVGDXXXXXXXXXXXXXKSMFP--DKISLSIADLESEPGLFVSNKERVFSEAE 443
            MS++R+TSV +             KS+FP   KISLS A+LESEPGLF+ NKERVFSEAE
Sbjct: 442  MSKSRITSVAELLKEDEKVKVLVVKSLFPXXXKISLSTAELESEPGLFILNKERVFSEAE 501

Query: 442  EMAKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            EMAKKYRQ LPAVS    +E  P DAL F++E S+ ANWKWFKFE++
Sbjct: 502  EMAKKYRQSLPAVSAPRNTEPLPADALSFENEESLYANWKWFKFERE 548


>EOY16873.1 Nucleic acid-binding proteins superfamily isoform 2 [Theobroma cacao]
          Length = 521

 Score =  644 bits (1661), Expect = 0.0
 Identities = 355/534 (66%), Positives = 399/534 (74%), Gaps = 22/534 (4%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYP--LQKARKSHSF-------AASFR 1685
            MQTLLQPCKSF F  LNS T      F  NG  K    ++    S+SF       A SF 
Sbjct: 1    MQTLLQPCKSFPF--LNSLTQY----FSLNGAPKCQCSVKTTPSSYSFTTTGTPGALSFS 54

Query: 1684 ---FLNSTQIVFCSKNDFFDDFSSTQLPENVENDG-VXXXXXXXXXXKPDLVPISNGVVS 1517
               FL ST+IVFCS+ND FD+FSSTQLPE++END  +          KP  VP++NG  +
Sbjct: 55   KITFLRSTRIVFCSQNDTFDEFSSTQLPESLENDSRIRENEELELLNKPSPVPVNNGFAA 114

Query: 1516 KVDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVE 1337
             V    EKPD++EAL PFLKFF+P +                     ++K  E  KV VE
Sbjct: 115  DV----EKPDKDEALEPFLKFFRPGESLEIEEGGGELGVS-------EEKSNEFKKVGVE 163

Query: 1336 YYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEE 1157
            YYEPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEME+L CDLKT+AEE
Sbjct: 164  YYEPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMEYLSCDLKTNAEE 223

Query: 1156 FMVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWH 977
            FM  GKMGIVKDDDAMSGG  PGRPVVETGT+LFAEVLGRTLSGRPLLSTRRLFR++AWH
Sbjct: 224  FMGYGKMGIVKDDDAMSGGPVPGRPVVETGTMLFAEVLGRTLSGRPLLSTRRLFRRIAWH 283

Query: 976  RVRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKE-------- 821
            RVRQIKQLNEPIEVK TEWNTGGLLTRIEGLRAFLPKAEL+KRVNNF+ELKE        
Sbjct: 284  RVRQIKQLNEPIEVKFTEWNTGGLLTRIEGLRAFLPKAELMKRVNNFSELKEYTFFGSFM 343

Query: 820  -KVGRHIYVQIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTN 644
             +VG  +YV+I RINEANNDLI+SEREAW +L+LR+GTLLEG V KI PYGAQ+RIGD+N
Sbjct: 344  LQVGCRMYVKITRINEANNDLIMSEREAWEMLHLRDGTLLEGIVVKILPYGAQVRIGDSN 403

Query: 643  RSGLLHISNMSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKE 464
            RSGLLHISNMS+TR+TSV +             KS+FPDKISLS ADLESEPGLF+SNKE
Sbjct: 404  RSGLLHISNMSKTRITSVAELLKEGEKIKVLVVKSLFPDKISLSTADLESEPGLFISNKE 463

Query: 463  RVFSEAEEMAKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            RVFSEAEEMAKKYRQ LPAVS     E  P DALPFD+E S+  NWKWFKFE++
Sbjct: 464  RVFSEAEEMAKKYRQNLPAVSAPRNVEPLPTDALPFDNEESLYVNWKWFKFERE 517


>XP_007019648.2 PREDICTED: 30S ribosomal protein S1 homolog A isoform X1 [Theobroma
            cacao]
          Length = 521

 Score =  643 bits (1658), Expect = 0.0
 Identities = 353/532 (66%), Positives = 397/532 (74%), Gaps = 20/532 (3%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKARKSHSF-------AASFR-- 1685
            MQTLLQPCKSF F  LNS T    FS       K  ++    S+SF       A SF   
Sbjct: 1    MQTLLQPCKSFPF--LNSLTQY--FSLNGAPKCKCSVKTTPSSYSFTTTGTPGALSFSKI 56

Query: 1684 -FLNSTQIVFCSKNDFFDDFSSTQLPENVENDG-VXXXXXXXXXXKPDLVPISNGVVSKV 1511
             FL ST+IVFCS+ND FD+FSSTQLPE++END  +          KP  VP++NG  + V
Sbjct: 57   TFLRSTRIVFCSQNDTFDEFSSTQLPESLENDSRIRENEELELLNKPSPVPVNNGFAADV 116

Query: 1510 DKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEYY 1331
                EKPD++EAL PFLKFF+P +                     ++K  E  KV VEYY
Sbjct: 117  ----EKPDKDEALEPFLKFFRPGESLEVEEGGGELGVS-------EEKSNEFKKVGVEYY 165

Query: 1330 EPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEFM 1151
            EPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEME+L CDLKT+ EEFM
Sbjct: 166  EPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMEYLSCDLKTNVEEFM 225

Query: 1150 VRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHRV 971
              GKMGIVKDDDAMSGG  PGRPVVETGT+LFAEVLGRTLSGRPLLSTRRLFR++AWHRV
Sbjct: 226  GYGKMGIVKDDDAMSGGPVPGRPVVETGTMLFAEVLGRTLSGRPLLSTRRLFRRIAWHRV 285

Query: 970  RQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKE---------K 818
            RQIKQL+EPIEVK TEWNTGGLLTRIEGLRAFLPKAEL+KRVNNF+ELKE         +
Sbjct: 286  RQIKQLDEPIEVKFTEWNTGGLLTRIEGLRAFLPKAELMKRVNNFSELKEYTFFGSFMLQ 345

Query: 817  VGRHIYVQIARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRS 638
            VG  +YV+I RINEANNDLILSEREAW +L+LR+GTLLEG V KI PYGAQ+RIGD+NRS
Sbjct: 346  VGCRMYVKITRINEANNDLILSEREAWEMLHLRDGTLLEGIVVKILPYGAQVRIGDSNRS 405

Query: 637  GLLHISNMSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERV 458
            GLLHISNMS+TR+TSV +             KS+FPDKISLS ADLESEPGLF+SNKERV
Sbjct: 406  GLLHISNMSKTRITSVAELLKEGEKIKVLVVKSLFPDKISLSTADLESEPGLFISNKERV 465

Query: 457  FSEAEEMAKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            FSEAEEMAKKYRQ LPAVS     E  P DALPFD+E S+  NWKWFKFE++
Sbjct: 466  FSEAEEMAKKYRQNLPAVSAPRNVEPLPTDALPFDNEESLYVNWKWFKFERE 517


>OMO98749.1 hypothetical protein COLO4_13726 [Corchorus olitorius]
          Length = 536

 Score =  643 bits (1658), Expect = 0.0
 Identities = 347/540 (64%), Positives = 400/540 (74%), Gaps = 28/540 (5%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKARKSHSFAAS----------- 1691
            MQTLLQPCKSF F  LNS++    F+F      +  ++  +  +SFA S           
Sbjct: 1    MQTLLQPCKSFWF--LNSSSQY--FAFSGAPKCQCSVRATQSCYSFAISGTPRAWSISRK 56

Query: 1690 FRFLNSTQIVFCSKNDFFDDFSSTQLPENVEND-GVXXXXXXXXXXKPDLVPISNGVVSK 1514
            F FL STQIV+CS+ND FD+FSST+LPE   ND G+          KP  VP+SNG  S 
Sbjct: 57   FTFLRSTQIVYCSQNDTFDEFSSTKLPERFGNDSGIRENEELELLNKPSPVPMSNGFASD 116

Query: 1513 VDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEY 1334
            +DKESEKPD+EEAL PFLKFF+P +                     +++ +E  KV VEY
Sbjct: 117  IDKESEKPDKEEALEPFLKFFRPSESLEEEEEGEGEGELGVS----EERSDEVTKVGVEY 172

Query: 1333 YEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEF 1154
            YEPKPGD VVGVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEM++L CDLK+DAEEF
Sbjct: 173  YEPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMKYLSCDLKSDAEEF 232

Query: 1153 MVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHR 974
            MV GK+GIVKDDDAM GG  PGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFR++AWHR
Sbjct: 233  MVYGKIGIVKDDDAMGGGPVPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRRIAWHR 292

Query: 973  VRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYVQ 794
            VRQIKQLNEPIEVK TEWNTGGLLTRIEGLRAFLPKAEL+KRVNNF++LKE VGR +YV+
Sbjct: 293  VRQIKQLNEPIEVKFTEWNTGGLLTRIEGLRAFLPKAELMKRVNNFSDLKEYVGRRMYVK 352

Query: 793  IARINEANNDLILSEREAWA----------------ILNLREGTLLEGTVKKIFPYGAQI 662
            I RINEANNDLILSEREAWA                +L+LR+GTLLEG V KI PYGAQ+
Sbjct: 353  ITRINEANNDLILSEREAWANNESRHLLMFLKMLQEMLHLRDGTLLEGIVVKILPYGAQV 412

Query: 661  RIGDTNRSGLLHISNMSRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGL 482
            RIG++NRSGLLHISN+S+ R+ SV +             KS+FPDKISLS A+LESEPGL
Sbjct: 413  RIGNSNRSGLLHISNLSKARINSVAELLKEGERIKVLVVKSLFPDKISLSTAELESEPGL 472

Query: 481  FVSNKERVFSEAEEMAKKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            F SNKERVFSEAEEMA+KYRQ LP +  +P +  P  DALPF +E S+ ANWKWFKFE +
Sbjct: 473  FTSNKERVFSEAEEMARKYRQNLPTMP-TPRNAEPLTDALPFHNEESLYANWKWFKFESE 531


>OMO89013.1 hypothetical protein CCACVL1_08061 [Corchorus capsularis]
          Length = 519

 Score =  642 bits (1656), Expect = 0.0
 Identities = 344/524 (65%), Positives = 396/524 (75%), Gaps = 12/524 (2%)
 Frame = -2

Query: 1837 MQTLLQPCKSFTFNLLNSTTPLNTFSFVQNGNTKYPLQKARKSHSFAAS----------- 1691
            MQTLLQPCKSF F  LNS++    F+F      +  ++  +   SFA S           
Sbjct: 1    MQTLLQPCKSFWF--LNSSSQY--FAFNGAPKCQCSVRATQSCSSFAVSGTPRALSISRK 56

Query: 1690 FRFLNSTQIVFCSKNDFFDDFSSTQLPENVEND-GVXXXXXXXXXXKPDLVPISNGVVSK 1514
            F FL STQIVFCS+ND FD+FSST+ PE + ND G+          KP  VP+SNG  S 
Sbjct: 57   FTFLRSTQIVFCSQNDTFDEFSSTKSPERLGNDSGIRENEELELLNKPSPVPMSNGFASD 116

Query: 1513 VDKESEKPDEEEALAPFLKFFKPRDXXXXXXXXXXXXXXXXENIDVDDKVEECNKVSVEY 1334
            + KESEKPD+EEAL PFLKFF+P +                     +++ +E  KV VEY
Sbjct: 117  IVKESEKPDKEEALEPFLKFFRPSESLEEGEGEGEGELGVS-----EERGDEVTKVGVEY 171

Query: 1333 YEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEFLLCDLKTDAEEF 1154
            YEPK GD VVGVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEME+L CDLK+DAEEF
Sbjct: 172  YEPKAGDLVVGVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMEYLSCDLKSDAEEF 231

Query: 1153 MVRGKMGIVKDDDAMSGGLGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRKMAWHR 974
            MV GK+GIVKDDDAM GG  PGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFR++AWHR
Sbjct: 232  MVYGKIGIVKDDDAMGGGPVPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRRIAWHR 291

Query: 973  VRQIKQLNEPIEVKITEWNTGGLLTRIEGLRAFLPKAELLKRVNNFTELKEKVGRHIYVQ 794
            VRQIKQLNEPIEVK TEWNTGGLLTR+EGLRAFLPKAEL+KRVNNF++LKE VGR ++V+
Sbjct: 292  VRQIKQLNEPIEVKFTEWNTGGLLTRLEGLRAFLPKAELMKRVNNFSDLKEYVGRRMHVK 351

Query: 793  IARINEANNDLILSEREAWAILNLREGTLLEGTVKKIFPYGAQIRIGDTNRSGLLHISNM 614
            I RINEANNDLILSEREAW +L+LR+GTLLEG V KI PYGAQ+RIG++NRSGLLHISN+
Sbjct: 352  ITRINEANNDLILSEREAWEMLHLRDGTLLEGIVVKILPYGAQVRIGNSNRSGLLHISNL 411

Query: 613  SRTRVTSVGDXXXXXXXXXXXXXKSMFPDKISLSIADLESEPGLFVSNKERVFSEAEEMA 434
            S+ R+ SV +             KS+FPDKISLS A+LESEPGLF SNKERVFSEAEEMA
Sbjct: 412  SKARINSVAELLKEGERIKVLVVKSLFPDKISLSTAELESEPGLFTSNKERVFSEAEEMA 471

Query: 433  KKYRQKLPAVSVSPMSEAPPNDALPFDSEASMCANWKWFKFEKD 302
            +KYRQ L A   +P +  P  DALPFD+E S+ ANWKWFKFE +
Sbjct: 472  RKYRQNL-ATMPTPRNAEPLTDALPFDNEESLYANWKWFKFESE 514


Top