BLASTX nr result

ID: Aconitum23_contig00003631 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Aconitum23_contig00003631
         (2337 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010266797.1| PREDICTED: putative uncharacterized protein ...  1013   0.0  
ref|XP_010254674.1| PREDICTED: putative uncharacterized protein ...   989   0.0  
ref|XP_002522599.1| ATP-dependent RNA helicase, putative [Ricinu...   940   0.0  
ref|XP_012069167.1| PREDICTED: putative uncharacterized protein ...   933   0.0  
ref|XP_007047849.1| Helicase domain-containing protein / IBR dom...   923   0.0  
gb|KHG13119.1| hypothetical protein F383_07330 [Gossypium arboreum]   920   0.0  
ref|XP_012455164.1| PREDICTED: putative uncharacterized protein ...   915   0.0  
gb|AGL44347.1| helicase/plant I subfamily protein [Glycine max]       911   0.0  
ref|XP_003552808.1| PREDICTED: putative uncharacterized protein ...   911   0.0  
gb|KHN31399.1| Hypothetical protein glysoja_023053 [Glycine soja]     910   0.0  
ref|XP_010926340.1| PREDICTED: putative uncharacterized protein ...   903   0.0  
gb|KRH31335.1| hypothetical protein GLYMA_11G242300 [Glycine max]     902   0.0  
gb|KRH31334.1| hypothetical protein GLYMA_11G242300 [Glycine max]     902   0.0  
ref|XP_003537562.1| PREDICTED: putative uncharacterized protein ...   902   0.0  
ref|XP_006465847.1| PREDICTED: putative uncharacterized protein ...   902   0.0  
ref|XP_006426318.1| hypothetical protein CICLE_v10024688mg [Citr...   902   0.0  
ref|XP_012469827.1| PREDICTED: putative uncharacterized protein ...   900   0.0  
ref|XP_008782178.1| PREDICTED: LOW QUALITY PROTEIN: putative unc...   898   0.0  
ref|XP_002307067.1| helicase domain-containing family protein [P...   898   0.0  
gb|KHG18071.1| hypothetical protein F383_23000 [Gossypium arboreum]   894   0.0  

>ref|XP_010266797.1| PREDICTED: putative uncharacterized protein At4g01020, chloroplastic
            [Nelumbo nucifera]
          Length = 1748

 Score = 1013 bits (2620), Expect = 0.0
 Identities = 488/738 (66%), Positives = 597/738 (80%), Gaps = 2/738 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSGIS 2156
            LYHGG G +PS AL G+G  IRHLEL K+YLTV+V HS+  +++DKELLM FE  VSGIS
Sbjct: 1010 LYHGGSGVSPSFALFGSGAMIRHLELEKRYLTVDVYHSDSSSINDKELLMFFEEHVSGIS 1069

Query: 2155 SFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD-R 1979
             + K+P  GQ+GED+EKWG+I FLTPEAAE+AV+E+ND+E+ GSLLKV PS  +   D R
Sbjct: 1070 GYLKYPAFGQDGEDTEKWGRIGFLTPEAAEKAVAELNDVEYCGSLLKVSPSRTSFATDHR 1129

Query: 1978 YFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEIS-KNEDCV 1802
             F+FPAVRA++ WPR+YSKG+A V+CARQD + ++++ S L I GR +RCE S K  D V
Sbjct: 1130 MFSFPAVRAKISWPRRYSKGFAIVRCARQDANFIVNECSNLLIGGRFVRCENSRKYMDSV 1189

Query: 1801 LIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPS 1622
            +I G+ KE+SE E+L +LR  T R+I D+ LVRG+A+NN    AACEEAL KEIA F+PS
Sbjct: 1190 VIHGLHKEVSESEILDVLRNATHRRILDVFLVRGDAVNNLSS-AACEEALLKEIASFMPS 1248

Query: 1621 KNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQ 1442
              PL N   VQVFPPEP+DY++KAVITFDGRL+LEAA AL+HIQGK L GC SWQKIQCQ
Sbjct: 1249 NIPLSNCCRVQVFPPEPKDYLMKAVITFDGRLHLEAAKALQHIQGKALNGCFSWQKIQCQ 1308

Query: 1441 HIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVAD 1262
             +FHSSV CP  VY ++K +LDSL + F+ +NG    LERNENGSYRVKISANATKTVA+
Sbjct: 1309 QMFHSSVSCPAAVYFVIKTELDSLLKRFEQRNGVYCNLERNENGSYRVKISANATKTVAE 1368

Query: 1261 MRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFG 1082
            +R+PLE++++GK +   +LT ++L+LLFSRDGI L+KS+QQETGT+ILYD+Q +NV+IFG
Sbjct: 1369 LRKPLEQLMKGKTINDASLTQSVLQLLFSRDGIMLIKSLQQETGTHILYDRQNMNVRIFG 1428

Query: 1081 PQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDA 902
            P++K+  AE +LV++LL L+E+KQLEIHLR  DLPH+LMKEVV KFG DLHGLKEKVP  
Sbjct: 1429 PEDKIAVAERRLVQSLLTLHENKQLEIHLRSGDLPHDLMKEVVGKFGSDLHGLKEKVPGV 1488

Query: 901  ELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDC 722
            EL LNTRRH +   G KE K+KVE II + A +L  SGL  + P GE  C ICLC+++DC
Sbjct: 1489 ELTLNTRRHVIYVRGKKELKKKVEEIIYETASTLRRSGLG-IRPSGEDTCSICLCEVEDC 1547

Query: 721  YKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLF 542
            ++LEAC H FCR CL++QC+SAIKSH+GFPLCC + GC  PIL+ADLR LL  +KL++LF
Sbjct: 1548 FQLEACAHGFCRLCLVDQCESAIKSHDGFPLCCAYEGCQTPILLADLRCLLSSDKLEELF 1607

Query: 541  RASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYI 362
            RASL AFVASSGG YRFCPSPDCPAVY+VA PGT   PF CGAC+VETCTRCHLE+HPY+
Sbjct: 1608 RASLGAFVASSGGTYRFCPSPDCPAVYKVADPGTAGGPFSCGACYVETCTRCHLEYHPYV 1667

Query: 361  TCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDC 182
            +CERYK FK+DPDSSLKEWCKGKE VK CPVCGYTIEKVDGCNHIECKCG HICWVCL+ 
Sbjct: 1668 SCERYKMFKEDPDSSLKEWCKGKEHVKHCPVCGYTIEKVDGCNHIECKCGRHICWVCLES 1727

Query: 181  FSTSEDCYDHLRTTHLAI 128
            F +S+DCY HLR+ HLAI
Sbjct: 1728 FHSSDDCYGHLRSVHLAI 1745


>ref|XP_010254674.1| PREDICTED: putative uncharacterized protein At4g01020, chloroplastic
            [Nelumbo nucifera]
          Length = 1728

 Score =  989 bits (2558), Expect = 0.0
 Identities = 475/738 (64%), Positives = 588/738 (79%), Gaps = 2/738 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSGIS 2156
            LY GG G +PS AL G G  IRHLEL K+ LTV+V HS+  A++DKELLM  E  VSGIS
Sbjct: 987  LYRGGSGISPSFALFGCGAMIRHLELEKRCLTVDVYHSDASAINDKELLMFLEDHVSGIS 1046

Query: 2155 SFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD-R 1979
             +HK+ G GQEGE +EKWG+ITFLTPE AE+AV+E++ +E+ GSLLK+ PS  +   D R
Sbjct: 1047 GYHKYAGIGQEGEGTEKWGRITFLTPEDAEKAVAELSGVEYCGSLLKISPSRTSFAVDHR 1106

Query: 1978 YFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEIS-KNEDCV 1802
             F+FPAVRA++ WPR+YS+G+A V+CA+QD D ++ D S L I GR + CEIS K  DCV
Sbjct: 1107 MFSFPAVRAKIFWPRRYSRGFAVVRCAKQDVDFIVDDCSDLLIGGRYVHCEISNKYMDCV 1166

Query: 1801 LIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPS 1622
            +I G+DKE+SE E+  +LRT T  +I D+ L+RG+A+ +   Y ACEEAL +EIAPF+PS
Sbjct: 1167 VISGLDKEVSESEIFDVLRTATHGRILDVFLLRGDAVESLS-YTACEEALLREIAPFMPS 1225

Query: 1621 KNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQ 1442
              PL ++  VQVFPPEP+D ++KAVITFDGRL+LEAA AL+HIQGK L GC SWQKIQ Q
Sbjct: 1226 NIPLSSSCQVQVFPPEPKDCLMKAVITFDGRLHLEAAKALQHIQGKALNGCFSWQKIQSQ 1285

Query: 1441 HIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVAD 1262
             +FHSSV CP  VY ++KRQLDSL  SFKH+ G +  LE+NENGSYRVKISANATKTVA+
Sbjct: 1286 QMFHSSVSCPATVYFVIKRQLDSLLSSFKHRKGATCNLEKNENGSYRVKISANATKTVAE 1345

Query: 1261 MRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFG 1082
            +R+PLE++++GK +    L+P+IL+LL SRDGI L+KS+Q+ET T+ILYD+Q +NVKIFG
Sbjct: 1346 LRKPLEQLMKGKTINDATLSPSILQLLLSRDGIMLIKSLQRETETHILYDRQNMNVKIFG 1405

Query: 1081 PQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDA 902
             ++K+  AE +LV++LL L+E+KQLEIHLR  DLPH+LMKEVV+KFGPDLHGLKEKVP  
Sbjct: 1406 SEDKIAVAEQRLVQSLLTLHENKQLEIHLRSGDLPHDLMKEVVRKFGPDLHGLKEKVPGV 1465

Query: 901  ELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDC 722
            EL LNTRRH +   G K+ KQKVE II + A  L   GL   +  GE  C ICLC+++DC
Sbjct: 1466 ELTLNTRRHVISVKGKKDLKQKVEEIIYETALPLRSGGLGQQL-SGEDTCSICLCEVEDC 1524

Query: 721  YKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLF 542
            ++LEAC H FCR CL++QC+SAIKSH+GFPL CT+ GC  PIL+ADLR+LL  EKL++LF
Sbjct: 1525 FQLEACAHRFCRLCLVDQCESAIKSHDGFPLLCTYEGCKAPILIADLRHLLSSEKLEELF 1584

Query: 541  RASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYI 362
            RASL AFVASSGG YRFCPSPDCPAVY+VA+PGT    F CGACHVETCTRCHLE+HPY+
Sbjct: 1585 RASLGAFVASSGGTYRFCPSPDCPAVYKVAEPGTSGGLFSCGACHVETCTRCHLEYHPYV 1644

Query: 361  TCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDC 182
            +CE YK FK+DPD SLKEW KGKEQVK CP+CGYTIEKVDGCNHI C+CG HICWVCL+ 
Sbjct: 1645 SCEMYKMFKEDPDLSLKEWAKGKEQVKQCPICGYTIEKVDGCNHIACRCGVHICWVCLES 1704

Query: 181  FSTSEDCYDHLRTTHLAI 128
            F++S+DCY HLR+ HLAI
Sbjct: 1705 FNSSDDCYGHLRSVHLAI 1722


>ref|XP_002522599.1| ATP-dependent RNA helicase, putative [Ricinus communis]
            gi|223538075|gb|EEF39686.1| ATP-dependent RNA helicase,
            putative [Ricinus communis]
          Length = 1588

 Score =  940 bits (2430), Expect = 0.0
 Identities = 441/739 (59%), Positives = 575/739 (77%), Gaps = 2/739 (0%)
 Frame = -3

Query: 2335 LYHGG-PGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSGI 2159
            LYHGG  G +P +AL GAG EIRHLEL  K+L+++V  S+  +L+DK +L  FE+SVSG+
Sbjct: 853  LYHGGRAGASPPVALFGAGAEIRHLELENKFLSIDVFLSDESSLNDKVILTFFEKSVSGV 912

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGDR 1979
               HK  G+  + +  EKWG++TFLTPEAA +A+ E N    SGS+LK+ P+   S G +
Sbjct: 913  CGVHKFAGSRLDADHVEKWGRLTFLTPEAARKAL-EFNGFNLSGSILKLSPASAAS-GHK 970

Query: 1978 YFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEIS-KNEDCV 1802
              +F AV+A++ WPR+YSKGYA V+C R +   V+ D   L I GR + CE+S K+ DC+
Sbjct: 971  VSSFAAVKAKVTWPRRYSKGYAIVRCERNEAAFVVQDCFNLLIGGRLVYCELSTKDIDCI 1030

Query: 1801 LIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPS 1622
            +I G+D++ SE E+L +L+  T+R+I D+ L+RG+ +NNPPL  ACEEA+ KEIAPF+P+
Sbjct: 1031 VIKGLDRDTSEQEILEVLQMATNRRILDVFLIRGDTVNNPPL-GACEEAILKEIAPFMPN 1089

Query: 1621 KNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQ 1442
            + PL N  HVQVFPPEP+D  +KA ITFDGRL+LEAA AL+HIQGKV+AGC SWQKI CQ
Sbjct: 1090 QTPLSNYCHVQVFPPEPKDTFMKAWITFDGRLHLEAAKALQHIQGKVIAGCFSWQKIWCQ 1149

Query: 1441 HIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVAD 1262
             +FHSSV CP PV+P ++RQL+SL + F H+ G  Y LERNENGSYRVK+SANATKTVA+
Sbjct: 1150 RVFHSSVSCPAPVFPFIERQLNSLLKRFTHRPGVHYSLERNENGSYRVKVSANATKTVAE 1209

Query: 1261 MRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFG 1082
            +RRPLE+++ GK V  G LTP +L+LLFSRDG  L+K++QQE GTY+L+D+Q L+V+I+G
Sbjct: 1210 LRRPLEQLMNGKKVDQGRLTPAVLQLLFSRDGRFLMKTLQQEMGTYVLFDRQNLSVRIYG 1269

Query: 1081 PQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDA 902
            P+ K+  AE KL+R+LL L++ KQL+I LRG  +PH+LMK+VV+KFGPDLHGLKEK PDA
Sbjct: 1270 PENKVALAEEKLIRSLLALHDKKQLDIPLRGGVMPHDLMKKVVEKFGPDLHGLKEKFPDA 1329

Query: 901  ELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDC 722
               LN +RH + FHG ++ + +VE II DFAR+L  +G ++       +CPICLC+++DC
Sbjct: 1330 VFTLNAKRHIISFHGKEDLRLRVENIIHDFARALNVNGSAEQPDLEATSCPICLCEVEDC 1389

Query: 721  YKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLF 542
            Y+LEAC H FCRSCL++Q +SA++  +GFP+ C   GCG  I + DL++LL  +KL+DLF
Sbjct: 1390 YQLEACAHKFCRSCLVDQLESAMRGRDGFPVSCAREGCGVAIWLTDLKSLLPCDKLEDLF 1449

Query: 541  RASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYI 362
            RAS+ AFVASSGG YRFCPSPDCP+VYRVA  GT   P+VCGAC+ ETCTRCHLE+HPY+
Sbjct: 1450 RASVGAFVASSGGTYRFCPSPDCPSVYRVADTGTFGGPYVCGACYTETCTRCHLEYHPYV 1509

Query: 361  TCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDC 182
            +CERYK+FKDDPD SLK+WC+GK+ VK CPVCGY IEKVDGCNHIEC+CG HICWVC + 
Sbjct: 1510 SCERYKEFKDDPDLSLKDWCRGKDHVKSCPVCGYIIEKVDGCNHIECRCGKHICWVCSEF 1569

Query: 181  FSTSEDCYDHLRTTHLAII 125
            FS+S+DCY HLRT HLAII
Sbjct: 1570 FSSSDDCYGHLRTIHLAII 1588


>ref|XP_012069167.1| PREDICTED: putative uncharacterized protein At4g01020, chloroplastic
            [Jatropha curcas] gi|802577766|ref|XP_012069168.1|
            PREDICTED: putative uncharacterized protein At4g01020,
            chloroplastic [Jatropha curcas]
            gi|643734089|gb|KDP40932.1| hypothetical protein
            JCGZ_24931 [Jatropha curcas]
          Length = 1736

 Score =  933 bits (2412), Expect = 0.0
 Identities = 441/740 (59%), Positives = 578/740 (78%), Gaps = 3/740 (0%)
 Frame = -3

Query: 2335 LYHGG-PGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSGI 2159
            LYHGG  G +P +AL GAG EIRHLEL  +YL+V+V  SN   LDDK+LL  FE+SV G+
Sbjct: 999  LYHGGRAGVSPPVALFGAGAEIRHLELESRYLSVDVFLSNANGLDDKDLLKFFEKSVHGV 1058

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRG-D 1982
             SFH++ G+GQ G++ EKWG++TFLTPEAA +A+ E ND E SGSLLK+ P+  +  G +
Sbjct: 1059 CSFHRYAGSGQVGDEMEKWGRVTFLTPEAARKAL-EFNDFELSGSLLKLSPARSSVGGSN 1117

Query: 1981 RYFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEIS-KNEDC 1805
            +  +F A++A++ WPR+ S+G+A V+C R D   V+ D   L I GR + CE+S K+ +C
Sbjct: 1118 KLSSFAALKAKVTWPRRNSRGHAVVRCERNDAKFVVQDCFNLLIGGRLVFCELSTKDINC 1177

Query: 1804 VLIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIP 1625
            V+I G+D++ SE E+L +L+  T R+I D+ L+RG+A++NPPL +ACEEAL KEIAPF+P
Sbjct: 1178 VIIRGLDRDTSEQEILEVLQMSTKRRILDVFLIRGDAVDNPPL-SACEEALLKEIAPFMP 1236

Query: 1624 SKNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQC 1445
            ++ PL N  HVQVFPP+P+D  +KA ITFDGRL+LEAA AL+HIQGKVLAGC SWQK++C
Sbjct: 1237 NQGPLSNYCHVQVFPPQPKDTYMKAYITFDGRLHLEAAKALQHIQGKVLAGCFSWQKLRC 1296

Query: 1444 QHIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVA 1265
            Q +FHSSV CP  VY  ++RQL+SL + FK++ G    LERNENGSYRVKISANATKTVA
Sbjct: 1297 QQVFHSSVSCPASVYAFIERQLNSLLKRFKNRPGVCCSLERNENGSYRVKISANATKTVA 1356

Query: 1264 DMRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIF 1085
            ++RRPLE+++ GK VTHG+LTP++L+LLFSR+G  L+KS+QQE GTYIL+D+  L+V+IF
Sbjct: 1357 ELRRPLEQLMNGKTVTHGSLTPSVLQLLFSREGKFLMKSLQQEMGTYILFDRHNLSVRIF 1416

Query: 1084 GPQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPD 905
            GP+ +L  AE KLV++LL LN++KQ++I LRG  +PH+LMK+VV+KFGPDL GLK + PD
Sbjct: 1417 GPENRLALAEQKLVKSLLALNDNKQIDIRLRGRAMPHDLMKKVVEKFGPDLCGLKAQFPD 1476

Query: 904  AELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDD 725
               MLNTR H + F G ++ + +VEA I+DFARSL   G S    +G  +CPICLC+++D
Sbjct: 1477 TAFMLNTRHHVISFFGKEDLRLRVEATINDFARSLSVGGASKQPVDGPTSCPICLCEIED 1536

Query: 724  CYKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDL 545
            CY+LE C H FCR+CL++Q +SA++ H+GFP+ C   GC   IL+ DL++LL  EKL+DL
Sbjct: 1537 CYQLEGCGHKFCRTCLVDQLESAMRGHDGFPIRCAQEGCRLHILLTDLKSLLPCEKLEDL 1596

Query: 544  FRASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPY 365
            F+ASL AFVASSGG YRFCPSPDCP+VYRV+  G    PF CGAC+ ETCT+CHLE+HPY
Sbjct: 1597 FKASLGAFVASSGGTYRFCPSPDCPSVYRVSTTGMVGAPFACGACYAETCTKCHLEYHPY 1656

Query: 364  ITCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLD 185
            ++CERYK+FK+DPD SL +W KGKE VK CP CG  IEKVDGCNHIEC+CG HICWVC +
Sbjct: 1657 VSCERYKEFKEDPDLSLVDWRKGKEHVKSCPECGSIIEKVDGCNHIECRCGKHICWVCSE 1716

Query: 184  CFSTSEDCYDHLRTTHLAII 125
             F++S+DCY HLR+ HLAII
Sbjct: 1717 SFNSSDDCYGHLRSIHLAII 1736


>ref|XP_007047849.1| Helicase domain-containing protein / IBR domain-containing protein /
            zinc finger protein-related, putative isoform 1
            [Theobroma cacao] gi|508700110|gb|EOX92006.1| Helicase
            domain-containing protein / IBR domain-containing protein
            / zinc finger protein-related, putative isoform 1
            [Theobroma cacao]
          Length = 1758

 Score =  923 bits (2385), Expect = 0.0
 Identities = 440/740 (59%), Positives = 569/740 (76%), Gaps = 3/740 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-I 2159
            L+HG  G +PS+AL GAG EI+HLE+ K+ LT++V HSNV  L+DK LLM+FE+  +G I
Sbjct: 1025 LFHG-QGASPSMALFGAGAEIKHLEVDKRCLTLDVFHSNVNDLEDKGLLMLFEKYSNGSI 1083

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD- 1982
             S HK   +G E +D EKWGKITFL P+AA +A +E++ ++F+GS LKV PS  +   D 
Sbjct: 1084 CSVHKSQASGHESDDKEKWGKITFLNPDAARKA-AELDGVDFAGSALKVLPSRTSFGADH 1142

Query: 1981 RYFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEIS-KNEDC 1805
            + F+FPAV+A++CWPR+ SKG+  VKC   D   +I DFS L I G+N+RCE+S K+ D 
Sbjct: 1143 KMFSFPAVKAKVCWPRRPSKGFGIVKCDLLDIGFIIDDFSSLVIGGKNVRCEVSRKSVDA 1202

Query: 1804 VLIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIP 1625
            ++I GIDKELSE E+   L+T T RKI D  LVRG+A+ NP   +ACEEAL +EI+PF+P
Sbjct: 1203 IVIYGIDKELSEAEVWDELQTATKRKIHDFFLVRGDAVENPTC-SACEEALHREISPFMP 1261

Query: 1624 SKNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQC 1445
             +NP  N   VQVF PEP++  +KA+ITFDGRL+LEAA ALE ++GKVL GC SWQKI+C
Sbjct: 1262 KRNPHANCCWVQVFQPEPKESFMKALITFDGRLHLEAAKALEQLEGKVLPGCLSWQKIRC 1321

Query: 1444 QHIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVA 1265
            Q +FHSS+ C   VY ++++QLDSL  SF+H  G    LE N NGSYRV+ISANATKTVA
Sbjct: 1322 QQLFHSSISCSSSVYAVIRKQLDSLLASFRHLKGAGCYLEANGNGSYRVRISANATKTVA 1381

Query: 1264 DMRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIF 1085
            ++RRP+E+++ GK V H +LTP+IL+ LFSRDGI+ ++S+QQETGTYI +D+  LN++IF
Sbjct: 1382 ELRRPVEELMNGKTVKHASLTPSILQHLFSRDGINQMRSLQQETGTYIFFDRHSLNIRIF 1441

Query: 1084 GPQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPD 905
            G  +    A+ KL+++LLL +E KQLE+ LRG  LP +LMKEVVKKFGPDLHGLKEK+P 
Sbjct: 1442 GSPDNAAVAQQKLIQSLLLYHESKQLEVKLRGRGLPPDLMKEVVKKFGPDLHGLKEKIPG 1501

Query: 904  AELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDD 725
            AE  L+TR H +   GDKE K+KVE I+ +   +     L++   + E  CPICLC+++D
Sbjct: 1502 AEFALSTRHHVISIRGDKEMKRKVEEIVLEIVET--GKHLAER-SDSEVTCPICLCEVED 1558

Query: 724  CYKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDL 545
             Y+LE C H FCR CL+EQC+SAIK+ + FP+CC + GC  PIL+ DL++LL  EKL++L
Sbjct: 1559 GYQLEGCSHFFCRLCLVEQCESAIKNLDSFPICCAYQGCKAPILLTDLKSLLSTEKLEEL 1618

Query: 544  FRASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPY 365
            FRASL AFVASS G YRFCPSPDCP+VYRVA P T  EPFVCGAC+ ETC +CHLE+HPY
Sbjct: 1619 FRASLGAFVASSRGTYRFCPSPDCPSVYRVADPETFGEPFVCGACYAETCIKCHLEYHPY 1678

Query: 364  ITCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLD 185
            ++CE+YK+FK+DPDSSLKEWCKGKEQVK CPVCGYT+EK+DGCNH+ECKCG H+CWVCL+
Sbjct: 1679 LSCEKYKEFKEDPDSSLKEWCKGKEQVKTCPVCGYTVEKIDGCNHVECKCGRHVCWVCLE 1738

Query: 184  CFSTSEDCYDHLRTTHLAII 125
             FS+S+DCY HLR  H+AII
Sbjct: 1739 FFSSSDDCYGHLRAVHMAII 1758


>gb|KHG13119.1| hypothetical protein F383_07330 [Gossypium arboreum]
          Length = 1760

 Score =  920 bits (2379), Expect = 0.0
 Identities = 446/741 (60%), Positives = 566/741 (76%), Gaps = 4/741 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFER-SVSGI 2159
            L+HG   + PS+AL GAG EI+HLE+ K+YL V+V HSN+ A+DDKELLM FE+ S  GI
Sbjct: 1026 LFHGRSAS-PSMALFGAGAEIKHLEVDKRYLAVDVFHSNLNAIDDKELLMFFEKHSNGGI 1084

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD- 1982
             S HK   NGQE +D EKWGKI FLTP+AA +A SE++ ++FSGS LKV PS  +  GD 
Sbjct: 1085 CSAHKSQANGQEIDDKEKWGKIIFLTPDAARKA-SELDGVDFSGSALKVLPSQTSFGGDH 1143

Query: 1981 RYFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSG-LEIQGRNIRCEISKN-ED 1808
            + F+FP V+A+L WPR+ SKG   VKC R D   ++ DFS  L I G+ + CE+S+  +D
Sbjct: 1144 KMFSFPPVKAKLSWPRRLSKGIGIVKCDRLDVQNILYDFSSRLVIAGKYVNCEVSRKCDD 1203

Query: 1807 CVLIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFI 1628
             VLI GIDKELSE E+  IL + T+R+I D  LVRG+A+ NP    ACEEAL +EI+PF+
Sbjct: 1204 SVLIYGIDKELSEAEVRDILHSATEREIHDFFLVRGDAVENPTC-GACEEALWREISPFM 1262

Query: 1627 PSKNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQ 1448
            P  NP  N   VQVF PEP++  +KA+ITFDGRL+LEAA ALE ++GKVL GC SWQKI+
Sbjct: 1263 PKGNPYTNCCWVQVFEPEPKETFMKALITFDGRLHLEAAKALEQLEGKVLPGCLSWQKIR 1322

Query: 1447 CQHIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTV 1268
            CQ +FHSS+ C   VY ++K+QLDSL  SF+H  G    LE NENGS RV+ISANATKTV
Sbjct: 1323 CQQLFHSSISCSSFVYAVIKKQLDSLLASFRHVKGADCFLETNENGSCRVRISANATKTV 1382

Query: 1267 ADMRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKI 1088
            A++RRPLE+++ G+ V H +LTP+IL+ L SRDGI+L++S+Q+ET TYIL+++  LN++I
Sbjct: 1383 AELRRPLEELMNGRTVKHASLTPSILQHLISRDGINLMRSLQRETRTYILFNRHSLNIRI 1442

Query: 1087 FGPQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVP 908
            FG ++    A+ KL+++LL  +E KQLE+ LRG  LP ++MKEVVKKFGPDLHGLKEK+P
Sbjct: 1443 FGSRDDAAVAQQKLMQSLLSYHESKQLEVRLRGRGLPPDMMKEVVKKFGPDLHGLKEKIP 1502

Query: 907  DAELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLD 728
             AE  LNTR H +   G+KE KQKVE I+   A +  D  +     + E +CPICLC+++
Sbjct: 1503 GAEFTLNTRHHIISICGNKEMKQKVEEIVLQIAEAGRDLAVRS---DSEVSCPICLCEVE 1559

Query: 727  DCYKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDD 548
            D Y+LE C H FCRSCLL+QC+SAIK+ + FPLCC   GC  PIL+ DL++LL  EKL++
Sbjct: 1560 DGYRLEGCSHFFCRSCLLKQCESAIKNLDSFPLCCAQQGCKAPILLTDLKSLLSTEKLEE 1619

Query: 547  LFRASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHP 368
            LFRASL AFV SSGG YRFCPSPDCP+VYRVA P T  EPFVCGAC+ ETCTRCHLE+HP
Sbjct: 1620 LFRASLGAFVVSSGGAYRFCPSPDCPSVYRVAGPETFGEPFVCGACYAETCTRCHLEYHP 1679

Query: 367  YITCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCL 188
            Y++CE+Y++FK+DPD SLKEWCKGKEQVK CPVCGYTIEK+DGCNH+ECKCG H+CWVCL
Sbjct: 1680 YLSCEKYREFKEDPDLSLKEWCKGKEQVKTCPVCGYTIEKIDGCNHVECKCGRHVCWVCL 1739

Query: 187  DCFSTSEDCYDHLRTTHLAII 125
            + FS+S+DCY HLR  H+AII
Sbjct: 1740 EFFSSSDDCYGHLRAVHMAII 1760


>ref|XP_012455164.1| PREDICTED: putative uncharacterized protein At4g01020, chloroplastic
            [Gossypium raimondii] gi|763804280|gb|KJB71218.1|
            hypothetical protein B456_011G111000 [Gossypium
            raimondii]
          Length = 1760

 Score =  915 bits (2366), Expect = 0.0
 Identities = 441/741 (59%), Positives = 563/741 (75%), Gaps = 4/741 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFER-SVSGI 2159
            L+HG   + P +AL GAG EI+HLE+ K+YL V+V HSN+ A+DDKELLM FE+ S  GI
Sbjct: 1026 LFHGRSAS-PCMALFGAGAEIKHLEVDKRYLAVDVFHSNLNAIDDKELLMFFEKHSNGGI 1084

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD- 1982
             S HK   NGQE +D EKWGKI FLTP+AA +A +E++ +EFSGS LKV PS  +  GD 
Sbjct: 1085 CSVHKSQANGQEIDDKEKWGKIMFLTPDAARKA-AELDGVEFSGSALKVLPSQTSFGGDH 1143

Query: 1981 RYFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSG-LEIQGRNIRCEISKN-ED 1808
            + F+FP V+A+L WPR+ SKG   V+C R D   ++ DFS  L I G+ + C +S+  +D
Sbjct: 1144 KMFSFPPVKAKLSWPRRLSKGIGIVRCDRLDVPDILYDFSSRLVIAGKYVNCGVSRKCDD 1203

Query: 1807 CVLIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFI 1628
             V+I GIDKELSE E+   L + T+R+I D  +VRG+A+ NP    ACEEAL +EI+PF+
Sbjct: 1204 SVVIYGIDKELSEAEIWDTLHSATEREIHDFFIVRGDAVKNPTC-GACEEALWREISPFM 1262

Query: 1627 PSKNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQ 1448
            P  NP  N   VQVF PEP++  +KA+ITFDGRL+LEAA ALE ++GKVL GC SWQKI+
Sbjct: 1263 PKGNPYTNCCWVQVFEPEPKETFMKALITFDGRLHLEAAKALEQLEGKVLPGCLSWQKIR 1322

Query: 1447 CQHIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTV 1268
            CQ +FHSS+ C   VY ++K+QLDSL  SF+H  G    LE NENGS RV+ISANATKTV
Sbjct: 1323 CQQLFHSSISCSSSVYAVIKKQLDSLLASFRHVKGADCFLETNENGSCRVRISANATKTV 1382

Query: 1267 ADMRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKI 1088
            A++RRP+E+++ G+ V H +LTP+IL+ LFSRDGI+L++S+Q+ET TYIL+D+  LN++I
Sbjct: 1383 AELRRPVEELMNGRTVKHASLTPSILQHLFSRDGINLMRSLQRETRTYILFDRHSLNIRI 1442

Query: 1087 FGPQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVP 908
            FG  +    A+ KL+++LL  +E KQLE+ LRG  LP ++MKEVVKKFGPDLHGLKEK+P
Sbjct: 1443 FGLPDDAAVAQQKLMQSLLSYHESKQLEVRLRGRGLPPDMMKEVVKKFGPDLHGLKEKIP 1502

Query: 907  DAELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLD 728
             AE  LNTR H +   G+KE KQKVE I+   A +  D  +     + E +CPICLC+++
Sbjct: 1503 GAEFTLNTRHHIISICGNKEMKQKVEEIVLQIAEAGRDLAVRS---DSEVSCPICLCEVE 1559

Query: 727  DCYKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDD 548
            D Y+LE C H FCRSCL+EQC+SAIK+ + FPLCC   GC  PIL+ DL++LL  EKL++
Sbjct: 1560 DGYRLEGCSHFFCRSCLVEQCESAIKNLDSFPLCCAQQGCKAPILLTDLKSLLSTEKLEE 1619

Query: 547  LFRASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHP 368
            LFRASL AFV SSGG YRFCPSPDCP+VYRVA P T  EPFVCGAC+ ETCTRCHLE+HP
Sbjct: 1620 LFRASLGAFVVSSGGAYRFCPSPDCPSVYRVAGPETVGEPFVCGACYAETCTRCHLEYHP 1679

Query: 367  YITCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCL 188
            Y++CE+Y++FK+DPD SLKEWCKGKEQVK CPVCGYTIEK+DGCNH+ECKCG H+CWVCL
Sbjct: 1680 YLSCEKYREFKEDPDMSLKEWCKGKEQVKTCPVCGYTIEKIDGCNHVECKCGRHVCWVCL 1739

Query: 187  DCFSTSEDCYDHLRTTHLAII 125
            + FS+S+DCY HLR  H+AII
Sbjct: 1740 EFFSSSDDCYGHLRAVHMAII 1760


>gb|AGL44347.1| helicase/plant I subfamily protein [Glycine max]
          Length = 1562

 Score =  911 bits (2354), Expect = 0.0
 Identities = 437/738 (59%), Positives = 563/738 (76%), Gaps = 2/738 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-I 2159
            LYHG  G +P +AL G+G EI+HLEL K+ L+V+V H N+  +DDKELLM FE++ SG I
Sbjct: 834  LYHGS-GFSPPVALFGSGAEIKHLELEKRSLSVDVCHPNINEIDDKELLMFFEKNTSGCI 892

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGDR 1979
             + HK  GN ++ ED +KWG+ITF++P+   RA +E++  EF GS LKV PS     GD+
Sbjct: 893  CAVHKFTGNTRD-EDRDKWGRITFMSPDIVRRA-AELDGREFCGSSLKVVPSQLG--GDK 948

Query: 1978 YFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEISKNE-DCV 1802
             F+FPAV+AR+ WPR+ S+G+A VKC  +D D ++ DF  L + GR +RCE+ K   D V
Sbjct: 949  TFSFPAVKARISWPRRLSRGFAIVKCDIKDVDYILRDFYNLAVGGRYVRCEVGKKSMDSV 1008

Query: 1801 LIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPS 1622
            +I G+DKELSE E+  +LRT T R+I D  LVRGEA+ NPP  +A EEAL KEI PF+P 
Sbjct: 1009 VINGLDKELSEAEISDVLRTATTRRILDFFLVRGEAVGNPPC-SALEEALLKEIYPFLPK 1067

Query: 1621 KNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQ 1442
            +NP  +   VQVF PEP+D  ++A+ITFDGRL+LEAA ALE I+GKVL GC SWQKI+CQ
Sbjct: 1068 RNPHISPCRVQVFAPEPKDAFMRALITFDGRLHLEAAKALEQIEGKVLPGCLSWQKIKCQ 1127

Query: 1441 HIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVAD 1262
             +FHSS+  P PVY ++K QLD +  SF++  G    L+R  NGS+RVKI+ANAT+TVA+
Sbjct: 1128 QLFHSSLTFPTPVYRVIKEQLDEVLASFRNLKGLECNLDRTFNGSHRVKITANATRTVAE 1187

Query: 1261 MRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFG 1082
            +RRPLE++L+GK + H +LTP +L+L+ SRDG SL  S+QQETGTYIL+D+  LN+++FG
Sbjct: 1188 VRRPLEELLRGKTIEHDSLTPAVLQLMLSRDGFSLKNSLQQETGTYILFDRHNLNLRVFG 1247

Query: 1081 PQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDA 902
                +  A+ K++++LL L+E+KQLEIHLRG DLP +LMK+++K FGPDLHGLKE+VP  
Sbjct: 1248 SPNMVALAQEKVIQSLLSLHEEKQLEIHLRGRDLPPDLMKQMIKNFGPDLHGLKERVPGV 1307

Query: 901  ELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDC 722
            +L LN RRH ++ HG KE K +VE I+ + ARS     L +    G P+CPICLC+++D 
Sbjct: 1308 DLTLNIRRHIIILHGSKELKPRVEEIVFEIARS--SHHLVERFGNG-PSCPICLCEVEDG 1364

Query: 721  YKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLF 542
            Y+LE C H FCR CL+EQ +SAIK+   FP+CCTH  CG PIL+ DLR+LL  +KL+DLF
Sbjct: 1365 YRLEGCGHLFCRMCLVEQFESAIKNQGTFPVCCTHRDCGDPILLTDLRSLLFGDKLEDLF 1424

Query: 541  RASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYI 362
            RASL AFVA+SGG YRFCPSPDCP++YRVA PG+  EPFVC AC+ ETCTRCHLE+HPY+
Sbjct: 1425 RASLGAFVATSGGTYRFCPSPDCPSIYRVADPGSAGEPFVCRACYSETCTRCHLEYHPYL 1484

Query: 361  TCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDC 182
            +CERYK+FK+DPDSSL EWC+GKEQVKCC  CGY IEKVDGCNH+ECKCG H+CWVCL+ 
Sbjct: 1485 SCERYKEFKEDPDSSLIEWCRGKEQVKCCSACGYVIEKVDGCNHVECKCGKHVCWVCLEF 1544

Query: 181  FSTSEDCYDHLRTTHLAI 128
            FSTS DCYDHLRT HL I
Sbjct: 1545 FSTSNDCYDHLRTIHLTI 1562


>ref|XP_003552808.1| PREDICTED: putative uncharacterized protein At4g01020,
            chloroplastic-like [Glycine max]
            gi|947048015|gb|KRG97543.1| hypothetical protein
            GLYMA_18G014800 [Glycine max]
          Length = 1729

 Score =  911 bits (2354), Expect = 0.0
 Identities = 437/738 (59%), Positives = 563/738 (76%), Gaps = 2/738 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-I 2159
            LYHG  G +P +AL G+G EI+HLEL K+ L+V+V H N+  +DDKELLM FE++ SG I
Sbjct: 1001 LYHGS-GFSPPVALFGSGAEIKHLELEKRSLSVDVCHPNINEIDDKELLMFFEKNTSGCI 1059

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGDR 1979
             + HK  GN ++ ED +KWG+ITF++P+   RA +E++  EF GS LKV PS     GD+
Sbjct: 1060 CAVHKFTGNTRD-EDRDKWGRITFMSPDIVRRA-AELDGREFCGSSLKVVPSQLG--GDK 1115

Query: 1978 YFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEISKNE-DCV 1802
             F+FPAV+AR+ WPR+ S+G+A VKC  +D D ++ DF  L + GR +RCE+ K   D V
Sbjct: 1116 TFSFPAVKARISWPRRLSRGFAIVKCDIKDVDYILRDFYNLAVGGRYVRCEVGKKSMDSV 1175

Query: 1801 LIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPS 1622
            +I G+DKELSE E+  +LRT T R+I D  LVRGEA+ NPP  +A EEAL KEI PF+P 
Sbjct: 1176 VINGLDKELSEAEISDVLRTATTRRILDFFLVRGEAVGNPPC-SALEEALLKEIYPFLPK 1234

Query: 1621 KNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQ 1442
            +NP  +   VQVF PEP+D  ++A+ITFDGRL+LEAA ALE I+GKVL GC SWQKI+CQ
Sbjct: 1235 RNPHISPCRVQVFAPEPKDAFMRALITFDGRLHLEAAKALEQIEGKVLPGCLSWQKIKCQ 1294

Query: 1441 HIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVAD 1262
             +FHSS+  P PVY ++K QLD +  SF++  G    L+R  NGS+RVKI+ANAT+TVA+
Sbjct: 1295 QLFHSSLTFPTPVYRVIKEQLDEVLASFRNLKGLECNLDRTFNGSHRVKITANATRTVAE 1354

Query: 1261 MRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFG 1082
            +RRPLE++L+GK + H +LTP +L+L+ SRDG SL  S+QQETGTYIL+D+  LN+++FG
Sbjct: 1355 VRRPLEELLRGKTIEHDSLTPAVLQLMLSRDGFSLKNSLQQETGTYILFDRHNLNLRVFG 1414

Query: 1081 PQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDA 902
                +  A+ K++++LL L+E+KQLEIHLRG DLP +LMK+++K FGPDLHGLKE+VP  
Sbjct: 1415 SPNMVALAQEKVIQSLLSLHEEKQLEIHLRGRDLPPDLMKQMIKNFGPDLHGLKERVPGV 1474

Query: 901  ELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDC 722
            +L LN RRH ++ HG KE K +VE I+ + ARS     L +    G P+CPICLC+++D 
Sbjct: 1475 DLTLNIRRHIIILHGSKELKPRVEEIVFEIARS--SHHLVERFGNG-PSCPICLCEVEDG 1531

Query: 721  YKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLF 542
            Y+LE C H FCR CL+EQ +SAIK+   FP+CCTH  CG PIL+ DLR+LL  +KL+DLF
Sbjct: 1532 YRLEGCGHLFCRMCLVEQFESAIKNQGTFPVCCTHRDCGDPILLTDLRSLLFGDKLEDLF 1591

Query: 541  RASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYI 362
            RASL AFVA+SGG YRFCPSPDCP++YRVA PG+  EPFVC AC+ ETCTRCHLE+HPY+
Sbjct: 1592 RASLGAFVATSGGTYRFCPSPDCPSIYRVADPGSAGEPFVCRACYSETCTRCHLEYHPYL 1651

Query: 361  TCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDC 182
            +CERYK+FK+DPDSSL EWC+GKEQVKCC  CGY IEKVDGCNH+ECKCG H+CWVCL+ 
Sbjct: 1652 SCERYKEFKEDPDSSLIEWCRGKEQVKCCSACGYVIEKVDGCNHVECKCGKHVCWVCLEF 1711

Query: 181  FSTSEDCYDHLRTTHLAI 128
            FSTS DCYDHLRT HL I
Sbjct: 1712 FSTSNDCYDHLRTIHLTI 1729


>gb|KHN31399.1| Hypothetical protein glysoja_023053 [Glycine soja]
          Length = 1707

 Score =  910 bits (2351), Expect = 0.0
 Identities = 436/738 (59%), Positives = 563/738 (76%), Gaps = 2/738 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-I 2159
            LYHG  G +P +AL G+G EI+HLEL K+ L+V+V H N+  +DD+ELLM FE++ SG I
Sbjct: 979  LYHGS-GFSPPVALFGSGAEIKHLELEKRSLSVDVCHPNINEIDDRELLMFFEKNTSGCI 1037

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGDR 1979
             + HK  GN ++ ED +KWG+ITF++P+   RA +E++  EF GS LKV PS     GD+
Sbjct: 1038 CAVHKFTGNTRD-EDRDKWGRITFMSPDIVRRA-AELDGREFCGSSLKVVPSQLG--GDK 1093

Query: 1978 YFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEISKNE-DCV 1802
             F+FPAV+AR+ WPR+ S+G+A VKC  +D D ++ DF  L + GR +RCE+ K   D V
Sbjct: 1094 TFSFPAVKARISWPRRLSRGFAIVKCDIKDVDYILRDFYNLAVGGRYVRCEVGKKSMDSV 1153

Query: 1801 LIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPS 1622
            +I G+DKELSE E+  +LRT T R+I D  LVRGEA+ NPP  +A EEAL KEI PF+P 
Sbjct: 1154 VINGLDKELSEAEISDVLRTATTRRILDFFLVRGEAVGNPPC-SALEEALLKEIYPFLPK 1212

Query: 1621 KNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQ 1442
            +NP  +   VQVF PEP+D  ++A+ITFDGRL+LEAA ALE I+GKVL GC SWQKI+CQ
Sbjct: 1213 RNPHISPCRVQVFAPEPKDAFMRALITFDGRLHLEAAKALEQIEGKVLPGCLSWQKIKCQ 1272

Query: 1441 HIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVAD 1262
             +FHSS+  P PVY ++K QLD +  SF++  G    L+R  NGS+RVKI+ANAT+TVA+
Sbjct: 1273 QLFHSSLTFPTPVYRVIKEQLDEVLASFRNLKGLECNLDRTFNGSHRVKITANATRTVAE 1332

Query: 1261 MRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFG 1082
            +RRPLE++L+GK + H +LTP +L+L+ SRDG SL  S+QQETGTYIL+D+  LN+++FG
Sbjct: 1333 VRRPLEELLRGKTIEHDSLTPAVLQLMLSRDGFSLKNSLQQETGTYILFDRHNLNLRVFG 1392

Query: 1081 PQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDA 902
                +  A+ K++++LL L+E+KQLEIHLRG DLP +LMK+++K FGPDLHGLKE+VP  
Sbjct: 1393 SPNMVALAQEKVIQSLLSLHEEKQLEIHLRGRDLPPDLMKQMIKNFGPDLHGLKERVPGV 1452

Query: 901  ELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDC 722
            +L LN RRH ++ HG KE K +VE I+ + ARS     L +    G P+CPICLC+++D 
Sbjct: 1453 DLTLNIRRHIIILHGSKELKPRVEEIVFEIARS--SHHLVERFGNG-PSCPICLCEVEDG 1509

Query: 721  YKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLF 542
            Y+LE C H FCR CL+EQ +SAIK+   FP+CCTH  CG PIL+ DLR+LL  +KL+DLF
Sbjct: 1510 YRLEGCGHLFCRMCLVEQFESAIKNQGTFPVCCTHRDCGDPILLTDLRSLLFGDKLEDLF 1569

Query: 541  RASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYI 362
            RASL AFVA+SGG YRFCPSPDCP++YRVA PG+  EPFVC AC+ ETCTRCHLE+HPY+
Sbjct: 1570 RASLGAFVATSGGTYRFCPSPDCPSIYRVADPGSAGEPFVCRACYSETCTRCHLEYHPYL 1629

Query: 361  TCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDC 182
            +CERYK+FK+DPDSSL EWC+GKEQVKCC  CGY IEKVDGCNH+ECKCG H+CWVCL+ 
Sbjct: 1630 SCERYKEFKEDPDSSLIEWCRGKEQVKCCSACGYVIEKVDGCNHVECKCGKHVCWVCLEF 1689

Query: 181  FSTSEDCYDHLRTTHLAI 128
            FSTS DCYDHLRT HL I
Sbjct: 1690 FSTSNDCYDHLRTIHLTI 1707


>ref|XP_010926340.1| PREDICTED: putative uncharacterized protein At4g01020, chloroplastic
            [Elaeis guineensis]
          Length = 1736

 Score =  903 bits (2334), Expect = 0.0
 Identities = 426/734 (58%), Positives = 569/734 (77%), Gaps = 1/734 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSGIS 2156
            L+ G PG++  +AL G+G EI+HLEL K++LTVE+SH N  A+DDKE+L+M ++ VSGI+
Sbjct: 1007 LFPGRPGSSLPVALFGSGAEIKHLELEKRHLTVEISHPNAHAVDDKEVLLMVDQCVSGIA 1066

Query: 2155 SFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGDRY 1976
            ++HK+ GNG EG D  KWGKITFL+P AAE AV+++N++EF GSLLK  P    +  ++ 
Sbjct: 1067 NYHKYAGNGPEGTD--KWGKITFLSPAAAENAVAKLNEVEFHGSLLKALP--VRAVDNKL 1122

Query: 1975 FTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEIS-KNEDCVL 1799
              F AVRAR+CWPR+ SKG A + CA  + + ++ D   L + GR + C++S K ++CV 
Sbjct: 1123 LPFSAVRARVCWPRRPSKGAALITCAGGEAEFIVRDCFALVVGGRYVNCQVSTKYKNCVF 1182

Query: 1798 IVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPSK 1619
            + G+ +++SE EL     + T+RKI D+HL+RGE I NPP  A C EAL +EI+ F+P K
Sbjct: 1183 VTGLPRDVSETELYDAFLSSTERKILDIHLLRGEPIPNPP-GATCREALVREISAFMPKK 1241

Query: 1618 NPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQH 1439
            N   ++F ++VF PEP+DYM+KA+ITFDG L+LEAA AL+HIQGKVL GC SWQKI+C+H
Sbjct: 1242 NFRDHSFQIEVFNPEPKDYMMKAIITFDGGLHLEAAKALDHIQGKVLPGCLSWQKIRCEH 1301

Query: 1438 IFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVADM 1259
            +FHS + CP  VY ++K+QLDSL +SF+ Q G SY LE+N+NGS RVKISANATKT+AD+
Sbjct: 1302 VFHSHLSCPARVYFVIKKQLDSLLESFQQQKGVSYNLEKNDNGSCRVKISANATKTIADL 1361

Query: 1258 RRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFGP 1079
            RRPLE++++GK ++H +LTPT+L+LLFSRDG++LLK+V++++GTYILYD+Q LNVK+FGP
Sbjct: 1362 RRPLEQLMKGKTISHPSLTPTVLQLLFSRDGVALLKAVERKSGTYILYDRQNLNVKVFGP 1421

Query: 1078 QEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDAE 899
             +++  AE  LV++LL L+ED+QLEI LRG ++P NLMKEVV++FGPDL GLKE VP AE
Sbjct: 1422 PKEVAAAEQNLVQSLLSLHEDRQLEIRLRGRNIPPNLMKEVVQRFGPDLQGLKEMVPGAE 1481

Query: 898  LMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDCY 719
            L LNTR H +   G+ E K++VE +I + A S+ D       P G  +CPICLC+L++ Y
Sbjct: 1482 LTLNTRSHIINVRGNNELKRRVEEVISEVALSV-DHAWMIKQPSG-TSCPICLCELEEPY 1539

Query: 718  KLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLFR 539
            +LEAC H FCRSCL++Q +S I+S + FP+ CT  GC + IL+ DLR+LL  EK+++LFR
Sbjct: 1540 RLEACGHDFCRSCLVDQLESTIRSRDSFPIGCTKEGCNELILLVDLRSLLPSEKMEELFR 1599

Query: 538  ASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYIT 359
            ASL AFVAS GG YRFCPSPDCP+VY+VA     A  FVCGAC VETCT+CHLE+HP+I+
Sbjct: 1600 ASLGAFVASRGGAYRFCPSPDCPSVYQVAPKDAEAGHFVCGACSVETCTKCHLEYHPFIS 1659

Query: 358  CERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDCF 179
            CERYK++K+DPD SL EW KGKE +K CP CGYTIEK+DGCNHIECKCG HICWVCL+ F
Sbjct: 1660 CERYKEYKEDPDLSLVEWRKGKEYIKDCPACGYTIEKIDGCNHIECKCGRHICWVCLEFF 1719

Query: 178  STSEDCYDHLRTTH 137
             +S++CY HLR+ H
Sbjct: 1720 RSSDECYGHLRSEH 1733


>gb|KRH31335.1| hypothetical protein GLYMA_11G242300 [Glycine max]
          Length = 1256

 Score =  902 bits (2332), Expect = 0.0
 Identities = 433/739 (58%), Positives = 562/739 (76%), Gaps = 2/739 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-I 2159
            LYHG  G +P +AL G+G EI+HLEL K+ L+V+V H N+  +DD+ELLM FE++ SG I
Sbjct: 527  LYHGS-GFSPPVALFGSGAEIKHLELEKRSLSVDVCHPNINEIDDRELLMFFEKNTSGCI 585

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGDR 1979
             + HK  GN ++G D +KWG+I F++P+   RA +E++  EF GS LK+ PS      D+
Sbjct: 586  CAVHKFTGNMRDG-DRDKWGRIIFMSPDVVRRA-AELDGQEFCGSSLKIVPSQLG--WDK 641

Query: 1978 YFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEISKNE-DCV 1802
             F+FPAV+AR+ WPR+ S+G+A VKC  +D + ++ DF  L + GR +RCEI K   D V
Sbjct: 642  TFSFPAVKARISWPRRLSRGFAIVKCDIKDVNYILRDFYNLAVGGRYVRCEIGKKSIDSV 701

Query: 1801 LIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPS 1622
            +I G+DKELSE E++ +LRT T R+I D  LVRG+A  NPP  +A EEAL KEI PF+P 
Sbjct: 702  VINGLDKELSEAEIVDVLRTATSRRILDFFLVRGDAAGNPPC-SALEEALLKEIYPFLPK 760

Query: 1621 KNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQ 1442
            +NP      VQVF PEP+D  ++A+ITFDGRL+LEAA ALE I+GKVL GC SWQKI+CQ
Sbjct: 761  RNPHIIPCRVQVFAPEPKDSFMRALITFDGRLHLEAAKALEQIEGKVLPGCLSWQKIKCQ 820

Query: 1441 HIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVAD 1262
             +FHSS+  P PVY ++K QLD +  SF++  G    L R  NGS+RVKI+ANAT+TVA+
Sbjct: 821  QLFHSSIIFPTPVYHVIKEQLDEVLASFRNLKGLECNLGRTVNGSHRVKITANATRTVAE 880

Query: 1261 MRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFG 1082
            +RRPLE++L+GK + H +LTP + +L+ SRDG SL  S+QQETGTYIL+D+  LN+++FG
Sbjct: 881  VRRPLEELLRGKTIEHDSLTPVVFQLMLSRDGFSLKNSLQQETGTYILFDRHNLNLRVFG 940

Query: 1081 PQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDA 902
               K+  A+ K++++LL L+E+KQLEIHLRG+DLP +LMK+++K FGPDL GLKE+VP  
Sbjct: 941  SPNKVALAQEKVIQSLLSLHEEKQLEIHLRGMDLPPDLMKQMIKNFGPDLRGLKERVPGV 1000

Query: 901  ELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDC 722
            +L LNTRRH V+ HG KE K +VE II + ARS     L +    G P+CPICLC+++D 
Sbjct: 1001 DLTLNTRRHIVILHGSKELKPRVEEIIFEIARS--SHHLVERFENG-PSCPICLCEVEDG 1057

Query: 721  YKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLF 542
            Y+LE C H FCR CL+EQ +SAI +   FP+CCTH  CG PIL+ DLR+LL  +KL+DLF
Sbjct: 1058 YRLEGCGHLFCRLCLVEQFESAINNQGTFPVCCTHRDCGDPILLTDLRSLLFGDKLEDLF 1117

Query: 541  RASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYI 362
            RASL AFVA+SGG YRFCPSPDCP++YRVA P +  EPFVCG+C+ ETCTRCHLE+HPY+
Sbjct: 1118 RASLGAFVATSGGAYRFCPSPDCPSIYRVADPESAGEPFVCGSCYSETCTRCHLEYHPYL 1177

Query: 361  TCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDC 182
            +CERY++FK+DPDSSLKEWC+GKEQVKCC  CGY IEKVDGCNH+ECKCG H+CWVCL+ 
Sbjct: 1178 SCERYQEFKEDPDSSLKEWCRGKEQVKCCSACGYVIEKVDGCNHVECKCGKHVCWVCLEF 1237

Query: 181  FSTSEDCYDHLRTTHLAII 125
            FSTS DCY+HLRT HLAII
Sbjct: 1238 FSTSNDCYNHLRTIHLAII 1256


>gb|KRH31334.1| hypothetical protein GLYMA_11G242300 [Glycine max]
          Length = 1699

 Score =  902 bits (2332), Expect = 0.0
 Identities = 433/739 (58%), Positives = 562/739 (76%), Gaps = 2/739 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-I 2159
            LYHG  G +P +AL G+G EI+HLEL K+ L+V+V H N+  +DD+ELLM FE++ SG I
Sbjct: 970  LYHGS-GFSPPVALFGSGAEIKHLELEKRSLSVDVCHPNINEIDDRELLMFFEKNTSGCI 1028

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGDR 1979
             + HK  GN ++G D +KWG+I F++P+   RA +E++  EF GS LK+ PS      D+
Sbjct: 1029 CAVHKFTGNMRDG-DRDKWGRIIFMSPDVVRRA-AELDGQEFCGSSLKIVPSQLG--WDK 1084

Query: 1978 YFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEISKNE-DCV 1802
             F+FPAV+AR+ WPR+ S+G+A VKC  +D + ++ DF  L + GR +RCEI K   D V
Sbjct: 1085 TFSFPAVKARISWPRRLSRGFAIVKCDIKDVNYILRDFYNLAVGGRYVRCEIGKKSIDSV 1144

Query: 1801 LIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPS 1622
            +I G+DKELSE E++ +LRT T R+I D  LVRG+A  NPP  +A EEAL KEI PF+P 
Sbjct: 1145 VINGLDKELSEAEIVDVLRTATSRRILDFFLVRGDAAGNPPC-SALEEALLKEIYPFLPK 1203

Query: 1621 KNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQ 1442
            +NP      VQVF PEP+D  ++A+ITFDGRL+LEAA ALE I+GKVL GC SWQKI+CQ
Sbjct: 1204 RNPHIIPCRVQVFAPEPKDSFMRALITFDGRLHLEAAKALEQIEGKVLPGCLSWQKIKCQ 1263

Query: 1441 HIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVAD 1262
             +FHSS+  P PVY ++K QLD +  SF++  G    L R  NGS+RVKI+ANAT+TVA+
Sbjct: 1264 QLFHSSIIFPTPVYHVIKEQLDEVLASFRNLKGLECNLGRTVNGSHRVKITANATRTVAE 1323

Query: 1261 MRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFG 1082
            +RRPLE++L+GK + H +LTP + +L+ SRDG SL  S+QQETGTYIL+D+  LN+++FG
Sbjct: 1324 VRRPLEELLRGKTIEHDSLTPVVFQLMLSRDGFSLKNSLQQETGTYILFDRHNLNLRVFG 1383

Query: 1081 PQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDA 902
               K+  A+ K++++LL L+E+KQLEIHLRG+DLP +LMK+++K FGPDL GLKE+VP  
Sbjct: 1384 SPNKVALAQEKVIQSLLSLHEEKQLEIHLRGMDLPPDLMKQMIKNFGPDLRGLKERVPGV 1443

Query: 901  ELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDC 722
            +L LNTRRH V+ HG KE K +VE II + ARS     L +    G P+CPICLC+++D 
Sbjct: 1444 DLTLNTRRHIVILHGSKELKPRVEEIIFEIARS--SHHLVERFENG-PSCPICLCEVEDG 1500

Query: 721  YKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLF 542
            Y+LE C H FCR CL+EQ +SAI +   FP+CCTH  CG PIL+ DLR+LL  +KL+DLF
Sbjct: 1501 YRLEGCGHLFCRLCLVEQFESAINNQGTFPVCCTHRDCGDPILLTDLRSLLFGDKLEDLF 1560

Query: 541  RASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYI 362
            RASL AFVA+SGG YRFCPSPDCP++YRVA P +  EPFVCG+C+ ETCTRCHLE+HPY+
Sbjct: 1561 RASLGAFVATSGGAYRFCPSPDCPSIYRVADPESAGEPFVCGSCYSETCTRCHLEYHPYL 1620

Query: 361  TCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDC 182
            +CERY++FK+DPDSSLKEWC+GKEQVKCC  CGY IEKVDGCNH+ECKCG H+CWVCL+ 
Sbjct: 1621 SCERYQEFKEDPDSSLKEWCRGKEQVKCCSACGYVIEKVDGCNHVECKCGKHVCWVCLEF 1680

Query: 181  FSTSEDCYDHLRTTHLAII 125
            FSTS DCY+HLRT HLAII
Sbjct: 1681 FSTSNDCYNHLRTIHLAII 1699


>ref|XP_003537562.1| PREDICTED: putative uncharacterized protein At4g01020,
            chloroplastic-like [Glycine max]
            gi|947082544|gb|KRH31333.1| hypothetical protein
            GLYMA_11G242300 [Glycine max]
          Length = 1736

 Score =  902 bits (2332), Expect = 0.0
 Identities = 433/739 (58%), Positives = 562/739 (76%), Gaps = 2/739 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-I 2159
            LYHG  G +P +AL G+G EI+HLEL K+ L+V+V H N+  +DD+ELLM FE++ SG I
Sbjct: 1007 LYHGS-GFSPPVALFGSGAEIKHLELEKRSLSVDVCHPNINEIDDRELLMFFEKNTSGCI 1065

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGDR 1979
             + HK  GN ++G D +KWG+I F++P+   RA +E++  EF GS LK+ PS      D+
Sbjct: 1066 CAVHKFTGNMRDG-DRDKWGRIIFMSPDVVRRA-AELDGQEFCGSSLKIVPSQLG--WDK 1121

Query: 1978 YFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEISKNE-DCV 1802
             F+FPAV+AR+ WPR+ S+G+A VKC  +D + ++ DF  L + GR +RCEI K   D V
Sbjct: 1122 TFSFPAVKARISWPRRLSRGFAIVKCDIKDVNYILRDFYNLAVGGRYVRCEIGKKSIDSV 1181

Query: 1801 LIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPS 1622
            +I G+DKELSE E++ +LRT T R+I D  LVRG+A  NPP  +A EEAL KEI PF+P 
Sbjct: 1182 VINGLDKELSEAEIVDVLRTATSRRILDFFLVRGDAAGNPPC-SALEEALLKEIYPFLPK 1240

Query: 1621 KNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQ 1442
            +NP      VQVF PEP+D  ++A+ITFDGRL+LEAA ALE I+GKVL GC SWQKI+CQ
Sbjct: 1241 RNPHIIPCRVQVFAPEPKDSFMRALITFDGRLHLEAAKALEQIEGKVLPGCLSWQKIKCQ 1300

Query: 1441 HIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVAD 1262
             +FHSS+  P PVY ++K QLD +  SF++  G    L R  NGS+RVKI+ANAT+TVA+
Sbjct: 1301 QLFHSSIIFPTPVYHVIKEQLDEVLASFRNLKGLECNLGRTVNGSHRVKITANATRTVAE 1360

Query: 1261 MRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFG 1082
            +RRPLE++L+GK + H +LTP + +L+ SRDG SL  S+QQETGTYIL+D+  LN+++FG
Sbjct: 1361 VRRPLEELLRGKTIEHDSLTPVVFQLMLSRDGFSLKNSLQQETGTYILFDRHNLNLRVFG 1420

Query: 1081 PQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDA 902
               K+  A+ K++++LL L+E+KQLEIHLRG+DLP +LMK+++K FGPDL GLKE+VP  
Sbjct: 1421 SPNKVALAQEKVIQSLLSLHEEKQLEIHLRGMDLPPDLMKQMIKNFGPDLRGLKERVPGV 1480

Query: 901  ELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDC 722
            +L LNTRRH V+ HG KE K +VE II + ARS     L +    G P+CPICLC+++D 
Sbjct: 1481 DLTLNTRRHIVILHGSKELKPRVEEIIFEIARS--SHHLVERFENG-PSCPICLCEVEDG 1537

Query: 721  YKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLF 542
            Y+LE C H FCR CL+EQ +SAI +   FP+CCTH  CG PIL+ DLR+LL  +KL+DLF
Sbjct: 1538 YRLEGCGHLFCRLCLVEQFESAINNQGTFPVCCTHRDCGDPILLTDLRSLLFGDKLEDLF 1597

Query: 541  RASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYI 362
            RASL AFVA+SGG YRFCPSPDCP++YRVA P +  EPFVCG+C+ ETCTRCHLE+HPY+
Sbjct: 1598 RASLGAFVATSGGAYRFCPSPDCPSIYRVADPESAGEPFVCGSCYSETCTRCHLEYHPYL 1657

Query: 361  TCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDC 182
            +CERY++FK+DPDSSLKEWC+GKEQVKCC  CGY IEKVDGCNH+ECKCG H+CWVCL+ 
Sbjct: 1658 SCERYQEFKEDPDSSLKEWCRGKEQVKCCSACGYVIEKVDGCNHVECKCGKHVCWVCLEF 1717

Query: 181  FSTSEDCYDHLRTTHLAII 125
            FSTS DCY+HLRT HLAII
Sbjct: 1718 FSTSNDCYNHLRTIHLAII 1736


>ref|XP_006465847.1| PREDICTED: putative uncharacterized protein At4g01020,
            chloroplastic-like [Citrus sinensis]
            gi|568823753|ref|XP_006466273.1| PREDICTED: putative
            uncharacterized protein At4g01020, chloroplastic-like
            [Citrus sinensis] gi|568885200|ref|XP_006495187.1|
            PREDICTED: putative uncharacterized protein At4g01020,
            chloroplastic-like [Citrus sinensis]
          Length = 1730

 Score =  902 bits (2330), Expect = 0.0
 Identities = 433/736 (58%), Positives = 564/736 (76%), Gaps = 3/736 (0%)
 Frame = -3

Query: 2323 GPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-ISSFH 2147
            G G +PS+AL GAG EI+HLEL +++LTV+V HSN   LDDKELLM  E++ SG I S H
Sbjct: 1001 GAGVSPSVALFGAGAEIKHLELERRFLTVDVYHSNANILDDKELLMFLEKNASGSICSIH 1060

Query: 2146 KHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD-RYFT 1970
            K    GQ+ ++ +KWG++TFLTP+ A +A +E+N +E++GSLLKV PS     GD + +T
Sbjct: 1061 KF-AVGQDSDEKDKWGRVTFLTPDTAGKA-TELNGVEYNGSLLKVVPSRATLGGDNKMYT 1118

Query: 1969 FPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEISKNE-DCVLIV 1793
            FPAV+A++ WPR+ SKG+A VKC   D + ++ DF  L I GR +RCEI +   D V+I 
Sbjct: 1119 FPAVKAKVYWPRRLSKGFAVVKCDATDVEFLVKDFFDLAIGGRYVRCEIGRRSMDSVVIS 1178

Query: 1792 GIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPSKNP 1613
            G+DKELSE E+L  LR  T R+I DL LVRG+A+  P  + A EEAL +EI+ F+P +N 
Sbjct: 1179 GLDKELSEDEILGELRKVTTRRIRDLFLVRGDAVECPQ-FDAFEEALLREISRFMPKRNS 1237

Query: 1612 LGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQHIF 1433
              N   VQVFPPEP+D  +KA ITFDGRL+LEAA ALE ++GKVL GC  WQK++CQ +F
Sbjct: 1238 HANCCRVQVFPPEPKDAFMKAFITFDGRLHLEAAKALEQLEGKVLPGCGPWQKMKCQQLF 1297

Query: 1432 HSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVADMRR 1253
            HSS+ CP  VY ++K +L+SL  +    NG   ++ERN NGSYRV+IS+NATKTVAD+RR
Sbjct: 1298 HSSLSCPASVYSVIKEELNSLLATLNRVNGAECVVERNYNGSYRVRISSNATKTVADLRR 1357

Query: 1252 PLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFGPQE 1073
            P+E +++G+ V H +LTPTIL+ LF+RDGI+L KS+QQET T+IL+D+  L+VKIFG  +
Sbjct: 1358 PVEVLMRGRTVNHASLTPTILQHLFTRDGINLRKSLQQETRTFILFDRHTLSVKIFGAPD 1417

Query: 1072 KLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDAELM 893
             +  A+ KL+++LL  +E KQLEIHLRG  LP +LMKEVV++FGPDL GLKEKVP AE  
Sbjct: 1418 NIAEAQQKLIQSLLTYHESKQLEIHLRGGVLPPDLMKEVVRRFGPDLQGLKEKVPGAEFS 1477

Query: 892  LNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDCYKL 713
            LNTRRH +  HGD+E KQKVE II + A++    G ++ +   E +CPICLC+L++ Y+L
Sbjct: 1478 LNTRRHVISVHGDRELKQKVEEIIYEIAQT--SDGSAERL-HSEASCPICLCELEESYRL 1534

Query: 712  EACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLFRAS 533
            E C H FCRSCL+EQC+SAIK+ + FP+ C H+GC   IL+ DLR+LL  EKL++LFRAS
Sbjct: 1535 EGCTHLFCRSCLVEQCESAIKNMDSFPIRCAHSGCKALILLTDLRSLLSNEKLEELFRAS 1594

Query: 532  LSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYITCE 353
            L A+VASSGG YRFCPSPDCP+VYRVA+PGT  EPF CGAC+ ETCT CHLE HPY++CE
Sbjct: 1595 LGAYVASSGGTYRFCPSPDCPSVYRVAEPGTAGEPFFCGACYAETCTMCHLEHHPYLSCE 1654

Query: 352  RYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDCFST 173
            +Y++FK+DPDSSLKEWCKGKE VK CP+CGYTIEK++GCNHIEC+CG HICWVCLD F++
Sbjct: 1655 KYREFKEDPDSSLKEWCKGKEHVKTCPICGYTIEKIEGCNHIECRCGRHICWVCLDIFNS 1714

Query: 172  SEDCYDHLRTTHLAII 125
            + DCY HLR+ H++ I
Sbjct: 1715 ANDCYGHLRSKHMSFI 1730


>ref|XP_006426318.1| hypothetical protein CICLE_v10024688mg [Citrus clementina]
            gi|557528308|gb|ESR39558.1| hypothetical protein
            CICLE_v10024688mg [Citrus clementina]
          Length = 1730

 Score =  902 bits (2330), Expect = 0.0
 Identities = 432/736 (58%), Positives = 564/736 (76%), Gaps = 3/736 (0%)
 Frame = -3

Query: 2323 GPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-ISSFH 2147
            G G +PS+AL GAG EI+HLEL +++LTV+V HSN   LDDKELLM  E++ SG I S H
Sbjct: 1001 GAGVSPSVALFGAGAEIKHLELERRFLTVDVYHSNANILDDKELLMFLEKNASGSICSIH 1060

Query: 2146 KHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD-RYFT 1970
            K    GQ+ ++ +KWG++TFLTP+ A +A +E+N +E++GSLLKV PS     GD + +T
Sbjct: 1061 KF-AVGQDSDEKDKWGRVTFLTPDTAGKA-TELNGVEYNGSLLKVVPSRATLGGDNKMYT 1118

Query: 1969 FPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEISKNE-DCVLIV 1793
            FPAV+A++ WPR+ SKG+A VKC   D + ++ DF  L I GR +RCEI +   D V+I 
Sbjct: 1119 FPAVKAKVYWPRRLSKGFAVVKCDATDVEFLVKDFFDLAIGGRYVRCEIGRRSMDAVVIS 1178

Query: 1792 GIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPSKNP 1613
            G+DKELSE E+L  LR  T R+I DL LVRG+A+  P  + A EEAL +EI+ F+P +N 
Sbjct: 1179 GLDKELSEDEILGELRKVTTRRIRDLFLVRGDAVECPQ-FDAFEEALLREISRFMPKRNS 1237

Query: 1612 LGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQHIF 1433
              N   VQVFPPEP+D  +KA ITFDGRL+LEAA ALE ++GKVL GC  WQK++CQ +F
Sbjct: 1238 HANCCRVQVFPPEPKDAFMKAFITFDGRLHLEAAKALEQLEGKVLPGCGPWQKMKCQQLF 1297

Query: 1432 HSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVADMRR 1253
            HSS+ CP  VY ++K +L+SL  +    NG   ++ERN NGSYRV+IS+NATKTVAD+RR
Sbjct: 1298 HSSLSCPASVYSVIKEELNSLLATLNRVNGAECVVERNYNGSYRVRISSNATKTVADLRR 1357

Query: 1252 PLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFGPQE 1073
            P+E++++G+ V H +LTPTIL+ LF+RDGI+L KS+QQET T+IL+D+  L+VKIFG  +
Sbjct: 1358 PVEELMRGRTVNHASLTPTILQHLFTRDGINLRKSLQQETRTFILFDRHTLSVKIFGALD 1417

Query: 1072 KLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDAELM 893
             +  A+ KL+++LL  +E KQLEIHLRG  LP +LMKEVV++FGPDL GLKEKVP AE  
Sbjct: 1418 NIAEAQQKLIQSLLTYHESKQLEIHLRGGVLPPDLMKEVVRRFGPDLQGLKEKVPGAEFS 1477

Query: 892  LNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDCYKL 713
            LNTRRH +  HGD+E KQKVE II++ A++    G ++ +   E +CPICLC+L++ Y L
Sbjct: 1478 LNTRRHVISVHGDRELKQKVEEIINEIAQT--SDGSAERL-HSEASCPICLCELEESYTL 1534

Query: 712  EACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLFRAS 533
            E C H FCRSCL+EQC+SAIK+ + FP+ C H+GC   IL+ DLR+LL  EK ++LFRAS
Sbjct: 1535 EGCTHLFCRSCLVEQCESAIKNMDSFPIRCAHSGCKALILLTDLRSLLSNEKFEELFRAS 1594

Query: 532  LSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYITCE 353
            L A+VASSGG YRFCPSPDCP+VYRVA+PGT  EPF CGAC+ ETCT CHLE HPY++CE
Sbjct: 1595 LGAYVASSGGTYRFCPSPDCPSVYRVAEPGTAGEPFFCGACYAETCTMCHLEHHPYLSCE 1654

Query: 352  RYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDCFST 173
            +Y++FK+DPDSSLKEWCKGKE VK CP+CGYTIEK++GCNHIEC+CG HICWVCLD F++
Sbjct: 1655 KYREFKEDPDSSLKEWCKGKEHVKTCPICGYTIEKIEGCNHIECRCGRHICWVCLDIFNS 1714

Query: 172  SEDCYDHLRTTHLAII 125
            + DCY HLR+ H++ I
Sbjct: 1715 ANDCYGHLRSKHMSFI 1730


>ref|XP_012469827.1| PREDICTED: putative uncharacterized protein At4g01020, chloroplastic
            [Gossypium raimondii] gi|763750851|gb|KJB18239.1|
            hypothetical protein B456_003G041600 [Gossypium
            raimondii]
          Length = 1750

 Score =  900 bits (2326), Expect = 0.0
 Identities = 434/740 (58%), Positives = 554/740 (74%), Gaps = 3/740 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-I 2159
            LYHG P  + S+AL GAG EI+HLE+ K+ LT++V HSNV  LDDKELL  FER  +G I
Sbjct: 1017 LYHG-PNASSSIALFGAGAEIKHLEVEKRCLTIDVFHSNVNTLDDKELLKFFERYSNGSI 1075

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD- 1982
             S HK   NGQE +D EKWGKITFLTP+AA++A +E++ ++F+GS LKV PS  +  GD 
Sbjct: 1076 CSVHKSQANGQESDDREKWGKITFLTPDAAQKA-AELDGVDFAGSALKVLPSRTSFGGDH 1134

Query: 1981 RYFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEIS-KNEDC 1805
            +  +FPAV+A++ WPR+ SKG+ FVKC   D   VI D   L +  + IRC++S K+ D 
Sbjct: 1135 KMISFPAVKAKVYWPRRESKGFGFVKCDLLDVGFVIDDLDNLVVGSKTIRCDVSSKSNDA 1194

Query: 1804 VLIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIP 1625
            +LI GIDKELSE E+   L+  T+RKI D  LVRG+A+ NP    ACE+AL +EI+ F+P
Sbjct: 1195 ILIRGIDKELSEAEIWDTLQGATNRKIHDFFLVRGDAVENPSC-GACEKALHREISHFMP 1253

Query: 1624 SKNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQC 1445
             +NP  N   VQVF PEP++  +KA+ITFDGRL+LEAA ALEH++GKVL  C SWQKI C
Sbjct: 1254 KRNPHTNCCWVQVFQPEPKETFMKALITFDGRLHLEAAKALEHLEGKVLRRCLSWQKITC 1313

Query: 1444 QHIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVA 1265
            Q +FHS + C   VY ++K+QLDSL  SFK   G    +E N NGSYRV+ISANATKTVA
Sbjct: 1314 QRLFHSYISCSSFVYAVIKKQLDSLLASFKRVKGAGCSIEANGNGSYRVRISANATKTVA 1373

Query: 1264 DMRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIF 1085
            +MRRPLE+++ G+ + H  LTP+IL+ LFSRDGI L++S+Q+ET TYI +D+  L V+IF
Sbjct: 1374 EMRRPLEELMNGRTIKHAGLTPSILQHLFSRDGIHLMRSLQRETRTYISFDRHSLGVRIF 1433

Query: 1084 GPQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPD 905
            G  +    AE K++++LL  +E KQLE+ LRG  LP +LMKEVVKKFGPDLHGLKEK+P 
Sbjct: 1434 GSPDAAAVAEQKMIQSLLSYHESKQLEVCLRGPGLPPDLMKEVVKKFGPDLHGLKEKIPG 1493

Query: 904  AELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDD 725
            +E  L++R H +  HGDKE+K+KVE I+ D A +  D        + +  CPICLC+++D
Sbjct: 1494 SEFTLDSRHHVISIHGDKETKRKVELIVLDIAETGEDLAKKS---DCDTTCPICLCEVED 1550

Query: 724  CYKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDL 545
             Y LE C H FCR CL+EQC+SAI++ + FP+CC H GC  PIL+ DL++LL+ E L+ L
Sbjct: 1551 GYWLEGCSHFFCRPCLVEQCESAIRNLDSFPICCAHQGCNVPILLTDLKSLLLSEMLEQL 1610

Query: 544  FRASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPY 365
            FRASL AFVASS G YRFCPSPDCP+VYRVA P TP E FVCGAC+ ETCTRCH E+HPY
Sbjct: 1611 FRASLGAFVASSKGTYRFCPSPDCPSVYRVADPETPGELFVCGACYTETCTRCHGEYHPY 1670

Query: 364  ITCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLD 185
            ++CE+Y++FK+DPD SLKEWCKGKEQVK CPVCGYTIEK+DGCNHIECKCG H+CW CL+
Sbjct: 1671 LSCEKYREFKEDPDISLKEWCKGKEQVKTCPVCGYTIEKIDGCNHIECKCGRHVCWACLE 1730

Query: 184  CFSTSEDCYDHLRTTHLAII 125
             F+ S+DCY+HLR  H+AII
Sbjct: 1731 VFTCSDDCYNHLRAVHMAII 1750


>ref|XP_008782178.1| PREDICTED: LOW QUALITY PROTEIN: putative uncharacterized protein
            At4g01020, chloroplastic [Phoenix dactylifera]
          Length = 1736

 Score =  898 bits (2321), Expect = 0.0
 Identities = 424/734 (57%), Positives = 564/734 (76%), Gaps = 1/734 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSGIS 2156
            L+ G PG++P +AL G+G EI+HLEL K++LTVE+SH N  A+DDKE+L+M ++ VSGI+
Sbjct: 1007 LFPGRPGSSPPVALFGSGAEIKHLELDKRHLTVEISHPNAHAIDDKEVLLMVDQCVSGIA 1066

Query: 2155 SFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGDRY 1976
            ++HK+ GNGQEG D  KWGKITFL+P AAE AV+++N++EF GSLLK  P    +  ++ 
Sbjct: 1067 NYHKYAGNGQEGTD--KWGKITFLSPGAAENAVAKLNEVEFHGSLLKAVP--VRAVDNKM 1122

Query: 1975 FTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEIS-KNEDCVL 1799
              F AVRAR+CWPR+ SKG A + CAR + + ++ D   L + GR + C++S K ++CV 
Sbjct: 1123 HPFSAVRARVCWPRRPSKGVALITCARGEAELIVRDCFALVVGGRYVNCQVSTKYKNCVF 1182

Query: 1798 IVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPSK 1619
            + G+ +++S+PEL     T T RKI D+HL+RGEAI NPP  A C EAL +EI+ F+P K
Sbjct: 1183 VTGLPRDVSKPELYDAFLTSTKRKILDIHLLRGEAIPNPP-GATCAEALVREISAFMPKK 1241

Query: 1618 NPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQH 1439
            N   ++F V+VF PEP+DYM+KA+ITFDG L+LEAA AL+HI+GKVL GC SWQ IQC+H
Sbjct: 1242 NFRDHSFQVEVFNPEPKDYMMKALITFDGSLHLEAAKALDHIEGKVLPGCLSWQTIQCEH 1301

Query: 1438 IFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVADM 1259
            +FHS + CP  VY ++K+QLDSL +SF+ Q G SY LE+N+NGS RVKISANATKT+AD+
Sbjct: 1302 VFHSHLSCPARVYFVIKKQLDSLLESFQRQKGVSYSLEKNDNGSCRVKISANATKTIADL 1361

Query: 1258 RRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFGP 1079
            RRPLE++++GK V+H +LTPT+L+LL SRDG++LLK+V++++GT+ILYD+Q LNVK+FGP
Sbjct: 1362 RRPLEQLMKGKTVSHPSLTPTVLQLLLSRDGMALLKAVERKSGTHILYDRQNLNVKVFGP 1421

Query: 1078 QEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDAE 899
             +++  AE  LV++LL L+ED+QLEI LRG +LP  LMKEVV++FGPDL GLKE VP AE
Sbjct: 1422 PKEVAAAEQNLVQSLLSLHEDRQLEIRLRGRNLPPXLMKEVVQRFGPDLQGLKEMVPGAE 1481

Query: 898  LMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDCY 719
            L LNTR H +   G    KQKVE +I + A S+    +++     E +CPICLC+L + Y
Sbjct: 1482 LTLNTRSHIIGVQGHNSLKQKVEEVISEVALSVDHGWMAE--QPLETSCPICLCELWEPY 1539

Query: 718  KLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLFR 539
            +LEAC H FCRSCL++Q +S I+S + FP+CCT  GC K IL+ DLR+LL  +++++LFR
Sbjct: 1540 RLEACGHDFCRSCLVDQLESTIRSRDSFPICCTKEGCNKLILLVDLRSLLPSQRMEELFR 1599

Query: 538  ASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYIT 359
            ASL AFVAS  G YRFCPSPDCP+VY+VA        F CGAC VETCT+CHLE+HP+I+
Sbjct: 1600 ASLGAFVASRSGSYRFCPSPDCPSVYQVATQDARGGHFACGACLVETCTKCHLEYHPFIS 1659

Query: 358  CERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDCF 179
            C RYK++K DPD SL EW KGKE +K CP CGYT+EKVDGC+HIECKCG HICWVCL+ F
Sbjct: 1660 CGRYKEYKKDPDLSLVEWRKGKENIKDCPACGYTVEKVDGCDHIECKCGRHICWVCLEFF 1719

Query: 178  STSEDCYDHLRTTH 137
             +S++CY HLR+ H
Sbjct: 1720 KSSDECYSHLRSEH 1733


>ref|XP_002307067.1| helicase domain-containing family protein [Populus trichocarpa]
            gi|222856516|gb|EEE94063.1| helicase domain-containing
            family protein [Populus trichocarpa]
          Length = 1743

 Score =  898 bits (2320), Expect = 0.0
 Identities = 423/724 (58%), Positives = 551/724 (76%), Gaps = 2/724 (0%)
 Frame = -3

Query: 2302 LALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-ISSFHKHPGNGQ 2126
            +AL GAG EI++LEL K+ LTV V  SN   +DDKE+LM  E   SG + S HK  G+GQ
Sbjct: 1021 MALFGAGAEIKYLELEKRCLTVNVFFSNANTIDDKEVLMFLEEYTSGTVCSVHKSVGSGQ 1080

Query: 2125 EGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD-RYFTFPAVRAR 1949
            EG++ EKWG+ITFL+P++A +A +++N++EF GS LKV PS     G+ + F+FPAV+A+
Sbjct: 1081 EGDEKEKWGQITFLSPDSARKA-AQLNEVEFKGSKLKVVPSQTIIGGNHKMFSFPAVKAK 1139

Query: 1948 LCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEISKNEDCVLIVGIDKELSE 1769
            + WPRK SKG A VKC   D D +I DFS LEI GR +RC   +  D +++ G  KELSE
Sbjct: 1140 IVWPRKVSKGLAIVKCYVHDVDFMICDFSNLEIGGRYVRCSAGRCVDSIVVSGFSKELSE 1199

Query: 1768 PELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIPSKNPLGNNFHVQ 1589
             ++L  LR+ T+R+I D  +VRG+A+ NPPL  ACE+AL +EI+PF+P +NP  +   VQ
Sbjct: 1200 ADILRALRSATNRRILDFFIVRGDAVENPPL-GACEKALLREISPFMPKRNPQTSCCRVQ 1258

Query: 1588 VFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQCQHIFHSSVFCPG 1409
            VFPPE +D  +KA ITFDGRL+LEAA ALEH++GKVL GC SWQKI+C+ +FHS + C  
Sbjct: 1259 VFPPELKDAFMKAFITFDGRLHLEAARALEHMEGKVLPGCHSWQKIKCEQMFHSLISCSA 1318

Query: 1408 PVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVADMRRPLEKILQG 1229
             +Y  +K+QLDSL  SF    G    L+RNENGSYRVKISANATKTVA++RRPLE++++G
Sbjct: 1319 SIYVAIKKQLDSLLASFSRVKGAECSLDRNENGSYRVKISANATKTVAELRRPLEELMRG 1378

Query: 1228 KNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIFGPQEKLGTAEFK 1049
            + + H +LTPTIL+ LFS  GI+L+KS+Q+ETGTYI +D++  N+KIFG  +K+  A+ K
Sbjct: 1379 QTINHPSLTPTILQHLFSGQGINLMKSIQRETGTYIHFDRRNFNLKIFGRPDKIAPAQQK 1438

Query: 1048 LVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPDAELMLNTRRHAV 869
             ++ LL  +E KQLEIHLRG DLP +LMKEVVK+FGPDLHGLKEKVP A+L L+TR H +
Sbjct: 1439 FIQLLLANHESKQLEIHLRGGDLPPDLMKEVVKRFGPDLHGLKEKVPGADLTLSTRHHVI 1498

Query: 868  MFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDDCYKLEACCHSFC 689
              HGDKE KQ VE II + A+   DS       +G  ACP+CLC+++D Y+LE+C H FC
Sbjct: 1499 SVHGDKELKQNVEEIIFEMAQMGYDSAER---LDGGDACPVCLCEVEDAYRLESCGHLFC 1555

Query: 688  RSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDLFRASLSAFVASS 509
            R CL+EQ +SA+K+ + FP+CC H  C  PIL+ DLR+LL  +KL++LFRASL +FVASS
Sbjct: 1556 RMCLVEQLESALKNLDSFPICCAHGSCRAPILLTDLRSLLSSDKLEELFRASLGSFVASS 1615

Query: 508  GGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPYITCERYKDFKDD 329
            GG YRFCPSPDCP+VYRVA P T  +PFVCGAC  ETCTRCHL++HPY++C++Y +FK+D
Sbjct: 1616 GGTYRFCPSPDCPSVYRVADPVTGGDPFVCGACFAETCTRCHLDYHPYLSCKKYMEFKED 1675

Query: 328  PDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLDCFSTSEDCYDHL 149
            PD SLK+WCKGKE VK CPVCGYTIEK +GCNH+ECKCG H+CWVCL+ ++ SEDCY+HL
Sbjct: 1676 PDLSLKDWCKGKENVKSCPVCGYTIEKGEGCNHVECKCGGHVCWVCLESYNNSEDCYNHL 1735

Query: 148  RTTH 137
            R+ H
Sbjct: 1736 RSMH 1739


>gb|KHG18071.1| hypothetical protein F383_23000 [Gossypium arboreum]
          Length = 1316

 Score =  894 bits (2311), Expect = 0.0
 Identities = 432/740 (58%), Positives = 551/740 (74%), Gaps = 3/740 (0%)
 Frame = -3

Query: 2335 LYHGGPGNTPSLALIGAGGEIRHLELAKKYLTVEVSHSNVRALDDKELLMMFERSVSG-I 2159
            LYHG P  + S+AL GAG EI+HLE+ K+ LT++V HSNV  LDDKELL  FER  +G I
Sbjct: 583  LYHG-PNASSSIALFGAGAEIKHLEVGKRCLTIDVFHSNVNTLDDKELLKFFERYSNGSI 641

Query: 2158 SSFHKHPGNGQEGEDSEKWGKITFLTPEAAERAVSEMNDIEFSGSLLKVCPSWFNSRGD- 1982
             S HK   NGQE +D EKWGKITFLTP+AA++A +E++ ++F+GS LKV PS     GD 
Sbjct: 642  CSVHKCQANGQESDDKEKWGKITFLTPDAAQKA-AELDGVDFTGSALKVLPSRTPFGGDH 700

Query: 1981 RYFTFPAVRARLCWPRKYSKGYAFVKCARQDFDAVISDFSGLEIQGRNIRCEIS-KNEDC 1805
            +  +FPAV+A++  PR+ SKG+ FVKC   D   +I D   L +  + I C++S K++D 
Sbjct: 701  KMISFPAVKAKVYLPRRQSKGFGFVKCDLLDAGFIIDDLDNLVVGSKTIHCDVSSKSDDA 760

Query: 1804 VLIVGIDKELSEPELLHILRTYTDRKIFDLHLVRGEAINNPPLYAACEEALAKEIAPFIP 1625
            +LI GIDKELSE E+   L   T+RKI D  LVRG+A+ NP    ACEEAL +EI+ F+P
Sbjct: 761  ILIRGIDKELSEAEIWDTLLGATNRKIHDFFLVRGDAVENPSC-GACEEALHREISHFMP 819

Query: 1624 SKNPLGNNFHVQVFPPEPQDYMVKAVITFDGRLYLEAAIALEHIQGKVLAGCRSWQKIQC 1445
             + P  N   VQVF PEP +  +KA+ITFDGRL+LEAA ALEH++GKVL GC SWQKI+C
Sbjct: 820  KRYPHTNCCWVQVFQPEPNETFMKALITFDGRLHLEAAKALEHLEGKVLRGCLSWQKIRC 879

Query: 1444 QHIFHSSVFCPGPVYPLVKRQLDSLFQSFKHQNGGSYILERNENGSYRVKISANATKTVA 1265
            Q +FHS + C   VY ++K+QLDSL  SFK   G  + +E N NGSYRV+ISANATKTVA
Sbjct: 880  QRLFHSYISCSSFVYAVIKKQLDSLLASFKRVKGAGWSIEANGNGSYRVRISANATKTVA 939

Query: 1264 DMRRPLEKILQGKNVTHGNLTPTILKLLFSRDGISLLKSVQQETGTYILYDKQKLNVKIF 1085
            +MRRPLE+++ G+ + H  LTP+IL+ LFSRDGI L++S+Q+ET TYI +D+  L V+IF
Sbjct: 940  EMRRPLEELMNGRTIKHAGLTPSILQHLFSRDGIHLMRSLQRETRTYISFDRHSLGVRIF 999

Query: 1084 GPQEKLGTAEFKLVRALLLLNEDKQLEIHLRGIDLPHNLMKEVVKKFGPDLHGLKEKVPD 905
            G  +     E KL+++LL  +E KQLE+ LRG  LP +LMKEVVKKFGPDLHGLKEK+P 
Sbjct: 1000 GSPDAAAVVEQKLIQSLLSYHESKQLEVCLRGPGLPPDLMKEVVKKFGPDLHGLKEKIPG 1059

Query: 904  AELMLNTRRHAVMFHGDKESKQKVEAIIDDFARSLGDSGLSDVVPEGEPACPICLCDLDD 725
            +E  L++R H +  HGDKE+K+KVE I+ D A +  D        +G+  CPICLC+++D
Sbjct: 1060 SEFTLDSRHHVISIHGDKETKRKVELIVLDIAETGEDLAKKS---DGDTTCPICLCEVED 1116

Query: 724  CYKLEACCHSFCRSCLLEQCDSAIKSHEGFPLCCTHAGCGKPILVADLRNLLMPEKLDDL 545
             Y LE C H FCR CL+EQC+SAI++ + FP+CC H GC  PIL+ DL++LL+ E L+ L
Sbjct: 1117 GYWLEGCSHFFCRPCLVEQCESAIRNLDSFPICCAHQGCNVPILLTDLKSLLLSEMLEQL 1176

Query: 544  FRASLSAFVASSGGIYRFCPSPDCPAVYRVAQPGTPAEPFVCGACHVETCTRCHLEFHPY 365
            FRASL AFVASS G YRFCPSPDCP+VYRVA P TP E FVCGAC+ ETCTRCH E+HPY
Sbjct: 1177 FRASLGAFVASSKGTYRFCPSPDCPSVYRVADPETPGELFVCGACYTETCTRCHGEYHPY 1236

Query: 364  ITCERYKDFKDDPDSSLKEWCKGKEQVKCCPVCGYTIEKVDGCNHIECKCGNHICWVCLD 185
            ++CE+Y++FK+DPD SLKEWCKGKEQVK CPVCGYTIEK+DGCNHIECKC  H+CW CL+
Sbjct: 1237 LSCEKYREFKEDPDLSLKEWCKGKEQVKTCPVCGYTIEKIDGCNHIECKCRRHVCWACLE 1296

Query: 184  CFSTSEDCYDHLRTTHLAII 125
             F+ S+DCY+HLR  H+AII
Sbjct: 1297 VFTCSDDCYNHLRAVHMAII 1316


Top