BLASTX nr result

ID: Sinomenium21_contig00020715 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00020715
         (1376 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002271257.2| PREDICTED: uncharacterized protein LOC100243...   458   e-126
ref|XP_002313643.2| peptidase M50 family protein [Populus tricho...   434   e-119
emb|CBI17094.3| unnamed protein product [Vitis vinifera]              434   e-119
ref|XP_007015973.1| Chromodomain-helicase-DNA-binding protein Mi...   432   e-118
ref|XP_007015971.1| Chromodomain-helicase-DNA-binding protein Mi...   432   e-118
ref|XP_006384678.1| hypothetical protein POPTR_0004s20090g [Popu...   431   e-118
ref|XP_007015972.1| Chromodomain-helicase-DNA-binding protein Mi...   427   e-117
ref|XP_002527438.1| DNA binding protein, putative [Ricinus commu...   419   e-114
ref|XP_004295644.1| PREDICTED: uncharacterized protein LOC101310...   410   e-112
ref|XP_002875697.1| hypothetical protein ARALYDRAFT_905616 [Arab...   403   e-110
ref|XP_006292624.1| hypothetical protein CARUB_v10018866mg [Caps...   402   e-109
ref|XP_006424355.1| hypothetical protein CICLE_v10027677mg [Citr...   398   e-108
ref|XP_003539182.1| PREDICTED: uncharacterized protein LOC100796...   398   e-108
ref|XP_002872016.1| PHD finger family protein [Arabidopsis lyrat...   397   e-108
ref|XP_006484965.1| PREDICTED: uncharacterized protein LOC102614...   397   e-108
ref|XP_006484963.1| PREDICTED: uncharacterized protein LOC102614...   397   e-108
ref|NP_197668.2| PHD finger family protein [Arabidopsis thaliana...   394   e-107
ref|XP_003540783.1| PREDICTED: uncharacterized protein LOC100808...   390   e-106
ref|XP_006349073.1| PREDICTED: uncharacterized protein LOC102589...   389   e-105
ref|XP_006289692.1| hypothetical protein CARUB_v10003253mg [Caps...   388   e-105

>ref|XP_002271257.2| PREDICTED: uncharacterized protein LOC100243147 [Vitis vinifera]
          Length = 1582

 Score =  458 bits (1178), Expect = e-126
 Identities = 245/460 (53%), Positives = 306/460 (66%), Gaps = 2/460 (0%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRPSGRRNR 1197
            ST   +K LLLELE ++R+IA SGDWVKLVD+W +E+S  +S T ++G   KR  GRR++
Sbjct: 804  STYSVIKALLLELEENIRIIALSGDWVKLVDNWLVEASVTQSATSAIGSTQKRGPGRRSK 863

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            + S +S++A     D  ++  WWRGGKLSK +FQ+  LP S VKKAARQGGSRKIPGICY
Sbjct: 864  RLSGVSEVADDRCLD--KDFTWWRGGKLSKHIFQRGILPRSAVKKAARQGGSRKIPGICY 921

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
            AE S+IPKRSR+  WR+AVEMSK+A QLALQVRYLD ++RW DLVRP+Q+  D KG ETE
Sbjct: 922  AEVSEIPKRSRQVIWRAAVEMSKNASQLALQVRYLDLHIRWGDLVRPEQNIQDVKGPETE 981

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
            ASAFRNA ICDKK+ E+KIRY + FGNQKHLPSRV+KNIIEVEQ +D  +K+WF E  IP
Sbjct: 982  ASAFRNAFICDKKIVENKIRYGVAFGNQKHLPSRVMKNIIEVEQIQDGNDKYWFYEMRIP 1041

Query: 656  LFLIKEYEEKAEKVPLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASCP 477
            L+LIKEYEE  E +     +  +  SKLQR QLKASRRDIFSYLM + D +DKCSCASC 
Sbjct: 1042 LYLIKEYEESVETLLPSDKQPSNVLSKLQRLQLKASRRDIFSYLMRKRDNLDKCSCASCQ 1101

Query: 476  KDVLLRDAVKCSECGGYCHKNCTISSTVDMKDDV--VITCNQCYWTSDVPLTENNYKPHI 303
             DVLL  AVKC  C GYCH++CTISST+   ++V  +ITC QCY        EN+     
Sbjct: 1102 LDVLLGSAVKCGACQGYCHEDCTISSTIQSTEEVEFLITCKQCYHAKTPTQNENSNDSPT 1161

Query: 302  NQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRKA 123
            +      ++ +    A K   Q  Y Q L  V   E+ S M+  A  +  +   RR    
Sbjct: 1162 SPLPLLGREYQNTATAPKGSRQKDYSQPLAYVRAPENCSNMQQTAAGSSLATKSRR---- 1217

Query: 122  LADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFRM 3
                             K  S+G+IWKKKN E+SG +FR+
Sbjct: 1218 -----------------KPCSWGLIWKKKNVEDSGIDFRL 1240


>ref|XP_002313643.2| peptidase M50 family protein [Populus trichocarpa]
            gi|550331774|gb|EEE87598.2| peptidase M50 family protein
            [Populus trichocarpa]
          Length = 1604

 Score =  434 bits (1117), Expect = e-119
 Identities = 231/461 (50%), Positives = 310/461 (67%), Gaps = 4/461 (0%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRP-SGRRN 1200
            S+  A+K  LLELE ++R+IA SGDWVK +DDW +ESS   S+   +G A +R  +G+R+
Sbjct: 803  SSYSAIKQPLLELEENIRLIALSGDWVKAMDDWLVESSVTHSSASIIGTAQRRGVNGKRH 862

Query: 1199 RKQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            RK S + D+AA    D  ++  WWRGG L KLV  KA LP S+VK+AARQGGSRKI GI 
Sbjct: 863  RKHSGVIDVAADGCHD--KSFVWWRGGTLLKLVSNKAILPQSMVKRAARQGGSRKISGIH 920

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            Y +  +I  RSR+  WR+AVE SK+A QLALQVRYLD +VRWSDLVRP+Q+  D KG+ET
Sbjct: 921  YTDDLEILNRSRQLIWRAAVERSKNASQLALQVRYLDYHVRWSDLVRPEQNLQDGKGSET 980

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            EAS FRNA ICDKK +E  IRY + FGNQKHLPSR++KNIIE+E+ ED K+K+WFSE ++
Sbjct: 981  EASFFRNAVICDKKFEEKTIRYGIAFGNQKHLPSRIMKNIIEIEKTEDGKDKYWFSELHV 1040

Query: 659  PLFLIKEYEEKAEKVPLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASC 480
            PL+LIKE+EE  + +P   NK  +  S LQRRQL+ASRRD+FSYL ++ DK+DKCSCASC
Sbjct: 1041 PLYLIKEFEESVDVIPPSSNKPSNELSVLQRRQLRASRRDMFSYLAFKRDKLDKCSCASC 1100

Query: 479  PKDVLLRDAVKCSECGGYCHKNCTISSTV--DMKDDVVITCNQCYWTSDVPLTENNYKPH 306
              DVL+R+ V CS C GYCH++CT+SS +  + +    +TC +CY    V  +E + K  
Sbjct: 1101 QCDVLIRNTVTCSSCQGYCHQDCTVSSRIYTNKEAQFSVTCKRCYSARAVIFSEKSNK-S 1159

Query: 305  INQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRK 126
            +    P  Q+   AV   KD     ++Q L SV   ES S +K    +T +S+   +   
Sbjct: 1160 LTSPFP-LQERHTAVTVTKDTGIKIHNQPLVSVRTQESCSEVKQ---NTSASSKATKPES 1215

Query: 125  ALADPPGGSTPDTAVK-RGKVTSYGIIWKKKNREESGANFR 6
               D    S+   A K   +  ++G++W+KKN E++G +FR
Sbjct: 1216 RTQDSCSTSSSGKATKTESRSRNWGVVWRKKNNEDTGIDFR 1256


>emb|CBI17094.3| unnamed protein product [Vitis vinifera]
          Length = 1382

 Score =  434 bits (1116), Expect = e-119
 Identities = 245/513 (47%), Positives = 306/513 (59%), Gaps = 55/513 (10%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRPSGRRNR 1197
            ST   +K LLLELE ++R+IA SGDWVKLVD+W +E+S  +S T ++G   KR  GRR++
Sbjct: 575  STYSVIKALLLELEENIRIIALSGDWVKLVDNWLVEASVTQSATSAIGSTQKRGPGRRSK 634

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            + S +S++A     D  ++  WWRGGKLSK +FQ+  LP S VKKAARQGGSRKIPGICY
Sbjct: 635  RLSGVSEVADDRCLD--KDFTWWRGGKLSKHIFQRGILPRSAVKKAARQGGSRKIPGICY 692

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
            AE S+IPKRSR+  WR+AVEMSK+A QLALQVRYLD ++RW DLVRP+Q+  D KG ETE
Sbjct: 693  AEVSEIPKRSRQVIWRAAVEMSKNASQLALQVRYLDLHIRWGDLVRPEQNIQDVKGPETE 752

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
            ASAFRNA ICDKK+ E+KIRY + FGNQKHLPSRV+KNIIEVEQ +D  +K+WF E  IP
Sbjct: 753  ASAFRNAFICDKKIVENKIRYGVAFGNQKHLPSRVMKNIIEVEQIQDGNDKYWFYEMRIP 812

Query: 656  LFLIKEYEEKAEKVPLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASCP 477
            L+LIKEYEE  E +     +  +  SKLQR QLKASRRDIFSYLM + D +DKCSCASC 
Sbjct: 813  LYLIKEYEESVETLLPSDKQPSNVLSKLQRLQLKASRRDIFSYLMRKRDNLDKCSCASCQ 872

Query: 476  KDVLLRDAVKCSEC---------------------------------------------- 435
             DVLL  AVKC  C                                              
Sbjct: 873  LDVLLGSAVKCGACQAVIQLSKLKKIQLMLKLREVSNIYPLILPITIIQKAVAVLSYKVF 932

Query: 434  -------GGYCHKNCTISSTVDMKDDV--VITCNQCYWTSDVPLTENNYKPHINQSIPQT 282
                    GYCH++CTISST+   ++V  +ITC QCY        EN+     +      
Sbjct: 933  YSFIVLLSGYCHEDCTISSTIQSTEEVEFLITCKQCYHAKTPTQNENSNDSPTSPLPLLG 992

Query: 281  QKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRKALADPPGG 102
            ++ +    A K   Q  Y Q L  V   E+ S M+  A  +  +   RR           
Sbjct: 993  REYQNTATAPKGSRQKDYSQPLAYVRAPENCSNMQQTAAGSSLATKSRR----------- 1041

Query: 101  STPDTAVKRGKVTSYGIIWKKKNREESGANFRM 3
                      K  S+G+IWKKKN E+SG +FR+
Sbjct: 1042 ----------KPCSWGLIWKKKNVEDSGIDFRL 1064


>ref|XP_007015973.1| Chromodomain-helicase-DNA-binding protein Mi-2, putative isoform 3
            [Theobroma cacao] gi|508786336|gb|EOY33592.1|
            Chromodomain-helicase-DNA-binding protein Mi-2, putative
            isoform 3 [Theobroma cacao]
          Length = 1149

 Score =  432 bits (1111), Expect = e-118
 Identities = 232/460 (50%), Positives = 305/460 (66%), Gaps = 3/460 (0%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKR-PSGRRN 1200
            STC A+K LLLELE ++ VIA   DW+KL+DDW ++SS  +S + +VG   KR P GRR 
Sbjct: 344  STCSAIKALLLELEENISVIALLVDWIKLMDDWLVDSSVIQSTSSTVGLPQKRGPGGRRR 403

Query: 1199 RKQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            RKQS  S++ A   +D  ++ +WWRGGKLS  +FQKA LP S+V+KAA+QGG RKI GI 
Sbjct: 404  RKQSVASEVTADDCDD--KSFDWWRGGKLSTHIFQKAILPGSMVRKAAQQGGVRKISGIN 461

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            Y + S+IPKRSR+  WR+AVE SK+A QLALQVRYLD +VRW+DLVRP+ +  D KG ET
Sbjct: 462  YVDDSEIPKRSRQLIWRAAVERSKNAAQLALQVRYLDLHVRWNDLVRPEHNIPDGKGTET 521

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            EAS FRNA ICDKK  E+KI+Y + FGNQKHLPSRV+KNII+++Q ED KEK+WF  T+I
Sbjct: 522  EASVFRNAIICDKKSVENKIQYGVAFGNQKHLPSRVMKNIIDIDQTEDRKEKYWFLITHI 581

Query: 659  PLFLIKEYEEKAEKVPL-QLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCAS 483
            PL+LIKEYEEK   V L  + KA    S+LQRRQLKASRR+IF+YL  + DK++KC CAS
Sbjct: 582  PLYLIKEYEEKMSNVGLPSVKKASSELSELQRRQLKASRRNIFAYLTSKRDKLEKCYCAS 641

Query: 482  CPKDVLLRDAVKCSECGGYCHKNCTISS-TVDMKDDVVITCNQCYWTSDVPLTENNYKPH 306
            C  DVLLR+AVKC  C GYCH++CT+SS  ++ K + +I C QCY    +   E + K  
Sbjct: 642  CQMDVLLRNAVKCGTCQGYCHQDCTLSSMRMNGKVECLIICKQCYHAKVLGQNEISTKSP 701

Query: 305  INQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRK 126
            I     Q +    A    K +      Q +  +    S               ++R + +
Sbjct: 702  IIPLPLQGRDCLSAPAVTKGMQVKSSAQPIKPLVSIRS------------KENSVRIQER 749

Query: 125  ALADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFR 6
            +       S    A KR K+ ++G+IW+KKN +E+G +FR
Sbjct: 750  SSDTKQSASLSGLATKRSKLCNWGVIWRKKNSDETGIDFR 789


>ref|XP_007015971.1| Chromodomain-helicase-DNA-binding protein Mi-2, putative isoform 1
            [Theobroma cacao] gi|508786334|gb|EOY33590.1|
            Chromodomain-helicase-DNA-binding protein Mi-2, putative
            isoform 1 [Theobroma cacao]
          Length = 1726

 Score =  432 bits (1111), Expect = e-118
 Identities = 232/460 (50%), Positives = 305/460 (66%), Gaps = 3/460 (0%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKR-PSGRRN 1200
            STC A+K LLLELE ++ VIA   DW+KL+DDW ++SS  +S + +VG   KR P GRR 
Sbjct: 921  STCSAIKALLLELEENISVIALLVDWIKLMDDWLVDSSVIQSTSSTVGLPQKRGPGGRRR 980

Query: 1199 RKQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            RKQS  S++ A   +D  ++ +WWRGGKLS  +FQKA LP S+V+KAA+QGG RKI GI 
Sbjct: 981  RKQSVASEVTADDCDD--KSFDWWRGGKLSTHIFQKAILPGSMVRKAAQQGGVRKISGIN 1038

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            Y + S+IPKRSR+  WR+AVE SK+A QLALQVRYLD +VRW+DLVRP+ +  D KG ET
Sbjct: 1039 YVDDSEIPKRSRQLIWRAAVERSKNAAQLALQVRYLDLHVRWNDLVRPEHNIPDGKGTET 1098

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            EAS FRNA ICDKK  E+KI+Y + FGNQKHLPSRV+KNII+++Q ED KEK+WF  T+I
Sbjct: 1099 EASVFRNAIICDKKSVENKIQYGVAFGNQKHLPSRVMKNIIDIDQTEDRKEKYWFLITHI 1158

Query: 659  PLFLIKEYEEKAEKVPL-QLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCAS 483
            PL+LIKEYEEK   V L  + KA    S+LQRRQLKASRR+IF+YL  + DK++KC CAS
Sbjct: 1159 PLYLIKEYEEKMSNVGLPSVKKASSELSELQRRQLKASRRNIFAYLTSKRDKLEKCYCAS 1218

Query: 482  CPKDVLLRDAVKCSECGGYCHKNCTISS-TVDMKDDVVITCNQCYWTSDVPLTENNYKPH 306
            C  DVLLR+AVKC  C GYCH++CT+SS  ++ K + +I C QCY    +   E + K  
Sbjct: 1219 CQMDVLLRNAVKCGTCQGYCHQDCTLSSMRMNGKVECLIICKQCYHAKVLGQNEISTKSP 1278

Query: 305  INQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRK 126
            I     Q +    A    K +      Q +  +    S               ++R + +
Sbjct: 1279 IIPLPLQGRDCLSAPAVTKGMQVKSSAQPIKPLVSIRS------------KENSVRIQER 1326

Query: 125  ALADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFR 6
            +       S    A KR K+ ++G+IW+KKN +E+G +FR
Sbjct: 1327 SSDTKQSASLSGLATKRSKLCNWGVIWRKKNSDETGIDFR 1366


>ref|XP_006384678.1| hypothetical protein POPTR_0004s20090g [Populus trichocarpa]
            gi|550341446|gb|ERP62475.1| hypothetical protein
            POPTR_0004s20090g [Populus trichocarpa]
          Length = 1708

 Score =  431 bits (1107), Expect = e-118
 Identities = 234/465 (50%), Positives = 305/465 (65%), Gaps = 8/465 (1%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPA-MKRPSGRRN 1200
            ST  A+K  LLELE + R++A SGDWVK +DDW +ES   +S+  S+G A  +R +G+R+
Sbjct: 899  STYSAIKQPLLELEENTRLVALSGDWVKAMDDWLVESPMTQSSAISIGTAHRRRVNGKRH 958

Query: 1199 RKQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            +K S ++D  A    D  ++  WWRGGKL KLVF KA LP S+V++AARQGGSRKI GI 
Sbjct: 959  KKHSGVTDTTADGCHD--KSFVWWRGGKLLKLVFNKAILPQSMVRRAARQGGSRKISGIH 1016

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            Y +  +IP RSR+  WR+AVE S +A QLALQVRYLD +VRWSDLVRP+Q+  D KG+ET
Sbjct: 1017 YTDDLEIPNRSRQLVWRAAVERSNNASQLALQVRYLDFHVRWSDLVRPEQNLQDGKGSET 1076

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            E+S FRNA ICDKK++E K RY + FGNQKHLPSR++KNIIE+EQ+E+ K+K+WFSE ++
Sbjct: 1077 ESSVFRNAVICDKKIEEKKTRYGIAFGNQKHLPSRIMKNIIEIEQSENGKDKYWFSEMHV 1136

Query: 659  PLFLIKEYEEKA-EKVPLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCAS 483
            PL+LIKE+EE   E VP    K  +  S LQRRQLK SRRDIFSYL  + DK+D CSCAS
Sbjct: 1137 PLYLIKEFEESLDEVVPPSAKKPSNELSVLQRRQLKDSRRDIFSYLASKRDKLDSCSCAS 1196

Query: 482  CPKDVLLRDAVKCSECGGYCHKNCTISSTVDMKDDV--VITCNQCYWTSDVPLTENNYKP 309
            C  DVL+RD V CS C GYCH+ CT+SS +   ++    I C +CY    V   E   + 
Sbjct: 1197 CQYDVLIRDTVTCSSCQGYCHQACTVSSRIYTNEEAQFSIICKRCYSARAVIYDEKRNES 1256

Query: 308  HINQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKR 129
              +    Q Q+   AV   K      ++Q   SV   ES S +K  A ST S AT  + R
Sbjct: 1257 LTSPLPLQWQEHHNAVTVMKSTRIKLHNQPFMSVRTQESCSEVKQ-ATSTSSKATKTKSR 1315

Query: 128  KALADPPGGSTPDTAVKRGKVTS----YGIIWKKKNREESGANFR 6
              ++         ++ K  K  S    +GIIW+KKN E++G +FR
Sbjct: 1316 TQVSGSEVKQAISSSRKATKTESRSRNWGIIWRKKNNEDTGIDFR 1360


>ref|XP_007015972.1| Chromodomain-helicase-DNA-binding protein Mi-2, putative isoform 2
            [Theobroma cacao] gi|508786335|gb|EOY33591.1|
            Chromodomain-helicase-DNA-binding protein Mi-2, putative
            isoform 2 [Theobroma cacao]
          Length = 1727

 Score =  427 bits (1099), Expect = e-117
 Identities = 232/461 (50%), Positives = 305/461 (66%), Gaps = 4/461 (0%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKR-PSGRRN 1200
            STC A+K LLLELE ++ VIA   DW+KL+DDW ++SS  +S + +VG   KR P GRR 
Sbjct: 921  STCSAIKALLLELEENISVIALLVDWIKLMDDWLVDSSVIQSTSSTVGLPQKRGPGGRRR 980

Query: 1199 RKQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            RKQS  S++ A   +D  ++ +WWRGGKLS  +FQKA LP S+V+KAA+QGG RKI GI 
Sbjct: 981  RKQSVASEVTADDCDD--KSFDWWRGGKLSTHIFQKAILPGSMVRKAAQQGGVRKISGIN 1038

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            Y + S+IPKRSR+  WR+AVE SK+A QLALQVRYLD +VRW+DLVRP+ +  D KG ET
Sbjct: 1039 YVDDSEIPKRSRQLIWRAAVERSKNAAQLALQVRYLDLHVRWNDLVRPEHNIPDGKGTET 1098

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            EAS FRNA ICDKK  E+KI+Y + FGNQKHLPSRV+KNII+++Q ED KEK+WF  T+I
Sbjct: 1099 EASVFRNAIICDKKSVENKIQYGVAFGNQKHLPSRVMKNIIDIDQTEDRKEKYWFLITHI 1158

Query: 659  PLFLIKEYEEKAEKVPL-QLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCAS 483
            PL+LIKEYEEK   V L  + KA    S+LQRRQLKASRR+IF+YL  + DK++KC CAS
Sbjct: 1159 PLYLIKEYEEKMSNVGLPSVKKASSELSELQRRQLKASRRNIFAYLTSKRDKLEKCYCAS 1218

Query: 482  CPKDVLL-RDAVKCSECGGYCHKNCTISS-TVDMKDDVVITCNQCYWTSDVPLTENNYKP 309
            C  DVLL R+AVKC  C GYCH++CT+SS  ++ K + +I C QCY    +   E + K 
Sbjct: 1219 CQMDVLLSRNAVKCGTCQGYCHQDCTLSSMRMNGKVECLIICKQCYHAKVLGQNEISTKS 1278

Query: 308  HINQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKR 129
             I     Q +    A    K +      Q +  +    S               ++R + 
Sbjct: 1279 PIIPLPLQGRDCLSAPAVTKGMQVKSSAQPIKPLVSIRS------------KENSVRIQE 1326

Query: 128  KALADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFR 6
            ++       S    A KR K+ ++G+IW+KKN +E+G +FR
Sbjct: 1327 RSSDTKQSASLSGLATKRSKLCNWGVIWRKKNSDETGIDFR 1367


>ref|XP_002527438.1| DNA binding protein, putative [Ricinus communis]
            gi|223533173|gb|EEF34930.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 1723

 Score =  419 bits (1076), Expect = e-114
 Identities = 231/485 (47%), Positives = 313/485 (64%), Gaps = 28/485 (5%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKR-PSGRRN 1200
            S+C A+   LLELE ++R IAF GDW K +D   ++S   +    + G   +  P G+R+
Sbjct: 786  SSCSAIMGPLLELEENIRTIAFLGDWTKAMDVLLVDSPMIQIAASNGGITQRSGPGGKRH 845

Query: 1199 RKQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            RKQS + D  A  ++D  ++  WWRG K  KLVFQ+A LP  +VK+AARQGGS+KI G+ 
Sbjct: 846  RKQSGVPDFRANSNDD--KSFVWWRGEKQLKLVFQQAILPRLVVKRAARQGGSKKIMGVF 903

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            Y +  ++PKRSR+  WR+AVE SK+A QLALQVRYLD +VRW+DLVRP+Q+  D KG+ET
Sbjct: 904  YVDDPELPKRSRQMVWRAAVERSKNASQLALQVRYLDLHVRWTDLVRPEQNNQDGKGSET 963

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            EAS FRNA ICDKK++++KI Y + FGNQKHLPSR++KNIIE+EQ+ D KEK+WFSET++
Sbjct: 964  EASVFRNAIICDKKIEKNKICYGVAFGNQKHLPSRIMKNIIEIEQSVDGKEKYWFSETHV 1023

Query: 659  PLFLIKEYEEKAEKVPL-QLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCAS 483
            PLFLIKE+EE+ ++V L    K+L+  S+LQR+QLK SRRDIF YL ++ DK+++CSCAS
Sbjct: 1024 PLFLIKEFEERVDQVALPSAKKSLNELSELQRKQLKYSRRDIFLYLTFKRDKLERCSCAS 1083

Query: 482  CPKDVLLRDAVKCSECGGYCHKNCTISSTVDMKDDV--VITCNQCYWTSDVPLTENNYKP 309
            C  DVL+R+ VKCS C GYCHK+CTISSTV    +V  +ITC QC     V +  N+ +P
Sbjct: 1084 CQHDVLIRNTVKCSACQGYCHKDCTISSTVYRNAEVEFLITCKQCCNAKAVVVHGNDNEP 1143

Query: 308  HINQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHS----------VMKSPAPST 159
             I     Q ++S   + A K     G   +L    K  +H            ++ P    
Sbjct: 1144 PIFHLPLQGRESHDVLTAPK-----GTRIKLRYNAKPVAHENDNGTPSTPLSLQGPESQN 1198

Query: 158  DSSATMRRKRKALADPPG-------------GSTPDTAVK-RGKVTSYGIIWKKKNREES 21
              +A    + K    PP               STP  A K R K+ ++G+IWKKKN E++
Sbjct: 1199 MLTAAKGTRVKFHIQPPSVRAQNSSPEMKQDTSTPSLATKTRSKICNWGVIWKKKNTEDA 1258

Query: 20   GANFR 6
            G +FR
Sbjct: 1259 GTDFR 1263


>ref|XP_004295644.1| PREDICTED: uncharacterized protein LOC101310205 [Fragaria vesca
            subsp. vesca]
          Length = 1676

 Score =  410 bits (1053), Expect = e-112
 Identities = 222/462 (48%), Positives = 301/462 (65%), Gaps = 4/462 (0%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKR-PSGRRN 1200
            STC  +K LLL+LE ++R IA SG+W+KLVDD  +ESS  +  T + G + +R P  RR 
Sbjct: 886  STCSLIKVLLLKLEENIRTIALSGEWIKLVDDVLVESSMIQGPTCTAGTSQRRGPYFRRG 945

Query: 1199 RKQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            RKQSA+ ++  +  E + ++  WW+GGKLSK++FQ+A LP S+VKKAARQGGSRKI G+ 
Sbjct: 946  RKQSAIQEV--IDDECNDKSFVWWQGGKLSKIIFQRAILPCSLVKKAARQGGSRKIFGVS 1003

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            YA+G DIPKRSR+  WR+AVE+SK   QLA+QVRYLD ++RWSDLVRP+Q+  D K AE 
Sbjct: 1004 YADGPDIPKRSRQSVWRAAVELSKKGSQLAVQVRYLDYHLRWSDLVRPEQNLLDGKAAEA 1063

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            EASAFRNA+ICDKK+ ++ I Y + FG+QKHLP+RV+K+IIE EQN+D   KFWF E+ I
Sbjct: 1064 EASAFRNASICDKKMLKNNIVYGVAFGSQKHLPNRVMKSIIETEQNQDGTNKFWFLESRI 1123

Query: 659  PLFLIKEYEEKAEKVPLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASC 480
            PL+LIKEYEE   KVP+   +  +  +KLQRRQ  A RRDIF YL  + D +D   C+ C
Sbjct: 1124 PLYLIKEYEESVAKVPMPSVQEPNLLNKLQRRQRNAIRRDIFYYLECKRDNLDLIICSLC 1183

Query: 479  PKDVLLRDAVKCSECGGYCHKNCTISSTVDMKDDV--VITCNQCYWTSDVPLTENNYKPH 306
              ++L+R+AVKCS C GYCH+ CTISSTV   ++V  +ITC QCY    V   +  +K  
Sbjct: 1184 QLEILVRNAVKCSSCQGYCHEACTISSTVSTNEEVEFLITCKQCYHMK-VLAEKQKFKEF 1242

Query: 305  INQSIPQTQKSKMAVVAHKDVWQNGYH-QQLDSVGKTESHSVMKSPAPSTDSSATMRRKR 129
                +P  +K     +      +  YH Q + S+   E  S +K    +TDS    +++R
Sbjct: 1243 PTNPLPLQKKEYHTPLTVTTAGRPKYHNQSVTSIKVQEPRSEIKQ--ATTDSGLATKKRR 1300

Query: 128  KALADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFRM 3
                                + S+G+IWKKK   E+G +FR+
Sbjct: 1301 -------------------PICSWGVIWKKKT-PETGTDFRI 1322


>ref|XP_002875697.1| hypothetical protein ARALYDRAFT_905616 [Arabidopsis lyrata subsp.
            lyrata] gi|297321535|gb|EFH51956.1| hypothetical protein
            ARALYDRAFT_905616 [Arabidopsis lyrata subsp. lyrata]
          Length = 1570

 Score =  403 bits (1036), Expect = e-110
 Identities = 217/463 (46%), Positives = 293/463 (63%), Gaps = 6/463 (1%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRPSGRRNR 1197
            STC ALK LLLELE ++  IA S DW KL+DDW +E S F+S   +VG   KR  GRR  
Sbjct: 794  STCKALKALLLELEENICSIALSSDWFKLMDDWLVEHSIFQSAPVTVGVTQKRGPGRR-- 851

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            KQ   +++ A  S+DD  +  WWRGGKLSK++  KA L    ++KAA QGGS+KIPG  Y
Sbjct: 852  KQRTQAEVTAEGSDDD--SFTWWRGGKLSKVILLKAVLSQPAIRKAAWQGGSQKIPGFNY 909

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
             + S IP+RSR+  W++AVE SK+  QLALQVRYLD  +RWS+LVRP+Q+  D KG ET+
Sbjct: 910  GDASYIPRRSRRSIWKAAVESSKNISQLALQVRYLDMNLRWSELVRPEQNLQDVKGPETD 969

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
             + FRNA ICDKK+ ++K+ Y + FGNQKHLPSRV+KN+IEVE+ +D  EK+WF E  +P
Sbjct: 970  VAIFRNARICDKKLSDNKVSYGVFFGNQKHLPSRVMKNVIEVEKTQDGNEKYWFQEARVP 1029

Query: 656  LFLIKEYEEKAEKV--PLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCAS 483
            L+LIKE+EE   +V  P    K  +  SKLQR+QLKASR DIFSY+  R DK++KCSCAS
Sbjct: 1030 LYLIKEFEESLHRVQMPSSTKKPSNKLSKLQRKQLKASRMDIFSYIASRRDKMEKCSCAS 1089

Query: 482  CPKDVLLRDAVKCSECGGYCHKNCT-ISSTVDMKDDVVITCNQCYWTSDVPLTENNYKPH 306
            C  DVLLRD   CS C G+CHK CT +S   + K +V++TC +CY   +      N++  
Sbjct: 1090 CDHDVLLRDTTTCSSCQGFCHKECTWMSQHTNGKVEVLVTCKRCYLAKNRVPANINHRQS 1149

Query: 305  INQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRK 126
                +    + + AV     +      QQ++   +     V+K   PS    +   R+  
Sbjct: 1150 TTPQLTINGRHQNAVTPVIKI--KPPSQQINGRPQNAVTPVIKIKPPSQQLPSQKPRENT 1207

Query: 125  ALADPPGGSTPDTAVK---RGKVTSYGIIWKKKNREESGANFR 6
            +        TP++ VK   + K  S G+IW+KKN E++G +FR
Sbjct: 1208 SGVKQ---ITPESTVKSKSKQKTLSCGVIWRKKNVEDTGVDFR 1247


>ref|XP_006292624.1| hypothetical protein CARUB_v10018866mg [Capsella rubella]
            gi|482561331|gb|EOA25522.1| hypothetical protein
            CARUB_v10018866mg [Capsella rubella]
          Length = 1563

 Score =  402 bits (1034), Expect = e-109
 Identities = 221/466 (47%), Positives = 298/466 (63%), Gaps = 9/466 (1%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRPSGRRNR 1197
            +TC ALK LLLELE ++  IA SGDW KL+DDW IE S F+S   +VG   KR  GR+ +
Sbjct: 787  TTCKALKGLLLELEENICGIALSGDWFKLMDDWWIEHSIFQSAPVTVGVTQKRGPGRKRQ 846

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            K  A  ++ A   +DD  +  WWRGGKLSK++  KA L     +KAA QGGS+KIPG  Y
Sbjct: 847  KNQA--EVTAEGYDDD--SFTWWRGGKLSKVILLKAVLSQLATRKAAWQGGSKKIPGFSY 902

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
             + S IP+RSR+  W++AVE SK+  QLALQVRYLD  +RWS+LVRP+Q+  D KG ET+
Sbjct: 903  GDASYIPRRSRRSNWKAAVESSKNISQLALQVRYLDMNLRWSELVRPEQNLQDVKGPETD 962

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
             + FRNA ICDK++ ++K+ Y + FGNQKHLPSRV+KN+IEVE+ +D  EK+WF ET +P
Sbjct: 963  VAIFRNARICDKRLSDNKVSYGVFFGNQKHLPSRVMKNVIEVEKTQDGNEKYWFQETRVP 1022

Query: 656  LFLIKEYEEKAEKV--PLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCAS 483
            L+LIK++EE   +V  P    K  +  SKLQR+QLKASR DIFSY+  R DK++KCSCAS
Sbjct: 1023 LYLIKDFEESLHRVQMPSLTKKPSNKLSKLQRKQLKASRMDIFSYIASRRDKMEKCSCAS 1082

Query: 482  CPKDVLLRDAVKCSECGGYCHKNCT-ISSTVDMKDDVVITCNQCYWTSDVPLTENNYKPH 306
            C  DVLLRD   C+ C G+CHK CT +S   + K +V++TC +CY+      T  +   H
Sbjct: 1083 CDHDVLLRDTTTCNACQGFCHKECTWMSQHTNGKVEVLVTCKKCYFAKTRFQTNIS---H 1139

Query: 305  INQSIPQ-TQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKR 129
            I  +IPQ T  S+        +      QQ++   +     V+K   PS    +     +
Sbjct: 1140 IQSTIPQLTINSRHQNAITPVIKIKPPSQQINGRPQNSVTPVIKIKPPSQQLLS-----Q 1194

Query: 128  KALADPPGGS--TPDTAV---KRGKVTSYGIIWKKKNREESGANFR 6
            K L +  G    TPD++V    + K  S G+IW+KKN E++G +FR
Sbjct: 1195 KPLENTSGVKQFTPDSSVTSKSKQKTLSCGVIWRKKNLEDTGVDFR 1240


>ref|XP_006424355.1| hypothetical protein CICLE_v10027677mg [Citrus clementina]
            gi|557526289|gb|ESR37595.1| hypothetical protein
            CICLE_v10027677mg [Citrus clementina]
          Length = 1691

 Score =  398 bits (1022), Expect = e-108
 Identities = 217/460 (47%), Positives = 300/460 (65%), Gaps = 4/460 (0%)
 Frame = -1

Query: 1373 TCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRP-SGRRNR 1197
            T  ++K LLLELE ++  IA SGDWVKL+DDW  +SS  +S + +     KR  SG+R R
Sbjct: 889  TLNSMKALLLELEENICHIALSGDWVKLMDDWLGDSSVIQSASCNFVTTQKRGLSGKRGR 948

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            K S +S++ A    D  ++ +WW+GGK +KL+ +KA LPH+I++ AAR+GG RKI G+ Y
Sbjct: 949  KHSVISEVTADDCND--QSFSWWQGGKSTKLISKKAILPHTIIRNAARRGGLRKISGVNY 1006

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
               +++PKRSR+  WR+AVE SK+  QLALQVRY+D +VRWS+LVRP+Q+  D KG ETE
Sbjct: 1007 T--AEMPKRSRQLVWRAAVERSKTVSQLALQVRYIDLHVRWSELVRPEQNLQDGKGPETE 1064

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
            A AFRNA ICDKK+ E+KIRY + FG  +HLPSRV+KNII++E ++D KEK+WF ET +P
Sbjct: 1065 AFAFRNAIICDKKIVENKIRYGVAFGIHRHLPSRVMKNIIDIELSQDGKEKYWFPETCLP 1124

Query: 656  LFLIKEYEEKAEKV-PLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASC 480
            LFLIKEYEE+ + V      K  +  S+ Q++QLKASR+D+FSYL+ R DK++KC+CASC
Sbjct: 1125 LFLIKEYEERVDMVIAPSSKKPSNELSEFQKKQLKASRKDLFSYLVCRRDKIEKCACASC 1184

Query: 479  PKDVLLRDAVKCSECGGYCHKNCTISSTVDMKDDV--VITCNQCYWTSDVPLTENNYKPH 306
              DVLL +AVKC  C GYCH+ CT SS++ M   V  +I CN+CY    +  +E   +  
Sbjct: 1185 QLDVLLGNAVKCGTCQGYCHEGCT-SSSMHMNSGVEPMIVCNRCYLPRALATSEIRSESP 1243

Query: 305  INQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRK 126
             +      Q+   AV   K     G++Q L S+   ES    +S    +DSS   + + +
Sbjct: 1244 TSPLPLHRQEYHTAVKVSKGTRPKGFNQALASIRTQES---SESKQTVSDSSTVTKTRNR 1300

Query: 125  ALADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFR 6
             L                   S+GIIW+KKN E++GA+FR
Sbjct: 1301 TL-------------------SWGIIWRKKNIEDAGADFR 1321


>ref|XP_003539182.1| PREDICTED: uncharacterized protein LOC100796377 [Glycine max]
          Length = 1612

 Score =  398 bits (1022), Expect = e-108
 Identities = 219/471 (46%), Positives = 290/471 (61%), Gaps = 15/471 (3%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKR-PSGRRN 1200
            +T  A+K LLL+LE ++R IAF GDWVKL+DDW +E S  +S T ++G A KR PSGRR 
Sbjct: 796  TTFSAIKPLLLKLEENIRTIAFCGDWVKLMDDWLVEFSMVQSATSTLGTAQKRAPSGRRY 855

Query: 1199 RKQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            +K+SA  +  A    ++F    WWRGGK +K +FQKA LP S+V+KAARQGGSRKI GI 
Sbjct: 856  KKRSANDEATAEGCPENFV---WWRGGKFTKFIFQKAVLPKSMVRKAARQGGSRKISGIF 912

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            YA+ S+IPKRSR+  WR AV+MS++A QLALQVRYLD Y+RWSDL+RP+Q+  D KG ET
Sbjct: 913  YADSSEIPKRSRQLVWRVAVQMSRNASQLALQVRYLDFYLRWSDLIRPEQNIQDGKGQET 972

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            EASAFRNA ICD K+ E K  Y + FG+QKHLPSRV+KN+ ++EQ+ + KEK+WF ET I
Sbjct: 973  EASAFRNANICDNKLVEGKSCYGIAFGSQKHLPSRVMKNVFQIEQDPERKEKYWFFETRI 1032

Query: 659  PLFLIKEYEEKAEKVPLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASC 480
            PL+LIKEYEE    +P        A   L RR+LKA  +DIF YL  + D +D  SC+ C
Sbjct: 1033 PLYLIKEYEEGNGNMPCNEEHLNTASELLYRRRLKAICKDIFLYLTCKRDNLDVVSCSVC 1092

Query: 479  PKDVLLRDAVKCSECGGYCHKNCTISSTVDMKDDVVI-TCNQCYWTSDVPLTENNYKPHI 303
               +L+RDA KC+ C GYCH+ C+  STV   + V + TC QCY    +   ENN +   
Sbjct: 1093 QMGLLIRDAHKCNACQGYCHEGCSTRSTVSANEVVYLTTCKQCYHARLLAQKENNNESPT 1152

Query: 302  NQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRKA 123
            +  + Q +++       K      + Q L S     ++  MK   P T    T  +  + 
Sbjct: 1153 SPLLLQGRENNSGTFL-KGSRPKSHDQVLKSSRTKANNPSMKQVTPVTALKGTKAKYYEQ 1211

Query: 122  LADPPG-------------GSTPDTAVKRGKVTSYGIIWKKKNREESGANF 9
                PG                  T  K  K  S+G+IW+KKN E++  +F
Sbjct: 1212 EPTSPGTKDNNHFDMPQVASEATSTGKKPRKNCSWGLIWQKKNNEDTDNDF 1262


>ref|XP_002872016.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297317853|gb|EFH48275.1| PHD finger family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 1562

 Score =  397 bits (1021), Expect = e-108
 Identities = 213/459 (46%), Positives = 290/459 (63%), Gaps = 2/459 (0%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRPSGRRNR 1197
            STC A+K LLLELE ++  IA S DW+K +DDW IE S F+S   +VG   KR  G+R +
Sbjct: 789  STCKAMKSLLLELEENICSIALSSDWLKQIDDWLIEHSIFQSAPDTVGATQKRRPGKRKQ 848

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            +  A  +I A  S+DD  +  WWRGGKLSK++  KA +    ++KAA QGG +K+P   Y
Sbjct: 849  RNQA--EITAQGSDDD--SFTWWRGGKLSKVILLKAVVSKPKIRKAAWQGGMKKLPEFNY 904

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
             +GS IPKRSR+  W++AVE SK+  QLALQVRYLD  +RWS+LVRP+Q+  D KG ETE
Sbjct: 905  GDGSYIPKRSRRSIWKAAVESSKNISQLALQVRYLDMNIRWSELVRPEQNVQDVKGPETE 964

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
            A+ FRNA+ICDKK+ ++K+RY + FGNQKHLPSRV+KN+IEVE+ ED  EK+WF E  +P
Sbjct: 965  AAIFRNASICDKKIIDNKVRYGVVFGNQKHLPSRVMKNVIEVEKTEDRDEKYWFHEARVP 1024

Query: 656  LFLIKEYEEKAEKVPLQ-LNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASC 480
            L+LIKEYEE   +V +  + K     SKLQ+RQLKASR +IFSYL  R D  +KCSCASC
Sbjct: 1025 LYLIKEYEESLHRVNIPFIKKPSRKISKLQKRQLKASRANIFSYLASRRDNTEKCSCASC 1084

Query: 479  PKDVLLRDAVKCSECGGYCHKNCTISST-VDMKDDVVITCNQCYWTSDVPLTENNYKPHI 303
              DV LRD+  CS C G+CHK CT+S+     + ++++TC +CY      L   N++P  
Sbjct: 1085 HLDVFLRDSTTCSTCQGFCHKECTMSTQHTTGQVEILVTCKRCYLARARSLININHRPPT 1144

Query: 302  NQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRKA 123
              ++    + + AV +         +QQL S    ++ S +K   P  D +   + K K 
Sbjct: 1145 TPTVLINGQVQNAVTSVTKTQIKPLNQQLPSPKIRDNASGVKQITP--DFNLAPKSKHKT 1202

Query: 122  LADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFR 6
            L                   S+G+IW+KKN  ++G +FR
Sbjct: 1203 L-------------------SWGVIWRKKNLADTGVSFR 1222


>ref|XP_006484965.1| PREDICTED: uncharacterized protein LOC102614180 isoform X3 [Citrus
            sinensis]
          Length = 1665

 Score =  397 bits (1019), Expect = e-108
 Identities = 217/460 (47%), Positives = 299/460 (65%), Gaps = 4/460 (0%)
 Frame = -1

Query: 1373 TCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRP-SGRRNR 1197
            T  ++K LLLELE ++  IA SGDWVK +DDW  +SS  +S + +     KR  SG+R R
Sbjct: 863  TLNSIKALLLELEENICHIALSGDWVKSMDDWLGDSSVIQSASCNFVTTQKRGLSGKRGR 922

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            K S +S++ A    D  ++ +WW+GGK +KL+ +KA LPH+I++ AAR+GG RKI G+ Y
Sbjct: 923  KHSVISEVTADDCND--QSFSWWQGGKSTKLISKKAILPHTIIRNAARRGGLRKISGVNY 980

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
               +++PKRSR+  WR+AVE SK+  QLALQVRY+D +VRWS+LVRP+Q+  D KG ETE
Sbjct: 981  T--AEMPKRSRQLVWRAAVERSKTVSQLALQVRYIDLHVRWSELVRPEQNLQDGKGPETE 1038

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
            A AFRNA ICDKK+ E+KIRY + FG  +HLPSRV+KNII++E ++D KEK+WF ET +P
Sbjct: 1039 AFAFRNAIICDKKIVENKIRYGVAFGIHRHLPSRVMKNIIDIELSQDGKEKYWFPETCLP 1098

Query: 656  LFLIKEYEEKAEKV-PLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASC 480
            LFLIKEYEE  + V      K L+  S+ Q++QLKASR+D+FSYL+ R DK++KC+CASC
Sbjct: 1099 LFLIKEYEESVDMVIAPSSKKPLNELSEFQKKQLKASRKDLFSYLVCRRDKIEKCACASC 1158

Query: 479  PKDVLLRDAVKCSECGGYCHKNCTISSTVDMKDDV--VITCNQCYWTSDVPLTENNYKPH 306
              DVLL +AVKC  C GYCH+ CT SS++ M   V  +I CN+CY    +  +E   +  
Sbjct: 1159 QIDVLLGNAVKCGTCQGYCHEGCT-SSSMHMNSGVEPMIVCNRCYLPRALATSEIRSESP 1217

Query: 305  INQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRK 126
             +      Q+   AV   K     G++Q L S+   ES    +S    +DSS   + + +
Sbjct: 1218 TSPLPLHRQEYHTAVKVSKGTRPKGFNQALASIRTQES---SESKQTVSDSSTVTKTRNR 1274

Query: 125  ALADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFR 6
             L                   S+GIIW+KKN E++GA+FR
Sbjct: 1275 TL-------------------SWGIIWRKKNIEDAGADFR 1295


>ref|XP_006484963.1| PREDICTED: uncharacterized protein LOC102614180 isoform X1 [Citrus
            sinensis] gi|568863025|ref|XP_006484964.1| PREDICTED:
            uncharacterized protein LOC102614180 isoform X2 [Citrus
            sinensis]
          Length = 1717

 Score =  397 bits (1019), Expect = e-108
 Identities = 217/460 (47%), Positives = 299/460 (65%), Gaps = 4/460 (0%)
 Frame = -1

Query: 1373 TCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRP-SGRRNR 1197
            T  ++K LLLELE ++  IA SGDWVK +DDW  +SS  +S + +     KR  SG+R R
Sbjct: 915  TLNSIKALLLELEENICHIALSGDWVKSMDDWLGDSSVIQSASCNFVTTQKRGLSGKRGR 974

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            K S +S++ A    D  ++ +WW+GGK +KL+ +KA LPH+I++ AAR+GG RKI G+ Y
Sbjct: 975  KHSVISEVTADDCND--QSFSWWQGGKSTKLISKKAILPHTIIRNAARRGGLRKISGVNY 1032

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
               +++PKRSR+  WR+AVE SK+  QLALQVRY+D +VRWS+LVRP+Q+  D KG ETE
Sbjct: 1033 T--AEMPKRSRQLVWRAAVERSKTVSQLALQVRYIDLHVRWSELVRPEQNLQDGKGPETE 1090

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
            A AFRNA ICDKK+ E+KIRY + FG  +HLPSRV+KNII++E ++D KEK+WF ET +P
Sbjct: 1091 AFAFRNAIICDKKIVENKIRYGVAFGIHRHLPSRVMKNIIDIELSQDGKEKYWFPETCLP 1150

Query: 656  LFLIKEYEEKAEKV-PLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASC 480
            LFLIKEYEE  + V      K L+  S+ Q++QLKASR+D+FSYL+ R DK++KC+CASC
Sbjct: 1151 LFLIKEYEESVDMVIAPSSKKPLNELSEFQKKQLKASRKDLFSYLVCRRDKIEKCACASC 1210

Query: 479  PKDVLLRDAVKCSECGGYCHKNCTISSTVDMKDDV--VITCNQCYWTSDVPLTENNYKPH 306
              DVLL +AVKC  C GYCH+ CT SS++ M   V  +I CN+CY    +  +E   +  
Sbjct: 1211 QIDVLLGNAVKCGTCQGYCHEGCT-SSSMHMNSGVEPMIVCNRCYLPRALATSEIRSESP 1269

Query: 305  INQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKRK 126
             +      Q+   AV   K     G++Q L S+   ES    +S    +DSS   + + +
Sbjct: 1270 TSPLPLHRQEYHTAVKVSKGTRPKGFNQALASIRTQES---SESKQTVSDSSTVTKTRNR 1326

Query: 125  ALADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFR 6
             L                   S+GIIW+KKN E++GA+FR
Sbjct: 1327 TL-------------------SWGIIWRKKNIEDAGADFR 1347


>ref|NP_197668.2| PHD finger family protein [Arabidopsis thaliana]
            gi|332005688|gb|AED93071.1| PHD finger family protein
            [Arabidopsis thaliana]
          Length = 1566

 Score =  394 bits (1011), Expect = e-107
 Identities = 215/461 (46%), Positives = 289/461 (62%), Gaps = 4/461 (0%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRPSGRRNR 1197
            STC A+K LLLELE ++  IA S DW+KL+DDW IE S F+S   +VG   KR  GRR +
Sbjct: 792  STCKAMKALLLELEENICSIALSSDWLKLMDDWLIELSIFQSAPVTVGATQKRRPGRRKQ 851

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            +  A  +  A  S+DD  +  WWRGGKLSK++  KA L    +KKAA QGG++K P   Y
Sbjct: 852  RNQA--ENTAQGSDDD--SFTWWRGGKLSKIILLKAVLSKPKIKKAAWQGGTKKFPEFNY 907

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
             +GS IPKRSR+  W++AVE SK+  QLALQVRYLD  +RWS+LVRP+Q+  D KG ETE
Sbjct: 908  GDGSYIPKRSRRSIWKAAVESSKNISQLALQVRYLDMNIRWSELVRPEQNVQDVKGPETE 967

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
            A+ FRNA+IC KK+ ++K+RY + FGNQKHLPSRV+KN+IEVE++ED  EK+WF E  +P
Sbjct: 968  ATIFRNASICVKKIIDNKVRYGVVFGNQKHLPSRVMKNVIEVEKSEDRNEKYWFHEARVP 1027

Query: 656  LFLIKEYEEKAEKV---PLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCA 486
            L+LIKEYEE   +V   P  + K     SKLQ+RQLKASR +IFSYL  R D  +KCSCA
Sbjct: 1028 LYLIKEYEESLHRVVHIPF-IKKPSRKISKLQKRQLKASRANIFSYLASRRDNTEKCSCA 1086

Query: 485  SCPKDVLLRDAVKCSECGGYCHKNCTISST-VDMKDDVVITCNQCYWTSDVPLTENNYKP 309
            SC  DV LRD++ CS C G+CHK CT+SS     + ++++TC +CY          N++ 
Sbjct: 1087 SCHLDVFLRDSITCSTCQGFCHKECTMSSQHTTGQLEILVTCKRCYLARARSQININHRQ 1146

Query: 308  HINQSIPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRRKR 129
                S+    + + A  ++        +QQL S    ++ S +K   P  D +   + K 
Sbjct: 1147 PTTPSVLINGQLQNAATSNTKTQIKRLNQQLPSSKTGDNASGVKQITP--DFNLAPKSKH 1204

Query: 128  KALADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFR 6
            K L                   S+G+IW+KKN  ++G +FR
Sbjct: 1205 KTL-------------------SWGVIWRKKNLADTGVSFR 1226


>ref|XP_003540783.1| PREDICTED: uncharacterized protein LOC100808261 [Glycine max]
          Length = 1644

 Score =  390 bits (1003), Expect = e-106
 Identities = 221/488 (45%), Positives = 300/488 (61%), Gaps = 33/488 (6%)
 Frame = -1

Query: 1373 TCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKR-PSGRRNR 1197
            T  A+K LLL+LE ++R I F GDWVKL+DDW +E S  +S + ++G A KR PSGRR +
Sbjct: 803  TFSAIKPLLLKLEENIRTIVFCGDWVKLMDDWLVEFSMVQSASSTLGTAQKRAPSGRRYK 862

Query: 1196 KQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGICY 1017
            K+ A  +  A    ++F    WWRGGK +K +FQKA LP S+V+KAARQGGSRKI GI Y
Sbjct: 863  KRLANDEATADGCPENFV---WWRGGKFTKFIFQKAVLPKSMVRKAARQGGSRKISGIFY 919

Query: 1016 AEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAETE 837
            A+GS+IPKRSR+  WR AV+MS++A QLALQVRYLD Y+RWSDL+RP+Q+  D KG ETE
Sbjct: 920  ADGSEIPKRSRQLVWRVAVQMSRNASQLALQVRYLDFYLRWSDLIRPEQNIQDGKGQETE 979

Query: 836  ASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYIP 657
            ASAFRNA ICD K+ E K  Y + FG+QKHLPSRV+KN+++VEQ+ + KEK+WF ET IP
Sbjct: 980  ASAFRNANICDNKLVEGKSCYGIAFGSQKHLPSRVMKNVVQVEQDPEGKEKYWFFETRIP 1039

Query: 656  LFLIKEYEEKAEKVPLQLNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCASCP 477
            L+LIKEYEE    +P        A   L RR+LKA  +DIF YL  + D +D  SC+ C 
Sbjct: 1040 LYLIKEYEEGNGNMPCNEEHLNTASELLHRRRLKAICKDIFFYLTCKRDNLDVVSCSVCQ 1099

Query: 476  KDVLLRDAVKCSECGGYCHKNCTISSTVDMKD-DVVITCNQCYW---------TSDVPLT 327
              VL+RDA KC+ C GYCH+ C+  STV   + + + TC QCY          T++ P +
Sbjct: 1100 MGVLIRDAHKCNACQGYCHEGCSTRSTVSANEVEYLTTCKQCYHARLLAQKENTNESPTS 1159

Query: 326  -------ENNYKPHINQSIP----QTQKSKMAVVAHKDVWQ-----------NGYHQQLD 213
                   ENN    +N S P    Q  KS      + +V Q             Y++Q  
Sbjct: 1160 PLLLQGRENNSGTFLNGSRPKSHDQVLKSSRTKANNPNVKQVTPVTALKGTKAKYYEQEP 1219

Query: 212  SVGKTESHSVMKSPAPSTDSSATMRRKRKALADPPGGSTPDTAVKRGKVTSYGIIWKKKN 33
            +  +T+ ++   +P  +++++ T ++ RK                     S+GIIW+KKN
Sbjct: 1220 TSTRTKDNNHFGTPQVASEATLTGKKPRKN-------------------CSWGIIWQKKN 1260

Query: 32   REESGANF 9
             E++  +F
Sbjct: 1261 NEDTDNDF 1268


>ref|XP_006349073.1| PREDICTED: uncharacterized protein LOC102589022 [Solanum tuberosum]
          Length = 1705

 Score =  389 bits (998), Expect = e-105
 Identities = 224/463 (48%), Positives = 290/463 (62%), Gaps = 6/463 (1%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMKRPSGRRNR 1197
            S C  +K LLLE E ++R++AFS DW KLVD    ESS   S     G   KR  GRR R
Sbjct: 889  SGCSLIKSLLLEFEENIRLVAFSMDWTKLVDSGPSESSVTHSAAGVAGSTQKRKPGRRGR 948

Query: 1196 K-QSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            K  +A+ +  A  S+D   +  WWRGG +SK +FQK TLP  +VKKAA QGG RKIPGI 
Sbjct: 949  KPMAAIVEATADESQDIPTDFTWWRGGLISKFIFQKGTLPRRMVKKAALQGGVRKIPGIY 1008

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            YAEGS+  KR+R+  WR+AV+M K+  QLALQVRYLD +VRWSDLVRP+QS  D KG ET
Sbjct: 1009 YAEGSETAKRNRQLVWRAAVDMCKTTSQLALQVRYLDMHVRWSDLVRPEQSIQDGKGPET 1068

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            EASAFRNA ICDK+V E++IRY + FGNQKHLPSRV+K+++EVEQ +D KEK+WFSE  I
Sbjct: 1069 EASAFRNAYICDKRVVENEIRYGVAFGNQKHLPSRVMKSVVEVEQTQDGKEKYWFSELRI 1128

Query: 659  PLFLIKEYEEKAEKVPLQLNKALHAFSKLQRRQLK---ASRRDIFSYLMYRGDKVDKCSC 489
            PL+LIKEYEEK  K     NK   AF  +Q++ L+   A  +DIFSYL+ + D  DK  C
Sbjct: 1129 PLYLIKEYEEKMGKDLPSANKPTSAF--MQKKPLRAPWAPCKDIFSYLVQKRDGNDKYCC 1186

Query: 488  ASCPKDVLLRDAVKCSECGGYCHKNCTISSTVDMKDDVVITCNQCYWTSDVPLTENNYKP 309
            ASC  DVL R+AVKC+ C G CH+ CT+SSTVD  +    TC QC           N   
Sbjct: 1187 ASCQTDVLFRNAVKCNTCQGLCHERCTVSSTVDATN----TCKQC-----------NQNR 1231

Query: 308  HINQS--IPQTQKSKMAVVAHKDVWQNGYHQQLDSVGKTESHSVMKSPAPSTDSSATMRR 135
             ++Q+  I ++ KS + +       Q  Y  +  S  +  + S    P      SA++  
Sbjct: 1232 ALSQAKCIDESPKSPLLL-------QGKYFPKPISANEGVNVSNFNRP------SASIAT 1278

Query: 134  KRKALADPPGGSTPDTAVKRGKVTSYGIIWKKKNREESGANFR 6
             + + A   G S+  TA  +    + G+IWKKK+ E++G +FR
Sbjct: 1279 LKHSSAMKHGNSSNSTAKTKRNSRNLGVIWKKKS-EDTGTDFR 1320


>ref|XP_006289692.1| hypothetical protein CARUB_v10003253mg [Capsella rubella]
            gi|482558398|gb|EOA22590.1| hypothetical protein
            CARUB_v10003253mg [Capsella rubella]
          Length = 1591

 Score =  388 bits (996), Expect = e-105
 Identities = 215/464 (46%), Positives = 292/464 (62%), Gaps = 7/464 (1%)
 Frame = -1

Query: 1376 STCIALKFLLLELEAHVRVIAFSGDWVKLVDDWSIESSAFKSNTYSVGPAMK-RPSGRRN 1200
            STC A+K LLLELE ++  IA S DW+KLVD+W IE S F+S   +V    K RP  RR 
Sbjct: 802  STCKAIKALLLELEENICSIALSSDWLKLVDEWLIEHSIFQSAPVTVAGTQKHRPGKRRQ 861

Query: 1199 RKQSALSDIAAVPSEDDFRNVNWWRGGKLSKLVFQKATLPHSIVKKAARQGGSRKIPGIC 1020
            R Q+   +I A  S+DD  +  WWRGGK+SK++  KA L     +KAA QGG +K P   
Sbjct: 862  RNQA---EITAQGSDDD--SFTWWRGGKISKVILLKAVLLKPKRRKAASQGGVKKFPEFS 916

Query: 1019 YAEGSDIPKRSRKFTWRSAVEMSKSAPQLALQVRYLDSYVRWSDLVRPDQSFYDAKGAET 840
            Y++GS IPKRSR+  W++AVE SK+  QLALQVRYLD  +RWS+LVRP+Q+  D KG ET
Sbjct: 917  YSDGSYIPKRSRRSMWKAAVESSKNISQLALQVRYLDLNIRWSELVRPEQNVQDVKGPET 976

Query: 839  EASAFRNATICDKKVQEDKIRYCLDFGNQKHLPSRVLKNIIEVEQNEDEKEKFWFSETYI 660
            EA+ FRNA+ICDKK+ ++K+RY + FGNQKHLPSRV+KNI EVE+ ED KEK+WF E  +
Sbjct: 977  EATVFRNASICDKKIIDNKVRYGVVFGNQKHLPSRVMKNITEVEKTEDGKEKYWFLEARV 1036

Query: 659  PLFLIKEYEEKAEKVPLQ-LNKALHAFSKLQRRQLKASRRDIFSYLMYRGDKVDKCSCAS 483
            PL+LIKEYEE   +V +  + K     SKLQ+ QLKASR +IFSYL  R D  +KCSCAS
Sbjct: 1037 PLYLIKEYEESLHRVHIPFIKKPSRKISKLQKMQLKASRANIFSYLASRRDSTEKCSCAS 1096

Query: 482  CPKDVLLRDAVKCSECGGYCHKNCTISST-VDMKDDVVITCNQCYWTSDVPLTENNYKPH 306
            C  +V +RDA  CS C GYCHK CT+S+     + ++++TC +CY      L   N++  
Sbjct: 1097 CHLNVFVRDATTCSTCQGYCHKECTMSTPHTSGRVELLVTCKRCYLARARSLININHRHP 1156

Query: 305  INQSIPQTQKSKMAVVAHKDVWQNGYHQQ-LDSVGKTESHSVMKSPAPSTDSSATMRRKR 129
               ++   +  + AV           +Q  +  V KT+   + +  A S   + T   K+
Sbjct: 1157 TTAAVLVNRPHQNAVTQVTKTQGKSLNQHAVTPVTKTQIKPLNQQLASSDIRNTTSGVKQ 1216

Query: 128  KALADPPGGSTPDTAV---KRGKVTSYGIIWKKKNREESGANFR 6
                      TPD+ +    + K  S+G+IW+KK+ E++G +FR
Sbjct: 1217 ---------ITPDSNLAPKSKHKTLSWGVIWRKKSLEDTGVSFR 1251


Top