BLASTX nr result

ID: Rehmannia24_contig00009503 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00009503
         (2449 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, part...   568   e-159
gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus pe...   525   e-146
gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thali...   523   e-145
gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana]             523   e-145
gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [...   513   e-142
ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [...   502   e-139
gb|AAD48963.1|AF147263_5 contains similarity to transposases [Ar...   496   e-137
gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thal...   478   e-132
gb|AAF19546.1|AC007190_14 F23N19.13 [Arabidopsis thaliana]            473   e-130
ref|XP_006280333.1| hypothetical protein CARUB_v10026257mg [Caps...   472   e-130
gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, p...   444   e-121
pir||H85073 probable transposon protein [imported] - Arabidopsis...   427   e-116
ref|NP_001060325.2| Os07g0624100 [Oryza sativa Japonica Group] g...   404   e-109
ref|XP_006279432.1| hypothetical protein CARUB_v10007925mg, part...   394   e-106
gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia]        394   e-106
ref|XP_002451486.1| hypothetical protein SORBIDRAFT_04g002725 [S...   386   e-104
ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [A...   380   e-102
gb|EMJ01864.1| hypothetical protein PRUPE_ppa015215mg, partial [...   379   e-102
emb|CAN80126.1| hypothetical protein VITISV_013417 [Vitis vinifera]   374   e-101
gb|EMJ28015.1| hypothetical protein PRUPE_ppa017701mg [Prunus pe...   373   e-100

>ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella]
            gi|482560944|gb|EOA25135.1| hypothetical protein
            CARUB_v10018444mg, partial [Capsella rubella]
          Length = 547

 Score =  568 bits (1465), Expect = e-159
 Identities = 287/542 (52%), Positives = 376/542 (69%), Gaps = 4/542 (0%)
 Frame = -3

Query: 1955 KIRSKKINPKISRELLAAAIIKHDLPFSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXX 1776
            ++ ++KI+  + REL+   II HDLPFSFVEY  +R  +KY+NP    ISRNT V+D+  
Sbjct: 2    RLAARKIDHSVVRELITLVIICHDLPFSFVEYPRVRELLKYLNPEYKTISRNTAVADVLK 61

Query: 1775 XXXXXXXXXXXXLANITNRICLTSDVWTACTSEGYICLTGHFVDENWKLNSKILCFDAMP 1596
                        LA + NRICLT DVW + + EGYICLT H+VD++WKL SKIL F AMP
Sbjct: 62   FHGIRKEQMKQELAGVGNRICLTCDVWRSISIEGYICLTAHYVDDSWKLKSKILSFCAMP 121

Query: 1595 PPHSGVELAAKIFAFLKEWGIDRKIFSLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFF 1416
            PPHSG ELA K+ + L++WGI++KIFSLTLDNASSND MQ IL++QLS +  L C+GEFF
Sbjct: 122  PPHSGFELAKKVLSCLEDWGIEKKIFSLTLDNASSNDNMQSILRDQLSSRHGLLCDGEFF 181

Query: 1415 HIRCSAHILNLIVQEGLKAINLALHKIRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGL 1236
            HIRCSAH+LNLIVQ GLK +   LHKIRE+VK++K SEGR   F+ECV  VG I    GL
Sbjct: 182  HIRCSAHVLNLIVQVGLKFVESPLHKIRETVKWIKWSEGRKDLFKECVIDVG-IKYTAGL 240

Query: 1235 RLDVSTRWNSTYLMLDSAIKYKKAFSSLQLNDRNYKFCPSIDEWKRAEKICEFLEPFYDT 1056
            ++DVSTRWNSTYLML S IKY++AFS L+  +RNYKFCPS +EW +AEKI  FLEPFYD 
Sbjct: 241  KMDVSTRWNSTYLMLGSVIKYRRAFSLLERAERNYKFCPSDEEWNKAEKIYTFLEPFYDI 300

Query: 1055 TNLISGSSYPTSNLYFMQVWKIEVKLKENLSNEDVFISDMCKRMKEKFDKYWSQYSTVLA 876
            T L SG+SYPT+NLYF Q+WKIE  L    ++ D+ + +M   M+ KFDKYW +YS +L+
Sbjct: 301  TKLFSGTSYPTANLYFAQIWKIECLLNSYSNDGDMELQNMANEMRTKFDKYWEEYSIILS 360

Query: 875  FGAILDPRVKFSMLSYFYSKVESDPVKCQETMSIVKAKLDMLFELYANDIKXXXXXXXXX 696
             GAILDPR+K  +L+Y + K+  DP   +  + +VK KL++LF+ Y +            
Sbjct: 361  IGAILDPRMKVEILTYCFDKL--DPSTTKAKVEVVKQKLNLLFDQYKS------------ 406

Query: 695  XXSTIHCSTQSGEGDKSKG----KRMFDEFKAYDSQTVTNAGKSQLDLYLEEPKLEFSYY 528
                    T +     S+G     +   +FKAY+ +T+   GKS+L +YLE+ +LE ++Y
Sbjct: 407  ------TPTSTNVSSSSRGTDFIAKTHSDFKAYEKRTILEEGKSKLAVYLEDDRLEMTFY 460

Query: 527  EDLDVLQYWKNHQHRFPTLALIARDVLAIPITTVASESAFSIGARVLTKYRSCTLPEKVQ 348
            ED+DVL++WKN   R+  LA +A DVL+IPIT+VA+ES+FSIGA VL KYRS  LP  V+
Sbjct: 461  EDMDVLEWWKNQTQRYGELARMACDVLSIPITSVAAESSFSIGAHVLNKYRSRLLPRHVE 520

Query: 347  TL 342
             L
Sbjct: 521  AL 522


>gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus persica]
          Length = 696

 Score =  525 bits (1351), Expect = e-146
 Identities = 287/658 (43%), Positives = 399/658 (60%), Gaps = 19/658 (2%)
 Frame = -3

Query: 2231 NQSINLDEGDTLENTKSNQGLGKPKEFSDVWNYFLKKGVGQDGVQRAXXXXXXXXXXXXX 2052
            N  ++ D  +      +  G  + K  S VW  F    + ++  QRA             
Sbjct: 22   NNVVDSDPSNNNNAVVTQIGKRRRKLTSAVWTQFEILPIDENNEQRAKCMKCGQKYLCDS 81

Query: 2051 XXXGTSTLRRHIPTCKMLSFHDVGQMIVDH-EGKI--RSKKINPKISRELLAAAIIKHDL 1881
                T  L+RHI +C      D+GQ+++   +G I  RS K +P   RELL  AII HDL
Sbjct: 82   RYG-TGNLKRHIESCVKTDTRDLGQLLLSKSDGAILTRSSKFDPMKFRELLVMAIIMHDL 140

Query: 1880 PFSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSD 1701
            PF FVEY GIR    Y+   +  +SRNT  +D+              L ++  R+CLTSD
Sbjct: 141  PFQFVEYAGIRQLFNYVCADIKLVSRNTAKADVLSLYNREKAKLKEILGSVPGRVCLTSD 200

Query: 1700 VWTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKI 1521
            +WT+ T++GY+CLT HF+D NWKL  +IL F  MPPPH+GV L  KI+  L +WG+++K+
Sbjct: 201  LWTSITTDGYLCLTVHFIDVNWKLQKRILNFSFMPPPHTGVALCEKIYRLLTDWGVEKKL 260

Query: 1520 FSLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALH 1341
            FS+TLDNASSND   E+LK QL+++D+L  NG+FFHIRC AHILNLIVQ+GLK I+ ++ 
Sbjct: 261  FSMTLDNASSNDTFVELLKGQLNLKDALLMNGKFFHIRCCAHILNLIVQDGLKHIDDSVG 320

Query: 1340 KIRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAF 1161
            KIRES+KYV+GS+GR +KF  C + V +++   GLR DV TRWNST+LM+DSA+ Y++AF
Sbjct: 321  KIRESIKYVRGSQGRKQKFLNCDARV-SLECKRGLRQDVPTRWNSTFLMIDSALYYQRAF 379

Query: 1160 SSLQLNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVK 981
              LQL+D NYK   S DEW + EK+ +FL+ FYD T L SG+ YPT+NLYF QV+ +E  
Sbjct: 380  LHLQLSDSNYKHSLSQDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDT 439

Query: 980  LKENLSNEDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDP 801
            L++   + D F+  M  +M EKFDKYW +YS +LA   ILDPR K   + + Y ++    
Sbjct: 440  LRKAKVDSDSFMKSMATQMMEKFDKYWKEYSLILAIAVILDPRYKIQFVEFCYKRLYG-- 497

Query: 800  VKCQETMSIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEG----------- 654
                E M+ V+  L  LF+LY                  I+ S++S  G           
Sbjct: 498  -YNSEEMTKVRDMLFSLFDLY----------------FRIYSSSESVSGTSSASNGARSH 540

Query: 653  -DKSKGKRMFDEFKAYDS----QTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQ 489
             D    K   D  K +D+    +  T+A K+QL LYL+EPK++      L+VL +WK +Q
Sbjct: 541  VDDMVSKECLDVMKEFDNFESEEFTTSAQKTQLQLYLDEPKIDRK--TKLNVLDFWKVNQ 598

Query: 488  HRFPTLALIARDVLAIPITTVASESAFSIGARVLTKYRSCTLPEKVQTLICARNWLHG 315
             R+P L+++ARD+L+IPI+TVASESAFS+G RVL +YRS   PE V+ L+C R+W+ G
Sbjct: 599  FRYPELSILARDLLSIPISTVASESAFSVGGRVLDQYRSALKPENVEALVCTRDWIFG 656


>gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thaliana]
          Length = 577

 Score =  523 bits (1348), Expect = e-145
 Identities = 274/545 (50%), Positives = 361/545 (66%)
 Frame = -3

Query: 1946 SKKINPKISRELLAAAIIKHDLPFSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXX 1767
            S+K++  + RE++A A+++H+LP+SFVEY+ IR    Y NPS+   SRNT   D+     
Sbjct: 19   SRKVDMMVFREMIAVALVQHNLPYSFVEYERIREAFTYANPSIEFWSRNTAAFDVYKIYE 78

Query: 1766 XXXXXXXXXLANITNRICLTSDVWTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPH 1587
                     LA I  RICLT+D+W A T E YICLT H+VD +  L +KIL F A PPPH
Sbjct: 79   REKIKLKEKLAIIPGRICLTTDLWRALTVESYICLTAHYVDVDGVLKTKILSFCAFPPPH 138

Query: 1586 SGVELAAKIFAFLKEWGIDRKIFSLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIR 1407
            SGV +A K+   LK+WGI++K+F+LT+DNAS+ND MQ ILK +L  Q  L C+GEFFH+R
Sbjct: 139  SGVAIAMKLSELLKDWGIEKKVFTLTVDNASANDTMQSILKRKL--QKDLVCSGEFFHVR 196

Query: 1406 CSAHILNLIVQEGLKAINLALHKIRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLD 1227
            CSAHILNLIVQ+GL+ I+ AL KIRE+VKYVKGSE R   F+ C+ T+G I T   L LD
Sbjct: 197  CSAHILNLIVQDGLEVISGALEKIRETVKYVKGSETRENLFQNCMDTIG-IQTEANLVLD 255

Query: 1226 VSTRWNSTYLMLDSAIKYKKAFSSLQLNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTNL 1047
            VSTRWNSTY ML  AI++K    SL   DR YK  PS  EW+RAE IC+ L+PF + T L
Sbjct: 256  VSTRWNSTYHMLSRAIQFKDVLRSLAEVDRGYKSFPSAVEWERAELICDLLKPFAEITKL 315

Query: 1046 ISGSSYPTSNLYFMQVWKIEVKLKENLSNEDVFISDMCKRMKEKFDKYWSQYSTVLAFGA 867
            ISGSSYPT+N+YFMQVW I+  L ++  + D  I +M + M EK+DKYW  +S +LA  A
Sbjct: 316  ISGSSYPTANVYFMQVWAIKCWLGDHDDSHDRVIREMVEDMTEKYDKYWEDFSDILAMAA 375

Query: 866  ILDPRVKFSMLSYFYSKVESDPVKCQETMSIVKAKLDMLFELYANDIKXXXXXXXXXXXS 687
            +LDPR+KFS L Y Y+ +  +P+  +E ++ V+ K+  LF  Y                +
Sbjct: 376  VLDPRLKFSALEYCYNIL--NPLTSKENLTHVRDKMVQLFGAYKR--------------T 419

Query: 686  TIHCSTQSGEGDKSKGKRMFDEFKAYDSQTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQ 507
            T + +  + +  +      +D F +Y SQ     GKS LD+YLEEP L+   + D+DV+ 
Sbjct: 420  TCNVAASTSQSSRKDIPFGYDGFYSYFSQR-NGTGKSPLDMYLEEPVLDMVSFRDMDVIA 478

Query: 506  YWKNHQHRFPTLALIARDVLAIPITTVASESAFSIGARVLTKYRSCTLPEKVQTLICARN 327
            YWKN+  RF  L+ +A D+L+IPITTVASESAFSIG+RVL KYRSC LP  VQ L+C RN
Sbjct: 479  YWKNNVSRFKELSSMACDILSIPITTVASESAFSIGSRVLNKYRSCLLPTNVQALLCTRN 538

Query: 326  WLHGY 312
            W  G+
Sbjct: 539  WFRGF 543


>gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana]
          Length = 745

 Score =  523 bits (1347), Expect = e-145
 Identities = 281/590 (47%), Positives = 379/590 (64%), Gaps = 1/590 (0%)
 Frame = -3

Query: 2039 TSTLRRHIPTCKMLSFHDVGQMIVDHEGKIRSKKINPKISRELLAAAIIKHDLPFSFVEY 1860
            T+T+ RH+ +C+                +I S+K++  + RE++A A+++H+LP+SFVEY
Sbjct: 181  TNTMNRHMRSCEKTP---------GSTPRI-SRKVDMMVFREMIAVALVQHNLPYSFVEY 230

Query: 1859 DGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSDVWTACTS 1680
            + IR    Y+NPS+   SRNT  SD+              LA I  RICLT+D+W A T 
Sbjct: 231  ERIREAFTYVNPSIEFWSRNTAASDVYKIYEREKIKLKEKLAIIPGRICLTTDLWRALTV 290

Query: 1679 EGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIFSLTLDN 1500
            E YICLT H+VD +  L +KIL F A PPPHSGV +A K+   LK+WGI++K+F+LT+DN
Sbjct: 291  ESYICLTAHYVDVDGVLKTKILSFCAFPPPHSGVAIAMKLSELLKDWGIEKKVFTLTVDN 350

Query: 1499 ASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALHKIRESVK 1320
            AS+ND MQ ILK +L  Q  L C+GEFFH+RCSAHILNLIVQ+GL+ I+ AL KIRE+VK
Sbjct: 351  ASANDTMQSILKRKL--QKHLVCSGEFFHVRCSAHILNLIVQDGLEVISGALEKIRETVK 408

Query: 1319 YVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFSSLQLND 1140
            YVKGSE R   F+ C+ T+G I T   L LDVSTRWNSTY ML  AI++K    SL   D
Sbjct: 409  YVKGSETRENLFQNCMDTIG-IQTEASLVLDVSTRWNSTYHMLSRAIQFKDVLHSLAEVD 467

Query: 1139 RNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKLKENLSN 960
            R YK  PS  EW+RAE IC+ L+PF + T LISGSSYPT+N+YFMQVW I+  L ++  +
Sbjct: 468  RGYKSFPSAVEWERAELICDLLKPFAEITKLISGSSYPTANVYFMQVWAIKCWLGDHDDS 527

Query: 959  EDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDPVKCQETM 780
             D  I +M + M EK+DKYW  +S +LA  A+LDPR+KFS L Y Y+ +  +P+  +E +
Sbjct: 528  HDRAIREMVEDMTEKYDKYWEDFSDILAMAAVLDPRLKFSALEYCYNIL--NPLTSKENL 585

Query: 779  SIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEFKAYDSQ 600
            + V+ K+  LF  Y                +T + +  + +  +      +D F +Y SQ
Sbjct: 586  THVRDKMVQLFGAYKR--------------TTCNVAASTSQSSRKDIPFGYDGFYSYFSQ 631

Query: 599  TVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIPITTVAS 420
                 GKS LD+YLEEP L+   + D+DV+ YWKN+  RF  L+ +A D+L+I ITTVAS
Sbjct: 632  R-NGTGKSPLDMYLEEPVLDMVSFRDMDVIAYWKNNVSRFKELSSMACDILSISITTVAS 690

Query: 419  ESAFSIGARVLTKYRSCTLPEKVQTLICARNWLHGYA-IDNEESASTKST 273
            ES FSIG+RVL KYRSC LP  VQ L+C RNW  G+  ++ +E    + T
Sbjct: 691  ESTFSIGSRVLNKYRSCLLPTNVQALLCTRNWFRGFQDVETDEIQGQEDT 740


>gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [Prunus persica]
          Length = 697

 Score =  513 bits (1322), Expect = e-142
 Identities = 282/658 (42%), Positives = 395/658 (60%), Gaps = 19/658 (2%)
 Frame = -3

Query: 2231 NQSINLDEGDTLENTKSNQGLGKPKEFSDVWNYFLKKGVGQDGVQRAXXXXXXXXXXXXX 2052
            N  ++ D  +      +  G  + K  S VW  F    + ++  QRA             
Sbjct: 23   NNVVDSDPSNNNNAVVTQIGKRRRKLTSAVWTQFEILPIDENNEQRAKCMKCGQKYLCDS 82

Query: 2051 XXXGTSTLRRHIPTCKMLSFHDVGQMIVDH-EGKI--RSKKINPKISRELLAAAIIKHDL 1881
                T  L+RHI +C      D+GQ+++   +G I  RS K +P   RELL  AII HDL
Sbjct: 83   RYG-TRNLKRHIESCVKTDTRDLGQLLLSKSDGAILTRSSKFDPMKFRELLVMAIITHDL 141

Query: 1880 PFSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSD 1701
            PF FVEY GIR    Y+   +  +SRNT  +D+              L ++  R+CL SD
Sbjct: 142  PFQFVEYSGIRQLFNYVCADIKLVSRNTAKADVLSLYNREKAKLKEILDSVPGRVCLASD 201

Query: 1700 VWTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKI 1521
            +WT+ T++GY+CLT HF+D NWKL  +IL F  MPPPH+GV L  KI+  L +WG+++K+
Sbjct: 202  LWTSITTDGYLCLTVHFIDVNWKLQKRILNFSFMPPPHTGVTLCEKIYKLLTDWGVEKKL 261

Query: 1520 FSLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALH 1341
            FS+TLDNASSND   E+LK Q +++D+L  NG+FF+IRC AHILNLIVQ+GLK I+ ++ 
Sbjct: 262  FSMTLDNASSNDTFVELLKGQPNLKDALLMNGKFFYIRCCAHILNLIVQDGLKHIDDSVG 321

Query: 1340 KIRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAF 1161
            KIRES+KYV+GS+GR +KF  C + V +++   GLR DV TRWNST+LM+DSA+ Y++AF
Sbjct: 322  KIRESIKYVRGSQGRKQKFLNCAAQV-SLECKRGLRQDVPTRWNSTFLMIDSALYYQRAF 380

Query: 1160 SSLQLNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVK 981
              LQL+D NYK   S DEW + EK+ +FL+ FYD T L SG+ YPT+NLYF QV+ +E  
Sbjct: 381  LHLQLSDSNYKHSLSQDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDT 440

Query: 980  LKENLSNEDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDP 801
            L++   + D F+  M  +M E FDKYW +YS + A   ILDPR K   + + Y ++    
Sbjct: 441  LRKAKVDSDSFMKSMATQMMEMFDKYWKEYSLIPAIAVILDPRYKIQFVEFCYKRLYG-- 498

Query: 800  VKCQETMSIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEG----------- 654
                E M+ V+  L  LF+LY                  I+ S++S  G           
Sbjct: 499  -YNSEEMTKVRDMLFSLFDLY----------------FQIYSSSESVSGTSSASNGARSH 541

Query: 653  -DKSKGKRMFDEFKAYDS----QTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQ 489
             D    K   D  K +D+    +  T+A K+QL LYL+EPK++      L+VL +WK +Q
Sbjct: 542  VDDMVSKECLDVMKEFDNFESEEFTTSAQKTQLQLYLDEPKIDRK--TKLNVLDFWKVNQ 599

Query: 488  HRFPTLALIARDVLAIPITTVASESAFSIGARVLTKYRSCTLPEKVQTLICARNWLHG 315
             R+P L+++ARD+L+IPI+TVASESAFS+G RVL +YRS   PE V+ L+C R+W+ G
Sbjct: 600  FRYPELSILARDLLSIPISTVASESAFSVGGRVLDQYRSALKPENVEALVCTRDWIFG 657


>ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula]
            gi|355504225|gb|AES85428.1| hypothetical protein
            MTR_126s0001, partial [Medicago truncatula]
          Length = 555

 Score =  502 bits (1292), Expect = e-139
 Identities = 243/535 (45%), Positives = 352/535 (65%)
 Frame = -3

Query: 1916 ELLAAAIIKHDLPFSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXL 1737
            E+ A+ I+ HDLPF F E +G+R + +++NP++P   RN + + +              L
Sbjct: 21   EICASTILAHDLPFHFFELEGMRKYSEFLNPNIPIPPRNVIEAYVSHLYTKEKPKLKQQL 80

Query: 1736 ANITNRICLTSDVWTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIF 1557
              I NRI L+ D+W + T+E YICLT HFVD NWKLNSK++ F  + PP SG E+  ++ 
Sbjct: 81   TTIPNRISLSFDLWESNTTETYICLTAHFVDANWKLNSKVINFRLVYPPTSG-EICERMV 139

Query: 1556 AFLKEWGIDRKIFSLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIV 1377
              L +WGI++KIFSLT+D++S N+ +QE LK QL +Q+ L C+GEFFH+ C A +LN IV
Sbjct: 140  ELLNDWGIEKKIFSLTIDDSSENEILQEQLKTQLVLQNGLLCDGEFFHVNCFARVLNQIV 199

Query: 1376 QEGLKAINLALHKIRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYL 1197
            +E LK ++  +HKIRES+ +V+ S+ R  KF+EC   VG +D+++ L LD+S   +STY+
Sbjct: 200  EEALKLVSCGVHKIRESIMFVRHSKSRREKFKECFEKVGGVDSSVHLHLDISMSLSSTYM 259

Query: 1196 MLDSAIKYKKAFSSLQLNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSN 1017
            +L+ A+KY+ AF S  L D +Y  CPS +EWKR EKIC FL PF +T N+I+ +++PTSN
Sbjct: 260  LLERALKYRCAFESFHLYDDSYDLCPSAEEWKRVEKICAFLLPFCETANMINSTTHPTSN 319

Query: 1016 LYFMQVWKIEVKLKENLSNEDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSM 837
            LYF+QVWK++  L ++L +ED  I  M +RM  KF+KYW +YS VLA GA+LDPR+KF+ 
Sbjct: 320  LYFLQVWKVQCVLVDSLGDEDEDIKKMAERMMSKFEKYWDEYSVVLALGAVLDPRMKFTT 379

Query: 836  LSYFYSKVESDPVKCQETMSIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGE 657
            L+Y YSK+  D   C+  +  VK KL MLFE ++ +                  S    +
Sbjct: 380  LAYCYSKL--DASTCERKLQQVKRKLCMLFEKHSGNSTTAGVQRTIKENQDQSSSMPLQK 437

Query: 656  GDKSKGKRMFDEFKAYDSQTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFP 477
              KS    +FDE K +  Q VT  GKSQLD+YL+E  L+F  Y ++DVLQ+WK++  RFP
Sbjct: 438  KLKSLSHGLFDELKVHHQQLVTKTGKSQLDVYLDESVLDFRCYAEMDVLQWWKSNNDRFP 497

Query: 476  TLALIARDVLAIPITTVASESAFSIGARVLTKYRSCTLPEKVQTLICARNWLHGY 312
             L+++A D+L++PI  VAS+S F +G+RV  KY+   LP  V+  IC R+WL+ +
Sbjct: 498  DLSILACDLLSVPIAAVASDSEFCMGSRVFNKYKDRMLPMNVEARICTRSWLYNF 552


>gb|AAD48963.1|AF147263_5 contains similarity to transposases [Arabidopsis thaliana]
            gi|7267311|emb|CAB81093.1| AT4g05510 [Arabidopsis
            thaliana]
          Length = 604

 Score =  496 bits (1278), Expect = e-137
 Identities = 287/642 (44%), Positives = 372/642 (57%), Gaps = 5/642 (0%)
 Frame = -3

Query: 2207 GDTLENTKSNQGLGKPKEF-----SDVWNYFLKKGVGQDGVQRAXXXXXXXXXXXXXXXX 2043
            G T  +T  ++ L     F     SD+W+YF  +    DG  +                 
Sbjct: 14   GQTSADTSQSKSLVSASRFKRSRTSDMWDYFTLEDEN-DG--KIAYCKKCLKPYPILPTT 70

Query: 2042 GTSTLRRHIPTCKMLSFHDVGQMIVDHEGKIRSKKINPKISRELLAAAIIKHDLPFSFVE 1863
            GTS L RH   C M    DVG+         ++ KI+ K+ RE  +  II+HDLPF  VE
Sbjct: 71   GTSNLIRHHRKCSMGL--DVGR---------KTTKIDHKVVREKFSRVIIRHDLPFLCVE 119

Query: 1862 YDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSDVWTACT 1683
            Y+ +R ++ Y+NP   C +RNT  +D+              L  I +RICLTSD WT+  
Sbjct: 120  YEELRDFISYMNPDYKCYTRNTAAADVVKTWEKEKQILKSELERIPSRICLTSDCWTSLG 179

Query: 1682 SEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIFSLTLD 1503
             +GYI LT H+VD  W LNSKIL F  M PPH+G  LA+KI   LKEWGI++K+F+LTLD
Sbjct: 180  GDGYIVLTAHYVDTRWILNSKILSFSDMLPPHTGDALASKIHECLKEWGIEKKVFTLTLD 239

Query: 1502 NASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALHKIRESV 1323
            NA++N+ MQE+L ++L + ++L C GEFFH+RC AH+LN IVQ GL  I+ AL KIRE+V
Sbjct: 240  NATANNSMQEVLIDRLKLDNNLMCKGEFFHVRCCAHVLNRIVQNGLDVISDALSKIRETV 299

Query: 1322 KYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFSSLQLN 1143
            KYVKGS  R     ECV   G     + L LDV TRWNSTYLML  A+KY++A +  ++ 
Sbjct: 300  KYVKGSTSRRLALAECVEGKG----EVLLSLDVQTRWNSTYLMLHKALKYQRALNRFKIV 355

Query: 1142 DRNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKLKENLS 963
            D+NYK CPS +EWKRA+ I E L PFY  TNL+SG SY TSNLYF  VWKI+  L+    
Sbjct: 356  DKNYKNCPSSEEWKRAKTIHEILMPFYKITNLMSGRSYSTSNLYFGHVWKIQCLLE---- 411

Query: 962  NEDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDPVKCQET 783
                        M+ KFDKYW +YS +LA  A+LDPR+KF +L   Y   E DP   QE 
Sbjct: 412  ------------MRLKFDKYWKEYSVILAMRAVLDPRMKFKLLKRCYD--ELDPTTSQEK 457

Query: 782  MSIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEFKAYDS 603
            +  ++ K+  LF                            GE  K+      D F   D 
Sbjct: 458  IDFLETKITELF----------------------------GEYRKAFPVTPVDLFDLDDV 489

Query: 602  QTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIPITTVA 423
              V   GKS LD+YLE+PKLE   + +L+VLQYWK ++ RF  LA +A DVL+IPIT+VA
Sbjct: 490  PEV-EEGKSALDMYLEDPKLEMKNHPNLNVLQYWKENRLRFGALAYMAMDVLSIPITSVA 548

Query: 422  SESAFSIGARVLTKYRSCTLPEKVQTLICARNWLHGYAIDNE 297
            SES+FSIG+ VL KYRS  LP  VQ L+C R+WL+G+  D E
Sbjct: 549  SESSFSIGSHVLNKYRSRLLPTNVQALLCTRSWLYGFVSDEE 590


>gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thaliana]
          Length = 659

 Score =  478 bits (1231), Expect = e-132
 Identities = 264/644 (40%), Positives = 375/644 (58%), Gaps = 11/644 (1%)
 Frame = -3

Query: 2165 KPKEFSDVWNYFLKKGVGQDGVQRAXXXXXXXXXXXXXXXXGTSTLRRHIPTCKMLSFHD 1986
            + K+ +  W+ F   G+ +DG +RA                 TST+ RH+  C       
Sbjct: 29   RKKQRALCWDEFTSVGIEEDGKERARCHHCGIKLVVEKSYG-TSTMNRHLTLCP------ 81

Query: 1985 VGQMIVDHEGKIRSKKINPKISRELLAAAIIKHDLPFSFVEYDGIRTWMKYINPSVPCIS 1806
                  +        K + K+ RE+ +  II HD+PF +VEY+ +R   K++NP    I 
Sbjct: 82   ------ERPQPETRPKYDHKVDREMTSEIIIYHDMPFRYVEYEKVRARDKFLNPDCKPIC 135

Query: 1805 RNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSDVWTA-CTSEGYICLTGHFVDENWKL 1629
            R T   D+               A    ++CLT+D+W++  T  GYIC+T H++DE+W+L
Sbjct: 136  RQTAALDVFKRFEIEKAKLIDVFAKHNGQVCLTADLWSSRSTVTGYICVTSHYIDESWRL 195

Query: 1628 NSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIFSLTLDNASSNDCMQEILKEQLSI 1449
            N+KIL F  + PPH+G E+A K++  LKEWG+++KI ++TLDNAS+N  MQ ILK +L  
Sbjct: 196  NNKILAFCDLKPPHNGEEIAKKVYDCLKEWGLEKKILTITLDNASANTSMQTILKHRLQS 255

Query: 1448 QDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALHKIRESVKYVKGSEGRMRKFEECVS 1269
             + L C G F H+RC AHILNLIVQ GL+  +  L  I ESVK+VK SE R   F  C+ 
Sbjct: 256  GNGLLCGGNFLHVRCCAHILNLIVQAGLELASGLLENITESVKFVKASESRKDSFATCLE 315

Query: 1268 TVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFSSLQLNDRNYKFCPSIDEWKRAEK 1089
             VG I +  GL LDVSTRWNSTY ML  A+K++KAF+ L L +R Y   P+ +E  R EK
Sbjct: 316  CVG-IKSGAGLSLDVSTRWNSTYEMLARALKFRKAFAILNLYERGYCSLPTEEECDRGEK 374

Query: 1088 ICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKLKENLSNEDVFISDMCKRMKEKFD 909
            IC+ L+PF   T   SG  YPT+N+YF+QVWKIE+ L +  + +DV + +M K+M++KF 
Sbjct: 375  ICDLLKPFNTITTYFSGVKYPTANIYFIQVWKIELLLMKYANCDDVDVREMAKKMQKKFA 434

Query: 908  KYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDPVKCQETMSIVKAKLDMLFELYAND 729
            KYW++YS +LA GA LDPR+K  +L   Y+KV  DPV  +  + IV+  L +L+E Y   
Sbjct: 435  KYWNEYSVILAMGAALDPRLKLQILRSAYNKV--DPVTAEGKVDIVRNNLILLYEEYKTK 492

Query: 728  IKXXXXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEFKAYDSQTVTNAGKSQLDLYLE-E 552
                          T H        +      +F+   +  S   + + KS L++YL+ E
Sbjct: 493  ---SASSSNSSTTLTPHELLNESPLEADVNDDLFELESSLIS--ASKSTKSTLEIYLDDE 547

Query: 551  PKLEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIPITTVASESAFSIGARVLTKYRS 372
            P+LE   + D+++L +WK +QHR+  LA +A D+L+IPITTVASESAFS+G RVL  +R+
Sbjct: 548  PRLEMKTFSDMEILSFWKENQHRYGDLASMASDLLSIPITTVASESAFSVGGRVLNPFRN 607

Query: 371  CTLPEKVQTLICARNWLHGYA---------IDNEESASTKSTFS 267
              LP+ VQ LIC RNWL GYA            E++ +TK T S
Sbjct: 608  RLLPQNVQALICTRNWLLGYADLEGDIEELFAEEDNDATKMTSS 651


>gb|AAF19546.1|AC007190_14 F23N19.13 [Arabidopsis thaliana]
          Length = 633

 Score =  473 bits (1218), Expect = e-130
 Identities = 264/576 (45%), Positives = 349/576 (60%)
 Frame = -3

Query: 2039 TSTLRRHIPTCKMLSFHDVGQMIVDHEGKIRSKKINPKISRELLAAAIIKHDLPFSFVEY 1860
            T+T+ RH+ +C+                +I S+K++  + RE++A A+++H+LP+SFVEY
Sbjct: 91   TNTMNRHMRSCEKTP---------GSTPRI-SRKVDMMVFREMIAVALVQHNLPYSFVEY 140

Query: 1859 DGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSDVWTACTS 1680
            + IR    Y NPS+   SRNT  SD+              LA I  RICLT+D+W A T 
Sbjct: 141  ERIREAFTYANPSIEFWSRNTAASDVYKIYEREKIKLKEKLAIIPGRICLTTDLWRALTV 200

Query: 1679 EGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIFSLTLDN 1500
            E YICLT H+VD +  L +KIL F A PPPHSGV +A K+   LK+WGI++KIF+LT+DN
Sbjct: 201  ESYICLTAHYVDVDGVLKTKILSFSAFPPPHSGVAIAMKLSELLKDWGIEKKIFTLTVDN 260

Query: 1499 ASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALHKIRESVK 1320
            AS+ND MQ ILK +L  Q  L C+GEFFH+RCSAHILNLIVQ+GL+ I+ AL KIRE+VK
Sbjct: 261  ASANDTMQSILKRKL--QKDLVCSGEFFHVRCSAHILNLIVQDGLEVISGALEKIRETVK 318

Query: 1319 YVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFSSLQLND 1140
            YVKGSE R   F+ C+ T+G I T   L LDVSTRWNSTY ML  AI++K    SL   D
Sbjct: 319  YVKGSETRENLFQNCMDTIG-IQTEASLVLDVSTRWNSTYHMLSRAIQFKDVLRSLAEVD 377

Query: 1139 RNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKLKENLSN 960
            R YK  PS  EW+RAE IC+ L+PF + T LIS                           
Sbjct: 378  RVYKSFPSAVEWERAELICDLLKPFAEITKLISD-------------------------- 411

Query: 959  EDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDPVKCQETM 780
                       M EK+DKYW  +S +LA  A+LDPR+KFS L Y Y+ +  +P+  +E +
Sbjct: 412  -----------MTEKYDKYWEDFSDILAMAAVLDPRLKFSALEYCYNIL--NPLTSKENL 458

Query: 779  SIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEFKAYDSQ 600
            + V+ K+  LF  Y                +T + +  + +  +      +D F +Y SQ
Sbjct: 459  THVRDKMVQLFGAYKR--------------TTCNVAASTSQSSRKDIPFGYDGFYSYFSQ 504

Query: 599  TVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIPITTVAS 420
                 GKS LD+YLEEP L+   ++D+DV+ YWKN+  RF  L+ +A D+L+IPITTVAS
Sbjct: 505  R-NGTGKSPLDMYLEEPVLDMVSFKDMDVIAYWKNNVSRFKELSSMACDILSIPITTVAS 563

Query: 419  ESAFSIGARVLTKYRSCTLPEKVQTLICARNWLHGY 312
            ESAFSIG+RVL KYRSC LP  VQ L+C RNW  G+
Sbjct: 564  ESAFSIGSRVLNKYRSCLLPTNVQALLCTRNWFRGF 599


>ref|XP_006280333.1| hypothetical protein CARUB_v10026257mg [Capsella rubella]
            gi|482549037|gb|EOA13231.1| hypothetical protein
            CARUB_v10026257mg [Capsella rubella]
          Length = 508

 Score =  472 bits (1215), Expect = e-130
 Identities = 261/553 (47%), Positives = 340/553 (61%), Gaps = 3/553 (0%)
 Frame = -3

Query: 1976 MIVDHEGKIRSKKINPKISRELLAAAIIKHDLPFSFVEYDGIRTWMKYINPSVPCISRNT 1797
            M++D + K+R+KKI+ KI RE  +  +I+HDLPFS VEY+ +R ++KY+NP     +RNT
Sbjct: 1    MMLDADMKLRAKKIDQKIVREKFSRVLIRHDLPFSAVEYEELRDFLKYMNPDYISYTRNT 60

Query: 1796 LVSDIXXXXXXXXXXXXXXLANITNRICLTSDVWTACTSEGYICLTGHFVDENWKLNSKI 1617
              SD+              L NI +RICLTSD WTA + EGYI L  H+VDE   LN+KI
Sbjct: 61   AASDVIKTWKTEKEKLKLELENIPSRICLTSDCWTAVSGEGYISLMAHYVDEKGLLNNKI 120

Query: 1616 LCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIFSLTLDNASSNDCMQEILKEQLSIQDSL 1437
            L F  + PPH+G  LA KI   L++WGI++K+F+LTLDNA++ND MQ+ILKE+L++  +L
Sbjct: 121  LSFCDILPPHTGEALATKIHECLRDWGIEKKVFTLTLDNATANDTMQDILKERLNLDHNL 180

Query: 1436 FCNGEFFHIRCSAHILNLIVQEGLKAINLALHKIRESVKYVKGSEGRMRKFEECVSTVGN 1257
             C GEFFH+RC AHILNLIVQ+GLK I  AL KIR+SVKYVK ++ R   FE C      
Sbjct: 181  LCEGEFFHVRCCAHILNLIVQDGLKVIGGALSKIRDSVKYVKATKARGIAFETC------ 234

Query: 1256 IDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFSSLQLNDRNYKFCPSIDEWKRAEKICEF 1077
                                          AF  L++ D++YK CPS D+W +A+ I E 
Sbjct: 235  ------------------------------AFKRLKVVDKSYKHCPSNDDWCKAKNILEI 264

Query: 1076 LEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKLKENLSNEDVFISDMCKRMKEKFDKYWS 897
            L+PFY  T L+ G SY TSNLYF+ VWKIE  LKEN  + D  I DM  RM+ KF KYW 
Sbjct: 265  LKPFYKITVLMLGRSYSTSNLYFVNVWKIECLLKENERHSDKDIRDMAGRMRIKFKKYWD 324

Query: 896  QYSTVLAFGAILDPRVKFSMLSYFYSKVESDPVKCQETMSIVKAKLDMLFELYANDIKXX 717
            QYS  LA GA+LDPR+KF +L   Y   E DP  C+E +  ++ KL +LF+ Y       
Sbjct: 325  QYSVSLAMGAVLDPRMKFKLLKRCYE--ELDPSTCKEKLDHIEEKLRLLFDDY------- 375

Query: 716  XXXXXXXXXSTIHCSTQSGEGDK---SKGKRMFDEFKAYDSQTVTNAGKSQLDLYLEEPK 546
                     +T   ST + E +K    K   + D F   D   VT  GKS LD+YL E K
Sbjct: 376  LLKYPTTASTTNASSTNAREINKQGRDKSDMLDDLFDLDDMPEVTEEGKSVLDIYLSETK 435

Query: 545  LEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIPITTVASESAFSIGARVLTKYRSCT 366
            LE   +  + VLQYWK++ HRF  L+ +A D+L+IPITTVASES+FSIG+ VL KYRS  
Sbjct: 436  LEMKNHPKMCVLQYWKDNIHRFGALSYMAYDILSIPITTVASESSFSIGSHVLNKYRSRL 495

Query: 365  LPEKVQTLICARN 327
            LP+ VQ L+C R+
Sbjct: 496  LPKHVQALLCTRS 508


>gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, putative isoform 1
            [Theobroma cacao] gi|508778249|gb|EOY25505.1| BED zinc
            finger,hAT family dimerization domain, putative isoform 1
            [Theobroma cacao] gi|508778250|gb|EOY25506.1| BED zinc
            finger,hAT family dimerization domain, putative isoform 1
            [Theobroma cacao] gi|508778251|gb|EOY25507.1| BED zinc
            finger,hAT family dimerization domain, putative isoform 1
            [Theobroma cacao]
          Length = 678

 Score =  444 bits (1141), Expect = e-121
 Identities = 239/579 (41%), Positives = 354/579 (61%), Gaps = 7/579 (1%)
 Frame = -3

Query: 2030 LRRHIPTCKMLSFHDVGQMIVDHEGK---IRSKKINPKISRELLAAAIIKHDLPFSFVEY 1860
            L+R+   C      ++GQMI  ++      RS  ++P+  REL+  AI  H+LP SFVEY
Sbjct: 81   LKRYSENCVGGDTREIGQMISSNQHGSTLTRSSNLDPEKFRELVIGAIFMHNLPLSFVEY 140

Query: 1859 DGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSDVWTACTS 1680
             G R    Y++  V  ISRNTL + +              L     RI LT D+W + T+
Sbjct: 141  RGSRALSSYLHEDVTLISRNTLKAYMIKMHRAERSKIKCLLEETPGRINLTFDLWNSITT 200

Query: 1679 EGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIFSLTLDN 1500
            + YICL  HFVD+NW L  ++L F  MPPP++ V L  K++A L EWGI+ K+FS+TLDN
Sbjct: 201  DTYICLIAHFVDKNWVLQKRVLNFSFMPPPYNCVALIEKVYALLAEWGIESKLFSVTLDN 260

Query: 1499 ASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALHKIRESVK 1320
              +++   E+LK+ L+++ +    G+FFH+RC A +LNLIVQ+ LK ++  + K+RESVK
Sbjct: 261  VLASNAFVELLKKNLNVRKTFLVGGKFFHLRCFAQVLNLIVQDSLKEVDCVVQKVRESVK 320

Query: 1319 YVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFSSLQLND 1140
            YVKGS+ R +KF ECV T+  ++   GLR DVST+WNST+LML  A+ ++KAFS L++ D
Sbjct: 321  YVKGSQVRKQKFLECV-TLMKLNAKGGLRQDVSTKWNSTFLMLKRALYFRKAFSHLEIRD 379

Query: 1139 RNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKLKENLSN 960
             NY++CPS DEW+R EK+ + L  FYD T + S + YPT+NL+F  ++     L+E++S 
Sbjct: 380  SNYRYCPSEDEWERVEKLYKLLAVFYDVTCVFSRTKYPTANLFFPSMFIAHSTLQEHMSG 439

Query: 959  EDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDPVKCQETM 780
            +DV++ +M  +M  KF KYWS +S +LA   ILDPR K   + + Y K+  +        
Sbjct: 440  QDVYMKNMSTQMLVKFVKYWSDFSLILAIAVILDPRYKIHFVEWSYGKLYGN------DS 493

Query: 779  SIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEFKAYDSQ 600
            +  K   D LF LY N+             +T      S E   ++GKR  D F+ +DS 
Sbjct: 494  TQFKNVRDWLFSLY-NEYAVKASPTPSSFNNT------SDEHTLTEGKR--DFFEEFDSY 544

Query: 599  TVTNAG----KSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIPIT 432
                 G    KSQL+ YL EP +E +  ++L++LQ+WK +Q+R+P LA +ARDVL+IPI+
Sbjct: 545  ATVKFGAATQKSQLEWYLSEPMVERT--KELNILQFWKENQYRYPELAAMARDVLSIPIS 602

Query: 431  TVASESAFSIGARVLTKYRSCTLPEKVQTLICARNWLHG 315
              ASE AFS+G ++L ++RS   P+ ++  +C ++WL G
Sbjct: 603  ATASEFAFSVGGKILDQHRSSLKPDILEATVCCKDWLFG 641


>pir||H85073 probable transposon protein [imported] - Arabidopsis thaliana
            gi|5032279|gb|AAD38227.1|AF147264_10 may be a pseudogene
            [Arabidopsis thaliana] gi|7267351|emb|CAB81124.1|
            putative transposon protein [Arabidopsis thaliana]
          Length = 483

 Score =  427 bits (1097), Expect = e-116
 Identities = 241/555 (43%), Positives = 324/555 (58%), Gaps = 1/555 (0%)
 Frame = -3

Query: 1973 IVDHEGKIRSKKINPKISRELLAAAIIKHDLPFSFVEYDGIRTWMKYINPSVPCISRNTL 1794
            +V+   K +++KI+  + REL+A  II+HDLPFS+VEY+ +R   KY+N  V   SRNT 
Sbjct: 1    MVNAVAKFQARKIDQSVFRELVAKTIIQHDLPFSYVEYERVRETWKYLNADVKFFSRNTA 60

Query: 1793 VSDIXXXXXXXXXXXXXXLANITNRICLTSDVWTACTSEGYICLTGHFVDENWKLNSKIL 1614
             +DI              LA +  RI L +D+W+A T EGY+CLT H++D NWKLN+KIL
Sbjct: 61   AADIYKFYEIETDKLKRELAQLPGRISLITDLWSALTHEGYMCLTAHYIDRNWKLNNKIL 120

Query: 1613 CFDAMPPPHSGVELAAKIFAFLKEWGIDRKIFSLTLDNASSNDCMQEILKEQLSIQDSLF 1434
                                         K+FS+T+DNA +ND MQEI+K QL ++D L 
Sbjct: 121  -----------------------------KVFSITVDNAGNNDTMQEIVKSQLVLRDDLL 151

Query: 1433 CNGEFFHIRCSAHILNLIVQEGLKAINLALHKIRESVKYVKGSEGRMRKFEECVSTVGNI 1254
            C GEFFH+RC+ HILN+IVQ GLK I   L KIRES+KYVKGSE R   F +C+  VG I
Sbjct: 152  CKGEFFHVRCATHILNIIVQIGLKGIGDTLEKIRESIKYVKGSEHREILFAKCMENVG-I 210

Query: 1253 DTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFSSLQLND-RNYKFCPSIDEWKRAEKICEF 1077
            +   GL LDV+ RWNST+ MLD A+KY+ AF +L++ D +NYKF P+  EW R +++ +F
Sbjct: 211  NLKAGLLLDVANRWNSTFKMLDRALKYRAAFGNLKVIDAKNYKFHPTDAEWHRLQQMSDF 270

Query: 1076 LEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKLKENLSNEDVFISDMCKRMKEKFDKYWS 897
            LE F   TNLISGS YPTSNLYFMQVWK +  L  N SN+D  I +M   MKE+FDKYW+
Sbjct: 271  LESFDQITNLISGSIYPTSNLYFMQVWKFQNWLTVNESNQDEVIRNMIVLMKERFDKYWA 330

Query: 896  QYSTVLAFGAILDPRVKFSMLSYFYSKVESDPVKCQETMSIVKAKLDMLFELYANDIKXX 717
            + S + A   + DPR+K ++  Y ++K+  D    ++ M  ++A+L  LFE+Y N     
Sbjct: 331  EVSNIFAIATVFDPRLKLTLADYCFAKL--DISTREKGMKHLRAQLRKLFEVYEN----- 383

Query: 716  XXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEFKAYDSQTVTNAGKSQLDLYLEEPKLEF 537
                     + +  +T+S E      +     F  YD                       
Sbjct: 384  -------KSNAVSPTTESREDVTPDDETAKGNFSNYD----------------------- 413

Query: 536  SYYEDLDVLQYWKNHQHRFPTLALIARDVLAIPITTVASESAFSIGARVLTKYRSCTLPE 357
                         N+  RF  LA +A D+L+IPITTVASES+FSIG RVL+KYR+  LP 
Sbjct: 414  ------------VNNGPRFGKLASMACDILSIPITTVASESSFSIGTRVLSKYRNRLLPR 461

Query: 356  KVQTLICARNWLHGY 312
             VQ LIC+RNWL G+
Sbjct: 462  NVQALICSRNWLKGF 476


>ref|NP_001060325.2| Os07g0624100 [Oryza sativa Japonica Group]
            gi|255677983|dbj|BAF22239.2| Os07g0624100 [Oryza sativa
            Japonica Group]
          Length = 762

 Score =  404 bits (1037), Expect = e-109
 Identities = 217/616 (35%), Positives = 353/616 (57%), Gaps = 6/616 (0%)
 Frame = -3

Query: 2039 TSTLRRHIPTCK--MLSFHDVGQM----IVDHEGKIRSKKINPKISRELLAAAIIKHDLP 1878
            TS+LR+H+  CK  + +   VG +    +  +  ++++   +P++SR+ L   I+ H+LP
Sbjct: 159  TSSLRKHLTRCKKRISALKIVGNLDFTLMSPNSVRLKNWSFDPEVSRKELMRMIVLHELP 218

Query: 1877 FSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSDV 1698
            F FVEYDG R++   +NP    ISR T+ +D                     R  LT+D+
Sbjct: 219  FQFVEYDGFRSFAASLNPYFKIISRTTIRNDCIAAFKEQKLAMKDMFKGANCRFSLTADM 278

Query: 1697 WTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIF 1518
            WT+  + GY+C+T HF+D +W++  +I+ F  +  PH+GV++   + + +++W I  KIF
Sbjct: 279  WTSNQTMGYMCVTCHFIDTDWRVQKRIIKFFGVKTPHTGVQMFNAMLSCIQDWNIADKIF 338

Query: 1517 SLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALHK 1338
            S+TLDNAS+ND M ++LK  L  + ++   G+  H RC AH++NLI ++GLK I+  +  
Sbjct: 339  SVTLDNASANDSMAKLLKCNLKAKKTIPAGGKLLHNRCVAHVINLIAKDGLKVIDSIVCN 398

Query: 1337 IRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFS 1158
            IRESVKY+  S  R  KFEE ++  G I   +   +DV T WNSTYLML++A  + +A++
Sbjct: 399  IRESVKYMDNSPSRKEKFEEIIAQEG-ITCELHPTVDVCTHWNSTYLMLNAAFPFMRAYA 457

Query: 1157 SLQLNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKL 978
            SL + ++NYK+ PS D+W+RA  +   L+  YD T ++SGS YPTSNLYF ++WKI++ L
Sbjct: 458  SLVVQEKNYKYAPSPDQWERATIVSGILKVLYDATMVVSGSLYPTSNLYFHEMWKIKLVL 517

Query: 977  KENLSNEDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDPV 798
             +  SN D  ++ M K+MK+KFDKYW +    L    I DPR KF  + +   +   +  
Sbjct: 518  DKERSNNDTEVASMVKKMKDKFDKYWLKSYKYLCIPVIFDPRFKFKFVEFRLGQAFGENA 577

Query: 797  KCQETMSIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEF 618
            K  E +  VK +++MLF+ Y++ +K             +  S      D          +
Sbjct: 578  K--ERIDKVKKRMNMLFKEYSDKLKDSNANPLRQAEHVMSISENDPMAD----------W 625

Query: 617  KAYDSQTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIP 438
              + S+ ++    ++LD+YL+E  ++  +    D+L +WK ++ ++PTLA IA+DV+A P
Sbjct: 626  VQHISEQLSEQVDTELDIYLKENPIQ-EFGNKFDILNWWKTNRSKYPTLACIAQDVVAWP 684

Query: 437  ITTVASESAFSIGARVLTKYRSCTLPEKVQTLICARNWLHGYAIDNEESASTKSTFSCES 258
             +TVASESAFS  +RV++ +R     + V+ LIC ++W    A  N   +S       ++
Sbjct: 685  ASTVASESAFSTRSRVISDFRCSLTMDSVEALICLQDWFRASAGPNINVSSVNEINYSDN 744

Query: 257  SNVLDIIDEEDSEGAG 210
               LD+ D  D +  G
Sbjct: 745  FVNLDLEDSMDGQDGG 760


>ref|XP_006279432.1| hypothetical protein CARUB_v10007925mg, partial [Capsella rubella]
            gi|482548132|gb|EOA12330.1| hypothetical protein
            CARUB_v10007925mg, partial [Capsella rubella]
          Length = 539

 Score =  394 bits (1012), Expect = e-106
 Identities = 216/454 (47%), Positives = 283/454 (62%), Gaps = 14/454 (3%)
 Frame = -3

Query: 2225 SINLDEGDTLE-NTKSNQGLGKPKE--------FSDVWNYFL---KKGVGQDGVQRAXXX 2082
            SIN+D+ D  + + K  +G GK  E        +++ W +F    KK    + V+RA   
Sbjct: 91   SINIDDDDDDDADVKGEKGKGKKPEEEPKKKRQYANCWEHFTVIKKKNNKGEIVERAQCN 150

Query: 2081 XXXXXXXXXXXXXGTSTLRRHIPTCKML-SFHDVGQMIVDHEGKIRSKKINPKISRELLA 1905
                         GT +  RH+ TCK+L S  DV +M+++ E K+++KKI+  + RE++A
Sbjct: 151  HCKHDYAYHSHKNGTKSYNRHMETCKVLISKVDVSKMMLNAEAKLQAKKIDHMVFREMVA 210

Query: 1904 AAIIKHDLPFSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANIT 1725
              II+HDLPF++VEY+               ISRNT  +D+              LAN+ 
Sbjct: 211  KCIIQHDLPFAYVEYERF-------------ISRNTAAADVYKFYENEADNLKRELANLP 257

Query: 1724 NRICLTSDVWTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLK 1545
             RI  TSD+WTA T EGY+CLT H+VD NWKLN+KI+ F A  PPHSG+ +A KI    +
Sbjct: 258  GRISFTSDLWTAITQEGYMCLTAHYVDRNWKLNNKIIAFFAFAPPHSGMHIAMKILEKWE 317

Query: 1544 EWGIDRKIFSLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGL 1365
            +WG+ +K+FS+T DNASSND  QEILK QL + ++L C GE+FH+RC+AHILN+IVQ GL
Sbjct: 318  DWGVQKKVFSITFDNASSNDSSQEILKSQLVLHNNLLCGGEYFHVRCAAHILNIIVQIGL 377

Query: 1364 KAINLALHKIRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDS 1185
              I   LHKIRES+KYV+ S  R   F +CV   G I    GL LDV TRWNSTY MLD 
Sbjct: 378  DEIVDTLHKIRESIKYVRASRKREMLFAKCVEAFG-IKMKAGLILDVKTRWNSTYKMLDR 436

Query: 1184 AIKYKKAFSSLQLND-RNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYF 1008
            A+KY+ AF + ++ D RNY F P+ DEW R + ICEFLEPF   TNLISGS+YPT NLYF
Sbjct: 437  ALKYRAAFGNFKVIDGRNYNFHPTEDEWHRLKLICEFLEPFDHITNLISGSTYPTFNLYF 496

Query: 1007 MQVWKIEVKLKENLSNEDVFISDMCKRMKEKFDK 906
            MQVWKI   L  N  N+D  I +M   M+E+FDK
Sbjct: 497  MQVWKINEWLISNSENQDEVIRNMIVPMRERFDK 530


>gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia]
          Length = 682

 Score =  394 bits (1012), Expect = e-106
 Identities = 245/641 (38%), Positives = 342/641 (53%), Gaps = 7/641 (1%)
 Frame = -3

Query: 2243 STSLNQSINLDEGDTLENTKSNQGLGKPKEFSDVWNYF--LKKGVGQDGVQRAXXXXXXX 2070
            ST  +Q+ N+        T++       K  S VW ++      +  DG+ RA       
Sbjct: 33   STPSSQNDNIPAPSVSSETRNR------KWTSPVWQHYKLFDASLFPDGIARAICKYCDG 86

Query: 2069 XXXXXXXXXGTSTLRRHIPTCKMLSFHDVGQMIVDHEGKIRSKKINPKISRELLAAAIIK 1890
                     GTS  +RH  TC       V  +  D       KK++P + +E +A A+I+
Sbjct: 87   GPTLAYSGNGTSNFKRHTETCPKRPLLGVAHLTSDGSF---IKKMDPLVYKERVALAVIR 143

Query: 1889 HDLPFSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICL 1710
            H  PFS+ EYDG R   + +N S   ISRNTL +                L+N+  +ICL
Sbjct: 144  HAFPFSYAEYDGNRWLHEGLNESYKPISRNTLRNYCMKIHKREKQILKESLSNLPGKICL 203

Query: 1709 TSDVWTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGID 1530
            T+D+WTA    GYI LT H++D  W L+SKIL F  + PPH    L   I+A LKEW I 
Sbjct: 204  TTDMWTAFVGMGYISLTAHYIDSEWNLHSKILNFCHLEPPHDAPSLHDSIYAKLKEWDIR 263

Query: 1529 RKIFSLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINL 1350
             KIF++TLDNA  ND MQ++L   LS+   + C+GE+FH+RC+AHILNLIVQ+GLK I+ 
Sbjct: 264  SKIFTITLDNARCNDNMQDLLMNSLSLHSPILCDGEYFHVRCAAHILNLIVQDGLKVIDS 323

Query: 1349 ALHKIRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYK 1170
             + K+R  V ++ GSE R+ KF+   S +G +DT+  L LD  TRWNSTY ML+ A+ Y+
Sbjct: 324  GVRKLRMVVAHIVGSERRLIKFKGNASALG-VDTSKKLCLDCVTRWNSTYNMLERAMIYR 382

Query: 1169 KAFSSLQ-----LNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFM 1005
              F +++       D ++   PS  EW R  KI E L+PF   T LISG  YPT+NLYF 
Sbjct: 383  NVFPTMRGPEMKKFDPHFPEPPSEAEWIRIVKIVELLKPFDHITTLISGRKYPTANLYFK 442

Query: 1004 QVWKIEVKLKENLSNEDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYF 825
             VWKI+  L       D  + DM   M+ KFDKYW  YS +L+F AILDPR K   + Y 
Sbjct: 443  SVWKIQYLLTRYAKCNDTHLKDMADLMRIKFDKYWENYSMILSFAAILDPRYKLPFIKYC 502

Query: 824  YSKVESDPVKCQETMSIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEGDKS 645
            + K+  DP   +    +VK   D  ++LY   +K             I         D+ 
Sbjct: 503  FHKL--DPESAELKTKVVK---DKFYKLYEEYVKYSPHVLKETSVQMIP--------DEL 549

Query: 644  KGKRMFDEFKAYDSQTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFPTLAL 465
             G      F  +D   V   G S LD YL++ +L+ +   ++DVL++WK ++ ++  LA 
Sbjct: 550  PG------FANFDGGAVIG-GLSYLDTYLDDARLDHTL--NIDVLKWWKENESKYLVLAE 600

Query: 464  IARDVLAIPITTVASESAFSIGARVLTKYRSCTLPEKVQTL 342
            +A D+L I I TVASESAF + +RVL K+R+  L   V  L
Sbjct: 601  MAIDILTIQINTVASESAFRMESRVLMKWRTTLLLITVDAL 641


>ref|XP_002451486.1| hypothetical protein SORBIDRAFT_04g002725 [Sorghum bicolor]
            gi|241931317|gb|EES04462.1| hypothetical protein
            SORBIDRAFT_04g002725 [Sorghum bicolor]
          Length = 604

 Score =  386 bits (991), Expect = e-104
 Identities = 204/577 (35%), Positives = 321/577 (55%), Gaps = 4/577 (0%)
 Frame = -3

Query: 2039 TSTLRRHIPTCK-MLSFHDVG---QMIVDHEGKIRSKKINPKISRELLAAAIIKHDLPFS 1872
            TS +RRH+  C+  L  HD+    Q +      + + + +PK++R  L   I+ H+LPFS
Sbjct: 42   TSHMRRHLENCEPRLKMHDLVEKLQSVSTESAVLTNWRFDPKLTRCELVRLIVLHELPFS 101

Query: 1871 FVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSDVWT 1692
            FVEYDG R +   +NP    +SR T+  +I                N   R  LT+D+WT
Sbjct: 102  FVEYDGFRRYSASLNPLAETVSRTTIKENILEAYKNHRTALKEMFENCNFRFSLTADLWT 161

Query: 1691 ACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIFSL 1512
            +  + GY+C+T H++D++WK+  +I+ F  +  PH G  L   +   ++ + I+ K+FS+
Sbjct: 162  SNQNIGYMCVTCHYIDDDWKVQKRIIKFCVVKTPHDGFNLYTSMLRTIRFYNIEDKLFSI 221

Query: 1511 TLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALHKIR 1332
            TLDNA+SN+ M +ILK  L   D L C+G+ FH+RC+AH++NLIV++GL+AI+  ++ IR
Sbjct: 222  TLDNATSNNTMMDILKANLLKMDLLHCDGDLFHVRCAAHVINLIVKDGLQAIDGVINNIR 281

Query: 1331 ESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFSSL 1152
            ESVKY++GS+ R  KFE+ +  +G I      ++DV+ RWNSTY M+ SA+ +K AF  L
Sbjct: 282  ESVKYIRGSQSRKEKFEDIIEELG-IRCRSAPQIDVANRWNSTYDMIQSAMPFKDAFLEL 340

Query: 1151 QLNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKLKE 972
            ++ D NY +CPS  +W+RA  +C+ L+ F   T ++SGS+YPTSNLYF Q+W +   L+E
Sbjct: 341  KVKDSNYTYCPSSQDWQRANAVCKLLKVFKKATKVVSGSTYPTSNLYFHQIWSVRQVLEE 400

Query: 971  NLSNEDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDPVKC 792
               + +  I+ M   M+ KFDKYW           +LDPR KF  + +   +        
Sbjct: 401  EAFSPNETIAAMVLEMQAKFDKYWMISYLTNCVPVVLDPRFKFGFIEFRLKQAFGQHGSV 460

Query: 791  QETMSIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEFKA 612
                 + +A +  LF  YA  +             + H  T   +         + ++  
Sbjct: 461  HHLDKVDQA-IRGLFNAYATQM-----------GGSSHVETHGDDMTSVDKGHSWSDWSE 508

Query: 611  YDSQTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIPIT 432
            + S    N   S+ D YL +        +  D+L +WK H  ++PTLA +ARD+LA+  +
Sbjct: 509  HIS-AKRNHANSEYDRYLRDDLFPCD-DDSFDILNWWKMHASKYPTLAAMARDILAVTAS 566

Query: 431  TVASESAFSIGARVLTKYRSCTLPEKVQTLICARNWL 321
            TV SESAFS G R++  +R+      V+ L+C ++WL
Sbjct: 567  TVPSESAFSTGGRIINDHRTRLAGSTVEALLCFQDWL 603


>ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda]
            gi|548861481|gb|ERN18855.1| hypothetical protein
            AMTR_s00067p00136180 [Amborella trichopoda]
          Length = 685

 Score =  380 bits (977), Expect = e-102
 Identities = 228/621 (36%), Positives = 347/621 (55%), Gaps = 6/621 (0%)
 Frame = -3

Query: 2165 KPKEFSDVWNYFLKKGVGQDGVQRAXXXXXXXXXXXXXXXXGTSTLRRHIPTC-KMLSFH 1989
            K K  S VW+ F +K   +DG  +A                 TS L+RH+  C K +   
Sbjct: 62   KRKTISSVWDEF-EKVRSEDGSVKAACKHCHRNLVGSSAHG-TSHLKRHLGRCAKRVHIG 119

Query: 1988 DVGQMIVDHEGKIRSKKINPKI----SRELLAAAIIKHDLPFSFVEYDGIRTWMKYINPS 1821
               Q++V    K  +  +N K     SR  LA  I+ H+ P S VE+   RT+++ + P 
Sbjct: 120  SGQQLVVTCIKKGEASSVNFKFDQGRSRYDLAKMILLHEYPSSMVEHTTFRTFVRNLQPL 179

Query: 1820 VPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSDVWTACTSEGYICLTGHFVDE 1641
               +S +T+ SDI              L  I +RI L++++W++C +  Y+CL  H++D+
Sbjct: 180  FSMVSPSTIESDIIEIYKKEKKKLYEELEKIPSRISLSANIWSSCQNLEYLCLIAHYIDD 239

Query: 1640 NWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIFSLTLDNASSNDCMQEILKE 1461
             W L  +IL F  +P   +G  +A  +   L +W +D+K+FS+TL++AS ND     L+ 
Sbjct: 240  AWVLQKQILSFVNLPS-RTGGAIAEVLLDLLSQWNVDKKLFSITLNSASYNDVAASSLRS 298

Query: 1460 QLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALHKIRESVKYVKGSEGRMRKFE 1281
            +LS   SL   G+ FH+ C +H++NL+VQ+GL+ I   L KIRES+KYVK S  R  +F 
Sbjct: 299  RLSRNSSLPLEGKIFHLCCCSHVVNLMVQDGLEVIQEVLQKIRESIKYVKTSHVRQERFN 358

Query: 1280 ECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFSSLQLNDRNYKFCPSIDEWK 1101
            E ++ +G I +   + LDV TRWNSTY MLD  ++ ++AFS     D      PS DEW+
Sbjct: 359  EIINQLG-IQSKQNIFLDVPTRWNSTYHMLDVTLELREAFSCFAQCDSMCNMVPSEDEWE 417

Query: 1100 RAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKLKENLSNEDVFISDMCKRMK 921
            R ++IC+ L+ FYD TN   GS YPT+NLYF +V+++ ++L E   + +  IS M  +MK
Sbjct: 418  RVKEICDCLKLFYDITNTFLGSKYPTANLYFPEVYQMHLRLVEWSMSLNKHISSMAIKMK 477

Query: 920  EKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDPVKCQETMSIVKAKLDMLFEL 741
            EKFDKYW   + VLA   ++DPR K   + Y YS++  +  +    M + +   D+  E 
Sbjct: 478  EKFDKYWKISNLVLAIAVVIDPRFKLKFVEYSYSQIYGNDAEHHIRM-VRQGVYDLCNEY 536

Query: 740  YANDIKXXXXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEFKAYDSQTVTN-AGKSQLDL 564
             + +               +  ST SG G  + GK    EF+ +  ++ +N A KS+LD 
Sbjct: 537  ESKE----PLASNSESSLAVSASTSSG-GVDTHGKLWAMEFEKFVRESSSNQARKSELDR 591

Query: 563  YLEEPKLEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIPITTVASESAFSIGARVLT 384
            YLEEP   F    D ++  +W+ +  RFPTL+ +ARD+L IP++TV S+S F IG +VL 
Sbjct: 592  YLEEP--IFPRNLDFNIRNWWQLNAPRFPTLSKMARDILGIPVSTVTSDSTFDIGGQVLD 649

Query: 383  KYRSCTLPEKVQTLICARNWL 321
            +YRS  LPE +Q L+CA++WL
Sbjct: 650  QYRSSLLPETIQALMCAQDWL 670


>gb|EMJ01864.1| hypothetical protein PRUPE_ppa015215mg, partial [Prunus persica]
          Length = 478

 Score =  379 bits (973), Expect = e-102
 Identities = 222/545 (40%), Positives = 304/545 (55%)
 Frame = -3

Query: 1949 RSKKINPKISRELLAAAIIKHDLPFSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXX 1770
            RS K +P   RELL  AII HDLPF FVEY GIR                          
Sbjct: 14   RSSKFDPIKFRELLVMAIIMHDLPFQFVEYAGIRQT------------------------ 49

Query: 1769 XXXXXXXXXXLANITNRICLTSDVWTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPP 1590
                                     T+ T++GY+CLT +F+D NWKL  +IL F  MPP 
Sbjct: 50   -------------------------TSITTDGYLCLTVYFIDVNWKLQKRILNFSFMPPL 84

Query: 1589 HSGVELAAKIFAFLKEWGIDRKIFSLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHI 1410
            H+GV L  KI+  L  WG+++K+FSLTLDNASSND   E+LK QL+++D+L  NG+FFH+
Sbjct: 85   HTGVALCEKIYRLLTNWGVEKKLFSLTLDNASSNDTFVELLKGQLNLKDALLMNGKFFHV 144

Query: 1409 RCSAHILNLIVQEGLKAINLALHKIRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRL 1230
            RC AHILNLIVQ+GLK I+  + KIRES+KYV+GS+G  +KF +C + V +++   GLR 
Sbjct: 145  RCCAHILNLIVQDGLKHIDDYVGKIRESIKYVRGSQGTKQKFLDCAAQV-SLECKRGLRQ 203

Query: 1229 DVSTRWNSTYLMLDSAIKYKKAFSSLQLNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTN 1050
            DV TRWNST+LM++SA+ Y++AF  LQL+D NYK   S DEW + EK+ +FL+ FYD T 
Sbjct: 204  DVPTRWNSTFLMINSALYYQRAFLHLQLSDSNYKHSLSQDEWGKLEKLSKFLKVFYDVTC 263

Query: 1049 LISGSSYPTSNLYFMQVWKIEVKLKENLSNEDVFISDMCKRMKEKFDKYWSQYSTVLAFG 870
            L  G+ YPT+NLYF QV+ +E  LK+                     KYW +YS +LA  
Sbjct: 264  LFFGTKYPTANLYFPQVFVVEDTLKK--------------------AKYWKEYSLILAIA 303

Query: 869  AILDPRVKFSMLSYFYSKVESDPVKCQETMSIVKAKLDMLFELYANDIKXXXXXXXXXXX 690
             ILDPR K   + + Y ++     K    M+ V+  L  LF+LY                
Sbjct: 304  VILDPRYKIQFVKFCYKRLYGYNSK---EMTKVRDMLFSLFDLYVR-------------- 346

Query: 689  STIHCSTQSGEGDKSKGKRMFDEFKAYDSQTVTNAGKSQLDLYLEEPKLEFSYYEDLDVL 510
              I+ S++S  G  S                V+   +S +D       +EF  +E     
Sbjct: 347  --IYTSSESVSGTSS----------------VSIGARSHVD------DMEFDNFE----- 377

Query: 509  QYWKNHQHRFPTLALIARDVLAIPITTVASESAFSIGARVLTKYRSCTLPEKVQTLICAR 330
                 +Q R+P L+++ RD+L+IPI+TVASESAFS+G R+L +YRS   P+ V+ L+C R
Sbjct: 378  ----MNQFRYPELSILVRDLLSIPISTVASESAFSVGGRMLDQYRSALKPKNVEVLVCTR 433

Query: 329  NWLHG 315
            +W+ G
Sbjct: 434  DWIFG 438


>emb|CAN80126.1| hypothetical protein VITISV_013417 [Vitis vinifera]
          Length = 1266

 Score =  374 bits (961), Expect = e-101
 Identities = 222/579 (38%), Positives = 320/579 (55%), Gaps = 6/579 (1%)
 Frame = -3

Query: 2039 TSTLRRHIPTCKMLSFHDVGQMIVDHEGKIRSK------KINPKISRELLAAAIIKHDLP 1878
            T  L  H+  C      D+ Q  +  E K   K        +  ISRE LA AII H+ P
Sbjct: 151  TKHLHVHLDRCIKRRNVDIKQQFLAIERKGYGKVQIGGFTFDQDISREKLARAIILHEYP 210

Query: 1877 FSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSDV 1698
             S V++ G R +   + P    +SRNT+  DI              L  +  R+ +T+D+
Sbjct: 211  LSIVDHAGFRDFASSLQPLFKMVSRNTIKDDIMKIYEFEKGKMSSYLEKLETRMAITTDM 270

Query: 1697 WTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKIF 1518
            WT+   +GY+ +T H++DE+W L+  I+ F  +PPPH+   L+  +  FL +W +DRK+ 
Sbjct: 271  WTSNQKKGYMAITVHYIDESWLLHHHIVRFVYVPPPHTKEVLSDVLLDFLLDWNMDRKLS 330

Query: 1517 SLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALHK 1338
            ++T+DN SSND M +IL E+LS   SL  NG+ FH+RC+AH+LNLIV+EGL  I + + K
Sbjct: 331  TITVDNCSSNDGMIDILSEKLSSSGSLLLNGKIFHMRCAAHVLNLIVKEGLDVIRVEIEK 390

Query: 1337 IRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAFS 1158
            IRESV Y   +  R+ KFE+    +  +  N  L LD  TRWNSTYLML  AI YK  F 
Sbjct: 391  IRESVAYWSATPSRVEKFEDAARQL-RLPCNKKLCLDCKTRWNSTYLMLSIAITYKDVFP 449

Query: 1157 SLQLNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVKL 978
             L+  ++ Y   PS +EW  A +ICE L+ FY+ T L SG +YPT+N +F++V +I+  L
Sbjct: 450  RLKQREKLYTTVPSEEEWNLAREICERLKLFYNITKLFSGRNYPTANTFFIKVCEIKEAL 509

Query: 977  KENLSNEDVFISDMCKRMKEKFDKYWSQYSTVLAFGAILDPRVKFSMLSYFYSKVESDPV 798
             + L   +  +S M   M EKFDKYWS    V+A   +LDPR K  +L +++  +     
Sbjct: 510  YDWLICSNEVVSTMASSMLEKFDKYWSGCHIVMAIAVVLDPRYKMKILEFYFPIMYGSEA 569

Query: 797  KCQETMSIVKAKLDMLFELYANDIKXXXXXXXXXXXSTIHCSTQSGEGDKSKGKRMFDEF 618
               E   I +   D+L E Y +  K           S  +    + +      K  FD F
Sbjct: 570  S-SEIGKIRQLCYDLLSE-YQSKSKMGQQTSSHGASSVSNLFELTYDEQDPLSK--FDLF 625

Query: 617  KAYDSQTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFPTLALIARDVLAIP 438
                S +     KS+LD YLEE  L      D DVL +WK +  ++PTL +I RD+ AIP
Sbjct: 626  --VHSTSEEGHAKSELDYYLEETVLP--RISDFDVLSWWKTNGIKYPTLQMIVRDIYAIP 681

Query: 437  ITTVASESAFSIGARVLTKYRSCTLPEKVQTLICARNWL 321
            ++TVASESAFS G R+++K+RS   P  ++ L+CA++WL
Sbjct: 682  VSTVASESAFSTGGRMVSKHRSRLHPNTLEALMCAQSWL 720


>gb|EMJ28015.1| hypothetical protein PRUPE_ppa017701mg [Prunus persica]
          Length = 567

 Score =  373 bits (957), Expect = e-100
 Identities = 193/423 (45%), Positives = 270/423 (63%), Gaps = 3/423 (0%)
 Frame = -3

Query: 2231 NQSINLDEGDTLENTKSNQGLGKPKEFSDVWNYFLKKGVGQDGVQRAXXXXXXXXXXXXX 2052
            N  ++LD  +      +  G  + K  S VW +F    + ++  QRA             
Sbjct: 22   NNVVDLDPSNNNNAVVTQIGKRRRKLTSAVWTHFEILHIDENNEQRAKCMKCGQKYLFDS 81

Query: 2051 XXXGTSTLRRHIPTCKMLSFHDVGQMIVDH-EGKI--RSKKINPKISRELLAAAIIKHDL 1881
                T  L+RHI +C  +   D+GQ+++   +G I  RS K +P   RELL  AII HDL
Sbjct: 82   RYG-TGNLKRHIESCVKIDTCDLGQLLLSKSDGAILTRSSKFDPMKFRELLVMAIIMHDL 140

Query: 1880 PFSFVEYDGIRTWMKYINPSVPCISRNTLVSDIXXXXXXXXXXXXXXLANITNRICLTSD 1701
            PF FVEY GIR    Y+   +  +SRNT  +D+              L ++  R+CLTSD
Sbjct: 141  PFQFVEYSGIRQLFNYVCADIKLVSRNTAKADVLSLYNREKAKLKEILGSVPGRVCLTSD 200

Query: 1700 VWTACTSEGYICLTGHFVDENWKLNSKILCFDAMPPPHSGVELAAKIFAFLKEWGIDRKI 1521
            +WT+ T++GY+CLT HF+D NWKL  +IL F  MPPPH+GV L  KI+  L +WG+++K+
Sbjct: 201  LWTSITTDGYLCLTVHFIDVNWKLQKRILNFSFMPPPHTGVALCEKIYRLLTDWGVEKKL 260

Query: 1520 FSLTLDNASSNDCMQEILKEQLSIQDSLFCNGEFFHIRCSAHILNLIVQEGLKAINLALH 1341
            FS+TLDNASSND   E+LK QL+++D+L  NG+FFHIRC AHILNLIVQ+GLK I+ ++ 
Sbjct: 261  FSMTLDNASSNDTFVELLKGQLNLKDALLMNGKFFHIRCCAHILNLIVQDGLKHIDDSVG 320

Query: 1340 KIRESVKYVKGSEGRMRKFEECVSTVGNIDTNIGLRLDVSTRWNSTYLMLDSAIKYKKAF 1161
            KIRES+KYV+GS+GR +KF  C + V +++   GLR DV TRWNST+LM+DSA+ Y++AF
Sbjct: 321  KIRESIKYVRGSQGRKQKFLNCAAQV-SLECKRGLRQDVPTRWNSTFLMIDSALHYQRAF 379

Query: 1160 SSLQLNDRNYKFCPSIDEWKRAEKICEFLEPFYDTTNLISGSSYPTSNLYFMQVWKIEVK 981
              LQL+D NYK     +EW + +K+ +FL+ FYD T L  G+ YP +NLYF QV+ +E  
Sbjct: 380  LHLQLSDSNYKHSLPQNEWGKLKKLSKFLKVFYDVTCLFFGTKYPIANLYFPQVFVVEDT 439

Query: 980  LKE 972
            L++
Sbjct: 440  LRK 442



 Score =  105 bits (263), Expect = 7e-20
 Identities = 53/116 (45%), Positives = 78/116 (67%)
 Frame = -3

Query: 653 DKSKGKRMFDEFKAYDSQTVTNAGKSQLDLYLEEPKLEFSYYEDLDVLQYWKNHQHRFPT 474
           D  +  + FD F++   +  T+A K+QL LYL EPK++      L+VL +WK +Q R+P 
Sbjct: 438 DTLRKAKEFDNFES--EEFTTSAQKTQLQLYLNEPKIDRK--TKLNVLNFWKVNQFRYPE 493

Query: 473 LALIARDVLAIPITTVASESAFSIGARVLTKYRSCTLPEKVQTLICARNWLHGYAI 306
           L+++ARD+L+IPI+TVA ESAFS+G RVL +Y S   PE V+ L+C  +W+ G  I
Sbjct: 494 LSILARDLLSIPISTVAYESAFSVGGRVLDQYHSALKPENVEALVCTHDWIFGEGI 549


Top