BLASTX nr result

ID: Rauwolfia21_contig00031038 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00031038
         (832 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, part...   176   1e-41
gb|EOY19559.1| T6D22.19, putative [Theobroma cacao]                   171   3e-40
gb|EOX99846.1| T6D22.19, putative [Theobroma cacao]                   170   6e-40
gb|EOX99730.1| Ac-like transposase THELMA13 [Theobroma cacao]         166   1e-38
gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thal...   163   6e-38
ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [...   162   1e-37
gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thali...   157   6e-36
gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana]             154   3e-35
ref|XP_006280333.1| hypothetical protein CARUB_v10026257mg [Caps...   154   5e-35
ref|XP_006300176.1| hypothetical protein CARUB_v10016411mg, part...   149   1e-33
pir||H85073 probable transposon protein [imported] - Arabidopsis...   148   3e-33
ref|XP_006279278.1| hypothetical protein CARUB_v100165484mg, par...   132   1e-32
gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia]        145   2e-32
gb|AAD12209.1| Ac-like transposase [Arabidopsis thaliana]             106   1e-30
ref|XP_006299532.1| hypothetical protein CARUB_v10015704mg [Caps...   125   2e-30
gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus pe...   137   5e-30
gb|EOY09496.1| Ac-like transposase THELMA13 [Theobroma cacao]         136   8e-30
gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [...   135   2e-29
gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, p...   124   3e-26
ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [A...   121   3e-25

>ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella]
           gi|482560944|gb|EOA25135.1| hypothetical protein
           CARUB_v10018444mg, partial [Capsella rubella]
          Length = 547

 Score =  176 bits (445), Expect = 1e-41
 Identities = 96/218 (44%), Positives = 127/218 (58%)
 Frame = +3

Query: 6   AHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVPN 185
           AH+LNLI Q GLK     L+KIRE+VK++K S+GR   F+ECV  V I  + GLK+DV  
Sbjct: 187 AHVLNLIVQVGLKFVESPLHKIRETVKWIKWSEGRKDLFKECVIDVGIKYTAGLKMDVST 246

Query: 186 KMEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILEPFYETTKLISG 365
           +            K  +                +             LEPFY+ TKL SG
Sbjct: 247 RWNSTYLMLGSVIKYRRAFSLLERAERNYKFCPSDEEWNKAEKIYTFLEPFYDITKLFSG 306

Query: 366 TSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEVLAFGAILD 545
           TSYPT+NLYF  IWKIEC+L    +  D  +++MA  M+ KFDKYW+EYS +L+ GAILD
Sbjct: 307 TSYPTANLYFAQIWKIECLLNSYSNDGDMELQNMANEMRTKFDKYWEEYSIILSIGAILD 366

Query: 546 PRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           PRMK + L YC+  +D ST++AKV+  K KL  L++ Y
Sbjct: 367 PRMKVEILTYCFDKLDPSTTKAKVEVVKQKLNLLFDQY 404


>gb|EOY19559.1| T6D22.19, putative [Theobroma cacao]
          Length = 559

 Score =  171 bits (433), Expect = 3e-40
 Identities = 80/112 (71%), Positives = 92/112 (82%)
 Frame = +3

Query: 327 LEPFYETTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWK 506
           LEPFYET  LISG+SYPTSNLYFM +WKIE IL ENL  EDE++KDM+ RMK KFDKYWK
Sbjct: 330 LEPFYETINLISGSSYPTSNLYFMQVWKIESILNENLHNEDEVIKDMSQRMKMKFDKYWK 389

Query: 507 EYSEVLAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENYA 662
           +YS VLAFGAILDPRMK DFL +CY  ID ST   K++N K+KLY+L+E YA
Sbjct: 390 DYSVVLAFGAILDPRMKLDFLRFCYSKIDASTCHEKLENVKTKLYELFEQYA 441



 Score = 75.9 bits (185), Expect = 2e-11
 Identities = 33/59 (55%), Positives = 44/59 (74%)
 Frame = +1

Query: 178 SQTKWNSTHLMLESAIKFKKAFASLQLIDRGYKYCPTEEEWSRGIKMCEFWSHFMRPQN 354
           + T+WNST+LM ESAIK++KAFASLQ +DR YKY P+++EW R + +CEF   F    N
Sbjct: 280 ASTRWNSTYLMFESAIKYQKAFASLQFVDRTYKYNPSDKEWGRAMIICEFLEPFYETIN 338


>gb|EOX99846.1| T6D22.19, putative [Theobroma cacao]
          Length = 247

 Score =  170 bits (430), Expect = 6e-40
 Identities = 80/112 (71%), Positives = 92/112 (82%)
 Frame = +3

Query: 327 LEPFYETTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWK 506
           LEPFYETT LISG+SYPTSNLYFM +WKIE IL E L  EDE++KDM+ RMK KFDKYWK
Sbjct: 7   LEPFYETTNLISGSSYPTSNLYFMQVWKIESILNEYLHNEDEMIKDMSQRMKMKFDKYWK 66

Query: 507 EYSEVLAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENYA 662
           +YS VLAFGAILDPRMK DFL +CY  ID ST   K++N K+KLY+L+E YA
Sbjct: 67  DYSVVLAFGAILDPRMKLDFLRFCYSKIDASTCHEKLENMKTKLYELFEQYA 118


>gb|EOX99730.1| Ac-like transposase THELMA13 [Theobroma cacao]
          Length = 244

 Score =  166 bits (419), Expect = 1e-38
 Identities = 78/111 (70%), Positives = 91/111 (81%)
 Frame = +3

Query: 330 EPFYETTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKE 509
           +PFYETT LISG+SYPTSNLYFM +WKIE IL  NL  EDEI+KDM+ RMK KFDKYWK+
Sbjct: 121 KPFYETTNLISGSSYPTSNLYFMQVWKIESILNANLHNEDEIIKDMSQRMKMKFDKYWKD 180

Query: 510 YSEVLAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENYA 662
           YS VLAF AILDP+MKFDFL +CY  ID ST   K++N K+KLY+L+E YA
Sbjct: 181 YSVVLAFRAILDPKMKFDFLRFCYYKIDASTCHEKLENVKTKLYELFEEYA 231


>gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thaliana]
          Length = 659

 Score =  163 bits (413), Expect = 6e-38
 Identities = 89/219 (40%), Positives = 121/219 (55%)
 Frame = +3

Query: 3   CAHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVP 182
           CAHILNLI Q GL++A   L  I ESVK+VK S+ R   F  C++ V I +  GL LDV 
Sbjct: 271 CAHILNLIVQAGLELASGLLENITESVKFVKASESRKDSFATCLECVGIKSGAGLSLDVS 330

Query: 183 NKMEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILEPFYETTKLIS 362
            +         +  K  K       Y      L      + G     +L+PF   T   S
Sbjct: 331 TRWNSTYEMLARALKFRKAFAILNLYERGYCSLPTEEECDRGEKICDLLKPFNTITTYFS 390

Query: 363 GTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEVLAFGAIL 542
           G  YPT+N+YF+ +WKIE +L++  +C+D  V++MA +M++KF KYW EYS +LA GA L
Sbjct: 391 GVKYPTANIYFIQVWKIELLLMKYANCDDVDVREMAKKMQKKFAKYWNEYSVILAMGAAL 450

Query: 543 DPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           DPR+K   L   Y  +D  T+E KVD  ++ L  LYE Y
Sbjct: 451 DPRLKLQILRSAYNKVDPVTAEGKVDIVRNNLILLYEEY 489


>ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula]
           gi|355504225|gb|AES85428.1| hypothetical protein
           MTR_126s0001, partial [Medicago truncatula]
          Length = 555

 Score =  162 bits (410), Expect = 1e-37
 Identities = 95/251 (37%), Positives = 136/251 (54%), Gaps = 4/251 (1%)
 Frame = +3

Query: 6   AHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVD-INASGGLKLDVP 182
           A +LN I +E LK+    ++KIRES+ +V+ S  R +KF+EC ++V  +++S  L LD+ 
Sbjct: 192 ARVLNQIVEEALKLVSCGVHKIRESIMFVRHSKSRREKFKECFEKVGGVDSSVHLHLDIS 251

Query: 183 NKMEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILEPFYETTKLIS 362
             +        +  K          Y     +  +    +        L PF ET  +I+
Sbjct: 252 MSLSSTYMLLERALKYRCAFESFHLYDDSYDLCPSAEEWKRVEKICAFLLPFCETANMIN 311

Query: 363 GTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEVLAFGAIL 542
            T++PTSNLYF+ +WK++C+L+++L  EDE +K MA RM  KF+KYW EYS VLA GA+L
Sbjct: 312 STTHPTSNLYFLQVWKVQCVLVDSLGDEDEDIKKMAERMMSKFEKYWDEYSVVLALGAVL 371

Query: 543 DPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENYAXXXXXXXXXXXXXXNQIQSG 722
           DPRMKF  L YCY  +D ST E K+   K KL  L+E ++              NQ QS 
Sbjct: 372 DPRMKFTTLAYCYSKLDASTCERKLQQVKRKLCMLFEKHSGNSTTAGVQRTIKENQDQSS 431

Query: 723 K---SKKFKFL 746
                KK K L
Sbjct: 432 SMPLQKKLKSL 442


>gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thaliana]
          Length = 577

 Score =  157 bits (396), Expect = 6e-36
 Identities = 87/229 (37%), Positives = 130/229 (56%), Gaps = 11/229 (4%)
 Frame = +3

Query: 6   AHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVPN 185
           AHILNLI Q+GL+V   AL KIRE+VKYVKGS+ R   FQ C+  + I     L LDV  
Sbjct: 199 AHILNLIVQDGLEVISGALEKIRETVKYVKGSETRENLFQNCMDTIGIQTEANLVLDVST 258

Query: 186 KMEFDTFNA-----------RKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILE 332
           +    T++            R   ++ +G    P+              E       +L+
Sbjct: 259 RWN-STYHMLSRAIQFKDVLRSLAEVDRGYKSFPS----------AVEWERAELICDLLK 307

Query: 333 PFYETTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEY 512
           PF E TKLISG+SYPT+N+YFM +W I+C L ++    D ++++M   M  K+DKYW+++
Sbjct: 308 PFAEITKLISGSSYPTANVYFMQVWAIKCWLGDHDDSHDRVIREMVEDMTEKYDKYWEDF 367

Query: 513 SEVLAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           S++LA  A+LDPR+KF  L YCY  ++  TS+  + + + K+ +L+  Y
Sbjct: 368 SDILAMAAVLDPRLKFSALEYCYNILNPLTSKENLTHVRDKMVQLFGAY 416


>gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana]
          Length = 745

 Score =  154 bits (390), Expect = 3e-35
 Identities = 86/219 (39%), Positives = 126/219 (57%), Gaps = 1/219 (0%)
 Frame = +3

Query: 6    AHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVPN 185
            AHILNLI Q+GL+V   AL KIRE+VKYVKGS+ R   FQ C+  + I     L LDV  
Sbjct: 382  AHILNLIVQDGLEVISGALEKIRETVKYVKGSETRENLFQNCMDTIGIQTEASLVLDVST 441

Query: 186  KMEFDTFNARKCYKI*KGIC*SPTY**R-L*ILSNGRRME*GH*NV*ILEPFYETTKLIS 362
            +         +  +  K +  S     R      +    E       +L+PF E TKLIS
Sbjct: 442  RWNSTYHMLSRAIQF-KDVLHSLAEVDRGYKSFPSAVEWERAELICDLLKPFAEITKLIS 500

Query: 363  GTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEVLAFGAIL 542
            G+SYPT+N+YFM +W I+C L ++    D  +++M   M  K+DKYW+++S++LA  A+L
Sbjct: 501  GSSYPTANVYFMQVWAIKCWLGDHDDSHDRAIREMVEDMTEKYDKYWEDFSDILAMAAVL 560

Query: 543  DPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
            DPR+KF  L YCY  ++  TS+  + + + K+ +L+  Y
Sbjct: 561  DPRLKFSALEYCYNILNPLTSKENLTHVRDKMVQLFGAY 599


>ref|XP_006280333.1| hypothetical protein CARUB_v10026257mg [Capsella rubella]
           gi|482549037|gb|EOA13231.1| hypothetical protein
           CARUB_v10026257mg [Capsella rubella]
          Length = 508

 Score =  154 bits (388), Expect = 5e-35
 Identities = 95/219 (43%), Positives = 124/219 (56%)
 Frame = +3

Query: 3   CAHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVP 182
           CAHILNLI Q+GLKV   AL KIR+SVKYVK +  R   F+ C       A   LK+ V 
Sbjct: 192 CAHILNLIVQDGLKVIGGALSKIRDSVKYVKATKARGIAFETC-------AFKRLKV-VD 243

Query: 183 NKMEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILEPFYETTKLIS 362
              +    N   C    K I                         + IL+PFY+ T L+ 
Sbjct: 244 KSYKHCPSNDDWCKA--KNI-------------------------LEILKPFYKITVLML 276

Query: 363 GTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEVLAFGAIL 542
           G SY TSNLYF+++WKIEC+L EN    D+ ++DMA RM+ KF KYW +YS  LA GA+L
Sbjct: 277 GRSYSTSNLYFVNVWKIECLLKENERHSDKDIRDMAGRMRIKFKKYWDQYSVSLAMGAVL 336

Query: 543 DPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           DPRMKF  L  CY+ +D ST + K+D+ + KL  L+++Y
Sbjct: 337 DPRMKFKLLKRCYEELDPSTCKEKLDHIEEKLRLLFDDY 375


>ref|XP_006300176.1| hypothetical protein CARUB_v10016411mg, partial [Capsella rubella]
           gi|482568885|gb|EOA33074.1| hypothetical protein
           CARUB_v10016411mg, partial [Capsella rubella]
          Length = 362

 Score =  149 bits (376), Expect = 1e-33
 Identities = 84/211 (39%), Positives = 125/211 (59%), Gaps = 2/211 (0%)
 Frame = +3

Query: 33  EGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVPNKMEFDTFNA 212
           +GL V   AL KIR+SVKYVKGS+ R   F+ C++ V I +  GL +DV  +    TF  
Sbjct: 6   DGLDVIRGALKKIRDSVKYVKGSETRKNLFRSCMETVGIQSEAGLIIDVSTRWN-STFLM 64

Query: 213 RKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*N--V*ILEPFYETTKLISGTSYPTSN 386
                + K +            L N   ++  +      +L+PF E TK+ISG+SYPTSN
Sbjct: 65  LSRAILFKDV------------LRNLVEVDKSYMKSICNLLKPFAEITKMISGSSYPTSN 112

Query: 387 LYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEVLAFGAILDPRMKFDF 566
           +YFM +W I+C L ++    D I+ ++   M  KF+KYW+E++++LA  A+LDPR+ F F
Sbjct: 113 MYFMPVWAIKCWLRDHEDSSDMIICEIVEDMNEKFNKYWEEFNDILAIAAVLDPRLTFVF 172

Query: 567 LMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           L YCY T+D  TS++ V + +SK+ KL+  Y
Sbjct: 173 LEYCYNTLDPLTSKSNVAHIRSKMAKLFRAY 203


>pir||H85073 probable transposon protein [imported] - Arabidopsis thaliana
           gi|5032279|gb|AAD38227.1|AF147264_10 may be a pseudogene
           [Arabidopsis thaliana] gi|7267351|emb|CAB81124.1|
           putative transposon protein [Arabidopsis thaliana]
          Length = 483

 Score =  148 bits (373), Expect = 3e-33
 Identities = 91/225 (40%), Positives = 124/225 (55%), Gaps = 6/225 (2%)
 Frame = +3

Query: 3   CA-HILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDV 179
           CA HILN+I Q GLK   D L KIRES+KYVKGS+ R   F +C++ V IN   GL LDV
Sbjct: 161 CATHILNIIVQIGLKGIGDTLEKIRESIKYVKGSEHREILFAKCMENVGINLKAGLLLDV 220

Query: 180 PNKME--FDTFNARKCYKI*KG---IC*SPTY**RL*ILSNGRRME*GH*NV*ILEPFYE 344
            N+    F   +    Y+   G   +  +  Y          R  +        LE F +
Sbjct: 221 ANRWNSTFKMLDRALKYRAAFGNLKVIDAKNYKFHPTDAEWHRLQQMSD----FLESFDQ 276

Query: 345 TTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEVL 524
            T LISG+ YPTSNLYFM +WK +  L  N S +DE++++M   MK +FDKYW E S + 
Sbjct: 277 ITNLISGSIYPTSNLYFMQVWKFQNWLTVNESNQDEVIRNMIVLMKERFDKYWAEVSNIF 336

Query: 525 AFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           A   + DPR+K     YC+  +D ST E  + + +++L KL+E Y
Sbjct: 337 AIATVFDPRLKLTLADYCFAKLDISTREKGMKHLRAQLRKLFEVY 381


>ref|XP_006279278.1| hypothetical protein CARUB_v100165484mg, partial [Capsella rubella]
           gi|482547957|gb|EOA12176.1| hypothetical protein
           CARUB_v100165484mg, partial [Capsella rubella]
          Length = 171

 Score =  132 bits (332), Expect(2) = 1e-32
 Identities = 59/111 (53%), Positives = 79/111 (71%)
 Frame = +3

Query: 327 LEPFYETTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWK 506
           L+PF E TK+ SG++YPTSNLYF  IW IEC L  +    DE+++ M   MK KFDKYW+
Sbjct: 38  LKPFDEITKMFSGSTYPTSNLYFKQIWNIECWLRRHEFSSDEVIEKMVENMKLKFDKYWE 97

Query: 507 EYSEVLAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           EYSE+LA GA+LDPRMKF  L +C+  +D STS  K ++ + K+Y L+ +Y
Sbjct: 98  EYSEILAIGAVLDPRMKFVLLEFCFNALDPSTSAQKCEHIRKKMYTLFGSY 148



 Score = 34.7 bits (78), Expect(2) = 1e-32
 Identities = 16/39 (41%), Positives = 23/39 (58%)
 Frame = +1

Query: 223 IKFKKAFASLQLIDRGYKYCPTEEEWSRGIKMCEFWSHF 339
           IK + A  +L+ I+  YK  PT+ +W RG  +CEF   F
Sbjct: 3   IKHQVALQNLRQIEPSYKSFPTDAKWVRGKLICEFLKPF 41


>gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia]
          Length = 682

 Score =  145 bits (365), Expect = 2e-32
 Identities = 89/234 (38%), Positives = 123/234 (52%), Gaps = 16/234 (6%)
 Frame = +3

Query: 6   AHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVPN 185
           AHILNLI Q+GLKV    + K+R  V ++ GS+ R+ KF+     + ++ S  L LD   
Sbjct: 307 AHILNLIVQDGLKVIDSGVRKLRMVVAHIVGSERRLIKFKGNASALGVDTSKKLCLDCVT 366

Query: 186 KMEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*N---------------- 317
           +    T+N  +   I + +   PT          G  M+    +                
Sbjct: 367 RWN-STYNMLERAMIYRNVF--PTM--------RGPEMKKFDPHFPEPPSEAEWIRIVKI 415

Query: 318 V*ILEPFYETTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDK 497
           V +L+PF   T LISG  YPT+NLYF  +WKI+ +L     C D  +KDMA  M+ KFDK
Sbjct: 416 VELLKPFDHITTLISGRKYPTANLYFKSVWKIQYLLTRYAKCNDTHLKDMADLMRIKFDK 475

Query: 498 YWKEYSEVLAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           YW+ YS +L+F AILDPR K  F+ YC+  +D  ++E K    K K YKLYE Y
Sbjct: 476 YWENYSMILSFAAILDPRYKLPFIKYCFHKLDPESAELKTKVVKDKFYKLYEEY 529


>gb|AAD12209.1| Ac-like transposase [Arabidopsis thaliana]
          Length = 308

 Score =  106 bits (265), Expect(2) = 1e-30
 Identities = 55/111 (49%), Positives = 71/111 (63%)
 Frame = +3

Query: 327 LEPFYETTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWK 506
           L PF E TKLISG+SYPT++LYFMH+WKIE  L  +   +DEI+ DM   MK KF     
Sbjct: 71  LRPFEEMTKLISGSSYPTASLYFMHVWKIESWLRAHERTDDEIIFDMVESMKLKF----- 125

Query: 507 EYSEVLAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
                     ILDPR+KF FL YCYK++  ST E+K+++ + K+ KLY  Y
Sbjct: 126 ---------KILDPRLKFAFLRYCYKSLKPSTCESKLEHIRKKMEKLYRFY 167



 Score = 53.9 bits (128), Expect(2) = 1e-30
 Identities = 24/52 (46%), Positives = 35/52 (67%)
 Frame = +1

Query: 184 TKWNSTHLMLESAIKFKKAFASLQLIDRGYKYCPTEEEWSRGIKMCEFWSHF 339
           T+WNST+LML  AI+FK+   +L  ++  YK  P++ EWSRG  +C+F   F
Sbjct: 23  TRWNSTYLMLSKAIQFKEVSRNLSELEPSYKSFPSKLEWSRGELICKFLRPF 74


>ref|XP_006299532.1| hypothetical protein CARUB_v10015704mg [Capsella rubella]
           gi|482568241|gb|EOA32430.1| hypothetical protein
           CARUB_v10015704mg [Capsella rubella]
          Length = 245

 Score =  125 bits (314), Expect(2) = 2e-30
 Identities = 58/112 (51%), Positives = 80/112 (71%)
 Frame = +3

Query: 324 ILEPFYETTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYW 503
           IL PFY+ T L+S   Y TSNLYF HIWKI+C+L  N    D ++++M   ++ K+DKY 
Sbjct: 40  ILMPFYKITTLMSRRRYSTSNLYFGHIWKIQCLLEVNRDHVDNVIREMVYELRLKYDKYL 99

Query: 504 KEYSEVLAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           ++Y+ VLA GA+LDPRMKF  L  CY  +D  TS+AK+++ KS+LYKL+E Y
Sbjct: 100 EQYNVVLAMGAVLDPRMKFKLLKRCYDELDLFTSQAKINHLKSELYKLFEEY 151



 Score = 34.7 bits (78), Expect(2) = 2e-30
 Identities = 12/33 (36%), Positives = 22/33 (66%)
 Frame = +1

Query: 208 MLESAIKFKKAFASLQLIDRGYKYCPTEEEWSR 306
           M+E A+K+  A    +++D+ YKY P+ ++W R
Sbjct: 1   MIEKALKYDCALNRFKVVDKKYKYFPSAQDWKR 33


>gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus persica]
          Length = 696

 Score =  137 bits (345), Expect = 5e-30
 Identities = 85/226 (37%), Positives = 116/226 (51%), Gaps = 7/226 (3%)
 Frame = +3

Query: 3   CAHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVP 182
           CAHILNLI Q+GLK   D++ KIRES+KYV+GS GR QKF  C  +V +    GL+ DVP
Sbjct: 300 CAHILNLIVQDGLKHIDDSVGKIRESIKYVRGSQGRKQKFLNCDARVSLECKRGLRQDVP 359

Query: 183 NKMEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILEPFYETTK--- 353
            +               +           L +  +  +          LE   +  K   
Sbjct: 360 TRWNSTFLMIDSALYYQRAFL-------HLQLSDSNYKHSLSQDEWGKLEKLSKFLKVFY 412

Query: 354 ----LISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEV 521
               L SGT YPT+NLYF  ++ +E  L +     D  +K MAT+M  KFDKYWKEYS +
Sbjct: 413 DVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDSDSFMKSMATQMMEKFDKYWKEYSLI 472

Query: 522 LAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
           LA   ILDPR K  F+ +CYK +    SE ++   +  L+ L++ Y
Sbjct: 473 LAIAVILDPRYKIQFVEFCYKRLYGYNSE-EMTKVRDMLFSLFDLY 517


>gb|EOY09496.1| Ac-like transposase THELMA13 [Theobroma cacao]
          Length = 373

 Score =  136 bits (343), Expect = 8e-30
 Identities = 88/226 (38%), Positives = 117/226 (51%), Gaps = 7/226 (3%)
 Frame = +3

Query: 6   AHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVPN 185
           AHILNLI Q+GLK    A+ K RES+KYVKGS GR QKF ECV  V++NA   LK DVP 
Sbjct: 18  AHILNLIVQDGLKEVDSAIQKGRESIKYVKGSQGRKQKFLECVSLVNLNAKRDLKQDVPT 77

Query: 186 KMEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILEP-------FYE 344
           +                G          L I  +  +          +E        FYE
Sbjct: 78  RWNSTFLMLESALYFRLGFS-------HLEISDSNFKHSPSRDEWDRIEKLSKFLSVFYE 130

Query: 345 TTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEVL 524
            T + SGT YPT++L+F  I+    IL E++S +D  +K+MAT+M  KF KYW ++S +L
Sbjct: 131 ITCVFSGTKYPTADLHFPSIFMARMILEEHMSGDDVYLKNMATQMFVKFKKYWSQFSLIL 190

Query: 525 AFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENYA 662
               I DPR K  F+ + Y  +  S S A+    K  L+ LY+ YA
Sbjct: 191 TIAVIFDPRYKIQFMEWSYTKLYGSNS-AEFKKVKDHLFALYDEYA 235


>gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [Prunus persica]
          Length = 697

 Score =  135 bits (339), Expect = 2e-29
 Identities = 84/226 (37%), Positives = 114/226 (50%), Gaps = 7/226 (3%)
 Frame = +3

Query: 3   CAHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVP 182
           CAHILNLI Q+GLK   D++ KIRES+KYV+GS GR QKF  C  QV +    GL+ DVP
Sbjct: 301 CAHILNLIVQDGLKHIDDSVGKIRESIKYVRGSQGRKQKFLNCAAQVSLECKRGLRQDVP 360

Query: 183 NKMEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILEPFYETTK--- 353
            +               +           L +  +  +          LE   +  K   
Sbjct: 361 TRWNSTFLMIDSALYYQRAFL-------HLQLSDSNYKHSLSQDEWGKLEKLSKFLKVFY 413

Query: 354 ----LISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEV 521
               L SGT YPT+NLYF  ++ +E  L +     D  +K MAT+M   FDKYWKEYS +
Sbjct: 414 DVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDSDSFMKSMATQMMEMFDKYWKEYSLI 473

Query: 522 LAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
            A   ILDPR K  F+ +CYK +    SE ++   +  L+ L++ Y
Sbjct: 474 PAIAVILDPRYKIQFVEFCYKRLYGYNSE-EMTKVRDMLFSLFDLY 518


>gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, putative isoform 1
           [Theobroma cacao] gi|508778249|gb|EOY25505.1| BED zinc
           finger,hAT family dimerization domain, putative isoform
           1 [Theobroma cacao] gi|508778250|gb|EOY25506.1| BED zinc
           finger,hAT family dimerization domain, putative isoform
           1 [Theobroma cacao] gi|508778251|gb|EOY25507.1| BED zinc
           finger,hAT family dimerization domain, putative isoform
           1 [Theobroma cacao]
          Length = 678

 Score =  124 bits (312), Expect = 3e-26
 Identities = 81/221 (36%), Positives = 115/221 (52%), Gaps = 2/221 (0%)
 Frame = +3

Query: 6   AHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVPN 185
           A +LNLI Q+ LK     + K+RESVKYVKGS  R QKF ECV  + +NA GGL+ DV  
Sbjct: 294 AQVLNLIVQDSLKEVDCVVQKVRESVKYVKGSQVRKQKFLECVTLMKLNAKGGLRQDVST 353

Query: 186 KMEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILEPFYETTKLISG 365
           K        ++     K                +    E       +L  FY+ T + S 
Sbjct: 354 KWNSTFLMLKRALYFRKAFSHLEIRDSNYRYCPSEDEWERVEKLYKLLAVFYDVTCVFSR 413

Query: 366 TSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYSEVLAFGAILD 545
           T YPT+NL+F  ++     L E++S +D  +K+M+T+M  KF KYW ++S +LA   ILD
Sbjct: 414 TKYPTANLFFPSMFIAHSTLQEHMSGQDVYMKNMSTQMLVKFVKYWSDFSLILAIAVILD 473

Query: 546 PRMKFDFLMYCYKTI--DDSTSEAKVDNSKSKLYKLYENYA 662
           PR K  F+ + Y  +  +DST   +  N +  L+ LY  YA
Sbjct: 474 PRYKIHFVEWSYGKLYGNDST---QFKNVRDWLFSLYNEYA 511


>ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda]
           gi|548861481|gb|ERN18855.1| hypothetical protein
           AMTR_s00067p00136180 [Amborella trichopoda]
          Length = 685

 Score =  121 bits (303), Expect = 3e-25
 Identities = 72/228 (31%), Positives = 116/228 (50%), Gaps = 9/228 (3%)
 Frame = +3

Query: 3   CAHILNLIFQEGLKVAHDALYKIRESVKYVKGSDGRIQKFQECVQQVDINASGGLKLDVP 182
           C+H++NL+ Q+GL+V  + L KIRES+KYVK S  R ++F E + Q+ I +   + LDVP
Sbjct: 318 CSHVVNLMVQDGLEVIQEVLQKIRESIKYVKTSHVRQERFNEIINQLGIQSKQNIFLDVP 377

Query: 183 NK---------MEFDTFNARKCYKI*KGIC*SPTY**RL*ILSNGRRME*GH*NV*ILEP 335
            +         +  +   A  C+     +C          ++ +    E        L+ 
Sbjct: 378 TRWNSTYHMLDVTLELREAFSCFAQCDSMCN---------MVPSEDEWERVKEICDCLKL 428

Query: 336 FYETTKLISGTSYPTSNLYFMHIWKIECILIENLSCEDEIVKDMATRMKRKFDKYWKEYS 515
           FY+ T    G+ YPT+NLYF  ++++   L+E     ++ +  MA +MK KFDKYWK  +
Sbjct: 429 FYDITNTFLGSKYPTANLYFPEVYQMHLRLVEWSMSLNKHISSMAIKMKEKFDKYWKISN 488

Query: 516 EVLAFGAILDPRMKFDFLMYCYKTIDDSTSEAKVDNSKSKLYKLYENY 659
            VLA   ++DPR K  F+ Y Y  I  + +E  +   +  +Y L   Y
Sbjct: 489 LVLAIAVVIDPRFKLKFVEYSYSQIYGNDAEHHIRMVRQGVYDLCNEY 536


Top