BLASTX nr result

ID: Rehmannia28_contig00042618 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00042618
         (999 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CDP14369.1| unnamed protein product [Coffea canephora]            155   4e-41
emb|CDP09717.1| unnamed protein product [Coffea canephora]            143   9e-35
emb|CDP14470.1| unnamed protein product [Coffea canephora]            124   7e-28
emb|CDP00431.1| unnamed protein product [Coffea canephora]            102   4e-21
ref|XP_010687479.1| PREDICTED: uncharacterized protein LOC104901...   101   2e-20
ref|XP_010684219.1| PREDICTED: putative ribonuclease H protein A...    93   9e-18
ref|XP_013668888.1| PREDICTED: uncharacterized protein LOC106373...    94   9e-18
ref|XP_008340321.1| PREDICTED: putative ribonuclease H protein A...    94   1e-17
ref|XP_013653962.1| PREDICTED: uncharacterized protein LOC106358...    92   3e-17
ref|XP_008338354.1| PREDICTED: putative ribonuclease H protein A...    92   4e-17
ref|XP_008345396.1| PREDICTED: putative ribonuclease H protein A...    91   5e-17
gb|AAD21778.1| putative non-LTR retroelement reverse transcripta...    92   7e-17
ref|XP_008372934.1| PREDICTED: putative ribonuclease H protein A...    91   1e-16
ref|XP_013601782.1| PREDICTED: uncharacterized protein LOC106309...    90   1e-16
ref|XP_010666374.1| PREDICTED: uncharacterized protein LOC104883...    88   2e-16
ref|XP_013588992.1| PREDICTED: uncharacterized protein LOC106297...    89   3e-16
ref|XP_008338363.1| PREDICTED: uncharacterized protein LOC103401...    90   4e-16
ref|XP_013726958.1| PREDICTED: uncharacterized protein LOC106430...    88   1e-15
ref|XP_013589719.1| PREDICTED: uncharacterized protein LOC106298...    88   1e-15
ref|XP_009113011.1| PREDICTED: putative ribonuclease H protein A...    88   1e-15

>emb|CDP14369.1| unnamed protein product [Coffea canephora]
          Length = 347

 Score =  155 bits (392), Expect = 4e-41
 Identities = 97/307 (31%), Positives = 144/307 (46%), Gaps = 4/307 (1%)
 Frame = +1

Query: 1   IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
           +KHFIWKC   ILP N  ++ +    D  C  CGE+ ETIEH LF C+ A  IWK+A +S
Sbjct: 28  LKHFIWKCLQGILPVNAVIRERCSKGDSVCKCCGEYPETIEHLLFFCDNALAIWKVAPVS 87

Query: 181 WDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQGVKKHEIDV 360
           W G   L  NF  WW  I +A   E  Q+RI+L+  +LW IWKSRN   F+      + V
Sbjct: 88  WAGLECLRNNFGHWWEEIREARAMESGQERIELTVNVLWQIWKSRNRRQFEDKGMDPMTV 147

Query: 361 VNCAVREWLEMHYSHTCLSAAKASTAAGEVA----NNKEICEEGLHVLAVSSSKICNDSV 528
           VN A REW E   +        + T  G+         E+    ++  A    K   +  
Sbjct: 148 VNKATREWREFQEAQDVDGGNGSHTTNGKEGLGGWREPEVGWVKINSDAAVQQKA--ERA 205

Query: 529 GLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAKEIV 708
           G G +  D   N + AW        S+ +   LA+R A++ A  + +  +    D K+++
Sbjct: 206 GWGMIARDCLGNALGAWAVPDTCCSSAKQEEALALRSAMLMAKQQGWRRVVFESDCKQLI 265

Query: 709 HKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLALFASEGKASSEWL 888
             +N+G         + + + L  N + CC FS+  ++N   S  LA  A      +EW 
Sbjct: 266 DSINSGDGDSDIATILLDIVSLKSNFYKCC-FSFTRRMNNSVSHSLAKLALSLDGPAEWK 324

Query: 889 SHLPSWI 909
              P+W+
Sbjct: 325 VVFPAWL 331


>emb|CDP09717.1| unnamed protein product [Coffea canephora]
          Length = 613

 Score =  143 bits (360), Expect = 9e-35
 Identities = 79/269 (29%), Positives = 135/269 (50%), Gaps = 1/269 (0%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            IKHFIW+  + +LP ND + ++    D  C  CGE EE+IEH  F C RA+ +WK+A + 
Sbjct: 313  IKHFIWRSLNGLLPVNDLVFKRIHQGDPICDGCGEQEESIEHLFFQCSRAQEVWKMAPIQ 372

Query: 181  WDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQGVKKHEIDV 360
            WDG      N   WW  + +A  +   ++ ++L+  ILW IWK RN W F   ++H  + 
Sbjct: 373  WDGLTEQTRNILVWWNSMLEATNRIEGREHVELTVNILWQIWKRRNEWKFNAKRRHPWES 432

Query: 361  VNCAVREWLEMHYS-HTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKICNDSVGLG 537
            +  A++EW E   +     S    +    E A   E   + + +   +  +   + VG G
Sbjct: 433  IKKALQEWQEQASAWREEKSTPVEAERDRETAETDEGGRDEMQIRLSTHVQEQTNRVGTG 492

Query: 538  CLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAKEIVHKL 717
             + T++    +SAW     +  S ++ +  AV+ A++ A   ++  I + + + +++  +
Sbjct: 493  IIATNFNHQLVSAWALTDRYAGSHLQTSAEAVKMAIIKARQLQWQKITVHLLSPQLLKMI 552

Query: 718  NNGLAFECANMTIAEDIFLLMNLFDCCKF 804
             NGLA +    T+ +DI  L  LF  C F
Sbjct: 553  TNGLAKDIKMATLTDDINSLRALFQKCSF 581


>emb|CDP14470.1| unnamed protein product [Coffea canephora]
          Length = 660

 Score =  124 bits (310), Expect = 7e-28
 Identities = 85/302 (28%), Positives = 134/302 (44%), Gaps = 7/302 (2%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            +K FIWKC    LP    + RK    D  C  CGE +ET+EH + NC+ ++ +WK A + 
Sbjct: 358  VKIFIWKCITGALPVRAAIFRKTRMGDPVCRLCGEDQETVEHLMLNCQHSQQVWKAAPIQ 417

Query: 181  WDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQG-VKKHEID 357
            WDG M    +F+ WW  I +A  ++   + I L+  ILW +WK RN   F+         
Sbjct: 418  WDGAMDQKGDFRRWWIRISEARTRQGGMEHIGLTAIILWQLWKERNSKEFENKTSCSPAR 477

Query: 358  VVNCAVREWLEMHYSHTCLSAAKASTAAGEVANNKE-----ICEEGLHVLAVS-SSKICN 519
             +  A +EWLE        +  K   +  E   NKE       EEG   L ++ +S+   
Sbjct: 478  TIGKAQKEWLEQEE----FTKGKTRLSIRETTCNKEEQHQGCNEEGTIKLEMAITSQNGQ 533

Query: 520  DSVGLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAK 699
             S+G+G   T +   +++ W          V    +AV   L  A   ++S I I   ++
Sbjct: 534  TSLGIGVTATKHPSLRLAEWALKERSKGDKVIDEAMAVHLVLCKAFEHQWSRITIQFQSQ 593

Query: 700  EIVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLALFASEGKASS 879
            +++ ++           T+ EDI  +  LF  C FS   + + + S  L+  A       
Sbjct: 594  DLMRQIKYNSPSNSRLATLIEDILSMQKLFRLCLFSLANENSIKRSRDLSSHAFGILVDE 653

Query: 880  EW 885
            EW
Sbjct: 654  EW 655


>emb|CDP00431.1| unnamed protein product [Coffea canephora]
          Length = 344

 Score =  102 bits (253), Expect = 4e-21
 Identities = 48/130 (36%), Positives = 70/130 (53%)
 Frame = +1

Query: 1   IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
           IK FIW+C    L   + + R+       C  CG+  ETIEH LF+C +A+ +WKLA + 
Sbjct: 195 IKLFIWRCITNTLTARETIFRRTKQGSPICSRCGDGMETIEHILFHCHQAQKVWKLAPIQ 254

Query: 181 WDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQGVKKHEIDV 360
           WDG       FK WW  + QA  +   +  I L+  +LW +WK RN   F+G ++  + +
Sbjct: 255 WDGIQNQTGCFKKWWAALSQATSRTEGRQHIALTANLLWQLWKDRNQMEFEGKEREGLKI 314

Query: 361 VNCAVREWLE 390
           V  A  EW+E
Sbjct: 315 VQKASSEWME 324


>ref|XP_010687479.1| PREDICTED: uncharacterized protein LOC104901585 [Beta vulgaris subsp.
            vulgaris]
          Length = 485

 Score =  101 bits (251), Expect = 2e-20
 Identities = 74/300 (24%), Positives = 128/300 (42%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            I+ F W+     +     L R+G++++  C  CGE +ET+EH L  C  ++ +W  + L 
Sbjct: 179  IRMFGWRAIKNGVAVRANLARRGVEIEKMCPRCGEFDETLEHMLLFCPESRKVWYYSPLR 238

Query: 181  WDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQGVKKHEIDV 360
             D     +  F +W   +  A  K+   D   L  ++ W IW  +N+W F+G K+  +DV
Sbjct: 239  VDVETVGSGRFHEWVERLAAAQLKD---DWWALFWFLCWNIWLGQNVWTFEGKKREFVDV 295

Query: 361  VNCAVREWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKICNDSVGLGC 540
            V  AVR  LE  Y        K +    +    K    EG + L   ++   ++ VG+G 
Sbjct: 296  VERAVRGVLE--YDKVAQEGDKVAAVEADTVRWK-APREGDYKLNTDAAMFADNQVGMGA 352

Query: 541  LVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAKEIVHKLN 720
            +V D++ + ++A C           A  L  R  L  A    F  + + +D  ++   L+
Sbjct: 353  VVRDFSGDVLAAMCCKMRGGDEVDIAEALCARRGLQIAMEAGFRSLVLEVDNLKLFSYLS 412

Query: 721  NGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLALFASEGKASSEWLSHLP 900
                       I +DI  L++      FS+  +   + + RLA   +       WL  +P
Sbjct: 413  RNKREASPFGFIVQDIQRLVHQCSSVSFSHTRRKGNEVAHRLAKMCNSIDGLKVWLEEVP 472


>ref|XP_010684219.1| PREDICTED: putative ribonuclease H protein At1g65750 [Beta vulgaris
           subsp. vulgaris]
          Length = 365

 Score = 92.8 bits (229), Expect = 9e-18
 Identities = 67/274 (24%), Positives = 121/274 (44%), Gaps = 1/274 (0%)
 Frame = +1

Query: 1   IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
           IK+F W+  H  +     L  +G+  D +C  CGE +ETI+H L  C  A+ IW+L+ L 
Sbjct: 88  IKNFGWRVLHNGIAVKANLLYRGMGEDRRCPVCGEGDETIKHALTLCRDARSIWRLSPLR 147

Query: 181 WDGRMFLNCNFKDWWFFICQAGKKEVFQDR-IQLSTYILWWIWKSRNLWIFQGVKKHEID 357
            + +   + N   W    C+     + Q+   ++   +LW IW  RN WIF+  +    D
Sbjct: 148 MEIQEMEDENVSQW----CEKLAATIVQESWWEIFWNLLWGIWLRRNAWIFKRKQVSVKD 203

Query: 358 VVNCAVREWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKICNDSVGLG 537
           +++ A+R  +E   +   +     S   G+   ++    EG   L   ++ +    +G G
Sbjct: 204 LIDKAMRFTIEYQAAKESVKDGLRSEGPGKKKWHRP--REGSLKLNTDAAVLAGGMIGFG 261

Query: 538 CLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAKEIVHKL 717
            +V D       A C   D   +   A  +A+R  L  A    F  I +  D  +++ +L
Sbjct: 262 GVVRDSMGEVRLACCGRMDGRFAPDIAEAMAIREVLAIAGAAGFREIILENDCLKLISQL 321

Query: 718 NNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCK 819
              +    +   I +DI   +  F+   F+++C+
Sbjct: 322 KKHVLENSSFGNIVKDILDYVQAFNVVSFNHICR 355


>ref|XP_013668888.1| PREDICTED: uncharacterized protein LOC106373226 [Brassica napus]
          Length = 779

 Score = 94.4 bits (233), Expect = 9e-18
 Identities = 73/277 (26%), Positives = 126/277 (45%), Gaps = 4/277 (1%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            +K FIWK    ILPT ++L  + I++D +C  CG   E+I H LF+C  A+ +WKL+ L 
Sbjct: 483  VKMFIWKGLKGILPTGERLLERHINVDPRCKRCG-CSESINHLLFHCPFAREVWKLSPLD 541

Query: 181  WDGRMFLNCNFK-DWWFFICQAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQGVKKHEID 357
             +  +    + + DW     Q            L  +ILW IWK+RN ++F+    +  D
Sbjct: 542  GNFEVSGLTDLRADWTAVHAQRCLPPAGVSDTMLVPWILWSIWKARNKFLFENFAGNPAD 601

Query: 358  VVNCAV---REWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKICNDSV 528
             ++ A+   REW+         S+  +     E        +  + + + ++  + N   
Sbjct: 602  TLSMAIVAAREWVGAQQEKEVKSSPTSIQGTEE--------QVDMVIRSDAAWSVTNKCA 653

Query: 529  GLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAKEIV 708
            GLG ++   AQ  I        FV S++ A  LA+R A+ F  +     ++   D+  ++
Sbjct: 654  GLGWVMLTQAQ--ILRGHKGASFVSSALTAEALALREAVKFGHDMNLKRVRFESDSAHLI 711

Query: 709  HKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCK 819
              +N    +      I EDI  L+N FD   F ++ +
Sbjct: 712  KAINKREPY-MELYGILEDILRLLNEFDVVVFGWISR 747


>ref|XP_008340321.1| PREDICTED: putative ribonuclease H protein At1g65750 [Malus
            domestica]
          Length = 565

 Score = 93.6 bits (231), Expect = 1e-17
 Identities = 75/303 (24%), Positives = 130/303 (42%), Gaps = 8/303 (2%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            IK FIW+C +  L     LKR+ + +D  CG C    ET  H  F CE + + W  + L 
Sbjct: 255  IKIFIWRCCNNALAVRRNLKRRHMRVDNVCGVCNAFNETENHLFFRCELSHVFWFCSPLH 314

Query: 181  WDGRMFLNCNFKDWWFFICQAGKKEVFQDRI-QLSTYILWWIWKSRNLWIFQGVKKHEID 357
             +  +    +F + W   C   K  +  D I Q   + LW +WK+RN  +F+G+ +  +D
Sbjct: 315  LNSHVLEGSDFLESWSNFCAQVKDXINADDICQDFXFGLWRLWKNRNDVVFKGIHRQPLD 374

Query: 358  VVNCAVREWLE----MHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKI-CND 522
            ++    +   E    +  +    S     T+     N  +  +     + +++    C D
Sbjct: 375  ILEAWRKSTSEYKDSLARNADDFSXRLPKTSKATALNCTKWQKPRFGTIKINTDAAWCKD 434

Query: 523  --SVGLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDA 696
               +G+G L  D+A    +A  S   F  S+  A   A+R+AL+   +  F  I I  DA
Sbjct: 435  XMXMGVGWLGRDFAGLLQAAGGSGSGFCHSAAAAEASAIRYALLSXXDHGFXDIIIESDA 494

Query: 697  KEIVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLALFASEGKAS 876
              I+  L   +  + +   I  DI +L        F++V +   + +  +A +A +   S
Sbjct: 495  STIIMMLKKEVLVDFSIECILXDIEVLAQKLRSVSFAFVPREGNRAAHSVAKYAFKEGRS 554

Query: 877  SEW 885
              W
Sbjct: 555  FSW 557


>ref|XP_013653962.1| PREDICTED: uncharacterized protein LOC106358691 [Brassica napus]
          Length = 387

 Score = 91.7 bits (226), Expect = 3e-17
 Identities = 69/282 (24%), Positives = 126/282 (44%), Gaps = 10/282 (3%)
 Frame = +1

Query: 1   IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
           I+ F+WK     LP  + L+++G+  ++ C  CG+ +ET  H   +C  A+ +W+LA +S
Sbjct: 95  IQVFLWKVLQGALPLGENLEKRGMRENIACVHCGD-KETAHHLFVSCSFARKLWELAPIS 153

Query: 181 WDGRMFLNCNFKDWWFFICQAGKKEVFQ------DRIQLSTYILWWIWKSRNLWIFQG-- 336
                  N +F D     C    K++            L T+I W +W +RN  IF+   
Sbjct: 154 THAIPSTNTSFAD-----CLTSSKKLICLPPSGIVEANLFTWISWALWTTRNKKIFENRT 208

Query: 337 -VKKHEIDVVNCAVREWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKI 513
              +  + +     REW     +       K ST A      ++  E  + V   ++  +
Sbjct: 209 ISPEKTLSIAISEAREWQGAQITEK--GRDKDSTMATTGCLRQQQVESSIKVFTDAAWNV 266

Query: 514 CNDSVGLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILID 693
            +   GLG + +D  +   +    F +FV S + A  LA+R ALV A N   + +++  D
Sbjct: 267 SHRGAGLGWVFSDTQRTAFAHGTRFEEFVGSPIMAEALAIRQALVEARNMGINDLQVNSD 326

Query: 694 AKEIVHKLNNGLAFECANMTIAEDIFLLMNLFD-CCKFSYVC 816
           ++ ++  +N           I E + +L ++++  C FS +C
Sbjct: 327 SQVLIRVINK-------KELIKELVGILQDIYELACCFSSIC 361


>ref|XP_008338354.1| PREDICTED: putative ribonuclease H protein At1g65750 [Malus
            domestica]
          Length = 435

 Score = 91.7 bits (226), Expect = 4e-17
 Identities = 75/305 (24%), Positives = 132/305 (43%), Gaps = 10/305 (3%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            IK+FIW+C +  L     L+ + + +D  CG CG  +E+  H  F C  + L W  +SL 
Sbjct: 111  IKNFIWRCCNNALVVRRNLELRHMSVDNVCGVCGAVDESETHLFFRCHLSHLFWYSSSLQ 170

Query: 181  WDGRMFLNCNF-KDWWFFICQAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQGVKKHEID 357
             +       +F   W  F  +   ++  ++ +Q   + LW +WK+RN  +F GV +  ++
Sbjct: 171  LNSLDLAGADFLSTWESFWSRVKGRDHAEEIMQEFAFGLWRLWKNRNEMVFNGVHRQPLE 230

Query: 358  VVNCAVREWLEMHYSHTCLSAAKASTAAGEVAN---NKEIC---EEGLHVLAVSS-SKIC 516
            ++    R   E  +    L       A G   N   ++  C   +    VL V++ +  C
Sbjct: 231  ILESWRRNIAE--FKDATLGQTTGDQAKGRPRNPPLDRGACRWVKLAFGVLKVNTDAPWC 288

Query: 517  NDSV--GLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILI 690
              S+  G+G +  D+A     A  S  +   S+      A+R AL    +  +  + I  
Sbjct: 289  RKSLRTGVGWVCRDFAGLLHGAGGSGAELYHSAAAGEAAAIRXALTACIDYGYDNVIIES 348

Query: 691  DAKEIVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLALFASEGK 870
            DAK I+  + + LA++ +   I  DI +L        FS V + +   +  +A F  +  
Sbjct: 349  DAKVIIQMIRHELAYDFSLECILGDIEVLARRLRSVSFSVVPRESNAAAHSVAKFVFKEG 408

Query: 871  ASSEW 885
               EW
Sbjct: 409  RKFEW 413


>ref|XP_008345396.1| PREDICTED: putative ribonuclease H protein At1g65750 [Malus
           domestica]
          Length = 426

 Score = 91.3 bits (225), Expect = 5e-17
 Identities = 71/288 (24%), Positives = 127/288 (44%), Gaps = 17/288 (5%)
 Frame = +1

Query: 1   IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
           IK FIW+C +  L     ++R+ + +D  CG C   +ET  H  F C  +   W  + L 
Sbjct: 102 IKLFIWRCCNNALAVRRNIQRRHMRVDNVCGVCNAVDETENHLFFRCSFSHTFWFCSPLH 161

Query: 181 WDGRMFLNCNFKDWWFFICQAGKKEVFQDRI-QLSTYILWWIWKSRNLWIFQGVKKHEID 357
            +       +F D W    +  K +V  + I Q   + LW +WK+RN  +F+G+ +  +D
Sbjct: 162 LNSHELEGFDFLDSWGKFQEGMKGKVNAEEICQEFAFGLWRLWKNRNDVVFKGLYRQPLD 221

Query: 358 VVNCAVREWLEMHYSHTCLSAAKASTAAGEVANNKEI------CEEGLH--------VLA 495
           V++   +  +E           +AST+  + A NK++          LH        +  
Sbjct: 222 VMDLWWKNIIEFR---------EASTSLSDDAWNKDVTPFSAAARPSLHWQRPRYGTIKI 272

Query: 496 VSSSKICNDSV--GLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKF 669
            + +  C D++  G+G +  D+A    +A        LS+  A  +A+R+AL      +F
Sbjct: 273 NTDAAWCKDTLRAGVGWVGRDFAGVLQAAGGLGTVLCLSAAAAEAIAIRYALKACITHRF 332

Query: 670 SGIKILIDAKEIVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYV 813
           + + I  DAK I+  +   +  +C    +  DI +L        F++V
Sbjct: 333 NHVVIESDAKLIIQMIRKEVTVDCNLDCVLGDIEILAQKLTSVTFAFV 380


>gb|AAD21778.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1715

 Score = 92.0 bits (227), Expect = 7e-17
 Identities = 77/310 (24%), Positives = 128/310 (41%), Gaps = 7/310 (2%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            IKHFIW+C    L T  QL+ + I  D  C  C   +ETI H +F C  A+++W+ A+ S
Sbjct: 1397 IKHFIWRCLSGALSTTTQLRNRNIPADPTCQRCCNADETINHIIFTCSYAQVVWRSANFS 1456

Query: 181  WDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQLSTY-ILWWIWKSRNLWIFQGVKKHEID 357
               R+    N ++    I Q  K +       L  + I+W +WKSRN ++FQ + +    
Sbjct: 1457 GSNRLCFTDNLEENIRLILQGKKNQNLPILNGLMPFWIMWRLWKSRNEYLFQQLDRFPWK 1516

Query: 358  VVNCA---VREWLEMHYSHTCLSAAKASTAAGEVANNKEICE--EGLHVLAVSSSKI-CN 519
            V   A     EW+E   + T +S   A +    ++ +K+     EG       S  +   
Sbjct: 1517 VAQKAEQEATEWVETMVNDTAISHNTAQSNDRPLSRSKQWSSPPEGFLKCNFDSGYVQGR 1576

Query: 520  DSVGLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAK 699
            D    G ++ D     + + C+      S+++A  L   HAL     R +  +    D  
Sbjct: 1577 DYTSTGWILRDCNGRVLHSGCAKLQQSYSALQAEALGFLHALQMVWIRGYCYVWFEGDNL 1636

Query: 700  EIVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLALFASEGKASS 879
            E+ + +N          T+  DI   M         YV +     + +L  +A+   +  
Sbjct: 1637 ELTNLINKTEDHHLLE-TLLYDIRFWMTKLPFSSIGYVNRERNLAADKLTKYANSMSSLY 1695

Query: 880  EWLSHLPSWI 909
            E     P W+
Sbjct: 1696 ETFHVPPRWL 1705


>ref|XP_008372934.1| PREDICTED: putative ribonuclease H protein At1g65750 [Malus
            domestica]
          Length = 961

 Score = 91.3 bits (225), Expect = 1e-16
 Identities = 78/303 (25%), Positives = 129/303 (42%), Gaps = 13/303 (4%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIW-----K 165
            +K F+WKC +  L     LKR+ + +   C  C + +ET  H  F CE + + W     +
Sbjct: 637  MKFFVWKCCNNALAVRRNLKRRXLRVXNFCEGCMQPDETENHLFFRCEISHIFWFCSPLQ 696

Query: 166  LASLSWDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQGVKK 345
            + SL  +G  FL      W  F  +   KE   + +Q   + LW +WK+RN  IF+GV  
Sbjct: 697  INSLELEGNDFL----ASWGSFCGRVKGKENEXELMQEFVFGLWHLWKNRNDLIFKGVTH 752

Query: 346  HEIDVVNCAVR---EWLEMHYSHTCLSAAKASTA----AGEVANNKEICEEGLHVLAVSS 504
              +D++   +R   E+ E     T         A    AG     K+     + V   ++
Sbjct: 753  QPLDILGLWMRNLDEYREAGLQRTNDDQPTVGLASPAPAGRTGQWKKPGFGTIKVNTDAA 812

Query: 505  SKICNDSVGLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKI 684
             +      G+G +  D+A    +A  +      S+  A +LA+R AL F     F  + I
Sbjct: 813  WRKETSCAGVGWVARDFAGLLQAAGGTGGRLCHSADAAEMLAIRSALDFCVRLGFKSVAI 872

Query: 685  LIDAKEIVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLALFA-S 861
              DAK I+  + N    +C+      DI  L    +   F +V + + + +  +A +   
Sbjct: 873  ESDAKNIIQMIRNETTLDCSLECTLGDIATLARGLESVTFDFVSRESNRAAHSVAKYVFL 932

Query: 862  EGK 870
            EGK
Sbjct: 933  EGK 935


>ref|XP_013601782.1| PREDICTED: uncharacterized protein LOC106309283 [Brassica oleracea
           var. oleracea]
          Length = 455

 Score = 90.1 bits (222), Expect = 1e-16
 Identities = 67/251 (26%), Positives = 118/251 (47%), Gaps = 10/251 (3%)
 Frame = +1

Query: 1   IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
           +KHF WK   + +P  ++L+ + +D+D KC  C EH E+I H LF+C+ A+ +W+LA  +
Sbjct: 154 VKHFAWKLLKRAIPVGERLQERHLDVDPKCKRC-EHNESILHLLFHCQYAQEVWQLAPFT 212

Query: 181 WDGRMFLNCNFKDWWFFIC-QAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQGVKKHEID 357
            +       +    W  IC Q            L  +ILW IWKSRN ++F+GV     +
Sbjct: 213 TEMEYRGIIDLMSSWPSICAQKCLPPSGVTSGALFPWILWSIWKSRNRFVFEGVFISPKE 272

Query: 358 VVNCAV---REWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSS---KICN 519
            ++ A+   REW          S +KA         NK++ +    V AV S       +
Sbjct: 273 TLSTAIKLAREW---------CSDSKADPTRPSARINKQMAQGETSVTAVRSDAAWMASS 323

Query: 520 DSVGLGCLVTDYAQNKISAWCSFR---DFVLSSVEATLLAVRHALVFAANRKFSGIKILI 690
           +  GLG  +   + ++      ++   +F+ S + A  LA+R A+          I++  
Sbjct: 324 NLAGLGWTILSESGHEYKKRAEYKKRAEFIPSPLVAEGLAMREAIQSYRRSDLQDIRLES 383

Query: 691 DAKEIVHKLNN 723
           D+ +++  +N+
Sbjct: 384 DSAQLIQCVNS 394


>ref|XP_010666374.1| PREDICTED: uncharacterized protein LOC104883530 [Beta vulgaris
           subsp. vulgaris]
          Length = 283

 Score = 87.8 bits (216), Expect = 2e-16
 Identities = 70/281 (24%), Positives = 123/281 (43%), Gaps = 2/281 (0%)
 Frame = +1

Query: 28  HQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLSWDGRMFLNC 207
           HQ LP  D L R+G D+D     CG   ETI H LF C  A+L+WK++ L  +  + L  
Sbjct: 2   HQGLPMRDNLSRRGCDVDRIFPMCGNGVETILHALFLCPIAQLVWKVSPLRLEVFVGLGE 61

Query: 208 NFKDWWFFICQAGKKEVFQDRIQLSTY--ILWWIWKSRNLWIFQGVKKHEIDVVNCAVRE 381
           +F DW  F         F D +  + +  +LW IW ++N W+F+      ++VV  A+ +
Sbjct: 62  SFMDWVKFFALK-----FTDDLWWALFWSLLWGIWLNKNAWVFENRNLEAVNVVERAISK 116

Query: 382 WLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKICNDSVGLGCLVTDYAQ 561
             E + ++  +         GE     +   +G   +   ++      +G G ++ D   
Sbjct: 117 MGEFNAANENVKLDTLVIRGGE--KRWQAPRDGWFQVNSDAAVFEEGKIGCGWVMRDSVG 174

Query: 562 NKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAKEIVHKLNNGLAFEC 741
           + + A     D  ++  EA  +A+RHA+  A            D  +++  L  GL    
Sbjct: 175 DVMVATMCLMDGGMAVDEAEAVALRHAVKIAIEAGLLDFVAGSDNLKLIDHLKKGLKENT 234

Query: 742 ANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLALFASE 864
           +   I  DI  ++ +     F++VC    + + +LA  + E
Sbjct: 235 SFGNIVADILDVVYVCRRVVFNHVCMDGNRVAHKLAHLSRE 275


>ref|XP_013588992.1| PREDICTED: uncharacterized protein LOC106297259 [Brassica oleracea
           var. oleracea]
          Length = 449

 Score = 89.4 bits (220), Expect = 3e-16
 Identities = 67/245 (27%), Positives = 111/245 (45%), Gaps = 4/245 (1%)
 Frame = +1

Query: 1   IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
           +KHF WK   + +P  ++L+ + +D+D KC  C EH E+I H LF+C+ A+ +W+LA  +
Sbjct: 154 VKHFAWKLLKRAIPVGERLQERHLDVDPKCKRC-EHNESILHLLFHCQYAQEVWQLAPFT 212

Query: 181 WDGRMFLNCNFKDWWFFIC-QAGKKEVFQDRIQLSTYILWWIWKSRNLWIFQGVKKHEID 357
            +       +    W  IC Q            L  +ILW IWKSRN ++F+GV     +
Sbjct: 213 TEMEYRGIIDLMSSWPSICAQKCLPPSGVTSGALFPWILWSIWKSRNRFVFEGVFISPKE 272

Query: 358 VVNCAV---REWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKICNDSV 528
            ++ A+   REW          S +KA         NK++ +    V AV S      S 
Sbjct: 273 TLSTAIKLAREW---------CSDSKADPTRPSARINKQMAQGETSVTAVRSDAAWMASS 323

Query: 529 GLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAKEIV 708
            L  L           +    +F+ S + A  LA+R A+          I++  D+ +++
Sbjct: 324 NLAGLGWTILSESGHEYKKRAEFIPSPLLAEGLAMREAIQSCRRSDLQDIRLESDSAQLI 383

Query: 709 HKLNN 723
             +N+
Sbjct: 384 QCVNS 388


>ref|XP_008338363.1| PREDICTED: uncharacterized protein LOC103401427 [Malus domestica]
          Length = 1877

 Score = 89.7 bits (221), Expect = 4e-16
 Identities = 72/303 (23%), Positives = 128/303 (42%), Gaps = 8/303 (2%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            I+ FIW+C +  L     LKR+ + +D  CG C    ET  H  F CE + + W  + L 
Sbjct: 136  IRIFIWRCCNNALAVRRNLKRRHMRVDNVCGVCIAVNETENHLFFRCEISHVFWFCSPLH 195

Query: 181  WDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQLS-TYILWWIWKSRNLWIFQGVKKHEID 357
             +  +    +F + W   C   K  +  D I+    + LW +WK+RN  +F+G+ +  +D
Sbjct: 196  LNSHVLEGRDFLESWCNFCDQVKDRIDADDIRHDFAFGLWRLWKNRNDVVFKGIYRQPLD 255

Query: 358  VVNCAVREWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLH-----VLAVSSSKICND 522
            ++    +   E   S        +      +  +  IC +        +   + +  C D
Sbjct: 256  ILEAWKKSTGEYKASLAPDQEDHSLRMPKTIMVSDRICTKWKRPRFGTIKINTDAAWCKD 315

Query: 523  S--VGLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDA 696
            +  +G+G L  D+A    +A  S   F  S+  A   A+R AL+   +  F  I I  DA
Sbjct: 316  TLRMGVGWLGRDFAGLLQAAGGSGTGFSHSAAAAEASAIRFALLSCIDHGFDDIIIESDA 375

Query: 697  KEIVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLALFASEGKAS 876
              I+  L   +  + +   I +DI +L        F++V +   + +  +A +  +   S
Sbjct: 376  STIIMMLKKEILVDFSIECILDDIEILAQKLRSVSFAFVPREGNRAAHSVAKYVFKEGRS 435

Query: 877  SEW 885
              W
Sbjct: 436  FSW 438


>ref|XP_013726958.1| PREDICTED: uncharacterized protein LOC106430715 [Brassica napus]
          Length = 820

 Score = 88.2 bits (217), Expect = 1e-15
 Identities = 78/301 (25%), Positives = 137/301 (45%), Gaps = 8/301 (2%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            IK F+WK  H  LP  +QL  + I +D KC  CG   E+I+H   +C+ AK + K A + 
Sbjct: 533  IKLFVWKTLHGALPVGEQLLARQIQVDGKCKLCG-LSESIDHLFLHCKFAKEVCKSAPVW 591

Query: 181  WDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQ---LSTYILWWIWKSRNLWIFQGVKKHE 351
                     + +  W  +C   +K +    +    L+ +ILW +WK+RN  IF+      
Sbjct: 592  PSNDYSGTIDLRSEWRSLCT--RKNLPPTGLSSGALAPWILWQLWKARNSLIFKDKGFSA 649

Query: 352  IDVVNCAV---REWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKICND 522
             +V++ A+   REW +  +    LS ++      +        E  + V + ++    N+
Sbjct: 650  TEVISMAIAAAREWNDSQHKTPVLSRSQPIRTIPQ--------ERCVLVRSDAAWSEANN 701

Query: 523  SVGLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAKE 702
              GLG +V   +Q++ S++ S R FV S + A  LA+R A+        + ++   D  +
Sbjct: 702  IAGLGWIVK--SQSRTSSFSSPRRFVGSPLVAEGLALREAVTKCKELGLTRVRFESDCDQ 759

Query: 703  IVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLA--LFASEGKAS 876
            ++  L +      A   I  DI  +   F+C  FS++ +     +  LA     +EG  +
Sbjct: 760  LIKALTSDYPM-AALYGIVSDIKSVALSFECISFSWISREKNSEAYSLAKQALVAEGDTN 818

Query: 877  S 879
            S
Sbjct: 819  S 819


>ref|XP_013589719.1| PREDICTED: uncharacterized protein LOC106298188 [Brassica oleracea
            var. oleracea]
          Length = 820

 Score = 88.2 bits (217), Expect = 1e-15
 Identities = 78/301 (25%), Positives = 137/301 (45%), Gaps = 8/301 (2%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            IK F+WK  H  LP  +QL  + I +D KC  CG   E+I+H   +C+ AK + K A + 
Sbjct: 533  IKLFVWKTLHGALPVGEQLLARQIQVDGKCKLCG-LSESIDHLFLHCKFAKEVCKSAPVW 591

Query: 181  WDGRMFLNCNFKDWWFFICQAGKKEVFQDRIQ---LSTYILWWIWKSRNLWIFQGVKKHE 351
                     + +  W  +C   +K +    +    L+ +ILW +WK+RN  IF+      
Sbjct: 592  PSNDYSGTIDLRSEWRSLCT--RKNLPPTGLSSGALAPWILWQLWKARNSLIFKDKGFSA 649

Query: 352  IDVVNCAV---REWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKICND 522
             +V++ A+   REW +  +    LS ++      +        E  + V + ++    N+
Sbjct: 650  TEVISMAIAAAREWNDSQHKTPVLSRSQPIRTIPQ--------ERCVLVRSDAAWSEANN 701

Query: 523  SVGLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAKE 702
              GLG +V   +Q++ S++ S R FV S + A  LA+R A+        + ++   D  +
Sbjct: 702  IAGLGWIVK--SQSRTSSFSSPRRFVGSPLVAEGLALREAVTKCKELGLTRVRFESDCDQ 759

Query: 703  IVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYVCKLNYQPSVRLA--LFASEGKAS 876
            ++  L +      A   I  DI  +   F+C  FS++ +     +  LA     +EG  +
Sbjct: 760  LIKALTSDYPM-AALYGIVSDIKSVALSFECISFSWISREKNSEAYSLAKQALVAEGDTN 818

Query: 877  S 879
            S
Sbjct: 819  S 819


>ref|XP_009113011.1| PREDICTED: putative ribonuclease H protein At1g65750 [Brassica rapa]
          Length = 682

 Score = 87.8 bits (216), Expect = 1e-15
 Identities = 76/278 (27%), Positives = 128/278 (46%), Gaps = 7/278 (2%)
 Frame = +1

Query: 1    IKHFIWKCWHQILPTNDQLKRKGIDLDLKCGFCGEHEETIEHFLFNCERAKLIWKLASLS 180
            IK FIWK +H+ LP  +QL+ + I +D KC  CG   ETI+H   +C  AK +W L  + 
Sbjct: 415  IKLFIWKLFHRALPVGEQLRARQIKVDGKCKLCG-LPETIDHLFLHCSFAKRVWALVPV- 472

Query: 181  WDGRMFL-NCNFKDWWFFICQAGKKEVFQDRI---QLSTYILWWIWKSRNLWIF--QGVK 342
            W    F      +  W  IC   ++ +    I    L+ +ILW +WK+RN  +F  +G  
Sbjct: 473  WPSIEFAGTAELRSEW--ICMLTRQALPPTGIATGSLAPWILWQLWKARNSLVFNDKGFN 530

Query: 343  KHE-IDVVNCAVREWLEMHYSHTCLSAAKASTAAGEVANNKEICEEGLHVLAVSSSKICN 519
              E I + + A REW +        S A+  T          + E+ + V + ++     
Sbjct: 531  ATEVITLASAAAREWNQ--------SQAQTLTKQKHPPVRASLPEDCVVVRSDAAWNEAK 582

Query: 520  DSVGLGCLVTDYAQNKISAWCSFRDFVLSSVEATLLAVRHALVFAANRKFSGIKILIDAK 699
               GLG ++   +QN+ S +     FV S + A  +A+R A++   +     ++   D  
Sbjct: 583  KVAGLGWIIK--SQNRASVFSEPARFVGSPLVAEGMALREAVLKCRDLGLPRLRCESDCA 640

Query: 700  EIVHKLNNGLAFECANMTIAEDIFLLMNLFDCCKFSYV 813
            +++  L +  +F      I EDI  +   FD   F+++
Sbjct: 641  QLIKALTSDQSFS-ELYGIVEDIKSVSFSFDFISFAWI 677