BLASTX nr result

ID: Alisma22_contig00003398 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Alisma22_contig00003398
         (2287 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

JAT53351.1 Retrovirus-related Pol polyprotein LINE-1 [Anthurium ...   196   2e-48
XP_019167697.1 PREDICTED: uncharacterized protein LOC109163410 [...   189   5e-46
JAT50758.1 Retrovirus-related Pol polyprotein LINE-1, partial [A...   188   6e-46
JAT45180.1 Putative ribonuclease H protein At1g65750, partial [A...   181   6e-46
GAV92425.1 zf-RVT domain-containing protein, partial [Cephalotus...   182   6e-46
GAV91933.1 zf-RVT domain-containing protein [Cephalotus follicul...   178   2e-45
JAT65994.1 Putative ribonuclease H protein At1g65750, partial [A...   179   3e-45
XP_017228355.1 PREDICTED: uncharacterized protein LOC108203735 [...   183   3e-44
XP_019158195.1 PREDICTED: uncharacterized protein LOC109154910 [...   181   1e-43
JAU28950.1 Putative ribonuclease H protein, partial [Noccaea cae...   175   2e-43
XP_018435547.1 PREDICTED: uncharacterized protein LOC108807802 [...   180   3e-43
XP_018474345.1 PREDICTED: uncharacterized protein LOC108845678 [...   176   1e-42
GAV92728.1 zf-RVT domain-containing protein, partial [Cephalotus...   168   1e-42
GAV92297.1 hypothetical protein CFOL_v3_35677 [Cephalotus follic...   169   2e-42
XP_019163505.1 PREDICTED: uncharacterized protein LOC109159849 [...   177   2e-42
XP_017227807.1 PREDICTED: uncharacterized protein LOC108203403 [...   174   2e-42
JAU25120.1 LINE-1 retrotransposable element ORF2 protein, partia...   176   3e-42
JAU74353.1 hypothetical protein LE_TR15446_c14_g1_i1_g.48810, pa...   176   5e-42
XP_010530577.2 PREDICTED: uncharacterized protein LOC104807150 i...   176   6e-42
XP_010530576.1 PREDICTED: uncharacterized protein LOC104807150 i...   176   7e-42

>JAT53351.1 Retrovirus-related Pol polyprotein LINE-1 [Anthurium amnicola]
          Length = 1225

 Score =  196 bits (497), Expect = 2e-48
 Identities = 131/467 (28%), Positives = 215/467 (46%), Gaps = 14/467 (2%)
 Frame = -1

Query: 2287 GFSNLIRKGIMTKSLQPIKARKCEPVSHLIFADDLMLFMKASPDNARFINNASDSFGMMS 2108
            GFS ++R     + +Q +       +SH++FADDL++FMK     A  + +    FG  S
Sbjct: 666  GFSVMMRDLCDRRKIQ-VPGLNGVSISHILFADDLIIFMKDDLGTAHAVADVIAQFGTYS 724

Query: 2107 GLKLNVCXXXXXXXXXXXVEPIL-EILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDR 1931
            GL  N                 L +IL + E  +  ++YLG+PLF   L   HC  L+D+
Sbjct: 725  GLHFNCGKSRVYIGAKVSCRRSLPDILGVSESSLP-VRYLGLPLFSKSLKDVHCQGLVDK 783

Query: 1930 LKKRLDSWKAKLLSLVGRLELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSE 1751
            +++++ SWK + LS  GRLELIKS L +  L+              I  ++  F W   +
Sbjct: 784  VRRKISSWKNRFLSKAGRLELIKSILSSYSLYWTSAFALPGSIINAIEGLLSRFFWSGGD 843

Query: 1750 GKKNIHLVSWDNVTLPLSEGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLS 1571
              K++H+++W N+  P +EGGLG+R   E   AC++   WD+   + SLW+ W+ A YLS
Sbjct: 844  MVKSLHMIAWKNICKPKTEGGLGIRGIGEWNRACLLVQLWDILHFRPSLWIDWVYASYLS 903

Query: 1570 KDTFWSIEVSPNYSWLFNKLLAVRGSLLGLIKFHIGANSQFKVWKAPWV-NGKLLSDCDE 1394
            K + W  +     SW++  +L  R  L   I + IG  S F     PW+ +G+ + +   
Sbjct: 904  KSSIWEKKRRVYDSWVWKHILDGRNILRQHIHYIIGDGSTFSFLYDPWLPSGQSVFELIG 963

Query: 1393 SFKPITFGIGSEAVMSMIISNGA*SLPKSRV----------VVSILNFIEDNNIKIGIKD 1244
                   GI     + + I  G   LP +            +  + +FI    I +G +D
Sbjct: 964  RDGIQVMGIPFSTRLGLFIQEGQWRLPVATPADFYTHGCAGLRRLWHFILSTQI-LGGED 1022

Query: 1243 SITFL-DKADDSLKDVWDATRRKNSESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT*DYS 1067
             I +  +     ++  W+  R  +    + S +W     + Y    W+  + +L T  Y 
Sbjct: 1023 CIRWKHNNGVWHVRHAWEVIRVVSPIVSWSSMVWTSPTIQKYSITLWQAAVGRLATEVYL 1082

Query: 1066 RRLGITVVSMCLLCMDDYEDGFHLSFQCKFAKMVWKACLDE-GLIKS 929
            ++ G  +   C+LC    E   HL F C F+K +W   L   G+I+S
Sbjct: 1083 QKRGFHLAGRCILCYHSEEHIDHLFFNCDFSKWIWMQVLKRLGIIRS 1129


>XP_019167697.1 PREDICTED: uncharacterized protein LOC109163410 [Ipomoea nil]
          Length = 1586

 Score =  189 bits (479), Expect = 5e-46
 Identities = 135/454 (29%), Positives = 209/454 (46%), Gaps = 16/454 (3%)
 Frame = -1

Query: 2272 IRKGIMTKSLQPIK-ARKCEPVSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKL 2096
            I++ +  K+ +PI  +R    VSHL FA+DLMLF +AS D  R I N    F   SGL +
Sbjct: 880  IQEKVRVKNWKPITLSRGGTGVSHLFFAEDLMLFAEASEDQGRMIMNCLKKFSGKSGLNI 939

Query: 2095 NVCXXXXXXXXXXXVEPILEILNMKEGDISDL--KYLGIPLFMGKLSKKHCYPLIDRLKK 1922
            NV             E    + N+    +SD    YLGIP+   ++SK     ++D +K+
Sbjct: 940  NVSKSNIFCSPNTNSEIKRGLKNLTGISVSDNLGTYLGIPILHKRVSKHTFGYILDGMKR 999

Query: 1921 RLDSWKAKLLSLVGRLELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKK 1742
            +L +WK  +LSL GR  L++S L ++ ++              I +I RDFLWG S   K
Sbjct: 1000 KLANWKGNMLSLAGRRTLVQSALSSMPVYTMQVFKLPAGTCNDIDRICRDFLWGDSAQNK 1059

Query: 1741 NIHLVSWDNVTLPLSEGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDT 1562
             +HLV W+++      GGLG+RKC +  +A + K AW L      LWVK +  KY+    
Sbjct: 1060 KVHLVGWNDICKSKDAGGLGLRKCLDFNNALLAKLAWQLVTSHDKLWVKVMREKYVKNKN 1119

Query: 1561 FWSIEVSPNYSWLFNKLLAVRGSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKP 1382
            F++  +  N SW +  ++  R  +     + IG       W   W   K L+  DE   P
Sbjct: 1120 FFATPMIANASWGWRSIMRGRSIVELGAAWRIGNGLSLNFWSDWWTGDKPLAFMDEVTIP 1179

Query: 1381 ITFGIGSEAVMSMIISNGA*SLPKSRVVVSILNFIEDNNIKIGIKDS-ITFLDKADDSL- 1208
             +    SEA +S  I      LP     V  L+ +  ++I  GI+ + I   ++  DSL 
Sbjct: 1180 DS---QSEAKVSDFI------LPNRTWNVDKLSSLLPHDIIDGIRATPIAVCEQVSDSLY 1230

Query: 1207 -----------KDVWDATRRKNSESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT*DYSRR 1061
                       +  +      + ++   SW+WK +V +      W +   +L T     R
Sbjct: 1231 WPRSPTGAFSVRSAFSHIAGNDEDAMDTSWVWKMRVTERCRLFLWLLMRGRLLTNMERWR 1290

Query: 1060 LGITVVSMCLLCMDDYEDGFHLSFQCKFAKMVWK 959
             G+T  ++C  C D  E   H+  +C FAK  W+
Sbjct: 1291 KGMTEDTLCATCGDGDETMNHVLMECSFAKDCWR 1324


>JAT50758.1 Retrovirus-related Pol polyprotein LINE-1, partial [Anthurium
            amnicola]
          Length = 1070

 Score =  188 bits (477), Expect = 6e-46
 Identities = 115/392 (29%), Positives = 194/392 (49%), Gaps = 6/392 (1%)
 Frame = -1

Query: 2212 VSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNVCXXXXXXXXXXXVE--PIL 2039
            +SH++FADDL++FMK      R I N    F  +SGL LN C            +   I 
Sbjct: 682  ISHILFADDLIIFMKDHIGTVREIANVLKQFSTLSGLHLN-CNKSKVYIGAKISQRRAIH 740

Query: 2038 EILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELIKS 1859
            +IL +K+  +  + YLG+ LF   L   HC  L+D+++KR+ SWK +LLS  GRLELI+S
Sbjct: 741  DILGVKDSSLP-VCYLGLSLFSKSLKANHCQFLVDKIRKRISSWKTRLLSKAGRLELIRS 799

Query: 1858 TLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLSEGGLGV 1679
             L +  ++              I  ++  F W  +E  ++ H+V+W ++  P  EGGLG+
Sbjct: 800  ILSSYSIYWSTAFALPCSIIHAIEALLSKFFWSGNEADRSFHMVAWKSICKPKVEGGLGI 859

Query: 1678 RKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSIEVSPNYSWLFNKLLAVR 1499
            R  +E   AC++   WDL +++ SLW  W+ A YL + + W  +     SW++  +L  R
Sbjct: 860  RSIAEWNKACILAQLWDLLQKRPSLWTDWVYASYLPRTSIWLKKKRSYDSWIWKHILDCR 919

Query: 1498 GSLLGLIKFHIGANSQFKVWKAPWV-NGKLLSDCDESFKPITFGIGSEAVMSMIISNGA* 1322
             +L   I + +G  + F  +  PW+ +G+ + D          G+  + ++  ++ +   
Sbjct: 920  DTLKQHIHYVVGDGTSFSFFYDPWLPSGQSIFDLSGDLIRHNLGLPHDILIGHLLQDRCW 979

Query: 1321 SLPKSRV--VVSILNFIEDNNIKIGIKDSITFL-DKADDSLKDVWDATRRKNSESQFLSW 1151
            SLP  R   +V+I   I    I+    DSI +  +    ++++ W+ TR+   +  + S 
Sbjct: 980  SLPPPRTMEMVNIWPIIYSIPIQ-NYSDSICWKHNNGVWNVRNAWEVTRKTRPKVAWCSL 1038

Query: 1150 MWKCKVPKNYLFHAWKVHLNKLPT*DYSRRLG 1055
            +W       Y    W+  L KL T D  ++ G
Sbjct: 1039 VWAHPTIPRYSITLWQAALGKLTTGDNLQKRG 1070


>JAT45180.1 Putative ribonuclease H protein At1g65750, partial [Anthurium
            amnicola]
          Length = 502

 Score =  181 bits (460), Expect = 6e-46
 Identities = 115/370 (31%), Positives = 188/370 (50%), Gaps = 6/370 (1%)
 Frame = -1

Query: 2212 VSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNVCXXXXXXXXXXXVE-PILE 2036
            +SH+IFADDL++FMK   D  R I +    FG  SGL+LN              +  I  
Sbjct: 135  ISHIIFADDLIIFMKDDLDTIRAIADIIQIFGSFSGLQLNCDKSKVYIGAKSSYKRAIPS 194

Query: 2035 ILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELIKST 1856
            IL + E  +  +KYLG+PLF   L   +C  L+++++KR+ +W++ LLS  GRLELI+S 
Sbjct: 195  ILKVHESSLP-VKYLGLPLFSKALKVVYCQHLVEKVRKRISNWRSNLLSKAGRLELIRSI 253

Query: 1855 LFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLSEGGLGVR 1676
            LF+  ++              I   + +F W  S   K+ H++SW  +  P  EGGLG+R
Sbjct: 254  LFSYSIYWSSAFTLPQLTIHDIEAQLSNFFWSGSGEDKSFHMISWKTICKPKDEGGLGIR 313

Query: 1675 KCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSIEVSPNY-SWLFNKLLAVR 1499
            +  +   ACV+   WD+ ++++SLWV W+ A YL+K + W +++  +Y SW +  +L  R
Sbjct: 314  RIGDWHKACVLAQLWDILQKRKSLWVDWVYASYLAKTSIW-VKIRRSYDSWTWKHILDCR 372

Query: 1498 GSLLGLIKFHIGANSQFKVWKAPWV-NGKLLSDCDESFKPITFGIGSEAVMSMIISNGA* 1322
              L   I++ +G  S F     PW+ NGK + +       +  GI  +  +   I  G  
Sbjct: 373  DILRQHIRYTVGDGSGFSFLYDPWLHNGKSIYELVGDRVRLVMGIPHDTRLLHYIHAGHW 432

Query: 1321 SL--PKSRVVVSILNFIEDNNIKIGIKDSITFL-DKADDSLKDVWDATRRKNSESQFLSW 1151
             L  P S  +++I   I +  I + I+DSI +  +    ++++ W+ATR    +  +   
Sbjct: 433  ILPTPTSTEMLTIWPIIHNTPIAL-IRDSIYWKHNNGVWNVRNAWEATREAGPKVGWYQM 491

Query: 1150 MWKCKVPKNY 1121
             W       Y
Sbjct: 492  AWNSPTIHKY 501


>GAV92425.1 zf-RVT domain-containing protein, partial [Cephalotus follicularis]
          Length = 520

 Score =  182 bits (461), Expect = 6e-46
 Identities = 119/428 (27%), Positives = 193/428 (45%), Gaps = 7/428 (1%)
 Frame = -1

Query: 2206 HLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNV--CXXXXXXXXXXXVEPILEI 2033
            HL +ADDL++F  A      FI    + F  +SGL                   + IL +
Sbjct: 1    HLCYADDLLIFAAADLKTIEFIKQGLECFKNVSGLAAGTDKSSIFFCNTNRRTRDLILRL 60

Query: 2032 LNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELIKSTL 1853
               ++  +  +KYLG+PL   +L+K+ C PL++++  R +SW +K LS  GRL+LI+STL
Sbjct: 61   TQFRQATLP-VKYLGLPLITSRLTKQDCAPLLEKIMARANSWVSKSLSYAGRLQLIQSTL 119

Query: 1852 FALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLSEGGLGVRK 1673
             ++ +                 + +R FLWG S       LV W  V LP  EGGLG++ 
Sbjct: 120  ASMQVFWASTFLLPATIIKDCERTLRRFLWGGSGNSHKHSLVKWSKVCLPRQEGGLGIKS 179

Query: 1672 CSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSIEVSPNYSWLFNKLLAVRGS 1493
                  A ++K  W+L  +  SLWV+W +   + K  FW++      SW + +++ +R +
Sbjct: 180  LKSWNRALLLKQIWNLLTD-HSLWVQWCKVNLIRKHNFWTLSSYGPLSWSWKQIILLRKT 238

Query: 1492 LLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVMSMIISNGA*SLP 1313
             L  +++  G   +F +W  PW +G  +     +      G+G   ++  +I+NG    P
Sbjct: 239  ALNHLRYVTGKGDKFSLWFDPWFHGSSIYATYGNRVIYESGMGLSVLVQEVIANGQWCWP 298

Query: 1312 KSRVVVSILNFIEDNNIKIGIKDSI--TFLDKADD--SLKDVWDATRRKNSESQFLSWMW 1145
                + S L  I+   + I I  S    F DK     S+K  W++ R       +   +W
Sbjct: 299  ---TISSELIDIQSRALHIPITSSSDHIFWDKVGSPFSIKSAWESIRASAPLVDWAKLVW 355

Query: 1144 -KCKVPKNYLFHAWKVHLNKLPT*DYSRRLGITVVSMCLLCMDDYEDGFHLSFQCKFAKM 968
               ++PK+   H W    N L T D    LGI   + C       E   HL F C +   
Sbjct: 356  HSSRIPKHAFCH-WMAIQNALKTMDKLFLLGIVHDTCCKFNCGGNESVEHLFFACPYTNG 414

Query: 967  VWKACLDE 944
            +W   L++
Sbjct: 415  IWTEVLNK 422


>GAV91933.1 zf-RVT domain-containing protein [Cephalotus follicularis]
          Length = 430

 Score =  178 bits (451), Expect = 2e-45
 Identities = 115/416 (27%), Positives = 188/416 (45%), Gaps = 3/416 (0%)
 Frame = -1

Query: 2212 VSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLK--LNVCXXXXXXXXXXXVEPIL 2039
            ++HL +ADDL++F  A   +  FI +  D F  +SGL   L               + IL
Sbjct: 17   LNHLCYADDLLIFAAADLQSIGFIKHGMDRFKEVSGLPAGLEKSSIFFCNTKRRTRDHIL 76

Query: 2038 EILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELIKS 1859
             +   ++G +  +KYLG+PL   +L K+ C P+I+++  R++SW  K LS  GRL+LIKS
Sbjct: 77   SMTQFRQG-VLPVKYLGLPLITSRLKKRDCTPIIEKILARVNSWVTKSLSYAGRLQLIKS 135

Query: 1858 TLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLSEGGLGV 1679
            TL ++ +                 +I R FLWG S   +   LV W NV LP  EGGLG+
Sbjct: 136  TLASMQVFWCSTFLLPAAVIKDCERIWRSFLWGNSGSSRKHSLVKWSNVCLPRQEGGLGI 195

Query: 1678 RKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSIEVSPNYSWLFNKLLAVR 1499
            +       A ++K  W L  +  SLWV+W +   + K +FW++  S + SW + ++L +R
Sbjct: 196  KSLKAWNQALLLKQIWSLLND-HSLWVQWCKLNLIRKHSFWTLPSSGSLSWSWRQILLLR 254

Query: 1498 GSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVMSMIISNGA*S 1319
             + L  + +  G  ++F +W  PW +G  +            G+ +  ++  +I N    
Sbjct: 255  NTALTHLVYVCGKGARFSLWYDPWFHGTSIYAIYGHKVIYDAGLANTELVHSVIENDQWC 314

Query: 1318 LP-KSRVVVSILNFIEDNNIKIGIKDSITFLDKADDSLKDVWDATRRKNSESQFLSWMWK 1142
             P  S  ++ I + ++D  I       +  +       K  W + R    +  +   +W 
Sbjct: 315  WPTTSPDLLDIQSRVQDIPITSSPYCIMWEIVGRPFLTKQAWKSIRTSAPQVVWAKLVWH 374

Query: 1141 CKVPKNYLFHAWKVHLNKLPT*DYSRRLGITVVSMCLLCMDDYEDGFHLSFQCKFA 974
                  + F  W      L T D      I   + C+    D E+  HL F C FA
Sbjct: 375  PSCIPKHAFCLWLAIQGALKTLDKLLHRVILTSASCVFNCGDNENVDHLYFACPFA 430


>JAT65994.1 Putative ribonuclease H protein At1g65750, partial [Anthurium
            amnicola]
          Length = 507

 Score =  179 bits (455), Expect = 3e-45
 Identities = 103/373 (27%), Positives = 179/373 (47%), Gaps = 3/373 (0%)
 Frame = -1

Query: 2212 VSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNVCXXXXXXXXXXXVEPILEI 2033
            +SH++FADDL++FMK      R I +    FG +SGL LN                ++ +
Sbjct: 135  ISHILFADDLIIFMKDDLGTIRKIADVISQFGEISGLHLNCDKSKVYVGARVSHRRVIPV 194

Query: 2032 LNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELIKSTL 1853
            +         ++YLG+PLF   L   HC  L+++++K++ SWK +LLS  GRLELI+S L
Sbjct: 195  ILGVNESALPVRYLGLPLFSKSLKASHCQSLVEKVRKKISSWKTRLLSKAGRLELIRSIL 254

Query: 1852 FALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLSEGGLGVRK 1673
             +  ++              I  ++  F W  +   ++ H+V+W  +  P  EGGLG+R 
Sbjct: 255  SSYSIYWTTAFALPCSTIHAIESLLSKFFWSGNVEDRSFHMVAWKVICKPKEEGGLGIRS 314

Query: 1672 CSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSIEVSPNYSWLFNKLLAVRGS 1493
              E   AC++   WDL +++ SLW +W+ A YL + + W  +     SW++  +L  R  
Sbjct: 315  IEEWNKACILGQLWDLLQKRPSLWTEWVYASYLHRTSIWQKKRRTYDSWIWKHILDCRNI 374

Query: 1492 LLGLIKFHIGANSQFKVWKAPWV-NGKLLSDCDESFKPITFGIGSEAVMSMIISNGA*SL 1316
            L   I++ +G  + F  +  PW+ +G+ + D          G   +  +++ I N   SL
Sbjct: 375  LKQHIQYVVGNGASFSFFYDPWLPSGQSIFDISGDLIRHALGYPQDICIALFIQNSCWSL 434

Query: 1315 PKSRVVVSILNFIEDNNIKIGIK-DSITF-LDKADDSLKDVWDATRRKNSESQFLSWMWK 1142
            P  R +  I  +    +I I ++ DSI +  +     ++  W+ TR  + +  + S +W 
Sbjct: 435  PPPRTMEMIHLWPVIYSIPIQVQADSICWKYNNGVWKVRHAWEVTRVTSPKVVWSSLVWT 494

Query: 1141 CKVPKNYLFHAWK 1103
                  Y    W+
Sbjct: 495  NPTIPRYSITLWQ 507


>XP_017228355.1 PREDICTED: uncharacterized protein LOC108203735 [Daucus carota subsp.
            sativus]
          Length = 1203

 Score =  183 bits (464), Expect = 3e-44
 Identities = 121/433 (27%), Positives = 206/433 (47%), Gaps = 9/433 (2%)
 Frame = -1

Query: 2224 KC--EPVSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLN--VCXXXXXXXXXX 2057
            KC  + ++HL+FADDL++F K +  +   +  A + F  +SGL+LN   C          
Sbjct: 682  KCSKQKLTHLVFADDLLIFCKGNLQSFSTVLEAVNLFSSVSGLQLNNSKCTCYFGNVPPN 741

Query: 2056 XVEPILEILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGR 1877
              + ++      EG +  + YLGIPL    L+ + C PLIDR+ ++++ W  K +S  GR
Sbjct: 742  IKQSVIAQSGFLEGQLHVI-YLGIPLITRCLTTRDCQPLIDRISRKIELWTNKFISQPGR 800

Query: 1876 LELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLS 1697
            L+LI S LFA++                +  I   FLW  +      + V+W  +  P  
Sbjct: 801  LQLISSVLFAIHGFWAQFLFLPVQVERKLISIFAKFLWSGNLSGSCFYKVAWKELCYPKW 860

Query: 1696 EGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSIEVSPNYSWLFN 1517
            EGGLG++       +  +   W +     SLWV+WI +  L    FW+++     SW++ 
Sbjct: 861  EGGLGLKNIRLWNESATLFQLWRIVTRADSLWVRWIYSYELVNKGFWTMKSPAKSSWIWR 920

Query: 1516 KLLAVRGSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVMSMII 1337
            K+L  R   L  IK+  G++S F +W  PW+N   L+   +    +       A++S I 
Sbjct: 921  KILNCRSRALSFIKYRPGSSSSFLLWHDPWLNNSPLTRQFDHSLMVALESQHMALLSSIQ 980

Query: 1336 SNGA*SLPKSRVVVSILNFIEDNNIKIGIK--DSITFLD--KADDSLKDVWDATRRKNSE 1169
             +G+ SL  S    S++  + +  + +  +  D IT+ D   ++ S   ++ +       
Sbjct: 981  VDGSWSLGVSN--YSLVRELREQCVNVTPRAFDRITWDDGGASNVSTSSIYQSLTDHRVG 1038

Query: 1168 SQFLSWMW-KCKVPKNYLFHAWKVHLNKLPT*DYSRRLGITVVSMCLLCMDDYEDGFHLS 992
              +L ++W + ++PK + F AW +   KL T D      + +   CLLC  D E+  HL 
Sbjct: 1039 PAWLPFVWHRFRIPK-HSFTAWLIMKGKLLTKDRMLAFHMNINVTCLLCGVDVENHSHLF 1097

Query: 991  FQCKFAKMVWKAC 953
              C +++MV  AC
Sbjct: 1098 CDCSYSRMVLNAC 1110


>XP_019158195.1 PREDICTED: uncharacterized protein LOC109154910 [Ipomoea nil]
          Length = 1367

 Score =  181 bits (459), Expect = 1e-43
 Identities = 134/471 (28%), Positives = 215/471 (45%), Gaps = 24/471 (5%)
 Frame = -1

Query: 2281 SNLIRKGIMTKSLQPIK-ARKCEPVSHLIFADDLMLFMKASPDNARFINNASDSFGMMSG 2105
            S LI++ +  K  +PI  AR    VSHL FADDLM+F ++S      I +  + F   SG
Sbjct: 657  SYLIQEKVREKDWKPIHLARGGIGVSHLFFADDLMIFGESSETQMATIMDCLNRFSHWSG 716

Query: 2104 LKLNVCXXXXXXXXXXXVEPILEILNMKEGDISDL-------KYLGIPLFMGKLSKKHCY 1946
            L +N                    +  + GD++++       KYLGIP+   ++SK H  
Sbjct: 717  LNINHTKSLIFCSNNTPNR-----VKRRMGDMANIPITENLGKYLGIPILQKRVSKNHFN 771

Query: 1945 PLIDRLKKRLDSWKAKLLSLVGRLELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFL 1766
             +I+ +K++L+ WKA+ LSL GR  L++S L  + ++              I K+ RDFL
Sbjct: 772  YIINNMKRKLNQWKAESLSLAGRRVLVQSALATVPVYTMQTGALPVSTCNDIDKLCRDFL 831

Query: 1765 WGKSEGKKNIHLVSWDNVTLPLSEGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIE 1586
            WG +E K+ IHL++W  V  P ++GGLG+R   +   A + K AW +    + LWVK + 
Sbjct: 832  WGSNEAKRKIHLINWKEVCSPRTQGGLGLRMAKDFNLALLAKLAWQILNNPEKLWVKVMR 891

Query: 1585 AKYLSKDTFWSIEVSPNYSWLFNKLLAVRGSLLGLIKFHIGANSQFKVWKAPWVNGKLLS 1406
             KY+  D F++     N SW +  ++  R  +    ++ +G+      W   WV      
Sbjct: 892  QKYIKDDNFFTANTPANASWTWRSIIKGRSIIETGARWRVGSGDSLDFWSDWWV------ 945

Query: 1405 DCDESFKPITFGIGSEAVMSMIISNGA*S--LPKSRV--VVSILNFIEDN------NIKI 1256
                S KPI  G+G+   +   + N   S  + + R   V  +L+F+  +       I I
Sbjct: 946  ----SDKPI--GLGATVNIPNHLCNVKVSEFITQQRTWDVNRLLDFLPPDLVNQIRAIPI 999

Query: 1255 GI----KDSITFLDKADD--SLKDVWDATRRKNSESQFLSWMWKCKVPKNYLFHAWKVHL 1094
             I    +D +T+ DK     S+   +      ++E +   WMWK    +      W +  
Sbjct: 1000 PIDDTTEDKLTWPDKTIGTFSVLSAFKHIAGHSTEEESWDWMWKLNCIEKVKSFLWLLLK 1059

Query: 1093 NKLPT*DYSRRLGITVVSMCLLCMDDYEDGFHLSFQCKFAKMVWKACLDEG 941
             KL T    R   +T  + C  C D+ E   H+   C FA   W++  D G
Sbjct: 1060 GKLLTGVERRNRNLTTDASCKRCPDEDESTNHIFRTCPFASDCWRSAKDYG 1110


>JAU28950.1 Putative ribonuclease H protein, partial [Noccaea caerulescens]
          Length = 533

 Score =  175 bits (443), Expect = 2e-43
 Identities = 129/448 (28%), Positives = 201/448 (44%), Gaps = 13/448 (2%)
 Frame = -1

Query: 2212 VSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNVCXXXXXXXXXXXVEPILEI 2033
            ++HL FADDL++F++ S  +   +      F  +SGL +N+             E IL  
Sbjct: 5    LTHLSFADDLLIFVEGSNQSVAGVFTVLSQFEKLSGLAVNISKTSMFCSGVP--ETILLE 62

Query: 2032 LNMKEGDISD---LKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELIK 1862
            L  +   +S    ++YLG+PL   KLS   C PL+ +++ +L+    + LSL GRL L+ 
Sbjct: 63   LKNRFALVSGSLPIRYLGLPLSSKKLSISDCDPLLSKIRMKLNGRMHRHLSLAGRLRLLS 122

Query: 1861 STLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHL---VSWDNVTLPLSEG 1691
            S +  L +               I  +   FLW    GK +I     VSW  ++ P SEG
Sbjct: 123  SVISGLIMFWTQAFFLPKTVIRKINSLCSSFLW---HGKLDIPTGARVSWSALSFPKSEG 179

Query: 1690 GLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSI-EVSPNYSWLFNK 1514
            GLG+R  S     C +K  W +     S+WV W+  +YLS++  WS+ E S  +SW+F K
Sbjct: 180  GLGIRSISSWNDTCGLKLIWMIFFRAGSIWVAWMRNRYLSRNCLWSLNEDSSTFSWMFRK 239

Query: 1513 LLAVRGSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVMSMIIS 1334
            +L  R   L  +   IG       W  PW     L +   S  P   GI  +A++   IS
Sbjct: 240  ILKTRQKALSFLCIQIGNGEDSFFWWDPWTPFGPLINYLGSQGPTNLGIPLQALVKDYIS 299

Query: 1333 NGA*SLPKSRVVVSILNFIEDNNIKIGIKDSITFLDKADDSL------KDVWDATRRKNS 1172
                 LP +R    +  F   ++I +  + +   + K DD +      K++W   R  N 
Sbjct: 300  GDGWILPPARSDRHVEVFSYISSI-VPSQSNDYPIWKVDDQIRTSFVSKEIWGKIRLVNP 358

Query: 1171 ESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT*DYSRRLGITVVSMCLLCMDDYEDGFHLS 992
            E  + S +W       +   AW   L++ PT +     G+ + S CLLC  + E   HL 
Sbjct: 359  EVPWHSLVWNRVAIPKHSTTAWLFMLDRNPTLNRLVSWGLDIESTCLLCGLEQESRDHLF 418

Query: 991  FQCKFAKMVWKACLDEGLIKSVSNDLSD 908
            F C F+  +W   +    + SV +   D
Sbjct: 419  FVCSFSNHIWLQLMHRLRLSSVPSQWED 446


>XP_018435547.1 PREDICTED: uncharacterized protein LOC108807802 [Raphanus sativus]
          Length = 1159

 Score =  180 bits (456), Expect = 3e-43
 Identities = 121/435 (27%), Positives = 197/435 (45%), Gaps = 14/435 (3%)
 Frame = -1

Query: 2218 EPVSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNVCXXXXXXXXXXXVEP-- 2045
            + ++HL FADDL++F+  S  +   + +    F +MSGL +NV             E   
Sbjct: 635  QELTHLCFADDLLIFIDGSESSLEGVFSVLSDFELMSGLAVNVSKTTLFTSGMMEAESQR 694

Query: 2044 ILEILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELI 1865
            I     +   ++  ++YLG PL   KLS   C PL+ ++KK++ SW  + LS+ GRL ++
Sbjct: 695  IANRFGLARSNLP-VRYLGTPLCTKKLSFSDCDPLLLQIKKKMSSWTTRSLSMAGRLTML 753

Query: 1864 KSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLSEGGL 1685
             S +  +  +              I  +   FLW  + G      V+W ++  P  EGGL
Sbjct: 754  TSVISGIIGYWSSAFLLPKRVIKAINSLCSSFLWHGTIGISTGAKVAWKDICTPKKEGGL 813

Query: 1684 GVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSI-EVSPNYSWLFNKLL 1508
            G+R     +  C +K  W L     S+WV WI  +YLS   FWS+ E + +YSW+F +LL
Sbjct: 814  GIRNIVTWSDTCALKLIWMLFFRAGSIWVAWIRQRYLSTGPFWSLNEKNCSYSWMFRRLL 873

Query: 1507 AVRGSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVMSMIISNG 1328
             +R   L  I+  IG       W  PW     L        P   G+   A++S + +  
Sbjct: 874  KLRSKALEFIRISIGRGDITFFWWDPWTPFGSLYTYLGQDGPTRLGVPLFALVSEVWNGS 933

Query: 1327 A*SLPKSRV--VVSILNFI---------EDNNIKIGIKDSITFLDKADDSLKDVWDATRR 1181
            + SLP +R    + +L+F+         +  N  I      +F+ +       +WD+ R 
Sbjct: 934  SWSLPAARSGRQLELLSFLTTVAPAQGPDIPNWIINGNSHKSFISRL------IWDSIRP 987

Query: 1180 KNSESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT*DYSRRLGITVVSMCLLCMDDYEDGF 1001
               + ++   +W   +   +    W   LN+ PT D     G+ V ++C+LC +  E   
Sbjct: 988  HVPDKEWAPILWHKGIIPRHATTTWLFILNRNPTLDRLHSWGLEVETICILCGNANESRN 1047

Query: 1000 HLSFQCKFAKMVWKA 956
            HL F C +A  VWK+
Sbjct: 1048 HLFFDCLYATEVWKS 1062


>XP_018474345.1 PREDICTED: uncharacterized protein LOC108845678 [Raphanus sativus]
          Length = 2018

 Score =  176 bits (445), Expect(2) = 1e-42
 Identities = 125/461 (27%), Positives = 213/461 (46%), Gaps = 20/461 (4%)
 Frame = -1

Query: 2269 RKGIMTKSLQPIKARKCEPVSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNV 2090
            R G ++   Q  K R    ++HL FADDL++F+  S ++ + +      F   SGL +++
Sbjct: 1481 RSGYLSYHHQCHKTR----LTHLSFADDLLIFIDGSLESVQRVLQILHEFEKRSGLAVSL 1536

Query: 2089 CXXXXXXXXXXXVEPILEILNMKEG---DISDLKYLGIPLFMGKLSKKHCYPLIDRLKKR 1919
                         E  ++ + +  G       ++YLG+PL   KL+ ++C PL+ ++K+R
Sbjct: 1537 QKSSFFASGVSEQE--IQAIQVSTGMPCGSLPMRYLGVPLCTKKLNLENCQPLLQQIKQR 1594

Query: 1918 LDSWKAKLLSLVGRLELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKN 1739
            L SW AK LS  GRL LIK+ +  +                 I  +   FLW  +    +
Sbjct: 1595 LSSWSAKALSFAGRLLLIKTVIAGVSTFWCSTFILPKACINKINSLCGVFLWNGNIDGHH 1654

Query: 1738 IHLVSWDNVTLPLSEGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKD-- 1565
               VSW+ VTL   +GGLGV+   +   AC++K  W L    +S+WV W +   L  D  
Sbjct: 1655 TARVSWETVTLTKDQGGLGVKDLHKWNLACLLKLVWMLFFRPKSVWVCWFKEVILRGDVS 1714

Query: 1564 TFWSIEVSPNYSWLFNKLLAVRGSLLGLIKFHIGANSQFKVWKAPWVN-GKLLSDCDESF 1388
             +W+++ S NYSWL NK++ VR  +  L+   +G     + W   W   G L +  + S 
Sbjct: 1715 NYWTVKTSTNYSWLVNKMIKVRDQVYPLLHRRLGNGETTRFWFDNWSPLGHLYTLLNASS 1774

Query: 1387 KPITFGIGSEAVMSMIISNGA*SLPKSRVVVSILNFIEDNNIKIGIK-DSITFLDKAD-- 1217
              +  GI   A ++ + +NG  SLP +R          +N + + +   ++T  ++AD  
Sbjct: 1775 SRL--GIPRSATVASLFTNGHWSLPPART---------ENQLALQVHLTTVTLSEEADYY 1823

Query: 1216 -----------DSLKDVWDATRRKNSESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT*DY 1070
                        ++ +V+   +       +   +W       + F  W V L++ PT D 
Sbjct: 1824 EWMIEGKLRNRYNMGEVYTYLKGPQQTVTWAKIVWFSYGIPRHNFLTWLVLLDRCPTKDR 1883

Query: 1069 SRRLGITVVSMCLLCMDDYEDGFHLSFQCKFAKMVWKACLD 947
              R G+ V  +CLLC   +E+  HL F C ++  +W+  +D
Sbjct: 1884 LIRWGMNVTPLCLLCNTSHENRNHLFFDCVYSATIWRQTMD 1924



 Score = 28.5 bits (62), Expect(2) = 1e-42
 Identities = 11/33 (33%), Positives = 20/33 (60%)
 Frame = -2

Query: 876  IVVFATLYYL*FERNRRMHDDISNSPDFLIKMI 778
            I + A +Y++  ERN+R+H  +   P+ L  +I
Sbjct: 1956 IAMQACIYWIWSERNKRLHHQVFRPPEVLFSLI 1988


>GAV92728.1 zf-RVT domain-containing protein, partial [Cephalotus follicularis]
          Length = 354

 Score =  168 bits (425), Expect = 1e-42
 Identities = 110/361 (30%), Positives = 164/361 (45%), Gaps = 1/361 (0%)
 Frame = -1

Query: 2044 ILEILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELI 1865
            IL  +N KEG +  + YLG+PL   KLS+  C PLI+R+  R  SW +K LS  GRL+LI
Sbjct: 8    ILRRVNFKEGTLP-VTYLGLPLITKKLSRSECAPLIERITARAKSWISKTLSFAGRLQLI 66

Query: 1864 KSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLSEGGL 1685
            KSTL  +  +                K++R FLWGK++GK     V W  V  P+ EGGL
Sbjct: 67   KSTLVNMQAYWCSAFLLPGSVVKECVKVLRTFLWGKAKGK-----VKWAEVCKPVEEGGL 121

Query: 1684 GVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSIEVSPNYSWLFNKLLA 1505
            GVR         ++K  W++    QSLW KW +   + +  FW + VS   SW + ++L 
Sbjct: 122  GVRDLKTWNRVLLLKRVWNILM-NQSLWAKWCQVYLMKRTNFWELPVSGQLSWSWRQILR 180

Query: 1504 VRGSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVMSMIISNGA 1325
            +R      I +  G    F +W  PW++G+ +            G+GS+A +  +I    
Sbjct: 181  LRPLAKEHILYVCGNGESFSLWYDPWLHGESVHALYGHRVIYDSGLGSQAKVKDVIWQDT 240

Query: 1324 *SLPKSRVVVSILNFIEDNNIKIGIKDSITFLDKADDSLKDVWDATRRKNSESQFLSWMW 1145
              L   +  V ++    D++         TF      S    W  TR  +SE  + + +W
Sbjct: 241  CDLIDIQQRVQVVQITMDHDRIYWGARGQTF------STASAWQLTRSPSSEVTWHAIVW 294

Query: 1144 -KCKVPKNYLFHAWKVHLNKLPT*DYSRRLGITVVSMCLLCMDDYEDGFHLSFQCKFAKM 968
               ++PK + F  W        T D     G+     C+    D E   HL FQC F+  
Sbjct: 295  HPMRIPK-HAFSLWLALRGAHKTKDKLLAAGVIHTDSCIFNCGDGESLAHLFFQCPFSAS 353

Query: 967  V 965
            +
Sbjct: 354  I 354


>GAV92297.1 hypothetical protein CFOL_v3_35677 [Cephalotus follicularis]
          Length = 397

 Score =  169 bits (427), Expect = 2e-42
 Identities = 91/296 (30%), Positives = 150/296 (50%), Gaps = 2/296 (0%)
 Frame = -1

Query: 2212 VSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNV--CXXXXXXXXXXXVEPIL 2039
            ++HL +ADDL++F  A   +  FIN+  D F  +SGL                   + IL
Sbjct: 28   LNHLCYADDLLIFAAADLKSIEFINHGLDLFKAVSGLAAGTEKSSIFFWNTKRRMRDHIL 87

Query: 2038 EILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELIKS 1859
             +   ++G +  +KYLG+PL   +L+K+ C PLI+ +  R++SW  K LS  GRL+LIKS
Sbjct: 88   SMTKFRQGALP-VKYLGLPLITSRLTKRDCTPLIENILARVNSWTTKSLSYAGRLQLIKS 146

Query: 1858 TLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLSEGGLGV 1679
            TL ++ +                 +I+R+FLWG S       LV W NV LP  EGGLG+
Sbjct: 147  TLASMQVFWCSTFLLPAEVIKECERILRNFLWGSSGTSGKRSLVKWTNVCLPRQEGGLGI 206

Query: 1678 RKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSIEVSPNYSWLFNKLLAVR 1499
            +       A ++K  W++  +  SLWV+W +   + K +FW++  + + SW + ++L +R
Sbjct: 207  KSLKTWNQALLLKQIWNILND-HSLWVQWCKLNLIRKHSFWTLPTTGSMSWSWRQILQLR 265

Query: 1498 GSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVMSMIISN 1331
               L  + +  G   +F +W  PW +G  +            G+G+  ++  +I N
Sbjct: 266  NMALTQLVYVSGKGDKFSLWYDPWFHGTSIYTTYGHRVIYDAGLGNNELVQAVIEN 321


>XP_019163505.1 PREDICTED: uncharacterized protein LOC109159849 [Ipomoea nil]
          Length = 1316

 Score =  177 bits (449), Expect = 2e-42
 Identities = 130/448 (29%), Positives = 191/448 (42%), Gaps = 31/448 (6%)
 Frame = -1

Query: 2212 VSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNV------CXXXXXXXXXXXV 2051
            +SHL FADDLMLF ++S   AR I N  + F   SGLK+N+      C           +
Sbjct: 630  ISHLFFADDLMLFGESSDRQARTILNCLERFSKASGLKVNLAKSQIFCSPNTTAGVKNII 689

Query: 2050 E-----PILEILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSL 1886
            E     PI E L           YLGIP+   ++SK     ++D+++++L +WKA  LS+
Sbjct: 690  ENRLGIPITENLG---------SYLGIPILQRRVSKDSFTSVVDKMRRKLATWKASSLSM 740

Query: 1885 VGRLELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTL 1706
             GR  L++S+L  +  +              I K+ R+FLWG +E  K IH +SW  +  
Sbjct: 741  AGRRVLVQSSLATIPTYTMQSMALPVSTCKQIDKVCRNFLWGHAENTKKIHTISWSEICK 800

Query: 1705 PLSEGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSIEVSPNYSW 1526
            P   GGLG+RK  +   A + K AW +      LWVK +  KY+    F    V  N SW
Sbjct: 801  PRDLGGLGLRKVGDFNLAFLTKLAWVVLTNPHKLWVKVMREKYVKNRNFLDNNVYTNCSW 860

Query: 1525 LFNKLLAVRGSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVMS 1346
             ++ +L  +  L   I + IG  S    W   W+  K L+       P   G G+  V  
Sbjct: 861  NWSSILKGKNILTEGIAWKIGTGSSVNFWYDNWLGNKSLATSPNIVVP--DGGGNMHVKD 918

Query: 1345 MIISNG--------------------A*SLPKSRVVVSILNFIEDNNIKIGIKDSITFLD 1226
             IISNG                    A  +P +      LN+       + +  + +FL 
Sbjct: 919  FIISNGGWNYAALESLLPGDMVDRIRAIPIPLTNGQQDKLNWPHSGTGLVTVSSAFSFLS 978

Query: 1225 KADDSLKDVWDATRRKNSESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT*DYSRRLGITV 1046
              DDS              +    W+WK K  +      WK+  N L      +R G+T+
Sbjct: 979  GTDDS--------------AASHEWIWKIKAIERVKLFVWKIVKNGLLVNTERKRRGLTI 1024

Query: 1045 VSMCLLCMDDYEDGFHLSFQCKFAKMVW 962
             S C  C  + E   HL  QC+ ++  W
Sbjct: 1025 DSSCPRCGAEEETLDHLFRQCEDSRNCW 1052


>XP_017227807.1 PREDICTED: uncharacterized protein LOC108203403 [Daucus carota subsp.
            sativus]
          Length = 664

 Score =  174 bits (441), Expect = 2e-42
 Identities = 115/412 (27%), Positives = 184/412 (44%), Gaps = 6/412 (1%)
 Frame = -1

Query: 2212 VSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNVCXXXXXXXXXXXV--EPIL 2039
            ++H+ FADD+++F      +   +    + F   SG+++N                  I 
Sbjct: 150  ITHISFADDILMFCHGDSVSVDRLLAGLNEFSHCSGMRINSAKSQFFISNVDDGLKHHIR 209

Query: 2038 EILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGRLELIKS 1859
                  EG +   KYLG+PL   KLS + C PLI R++ R+DSW    L+  GRL+LIK+
Sbjct: 210  VSTGFSEGSLP-AKYLGLPLISTKLSMRCCLPLIVRVQNRIDSWLNTCLNQAGRLQLIKA 268

Query: 1858 TLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHLVSWDNVTLPLSEGGLGV 1679
             LF L  +              +  +   FLWG S     +  V W +   P  EGGLG+
Sbjct: 269  VLFGLQGYWSAHLLLPKSVLKKLQSLFVKFLWGGSSDNHKVVKVKWSDCCFPKIEGGLGL 328

Query: 1678 RKCSEMTHACVIKNAWDLAR-EKQSLWVKWIEAKYLSKDTFWSIEVSPNYSWLFNKLLAV 1502
                +  +A  + + W + + +  SLW+ W +  YL +  FW+++   N  W   K+L +
Sbjct: 329  YDLCQWNNAVFLFHLWRITQPDNNSLWIMWFKRTYLKRRAFWTMDFPKNAPWCIRKILQL 388

Query: 1501 RGSLLGLIKFHIGANSQFKVWKAPWVNGK-LLSDCDESFKPITFGIGSEAVMSMIISNGA 1325
            R   L  I +H+GA+S F +W  PWVN K LLS C       +     + V S  +SN +
Sbjct: 389  RPLALRFINYHVGASSNFLLWHDPWVNNKPLLSYCHPDVISSSNSRLFDKVAS-FMSNSS 447

Query: 1324 *SLPKSR--VVVSILNFIEDNNIKIGIKDSITFLDKADDSLKDVWDATRRKNSESQFLSW 1151
              LP S    V+ + + I   +I I  +DSIT+++ A   +  +W   R  ++   ++  
Sbjct: 448  WLLPSSNHLDVIQLRSLI--GSIPIHSRDSITWMNSASIKISSIWQCIRSISTTPPWIIG 505

Query: 1150 MWKCKVPKNYLFHAWKVHLNKLPT*DYSRRLGITVVSMCLLCMDDYEDGFHL 995
            +W         F  W     +L T D   +  +     C+ C    E   HL
Sbjct: 506  VWHQFAIPKCAFTLWLAFKERLLTKDRMIKFNMNTDLACIFCNRAIETHSHL 557


>JAU25120.1 LINE-1 retrotransposable element ORF2 protein, partial [Noccaea
            caerulescens]
          Length = 942

 Score =  176 bits (447), Expect = 3e-42
 Identities = 132/453 (29%), Positives = 203/453 (44%), Gaps = 15/453 (3%)
 Frame = -1

Query: 2221 CEPV--SHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNVCXXXXXXXXXXXVE 2048
            CE V  +HL FADDL++F++ S  +   +      F  +SGL +N+             E
Sbjct: 409  CEEVQLTHLSFADDLLIFVEGSNQSVAGVFTVLSQFEKLSGLAVNISKTSMFCSGVP--E 466

Query: 2047 PILEILNMKEGDISD---LKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGR 1877
             IL  L  +   +S    ++YLG+PL   KLS   C PL+ +++ +L+    + LSL GR
Sbjct: 467  TILLELKNRFALVSGSLPIRYLGLPLCSKKLSISDCDPLLSKIRMKLNGRMHRHLSLAGR 526

Query: 1876 LELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHL---VSWDNVTL 1706
            L L+ S +  L +               I  +   FLW    GK +I     VSW  ++ 
Sbjct: 527  LRLLSSVISGLIMFWTQAFFLPKTVIRKINSLCSSFLW---HGKLDIPTGARVSWSALSF 583

Query: 1705 PLSEGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSI-EVSPNYS 1529
            P SEGGLG+R  S     C +K  W +     S+WV W+  +YLS++  WS+ E S  +S
Sbjct: 584  PKSEGGLGIRSISSWNDTCGLKMIWMIFFRAGSIWVAWMRNRYLSRNCLWSLNEDSSTFS 643

Query: 1528 WLFNKLLAVRGSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVM 1349
            W+F K+L  R   L  +   IG       W  PW     L +   S  P   GI  +A++
Sbjct: 644  WMFRKILKTRQKALSFLCIQIGNGEDSFFWWDPWTPFGPLINYLGSQGPTNLGIPLQALV 703

Query: 1348 SMIISNGA*SLPKSRVVVSILNFIEDNNIKIGIKDSITFLDKADDSL------KDVWDAT 1187
               IS     LP +R    +  F   ++I +  + +   + K DD +      K++W   
Sbjct: 704  KDYISGDGWILPPARSDRHVEVFSYISSI-VPSQSNDYPIWKVDDQIRTSFVSKEIWGKI 762

Query: 1186 RRKNSESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT*DYSRRLGITVVSMCLLCMDDYED 1007
            R  N E  + S +W       +   AW   L++ PT +     G+ + S CLLC  + E 
Sbjct: 763  RLVNPEVPWHSLVWNRVAIPKHSTTAWLFMLDRNPTLNRLVSWGLDIESTCLLCGLEQES 822

Query: 1006 GFHLSFQCKFAKMVWKACLDEGLIKSVSNDLSD 908
              HL F C F+  +W   +    + SV +   D
Sbjct: 823  RDHLFFVCSFSNHIWLQLMHRLRLSSVPSQWED 855


>JAU74353.1 hypothetical protein LE_TR15446_c14_g1_i1_g.48810, partial [Noccaea
            caerulescens]
          Length = 1124

 Score =  176 bits (446), Expect = 5e-42
 Identities = 132/453 (29%), Positives = 203/453 (44%), Gaps = 15/453 (3%)
 Frame = -1

Query: 2221 CEPV--SHLIFADDLMLFMKASPDNARFINNASDSFGMMSGLKLNVCXXXXXXXXXXXVE 2048
            CE V  +HL FADDL++F++ S  +   +      F  +SGL +N+             E
Sbjct: 591  CEEVQLTHLSFADDLLIFVEGSNQSVAGVFTVLSQFEKLSGLAVNISKTSMFCSGVP--E 648

Query: 2047 PILEILNMKEGDISD---LKYLGIPLFMGKLSKKHCYPLIDRLKKRLDSWKAKLLSLVGR 1877
             IL  L  +   +S    ++YLG+PL   KLS   C PL+ +++ +L+    + LSL GR
Sbjct: 649  TILLELKNRFALVSGSLPIRYLGLPLSSKKLSISDCDPLLSKIRMKLNGRMHRHLSLAGR 708

Query: 1876 LELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEGKKNIHL---VSWDNVTL 1706
            L L+ S +  L +               I  +   FLW    GK +I     VSW  ++ 
Sbjct: 709  LRLLSSVISGLIMFWTQAFFLPKTVIRKINSLCSSFLW---HGKLDIPTGARVSWSALSF 765

Query: 1705 PLSEGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSKDTFWSI-EVSPNYS 1529
            P SEGGLG+R  S     C +K  W +     S+WV W+  +YLS++  WS+ E S  +S
Sbjct: 766  PKSEGGLGIRSISSWNDTCGLKLIWMIFFRAGSIWVAWMRNRYLSRNCLWSLNEDSSTFS 825

Query: 1528 WLFNKLLAVRGSLLGLIKFHIGANSQFKVWKAPWVNGKLLSDCDESFKPITFGIGSEAVM 1349
            W+F K+L  R   L  +   IG       W  PW     L +   S  P   GI  +A++
Sbjct: 826  WMFRKILKTRQKALSFLCIQIGNGEDSFFWWDPWTPFGPLINYLGSQGPTNLGIPLQALV 885

Query: 1348 SMIISNGA*SLPKSRVVVSILNFIEDNNIKIGIKDSITFLDKADDSL------KDVWDAT 1187
               IS     LP +R    +  F   ++I +  + +   + K DD +      K++W   
Sbjct: 886  KDYISGDGWILPPARSDRHVEVFSYISSI-VPSQSNDYPIWKVDDQIRTSFVSKEIWGKI 944

Query: 1186 RRKNSESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT*DYSRRLGITVVSMCLLCMDDYED 1007
            R  N E  + S +W       +   AW   L++ PT +     G+ + S CLLC  + E 
Sbjct: 945  RLVNPEVPWHSLVWNRVAIPKHSTTAWLFMLDRNPTLNRLVSWGLDIESTCLLCGLEQES 1004

Query: 1006 GFHLSFQCKFAKMVWKACLDEGLIKSVSNDLSD 908
              HL F C F+  +W   +    + SV +   D
Sbjct: 1005 RDHLFFVCSFSNHIWLQLMHRLRLSSVPSQWED 1037


>XP_010530577.2 PREDICTED: uncharacterized protein LOC104807150 isoform X6 [Tarenaya
            hassleriana]
          Length = 1517

 Score =  176 bits (446), Expect = 6e-42
 Identities = 119/458 (25%), Positives = 201/458 (43%), Gaps = 18/458 (3%)
 Frame = -1

Query: 2281 SNLIRKGIMTKSLQPIKARKCEPVSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGL 2102
            S ++ + IM   +QP    K   +SHL FADD+++F K    +   +      F  +SGL
Sbjct: 668  SRMLDRSIMEGKIQPHYRCKSPLISHLSFADDIIIFSKGDVQSLMEVKRVLHDFSNLSGL 727

Query: 2101 KLNVCXXXXXXXXXXXVEPIL--EILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRL 1928
            ++N              E I     + +  G +  ++YLG+PL   +LSK    PL+ ++
Sbjct: 728  QINPEKSELFLAGCSIDEQIAISTAVGIGMGHLP-VRYLGVPLSPTRLSKTDYLPLLQKV 786

Query: 1927 KKRLDSWKAKLLSLVGRLELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEG 1748
            K +L +W+ K LS +G+++LI + ++ L                 I  +   FLW  S  
Sbjct: 787  KSKLTNWQTKFLSSMGKIQLITTVIYGLVNSWSMTFLLPKYLLKEIDSLCAAFLWQHSTN 846

Query: 1747 KKNIHLVSWDNVTLPLSEGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSK 1568
                H ++WD +  P +EGGLG+R+  E      +K  W L  +  SLWV W+++     
Sbjct: 847  STPSHRIAWDAICKPRTEGGLGLRRLDEFNKVFRLKLVWLLFSKAGSLWVAWVKSNIFKD 906

Query: 1567 DTFWSIEVSPNYSWLFNKLLAVRGSLLGLIKFHIGANSQFKVWKAPWVN-GKLLSDCDES 1391
             ++W ++   N SW   KLL +R      I   +G       W   W++ G L+    E+
Sbjct: 907  KSYWGLKAHQNVSWNLRKLLQMRNQAREFIGVKLGNGRDTSFWYDSWLDIGPLIVFIGET 966

Query: 1390 FKPITFGIGSEAVMSMIISNGA*SLPKSRVVVSILNFIEDNNIKIGIKDSITFLDKADD- 1214
              P    +   A ++  + NG  SLP +R      N I++ ++K+ ++ +    D   D 
Sbjct: 967  -GPGLLRLPKSATVADAVRNGNWSLPPAR-----SNRIQELHLKL-LELNPPTNDAGPDL 1019

Query: 1213 --------------SLKDVWDATRRKNSESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT* 1076
                          S +  W+  R   +  +     W  +    + F  W+V   +LPT 
Sbjct: 1020 PTWRHMGGTRKTFFSSRQTWEQLRSPGTAFEGCGLAWFRQATPRHAFLTWQVLQERLPTT 1079

Query: 1075 DYSRRLGITVVSMCLLCMDDYEDGFHLSFQCKFAKMVW 962
            D     GI   + C+LC  D E   HL F C+F++ +W
Sbjct: 1080 DRLETWGIQAPNRCILCCADLESHPHLFFGCRFSRSIW 1117


>XP_010530576.1 PREDICTED: uncharacterized protein LOC104807150 isoform X5 [Tarenaya
            hassleriana]
          Length = 1565

 Score =  176 bits (446), Expect = 7e-42
 Identities = 119/458 (25%), Positives = 201/458 (43%), Gaps = 18/458 (3%)
 Frame = -1

Query: 2281 SNLIRKGIMTKSLQPIKARKCEPVSHLIFADDLMLFMKASPDNARFINNASDSFGMMSGL 2102
            S ++ + IM   +QP    K   +SHL FADD+++F K    +   +      F  +SGL
Sbjct: 426  SRMLDRSIMEGKIQPHYRCKSPLISHLSFADDIIIFSKGDVQSLMEVKRVLHDFSNLSGL 485

Query: 2101 KLNVCXXXXXXXXXXXVEPIL--EILNMKEGDISDLKYLGIPLFMGKLSKKHCYPLIDRL 1928
            ++N              E I     + +  G +  ++YLG+PL   +LSK    PL+ ++
Sbjct: 486  QINPEKSELFLAGCSIDEQIAISTAVGIGMGHLP-VRYLGVPLSPTRLSKTDYLPLLQKV 544

Query: 1927 KKRLDSWKAKLLSLVGRLELIKSTLFALYLHXXXXXXXXXXXXXXIYKIIRDFLWGKSEG 1748
            K +L +W+ K LS +G+++LI + ++ L                 I  +   FLW  S  
Sbjct: 545  KSKLTNWQTKFLSSMGKIQLITTVIYGLVNSWSMTFLLPKYLLKEIDSLCAAFLWQHSTN 604

Query: 1747 KKNIHLVSWDNVTLPLSEGGLGVRKCSEMTHACVIKNAWDLAREKQSLWVKWIEAKYLSK 1568
                H ++WD +  P +EGGLG+R+  E      +K  W L  +  SLWV W+++     
Sbjct: 605  STPSHRIAWDAICKPRTEGGLGLRRLDEFNKVFRLKLVWLLFSKAGSLWVAWVKSNIFKD 664

Query: 1567 DTFWSIEVSPNYSWLFNKLLAVRGSLLGLIKFHIGANSQFKVWKAPWVN-GKLLSDCDES 1391
             ++W ++   N SW   KLL +R      I   +G       W   W++ G L+    E+
Sbjct: 665  KSYWGLKAHQNVSWNLRKLLQMRNQAREFIGVKLGNGRDTSFWYDSWLDIGPLIVFIGET 724

Query: 1390 FKPITFGIGSEAVMSMIISNGA*SLPKSRVVVSILNFIEDNNIKIGIKDSITFLDKADD- 1214
              P    +   A ++  + NG  SLP +R      N I++ ++K+ ++ +    D   D 
Sbjct: 725  -GPGLLRLPKSATVADAVRNGNWSLPPAR-----SNRIQELHLKL-LELNPPTNDAGPDL 777

Query: 1213 --------------SLKDVWDATRRKNSESQFLSWMWKCKVPKNYLFHAWKVHLNKLPT* 1076
                          S +  W+  R   +  +     W  +    + F  W+V   +LPT 
Sbjct: 778  PTWRHMGGTRKTFFSSRQTWEQLRSPGTAFEGCGLAWFRQATPRHAFLTWQVLQERLPTT 837

Query: 1075 DYSRRLGITVVSMCLLCMDDYEDGFHLSFQCKFAKMVW 962
            D     GI   + C+LC  D E   HL F C+F++ +W
Sbjct: 838  DRLETWGIQAPNRCILCCADLESHPHLFFGCRFSRSIW 875


Top