BLASTX nr result

ID: Rehmannia22_contig00004120 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00004120
         (1354 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citr...   541   e-151
gb|EOY04484.1| NT domain of poly(A) polymerase and terminal urid...   535   e-149
gb|EMJ09368.1| hypothetical protein PRUPE_ppa001915mg [Prunus pe...   526   e-146
ref|XP_004302534.1| PREDICTED: uncharacterized protein LOC101304...   523   e-146
ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207...   519   e-144
ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258...   513   e-143
emb|CBI18050.3| unnamed protein product [Vitis vinifera]              513   e-143
ref|XP_006346681.1| PREDICTED: uncharacterized protein LOC102589...   508   e-141
gb|EXB42369.1| hypothetical protein L484_021961 [Morus notabilis]     497   e-138
ref|XP_006843704.1| hypothetical protein AMTR_s00007p00209910 [A...   493   e-136
ref|NP_850678.2| PAP/OAS1 substrate-binding domain superfamily [...   492   e-136
ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arab...   491   e-136
ref|XP_004246272.1| PREDICTED: uncharacterized protein LOC101256...   486   e-135
ref|XP_006403898.1| hypothetical protein EUTSA_v10010169mg [Eutr...   485   e-134
ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253...   471   e-130
ref|XP_006290592.1| hypothetical protein CARUB_v10016681mg [Caps...   470   e-130
ref|XP_006290591.1| hypothetical protein CARUB_v10016681mg [Caps...   470   e-130
gb|EOY34688.1| NT domain of poly(A) polymerase and terminal urid...   469   e-130
gb|EOY34687.1| NT domain of poly(A) polymerase and terminal urid...   469   e-130
ref|XP_004170318.1| PREDICTED: uncharacterized LOC101207419 [Cuc...   462   e-127

>ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citrus clementina]
            gi|568855155|ref|XP_006481174.1| PREDICTED:
            uncharacterized protein LOC102622468 [Citrus sinensis]
            gi|557531615|gb|ESR42798.1| hypothetical protein
            CICLE_v10011044mg [Citrus clementina]
          Length = 882

 Score =  541 bits (1393), Expect = e-151
 Identities = 266/363 (73%), Positives = 301/363 (82%)
 Frame = +2

Query: 38   SHATSSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGA 217
            S ++SS   N   IG E+W  AE A   II ++QP  VSEERR+ V+DY+QRLIRN LG 
Sbjct: 21   SSSSSSVPSNQTAIGAEYWQRAEEATQAIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGC 80

Query: 218  EVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLV 397
            EVFP+GSVPLKTYLPDGDIDLT FGG NVE+ LAND+ S+LE E++N ++EFVVKD QL+
Sbjct: 81   EVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLI 140

Query: 398  RAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYES 577
            RAEVKLVKC+VQ+IVVD+SFNQ+GGL TLCFLEQVDR IGKDHLFKRSIILIKAWCYYES
Sbjct: 141  RAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYES 200

Query: 578  RILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPV 757
            RILGAHHGLISTYALETLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW++Y +SLNGPV
Sbjct: 201  RILGAHHGLISTYALETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDSYCISLNGPV 260

Query: 758  RLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLK 937
            R+SSLP VVVE PE               CV  FSVPSRG D NSR F  KHLNI DPLK
Sbjct: 261  RISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLK 320

Query: 938  DINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRP 1117
            + NNLGRSVSKGNFYRIRSAF+YGARKL  IL QP +S+T+EL +FFSNT+ RHGSGQRP
Sbjct: 321  ENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRP 380

Query: 1118 DVQ 1126
            DVQ
Sbjct: 381  DVQ 383


>gb|EOY04484.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative [Theobroma
            cacao]
          Length = 890

 Score =  535 bits (1379), Expect = e-149
 Identities = 266/379 (70%), Positives = 306/379 (80%)
 Frame = +2

Query: 8    GAAAELKNVASHATSSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYI 187
            G A+E +   S ++SS+  N   I  E+W  AE A  GII ++QP  VSEERR+ V+DY+
Sbjct: 16   GVASEER---SSSSSSSSSNQAGIAAEYWKKAEEATQGIIAQVQPTVVSEERRKAVIDYV 72

Query: 188  QRLIRNCLGAEVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASS 367
            QRLI N LG  VFP+GSVPLKTYLPDGDIDLT FGG N E+ LAND+ S+LE E+ N ++
Sbjct: 73   QRLIGNYLGCGVFPFGSVPLKTYLPDGDIDLTAFGGLNFEEALANDVCSVLEREDHNRAA 132

Query: 368  EFVVKDVQLVRAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSII 547
            EFVVKDVQL+RAEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE+VDR IGKDHLFKRSII
Sbjct: 133  EFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKVDRRIGKDHLFKRSII 192

Query: 548  LIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWE 727
            LIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHS+LDGPLAVL+KFLDYFSKFDW+
Sbjct: 193  LIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLDGPLAVLYKFLDYFSKFDWD 252

Query: 728  TYSVSLNGPVRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQL 907
             Y +SLNGP+ +SSLP VVVE PE               CV MFSVPSRG + NSR F  
Sbjct: 253  NYCISLNGPIHISSLPEVVVETPENGGGDLLLSNDFLKECVEMFSVPSRGFETNSRTFPQ 312

Query: 908  KHLNIFDPLKDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNT 1087
            KHLNI DPL++ NNLGRSVSKGNFYRIRSAF+YGARKL +IL Q  +S+ +EL +FFSNT
Sbjct: 313  KHLNIVDPLRENNNLGRSVSKGNFYRIRSAFTYGARKLGKILSQAEESMADELRKFFSNT 372

Query: 1088 MARHGSGQRPDVQGFDPSL 1144
            + RHGSGQRPDVQ   PSL
Sbjct: 373  LDRHGSGQRPDVQDCIPSL 391


>gb|EMJ09368.1| hypothetical protein PRUPE_ppa001915mg [Prunus persica]
          Length = 742

 Score =  526 bits (1354), Expect = e-146
 Identities = 254/354 (71%), Positives = 292/354 (82%)
 Frame = +2

Query: 77   IGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTY 256
            I  E+W  AE A  G+I ++QP  VSE RR+ V+DY+QRLIR CLG EVFP+GSVPLKTY
Sbjct: 48   ISAEYWKKAEEATQGVIAQVQPTDVSERRRKAVIDYVQRLIRGCLGCEVFPFGSVPLKTY 107

Query: 257  LPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQD 436
            LPDGDIDLT FGG NVE+ LAND+ S+LE E +N ++EF+VKDVQL+RAEVKLVKC+VQ+
Sbjct: 108  LPDGDIDLTAFGGINVEEALANDVCSVLEREVQNGTAEFMVKDVQLIRAEVKLVKCLVQN 167

Query: 437  IVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 616
            IVVD+SFNQ+GGLCTLCFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY
Sbjct: 168  IVVDISFNQLGGLCTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 227

Query: 617  ALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMP 796
            ALETLVLYIFHLFH++L+GPLAVL+KFLDYFSKFDW+ Y +SL+GPVR+SSLP ++VE P
Sbjct: 228  ALETLVLYIFHLFHASLNGPLAVLYKFLDYFSKFDWDNYCISLSGPVRISSLPELLVETP 287

Query: 797  EXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGN 976
            E               CV MFSVPSRG + N R F  KH NI DPLKD NNLGRSVSKGN
Sbjct: 288  ENGGNDLLLSNDFLKECVQMFSVPSRGYETNYRTFPPKHFNIVDPLKDNNNLGRSVSKGN 347

Query: 977  FYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDP 1138
            FYRIRSAF+YGARKL RIL Q  D+I +E+ +FF+NT+ RHG GQRPDVQ   P
Sbjct: 348  FYRIRSAFTYGARKLGRILSQTEDNIDDEIRKFFANTLDRHGGGQRPDVQDLVP 401


>ref|XP_004302534.1| PREDICTED: uncharacterized protein LOC101304393 [Fragaria vesca
            subsp. vesca]
          Length = 878

 Score =  523 bits (1347), Expect = e-146
 Identities = 259/385 (67%), Positives = 308/385 (80%), Gaps = 9/385 (2%)
 Frame = +2

Query: 8    GAAAELKNVASHATS--SADQNPFEIGT-EHWATAERAAHGIIRKIQPNSVSEERRREVV 178
            GA  E +  +S ++S  S+  +   + T E+W  AE A  G+I ++QP  VSE RRR V+
Sbjct: 13   GAVLEDRPTSSSSSSLPSSSSSLLSVSTAEYWRRAEAATQGVIAQVQPTDVSERRRRAVI 72

Query: 179  DYIQRLIRNCLGAEVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKN 358
            DY+QRLIR  LG EVFP+GSVPLKTYLPDGDIDLT FGG N+++ LAND+ ++LE E++N
Sbjct: 73   DYVQRLIRGFLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNIDEVLANDVCAVLEREDQN 132

Query: 359  ASSEFVVKDVQLVRAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKR 538
             ++EF+VKDVQL+RAEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLEQVDR IGKDHLFKR
Sbjct: 133  MAAEFMVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQVDRLIGKDHLFKR 192

Query: 539  SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKF 718
            SIILIKAWCYYESRILGAHHGLISTY LETLVL+IFHLFH++L+GPLAVL+KFLDYFSKF
Sbjct: 193  SIILIKAWCYYESRILGAHHGLISTYGLETLVLFIFHLFHASLNGPLAVLYKFLDYFSKF 252

Query: 719  DWETYSVSLNGPVRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRG 898
            DW+ Y +SLNGPVR+SSLP ++ EMP+               CV  FSVPSRG + N R 
Sbjct: 253  DWDNYCISLNGPVRISSLPELLTEMPDNGGGDLLLSNEFLRSCVDRFSVPSRGYETNYRT 312

Query: 899  FQLKHLNIFDPLKDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFF 1078
            FQ KHLNI DPLK+ NNLGRSVSKGNFYRIRSAF+YGARKL RIL QP ++I +E  +FF
Sbjct: 313  FQPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGRILSQPEENIDDEFRKFF 372

Query: 1079 SNTMARHGSGQRPDVQ------GFD 1135
            SNT+ RHGSGQRPDVQ      GFD
Sbjct: 373  SNTLDRHGSGQRPDVQDPIPFSGFD 397


>ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207419 [Cucumis sativus]
          Length = 898

 Score =  519 bits (1336), Expect = e-144
 Identities = 256/381 (67%), Positives = 302/381 (79%), Gaps = 3/381 (0%)
 Frame = +2

Query: 8    GAAAELKNVASHATSSAD---QNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVV 178
            GA AE K  +S  +S +     NP  IG ++W  AE A   II ++QP  VSE RR+ V+
Sbjct: 13   GAVAEDKPSSSSFSSFSSLLPSNPTPIGVDYWRRAEEATQAIISQVQPTVVSERRRKAVI 72

Query: 179  DYIQRLIRNCLGAEVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKN 358
            DY+QRLIR  L  EVFP+GSVPLKTYLPDGDIDLT  GG+NVE+ LA+D+ S+L  E++N
Sbjct: 73   DYVQRLIRGRLRCEVFPFGSVPLKTYLPDGDIDLTALGGSNVEEALASDVCSVLNSEDQN 132

Query: 359  ASSEFVVKDVQLVRAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKR 538
             ++EFVVKDVQL+RAEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE++DR IGKDHLFKR
Sbjct: 133  GAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKR 192

Query: 539  SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKF 718
            SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHS L+GPL VL+KFLDYFSKF
Sbjct: 193  SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKF 252

Query: 719  DWETYSVSLNGPVRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRG 898
            DW+ Y +SLNGPVR+SSLP +V E P+               C+  FSVP+RG + NSR 
Sbjct: 253  DWDNYCISLNGPVRISSLPELVAETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRA 312

Query: 899  FQLKHLNIFDPLKDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFF 1078
            F +KHLNI DPLK+ NNLGRSVSKGNFYRIRSAFSYGARKL  IL  P D++ +E+ +FF
Sbjct: 313  FPIKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFF 372

Query: 1079 SNTMARHGSGQRPDVQGFDPS 1141
            SNT+ RHG GQRPDVQ  DP+
Sbjct: 373  SNTLDRHGGGQRPDVQ--DPA 391


>ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258499 [Vitis vinifera]
          Length = 884

 Score =  513 bits (1321), Expect = e-143
 Identities = 255/358 (71%), Positives = 289/358 (80%)
 Frame = +2

Query: 50   SSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFP 229
            S +  NP  IG   WA AE     II ++QP  VSEERR+EVVDY+Q LIR  +G EVFP
Sbjct: 25   SLSHPNPPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFP 84

Query: 230  YGSVPLKTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEV 409
            +GSVPLKTYLPDGDIDLT FGG  VEDTLA ++ S+LE E++N ++EFVVKDVQL+ AEV
Sbjct: 85   FGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEV 144

Query: 410  KLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILG 589
            KLVKC+VQ+IVVD+SFNQ+GGLCTLCFLEQ+DR IGKDHLFKRSIILIKAWCYYESRILG
Sbjct: 145  KLVKCLVQNIVVDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILG 204

Query: 590  AHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSS 769
            AHHGLISTYALETLVLYIF LFHS L+GPLAVL+KFLDYFSKFDW+ Y VSLNGPVR+SS
Sbjct: 205  AHHGLISTYALETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISS 264

Query: 770  LPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINN 949
            LP ++ E PE               C+  FSVPSRG + NSR F  KH NI DPLK+ NN
Sbjct: 265  LPEMIAETPENVGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNN 324

Query: 950  LGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1123
            LGRSVSKGNFYRIRSAF+YGARKL RILLQP D I+ EL +FF+NT+ RHG GQRPDV
Sbjct: 325  LGRSVSKGNFYRIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERHGRGQRPDV 382


>emb|CBI18050.3| unnamed protein product [Vitis vinifera]
          Length = 824

 Score =  513 bits (1321), Expect = e-143
 Identities = 255/358 (71%), Positives = 289/358 (80%)
 Frame = +2

Query: 50   SSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFP 229
            S +  NP  IG   WA AE     II ++QP  VSEERR+EVVDY+Q LIR  +G EVFP
Sbjct: 25   SLSHPNPPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFP 84

Query: 230  YGSVPLKTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEV 409
            +GSVPLKTYLPDGDIDLT FGG  VEDTLA ++ S+LE E++N ++EFVVKDVQL+ AEV
Sbjct: 85   FGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEV 144

Query: 410  KLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILG 589
            KLVKC+VQ+IVVD+SFNQ+GGLCTLCFLEQ+DR IGKDHLFKRSIILIKAWCYYESRILG
Sbjct: 145  KLVKCLVQNIVVDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILG 204

Query: 590  AHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSS 769
            AHHGLISTYALETLVLYIF LFHS L+GPLAVL+KFLDYFSKFDW+ Y VSLNGPVR+SS
Sbjct: 205  AHHGLISTYALETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISS 264

Query: 770  LPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINN 949
            LP ++ E PE               C+  FSVPSRG + NSR F  KH NI DPLK+ NN
Sbjct: 265  LPEMIAETPENVGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNN 324

Query: 950  LGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1123
            LGRSVSKGNFYRIRSAF+YGARKL RILLQP D I+ EL +FF+NT+ RHG GQRPDV
Sbjct: 325  LGRSVSKGNFYRIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERHGRGQRPDV 382


>ref|XP_006346681.1| PREDICTED: uncharacterized protein LOC102589320 isoform X1 [Solanum
            tuberosum] gi|565359810|ref|XP_006346682.1| PREDICTED:
            uncharacterized protein LOC102589320 isoform X2 [Solanum
            tuberosum]
          Length = 852

 Score =  508 bits (1307), Expect = e-141
 Identities = 255/377 (67%), Positives = 289/377 (76%)
 Frame = +2

Query: 74   EIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKT 253
            +IG E WA AE+    I+R++QP +VSE RRR V++Y+Q L+R  L  EVFPYGSVPLKT
Sbjct: 25   DIGPERWAVAEKVTQNILRRVQPTTVSENRRRSVIEYVQNLVRGSLRCEVFPYGSVPLKT 84

Query: 254  YLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQ 433
            YLPDGDIDLT F G + ED  A+DM S LE E++N  +EF VKDVQL+RAEVKLVKCIVQ
Sbjct: 85   YLPDGDIDLTAFVGKDFEDAFADDMVSTLEAEDRNKDAEFAVKDVQLIRAEVKLVKCIVQ 144

Query: 434  DIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 613
            +IVVD+S NQIGGLCTL FLEQVDR IGKDHLFKRSIILIK WCYYESR+LGAHHGL ST
Sbjct: 145  NIVVDISLNQIGGLCTLGFLEQVDRLIGKDHLFKRSIILIKTWCYYESRLLGAHHGLFST 204

Query: 614  YALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEM 793
            YALETLVLYIFH FH+TLDGPLAVL+KFLDYF KFDW+ Y VSL GPVR+SSLP  VVE+
Sbjct: 205  YALETLVLYIFHFFHTTLDGPLAVLYKFLDYFGKFDWDNYYVSLTGPVRISSLPEYVVEV 264

Query: 794  PEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKG 973
            PE               C+  FSVPS+GGD NSR  Q K+LNI DPLK+ NNLGRSVSKG
Sbjct: 265  PENDGGDVLLSNDFIRYCLERFSVPSKGGDLNSRKIQHKYLNIIDPLKESNNLGRSVSKG 324

Query: 974  NFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDPSLICT 1153
            NFYRIRSA +YGARKL  ILLQ  D+I  EL+RFF NTM RH SG+RPDVQ  DPS    
Sbjct: 325  NFYRIRSAINYGARKLESILLQSEDNIVEELYRFFPNTMDRHDSGERPDVQ--DPSNDFC 382

Query: 1154 RPISAVPIPETRPCKIK 1204
                A P P   P +I+
Sbjct: 383  LASPASPAPNFDPSQIE 399


>gb|EXB42369.1| hypothetical protein L484_021961 [Morus notabilis]
          Length = 928

 Score =  497 bits (1279), Expect = e-138
 Identities = 251/391 (64%), Positives = 289/391 (73%), Gaps = 37/391 (9%)
 Frame = +2

Query: 77   IGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTY 256
            IG E+W  AE A  GII ++QP  VS +RRR V+DY+QRLIR  LG EVFP+GSVPLKTY
Sbjct: 29   IGAEYWKRAEEATQGIIAQVQPTVVSGKRRRAVIDYVQRLIRGFLGCEVFPFGSVPLKTY 88

Query: 257  LPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAE---------- 406
            LPDGDIDLT FGG N+E+ LAND+ S+LE EE+N ++EFVVKDVQL+RAE          
Sbjct: 89   LPDGDIDLTAFGGLNIEEALANDVCSVLEREEQNKAAEFVVKDVQLIRAETSDLKVQVLH 148

Query: 407  ---------------------------VKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVD 505
                                       VKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLEQVD
Sbjct: 149  YSRSDGFEVVEAYFDAHALAGCVVLLLVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQVD 208

Query: 506  RHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAV 685
              IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFH FHS+L+GPLAV
Sbjct: 209  VLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHRFHSSLNGPLAV 268

Query: 686  LFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSV 865
            L+KFLDYFS FDW+ Y +SLNGPVR+SSLP ++  +PE               C  MFS 
Sbjct: 269  LYKFLDYFSNFDWDNYCISLNGPVRISSLPEIMAGIPENGGHDLLLTDDFLKGCAEMFSA 328

Query: 866  PSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPA 1045
            PSRG + +SR F  KHLNI DPLK+ NNLGRSVSKGNFYRIRSAF+YGARKL  IL QP 
Sbjct: 329  PSRGYETSSRLFPSKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPE 388

Query: 1046 DSITNELHRFFSNTMARHGSGQRPDVQGFDP 1138
            ++I +E+ +FFSNT+ RHG GQRPDVQ   P
Sbjct: 389  ENIGDEIRKFFSNTLERHGKGQRPDVQDHLP 419


>ref|XP_006843704.1| hypothetical protein AMTR_s00007p00209910 [Amborella trichopoda]
            gi|548846072|gb|ERN05379.1| hypothetical protein
            AMTR_s00007p00209910 [Amborella trichopoda]
          Length = 904

 Score =  493 bits (1268), Expect = e-136
 Identities = 261/430 (60%), Positives = 309/430 (71%), Gaps = 2/430 (0%)
 Frame = +2

Query: 65   NPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVP 244
            +P  IG + W  AE     II KIQP  VSE+RR+ VVDY+ RLI   LG+ VFP+GSVP
Sbjct: 21   HPRAIGPDRWRRAEDRTCEIISKIQPTIVSEQRRKAVVDYVHRLIHGYLGSVVFPFGSVP 80

Query: 245  LKTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKC 424
            LKTYLPDGDIDLT F      DTLAND++S+LE EE+N  +EF VKDVQ + AEVKLVKC
Sbjct: 81   LKTYLPDGDIDLTAFSNFQ-NDTLANDVRSVLEGEEQNKVAEFEVKDVQYIHAEVKLVKC 139

Query: 425  IVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGL 604
            +VQ+IVVD+SFNQ+GGLCTLCFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGL
Sbjct: 140  LVQNIVVDISFNQLGGLCTLCFLEQVDRMIGKDHLFKRSIILIKAWCYYESRILGAHHGL 199

Query: 605  ISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVV 784
            ISTYALETLVLYIFHLFHST +GPL VL++FLDYFSKFDW++Y +SLNGPV +SS P + 
Sbjct: 200  ISTYALETLVLYIFHLFHSTFNGPLEVLYRFLDYFSKFDWDSYCISLNGPVSISSFPELT 259

Query: 785  VEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSV 964
            VE PE               CV  +SVPS+  +   R F LKHLNI DPLK+ NNLGRSV
Sbjct: 260  VETPENDGGELLLSKEFLKDCVDSYSVPSKVSEGTPRSFPLKHLNIIDPLKENNNLGRSV 319

Query: 965  SKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDPSL 1144
            SKGNFYRIRSAF+YGARKL RILL   ++I +ELH+FF+NT+ RHGSGQRPDVQ     L
Sbjct: 320  SKGNFYRIRSAFTYGARKLGRILLLSEETIPDELHKFFTNTLDRHGSGQRPDVQ----EL 375

Query: 1145 ICTRPISAVPIPETRPCKIKNLYRHFDRSS--VLEHPSVLLDQDLNNLKISTDSSIRQTP 1318
            I     S   +P T   +    Y   DR S   L H S+ L+    +L+  +  S+  + 
Sbjct: 376  I----FSPEGLPLTPDIE---QYNEDDRYSGVSLYHSSLNLEAGYYSLQFDSSLSVESSG 428

Query: 1319 IAKEAVTAVG 1348
            + + A +  G
Sbjct: 429  VEQRAESLGG 438


>ref|NP_850678.2| PAP/OAS1 substrate-binding domain superfamily [Arabidopsis thaliana]
            gi|332645293|gb|AEE78814.1| PAP/OAS1 substrate-binding
            domain superfamily [Arabidopsis thaliana]
          Length = 829

 Score =  492 bits (1266), Expect = e-136
 Identities = 245/363 (67%), Positives = 287/363 (79%)
 Frame = +2

Query: 86   EHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLPD 265
            E W   E A   II ++ P  VSE+RRR+V+ Y+Q+LIR  LG EV  +GSVPLKTYLPD
Sbjct: 32   ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91

Query: 266  GDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDIVV 445
            GDIDLT FGG   E+ LA  + ++LE EE N SS+FVVKDVQL+RAEVKLVKC+VQ+IVV
Sbjct: 92   GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151

Query: 446  DVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 625
            D+SFNQIGG+CTLCFLE++D  IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYALE
Sbjct: 152  DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211

Query: 626  TLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPEXX 805
            TLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW++Y +SLNGPV LSSLP +VVE PE  
Sbjct: 212  TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENG 271

Query: 806  XXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNFYR 985
                         C+ M+SVPSRG + N RGFQ KHLNI DPLK+ NNLGRSVSKGNFYR
Sbjct: 272  GEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331

Query: 986  IRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDPSLICTRPIS 1165
            IRSAF+YGARKL ++ LQ  ++I++EL +FFSN + RHGSGQRPDV    P L   R  +
Sbjct: 332  IRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRHGSGQRPDVHDAIPFLRYNRYNA 391

Query: 1166 AVP 1174
             +P
Sbjct: 392  ILP 394


>ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
            lyrata] gi|297321933|gb|EFH52354.1| hypothetical protein
            ARALYDRAFT_485514 [Arabidopsis lyrata subsp. lyrata]
          Length = 829

 Score =  491 bits (1265), Expect = e-136
 Identities = 243/346 (70%), Positives = 278/346 (80%)
 Frame = +2

Query: 86   EHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLPD 265
            E W   E A   II ++ P  VSE+RRR+V+ Y+Q+LIR  LG EV  +GSVPLKTYLPD
Sbjct: 32   EFWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRITLGCEVHSFGSVPLKTYLPD 91

Query: 266  GDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDIVV 445
            GDIDLT FGG   E+ LA  + S+LE EE N SS FVVKDVQL+RAEVKLVKC+VQ+IVV
Sbjct: 92   GDIDLTAFGGLYHEEELAAKVFSVLEREEHNVSSHFVVKDVQLIRAEVKLVKCLVQNIVV 151

Query: 446  DVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 625
            D+SFNQIGG+CTLCFLE++D  IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYALE
Sbjct: 152  DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211

Query: 626  TLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPEXX 805
            TLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW+ Y +SLNGPV LSSLP +VVE PE  
Sbjct: 212  TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVETPENG 271

Query: 806  XXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNFYR 985
                         C+ M+SVPSRG + N RGFQ KHLNI DPLK+ NNLGRSVSKGNFYR
Sbjct: 272  GEDFLLTSEFLKECMEMYSVPSRGFETNQRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331

Query: 986  IRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1123
            IRSAF+YGARKL +I LQ  ++I +EL +FFSN + RHGSGQRPDV
Sbjct: 332  IRSAFTYGARKLGQIFLQSDEAIKSELRKFFSNMLLRHGSGQRPDV 377


>ref|XP_004246272.1| PREDICTED: uncharacterized protein LOC101256025 [Solanum
            lycopersicum]
          Length = 849

 Score =  486 bits (1252), Expect = e-135
 Identities = 245/371 (66%), Positives = 278/371 (74%), Gaps = 13/371 (3%)
 Frame = +2

Query: 74   EIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKT 253
            +IG + WA AE     I+R++QP +VSE RR+ V++Y+Q LIR  LG EVFPYGSVPLKT
Sbjct: 25   DIGPQRWAVAEEVTQDILRRVQPTTVSENRRQRVIEYVQNLIRGSLGCEVFPYGSVPLKT 84

Query: 254  YLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQ 433
            YLPDGDIDLT F G   ED  A+D+ S LE  ++N  +EF VKDVQL+RAEVKLVKCIVQ
Sbjct: 85   YLPDGDIDLTAFVGKFFEDAFADDLVSTLEAADRNKDAEFSVKDVQLIRAEVKLVKCIVQ 144

Query: 434  DIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 613
            +IVVD+S NQIGGLCTL FLEQVDR IGKDHLFKRSIILIK WCYYESR+LGAHHGL ST
Sbjct: 145  NIVVDISLNQIGGLCTLGFLEQVDRLIGKDHLFKRSIILIKTWCYYESRLLGAHHGLFST 204

Query: 614  YALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEM 793
            YALETLVLYIFH FH+TLDGPL+VL+KFLDYF KFDW+ Y VSL GPV +SSLP  VV +
Sbjct: 205  YALETLVLYIFHFFHTTLDGPLSVLYKFLDYFGKFDWDNYYVSLTGPVHISSLPEYVVGV 264

Query: 794  PEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKG 973
            PE               C+  FSVPS+ GD NSR  Q K+LNI DPLK+ NNLGRSVSKG
Sbjct: 265  PENDGGNLLLSDDFIQYCLERFSVPSKDGDLNSRKIQHKYLNIIDPLKESNNLGRSVSKG 324

Query: 974  NFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQ--------- 1126
            NFYRIRSA +YGARKL  ILLQ  D+I  EL+ FF NTM RH SG+RPDVQ         
Sbjct: 325  NFYRIRSAINYGARKLESILLQSEDNIVEELYSFFPNTMDRHDSGERPDVQNPRNDFCLA 384

Query: 1127 ----GFDPSLI 1147
                 FDPS I
Sbjct: 385  FPAPNFDPSQI 395


>ref|XP_006403898.1| hypothetical protein EUTSA_v10010169mg [Eutrema salsugineum]
            gi|557105017|gb|ESQ45351.1| hypothetical protein
            EUTSA_v10010169mg [Eutrema salsugineum]
          Length = 695

 Score =  485 bits (1249), Expect = e-134
 Identities = 246/368 (66%), Positives = 284/368 (77%)
 Frame = +2

Query: 83   TEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLP 262
            +E W   E A   II ++ P  VSE+RRR+V+DY+QRLI+  LG EV  +GSVPLKTYLP
Sbjct: 31   SEFWKRVEEATREIIEQVHPTLVSEDRRRDVIDYMQRLIKMTLGCEVHSFGSVPLKTYLP 90

Query: 263  DGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDIV 442
            DGDIDLT FGG   E+ LA+ + S+LE EE      FVVKDVQL+RAEVKLVKC+VQ+IV
Sbjct: 91   DGDIDLTAFGGPCHEEELAHKVYSVLEREEHIGGGPFVVKDVQLIRAEVKLVKCLVQNIV 150

Query: 443  VDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 622
            VD+SFNQ+GG+CTLCFLE++D  IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYAL
Sbjct: 151  VDISFNQLGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGALHGLISTYAL 210

Query: 623  ETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPEX 802
            ETLVLYIFHLFHS+LDGPLAVL+KFLDYFSKFDW+ Y +SL+GPV LSSLP +VVE PE 
Sbjct: 211  ETLVLYIFHLFHSSLDGPLAVLYKFLDYFSKFDWDNYCISLSGPVCLSSLPDIVVETPEN 270

Query: 803  XXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNFY 982
                          CV M+SVPSRG D N R F  KHLNI DPLK+ NNLGRSVSKGNFY
Sbjct: 271  GGQDLLLTSEFLKECVEMYSVPSRGFDSNPRLFPSKHLNIVDPLKENNNLGRSVSKGNFY 330

Query: 983  RIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDPSLICTRPI 1162
            RIRSAF+YGARKL +I+LQ  + I+ EL +FFSN + RHGSGQRPDV    P +   R  
Sbjct: 331  RIRSAFTYGARKLGQIILQSEEDISFELRKFFSNMLHRHGSGQRPDVLDAGPFVRYNR-Y 389

Query: 1163 SAVPIPET 1186
            SA+  P T
Sbjct: 390  SAISPPST 397


>ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253523 [Vitis vinifera]
          Length = 854

 Score =  471 bits (1211), Expect = e-130
 Identities = 233/364 (64%), Positives = 279/364 (76%)
 Frame = +2

Query: 35   ASHATSSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLG 214
            AS + SS+   P  I  + WA AERA   I+ K+QP   S   R+EV+DY+QRLI  CLG
Sbjct: 21   ASRSLSSSPPLPASIAGDSWAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLG 80

Query: 215  AEVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQL 394
             EVFPYGSVPLKTYL DGDIDLT    +NVE+ LA+D+ ++L+ EE+N ++EF VKD+Q 
Sbjct: 81   CEVFPYGSVPLKTYLLDGDIDLTALCSSNVEEALASDVHAVLKGEEQNENAEFEVKDIQF 140

Query: 395  VRAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYE 574
            + AEVKLVKC+V+DIV+D+SFNQ+GGL TLCFLEQVDR IGKDHLFKRSIILIK+WCYYE
Sbjct: 141  ITAEVKLVKCLVKDIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKSWCYYE 200

Query: 575  SRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGP 754
            SRILGAHHGLISTYALE LVLYIFHLFH +LDGPLAVL++FLDYFSKFDW+ Y +SLNGP
Sbjct: 201  SRILGAHHGLISTYALEILVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGP 260

Query: 755  VRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPL 934
            V  SSLP +V E+PE               CV MFSVP RG + NSR F LKHLNI DPL
Sbjct: 261  VCKSSLPDIVAELPENGQDDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPL 320

Query: 935  KDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQR 1114
            ++ NNLGRSV+KGNFYRIRSAF YG+ KL +IL  P + I +EL  FF++T+ RH S   
Sbjct: 321  RENNNLGRSVNKGNFYRIRSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERHRSKYM 380

Query: 1115 PDVQ 1126
             ++Q
Sbjct: 381  AEIQ 384


>ref|XP_006290592.1| hypothetical protein CARUB_v10016681mg [Capsella rubella]
            gi|482559299|gb|EOA23490.1| hypothetical protein
            CARUB_v10016681mg [Capsella rubella]
          Length = 851

 Score =  470 bits (1209), Expect = e-130
 Identities = 231/348 (66%), Positives = 278/348 (79%), Gaps = 2/348 (0%)
 Frame = +2

Query: 86   EHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLPD 265
            E W   E A   II ++ P  V+E+RR+ V+ ++Q+++ + LG EV  +GSVPLKTYLPD
Sbjct: 34   EFWMRVEEATREIIEQVHPTHVAEDRRKNVITFVQKILGHKLGCEVHSFGSVPLKTYLPD 93

Query: 266  GDIDLTTFGG--ANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDI 439
            GDIDLT FG      E+ LA  + ++LE EE++ S++FVVKDVQL+RAEVKLVKC+VQ+I
Sbjct: 94   GDIDLTAFGRFIPEPEEDLAAKVFNVLEREERSGSADFVVKDVQLIRAEVKLVKCLVQNI 153

Query: 440  VVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYA 619
            VVD+SFNQIGG+CTLCFLE++DR IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYA
Sbjct: 154  VVDISFNQIGGICTLCFLEKIDRLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYA 213

Query: 620  LETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPE 799
            LETLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW+ Y +SLNGPV LSSLP +VVE PE
Sbjct: 214  LETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVEAPE 273

Query: 800  XXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNF 979
                           C+ M+SVPSRG + N R F  KHLNI DPLK+ NNLGRSVSKGNF
Sbjct: 274  NGGEDLLLTSEFLKECMEMYSVPSRGFETNPRVFPSKHLNIVDPLKENNNLGRSVSKGNF 333

Query: 980  YRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1123
            YRIRSAF+YGARKL +I+ Q  ++I++EL +FFSN + RHGSGQRPDV
Sbjct: 334  YRIRSAFTYGARKLGQIISQSEENISSELRKFFSNMLHRHGSGQRPDV 381


>ref|XP_006290591.1| hypothetical protein CARUB_v10016681mg [Capsella rubella]
            gi|482559298|gb|EOA23489.1| hypothetical protein
            CARUB_v10016681mg [Capsella rubella]
          Length = 827

 Score =  470 bits (1209), Expect = e-130
 Identities = 231/348 (66%), Positives = 278/348 (79%), Gaps = 2/348 (0%)
 Frame = +2

Query: 86   EHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLPD 265
            E W   E A   II ++ P  V+E+RR+ V+ ++Q+++ + LG EV  +GSVPLKTYLPD
Sbjct: 34   EFWMRVEEATREIIEQVHPTHVAEDRRKNVITFVQKILGHKLGCEVHSFGSVPLKTYLPD 93

Query: 266  GDIDLTTFGG--ANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDI 439
            GDIDLT FG      E+ LA  + ++LE EE++ S++FVVKDVQL+RAEVKLVKC+VQ+I
Sbjct: 94   GDIDLTAFGRFIPEPEEDLAAKVFNVLEREERSGSADFVVKDVQLIRAEVKLVKCLVQNI 153

Query: 440  VVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYA 619
            VVD+SFNQIGG+CTLCFLE++DR IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYA
Sbjct: 154  VVDISFNQIGGICTLCFLEKIDRLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYA 213

Query: 620  LETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPE 799
            LETLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW+ Y +SLNGPV LSSLP +VVE PE
Sbjct: 214  LETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVEAPE 273

Query: 800  XXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNF 979
                           C+ M+SVPSRG + N R F  KHLNI DPLK+ NNLGRSVSKGNF
Sbjct: 274  NGGEDLLLTSEFLKECMEMYSVPSRGFETNPRVFPSKHLNIVDPLKENNNLGRSVSKGNF 333

Query: 980  YRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1123
            YRIRSAF+YGARKL +I+ Q  ++I++EL +FFSN + RHGSGQRPDV
Sbjct: 334  YRIRSAFTYGARKLGQIISQSEENISSELRKFFSNMLHRHGSGQRPDV 381


>gb|EOY34688.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative isoform 2
            [Theobroma cacao]
          Length = 836

 Score =  469 bits (1208), Expect = e-130
 Identities = 227/353 (64%), Positives = 274/353 (77%)
 Frame = +2

Query: 68   PFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPL 247
            P  I  E W +AE  A  I+  +QP   ++ +R+E+V+Y+QRLI++ LG +VFPYGSVPL
Sbjct: 39   PCSIARESWDSAEETARRIVWSVQPTLDADRKRKEIVEYVQRLIQDGLGYQVFPYGSVPL 98

Query: 248  KTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCI 427
            KTYLPDGDIDLTT     +EDTL +D+ +IL  EE N  + + VKDV  + AEVKLVKC+
Sbjct: 99   KTYLPDGDIDLTTLSSPAIEDTLVSDVHAILRGEEHNQKAPYRVKDVHCIDAEVKLVKCL 158

Query: 428  VQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 607
            VQDIVVD+SFNQ+GGLCTLCFLEQ+DR +GKDHLFKRSIILIKAWCYYESRILGAHHGLI
Sbjct: 159  VQDIVVDISFNQLGGLCTLCFLEQIDRLVGKDHLFKRSIILIKAWCYYESRILGAHHGLI 218

Query: 608  STYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVV 787
            STYALETLVLYIFHLFHS+L GP+AVL++FLDYFSKFDWE Y +SLNGPV  SSLP +V 
Sbjct: 219  STYALETLVLYIFHLFHSSLTGPIAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPDIVA 278

Query: 788  EMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVS 967
            E+PE               C+ MFSVPS+G + NSR F LKHLNI DPLK+ NNLGRSV+
Sbjct: 279  EVPENVGNNPLLSEEFLRKCINMFSVPSKGVETNSRLFPLKHLNIIDPLKENNNLGRSVN 338

Query: 968  KGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQ 1126
            +GN+YRIRSAF YGA KL +IL+ P + I +EL +FF+NT+ RHGS     +Q
Sbjct: 339  RGNYYRIRSAFKYGAHKLEQILILPRERIPDELVKFFANTLERHGSNHLTGMQ 391


>gb|EOY34687.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative isoform 1
            [Theobroma cacao]
          Length = 836

 Score =  469 bits (1208), Expect = e-130
 Identities = 227/353 (64%), Positives = 274/353 (77%)
 Frame = +2

Query: 68   PFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPL 247
            P  I  E W +AE  A  I+  +QP   ++ +R+E+V+Y+QRLI++ LG +VFPYGSVPL
Sbjct: 39   PCSIARESWDSAEETARRIVWSVQPTLDADRKRKEIVEYVQRLIQDGLGYQVFPYGSVPL 98

Query: 248  KTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCI 427
            KTYLPDGDIDLTT     +EDTL +D+ +IL  EE N  + + VKDV  + AEVKLVKC+
Sbjct: 99   KTYLPDGDIDLTTLSSPAIEDTLVSDVHAILRGEEHNQKAPYRVKDVHCIDAEVKLVKCL 158

Query: 428  VQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 607
            VQDIVVD+SFNQ+GGLCTLCFLEQ+DR +GKDHLFKRSIILIKAWCYYESRILGAHHGLI
Sbjct: 159  VQDIVVDISFNQLGGLCTLCFLEQIDRLVGKDHLFKRSIILIKAWCYYESRILGAHHGLI 218

Query: 608  STYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVV 787
            STYALETLVLYIFHLFHS+L GP+AVL++FLDYFSKFDWE Y +SLNGPV  SSLP +V 
Sbjct: 219  STYALETLVLYIFHLFHSSLTGPIAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPDIVA 278

Query: 788  EMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVS 967
            E+PE               C+ MFSVPS+G + NSR F LKHLNI DPLK+ NNLGRSV+
Sbjct: 279  EVPENVGNNPLLSEEFLRKCINMFSVPSKGVETNSRLFPLKHLNIIDPLKENNNLGRSVN 338

Query: 968  KGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQ 1126
            +GN+YRIRSAF YGA KL +IL+ P + I +EL +FF+NT+ RHGS     +Q
Sbjct: 339  RGNYYRIRSAFKYGAHKLEQILILPRERIPDELVKFFANTLERHGSNHLTGMQ 391


>ref|XP_004170318.1| PREDICTED: uncharacterized LOC101207419 [Cucumis sativus]
          Length = 816

 Score =  462 bits (1189), Expect = e-127
 Identities = 222/308 (72%), Positives = 259/308 (84%)
 Frame = +2

Query: 218  EVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLANDMKSILEEEEKNASSEFVVKDVQLV 397
            +VFP+GSVPLKTYLPDGDIDLT  GG+NVE+ LA+D+ S+L  E++N ++EFVVKDVQL+
Sbjct: 4    QVFPFGSVPLKTYLPDGDIDLTALGGSNVEEALASDVCSVLNSEDQNGAAEFVVKDVQLI 63

Query: 398  RAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYES 577
            RAEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE++DR IGKDHLFKRSIILIKAWCYYES
Sbjct: 64   RAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYES 123

Query: 578  RILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPV 757
            RILGAHHGLISTYALETLVLYIFHLFHS L+GPL VL+KFLDYFSKFDW+ Y +SLNGPV
Sbjct: 124  RILGAHHGLISTYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGPV 183

Query: 758  RLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLK 937
            R+SSLP +V E P+               C+  FSVP+RG + NSR F +KHLNI DPLK
Sbjct: 184  RISSLPELVAETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPLK 243

Query: 938  DINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRP 1117
            + NNLGRSVSKGNFYRIRSAFSYGARKL  IL  P D++ +E+ +FFSNT+ RHG GQRP
Sbjct: 244  ENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRHGGGQRP 303

Query: 1118 DVQGFDPS 1141
            DVQ  DP+
Sbjct: 304  DVQ--DPA 309


Top