BLASTX nr result

ID: Rehmannia24_contig00016569 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00016569
         (1282 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citr...   538   e-150
gb|EOY04484.1| NT domain of poly(A) polymerase and terminal urid...   533   e-149
gb|EMJ09368.1| hypothetical protein PRUPE_ppa001915mg [Prunus pe...   523   e-146
ref|XP_004302534.1| PREDICTED: uncharacterized protein LOC101304...   521   e-145
ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207...   516   e-144
ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258...   511   e-142
emb|CBI18050.3| unnamed protein product [Vitis vinifera]              511   e-142
ref|XP_006346681.1| PREDICTED: uncharacterized protein LOC102589...   505   e-140
gb|EXB42369.1| hypothetical protein L484_021961 [Morus notabilis]     494   e-137
ref|XP_006843704.1| hypothetical protein AMTR_s00007p00209910 [A...   489   e-136
ref|NP_850678.2| PAP/OAS1 substrate-binding domain superfamily [...   489   e-136
ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arab...   489   e-135
ref|XP_006403898.1| hypothetical protein EUTSA_v10010169mg [Eutr...   484   e-134
ref|XP_004246272.1| PREDICTED: uncharacterized protein LOC101256...   484   e-134
gb|EOY34688.1| NT domain of poly(A) polymerase and terminal urid...   469   e-130
gb|EOY34687.1| NT domain of poly(A) polymerase and terminal urid...   469   e-130
ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253...   468   e-129
ref|XP_006290592.1| hypothetical protein CARUB_v10016681mg [Caps...   468   e-129
ref|XP_006290591.1| hypothetical protein CARUB_v10016681mg [Caps...   468   e-129
ref|XP_006575451.1| PREDICTED: uncharacterized protein LOC100814...   461   e-127

>ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citrus clementina]
            gi|568855155|ref|XP_006481174.1| PREDICTED:
            uncharacterized protein LOC102622468 [Citrus sinensis]
            gi|557531615|gb|ESR42798.1| hypothetical protein
            CICLE_v10011044mg [Citrus clementina]
          Length = 882

 Score =  538 bits (1387), Expect = e-150
 Identities = 265/363 (73%), Positives = 300/363 (82%)
 Frame = +1

Query: 31   SHATSSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGA 210
            S ++SS   N   IG E+W  AE A   II ++QP  VSEERR+ V+DY+QRLIRN LG 
Sbjct: 21   SSSSSSVPSNQTAIGAEYWQRAEEATQAIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGC 80

Query: 211  EVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLV 390
            EVFP+GSVPLKTYLPDGDIDLT FGG NVE+ L ND+ S+LE E++N ++EFVVKD QL+
Sbjct: 81   EVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLI 140

Query: 391  RAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYES 570
            RAEVKLVKC+VQ+IVVD+SFNQ+GGL TLCFLEQVDR IGKDHLFKRSIILIKAWCYYES
Sbjct: 141  RAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYES 200

Query: 571  RILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPV 750
            RILGAHHGLISTYALETLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW++Y +SLNGPV
Sbjct: 201  RILGAHHGLISTYALETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDSYCISLNGPV 260

Query: 751  RLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLK 930
            R+SSLP VVVE PE               CV  FSVPSRG D NSR F  KHLNI DPLK
Sbjct: 261  RISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLK 320

Query: 931  DINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRP 1110
            + NNLGRSVSKGNFYRIRSAF+YGARKL  IL QP +S+T+EL +FFSNT+ RHGSGQRP
Sbjct: 321  ENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRP 380

Query: 1111 DVQ 1119
            DVQ
Sbjct: 381  DVQ 383


>gb|EOY04484.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative [Theobroma
            cacao]
          Length = 890

 Score =  533 bits (1373), Expect = e-149
 Identities = 265/379 (69%), Positives = 305/379 (80%)
 Frame = +1

Query: 1    GAAAELKNVASHATSSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYI 180
            G A+E +   S ++SS+  N   I  E+W  AE A  GII ++QP  VSEERR+ V+DY+
Sbjct: 16   GVASEER---SSSSSSSSSNQAGIAAEYWKKAEEATQGIIAQVQPTVVSEERRKAVIDYV 72

Query: 181  QRLIRNCLGAEVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASS 360
            QRLI N LG  VFP+GSVPLKTYLPDGDIDLT FGG N E+ L ND+ S+LE E+ N ++
Sbjct: 73   QRLIGNYLGCGVFPFGSVPLKTYLPDGDIDLTAFGGLNFEEALANDVCSVLEREDHNRAA 132

Query: 361  EFVVKDVQLVRAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSII 540
            EFVVKDVQL+RAEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE+VDR IGKDHLFKRSII
Sbjct: 133  EFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKVDRRIGKDHLFKRSII 192

Query: 541  LIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWE 720
            LIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHS+LDGPLAVL+KFLDYFSKFDW+
Sbjct: 193  LIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLDGPLAVLYKFLDYFSKFDWD 252

Query: 721  TYSVSLNGPVRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQL 900
             Y +SLNGP+ +SSLP VVVE PE               CV MFSVPSRG + NSR F  
Sbjct: 253  NYCISLNGPIHISSLPEVVVETPENGGGDLLLSNDFLKECVEMFSVPSRGFETNSRTFPQ 312

Query: 901  KHLNIFDPLKDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNT 1080
            KHLNI DPL++ NNLGRSVSKGNFYRIRSAF+YGARKL +IL Q  +S+ +EL +FFSNT
Sbjct: 313  KHLNIVDPLRENNNLGRSVSKGNFYRIRSAFTYGARKLGKILSQAEESMADELRKFFSNT 372

Query: 1081 MARHGSGQRPDVQGFDPSL 1137
            + RHGSGQRPDVQ   PSL
Sbjct: 373  LDRHGSGQRPDVQDCIPSL 391


>gb|EMJ09368.1| hypothetical protein PRUPE_ppa001915mg [Prunus persica]
          Length = 742

 Score =  523 bits (1348), Expect = e-146
 Identities = 253/354 (71%), Positives = 291/354 (82%)
 Frame = +1

Query: 70   IGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTY 249
            I  E+W  AE A  G+I ++QP  VSE RR+ V+DY+QRLIR CLG EVFP+GSVPLKTY
Sbjct: 48   ISAEYWKKAEEATQGVIAQVQPTDVSERRRKAVIDYVQRLIRGCLGCEVFPFGSVPLKTY 107

Query: 250  LPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQD 429
            LPDGDIDLT FGG NVE+ L ND+ S+LE E +N ++EF+VKDVQL+RAEVKLVKC+VQ+
Sbjct: 108  LPDGDIDLTAFGGINVEEALANDVCSVLEREVQNGTAEFMVKDVQLIRAEVKLVKCLVQN 167

Query: 430  IVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 609
            IVVD+SFNQ+GGLCTLCFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY
Sbjct: 168  IVVDISFNQLGGLCTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 227

Query: 610  ALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMP 789
            ALETLVLYIFHLFH++L+GPLAVL+KFLDYFSKFDW+ Y +SL+GPVR+SSLP ++VE P
Sbjct: 228  ALETLVLYIFHLFHASLNGPLAVLYKFLDYFSKFDWDNYCISLSGPVRISSLPELLVETP 287

Query: 790  EXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGN 969
            E               CV MFSVPSRG + N R F  KH NI DPLKD NNLGRSVSKGN
Sbjct: 288  ENGGNDLLLSNDFLKECVQMFSVPSRGYETNYRTFPPKHFNIVDPLKDNNNLGRSVSKGN 347

Query: 970  FYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDP 1131
            FYRIRSAF+YGARKL RIL Q  D+I +E+ +FF+NT+ RHG GQRPDVQ   P
Sbjct: 348  FYRIRSAFTYGARKLGRILSQTEDNIDDEIRKFFANTLDRHGGGQRPDVQDLVP 401


>ref|XP_004302534.1| PREDICTED: uncharacterized protein LOC101304393 [Fragaria vesca
            subsp. vesca]
          Length = 878

 Score =  521 bits (1341), Expect = e-145
 Identities = 258/385 (67%), Positives = 307/385 (79%), Gaps = 9/385 (2%)
 Frame = +1

Query: 1    GAAAELKNVASHATS--SADQNPFEIGT-EHWATAERAAHGIIRKIQPNSVSEERRREVV 171
            GA  E +  +S ++S  S+  +   + T E+W  AE A  G+I ++QP  VSE RRR V+
Sbjct: 13   GAVLEDRPTSSSSSSLPSSSSSLLSVSTAEYWRRAEAATQGVIAQVQPTDVSERRRRAVI 72

Query: 172  DYIQRLIRNCLGAEVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKN 351
            DY+QRLIR  LG EVFP+GSVPLKTYLPDGDIDLT FGG N+++ L ND+ ++LE E++N
Sbjct: 73   DYVQRLIRGFLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNIDEVLANDVCAVLEREDQN 132

Query: 352  ASSEFVVKDVQLVRAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKR 531
             ++EF+VKDVQL+RAEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLEQVDR IGKDHLFKR
Sbjct: 133  MAAEFMVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQVDRLIGKDHLFKR 192

Query: 532  SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKF 711
            SIILIKAWCYYESRILGAHHGLISTY LETLVL+IFHLFH++L+GPLAVL+KFLDYFSKF
Sbjct: 193  SIILIKAWCYYESRILGAHHGLISTYGLETLVLFIFHLFHASLNGPLAVLYKFLDYFSKF 252

Query: 712  DWETYSVSLNGPVRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRG 891
            DW+ Y +SLNGPVR+SSLP ++ EMP+               CV  FSVPSRG + N R 
Sbjct: 253  DWDNYCISLNGPVRISSLPELLTEMPDNGGGDLLLSNEFLRSCVDRFSVPSRGYETNYRT 312

Query: 892  FQLKHLNIFDPLKDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFF 1071
            FQ KHLNI DPLK+ NNLGRSVSKGNFYRIRSAF+YGARKL RIL QP ++I +E  +FF
Sbjct: 313  FQPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGRILSQPEENIDDEFRKFF 372

Query: 1072 SNTMARHGSGQRPDVQ------GFD 1128
            SNT+ RHGSGQRPDVQ      GFD
Sbjct: 373  SNTLDRHGSGQRPDVQDPIPFSGFD 397


>ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207419 [Cucumis sativus]
          Length = 898

 Score =  516 bits (1330), Expect = e-144
 Identities = 255/381 (66%), Positives = 301/381 (79%), Gaps = 3/381 (0%)
 Frame = +1

Query: 1    GAAAELKNVASHATSSAD---QNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVV 171
            GA AE K  +S  +S +     NP  IG ++W  AE A   II ++QP  VSE RR+ V+
Sbjct: 13   GAVAEDKPSSSSFSSFSSLLPSNPTPIGVDYWRRAEEATQAIISQVQPTVVSERRRKAVI 72

Query: 172  DYIQRLIRNCLGAEVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKN 351
            DY+QRLIR  L  EVFP+GSVPLKTYLPDGDIDLT  GG+NVE+ L +D+ S+L  E++N
Sbjct: 73   DYVQRLIRGRLRCEVFPFGSVPLKTYLPDGDIDLTALGGSNVEEALASDVCSVLNSEDQN 132

Query: 352  ASSEFVVKDVQLVRAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKR 531
             ++EFVVKDVQL+RAEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE++DR IGKDHLFKR
Sbjct: 133  GAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKR 192

Query: 532  SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKF 711
            SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHS L+GPL VL+KFLDYFSKF
Sbjct: 193  SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKF 252

Query: 712  DWETYSVSLNGPVRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRG 891
            DW+ Y +SLNGPVR+SSLP +V E P+               C+  FSVP+RG + NSR 
Sbjct: 253  DWDNYCISLNGPVRISSLPELVAETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRA 312

Query: 892  FQLKHLNIFDPLKDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFF 1071
            F +KHLNI DPLK+ NNLGRSVSKGNFYRIRSAFSYGARKL  IL  P D++ +E+ +FF
Sbjct: 313  FPIKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFF 372

Query: 1072 SNTMARHGSGQRPDVQGFDPS 1134
            SNT+ RHG GQRPDVQ  DP+
Sbjct: 373  SNTLDRHGGGQRPDVQ--DPA 391


>ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258499 [Vitis vinifera]
          Length = 884

 Score =  511 bits (1315), Expect = e-142
 Identities = 254/358 (70%), Positives = 288/358 (80%)
 Frame = +1

Query: 43   SSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFP 222
            S +  NP  IG   WA AE     II ++QP  VSEERR+EVVDY+Q LIR  +G EVFP
Sbjct: 25   SLSHPNPPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFP 84

Query: 223  YGSVPLKTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEV 402
            +GSVPLKTYLPDGDIDLT FGG  VEDTL  ++ S+LE E++N ++EFVVKDVQL+ AEV
Sbjct: 85   FGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEV 144

Query: 403  KLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILG 582
            KLVKC+VQ+IVVD+SFNQ+GGLCTLCFLEQ+DR IGKDHLFKRSIILIKAWCYYESRILG
Sbjct: 145  KLVKCLVQNIVVDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILG 204

Query: 583  AHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSS 762
            AHHGLISTYALETLVLYIF LFHS L+GPLAVL+KFLDYFSKFDW+ Y VSLNGPVR+SS
Sbjct: 205  AHHGLISTYALETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISS 264

Query: 763  LPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINN 942
            LP ++ E PE               C+  FSVPSRG + NSR F  KH NI DPLK+ NN
Sbjct: 265  LPEMIAETPENVGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNN 324

Query: 943  LGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1116
            LGRSVSKGNFYRIRSAF+YGARKL RILLQP D I+ EL +FF+NT+ RHG GQRPDV
Sbjct: 325  LGRSVSKGNFYRIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERHGRGQRPDV 382


>emb|CBI18050.3| unnamed protein product [Vitis vinifera]
          Length = 824

 Score =  511 bits (1315), Expect = e-142
 Identities = 254/358 (70%), Positives = 288/358 (80%)
 Frame = +1

Query: 43   SSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFP 222
            S +  NP  IG   WA AE     II ++QP  VSEERR+EVVDY+Q LIR  +G EVFP
Sbjct: 25   SLSHPNPPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFP 84

Query: 223  YGSVPLKTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEV 402
            +GSVPLKTYLPDGDIDLT FGG  VEDTL  ++ S+LE E++N ++EFVVKDVQL+ AEV
Sbjct: 85   FGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEV 144

Query: 403  KLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILG 582
            KLVKC+VQ+IVVD+SFNQ+GGLCTLCFLEQ+DR IGKDHLFKRSIILIKAWCYYESRILG
Sbjct: 145  KLVKCLVQNIVVDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILG 204

Query: 583  AHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSS 762
            AHHGLISTYALETLVLYIF LFHS L+GPLAVL+KFLDYFSKFDW+ Y VSLNGPVR+SS
Sbjct: 205  AHHGLISTYALETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISS 264

Query: 763  LPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINN 942
            LP ++ E PE               C+  FSVPSRG + NSR F  KH NI DPLK+ NN
Sbjct: 265  LPEMIAETPENVGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNN 324

Query: 943  LGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1116
            LGRSVSKGNFYRIRSAF+YGARKL RILLQP D I+ EL +FF+NT+ RHG GQRPDV
Sbjct: 325  LGRSVSKGNFYRIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERHGRGQRPDV 382


>ref|XP_006346681.1| PREDICTED: uncharacterized protein LOC102589320 isoform X1 [Solanum
            tuberosum] gi|565359810|ref|XP_006346682.1| PREDICTED:
            uncharacterized protein LOC102589320 isoform X2 [Solanum
            tuberosum]
          Length = 852

 Score =  505 bits (1300), Expect = e-140
 Identities = 254/376 (67%), Positives = 287/376 (76%)
 Frame = +1

Query: 67   EIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKT 246
            +IG E WA AE+    I+R++QP +VSE RRR V++Y+Q L+R  L  EVFPYGSVPLKT
Sbjct: 25   DIGPERWAVAEKVTQNILRRVQPTTVSENRRRSVIEYVQNLVRGSLRCEVFPYGSVPLKT 84

Query: 247  YLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQ 426
            YLPDGDIDLT F G + ED   +DM S LE E++N  +EF VKDVQL+RAEVKLVKCIVQ
Sbjct: 85   YLPDGDIDLTAFVGKDFEDAFADDMVSTLEAEDRNKDAEFAVKDVQLIRAEVKLVKCIVQ 144

Query: 427  DIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 606
            +IVVD+S NQIGGLCTL FLEQVDR IGKDHLFKRSIILIK WCYYESR+LGAHHGL ST
Sbjct: 145  NIVVDISLNQIGGLCTLGFLEQVDRLIGKDHLFKRSIILIKTWCYYESRLLGAHHGLFST 204

Query: 607  YALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEM 786
            YALETLVLYIFH FH+TLDGPLAVL+KFLDYF KFDW+ Y VSL GPVR+SSLP  VVE+
Sbjct: 205  YALETLVLYIFHFFHTTLDGPLAVLYKFLDYFGKFDWDNYYVSLTGPVRISSLPEYVVEV 264

Query: 787  PEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKG 966
            PE               C+  FSVPS+GGD NSR  Q K+LNI DPLK+ NNLGRSVSKG
Sbjct: 265  PENDGGDVLLSNDFIRYCLERFSVPSKGGDLNSRKIQHKYLNIIDPLKESNNLGRSVSKG 324

Query: 967  NFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDPSLICT 1146
            NFYRIRSA +YGARKL  ILLQ  D+I  EL+RFF NTM RH SG+RPDVQ  DPS    
Sbjct: 325  NFYRIRSAINYGARKLESILLQSEDNIVEELYRFFPNTMDRHDSGERPDVQ--DPSNDFC 382

Query: 1147 RPISAVPIPETRPCKI 1194
                A P P   P +I
Sbjct: 383  LASPASPAPNFDPSQI 398


>gb|EXB42369.1| hypothetical protein L484_021961 [Morus notabilis]
          Length = 928

 Score =  494 bits (1273), Expect = e-137
 Identities = 250/391 (63%), Positives = 288/391 (73%), Gaps = 37/391 (9%)
 Frame = +1

Query: 70   IGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTY 249
            IG E+W  AE A  GII ++QP  VS +RRR V+DY+QRLIR  LG EVFP+GSVPLKTY
Sbjct: 29   IGAEYWKRAEEATQGIIAQVQPTVVSGKRRRAVIDYVQRLIRGFLGCEVFPFGSVPLKTY 88

Query: 250  LPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAE---------- 399
            LPDGDIDLT FGG N+E+ L ND+ S+LE EE+N ++EFVVKDVQL+RAE          
Sbjct: 89   LPDGDIDLTAFGGLNIEEALANDVCSVLEREEQNKAAEFVVKDVQLIRAETSDLKVQVLH 148

Query: 400  ---------------------------VKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVD 498
                                       VKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLEQVD
Sbjct: 149  YSRSDGFEVVEAYFDAHALAGCVVLLLVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQVD 208

Query: 499  RHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAV 678
              IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFH FHS+L+GPLAV
Sbjct: 209  VLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHRFHSSLNGPLAV 268

Query: 679  LFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSV 858
            L+KFLDYFS FDW+ Y +SLNGPVR+SSLP ++  +PE               C  MFS 
Sbjct: 269  LYKFLDYFSNFDWDNYCISLNGPVRISSLPEIMAGIPENGGHDLLLTDDFLKGCAEMFSA 328

Query: 859  PSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPA 1038
            PSRG + +SR F  KHLNI DPLK+ NNLGRSVSKGNFYRIRSAF+YGARKL  IL QP 
Sbjct: 329  PSRGYETSSRLFPSKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPE 388

Query: 1039 DSITNELHRFFSNTMARHGSGQRPDVQGFDP 1131
            ++I +E+ +FFSNT+ RHG GQRPDVQ   P
Sbjct: 389  ENIGDEIRKFFSNTLERHGKGQRPDVQDHLP 419


>ref|XP_006843704.1| hypothetical protein AMTR_s00007p00209910 [Amborella trichopoda]
            gi|548846072|gb|ERN05379.1| hypothetical protein
            AMTR_s00007p00209910 [Amborella trichopoda]
          Length = 904

 Score =  489 bits (1260), Expect = e-136
 Identities = 243/354 (68%), Positives = 279/354 (78%)
 Frame = +1

Query: 58   NPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVP 237
            +P  IG + W  AE     II KIQP  VSE+RR+ VVDY+ RLI   LG+ VFP+GSVP
Sbjct: 21   HPRAIGPDRWRRAEDRTCEIISKIQPTIVSEQRRKAVVDYVHRLIHGYLGSVVFPFGSVP 80

Query: 238  LKTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKC 417
            LKTYLPDGDIDLT F      DTL ND++S+LE EE+N  +EF VKDVQ + AEVKLVKC
Sbjct: 81   LKTYLPDGDIDLTAFSNFQ-NDTLANDVRSVLEGEEQNKVAEFEVKDVQYIHAEVKLVKC 139

Query: 418  IVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGL 597
            +VQ+IVVD+SFNQ+GGLCTLCFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGL
Sbjct: 140  LVQNIVVDISFNQLGGLCTLCFLEQVDRMIGKDHLFKRSIILIKAWCYYESRILGAHHGL 199

Query: 598  ISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVV 777
            ISTYALETLVLYIFHLFHST +GPL VL++FLDYFSKFDW++Y +SLNGPV +SS P + 
Sbjct: 200  ISTYALETLVLYIFHLFHSTFNGPLEVLYRFLDYFSKFDWDSYCISLNGPVSISSFPELT 259

Query: 778  VEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSV 957
            VE PE               CV  +SVPS+  +   R F LKHLNI DPLK+ NNLGRSV
Sbjct: 260  VETPENDGGELLLSKEFLKDCVDSYSVPSKVSEGTPRSFPLKHLNIIDPLKENNNLGRSV 319

Query: 958  SKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQ 1119
            SKGNFYRIRSAF+YGARKL RILL   ++I +ELH+FF+NT+ RHGSGQRPDVQ
Sbjct: 320  SKGNFYRIRSAFTYGARKLGRILLLSEETIPDELHKFFTNTLDRHGSGQRPDVQ 373


>ref|NP_850678.2| PAP/OAS1 substrate-binding domain superfamily [Arabidopsis thaliana]
            gi|332645293|gb|AEE78814.1| PAP/OAS1 substrate-binding
            domain superfamily [Arabidopsis thaliana]
          Length = 829

 Score =  489 bits (1260), Expect = e-136
 Identities = 244/363 (67%), Positives = 286/363 (78%)
 Frame = +1

Query: 79   EHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLPD 258
            E W   E A   II ++ P  VSE+RRR+V+ Y+Q+LIR  LG EV  +GSVPLKTYLPD
Sbjct: 32   ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91

Query: 259  GDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDIVV 438
            GDIDLT FGG   E+ L   + ++LE EE N SS+FVVKDVQL+RAEVKLVKC+VQ+IVV
Sbjct: 92   GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151

Query: 439  DVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 618
            D+SFNQIGG+CTLCFLE++D  IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYALE
Sbjct: 152  DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211

Query: 619  TLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPEXX 798
            TLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW++Y +SLNGPV LSSLP +VVE PE  
Sbjct: 212  TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENG 271

Query: 799  XXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNFYR 978
                         C+ M+SVPSRG + N RGFQ KHLNI DPLK+ NNLGRSVSKGNFYR
Sbjct: 272  GEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331

Query: 979  IRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDPSLICTRPIS 1158
            IRSAF+YGARKL ++ LQ  ++I++EL +FFSN + RHGSGQRPDV    P L   R  +
Sbjct: 332  IRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRHGSGQRPDVHDAIPFLRYNRYNA 391

Query: 1159 AVP 1167
             +P
Sbjct: 392  ILP 394


>ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
            lyrata] gi|297321933|gb|EFH52354.1| hypothetical protein
            ARALYDRAFT_485514 [Arabidopsis lyrata subsp. lyrata]
          Length = 829

 Score =  489 bits (1259), Expect = e-135
 Identities = 242/346 (69%), Positives = 277/346 (80%)
 Frame = +1

Query: 79   EHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLPD 258
            E W   E A   II ++ P  VSE+RRR+V+ Y+Q+LIR  LG EV  +GSVPLKTYLPD
Sbjct: 32   EFWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRITLGCEVHSFGSVPLKTYLPD 91

Query: 259  GDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDIVV 438
            GDIDLT FGG   E+ L   + S+LE EE N SS FVVKDVQL+RAEVKLVKC+VQ+IVV
Sbjct: 92   GDIDLTAFGGLYHEEELAAKVFSVLEREEHNVSSHFVVKDVQLIRAEVKLVKCLVQNIVV 151

Query: 439  DVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 618
            D+SFNQIGG+CTLCFLE++D  IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYALE
Sbjct: 152  DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211

Query: 619  TLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPEXX 798
            TLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW+ Y +SLNGPV LSSLP +VVE PE  
Sbjct: 212  TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVETPENG 271

Query: 799  XXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNFYR 978
                         C+ M+SVPSRG + N RGFQ KHLNI DPLK+ NNLGRSVSKGNFYR
Sbjct: 272  GEDFLLTSEFLKECMEMYSVPSRGFETNQRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331

Query: 979  IRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1116
            IRSAF+YGARKL +I LQ  ++I +EL +FFSN + RHGSGQRPDV
Sbjct: 332  IRSAFTYGARKLGQIFLQSDEAIKSELRKFFSNMLLRHGSGQRPDV 377


>ref|XP_006403898.1| hypothetical protein EUTSA_v10010169mg [Eutrema salsugineum]
            gi|557105017|gb|ESQ45351.1| hypothetical protein
            EUTSA_v10010169mg [Eutrema salsugineum]
          Length = 695

 Score =  484 bits (1246), Expect = e-134
 Identities = 248/379 (65%), Positives = 286/379 (75%)
 Frame = +1

Query: 76   TEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLP 255
            +E W   E A   II ++ P  VSE+RRR+V+DY+QRLI+  LG EV  +GSVPLKTYLP
Sbjct: 31   SEFWKRVEEATREIIEQVHPTLVSEDRRRDVIDYMQRLIKMTLGCEVHSFGSVPLKTYLP 90

Query: 256  DGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDIV 435
            DGDIDLT FGG   E+ L + + S+LE EE      FVVKDVQL+RAEVKLVKC+VQ+IV
Sbjct: 91   DGDIDLTAFGGPCHEEELAHKVYSVLEREEHIGGGPFVVKDVQLIRAEVKLVKCLVQNIV 150

Query: 436  VDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 615
            VD+SFNQ+GG+CTLCFLE++D  IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYAL
Sbjct: 151  VDISFNQLGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGALHGLISTYAL 210

Query: 616  ETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPEX 795
            ETLVLYIFHLFHS+LDGPLAVL+KFLDYFSKFDW+ Y +SL+GPV LSSLP +VVE PE 
Sbjct: 211  ETLVLYIFHLFHSSLDGPLAVLYKFLDYFSKFDWDNYCISLSGPVCLSSLPDIVVETPEN 270

Query: 796  XXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNFY 975
                          CV M+SVPSRG D N R F  KHLNI DPLK+ NNLGRSVSKGNFY
Sbjct: 271  GGQDLLLTSEFLKECVEMYSVPSRGFDSNPRLFPSKHLNIVDPLKENNNLGRSVSKGNFY 330

Query: 976  RIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDPSLICTRPI 1155
            RIRSAF+YGARKL +I+LQ  + I+ EL +FFSN + RHGSGQRPDV    P +   R  
Sbjct: 331  RIRSAFTYGARKLGQIILQSEEDISFELRKFFSNMLHRHGSGQRPDVLDAGPFVRYNR-Y 389

Query: 1156 SAVPIPETRPCKINNLHTH 1212
            SA+  P T     NN   H
Sbjct: 390  SAISPPST----ANNFQDH 404


>ref|XP_004246272.1| PREDICTED: uncharacterized protein LOC101256025 [Solanum
            lycopersicum]
          Length = 849

 Score =  484 bits (1246), Expect = e-134
 Identities = 244/371 (65%), Positives = 277/371 (74%), Gaps = 13/371 (3%)
 Frame = +1

Query: 67   EIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKT 246
            +IG + WA AE     I+R++QP +VSE RR+ V++Y+Q LIR  LG EVFPYGSVPLKT
Sbjct: 25   DIGPQRWAVAEEVTQDILRRVQPTTVSENRRQRVIEYVQNLIRGSLGCEVFPYGSVPLKT 84

Query: 247  YLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQ 426
            YLPDGDIDLT F G   ED   +D+ S LE  ++N  +EF VKDVQL+RAEVKLVKCIVQ
Sbjct: 85   YLPDGDIDLTAFVGKFFEDAFADDLVSTLEAADRNKDAEFSVKDVQLIRAEVKLVKCIVQ 144

Query: 427  DIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 606
            +IVVD+S NQIGGLCTL FLEQVDR IGKDHLFKRSIILIK WCYYESR+LGAHHGL ST
Sbjct: 145  NIVVDISLNQIGGLCTLGFLEQVDRLIGKDHLFKRSIILIKTWCYYESRLLGAHHGLFST 204

Query: 607  YALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEM 786
            YALETLVLYIFH FH+TLDGPL+VL+KFLDYF KFDW+ Y VSL GPV +SSLP  VV +
Sbjct: 205  YALETLVLYIFHFFHTTLDGPLSVLYKFLDYFGKFDWDNYYVSLTGPVHISSLPEYVVGV 264

Query: 787  PEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKG 966
            PE               C+  FSVPS+ GD NSR  Q K+LNI DPLK+ NNLGRSVSKG
Sbjct: 265  PENDGGNLLLSDDFIQYCLERFSVPSKDGDLNSRKIQHKYLNIIDPLKESNNLGRSVSKG 324

Query: 967  NFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQ--------- 1119
            NFYRIRSA +YGARKL  ILLQ  D+I  EL+ FF NTM RH SG+RPDVQ         
Sbjct: 325  NFYRIRSAINYGARKLESILLQSEDNIVEELYSFFPNTMDRHDSGERPDVQNPRNDFCLA 384

Query: 1120 ----GFDPSLI 1140
                 FDPS I
Sbjct: 385  FPAPNFDPSQI 395


>gb|EOY34688.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative isoform 2
            [Theobroma cacao]
          Length = 836

 Score =  469 bits (1208), Expect = e-130
 Identities = 232/381 (60%), Positives = 281/381 (73%)
 Frame = +1

Query: 61   PFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPL 240
            P  I  E W +AE  A  I+  +QP   ++ +R+E+V+Y+QRLI++ LG +VFPYGSVPL
Sbjct: 39   PCSIARESWDSAEETARRIVWSVQPTLDADRKRKEIVEYVQRLIQDGLGYQVFPYGSVPL 98

Query: 241  KTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCI 420
            KTYLPDGDIDLTT     +EDTL +D+ +IL  EE N  + + VKDV  + AEVKLVKC+
Sbjct: 99   KTYLPDGDIDLTTLSSPAIEDTLVSDVHAILRGEEHNQKAPYRVKDVHCIDAEVKLVKCL 158

Query: 421  VQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 600
            VQDIVVD+SFNQ+GGLCTLCFLEQ+DR +GKDHLFKRSIILIKAWCYYESRILGAHHGLI
Sbjct: 159  VQDIVVDISFNQLGGLCTLCFLEQIDRLVGKDHLFKRSIILIKAWCYYESRILGAHHGLI 218

Query: 601  STYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVV 780
            STYALETLVLYIFHLFHS+L GP+AVL++FLDYFSKFDWE Y +SLNGPV  SSLP +V 
Sbjct: 219  STYALETLVLYIFHLFHSSLTGPIAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPDIVA 278

Query: 781  EMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVS 960
            E+PE               C+ MFSVPS+G + NSR F LKHLNI DPLK+ NNLGRSV+
Sbjct: 279  EVPENVGNNPLLSEEFLRKCINMFSVPSKGVETNSRLFPLKHLNIIDPLKENNNLGRSVN 338

Query: 961  KGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDPSLI 1140
            +GN+YRIRSAF YGA KL +IL+ P + I +EL +FF+NT+ RHGS     +Q    +  
Sbjct: 339  RGNYYRIRSAFKYGAHKLEQILILPRERIPDELVKFFANTLERHGSNHLTGMQNLPSTSD 398

Query: 1141 CTRPISAVPIPETRPCKINNL 1203
                   +P P    C  N L
Sbjct: 399  ARGYDHVMPSPCASMCSGNYL 419


>gb|EOY34687.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative isoform 1
            [Theobroma cacao]
          Length = 836

 Score =  469 bits (1208), Expect = e-130
 Identities = 232/381 (60%), Positives = 281/381 (73%)
 Frame = +1

Query: 61   PFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPL 240
            P  I  E W +AE  A  I+  +QP   ++ +R+E+V+Y+QRLI++ LG +VFPYGSVPL
Sbjct: 39   PCSIARESWDSAEETARRIVWSVQPTLDADRKRKEIVEYVQRLIQDGLGYQVFPYGSVPL 98

Query: 241  KTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCI 420
            KTYLPDGDIDLTT     +EDTL +D+ +IL  EE N  + + VKDV  + AEVKLVKC+
Sbjct: 99   KTYLPDGDIDLTTLSSPAIEDTLVSDVHAILRGEEHNQKAPYRVKDVHCIDAEVKLVKCL 158

Query: 421  VQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 600
            VQDIVVD+SFNQ+GGLCTLCFLEQ+DR +GKDHLFKRSIILIKAWCYYESRILGAHHGLI
Sbjct: 159  VQDIVVDISFNQLGGLCTLCFLEQIDRLVGKDHLFKRSIILIKAWCYYESRILGAHHGLI 218

Query: 601  STYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVV 780
            STYALETLVLYIFHLFHS+L GP+AVL++FLDYFSKFDWE Y +SLNGPV  SSLP +V 
Sbjct: 219  STYALETLVLYIFHLFHSSLTGPIAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPDIVA 278

Query: 781  EMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVS 960
            E+PE               C+ MFSVPS+G + NSR F LKHLNI DPLK+ NNLGRSV+
Sbjct: 279  EVPENVGNNPLLSEEFLRKCINMFSVPSKGVETNSRLFPLKHLNIIDPLKENNNLGRSVN 338

Query: 961  KGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDVQGFDPSLI 1140
            +GN+YRIRSAF YGA KL +IL+ P + I +EL +FF+NT+ RHGS     +Q    +  
Sbjct: 339  RGNYYRIRSAFKYGAHKLEQILILPRERIPDELVKFFANTLERHGSNHLTGMQNLPSTSD 398

Query: 1141 CTRPISAVPIPETRPCKINNL 1203
                   +P P    C  N L
Sbjct: 399  ARGYDHVMPSPCASMCSGNYL 419


>ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253523 [Vitis vinifera]
          Length = 854

 Score =  468 bits (1205), Expect = e-129
 Identities = 232/364 (63%), Positives = 278/364 (76%)
 Frame = +1

Query: 28   ASHATSSADQNPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLG 207
            AS + SS+   P  I  + WA AERA   I+ K+QP   S   R+EV+DY+QRLI  CLG
Sbjct: 21   ASRSLSSSPPLPASIAGDSWAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLG 80

Query: 208  AEVFPYGSVPLKTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQL 387
             EVFPYGSVPLKTYL DGDIDLT    +NVE+ L +D+ ++L+ EE+N ++EF VKD+Q 
Sbjct: 81   CEVFPYGSVPLKTYLLDGDIDLTALCSSNVEEALASDVHAVLKGEEQNENAEFEVKDIQF 140

Query: 388  VRAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYE 567
            + AEVKLVKC+V+DIV+D+SFNQ+GGL TLCFLEQVDR IGKDHLFKRSIILIK+WCYYE
Sbjct: 141  ITAEVKLVKCLVKDIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKSWCYYE 200

Query: 568  SRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGP 747
            SRILGAHHGLISTYALE LVLYIFHLFH +LDGPLAVL++FLDYFSKFDW+ Y +SLNGP
Sbjct: 201  SRILGAHHGLISTYALEILVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGP 260

Query: 748  VRLSSLPAVVVEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPL 927
            V  SSLP +V E+PE               CV MFSVP RG + NSR F LKHLNI DPL
Sbjct: 261  VCKSSLPDIVAELPENGQDDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPL 320

Query: 928  KDINNLGRSVSKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQR 1107
            ++ NNLGRSV+KGNFYRIRSAF YG+ KL +IL  P + I +EL  FF++T+ RH S   
Sbjct: 321  RENNNLGRSVNKGNFYRIRSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERHRSKYM 380

Query: 1108 PDVQ 1119
             ++Q
Sbjct: 381  AEIQ 384


>ref|XP_006290592.1| hypothetical protein CARUB_v10016681mg [Capsella rubella]
            gi|482559299|gb|EOA23490.1| hypothetical protein
            CARUB_v10016681mg [Capsella rubella]
          Length = 851

 Score =  468 bits (1203), Expect = e-129
 Identities = 230/348 (66%), Positives = 277/348 (79%), Gaps = 2/348 (0%)
 Frame = +1

Query: 79   EHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLPD 258
            E W   E A   II ++ P  V+E+RR+ V+ ++Q+++ + LG EV  +GSVPLKTYLPD
Sbjct: 34   EFWMRVEEATREIIEQVHPTHVAEDRRKNVITFVQKILGHKLGCEVHSFGSVPLKTYLPD 93

Query: 259  GDIDLTTFGG--ANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDI 432
            GDIDLT FG      E+ L   + ++LE EE++ S++FVVKDVQL+RAEVKLVKC+VQ+I
Sbjct: 94   GDIDLTAFGRFIPEPEEDLAAKVFNVLEREERSGSADFVVKDVQLIRAEVKLVKCLVQNI 153

Query: 433  VVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYA 612
            VVD+SFNQIGG+CTLCFLE++DR IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYA
Sbjct: 154  VVDISFNQIGGICTLCFLEKIDRLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYA 213

Query: 613  LETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPE 792
            LETLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW+ Y +SLNGPV LSSLP +VVE PE
Sbjct: 214  LETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVEAPE 273

Query: 793  XXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNF 972
                           C+ M+SVPSRG + N R F  KHLNI DPLK+ NNLGRSVSKGNF
Sbjct: 274  NGGEDLLLTSEFLKECMEMYSVPSRGFETNPRVFPSKHLNIVDPLKENNNLGRSVSKGNF 333

Query: 973  YRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1116
            YRIRSAF+YGARKL +I+ Q  ++I++EL +FFSN + RHGSGQRPDV
Sbjct: 334  YRIRSAFTYGARKLGQIISQSEENISSELRKFFSNMLHRHGSGQRPDV 381


>ref|XP_006290591.1| hypothetical protein CARUB_v10016681mg [Capsella rubella]
            gi|482559298|gb|EOA23489.1| hypothetical protein
            CARUB_v10016681mg [Capsella rubella]
          Length = 827

 Score =  468 bits (1203), Expect = e-129
 Identities = 230/348 (66%), Positives = 277/348 (79%), Gaps = 2/348 (0%)
 Frame = +1

Query: 79   EHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVPLKTYLPD 258
            E W   E A   II ++ P  V+E+RR+ V+ ++Q+++ + LG EV  +GSVPLKTYLPD
Sbjct: 34   EFWMRVEEATREIIEQVHPTHVAEDRRKNVITFVQKILGHKLGCEVHSFGSVPLKTYLPD 93

Query: 259  GDIDLTTFGG--ANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKCIVQDI 432
            GDIDLT FG      E+ L   + ++LE EE++ S++FVVKDVQL+RAEVKLVKC+VQ+I
Sbjct: 94   GDIDLTAFGRFIPEPEEDLAAKVFNVLEREERSGSADFVVKDVQLIRAEVKLVKCLVQNI 153

Query: 433  VVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYA 612
            VVD+SFNQIGG+CTLCFLE++DR IGKDHLFKRSIILIKAWCYYESRILGA HGLISTYA
Sbjct: 154  VVDISFNQIGGICTLCFLEKIDRLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYA 213

Query: 613  LETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVVVEMPE 792
            LETLVLYIFHLFHS+L+GPLAVL+KFLDYFSKFDW+ Y +SLNGPV LSSLP +VVE PE
Sbjct: 214  LETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVEAPE 273

Query: 793  XXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSVSKGNF 972
                           C+ M+SVPSRG + N R F  KHLNI DPLK+ NNLGRSVSKGNF
Sbjct: 274  NGGEDLLLTSEFLKECMEMYSVPSRGFETNPRVFPSKHLNIVDPLKENNNLGRSVSKGNF 333

Query: 973  YRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQRPDV 1116
            YRIRSAF+YGARKL +I+ Q  ++I++EL +FFSN + RHGSGQRPDV
Sbjct: 334  YRIRSAFTYGARKLGQIISQSEENISSELRKFFSNMLHRHGSGQRPDV 381


>ref|XP_006575451.1| PREDICTED: uncharacterized protein LOC100814626 isoform X3 [Glycine
            max]
          Length = 782

 Score =  461 bits (1185), Expect = e-127
 Identities = 231/349 (66%), Positives = 267/349 (76%)
 Frame = +1

Query: 58   NPFEIGTEHWATAERAAHGIIRKIQPNSVSEERRREVVDYIQRLIRNCLGAEVFPYGSVP 237
            +P  +  + WA AER    I+R+I+P   ++ RRREVVDY+QRLIR     EVFPYGSVP
Sbjct: 32   DPSSVAADAWAAAERNTAEILRRIRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVP 91

Query: 238  LKTYLPDGDIDLTTFGGANVEDTLDNDMKSILEEEEKNASSEFVVKDVQLVRAEVKLVKC 417
            LKTYLPDGDIDLT     N+ED L +D++++L  EE N ++E+ VKDV+ + AEVKLVKC
Sbjct: 92   LKTYLPDGDIDLTALSCENIEDGLVSDVRAVLHGEEINEAAEYEVKDVRFIDAEVKLVKC 151

Query: 418  IVQDIVVDVSFNQIGGLCTLCFLEQVDRHIGKDHLFKRSIILIKAWCYYESRILGAHHGL 597
            IVQDIVVD+SFNQ+GGL TLCFLE+VDR + KDHLFKRSIILIKAWCYYESR+LGAHHGL
Sbjct: 152  IVQDIVVDISFNQLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGL 211

Query: 598  ISTYALETLVLYIFHLFHSTLDGPLAVLFKFLDYFSKFDWETYSVSLNGPVRLSSLPAVV 777
            ISTYALETLVLYIFH FH +LDGPLAVL++FLDYFSKFDW+ Y VSL GPV  +SLP +V
Sbjct: 212  ISTYALETLVLYIFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVSKTSLPNIV 271

Query: 778  VEMPEXXXXXXXXXXXXXXXCVGMFSVPSRGGDKNSRGFQLKHLNIFDPLKDINNLGRSV 957
             E+PE               CV  FSVPSRG D N R F  KHLNI DPLK+ NNLGRSV
Sbjct: 272  AEVPE-NGGNTLLTEEFIRSCVESFSVPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSV 330

Query: 958  SKGNFYRIRSAFSYGARKLARILLQPADSITNELHRFFSNTMARHGSGQ 1104
            +KGNFYRIRSAF YGARKL  IL  P D I  EL RFF+NT+ RHGS Q
Sbjct: 331  NKGNFYRIRSAFKYGARKLGWILRLPEDRIAEELIRFFANTLERHGSTQ 379


Top