BLASTX nr result

ID: Cocculus22_contig00014094 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00014094
         (1492 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002316267.2| hypothetical protein POPTR_0010s20710g [Popu...   405   e-110
ref|XP_002524216.1| zinc finger protein, putative [Ricinus commu...   395   e-107
ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like ...   395   e-107
ref|XP_006486408.1| PREDICTED: poly(A) RNA polymerase GLD2-like ...   394   e-107
ref|XP_006435620.1| hypothetical protein CICLE_v10031537mg [Citr...   393   e-106
ref|XP_007220456.1| hypothetical protein PRUPE_ppa005171mg [Prun...   381   e-103
ref|XP_007009280.1| Zinc finger protein, putative [Theobroma cac...   376   e-101
ref|XP_006338856.1| PREDICTED: poly(A) RNA polymerase cid11-like...   375   e-101
ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like ...   372   e-100
ref|XP_004240948.1| PREDICTED: poly(A) RNA polymerase GLD2-like ...   365   4e-98
gb|EYU19942.1| hypothetical protein MIMGU_mgv1a006495mg [Mimulus...   361   4e-97
ref|XP_006296116.1| hypothetical protein CARUB_v10025267mg [Caps...   361   5e-97
ref|XP_006591354.1| PREDICTED: poly(A) RNA polymerase cid11-like...   356   2e-95
ref|NP_181504.2| HEN1 suppressor 1 [Arabidopsis thaliana] gi|538...   355   3e-95
dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana]           354   6e-95
ref|XP_006601982.1| PREDICTED: poly(A) RNA polymerase cid11-like...   352   2e-94
ref|XP_006411208.1| hypothetical protein EUTSA_v10016564mg [Eutr...   349   2e-93
ref|XP_006846360.1| hypothetical protein AMTR_s00012p00261420 [A...   348   3e-93
ref|XP_007163461.1| hypothetical protein PHAVU_001G236100g [Phas...   347   8e-93
ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arab...   346   1e-92

>ref|XP_002316267.2| hypothetical protein POPTR_0010s20710g [Populus trichocarpa]
            gi|566191879|ref|XP_006378690.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|566191881|ref|XP_006378691.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|566191883|ref|XP_006378692.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|550330240|gb|EEF02438.2| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|550330241|gb|ERP56487.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|550330242|gb|ERP56488.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|550330243|gb|ERP56489.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
          Length = 493

 Score =  405 bits (1042), Expect = e-110
 Identities = 217/405 (53%), Positives = 283/405 (69%), Gaps = 6/405 (1%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D   R  VI E + VV  VESLRG+ VEPFGSF+S L++RWGDLDIS+ L  GS++ SA
Sbjct: 23   EDWVVRFKVIEELEDVVKSVESLRGSTVEPFGSFVSNLFTRWGDLDISIVLSNGSYISSA 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             K++KQN+L D+ KAL+ RG  +R++FIPNARVP+L FE+A  SISCD+SI+N  GL+KS
Sbjct: 83   GKRRKQNLLEDVLKALRQRGGWQRLQFIPNARVPILKFENA--SISCDVSIDNMQGLMKS 140

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            KFLFWI+EID+RF DMVLLVKEWAK+ NINNPK G+LNSYSL ++VIFHFQTC PAILPP
Sbjct: 141  KFLFWINEIDRRFRDMVLLVKEWAKTHNINNPKTGSLNSYSLSLLVIFHFQTCVPAILPP 200

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L+E YP N+++DLTGV   AER I +ICAANI ++R+   R  NR SLSELFISF  KF 
Sbjct: 201  LKEIYPRNVIDDLTGVRTDAERRIGEICAANISRYRSNKSRAINRNSLSELFISFLTKFY 260

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             I+  A E  ICP+T KWE I SN RW  + Y L IEDPFEQPENTAR+V   +L++ISE
Sbjct: 261  DISSKATELGICPFTGKWEEIRSNTRWLPRTYALFIEDPFEQPENTARAVSAANLMKISE 320

Query: 903  AFEETHSKL-SRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLSHTVR- 1076
            A + TH +L + +Q+Q S +  LVR ++  +  A T  + S++  G         H VR 
Sbjct: 321  AIQTTHHRLVTANQNQISFLGMLVRPRI-SRIIAGTPASNSSYTAG---------HQVRI 370

Query: 1077 PMGNP----VHQMRPVTSLNGVSQRQQFSSKVRPTRSDFRSMQLE 1199
            P+G P    VH++R     +     Q  +++ + +RS +   Q +
Sbjct: 371  PVGTPSYTSVHRVRTPVGTSSYMAGQHITTRSQTSRSVYSPSQAQ 415


>ref|XP_002524216.1| zinc finger protein, putative [Ricinus communis]
            gi|223536493|gb|EEF38140.1| zinc finger protein, putative
            [Ricinus communis]
          Length = 493

 Score =  395 bits (1015), Expect = e-107
 Identities = 194/328 (59%), Positives = 249/328 (75%), Gaps = 1/328 (0%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D A R  +I E K V+  +ESLRGA VEPFGSF+S L++RWGDLDIS+ L  GS++ SA
Sbjct: 23   EDWAVRSKIIEELKDVIASIESLRGATVEPFGSFVSNLFTRWGDLDISIMLANGSYISSA 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KK+KQNVLR+  KAL+ +G  RR++F+PNARVPLL FES  ++ISCD+SI+N  G IKS
Sbjct: 83   AKKRKQNVLREFHKALRQKGGWRRLQFVPNARVPLLKFESGRQNISCDVSIDNLQGQIKS 142

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
             FLFW+++ID RF DMVLLVKEWAK+ NINNPK GTLNSYSL ++VIFHFQTC PAILPP
Sbjct: 143  NFLFWLNQIDGRFRDMVLLVKEWAKAHNINNPKTGTLNSYSLSLLVIFHFQTCVPAILPP 202

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L+E YP N+V+DLTGV  +AE  I++ C ANI ++ +  YR  NR SLSELFISFF KFS
Sbjct: 203  LKEIYPRNVVDDLTGVRTVAEERIKETCNANIARYMSDKYRAVNRSSLSELFISFFAKFS 262

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             I++ A +  IC +T +W  I S  RW  K Y L IEDPFEQPEN AR+V   +L++I+E
Sbjct: 263  GISLKAADLGICTFTGQWLDIRSTMRWLPKTYALFIEDPFEQPENAARAVSAGNLVKIAE 322

Query: 903  AFEETHSKL-SRSQDQRSMIATLVRVQL 983
            AF+ T+ KL   +Q++ S++ TLVR ++
Sbjct: 323  AFQTTYHKLVLANQNRTSLLGTLVRPEI 350


>ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like [Vitis vinifera]
            gi|296086183|emb|CBI31624.3| unnamed protein product
            [Vitis vinifera]
          Length = 453

 Score =  395 bits (1014), Expect = e-107
 Identities = 206/384 (53%), Positives = 261/384 (67%), Gaps = 5/384 (1%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D A R  +I +F++ VD VESLRGA VEPFGSF+S LY++WGDLDIS+ELP G+++ SA
Sbjct: 23   EDWAIRNQLIADFRTAVDSVESLRGATVEPFGSFLSNLYTQWGDLDISIELPNGAYISSA 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             K+ KQ +L  +  AL+++G  R+++FIPNARVP++ FES H +ISCD+SINN  G +KS
Sbjct: 83   GKRHKQTLLGHVLNALRSKGGWRKLQFIPNARVPIIKFESYHPNISCDVSINNLKGQMKS 142

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            KFLFWIS ID RF D+VLLVKEWA++ +INN K GTLNSYSL ++V+FH QTC PAILPP
Sbjct: 143  KFLFWISGIDGRFRDLVLLVKEWARAHDINNSKTGTLNSYSLSLLVVFHLQTCRPAILPP 202

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L+E YPGN+ +DL GV  + E  IE+  AANI +F+    R  NR SLSELFISF  KF 
Sbjct: 203  LKEIYPGNVADDLIGVRAVVEGQIEETSAANINRFKRDRSRAPNRSSLSELFISFLAKFV 262

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             I   A E  ICPYT +W  I SN RW  + Y L +EDPFEQPENTAR VR   L RISE
Sbjct: 263  DITSRASEQGICPYTGQWVDIDSNMRWMPRTYELFVEDPFEQPENTARGVRSRQLQRISE 322

Query: 903  AFEETHSKL-SRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHN----AQLSH 1067
            AF+ TH +L S +QDQ S+I TLVR Q+  QF  +     S+  G            +++
Sbjct: 323  AFQTTHQRLTSANQDQHSLIDTLVRPQI-AQFIRRAPSRNSSAYGRNNSRTYPSVPNVAN 381

Query: 1068 TVRPMGNPVHQMRPVTSLNGVSQR 1139
            +     N     RP +  N  SQR
Sbjct: 382  SPLQFQNDFQNRRPQSRPNTTSQR 405


>ref|XP_006486408.1| PREDICTED: poly(A) RNA polymerase GLD2-like isoform X1 [Citrus
            sinensis] gi|568866114|ref|XP_006486409.1| PREDICTED:
            poly(A) RNA polymerase GLD2-like isoform X2 [Citrus
            sinensis]
          Length = 445

 Score =  394 bits (1012), Expect = e-107
 Identities = 208/403 (51%), Positives = 276/403 (68%), Gaps = 24/403 (5%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D  TR+ VI++ + VV+ VESLRGA VEPFGSF+S L+SRWGDLDIS+EL  GS + S 
Sbjct: 23   EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISST 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KK KQ++L D+ +AL+ +G  RR++F+ +ARVP+L FE+ H++ISCDISI+N  G IKS
Sbjct: 83   GKKLKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            KFLFWIS+ID RF DMVLLVKEWAK+ +INNPK GT NSYSL ++V+FHFQTC PAILPP
Sbjct: 143  KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L++ YPGN+V+DL GV    ER I +ICA NI +F +  YR  NR SL+ LF+SF +KFS
Sbjct: 203  LKDIYPGNLVDDLKGVRANVERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             +++ A E  ICP+T +WE I SN RW    +PL IEDPFEQPEN+AR+V + +L +IS 
Sbjct: 263  GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISN 322

Query: 903  AFEETHSKL-SRSQDQRSMIATLVRVQLRPQFFAKT--------QGNQSAHAGGTRWHNA 1055
            AFE TH +L S +Q + +++++L R  +  QFF ++         G++ A     +  N+
Sbjct: 323  AFEMTHFRLTSTNQTRYALLSSLARPYIL-QFFGESPVRYANYNNGHRRARPQSHKSVNS 381

Query: 1056 QL-----SHTVRPMGNP----------VHQMRPVTSLNGVSQR 1139
             L     SH  R    P           HQ +PV   NG  QR
Sbjct: 382  PLQAQHQSHNARRENRPNRPMSQQSVQQHQSQPVRQNNGQVQR 424


>ref|XP_006435620.1| hypothetical protein CICLE_v10031537mg [Citrus clementina]
            gi|557537816|gb|ESR48860.1| hypothetical protein
            CICLE_v10031537mg [Citrus clementina]
          Length = 445

 Score =  393 bits (1009), Expect = e-106
 Identities = 207/403 (51%), Positives = 276/403 (68%), Gaps = 24/403 (5%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D  TR+ VI++ + VV+ VESLRGA VEPFGSF+S L+SRWGDLDIS+EL  GS + S 
Sbjct: 23   EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISST 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KK KQ++L D+ +AL+ +G  RR++F+ +ARVP+L FE+ H++ISCDISI+N  G IKS
Sbjct: 83   GKKLKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            KFLFWIS+ID RF DMVLLVKEWAK+ +INNPK GT NSYSL ++V+FHFQTC PAILPP
Sbjct: 143  KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L++ YPGN+V+DL GV    ER I +ICA NI +F +  YR  NR SL+ LF+SF +KFS
Sbjct: 203  LKDIYPGNLVDDLKGVRANVERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             +++ + E  ICP+T +WE I SN RW    +PL IEDPFEQPEN+AR+V + +L +IS 
Sbjct: 263  GLSLKSSELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISN 322

Query: 903  AFEETHSKL-SRSQDQRSMIATLVRVQLRPQFFAKT--------QGNQSAHAGGTRWHNA 1055
            AFE TH +L S +Q + +++++L R  +  QFF ++         G++ A     +  N+
Sbjct: 323  AFEMTHFRLTSTNQTRYALLSSLARPYIL-QFFGESPVRYANYNNGHRRARPQSHKSVNS 381

Query: 1056 QL-----SHTVRPMGNP----------VHQMRPVTSLNGVSQR 1139
             L     SH  R    P           HQ +PV   NG  QR
Sbjct: 382  PLQAQHQSHNARRENRPNRPMSQQSVQQHQSQPVRQNNGQVQR 424


>ref|XP_007220456.1| hypothetical protein PRUPE_ppa005171mg [Prunus persica]
            gi|462416918|gb|EMJ21655.1| hypothetical protein
            PRUPE_ppa005171mg [Prunus persica]
          Length = 474

 Score =  381 bits (979), Expect = e-103
 Identities = 188/327 (57%), Positives = 243/327 (74%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D  TR+ +I+E +  V+ VESLRGA VEPFGSF+S L++RWGDLD+S+E   GSFV   
Sbjct: 23   EDWTTRLQIIDELRGAVESVESLRGATVEPFGSFVSDLFTRWGDLDVSIEFSNGSFVSPY 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KKQKQ +L D+ +A++ +G  RR + IPNARVP+L  ES  +++SCDISI+N    +KS
Sbjct: 83   GKKQKQRLLGDVMRAMRQKGGWRRYQLIPNARVPILKVESNLQNVSCDISIDNLKCQMKS 142

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            + LFWISEID RF DMVLL+KEWAK+ NINNPK GT NSYSL ++V+FHFQTC PAI PP
Sbjct: 143  RLLFWISEIDTRFRDMVLLIKEWAKAHNINNPKFGTFNSYSLTLLVVFHFQTCAPAIFPP 202

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L++ YPGN+++DL G+    ER IE+ CAANIR+F++ + R  NR SLSELFISF  KFS
Sbjct: 203  LKDIYPGNLIDDLKGLRADTERRIEETCAANIRRFQSYNLRAENRSSLSELFISFLGKFS 262

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             I++ A E  IC YT +W+ I SN RW  + Y L IEDPFEQPEN+AR+V + +L RISE
Sbjct: 263  DISLKASELGICTYTGQWQAIKSNMRWLPQTYALFIEDPFEQPENSARAVSKRELTRISE 322

Query: 903  AFEETHSKLSRSQDQRSMIATLVRVQL 983
             FE +H  L  S +  S++ATLVR Q+
Sbjct: 323  TFEMSHHMLI-SPNHSSLLATLVRPQM 348


>ref|XP_007009280.1| Zinc finger protein, putative [Theobroma cacao]
            gi|508726193|gb|EOY18090.1| Zinc finger protein, putative
            [Theobroma cacao]
          Length = 482

 Score =  376 bits (965), Expect = e-101
 Identities = 203/390 (52%), Positives = 265/390 (67%), Gaps = 4/390 (1%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D  TR  +I+E + VV  +ESLRGA VEPFGS +S L++RWGDLDIS+ELP GS+V SA
Sbjct: 23   EDWVTRQKIIDELREVVQSMESLRGATVEPFGSLVSNLFTRWGDLDISIELPYGSYVSSA 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KK+KQ +L ++++AL+ +   +R++FIP+ARVP+L  ES  ++ISCDISI+N  G IKS
Sbjct: 83   GKKRKQTLLGELQRALKQKDGWQRLQFIPHARVPILKIESRWQNISCDISIDNLQGQIKS 142

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            KFLFW++EID RF +MVLLVKEWA +  INNPK GT NSYSL ++VIFHFQTC PAI PP
Sbjct: 143  KFLFWLNEIDGRFREMVLLVKEWASANGINNPKAGTFNSYSLTLLVIFHFQTCAPAIFPP 202

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L++ YP N+V DLTGV   AER I  +C++NI +F  RS R  NR SLSELFISF  KFS
Sbjct: 203  LKDIYPRNVVTDLTGVRADAERRIAQVCSSNIARF--RSGRTVNRSSLSELFISFIAKFS 260

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             IN  A +  IC +T +WE I SN RW  + Y + +EDPFEQPEN +R+V Q  LI+I+E
Sbjct: 261  DINSKASDMGICTFTGQWEYITSNMRWLPRTYAIFVEDPFEQPENASRAVSQKQLIKIAE 320

Query: 903  AFEETHSKL-SRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNA--QLSHTV 1073
            AFE T   L S +  Q +++ TLV  +   +F  K Q   S+   G  + N   Q+   V
Sbjct: 321  AFETTRCMLISANLTQSTLLPTLVGPK-TSRFIVKQQSVSSSSYNGGHYPNTRPQVHRAV 379

Query: 1074 R-PMGNPVHQMRPVTSLNGVSQRQQFSSKV 1160
              P+    HQ R   S    SQ QQ  +++
Sbjct: 380  HSPLLMQQHQYR--NSRPAASQMQQHQAQM 407


>ref|XP_006338856.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Solanum
            tuberosum] gi|565343469|ref|XP_006338857.1| PREDICTED:
            poly(A) RNA polymerase cid11-like isoform X2 [Solanum
            tuberosum] gi|565343471|ref|XP_006338858.1| PREDICTED:
            poly(A) RNA polymerase cid11-like isoform X3 [Solanum
            tuberosum] gi|565343473|ref|XP_006338859.1| PREDICTED:
            poly(A) RNA polymerase cid11-like isoform X4 [Solanum
            tuberosum]
          Length = 453

 Score =  375 bits (962), Expect = e-101
 Identities = 196/400 (49%), Positives = 267/400 (66%), Gaps = 1/400 (0%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D + R  +I+E ++VV+ +E LRGA VEPFGSF+S L++RWGDLDIS+ELP GS + +A
Sbjct: 23   EDWSMRFQLIHELRAVVESIEILRGATVEPFGSFVSNLFTRWGDLDISIELPNGSHISAA 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KK K ++L D+ KAL+ +G  R+++FI NARVP+L F+  + +ISCDISINN  G +KS
Sbjct: 83   GKKYKLSLLGDVLKALRAKGGCRKLQFITNARVPILKFQGNY-NISCDISINNLSGQMKS 141

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            K L+WI+ ID RF DMVLLVKEWAK+ NIN+ K GTLNSYSL ++V+FHFQTC PAILPP
Sbjct: 142  KILYWINMIDGRFRDMVLLVKEWAKAHNINDSKTGTLNSYSLSLLVVFHFQTCVPAILPP 201

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L+E YPG++V+DLTGV   AE+ IE+ CA NI +  +   R  NR  LSELFISF  KF 
Sbjct: 202  LKEIYPGSMVDDLTGVRASAEKFIEETCAMNINRLMSNKSRAINRSYLSELFISFIAKFC 261

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             I+  A    I P+T +WE I+SN RW  K Y + +EDPFEQP N+AR V    L RI E
Sbjct: 262  DISSRASAQGISPFTGQWEDIVSNMRWLPKTYTIFVEDPFEQPLNSARGVSTKQLTRIEE 321

Query: 903  AFEETHSKL-SRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLSHTVRP 1079
            AF  TH  L S +Q++  +I+TLV+  +  +F A+T GNQ+ ++        Q    ++P
Sbjct: 322  AFRSTHFMLCSSNQNENEIISTLVKPHV-SKFVARTSGNQNNYSRNGLRPQLQAQRAIKP 380

Query: 1080 MGNPVHQMRPVTSLNGVSQRQQFSSKVRPTRSDFRSMQLE 1199
                 HQ +   +++   + Q      RP     ++ QL+
Sbjct: 381  PFQAHHQRQAQRAIHPPLRAQHQPQAQRPINPPLQAHQLQ 420


>ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
            gi|449516431|ref|XP_004165250.1| PREDICTED: poly(A) RNA
            polymerase GLD2-like [Cucumis sativus]
          Length = 464

 Score =  372 bits (955), Expect = e-100
 Identities = 195/370 (52%), Positives = 252/370 (68%), Gaps = 1/370 (0%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            DD   R  VINE ++VV  +ESLRGA +EPFGSF+S L+SRWGDLD+SV+L  GS+  +A
Sbjct: 22   DDWTARFQVINELRNVVQSIESLRGATIEPFGSFVSNLFSRWGDLDLSVQLNNGSYTSTA 81

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KK+KQ +LRDI+ A +  G   +++ IP+ARVP+L  E    +ISCDISI+N +G IKS
Sbjct: 82   GKKRKQTLLRDIQNASRKNGRWYKLQLIPHARVPILKIEHIQHNISCDISIDNLVGQIKS 141

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            K L W++EID RFHDMVLLVKEWAK+ +INN K GT NSYSL ++VIFHFQTC PAI PP
Sbjct: 142  KILLWVNEIDGRFHDMVLLVKEWAKAHDINNSKQGTFNSYSLSLLVIFHFQTCSPAIFPP 201

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L++ YPGN+V++L GV    E  I   CA NI +F++R+   ANR SLSELF+SF  KFS
Sbjct: 202  LRDIYPGNVVDNLKGVRAEVENEIARTCATNIARFKSRT---ANRSSLSELFVSFLAKFS 258

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             I+  A E  ICPYT +W  I SN RW  K Y + +EDPFEQPENTAR++    L+RISE
Sbjct: 259  DISSKASELGICPYTGQWLKIESNMRWLPKTYAIFVEDPFEQPENTARAINARQLMRISE 318

Query: 903  AFEETHSKL-SRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLSHTVRP 1079
            AF  TH +L S  Q++ S++  L R Q+  Q    + G+ SA A      N +    +RP
Sbjct: 319  AFRMTHLRLTSVYQNRSSILNDLARPQI-SQLIINSSGSASAPA-----FNVENYTPIRP 372

Query: 1080 MGNPVHQMRP 1109
              +    M+P
Sbjct: 373  QVHQARVMQP 382


>ref|XP_004240948.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Solanum lycopersicum]
          Length = 453

 Score =  365 bits (936), Expect = 4e-98
 Identities = 192/400 (48%), Positives = 263/400 (65%), Gaps = 1/400 (0%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D + R  +I+E ++VV+ +E LRGA VEPFGSF+S L++RWGD+DIS+ELP G  + +A
Sbjct: 23   EDWSMRFQLIHELRAVVESIEILRGATVEPFGSFVSNLFTRWGDVDISIELPNGLHISAA 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KK K ++L D+ KAL+ +G  R+++FI NARVP+L F+  + +ISCDISINN  G +KS
Sbjct: 83   GKKYKLSLLGDVLKALRAKGGCRKLQFITNARVPILKFQG-NNNISCDISINNLSGQMKS 141

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            K L+WI+ ID RF DMVLLVKEWAK+ NIN+ K GTLNSYSL ++V+FH QTC PAILPP
Sbjct: 142  KILYWINMIDGRFRDMVLLVKEWAKAHNINDSKTGTLNSYSLSLLVVFHLQTCVPAILPP 201

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L+E YPG++V+DLTGV   AE+ IE+ CA NI +  +   R  NR SLSELFISF  KF 
Sbjct: 202  LKEIYPGSMVDDLTGVRASAEKFIEETCAMNINRLMSNKSRVINRSSLSELFISFIAKFC 261

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             I+  A    I P+T +WE I+SN RW  K Y + +EDPFEQP N+AR V    L RI E
Sbjct: 262  NISSRASAQGISPFTGQWEDIVSNMRWLPKTYTIFVEDPFEQPLNSARGVSTKQLTRIEE 321

Query: 903  AFEETHSKL-SRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLSHTVRP 1079
            AF  TH  L S + ++  +I+TLV+  +  +F A+  GNQ+ ++        Q    + P
Sbjct: 322  AFRSTHFMLCSSNLNENEVISTLVKPHV-SKFVARISGNQNNYSRNGLRPQLQGQRAIHP 380

Query: 1080 MGNPVHQMRPVTSLNGVSQRQQFSSKVRPTRSDFRSMQLE 1199
                 HQ +   +++   + Q      RP     ++ QL+
Sbjct: 381  PLQAHHQRQAQRAIHPPLRAQHQPQAQRPINPPLQAHQLQ 420


>gb|EYU19942.1| hypothetical protein MIMGU_mgv1a006495mg [Mimulus guttatus]
          Length = 442

 Score =  361 bits (927), Expect = 4e-97
 Identities = 182/323 (56%), Positives = 239/323 (73%), Gaps = 1/323 (0%)
 Frame = +3

Query: 3   DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
           DD + R  +INE +++V  +E+LRGA VEPFGSF S L+++WGDLDIS+EL  G+++ S 
Sbjct: 27  DDWSFRFQMINEIRAIVGSIENLRGATVEPFGSFASNLFTKWGDLDISIELQNGTYISSP 86

Query: 183 RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
            KK KQ+VL+++ KA + +G  R++KFI NARVP+L FE ++ +ISCDISINN  G +KS
Sbjct: 87  GKKHKQSVLQEVLKAFRKKGGFRKLKFIANARVPILKFEGSY-NISCDISINNLSGQMKS 145

Query: 363 KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
           K LFWI+EID RF D+V+LVKEWAK+ +IN+ K GTLNSYSL ++VIFH QT  PAILPP
Sbjct: 146 KILFWINEIDGRFRDLVMLVKEWAKAHHINDSKSGTLNSYSLSLLVIFHLQTLVPAILPP 205

Query: 543 LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
           L+E YPGN+++DLTGV  +AE+NIEDICAANI + R+   R  NR +LS LFISF  KF+
Sbjct: 206 LREIYPGNMIDDLTGVRTVAEKNIEDICAANIHRIRSDRSRLINRSTLSALFISFLTKFA 265

Query: 723 RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
            I   A    ICPY+ + E I +N RW  + Y L +EDPFEQP NTAR+V  N LIRIS+
Sbjct: 266 DICSRASTQGICPYSGQLEDIHTNMRWLPRTYALFVEDPFEQPANTARTVSSNQLIRISQ 325

Query: 903 AFEETHSKL-SRSQDQRSMIATL 968
           A + TH  L + +QD+  +I  L
Sbjct: 326 AIQATHGILVAANQDRTCLIPVL 348


>ref|XP_006296116.1| hypothetical protein CARUB_v10025267mg [Capsella rubella]
            gi|482564824|gb|EOA29014.1| hypothetical protein
            CARUB_v10025267mg [Capsella rubella]
          Length = 499

 Score =  361 bits (926), Expect = 5e-97
 Identities = 186/366 (50%), Positives = 248/366 (67%), Gaps = 1/366 (0%)
 Frame = +3

Query: 6    DVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSAR 185
            D  TR+ VI + +SVV  VE LRGA V+PFGSF+S L++RWGDLDISV+L  GS +    
Sbjct: 23   DCDTRIGVIEQLRSVVQSVECLRGATVQPFGSFVSNLFTRWGDLDISVDLFSGSSILFTG 82

Query: 186  KKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKSK 365
            KKQKQ +L  + +AL+  G+  +++F+ +ARVP+L  ES H+ ISCDISI+N  GL+KS+
Sbjct: 83   KKQKQKLLGHLLRALRANGLWYKLQFVIHARVPILKVESGHQRISCDISIDNLEGLLKSR 142

Query: 366  FLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPPL 545
            FL WISEID RF D+VLLVKEWAK+ NIN+ K GT NSYSL ++VIFH QTC PAILPPL
Sbjct: 143  FLLWISEIDGRFRDLVLLVKEWAKAHNINDSKNGTFNSYSLSLLVIFHLQTCVPAILPPL 202

Query: 546  QEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFSR 725
            +  YP +  +DLTGV   AE +I  I AANI +F+  + ++ NR SLSEL +SFF KFS 
Sbjct: 203  RVIYPKSAADDLTGVRKTAEESIAQITAANIARFKLDTAKSPNRSSLSELLVSFFAKFSD 262

Query: 726  INIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISEA 905
            IN+ A E  +CP+T +WE I SN RW  K Y L +EDPFEQP+N ARSV + +L RI++ 
Sbjct: 263  INVKAQELGVCPFTGRWENISSNSRWLPKTYSLFVEDPFEQPQNAARSVSRRNLDRIAQV 322

Query: 906  FEETHSKLSRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLSH-TVRPM 1082
            F+ T  +L+   ++ S+I  +   Q++   +     +   HA GT  HN +  H   RP 
Sbjct: 323  FQMTSRRLASDCNRNSIIGVMTGQQIQQSLYRTISLHNQHHANGT--HNVRNLHGQSRPW 380

Query: 1083 GNPVHQ 1100
               + Q
Sbjct: 381  NQQLQQ 386


>ref|XP_006591354.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Glycine max]
            gi|571489968|ref|XP_006591355.1| PREDICTED: poly(A) RNA
            polymerase cid11-like isoform X2 [Glycine max]
          Length = 455

 Score =  356 bits (913), Expect = 2e-95
 Identities = 191/406 (47%), Positives = 259/406 (63%), Gaps = 7/406 (1%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D   R  +IN+F+S+V+ VESLRGA VEP+GSF+S L++RWGDLDIS+EL  G  + SA
Sbjct: 23   EDWEIRFAIINDFRSIVESVESLRGATVEPYGSFVSNLFTRWGDLDISIELSNGLHISSA 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KKQKQ +L ++ KAL+ +G    ++FI NARVP+L F+S  + +SCDISINN  G +KS
Sbjct: 83   GKKQKQTLLGEVLKALRMKGGGSNLQFISNARVPILKFKSYRQGVSCDISINNLPGQMKS 142

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            K L WI++ID RF  MVLLVKEWAK+  INN K GT NSYSL ++VIF+FQTC PAI PP
Sbjct: 143  KILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFNSYSLSLLVIFYFQTCIPAIFPP 202

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L++ YPGN+++DL G+   AE  I + C ANI +F +   R+ NR S++ELF+ F  KF+
Sbjct: 203  LKDIYPGNMIDDLIGIRSDAENLIAETCDANINRFISNRARSINRKSVAELFVDFVGKFA 262

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
            +++ MA+E  ICPYT KWE I  N  W  K Y + +EDPFEQP+NTARSV    L +I+E
Sbjct: 263  KMDSMAVEMGICPYTGKWEQIEDNMIWLPKTYAIFVEDPFEQPQNTARSVSAGQLKKITE 322

Query: 903  AFEETHSKL-SRSQDQRSMIATL-----VRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLS 1064
             F  TH  L S +Q+Q S+++ L     +R   RP         Q     G  + +    
Sbjct: 323  TFARTHDLLTSTNQNQISLLSNLAPAHVIRCITRPYGGGYIHPTQPQVQRGVAYFHPTQP 382

Query: 1065 HTVRPMGNPVHQMRPVTSLNGVSQRQ-QFSSKVRPTRSDFRSMQLE 1199
               RP    V   R +  L   SQ   Q +S+   + S F + Q++
Sbjct: 383  QVFRPTQPQV--QRAIRPLQPQSQHHFQNASQGTSSNSSFSTGQIQ 426


>ref|NP_181504.2| HEN1 suppressor 1 [Arabidopsis thaliana] gi|53850481|gb|AAU95417.1|
            At2g39740 [Arabidopsis thaliana]
            gi|55733735|gb|AAV59264.1| At2g39740 [Arabidopsis
            thaliana] gi|330254623|gb|AEC09717.1| HEN1 suppressor 1
            [Arabidopsis thaliana]
          Length = 511

 Score =  355 bits (911), Expect = 3e-95
 Identities = 183/366 (50%), Positives = 244/366 (66%), Gaps = 1/366 (0%)
 Frame = +3

Query: 6    DVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSAR 185
            D  TR+ VI++ + V+  VE LRGA V+PFGSF+S L++RWGDLDISV+L  GS +    
Sbjct: 24   DRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGDLDISVDLFSGSSILFTG 83

Query: 186  KKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKSK 365
            KKQKQ +L  + +AL+  G+  +++F+ +ARVP+L   S H+ ISCDISI+N  GL+KS+
Sbjct: 84   KKQKQTLLGHLLRALRASGLWYKLQFVIHARVPILKVVSGHQRISCDISIDNLDGLLKSR 143

Query: 366  FLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPPL 545
            FLFWISEID RF D+VLLVKEWAK+ NIN+ K GT NSYSL ++VIFHFQTC PAILPPL
Sbjct: 144  FLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFNSYSLSLLVIFHFQTCVPAILPPL 203

Query: 546  QEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFSR 725
            +  YP + V+DLTGV   AE +I  + AANI +F++   ++ NR SLSEL +SFF KFS 
Sbjct: 204  RVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSERAKSVNRSSLSELLVSFFAKFSD 263

Query: 726  INIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISEA 905
            IN+ A E  +CP+T +WE I SN  W  K Y L +EDPFEQP N ARSV + +L RI++ 
Sbjct: 264  INVKAQEFGVCPFTGRWETISSNTTWLPKTYSLFVEDPFEQPVNAARSVSRRNLDRIAQV 323

Query: 906  FEETHSKLSRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLSH-TVRPM 1082
            F+ T  +L    ++ S+I  L    ++   +         HA G   HN +  H   RP 
Sbjct: 324  FQITSRRLVSECNRNSIIGILTGQHIQESLYRTISLPSQHHANG--MHNVRNLHGQARPQ 381

Query: 1083 GNPVHQ 1100
               + Q
Sbjct: 382  NQQMQQ 387


>dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana]
          Length = 511

 Score =  354 bits (908), Expect = 6e-95
 Identities = 183/366 (50%), Positives = 244/366 (66%), Gaps = 1/366 (0%)
 Frame = +3

Query: 6    DVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSAR 185
            D  TR+ VI++ + V+  VE LRGA V+PFGSF+S L++RWGDLDISV+L  GS +    
Sbjct: 24   DRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGDLDISVDLFSGSSILFTG 83

Query: 186  KKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKSK 365
            KKQKQ +L  + +AL+  G+  +++F+ +ARVP+L   S H+ ISCDISI+N  GL+KS+
Sbjct: 84   KKQKQILLGHLLRALRASGLWYKLQFVIHARVPILKVVSGHQRISCDISIDNLDGLLKSR 143

Query: 366  FLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPPL 545
            FLFWISEID RF D+VLLVKEWAK+ NIN+ K GT NSYSL ++VIFHFQTC PAILPPL
Sbjct: 144  FLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFNSYSLSLLVIFHFQTCVPAILPPL 203

Query: 546  QEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFSR 725
            +  YP + V+DLTGV   AE +I  + AANI +F++   ++ NR SLSEL +SFF KFS 
Sbjct: 204  RVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSERAKSVNRSSLSELLVSFFAKFSD 263

Query: 726  INIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISEA 905
            IN+ A E  +CP+T +WE I SN  W  K Y L +EDPFEQP N ARSV + +L RI++ 
Sbjct: 264  INVKAQEFGVCPFTGRWETISSNTTWLPKTYSLFVEDPFEQPVNAARSVSRRNLDRIAQV 323

Query: 906  FEETHSKLSRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLSH-TVRPM 1082
            F+ T  +L    ++ S+I  L    ++   +         HA G   HN +  H   RP 
Sbjct: 324  FQITSRRLVSECNRNSIIGILTGQHIQESLYRTISLPSQHHANG--MHNVRNLHGQARPQ 381

Query: 1083 GNPVHQ 1100
               + Q
Sbjct: 382  NQQMQQ 387


>ref|XP_006601982.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Glycine
           max] gi|571542766|ref|XP_006601983.1| PREDICTED: poly(A)
           RNA polymerase cid11-like isoform X2 [Glycine max]
           gi|571542770|ref|XP_006601984.1| PREDICTED: poly(A) RNA
           polymerase cid11-like isoform X3 [Glycine max]
           gi|571542774|ref|XP_006601985.1| PREDICTED: poly(A) RNA
           polymerase cid11-like isoform X4 [Glycine max]
          Length = 415

 Score =  352 bits (903), Expect = 2e-94
 Identities = 175/323 (54%), Positives = 230/323 (71%), Gaps = 1/323 (0%)
 Frame = +3

Query: 3   DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
           +D   R  +IN+ +S+V+ VESLRGA VEPFGSF+S L++RWGDLDIS+EL  G  + SA
Sbjct: 23  EDWEIRFAIINDLRSIVESVESLRGATVEPFGSFVSNLFTRWGDLDISIELSNGLHISSA 82

Query: 183 RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
            KKQKQ  L D+ KAL+ +G    ++FI NARVP+L F+S  + +SCDISINN  G +KS
Sbjct: 83  GKKQKQTFLGDVLKALRMKGGGSNLQFISNARVPILKFKSYRQGVSCDISINNLPGQMKS 142

Query: 363 KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
           K L WI++ID RF  MVLLVKEWAK+  INN K GT NSYSL ++VIF+FQTC PAI PP
Sbjct: 143 KILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFNSYSLSLLVIFYFQTCIPAIFPP 202

Query: 543 LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
           L++ YPGN+V+DL GV   AE  I   C ANI +F +   R+ NR S++ELF+ F  KF+
Sbjct: 203 LKDIYPGNMVDDLIGVRSDAENLIAQTCDANINRFISNRARSINRKSVAELFVEFIGKFA 262

Query: 723 RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
           +++ MA++  ICPY+ KWE I  N  W  K Y + +EDPFEQP+NTARSV    L +I+E
Sbjct: 263 KMDSMAVKMGICPYSGKWEQIEDNMIWLPKTYAIFVEDPFEQPQNTARSVSAGQLKKITE 322

Query: 903 AFEETHSKL-SRSQDQRSMIATL 968
           AF  TH  L S +Q+Q S+++ +
Sbjct: 323 AFARTHDLLTSTNQNQISLLSNM 345


>ref|XP_006411208.1| hypothetical protein EUTSA_v10016564mg [Eutrema salsugineum]
            gi|567214704|ref|XP_006411209.1| hypothetical protein
            EUTSA_v10016564mg [Eutrema salsugineum]
            gi|557112377|gb|ESQ52661.1| hypothetical protein
            EUTSA_v10016564mg [Eutrema salsugineum]
            gi|557112378|gb|ESQ52662.1| hypothetical protein
            EUTSA_v10016564mg [Eutrema salsugineum]
          Length = 493

 Score =  349 bits (895), Expect = 2e-93
 Identities = 183/365 (50%), Positives = 242/365 (66%)
 Frame = +3

Query: 6    DVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSAR 185
            D   R+ VI++ +S +  VESLRGA V+PFGSF+S L++RWGDLDISV+L  GS +    
Sbjct: 24   DWDARMTVIDQLRSALQSVESLRGATVQPFGSFVSNLFTRWGDLDISVDLFSGSSILFTG 83

Query: 186  KKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKSK 365
            KKQKQ  L  + +AL+  G   R++F+ +ARVP+L   S H+ ISCDISI+N  GL+KS+
Sbjct: 84   KKQKQTFLGQLLRALRASGAWYRLQFVAHARVPILKVVSGHQRISCDISIDNLEGLLKSR 143

Query: 366  FLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPPL 545
            FLFWISEID RF D+VLLVKEWAK+ +INNPK GT NSYSL ++VIFH QTC PAILPPL
Sbjct: 144  FLFWISEIDWRFRDLVLLVKEWAKAHDINNPKNGTFNSYSLSLLVIFHLQTCVPAILPPL 203

Query: 546  QEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFSR 725
             + YP + V+DL   + IA+     + AANI +FR+ + R  NR SLSEL +SFF KFS 
Sbjct: 204  GDIYPRSAVDDLKVAACIAQ-----LSAANIARFRSGTSRAVNRSSLSELLVSFFAKFSD 258

Query: 726  INIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISEA 905
            IN+ A E  +CP+T +WE I SN RW  K Y L +EDPFEQPEN ARSV +  L RI++ 
Sbjct: 259  INVKAKELGVCPFTGRWENISSNTRWLPKTYSLFVEDPFEQPENAARSVSRKSLDRIAQV 318

Query: 906  FEETHSKLSRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLSHTVRPMG 1085
            FE T  +L+   ++ S++  L    +     ++T  +   HA G   +   L    RP  
Sbjct: 319  FEMTSRRLATDCNRNSIVGVLTSPHISQPLCSRTSLHNHHHANGVN-NGHNLHGQSRPWN 377

Query: 1086 NPVHQ 1100
            + + Q
Sbjct: 378  HQMQQ 382


>ref|XP_006846360.1| hypothetical protein AMTR_s00012p00261420 [Amborella trichopoda]
            gi|548849130|gb|ERN08035.1| hypothetical protein
            AMTR_s00012p00261420 [Amborella trichopoda]
          Length = 520

 Score =  348 bits (894), Expect = 3e-93
 Identities = 176/332 (53%), Positives = 232/332 (69%), Gaps = 2/332 (0%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            DD   R  VI++  + +  ++SL+G+ V+PFGS++SKLYSRWGDLDIS+EL     V   
Sbjct: 30   DDQIRRADVISDIATSLTCLQSLKGSSVQPFGSYVSKLYSRWGDLDISIELA----VSDV 85

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             K +K NVL+ +R  LQ  GVA  ++FIP ARVPLL FES    ISCD+S+ N  GL+KS
Sbjct: 86   SKSKKLNVLKQLRDVLQRTGVAHYIQFIPQARVPLLIFESNRHHISCDVSVGNCEGLLKS 145

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            KFL WIS ID RFHD+VLLVKEWAK+  IN+PK G+LNSY+LC+MVIFH QTC P+ILPP
Sbjct: 146  KFLLWISHIDGRFHDIVLLVKEWAKAHKINDPKNGSLNSYALCLMVIFHLQTCSPSILPP 205

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L++ Y GN+V DL GV    +R+IE  C   I + RA+ +  AN+ SL+ELF+SFF+KF+
Sbjct: 206  LRDIYGGNMVEDLKGVGSAYKRDIEHSCNEKIDRLRAQGFNQANKSSLAELFVSFFEKFT 265

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
             I   + + AIC +T +WE + S   W  K YPL+IEDPFEQP N ARSVR  +L+RIS+
Sbjct: 266  DIGTRSSQQAICTFTGRWENLYSKRDWTKKSYPLVIEDPFEQPTNCARSVRAFELLRISD 325

Query: 903  AFEETHSKLSRSQDQR--SMIATLVRVQLRPQ 992
            AF  T   L RS  ++  S   +L ++ +RP+
Sbjct: 326  AFNNTRGDL-RSPFRKICSSRTSLAKLLIRPE 356


>ref|XP_007163461.1| hypothetical protein PHAVU_001G236100g [Phaseolus vulgaris]
            gi|561036925|gb|ESW35455.1| hypothetical protein
            PHAVU_001G236100g [Phaseolus vulgaris]
          Length = 509

 Score =  347 bits (890), Expect = 8e-93
 Identities = 174/361 (48%), Positives = 243/361 (67%), Gaps = 2/361 (0%)
 Frame = +3

Query: 3    DDVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSA 182
            +D   R  ++N+ +S+V+ VESLRGA VEPFGSF+S L++RWGDLDIS+EL  G  + SA
Sbjct: 23   EDWQIRFAILNDLRSIVESVESLRGATVEPFGSFVSNLFTRWGDLDISIELSNGLHISSA 82

Query: 183  RKKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKS 362
             KKQKQ +L ++ KAL+ +G    ++FI +ARVP+L F+S  + +SCDISINN  G +KS
Sbjct: 83   GKKQKQTLLGEVLKALRMKGAGSHLQFISSARVPILKFKSNRQGVSCDISINNLPGQMKS 142

Query: 363  KFLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPP 542
            K L WI++ID RFHDMVLLVKEWAK+  INN K GT NSYSL ++VIFHFQTC PAILPP
Sbjct: 143  KILLWINKIDGRFHDMVLLVKEWAKAHKINNSKTGTFNSYSLSLLVIFHFQTCVPAILPP 202

Query: 543  LQEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFS 722
            L+  YPGN+V+DL G+   AE  I + C A I +  + + R+ N+ S+ +LF+ F +K++
Sbjct: 203  LKYIYPGNMVDDLKGIRADAENLIAETCNAGINRHISNTARSINKKSVPDLFVEFLRKYA 262

Query: 723  RINIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISE 902
            +++  A E  ICPYT +WE I +N  W  K Y + +EDPFEQP+NTARSV    L +IS+
Sbjct: 263  QMDSWASELGICPYTGQWEQIENNTIWLPKTYSIFVEDPFEQPQNTARSVNAGQLKKISD 322

Query: 903  AFEETHSKLSRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQ--LSHTVR 1076
             F +T++ LS +    + + T+    L P    K+      +  G+ +H  Q  +   +R
Sbjct: 323  TFSKTYAFLSSNHHNLNSLLTM----LAPPHVVKSITTPIRNYDGSYFHPTQPKVQRAMR 378

Query: 1077 P 1079
            P
Sbjct: 379  P 379


>ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
            lyrata] gi|297325653|gb|EFH56073.1| hypothetical protein
            ARALYDRAFT_321659 [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  346 bits (888), Expect = 1e-92
 Identities = 176/354 (49%), Positives = 239/354 (67%)
 Frame = +3

Query: 6    DVATRVLVINEFKSVVDFVESLRGARVEPFGSFISKLYSRWGDLDISVELPCGSFVPSAR 185
            D  TR+ VI++ + V+  VE LRGA V+PFGSF+S L++RWGDLD+SV+L  GS +    
Sbjct: 24   DWDTRIRVIDQLRDVLQTVECLRGATVQPFGSFVSNLFTRWGDLDLSVDLFSGSSILFTG 83

Query: 186  KKQKQNVLRDIRKALQTRGVARRVKFIPNARVPLLTFESAHRSISCDISINNQLGLIKSK 365
            KKQKQ +LR + +AL+  G+  +++F+ +ARVP+L   S H+ I+CDISI+N  GL+KS+
Sbjct: 84   KKQKQTLLRHLLRALRASGLWYKLQFVIHARVPILKVVSGHQRIACDISIDNLDGLLKSR 143

Query: 366  FLFWISEIDKRFHDMVLLVKEWAKSQNINNPKLGTLNSYSLCMMVIFHFQTCEPAILPPL 545
            FLFWISEID RF D+VLLVKEWAK+ NIN+ K GT NSYSL ++VIFH QTC PAILPPL
Sbjct: 144  FLFWISEIDGRFRDLVLLVKEWAKAHNINDSKNGTFNSYSLSLLVIFHLQTCVPAILPPL 203

Query: 546  QEFYPGNIVNDLTGVSGIAERNIEDICAANIRKFRARSYRNANRCSLSELFISFFQKFSR 725
            +  YP + V+DLTGV   AE +I  + AANI +F+  + ++ NR SLSEL +SF+ KFS 
Sbjct: 204  RVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKLNTAKSVNRSSLSELLVSFYAKFSD 263

Query: 726  INIMALEHAICPYTAKWEPIMSNYRWRAKEYPLLIEDPFEQPENTARSVRQNDLIRISEA 905
            IN+ A E  +CP+T +WE I SN  W  K Y L +EDPFEQP N ARSV + +L RI++ 
Sbjct: 264  INLKAQELGVCPFTGRWENISSNTTWLPKTYSLFVEDPFEQPVNAARSVSRRNLDRIAQV 323

Query: 906  FEETHSKLSRSQDQRSMIATLVRVQLRPQFFAKTQGNQSAHAGGTRWHNAQLSH 1067
            F+ T  +L    ++ S+I  L    ++         +   HA     HN +  H
Sbjct: 324  FQITSRRLVSDCNRNSIIGVLTGQHIQESLHRTISLHSQQHANS--MHNVRNLH 375


Top