BLASTX nr result

ID: Rauwolfia21_contig00017698 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00017698
         (1764 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004240014.1| PREDICTED: uncharacterized protein LOC101254...   317   1e-83
ref|XP_006355616.1| PREDICTED: DNA-directed RNA polymerase III s...   313   1e-82
ref|XP_002263694.1| PREDICTED: uncharacterized protein LOC100260...   271   9e-70
gb|EOY22840.1| DNA binding protein, putative isoform 1 [Theobrom...   269   2e-69
emb|CAN67263.1| hypothetical protein VITISV_022611 [Vitis vinifera]   268   6e-69
gb|EOY22841.1| DNA binding protein, putative isoform 2 [Theobrom...   256   3e-65
ref|XP_002510979.1| DNA binding protein, putative [Ricinus commu...   253   2e-64
ref|XP_002303490.2| hypothetical protein POPTR_0003s10650g [Popu...   228   5e-57
ref|XP_004307797.1| PREDICTED: uncharacterized protein LOC101300...   224   7e-56
ref|XP_004146914.1| PREDICTED: uncharacterized protein LOC101207...   205   4e-50
ref|XP_004167167.1| PREDICTED: uncharacterized protein LOC101227...   202   4e-49
ref|XP_003534408.1| PREDICTED: DNA-directed RNA polymerase III s...   197   2e-47
gb|ACU24406.1| unknown [Glycine max]                                  194   8e-47
gb|ESW03348.1| hypothetical protein PHAVU_011G006800g [Phaseolus...   192   5e-46
ref|XP_006587708.1| PREDICTED: DNA-directed RNA polymerase III s...   184   1e-43
gb|EXB38927.1| DNA-directed RNA polymerase III subunit RPC4 [Mor...   178   8e-42
ref|XP_006428587.1| hypothetical protein CICLE_v10012311mg [Citr...   172   3e-40
gb|EOY30295.1| DNA-directed RNA polymerase III subunit RPC4, put...   171   1e-39
ref|XP_004144123.1| PREDICTED: uncharacterized protein LOC101209...   166   4e-38
gb|EMJ21970.1| hypothetical protein PRUPE_ppa024519mg [Prunus pe...   165   7e-38

>ref|XP_004240014.1| PREDICTED: uncharacterized protein LOC101254492 [Solanum
            lycopersicum]
          Length = 373

 Score =  317 bits (811), Expect = 1e-83
 Identities = 200/435 (45%), Positives = 256/435 (58%), Gaps = 6/435 (1%)
 Frame = +1

Query: 184  DPMDLDLGASSTRRKSKFAPKGPPRREAQ-PPKPKSEL--TESDGDDEINEALLRKVNDH 354
            DP DL L +SSTR K KFAPKGPPRR+ Q P +PK+E    E   D+E  EA+LRKVN+ 
Sbjct: 2    DP-DLPLSSSSTR-KVKFAPKGPPRRKKQNPAQPKNEADGNEDRDDNEAAEAVLRKVNER 59

Query: 355  LTRRLHKAEKKSSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLS 534
            LTR+  K EKK++V+VAF HGVAS T  +T G   E T  +++     D+ S D   + S
Sbjct: 60   LTRQKPKTEKKAAVEVAFAHGVASPTSTKTSGKSRELT--VNQDSTLKDNESCDNMDIDS 117

Query: 535  LPSTDVAGG--ILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDA 708
            LP+   + G  + E S N  D L K+K+  YKE W                         
Sbjct: 118  LPTLPSSTGPDLAEMSVNNSDSLLKRKKE-YKEPW------------------------- 151

Query: 709  SRPR*HDSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENT 888
                              DYHHS YP TLPLRRP++GDPE+L+  EFG+AA   EYDEN 
Sbjct: 152  ------------------DYHHSNYPVTLPLRRPYAGDPEILNEAEFGEAAKNAEYDENN 193

Query: 889  INSASTLGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKD 1068
            IN AS LGLL  EK D  ++LF QLP ++   K   S  G++ A S  L G        D
Sbjct: 194  INPASELGLL--EKKDDVQLLFLQLPANLPLSKLQASTGGRDTAVSLTLPGDK-----SD 246

Query: 1069 KAIAGSSTPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLT-SSTGVHASEKACSLE 1245
            KA           SSPM +GKEV G      A  KGKEI+DS T S    + + K CSL+
Sbjct: 247  KAAT--------LSSPMLKGKEVAGSAPRFLASAKGKEISDSSTISRRHNNTTNKVCSLQ 298

Query: 1246 KLSAGCVGKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKR 1425
            +L AG +GKMLVYKSGA+KLK+G+ILYDVSPG +C F+Q+++ +NT EKQCC +GEL KR
Sbjct: 299  ELPAGSMGKMLVYKSGAIKLKLGDILYDVSPGVECSFSQDVVAINTAEKQCCQLGELGKR 358

Query: 1426 AVVTPEIDSLLDSVI 1470
            AVVTP++D LL++++
Sbjct: 359  AVVTPDVDFLLNNLM 373


>ref|XP_006355616.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4-like [Solanum
            tuberosum]
          Length = 368

 Score =  313 bits (803), Expect = 1e-82
 Identities = 203/435 (46%), Positives = 257/435 (59%), Gaps = 6/435 (1%)
 Frame = +1

Query: 184  DPMDLDLGASSTRRKSKFAPKGPPRREAQ-PPKPKSEL--TESDGDDEINEALLRKVNDH 354
            DP DL L +SSTR K KFAPKGPPRR+ Q P +PK+E    E   D+E  EA+LRKVN+ 
Sbjct: 2    DP-DLPLSSSSTR-KVKFAPKGPPRRKKQNPAQPKNEADGNEDRDDNEAAEAILRKVNER 59

Query: 355  LTRRLHKAEKKSSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLS 534
            LTR+  K EKK  V+VAFTHGVAS T  +T+G   E T  +++     D+ S D   + S
Sbjct: 60   LTRQKPKTEKK--VEVAFTHGVASPTSTKTFGKSRELT--VNQDSTLKDNESCDNMDIDS 115

Query: 535  LPSTDVAGG--ILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDA 708
            LP+   + G  + E S N  D L K+K+  YKE W                         
Sbjct: 116  LPTLPSSTGPDLAEMSVNNSDSLLKRKKE-YKEPW------------------------- 149

Query: 709  SRPR*HDSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENT 888
                              DYHHS YP TLPLRRP++GDPE+L+  EFG+AA    YDEN 
Sbjct: 150  ------------------DYHHSNYPVTLPLRRPYAGDPEILNEAEFGEAAKNAVYDENN 191

Query: 889  INSASTLGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKD 1068
            IN AS LGLL  EK D  ++LF QLP ++   K   S  G++ A    L G        D
Sbjct: 192  INPASELGLL--EKKDDVQLLFLQLPANLPLSKLQASTGGRDTAVCLTLPGDK-----SD 244

Query: 1069 KAIAGSSTPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLT-SSTGVHASEKACSLE 1245
            KA           SSPM +GKEV G    A AG KGKEIADS T S    + + KACSL+
Sbjct: 245  KAAT--------LSSPMLKGKEVAGS---ALAGAKGKEIADSPTISRRHNNTTNKACSLQ 293

Query: 1246 KLSAGCVGKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKR 1425
            +L AG +GKMLVYKSGA+KLK+G+ILYDVSPG +C F+Q+++ +NT EKQCC +GEL KR
Sbjct: 294  ELPAGSMGKMLVYKSGAIKLKLGDILYDVSPGVECSFSQDVVAINTAEKQCCQLGELGKR 353

Query: 1426 AVVTPEIDSLLDSVI 1470
            AVVTP++D LL++++
Sbjct: 354  AVVTPDVDFLLNNLM 368


>ref|XP_002263694.1| PREDICTED: uncharacterized protein LOC100260717 [Vitis vinifera]
            gi|297745083|emb|CBI38675.3| unnamed protein product
            [Vitis vinifera]
          Length = 348

 Score =  271 bits (692), Expect = 9e-70
 Identities = 170/431 (39%), Positives = 230/431 (53%), Gaps = 1/431 (0%)
 Frame = +1

Query: 190  MDLDLGASSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEALLRKVNDHLTRRL 369
            MD +  +S + RK +FAPK PPRR+ +   P+  + E + + +  + LLR+VN+ L R+ 
Sbjct: 1    MDHNESSSVSPRKVRFAPKSPPRRKPKTTAPQPVVAEEEDEAKRAQYLLRRVNEKLRRQG 60

Query: 370  HKAEKKSSVQVAFTHGVAS-STPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPST 546
             K EK SSVQV F  G A+ S  IRT+G+  +G  + S G     S  D  ++ +S PST
Sbjct: 61   PKVEKTSSVQVVFGPGAATPSDTIRTFGVHRDGNSDKSSGMELKVSTPDHEEIAVSSPST 120

Query: 547  DVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*H 726
                       +A D   + ++R YKE W                               
Sbjct: 121  TKPDETNGYFADATDDSAQIRKR-YKEPW------------------------------- 148

Query: 727  DSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSAST 906
                        DY HSYYP TLPLR+P SGDPE+LD  EFG+A++ +EYDE TIN AS 
Sbjct: 149  ------------DYVHSYYPTTLPLRKPHSGDPEILDEAEFGEASTNLEYDEKTINPASE 196

Query: 907  LGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGS 1086
            LGLL  E+ +   M  FQLP ++   KQ  SA GKE                    +  S
Sbjct: 197  LGLL--EESEKGRMFLFQLPANLPLFKQSPSAKGKE-------------------IVENS 235

Query: 1087 STPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCV 1266
            ++  GI +S                   KGK++A S  SS  +  SE +C LE L+ G +
Sbjct: 236  TSLEGIYASA------------------KGKQVARSSLSSKSIGTSEHSCRLEDLAGGHI 277

Query: 1267 GKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEI 1446
            GKMLVYKSGA+KLK+GEILYDVSPG DC   Q+++ +NTV+K C A+GEL KR +VTP++
Sbjct: 278  GKMLVYKSGAIKLKLGEILYDVSPGLDCTCVQDVVAINTVDKHCYALGELGKRVIVTPDV 337

Query: 1447 DSLLDSVIHLE 1479
            DSLLDS+I L+
Sbjct: 338  DSLLDSMIALD 348


>gb|EOY22840.1| DNA binding protein, putative isoform 1 [Theobroma cacao]
          Length = 359

 Score =  269 bits (688), Expect = 2e-69
 Identities = 182/430 (42%), Positives = 225/430 (52%), Gaps = 1/430 (0%)
 Frame = +1

Query: 190  MDLDLGASSTRRKSKFAPKGPPR-REAQPPKPKSELTESDGDDEINEALLRKVNDHLTRR 366
            MD D G SS RRK +FAPK P   R  +    KSE+ + DG+    + LL + N++ TR+
Sbjct: 1    MDQD-GPSSGRRKVRFAPKAPQSSRRLKTTVSKSEVNDEDGEAAQAQYLLGRFNENQTRQ 59

Query: 367  LHKAEKKSSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPST 546
              K EKKSS Q++F  G  SS  +R YG Q  GT   S          +DGQ++ S PS 
Sbjct: 60   RPKVEKKSSAQISFGPGAPSSNLLRAYGSQRGGTSGKSTDSRQRSPDDNDGQIIGSFPSA 119

Query: 547  DVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*H 726
                     S +A++    K +R Y+E WV    +  C    R  AH             
Sbjct: 120  SKEDRTDICSSDAIEASAPKIKREYREPWVKVCSLFVC---LRPSAHLCAI--------- 167

Query: 727  DSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSAST 906
              +L   M   QDYHH+YYP TLPLRRP+SGDPELLD  EF +AA   EYDE TIN AS 
Sbjct: 168  PLILLSSMIALQDYHHTYYPITLPLRRPYSGDPELLDQAEFVEAARK-EYDEKTINPASD 226

Query: 907  LGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGS 1086
            LGL  LE+G+  +M FFQLP ++   K+  S  GKE A +                  GS
Sbjct: 227  LGL--LEEGEKGKMFFFQLPANLPVIKRLASTKGKEKAEN-----------------LGS 267

Query: 1087 STPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCV 1266
            S   G                    A +KG                   C LE+L  G +
Sbjct: 268  SERFG--------------------ALKKG-------------------CQLEELPGGFM 288

Query: 1267 GKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEI 1446
            GKMLVYKSGAVKLK+GE LYDVSPGSDCIFAQ++  VNT EK CC +GEL KR VVTP+I
Sbjct: 289  GKMLVYKSGAVKLKLGETLYDVSPGSDCIFAQDVAAVNTTEKHCCVIGELGKRVVVTPDI 348

Query: 1447 DSLLDSVIHL 1476
             S+L+SVI L
Sbjct: 349  SSVLNSVIDL 358


>emb|CAN67263.1| hypothetical protein VITISV_022611 [Vitis vinifera]
          Length = 348

 Score =  268 bits (685), Expect = 6e-69
 Identities = 170/431 (39%), Positives = 229/431 (53%), Gaps = 1/431 (0%)
 Frame = +1

Query: 190  MDLDLGASSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEALLRKVNDHLTRRL 369
            MD +  +S + RK +FAPK PPRR+ +   P+  + E + + +  + LLR+VN+ L R+ 
Sbjct: 1    MDHNESSSVSPRKVRFAPKSPPRRKPKTTAPQPVVAEEEDEAKRAQYLLRRVNEKLRRQG 60

Query: 370  HKAEKKSSVQVAFTHGVAS-STPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPST 546
             K EK SSVQV F  G A+ S  IRT+G+  +G  + S G     S  D  ++ +S  ST
Sbjct: 61   PKVEKTSSVQVVFGPGAATPSDTIRTFGVHRDGNSDKSSGMELKVSTPDHEEIAVSSXST 120

Query: 547  DVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*H 726
                       +A D   + ++R YKE W                               
Sbjct: 121  TKPDETNGXFADATDDSAQIRKR-YKEPW------------------------------- 148

Query: 727  DSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSAST 906
                        DY HSYYP TLPLR+P SGDPE+LD  EFG+A++ +EYDE TIN AS 
Sbjct: 149  ------------DYVHSYYPTTLPLRKPHSGDPEILDEAEFGEASTNLEYDEKTINPASE 196

Query: 907  LGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGS 1086
            LGLL  E+ +   M  FQLP ++   KQ  SA GKE  G+                   S
Sbjct: 197  LGLL--EESEKGRMFLFQLPANLPLVKQSASAKGKEIVGN-------------------S 235

Query: 1087 STPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCV 1266
            ++  GI +S                   KGK++A S  SS  +  SE +C LE L+ G  
Sbjct: 236  TSLEGIYASA------------------KGKQVARSSLSSKSIGTSEHSCRLEDLAGGHX 277

Query: 1267 GKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEI 1446
            GKMLVYKSGA+KLK+GEILYDVSPG DC   Q+++ +NTV+K C A+GEL KR +VTP++
Sbjct: 278  GKMLVYKSGAIKLKLGEILYDVSPGLDCTCVQDVVAINTVDKHCYALGELGKRVIVTPDV 337

Query: 1447 DSLLDSVIHLE 1479
            DSLLDS+I L+
Sbjct: 338  DSLLDSMIALD 348


>gb|EOY22841.1| DNA binding protein, putative isoform 2 [Theobroma cacao]
          Length = 328

 Score =  256 bits (653), Expect = 3e-65
 Identities = 174/430 (40%), Positives = 215/430 (50%), Gaps = 1/430 (0%)
 Frame = +1

Query: 190  MDLDLGASSTRRKSKFAPKGPPR-REAQPPKPKSELTESDGDDEINEALLRKVNDHLTRR 366
            MD D G SS RRK +FAPK P   R  +    KSE+ + DG+    + LL + N++ TR+
Sbjct: 1    MDQD-GPSSGRRKVRFAPKAPQSSRRLKTTVSKSEVNDEDGEAAQAQYLLGRFNENQTRQ 59

Query: 367  LHKAEKKSSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPST 546
              K EKKSS Q++F  G  SS  +R YG Q  GT   S          +DGQ++ S PS 
Sbjct: 60   RPKVEKKSSAQISFGPGAPSSNLLRAYGSQRGGTSGKSTDSRQRSPDDNDGQIIGSFPSA 119

Query: 547  DVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*H 726
                     S +A++    K +R Y+E W                               
Sbjct: 120  SKEDRTDICSSDAIEASAPKIKREYREPW------------------------------- 148

Query: 727  DSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSAST 906
                        DYHH+YYP TLPLRRP+SGDPELLD  EF +AA   EYDE TIN AS 
Sbjct: 149  ------------DYHHTYYPITLPLRRPYSGDPELLDQAEFVEAARK-EYDEKTINPASD 195

Query: 907  LGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGS 1086
            LGL  LE+G+  +M FFQLP ++   K+  S  GKE A +                  GS
Sbjct: 196  LGL--LEEGEKGKMFFFQLPANLPVIKRLASTKGKEKAEN-----------------LGS 236

Query: 1087 STPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCV 1266
            S   G                    A +KG                   C LE+L  G +
Sbjct: 237  SERFG--------------------ALKKG-------------------CQLEELPGGFM 257

Query: 1267 GKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEI 1446
            GKMLVYKSGAVKLK+GE LYDVSPGSDCIFAQ++  VNT EK CC +GEL KR VVTP+I
Sbjct: 258  GKMLVYKSGAVKLKLGETLYDVSPGSDCIFAQDVAAVNTTEKHCCVIGELGKRVVVTPDI 317

Query: 1447 DSLLDSVIHL 1476
             S+L+SVI L
Sbjct: 318  SSVLNSVIDL 327


>ref|XP_002510979.1| DNA binding protein, putative [Ricinus communis]
            gi|223550094|gb|EEF51581.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 328

 Score =  253 bits (645), Expect = 2e-64
 Identities = 164/425 (38%), Positives = 221/425 (52%), Gaps = 3/425 (0%)
 Frame = +1

Query: 211  SSTRRKSKFAPKGPPRREAQPPKPKSELTESDG-DDEINEA--LLRKVNDHLTRRLHKAE 381
            S ++RK KF PK P +R  +   PK+E+   D  +DE  +A  L+RK N++  R+  + E
Sbjct: 7    SPSQRKVKFTPKAPSQRRPRRTVPKTEVNGVDNNEDEAVQAQKLMRKFNENFRRQGPRVE 66

Query: 382  KKSSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPSTDVAGG 561
            KKS+VQVAF  G  SST IRT+G+     GE     G  DS  DDG++++S  STD    
Sbjct: 67   KKSTVQVAFGPGATSSTSIRTFGVSK---GENPVSSGIKDSTDDDGKIVISSLSTDKEDE 123

Query: 562  ILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*HDSLLG 741
            I+  +   +D L  K ++ Y+E W                                    
Sbjct: 124  IINCASEDIDALPLKIKKDYREPW------------------------------------ 147

Query: 742  FLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSASTLGLLG 921
                   DY  +YYP TLPLRRP+SGDP LLD  EFG+AA  +EYDE+T+N AS L LL 
Sbjct: 148  -------DYDRTYYPTTLPLRRPYSGDPVLLDEAEFGEAARKLEYDESTMNPASDLELL- 199

Query: 922  LEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGSSTPRG 1101
                                          E+  + K+    +PA+              
Sbjct: 200  ------------------------------EECDTEKMIFFQLPAKL------------- 216

Query: 1102 IASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCVGKMLV 1281
                P+ +           SA  KGKE A+    S G +A++K  SL+ LSAG +GKMLV
Sbjct: 217  ----PLVK----------RSASAKGKEKAEGSIPSQGKNAAKKESSLDGLSAGYMGKMLV 262

Query: 1282 YKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEIDSLLD 1461
            Y+SGAVKLK+G+ LYDVS GSDC+FAQ++M +NT  K CC +GEL KRAVVTP++DSLLD
Sbjct: 263  YRSGAVKLKLGDTLYDVSQGSDCMFAQDVMAINTAAKHCCTIGELEKRAVVTPDVDSLLD 322

Query: 1462 SVIHL 1476
            SV++L
Sbjct: 323  SVVNL 327


>ref|XP_002303490.2| hypothetical protein POPTR_0003s10650g [Populus trichocarpa]
            gi|550342916|gb|EEE78469.2| hypothetical protein
            POPTR_0003s10650g [Populus trichocarpa]
          Length = 368

 Score =  228 bits (582), Expect = 5e-57
 Identities = 168/433 (38%), Positives = 220/433 (50%), Gaps = 9/433 (2%)
 Frame = +1

Query: 208  ASSTRRKSKFAPKGPPRREAQPPKPKSELTESDG----DDEINEA--LLRKVNDHLTRRL 369
            +S +R K KF PK  PRR+ +P  PK+E    D     D+E  +A  L+ K N++L R++
Sbjct: 51   SSPSRTKLKFKPK-LPRRQRRPSVPKTEEINDDRRSNEDEEAAQAQMLIHKFNENLRRQV 109

Query: 370  HKAEKKSSVQVAFTHGVASSTP--IRTYGMQ-NEGTGEISKGKGSMDSISDDGQVLLSLP 540
             K EKK  VQVAF  G A S P  IR Y +  +E TG  S   G+ D+  DDG++ +   
Sbjct: 110  PK-EKKPQVQVAFGPG-APSPPLLIRKYNVPVHENTG--SSWSGTEDTRDDDGKIFVPPS 165

Query: 541  STDVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR 720
            +  V G        A++PL  K +R YKE W                             
Sbjct: 166  AARVDG--------AINPLSLKGKRRYKEPW----------------------------- 188

Query: 721  *HDSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSA 900
                          DYHH YYP TLPLR P+SGDP+LLD  EFG+ A  +EYDE TIN A
Sbjct: 189  --------------DYHHIYYPNTLPLRPPYSGDPKLLDEAEFGEEARNLEYDETTINPA 234

Query: 901  STLGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIA 1080
            S LGL  LE+ D   + FFQ+P  + F K+  SA GKE A        ++P+ +K     
Sbjct: 235  SDLGL--LEECDNERLFFFQVPEKLPFLKRSASAKGKERA------DMSMPSESK----- 281

Query: 1081 GSSTPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAG 1260
                                      SA RK                     S E+L  G
Sbjct: 282  --------------------------SAARK--------------------TSFEELPKG 295

Query: 1261 CVGKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTP 1440
             +GKMLVY+SGA+KLK+G+ LYDVSPGS+C FAQ++M +NT  K CCA+GEL KRAVVTP
Sbjct: 296  YMGKMLVYRSGAIKLKLGDALYDVSPGSECTFAQDVMAINTAGKDCCAIGELGKRAVVTP 355

Query: 1441 EIDSLLDSVIHLE 1479
            +I+  L+SVI+L+
Sbjct: 356  DIEFNLNSVINLD 368


>ref|XP_004307797.1| PREDICTED: uncharacterized protein LOC101300483 [Fragaria vesca
            subsp. vesca]
          Length = 324

 Score =  224 bits (572), Expect = 7e-56
 Identities = 162/436 (37%), Positives = 218/436 (50%), Gaps = 6/436 (1%)
 Frame = +1

Query: 190  MDLDLGASSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEALLRKVNDHLTRRL 369
            MD D G S+ RRK +F P+  PRR    P P +E+ +++ ++   +ALLRK  ++  RR 
Sbjct: 1    MDKD-GPSAPRRKGRFKPRAQPRR----PNPTTEVEDAEKEEREAKALLRKFQENRARRA 55

Query: 370  HKAEKKSS--VQVAFTHGVASSTPIRTYG---MQNEGTGEISKGKGSMDSISDDGQVLLS 534
             KAEKKS+  V+VAF  G  SS+ +RTYG   ++N   G     KG      D  ++L S
Sbjct: 56   PKAEKKSAAAVEVAFGPGAQSSSSLRTYGVPKLENLDQGSSLGVKGY-----DGHKILSS 110

Query: 535  LP-STDVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDAS 711
             P +T  AG       +  D      +  Y E W                          
Sbjct: 111  SPLATGGAGTDAPMDIDTADASISNVKNHYVEIW-------------------------- 144

Query: 712  RPR*HDSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTI 891
                             DY +S YP +LPLR+P+SGDP++L+ +EF + A A EYDE+TI
Sbjct: 145  -----------------DYENSKYPISLPLRKPYSGDPDILNEKEFVEDA-AKEYDESTI 186

Query: 892  NSASTLGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDK 1071
            N AS LGL  LE+    ++LF QLP  +   K+ TSA GKE                   
Sbjct: 187  NCASELGL--LEQNPKEKLLFVQLPPTLPLVKRSTSAKGKEK------------------ 226

Query: 1072 AIAGSSTPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKL 1251
               GSSTP                                    S  V A++K+  LE+L
Sbjct: 227  --VGSSTP------------------------------------SEKVGAAKKSGGLEEL 248

Query: 1252 SAGCVGKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAV 1431
            S G +GKMLVYKSGAVK K+G+ LYDVSPGSDC+FAQ+I  +NT  ++CC +GEL +R V
Sbjct: 249  SEGYMGKMLVYKSGAVKFKLGDALYDVSPGSDCVFAQDIAAINTAARKCCVLGELGQRVV 308

Query: 1432 VTPEIDSLLDSVIHLE 1479
            VTP++DSLLD+ I LE
Sbjct: 309  VTPDVDSLLDATIELE 324


>ref|XP_004146914.1| PREDICTED: uncharacterized protein LOC101207602 [Cucumis sativus]
          Length = 334

 Score =  205 bits (522), Expect = 4e-50
 Identities = 152/435 (34%), Positives = 206/435 (47%), Gaps = 13/435 (2%)
 Frame = +1

Query: 211  SSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEA--LLRKVNDHLTRRLHKAEK 384
            S  RRK KFAPK   R+   PP P  +  + DG+  + +   LLR+ N++L +R +K EK
Sbjct: 7    SPPRRKVKFAPKSSQRKRPPPP-PVQKTEDEDGEGYVAQTRYLLRRANENLGKRANKVEK 65

Query: 385  KSSVQVAFTHGVAS-STPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLP------- 540
            KSSVQVAF  G  S S+ IRTYG+     G  S+       + +D + +L +        
Sbjct: 66   KSSVQVAFGPGAESTSSSIRTYGVPKVENG--SRKNDIEPEVDEDEEFVLPVARDVNEDG 123

Query: 541  ---STDVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDAS 711
                     GI E+S +A++    K +R YKE W                          
Sbjct: 124  KYFDKKTKDGITESSSSAMET---KTKRDYKEPW-------------------------- 154

Query: 712  RPR*HDSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTI 891
                             DY +SYYP TLPLR P+SGDPE LD  EFGQ     EYDEN++
Sbjct: 155  -----------------DYQNSYYPTTLPLRMPYSGDPERLDEAEFGQDVMNREYDENSV 197

Query: 892  NSASTLGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDK 1071
              A  LGLL  ++   +   FFQL                             PAR    
Sbjct: 198  IPALDLGLL--DENTESTKYFFQL-----------------------------PARLP-- 224

Query: 1072 AIAGSSTPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKL 1251
                                 +P   S A+    GKE   +  SS    +S+    L+KL
Sbjct: 225  ---------------------LPKQSSTAT----GKEKVGNSRSSNSTSSSDLD-DLKKL 258

Query: 1252 SAGCVGKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAV 1431
            SAGC+GK+L+YKSGA+KL++G+ILYDVS GS+C F Q ++ +NT E QCC +G++  R V
Sbjct: 259  SAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAINTEEGQCCDLGDIGNRVV 318

Query: 1432 VTPEIDSLLDSVIHL 1476
            VTP+I SLL+SV +L
Sbjct: 319  VTPDISSLLNSVTNL 333


>ref|XP_004167167.1| PREDICTED: uncharacterized protein LOC101227599 [Cucumis sativus]
          Length = 322

 Score =  202 bits (514), Expect = 4e-49
 Identities = 151/430 (35%), Positives = 201/430 (46%), Gaps = 8/430 (1%)
 Frame = +1

Query: 211  SSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEA--LLRKVNDHLTRRLHKAEK 384
            S  RRK KFAPK   R+   PP P  +  + DG+  + +   LLR+ N++L +R +K EK
Sbjct: 7    SPPRRKVKFAPKSSQRKRPPPP-PVQKTEDEDGEGYVAQTRYLLRRANENLGKRANKVEK 65

Query: 385  KSSVQVAFTHGVAS-STPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPSTDVAGG 561
            KSSVQVAF  G  S S+ IRTYG+     G  S+       + +D + +L +        
Sbjct: 66   KSSVQVAFGPGAESTSSSIRTYGVPKVENG--SRKNDIEPEVDEDEEFVLPVA------- 116

Query: 562  ILENSGNAVDPLFKKK-----RRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*H 726
                  N     F KK     +R YKE W                               
Sbjct: 117  ---RDANEDGKYFDKKPKMETKRDYKEPW------------------------------- 142

Query: 727  DSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSAST 906
                        DY +SYYP TLPLR P+SGDPELLD  EFGQ     EYDEN++  A  
Sbjct: 143  ------------DYQNSYYPTTLPLRMPYSGDPELLDEAEFGQDVMNREYDENSVIPALD 190

Query: 907  LGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGS 1086
            LGLL  ++   +   FFQL                             PAR         
Sbjct: 191  LGLL--DENTESTKYFFQL-----------------------------PARLP------- 212

Query: 1087 STPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCV 1266
                            +P   S A+    GKE   +  SS    +S+    L+KLSAGC+
Sbjct: 213  ----------------LPKQSSTAT----GKEKVGNSRSSNSTSSSDLD-DLKKLSAGCM 251

Query: 1267 GKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEI 1446
            GK+L+YKSGA+KL++G+ILYDVS GS+C F Q ++ +NT E QCC +G++  R VVTP+I
Sbjct: 252  GKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAINTEEGQCCDLGDIGNRVVVTPDI 311

Query: 1447 DSLLDSVIHL 1476
             SLL+SV +L
Sbjct: 312  SSLLNSVTNL 321


>ref|XP_003534408.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4 isoform X1
            [Glycine max]
          Length = 303

 Score =  197 bits (500), Expect = 2e-47
 Identities = 141/423 (33%), Positives = 198/423 (46%), Gaps = 1/423 (0%)
 Frame = +1

Query: 190  MDLDLGASSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDD-EINEALLRKVNDHLTRR 366
            MD D G S  R+     PK  PR       PK+E  +   +D E+  AL R+ +++  +R
Sbjct: 1    MDPDQGTSKARK-----PKFKPRNLKAVRAPKTEADDKRNEDSEVPRALSRRRHENPAKR 55

Query: 367  LHKAEKKSSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPST 546
              K E+KSSV+VAF+ G +SS  +RTYG              S+DS +  G       + 
Sbjct: 56   EPKVERKSSVEVAFSLGSSSSHSLRTYGTSK-----------SIDSGTSSGSPSKYFANE 104

Query: 547  DVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*H 726
             +     E+  +A +   +K +R YKE W                               
Sbjct: 105  QIRSIATEDQNDASNASARKIKREYKEPW------------------------------- 133

Query: 727  DSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSAST 906
                        DY +SYYP TLPLR+P SGDPE+LD +EFG+AA++VEYDENT+NSA+ 
Sbjct: 134  ------------DYENSYYPTTLPLRKPNSGDPEILDEKEFGEAATSVEYDENTVNSAAE 181

Query: 907  LGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGS 1086
            L +L   + +   M FFQ+P        P   D + + G  K+               G+
Sbjct: 182  LEIL---ESEEQRMFFFQIP-------TPLPMDKQSNKGKEKI---------------GT 216

Query: 1087 STPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCV 1266
            ST  G A+   +  +E+P        G  GK                             
Sbjct: 217  STVSGEATKSKNALEELP-------RGYMGK----------------------------- 240

Query: 1267 GKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEI 1446
              MLVYKSGA+KLK+GE L DVSPGS+C   Q++M VNT +KQCC +GE++KR VV P++
Sbjct: 241  --MLVYKSGAIKLKLGETLLDVSPGSNCRCVQDVMAVNTAQKQCCNLGEISKRVVVVPDL 298

Query: 1447 DSL 1455
            DS+
Sbjct: 299  DSI 301


>gb|ACU24406.1| unknown [Glycine max]
          Length = 303

 Score =  194 bits (494), Expect = 8e-47
 Identities = 139/423 (32%), Positives = 198/423 (46%), Gaps = 1/423 (0%)
 Frame = +1

Query: 190  MDLDLGASSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDD-EINEALLRKVNDHLTRR 366
            MD D G S  R+     PK  PR       PK+E  +   +D E+  AL R+ +++  +R
Sbjct: 1    MDPDQGTSKARK-----PKFKPRNLKAVRAPKTEADDKRNEDSEVPRALSRRRHENPAKR 55

Query: 367  LHKAEKKSSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPST 546
              K E+KSSV+VAF+ G +SS  +RTYG              S+DS +  G       + 
Sbjct: 56   EPKVERKSSVEVAFSLGSSSSHSLRTYGTSK-----------SIDSGTSSGSPSKYFANE 104

Query: 547  DVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*H 726
             +     E+  +A +   ++ +R Y+E W                               
Sbjct: 105  QIRSIATEDQNDASNASARRIKREYREPW------------------------------- 133

Query: 727  DSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSAST 906
                        DY +SYYP TLPLR+P SGDPE+LD +EFG+AA++VEYDENT+NSA+ 
Sbjct: 134  ------------DYENSYYPTTLPLRKPNSGDPEILDEKEFGEAATSVEYDENTVNSAAE 181

Query: 907  LGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGS 1086
            L +L   + +   M FFQ+P        P   D + + G  K+               G+
Sbjct: 182  LEIL---ESEEQRMFFFQIP-------TPLPMDKQSNKGKEKI---------------GT 216

Query: 1087 STPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCV 1266
            ST  G A+   +  +E+P        G  GK                             
Sbjct: 217  STVSGEATKSKNALEELP-------RGYMGK----------------------------- 240

Query: 1267 GKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEI 1446
              MLVYKSGA+KLK+GE L DVSPGS+C   Q++M VNT +KQCC +GE++KR VV P++
Sbjct: 241  --MLVYKSGAIKLKLGETLLDVSPGSNCRCVQDVMAVNTAQKQCCNLGEISKRVVVVPDL 298

Query: 1447 DSL 1455
            DS+
Sbjct: 299  DSI 301


>gb|ESW03348.1| hypothetical protein PHAVU_011G006800g [Phaseolus vulgaris]
          Length = 318

 Score =  192 bits (487), Expect = 5e-46
 Identities = 140/424 (33%), Positives = 197/424 (46%), Gaps = 2/424 (0%)
 Frame = +1

Query: 190  MDLDLGASSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEALLRKVNDHLTRRL 369
            MD D G+S TR K KF P+ P     +P  PK+E  +   +D     LL +  ++  RR 
Sbjct: 1    MDPDQGSSRTR-KHKFTPRPP-----KPHAPKTEKDDKQDEDSAPARLLSRRYENSARRE 54

Query: 370  HKAEKKSSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPSTD 549
             K E KSSV+VAF+ GV SST +RTYG         + G  S     +  +   S  +T 
Sbjct: 55   PKVETKSSVEVAFSPGV-SSTSLRTYGTSKAVDNGTNSGSPSKSFAKEQIRSRRSSAATG 113

Query: 550  VAG--GILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR* 723
                   ++ + N  +   +K +R YKE W                              
Sbjct: 114  DQNDTSTIDVTDNTTNETARKIKREYKEPW------------------------------ 143

Query: 724  HDSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSAS 903
                         DY +SYYP TLPLR+P SG+PE+LD EEFG+AA++ +YDEN +NSA+
Sbjct: 144  -------------DYTNSYYPITLPLRKPNSGNPEILDEEEFGEAATSSKYDENAVNSAA 190

Query: 904  TLGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAG 1083
             L LL  EK +  +M  FQ P +  F                   G N     K+K   G
Sbjct: 191  ELKLL--EKSEQHKMFLFQFPKNFPFN-----------------VGSN-----KEKGQIG 226

Query: 1084 SSTPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGC 1263
            ++T  G                                          KA +LE+L +G 
Sbjct: 227  ATTVSG------------------------------------------KAGALEELPSGY 244

Query: 1264 VGKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPE 1443
            +GKM +YKSGA+KLK+GE L+DVSPG+ C F+Q+++ VN  +KQ C +GE+N + VV PE
Sbjct: 245  MGKMQIYKSGAIKLKLGETLFDVSPGTKCGFSQDVVAVNIAQKQICNLGEVNHKVVVVPE 304

Query: 1444 IDSL 1455
            +DS+
Sbjct: 305  LDSI 308


>ref|XP_006587708.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4 isoform X2
            [Glycine max]
          Length = 292

 Score =  184 bits (466), Expect = 1e-43
 Identities = 137/423 (32%), Positives = 192/423 (45%), Gaps = 1/423 (0%)
 Frame = +1

Query: 190  MDLDLGASSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDD-EINEALLRKVNDHLTRR 366
            MD D G S  R+     PK  PR       PK+E  +   +D E+  AL R+ +++  +R
Sbjct: 1    MDPDQGTSKARK-----PKFKPRNLKAVRAPKTEADDKRNEDSEVPRALSRRRHENPAKR 55

Query: 367  LHKAEKKSSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPST 546
              K E+KSSV+VAF+ G +SS  +RTYG              S+DS +  G       + 
Sbjct: 56   EPKVERKSSVEVAFSLGSSSSHSLRTYGTSK-----------SIDSGTSSGSPSKYFANE 104

Query: 547  DVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*H 726
             +     E+  +A +   +K +R YKE W                               
Sbjct: 105  QIRSIATEDQNDASNASARKIKREYKEPW------------------------------- 133

Query: 727  DSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSAST 906
                        DY +SYYP TLPLR+P SGDPE+LD +EFG+AA++VEYDENT+NSA+ 
Sbjct: 134  ------------DYENSYYPTTLPLRKPNSGDPEILDEKEFGEAATSVEYDENTVNSAAE 181

Query: 907  LGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGS 1086
            L +L              +P        P   D + + G  K+               G+
Sbjct: 182  LEIL--------------IP-------TPLPMDKQSNKGKEKI---------------GT 205

Query: 1087 STPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCV 1266
            ST  G A+   +  +E+P        G  GK                             
Sbjct: 206  STVSGEATKSKNALEELP-------RGYMGK----------------------------- 229

Query: 1267 GKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEI 1446
              MLVYKSGA+KLK+GE L DVSPGS+C   Q++M VNT +KQCC +GE++KR VV P++
Sbjct: 230  --MLVYKSGAIKLKLGETLLDVSPGSNCRCVQDVMAVNTAQKQCCNLGEISKRVVVVPDL 287

Query: 1447 DSL 1455
            DS+
Sbjct: 288  DSI 290


>gb|EXB38927.1| DNA-directed RNA polymerase III subunit RPC4 [Morus notabilis]
          Length = 328

 Score =  178 bits (451), Expect = 8e-42
 Identities = 137/424 (32%), Positives = 191/424 (45%), Gaps = 6/424 (1%)
 Frame = +1

Query: 223  RKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEALLRKVNDHLTRRLHKAEKK-SSVQ 399
            RK +F PK PP R  +       + E+D D      LLR+ N+  TR   K EKK ++ Q
Sbjct: 14   RKRRFMPKAPPSRVPKAEVKAEVVEETDADQA--RVLLRRFNEGSTRAKPKVEKKVAAAQ 71

Query: 400  VAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLL----SLPSTDVAGGIL 567
            VAF +G AS+T IR+YG+   G    S+G  +   +      L     S P  D+   +L
Sbjct: 72   VAFGYGGASNT-IRSYGVPKGGYRN-SQGPPATRMLFTSAAFLSTVNKSFPMHDIKNHVL 129

Query: 568  ENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*HDSLLGFL 747
             +      P   ++ + YKE W                                      
Sbjct: 130  TDGAF---PSGTRQEKEYKEPW-------------------------------------- 148

Query: 748  MQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSASTLGLLGLE 927
                 DY+ SYYP+TLP RRP SG+PE LD EEFG     + YDE +  +A+ LGL  +E
Sbjct: 149  -----DYY-SYYPSTLPFRRPHSGNPEFLDEEEFGADTETINYDETSAKAATELGL--VE 200

Query: 928  KGDAAEMLFFQLPGDI-LFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGSSTPRGI 1104
            +     M+  QLP  + L  +   +A G+E   + K +   V A+A  KA A    P G 
Sbjct: 201  ENPETSMILLQLPPIMPLMKRSANTAAGQE---ATKSSPAPVVAQATHKACALHELPAGF 257

Query: 1105 ASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCVGKMLVY 1284
                                                                 +GKMLVY
Sbjct: 258  -----------------------------------------------------MGKMLVY 264

Query: 1285 KSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEIDSLLDS 1464
            +SGA+KLKIG+ LYDVS G DC+F+Q+++ +NTVEK CCA+GEL KRA +TP++D +L S
Sbjct: 265  RSGAIKLKIGDTLYDVSSGMDCVFSQDVVAINTVEKHCCAVGELKKRAAITPDVDFILQS 324

Query: 1465 VIHL 1476
            +  L
Sbjct: 325  MADL 328


>ref|XP_006428587.1| hypothetical protein CICLE_v10012311mg [Citrus clementina]
            gi|568853572|ref|XP_006480425.1| PREDICTED:
            uncharacterized protein LOC102622464 [Citrus sinensis]
            gi|557530644|gb|ESR41827.1| hypothetical protein
            CICLE_v10012311mg [Citrus clementina]
          Length = 303

 Score =  172 bits (437), Expect = 3e-40
 Identities = 137/424 (32%), Positives = 196/424 (46%), Gaps = 2/424 (0%)
 Frame = +1

Query: 211  SSTRRKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEALLR-KVNDHLTRRLHKAEKK 387
            S+  RK K+APK PPRR  +  + K+E+ E+    +  + L R   N    +   K EKK
Sbjct: 10   SNATRKIKYAPKAPPRRVPKA-EVKTEMVENADAAQAMDLLQRFNANQGALKGRPKVEKK 68

Query: 388  -SSVQVAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPSTDVAGGI 564
             +  Q+AF  G AS T I++YG+   G            S S  GQ              
Sbjct: 69   VAPSQIAFGQGGAS-TFIKSYGIPKGG------------SSSSRGQ-------------- 101

Query: 565  LENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*HDSLLGF 744
                G+AV+                             GAHA+            + LG 
Sbjct: 102  ----GSAVN----------------------------GGAHAS-----------GTRLGK 118

Query: 745  LMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSASTLGLLGL 924
              Q   DY+ SYYP +LPLRRP+SG PELLD EEFG+A+  + YDE+++N A  LGL+  
Sbjct: 119  EYQEPWDYY-SYYPVSLPLRRPYSGSPELLDEEEFGEASETINYDESSMNPAEELGLM-- 175

Query: 925  EKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGSSTPRGI 1104
            E+     M+F QLP        PT            L  K  PA   ++           
Sbjct: 176  EENLEPNMIFLQLP--------PT------------LPLKKQPATGNER----------- 204

Query: 1105 ASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCVGKMLVY 1284
                                     ++ +S +   G  A EK  SL +L    +GK+LVY
Sbjct: 205  -------------------------QVTESSSKHEGATAKEKTSSLSELPGAFMGKLLVY 239

Query: 1285 KSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEIDSLLDS 1464
            +SGAVKLK+GE +Y+V+PG DC+FAQ+++++NT EK  C  GELNKRA+++P++D +L++
Sbjct: 240  RSGAVKLKLGETVYNVTPGMDCMFAQDVVVINTAEKHFCVAGELNKRAILSPDVDFILNN 299

Query: 1465 VIHL 1476
               L
Sbjct: 300  FADL 303


>gb|EOY30295.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma
            cacao]
          Length = 294

 Score =  171 bits (432), Expect = 1e-39
 Identities = 138/419 (32%), Positives = 198/419 (47%), Gaps = 1/419 (0%)
 Frame = +1

Query: 223  RKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEALLRKVNDHLTRRLHKAEKK-SSVQ 399
            RK +FAPK PPR +A   + K+E+ E D D      LL+++N    +   K EKK +S Q
Sbjct: 12   RKMRFAPKAPPR-QAPKLEVKTEVVE-DTDAVQARDLLQRLNQTSAKTKPKVEKKVASSQ 69

Query: 400  VAFTHGVASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPSTDVAGGILENSG 579
            VAF HG AS++ ++ +G        +SKG               S  S +   G++   G
Sbjct: 70   VAFGHGGASAS-MKLFG--------VSKGA--------------SRTSRETLNGVVHTPG 106

Query: 580  NAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR*HDSLLGFLMQNF 759
                    ++ + Y+E W                                          
Sbjct: 107  -------LREEKEYREPW------------------------------------------ 117

Query: 760  QDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSASTLGLLGLEKGDA 939
             DY+ SYYP TLP+RRP+SG+PE LD EEF  A+  + +DEN++  A  LGL  +++   
Sbjct: 118  -DYY-SYYPVTLPMRRPYSGNPEFLDEEEF--ASENITFDENSVEPAVELGL--MDENLE 171

Query: 940  AEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGSSTPRGIASSPM 1119
              M F QLP  +   KQ  +  G E                    +  SS P        
Sbjct: 172  PSMFFLQLPPTLPMIKQSGTTAGLE--------------------VDSSSKP-------- 203

Query: 1120 SQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCVGKMLVYKSGAV 1299
                          A R G     S+  + G+         E+L AG +GKMLV+KSGAV
Sbjct: 204  --------------AARVG-----SVKKTCGL---------EELPAGLMGKMLVHKSGAV 235

Query: 1300 KLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEIDSLLDSVIHL 1476
            KLK+G+ LYDV+PG +C+FAQ+++ VNT EKQCC +GEL+KRAV+TP++DS+L+S+  L
Sbjct: 236  KLKLGDTLYDVTPGLNCVFAQDVVAVNTAEKQCCVVGELDKRAVLTPDVDSVLNSMADL 294


>ref|XP_004144123.1| PREDICTED: uncharacterized protein LOC101209454 [Cucumis sativus]
            gi|449500539|ref|XP_004161125.1| PREDICTED:
            uncharacterized LOC101209454 [Cucumis sativus]
          Length = 293

 Score =  166 bits (419), Expect = 4e-38
 Identities = 94/233 (40%), Positives = 134/233 (57%)
 Frame = +1

Query: 769  HHSYYPATLPLRRPFSGDPELLDVEEFGQAASAVEYDENTINSASTLGLLGLEKGDAAEM 948
            ++SYYP TLPLRRP+SG+P+ L+ EEFG+A+  + YDENT  +A  LGLL  E+   A++
Sbjct: 116  YYSYYPVTLPLRRPYSGNPDSLNEEEFGEASENLTYDENTTTAAMNLGLL--EENPEADV 173

Query: 949  LFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAIAGSSTPRGIASSPMSQG 1128
            LF QLP  +   KQ +S    ED GS                  G+S+ +  AS P    
Sbjct: 174  LFLQLPPMVPMIKQSSSV---EDMGS------------------GNSSEQNKASQPR--- 209

Query: 1129 KEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSAGCVGKMLVYKSGAVKLK 1308
                                            +K CS+ +L +G +GK+LVY+SGAVKLK
Sbjct: 210  --------------------------------QKTCSMNELPSGSIGKLLVYRSGAVKLK 237

Query: 1309 IGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVTPEIDSLLDSV 1467
            +G+I+YDVS G DC FAQE+  +N   K+CC +GEL+KRA++TP++DS+L ++
Sbjct: 238  LGDIIYDVSSGMDCGFAQEVAAINVEGKRCCIVGELSKRAILTPDVDSMLKNI 290


>gb|EMJ21970.1| hypothetical protein PRUPE_ppa024519mg [Prunus persica]
          Length = 310

 Score =  165 bits (417), Expect = 7e-38
 Identities = 144/430 (33%), Positives = 187/430 (43%), Gaps = 4/430 (0%)
 Frame = +1

Query: 190  MDLDLGASSTR-RKSKFAPKGPPRREAQPPKPKSELTESDGDDEINEALLRKVNDHLTRR 366
            MD D   SS R R  KF PKG  R++ + P P  E+ + D  +E  +A            
Sbjct: 1    MDKDNDGSSGRPRTHKFIPKG--RKKTRAPVPAPEVGDKDDGEEARKA------------ 46

Query: 367  LHKAEKKSSVQVAFTHGV-ASSTPIRTYGMQNEGTGEISKGKGSMDSISDDGQVLLSLPS 543
                    SV+VAF  G   SST IRTYG   +G    S   G  DS  DD Q   +LP 
Sbjct: 47   ---EALFPSVRVAFGPGADTSSTSIRTYGTAKDGNTGKSSSSGLEDS--DDDQSPPTLP- 100

Query: 544  TDVAGGILENSGNAVDPLFKKKRRAYKERWVVTLFISSCWSFTRAGAHAAHRFDASRPR* 723
             +V   +       V+PL   K+R            + CW                    
Sbjct: 101  -EVGKDVSMEDATDVEPLQTVKQR-----------YTECW-------------------- 128

Query: 724  HDSLLGFLMQNFQDYHHSYYPATLPLRRPFSGDPELLDVEEFGQAA--SAVEYDENTINS 897
                         D   +YYP TLPLR+P SGDP  L    F      +  EYDE+TIN 
Sbjct: 129  -------------DSETTYYPTTLPLRKPNSGDPGSLFNGTFNNLNLDAEKEYDESTINH 175

Query: 898  ASTLGLLGLEKGDAAEMLFFQLPGDILFGKQPTSADGKEDAGSRKLTGKNVPARAKDKAI 1077
            A+ LGL  LE+   A M         LF + PT                           
Sbjct: 176  AAELGL--LEEKVEARM---------LFVQLPTIL------------------------- 199

Query: 1078 AGSSTPRGIASSPMSQGKEVPGHFSMASAGRKGKEIADSLTSSTGVHASEKACSLEKLSA 1257
                        P+++           SA  KGKE   S TS     A +K CSLE+L  
Sbjct: 200  ------------PLTK----------RSATAKGKEKVGSSTSLESTGAPKKGCSLEELPG 237

Query: 1258 GCVGKMLVYKSGAVKLKIGEILYDVSPGSDCIFAQEIMIVNTVEKQCCAMGELNKRAVVT 1437
            G +GKMLVYKSGA+KLK+G  L DVSPGSDC+  +++ ++NT EK+CC +G ++ RAVVT
Sbjct: 238  GYMGKMLVYKSGAIKLKLGNTLCDVSPGSDCLCDEDVAVINTEEKRCCVLGGISHRAVVT 297

Query: 1438 PEIDSLLDSV 1467
            P +DSLL+ +
Sbjct: 298  PNVDSLLNVI 307


Top