BLASTX nr result

ID: Cocculus23_contig00004529 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00004529
         (1170 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270971.2| PREDICTED: uncharacterized protein LOC100241...   384   e-104
ref|XP_006465794.1| PREDICTED: uncharacterized abhydrolase domai...   355   3e-95
ref|XP_006426813.1| hypothetical protein CICLE_v10025233mg [Citr...   352   1e-94
ref|XP_006342882.1| PREDICTED: uncharacterized protein LOC102583...   340   9e-91
ref|XP_002533812.1| conserved hypothetical protein [Ricinus comm...   339   2e-90
ref|XP_004235521.1| PREDICTED: uncharacterized protein LOC101243...   334   5e-89
ref|XP_007024787.1| Muscle M-line assembly protein unc-89, putat...   333   1e-88
ref|XP_007216528.1| hypothetical protein PRUPE_ppa026495mg, part...   333   1e-88
ref|XP_004144449.1| PREDICTED: uncharacterized protein LOC101208...   331   4e-88
ref|XP_004303754.1| PREDICTED: uncharacterized protein LOC101293...   327   5e-87
ref|XP_002303336.2| hypothetical protein POPTR_0003s07100g [Popu...   327   6e-87
ref|XP_004235520.1| PREDICTED: uncharacterized protein LOC101243...   326   1e-86
ref|XP_007024786.1| Muscle M-line assembly protein unc-89, putat...   323   7e-86
ref|NP_564641.2| uncharacterized protein [Arabidopsis thaliana] ...   319   1e-84
ref|XP_006842720.1| hypothetical protein AMTR_s00147p00104660 [A...   319   2e-84
dbj|BAH19946.1| AT1G53800 [Arabidopsis thaliana]                      318   3e-84
gb|EXB64651.1| hypothetical protein L484_017984 [Morus notabilis]     317   6e-84
ref|NP_001031183.1| uncharacterized protein [Arabidopsis thalian...   314   5e-83
ref|XP_006392749.1| hypothetical protein EUTSA_v10011340mg [Eutr...   310   1e-81
ref|XP_002891771.1| endonuclease [Arabidopsis lyrata subsp. lyra...   306   1e-80

>ref|XP_002270971.2| PREDICTED: uncharacterized protein LOC100241217 [Vitis vinifera]
           gi|297742921|emb|CBI35788.3| unnamed protein product
           [Vitis vinifera]
          Length = 586

 Score =  384 bits (987), Expect = e-104
 Identities = 202/310 (65%), Positives = 237/310 (76%), Gaps = 8/310 (2%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKTHNLLN---VDKFKYAKEKGFSSP--SIQVLRKVNFDQT 752
           MPLLDIAT++PSF+N L TL    L++     KF    EK  +S     Q+ +KV+F+  
Sbjct: 1   MPLLDIATAQPSFQNHLVTLSAQTLIHGKVSGKFASGPEKRLASAWKPFQIPKKVDFNVG 60

Query: 751 QLEMRC-VLLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMPD--FQSSSEASNEMDE 581
            LE R    L +AV+T+E K  V   D QK   N  LG +   P    QSSSE  +E+DE
Sbjct: 61  HLEARWGKFLIKAVATLEAKCSVQGEDGQKGYNNLPLGVDSRPPGNPIQSSSEEYSELDE 120

Query: 580 REKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEE 401
           RE+LRRMRISKANKGN PWNKG+KHSAETLQRIRE+TKLAMQDPKVK+KL NLGHAQSEE
Sbjct: 121 RERLRRMRISKANKGNTPWNKGKKHSAETLQRIRERTKLAMQDPKVKMKLVNLGHAQSEE 180

Query: 400 TRAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQL 221
           TR KIG+GVRMGWQRRREK M+QETC  +WQ+LIAE SRRGYA EEE+QWDSY+ L++QL
Sbjct: 181 TRVKIGVGVRMGWQRRREKRMLQETCYFEWQSLIAEASRRGYAGEEELQWDSYDILDEQL 240

Query: 220 EQEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGS 41
           E+EWLES+E+RK M RPKGS+RAPKS EQRRKISEAISAKW+DP YRERVC ALAKYHG 
Sbjct: 241 EREWLESVEERKRMPRPKGSKRAPKSPEQRRKISEAISAKWSDPAYRERVCSALAKYHGI 300

Query: 40  PVGEKRRAQR 11
           P G  R+ +R
Sbjct: 301 PEGAPRKPRR 310


>ref|XP_006465794.1| PREDICTED: uncharacterized abhydrolase domain-containing protein
           DDB_G0269086-like [Citrus sinensis]
          Length = 588

 Score =  355 bits (910), Expect = 3e-95
 Identities = 187/311 (60%), Positives = 231/311 (74%), Gaps = 9/311 (2%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKTHNLLNVD----KFKYAKEKG--FSSPSIQVLRKVNFDQ 755
           MPLLDIAT +PS +N L  L+   L +      KF    +K   F+S   Q   K + + 
Sbjct: 1   MPLLDIATGQPSLQNHLGPLRAQTLTHAKVLSCKFTLGDDKRLCFASKPFQFTTKTSINL 60

Query: 754 TQLEMRC-VLLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMP--DFQSSSEASNEMD 584
            Q+E R   LL  AV+T+E + LV   D+ KD    +LG +  +P   FQSS   +  +D
Sbjct: 61  GQIETRRGKLLITAVATIEPECLVRKEDRAKDA---VLGVDAGLPAMQFQSSDGETEALD 117

Query: 583 EREKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSE 404
           EREKLRRMRISKANKGN PWNKGRKHSAETLQRI+E+T+LAMQ+PKV++KL NLGHAQSE
Sbjct: 118 EREKLRRMRISKANKGNTPWNKGRKHSAETLQRIKERTRLAMQNPKVRMKLVNLGHAQSE 177

Query: 403 ETRAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQ 224
           ET+ KIGIGVRMGW++RR KLMVQE+C  +WQNLIAE +RRG A EEE+QW SY  L++Q
Sbjct: 178 ETKKKIGIGVRMGWEKRRGKLMVQESCYFEWQNLIAEAARRGLAGEEELQWYSYNILDEQ 237

Query: 223 LEQEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHG 44
           L +EWLES+E+RK M R KGSRRAPKSAEQR+KI+EAI+AKWADP YRERVC  L+K+HG
Sbjct: 238 LMKEWLESVERRKTMPRTKGSRRAPKSAEQRKKIAEAIAAKWADPEYRERVCAGLSKFHG 297

Query: 43  SPVGEKRRAQR 11
            PVG +R+A+R
Sbjct: 298 VPVGVERKAKR 308


>ref|XP_006426813.1| hypothetical protein CICLE_v10025233mg [Citrus clementina]
           gi|557528803|gb|ESR40053.1| hypothetical protein
           CICLE_v10025233mg [Citrus clementina]
          Length = 588

 Score =  352 bits (904), Expect = 1e-94
 Identities = 184/311 (59%), Positives = 231/311 (74%), Gaps = 9/311 (2%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKTHNLLNVD----KFKYAKEKG--FSSPSIQVLRKVNFDQ 755
           MPLLDIAT +PS +N L  L+   L +      KF    +K   F+S   Q   K + + 
Sbjct: 1   MPLLDIATGQPSLQNHLGPLRAQTLTHAKVLSCKFTLGDDKRLCFASKPFQFTTKTSINL 60

Query: 754 TQLEMRC-VLLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMP--DFQSSSEASNEMD 584
            Q+E R   LL  AV+T+E + LV   D+ KD    +LG +  +P   FQSS   +  +D
Sbjct: 61  GQIETRRGKLLITAVATIEPECLVRKEDRAKDA---VLGVDAGLPAMQFQSSDGETEALD 117

Query: 583 EREKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSE 404
           EREKLRRMRISKANKGN PWNKGRKHSAETLQRI+E+T+LAMQ+PK+++KL NLGHAQSE
Sbjct: 118 EREKLRRMRISKANKGNTPWNKGRKHSAETLQRIKERTRLAMQNPKIRMKLVNLGHAQSE 177

Query: 403 ETRAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQ 224
           ET+ KIGIGVRMGW++RR KLMVQE+C  +WQNLIAE +RRG A EEE+QW SY  L++Q
Sbjct: 178 ETKKKIGIGVRMGWEKRRGKLMVQESCYFEWQNLIAEAARRGLAGEEELQWYSYNILDEQ 237

Query: 223 LEQEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHG 44
           L++EWLES+E+RK M R KGS+RAPK AEQR+KI+EAI+AKWADP YRERVC  L+K+HG
Sbjct: 238 LKKEWLESVERRKTMPRTKGSKRAPKPAEQRKKIAEAIAAKWADPEYRERVCAGLSKFHG 297

Query: 43  SPVGEKRRAQR 11
            PVG +R+A+R
Sbjct: 298 VPVGVERKAKR 308


>ref|XP_006342882.1| PREDICTED: uncharacterized protein LOC102583814 isoform X1 [Solanum
           tuberosum]
          Length = 616

 Score =  340 bits (871), Expect = 9e-91
 Identities = 183/311 (58%), Positives = 229/311 (73%), Gaps = 9/311 (2%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSP--SIQVLRKVNFDQTQLE 743
           MPLLDIAT++P FRN + ++ +HN+    +F Y  E  F S   S ++ +K +  + +L 
Sbjct: 1   MPLLDIATTQPCFRNNVCSISSHNVF-ATRFLYNNEIRFVSTWNSFRIPQKPSNSRIKL- 58

Query: 742 MRCVLLTRAVSTVEH---KYLVLNGDQQKDGR----NFLLGPEPSMPDFQSSSEASNEMD 584
            R  L+ RAV+T+E    K    N +Q   G      +   P  ++ + QS+SE + E++
Sbjct: 59  FRSGLMIRAVATLEKGPTKNTQTNEEQSNFGGVRMGKYAASPTSAVVEQQSASEEA-ELN 117

Query: 583 EREKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSE 404
           EREKLRRMRISKANKGN PWNKGRKHS ETLQRIRE+T+LAMQDPKVK+KL NLGHAQSE
Sbjct: 118 EREKLRRMRISKANKGNTPWNKGRKHSPETLQRIRERTRLAMQDPKVKMKLVNLGHAQSE 177

Query: 403 ETRAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQ 224
           ETR KIG+ VRMGW+RRR  L +QETC  +WQNLIAE SRRG   EEE+QWDSYE L+KQ
Sbjct: 178 ETRLKIGVAVRMGWERRRGMLRLQETCHYEWQNLIAEASRRGLLGEEELQWDSYEILSKQ 237

Query: 223 LEQEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHG 44
           LEQEW++S+++RK   R KG++RAPKSAEQRRKISEAI+AKWADP YR RV  AL+KYHG
Sbjct: 238 LEQEWIQSVQERKNKPRLKGNKRAPKSAEQRRKISEAIAAKWADPDYRSRVQSALSKYHG 297

Query: 43  SPVGEKRRAQR 11
            P G +RR +R
Sbjct: 298 IPDGVERRPRR 308


>ref|XP_002533812.1| conserved hypothetical protein [Ricinus communis]
           gi|223526249|gb|EEF28565.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 595

 Score =  339 bits (869), Expect = 2e-90
 Identities = 181/305 (59%), Positives = 224/305 (73%), Gaps = 3/305 (0%)
 Frame = -3

Query: 907 LDIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSPSIQVLRKVNFDQTQLEM-RCV 731
           +DIAT++ S +N LS L+   L+           G     +   +  +F +T     R  
Sbjct: 30  VDIATAQASLQNHLSPLRAQPLIPCKVLPSPFIFGDEKRPLARWKSFHFPKTLNTCERSK 89

Query: 730 LLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMPDFQ--SSSEASNEMDEREKLRRMR 557
           L  +AV+T+E K   L+ +++K   + LL    + P  Q  SS   S E+DE EKLRRMR
Sbjct: 90  LQIKAVATLEPK--ALHKEEEKSKESVLLDDNSNAPAVQADSSDADSLELDENEKLRRMR 147

Query: 556 ISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRAKIGIG 377
           ISKANKGN PWNKGRKHS ET+QRIRE+T+LAMQ+PK+++KLANLGHAQS+ETR KIG+G
Sbjct: 148 ISKANKGNTPWNKGRKHSPETIQRIRERTRLAMQNPKIRMKLANLGHAQSKETRTKIGVG 207

Query: 376 VRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQEWLESI 197
           VRM W++RREK  VQETCL +WQNLIAE SRRGYA EEEMQWDSY+ L ++LE EW+ESI
Sbjct: 208 VRMRWKKRREKKNVQETCLFEWQNLIAEASRRGYAGEEEMQWDSYKILTEKLEVEWVESI 267

Query: 196 EQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVGEKRRA 17
           EQRK M RPKGS+RAPKS EQRRKI+EAI+AKWADP YRERVC AL+KYHG+PVG K R 
Sbjct: 268 EQRKTMPRPKGSKRAPKSPEQRRKIAEAIAAKWADPEYRERVCSALSKYHGTPVGIKPR- 326

Query: 16  QRTRP 2
           +RT+P
Sbjct: 327 RRTQP 331


>ref|XP_004235521.1| PREDICTED: uncharacterized protein LOC101243687 isoform 2 [Solanum
           lycopersicum]
          Length = 617

 Score =  334 bits (856), Expect = 5e-89
 Identities = 181/311 (58%), Positives = 225/311 (72%), Gaps = 9/311 (2%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSP--SIQVLRKVNFDQTQLE 743
           MPLLDIAT++P FRN + ++ +HN+    +F Y  E  F S   S ++ +K    + +L 
Sbjct: 1   MPLLDIATTQPCFRNNVCSISSHNVF-ATRFLYNNEIRFVSTWNSFRIPQKAPNSRIKL- 58

Query: 742 MRCVLLTRAVSTVEH---KYLVLNGDQQKDGR----NFLLGPEPSMPDFQSSSEASNEMD 584
            R  L+ RA++T+E    K    N +Q   G      +       + + QS SE + E++
Sbjct: 59  FRSGLMIRAIATLEKGPTKNTKTNEEQNNFGDVRMGKYAASSTSVVVEQQSPSEEA-ELN 117

Query: 583 EREKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSE 404
           EREKLRRMRISKANKGN PWNKGRKHS ETLQRIRE+T+LAMQDPKVK+KL NLGHAQSE
Sbjct: 118 EREKLRRMRISKANKGNTPWNKGRKHSPETLQRIRERTRLAMQDPKVKMKLVNLGHAQSE 177

Query: 403 ETRAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQ 224
           ETR KIG+ VRMGW+RRR  L +QETC  +WQNLIAE SRRG   EEE+QWDSYE L+KQ
Sbjct: 178 ETRLKIGVAVRMGWERRRGMLRLQETCHYEWQNLIAEASRRGLLGEEELQWDSYEILSKQ 237

Query: 223 LEQEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHG 44
           LEQEW++S+++RK   R KG++RAPKSAEQRRKISEAI+AKWADP YR RV  AL+KYHG
Sbjct: 238 LEQEWIQSVQERKNRPRLKGNKRAPKSAEQRRKISEAIAAKWADPDYRSRVQSALSKYHG 297

Query: 43  SPVGEKRRAQR 11
            P G +RR +R
Sbjct: 298 IPDGVERRPRR 308


>ref|XP_007024787.1| Muscle M-line assembly protein unc-89, putative isoform 2
           [Theobroma cacao] gi|508780153|gb|EOY27409.1| Muscle
           M-line assembly protein unc-89, putative isoform 2
           [Theobroma cacao]
          Length = 582

 Score =  333 bits (853), Expect = 1e-88
 Identities = 184/312 (58%), Positives = 225/312 (72%), Gaps = 10/312 (3%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSP--SIQVLRKVNFDQTQLE 743
           MPLLDIAT++PS ++ L  L+   L+        + +  S+P  S Q+  ++NF    LE
Sbjct: 1   MPLLDIATAQPSLQSHLVPLRAQTLI--------QGQVLSNPWKSSQLPTRLNFHVGHLE 52

Query: 742 MRCV---LLTRAVSTVEHKYLVLNGDQQKDGRNFL-LGPE--PSMPDFQS--SSEASNEM 587
                  L  RAV+T+E K  V     ++DGRN   LG +  PS    +S  S ++  E 
Sbjct: 53  TLTQGGKLQIRAVATLEPKCSV----PKEDGRNTSQLGRDSSPSSTQLESLKSGDSDEEP 108

Query: 586 DEREKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQS 407
           DEREKLRRMRISKANKGN PWNKGRKHSAETLQRIRE T+LAMQ+PKVK+KL NLGHAQS
Sbjct: 109 DEREKLRRMRISKANKGNTPWNKGRKHSAETLQRIREGTRLAMQNPKVKMKLVNLGHAQS 168

Query: 406 EETRAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNK 227
           +ETR KIGIGVRMGW+RRREKLMVQE C  +W NLIAE SR+GY  EEE+QWDSY+ L  
Sbjct: 169 KETREKIGIGVRMGWERRREKLMVQENCHFEWMNLIAEASRKGYLGEEELQWDSYKILAA 228

Query: 226 QLEQEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYH 47
           QL ++WLES+E+RK M R KGS+RAPKS EQRRKI+ AI+AKWADP YR+RVC  LAKYH
Sbjct: 229 QLTKDWLESVEERKTMPRTKGSKRAPKSLEQRRKIAAAIAAKWADPEYRKRVCSGLAKYH 288

Query: 46  GSPVGEKRRAQR 11
           G+  G +R+ +R
Sbjct: 289 GTQAGAERKPKR 300


>ref|XP_007216528.1| hypothetical protein PRUPE_ppa026495mg, partial [Prunus persica]
           gi|462412678|gb|EMJ17727.1| hypothetical protein
           PRUPE_ppa026495mg, partial [Prunus persica]
          Length = 647

 Score =  333 bits (853), Expect = 1e-88
 Identities = 174/306 (56%), Positives = 224/306 (73%), Gaps = 9/306 (2%)
 Frame = -3

Query: 901 IATSRPSFRNQLSTLKT----HNLLNVDKFKYAKEKGFSSP--SIQVLRKVNFDQTQLEM 740
           IA ++P+F+N L TL+     H  +    F +  +K  SS   S  + +K+NF+   + +
Sbjct: 1   IAVAQPAFQNHLGTLRAQTPLHGKVISSPFTFGSDKKLSSAWKSSHIPKKLNFNLGHINI 60

Query: 739 -RCVLLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPE--PSMPDFQSSSEASNEMDEREKL 569
            +  LL +AV+T+E K  V N D      N  LG +  P   + +SSSE S E+DERE+L
Sbjct: 61  PKGSLLIKAVATLESKSSVQNEDAHLGYTNSQLGMDSSPWTVEPESSSEDSAELDERERL 120

Query: 568 RRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRAK 389
           RR+RISKANKGN PWNKGRKHS ETLQ IRE+T+LAMQ+PKVK+KL NLGHAQS ETR K
Sbjct: 121 RRLRISKANKGNTPWNKGRKHSPETLQLIRERTRLAMQNPKVKMKLVNLGHAQSNETRVK 180

Query: 388 IGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQEW 209
           IG+GVR+GWQRRREKL +QE C  +WQNLIA  SR+GY  EEE+QWDSY+  ++ L++E+
Sbjct: 181 IGLGVRIGWQRRREKLSLQENCCFEWQNLIAAASRQGYDGEEELQWDSYKIFDEHLKEEY 240

Query: 208 LESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVGE 29
           LES+EQRK+M RPKGS+RAPKS EQRRKIS+AISAKW DP YR+RVC ALAKY+ S  G 
Sbjct: 241 LESVEQRKIMRRPKGSKRAPKSLEQRRKISQAISAKWNDPDYRDRVCSALAKYYDSSYGA 300

Query: 28  KRRAQR 11
           +R+ ++
Sbjct: 301 ERKPRK 306


>ref|XP_004144449.1| PREDICTED: uncharacterized protein LOC101208479 [Cucumis sativus]
           gi|449523814|ref|XP_004168918.1| PREDICTED:
           uncharacterized LOC101208479 [Cucumis sativus]
          Length = 577

 Score =  331 bits (848), Expect = 4e-88
 Identities = 176/300 (58%), Positives = 217/300 (72%), Gaps = 17/300 (5%)
 Frame = -3

Query: 853 THNLLNVDK------FKYAKEKGFSSPSIQVLRKVNFDQTQLEMRCVLLTRAVSTVEHKY 692
           TH+  +V+K      F   KE   S  S+ + ++VN        R   L RAV+T+E K 
Sbjct: 10  THSWFHVNKSLGSLTFGNDKELSSSWKSLIIPKRVNLSVQPEISRRGFLIRAVATLESKP 69

Query: 691 LVLNGDQQK------DGRNFLLGPEPSMPD-----FQSSSEASNEMDEREKLRRMRISKA 545
           L+ +G++        + +N  LG  P  P        SSS  S E+DE+E+LRR RISKA
Sbjct: 70  LLHDGNRDVSMGEPVEFKNSQLGVAPHNPSTSELQLASSSGDSKELDEKERLRRERISKA 129

Query: 544 NKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRAKIGIGVRMG 365
           NKGN PWNKGRKH+AETL+RI+E+T+LAMQDPKVK+KL  LGHAQSEETR KIG+GVRMG
Sbjct: 130 NKGNTPWNKGRKHTAETLRRIKERTRLAMQDPKVKMKLIKLGHAQSEETRLKIGVGVRMG 189

Query: 364 WQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQEWLESIEQRK 185
           WQRRREK ++QETC  +WQNLIAE SR+GY  EEE+QWDSY+ LN++L++EWLES+EQRK
Sbjct: 190 WQRRREKQVLQETCHFEWQNLIAEASRQGYKGEEELQWDSYQILNEELKKEWLESVEQRK 249

Query: 184 LMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVGEKRRAQRTR 5
              R  GSRRAPKSAEQR+KISE+ISAKWADP YR+RVC ALAKYHG+P G  RR +R R
Sbjct: 250 KTPRVVGSRRAPKSAEQRKKISESISAKWADPDYRDRVCSALAKYHGTPTGVIRRPRRKR 309


>ref|XP_004303754.1| PREDICTED: uncharacterized protein LOC101293611 [Fragaria vesca
           subsp. vesca]
          Length = 667

 Score =  327 bits (839), Expect = 5e-87
 Identities = 173/312 (55%), Positives = 226/312 (72%), Gaps = 7/312 (2%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKT----HNLLNVDKFKYAKEKGFSSP--SIQVLRKVNFDQ 755
           MP L+IA ++P F+N + TL+     H+ +    F +  +K  SS   S  + ++  F+ 
Sbjct: 1   MPSLEIAVAQPVFQNHICTLRAQTPLHSKVISSPFTFGIDKKLSSSWKSSDIPKRSYFNL 60

Query: 754 TQLEM-RCVLLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMPDFQSSSEASNEMDER 578
              +  R  L+ +AV+T E K LV N +      +  L  + S    +SSS+ S+EMD++
Sbjct: 61  GNAKTPRSRLVIQAVATFEAKSLVHNENGHMRVNSSELDKDSSPFKPESSSDGSSEMDDK 120

Query: 577 EKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEET 398
           EKLRRMRISKANKGN PWNKGRKHS ETL+ IRE+T+LAMQDPKVK+KL NLGHAQ++ET
Sbjct: 121 EKLRRMRISKANKGNTPWNKGRKHSPETLRLIRERTRLAMQDPKVKMKLVNLGHAQTKET 180

Query: 397 RAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLE 218
           R KIGIGVRMGW++RR+KL +QE+C  +WQNLIAE SRRGY  EEE+QWDSY  L+ +L 
Sbjct: 181 RTKIGIGVRMGWEKRRQKLSLQESCCYEWQNLIAEASRRGYDGEEELQWDSYTILDGKLT 240

Query: 217 QEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSP 38
           +E+LESIEQRK M RPKGS+RAPKS EQRRKIS+AISAKW DP YR+RVC AL+KY+ + 
Sbjct: 241 EEYLESIEQRKTMRRPKGSKRAPKSLEQRRKISQAISAKWNDPEYRDRVCSALSKYYDAS 300

Query: 37  VGEKRRAQRTRP 2
            G +R+ ++  P
Sbjct: 301 YGAERKPRKRSP 312


>ref|XP_002303336.2| hypothetical protein POPTR_0003s07100g [Populus trichocarpa]
           gi|550342603|gb|EEE78315.2| hypothetical protein
           POPTR_0003s07100g [Populus trichocarpa]
          Length = 600

 Score =  327 bits (838), Expect = 6e-87
 Identities = 176/330 (53%), Positives = 225/330 (68%), Gaps = 11/330 (3%)
 Frame = -3

Query: 967 TILLYQ*LNSWMIE*ALMPLLDIATSRPSFRNQLSTLKT----HNLLNVDKFKYAKEKGF 800
           T L+ + +N W          DIAT++ S +N L  L+     H  +    F +  +K  
Sbjct: 8   TFLITEFVNGWF------RWADIATAQASLQNHLGLLRVQTNAHCKVFSSPFAFGDDKRL 61

Query: 799 SS--PSIQVLRKVN-FDQTQLEMRCVLLTRAVSTVEHKYLVLNGDQQK----DGRNFLLG 641
            S   SI+  RK+N F++++L+++      AV+TVE K LV  GD ++    +       
Sbjct: 62  LSLGKSIKFRRKINVFERSKLQIK------AVATVEPKSLVRKGDGKRKTSLENEQLAAN 115

Query: 640 PEPSMPDFQSSSEASNEMDEREKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLA 461
            +      +SS E S  +D++E LRR RIS ANKGN PWNKGRKHS ETLQ+IRE+T+LA
Sbjct: 116 SDTLPAQVESSGEDSMALDDKENLRRKRISNANKGNTPWNKGRKHSPETLQKIRERTRLA 175

Query: 460 MQDPKVKLKLANLGHAQSEETRAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRR 281
           MQDPK+K+KLANLGHAQS+ETR KIG GVR+GWQ+RREK MVQE C  +WQNLIAE SRR
Sbjct: 176 MQDPKIKMKLANLGHAQSKETREKIGHGVRLGWQKRREKQMVQEGCYFEWQNLIAEASRR 235

Query: 280 GYADEEEMQWDSYETLNKQLEQEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAK 101
           GY  EEE+QWDSY  L +QLE EW+ES++QRK + RPKGS+RAPKS EQRRKISEAI+AK
Sbjct: 236 GYTGEEELQWDSYNILRQQLEDEWVESVQQRKTLPRPKGSKRAPKSLEQRRKISEAIAAK 295

Query: 100 WADPGYRERVCFALAKYHGSPVGEKRRAQR 11
           WADP YRERV   L+KYHG+  G  R+ +R
Sbjct: 296 WADPEYRERVYSGLSKYHGTLAGAARKPRR 325


>ref|XP_004235520.1| PREDICTED: uncharacterized protein LOC101243687 isoform 1 [Solanum
           lycopersicum]
          Length = 618

 Score =  326 bits (836), Expect = 1e-86
 Identities = 177/307 (57%), Positives = 221/307 (71%), Gaps = 9/307 (2%)
 Frame = -3

Query: 904 DIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSP--SIQVLRKVNFDQTQLEMRCV 731
           DIAT++P FRN + ++ +HN+    +F Y  E  F S   S ++ +K    + +L  R  
Sbjct: 6   DIATTQPCFRNNVCSISSHNVF-ATRFLYNNEIRFVSTWNSFRIPQKAPNSRIKL-FRSG 63

Query: 730 LLTRAVSTVEH---KYLVLNGDQQKDGR----NFLLGPEPSMPDFQSSSEASNEMDEREK 572
           L+ RA++T+E    K    N +Q   G      +       + + QS SE + E++EREK
Sbjct: 64  LMIRAIATLEKGPTKNTKTNEEQNNFGDVRMGKYAASSTSVVVEQQSPSEEA-ELNEREK 122

Query: 571 LRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRA 392
           LRRMRISKANKGN PWNKGRKHS ETLQRIRE+T+LAMQDPKVK+KL NLGHAQSEETR 
Sbjct: 123 LRRMRISKANKGNTPWNKGRKHSPETLQRIRERTRLAMQDPKVKMKLVNLGHAQSEETRL 182

Query: 391 KIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQE 212
           KIG+ VRMGW+RRR  L +QETC  +WQNLIAE SRRG   EEE+QWDSYE L+KQLEQE
Sbjct: 183 KIGVAVRMGWERRRGMLRLQETCHYEWQNLIAEASRRGLLGEEELQWDSYEILSKQLEQE 242

Query: 211 WLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVG 32
           W++S+++RK   R KG++RAPKSAEQRRKISEAI+AKWADP YR RV  AL+KYHG P G
Sbjct: 243 WIQSVQERKNRPRLKGNKRAPKSAEQRRKISEAIAAKWADPDYRSRVQSALSKYHGIPDG 302

Query: 31  EKRRAQR 11
            +RR +R
Sbjct: 303 VERRPRR 309


>ref|XP_007024786.1| Muscle M-line assembly protein unc-89, putative isoform 1
           [Theobroma cacao] gi|508780152|gb|EOY27408.1| Muscle
           M-line assembly protein unc-89, putative isoform 1
           [Theobroma cacao]
          Length = 611

 Score =  323 bits (829), Expect = 7e-86
 Identities = 180/310 (58%), Positives = 221/310 (71%), Gaps = 10/310 (3%)
 Frame = -3

Query: 910 LLDIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSP--SIQVLRKVNFDQTQLEMR 737
           L  IAT++PS ++ L  L+   L+        + +  S+P  S Q+  ++NF    LE  
Sbjct: 32  LCHIATAQPSLQSHLVPLRAQTLI--------QGQVLSNPWKSSQLPTRLNFHVGHLETL 83

Query: 736 CV---LLTRAVSTVEHKYLVLNGDQQKDGRNFL-LGPE--PSMPDFQS--SSEASNEMDE 581
                L  RAV+T+E K  V     ++DGRN   LG +  PS    +S  S ++  E DE
Sbjct: 84  TQGGKLQIRAVATLEPKCSV----PKEDGRNTSQLGRDSSPSSTQLESLKSGDSDEEPDE 139

Query: 580 REKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEE 401
           REKLRRMRISKANKGN PWNKGRKHSAETLQRIRE T+LAMQ+PKVK+KL NLGHAQS+E
Sbjct: 140 REKLRRMRISKANKGNTPWNKGRKHSAETLQRIREGTRLAMQNPKVKMKLVNLGHAQSKE 199

Query: 400 TRAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQL 221
           TR KIGIGVRMGW+RRREKLMVQE C  +W NLIAE SR+GY  EEE+QWDSY+ L  QL
Sbjct: 200 TREKIGIGVRMGWERRREKLMVQENCHFEWMNLIAEASRKGYLGEEELQWDSYKILAAQL 259

Query: 220 EQEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGS 41
            ++WLES+E+RK M R KGS+RAPKS EQRRKI+ AI+AKWADP YR+RVC  LAKYHG+
Sbjct: 260 TKDWLESVEERKTMPRTKGSKRAPKSLEQRRKIAAAIAAKWADPEYRKRVCSGLAKYHGT 319

Query: 40  PVGEKRRAQR 11
             G +R+ +R
Sbjct: 320 QAGAERKPKR 329


>ref|NP_564641.2| uncharacterized protein [Arabidopsis thaliana]
           gi|332194882|gb|AEE33003.1| uncharacterized protein
           AT1G53800 [Arabidopsis thaliana]
          Length = 568

 Score =  319 bits (818), Expect = 1e-84
 Identities = 165/304 (54%), Positives = 215/304 (70%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSPSIQVLRKVNFDQTQLEMR 737
           MP LDIAT +PSF+  L  L   ++++         +   S ++    K     + +   
Sbjct: 1   MPSLDIATIQPSFQAHLVPLGAQSIIHAKSLPNPWRQSCFSKNL----KFYTGHSHVRRG 56

Query: 736 CVLLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMPDFQSSSEASNEMDEREKLRRMR 557
            VL+T AV+T+E KY      Q+++ R+  L    S     S+ +   ++D+REKLRRMR
Sbjct: 57  KVLIT-AVATLETKYPA----QKENERSSSLSSASSKSSNGSADDGEEQVDDREKLRRMR 111

Query: 556 ISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRAKIGIG 377
           ISKAN+GN PWNKGRKHS ETLQ+IRE+TK+AMQDPK+K+KLANLGHAQ++ETR KIG G
Sbjct: 112 ISKANRGNTPWNKGRKHSPETLQKIRERTKIAMQDPKIKMKLANLGHAQNKETRMKIGEG 171

Query: 376 VRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQEWLESI 197
           VRM W RR+E+  VQETC  +WQNL+AE +++GY DEEE+QWDSY  L++Q + EWLES+
Sbjct: 172 VRMRWARRKERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILDQQNQLEWLESV 231

Query: 196 EQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVGEKRRA 17
           EQRK +   K +RRAPKS EQRR+I+EAI+AKWADP YRERVC  LAKYHG PVG +RR 
Sbjct: 232 EQRKAIKGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKYHGIPVGVERRR 291

Query: 16  QRTR 5
           +R R
Sbjct: 292 RRPR 295


>ref|XP_006842720.1| hypothetical protein AMTR_s00147p00104660 [Amborella trichopoda]
           gi|548844821|gb|ERN04395.1| hypothetical protein
           AMTR_s00147p00104660 [Amborella trichopoda]
          Length = 509

 Score =  319 bits (817), Expect = 2e-84
 Identities = 156/239 (65%), Positives = 192/239 (80%)
 Frame = -3

Query: 721 RAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMPDFQSSSEASNEMDEREKLRRMRISKAN 542
           +AV+T++  YLV +  +  +  +  L  + +    Q S++ S E+DE E+LRRM+ISKAN
Sbjct: 8   KAVATMDLDYLVQSQSEGGEKNHLSLISDGNCEPNQLSTDDSAELDENERLRRMKISKAN 67

Query: 541 KGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRAKIGIGVRMGW 362
           KGNVPWNKGRKHS ETL++IRE+TKLAMQDPKVK+KL NLGHAQS+ETR KIG GVR+GW
Sbjct: 68  KGNVPWNKGRKHSPETLRKIRERTKLAMQDPKVKMKLVNLGHAQSKETRVKIGQGVRIGW 127

Query: 361 QRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQEWLESIEQRKL 182
           +RRRE+L +QETC LQWQNLI E SR+G   E+E+QWDSYETL+++LE+EW ESIE+R+ 
Sbjct: 128 ERRRERLALQETCCLQWQNLITEASRKGIHGEDELQWDSYETLDRELEKEWQESIERRRS 187

Query: 181 MSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVGEKRRAQRTR 5
           M RPKG RRAPKS EQRRKISEAISAKWADP YR+RV   L KYHG+PVG  RR+ R R
Sbjct: 188 MPRPKGGRRAPKSPEQRRKISEAISAKWADPEYRDRVFSGLTKYHGTPVGAVRRSPRRR 246


>dbj|BAH19946.1| AT1G53800 [Arabidopsis thaliana]
          Length = 568

 Score =  318 bits (815), Expect = 3e-84
 Identities = 164/304 (53%), Positives = 215/304 (70%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSPSIQVLRKVNFDQTQLEMR 737
           MP LDIAT +PSF+  L  L   ++++         +   S ++    K     + +   
Sbjct: 1   MPSLDIATIQPSFQAHLVPLGAQSIIHAKSLPNPWRQSCFSKNL----KFYTGHSHVRRG 56

Query: 736 CVLLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMPDFQSSSEASNEMDEREKLRRMR 557
            VL+T AV+T+E KY      Q+++ R+  L    S     S+ +   ++D+REKLRRMR
Sbjct: 57  RVLIT-AVATLETKYPA----QKENERSSSLSSASSKSSNGSADDGEEQVDDREKLRRMR 111

Query: 556 ISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRAKIGIG 377
           ISKAN+GN PWNKGRKHS ETLQ+IRE+TK+AMQDPK+K+KLANLGHAQ++ETR KIG G
Sbjct: 112 ISKANRGNTPWNKGRKHSPETLQKIRERTKIAMQDPKIKMKLANLGHAQNKETRMKIGEG 171

Query: 376 VRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQEWLESI 197
           VRM W RR+E+  VQETC  +WQNL+AE +++GY DEEE+QWDSY  L++Q + EWLES+
Sbjct: 172 VRMRWARRKERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILDQQNQLEWLESV 231

Query: 196 EQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVGEKRRA 17
           EQRK +   K +RRAP+S EQRR+I+EAI+AKWADP YRERVC  LAKYHG PVG +RR 
Sbjct: 232 EQRKAIKGAKSNRRAPRSPEQRRRIAEAIAAKWADPSYRERVCSGLAKYHGIPVGVERRR 291

Query: 16  QRTR 5
           +R R
Sbjct: 292 RRPR 295


>gb|EXB64651.1| hypothetical protein L484_017984 [Morus notabilis]
          Length = 528

 Score =  317 bits (812), Expect = 6e-84
 Identities = 156/242 (64%), Positives = 193/242 (79%), Gaps = 2/242 (0%)
 Frame = -3

Query: 730 LLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMPDFQS--SSEASNEMDEREKLRRMR 557
           LL +AV+T+E K LV NGD+        LG +   P  Q+  SSE  + +D++E+LRRMR
Sbjct: 6   LLIKAVATLEPKRLVNNGDKHSGPNKSQLGMDNRPPTVQTPTSSEGFDNLDDKERLRRMR 65

Query: 556 ISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRAKIGIG 377
           ISKANKGN PWNKG+KH+ ETL+RIRE T+LAMQDPKVK+KL NLGHAQ+  TR KIG G
Sbjct: 66  ISKANKGNTPWNKGKKHTPETLRRIRENTRLAMQDPKVKMKLVNLGHAQTLATRKKIGAG 125

Query: 376 VRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQEWLESI 197
           VRMGWQRRR+KL++QETC  +WQNLIAE SRRG+  E+++QW+SYE LN+QL++ WLES+
Sbjct: 126 VRMGWQRRRKKLLLQETCYFEWQNLIAEASRRGFDGEDKLQWNSYEVLNEQLKEAWLESV 185

Query: 196 EQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVGEKRRA 17
           E+RK M RPKGS+RAPKSAEQ+RKISEAIS KWAD GYRERV  ALA+YHG   G +R+ 
Sbjct: 186 EKRKSMPRPKGSKRAPKSAEQKRKISEAISRKWADFGYRERVVSALARYHGIEPGTERKP 245

Query: 16  QR 11
           +R
Sbjct: 246 RR 247


>ref|NP_001031183.1| uncharacterized protein [Arabidopsis thaliana]
           gi|222424381|dbj|BAH20146.1| AT1G53800 [Arabidopsis
           thaliana] gi|332194883|gb|AEE33004.1| uncharacterized
           protein AT1G53800 [Arabidopsis thaliana]
          Length = 572

 Score =  314 bits (804), Expect = 5e-83
 Identities = 162/300 (54%), Positives = 212/300 (70%)
 Frame = -3

Query: 904 DIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSPSIQVLRKVNFDQTQLEMRCVLL 725
           DIAT +PSF+  L  L   ++++         +   S ++    K     + +    VL+
Sbjct: 9   DIATIQPSFQAHLVPLGAQSIIHAKSLPNPWRQSCFSKNL----KFYTGHSHVRRGKVLI 64

Query: 724 TRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMPDFQSSSEASNEMDEREKLRRMRISKA 545
           T AV+T+E KY      Q+++ R+  L    S     S+ +   ++D+REKLRRMRISKA
Sbjct: 65  T-AVATLETKYPA----QKENERSSSLSSASSKSSNGSADDGEEQVDDREKLRRMRISKA 119

Query: 544 NKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRAKIGIGVRMG 365
           N+GN PWNKGRKHS ETLQ+IRE+TK+AMQDPK+K+KLANLGHAQ++ETR KIG GVRM 
Sbjct: 120 NRGNTPWNKGRKHSPETLQKIRERTKIAMQDPKIKMKLANLGHAQNKETRMKIGEGVRMR 179

Query: 364 WQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQEWLESIEQRK 185
           W RR+E+  VQETC  +WQNL+AE +++GY DEEE+QWDSY  L++Q + EWLES+EQRK
Sbjct: 180 WARRKERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILDQQNQLEWLESVEQRK 239

Query: 184 LMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVGEKRRAQRTR 5
            +   K +RRAPKS EQRR+I+EAI+AKWADP YRERVC  LAKYHG PVG +RR +R R
Sbjct: 240 AIKGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKYHGIPVGVERRRRRPR 299


>ref|XP_006392749.1| hypothetical protein EUTSA_v10011340mg [Eutrema salsugineum]
           gi|557089327|gb|ESQ30035.1| hypothetical protein
           EUTSA_v10011340mg [Eutrema salsugineum]
          Length = 583

 Score =  310 bits (793), Expect = 1e-81
 Identities = 168/309 (54%), Positives = 216/309 (69%), Gaps = 7/309 (2%)
 Frame = -3

Query: 916 MPLLDIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSPSIQVLRKVNFDQTQLEMR 737
           MP LDIA  +PS +  L  L   ++++        +K   S +++      F    L +R
Sbjct: 1   MPSLDIAAIQPSLQAHLVPLGAQSIIHAKNLPNPWKKSSFSKNLK------FYTGHLHVR 54

Query: 736 C--VLLTRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMP--DFQSSSE---ASNEMDER 578
              VL+T AV+T+E KY      Q+++ +++ L    S P  D   SSE   A  ++D+R
Sbjct: 55  RGKVLIT-AVATLETKYPA----QKENKQSYSLSSASSKPSEDRNGSSEDGEALEQVDDR 109

Query: 577 EKLRRMRISKANKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEET 398
           EKLRRMRISKAN+GN PWNKGRKHS ETLQ+IRE+TK+AMQDPK+K+KLANLGHAQ++ET
Sbjct: 110 EKLRRMRISKANRGNTPWNKGRKHSPETLQKIRERTKIAMQDPKIKMKLANLGHAQNKET 169

Query: 397 RAKIGIGVRMGWQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLE 218
           R KIG GVRM W RR+E+  VQETC  +WQNL+AE ++ GY DEEE+QW+SY  L++Q +
Sbjct: 170 RLKIGEGVRMRWARRKERRKVQETCHFEWQNLLAEAAKEGYTDEEELQWNSYNILDQQNQ 229

Query: 217 QEWLESIEQRKLMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSP 38
            EWLES+EQRK     K +RRAPKS EQRRKI+EAI+AKWADP YRERVC  LAKYHG P
Sbjct: 230 LEWLESVEQRKAARGVKSNRRAPKSPEQRRKIAEAIAAKWADPSYRERVCSGLAKYHGIP 289

Query: 37  VGEKRRAQR 11
            G +RR +R
Sbjct: 290 AGVERRRRR 298


>ref|XP_002891771.1| endonuclease [Arabidopsis lyrata subsp. lyrata]
           gi|297337613|gb|EFH68030.1| endonuclease [Arabidopsis
           lyrata subsp. lyrata]
          Length = 570

 Score =  306 bits (784), Expect = 1e-80
 Identities = 160/300 (53%), Positives = 206/300 (68%)
 Frame = -3

Query: 904 DIATSRPSFRNQLSTLKTHNLLNVDKFKYAKEKGFSSPSIQVLRKVNFDQTQLEMRCVLL 725
           DIAT +PS +  L  L   ++++         +   S ++    K     T +    VL+
Sbjct: 9   DIATIQPSLQAHLVPLGAQSIIHAKTLPNPWRQSCFSKNL----KFYTGHTHVRRGKVLI 64

Query: 724 TRAVSTVEHKYLVLNGDQQKDGRNFLLGPEPSMPDFQSSSEASNEMDEREKLRRMRISKA 545
           T AV+T+E KY     ++Q    +       +       S   +++D+REKLRRMRISKA
Sbjct: 65  T-AVATLETKYPAQKENEQSSSLS------SASSKSSDGSADDDKVDDREKLRRMRISKA 117

Query: 544 NKGNVPWNKGRKHSAETLQRIREKTKLAMQDPKVKLKLANLGHAQSEETRAKIGIGVRMG 365
           N+GN PWNKGRKHS ETLQ+IRE+TK+AMQDPK+KLKLANLGHAQ++ETR KIG GVRM 
Sbjct: 118 NRGNTPWNKGRKHSPETLQKIRERTKIAMQDPKIKLKLANLGHAQNKETRMKIGEGVRMR 177

Query: 364 WQRRREKLMVQETCLLQWQNLIAEGSRRGYADEEEMQWDSYETLNKQLEQEWLESIEQRK 185
           W RR+E+  VQETC  +WQNL+AE ++ GY DEEE+QWDSY+ L++Q + EWLES+EQRK
Sbjct: 178 WARRKERRKVQETCHFEWQNLLAEAAKEGYRDEEELQWDSYKILDQQNQLEWLESVEQRK 237

Query: 184 LMSRPKGSRRAPKSAEQRRKISEAISAKWADPGYRERVCFALAKYHGSPVGEKRRAQRTR 5
                K +RRAPKS EQRR+I+EAI+AKWADP YRERVC  LAKYHG PVG +RR +R R
Sbjct: 238 AAKGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKYHGIPVGVERRRRRPR 297


Top