BLASTX nr result

ID: Akebia27_contig00003038 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00003038
         (1305 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002319702.2| myb family transcription factor family prote...   298   3e-78
ref|XP_002525443.1| transcription factor, putative [Ricinus comm...   292   3e-76
ref|XP_002325408.2| myb family transcription factor family prote...   287   8e-75
ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248...   287   8e-75
ref|XP_007202959.1| hypothetical protein PRUPE_ppa015076mg [Prun...   284   5e-74
ref|XP_007030697.1| Homeodomain-like superfamily protein isoform...   280   1e-72
ref|XP_007030696.1| Homeodomain-like superfamily protein isoform...   280   1e-72
ref|XP_002282336.1| PREDICTED: uncharacterized protein LOC100248...   278   5e-72
ref|XP_006443380.1| hypothetical protein CICLE_v10020171mg [Citr...   246   1e-62
ref|XP_006443379.1| hypothetical protein CICLE_v10020171mg [Citr...   246   1e-62
ref|XP_006443378.1| hypothetical protein CICLE_v10020171mg [Citr...   246   1e-62
ref|XP_004288533.1| PREDICTED: uncharacterized protein LOC101304...   226   2e-56
ref|XP_006338933.1| PREDICTED: uncharacterized protein LOC102592...   214   9e-53
ref|XP_004249601.1| PREDICTED: uncharacterized protein LOC101256...   212   3e-52
ref|XP_006829830.1| hypothetical protein AMTR_s00119p00095480 [A...   207   1e-50
ref|XP_006338935.1| PREDICTED: uncharacterized protein LOC102592...   205   3e-50
ref|XP_007150070.1| hypothetical protein PHAVU_005G123900g [Phas...   199   2e-48
ref|XP_003539830.1| PREDICTED: uncharacterized protein LOC100805...   197   8e-48
ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810...   196   1e-47
gb|EXC02650.1| Myb family transcription factor APL [Morus notabi...   196   2e-47

>ref|XP_002319702.2| myb family transcription factor family protein [Populus
           trichocarpa] gi|550325041|gb|EEE95625.2| myb family
           transcription factor family protein [Populus
           trichocarpa]
          Length = 427

 Score =  298 bits (764), Expect = 3e-78
 Identities = 160/253 (63%), Positives = 188/253 (74%), Gaps = 6/253 (2%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQ+VLEKAQETLGRQNLG+VGLE AKVQLSEL SKVST+CL S F
Sbjct: 175 VQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSTF 234

Query: 383 PGLKEMPGSCNQQT---QPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
             L ++ G C QQT   QP DCS+DSCLTSCEGSQK+QEIHNIGMGLRP ++N  L  KE
Sbjct: 235 SELNDLQGLCPQQTPPTQPNDCSMDSCLTSCEGSQKEQEIHNIGMGLRPCNSNALLEPKE 294

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGS-S 730
           I EE  + QTE  W + L  NKMFL+S+  ETE+  F  +RS SD S+ + +QGEKG+ +
Sbjct: 295 IAEEHALQQTELKWGEYLRDNKMFLTSIGHETERRTFSAERSCSDLSIGVGLQGEKGNIN 354

Query: 731 SSVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHDENDTTS 910
           SS +  R K   EDD F DQTN R   ++ E+EK S G  L+Y T+KLDLN+HDE D  S
Sbjct: 355 SSFAEGRFKGMSEDDSFQDQTNKRAESVKFEDEKMSPGYRLSYFTTKLDLNSHDEIDAAS 414

Query: 911 --RQFDLNGFSWN 943
             +Q DLNGFSWN
Sbjct: 415 SCKQLDLNGFSWN 427


>ref|XP_002525443.1| transcription factor, putative [Ricinus communis]
           gi|223535256|gb|EEF36933.1| transcription factor,
           putative [Ricinus communis]
          Length = 419

 Score =  292 bits (747), Expect = 3e-76
 Identities = 154/252 (61%), Positives = 184/252 (73%), Gaps = 5/252 (1%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGS+GLE AKVQLSEL SKVST+CL SAF
Sbjct: 168 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSIGLEAAKVQLSELVSKVSTQCLNSAF 227

Query: 383 PGLKEMPGSCNQQTQ---PTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
             LKE+ G C+QQTQ   PTDCS+DSCLTSCEGSQK+QEIHN GMGLRP++ N  L  K+
Sbjct: 228 SELKELQGLCHQQTQTAPPTDCSMDSCLTSCEGSQKEQEIHNTGMGLRPYNGNALLESKD 287

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGSSS 733
           I E   + QTE  W +DL  NKMFLS +     +  F  +RS+SD SM++ +QGE G++S
Sbjct: 288 ITEGHVLHQTELKWSEDLKDNKMFLSPLGNNAARRNFAAERSTSDLSMTVGLQGENGNAS 347

Query: 734 SVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHDENDTTS- 910
           S S  R K+R++ D F DQTN     ++      S+G  L Y  +KLDLN+H+E D  S 
Sbjct: 348 SFSEGRYKDRNDGDSFPDQTNKSLDSVKLPKGDVSQGYRLPYFATKLDLNSHEEIDAASS 407

Query: 911 -RQFDLNGFSWN 943
            +Q DLNGFSWN
Sbjct: 408 CKQLDLNGFSWN 419


>ref|XP_002325408.2| myb family transcription factor family protein [Populus
           trichocarpa] gi|550316805|gb|EEE99789.2| myb family
           transcription factor family protein [Populus
           trichocarpa]
          Length = 420

 Score =  287 bits (734), Expect = 8e-75
 Identities = 157/253 (62%), Positives = 185/253 (73%), Gaps = 6/253 (2%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLG+VGLE AKVQLSEL SKVS++CL SAF
Sbjct: 168 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSSKCLNSAF 227

Query: 383 PGLKEMPGSCNQQTQPT---DCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
             LK++ G C   TQPT   DCS+DSCLTS EGSQK+QEIHN GMGLRP++ N  L  K 
Sbjct: 228 SELKDLQGLCPPLTQPTHPNDCSMDSCLTSIEGSQKEQEIHNTGMGLRPYNGNALLEPKV 287

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGS-S 730
           I  E  + QTE  W +D   NKMFLSS+  +T++  F  +RS S+ S+ + +QGE+G+ S
Sbjct: 288 IAGEHALQQTELKWGEDQRDNKMFLSSMRNDTDRRTFSAERSCSNLSIGVGLQGERGNVS 347

Query: 731 SSVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHDENDTTS 910
           SS + AR K R EDD F D+TN R   I+ ENEK S G  L+Y  +KLDLN+H E D  S
Sbjct: 348 SSFAEARFKGRSEDDSFQDKTNRRIDAIKLENEKLSPGYRLSYYATKLDLNSHGEIDAAS 407

Query: 911 --RQFDLNGFSWN 943
             RQ DLNGFSWN
Sbjct: 408 GCRQLDLNGFSWN 420


>ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248614 isoform 1 [Vitis
           vinifera]
          Length = 418

 Score =  287 bits (734), Expect = 8e-75
 Identities = 158/256 (61%), Positives = 187/256 (73%), Gaps = 9/256 (3%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQ+VLEKAQETLGRQNLG+VGLE AKVQLSEL SKVST+CL SAF
Sbjct: 163 VQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGAVGLEAAKVQLSELVSKVSTQCLHSAF 222

Query: 383 PGLKEMPGSCNQ--QTQPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHN-NTHLSQKE 553
             LKE+   C Q  QTQPTDCS+DSCLTSCEGSQ++QEIHN GMGLRP+ N +T L  K+
Sbjct: 223 SELKELQSLCPQQTQTQPTDCSMDSCLTSCEGSQREQEIHNCGMGLRPYTNGSTPLEAKD 282

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGS-S 730
             E   +  T   WC+D   N+ F+SS++R+ E+     +RS+SD SM I +QGEKG+ S
Sbjct: 283 TAEPPGLQHTVLKWCEDTKENRQFISSMQRDAERRTMTAERSNSDLSMRIGLQGEKGNGS 342

Query: 731 SSVSAARPKERDEDDIFLDQTN---GRRAVIQQENEKKSKGPGLNYLTSKLDLNAHDEND 901
           +S S  R K R E D F+D+TN        ++QENEK S G  L    +KLDLNAHDEND
Sbjct: 343 NSYSEGRFKGRAEADNFVDRTNHGADSGNSVKQENEKMSHGYRLPCFGAKLDLNAHDEND 402

Query: 902 TT--SRQFDLNGFSWN 943
            T   +QFDLNGFSWN
Sbjct: 403 VTLSCKQFDLNGFSWN 418


>ref|XP_007202959.1| hypothetical protein PRUPE_ppa015076mg [Prunus persica]
           gi|462398490|gb|EMJ04158.1| hypothetical protein
           PRUPE_ppa015076mg [Prunus persica]
          Length = 421

 Score =  284 bits (727), Expect = 5e-74
 Identities = 155/253 (61%), Positives = 182/253 (71%), Gaps = 6/253 (2%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLG+VGLE AKVQLSEL SKVST+CL SAF
Sbjct: 172 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSAF 231

Query: 383 PGLKEMPGSCNQQ---TQPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
             LKE+ G C QQ   TQPTDCS++SCLTSCEGS+KDQEIHN  MGLR  +N   L  + 
Sbjct: 232 TELKELQGLCPQQTQTTQPTDCSMESCLTSCEGSKKDQEIHNSAMGLRANYNGRELLDE- 290

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGSSS 733
             +E  + +TE  WC++L  N M LSS+  +  K MFPV+RSSSD SMSI  QGE+ + +
Sbjct: 291 --KEPMLQKTELKWCEELKENNMLLSSISNDAAKRMFPVERSSSDLSMSIGCQGERWNIN 348

Query: 734 SVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKG-PGLNYLTSKLDLNAHDENDTTS 910
             S  R K R  D  FLD+TN R    + E EK S+G   + Y  +KLDLN HD+ND  S
Sbjct: 349 GNSEERLKGRSTDVSFLDRTNNRADSAKAETEKVSRGCRSVPYFAAKLDLNTHDDNDAPS 408

Query: 911 --RQFDLNGFSWN 943
             +QFDLNGFSW+
Sbjct: 409 SCKQFDLNGFSWS 421


>ref|XP_007030697.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao]
           gi|590643063|ref|XP_007030698.1| Homeodomain-like
           superfamily protein isoform 2 [Theobroma cacao]
           gi|508719302|gb|EOY11199.1| Homeodomain-like superfamily
           protein isoform 2 [Theobroma cacao]
           gi|508719303|gb|EOY11200.1| Homeodomain-like superfamily
           protein isoform 2 [Theobroma cacao]
          Length = 414

 Score =  280 bits (716), Expect = 1e-72
 Identities = 157/256 (61%), Positives = 186/256 (72%), Gaps = 9/256 (3%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQ+VLEKAQETLGRQNLGSVGLE AKVQLSEL SKVS +CL SAF
Sbjct: 169 VQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGSVGLEAAKVQLSELVSKVSNQCLNSAF 228

Query: 383 PGLKEMPGSCNQQTQ---PTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFH-NNTHLSQK 550
             LK++ G C QQTQ   PTDCS+DSCLTSCEGSQK+QEIHN GM LRP++ +   L Q+
Sbjct: 229 SDLKDLQGLCPQQTQATPPTDCSMDSCLTSCEGSQKEQEIHNNGMCLRPYNTSGALLEQR 288

Query: 551 EIREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEK--- 721
           EI E+  + QTE    +D+  NKMFLSS+ ++ E+ MF   RSSSD SMS+ +QGEK   
Sbjct: 289 EIAEDPLLPQTELKSFEDIKENKMFLSSLGKDAERRMFFADRSSSDLSMSVGLQGEKGNG 348

Query: 722 GSSSSVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHDEND 901
           G+SSS S A+ K R+EDD FLD+ N R   + +          L Y  +KLDLN H+END
Sbjct: 349 GNSSSFSEAKFKGRNEDDSFLDRGNKRADEVNR----------LPYFATKLDLNVHEEND 398

Query: 902 TTS--RQFDLNGFSWN 943
             S  +QFDLNG SWN
Sbjct: 399 AASSCKQFDLNGLSWN 414


>ref|XP_007030696.1| Homeodomain-like superfamily protein isoform 1 [Theobroma cacao]
           gi|508719301|gb|EOY11198.1| Homeodomain-like superfamily
           protein isoform 1 [Theobroma cacao]
          Length = 478

 Score =  280 bits (716), Expect = 1e-72
 Identities = 157/256 (61%), Positives = 186/256 (72%), Gaps = 9/256 (3%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQ+VLEKAQETLGRQNLGSVGLE AKVQLSEL SKVS +CL SAF
Sbjct: 233 VQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGSVGLEAAKVQLSELVSKVSNQCLNSAF 292

Query: 383 PGLKEMPGSCNQQTQ---PTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFH-NNTHLSQK 550
             LK++ G C QQTQ   PTDCS+DSCLTSCEGSQK+QEIHN GM LRP++ +   L Q+
Sbjct: 293 SDLKDLQGLCPQQTQATPPTDCSMDSCLTSCEGSQKEQEIHNNGMCLRPYNTSGALLEQR 352

Query: 551 EIREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEK--- 721
           EI E+  + QTE    +D+  NKMFLSS+ ++ E+ MF   RSSSD SMS+ +QGEK   
Sbjct: 353 EIAEDPLLPQTELKSFEDIKENKMFLSSLGKDAERRMFFADRSSSDLSMSVGLQGEKGNG 412

Query: 722 GSSSSVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHDEND 901
           G+SSS S A+ K R+EDD FLD+ N R   + +          L Y  +KLDLN H+END
Sbjct: 413 GNSSSFSEAKFKGRNEDDSFLDRGNKRADEVNR----------LPYFATKLDLNVHEEND 462

Query: 902 TTS--RQFDLNGFSWN 943
             S  +QFDLNG SWN
Sbjct: 463 AASSCKQFDLNGLSWN 478


>ref|XP_002282336.1| PREDICTED: uncharacterized protein LOC100248614 isoform 2 [Vitis
           vinifera]
          Length = 412

 Score =  278 bits (710), Expect = 5e-72
 Identities = 153/256 (59%), Positives = 184/256 (71%), Gaps = 9/256 (3%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           +   L+LRIEAQGKYLQ+VLEKAQETLGRQNLG+VGLE AKVQLSEL SKVST+CL SAF
Sbjct: 157 LHEQLELRIEAQGKYLQAVLEKAQETLGRQNLGAVGLEAAKVQLSELVSKVSTQCLHSAF 216

Query: 383 PGLKEMPGSCNQ--QTQPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHN-NTHLSQKE 553
             LKE+   C Q  QTQPTDCS+DSCLTSCEGSQ++QEIHN GMGLRP+ N +T L  K+
Sbjct: 217 SELKELQSLCPQQTQTQPTDCSMDSCLTSCEGSQREQEIHNCGMGLRPYTNGSTPLEAKD 276

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGS-S 730
             E   +  T   WC+D   N+ F+SS++R+ E+     +RS+SD SM I +QGEKG+ S
Sbjct: 277 TAEPPGLQHTVLKWCEDTKENRQFISSMQRDAERRTMTAERSNSDLSMRIGLQGEKGNGS 336

Query: 731 SSVSAARPKERDEDDIFLDQTN---GRRAVIQQENEKKSKGPGLNYLTSKLDLNAHDEND 901
           +S S  R K R E D F+D+TN        ++QENEK S G  L    +KLDLNAHDEND
Sbjct: 337 NSYSEGRFKGRAEADNFVDRTNHGADSGNSVKQENEKMSHGYRLPCFGAKLDLNAHDEND 396

Query: 902 TT--SRQFDLNGFSWN 943
            T   +QFDLNGFSWN
Sbjct: 397 VTLSCKQFDLNGFSWN 412


>ref|XP_006443380.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
           gi|568850794|ref|XP_006479082.1| PREDICTED:
           uncharacterized protein LOC102612777 isoform X1 [Citrus
           sinensis] gi|568850796|ref|XP_006479083.1| PREDICTED:
           uncharacterized protein LOC102612777 isoform X2 [Citrus
           sinensis] gi|557545642|gb|ESR56620.1| hypothetical
           protein CICLE_v10020171mg [Citrus clementina]
          Length = 401

 Score =  246 bits (629), Expect = 1e-62
 Identities = 146/253 (57%), Positives = 167/253 (66%), Gaps = 6/253 (2%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQ+VLEKAQETLGRQNLG+ GLE AKVQLSEL SKVST+CL S F
Sbjct: 168 VQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTF 227

Query: 383 PGLKEMPGSCNQQ---TQPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
             LKE+ G C QQ    QPTDCS+DSCLTSCEGSQKDQEIHN G+ LRP+H    L  KE
Sbjct: 228 SDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKE 287

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGSSS 733
           I EE  + QTE  W KDL  +K FLSS+ ++         R   + S+          S 
Sbjct: 288 IVEEPMLQQTELKWRKDLKESK-FLSSIGKD---------RGPGELSI---------GSG 328

Query: 734 SVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHD-ENDTTS 910
           S  A R K  +ED+ F DQTN +    + ENE       L   ++KLDLNAHD END  S
Sbjct: 329 SFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVAS 388

Query: 911 --RQFDLNGFSWN 943
             +QFDLNGFSWN
Sbjct: 389 GCKQFDLNGFSWN 401


>ref|XP_006443379.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
           gi|557545641|gb|ESR56619.1| hypothetical protein
           CICLE_v10020171mg [Citrus clementina]
          Length = 441

 Score =  246 bits (629), Expect = 1e-62
 Identities = 146/253 (57%), Positives = 167/253 (66%), Gaps = 6/253 (2%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQ+VLEKAQETLGRQNLG+ GLE AKVQLSEL SKVST+CL S F
Sbjct: 208 VQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTF 267

Query: 383 PGLKEMPGSCNQQ---TQPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
             LKE+ G C QQ    QPTDCS+DSCLTSCEGSQKDQEIHN G+ LRP+H    L  KE
Sbjct: 268 SDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKE 327

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGSSS 733
           I EE  + QTE  W KDL  +K FLSS+ ++         R   + S+          S 
Sbjct: 328 IVEEPMLQQTELKWRKDLKESK-FLSSIGKD---------RGPGELSI---------GSG 368

Query: 734 SVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHD-ENDTTS 910
           S  A R K  +ED+ F DQTN +    + ENE       L   ++KLDLNAHD END  S
Sbjct: 369 SFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVAS 428

Query: 911 --RQFDLNGFSWN 943
             +QFDLNGFSWN
Sbjct: 429 GCKQFDLNGFSWN 441


>ref|XP_006443378.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
           gi|557545640|gb|ESR56618.1| hypothetical protein
           CICLE_v10020171mg [Citrus clementina]
          Length = 294

 Score =  246 bits (629), Expect = 1e-62
 Identities = 146/253 (57%), Positives = 167/253 (66%), Gaps = 6/253 (2%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQ+VLEKAQETLGRQNLG+ GLE AKVQLSEL SKVST+CL S F
Sbjct: 61  VQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTF 120

Query: 383 PGLKEMPGSCNQQ---TQPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
             LKE+ G C QQ    QPTDCS+DSCLTSCEGSQKDQEIHN G+ LRP+H    L  KE
Sbjct: 121 SDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKE 180

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGSSS 733
           I EE  + QTE  W KDL  +K FLSS+ ++         R   + S+          S 
Sbjct: 181 IVEEPMLQQTELKWRKDLKESK-FLSSIGKD---------RGPGELSI---------GSG 221

Query: 734 SVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHD-ENDTTS 910
           S  A R K  +ED+ F DQTN +    + ENE       L   ++KLDLNAHD END  S
Sbjct: 222 SFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVAS 281

Query: 911 --RQFDLNGFSWN 943
             +QFDLNGFSWN
Sbjct: 282 GCKQFDLNGFSWN 294


>ref|XP_004288533.1| PREDICTED: uncharacterized protein LOC101304811 [Fragaria vesca
           subsp. vesca]
          Length = 418

 Score =  226 bits (576), Expect = 2e-56
 Identities = 134/252 (53%), Positives = 168/252 (66%), Gaps = 5/252 (1%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLG+VGLE AKVQLSEL SKVST+CL SAF
Sbjct: 180 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSAF 239

Query: 383 PGLKEMPGSCNQQTQPTDCSIDSCLTSCEGSQKDQEI-HNIGMGLRPFHNNTHLSQKEIR 559
             +KE+ GSC  Q  PTDCS++SCLTS EGS+KDQEI +N  MGLR ++++  L + E  
Sbjct: 240 TEMKEVQGSC-PQNPPTDCSMESCLTSSEGSKKDQEIQNNSRMGLRAYNSSRVLLESE-- 296

Query: 560 EESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGE-KGSSSS 736
                          L  N MF+S++ +  ++ MFP +  S DFSMSI ++ E    S  
Sbjct: 297 ----------KTMLHLKENSMFVSTLTKNADQRMFPSEPRSGDFSMSIGLEREILNGSHC 346

Query: 737 VSAARPKERDEDDIFLDQTNGRRAVIQQENEKK-SKGPGLNYLTSKLDLNAHDENDTTS- 910
            S  R K R+  D FLD  N R   ++ +  +K S+G    Y  +KLDLN+HD+ D +S 
Sbjct: 347 NSEERFKARNTIDSFLDNKNNRADSVKVDQSRKVSQGYSGPYFAAKLDLNSHDDTDASSS 406

Query: 911 -RQFDLNGFSWN 943
            +QFDLN FSW+
Sbjct: 407 CKQFDLNDFSWS 418


>ref|XP_006338933.1| PREDICTED: uncharacterized protein LOC102592272 isoform X1 [Solanum
           tuberosum] gi|565343634|ref|XP_006338934.1| PREDICTED:
           uncharacterized protein LOC102592272 isoform X2 [Solanum
           tuberosum]
          Length = 416

 Score =  214 bits (544), Expect = 9e-53
 Identities = 124/251 (49%), Positives = 155/251 (61%), Gaps = 5/251 (1%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQSVLEKAQETLGRQN+ +VGLE  KVQLSE  SK S +CL S F
Sbjct: 166 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNMETVGLEAVKVQLSEFVSKASNQCLNSPF 225

Query: 383 PGLKEMPG---SCNQQTQPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
           P +KE+ G      Q TQPTD SIDSCLTS +GS +D  +H+  +GLRPF     +  K+
Sbjct: 226 PDIKELSGFHSQHTQATQPTDRSIDSCLTSRDGSLRDNTMHDNQIGLRPFDFTPSIECKD 285

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGSSS 733
           I  ++R+ QTE  WC +L  N+   S +    EK  F  + + ++ SMSI +Q EK + S
Sbjct: 286 IENDARLQQTELRWCDNLKENRRLFSPMNEGREK-TFTRETNCNNLSMSIGLQDEKLNGS 344

Query: 734 SVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHDENDTTS- 910
              +       E D+ L      R+    +  K S+   L+Y   KLDLN HDE D  S 
Sbjct: 345 MNHSDGSFNGTERDVKLFHQVTNRSESVPQRHKSSQEYKLSYFQPKLDLNMHDETDAASS 404

Query: 911 -RQFDLNGFSW 940
            +QFDLNGFSW
Sbjct: 405 CKQFDLNGFSW 415


>ref|XP_004249601.1| PREDICTED: uncharacterized protein LOC101256236 [Solanum
           lycopersicum]
          Length = 414

 Score =  212 bits (540), Expect = 3e-52
 Identities = 124/252 (49%), Positives = 157/252 (62%), Gaps = 5/252 (1%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQSVLEKAQETLGRQN+ +VGLE  KVQLSE  SK S +CL S F
Sbjct: 164 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNMETVGLEAVKVQLSEFVSKASNQCLNSPF 223

Query: 383 PGLKEMPGSCNQQ---TQPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
             +KE+ G  +QQ   TQPTD SIDSCLTS +GS +D  +H+  +GLRPF     +  K+
Sbjct: 224 TDIKELSGFHSQQTQATQPTDRSIDSCLTSRDGSLRDNTMHDNQIGLRPFGFTPSIECKD 283

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGSSS 733
           I  ++R+ QTE  WC +L  N+   S +    EK  F  + + ++ SMSI +Q EK + S
Sbjct: 284 IENDTRLQQTELRWCDNLKENRRLFSPMNEGREK-TFTRETNCNNLSMSIGLQDEKLNGS 342

Query: 734 SVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKLDLNAHDENDTTS- 910
              +       E D+ L      R+    +  K S+   L+Y   KLDLN HDE D  S 
Sbjct: 343 MNHSDGNFNGTERDVKLFHQVTNRSESVPQRHKSSQEYKLSYFEPKLDLNMHDETDAASS 402

Query: 911 -RQFDLNGFSWN 943
            +QFDLNGFSW+
Sbjct: 403 CKQFDLNGFSWS 414


>ref|XP_006829830.1| hypothetical protein AMTR_s00119p00095480 [Amborella trichopoda]
           gi|548835411|gb|ERM97246.1| hypothetical protein
           AMTR_s00119p00095480 [Amborella trichopoda]
          Length = 412

 Score =  207 bits (526), Expect = 1e-50
 Identities = 128/255 (50%), Positives = 160/255 (62%), Gaps = 8/255 (3%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQSVLEKAQETL +QN GS GLE  + Q+SEL S+VS ECL SAF
Sbjct: 163 VQRHLQLRIEAQGKYLQSVLEKAQETLAKQNPGSSGLEATRAQISELVSQVSAECLNSAF 222

Query: 383 PGLKEMPGSCNQQTQPT---DCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
            GL E P   NQQ Q +   DCS+DSCLTSCEG QKDQE  NI +GL  +H+N+ L QK 
Sbjct: 223 SGLTEAPSLNNQQAQKSHLADCSMDSCLTSCEGPQKDQETQNISIGL-GYHSNSLLWQKA 281

Query: 554 IREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGSSS 733
            REE RV +   +  + L   K +  S ER+    M  +  S  D   ++R  G+KG+ S
Sbjct: 282 EREEFRVQRPNHSTGESLKDTKHYSPSPERK----MHLLTSSFGDTINNVRAPGDKGACS 337

Query: 734 SVSAARPKERDEDDIFLDQTNGRRAVIQQEN---EKKSKGPGLNYLTSKLDLNAHDENDT 904
           + S AR KER  +    +    R    + +    E+      L+  T+ LDLNAHDEND 
Sbjct: 338 TNSDARRKERGFEGACGEPPRKRSVASRTQALDIEQTEVFDRLSNHTAVLDLNAHDENDA 397

Query: 905 TS--RQFDLNGFSWN 943
           +S  ++FDLNGFSW+
Sbjct: 398 SSECKEFDLNGFSWS 412


>ref|XP_006338935.1| PREDICTED: uncharacterized protein LOC102592272 isoform X3 [Solanum
           tuberosum]
          Length = 410

 Score =  205 bits (522), Expect = 3e-50
 Identities = 122/264 (46%), Positives = 156/264 (59%), Gaps = 5/264 (1%)
 Frame = +2

Query: 164 SELVVMLTYETN*VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSEL 343
           SE + M       +   L+LRIEAQGKYLQSVLEKAQETLGRQN+ +VGLE  KVQLSE 
Sbjct: 147 SEAIQMQIEVQRRLHEQLELRIEAQGKYLQSVLEKAQETLGRQNMETVGLEAVKVQLSEF 206

Query: 344 ASKVSTECLRSAFPGLKEMPG---SCNQQTQPTDCSIDSCLTSCEGSQKDQEIHNIGMGL 514
            SK S +CL S FP +KE+ G      Q TQPTD SIDSCLTS +GS +D  +H+  +GL
Sbjct: 207 VSKASNQCLNSPFPDIKELSGFHSQHTQATQPTDRSIDSCLTSRDGSLRDNTMHDNQIGL 266

Query: 515 RPFHNNTHLSQKEIREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFS 694
           RPF     +  K+I  ++R+ QTE  WC +L  N+   S +    EK  F  + + ++ S
Sbjct: 267 RPFDFTPSIECKDIENDARLQQTELRWCDNLKENRRLFSPMNEGREK-TFTRETNCNNLS 325

Query: 695 MSIRVQGEKGSSSSVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGLNYLTSKL 874
           MSI +Q EK + S   +       E D+ L      R+    +  K S+   L+Y   KL
Sbjct: 326 MSIGLQDEKLNGSMNHSDGSFNGTERDVKLFHQVTNRSESVPQRHKSSQEYKLSYFQPKL 385

Query: 875 DLNAHDENDTTS--RQFDLNGFSW 940
           DLN HDE D  S  +QFDLNGFSW
Sbjct: 386 DLNMHDETDAASSCKQFDLNGFSW 409


>ref|XP_007150070.1| hypothetical protein PHAVU_005G123900g [Phaseolus vulgaris]
           gi|561023334|gb|ESW22064.1| hypothetical protein
           PHAVU_005G123900g [Phaseolus vulgaris]
          Length = 430

 Score =  199 bits (507), Expect = 2e-48
 Identities = 118/257 (45%), Positives = 162/257 (63%), Gaps = 10/257 (3%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQ+HLQLRIEAQGKYLQSVLEKAQ+TLGRQNLG +GLE AKVQLSEL SKVS++CL SAF
Sbjct: 178 VQKHLQLRIEAQGKYLQSVLEKAQDTLGRQNLGIIGLETAKVQLSELVSKVSSQCLNSAF 237

Query: 383 PGLKEMPGSCNQQT---QPTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
             LKE+ G C QQT   QP DCS+DSCLTSC+  QK+Q+I N    LR F+++  + QKE
Sbjct: 238 SELKELQGFCPQQTHTNQPNDCSMDSCLTSCDILQKEQKIQN---SLRQFNSHVFMEQKE 294

Query: 554 IRE-ESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGE-KGS 727
             +  + +  +E  WC D   N  FL+ + +  E+  +  +    + SMSI ++ E +  
Sbjct: 295 STDARNNLRNSELKWCDDGKKN-TFLAPLSKTEERRKYAAETGPGNLSMSIGLERETENR 353

Query: 728 SSSVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKG---PGLNYLTSKLDLNAHDEN 898
           SS    +  KE   +  F  +   +   ++  +EK  +    P   ++ ++LDLN H +N
Sbjct: 354 SSMYPESLIKESQSEGEFQHRNRIKTETMKAVDEKVCQDYRMPASYFVATRLDLNNHGDN 413

Query: 899 D--TTSRQFDLNGFSWN 943
           +  TT +Q DLN FSW+
Sbjct: 414 EAATTCKQLDLNRFSWS 430


>ref|XP_003539830.1| PREDICTED: uncharacterized protein LOC100805237 isoformX1 [Glycine
           max] gi|571492729|ref|XP_006592327.1| PREDICTED:
           uncharacterized protein LOC100805237 isoform X2 [Glycine
           max]
          Length = 405

 Score =  197 bits (501), Expect = 8e-48
 Identities = 129/262 (49%), Positives = 162/262 (61%), Gaps = 15/262 (5%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQR LQLRIEAQGKYLQ+VLEKAQETLGRQNLG+VGLE  K+QLSEL SKVS++CL SAF
Sbjct: 171 VQRLLQLRIEAQGKYLQAVLEKAQETLGRQNLGAVGLEATKLQLSELVSKVSSQCLNSAF 230

Query: 383 PG-LKEMPG-SCNQQTQ-----PTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHL 541
              LKE+ G S +QQTQ       DCS+DSCLTSCEGSQK+QEI N GM LRPF+ +T +
Sbjct: 231 SDRLKEIQGFSPHQQTQTNQPNTNDCSMDSCLTSCEGSQKEQEIQNGGMSLRPFNVHTFM 290

Query: 542 SQKEIREE---SRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQ 712
            +KE+ E    + +  T+  WC  +  N  FL+ +    +K      RS S+ SMSI ++
Sbjct: 291 ERKEVIEGPNLNNLPNTDLNWCDPVKKN-TFLTPLSMHADK------RSPSNLSMSIGLE 343

Query: 713 GEKGSSSSVSAARPKERDEDDIFLDQTNGRRAVIQQENEKKSKGPGL--NYL-TSKLDLN 883
           GE  + S++                    R   ++   +K S+  GL  NY   SKLDL 
Sbjct: 344 GETENGSTI--------------------RTESVKPVADKVSQDYGLPSNYFAASKLDLT 383

Query: 884 AHDEND--TTSRQFDLNGFSWN 943
             D  D  T+ +Q DLNGFSWN
Sbjct: 384 TEDNKDTKTSCKQLDLNGFSWN 405


>ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810396 [Glycine max]
          Length = 420

 Score =  196 bits (499), Expect = 1e-47
 Identities = 120/257 (46%), Positives = 161/257 (62%), Gaps = 10/257 (3%)
 Frame = +2

Query: 203 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
           VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLG VG+E AKVQLSEL SKVS++CL SAF
Sbjct: 168 VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGVVGIEAAKVQLSELVSKVSSQCLNSAF 227

Query: 383 PGLKEMPGSCNQQTQ---PTDCSIDSCLTSCEGSQKDQEIHNIGMGLRPFHNNTHLSQKE 553
              K++ G   QQTQ   P DCS+DSCLTS + SQK+QEI N   GLR F+++  +  KE
Sbjct: 228 TEPKDLQGFFPQQTQTNPPNDCSMDSCLTSSDRSQKEQEIQN---GLRHFNSHVFMEHKE 284

Query: 554 IRE-ESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSSSDFSMSIRVQGEKGSS 730
             E  + +   E  WC+D   N  FL+ + +  E+  +  + S ++ SMSI ++ E  + 
Sbjct: 285 ATEAPNNLRNPELKWCEDGKKN-TFLAPLSKNEERRNYAAESSPNNLSMSIGLERETENG 343

Query: 731 SSVSAAR-PKERDEDDIFLDQTNGRRAVIQQENEKKSKG---PGLNYLTSKLDLNAHDEN 898
            ++   R   E   D  F  +   +   ++  +EK S+    P   +  ++LDLN H +N
Sbjct: 344 INLYPERLITESQSDGEFQHRNRIKPETLKPVDEKVSQDYRLPASYFAAARLDLNTHGDN 403

Query: 899 D--TTSRQFDLNGFSWN 943
           +  TT +Q DLN FSW+
Sbjct: 404 EAATTCKQLDLNRFSWS 420


>gb|EXC02650.1| Myb family transcription factor APL [Morus notabilis]
          Length = 444

 Score =  196 bits (498), Expect = 2e-47
 Identities = 139/297 (46%), Positives = 166/297 (55%), Gaps = 50/297 (16%)
 Frame = +2

Query: 203  VQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGSVGLEDAKVQLSELASKVSTECLRSAF 382
            VQRHLQLR+EAQGKYLQ+VLEKAQETLGRQNLG+VGLE AKVQLSEL SKVST+CL SAF
Sbjct: 161  VQRHLQLRMEAQGKYLQAVLEKAQETLGRQNLGAVGLEAAKVQLSELVSKVSTQCLNSAF 220

Query: 383  PGLKEM-------PGSCNQ--------QTQP-TDCSIDSCLTSCEGSQKDQEI-HNIGMG 511
              LKE+       P S NQ        Q QP  DCS+DSCLTSCEGSQKDQEI HN   G
Sbjct: 221  AELKEVQGGLCRQPNSSNQTQGTVLIRQQQPNNDCSMDSCLTSCEGSQKDQEIAHNTSSG 280

Query: 512  ---LRPFHNNTHLSQKEIREESRVGQTEPTWCKDLNANKMFLSSVERETEKMMFPVQRSS 682
               LRP++N    S             EP WC D   + MFLS      E  MFPV+RSS
Sbjct: 281  IVQLRPYNNAATTSTSN-------AFLEPKWCNDAKDSVMFLS------ETRMFPVERSS 327

Query: 683  SDFSMSIRV---QGEKGSSSSVSAARPKER------DEDDIFLDQ----TNGRRAVIQQE 823
            +   +SI +    GEKG  +S  +   + R      D D+ F+D+     + R A    +
Sbjct: 328  NTGGLSIGIGLGGGEKGGDTSTYSQGVRFRGTGSSWDVDEDFVDRNTTTASARSAGFHDK 387

Query: 824  NEKKSKGPGLNYLTSK--LDLNAHD------------ENDTTS---RQFDLNGFSWN 943
                      NY  +   LDLN+HD            EN ++S   +Q DLNGF W+
Sbjct: 388  ASDHQACRVANYFATNKVLDLNSHDHDHHDHDHENQNENHSSSTCYKQIDLNGFGWS 444


Top