BLASTX nr result
ID: Atropa21_contig00020570
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00020570 (1166 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006338056.1| PREDICTED: uncharacterized protein LOC102605... 328 3e-87 ref|XP_006338055.1| PREDICTED: uncharacterized protein LOC102605... 322 2e-85 ref|XP_004237998.1| PREDICTED: uncharacterized protein LOC101255... 321 3e-85 ref|XP_004252718.1| PREDICTED: uncharacterized protein LOC101249... 181 5e-43 ref|XP_006366421.1| PREDICTED: uncharacterized protein LOC102582... 180 1e-42 gb|EXB22546.1| hypothetical protein L484_002900 [Morus notabilis] 107 9e-21 ref|XP_002283801.2| PREDICTED: uncharacterized protein LOC100245... 99 3e-18 emb|CBI19274.3| unnamed protein product [Vitis vinifera] 98 7e-18 emb|CAN60243.1| hypothetical protein VITISV_010188 [Vitis vinifera] 98 7e-18 ref|XP_006472453.1| PREDICTED: putative GPI-anchored protein PB1... 80 2e-12 ref|XP_006433817.1| hypothetical protein CICLE_v10000622mg [Citr... 79 5e-12 ref|XP_002302346.1| myb family transcription factor family prote... 77 2e-11 gb|EMJ23216.1| hypothetical protein PRUPE_ppa002943mg [Prunus pe... 74 9e-11 ref|XP_002514048.1| DNA binding protein, putative [Ricinus commu... 72 4e-10 gb|EMJ25790.1| hypothetical protein PRUPE_ppa1027142mg [Prunus p... 70 2e-09 ref|XP_004163958.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 69 3e-09 ref|XP_004152740.1| PREDICTED: uncharacterized protein LOC101206... 69 3e-09 gb|EMJ25789.1| hypothetical protein PRUPE_ppa1027142mg [Prunus p... 63 2e-07 gb|EOY15458.1| Homeodomain-like superfamily protein isoform 2 [T... 60 2e-06 gb|EOY15457.1| Homeodomain-like superfamily protein isoform 1 [T... 60 2e-06 >ref|XP_006338056.1| PREDICTED: uncharacterized protein LOC102605794 isoform X2 [Solanum tuberosum] Length = 544 Score = 328 bits (841), Expect = 3e-87 Identities = 195/318 (61%), Positives = 211/318 (66%), Gaps = 11/318 (3%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNMVGNNLQLSEAQLAARHAMSLAFGDNLRAARPISTN 182 DFKGDRTASQLSQRWA IRKQH MVGN LSEAQLA RHA+S+AFGDN+RAA PIS N Sbjct: 233 DFKGDRTASQLSQRWATIRKQHVMMVGNGSHLSEAQLATRHAVSMAFGDNVRAACPISPN 292 Query: 183 AXXXXXXXXXXXXRF-AAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXXXVNPDPFXXX 359 A F AAA++AS GPQSKHQQDLV +NPDP Sbjct: 293 AGPNSGSGPSNSSHFAAAANVASAGPQSKHQQDLV-PSKPIIPKIPLPKPAINPDPMVKA 351 Query: 360 XXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGGTNGLPSNVHFIRTG 539 SS +AT S VHIMPGGTPA+KSSVPG NGLPSNVHFIRTG Sbjct: 352 AAMAASSRVATHSGAAASLQKAAQSKKGVHIMPGGTPAVKSSVPGSFNGLPSNVHFIRTG 411 Query: 540 LVSHPPGPSNASQPGTQHLQGHSLRSASPPVQPNPVLALPSRTNASSGIPSAPT------ 701 LVS P PSN SQ GTQ LQ + RS SP VQP P +PSRTNASSG+ SAP+ Sbjct: 412 LVSCPADPSNTSQSGTQQLQ--APRSVSPAVQPKPT-TVPSRTNASSGVRSAPSSYPTTV 468 Query: 702 ----SKAAAIQENQTVVRSNLRSEKAEVIRAATLANTPQQQVQKDRTFGSGNLLREKVGG 869 SKAA QENQ V SN RSEK +VIRAA+LANTPQQQV KD+TF GNLL KV G Sbjct: 469 LEVKSKAAVSQENQIAVLSNTRSEKTQVIRAASLANTPQQQVPKDQTF--GNLLSGKVDG 526 Query: 870 QTSVLGDTVKELGGESKA 923 QTSVLGDTVK+LGGESKA Sbjct: 527 QTSVLGDTVKKLGGESKA 544 >ref|XP_006338055.1| PREDICTED: uncharacterized protein LOC102605794 isoform X1 [Solanum tuberosum] Length = 550 Score = 322 bits (824), Expect = 2e-85 Identities = 195/324 (60%), Positives = 211/324 (65%), Gaps = 17/324 (5%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNMVGNNLQLSEAQLAARHAMSLAFGDNLRAARPISTN 182 DFKGDRTASQLSQRWA IRKQH MVGN LSEAQLA RHA+S+AFGDN+RAA PIS N Sbjct: 233 DFKGDRTASQLSQRWATIRKQHVMMVGNGSHLSEAQLATRHAVSMAFGDNVRAACPISPN 292 Query: 183 ------AXXXXXXXXXXXXRF-AAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXXXVNP 341 A F AAA++AS GPQSKHQQDLV +NP Sbjct: 293 GCGIVSAGPNSGSGPSNSSHFAAAANVASAGPQSKHQQDLV-PSKPIIPKIPLPKPAINP 351 Query: 342 DPFXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGGTNGLPSNV 521 DP SS +AT S VHIMPGGTPA+KSSVPG NGLPSNV Sbjct: 352 DPMVKAAAMAASSRVATHSGAAASLQKAAQSKKGVHIMPGGTPAVKSSVPGSFNGLPSNV 411 Query: 522 HFIRTGLVSHPPGPSNASQPGTQHLQGHSLRSASPPVQPNPVLALPSRTNASSGIPSAPT 701 HFIRTGLVS P PSN SQ GTQ LQ + RS SP VQP P +PSRTNASSG+ SAP+ Sbjct: 412 HFIRTGLVSCPADPSNTSQSGTQQLQ--APRSVSPAVQPKPT-TVPSRTNASSGVRSAPS 468 Query: 702 ----------SKAAAIQENQTVVRSNLRSEKAEVIRAATLANTPQQQVQKDRTFGSGNLL 851 SKAA QENQ V SN RSEK +VIRAA+LANTPQQQV KD+TF GNLL Sbjct: 469 SYPTTVLEVKSKAAVSQENQIAVLSNTRSEKTQVIRAASLANTPQQQVPKDQTF--GNLL 526 Query: 852 REKVGGQTSVLGDTVKELGGESKA 923 KV GQTSVLGDTVK+LGGESKA Sbjct: 527 SGKVDGQTSVLGDTVKKLGGESKA 550 >ref|XP_004237998.1| PREDICTED: uncharacterized protein LOC101255687 [Solanum lycopersicum] Length = 571 Score = 321 bits (823), Expect = 3e-85 Identities = 194/334 (58%), Positives = 212/334 (63%), Gaps = 16/334 (4%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNMVGNNLQLSEAQLAARHAMSLAFGDNLRAARPISTN 182 DFKGDRTASQLSQRWA IRKQH MVGN LSEAQLAARHA+S+AF DN+RAA PIS N Sbjct: 233 DFKGDRTASQLSQRWATIRKQHVMMVGNGSHLSEAQLAARHAVSMAFRDNVRAACPISPN 292 Query: 183 AXXXXXXXXXXXXRFAAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXXXVNPDPFXXXX 362 A FAAAD+AS GPQ KHQQDLV +NPD Sbjct: 293 AGTNSGSGPSNSSHFAAADVASAGPQPKHQQDLV-PSKPIIPKIPLPKPAINPDLMVKTA 351 Query: 363 XXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGGTNGLPSNVHFIRTGL 542 SS +AT S VHIMPGGTPA+KSSVPG NGLPSNVHF+RTGL Sbjct: 352 AMAASSRVATHSGTAASLQKAALSKKGVHIMPGGTPAVKSSVPGSFNGLPSNVHFMRTGL 411 Query: 543 VSHPPGPSNASQPGTQHL------QGHSLRSASPPVQPNPVLALPSRTNASSGIPSAPT- 701 VS P GPSNA Q GTQ L Q + RS SP VQP P +PSRTNASSG+ SAP+ Sbjct: 412 VSRPAGPSNAPQSGTQQLHAPRTQQLQAPRSVSPAVQPKPT-TVPSRTNASSGVRSAPSS 470 Query: 702 ---------SKAAAIQENQTVVRSNLRSEKAEVIRAATLANTPQQQVQKDRTFGSGNLLR 854 SKAA QENQ V SN R EK +VI+AA+LANTPQQQV KD+ F G+LL Sbjct: 471 YPTTVLDVKSKAAVSQENQIAVLSNTRGEKTQVIQAASLANTPQQQVPKDQNF--GDLLS 528 Query: 855 EKVGGQTSVLGDTVKELGGESKASRISGS*KANP 956 KV GQTSVL DTVK+LGGESKASRI K P Sbjct: 529 GKVEGQTSVLCDTVKKLGGESKASRIWVQEKLTP 562 >ref|XP_004252718.1| PREDICTED: uncharacterized protein LOC101249442 [Solanum lycopersicum] Length = 569 Score = 181 bits (459), Expect = 5e-43 Identities = 127/297 (42%), Positives = 154/297 (51%), Gaps = 24/297 (8%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNMVGNNLQLSEAQLAARHAMSLAFGDNLRAARPISTN 182 DFKGDRTASQLSQRWAIIRK+ G MVGN QLSEAQLAARHAMS A PI + Sbjct: 233 DFKGDRTASQLSQRWAIIRKRQGTMVGNGSQLSEAQLAARHAMSHALN------MPIGAS 286 Query: 183 AXXXXXXXXXXXXRFAAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXXXVNPDPFXXXX 362 AD+ASGG QS+HQQD + + D Sbjct: 287 VGPNSGGGSSNSSLPVTADLASGGAQSQHQQDPLSSKPRIVPQKPAPKPTTSSDSMVKVT 346 Query: 363 XXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGGTNGLPSNVHFIRTGL 542 + IAT S + +PGG A+KSSV G TNGLPSNVHFIRTGL Sbjct: 347 AVAAGARIATSSNSASQVKLAQPKTPLQ--IPGGGSAVKSSVLGSTNGLPSNVHFIRTGL 404 Query: 543 VSH---PP------GPSNASQPGTQHLQGHSLRSASPPVQPNPVLALPSRTNASSGIPSA 695 VSH PP GPS+AS+PGTQ HSL+ ASP VQP P+ S+ NA + +P+A Sbjct: 405 VSHSAGPPKAVHSAGPSHASRPGTQQGLSHSLKPASPTVQPKPI-GNSSKPNALA-VPTA 462 Query: 696 PTSKAAA---------IQENQT------VVRSNLRSEKAEVIRAATLANTPQQQVQK 821 PTS A +Q++QT +++ + E + R AN P QVQ+ Sbjct: 463 PTSTPVAELKVNTNQEVQQDQTPPSVNSLIKVSESKEHKKEDRDPVHANAPGVQVQE 519 >ref|XP_006366421.1| PREDICTED: uncharacterized protein LOC102582625 [Solanum tuberosum] Length = 574 Score = 180 bits (456), Expect = 1e-42 Identities = 127/302 (42%), Positives = 151/302 (50%), Gaps = 29/302 (9%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNMVGNNLQLSEAQLAARHAMSLAFGDNLRAARPISTN 182 DFKGDRTASQLSQRWAIIRK+ G MVGN QLSEAQLAARHAMS A PI Sbjct: 233 DFKGDRTASQLSQRWAIIRKRQGTMVGNGSQLSEAQLAARHAMSHALN------MPIGAG 286 Query: 183 AXXXXXXXXXXXXRFAAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXXXVNPDPFXXXX 362 AD+ASGG QS+HQQD + +PD Sbjct: 287 VGPNSGSGPSNSSHPVTADLASGGAQSQHQQDPLSSKPRIVPQKPAPKPTTSPDSMIKVA 346 Query: 363 XXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGGTNGLPSNVHFIRTGL 542 + IAT S + +PGG PA+KSSV G TNGLPSNVHFIRTGL Sbjct: 347 AVAAGARIATSSNSASQVKLAQPKTPLQ--IPGGGPAVKSSVLGSTNGLPSNVHFIRTGL 404 Query: 543 VSHPPG---------PSNASQPGTQHLQGHSLRSASPPVQPNPVLALPSRTNASSGIPSA 695 VSH G PSNAS+PGT + HSL+ ASP VQP P+ S+ NA + ++ Sbjct: 405 VSHSAGPPKVVHSAVPSNASRPGTPQVLSHSLKPASPTVQPKPI-GNSSKPNALAE-RNS 462 Query: 696 PTSKAAA-------------IQENQTVVRSNLRSEKAEVI-------RAATLANTPQQQV 815 PTS A +Q++QT N +KA + R AN+P QV Sbjct: 463 PTSTPVAELKVNTNQEVLQKVQQDQTPPSVNPLIKKASELKEHKKEDRDPVHANSPGVQV 522 Query: 816 QK 821 Q+ Sbjct: 523 QE 524 >gb|EXB22546.1| hypothetical protein L484_002900 [Morus notabilis] Length = 854 Score = 107 bits (267), Expect = 9e-21 Identities = 107/342 (31%), Positives = 142/342 (41%), Gaps = 33/342 (9%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM----VGNNLQLSEAQLAARHAMSLAFGDNLR--AA 164 DFKGDRTASQLSQRWAIIRK+HGN+ N QLSEAQLAARHAMSLA ++ A Sbjct: 230 DFKGDRTASQLSQRWAIIRKRHGNLNLGSSSNGTQLSEAQLAARHAMSLALNMPVKNLTA 289 Query: 165 RPIS--TNAXXXXXXXXXXXXRFAAADIASGGP-----QSKHQQDLVXXXXXXXXXXXXX 323 IS + A + A+GG Q++ Q++L Sbjct: 290 NTISHAGTTALNNSMGTNSTNKSAGTNAAAGGNSSLQLQNQSQENLASKESPVGSLGPIT 349 Query: 324 XXXV-----------NPDPFXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTP 470 + + D + IA+PS +HI P G+ Sbjct: 350 KARIPMKKPLVKSTPSSDAMVRATAVAAGARIASPS-DAASLLKAAQAKNAIHIRPTGSG 408 Query: 471 AIKSSVPGGTNGLPS------NVHFIRTGLVSHPPGPSNASQPGTQHLQGHSLRSASPPV 632 +IKSS+PG GLP+ NVH+IRTGL S P A+ P S++S S PV Sbjct: 409 SIKSSMPG---GLPAPSEAHPNVHYIRTGLASAPVSNYAAATPSVP--CPASVKSISSPV 463 Query: 633 QPNPV---LALPSRTNASSGIPSAPTSKAAAIQENQTVVRSNLRSEKAEVIRAATLANTP 803 Q P +L + + + P + QE +TV E I+ + Sbjct: 464 QQTPTSNGTSLDVSSKQKNYVSCTPAHELPLKQEAKTV----------EEIKVPASGSAA 513 Query: 804 QQQVQKDRTFGSGNLLREKVGGQTSVLGDTVKELGGESKASR 929 +QQ+Q D S N V D EL G S + Sbjct: 514 KQQIQGDGACVSANSQDGLVQDNKVAAPDPDAELKGTSDVGK 555 >ref|XP_002283801.2| PREDICTED: uncharacterized protein LOC100245507 [Vitis vinifera] Length = 606 Score = 99.0 bits (245), Expect = 3e-18 Identities = 100/323 (30%), Positives = 136/323 (42%), Gaps = 12/323 (3%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM-VG----NNLQLSEAQLAARHAMSLAFGDNLRAAR 167 DFKGDR+ASQLSQRW IIRK+H N+ VG N QLSEAQLAARHAMSLA + Sbjct: 226 DFKGDRSASQLSQRWTIIRKKHKNLNVGGANSNGSQLSEAQLAARHAMSLALD---MPVK 282 Query: 168 PISTNAXXXXXXXXXXXXRFAAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXXXVNPDP 347 ++T + + S G K + + Sbjct: 283 NLTTTNISQAQQLSQQGPVSTLSQMGSLGSAPKSR---------ATSKKTSAKSTFSSQS 333 Query: 348 FXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGGTNGLPSNVHF 527 + IATPS VHIMPGG+ IKSSV GG N LP+N Sbjct: 334 MLKATAVAAGARIATPSA-AASLLKDAQSRNAVHIMPGGSTLIKSSVAGGANPLPAN--- 389 Query: 528 IRTGLVSHPPGPSNASQPGTQHLQGHSL-------RSASPPVQPNPVLALPSRTNASSGI 686 L +HP + P T L +S ++ P P LA PS + S I Sbjct: 390 ---HLGAHPNVHYKCAGPPTTSLSTYSAVAPSVSRTGSAKPAAPGGQLA-PSPSATSVNI 445 Query: 687 PSAPTSKAAAIQENQTVVRSNLRSEKAEVIRAATLANTPQQQVQKDRTFGSGNLLREKVG 866 S T+ AA + ++ +E + N P+ +V +D+ S N E+V Sbjct: 446 SSEQTN--AATTSLAVEYPAKQETKTSEETKVPISGNVPKAKVLEDQACVSSNTASEQVQ 503 Query: 867 GQTSVLGDTVKELGGESKASRIS 935 + L +T E+ E+K + +S Sbjct: 504 EDQATLSNT--EVVLENKKAMVS 524 >emb|CBI19274.3| unnamed protein product [Vitis vinifera] Length = 641 Score = 97.8 bits (242), Expect = 7e-18 Identities = 106/340 (31%), Positives = 138/340 (40%), Gaps = 29/340 (8%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM-VG----NNLQLSEAQLAARHAMSLAFG---DNLR 158 DFKGDR+ASQLSQRW IIRK+H N+ VG N QLSEAQLAARHAMSLA NL Sbjct: 226 DFKGDRSASQLSQRWTIIRKKHKNLNVGGANSNGSQLSEAQLAARHAMSLALDMPVKNLT 285 Query: 159 AARPI---STNAXXXXXXXXXXXXRFAAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXX 329 + I + NA A Q QQ V Sbjct: 286 TSSSIAGTNPNATSSNSAFPATPAEALPASTNISQAQQLSQQGPVSTLSQMGSLGSAPKS 345 Query: 330 XV-----------NPDPFXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAI 476 + + IATPS VHIMPGG+ I Sbjct: 346 RATSKKTSAKSTFSSQSMLKATAVAAGARIATPSA-AASLLKDAQSRNAVHIMPGGSTLI 404 Query: 477 KSSVPGGTNGLPSNVHFIRTGLVSHPPGPSNASQPGTQHLQGHSL-------RSASPPVQ 635 KSSV GG N LP+N L +HP + P T L +S ++ P Sbjct: 405 KSSVAGGANPLPAN------HLGAHPNVHYKCAGPPTTSLSTYSAVAPSVSRTGSAKPAA 458 Query: 636 PNPVLALPSRTNASSGIPSAPTSKAAAIQENQTVVRSNLRSEKAEVIRAATLANTPQQQV 815 P LA PS + S I S T+ AA + ++ +E + N P+ +V Sbjct: 459 PGGQLA-PSPSATSVNISSEQTN--AATTSLAVEYPAKQETKTSEETKVPISGNVPKAKV 515 Query: 816 QKDRTFGSGNLLREKVGGQTSVLGDTVKELGGESKASRIS 935 +D+ S N E+V + L +T E+ E+K + +S Sbjct: 516 LEDQACVSSNTASEQVQEDQATLSNT--EVVLENKKAMVS 553 >emb|CAN60243.1| hypothetical protein VITISV_010188 [Vitis vinifera] Length = 598 Score = 97.8 bits (242), Expect = 7e-18 Identities = 106/340 (31%), Positives = 138/340 (40%), Gaps = 29/340 (8%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM-VG----NNLQLSEAQLAARHAMSLAFG---DNLR 158 DFKGDR+ASQLSQRW IIRK+H N+ VG N QLSEAQLAARHAMSLA NL Sbjct: 189 DFKGDRSASQLSQRWTIIRKKHKNLNVGGANSNGSQLSEAQLAARHAMSLALDMPVKNLT 248 Query: 159 AARPI---STNAXXXXXXXXXXXXRFAAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXX 329 + I + NA A Q QQ V Sbjct: 249 TSSSIAGTNPNATSSNSAFPATPAEALPASTNISQAQQLSQQGPVSTLSQMGSLGSAPKS 308 Query: 330 XV-----------NPDPFXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAI 476 + + IATPS VHIMPGG+ I Sbjct: 309 RATSKKTSAKSTFSSQSMLKATAVAAGARIATPSA-AASLLKDAQSRNAVHIMPGGSTLI 367 Query: 477 KSSVPGGTNGLPSNVHFIRTGLVSHPPGPSNASQPGTQHLQGHSL-------RSASPPVQ 635 KSSV GG N LP+N L +HP + P T L +S ++ P Sbjct: 368 KSSVAGGANPLPAN------HLGAHPNVHYKCAGPPTTSLSTYSAVAPSVSRTGSAKPAA 421 Query: 636 PNPVLALPSRTNASSGIPSAPTSKAAAIQENQTVVRSNLRSEKAEVIRAATLANTPQQQV 815 P LA PS + S I S T+ AA + ++ +E + N P+ +V Sbjct: 422 PGGQLA-PSPSATSVNISSEQTN--AATTSLAVEYPAKQETKTSEETKVPISGNVPKAKV 478 Query: 816 QKDRTFGSGNLLREKVGGQTSVLGDTVKELGGESKASRIS 935 +D+ S N E+V + L +T E+ E+K + +S Sbjct: 479 LEDQACVSSNTASEQVQEDQATLSNT--EVVLENKKAMVS 516 >ref|XP_006472453.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like [Citrus sinensis] Length = 603 Score = 79.7 bits (195), Expect = 2e-12 Identities = 85/292 (29%), Positives = 116/292 (39%), Gaps = 18/292 (6%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNMV-GNN---LQLSEAQLAARHAMSLAFG---DNLRA 161 DFK DRTASQLSQRW I+RK+HGN++ G+N QLSEAQLAARHAMSLA N+ A Sbjct: 234 DFKWDRTASQLSQRWNILRKKHGNVILGSNSSGSQLSEAQLAARHAMSLALDMPVKNITA 293 Query: 162 ARPISTNAXXXXXXXXXXXXRFAAADIASGGPQSKHQ---QDLVXXXXXXXXXXXXXXXX 332 + +T A A+ +S QSK Sbjct: 294 SCTNTTAGTTSSATMNNPVPSTANAEASSVANQSKLSPVGSPGSAAKSRVPLKKMPAKSN 353 Query: 333 VNPDPFXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGGTNG-- 506 D + I TPS +HIMP G +IKS G + Sbjct: 354 FGADSSIRAAAVAAGARIVTPS-DAASLLKVAQAKKAIHIMPSGVSSIKSPSAGSASAHL 412 Query: 507 -LPSNVHFIRTGL-----VSHPPGPSNASQPGTQHLQGHSLRSASPPVQPNPVLALPSRT 668 ++R L S P S+AS PG +++A P VQ N +T Sbjct: 413 EASPTTRYVRPSLPAVPSSSSPAVTSSASHPGL-------VKAALPKVQHNTSC---EQT 462 Query: 669 NASSGIPSAPTSKAAAIQENQTVVRSNLRSEKAEVIRAATLANTPQQQVQKD 824 NA +P+ ++ E+ +V + N P +++Q D Sbjct: 463 NAVVSVPATELQLKPEVK----------AGEEIKVSGCSVSGNEPSKEIQLD 504 >ref|XP_006433817.1| hypothetical protein CICLE_v10000622mg [Citrus clementina] gi|557535939|gb|ESR47057.1| hypothetical protein CICLE_v10000622mg [Citrus clementina] Length = 612 Score = 78.6 bits (192), Expect = 5e-12 Identities = 85/292 (29%), Positives = 115/292 (39%), Gaps = 18/292 (6%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNMV-GNN---LQLSEAQLAARHAMSLAFG---DNLRA 161 DFK DRTASQLSQRW I+RK+HGN++ G+N QLSEAQLAARHAMSLA N+ A Sbjct: 234 DFKWDRTASQLSQRWNILRKKHGNVILGSNSSGSQLSEAQLAARHAMSLALDMPVKNITA 293 Query: 162 ARPISTNAXXXXXXXXXXXXRFAAADIASGGPQSKHQ---QDLVXXXXXXXXXXXXXXXX 332 + +T A A+ +S QSK Sbjct: 294 SCTNTTAGTTSSATMNNPVPSTANAEASSVANQSKLSPVGSPGSAVKSRVPLKKMPAKSN 353 Query: 333 VNPDPFXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGGTN--- 503 D + I TPS +HIMP G +IKS G + Sbjct: 354 FGADSSIRAAAVAAGARIVTPS-DAASLLKVAQAKKAIHIMPSGVSSIKSPSAGSASVHL 412 Query: 504 GLPSNVHFIRTGL-----VSHPPGPSNASQPGTQHLQGHSLRSASPPVQPNPVLALPSRT 668 ++R L S P S+AS PG +++A P VQ N +T Sbjct: 413 EASPTTRYVRPSLPVVPSSSSPAVTSSASHPGL-------VKAALPKVQHNTSC---EQT 462 Query: 669 NASSGIPSAPTSKAAAIQENQTVVRSNLRSEKAEVIRAATLANTPQQQVQKD 824 NA +P ++ E+ +V + N P +++Q D Sbjct: 463 NAVVSVPGTELQLKPEVK----------AGEEIKVSGGSVSGNEPSKEIQLD 504 >ref|XP_002302346.1| myb family transcription factor family protein [Populus trichocarpa] gi|222844072|gb|EEE81619.1| myb family transcription factor family protein [Populus trichocarpa] Length = 677 Score = 76.6 bits (187), Expect = 2e-11 Identities = 83/321 (25%), Positives = 128/321 (39%), Gaps = 34/321 (10%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM----VGNNLQLSEAQLAARHAMSLAFGDNLRAARP 170 +FKGDRTASQLSQRWAIIRK+HGN+ V + QLSE Q AAR A+ +A + A Sbjct: 237 EFKGDRTASQLSQRWAIIRKRHGNLNVGTVSSAPQLSETQRAARDAVKMALDPHPAAKSL 296 Query: 171 ISTNAXXXXXXXXXXXXRFAAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXXXV----- 335 I+++A AS Q+ ++ V Sbjct: 297 IASSAGTTSTKTPNNCASPTITAEASPAQHQSQQRTMMTKSSSIWPVGPAAKSQVMLAKA 356 Query: 336 ------NPDPFXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGG 497 + DP + IAT S VHIMP G+ +IKSS+ GG Sbjct: 357 SEKSILSSDP-VRAAAVAAGARIATQS-DAASLLKAAQAKNAVHIMPTGSSSIKSSMTGG 414 Query: 498 TN---GLPSNVHFIRTGLVSHP-----------PGPSNASQPGTQHLQGHSLRSASPPVQ 635 + + N FI +G+ + P PG A+ P Q + S + Q Sbjct: 415 ISTHLDVNPNTRFISSGMATAPTTTRPPASGPCPGLPKATSPPPQM---KASSSTAQHTQ 471 Query: 636 PNPVLALPSRTNASSGIPS-----APTSKAAAIQENQTVVRSNLRSEKAEVIRAATLANT 800 PV + +++ ++ + + P KA+++ T+ S +E A + Sbjct: 472 STPVTSFNAQSEQTNSVLAKATVLPPQMKASSMTTQNTLSTPITSSTPSEQTNAESSPKQ 531 Query: 801 PQQQVQKDRTFGSGNLLREKV 863 ++ + FGS + +V Sbjct: 532 GIVTIKDTKAFGSQEVANGQV 552 >gb|EMJ23216.1| hypothetical protein PRUPE_ppa002943mg [Prunus persica] Length = 619 Score = 74.3 bits (181), Expect = 9e-11 Identities = 91/348 (26%), Positives = 127/348 (36%), Gaps = 31/348 (8%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHG---NMVGNNL-QLSEAQLAARHAMSLAFGDNLRAARP 170 DFKG+RTA+QLSQRW IRK H N+ GN+ +LSEAQLA RHAMSLA A Sbjct: 238 DFKGERTANQLSQRWKYIRKHHHQDLNVGGNSSNKLSEAQLATRHAMSLALNMPSITANT 297 Query: 171 ISTNAXXXXXXXXXXXXRFAAADIASGGPQSKHQQDL------------VXXXXXXXXXX 314 I T + + + + QQ L Sbjct: 298 IGTAGTNTHSKFGGTNATTNSLPSTAAEEELQSQQGLKPAKPYQMGLLGSTSKSQLTSKK 357 Query: 315 XXXXXXVNPDPFXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPG 494 N D + IA+PS VH++P G +I+SS+PG Sbjct: 358 TLTKPNSNTDGMVRATAVAAGARIASPS-DAASLLKAAQAKNAVHVLPTGGSSIQSSLPG 416 Query: 495 GTNGLPS---NVHFIRTGLVSHPPGP--SNASQPGTQHLQGHSLRSASPPVQPNPVLALP 659 P N+H++ TGL + P S A P H P + ALP Sbjct: 417 SMRTHPEPHPNLHYMHTGLAATPVSTPLSTAVTPSATH--------------PGSLKALP 462 Query: 660 SRTNASSGIPSAPTSKAAAIQENQTVVRSNLRSEKAEVIR--AATLANTPQQQVQKDRT- 830 S P+ T + I++ + S L E ++ A N ++ QKD+ Sbjct: 463 ---QTSQHAPTNSTLLSKQIKDVSCSLDSELGCTPTEQVQDGAVISENGQNEEGQKDKVD 519 Query: 831 -------FGSGNLLREKVGGQTSVLGDTVKELGGESKASRISGS*KAN 953 + + E + G + GD + G S S K N Sbjct: 520 SPDQKAELKNLSTSAENLVGSLDIKGDETDNIAGIGVQSEERQSAKDN 567 >ref|XP_002514048.1| DNA binding protein, putative [Ricinus communis] gi|223547134|gb|EEF48631.1| DNA binding protein, putative [Ricinus communis] Length = 608 Score = 72.0 bits (175), Expect = 4e-10 Identities = 83/292 (28%), Positives = 125/292 (42%), Gaps = 5/292 (1%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM--VGN--NLQLSEAQLAARHAMSLAFGDNLRAARP 170 +F DRTASQLSQRWAIIRK+HGN VGN +QLSE AARHAM+LA ++ Sbjct: 235 EFTWDRTASQLSQRWAIIRKRHGNWNPVGNTSGVQLSEEWRAARHAMNLALDPPVK--NK 292 Query: 171 ISTNAXXXXXXXXXXXXRFAAADIASGGPQSKHQQDLVXXXXXXXXXXXXXXXXVNPDPF 350 + N R AA + P + + ++ DP Sbjct: 293 FTNNISGEATPAQHQSQRPFAAKSSPMVPLGSAPKSQI-------AVKRPAKPDLSSDP- 344 Query: 351 XXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPGGTNGLPSNVHFI 530 + IAT S VHIMP G ++KS++PGG + Sbjct: 345 VRATAVAAGARIATQS-DAASLLKAAQAKNAVHIMPTGGSSMKSALPGGASNHSE----- 398 Query: 531 RTGLVSHPPGPSNASQPGTQHLQGHSLRSASPPVQPNPVLALPSRTNASSGIPS-APTSK 707 +HP +N G+ RS P V P+ + P+ ++ IPS + T+K Sbjct: 399 -----AHPNVHTNDLAAGS--------RSTLPVVSPSAI--RPAASSTVQHIPSISDTAK 443 Query: 708 AAAIQENQTVVRSNLRSEKAEVIRAATLANTPQQQVQKDRTFGSGNLLREKV 863 + ++ + + +E A I+ + + +QQV++ SGN L ++V Sbjct: 444 NISAKQFNAELPARKDTETAGAIKILS-EDAKEQQVKEHGACVSGNELSKQV 494 >gb|EMJ25790.1| hypothetical protein PRUPE_ppa1027142mg [Prunus persica] Length = 639 Score = 70.1 bits (170), Expect = 2e-09 Identities = 90/289 (31%), Positives = 118/289 (40%), Gaps = 22/289 (7%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRK--QHGNMVGNNL-QLSEAQLAARHAMSLAFGDNLRAARPI 173 DFKGDRTA QLSQRWAII+K Q N+ GN+ +LSEAQLAARH++S+A A+ I Sbjct: 238 DFKGDRTAGQLSQRWAIIKKRNQELNLGGNSSGKLSEAQLAARHSLSVALNMPNLTAKTI 297 Query: 174 STNAXXXXXXXXXXXXRFAAADIASGGPQSKHQQDL-------------VXXXXXXXXXX 314 T + + QQDL Sbjct: 298 GTAGTNAHNKFARKVATSNPVLTTGAKAEPQSQQDLKPTKKPYQMELLGSTTKSQVTSKN 357 Query: 315 XXXXXXVNPDPFXXXXXXXXSSHIATPSTXXXXXXXXXXXXXVVHIMPGGTPAIKSSVPG 494 N D + IA+PS VHIMP + +I+SS+PG Sbjct: 358 TLTKPNCNDDDIVRAIAVAAGARIASPS-DAASLLKAAQAKNAVHIMP-TSGSIQSSLPG 415 Query: 495 G--TNGLPSNVHFIRTGL----VSHPPGPSNASQPGTQHLQGHSLRSASPPVQPNPVLAL 656 G T+ P +RTGL +S PP P++ + P H S ++ P QP P Sbjct: 416 GMSTHSEPHPNLHMRTGLAGITLSTPP-PTDVT-PSAVH--PGSSKALPPMSQPTPT--- 468 Query: 657 PSRTNASSGIPSAPTSKAAAIQENQTVVRSNLRSEKAEVIRAATLANTP 803 + T S I S A + Q V R+E+ VI A L TP Sbjct: 469 -NGTLLSRQIKGVSCSLDAKLPSKQEV-----RTEEGSVI--AELGCTP 509 >ref|XP_004163958.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101223883 [Cucumis sativus] Length = 659 Score = 69.3 bits (168), Expect = 3e-09 Identities = 38/52 (73%), Positives = 42/52 (80%), Gaps = 4/52 (7%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM-VGNN---LQLSEAQLAARHAMSLAFG 146 DF DRTASQLSQRWAII+K+HGN+ VG N QLSE QLAARHAMS+A G Sbjct: 229 DFLSDRTASQLSQRWAIIKKKHGNLNVGVNTAGTQLSEVQLAARHAMSVALG 280 >ref|XP_004152740.1| PREDICTED: uncharacterized protein LOC101206820 [Cucumis sativus] Length = 659 Score = 69.3 bits (168), Expect = 3e-09 Identities = 38/52 (73%), Positives = 42/52 (80%), Gaps = 4/52 (7%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM-VGNN---LQLSEAQLAARHAMSLAFG 146 DF DRTASQLSQRWAII+K+HGN+ VG N QLSE QLAARHAMS+A G Sbjct: 229 DFLSDRTASQLSQRWAIIKKKHGNLNVGVNTAGTQLSEVQLAARHAMSVALG 280 >gb|EMJ25789.1| hypothetical protein PRUPE_ppa1027142mg [Prunus persica] Length = 339 Score = 63.2 bits (152), Expect = 2e-07 Identities = 37/62 (59%), Positives = 45/62 (72%), Gaps = 3/62 (4%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRK--QHGNMVGNNL-QLSEAQLAARHAMSLAFGDNLRAARPI 173 DFKGDRTA QLSQRWAII+K Q N+ GN+ +LSEAQLAARH++S+A A+ I Sbjct: 238 DFKGDRTAGQLSQRWAIIKKRNQELNLGGNSSGKLSEAQLAARHSLSVALNMPNLTAKTI 297 Query: 174 ST 179 T Sbjct: 298 GT 299 >gb|EOY15458.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao] Length = 606 Score = 60.1 bits (144), Expect = 2e-06 Identities = 37/63 (58%), Positives = 45/63 (71%), Gaps = 7/63 (11%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM--VGNNL--QLSEAQLAARHAMSLAF---GDNLRA 161 DFKGDR+ASQL+QRW II+K+ GN+ GN+ QLSEAQLA R A+SLA NL + Sbjct: 235 DFKGDRSASQLAQRWTIIKKRLGNLNVEGNSTIPQLSEAQLATRSALSLALDMPDKNLTS 294 Query: 162 ARP 170 A P Sbjct: 295 ACP 297 >gb|EOY15457.1| Homeodomain-like superfamily protein isoform 1 [Theobroma cacao] Length = 674 Score = 60.1 bits (144), Expect = 2e-06 Identities = 37/63 (58%), Positives = 45/63 (71%), Gaps = 7/63 (11%) Frame = +3 Query: 3 DFKGDRTASQLSQRWAIIRKQHGNM--VGNNL--QLSEAQLAARHAMSLAF---GDNLRA 161 DFKGDR+ASQL+QRW II+K+ GN+ GN+ QLSEAQLA R A+SLA NL + Sbjct: 235 DFKGDRSASQLAQRWTIIKKRLGNLNVEGNSTIPQLSEAQLATRSALSLALDMPDKNLTS 294 Query: 162 ARP 170 A P Sbjct: 295 ACP 297