BLASTX nr result
ID: Forsythia23_contig00033614
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00033614 (1107 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011100658.1| PREDICTED: uncharacterized protein LOC105178... 336 2e-89 ref|XP_011100655.1| PREDICTED: uncharacterized protein LOC105178... 336 2e-89 ref|XP_012853006.1| PREDICTED: uncharacterized protein LOC105972... 312 3e-82 ref|XP_004253265.1| PREDICTED: uncharacterized protein LOC101263... 279 3e-72 ref|XP_006359028.1| PREDICTED: uncharacterized protein LOC102580... 276 3e-71 emb|CDP01127.1| unnamed protein product [Coffea canephora] 274 7e-71 ref|XP_009627805.1| PREDICTED: uncharacterized protein LOC104118... 271 6e-70 ref|XP_009803652.1| PREDICTED: uncharacterized protein LOC104248... 270 1e-69 ref|XP_002263508.2| PREDICTED: uncharacterized protein LOC100240... 264 1e-67 ref|XP_010263847.1| PREDICTED: uncharacterized protein LOC104602... 263 2e-67 gb|EYU24348.1| hypothetical protein MIMGU_mgv1a0190601mg, partia... 259 2e-66 ref|XP_007019649.1| Nucleic acid-binding proteins superfamily is... 254 6e-65 ref|XP_007019648.1| Nucleic acid-binding proteins superfamily is... 254 6e-65 ref|XP_007019647.1| Nucleic acid-binding proteins superfamily is... 254 6e-65 ref|XP_012482382.1| PREDICTED: uncharacterized protein LOC105797... 254 1e-64 ref|XP_012482380.1| PREDICTED: uncharacterized protein LOC105797... 254 1e-64 gb|KJB28951.1| hypothetical protein B456_005G076800 [Gossypium r... 254 1e-64 gb|KJB28949.1| hypothetical protein B456_005G076800 [Gossypium r... 254 1e-64 gb|KHG28109.1| 30S ribosomal S1, chloroplastic [Gossypium arboreum] 254 1e-64 gb|KMT19650.1| hypothetical protein BVRB_1g010370 [Beta vulgaris... 251 5e-64 >ref|XP_011100658.1| PREDICTED: uncharacterized protein LOC105178804 isoform X2 [Sesamum indicum] Length = 480 Score = 336 bits (862), Expect = 2e-89 Identities = 191/294 (64%), Positives = 211/294 (71%), Gaps = 2/294 (0%) Frame = -2 Query: 878 MPILIHHPVSKCFPLFISSLPFKIPIIYSTAIQTKASSKFPLVEIPNLPSISCRGLCFSL 699 MPI I+ P + F S PF PII +TAI FPL + P S SC+ LCFS Sbjct: 1 MPIPIYRPCNTNSS-FTPSAPFTNPII-TTAI-------FPLAKSPKPRSGSCKSLCFSF 51 Query: 698 RLPN-RRTRVPFCSNNGVFEDITDTLLSKKTENE-FEELGLLNKPSPKQVDDASAEEMVE 525 +LP RRTRVP C+ NGVFE+ DTLL K+ ENE EEL LLNKPSPKQV + + EE+ E Sbjct: 52 KLPKFRRTRVPLCAKNGVFEESKDTLLPKRPENEEAEELELLNKPSPKQVSNGALEEIEE 111 Query: 524 IEPQKPDKDEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDLVVGV 345 E + DKDE+LE FYKFFR EYYEPKPGDLVVGV Sbjct: 112 NELKTLDKDEILEPFYKFFR-PFEEESDSEGGNLRSAEEENEKISVEYYEPKPGDLVVGV 170 Query: 344 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRND 165 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEM++LLCDVEKD +EF+VRGK+GIV ND Sbjct: 171 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMDYLLCDVEKDADEFLVRGKVGIVSND 230 Query: 164 EAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 EA+S E P KPVVEPGTVLFAEVLGRTL GRPLLSTRRL RRIAWHRVRQIKQ Sbjct: 231 EAISGEPMPGKPVVEPGTVLFAEVLGRTLGGRPLLSTRRLFRRIAWHRVRQIKQ 284 >ref|XP_011100655.1| PREDICTED: uncharacterized protein LOC105178804 isoform X1 [Sesamum indicum] Length = 507 Score = 336 bits (862), Expect = 2e-89 Identities = 191/294 (64%), Positives = 211/294 (71%), Gaps = 2/294 (0%) Frame = -2 Query: 878 MPILIHHPVSKCFPLFISSLPFKIPIIYSTAIQTKASSKFPLVEIPNLPSISCRGLCFSL 699 MPI I+ P + F S PF PII +TAI FPL + P S SC+ LCFS Sbjct: 1 MPIPIYRPCNTNSS-FTPSAPFTNPII-TTAI-------FPLAKSPKPRSGSCKSLCFSF 51 Query: 698 RLPN-RRTRVPFCSNNGVFEDITDTLLSKKTENE-FEELGLLNKPSPKQVDDASAEEMVE 525 +LP RRTRVP C+ NGVFE+ DTLL K+ ENE EEL LLNKPSPKQV + + EE+ E Sbjct: 52 KLPKFRRTRVPLCAKNGVFEESKDTLLPKRPENEEAEELELLNKPSPKQVSNGALEEIEE 111 Query: 524 IEPQKPDKDEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDLVVGV 345 E + DKDE+LE FYKFFR EYYEPKPGDLVVGV Sbjct: 112 NELKTLDKDEILEPFYKFFR-PFEEESDSEGGNLRSAEEENEKISVEYYEPKPGDLVVGV 170 Query: 344 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRND 165 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEM++LLCDVEKD +EF+VRGK+GIV ND Sbjct: 171 VVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMDYLLCDVEKDADEFLVRGKVGIVSND 230 Query: 164 EAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 EA+S E P KPVVEPGTVLFAEVLGRTL GRPLLSTRRL RRIAWHRVRQIKQ Sbjct: 231 EAISGEPMPGKPVVEPGTVLFAEVLGRTLGGRPLLSTRRLFRRIAWHRVRQIKQ 284 >ref|XP_012853006.1| PREDICTED: uncharacterized protein LOC105972587 [Erythranthe guttatus] gi|604305156|gb|EYU24335.1| hypothetical protein MIMGU_mgv1a004902mg [Erythranthe guttata] Length = 505 Score = 312 bits (799), Expect = 3e-82 Identities = 175/295 (59%), Positives = 198/295 (67%), Gaps = 3/295 (1%) Frame = -2 Query: 878 MPILIHHPVSKCFPLFISSLPFKIPIIYSTAIQTKASSKFPLVEIPNLP--SISCRGLCF 705 M ILIH P + F SS PF P I +T FP + P P S SCR Sbjct: 1 MAILIHCPCNSKSTFFNSSPPFTNPAISTT---------FPPAKTPKKPHSSNSCRTSFP 51 Query: 704 SLRLPN-RRTRVPFCSNNGVFEDITDTLLSKKTENEFEELGLLNKPSPKQVDDASAEEMV 528 +L+L RRT + FC+ NGVFE+ DTLLS+K ENE EL LNKPSPKQ + EE Sbjct: 52 ALKLSKFRRTHITFCTKNGVFEEFKDTLLSRKPENEEPEL--LNKPSPKQANPLPVEETE 109 Query: 527 EIEPQKPDKDEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDLVVG 348 ++EP KPDKD +LE FYKFFR EYYEPKPGDLVVG Sbjct: 110 QLEPDKPDKDVILEPFYKFFRPLEEEINRNNIEEEEEGDDNIEKVGIEYYEPKPGDLVVG 169 Query: 347 VVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRN 168 VVVSGNENKLDVNVGAD LGTMLTKEVLPLYDKEM++LLCDVE D +EF VRGK+G+VRN Sbjct: 170 VVVSGNENKLDVNVGADTLGTMLTKEVLPLYDKEMDYLLCDVESDSDEFFVRGKVGLVRN 229 Query: 167 DEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 D A+ E +PVV+PGT+LFAEVLGRTL GRPL+STRRL RRIAWHRVRQIKQ Sbjct: 230 DVAMGGERGHGRPVVDPGTILFAEVLGRTLGGRPLISTRRLFRRIAWHRVRQIKQ 284 >ref|XP_004253265.1| PREDICTED: uncharacterized protein LOC101263198 [Solanum lycopersicum] Length = 513 Score = 279 bits (713), Expect = 3e-72 Identities = 166/299 (55%), Positives = 196/299 (65%), Gaps = 7/299 (2%) Frame = -2 Query: 878 MPILIHHPVSKCFPLFISSLPFKIPIIYSTAIQTKASSKFPLVEIPNLPSISCRGLC--- 708 MP+L+ K F +F LP +IY+TAIQ S P +P P S + L Sbjct: 1 MPLLLLP--CKSFSIFNPILPLNTSVIYNTAIQFSGFSLSPKYPLPRTPK-SSKNLSIHW 57 Query: 707 -FSLRLPNRRTRVPFCSNNGVFEDITDTLLSKKTENEFEELGLLNKPSPKQVDDASAEEM 531 + + LP T V FCS N +FE+ T L++ E+E EL L NKP KQ+D+ + Sbjct: 58 NYQINLP---THVLFCSKNEIFEEFRTTQLAELPESE--ELELHNKPYLKQIDNGVVSD- 111 Query: 530 VEIEPQKPDKDEVLETFYKFFR---LXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGD 360 VE + +K KDEVLE FYK F+ EYYEPKPGD Sbjct: 112 VEEDQKKVSKDEVLEPFYKLFKPIESNEEESDIEQEEEVHPVVEESKKVSVEYYEPKPGD 171 Query: 359 LVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIG 180 LVVGVVVSGNENKLDV++GADLLGTMLTK+VLPLYDKE+ +LLCD+EKD EEF+VRGK+G Sbjct: 172 LVVGVVVSGNENKLDVSIGADLLGTMLTKDVLPLYDKEIGYLLCDLEKDAEEFLVRGKMG 231 Query: 179 IVRNDEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 I+ D+AVS ES P KPVVEPGTVLFAEVLGRTLSGRPLLSTRRL RRIAWHRVRQIKQ Sbjct: 232 ILSYDDAVSGESTPGKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLFRRIAWHRVRQIKQ 290 >ref|XP_006359028.1| PREDICTED: uncharacterized protein LOC102580008 [Solanum tuberosum] Length = 513 Score = 276 bits (705), Expect = 3e-71 Identities = 165/298 (55%), Positives = 191/298 (64%), Gaps = 6/298 (2%) Frame = -2 Query: 878 MPILIHHPVSKCFPLFISSLPFKIPIIYSTAIQTKA---SSKFPLVEIPNLPSISCRGLC 708 MP+L+ K F +F LP +IY+TA Q A S K+PL P Sbjct: 1 MPLLLLP--CKSFSIFNPILPLNTSVIYNTATQFSAFPLSHKYPLARTPKSSKNLSLHWN 58 Query: 707 FSLRLPNRRTRVPFCSNNGVFEDITDTLLSKKTENEFEELGLLNKPSPKQVDDASAEEMV 528 + + L T V FCS N +FE+ T L + E+E EL L NKP KQ+D+ + V Sbjct: 59 YQISL---HTHVSFCSKNEIFEEFRTTQLDELPESE--ELELHNKPYLKQIDNGVVSD-V 112 Query: 527 EIEPQKPDKDEVLETFYKFFR---LXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDL 357 E E +K KDEVLE FYK F+ EYYEPKPGDL Sbjct: 113 EEEQKKVSKDEVLEPFYKLFKPTESNEEESDTEQEEEVHPVVEESKKVSVEYYEPKPGDL 172 Query: 356 VVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGI 177 VVGVVVSGNENKLDV+VGADLLGTMLTK+VLPLYDKEM +LLCD+EKD EEF+VRGK+GI Sbjct: 173 VVGVVVSGNENKLDVSVGADLLGTMLTKDVLPLYDKEMGYLLCDLEKDAEEFLVRGKMGI 232 Query: 176 VRNDEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 V D+A+S ES KP+VEPGTVLFAEVLGRTLSGRPLLSTRRL RRIAWHRVRQIKQ Sbjct: 233 VSYDDAISGESTSGKPIVEPGTVLFAEVLGRTLSGRPLLSTRRLFRRIAWHRVRQIKQ 290 >emb|CDP01127.1| unnamed protein product [Coffea canephora] Length = 479 Score = 274 bits (701), Expect = 7e-71 Identities = 148/231 (64%), Positives = 167/231 (72%), Gaps = 5/231 (2%) Frame = -2 Query: 680 TRVPFCSNNGVFEDITDTLLSKKTENEFEELGLLNKPSPKQVDDASAEEMVEIEPQKPDK 501 TRV FCS N VFED + + EN F+EL LL+KP PKQ+DD S E+ E E +K DK Sbjct: 27 TRVLFCSKNEVFEDFKSAHVRPEIEN-FDELELLDKPFPKQMDDGSVTEIEEEELKKDDK 85 Query: 500 DEVLETFYKFFR-----LXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDLVVGVVVS 336 D+VLE FYKFF+ EYYEPKPGDLVVGVVVS Sbjct: 86 DDVLEEFYKFFKPRDELRQEIDVEEGGKGSQTGEGYQNEKVSIEYYEPKPGDLVVGVVVS 145 Query: 335 GNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRNDEAV 156 GNEN+LDVNVGAD+LGTMLTKEVLPLYDKE+ LLC++E D EEFMV GK GIV+NDEAV Sbjct: 146 GNENRLDVNVGADILGTMLTKEVLPLYDKEINDLLCNLENDAEEFMVNGKAGIVKNDEAV 205 Query: 155 SEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 S E+ P +PVV PGT+L+AEVLGRTLSGRPLLSTRRL RRIAWHRVRQIKQ Sbjct: 206 SREAMPGRPVVAPGTLLYAEVLGRTLSGRPLLSTRRLFRRIAWHRVRQIKQ 256 >ref|XP_009627805.1| PREDICTED: uncharacterized protein LOC104118291 [Nicotiana tomentosiformis] Length = 520 Score = 271 bits (693), Expect = 6e-70 Identities = 164/296 (55%), Positives = 187/296 (63%), Gaps = 14/296 (4%) Frame = -2 Query: 848 KCFPLFISSLPFKIPIIYSTAIQTKASS---KFPLVEIPNLPSISCRGLCFSLRLPNRRT 678 K F + LP +IY+TA + S K+PLV+ P + L T Sbjct: 9 KSFSILNPILPLNTSVIYNTATKFSIFSTPPKYPLVKTPKFSKNLSSHWNSQISL---HT 65 Query: 677 RVPFCSNNGVFEDITDTLLSKKTE-NEFEELGLLNKPSPKQVDDASAEEMVEIEPQKPDK 501 V FCS N + E+ + T L+K E E EEL LLNKP KQ+++ E +E EP+K K Sbjct: 66 HVSFCSKNEILEEFSTTQLAKVPEIEESEELELLNKPYLKQINNGVGAE-IEEEPKKVSK 124 Query: 500 DEVLETFYKFFR---------LXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDLVVG 348 DEVLE FYK F+ EYYEPKPGDLVVG Sbjct: 125 DEVLEPFYKLFKPREESLEQESSDTEDTEQEEEEVHSVEAESKKISVEYYEPKPGDLVVG 184 Query: 347 VVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRN 168 VVVSGNENKLDVNVGADLLGTMLTK+VLPLYDKEM +LLCD+EKD EEF+VRGK+GIV Sbjct: 185 VVVSGNENKLDVNVGADLLGTMLTKDVLPLYDKEMGYLLCDLEKDAEEFLVRGKMGIVSY 244 Query: 167 DEAVS-EESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 DEA+ ES KPVVEPGTVLFAEVLGRTLSGRPLLSTRRL RRIAWHRVRQIKQ Sbjct: 245 DEAIECGESMSGKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLFRRIAWHRVRQIKQ 300 >ref|XP_009803652.1| PREDICTED: uncharacterized protein LOC104248985 [Nicotiana sylvestris] Length = 521 Score = 270 bits (690), Expect = 1e-69 Identities = 167/306 (54%), Positives = 193/306 (63%), Gaps = 14/306 (4%) Frame = -2 Query: 878 MPILIHHPVSKCFPLFISSLPFKIPIIYSTAIQTKASS---KFPLVEIPNLPSISCRGLC 708 MPIL+ K F + LP +IY+TA + S K+PLV+ P Sbjct: 2 MPILLLP--CKSFSILNPILPLNTSVIYNTATKFSIFSTPPKYPLVKTPKFSKNFSSHWN 59 Query: 707 FSLRLPNRRTRVPFCSNNGVFEDITDTLLSKKTENE-FEELGLLNKPSPKQVDDASAEEM 531 + L T V FC+ N + E+ + T L+K E+E EEL LLNKP KQ+++ E Sbjct: 60 SQISL---HTHVSFCTKNEILEEFSTTQLAKVPESEESEELELLNKPYLKQINNGVGTE- 115 Query: 530 VEIEPQKPDKDEVLETFYKFFR---------LXXXXXXXXXXXXXXXXXXXXXXXXXEYY 378 +E EP+K KDEVLE FYK F+ EYY Sbjct: 116 IEEEPKKVSKDEVLEPFYKLFKPREESLGQESSDIEDTEQEEEKVHSVEEESKKISVEYY 175 Query: 377 EPKPGDLVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFM 198 EPKPGDLVVGVVVSGNE KLDVNVGADLLGTMLTK+VLPLYDKEM +LLCD+EKD EEF+ Sbjct: 176 EPKPGDLVVGVVVSGNEYKLDVNVGADLLGTMLTKDVLPLYDKEMGYLLCDLEKDAEEFL 235 Query: 197 VRGKIGIVRNDEAV-SEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHR 21 VRGK+GIV DEAV S ES KPVVEPGTVLF+EVLGRTLSGRPLLSTRRL RRIAWHR Sbjct: 236 VRGKMGIVSYDEAVESRESMSGKPVVEPGTVLFSEVLGRTLSGRPLLSTRRLFRRIAWHR 295 Query: 20 VRQIKQ 3 VRQIKQ Sbjct: 296 VRQIKQ 301 >ref|XP_002263508.2| PREDICTED: uncharacterized protein LOC100240915 [Vitis vinifera] Length = 513 Score = 264 bits (674), Expect = 1e-67 Identities = 159/300 (53%), Positives = 187/300 (62%), Gaps = 8/300 (2%) Frame = -2 Query: 878 MPILIHHPVSKCFPLFISSLPFKIPIIYSTAIQTKASSKFPLVEIPNLPSISCRGLCFSL 699 MPI++ HP + F S P I +++ T K PL + + +S + Sbjct: 1 MPIVLAHPSNSLSFRFSSHPPLNSNISFTSFHNTP---KLPLRKPQSKTLVSPKNSPVW- 56 Query: 698 RLPNRRTRVPFCSNNGVFEDITDTLLSKKTE----NEFEELGLLNKPSPKQVDDASAEEM 531 R T++ FCS N +F DI+ T L + E + EEL LL KPSP +++ SA + Sbjct: 57 ----RTTQISFCSPNDIFYDISSTQLPETPEIDGVQDIEELELLGKPSPVPLNNGSASD- 111 Query: 530 VEIEPQKPDKDEVLETFYKFFR----LXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPG 363 ++ E +KPDKDE L F KFF+ EYYEPKPG Sbjct: 112 IDSELKKPDKDEALAPFLKFFKPRESSEEANGASGEDGSEISESGSTKLVSVEYYEPKPG 171 Query: 362 DLVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKI 183 D VVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEME+LLCDVEKD EEFMV GK+ Sbjct: 172 DFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEYLLCDVEKDAEEFMVHGKM 231 Query: 182 GIVRNDEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 IVRND+A+S PVVE GTVLFAEVLGRTLSGRPLLSTRR RRIAWHRVRQIKQ Sbjct: 232 SIVRNDDALSRVPMQGSPVVETGTVLFAEVLGRTLSGRPLLSTRRFFRRIAWHRVRQIKQ 291 >ref|XP_010263847.1| PREDICTED: uncharacterized protein LOC104602009 [Nelumbo nucifera] Length = 529 Score = 263 bits (671), Expect = 2e-67 Identities = 163/311 (52%), Positives = 195/311 (62%), Gaps = 25/311 (8%) Frame = -2 Query: 860 HPVSKCFPLFISSLPFKIPIIYSTA---IQTKASS---KFPLVEIPNLPSISCRG----L 711 H KC SS +P+ +S A IQ + +S KF + + P L S+S G L Sbjct: 5 HQPCKCLNFLSSS----VPLNFSAARKIIQNRENSSTPKFVVSKTPTLFSVSVTGSPKNL 60 Query: 710 CFSLRLPN-RRTRVPFCSNNGVFEDITDTLLSKKTENE----FEELGLLNKPSPKQVDDA 546 F + P R+T V CS+N FE T +S K E+E EEL LL KP+ V +A Sbjct: 61 VFLPKSPFCRKTHVFLCSSNDEFETSRGTQVSNKPESERLEEMEELELLGKPALITVRNA 120 Query: 545 SAEEMVEIEPQKPDKDEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXXXXXXXE------ 384 S EE EP+KP++DE L F KFF+ E Sbjct: 121 SVEEEKVAEPRKPEEDEALAPFLKFFKARDSLEQGEVSELEVTDEEVSEEEREEKEENKK 180 Query: 383 ----YYEPKPGDLVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEK 216 YYEPKPGD VVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYD+E+ +LLCD+EK Sbjct: 181 VSVEYYEPKPGDFVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDQELPYLLCDMEK 240 Query: 215 DVEEFMVRGKIGIVRNDEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRR 36 D EEFMVRGK+GIV++++A+S E P +PVVE GTVLFAEVLGRTLSGRPLLSTRRL RR Sbjct: 241 DSEEFMVRGKMGIVQDEDAMSGEPVPGRPVVETGTVLFAEVLGRTLSGRPLLSTRRLFRR 300 Query: 35 IAWHRVRQIKQ 3 +AWHRVRQI+Q Sbjct: 301 VAWHRVRQIEQ 311 >gb|EYU24348.1| hypothetical protein MIMGU_mgv1a0190601mg, partial [Erythranthe guttata] Length = 367 Score = 259 bits (663), Expect = 2e-66 Identities = 136/204 (66%), Positives = 151/204 (74%) Frame = -2 Query: 614 KTENEFEELGLLNKPSPKQVDDASAEEMVEIEPQKPDKDEVLETFYKFFRLXXXXXXXXX 435 K ENE EL LNKPSPKQ + EE ++EP KPDK+ +LE FYKFFR Sbjct: 1 KPENEEPEL--LNKPSPKQANPLPVEETEQLEPDKPDKNVILEPFYKFFRPLEEEINRNN 58 Query: 434 XXXXXXXXXXXXXXXXEYYEPKPGDLVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLY 255 EYYEPKPGDLVVGVVVSGNENKLDVNVGAD LGTMLTKEVLPLY Sbjct: 59 IEEEEEGDDNIEKVGIEYYEPKPGDLVVGVVVSGNENKLDVNVGADTLGTMLTKEVLPLY 118 Query: 254 DKEMEHLLCDVEKDVEEFMVRGKIGIVRNDEAVSEESKPRKPVVEPGTVLFAEVLGRTLS 75 DKEM++LLCDVE D +EF VRGK+G+VRND A+S E +PVV+PGT+LFAEVLGRTL Sbjct: 119 DKEMDYLLCDVESDSDEFFVRGKVGLVRNDVAMSGERGHGRPVVDPGTILFAEVLGRTLG 178 Query: 74 GRPLLSTRRLVRRIAWHRVRQIKQ 3 GRPL+STRRL RRIAWHRVRQIKQ Sbjct: 179 GRPLISTRRLFRRIAWHRVRQIKQ 202 >ref|XP_007019649.1| Nucleic acid-binding proteins superfamily isoform 3, partial [Theobroma cacao] gi|508724977|gb|EOY16874.1| Nucleic acid-binding proteins superfamily isoform 3, partial [Theobroma cacao] Length = 511 Score = 254 bits (650), Expect = 6e-65 Identities = 142/236 (60%), Positives = 164/236 (69%), Gaps = 8/236 (3%) Frame = -2 Query: 686 RRTRVPFCSNNGVFEDITDTLLSKKTEN-----EFEELGLLNKPSPKQVDDASAEEMVEI 522 R TR+ FCS N F++ + T L + EN E EEL LLNKPSP V++ A ++ Sbjct: 59 RSTRIVFCSQNDTFDEFSSTQLPESLENDSRIRENEELELLNKPSPVPVNNGFAADV--- 115 Query: 521 EPQKPDKDEVLETFYKFFR---LXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDLVV 351 +KPDKDE LE F KFFR EYYEPKPGDLVV Sbjct: 116 --EKPDKDEALEPFLKFFRPGESLEIEEGGGELGVSEEKSNEFKKVGVEYYEPKPGDLVV 173 Query: 350 GVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVR 171 GVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEME+L CD++ + EEFM GK+GIV+ Sbjct: 174 GVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMEYLSCDLKTNAEEFMGYGKMGIVK 233 Query: 170 NDEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 +D+A+S P +PVVE GT+LFAEVLGRTLSGRPLLSTRRL RRIAWHRVRQIKQ Sbjct: 234 DDDAMSGGPVPGRPVVETGTMLFAEVLGRTLSGRPLLSTRRLFRRIAWHRVRQIKQ 289 >ref|XP_007019648.1| Nucleic acid-binding proteins superfamily isoform 2 [Theobroma cacao] gi|508724976|gb|EOY16873.1| Nucleic acid-binding proteins superfamily isoform 2 [Theobroma cacao] Length = 521 Score = 254 bits (650), Expect = 6e-65 Identities = 142/236 (60%), Positives = 164/236 (69%), Gaps = 8/236 (3%) Frame = -2 Query: 686 RRTRVPFCSNNGVFEDITDTLLSKKTEN-----EFEELGLLNKPSPKQVDDASAEEMVEI 522 R TR+ FCS N F++ + T L + EN E EEL LLNKPSP V++ A ++ Sbjct: 60 RSTRIVFCSQNDTFDEFSSTQLPESLENDSRIRENEELELLNKPSPVPVNNGFAADV--- 116 Query: 521 EPQKPDKDEVLETFYKFFR---LXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDLVV 351 +KPDKDE LE F KFFR EYYEPKPGDLVV Sbjct: 117 --EKPDKDEALEPFLKFFRPGESLEIEEGGGELGVSEEKSNEFKKVGVEYYEPKPGDLVV 174 Query: 350 GVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVR 171 GVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEME+L CD++ + EEFM GK+GIV+ Sbjct: 175 GVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMEYLSCDLKTNAEEFMGYGKMGIVK 234 Query: 170 NDEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 +D+A+S P +PVVE GT+LFAEVLGRTLSGRPLLSTRRL RRIAWHRVRQIKQ Sbjct: 235 DDDAMSGGPVPGRPVVETGTMLFAEVLGRTLSGRPLLSTRRLFRRIAWHRVRQIKQ 290 >ref|XP_007019647.1| Nucleic acid-binding proteins superfamily isoform 1 [Theobroma cacao] gi|508724975|gb|EOY16872.1| Nucleic acid-binding proteins superfamily isoform 1 [Theobroma cacao] Length = 512 Score = 254 bits (650), Expect = 6e-65 Identities = 142/236 (60%), Positives = 164/236 (69%), Gaps = 8/236 (3%) Frame = -2 Query: 686 RRTRVPFCSNNGVFEDITDTLLSKKTEN-----EFEELGLLNKPSPKQVDDASAEEMVEI 522 R TR+ FCS N F++ + T L + EN E EEL LLNKPSP V++ A ++ Sbjct: 60 RSTRIVFCSQNDTFDEFSSTQLPESLENDSRIRENEELELLNKPSPVPVNNGFAADV--- 116 Query: 521 EPQKPDKDEVLETFYKFFR---LXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDLVV 351 +KPDKDE LE F KFFR EYYEPKPGDLVV Sbjct: 117 --EKPDKDEALEPFLKFFRPGESLEIEEGGGELGVSEEKSNEFKKVGVEYYEPKPGDLVV 174 Query: 350 GVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVR 171 GVVVSGNENKLDVNVGAD+LGTMLTKEVLPLYDKEME+L CD++ + EEFM GK+GIV+ Sbjct: 175 GVVVSGNENKLDVNVGADMLGTMLTKEVLPLYDKEMEYLSCDLKTNAEEFMGYGKMGIVK 234 Query: 170 NDEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 +D+A+S P +PVVE GT+LFAEVLGRTLSGRPLLSTRRL RRIAWHRVRQIKQ Sbjct: 235 DDDAMSGGPVPGRPVVETGTMLFAEVLGRTLSGRPLLSTRRLFRRIAWHRVRQIKQ 290 >ref|XP_012482382.1| PREDICTED: uncharacterized protein LOC105797016 isoform X2 [Gossypium raimondii] Length = 459 Score = 254 bits (648), Expect = 1e-64 Identities = 138/234 (58%), Positives = 165/234 (70%), Gaps = 7/234 (2%) Frame = -2 Query: 686 RRTRVPFCSNNGVFEDITDTLLSKKTEN-----EFEELGLLNKPSPKQVDDASAEEMVEI 522 R T++ CS N F++ + T ++ EN E EEL LLNKPSP V++ + V+ Sbjct: 4 RSTQIVLCSQNDTFDEFSSTQFPERFENDSGIEENEELELLNKPSPAPVNNGFVSD-VDK 62 Query: 521 EPQKPDKDEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXXXXXXXE--YYEPKPGDLVVG 348 E +KPDK+EVLE F KFFR YYEPKPGDLVVG Sbjct: 63 ESEKPDKEEVLEPFLKFFRPSEPLEVEEGSELEDSEEKIDEVKKVGVEYYEPKPGDLVVG 122 Query: 347 VVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRN 168 VVVSGNENKLDVNVGAD+LGTMLTK+VLPLYDKEM++L+CD+E + EEFMV GK+GIV++ Sbjct: 123 VVVSGNENKLDVNVGADMLGTMLTKDVLPLYDKEMDYLVCDLENNAEEFMVYGKMGIVKD 182 Query: 167 DEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIK 6 D+A+S P +PVVE GTVLFAEVLGRTLSGRPLLSTR+L RRIAWHRVRQIK Sbjct: 183 DDAMSGGPGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRQLFRRIAWHRVRQIK 236 >ref|XP_012482380.1| PREDICTED: uncharacterized protein LOC105797016 isoform X1 [Gossypium raimondii] Length = 550 Score = 254 bits (648), Expect = 1e-64 Identities = 138/234 (58%), Positives = 165/234 (70%), Gaps = 7/234 (2%) Frame = -2 Query: 686 RRTRVPFCSNNGVFEDITDTLLSKKTEN-----EFEELGLLNKPSPKQVDDASAEEMVEI 522 R T++ CS N F++ + T ++ EN E EEL LLNKPSP V++ + V+ Sbjct: 95 RSTQIVLCSQNDTFDEFSSTQFPERFENDSGIEENEELELLNKPSPAPVNNGFVSD-VDK 153 Query: 521 EPQKPDKDEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXXXXXXXE--YYEPKPGDLVVG 348 E +KPDK+EVLE F KFFR YYEPKPGDLVVG Sbjct: 154 ESEKPDKEEVLEPFLKFFRPSEPLEVEEGSELEDSEEKIDEVKKVGVEYYEPKPGDLVVG 213 Query: 347 VVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRN 168 VVVSGNENKLDVNVGAD+LGTMLTK+VLPLYDKEM++L+CD+E + EEFMV GK+GIV++ Sbjct: 214 VVVSGNENKLDVNVGADMLGTMLTKDVLPLYDKEMDYLVCDLENNAEEFMVYGKMGIVKD 273 Query: 167 DEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIK 6 D+A+S P +PVVE GTVLFAEVLGRTLSGRPLLSTR+L RRIAWHRVRQIK Sbjct: 274 DDAMSGGPGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRQLFRRIAWHRVRQIK 327 >gb|KJB28951.1| hypothetical protein B456_005G076800 [Gossypium raimondii] Length = 383 Score = 254 bits (648), Expect = 1e-64 Identities = 138/234 (58%), Positives = 165/234 (70%), Gaps = 7/234 (2%) Frame = -2 Query: 686 RRTRVPFCSNNGVFEDITDTLLSKKTEN-----EFEELGLLNKPSPKQVDDASAEEMVEI 522 R T++ CS N F++ + T ++ EN E EEL LLNKPSP V++ + V+ Sbjct: 59 RSTQIVLCSQNDTFDEFSSTQFPERFENDSGIEENEELELLNKPSPAPVNNGFVSD-VDK 117 Query: 521 EPQKPDKDEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXXXXXXXE--YYEPKPGDLVVG 348 E +KPDK+EVLE F KFFR YYEPKPGDLVVG Sbjct: 118 ESEKPDKEEVLEPFLKFFRPSEPLEVEEGSELEDSEEKIDEVKKVGVEYYEPKPGDLVVG 177 Query: 347 VVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRN 168 VVVSGNENKLDVNVGAD+LGTMLTK+VLPLYDKEM++L+CD+E + EEFMV GK+GIV++ Sbjct: 178 VVVSGNENKLDVNVGADMLGTMLTKDVLPLYDKEMDYLVCDLENNAEEFMVYGKMGIVKD 237 Query: 167 DEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIK 6 D+A+S P +PVVE GTVLFAEVLGRTLSGRPLLSTR+L RRIAWHRVRQIK Sbjct: 238 DDAMSGGPGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRQLFRRIAWHRVRQIK 291 >gb|KJB28949.1| hypothetical protein B456_005G076800 [Gossypium raimondii] gi|763761696|gb|KJB28950.1| hypothetical protein B456_005G076800 [Gossypium raimondii] gi|763761698|gb|KJB28952.1| hypothetical protein B456_005G076800 [Gossypium raimondii] Length = 514 Score = 254 bits (648), Expect = 1e-64 Identities = 138/234 (58%), Positives = 165/234 (70%), Gaps = 7/234 (2%) Frame = -2 Query: 686 RRTRVPFCSNNGVFEDITDTLLSKKTEN-----EFEELGLLNKPSPKQVDDASAEEMVEI 522 R T++ CS N F++ + T ++ EN E EEL LLNKPSP V++ + V+ Sbjct: 59 RSTQIVLCSQNDTFDEFSSTQFPERFENDSGIEENEELELLNKPSPAPVNNGFVSD-VDK 117 Query: 521 EPQKPDKDEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXXXXXXXE--YYEPKPGDLVVG 348 E +KPDK+EVLE F KFFR YYEPKPGDLVVG Sbjct: 118 ESEKPDKEEVLEPFLKFFRPSEPLEVEEGSELEDSEEKIDEVKKVGVEYYEPKPGDLVVG 177 Query: 347 VVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRN 168 VVVSGNENKLDVNVGAD+LGTMLTK+VLPLYDKEM++L+CD+E + EEFMV GK+GIV++ Sbjct: 178 VVVSGNENKLDVNVGADMLGTMLTKDVLPLYDKEMDYLVCDLENNAEEFMVYGKMGIVKD 237 Query: 167 DEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIK 6 D+A+S P +PVVE GTVLFAEVLGRTLSGRPLLSTR+L RRIAWHRVRQIK Sbjct: 238 DDAMSGGPGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRQLFRRIAWHRVRQIK 291 >gb|KHG28109.1| 30S ribosomal S1, chloroplastic [Gossypium arboreum] Length = 550 Score = 254 bits (648), Expect = 1e-64 Identities = 143/254 (56%), Positives = 171/254 (67%), Gaps = 12/254 (4%) Frame = -2 Query: 731 SISCRGLCFSLRLPN-----RRTRVPFCSNNGVFEDITDTLLSKKTEN-----EFEELGL 582 SI+ G +L P R T++ CS N F++ + T L ++ EN E EEL L Sbjct: 75 SITAAGTPKALSFPRKYTFLRSTQIVLCSQNDTFDEFSSTQLPERFENDSGIEENEELEL 134 Query: 581 LNKPSPKQVDDASAEEMVEIEPQKPDKDEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXX 402 LNKPSP V++ + V+ E +KPDK+EVLE F KFFR Sbjct: 135 LNKPSPAPVNNGFVSD-VDKESEKPDKEEVLEPFLKFFRPSEPLQVEGRGELEDSEEKID 193 Query: 401 XXXXXE--YYEPKPGDLVVGVVVSGNENKLDVNVGADLLGTMLTKEVLPLYDKEMEHLLC 228 YYEPKPGDLVVGVVVSGNENKLDVNVGAD+LGTMLTK+VLPLYDKEM++L+C Sbjct: 194 EVKKVGVEYYEPKPGDLVVGVVVSGNENKLDVNVGADMLGTMLTKDVLPLYDKEMDYLMC 253 Query: 227 DVEKDVEEFMVRGKIGIVRNDEAVSEESKPRKPVVEPGTVLFAEVLGRTLSGRPLLSTRR 48 D+E EEFM GK+GIV++D+A+S P +PVVE GTVLFAEVLGRTLSGRPLLSTR+ Sbjct: 254 DLENKAEEFMFYGKMGIVKDDDAMSGGPGPGRPVVETGTVLFAEVLGRTLSGRPLLSTRQ 313 Query: 47 LVRRIAWHRVRQIK 6 L RRIAWHRVRQIK Sbjct: 314 LFRRIAWHRVRQIK 327 >gb|KMT19650.1| hypothetical protein BVRB_1g010370 [Beta vulgaris subsp. vulgaris] Length = 494 Score = 251 bits (642), Expect = 5e-64 Identities = 133/226 (58%), Positives = 159/226 (70%) Frame = -2 Query: 680 TRVPFCSNNGVFEDITDTLLSKKTENEFEELGLLNKPSPKQVDDASAEEMVEIEPQKPDK 501 T+V FC + ED T+ L ++N EEL L NKPSP + +++ VE EP+KPDK Sbjct: 47 TQVAFCYAEDISEDSTNNQLPANSQNGTEELELRNKPSPMMPVNGASDTKVESEPKKPDK 106 Query: 500 DEVLETFYKFFRLXXXXXXXXXXXXXXXXXXXXXXXXXEYYEPKPGDLVVGVVVSGNENK 321 +E LE F KFF+ EYYEPKPGD VVGVV+SGNENK Sbjct: 107 EEALEPFLKFFK-SKDLEEEVVEQNGIERVKESEKVVVEYYEPKPGDFVVGVVISGNENK 165 Query: 320 LDVNVGADLLGTMLTKEVLPLYDKEMEHLLCDVEKDVEEFMVRGKIGIVRNDEAVSEESK 141 LDVNVGADLLGTMLTKEVLPLYDKEM+++ CD++K+ EEFM GK+GIVRN++A S + Sbjct: 166 LDVNVGADLLGTMLTKEVLPLYDKEMDYMSCDLDKNPEEFMSNGKMGIVRNEDAGSRQPV 225 Query: 140 PRKPVVEPGTVLFAEVLGRTLSGRPLLSTRRLVRRIAWHRVRQIKQ 3 +PVV+ GTVLFAEVLGRTLSGRPLLSTRR R+IAWHRVRQIKQ Sbjct: 226 SGRPVVDAGTVLFAEVLGRTLSGRPLLSTRRFFRQIAWHRVRQIKQ 271