BLASTX nr result
ID: Forsythia23_contig00031174
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00031174 (831 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011081728.1| PREDICTED: uncharacterized protein LOC105164... 300 6e-79 emb|CDP08201.1| unnamed protein product [Coffea canephora] 276 2e-71 ref|XP_012857911.1| PREDICTED: uncharacterized protein LOC105977... 270 1e-69 ref|XP_012857910.1| PREDICTED: uncharacterized protein LOC105977... 270 1e-69 gb|EYU20397.1| hypothetical protein MIMGU_mgv1a018711mg, partial... 270 1e-69 ref|XP_010648306.1| PREDICTED: uncharacterized protein LOC100254... 256 1e-65 ref|XP_010648305.1| PREDICTED: uncharacterized protein LOC100254... 256 1e-65 ref|XP_010648304.1| PREDICTED: uncharacterized protein LOC100254... 256 1e-65 ref|XP_010648303.1| PREDICTED: uncharacterized protein LOC100254... 256 1e-65 emb|CBI20600.3| unnamed protein product [Vitis vinifera] 256 1e-65 emb|CAN74834.1| hypothetical protein VITISV_023323 [Vitis vinifera] 256 1e-65 ref|XP_006371866.1| hypothetical protein POPTR_0018s04800g [Popu... 247 8e-63 ref|XP_006371865.1| hypothetical protein POPTR_0018s04800g [Popu... 247 8e-63 ref|XP_002324750.2| hypothetical protein POPTR_0018s04800g [Popu... 247 8e-63 ref|XP_007012208.1| Tetratricopeptide repeat-like superfamily pr... 241 4e-61 ref|XP_007012207.1| Tetratricopeptide repeat-like superfamily pr... 241 4e-61 ref|XP_007012206.1| Tetratricopeptide repeat-like superfamily pr... 241 4e-61 ref|XP_007012205.1| Tetratricopeptide repeat-like superfamily pr... 241 4e-61 ref|XP_007012204.1| Tetratricopeptide repeat-like superfamily pr... 241 4e-61 ref|XP_002516492.1| conserved hypothetical protein [Ricinus comm... 239 2e-60 >ref|XP_011081728.1| PREDICTED: uncharacterized protein LOC105164707 [Sesamum indicum] Length = 2041 Score = 300 bits (769), Expect = 6e-79 Identities = 159/276 (57%), Positives = 193/276 (69%), Gaps = 3/276 (1%) Frame = -1 Query: 819 YRSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEE 640 Y S +R++I+L S+ V K CM AGA M +NCNS N+ KE FEE Sbjct: 280 YISRDIRLTIKLPSSAPKSTGAVGAKGFTCMPAGAGMPFANCNSANE-----KEGTAFEE 334 Query: 639 QPQERRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSV 469 QPQERRSSRL RSRKPGKEESD +K+ KVVKQFL PYLV CN D+ + Sbjct: 335 QPQERRSSRLERLRSRKPGKEESDLPPSKELAKVVKQFLVPYLVDGPGTINCNQDSDPAF 394 Query: 468 YCAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEK 289 +C E +A+S +++ TDVI+FVQ TS NFGAYHM HLLLE+IANR I +Q+S ++ILDLEK Sbjct: 395 HCVEVLANSPESESTDVIEFVQNTSNNFGAYHMCHLLLEKIANRTILHQNSIARILDLEK 454 Query: 288 LTRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHM 109 TRH G +RTPECSLFL ELYYD G++S +T+ FMSE SYHLCKIIESVALEYPFH+ Sbjct: 455 ETRHWGHERTPECSLFLSELYYDMGLQSFETSTTCSFMSEASYHLCKIIESVALEYPFHI 514 Query: 108 NGMQGKENCSSDDSSEQNHQLPIDSTSLLRNNYPFW 1 M K NCS D S N +LP+D++SLLR N+ FW Sbjct: 515 TAMDEKNNCSIIDVSGHNKKLPMDNSSLLRGNHCFW 550 >emb|CDP08201.1| unnamed protein product [Coffea canephora] Length = 2057 Score = 276 bits (705), Expect = 2e-71 Identities = 145/276 (52%), Positives = 188/276 (68%), Gaps = 3/276 (1%) Frame = -1 Query: 819 YRSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEE 640 Y+S V +SI+L +SG+ M T+E K + T+ M +NCN + KE + EE Sbjct: 303 YKSGDVSLSIRLPHTSGSGMETLESKGSMLTTSSEDMPFANCNFEKNSHTKEKEANVSEE 362 Query: 639 QPQERRSSR---LRSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSV 469 QPQERRSSR LRSRKPGKE+SDF T +D KV+ QFL P++ G + DA S Sbjct: 363 QPQERRSSRIERLRSRKPGKEDSDFGTTRDLAKVIVQFLRPFIAGGGGSDDYTTDASTSS 422 Query: 468 YCAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEK 289 CAE V S D++ TDVI+FV+KTSEN+GAYHMSHL+LEEIA+R I +QDSN+K LDLEK Sbjct: 423 DCAEIVTRSQDSESTDVIRFVEKTSENYGAYHMSHLILEEIASRCIFFQDSNAKFLDLEK 482 Query: 288 LTRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHM 109 LTR G +RTPECSLFL ELYYDFG+RS D++ E+MSE SYH+CK+IE VALE P Sbjct: 483 LTRQWGKERTPECSLFLAELYYDFGLRSPDSST-SEYMSEASYHICKVIECVALECPLQS 541 Query: 108 NGMQGKENCSSDDSSEQNHQLPIDSTSLLRNNYPFW 1 + +N SS +S ++ +D++ L N++PFW Sbjct: 542 LAVASHDNLSSRESLSDPCKIAVDNSHPLSNDFPFW 577 >ref|XP_012857911.1| PREDICTED: uncharacterized protein LOC105977172 isoform X2 [Erythranthe guttatus] Length = 1939 Score = 270 bits (689), Expect = 1e-69 Identities = 150/276 (54%), Positives = 186/276 (67%), Gaps = 3/276 (1%) Frame = -1 Query: 819 YRSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEE 640 Y S VR++IQL S+ + +++ + C G N NS N+ KE IFEE Sbjct: 260 YISGDVRLTIQLPPSATKLTGSIDTERSTC---GGGTPFGNGNSINE-----KEGTIFEE 311 Query: 639 QPQERRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSV 469 QPQERRSSRL RSRKPGKEES+F++NKD KVVKQFL P+L+ H++ S Sbjct: 312 QPQERRSSRLERLRSRKPGKEESEFSSNKDLAKVVKQFLVPHLLDGTGAIHSKHNSDPSC 371 Query: 468 YCAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEK 289 AE A+SLD++ DVI+FVQ TS NFGAYHM HLLLE+IAN I Y D+ KILDLEK Sbjct: 372 N-AEVTANSLDSEPIDVIEFVQNTSNNFGAYHMGHLLLEKIANSSILYHDNIGKILDLEK 430 Query: 288 LTRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHM 109 RH G +RTPECSLFL ELYYD G+RS +T+ + F SE SYHLCK+IESVAL YPFH+ Sbjct: 431 NIRHWGKERTPECSLFLSELYYDMGLRSSETSIVCSFTSEASYHLCKVIESVALGYPFHI 490 Query: 108 NGMQGKENCSSDDSSEQNHQLPIDSTSLLRNNYPFW 1 +GM G+ D SE N Q +D++SLLR+N+ FW Sbjct: 491 SGMDGEIKFPMADVSEDNQQGQMDNSSLLRSNHRFW 526 >ref|XP_012857910.1| PREDICTED: uncharacterized protein LOC105977172 isoform X1 [Erythranthe guttatus] Length = 1957 Score = 270 bits (689), Expect = 1e-69 Identities = 150/276 (54%), Positives = 186/276 (67%), Gaps = 3/276 (1%) Frame = -1 Query: 819 YRSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEE 640 Y S VR++IQL S+ + +++ + C G N NS N+ KE IFEE Sbjct: 278 YISGDVRLTIQLPPSATKLTGSIDTERSTC---GGGTPFGNGNSINE-----KEGTIFEE 329 Query: 639 QPQERRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSV 469 QPQERRSSRL RSRKPGKEES+F++NKD KVVKQFL P+L+ H++ S Sbjct: 330 QPQERRSSRLERLRSRKPGKEESEFSSNKDLAKVVKQFLVPHLLDGTGAIHSKHNSDPSC 389 Query: 468 YCAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEK 289 AE A+SLD++ DVI+FVQ TS NFGAYHM HLLLE+IAN I Y D+ KILDLEK Sbjct: 390 N-AEVTANSLDSEPIDVIEFVQNTSNNFGAYHMGHLLLEKIANSSILYHDNIGKILDLEK 448 Query: 288 LTRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHM 109 RH G +RTPECSLFL ELYYD G+RS +T+ + F SE SYHLCK+IESVAL YPFH+ Sbjct: 449 NIRHWGKERTPECSLFLSELYYDMGLRSSETSIVCSFTSEASYHLCKVIESVALGYPFHI 508 Query: 108 NGMQGKENCSSDDSSEQNHQLPIDSTSLLRNNYPFW 1 +GM G+ D SE N Q +D++SLLR+N+ FW Sbjct: 509 SGMDGEIKFPMADVSEDNQQGQMDNSSLLRSNHRFW 544 >gb|EYU20397.1| hypothetical protein MIMGU_mgv1a018711mg, partial [Erythranthe guttata] Length = 1954 Score = 270 bits (689), Expect = 1e-69 Identities = 150/276 (54%), Positives = 186/276 (67%), Gaps = 3/276 (1%) Frame = -1 Query: 819 YRSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEE 640 Y S VR++IQL S+ + +++ + C G N NS N+ KE IFEE Sbjct: 281 YISGDVRLTIQLPPSATKLTGSIDTERSTC---GGGTPFGNGNSINE-----KEGTIFEE 332 Query: 639 QPQERRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSV 469 QPQERRSSRL RSRKPGKEES+F++NKD KVVKQFL P+L+ H++ S Sbjct: 333 QPQERRSSRLERLRSRKPGKEESEFSSNKDLAKVVKQFLVPHLLDGTGAIHSKHNSDPSC 392 Query: 468 YCAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEK 289 AE A+SLD++ DVI+FVQ TS NFGAYHM HLLLE+IAN I Y D+ KILDLEK Sbjct: 393 N-AEVTANSLDSEPIDVIEFVQNTSNNFGAYHMGHLLLEKIANSSILYHDNIGKILDLEK 451 Query: 288 LTRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHM 109 RH G +RTPECSLFL ELYYD G+RS +T+ + F SE SYHLCK+IESVAL YPFH+ Sbjct: 452 NIRHWGKERTPECSLFLSELYYDMGLRSSETSIVCSFTSEASYHLCKVIESVALGYPFHI 511 Query: 108 NGMQGKENCSSDDSSEQNHQLPIDSTSLLRNNYPFW 1 +GM G+ D SE N Q +D++SLLR+N+ FW Sbjct: 512 SGMDGEIKFPMADVSEDNQQGQMDNSSLLRSNHRFW 547 >ref|XP_010648306.1| PREDICTED: uncharacterized protein LOC100254195 isoform X4 [Vitis vinifera] Length = 1278 Score = 256 bits (655), Expect = 1e-65 Identities = 139/281 (49%), Positives = 182/281 (64%), Gaps = 12/281 (4%) Frame = -1 Query: 807 HVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQPQE 628 ++R+SI L S+ N++ ERK G +M + +C S KE FEEQPQE Sbjct: 283 NIRLSIHLPSSAENIVPPGERKGLKFNPVGENMCLGDCKSERASTLKEKEANAFEEQPQE 342 Query: 627 RRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAE 457 RRS+RL RSRKP KEE DFA+ KD K V QFLEP++VG + +H A S C E Sbjct: 343 RRSTRLERLRSRKPEKEEVDFASGKDLPKAVIQFLEPFIVGGPGLRNSDHSASSSASCPE 402 Query: 456 GVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRH 277 A+ + + +DV KFV++TS+N+GA+HM HLLLEE+ANR + YQD K L+LEKLTRH Sbjct: 403 SQANLSENECSDVAKFVKETSKNYGAHHMGHLLLEEVANRDLLYQDYFIKFLELEKLTRH 462 Query: 276 RGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQ 97 GLDRTPECSLFL ELYYD G S + +++ ++M +V+YHLCKIIESVALEYPFH +G+ Sbjct: 463 GGLDRTPECSLFLAELYYDLG-SSSEASSLSDYMEDVTYHLCKIIESVALEYPFHSSGVA 521 Query: 96 GKENCSSDDSSEQNHQLPIDS---------TSLLRNNYPFW 1 G NCS DS + ++ +D+ +S L N FW Sbjct: 522 GNANCSLTDSGQGAGRISLDNSVSQNSLLDSSFLSNKQFFW 562 >ref|XP_010648305.1| PREDICTED: uncharacterized protein LOC100254195 isoform X3 [Vitis vinifera] Length = 1590 Score = 256 bits (655), Expect = 1e-65 Identities = 139/281 (49%), Positives = 182/281 (64%), Gaps = 12/281 (4%) Frame = -1 Query: 807 HVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQPQE 628 ++R+SI L S+ N++ ERK G +M + +C S KE FEEQPQE Sbjct: 283 NIRLSIHLPSSAENIVPPGERKGLKFNPVGENMCLGDCKSERASTLKEKEANAFEEQPQE 342 Query: 627 RRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAE 457 RRS+RL RSRKP KEE DFA+ KD K V QFLEP++VG + +H A S C E Sbjct: 343 RRSTRLERLRSRKPEKEEVDFASGKDLPKAVIQFLEPFIVGGPGLRNSDHSASSSASCPE 402 Query: 456 GVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRH 277 A+ + + +DV KFV++TS+N+GA+HM HLLLEE+ANR + YQD K L+LEKLTRH Sbjct: 403 SQANLSENECSDVAKFVKETSKNYGAHHMGHLLLEEVANRDLLYQDYFIKFLELEKLTRH 462 Query: 276 RGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQ 97 GLDRTPECSLFL ELYYD G S + +++ ++M +V+YHLCKIIESVALEYPFH +G+ Sbjct: 463 GGLDRTPECSLFLAELYYDLG-SSSEASSLSDYMEDVTYHLCKIIESVALEYPFHSSGVA 521 Query: 96 GKENCSSDDSSEQNHQLPIDS---------TSLLRNNYPFW 1 G NCS DS + ++ +D+ +S L N FW Sbjct: 522 GNANCSLTDSGQGAGRISLDNSVSQNSLLDSSFLSNKQFFW 562 >ref|XP_010648304.1| PREDICTED: uncharacterized protein LOC100254195 isoform X2 [Vitis vinifera] Length = 1851 Score = 256 bits (655), Expect = 1e-65 Identities = 139/281 (49%), Positives = 182/281 (64%), Gaps = 12/281 (4%) Frame = -1 Query: 807 HVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQPQE 628 ++R+SI L S+ N++ ERK G +M + +C S KE FEEQPQE Sbjct: 154 NIRLSIHLPSSAENIVPPGERKGLKFNPVGENMCLGDCKSERASTLKEKEANAFEEQPQE 213 Query: 627 RRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAE 457 RRS+RL RSRKP KEE DFA+ KD K V QFLEP++VG + +H A S C E Sbjct: 214 RRSTRLERLRSRKPEKEEVDFASGKDLPKAVIQFLEPFIVGGPGLRNSDHSASSSASCPE 273 Query: 456 GVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRH 277 A+ + + +DV KFV++TS+N+GA+HM HLLLEE+ANR + YQD K L+LEKLTRH Sbjct: 274 SQANLSENECSDVAKFVKETSKNYGAHHMGHLLLEEVANRDLLYQDYFIKFLELEKLTRH 333 Query: 276 RGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQ 97 GLDRTPECSLFL ELYYD G S + +++ ++M +V+YHLCKIIESVALEYPFH +G+ Sbjct: 334 GGLDRTPECSLFLAELYYDLG-SSSEASSLSDYMEDVTYHLCKIIESVALEYPFHSSGVA 392 Query: 96 GKENCSSDDSSEQNHQLPIDS---------TSLLRNNYPFW 1 G NCS DS + ++ +D+ +S L N FW Sbjct: 393 GNANCSLTDSGQGAGRISLDNSVSQNSLLDSSFLSNKQFFW 433 >ref|XP_010648303.1| PREDICTED: uncharacterized protein LOC100254195 isoform X1 [Vitis vinifera] Length = 1980 Score = 256 bits (655), Expect = 1e-65 Identities = 139/281 (49%), Positives = 182/281 (64%), Gaps = 12/281 (4%) Frame = -1 Query: 807 HVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQPQE 628 ++R+SI L S+ N++ ERK G +M + +C S KE FEEQPQE Sbjct: 283 NIRLSIHLPSSAENIVPPGERKGLKFNPVGENMCLGDCKSERASTLKEKEANAFEEQPQE 342 Query: 627 RRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAE 457 RRS+RL RSRKP KEE DFA+ KD K V QFLEP++VG + +H A S C E Sbjct: 343 RRSTRLERLRSRKPEKEEVDFASGKDLPKAVIQFLEPFIVGGPGLRNSDHSASSSASCPE 402 Query: 456 GVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRH 277 A+ + + +DV KFV++TS+N+GA+HM HLLLEE+ANR + YQD K L+LEKLTRH Sbjct: 403 SQANLSENECSDVAKFVKETSKNYGAHHMGHLLLEEVANRDLLYQDYFIKFLELEKLTRH 462 Query: 276 RGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQ 97 GLDRTPECSLFL ELYYD G S + +++ ++M +V+YHLCKIIESVALEYPFH +G+ Sbjct: 463 GGLDRTPECSLFLAELYYDLG-SSSEASSLSDYMEDVTYHLCKIIESVALEYPFHSSGVA 521 Query: 96 GKENCSSDDSSEQNHQLPIDS---------TSLLRNNYPFW 1 G NCS DS + ++ +D+ +S L N FW Sbjct: 522 GNANCSLTDSGQGAGRISLDNSVSQNSLLDSSFLSNKQFFW 562 >emb|CBI20600.3| unnamed protein product [Vitis vinifera] Length = 1970 Score = 256 bits (655), Expect = 1e-65 Identities = 139/281 (49%), Positives = 182/281 (64%), Gaps = 12/281 (4%) Frame = -1 Query: 807 HVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQPQE 628 ++R+SI L S+ N++ ERK G +M + +C S KE FEEQPQE Sbjct: 308 NIRLSIHLPSSAENIVPPGERKGLKFNPVGENMCLGDCKSERASTLKEKEANAFEEQPQE 367 Query: 627 RRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAE 457 RRS+RL RSRKP KEE DFA+ KD K V QFLEP++VG + +H A S C E Sbjct: 368 RRSTRLERLRSRKPEKEEVDFASGKDLPKAVIQFLEPFIVGGPGLRNSDHSASSSASCPE 427 Query: 456 GVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRH 277 A+ + + +DV KFV++TS+N+GA+HM HLLLEE+ANR + YQD K L+LEKLTRH Sbjct: 428 SQANLSENECSDVAKFVKETSKNYGAHHMGHLLLEEVANRDLLYQDYFIKFLELEKLTRH 487 Query: 276 RGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQ 97 GLDRTPECSLFL ELYYD G S + +++ ++M +V+YHLCKIIESVALEYPFH +G+ Sbjct: 488 GGLDRTPECSLFLAELYYDLG-SSSEASSLSDYMEDVTYHLCKIIESVALEYPFHSSGVA 546 Query: 96 GKENCSSDDSSEQNHQLPIDS---------TSLLRNNYPFW 1 G NCS DS + ++ +D+ +S L N FW Sbjct: 547 GNANCSLTDSGQGAGRISLDNSVSQNSLLDSSFLSNKQFFW 587 >emb|CAN74834.1| hypothetical protein VITISV_023323 [Vitis vinifera] Length = 1610 Score = 256 bits (655), Expect = 1e-65 Identities = 139/281 (49%), Positives = 182/281 (64%), Gaps = 12/281 (4%) Frame = -1 Query: 807 HVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQPQE 628 ++R+SI L S+ N++ ERK G +M + +C S KE FEEQPQE Sbjct: 302 NIRLSIHLPSSAENIVPPGERKGLKFNPVGENMCLGDCKSERASTLKEKEANAFEEQPQE 361 Query: 627 RRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAE 457 RRS+RL RSRKP KEE DFA+ KD K V QFLEP++VG + +H A S C E Sbjct: 362 RRSTRLERLRSRKPEKEEVDFASGKDLPKAVIQFLEPFIVGGPGLRNSDHSASSSASCPE 421 Query: 456 GVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRH 277 A+ + + +DV KFV++TS+N+GA+HM HLLLEE+ANR + YQD K L+LEKLTRH Sbjct: 422 SQANLSENECSDVAKFVKETSKNYGAHHMGHLLLEEVANRDLLYQDYFIKFLELEKLTRH 481 Query: 276 RGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQ 97 GLDRTPECSLFL ELYYD G S + +++ ++M +V+YHLCKIIESVALEYPFH +G+ Sbjct: 482 GGLDRTPECSLFLAELYYDLG-SSSEASSLSDYMEDVTYHLCKIIESVALEYPFHSSGVA 540 Query: 96 GKENCSSDDSSEQNHQLPIDS---------TSLLRNNYPFW 1 G NCS DS + ++ +D+ +S L N FW Sbjct: 541 GNANCSLTDSGQGAGRISLDNSVSQNSLLDSSFLSNKQFFW 581 >ref|XP_006371866.1| hypothetical protein POPTR_0018s04800g [Populus trichocarpa] gi|550318055|gb|ERP49663.1| hypothetical protein POPTR_0018s04800g [Populus trichocarpa] Length = 1967 Score = 247 bits (630), Expect = 8e-63 Identities = 136/281 (48%), Positives = 180/281 (64%), Gaps = 9/281 (3%) Frame = -1 Query: 816 RSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQ 637 RS +R++I + + +M +VE+K + + SMS +CNS ++ I +EQ Sbjct: 265 RSGDIRLTINMPSNMEIIMESVEKKGSKSIPSVQSMSFVDCNSERASSVKERDPNIIDEQ 324 Query: 636 PQERRSSRLRSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAE 457 P ERRS+RLRSRKPGKEE DF T KD KVV Q +EP++V N ++ + SV C + Sbjct: 325 PHERRSTRLRSRKPGKEELDFDTRKDLAKVVVQLIEPFIVKN---EDSDLVGSCSVPCFD 381 Query: 456 GVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRH 277 A+SLDT+H DV FV++TS+N+GAYHM HLLLE A+RG+ YQD+ K L+LE+LTRH Sbjct: 382 Q-ANSLDTEHNDVADFVRETSKNYGAYHMGHLLLEHAASRGLKYQDAFVKFLELERLTRH 440 Query: 276 RGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQ 97 G DRTPEC LFL ELYYD G + + M E++SE SYHLCKIIESVAL+YPFH+ + Sbjct: 441 WGRDRTPECCLFLAELYYDLGSLPSNVSKMSEYLSEASYHLCKIIESVALDYPFHLTHVS 500 Query: 96 GKENCSSDDSSEQNHQLPIDST---------SLLRNNYPFW 1 G N SSD S + + + + T SLL N FW Sbjct: 501 GNINFSSDKSFQDSDETLKEGTGGWDSLLNISLLDNKSSFW 541 >ref|XP_006371865.1| hypothetical protein POPTR_0018s04800g [Populus trichocarpa] gi|550318054|gb|ERP49662.1| hypothetical protein POPTR_0018s04800g [Populus trichocarpa] Length = 1976 Score = 247 bits (630), Expect = 8e-63 Identities = 136/281 (48%), Positives = 180/281 (64%), Gaps = 9/281 (3%) Frame = -1 Query: 816 RSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQ 637 RS +R++I + + +M +VE+K + + SMS +CNS ++ I +EQ Sbjct: 265 RSGDIRLTINMPSNMEIIMESVEKKGSKSIPSVQSMSFVDCNSERASSVKERDPNIIDEQ 324 Query: 636 PQERRSSRLRSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAE 457 P ERRS+RLRSRKPGKEE DF T KD KVV Q +EP++V N ++ + SV C + Sbjct: 325 PHERRSTRLRSRKPGKEELDFDTRKDLAKVVVQLIEPFIVKN---EDSDLVGSCSVPCFD 381 Query: 456 GVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRH 277 A+SLDT+H DV FV++TS+N+GAYHM HLLLE A+RG+ YQD+ K L+LE+LTRH Sbjct: 382 Q-ANSLDTEHNDVADFVRETSKNYGAYHMGHLLLEHAASRGLKYQDAFVKFLELERLTRH 440 Query: 276 RGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQ 97 G DRTPEC LFL ELYYD G + + M E++SE SYHLCKIIESVAL+YPFH+ + Sbjct: 441 WGRDRTPECCLFLAELYYDLGSLPSNVSKMSEYLSEASYHLCKIIESVALDYPFHLTHVS 500 Query: 96 GKENCSSDDSSEQNHQLPIDST---------SLLRNNYPFW 1 G N SSD S + + + + T SLL N FW Sbjct: 501 GNINFSSDKSFQDSDETLKEGTGGWDSLLNISLLDNKSSFW 541 >ref|XP_002324750.2| hypothetical protein POPTR_0018s04800g [Populus trichocarpa] gi|550318053|gb|EEF03315.2| hypothetical protein POPTR_0018s04800g [Populus trichocarpa] Length = 1974 Score = 247 bits (630), Expect = 8e-63 Identities = 136/281 (48%), Positives = 180/281 (64%), Gaps = 9/281 (3%) Frame = -1 Query: 816 RSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQ 637 RS +R++I + + +M +VE+K + + SMS +CNS ++ I +EQ Sbjct: 265 RSGDIRLTINMPSNMEIIMESVEKKGSKSIPSVQSMSFVDCNSERASSVKERDPNIIDEQ 324 Query: 636 PQERRSSRLRSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAE 457 P ERRS+RLRSRKPGKEE DF T KD KVV Q +EP++V N ++ + SV C + Sbjct: 325 PHERRSTRLRSRKPGKEELDFDTRKDLAKVVVQLIEPFIVKN---EDSDLVGSCSVPCFD 381 Query: 456 GVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRH 277 A+SLDT+H DV FV++TS+N+GAYHM HLLLE A+RG+ YQD+ K L+LE+LTRH Sbjct: 382 Q-ANSLDTEHNDVADFVRETSKNYGAYHMGHLLLEHAASRGLKYQDAFVKFLELERLTRH 440 Query: 276 RGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQ 97 G DRTPEC LFL ELYYD G + + M E++SE SYHLCKIIESVAL+YPFH+ + Sbjct: 441 WGRDRTPECCLFLAELYYDLGSLPSNVSKMSEYLSEASYHLCKIIESVALDYPFHLTHVS 500 Query: 96 GKENCSSDDSSEQNHQLPIDST---------SLLRNNYPFW 1 G N SSD S + + + + T SLL N FW Sbjct: 501 GNINFSSDKSFQDSDETLKEGTGGWDSLLNISLLDNKSSFW 541 >ref|XP_007012208.1| Tetratricopeptide repeat-like superfamily protein isoform 5 [Theobroma cacao] gi|508782571|gb|EOY29827.1| Tetratricopeptide repeat-like superfamily protein isoform 5 [Theobroma cacao] Length = 1659 Score = 241 bits (615), Expect = 4e-61 Identities = 137/284 (48%), Positives = 177/284 (62%), Gaps = 12/284 (4%) Frame = -1 Query: 816 RSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQ 637 RS VR+ I + S VM VE+K P ++G S+ S+C++ KE EEQ Sbjct: 280 RSGDVRLRILIPPGSEIVMEPVEKKVPTSASSGESIPPSDCDTERASNLKEKESNFLEEQ 339 Query: 636 PQERRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVY 466 PQERRS+RL RSRKPGKEE DFA +KD K+V QFLEP+++ E K+ + S+ Sbjct: 340 PQERRSTRLERLRSRKPGKEEIDFAADKDLAKIVLQFLEPFVISRPEGKDSDDVVNCSMS 399 Query: 465 CAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKL 286 A+ A SLD + DV FV++TS+N+GAYH+ HLLLE N+ + + D++ K L+LEKL Sbjct: 400 YADQ-AYSLDMECQDVANFVKETSKNYGAYHLGHLLLEHATNKSLVHPDAHVKFLELEKL 458 Query: 285 TRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMN 106 TRH G DRTPECSLFL ELYYD G +++ + EF+SE SYHLCKIIESVAL++PFHM Sbjct: 459 TRHWGQDRTPECSLFLAELYYDIGSSPSNSSNLSEFLSEASYHLCKIIESVALDHPFHMT 518 Query: 105 GMQGKENCSS------DDSSEQNHQLPIDS---TSLLRNNYPFW 1 G ENCSS D N+ S + L N PFW Sbjct: 519 SSFGNENCSSFKNFLGTDGISPNNSFCESSHLDSFLSSNKSPFW 562 >ref|XP_007012207.1| Tetratricopeptide repeat-like superfamily protein isoform 4 [Theobroma cacao] gi|508782570|gb|EOY29826.1| Tetratricopeptide repeat-like superfamily protein isoform 4 [Theobroma cacao] Length = 1858 Score = 241 bits (615), Expect = 4e-61 Identities = 137/284 (48%), Positives = 177/284 (62%), Gaps = 12/284 (4%) Frame = -1 Query: 816 RSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQ 637 RS VR+ I + S VM VE+K P ++G S+ S+C++ KE EEQ Sbjct: 151 RSGDVRLRILIPPGSEIVMEPVEKKVPTSASSGESIPPSDCDTERASNLKEKESNFLEEQ 210 Query: 636 PQERRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVY 466 PQERRS+RL RSRKPGKEE DFA +KD K+V QFLEP+++ E K+ + S+ Sbjct: 211 PQERRSTRLERLRSRKPGKEEIDFAADKDLAKIVLQFLEPFVISRPEGKDSDDVVNCSMS 270 Query: 465 CAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKL 286 A+ A SLD + DV FV++TS+N+GAYH+ HLLLE N+ + + D++ K L+LEKL Sbjct: 271 YADQ-AYSLDMECQDVANFVKETSKNYGAYHLGHLLLEHATNKSLVHPDAHVKFLELEKL 329 Query: 285 TRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMN 106 TRH G DRTPECSLFL ELYYD G +++ + EF+SE SYHLCKIIESVAL++PFHM Sbjct: 330 TRHWGQDRTPECSLFLAELYYDIGSSPSNSSNLSEFLSEASYHLCKIIESVALDHPFHMT 389 Query: 105 GMQGKENCSS------DDSSEQNHQLPIDS---TSLLRNNYPFW 1 G ENCSS D N+ S + L N PFW Sbjct: 390 SSFGNENCSSFKNFLGTDGISPNNSFCESSHLDSFLSSNKSPFW 433 >ref|XP_007012206.1| Tetratricopeptide repeat-like superfamily protein isoform 3, partial [Theobroma cacao] gi|590573754|ref|XP_007012209.1| Tetratricopeptide repeat-like superfamily protein isoform 3, partial [Theobroma cacao] gi|590573758|ref|XP_007012210.1| Tetratricopeptide repeat-like superfamily protein isoform 3, partial [Theobroma cacao] gi|508782569|gb|EOY29825.1| Tetratricopeptide repeat-like superfamily protein isoform 3, partial [Theobroma cacao] gi|508782572|gb|EOY29828.1| Tetratricopeptide repeat-like superfamily protein isoform 3, partial [Theobroma cacao] gi|508782573|gb|EOY29829.1| Tetratricopeptide repeat-like superfamily protein isoform 3, partial [Theobroma cacao] Length = 1521 Score = 241 bits (615), Expect = 4e-61 Identities = 137/284 (48%), Positives = 177/284 (62%), Gaps = 12/284 (4%) Frame = -1 Query: 816 RSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQ 637 RS VR+ I + S VM VE+K P ++G S+ S+C++ KE EEQ Sbjct: 280 RSGDVRLRILIPPGSEIVMEPVEKKVPTSASSGESIPPSDCDTERASNLKEKESNFLEEQ 339 Query: 636 PQERRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVY 466 PQERRS+RL RSRKPGKEE DFA +KD K+V QFLEP+++ E K+ + S+ Sbjct: 340 PQERRSTRLERLRSRKPGKEEIDFAADKDLAKIVLQFLEPFVISRPEGKDSDDVVNCSMS 399 Query: 465 CAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKL 286 A+ A SLD + DV FV++TS+N+GAYH+ HLLLE N+ + + D++ K L+LEKL Sbjct: 400 YADQ-AYSLDMECQDVANFVKETSKNYGAYHLGHLLLEHATNKSLVHPDAHVKFLELEKL 458 Query: 285 TRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMN 106 TRH G DRTPECSLFL ELYYD G +++ + EF+SE SYHLCKIIESVAL++PFHM Sbjct: 459 TRHWGQDRTPECSLFLAELYYDIGSSPSNSSNLSEFLSEASYHLCKIIESVALDHPFHMT 518 Query: 105 GMQGKENCSS------DDSSEQNHQLPIDS---TSLLRNNYPFW 1 G ENCSS D N+ S + L N PFW Sbjct: 519 SSFGNENCSSFKNFLGTDGISPNNSFCESSHLDSFLSSNKSPFW 562 >ref|XP_007012205.1| Tetratricopeptide repeat-like superfamily protein isoform 2 [Theobroma cacao] gi|508782568|gb|EOY29824.1| Tetratricopeptide repeat-like superfamily protein isoform 2 [Theobroma cacao] Length = 1541 Score = 241 bits (615), Expect = 4e-61 Identities = 137/284 (48%), Positives = 177/284 (62%), Gaps = 12/284 (4%) Frame = -1 Query: 816 RSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQ 637 RS VR+ I + S VM VE+K P ++G S+ S+C++ KE EEQ Sbjct: 280 RSGDVRLRILIPPGSEIVMEPVEKKVPTSASSGESIPPSDCDTERASNLKEKESNFLEEQ 339 Query: 636 PQERRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVY 466 PQERRS+RL RSRKPGKEE DFA +KD K+V QFLEP+++ E K+ + S+ Sbjct: 340 PQERRSTRLERLRSRKPGKEEIDFAADKDLAKIVLQFLEPFVISRPEGKDSDDVVNCSMS 399 Query: 465 CAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKL 286 A+ A SLD + DV FV++TS+N+GAYH+ HLLLE N+ + + D++ K L+LEKL Sbjct: 400 YADQ-AYSLDMECQDVANFVKETSKNYGAYHLGHLLLEHATNKSLVHPDAHVKFLELEKL 458 Query: 285 TRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMN 106 TRH G DRTPECSLFL ELYYD G +++ + EF+SE SYHLCKIIESVAL++PFHM Sbjct: 459 TRHWGQDRTPECSLFLAELYYDIGSSPSNSSNLSEFLSEASYHLCKIIESVALDHPFHMT 518 Query: 105 GMQGKENCSS------DDSSEQNHQLPIDS---TSLLRNNYPFW 1 G ENCSS D N+ S + L N PFW Sbjct: 519 SSFGNENCSSFKNFLGTDGISPNNSFCESSHLDSFLSSNKSPFW 562 >ref|XP_007012204.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|508782567|gb|EOY29823.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] Length = 1986 Score = 241 bits (615), Expect = 4e-61 Identities = 137/284 (48%), Positives = 177/284 (62%), Gaps = 12/284 (4%) Frame = -1 Query: 816 RSWHVRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQ 637 RS VR+ I + S VM VE+K P ++G S+ S+C++ KE EEQ Sbjct: 280 RSGDVRLRILIPPGSEIVMEPVEKKVPTSASSGESIPPSDCDTERASNLKEKESNFLEEQ 339 Query: 636 PQERRSSRL---RSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVY 466 PQERRS+RL RSRKPGKEE DFA +KD K+V QFLEP+++ E K+ + S+ Sbjct: 340 PQERRSTRLERLRSRKPGKEEIDFAADKDLAKIVLQFLEPFVISRPEGKDSDDVVNCSMS 399 Query: 465 CAEGVASSLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKL 286 A+ A SLD + DV FV++TS+N+GAYH+ HLLLE N+ + + D++ K L+LEKL Sbjct: 400 YADQ-AYSLDMECQDVANFVKETSKNYGAYHLGHLLLEHATNKSLVHPDAHVKFLELEKL 458 Query: 285 TRHRGLDRTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMN 106 TRH G DRTPECSLFL ELYYD G +++ + EF+SE SYHLCKIIESVAL++PFHM Sbjct: 459 TRHWGQDRTPECSLFLAELYYDIGSSPSNSSNLSEFLSEASYHLCKIIESVALDHPFHMT 518 Query: 105 GMQGKENCSS------DDSSEQNHQLPIDS---TSLLRNNYPFW 1 G ENCSS D N+ S + L N PFW Sbjct: 519 SSFGNENCSSFKNFLGTDGISPNNSFCESSHLDSFLSSNKSPFW 562 >ref|XP_002516492.1| conserved hypothetical protein [Ricinus communis] gi|223544312|gb|EEF45833.1| conserved hypothetical protein [Ricinus communis] Length = 1906 Score = 239 bits (610), Expect = 2e-60 Identities = 133/277 (48%), Positives = 171/277 (61%), Gaps = 9/277 (3%) Frame = -1 Query: 804 VRISIQLNLSSGNVMSTVERKEPICMTAGASMSVSNCNSGNDGINIFKEEAIFEEQPQER 625 VR+++ VM + E K P +++ S+ V +CN+ +E EEQP ER Sbjct: 259 VRLTMHFPSHKNIVMGSTEDKGPNPLSS-ESLLVGDCNAERASFTKEREANTSEEQPHER 317 Query: 624 RSSRLRSRKPGKEESDFATNKDFVKVVKQFLEPYLVGNAEPKECNHDAPFSVYCAEGVAS 445 RS+RLRSRKPGKEE DFA +KD K+V Q LEP++V K+ A SV C G + Sbjct: 318 RSTRLRSRKPGKEELDFAASKDLAKIVLQLLEPFVVSGLTSKDSGQAAGHSVSCP-GQVN 376 Query: 444 SLDTQHTDVIKFVQKTSENFGAYHMSHLLLEEIANRGISYQDSNSKILDLEKLTRHRGLD 265 SLD++H DV F+ +TS+N+GAYHM HLLLE A G+ YQD+ K L+LEKLTRH G D Sbjct: 377 SLDSEHDDVSAFLGETSKNYGAYHMGHLLLEHAATGGLGYQDTFIKFLELEKLTRHWGQD 436 Query: 264 RTPECSLFLGELYYDFGIRSLDTAAMREFMSEVSYHLCKIIESVALEYPFHMNGMQGKEN 85 RTPEC LFL ELYY+ G + + + EFMSE SYHLCKIIESVAL+YPF N G + Sbjct: 437 RTPECCLFLAELYYELGSLPSNASKLPEFMSEASYHLCKIIESVALDYPFSSNQFSGSAS 496 Query: 84 CSS-----DD----SSEQNHQLPIDSTSLLRNNYPFW 1 CSS DD S + + Q ++ L+ N PFW Sbjct: 497 CSSLKSFQDDNEIFSKDSSCQDSFFNSPLVINKIPFW 533