BLASTX nr result
ID: Forsythia21_contig00049623
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia21_contig00049623 (907 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_009614410.1| PREDICTED: myb-like protein X [Nicotiana tom... 181 6e-43 ref|XP_009783129.1| PREDICTED: uncharacterized protein LOC104231... 173 1e-40 ref|XP_009781432.1| PREDICTED: uncharacterized protein LOC104230... 169 2e-39 ref|XP_009626985.1| PREDICTED: uncharacterized protein LOC104117... 167 9e-39 ref|XP_006339485.1| PREDICTED: uncharacterized protein LOC102596... 161 5e-37 ref|XP_011091426.1| PREDICTED: uncharacterized protein LOC105171... 159 2e-36 ref|XP_011035418.1| PREDICTED: uncharacterized protein LOC105133... 150 1e-33 ref|XP_004229855.1| PREDICTED: uncharacterized protein LOC101264... 149 2e-33 ref|XP_002320096.2| hypothetical protein POPTR_0014s07340g [Popu... 144 1e-31 ref|XP_011023450.1| PREDICTED: uncharacterized protein LOC105124... 135 4e-29 ref|XP_007051946.1| Uncharacterized protein isoform 2 [Theobroma... 135 4e-29 ref|XP_007051945.1| Uncharacterized protein isoform 1 [Theobroma... 135 5e-29 emb|CDP08894.1| unnamed protein product [Coffea canephora] 134 1e-28 ref|XP_002302558.2| hypothetical protein POPTR_0002s15410g [Popu... 133 2e-28 ref|XP_004306853.2| PREDICTED: uncharacterized protein LOC101297... 117 1e-23 ref|XP_010278517.1| PREDICTED: uncharacterized protein LOC104612... 114 7e-23 ref|XP_012843647.1| PREDICTED: uncharacterized protein LOC105963... 114 7e-23 ref|XP_010255548.1| PREDICTED: uncharacterized protein LOC104596... 107 8e-21 emb|CBI38843.3| unnamed protein product [Vitis vinifera] 106 2e-20 ref|XP_003633933.1| PREDICTED: uncharacterized protein LOC100854... 106 2e-20 >ref|XP_009614410.1| PREDICTED: myb-like protein X [Nicotiana tomentosiformis] Length = 456 Score = 181 bits (459), Expect = 6e-43 Identities = 131/305 (42%), Positives = 171/305 (56%), Gaps = 49/305 (16%) Frame = -3 Query: 776 SCEHNPKMLKDFL-RDDSYSSNP----------CKA----------------SPVLLRSR 678 S E PK+LKDFL +DD YS +P CK+ S LLRSR Sbjct: 10 SIERRPKLLKDFLLQDDPYSCSPNDFGSHPRKLCKSKISNFHGSRIKSNKASSHQLLRSR 69 Query: 677 SRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRSRNRCENFQ------E 516 S +AA TISAI+KVIN+VKF PF+SVKSP + PR +SRKLSRR+ +R +N + + Sbjct: 70 SSRAATATISAINKVINIVKFLPFASVKSPSIFPRSISRKLSRRTNHR-DNIKQHSSNHD 128 Query: 515 VSVTVKVKDILRWRSFRELVEEKSP-----RCIXXXXXXXXXXXXXXXXXSRH------D 369 VSV VKVKDILRW+SFR+LV+EKS RC S+ D Sbjct: 129 VSVKVKVKDILRWKSFRDLVDEKSTPYSPNRCTTTTTTTNSTTTTSTTTSSKRTSWCDSD 188 Query: 368 FTEEILPFRCSENSVFLH--KNGV-EIDEKCLPKETV-GQPTNDTSGTTRNLKGELSFEE 201 FT E LP EN FL ++G+ ++ + +ETV G T R+ K EL F+E Sbjct: 189 FTAEDLPSWWGENGEFLGELEDGMKKVGRNNIFEETVGGYSMGTTRAIKRDRKEELCFDE 248 Query: 200 SEQHSPVSVLDSLFQEED-ESISPFHHQSFAKIERRRCRLMQMIQEFSSLFEGEKPVFNQ 24 +EQHSPVSVL+S FQE+D E I+ H++ A +++R MQ IQ+F SL EG + Sbjct: 249 NEQHSPVSVLESPFQEDDEEGIAFSFHRNLANLDKRTSMFMQRIQQFESLAEGNTSFEEE 308 Query: 23 DEEGE 9 +EE E Sbjct: 309 EEEEE 313 >ref|XP_009783129.1| PREDICTED: uncharacterized protein LOC104231773 [Nicotiana sylvestris] Length = 394 Score = 173 bits (439), Expect = 1e-40 Identities = 130/292 (44%), Positives = 161/292 (55%), Gaps = 20/292 (6%) Frame = -3 Query: 830 MASSLNNPQTNSRFQSAFSCEHNPKMLKDFLRDD-SYSSNPCKA------------SPVL 690 MASS NP FS E PK+LKDFL+DD S+ S KA S L Sbjct: 1 MASSCPNPHAK------FSFESRPKLLKDFLQDDISFQSQSQKAYKSTSQRFFNKNSSQL 54 Query: 689 LRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRSRNRCENF---Q 519 RSRS +AA TISAIHKVIN+VKF PF+ VKSP +LPR +SRKLSRR+ EN+ Sbjct: 55 HRSRSSRAASATISAIHKVINIVKFLPFAYVKSPSILPRSISRKLSRRNHKETENYIMNH 114 Query: 518 EVSVT--VKVKDILRWRSFRELVEEKSPRCIXXXXXXXXXXXXXXXXXSRHDFTEEILPF 345 EVSVT VKVKDILRW+S ++LVEEKS DFT E L Sbjct: 115 EVSVTVKVKVKDILRWKSSKDLVEEKS----------TPLDYAYSPFRCDGDFTAENLSS 164 Query: 344 RCSENSVFLHKNGVEIDEKCLPKETVGQPTNDTSGTTRNLK-GELSFEESEQHSPVSVLD 168 C E+ D+K L +E VG+ + TR +K E+EQHSPVSVL+ Sbjct: 165 WCGESDEC-------YDKKNLLEEGVGR---YCARETRGIKLDPEEHYENEQHSPVSVLE 214 Query: 167 SLFQE-EDESISPFHHQSFAKIERRRCRLMQMIQEFSSLFEGEKPVFNQDEE 15 S F+E +DE +S +H++ I+RR+C L + IQ F SL EG +DEE Sbjct: 215 SPFREDQDEYVS--YHRTLVDIDRRKCMLRERIQAFESLEEGNTSYNEEDEE 264 >ref|XP_009781432.1| PREDICTED: uncharacterized protein LOC104230352 [Nicotiana sylvestris] Length = 454 Score = 169 bits (428), Expect = 2e-39 Identities = 131/308 (42%), Positives = 169/308 (54%), Gaps = 52/308 (16%) Frame = -3 Query: 776 SCEHNPKMLKDFL-RDDSYS-------SNP---CKA----------------SPVLLRSR 678 S E PK+LKDFL +DD YS S+P CK+ S LLRSR Sbjct: 11 SIERRPKLLKDFLLQDDPYSCSFNDIGSHPRKLCKSTISNFHGSRIRSNKASSHQLLRSR 70 Query: 677 SRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRSRNR------CENFQE 516 S +AA TISAI+KVIN+VKF PF+SVKSP + P +SRKLS R+ +R + + Sbjct: 71 SSRAATATISAINKVINIVKFLPFASVKSPSIFPLSISRKLSTRTNHRDNIIKQHSSNHD 130 Query: 515 VSVTVKVKDILRWRSFRELVEEKSP---------RC-IXXXXXXXXXXXXXXXXXSRHDF 366 VSV VKVKDILRW+SFR+LV+EKS RC DF Sbjct: 131 VSVKVKVKDILRWKSFRDLVDEKSTPLDSSYSPNRCTTTTTNSTTTTISSKRTSWCDSDF 190 Query: 365 TEEILPFRCSENSVFLH--KNGV-EIDEKCLPKETVGQPTNDT-SGTTRNLKGELSFEES 198 T E LP EN FL ++G+ ++ K + +ETVG T R+ K EL F+E+ Sbjct: 191 TAEDLPSWWGENGEFLGELEDGMKKVGRKNIFEETVGGYGMGTIRAIKRDCKEELCFDEN 250 Query: 197 EQHSPVSVLDSLFQEED-ESISPFHHQSFAKIERRRCRLMQMIQEFSSLFEG----EKPV 33 EQHSPVSVL+S FQE+D E I+ H++ A +++R MQ IQ+F SL EG E+ Sbjct: 251 EQHSPVSVLESPFQEDDEEGIAFSFHRNLANLDKRTSTFMQKIQQFESLAEGNTSFEEEQ 310 Query: 32 FNQDEEGE 9 Q+EE E Sbjct: 311 QQQEEEEE 318 >ref|XP_009626985.1| PREDICTED: uncharacterized protein LOC104117616 [Nicotiana tomentosiformis] Length = 400 Score = 167 bits (423), Expect = 9e-39 Identities = 126/297 (42%), Positives = 156/297 (52%), Gaps = 25/297 (8%) Frame = -3 Query: 830 MASSLNNPQTNSRFQSAFSCEHNPKMLKDFLRDDS--YSSN------PCKA--------- 702 MASS +P FS E PK+LKDFL+D+S YSSN P KA Sbjct: 1 MASSCPSPHAK------FSFECRPKLLKDFLQDESPSYSSNISFQSHPQKAYKSTSQRFL 54 Query: 701 ---SPVLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRSRNRC 531 S L RSRS +AA TISAIHKVIN+VKF PF+ VKSP +LPR +SRKLSRR+ Sbjct: 55 NKNSSQLHRSRSSRAASATISAIHKVINIVKFLPFTYVKSPSILPRIISRKLSRRNHKET 114 Query: 530 ENF---QEVSVT--VKVKDILRWRSFRELVEEKSPRCIXXXXXXXXXXXXXXXXXSRHDF 366 EN+ EVSVT VKVKDILRW+S ++LVEEKS + DF Sbjct: 115 ENYILNHEVSVTVKVKVKDILRWKSSKDLVEEKS------------TPLDYAHSPFKCDF 162 Query: 365 TEEILPFRCSENSVFLHKNGVEIDEKCLPKETVGQPTNDTSGTTRNLKGELSFEESEQHS 186 T E L C E K + L + G +T G + K E+EQHS Sbjct: 163 TAENLSSWCGEIDECYGKRNL------LEEGVGGCCVTETRGIKLDPK---EHYENEQHS 213 Query: 185 PVSVLDSLFQEEDESISPFHHQSFAKIERRRCRLMQMIQEFSSLFEGEKPVFNQDEE 15 PVSVL+S F+E+ + F H++ +RR+C L + IQ F SL EG +DEE Sbjct: 214 PVSVLESPFREDQDEYFSF-HRTLVDTDRRKCMLRERIQAFESLEEGNTSFNEEDEE 269 >ref|XP_006339485.1| PREDICTED: uncharacterized protein LOC102596041 [Solanum tuberosum] Length = 400 Score = 161 bits (408), Expect = 5e-37 Identities = 128/307 (41%), Positives = 163/307 (53%), Gaps = 32/307 (10%) Frame = -3 Query: 830 MASSLNNPQTNSRFQSAFSCEHNPKMLKDFL-RDDSYSSNPCKASPV------------- 693 MASS +P S E PK+LK+FL +DD YSSN + P Sbjct: 1 MASSSPSPLAR------LSLERRPKLLKEFLLQDDPYSSNDFGSYPKTCIHGSRIIRSNK 54 Query: 692 -----LLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRS---RN 537 LLRSRS +AA TISAI+KVIN+VKF PF+SVKSP + PR +SRKLSRR+ ++ Sbjct: 55 GSSHQLLRSRSSRAATATISAINKVINIVKFLPFTSVKSPSIFPRSISRKLSRRNNYKKS 114 Query: 536 RCENF-QEVSVTVKVKDILRWRSFRELVEEKSPRCIXXXXXXXXXXXXXXXXXSR----- 375 + N Q+VSV VKVKDILRW+SFR+L E+ +P R Sbjct: 115 QKHNVDQDVSVKVKVKDILRWKSFRDLEEKSTPLDSSYSPYRCGTITTTTTSSKRTSWCD 174 Query: 374 HDFTEEILPFRCSENSVFLHKNGVEIDEKCLPKETVGQPTNDTSGTTRNLKGELSFEESE 195 DFT E LP EN FL + K +VG +T+ + N K EL F+E+E Sbjct: 175 SDFTAEDLPSWWGENGEFLGR-----------KNSVGGYCMETTKSIIN-KEELCFDENE 222 Query: 194 QHSPVSVLDSLFQE-EDESISPFHHQSFAKIERRRCRLMQMIQEFSSLFE---GEKPVFN 27 Q+SPVS+L+S FQE +DE I F Q +R+ LMQ IQ+F SL E K V Sbjct: 223 QYSPVSILESPFQEDDDEGIMAFSFQ------KRKSMLMQRIQQFESLAEENINSKGVEE 276 Query: 26 QDEEGEI 6 E+ EI Sbjct: 277 LKEDEEI 283 >ref|XP_011091426.1| PREDICTED: uncharacterized protein LOC105171871 [Sesamum indicum] Length = 384 Score = 159 bits (403), Expect = 2e-36 Identities = 112/241 (46%), Positives = 136/241 (56%), Gaps = 24/241 (9%) Frame = -3 Query: 776 SCEHNPKMLKDFLRDDSY-----------------SSNPCKASPVLLRSRSRKAAETTIS 648 SCE P MLKDFLRDD++ SS+P K S VL+RSRS+KAA TTIS Sbjct: 5 SCETRPMMLKDFLRDDAHTSYNGFPINPRNSRMLISSHPDKLSGVLIRSRSKKAAATTIS 64 Query: 647 AIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRSRNRCENFQEV-SVTVKVKDILRWRS 471 AIHKVINVVK F F SV+SPLVLPR +SRK S +SR+R + ++ + VKVKDILRW+S Sbjct: 65 AIHKVINVVKLFQFGSVRSPLVLPRSISRK-SPKSRDRNVDISDLPEIKVKVKDILRWKS 123 Query: 470 FRELVEEKS-----PRCIXXXXXXXXXXXXXXXXXSRHDFTEEILPFRCSENSVFLHKNG 306 FR+LVE+ P DFT E LP E FL K Sbjct: 124 FRDLVEDDPVPLDVPSSPGRSTTGTTTSCSQRSSWCESDFTAEELPPWGGETEQFLGK-- 181 Query: 305 VEIDEKCLPKETVGQPTNDTSGTTRNLKGELS-FEESEQHSPVSVLDSLFQEEDESISPF 129 KC +GQP + R +KG+ S FEE EQ SPVSVLDS F++ +E +SP Sbjct: 182 -----KCF----LGQPAD------RKVKGDWSMFEEMEQQSPVSVLDSPFRQVEEFLSPS 226 Query: 128 H 126 H Sbjct: 227 H 227 >ref|XP_011035418.1| PREDICTED: uncharacterized protein LOC105133229 [Populus euphratica] Length = 471 Score = 150 bits (378), Expect = 1e-33 Identities = 112/300 (37%), Positives = 149/300 (49%), Gaps = 50/300 (16%) Frame = -3 Query: 803 TNSRFQSAFSCEHNPKMLKDFLRDD-----------SYSSNPCK---------------- 705 TN + ++ FS EH PK+LKDFL DD S+S PC Sbjct: 11 TNFKQRNHFSIEHRPKLLKDFLIDDDSNSCSSSGFRSFSRKPCDPTMKTLIEIDLSNPKG 70 Query: 704 --------ASPVLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSR 549 AS LLRSRS+ AA TTISA VIN V+ F++VKSP +LPR +SRKLS+ Sbjct: 71 IANSSNNIASYKLLRSRSKAAASTTISAFQAVINAVRNIHFTAVKSPSILPRSLSRKLSK 130 Query: 548 RSRNRCENFQEVSVTVKVKDILRWRSFRELVEEKS---------PRCIXXXXXXXXXXXX 396 + R EN EV +TV +KDI+RWRSFR++VEEKS CI Sbjct: 131 KKRQNKEN--EVKITVTIKDIIRWRSFRDIVEEKSLASDLPSSPYHCITTTTASTSTSPR 188 Query: 395 XXXXXSRHDFTEEILP------FRCSENSVFLHKNGVEIDEKCLPKETVGQPTNDTSGTT 234 DFT + LP +C E +E+ +K P VG+ + +T T Sbjct: 189 SGSSWCDSDFTSDYLPPWNGTLDKCGEKE-------MEVGQKNSP--CVGEDSLETITNT 239 Query: 233 RNLKGELSFEESEQHSPVSVLDSLFQEEDESISPFHHQSFAKIERRRCRLMQMIQEFSSL 54 + G EE HSPVSV + F+E+++S S F QS A +ER R ++M+ I+ F L Sbjct: 240 K--VGPEEDEEERLHSPVSVTEFEFEEDEDSSSSF-EQSLATVERTREKIMEKIRRFEGL 296 >ref|XP_004229855.1| PREDICTED: uncharacterized protein LOC101264556 [Solanum lycopersicum] Length = 407 Score = 149 bits (377), Expect = 2e-33 Identities = 117/294 (39%), Positives = 155/294 (52%), Gaps = 33/294 (11%) Frame = -3 Query: 830 MASSLNNPQTNSRFQSAFSCEHNPKMLKDFL-RDDSYSSNPCKASP-------------- 696 MASS +P S E PK+LK+FL +DD YSSN + P Sbjct: 1 MASSSPSPLAR------LSLERRPKLLKEFLLQDDPYSSNDFGSYPNKYIHGSTIIRSNK 54 Query: 695 ----VLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRS---RN 537 LLRSRS +AA TISAI+KVI++VKF PF+SVKSP + PR +SRKLSR++ ++ Sbjct: 55 GSSHQLLRSRSSRAATATISAINKVISIVKFLPFTSVKSPSIFPRNISRKLSRKNNYKKS 114 Query: 536 RCENF-QEVSVTVKVKDILRWRSFRELVEEKSPRC----------IXXXXXXXXXXXXXX 390 + N Q+VSV VKVKDILRW+SFR+L EEKS Sbjct: 115 QKHNVDQDVSVKVKVKDILRWKSFRDLAEEKSTPLDSSYSPYRYGTITAMTTTTITTGKR 174 Query: 389 XXXSRHDFTEEILPFRCSENSVFLHKNGVEIDEKCLPKETVGQPTNDTSGTTRNLKGELS 210 D T E LP EN L + K +VG +T+ + N K EL Sbjct: 175 TSWCDSDSTAEDLPSWWGENGELLGR-----------KNSVGGYCMETTKSIIN-KEELC 222 Query: 209 FEESEQHSPVSVLDSLFQEEDESISPFHHQSFAKIERRRCRLMQMIQEFSSLFE 48 F+E+EQHSPVS+L+S FQE+D+ S +F+ ++R+ L+ IQ+F SL E Sbjct: 223 FDENEQHSPVSILESPFQEDDDEGS----MAFS-FQKRKSMLLHRIQQFESLAE 271 >ref|XP_002320096.2| hypothetical protein POPTR_0014s07340g [Populus trichocarpa] gi|550323705|gb|EEE98411.2| hypothetical protein POPTR_0014s07340g [Populus trichocarpa] Length = 353 Score = 144 bits (362), Expect = 1e-31 Identities = 107/300 (35%), Positives = 145/300 (48%), Gaps = 50/300 (16%) Frame = -3 Query: 803 TNSRFQSAFSCEHNPKMLKDFLRDD-----------SYSSNPCK---------------- 705 TN + ++ S EH P++LKDFL DD S+S PC Sbjct: 11 TNFKQRNHVSIEHRPQLLKDFLIDDDSNLCSSSGFRSFSRKPCDSTMKTLIEIDLRNPKR 70 Query: 704 --------ASPVLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSR 549 AS LLRSRS+ AA TTISA VIN VK F++VK P +LPR +SRKLS+ Sbjct: 71 IANSSNNIASYKLLRSRSKAAASTTISAFQAVINAVKNIHFTAVKPPSILPRSLSRKLSK 130 Query: 548 RSRNRCENFQEVSVTVKVKDILRWRSFRELVEEKS---------PRCIXXXXXXXXXXXX 396 + EN EV +TV +KDI+RWRSFR++VEEKS CI Sbjct: 131 KKSQNKEN--EVEITVTIKDIIRWRSFRDIVEEKSLPSDLPSSPYHCITTTTGSTSTTPR 188 Query: 395 XXXXXSRHDFTEEILP------FRCSENSVFLHKNGVEIDEKCLPKETVGQPTNDTSGTT 234 DFT + LP C E + + K + C+ ++++ TN G Sbjct: 189 SGSSWCDSDFTSDYLPPWNGNFDECGEKEIEVGKE----NSPCVGEDSLELITNTKVGPE 244 Query: 233 RNLKGELSFEESEQHSPVSVLDSLFQEEDESISPFHHQSFAKIERRRCRLMQMIQEFSSL 54 + EE HSPVSV + F+E+++S S F QS A +ER R ++M+ I+ F SL Sbjct: 245 ED-------EEERLHSPVSVTEFEFEEDEDSSSSF-EQSLATVERTREKIMEKIRRFESL 296 >ref|XP_011023450.1| PREDICTED: uncharacterized protein LOC105124929 [Populus euphratica] Length = 472 Score = 135 bits (340), Expect = 4e-29 Identities = 105/307 (34%), Positives = 152/307 (49%), Gaps = 51/307 (16%) Frame = -3 Query: 821 SLNNPQTNSRF--QSAFSCEHNPKMLKDFLRDDSYSSN----------PCK--------- 705 S N +N+ + ++ FS E PKMLK+FL DDSYS + PC Sbjct: 3 SFYNSYSNTSYSQRNHFSVERRPKMLKEFLIDDSYSCSSWGFESFSRKPCDSTMKTLIEI 62 Query: 704 ---------------ASPVLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRF 570 AS LL+SRS+ A TTISA ++N VK F +VKSP +LPR Sbjct: 63 DIYNPRNVANPSNNIASYKLLKSRSKAVASTTISAFQAMMNAVKNVHFIAVKSPSLLPRS 122 Query: 569 VSRKLSRRSRNRCENFQEVSVTVKVKDILRWRSFRELVEEKSP---------RCIXXXXX 417 +SR+LS++ EN EV +T+ VKDI+RW+SFR++VEEK+P C Sbjct: 123 LSRRLSKKKCQNKEN--EVKMTILVKDIIRWKSFRDIVEEKAPPSDLPSSPHHCTTTTTR 180 Query: 416 XXXXXXXXXXXXSRHDFTEEILPF------RCSENSVFLHKNGVEIDEKCLPKETVGQPT 255 DF + LP C EN V K + C+ ++++ Q T Sbjct: 181 STSTTPRSGSSWCDSDFNSDYLPSWNGNFDECVENEVGAGKKFL----PCVGEDSL-QAT 235 Query: 254 NDTSGTTRNLKGELSFEESEQHSPVSVLDSLFQEEDESISPFHHQSFAKIERRRCRLMQM 75 + T+ G EE +QH+PVSV++ F+E++ES S F HQS A ++R R ++M+ Sbjct: 236 TEARIYTK--VGPKEDEEEQQHTPVSVIEFHFEEDEESSSSF-HQSLATLDRTREKIMEK 292 Query: 74 IQEFSSL 54 I+ S+ Sbjct: 293 IRRSESV 299 >ref|XP_007051946.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508704207|gb|EOX96103.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 447 Score = 135 bits (340), Expect = 4e-29 Identities = 115/307 (37%), Positives = 151/307 (49%), Gaps = 48/307 (15%) Frame = -3 Query: 830 MASSLNNPQTNSRFQSAFSCEHNPKMLKDFLRDDSYS--SNPCKASP------------- 696 MASS T SR + F E P+MLKDFL DDS S SN K+ P Sbjct: 1 MASS-----TISRQRKHFPLERRPRMLKDFLLDDSNSCSSNGFKSFPRKTCQSIRNLIET 55 Query: 695 -------------VLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKL 555 L RSRS KAA TTIS +I V+ F+SVKSP +LPR +SRKL Sbjct: 56 DLNSSHAKPSYAQQLQRSRS-KAASTTISTFQAMIKAVRNIHFTSVKSPSILPRSLSRKL 114 Query: 554 SRRSRNRCENFQEVSVTVKVKDILRWRSFRELVEEKSP---------RCIXXXXXXXXXX 402 S+++ + E TV+VKDI+RW+S R+LVEEK P C Sbjct: 115 SKKN---SQKETETRTTVRVKDIIRWKSSRDLVEEKFPPADFASSPHHCTTRSTTTTTTT 171 Query: 401 XXXXXXXSRH-------DFTEEILPFRCSENSVFLHKNGVEIDEKCLP---KETVGQPTN 252 S + DFT E LP S H++ V++ +K LP K+ + T Sbjct: 172 GSKSTPCSSNSSSWCDSDFTSEYLP------SEEYHESEVDVGKKFLPCVGKDPMETTTG 225 Query: 251 DTSGTTRNLKG-ELSFEESEQHSPVSVLDSLFQEEDESISPFHHQSFAKIERRRCRLMQM 75 + T KG + + EE EQHSP+SVLD ++E+DE ++S A +ER+R +LMQ Sbjct: 226 LAANTAVGPKGRKHASEEKEQHSPLSVLDFEYEEDDEESLSSFNRSLATMERKRQKLMQN 285 Query: 74 IQEFSSL 54 IQ F SL Sbjct: 286 IQRFESL 292 >ref|XP_007051945.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508704206|gb|EOX96102.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 448 Score = 135 bits (339), Expect = 5e-29 Identities = 115/310 (37%), Positives = 149/310 (48%), Gaps = 51/310 (16%) Frame = -3 Query: 830 MASSLNNPQTNSRFQSAFSCEHNPKMLKDFLRDDSYS--SNPCKASP------------- 696 MASS T SR + F E P+MLKDFL DDS S SN K+ P Sbjct: 1 MASS-----TISRQRKHFPLERRPRMLKDFLLDDSNSCSSNGFKSFPRKTCQSIRNLIET 55 Query: 695 -------------VLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKL 555 L RSRS KAA TTIS +I V+ F+SVKSP +LPR +SRKL Sbjct: 56 DLNSSHAKPSYAQQLQRSRS-KAASTTISTFQAMIKAVRNIHFTSVKSPSILPRSLSRKL 114 Query: 554 SRRSRNRCENFQEVSVTVKVKDILRWRSFRELVEEKSP---------RCIXXXXXXXXXX 402 S+++ + E TV+VKDI+RW+S R+LVEEK P C Sbjct: 115 SKKN---SQKETETRTTVRVKDIIRWKSSRDLVEEKFPPADFASSPHHCTTRSTTTTTTT 171 Query: 401 XXXXXXXSRH-------DFTEEILPFRCSENSVFLHKNGVEIDEKCLP-------KETVG 264 S + DFT E LP S H++ V++ +K LP + T G Sbjct: 172 GSKSTPCSSNSSSWCDSDFTSEYLP------SEEYHESEVDVGKKFLPCVGKDPMETTTG 225 Query: 263 QPTNDTSGTTRNLKGELSFEESEQHSPVSVLDSLFQEEDESISPFHHQSFAKIERRRCRL 84 N G + K + EE EQHSP+SVLD ++E+DE ++S A +ER+R +L Sbjct: 226 LAANTAVGPKQGRKH--ASEEKEQHSPLSVLDFEYEEDDEESLSSFNRSLATMERKRQKL 283 Query: 83 MQMIQEFSSL 54 MQ IQ F SL Sbjct: 284 MQNIQRFESL 293 >emb|CDP08894.1| unnamed protein product [Coffea canephora] Length = 377 Score = 134 bits (336), Expect = 1e-28 Identities = 101/268 (37%), Positives = 138/268 (51%), Gaps = 25/268 (9%) Frame = -3 Query: 785 SAFSCEHNPKMLKDFLRDDSYSS-----------NPCKASPVLLRSRSRKAAETTISAIH 639 +++SCE P++LKDFL DDS S + ++ LLRSRS KA +ISAIH Sbjct: 10 ASYSCERRPRLLKDFLTDDSAPSCSSNGRFMNSFHKNSSNTQLLRSRS-KATSISISAIH 68 Query: 638 K----VINVVKFFPFSS-VKSPLVLPRFVSRKLSRRSRNRCENF-------QEVSV-TVK 498 K VIN +KF PF+S VKS +LPR +SRKLSR ++ +EVSV T K Sbjct: 69 KASEMVINAIKFLPFASMVKSHSILPRSISRKLSRSRDHKDSKLAPVVPGAEEVSVATPK 128 Query: 497 VKDILRWRSFRELVEEKSPRCIXXXXXXXXXXXXXXXXXSRHDFTEEILP-FRCSENSVF 321 +KDILRW+SFR++VEE S DFT E LP + +N+ F Sbjct: 129 IKDILRWKSFRDVVEELS------------------TSWYERDFTAEDLPSWGGDQNTEF 170 Query: 320 LHKNGVEIDEKCLPKETVGQPTNDTSGTTRNLKGELSFEESEQHSPVSVLDSLFQEEDES 141 + N + ++G+ S +E+E HSPVSVLD+ ED Sbjct: 171 MDHN------------------------RKKMEGQFSIDENELHSPVSVLDNSPFREDGG 206 Query: 140 ISPFHHQSFAKIERRRCRLMQMIQEFSS 57 F ++S +ERR+C LMQ I+EF + Sbjct: 207 FVSFFNRSIDTMERRKCILMQRIEEFEN 234 >ref|XP_002302558.2| hypothetical protein POPTR_0002s15410g [Populus trichocarpa] gi|550345081|gb|EEE81831.2| hypothetical protein POPTR_0002s15410g [Populus trichocarpa] Length = 473 Score = 133 bits (334), Expect = 2e-28 Identities = 109/308 (35%), Positives = 152/308 (49%), Gaps = 48/308 (15%) Frame = -3 Query: 833 SMASSLNNPQTNSRFQSAFSCEHNPKMLKDFLRDDSYS---------------------- 720 S +S +N N R + FS E PKMLK+FL DDSYS Sbjct: 3 SFYNSYSNTNYNQR--NHFSIERTPKMLKEFLIDDSYSCSSRGFKSFSRKPSDSTMKTLI 60 Query: 719 ----------SNPCK--ASPVLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLP 576 +NP AS LL+SRS+ AA TTISA ++N VK F ++KSP +LP Sbjct: 61 EIDIYNPRNVANPSNNIASYKLLKSRSKAAASTTISAFQAMMNAVKNVHFIAIKSPSLLP 120 Query: 575 RFVSRKLSRRSRNRCENFQEVSVTVKVKDILRWRSFRELVE-EKSP---------RCIXX 426 R +SR+LS++ EN EV +T+ VKDI+RW+SFR++VE +K+P C Sbjct: 121 RSLSRRLSKKKCQNKEN--EVKMTITVKDIIRWKSFRDIVEDDKAPPSDLPPSPHHCTTT 178 Query: 425 XXXXXXXXXXXXXXXSRHDFTEEILPFRCSENSVF--LHKNGVEIDEKCLPKETVGQPTN 252 DF + LP S N F +N V +K LP VG+ + Sbjct: 179 TTRSTSTTPRSGSSWCDSDFNSDYLP---SWNGNFDECVENEVGAGKKFLP--CVGEDSL 233 Query: 251 D--TSGTTRNLKGELSFEESEQHSPVSVLDSLFQEEDESISPFHHQSFAKIERRRCRLMQ 78 + T T G E+ +QHSPVSV++ F+E++ES S F HQS A + R R ++M+ Sbjct: 234 EATTEARTYTKVGPKEDEDEQQHSPVSVIEFHFEEDEESSSSF-HQSLATLNRTREKIME 292 Query: 77 MIQEFSSL 54 I+ S+ Sbjct: 293 KIRRSESV 300 >ref|XP_004306853.2| PREDICTED: uncharacterized protein LOC101297873 [Fragaria vesca subsp. vesca] Length = 443 Score = 117 bits (292), Expect = 1e-23 Identities = 108/330 (32%), Positives = 158/330 (47%), Gaps = 57/330 (17%) Frame = -3 Query: 827 ASSLNNPQTNSRFQSAFSCEHNPKMLKDFLRDDSYSSNP-------------CKAS---- 699 +SS + N + F E P MLKDFL ++S S + CKAS Sbjct: 3 SSSASCSYQNVHQKKPFPIERRPTMLKDFLNENSNSCSSSGFKSFPRKPELDCKASNPNP 62 Query: 698 -----PVLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRSR-N 537 L RSRS KAA TTISA ++N VK F++VK+P +LPR +SR+LS+RS + Sbjct: 63 TATITSKLQRSRS-KAASTTISAFQSIMNAVKNIQFTAVKTPSLLPRSLSRRLSKRSSWS 121 Query: 536 RCENFQ-EVSVTVKVKDILRWRSFRELVEEKSPRCI-------------------XXXXX 417 R ++ Q +V ++VKVKDILRW SFR +E+ P+ + Sbjct: 122 RKQSLQTQVQISVKVKDILRWTSFR---DERLPQSLPWDFASSPHHCTTATTVTDTTTTT 178 Query: 416 XXXXXXXXXXXXSRHDFTEEILPFRCSENSVFLHKNGVEIDEKCLPKETVGQPTND-TSG 240 DFT E L C + E+ +K P VG+ + + T+G Sbjct: 179 TTCSNSSNGSSWCDSDFTAEFLQSPCDGEN--------EMGKKYSP--CVGRVSMEATAG 228 Query: 239 TTRNLKGE-----LSFEESEQHSPVSVLDSLF-QEEDESISPFHHQSFAKIERRRCRLMQ 78 R + + LS +E EQHSPVSVL+ F ++E+E+ S QS A +ER + LMQ Sbjct: 229 PARCSELDPKVEVLSCDEDEQHSPVSVLNFQFGEDEEETFSTTFDQSLANVERTKVMLMQ 288 Query: 77 MIQEFSSL-------FEGEKPVFNQDEEGE 9 +++F L E+ ++ ++E GE Sbjct: 289 RLKQFEGLANLDNSWLSPEEGLYYEEEAGE 318 >ref|XP_010278517.1| PREDICTED: uncharacterized protein LOC104612690 [Nelumbo nucifera] gi|720072868|ref|XP_010278518.1| PREDICTED: uncharacterized protein LOC104612690 [Nelumbo nucifera] gi|720072871|ref|XP_010278519.1| PREDICTED: uncharacterized protein LOC104612690 [Nelumbo nucifera] gi|720072874|ref|XP_010278521.1| PREDICTED: uncharacterized protein LOC104612690 [Nelumbo nucifera] gi|720072877|ref|XP_010278522.1| PREDICTED: uncharacterized protein LOC104612690 [Nelumbo nucifera] Length = 507 Score = 114 bits (286), Expect = 7e-23 Identities = 113/344 (32%), Positives = 159/344 (46%), Gaps = 78/344 (22%) Frame = -3 Query: 830 MASSLNNPQTNSRFQSAFSCEHNPKMLKDFLRDD--SYSSNPCKASPV------------ 693 MAS + P + S + F + P MLKDFLRDD S SSN ++ P Sbjct: 1 MASLRSGPISESIRRKPFLIDKRPPMLKDFLRDDLNSCSSNGFQSYPRRSCCTTVRNLLD 60 Query: 692 -------------LLRSRSRKAAETTISAIHK----VINVVKFFPFSSV-KSPL------ 585 L+RSRS KAA TTISA+HK V+ VK+FPFS + KSP Sbjct: 61 IDFKSRESSQRRRLVRSRS-KAASTTISALHKASEAVLAAVKYFPFSPINKSPSTQPQNR 119 Query: 584 ----VLPRFVSRKLSRRSRNRCENFQ-EVSVTVKVKDILRWRSFRELVEEK--------- 447 +LPR +SR+L R + + + E+ VTV+VKDILRW+SFR+L+E+ Sbjct: 120 PKVGILPRSISRRLRRSFWRKTDKEEHEIIVTVRVKDILRWKSFRDLIEDSEKPSDFSPT 179 Query: 446 ------SPRCIXXXXXXXXXXXXXXXXXSR-----------HDFTEEILPFRCSENSVFL 318 P C+ S DFT + L S +S L Sbjct: 180 PLNEKAKPLCLSSSPLPRTTTTTTTTTTSSSITSSSNSWADSDFTSDYLQ-SSSVSSECL 238 Query: 317 HKNGVEIDEKCLPKE------TVGQPTNDTSGT-TRNLKGELSFEESEQHSPVSVLDSLF 159 + E ++CL E G+ + +T+ T + + K +EE EQ SPVSVLD F Sbjct: 239 GETDGEDSKRCLSSEKQTFNNKAGEGSMETTTTYSVDAKSACQYEEKEQFSPVSVLDFPF 298 Query: 158 QEEDESISPF--HHQSFAKIERRRCRLMQMIQEFSSLFEGEKPV 33 +EEDE +QS + ++ + +LMQ I+ F +F +PV Sbjct: 299 EEEDEDTETHSSSNQSLSNMDWAKQKLMQKIRRF-EIFAQLEPV 341 >ref|XP_012843647.1| PREDICTED: uncharacterized protein LOC105963748 [Erythranthe guttatus] gi|848886814|ref|XP_012843648.1| PREDICTED: uncharacterized protein LOC105963748 [Erythranthe guttatus] gi|848886816|ref|XP_012843649.1| PREDICTED: uncharacterized protein LOC105963748 [Erythranthe guttatus] gi|604321470|gb|EYU32046.1| hypothetical protein MIMGU_mgv1a020770mg [Erythranthe guttata] Length = 378 Score = 114 bits (286), Expect = 7e-23 Identities = 103/269 (38%), Positives = 129/269 (47%), Gaps = 53/269 (19%) Frame = -3 Query: 791 FQSAFSCEHNPKMLKDFL-RDD---------SYSSN----------PCKASP-------- 696 FQ + E+ P MLKD+L RDD S SSN PCK++ Sbjct: 6 FQESAKLENRPMMLKDYLIRDDCDHNSCSSSSSSSNGGFQMYPRRKPCKSTAAAVPSKKS 65 Query: 695 -------VLLRSRSRKAAETT--ISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRS 543 VLLRS SRKAA T +AIHKVINVVK F F+S KSPL+LPR +S+K S+ Sbjct: 66 TNKPGHIVLLRSWSRKAAAATRTSAAIHKVINVVKLFHFASAKSPLLLPRKMSKK-SKED 124 Query: 542 RNRC---ENFQEVSVTVKVKDILRWRSFRELVEE-------------KSPRCIXXXXXXX 411 +N + EV V VKVKDILRWRSFR++ EE +S Sbjct: 125 KNGAVLGDVTPEVKVKVKVKDILRWRSFRDVAEEDNSSPWDFPSSPSRSTTTATATATTT 184 Query: 410 XXXXXXXXXXSRHDFTEEILPFRCSENSVFLHKNGVEIDEKCLPKETVGQPTNDTSGTTR 231 DFT E E+ ++ +NG + +KC T G T+ T Sbjct: 185 TTSSSKRSSWCDSDFTAE-------ESQLWGGENGECLGKKCF---TAG---FTTTTLTS 231 Query: 230 NLKGELSFEESEQHSPVSVLDSLFQEEDE 144 G+ S EE EQ SP+SVLDS FQE +E Sbjct: 232 PKVGDFSIEEYEQKSPISVLDSPFQEVEE 260 >ref|XP_010255548.1| PREDICTED: uncharacterized protein LOC104596185 isoform X3 [Nelumbo nucifera] Length = 498 Score = 107 bits (268), Expect = 8e-21 Identities = 101/320 (31%), Positives = 144/320 (45%), Gaps = 67/320 (20%) Frame = -3 Query: 830 MASSLNNPQTNSRFQSAFSCEHNPKMLKDFLRDD--SYSSNPCKASP------------- 696 MAS + P S+ + F E P MLKDFL +D S SS+ ++ P Sbjct: 1 MASLSSGPAIESKRRKPFLVEQRPLMLKDFLSEDFNSCSSSGFQSFPRRSCFTTVGNLLE 60 Query: 695 ------------VLLRSRSRKAAETTISAIHK----VINVVKFFPFSSVKSPL------- 585 L R RS K+A TTISA+HK V+ +K+FPFS KSP Sbjct: 61 MDRKSRASNQRRRLFRRRS-KSASTTISALHKASEAVLTAIKYFPFSPTKSPSPPQNNRP 119 Query: 584 ---VLPRFVSRKL--SRRSRNRCENFQEVSVTVKVKDILRWRSFRELVEEK--------- 447 +LPR +SR+L S +N E+ E+ VT +VKDI+RWRSFR+L+EEK Sbjct: 120 KLGILPRSLSRRLRGSFWKKNDKED-HEIKVTTRVKDIVRWRSFRDLIEEKPKPLDFSSS 178 Query: 446 -----SPRCIXXXXXXXXXXXXXXXXXSRHDFTEEILPFRCSENSVFLHKNGVEIDEKCL 282 + S DFT + L S +S + +N E ++C Sbjct: 179 LRSRSTTTTTNTTTTTSSRTTSNSSSWSDSDFTSDYLQ-SSSVSSEYSGENEGEEGKRCS 237 Query: 281 PK----------ETVGQPTNDTSGTTRNLKGELSFEESEQHSPVSVLDSLFQEEDESISP 132 + + TN + T +K E + E EQ SPVSVL+ ++E++E+ + Sbjct: 238 SEPDEEKLSNKASELSVETNAYALQTNKVKSESPYREKEQFSPVSVLNFPYEEDEETENS 297 Query: 131 FHHQSFAKIERRRCRLMQMI 72 HQS IER + + Q I Sbjct: 298 SFHQSLPDIERAKQKKKQKI 317 >emb|CBI38843.3| unnamed protein product [Vitis vinifera] Length = 507 Score = 106 bits (265), Expect = 2e-20 Identities = 100/296 (33%), Positives = 132/296 (44%), Gaps = 36/296 (12%) Frame = -3 Query: 800 NSRFQSAFSCEHNPKMLKDFLRDDS-------YSSNPCKAS------------------- 699 NS + F E P+MLKDFL DDS + S P KAS Sbjct: 48 NSMQRRHFQIERRPRMLKDFLIDDSNSCSSNGFKSFPRKASHCTVRNLLECDGGERGSNS 107 Query: 698 ---PVLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRSRNRCE 528 LRSRS AA TTISA K +V ++VKSP L R + R+ RR N E Sbjct: 108 NNKSTFLRSRSTAAA-TTISAFQKASEIV----INAVKSPSFLQRSLKRRFWRR--NSTE 160 Query: 527 NFQEVSVTVKVKDILRWRSFRELVEEKS-PRCIXXXXXXXXXXXXXXXXXSRHDFTEEIL 351 ++VKDI+R RSFR+++EEKS P DFT + + Sbjct: 161 ERGMTVTVIRVKDIVRMRSFRDVMEEKSAPSDSTGSITTTATTSSNGSSWCGSDFTADYV 220 Query: 350 PFRCSENSVFLHKNGVEIDEKCLPKETVGQPTNDTSGTTRNL-----KGELSFEESEQHS 186 + + N V++ EK LP G N T+ TT ++ KGEL EE EQHS Sbjct: 221 QSWSGNSEEYSGANAVKVGEKYLPGVGDGN-ANTTTRTTMSIDAVGPKGELRCEEKEQHS 279 Query: 185 PVSVLDSLFQEEDESISPF-HHQSFAKIERRRCRLMQMIQEFSSLFEGEKPVFNQD 21 PVSVLD F E+ E S F + + E + +L+ ++ S+ E K N D Sbjct: 280 PVSVLDCPFTEDAEPFSFFIRDEEEEEAEEKAMQLLDQVKAAGSV-EFSKANLNDD 334 >ref|XP_003633933.1| PREDICTED: uncharacterized protein LOC100854402 [Vitis vinifera] Length = 393 Score = 106 bits (265), Expect = 2e-20 Identities = 100/296 (33%), Positives = 132/296 (44%), Gaps = 36/296 (12%) Frame = -3 Query: 800 NSRFQSAFSCEHNPKMLKDFLRDDS-------YSSNPCKAS------------------- 699 NS + F E P+MLKDFL DDS + S P KAS Sbjct: 13 NSMQRRHFQIERRPRMLKDFLIDDSNSCSSNGFKSFPRKASHCTVRNLLECDGGERGSNS 72 Query: 698 ---PVLLRSRSRKAAETTISAIHKVINVVKFFPFSSVKSPLVLPRFVSRKLSRRSRNRCE 528 LRSRS AA TTISA K +V ++VKSP L R + R+ RR N E Sbjct: 73 NNKSTFLRSRSTAAA-TTISAFQKASEIV----INAVKSPSFLQRSLKRRFWRR--NSTE 125 Query: 527 NFQEVSVTVKVKDILRWRSFRELVEEKS-PRCIXXXXXXXXXXXXXXXXXSRHDFTEEIL 351 ++VKDI+R RSFR+++EEKS P DFT + + Sbjct: 126 ERGMTVTVIRVKDIVRMRSFRDVMEEKSAPSDSTGSITTTATTSSNGSSWCGSDFTADYV 185 Query: 350 PFRCSENSVFLHKNGVEIDEKCLPKETVGQPTNDTSGTTRNL-----KGELSFEESEQHS 186 + + N V++ EK LP G N T+ TT ++ KGEL EE EQHS Sbjct: 186 QSWSGNSEEYSGANAVKVGEKYLPGVGDGN-ANTTTRTTMSIDAVGPKGELRCEEKEQHS 244 Query: 185 PVSVLDSLFQEEDESISPF-HHQSFAKIERRRCRLMQMIQEFSSLFEGEKPVFNQD 21 PVSVLD F E+ E S F + + E + +L+ ++ S+ E K N D Sbjct: 245 PVSVLDCPFTEDAEPFSFFIRDEEEEEAEEKAMQLLDQVKAAGSV-EFSKANLNDD 299