BLASTX nr result
ID: Forsythia22_contig00031339
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00031339 (1707 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011075524.1| PREDICTED: uncharacterized protein LOC105159... 640 e-180 ref|XP_009773384.1| PREDICTED: uncharacterized protein LOC104223... 568 e-159 emb|CDP20239.1| unnamed protein product [Coffea canephora] 568 e-159 ref|XP_009603669.1| PREDICTED: uncharacterized protein LOC104098... 565 e-158 ref|XP_002522472.1| conserved hypothetical protein [Ricinus comm... 563 e-157 ref|XP_011027873.1| PREDICTED: uncharacterized protein LOC105128... 555 e-155 ref|XP_002266397.1| PREDICTED: uncharacterized protein LOC100264... 551 e-154 ref|XP_007046020.1| Arabinanase/levansucrase/invertase, putative... 547 e-153 ref|XP_010265210.1| PREDICTED: uncharacterized protein LOC104603... 544 e-152 ref|XP_006483118.1| PREDICTED: uncharacterized protein LOC102631... 543 e-151 gb|KDO82999.1| hypothetical protein CISIN_1g012700mg [Citrus sin... 541 e-151 ref|XP_012464964.1| PREDICTED: uncharacterized protein LOC105783... 540 e-150 emb|CAN72785.1| hypothetical protein VITISV_039508 [Vitis vinifera] 531 e-148 ref|XP_010108700.1| hypothetical protein L484_015688 [Morus nota... 530 e-147 gb|KHN01978.1| hypothetical protein glysoja_034500 [Glycine soja] 529 e-147 ref|XP_003520089.1| PREDICTED: uncharacterized protein LOC100794... 527 e-147 ref|XP_008221583.1| PREDICTED: uncharacterized protein LOC103321... 525 e-146 ref|XP_007224592.1| hypothetical protein PRUPE_ppa1027170mg [Pru... 524 e-146 gb|KCW86570.1| hypothetical protein EUGRSUZ_B03205 [Eucalyptus g... 522 e-145 ref|XP_004298413.1| PREDICTED: uncharacterized protein LOC101299... 522 e-145 >ref|XP_011075524.1| PREDICTED: uncharacterized protein LOC105159987 [Sesamum indicum] Length = 462 Score = 640 bits (1650), Expect = e-180 Identities = 308/451 (68%), Positives = 351/451 (77%), Gaps = 3/451 (0%) Frame = -1 Query: 1491 WPQTRPQIFNLNFST---NPSNKNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQK 1321 WPQ P+ S+ NP K++SL +T+CSTKPN N+ + EP SR Q Sbjct: 16 WPQMVPKFLKFTSSSTLLNPRCKSISLFVTHCSTKPNTNKRATDENQSNTEPLPSSRIQ- 74 Query: 1320 QAKPTXXXXXXXXXXXXXXXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRS 1141 PT +FD+G++NSWDSLEIGSPVVKRYLSD+EERWYMWYHGRS Sbjct: 75 ---PTSGQQVGSSFSLSRGL-IFDVGSKNSWDSLEIGSPVVKRYLSDEEERWYMWYHGRS 130 Query: 1140 GENPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMS 961 E+P F+SIGLAVSSNGIHWERGGGAV+S ADVGLVM+ SKDWWAFDT SIRP E++IMS Sbjct: 131 SESPDFDSIGLAVSSNGIHWERGGGAVRSNADVGLVMSSSKDWWAFDTHSIRPCEVMIMS 190 Query: 960 STKVRSNNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSL 781 S KVR+N+AVYWLYYTGFSSEN++ D SLE + KNPER H+ +N S ILKSL Sbjct: 191 SAKVRANSAVYWLYYTGFSSENIKCLDNSLEFNFKNPERVHLDVENGGLNGDSSKILKSL 250 Query: 780 PGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDA 601 PGLAMSQDGRHWARIEGEHHSGAL DVGSDGEWDSLFIASPQVV+HG+GDLRMYYHSFD Sbjct: 251 PGLAMSQDGRHWARIEGEHHSGALLDVGSDGEWDSLFIASPQVVFHGTGDLRMYYHSFDV 310 Query: 600 ENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVYECVA 421 ENGHFA+G+ARSRDGMRWVKLGKIL GG GAFDECGIVN ++ RNKK+G YLMVYE V Sbjct: 311 ENGHFAIGVARSRDGMRWVKLGKILSGGTRGAFDECGIVNARILRNKKDGEYLMVYEGVD 370 Query: 420 SDGERSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELDEWRLYY 241 DG+RSIG AVSSDGL+ WKRVQD P K+SE+ GWDC+GVGSPCLVQMDG+ +EWRLYY Sbjct: 371 GDGKRSIGVAVSSDGLQHWKRVQDSPVFKQSEDGGWDCDGVGSPCLVQMDGDTEEWRLYY 430 Query: 240 XXXXXXXXXXXGLAISQGNELNNFQRWTGFH 148 GLA+SQG+E+NNFQRWTGFH Sbjct: 431 RGIGKGGRSGIGLAVSQGSEVNNFQRWTGFH 461 >ref|XP_009773384.1| PREDICTED: uncharacterized protein LOC104223620 [Nicotiana sylvestris] Length = 462 Score = 568 bits (1465), Expect = e-159 Identities = 287/453 (63%), Positives = 334/453 (73%), Gaps = 3/453 (0%) Frame = -1 Query: 1497 PIWPQTRPQIFNLNFSTNPSNKNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQ 1318 P W Q++P+ L + KN+ L L S KPN N+ENA D+N+ +PSS + Q Sbjct: 28 PTWFQSKPKSLQLR-----TIKNIGLFLAKSSAKPNANQENAADKNMLNDPSS----RIQ 78 Query: 1317 AKPTXXXXXXXXXXXXXXXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRSG 1138 +PT LVFDLG ++SWDS EIGSPVVKRYLSD+EERWYMWY+GRS Sbjct: 79 PQPTSNQPLSSSTSSFSRGLVFDLGQKDSWDSTEIGSPVVKRYLSDEEERWYMWYYGRSN 138 Query: 1137 ENPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMSS 958 G ESIGLAVSSNG+HWERG A K DVGLVMNC +DWW FDTQSIRP E+VIMSS Sbjct: 139 ---GKESIGLAVSSNGVHWERGEMAAKMSDDVGLVMNCGEDWWGFDTQSIRPCEVVIMSS 195 Query: 957 TKVRSNNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSLP 778 KVR+N++VYWLYYTGFSSE +EF D SL+ SL+NPE + G+ K I KSLP Sbjct: 196 AKVRANSSVYWLYYTGFSSEKIEFLDNSLDFSLENPETLYSDGE-------KGKIFKSLP 248 Query: 777 GLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDAE 598 GLAMSQDGRHWARIEGEHHSGALFDVG +GEWDSLFIASP+VV+ SGDLRMYYHS+D E Sbjct: 249 GLAMSQDGRHWARIEGEHHSGALFDVGIEGEWDSLFIASPKVVFRSSGDLRMYYHSYDVE 308 Query: 597 NGHFAVGIARSRDGMRWVKLGKILGGGAN-GAFDECGIVNPKVTRNKKNGTYLMVYECVA 421 G+FA+GIARSRDG++W+KLGKI+GGG GAFDE G++NP V RN+K+G YLMVYE V Sbjct: 309 KGNFAIGIARSRDGIKWLKLGKIIGGGGKIGAFDELGVLNPHVVRNRKDGKYLMVYEGVD 368 Query: 420 SDGERSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELD--EWRL 247 S+G RSIG A+SSDGLK WKRVQ+ P LKK EE WD EGVGSP LVQMDG+ EWRL Sbjct: 369 SNGRRSIGMAISSDGLKGWKRVQENPVLKKCEEQRWDSEGVGSPYLVQMDGDDQDHEWRL 428 Query: 246 YYXXXXXXXXXXXGLAISQGNELNNFQRWTGFH 148 YY G+A+SQGNE +FQRWTGFH Sbjct: 429 YYRGVGKNGRTGIGMAVSQGNEFQSFQRWTGFH 461 >emb|CDP20239.1| unnamed protein product [Coffea canephora] Length = 475 Score = 568 bits (1463), Expect = e-159 Identities = 281/461 (60%), Positives = 336/461 (72%), Gaps = 11/461 (2%) Frame = -1 Query: 1497 PIWPQTRPQI---------FNLNFSTNPS--NKNVSLLLTNCSTKPNVNRENAIDENLTV 1351 P W QT+ ++ N S +P NKN L L +CS KPN +R+NA D+NL++ Sbjct: 35 PTWSQTKSKLPDFYASSSFLNHLKSKSPQLKNKNSKLFLLHCSAKPNTDRKNATDDNLSL 94 Query: 1350 EPSSISRTQKQAKPTXXXXXXXXXXXXXXXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEE 1171 S S +QA + VFDLG ++SWDS EIGSPVVKR++ D+EE Sbjct: 95 RSGSNSTIPEQAAASPTNEGLSSFSRGL---VFDLGLKDSWDSAEIGSPVVKRFIGDEEE 151 Query: 1170 RWYMWYHGRSGENPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQS 991 RWYMWY GRS G +SIGLAVSSNGIHWERG G +KS +DVG+VMNCS DWWAFDTQ Sbjct: 152 RWYMWYCGRSN---GKDSIGLAVSSNGIHWERGNGPIKSSSDVGMVMNCSDDWWAFDTQG 208 Query: 990 IRPGEIVIMSSTKVRSNNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVGDNSNEI 811 IRP EIVIMSS KVR NN++YWLYYTGF+ E +E D S+ L + +R Sbjct: 209 IRPSEIVIMSSAKVRVNNSLYWLYYTGFNDEKIEPLDNSVAFKLSDRKR----------- 257 Query: 810 EGKSMILKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGD 631 M +SLPGLAMSQDGRHWARIEGEH+SGALFDVGSDGEWDSLFIASP+VVYHG+GD Sbjct: 258 ----MYYRSLPGLAMSQDGRHWARIEGEHYSGALFDVGSDGEWDSLFIASPKVVYHGAGD 313 Query: 630 LRMYYHSFDAENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNG 451 +RMYYHSFDAE GHFAVGIARSRDG++WVKLGKI+GGG NG FDE G++N V +N+K+G Sbjct: 314 VRMYYHSFDAEKGHFAVGIARSRDGIKWVKLGKIMGGGGNGMFDELGVMNAHVVKNRKDG 373 Query: 450 TYLMVYECVASDGERSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSPCLVQMD 271 Y+M YE VA+DG++S+G AVSSDGLK+W++ QD PALK+SEE+GWD EGVGSPCLVQMD Sbjct: 374 KYVMAYEGVAADGKKSVGLAVSSDGLKEWRKFQDGPALKQSEEDGWDWEGVGSPCLVQMD 433 Query: 270 GELDEWRLYYXXXXXXXXXXXGLAISQGNELNNFQRWTGFH 148 G+ DEWRLYY GLA+S+G E +FQRWTGFH Sbjct: 434 GDADEWRLYYKGTGKGGKTGIGLAVSEGIEFASFQRWTGFH 474 >ref|XP_009603669.1| PREDICTED: uncharacterized protein LOC104098592 isoform X1 [Nicotiana tomentosiformis] gi|697189237|ref|XP_009603670.1| PREDICTED: uncharacterized protein LOC104098592 isoform X2 [Nicotiana tomentosiformis] gi|697189239|ref|XP_009603671.1| PREDICTED: uncharacterized protein LOC104098592 isoform X3 [Nicotiana tomentosiformis] gi|697189241|ref|XP_009603672.1| PREDICTED: uncharacterized protein LOC104098592 isoform X4 [Nicotiana tomentosiformis] Length = 462 Score = 565 bits (1457), Expect = e-158 Identities = 287/453 (63%), Positives = 335/453 (73%), Gaps = 3/453 (0%) Frame = -1 Query: 1497 PIWPQTRPQIFNLNFSTNPSNKNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQ 1318 P W Q++P+ L + KN+ L L S KPN N+ENA D+++ +PSS R Q Q Sbjct: 28 PTWFQSKPKYLQLR-----ATKNIGLFLVKSSAKPNANQENAADKDMLNDPSS--RIQPQ 80 Query: 1317 AKPTXXXXXXXXXXXXXXXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRSG 1138 + T LVFDLG ++SWDS EIGSPVVKRYLSD+EERWYMWY+GRS Sbjct: 81 S--TSNQPLSSSASSFSRGLVFDLGQKDSWDSTEIGSPVVKRYLSDEEERWYMWYYGRSN 138 Query: 1137 ENPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMSS 958 G ESIGLAVSSNG+HWERG A K DVGLVMNC +DWW FDTQSIRP E+VIMSS Sbjct: 139 ---GKESIGLAVSSNGVHWERGEMAAKMSDDVGLVMNCGEDWWGFDTQSIRPCEVVIMSS 195 Query: 957 TKVRSNNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSLP 778 KVR+N++VYWLYYTGFSSE ++F D SL+ SL+NPER + G+ K I KSLP Sbjct: 196 AKVRANSSVYWLYYTGFSSEKIDFLDNSLDFSLENPERLYSDGE-------KGKIFKSLP 248 Query: 777 GLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDAE 598 GLAMSQDGRHWARIEGEHHSGALFDVG +GEWDSLFIASP+VV+H SGDLRMYYHS+D E Sbjct: 249 GLAMSQDGRHWARIEGEHHSGALFDVGIEGEWDSLFIASPKVVFHTSGDLRMYYHSYDVE 308 Query: 597 NGHFAVGIARSRDGMRWVKLGKILGGGAN-GAFDECGIVNPKVTRNKKNGTYLMVYECVA 421 G+FA+GIARSRDGM+W+KLGKI+GGG GAFDE G++NP V RN+K+G YLMVYE V Sbjct: 309 KGNFAIGIARSRDGMKWLKLGKIIGGGGKIGAFDELGVLNPHVVRNRKDGKYLMVYEGVD 368 Query: 420 SDGERSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELD--EWRL 247 S+G RSIG A+SSDGLK WKRVQ+ LKK EE WD EGVGSP LVQMDG+ EWRL Sbjct: 369 SNGSRSIGMAISSDGLKGWKRVQENAVLKKYEEERWDSEGVGSPYLVQMDGDDQDHEWRL 428 Query: 246 YYXXXXXXXXXXXGLAISQGNELNNFQRWTGFH 148 YY G+A+SQGNE +F+RWTGFH Sbjct: 429 YYRGIGKNGRTGIGMAVSQGNEFQSFRRWTGFH 461 >ref|XP_002522472.1| conserved hypothetical protein [Ricinus communis] gi|223538357|gb|EEF39964.1| conserved hypothetical protein [Ricinus communis] Length = 484 Score = 563 bits (1452), Expect = e-157 Identities = 279/454 (61%), Positives = 344/454 (75%), Gaps = 5/454 (1%) Frame = -1 Query: 1497 PIWPQTRPQIFNLNFSTNPSNKNVSL-LLTNCSTKPNVNRENAIDENLTVEPSSISRTQK 1321 P+WP T+P N+++NP ++N + LT CSTKP+ N N +N ++ + S +Q Sbjct: 36 PLWPPTKPN----NYASNPISRNHTFPSLTCCSTKPDTNTNNGTTQNSSIGSNPSSNSQN 91 Query: 1320 QAKP--TXXXXXXXXXXXXXXXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHG 1147 A P + LVFDLG +SWDS EIGSPVVKR+LSD+EERWYMWYHG Sbjct: 92 LAAPISSNSLSSSFPSPSSSTGLVFDLGPIDSWDSKEIGSPVVKRFLSDEEERWYMWYHG 151 Query: 1146 RSGE-NPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIV 970 S E N G +SIGLAVSSNGIHWERG AVKS DVGLVMNC +DWWAFDT SIRP E+V Sbjct: 152 NSSEKNSGLDSIGLAVSSNGIHWERGIEAVKSSGDVGLVMNCCQDWWAFDTISIRPSEVV 211 Query: 969 IMSSTKVRSNNAVYWLYYTGFSSENVEF-SDKSLEISLKNPERAHIVGDNSNEIEGKSMI 793 +MSS KVR++NAVYWLYY+GFSSE V+F +D SL+ +++NPE+ +NS++ G++ I Sbjct: 212 VMSSNKVRASNAVYWLYYSGFSSEKVDFVNDDSLDFNVENPEKFCFGNENSDD--GRN-I 268 Query: 792 LKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYH 613 KSLPGLA+SQDGRHWARIEGEHHSGALFDVGS+ EWDSLFIASPQVV+HG+GDLRMYYH Sbjct: 269 FKSLPGLAISQDGRHWARIEGEHHSGALFDVGSECEWDSLFIASPQVVFHGNGDLRMYYH 328 Query: 612 SFDAENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVY 433 SFD ENG F +GIARSRDG++WVKLGKI+GGG +G+FDE G++N V ++KK+G Y+M Y Sbjct: 329 SFDMENGQFGIGIARSRDGIKWVKLGKIMGGGKSGSFDEFGVMNASVVKSKKDGKYVMAY 388 Query: 432 ECVASDGERSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELDEW 253 E VASDG+RSIG AVS DGLK W+R QD LK SE++GWD GVGSPCLVQM+G++DEW Sbjct: 389 EGVASDGKRSIGLAVSPDGLKDWRRFQDGEVLKPSEKDGWDNRGVGSPCLVQMEGDVDEW 448 Query: 252 RLYYXXXXXXXXXXXGLAISQGNELNNFQRWTGF 151 RLYY G+A GN++++F RWTGF Sbjct: 449 RLYYRGVSNEGRTGIGMAFCVGNDVSSFTRWTGF 482 >ref|XP_011027873.1| PREDICTED: uncharacterized protein LOC105128058 [Populus euphratica] Length = 604 Score = 555 bits (1431), Expect = e-155 Identities = 276/456 (60%), Positives = 331/456 (72%), Gaps = 8/456 (1%) Frame = -1 Query: 1491 WP-QTRPQIFNLNFSTNPSNK-NVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQ 1318 WP T P++ +L NP + N L LT CSTKP+ N D+N T E +S Q Sbjct: 149 WPASTNPKVLHLYVPKNPVQRINTLLSLTRCSTKPDTNTNKETDQNSTPESNSNPEPQYP 208 Query: 1317 AKPTXXXXXXXXXXXXXXXL----VFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYH 1150 P L VFDLG NSWD EIGSPVVKR+LSD+EERWYMWYH Sbjct: 209 LTPISSNDPVPSNSLPSQSLSRGLVFDLGPSNSWDGKEIGSPVVKRFLSDEEERWYMWYH 268 Query: 1149 GRSGENPGF-ESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEI 973 G S +N G +SIGLAVSSNGIHWERG G V S DVG VM C +DWWAFDT SIRPGE+ Sbjct: 269 GNSSQNSGSADSIGLAVSSNGIHWERGVGPVSSSVDVGSVMKCGQDWWAFDTMSIRPGEV 328 Query: 972 VIMSSTKVRSNNAVYWLYYTGFSSENVEFSDK-SLEISLKNPERAHIVGDNSNEIEGKSM 796 V+MSS+KVR+++A YWLYY+GFSSE V+++D SLE SL+NPER + N+ ++ K Sbjct: 329 VVMSSSKVRASSAFYWLYYSGFSSEKVDYTDDDSLEFSLENPERFCLDNVNNGNVD-KGK 387 Query: 795 ILKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYY 616 I KSLPGLAMSQDGRHWARIEGEHHSGALFDVGS+ EWDSLFIA P+VV+HG+ DLRMYY Sbjct: 388 IFKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSEREWDSLFIAGPRVVFHGNSDLRMYY 447 Query: 615 HSFDAENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMV 436 HSFD E+G F +GIARSRDG+ W+KLGKI+GGG +FDE G++N V RNKK+GTYLM Sbjct: 448 HSFDVESGQFGIGIARSRDGINWMKLGKIIGGGKISSFDEFGVINACVVRNKKDGTYLMA 507 Query: 435 YECVASDGERSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELDE 256 YE V + G+RSIG AVS DGL+ W+R QD L+ S ++GWD +GVGSPCLVQMDGE+DE Sbjct: 508 YEGVTAGGKRSIGLAVSPDGLRDWRRFQDEAVLESSVKDGWDNKGVGSPCLVQMDGEVDE 567 Query: 255 WRLYYXXXXXXXXXXXGLAISQGNELNNFQRWTGFH 148 WRLYY G+AISQGN++++F+RWTGFH Sbjct: 568 WRLYYRGAGNEGRTGIGMAISQGNDVSSFRRWTGFH 603 >ref|XP_002266397.1| PREDICTED: uncharacterized protein LOC100264211 [Vitis vinifera] Length = 491 Score = 551 bits (1419), Expect = e-154 Identities = 275/469 (58%), Positives = 333/469 (71%), Gaps = 19/469 (4%) Frame = -1 Query: 1497 PIWPQTRPQIFNLNFST----------NPSNKNVSLLLTNCSTKPNVNRENAIDENLTVE 1348 P W T +F L S+ +P+ +N +L LT CST+P+ D+N TV Sbjct: 30 PAWCSTPANMFPLYASSTNFFAILPTPHPNPRNCALYLTRCSTRPDTT-----DKNSTVG 84 Query: 1347 PSSISRT----QKQAKPTXXXXXXXXXXXXXXXL---VFDLGTRNSWDSLEIGSPVVKRY 1189 PSS S + Q A P VFDLG NSWDS +IGSPVVKR+ Sbjct: 85 PSSNSNSNSKPQDSAAPASNESLSSAAAAASSSSRGLVFDLGPSNSWDSAQIGSPVVKRF 144 Query: 1188 LSDDEERWYMWYHGRSGENPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWW 1009 LSDDEERWYMWYHG S EN +SIGLAVSSNG+HWERGGG V+SG DVGLVMNC KDWW Sbjct: 145 LSDDEERWYMWYHGASNENSASDSIGLAVSSNGVHWERGGGPVRSGGDVGLVMNCGKDWW 204 Query: 1008 AFDTQSIRPGEIVIMSSTKVRSNNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVG 829 AFDT SIRP ++VIMSS +VR ++AVYWLYYTG+SSE V F D SLE+ L+NPERA G Sbjct: 205 AFDTMSIRPSDVVIMSSNRVRGSSAVYWLYYTGYSSEKVVFLDDSLELYLENPERA---G 261 Query: 828 DNSNEIEGKSMILKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVV 649 + E G I KSLPGLA+SQDGRHWARIEGEHH+GALFDVG + EWDS++IASPQVV Sbjct: 262 AENGENGGIGKIFKSLPGLAISQDGRHWARIEGEHHTGALFDVGLENEWDSMYIASPQVV 321 Query: 648 YHGSGDLRMYYHSFDAENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVT 469 +HG+GDLRMYYHSFD ENG FA+GIARS+DG+RWVKLGKI+GGG +G+FDE G+V V Sbjct: 322 FHGNGDLRMYYHSFDVENGQFAIGIARSKDGIRWVKLGKIMGGGISGSFDESGVVKACVV 381 Query: 468 RNKKNGTYLMVYECVASDGERSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSP 289 +N+++G Y+M YE V +G RSIG AVS DGLK+W+R QD L +E++GWD +GVGSP Sbjct: 382 KNRRDGKYVMAYEGVDGNGRRSIGLAVSPDGLKEWRRSQDEAVLMPAEDDGWDNKGVGSP 441 Query: 288 CLVQMDGELD--EWRLYYXXXXXXXXXXXGLAISQGNELNNFQRWTGFH 148 CLVQMDG+ D EWRLYY G+A+ +G++ F++WTGFH Sbjct: 442 CLVQMDGDGDGGEWRLYYRGIGQGGRTGIGMAVCEGSDRRRFRKWTGFH 490 >ref|XP_007046020.1| Arabinanase/levansucrase/invertase, putative [Theobroma cacao] gi|508709955|gb|EOY01852.1| Arabinanase/levansucrase/invertase, putative [Theobroma cacao] Length = 482 Score = 547 bits (1410), Expect = e-153 Identities = 270/454 (59%), Positives = 329/454 (72%), Gaps = 7/454 (1%) Frame = -1 Query: 1491 WPQTRPQIFNLNFSTNPSNKNVSLLLTNCSTKPNVNRENAIDENLTVE----PSSISRTQ 1324 WPQ + + L ++ NP+ + SL LT CSTKPN + N D+N T E P + + T+ Sbjct: 33 WPQNKLNMLTL-YAPNPTTRFSSLSLTRCSTKPNTDTNNETDQNSTFEANPNPDNENPTR 91 Query: 1323 ---KQAKPTXXXXXXXXXXXXXXXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWY 1153 +A P+ V DLGT +SWD EIGSPVVKR+LSD+EERWYMWY Sbjct: 92 HVSNEAVPSSSTPSSSLSRGL----VLDLGTVDSWDCREIGSPVVKRFLSDEEERWYMWY 147 Query: 1152 HGRSGENPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEI 973 HG S PG +SIGLAVSSNG+HWERG GAVKS ADVGLVMNC DWWAFDT+SI PGE+ Sbjct: 148 HGVSNGKPGSDSIGLAVSSNGVHWERGKGAVKSSADVGLVMNCGNDWWAFDTKSIMPGEV 207 Query: 972 VIMSSTKVRSNNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMI 793 VIMSS KVR+++AVYWLYYTG+SSE V+ S +++NPER + S+ I GK I Sbjct: 208 VIMSSAKVRASSAVYWLYYTGYSSEQVDILGNSSGFNVQNPERFCVDVSRSSGI-GK--I 264 Query: 792 LKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYH 613 +SLPGLA+SQDGRHWARIEGEHHSGALFDVGS+G+WDSLFIA+PQVV+HG GDLRMYYH Sbjct: 265 FRSLPGLAISQDGRHWARIEGEHHSGALFDVGSEGDWDSLFIAAPQVVFHGYGDLRMYYH 324 Query: 612 SFDAENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVY 433 SFD +NG + +GIARSRDGM+W+KLGKI+GGG FDE G NP V +NKK+G Y+M Y Sbjct: 325 SFDVKNGEYCIGIARSRDGMKWIKLGKIMGGGKRSCFDELGATNPCVVKNKKDGEYIMAY 384 Query: 432 ECVASDGERSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELDEW 253 E V +DG R+IG AVS DGLK W R++D LK ++GWD EG+GSPCLV MDG++DEW Sbjct: 385 EGVDADGLRNIGLAVSPDGLKDWTRLRDEAVLKPGTDDGWDNEGIGSPCLVGMDGDVDEW 444 Query: 252 RLYYXXXXXXXXXXXGLAISQGNELNNFQRWTGF 151 RLYY G+A+S G+E+ F+RWTGF Sbjct: 445 RLYYRGIGNGGRSGIGMAVSDGSEITRFRRWTGF 478 >ref|XP_010265210.1| PREDICTED: uncharacterized protein LOC104603010 [Nelumbo nucifera] Length = 591 Score = 544 bits (1401), Expect = e-152 Identities = 267/434 (61%), Positives = 323/434 (74%), Gaps = 1/434 (0%) Frame = -1 Query: 1446 NPSNKNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQAKPTXXXXXXXXXXXXX 1267 +P +N LT CS K ++ +N D N T++ SS S TQ+ + PT Sbjct: 135 SPIPRNGFPYLTRCSRKLDIGNDNTNDRNPTIDTSSTSTTQQPSTPTQIQTPTSSSSSTG 194 Query: 1266 XXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRSGENPGFESIGLAVSSNGI 1087 VFDLG+ + WDS E+GS V+KRYLSDD ERWYMWYHG S +NP SIGLAVS NGI Sbjct: 195 L--VFDLGSNSCWDSREVGSLVLKRYLSDDAERWYMWYHGSSDDNPTSGSIGLAVSGNGI 252 Query: 1086 HWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMSSTKVRSNNAVYWLYYTGF 907 HWERG G V+S D G+VMNCS DWWAFDT IRP E+VIMSSTKVR +NAVYWLYYTGF Sbjct: 253 HWERGTGHVRSSTDAGMVMNCSNDWWAFDTACIRPSEVVIMSSTKVRGSNAVYWLYYTGF 312 Query: 906 SSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSLPGLAMSQDGRHWARIEGE 727 +SE V+FS + I+++NPER + N NE + + ILKSLPGLA+SQDGRHWARIEGE Sbjct: 313 NSEKVDFS-VAPGITVENPERVY---KNDNE-DTQGSILKSLPGLAISQDGRHWARIEGE 367 Query: 726 HHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDAENGHFAVGIARSRDGMRW 547 HHSGALFDVGS EWDSLF+A+P+VV+H +GDLRMYYHSFDA GHFAVGIARSRDG+RW Sbjct: 368 HHSGALFDVGSGVEWDSLFVATPRVVFHSNGDLRMYYHSFDAGCGHFAVGIARSRDGIRW 427 Query: 546 VKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVYECVASDGERSIGFAVSSDGLKQ 367 VKLGKI+GGG +G+FDECG++N V RN+++G YLM YE +A+DG+RSIG AVS DGLK Sbjct: 428 VKLGKIMGGGLDGSFDECGVINAHVVRNRRDGGYLMAYEGIAADGQRSIGLAVSPDGLKD 487 Query: 366 WKRVQDFPALKKS-EENGWDCEGVGSPCLVQMDGELDEWRLYYXXXXXXXXXXXGLAISQ 190 W+R + LK S +E+GWD +GVGSPCLVQ++G DEWRLYY G+A+S Sbjct: 488 WRRCGEDAVLKPSADEDGWDNKGVGSPCLVQLEGSPDEWRLYYRGVGKGGRTGIGMAVSD 547 Query: 189 GNELNNFQRWTGFH 148 G+E NF+RWTGFH Sbjct: 548 GSEARNFKRWTGFH 561 >ref|XP_006483118.1| PREDICTED: uncharacterized protein LOC102631485 [Citrus sinensis] Length = 493 Score = 543 bits (1398), Expect = e-151 Identities = 270/454 (59%), Positives = 324/454 (71%), Gaps = 10/454 (2%) Frame = -1 Query: 1482 TRPQIFNLNFSTNPSNKNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQAKPTX 1303 ++P+ NL P N+ LT+CSTKP+ N N D++ T+E +S S++ + P+ Sbjct: 38 SKPKKPNLLVVYAPRVNNLLSFLTHCSTKPDTNTNNETDQDSTIEHNSNSKSNQGNAPSS 97 Query: 1302 XXXXXXXXXXXXXXL-------VFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGR 1144 V DLG+ NSWDS EIGSPVVKR+L DDEERWYMWYHG Sbjct: 98 SNSDEALGASLSPSNSSSSRGLVLDLGSTNSWDSGEIGSPVVKRFLGDDEERWYMWYHGN 157 Query: 1143 SGENPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIM 964 SGE PG +S+GLA+SSNGIHWERG G V++ DVGLVMNC KDWWAFDT SIRP E+ IM Sbjct: 158 SGEKPGSDSVGLAISSNGIHWERGNGPVRTSNDVGLVMNCGKDWWAFDTLSIRPSEVAIM 217 Query: 963 SSTKVRSNNAVYWLYYTGFSSENVEFSD-KSLEISLKNPERAHIVGDNSNEIEGKSMILK 787 SS KVR+++AVYWLYYTG+SSE + F D SLE +L+NPER + S E K I K Sbjct: 218 SSNKVRASSAVYWLYYTGYSSEKMNFLDYDSLEFNLENPERFQVGNLLSGENGLKRKINK 277 Query: 786 SLPGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSF 607 SLPGLA+SQDGRHWARIEGEHHSGALFDVGSD +WDSLFIA+PQVV+HG+GDLRMYYHSF Sbjct: 278 SLPGLAISQDGRHWARIEGEHHSGALFDVGSDEDWDSLFIAAPQVVFHGNGDLRMYYHSF 337 Query: 606 DAENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVYEC 427 D E G F +GIARSRDG++WVKLGKI+GGG G+FDE G+ N V RNKK+G YLM YE Sbjct: 338 DVEKGEFGIGIARSRDGIKWVKLGKIMGGGIRGSFDEFGVKNACVARNKKDGKYLMAYEG 397 Query: 426 VASDGERSIGFAVSSDGLKQWKRVQDFPALKK--SEENGWDCEGVGSPCLVQMDGELDEW 253 V +DG SIG AVS+ GLK W+R QD LK E+GWD +G+GSP LVQMDG+ DEW Sbjct: 398 VGADGSSSIGLAVSTGGLKGWRRFQDNTMLKAEVEAEDGWDNKGIGSPYLVQMDGDSDEW 457 Query: 252 RLYYXXXXXXXXXXXGLAISQGNELNNFQRWTGF 151 RLYY GLA+S+G+++ F RWTGF Sbjct: 458 RLYYRGIGNGGRTGIGLAVSEGSDVRKFTRWTGF 491 >gb|KDO82999.1| hypothetical protein CISIN_1g012700mg [Citrus sinensis] Length = 458 Score = 541 bits (1394), Expect = e-151 Identities = 267/441 (60%), Positives = 318/441 (72%), Gaps = 10/441 (2%) Frame = -1 Query: 1443 PSNKNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQAKPTXXXXXXXXXXXXXX 1264 P N+ LT+CSTKP+ N N D++ T+E +S S++ + P+ Sbjct: 16 PRVNNLLSFLTHCSTKPDTNTNNETDQDSTIEHNSNSKSNQGNAPSSSNSDEALGASLSP 75 Query: 1263 XL-------VFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRSGENPGFESIGLA 1105 V DLG+ NSWDS EIGSPVVKR+L DDEERWYMWYHG SGE PG +S+GLA Sbjct: 76 SNSSSSRGLVLDLGSTNSWDSGEIGSPVVKRFLGDDEERWYMWYHGNSGEKPGSDSVGLA 135 Query: 1104 VSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMSSTKVRSNNAVYW 925 +SSNGIHWERG G V++ DVGLVMNC KDWWAFDT SIRP E+ IMSS KVR+++AVYW Sbjct: 136 ISSNGIHWERGNGPVRTSNDVGLVMNCGKDWWAFDTLSIRPSEVAIMSSNKVRASSAVYW 195 Query: 924 LYYTGFSSENVEFSD-KSLEISLKNPERAHIVGDNSNEIEGKSMILKSLPGLAMSQDGRH 748 LYYTG+SSE + F D SLE +L+NPER + S E K I KSLPGLA+SQDGRH Sbjct: 196 LYYTGYSSEKMNFLDYDSLEFNLENPERFQVGNLLSGENGLKRKINKSLPGLAISQDGRH 255 Query: 747 WARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDAENGHFAVGIAR 568 WARIEGEHHSGALFDVGSD +WDSLFIA+PQVV+HG+GDLRMYYHSFD E G F +GIAR Sbjct: 256 WARIEGEHHSGALFDVGSDEDWDSLFIAAPQVVFHGNGDLRMYYHSFDVEKGEFGIGIAR 315 Query: 567 SRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVYECVASDGERSIGFAV 388 SRDG++WVKLGKI+GGG G+FDE G+ N V RNKK+G YLM YE V +DG SIG AV Sbjct: 316 SRDGIKWVKLGKIMGGGIRGSFDEFGVKNACVARNKKDGKYLMAYEGVGADGSSSIGLAV 375 Query: 387 SSDGLKQWKRVQDFPALKK--SEENGWDCEGVGSPCLVQMDGELDEWRLYYXXXXXXXXX 214 S+ GLK W+R QD LK E+GWD +G+GSP LVQMDG+ DEWRLYY Sbjct: 376 STGGLKGWRRFQDNTMLKAEVEAEDGWDNKGIGSPYLVQMDGDSDEWRLYYRGIGNGGRT 435 Query: 213 XXGLAISQGNELNNFQRWTGF 151 GLA+S+G+++ F RWTGF Sbjct: 436 GIGLAVSEGSDVRKFTRWTGF 456 >ref|XP_012464964.1| PREDICTED: uncharacterized protein LOC105783834 [Gossypium raimondii] gi|763813391|gb|KJB80243.1| hypothetical protein B456_013G088500 [Gossypium raimondii] Length = 474 Score = 540 bits (1390), Expect = e-150 Identities = 258/451 (57%), Positives = 325/451 (72%), Gaps = 4/451 (0%) Frame = -1 Query: 1491 WPQTRPQIFNLNFSTNPSNKNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQAK 1312 W QT+ + ++ NP+ + S+ L CSTKPN + N D+N T EP+ T+ + Sbjct: 32 WSQTKLNMLTF-YAPNPNTRFSSICLPRCSTKPNTDTNNETDQNPTFEPNPSLTTENPSS 90 Query: 1311 PTXXXXXXXXXXXXXXXL---VFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRS 1141 V DLG SWD +IGSPVVKR+LSD+EERWYMWYHG S Sbjct: 91 AVSDEVIPSSSNPPSSLSRGLVLDLGPVGSWDCTDIGSPVVKRFLSDEEERWYMWYHGVS 150 Query: 1140 GENPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMS 961 ++ G +SIGLAVSSNG+HWERG GAVKS ADVGLVM+C DWWAFDTQSIRPGE+VIMS Sbjct: 151 TDSQGSDSIGLAVSSNGVHWERGKGAVKSSADVGLVMSCGNDWWAFDTQSIRPGEVVIMS 210 Query: 960 STKVRSNNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSL 781 S KVR+++AVYWLYYTG+S+E V+ S SL ++NPE N+ +L+SL Sbjct: 211 SAKVRASSAVYWLYYTGYSNEKVDISADSLGFKVQNPE---------NQSSQTGEVLRSL 261 Query: 780 PGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDA 601 PGLA+SQDGRHWARIEGEHHSGALFDVGS+G+WDSLFI+SPQVV+HG+GDLRMYYHSFD Sbjct: 262 PGLAISQDGRHWARIEGEHHSGALFDVGSEGDWDSLFISSPQVVFHGNGDLRMYYHSFDV 321 Query: 600 ENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVYECVA 421 NG F++G+ARSRDGM+W+KLGKI+GGG G FDE G +NP V +NKK+ Y+M YE V Sbjct: 322 GNGVFSIGMARSRDGMKWIKLGKIMGGGPKGCFDELGAMNPYVVKNKKDRNYVMAYEGVG 381 Query: 420 SDGERSIGFAVSSDGLKQWKRVQDFPALK-KSEENGWDCEGVGSPCLVQMDGELDEWRLY 244 +DG RSIG A+S++GLK W+RV+D LK + E+GWD +G+GSPCLV+MDG++DEWRLY Sbjct: 382 ADGRRSIGLAMSAEGLKDWRRVEDEAVLKLATMEDGWDSKGIGSPCLVEMDGDVDEWRLY 441 Query: 243 YXXXXXXXXXXXGLAISQGNELNNFQRWTGF 151 Y G+A+S G+++ F+RW GF Sbjct: 442 YRGIGNSGRCGIGMAVSDGSDITRFRRWKGF 472 >emb|CAN72785.1| hypothetical protein VITISV_039508 [Vitis vinifera] Length = 531 Score = 531 bits (1369), Expect = e-148 Identities = 266/438 (60%), Positives = 317/438 (72%), Gaps = 19/438 (4%) Frame = -1 Query: 1497 PIWPQTRPQIFNLNFST----------NPSNKNVSLLLTNCSTKPNVNRENAIDENLTVE 1348 P W T +F L S+ +P+ +N +L LT CST+P+ D+N TV Sbjct: 30 PAWCSTPANMFPLYASSTNFFAILPTPHPNPRNCALYLTRCSTRPDTT-----DKNSTVG 84 Query: 1347 PSSISRT----QKQAKPTXXXXXXXXXXXXXXXL---VFDLGTRNSWDSLEIGSPVVKRY 1189 PSS S + Q A P VFDLG NSWDS +IGSPVVKR+ Sbjct: 85 PSSDSNSNSKPQDSAAPASNESLSSAAAAASSSSRGLVFDLGPSNSWDSAQIGSPVVKRF 144 Query: 1188 LSDDEERWYMWYHGRSGENPGFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWW 1009 LSDDEERWYMWYHG S EN +SIGLAVSSNG+HWERGGG V+SG DVGLVMNC KDWW Sbjct: 145 LSDDEERWYMWYHGASNENSASDSIGLAVSSNGVHWERGGGPVRSGGDVGLVMNCGKDWW 204 Query: 1008 AFDTQSIRPGEIVIMSSTKVRSNNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVG 829 AFDT SIRP ++VIMSS +VR ++AVYWLYYTG+SSE V F D SLE+ L+NPERA G Sbjct: 205 AFDTMSIRPSDVVIMSSNRVRGSSAVYWLYYTGYSSEKVVFLDDSLELYLENPERA---G 261 Query: 828 DNSNEIEGKSMILKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVV 649 + E G I KSLPGLA+SQDGRHWARIEGEHH+GALFDVG + EWDS++IASPQVV Sbjct: 262 AENGENGGIGKIFKSLPGLAISQDGRHWARIEGEHHTGALFDVGLENEWDSMYIASPQVV 321 Query: 648 YHGSGDLRMYYHSFDAENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVT 469 +HG+GDLRMYYHSFD ENG FA+GIARS+DG+RWVKLGKI+GGG +G+FDE G+V V Sbjct: 322 FHGNGDLRMYYHSFDVENGQFAIGIARSKDGIRWVKLGKIMGGGISGSFDESGVVKACVV 381 Query: 468 RNKKNGTYLMVYECVASDGERSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSP 289 +N+++G Y+M YE V +G RSIG AVS DGLK+W+R QD L +E++GWD +GVGSP Sbjct: 382 KNRRDGKYVMAYEGVDGNGRRSIGLAVSPDGLKEWRRSQDEAVLMPAEDDGWDNKGVGSP 441 Query: 288 CLVQMDGELD--EWRLYY 241 CLVQMDG+ D EWRLYY Sbjct: 442 CLVQMDGDGDGGEWRLYY 459 >ref|XP_010108700.1| hypothetical protein L484_015688 [Morus notabilis] gi|587933010|gb|EXC20011.1| hypothetical protein L484_015688 [Morus notabilis] Length = 481 Score = 530 bits (1366), Expect = e-147 Identities = 267/443 (60%), Positives = 317/443 (71%), Gaps = 14/443 (3%) Frame = -1 Query: 1434 KNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQAKPTXXXXXXXXXXXXXXXL- 1258 +N SL L+ CSTKP+ N +N +++ T + S K KP Sbjct: 40 RNNSLSLSCCSTKPDTNTDNVGNQDPTFDIGLNSEQPKSPKPNSSDQLSSLSPSSASSSG 99 Query: 1257 --VFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRSG------ENPGFESIGLAV 1102 VFDLG NSWDS EIGSPVVKR+LSD+EERWYMWYHGRS ENP +S+GLAV Sbjct: 100 GLVFDLGIENSWDSAEIGSPVVKRFLSDEEERWYMWYHGRSSRSKNDSENPCLDSVGLAV 159 Query: 1101 SSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMSSTKVRSNNAVYWL 922 SSNG+HWERG G V++ DVG VM+C KDWWAFDT SIRP ++VIMSS+KVR ++AVYW+ Sbjct: 160 SSNGVHWERGVGPVQASRDVGFVMSCGKDWWAFDTLSIRPSKVVIMSSSKVRVSSAVYWM 219 Query: 921 YYTGFSSE--NVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSLPGLAMSQDGRH 748 YYTGFSSE +++ SD+S + SL+NPER GD I KSLPGLA+SQDGR+ Sbjct: 220 YYTGFSSEEIDIDISDESFKFSLENPER--FFGDFEGGSTSSGKIHKSLPGLAISQDGRY 277 Query: 747 WARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDAENGHFAVGIAR 568 WARIEGEHHSGALFDVG++ EWDSLFIASPQVV+HG+GDLRMYYHSFD NG F +G+AR Sbjct: 278 WARIEGEHHSGALFDVGAEKEWDSLFIASPQVVFHGNGDLRMYYHSFDVGNGEFCIGMAR 337 Query: 567 SRDGMRWVKLGKILGGGAN--GAFDECGIVNPKVTRNKKNGTYLMVYECVASDGERSIGF 394 SRDG+RWVKLGKI+GG N GAFDE G +N V RN+K+G YLM YE V+ +GERSIG Sbjct: 338 SRDGIRWVKLGKIIGGEKNTSGAFDEFGALNANVVRNRKDGKYLMAYEGVSCNGERSIGL 397 Query: 393 AVSSDGLKQWKRVQDFPALKKSE-ENGWDCEGVGSPCLVQMDGELDEWRLYYXXXXXXXX 217 A+S DGLK W + +D P LK SE +NGWD GVGSPCLVQMDGE DEWRLYY Sbjct: 398 AMSQDGLKNWTKFRDGPVLKASEAQNGWDNRGVGSPCLVQMDGEEDEWRLYYRGVGNEGR 457 Query: 216 XXXGLAISQGNELNNFQRWTGFH 148 G+A S G++ F RWTGFH Sbjct: 458 TGIGMAASHGSDFGRFTRWTGFH 480 >gb|KHN01978.1| hypothetical protein glysoja_034500 [Glycine soja] Length = 490 Score = 529 bits (1362), Expect = e-147 Identities = 261/433 (60%), Positives = 313/433 (72%), Gaps = 13/433 (3%) Frame = -1 Query: 1407 CSTKPNV--NRENAIDENLTVE-PSSISRTQK----------QAKPTXXXXXXXXXXXXX 1267 CSTKP+ N E N E P+SIS +Q +A + Sbjct: 57 CSTKPDTSANSETQHTNNPNNEQPNSISNSQNAPQSSDSSSSEAFSSSPPPLGSSHSSSS 116 Query: 1266 XXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRSGENPGFESIGLAVSSNGI 1087 LV DLG NSWDS +IGSPVVKR+LSD+EERWYMWYHGR+ P + IGLAVS NG+ Sbjct: 117 RGLVLDLGPSNSWDSADIGSPVVKRFLSDEEERWYMWYHGRAKGYPSSDLIGLAVSKNGV 176 Query: 1086 HWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMSSTKVRSNNAVYWLYYTGF 907 HWERGGG +S +DVG V++C KDWW FDT IRP E+VIMSS++VR+++AVYWLYYTGF Sbjct: 177 HWERGGGPARSSSDVGFVISCGKDWWGFDTGGIRPSEMVIMSSSRVRASSAVYWLYYTGF 236 Query: 906 SSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSLPGLAMSQDGRHWARIEGE 727 SE +EFSD SLE S++NP+ G + GK +LKSLPGLA+SQDGRHWARIEGE Sbjct: 237 VSERMEFSDHSLEFSVENPDGMINDGVSCGNGNGKGKVLKSLPGLAISQDGRHWARIEGE 296 Query: 726 HHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDAENGHFAVGIARSRDGMRW 547 HHSGAL DVGS+ EWDSLFI+SPQVV+HG+GDLRMYYHSFD E GHF VGIARSRDG+RW Sbjct: 297 HHSGALIDVGSEKEWDSLFISSPQVVFHGNGDLRMYYHSFDVERGHFGVGIARSRDGIRW 356 Query: 546 VKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVYECVASDGERSIGFAVSSDGLKQ 367 VKLGKI+GGG G+FDE G++NP VTRN+ G Y+M YE VA+DG RSIG AVS DGLK+ Sbjct: 357 VKLGKIMGGGKVGSFDEFGVMNPCVTRNRSGGNYVMTYEGVAADGRRSIGLAVSPDGLKE 416 Query: 366 WKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELDEWRLYYXXXXXXXXXXXGLAISQG 187 W R+QD LK S++ WD + VGSPCLV+MD E DEWRLYY G+AIS+G Sbjct: 417 WARLQDEAILKPSDQGCWDDKDVGSPCLVEMDTEGDEWRLYYRGVGNGGRVGIGMAISEG 476 Query: 186 NELNNFQRWTGFH 148 ++ +F+RWTGFH Sbjct: 477 RDIGSFRRWTGFH 489 >ref|XP_003520089.1| PREDICTED: uncharacterized protein LOC100794036 [Glycine max] Length = 497 Score = 527 bits (1358), Expect = e-147 Identities = 261/433 (60%), Positives = 312/433 (72%), Gaps = 13/433 (3%) Frame = -1 Query: 1407 CSTKPNV--NRENAIDENLTVE-PSSISRTQK----------QAKPTXXXXXXXXXXXXX 1267 CSTKP+ N E N E P+SIS +Q +A + Sbjct: 64 CSTKPDTSANSETQHTNNPNNEQPNSISNSQNAPQSSDSSSSEAFSSSPPPLGSSHSSSS 123 Query: 1266 XXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRSGENPGFESIGLAVSSNGI 1087 LV DLG NSWDS +IGSPVVKR+LSD+EERWYMWYHGR+ P + IGLAVS NG+ Sbjct: 124 RGLVLDLGPSNSWDSADIGSPVVKRFLSDEEERWYMWYHGRAKGYPSSDLIGLAVSKNGV 183 Query: 1086 HWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMSSTKVRSNNAVYWLYYTGF 907 HWERGGG +S +DVG V++C KDWW FDT IRP E+VIMSS++VR+++AVYWLYYTGF Sbjct: 184 HWERGGGPARSSSDVGFVISCGKDWWGFDTGGIRPSEMVIMSSSRVRASSAVYWLYYTGF 243 Query: 906 SSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSLPGLAMSQDGRHWARIEGE 727 SE +EFSD SLE S++NP+ G + GK +LKSLPGLA+SQDGRHWARIEGE Sbjct: 244 VSERMEFSDHSLEFSVENPDGMINDGVSCGNGNGKGKVLKSLPGLAISQDGRHWARIEGE 303 Query: 726 HHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDAENGHFAVGIARSRDGMRW 547 HHSGAL DVGS+ EWDSLFI+SPQVV+HG+GDLRMYYHSFD E GHF VGIARSRDG+RW Sbjct: 304 HHSGALIDVGSEKEWDSLFISSPQVVFHGNGDLRMYYHSFDVERGHFGVGIARSRDGIRW 363 Query: 546 VKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVYECVASDGERSIGFAVSSDGLKQ 367 VKLGKI+GGG G+FDE G++NP VTRN+ G Y+M YE VA+DG RSIG AVS DGLK+ Sbjct: 364 VKLGKIMGGGKVGSFDEFGVMNPCVTRNRSGGNYVMTYEGVAADGRRSIGLAVSPDGLKE 423 Query: 366 WKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELDEWRLYYXXXXXXXXXXXGLAISQG 187 W R QD LK S++ WD + VGSPCLV+MD E DEWRLYY G+AIS+G Sbjct: 424 WARRQDEAILKPSDQGCWDDKDVGSPCLVEMDTEGDEWRLYYRGVGNGGRVGIGMAISEG 483 Query: 186 NELNNFQRWTGFH 148 ++ +F+RWTGFH Sbjct: 484 RDIGSFRRWTGFH 496 >ref|XP_008221583.1| PREDICTED: uncharacterized protein LOC103321547 [Prunus mume] Length = 497 Score = 525 bits (1351), Expect = e-146 Identities = 266/458 (58%), Positives = 330/458 (72%), Gaps = 11/458 (2%) Frame = -1 Query: 1488 PQTRPQIFNLNF-STNPSNKNVSLLLTNCSTKPNVNR-ENAIDENLTVEPSSISRTQKQA 1315 P T+P + L ++NP ++ +SL+ T CS KP+ + +N ++N TVEP+ S + Sbjct: 40 PHTKPNVHALCLPNSNPRSRAISLI-TRCSIKPDTDTTDNEKEQNSTVEPNLNSEPTNPS 98 Query: 1314 KP-----TXXXXXXXXXXXXXXXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYH 1150 P LV LG NSWDS E+GSPVVKR+L D+EERWYMWY+ Sbjct: 99 TPFSNDALSSPISTSFSSSNTKGLVLGLGFENSWDSAEVGSPVVKRFLGDEEERWYMWYY 158 Query: 1149 GRSGENP--GFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGE 976 G+S NP G +SIGLAVSSNG+HWERG G V+S DVG V+NC KDWWAFDTQSIRP E Sbjct: 159 GKSSSNPNPGSDSIGLAVSSNGVHWERGVGQVQSSQDVGAVINCGKDWWAFDTQSIRPSE 218 Query: 975 IVIMSSTKVRSNNAVYWLYYTGFSSENVE-FSDKSLEISLKNPERAHIVGDNSNEIEGKS 799 +V+MSS+KVR+++AVYWLYYTG+S+E E S+ S EI+L+NPER + G S++ G Sbjct: 219 VVVMSSSKVRASSAVYWLYYTGYSAEEAENISNHSQEINLENPERFLLDGLISDKNGGIG 278 Query: 798 MILKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMY 619 I KSLPGLA+SQDGRHWARIEGEHHSGALFDVG GEWDS FIA+P VV+H SGDLRMY Sbjct: 279 KIFKSLPGLAISQDGRHWARIEGEHHSGALFDVGLQGEWDSSFIAAPHVVFHESGDLRMY 338 Query: 618 YHSFDAENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLM 439 YHSFD E G++++G+ARSRDG++WVKLGKI+GGG +G FDE G +NP V RN+K+G YLM Sbjct: 339 YHSFDLEMGNYSIGMARSRDGIKWVKLGKIIGGGRSGYFDELGAMNPCVVRNRKDGEYLM 398 Query: 438 VYECVASDGERSIGFAVSSDGLKQWKRVQDFP-ALKKSEENGWDCEGVGSPCLVQMDGEL 262 YE V DG RSIG AVS DGLK W R++D LK SE+ GWD +GVGSPCLVQMDGE Sbjct: 399 AYEGVGGDGGRSIGLAVSPDGLKDWTRLKDDEVVLKASEDCGWDNKGVGSPCLVQMDGEE 458 Query: 261 DEWRLYYXXXXXXXXXXXGLAISQGNELNNFQRWTGFH 148 DEWRLYY G+A+S+G+++ F+R GFH Sbjct: 459 DEWRLYYRGVGIEGRTGIGMAVSEGSDVTRFRRCAGFH 496 >ref|XP_007224592.1| hypothetical protein PRUPE_ppa1027170mg [Prunus persica] gi|462421528|gb|EMJ25791.1| hypothetical protein PRUPE_ppa1027170mg [Prunus persica] Length = 497 Score = 524 bits (1350), Expect = e-146 Identities = 266/458 (58%), Positives = 329/458 (71%), Gaps = 11/458 (2%) Frame = -1 Query: 1488 PQTRPQIFNLNF-STNPSNKNVSLLLTNCSTKPNVNR-ENAIDENLTVEPSSISRTQKQA 1315 P T+P + L ++NP ++ +SL+ T CS KP+ + +N ++N TVEP+ S + Sbjct: 40 PHTKPNVHALCLPNSNPRSRAISLI-TRCSIKPDTDTTDNEKEQNSTVEPNLNSEPTNPS 98 Query: 1314 KP-----TXXXXXXXXXXXXXXXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYH 1150 P LV LG NSWDS E+GSPVVKR+L D+EERWYMWY+ Sbjct: 99 TPFSNDALSSPISTSFSSSNTKGLVLGLGFENSWDSAEVGSPVVKRFLGDEEERWYMWYY 158 Query: 1149 GRSGENP--GFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGE 976 G+S NP G +SIGLAVSSNG+HWERG G V+S DVG V+NC KDWW FDTQSIRP E Sbjct: 159 GKSSSNPNPGSDSIGLAVSSNGVHWERGVGQVQSSQDVGAVINCGKDWWVFDTQSIRPSE 218 Query: 975 IVIMSSTKVRSNNAVYWLYYTGFSSENVE-FSDKSLEISLKNPERAHIVGDNSNEIEGKS 799 +V+MSS+KVR+++AVYWLYYTG+S+E E S+ S EI+L+NPER + G S++ G Sbjct: 219 VVVMSSSKVRASSAVYWLYYTGYSAEEAENISNHSQEINLENPERFLLDGLISDKNGGIG 278 Query: 798 MILKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMY 619 I KSLPGLA+SQDGRHWARIEGEHHSGALFDVG GEWDS FIA+P VV+H SGDLRMY Sbjct: 279 KIFKSLPGLAISQDGRHWARIEGEHHSGALFDVGLQGEWDSSFIAAPHVVFHESGDLRMY 338 Query: 618 YHSFDAENGHFAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLM 439 YHSFD E G++++G+ARSRDG++WVKLGKI+GGG +G FDE G +NP V RN+K+G YLM Sbjct: 339 YHSFDLEMGNYSIGMARSRDGIKWVKLGKIIGGGRSGYFDELGAMNPCVVRNRKDGEYLM 398 Query: 438 VYECVASDGERSIGFAVSSDGLKQWKRVQDFP-ALKKSEENGWDCEGVGSPCLVQMDGEL 262 YE V DG RSIG AVS DGLK W R++D LK SE+ GWD +GVGSPCLVQMDGE Sbjct: 399 AYEGVGGDGGRSIGLAVSPDGLKDWTRLKDDEVVLKASEDCGWDNKGVGSPCLVQMDGEE 458 Query: 261 DEWRLYYXXXXXXXXXXXGLAISQGNELNNFQRWTGFH 148 DEWRLYY G+A+SQG+++ F+R GFH Sbjct: 459 DEWRLYYRGVGIEGRTGIGMAVSQGSDVTRFRRCAGFH 496 >gb|KCW86570.1| hypothetical protein EUGRSUZ_B03205 [Eucalyptus grandis] Length = 455 Score = 522 bits (1345), Expect = e-145 Identities = 262/445 (58%), Positives = 317/445 (71%), Gaps = 1/445 (0%) Frame = -1 Query: 1479 RPQIFNLNFSTNPSNKNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQAKPTXX 1300 +P + L+ T S + SL+ T CSTKPN + N D+N + E + + + + T Sbjct: 31 KPNLLTLHKITFDSRRTPSLV-TRCSTKPNADITNGTDKNPSFESDNYASSSASSSTTLL 89 Query: 1299 XXXXXXXXXXXXXLVFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRSGENPGFE 1120 V DLG NSWDS E+GSPVVKR+LSD+EERWYMWYHG S ++P + Sbjct: 90 CSRGL---------VLDLGPANSWDSAEVGSPVVKRFLSDEEERWYMWYHGSSDQDPSSD 140 Query: 1119 SIGLAVSSNGIHWERGGGAVKSGADV-GLVMNCSKDWWAFDTQSIRPGEIVIMSSTKVRS 943 +IGLAVSSNGIHWERG GA + +V G+V+NCSKDWWAFDT SIRP E+VIMSS VR+ Sbjct: 141 AIGLAVSSNGIHWERGRGASVTSTNVAGVVLNCSKDWWAFDTMSIRPSEVVIMSSNIVRA 200 Query: 942 NNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSLPGLAMS 763 + AVYWLYYTG +SENV++ D S E+ +KNPER K + KSLPGLAMS Sbjct: 201 SGAVYWLYYTGHTSENVKYFDDSFELDVKNPERF---------CSRKGEVFKSLPGLAMS 251 Query: 762 QDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDAENGHFA 583 QDGRHWAR+EGEHHSGALFDVGS+ EWD LFI+SPQVV+H +GDLRMYYHSFDAE G + Sbjct: 252 QDGRHWARLEGEHHSGALFDVGSENEWDFLFISSPQVVFHANGDLRMYYHSFDAEKGEYC 311 Query: 582 VGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVYECVASDGERS 403 +G+ARSRDG++W+KLGKIL GG G FDE G VN +V +NKK+G YLMVYE V GERS Sbjct: 312 IGMARSRDGIKWLKLGKIL-GGRKGCFDEGGAVNARVLKNKKDGQYLMVYEGVGRHGERS 370 Query: 402 IGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELDEWRLYYXXXXXX 223 IG AVSSDGLK W+R+ + L +S + GWD EGVGSPCLVQ+DGE EWRLYY Sbjct: 371 IGAAVSSDGLKDWRRLGEEAILGRS-DGGWDGEGVGSPCLVQLDGEASEWRLYYRGVGSG 429 Query: 222 XXXXXGLAISQGNELNNFQRWTGFH 148 GLAIS G++L+ FQRWTGFH Sbjct: 430 GRTGIGLAISDGSDLSGFQRWTGFH 454 >ref|XP_004298413.1| PREDICTED: uncharacterized protein LOC101299860 [Fragaria vesca subsp. vesca] Length = 466 Score = 522 bits (1345), Expect = e-145 Identities = 252/447 (56%), Positives = 313/447 (70%), Gaps = 2/447 (0%) Frame = -1 Query: 1482 TRPQIFNLNFSTNPSNKNVSLLLTNCSTKPNVNRENAIDENLTVEPSSISRTQKQAKPTX 1303 T+ +L+ ++ + ++ ++ L+ CSTKP+ + +N D+N TVEP+ S+ + PT Sbjct: 32 TKSNSHSLHLQSSNTRRSRAISLSRCSTKPDTDTDNQKDQNSTVEPNLNSKPLNPSPPTS 91 Query: 1302 XXXXXXXXXXXXXXL--VFDLGTRNSWDSLEIGSPVVKRYLSDDEERWYMWYHGRSGENP 1129 VFDLG +SWD +GSPVVKR+L D+EERWYMWYHGRS NP Sbjct: 92 IDQLSTPASSFPCSKGLVFDLGVESSWDGAGVGSPVVKRFLGDEEERWYMWYHGRSDSNP 151 Query: 1128 GFESIGLAVSSNGIHWERGGGAVKSGADVGLVMNCSKDWWAFDTQSIRPGEIVIMSSTKV 949 G +SIGLAVSSNG+HW RG GAV+S DVGLVM+ KDWWAFDT SIRP E+V+MSS+KV Sbjct: 152 GSDSIGLAVSSNGVHWNRGRGAVQSSQDVGLVMSSGKDWWAFDTLSIRPSEVVVMSSSKV 211 Query: 948 RSNNAVYWLYYTGFSSENVEFSDKSLEISLKNPERAHIVGDNSNEIEGKSMILKSLPGLA 769 R+++AVYWLYYTG+S E E S+ + I L+NPER+ + KSLPGLA Sbjct: 212 RASSAVYWLYYTGYSPEKAEISE--VPIGLENPERS-------------GEVFKSLPGLA 256 Query: 768 MSQDGRHWARIEGEHHSGALFDVGSDGEWDSLFIASPQVVYHGSGDLRMYYHSFDAENGH 589 +SQDGRHWARIEGEHHSGALFDVG + EWDS FIA VV+H GDLRMYYHSFD E+GH Sbjct: 257 ISQDGRHWARIEGEHHSGALFDVGLEKEWDSSFIAGSHVVFHKRGDLRMYYHSFDLESGH 316 Query: 588 FAVGIARSRDGMRWVKLGKILGGGANGAFDECGIVNPKVTRNKKNGTYLMVYECVASDGE 409 + +GIARSRDGM+W+K+GKI+GGG NG FDE G +NP V R + G YLM YE V +G Sbjct: 317 YGIGIARSRDGMKWIKMGKIIGGGRNGGFDELGAMNPCVVRKRGGGEYLMAYEGVDGNGG 376 Query: 408 RSIGFAVSSDGLKQWKRVQDFPALKKSEENGWDCEGVGSPCLVQMDGELDEWRLYYXXXX 229 RSIG A+S DGLK+W R D LK SE +GWD +GVGSPCLVQMDGE DEWRLYY Sbjct: 377 RSIGLAISRDGLKEWTRCGDAVVLKSSEGSGWDSKGVGSPCLVQMDGEEDEWRLYYRGVG 436 Query: 228 XXXXXXXGLAISQGNELNNFQRWTGFH 148 G+A+S+G++ F+RW GFH Sbjct: 437 NGERTGIGMAVSEGSDYRRFRRWEGFH 463