BLASTX nr result
ID: Catharanthus22_contig00004553
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00004553 (2540 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004250874.1| PREDICTED: outer envelope protein 80, chloro... 962 0.0 ref|XP_006354253.1| PREDICTED: outer envelope protein 80, chloro... 961 0.0 ref|XP_004249210.1| PREDICTED: outer envelope protein 80, chloro... 954 0.0 ref|XP_006351245.1| PREDICTED: outer envelope protein 80, chloro... 954 0.0 ref|XP_006484493.1| PREDICTED: outer envelope protein 80, chloro... 927 0.0 ref|XP_002513472.1| sorting and assembly machinery (sam50) prote... 912 0.0 ref|XP_002871896.1| hypothetical protein ARALYDRAFT_909999 [Arab... 904 0.0 gb|EOY32604.1| Outer envelope protein of 80 kDa isoform 2 [Theob... 902 0.0 ref|XP_006287138.1| hypothetical protein CARUB_v10000309mg [Caps... 902 0.0 ref|XP_002285507.2| PREDICTED: outer envelope protein of 80 kDa,... 901 0.0 ref|NP_568378.1| outer envelope protein 80 [Arabidopsis thaliana... 897 0.0 ref|XP_006437641.1| hypothetical protein CICLE_v10030987mg [Citr... 890 0.0 ref|XP_006400523.1| hypothetical protein EUTSA_v10012770mg [Eutr... 890 0.0 ref|XP_002304768.1| hypothetical protein POPTR_0003s20390g [Popu... 888 0.0 ref|XP_003542049.2| PREDICTED: outer envelope protein 80, chloro... 884 0.0 gb|EOY32603.1| Outer envelope protein of 80 kDa isoform 1 [Theob... 884 0.0 gb|ESW22375.1| hypothetical protein PHAVU_005G148500g [Phaseolus... 883 0.0 ref|XP_003547118.1| PREDICTED: outer envelope protein 80, chloro... 881 0.0 gb|EMJ09540.1| hypothetical protein PRUPE_ppa002070mg [Prunus pe... 880 0.0 ref|XP_004296333.1| PREDICTED: outer envelope protein 80, chloro... 877 0.0 >ref|XP_004250874.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Solanum lycopersicum] Length = 698 Score = 962 bits (2487), Expect = 0.0 Identities = 474/576 (82%), Positives = 511/576 (88%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 ++ERVLISEV VRNKDGEELERKDLESEAL+ALKA RPNSALTVREVQEDVHRI+ASGYF Sbjct: 126 NEERVLISEVLVRNKDGEELERKDLESEALNALKACRPNSALTVREVQEDVHRIVASGYF 185 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SCMPVAVDTRDGIRLVFQVEPNQ+F GLVCEGA LP+RFIED+FRDGYGKI+NI+ LD Sbjct: 186 CSCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGASVLPARFIEDSFRDGYGKIVNIKRLD 245 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDRTGEPTVGKTKP 1019 E+ISSINGWYMERGLFG VSG+E+LSGGM+RL+VSEAEVNNI IRFLD+TGEPTVGKT+P Sbjct: 246 EIISSINGWYMERGLFGAVSGIEMLSGGMIRLEVSEAEVNNITIRFLDKTGEPTVGKTRP 305 Query: 1020 ETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERKX 1199 ETILRQLTTKKGQVYSM QGKRDVDT+L MGIMEDVSIIPQPAGDTGKVDL +N+VERK Sbjct: 306 ETILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDTGKVDLVMNVVERKS 365 Query: 1200 XXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTDP 1379 PLAGLIGSCAIYHKNLFG+NQKLNLSLERGQIDS+FRINYTDP Sbjct: 366 GGGISAGGGISSGITGGPLAGLIGSCAIYHKNLFGRNQKLNLSLERGQIDSIFRINYTDP 425 Query: 1380 WIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAGF 1559 WIEGDDKRTSRSIM+QNSRTPGTLVH N P SLTIGRVTAGIEYSRPFRPKWNGTAG Sbjct: 426 WIEGDDKRTSRSIMIQNSRTPGTLVH-NHP-GGSLTIGRVTAGIEYSRPFRPKWNGTAGI 483 Query: 1560 VFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNMD 1739 +FQ AGARDDKGNP+IRD+YSSPLTASGNTHDDMLLAK+ETVYTGSGDP SS+F FNMD Sbjct: 484 IFQRAGARDDKGNPIIRDYYSSPLTASGNTHDDMLLAKLETVYTGSGDP-GSSVFVFNMD 542 Query: 1740 QGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRGY 1919 QG+PVW EWLVFNRVNARARKG+V+GP L S SGGHVVGNFPPHEAF +GGTNSVRGY Sbjct: 543 QGLPVWSEWLVFNRVNARARKGLVLGPMRLLLSFSGGHVVGNFPPHEAFVLGGTNSVRGY 602 Query: 1920 EEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG 2099 EE EISFPL GP+EG +FADYGTDLGSGP+VPGDPAGARLKPGSGYG Sbjct: 603 EEGTVGSGRSYAVGCGEISFPLMGPLEGAVFADYGTDLGSGPSVPGDPAGARLKPGSGYG 662 Query: 2100 YGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 G+GIRV+SPLGPLRLEYA NDQ+TGRFHFGVGLRN Sbjct: 663 CGVGIRVESPLGPLRLEYAFNDQRTGRFHFGVGLRN 698 >ref|XP_006354253.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Solanum tuberosum] Length = 698 Score = 961 bits (2484), Expect = 0.0 Identities = 473/576 (82%), Positives = 512/576 (88%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 ++ERVLISEV VRNKDGEELERKDLESEAL+ALKA RPNSALTVREVQEDVHRI+ASGYF Sbjct: 126 NEERVLISEVLVRNKDGEELERKDLESEALNALKACRPNSALTVREVQEDVHRIVASGYF 185 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SCMPVAVDTRDGIRLVF+VEPNQ+F GLVCEGA+ LP+RFIED+FRDGYGKI+NI+ LD Sbjct: 186 CSCMPVAVDTRDGIRLVFKVEPNQEFHGLVCEGANVLPARFIEDSFRDGYGKIVNIKRLD 245 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDRTGEPTVGKTKP 1019 E+ISSINGWYMERGLFG VSG+E+LSGGM+RL+VSEAEVNNI IRFLDRTGEPTVGKT+P Sbjct: 246 EIISSINGWYMERGLFGAVSGIEMLSGGMIRLEVSEAEVNNITIRFLDRTGEPTVGKTRP 305 Query: 1020 ETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERKX 1199 ETILRQLTTKKGQVYSM QGKRDVDT+L MGIMEDVSIIPQPAGDTGKVDL +N+VERK Sbjct: 306 ETILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDTGKVDLVMNVVERKS 365 Query: 1200 XXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTDP 1379 PLAGLIGSCAIYHKNLFG+NQKLNLSLERGQIDS+FRINYTDP Sbjct: 366 GAGISAGGGISSGITSGPLAGLIGSCAIYHKNLFGRNQKLNLSLERGQIDSIFRINYTDP 425 Query: 1380 WIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAGF 1559 WIEGDDKRTSRS+M+QNSRTPG+LVH N P SLTIGRVTAGIEYSRPFRPKWNGTAG Sbjct: 426 WIEGDDKRTSRSMMIQNSRTPGSLVH-NHP-GGSLTIGRVTAGIEYSRPFRPKWNGTAGI 483 Query: 1560 VFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNMD 1739 +FQ AGARDDKGNP+IRD+YSSPLTASGNTHDDMLLAK+ETVYTGSGDP SS+F FNMD Sbjct: 484 IFQRAGARDDKGNPIIRDYYSSPLTASGNTHDDMLLAKLETVYTGSGDP-GSSVFVFNMD 542 Query: 1740 QGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRGY 1919 QG+PVW EWLVFNRVNARARKG+V+GP L S SGGHVVGNFPPHEAF +GGTNSVRGY Sbjct: 543 QGLPVWSEWLVFNRVNARARKGLVLGPMRLLLSFSGGHVVGNFPPHEAFVLGGTNSVRGY 602 Query: 1920 EEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG 2099 EE EISFPL GP+EG +FADYGTDLGSGP+VPGDPAGARLKPGSGYG Sbjct: 603 EEGTVGSGRSYAVGCGEISFPLMGPLEGAVFADYGTDLGSGPSVPGDPAGARLKPGSGYG 662 Query: 2100 YGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 G+GIRVDSPLGPLRLEYA NDQ+TGRFHFGVGLRN Sbjct: 663 CGVGIRVDSPLGPLRLEYAFNDQRTGRFHFGVGLRN 698 >ref|XP_004249210.1| PREDICTED: outer envelope protein 80, chloroplastic-like isoform 1 [Solanum lycopersicum] Length = 702 Score = 954 bits (2467), Expect = 0.0 Identities = 469/576 (81%), Positives = 513/576 (89%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 ++ERVLISEV VR+KDGEELERKDLE+E L+ALKA RPNSALTV+EVQEDVHRIIASGYF Sbjct: 130 NEERVLISEVLVRSKDGEELERKDLENEVLNALKACRPNSALTVQEVQEDVHRIIASGYF 189 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SCMPVAVDTRDGIRLVFQVEPNQ+F GLVCEGA+ LP++FIED+FRDGYGKI+NI+ +D Sbjct: 190 CSCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGANVLPAKFIEDSFRDGYGKIVNIKRID 249 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDRTGEPTVGKTKP 1019 E+ISSINGWYMERGLFG VSGVE+LSGGM+RL+VSEAEVNNIAIRFLD+TGEPTVGKT+P Sbjct: 250 EIISSINGWYMERGLFGAVSGVEMLSGGMIRLEVSEAEVNNIAIRFLDKTGEPTVGKTRP 309 Query: 1020 ETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERKX 1199 ETILRQLTTKKGQVYSM QGKRDV+T+L MGIMEDVSIIPQP+GDTGKVDL +N+VERK Sbjct: 310 ETILRQLTTKKGQVYSMLQGKRDVETVLAMGIMEDVSIIPQPSGDTGKVDLVMNVVERKS 369 Query: 1200 XXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTDP 1379 PLAGLIGSCAIYHKNLFG+NQKLNLSLERGQ+DS+FRINYTDP Sbjct: 370 GAGISAGGGISSGITSGPLAGLIGSCAIYHKNLFGRNQKLNLSLERGQVDSVFRINYTDP 429 Query: 1380 WIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAGF 1559 WIEGDDKRTSRSIM+QNSRTPGTLVH NQPD SLTIGRVTAGIEYSRPFRPKWNGTAG Sbjct: 430 WIEGDDKRTSRSIMIQNSRTPGTLVH-NQPD-GSLTIGRVTAGIEYSRPFRPKWNGTAGI 487 Query: 1560 VFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNMD 1739 +FQ AGARDDKG+P+IRD+YSSPLTASGNTHDDMLLAK+ETVYTGSGDP SS+F FNMD Sbjct: 488 IFQRAGARDDKGSPIIRDYYSSPLTASGNTHDDMLLAKLETVYTGSGDP-GSSVFVFNMD 546 Query: 1740 QGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRGY 1919 QG+PVW +WLVFNRVNARARKG+ +GP L S SGGHVVGNFPPHEAFAIGGTNSVRGY Sbjct: 547 QGLPVWSDWLVFNRVNARARKGLALGPMHLLLSFSGGHVVGNFPPHEAFAIGGTNSVRGY 606 Query: 1920 EEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG 2099 EE EISFPLTGPVEG +FADYG+DLGSGP+VPGDPAG R KPGSGYG Sbjct: 607 EEGAVGSSRSYVVGCGEISFPLTGPVEGAVFADYGSDLGSGPSVPGDPAGPRRKPGSGYG 666 Query: 2100 YGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 G+GIRVDSPLGPLRLEYA NDQ+TGRFHFGVGLRN Sbjct: 667 CGVGIRVDSPLGPLRLEYAFNDQRTGRFHFGVGLRN 702 >ref|XP_006351245.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Solanum tuberosum] Length = 702 Score = 954 bits (2466), Expect = 0.0 Identities = 470/576 (81%), Positives = 510/576 (88%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 ++ERVLISEV VR+KDGEELERKDLESE L+ALKA RPNSALTV+EVQEDVHRIIASGYF Sbjct: 130 NEERVLISEVLVRSKDGEELERKDLESEVLNALKACRPNSALTVQEVQEDVHRIIASGYF 189 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SCMPVAVDTRDGIRLVFQVEPNQ+F GLVCEGA+ LP+RFIED+FRDGYGKI+NI+ +D Sbjct: 190 CSCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGANVLPARFIEDSFRDGYGKIVNIKRID 249 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDRTGEPTVGKTKP 1019 E+ISSINGWYMERGLFG VS VEILSGGM+RL++SEAEVNNIAIRFLD+TGEPTVGKT+P Sbjct: 250 EIISSINGWYMERGLFGAVSSVEILSGGMIRLEISEAEVNNIAIRFLDKTGEPTVGKTRP 309 Query: 1020 ETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERKX 1199 ETILRQLTTKKGQVYSM QGKRDVDT+L MGIMEDVSIIPQPAGDTGKVDL +N+VERK Sbjct: 310 ETILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDTGKVDLVMNVVERKS 369 Query: 1200 XXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTDP 1379 PL GLIGSCAIYHKNLFG+NQKLNLSLERGQ+DS+FRINYTDP Sbjct: 370 GGGISAGGGISSGITSGPLTGLIGSCAIYHKNLFGRNQKLNLSLERGQVDSVFRINYTDP 429 Query: 1380 WIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAGF 1559 WIEGDDKRTSRSIM+QNSRTPGTLVH NQPD SLTIGRVTAGIEYSRPFRPKWNGTAG Sbjct: 430 WIEGDDKRTSRSIMIQNSRTPGTLVH-NQPD-GSLTIGRVTAGIEYSRPFRPKWNGTAGI 487 Query: 1560 VFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNMD 1739 +FQ AGARDDKG+P+IRD+YSSPLTASGNTHDDMLLAK+ETVYTGSGDP SS+F FNMD Sbjct: 488 IFQRAGARDDKGSPIIRDYYSSPLTASGNTHDDMLLAKLETVYTGSGDP-GSSVFVFNMD 546 Query: 1740 QGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRGY 1919 QG+PVW +WLVFNRVNARARKG+ +GP L S SGGHVVGNFPPHEAFAIGGTNSVRGY Sbjct: 547 QGLPVWSDWLVFNRVNARARKGLALGPMHLLLSFSGGHVVGNFPPHEAFAIGGTNSVRGY 606 Query: 1920 EEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG 2099 EE EISFPL GPVEG +FADYG+DLGSGP+VPGDPAG R KPGSGYG Sbjct: 607 EEGAVGSSRSYVVGCGEISFPLMGPVEGAVFADYGSDLGSGPSVPGDPAGPRRKPGSGYG 666 Query: 2100 YGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 G+GIRVDSPLGPLRLEYA NDQ+TGRFHFGVGLRN Sbjct: 667 CGVGIRVDSPLGPLRLEYAFNDQRTGRFHFGVGLRN 702 >ref|XP_006484493.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Citrus sinensis] Length = 707 Score = 927 bits (2396), Expect = 0.0 Identities = 459/577 (79%), Positives = 505/577 (87%), Gaps = 1/577 (0%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 D+ERVLISEV VRNKDGEELERKDLE+EAL ALKA R NSALTVREVQEDVHRII SGYF Sbjct: 133 DEERVLISEVLVRNKDGEELERKDLETEALTALKACRANSALTVREVQEDVHRIIDSGYF 192 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SCMPVAVDTRDGIRLVFQVEPNQ+F GLVCEGA+ LP++F+EDAFRDGYGK++NIR LD Sbjct: 193 CSCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGANVLPTKFVEDAFRDGYGKVVNIRRLD 252 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTK 1016 EVI+SINGWYMERGLFGMVSGVEILSGG++RLQV+EAEVNNI+IRFLDR TGEPT GKT+ Sbjct: 253 EVITSINGWYMERGLFGMVSGVEILSGGIIRLQVAEAEVNNISIRFLDRKTGEPTKGKTR 312 Query: 1017 PETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERK 1196 PETILRQLTTKKGQVYSM QGKRDV+T+LTMGIMEDVSIIPQPAGDTGKVDL +N+VER Sbjct: 313 PETILRQLTTKKGQVYSMLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLIMNVVER- 371 Query: 1197 XXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTD 1376 PL+GLIGS A H+N+FG+NQKLN+SLERGQIDS+FRINYTD Sbjct: 372 PSGGFSAGGGISSGITSGPLSGLIGSFAYSHRNVFGRNQKLNISLERGQIDSIFRINYTD 431 Query: 1377 PWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAG 1556 PWIEGDDKRTSR+IMVQNSRTPGT VHGNQPDNSSLTIGRVTAG+E+SRP RPKW+GT G Sbjct: 432 PWIEGDDKRTSRTIMVQNSRTPGTHVHGNQPDNSSLTIGRVTAGMEFSRPIRPKWSGTVG 491 Query: 1557 FVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNM 1736 +FQH+GARD+KGNP+I+DFYSSPLTASG T+D+ML+AK E+VYTGSGD SSMF FNM Sbjct: 492 LIFQHSGARDEKGNPIIKDFYSSPLTASGKTNDEMLIAKFESVYTGSGD-QGSSMFVFNM 550 Query: 1737 DQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRG 1916 +QG+PVWPEWL FNRVNARARKG+ IGP L SLSGGHVVGNF PHEAFAIGGTNSVRG Sbjct: 551 EQGLPVWPEWLFFNRVNARARKGVEIGPARLLLSLSGGHVVGNFSPHEAFAIGGTNSVRG 610 Query: 1917 YEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGY 2096 YEE EISFP+ GPVEGVIF+DYGTDLGSGP+VPGDPAGARLKPGSGY Sbjct: 611 YEEGAVGSGRSYVVGSGEISFPMLGPVEGVIFSDYGTDLGSGPSVPGDPAGARLKPGSGY 670 Query: 2097 GYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 GYG GIRVDSPLGPLRLEYA ND++ RFHFGVG RN Sbjct: 671 GYGFGIRVDSPLGPLRLEYAFNDKQAKRFHFGVGYRN 707 >ref|XP_002513472.1| sorting and assembly machinery (sam50) protein, putative [Ricinus communis] gi|223547380|gb|EEF48875.1| sorting and assembly machinery (sam50) protein, putative [Ricinus communis] Length = 700 Score = 912 bits (2356), Expect = 0.0 Identities = 474/706 (67%), Positives = 533/706 (75%), Gaps = 4/706 (0%) Frame = +3 Query: 102 MPRNDGVCFTSCSLKLTPPHPPLAQLSNLQFTPQILINCLXXXXXXXXXXXXXXXSITQF 281 MP+ND V FTS SLK+ PP Q Q PQ+ + I++ Sbjct: 1 MPQNDTVRFTSSSLKIPLLPPPQQQ----QQAPQLSYTKISFTNFIDSLITRSKIHISRS 56 Query: 282 LNNLRE---PQKFLNSIHFRPPXXXXXXXXXXXXXXXXNNGGNNNAESDSTGPTQKXXXX 452 +N+ R+ P S+ + + +S + Sbjct: 57 VNSPRKLTLPLLCFASLSLPQSKDTVISESHTQSPILCSASLSLTQPGESENIVTQQKGS 116 Query: 453 XXXXXXXXIDQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDV 632 D+ERVLISEV VRNKDGEELERKDLE+EA+ ALKA R NSALTVREVQEDV Sbjct: 117 GGGLSGSRHDEERVLISEVLVRNKDGEELERKDLEAEAVAALKACRANSALTVREVQEDV 176 Query: 633 HRIIASGYFSSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYG 812 HRII SGYF SC PVAVDTRDGIRLVFQVEPNQ+F GLVCEGA LP++F++DAFR+GYG Sbjct: 177 HRIIDSGYFCSCTPVAVDTRDGIRLVFQVEPNQEFHGLVCEGASVLPTKFLQDAFREGYG 236 Query: 813 KIINIRHLDEVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-T 989 K++NIRHLD+VI+SINGWYMERGLFG+VSGVEILSGG+LRLQV+EAEVNNI+IRFLDR T Sbjct: 237 KVVNIRHLDDVITSINGWYMERGLFGLVSGVEILSGGILRLQVAEAEVNNISIRFLDRKT 296 Query: 990 GEPTVGKTKPETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVD 1169 GEPT GKTKPETILRQLTTKKGQVYSM QGKRDVDT+LTMGIMEDVSIIPQPAGDTGKVD Sbjct: 297 GEPTKGKTKPETILRQLTTKKGQVYSMLQGKRDVDTVLTMGIMEDVSIIPQPAGDTGKVD 356 Query: 1170 LTLNIVERKXXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQID 1349 L +N+VER PL+GLIGS H+N+FG+NQKLN+SLERGQID Sbjct: 357 LVMNVVER-PSGGFSAGGGISSGITSGPLSGLIGSFTYSHRNVFGRNQKLNISLERGQID 415 Query: 1350 SLFRINYTDPWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPF 1529 S+FRINYTDPWI+GDDKRTSR+IMVQNSRTPG LVH QP NSSLTIGRVTAG+E+SRP Sbjct: 416 SIFRINYTDPWIQGDDKRTSRTIMVQNSRTPGNLVHSYQPGNSSLTIGRVTAGVEFSRPL 475 Query: 1530 RPKWNGTAGFVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPA 1709 RPKW+GTAG +FQHAGA D+KGNP+I+D YSSPLTASG THD+MLLAK E+VYTGSGD Sbjct: 476 RPKWSGTAGLIFQHAGAHDEKGNPIIKDHYSSPLTASGKTHDNMLLAKFESVYTGSGD-H 534 Query: 1710 ASSMFAFNMDQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFA 1889 SSMF N++QG+P+WPEWL FNRVNARARKG+ IGP SLSGGHVVGNF PHEAFA Sbjct: 535 GSSMFVLNVEQGLPLWPEWLFFNRVNARARKGVEIGPALFLLSLSGGHVVGNFSPHEAFA 594 Query: 1890 IGGTNSVRGYEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAG 2069 IGGTNSVRGYEE EISFPL GPVEGV+FADYGTDLGSGPTVPGDPAG Sbjct: 595 IGGTNSVRGYEEGAVGSARSYAVGSGEISFPLMGPVEGVLFADYGTDLGSGPTVPGDPAG 654 Query: 2070 ARLKPGSGYGYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 ARLKPGSGYGYG G+RVDSPLGPLRLEYA ND+ RFHFGVG RN Sbjct: 655 ARLKPGSGYGYGFGMRVDSPLGPLRLEYAFNDKHAKRFHFGVGHRN 700 >ref|XP_002871896.1| hypothetical protein ARALYDRAFT_909999 [Arabidopsis lyrata subsp. lyrata] gi|297317733|gb|EFH48155.1| hypothetical protein ARALYDRAFT_909999 [Arabidopsis lyrata subsp. lyrata] Length = 732 Score = 904 bits (2337), Expect = 0.0 Identities = 450/576 (78%), Positives = 494/576 (85%), Gaps = 1/576 (0%) Frame = +3 Query: 483 QERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYFS 662 +ERVLISEV VR KDGEELERKDLE EAL ALKA R NSALT+REVQEDVHRII SGYF Sbjct: 159 EERVLISEVLVRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVHRIIESGYFC 218 Query: 663 SCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLDE 842 SC PVAVDTRDGIRL+FQVEPNQ+F+GLVCE A+ LPS+FI++AFRDG+GK+INI+ L+E Sbjct: 219 SCTPVAVDTRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIQEAFRDGFGKVINIKRLEE 278 Query: 843 VISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTKP 1019 I+SINGWYMERGLFG+VS ++ LSGG++RLQV+EAEVNNI+IRFLDR TGEPT GKT P Sbjct: 279 AITSINGWYMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTSP 338 Query: 1020 ETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERKX 1199 ETILRQLTTKKGQVYSM QGKRDVDT+L MGIMEDVSIIPQPAGDTGKVDL +N VER Sbjct: 339 ETILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDTGKVDLIMNCVERPS 398 Query: 1200 XXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTDP 1379 PL+GLIGS A H+NLFG+NQKLN+SLERGQIDS+FRINYTDP Sbjct: 399 GGFSAGGGISSGITSG-PLSGLIGSFAYSHRNLFGRNQKLNVSLERGQIDSIFRINYTDP 457 Query: 1380 WIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAGF 1559 WIEGDDKRTSRSIMVQNSRTPG LVHGNQPDNSSLTIGRVTAGIEYSRPFRPKW+GTAG Sbjct: 458 WIEGDDKRTSRSIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWSGTAGL 517 Query: 1560 VFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNMD 1739 +FQHAGARD++GNP+I+DFYSSPLTASG THDD LLAK+E++YTGSGD S+MFAFNM+ Sbjct: 518 IFQHAGARDEQGNPIIKDFYSSPLTASGKTHDDTLLAKLESIYTGSGD-RGSTMFAFNME 576 Query: 1740 QGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRGY 1919 QG+PV PEWL FNRV RARKGI IGP FSLSGGHVVGNF PHEAF IGGTNS+RGY Sbjct: 577 QGLPVLPEWLCFNRVTGRARKGIHIGPARFLFSLSGGHVVGNFSPHEAFVIGGTNSIRGY 636 Query: 1920 EEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG 2099 EE E+SFP+ GPVEGVIF DYGTDLGSG TVPGDPAGARLKPGSGYG Sbjct: 637 EEGAVGSGRSYVVGSGEMSFPVRGPVEGVIFTDYGTDLGSGSTVPGDPAGARLKPGSGYG 696 Query: 2100 YGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 YGLG+RVDSPLGPLRLEYA NDQ GRFHFGVGLRN Sbjct: 697 YGLGVRVDSPLGPLRLEYAFNDQHAGRFHFGVGLRN 732 >gb|EOY32604.1| Outer envelope protein of 80 kDa isoform 2 [Theobroma cacao] gi|508785349|gb|EOY32605.1| Outer envelope protein of 80 kDa isoform 2 [Theobroma cacao] gi|508785351|gb|EOY32607.1| Outer envelope protein of 80 kDa isoform 2 [Theobroma cacao] Length = 715 Score = 902 bits (2331), Expect = 0.0 Identities = 453/601 (75%), Positives = 501/601 (83%), Gaps = 1/601 (0%) Frame = +3 Query: 408 AESDSTGPTQKXXXXXXXXXXXXIDQERVLISEVWVRNKDGEELERKDLESEALDALKAS 587 A +DST + D+ERVLISEV VRNKDGEELE KDLE EAL ALKA Sbjct: 117 ASTDSTQSGSELPQKGQSATAGRHDEERVLISEVLVRNKDGEELEMKDLEMEALTALKAC 176 Query: 588 RPNSALTVREVQEDVHRIIASGYFSSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADA 767 R NSALTVREVQEDVHRII SGYFSSCMPVAVDTRDGIRLVFQVEPNQ+F GLVCEGA+ Sbjct: 177 RANSALTVREVQEDVHRIIDSGYFSSCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGANV 236 Query: 768 LPSRFIEDAFRDGYGKIINIRHLDEVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSE 947 LPS+F+EDAFRDG+GK++N++ LDEVI+SINGWYMERGLFG+VSGV+ILSGG++RLQV+E Sbjct: 237 LPSKFLEDAFRDGHGKVVNLKRLDEVINSINGWYMERGLFGLVSGVDILSGGIIRLQVAE 296 Query: 948 AEVNNIAIRFLDR-TGEPTVGKTKPETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMED 1124 AEVNNI+IRFLDR TGEP GKTKPETILRQLTTKKGQVYSM QGKRDVDT+ TMG+MED Sbjct: 297 AEVNNISIRFLDRKTGEPCKGKTKPETILRQLTTKKGQVYSMLQGKRDVDTVSTMGLMED 356 Query: 1125 VSIIPQPAGDTGKVDLTLNIVERKXXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFG 1304 VSIIPQPAGD GKVDL +N+VER PL+GLIGS A H+NLFG Sbjct: 357 VSIIPQPAGDAGKVDLIMNVVER-PSGGFSAGGGISSGITSGPLSGLIGSFAYSHRNLFG 415 Query: 1305 KNQKLNLSLERGQIDSLFRINYTDPWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSL 1484 +NQKLN+SLERGQIDS+FRINYTDPWIEGDDKRTSR+I+VQNSRTPGTLVHGN DNSSL Sbjct: 416 RNQKLNISLERGQIDSIFRINYTDPWIEGDDKRTSRTIIVQNSRTPGTLVHGNLHDNSSL 475 Query: 1485 TIGRVTAGIEYSRPFRPKWNGTAGFVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDML 1664 +IGRVTAG+E+SRP RPKWNGTAG +FQHAGARD+KGNP+I+DFY SPLTASG +DDML Sbjct: 476 SIGRVTAGVEFSRPIRPKWNGTAGLIFQHAGARDEKGNPIIKDFYGSPLTASGKPYDDML 535 Query: 1665 LAKIETVYTGSGDPAASSMFAFNMDQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLS 1844 LAK E+VYTGSGD SSMFAFNM+QG+PV PEWL FNRVNARARKG+ IGP L SLS Sbjct: 536 LAKFESVYTGSGD-QGSSMFAFNMEQGLPVMPEWLFFNRVNARARKGVEIGPARLLLSLS 594 Query: 1845 GGHVVGNFPPHEAFAIGGTNSVRGYEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYG 2024 GGHVVGNF PHEAFAIGGTNSVRGYEE E+SFP+ GPVEGV+FADYG Sbjct: 595 GGHVVGNFSPHEAFAIGGTNSVRGYEEGAVGSGRSYVVGSSEVSFPMVGPVEGVMFADYG 654 Query: 2025 TDLGSGPTVPGDPAGARLKPGSGYGYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLR 2204 DL SGP VPGDPAGAR KPGSGYGYG GIRV+SPLGPLRLEYA ND++ RFHFGVG R Sbjct: 655 HDLWSGPNVPGDPAGARFKPGSGYGYGFGIRVESPLGPLRLEYAFNDRQAKRFHFGVGHR 714 Query: 2205 N 2207 N Sbjct: 715 N 715 >ref|XP_006287138.1| hypothetical protein CARUB_v10000309mg [Capsella rubella] gi|482555844|gb|EOA20036.1| hypothetical protein CARUB_v10000309mg [Capsella rubella] Length = 735 Score = 902 bits (2330), Expect = 0.0 Identities = 448/576 (77%), Positives = 496/576 (86%), Gaps = 1/576 (0%) Frame = +3 Query: 483 QERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYFS 662 +ERVLISEV VR KDGEELERKDLE EAL ALKA R NSALT+REVQEDVHRII SGYF Sbjct: 162 EERVLISEVLVRTKDGEELERKDLEIEALAALKACRANSALTIREVQEDVHRIIESGYFC 221 Query: 663 SCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLDE 842 SC PVAVDTRDGIRL+FQVEPNQ+F+GLVCE A+ LPS+FI++AFRDG+GK+INI+ L+E Sbjct: 222 SCTPVAVDTRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIQEAFRDGFGKVINIKRLEE 281 Query: 843 VISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTKP 1019 I+SINGWYMERGLFG+VS ++ LSGG++RLQV+EAEVNNI+IRFLDR TGEPT GKT P Sbjct: 282 AITSINGWYMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTSP 341 Query: 1020 ETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERKX 1199 ETILRQLTTKKGQVYSM QGKRDVDT+L MGIMEDVSIIPQPAGD+GKVDL +N VER Sbjct: 342 ETILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDSGKVDLIMNCVERPS 401 Query: 1200 XXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTDP 1379 PL+GLIGS A H+NLFG+NQKLN+SLERGQIDS+FRINYTDP Sbjct: 402 GGFSAGGGISSGITSG-PLSGLIGSFAYSHRNLFGRNQKLNVSLERGQIDSIFRINYTDP 460 Query: 1380 WIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAGF 1559 WIEGDDKRTSRSIMVQNSRTPG LVHGNQPDNSSLTIGRVTAG+EYSRPFRPKW+GTAG Sbjct: 461 WIEGDDKRTSRSIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGVEYSRPFRPKWSGTAGL 520 Query: 1560 VFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNMD 1739 +FQHAGARD++GNP+I+DFYSSPLTASG THD+ LLAK+E++YTGSGD S+MFAFNM+ Sbjct: 521 IFQHAGARDEQGNPIIKDFYSSPLTASGKTHDETLLAKLESIYTGSGD-RGSTMFAFNME 579 Query: 1740 QGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRGY 1919 QG+PV PEWL FNRV ARARKGI IGP FSLSGGHVVGNF PHEAF IGGTNSVRGY Sbjct: 580 QGLPVLPEWLCFNRVTARARKGIHIGPGRFLFSLSGGHVVGNFSPHEAFGIGGTNSVRGY 639 Query: 1920 EEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG 2099 EE E+SFP+ GPVEGVIF DYGTD+GSG TVPGDPAGARLKPGSGYG Sbjct: 640 EEGAVGSGRSYVVGSGEMSFPVRGPVEGVIFTDYGTDMGSGSTVPGDPAGARLKPGSGYG 699 Query: 2100 YGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 YGLG+RVDSPLGPLRLEYA NDQ+ GRFHFGVGLRN Sbjct: 700 YGLGVRVDSPLGPLRLEYAFNDQQAGRFHFGVGLRN 735 >ref|XP_002285507.2| PREDICTED: outer envelope protein of 80 kDa, chloroplastic [Vitis vinifera] Length = 673 Score = 901 bits (2329), Expect = 0.0 Identities = 446/577 (77%), Positives = 492/577 (85%), Gaps = 1/577 (0%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 D+ERVLISEV VRNKDGEELERKDLE+EA+ ALKA RPNSALTVREVQEDVHRII SG F Sbjct: 98 DEERVLISEVLVRNKDGEELERKDLEAEAVAALKACRPNSALTVREVQEDVHRIIDSGLF 157 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SCMPVAVDTRDGIRLVFQVEPNQ+FQGLVCEGA+ LPS+F+EDAFRDGYGK++NIR LD Sbjct: 158 WSCMPVAVDTRDGIRLVFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGYGKVVNIRRLD 217 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTK 1016 +VI+SIN WY ERGLFGMVSGVEILSGG++RL+VSEAEVN+I++RFLDR TGEPT+GKTK Sbjct: 218 DVITSINDWYNERGLFGMVSGVEILSGGIIRLKVSEAEVNDISVRFLDRKTGEPTIGKTK 277 Query: 1017 PETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERK 1196 PETILRQLTTKKGQVYS+ QGKRD +T+LTMGIMEDVSII Q GD K+DL +N+VER Sbjct: 278 PETILRQLTTKKGQVYSLIQGKRDAETVLTMGIMEDVSIIHQSVGDRDKIDLVMNVVERV 337 Query: 1197 XXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTD 1376 PL+GLIGS A H+N+FG+NQKLN+SLERGQ+DS+FRINYTD Sbjct: 338 SGGFSAGGGISRGITTSRPLSGLIGSFAYSHRNVFGRNQKLNVSLERGQVDSIFRINYTD 397 Query: 1377 PWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAG 1556 PWIEGDDKRTSRSIM+QNSRTPG LVHG QP NSSLTIGRVTAGIE+SRPFRP W+GT G Sbjct: 398 PWIEGDDKRTSRSIMIQNSRTPGILVHGGQPANSSLTIGRVTAGIEFSRPFRPNWSGTVG 457 Query: 1557 FVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNM 1736 +FQHAGA D+ G P+I+DFYSSPLTASGNTHDD LLAK E+VYTGSGD SSMF FNM Sbjct: 458 LIFQHAGAHDEHGKPIIKDFYSSPLTASGNTHDDALLAKFESVYTGSGD-HGSSMFVFNM 516 Query: 1737 DQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRG 1916 +QG+PV PEWL FNRVNARARKG+ IGP CL SLSGGHVVGNF PHEAFAIGGTNSVRG Sbjct: 517 EQGLPVLPEWLFFNRVNARARKGVEIGPACLLLSLSGGHVVGNFSPHEAFAIGGTNSVRG 576 Query: 1917 YEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGY 2096 YEE EISFPL GP+ G +FADYGTDLGSGPTVPGDPAGARLKPGSGY Sbjct: 577 YEEGAVGSGRSHVVGSGEISFPLYGPLGGALFADYGTDLGSGPTVPGDPAGARLKPGSGY 636 Query: 2097 GYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 GYG GIR+DSPLGPLRLEYA NDQ+ RFHFGVG RN Sbjct: 637 GYGFGIRLDSPLGPLRLEYAFNDQQAQRFHFGVGHRN 673 >ref|NP_568378.1| outer envelope protein 80 [Arabidopsis thaliana] gi|75168961|sp|Q9C5J8.1|OEP80_ARATH RecName: Full=Outer envelope protein 80, chloroplastic; AltName: Full=Chloroplastic outer envelope protein of 80 kDa; Short=AtOEP80; AltName: Full=Protein TOC75-V; Short=AtToc75-V gi|13430586|gb|AAK25915.1|AF360205_1 unknown protein [Arabidopsis thaliana] gi|14532858|gb|AAK64111.1| unknown protein [Arabidopsis thaliana] gi|332005348|gb|AED92731.1| outer envelope protein 80 [Arabidopsis thaliana] Length = 732 Score = 897 bits (2317), Expect = 0.0 Identities = 445/576 (77%), Positives = 491/576 (85%), Gaps = 1/576 (0%) Frame = +3 Query: 483 QERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYFS 662 +ERVLISEV VR KDGEELERKDLE EAL ALKA R NSALT+REVQEDVHRII SGYF Sbjct: 159 EERVLISEVLVRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVHRIIESGYFC 218 Query: 663 SCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLDE 842 SC PVAVDTRDGIRL+FQVEPNQ+F+GLVCE A+ LPS+FI +AFRDG+GK+INI+ L+E Sbjct: 219 SCTPVAVDTRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIHEAFRDGFGKVINIKRLEE 278 Query: 843 VISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTKP 1019 I+SINGWYMERGLFG+VS ++ LSGG++RLQV+EAEVNNI+IRFLDR TGEPT GKT P Sbjct: 279 AITSINGWYMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTSP 338 Query: 1020 ETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERKX 1199 ETILRQLTTKKGQVYSM QGKRDVDT+L MGIMEDVSIIPQPAGD+GKVDL +N VER Sbjct: 339 ETILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDSGKVDLIMNCVERPS 398 Query: 1200 XXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTDP 1379 PL+GLIGS A H+NLFG+NQKLN+SLERGQIDS+FRINYTDP Sbjct: 399 GGFSAGGGISSGITSG-PLSGLIGSFAYSHRNLFGRNQKLNVSLERGQIDSIFRINYTDP 457 Query: 1380 WIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAGF 1559 WIEGDDKRTSRSIMVQNSRTPG LVHGNQPDNSSLTIGRVTAG+EYSRPFRPKWNGTAG Sbjct: 458 WIEGDDKRTSRSIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGVEYSRPFRPKWNGTAGL 517 Query: 1560 VFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNMD 1739 +FQHAGARD++GNP+I+DFYSSPLTASG HD+ +LAK+E++YTGSGD S+MFAFNM+ Sbjct: 518 IFQHAGARDEQGNPIIKDFYSSPLTASGKPHDETMLAKLESIYTGSGD-QGSTMFAFNME 576 Query: 1740 QGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRGY 1919 QG+PV PEWL FNRV RARKGI IGP FSLSGGHVVG F PHEAF IGGTNSVRGY Sbjct: 577 QGLPVLPEWLCFNRVTGRARKGIHIGPARFLFSLSGGHVVGKFSPHEAFVIGGTNSVRGY 636 Query: 1920 EEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG 2099 EE E+SFP+ GPVEGVIF DYGTD+GSG TVPGDPAGARLKPGSGYG Sbjct: 637 EEGAVGSGRSYVVGSGELSFPVRGPVEGVIFTDYGTDMGSGSTVPGDPAGARLKPGSGYG 696 Query: 2100 YGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 YGLG+RVDSPLGPLRLEYA NDQ GRFHFGVGLRN Sbjct: 697 YGLGVRVDSPLGPLRLEYAFNDQHAGRFHFGVGLRN 732 >ref|XP_006437641.1| hypothetical protein CICLE_v10030987mg [Citrus clementina] gi|557539837|gb|ESR50881.1| hypothetical protein CICLE_v10030987mg [Citrus clementina] Length = 612 Score = 890 bits (2301), Expect = 0.0 Identities = 446/577 (77%), Positives = 491/577 (85%), Gaps = 1/577 (0%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 D+ERVLISEV VRNKDGEELERKDLE+EAL ALKA R NSALTVREVQEDVHRII SGYF Sbjct: 52 DEERVLISEVLVRNKDGEELERKDLETEALTALKACRANSALTVREVQEDVHRIIDSGYF 111 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SCMPVAVDTRDGIRLVFQVEPNQ+F GLVCEGA+ LP++F+EDAFRDGYGK++NIR LD Sbjct: 112 CSCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGANVLPTKFVEDAFRDGYGKVVNIRRLD 171 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTK 1016 EVI+SINGWYMERGLFGMVSGVEILSGG++RLQV+EAEVNNI+IRFLDR TGEPT GKT+ Sbjct: 172 EVITSINGWYMERGLFGMVSGVEILSGGIIRLQVAEAEVNNISIRFLDRKTGEPTKGKTR 231 Query: 1017 PETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERK 1196 PETILRQLTTKKGQVYSM QGKRDV+T+LTMGIMEDVSIIPQPAGDTGKVDL +N+VER Sbjct: 232 PETILRQLTTKKGQVYSMLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLIMNVVER- 290 Query: 1197 XXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTD 1376 PL+GLIGS A H+N+FG+NQKLN+SLERGQIDS+FRINYTD Sbjct: 291 PSGGFSAGGGISSGITSGPLSGLIGSFAYSHRNVFGRNQKLNISLERGQIDSIFRINYTD 350 Query: 1377 PWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAG 1556 PWIEGDDKRTSR+IMVQNSRTPGT VHGNQPDNSSLTIGRVTAG+E+SRP RPKW+GT G Sbjct: 351 PWIEGDDKRTSRTIMVQNSRTPGTHVHGNQPDNSSLTIGRVTAGMEFSRPIRPKWSGTVG 410 Query: 1557 FVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNM 1736 +FQH+GARD+KGNP+I+DFYSSPLTASG T+D+ML+AK E+VYTGSGD +S Sbjct: 411 LIFQHSGARDEKGNPIIKDFYSSPLTASGKTNDEMLIAKFESVYTGSGDQGSSM------ 464 Query: 1737 DQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRG 1916 WL FNRVNARARKG+ IGP L SLSGGHVVGNF PHEAFAIGGTNSVRG Sbjct: 465 ---------WLFFNRVNARARKGVEIGPARLLLSLSGGHVVGNFSPHEAFAIGGTNSVRG 515 Query: 1917 YEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGY 2096 YEE EISFP+ GPVEGVIF+DYGTDLGSGP+VPGDPAGARLKPGSGY Sbjct: 516 YEEGAVGSGRSYVVGSGEISFPMLGPVEGVIFSDYGTDLGSGPSVPGDPAGARLKPGSGY 575 Query: 2097 GYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 GYG GIRVDSPLGPLRLEYA ND++ RFHFGVG RN Sbjct: 576 GYGFGIRVDSPLGPLRLEYAFNDKQAKRFHFGVGYRN 612 >ref|XP_006400523.1| hypothetical protein EUTSA_v10012770mg [Eutrema salsugineum] gi|557101613|gb|ESQ41976.1| hypothetical protein EUTSA_v10012770mg [Eutrema salsugineum] Length = 743 Score = 890 bits (2300), Expect = 0.0 Identities = 445/585 (76%), Positives = 495/585 (84%), Gaps = 10/585 (1%) Frame = +3 Query: 483 QERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYFS 662 +ERVLISEV VR KDGEELERKDLE EAL ALKA R NSALT+REVQEDVHRII SGYF Sbjct: 161 EERVLISEVLVRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVHRIIESGYFC 220 Query: 663 SCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLDE 842 SC PVAVDTRDGIRL+FQVEPNQ+F+GLVCE A+ LPS+FI++AF+DG+GK+INI+ L+E Sbjct: 221 SCTPVAVDTRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIQEAFQDGFGKVINIKRLEE 280 Query: 843 VISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTKP 1019 I+SINGWYMERGLFG+VS ++ LSGG++RLQV+EAEVNNI+IRFLDR TGEPT GKT+ Sbjct: 281 AITSINGWYMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTRV 340 Query: 1020 ETILRQLTTKKGQV---------YSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDL 1172 ETILRQLTTKKGQV YSM QGKRDVDT+L MGIMEDVSIIPQPAGD+GKVDL Sbjct: 341 ETILRQLTTKKGQVFLESLSLDVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDSGKVDL 400 Query: 1173 TLNIVERKXXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDS 1352 +N VER PL+GLIGS A H+N+ G+NQKLN+SLERGQIDS Sbjct: 401 IMNCVERPSGGFSAGGGISSGITSG-PLSGLIGSFAYSHRNILGRNQKLNVSLERGQIDS 459 Query: 1353 LFRINYTDPWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFR 1532 +FRINYTDPWIEGDDKRTSRSIMVQNSRTPG LVHGNQPDN++LTIGRVTAGIEYSRPFR Sbjct: 460 IFRINYTDPWIEGDDKRTSRSIMVQNSRTPGNLVHGNQPDNANLTIGRVTAGIEYSRPFR 519 Query: 1533 PKWNGTAGFVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAA 1712 PKW+GTAG +FQHAGARD++GNP+I+DFYSSPLTASG THDD LLAK E++YTGSGD Sbjct: 520 PKWSGTAGLIFQHAGARDEQGNPIIKDFYSSPLTASGKTHDDTLLAKFESIYTGSGD-HG 578 Query: 1713 SSMFAFNMDQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAI 1892 S+MFAFNM+QG+PV PEWL FNRVNAR RKGI IGPT FSLSGGHVVGNF PHEAFAI Sbjct: 579 STMFAFNMEQGLPVLPEWLFFNRVNARTRKGIHIGPTRFLFSLSGGHVVGNFSPHEAFAI 638 Query: 1893 GGTNSVRGYEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGA 2072 GGTNSVRGYEE E+SFP+ GPVEGV+F DYGTDLGSGPTVPGDPAGA Sbjct: 639 GGTNSVRGYEEGAVGSGRSYVVGSGEVSFPMRGPVEGVLFTDYGTDLGSGPTVPGDPAGA 698 Query: 2073 RLKPGSGYGYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 RLKPGSGYGYG G+RVDSPLGPLRLEYA ND+ TGRFHFGVG RN Sbjct: 699 RLKPGSGYGYGFGVRVDSPLGPLRLEYAFNDKHTGRFHFGVGHRN 743 >ref|XP_002304768.1| hypothetical protein POPTR_0003s20390g [Populus trichocarpa] gi|222842200|gb|EEE79747.1| hypothetical protein POPTR_0003s20390g [Populus trichocarpa] Length = 682 Score = 888 bits (2295), Expect = 0.0 Identities = 444/603 (73%), Positives = 493/603 (81%), Gaps = 1/603 (0%) Frame = +3 Query: 402 NNAESDSTGPTQKXXXXXXXXXXXXIDQERVLISEVWVRNKDGEELERKDLESEALDALK 581 ++ +SDS QK D+ERVLISEV VRNKDGEELERKDLE+EAL ALK Sbjct: 94 DSTQSDSVVAQQKSGGASGVHGPSRYDEERVLISEVLVRNKDGEELERKDLEAEALAALK 153 Query: 582 ASRPNSALTVREVQEDVHRIIASGYFSSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGA 761 A R NSALTVREVQEDVHR+I+SGYF SCMPVAVDTRDGIRLVFQVEPNQ+F GLVCEGA Sbjct: 154 ACRANSALTVREVQEDVHRVISSGYFCSCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGA 213 Query: 762 DALPSRFIEDAFRDGYGKIINIRHLDEVISSINGWYMERGLFGMVSGVEILSGGMLRLQV 941 LP++F++DAFR GYGK++NI+ LDEVISSIN WYMERGLFGMVS EILSGG++RLQ+ Sbjct: 214 SVLPTKFLQDAFRGGYGKVVNIKQLDEVISSINSWYMERGLFGMVSNAEILSGGIIRLQI 273 Query: 942 SEAEVNNIAIRFLDR-TGEPTVGKTKPETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIM 1118 +EAEVN+I+IRFLDR TGEPT GKTKPETILRQLTTKKGQVYSM QGKRDVDT+LTMGIM Sbjct: 274 AEAEVNDISIRFLDRKTGEPTKGKTKPETILRQLTTKKGQVYSMLQGKRDVDTVLTMGIM 333 Query: 1119 EDVSIIPQPAGDTGKVDLTLNIVERKXXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNL 1298 EDVS IPQPA DTGKVDL +N+VER G+ A H+N+ Sbjct: 334 EDVSFIPQPAEDTGKVDLIMNVVERPNGGFSAG-------------GGISSGFAYSHRNV 380 Query: 1299 FGKNQKLNLSLERGQIDSLFRINYTDPWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNS 1478 FG+NQKLN+SLERGQIDS+FRINYTDPWIEGDDKRTSR+IMVQNSRTPG LVHGNQP N+ Sbjct: 381 FGRNQKLNISLERGQIDSIFRINYTDPWIEGDDKRTSRTIMVQNSRTPGNLVHGNQPVNN 440 Query: 1479 SLTIGRVTAGIEYSRPFRPKWNGTAGFVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDD 1658 SLTIGRV AGIE+SRP RPKW+GT G +FQHAGAR++KG+P I+D Y+SPLTASG HDD Sbjct: 441 SLTIGRVAAGIEFSRPLRPKWSGTVGLIFQHAGARNEKGDPKIKDHYNSPLTASGKNHDD 500 Query: 1659 MLLAKIETVYTGSGDPAASSMFAFNMDQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFS 1838 MLLAK E+VYTGSGD SSMF FNM+QG+P+WPEWL FNRVN RARKG+ IGP S Sbjct: 501 MLLAKFESVYTGSGD-HGSSMFVFNMEQGLPLWPEWLFFNRVNTRARKGVEIGPALCLLS 559 Query: 1839 LSGGHVVGNFPPHEAFAIGGTNSVRGYEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFAD 2018 LSGGHV+GNF PHEAFAIGGTNSVRGYEE EISFP+ GPVEGV FAD Sbjct: 560 LSGGHVMGNFSPHEAFAIGGTNSVRGYEEGAVGSGRSYAVGSGEISFPVLGPVEGVFFAD 619 Query: 2019 YGTDLGSGPTVPGDPAGARLKPGSGYGYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVG 2198 YGTDLGSGP+VPGDPAGARLKPGSGYGYG GIRVDSPLGPLRLEYA ND+ T RFHFGVG Sbjct: 620 YGTDLGSGPSVPGDPAGARLKPGSGYGYGFGIRVDSPLGPLRLEYAFNDRHTKRFHFGVG 679 Query: 2199 LRN 2207 RN Sbjct: 680 HRN 682 >ref|XP_003542049.2| PREDICTED: outer envelope protein 80, chloroplastic-like isoform X1 [Glycine max] Length = 685 Score = 884 bits (2285), Expect = 0.0 Identities = 443/577 (76%), Positives = 492/577 (85%), Gaps = 1/577 (0%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 ++ERVLISEV VRNKDGEELERKDLE+EA ALKA RPNSALTVREVQEDVHRII SGYF Sbjct: 112 NEERVLISEVLVRNKDGEELERKDLEAEAAQALKACRPNSALTVREVQEDVHRIINSGYF 171 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SSCMPVAVDTRDGIRLVFQVEPNQ+FQGLVCEGA+ LP++F+ED+ RDGYGKIIN+R LD Sbjct: 172 SSCMPVAVDTRDGIRLVFQVEPNQEFQGLVCEGANVLPAKFLEDSMRDGYGKIINLRRLD 231 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTK 1016 E ISSIN WYMERGLF MVS VEILSGG+LRLQVSEAEV+NI+IRFLDR TGE T+GKTK Sbjct: 232 EAISSINNWYMERGLFAMVSAVEILSGGILRLQVSEAEVDNISIRFLDRKTGETTMGKTK 291 Query: 1017 PETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERK 1196 PETILRQ+TTKKGQVYSM +GKRDV+T+LTMGIMEDVSIIPQPA DTGKVDL +N+VER Sbjct: 292 PETILRQITTKKGQVYSMLEGKRDVETVLTMGIMEDVSIIPQPA-DTGKVDLVMNVVER- 349 Query: 1197 XXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTD 1376 PL GLIGS A H+N+FGKNQKLN+SLERGQIDS++RINYTD Sbjct: 350 PSGGFSAGGGISSGITNGPLRGLIGSFAYSHRNVFGKNQKLNISLERGQIDSVYRINYTD 409 Query: 1377 PWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAG 1556 PWI+GDDKRTSR+IM+QNSRTPGT+VHGN N SLTIGR+T GIE+SRP RPKW+GTAG Sbjct: 410 PWIQGDDKRTSRTIMIQNSRTPGTIVHGNADGNGSLTIGRITGGIEFSRPIRPKWSGTAG 469 Query: 1557 FVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNM 1736 VFQHAG RD+KG P+I+D YSSPLTASGNTHDD LLAK+ETVYTGSGD SS+F NM Sbjct: 470 LVFQHAGVRDEKGIPIIKDCYSSPLTASGNTHDDTLLAKLETVYTGSGD-HGSSLFVLNM 528 Query: 1737 DQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRG 1916 ++G+P+ PEWL F RVNARARKG+ IGP LH S+SGGHVVGNF P+EAFAIGGTNSVRG Sbjct: 529 EKGLPLLPEWLSFTRVNARARKGVEIGPARLHLSISGGHVVGNFSPYEAFAIGGTNSVRG 588 Query: 1917 YEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGY 2096 YEE EISFP+ GPVEGVIF+DYGTDLGSGPTVPGDPAGAR KPGSGY Sbjct: 589 YEEGSVGSGRSYIVGSGEISFPMYGPVEGVIFSDYGTDLGSGPTVPGDPAGARKKPGSGY 648 Query: 2097 GYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 GYG GIRV+SPLGPLRLEYA ND++ RFHFGVG RN Sbjct: 649 GYGFGIRVESPLGPLRLEYAFNDKQDKRFHFGVGHRN 685 >gb|EOY32603.1| Outer envelope protein of 80 kDa isoform 1 [Theobroma cacao] Length = 755 Score = 884 bits (2284), Expect = 0.0 Identities = 444/589 (75%), Positives = 492/589 (83%), Gaps = 1/589 (0%) Frame = +3 Query: 408 AESDSTGPTQKXXXXXXXXXXXXIDQERVLISEVWVRNKDGEELERKDLESEALDALKAS 587 A +DST + D+ERVLISEV VRNKDGEELE KDLE EAL ALKA Sbjct: 117 ASTDSTQSGSELPQKGQSATAGRHDEERVLISEVLVRNKDGEELEMKDLEMEALTALKAC 176 Query: 588 RPNSALTVREVQEDVHRIIASGYFSSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADA 767 R NSALTVREVQEDVHRII SGYFSSCMPVAVDTRDGIRLVFQVEPNQ+F GLVCEGA+ Sbjct: 177 RANSALTVREVQEDVHRIIDSGYFSSCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGANV 236 Query: 768 LPSRFIEDAFRDGYGKIINIRHLDEVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSE 947 LPS+F+EDAFRDG+GK++N++ LDEVI+SINGWYMERGLFG+VSGV+ILSGG++RLQV+E Sbjct: 237 LPSKFLEDAFRDGHGKVVNLKRLDEVINSINGWYMERGLFGLVSGVDILSGGIIRLQVAE 296 Query: 948 AEVNNIAIRFLDR-TGEPTVGKTKPETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMED 1124 AEVNNI+IRFLDR TGEP GKTKPETILRQLTTKKGQVYSM QGKRDVDT+ TMG+MED Sbjct: 297 AEVNNISIRFLDRKTGEPCKGKTKPETILRQLTTKKGQVYSMLQGKRDVDTVSTMGLMED 356 Query: 1125 VSIIPQPAGDTGKVDLTLNIVERKXXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFG 1304 VSIIPQPAGD GKVDL +N+VER PL+GLIGS A H+NLFG Sbjct: 357 VSIIPQPAGDAGKVDLIMNVVER-PSGGFSAGGGISSGITSGPLSGLIGSFAYSHRNLFG 415 Query: 1305 KNQKLNLSLERGQIDSLFRINYTDPWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSL 1484 +NQKLN+SLERGQIDS+FRINYTDPWIEGDDKRTSR+I+VQNSRTPGTLVHGN DNSSL Sbjct: 416 RNQKLNISLERGQIDSIFRINYTDPWIEGDDKRTSRTIIVQNSRTPGTLVHGNLHDNSSL 475 Query: 1485 TIGRVTAGIEYSRPFRPKWNGTAGFVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDML 1664 +IGRVTAG+E+SRP RPKWNGTAG +FQHAGARD+KGNP+I+DFY SPLTASG +DDML Sbjct: 476 SIGRVTAGVEFSRPIRPKWNGTAGLIFQHAGARDEKGNPIIKDFYGSPLTASGKPYDDML 535 Query: 1665 LAKIETVYTGSGDPAASSMFAFNMDQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLS 1844 LAK E+VYTGSGD SSMFAFNM+QG+PV PEWL FNRVNARARKG+ IGP L SLS Sbjct: 536 LAKFESVYTGSGD-QGSSMFAFNMEQGLPVMPEWLFFNRVNARARKGVEIGPARLLLSLS 594 Query: 1845 GGHVVGNFPPHEAFAIGGTNSVRGYEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYG 2024 GGHVVGNF PHEAFAIGGTNSVRGYEE E+SFP+ GPVEGV+FADYG Sbjct: 595 GGHVVGNFSPHEAFAIGGTNSVRGYEEGAVGSGRSYVVGSSEVSFPMVGPVEGVMFADYG 654 Query: 2025 TDLGSGPTVPGDPAGARLKPGSGYGYGLGIRVDSPLGPLRLEYALNDQK 2171 DL SGP VPGDPAGAR KPGSGYGYG GIRV+SPLGPLRLEYA ND++ Sbjct: 655 HDLWSGPNVPGDPAGARFKPGSGYGYGFGIRVESPLGPLRLEYAFNDRQ 703 >gb|ESW22375.1| hypothetical protein PHAVU_005G148500g [Phaseolus vulgaris] Length = 675 Score = 883 bits (2282), Expect = 0.0 Identities = 441/577 (76%), Positives = 493/577 (85%), Gaps = 1/577 (0%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 ++ERVLISEV VRNKDGEE+ERKDLE+EA+ ALKA RPNSALTVREVQEDVHRII SGYF Sbjct: 102 NEERVLISEVLVRNKDGEEMERKDLEAEAVQALKACRPNSALTVREVQEDVHRIINSGYF 161 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SSCMPVAVDTRDGIRLVFQVEPNQ+FQGLVCEGA+ LP++F+E++ RDGYGKIIN+R LD Sbjct: 162 SSCMPVAVDTRDGIRLVFQVEPNQEFQGLVCEGANVLPAKFLENSMRDGYGKIINLRRLD 221 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTK 1016 E ISSIN WYMERGLF MVS VEILSGG+LRLQVSEAEVNNI+IRFLDR TGE T+GKTK Sbjct: 222 EAISSINNWYMERGLFAMVSAVEILSGGILRLQVSEAEVNNISIRFLDRKTGEITMGKTK 281 Query: 1017 PETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERK 1196 PETILRQ+TTKKGQVYSM +GKRDV+T+LTMGIMEDVSIIPQP DTGKVDL +N+VER Sbjct: 282 PETILRQITTKKGQVYSMLEGKRDVETVLTMGIMEDVSIIPQPE-DTGKVDLVMNVVER- 339 Query: 1197 XXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTD 1376 PL GLIGS A H+N+FGKNQKLN+SLERGQIDS++RINYTD Sbjct: 340 PSGGFSAGGGISSGITNGPLRGLIGSFAYSHRNVFGKNQKLNISLERGQIDSVYRINYTD 399 Query: 1377 PWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAG 1556 PWI+GDD+RTSR+IM+QNSRTPGT+VHGN N SLTIGR+T GIE+SRP RPKW+GTAG Sbjct: 400 PWIQGDDRRTSRTIMIQNSRTPGTIVHGNADGNGSLTIGRITGGIEFSRPIRPKWSGTAG 459 Query: 1557 FVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNM 1736 VFQHAG RD+KG P+I+D +SSPLTASGNTHD+ LLAK+ETVYTGSGD SSMF NM Sbjct: 460 LVFQHAGVRDEKGIPIIKDCFSSPLTASGNTHDETLLAKLETVYTGSGD-HGSSMFVLNM 518 Query: 1737 DQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRG 1916 ++G+P+ PEWL F RVNARARKG+ IGP LH S+SGGHVVGNFPP+EAFAIGGTNSVRG Sbjct: 519 EKGLPLLPEWLSFTRVNARARKGVEIGPARLHLSISGGHVVGNFPPYEAFAIGGTNSVRG 578 Query: 1917 YEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGY 2096 YEE EISFP+ GPVEGVIF+DYGTDLGSGPTVPGDPAGAR KPGSGY Sbjct: 579 YEEGSVGSGRSYVVGSGEISFPMYGPVEGVIFSDYGTDLGSGPTVPGDPAGARKKPGSGY 638 Query: 2097 GYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 GYG GIRV+SPLGPLRLEYA ND+K RFHFGVG RN Sbjct: 639 GYGFGIRVESPLGPLRLEYAFNDKKERRFHFGVGHRN 675 >ref|XP_003547118.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Glycine max] Length = 677 Score = 881 bits (2276), Expect = 0.0 Identities = 440/577 (76%), Positives = 491/577 (85%), Gaps = 1/577 (0%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 ++ERVLISEV VRNKDGEELERKDLE+EA ALKA RPNSALTVREVQEDVHRII SGYF Sbjct: 104 NEERVLISEVLVRNKDGEELERKDLEAEAAQALKACRPNSALTVREVQEDVHRIINSGYF 163 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SSCMPVAVDTRDGIRLVFQVEPNQ+FQGLVCEGA+ LP++F+ED+ RDGYGKIIN+R LD Sbjct: 164 SSCMPVAVDTRDGIRLVFQVEPNQEFQGLVCEGANVLPAKFLEDSMRDGYGKIINLRRLD 223 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTK 1016 E +SSIN WYMERGLF MVS VEILSGG+LRLQVSEAEV+NI+IRFLDR TGE T+GKTK Sbjct: 224 EALSSINNWYMERGLFAMVSAVEILSGGILRLQVSEAEVDNISIRFLDRKTGETTMGKTK 283 Query: 1017 PETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERK 1196 PETILRQ+TTKKGQVYSM +GKRDV+T+LTMGIMEDVSIIPQPA DTGKVDL +N+VER Sbjct: 284 PETILRQITTKKGQVYSMLEGKRDVETVLTMGIMEDVSIIPQPA-DTGKVDLVMNVVER- 341 Query: 1197 XXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTD 1376 PL GLIGS A H+N+FGKNQKLN+SLERGQIDS++RINYTD Sbjct: 342 PSGGFSAGGGISSGITNGPLRGLIGSFAYSHRNVFGKNQKLNISLERGQIDSVYRINYTD 401 Query: 1377 PWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAG 1556 PWI+GDDKRTSR+IM+QNSRTPGT+VHGN N SLTIGR+T GIE+SRP RPKW+GT G Sbjct: 402 PWIQGDDKRTSRTIMIQNSRTPGTIVHGNADGNGSLTIGRITGGIEFSRPIRPKWSGTVG 461 Query: 1557 FVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNM 1736 VFQHAG RD++G P+I+D YSSPLTASGNTHDD LLAK+ETVYTGSGD SSMF NM Sbjct: 462 LVFQHAGVRDEQGIPIIKDCYSSPLTASGNTHDDTLLAKLETVYTGSGD-HGSSMFVLNM 520 Query: 1737 DQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRG 1916 ++G+P+ PEWL F RVNARARKG+ IGP LH S+SGGHVVGNF P+EAFAIGGTNSVRG Sbjct: 521 EKGLPLLPEWLSFTRVNARARKGVEIGPARLHLSISGGHVVGNFSPYEAFAIGGTNSVRG 580 Query: 1917 YEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGY 2096 YEE E+SFP+ GPVEGVIF+DYGTDLGSGPTVPGDPAGAR KPGSGY Sbjct: 581 YEEGSVGSGRSYVVGSGEVSFPVYGPVEGVIFSDYGTDLGSGPTVPGDPAGARKKPGSGY 640 Query: 2097 GYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 GYG GIRV+SPLGPLRLEYA ND++ RFHFGVG RN Sbjct: 641 GYGFGIRVESPLGPLRLEYAFNDKQDKRFHFGVGHRN 677 >gb|EMJ09540.1| hypothetical protein PRUPE_ppa002070mg [Prunus persica] Length = 721 Score = 880 bits (2274), Expect = 0.0 Identities = 441/577 (76%), Positives = 493/577 (85%), Gaps = 1/577 (0%) Frame = +3 Query: 480 DQERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYF 659 D+ERVLISEV VRNKDGEELERKDLE+EAL ALKA RPNSALTV EVQEDV RI SGYF Sbjct: 148 DEERVLISEVLVRNKDGEELERKDLEAEALAALKACRPNSALTVSEVQEDVQRIFDSGYF 207 Query: 660 SSCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLD 839 SCMPVAVDTRDGIRL+FQV+PNQ+FQGLVCEGA+ LP++FI+DAF DGYGK+IN++ L+ Sbjct: 208 CSCMPVAVDTRDGIRLIFQVKPNQEFQGLVCEGANVLPAKFIKDAFCDGYGKVINLKRLN 267 Query: 840 EVISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTK 1016 EVISSIN WYM+RGLF MVS VE LSGG+L+LQVSEAEVNNI+IRFLDR TGEPTVGKTK Sbjct: 268 EVISSINDWYMDRGLFAMVSAVESLSGGVLKLQVSEAEVNNISIRFLDRKTGEPTVGKTK 327 Query: 1017 PETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERK 1196 PETILRQLTTKKGQVYSM QGKRDV+T+LTMG+MEDVSIIPQPA D GKVD+T+N+VER Sbjct: 328 PETILRQLTTKKGQVYSMLQGKRDVETVLTMGLMEDVSIIPQPA-DAGKVDITMNVVER- 385 Query: 1197 XXXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTD 1376 PL+GLIGS A H+NLFG+NQKL++SLERGQIDS+FRINY+D Sbjct: 386 PSGGFSAGGGISSGITSGPLSGLIGSFAYSHRNLFGRNQKLHVSLERGQIDSIFRINYSD 445 Query: 1377 PWIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAG 1556 PWI GDD RTSR+IMVQNSRTPGTL+HGNQ D S+LTIGR+TAGIE+SRP RPK +GTAG Sbjct: 446 PWIAGDDMRTSRTIMVQNSRTPGTLIHGNQQDGSNLTIGRITAGIEFSRPIRPKLSGTAG 505 Query: 1557 FVFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNM 1736 +FQHAGARD++GNP+I+DF+SSPLTASGN HDDMLLAK+E+VYTGSGD SSM NM Sbjct: 506 LIFQHAGARDERGNPIIKDFFSSPLTASGNNHDDMLLAKLESVYTGSGD-HGSSMLVLNM 564 Query: 1737 DQGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRG 1916 +QG+PV PEWLVFNR+NARARK + +GP SLSGGHVVGNFPPHEAFAIGGTNSVRG Sbjct: 565 EQGLPVLPEWLVFNRINARARKDLELGPARFLLSLSGGHVVGNFPPHEAFAIGGTNSVRG 624 Query: 1917 YEEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGY 2096 YEE EISFP+ GPV GVIFADYGTDLGSGPTVPGDPAGARLKPGSGY Sbjct: 625 YEEGAVGSGRSYTVGSGEISFPVIGPVGGVIFADYGTDLGSGPTVPGDPAGARLKPGSGY 684 Query: 2097 GYGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 GYG GIR+DSPLGPLRLEYA ND+ T RFHFGVG RN Sbjct: 685 GYGFGIRLDSPLGPLRLEYAFNDKHTKRFHFGVGHRN 721 >ref|XP_004296333.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 680 Score = 877 bits (2265), Expect = 0.0 Identities = 432/576 (75%), Positives = 494/576 (85%), Gaps = 1/576 (0%) Frame = +3 Query: 483 QERVLISEVWVRNKDGEELERKDLESEALDALKASRPNSALTVREVQEDVHRIIASGYFS 662 +ERVLISEV +RNKDGEELERKDLE EAL ALKA R NSALTVREVQEDVHRII SGYF Sbjct: 107 EERVLISEVLIRNKDGEELERKDLELEALGALKACRANSALTVREVQEDVHRIIDSGYFC 166 Query: 663 SCMPVAVDTRDGIRLVFQVEPNQDFQGLVCEGADALPSRFIEDAFRDGYGKIINIRHLDE 842 CMPVA+DTRDGIRL+FQV+PNQ+FQGLVCEGA+ LP++F++DAF DGYGK+IN++ L+E Sbjct: 167 QCMPVAIDTRDGIRLIFQVKPNQEFQGLVCEGANVLPAKFLKDAFYDGYGKVINLKRLNE 226 Query: 843 VISSINGWYMERGLFGMVSGVEILSGGMLRLQVSEAEVNNIAIRFLDR-TGEPTVGKTKP 1019 VI+SIN WYM+RGLF MVS VE+LSGG+L+LQVSE EVNNIAIRFLDR TGEPT+GKTKP Sbjct: 227 VITSINDWYMDRGLFAMVSAVEVLSGGILKLQVSETEVNNIAIRFLDRKTGEPTIGKTKP 286 Query: 1020 ETILRQLTTKKGQVYSMFQGKRDVDTLLTMGIMEDVSIIPQPAGDTGKVDLTLNIVERKX 1199 ETILRQLTTKKGQVYSM QGKRDV+T+LTMG+MEDVSIIPQPAG++GKVD+ +N+VER Sbjct: 287 ETILRQLTTKKGQVYSMLQGKRDVETVLTMGLMEDVSIIPQPAGESGKVDIVMNVVER-P 345 Query: 1200 XXXXXXXXXXXXXXXXXPLAGLIGSCAIYHKNLFGKNQKLNLSLERGQIDSLFRINYTDP 1379 PL+GLIGS A H+NLFG+NQKL++SLERGQIDSLFRINY+DP Sbjct: 346 SGGFSAGGGISSGITSGPLSGLIGSFAYSHRNLFGRNQKLHVSLERGQIDSLFRINYSDP 405 Query: 1380 WIEGDDKRTSRSIMVQNSRTPGTLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWNGTAGF 1559 WI GDD RTSR+IMVQNSRTPGTL+HGNQ D S+LTIGR++AGI++SRP RPKW+GTAG Sbjct: 406 WISGDDMRTSRTIMVQNSRTPGTLIHGNQLDGSNLTIGRISAGIDFSRPIRPKWSGTAGL 465 Query: 1560 VFQHAGARDDKGNPVIRDFYSSPLTASGNTHDDMLLAKIETVYTGSGDPAASSMFAFNMD 1739 +QHAGARD++G+P+I+DF+SSPLTASGN++D+MLLAK+ETVYTGSGD SSM FNM+ Sbjct: 466 TYQHAGARDEEGSPIIKDFFSSPLTASGNSYDEMLLAKLETVYTGSGD-RGSSMLKFNME 524 Query: 1740 QGIPVWPEWLVFNRVNARARKGIVIGPTCLHFSLSGGHVVGNFPPHEAFAIGGTNSVRGY 1919 QG+PV P+WL FNR NARARK + IG L FS+SGGHV+GNFPPHEAF IGGTNSVRGY Sbjct: 525 QGLPVLPDWLFFNRTNARARKDLEIGLAHLLFSVSGGHVIGNFPPHEAFVIGGTNSVRGY 584 Query: 1920 EEXXXXXXXXXXXXXXEISFPLTGPVEGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG 2099 EE EISFPL GPV GVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG Sbjct: 585 EEGAVGSGRSYAVGSGEISFPLVGPVGGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYG 644 Query: 2100 YGLGIRVDSPLGPLRLEYALNDQKTGRFHFGVGLRN 2207 YGLGIR+DSPLGPLRLEYA ND+ T RFHFGVG RN Sbjct: 645 YGLGIRLDSPLGPLRLEYAFNDKGTPRFHFGVGHRN 680