BLASTX nr result

ID: Paeonia22_contig00004478 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00004478
         (2541 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285507.2| PREDICTED: outer envelope protein of 80 kDa,...   967   0.0  
ref|XP_002513472.1| sorting and assembly machinery (sam50) prote...   963   0.0  
ref|XP_006484493.1| PREDICTED: outer envelope protein 80, chloro...   962   0.0  
ref|XP_007014985.1| Outer envelope protein of 80 kDa isoform 2 [...   956   0.0  
ref|XP_006400523.1| hypothetical protein EUTSA_v10012770mg [Eutr...   941   0.0  
ref|XP_002871896.1| hypothetical protein ARALYDRAFT_909999 [Arab...   940   0.0  
ref|NP_568378.1| outer envelope protein 80 [Arabidopsis thaliana...   939   0.0  
ref|XP_007208341.1| hypothetical protein PRUPE_ppa002070mg [Prun...   938   0.0  
ref|XP_006287138.1| hypothetical protein CARUB_v10000309mg [Caps...   936   0.0  
ref|XP_007014984.1| Outer envelope protein of 80 kDa isoform 1 [...   935   0.0  
ref|XP_003542049.2| PREDICTED: outer envelope protein 80, chloro...   927   0.0  
ref|XP_002304768.1| hypothetical protein POPTR_0003s20390g [Popu...   927   0.0  
gb|EXB93281.1| Outer envelope protein 80 [Morus notabilis]            926   0.0  
ref|XP_003547118.1| PREDICTED: outer envelope protein 80, chloro...   915   0.0  
ref|XP_006437641.1| hypothetical protein CICLE_v10030987mg [Citr...   912   0.0  
ref|XP_007150381.1| hypothetical protein PHAVU_005G148500g [Phas...   911   0.0  
ref|XP_003597441.1| Outer envelope protein of 80 kDa [Medicago t...   905   0.0  
ref|XP_004296333.1| PREDICTED: outer envelope protein 80, chloro...   899   0.0  
ref|XP_004142120.1| PREDICTED: outer envelope protein 80, chloro...   893   0.0  
ref|XP_004161694.1| PREDICTED: LOW QUALITY PROTEIN: outer envelo...   887   0.0  

>ref|XP_002285507.2| PREDICTED: outer envelope protein of 80 kDa, chloroplastic [Vitis
            vinifera]
          Length = 673

 Score =  967 bits (2501), Expect = 0.0
 Identities = 519/745 (69%), Positives = 564/745 (75%), Gaps = 1/745 (0%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFK 2244
            M +N+ VRFT SSLK+P S  S             F SQ L SHLT+   S+I L+ SF+
Sbjct: 1    MSKNEDVRFTSSSLKIPLSPPS-------------FFSQTLGSHLTEATKSVIHLVNSFR 47

Query: 2243 TRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGAL 2064
                 + R PL   AR +                      LLCS++L+ ++  ES+Q   
Sbjct: 48   -----NFRKPLNFLARPSP---------------------LLCSASLSLSQPAESTQ--- 78

Query: 2063 EEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLISEVL 1884
             E   TQ KG                              Q ++RH REDEERVLISEVL
Sbjct: 79   LEVAATQPKG------------------------------QTVARHPREDEERVLISEVL 108

Query: 1883 VRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDTR 1704
            VRNKDGEELERKDLEAEAVAAL+ACRPNSALT REVQEDVHRII+SG F SCMPVAVDTR
Sbjct: 109  VRNKDGEELERKDLEAEAVAALKACRPNSALTVREVQEDVHRIIDSGLFWSCMPVAVDTR 168

Query: 1703 DGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWYM 1524
            DGIRL+FQVEPNQEFQGLVCEGANVLPSKFLEDAFRDG+GKVVNIRRLD+VITSIN WY 
Sbjct: 169  DGIRLVFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGYGKVVNIRRLDDVITSINDWYN 228

Query: 1523 ERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTTK 1344
            ERGLFG+VS VEILSGGII+L+VSEAEVN+IS+RFLDRK+GEPT+GKTKPETILRQLTTK
Sbjct: 229  ERGLFGMVSGVEILSGGIIRLKVSEAEVNDISVRFLDRKTGEPTIGKTKPETILRQLTTK 288

Query: 1343 KGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXXX 1164
            KGQVYSL+QGKRD ETVLTMGIMEDVSII Q  GD  K+DL MNVVERV           
Sbjct: 289  KGQVYSLIQGKRDAETVLTMGIMEDVSIIHQSVGDRDKIDLVMNVVERVSGGFSAGGGIS 348

Query: 1163 XXXXXXXXXXXXXS-FAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTS 987
                           FAYSHRNVFGRNQKLN+SLERGQ+DSIFR+NYTDPWIEGDDKRTS
Sbjct: 349  RGITTSRPLSGLIGSFAYSHRNVFGRNQKLNVSLERGQVDSIFRINYTDPWIEGDDKRTS 408

Query: 986  RSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDE 807
            RSIMIQNSRTPG LVHG QP  S+LTIGRVTAGIEFSRPF P WSGT GLIFQ AGA DE
Sbjct: 409  RSIMIQNSRTPGILVHGGQPANSSLTIGRVTAGIEFSRPFRPNWSGTVGLIFQHAGAHDE 468

Query: 806  KGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLF 627
             G PIIKD+YSSPLTASGNT+DD LLAK E VYTGSG HGS MFVFNMEQGLPVLPEWLF
Sbjct: 469  HGKPIIKDFYSSPLTASGNTHDDALLAKFESVYTGSGDHGSSMFVFNMEQGLPVLPEWLF 528

Query: 626  FNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXX 447
            FNRVNAR R+GVEIGPA  LLS SGGHVVGNF+P+EAFAIGGTNSVRGYEE         
Sbjct: 529  FNRVNARARKGVEIGPACLLLSLSGGHVVGNFSPHEAFAIGGTNSVRGYEEGAVGSGRSH 588

Query: 446  XXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPL 267
                GEISFPL GP+ GA+FADYGTDLGSG TVPGDPAGAR KPGSGYG GFGIR+DSPL
Sbjct: 589  VVGSGEISFPLYGPLGGALFADYGTDLGSGPTVPGDPAGARLKPGSGYGYGFGIRLDSPL 648

Query: 266  GPLRLEYAFNDRQAKRFHFAVGHRN 192
            GPLRLEYAFND+QA+RFHF VGHRN
Sbjct: 649  GPLRLEYAFNDQQAQRFHFGVGHRN 673


>ref|XP_002513472.1| sorting and assembly machinery (sam50) protein, putative [Ricinus
            communis] gi|223547380|gb|EEF48875.1| sorting and
            assembly machinery (sam50) protein, putative [Ricinus
            communis]
          Length = 700

 Score =  963 bits (2490), Expect = 0.0
 Identities = 512/746 (68%), Positives = 577/746 (77%), Gaps = 2/746 (0%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLP--HSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITS 2250
            MP+ND VRFT SSLK+P     +     P  + + +SF +  + S +T++K  + + + S
Sbjct: 1    MPQNDTVRFTSSSLKIPLLPPPQQQQQAPQLSYTKISF-TNFIDSLITRSKIHISRSVNS 59

Query: 2249 FKTRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQG 2070
             +  ++     PL  FA L+    +D  ++     E   +  +LCS++L+  +  ES   
Sbjct: 60   PRKLTL-----PLLCFASLSLPQSKDTVIS-----ESHTQSPILCSASLSLTQPGES--- 106

Query: 2069 ALEEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLISE 1890
               E  +TQ+KG                         GG++    SRH   DEERVLISE
Sbjct: 107  ---ENIVTQQKGS-----------------------GGGLSG---SRH---DEERVLISE 134

Query: 1889 VLVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVD 1710
            VLVRNKDGEELERKDLEAEAVAAL+ACR NSALT REVQEDVHRII+SGYFCSC PVAVD
Sbjct: 135  VLVRNKDGEELERKDLEAEAVAALKACRANSALTVREVQEDVHRIIDSGYFCSCTPVAVD 194

Query: 1709 TRDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGW 1530
            TRDGIRL+FQVEPNQEF GLVCEGA+VLP+KFL+DAFR+G+GKVVNIR LD+VITSINGW
Sbjct: 195  TRDGIRLVFQVEPNQEFHGLVCEGASVLPTKFLQDAFREGYGKVVNIRHLDDVITSINGW 254

Query: 1529 YMERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLT 1350
            YMERGLFGLVS VEILSGGI++LQV+EAEVNNISIRFLDRK+GEPT GKTKPETILRQLT
Sbjct: 255  YMERGLFGLVSGVEILSGGILRLQVAEAEVNNISIRFLDRKTGEPTKGKTKPETILRQLT 314

Query: 1349 TKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXX 1170
            TKKGQVYS+LQGKRDV+TVLTMGIMEDVSIIPQPAGDTGKVDL MNVVER          
Sbjct: 315  TKKGQVYSMLQGKRDVDTVLTMGIMEDVSIIPQPAGDTGKVDLVMNVVERPSGGFSAGGG 374

Query: 1169 XXXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRT 990
                           SF YSHRNVFGRNQKLNISLERGQIDSIFR+NYTDPWI+GDDKRT
Sbjct: 375  ISSGITSGPLSGLIGSFTYSHRNVFGRNQKLNISLERGQIDSIFRINYTDPWIQGDDKRT 434

Query: 989  SRSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARD 810
            SR+IM+QNSRTPG LVH  QP  S+LTIGRVTAG+EFSRP  PKWSGTAGLIFQ AGA D
Sbjct: 435  SRTIMVQNSRTPGNLVHSYQPGNSSLTIGRVTAGVEFSRPLRPKWSGTAGLIFQHAGAHD 494

Query: 809  EKGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWL 630
            EKG+PIIKD+YSSPLTASG T+D+ LLAK E VYTGSG HGS MFV N+EQGLP+ PEWL
Sbjct: 495  EKGNPIIKDHYSSPLTASGKTHDNMLLAKFESVYTGSGDHGSSMFVLNVEQGLPLWPEWL 554

Query: 629  FFNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXX 450
            FFNRVNAR R+GVEIGPALFLLS SGGHVVGNF+P+EAFAIGGTNSVRGYEE        
Sbjct: 555  FFNRVNARARKGVEIGPALFLLSLSGGHVVGNFSPHEAFAIGGTNSVRGYEEGAVGSARS 614

Query: 449  XXXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSP 270
                 GEISFPL+GPVEG +FADYGTDLGSG TVPGDPAGAR KPGSGYG GFG+RVDSP
Sbjct: 615  YAVGSGEISFPLMGPVEGVLFADYGTDLGSGPTVPGDPAGARLKPGSGYGYGFGMRVDSP 674

Query: 269  LGPLRLEYAFNDRQAKRFHFAVGHRN 192
            LGPLRLEYAFND+ AKRFHF VGHRN
Sbjct: 675  LGPLRLEYAFNDKHAKRFHFGVGHRN 700


>ref|XP_006484493.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Citrus
            sinensis]
          Length = 707

 Score =  962 bits (2487), Expect = 0.0
 Identities = 521/747 (69%), Positives = 570/747 (76%), Gaps = 5/747 (0%)
 Frame = -2

Query: 2417 RNDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSF--K 2244
            RND VRF  S LK+P         P      + F +Q     LTK+K+SL  LI S    
Sbjct: 4    RNDDVRFISSPLKIP---------PFRPEPPVPFFAQT----LTKSKNSLSHLIYSLNES 50

Query: 2243 TRSIFHHRSPLFSFAR--LNESTRQDDGVAQRRGR-EGRVKPTLLCSSTLAWNRSEESSQ 2073
            TRS       L SFA     +S R         G  +  V   LLCS++L+ N+S     
Sbjct: 51   TRSTEPFTRKLQSFAEHLYGKSVRICSTCLSMTGAVDTLVNFPLLCSASLSLNQSSAE-- 108

Query: 2072 GALEEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLIS 1893
                                         +S+ STQL+   AQQ  S   R DEERVLIS
Sbjct: 109  --------------------------FPAQSELSTQLQ-QKAQQPHS-VSRSDEERVLIS 140

Query: 1892 EVLVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAV 1713
            EVLVRNKDGEELERKDLE EA+ AL+ACR NSALT REVQEDVHRII+SGYFCSCMPVAV
Sbjct: 141  EVLVRNKDGEELERKDLETEALTALKACRANSALTVREVQEDVHRIIDSGYFCSCMPVAV 200

Query: 1712 DTRDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSING 1533
            DTRDGIRL+FQVEPNQEF GLVCEGANVLP+KF+EDAFRDG+GKVVNIRRLDEVITSING
Sbjct: 201  DTRDGIRLVFQVEPNQEFHGLVCEGANVLPTKFVEDAFRDGYGKVVNIRRLDEVITSING 260

Query: 1532 WYMERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQL 1353
            WYMERGLFG+VS VEILSGGII+LQV+EAEVNNISIRFLDRK+GEPT GKT+PETILRQL
Sbjct: 261  WYMERGLFGMVSGVEILSGGIIRLQVAEAEVNNISIRFLDRKTGEPTKGKTRPETILRQL 320

Query: 1352 TTKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXX 1173
            TTKKGQVYS+LQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDL MNVVER         
Sbjct: 321  TTKKGQVYSMLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLIMNVVERPSGGFSAGG 380

Query: 1172 XXXXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKR 993
                            SFAYSHRNVFGRNQKLNISLERGQIDSIFR+NYTDPWIEGDDKR
Sbjct: 381  GISSGITSGPLSGLIGSFAYSHRNVFGRNQKLNISLERGQIDSIFRINYTDPWIEGDDKR 440

Query: 992  TSRSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGAR 813
            TSR+IM+QNSRTPGT VHG+QP+ S+LTIGRVTAG+EFSRP  PKWSGT GLIFQ +GAR
Sbjct: 441  TSRTIMVQNSRTPGTHVHGNQPDNSSLTIGRVTAGMEFSRPIRPKWSGTVGLIFQHSGAR 500

Query: 812  DEKGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEW 633
            DEKG+PIIKD+YSSPLTASG TND+ L+AK E VYTGSG  GS MFVFNMEQGLPV PEW
Sbjct: 501  DEKGNPIIKDFYSSPLTASGKTNDEMLIAKFESVYTGSGDQGSSMFVFNMEQGLPVWPEW 560

Query: 632  LFFNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXX 453
            LFFNRVNAR R+GVEIGPA  LLS SGGHVVGNF+P+EAFAIGGTNSVRGYEE       
Sbjct: 561  LFFNRVNARARKGVEIGPARLLLSLSGGHVVGNFSPHEAFAIGGTNSVRGYEEGAVGSGR 620

Query: 452  XXXXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDS 273
                  GEISFP++GPVEG +F+DYGTDLGSG +VPGDPAGAR KPGSGYG GFGIRVDS
Sbjct: 621  SYVVGSGEISFPMLGPVEGVIFSDYGTDLGSGPSVPGDPAGARLKPGSGYGYGFGIRVDS 680

Query: 272  PLGPLRLEYAFNDRQAKRFHFAVGHRN 192
            PLGPLRLEYAFND+QAKRFHF VG+RN
Sbjct: 681  PLGPLRLEYAFNDKQAKRFHFGVGYRN 707


>ref|XP_007014985.1| Outer envelope protein of 80 kDa isoform 2 [Theobroma cacao]
            gi|590583754|ref|XP_007014986.1| Outer envelope protein
            of 80 kDa isoform 2 [Theobroma cacao]
            gi|590583762|ref|XP_007014988.1| Outer envelope protein
            of 80 kDa isoform 2 [Theobroma cacao]
            gi|508785348|gb|EOY32604.1| Outer envelope protein of 80
            kDa isoform 2 [Theobroma cacao]
            gi|508785349|gb|EOY32605.1| Outer envelope protein of 80
            kDa isoform 2 [Theobroma cacao]
            gi|508785351|gb|EOY32607.1| Outer envelope protein of 80
            kDa isoform 2 [Theobroma cacao]
          Length = 715

 Score =  956 bits (2470), Expect = 0.0
 Identities = 513/745 (68%), Positives = 568/745 (76%), Gaps = 1/745 (0%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFK 2244
            M  NDGV FT SSLK+P          +P+SS     SQ L S L +T HS+ QLI S +
Sbjct: 1    MHPNDGVSFTSSSLKIP----------LPSSSPS--LSQALASQLARTGHSVFQLIDSLR 48

Query: 2243 TRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGAL 2064
             RS +  R+PL   +R  EST+ D G++       R  P LL S +L+  RS + +Q   
Sbjct: 49   NRSNYV-RNPL---SRSTESTQSDLGISSLF----RSSP-LLFSLSLSLTRSTDPTQN-- 97

Query: 2063 EEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRH-GREDEERVLISEV 1887
                       +               S +STQ    + Q+  S   GR DEERVLISEV
Sbjct: 98   -------HNIAKSPLLCSASLSLTQPASTDSTQSGSELPQKGQSATAGRHDEERVLISEV 150

Query: 1886 LVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDT 1707
            LVRNKDGEELE KDLE EA+ AL+ACR NSALT REVQEDVHRII+SGYF SCMPVAVDT
Sbjct: 151  LVRNKDGEELEMKDLEMEALTALKACRANSALTVREVQEDVHRIIDSGYFSSCMPVAVDT 210

Query: 1706 RDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWY 1527
            RDGIRL+FQVEPNQEF GLVCEGANVLPSKFLEDAFRDGHGKVVN++RLDEVI SINGWY
Sbjct: 211  RDGIRLVFQVEPNQEFHGLVCEGANVLPSKFLEDAFRDGHGKVVNLKRLDEVINSINGWY 270

Query: 1526 MERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTT 1347
            MERGLFGLVS V+ILSGGII+LQV+EAEVNNISIRFLDRK+GEP  GKTKPETILRQLTT
Sbjct: 271  MERGLFGLVSGVDILSGGIIRLQVAEAEVNNISIRFLDRKTGEPCKGKTKPETILRQLTT 330

Query: 1346 KKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXX 1167
            KKGQVYS+LQGKRDV+TV TMG+MEDVSIIPQPAGD GKVDL MNVVER           
Sbjct: 331  KKGQVYSMLQGKRDVDTVSTMGLMEDVSIIPQPAGDAGKVDLIMNVVERPSGGFSAGGGI 390

Query: 1166 XXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTS 987
                          SFAYSHRN+FGRNQKLNISLERGQIDSIFR+NYTDPWIEGDDKRTS
Sbjct: 391  SSGITSGPLSGLIGSFAYSHRNLFGRNQKLNISLERGQIDSIFRINYTDPWIEGDDKRTS 450

Query: 986  RSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDE 807
            R+I++QNSRTPGTLVHG+  + S+L+IGRVTAG+EFSRP  PKW+GTAGLIFQ AGARDE
Sbjct: 451  RTIIVQNSRTPGTLVHGNLHDNSSLSIGRVTAGVEFSRPIRPKWNGTAGLIFQHAGARDE 510

Query: 806  KGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLF 627
            KG+PIIKD+Y SPLTASG   DD LLAK E VYTGSG  GS MF FNMEQGLPV+PEWLF
Sbjct: 511  KGNPIIKDFYGSPLTASGKPYDDMLLAKFESVYTGSGDQGSSMFAFNMEQGLPVMPEWLF 570

Query: 626  FNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXX 447
            FNRVNAR R+GVEIGPA  LLS SGGHVVGNF+P+EAFAIGGTNSVRGYEE         
Sbjct: 571  FNRVNARARKGVEIGPARLLLSLSGGHVVGNFSPHEAFAIGGTNSVRGYEEGAVGSGRSY 630

Query: 446  XXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPL 267
                 E+SFP++GPVEG +FADYG DL SG  VPGDPAGAR KPGSGYG GFGIRV+SPL
Sbjct: 631  VVGSSEVSFPMVGPVEGVMFADYGHDLWSGPNVPGDPAGARFKPGSGYGYGFGIRVESPL 690

Query: 266  GPLRLEYAFNDRQAKRFHFAVGHRN 192
            GPLRLEYAFNDRQAKRFHF VGHRN
Sbjct: 691  GPLRLEYAFNDRQAKRFHFGVGHRN 715


>ref|XP_006400523.1| hypothetical protein EUTSA_v10012770mg [Eutrema salsugineum]
            gi|557101613|gb|ESQ41976.1| hypothetical protein
            EUTSA_v10012770mg [Eutrema salsugineum]
          Length = 743

 Score =  941 bits (2433), Expect = 0.0
 Identities = 501/758 (66%), Positives = 570/758 (75%), Gaps = 14/758 (1%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFK 2244
            M R+D V F+ SS+++ HS  SSH       ++L  CS+ L S L+ T+ SL +L+ S K
Sbjct: 1    MQRHDDVHFSSSSIRI-HS--SSHDQSF--LANLQSCSKTLASQLSTTRLSLGRLLKSLK 55

Query: 2243 TRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGAL 2064
             R    H SP F+  R N  T+  + + Q    +  + P++  S +L        S    
Sbjct: 56   NR----HSSPRFTQNRPNSPTQMLNSITQLMIGKSSLAPSV--SLSLIHPAQSIWSDSGA 109

Query: 2063 EEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGG--VAQQKLSRH---GREDEERVL 1899
            +  G+T                  L R  ESTQ   G  V QQ+L +     R  EERVL
Sbjct: 110  DNKGLT----AGINSPLLCCASLSLTRPSESTQSVEGKDVIQQQLQKGHSVSRNAEERVL 165

Query: 1898 ISEVLVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPV 1719
            ISEVLVR KDGEELERKDLE EA+AAL+ACR NSALT REVQEDVHRII SGYFCSC PV
Sbjct: 166  ISEVLVRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVHRIIESGYFCSCTPV 225

Query: 1718 AVDTRDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSI 1539
            AVDTRDGIRL+FQVEPNQEF+GLVCE ANVLPSKF+++AF+DG GKV+NI+RL+E ITSI
Sbjct: 226  AVDTRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIQEAFQDGFGKVINIKRLEEAITSI 285

Query: 1538 NGWYMERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILR 1359
            NGWYMERGLFG+VSD++ LSGGI++LQV+EAEVNNISIRFLDRK+GEPT GKT+ ETILR
Sbjct: 286  NGWYMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTRVETILR 345

Query: 1358 QLTTKKGQV---------YSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVV 1206
            QLTTKKGQV         YS+LQGKRDV+TVL MGIMEDVSIIPQPAGD+GKVDL MN V
Sbjct: 346  QLTTKKGQVFLESLSLDVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDSGKVDLIMNCV 405

Query: 1205 ERVXXXXXXXXXXXXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNY 1026
            ER                         SFAYSHRN+ GRNQKLN+SLERGQIDSIFR+NY
Sbjct: 406  ERPSGGFSAGGGISSGITSGPLSGLIGSFAYSHRNILGRNQKLNVSLERGQIDSIFRINY 465

Query: 1025 TDPWIEGDDKRTSRSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGT 846
            TDPWIEGDDKRTSRSIM+QNSRTPG LVHG+QP+ +NLTIGRVTAGIE+SRPF PKWSGT
Sbjct: 466  TDPWIEGDDKRTSRSIMVQNSRTPGNLVHGNQPDNANLTIGRVTAGIEYSRPFRPKWSGT 525

Query: 845  AGLIFQRAGARDEKGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFN 666
            AGLIFQ AGARDE+G+PIIKD+YSSPLTASG T+DDTLLAK E +YTGSG HGS MF FN
Sbjct: 526  AGLIFQHAGARDEQGNPIIKDFYSSPLTASGKTHDDTLLAKFESIYTGSGDHGSTMFAFN 585

Query: 665  MEQGLPVLPEWLFFNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVR 486
            MEQGLPVLPEWLFFNRVNAR R+G+ IGP  FL S SGGHVVGNF+P+EAFAIGGTNSVR
Sbjct: 586  MEQGLPVLPEWLFFNRVNARTRKGIHIGPTRFLFSLSGGHVVGNFSPHEAFAIGGTNSVR 645

Query: 485  GYEEXXXXXXXXXXXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSG 306
            GYEE             GE+SFP+ GPVEG +F DYGTDLGSG TVPGDPAGAR KPGSG
Sbjct: 646  GYEEGAVGSGRSYVVGSGEVSFPMRGPVEGVLFTDYGTDLGSGPTVPGDPAGARLKPGSG 705

Query: 305  YGCGFGIRVDSPLGPLRLEYAFNDRQAKRFHFAVGHRN 192
            YG GFG+RVDSPLGPLRLEYAFND+   RFHF VGHRN
Sbjct: 706  YGYGFGVRVDSPLGPLRLEYAFNDKHTGRFHFGVGHRN 743


>ref|XP_002871896.1| hypothetical protein ARALYDRAFT_909999 [Arabidopsis lyrata subsp.
            lyrata] gi|297317733|gb|EFH48155.1| hypothetical protein
            ARALYDRAFT_909999 [Arabidopsis lyrata subsp. lyrata]
          Length = 732

 Score =  940 bits (2430), Expect = 0.0
 Identities = 493/744 (66%), Positives = 559/744 (75%), Gaps = 4/744 (0%)
 Frame = -2

Query: 2411 DGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFKTRSI 2232
            D VRF+ SS+++   +   HS      ++L  CS+   S L  T+ SL Q++ S + R  
Sbjct: 7    DDVRFSSSSIRIHSPSSKEHSL----LTNLKSCSKTFVSQLCNTRLSLTQMLESLRNR-- 60

Query: 2231 FHHRSPLFSFARLNESTRQDDGVAQRR-GREGRVKPTLLCSSTLAWNRSEESSQGALEEG 2055
                +P  S  R N  T+  + V Q   G+   +  +L+ S+ L W+         +  G
Sbjct: 61   ---HTPPRSVRRPNLPTQMLNSVTQLMIGKSSPISLSLIQSTQLNWSSGSGDENVEIIRG 117

Query: 2054 GMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGG---VAQQKLSRHGREDEERVLISEVL 1884
                                 L R +ESTQ   G   V QQK     R  EERVLISEVL
Sbjct: 118  ---------LNSPLLCCASLSLTRPNESTQSVEGKDIVQQQKGHSVSRNAEERVLISEVL 168

Query: 1883 VRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDTR 1704
            VR KDGEELERKDLE EA+AAL+ACR NSALT REVQEDVHRII SGYFCSC PVAVDTR
Sbjct: 169  VRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVHRIIESGYFCSCTPVAVDTR 228

Query: 1703 DGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWYM 1524
            DGIRL+FQVEPNQEF+GLVCE ANVLPSKF+++AFRDG GKV+NI+RL+E ITSINGWYM
Sbjct: 229  DGIRLMFQVEPNQEFRGLVCENANVLPSKFIQEAFRDGFGKVINIKRLEEAITSINGWYM 288

Query: 1523 ERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTTK 1344
            ERGLFG+VSD++ LSGGI++LQV+EAEVNNISIRFLDRK+GEPT GKT PETILRQLTTK
Sbjct: 289  ERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTSPETILRQLTTK 348

Query: 1343 KGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXXX 1164
            KGQVYS+LQGKRDV+TVL MGIMEDVSIIPQPAGDTGKVDL MN VER            
Sbjct: 349  KGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDTGKVDLIMNCVERPSGGFSAGGGIS 408

Query: 1163 XXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTSR 984
                         SFAYSHRN+FGRNQKLN+SLERGQIDSIFR+NYTDPWIEGDDKRTSR
Sbjct: 409  SGITSGPLSGLIGSFAYSHRNLFGRNQKLNVSLERGQIDSIFRINYTDPWIEGDDKRTSR 468

Query: 983  SIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDEK 804
            SIM+QNSRTPG LVHG+QP+ S+LTIGRVTAGIE+SRPF PKWSGTAGLIFQ AGARDE+
Sbjct: 469  SIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWSGTAGLIFQHAGARDEQ 528

Query: 803  GSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLFF 624
            G+PIIKD+YSSPLTASG T+DDTLLAK+E +YTGSG  GS MF FNMEQGLPVLPEWL F
Sbjct: 529  GNPIIKDFYSSPLTASGKTHDDTLLAKLESIYTGSGDRGSTMFAFNMEQGLPVLPEWLCF 588

Query: 623  NRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXXX 444
            NRV  R R+G+ IGPA FL S SGGHVVGNF+P+EAF IGGTNS+RGYEE          
Sbjct: 589  NRVTGRARKGIHIGPARFLFSLSGGHVVGNFSPHEAFVIGGTNSIRGYEEGAVGSGRSYV 648

Query: 443  XXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPLG 264
               GE+SFP+ GPVEG +F DYGTDLGSGSTVPGDPAGAR KPGSGYG G G+RVDSPLG
Sbjct: 649  VGSGEMSFPVRGPVEGVIFTDYGTDLGSGSTVPGDPAGARLKPGSGYGYGLGVRVDSPLG 708

Query: 263  PLRLEYAFNDRQAKRFHFAVGHRN 192
            PLRLEYAFND+ A RFHF VG RN
Sbjct: 709  PLRLEYAFNDQHAGRFHFGVGLRN 732


>ref|NP_568378.1| outer envelope protein 80 [Arabidopsis thaliana]
            gi|75168961|sp|Q9C5J8.1|OEP80_ARATH RecName: Full=Outer
            envelope protein 80, chloroplastic; AltName:
            Full=Chloroplastic outer envelope protein of 80 kDa;
            Short=AtOEP80; AltName: Full=Protein TOC75-V;
            Short=AtToc75-V gi|13430586|gb|AAK25915.1|AF360205_1
            unknown protein [Arabidopsis thaliana]
            gi|14532858|gb|AAK64111.1| unknown protein [Arabidopsis
            thaliana] gi|332005348|gb|AED92731.1| outer envelope
            protein 80 [Arabidopsis thaliana]
          Length = 732

 Score =  939 bits (2426), Expect = 0.0
 Identities = 491/745 (65%), Positives = 564/745 (75%), Gaps = 4/745 (0%)
 Frame = -2

Query: 2414 NDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFKTRS 2235
            ND VRF+ SS+++ HS        + T+  L  CS+   SHL+ T++SL Q++ S K R 
Sbjct: 5    NDDVRFSSSSIRI-HSPSPKEQHSLLTN--LQSCSKTFVSHLSNTRNSLNQMLQSLKNR- 60

Query: 2234 IFHHRSPLFSFARLNESTRQDDGVAQRR-GREGRVKPTLLCSSTLAWNRSEESSQGALEE 2058
               H  P  S  R N  T+  + V Q   G+   +  +L+ S+   W+ S + +   +  
Sbjct: 61   ---HTPPPRSVRRPNLPTQMLNSVTQLMIGKSSPISLSLIQSTQFNWSESRDENVETIR- 116

Query: 2057 GGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGG---VAQQKLSRHGREDEERVLISEV 1887
             G++                  L R +ESTQ   G   V QQK     R  EERVLISEV
Sbjct: 117  -GLSSP--------LLCCASLSLTRPNESTQSVEGKDTVQQQKGHSVSRNAEERVLISEV 167

Query: 1886 LVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDT 1707
            LVR KDGEELERKDLE EA+AAL+ACR NSALT REVQEDVHRII SGYFCSC PVAVDT
Sbjct: 168  LVRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVHRIIESGYFCSCTPVAVDT 227

Query: 1706 RDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWY 1527
            RDGIRL+FQVEPNQEF+GLVCE ANVLPSKF+ +AFRDG GKV+NI+RL+E ITSINGWY
Sbjct: 228  RDGIRLMFQVEPNQEFRGLVCENANVLPSKFIHEAFRDGFGKVINIKRLEEAITSINGWY 287

Query: 1526 MERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTT 1347
            MERGLFG+VSD++ LSGGI++LQV+EAEVNNISIRFLDRK+GEPT GKT PETILRQLTT
Sbjct: 288  MERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTSPETILRQLTT 347

Query: 1346 KKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXX 1167
            KKGQVYS+LQGKRDV+TVL MGIMEDVSIIPQPAGD+GKVDL MN VER           
Sbjct: 348  KKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDSGKVDLIMNCVERPSGGFSAGGGI 407

Query: 1166 XXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTS 987
                          SFAYSHRN+FGRNQKLN+SLERGQIDSIFR+NYTDPWIEGDDKRTS
Sbjct: 408  SSGITSGPLSGLIGSFAYSHRNLFGRNQKLNVSLERGQIDSIFRINYTDPWIEGDDKRTS 467

Query: 986  RSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDE 807
            RSIM+QNSRTPG LVHG+QP+ S+LTIGRVTAG+E+SRPF PKW+GTAGLIFQ AGARDE
Sbjct: 468  RSIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGVEYSRPFRPKWNGTAGLIFQHAGARDE 527

Query: 806  KGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLF 627
            +G+PIIKD+YSSPLTASG  +D+T+LAK+E +YTGSG  GS MF FNMEQGLPVLPEWL 
Sbjct: 528  QGNPIIKDFYSSPLTASGKPHDETMLAKLESIYTGSGDQGSTMFAFNMEQGLPVLPEWLC 587

Query: 626  FNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXX 447
            FNRV  R R+G+ IGPA FL S SGGHVVG F+P+EAF IGGTNSVRGYEE         
Sbjct: 588  FNRVTGRARKGIHIGPARFLFSLSGGHVVGKFSPHEAFVIGGTNSVRGYEEGAVGSGRSY 647

Query: 446  XXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPL 267
                GE+SFP+ GPVEG +F DYGTD+GSGSTVPGDPAGAR KPGSGYG G G+RVDSPL
Sbjct: 648  VVGSGELSFPVRGPVEGVIFTDYGTDMGSGSTVPGDPAGARLKPGSGYGYGLGVRVDSPL 707

Query: 266  GPLRLEYAFNDRQAKRFHFAVGHRN 192
            GPLRLEYAFND+ A RFHF VG RN
Sbjct: 708  GPLRLEYAFNDQHAGRFHFGVGLRN 732


>ref|XP_007208341.1| hypothetical protein PRUPE_ppa002070mg [Prunus persica]
            gi|462403983|gb|EMJ09540.1| hypothetical protein
            PRUPE_ppa002070mg [Prunus persica]
          Length = 721

 Score =  938 bits (2424), Expect = 0.0
 Identities = 501/745 (67%), Positives = 567/745 (76%), Gaps = 1/745 (0%)
 Frame = -2

Query: 2423 MPRNDGVRFTPS-SLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSF 2247
            MP ND VRFT S S+K+P   ++          DL F          +T++S  QLI S 
Sbjct: 1    MPPNDEVRFTSSPSVKVPRPPQNRQL-------DLPFL-------FARTRNSFAQLIDSL 46

Query: 2246 KTRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGA 2067
            KTRS F    PL     L+    Q   V Q  GR   +   +LCS++L+  RS +S++  
Sbjct: 47   KTRSAFAQFPPLKWPPFLSTELNQCIAVTQN-GRSHSLP--ILCSASLSLTRSADSAESE 103

Query: 2066 LEEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLISEV 1887
                     +               L R DESTQ     +QQK     R DEERVLISEV
Sbjct: 104  SRNRNADHSQF-VGKSPLLCSASLSLTRPDESTQ-----SQQKGHSSSRHDEERVLISEV 157

Query: 1886 LVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDT 1707
            LVRNKDGEELERKDLEAEA+AAL+ACRPNSALT  EVQEDV RI +SGYFCSCMPVAVDT
Sbjct: 158  LVRNKDGEELERKDLEAEALAALKACRPNSALTVSEVQEDVQRIFDSGYFCSCMPVAVDT 217

Query: 1706 RDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWY 1527
            RDGIRL+FQV+PNQEFQGLVCEGANVLP+KF++DAF DG+GKV+N++RL+EVI+SIN WY
Sbjct: 218  RDGIRLIFQVKPNQEFQGLVCEGANVLPAKFIKDAFCDGYGKVINLKRLNEVISSINDWY 277

Query: 1526 MERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTT 1347
            M+RGLF +VS VE LSGG++KLQVSEAEVNNISIRFLDRK+GEPTVGKTKPETILRQLTT
Sbjct: 278  MDRGLFAMVSAVESLSGGVLKLQVSEAEVNNISIRFLDRKTGEPTVGKTKPETILRQLTT 337

Query: 1346 KKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXX 1167
            KKGQVYS+LQGKRDVETVLTMG+MEDVSIIPQPA D GKVD+TMNVVER           
Sbjct: 338  KKGQVYSMLQGKRDVETVLTMGLMEDVSIIPQPA-DAGKVDITMNVVERPSGGFSAGGGI 396

Query: 1166 XXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTS 987
                          SFAYSHRN+FGRNQKL++SLERGQIDSIFR+NY+DPWI GDD RTS
Sbjct: 397  SSGITSGPLSGLIGSFAYSHRNLFGRNQKLHVSLERGQIDSIFRINYSDPWIAGDDMRTS 456

Query: 986  RSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDE 807
            R+IM+QNSRTPGTL+HG+Q + SNLTIGR+TAGIEFSRP  PK SGTAGLIFQ AGARDE
Sbjct: 457  RTIMVQNSRTPGTLIHGNQQDGSNLTIGRITAGIEFSRPIRPKLSGTAGLIFQHAGARDE 516

Query: 806  KGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLF 627
            +G+PIIKD++SSPLTASGN +DD LLAK+E VYTGSG HGS M V NMEQGLPVLPEWL 
Sbjct: 517  RGNPIIKDFFSSPLTASGNNHDDMLLAKLESVYTGSGDHGSSMLVLNMEQGLPVLPEWLV 576

Query: 626  FNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXX 447
            FNR+NAR R+ +E+GPA FLLS SGGHVVGNF P+EAFAIGGTNSVRGYEE         
Sbjct: 577  FNRINARARKDLELGPARFLLSLSGGHVVGNFPPHEAFAIGGTNSVRGYEEGAVGSGRSY 636

Query: 446  XXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPL 267
                GEISFP+IGPV G +FADYGTDLGSG TVPGDPAGAR KPGSGYG GFGIR+DSPL
Sbjct: 637  TVGSGEISFPVIGPVGGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYGYGFGIRLDSPL 696

Query: 266  GPLRLEYAFNDRQAKRFHFAVGHRN 192
            GPLRLEYAFND+  KRFHF VGHRN
Sbjct: 697  GPLRLEYAFNDKHTKRFHFGVGHRN 721


>ref|XP_006287138.1| hypothetical protein CARUB_v10000309mg [Capsella rubella]
            gi|482555844|gb|EOA20036.1| hypothetical protein
            CARUB_v10000309mg [Capsella rubella]
          Length = 735

 Score =  936 bits (2419), Expect = 0.0
 Identities = 488/746 (65%), Positives = 561/746 (75%), Gaps = 5/746 (0%)
 Frame = -2

Query: 2414 NDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFKTRS 2235
            +D V F+ SS+++   +   H    P  ++L  CS+ L S L+ T+HSL ++    K R 
Sbjct: 5    HDDVHFSSSSIRIHSPSFKEH----PLLTNLQSCSKTLVSQLSNTRHSLNRVFELIKNR- 59

Query: 2234 IFHHRSPLFS----FARLNESTRQDDGVAQRR-GREGRVKPTLLCSSTLAWNRSEESSQG 2070
               H  P F+      R N  T+    V Q   G+   +  +L+ S+ L W+ S      
Sbjct: 60   ---HSPPRFTQTRPVRRSNSHTQILSSVTQLMIGKSSPISLSLIQSTQLNWSNSGV---- 112

Query: 2069 ALEEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLISE 1890
               E   T +                   S++S + +  + QQK     R  EERVLISE
Sbjct: 113  ---EDIETTRGLSSPLLCCASLSLTRPNESNQSVEGKDMIQQQKGHSVSRNAEERVLISE 169

Query: 1889 VLVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVD 1710
            VLVR KDGEELERKDLE EA+AAL+ACR NSALT REVQEDVHRII SGYFCSC PVAVD
Sbjct: 170  VLVRTKDGEELERKDLEIEALAALKACRANSALTIREVQEDVHRIIESGYFCSCTPVAVD 229

Query: 1709 TRDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGW 1530
            TRDGIRL+FQVEPNQEF+GLVCE ANVLPSKF+++AFRDG GKV+NI+RL+E ITSINGW
Sbjct: 230  TRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIQEAFRDGFGKVINIKRLEEAITSINGW 289

Query: 1529 YMERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLT 1350
            YMERGLFG+VSD++ LSGGI++LQV+EAEVNNISIRFLDRK+GEPT GKT PETILRQLT
Sbjct: 290  YMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTSPETILRQLT 349

Query: 1349 TKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXX 1170
            TKKGQVYS+LQGKRDV+TVL MGIMEDVSIIPQPAGD+GKVDL MN VER          
Sbjct: 350  TKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDSGKVDLIMNCVERPSGGFSAGGG 409

Query: 1169 XXXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRT 990
                           SFAYSHRN+FGRNQKLN+SLERGQIDSIFR+NYTDPWIEGDDKRT
Sbjct: 410  ISSGITSGPLSGLIGSFAYSHRNLFGRNQKLNVSLERGQIDSIFRINYTDPWIEGDDKRT 469

Query: 989  SRSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARD 810
            SRSIM+QNSRTPG LVHG+QP+ S+LTIGRVTAG+E+SRPF PKWSGTAGLIFQ AGARD
Sbjct: 470  SRSIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGVEYSRPFRPKWSGTAGLIFQHAGARD 529

Query: 809  EKGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWL 630
            E+G+PIIKD+YSSPLTASG T+D+TLLAK+E +YTGSG  GS MF FNMEQGLPVLPEWL
Sbjct: 530  EQGNPIIKDFYSSPLTASGKTHDETLLAKLESIYTGSGDRGSTMFAFNMEQGLPVLPEWL 589

Query: 629  FFNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXX 450
             FNRV AR R+G+ IGP  FL S SGGHVVGNF+P+EAF IGGTNSVRGYEE        
Sbjct: 590  CFNRVTARARKGIHIGPGRFLFSLSGGHVVGNFSPHEAFGIGGTNSVRGYEEGAVGSGRS 649

Query: 449  XXXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSP 270
                 GE+SFP+ GPVEG +F DYGTD+GSGSTVPGDPAGAR KPGSGYG G G+RVDSP
Sbjct: 650  YVVGSGEMSFPVRGPVEGVIFTDYGTDMGSGSTVPGDPAGARLKPGSGYGYGLGVRVDSP 709

Query: 269  LGPLRLEYAFNDRQAKRFHFAVGHRN 192
            LGPLRLEYAFND+QA RFHF VG RN
Sbjct: 710  LGPLRLEYAFNDQQAGRFHFGVGLRN 735


>ref|XP_007014984.1| Outer envelope protein of 80 kDa isoform 1 [Theobroma cacao]
            gi|508785347|gb|EOY32603.1| Outer envelope protein of 80
            kDa isoform 1 [Theobroma cacao]
          Length = 755

 Score =  935 bits (2416), Expect = 0.0
 Identities = 504/735 (68%), Positives = 559/735 (76%), Gaps = 1/735 (0%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFK 2244
            M  NDGV FT SSLK+P          +P+SS     SQ L S L +T HS+ QLI S +
Sbjct: 1    MHPNDGVSFTSSSLKIP----------LPSSSPS--LSQALASQLARTGHSVFQLIDSLR 48

Query: 2243 TRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGAL 2064
             RS +  R+PL   +R  EST+ D G++       R  P LL S +L+  RS + +Q   
Sbjct: 49   NRSNYV-RNPL---SRSTESTQSDLGISSLF----RSSP-LLFSLSLSLTRSTDPTQN-- 97

Query: 2063 EEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRH-GREDEERVLISEV 1887
                       +               S +STQ    + Q+  S   GR DEERVLISEV
Sbjct: 98   -------HNIAKSPLLCSASLSLTQPASTDSTQSGSELPQKGQSATAGRHDEERVLISEV 150

Query: 1886 LVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDT 1707
            LVRNKDGEELE KDLE EA+ AL+ACR NSALT REVQEDVHRII+SGYF SCMPVAVDT
Sbjct: 151  LVRNKDGEELEMKDLEMEALTALKACRANSALTVREVQEDVHRIIDSGYFSSCMPVAVDT 210

Query: 1706 RDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWY 1527
            RDGIRL+FQVEPNQEF GLVCEGANVLPSKFLEDAFRDGHGKVVN++RLDEVI SINGWY
Sbjct: 211  RDGIRLVFQVEPNQEFHGLVCEGANVLPSKFLEDAFRDGHGKVVNLKRLDEVINSINGWY 270

Query: 1526 MERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTT 1347
            MERGLFGLVS V+ILSGGII+LQV+EAEVNNISIRFLDRK+GEP  GKTKPETILRQLTT
Sbjct: 271  MERGLFGLVSGVDILSGGIIRLQVAEAEVNNISIRFLDRKTGEPCKGKTKPETILRQLTT 330

Query: 1346 KKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXX 1167
            KKGQVYS+LQGKRDV+TV TMG+MEDVSIIPQPAGD GKVDL MNVVER           
Sbjct: 331  KKGQVYSMLQGKRDVDTVSTMGLMEDVSIIPQPAGDAGKVDLIMNVVERPSGGFSAGGGI 390

Query: 1166 XXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTS 987
                          SFAYSHRN+FGRNQKLNISLERGQIDSIFR+NYTDPWIEGDDKRTS
Sbjct: 391  SSGITSGPLSGLIGSFAYSHRNLFGRNQKLNISLERGQIDSIFRINYTDPWIEGDDKRTS 450

Query: 986  RSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDE 807
            R+I++QNSRTPGTLVHG+  + S+L+IGRVTAG+EFSRP  PKW+GTAGLIFQ AGARDE
Sbjct: 451  RTIIVQNSRTPGTLVHGNLHDNSSLSIGRVTAGVEFSRPIRPKWNGTAGLIFQHAGARDE 510

Query: 806  KGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLF 627
            KG+PIIKD+Y SPLTASG   DD LLAK E VYTGSG  GS MF FNMEQGLPV+PEWLF
Sbjct: 511  KGNPIIKDFYGSPLTASGKPYDDMLLAKFESVYTGSGDQGSSMFAFNMEQGLPVMPEWLF 570

Query: 626  FNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXX 447
            FNRVNAR R+GVEIGPA  LLS SGGHVVGNF+P+EAFAIGGTNSVRGYEE         
Sbjct: 571  FNRVNARARKGVEIGPARLLLSLSGGHVVGNFSPHEAFAIGGTNSVRGYEEGAVGSGRSY 630

Query: 446  XXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPL 267
                 E+SFP++GPVEG +FADYG DL SG  VPGDPAGAR KPGSGYG GFGIRV+SPL
Sbjct: 631  VVGSSEVSFPMVGPVEGVMFADYGHDLWSGPNVPGDPAGARFKPGSGYGYGFGIRVESPL 690

Query: 266  GPLRLEYAFNDRQAK 222
            GPLRLEYAFNDRQAK
Sbjct: 691  GPLRLEYAFNDRQAK 705


>ref|XP_003542049.2| PREDICTED: outer envelope protein 80, chloroplastic-like isoform X1
            [Glycine max]
          Length = 685

 Score =  927 bits (2395), Expect = 0.0
 Identities = 499/745 (66%), Positives = 556/745 (74%), Gaps = 1/745 (0%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLPHSTESSHST-PIPTSSDLSFCSQILTSHLTKTKHSLIQLITSF 2247
            M RND VR   SS+K+P  + S H T P+ T+           SH+    +S+ QLI SF
Sbjct: 9    MLRNDDVRIVSSSIKIPLPSISKHPTCPLRTAH----------SHIANATNSIAQLINSF 58

Query: 2246 KTRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGA 2067
             + S           A L  S  Q              K +LLCS+TL+           
Sbjct: 59   TSHS-----------AELTRSVIQ--------------KSSLLCSATLSLT--------- 84

Query: 2066 LEEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLISEV 1887
                G  ++K                IR   S  L    AQQK     R++EERVLISEV
Sbjct: 85   ----GDRKRK--------------CPIRRLASLSL-AEEAQQK----ARQNEERVLISEV 121

Query: 1886 LVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDT 1707
            LVRNKDGEELERKDLEAEA  AL+ACRPNSALT REVQEDVHRIINSGYF SCMPVAVDT
Sbjct: 122  LVRNKDGEELERKDLEAEAAQALKACRPNSALTVREVQEDVHRIINSGYFSSCMPVAVDT 181

Query: 1706 RDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWY 1527
            RDGIRL+FQVEPNQEFQGLVCEGANVLP+KFLED+ RDG+GK++N+RRLDE I+SIN WY
Sbjct: 182  RDGIRLVFQVEPNQEFQGLVCEGANVLPAKFLEDSMRDGYGKIINLRRLDEAISSINNWY 241

Query: 1526 MERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTT 1347
            MERGLF +VS VEILSGGI++LQVSEAEV+NISIRFLDRK+GE T+GKTKPETILRQ+TT
Sbjct: 242  MERGLFAMVSAVEILSGGILRLQVSEAEVDNISIRFLDRKTGETTMGKTKPETILRQITT 301

Query: 1346 KKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXX 1167
            KKGQVYS+L+GKRDVETVLTMGIMEDVSIIPQPA DTGKVDL MNVVER           
Sbjct: 302  KKGQVYSMLEGKRDVETVLTMGIMEDVSIIPQPA-DTGKVDLVMNVVERPSGGFSAGGGI 360

Query: 1166 XXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTS 987
                          SFAYSHRNVFG+NQKLNISLERGQIDS++R+NYTDPWI+GDDKRTS
Sbjct: 361  SSGITNGPLRGLIGSFAYSHRNVFGKNQKLNISLERGQIDSVYRINYTDPWIQGDDKRTS 420

Query: 986  RSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDE 807
            R+IMIQNSRTPGT+VHG+     +LTIGR+T GIEFSRP  PKWSGTAGL+FQ AG RDE
Sbjct: 421  RTIMIQNSRTPGTIVHGNADGNGSLTIGRITGGIEFSRPIRPKWSGTAGLVFQHAGVRDE 480

Query: 806  KGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLF 627
            KG PIIKD YSSPLTASGNT+DDTLLAK+E VYTGSG HGS +FV NME+GLP+LPEWL 
Sbjct: 481  KGIPIIKDCYSSPLTASGNTHDDTLLAKLETVYTGSGDHGSSLFVLNMEKGLPLLPEWLS 540

Query: 626  FNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXX 447
            F RVNAR R+GVEIGPA   LS SGGHVVGNF+PYEAFAIGGTNSVRGYEE         
Sbjct: 541  FTRVNARARKGVEIGPARLHLSISGGHVVGNFSPYEAFAIGGTNSVRGYEEGSVGSGRSY 600

Query: 446  XXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPL 267
                GEISFP+ GPVEG +F+DYGTDLGSG TVPGDPAGAR KPGSGYG GFGIRV+SPL
Sbjct: 601  IVGSGEISFPMYGPVEGVIFSDYGTDLGSGPTVPGDPAGARKKPGSGYGYGFGIRVESPL 660

Query: 266  GPLRLEYAFNDRQAKRFHFAVGHRN 192
            GPLRLEYAFND+Q KRFHF VGHRN
Sbjct: 661  GPLRLEYAFNDKQDKRFHFGVGHRN 685


>ref|XP_002304768.1| hypothetical protein POPTR_0003s20390g [Populus trichocarpa]
            gi|222842200|gb|EEE79747.1| hypothetical protein
            POPTR_0003s20390g [Populus trichocarpa]
          Length = 682

 Score =  927 bits (2395), Expect = 0.0
 Identities = 502/754 (66%), Positives = 561/754 (74%), Gaps = 10/754 (1%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLK----LPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLI 2256
            M +ND V FT S+LK    L H T+ S          L F SQ + + LT         +
Sbjct: 1    MIKNDDVSFTSSALKIAPFLHHQTKPS----------LPFFSQFVQTKLT--------FL 42

Query: 2255 TSFKTRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESS 2076
             S  TR+ F + SPL   A L+ +     G   +          +LCS++L+ ++S+   
Sbjct: 43   DSLLTRTRFPN-SPLLCSASLSLTRPSSPGPDPK-------SLPILCSASLSLSQSQLR- 93

Query: 2075 QGALEEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSR----HG--RED 1914
                                             +STQ +  VAQQK       HG  R D
Sbjct: 94   ---------------------------------DSTQSDSVVAQQKSGGASGVHGPSRYD 120

Query: 1913 EERVLISEVLVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFC 1734
            EERVLISEVLVRNKDGEELERKDLEAEA+AAL+ACR NSALT REVQEDVHR+I+SGYFC
Sbjct: 121  EERVLISEVLVRNKDGEELERKDLEAEALAALKACRANSALTVREVQEDVHRVISSGYFC 180

Query: 1733 SCMPVAVDTRDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDE 1554
            SCMPVAVDTRDGIRL+FQVEPNQEF GLVCEGA+VLP+KFL+DAFR G+GKVVNI++LDE
Sbjct: 181  SCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGASVLPTKFLQDAFRGGYGKVVNIKQLDE 240

Query: 1553 VITSINGWYMERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKP 1374
            VI+SIN WYMERGLFG+VS+ EILSGGII+LQ++EAEVN+ISIRFLDRK+GEPT GKTKP
Sbjct: 241  VISSINSWYMERGLFGMVSNAEILSGGIIRLQIAEAEVNDISIRFLDRKTGEPTKGKTKP 300

Query: 1373 ETILRQLTTKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVX 1194
            ETILRQLTTKKGQVYS+LQGKRDV+TVLTMGIMEDVS IPQPA DTGKVDL MNVVER  
Sbjct: 301  ETILRQLTTKKGQVYSMLQGKRDVDTVLTMGIMEDVSFIPQPAEDTGKVDLIMNVVER-- 358

Query: 1193 XXXXXXXXXXXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPW 1014
                                    FAYSHRNVFGRNQKLNISLERGQIDSIFR+NYTDPW
Sbjct: 359  ----------PNGGFSAGGGISSGFAYSHRNVFGRNQKLNISLERGQIDSIFRINYTDPW 408

Query: 1013 IEGDDKRTSRSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLI 834
            IEGDDKRTSR+IM+QNSRTPG LVHG+QP  ++LTIGRV AGIEFSRP  PKWSGT GLI
Sbjct: 409  IEGDDKRTSRTIMVQNSRTPGNLVHGNQPVNNSLTIGRVAAGIEFSRPLRPKWSGTVGLI 468

Query: 833  FQRAGARDEKGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQG 654
            FQ AGAR+EKG P IKD+Y+SPLTASG  +DD LLAK E VYTGSG HGS MFVFNMEQG
Sbjct: 469  FQHAGARNEKGDPKIKDHYNSPLTASGKNHDDMLLAKFESVYTGSGDHGSSMFVFNMEQG 528

Query: 653  LPVLPEWLFFNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEE 474
            LP+ PEWLFFNRVN R R+GVEIGPAL LLS SGGHV+GNF+P+EAFAIGGTNSVRGYEE
Sbjct: 529  LPLWPEWLFFNRVNTRARKGVEIGPALCLLSLSGGHVMGNFSPHEAFAIGGTNSVRGYEE 588

Query: 473  XXXXXXXXXXXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCG 294
                         GEISFP++GPVEG  FADYGTDLGSG +VPGDPAGAR KPGSGYG G
Sbjct: 589  GAVGSGRSYAVGSGEISFPVLGPVEGVFFADYGTDLGSGPSVPGDPAGARLKPGSGYGYG 648

Query: 293  FGIRVDSPLGPLRLEYAFNDRQAKRFHFAVGHRN 192
            FGIRVDSPLGPLRLEYAFNDR  KRFHF VGHRN
Sbjct: 649  FGIRVDSPLGPLRLEYAFNDRHTKRFHFGVGHRN 682


>gb|EXB93281.1| Outer envelope protein 80 [Morus notabilis]
          Length = 729

 Score =  926 bits (2393), Expect = 0.0
 Identities = 501/772 (64%), Positives = 560/772 (72%), Gaps = 28/772 (3%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFK 2244
            MP ND + FT  SLKLP  +        P   DLS          T+T++S  QL  S K
Sbjct: 1    MPANDDISFT--SLKLPFPSP-------PPPFDLS-------PLFTRTRNSFAQLFDSVK 44

Query: 2243 TRSIFHHRSPLF--------------------------SFARLNESTRQDDGVAQRRGRE 2142
            TRS     +P F                               + S  + D  +++  +E
Sbjct: 45   TRSGIERLAPKFLPLPSRPLFRNRFTAGGLASRCPKLPPLCWASLSLTRSDSESRKEDKE 104

Query: 2141 GRV-KPTLLCSSTLAWNRSEESSQGALEEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQ 1965
              + K +LLCS++L+  R ++S+Q  LE   MT                           
Sbjct: 105  LPIGKSSLLCSASLSLTRPDDSTQSGLERREMTASAAAPQ-------------------- 144

Query: 1964 LEGGVAQQKLSRHGREDEERVLISEVLVRNKDGEELERKDLEAEAVAALRACRPNSALTA 1785
                  QQK     R DEERVLISEVLVRNKDG+ELERKDLE EA+AAL+ACRPNSALT 
Sbjct: 145  ------QQKGHSASRHDEERVLISEVLVRNKDGDELERKDLEMEALAALKACRPNSALTV 198

Query: 1784 REVQEDVHRIINSGYFCSCMPVAVDTRDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLED 1605
            REVQEDVHR+I SGYFCSCMPVAVDTRDGIRL+FQVEPNQEFQGLVCEGANVLP+KFLED
Sbjct: 199  REVQEDVHRVIGSGYFCSCMPVAVDTRDGIRLVFQVEPNQEFQGLVCEGANVLPAKFLED 258

Query: 1604 AFRDGHGKVVNIRRLDEVITSINGWYMERGLFGLVSDVEILSGGIIKLQVSEAEVNNISI 1425
            +FRDG GKV+N+RRLD+ ITSIN WYMERGLF +VS VEILSGGI++LQVSEAEVNNISI
Sbjct: 259  SFRDGCGKVINLRRLDKAITSINDWYMERGLFAMVSAVEILSGGILRLQVSEAEVNNISI 318

Query: 1424 RFLDRKSGEPTVGKTKPETILRQLTTKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPA 1245
            RFLDRKSGEPT GKT+PETILRQLTTKKGQVYS+LQGKRDVETVLTMGIMEDVSIIPQPA
Sbjct: 319  RFLDRKSGEPTSGKTQPETILRQLTTKKGQVYSMLQGKRDVETVLTMGIMEDVSIIPQPA 378

Query: 1244 GDTGKVDLTMNVVERVXXXXXXXXXXXXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISL 1065
             DTGKVD+ MNVVER                         SFAYSHRN+FGRNQKL++SL
Sbjct: 379  -DTGKVDMVMNVVERPSGGFSAGGGISSGITSGPLSGLIGSFAYSHRNLFGRNQKLHVSL 437

Query: 1064 ERGQIDSIFRVNYTDPWIEGDDKRTSRSIMIQNSRTPGTLVHGS-QPEQSNLTIGRVTAG 888
            ERGQIDSIFR+N TDPWI GDDKRTSR+IM+QNSRTPGTLVHG  Q E  + TIGRVTAG
Sbjct: 438  ERGQIDSIFRINCTDPWIAGDDKRTSRTIMVQNSRTPGTLVHGKVQDEDISPTIGRVTAG 497

Query: 887  IEFSRPFMPKWSGTAGLIFQRAGARDEKGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVY 708
            +EFS+P  PKWSGTAGLIFQ AGAR+EKG PIIKD + SPLTASG T+DDTLLAK+E VY
Sbjct: 498  VEFSQPLRPKWSGTAGLIFQHAGARNEKGEPIIKDCFGSPLTASGKTHDDTLLAKLETVY 557

Query: 707  TGSGAHGSPMFVFNMEQGLPVLPEWLFFNRVNARFRQGVEIGPALFLLSASGGHVVGNFA 528
            TGSG HGS MFVFN+EQGLPVLPEWLFFNRVNAR R+ +EIGPA  L S SGGHVVGNF+
Sbjct: 558  TGSGDHGSSMFVFNVEQGLPVLPEWLFFNRVNARARKDIEIGPARILFSLSGGHVVGNFS 617

Query: 527  PYEAFAIGGTNSVRGYEEXXXXXXXXXXXXXGEISFPLIGPVEGAVFADYGTDLGSGSTV 348
            P+EAF IGGTNSVRGYEE             GEISFP++GPV G +FADYGTDLGSG TV
Sbjct: 618  PHEAFTIGGTNSVRGYEEGAVGSGRSYAVGSGEISFPMVGPVGGVIFADYGTDLGSGPTV 677

Query: 347  PGDPAGARHKPGSGYGCGFGIRVDSPLGPLRLEYAFNDRQAKRFHFAVGHRN 192
            PGDPAGAR KPGSGYG G GIR+DSPLGPLRLEYAF+D Q KRFHF VGHRN
Sbjct: 678  PGDPAGARLKPGSGYGYGVGIRLDSPLGPLRLEYAFSDSQNKRFHFGVGHRN 729


>ref|XP_003547118.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Glycine
            max]
          Length = 677

 Score =  915 bits (2366), Expect = 0.0
 Identities = 494/745 (66%), Positives = 550/745 (73%), Gaps = 1/745 (0%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLPHSTESSHST-PIPTSSDLSFCSQILTSHLTKTKHSLIQLITSF 2247
            M RND V    SS+K+P    S   T P+ T+           SH+    +S+ QL+ SF
Sbjct: 1    MFRNDDVCIVSSSIKIPLPYISKRPTCPLRTAH----------SHIANATNSIAQLVNSF 50

Query: 2246 KTRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGA 2067
             + S    RS L                          K +LLCS+TL       S  G 
Sbjct: 51   TSHSTELTRSVL-------------------------QKSSLLCSATL-------SLTGD 78

Query: 2066 LEEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLISEV 1887
            LE                        IR   S  L    AQQK     R++EERVLISEV
Sbjct: 79   LER--------------------KCPIRRLASLSL-AEEAQQK----ARQNEERVLISEV 113

Query: 1886 LVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDT 1707
            LVRNKDGEELERKDLEAEA  AL+ACRPNSALT REVQEDVHRIINSGYF SCMPVAVDT
Sbjct: 114  LVRNKDGEELERKDLEAEAAQALKACRPNSALTVREVQEDVHRIINSGYFSSCMPVAVDT 173

Query: 1706 RDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWY 1527
            RDGIRL+FQVEPNQEFQGLVCEGANVLP+KFLED+ RDG+GK++N+RRLDE ++SIN WY
Sbjct: 174  RDGIRLVFQVEPNQEFQGLVCEGANVLPAKFLEDSMRDGYGKIINLRRLDEALSSINNWY 233

Query: 1526 MERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTT 1347
            MERGLF +VS VEILSGGI++LQVSEAEV+NISIRFLDRK+GE T+GKTKPETILRQ+TT
Sbjct: 234  MERGLFAMVSAVEILSGGILRLQVSEAEVDNISIRFLDRKTGETTMGKTKPETILRQITT 293

Query: 1346 KKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXX 1167
            KKGQVYS+L+GKRDVETVLTMGIMEDVSIIPQPA DTGKVDL MNVVER           
Sbjct: 294  KKGQVYSMLEGKRDVETVLTMGIMEDVSIIPQPA-DTGKVDLVMNVVERPSGGFSAGGGI 352

Query: 1166 XXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTS 987
                          SFAYSHRNVFG+NQKLNISLERGQIDS++R+NYTDPWI+GDDKRTS
Sbjct: 353  SSGITNGPLRGLIGSFAYSHRNVFGKNQKLNISLERGQIDSVYRINYTDPWIQGDDKRTS 412

Query: 986  RSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDE 807
            R+IMIQNSRTPGT+VHG+     +LTIGR+T GIEFSRP  PKWSGT GL+FQ AG RDE
Sbjct: 413  RTIMIQNSRTPGTIVHGNADGNGSLTIGRITGGIEFSRPIRPKWSGTVGLVFQHAGVRDE 472

Query: 806  KGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLF 627
            +G PIIKD YSSPLTASGNT+DDTLLAK+E VYTGSG HGS MFV NME+GLP+LPEWL 
Sbjct: 473  QGIPIIKDCYSSPLTASGNTHDDTLLAKLETVYTGSGDHGSSMFVLNMEKGLPLLPEWLS 532

Query: 626  FNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXX 447
            F RVNAR R+GVEIGPA   LS SGGHVVGNF+PYEAFAIGGTNSVRGYEE         
Sbjct: 533  FTRVNARARKGVEIGPARLHLSISGGHVVGNFSPYEAFAIGGTNSVRGYEEGSVGSGRSY 592

Query: 446  XXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPL 267
                GE+SFP+ GPVEG +F+DYGTDLGSG TVPGDPAGAR KPGSGYG GFGIRV+SPL
Sbjct: 593  VVGSGEVSFPVYGPVEGVIFSDYGTDLGSGPTVPGDPAGARKKPGSGYGYGFGIRVESPL 652

Query: 266  GPLRLEYAFNDRQAKRFHFAVGHRN 192
            GPLRLEYAFND+Q KRFHF VGHRN
Sbjct: 653  GPLRLEYAFNDKQDKRFHFGVGHRN 677


>ref|XP_006437641.1| hypothetical protein CICLE_v10030987mg [Citrus clementina]
            gi|557539837|gb|ESR50881.1| hypothetical protein
            CICLE_v10030987mg [Citrus clementina]
          Length = 612

 Score =  912 bits (2357), Expect = 0.0
 Identities = 468/598 (78%), Positives = 506/598 (84%)
 Frame = -2

Query: 1985 RSDESTQLEGGVAQQKLSRHGREDEERVLISEVLVRNKDGEELERKDLEAEAVAALRACR 1806
            +S+ STQL+   AQQ  S   R DEERVLISEVLVRNKDGEELERKDLE EA+ AL+ACR
Sbjct: 31   QSELSTQLQQK-AQQPHSV-SRSDEERVLISEVLVRNKDGEELERKDLETEALTALKACR 88

Query: 1805 PNSALTAREVQEDVHRIINSGYFCSCMPVAVDTRDGIRLLFQVEPNQEFQGLVCEGANVL 1626
             NSALT REVQEDVHRII+SGYFCSCMPVAVDTRDGIRL+FQVEPNQEF GLVCEGANVL
Sbjct: 89   ANSALTVREVQEDVHRIIDSGYFCSCMPVAVDTRDGIRLVFQVEPNQEFHGLVCEGANVL 148

Query: 1625 PSKFLEDAFRDGHGKVVNIRRLDEVITSINGWYMERGLFGLVSDVEILSGGIIKLQVSEA 1446
            P+KF+EDAFRDG+GKVVNIRRLDEVITSINGWYMERGLFG+VS VEILSGGII+LQV+EA
Sbjct: 149  PTKFVEDAFRDGYGKVVNIRRLDEVITSINGWYMERGLFGMVSGVEILSGGIIRLQVAEA 208

Query: 1445 EVNNISIRFLDRKSGEPTVGKTKPETILRQLTTKKGQVYSLLQGKRDVETVLTMGIMEDV 1266
            EVNNISIRFLDRK+GEPT GKT+PETILRQLTTKKGQVYS+LQGKRDVETVLTMGIMEDV
Sbjct: 209  EVNNISIRFLDRKTGEPTKGKTRPETILRQLTTKKGQVYSMLQGKRDVETVLTMGIMEDV 268

Query: 1265 SIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXXXXXXXXXXXXXXXXSFAYSHRNVFGRN 1086
            SIIPQPAGDTGKVDL MNVVER                         SFAYSHRNVFGRN
Sbjct: 269  SIIPQPAGDTGKVDLIMNVVERPSGGFSAGGGISSGITSGPLSGLIGSFAYSHRNVFGRN 328

Query: 1085 QKLNISLERGQIDSIFRVNYTDPWIEGDDKRTSRSIMIQNSRTPGTLVHGSQPEQSNLTI 906
            QKLNISLERGQIDSIFR+NYTDPWIEGDDKRTSR+IM+QNSRTPGT VHG+QP+ S+LTI
Sbjct: 329  QKLNISLERGQIDSIFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTHVHGNQPDNSSLTI 388

Query: 905  GRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDEKGSPIIKDYYSSPLTASGNTNDDTLLA 726
            GRVTAG+EFSRP  PKWSGT GLIFQ +GARDEKG+PIIKD+YSSPLTASG TND+ L+A
Sbjct: 389  GRVTAGMEFSRPIRPKWSGTVGLIFQHSGARDEKGNPIIKDFYSSPLTASGKTNDEMLIA 448

Query: 725  KIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLFFNRVNARFRQGVEIGPALFLLSASGGH 546
            K E VYTGSG  GS M              WLFFNRVNAR R+GVEIGPA  LLS SGGH
Sbjct: 449  KFESVYTGSGDQGSSM--------------WLFFNRVNARARKGVEIGPARLLLSLSGGH 494

Query: 545  VVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXXXXXXGEISFPLIGPVEGAVFADYGTDL 366
            VVGNF+P+EAFAIGGTNSVRGYEE             GEISFP++GPVEG +F+DYGTDL
Sbjct: 495  VVGNFSPHEAFAIGGTNSVRGYEEGAVGSGRSYVVGSGEISFPMLGPVEGVIFSDYGTDL 554

Query: 365  GSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPLGPLRLEYAFNDRQAKRFHFAVGHRN 192
            GSG +VPGDPAGAR KPGSGYG GFGIRVDSPLGPLRLEYAFND+QAKRFHF VG+RN
Sbjct: 555  GSGPSVPGDPAGARLKPGSGYGYGFGIRVDSPLGPLRLEYAFNDKQAKRFHFGVGYRN 612


>ref|XP_007150381.1| hypothetical protein PHAVU_005G148500g [Phaseolus vulgaris]
            gi|561023645|gb|ESW22375.1| hypothetical protein
            PHAVU_005G148500g [Phaseolus vulgaris]
          Length = 675

 Score =  911 bits (2354), Expect = 0.0
 Identities = 486/744 (65%), Positives = 551/744 (74%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFK 2244
            M RND VR   S++K+P  ++   + P+ T+           SH+    +S+ QL+ SF 
Sbjct: 1    MLRNDDVRVVSSAIKIPLPSKRP-TCPMRTAH----------SHIANATNSIAQLVNSFA 49

Query: 2243 TRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGAL 2064
            + S    RS L                          K +LLCS+TL+     + +    
Sbjct: 50   SHSTEFTRSVL-------------------------QKSSLLCSATLSLTGDRKRA---- 80

Query: 2063 EEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLISEVL 1884
                                     IR   S  L    AQQK     R++EERVLISEVL
Sbjct: 81   -----------------------CPIRRMASLSLSEE-AQQK----ARQNEERVLISEVL 112

Query: 1883 VRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDTR 1704
            VRNKDGEE+ERKDLEAEAV AL+ACRPNSALT REVQEDVHRIINSGYF SCMPVAVDTR
Sbjct: 113  VRNKDGEEMERKDLEAEAVQALKACRPNSALTVREVQEDVHRIINSGYFSSCMPVAVDTR 172

Query: 1703 DGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWYM 1524
            DGIRL+FQVEPNQEFQGLVCEGANVLP+KFLE++ RDG+GK++N+RRLDE I+SIN WYM
Sbjct: 173  DGIRLVFQVEPNQEFQGLVCEGANVLPAKFLENSMRDGYGKIINLRRLDEAISSINNWYM 232

Query: 1523 ERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTTK 1344
            ERGLF +VS VEILSGGI++LQVSEAEVNNISIRFLDRK+GE T+GKTKPETILRQ+TTK
Sbjct: 233  ERGLFAMVSAVEILSGGILRLQVSEAEVNNISIRFLDRKTGEITMGKTKPETILRQITTK 292

Query: 1343 KGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXXX 1164
            KGQVYS+L+GKRDVETVLTMGIMEDVSIIPQP  DTGKVDL MNVVER            
Sbjct: 293  KGQVYSMLEGKRDVETVLTMGIMEDVSIIPQPE-DTGKVDLVMNVVERPSGGFSAGGGIS 351

Query: 1163 XXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTSR 984
                         SFAYSHRNVFG+NQKLNISLERGQIDS++R+NYTDPWI+GDD+RTSR
Sbjct: 352  SGITNGPLRGLIGSFAYSHRNVFGKNQKLNISLERGQIDSVYRINYTDPWIQGDDRRTSR 411

Query: 983  SIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDEK 804
            +IMIQNSRTPGT+VHG+     +LTIGR+T GIEFSRP  PKWSGTAGL+FQ AG RDEK
Sbjct: 412  TIMIQNSRTPGTIVHGNADGNGSLTIGRITGGIEFSRPIRPKWSGTAGLVFQHAGVRDEK 471

Query: 803  GSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLFF 624
            G PIIKD +SSPLTASGNT+D+TLLAK+E VYTGSG HGS MFV NME+GLP+LPEWL F
Sbjct: 472  GIPIIKDCFSSPLTASGNTHDETLLAKLETVYTGSGDHGSSMFVLNMEKGLPLLPEWLSF 531

Query: 623  NRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXXX 444
             RVNAR R+GVEIGPA   LS SGGHVVGNF PYEAFAIGGTNSVRGYEE          
Sbjct: 532  TRVNARARKGVEIGPARLHLSISGGHVVGNFPPYEAFAIGGTNSVRGYEEGSVGSGRSYV 591

Query: 443  XXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPLG 264
               GEISFP+ GPVEG +F+DYGTDLGSG TVPGDPAGAR KPGSGYG GFGIRV+SPLG
Sbjct: 592  VGSGEISFPMYGPVEGVIFSDYGTDLGSGPTVPGDPAGARKKPGSGYGYGFGIRVESPLG 651

Query: 263  PLRLEYAFNDRQAKRFHFAVGHRN 192
            PLRLEYAFND++ +RFHF VGHRN
Sbjct: 652  PLRLEYAFNDKKERRFHFGVGHRN 675


>ref|XP_003597441.1| Outer envelope protein of 80 kDa [Medicago truncatula]
            gi|355486489|gb|AES67692.1| Outer envelope protein of 80
            kDa [Medicago truncatula]
          Length = 672

 Score =  905 bits (2339), Expect = 0.0
 Identities = 486/744 (65%), Positives = 546/744 (73%)
 Frame = -2

Query: 2423 MPRNDGVRFTPSSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSFK 2244
            MP+ND +RF  SS+K+P  +    S P PTS       + L SH T   +S   LI SF 
Sbjct: 1    MPQNDDIRFISSSIKIPLPS----SKPKPTSP-----FKTLHSHFTNATNSFSHLIHSFT 51

Query: 2243 TRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGAL 2064
            T S            +L  S  Q              K   LCS++L+ N +  S   +L
Sbjct: 52   THS-----------TQLTRSVLQ--------------KSHSLCSTSLSLNAANRSPPLSL 86

Query: 2063 EEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLISEVL 1884
                                       S E TQL+            R++EERVLISEVL
Sbjct: 87   S--------------------------SAEETQLKT-----------RQNEERVLISEVL 109

Query: 1883 VRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDTR 1704
            VRNKDGEELERKDLEAEA  AL+ACRPNSALT REVQ+DVHRIINSGYFCSC+PVAVDTR
Sbjct: 110  VRNKDGEELERKDLEAEAAQALKACRPNSALTVREVQDDVHRIINSGYFCSCVPVAVDTR 169

Query: 1703 DGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWYM 1524
            DGIRL+FQVEPNQEFQGLVCEGANV+P+KFLE++FR+G+GKV+N+RRLDE I+SIN WYM
Sbjct: 170  DGIRLVFQVEPNQEFQGLVCEGANVIPAKFLENSFRNGYGKVINLRRLDEAISSINDWYM 229

Query: 1523 ERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTTK 1344
            ERGLF +VS VEILSGGI++LQVSEAEVNNISIRFLDRK+GE TVGKTKPETILRQ+TTK
Sbjct: 230  ERGLFAMVSAVEILSGGILRLQVSEAEVNNISIRFLDRKTGETTVGKTKPETILRQITTK 289

Query: 1343 KGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXXX 1164
            KGQVYS+ QGKRDVETVLTMGIMEDVSIIPQPA DTGKVDL MNVVER            
Sbjct: 290  KGQVYSMHQGKRDVETVLTMGIMEDVSIIPQPA-DTGKVDLVMNVVERPSGGFSAGGGIS 348

Query: 1163 XXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTSR 984
                         SFAYSHRNVFGRNQKLN+SLERGQ+D I R NYTDPWI+GDDKRTS 
Sbjct: 349  SGITSGPLRGLIGSFAYSHRNVFGRNQKLNVSLERGQVDLIVRANYTDPWIQGDDKRTSG 408

Query: 983  SIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDEK 804
            +IM+QNSRTPGT+VHG+    S+LTIGR+T G+E SRP  PKWSGTAGLIFQRAG  D  
Sbjct: 409  TIMVQNSRTPGTIVHGNLDGNSSLTIGRITGGVELSRPIRPKWSGTAGLIFQRAGVCDNN 468

Query: 803  GSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLFF 624
            G PII+D Y+SPLTASGNT+DDTLL KIE VYTGSG HGS MFV NMEQGLP+LP+WL F
Sbjct: 469  GVPIIRDRYNSPLTASGNTHDDTLLGKIETVYTGSGEHGSSMFVLNMEQGLPLLPDWLSF 528

Query: 623  NRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXXX 444
             RVNAR R+GVEIGP    LS SGGHVVGNF+PYEAFAIGGTNSVRGYEE          
Sbjct: 529  TRVNARARKGVEIGPTRLNLSLSGGHVVGNFSPYEAFAIGGTNSVRGYEEGGVGSGRSYV 588

Query: 443  XXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPLG 264
               GEISFP++ PVE  +F+DYGTDLGSGSTVPGDPAGAR+KPGSGYG G GIRVDSPLG
Sbjct: 589  VGSGEISFPMMKPVECVIFSDYGTDLGSGSTVPGDPAGARNKPGSGYGYGLGIRVDSPLG 648

Query: 263  PLRLEYAFNDRQAKRFHFAVGHRN 192
            PLRLEYAFND++ KRFHF VG+RN
Sbjct: 649  PLRLEYAFNDKKEKRFHFGVGYRN 672


>ref|XP_004296333.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Fragaria
            vesca subsp. vesca]
          Length = 680

 Score =  899 bits (2324), Expect = 0.0
 Identities = 478/745 (64%), Positives = 547/745 (73%), Gaps = 1/745 (0%)
 Frame = -2

Query: 2423 MPRNDGVRFTP-SSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSF 2247
            MP+ND VRF    SLKLPH        P P   DLS             ++SL QLI S 
Sbjct: 1    MPQNDDVRFISFPSLKLPHPPPP----PPPPRFDLSSLF---------ARNSLSQLIDSI 47

Query: 2246 KTRSIFHHRSPLFSFARLNESTRQDDGVAQRRGREGRVKPTLLCSSTLAWNRSEESSQGA 2067
            K+RS    RSP+   A L+   R        R    R  P LLCS++L+ +RS+ES++  
Sbjct: 48   KSRSK-QPRSPILCSASLS-LPRPRRSADDDRSWLVRKSP-LLCSASLSLSRSDESTRSG 104

Query: 2066 LEEGGMTQKKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQKLSRHGREDEERVLISEV 1887
                                                               EERVLISEV
Sbjct: 105  -------------------------------------------------SSEERVLISEV 115

Query: 1886 LVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQEDVHRIINSGYFCSCMPVAVDT 1707
            L+RNKDGEELERKDLE EA+ AL+ACR NSALT REVQEDVHRII+SGYFC CMPVA+DT
Sbjct: 116  LIRNKDGEELERKDLELEALGALKACRANSALTVREVQEDVHRIIDSGYFCQCMPVAIDT 175

Query: 1706 RDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGHGKVVNIRRLDEVITSINGWY 1527
            RDGIRL+FQV+PNQEFQGLVCEGANVLP+KFL+DAF DG+GKV+N++RL+EVITSIN WY
Sbjct: 176  RDGIRLIFQVKPNQEFQGLVCEGANVLPAKFLKDAFYDGYGKVINLKRLNEVITSINDWY 235

Query: 1526 MERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDRKSGEPTVGKTKPETILRQLTT 1347
            M+RGLF +VS VE+LSGGI+KLQVSE EVNNI+IRFLDRK+GEPT+GKTKPETILRQLTT
Sbjct: 236  MDRGLFAMVSAVEVLSGGILKLQVSETEVNNIAIRFLDRKTGEPTIGKTKPETILRQLTT 295

Query: 1346 KKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLTMNVVERVXXXXXXXXXX 1167
            KKGQVYS+LQGKRDVETVLTMG+MEDVSIIPQPAG++GKVD+ MNVVER           
Sbjct: 296  KKGQVYSMLQGKRDVETVLTMGLMEDVSIIPQPAGESGKVDIVMNVVERPSGGFSAGGGI 355

Query: 1166 XXXXXXXXXXXXXXSFAYSHRNVFGRNQKLNISLERGQIDSIFRVNYTDPWIEGDDKRTS 987
                          SFAYSHRN+FGRNQKL++SLERGQIDS+FR+NY+DPWI GDD RTS
Sbjct: 356  SSGITSGPLSGLIGSFAYSHRNLFGRNQKLHVSLERGQIDSLFRINYSDPWISGDDMRTS 415

Query: 986  RSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFSRPFMPKWSGTAGLIFQRAGARDE 807
            R+IM+QNSRTPGTL+HG+Q + SNLTIGR++AGI+FSRP  PKWSGTAGL +Q AGARDE
Sbjct: 416  RTIMVQNSRTPGTLIHGNQLDGSNLTIGRISAGIDFSRPIRPKWSGTAGLTYQHAGARDE 475

Query: 806  KGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSGAHGSPMFVFNMEQGLPVLPEWLF 627
            +GSPIIKD++SSPLTASGN+ D+ LLAK+E VYTGSG  GS M  FNMEQGLPVLP+WLF
Sbjct: 476  EGSPIIKDFFSSPLTASGNSYDEMLLAKLETVYTGSGDRGSSMLKFNMEQGLPVLPDWLF 535

Query: 626  FNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEAFAIGGTNSVRGYEEXXXXXXXXX 447
            FNR NAR R+ +EIG A  L S SGGHV+GNF P+EAF IGGTNSVRGYEE         
Sbjct: 536  FNRTNARARKDLEIGLAHLLFSVSGGHVIGNFPPHEAFVIGGTNSVRGYEEGAVGSGRSY 595

Query: 446  XXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDPAGARHKPGSGYGCGFGIRVDSPL 267
                GEISFPL+GPV G +FADYGTDLGSG TVPGDPAGAR KPGSGYG G GIR+DSPL
Sbjct: 596  AVGSGEISFPLVGPVGGVIFADYGTDLGSGPTVPGDPAGARLKPGSGYGYGLGIRLDSPL 655

Query: 266  GPLRLEYAFNDRQAKRFHFAVGHRN 192
            GPLRLEYAFND+   RFHF VGHRN
Sbjct: 656  GPLRLEYAFNDKGTPRFHFGVGHRN 680


>ref|XP_004142120.1| PREDICTED: outer envelope protein 80, chloroplastic-like [Cucumis
            sativus]
          Length = 757

 Score =  893 bits (2308), Expect = 0.0
 Identities = 487/768 (63%), Positives = 556/768 (72%), Gaps = 24/768 (3%)
 Frame = -2

Query: 2423 MPRNDGVRFTP-SSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSF 2247
            MP ND + FT  S+L++PH   SS      + S   FC + L S   ++  S+   I S 
Sbjct: 1    MPPNDDIVFTSRSTLRIPHFPPSS------SHSSFRFCFRNLASQFDQSCKSISHFIDSV 54

Query: 2246 KTRSIFHHRS--------PLFSFARLNESTRQDDGVAQRRGRE-GRV---KPTLLCSSTL 2103
            K  S   H +        P   F    + T+Q+  +++R     G V   K  L+CS+++
Sbjct: 55   KRGSKLSHFNHSFPHLWPPTLPFCSSKKVTQQESSISRRASWNWGSVFVEKYPLICSASM 114

Query: 2102 AWNRSEESSQGALEEGGMTQ-----KKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQK 1938
            +  +S+ SS+   E+ G  Q       G              L RSDES Q  GG   ++
Sbjct: 115  SLIQSDMSSKSESEDSGKRQGMEDMSTGLVGKSSLLCSASLALTRSDESNQ-SGGSESKE 173

Query: 1937 LSRHG----REDEERVLISEVLVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQE 1770
            L + G    R DEERVLISEVLVRNKDGEELERKDLE E   AL+A RPNSALT REVQE
Sbjct: 174  LPQKGYSAARVDEERVLISEVLVRNKDGEELERKDLELEVFTALKASRPNSALTVREVQE 233

Query: 1769 DVHRIINSGYFCSCMPVAVDTRDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDG 1590
            DVHRIINSGYF SC+PVAVDTRDGIRL+FQVEPNQEFQGLVCEGANVLP+KFLE+AFRDG
Sbjct: 234  DVHRIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDG 293

Query: 1589 HGKVVNIRRLDEVITSINGWYMERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDR 1410
            +GKVVN+R LDEVI+SINGWY ERGLFG VS V+ILSGGI+ LQVSEAEVNNISIRFLD+
Sbjct: 294  YGKVVNLRHLDEVISSINGWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDK 353

Query: 1409 KSGEPTVGKTKPETILRQLTTKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGK 1230
            K+GEP  G T+PETILRQLTTKKGQVYS+LQGKRD ETVLTMGIMEDVSIIPQPA D GK
Sbjct: 354  KTGEPIPGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGK 413

Query: 1229 VDLTMNVVERVXXXXXXXXXXXXXXXXXXXXXXXXS--FAYSHRNVFGRNQKLNISLERG 1056
            VD+ MNVVER                             AYSHRN+FGRNQKL++SLE+G
Sbjct: 414  VDILMNVVERPGGGFSAGGGLSCGSTGGAGLLSTLIGSLAYSHRNLFGRNQKLHVSLEKG 473

Query: 1055 QIDSIFRVNYTDPWIEGDDKRTSRSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFS 876
            Q+DS FR+NYTDPWIEGDDKRTSR++M+QNSRTPGTLVHG     SNLTI RVTAG+EF+
Sbjct: 474  QVDSTFRINYTDPWIEGDDKRTSRTMMVQNSRTPGTLVHGG----SNLTIVRVTAGLEFN 529

Query: 875  RPFMPKWSGTAGLIFQRAGARDEKGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSG 696
            RP  P WSGTAGL FQRAGA+DEKG PI+KD    PLTASGN  D+ LLAK+EGVYTGSG
Sbjct: 530  RPIRPTWSGTAGLYFQRAGAQDEKGEPILKDNIKCPLTASGNAVDNMLLAKLEGVYTGSG 589

Query: 695  AHGSPMFVFNMEQGLPVLPEWLFFNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEA 516
             HGS MFV +MEQGLP LPEWL FNRVNAR R G+EIG +  LLS SGGHVVGNF P+EA
Sbjct: 590  DHGSSMFVLSMEQGLPFLPEWLCFNRVNARARTGMEIGFSQLLLSLSGGHVVGNFCPHEA 649

Query: 515  FAIGGTNSVRGYEEXXXXXXXXXXXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDP 336
            FAIGGTNSVRGYEE             GE+SFPL GPVEG  FADYGTDLGSG++V GDP
Sbjct: 650  FAIGGTNSVRGYEEGAVGSGRSYAVGCGELSFPLFGPVEGVFFADYGTDLGSGASVLGDP 709

Query: 335  AGARHKPGSGYGCGFGIRVDSPLGPLRLEYAFNDRQAKRFHFAVGHRN 192
            AGAR K GSG+G GFGIR++SPLGPLRLEYAFND+  KRFHF VGHRN
Sbjct: 710  AGARMKTGSGFGYGFGIRLESPLGPLRLEYAFNDKSEKRFHFGVGHRN 757


>ref|XP_004161694.1| PREDICTED: LOW QUALITY PROTEIN: outer envelope protein 80,
            chloroplastic-like [Cucumis sativus]
          Length = 757

 Score =  887 bits (2292), Expect = 0.0
 Identities = 485/768 (63%), Positives = 554/768 (72%), Gaps = 24/768 (3%)
 Frame = -2

Query: 2423 MPRNDGVRFTP-SSLKLPHSTESSHSTPIPTSSDLSFCSQILTSHLTKTKHSLIQLITSF 2247
            MP ND + FT  S+L++PH   SS      + S   FC + L S   ++  S+   I S 
Sbjct: 1    MPPNDDIVFTSRSTLRIPHFPPSS------SHSSFRFCFRNLASQFDQSCKSISHFIDSV 54

Query: 2246 KTRSIFHHRS--------PLFSFARLNESTRQDDGVAQRRGRE-GRV---KPTLLCSSTL 2103
            K  S   H +        P   F    + T+Q+  +++R     G V   K  L+CS+++
Sbjct: 55   KRGSKLSHFNHSFPHLWPPTLPFCSSKKVTQQESSISRRASWNWGSVFVEKYPLICSASM 114

Query: 2102 AWNRSEESSQGALEEGGMTQ-----KKGGQXXXXXXXXXXXXLIRSDESTQLEGGVAQQK 1938
            +  +S+ SS+   E+ G  Q       G              L RSDES Q  GG   ++
Sbjct: 115  SLIQSDMSSKSESEDSGKRQGMEDMSTGLVGKSSLLCSASLALTRSDESNQ-SGGSESKE 173

Query: 1937 LSRHG----REDEERVLISEVLVRNKDGEELERKDLEAEAVAALRACRPNSALTAREVQE 1770
            L + G    R DEERVLISEVLVRNKDGEELERKDLE E   AL+A RPNSALT REVQE
Sbjct: 174  LPQKGYSAARVDEERVLISEVLVRNKDGEELERKDLELEVFTALKASRPNSALTVREVQE 233

Query: 1769 DVHRIINSGYFCSCMPVAVDTRDGIRLLFQVEPNQEFQGLVCEGANVLPSKFLEDAFRDG 1590
            DVHRIINSGYF SC+PVAVDTRDGIRL+FQVEPNQEFQGLVCEGANVLP+KFLE+AFRDG
Sbjct: 234  DVHRIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDG 293

Query: 1589 HGKVVNIRRLDEVITSINGWYMERGLFGLVSDVEILSGGIIKLQVSEAEVNNISIRFLDR 1410
            +GKVVN+R LDEVI+SINGWY ERGLFG VS V+ILSGGI+ LQVSEAEVNNISIRFLD+
Sbjct: 294  YGKVVNLRHLDEVISSINGWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDK 353

Query: 1409 KSGEPTVGKTKPETILRQLTTKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGK 1230
            K+GEP  G T+PETILRQLTTKKGQVYS+LQGKRD ETVLTMGIMEDVSIIPQPA D GK
Sbjct: 354  KTGEPIPGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGK 413

Query: 1229 VDLTMNVVERVXXXXXXXXXXXXXXXXXXXXXXXXS--FAYSHRNVFGRNQKLNISLERG 1056
            VD+ MNVVER                             AYSHRN+FGRNQKL++SLE+G
Sbjct: 414  VDILMNVVERPGGGFSAGGGLSCGSTGGAGLLSTLIGSLAYSHRNLFGRNQKLHVSLEKG 473

Query: 1055 QIDSIFRVNYTDPWIEGDDKRTSRSIMIQNSRTPGTLVHGSQPEQSNLTIGRVTAGIEFS 876
            Q+DS FR+NYTDPWIEGDDKRTSR++M+QNSRTPGTLVHG     SNLTI RVTAG+EF+
Sbjct: 474  QVDSTFRINYTDPWIEGDDKRTSRTMMVQNSRTPGTLVHGG----SNLTIVRVTAGLEFN 529

Query: 875  RPFMPKWSGTAGLIFQRAGARDEKGSPIIKDYYSSPLTASGNTNDDTLLAKIEGVYTGSG 696
            RP  P WSGTAGL FQRAGA+DEKG PI+KD    PLTASGN  D+ LLAK+EGVYTGSG
Sbjct: 530  RPIRPTWSGTAGLYFQRAGAQDEKGEPILKDNIKCPLTASGNAVDNMLLAKLEGVYTGSG 589

Query: 695  AHGSPMFVFNMEQGLPVLPEWLFFNRVNARFRQGVEIGPALFLLSASGGHVVGNFAPYEA 516
             HGS MFV +MEQGLP LPEWL FNRVNAR R G+EIG +  LLS SGGHVVGNF P+EA
Sbjct: 590  DHGSSMFVLSMEQGLPFLPEWLCFNRVNARARTGMEIGFSQLLLSLSGGHVVGNFCPHEA 649

Query: 515  FAIGGTNSVRGYEEXXXXXXXXXXXXXGEISFPLIGPVEGAVFADYGTDLGSGSTVPGDP 336
            FAIGGTNSVRGYEE             GE+SFPL GPVEG  FADYGTDLGSG++V GDP
Sbjct: 650  FAIGGTNSVRGYEEGAVGSGRSYAVGCGELSFPLFGPVEGVFFADYGTDLGSGASVLGDP 709

Query: 335  AGARHKPGSGYGCGFGIRVDSPLGPLRLEYAFNDRQAKRFHFAVGHRN 192
            AGAR K GSG+G GFGIR++SPLGPLRLEYAFND+  K F F VGHRN
Sbjct: 710  AGARMKTGSGFGYGFGIRLESPLGPLRLEYAFNDKSEKGFXFGVGHRN 757


Top