BLASTX nr result

ID: Sinomenium21_contig00001190 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00001190
         (2306 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007016346.1| Translocon at the outer envelope membrane of...  1201   0.0  
ref|XP_002280661.1| PREDICTED: protein TOC75-3, chloroplastic-li...  1170   0.0  
emb|CAN81047.1| hypothetical protein VITISV_006765 [Vitis vinifera]  1166   0.0  
ref|XP_006290594.1| hypothetical protein CARUB_v10016683mg, part...  1162   0.0  
ref|XP_002299371.1| outer membrane family protein [Populus trich...  1162   0.0  
ref|XP_002520530.1| sorting and assembly machinery (sam50) prote...  1157   0.0  
gb|EXC10708.1| Protein TOC75-3 [Morus notabilis]                     1157   0.0  
ref|XP_002303729.2| hypothetical protein POPTR_0003s15670g [Popu...  1155   0.0  
ref|XP_006418866.1| hypothetical protein EUTSA_v10002403mg [Eutr...  1153   0.0  
gb|EYU45949.1| hypothetical protein MIMGU_mgv1a001440mg [Mimulus...  1151   0.0  
ref|NP_190258.1| protein TOC75-3 [Arabidopsis thaliana] gi|75207...  1147   0.0  
ref|XP_002877511.1| translocon outer membrane complex 75-III [Ar...  1147   0.0  
ref|XP_004241213.1| PREDICTED: protein TOC75-3, chloroplastic-li...  1146   0.0  
ref|XP_006350787.1| PREDICTED: protein TOC75-3, chloroplastic-li...  1145   0.0  
ref|XP_006424890.1| hypothetical protein CICLE_v10027835mg [Citr...  1139   0.0  
ref|XP_004301512.1| PREDICTED: protein TOC75-3, chloroplastic-li...  1139   0.0  
ref|XP_007208340.1| hypothetical protein PRUPE_ppa002062mg [Prun...  1129   0.0  
ref|XP_006846297.1| hypothetical protein AMTR_s00012p00252410 [A...  1128   0.0  
ref|XP_004153150.1| PREDICTED: protein TOC75-3, chloroplastic-li...  1123   0.0  
ref|XP_003547008.1| PREDICTED: protein TOC75-3, chloroplastic-li...  1115   0.0  

>ref|XP_007016346.1| Translocon at the outer envelope membrane of chloroplasts 75-III
            [Theobroma cacao] gi|508786709|gb|EOY33965.1| Translocon
            at the outer envelope membrane of chloroplasts 75-III
            [Theobroma cacao]
          Length = 813

 Score = 1201 bits (3108), Expect = 0.0
 Identities = 607/774 (78%), Positives = 658/774 (85%), Gaps = 20/774 (2%)
 Frame = +1

Query: 43   AFCGPGHLLPP--NLSLCRRRTATATVRSDL---DRKSS---------PQP---TKTLHF 171
            +F  P HLL P  NLS   RR    T  S      R SS         P+P   T  L  
Sbjct: 3    SFPAPSHLLSPSANLSSSTRRQLPPTSSSSSRASPRSSSIKCHLPFQNPKPQKRTSLLRS 62

Query: 172  LSK--TLAFSAVSSLISHLSPVPSQLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX- 342
            LSK  TLA ++ ++L+  ++P+P+ L                                  
Sbjct: 63   LSKPLTLASASAATLLIRITPIPTLLAGGGGENFGGSGGLSGGGGGGGGSGGGDGSSGNF 122

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            W K+FSPSPAIA D+ +Q+QEWDSHGLPANIVVQLNKLSGFK+YK+S+I FFD+RRWTTV
Sbjct: 123  WEKLFSPSPAIA-DDNNQTQEWDSHGLPANIVVQLNKLSGFKKYKLSDILFFDRRRWTTV 181

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMVSL+PGG+YTK  LQKELETLATCGMFEKVD+EGKT PDGT+G+TISFTES
Sbjct: 182  GTEDSFFEMVSLRPGGIYTKTQLQKELETLATCGMFEKVDMEGKTNPDGTLGLTISFTES 241

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVGLM QSK IEMDSDMTDKE  EYYK+QEKDY+RRIERARPCLLPV +
Sbjct: 242  TWQSADRFRCINVGLMAQSKPIEMDSDMTDKEKLEYYKNQEKDYKRRIERARPCLLPVQV 301

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
            HREVLQMLR+QG VSARLLQ+IRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI
Sbjct: 302  HREVLQMLRDQGKVSARLLQKIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 361

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL IQF DKLGNV EGNTQL V+RRELPKQLR G+VFNIEAGKQALRNINSLALFSNIE
Sbjct: 362  TQLVIQFQDKLGNVVEGNTQLPVVRRELPKQLRQGNVFNIEAGKQALRNINSLALFSNIE 421

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKEL+QK+AEVSTEWSIVPGR GRP LAS QPGGTV+ EHRN
Sbjct: 422  VNPRPDEKNEGGIIVEIKLKELDQKSAEVSTEWSIVPGRGGRPALASFQPGGTVSFEHRN 481

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            L GLNRS++GSLTT+NF +PQDDL+FK EYVHPYLDG++NPRNRTFRA+CFNSRKLSPVF
Sbjct: 482  LKGLNRSILGSLTTSNFFNPQDDLAFKLEYVHPYLDGVYNPRNRTFRASCFNSRKLSPVF 541

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVME+I TRDESSHISP GQR LP
Sbjct: 542  TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEEITTRDESSHISPNGQRVLP 601

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGISADGPPTTLSGTGVD+M F QANITRDNTKFVNGAIVGERNVFQ+DQGLGIGS+FP
Sbjct: 602  SGGISADGPPTTLSGTGVDRMAFLQANITRDNTKFVNGAIVGERNVFQVDQGLGIGSKFP 661

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLT TRF+QLKQVEEGA+KPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 662  FFNRHQLTFTRFLQLKQVEEGANKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 721

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGELGA RNI+EL AELR+PVRNTHVY FAEHGNDLGSSKDVKGNPTE YRRM
Sbjct: 722  NMGELGAARNIIELGAELRIPVRNTHVYAFAEHGNDLGSSKDVKGNPTEVYRRM 775


>ref|XP_002280661.1| PREDICTED: protein TOC75-3, chloroplastic-like [Vitis vinifera]
          Length = 808

 Score = 1170 bits (3027), Expect = 0.0
 Identities = 582/771 (75%), Positives = 650/771 (84%), Gaps = 10/771 (1%)
 Frame = +1

Query: 22   LIAPPTMAFCGPGHLLPPNLSLCRRRTAT--ATVRSDLDRKSSPQ-------PTKTLHFL 174
            L +PP      P   L  +  + +  ++T  +T++ DL   SS         P K    L
Sbjct: 10   LFSPPLTPSLRPRRRLASSTPILKAGSSTPASTIKCDLSSSSSSSQNPESLTPKKPFFSL 69

Query: 175  SKTLAFSA-VSSLISHLSPVPSQLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWSK 351
            ++ LAFSA  + ++   SPV   +                                 WS+
Sbjct: 70   AQALAFSAGAAGILLRFSPVTPLVDSGDFSGGGGVGGTGDGGGGGGDGGF-------WSR 122

Query: 352  IFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTVGTD 531
            IFSP+ A+AKDEESQ  EWDSHGLPANIVVQLNKLSGFK+YK+SEI F+D+RR + VGT+
Sbjct: 123  IFSPA-AVAKDEESQ--EWDSHGLPANIVVQLNKLSGFKKYKISEILFYDRRRGSVVGTE 179

Query: 532  DSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTESTWQ 711
            DSFFEMV+++PGG+Y KA LQKELE LATCGMFEKVDLEGKT PDGT+GVTISF ESTWQ
Sbjct: 180  DSFFEMVTIRPGGIYNKAQLQKELENLATCGMFEKVDLEGKTNPDGTVGVTISFLESTWQ 239

Query: 712  SADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPIHRE 891
            SAD FRCINVGLM Q+K IEMD+DMTDKE  EY+++QEKDY+RRI+++RPCLLP+P++RE
Sbjct: 240  SADKFRCINVGLMPQTKPIEMDADMTDKEKMEYFRNQEKDYKRRIDKSRPCLLPMPVYRE 299

Query: 892  VLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDITQL 1071
            +LQMLR+QG VSARLLQ+IRDRVQKWYHDEGYACAQVVNFGNLNT+EVVCEVVEGDITQL
Sbjct: 300  ILQMLRDQGKVSARLLQKIRDRVQKWYHDEGYACAQVVNFGNLNTREVVCEVVEGDITQL 359

Query: 1072 AIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIEVNP 1251
             IQF DKLGNV EGNTQ  V+RRELPKQLR GHVFNIEAGKQALRNINSLALFSNIEVNP
Sbjct: 360  VIQFQDKLGNVVEGNTQFPVVRRELPKQLRQGHVFNIEAGKQALRNINSLALFSNIEVNP 419

Query: 1252 RPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRNLNG 1431
            RPDEK EGGIIVEIKLKELEQKTAEVS+EWSIVPGR GRPTLASIQPGGTV+ EHRN+ G
Sbjct: 420  RPDEKNEGGIIVEIKLKELEQKTAEVSSEWSIVPGRGGRPTLASIQPGGTVSFEHRNIKG 479

Query: 1432 LNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVFTGG 1611
            LNRS++GS+TT+NF++PQDDL+FK EYVHPYLDG++N RNRT RA+CFNSRKLSPVFTGG
Sbjct: 480  LNRSILGSVTTSNFLNPQDDLAFKLEYVHPYLDGVYNARNRTLRASCFNSRKLSPVFTGG 539

Query: 1612 PGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLPSGG 1791
            PGVDEVPPIWVDRAG+KANITENFTRQSKFTYGLVME+I TRDESSHISP GQR LPSGG
Sbjct: 540  PGVDEVPPIWVDRAGIKANITENFTRQSKFTYGLVMEEITTRDESSHISPNGQRVLPSGG 599

Query: 1792 ISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFPFFN 1971
            ISADGPPTTLSGTG+D+M F QANITRDNTKFVNGAIVGERNVFQ+DQGLG+GS FPFFN
Sbjct: 600  ISADGPPTTLSGTGIDRMAFAQANITRDNTKFVNGAIVGERNVFQVDQGLGVGSNFPFFN 659

Query: 1972 RHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGYNMG 2151
            RHQLT+TRFIQLKQVEEGA KPPPPVLVLHGHYGGCVGDLPSYDAF LGGPYSVRGYNMG
Sbjct: 660  RHQLTLTRFIQLKQVEEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFALGGPYSVRGYNMG 719

Query: 2152 ELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            ELGA RNILE+AAELR+PVRNTHVY FAEHGNDLGSSKDVKGNPTE YRRM
Sbjct: 720  ELGAARNILEVAAELRIPVRNTHVYAFAEHGNDLGSSKDVKGNPTEVYRRM 770


>emb|CAN81047.1| hypothetical protein VITISV_006765 [Vitis vinifera]
          Length = 784

 Score = 1166 bits (3016), Expect = 0.0
 Identities = 560/654 (85%), Positives = 613/654 (93%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            WS+IFSP+ A+AKDEESQ  EWDSHGLPANIVVQLNKLSGFK+YK+SEI F+D+RR + V
Sbjct: 96   WSRIFSPA-AVAKDEESQ--EWDSHGLPANIVVQLNKLSGFKKYKISEILFYDRRRGSVV 152

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMV+++PGG+Y KA LQKELE LATCGMFEKVDLEGKT PDGT+GVTISF ES
Sbjct: 153  GTEDSFFEMVTIRPGGIYNKAQLQKELENLATCGMFEKVDLEGKTNPDGTVGVTISFLES 212

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVGLM Q+K IEMD+DMTDKE  EY+++QEKDY+RRI+++RPCLLP+P+
Sbjct: 213  TWQSADKFRCINVGLMPQTKPIEMDADMTDKEKMEYFRNQEKDYKRRIDKSRPCLLPMPV 272

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
            +RE+LQMLR+QG VSARLLQ+IRDRVQKWYHDEGYACAQVVNFGNLNT+EVVCEVVEGDI
Sbjct: 273  YREILQMLRDQGKVSARLLQKIRDRVQKWYHDEGYACAQVVNFGNLNTREVVCEVVEGDI 332

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL IQF DKLGNV EGNTQ  V+RRELPKQLR GHVFNIEAGKQALRNINSLALFSNIE
Sbjct: 333  TQLVIQFQDKLGNVVEGNTQFPVVRRELPKQLRQGHVFNIEAGKQALRNINSLALFSNIE 392

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKELEQKTAEVS+EWSIVPGR GRPTLASIQPGGTV+ EHRN
Sbjct: 393  VNPRPDEKNEGGIIVEIKLKELEQKTAEVSSEWSIVPGRGGRPTLASIQPGGTVSFEHRN 452

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            + GLNRS++GS+TT+NF++PQDDL+FK EYVHPYLDG++N RNRT RA+CFNSRKLSPVF
Sbjct: 453  IKGLNRSILGSVTTSNFLNPQDDLAFKLEYVHPYLDGVYNARNRTLRASCFNSRKLSPVF 512

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPGVDEVPPIWVDRAG+KANITENFTRQSKFTYGLVME+I TRDESSHISP GQR LP
Sbjct: 513  TGGPGVDEVPPIWVDRAGIKANITENFTRQSKFTYGLVMEEITTRDESSHISPNGQRVLP 572

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGISADGPPTTLSGTG+D+M F QANITRDNTKFVNGAIVGERNVFQ+DQGLG+GS FP
Sbjct: 573  SGGISADGPPTTLSGTGIDRMAFAQANITRDNTKFVNGAIVGERNVFQVDQGLGVGSNFP 632

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLT+TRFIQLKQVEEGA KPPPPVLVLHGHYGGCVGDLPSYDAF LGGPYSVRGY
Sbjct: 633  FFNRHQLTLTRFIQLKQVEEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFALGGPYSVRGY 692

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGELGA RNILE+AAELR+PVRNTHVY FAEHGNDLGSSKDVKGNPTE YRRM
Sbjct: 693  NMGELGAARNILEVAAELRIPVRNTHVYAFAEHGNDLGSSKDVKGNPTEVYRRM 746


>ref|XP_006290594.1| hypothetical protein CARUB_v10016683mg, partial [Capsella rubella]
            gi|482559301|gb|EOA23492.1| hypothetical protein
            CARUB_v10016683mg, partial [Capsella rubella]
          Length = 845

 Score = 1162 bits (3006), Expect = 0.0
 Identities = 588/787 (74%), Positives = 641/787 (81%), Gaps = 27/787 (3%)
 Frame = +1

Query: 22   LIAPPTMAFCGPGHLLPPNLS------LCRRRTATATVRSDLDRKSSPQPT--------- 156
            L +PP  AF   G L+P   S      L  RR   +   S L R SSP P          
Sbjct: 22   LPSPPMAAFSVNGKLIPAATSSTNPTPLSSRRKFLSPSTSRLPRISSPSPRVPSIKCSSS 81

Query: 157  ------------KTLHFLSKTLAFSAVSSLISHLSPVPSQLXXXXXXXXXXXXXXXXXXX 300
                          L  L+K LA ++VSS  S      S L                   
Sbjct: 82   LPSRDTEPSPKDSLLKNLAKPLAVASVSSAASFFLFRISNLPSVLSGGGGGGGDGNFGGF 141

Query: 301  XXXXXXXXXXXXXXWSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKV 480
                          W K+FSP+PA+A  +E QS +WDSHGLPANIVVQLNKLSGFK+YKV
Sbjct: 142  GGGGGGGDGNDGGFWGKLFSPAPAVA--DEEQSPDWDSHGLPANIVVQLNKLSGFKKYKV 199

Query: 481  SEIQFFDQRRWTTVGTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTK 660
            S+I FFD+RR TT+GT+DSFFEMVS++PGGVYTKA LQKELETLATCGMFEKVDLEGKTK
Sbjct: 200  SDIMFFDRRRQTTIGTEDSFFEMVSIRPGGVYTKAQLQKELETLATCGMFEKVDLEGKTK 259

Query: 661  PDGTIGVTISFTESTWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRR 840
            PDGT+GVTISF ESTWQSAD FRCINVGLM QSK IEMDSDMTDKE  EYY+S EKDY+R
Sbjct: 260  PDGTLGVTISFAESTWQSADRFRCINVGLMVQSKPIEMDSDMTDKEKLEYYRSLEKDYKR 319

Query: 841  RIERARPCLLPVPIHREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNL 1020
            RI+RARPCLLP P++ EV+QMLR+QG VSARLLQRIRDRVQKWYHDEGYACAQVVNFGNL
Sbjct: 320  RIDRARPCLLPAPVYGEVMQMLRDQGKVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNL 379

Query: 1021 NTKEVVCEVVEGDITQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQA 1200
            NTKEVVCEVVEGDITQL IQF DKLGNV EGNTQ+ V+RRELPKQLR G+VFNIEAGKQA
Sbjct: 380  NTKEVVCEVVEGDITQLVIQFQDKLGNVVEGNTQVPVVRRELPKQLRQGYVFNIEAGKQA 439

Query: 1201 LRNINSLALFSNIEVNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLA 1380
            LRNINSL LFSNIEVNPRPDEK EGGIIVEIKLKELEQK+AEVSTEWSIVPGR G PTLA
Sbjct: 440  LRNINSLGLFSNIEVNPRPDEKNEGGIIVEIKLKELEQKSAEVSTEWSIVPGRGGAPTLA 499

Query: 1381 SIQPGGTVTIEHRNLNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTF 1560
            S QPGG+VT EHRNL GLNRSL+GS+TT+NF++PQDDLSFK EYVHPYLDG++NPRNRTF
Sbjct: 500  SFQPGGSVTFEHRNLQGLNRSLMGSVTTSNFLNPQDDLSFKLEYVHPYLDGVYNPRNRTF 559

Query: 1561 RANCFNSRKLSPVFTGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRD 1740
            + +CFNSRKLSPVFTGGPGV+EVPPIWVDRAG+KANITENFTRQSKFTYGLVME+I TRD
Sbjct: 560  KTSCFNSRKLSPVFTGGPGVEEVPPIWVDRAGLKANITENFTRQSKFTYGLVMEEITTRD 619

Query: 1741 ESSHISPYGQRQLPSGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNV 1920
            ESSHI+  GQR LPSGGISADGPPTTLSGTG+D+M F QANITRDNTKFVNGA+VGER V
Sbjct: 620  ESSHIAANGQRLLPSGGISADGPPTTLSGTGIDRMAFLQANITRDNTKFVNGAVVGERTV 679

Query: 1921 FQLDQGLGIGSQFPFFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSY 2100
            FQ+DQGLGIGS+FPFFNRHQLTMTRFIQL+QVEEGA KPPPPVLVLHGHYGGCVGDLPSY
Sbjct: 680  FQVDQGLGIGSKFPFFNRHQLTMTRFIQLRQVEEGAGKPPPPVLVLHGHYGGCVGDLPSY 739

Query: 2101 DAFTLGGPYSVRGYNMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGN 2280
            DAF LGGPYSVRGYNMGELGA RNI EL AE+R+PV+NTHVY F EHGNDLGSSKDVKGN
Sbjct: 740  DAFVLGGPYSVRGYNMGELGAARNIAELGAEIRIPVKNTHVYAFVEHGNDLGSSKDVKGN 799

Query: 2281 PTEFYRR 2301
            PT  YRR
Sbjct: 800  PTAVYRR 806


>ref|XP_002299371.1| outer membrane family protein [Populus trichocarpa]
            gi|222846629|gb|EEE84176.1| outer membrane family protein
            [Populus trichocarpa]
          Length = 813

 Score = 1162 bits (3005), Expect = 0.0
 Identities = 562/654 (85%), Positives = 606/654 (92%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            W  +FS + A A  +ESQSQ+WDSHGLPANIVVQLNKLSGFK+YK+SEI FFD+RRWTTV
Sbjct: 124  WKNLFSVASANA--DESQSQDWDSHGLPANIVVQLNKLSGFKKYKLSEILFFDRRRWTTV 181

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMVSL+PGGVYTKA LQKELE+LATCGMFEKVD+EGKT PDGTIG+TISFTES
Sbjct: 182  GTEDSFFEMVSLRPGGVYTKAQLQKELESLATCGMFEKVDMEGKTNPDGTIGITISFTES 241

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVGLMQQSK IEMD DMTDKE  EYY+SQEKDYRRRIE+ARPCLLP  +
Sbjct: 242  TWQSADKFRCINVGLMQQSKPIEMDPDMTDKEKLEYYRSQEKDYRRRIEKARPCLLPTQV 301

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
            HREVLQMLREQG VSARLLQ+IRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI
Sbjct: 302  HREVLQMLREQGKVSARLLQKIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 361

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL IQ+ DKLGNV EGNTQL V++RELPKQLR G VFNIEAGKQALRNINSLALFSNIE
Sbjct: 362  TQLVIQYQDKLGNVVEGNTQLPVVKRELPKQLRQGQVFNIEAGKQALRNINSLALFSNIE 421

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKELE K+AEVSTEWSIVPGR GRPTLAS QPGGTV+ EHRN
Sbjct: 422  VNPRPDEKNEGGIIVEIKLKELEPKSAEVSTEWSIVPGRGGRPTLASFQPGGTVSFEHRN 481

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            + GLNRS++GS+TT+NF   QDDLSFK EYVHPYLDG++NPRNRT R +CFNSRKLSPVF
Sbjct: 482  IKGLNRSILGSITTSNFFSAQDDLSFKLEYVHPYLDGVYNPRNRTLRGSCFNSRKLSPVF 541

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPGVDEVPPIWVDRAG+KANITENFTRQSKFTYG+VME+I TRDESSHIS  GQR LP
Sbjct: 542  TGGPGVDEVPPIWVDRAGMKANITENFTRQSKFTYGIVMEEITTRDESSHISSNGQRVLP 601

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGISADGPPTTLSGTG+D+M F QANITRDNTKFVNG +VG+RNVFQ+DQGLGIGS+FP
Sbjct: 602  SGGISADGPPTTLSGTGIDRMAFLQANITRDNTKFVNGTVVGDRNVFQVDQGLGIGSKFP 661

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLT+TRFIQLK+VEEGA KPPPPVLVL+GHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 662  FFNRHQLTLTRFIQLKEVEEGAGKPPPPVLVLNGHYGGCVGDLPSYDAFTLGGPYSVRGY 721

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGELGA RNILEL AE+R+PVRNTHVY FAEHGNDLG+SKDVKGNPTE YRRM
Sbjct: 722  NMGELGAARNILELGAEVRIPVRNTHVYAFAEHGNDLGTSKDVKGNPTEVYRRM 775


>ref|XP_002520530.1| sorting and assembly machinery (sam50) protein, putative [Ricinus
            communis] gi|223540372|gb|EEF41943.1| sorting and
            assembly machinery (sam50) protein, putative [Ricinus
            communis]
          Length = 815

 Score = 1157 bits (2993), Expect = 0.0
 Identities = 570/738 (77%), Positives = 638/738 (86%), Gaps = 3/738 (0%)
 Frame = +1

Query: 100  TATATVRSDLDRKSSPQPTKTL-HFLSKTLAFSAVSSLISHLSPVPSQLXXXXXXXXXXX 276
            +++++  S    + +P+P  +L   L+K LA ++ +SL   L+P+PS             
Sbjct: 42   SSSSSSSSSSSSQQNPKPHNSLLKTLTKPLAVASAASLFLRLTPIPSPFSGGGNNGGDWG 101

Query: 277  XXXXXXXXXXXXXXXXXXXXXX--WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLN 450
                                    W+K+F P+PAIA  +ESQS+++DSHGLPANIVVQLN
Sbjct: 102  GRGGGGGGGGENNFNGGEAGGDGFWNKLFQPAPAIA--DESQSKDFDSHGLPANIVVQLN 159

Query: 451  KLSGFKRYKVSEIQFFDQRRWTTVGTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMF 630
            KLSGFK+YK+S+I FFD+RR+TTVG+ DSFFEMVSL+PGG YTKA LQKELETLA+CGMF
Sbjct: 160  KLSGFKKYKLSDIVFFDRRRYTTVGSQDSFFEMVSLRPGGTYTKAQLQKELETLASCGMF 219

Query: 631  EKVDLEGKTKPDGTIGVTISFTESTWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEY 810
            EKVD+EGKT PDGT+G+TISFTESTWQSAD FRCINVGLMQQSK IEMD DMTDKE  EY
Sbjct: 220  EKVDMEGKTNPDGTLGITISFTESTWQSADKFRCINVGLMQQSKPIEMDPDMTDKEKLEY 279

Query: 811  YKSQEKDYRRRIERARPCLLPVPIHREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYA 990
            Y+SQEKDY+RRIE+ARPCLLP  ++REVLQMLR+QG VSARLLQ+IRDRVQKWYHDEGYA
Sbjct: 280  YRSQEKDYKRRIEKARPCLLPASVNREVLQMLRDQGKVSARLLQKIRDRVQKWYHDEGYA 339

Query: 991  CAQVVNFGNLNTKEVVCEVVEGDITQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGH 1170
            CAQVVNFGNLNTKEVVCEVVEGDITQ+ IQ+ DKLGNV EGNTQL V++RELPKQLR G 
Sbjct: 340  CAQVVNFGNLNTKEVVCEVVEGDITQMVIQYQDKLGNVVEGNTQLPVVKRELPKQLRQGQ 399

Query: 1171 VFNIEAGKQALRNINSLALFSNIEVNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIV 1350
            VFNIEAGKQALRNINSLALFSNIEVNPRPDEK EGGIIVEIKLKELE K+AEVSTEWSIV
Sbjct: 400  VFNIEAGKQALRNINSLALFSNIEVNPRPDEKNEGGIIVEIKLKELEPKSAEVSTEWSIV 459

Query: 1351 PGRNGRPTLASIQPGGTVTIEHRNLNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLD 1530
            PGR GRPTLAS QPGGTV+ EHRN+ GLNRS++GS+TT+NF  PQDDL+FK EYVHPYLD
Sbjct: 460  PGRGGRPTLASFQPGGTVSFEHRNIKGLNRSILGSITTSNFFLPQDDLAFKLEYVHPYLD 519

Query: 1531 GIFNPRNRTFRANCFNSRKLSPVFTGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYG 1710
            G++NPRNRT RA+CFNSRKLSPVFTGGPGVDEVPPIWVDRAG+KANITENFTRQSKFTYG
Sbjct: 520  GVYNPRNRTLRASCFNSRKLSPVFTGGPGVDEVPPIWVDRAGLKANITENFTRQSKFTYG 579

Query: 1711 LVMEKIITRDESSHISPYGQRQLPSGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFV 1890
            +VME+I TRDESSHIS  GQR LPSGGISADGPPTTLSGTG+D+M F QANITRDNTKFV
Sbjct: 580  IVMEEITTRDESSHISANGQRVLPSGGISADGPPTTLSGTGIDRMAFLQANITRDNTKFV 639

Query: 1891 NGAIVGERNVFQLDQGLGIGSQFPFFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHY 2070
            NGA+VGERNVFQ+DQGLGIGS+FPFFNRHQLT+TRFI L QVEEGA KPPPPVLVL+GHY
Sbjct: 640  NGAVVGERNVFQVDQGLGIGSKFPFFNRHQLTITRFIPLTQVEEGAGKPPPPVLVLNGHY 699

Query: 2071 GGCVGDLPSYDAFTLGGPYSVRGYNMGELGACRNILELAAELRVPVRNTHVYLFAEHGND 2250
            GGCVGDLPSYDAFTLGGPYSVRGYNMGELGA RNILEL AE+R+PVRNTHVY FAEHGND
Sbjct: 700  GGCVGDLPSYDAFTLGGPYSVRGYNMGELGAARNILELGAEIRIPVRNTHVYAFAEHGND 759

Query: 2251 LGSSKDVKGNPTEFYRRM 2304
            LG+SKDVKGNPTE YRRM
Sbjct: 760  LGTSKDVKGNPTEVYRRM 777


>gb|EXC10708.1| Protein TOC75-3 [Morus notabilis]
          Length = 826

 Score = 1157 bits (2992), Expect = 0.0
 Identities = 556/654 (85%), Positives = 605/654 (92%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            W ++F PS A A D  SQSQEWDSHGLPANIVVQLNKLSGFK+Y+VS+IQF D+RR  TV
Sbjct: 137  WRRLFCPSAAFADD--SQSQEWDSHGLPANIVVQLNKLSGFKKYRVSDIQFLDRRRGNTV 194

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            G +DSFFEMVSL+PGGVY KA LQKELETLATCGMFEKVD+EGKT PDGT+GVTISFTES
Sbjct: 195  GPEDSFFEMVSLRPGGVYAKAQLQKELETLATCGMFEKVDMEGKTNPDGTLGVTISFTES 254

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVG+M QSK IEMD DMTDKE  EY+K+QEKDY+RRIERARPCLLP+P+
Sbjct: 255  TWQSADKFRCINVGMMPQSKPIEMDPDMTDKEKLEYFKNQEKDYKRRIERARPCLLPMPV 314

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
            +REV+QMLR+QG VSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI
Sbjct: 315  YREVMQMLRDQGKVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 374

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL IQF DKLGNV EGNTQ+ V++RELP+QLRPG+VFNIEAGKQALRNINSLALFSNIE
Sbjct: 375  TQLVIQFQDKLGNVVEGNTQIPVVKRELPRQLRPGYVFNIEAGKQALRNINSLALFSNIE 434

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKELEQKTAEVSTEWSIVPGR G PTLAS+QPGGTVT EHRN
Sbjct: 435  VNPRPDEKNEGGIIVEIKLKELEQKTAEVSTEWSIVPGRGGYPTLASLQPGGTVTFEHRN 494

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            + GLNRS++GS+TT+NF++PQDDL+FK EYVHPYLDG++NPRNR+ R +CFNSRKLSPVF
Sbjct: 495  IKGLNRSILGSVTTSNFLNPQDDLAFKLEYVHPYLDGVYNPRNRSLRVSCFNSRKLSPVF 554

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPG DEVPPIWVDRAGVKANITENFTRQSKFTYGLVME+I TRDESSHI P GQR LP
Sbjct: 555  TGGPGADEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEEITTRDESSHICPNGQRILP 614

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGIS DGPPTTLSGTG+D+M F QANITRDNTKFVNG IVG+RNVFQ+DQGLGIGS+FP
Sbjct: 615  SGGISVDGPPTTLSGTGIDRMAFLQANITRDNTKFVNGTIVGQRNVFQVDQGLGIGSKFP 674

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLT+T+F QLKQVEEGA KPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 675  FFNRHQLTLTKFFQLKQVEEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 734

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGE+GA RNI+ELAAELR+PV+ THVY F EHGNDLGSSKDVKGNPTE YRRM
Sbjct: 735  NMGEIGAARNIVELAAELRIPVQGTHVYAFVEHGNDLGSSKDVKGNPTEVYRRM 788


>ref|XP_002303729.2| hypothetical protein POPTR_0003s15670g [Populus trichocarpa]
            gi|550343261|gb|EEE78708.2| hypothetical protein
            POPTR_0003s15670g [Populus trichocarpa]
          Length = 816

 Score = 1155 bits (2989), Expect = 0.0
 Identities = 559/654 (85%), Positives = 607/654 (92%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            W K+F  +PA A  +ESQS++WDSHGLPANIVVQLNKLSGFK+YK+SEI FFD+RRWTTV
Sbjct: 127  WKKLFFVAPANA--DESQSEDWDSHGLPANIVVQLNKLSGFKKYKLSEILFFDRRRWTTV 184

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMVSL+PGGVYTKA LQKELE+LATCGMFEKVD+EGKT PDGTIG+TISFTES
Sbjct: 185  GTEDSFFEMVSLRPGGVYTKAQLQKELESLATCGMFEKVDMEGKTNPDGTIGITISFTES 244

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVGLMQQSK IEMDSDMTDKE  EYY+SQEKDYRRRIERARPCLLP  +
Sbjct: 245  TWQSADKFRCINVGLMQQSKPIEMDSDMTDKEKLEYYRSQEKDYRRRIERARPCLLPTQV 304

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
            HREVLQMLREQG VSARLLQ+IRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGD+
Sbjct: 305  HREVLQMLREQGKVSARLLQKIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDV 364

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL IQ+LDKLGNV EG+TQL V++RELPKQLR G VFNIEAGKQALRNINSLALFSNIE
Sbjct: 365  TQLVIQYLDKLGNVVEGHTQLPVVKRELPKQLRQGQVFNIEAGKQALRNINSLALFSNIE 424

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKELE K+AEVSTEWSIVPGR GRPTLAS QPGGTV+ EHRN
Sbjct: 425  VNPRPDEKNEGGIIVEIKLKELEPKSAEVSTEWSIVPGRGGRPTLASFQPGGTVSFEHRN 484

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            + GLNRS++GS+TT+NF   Q+DLSFK EYVHPYLDG+++ RN+T RA+CFN RKLSPVF
Sbjct: 485  IKGLNRSILGSITTSNFFSAQEDLSFKLEYVHPYLDGLYSSRNQTLRASCFNIRKLSPVF 544

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPGVDEVPPIWVDR G+KANITENFTRQSKFTYG+VME+I T DESSHIS  GQR LP
Sbjct: 545  TGGPGVDEVPPIWVDRTGMKANITENFTRQSKFTYGIVMEEITTSDESSHISSNGQRVLP 604

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGISADGPPTTLSGTGVD+M F QANITRDNTKFVNGA+VG+RNVFQ+DQGLGIGS+FP
Sbjct: 605  SGGISADGPPTTLSGTGVDRMAFLQANITRDNTKFVNGAVVGDRNVFQVDQGLGIGSKFP 664

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLT+TRFIQLK+VEEGA KPPPPVLVL+GHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 665  FFNRHQLTLTRFIQLKEVEEGAGKPPPPVLVLNGHYGGCVGDLPSYDAFTLGGPYSVRGY 724

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGELGA RNILEL AE+R+PVRNTHVY FAEHGNDLG+SKDVKGNPTE YRRM
Sbjct: 725  NMGELGAARNILELGAEIRIPVRNTHVYAFAEHGNDLGTSKDVKGNPTEVYRRM 778


>ref|XP_006418866.1| hypothetical protein EUTSA_v10002403mg [Eutrema salsugineum]
            gi|557096794|gb|ESQ37302.1| hypothetical protein
            EUTSA_v10002403mg [Eutrema salsugineum]
          Length = 818

 Score = 1153 bits (2982), Expect = 0.0
 Identities = 575/759 (75%), Positives = 639/759 (84%), Gaps = 6/759 (0%)
 Frame = +1

Query: 46   FCGPGHLLPPNLSLCRRRTATATVRSDLDRKSSPQPTKT--LHFLSKTLAFSAVSSLIS- 216
            F  P     P +S    R A+    S L  +     +K   L  L+K LA ++VSS  S 
Sbjct: 29   FLSPSSSRLPRISTPSPRVASIKCSSSLPTRDKEPSSKDNLLKSLAKPLAVASVSSAASF 88

Query: 217  ---HLSPVPSQLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWSKIFSPSPAIAKDE 387
                +S +PS L                                 W ++FSP+ A+A  +
Sbjct: 89   FLFRISNLPSFLCGGGGGGDGNFGGFGGGGGGGDGNDGGF-----WRELFSPALAVA--D 141

Query: 388  ESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTVGTDDSFFEMVSLQPG 567
            E QS +WDSHGLPANIVVQLNKLSGFK+YKVS+I FFD+RR +T+GT+DSFFEMVS++PG
Sbjct: 142  EEQSPDWDSHGLPANIVVQLNKLSGFKKYKVSDIMFFDRRRQSTIGTEDSFFEMVSIRPG 201

Query: 568  GVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTESTWQSADSFRCINVGL 747
            GVYTKA LQKELETLATCGMFEKVDLEGKTKPDGT+GVTISF ESTWQSAD FRCINVGL
Sbjct: 202  GVYTKAQLQKELETLATCGMFEKVDLEGKTKPDGTLGVTISFAESTWQSADRFRCINVGL 261

Query: 748  MQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPIHREVLQMLREQGTVS 927
            M QSK +EMD+DMTDKE  EYY+S EKDY+RRI+RARPCLLP P++ EV+QMLR+QG VS
Sbjct: 262  MVQSKPVEMDADMTDKEKLEYYRSLEKDYKRRIDRARPCLLPAPVYGEVMQMLRDQGKVS 321

Query: 928  ARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDITQLAIQFLDKLGNVC 1107
            ARLLQ+IRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDITQL IQF DKLGNV 
Sbjct: 322  ARLLQKIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDITQLVIQFQDKLGNVV 381

Query: 1108 EGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIEVNPRPDEKKEGGIIV 1287
            EGNTQ+ V+RRELPKQLR G+VFNIEAGKQALRNINSL LFSNIEVNPRPDEK EGGIIV
Sbjct: 382  EGNTQVPVVRRELPKQLRQGYVFNIEAGKQALRNINSLGLFSNIEVNPRPDEKNEGGIIV 441

Query: 1288 EIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRNLNGLNRSLIGSLTTN 1467
            EIKLKELEQK+AEVSTEWSIVPGR G PTLAS QPGG+VT EHRN+ GLNRS++GS+TT+
Sbjct: 442  EIKLKELEQKSAEVSTEWSIVPGRGGAPTLASFQPGGSVTFEHRNIQGLNRSVMGSVTTS 501

Query: 1468 NFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVFTGGPGVDEVPPIWVD 1647
            NF++PQDDLSFK EYVHPYLDG++NPRNRTF+ +CFNSRKLSPVFTGGPGV+EVPPIWVD
Sbjct: 502  NFLNPQDDLSFKLEYVHPYLDGVYNPRNRTFKTSCFNSRKLSPVFTGGPGVEEVPPIWVD 561

Query: 1648 RAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLPSGGISADGPPTTLSG 1827
            RAGVKANITENFTRQSKFTYGLVME+I TRDESSHI+  GQR LPSGGISADGPPTTLSG
Sbjct: 562  RAGVKANITENFTRQSKFTYGLVMEEITTRDESSHIAANGQRLLPSGGISADGPPTTLSG 621

Query: 1828 TGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFPFFNRHQLTMTRFIQL 2007
            TG+D+M F QANITRDNTKFVNGA+VG+RNVFQ+DQGLGIGS+FPFFNRHQLT+TRFIQL
Sbjct: 622  TGIDRMAFLQANITRDNTKFVNGAVVGDRNVFQVDQGLGIGSKFPFFNRHQLTLTRFIQL 681

Query: 2008 KQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGYNMGELGACRNILELA 2187
            +QVEEGA KPPPPVLVLHGHYGGCVGDLPSYDAF LGGPYSVRGYNMGELGA RNILEL 
Sbjct: 682  QQVEEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFVLGGPYSVRGYNMGELGAARNILELG 741

Query: 2188 AELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            AE+R+PV+NTHVY FAEHGNDLGSSKDVKGNPT  YRRM
Sbjct: 742  AEIRIPVKNTHVYAFAEHGNDLGSSKDVKGNPTAVYRRM 780


>gb|EYU45949.1| hypothetical protein MIMGU_mgv1a001440mg [Mimulus guttatus]
          Length = 819

 Score = 1151 bits (2978), Expect = 0.0
 Identities = 583/782 (74%), Positives = 651/782 (83%), Gaps = 28/782 (3%)
 Frame = +1

Query: 43   AFCGPGHLLPPNLSLCRRR----TATATVRSDLDRKSS--------------PQPTKTLH 168
            +F  P HLLP   S   RR    T T  V S L   +S              P+P+K+ H
Sbjct: 3    SFAAPTHLLPLTSSTASRRSKFSTTTTAVASRLPSPNSIIQCNLPSLASSQNPKPSKSPH 62

Query: 169  -FL-----SKTLAFSAVSSLISHL-SPVPS---QLXXXXXXXXXXXXXXXXXXXXXXXXX 318
             FL     SK++A SA S ++ H+ SPV S    +                         
Sbjct: 63   SFLNSVISSKSIALSAASCILFHIASPVLSGNDSIFNFSGGGAGGNGGGGGGGGGGGGGS 122

Query: 319  XXXXXXXXWSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFF 498
                    W + FSP+ A+AKD++ Q QEWDSHGLPA+IVVQLNKLSGFK+YKVS+I FF
Sbjct: 123  SGGPGGNFW-RFFSPA-AVAKDDDPQ-QEWDSHGLPASIVVQLNKLSGFKKYKVSDILFF 179

Query: 499  DQRRWTTVGTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIG 678
            D+RR +TVGT+DSFFEMVSL+PGGVYTKA LQKELETLATCGMFEKVDLE KT PDGTI 
Sbjct: 180  DRRRGSTVGTEDSFFEMVSLRPGGVYTKAQLQKELETLATCGMFEKVDLETKTNPDGTIN 239

Query: 679  VTISFTESTWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERAR 858
            +TI F ESTWQSAD FRCINVGLMQQ+K IEMD DMT+KE  E+Y+SQEKDYRRRI+R+R
Sbjct: 240  ITIPFLESTWQSADKFRCINVGLMQQTKPIEMDPDMTEKERLEFYRSQEKDYRRRIDRSR 299

Query: 859  PCLLPVPIHREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVV 1038
            PCLLP P+ RE+LQMLR+QG VSARLLQRIRDRVQ+WYH+ GYACAQVVNFGNLNTKEVV
Sbjct: 300  PCLLPPPVQREILQMLRDQGKVSARLLQRIRDRVQQWYHENGYACAQVVNFGNLNTKEVV 359

Query: 1039 CEVVEGDITQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINS 1218
            CEVVEGDITQL IQF DKLGNVCEGNTQ  VI+RELPK LR G VFNIEAGKQALRNINS
Sbjct: 360  CEVVEGDITQLVIQFQDKLGNVCEGNTQFPVIKRELPKLLRQGQVFNIEAGKQALRNINS 419

Query: 1219 LALFSNIEVNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGG 1398
            L+LFSNIEVNPRPDEK EGGIIVEIKLKELEQK+AEVS+EWSIVPGR GRPTLA+IQPGG
Sbjct: 420  LSLFSNIEVNPRPDEKNEGGIIVEIKLKELEQKSAEVSSEWSIVPGRAGRPTLATIQPGG 479

Query: 1399 TVTIEHRNLNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFN 1578
            TV+ EHRN+NGLNRSL+GS+TT+NF++PQDDL+FK EYVHPY+DG++NPRNRTFRA+CFN
Sbjct: 480  TVSFEHRNINGLNRSLLGSVTTSNFLNPQDDLAFKLEYVHPYVDGVYNPRNRTFRASCFN 539

Query: 1579 SRKLSPVFTGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHIS 1758
            SRKLSPVFTGGPG++EVPPIWVDRAG+KAN TENFTRQSKFTYGLVME+I TRDESSHIS
Sbjct: 540  SRKLSPVFTGGPGIEEVPPIWVDRAGMKANFTENFTRQSKFTYGLVMEEITTRDESSHIS 599

Query: 1759 PYGQRQLPSGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQG 1938
              GQR LPSGG+SADGPPTTLSGTGVD+M F QANITRDNTKF+NGAIVG+RNVFQ+DQG
Sbjct: 600  ANGQRVLPSGGVSADGPPTTLSGTGVDRMAFLQANITRDNTKFLNGAIVGDRNVFQVDQG 659

Query: 1939 LGIGSQFPFFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLG 2118
            LGIGS+FPFFNRHQLT+TRF+QL  VEEGA KPPPPVLVLHGHYGGCVGDLPSYDAFTLG
Sbjct: 660  LGIGSKFPFFNRHQLTVTRFLQLNDVEEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFTLG 719

Query: 2119 GPYSVRGYNMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYR 2298
            GPYSVRGYNMGELGA RNILELAAELR+P++NTH Y+FAEHGNDLGSSKDVKGNPTE YR
Sbjct: 720  GPYSVRGYNMGELGAARNILELAAELRIPIKNTHAYVFAEHGNDLGSSKDVKGNPTEVYR 779

Query: 2299 RM 2304
            RM
Sbjct: 780  RM 781


>ref|NP_190258.1| protein TOC75-3 [Arabidopsis thaliana]
            gi|75207662|sp|Q9STE8.1|TC753_ARATH RecName: Full=Protein
            TOC75-3, chloroplastic; AltName: Full=75 kDa translocon
            at the outer-envelope-membrane of chloroplasts 3;
            Short=AtTOC75-III; Flags: Precursor
            gi|5541685|emb|CAB51191.1| chloroplast import-associated
            channel homolog [Arabidopsis thaliana]
            gi|22022564|gb|AAM83239.1| AT3g46740/T6H20_230
            [Arabidopsis thaliana] gi|30102496|gb|AAP21166.1|
            At3g46740/T6H20_230 [Arabidopsis thaliana]
            gi|332644678|gb|AEE78199.1| protein TOC75-3 [Arabidopsis
            thaliana]
          Length = 818

 Score = 1147 bits (2967), Expect = 0.0
 Identities = 553/653 (84%), Positives = 603/653 (92%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            W K+FSPSPA+A  +E QS +WDSHGLPANIVVQLNKLSGFK+YKVS+I FFD+RR TT+
Sbjct: 129  WGKLFSPSPAVA--DEEQSPDWDSHGLPANIVVQLNKLSGFKKYKVSDIMFFDRRRQTTI 186

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMVS++PGGVYTKA LQKELETLATCGMFEKVDLEGKTKPDGT+GVTISF ES
Sbjct: 187  GTEDSFFEMVSIRPGGVYTKAQLQKELETLATCGMFEKVDLEGKTKPDGTLGVTISFAES 246

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVGLM QSK IEMDSDMTDKE  EYY+S EKDY+RRI+RARPCLLP P+
Sbjct: 247  TWQSADRFRCINVGLMVQSKPIEMDSDMTDKEKLEYYRSLEKDYKRRIDRARPCLLPAPV 306

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
            + EV+QMLR+QG VSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI
Sbjct: 307  YGEVMQMLRDQGKVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 366

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL IQF DKLGNV EGNTQ+ V+RRELPKQLR G+VFNIEAGK+AL NINSL LFSNIE
Sbjct: 367  TQLVIQFQDKLGNVVEGNTQVPVVRRELPKQLRQGYVFNIEAGKKALSNINSLGLFSNIE 426

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKELEQK+AEVSTEWSIVPGR G PTLAS QPGG+VT EHRN
Sbjct: 427  VNPRPDEKNEGGIIVEIKLKELEQKSAEVSTEWSIVPGRGGAPTLASFQPGGSVTFEHRN 486

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            L GLNRSL+GS+TT+NF++PQDDLSFK EYVHPYLDG++NPRNRTF+ +CFNSRKLSPVF
Sbjct: 487  LQGLNRSLMGSVTTSNFLNPQDDLSFKLEYVHPYLDGVYNPRNRTFKTSCFNSRKLSPVF 546

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPGV+EVPPIWVDRAGVKANITENFTRQSKFTYGLVME+I TRDESSHI+  GQR LP
Sbjct: 547  TGGPGVEEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEEITTRDESSHIAANGQRLLP 606

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGISADGPPTTLSGTGVD+M F QANITRDNTKFVNGA+VG+R VFQ+DQGLGIGS+FP
Sbjct: 607  SGGISADGPPTTLSGTGVDRMAFLQANITRDNTKFVNGAVVGQRTVFQVDQGLGIGSKFP 666

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLTMT+FIQL++VE+GA K PPPVLVLHGHYGGCVGDLPSYDAF LGGPYSVRGY
Sbjct: 667  FFNRHQLTMTKFIQLREVEQGAGKSPPPVLVLHGHYGGCVGDLPSYDAFVLGGPYSVRGY 726

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRR 2301
            NMGELGA RNI E+ AE+R+PV+NTHVY F EHGNDLGSSKDVKGNPT  YRR
Sbjct: 727  NMGELGAARNIAEVGAEIRIPVKNTHVYAFVEHGNDLGSSKDVKGNPTAVYRR 779


>ref|XP_002877511.1| translocon outer membrane complex 75-III [Arabidopsis lyrata subsp.
            lyrata] gi|297323349|gb|EFH53770.1| translocon outer
            membrane complex 75-III [Arabidopsis lyrata subsp.
            lyrata]
          Length = 817

 Score = 1147 bits (2967), Expect = 0.0
 Identities = 583/784 (74%), Positives = 642/784 (81%), Gaps = 31/784 (3%)
 Frame = +1

Query: 43   AFCGPGHLLPP------NLSLCRRRTATATVRSDLDRKSSPQP---------------TK 159
            AF   G L+P       + SL  RR   +   S L R SSP P               T+
Sbjct: 3    AFSVNGQLIPTTTSSTSSTSLSSRRKFLSPSSSRLPRISSPSPRVPSIKCSSSLPNRDTE 62

Query: 160  T------LHFLSKTLAFSAVSSLIS----HLSPVPSQLXXXXXXXXXXXXXXXXXXXXXX 309
            T      L  L+K LA ++VSS  S     +S +PS L                      
Sbjct: 63   TSSKDSLLKNLAKPLAVASVSSAASFFLFRISNLPSVLTGGGGGGGGGDGNFGGFGGGDG 122

Query: 310  XXXXXXXXXXXWSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEI 489
                       W K+F+P+PA+A  +E QS +WDSHGLPANIVVQLNKLSGFK+YKVS+I
Sbjct: 123  NDGGF------WGKLFAPAPAVA--DEEQSPDWDSHGLPANIVVQLNKLSGFKKYKVSDI 174

Query: 490  QFFDQRRWTTVGTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDG 669
             FFD+RR TT+GT+DSFFEMVS++PGGVYTKA LQKELETLATCGMFEKVDLEGKTKPDG
Sbjct: 175  MFFDRRRQTTIGTEDSFFEMVSIRPGGVYTKAQLQKELETLATCGMFEKVDLEGKTKPDG 234

Query: 670  TIGVTISFTESTWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIE 849
            T+GVTISF ESTWQSAD FRCINVGLM QSK IEMDSDMTDKE  EYY+S EKDY+RRI+
Sbjct: 235  TLGVTISFAESTWQSADRFRCINVGLMVQSKPIEMDSDMTDKEKLEYYRSLEKDYKRRID 294

Query: 850  RARPCLLPVPIHREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTK 1029
            RARPCLLP P++ EV+QMLR+QG VSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTK
Sbjct: 295  RARPCLLPAPVYGEVMQMLRDQGKVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTK 354

Query: 1030 EVVCEVVEGDITQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRN 1209
            EVVCEVVEGDITQL IQF DKLGNV EGNTQ+ V+RRELPKQLR G+VFNIEAGKQALRN
Sbjct: 355  EVVCEVVEGDITQLVIQFQDKLGNVVEGNTQVPVVRRELPKQLRQGYVFNIEAGKQALRN 414

Query: 1210 INSLALFSNIEVNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQ 1389
            INSL LFSNIEVNPRPDEK EGGIIVEIKLKELE K+AEVSTEWSIVPGR G PTLAS Q
Sbjct: 415  INSLGLFSNIEVNPRPDEKNEGGIIVEIKLKELEHKSAEVSTEWSIVPGRGGAPTLASFQ 474

Query: 1390 PGGTVTIEHRNLNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRAN 1569
            PGG+VT EHRNL GLNRSL+GS+TT+NF++PQDDLSFK EYVHPYLDG++NPRNRTF+ +
Sbjct: 475  PGGSVTFEHRNLQGLNRSLMGSVTTSNFLNPQDDLSFKLEYVHPYLDGVYNPRNRTFKTS 534

Query: 1570 CFNSRKLSPVFTGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESS 1749
            CFNSRKLSPVFTGGPGV+EVPPIWVDRAGVKANITENFTRQSKFTYGLVME+I TRDESS
Sbjct: 535  CFNSRKLSPVFTGGPGVEEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEEITTRDESS 594

Query: 1750 HISPYGQRQLPSGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQL 1929
            HI+  GQR LPSGGISADGPPTTLSGTG+D+M F QANITRD TKFVNGA+VG+R VFQ+
Sbjct: 595  HIAANGQRLLPSGGISADGPPTTLSGTGIDRMAFLQANITRDTTKFVNGAVVGQRTVFQV 654

Query: 1930 DQGLGIGSQFPFFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAF 2109
            DQGLGIGS+FPFFNRHQLTMTRFIQL++VEEGA K PPPVLVLHGHYGGCVGDLPSYDAF
Sbjct: 655  DQGLGIGSKFPFFNRHQLTMTRFIQLREVEEGAGKSPPPVLVLHGHYGGCVGDLPSYDAF 714

Query: 2110 TLGGPYSVRGYNMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTE 2289
             LGGPYSVRGYNMGELGA RNI E+ AE+R+PV+NTHVY F EHGNDLGSSKDVKGNPT 
Sbjct: 715  VLGGPYSVRGYNMGELGAARNIAEVGAEIRIPVKNTHVYAFVEHGNDLGSSKDVKGNPTA 774

Query: 2290 FYRR 2301
             YRR
Sbjct: 775  VYRR 778


>ref|XP_004241213.1| PREDICTED: protein TOC75-3, chloroplastic-like [Solanum lycopersicum]
          Length = 812

 Score = 1146 bits (2964), Expect = 0.0
 Identities = 557/654 (85%), Positives = 608/654 (92%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            WSKIFSP+ AIA +EESQ  EWDSHGLPANIVVQLNKLSGFK+YKVS+I FFD+RR +TV
Sbjct: 124  WSKIFSPA-AIAGEEESQ--EWDSHGLPANIVVQLNKLSGFKKYKVSDILFFDRRRGSTV 180

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEM+SL+PGGVYTKA LQKELETLAT GMFEKVDL+ KT PDGT+GVTISF ES
Sbjct: 181  GTEDSFFEMLSLRPGGVYTKAQLQKELETLATSGMFEKVDLDAKTNPDGTVGVTISFLES 240

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVGLM QSK IEMD+DMT+KE  EY+ SQE+DYRRRIER+RPCLLPV +
Sbjct: 241  TWQSADKFRCINVGLMPQSKPIEMDADMTEKEKLEYFNSQEQDYRRRIERSRPCLLPVSV 300

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
             RE+LQ+LRE+GTVSARLLQ+IRD+VQ+WYHD GYACAQVVNFGNLNTKEVVCEVVEGDI
Sbjct: 301  QREILQLLREKGTVSARLLQKIRDKVQQWYHDNGYACAQVVNFGNLNTKEVVCEVVEGDI 360

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQ+ IQF DKLGNVCEGNTQ  V+RRELP+QLR G VFNIEAGKQALRNINSLALFSNIE
Sbjct: 361  TQMVIQFQDKLGNVCEGNTQYPVVRRELPRQLRQGKVFNIEAGKQALRNINSLALFSNIE 420

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKELEQK+AEVSTEWSIVPGR GRPTLASIQPGGTV+ EHRN
Sbjct: 421  VNPRPDEKNEGGIIVEIKLKELEQKSAEVSTEWSIVPGRGGRPTLASIQPGGTVSFEHRN 480

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            L GLNRS++GS+TT+NF++PQDDL+FK EYVHPYLDG++NPRNRT R +CFNSRKLSPVF
Sbjct: 481  LYGLNRSILGSVTTSNFLNPQDDLAFKLEYVHPYLDGVYNPRNRTLRTSCFNSRKLSPVF 540

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPGVDEVPPIWVDRAG+KANITENFTRQSKFTYGLVME+I TRDESSHIS  GQR LP
Sbjct: 541  TGGPGVDEVPPIWVDRAGLKANITENFTRQSKFTYGLVMEEITTRDESSHISARGQRVLP 600

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGISADGPPTTLS TG+D+M F QANITRDNTKF+NG IVGERNVFQ+DQGLG+G++FP
Sbjct: 601  SGGISADGPPTTLSETGIDRMAFLQANITRDNTKFINGTIVGERNVFQVDQGLGVGTKFP 660

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLTMT+FIQLKQVEEGA K PPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 661  FFNRHQLTMTQFIQLKQVEEGAGKAPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 720

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGE+GA RNI+ELAAELR+PVRNTHVY FAEHGNDLGSSKDVKGNPTE YRRM
Sbjct: 721  NMGEIGAARNIVELAAELRIPVRNTHVYAFAEHGNDLGSSKDVKGNPTEVYRRM 774


>ref|XP_006350787.1| PREDICTED: protein TOC75-3, chloroplastic-like [Solanum tuberosum]
          Length = 814

 Score = 1145 bits (2961), Expect = 0.0
 Identities = 556/654 (85%), Positives = 608/654 (92%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            WSKIFSP+ AIA +EESQ  EWDSHGLPANIVVQLNKLSGFK+YKVS+I FFD+RR +TV
Sbjct: 126  WSKIFSPA-AIAGEEESQ--EWDSHGLPANIVVQLNKLSGFKKYKVSDILFFDRRRGSTV 182

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEM+SL+PGGVYTKA LQKELETLAT GMFEKVDL+ KT PDGT+GVTISF ES
Sbjct: 183  GTEDSFFEMLSLRPGGVYTKAQLQKELETLATSGMFEKVDLDAKTNPDGTVGVTISFLES 242

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVGLM QSK IEMD+DMT+KE  EY+ SQE+DYRRRIER+RPCLLPV +
Sbjct: 243  TWQSADKFRCINVGLMPQSKPIEMDADMTEKEKLEYFNSQEQDYRRRIERSRPCLLPVSV 302

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
             RE+LQ+LRE+GTVSARLLQ+IRD+VQ+WYHD GYACAQVVNFGNLNTKEVVCEVVEGDI
Sbjct: 303  QREILQLLREKGTVSARLLQKIRDKVQQWYHDNGYACAQVVNFGNLNTKEVVCEVVEGDI 362

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQ+ IQF DKLGNVCEGNTQ  V+RRELP+QLR G VFNIEAGKQALRNINSLALFSNIE
Sbjct: 363  TQMVIQFQDKLGNVCEGNTQYPVVRRELPRQLRQGKVFNIEAGKQALRNINSLALFSNIE 422

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKELEQK+AEVSTEWSIVPGR GRPTLASIQPGGTV+ EHRN
Sbjct: 423  VNPRPDEKNEGGIIVEIKLKELEQKSAEVSTEWSIVPGRGGRPTLASIQPGGTVSFEHRN 482

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            L GLNRS++GS+TT+NF++PQDDL+FK EYVHPYLDG++NPRNRT R +CFNSRKLSPVF
Sbjct: 483  LYGLNRSILGSVTTSNFLNPQDDLAFKLEYVHPYLDGVYNPRNRTLRTSCFNSRKLSPVF 542

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPGVDEVPPIWVDRAG+KANITENFTRQSKFTYGLVME+I TRDESSHIS  GQR LP
Sbjct: 543  TGGPGVDEVPPIWVDRAGLKANITENFTRQSKFTYGLVMEEITTRDESSHISARGQRVLP 602

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGISADGPPTTLS TG+D+M F QANITRDNTKF+NG IVGERNVFQ+DQGLG+G++FP
Sbjct: 603  SGGISADGPPTTLSETGIDRMAFLQANITRDNTKFINGTIVGERNVFQVDQGLGVGTKFP 662

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLTMT+FIQLKQVEEGA K PPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 663  FFNRHQLTMTQFIQLKQVEEGAGKAPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 722

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGE+GA RNI+ELAAELR+PVRNTHVY FAEHGNDLG+SKDVKGNPTE YRRM
Sbjct: 723  NMGEIGAARNIVELAAELRIPVRNTHVYAFAEHGNDLGTSKDVKGNPTEVYRRM 776


>ref|XP_006424890.1| hypothetical protein CICLE_v10027835mg [Citrus clementina]
            gi|568870366|ref|XP_006488376.1| PREDICTED: protein
            TOC75-3, chloroplastic-like [Citrus sinensis]
            gi|557526824|gb|ESR38130.1| hypothetical protein
            CICLE_v10027835mg [Citrus clementina]
          Length = 814

 Score = 1139 bits (2946), Expect = 0.0
 Identities = 544/654 (83%), Positives = 605/654 (92%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            W K+F    AIA DE+++SQ+WD+HGLPANI VQL+KLSGF++YK+SEI FFD++R  TV
Sbjct: 124  WRKVFGTELAIA-DEDNESQDWDAHGLPANIAVQLSKLSGFRKYKLSEILFFDRQRGATV 182

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMVSL+PGGVYT+  L KELETLATCGMFEKVD+E KTKPDGT+G+TISF ES
Sbjct: 183  GTEDSFFEMVSLRPGGVYTRTQLLKELETLATCGMFEKVDMEAKTKPDGTLGLTISFLES 242

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSA+  RCINVGLMQQSK IEMD+DMT+KE  EYY SQEKDY+RRI++ARPCLLP  +
Sbjct: 243  TWQSAERIRCINVGLMQQSKPIEMDADMTEKEKLEYYHSQEKDYKRRIDKARPCLLPQSV 302

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
            H E+LQML++ G VSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNT+EVVCEVVEGDI
Sbjct: 303  HNEILQMLKDHGKVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTREVVCEVVEGDI 362

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL +QF DKLGNV EGNTQLAV++RELPKQLR G+VFNIEAGKQALRNINSL+LFSNIE
Sbjct: 363  TQLVVQFQDKLGNVVEGNTQLAVVKRELPKQLRQGNVFNIEAGKQALRNINSLSLFSNIE 422

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKEL+QK+AEVS EWS+VPGR GRPT AS+QPGGTV+ EHRN
Sbjct: 423  VNPRPDEKNEGGIIVEIKLKELDQKSAEVSAEWSLVPGRGGRPTFASLQPGGTVSFEHRN 482

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            L GLNRS++GS+TT+NF++PQDDL+FK EYVHPYLDG++NPRNRTFRA+CFNSRKLSPVF
Sbjct: 483  LQGLNRSILGSVTTSNFLNPQDDLAFKLEYVHPYLDGVYNPRNRTFRASCFNSRKLSPVF 542

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPGVDEVP IWVDRAG+KANITENFTRQSKFTYGLVME+I TRDESSHISP+GQR LP
Sbjct: 543  TGGPGVDEVPAIWVDRAGLKANITENFTRQSKFTYGLVMEEITTRDESSHISPHGQRVLP 602

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGISADGPPTTLSGTG+D+M F Q NITRDNTKFVNGAIVGERNVFQ+DQGLGIGS+FP
Sbjct: 603  SGGISADGPPTTLSGTGIDRMAFLQGNITRDNTKFVNGAIVGERNVFQVDQGLGIGSKFP 662

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLT+TRF QLKQVEEGA+KPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 663  FFNRHQLTLTRFFQLKQVEEGANKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 722

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGELGA RNILEL AE+R+PV+NTHVY F EHGNDLGSSKDVKGNPTE YRRM
Sbjct: 723  NMGELGAARNILELGAEIRIPVKNTHVYAFVEHGNDLGSSKDVKGNPTEVYRRM 776


>ref|XP_004301512.1| PREDICTED: protein TOC75-3, chloroplastic-like [Fragaria vesca subsp.
            vesca]
          Length = 845

 Score = 1139 bits (2945), Expect = 0.0
 Identities = 548/654 (83%), Positives = 596/654 (91%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            W  +F P   +A  +E+QS EWDSHGLPANIVVQLNKLSGFK+YKVSEI FFD+RRW+TV
Sbjct: 155  WRSLFGPQ-LVANADEAQSPEWDSHGLPANIVVQLNKLSGFKKYKVSEILFFDRRRWSTV 213

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMVSL+PGGVYTKA LQKELETLA CGMFEKVDLEGKT PDGT+GVTISFTES
Sbjct: 214  GTEDSFFEMVSLRPGGVYTKAQLQKELETLANCGMFEKVDLEGKTNPDGTLGVTISFTES 273

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVGLM QSK IEMD DMTDKE  EY+++QEKDY+RRI+RARPCLLP P+
Sbjct: 274  TWQSADKFRCINVGLMAQSKPIEMDPDMTDKEKMEYFRNQEKDYKRRIDRARPCLLPPPV 333

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
             REVL MLREQG VSARLLQ+IRDRVQ+WY DEGYACAQVVNFGNLNTKEVVCEVVEGDI
Sbjct: 334  QREVLLMLREQGKVSARLLQKIRDRVQRWYQDEGYACAQVVNFGNLNTKEVVCEVVEGDI 393

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL IQF DKLGN  EGNTQ+ V++RELP+QLRPG+VFNIEAGKQALRNINSLALFSNIE
Sbjct: 394  TQLVIQFQDKLGNFVEGNTQIPVVKRELPRQLRPGYVFNIEAGKQALRNINSLALFSNIE 453

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKELEQKTAEVSTEWSIVPGR G PTLAS+QPGGTVT EHRN
Sbjct: 454  VNPRPDEKNEGGIIVEIKLKELEQKTAEVSTEWSIVPGRGGYPTLASLQPGGTVTFEHRN 513

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            L GLNRS++GS+TT+NF +PQDDL+FK EYVHPYLDG++NPRNR  R +CFNSRKLSPVF
Sbjct: 514  LKGLNRSILGSVTTSNFYNPQDDLAFKLEYVHPYLDGVYNPRNRALRVSCFNSRKLSPVF 573

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPG DEVPPIWVDRAG+KANITENF RQSKFTYGLVME+I TRDE SHI   GQR LP
Sbjct: 574  TGGPGADEVPPIWVDRAGMKANITENFNRQSKFTYGLVMEEITTRDERSHICSNGQRVLP 633

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGG+S DGPPTTLSGTG+D++ F Q+NITRDNTKFVNG IVGERNVFQ+DQGLGIGS+FP
Sbjct: 634  SGGVSEDGPPTTLSGTGIDRVAFIQSNITRDNTKFVNGTIVGERNVFQVDQGLGIGSKFP 693

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLT+TRF QLK+VEEGA KPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 694  FFNRHQLTLTRFFQLKEVEEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 753

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGE+GA R ILELAAELR+PV+ THVY FAEHGNDLGSSKDVKGNPTE YRR+
Sbjct: 754  NMGEIGAARQILELAAELRIPVKGTHVYAFAEHGNDLGSSKDVKGNPTEVYRRV 807


>ref|XP_007208340.1| hypothetical protein PRUPE_ppa002062mg [Prunus persica]
            gi|462403982|gb|EMJ09539.1| hypothetical protein
            PRUPE_ppa002062mg [Prunus persica]
          Length = 723

 Score = 1129 bits (2920), Expect = 0.0
 Identities = 545/651 (83%), Positives = 594/651 (91%)
 Frame = +1

Query: 352  IFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTVGTD 531
            +F    A+A  +E QSQEWDSHGLPANIVVQLNKLSGFK+YKVSEI FFD+RRWT VG+D
Sbjct: 37   LFGSQTAMA--DEPQSQEWDSHGLPANIVVQLNKLSGFKKYKVSEIFFFDRRRWTAVGSD 94

Query: 532  DSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTESTWQ 711
            DSFFEMVSL+ G VYTKA LQKELE+LA CGMFEKVDLEGKT PDGT+GVTISFTESTWQ
Sbjct: 95   DSFFEMVSLRAGSVYTKAQLQKELESLANCGMFEKVDLEGKTNPDGTLGVTISFTESTWQ 154

Query: 712  SADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPIHRE 891
            SAD FRCINVGLM QSK  EMD DMTDKE  EY+++QEKDY+RRI+RARPCLLP P+ RE
Sbjct: 155  SADKFRCINVGLMPQSKPSEMDPDMTDKEKLEYFRNQEKDYKRRIDRARPCLLPAPVQRE 214

Query: 892  VLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDITQL 1071
            VL MLREQG VSARLLQ+IRDRVQKWY DEGYACAQVVNFGNLNTKEVVCEVVEGDITQL
Sbjct: 215  VLLMLREQGKVSARLLQKIRDRVQKWYQDEGYACAQVVNFGNLNTKEVVCEVVEGDITQL 274

Query: 1072 AIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIEVNP 1251
             IQF DKLGN  EGNTQ+ V++RELP+QLRPG+VFNIEAGKQALRNINSL+LFSNIEVNP
Sbjct: 275  LIQFQDKLGNFVEGNTQIPVLKRELPRQLRPGYVFNIEAGKQALRNINSLSLFSNIEVNP 334

Query: 1252 RPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRNLNG 1431
            RPDEK EGGIIVEIKLKELEQKTAEV+TEWSIVPGR G PTLAS+QPGGTVT EHRNLNG
Sbjct: 335  RPDEKNEGGIIVEIKLKELEQKTAEVNTEWSIVPGRGGYPTLASLQPGGTVTFEHRNLNG 394

Query: 1432 LNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVFTGG 1611
            LNRS++G++ T NF +PQDDL+FK EYVHPYLDG++NPRNRT R +CFNSRKLSPVFTGG
Sbjct: 395  LNRSILGTVNTTNFCNPQDDLAFKLEYVHPYLDGVYNPRNRTLRVSCFNSRKLSPVFTGG 454

Query: 1612 PGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLPSGG 1791
            PG DEVPPIWVDRAGVKANITENFTRQSKFTYGLVME+I TRDE SHI   GQR LPSGG
Sbjct: 455  PGADEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEEITTRDERSHICSNGQRVLPSGG 514

Query: 1792 ISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFPFFN 1971
            +S DGPPTTLSGTG+D++ F Q+NITRDNTKFVNGAIVG+RNVFQ+DQGLG+GS+FPFFN
Sbjct: 515  VSEDGPPTTLSGTGIDRVAFLQSNITRDNTKFVNGAIVGQRNVFQVDQGLGVGSKFPFFN 574

Query: 1972 RHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGYNMG 2151
            RHQLT+TRF QLK+VEEGA KPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY+MG
Sbjct: 575  RHQLTLTRFFQLKEVEEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGYSMG 634

Query: 2152 ELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            E+GA RNILELAAELR+PV+ THVY FAEHGNDLGSSKDVKGNPTE YRRM
Sbjct: 635  EIGAARNILELAAELRIPVKGTHVYAFAEHGNDLGSSKDVKGNPTEVYRRM 685


>ref|XP_006846297.1| hypothetical protein AMTR_s00012p00252410 [Amborella trichopoda]
            gi|548849067|gb|ERN07972.1| hypothetical protein
            AMTR_s00012p00252410 [Amborella trichopoda]
          Length = 859

 Score = 1128 bits (2918), Expect = 0.0
 Identities = 542/653 (83%), Positives = 601/653 (92%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            WS++ SP  A+AK+EE+Q  EWD HGLPANI+VQ+NKLSGFK+YK SEI FFD++R + V
Sbjct: 171  WSELLSPV-AVAKEEENQ--EWDPHGLPANILVQINKLSGFKKYKTSEILFFDKKRGSVV 227

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMVSL+PGG+YTKA LQKELETLATCGMFEKVDLE KTKPDGTI V ISF ES
Sbjct: 228  GTEDSFFEMVSLKPGGIYTKAHLQKELETLATCGMFEKVDLEAKTKPDGTIAVNISFAES 287

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD+F+CINVGL+ QSK ++MD DMTD+E +EY K QE +YR+R+ER+RPCLLP+ +
Sbjct: 288  TWQSADAFKCINVGLLPQSKPVDMDPDMTDREKFEYIKRQEAEYRKRMERSRPCLLPIRV 347

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
             +EVL MLREQG VSARLLQ+IRDRVQKWYHDEGYACAQVVNFGNLNT+EVVCEVVEGDI
Sbjct: 348  QKEVLAMLREQGKVSARLLQKIRDRVQKWYHDEGYACAQVVNFGNLNTREVVCEVVEGDI 407

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL +QF DKLGNVCEGNT+L +IRRELPKQLR GHVFNIEAGKQALRNINSLALFSNIE
Sbjct: 408  TQLVVQFQDKLGNVCEGNTELPIIRRELPKQLRQGHVFNIEAGKQALRNINSLALFSNIE 467

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKELEQKTAEVSTEWSIVPGR GRPTLASIQPGGTV+ EHRN
Sbjct: 468  VNPRPDEKNEGGIIVEIKLKELEQKTAEVSTEWSIVPGRQGRPTLASIQPGGTVSFEHRN 527

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            + GLNRSL+GS+T+NN ++PQDDLSFKFEYVHPY+DG++NPRNRTFRA+ FNSRKLSPVF
Sbjct: 528  IKGLNRSLLGSVTSNNLLNPQDDLSFKFEYVHPYVDGVYNPRNRTFRASVFNSRKLSPVF 587

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPG++EVPPIWVDR+G KANI ENFTRQSKFTYGLV+E+I TRDE+S I   G R LP
Sbjct: 588  TGGPGMEEVPPIWVDRSGFKANIMENFTRQSKFTYGLVLEEITTRDETSSICTNGARALP 647

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGG+S DGPPTTLSGTGVD+M F QANITRDNTKFVNGAIVGERNVFQLDQGLGIG+ FP
Sbjct: 648  SGGLSMDGPPTTLSGTGVDRMAFVQANITRDNTKFVNGAIVGERNVFQLDQGLGIGTNFP 707

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQL+MTRFIQLK V+EGA KPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 708  FFNRHQLSMTRFIQLKSVQEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 767

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRR 2301
            NMGELGACRNILELAAELR+P+RNTHVY FAEHGNDLGSSK+VKGNPTEF+RR
Sbjct: 768  NMGELGACRNILELAAELRIPIRNTHVYAFAEHGNDLGSSKEVKGNPTEFFRR 820


>ref|XP_004153150.1| PREDICTED: protein TOC75-3, chloroplastic-like [Cucumis sativus]
          Length = 809

 Score = 1123 bits (2905), Expect = 0.0
 Identities = 546/653 (83%), Positives = 598/653 (91%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            W K FS S A+A  +E Q+QEWDSHGLPANIVVQLNKLSGFK+YKVS+I FFD+RR  TV
Sbjct: 121  WRKFFS-SAALA--DERQNQEWDSHGLPANIVVQLNKLSGFKKYKVSDILFFDRRRGITV 177

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMVSL+PGGVYTKA LQKELETLATCGMFE+VDL+  T  DGTIGV I FTES
Sbjct: 178  GTEDSFFEMVSLRPGGVYTKAQLQKELETLATCGMFERVDLDSNTNADGTIGVRILFTES 237

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSA+ FRCINVGLMQQ+K +EMD+DMTDKE  EYY+SQE+DY+RRIERARPC+LP  +
Sbjct: 238  TWQSAERFRCINVGLMQQTKPMEMDADMTDKEKMEYYRSQERDYKRRIERARPCMLPEAV 297

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
            +R+VL MLR QG VSAR LQ+IRD VQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI
Sbjct: 298  YRDVLLMLRTQGKVSARSLQQIRDMVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 357

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL IQF DKLGNV EGNTQL+V+RRELPKQLRPG+VFNIEAGKQALRNINSLALFSNIE
Sbjct: 358  TQLVIQFQDKLGNVVEGNTQLSVVRRELPKQLRPGYVFNIEAGKQALRNINSLALFSNIE 417

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDEK EGGIIVEIKLKEL+QKTAEVS+EWSIVPGR GRPTLAS+QPGGTVT EHRN
Sbjct: 418  VNPRPDEKNEGGIIVEIKLKELDQKTAEVSSEWSIVPGRGGRPTLASLQPGGTVTFEHRN 477

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            + GLNRS++G++TT+NF +PQDDLSFK EYVHPYLDGI++PRNRT R +CFNSRKLSPVF
Sbjct: 478  IKGLNRSILGTITTSNFFNPQDDLSFKLEYVHPYLDGIYSPRNRTLRVSCFNSRKLSPVF 537

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPG DEVPPIWVDRAGVKANITENFTRQSKFTYG V+E+IITRDESS+I P GQR L 
Sbjct: 538  TGGPGADEVPPIWVDRAGVKANITENFTRQSKFTYGAVVEEIITRDESSNICPNGQRALL 597

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
             GGISADGPPTTLSGTG D+M F QANITRDNTKFVNGAIVG+RNVFQ+DQG+G+GS +P
Sbjct: 598  GGGISADGPPTTLSGTGTDRMAFLQANITRDNTKFVNGAIVGDRNVFQVDQGIGVGSNYP 657

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLT+TRF+QLK+VEEGA KPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 658  FFNRHQLTLTRFLQLKEVEEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 717

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRR 2301
            NMGELGA RNILELAAELR+PV+ THVY FAEHGNDLGSSKDVKGNPTE YRR
Sbjct: 718  NMGELGAARNILELAAELRIPVKGTHVYAFAEHGNDLGSSKDVKGNPTEVYRR 770


>ref|XP_003547008.1| PREDICTED: protein TOC75-3, chloroplastic-like [Glycine max]
          Length = 794

 Score = 1115 bits (2883), Expect = 0.0
 Identities = 540/654 (82%), Positives = 590/654 (90%)
 Frame = +1

Query: 343  WSKIFSPSPAIAKDEESQSQEWDSHGLPANIVVQLNKLSGFKRYKVSEIQFFDQRRWTTV 522
            WS++F+     A  +ESQSQEWDSHGLPANIVVQLNK+SGFK+YKVS+I FFD+ R   V
Sbjct: 108  WSRMFA-----AVADESQSQEWDSHGLPANIVVQLNKMSGFKKYKVSDISFFDRNRKMKV 162

Query: 523  GTDDSFFEMVSLQPGGVYTKAVLQKELETLATCGMFEKVDLEGKTKPDGTIGVTISFTES 702
            GT+DSFFEMVSL+PGGVYTK  LQKELETLAT GMFEKVDLEGKT PDG+IGVTISF+ES
Sbjct: 163  GTEDSFFEMVSLRPGGVYTKGQLQKELETLATSGMFEKVDLEGKTNPDGSIGVTISFSES 222

Query: 703  TWQSADSFRCINVGLMQQSKAIEMDSDMTDKEAYEYYKSQEKDYRRRIERARPCLLPVPI 882
            TWQSAD FRCINVGLMQQ+K +EMD+DMTDKE  EYY SQE++Y+RRIERARPCLLP  +
Sbjct: 223  TWQSADGFRCINVGLMQQTKPVEMDADMTDKERLEYYLSQEREYKRRIERARPCLLPRYV 282

Query: 883  HREVLQMLREQGTVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 1062
            H E+L ML+  G VSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI
Sbjct: 283  HNEILDMLKRHGMVSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVVCEVVEGDI 342

Query: 1063 TQLAIQFLDKLGNVCEGNTQLAVIRRELPKQLRPGHVFNIEAGKQALRNINSLALFSNIE 1242
            TQL IQF DKLGNV EGNTQ+ VI+RELP+QLRPG+ FNIEAGKQALRN+NSLALFSNIE
Sbjct: 343  TQLDIQFQDKLGNVVEGNTQVPVIQRELPRQLRPGYTFNIEAGKQALRNVNSLALFSNIE 402

Query: 1243 VNPRPDEKKEGGIIVEIKLKELEQKTAEVSTEWSIVPGRNGRPTLASIQPGGTVTIEHRN 1422
            VNPRPDE  EGGI+VEIKLKELEQK+AEVSTEWSIVPGR G PTLAS+QPGGTV+ EHRN
Sbjct: 403  VNPRPDETNEGGIVVEIKLKELEQKSAEVSTEWSIVPGRGGHPTLASLQPGGTVSFEHRN 462

Query: 1423 LNGLNRSLIGSLTTNNFIDPQDDLSFKFEYVHPYLDGIFNPRNRTFRANCFNSRKLSPVF 1602
            L GLNRS+ GS+TT+NF++PQDDL+FK EYVHPYLDG++  RNRT R +CFNSRKLSPVF
Sbjct: 463  LQGLNRSINGSITTSNFLNPQDDLAFKLEYVHPYLDGVYYSRNRTLRVSCFNSRKLSPVF 522

Query: 1603 TGGPGVDEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEKIITRDESSHISPYGQRQLP 1782
            TGGPGVDEVPPIWVDR GVKANITENFTRQSKFTYGLVME+I TRDESSHI   GQR LP
Sbjct: 523  TGGPGVDEVPPIWVDRTGVKANITENFTRQSKFTYGLVMEEITTRDESSHICANGQRVLP 582

Query: 1783 SGGISADGPPTTLSGTGVDQMVFGQANITRDNTKFVNGAIVGERNVFQLDQGLGIGSQFP 1962
            SGGISADGPPTTLSGTG+D M F QANITRDNT+FVNG +VG+RN+FQ+DQGLGIGSQFP
Sbjct: 583  SGGISADGPPTTLSGTGIDHMAFLQANITRDNTRFVNGTVVGDRNMFQVDQGLGIGSQFP 642

Query: 1963 FFNRHQLTMTRFIQLKQVEEGADKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 2142
            FFNRHQLT+TRFIQL  VEEGA KPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY
Sbjct: 643  FFNRHQLTLTRFIQLMAVEEGAGKPPPPVLVLHGHYGGCVGDLPSYDAFTLGGPYSVRGY 702

Query: 2143 NMGELGACRNILELAAELRVPVRNTHVYLFAEHGNDLGSSKDVKGNPTEFYRRM 2304
            NMGE+GA RNILELAAELR+PV+ THVY F EHGNDLGSSK VKGNPTE YRRM
Sbjct: 703  NMGEIGAARNILELAAELRIPVKGTHVYAFTEHGNDLGSSKGVKGNPTEVYRRM 756


Top