BLASTX nr result

ID: Forsythia22_contig00006001 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00006001
         (2020 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CDO96870.1| unnamed protein product [Coffea canephora]            712   0.0  
ref|XP_009764895.1| PREDICTED: pentatricopeptide repeat-containi...   711   0.0  
ref|XP_006358091.1| PREDICTED: pentatricopeptide repeat-containi...   707   0.0  
ref|XP_009627120.1| PREDICTED: pentatricopeptide repeat-containi...   707   0.0  
ref|XP_002265079.1| PREDICTED: pentatricopeptide repeat-containi...   686   0.0  
ref|XP_007025291.1| Pentatricopeptide repeat (PPR-like) superfam...   650   0.0  
ref|XP_012484752.1| PREDICTED: pentatricopeptide repeat-containi...   623   e-175
ref|XP_010529216.1| PREDICTED: pentatricopeptide repeat-containi...   612   e-172
emb|CBI30729.3| unnamed protein product [Vitis vinifera]              602   e-169
ref|XP_002867972.1| pentatricopeptide repeat-containing protein ...   602   e-169
ref|XP_010436490.1| PREDICTED: pentatricopeptide repeat-containi...   596   e-167
ref|XP_010439805.1| PREDICTED: pentatricopeptide repeat-containi...   595   e-167
ref|XP_006283500.1| hypothetical protein CARUB_v10004552mg [Caps...   595   e-167
ref|XP_006414048.1| hypothetical protein EUTSA_v10024877mg [Eutr...   594   e-167
ref|XP_009132265.1| PREDICTED: pentatricopeptide repeat-containi...   588   e-165
ref|XP_010943932.1| PREDICTED: pentatricopeptide repeat-containi...   568   e-159
ref|XP_010451379.1| PREDICTED: pentatricopeptide repeat-containi...   561   e-157
ref|NP_193619.1| pentatricopeptide repeat-containing protein [Ar...   546   e-152
ref|XP_008813633.1| PREDICTED: pentatricopeptide repeat-containi...   507   e-140
ref|XP_010665683.1| PREDICTED: pentatricopeptide repeat-containi...   503   e-139

>emb|CDO96870.1| unnamed protein product [Coffea canephora]
          Length = 516

 Score =  712 bits (1837), Expect = 0.0
 Identities = 339/515 (65%), Positives = 417/515 (80%)
 Frame = -3

Query: 1910 ATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATTISHHSLSYVESIFNHVHRPNSYIYN 1731
            ATSIS L Q HAYMLKTGLF   FAASRL+TAA + S  SLSY  +IF    +PN+Y+YN
Sbjct: 2    ATSISELHQAHAYMLKTGLFQQPFAASRLMTAAASSSIDSLSYAHTIFTQTPQPNTYMYN 61

Query: 1730 TMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIHGIVL 1551
            T+I  YATS TP+++  +FLKLL ++  +  DKYT+TF+LKACA +CR K GKQIHG V+
Sbjct: 62   TLIRGYATSPTPNVALFLFLKLLCDDQDLLPDKYTYTFVLKACASLCRVKHGKQIHGCVI 121

Query: 1550 KNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVELARE 1371
            KNGL  D YI NTL+HMYAKCGCFE AR++LDR+   D VSWNA+LSVY +MGLV+LA +
Sbjct: 122  KNGLSWDVYICNTLLHMYAKCGCFEAARHMLDRMPNRDVVSWNAVLSVYVEMGLVDLAFD 181

Query: 1370 LFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGAFGEV 1191
             F EMPV+N+ESWNFM+SGY N GL++EAR VFD M VKD+VSWNA+I+GYA SG + EV
Sbjct: 182  FFSEMPVKNLESWNFMLSGYANSGLLDEARRVFDEMSVKDVVSWNALITGYANSGRYNEV 241

Query: 1190 LVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLATALVDM 1011
            L LF+DMQ  +V+PDN TLV +LSACAG+GAL QGKWVHAY+DRNGIE  GFLATALVDM
Sbjct: 242  LELFDDMQRARVKPDNHTLVTLLSACAGIGALEQGKWVHAYMDRNGIEANGFLATALVDM 301

Query: 1010 YSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTPTEVTF 831
            YSKCGCIEKA+EVFD+  RKD+STWN+MI G S+HG G  ALK+F++M+  GF P +VTF
Sbjct: 302  YSKCGCIEKAVEVFDSASRKDVSTWNAMITGFSVHGFGEQALKVFSEMVENGFKPNDVTF 361

Query: 830  ISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATELLNKLK 651
            +S+LSACSRAGL+ E   +FD+M  +YGI+P IEHYGCLVDLLGR GLL EA EL+ K+ 
Sbjct: 362  VSLLSACSRAGLLFESHEIFDNMFSIYGIKPKIEHYGCLVDLLGRFGLLKEAEELVEKMP 421

Query: 650  VKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRWNVVME 471
             K+  I+W+SLL+ACR HG+VELAE+IA KL EL P+D AGYVQLSN HAS GRW+ V++
Sbjct: 422  QKDVLIIWESLLSACRNHGNVELAEHIAGKLLELNPQDNAGYVQLSNIHASKGRWSDVVD 481

Query: 470  LRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVI 366
            +R +MREK + K+PG S+IE++G VHEFLAGEG+I
Sbjct: 482  IRRKMREKLVSKKPGGSVIEVNGVVHEFLAGEGMI 516


>ref|XP_009764895.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840
            [Nicotiana sylvestris] gi|698537670|ref|XP_009764896.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g18840 [Nicotiana sylvestris]
          Length = 550

 Score =  711 bits (1836), Expect = 0.0
 Identities = 349/522 (66%), Positives = 418/522 (80%), Gaps = 4/522 (0%)
 Frame = -3

Query: 1916 EAATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATTI----SHHSLSYVESIFNHVHRP 1749
            E A SIS L Q HA++LKTGLF + FAASRLLT ATT+    S  +LSY  SIF H+  P
Sbjct: 29   EMANSISELHQSHAFLLKTGLFRNPFAASRLLTKATTLPSSSSADTLSYPLSIFTHIEEP 88

Query: 1748 NSYIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQ 1569
            NSY YNT+I AY+TS  P LS I+FLKLL    +V+ DKYTFTFI+KACA +   K+G+Q
Sbjct: 89   NSYTYNTIIRAYSTSSFPQLSLIIFLKLLNAVHKVFPDKYTFTFIVKACATIGNAKQGEQ 148

Query: 1568 IHGIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGL 1389
            +HG+V K GL +D Y+ NTLIHMYAKCGCF V+R ++D L + D ++WN LLSV+ + GL
Sbjct: 149  VHGLVTKIGLEEDVYVYNTLIHMYAKCGCFGVSRGMIDGLVEDDVIAWNGLLSVFAERGL 208

Query: 1388 VELARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKS 1209
             ELARELFDEMPV+NVESWNFMVSGYVN GLV+EAR VFD MLVKD+VSWN MI+GY K+
Sbjct: 209  FELARELFDEMPVKNVESWNFMVSGYVNVGLVDEARKVFDEMLVKDVVSWNVMITGYTKA 268

Query: 1208 GAFGEVLVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLA 1029
              F EVL LFEDM   KV+PDNCTLVNVLSACAG+G+LSQGKWVHAYI+RNGIEV  FLA
Sbjct: 269  DRFAEVLALFEDMLRAKVKPDNCTLVNVLSACAGVGSLSQGKWVHAYIERNGIEVHDFLA 328

Query: 1028 TALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFT 849
            TALVDMY KCGCIEKALEVF+  LRKDISTWN+MIAGLS HG    ALK F+++I+ G  
Sbjct: 329  TALVDMYCKCGCIEKALEVFNGTLRKDISTWNAMIAGLSNHGYLDDALKTFDELIADGIK 388

Query: 848  PTEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATE 669
            P +VTF+S+LS CS+ GL+SEG  MFD M+  Y IQPT+ HYGC+VDLLGR GLL EA E
Sbjct: 389  PNKVTFVSVLSTCSQGGLLSEGRRMFDLMISEYRIQPTLVHYGCMVDLLGRFGLLEEAEE 448

Query: 668  LLNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGR 489
            LL++L VKE+P +W+SLL+A R+H DVELAE IA KL EL+P D+AGYVQLSN  AS+GR
Sbjct: 449  LLSRLPVKEAPAIWESLLSASRSHNDVELAERIATKLLELDPHDSAGYVQLSNVLASMGR 508

Query: 488  WNVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIV 363
            W+ V E+R +MR +G+ KEPGCS+IE+DG VHEFLAGEG+I+
Sbjct: 509  WDDVREVRRKMRSEGVTKEPGCSMIEVDGVVHEFLAGEGIIL 550


>ref|XP_006358091.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like
            [Solanum tuberosum]
          Length = 536

 Score =  707 bits (1826), Expect = 0.0
 Identities = 343/521 (65%), Positives = 417/521 (80%), Gaps = 3/521 (0%)
 Frame = -3

Query: 1916 EAATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATTI---SHHSLSYVESIFNHVHRPN 1746
            E A SIS L Q HA MLKTGLF   FAASRLLT AT +   S  +LSY  S+F H+  PN
Sbjct: 16   EMANSISELHQAHAVMLKTGLFRDPFAASRLLTKATVLPISSPETLSYALSVFTHIEEPN 75

Query: 1745 SYIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQI 1566
            SYIYNT+I AY+TS  P L+ I+FLK+L   ++V+ D+YTFTFI+KACA +   K+G+Q+
Sbjct: 76   SYIYNTIIRAYSTSPFPQLALIIFLKMLNSVNKVFPDRYTFTFIVKACATMENAKQGEQV 135

Query: 1565 HGIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLV 1386
            HG+V K GL +D YI NTL+HMYAKCGCF ++R ++D L + D ++WNALLSVY + GL 
Sbjct: 136  HGLVTKIGLEEDVYIYNTLVHMYAKCGCFGISRGMIDGLIEDDVIAWNALLSVYAERGLF 195

Query: 1385 ELARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSG 1206
            ELARELFDEMPV+NVESWNFMVSGYVN GLV+EAR VFD MLVKD+VSWN MI+GY K+ 
Sbjct: 196  ELARELFDEMPVKNVESWNFMVSGYVNVGLVDEARKVFDEMLVKDVVSWNVMITGYTKAD 255

Query: 1205 AFGEVLVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLAT 1026
             F EVL LFEDM   KV+PD+CTLVNVLSACAG+G+LSQGKWVHA+I+RNGIEV  FLAT
Sbjct: 256  KFNEVLTLFEDMLRAKVKPDDCTLVNVLSACAGVGSLSQGKWVHAFIERNGIEVHNFLAT 315

Query: 1025 ALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTP 846
            ALVDMY KCGCIEK LEVF+  LRKDISTWN+MIAG S HG    ALK FN++I+ G  P
Sbjct: 316  ALVDMYCKCGCIEKGLEVFNGTLRKDISTWNAMIAGFSNHGYLDDALKTFNELIADGIKP 375

Query: 845  TEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATEL 666
             EVTF+S+LS CS+ GL+SEG  MF+ M+  Y IQPT+ HYGC+VDLLGR GLL EA EL
Sbjct: 376  NEVTFVSVLSTCSQGGLLSEGRRMFELMINEYRIQPTLVHYGCMVDLLGRFGLLEEAEEL 435

Query: 665  LNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRW 486
            ++KL VKE+P +W+SLL+A R+H DVELAE IA KL E++P+D+AGYVQLSN  AS+GRW
Sbjct: 436  VSKLPVKEAPAIWESLLSASRSHNDVELAERIATKLLEVDPRDSAGYVQLSNVLASMGRW 495

Query: 485  NVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIV 363
            + V E+R +MR +G+ KEPGCS+IE+DG VHEFLAGEG+I+
Sbjct: 496  DDVREVRRKMRSEGITKEPGCSMIEVDGVVHEFLAGEGIIL 536


>ref|XP_009627120.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840
            [Nicotiana tomentosiformis]
          Length = 550

 Score =  707 bits (1825), Expect = 0.0
 Identities = 345/521 (66%), Positives = 415/521 (79%), Gaps = 4/521 (0%)
 Frame = -3

Query: 1916 EAATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATTI----SHHSLSYVESIFNHVHRP 1749
            E A SIS L Q HA++LKTGLF + FAASRLLT ATT+    S  +LSY  SIF H+  P
Sbjct: 29   ETANSISELHQSHAFLLKTGLFRNPFAASRLLTKATTLPTSSSADTLSYALSIFTHIEEP 88

Query: 1748 NSYIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQ 1569
            NSY YNT+I AY+TS  P LS I+FLKLL    +++ DKYTFTFI+KACA +   K+G+Q
Sbjct: 89   NSYTYNTIIRAYSTSSFPQLSLIIFLKLLNAVHKIFPDKYTFTFIVKACATIGNAKQGQQ 148

Query: 1568 IHGIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGL 1389
            +HG+V K GL +DEY+ NTLIHMYAKCGCF V+R ++D L + D ++WN LLSV+ + GL
Sbjct: 149  VHGLVTKIGLEEDEYVHNTLIHMYAKCGCFGVSRGMIDGLVEDDVIAWNGLLSVFAERGL 208

Query: 1388 VELARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKS 1209
             ELARELFDEMPV+NVESWNFM+SGYVN GLV+EAR VFD M  KD+VSWN MI+GY K+
Sbjct: 209  FELARELFDEMPVKNVESWNFMISGYVNVGLVDEARKVFDEMSDKDVVSWNVMITGYTKA 268

Query: 1208 GAFGEVLVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLA 1029
              F EVL LFEDM   KV+PDNCTLVNVLSACAG+G+LSQGKWVHAYI+R GI+V  FLA
Sbjct: 269  DKFAEVLALFEDMLRAKVKPDNCTLVNVLSACAGVGSLSQGKWVHAYIERYGIQVHDFLA 328

Query: 1028 TALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFT 849
            TALVDMY KCGCIEKALEVF+  LRKDISTWN+MIAGLS HG    AL+ FN++I+ G  
Sbjct: 329  TALVDMYCKCGCIEKALEVFNGTLRKDISTWNAMIAGLSNHGFLDDALETFNELIADGIK 388

Query: 848  PTEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATE 669
            P EVTF+S+LS CS+ GL+SEG  MFD M+  Y IQPT+ HYGC+VDLLGR GLL EA E
Sbjct: 389  PNEVTFVSVLSTCSQGGLLSEGRRMFDLMISEYRIQPTLVHYGCMVDLLGRFGLLEEAEE 448

Query: 668  LLNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGR 489
            LL++L VKE+P +W+SLL+A R+H DVELAE IA KL EL+P D+AGYVQLSN  AS+GR
Sbjct: 449  LLSRLPVKEAPAIWESLLSASRSHNDVELAERIATKLLELDPHDSAGYVQLSNVLASMGR 508

Query: 488  WNVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVI 366
            W+ V E+R +MR +G+ KEPGCS+IE+DG VHEFLAGEG+I
Sbjct: 509  WDDVREVRRKMRSEGVTKEPGCSMIEVDGVVHEFLAGEGII 549


>ref|XP_002265079.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840
            [Vitis vinifera]
          Length = 536

 Score =  686 bits (1770), Expect = 0.0
 Identities = 332/518 (64%), Positives = 418/518 (80%), Gaps = 1/518 (0%)
 Frame = -3

Query: 1916 EAATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATTISH-HSLSYVESIFNHVHRPNSY 1740
            E ATSIS L Q HA++LK+GL H TFAASRL+ + +T SH  ++ Y  SIF+ +  PNSY
Sbjct: 15   EMATSISELHQAHAHILKSGLIHSTFAASRLIASVSTNSHAQAIPYAHSIFSRIPNPNSY 74

Query: 1739 IYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIHG 1560
            ++NT+I AYA S TP+ +  +F ++L+  + V  DKYTFTF LK+C      +EG+QIHG
Sbjct: 75   MWNTIIRAYANSPTPEAALTIFHQMLH--ASVLPDKYTFTFALKSCGSFSGVEEGRQIHG 132

Query: 1559 IVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVEL 1380
             VLK GLGDD +I+NTLIH+YA CGC E AR+LLDR+ + D VSWNALLS Y + GL+EL
Sbjct: 133  HVLKTGLGDDLFIQNTLIHLYASCGCIEDARHLLDRMLERDVVSWNALLSAYAERGLMEL 192

Query: 1379 ARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGAF 1200
            A  LFDEM  RNVESWNFM+SGYV  GL+EEAR VF    VK++VSWNAMI+GY+ +G F
Sbjct: 193  ACHLFDEMTERNVESWNFMISGYVGVGLLEEARRVFGETPVKNVVSWNAMITGYSHAGRF 252

Query: 1199 GEVLVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLATAL 1020
             EVLVLFEDMQ   V+PDNCTLV+VLSACA +GALSQG+WVHAYID+NGI ++GF+ATAL
Sbjct: 253  SEVLVLFEDMQHAGVKPDNCTLVSVLSACAHVGALSQGEWVHAYIDKNGISIDGFVATAL 312

Query: 1019 VDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTPTE 840
            VDMYSKCG IEKALEVF++ LRKDISTWNS+I+GLS HGSG HAL++F++M+  GF P E
Sbjct: 313  VDMYSKCGSIEKALEVFNSCLRKDISTWNSIISGLSTHGSGQHALQIFSEMLVEGFKPNE 372

Query: 839  VTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATELLN 660
            VTF+ +LSACSRAGL+ EG  MF+ MV V+GIQPTIEHYGC+VDLLGR GLL EA EL+ 
Sbjct: 373  VTFVCVLSACSRAGLLDEGREMFNLMVHVHGIQPTIEHYGCMVDLLGRVGLLEEAEELVQ 432

Query: 659  KLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRWNV 480
            K+  KE+ +VW+SLL ACR HG+VELAE +A+KL EL P++++ +VQLSN +AS+GRW  
Sbjct: 433  KMPQKEASVVWESLLGACRNHGNVELAERVAQKLLELSPQESSSFVQLSNMYASMGRWKD 492

Query: 479  VMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVI 366
            VME+R +MR +G+ K+PGCS+IE+DGTV+EFLAGEG++
Sbjct: 493  VMEVRQKMRAQGVRKDPGCSMIEVDGTVYEFLAGEGLV 530


>ref|XP_007025291.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative
            isoform 1 [Theobroma cacao]
            gi|590623325|ref|XP_007025292.1| Pentatricopeptide repeat
            (PPR-like) superfamily protein, putative isoform 1
            [Theobroma cacao] gi|590623329|ref|XP_007025293.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein,
            putative isoform 1 [Theobroma cacao]
            gi|590623333|ref|XP_007025294.1| Pentatricopeptide repeat
            (PPR-like) superfamily protein, putative isoform 1
            [Theobroma cacao] gi|590623336|ref|XP_007025295.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein,
            putative isoform 1 [Theobroma cacao]
            gi|508780657|gb|EOY27913.1| Pentatricopeptide repeat
            (PPR-like) superfamily protein, putative isoform 1
            [Theobroma cacao] gi|508780658|gb|EOY27914.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein,
            putative isoform 1 [Theobroma cacao]
            gi|508780659|gb|EOY27915.1| Pentatricopeptide repeat
            (PPR-like) superfamily protein, putative isoform 1
            [Theobroma cacao] gi|508780660|gb|EOY27916.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein,
            putative isoform 1 [Theobroma cacao]
            gi|508780661|gb|EOY27917.1| Pentatricopeptide repeat
            (PPR-like) superfamily protein, putative isoform 1
            [Theobroma cacao]
          Length = 535

 Score =  650 bits (1678), Expect = 0.0
 Identities = 316/522 (60%), Positives = 410/522 (78%), Gaps = 2/522 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFH-HTFAASRLLT-AATTISHHSLSYVESIFNHVHRPN 1746
            TE A S+S +QQ HA++LKTGLF+ H  A+++L++ A       +LSY  S+F H   PN
Sbjct: 14   TEMANSVSEIQQAHAHLLKTGLFYNHPLASNKLISFAVNNPDPKTLSYAHSVFTHTTNPN 73

Query: 1745 SYIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQI 1566
            SY YN++I AYA S TP  +  +F ++L     V+ DKY+FTF+LKACA     +EG+QI
Sbjct: 74   SYSYNSLIRAYANSHTPQNALSLFRQML--QGPVFPDKYSFTFVLKACAGFGGVQEGRQI 131

Query: 1565 HGIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLV 1386
            HG+VL+ G+G D ++ NTLIH+Y K G F VAR+LLDR+ K DAVSWNALLS Y + G +
Sbjct: 132  HGLVLRMGIGFDVFVANTLIHVYGKGGYFGVARSLLDRMPKRDAVSWNALLSAYIETGYI 191

Query: 1385 ELARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSG 1206
             LA  LF+EM  RNVESWNFM+SGY++ GLVEEAR VF  M VK++VSWNA+I+GYA + 
Sbjct: 192  RLASGLFEEMEERNVESWNFMISGYLSAGLVEEARSVFYRMPVKNVVSWNALITGYAHTS 251

Query: 1205 AFGEVLVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLAT 1026
             FGEVLVLFEDMQ  KV+PDNCTLVNVLSACA LGAL QG+W+H+YID+N I + G++AT
Sbjct: 252  CFGEVLVLFEDMQREKVKPDNCTLVNVLSACAHLGALGQGEWIHSYIDKNAIGINGYIAT 311

Query: 1025 ALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTP 846
            ALVDMYSKCG I+KAL VF N  RKDISTWNS+I GL +HG G HAL++F++M+  GF P
Sbjct: 312  ALVDMYSKCGNIDKALYVFRNASRKDISTWNSIIVGLGMHGLGEHALEIFSEMLVNGFEP 371

Query: 845  TEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATEL 666
             EVTFI +LSACSRAGL++EG  +F  MV  YGIQPTIEH+GC+VDLLG+ GLL EA +L
Sbjct: 372  NEVTFIGLLSACSRAGLLNEGHHIFQIMVDDYGIQPTIEHFGCMVDLLGQVGLLEEALDL 431

Query: 665  LNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRW 486
            + K  +KE+P++W+SLL+AC+ HG+VE+AE++ARKL EL P+D+AGYVQLSN +A+L RW
Sbjct: 432  VKKRPLKEAPVLWESLLSACKKHGNVEMAEHVARKLLELNPQDSAGYVQLSNTYAALQRW 491

Query: 485  NVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIVE 360
            + VM +R++M+   ++KEPGCS+IE+DG VHEFL+GEG+I+E
Sbjct: 492  DDVMNVRSKMKALKIKKEPGCSMIEVDGVVHEFLSGEGMILE 533


>ref|XP_012484752.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840
            [Gossypium raimondii] gi|763767696|gb|KJB34911.1|
            hypothetical protein B456_006G090000 [Gossypium
            raimondii]
          Length = 534

 Score =  623 bits (1606), Expect = e-175
 Identities = 301/522 (57%), Positives = 402/522 (77%), Gaps = 2/522 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLF-HHTFAASRLLT-AATTISHHSLSYVESIFNHVHRPN 1746
            TE A+SIS++ Q HA++LKTG+F ++TF +++L++ A +     +LSY  S+F H+  PN
Sbjct: 14   TEMASSISQIHQAHAHLLKTGVFPNNTFVSNKLISFAVSNPDPITLSYAHSVFTHITDPN 73

Query: 1745 SYIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQI 1566
            S+ YN++I AYA S TP+ +  +F ++L +   V  DKY+FTF LKACA  C  +EG QI
Sbjct: 74   SFSYNSLIRAYANSRTPENALFLFRQML-KGGPVLPDKYSFTFALKACAGFCGVEEGMQI 132

Query: 1565 HGIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLV 1386
            HG+ LK G+G D ++ NTLIH+Y K G F  AR+LLDR+   D VSWNALLS Y + G +
Sbjct: 133  HGLALKLGIGFDIFVANTLIHVYGKSGHFGFARSLLDRMADRDVVSWNALLSAYIETGFI 192

Query: 1385 ELARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSG 1206
             LAR LFDEM  RNVESWNFM+SGY++ GL+EEA+ VFD M +KD+VSWNA+I+GYA + 
Sbjct: 193  RLARGLFDEMDERNVESWNFMISGYLSSGLLEEAKSVFDSMPLKDVVSWNAIITGYAHAS 252

Query: 1205 AFGEVLVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLAT 1026
             F EVL LFEDMQ  +VRPD CTLVNVLSACA LGAL QG+W+H YID+NGI+  GF+AT
Sbjct: 253  RFDEVLELFEDMQREEVRPDTCTLVNVLSACAHLGALGQGEWIHGYIDKNGIDTNGFIAT 312

Query: 1025 ALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTP 846
            ALVDM+SKCG I+KA+ VF N  +KDISTWNS+I GL +HG G  AL+ F++M+  GF P
Sbjct: 313  ALVDMHSKCGNIDKAVNVFRNASKKDISTWNSIIVGLGMHGYGETALETFSEMLMEGFEP 372

Query: 845  TEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATEL 666
             EVTFI++L+ACSR+  ++EG  MF  MV  YGI+P IEHYGC+VDLLG+ GLL EA EL
Sbjct: 373  NEVTFIAVLTACSRSRFLNEGCKMFKLMVDDYGIEPAIEHYGCMVDLLGQVGLLEEALEL 432

Query: 665  LNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRW 486
            +   ++KE+ ++W+SLL+AC+ HG+V++AEY+ARKL EL P+D++GYVQLSN +A+L RW
Sbjct: 433  VETRQLKEAHVLWESLLSACKNHGNVKMAEYVARKLLELNPQDSSGYVQLSNTYAALKRW 492

Query: 485  NVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIVE 360
            + V+ +R +M+   + KEPGCS+IE++G VHEFLAGEG+I+E
Sbjct: 493  DDVLNVRKKMKALKVNKEPGCSMIEVNGVVHEFLAGEGMILE 534


>ref|XP_010529216.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840
            [Tarenaya hassleriana] gi|729308341|ref|XP_010529217.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g18840 [Tarenaya hassleriana]
            gi|729308344|ref|XP_010529218.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18840
            [Tarenaya hassleriana] gi|729308347|ref|XP_010529219.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g18840 [Tarenaya hassleriana]
            gi|729308350|ref|XP_010529220.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18840
            [Tarenaya hassleriana] gi|729308353|ref|XP_010529221.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g18840 [Tarenaya hassleriana]
            gi|729308356|ref|XP_010529222.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18840
            [Tarenaya hassleriana] gi|729308359|ref|XP_010529224.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g18840 [Tarenaya hassleriana]
            gi|729308362|ref|XP_010529225.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18840
            [Tarenaya hassleriana] gi|729308365|ref|XP_010529226.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g18840 [Tarenaya hassleriana]
          Length = 534

 Score =  612 bits (1578), Expect = e-172
 Identities = 297/521 (57%), Positives = 392/521 (75%), Gaps = 1/521 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFHHTFAASRLLT-AATTISHHSLSYVESIFNHVHRPNS 1743
            TE A S+S +QQ HAYMLKTG+F   F+AS+L+  +A      ++SY  SI   V  PNS
Sbjct: 14   TEKAESLSEIQQAHAYMLKTGIFRDAFSASKLIAFSAVNPDPVTVSYARSILRRVENPNS 73

Query: 1742 YIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIH 1563
            + YN++I AYA S TP+ +  VF ++L     V  DKY+FTF+LKACA     +EG+QIH
Sbjct: 74   FSYNSVIRAYANSSTPETALDVFREMLL--GPVLPDKYSFTFVLKACAGFEGYEEGRQIH 131

Query: 1562 GIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVE 1383
            G+ LK G G D +++NTL+++Y + G FE+A  +LD++ + DAVSWN+LLSVY   GLVE
Sbjct: 132  GLFLKTGTGPDVFVENTLVNVYGRSGHFELAHKVLDKMPERDAVSWNSLLSVYLDKGLVE 191

Query: 1382 LARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGA 1203
             ARELFDEM  RN+ESWNFM+SGY+  GLV+EA ++FD M  KD+VSWN M++GYA +G 
Sbjct: 192  TARELFDEMEERNLESWNFMISGYMASGLVKEAAELFDAMPCKDVVSWNVMVTGYAHAGL 251

Query: 1202 FGEVLVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLATA 1023
            + EVL LF ++   +  PD CTLVNVLSACA LGAL+QG+WVH +ID+ GI ++GFLATA
Sbjct: 252  YSEVLELFREILNSEDEPDGCTLVNVLSACANLGALNQGEWVHVHIDKQGIIIDGFLATA 311

Query: 1022 LVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTPT 843
            LVDMYSKCG I+KALEVF   LRKD+STWNSMI+GLSIHG G  AL +F++M+  GF P 
Sbjct: 312  LVDMYSKCGKIDKALEVFRATLRKDVSTWNSMISGLSIHGFGKVALGIFSEMLLEGFEPN 371

Query: 842  EVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATELL 663
             VTF+ +LSACS AGL+ EG  +F  M RVYGI+PT+EHYGC+VDL GR G + EA EL+
Sbjct: 372  NVTFVGVLSACSHAGLLDEGRELFGMMKRVYGIEPTVEHYGCMVDLFGRMGKVEEAEELV 431

Query: 662  NKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRWN 483
            +K+  + +P++ +SLL AC+  G +E+AE IA +L EL P++++GYVQ+SN +AS GRW+
Sbjct: 432  SKIPPESAPVLLESLLGACKRFGHMEMAERIAMRLVELNPEESSGYVQMSNLYASSGRWD 491

Query: 482  VVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIVE 360
             VME+R +MR K L K+PGCS+IE+DG VHEFLAGEG+I +
Sbjct: 492  EVMEVRRKMRAKRLSKKPGCSMIEVDGIVHEFLAGEGLIAD 532


>emb|CBI30729.3| unnamed protein product [Vitis vinifera]
          Length = 506

 Score =  602 bits (1553), Expect = e-169
 Identities = 303/519 (58%), Positives = 387/519 (74%), Gaps = 2/519 (0%)
 Frame = -3

Query: 1916 EAATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATTISH-HSLSYVESIFNHVHRPNSY 1740
            E ATSIS L Q HA++LK+GL H TFAASRL+ + +T SH  ++ Y  SIF+ +  PNSY
Sbjct: 15   EMATSISELHQAHAHILKSGLIHSTFAASRLIASVSTNSHAQAIPYAHSIFSRIPNPNSY 74

Query: 1739 IYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIHG 1560
            ++NT+I AYA S TP+ +  +F ++L+  + V  DKYTFTF LK+C      +EG+QIHG
Sbjct: 75   MWNTIIRAYANSPTPEAALTIFHQMLH--ASVLPDKYTFTFALKSCGSFSGVEEGRQIHG 132

Query: 1559 IVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVEL 1380
             VLK GLGDD +I+NTLIH+YA C                               G +E 
Sbjct: 133  HVLKTGLGDDLFIQNTLIHLYASC-------------------------------GCIED 161

Query: 1379 ARELFDEMPVRNVESWNFMVSGYVNCGLVEEA-RDVFDVMLVKDIVSWNAMISGYAKSGA 1203
            AR L D M  R+V SWN ++S Y   GL+E A R VF    VK++VSWNAMI+GY+ +G 
Sbjct: 162  ARHLLDRMLERDVVSWNALLSAYAERGLMELASRRVFGETPVKNVVSWNAMITGYSHAGR 221

Query: 1202 FGEVLVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLATA 1023
            F EVLVLFEDMQ   V+PDNCTLV+VLSACA +GALSQG+WVHAYID+NGI ++GF+ATA
Sbjct: 222  FSEVLVLFEDMQHAGVKPDNCTLVSVLSACAHVGALSQGEWVHAYIDKNGISIDGFVATA 281

Query: 1022 LVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTPT 843
            LVDMYSKCG IEKALEVF++ LRKDISTWNS+I+GLS HGSG HAL++F++M+  GF P 
Sbjct: 282  LVDMYSKCGSIEKALEVFNSCLRKDISTWNSIISGLSTHGSGQHALQIFSEMLVEGFKPN 341

Query: 842  EVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATELL 663
            EVTF+ +LSACSRAGL+ EG  MF+ MV V+GIQPTIEHYGC+VDLLGR GLL EA EL+
Sbjct: 342  EVTFVCVLSACSRAGLLDEGREMFNLMVHVHGIQPTIEHYGCMVDLLGRVGLLEEAEELV 401

Query: 662  NKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRWN 483
             K+  KE+ +VW+SLL ACR HG+VELAE +A+KL EL P++++ +VQLSN +AS+GRW 
Sbjct: 402  QKMPQKEASVVWESLLGACRNHGNVELAERVAQKLLELSPQESSSFVQLSNMYASMGRWK 461

Query: 482  VVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVI 366
             VME+R +MR +G+ K+PGCS+IE+DGTV+EFLAGEG++
Sbjct: 462  DVMEVRQKMRAQGVRKDPGCSMIEVDGTVYEFLAGEGLV 500



 Score = 73.6 bits (179), Expect = 6e-10
 Identities = 60/244 (24%), Positives = 115/244 (47%), Gaps = 4/244 (1%)
 Frame = -3

Query: 1127 VLSACAGLGALSQGKWVHAYIDRNGIEVEGFLATAL---VDMYSKCGCIEKALEVFDNVL 957
            +LS      ++S+    HA+I ++G+    F A+ L   V   S    I  A  +F  + 
Sbjct: 10   ILSFAEMATSISELHQAHAHILKSGLIHSTFAASRLIASVSTNSHAQAIPYAHSIFSRIP 69

Query: 956  RKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTPTEVTFISILSACSRAGLVSEGLM 777
              +   WN++I   +   +   AL +F+QM+     P + TF   L +C     V EG  
Sbjct: 70   NPNSYMWNTIIRAYANSPTPEAALTIFHQMLHASVLPDKYTFTFALKSCGSFSGVEEGRQ 129

Query: 776  MFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATELLNKLKVKESPIVWQSLLAACRTH 597
            +  H+++  G+   +     L+ L    G + +A  LL+++ ++   + W +LL+A    
Sbjct: 130  IHGHVLKT-GLGDDLFIQNTLIHLYASCGCIEDARHLLDRM-LERDVVSWNALLSAYAER 187

Query: 596  GDVELAEYIARKLF-ELEPKDTAGYVQLSNFHASLGRWNVVMELRTRMREKGLEKEPGCS 420
            G +ELA   +R++F E   K+   +  +   ++  GR++ V+ L   M+  G+ K   C+
Sbjct: 188  GLMELA---SRRVFGETPVKNVVSWNAMITGYSHAGRFSEVLVLFEDMQHAGV-KPDNCT 243

Query: 419  LIEI 408
            L+ +
Sbjct: 244  LVSV 247


>ref|XP_002867972.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297313808|gb|EFH44231.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 535

 Score =  602 bits (1551), Expect = e-169
 Identities = 293/522 (56%), Positives = 389/522 (74%), Gaps = 2/522 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFHHTFAASRLLT-AATTISHHSLSYVESIFNHVHRPNS 1743
            TE A S+  +QQ HA+MLKTGLFH TF+AS+L+  AAT     ++SY  SI N +  PN 
Sbjct: 16   TERAKSLLEIQQAHAFMLKTGLFHDTFSASKLVAFAATNPEPKTVSYAHSILNRIESPNG 75

Query: 1742 YIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIH 1563
            + +N++I AYA S TP+++  VF ++L     V+ DKY+FTF+LKACA  C  +EG+QIH
Sbjct: 76   FTHNSVIRAYANSSTPEIALTVFREMLL--GPVFPDKYSFTFVLKACAAFCGFEEGRQIH 133

Query: 1562 GIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVE 1383
            G+ +K+ L  D +++NTLI++Y + G FE+AR +LDR+   DAVSWN+LLS Y   GLVE
Sbjct: 134  GLFMKSDLVTDVFVENTLINVYGRSGYFEIARKVLDRMPVRDAVSWNSLLSAYLDKGLVE 193

Query: 1382 LARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGA 1203
             AR LFDEM  RNVESWNFM+SGY   GLV+EAR+VFD M VKD+VSWNAM++ YA  G 
Sbjct: 194  EARALFDEMEERNVESWNFMISGYAAAGLVKEAREVFDSMPVKDVVSWNAMVTAYAHVGC 253

Query: 1202 FGEVLVLFEDMQTVKV-RPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLAT 1026
            + EVL +F  M      RPD  TLVNVLSACA LG+LSQG+WVH YID++GIE+EGF+AT
Sbjct: 254  YNEVLEVFNMMLDDSAERPDGFTLVNVLSACASLGSLSQGEWVHVYIDKHGIEIEGFVAT 313

Query: 1025 ALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTP 846
            ALVDMYSKCG I+KALEVF +  ++D+STWNS+I GLS+HG G  AL++F++M+  GF P
Sbjct: 314  ALVDMYSKCGKIDKALEVFRDTSKRDVSTWNSIITGLSVHGLGKDALEIFSEMVYEGFKP 373

Query: 845  TEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATEL 666
              +TFI +LSAC+  GL+ +   +F+ M  VYGI+PTIEHYGC+VDLLGR G   EA EL
Sbjct: 374  NGITFIGVLSACNHVGLLDQARKLFEMMNSVYGIEPTIEHYGCMVDLLGRMGKFEEAEEL 433

Query: 665  LNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRW 486
            +N++   E+ I+ +SLL AC+  G +E AE IA +L E  P++++GYVQ+SN +AS GRW
Sbjct: 434  VNEVPADEASILLESLLGACKRFGKLEQAERIANRLLESNPRESSGYVQMSNLYASHGRW 493

Query: 485  NVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIVE 360
            +  ME+R +MR + ++K PGCS+IE+DG VHEFLAGEG+ +E
Sbjct: 494  DEAMEVRGKMRAERVKKNPGCSMIEVDGVVHEFLAGEGLRIE 535


>ref|XP_010436490.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like
            [Camelina sativa]
          Length = 878

 Score =  596 bits (1537), Expect = e-167
 Identities = 283/524 (54%), Positives = 395/524 (75%), Gaps = 4/524 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATTISH---HSLSYVESIFNHVHRP 1749
            TE A S+S ++Q HA+MLKTGLF  T++AS+L+  A T ++    ++SY  SI N +   
Sbjct: 357  TERAKSLSEIKQAHAFMLKTGLFQDTYSASKLIAFAATQTNPEPKTVSYAHSILNRIESA 416

Query: 1748 NSYIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQ 1569
            N + +N++I AYA S TP+++ +VF  +L     V+ DKY+FTF+LKACA  C  ++GKQ
Sbjct: 417  NGFTHNSVIRAYANSSTPEMALVVFRDMLL--GPVFPDKYSFTFVLKACAAFCGFEQGKQ 474

Query: 1568 IHGIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGL 1389
            IHG+ +K+GL  D +++NTL+++Y + G FE+AR +LD +   DAVSWN+LLS Y + GL
Sbjct: 475  IHGLFMKSGLMTDVFVENTLVNVYGRSGYFEIARKVLDEMPVRDAVSWNSLLSAYLEKGL 534

Query: 1388 VELARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKS 1209
            VE AR LFDEM  RNVESWNFM+SGY   GLV+EA+++FD M VKD+VSWNAM++ YA  
Sbjct: 535  VEEARALFDEMEERNVESWNFMISGYAAAGLVKEAKEIFDSMPVKDVVSWNAMVTAYAHV 594

Query: 1208 GAFGEVLVLFEDMQTVKV-RPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFL 1032
            G + +VL +F +M  V   +PD  TLVNVLSACA LG+LSQG+WVH YID++GIE+EGFL
Sbjct: 595  GCYDDVLEVFNEMLDVSTEKPDGFTLVNVLSACASLGSLSQGEWVHVYIDKHGIEIEGFL 654

Query: 1031 ATALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGF 852
            ATALVDMYSKCG I+KALEVF    ++D+STWNS+I+GLS+HG G  AL++F++M+  GF
Sbjct: 655  ATALVDMYSKCGKIDKALEVFRATSKRDVSTWNSIISGLSVHGLGKDALEIFSEMVYEGF 714

Query: 851  TPTEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEAT 672
             P  +TF+ +LSAC+  GL+ +   +F+ M  VYG++PT+EHYGC+VDLLGR G + EA 
Sbjct: 715  KPNGITFVGVLSACNHVGLLDQARNLFEMMNSVYGVEPTVEHYGCMVDLLGRMGRIEEAE 774

Query: 671  ELLNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLG 492
            EL+N++   E+ I+ +SLL +C+  G +E AE IA +L EL P++++GYVQ+SN +AS G
Sbjct: 775  ELVNEIPADEASILLESLLGSCKRFGRLEQAERIANRLQELNPQESSGYVQMSNLYASNG 834

Query: 491  RWNVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIVE 360
            RW+ VME+R +MR + + K+PGCS+IE+DG VHEFLAGEG+ ++
Sbjct: 835  RWDEVMEVRRKMRAERVNKKPGCSMIEVDGVVHEFLAGEGLRID 878


>ref|XP_010439805.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like
            [Camelina sativa]
          Length = 539

 Score =  595 bits (1533), Expect = e-167
 Identities = 282/524 (53%), Positives = 395/524 (75%), Gaps = 4/524 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATTISH---HSLSYVESIFNHVHRP 1749
            TE A S+S ++Q HA+MLKTGLF  T++AS+L+  A T ++   +++SY  SI N +  P
Sbjct: 18   TERAKSLSEIKQAHAFMLKTGLFQDTYSASKLIAFAATQTNPEPNTVSYAHSILNRIDSP 77

Query: 1748 NSYIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQ 1569
            N + +N++I AYA S TP+++ +VF  +L     V+ DKY+FTF LKACA  C  ++G+Q
Sbjct: 78   NGFTHNSVIRAYANSSTPEMALVVFRDMLL--GPVFPDKYSFTFALKACAAFCGFEQGRQ 135

Query: 1568 IHGIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGL 1389
            IHG+ +K+GL  D +++NTL+++YA+ G F++AR +LD +   DAVSWN+LLS Y   GL
Sbjct: 136  IHGLFMKSGLMTDVFVENTLVNVYARSGYFQIARKVLDEMPVRDAVSWNSLLSAYLAKGL 195

Query: 1388 VELARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKS 1209
            VE AR LFDEM  RNVESWNFM+SGY   GLV+EA+++FD M  KD+VSWNAM++ YA  
Sbjct: 196  VEEARALFDEMEERNVESWNFMISGYAAAGLVKEAKEIFDSMPGKDVVSWNAMVTAYAHV 255

Query: 1208 GAFGEVLVLFEDM-QTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFL 1032
            G + EVL +F +M  +    PD  TLVNVLSACA LG+LSQG+WVH YID++GIE+EGFL
Sbjct: 256  GCYDEVLEVFNEMLDSSTEEPDGFTLVNVLSACASLGSLSQGEWVHVYIDKHGIEIEGFL 315

Query: 1031 ATALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGF 852
            ATALVDMYSKCG I+KALEVF    ++D+STWNS+I+GLS+HG G+ AL++F++M+  GF
Sbjct: 316  ATALVDMYSKCGKIDKALEVFRATSKRDVSTWNSIISGLSVHGLGNDALEIFSEMVYEGF 375

Query: 851  TPTEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEAT 672
             P  +TF+ +LSAC+  GL+ +   +F+ +  VYG++PTIEHYGC+VDLLGR G + EA 
Sbjct: 376  KPNGITFVGVLSACNHVGLLDQARKLFEMINSVYGVEPTIEHYGCMVDLLGRMGKIEEAE 435

Query: 671  ELLNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLG 492
            EL+N++  +E+ ++ +SLL AC+  G +E AE IA +L EL P++++GYVQ+SN +AS G
Sbjct: 436  ELVNEIPAEEASVLLESLLGACKRFGRLEQAERIANRLLELNPRESSGYVQMSNLYASNG 495

Query: 491  RWNVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIVE 360
            RW+ VME+R +MR   + K+PGCS+IE+DG VHEFLAGEG+ ++
Sbjct: 496  RWDEVMEVRRKMRAVNVNKKPGCSMIEVDGVVHEFLAGEGLRID 539


>ref|XP_006283500.1| hypothetical protein CARUB_v10004552mg [Capsella rubella]
            gi|482552205|gb|EOA16398.1| hypothetical protein
            CARUB_v10004552mg [Capsella rubella]
          Length = 537

 Score =  595 bits (1533), Expect = e-167
 Identities = 285/522 (54%), Positives = 391/522 (74%), Gaps = 2/522 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFHHTFAASRLLT-AATTISHHSLSYVESIFNHVHRPNS 1743
            TE A S+S +QQ HA+MLKTGL   T++AS+L+  A T     ++SY  SI N +  PN 
Sbjct: 18   TERAKSLSEIQQAHAFMLKTGLSQDTYSASKLIAFAVTNPEPKTVSYAHSILNRIESPNG 77

Query: 1742 YIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIH 1563
            + +N++I AYA S TP+++ +VF  +L     V+ DKY+FTF+LKACA     +EG+QIH
Sbjct: 78   FTHNSVIRAYANSSTPEMALVVFRDMLL--GPVFPDKYSFTFVLKACAAFSGFEEGRQIH 135

Query: 1562 GIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVE 1383
            G+ +K+GL  D +++NTL+++Y + G FE+AR +LD++   DAVSWN+LLS Y + GLVE
Sbjct: 136  GLFMKSGLMTDVFVENTLVNVYGRSGYFEIARKVLDKMPVRDAVSWNSLLSAYLEKGLVE 195

Query: 1382 LARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGA 1203
             AR LFDEM  RNVESWNFM+SGY   GLV+EA+++FD M VKD+VSWNAM++ YA  G 
Sbjct: 196  EARALFDEMEERNVESWNFMISGYAAAGLVKEAKEIFDSMPVKDVVSWNAMVTAYAHVGC 255

Query: 1202 FGEVLVLFEDMQTVKV-RPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLAT 1026
            + EVL +F +M      +PD  TLVNVLSACA LG+LSQG+WVH YID++GIE+EGFLAT
Sbjct: 256  YNEVLEVFNEMLDDSTEKPDGFTLVNVLSACASLGSLSQGEWVHVYIDKHGIEIEGFLAT 315

Query: 1025 ALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTP 846
            ALVDMYSKCG I+KALEVF    ++D+STWNS+I+GLS+HG G  AL++F++M+  GF P
Sbjct: 316  ALVDMYSKCGKIDKALEVFRATSKRDVSTWNSIISGLSVHGLGKDALEIFSEMVYEGFKP 375

Query: 845  TEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATEL 666
              +TFI +LSAC+  GL+ +   +F+ M  VYGI+PT+EHYGC+VDLLGR G + EA EL
Sbjct: 376  NGITFIGVLSACNHVGLLDQARRLFEMMNSVYGIEPTVEHYGCMVDLLGRMGKIEEAEEL 435

Query: 665  LNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRW 486
            +N++   E+ ++ +SLL +C+  G +E AE IA +L EL P +++GYVQ+SN +AS GRW
Sbjct: 436  VNEIPADEASMLLESLLGSCKRFGKLEQAERIANRLLELNPHESSGYVQMSNLYASNGRW 495

Query: 485  NVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIVE 360
            + VME+R +MR + + K+PGCS+IE+DG VHEFLAGEG+ V+
Sbjct: 496  DEVMEVRRKMRAERVNKKPGCSMIEVDGVVHEFLAGEGLRVD 537



 Score =  103 bits (257), Expect = 5e-19
 Identities = 85/357 (23%), Positives = 165/357 (46%), Gaps = 16/357 (4%)
 Frame = -3

Query: 1616 ILKACARVCRGKEGKQIHGIVLKNGLGDDEYIKNTLIHMYA---KCGCFEVARNLLDRLQ 1446
            IL    R     E +Q H  +LK GL  D Y  + LI       +      A ++L+R++
Sbjct: 14   ILSFTERAKSLSEIQQAHAFMLKTGLSQDTYSASKLIAFAVTNPEPKTVSYAHSILNRIE 73

Query: 1445 KHDAVSWNALLSVYTQMGLVELARELFDEMPVRNV----ESWNFMVSGYVNCGLVEEARD 1278
              +  + N+++  Y      E+A  +F +M +  V     S+ F++         EE R 
Sbjct: 74   SPNGFTHNSVIRAYANSSTPEMALVVFRDMLLGPVFPDKYSFTFVLKACAAFSGFEEGRQ 133

Query: 1277 VFDVM----LVKDIVSWNAMISGYAKSGAFGEVLVLFEDMQTVKVRPDNCTLVNVLSACA 1110
            +  +     L+ D+   N +++ Y +SG F E+     D   V+   D  +  ++LSA  
Sbjct: 134  IHGLFMKSGLMTDVFVENTLVNVYGRSGYF-EIARKVLDKMPVR---DAVSWNSLLSAYL 189

Query: 1109 GLGALSQGKWVHAYIDRNGIEVEGFLATALVDMYSKCGCIEKALEVFDNVLRKDISTWNS 930
              G + + + +   ++   +E   F    ++  Y+  G +++A E+FD++  KD+ +WN+
Sbjct: 190  EKGLVEEARALFDEMEERNVESWNF----MISGYAAAGLVKEAKEIFDSMPVKDVVSWNA 245

Query: 929  MIAGLSIHGSGHHALKMFNQMISGGF-TPTEVTFISILSACSRAGLVSEG----LMMFDH 765
            M+   +  G  +  L++FN+M+      P   T +++LSAC+  G +S+G    + +  H
Sbjct: 246  MVTAYAHVGCYNEVLEVFNEMLDDSTEKPDGFTLVNVLSACASLGSLSQGEWVHVYIDKH 305

Query: 764  MVRVYGIQPTIEHYGCLVDLLGRRGLLNEATELLNKLKVKESPIVWQSLLAACRTHG 594
             + + G   T      LVD+  + G +++A E+  +   K     W S+++    HG
Sbjct: 306  GIEIEGFLAT-----ALVDMYSKCGKIDKALEVF-RATSKRDVSTWNSIISGLSVHG 356


>ref|XP_006414048.1| hypothetical protein EUTSA_v10024877mg [Eutrema salsugineum]
            gi|557115218|gb|ESQ55501.1| hypothetical protein
            EUTSA_v10024877mg [Eutrema salsugineum]
          Length = 535

 Score =  594 bits (1532), Expect = e-167
 Identities = 284/522 (54%), Positives = 387/522 (74%), Gaps = 2/522 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFHHTFAASRLLT-AATTISHHSLSYVESIFNHVHRPNS 1743
            TE A S+S +QQ HA+MLKTGLF  TF+AS+L+  A       ++SY  SI N +  PN+
Sbjct: 16   TERAKSLSEIQQAHAFMLKTGLFRDTFSASKLIAFAVVNPEPKTVSYAHSILNRIESPNA 75

Query: 1742 YIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIH 1563
            + +N++I AYA S  P+ +   F ++L     V+ DKY+FTF+LKACA  C  +EG+QIH
Sbjct: 76   FTHNSVIRAYANSSAPESALTAFREMLL--GPVFPDKYSFTFVLKACAAFCGFEEGRQIH 133

Query: 1562 GIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVE 1383
            G+ LK+ L  D +++NTL+++Y + G FE+AR +LD + + D VSWN+LLS Y + GLVE
Sbjct: 134  GLFLKSDLISDVFVENTLVNVYGRSGYFEIARKVLDTMPERDVVSWNSLLSAYVEKGLVE 193

Query: 1382 LARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGA 1203
             AR +FDEM  RNVESWNFM+SGY   GLV EA+++FD M VKD+VSWNAM+S YA  G 
Sbjct: 194  EARGVFDEMDERNVESWNFMISGYAAAGLVNEAKELFDSMPVKDVVSWNAMVSAYAHVGC 253

Query: 1202 FGEVLVLFEDMQTVKV-RPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLAT 1026
            + EVL +F +M      +PD  TLVNVLSACA LG+LSQG+WVH Y D++GIE++GFLAT
Sbjct: 254  YSEVLEVFNEMLNSSTEKPDGFTLVNVLSACANLGSLSQGEWVHVYTDKHGIEIDGFLAT 313

Query: 1025 ALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTP 846
            ALVDMYSKCG ++KALEVF    +KD+STWNSMI+GLS+HG G+ AL++F++M+  GF P
Sbjct: 314  ALVDMYSKCGKVDKALEVFRATSKKDVSTWNSMISGLSVHGLGNDALEIFSEMVHEGFKP 373

Query: 845  TEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATEL 666
              +TFI+ LSAC+  G++ +   +F+ M  VYG++PTIEHYGC+VDLLGR G   EA EL
Sbjct: 374  NSITFIATLSACNHVGMLDQARRLFETMNSVYGVEPTIEHYGCMVDLLGRMGKFEEAEEL 433

Query: 665  LNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRW 486
            +N+    E+ ++ +SLL AC+  G +E AE IA +L EL P +T+GYVQ+SN +AS GRW
Sbjct: 434  VNETPADEASVLLESLLGACKRFGRMEQAESIANRLLELNPGETSGYVQMSNLYASNGRW 493

Query: 485  NVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIVE 360
            + VME+R +MR + ++K+PGCS+IE+DG VHEFLAGEG+I++
Sbjct: 494  DQVMEVRRKMRAERVKKKPGCSMIEVDGVVHEFLAGEGLIID 535


>ref|XP_009132265.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840
            [Brassica rapa]
          Length = 535

 Score =  588 bits (1515), Expect = e-165
 Identities = 286/520 (55%), Positives = 386/520 (74%), Gaps = 2/520 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFHHTFAASRLLT-AATTISHHSLSYVESIFNHVHRPNS 1743
            TE A S+S +QQ HA+MLKTGL   TF+AS+L++ A       ++SY  SI N +  PN+
Sbjct: 15   TEKAKSLSEIQQAHAFMLKTGLSRDTFSASKLISFAVANPEPKTVSYAHSILNRIDTPNA 74

Query: 1742 YIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIH 1563
            + +N++I AYA S TP+++   F ++L     V  DKY+FTF LKACA     +EG+Q+H
Sbjct: 75   FTHNSLIRAYANSPTPEMALTAFREMLL-GGPVAPDKYSFTFALKACAAFRGVEEGRQLH 133

Query: 1562 GIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVE 1383
            G+ LK+GL  D +++NTL+++YA+ G FEVAR +LD + + D VSWN+LLS + + GLVE
Sbjct: 134  GLFLKSGLDSDVFVENTLVNVYARSGWFEVARKVLDEMPERDVVSWNSLLSAFVEKGLVE 193

Query: 1382 LARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGA 1203
             AR LFDEM  RNVESWNFMVS Y   GLVEEAR VFD M VKD+VSWNAM+SGYA +G 
Sbjct: 194  EARGLFDEMEERNVESWNFMVSCYAAAGLVEEARGVFDEMPVKDLVSWNAMVSGYASAGC 253

Query: 1202 FGEVLVLFEDM-QTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLAT 1026
            +GE L +F +M ++    PD  TLV+VLSACA LG+LSQG+WV  YID++G+E++GFLAT
Sbjct: 254  YGEALEVFNEMLKSCAEEPDGFTLVSVLSACANLGSLSQGEWVRVYIDKHGVEIDGFLAT 313

Query: 1025 ALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTP 846
            ALVDMYSKCG I+KA+EVF    +KD+STWNSMI GLS+HG G+ AL++F++M+  GF P
Sbjct: 314  ALVDMYSKCGRIDKAIEVFRGASKKDVSTWNSMITGLSVHGLGNDALEIFSEMVYEGFKP 373

Query: 845  TEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATEL 666
              +TFI++LSAC+  GL+ +   +F+ M  VYG++P+IEHYGC+VDLLGR G   EA EL
Sbjct: 374  NGITFIAVLSACNHVGLLDQARKLFETMSSVYGVEPSIEHYGCMVDLLGRLGRFEEAEEL 433

Query: 665  LNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRW 486
            +NK+   E+ ++ +SLL AC+  G  E AE +A +L EL P +T+GYVQ+SN +AS GRW
Sbjct: 434  VNKVPPDEASVLLESLLGACKRFGRTEQAESLANRLLELNPGETSGYVQMSNLYASDGRW 493

Query: 485  NVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVI 366
            + V E+R +MR + + K+PGCS+IE+DG VHEFLAGEG+I
Sbjct: 494  DEVTEVRRKMRAERVNKKPGCSMIEVDGVVHEFLAGEGLI 533


>ref|XP_010943932.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840
            [Elaeis guineensis]
          Length = 528

 Score =  568 bits (1463), Expect = e-159
 Identities = 277/516 (53%), Positives = 372/516 (72%), Gaps = 1/516 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATTISHH-SLSYVESIFNHVHRPNS 1743
            T+ AT++S + Q HA M+KTGL H   AA+RLL++A       +L+Y +S+F  +  P S
Sbjct: 15   TDMATNLSEVHQSHANMIKTGLIHSPAAAARLLSSAIAADEPPALAYADSLFARLPAPTS 74

Query: 1742 YIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIH 1563
            + +N+MI A+A +  P  +  +F ++L+  S    D +TF FILKACA +    E  QIH
Sbjct: 75   FAWNSMIRAHARAPDPGPALQLFYRMLH--SPTRPDNFTFPFILKACAALPALSETLQIH 132

Query: 1562 GIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVE 1383
              ++K G G D ++ NTL+H YA  G  E A  L  R+ + D +SWNAL++     GL++
Sbjct: 133  ARIIKTGFGSDIFVLNTLLHTYAINGLTEEAFKLFGRMPQKDLISWNALINALVAHGLID 192

Query: 1382 LARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGA 1203
             AR LFDEMP RNVE+WNFM+SGY++ GLV+++R++F++M V+DIVSWNAMI+G A +G 
Sbjct: 193  PARNLFDEMPERNVETWNFMISGYLDLGLVDQSRELFNLMPVRDIVSWNAMITGCAHAGR 252

Query: 1202 FGEVLVLFEDMQTVKVRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLATA 1023
            F EV+ LF++MQ   V PD CTLVNVLSACA +GAL QG+W+ AY+D+NGIE++GFLATA
Sbjct: 253  FDEVISLFQEMQYDNVWPDECTLVNVLSACARVGALGQGEWIRAYVDKNGIEIKGFLATA 312

Query: 1022 LVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTPT 843
             VDMYSKCG IEKAL+VF N  +KD+STWN+MI GLS HG G HAL++F  M   G  P 
Sbjct: 313  FVDMYSKCGSIEKALQVFSNASKKDVSTWNAMIDGLSSHGFGEHALRLFEDMPRNGLVPN 372

Query: 842  EVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATELL 663
             VTF+++LSACS  GL++EG  +FD M  VYGI+P IEHYGC+VDLLGR GLL EA ELL
Sbjct: 373  GVTFVNVLSACSHGGLLNEGCRIFDDMACVYGIEPDIEHYGCMVDLLGRAGLLVEAKELL 432

Query: 662  NKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRWN 483
             +  VK++P++W+SLL+ACR HGDVELAE  A++L EL P D++ Y+QLS  +A LGRW 
Sbjct: 433  KRAPVKDAPVLWRSLLSACREHGDVELAEIAAKQLLELCPFDSSCYIQLSKIYALLGRWE 492

Query: 482  VVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGE 375
                LR  M+ +G++KEPGCS IE+DG +HEF  G+
Sbjct: 493  DARMLREMMKVQGVKKEPGCSTIEVDGAIHEFFVGD 528


>ref|XP_010451379.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840,
            partial [Camelina sativa]
          Length = 490

 Score =  561 bits (1447), Expect = e-157
 Identities = 262/479 (54%), Positives = 365/479 (76%), Gaps = 1/479 (0%)
 Frame = -3

Query: 1793 SLSYVESIFNHVHRPNSYIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFI 1614
            ++SY  SI N +  PN + +N++I AYA S TP+++ +VF  +L     V+ DKY+FTF+
Sbjct: 14   TVSYAHSILNRIESPNGFTHNSVIRAYANSSTPEMALVVFRDMLL--GPVFPDKYSFTFV 71

Query: 1613 LKACARVCRGKEGKQIHGIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDA 1434
            LKACA  C  ++G+QIHG+ +K+ L  D +++NTL+++YA+ G FE+AR +LD +   DA
Sbjct: 72   LKACAAFCGFEQGRQIHGLFMKSDLMTDVFVENTLVNVYARSGYFEIARKVLDEMPVRDA 131

Query: 1433 VSWNALLSVYTQMGLVELARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVK 1254
            VSWN+LLS Y + GLVE AR LFDEM  RNVESWNFM+SGY   GLV+EA+++FD M VK
Sbjct: 132  VSWNSLLSAYLEKGLVEEARALFDEMEERNVESWNFMISGYAAAGLVKEAKEIFDSMPVK 191

Query: 1253 DIVSWNAMISGYAKSGAFGEVLVLFEDMQTVKVR-PDNCTLVNVLSACAGLGALSQGKWV 1077
            D+VSWNAM++ YA  G + +VL +F +M  V    PD  TLVNVLSACA LG+LSQG+WV
Sbjct: 192  DVVSWNAMVTAYAHVGCYDDVLEVFNEMLDVSTEEPDGVTLVNVLSACASLGSLSQGEWV 251

Query: 1076 HAYIDRNGIEVEGFLATALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSG 897
            H YID++GIE+EGFLATALVDMYSKCG I+KALEVF    ++D+STWNS+I+GLS+HG G
Sbjct: 252  HVYIDKHGIEIEGFLATALVDMYSKCGKIDKALEVFRATSKRDVSTWNSIISGLSVHGLG 311

Query: 896  HHALKMFNQMISGGFTPTEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGC 717
              AL++F++M+  GF P  +TF+ +LSAC+  GL+ +   +F+ M  VYG++P+IEHYGC
Sbjct: 312  KDALEIFSEMVYEGFKPNGITFVGVLSACNHVGLLDQARNLFEMMNSVYGVEPSIEHYGC 371

Query: 716  LVDLLGRRGLLNEATELLNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKD 537
            +VDLLGR G + EA EL+N+++  E+ ++ +SLL AC+  G +E AE IA +L EL P++
Sbjct: 372  MVDLLGRMGKIEEAEELVNEIQADEASVLLESLLGACKRFGRLEQAERIANRLQELNPRE 431

Query: 536  TAGYVQLSNFHASLGRWNVVMELRTRMREKGLEKEPGCSLIEIDGTVHEFLAGEGVIVE 360
            ++GYVQ+SN +AS GRW+ VME+R +MR + + K+PGCS+IE+DG VHEFLAGEG+ ++
Sbjct: 432  SSGYVQMSNLYASNGRWDEVMEVRRKMRAENVNKKPGCSMIEVDGVVHEFLAGEGLRID 490


>ref|NP_193619.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75098703|sp|O49399.2|PP321_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g18840 gi|5738365|emb|CAA16741.2| putative protein
            [Arabidopsis thaliana] gi|7268678|emb|CAB78886.1|
            putative protein [Arabidopsis thaliana]
            gi|332658697|gb|AEE84097.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 545

 Score =  546 bits (1407), Expect = e-152
 Identities = 265/497 (53%), Positives = 365/497 (73%), Gaps = 2/497 (0%)
 Frame = -3

Query: 1919 TEAATSISRLQQVHAYMLKTGLFHHTFAASRLLT-AATTISHHSLSYVESIFNHVHRPNS 1743
            TE A S++ +QQ HA+MLKTGLFH TF+AS+L+  AAT     ++SY  SI N +  PN 
Sbjct: 46   TERAKSLTEIQQAHAFMLKTGLFHDTFSASKLVAFAATNPEPKTVSYAHSILNRIGSPNG 105

Query: 1742 YIYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIH 1563
            + +N++I AYA S TP+++  VF ++L     V+ DKY+FTF+LKACA  C  +EG+QIH
Sbjct: 106  FTHNSVIRAYANSSTPEVALTVFREMLL--GPVFPDKYSFTFVLKACAAFCGFEEGRQIH 163

Query: 1562 GIVLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVE 1383
            G+ +K+GL  D +++NTL+++Y + G FE+AR +LDR+   DAVSWN+LLS Y + GLV+
Sbjct: 164  GLFIKSGLVTDVFVENTLVNVYGRSGYFEIARKVLDRMPVRDAVSWNSLLSAYLEKGLVD 223

Query: 1382 LARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGA 1203
             AR LFDEM  RNVESWNFM+SGY   GLV+EA++VFD M V+D+VSWNAM++ YA  G 
Sbjct: 224  EARALFDEMEERNVESWNFMISGYAAAGLVKEAKEVFDSMPVRDVVSWNAMVTAYAHVGC 283

Query: 1202 FGEVLVLFEDMQTVKV-RPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLAT 1026
            + EVL +F  M      +PD  TLV+VLSACA LG+LSQG+WVH YID++GIE+EGFLAT
Sbjct: 284  YNEVLEVFNKMLDDSTEKPDGFTLVSVLSACASLGSLSQGEWVHVYIDKHGIEIEGFLAT 343

Query: 1025 ALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGFTP 846
            ALVDMYSKCG I+KALEVF    ++D+STWNS+I+ LS+HG G  AL++F++M+  GF P
Sbjct: 344  ALVDMYSKCGKIDKALEVFRATSKRDVSTWNSIISDLSVHGLGKDALEIFSEMVYEGFKP 403

Query: 845  TEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATEL 666
              +TFI +LSAC+  G++ +   +F+ M  VY ++PTIEHYGC+VDLLGR G + EA EL
Sbjct: 404  NGITFIGVLSACNHVGMLDQARKLFEMMSSVYRVEPTIEHYGCMVDLLGRMGKIEEAEEL 463

Query: 665  LNKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTAGYVQLSNFHASLGRW 486
            +N++   E+ I+ +SLL AC+  G +E AE IA +L EL  +D++GY Q+SN +AS GRW
Sbjct: 464  VNEIPADEASILLESLLGACKRFGQLEQAERIANRLLELNLRDSSGYAQMSNLYASDGRW 523

Query: 485  NVVMELRTRMREKGLEK 435
              V++ R  MR + + +
Sbjct: 524  EKVIDGRRNMRAERVNR 540


>ref|XP_008813633.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840
            [Phoenix dactylifera]
          Length = 426

 Score =  507 bits (1306), Expect = e-140
 Identities = 241/427 (56%), Positives = 319/427 (74%)
 Frame = -3

Query: 1655 NSQVYADKYTFTFILKACARVCRGKEGKQIHGIVLKNGLGDDEYIKNTLIHMYAKCGCFE 1476
            +S    DKYTF F+LKACA +    E  QIH  +LK G G D ++ NTL+  YA  G   
Sbjct: 3    HSPTRPDKYTFPFVLKACAAL---PEAFQIHARILKTGFGSDIFVLNTLLRTYAINGLTA 59

Query: 1475 VARNLLDRLQKHDAVSWNALLSVYTQMGLVELARELFDEMPVRNVESWNFMVSGYVNCGL 1296
             A  L  R+ + D +SWNA+++ +   GLV  AR+LFD+M  RNVE+WNFM+SGY N GL
Sbjct: 60   EALKLFARMPQKDVISWNAMINAFVTHGLVGQARKLFDKMSERNVETWNFMISGYSNLGL 119

Query: 1295 VEEARDVFDVMLVKDIVSWNAMISGYAKSGAFGEVLVLFEDMQTVKVRPDNCTLVNVLSA 1116
            V+++R++F++M V+DIVSWNAMI+G A++G F EV+ LF++MQ   V PD CT VNVLSA
Sbjct: 120  VDQSRELFNLMPVRDIVSWNAMITGCARAGRFDEVISLFQEMQYGNVWPDECTFVNVLSA 179

Query: 1115 CAGLGALSQGKWVHAYIDRNGIEVEGFLATALVDMYSKCGCIEKALEVFDNVLRKDISTW 936
            CA +G+L QG+W+ AY+D+NGIE++GFLATALVDMYSKCG IEKAL+VF+N  +KD+STW
Sbjct: 180  CARVGSLGQGEWIRAYVDKNGIEIKGFLATALVDMYSKCGSIEKALQVFNNTSKKDVSTW 239

Query: 935  NSMIAGLSIHGSGHHALKMFNQMISGGFTPTEVTFISILSACSRAGLVSEGLMMFDHMVR 756
            N+MI GLS HG G HAL++F +M   G  P  VTF+++LSACS  GL++EG  +F+ M R
Sbjct: 240  NAMIDGLSSHGLGEHALRLFEEMPRNGLVPNGVTFVNVLSACSHEGLLNEGCRIFNDMAR 299

Query: 755  VYGIQPTIEHYGCLVDLLGRRGLLNEATELLNKLKVKESPIVWQSLLAACRTHGDVELAE 576
            VYGI+P IEHYGC+VDLLGR GLL  A ELL +  VK++P++W+SLL+ACR HGD+ELAE
Sbjct: 300  VYGIEPEIEHYGCMVDLLGRAGLLVAAEELLRRAPVKDAPVLWRSLLSACREHGDLELAE 359

Query: 575  YIARKLFELEPKDTAGYVQLSNFHASLGRWNVVMELRTRMREKGLEKEPGCSLIEIDGTV 396
              A++L EL P D++ Y+QLSN +A LGRW     LR  M+ +G++KEPGCS IE+DG +
Sbjct: 360  IAAKQLLELSPFDSSCYIQLSNVYALLGRWEDARMLREMMKVQGVKKEPGCSTIEVDGAI 419

Query: 395  HEFLAGE 375
            HEF  G+
Sbjct: 420  HEFFVGD 426



 Score = 79.3 bits (194), Expect = 1e-11
 Identities = 82/344 (23%), Positives = 137/344 (39%), Gaps = 70/344 (20%)
 Frame = -3

Query: 1916 EAATSISRLQQVHAYMLKTGLFHHTFAASRLLTAATT----------------------- 1806
            +A  ++    Q+HA +LKTG     F  + LL                            
Sbjct: 18   KACAALPEAFQIHARILKTGFGSDIFVLNTLLRTYAINGLTAEALKLFARMPQKDVISWN 77

Query: 1805 ------ISHHSLSYVESIFNHVHRPNSYIYNTMIHAYATSETPDLS-------------- 1686
                  ++H  +     +F+ +   N   +N MI  Y+     D S              
Sbjct: 78   AMINAFVTHGLVGQARKLFDKMSERNVETWNFMISGYSNLGLVDQSRELFNLMPVRDIVS 137

Query: 1685 -----------------FIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIHGI 1557
                               +F ++ Y N  V+ D+ TF  +L ACARV    +G+ I   
Sbjct: 138  WNAMITGCARAGRFDEVISLFQEMQYGN--VWPDECTFVNVLSACARVGSLGQGEWIRAY 195

Query: 1556 VLKNGLGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVELA 1377
            V KNG+    ++   L+ MY+KCG  E A  + +   K D  +WNA++   +  GL E A
Sbjct: 196  VDKNGIEIKGFLATALVDMYSKCGSIEKALQVFNNTSKKDVSTWNAMIDGLSSHGLGEHA 255

Query: 1376 RELFDEMP----VRNVESWNFMVSGYVNCGLVEEARDVFDVM-----LVKDIVSWNAMIS 1224
              LF+EMP    V N  ++  ++S   + GL+ E   +F+ M     +  +I  +  M+ 
Sbjct: 256  LRLFEEMPRNGLVPNGVTFVNVLSACSHEGLLNEGCRIFNDMARVYGIEPEIEHYGCMVD 315

Query: 1223 GYAKSGAFGEVLVLFED-MQTVKVRPDNCTLVNVLSACAGLGAL 1095
               ++G    +LV  E+ ++   V+       ++LSAC   G L
Sbjct: 316  LLGRAG----LLVAAEELLRRAPVKDAPVLWRSLLSACREHGDL 355


>ref|XP_010665683.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840
            [Beta vulgaris subsp. vulgaris]
            gi|870843485|gb|KMS96647.1| hypothetical protein
            BVRB_8g201100 [Beta vulgaris subsp. vulgaris]
          Length = 481

 Score =  503 bits (1295), Expect = e-139
 Identities = 255/467 (54%), Positives = 341/467 (73%), Gaps = 5/467 (1%)
 Frame = -3

Query: 1916 EAATSISRLQQVHAYMLKTGLFHHTFAASRLLT-AATTISHHSLSYVESIFNHVHRPNSY 1740
            E  +SIS LQQ H  ++K GL + ++ ASRL+  A T  +  ++SY  SIF+H+  PNS+
Sbjct: 16   EQTSSISELQQAHGQLIKNGLINDSYTASRLIAFACTNPNLQTISYAHSIFSHLQNPNSF 75

Query: 1739 IYNTMIHAYATSETPDLSFIVFLKLLYENSQVYADKYTFTFILKACARVCRGKEGKQIHG 1560
             +N+M+ AYA S  P  + +VF ++L   + V  DKYT+ F++KAC+      EG+Q+H 
Sbjct: 76   TWNSMMRAYANSSNPQNALLVFTQML--ETSVVPDKYTYPFVIKACSAFGGLNEGQQVHA 133

Query: 1559 IVLKNG-LGDDEYIKNTLIHMYAKCGCFEVARNLLDRLQKHDAVSWNALLSVYTQMGLVE 1383
             V K   + DD+Y++NTLI MYA CG FE ARNLLD++ + D +SWNA+L+ YT+ GL++
Sbjct: 134  QVTKRREMVDDKYVQNTLISMYANCGYFESARNLLDKMPQRDVISWNAMLAAYTERGLMD 193

Query: 1382 LARELFDEMPVRNVESWNFMVSGYVNCGLVEEARDVFDVMLVKDIVSWNAMISGYAKSGA 1203
             A+ LF EM  RNVESWNFMVSGY   GLVEEAR +FD + VKD+VSWNA+ISGYA  G 
Sbjct: 194  AAQVLFCEMEERNVESWNFMVSGYARLGLVEEARLMFDDIPVKDVVSWNAIISGYADVGG 253

Query: 1202 FGEVLVLFEDMQTVK-VRPDNCTLVNVLSACAGLGALSQGKWVHAYIDRNGIEVEGFLAT 1026
            F EVL+LF++M     ++PD  TLV VLSAC+ LGALS+G+W+H YID+NGI +EGFLAT
Sbjct: 254  FNEVLLLFQNMVGENIIKPDKYTLVYVLSACSNLGALSRGEWIHLYIDKNGIGIEGFLAT 313

Query: 1025 ALVDMYSKCGCIEKALEVFDNVLRKDISTWNSMIAGLSIHGSGHHALKMFNQMISGGF-T 849
            ALVDMYSKCG   KALEVF    +KDI+TWNSMI GLSI+G G  A+ +F++M++  + T
Sbjct: 314  ALVDMYSKCGETRKALEVFGTTAQKDITTWNSMIGGLSINGLGQEAVGIFHKMLNDDYAT 373

Query: 848  PTEVTFISILSACSRAGLVSEGLMMFDHMVRVYGIQPTIEHYGCLVDLLGRRGLLNEATE 669
            P E+TF++ILSACS AGL+ +GL +F+ M+R Y IQPT+EH GC++DLLGR GLL EA  
Sbjct: 374  PNEITFMNILSACSHAGLLIDGLKIFNIMMRKYDIQPTVEHCGCIIDLLGRVGLLEEAKA 433

Query: 668  LL-NKLKVKESPIVWQSLLAACRTHGDVELAEYIARKLFELEPKDTA 531
            LL  +   KESP++WQSLL +C  +G +ELA+  A+KL EL P+D A
Sbjct: 434  LLKGEASAKESPVLWQSLLFSCINYGSLELAQDFAKKLLELNPQDNA 480


Top