BLASTX nr result
ID: Lithospermum23_contig00022707
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum23_contig00022707 (1354 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value CDO96870.1 unnamed protein product [Coffea canephora] 526 0.0 KVH89454.1 Pentatricopeptide repeat-containing protein [Cynara c... 517 e-178 XP_009764895.1 PREDICTED: pentatricopeptide repeat-containing pr... 513 e-176 XP_015063884.1 PREDICTED: pentatricopeptide repeat-containing pr... 511 e-175 XP_016558299.1 PREDICTED: pentatricopeptide repeat-containing pr... 510 e-175 XP_006358091.1 PREDICTED: pentatricopeptide repeat-containing pr... 509 e-175 XP_016478659.1 PREDICTED: pentatricopeptide repeat-containing pr... 509 e-175 XP_009627120.1 PREDICTED: pentatricopeptide repeat-containing pr... 507 e-173 XP_019264683.1 PREDICTED: pentatricopeptide repeat-containing pr... 506 e-173 XP_016468628.1 PREDICTED: pentatricopeptide repeat-containing pr... 508 e-173 XP_002265079.1 PREDICTED: pentatricopeptide repeat-containing pr... 496 e-169 XP_017615748.1 PREDICTED: pentatricopeptide repeat-containing pr... 485 e-165 XP_016672141.1 PREDICTED: pentatricopeptide repeat-containing pr... 483 e-164 XP_012484752.1 PREDICTED: pentatricopeptide repeat-containing pr... 482 e-164 OMO65465.1 hypothetical protein COLO4_31216 [Corchorus olitorius] 469 e-159 XP_017979424.1 PREDICTED: pentatricopeptide repeat-containing pr... 458 e-154 EOY27913.1 Pentatricopeptide repeat (PPR-like) superfamily prote... 458 e-154 XP_010439805.2 PREDICTED: pentatricopeptide repeat-containing pr... 454 e-153 JAU51767.1 Pentatricopeptide repeat-containing protein, partial ... 452 e-152 XP_006414048.1 hypothetical protein EUTSA_v10024877mg [Eutrema s... 451 e-152 >CDO96870.1 unnamed protein product [Coffea canephora] Length = 516 Score = 526 bits (1356), Expect = 0.0 Identities = 256/401 (63%), Positives = 318/401 (79%), Gaps = 2/401 (0%) Frame = +2 Query: 158 TSMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTIS-PSLSYPHSIFNNIHQPNSYIYNT 334 TS++++HQ HA MLKTG F P+ +SRL+TAA + S SLSY H+IF QPN+Y+YNT Sbjct: 3 TSISELHQAHAYMLKTGLFQQPFAASRLMTAAASSSIDSLSYAHTIFTQTPQPNTYMYNT 62 Query: 335 IIRAYSTSPTPLQSFHIFIDMLFDAY-VVPDKFTFTFILKACASLCSIEYGQQIHGVLVK 511 +IR Y+TSPTP + +F+ +L D ++PDK+T+TF+LKACASLC +++G+QIHG ++K Sbjct: 63 LIRGYATSPTPNVALFLFLKLLCDDQDLLPDKYTYTFVLKACASLCRVKHGKQIHGCVIK 122 Query: 512 GGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAVLSAYVEMGLMELAREF 691 G DVY+ NTL+H+YAK G F AR +LD+M DV++WNAVLS YVEMGL++LA +F Sbjct: 123 NGLSWDVYICNTLLHMYAKCGCFEAARHMLDRMPNRDVVSWNAVLSVYVEMGLVDLAFDF 182 Query: 692 LNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSWNSLITGYAKVGDYCEVL 871 +EMPVKN ESWNFM+SGY N GL+DEAR +FDEM KDVVSWN+LITGYA G Y EVL Sbjct: 183 FSEMPVKNLESWNFMLSGYANSGLLDEARRVFDEMSVKDVVSWNALITGYANSGRYNEVL 242 Query: 872 RLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYIDKKGIEVNGFLATALVDM 1051 LF+DMQR V PDN TLV +LSACAG+GAL QG WV AY+D+ GIE NGFLATALVDM Sbjct: 243 ELFDDMQRA-RVKPDNHTLVTLLSACAGIGALEQGKWVHAYMDRNGIEANGFLATALVDM 301 Query: 1052 YSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALETFNRMVADGYKPNDVTF 1231 YSKCGCIEKAVEVF+ SRKDVSTWN+MI+G SVHG+GE AL+ F+ MV +G+KPNDVTF Sbjct: 302 YSKCGCIEKAVEVFDSASRKDVSTWNAMITGFSVHGFGEQALKVFSEMVENGFKPNDVTF 361 Query: 1232 VSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 VS+L+ACSR L E E+F NM YGI+P +EHYGC+VD Sbjct: 362 VSLLSACSRAGLLFESHEIFDNMFSIYGIKPKIEHYGCLVD 402 Score = 89.0 bits (219), Expect = 2e-15 Identities = 69/257 (26%), Positives = 112/257 (43%), Gaps = 45/257 (17%) Frame = +2 Query: 539 YNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAVLSAYVEMGLMELAREFLNEMP---- 706 +N ++ YA G AR + D+MS DV++WNA+++ Y G E ++M Sbjct: 194 WNFMLSGYANSGLLDEARRVFDEMSVKDVVSWNALITGYANSGRYNEVLELFDDMQRARV 253 Query: 707 ------------------VKNQESW-----------------NFMVSGYVNVGLVDEARS 781 Q W +V Y G +++A Sbjct: 254 KPDNHTLVTLLSACAGIGALEQGKWVHAYMDRNGIEANGFLATALVDMYSKCGCIEKAVE 313 Query: 782 IFDEMLEKDVVSWNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVG 961 +FD KDV +WN++ITG++ G + L++F +M G P++ T V++LSAC+ G Sbjct: 314 VFDSASRKDVSTWNAMITGFSVHGFGEQALKVFSEMVENG-FKPNDVTFVSLLSACSRAG 372 Query: 962 ALGQGDWV-----QAYIDKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDV-ST 1123 L + + Y K IE G LVD+ + G +++A E+ + +KDV Sbjct: 373 LLFESHEIFDNMFSIYGIKPKIEHYG----CLVDLLGRFGLLKEAEELVEKMPQKDVLII 428 Query: 1124 WNSMISGLSVHGYGESA 1174 W S++S HG E A Sbjct: 429 WESLLSACRNHGNVELA 445 >KVH89454.1 Pentatricopeptide repeat-containing protein [Cynara cardunculus var. scolymus] Length = 536 Score = 517 bits (1332), Expect = e-178 Identities = 257/422 (60%), Positives = 328/422 (77%), Gaps = 9/422 (2%) Frame = +2 Query: 116 MSSNIFPPP-SLLSLTSM----TQVHQTHAQMLKTGFFHHPYLSSRLLTAATTISP---- 268 M++ IFPPP ++LS T M +++ Q+HA MLKTG H PY ++RL+++A ++SP Sbjct: 1 MAAIIFPPPPAILSFTEMATSISELRQSHAHMLKTGLIHDPYSAARLISSAASMSPTSSH 60 Query: 269 SLSYPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFIL 448 SL YPHSIF I PNSY YNT+IRAY+ S TP SF +F +MLFD V+PDK+TFTF+L Sbjct: 61 SLLYPHSIFTYIQNPNSYSYNTLIRAYANSSTPESSFTLFRNMLFDDGVLPDKYTFTFVL 120 Query: 449 KACASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVI 628 KAC+ L S+ G+Q+HG +K G DVY+ NTL+H+YAKGG F AR LLD+MSE DVI Sbjct: 121 KACSVLNSVSVGKQVHGHAIKFGIERDVYICNTLIHMYAKGGFFEIARNLLDRMSERDVI 180 Query: 629 AWNAVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKD 808 +WNA+LSAYV+MG+M LA+ +EMP +N ESWNFM+SG+V GL+ EAR IFD+M KD Sbjct: 181 SWNAILSAYVDMGMMGLAQGLFDEMPERNAESWNFMISGFVKDGLIIEARRIFDDMPVKD 240 Query: 809 VVSWNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQ 988 VVSWN +ITGYA G + EV LFE+MQ G ++PD+ TLVNVLS+CA V AL QG+W+ Sbjct: 241 VVSWNVIITGYAHEGRFEEVFMLFEEMQNAG-MMPDDYTLVNVLSSCARVSALSQGEWIH 299 Query: 989 AYIDKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGE 1168 AYIDK IEV GFLATALVDMY+KCGC+EKA+EVF +TS+KD+STWNSMISGLS+HG GE Sbjct: 300 AYIDKNRIEVCGFLATALVDMYAKCGCLEKALEVFLKTSKKDISTWNSMISGLSLHGSGE 359 Query: 1169 SALETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCM 1348 SA++ F ++A+G+KPN+VTFVSVL+ACSR L+EG +MF M +GI+P +EHYGCM Sbjct: 360 SAIKLFYELLAEGFKPNEVTFVSVLSACSRSGLLDEGHKMFELMVHGHGIKPKIEHYGCM 419 Query: 1349 VD 1354 VD Sbjct: 420 VD 421 >XP_009764895.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Nicotiana sylvestris] XP_009764896.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Nicotiana sylvestris] Length = 550 Score = 513 bits (1322), Expect = e-176 Identities = 262/434 (60%), Positives = 322/434 (74%), Gaps = 16/434 (3%) Frame = +2 Query: 101 ILP*TMSSNIFPPPSLLS----------LTSMTQVHQTHAQMLKTGFFHHPYLSSRLLTA 250 ++P T N P + LS S++++HQ+HA +LKTG F +P+ +SRLLT Sbjct: 3 VVPVTKIGNTMSPAATLSPEPIFSFIEMANSISELHQSHAFLLKTGLFRNPFAASRLLTK 62 Query: 251 ATTISPS-----LSYPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAY- 412 ATT+ S LSYP SIF +I +PNSY YNTIIRAYSTS P S IF+ +L + Sbjct: 63 ATTLPSSSSADTLSYPLSIFTHIEEPNSYTYNTIIRAYSTSSFPQLSLIIFLKLLNAVHK 122 Query: 413 VVPDKFTFTFILKACASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNAR 592 V PDK+TFTFI+KACA++ + + G+Q+HG++ K G +DVYVYNTL+H+YAK G F +R Sbjct: 123 VFPDKYTFTFIVKACATIGNAKQGEQVHGLVTKIGLEEDVYVYNTLIHMYAKCGCFGVSR 182 Query: 593 VLLDKMSEPDVIAWNAVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDE 772 ++D + E DVIAWN +LS + E GL ELARE +EMPVKN ESWNFMVSGYVNVGLVDE Sbjct: 183 GMIDGLVEDDVIAWNGLLSVFAERGLFELARELFDEMPVKNVESWNFMVSGYVNVGLVDE 242 Query: 773 ARSIFDEMLEKDVVSWNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACA 952 AR +FDEML KDVVSWN +ITGY K + EVL LFEDM R V PDNCTLVNVLSACA Sbjct: 243 ARKVFDEMLVKDVVSWNVMITGYTKADRFAEVLALFEDMLRA-KVKPDNCTLVNVLSACA 301 Query: 953 GVGALGQGDWVQAYIDKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNS 1132 GVG+L QG WV AYI++ GIEV+ FLATALVDMY KCGCIEKA+EVFN T RKD+STWN+ Sbjct: 302 GVGSLSQGKWVHAYIERNGIEVHDFLATALVDMYCKCGCIEKALEVFNGTLRKDISTWNA 361 Query: 1133 MISGLSVHGYGESALETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKY 1312 MI+GLS HGY + AL+TF+ ++ADG KPN VTFVSVL+ CS+ L EGR MF M +Y Sbjct: 362 MIAGLSNHGYLDDALKTFDELIADGIKPNKVTFVSVLSTCSQGGLLSEGRRMFDLMISEY 421 Query: 1313 GIEPSVEHYGCMVD 1354 I+P++ HYGCMVD Sbjct: 422 RIQPTLVHYGCMVD 435 >XP_015063884.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Solanum pennellii] Length = 536 Score = 511 bits (1316), Expect = e-175 Identities = 261/417 (62%), Positives = 316/417 (75%), Gaps = 10/417 (2%) Frame = +2 Query: 134 PPPSLLS-----LTSMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTI---SP-SLSYPH 286 P PS LS S++++HQ HA MLKTG F P+ +SRLLT AT + SP +LSY Sbjct: 6 PTPSTLSSFLEMANSISELHQAHAVMLKTGLFRDPFAASRLLTKATVLPISSPETLSYAL 65 Query: 287 SIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAY-VVPDKFTFTFILKACAS 463 S+F +I +PNSYIYNTIIRAYSTSP P + IF+ ML V PDK+TFTFI+KACA+ Sbjct: 66 SVFTHIEEPNSYIYNTIIRAYSTSPFPQLALIIFLKMLNSVNKVFPDKYTFTFIVKACAT 125 Query: 464 LCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAV 643 + + + G+Q+HG++ K G +DVYVYNTLVH+YAK G F +R ++D + E DVIAWNA+ Sbjct: 126 MENAKQGEQVHGLVTKIGLEEDVYVYNTLVHMYAKCGCFGVSRGMIDGLIEDDVIAWNAL 185 Query: 644 LSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSWN 823 LS Y E GL ELARE +EMPVKN ESWNFMVSGYVNVGLVDEAR +FDEML KDVVSWN Sbjct: 186 LSVYAERGLFELARELFDEMPVKNVESWNFMVSGYVNVGLVDEARKVFDEMLVKDVVSWN 245 Query: 824 SLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYIDK 1003 +ITGY K + EVL LFEDM R V PD+CTLVNVLSACAGVG+L QG WV A+I++ Sbjct: 246 VMITGYTKADKFNEVLTLFEDMLR-AKVKPDDCTLVNVLSACAGVGSLSQGKWVHAFIER 304 Query: 1004 KGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALET 1183 GI V+ FLATALVDMY KCGCIEK +EVFN T RKD+STWN+MI+G S HGY + AL+T Sbjct: 305 NGIAVHNFLATALVDMYCKCGCIEKGLEVFNGTLRKDISTWNAMIAGFSNHGYLDDALKT 364 Query: 1184 FNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 FN ++ADG KPN+VTFVSVL+ CS+ L EGR MF M +Y I+P++ HYGCMVD Sbjct: 365 FNELIADGIKPNEVTFVSVLSTCSQGGLLSEGRRMFELMINEYRIQPTLVHYGCMVD 421 >XP_016558299.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Capsicum annuum] Length = 535 Score = 510 bits (1313), Expect = e-175 Identities = 256/415 (61%), Positives = 318/415 (76%), Gaps = 8/415 (1%) Frame = +2 Query: 134 PPPSLLSL----TSMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTISPS----LSYPHS 289 P PS+LS S++Q+HQ HA MLKTG FH+P+ +SRLLT AT + S LSY S Sbjct: 6 PTPSILSFIETANSISQLHQAHAFMLKTGLFHNPFSASRLLTKATLLPISSPEVLSYALS 65 Query: 290 IFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKACASLC 469 +F +I QPNSYIYNTIIRAYSTSP P + IF++ML V PDK+TFTF++KACA++ Sbjct: 66 VFTHIQQPNSYIYNTIIRAYSTSPFPQLALIIFLNMLNK--VSPDKYTFTFVVKACATME 123 Query: 470 SIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAVLS 649 + + G+Q+ G++ K G +DVYVYNTLVH+YAK G F +R ++D++ E DVIAWNA+LS Sbjct: 124 NAKQGEQVQGLVTKVGLEEDVYVYNTLVHMYAKCGCFGVSRGMIDRLVEDDVIAWNALLS 183 Query: 650 AYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSWNSL 829 Y E GL+E ARE +EMPVKN ESWNFMVSGYVNVGLVDEAR +FDEML KDVVSWN + Sbjct: 184 VYAERGLIEYARELFDEMPVKNVESWNFMVSGYVNVGLVDEARKVFDEMLVKDVVSWNVM 243 Query: 830 ITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYIDKKG 1009 +TGY K + EVL LFEDM R V PD+CTLVNVL+ACAGVG+L QG WV A+I++ G Sbjct: 244 VTGYTKADRFNEVLALFEDMLR-TKVKPDDCTLVNVLAACAGVGSLSQGKWVHAFIERNG 302 Query: 1010 IEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALETFN 1189 IEV+ FLATALVDMY KCGCIEK +EVF+ T RKD+STWN+MI+G S HGY + AL+TFN Sbjct: 303 IEVHNFLATALVDMYCKCGCIEKGLEVFSGTLRKDISTWNAMIAGFSNHGYLDDALKTFN 362 Query: 1190 RMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 ++ DG KPN+VTFVS+L+ CS+ L EGR MF M +Y I+P++ HYGCMVD Sbjct: 363 ELIVDGIKPNEVTFVSILSTCSQGGLLSEGRRMFDLMINEYRIQPTLVHYGCMVD 417 >XP_006358091.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Solanum tuberosum] Length = 536 Score = 509 bits (1312), Expect = e-175 Identities = 259/417 (62%), Positives = 316/417 (75%), Gaps = 10/417 (2%) Frame = +2 Query: 134 PPPSLLS-----LTSMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTI---SP-SLSYPH 286 P P LS S++++HQ HA MLKTG F P+ +SRLLT AT + SP +LSY Sbjct: 6 PTPLTLSSFLEMANSISELHQAHAVMLKTGLFRDPFAASRLLTKATVLPISSPETLSYAL 65 Query: 287 SIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAY-VVPDKFTFTFILKACAS 463 S+F +I +PNSYIYNTIIRAYSTSP P + IF+ ML V PD++TFTFI+KACA+ Sbjct: 66 SVFTHIEEPNSYIYNTIIRAYSTSPFPQLALIIFLKMLNSVNKVFPDRYTFTFIVKACAT 125 Query: 464 LCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAV 643 + + + G+Q+HG++ K G +DVY+YNTLVH+YAK G F +R ++D + E DVIAWNA+ Sbjct: 126 MENAKQGEQVHGLVTKIGLEEDVYIYNTLVHMYAKCGCFGISRGMIDGLIEDDVIAWNAL 185 Query: 644 LSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSWN 823 LS Y E GL ELARE +EMPVKN ESWNFMVSGYVNVGLVDEAR +FDEML KDVVSWN Sbjct: 186 LSVYAERGLFELARELFDEMPVKNVESWNFMVSGYVNVGLVDEARKVFDEMLVKDVVSWN 245 Query: 824 SLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYIDK 1003 +ITGY K + EVL LFEDM R V PD+CTLVNVLSACAGVG+L QG WV A+I++ Sbjct: 246 VMITGYTKADKFNEVLTLFEDMLR-AKVKPDDCTLVNVLSACAGVGSLSQGKWVHAFIER 304 Query: 1004 KGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALET 1183 GIEV+ FLATALVDMY KCGCIEK +EVFN T RKD+STWN+MI+G S HGY + AL+T Sbjct: 305 NGIEVHNFLATALVDMYCKCGCIEKGLEVFNGTLRKDISTWNAMIAGFSNHGYLDDALKT 364 Query: 1184 FNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 FN ++ADG KPN+VTFVSVL+ CS+ L EGR MF M +Y I+P++ HYGCMVD Sbjct: 365 FNELIADGIKPNEVTFVSVLSTCSQGGLLSEGRRMFELMINEYRIQPTLVHYGCMVD 421 >XP_016478659.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like [Nicotiana tabacum] Length = 550 Score = 509 bits (1312), Expect = e-175 Identities = 261/434 (60%), Positives = 321/434 (73%), Gaps = 16/434 (3%) Frame = +2 Query: 101 ILP*TMSSNIFPPPSLLS----------LTSMTQVHQTHAQMLKTGFFHHPYLSSRLLTA 250 ++P T N P + LS S++++HQ+HA +LKTG F +P+ +SRLLT Sbjct: 3 VVPVTKIGNTMSPAATLSPEPIFSFIEMANSISELHQSHAFLLKTGLFRNPFAASRLLTK 62 Query: 251 ATTISPS-----LSYPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAY- 412 ATT+ S LSY SIF +I +PNSY YNTIIRAYSTS P S IF+ +L + Sbjct: 63 ATTLPSSSSADTLSYALSIFTHIEEPNSYTYNTIIRAYSTSSFPQLSLIIFLKLLNAVHK 122 Query: 413 VVPDKFTFTFILKACASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNAR 592 V PDK+TFTFI+KACA++ + + G+Q+HG++ K G +DVYVYNTL+H+YAK G F +R Sbjct: 123 VFPDKYTFTFIVKACATIGNAKQGEQVHGLVTKIGLEEDVYVYNTLIHMYAKCGCFGVSR 182 Query: 593 VLLDKMSEPDVIAWNAVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDE 772 ++D + E DVIAWN +LS + E GL ELARE +EMPVKN ESWNFMVSGYVNVGLVDE Sbjct: 183 GMIDGLVEDDVIAWNGLLSVFSERGLFELARELFDEMPVKNVESWNFMVSGYVNVGLVDE 242 Query: 773 ARSIFDEMLEKDVVSWNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACA 952 AR +FDEML KDVVSWN +ITGY K + EVL LFEDM R V PDNCTLVNVLSACA Sbjct: 243 ARKVFDEMLVKDVVSWNVMITGYTKADRFAEVLALFEDMLRA-KVKPDNCTLVNVLSACA 301 Query: 953 GVGALGQGDWVQAYIDKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNS 1132 GVG+L QG WV AYI++ GIEV+ FLATALVDMY KCGCIEKA+EVFN T RKD+STWN+ Sbjct: 302 GVGSLSQGKWVHAYIERNGIEVHDFLATALVDMYCKCGCIEKALEVFNGTLRKDISTWNA 361 Query: 1133 MISGLSVHGYGESALETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKY 1312 MI+GLS HGY + AL+TF+ ++ADG KPN VTFVSVL+ CS+ L EGR MF M +Y Sbjct: 362 MIAGLSNHGYLDDALKTFDELIADGIKPNKVTFVSVLSTCSQGGLLSEGRRMFDLMISEY 421 Query: 1313 GIEPSVEHYGCMVD 1354 I+P++ HYGCMVD Sbjct: 422 RIQPTLVHYGCMVD 435 >XP_009627120.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Nicotiana tomentosiformis] Length = 550 Score = 507 bits (1305), Expect = e-173 Identities = 254/423 (60%), Positives = 321/423 (75%), Gaps = 9/423 (2%) Frame = +2 Query: 113 TMSSNIFPPPS---LLSLTSMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTISPS---- 271 +M++ + P P + + S++++HQ+HA +LKTG F +P+ +SRLLT ATT+ S Sbjct: 14 SMAATLSPEPIFSFIETANSISELHQSHAFLLKTGLFRNPFAASRLLTKATTLPTSSSAD 73 Query: 272 -LSYPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAY-VVPDKFTFTFI 445 LSY SIF +I +PNSY YNTIIRAYSTS P S IF+ +L + + PDK+TFTFI Sbjct: 74 TLSYALSIFTHIEEPNSYTYNTIIRAYSTSSFPQLSLIIFLKLLNAVHKIFPDKYTFTFI 133 Query: 446 LKACASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDV 625 +KACA++ + + GQQ+HG++ K G +D YV+NTL+H+YAK G F +R ++D + E DV Sbjct: 134 VKACATIGNAKQGQQVHGLVTKIGLEEDEYVHNTLIHMYAKCGCFGVSRGMIDGLVEDDV 193 Query: 626 IAWNAVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEK 805 IAWN +LS + E GL ELARE +EMPVKN ESWNFM+SGYVNVGLVDEAR +FDEM +K Sbjct: 194 IAWNGLLSVFAERGLFELARELFDEMPVKNVESWNFMISGYVNVGLVDEARKVFDEMSDK 253 Query: 806 DVVSWNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWV 985 DVVSWN +ITGY K + EVL LFEDM R V PDNCTLVNVLSACAGVG+L QG WV Sbjct: 254 DVVSWNVMITGYTKADKFAEVLALFEDMLRA-KVKPDNCTLVNVLSACAGVGSLSQGKWV 312 Query: 986 QAYIDKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYG 1165 AYI++ GI+V+ FLATALVDMY KCGCIEKA+EVFN T RKD+STWN+MI+GLS HG+ Sbjct: 313 HAYIERYGIQVHDFLATALVDMYCKCGCIEKALEVFNGTLRKDISTWNAMIAGLSNHGFL 372 Query: 1166 ESALETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGC 1345 + ALETFN ++ADG KPN+VTFVSVL+ CS+ L EGR MF M +Y I+P++ HYGC Sbjct: 373 DDALETFNELIADGIKPNEVTFVSVLSTCSQGGLLSEGRRMFDLMISEYRIQPTLVHYGC 432 Query: 1346 MVD 1354 MVD Sbjct: 433 MVD 435 >XP_019264683.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Nicotiana attenuata] OIT36246.1 pentatricopeptide repeat-containing protein [Nicotiana attenuata] Length = 550 Score = 506 bits (1303), Expect = e-173 Identities = 256/418 (61%), Positives = 316/418 (75%), Gaps = 10/418 (2%) Frame = +2 Query: 131 FPPPSLLSLTSM----TQVHQTHAQMLKTGFFHHPYLSSRLLTAATTISPS-----LSYP 283 F P + S M +++HQ+HA +LK+G F +P+ +SRLLT ATT+ S LSY Sbjct: 19 FSPEPIFSFIEMANTISELHQSHAFLLKSGLFRNPFAASRLLTKATTLPTSSSVDTLSYA 78 Query: 284 HSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAY-VVPDKFTFTFILKACA 460 SIF +I +PNSY YNTIIRAYSTS P S IF+ ML + V PDK+TFTFI+KACA Sbjct: 79 LSIFTHIEEPNSYTYNTIIRAYSTSSFPQLSLIIFLKMLNAVHKVFPDKYTFTFIVKACA 138 Query: 461 SLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNA 640 ++ + + G+Q+HG++ K G +DVYVYNTL+H+YAK G F +R ++D + E DVIAWN Sbjct: 139 TIGNAKQGEQVHGLVTKIGLEEDVYVYNTLIHMYAKCGCFGVSRGMIDGLVEDDVIAWNG 198 Query: 641 VLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSW 820 +LS + E GL ELARE +EMPVKN ESWNFMVSGYVNVGLVDEAR +FDEML KDVVSW Sbjct: 199 LLSVFAERGLFELARELFDEMPVKNVESWNFMVSGYVNVGLVDEARKVFDEMLVKDVVSW 258 Query: 821 NSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYID 1000 N +ITGY K + EVL LFEDM R V PDNCTLVNVLSACAGVG+L QG W+ AYI+ Sbjct: 259 NVMITGYTKADRFAEVLALFEDMLR-AKVKPDNCTLVNVLSACAGVGSLSQGKWIHAYIE 317 Query: 1001 KKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALE 1180 + GIEV+ FLATALVDMY KCGCIEKA+EVF TSRKD+STWN+MI+GLS HGY + AL+ Sbjct: 318 RNGIEVHDFLATALVDMYCKCGCIEKALEVFIGTSRKDISTWNAMIAGLSNHGYLDDALK 377 Query: 1181 TFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 TF+ ++ADG KPN+VTFVSVL+ CS+ L EGR +F M +Y I+P++ HYGCMVD Sbjct: 378 TFDELIADGIKPNEVTFVSVLSTCSQGGLLSEGRRIFDLMISEYRIQPTLVHYGCMVD 435 >XP_016468628.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like, partial [Nicotiana tabacum] Length = 633 Score = 508 bits (1309), Expect = e-173 Identities = 255/423 (60%), Positives = 321/423 (75%), Gaps = 9/423 (2%) Frame = +2 Query: 113 TMSSNIFPPPS---LLSLTSMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTISPS---- 271 +M++ + P P + + S++++HQ+HA +LKTG F +P+ +SRLLT ATT+ S Sbjct: 97 SMAATLSPEPIFSFIETANSISELHQSHAFLLKTGLFRNPFAASRLLTKATTLPTSSSAD 156 Query: 272 -LSYPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAY-VVPDKFTFTFI 445 LSY SIF +I +PNSY YNTIIRAYSTS P S IF+ +L + + PDK+TFTFI Sbjct: 157 TLSYALSIFTHIEEPNSYTYNTIIRAYSTSSFPQLSLIIFLKLLNAVHKIFPDKYTFTFI 216 Query: 446 LKACASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDV 625 +KACA++ + + GQQ+HG++ K G +D YVYNTL+H+YAK G F +R ++D + E DV Sbjct: 217 VKACATIGNAKQGQQVHGLVTKIGLEEDEYVYNTLIHMYAKCGCFGVSRGMIDGLVEDDV 276 Query: 626 IAWNAVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEK 805 IAWN +LS + E GL ELARE +EMPVKN ESWNFM+SGYVNVGLVDEAR +FDEM +K Sbjct: 277 IAWNGLLSVFAERGLFELARELFDEMPVKNVESWNFMISGYVNVGLVDEARKVFDEMSDK 336 Query: 806 DVVSWNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWV 985 DVVSWN +ITGY K + EVL LFEDM R V PDNCTLVNVLSACAGVG+L QG WV Sbjct: 337 DVVSWNVMITGYTKADRFAEVLALFEDMLRA-KVKPDNCTLVNVLSACAGVGSLSQGKWV 395 Query: 986 QAYIDKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYG 1165 AYI++ GI+V+ FLATALVDMY KCGCIEKA+EVFN T RKD+STWN+MI+GLS HG+ Sbjct: 396 HAYIERYGIQVHDFLATALVDMYCKCGCIEKALEVFNGTLRKDISTWNAMIAGLSNHGFL 455 Query: 1166 ESALETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGC 1345 + ALETFN ++ADG KPN+VTFVSVL+ CS+ L EGR MF M +Y I+P++ HYGC Sbjct: 456 DDALETFNELIADGIKPNEVTFVSVLSTCSQGGLLSEGRRMFDLMISEYRIQPTLVHYGC 515 Query: 1346 MVD 1354 MVD Sbjct: 516 MVD 518 >XP_002265079.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Vitis vinifera] Length = 536 Score = 496 bits (1276), Expect = e-169 Identities = 246/417 (58%), Positives = 315/417 (75%), Gaps = 6/417 (1%) Frame = +2 Query: 122 SNIFPPPSLLSL----TSMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTIS--PSLSYP 283 S+ FPPP +LS TS++++HQ HA +LK+G H + +SRL+ + +T S ++ Y Sbjct: 2 SSSFPPPPILSFAEMATSISELHQAHAHILKSGLIHSTFAASRLIASVSTNSHAQAIPYA 61 Query: 284 HSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKACAS 463 HSIF+ I PNSY++NTIIRAY+ SPTP + IF ML A V+PDK+TFTF LK+C S Sbjct: 62 HSIFSRIPNPNSYMWNTIIRAYANSPTPEAALTIFHQMLH-ASVLPDKYTFTFALKSCGS 120 Query: 464 LCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAV 643 +E G+QIHG ++K G DD+++ NTL+H+YA G +AR LLD+M E DV++WNA+ Sbjct: 121 FSGVEEGRQIHGHVLKTGLGDDLFIQNTLIHLYASCGCIEDARHLLDRMLERDVVSWNAL 180 Query: 644 LSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSWN 823 LSAY E GLMELA +EM +N ESWNFM+SGYV VGL++EAR +F E K+VVSWN Sbjct: 181 LSAYAERGLMELACHLFDEMTERNVESWNFMISGYVGVGLLEEARRVFGETPVKNVVSWN 240 Query: 824 SLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYIDK 1003 ++ITGY+ G + EVL LFEDMQ G V PDNCTLV+VLSACA VGAL QG+WV AYIDK Sbjct: 241 AMITGYSHAGRFSEVLVLFEDMQHAG-VKPDNCTLVSVLSACAHVGALSQGEWVHAYIDK 299 Query: 1004 KGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALET 1183 GI ++GF+ATALVDMYSKCG IEKA+EVFN RKD+STWNS+ISGLS HG G+ AL+ Sbjct: 300 NGISIDGFVATALVDMYSKCGSIEKALEVFNSCLRKDISTWNSIISGLSTHGSGQHALQI 359 Query: 1184 FNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 F+ M+ +G+KPN+VTFV VL+ACSR L+EGREMF M +GI+P++EHYGCMVD Sbjct: 360 FSEMLVEGFKPNEVTFVCVLSACSRAGLLDEGREMFNLMVHVHGIQPTIEHYGCMVD 416 Score = 62.4 bits (150), Expect = 9e-07 Identities = 55/263 (20%), Positives = 117/263 (44%), Gaps = 4/263 (1%) Frame = +2 Query: 161 SMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTISPSLSYPHSIFNNIHQPNSYIYNTII 340 +++Q HA + K G ++++ L+ + S+ +FN+ + + +N+II Sbjct: 286 ALSQGEWVHAYIDKNGISIDGFVATALVDMYSKCG-SIEKALEVFNSCLRKDISTWNSII 344 Query: 341 RAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKACASLCSIEYGQQIHGVLVKGGF 520 ST + + IF +ML + + P++ TF +L AC+ ++ G+++ Sbjct: 345 SGLSTHGSGQHALQIFSEMLVEGFK-PNEVTFVCVLSACSRAGLLDEGREM--------- 394 Query: 521 YDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAVLSAYVEMGLMELAREFLNE 700 +N +VHV+ +P + + ++ +GL+E A E + + Sbjct: 395 ------FNLMVHVHG---------------IQPTIEHYGCMVDLLGRVGLLEEAEELVQK 433 Query: 701 MPVKNQE-SWNFMVSGYVNVGLVDEARSIFDEMLE---KDVVSWNSLITGYAKVGDYCEV 868 MP K W ++ N G V+ A + ++LE ++ S+ L YA +G + +V Sbjct: 434 MPQKEASVVWESLLGACRNHGNVELAERVAQKLLELSPQESSSFVQLSNMYASMGRWKDV 493 Query: 869 LRLFEDMQRGGNVVPDNCTLVNV 937 + + + M+ G C+++ V Sbjct: 494 MEVRQKMRAQGVRKDPGCSMIEV 516 >XP_017615748.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Gossypium arboreum] Length = 551 Score = 485 bits (1248), Expect = e-165 Identities = 232/402 (57%), Positives = 309/402 (76%), Gaps = 3/402 (0%) Frame = +2 Query: 158 TSMTQVHQTHAQMLKTGFF-HHPYLSSRLLTAATTISP--SLSYPHSIFNNIHQPNSYIY 328 +S++Q+HQ HA +LKTG F ++ ++S++L++ A + +LSY HS+F +I PNS+ Y Sbjct: 35 SSISQIHQAHAHLLKTGVFPNNTFVSNKLISFAVSNPDPITLSYAHSVFTHITDPNSFSY 94 Query: 329 NTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKACASLCSIEYGQQIHGVLV 508 N++IRAY+ S TP + +F ML V+PDK++FTF LKACA C +E G QIHG+ + Sbjct: 95 NSLIRAYANSRTPENALFLFRQMLEGGPVLPDKYSFTFALKACAGFCGVEEGMQIHGLAL 154 Query: 509 KGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAVLSAYVEMGLMELARE 688 K G D++V NTL+HVY + G F AR LLD+M++ DV++WNA+L+AY+E G M LAR Sbjct: 155 KLGIGFDIFVANTLIHVYGRSGHFGFARSLLDRMTDRDVVSWNALLTAYIETGFMRLARG 214 Query: 689 FLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSWNSLITGYAKVGDYCEV 868 +EM +N ESWNFM+SGY++ GL++EA+S+FD M KDVVSWN++ITGYA + EV Sbjct: 215 LFDEMDERNVESWNFMISGYLSSGLLEEAKSVFDSMPLKDVVSWNAIITGYAHSSRFDEV 274 Query: 869 LRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYIDKKGIEVNGFLATALVD 1048 L LFEDMQR V PD CTLVNVLSACA +GALGQG+W+ YIDK GI+ NGF+ATALVD Sbjct: 275 LELFEDMQR-EEVRPDTCTLVNVLSACAHLGALGQGEWIHGYIDKNGIDTNGFIATALVD 333 Query: 1049 MYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALETFNRMVADGYKPNDVT 1228 MYSKCG I+KA+ VF S+KD+STWNS+I GL +HGYGE+ALETF+ M+ +G++PN+VT Sbjct: 334 MYSKCGNIDKALNVFRNASKKDISTWNSIIVGLGMHGYGETALETFSEMLMEGFEPNEVT 393 Query: 1229 FVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 F++VL ACSR FL EGR+MF+ M + YGIEP+VEHYGCMVD Sbjct: 394 FIAVLTACSRSRFLNEGRKMFKLMVDDYGIEPAVEHYGCMVD 435 >XP_016672141.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like [Gossypium hirsutum] Length = 534 Score = 483 bits (1244), Expect = e-164 Identities = 231/402 (57%), Positives = 308/402 (76%), Gaps = 3/402 (0%) Frame = +2 Query: 158 TSMTQVHQTHAQMLKTGFF-HHPYLSSRLLTAATTISP--SLSYPHSIFNNIHQPNSYIY 328 +S++Q+HQ HA +LKTG F ++ ++S++L++ A + +LSY HS+F +I PNS+ Y Sbjct: 18 SSISQIHQAHAHLLKTGVFPNNTFVSNKLISFAVSNPDPITLSYAHSVFTHITDPNSFSY 77 Query: 329 NTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKACASLCSIEYGQQIHGVLV 508 N++IRAY+ S TP + +F ML V+ DK++FTF LKACA C +E G QIHG+ + Sbjct: 78 NSLIRAYANSRTPENALFLFRQMLEGGPVLTDKYSFTFALKACAGFCGVEEGMQIHGLAL 137 Query: 509 KGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAVLSAYVEMGLMELARE 688 K G D++V NTL+HVY K G F AR LLD+M++ DV++WNA+LSAY+E G + LAR Sbjct: 138 KLGIGFDIFVANTLIHVYGKSGHFGFARSLLDRMADRDVVSWNALLSAYIETGFIRLARG 197 Query: 689 FLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSWNSLITGYAKVGDYCEV 868 +EM +N ESWNFM+SGY++ GL++EA+S+FD M KDVVSWN++ITGYA + EV Sbjct: 198 LFDEMDERNVESWNFMISGYLSSGLLEEAKSVFDSMPLKDVVSWNAIITGYAHASRFDEV 257 Query: 869 LRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYIDKKGIEVNGFLATALVD 1048 L LFEDMQR V PD CTLVNVLSACA +GALGQG+W+ YIDK GI+ NGF+ATALVD Sbjct: 258 LELFEDMQR-EEVRPDTCTLVNVLSACAHLGALGQGEWIHGYIDKNGIDTNGFIATALVD 316 Query: 1049 MYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALETFNRMVADGYKPNDVT 1228 MYSKCG I+KA+ VF S+KD+STWNS+I GL +HGYGE+ALETF+ M+ +G++PN+VT Sbjct: 317 MYSKCGNIDKALNVFRNASKKDISTWNSIIVGLGMHGYGETALETFSEMLTEGFEPNEVT 376 Query: 1229 FVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 F++VL ACSR FL EGR+MF+ M + YGIEP++EHYGCMVD Sbjct: 377 FIAVLTACSRSRFLNEGRKMFKLMVDDYGIEPAIEHYGCMVD 418 >XP_012484752.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Gossypium raimondii] KJB34911.1 hypothetical protein B456_006G090000 [Gossypium raimondii] Length = 534 Score = 482 bits (1241), Expect = e-164 Identities = 231/402 (57%), Positives = 308/402 (76%), Gaps = 3/402 (0%) Frame = +2 Query: 158 TSMTQVHQTHAQMLKTGFF-HHPYLSSRLLTAATTISP--SLSYPHSIFNNIHQPNSYIY 328 +S++Q+HQ HA +LKTG F ++ ++S++L++ A + +LSY HS+F +I PNS+ Y Sbjct: 18 SSISQIHQAHAHLLKTGVFPNNTFVSNKLISFAVSNPDPITLSYAHSVFTHITDPNSFSY 77 Query: 329 NTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKACASLCSIEYGQQIHGVLV 508 N++IRAY+ S TP + +F ML V+PDK++FTF LKACA C +E G QIHG+ + Sbjct: 78 NSLIRAYANSRTPENALFLFRQMLKGGPVLPDKYSFTFALKACAGFCGVEEGMQIHGLAL 137 Query: 509 KGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAVLSAYVEMGLMELARE 688 K G D++V NTL+HVY K G F AR LLD+M++ DV++WNA+LSAY+E G + LAR Sbjct: 138 KLGIGFDIFVANTLIHVYGKSGHFGFARSLLDRMADRDVVSWNALLSAYIETGFIRLARG 197 Query: 689 FLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSWNSLITGYAKVGDYCEV 868 +EM +N ESWNFM+SGY++ GL++EA+S+FD M KDVVSWN++ITGYA + EV Sbjct: 198 LFDEMDERNVESWNFMISGYLSSGLLEEAKSVFDSMPLKDVVSWNAIITGYAHASRFDEV 257 Query: 869 LRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYIDKKGIEVNGFLATALVD 1048 L LFEDMQR V PD CTLVNVLSACA +GALGQG+W+ YIDK GI+ NGF+ATALVD Sbjct: 258 LELFEDMQR-EEVRPDTCTLVNVLSACAHLGALGQGEWIHGYIDKNGIDTNGFIATALVD 316 Query: 1049 MYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALETFNRMVADGYKPNDVT 1228 M+SKCG I+KAV VF S+KD+STWNS+I GL +HGYGE+ALETF+ M+ +G++PN+VT Sbjct: 317 MHSKCGNIDKAVNVFRNASKKDISTWNSIIVGLGMHGYGETALETFSEMLMEGFEPNEVT 376 Query: 1229 FVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 F++VL ACSR FL EG +MF+ M + YGIEP++EHYGCMVD Sbjct: 377 FIAVLTACSRSRFLNEGCKMFKLMVDDYGIEPAIEHYGCMVD 418 >OMO65465.1 hypothetical protein COLO4_31216 [Corchorus olitorius] Length = 535 Score = 469 bits (1207), Expect = e-159 Identities = 236/419 (56%), Positives = 304/419 (72%), Gaps = 6/419 (1%) Frame = +2 Query: 116 MSSNIFPPPSLL---SLTSMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTISP---SLS 277 MS + PP L S++++HQ HA +LKTG F++ +S L + +P +LS Sbjct: 1 MSGTLSQPPILSFAEMANSISEIHQAHAHLLKTGLFYNNTFASNKLISFAVSNPDPKTLS 60 Query: 278 YPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKAC 457 Y HS+ I PNS+ YN++IRAY+ S TP + IF +ML + V PDK++FTF+LKAC Sbjct: 61 YAHSVLTKITNPNSFSYNSLIRAYANSHTPQSALSIFHEML-EGPVFPDKYSFTFVLKAC 119 Query: 458 ASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWN 637 A ++ G+QIH +++K G D++V NTLVHVY K G F AR LLD+M DVI+WN Sbjct: 120 AGFGGVQEGRQIHSLVLKMGIGFDIFVANTLVHVYGKSGYFGVARSLLDRMPTRDVISWN 179 Query: 638 AVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVS 817 A+LSAY+E G + LAR +EM +N ESWNFM+SGY++ GLV+EARS+FD M KDVVS Sbjct: 180 ALLSAYIENGYIRLARGLFDEMDERNVESWNFMISGYLSAGLVEEARSVFDRMPVKDVVS 239 Query: 818 WNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYI 997 WN++ITGYA + EVL LFEDMQ+ V PDNCTLVNVLSACA +GALGQG+WV AYI Sbjct: 240 WNAMITGYAHTSCFDEVLVLFEDMQQQ-EVKPDNCTLVNVLSACAHLGALGQGEWVHAYI 298 Query: 998 DKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESAL 1177 DK I NGFLATALVDMY+KCG I+KA+ VF SRKD+STWNS+I GL +HG+GE AL Sbjct: 299 DKNEIGTNGFLATALVDMYAKCGNIDKALSVFKNASRKDISTWNSIIVGLGMHGFGEHAL 358 Query: 1178 ETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 + F+ M+ +G++PN+VTFV VL+ACSR L EGR MF+ M E YGI+P++EHYGCMVD Sbjct: 359 KIFSEMLVEGFQPNEVTFVGVLSACSRTGLLNEGRYMFQLMLEDYGIQPTIEHYGCMVD 417 >XP_017979424.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18840 [Theobroma cacao] Length = 535 Score = 458 bits (1178), Expect = e-154 Identities = 227/419 (54%), Positives = 303/419 (72%), Gaps = 6/419 (1%) Frame = +2 Query: 116 MSSNIFPPPSLL---SLTSMTQVHQTHAQMLKTG-FFHHPYLSSRLLTAATTISP--SLS 277 MS + PP L S++++ Q HA +LKTG F++HP S++L++ A +LS Sbjct: 1 MSGTLSQPPILYFTEMANSVSEIQQAHAHLLKTGLFYNHPLASNKLISFAVNNPDPKTLS 60 Query: 278 YPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKAC 457 Y HS+F + PNS+ YN++IRAY+ S TP + +F ML V PDK++FTF+LKAC Sbjct: 61 YAHSVFTHTTNPNSFSYNSLIRAYANSHTPQNALSLFRQML-QGPVFPDKYSFTFVLKAC 119 Query: 458 ASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWN 637 A ++ G+QIHG++++ G DV+V NTL+HVY KGG F AR LLD+M + D ++WN Sbjct: 120 AGFGGVQEGRQIHGLVLRMGIGFDVFVANTLIHVYGKGGYFGVARSLLDRMPKRDAVSWN 179 Query: 638 AVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVS 817 A+LSAY+E G + LA EM +N ESWNFM+SGY++ GLV+EARSIFD M K+VVS Sbjct: 180 ALLSAYIETGYIRLASGLFEEMEERNVESWNFMISGYLSAGLVEEARSIFDRMPVKNVVS 239 Query: 818 WNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYI 997 WN+LI GYA + EVL LFEDMQR V PDNCTLVNVLSACA +GALGQG+W+ +YI Sbjct: 240 WNALIAGYAHTSCFGEVLVLFEDMQRE-KVKPDNCTLVNVLSACAHLGALGQGEWIHSYI 298 Query: 998 DKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESAL 1177 DK I +NG++ATALVDMYSKCG I+KA+ VF SRKD+STWNS+I GL +HG GE AL Sbjct: 299 DKNAIGINGYVATALVDMYSKCGNIDKALYVFRNASRKDISTWNSIIVGLGMHGLGEHAL 358 Query: 1178 ETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 E F+ M+ +G++PN+VTF+ +L+ACSR L EG +F+ M + YGI+P++EH+GCMVD Sbjct: 359 EIFSEMLVNGFEPNEVTFIGLLSACSRACLLNEGHHIFQIMVDDYGIQPTIEHFGCMVD 417 >EOY27913.1 Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] EOY27914.1 Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] EOY27915.1 Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] EOY27916.1 Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] EOY27917.1 Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] Length = 535 Score = 458 bits (1178), Expect = e-154 Identities = 227/419 (54%), Positives = 303/419 (72%), Gaps = 6/419 (1%) Frame = +2 Query: 116 MSSNIFPPPSLL---SLTSMTQVHQTHAQMLKTG-FFHHPYLSSRLLTAATTISP--SLS 277 MS + PP L S++++ Q HA +LKTG F++HP S++L++ A +LS Sbjct: 1 MSGTLSQPPILYFTEMANSVSEIQQAHAHLLKTGLFYNHPLASNKLISFAVNNPDPKTLS 60 Query: 278 YPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKAC 457 Y HS+F + PNSY YN++IRAY+ S TP + +F ML V PDK++FTF+LKAC Sbjct: 61 YAHSVFTHTTNPNSYSYNSLIRAYANSHTPQNALSLFRQML-QGPVFPDKYSFTFVLKAC 119 Query: 458 ASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWN 637 A ++ G+QIHG++++ G DV+V NTL+HVY KGG F AR LLD+M + D ++WN Sbjct: 120 AGFGGVQEGRQIHGLVLRMGIGFDVFVANTLIHVYGKGGYFGVARSLLDRMPKRDAVSWN 179 Query: 638 AVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVS 817 A+LSAY+E G + LA EM +N ESWNFM+SGY++ GLV+EARS+F M K+VVS Sbjct: 180 ALLSAYIETGYIRLASGLFEEMEERNVESWNFMISGYLSAGLVEEARSVFYRMPVKNVVS 239 Query: 818 WNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYI 997 WN+LITGYA + EVL LFEDMQR V PDNCTLVNVLSACA +GALGQG+W+ +YI Sbjct: 240 WNALITGYAHTSCFGEVLVLFEDMQRE-KVKPDNCTLVNVLSACAHLGALGQGEWIHSYI 298 Query: 998 DKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESAL 1177 DK I +NG++ATALVDMYSKCG I+KA+ VF SRKD+STWNS+I GL +HG GE AL Sbjct: 299 DKNAIGINGYIATALVDMYSKCGNIDKALYVFRNASRKDISTWNSIIVGLGMHGLGEHAL 358 Query: 1178 ETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 E F+ M+ +G++PN+VTF+ +L+ACSR L EG +F+ M + YGI+P++EH+GCMVD Sbjct: 359 EIFSEMLVNGFEPNEVTFIGLLSACSRAGLLNEGHHIFQIMVDDYGIQPTIEHFGCMVD 417 >XP_010439805.2 PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like, partial [Camelina sativa] Length = 554 Score = 454 bits (1167), Expect = e-153 Identities = 223/402 (55%), Positives = 289/402 (71%), Gaps = 4/402 (0%) Frame = +2 Query: 161 SMTQVHQTHAQMLKTGFFHHPYLSSRLLT-AATTISP---SLSYPHSIFNNIHQPNSYIY 328 S++++ Q HA MLKTG F Y +S+L+ AAT +P ++SY HSI N I PN + + Sbjct: 38 SLSEIKQAHAFMLKTGLFQDTYSASKLIAFAATQTNPEPNTVSYAHSILNRIDSPNGFTH 97 Query: 329 NTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKACASLCSIEYGQQIHGVLV 508 N++IRAY+ S TP + +F DML V PDK++FTF LKACA+ C E G+QIHG+ + Sbjct: 98 NSVIRAYANSSTPEMALVVFRDMLLGP-VFPDKYSFTFALKACAAFCGFEQGRQIHGLFM 156 Query: 509 KGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWNAVLSAYVEMGLMELARE 688 K G DV+V NTLV+VYA+ G F AR +LD+M D ++WN++LSAY+ GL+E AR Sbjct: 157 KSGLMTDVFVENTLVNVYARSGYFQIARKVLDEMPVRDAVSWNSLLSAYLAKGLVEEARA 216 Query: 689 FLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVSWNSLITGYAKVGDYCEV 868 +EM +N ESWNFM+SGY GLV EA+ IFD M KDVVSWN+++T YA VG Y EV Sbjct: 217 LFDEMEERNVESWNFMISGYAAAGLVKEAKEIFDSMPGKDVVSWNAMVTAYAHVGCYDEV 276 Query: 869 LRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYIDKKGIEVNGFLATALVD 1048 L +F +M PD TLVNVLSACA +G+L QG+WV YIDK GIE+ GFLATALVD Sbjct: 277 LEVFNEMLDSSTEEPDGFTLVNVLSACASLGSLSQGEWVHVYIDKHGIEIEGFLATALVD 336 Query: 1049 MYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESALETFNRMVADGYKPNDVT 1228 MYSKCG I+KA+EVF TS++DVSTWNS+ISGLSVHG G ALE F+ MV +G+KPN +T Sbjct: 337 MYSKCGKIDKALEVFRATSKRDVSTWNSIISGLSVHGLGNDALEIFSEMVYEGFKPNGIT 396 Query: 1229 FVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 FV VL+AC+ L++ R++F + YG+EP++EHYGCMVD Sbjct: 397 FVGVLSACNHVGLLDQARKLFEMINSVYGVEPTIEHYGCMVD 438 >JAU51767.1 Pentatricopeptide repeat-containing protein, partial [Noccaea caerulescens] Length = 539 Score = 452 bits (1163), Expect = e-152 Identities = 225/419 (53%), Positives = 295/419 (70%), Gaps = 6/419 (1%) Frame = +2 Query: 116 MSSNIFPPPSLLSLT----SMTQVHQTHAQMLKTGFFHHPYLSSRLLT--AATTISPSLS 277 MS+ P +LS T S++++ Q HA MLKTG + +S+L+ A ++S Sbjct: 6 MSACSSTPLPILSFTEGAKSLSEIQQAHAFMLKTGLSRDTFSASKLIAFAVANPEPETVS 65 Query: 278 YPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKAC 457 Y HSI N I PN+Y +N++IRAY+ S TP + F +ML V PDK++FTF+LKAC Sbjct: 66 YAHSILNRIESPNAYTHNSVIRAYANSSTPGIALIPFREMLLGP-VFPDKYSFTFVLKAC 124 Query: 458 ASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAWN 637 A+ C E G+QIHG+ +K G DV+V NTLV+VY + G F AR +LD M E DV++WN Sbjct: 125 AAFCGFEEGRQIHGLFLKSGLMSDVFVENTLVNVYGRCGYFEIARKVLDGMPEKDVVSWN 184 Query: 638 AVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVVS 817 ++LSAYVE GL+E AR +EM +N ESWNFM+SGYV GLV+EA+ +FD M KDVVS Sbjct: 185 SLLSAYVEKGLVEEARGLFDEMEERNVESWNFMISGYVAAGLVEEAKELFDSMPVKDVVS 244 Query: 818 WNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAYI 997 WN+++T YA VG Y EVL +F +M PD TLVNVLSACA +G+L QG+WV Y+ Sbjct: 245 WNAMVTAYAHVGCYSEVLEVFNEMLNSSTERPDGFTLVNVLSACASLGSLSQGEWVHVYM 304 Query: 998 DKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESAL 1177 DK GI ++GFLATALVDMYSKCG I+KA+EVF TS KDVSTWNS+ISGL VHG+G AL Sbjct: 305 DKNGIVIDGFLATALVDMYSKCGKIDKALEVFRATSEKDVSTWNSIISGLGVHGHGNDAL 364 Query: 1178 ETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 E F+ MV +G+KPN +TF+ VL+AC+ L++ R++F M YG+EP++EHYGCMVD Sbjct: 365 EIFSEMVYEGFKPNGITFIGVLSACNHVGLLDQARKLFETMNSVYGVEPTIEHYGCMVD 423 >XP_006414048.1 hypothetical protein EUTSA_v10024877mg [Eutrema salsugineum] ESQ55501.1 hypothetical protein EUTSA_v10024877mg [Eutrema salsugineum] Length = 535 Score = 451 bits (1161), Expect = e-152 Identities = 223/420 (53%), Positives = 296/420 (70%), Gaps = 7/420 (1%) Frame = +2 Query: 116 MSSNIFPPPSLLSLT----SMTQVHQTHAQMLKTGFFHHPYLSSRLLTAATTISP---SL 274 +SS P +LS T S++++ Q HA MLKTG F + +S+L+ A ++P ++ Sbjct: 5 LSSTSLP---ILSFTERAKSLSEIQQAHAFMLKTGLFRDTFSASKLIAFAV-VNPEPKTV 60 Query: 275 SYPHSIFNNIHQPNSYIYNTIIRAYSTSPTPLQSFHIFIDMLFDAYVVPDKFTFTFILKA 454 SY HSI N I PN++ +N++IRAY+ S P + F +ML V PDK++FTF+LKA Sbjct: 61 SYAHSILNRIESPNAFTHNSVIRAYANSSAPESALTAFREMLLGP-VFPDKYSFTFVLKA 119 Query: 455 CASLCSIEYGQQIHGVLVKGGFYDDVYVYNTLVHVYAKGGSFCNARVLLDKMSEPDVIAW 634 CA+ C E G+QIHG+ +K DV+V NTLV+VY + G F AR +LD M E DV++W Sbjct: 120 CAAFCGFEEGRQIHGLFLKSDLISDVFVENTLVNVYGRSGYFEIARKVLDTMPERDVVSW 179 Query: 635 NAVLSAYVEMGLMELAREFLNEMPVKNQESWNFMVSGYVNVGLVDEARSIFDEMLEKDVV 814 N++LSAYVE GL+E AR +EM +N ESWNFM+SGY GLV+EA+ +FD M KDVV Sbjct: 180 NSLLSAYVEKGLVEEARGVFDEMDERNVESWNFMISGYAAAGLVNEAKELFDSMPVKDVV 239 Query: 815 SWNSLITGYAKVGDYCEVLRLFEDMQRGGNVVPDNCTLVNVLSACAGVGALGQGDWVQAY 994 SWN++++ YA VG Y EVL +F +M PD TLVNVLSACA +G+L QG+WV Y Sbjct: 240 SWNAMVSAYAHVGCYSEVLEVFNEMLNSSTEKPDGFTLVNVLSACANLGSLSQGEWVHVY 299 Query: 995 IDKKGIEVNGFLATALVDMYSKCGCIEKAVEVFNRTSRKDVSTWNSMISGLSVHGYGESA 1174 DK GIE++GFLATALVDMYSKCG ++KA+EVF TS+KDVSTWNSMISGLSVHG G A Sbjct: 300 TDKHGIEIDGFLATALVDMYSKCGKVDKALEVFRATSKKDVSTWNSMISGLSVHGLGNDA 359 Query: 1175 LETFNRMVADGYKPNDVTFVSVLAACSREWFLEEGREMFRNMEEKYGIEPSVEHYGCMVD 1354 LE F+ MV +G+KPN +TF++ L+AC+ L++ R +F M YG+EP++EHYGCMVD Sbjct: 360 LEIFSEMVHEGFKPNSITFIATLSACNHVGMLDQARRLFETMNSVYGVEPTIEHYGCMVD 419