BLASTX nr result
ID: Atropa21_contig00011273
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00011273 (1214 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006362659.1| PREDICTED: pentatricopeptide repeat-containi... 486 e-135 ref|XP_004234195.1| PREDICTED: pentatricopeptide repeat-containi... 474 e-131 ref|XP_002274318.1| PREDICTED: pentatricopeptide repeat-containi... 386 e-104 gb|EOY23346.1| Pentatricopeptide repeat (PPR) superfamily protei... 382 e-103 gb|EOY23345.1| Pentatricopeptide repeat (PPR) superfamily protei... 382 e-103 ref|XP_004140747.1| PREDICTED: pentatricopeptide repeat-containi... 381 e-103 ref|XP_006422051.1| hypothetical protein CICLE_v10005483mg [Citr... 371 e-100 gb|EMJ19937.1| hypothetical protein PRUPE_ppa009149mg [Prunus pe... 367 4e-99 ref|XP_006372373.1| hypothetical protein POPTR_0017s00990g [Popu... 366 1e-98 ref|XP_002327501.1| predicted protein [Populus trichocarpa] 365 2e-98 ref|XP_006836534.1| hypothetical protein AMTR_s00131p00023460 [A... 362 2e-97 ref|XP_002513638.1| pentatricopeptide repeat-containing protein,... 362 2e-97 gb|EXC05953.1| hypothetical protein L484_014222 [Morus notabilis] 362 2e-97 ref|XP_004308043.1| PREDICTED: pentatricopeptide repeat-containi... 361 4e-97 ref|NP_567622.1| pentatricopeptide repeat protein EMBRYO DEFECTI... 347 4e-93 ref|XP_002867860.1| EMB1417 [Arabidopsis lyrata subsp. lyrata] g... 346 1e-92 ref|XP_006413807.1| hypothetical protein EUTSA_v10025765mg [Eutr... 343 8e-92 ref|XP_006284192.1| hypothetical protein CARUB_v10005340mg [Caps... 343 8e-92 gb|EEC71511.1| hypothetical protein OsI_03797 [Oryza sativa Indi... 342 2e-91 emb|CAA17538.1| putative protein [Arabidopsis thaliana] gi|72689... 341 3e-91 >ref|XP_006362659.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190-like [Solanum tuberosum] Length = 308 Score = 486 bits (1252), Expect = e-135 Identities = 241/276 (87%), Positives = 251/276 (90%) Frame = +3 Query: 339 FRRNFVLSHVCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSF 518 + RN V VCEAKGPRPRYPRVWKTK+KIGTISKSLKLVECIKGLSNVKEEVYGALDSF Sbjct: 25 YNRNVV---VCEAKGPRPRYPRVWKTKKKIGTISKSLKLVECIKGLSNVKEEVYGALDSF 81 Query: 519 IAWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRL 698 IAWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYF LLNALAEDGRL Sbjct: 82 IAWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFALLNALAEDGRL 141 Query: 699 EEAEELWLKLFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTM 878 EEAEELWLKLFSQNLESMPR+FFQKMI+IYYH+EMNEKMFEIFADMEELGIRPTVPVVTM Sbjct: 142 EEAEELWLKLFSQNLESMPRIFFQKMIAIYYHKEMNEKMFEIFADMEELGIRPTVPVVTM 201 Query: 879 VGNVFQKLGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHDHDXXXXXXXXXX 1058 VGNVFQKL MLDKYQKL KKYPPPKWEYRYIKGKRVKIRTKDLDKS DHD Sbjct: 202 VGNVFQKLEMLDKYQKLKKKYPPPKWEYRYIKGKRVKIRTKDLDKSQDHDVDSKSEEVDE 261 Query: 1059 XXFDENSEDQADAVDEDNLVQLKEVDECEPGEISTV 1166 FDENS+DQAD VDED + Q+K+V+ECEPGEIS V Sbjct: 262 SEFDENSQDQADEVDEDYVEQIKDVEECEPGEISIV 297 >ref|XP_004234195.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190-like [Solanum lycopersicum] Length = 305 Score = 474 bits (1221), Expect = e-131 Identities = 235/276 (85%), Positives = 248/276 (89%) Frame = +3 Query: 339 FRRNFVLSHVCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSF 518 + RN V VCEAKGPRPRYPRVWKTK+KIGTISKSLKLVECIKGLSNVKEEVYGALDSF Sbjct: 25 YNRNVV---VCEAKGPRPRYPRVWKTKKKIGTISKSLKLVECIKGLSNVKEEVYGALDSF 81 Query: 519 IAWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRL 698 IAWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYF LLNALAEDGRL Sbjct: 82 IAWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFALLNALAEDGRL 141 Query: 699 EEAEELWLKLFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTM 878 EEAEELWLKLFSQNLESMPR+FFQKMI+IYYH+EMNEKMFEIFADMEELGIRPTVPVV M Sbjct: 142 EEAEELWLKLFSQNLESMPRIFFQKMIAIYYHKEMNEKMFEIFADMEELGIRPTVPVVKM 201 Query: 879 VGNVFQKLGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHDHDXXXXXXXXXX 1058 VGNVFQKLGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHDHD Sbjct: 202 VGNVFQKLGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHDHDVESNSEEVDE 261 Query: 1059 XXFDENSEDQADAVDEDNLVQLKEVDECEPGEISTV 1166 FDENS+DQ +ED + Q+++ +ECEP E+S V Sbjct: 262 SEFDENSQDQE---NEDYVEQIEDAEECEPAEVSVV 294 >ref|XP_002274318.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190 [Vitis vinifera] gi|302143769|emb|CBI22630.3| unnamed protein product [Vitis vinifera] Length = 295 Score = 386 bits (991), Expect = e-104 Identities = 191/249 (76%), Positives = 212/249 (85%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC AKGPRPRYPRVWKT+++IGTISKS KLV+CIKGLSNVKEEVYGALDSFIAWELEFPL Sbjct: 29 VCGAKGPRPRYPRVWKTRQRIGTISKSAKLVDCIKGLSNVKEEVYGALDSFIAWELEFPL 88 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 ITVKKALK LE++KEWKRIIQVTKWMLSKGQGRTMGSYF LLNALAEDGRL+EAEELW K Sbjct: 89 ITVKKALKTLEDQKEWKRIIQVTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWTK 148 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LFS+NLES+PRVF+ KMISIYY R+M+EKMFEIFADMEELGIRP +V MVG+VFQKLG Sbjct: 149 LFSENLESLPRVFYDKMISIYYRRDMHEKMFEIFADMEELGIRPNTSIVKMVGDVFQKLG 208 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHDHDXXXXXXXXXXXXFDENSED 1085 MLDKY+KL KKYPPPKWEYRYIKGKRV+IR K +S D + S+D Sbjct: 209 MLDKYEKLQKKYPPPKWEYRYIKGKRVRIRAKLTGESDDPG-------------EAESDD 255 Query: 1086 QADAVDEDN 1112 +AV+E N Sbjct: 256 PGEAVNEIN 264 >gb|EOY23346.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Theobroma cacao] Length = 238 Score = 382 bits (980), Expect = e-103 Identities = 179/216 (82%), Positives = 201/216 (93%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC AKGPRPRYPRVWK++R+IGT+SKS KLV C+K LSNVKEEVYGALDSFIAWELEFPL Sbjct: 3 VCAAKGPRPRYPRVWKSRRRIGTVSKSAKLVSCVKELSNVKEEVYGALDSFIAWELEFPL 62 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 ITVKKALKIL+NE+EWKRIIQV KWMLSKGQGRTMG+YF LLNALAEDGRL+EAEELW K Sbjct: 63 ITVKKALKILQNEQEWKRIIQVVKWMLSKGQGRTMGTYFTLLNALAEDGRLDEAEELWAK 122 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LFS NLES PR+FF KMISIYYH+ M++KMFE+FADMEELG++P+V VV+MVGNVFQ+LG Sbjct: 123 LFSDNLESTPRIFFDKMISIYYHKGMHDKMFEVFADMEELGVKPSVSVVSMVGNVFQQLG 182 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDK 1013 MLDKY KLNKKYPPPKWEYRYIKGKRVKI+ K L++ Sbjct: 183 MLDKYDKLNKKYPPPKWEYRYIKGKRVKIKVKQLEE 218 >gb|EOY23345.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 1 [Theobroma cacao] Length = 258 Score = 382 bits (980), Expect = e-103 Identities = 179/216 (82%), Positives = 201/216 (93%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC AKGPRPRYPRVWK++R+IGT+SKS KLV C+K LSNVKEEVYGALDSFIAWELEFPL Sbjct: 23 VCAAKGPRPRYPRVWKSRRRIGTVSKSAKLVSCVKELSNVKEEVYGALDSFIAWELEFPL 82 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 ITVKKALKIL+NE+EWKRIIQV KWMLSKGQGRTMG+YF LLNALAEDGRL+EAEELW K Sbjct: 83 ITVKKALKILQNEQEWKRIIQVVKWMLSKGQGRTMGTYFTLLNALAEDGRLDEAEELWAK 142 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LFS NLES PR+FF KMISIYYH+ M++KMFE+FADMEELG++P+V VV+MVGNVFQ+LG Sbjct: 143 LFSDNLESTPRIFFDKMISIYYHKGMHDKMFEVFADMEELGVKPSVSVVSMVGNVFQQLG 202 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDK 1013 MLDKY KLNKKYPPPKWEYRYIKGKRVKI+ K L++ Sbjct: 203 MLDKYDKLNKKYPPPKWEYRYIKGKRVKIKVKQLEE 238 >ref|XP_004140747.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190-like [Cucumis sativus] Length = 331 Score = 381 bits (978), Expect = e-103 Identities = 187/266 (70%), Positives = 221/266 (83%) Frame = +3 Query: 360 SHVCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEF 539 S VC AKGPRPRYPRVWKTK++IGTISK+ KLV+C+KGLSNVKEEVYGALDSFIAWELEF Sbjct: 45 SVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEF 104 Query: 540 PLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELW 719 PLITVKKALK LEN++EWKRIIQ+TKWMLSKGQGRTMGSYF LLNALAEDGRL+EAEELW Sbjct: 105 PLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELW 164 Query: 720 LKLFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQK 899 KLFSQ+LES+PR+FF KMIS+YY + M++K+FE+FADMEELG++P + +VT VGNVFQ+ Sbjct: 165 NKLFSQHLESIPRIFFHKMISLYYDQAMHDKLFEVFADMEELGVQPNMAIVTKVGNVFQE 224 Query: 900 LGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHDHDXXXXXXXXXXXXFDENS 1079 LGMLDKY+KL KKYPPPKWEYRYIKGKRVKIR K L ++ + + NS Sbjct: 225 LGMLDKYKKLMKKYPPPKWEYRYIKGKRVKIRAKYLSENGNSNNGLSEHAKMEHS-STNS 283 Query: 1080 EDQADAVDEDNLVQLKEVDECEPGEI 1157 D+A+ ED+ ++ E +P EI Sbjct: 284 IDEAEITSEDSSLEDDEDMSEDPDEI 309 >ref|XP_006422051.1| hypothetical protein CICLE_v10005483mg [Citrus clementina] gi|568875045|ref|XP_006490621.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190-like [Citrus sinensis] gi|557523924|gb|ESR35291.1| hypothetical protein CICLE_v10005483mg [Citrus clementina] Length = 310 Score = 371 bits (952), Expect = e-100 Identities = 175/212 (82%), Positives = 194/212 (91%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC A+GPRPRYPRVWK +++IGTISKS KLV CIKGLSNVKEEVYGALDSFIAWELEFPL Sbjct: 29 VCAARGPRPRYPRVWKARKRIGTISKSAKLVTCIKGLSNVKEEVYGALDSFIAWELEFPL 88 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 ITVKKALK LENEK+WKRIIQVTKWMLSKGQGRTMG+YF+LLNALAEDGRL+EAEELW K Sbjct: 89 ITVKKALKTLENEKDWKRIIQVTKWMLSKGQGRTMGTYFLLLNALAEDGRLDEAEELWTK 148 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 +F +LE PR+FF KMISIYY+R M+EKMFEIFADMEELG+RP V +V+M+GN FQKLG Sbjct: 149 IFLDHLEGTPRIFFDKMISIYYNRGMHEKMFEIFADMEELGVRPNVSIVSMMGNAFQKLG 208 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTK 1001 MLDKY+KL KKYPPPKWEYRYIKGKRV+I K Sbjct: 209 MLDKYEKLKKKYPPPKWEYRYIKGKRVRIPAK 240 >gb|EMJ19937.1| hypothetical protein PRUPE_ppa009149mg [Prunus persica] Length = 305 Score = 367 bits (943), Expect = 4e-99 Identities = 176/210 (83%), Positives = 193/210 (91%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 +C AKGPRPRYPRVWK ++IGTISKS+KLVE IKGLSNVKEEVYGALDSFIAWELEFPL Sbjct: 29 LCAAKGPRPRYPRVWKANKRIGTISKSIKLVESIKGLSNVKEEVYGALDSFIAWELEFPL 88 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 ITVKKALK LEN+KEWKRIIQV+KWMLSKGQGRTMG+YF LLNALAEDGR+EEAEELW K Sbjct: 89 ITVKKALKTLENQKEWKRIIQVSKWMLSKGQGRTMGTYFTLLNALAEDGRVEEAEELWTK 148 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LFSQ LESMPR+FF KMISIYY +++KMFEIFADMEELG++P V +VT VGNVFQ+LG Sbjct: 149 LFSQYLESMPRMFFDKMISIYYRHGIHDKMFEIFADMEELGVQPNVSIVTKVGNVFQELG 208 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIR 995 MLDKY KL +KYPPPKWEYRYIKGKRVKIR Sbjct: 209 MLDKYHKLKQKYPPPKWEYRYIKGKRVKIR 238 >ref|XP_006372373.1| hypothetical protein POPTR_0017s00990g [Populus trichocarpa] gi|550318992|gb|ERP50170.1| hypothetical protein POPTR_0017s00990g [Populus trichocarpa] Length = 336 Score = 366 bits (939), Expect = 1e-98 Identities = 177/245 (72%), Positives = 208/245 (84%) Frame = +3 Query: 288 RYTLFFSISSLLAFEVTFRRNFVLSHVCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECI 467 RY+L + +F+ T R VC AKGPRPRYPRVWKTKR+IGTISKS KLV+CI Sbjct: 5 RYSLPLIPNRFQSFDTT-RNTKSCVVVCAAKGPRPRYPRVWKTKRRIGTISKSAKLVDCI 63 Query: 468 KGLSNVKEEVYGALDSFIAWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRT 647 KGLSNVKEEVYGALDSF+AWELEFPLI VKKAL+ LE ++EWKRIIQVTKWMLSKGQGRT Sbjct: 64 KGLSNVKEEVYGALDSFVAWELEFPLIAVKKALRALEEQQEWKRIIQVTKWMLSKGQGRT 123 Query: 648 MGSYFMLLNALAEDGRLEEAEELWLKLFSQNLESMPRVFFQKMISIYYHREMNEKMFEIF 827 MG+YF L+NALAEDGRL+E EELW KLFSQ LE PR+ F KMISIYY R+M++++FEIF Sbjct: 124 MGTYFTLMNALAEDGRLDEVEELWTKLFSQYLEGTPRMMFDKMISIYYKRDMHDQIFEIF 183 Query: 828 ADMEELGIRPTVPVVTMVGNVFQKLGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDL 1007 ADMEELG+RP+V +V MVGNVFQ+LGM+DKY+KL KKYPPPKW YRYIKGKRV++R K+ Sbjct: 184 ADMEELGLRPSVSIVNMVGNVFQRLGMMDKYEKLKKKYPPPKWIYRYIKGKRVRVRAKND 243 Query: 1008 DKSHD 1022 +++ D Sbjct: 244 NEAGD 248 >ref|XP_002327501.1| predicted protein [Populus trichocarpa] Length = 229 Score = 365 bits (937), Expect = 2e-98 Identities = 171/219 (78%), Positives = 198/219 (90%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC AKGPRPRYPRVWKTKR+IGTISKS KLV+CIKGLSNVKEEVYGALDSF+AWELEFPL Sbjct: 1 VCAAKGPRPRYPRVWKTKRRIGTISKSAKLVDCIKGLSNVKEEVYGALDSFVAWELEFPL 60 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 I VKKAL+ LE ++EWKRIIQVTKWMLSKGQGRTMG+YF L+NALAEDGRL+E EELW K Sbjct: 61 IAVKKALRALEEQQEWKRIIQVTKWMLSKGQGRTMGTYFTLMNALAEDGRLDEVEELWTK 120 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LFSQ LE PR+ F KMISIYY R+M++++FEIFADMEELG+RP+V +V MVGNVFQ+LG Sbjct: 121 LFSQYLEGTPRMMFDKMISIYYKRDMHDQIFEIFADMEELGLRPSVSIVNMVGNVFQRLG 180 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHD 1022 M+DKY+KL KKYPPPKW YRYIKGKRV++R K+ +++ D Sbjct: 181 MMDKYEKLKKKYPPPKWIYRYIKGKRVRVRAKNDNEAGD 219 >ref|XP_006836534.1| hypothetical protein AMTR_s00131p00023460 [Amborella trichopoda] gi|548839073|gb|ERM99387.1| hypothetical protein AMTR_s00131p00023460 [Amborella trichopoda] Length = 317 Score = 362 bits (929), Expect = 2e-97 Identities = 170/216 (78%), Positives = 195/216 (90%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 +C AKGPRPRYPRVWKT+++IG+ISKS KLVECIKGLSNVKEEVYGALDSFIAWELEFPL Sbjct: 20 ICVAKGPRPRYPRVWKTRKRIGSISKSEKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 79 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 I VKKALKIL+NEKEWKRIIQVTKWMLSKGQG+TMGSY+ LLNAL EDGRLEEAEELW K Sbjct: 80 IVVKKALKILQNEKEWKRIIQVTKWMLSKGQGKTMGSYYTLLNALIEDGRLEEAEELWTK 139 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 +FS+NLE +PR+FF +IS+YY M++KMFE+FADMEELG++P +V MVG+ FQKLG Sbjct: 140 IFSENLEGLPRIFFHLIISVYYKNNMHDKMFEVFADMEELGVKPNNAIVVMVGDEFQKLG 199 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDK 1013 MLDKY+KL KKYPP KWEYRYIKGKRVKI +K+L + Sbjct: 200 MLDKYKKLKKKYPPLKWEYRYIKGKRVKILSKNLSQ 235 >ref|XP_002513638.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223547546|gb|EEF49041.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 359 Score = 362 bits (929), Expect = 2e-97 Identities = 186/272 (68%), Positives = 212/272 (77%), Gaps = 1/272 (0%) Frame = +3 Query: 327 FEVTFRRNFVLSHVCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGA 506 FEV S V KG RPR PRVWKTK +IGTISKS KLVECIKGLSNVKEEVYGA Sbjct: 49 FEVIKFSKSTSSVVSALKGARPRAPRVWKTKPRIGTISKSAKLVECIKGLSNVKEEVYGA 108 Query: 507 LDSFIAWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAE 686 LDS IAWELEFPLI VKKALK LENE+EWKRIIQV KWMLSKGQGRTMG+YF LLNALAE Sbjct: 109 LDSLIAWELEFPLIAVKKALKTLENEQEWKRIIQVIKWMLSKGQGRTMGTYFTLLNALAE 168 Query: 687 DGRLEEAEELWLKLFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVP 866 D RL+EAEELW KLFS NLE PR FF KMISIYY REM+EKMFEIFADMEELG+RP+V Sbjct: 169 DERLDEAEELWTKLFSDNLEGTPRNFFDKMISIYYKREMHEKMFEIFADMEELGVRPSVS 228 Query: 867 VVTMVGNVFQKLGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHDHDXXXXXX 1046 +V M+G+VFQKLGMLDKY+KL KKYPPPKWEYRYIKGKRV++R K +++ + Sbjct: 229 IVNMMGSVFQKLGMLDKYRKLKKKYPPPKWEYRYIKGKRVRLRAKQVNEFLGANESVNQN 288 Query: 1047 XXXXXXFDENSEDQADAVDEDNLVQ-LKEVDE 1139 ++ +E+ ++E N+ + L E+DE Sbjct: 289 AETPYISNKLNEEDNTKLNEANVEEDLNELDE 320 >gb|EXC05953.1| hypothetical protein L484_014222 [Morus notabilis] Length = 305 Score = 362 bits (928), Expect = 2e-97 Identities = 184/273 (67%), Positives = 212/273 (77%), Gaps = 7/273 (2%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC AKGPRPRY RVWKT ++IGT+SKS K V+ IK LSNVKEEVYGALDS IAWELEFPL Sbjct: 30 VCAAKGPRPRYARVWKTNKRIGTVSKSAKFVQSIKELSNVKEEVYGALDSLIAWELEFPL 89 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 ITVKKA+K LE +KEWKRIIQVTKWMLSKGQG+TMG+YF+LLNALAEDGRLEEAEELW K Sbjct: 90 ITVKKAIKTLEEQKEWKRIIQVTKWMLSKGQGKTMGTYFILLNALAEDGRLEEAEELWTK 149 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LFS+NLES PR FF KMISIYYHR M+++MFEIFADMEELGIRP V +VTMVG VF +LG Sbjct: 150 LFSENLESTPRNFFNKMISIYYHRRMHDQMFEIFADMEELGIRPNVSIVTMVGKVFLELG 209 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDL-------DKSHDHDXXXXXXXXXXXX 1064 MLDK++KL +KYP PKWEYRYI+GKR++IR KDL D+ D Sbjct: 210 MLDKHKKLKRKYPLPKWEYRYIRGKRIRIRAKDLAKYDGDTDRGVSKDEESEHGSDEPLD 269 Query: 1065 FDENSEDQADAVDEDNLVQLKEVDECEPGEIST 1163 E+S + +DA E+ V + D E E+ST Sbjct: 270 IAESSPNGSDAESEE--VDPESNDVFEEAEMST 300 >ref|XP_004308043.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190-like [Fragaria vesca subsp. vesca] Length = 313 Score = 361 bits (926), Expect = 4e-97 Identities = 183/273 (67%), Positives = 213/273 (78%), Gaps = 6/273 (2%) Frame = +3 Query: 345 RNFVLSHVCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIA 524 RN V+ VC KGPRPRYPRVWK+ +KIGTISKSLKLVECIKGLSNVKEEVYGALDSFIA Sbjct: 24 RNSVV--VCGLKGPRPRYPRVWKSNKKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIA 81 Query: 525 WELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEE 704 WELEFPLITVKKALK LEN+K++KRIIQV KWMLSKGQGRTMG+YF LLNALA DGRLEE Sbjct: 82 WELEFPLITVKKALKTLENQKDYKRIIQVAKWMLSKGQGRTMGTYFTLLNALAADGRLEE 141 Query: 705 AEELWLKLFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVG 884 AEELW KLF+Q L+SMPR+FF KMISIYY + +++KMFEIFADMEELGI+P + +V VG Sbjct: 142 AEELWTKLFTQYLDSMPRIFFDKMISIYYEKGLHDKMFEIFADMEELGIKPNMSIVNKVG 201 Query: 885 NVFQKLGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTK---DLD---KSHDHDXXXXXX 1046 +VFQKLGM+DKY KL KKYPPP+WE RYIKGKRV+I+ +LD K + Sbjct: 202 DVFQKLGMMDKYTKLKKKYPPPRWEIRYIKGKRVRIQANKQGNLDGDVKMLSEEKETIHG 261 Query: 1047 XXXXXXFDENSEDQADAVDEDNLVQLKEVDECE 1145 D N ++Q +E N + +DE + Sbjct: 262 SNEVLNADSNPDEQTVEAEEMNQILCNSLDEAD 294 >ref|NP_567622.1| pentatricopeptide repeat protein EMBRYO DEFECTIVE 1417 [Arabidopsis thaliana] gi|75246109|sp|Q8LG95.1|PP332_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21190; AltName: Full=Protein EMBRYO DEFECTIVE 1417 gi|21618230|gb|AAM67280.1| unknown [Arabidopsis thaliana] gi|51969238|dbj|BAD43311.1| putative protein [Arabidopsis thaliana] gi|51971351|dbj|BAD44340.1| putative protein [Arabidopsis thaliana] gi|51971365|dbj|BAD44347.1| putative protein [Arabidopsis thaliana] gi|332659017|gb|AEE84417.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 307 Score = 347 bits (891), Expect = 4e-93 Identities = 169/260 (65%), Positives = 207/260 (79%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC A+GPRPR PRVWKT+++IGTISK+ K++ CIKGLSNVKEEVYGALDSFIAWELEFPL Sbjct: 32 VCAARGPRPRSPRVWKTRKRIGTISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPL 91 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 + VKKAL ILE+EKEWK+IIQVTKWMLSKGQGRTMG+YF LLNALAED RL+EAEELW K Sbjct: 92 VIVKKALVILEDEKEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNK 151 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LF ++LE PR FF KMISIYY R+M++K+FE+FADMEELG++P V +V+MVG VF KL Sbjct: 152 LFMEHLEGTPRKFFNKMISIYYKRDMHQKLFEVFADMEELGVKPNVAIVSMVGKVFVKLE 211 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHDHDXXXXXXXXXXXXFDENSED 1085 M DKY+KL KKYPPP+WE+RYIKG+RVK++ K L++ + + DE+ D Sbjct: 212 MKDKYEKLMKKYPPPQWEFRYIKGRRVKVKAKQLNELSEGEGGLSS--------DEDKID 263 Query: 1086 QADAVDEDNLVQLKEVDECE 1145 +E++ L E +E E Sbjct: 264 NEIESEEEDGEDLSEEEEDE 283 >ref|XP_002867860.1| EMB1417 [Arabidopsis lyrata subsp. lyrata] gi|297313696|gb|EFH44119.1| EMB1417 [Arabidopsis lyrata subsp. lyrata] Length = 317 Score = 346 bits (887), Expect = 1e-92 Identities = 160/216 (74%), Positives = 192/216 (88%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC A+GPRPR PRVWKT+++IGTISK+ K++ CIKGLSNVKEEVYGALDSFIAWELEFPL Sbjct: 32 VCAARGPRPRSPRVWKTRKRIGTISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPL 91 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 + VKKAL ILE+EKEWK+IIQVTKWMLSKGQGRTMG+YF LLNALAED RL+EAEELW K Sbjct: 92 VIVKKALVILEDEKEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNK 151 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LF ++LE PR FF KMISIYY R+M++K+FE+FADMEELG++P + +V+MVG VF KL Sbjct: 152 LFMEHLEGTPRKFFNKMISIYYKRDMHQKLFEVFADMEELGVKPNIAIVSMVGKVFVKLE 211 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDK 1013 M DKY+KL KKYPPP+WE+RYIKG+RVK++ K L++ Sbjct: 212 MKDKYEKLMKKYPPPQWEFRYIKGRRVKVKAKQLNE 247 >ref|XP_006413807.1| hypothetical protein EUTSA_v10025765mg [Eutrema salsugineum] gi|567220358|ref|XP_006413808.1| hypothetical protein EUTSA_v10025765mg [Eutrema salsugineum] gi|557114977|gb|ESQ55260.1| hypothetical protein EUTSA_v10025765mg [Eutrema salsugineum] gi|557114978|gb|ESQ55261.1| hypothetical protein EUTSA_v10025765mg [Eutrema salsugineum] Length = 315 Score = 343 bits (880), Expect = 8e-92 Identities = 170/275 (61%), Positives = 208/275 (75%), Gaps = 9/275 (3%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC A+GPRPR+PRVWKTK++IG+ISK+ K++ CIK LSNVKEEVYGALDSFIAWELEFPL Sbjct: 30 VCAARGPRPRHPRVWKTKKRIGSISKAAKMLSCIKELSNVKEEVYGALDSFIAWELEFPL 89 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 + VKKAL ILE+E+EWK+IIQVTKWMLSKGQGRTMG+YF LLNALAED RL+EAEELW K Sbjct: 90 VIVKKALAILEDEREWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNK 149 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LF ++LE PR FF KMISIYY R+M+ K+FE+FADMEELG++P + +V+MVG VF KL Sbjct: 150 LFMEHLEGTPRKFFNKMISIYYKRDMHHKLFEVFADMEELGVKPNIAIVSMVGKVFMKLE 209 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDK--------SHDHDXXXXXXXXXXX 1061 M DKY+KL KKYPPP+WE+RYIKG+R+K++ K L + S D D Sbjct: 210 MKDKYEKLMKKYPPPQWEFRYIKGRRIKVKAKQLSELSEGEGGVSSDEDKTDSEIESKSE 269 Query: 1062 XFDENSEDQADAVD-EDNLVQLKEVDECEPGEIST 1163 F + +Q DA D +N E+ G+I T Sbjct: 270 MFSDEEANQ-DAEDLSENEEDENELFSGNQGQIGT 303 >ref|XP_006284192.1| hypothetical protein CARUB_v10005340mg [Capsella rubella] gi|482552897|gb|EOA17090.1| hypothetical protein CARUB_v10005340mg [Capsella rubella] Length = 304 Score = 343 bits (880), Expect = 8e-92 Identities = 170/272 (62%), Positives = 207/272 (76%), Gaps = 8/272 (2%) Frame = +3 Query: 366 VCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALDSFIAWELEFPL 545 VC A+GPRPR PRVWKT+++IG+ISK+ K++ CIKGLSNVKEEVYGALDSFIAWELEFPL Sbjct: 30 VCAARGPRPRSPRVWKTRKRIGSISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPL 89 Query: 546 ITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLEEAEELWLK 725 + VKKAL ILE+EKEWK+IIQVTKWMLSKGQGRTMG+YF LLNALAED RL+EAEELW K Sbjct: 90 VIVKKALVILEDEKEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNK 149 Query: 726 LFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMVGNVFQKLG 905 LF ++LE PR FF KMISIYY R+M+ K+FE+FADMEELG++P + +V+MVG VF KL Sbjct: 150 LFMEHLEGTPRKFFNKMISIYYKRDMHHKLFEVFADMEELGVKPNLAIVSMVGKVFVKLE 209 Query: 906 MLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDK--------SHDHDXXXXXXXXXXX 1061 M DKY+KL KKYPPP+WE+RYIKG+RVK++ K L++ S D D Sbjct: 210 MQDKYEKLMKKYPPPQWEFRYIKGRRVKVKAKQLNELSEGEGGLSSDEDKVGNEIESKSN 269 Query: 1062 XFDENSEDQADAVDEDNLVQLKEVDECEPGEI 1157 + +Q D D + +E DE E E+ Sbjct: 270 MLSDKEANQ-DGEDLSEEEEEEEEDEDEEEEL 300 >gb|EEC71511.1| hypothetical protein OsI_03797 [Oryza sativa Indica Group] Length = 295 Score = 342 bits (877), Expect = 2e-91 Identities = 161/227 (70%), Positives = 190/227 (83%) Frame = +3 Query: 333 VTFRRNFVLSHVCEAKGPRPRYPRVWKTKRKIGTISKSLKLVECIKGLSNVKEEVYGALD 512 +T F VC A+GPRPRYPRVWKT+++IGT+SKS KLVEC+KGLSNVKEEVYGALD Sbjct: 20 ITHPSKFSTLVVCGARGPRPRYPRVWKTRKRIGTVSKSQKLVECVKGLSNVKEEVYGALD 79 Query: 513 SFIAWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDG 692 SF+AWELEFPLI VKKALK LE+EKEWKRIIQV KWM +KGQG+TMGSY+ LLNAL EDG Sbjct: 80 SFVAWELEFPLIAVKKALKTLEDEKEWKRIIQVIKWMFNKGQGKTMGSYYTLLNALIEDG 139 Query: 693 RLEEAEELWLKLFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVV 872 R+EEAEEL+ K FS+ LE +PR FF +MIS+YY E +KMFEIFADMEELG+RP ++ Sbjct: 140 RVEEAEELYGKTFSRYLEGLPRTFFMRMISLYYRLESYQKMFEIFADMEELGVRPDGSII 199 Query: 873 TMVGNVFQKLGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDK 1013 M+G VFQKLGMLDKY KL KKYPPPKWEYR+IKGKR++++ D+ Sbjct: 200 RMLGEVFQKLGMLDKYVKLKKKYPPPKWEYRHIKGKRIRVKVYPKDE 246 >emb|CAA17538.1| putative protein [Arabidopsis thaliana] gi|7268916|emb|CAB79119.1| putative protein [Arabidopsis thaliana] Length = 325 Score = 341 bits (875), Expect = 3e-91 Identities = 169/268 (63%), Positives = 208/268 (77%), Gaps = 6/268 (2%) Frame = +3 Query: 360 SHVCEAKGPRPRYPRVWKTKRKIGTISKSLKLVEC------IKGLSNVKEEVYGALDSFI 521 + VC A+GPRPR PRVWKT+++IGTISK+ K++ C IKGLSNVKEEVYGALDSFI Sbjct: 42 AQVCAARGPRPRSPRVWKTRKRIGTISKAAKMIACVMLSSYIKGLSNVKEEVYGALDSFI 101 Query: 522 AWELEFPLITVKKALKILENEKEWKRIIQVTKWMLSKGQGRTMGSYFMLLNALAEDGRLE 701 AWELEFPL+ VKKAL ILE+EKEWK+IIQVTKWMLSKGQGRTMG+YF LLNALAED RL+ Sbjct: 102 AWELEFPLVIVKKALVILEDEKEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLD 161 Query: 702 EAEELWLKLFSQNLESMPRVFFQKMISIYYHREMNEKMFEIFADMEELGIRPTVPVVTMV 881 EAEELW KLF ++LE PR FF KMISIYY R+M++K+FE+FADMEELG++P V +V+MV Sbjct: 162 EAEELWNKLFMEHLEGTPRKFFNKMISIYYKRDMHQKLFEVFADMEELGVKPNVAIVSMV 221 Query: 882 GNVFQKLGMLDKYQKLNKKYPPPKWEYRYIKGKRVKIRTKDLDKSHDHDXXXXXXXXXXX 1061 G VF KL M DKY+KL KKYPPP+WE+RYIKG+RVK++ K L++ + + Sbjct: 222 GKVFVKLEMKDKYEKLMKKYPPPQWEFRYIKGRRVKVKAKQLNELSEGEGGLSS------ 275 Query: 1062 XFDENSEDQADAVDEDNLVQLKEVDECE 1145 DE+ D +E++ L E +E E Sbjct: 276 --DEDKIDNEIESEEEDGEDLSEEEEDE 301