BLASTX nr result
ID: Sinomenium22_contig00054992
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00054992 (406 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007047157.1| Pentatricopeptide repeat (PPR) superfamily p... 228 5e-58 ref|XP_003632466.1| PREDICTED: pentatricopeptide repeat-containi... 223 2e-56 ref|XP_002522334.1| pentatricopeptide repeat-containing protein,... 217 1e-54 ref|XP_002324099.1| hypothetical protein POPTR_0017s12720g [Popu... 217 1e-54 ref|XP_003610927.1| Pentatricopeptide repeat-containing protein ... 216 3e-54 ref|XP_006466653.1| PREDICTED: pentatricopeptide repeat-containi... 216 3e-54 ref|XP_006425825.1| hypothetical protein CICLE_v10024955mg [Citr... 216 3e-54 ref|XP_007203614.1| hypothetical protein PRUPE_ppa002292mg [Prun... 216 3e-54 ref|XP_003542017.1| PREDICTED: pentatricopeptide repeat-containi... 214 1e-53 ref|XP_007150279.1| hypothetical protein PHAVU_005G140500g [Phas... 212 4e-53 gb|EXC24858.1| hypothetical protein L484_013224 [Morus notabilis] 209 2e-52 ref|XP_004301849.1| PREDICTED: pentatricopeptide repeat-containi... 207 9e-52 ref|XP_006339636.1| PREDICTED: pentatricopeptide repeat-containi... 203 2e-50 gb|EYU39396.1| hypothetical protein MIMGU_mgv1a026743mg, partial... 200 2e-49 ref|XP_004229908.1| PREDICTED: pentatricopeptide repeat-containi... 200 2e-49 ref|XP_006283244.1| hypothetical protein CARUB_v10004277mg [Caps... 199 3e-49 ref|XP_004511479.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 198 6e-49 ref|XP_004152308.1| PREDICTED: pentatricopeptide repeat-containi... 198 8e-49 ref|NP_195434.1| pentatricopeptide repeat-containing protein [Ar... 196 2e-48 ref|XP_006411944.1| hypothetical protein EUTSA_v10026762mg [Eutr... 192 4e-47 >ref|XP_007047157.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] gi|508699418|gb|EOX91314.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 684 Score = 228 bits (582), Expect = 5e-58 Identities = 106/135 (78%), Positives = 121/135 (89%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EGFELFS+++ SGI+PNEFTFAGVLNAC+D AAE++GKQ+HG + R GF+P SFAA Sbjct: 291 GRWEEGFELFSELMKSGIRPNEFTFAGVLNACADHAAEEIGKQVHGCMTRLGFNPFSFAA 350 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCGN ENAKRVF MP PDLVSWT++I+GYAQNGQPEEAL YFELLLK+GTK Sbjct: 351 SALVHMYSKCGNVENAKRVFNGMPLPDLVSWTSLITGYAQNGQPEEALEYFELLLKSGTK 410 Query: 361 PDHITFVGVISACTH 405 PDHITFVGV+SACTH Sbjct: 411 PDHITFVGVLSACTH 425 Score = 105 bits (261), Expect = 9e-21 Identities = 54/133 (40%), Positives = 80/133 (60%), Gaps = 1/133 (0%) Frame = +1 Query: 4 RTREGFELFSDMLISGI-KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 R +E EL+ +S + K N+FT + + A + GK+IHG I R+G D Sbjct: 190 RPKEALELYRMKEMSMVSKLNKFTVSSAIAASAAMGCLTTGKEIHGRITRAGLDLDEVVW 249 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 AL+ MY KCG+ E A+RVF+++ D+VSWTAMI Y ++G+ EE F L+K+G + Sbjct: 250 SALMDMYGKCGSIEEARRVFDKIVDRDIVSWTAMIDRYFEDGRWEEGFELFSELMKSGIR 309 Query: 361 PDHITFVGVISAC 399 P+ TF GV++AC Sbjct: 310 PNEFTFAGVLNAC 322 Score = 69.7 bits (169), Expect = 4e-10 Identities = 33/94 (35%), Positives = 53/94 (56%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 KP ++ ++ C A + GK +H +I SGF L+ MY+KCG+ +A+ Sbjct: 75 KPPASLYSTLIQLCCQNRALNEGKSVHQHIKISGFSAGLVICNRLLDMYAKCGSLADAQN 134 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFE 336 VF++M + DL SW ++SGYA+ G +EA F+ Sbjct: 135 VFDEMSERDLCSWNTLMSGYAKMGMLKEANKLFD 168 Score = 56.2 bits (134), Expect = 5e-06 Identities = 34/118 (28%), Positives = 56/118 (47%), Gaps = 2/118 (1%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMG-KQIHGYIMRSGFDPVSFA 177 G+ E E F +L SG KP+ TF GVL+AC+ D G + H R G + Sbjct: 392 GQPEEALEYFELLLKSGTKPDHITFVGVLSACTHAGLVDKGLEYFHSIKDRHGLTHTADH 451 Query: 178 AGALVHMYSKCGNTENAKRVFEQMP-QPDLVSWTAMISGYAQNGQPEEALHYFELLLK 348 ++ + ++ G + A+ + +MP +PD W +++ G +G E A E L + Sbjct: 452 YACIIDLLARSGRFQEAENIIVKMPMKPDKFLWASLLGGCRIHGNLELAEKAAEALFE 509 >ref|XP_003632466.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like, partial [Vitis vinifera] Length = 621 Score = 223 bits (569), Expect = 2e-56 Identities = 102/135 (75%), Positives = 120/135 (88%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EGF LFSD+L SGI PNEFTF+GVLNAC+D AAE++GKQ+HGY+ R GFDP SFAA Sbjct: 302 GRREEGFALFSDLLKSGIWPNEFTFSGVLNACADHAAEELGKQVHGYMTRIGFDPSSFAA 361 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 LVHMY+KCGN +NA+RVF MP+PDLVSWT++ISGYAQNGQP+EAL +FELLLK+GT+ Sbjct: 362 STLVHMYTKCGNIKNARRVFNGMPRPDLVSWTSLISGYAQNGQPDEALQFFELLLKSGTQ 421 Query: 361 PDHITFVGVISACTH 405 PDHITFVGV+SACTH Sbjct: 422 PDHITFVGVLSACTH 436 Score = 96.3 bits (238), Expect = 4e-18 Identities = 53/130 (40%), Positives = 73/130 (56%), Gaps = 1/130 (0%) Frame = +1 Query: 13 EGFELFSDMLI-SGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGAL 189 E ELF M K N+FT + L A + + +GK+IHG+I+R G D AL Sbjct: 204 EALELFRAMQRHENFKCNKFTMSSALAASAAIQSLHLGKEIHGHILRIGLDLDGVVWSAL 263 Query: 190 VHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDH 369 MY KCG+ A+ +F++ D+VSWTAMI Y + G+ EE F LLK+G P+ Sbjct: 264 SDMYGKCGSIGEARHIFDKTVDRDVVSWTAMIDRYFKEGRREEGFALFSDLLKSGIWPNE 323 Query: 370 ITFVGVISAC 399 TF GV++AC Sbjct: 324 FTFSGVLNAC 333 Score = 77.0 bits (188), Expect = 3e-12 Identities = 39/113 (34%), Positives = 65/113 (57%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ T++ +L C A D G ++H + SGF P + ++ MY KC + NAKR Sbjct: 86 RPSAATYSTLLQLCLQLRALDEGMKVHAHTKTSGFVPGVVISNRILDMYIKCNSLVNAKR 145 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 +F++M + DL SW MISGYA+ G+ +EA F+ + T+ D+ ++ + S Sbjct: 146 LFDEMAERDLCSWNIMISGYAKAGRLQEARKLFDQM----TERDNFSWTAMTS 194 >ref|XP_002522334.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538412|gb|EEF40018.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 507 Score = 217 bits (553), Expect = 1e-54 Identities = 99/135 (73%), Positives = 117/135 (86%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EGFELF+++L SGIKPN+FTFAGVLNAC+D E +GKQ+HG++ R+ FDP SFAA Sbjct: 293 GRREEGFELFAELLRSGIKPNDFTFAGVLNACADLGVEGIGKQVHGHMTRADFDPFSFAA 352 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCGN NA+RVF MPQPDLVSWT++I+GYAQNG P+EAL YFELLLK+GT+ Sbjct: 353 SALVHMYSKCGNMVNAERVFRGMPQPDLVSWTSLIAGYAQNGHPDEALQYFELLLKSGTR 412 Query: 361 PDHITFVGVISACTH 405 PDHITFVGV+SAC H Sbjct: 413 PDHITFVGVLSACAH 427 Score = 108 bits (271), Expect = 6e-22 Identities = 56/133 (42%), Positives = 79/133 (59%), Gaps = 1/133 (0%) Frame = +1 Query: 4 RTREGFELFSDML-ISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 R E EL+ M + N+FT + VL A + +GK+IHGYIMR+G D Sbjct: 192 RPHEALELYRLMKKCENLTSNKFTVSSVLAAAAAIPCLRIGKEIHGYIMRTGLDSDEVVW 251 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 AL MY KCG+ E A+ +F++M D+V+WTAMI Y ++G+ EE F LL++G K Sbjct: 252 SALSDMYGKCGSIEEARHIFDKMVNRDVVTWTAMIDRYFEDGRREEGFELFAELLRSGIK 311 Query: 361 PDHITFVGVISAC 399 P+ TF GV++AC Sbjct: 312 PNDFTFAGVLNAC 324 Score = 71.6 bits (174), Expect = 1e-10 Identities = 36/113 (31%), Positives = 67/113 (59%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ ++ ++ +C A ++GK++H +I SGF P + L+ MY+KC + +A++ Sbjct: 77 RPSPSIYSSLIQSCLKNRALEVGKKVHDHIKLSGFIPGLVISNRLLDMYAKCNDLVDAQK 136 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 +FE+M + DL SW +ISG A+ G +EA F+ T + D+ ++ +IS Sbjct: 137 LFEEMGERDLCSWNVLISGCAKMGLLKEARKLFD----TMPERDNFSWTAMIS 185 >ref|XP_002324099.1| hypothetical protein POPTR_0017s12720g [Populus trichocarpa] gi|222867101|gb|EEF04232.1| hypothetical protein POPTR_0017s12720g [Populus trichocarpa] Length = 676 Score = 217 bits (553), Expect = 1e-54 Identities = 95/135 (70%), Positives = 122/135 (90%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR +EGF+LF+D+L SGI+PNEFTF+GVLNAC++Q +E++GK++HGY+ R GFDP SFAA Sbjct: 283 GRRKEGFDLFADLLRSGIRPNEFTFSGVLNACANQTSEELGKKVHGYMTRVGFDPFSFAA 342 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCGN +A+RVF++ PQPDL SWT++I+GYAQNGQP+EA+ YFELL+K+GT+ Sbjct: 343 SALVHMYSKCGNMVSAERVFKETPQPDLFSWTSLIAGYAQNGQPDEAIRYFELLVKSGTQ 402 Query: 361 PDHITFVGVISACTH 405 PDHITFVGV+SAC H Sbjct: 403 PDHITFVGVLSACAH 417 Score = 108 bits (271), Expect = 6e-22 Identities = 57/135 (42%), Positives = 80/135 (59%), Gaps = 1/135 (0%) Frame = +1 Query: 4 RTREGFELFSDMLIS-GIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 R E ELF M S K N+FT + L A + +GK+IHGYIMR+G D Sbjct: 182 RPNEALELFRMMKRSDNSKSNKFTVSSALAAAAAVPCLRIGKEIHGYIMRTGLDSDEVVW 241 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 AL MY KCG+ E A+ +F++M D+V+WTAMI Y Q+G+ +E F LL++G + Sbjct: 242 SALSDMYGKCGSIEEARHIFDKMVDRDIVTWTAMIDRYFQDGRRKEGFDLFADLLRSGIR 301 Query: 361 PDHITFVGVISACTH 405 P+ TF GV++AC + Sbjct: 302 PNEFTFSGVLNACAN 316 Score = 74.7 bits (182), Expect = 1e-11 Identities = 36/113 (31%), Positives = 66/113 (58%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 KP+ ++ ++ +C GK++H +I SGF P F L+ MY+KC + ++++ Sbjct: 67 KPSASVYSTLIQSCIKSRLLQQGKKVHQHIKLSGFVPGLFILNRLLEMYAKCDSLMDSQK 126 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 +F++MP+ DL SW +ISGYA+ G +EA F+ + + D+ ++ +IS Sbjct: 127 LFDEMPERDLCSWNILISGYAKMGLLQEAKSLFDKM----PERDNFSWTAMIS 175 >ref|XP_003610927.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355512262|gb|AES93885.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 802 Score = 216 bits (550), Expect = 3e-54 Identities = 97/135 (71%), Positives = 118/135 (87%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR +EGF LF D++ SG++PNE+TFAGVLNAC+D AAE MGK++HGY+ R G+DP SFAA Sbjct: 276 GRKKEGFSLFRDLMGSGVRPNEYTFAGVLNACADLAAEQMGKEVHGYMTRVGYDPFSFAA 335 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVH+YSKCGNTE A+RVF QMP+PDLVSWT++I GYAQNGQP+ AL +FE LL++GTK Sbjct: 336 SALVHVYSKCGNTETARRVFNQMPRPDLVSWTSLIVGYAQNGQPDMALQFFESLLRSGTK 395 Query: 361 PDHITFVGVISACTH 405 PD ITFVGV+SACTH Sbjct: 396 PDEITFVGVLSACTH 410 Score = 91.7 bits (226), Expect = 1e-16 Identities = 43/113 (38%), Positives = 68/113 (60%) Frame = +1 Query: 61 NEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKRVF 240 N FT + L A + ++ GK+IHGY++RSG + AL+ +Y KCG+ A+ +F Sbjct: 195 NMFTLSSALAAAAAISSLRRGKEIHGYLIRSGLELDEVVWTALLDLYGKCGSLNEARGIF 254 Query: 241 EQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVISAC 399 +QM D+VSWT MI ++G+ +E F L+ +G +P+ TF GV++AC Sbjct: 255 DQMADKDIVSWTTMIHRCFEDGRKKEGFSLFRDLMGSGVRPNEYTFAGVLNAC 307 Score = 77.0 bits (188), Expect = 3e-12 Identities = 34/94 (36%), Positives = 58/94 (61%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ ++ ++ AC ++GK++H + S F P + L+HMY+KCG+ +A+ Sbjct: 60 QPSPRLYSTLIAACLRHRKLELGKRVHAHTKASNFIPGIVISNRLIHMYAKCGSLVDAQM 119 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFE 336 +F+++PQ DL SW MISGYA G+ E+A F+ Sbjct: 120 LFDEIPQKDLCSWNTMISGYANVGRIEQARKLFD 153 >ref|XP_006466653.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Citrus sinensis] Length = 695 Score = 216 bits (549), Expect = 3e-54 Identities = 96/135 (71%), Positives = 119/135 (88%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EGF LFS+++ SGI+PN FTFAGVLNAC+D AAE++GKQ+HGY+ R G+DP SFAA Sbjct: 302 GRREEGFALFSELIKSGIRPNAFTFAGVLNACADHAAEELGKQVHGYMTRIGYDPYSFAA 361 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCGN EN+K+VF MP+PDLVSWT++I+GYAQNG P++AL YFELLLK+GT+ Sbjct: 362 SALVHMYSKCGNVENSKKVFNGMPRPDLVSWTSLIAGYAQNGMPDKALEYFELLLKSGTQ 421 Query: 361 PDHITFVGVISACTH 405 PD+I FVGV++ACTH Sbjct: 422 PDNIVFVGVLTACTH 436 Score = 108 bits (271), Expect = 6e-22 Identities = 52/113 (46%), Positives = 72/113 (63%) Frame = +1 Query: 61 NEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKRVF 240 N+FT + L+A S +GK+IHGYIMR+GFD AL MY KCG+ A+++F Sbjct: 221 NKFTLSSALSAVSAIQCLRLGKEIHGYIMRTGFDSDEVVWSALSDMYGKCGSINEARQIF 280 Query: 241 EQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVISAC 399 ++M D+VSWTAMI Y Q G+ EE F L+K+G +P+ TF GV++AC Sbjct: 281 DKMVDRDVVSWTAMIGRYFQEGRREEGFALFSELIKSGIRPNAFTFAGVLNAC 333 Score = 74.7 bits (182), Expect = 1e-11 Identities = 35/112 (31%), Positives = 66/112 (58%) Frame = +1 Query: 58 PNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKRV 237 P+ ++ ++ C A + GK++H ++ SGF P F + L+ MY+KCGN +A+ + Sbjct: 87 PSPSIYSSLIQFCRQNRALEEGKKVHSHLKSSGFKPGVFISNCLLDMYAKCGNLSDARTL 146 Query: 238 FEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 F++M + D+ S+ MISGY + G E+A + F+ + + D+ ++ +IS Sbjct: 147 FDEMHERDVCSYNTMISGYTKVGFLEQARNLFDEM----PQRDNFSWTAIIS 194 >ref|XP_006425825.1| hypothetical protein CICLE_v10024955mg [Citrus clementina] gi|557527815|gb|ESR39065.1| hypothetical protein CICLE_v10024955mg [Citrus clementina] Length = 759 Score = 216 bits (549), Expect = 3e-54 Identities = 96/135 (71%), Positives = 119/135 (88%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EGF LFS+++ SGI+PN FTFAGVLNAC+D AAE++GKQ+HGY+ R G+DP SFAA Sbjct: 366 GRREEGFALFSELIKSGIRPNAFTFAGVLNACADHAAEELGKQVHGYMTRIGYDPYSFAA 425 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCGN EN+K+VF MP+PDLVSWT++I+GYAQNG P++AL YFELLLK+GT+ Sbjct: 426 SALVHMYSKCGNVENSKKVFNGMPRPDLVSWTSLIAGYAQNGMPDKALEYFELLLKSGTQ 485 Query: 361 PDHITFVGVISACTH 405 PD+I FVGV++ACTH Sbjct: 486 PDNIVFVGVLTACTH 500 Score = 108 bits (271), Expect = 6e-22 Identities = 52/113 (46%), Positives = 72/113 (63%) Frame = +1 Query: 61 NEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKRVF 240 N+FT + L+A S +GK+IHGYIMR+GFD AL MY KCG+ A+++F Sbjct: 285 NKFTLSSALSAVSAIQCLRLGKEIHGYIMRTGFDSDEVVWSALSDMYGKCGSINEARQIF 344 Query: 241 EQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVISAC 399 ++M D+VSWTAMI Y Q G+ EE F L+K+G +P+ TF GV++AC Sbjct: 345 DKMVDRDVVSWTAMIGRYFQEGRREEGFALFSELIKSGIRPNAFTFAGVLNAC 397 Score = 74.7 bits (182), Expect = 1e-11 Identities = 35/112 (31%), Positives = 66/112 (58%) Frame = +1 Query: 58 PNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKRV 237 P+ ++ ++ C A + GK++H ++ SGF P F + L+ MY+KCGN +A+ + Sbjct: 151 PSPSIYSSLIQFCRQNRALEEGKKVHSHLKSSGFKPGVFISNCLLDMYAKCGNLSDARTL 210 Query: 238 FEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 F++M + D+ S+ MISGY + G E+A + F+ + + D+ ++ +IS Sbjct: 211 FDEMHERDVCSYNTMISGYTKVGFLEQARNLFDEM----PQRDNFSWTAIIS 258 >ref|XP_007203614.1| hypothetical protein PRUPE_ppa002292mg [Prunus persica] gi|462399145|gb|EMJ04813.1| hypothetical protein PRUPE_ppa002292mg [Prunus persica] Length = 691 Score = 216 bits (549), Expect = 3e-54 Identities = 99/135 (73%), Positives = 117/135 (86%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 G+ EGF LFS+++ SGI+PNEFTFAGVLNAC+ AAE++GKQ+HGY+ R GFDP+SFA+ Sbjct: 298 GKREEGFALFSELMKSGIRPNEFTFAGVLNACAHHAAENLGKQVHGYMTRIGFDPLSFAS 357 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCGNT NA VF+ MP PD+VSWT++I GYAQNGQP EAL FELLLK+GTK Sbjct: 358 SALVHMYSKCGNTVNANMVFKGMPHPDVVSWTSLIVGYAQNGQPYEALQLFELLLKSGTK 417 Query: 361 PDHITFVGVISACTH 405 PDHITFVGV+SACTH Sbjct: 418 PDHITFVGVLSACTH 432 Score = 112 bits (281), Expect = 4e-23 Identities = 57/135 (42%), Positives = 82/135 (60%), Gaps = 1/135 (0%) Frame = +1 Query: 4 RTREGFELFSDMLI-SGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 R +E +L+ M K N+FT + L A + + +GK+IHG+IMR+G D Sbjct: 197 RPKEALQLYRMMQRHDNSKSNKFTVSSALAASAAIQSLRLGKEIHGFIMRTGLDSDEVVW 256 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 AL MY KCG+ E AKR+F++M D+VSWTAMI Y ++G+ EE F L+K+G + Sbjct: 257 SALSDMYGKCGSIEEAKRIFDKMVNRDVVSWTAMIDRYFEDGKREEGFALFSELMKSGIR 316 Query: 361 PDHITFVGVISACTH 405 P+ TF GV++AC H Sbjct: 317 PNEFTFAGVLNACAH 331 Score = 74.7 bits (182), Expect = 1e-11 Identities = 39/113 (34%), Positives = 64/113 (56%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ ++ +L C Q A GK +H + SGF P F L+ +Y+KCG+ +A++ Sbjct: 82 RPSASIYSTLLQLCLQQRALVQGKLVHAHTKVSGFVPGLFICNRLIDLYAKCGSLVDAQK 141 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 VF++M + DL SW MISGYA+ G EA F+ + + D+ ++ +IS Sbjct: 142 VFDEMSERDLCSWNTMISGYAKVGLLGEARKLFDEM----PEKDNFSWTAMIS 190 >ref|XP_003542017.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Glycine max] Length = 693 Score = 214 bits (545), Expect = 1e-53 Identities = 95/135 (70%), Positives = 118/135 (87%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EGF LF D++ SG++PNE+TFAGVLNAC+D AAE +GK++HGY+M +G+DP SFA Sbjct: 300 GRREEGFLLFRDLMQSGVRPNEYTFAGVLNACADHAAEHLGKEVHGYMMHAGYDPGSFAI 359 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCGNT A+RVF +M QPDLVSWT++I GYAQNGQP+EALH+FELLL++GTK Sbjct: 360 SALVHMYSKCGNTRVARRVFNEMHQPDLVSWTSLIVGYAQNGQPDEALHFFELLLQSGTK 419 Query: 361 PDHITFVGVISACTH 405 PD +T+VGV+SACTH Sbjct: 420 PDQVTYVGVLSACTH 434 Score = 91.7 bits (226), Expect = 1e-16 Identities = 48/131 (36%), Positives = 75/131 (57%), Gaps = 1/131 (0%) Frame = +1 Query: 10 REGFELFSDMLI-SGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGA 186 RE ELF M N+FT + L A + +GK+IHGY++R+ + A Sbjct: 201 REALELFRVMQRHERSSSNKFTLSSALAASAAIPCLRLGKEIHGYLIRTELNLDEVVWSA 260 Query: 187 LVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPD 366 L+ +Y KCG+ + A+ +F+QM D+VSWT MI ++G+ EE F L+++G +P+ Sbjct: 261 LLDLYGKCGSLDEARGIFDQMKDRDVVSWTTMIHRCFEDGRREEGFLLFRDLMQSGVRPN 320 Query: 367 HITFVGVISAC 399 TF GV++AC Sbjct: 321 EYTFAGVLNAC 331 Score = 70.9 bits (172), Expect = 2e-10 Identities = 32/94 (34%), Positives = 57/94 (60%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ ++ ++ AC A ++G+++H + S F P F + L+ MY+KCG+ +A+ Sbjct: 84 RPSARVYSTLIAACVRHRALELGRRVHAHTKASNFVPGVFISNRLLDMYAKCGSLVDAQM 143 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFE 336 +F++M DL SW MI GYA+ G+ E+A F+ Sbjct: 144 LFDEMGHRDLCSWNTMIVGYAKLGRLEQARKLFD 177 >ref|XP_007150279.1| hypothetical protein PHAVU_005G140500g [Phaseolus vulgaris] gi|561023543|gb|ESW22273.1| hypothetical protein PHAVU_005G140500g [Phaseolus vulgaris] Length = 681 Score = 212 bits (540), Expect = 4e-53 Identities = 95/135 (70%), Positives = 114/135 (84%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EG LF D++ SG++PNE+TFAGVLN C+D AAE +GK++HGY+MR G+DP SFA Sbjct: 288 GRKEEGLSLFRDLMWSGVRPNEYTFAGVLNECADHAAEHLGKEVHGYMMRVGYDPCSFAV 347 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCGNT A+RVF MP DLVSWT++I GYAQNG+PEEALH+FELLL++GTK Sbjct: 348 SALVHMYSKCGNTRVARRVFNHMPHKDLVSWTSLIVGYAQNGEPEEALHFFELLLQSGTK 407 Query: 361 PDHITFVGVISACTH 405 PD ITFVGV+SACTH Sbjct: 408 PDQITFVGVLSACTH 422 Score = 92.0 bits (227), Expect = 8e-17 Identities = 49/133 (36%), Positives = 74/133 (55%), Gaps = 1/133 (0%) Frame = +1 Query: 4 RTREGFELFSDML-ISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 R E ELF M N+FT + L A + +GK+IHGY+MR+ + Sbjct: 187 RPWEALELFRVMQRCERSNSNKFTLSSALAASAAIPCLRLGKEIHGYLMRTELNLDEVVW 246 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 AL+ +Y KCG+ + A+ +F+QM D+VSWT MI ++G+ EE L F L+ +G + Sbjct: 247 SALLDLYGKCGSLDEARGIFDQMKSKDVVSWTTMIHRCFEDGRKEEGLSLFRDLMWSGVR 306 Query: 361 PDHITFVGVISAC 399 P+ TF GV++ C Sbjct: 307 PNEYTFAGVLNEC 319 Score = 67.4 bits (163), Expect = 2e-09 Identities = 31/94 (32%), Positives = 56/94 (59%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ ++ ++ AC A ++G+++H + S F F L+ MY+KCG+ +A+ Sbjct: 72 RPSARAYSTLIAACVRHRALELGRRVHAHTKGSNFVLGVFICNRLLDMYAKCGSLVDAQM 131 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFE 336 +F++M DL SW MI+GYA+ G+ E+A F+ Sbjct: 132 LFDEMGHRDLCSWNTMIAGYAKLGRLEQARKLFD 165 >gb|EXC24858.1| hypothetical protein L484_013224 [Morus notabilis] Length = 742 Score = 209 bits (533), Expect = 2e-52 Identities = 94/135 (69%), Positives = 116/135 (85%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR+++GF LF +++ SG +PN FTF+GVLNAC+D AA D+GKQ+HGY+ R GFDP+SFAA Sbjct: 301 GRSKDGFSLFMELMSSGTRPNGFTFSGVLNACADHAAGDLGKQVHGYMTRIGFDPLSFAA 360 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMY+KCGN ENAKRVF+ MP+PDLVSWT++I GYAQ+GQP EAL FE L K+G K Sbjct: 361 SALVHMYAKCGNIENAKRVFKGMPKPDLVSWTSLIVGYAQHGQPNEALQMFESLHKSGIK 420 Query: 361 PDHITFVGVISACTH 405 PDH+TFVGV+SACTH Sbjct: 421 PDHVTFVGVLSACTH 435 Score = 104 bits (259), Expect = 1e-20 Identities = 52/131 (39%), Positives = 83/131 (63%), Gaps = 1/131 (0%) Frame = +1 Query: 10 REGFELFSDML-ISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGA 186 +EG EL+ M + ++FT + VL A + + +GK+IHGY+MR+G D A Sbjct: 202 KEGLELYRMMQRCEKSRCDKFTVSSVLAAAAAIPSLRVGKEIHGYVMRTGLDSDEVVLSA 261 Query: 187 LVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPD 366 L+ MY KCGN + A+RVF++M + D+V+WTAMI ++G+ ++ F L+ +GT+P+ Sbjct: 262 LLDMYGKCGNIDEARRVFDKMVERDVVTWTAMIDRCFRSGRSKDGFSLFMELMSSGTRPN 321 Query: 367 HITFVGVISAC 399 TF GV++AC Sbjct: 322 GFTFSGVLNAC 332 Score = 76.6 bits (187), Expect = 3e-12 Identities = 38/113 (33%), Positives = 64/113 (56%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ ++ +L C + A + GK +H + SG P F + + +Y+KCG +A++ Sbjct: 85 RPSALIYSTILCHCLHERALEEGKLVHAHTKASGLVPGLFISNRFIDLYAKCGCLGDARK 144 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 VF++MP DL SW MISGYA+ G+ +EA F+ + DH ++ +IS Sbjct: 145 VFDEMPDKDLCSWNTMISGYAKVGKLDEARRLFDEM----PDRDHYSWSAMIS 193 >ref|XP_004301849.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Fragaria vesca subsp. vesca] Length = 757 Score = 207 bits (528), Expect = 9e-52 Identities = 94/135 (69%), Positives = 114/135 (84%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 G+ EG LFS+++ +GI+PNEFTFAGVLNAC+D A E++GKQ+HGY+ R FDP SFAA Sbjct: 364 GKREEGLALFSELMRTGIRPNEFTFAGVLNACADHAIENLGKQVHGYMTRIEFDPFSFAA 423 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCGNT NA +VF+ MP PDLVSWT++I GYAQNGQ +EAL FE LLK+GT+ Sbjct: 424 SALVHMYSKCGNTANANKVFKGMPSPDLVSWTSLIVGYAQNGQADEALQLFESLLKSGTR 483 Query: 361 PDHITFVGVISACTH 405 PDH+TFVGV+SACTH Sbjct: 484 PDHVTFVGVLSACTH 498 Score = 106 bits (264), Expect = 4e-21 Identities = 57/133 (42%), Positives = 81/133 (60%), Gaps = 1/133 (0%) Frame = +1 Query: 4 RTREGFELFSDMLIS-GIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 R E EL+ M K ++FT + VL A + + MGK+IH YIMR+G D Sbjct: 263 RPDEALELYRVMRKEESSKCSKFTVSSVLVASAAVQSLRMGKEIHCYIMRTGLDSDEVVW 322 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 AL MY KCG+ E A+RVF++M D+V+WTAM+ Y ++G+ EE L F L++TG + Sbjct: 323 SALSDMYGKCGSIEEARRVFDKMVNRDVVTWTAMMGRYFEDGKREEGLALFSELMRTGIR 382 Query: 361 PDHITFVGVISAC 399 P+ TF GV++AC Sbjct: 383 PNEFTFAGVLNAC 395 Score = 72.8 bits (177), Expect = 5e-11 Identities = 33/93 (35%), Positives = 55/93 (59%) Frame = +1 Query: 58 PNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKRV 237 P+ ++ +L+ C A D K +H + GFD F + +++Y+KCG+ +A++V Sbjct: 149 PSSSLYSTLLHHCLQHRALDQAKLVHSHTKLYGFDLGLFISNRFINLYAKCGSLVDAQKV 208 Query: 238 FEQMPQPDLVSWTAMISGYAQNGQPEEALHYFE 336 F++MP DL SW MISGYA+ G+ +A F+ Sbjct: 209 FDEMPDRDLCSWNTMISGYAKLGKLGDARKLFD 241 >ref|XP_006339636.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Solanum tuberosum] Length = 695 Score = 203 bits (516), Expect = 2e-50 Identities = 90/135 (66%), Positives = 116/135 (85%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EG+ LFS ++ SGI+PN+FTFAGVLNAC+ Q E GKQ+HGY+ R GFDP+SFAA Sbjct: 302 GRWEEGYLLFSCLMESGIRPNDFTFAGVLNACAHQTTEHFGKQVHGYMTRIGFDPLSFAA 361 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 LVHMY+KCG+ ++A +VF+++P+PD+VSWT++I+GYAQNGQP EAL F+LLLK+GT+ Sbjct: 362 STLVHMYAKCGSVDSAYKVFKRLPRPDVVSWTSLINGYAQNGQPSEALQLFDLLLKSGTQ 421 Query: 361 PDHITFVGVISACTH 405 PDHITFVGV+SACTH Sbjct: 422 PDHITFVGVLSACTH 436 Score = 102 bits (255), Expect = 4e-20 Identities = 47/118 (39%), Positives = 74/118 (62%) Frame = +1 Query: 52 IKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAK 231 +K N+FT + L A + + +GK+IHG+I+R+G D + AL MY KCG+ + A+ Sbjct: 218 VKCNKFTISSALAASASVQSLRLGKEIHGHIVRTGLDSDAVVWSALSDMYGKCGSVDEAR 277 Query: 232 RVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVISACTH 405 +F++ D+VSWTAMI Y +G+ EE F L+++G +P+ TF GV++AC H Sbjct: 278 HIFDRTKDKDVVSWTAMIDRYFGDGRWEEGYLLFSCLMESGIRPNDFTFAGVLNACAH 335 Score = 67.0 bits (162), Expect = 3e-09 Identities = 35/113 (30%), Positives = 61/113 (53%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ F+ +L C D A + GK++H + SGF P + ++ Y KC +A Sbjct: 86 RPSATVFSTLLRICIDNRALEEGKRVHKSMKCSGFRPGVVISNRILDFYCKCDKPFDAHN 145 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 +F +MP+ DL SW M+SG+A+ G +EA F+ + + D+ ++ +IS Sbjct: 146 LFVEMPERDLCSWNIMVSGFAKLGLIDEARKLFDEM----PEKDNFSWTAMIS 194 >gb|EYU39396.1| hypothetical protein MIMGU_mgv1a026743mg, partial [Mimulus guttatus] Length = 670 Score = 200 bits (509), Expect = 2e-49 Identities = 92/135 (68%), Positives = 112/135 (82%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 G+ EG LFSD L SGIKPNEFTFAGVLNAC+ Q AE++G+Q+HG +MR GFDP SFAA Sbjct: 277 GKWEEGLSLFSDFLSSGIKPNEFTFAGVLNACAHQTAEELGRQVHGLMMRIGFDPSSFAA 336 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMY+KCG+ E A RVF +P+PDLVS+T++I+GYAQNGQP EAL F+ L+K+G K Sbjct: 337 SALVHMYTKCGSVERANRVFNWLPKPDLVSYTSLINGYAQNGQPHEALKLFDSLVKSGNK 396 Query: 361 PDHITFVGVISACTH 405 DH+TFVGV+SACTH Sbjct: 397 LDHVTFVGVLSACTH 411 Score = 94.0 bits (232), Expect = 2e-17 Identities = 46/115 (40%), Positives = 67/115 (58%) Frame = +1 Query: 61 NEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKRVF 240 N+FT + L A + + +GK+IH +I R G D + AL+ +Y KCG+ AK +F Sbjct: 196 NKFTISSALAASAAIQSLRLGKEIHAHITRMGLDSDAVVWSALLDVYGKCGSLNEAKYIF 255 Query: 241 EQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVISACTH 405 ++ D+VSWT MI Y +G+ EE L F L +G KP+ TF GV++AC H Sbjct: 256 DRTVGNDIVSWTTMIDRYFGDGKWEEGLSLFSDFLSSGIKPNEFTFAGVLNACAH 310 Score = 79.0 bits (193), Expect = 7e-13 Identities = 38/113 (33%), Positives = 67/113 (59%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ +A VL C ++ A D GK++H +I SGF P F + ++ +Y KC + +A++ Sbjct: 61 RPSASLYAAVLQLCIEKRALDEGKRVHSHIKGSGFAPGVFISNKILDLYCKCESISDARK 120 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 +F++M D+ SW +ISGYA+ G+ EA F+ + K D+ ++ +IS Sbjct: 121 LFDEMGDRDVCSWNTLISGYAKMGRVSEARKLFDEM----PKRDNFSWTAMIS 169 >ref|XP_004229908.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Solanum lycopersicum] Length = 695 Score = 200 bits (508), Expect = 2e-49 Identities = 88/135 (65%), Positives = 115/135 (85%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EG+ LFS ++ SGI+PN+FTFAGVLNAC+ Q E GKQ+HGY+MR GFDP+SFAA Sbjct: 302 GRWEEGYLLFSCLMYSGIRPNDFTFAGVLNACAHQTKEHFGKQVHGYMMRIGFDPLSFAA 361 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 LVHMY+KCG+ ++A +VF+++P+PD+VSWT++I+GYAQN QP EAL ++ LLK+GT+ Sbjct: 362 STLVHMYAKCGSVDSAYKVFKRLPKPDVVSWTSLINGYAQNSQPSEALQLYDSLLKSGTQ 421 Query: 361 PDHITFVGVISACTH 405 PDHITFVGV+SACTH Sbjct: 422 PDHITFVGVLSACTH 436 Score = 98.6 bits (244), Expect = 8e-19 Identities = 50/129 (38%), Positives = 77/129 (59%), Gaps = 1/129 (0%) Frame = +1 Query: 22 ELFSDMLIS-GIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHM 198 EL+ ML K N+FT + L A + + +GK+I+G+I+R+G D + AL M Sbjct: 207 ELYRVMLRDENFKCNKFTISSALAASASIQSLRLGKEIYGHIVRTGLDSDAVVWSALSDM 266 Query: 199 YSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITF 378 Y KCG+ + A+ +F++ D+VSWTAMI Y +G+ EE F L+ +G +P+ TF Sbjct: 267 YGKCGSVDEARHIFDRTKDKDVVSWTAMIDRYFGDGRWEEGYLLFSCLMYSGIRPNDFTF 326 Query: 379 VGVISACTH 405 GV++AC H Sbjct: 327 AGVLNACAH 335 Score = 67.8 bits (164), Expect = 2e-09 Identities = 35/113 (30%), Positives = 62/113 (54%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ F+ +L C D A + GK++H + SGF P + ++ Y KC +A+ Sbjct: 86 RPSATVFSTLLRICIDNRALEEGKRVHKIMKCSGFRPGVVISNRVLDFYCKCDKPFDAQN 145 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVIS 393 +F +MP+ DL SW M+SG+A+ G +EA F+ + + D+ ++ +IS Sbjct: 146 LFVEMPERDLCSWNIMVSGFAKLGLIDEARKLFDEM----PEKDNFSWTAMIS 194 >ref|XP_006283244.1| hypothetical protein CARUB_v10004277mg [Capsella rubella] gi|482551949|gb|EOA16142.1| hypothetical protein CARUB_v10004277mg [Capsella rubella] Length = 690 Score = 199 bits (507), Expect = 3e-49 Identities = 89/134 (66%), Positives = 113/134 (84%) Frame = +1 Query: 4 RTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAG 183 R REGF LFS+++ S +PNE+TFAG+LNAC+D ED+GKQ+HGY+ R GFDP SFA+ Sbjct: 298 RWREGFSLFSELIGSCERPNEYTFAGILNACADLTKEDLGKQVHGYMTRIGFDPYSFASS 357 Query: 184 ALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKP 363 +LV MY+KCGN E+AK V + P+PDLVSWT++I GYAQNG+P+EAL YF+LLLK+GTKP Sbjct: 358 SLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIGGYAQNGKPDEALKYFDLLLKSGTKP 417 Query: 364 DHITFVGVISACTH 405 DH+TFV V+SACTH Sbjct: 418 DHVTFVNVLSACTH 431 Score = 84.0 bits (206), Expect = 2e-14 Identities = 43/130 (33%), Positives = 73/130 (56%), Gaps = 1/130 (0%) Frame = +1 Query: 13 EGFELFSDML-ISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGAL 189 E L+S M + +PN FT + + A + GK+IHG+I+R+G D +L Sbjct: 199 EALVLYSLMQRVPNSRPNIFTVSSAVAAAAAIPCIRRGKEIHGHIVRAGLDSDEVLWSSL 258 Query: 190 VHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDH 369 + MY KCG + A+ +F+++ D+VSWT+MI Y ++ + E F L+ + +P+ Sbjct: 259 IDMYGKCGCIDEARNIFDKILVKDVVSWTSMIDRYFKSRRWREGFSLFSELIGSCERPNE 318 Query: 370 ITFVGVISAC 399 TF G+++AC Sbjct: 319 YTFAGILNAC 328 Score = 78.6 bits (192), Expect = 9e-13 Identities = 36/94 (38%), Positives = 57/94 (60%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 KP T+ ++ CS A + GK++H +I SGF P L+ MY+KCG+ +A++ Sbjct: 81 KPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRLLGMYAKCGSLVDARK 140 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFE 336 VF+ MP+ D+ SW M++GYA+ G +EA F+ Sbjct: 141 VFDDMPKRDVCSWNLMVNGYAEVGLVDEARKLFD 174 >ref|XP_004511479.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g37170-like [Cicer arietinum] Length = 700 Score = 198 bits (504), Expect = 6e-49 Identities = 90/135 (66%), Positives = 116/135 (85%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR G LF +++ SG++PNE+TFAGVLNAC+D A E +GK++HGY++R G++P SFAA Sbjct: 272 GRKEGGLSLFRNLMGSGVRPNEYTFAGVLNACADLAIERIGKEVHGYMIRVGYNPCSFAA 331 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVH+YSKCGNTE A+RVF +MP+PDLVS T++I GYAQNGQP+ AL++FELLL++GTK Sbjct: 332 SALVHLYSKCGNTEIARRVFNKMPRPDLVSCTSLIVGYAQNGQPDMALNFFELLLRSGTK 391 Query: 361 PDHITFVGVISACTH 405 PD ITFVGV+SACTH Sbjct: 392 PDEITFVGVLSACTH 406 Score = 95.1 bits (235), Expect = 9e-18 Identities = 49/133 (36%), Positives = 76/133 (57%), Gaps = 1/133 (0%) Frame = +1 Query: 4 RTREGFELFSDMLI-SGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 R RE +LF M N FT + L A + + +GK+IHGY++R+ + Sbjct: 171 RHREALDLFRTMQEHESSNSNMFTLSSALAAAAAIRSLRLGKEIHGYLVRTELNLDEVVW 230 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 AL+ +Y KCG+ + A+ +F+QM D+VSWT MI Y ++G+ E L F L+ +G + Sbjct: 231 SALLDLYGKCGSLDEARGIFDQMVDRDVVSWTTMIHRYFEDGRKEGGLSLFRNLMGSGVR 290 Query: 361 PDHITFVGVISAC 399 P+ TF GV++AC Sbjct: 291 PNEYTFAGVLNAC 303 Score = 74.7 bits (182), Expect = 1e-11 Identities = 33/94 (35%), Positives = 56/94 (59%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 +P+ ++ ++ AC + ++G+++H + S F P F + L+HMY KCG +A+ Sbjct: 56 QPSPRLYSNLIAACLHHRSLELGRKVHAHTKASNFIPGIFISNRLLHMYVKCGGLIDAQS 115 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFE 336 +F++M Q DL SW MI+GYA G E+A F+ Sbjct: 116 LFDEMSQKDLCSWNTMIAGYANLGHLEQARKLFD 149 >ref|XP_004152308.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Cucumis sativus] gi|449484855|ref|XP_004156999.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Cucumis sativus] Length = 724 Score = 198 bits (503), Expect = 8e-49 Identities = 92/135 (68%), Positives = 111/135 (82%) Frame = +1 Query: 1 GRTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 GR EGF LF ++ S I PN+FTFAGVLNAC+D AAED+GKQIH Y++R GFD S AA Sbjct: 331 GRREEGFALFRHLMNSNIMPNDFTFAGVLNACADLAAEDLGKQIHAYMVRVGFDSFSSAA 390 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 ALVHMYSKCG+ ENAK VFE +PQPDL SWT+++ GYAQ+GQ ++ALH+FELLLK+GTK Sbjct: 391 SALVHMYSKCGDIENAKSVFEILPQPDLFSWTSLLVGYAQHGQHDKALHFFELLLKSGTK 450 Query: 361 PDHITFVGVISACTH 405 PD I F+GV+SAC H Sbjct: 451 PDGIAFIGVLSACAH 465 Score = 97.1 bits (240), Expect = 2e-18 Identities = 52/133 (39%), Positives = 75/133 (56%), Gaps = 1/133 (0%) Frame = +1 Query: 4 RTREGFELFSDMLISGI-KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAA 180 R E EL+ M K N+ T + L A + + MGK+IHG+IMR G D Sbjct: 230 RPEEALELYRLMQKHDYSKSNKCTISSALAASAAIPSLHMGKKIHGHIMRMGLDSDEVVW 289 Query: 181 GALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTK 360 +L+ MY KCG+ E A+ +F++M + D+VSWT MI Y +NG+ EE F L+ + Sbjct: 290 CSLLDMYGKCGSIEEARYIFDKMEERDVVSWTTMIHTYLKNGRREEGFALFRHLMNSNIM 349 Query: 361 PDHITFVGVISAC 399 P+ TF GV++AC Sbjct: 350 PNDFTFAGVLNAC 362 Score = 73.6 bits (179), Expect = 3e-11 Identities = 40/115 (34%), Positives = 64/115 (55%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 KP + +L C Q A GKQ+H +I SG + + + L+ MY+KCG+ +A++ Sbjct: 116 KPYASIYLTLLKFCLKQRALKEGKQVHAHIKTSGSIGL-YISNRLLDMYAKCGSLVDAEK 174 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVISAC 399 VF++M DL SW MISGY + G E+A + F+ + D+ ++ +IS C Sbjct: 175 VFDEMVHRDLCSWNIMISGYVKGGNFEKARNLFDKM----PNRDNFSWTAIISGC 225 >ref|NP_195434.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75097747|sp|O23169.1|PP353_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g37170 gi|2464864|emb|CAB16758.1| putative protein [Arabidopsis thaliana] gi|7270666|emb|CAB80383.1| putative protein [Arabidopsis thaliana] gi|332661361|gb|AEE86761.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 691 Score = 196 bits (499), Expect = 2e-48 Identities = 89/134 (66%), Positives = 112/134 (83%) Frame = +1 Query: 4 RTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAG 183 R REGF LFS+++ S +PNE+TFAGVLNAC+D E++GKQ+HGY+ R GFDP SFA+ Sbjct: 299 RWREGFSLFSELVGSCERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVGFDPYSFASS 358 Query: 184 ALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKP 363 +LV MY+KCGN E+AK V + P+PDLVSWT++I G AQNGQP+EAL YF+LLLK+GTKP Sbjct: 359 SLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSGTKP 418 Query: 364 DHITFVGVISACTH 405 DH+TFV V+SACTH Sbjct: 419 DHVTFVNVLSACTH 432 Score = 84.7 bits (208), Expect = 1e-14 Identities = 44/130 (33%), Positives = 74/130 (56%), Gaps = 1/130 (0%) Frame = +1 Query: 13 EGFELFSDML-ISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGAL 189 E L+S M + +PN FT + + A + GK+IHG+I+R+G D +L Sbjct: 200 EALVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDEVLWSSL 259 Query: 190 VHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDH 369 + MY KCG + A+ +F+++ + D+VSWT+MI Y ++ + E F L+ + +P+ Sbjct: 260 MDMYGKCGCIDEARNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFSELVGSCERPNE 319 Query: 370 ITFVGVISAC 399 TF GV++AC Sbjct: 320 YTFAGVLNAC 329 Score = 82.0 bits (201), Expect = 8e-14 Identities = 38/94 (40%), Positives = 57/94 (60%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 KP T+ ++ CS A + GK++H +I SGF P L+ MY+KCG+ +A++ Sbjct: 82 KPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRLLRMYAKCGSLVDARK 141 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFE 336 VF++MP DL SW M++GYA+ G EEA F+ Sbjct: 142 VFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFD 175 >ref|XP_006411944.1| hypothetical protein EUTSA_v10026762mg [Eutrema salsugineum] gi|557113114|gb|ESQ53397.1| hypothetical protein EUTSA_v10026762mg [Eutrema salsugineum] Length = 694 Score = 192 bits (488), Expect = 4e-47 Identities = 86/134 (64%), Positives = 112/134 (83%) Frame = +1 Query: 4 RTREGFELFSDMLISGIKPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAG 183 R REGF LFS+++ S +PNE+TFAGVLNAC+D E++GKQ+HGY+ R G+DP SFA+ Sbjct: 302 RWREGFCLFSELVSSCERPNEYTFAGVLNACTDLTTEELGKQVHGYMTRIGYDPYSFASS 361 Query: 184 ALVHMYSKCGNTENAKRVFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKP 363 +LV MY+KCGN ++AK V + P+PDL SWT++I GYAQNG+PE+AL YF+LLL++GTKP Sbjct: 362 SLVDMYTKCGNIQSAKHVVDGCPKPDLFSWTSLIGGYAQNGEPEKALKYFDLLLESGTKP 421 Query: 364 DHITFVGVISACTH 405 DHITFV V+SACTH Sbjct: 422 DHITFVNVLSACTH 435 Score = 86.7 bits (213), Expect = 3e-15 Identities = 42/116 (36%), Positives = 67/116 (57%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 KPN FT + + A + GK+IHG+I R+G D +L+ MY KCG + A+ Sbjct: 218 KPNIFTVSSAVAAAAAIPCIRRGKEIHGHIFRAGLDSDEVLWSSLMDMYGKCGCIDEARH 277 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFELLLKTGTKPDHITFVGVISACT 402 +F+++ D+VSWT+MI Y ++ + E F L+ + +P+ TF GV++ACT Sbjct: 278 IFDKIVDKDVVSWTSMIDRYFKSRRWREGFCLFSELVSSCERPNEYTFAGVLNACT 333 Score = 80.9 bits (198), Expect = 2e-13 Identities = 36/94 (38%), Positives = 58/94 (61%) Frame = +1 Query: 55 KPNEFTFAGVLNACSDQAAEDMGKQIHGYIMRSGFDPVSFAAGALVHMYSKCGNTENAKR 234 KP T+ ++ CS + A + GK++H +I SGF P L+ MY+KCG+ +A++ Sbjct: 85 KPPASTYCNLIQVCSQKRALEEGKKVHEHIKNSGFVPGVVICNRLLGMYAKCGSLIDARK 144 Query: 235 VFEQMPQPDLVSWTAMISGYAQNGQPEEALHYFE 336 +F++MP D+ SW M++GYA+ G EEA F+ Sbjct: 145 LFDEMPNKDVCSWNIMVNGYAEVGLLEEARKLFD 178