BLASTX nr result
ID: Rehmannia22_contig00032757
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00032757 (553 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006339636.1| PREDICTED: pentatricopeptide repeat-containi... 310 1e-82 ref|XP_004229908.1| PREDICTED: pentatricopeptide repeat-containi... 308 7e-82 ref|XP_003632466.1| PREDICTED: pentatricopeptide repeat-containi... 305 6e-81 gb|EMJ04813.1| hypothetical protein PRUPE_ppa002292mg [Prunus pe... 303 1e-80 gb|EOX91314.1| Pentatricopeptide repeat (PPR) superfamily protei... 301 9e-80 ref|XP_006466653.1| PREDICTED: pentatricopeptide repeat-containi... 296 2e-78 ref|XP_006425825.1| hypothetical protein CICLE_v10024955mg [Citr... 296 2e-78 ref|XP_002324099.1| hypothetical protein POPTR_0017s12720g [Popu... 293 2e-77 gb|EXC24858.1| hypothetical protein L484_013224 [Morus notabilis] 290 1e-76 gb|EPS71377.1| hypothetical protein M569_03380, partial [Genlise... 290 1e-76 ref|XP_004301849.1| PREDICTED: pentatricopeptide repeat-containi... 289 3e-76 ref|XP_003610927.1| Pentatricopeptide repeat-containing protein ... 288 8e-76 gb|ESW22273.1| hypothetical protein PHAVU_005G140500g [Phaseolus... 286 2e-75 ref|XP_003542017.1| PREDICTED: pentatricopeptide repeat-containi... 285 4e-75 ref|XP_004511479.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 283 3e-74 ref|XP_006283244.1| hypothetical protein CARUB_v10004277mg [Caps... 273 2e-71 ref|XP_006411944.1| hypothetical protein EUTSA_v10026762mg [Eutr... 270 2e-70 ref|NP_195434.1| pentatricopeptide repeat-containing protein [Ar... 268 6e-70 ref|XP_004152308.1| PREDICTED: pentatricopeptide repeat-containi... 265 4e-69 ref|XP_002869013.1| pentatricopeptide repeat-containing protein ... 264 1e-68 >ref|XP_006339636.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Solanum tuberosum] Length = 695 Score = 310 bits (794), Expect = 1e-82 Identities = 144/182 (79%), Positives = 166/182 (91%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LFS + SGIRPN+FTFAGVLNACAHQT E G+QVHG+M R GFDPLSFAAS LVHMY Sbjct: 310 LFSCLMESGIRPNDFTFAGVLNACAHQTTEHFGKQVHGYMTRIGFDPLSFAASTLVHMYA 369 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCGSV+ A+KVFK LP+PD+VSWTSLINGYAQNGQP EAL+LFDLLL+SG QPDHITF+G Sbjct: 370 KCGSVDSAYKVFKRLPRPDVVSWTSLINGYAQNGQPSEALQLFDLLLKSGTQPDHITFVG 429 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VLSACTHAGLVDKG+EYF+SIK+KH L+HT+DHYACV+DLLSR GRF+EAE+II++MP+K Sbjct: 430 VLSACTHAGLVDKGLEYFYSIKDKHCLTHTSDHYACVIDLLSRFGRFKEAEEIISQMPMK 489 Query: 548 PD 553 PD Sbjct: 490 PD 491 Score = 106 bits (265), Expect = 3e-21 Identities = 59/173 (34%), Positives = 94/173 (54%) Frame = +2 Query: 35 IRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAH 214 ++ N+FT + L A A + LG+++HGH++RTG D + SAL MY KCGSV+ A Sbjct: 218 VKCNKFTISSALAASASVQSLRLGKEIHGHIVRTGLDSDAVVWSALSDMYGKCGSVDEAR 277 Query: 215 KVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAG 394 +F D+VSWT++I+ Y +G+ E LF L+ SG +P+ TF GVL+AC H Sbjct: 278 HIFDRTKDKDVVSWTAMIDRYFGDGRWEEGYLLFSCLMESGIRPNDFTFAGVLNACAHQT 337 Query: 395 LVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 G + H + G + + +V + ++ G A + ++P +PD Sbjct: 338 TEHFG-KQVHGYMTRIGFDPLSFAASTLVHMYAKCGSVDSAYKVFKRLP-RPD 388 Score = 67.0 bits (162), Expect = 3e-09 Identities = 35/101 (34%), Positives = 54/101 (53%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 RP+ F+ +L C A E G++VH M +GF P ++ ++ Y KC AH Sbjct: 86 RPSATVFSTLLRICIDNRALEEGKRVHKSMKCSGFRPGVVISNRILDFYCKCDKPFDAHN 145 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGN 340 +F +P+ DL SW +++G+A+ G EA KLFD + N Sbjct: 146 LFVEMPERDLCSWNIMVSGFAKLGLIDEARKLFDEMPEKDN 186 >ref|XP_004229908.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Solanum lycopersicum] Length = 695 Score = 308 bits (788), Expect = 7e-82 Identities = 144/182 (79%), Positives = 165/182 (90%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LFS + SGIRPN+FTFAGVLNACAHQT E G+QVHG+MMR GFDPLSFAAS LVHMY Sbjct: 310 LFSCLMYSGIRPNDFTFAGVLNACAHQTKEHFGKQVHGYMMRIGFDPLSFAASTLVHMYA 369 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCGSV+ A+KVFK LPKPD+VSWTSLINGYAQN QP EAL+L+D LL+SG QPDHITF+G Sbjct: 370 KCGSVDSAYKVFKRLPKPDVVSWTSLINGYAQNSQPSEALQLYDSLLKSGTQPDHITFVG 429 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VLSACTHAGLVDKG+EYF+SIK+KH L+HTADHYACV+DLLSR GRF+EAE+II++MP+K Sbjct: 430 VLSACTHAGLVDKGLEYFYSIKDKHCLTHTADHYACVIDLLSRFGRFKEAEEIISQMPMK 489 Query: 548 PD 553 PD Sbjct: 490 PD 491 Score = 102 bits (255), Expect = 5e-20 Identities = 59/170 (34%), Positives = 92/170 (54%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N+FT + L A A + LG++++GH++RTG D + SAL MY KCGSV+ A +F Sbjct: 221 NKFTISSALAASASIQSLRLGKEIYGHIVRTGLDSDAVVWSALSDMYGKCGSVDEARHIF 280 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 D+VSWT++I+ Y +G+ E LF L+ SG +P+ TF GVL+AC H Sbjct: 281 DRTKDKDVVSWTAMIDRYFGDGRWEEGYLLFSCLMYSGIRPNDFTFAGVLNACAHQTKEH 340 Query: 404 KGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 G + H + G + + +V + ++ G A + ++P KPD Sbjct: 341 FG-KQVHGYMMRIGFDPLSFAASTLVHMYAKCGSVDSAYKVFKRLP-KPD 388 Score = 67.4 bits (163), Expect = 2e-09 Identities = 50/198 (25%), Positives = 88/198 (44%), Gaps = 32/198 (16%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKC-------- 193 RP+ F+ +L C A E G++VH M +GF P ++ ++ Y KC Sbjct: 86 RPSATVFSTLLRICIDNRALEEGKRVHKIMKCSGFRPGVVISNRVLDFYCKCDKPFDAQN 145 Query: 194 -----------------------GSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEA 304 G ++ A K+F +P+ D SWT++I+GY ++ +P A Sbjct: 146 LFVEMPERDLCSWNIMVSGFAKLGLIDEARKLFDEMPEKDNFSWTAMISGYVRHNKPECA 205 Query: 305 LKLFDLLLRSGN-QPDHITFIGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVV 481 L+L+ ++LR N + + T L+A + G E + I + GL A ++ + Sbjct: 206 LELYRVMLRDENFKCNKFTISSALAASASIQSLRLGKEIYGHI-VRTGLDSDAVVWSALS 264 Query: 482 DLLSRSGRFREAEDIINK 535 D+ + G EA I ++ Sbjct: 265 DMYGKCGSVDEARHIFDR 282 >ref|XP_003632466.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like, partial [Vitis vinifera] Length = 621 Score = 305 bits (780), Expect = 6e-81 Identities = 142/182 (78%), Positives = 163/182 (89%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LFSD L SGI PNEFTF+GVLNACA AEELG+QVHG+M R GFDP SFAAS LVHMYT Sbjct: 310 LFSDLLKSGIWPNEFTFSGVLNACADHAAEELGKQVHGYMTRIGFDPSSFAASTLVHMYT 369 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCG+++ A +VF +P+PDLVSWTSLI+GYAQNGQP EAL+ F+LLL+SG QPDHITF+G Sbjct: 370 KCGNIKNARRVFNGMPRPDLVSWTSLISGYAQNGQPDEALQFFELLLKSGTQPDHITFVG 429 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VLSACTHAGLVDKG+EYF SIKEKHGL+HTADHYAC++DLLSRSGR +EAEDII+KMPI+ Sbjct: 430 VLSACTHAGLVDKGLEYFDSIKEKHGLTHTADHYACLIDLLSRSGRLQEAEDIIDKMPIE 489 Query: 548 PD 553 PD Sbjct: 490 PD 491 Score = 104 bits (260), Expect = 1e-20 Identities = 59/170 (34%), Positives = 91/170 (53%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N+FT + L A A + LG+++HGH++R G D SAL MY KCGS+ A +F Sbjct: 221 NKFTMSSALAASAAIQSLHLGKEIHGHILRIGLDLDGVVWSALSDMYGKCGSIGEARHIF 280 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 D+VSWT++I+ Y + G+ E LF LL+SG P+ TF GVL+AC + Sbjct: 281 DKTVDRDVVSWTAMIDRYFKEGRREEGFALFSDLLKSGIWPNEFTFSGVLNACADHAAEE 340 Query: 404 KGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 G + H + G ++ + +V + ++ G + A + N MP +PD Sbjct: 341 LG-KQVHGYMTRIGFDPSSFAASTLVHMYTKCGNIKNARRVFNGMP-RPD 388 Score = 73.2 bits (178), Expect = 4e-11 Identities = 39/132 (29%), Positives = 71/132 (53%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 RP+ T++ +L C A + G +VH H +GF P ++ ++ MY KC S+ A + Sbjct: 86 RPSAATYSTLLQLCLQLRALDEGMKVHAHTKTSGFVPGVVISNRILDMYIKCNSLVNAKR 145 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 +F + + DL SW +I+GYA+ G+ EA KLFD + + D+ ++ + S Sbjct: 146 LFDEMAERDLCSWNIMISGYAKAGRLQEARKLFDQM----TERDNFSWTAMTSGYVRHDQ 201 Query: 398 VDKGVEYFHSIK 433 ++ +E F +++ Sbjct: 202 HEEALELFRAMQ 213 >gb|EMJ04813.1| hypothetical protein PRUPE_ppa002292mg [Prunus persica] Length = 691 Score = 303 bits (777), Expect = 1e-80 Identities = 140/182 (76%), Positives = 164/182 (90%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LFS+ + SGIRPNEFTFAGVLNACAH AE LG+QVHG+M R GFDPLSFA+SALVHMY+ Sbjct: 306 LFSELMKSGIRPNEFTFAGVLNACAHHAAENLGKQVHGYMTRIGFDPLSFASSALVHMYS 365 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCG+ A+ VFK +P PD+VSWTSLI GYAQNGQP+EAL+LF+LLL+SG +PDHITF+G Sbjct: 366 KCGNTVNANMVFKGMPHPDVVSWTSLIVGYAQNGQPYEALQLFELLLKSGTKPDHITFVG 425 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VLSACTHAGLV+KG+EYFHSIK KHGL+HTADHYACVVDLL+R+GRF EAE+ IN+MP+K Sbjct: 426 VLSACTHAGLVEKGLEYFHSIKAKHGLAHTADHYACVVDLLARAGRFEEAENFINEMPMK 485 Query: 548 PD 553 PD Sbjct: 486 PD 487 Score = 109 bits (273), Expect = 4e-22 Identities = 59/168 (35%), Positives = 92/168 (54%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 + N+FT + L A A + LG+++HG +MRTG D SAL MY KCGS+E A + Sbjct: 215 KSNKFTVSSALAASAAIQSLRLGKEIHGFIMRTGLDSDEVVWSALSDMYGKCGSIEEAKR 274 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 +F + D+VSWT++I+ Y ++G+ E LF L++SG +P+ TF GVL+AC H Sbjct: 275 IFDKMVNRDVVSWTAMIDRYFEDGKREEGFALFSELMKSGIRPNEFTFAGVLNACAHHAA 334 Query: 398 VDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMP 541 + G + H + G + + +V + S+ G A + MP Sbjct: 335 ENLG-KQVHGYMTRIGFDPLSFASSALVHMYSKCGNTVNANMVFKGMP 381 Score = 77.8 bits (190), Expect = 2e-12 Identities = 55/199 (27%), Positives = 89/199 (44%), Gaps = 32/199 (16%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 RP+ ++ +L C Q A G+ VH H +GF P F + L+ +Y KCGS+ A K Sbjct: 82 RPSASIYSTLLQLCLQQRALVQGKLVHAHTKVSGFVPGLFICNRLIDLYAKCGSLVDAQK 141 Query: 218 VF---------KW----------------------LPKPDLVSWTSLINGYAQNGQPHEA 304 VF W +P+ D SWT++I+GY ++ +P EA Sbjct: 142 VFDEMSERDLCSWNTMISGYAKVGLLGEARKLFDEMPEKDNFSWTAMISGYVRHERPKEA 201 Query: 305 LKLFDLLLRSGN-QPDHITFIGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVV 481 L+L+ ++ R N + + T L+A + G E H + GL ++ + Sbjct: 202 LQLYRMMQRHDNSKSNKFTVSSALAASAAIQSLRLGKE-IHGFIMRTGLDSDEVVWSALS 260 Query: 482 DLLSRSGRFREAEDIINKM 538 D+ + G EA+ I +KM Sbjct: 261 DMYGKCGSIEEAKRIFDKM 279 >gb|EOX91314.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 684 Score = 301 bits (770), Expect = 9e-80 Identities = 139/182 (76%), Positives = 163/182 (89%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LFS+ + SGIRPNEFTFAGVLNACA AEE+G+QVHG M R GF+P SFAASALVHMY+ Sbjct: 299 LFSELMKSGIRPNEFTFAGVLNACADHAAEEIGKQVHGCMTRLGFNPFSFAASALVHMYS 358 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCG+VE A +VF +P PDLVSWTSLI GYAQNGQP EAL+ F+LLL+SG +PDHITF+G Sbjct: 359 KCGNVENAKRVFNGMPLPDLVSWTSLITGYAQNGQPEEALEYFELLLKSGTKPDHITFVG 418 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VLSACTHAGLVDKG+EYFHSIK++HGL+HTADHYAC++DLL+RSGRF+EAE+II KMP+K Sbjct: 419 VLSACTHAGLVDKGLEYFHSIKDRHGLTHTADHYACIIDLLARSGRFQEAENIIVKMPMK 478 Query: 548 PD 553 PD Sbjct: 479 PD 480 Score = 106 bits (264), Expect = 4e-21 Identities = 58/170 (34%), Positives = 94/170 (55%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N+FT + + A A G+++HG + R G D SAL+ MY KCGS+E A +VF Sbjct: 210 NKFTVSSAIAASAAMGCLTTGKEIHGRITRAGLDLDEVVWSALMDMYGKCGSIEEARRVF 269 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 + D+VSWT++I+ Y ++G+ E +LF L++SG +P+ TF GVL+AC + Sbjct: 270 DKIVDRDIVSWTAMIDRYFEDGRWEEGFELFSELMKSGIRPNEFTFAGVLNACADHAAEE 329 Query: 404 KGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 G + H + G + + + +V + S+ G A+ + N MP+ PD Sbjct: 330 IG-KQVHGCMTRLGFNPFSFAASALVHMYSKCGNVENAKRVFNGMPL-PD 377 Score = 67.8 bits (164), Expect = 2e-09 Identities = 34/94 (36%), Positives = 51/94 (54%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 +P ++ ++ C A G+ VH H+ +GF + L+ MY KCGS+ A Sbjct: 75 KPPASLYSTLIQLCCQNRALNEGKSVHQHIKISGFSAGLVICNRLLDMYAKCGSLADAQN 134 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFD 319 VF + + DL SW +L++GYA+ G EA KLFD Sbjct: 135 VFDEMSERDLCSWNTLMSGYAKMGMLKEANKLFD 168 >ref|XP_006466653.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Citrus sinensis] Length = 695 Score = 296 bits (758), Expect = 2e-78 Identities = 136/182 (74%), Positives = 162/182 (89%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LFS+ + SGIRPN FTFAGVLNACA AEELG+QVHG+M R G+DP SFAASALVHMY+ Sbjct: 310 LFSELIKSGIRPNAFTFAGVLNACADHAAEELGKQVHGYMTRIGYDPYSFAASALVHMYS 369 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCG+VE + KVF +P+PDLVSWTSLI GYAQNG P +AL+ F+LLL+SG QPD+I F+G Sbjct: 370 KCGNVENSKKVFNGMPRPDLVSWTSLIAGYAQNGMPDKALEYFELLLKSGTQPDNIVFVG 429 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VL+ACTHAGLVDKG++YFHSIKEKHGL++TADHYAC+VDLL+RSGRF EAED+I+KMP+K Sbjct: 430 VLTACTHAGLVDKGLQYFHSIKEKHGLTYTADHYACIVDLLARSGRFHEAEDVISKMPMK 489 Query: 548 PD 553 PD Sbjct: 490 PD 491 Score = 111 bits (277), Expect = 1e-22 Identities = 60/170 (35%), Positives = 94/170 (55%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N+FT + L+A + LG+++HG++MRTGFD SAL MY KCGS+ A ++F Sbjct: 221 NKFTLSSALSAVSAIQCLRLGKEIHGYIMRTGFDSDEVVWSALSDMYGKCGSINEARQIF 280 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 + D+VSWT++I Y Q G+ E LF L++SG +P+ TF GVL+AC + Sbjct: 281 DKMVDRDVVSWTAMIGRYFQEGRREEGFALFSELIKSGIRPNAFTFAGVLNACADHAAEE 340 Query: 404 KGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 G + H + G + + +V + S+ G ++ + N MP +PD Sbjct: 341 LG-KQVHGYMTRIGYDPYSFAASALVHMYSKCGNVENSKKVFNGMP-RPD 388 Score = 75.9 bits (185), Expect = 6e-12 Identities = 51/198 (25%), Positives = 86/198 (43%), Gaps = 32/198 (16%) Frame = +2 Query: 41 PNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGS------- 199 P+ ++ ++ C A E G++VH H+ +GF P F ++ L+ MY KCG+ Sbjct: 87 PSPSIYSSLIQFCRQNRALEEGKKVHSHLKSSGFKPGVFISNCLLDMYAKCGNLSDARTL 146 Query: 200 ------------------------VERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEAL 307 +E+A +F +P+ D SWT++I+GY + QP EAL Sbjct: 147 FDEMHERDVCSYNTMISGYTKVGFLEQARNLFDEMPQRDNFSWTAIISGYVRYNQPIEAL 206 Query: 308 KLFDLLLRSGNQ-PDHITFIGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVD 484 L+ ++ N + T LSA + + G E H + G ++ + D Sbjct: 207 DLYRMMQNFENSVSNKFTLSSALSAVSAIQCLRLGKE-IHGYIMRTGFDSDEVVWSALSD 265 Query: 485 LLSRSGRFREAEDIINKM 538 + + G EA I +KM Sbjct: 266 MYGKCGSINEARQIFDKM 283 >ref|XP_006425825.1| hypothetical protein CICLE_v10024955mg [Citrus clementina] gi|557527815|gb|ESR39065.1| hypothetical protein CICLE_v10024955mg [Citrus clementina] Length = 759 Score = 296 bits (758), Expect = 2e-78 Identities = 136/182 (74%), Positives = 162/182 (89%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LFS+ + SGIRPN FTFAGVLNACA AEELG+QVHG+M R G+DP SFAASALVHMY+ Sbjct: 374 LFSELIKSGIRPNAFTFAGVLNACADHAAEELGKQVHGYMTRIGYDPYSFAASALVHMYS 433 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCG+VE + KVF +P+PDLVSWTSLI GYAQNG P +AL+ F+LLL+SG QPD+I F+G Sbjct: 434 KCGNVENSKKVFNGMPRPDLVSWTSLIAGYAQNGMPDKALEYFELLLKSGTQPDNIVFVG 493 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VL+ACTHAGLVDKG++YFHSIKEKHGL++TADHYAC+VDLL+RSGRF EAED+I+KMP+K Sbjct: 494 VLTACTHAGLVDKGLQYFHSIKEKHGLTYTADHYACIVDLLARSGRFHEAEDVISKMPMK 553 Query: 548 PD 553 PD Sbjct: 554 PD 555 Score = 111 bits (277), Expect = 1e-22 Identities = 60/170 (35%), Positives = 94/170 (55%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N+FT + L+A + LG+++HG++MRTGFD SAL MY KCGS+ A ++F Sbjct: 285 NKFTLSSALSAVSAIQCLRLGKEIHGYIMRTGFDSDEVVWSALSDMYGKCGSINEARQIF 344 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 + D+VSWT++I Y Q G+ E LF L++SG +P+ TF GVL+AC + Sbjct: 345 DKMVDRDVVSWTAMIGRYFQEGRREEGFALFSELIKSGIRPNAFTFAGVLNACADHAAEE 404 Query: 404 KGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 G + H + G + + +V + S+ G ++ + N MP +PD Sbjct: 405 LG-KQVHGYMTRIGYDPYSFAASALVHMYSKCGNVENSKKVFNGMP-RPD 452 Score = 75.9 bits (185), Expect = 6e-12 Identities = 51/198 (25%), Positives = 86/198 (43%), Gaps = 32/198 (16%) Frame = +2 Query: 41 PNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGS------- 199 P+ ++ ++ C A E G++VH H+ +GF P F ++ L+ MY KCG+ Sbjct: 151 PSPSIYSSLIQFCRQNRALEEGKKVHSHLKSSGFKPGVFISNCLLDMYAKCGNLSDARTL 210 Query: 200 ------------------------VERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEAL 307 +E+A +F +P+ D SWT++I+GY + QP EAL Sbjct: 211 FDEMHERDVCSYNTMISGYTKVGFLEQARNLFDEMPQRDNFSWTAIISGYVRYNQPIEAL 270 Query: 308 KLFDLLLRSGNQ-PDHITFIGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVD 484 L+ ++ N + T LSA + + G E H + G ++ + D Sbjct: 271 DLYRMMQNFENSVSNKFTLSSALSAVSAIQCLRLGKE-IHGYIMRTGFDSDEVVWSALSD 329 Query: 485 LLSRSGRFREAEDIINKM 538 + + G EA I +KM Sbjct: 330 MYGKCGSINEARQIFDKM 347 >ref|XP_002324099.1| hypothetical protein POPTR_0017s12720g [Populus trichocarpa] gi|222867101|gb|EEF04232.1| hypothetical protein POPTR_0017s12720g [Populus trichocarpa] Length = 676 Score = 293 bits (750), Expect = 2e-77 Identities = 134/182 (73%), Positives = 164/182 (90%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LF+D L SGIRPNEFTF+GVLNACA+QT+EELG++VHG+M R GFDP SFAASALVHMY+ Sbjct: 291 LFADLLRSGIRPNEFTFSGVLNACANQTSEELGKKVHGYMTRVGFDPFSFAASALVHMYS 350 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCG++ A +VFK P+PDL SWTSLI GYAQNGQP EA++ F+LL++SG QPDHITF+G Sbjct: 351 KCGNMVSAERVFKETPQPDLFSWTSLIAGYAQNGQPDEAIRYFELLVKSGTQPDHITFVG 410 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VLSAC HAGLVDKG++YFHSIKE++GL+HTADHYAC++DLL+RSG+F EAE+II+KM +K Sbjct: 411 VLSACAHAGLVDKGLDYFHSIKEQYGLTHTADHYACIIDLLARSGQFDEAENIISKMSMK 470 Query: 548 PD 553 PD Sbjct: 471 PD 472 Score = 109 bits (272), Expect = 5e-22 Identities = 62/176 (35%), Positives = 96/176 (54%) Frame = +2 Query: 26 SSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVE 205 S + N+FT + L A A +G+++HG++MRTG D SAL MY KCGS+E Sbjct: 196 SDNSKSNKFTVSSALAAAAAVPCLRIGKEIHGYIMRTGLDSDEVVWSALSDMYGKCGSIE 255 Query: 206 RAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACT 385 A +F + D+V+WT++I+ Y Q+G+ E LF LLRSG +P+ TF GVL+AC Sbjct: 256 EARHIFDKMVDRDIVTWTAMIDRYFQDGRRKEGFDLFADLLRSGIRPNEFTFSGVLNACA 315 Query: 386 HAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 + + G + H + G + + +V + S+ G AE + + P +PD Sbjct: 316 NQTSEELG-KKVHGYMTRVGFDPFSFAASALVHMYSKCGNMVSAERVFKETP-QPD 369 Score = 77.4 bits (189), Expect = 2e-12 Identities = 53/199 (26%), Positives = 90/199 (45%), Gaps = 32/199 (16%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 +P+ ++ ++ +C + G++VH H+ +GF P F + L+ MY KC S+ + K Sbjct: 67 KPSASVYSTLIQSCIKSRLLQQGKKVHQHIKLSGFVPGLFILNRLLEMYAKCDSLMDSQK 126 Query: 218 VFKWLPKPDLVSWTSLINGYAQNG-------------------------------QPHEA 304 +F +P+ DL SW LI+GYA+ G +P+EA Sbjct: 127 LFDEMPERDLCSWNILISGYAKMGLLQEAKSLFDKMPERDNFSWTAMISGYVRHDRPNEA 186 Query: 305 LKLFDLLLRSGN-QPDHITFIGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVV 481 L+LF ++ RS N + + T L+A + G E H + GL ++ + Sbjct: 187 LELFRMMKRSDNSKSNKFTVSSALAAAAAVPCLRIGKE-IHGYIMRTGLDSDEVVWSALS 245 Query: 482 DLLSRSGRFREAEDIINKM 538 D+ + G EA I +KM Sbjct: 246 DMYGKCGSIEEARHIFDKM 264 >gb|EXC24858.1| hypothetical protein L484_013224 [Morus notabilis] Length = 742 Score = 290 bits (743), Expect = 1e-76 Identities = 133/183 (72%), Positives = 162/183 (88%) Frame = +2 Query: 5 TLFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMY 184 +LF + +SSG RPN FTF+GVLNACA A +LG+QVHG+M R GFDPLSFAASALVHMY Sbjct: 308 SLFMELMSSGTRPNGFTFSGVLNACADHAAGDLGKQVHGYMTRIGFDPLSFAASALVHMY 367 Query: 185 TKCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFI 364 KCG++E A +VFK +PKPDLVSWTSLI GYAQ+GQP+EAL++F+ L +SG +PDH+TF+ Sbjct: 368 AKCGNIENAKRVFKGMPKPDLVSWTSLIVGYAQHGQPNEALQMFESLHKSGIKPDHVTFV 427 Query: 365 GVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPI 544 GVLSACTHAGLVDKG+EYFHSIK KHGL +TADHYAC+VD+L+R+GRF+EAE+IIN MPI Sbjct: 428 GVLSACTHAGLVDKGLEYFHSIKTKHGLGYTADHYACIVDILARAGRFKEAEEIINGMPI 487 Query: 545 KPD 553 +PD Sbjct: 488 RPD 490 Score = 104 bits (260), Expect = 1e-20 Identities = 57/172 (33%), Positives = 97/172 (56%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 R ++FT + VL A A + +G+++HG++MRTG D SAL+ MY KCG+++ A + Sbjct: 218 RCDKFTVSSVLAAAAAIPSLRVGKEIHGYVMRTGLDSDEVVLSALLDMYGKCGNIDEARR 277 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 VF + + D+V+WT++I+ ++G+ + LF L+ SG +P+ TF GVL+AC Sbjct: 278 VFDKMVERDVVTWTAMIDRCFRSGRSKDGFSLFMELMSSGTRPNGFTFSGVLNACADHAA 337 Query: 398 VDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 D G + H + G + + +V + ++ G A+ + MP KPD Sbjct: 338 GDLG-KQVHGYMTRIGFDPLSFAASALVHMYAKCGNIENAKRVFKGMP-KPD 387 Score = 82.4 bits (202), Expect = 7e-14 Identities = 42/132 (31%), Positives = 71/132 (53%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 RP+ ++ +L C H+ A E G+ VH H +G P F ++ + +Y KCG + A K Sbjct: 85 RPSALIYSTILCHCLHERALEEGKLVHAHTKASGLVPGLFISNRFIDLYAKCGCLGDARK 144 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 VF +P DL SW ++I+GYA+ G+ EA +LFD + DH ++ ++S Sbjct: 145 VFDEMPDKDLCSWNTMISGYAKVGKLDEARRLFDEM----PDRDHYSWSAMISGYVRQDW 200 Query: 398 VDKGVEYFHSIK 433 +G+E + ++ Sbjct: 201 AKEGLELYRMMQ 212 >gb|EPS71377.1| hypothetical protein M569_03380, partial [Genlisea aurea] Length = 639 Score = 290 bits (743), Expect = 1e-76 Identities = 140/184 (76%), Positives = 157/184 (85%), Gaps = 1/184 (0%) Frame = +2 Query: 5 TLFSDFLS-SGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHM 181 +LFS LS SG PN+FT +GVL AC TAEE+GRQVH MM TGF P SFAAS LVHM Sbjct: 256 SLFSHLLSCSGNEPNDFTISGVLKACTFCTAEEIGRQVHARMMLTGFSPDSFAASTLVHM 315 Query: 182 YTKCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITF 361 YTKCGS+E A KVF +P+PDLVSWTSLINGYAQNGQ EAL+LFDLLL SGN+PDHITF Sbjct: 316 YTKCGSIESARKVFSMIPEPDLVSWTSLINGYAQNGQHREALRLFDLLLESGNRPDHITF 375 Query: 362 IGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMP 541 +GVLSACTHAGLV +G+EYFHSI EKHGLSHT DHYACVVDLLSR+GRF EAE++IN+MP Sbjct: 376 VGVLSACTHAGLVSEGLEYFHSITEKHGLSHTPDHYACVVDLLSRAGRFEEAENVINEMP 435 Query: 542 IKPD 553 +KPD Sbjct: 436 MKPD 439 Score = 92.4 bits (228), Expect = 6e-17 Identities = 58/171 (33%), Positives = 90/171 (52%), Gaps = 1/171 (0%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N+FT + L A A + G+++HG ++RT D + SAL+ MY KCGSV A +F Sbjct: 168 NKFTISIALAASASLKSLCSGKEIHGRIIRTSRDSDAVVWSALLDMYGKCGSVNEARHIF 227 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLR-SGNQPDHITFIGVLSACTHAGLV 400 P D+VSWT++++ Y +G+ E LF LL SGN+P+ T GVL ACT Sbjct: 228 DTTPDKDIVSWTTMMDCYFGDGKWTEGFSLFSHLLSCSGNEPNDFTISGVLKACTFCTAE 287 Query: 401 DKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 + G H+ G S + + +V + ++ G A + + +P +PD Sbjct: 288 EIG-RQVHARMMLTGFSPDSFAASTLVHMYTKCGSIESARKVFSMIP-EPD 336 Score = 58.9 bits (141), Expect = 8e-07 Identities = 51/198 (25%), Positives = 90/198 (45%), Gaps = 33/198 (16%) Frame = +2 Query: 53 TFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVFKWL 232 ++A VL C + A + G+ V + +GF P +F ++ ++ ++ KCGS+ A +F + Sbjct: 36 SYATVLQLCIEKKALDEGKIVVAQIKASGFVPGTFVSNRILDLFCKCGSLFEARTLFDEM 95 Query: 233 PKPDLVSWTSLINGYAQNGQPHEALKLFDLL----------LRSG----NQPDHITFI-- 364 DL SW +++GY + G +A +FD + L SG N+P+H + Sbjct: 96 NCRDLCSWNIMLSGYTKCGLISDARNMFDEMPQRDNFTWTALISGYVKHNEPEHALELYR 155 Query: 365 -----GVLSACTH------AGLVDKGVEYFHSIKEKHG----LSHTADH--YACVVDLLS 493 + SAC + A ++ S KE HG S +D ++ ++D+ Sbjct: 156 LMHEEEISSACDNKFTISIALAASASLKSLCSGKEIHGRIIRTSRDSDAVVWSALLDMYG 215 Query: 494 RSGRFREAEDIINKMPIK 547 + G EA I + P K Sbjct: 216 KCGSVNEARHIFDTTPDK 233 Score = 56.6 bits (135), Expect = 4e-06 Identities = 48/157 (30%), Positives = 72/157 (45%), Gaps = 7/157 (4%) Frame = +2 Query: 2 LTLFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQ-VHGHMMRTGFDPLSFAASALVH 178 L LF L SG RP+ TF GVL+AC H G + H + G + +V Sbjct: 357 LRLFDLLLESGNRPDHITFVGVLSACTHAGLVSEGLEYFHSITEKHGLSHTPDHYACVVD 416 Query: 179 MYTKCGSVERAHKVFKWLP-KPDLVSWTSLINGYAQNGQPHEALKLFDLLLR--SGNQPD 349 + ++ G E A V +P KPD W SL+NG +G A + + LLR N Sbjct: 417 LLSRAGRFEEAENVINEMPMKPDRFIWGSLLNGCRIHGNYVLAKEAAEALLRLEPDNAAT 476 Query: 350 HITFIGVLSA---CTHAGLVDKGVEYFHSIKEKHGLS 451 ++T + ++ AG + K +E ++K K G+S Sbjct: 477 YVTLANIYASEGKWDEAGEMRKVMEEGKAVK-KPGMS 512 >ref|XP_004301849.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Fragaria vesca subsp. vesca] Length = 757 Score = 289 bits (740), Expect = 3e-76 Identities = 135/183 (73%), Positives = 159/183 (86%) Frame = +2 Query: 2 LTLFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHM 181 L LFS+ + +GIRPNEFTFAGVLNACA E LG+QVHG+M R FDP SFAASALVHM Sbjct: 370 LALFSELMRTGIRPNEFTFAGVLNACADHAIENLGKQVHGYMTRIEFDPFSFAASALVHM 429 Query: 182 YTKCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITF 361 Y+KCG+ A+KVFK +P PDLVSWTSLI GYAQNGQ EAL+LF+ LL+SG +PDH+TF Sbjct: 430 YSKCGNTANANKVFKGMPSPDLVSWTSLIVGYAQNGQADEALQLFESLLKSGTRPDHVTF 489 Query: 362 IGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMP 541 +GVLSACTHAGLVD+G+EYFHSIKEKHGL HTADHYACVVDLL+R+G+F EAE+II++MP Sbjct: 490 VGVLSACTHAGLVDRGLEYFHSIKEKHGLKHTADHYACVVDLLARAGQFDEAENIISEMP 549 Query: 542 IKP 550 +KP Sbjct: 550 MKP 552 Score = 101 bits (252), Expect = 1e-19 Identities = 57/170 (33%), Positives = 93/170 (54%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 ++FT + VL A A + +G+++H ++MRTG D SAL MY KCGS+E A +VF Sbjct: 283 SKFTVSSVLVASAAVQSLRMGKEIHCYIMRTGLDSDEVVWSALSDMYGKCGSIEEARRVF 342 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 + D+V+WT+++ Y ++G+ E L LF L+R+G +P+ TF GVL+AC + + Sbjct: 343 DKMVNRDVVTWTAMMGRYFEDGKREEGLALFSELMRTGIRPNEFTFAGVLNACADHAIEN 402 Query: 404 KGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 G + H + + + +V + S+ G A + MP PD Sbjct: 403 LG-KQVHGYMTRIEFDPFSFAASALVHMYSKCGNTANANKVFKGMP-SPD 450 Score = 73.2 bits (178), Expect = 4e-11 Identities = 39/133 (29%), Positives = 72/133 (54%) Frame = +2 Query: 41 PNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKV 220 P+ ++ +L+ C A + + VH H GFD F ++ +++Y KCGS+ A KV Sbjct: 149 PSSSLYSTLLHHCLQHRALDQAKLVHSHTKLYGFDLGLFISNRFINLYAKCGSLVDAQKV 208 Query: 221 FKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLV 400 F +P DL SW ++I+GYA+ G+ +A KLFD + D+ ++ ++S Sbjct: 209 FDEMPDRDLCSWNTMISGYAKLGKLGDARKLFDEM----PHRDNFSWTAMISGYVWHERP 264 Query: 401 DKGVEYFHSIKEK 439 D+ +E + ++++ Sbjct: 265 DEALELYRVMRKE 277 >ref|XP_003610927.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355512262|gb|AES93885.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 802 Score = 288 bits (736), Expect = 8e-76 Identities = 131/183 (71%), Positives = 159/183 (86%) Frame = +2 Query: 5 TLFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMY 184 +LF D + SG+RPNE+TFAGVLNACA AE++G++VHG+M R G+DP SFAASALVH+Y Sbjct: 283 SLFRDLMGSGVRPNEYTFAGVLNACADLAAEQMGKEVHGYMTRVGYDPFSFAASALVHVY 342 Query: 185 TKCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFI 364 +KCG+ E A +VF +P+PDLVSWTSLI GYAQNGQP AL+ F+ LLRSG +PD ITF+ Sbjct: 343 SKCGNTETARRVFNQMPRPDLVSWTSLIVGYAQNGQPDMALQFFESLLRSGTKPDEITFV 402 Query: 365 GVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPI 544 GVLSACTHAGLVD G+EYFHS+KEKHGL HTADHYACV+DLL+RSGRF+EAE+II+ MP+ Sbjct: 403 GVLSACTHAGLVDIGLEYFHSVKEKHGLVHTADHYACVIDLLARSGRFKEAENIIDNMPM 462 Query: 545 KPD 553 KPD Sbjct: 463 KPD 465 Score = 96.3 bits (238), Expect = 4e-18 Identities = 54/170 (31%), Positives = 91/170 (53%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N FT + L A A ++ G+++HG+++R+G + +AL+ +Y KCGS+ A +F Sbjct: 195 NMFTLSSALAAAAAISSLRRGKEIHGYLIRSGLELDEVVWTALLDLYGKCGSLNEARGIF 254 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 + D+VSWT++I+ ++G+ E LF L+ SG +P+ TF GVL+AC Sbjct: 255 DQMADKDIVSWTTMIHRCFEDGRKKEGFSLFRDLMGSGVRPNEYTFAGVLNACADLAAEQ 314 Query: 404 KGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 G E H + G + + +V + S+ G A + N+MP +PD Sbjct: 315 MGKE-VHGYMTRVGYDPFSFAASALVHVYSKCGNTETARRVFNQMP-RPD 362 Score = 82.4 bits (202), Expect = 7e-14 Identities = 44/140 (31%), Positives = 76/140 (54%) Frame = +2 Query: 17 DFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCG 196 D+L +P+ ++ ++ AC ELG++VH H + F P ++ L+HMY KCG Sbjct: 53 DYLHRIPQPSPRLYSTLIAACLRHRKLELGKRVHAHTKASNFIPGIVISNRLIHMYAKCG 112 Query: 197 SVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLS 376 S+ A +F +P+ DL SW ++I+GYA G+ +A KLFD + D+ ++ V+S Sbjct: 113 SLVDAQMLFDEIPQKDLCSWNTMISGYANVGRIEQARKLFDEM----PHRDNFSWNAVIS 168 Query: 377 ACTHAGLVDKGVEYFHSIKE 436 G + ++ F ++E Sbjct: 169 GYVSQGWYMEALDLFRMMQE 188 >gb|ESW22273.1| hypothetical protein PHAVU_005G140500g [Phaseolus vulgaris] Length = 681 Score = 286 bits (732), Expect = 2e-75 Identities = 132/184 (71%), Positives = 157/184 (85%) Frame = +2 Query: 2 LTLFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHM 181 L+LF D + SG+RPNE+TFAGVLN CA AE LG++VHG+MMR G+DP SFA SALVHM Sbjct: 294 LSLFRDLMWSGVRPNEYTFAGVLNECADHAAEHLGKEVHGYMMRVGYDPCSFAVSALVHM 353 Query: 182 YTKCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITF 361 Y+KCG+ A +VF +P DLVSWTSLI GYAQNG+P EAL F+LLL+SG +PD ITF Sbjct: 354 YSKCGNTRVARRVFNHMPHKDLVSWTSLIVGYAQNGEPEEALHFFELLLQSGTKPDQITF 413 Query: 362 IGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMP 541 +GVLSACTHAGLVDKG+EYFHSI+EKHGL H+ADHYACV+DLL+RSGRF+EAE+II+ MP Sbjct: 414 VGVLSACTHAGLVDKGLEYFHSIREKHGLMHSADHYACVIDLLARSGRFKEAENIIDNMP 473 Query: 542 IKPD 553 IKPD Sbjct: 474 IKPD 477 Score = 99.8 bits (247), Expect = 4e-19 Identities = 61/174 (35%), Positives = 91/174 (52%), Gaps = 6/174 (3%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N+FT + L A A LG+++HG++MRT + SAL+ +Y KCGS++ A +F Sbjct: 207 NKFTLSSALAASAAIPCLRLGKEIHGYLMRTELNLDEVVWSALLDLYGKCGSLDEARGIF 266 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 + D+VSWT++I+ ++G+ E L LF L+ SG +P+ TF GVL+ C D Sbjct: 267 DQMKSKDVVSWTTMIHRCFEDGRKEEGLSLFRDLMWSGVRPNEYTFAGVLNEC-----AD 321 Query: 404 KGVEYFHSIKEKHGLSHTADHYAC------VVDLLSRSGRFREAEDIINKMPIK 547 E H KE HG + C +V + S+ G R A + N MP K Sbjct: 322 HAAE--HLGKEVHGYMMRVGYDPCSFAVSALVHMYSKCGNTRVARRVFNHMPHK 373 Score = 73.9 bits (180), Expect = 2e-11 Identities = 39/101 (38%), Positives = 57/101 (56%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 RP+ ++ ++ AC A ELGR+VH H + F F + L+ MY KCGS+ A Sbjct: 72 RPSARAYSTLIAACVRHRALELGRRVHAHTKGSNFVLGVFICNRLLDMYAKCGSLVDAQM 131 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGN 340 +F + DL SW ++I GYA+ G+ +A KLFD + R N Sbjct: 132 LFDEMGHRDLCSWNTMIAGYAKLGRLEQARKLFDEMPRRDN 172 >ref|XP_003542017.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Glycine max] Length = 693 Score = 285 bits (730), Expect = 4e-75 Identities = 131/182 (71%), Positives = 156/182 (85%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LF D + SG+RPNE+TFAGVLNACA AE LG++VHG+MM G+DP SFA SALVHMY+ Sbjct: 308 LFRDLMQSGVRPNEYTFAGVLNACADHAAEHLGKEVHGYMMHAGYDPGSFAISALVHMYS 367 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCG+ A +VF + +PDLVSWTSLI GYAQNGQP EAL F+LLL+SG +PD +T++G Sbjct: 368 KCGNTRVARRVFNEMHQPDLVSWTSLIVGYAQNGQPDEALHFFELLLQSGTKPDQVTYVG 427 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VLSACTHAGLVDKG+EYFHSIKEKHGL HTADHYACV+DLL+RSGRF+EAE+II+ MP+K Sbjct: 428 VLSACTHAGLVDKGLEYFHSIKEKHGLMHTADHYACVIDLLARSGRFKEAENIIDNMPVK 487 Query: 548 PD 553 PD Sbjct: 488 PD 489 Score = 94.7 bits (234), Expect = 1e-17 Identities = 58/171 (33%), Positives = 92/171 (53%), Gaps = 6/171 (3%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N+FT + L A A LG+++HG+++RT + SAL+ +Y KCGS++ A +F Sbjct: 219 NKFTLSSALAASAAIPCLRLGKEIHGYLIRTELNLDEVVWSALLDLYGKCGSLDEARGIF 278 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 + D+VSWT++I+ ++G+ E LF L++SG +P+ TF GVL+AC D Sbjct: 279 DQMKDRDVVSWTTMIHRCFEDGRREEGFLLFRDLMQSGVRPNEYTFAGVLNAC-----AD 333 Query: 404 KGVEYFHSIKEKHGLSHTADH------YACVVDLLSRSGRFREAEDIINKM 538 E H KE HG A + + +V + S+ G R A + N+M Sbjct: 334 HAAE--HLGKEVHGYMMHAGYDPGSFAISALVHMYSKCGNTRVARRVFNEM 382 Score = 77.8 bits (190), Expect = 2e-12 Identities = 55/199 (27%), Positives = 89/199 (44%), Gaps = 32/199 (16%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGS------ 199 RP+ ++ ++ AC A ELGR+VH H + F P F ++ L+ MY KCGS Sbjct: 84 RPSARVYSTLIAACVRHRALELGRRVHAHTKASNFVPGVFISNRLLDMYAKCGSLVDAQM 143 Query: 200 -------------------------VERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEA 304 +E+A K+F +P+ D SW + I+GY + QP EA Sbjct: 144 LFDEMGHRDLCSWNTMIVGYAKLGRLEQARKLFDEMPQRDNFSWNAAISGYVTHNQPREA 203 Query: 305 LKLFDLLLR-SGNQPDHITFIGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVV 481 L+LF ++ R + + T L+A + G E H + L+ ++ ++ Sbjct: 204 LELFRVMQRHERSSSNKFTLSSALAASAAIPCLRLGKE-IHGYLIRTELNLDEVVWSALL 262 Query: 482 DLLSRSGRFREAEDIINKM 538 DL + G EA I ++M Sbjct: 263 DLYGKCGSLDEARGIFDQM 281 >ref|XP_004511479.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g37170-like [Cicer arietinum] Length = 700 Score = 283 bits (723), Expect = 3e-74 Identities = 130/184 (70%), Positives = 159/184 (86%) Frame = +2 Query: 2 LTLFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHM 181 L+LF + + SG+RPNE+TFAGVLNACA E +G++VHG+M+R G++P SFAASALVH+ Sbjct: 278 LSLFRNLMGSGVRPNEYTFAGVLNACADLAIERIGKEVHGYMIRVGYNPCSFAASALVHL 337 Query: 182 YTKCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITF 361 Y+KCG+ E A +VF +P+PDLVS TSLI GYAQNGQP AL F+LLLRSG +PD ITF Sbjct: 338 YSKCGNTEIARRVFNKMPRPDLVSCTSLIVGYAQNGQPDMALNFFELLLRSGTKPDEITF 397 Query: 362 IGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMP 541 +GVLSACTHAGLVDKG+EYFHS+KEKHGL HTADHYACV+DLL+RSGRF+EAE+II+ MP Sbjct: 398 VGVLSACTHAGLVDKGLEYFHSVKEKHGLMHTADHYACVIDLLARSGRFKEAENIIDNMP 457 Query: 542 IKPD 553 +KPD Sbjct: 458 MKPD 461 Score = 101 bits (251), Expect = 1e-19 Identities = 59/170 (34%), Positives = 94/170 (55%) Frame = +2 Query: 44 NEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHKVF 223 N FT + L A A + LG+++HG+++RT + SAL+ +Y KCGS++ A +F Sbjct: 191 NMFTLSSALAAAAAIRSLRLGKEIHGYLVRTELNLDEVVWSALLDLYGKCGSLDEARGIF 250 Query: 224 KWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGLVD 403 + D+VSWT++I+ Y ++G+ L LF L+ SG +P+ TF GVL+AC + Sbjct: 251 DQMVDRDVVSWTTMIHRYFEDGRKEGGLSLFRNLMGSGVRPNEYTFAGVLNACADLAIER 310 Query: 404 KGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 G E H + G + + + +V L S+ G A + NKMP +PD Sbjct: 311 IGKE-VHGYMIRVGYNPCSFAASALVHLYSKCGNTEIARRVFNKMP-RPD 358 Score = 84.7 bits (208), Expect = 1e-14 Identities = 45/129 (34%), Positives = 69/129 (53%) Frame = +2 Query: 17 DFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCG 196 DFL +P+ ++ ++ AC H + ELGR+VH H + F P F ++ L+HMY KCG Sbjct: 49 DFLHRIHQPSPRLYSNLIAACLHHRSLELGRKVHAHTKASNFIPGIFISNRLLHMYVKCG 108 Query: 197 SVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLS 376 + A +F + + DL SW ++I GYA G +A KLFD + + N + G +S Sbjct: 109 GLIDAQSLFDEMSQKDLCSWNTMIAGYANLGHLEQARKLFDEMPQRDNFSWNAAISGYVS 168 Query: 377 ACTHAGLVD 403 H +D Sbjct: 169 HHRHREALD 177 >ref|XP_006283244.1| hypothetical protein CARUB_v10004277mg [Capsella rubella] gi|482551949|gb|EOA16142.1| hypothetical protein CARUB_v10004277mg [Capsella rubella] Length = 690 Score = 273 bits (699), Expect = 2e-71 Identities = 126/182 (69%), Positives = 153/182 (84%) Frame = +2 Query: 5 TLFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMY 184 +LFS+ + S RPNE+TFAG+LNACA T E+LG+QVHG+M R GFDP SFA+S+LV MY Sbjct: 304 SLFSELIGSCERPNEYTFAGILNACADLTKEDLGKQVHGYMTRIGFDPYSFASSSLVDMY 363 Query: 185 TKCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFI 364 TKCG++E A V PKPDLVSWTSLI GYAQNG+P EALK FDLLL+SG +PDH+TF+ Sbjct: 364 TKCGNIESAKHVVDGCPKPDLVSWTSLIGGYAQNGKPDEALKYFDLLLKSGTKPDHVTFV 423 Query: 365 GVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPI 544 VLSACTHAGLV+KG+EYFHSI EKHGLSHT+DHY C+VDLL+RSGRF + + I ++MP+ Sbjct: 424 NVLSACTHAGLVEKGLEYFHSITEKHGLSHTSDHYTCLVDLLARSGRFEQLKSITSEMPM 483 Query: 545 KP 550 KP Sbjct: 484 KP 485 Score = 100 bits (250), Expect = 2e-19 Identities = 55/172 (31%), Positives = 91/172 (52%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 RPN FT + + A A G+++HGH++R G D S+L+ MY KCG ++ A Sbjct: 214 RPNIFTVSSAVAAAAAIPCIRRGKEIHGHIVRAGLDSDEVLWSSLIDMYGKCGCIDEARN 273 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 +F + D+VSWTS+I+ Y ++ + E LF L+ S +P+ TF G+L+AC Sbjct: 274 IFDKILVKDVVSWTSMIDRYFKSRRWREGFSLFSELIGSCERPNEYTFAGILNACADLTK 333 Query: 398 VDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 D G + H + G + + +VD+ ++ G A+ +++ P KPD Sbjct: 334 EDLG-KQVHGYMTRIGFDPYSFASSSLVDMYTKCGNIESAKHVVDGCP-KPD 383 Score = 85.1 bits (209), Expect = 1e-14 Identities = 57/202 (28%), Positives = 93/202 (46%), Gaps = 32/202 (15%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 +P T+ ++ C+ A E G++VH H+ +GF P + L+ MY KCGS+ A K Sbjct: 81 KPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRLLGMYAKCGSLVDARK 140 Query: 218 VFKWLPKPDLVSWTSLINGYAQNG-------------------------------QPHEA 304 VF +PK D+ SW ++NGYA+ G QP EA Sbjct: 141 VFDDMPKRDVCSWNLMVNGYAEVGLVDEARKLFDEMPQRDSYSWTAMVAGYVKKDQPEEA 200 Query: 305 LKLFDLLLRSGN-QPDHITFIGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVV 481 L L+ L+ R N +P+ T ++A + +G E H + GL ++ ++ Sbjct: 201 LVLYSLMQRVPNSRPNIFTVSSAVAAAAAIPCIRRGKE-IHGHIVRAGLDSDEVLWSSLI 259 Query: 482 DLLSRSGRFREAEDIINKMPIK 547 D+ + G EA +I +K+ +K Sbjct: 260 DMYGKCGCIDEARNIFDKILVK 281 >ref|XP_006411944.1| hypothetical protein EUTSA_v10026762mg [Eutrema salsugineum] gi|557113114|gb|ESQ53397.1| hypothetical protein EUTSA_v10026762mg [Eutrema salsugineum] Length = 694 Score = 270 bits (690), Expect = 2e-70 Identities = 126/182 (69%), Positives = 151/182 (82%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LFS+ +SS RPNE+TFAGVLNAC T EELG+QVHG+M R G+DP SFA+S+LV MYT Sbjct: 309 LFSELVSSCERPNEYTFAGVLNACTDLTTEELGKQVHGYMTRIGYDPYSFASSSLVDMYT 368 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCG+++ A V PKPDL SWTSLI GYAQNG+P +ALK FDLLL SG +PDHITF+ Sbjct: 369 KCGNIQSAKHVVDGCPKPDLFSWTSLIGGYAQNGEPEKALKYFDLLLESGTKPDHITFVN 428 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VLSACTHAGLV+KG+EYFHSI EKHGLSHT DHY C+VDLL+RSGRF + + II++MP+K Sbjct: 429 VLSACTHAGLVEKGLEYFHSITEKHGLSHTDDHYTCLVDLLARSGRFEQLKGIISEMPMK 488 Query: 548 PD 553 P+ Sbjct: 489 PN 490 Score = 99.0 bits (245), Expect = 7e-19 Identities = 55/172 (31%), Positives = 92/172 (53%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 +PN FT + + A A G+++HGH+ R G D S+L+ MY KCG ++ A Sbjct: 218 KPNIFTVSSAVAAAAAIPCIRRGKEIHGHIFRAGLDSDEVLWSSLMDMYGKCGCIDEARH 277 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 +F + D+VSWTS+I+ Y ++ + E LF L+ S +P+ TF GVL+ACT Sbjct: 278 IFDKIVDKDVVSWTSMIDRYFKSRRWREGFCLFSELVSSCERPNEYTFAGVLNACTDLTT 337 Query: 398 VDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 + G + H + G + + +VD+ ++ G + A+ +++ P KPD Sbjct: 338 EELG-KQVHGYMTRIGYDPYSFASSSLVDMYTKCGNIQSAKHVVDGCP-KPD 387 Score = 79.3 bits (194), Expect = 6e-13 Identities = 37/94 (39%), Positives = 56/94 (59%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 +P T+ ++ C+ + A E G++VH H+ +GF P + L+ MY KCGS+ A K Sbjct: 85 KPPASTYCNLIQVCSQKRALEEGKKVHEHIKNSGFVPGVVICNRLLGMYAKCGSLIDARK 144 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFD 319 +F +P D+ SW ++NGYA+ G EA KLFD Sbjct: 145 LFDEMPNKDVCSWNIMVNGYAEVGLLEEARKLFD 178 >ref|NP_195434.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75097747|sp|O23169.1|PP353_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g37170 gi|2464864|emb|CAB16758.1| putative protein [Arabidopsis thaliana] gi|7270666|emb|CAB80383.1| putative protein [Arabidopsis thaliana] gi|332661361|gb|AEE86761.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 691 Score = 268 bits (685), Expect = 6e-70 Identities = 125/182 (68%), Positives = 152/182 (83%) Frame = +2 Query: 5 TLFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMY 184 +LFS+ + S RPNE+TFAGVLNACA T EELG+QVHG+M R GFDP SFA+S+LV MY Sbjct: 305 SLFSELVGSCERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMY 364 Query: 185 TKCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFI 364 TKCG++E A V PKPDLVSWTSLI G AQNGQP EALK FDLLL+SG +PDH+TF+ Sbjct: 365 TKCGNIESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTFV 424 Query: 365 GVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPI 544 VLSACTHAGLV+KG+E+F+SI EKH LSHT+DHY C+VDLL+RSGRF + + +I++MP+ Sbjct: 425 NVLSACTHAGLVEKGLEFFYSITEKHRLSHTSDHYTCLVDLLARSGRFEQLKSVISEMPM 484 Query: 545 KP 550 KP Sbjct: 485 KP 486 Score = 100 bits (248), Expect = 3e-19 Identities = 55/172 (31%), Positives = 92/172 (53%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 RPN FT + + A A G+++HGH++R G D S+L+ MY KCG ++ A Sbjct: 215 RPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARN 274 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 +F + + D+VSWTS+I+ Y ++ + E LF L+ S +P+ TF GVL+AC Sbjct: 275 IFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFSELVGSCERPNEYTFAGVLNACADLTT 334 Query: 398 VDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 + G + H + G + + +VD+ ++ G A+ +++ P KPD Sbjct: 335 EELG-KQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESAKHVVDGCP-KPD 384 Score = 82.0 bits (201), Expect = 9e-14 Identities = 54/198 (27%), Positives = 87/198 (43%), Gaps = 31/198 (15%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 +P T+ ++ C+ A E G++VH H+ +GF P + L+ MY KCGS+ A K Sbjct: 82 KPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRLLRMYAKCGSLVDARK 141 Query: 218 VFKWLPKPDLVSWTSLINGYAQNG-------------------------------QPHEA 304 VF +P DL SW ++NGYA+ G QP EA Sbjct: 142 VFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFDEMTEKDSYSWTAMVTGYVKKDQPEEA 201 Query: 305 LKLFDLLLRSGNQPDHITFIGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVD 484 L L+ L+ R N +I + + A A + + H + GL ++ ++D Sbjct: 202 LVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDEVLWSSLMD 261 Query: 485 LLSRSGRFREAEDIINKM 538 + + G EA +I +K+ Sbjct: 262 MYGKCGCIDEARNIFDKI 279 >ref|XP_004152308.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Cucumis sativus] gi|449484855|ref|XP_004156999.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Cucumis sativus] Length = 724 Score = 265 bits (678), Expect = 4e-69 Identities = 124/182 (68%), Positives = 150/182 (82%) Frame = +2 Query: 8 LFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYT 187 LF ++S I PN+FTFAGVLNACA AE+LG+Q+H +M+R GFD S AASALVHMY+ Sbjct: 339 LFRHLMNSNIMPNDFTFAGVLNACADLAAEDLGKQIHAYMVRVGFDSFSSAASALVHMYS 398 Query: 188 KCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIG 367 KCG +E A VF+ LP+PDL SWTSL+ GYAQ+GQ +AL F+LLL+SG +PD I FIG Sbjct: 399 KCGDIENAKSVFEILPQPDLFSWTSLLVGYAQHGQHDKALHFFELLLKSGTKPDGIAFIG 458 Query: 368 VLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIK 547 VLSAC HAGLVDKG+EYFHSIKEKHGL+ T DHYAC++DLL+R+G+F EAE IIN+MPIK Sbjct: 459 VLSACAHAGLVDKGLEYFHSIKEKHGLTRTIDHYACIIDLLARAGQFTEAESIINEMPIK 518 Query: 548 PD 553 PD Sbjct: 519 PD 520 Score = 104 bits (259), Expect = 2e-20 Identities = 56/172 (32%), Positives = 92/172 (53%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 + N+ T + L A A + +G+++HGH+MR G D +L+ MY KCGS+E A Sbjct: 248 KSNKCTISSALAASAAIPSLHMGKKIHGHIMRMGLDSDEVVWCSLLDMYGKCGSIEEARY 307 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 +F + + D+VSWT++I+ Y +NG+ E LF L+ S P+ TF GVL+AC Sbjct: 308 IFDKMEERDVVSWTTMIHTYLKNGRREEGFALFRHLMNSNIMPNDFTFAGVLNACADLAA 367 Query: 398 VDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 D G + H+ + G + + +V + S+ G A+ + +P +PD Sbjct: 368 EDLG-KQIHAYMVRVGFDSFSSAASALVHMYSKCGDIENAKSVFEILP-QPD 417 Score = 71.6 bits (174), Expect = 1e-10 Identities = 42/138 (30%), Positives = 71/138 (51%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 +P + +L C Q A + G+QVH H+ +G L + ++ L+ MY KCGS+ A K Sbjct: 116 KPYASIYLTLLKFCLKQRALKEGKQVHAHIKTSGSIGL-YISNRLLDMYAKCGSLVDAEK 174 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 VF + DL SW +I+GY + G +A LFD + D+ ++ ++S C Sbjct: 175 VFDEMVHRDLCSWNIMISGYVKGGNFEKARNLFDKM----PNRDNFSWTAIISGCVQHNR 230 Query: 398 VDKGVEYFHSIKEKHGLS 451 ++ +E + + +KH S Sbjct: 231 PEEALELYR-LMQKHDYS 247 >ref|XP_002869013.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297314849|gb|EFH45272.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 693 Score = 264 bits (674), Expect = 1e-68 Identities = 121/182 (66%), Positives = 152/182 (83%) Frame = +2 Query: 5 TLFSDFLSSGIRPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMY 184 +LFS+ + S RPNE+TF+GVLNACA T EELGRQVHG+M R GFDP SFA+S+L+ MY Sbjct: 307 SLFSELIGSCERPNEYTFSGVLNACADLTTEELGRQVHGYMTRVGFDPYSFASSSLIDMY 366 Query: 185 TKCGSVERAHKVFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFI 364 TKCG++E A V PKPDLVS TSLI GYAQNG+P EALK FDLLL+SG +PDH+TF+ Sbjct: 367 TKCGNIESARHVVDGCPKPDLVSLTSLIGGYAQNGKPDEALKYFDLLLKSGTKPDHVTFV 426 Query: 365 GVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPI 544 VLSACTHAGLV+KG+E+F+SI EKH L+HT+DHY C+VDLL+RSGRF + + ++++MP+ Sbjct: 427 NVLSACTHAGLVEKGLEFFYSITEKHDLTHTSDHYTCLVDLLARSGRFEQLKSVLSEMPM 486 Query: 545 KP 550 KP Sbjct: 487 KP 488 Score = 97.8 bits (242), Expect = 2e-18 Identities = 53/172 (30%), Positives = 89/172 (51%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 +PN FT + + A A G+++HGH++R G D S+L+ MY KCG ++ A Sbjct: 217 KPNIFTVSSAVAAAAAIKCIRRGKEIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARN 276 Query: 218 VFKWLPKPDLVSWTSLINGYAQNGQPHEALKLFDLLLRSGNQPDHITFIGVLSACTHAGL 397 +F + D+VSWTS+I+ Y ++ + E LF L+ S +P+ TF GVL+AC Sbjct: 277 IFDKIIDKDVVSWTSMIDRYFKSSRWREGFSLFSELIGSCERPNEYTFSGVLNACADLTT 336 Query: 398 VDKGVEYFHSIKEKHGLSHTADHYACVVDLLSRSGRFREAEDIINKMPIKPD 553 + G H + G + + ++D+ ++ G A +++ P KPD Sbjct: 337 EELG-RQVHGYMTRVGFDPYSFASSSLIDMYTKCGNIESARHVVDGCP-KPD 386 Score = 80.5 bits (197), Expect = 2e-13 Identities = 54/199 (27%), Positives = 91/199 (45%), Gaps = 32/199 (16%) Frame = +2 Query: 38 RPNEFTFAGVLNACAHQTAEELGRQVHGHMMRTGFDPLSFAASALVHMYTKCGSVERAHK 217 +P T+ ++ C+ A E G++VH H+ +GF P + ++ MY KCGS+ A K Sbjct: 84 KPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRILGMYAKCGSLVDARK 143 Query: 218 VFKWLPKPDLVSWTSLINGYAQNG-------------------------------QPHEA 304 VF +P+ D+ SW ++NGYA+ G QP EA Sbjct: 144 VFDEMPERDVCSWNVMVNGYAEVGLLEEARNLFDEMPERDSYSWTAMVTGYVKKDQPEEA 203 Query: 305 LKLFDLLLRSGN-QPDHITFIGVLSACTHAGLVDKGVEYFHSIKEKHGLSHTADHYACVV 481 L L+ L+ R N +P+ T ++A + +G E H + GL ++ ++ Sbjct: 204 LVLYSLMQRVPNSKPNIFTVSSAVAAAAAIKCIRRGKE-IHGHIVRAGLDSDEVLWSSLM 262 Query: 482 DLLSRSGRFREAEDIINKM 538 D+ + G EA +I +K+ Sbjct: 263 DMYGKCGCIDEARNIFDKI 281