BLASTX nr result
ID: Akebia22_contig00018822
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00018822 (1836 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006358787.1| PREDICTED: pentatricopeptide repeat-containi... 677 0.0 ref|XP_004248026.1| PREDICTED: pentatricopeptide repeat-containi... 670 0.0 gb|EYU27896.1| hypothetical protein MIMGU_mgv1a005951mg [Mimulus... 669 0.0 ref|XP_006830661.1| hypothetical protein AMTR_s00210p00017530 [A... 652 0.0 ref|XP_002268375.1| PREDICTED: pentatricopeptide repeat-containi... 636 e-179 ref|XP_004142520.1| PREDICTED: pentatricopeptide repeat-containi... 635 e-179 ref|XP_007030290.1| Pentatricopeptide repeat (PPR) superfamily p... 627 e-177 gb|EXB93122.1| hypothetical protein L484_024459 [Morus notabilis] 622 e-175 gb|EPS58676.1| hypothetical protein M569_16136, partial [Genlise... 604 e-170 ref|XP_007206771.1| hypothetical protein PRUPE_ppa025361mg [Prun... 603 e-169 ref|XP_006479008.1| PREDICTED: pentatricopeptide repeat-containi... 593 e-167 ref|XP_002522032.1| pentatricopeptide repeat-containing protein,... 586 e-164 ref|XP_006400384.1| hypothetical protein EUTSA_v10013477mg [Eutr... 579 e-162 ref|XP_004504788.1| PREDICTED: pentatricopeptide repeat-containi... 579 e-162 ref|XP_006287650.1| hypothetical protein CARUB_v10000860mg, part... 578 e-162 ref|XP_004296694.1| PREDICTED: pentatricopeptide repeat-containi... 576 e-161 ref|NP_197340.1| pentatricopeptide repeat-containing protein [Ar... 576 e-161 ref|XP_002871814.1| pentatricopeptide repeat-containing protein ... 576 e-161 ref|XP_003524064.1| PREDICTED: pentatricopeptide repeat-containi... 563 e-158 ref|XP_007159095.1| hypothetical protein PHAVU_002G208300g [Phas... 562 e-157 >ref|XP_006358787.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like isoform X1 [Solanum tuberosum] gi|565385886|ref|XP_006358788.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like isoform X2 [Solanum tuberosum] gi|565385889|ref|XP_006358789.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like isoform X3 [Solanum tuberosum] Length = 472 Score = 677 bits (1747), Expect = 0.0 Identities = 323/420 (76%), Positives = 373/420 (88%), Gaps = 3/420 (0%) Frame = -1 Query: 1638 ISNDDYFAAIHHISNIVRRDFYLERTLQKMQIN--VTSELVYRVLRSCGKSGIESFRFFN 1465 + NDDYFA IHH+SNIVRRD YLERTL KM I+ V SELVYRVLRSC + GIESFRFFN Sbjct: 53 VPNDDYFATIHHVSNIVRRDIYLERTLNKMHISSIVNSELVYRVLRSCCQHGIESFRFFN 112 Query: 1464 WARNQ-PRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETFSMIIESYGQ 1288 WAR Q P+Y+PTT+EFEEL+KTL RT +WETMWKV +QMK Q+ ++P S IIE YG+ Sbjct: 113 WARTQHPQYDPTTVEFEELLKTLARTAHWETMWKVVQQMKAQNIPISPSIVSFIIEHYGK 172 Query: 1287 NGFVDRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRT 1108 G +D+AVE+FNR+KNF+CPQTT VYN++LFALCEVKNFQGAYALIRRMIRKG VPDK+T Sbjct: 173 RGLIDQAVELFNRLKNFDCPQTTEVYNAMLFALCEVKNFQGAYALIRRMIRKGTVPDKQT 232 Query: 1107 FSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMT 928 +SILVNGWCSAGK+REAQEFLEEMSRKGFNPPVRGRDLLIDGLL+AGY ESAKGLVRKMT Sbjct: 233 YSILVNGWCSAGKMREAQEFLEEMSRKGFNPPVRGRDLLIDGLLSAGYLESAKGLVRKMT 292 Query: 927 KEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRID 748 KEGF+PD+ TFNSL EAICK E+DFCIDL +DVC+ GL PDI TYKI+I SK+GRID Sbjct: 293 KEGFVPDVGTFNSLAEAICKTGEIDFCIDLFNDVCRSGLFPDIETYKIVITAASKVGRID 352 Query: 747 EAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLI 568 EAF++LH SIE GH+PFPSLYAPILKA R G+FDDAFSFFS++K++GHPPNRP+YTMLI Sbjct: 353 EAFQILHRSIEAGHRPFPSLYAPILKAFFRRGQFDDAFSFFSEMKLKGHPPNRPLYTMLI 412 Query: 567 KMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRGV 388 KMC RGGRF+EA+NYLVEMTELNL P ++SFDMVTDGLKNCGKHDLAKR+E+LEIS++G+ Sbjct: 413 KMCSRGGRFVEASNYLVEMTELNLLPMSRSFDMVTDGLKNCGKHDLAKRIEQLEISVKGI 472 >ref|XP_004248026.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like [Solanum lycopersicum] Length = 470 Score = 670 bits (1729), Expect = 0.0 Identities = 321/420 (76%), Positives = 368/420 (87%), Gaps = 3/420 (0%) Frame = -1 Query: 1638 ISNDDYFAAIHHISNIVRRDFYLERTLQKMQIN--VTSELVYRVLRSCGKSGIESFRFFN 1465 + NDDYFA IHH+SNIVRRD YLERTL KM I+ V SELVYRVLRSC + GIESFRFFN Sbjct: 51 VPNDDYFATIHHVSNIVRRDIYLERTLNKMHISRIVNSELVYRVLRSCCQHGIESFRFFN 110 Query: 1464 WARNQ-PRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETFSMIIESYGQ 1288 WAR Q P+Y+PTT+EFEEL+KTL RT +WETMWKV +QMK Q+ ++P S IIE YG+ Sbjct: 111 WARTQHPQYDPTTVEFEELLKTLARTAHWETMWKVVQQMKAQNIPISPSIVSFIIEHYGK 170 Query: 1287 NGFVDRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRT 1108 G +D+AVE+FNR+KNF C QTT VYN++LFALCEVKNFQGAYALIRRMIRKG VPDK T Sbjct: 171 RGLIDQAVELFNRLKNFGCSQTTEVYNAMLFALCEVKNFQGAYALIRRMIRKGTVPDKLT 230 Query: 1107 FSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMT 928 +SILVNGWCSAGK+REAQEFLEEMSRKGFNPPVRGRDLLIDGLL+AGY ESAKGLVRKMT Sbjct: 231 YSILVNGWCSAGKMREAQEFLEEMSRKGFNPPVRGRDLLIDGLLSAGYLESAKGLVRKMT 290 Query: 927 KEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRID 748 KEGF+PD+ TFNSL EA+CK E+DFCIDL +DVC+LGL PD TYKI+I +K GRID Sbjct: 291 KEGFVPDVGTFNSLAEAVCKTGEIDFCIDLFNDVCRLGLCPDTETYKIVITAAAKAGRID 350 Query: 747 EAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLI 568 EAF++LH SIE GH+PFPSLYAPILKA R G+FDDAFSFFSD+KV+GHPPNRP+YTMLI Sbjct: 351 EAFQILHRSIEAGHRPFPSLYAPILKAFFRRGQFDDAFSFFSDMKVKGHPPNRPLYTMLI 410 Query: 567 KMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRGV 388 KMC RGGRF+EA+NYLVEMTELNL P ++SFD VTDGLKNCGKHDLAKR+E+LEIS++G+ Sbjct: 411 KMCCRGGRFVEASNYLVEMTELNLLPMSRSFDTVTDGLKNCGKHDLAKRIEQLEISVKGI 470 >gb|EYU27896.1| hypothetical protein MIMGU_mgv1a005951mg [Mimulus guttatus] Length = 463 Score = 669 bits (1726), Expect = 0.0 Identities = 319/425 (75%), Positives = 369/425 (86%), Gaps = 3/425 (0%) Frame = -1 Query: 1653 TSRGGISNDDYFAAIHHISNIVRRDFYLERTLQKMQIN--VTSELVYRVLRSCGKSGIES 1480 T+ G+ NDDYFA IHHISNIVRRD YLERTL KM+I+ V SELVYRVLRSC GIES Sbjct: 39 TTARGLPNDDYFATIHHISNIVRRDIYLERTLNKMRISNIVNSELVYRVLRSCCSCGIES 98 Query: 1479 FRFFNWAR-NQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETFSMII 1303 FRFFNWAR + P Y+PTTLEFEEL++ L +TR+WETMWKVA+ MK Q+F ++P S II Sbjct: 99 FRFFNWARTHHPNYDPTTLEFEELLRILAKTRHWETMWKVAQSMKTQNFPISPSVVSFII 158 Query: 1302 ESYGQNGFVDRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGV 1123 E Y ++G +D+AV++FN +KNF+CPQTT VYNSLLFALCEVKNFQGAYALIRRMIRKG V Sbjct: 159 EQYAKHGLIDQAVDLFNGLKNFDCPQTTEVYNSLLFALCEVKNFQGAYALIRRMIRKGSV 218 Query: 1122 PDKRTFSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGL 943 PDK+T+S+LVN WCSAGK+REAQEFLEEMS+KG+ PPVRGRDLLIDGLLNAGY ESAKGL Sbjct: 219 PDKKTYSVLVNAWCSAGKMREAQEFLEEMSKKGYKPPVRGRDLLIDGLLNAGYLESAKGL 278 Query: 942 VRKMTKEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSK 763 VRKM+K G++PD++TFN L EA+ K E+DFCIDL +DVC LG PD TYKIMI SK Sbjct: 279 VRKMSKGGYVPDVSTFNCLAEALSKSGEIDFCIDLFNDVCGLGFCPDTDTYKIMITVTSK 338 Query: 762 LGRIDEAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPV 583 +GR+DE F++LH SIEDGHKPFPSLYAPILKALCR G+FDDAFSFF+D+KV+GHPPNRPV Sbjct: 339 IGRVDEGFKILHRSIEDGHKPFPSLYAPILKALCRRGQFDDAFSFFADMKVKGHPPNRPV 398 Query: 582 YTMLIKMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEI 403 YTMLI+MC RGGR +EA NYL+EMTELNLSP +QSFDMV DGLKNCGKHDLAKRME+LEI Sbjct: 399 YTMLIRMCVRGGRCVEAGNYLMEMTELNLSPMSQSFDMVCDGLKNCGKHDLAKRMEQLEI 458 Query: 402 SLRGV 388 SLRG+ Sbjct: 459 SLRGI 463 >ref|XP_006830661.1| hypothetical protein AMTR_s00210p00017530 [Amborella trichopoda] gi|548837251|gb|ERM98077.1| hypothetical protein AMTR_s00210p00017530 [Amborella trichopoda] Length = 459 Score = 652 bits (1683), Expect = 0.0 Identities = 312/415 (75%), Positives = 361/415 (86%) Frame = -1 Query: 1635 SNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGKSGIESFRFFNWAR 1456 S DDYFA +HHISNIVRRD++LERTLQK+ + +T ELVYRVLRSC K+GIESFRFFNWAR Sbjct: 44 SKDDYFAVVHHISNIVRRDYFLERTLQKLNLTLTPELVYRVLRSCNKNGIESFRFFNWAR 103 Query: 1455 NQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETFSMIIESYGQNGFV 1276 Y PTT+EFEELIKTLG+T+NWETMWKVA+ MK F L+PETFS +++SYG+ G + Sbjct: 104 THASYHPTTIEFEELIKTLGQTKNWETMWKVADHMKILGFPLSPETFSAVMDSYGKAGLL 163 Query: 1275 DRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRTFSIL 1096 DRAVEVFNRMK+F+CPQTT VYNSLL ALC VKNFQGAYALIRRMIRKGG PDK+T++IL Sbjct: 164 DRAVEVFNRMKHFDCPQTTGVYNSLLSALCMVKNFQGAYALIRRMIRKGGHPDKQTYAIL 223 Query: 1095 VNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMTKEGF 916 VNGWCS+GKL EA+EFLEEMS+KGFNPPVRGRDLLIDGLLNAGY ESAK LV+KMTKEGF Sbjct: 224 VNGWCSSGKLGEAREFLEEMSKKGFNPPVRGRDLLIDGLLNAGYLESAKELVKKMTKEGF 283 Query: 915 LPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRIDEAFR 736 LPDI+TFNSLLEA+C E +FCI+LL V +L L DI TYKI+IP VSK G+IDEAFR Sbjct: 284 LPDISTFNSLLEALCNSGETEFCIELLRVVTELSLVLDIGTYKILIPAVSKSGQIDEAFR 343 Query: 735 LLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLIKMCG 556 LLH SIEDGHKPFPSLYAP+LK LC+ G+F DAFS F+D+K +GH PNRPVYTML++MC Sbjct: 344 LLHASIEDGHKPFPSLYAPLLKVLCKRGQFGDAFSLFADMKAEGHAPNRPVYTMLMRMCC 403 Query: 555 RGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRG 391 RGGR ++AANYLVEM E L+PR++SFDMV DGLKN GKHDLAKR++ +EISLRG Sbjct: 404 RGGRCVDAANYLVEMVERGLAPRSESFDMVIDGLKNAGKHDLAKRIDHMEISLRG 458 >ref|XP_002268375.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Vitis vinifera] gi|296085168|emb|CBI28663.3| unnamed protein product [Vitis vinifera] Length = 454 Score = 636 bits (1640), Expect = e-179 Identities = 305/427 (71%), Positives = 364/427 (85%), Gaps = 1/427 (0%) Frame = -1 Query: 1665 EEIITSRGGISNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGKSGI 1486 + ITS+ DDYFA +HHIS IVRRDFYLERTL K+ I+VTS+LVYRVLRSC SG Sbjct: 36 QNTITSK----KDDYFAVVHHISAIVRRDFYLERTLNKLPISVTSDLVYRVLRSCPNSGT 91 Query: 1485 ESFRFFNWARNQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETFSMI 1306 ES RFFNWAR+ Y+PTTLE+EEL+KTL RT+ ++ MWK+A QM+ L+P S I Sbjct: 92 ESLRFFNWARSHLSYQPTTLEYEELLKTLARTKQFQPMWKIAHQMQ----TLSPTVVSSI 147 Query: 1305 IESYGQNGFVDRAVEVFNRMKN-FNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKG 1129 IE +G++G VD+AVEVFN+ K+ NCPQT VYNSLLFALCEVK F GAYALIRRMIRKG Sbjct: 148 IEEFGKHGLVDQAVEVFNKAKSALNCPQTIEVYNSLLFALCEVKYFHGAYALIRRMIRKG 207 Query: 1128 GVPDKRTFSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAK 949 P+K+T+S+LVNGWC+AGK++EAQ+FLEEMSRKGFNPPVRGRDLL+DGLLNAGY E+AK Sbjct: 208 VTPNKQTYSVLVNGWCAAGKMKEAQDFLEEMSRKGFNPPVRGRDLLVDGLLNAGYLEAAK 267 Query: 948 GLVRKMTKEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNV 769 +VRKMTKEG PD+ T NS+LEAICK E +FCID+ +DVC+LG++P++ TYKIMIP Sbjct: 268 EMVRKMTKEGCAPDVETLNSMLEAICKAGEAEFCIDIYNDVCRLGVSPNVGTYKIMIPAA 327 Query: 768 SKLGRIDEAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNR 589 K GRIDEAFR+LH SIEDGH+PFPSLYAPI+KALCR G+FDDAF FFSD+KV+GHPPNR Sbjct: 328 CKEGRIDEAFRILHRSIEDGHRPFPSLYAPIIKALCRNGQFDDAFCFFSDMKVKGHPPNR 387 Query: 588 PVYTMLIKMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEEL 409 PVYTMLI MCGRGGRF++AANYLVEMTELNL+P ++ FDMVTDGLKNCGKHDLA+++E+L Sbjct: 388 PVYTMLITMCGRGGRFVDAANYLVEMTELNLTPISRCFDMVTDGLKNCGKHDLARKIEQL 447 Query: 408 EISLRGV 388 E+SLRGV Sbjct: 448 EVSLRGV 454 >ref|XP_004142520.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like [Cucumis sativus] gi|449518358|ref|XP_004166209.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like [Cucumis sativus] Length = 455 Score = 635 bits (1639), Expect = e-179 Identities = 301/432 (69%), Positives = 359/432 (83%), Gaps = 3/432 (0%) Frame = -1 Query: 1674 NNHEEIITSRGGISNDDYFAAIHHISNIVRRDFYLERTLQKMQI-NVTSELVYRVLRSCG 1498 +N ++ S DDYFAAIHHIS+IVRRDFY+ERTL K++I N+ SELV+RVLR+C Sbjct: 24 SNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRVLRACS 83 Query: 1497 KSGIESFRFFNWA-RNQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPE 1321 SG ESFRFFNWA + P Y+PTTLEFEEL+KTL RTR + TMWKV QMK Q+ ++PE Sbjct: 84 NSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKISPE 143 Query: 1320 TFSMIIESYGQNGFVDRAVEVFNRM-KNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRR 1144 T S II+ YG+ G VD AV +FN+ K+ +CPQT VYN+LLFALCEVK F GAYALIRR Sbjct: 144 TISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRR 203 Query: 1143 MIRKGGVPDKRTFSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGY 964 MIRKG PDK+T+ LV GWCSAGK++EAQEFLEEMS+KGFNPP+RGRDLL++GLLNAGY Sbjct: 204 MIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGY 263 Query: 963 FESAKGLVRKMTKEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKI 784 ESAK +VRKMTKEG +PDI TFNSL++ IC EVDFCI++ H+VCKLGL PDI+TYKI Sbjct: 264 LESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKI 323 Query: 783 MIPNVSKLGRIDEAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQG 604 +IP SK+GRIDEAFRLLHC IEDGH PFPSLY PILK +C+ G+FDDAF FF D+K +G Sbjct: 324 LIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKG 383 Query: 603 HPPNRPVYTMLIKMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAK 424 HPPNRPVYTMLI MCGRGGRF++AANYL+EM EL L P ++ FDMVTDGLKNCGKHDLAK Sbjct: 384 HPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHDLAK 443 Query: 423 RMEELEISLRGV 388 ++E+LE+S+RG+ Sbjct: 444 KIEQLEVSIRGI 455 >ref|XP_007030290.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] gi|508718895|gb|EOY10792.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 455 Score = 627 bits (1618), Expect = e-177 Identities = 303/430 (70%), Positives = 360/430 (83%) Frame = -1 Query: 1677 SNNHEEIITSRGGISNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCG 1498 +N+ + S +S DDYFAAIHHISN VRR+ + ERTL +M I+V SELV+RVLRSC Sbjct: 28 ANSLQIASVSTTAVSKDDYFAAIHHISNTVRREVHPERTLNRMNISVNSELVFRVLRSCS 87 Query: 1497 KSGIESFRFFNWARNQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPET 1318 S ES RFF+WAR Y PT++EFEEL+K L R R +E+MWK +QM+KQ+ +L+ +T Sbjct: 88 NSPTESLRFFSWAR--AHYVPTSVEFEELVKILIRHRKYESMWKTIQQMQKQNLSLSCDT 145 Query: 1317 FSMIIESYGQNGFVDRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMI 1138 S IIE YG+NG VD+AVEVFN+ + C QT VYNSLLFALCEVK F GAYALIRRMI Sbjct: 146 LSFIIEEYGKNGLVDQAVEVFNKSTSLGCKQTVSVYNSLLFALCEVKMFHGAYALIRRMI 205 Query: 1137 RKGGVPDKRTFSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFE 958 RKG VPDKRT++ILVNGWCS GK+REAQEFLEEMS+ GFNPPVRGRDLL++GLLNAGY E Sbjct: 206 RKGEVPDKRTYAILVNGWCSGGKMREAQEFLEEMSKMGFNPPVRGRDLLVEGLLNAGYLE 265 Query: 957 SAKGLVRKMTKEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMI 778 SAK +VR+MTKEGF+PDI TFNSL+E IC EVDFCI++ H VCKLGL PDI+TYKI+I Sbjct: 266 SAKEMVRRMTKEGFVPDIGTFNSLVETICSSGEVDFCINMYHSVCKLGLCPDINTYKILI 325 Query: 777 PNVSKLGRIDEAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHP 598 P SK+GRIDEAFRLL+ S+EDG++PFPSLYAPI+KA+CR G+FDDAFSFF ++KV+GH Sbjct: 326 PAASKVGRIDEAFRLLNNSVEDGYRPFPSLYAPIIKAMCRKGQFDDAFSFFGEMKVKGHS 385 Query: 597 PNRPVYTMLIKMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRM 418 PNRPVYTMLI MCGRGGRF+EAANYLVEMTEL L+P ++ FDMV DGLKNCGKHDLAKR+ Sbjct: 386 PNRPVYTMLITMCGRGGRFVEAANYLVEMTELGLAPISRCFDMVIDGLKNCGKHDLAKRI 445 Query: 417 EELEISLRGV 388 E+LE+SLRGV Sbjct: 446 EQLEVSLRGV 455 >gb|EXB93122.1| hypothetical protein L484_024459 [Morus notabilis] Length = 470 Score = 622 bits (1603), Expect = e-175 Identities = 301/420 (71%), Positives = 355/420 (84%), Gaps = 4/420 (0%) Frame = -1 Query: 1635 SNDDYFAAIHHISNIVRRDFYLERTLQKMQIN-VTSELVYRVLRSCGKSGIESFRFFNWA 1459 S D+YFAAIHHISNIV+RDFY+ERTL K++I V S+LV+RVLR+C K G ES RFFNWA Sbjct: 51 SKDNYFAAIHHISNIVQRDFYMERTLNKLRIAAVDSDLVFRVLRACHKFGPESLRFFNWA 110 Query: 1458 RN-QPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMK-KQDFALTPETFSMIIESYGQN 1285 R+ QP Y PT++E EEL K L RT+ +E+MWK+ +QMK + ++ ET IIE YG+ Sbjct: 111 RSHQPSYRPTSVELEELAKNLARTKKYESMWKILQQMKTNNNLIISSETLCFIIEEYGKQ 170 Query: 1284 GFVDRAVEVFNRM-KNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRT 1108 G VD+A EVFNR+ K FNC QT VYNSLLFALCEVK F GAYAL+RRMIRK VPDKRT Sbjct: 171 GLVDQAAEVFNRVPKIFNCSQTVEVYNSLLFALCEVKLFHGAYALVRRMIRKEVVPDKRT 230 Query: 1107 FSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMT 928 +SILVN WCSAGK+REAQ FL EMS+KGFNPPVRGRDLLI+GLLNAGY ESAK +VRKM Sbjct: 231 YSILVNAWCSAGKMREAQNFLSEMSKKGFNPPVRGRDLLIEGLLNAGYIESAKEMVRKMV 290 Query: 927 KEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRID 748 KEGFLPD++TFNSL+E ICK EEV+FCIDL H VC LGL PDI+TYK++IP VSK G+ID Sbjct: 291 KEGFLPDVSTFNSLVEVICKSEEVEFCIDLYHQVCGLGLCPDINTYKVLIPAVSKAGQID 350 Query: 747 EAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLI 568 EAFRLLH SIEDGHKPFPSLYAPI+K +CR G+FDDA FF ++KV+GHPPNRPVYTMLI Sbjct: 351 EAFRLLHSSIEDGHKPFPSLYAPIIKGMCRKGQFDDALCFFGEMKVKGHPPNRPVYTMLI 410 Query: 567 KMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRGV 388 MCGRGGRF++AANYLVEMTE+ L+P ++ FD+VTDGLKNCGKHDLA+R+E+LE+S RG+ Sbjct: 411 TMCGRGGRFVDAANYLVEMTEIGLTPISRCFDLVTDGLKNCGKHDLARRIEQLEVSARGM 470 >gb|EPS58676.1| hypothetical protein M569_16136, partial [Genlisea aurea] Length = 419 Score = 604 bits (1557), Expect = e-170 Identities = 295/419 (70%), Positives = 343/419 (81%), Gaps = 5/419 (1%) Frame = -1 Query: 1629 DDYFAAIHHISNIVRRDFYLERTLQKMQIN--VTSELVYRVLRSCGKSGIESFRFFNWAR 1456 DDYFA IHHISNIVRRD YLERTL KM I+ V SELVYRV+ +C SGIESFRFFNWAR Sbjct: 1 DDYFATIHHISNIVRRDIYLERTLMKMNISSLVNSELVYRVINNCSSSGIESFRFFNWAR 60 Query: 1455 N-QPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETFSMIIESYGQNGF 1279 + P YEPTTLEFE L+K L +T++WETMWKV MK Q ++P S IIE Y ++G Sbjct: 61 SCHPNYEPTTLEFEALLKVLAQTKHWETMWKVVHTMKSQQSPISPGIMSFIIEQYAKHGL 120 Query: 1278 VDRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRTFSI 1099 +D+AVE+FN +KN +CPQT VYNSLLFALCEVKNFQGAYAL+RRMIRKG VPDKRT+SI Sbjct: 121 IDKAVELFNGLKNLDCPQTIEVYNSLLFALCEVKNFQGAYALVRRMIRKGNVPDKRTYSI 180 Query: 1098 LVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMTKEG 919 LVN WC AGKL EAQEFLEEMS+KG+NPPVRGRDLLIDGLLNAGY E AKGLVRKMTK G Sbjct: 181 LVNAWCRAGKLIEAQEFLEEMSKKGYNPPVRGRDLLIDGLLNAGYLECAKGLVRKMTKIG 240 Query: 918 FLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRIDEAF 739 +PDIATFNSL EA+CK E D CI D+C+LG P+ TYKIMI S+ GRIDEA Sbjct: 241 SIPDIATFNSLAEALCKNGETDACIASFDDICELGFCPNSDTYKIMITAASRDGRIDEAV 300 Query: 738 RLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLIKMC 559 R++ SIE+G +PFPSLYAPILK R G+FDDAFSFFS++KV+GH PNRP+YTML+K+C Sbjct: 301 RMIQRSIEEGQRPFPSLYAPILKGFIRRGQFDDAFSFFSEMKVKGHVPNRPIYTMLVKLC 360 Query: 558 GRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRME--ELEISLRGV 388 RGGRF+EAANYLVEM ELNL P ++SFDMV DGLK CGK+DLA+R++ E++ISLRG+ Sbjct: 361 VRGGRFVEAANYLVEMIELNLLPMSKSFDMVCDGLKKCGKYDLAQRIQRMEMDISLRGI 419 >ref|XP_007206771.1| hypothetical protein PRUPE_ppa025361mg [Prunus persica] gi|462402413|gb|EMJ07970.1| hypothetical protein PRUPE_ppa025361mg [Prunus persica] Length = 460 Score = 603 bits (1554), Expect = e-169 Identities = 289/418 (69%), Positives = 349/418 (83%), Gaps = 3/418 (0%) Frame = -1 Query: 1635 SNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGKSGIESFRFFNWAR 1456 + DDYF+AI HI+NIVRRD ++ERTL K++I V SELVYRVLR+C +G ES RFFNWAR Sbjct: 42 TKDDYFSAIQHITNIVRRDHFMERTLNKLRITVDSELVYRVLRACSAAGTESLRFFNWAR 101 Query: 1455 -NQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQD-FALTPETFSMIIESYGQNG 1282 + P Y PTTLE EEL+KTL RT+ +E+MWK+ + M+ L+ E+ +IE YG +G Sbjct: 102 THHPTYHPTTLELEELVKTLARTKKYESMWKLLQSMQTHHGLTLSQESLCFVIEEYGNHG 161 Query: 1281 FVDRAVEVFNRM-KNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRTF 1105 VD+AVE+FNR K FNC QT VYN+LLF+LC+ K F AYAL+RRMIRKG VPDKRT+ Sbjct: 162 LVDQAVELFNRAPKTFNCLQTVEVYNALLFSLCQAKLFHAAYALVRRMIRKGLVPDKRTY 221 Query: 1104 SILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMTK 925 SILVN WCS GK+REAQ FLEEMS KGFNPPVRGRDLL++GLLNAGY E+AK +VRKM K Sbjct: 222 SILVNAWCSNGKMREAQLFLEEMSSKGFNPPVRGRDLLVEGLLNAGYIEAAKEMVRKMVK 281 Query: 924 EGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRIDE 745 EGF+PD++TFNSL+EAICK EV+FCIDL + LGL PDI+TYK++IP VSK+GRID+ Sbjct: 282 EGFVPDVSTFNSLMEAICKCGEVEFCIDLYWEANGLGLCPDINTYKVLIPAVSKVGRIDD 341 Query: 744 AFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLIK 565 AFRLLH SIEDGH+PFPSLYAPI+K +CR G+FDDAF FFS++KV+GHPPNRPVYTMLI Sbjct: 342 AFRLLHNSIEDGHRPFPSLYAPIIKGMCRRGQFDDAFCFFSEMKVKGHPPNRPVYTMLIT 401 Query: 564 MCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRG 391 M GRGGRF+EAANYLVEMTE+ L P ++ FD+VTDGLKNCGKHD+AKR+E+LE+SLRG Sbjct: 402 MSGRGGRFVEAANYLVEMTEMGLMPISRCFDLVTDGLKNCGKHDMAKRIEQLEVSLRG 459 >ref|XP_006479008.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like [Citrus sinensis] Length = 445 Score = 593 bits (1529), Expect = e-167 Identities = 282/417 (67%), Positives = 344/417 (82%), Gaps = 1/417 (0%) Frame = -1 Query: 1635 SNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGK-SGIESFRFFNWA 1459 S DDYFAA++HI+NIVR D Y ERTL ++ + +TSELVYRVLR C S ES RFF WA Sbjct: 29 SKDDYFAAVNHIANIVRHDIYPERTLNRLNLTLTSELVYRVLRVCHTTSPSESLRFFTWA 88 Query: 1458 RNQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETFSMIIESYGQNGF 1279 R+QP+Y PT+LEFE LI TL + +++MWK E MK + +++P+T S+IIE +G++G Sbjct: 89 RSQPQYSPTSLEFEPLILTLAHHKRYQSMWKTIELMKPYNLSVSPQTLSLIIEEFGKHGL 148 Query: 1278 VDRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRTFSI 1099 VD AVEVFN+ FNC Q +YNSLLFALCEVK F GAYALIRRMIRKG VPDKRT++I Sbjct: 149 VDNAVEVFNKCTAFNCQQCVLLYNSLLFALCEVKLFHGAYALIRRMIRKGFVPDKRTYAI 208 Query: 1098 LVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMTKEG 919 LVN WCS+ K+REAQEFL+EMS KGFNPPVRGRDLL+ GLLNAGY ESAK +V KM K+G Sbjct: 209 LVNAWCSSWKMREAQEFLQEMSDKGFNPPVRGRDLLVQGLLNAGYLESAKQMVNKMIKQG 268 Query: 918 FLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRIDEAF 739 +PD+ TFNSL+E ICK EV+FC+++ + VCKLGL D+STYKI+IP VSK G IDEAF Sbjct: 269 SVPDLETFNSLIETICKSGEVEFCVEMYYSVCKLGLCADVSTYKILIPAVSKAGMIDEAF 328 Query: 738 RLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLIKMC 559 RLLH +EDGHKPFPSLYAPI+K + R G+FDDAF FFS++K++GHPPNRPVYTMLI MC Sbjct: 329 RLLHNLVEDGHKPFPSLYAPIIKGMFRRGQFDDAFCFFSEMKIKGHPPNRPVYTMLITMC 388 Query: 558 GRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRGV 388 GRGGRF+EAANYLVEMTE+ L+P ++ FD+VTDGLKNCGKHDLA+++E+LE+SLR V Sbjct: 389 GRGGRFVEAANYLVEMTEMGLTPISRCFDLVTDGLKNCGKHDLAEKIEQLEVSLRSV 445 >ref|XP_002522032.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538836|gb|EEF40436.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 451 Score = 586 bits (1511), Expect = e-164 Identities = 278/418 (66%), Positives = 346/418 (82%), Gaps = 2/418 (0%) Frame = -1 Query: 1635 SNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGKSGIESFRFFNWAR 1456 + D YFA IHHI+NIVRRDFY ERTL K+ VTSELV+RVLR+C +S ES RFFNW+R Sbjct: 36 TKDAYFALIHHITNIVRRDFYPERTLNKLNAPVTSELVFRVLRACSRSPTESLRFFNWSR 95 Query: 1455 NQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQD--FALTPETFSMIIESYGQNG 1282 Y PT++E+EELIK L +++ + +MWK+ QMK Q+ F+++ ET IIE YG++G Sbjct: 96 AY--YTPTSIEYEELIKILAKSKRYSSMWKLITQMKDQNPQFSISSETVRSIIEEYGRSG 153 Query: 1281 FVDRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRTFS 1102 +D+AVEVFN+ + NC Q +YNSLLFALCEVK F GAYAL+RR+IRKG P+K T+S Sbjct: 154 LIDQAVEVFNQCNSLNCEQNVDIYNSLLFALCEVKLFHGAYALVRRLIRKGLAPNKTTYS 213 Query: 1101 ILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMTKE 922 +LVNGWCS GK +EAQ FLEEMS+KGFNPPVRGRDLLI+GLLNAGYFESAK +V KM+KE Sbjct: 214 VLVNGWCSNGKFKEAQLFLEEMSKKGFNPPVRGRDLLIEGLLNAGYFESAKEMVFKMSKE 273 Query: 921 GFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRIDEA 742 GF+PD+ TFN L+EAIC EVDFC+D+ + + KLG PDI++YKI+IP VSK+G+IDEA Sbjct: 274 GFVPDVNTFNCLIEAICNSGEVDFCVDMYYSLRKLGFCPDINSYKILIPAVSKVGKIDEA 333 Query: 741 FRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLIKM 562 F+LL+ SIEDGHKPFP LYAPI+K +CR G+FDDAF FF ++KV+GHPPNRPVYTMLI M Sbjct: 334 FKLLNNSIEDGHKPFPGLYAPIIKGMCRRGQFDDAFCFFGEMKVKGHPPNRPVYTMLITM 393 Query: 561 CGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRGV 388 CGRGG+++EAANYLVEMTE+ L+P ++ FDMVTDGLKNCGKHDLAKR+E+LE+S+ V Sbjct: 394 CGRGGKYVEAANYLVEMTEMGLTPISRCFDMVTDGLKNCGKHDLAKRIEQLEVSVCSV 451 >ref|XP_006400384.1| hypothetical protein EUTSA_v10013477mg [Eutrema salsugineum] gi|557101474|gb|ESQ41837.1| hypothetical protein EUTSA_v10013477mg [Eutrema salsugineum] Length = 461 Score = 579 bits (1492), Expect = e-162 Identities = 279/426 (65%), Positives = 344/426 (80%), Gaps = 1/426 (0%) Frame = -1 Query: 1665 EEIITSRGGISNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGKSGI 1486 E + +S + DYFAAI+H+ NIVRR+ + ER+L ++++ VTSE V+RVLR+ +S Sbjct: 35 EPLQSSDSTPTKGDYFAAINHVVNIVRREVHPERSLNRLRLPVTSEFVFRVLRATSRSAN 94 Query: 1485 ESFRFFNWARNQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETFSMI 1306 +S RFFNWAR+ P Y PT++E+E+L K+L + +E+MWKV +QMK ++ ET I Sbjct: 95 DSLRFFNWARSSPNYTPTSIEYEQLAKSLASHKKYESMWKVLKQMKDLSLDISGETLCFI 154 Query: 1305 IESYGQNGFVDRAVEVFNRM-KNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKG 1129 IE YG+NG VD+AVE+FN + K C QT VYNSLL ALCEVK F GAYALIRRMIRKG Sbjct: 155 IEQYGKNGHVDQAVELFNGVPKTLGCQQTVEVYNSLLHALCEVKMFHGAYALIRRMIRKG 214 Query: 1128 GVPDKRTFSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAK 949 PDKRT+S+LVNGWCSAGK++EAQEFL+EMSRKGFNPP RGRDLLI+GLLNAGY ESAK Sbjct: 215 LKPDKRTYSVLVNGWCSAGKMKEAQEFLDEMSRKGFNPPARGRDLLIEGLLNAGYLESAK 274 Query: 948 GLVRKMTKEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNV 769 +V+KMTK GF+PDI TFN+L+EAI K EVDFCI++ + CKLGL DI TYK +IP V Sbjct: 275 EMVKKMTKGGFVPDIHTFNTLIEAISKSGEVDFCIEMYYTACKLGLCVDIDTYKTLIPAV 334 Query: 768 SKLGRIDEAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNR 589 SK+G+IDEAFRLL+ +EDGHKPFPSLYAPI+K +CR G FDDAFSFFSD+KV+ HPPNR Sbjct: 335 SKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPNR 394 Query: 588 PVYTMLIKMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEEL 409 PVYTMLI MCGRGG+F++AANYLVEMTE+ L P ++ FD+VTDGLKN GKHDLA R+E+L Sbjct: 395 PVYTMLITMCGRGGKFVDAANYLVEMTEMGLVPISRCFDIVTDGLKNSGKHDLAMRIEQL 454 Query: 408 EISLRG 391 E+ LRG Sbjct: 455 EVQLRG 460 >ref|XP_004504788.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like isoform X1 [Cicer arietinum] gi|502142093|ref|XP_004504789.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like isoform X2 [Cicer arietinum] Length = 453 Score = 579 bits (1492), Expect = e-162 Identities = 271/440 (61%), Positives = 353/440 (80%), Gaps = 6/440 (1%) Frame = -1 Query: 1689 KPYKSNNHEEIITSRGGISNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVL 1510 KP +H + +T+ S D+YFAAI H++NIVRRDFYLERTL K++I +T ELV+RVL Sbjct: 15 KPKPLLHHHKTLTT-ATTSKDEYFAAIQHVANIVRRDFYLERTLNKLRITITPELVFRVL 73 Query: 1509 RSCGKSGIESFRFFNWARNQ---PRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMK-KQ 1342 R+C S ES RFFNWAR+ P Y PT++EFE+++ L N++TMW + QM Sbjct: 74 RACSSSPTESLRFFNWARSHHHHPPYTPTSVEFEQIVTILANANNYQTMWSIIHQMTHNH 133 Query: 1341 DFALTPETFSMIIESYGQNGFVDRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGA 1162 + +L+P S +IESYG++ +D++V++FN+ K FNCPQ ++YNSLLFALCE K F A Sbjct: 134 NLSLSPSAVSSLIESYGRHRHIDQSVQLFNKCKVFNCPQNLNLYNSLLFALCESKLFHAA 193 Query: 1161 YALIRRMIRKGGVPDKRTFSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDG 982 YALIRRMIRKG PDKRT+++LVN WCS GK+REAQ+FL+EMS KGF PPVRGRDLLI+G Sbjct: 194 YALIRRMIRKGINPDKRTYALLVNAWCSTGKMREAQQFLKEMSDKGFTPPVRGRDLLIEG 253 Query: 981 LLNAGYFESAKGLVRKMTKEGFLPDIATFNSLLEAICKL--EEVDFCIDLLHDVCKLGLT 808 LLNAGY ESAKG+VRKM KEG +PD+ TFN+L+E+ICK +E+ FCIDL H++C LG+ Sbjct: 254 LLNAGYIESAKGMVRKMVKEGIIPDVGTFNALMESICKCGDDEIKFCIDLYHELCSLGMV 313 Query: 807 PDISTYKIMIPNVSKLGRIDEAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSF 628 PD++TYKI++P VSK+G +DEAF+LL+ E+G++PFPSLYAP++K L + G+FDDAF F Sbjct: 314 PDVNTYKILVPAVSKIGLMDEAFKLLNNFTEEGNRPFPSLYAPVMKGLFKRGQFDDAFCF 373 Query: 627 FSDIKVQGHPPNRPVYTMLIKMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKN 448 F+D+KV+GHPPNRP+YTMLI MCGRGGRF++AANYL EMTE+ P ++ FDMVTDGLKN Sbjct: 374 FADMKVKGHPPNRPLYTMLITMCGRGGRFVDAANYLFEMTEIGFVPISRCFDMVTDGLKN 433 Query: 447 CGKHDLAKRMEELEISLRGV 388 CGKHDLAKR+++LE+S+RGV Sbjct: 434 CGKHDLAKRVQQLEVSIRGV 453 >ref|XP_006287650.1| hypothetical protein CARUB_v10000860mg, partial [Capsella rubella] gi|482556356|gb|EOA20548.1| hypothetical protein CARUB_v10000860mg, partial [Capsella rubella] Length = 477 Score = 578 bits (1489), Expect = e-162 Identities = 279/430 (64%), Positives = 345/430 (80%), Gaps = 1/430 (0%) Frame = -1 Query: 1674 NNHEEIITSRGGISNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGK 1495 N+ E + +S + DYFAAI+H+ NIVRR+ + ER+L +++ VTSE V+RVLR+ + Sbjct: 48 NSLEPLQSSDSTSTKGDYFAAINHVVNIVRREIHPERSLNSLRLPVTSEFVFRVLRATSR 107 Query: 1494 SGIESFRFFNWARNQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETF 1315 S +S RFFNWAR+ P Y PT++E+E+L K+L + +E+MWK+ +QMK ++ ET Sbjct: 108 SANDSLRFFNWARSNPSYTPTSMEYEQLAKSLASHKKYESMWKILKQMKDLSLDISGETL 167 Query: 1314 SMIIESYGQNGFVDRAVEVFNRM-KNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMI 1138 IIE YG+NG VD+AVE+FN + K C QT VYNSLL ALC+VK F GAYALIRRMI Sbjct: 168 CFIIEQYGKNGHVDQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYALIRRMI 227 Query: 1137 RKGGVPDKRTFSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFE 958 RKG PDKRT++ILVNGWCSAGK++EAQEFL+EMSRKGFNPP RGRDLLI+GLLNAGY E Sbjct: 228 RKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRKGFNPPARGRDLLIEGLLNAGYLE 287 Query: 957 SAKGLVRKMTKEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMI 778 SAK +V KMTK GF+PDI TFN+L+EAI K EV+FCI++ + CKLGL DI TYK +I Sbjct: 288 SAKEMVSKMTKGGFVPDIQTFNTLIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLI 347 Query: 777 PNVSKLGRIDEAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHP 598 P VSK+G+IDEAFRLL+ +EDGHKPFPSLYAPI+K +CR G FDDAFSFFSD+KV+ HP Sbjct: 348 PAVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIVKGMCRNGMFDDAFSFFSDMKVKAHP 407 Query: 597 PNRPVYTMLIKMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRM 418 PNRPVYTMLI MCGRGG+F++AANYLVEMTE+ L P ++ FDMVTDGLKN GKHDLA R+ Sbjct: 408 PNRPVYTMLITMCGRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNSGKHDLAMRI 467 Query: 417 EELEISLRGV 388 E+LE+ LRGV Sbjct: 468 EQLEVQLRGV 477 >ref|XP_004296694.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like [Fragaria vesca subsp. vesca] Length = 444 Score = 576 bits (1485), Expect = e-161 Identities = 275/418 (65%), Positives = 342/418 (81%), Gaps = 3/418 (0%) Frame = -1 Query: 1635 SNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGKSGIESFRFFNWAR 1456 + DDYF+AIHHI+NIVRRD ++ERTL K++I + S+LV+RVLR+ S ES RFFNWAR Sbjct: 26 TKDDYFSAIHHITNIVRRDHFMERTLNKLRIPIDSDLVFRVLRASSSSPTESLRFFNWAR 85 Query: 1455 -NQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQD-FALTPETFSMIIESYGQNG 1282 + P Y PT+LE EEL+KTL R++ +E+MWK+ + MK L+ T II+ YG++ Sbjct: 86 THHPSYHPTSLETEELVKTLARSKKYESMWKILDSMKTHHALTLSESTLCFIIQEYGKHA 145 Query: 1281 FVDRAVEVFNRMKN-FNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRTF 1105 +D+AVE+FNR N FNC Q+ VYN+LLF+LCE K F GAYAL+RR+IRKG VP+K T+ Sbjct: 146 LIDQAVELFNRAPNTFNCLQSVQVYNALLFSLCETKLFHGAYALVRRLIRKGMVPNKMTY 205 Query: 1104 SILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMTK 925 SILVN WCS GK++EAQ FLEEMS KGFNPPVRGRDLL++GLLNAGY E AK +VRKM K Sbjct: 206 SILVNAWCSNGKMKEAQLFLEEMSEKGFNPPVRGRDLLVEGLLNAGYIEGAKDMVRKMVK 265 Query: 924 EGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRIDE 745 E +P+++TFN+LLEAICK EV+FCI L + LGL PDI+TYK+MIP VSK+GR+D+ Sbjct: 266 ENCVPEVSTFNALLEAICKSGEVEFCIALYWEATGLGLCPDINTYKVMIPAVSKIGRMDD 325 Query: 744 AFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLIK 565 AFRLLH SIEDGH+PFPSLYAPI+K +CR G+FDDAF FFS++KV+GHPPNRPVYTMLI Sbjct: 326 AFRLLHNSIEDGHRPFPSLYAPIVKGMCRKGQFDDAFCFFSEMKVKGHPPNRPVYTMLIT 385 Query: 564 MCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRG 391 M GRGGRF+EAANYL+EMTE+ L P ++ FD VTDGLKNCGKHDLAKR+E++E+SLRG Sbjct: 386 MAGRGGRFVEAANYLIEMTEVGLVPISRCFDFVTDGLKNCGKHDLAKRIEQIEVSLRG 443 >ref|NP_197340.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635760|sp|Q94JX6.2|PP391_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g18390, mitochondrial; Flags: Precursor gi|332005166|gb|AED92549.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 459 Score = 576 bits (1484), Expect = e-161 Identities = 279/430 (64%), Positives = 344/430 (80%), Gaps = 1/430 (0%) Frame = -1 Query: 1674 NNHEEIITSRGGISNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGK 1495 N+ E + +S + DYFAAI+H+ NIVRR+ + ER+L +++ VTSE V+RVLR+ + Sbjct: 30 NSLEPLQSSDSTPTKGDYFAAINHVVNIVRREIHPERSLNSLRLPVTSEFVFRVLRATSR 89 Query: 1494 SGIESFRFFNWARNQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETF 1315 S +S RFFNWAR+ P Y PT++E+EEL K+L + +E+MWK+ +QMK ++ ET Sbjct: 90 SSNDSLRFFNWARSNPSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDISGETL 149 Query: 1314 SMIIESYGQNGFVDRAVEVFNRM-KNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMI 1138 IIE YG+NG VD+AVE+FN + K C QT VYNSLL ALC+VK F GAYALIRRMI Sbjct: 150 CFIIEQYGKNGHVDQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYALIRRMI 209 Query: 1137 RKGGVPDKRTFSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFE 958 RKG PDKRT++ILVNGWCSAGK++EAQEFL+EMSR+GFNPP RGRDLLI+GLLNAGY E Sbjct: 210 RKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIEGLLNAGYLE 269 Query: 957 SAKGLVRKMTKEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMI 778 SAK +V KMTK GF+PDI TFN L+EAI K EV+FCI++ + CKLGL DI TYK +I Sbjct: 270 SAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLI 329 Query: 777 PNVSKLGRIDEAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHP 598 P VSK+G+IDEAFRLL+ +EDGHKPFPSLYAPI+K +CR G FDDAFSFFSD+KV+ HP Sbjct: 330 PAVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHP 389 Query: 597 PNRPVYTMLIKMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRM 418 PNRPVYTMLI MCGRGG+F++AANYLVEMTE+ L P ++ FDMVTDGLKN GKHDLA R+ Sbjct: 390 PNRPVYTMLITMCGRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNGGKHDLAMRI 449 Query: 417 EELEISLRGV 388 E+LE+ LRGV Sbjct: 450 EQLEVQLRGV 459 >ref|XP_002871814.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297317651|gb|EFH48073.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 459 Score = 576 bits (1484), Expect = e-161 Identities = 278/427 (65%), Positives = 343/427 (80%), Gaps = 1/427 (0%) Frame = -1 Query: 1665 EEIITSRGGISNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGKSGI 1486 E + +S + DYFAAI+H+ NIVRR+ + ER+L +++ VTSE V+RVLR+ +S Sbjct: 33 EPLQSSDSTPTKGDYFAAINHVVNIVRREIHPERSLNSLRLPVTSEFVFRVLRATSRSAN 92 Query: 1485 ESFRFFNWARNQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQDFALTPETFSMI 1306 +S RFFNWAR+ P Y PT++E+EEL K+L + +E+MWK+ +QMK ++ ET I Sbjct: 93 DSLRFFNWARSNPSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDISGETLCFI 152 Query: 1305 IESYGQNGFVDRAVEVFNRM-KNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKG 1129 IE YG+NG VD+AVE+FN + K C QT VYN+LL ALC+VK F GAYALIRRMIRKG Sbjct: 153 IEQYGKNGHVDQAVELFNGVPKTLGCQQTVDVYNALLHALCDVKMFHGAYALIRRMIRKG 212 Query: 1128 GVPDKRTFSILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAK 949 PDKRT++ILVNGWCSAGK++EAQEFL+EMSRKGFNPP RGRDLLI+GLLNAGY ESAK Sbjct: 213 LKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRKGFNPPARGRDLLIEGLLNAGYLESAK 272 Query: 948 GLVRKMTKEGFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNV 769 +V KMTK GF+PDI TFN+L+EAI K EV+FCI++ + CKLGL DI TYK +IP V Sbjct: 273 EIVDKMTKGGFVPDILTFNTLIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAV 332 Query: 768 SKLGRIDEAFRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNR 589 SK+G+IDEAFRLL+ +EDGHKPFPSLYAPI+K +CR G FDDAFSFFSD+KV+ HPPNR Sbjct: 333 SKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPNR 392 Query: 588 PVYTMLIKMCGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEEL 409 PVYTMLI MCGRGG+F++AANYLVEMTE+ L P ++ FDMVTDGLKN GKHDLA R+E+L Sbjct: 393 PVYTMLITMCGRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNSGKHDLAMRIEQL 452 Query: 408 EISLRGV 388 E+ LRGV Sbjct: 453 EVQLRGV 459 >ref|XP_003524064.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like isoform X1 [Glycine max] gi|571455122|ref|XP_006579993.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like isoform X2 [Glycine max] gi|571455124|ref|XP_006579994.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial-like isoform X3 [Glycine max] Length = 450 Score = 563 bits (1452), Expect = e-158 Identities = 263/418 (62%), Positives = 334/418 (79%), Gaps = 2/418 (0%) Frame = -1 Query: 1635 SNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGKSGIESFRFFNWAR 1456 S D+YFA IHH+SNIVRRDFYLERTL K++I VT ELV+RVLR+C + ES RFFNWAR Sbjct: 34 SRDEYFAVIHHVSNIVRRDFYLERTLNKLRITVTPELVFRVLRACSNNPTESLRFFNWAR 93 Query: 1455 NQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQD-FALTPETFSMIIESYGQNGF 1279 P Y PT+LEFE+++ TL R +++MW + Q+ +L+P + +IE+YG N Sbjct: 94 THPSYSPTSLEFEQIVTTLARANTYQSMWALIRQVTLHHRLSLSPSAVASVIEAYGDNRH 153 Query: 1278 VDRAVEVFNRMKNF-NCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRTFS 1102 VD++V+VFN+ NCPQT +YN+LL +LC K F GAYAL+RRM+RKG PDK T++ Sbjct: 154 VDQSVQVFNKSPLLLNCPQTLPLYNALLRSLCHNKLFHGAYALVRRMLRKGLRPDKTTYA 213 Query: 1101 ILVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMTKE 922 +LVN WCS GKLREA+ FLEEMS KGFNPPVRGRDLL++GLLNAGY ESAKG+VR M K+ Sbjct: 214 VLVNAWCSNGKLREAKLFLEEMSEKGFNPPVRGRDLLVEGLLNAGYVESAKGMVRNMIKQ 273 Query: 921 GFLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRIDEA 742 G +PD+ TFN+++E + K E+V FC+ L H+VC LG+ PD++TYKI++P VSK G +DEA Sbjct: 274 GSVPDVGTFNAVVETVSK-EDVQFCVGLYHEVCALGMAPDVNTYKILVPAVSKSGMVDEA 332 Query: 741 FRLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLIKM 562 FRLL+ IEDGHKPFPSLYAP++KALCR G+FDDAF FF D+K + HPPNRP+YTMLI M Sbjct: 333 FRLLNNFIEDGHKPFPSLYAPVIKALCRRGQFDDAFCFFGDMKAKAHPPNRPLYTMLITM 392 Query: 561 CGRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRGV 388 CGR G+F+EAANY+ EMTE+ L P ++ FDMVTDGLKNCGKHDLA+R++ELE+S+RGV Sbjct: 393 CGRAGKFVEAANYIFEMTEMGLVPISRCFDMVTDGLKNCGKHDLARRVQELEVSIRGV 450 >ref|XP_007159095.1| hypothetical protein PHAVU_002G208300g [Phaseolus vulgaris] gi|561032510|gb|ESW31089.1| hypothetical protein PHAVU_002G208300g [Phaseolus vulgaris] Length = 448 Score = 562 bits (1449), Expect = e-157 Identities = 262/417 (62%), Positives = 335/417 (80%), Gaps = 1/417 (0%) Frame = -1 Query: 1635 SNDDYFAAIHHISNIVRRDFYLERTLQKMQINVTSELVYRVLRSCGKSGIESFRFFNWAR 1456 + D YFA IHHISNIVRRDFYLERTL K++I+VT ELV+RVLR+C + S RFFNWAR Sbjct: 33 ARDQYFAVIHHISNIVRRDFYLERTLNKLRIHVTPELVFRVLRACSTAPTPSLRFFNWAR 92 Query: 1455 NQPRYEPTTLEFEELIKTLGRTRNWETMWKVAEQMKKQD-FALTPETFSMIIESYGQNGF 1279 + P Y PT+LEFE+++ TL R N++TMW + Q+ +L+P + +I++YG + Sbjct: 93 SHPSYTPTSLEFEQIVTTLARANNYQTMWSLIRQVTLHHRLSLSPAAVATLIDAYGHHRH 152 Query: 1278 VDRAVEVFNRMKNFNCPQTTHVYNSLLFALCEVKNFQGAYALIRRMIRKGGVPDKRTFSI 1099 +D+AVEVFN+ NCPQT +YN+LL +LC + F GAYAL+RRM+RKG PDK T+++ Sbjct: 153 IDQAVEVFNKAPILNCPQTLPLYNALLKSLCHNRLFHGAYALLRRMLRKGLHPDKATYAV 212 Query: 1098 LVNGWCSAGKLREAQEFLEEMSRKGFNPPVRGRDLLIDGLLNAGYFESAKGLVRKMTKEG 919 LVN WCS+GKLREA+ FL EMS KGFNPP+RGRDLL++GLLNAGY ESAKG+VRKM KEG Sbjct: 213 LVNAWCSSGKLREAKLFLREMSEKGFNPPLRGRDLLVEGLLNAGYVESAKGMVRKMIKEG 272 Query: 918 FLPDIATFNSLLEAICKLEEVDFCIDLLHDVCKLGLTPDISTYKIMIPNVSKLGRIDEAF 739 +PD+ TFN+++E +CK EEV FC+DL H+VC LG+ PD++TYKI+IP VSK IDEAF Sbjct: 273 IVPDVETFNAVVETVCK-EEVQFCVDLYHEVCALGMVPDVNTYKILIPAVSKSDFIDEAF 331 Query: 738 RLLHCSIEDGHKPFPSLYAPILKALCRMGRFDDAFSFFSDIKVQGHPPNRPVYTMLIKMC 559 RLL+ +EDG++PFPSLYAP++KALCR G+FDDAF FF D+K + HPPNRP+YTMLI MC Sbjct: 332 RLLNNFVEDGNRPFPSLYAPVIKALCRRGQFDDAFCFFGDMKAKAHPPNRPLYTMLITMC 391 Query: 558 GRGGRFIEAANYLVEMTELNLSPRAQSFDMVTDGLKNCGKHDLAKRMEELEISLRGV 388 GR G+F+EAANYL EMTE+ L P ++ FDMVTDGLKN GKHDLA R+++LE+S+RGV Sbjct: 392 GRAGKFVEAANYLFEMTEMGLVPISRCFDMVTDGLKNSGKHDLASRVQQLEVSIRGV 448