BLASTX nr result
ID: Catharanthus23_contig00007587
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00007587 (2519 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002283907.2| PREDICTED: pentatricopeptide repeat-containi... 723 0.0 ref|XP_004237977.1| PREDICTED: pentatricopeptide repeat-containi... 704 0.0 ref|XP_006338085.1| PREDICTED: pentatricopeptide repeat-containi... 701 0.0 gb|EXC16677.1| hypothetical protein L484_007723 [Morus notabilis] 686 0.0 gb|EOY33044.1| Pentatricopeptide repeat superfamily protein isof... 679 0.0 ref|XP_006472504.1| PREDICTED: pentatricopeptide repeat-containi... 671 0.0 ref|XP_006433856.1| hypothetical protein CICLE_v10000867mg [Citr... 670 0.0 ref|XP_002532248.1| pentatricopeptide repeat-containing protein,... 668 0.0 ref|XP_004301354.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 667 0.0 emb|CAN80524.1| hypothetical protein VITISV_030537 [Vitis vinifera] 657 0.0 ref|XP_004501623.1| PREDICTED: pentatricopeptide repeat-containi... 653 0.0 ref|XP_004136259.1| PREDICTED: pentatricopeptide repeat-containi... 642 0.0 ref|XP_003602939.1| Pentatricopeptide repeat-containing protein ... 637 e-180 ref|XP_002301082.1| hypothetical protein POPTR_0002s10380g [Popu... 637 e-180 ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containi... 629 e-177 gb|ESW09636.1| hypothetical protein PHAVU_009G143500g, partial [... 619 e-174 ref|XP_002873896.1| pentatricopeptide repeat-containing protein ... 548 e-153 ref|NP_974803.1| pentatricopeptide repeat-containing protein [Ar... 539 e-150 ref|XP_006287559.1| hypothetical protein CARUB_v10000770mg [Caps... 538 e-150 ref|XP_006403586.1| hypothetical protein EUTSA_v10010303mg [Eutr... 527 e-146 >ref|XP_002283907.2| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Vitis vinifera] Length = 513 Score = 723 bits (1867), Expect = 0.0 Identities = 350/512 (68%), Positives = 425/512 (83%), Gaps = 3/512 (0%) Frame = +2 Query: 302 MKAFLRIRCFXXXXXXXXXXVKWISPLQYVKTQNL--DSPAKTLDTISNVPRKR-RYMSH 472 M F RCF + WISPLQY+ + D PA T PRK+ +++SH Sbjct: 1 MNPFHEYRCFSCSPSAPSSSLPWISPLQYLNATSPKPDPPATEATTTMVEPRKKPKFISH 60 Query: 473 EHAINLINREKDPEHALEIFNKASDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMS 652 E AINLI RE DP+ ALEIFN+ ++Q+GF+HNN+TY ILHKLAK KKF ID +LHQM+ Sbjct: 61 ESAINLIKRETDPQRALEIFNRVAEQRGFSHNNATYATILHKLAKSKKFQAIDAVLHQMT 120 Query: 653 YETCMFHEGIFINLMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQI 832 YETC FHEGIF+NLMKHFSK +H+RV++MF AI+PIVREKPSLKAISTCLNLLVE+NQ+ Sbjct: 121 YETCKFHEGIFLNLMKHFSKLSLHERVVEMFDAIRPIVREKPSLKAISTCLNLLVESNQV 180 Query: 833 DLARTFLLNAQKNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYS 1012 DL R FLLN++K+L+L+PNTCIFNILVK+HC+ GD++SA E+V MK S VSYPNLITYS Sbjct: 181 DLTRKFLLNSKKSLNLEPNTCIFNILVKHHCKNGDIDSAFEVVEEMKKSHVSYPNLITYS 240 Query: 1013 TLMDGFCRCGRLEEAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKK 1192 TL++G C GRL+EAIE+FEEMVSKDQILPDALTYN LINGFC KVDRA KIMEFMKK Sbjct: 241 TLINGLCGSGRLKEAIELFEEMVSKDQILPDALTYNALINGFCHGEKVDRALKIMEFMKK 300 Query: 1193 NGCNPNVVNYSALMNGLCKQGRLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDE 1372 NGCNPNV NYSALMNG CK+GRLE+AKE+F+EMK+ G++PD VGYTTLI++ CR+ RVDE Sbjct: 301 NGCNPNVFNYSALMNGFCKEGRLEEAKEVFDEMKSLGLKPDTVGYTTLINFFCRAGRVDE 360 Query: 1373 AIELLKEMKETECRADVVTFNVILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLN 1552 A+ELLK+M+E +CRAD VTFNVILGGLCR RF+EA MLERLP++G+ LNKASYRIVLN Sbjct: 361 AMELLKDMRENKCRADTVTFNVILGGLCREGRFEEARGMLERLPYEGVYLNKASYRIVLN 420 Query: 1553 SLCKEGDLNKATELLGLMLARRVLPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTP 1732 SLC+EG+L KAT+L+GLML R VLPHFA+SNELLV LCEAGKV +A + L GL+ELGF P Sbjct: 421 SLCREGELQKATQLVGLMLGRGVLPHFATSNELLVHLCEAGKVGDAVMALLGLLELGFKP 480 Query: 1733 APDTWSLLVDVFCRERKLLPSFQLLDELIMQD 1828 P++W+LLV++ CRERKLLP+F+LLD+L++Q+ Sbjct: 481 EPNSWALLVELICRERKLLPAFELLDDLVIQE 512 >ref|XP_004237977.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Solanum lycopersicum] Length = 511 Score = 704 bits (1817), Expect = 0.0 Identities = 334/490 (68%), Positives = 409/490 (83%), Gaps = 2/490 (0%) Frame = +2 Query: 362 VKWISPLQYVKTQNL--DSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFN 535 V+WISPL Y +L +P + T VPRKR+Y+SHE A+NLI +EKD ALEIFN Sbjct: 22 VQWISPLDYQGRNSLRPGAPIERDGTSEQVPRKRKYISHESAVNLIKQEKDARRALEIFN 81 Query: 536 KASDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKS 715 K SDQKGFNHNNSTY V+LH+LA KKF ++ I+HQM YETC FHEG+F NLMKH+S+S Sbjct: 82 KVSDQKGFNHNNSTYAVLLHRLAVCKKFETVEAIIHQMKYETCKFHEGVFTNLMKHYSRS 141 Query: 716 HMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTC 895 +H++VL+MF AI PIVREKPSL AISTCLNLLVEA QI+LA+ FLLN QK+L+LKPNTC Sbjct: 142 SLHEKVLEMFDAILPIVREKPSLNAISTCLNLLVEAKQIELAKEFLLNVQKHLYLKPNTC 201 Query: 896 IFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEE 1075 IFNILVKYHC+KGD+++A +V M+ S VS+PNLITYSTLMDG CRCGRL++A+++FE+ Sbjct: 202 IFNILVKYHCKKGDVDAAFVVVEEMRKSRVSHPNLITYSTLMDGLCRCGRLQDALDLFEK 261 Query: 1076 MVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQG 1255 M++KDQI PDALTYN+LIN FCR GKVDRA+ I+ FM+KNGC PN+VNY+ALMNG CK+G Sbjct: 262 MLAKDQIPPDALTYNILINAFCRAGKVDRARNIIGFMRKNGCQPNIVNYTALMNGFCKEG 321 Query: 1256 RLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFN 1435 R+EDAKE+F+EMK G++PD VGYTTLI+ CR+ +VDE IELL EMK+ C+AD VT Sbjct: 322 RVEDAKEVFHEMKGVGLKPDVVGYTTLINSFCRAGKVDEGIELLDEMKDKGCKADDVTIK 381 Query: 1436 VILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLAR 1615 +ILGGLCR+ R EA NMLERLP+DG+ L+K SYRIVLN LCKEG+L KA +LLGLMLAR Sbjct: 382 IILGGLCRASRSSEAFNMLERLPYDGVHLSKESYRIVLNFLCKEGELVKAMDLLGLMLAR 441 Query: 1616 RVLPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPS 1795 R +PHFA+SNEL+V LCEAGK A+AA+ LFGL+E+GF P P TWS+L+DV CRERKLLP+ Sbjct: 442 RFVPHFATSNELIVQLCEAGKAADAALALFGLLEMGFKPEPQTWSMLIDVICRERKLLPA 501 Query: 1796 FQLLDELIMQ 1825 FQLLDEL++Q Sbjct: 502 FQLLDELVLQ 511 >ref|XP_006338085.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Solanum tuberosum] Length = 511 Score = 701 bits (1809), Expect = 0.0 Identities = 332/489 (67%), Positives = 408/489 (83%), Gaps = 2/489 (0%) Frame = +2 Query: 362 VKWISPLQYVKTQNL--DSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFN 535 V+WISPL Y +L D+P K T +PRKR+Y+SHE A+NLI +E+D ALEIFN Sbjct: 22 VQWISPLHYQGRNSLRPDAPIKRDGTSEQLPRKRKYISHESAVNLIKQERDARRALEIFN 81 Query: 536 KASDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKS 715 K SDQKGFNHNNSTY V+LH+LA KKF +D I+HQM YETC FHEG+F NLMKH+SKS Sbjct: 82 KVSDQKGFNHNNSTYAVLLHRLAVCKKFETVDAIIHQMKYETCKFHEGVFTNLMKHYSKS 141 Query: 716 HMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTC 895 +H++VL+MF+AI PIVREKPSL AISTCLNLL+EA QI+LA+ FLLN QK+L LKPNTC Sbjct: 142 SLHEKVLEMFNAILPIVREKPSLNAISTCLNLLIEAKQIELAKEFLLNVQKHLDLKPNTC 201 Query: 896 IFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEE 1075 IFNILVKYHCRKGD+E+A +V M+ S VS+PNLITYSTLMDG CRCGRL++A+++FE+ Sbjct: 202 IFNILVKYHCRKGDVEAAFVVVEEMRKSRVSHPNLITYSTLMDGLCRCGRLQDALDLFEK 261 Query: 1076 MVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQG 1255 M++KDQI PDALTYN+LIN FCR GKVDRA+ I+ FM+KNGC PN+VNY+ALMNG CK+G Sbjct: 262 MLAKDQIPPDALTYNILINAFCRAGKVDRARNIIGFMRKNGCQPNIVNYTALMNGFCKEG 321 Query: 1256 RLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFN 1435 R+ DAKE+F+EMK G++PD VGYTTLI+ CR+ +VD+ IELL+EMK+ C+AD VT Sbjct: 322 RVGDAKEVFHEMKGVGLKPDVVGYTTLINSFCRAGKVDKGIELLEEMKDKGCKADDVTIK 381 Query: 1436 VILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLAR 1615 +ILGGLCR+ R EA +MLERLP+DG+ L+K SYRIVLN LCKEG+L KA +LLGLMLAR Sbjct: 382 IILGGLCRASRSSEAFDMLERLPYDGVHLSKESYRIVLNFLCKEGELEKAMDLLGLMLAR 441 Query: 1616 RVLPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPS 1795 R +PHFA+SNEL+V LCEAGK A+AA+ LFGL+E+ F P P TWS+L+DV CRERKLLP+ Sbjct: 442 RFVPHFATSNELIVQLCEAGKAADAALALFGLLEMSFKPEPRTWSMLIDVICRERKLLPA 501 Query: 1796 FQLLDELIM 1822 FQLLDEL++ Sbjct: 502 FQLLDELVL 510 >gb|EXC16677.1| hypothetical protein L484_007723 [Morus notabilis] Length = 513 Score = 686 bits (1769), Expect = 0.0 Identities = 337/488 (69%), Positives = 402/488 (82%), Gaps = 2/488 (0%) Frame = +2 Query: 362 VKWISPLQYVKTQNL--DSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFN 535 ++WISP+Q K + D P +++ + RK +Y+SH+ AINLI RE+DP+ ALEIFN Sbjct: 25 IRWISPVQLSKASSKKPDPPTESIASSLEGRRKAKYISHDTAINLIKRERDPQRALEIFN 84 Query: 536 KASDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKS 715 S+QKGFNHN TY ILHKLA KKFG ID IL QM YETC FHE IF+NLMKHFSK Sbjct: 85 SVSEQKGFNHNGDTYSTILHKLALSKKFGAIDAILRQMMYETCKFHEPIFLNLMKHFSKY 144 Query: 716 HMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTC 895 +H++VL+MFHAI+ I REKPSLKAISTCLNLLVEAN+IDLAR FL++++KNL LKPNTC Sbjct: 145 ALHEKVLEMFHAIRSIAREKPSLKAISTCLNLLVEANRIDLARQFLMHSRKNLSLKPNTC 204 Query: 896 IFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEE 1075 IFNILVK+HCR GDLESA E+V+ MK +++SYPNLITYSTL+DG C GRL+ AIE+FEE Sbjct: 205 IFNILVKHHCRNGDLESAFEVVKEMKKAKISYPNLITYSTLIDGLCVSGRLKGAIELFEE 264 Query: 1076 MVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQG 1255 M+SKDQILPDALT+N+LINGFCRDGKVDRA+KIMEFMK NGC+PNV NYSAL+NG K G Sbjct: 265 MISKDQILPDALTFNVLINGFCRDGKVDRARKIMEFMKSNGCSPNVFNYSALINGFFKVG 324 Query: 1256 RLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFN 1435 R E+A+EIF EMK+ G +PDKVGYTT+I+ CR+ R DEA+ELLKEMK ECRADVVTFN Sbjct: 325 RFEEAEEIFYEMKSFGPKPDKVGYTTIINCFCRTGRTDEAMELLKEMKGGECRADVVTFN 384 Query: 1436 VILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLAR 1615 VI GGLCR R +EAL MLERLP++G+ LNKASYRIVLN LC++G+L KAT LL LML R Sbjct: 385 VIFGGLCREGRLEEALRMLERLPYEGMHLNKASYRIVLNFLCQKGELKKATSLLDLMLGR 444 Query: 1616 RVLPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPS 1795 +PHFA+SNELLV LC AG +AA+ LFGL+E+GF P PD+W++LVD+ RERKLL S Sbjct: 445 GFVPHFATSNELLVRLCNAGMADDAAMALFGLLEMGFKPEPDSWAILVDLISRERKLLSS 504 Query: 1796 FQLLDELI 1819 FQLLDELI Sbjct: 505 FQLLDELI 512 Score = 134 bits (338), Expect = 1e-28 Identities = 98/386 (25%), Positives = 186/386 (48%), Gaps = 2/386 (0%) Frame = +2 Query: 686 INLMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQ 865 INL+K + QR L++F+++ + ST L+ L + + A +L Sbjct: 67 INLIK---RERDPQRALEIFNSVSEQKGFNHNGDTYSTILHKLALSKKFG-AIDAILRQM 122 Query: 866 KNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGR 1045 K + IF L+K+ + E +E+ ++S P+L ST ++ R Sbjct: 123 MYETCKFHEPIFLNLMKHFSKYALHEKVLEMFHAIRSIAREKPSLKAISTCLNLLVEANR 182 Query: 1046 LEEAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCN-PNVVNY 1222 ++ A + + P+ +N+L+ CR+G ++ A ++++ MKK + PN++ Y Sbjct: 183 IDLARQFLMHSRKNLSLKPNTCIFNILVKHHCRNGDLESAFEVVKEMKKAKISYPNLITY 242 Query: 1223 SALMNGLCKQGRLEDAKEIFNEMKAAG-MQPDKVGYTTLIDYLCRSSRVDEAIELLKEMK 1399 S L++GLC GRL+ A E+F EM + + PD + + LI+ CR +VD A ++++ MK Sbjct: 243 STLIDGLCVSGRLKGAIELFEEMISKDQILPDALTFNVLINGFCRDGKVDRARKIMEFMK 302 Query: 1400 ETECRADVVTFNVILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLN 1579 C +V ++ ++ G + RF+EA + + G +K Y ++N C+ G + Sbjct: 303 SNGCSPNVFNYSALINGFFKVGRFEEAEEIFYEMKSFGPKPDKVGYTTIINCFCRTGRTD 362 Query: 1580 KATELLGLMLARRVLPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLV 1759 +A ELL M + N + LC G++ A ML L G ++ +++ Sbjct: 363 EAMELLKEMKGGECRADVVTFNVIFGGLCREGRLEEALRMLERLPYEGMHLNKASYRIVL 422 Query: 1760 DVFCRERKLLPSFQLLDELIMQDW*P 1837 + C++ +L + LLD ++ + + P Sbjct: 423 NFLCQKGELKKATSLLDLMLGRGFVP 448 >gb|EOY33044.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] gi|508785789|gb|EOY33045.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] Length = 530 Score = 679 bits (1753), Expect = 0.0 Identities = 323/489 (66%), Positives = 406/489 (83%), Gaps = 2/489 (0%) Frame = +2 Query: 368 WISPLQYVK--TQNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKA 541 WISPLQ++K +Q D P + T++ RK R++SHE AINLI RE+DP+ ALEIFN+ Sbjct: 26 WISPLQFLKANSQKRDPPPEIPYTLTESQRKPRFVSHETAINLIKRERDPQRALEIFNRV 85 Query: 542 SDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHM 721 S+QKGF+HNN+TYG ILHKL + KKF ID IL QM+YETC FHEG+F+NLMKHFSK + Sbjct: 86 SEQKGFSHNNATYGTILHKLVQSKKFQAIDSILRQMTYETCKFHEGVFLNLMKHFSKFSL 145 Query: 722 HQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIF 901 H RVL+MF+AIQPIVREKPSLKAISTCLNLL+E+NQ+DLAR FLLN++K+L L+PNTCIF Sbjct: 146 HDRVLEMFYAIQPIVREKPSLKAISTCLNLLIESNQVDLARHFLLNSKKSLRLRPNTCIF 205 Query: 902 NILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMV 1081 NILVK+HC+ GDLESA E+V+ MK S VSYPNLITYSTLM G C GRL+EAIE+FEEMV Sbjct: 206 NILVKHHCKNGDLESAFEVVKEMKKSRVSYPNLITYSTLMGGLCESGRLKEAIELFEEMV 265 Query: 1082 SKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRL 1261 +KDQILPD LTYN+LINGFC GKVDRA+KIMEFMK NGCNPN+ NYS L+NG CK+GR Sbjct: 266 AKDQILPDVLTYNILINGFCCRGKVDRARKIMEFMKNNGCNPNLFNYSTLINGFCKEGRW 325 Query: 1262 EDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVI 1441 ++AKE+F EM++ G++PD +GYTTLI+ LCR+++++EA+ELLKEMKE EC+ADVVT NV+ Sbjct: 326 QEAKEVFVEMESIGLKPDTIGYTTLINCLCRAAQIEEAMELLKEMKEKECQADVVTLNVL 385 Query: 1442 LGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRV 1621 LGGLCR RF +AL MLE+LP++G+ LNKASYRIVLNSLC++ ++ KA +L+GLML R Sbjct: 386 LGGLCREGRFQDALQMLEKLPYEGVYLNKASYRIVLNSLCQKDEMEKAAKLVGLMLDRGF 445 Query: 1622 LPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQ 1801 +PH+A+SN+LL+ LC+AG V +A L GL E GF P P W L ++ C+ERKLL F+ Sbjct: 446 VPHYATSNDLLIRLCKAGMVDDAVTALVGLAETGFKPEPHCWEFLTELNCKERKLLSVFE 505 Query: 1802 LLDELIMQD 1828 LLDEL++++ Sbjct: 506 LLDELVIKE 514 Score = 129 bits (325), Expect = 5e-27 Identities = 92/386 (23%), Positives = 188/386 (48%), Gaps = 2/386 (0%) Frame = +2 Query: 686 INLMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQ 865 INL+K + QR L++F+ + + T L+ LV++ + A +L Sbjct: 66 INLIK---RERDPQRALEIFNRVSEQKGFSHNNATYGTILHKLVQSKKFQ-AIDSILRQM 121 Query: 866 KNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGR 1045 K + +F L+K+ + + +E+ ++ P+L ST ++ + Sbjct: 122 TYETCKFHEGVFLNLMKHFSKFSLHDRVLEMFYAIQPIVREKPSLKAISTCLNLLIESNQ 181 Query: 1046 LEEAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCN-PNVVNY 1222 ++ A ++ P+ +N+L+ C++G ++ A ++++ MKK+ + PN++ Y Sbjct: 182 VDLARHFLLNSKKSLRLRPNTCIFNILVKHHCKNGDLESAFEVVKEMKKSRVSYPNLITY 241 Query: 1223 SALMNGLCKQGRLEDAKEIFNEMKAAG-MQPDKVGYTTLIDYLCRSSRVDEAIELLKEMK 1399 S LM GLC+ GRL++A E+F EM A + PD + Y LI+ C +VD A ++++ MK Sbjct: 242 STLMGGLCESGRLKEAIELFEEMVAKDQILPDVLTYNILINGFCCRGKVDRARKIMEFMK 301 Query: 1400 ETECRADVVTFNVILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLN 1579 C ++ ++ ++ G C+ R+ EA + + G+ + Y ++N LC+ + Sbjct: 302 NNGCNPNLFNYSTLINGFCKEGRWQEAKEVFVEMESIGLKPDTIGYTTLINCLCRAAQIE 361 Query: 1580 KATELLGLMLARRVLPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLV 1759 +A ELL M + + N LL LC G+ +A ML L G ++ +++ Sbjct: 362 EAMELLKEMKEKECQADVVTLNVLLGGLCREGRFQDALQMLEKLPYEGVYLNKASYRIVL 421 Query: 1760 DVFCRERKLLPSFQLLDELIMQDW*P 1837 + C++ ++ + +L+ ++ + + P Sbjct: 422 NSLCQKDEMEKAAKLVGLMLDRGFVP 447 >ref|XP_006472504.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like isoform X1 [Citrus sinensis] gi|568836969|ref|XP_006472505.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like isoform X2 [Citrus sinensis] Length = 521 Score = 671 bits (1730), Expect = 0.0 Identities = 325/512 (63%), Positives = 407/512 (79%), Gaps = 2/512 (0%) Frame = +2 Query: 299 SMKAFLRIRCFXXXXXXXXXXVKWISPLQYVK--TQNLDSPAKTLDTISNVPRKRRYMSH 472 S++ LR C + WISPL+ +K T D P +T DT + ++ R++SH Sbjct: 3 SVRFSLRKCCRFSSFSTSSSSLPWISPLEVIKANTPKADPPVETSDTCVDARKRSRFISH 62 Query: 473 EHAINLINREKDPEHALEIFNKASDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMS 652 AI+LI EK+P+ ALEIFN S+QKGFNHNN+TY IL KLA+YKKF +D +L QM+ Sbjct: 63 GAAISLIKCEKEPQCALEIFNTVSEQKGFNHNNATYATILDKLARYKKFEAVDAVLRQMT 122 Query: 653 YETCMFHEGIFINLMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQI 832 YETC FHEGIF+NLMKHFS +H+RVL+MFH I PI REKPSLKAISTCLNLL+E+NQ+ Sbjct: 123 YETCKFHEGIFLNLMKHFSNCSLHERVLEMFHKIHPITREKPSLKAISTCLNLLIESNQV 182 Query: 833 DLARTFLLNAQKNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYS 1012 DLA+ FL + ++L LKPNTCIFNIL+K+HC++G LESA E+++ MK S++SYPNLITYS Sbjct: 183 DLAQNFLKYSNRHLRLKPNTCIFNILIKHHCKRGTLESAFEVLKEMKKSQMSYPNLITYS 242 Query: 1013 TLMDGFCRCGRLEEAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKK 1192 TL+DG C+ GR EAIE+FEEMVSKDQILPDALTYN+LI+GFC GKVDRAKKIMEFMK Sbjct: 243 TLIDGLCKNGRFREAIELFEEMVSKDQILPDALTYNVLIDGFCHGGKVDRAKKIMEFMKN 302 Query: 1193 NGCNPNVVNYSALMNGLCKQGRLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDE 1372 NGCNPNV NY+ LMNG CK+G+L++AKE+F+EMK ++PD +GYTTLI+ CR+ VDE Sbjct: 303 NGCNPNVFNYTTLMNGFCKEGKLQEAKEVFDEMKNFHLKPDTIGYTTLINCFCRAGGVDE 362 Query: 1373 AIELLKEMKETECRADVVTFNVILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLN 1552 A+ELLKEMKE C+AD+VTFN+ILGGLCR R +EAL MLE+L +DGI LNKASYRIVLN Sbjct: 363 ALELLKEMKERGCKADIVTFNIILGGLCREGRIEEALGMLEKLWYDGIYLNKASYRIVLN 422 Query: 1553 SLCKEGDLNKATELLGLMLARRVLPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTP 1732 LC++G+L KA ELL LML R LPH+A+SNELLV LC+AG +AA+ LFGLVE+GF P Sbjct: 423 FLCQKGELEKAIELLRLMLCRGFLPHYATSNELLVRLCKAGMAEDAAIALFGLVEMGFKP 482 Query: 1733 APDTWSLLVDVFCRERKLLPSFQLLDELIMQD 1828 D+W+LLV++ CR RKLL +F LLDEL++++ Sbjct: 483 ESDSWALLVEMICRGRKLLFAFVLLDELVIKE 514 >ref|XP_006433856.1| hypothetical protein CICLE_v10000867mg [Citrus clementina] gi|567882597|ref|XP_006433857.1| hypothetical protein CICLE_v10000867mg [Citrus clementina] gi|557535978|gb|ESR47096.1| hypothetical protein CICLE_v10000867mg [Citrus clementina] gi|557535979|gb|ESR47097.1| hypothetical protein CICLE_v10000867mg [Citrus clementina] Length = 521 Score = 670 bits (1729), Expect = 0.0 Identities = 323/512 (63%), Positives = 407/512 (79%), Gaps = 2/512 (0%) Frame = +2 Query: 299 SMKAFLRIRCFXXXXXXXXXXVKWISPLQYVK--TQNLDSPAKTLDTISNVPRKRRYMSH 472 S++ LR C + WISPL+ +K T D P +T DT + ++ +++SH Sbjct: 3 SVRFSLRKCCRFSSFSTSSSSLPWISPLEVIKANTPKADPPVETSDTCVDARKRSKFISH 62 Query: 473 EHAINLINREKDPEHALEIFNKASDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMS 652 AI+LI EK+P+ ALEIFN S+QKGFNHNN TY IL KL +YKKF +D +L QM+ Sbjct: 63 GAAISLIKCEKEPQRALEIFNTVSEQKGFNHNNGTYATILDKLVRYKKFQAVDAVLRQMT 122 Query: 653 YETCMFHEGIFINLMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQI 832 YETC FHEGIF+NLMKHFS +H+RVL+MFH I PI REKPSLKAISTCLNLL+E+NQ+ Sbjct: 123 YETCKFHEGIFLNLMKHFSNCSLHERVLEMFHKIHPITREKPSLKAISTCLNLLIESNQV 182 Query: 833 DLARTFLLNAQKNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYS 1012 DLA+ FL + ++L LKPNTCIFNIL+K+HC++G LESA E+++ MK S++SYPNLITYS Sbjct: 183 DLAQNFLKYSNQHLRLKPNTCIFNILIKHHCKRGTLESAFEVLKEMKKSQMSYPNLITYS 242 Query: 1013 TLMDGFCRCGRLEEAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKK 1192 TL+DG C+ GR EAIE+FEEMVSKDQILPDALTYN+LI+GFCR GKVDRAKKIMEFMK Sbjct: 243 TLIDGLCKNGRFREAIELFEEMVSKDQILPDALTYNVLIDGFCRGGKVDRAKKIMEFMKN 302 Query: 1193 NGCNPNVVNYSALMNGLCKQGRLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDE 1372 NGCNPNV NY+ LMNG CK+G+L++AKE+F+EMK ++PD +GYTTLI+ CR+ RVDE Sbjct: 303 NGCNPNVFNYTTLMNGFCKEGKLQEAKEVFDEMKNFLLKPDTIGYTTLINCFCRAGRVDE 362 Query: 1373 AIELLKEMKETECRADVVTFNVILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLN 1552 A+ELLKEMKE C+AD+VTFN+ILGGLCR + +EAL MLE+L +DGI LNKASYRIVLN Sbjct: 363 ALELLKEMKERGCKADIVTFNIILGGLCREGKIEEALGMLEKLWYDGIYLNKASYRIVLN 422 Query: 1553 SLCKEGDLNKATELLGLMLARRVLPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTP 1732 C++G+L KA ELL LML R LPH+A+SNELLV LC+AG +AA+ LFGLVE+GF P Sbjct: 423 FSCQKGELEKAIELLRLMLCRGFLPHYATSNELLVRLCKAGMAEDAAIALFGLVEMGFKP 482 Query: 1733 APDTWSLLVDVFCRERKLLPSFQLLDELIMQD 1828 D+W+LLV++ CR RKLL +F+LLDEL++++ Sbjct: 483 ESDSWALLVELICRGRKLLFAFELLDELVIKE 514 >ref|XP_002532248.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223528066|gb|EEF30142.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 521 Score = 668 bits (1724), Expect = 0.0 Identities = 320/489 (65%), Positives = 398/489 (81%), Gaps = 2/489 (0%) Frame = +2 Query: 368 WISPLQYVKTQNL--DSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKA 541 WISPLQ+ K L DSP +T T+ RK +++SHE AINLI REKDP+HALEIFN Sbjct: 23 WISPLQFSKAAPLVPDSPTETSSTLVETGRKCKFISHESAINLIKREKDPQHALEIFNMV 82 Query: 542 SDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHM 721 +QKGFNHN++TY ++HKLA+ KKF +D +LHQM+YETC FHE IF+NLMKHF KS + Sbjct: 83 GEQKGFNHNHATYSTLIHKLAQTKKFHAVDALLHQMTYETCKFHENIFLNLMKHFYKSSL 142 Query: 722 HQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIF 901 H+RVL+MF+AIQPIVREKPSLKAISTCLN+LVE+ QIDLA+ LL ++L ++PNTCIF Sbjct: 143 HERVLEMFYAIQPIVREKPSLKAISTCLNILVESKQIDLAQKCLLYVNEHLKVRPNTCIF 202 Query: 902 NILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMV 1081 NILVK+HC+ GDLESA+E++ MK S SYPN+ITYSTL+DG C GRL+EAIE+FEEMV Sbjct: 203 NILVKHHCKSGDLESALEVMHEMKKSRRSYPNVITYSTLIDGLCGNGRLKEAIELFEEMV 262 Query: 1082 SKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRL 1261 SKDQILPDALTY++LI GFC GK DRA+KIMEFM+ NGC+PNV NYS LMNG CK+GRL Sbjct: 263 SKDQILPDALTYSVLIKGFCHGGKADRARKIMEFMRSNGCDPNVFNYSVLMNGFCKEGRL 322 Query: 1262 EDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVI 1441 E+AKE+F+EMK++G++PD VGYTTLI+ C R+DEA+ELLKEM E +C+AD VTFNV+ Sbjct: 323 EEAKEVFDEMKSSGLKPDTVGYTTLINCFCGVGRIDEAMELLKEMTEMKCKADAVTFNVL 382 Query: 1442 LGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRV 1621 L GLCR RFDEAL MLE L ++G+ LNK SYRIVLN LC++G+L K+ LLGLML+R Sbjct: 383 LKGLCREGRFDEALRMLENLAYEGVYLNKGSYRIVLNFLCQKGELEKSCALLGLMLSRGF 442 Query: 1622 LPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQ 1801 +PH+A+SNELLV LCEAG V NA LFGL ++GFTP P +W+ L++ CRERKLL F+ Sbjct: 443 VPHYATSNELLVCLCEAGMVDNAVTALFGLTQMGFTPEPKSWAHLIEYICRERKLLFVFE 502 Query: 1802 LLDELIMQD 1828 L+DEL+ ++ Sbjct: 503 LVDELVEKE 511 >ref|XP_004301354.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g18475-like [Fragaria vesca subsp. vesca] Length = 568 Score = 667 bits (1720), Expect = 0.0 Identities = 318/490 (64%), Positives = 401/490 (81%) Frame = +2 Query: 362 VKWISPLQYVKTQNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKA 541 V WISPL+ K N P DT + RK +Y+SH AINLI RE+DP+HALEIFN Sbjct: 79 VSWISPLKLSKL-NAHQPDPPPDTRTEARRKSKYISHNAAINLIKRERDPQHALEIFNMV 137 Query: 542 SDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHM 721 S+QKGFNHNN+TY IL+KL++ KKF +D +L+QM Y+TC FHEGIF+NLMKHFSK M Sbjct: 138 SEQKGFNHNNATYATILNKLSQSKKFKAVDAVLYQMKYDTCKFHEGIFLNLMKHFSKFSM 197 Query: 722 HQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIF 901 H+RVL+MFHAIQPIVREKPSLK ISTCLNLL+EANQ+D+A+ FL++ +K+L+LK NTCI Sbjct: 198 HERVLEMFHAIQPIVREKPSLKCISTCLNLLIEANQVDMAQQFLMHLKKSLNLKLNTCIA 257 Query: 902 NILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMV 1081 NILVK++C+ GDLESA E+V+ MK S++SYPNLITYSTL+DG C+ G+L EA+++F+EM+ Sbjct: 258 NILVKHYCKNGDLESAFEVVKKMKKSKLSYPNLITYSTLIDGLCQSGKLTEAMDMFDEMI 317 Query: 1082 SKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRL 1261 SK+QILPD LTYN+L+ GFCR GKVDRA+KI++FMK GCNPN+ NYS LMNG CK+ RL Sbjct: 318 SKEQILPDVLTYNILMKGFCRAGKVDRARKILDFMKSKGCNPNIYNYSTLMNGFCKEVRL 377 Query: 1262 EDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVI 1441 ++A+E+ +EMK+ G++PD V YTTLID CR+ RVDEAIELLKEMKE C+AD VTFNVI Sbjct: 378 KEAQELLDEMKSFGIKPDTVVYTTLIDCHCRTGRVDEAIELLKEMKERRCKADTVTFNVI 437 Query: 1442 LGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRV 1621 LGGLCR CR ++AL ML+ LP++GI LNK SYRIVLNSL ++GDLNKA ELL LM+ R Sbjct: 438 LGGLCRECRIEDALKMLDELPYEGIYLNKGSYRIVLNSLYQKGDLNKAKELLRLMMGRGF 497 Query: 1622 LPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQ 1801 +PH+A+SN LLVSLCEAG + +A LFGLVE+GF P D+W+ V+ CRERKLLP+F+ Sbjct: 498 VPHYATSNGLLVSLCEAGMIDDATTALFGLVEMGFKPLLDSWAXFVESICRERKLLPAFE 557 Query: 1802 LLDELIMQDW 1831 LLDEL+ +++ Sbjct: 558 LLDELVNEEF 567 >emb|CAN80524.1| hypothetical protein VITISV_030537 [Vitis vinifera] Length = 714 Score = 657 bits (1694), Expect = 0.0 Identities = 327/512 (63%), Positives = 394/512 (76%), Gaps = 3/512 (0%) Frame = +2 Query: 302 MKAFLRIRCFXXXXXXXXXXVKWISPLQYVKTQNL--DSPAKTLDTISNVPRKR-RYMSH 472 M F RCF + WISPLQY+ + D PA T PRK+ +++SH Sbjct: 71 MNPFXEYRCFSCSPSAPSSSLPWISPLQYLNATSPKPDPPATEATTTMVEPRKKPKFISH 130 Query: 473 EHAINLINREKDPEHALEIFNKASDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMS 652 E AINLI RE DP+ ALEIFN+ ++Q+GF+HNN+TY ILHKLAK KKF ID +LHQM+ Sbjct: 131 ESAINLIKRETDPQRALEIFNRVAEQRGFSHNNATYATILHKLAKSKKFQAIDAVLHQMT 190 Query: 653 YETCMFHEGIFINLMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQI 832 YETC FHEGIF+NLMKHFSK +H+RV++MF AI PIVREKPSLKAISTCLNLLVE+NQ Sbjct: 191 YETCKFHEGIFLNLMKHFSKLSLHERVVEMFDAIXPIVREKPSLKAISTCLNLLVESNQS 250 Query: 833 DLARTFLLNAQKNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYS 1012 + + GD++SA E+V MK S VSYPNLITYS Sbjct: 251 SIT---------------------------AKNGDIDSAFEVVEEMKKSHVSYPNLITYS 283 Query: 1013 TLMDGFCRCGRLEEAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKK 1192 TL++G C GRL+EAIE+FEEMVSKDQILPDALTYN LINGFC KVDRA KIMEFMKK Sbjct: 284 TLINGLCGSGRLKEAIELFEEMVSKDQILPDALTYNALINGFCHGXKVDRALKIMEFMKK 343 Query: 1193 NGCNPNVVNYSALMNGLCKQGRLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDE 1372 NGCNPNV NYSALMNG CK+GRLE+AKE+F+EMK+ G++PD VGYTTLI++ CR+ RVDE Sbjct: 344 NGCNPNVFNYSALMNGFCKEGRLEEAKEVFDEMKSLGLKPDTVGYTTLINFFCRAGRVDE 403 Query: 1373 AIELLKEMKETECRADVVTFNVILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLN 1552 A+ELLK+M E +CRAD VTFNVILGGLCR RF+EA MLERLP++G+ LNKASYRIVLN Sbjct: 404 AMELLKDMXENKCRADTVTFNVILGGLCREGRFEEAXGMLERLPYEGVYLNKASYRIVLN 463 Query: 1553 SLCKEGDLNKATELLGLMLARRVLPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTP 1732 SLC+EG+L KAT+L+GLML R VLPHFA+SNELLV LCEAGKV +A + L GL+ELGF P Sbjct: 464 SLCREGELQKATQLVGLMLGRGVLPHFATSNELLVHLCEAGKVGDAVMALLGLLELGFKP 523 Query: 1733 APDTWSLLVDVFCRERKLLPSFQLLDELIMQD 1828 P++W+LLV++ CRERKLLP+F+LLD+L++Q+ Sbjct: 524 EPNSWALLVELICRERKLLPAFELLDDLVIQE 555 >ref|XP_004501623.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like isoform X1 [Cicer arietinum] gi|502133024|ref|XP_004501624.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like isoform X2 [Cicer arietinum] Length = 510 Score = 653 bits (1685), Expect = 0.0 Identities = 313/484 (64%), Positives = 394/484 (81%) Frame = +2 Query: 368 WISPLQYVKTQNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKASD 547 WISPL + K + LD P + + RK +Y++H+ AINLI REKDP+HAL+IFN S+ Sbjct: 24 WISPLNFSKPK-LDPPPEITLPSNETRRKNKYITHDVAINLIKREKDPQHALKIFNMVSE 82 Query: 548 QKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHMHQ 727 QKGFNHNN+TY ILHKLA++KKF +D +LHQM+YETC FHEGIFINLMKH+SK H+ Sbjct: 83 QKGFNHNNATYASILHKLAQFKKFQAVDRVLHQMTYETCQFHEGIFINLMKHYSKCSFHE 142 Query: 728 RVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIFNI 907 +VLD F +IQPIVREKPS KAISTCLNLLV++NQ+DLAR LL+A+++L KPN CIFNI Sbjct: 143 KVLDAFFSIQPIVREKPSPKAISTCLNLLVDSNQVDLARQLLLHAKRSLIYKPNVCIFNI 202 Query: 908 LVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMVSK 1087 LVKYHCR GD+ESA E+V M+ S+ SYPN+ITYST+MDG CR GRL+EA E+FEEMVSK Sbjct: 203 LVKYHCRNGDIESAFEVVEEMRKSKYSYPNVITYSTMMDGLCRNGRLKEAFELFEEMVSK 262 Query: 1088 DQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRLED 1267 D+I+PD LTYN+LINGFCR GK DRA+ ++EFMK NGC PNV NYSAL++GLCK G+L+D Sbjct: 263 DRIVPDPLTYNVLINGFCRGGKPDRARNVIEFMKSNGCCPNVFNYSALVDGLCKVGKLQD 322 Query: 1268 AKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVILG 1447 AK +F EMK++G++PD V YT+LI++ CR+ ++DEAIELLKEMKE EC+AD V FNVILG Sbjct: 323 AKGVFAEMKSSGLKPDTVTYTSLINFFCRNRKIDEAIELLKEMKENECQADTVAFNVILG 382 Query: 1448 GLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRVLP 1627 G+CR RF+EAL+M+E+LP G+ LNK SYRIVLNSL ++ +L KA +LL LML+R LP Sbjct: 383 GMCREGRFEEALDMIEKLPQQGVYLNKGSYRIVLNSLTQKCELRKAKKLLELMLSRGFLP 442 Query: 1628 HFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQLL 1807 H+A+SNELL+S C+ G V +AA LF LVE+GF P D W LL+++ CR+RKLL F+LL Sbjct: 443 HYATSNELLISFCKEGMVDDAAAALFDLVEMGFQPPLDCWELLIELICRDRKLLYVFELL 502 Query: 1808 DELI 1819 DEL+ Sbjct: 503 DELV 506 Score = 65.1 bits (157), Expect = 1e-07 Identities = 42/179 (23%), Positives = 80/179 (44%), Gaps = 34/179 (18%) Frame = +2 Query: 866 KNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGR 1045 K+ LKP+T + L+ + CR ++ A+E+++ MK +E + + ++ ++ G CR GR Sbjct: 331 KSSGLKPDTVTYTSLINFFCRNRKIDEAIELLKEMKENECQ-ADTVAFNVILGGMCREGR 389 Query: 1046 LEEAIEVFE----------------------------------EMVSKDQILPDALTYNL 1123 EEA+++ E E++ LP T N Sbjct: 390 FEEALDMIEKLPQQGVYLNKGSYRIVLNSLTQKCELRKAKKLLELMLSRGFLPHYATSNE 449 Query: 1124 LINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRLEDAKEIFNEMKAA 1300 L+ FC++G VD A + + + G P + + L+ +C+ +L E+ +E+ A Sbjct: 450 LLISFCKEGMVDDAAAALFDLVEMGFQPPLDCWELLIELICRDRKLLYVFELLDELVTA 508 >ref|XP_004136259.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Cucumis sativus] gi|449497032|ref|XP_004160294.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Cucumis sativus] Length = 504 Score = 642 bits (1656), Expect = 0.0 Identities = 305/460 (66%), Positives = 382/460 (83%) Frame = +2 Query: 452 KRRYMSHEHAINLINREKDPEHALEIFNKASDQKGFNHNNSTYGVILHKLAKYKKFGHID 631 K Y+SHE AI LI E+DP+HAL+IFN S+Q+GFNHN++TY I+ LAKYKKF ID Sbjct: 44 KSSYISHETAIKLIKNERDPQHALDIFNMVSEQQGFNHNHATYASIIQNLAKYKKFQAID 103 Query: 632 IILHQMSYETCMFHEGIFINLMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNL 811 +LHQM+Y+TC HEGIF+NLMKHFSKS MH+RVLDMF+AI+ IVREKPSLKAISTCLNL Sbjct: 104 GVLHQMTYDTCKVHEGIFLNLMKHFSKSSMHERVLDMFYAIKSIVREKPSLKAISTCLNL 163 Query: 812 LVEANQIDLARTFLLNAQKNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSY 991 LVE++++DLAR L+NA+ L+L+PNTCIFNILVK+HCR GDL++A E+V+ MKS+ VSY Sbjct: 164 LVESDRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRNGDLQAAFEVVKEMKSARVSY 223 Query: 992 PNLITYSTLMDGFCRCGRLEEAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKK 1171 PNL+TYSTL+ G C G+L+EAIE FEEMVSKD ILPDALTYN+LINGFC+ GKVDRA+ Sbjct: 224 PNLVTYSTLIGGLCENGKLKEAIEFFEEMVSKDNILPDALTYNILINGFCQRGKVDRART 283 Query: 1172 IMEFMKKNGCNPNVVNYSALMNGLCKQGRLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLC 1351 I+EFMK NGC+PNV NYS LMNG CK+GRL++AKE+FNE+K+ GM+PD + YTTLI+ LC Sbjct: 284 ILEFMKSNGCSPNVFNYSVLMNGYCKEGRLQEAKEVFNEIKSLGMKPDTISYTTLINCLC 343 Query: 1352 RSSRVDEAIELLKEMKETECRADVVTFNVILGGLCRSCRFDEALNMLERLPWDGISLNKA 1531 R+ RVDEA ELL++MK+ +CRAD VTFNV+LGGLCR RFDEAL+M+++LP++G LNK Sbjct: 344 RTGRVDEATELLQQMKDKDCRADTVTFNVMLGGLCREGRFDEALDMVQKLPFEGFYLNKG 403 Query: 1532 SYRIVLNSLCKEGDLNKATELLGLMLARRVLPHFASSNELLVSLCEAGKVANAAVMLFGL 1711 SYRIVLN L ++G+L KATELLGLML R +PH A+SN LL+ LC G V +A L GL Sbjct: 404 SYRIVLNFLTQKGELRKATELLGLMLNRGFVPHHATSNTLLLLLCNNGMVKDAVESLLGL 463 Query: 1712 VELGFTPAPDTWSLLVDVFCRERKLLPSFQLLDELIMQDW 1831 +E+GF P ++W LVD+ CRERK+LP F+LLD L+ Q++ Sbjct: 464 LEMGFKPEHESWFTLVDLICRERKMLPVFELLDVLVTQEY 503 Score = 146 bits (369), Expect = 4e-32 Identities = 91/315 (28%), Positives = 163/315 (51%) Frame = +2 Query: 680 IFINLMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLN 859 IF L+KH ++ Q ++ ++ P+L ST + L E ++ A F Sbjct: 192 IFNILVKHHCRNGDLQAAFEVVKEMKSARVSYPNLVTYSTLIGGLCENGKLKEAIEFFEE 251 Query: 860 AQKNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRC 1039 ++ P+ +NIL+ C++G ++ A I+ MKS+ S PN+ YS LM+G+C+ Sbjct: 252 MVSKDNILPDALTYNILINGFCQRGKVDRARTILEFMKSNGCS-PNVFNYSVLMNGYCKE 310 Query: 1040 GRLEEAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVN 1219 GRL+EA EVF E+ S + PD ++Y LIN CR G+VD A ++++ MK C + V Sbjct: 311 GRLQEAKEVFNEIKSLG-MKPDTISYTTLINCLCRTGRVDEATELLQQMKDKDCRADTVT 369 Query: 1220 YSALMNGLCKQGRLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMK 1399 ++ ++ GLC++GR ++A ++ ++ G +K Y ++++L + + +A ELL M Sbjct: 370 FNVMLGGLCREGRFDEALDMVQKLPFEGFYLNKGSYRIVLNFLTQKGELRKATELLGLML 429 Query: 1400 ETECRADVVTFNVILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLN 1579 T N +L LC + +A+ L L G S+ +++ +C+E + Sbjct: 430 NRGFVPHHATSNTLLLLLCNNGMVKDAVESLLGLLEMGFKPEHESWFTLVDLICRERKML 489 Query: 1580 KATELLGLMLARRVL 1624 ELL +++ + L Sbjct: 490 PVFELLDVLVTQEYL 504 >ref|XP_003602939.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355491987|gb|AES73190.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 586 Score = 637 bits (1644), Expect = e-180 Identities = 304/486 (62%), Positives = 391/486 (80%), Gaps = 2/486 (0%) Frame = +2 Query: 368 WISPLQYVKT--QNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKA 541 WISPL + K LD P + + ++ +K +Y++H+ AINLI REKDP+HAL+IFN Sbjct: 99 WISPLNFTKPLEPKLDPPPEIV--VAETRKKSKYITHDVAINLIKREKDPQHALKIFNMV 156 Query: 542 SDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHM 721 S+QKGFNHNN+TY IL KLA++KKF +D +LHQM+YE C FHEG+FINLMKH+SK Sbjct: 157 SEQKGFNHNNATYATILQKLAQFKKFQAVDRVLHQMTYEACKFHEGVFINLMKHYSKCGF 216 Query: 722 HQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIF 901 H++V D F +IQ IVREKPS KAIS+CLNLLV++NQ+DL R LL A+++L KPN CIF Sbjct: 217 HEKVFDAFLSIQTIVREKPSPKAISSCLNLLVDSNQVDLVRKLLLYAKRSLVYKPNVCIF 276 Query: 902 NILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMV 1081 NILVKYHCR+GD++SA E+V+ M++S+ SYPN+ITYSTLMDG CR GRL+EA E+FEEMV Sbjct: 277 NILVKYHCRRGDIDSAFEVVKEMRNSKYSYPNVITYSTLMDGLCRNGRLKEAFELFEEMV 336 Query: 1082 SKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRL 1261 SKDQI+PD LTYN+LINGFCR+GK DRA+ ++EFMK NGC PNV NYSAL++GLCK G+L Sbjct: 337 SKDQIVPDPLTYNVLINGFCREGKADRARNVIEFMKNNGCCPNVFNYSALVDGLCKAGKL 396 Query: 1262 EDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVI 1441 +DAK + EMK++G++PD + YT+LI++ R+ ++DEAIELL EMKE +C+AD VTFNVI Sbjct: 397 QDAKGVLAEMKSSGLKPDAITYTSLINFFSRNGQIDEAIELLTEMKENDCQADTVTFNVI 456 Query: 1442 LGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRV 1621 LGGLCR RFDEAL+M+E+LP G+ LNK SYRIVLNSL + +L KA +LLGLML+R Sbjct: 457 LGGLCREGRFDEALDMIEKLPQQGVYLNKGSYRIVLNSLTQNCELRKANKLLGLMLSRGF 516 Query: 1622 LPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQ 1801 +PH+A+SNELLV LC+ G +AA LF LV++GF P D+W LL+D+ CR+RKLL F+ Sbjct: 517 VPHYATSNELLVRLCKEGMANDAATALFDLVDMGFQPQHDSWELLIDLICRDRKLLYVFE 576 Query: 1802 LLDELI 1819 LLDEL+ Sbjct: 577 LLDELV 582 >ref|XP_002301082.1| hypothetical protein POPTR_0002s10380g [Populus trichocarpa] gi|222842808|gb|EEE80355.1| hypothetical protein POPTR_0002s10380g [Populus trichocarpa] Length = 509 Score = 637 bits (1643), Expect = e-180 Identities = 316/483 (65%), Positives = 383/483 (79%) Frame = +2 Query: 368 WISPLQYVKTQNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKASD 547 WISPL ++ T LD P KTL RK +++SHE A+NLI E+DP+HALEIFN + Sbjct: 20 WISPLHFL-TPKLDPPPKTL---LEPRRKPKFISHETAVNLIKHERDPQHALEIFNLVVE 75 Query: 548 QKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHMHQ 727 QKGFNHN++TY I+ KLA+ KKF +D +L QM YETC FHE +F+NLMK+F+KS + Sbjct: 76 QKGFNHNHATYSTIIDKLARAKKFQAVDALLRQMMYETCKFHESLFLNLMKYFAKSSEFE 135 Query: 728 RVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIFNI 907 RV++MF+ IQPIVREKPSLKAISTCLNLLVE+ Q+DL R FLL+ K+ LKPNTCIFNI Sbjct: 136 RVVEMFNKIQPIVREKPSLKAISTCLNLLVESKQVDLLRGFLLDLNKDHMLKPNTCIFNI 195 Query: 908 LVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMVSK 1087 +KYHC+ GDLESA +V+ MK S +SYPNLITYSTLMDG C GRL+EAIE+FEEMVSK Sbjct: 196 FIKYHCKSGDLESAFAVVKEMKKSSISYPNLITYSTLMDGLCESGRLKEAIELFEEMVSK 255 Query: 1088 DQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRLED 1267 DQILPDALTYN+LINGF GKVDRAKKIMEFMK NGC+PNV NYSALM+G CK+GRLE+ Sbjct: 256 DQILPDALTYNVLINGFSCWGKVDRAKKIMEFMKSNGCSPNVFNYSALMSGFCKEGRLEE 315 Query: 1268 AKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVILG 1447 A + F EMK G++ D VGYT LI+Y CR R+DEA+ LL+EMKET+C+AD+VT NV+L Sbjct: 316 AMDAFEEMKIFGLKQDTVGYTILINYFCRFGRIDEAMALLEEMKETKCKADIVTVNVLLR 375 Query: 1448 GLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRVLP 1627 G C R +EAL ML RL +GI LNKASYRIVLNSLC++GDL+KA ELLGL L+R +P Sbjct: 376 GFCGEGRTEEALGMLNRLSSEGIYLNKASYRIVLNSLCQKGDLDKALELLGLTLSRGFVP 435 Query: 1628 HFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQLL 1807 H A+SNELLV LC+AG +A V L+GL E+GF P D+W+LLV+ CRERKLL +F+LL Sbjct: 436 HHATSNELLVGLCKAGMADDAVVALYGLAEMGFKPEQDSWALLVEFVCRERKLLLAFELL 495 Query: 1808 DEL 1816 DEL Sbjct: 496 DEL 498 Score = 101 bits (252), Expect = 1e-18 Identities = 70/300 (23%), Positives = 132/300 (44%) Frame = +2 Query: 512 EHALEIFNKASDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFIN 691 + A+E+F + + + TY V+++ + + K I+ M C + + Sbjct: 243 KEAIELFEEMVSKDQILPDALTYNVLINGFSCWGKVDRAKKIMEFMKSNGCSPNVFNYSA 302 Query: 692 LMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKN 871 LM F K + +D F + K Sbjct: 303 LMSGFCKEGRLEEAMDAFEEM-------------------------------------KI 325 Query: 872 LHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLE 1051 LK +T + IL+ Y CR G ++ A+ ++ MK ++ +++T + L+ GFC GR E Sbjct: 326 FGLKQDTVGYTILINYFCRFGRIDEAMALLEEMKETKCK-ADIVTVNVLLRGFCGEGRTE 384 Query: 1052 EAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSAL 1231 EA+ + + S+ L A +Y +++N C+ G +D+A +++ G P+ + L Sbjct: 385 EALGMLNRLSSEGIYLNKA-SYRIVLNSLCQKGDLDKALELLGLTLSRGFVPHHATSNEL 443 Query: 1232 MNGLCKQGRLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETEC 1411 + GLCK G +DA + G +P++ + L++++CR ++ A ELL E+ EC Sbjct: 444 LVGLCKAGMADDAVVALYGLAEMGFKPEQDSWALLVEFVCRERKLLLAFELLDELTANEC 503 >ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Glycine max] Length = 546 Score = 629 bits (1623), Expect = e-177 Identities = 302/485 (62%), Positives = 390/485 (80%) Frame = +2 Query: 368 WISPLQYVKTQNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKASD 547 WISPL++ K D P + L + PRKR+++SH+ AI+LI REKDP+HAL IFN S+ Sbjct: 65 WISPLKFTKA---DPPPEPLPS---PPRKRKHISHDSAIDLIKREKDPQHALNIFNMVSE 118 Query: 548 QKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHMHQ 727 Q GF HNN+TY IL KLA+ F +D +LHQM+YETC FHEGIF+NLMKHFSKS +H+ Sbjct: 119 QNGFQHNNATYATILDKLARCNNFHAVDRVLHQMTYETCKFHEGIFVNLMKHFSKSSLHE 178 Query: 728 RVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIFNI 907 ++L + +IQPIVREKPS KA+STCLNLL+++N++DLAR LL+A+++L KPN C+FNI Sbjct: 179 KLLHAYFSIQPIVREKPSPKALSTCLNLLLDSNRVDLARKLLLHAKRDLTRKPNVCVFNI 238 Query: 908 LVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMVSK 1087 LVKYHC+ GDL+SA EIV M++SE SYPNL+TYSTLMDG CR GR++EA ++FEEMVS+ Sbjct: 239 LVKYHCKNGDLDSAFEIVEEMRNSEFSYPNLVTYSTLMDGLCRNGRVKEAFDLFEEMVSR 298 Query: 1088 DQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRLED 1267 D I+PD LTYN+LINGFCR GK DRA+ +++FMK NGC PNV NYSAL++GLCK G+LED Sbjct: 299 DHIVPDPLTYNVLINGFCRGGKPDRARNVIQFMKSNGCYPNVYNYSALVDGLCKVGKLED 358 Query: 1268 AKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVILG 1447 AK + E+K +G++PD V YT+LI++LCR+ + DEAIELL+EMKE C+AD VTFNV+LG Sbjct: 359 AKGVLAEIKGSGLKPDAVTYTSLINFLCRNGKSDEAIELLEEMKENGCQADSVTFNVLLG 418 Query: 1448 GLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRVLP 1627 GLCR +F+EAL+M+E+LP G+ LNK SYRIVLNSL ++ +L +A ELLGLML R P Sbjct: 419 GLCREGKFEEALDMVEKLPQQGVYLNKGSYRIVLNSLTQKCELKRAKELLGLMLRRGFQP 478 Query: 1628 HFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQLL 1807 H+A+SNELLV LC+AG V +AAV LF LVE+GF P +TW +L+ + CRERKLL F+LL Sbjct: 479 HYATSNELLVCLCKAGMVDDAAVALFDLVEMGFQPGLETWEVLIGLICRERKLLYVFELL 538 Query: 1808 DELIM 1822 DEL++ Sbjct: 539 DELVV 543 >gb|ESW09636.1| hypothetical protein PHAVU_009G143500g, partial [Phaseolus vulgaris] Length = 742 Score = 619 bits (1596), Expect = e-174 Identities = 299/474 (63%), Positives = 374/474 (78%) Frame = +2 Query: 368 WISPLQYVKTQNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKASD 547 WISPL++ K P +T PRKR+++SH+ AINLI REKDP+ AL+IFN S Sbjct: 31 WISPLKFTKPAQ-PKPDPPPETAVEPPRKRKFISHDGAINLIKREKDPQLALKIFNMVSQ 89 Query: 548 QKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHMHQ 727 QKGF HNN+TY IL KLA+ KF +D +LHQM+YETC FHEGIF+NLM HFSKS +H Sbjct: 90 QKGFQHNNATYATILEKLARCNKFHAVDRVLHQMTYETCKFHEGIFVNLMSHFSKSSLHD 149 Query: 728 RVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIFNI 907 +VL F +IQPIVR+KPS KA++TCLNLL+++N++DLAR LL+A++ L KPN CIFNI Sbjct: 150 KVLQAFFSIQPIVRDKPSPKALTTCLNLLLDSNRVDLARKLLLHAKRGLTHKPNVCIFNI 209 Query: 908 LVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMVSK 1087 LVKYHC+ GDLESA E+V+ M+SSE SYPNLITYSTLMDG CR GRL EA ++FEEMVS+ Sbjct: 210 LVKYHCKNGDLESAFEVVKEMRSSEFSYPNLITYSTLMDGLCRNGRLREAFQLFEEMVSR 269 Query: 1088 DQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRLED 1267 D I+PD LTYN+LINGFCR+GK D A+ ++EFMK NGC PNV NYSAL+NGLC+ G+LED Sbjct: 270 DHIVPDPLTYNVLINGFCREGKPDHARNVIEFMKSNGCYPNVYNYSALVNGLCRIGKLED 329 Query: 1268 AKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVILG 1447 AK + EMK +G++PD V YT+LI+YLCR+ +V EAI+LL+EMKE + +AD V FN+ILG Sbjct: 330 AKGVLAEMKNSGLKPDAVTYTSLINYLCRNGQVGEAIQLLEEMKENKIQADTVVFNLILG 389 Query: 1448 GLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRVLP 1627 GLCR RF+EAL+MLE+LP G+ LNK SYRIVLNSL + G+L A ELLGLML+R LP Sbjct: 390 GLCREDRFEEALDMLEKLPQQGVYLNKGSYRIVLNSLIQNGELKSAKELLGLMLSRGFLP 449 Query: 1628 HFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLL 1789 H+ASSNELLV LC+ G +AA LF LVE+GF P ++W +L+ + CR+RKLL Sbjct: 450 HYASSNELLVCLCKGGMADDAARALFDLVEMGFQPGLESWEILIGLICRDRKLL 503 Score = 118 bits (296), Expect = 1e-23 Identities = 65/248 (26%), Positives = 133/248 (53%), Gaps = 2/248 (0%) Frame = +2 Query: 1100 PDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCN-PNVVNYSALMNGLCKQGRLEDAKE 1276 P+ +N+L+ C++G ++ A ++++ M+ + + PN++ YS LM+GLC+ GRL +A + Sbjct: 202 PNVCIFNILVKYHCKNGDLESAFEVVKEMRSSEFSYPNLITYSTLMDGLCRNGRLREAFQ 261 Query: 1277 IFNEMKAAG-MQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVILGGL 1453 +F EM + + PD + Y LI+ CR + D A +++ MK C +V ++ ++ GL Sbjct: 262 LFEEMVSRDHIVPDPLTYNVLINGFCREGKPDHARNVIEFMKSNGCYPNVYNYSALVNGL 321 Query: 1454 CRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRVLPHF 1633 CR + ++A +L + G+ + +Y ++N LC+ G + +A +LL M ++ Sbjct: 322 CRIGKLEDAKGVLAEMKNSGLKPDAVTYTSLINYLCRNGQVGEAIQLLEEMKENKIQADT 381 Query: 1634 ASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQLLDE 1813 N +L LC + A ML L + G ++ ++++ + +L + +LL Sbjct: 382 VVFNLILGGLCREDRFEEALDMLEKLPQQGVYLNKGSYRIVLNSLIQNGELKSAKELLGL 441 Query: 1814 LIMQDW*P 1837 ++ + + P Sbjct: 442 MLSRGFLP 449 >ref|XP_002873896.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297319733|gb|EFH50155.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 507 Score = 548 bits (1413), Expect = e-153 Identities = 259/485 (53%), Positives = 358/485 (73%), Gaps = 2/485 (0%) Frame = +2 Query: 368 WISPLQYV--KTQNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKA 541 W+SP+ + K + LD P ++ + K +++SHE ++L+ RE+DP+ AL+IFNKA Sbjct: 21 WVSPICFSEKKKKKLDPPPESSISTMETNPKTKFISHESTVSLMKRERDPQRALDIFNKA 80 Query: 542 SDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHM 721 S QKGFNHNN+TY V+L L ++KKF +D ILHQM YETC F E +F+NLM+HFS+ + Sbjct: 81 SQQKGFNHNNATYSVLLDNLVRHKKFLAVDAILHQMKYETCRFQESLFLNLMRHFSRFDL 140 Query: 722 HQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIF 901 H +V++MF+ IQ I R KPSL AISTCLNLL+++ ++DLAR LL A+ NL L+PNTCIF Sbjct: 141 HDKVMEMFNLIQVIARVKPSLNAISTCLNLLIDSGEVDLARKLLLYAKHNLALQPNTCIF 200 Query: 902 NILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMV 1081 NILVK+HC+ GD++SA +V MK S +SYPN ITYSTLMD R +EA+E+FE+M+ Sbjct: 201 NILVKHHCKNGDIDSAFRVVEEMKRSGISYPNSITYSTLMDCLFAQSRSKEAVELFEDMI 260 Query: 1082 SKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRL 1261 SK I PD + +N++INGFCR G+V+RAK I++FMKKNGCNPNV NYSALMNG CK+G++ Sbjct: 261 SKRGISPDPVIFNVMINGFCRSGEVERAKMILDFMKKNGCNPNVYNYSALMNGFCKEGKI 320 Query: 1262 EDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVI 1441 ++AK++F+E+K G++ D VGYTTL++ LCR+ +DEA++LL EMK + CRAD +T+NVI Sbjct: 321 QEAKQVFDEVKKTGLKLDTVGYTTLMNCLCRNGEIDEAMKLLGEMKASRCRADALTYNVI 380 Query: 1442 LGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRV 1621 L GL R +EAL ML++ +G+ LNK SYRI+LN+LC G+L KA + L +M R + Sbjct: 381 LRGLSSEGRSEEALQMLDQWGCEGVHLNKGSYRIILNALCCNGELEKAVKFLSVMSKRGI 440 Query: 1622 LPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQ 1801 PH A+ NEL+V LCE+G +L G + +G PAP +W +V+ C+ERKL+ F+ Sbjct: 441 WPHHATWNELVVRLCESGNTEIGVRVLIGFLGIGLIPAPKSWGAVVESICKERKLVHVFE 500 Query: 1802 LLDEL 1816 LLD L Sbjct: 501 LLDSL 505 >ref|NP_974803.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|122214363|sp|Q3E9F0.1|PP392_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g18475 gi|110737103|dbj|BAF00503.1| hypothetical protein [Arabidopsis thaliana] gi|332005185|gb|AED92568.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 506 Score = 539 bits (1388), Expect = e-150 Identities = 256/486 (52%), Positives = 356/486 (73%), Gaps = 2/486 (0%) Frame = +2 Query: 368 WISPLQYVKTQNLDSPA--KTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKA 541 W+SP+ + + + SP ++ + P K +++SHE A++L+ RE+DP+ L+IFNKA Sbjct: 21 WVSPICFSEKKKKPSPPPESSISPVETNP-KTKFISHESAVSLMKRERDPQGVLDIFNKA 79 Query: 542 SDQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHM 721 S QKGFNHNN+TY V+L L ++KKF +D ILHQM YETC F E +F+NLM+HFS+S + Sbjct: 80 SQQKGFNHNNATYSVLLDNLVRHKKFLAVDAILHQMKYETCRFQESLFLNLMRHFSRSDL 139 Query: 722 HQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIF 901 H +V++MF+ IQ I R KPSL AISTCLNLL+++ +++L+R LL A+ NL L+PNTCIF Sbjct: 140 HDKVMEMFNLIQVIARVKPSLNAISTCLNLLIDSGEVNLSRKLLLYAKHNLGLQPNTCIF 199 Query: 902 NILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMV 1081 NILVK+HC+ GD+ A +V MK S +SYPN ITYSTLMD R +EA+E+FE+M+ Sbjct: 200 NILVKHHCKNGDINFAFLVVEEMKRSGISYPNSITYSTLMDCLFAHSRSKEAVELFEDMI 259 Query: 1082 SKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRL 1261 SK+ I PD +T+N++INGFCR G+V+RAKKI++FMKKNGCNPNV NYSALMNG CK G++ Sbjct: 260 SKEGISPDPVTFNVMINGFCRAGEVERAKKILDFMKKNGCNPNVYNYSALMNGFCKVGKI 319 Query: 1262 EDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVI 1441 ++AK+ F+E+K G++ D VGYTTL++ CR+ DEA++LL EMK + CRAD +T+NVI Sbjct: 320 QEAKQTFDEVKKTGLKLDTVGYTTLMNCFCRNGETDEAMKLLGEMKASRCRADTLTYNVI 379 Query: 1442 LGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRV 1621 L GL R +EAL ML++ +G+ LNK SYRI+LN+LC G+L KA + L +M R + Sbjct: 380 LRGLSSEGRSEEALQMLDQWGSEGVHLNKGSYRIILNALCCNGELEKAVKFLSVMSERGI 439 Query: 1622 LPHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQ 1801 PH A+ NEL+V LCE+G +L G + +G P P +W +V+ C+ERKL+ F+ Sbjct: 440 WPHHATWNELVVRLCESGYTEIGVRVLIGFLRIGLIPGPKSWGAVVESICKERKLVHVFE 499 Query: 1802 LLDELI 1819 LLD L+ Sbjct: 500 LLDSLV 505 >ref|XP_006287559.1| hypothetical protein CARUB_v10000770mg [Capsella rubella] gi|565459122|ref|XP_006287560.1| hypothetical protein CARUB_v10000770mg [Capsella rubella] gi|482556265|gb|EOA20457.1| hypothetical protein CARUB_v10000770mg [Capsella rubella] gi|482556266|gb|EOA20458.1| hypothetical protein CARUB_v10000770mg [Capsella rubella] Length = 506 Score = 538 bits (1386), Expect = e-150 Identities = 258/485 (53%), Positives = 354/485 (72%), Gaps = 1/485 (0%) Frame = +2 Query: 368 WISPLQYV-KTQNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKAS 544 W+SP+ + K + +SP ++ + K +++SH AI L+ RE+DP+ +L+IFN+AS Sbjct: 21 WVSPICFSDKMKKPNSPPESSISPLETNPKTKFISHASAIELMRRERDPQRSLDIFNRAS 80 Query: 545 DQKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHMH 724 QKGFNHNN+TY V+L L ++KKF +D ILHQM YETC F E +F+NLM+HFS+ +H Sbjct: 81 QQKGFNHNNATYSVLLDNLVRHKKFLAVDAILHQMRYETCRFEESLFLNLMRHFSRFDLH 140 Query: 725 QRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIFN 904 +V+DMF+ IQ I R KPSLK+ISTCLNLL++A +I+LAR LL A+ NL L+PNTCIFN Sbjct: 141 DKVMDMFNLIQVIARVKPSLKSISTCLNLLIDAGEINLARNLLLYAKHNLGLQPNTCIFN 200 Query: 905 ILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMVS 1084 ILVK+HC+ GD++SA +V MK S +SYPN ITYSTLMD R +EA+E+FE+M+S Sbjct: 201 ILVKHHCKNGDIDSAFRVVEEMKRSGISYPNSITYSTLMDCLFAHSRSKEAVELFEDMIS 260 Query: 1085 KDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRLE 1264 K+ ILPD +T+N++INGFCR G+V RA+ I++FMKKNGCNPNV NYSALMNG CK+G ++ Sbjct: 261 KEGILPDPVTFNVMINGFCRSGEVKRAEMILDFMKKNGCNPNVYNYSALMNGFCKEGNIQ 320 Query: 1265 DAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVIL 1444 +AK IFNE+K G++ D VGYTTL++ LC++ +DEA++LL EMK + CR D +T NVIL Sbjct: 321 EAKRIFNEVKEVGLRLDTVGYTTLMNCLCKNGAIDEAMKLLGEMKASRCRVDALTCNVIL 380 Query: 1445 GGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRVL 1624 GL R +EAL ML++ +G+ L+K SYRI+LN LC G L KA + L +M R + Sbjct: 381 KGLSSEGRSEEALQMLDQWGCEGVHLDKGSYRIILNGLCHNGKLEKAVKFLSVMSERGMW 440 Query: 1625 PHFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQL 1804 PH A+ NEL+V LC +G +L G +++G P P +W +V+ CRERKL+ F+L Sbjct: 441 PHHATWNELVVRLCGSGNAEMGVRVLIGFLKIGLQPEPSSWRAVVESSCRERKLVHVFEL 500 Query: 1805 LDELI 1819 LD L+ Sbjct: 501 LDSLV 505 Score = 127 bits (320), Expect = 2e-26 Identities = 82/311 (26%), Positives = 155/311 (49%) Frame = +2 Query: 680 IFINLMKHFSKSHMHQRVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLN 859 IF L+KH K+ + ++ P+ ST ++ L ++ A + Sbjct: 198 IFNILVKHHCKNGDIDSAFRVVEEMKRSGISYPNSITYSTLMDCLFAHSRSKEAVELFED 257 Query: 860 AQKNLHLKPNTCIFNILVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRC 1039 + P+ FN+++ CR G+++ A I+ MK + + PN+ YS LM+GFC+ Sbjct: 258 MISKEGILPDPVTFNVMINGFCRSGEVKRAEMILDFMKKNGCN-PNVYNYSALMNGFCKE 316 Query: 1040 GRLEEAIEVFEEMVSKDQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVN 1219 G ++EA +F E V + + D + Y L+N C++G +D A K++ MK + C + + Sbjct: 317 GNIQEAKRIFNE-VKEVGLRLDTVGYTTLMNCLCKNGAIDEAMKLLGEMKASRCRVDALT 375 Query: 1220 YSALMNGLCKQGRLEDAKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMK 1399 + ++ GL +GR E+A ++ ++ G+ DK Y +++ LC + ++++A++ L M Sbjct: 376 CNVILKGLSSEGRSEEALQMLDQWGCEGVHLDKGSYRIILNGLCHNGKLEKAVKFLSVMS 435 Query: 1400 ETECRADVVTFNVILGGLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLN 1579 E T+N ++ LC S + + +L G+ +S+R V+ S C+E L Sbjct: 436 ERGMWPHHATWNELVVRLCGSGNAEMGVRVLIGFLKIGLQPEPSSWRAVVESSCRERKLV 495 Query: 1580 KATELLGLMLA 1612 ELL ++A Sbjct: 496 HVFELLDSLVA 506 >ref|XP_006403586.1| hypothetical protein EUTSA_v10010303mg [Eutrema salsugineum] gi|557104705|gb|ESQ45039.1| hypothetical protein EUTSA_v10010303mg [Eutrema salsugineum] Length = 505 Score = 527 bits (1357), Expect = e-146 Identities = 250/484 (51%), Positives = 354/484 (73%) Frame = +2 Query: 368 WISPLQYVKTQNLDSPAKTLDTISNVPRKRRYMSHEHAINLINREKDPEHALEIFNKASD 547 W+SP+ + + D P ++ + K +++SHE A+NLI E+DP+ AL++FN S Sbjct: 21 WVSPICFTEKTKPDPPPESSISHVETNPKTKFISHESAVNLIKCERDPQCALDVFNILSR 80 Query: 548 QKGFNHNNSTYGVILHKLAKYKKFGHIDIILHQMSYETCMFHEGIFINLMKHFSKSHMHQ 727 QKGFNHN++TY V+L L ++KKF +D IL+QM YETC F EG+F+NLM+H+S+ +H+ Sbjct: 81 QKGFNHNSATYSVLLDNLVRHKKFQAVDAILNQMKYETCRFQEGVFLNLMRHYSRFDLHE 140 Query: 728 RVLDMFHAIQPIVREKPSLKAISTCLNLLVEANQIDLARTFLLNAQKNLHLKPNTCIFNI 907 +V++MF+ I I R KPSL AISTCLNLL+++ ++DLAR LL A+ +L L+PNTCIFNI Sbjct: 141 KVMEMFNLILMIARVKPSLNAISTCLNLLIDSGEVDLARKLLLYAKNHLGLQPNTCIFNI 200 Query: 908 LVKYHCRKGDLESAVEIVRGMKSSEVSYPNLITYSTLMDGFCRCGRLEEAIEVFEEMVSK 1087 LVK+HC+ GD++SA +V M+ +SYPNLITYSTL++ R +EA+E+FE+M+S Sbjct: 201 LVKHHCKNGDVDSAFRVVEEMRRFGISYPNLITYSTLIECLFAHSRSKEAMELFEDMISN 260 Query: 1088 DQILPDALTYNLLINGFCRDGKVDRAKKIMEFMKKNGCNPNVVNYSALMNGLCKQGRLED 1267 + I PD +T+N++INGFCR G+V+RAK I+EFMKKNGCNPNV NYSALMNG CK+G++++ Sbjct: 261 EGISPDPVTFNVMINGFCRAGQVERAKMIIEFMKKNGCNPNVFNYSALMNGFCKEGKIQE 320 Query: 1268 AKEIFNEMKAAGMQPDKVGYTTLIDYLCRSSRVDEAIELLKEMKETECRADVVTFNVILG 1447 AK IF+E+K G++ D VGYTTL++ LC++ ++DEA+ELL EMK + C+AD +T+NVIL Sbjct: 321 AKLIFDEVKETGLKLDTVGYTTLMNCLCKNGQIDEAMELLVEMKASGCKADALTYNVILR 380 Query: 1448 GLCRSCRFDEALNMLERLPWDGISLNKASYRIVLNSLCKEGDLNKATELLGLMLARRVLP 1627 GL R ++AL ML + +G+ LNK SYRI+LN+LCK G+L KA E L LM + V P Sbjct: 381 GLSSEGRAEQALEMLGQWGCEGVHLNKGSYRIILNALCKNGELEKAVEFLSLMSKKGVWP 440 Query: 1628 HFASSNELLVSLCEAGKVANAAVMLFGLVELGFTPAPDTWSLLVDVFCRERKLLPSFQLL 1807 H A+ NEL+V LC +G +L G + +GF P P +W +V C+ERKLL +L+ Sbjct: 441 HHATWNELVVQLCGSGNADIGVRVLKGFLGIGFKPEPQSWGAVVGSVCKERKLLHVIELV 500 Query: 1808 DELI 1819 D L+ Sbjct: 501 DSLV 504