BLASTX nr result
ID: Catharanthus22_contig00023775
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00023775 (934 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006338488.1| PREDICTED: putative pentatricopeptide repeat... 384 e-104 emb|CBI35029.3| unnamed protein product [Vitis vinifera] 382 e-104 ref|XP_002276684.1| PREDICTED: putative pentatricopeptide repeat... 382 e-104 ref|XP_004233685.1| PREDICTED: putative pentatricopeptide repeat... 379 e-102 ref|XP_004304947.1| PREDICTED: pentatricopeptide repeat-containi... 370 e-100 gb|EMJ17608.1| hypothetical protein PRUPE_ppa022709mg, partial [... 368 2e-99 gb|EXB36666.1| hypothetical protein L484_002079 [Morus notabilis] 360 3e-97 ref|XP_002517451.1| pentatricopeptide repeat-containing protein,... 357 4e-96 gb|EOY03349.1| Tetratricopeptide repeat (TPR)-like superfamily p... 356 8e-96 ref|NP_189507.2| pentatricopeptide repeat-containing protein [Ar... 354 2e-95 ref|XP_006290938.1| hypothetical protein CARUB_v10017051mg [Caps... 353 7e-95 ref|XP_002877120.1| pentatricopeptide repeat-containing protein ... 351 2e-94 ref|NP_189505.2| putative pentatricopeptide repeat-containing pr... 350 4e-94 ref|XP_006395353.1| hypothetical protein EUTSA_v10005682mg [Eutr... 339 8e-91 gb|EMT01880.1| hypothetical protein F775_14784 [Aegilops tauschii] 305 2e-80 ref|NP_001168258.1| hypothetical protein [Zea mays] gi|223947079... 303 5e-80 gb|EMS68422.1| hypothetical protein TRIUR3_23855 [Triticum urartu] 303 6e-80 ref|XP_004981863.1| PREDICTED: putative pentatricopeptide repeat... 302 1e-79 ref|NP_001173276.1| Os03g0158900 [Oryza sativa Japonica Group] g... 300 5e-79 ref|XP_002439151.1| hypothetical protein SORBIDRAFT_09g001360 [S... 299 9e-79 >ref|XP_006338488.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g28640-like [Solanum tuberosum] Length = 512 Score = 384 bits (987), Expect = e-104 Identities = 191/300 (63%), Positives = 227/300 (75%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 +F++ML RG+ PDE+C+TTAL ACAQLG L+QGKWIHE+V + L D F+G+ALVDM Sbjct: 211 IFQDMLGRGVGPDEYCVTTALGACAQLGALEQGKWIHEHVTKSEWLEY-DVFIGSALVDM 269 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG I+ A EVF MPKRNK SWA +I G+AVHG A CLERMQ DG+ PDG+ Sbjct: 270 YAKCGCINMASEVFESMPKRNKHSWATMIRGFAVHGRPELAISCLERMQVADGLKPDGVV 329 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +L VLAAC H+GL K+GQ LL+ MESLYG+ PEHEH+SCVVDLLCRAG+L +A++LIRRM Sbjct: 330 ILAVLAACAHSGLQKEGQGLLDEMESLYGVTPEHEHFSCVVDLLCRAGRLDDALKLIRRM 389 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKP ASVWGALLSGCR+HNNV AYVQLSNIYL ARQ Sbjct: 390 PMKPRASVWGALLSGCRNHNNVNLAELAVKEILLVEDGNEAEEDSAYVQLSNIYLAARQC 449 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLEVSID 900 DDARRIRR IG++GL+KTPGYSA+EIDG+ +EFISGDVSH+ L +IH VL L L+ ID Sbjct: 450 DDARRIRRRIGDRGLRKTPGYSAIEIDGMINEFISGDVSHTCLADIHKVLDLTYLDPEID 509 >emb|CBI35029.3| unnamed protein product [Vitis vinifera] Length = 1596 Score = 382 bits (982), Expect = e-104 Identities = 188/297 (63%), Positives = 227/297 (76%) Frame = +1 Query: 4 FRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMY 183 FRNMLV G+EPDEFCLTTAL CAQLG L QGKWIHEYV RK L AD F+GTALVDMY Sbjct: 195 FRNMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLE-ADVFIGTALVDMY 253 Query: 184 AKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAV 363 AKCG ID + EVF M KRN FSW+A+IGG+A+HG +A QCLERMQ EDG+ PDG+ + Sbjct: 254 AKCGCIDRSVEVFEGMTKRNVFSWSAMIGGFALHGHVRKAMQCLERMQVEDGLRPDGVVL 313 Query: 364 LGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMP 543 LGV+ AC HAGL ++GQFLLENME+ YG++P+HEHYSC+VDLLCRAGQL EA++LIRRMP Sbjct: 314 LGVIMACAHAGLQEEGQFLLENMEARYGILPKHEHYSCMVDLLCRAGQLDEALKLIRRMP 373 Query: 544 MKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQYD 723 MKP A+VWGALLSGCR+HNNV AYVQLSNIYL A++ + Sbjct: 374 MKPRAAVWGALLSGCRTHNNVDLAELAARELLMVGNGDGTEEDGAYVQLSNIYLAAQKCE 433 Query: 724 DARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLEVS 894 DA RIRRMIG+K +K PG S +E++G ++F+SGD+SH L +IH +L L+SL+ S Sbjct: 434 DACRIRRMIGDKRIKTKPGCSLIEVEGEVNQFVSGDISHPCLAQIHEMLDLVSLQHS 490 Score = 78.2 bits (191), Expect = 4e-12 Identities = 49/193 (25%), Positives = 89/193 (46%) Frame = +1 Query: 28 IEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDT 207 + PD+ ++AC + GK IH +V + + +D + TALV YA+C + Sbjct: 101 VGPDQHTFPFIISACTNSLWMLLGKQIHNWVL-KNGVASSDRHVQTALVRFYAECCAMGD 159 Query: 208 AEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACK 387 A ++F +P + W ++ GY G A +A M G+ PD + L C Sbjct: 160 ARKLFDEIPNLDVVQWNVLLNGYVRRGLAPEALNAFRNMLV-SGVEPDEFCLTTALKGCA 218 Query: 388 HAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVW 567 G ++QG+++ E + L + + +VD+ + G + +VE+ M + + S W Sbjct: 219 QLGALQQGKWIHEYVTKRKWLEADVFIGTALVDMYAKCGCIDRSVEVFEGMTKRNVFS-W 277 Query: 568 GALLSGCRSHNNV 606 A++ G H +V Sbjct: 278 SAMIGGFALHGHV 290 >ref|XP_002276684.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g28640-like [Vitis vinifera] Length = 511 Score = 382 bits (982), Expect = e-104 Identities = 188/297 (63%), Positives = 227/297 (76%) Frame = +1 Query: 4 FRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMY 183 FRNMLV G+EPDEFCLTTAL CAQLG L QGKWIHEYV RK L AD F+GTALVDMY Sbjct: 195 FRNMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLE-ADVFIGTALVDMY 253 Query: 184 AKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAV 363 AKCG ID + EVF M KRN FSW+A+IGG+A+HG +A QCLERMQ EDG+ PDG+ + Sbjct: 254 AKCGCIDRSVEVFEGMTKRNVFSWSAMIGGFALHGHVRKAMQCLERMQVEDGLRPDGVVL 313 Query: 364 LGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMP 543 LGV+ AC HAGL ++GQFLLENME+ YG++P+HEHYSC+VDLLCRAGQL EA++LIRRMP Sbjct: 314 LGVIMACAHAGLQEEGQFLLENMEARYGILPKHEHYSCMVDLLCRAGQLDEALKLIRRMP 373 Query: 544 MKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQYD 723 MKP A+VWGALLSGCR+HNNV AYVQLSNIYL A++ + Sbjct: 374 MKPRAAVWGALLSGCRTHNNVDLAELAARELLMVGNGDGTEEDGAYVQLSNIYLAAQKCE 433 Query: 724 DARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLEVS 894 DA RIRRMIG+K +K PG S +E++G ++F+SGD+SH L +IH +L L+SL+ S Sbjct: 434 DACRIRRMIGDKRIKTKPGCSLIEVEGEVNQFVSGDISHPCLAQIHEMLDLVSLQHS 490 Score = 78.2 bits (191), Expect = 4e-12 Identities = 49/193 (25%), Positives = 89/193 (46%) Frame = +1 Query: 28 IEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDT 207 + PD+ ++AC + GK IH +V + + +D + TALV YA+C + Sbjct: 101 VGPDQHTFPFIISACTNSLWMLLGKQIHNWVL-KNGVASSDRHVQTALVRFYAECCAMGD 159 Query: 208 AEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACK 387 A ++F +P + W ++ GY G A +A M G+ PD + L C Sbjct: 160 ARKLFDEIPNLDVVQWNVLLNGYVRRGLAPEALNAFRNMLV-SGVEPDEFCLTTALKGCA 218 Query: 388 HAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVW 567 G ++QG+++ E + L + + +VD+ + G + +VE+ M + + S W Sbjct: 219 QLGALQQGKWIHEYVTKRKWLEADVFIGTALVDMYAKCGCIDRSVEVFEGMTKRNVFS-W 277 Query: 568 GALLSGCRSHNNV 606 A++ G H +V Sbjct: 278 SAMIGGFALHGHV 290 >ref|XP_004233685.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g28640-like [Solanum lycopersicum] Length = 487 Score = 379 bits (972), Expect = e-102 Identities = 189/300 (63%), Positives = 224/300 (74%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 +F++ML RG+ PDE+C+TTAL ACAQLG L+QGKWIHE+V + L D F+G+ALVDM Sbjct: 186 IFQDMLGRGVGPDEYCVTTALGACAQLGALEQGKWIHEHVTKSEWLEY-DVFIGSALVDM 244 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG I+ A EVF MP RNK SWA +I G+AVHG A CLERMQ DG+ PDG+ Sbjct: 245 YAKCGSINLASEVFESMPTRNKHSWATMIRGFAVHGRPELALSCLERMQVADGLKPDGVV 304 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +L VLAAC H+GL K+GQ LL+ MESLYG+ PEHEH+SCVVDLLCRAG+L +A++LIRRM Sbjct: 305 ILAVLAACAHSGLQKEGQGLLDEMESLYGVTPEHEHFSCVVDLLCRAGRLDDALKLIRRM 364 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKP ASVWGALLSGCR+HNNV AYVQLSNIYL ARQ Sbjct: 365 PMKPRASVWGALLSGCRNHNNVNLAELAVKEILLVEDGNEAEEDSAYVQLSNIYLAARQC 424 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLEVSID 900 DDARRIRR IG++GL+KTPGYSA+EIDG+ +EFISGDVSH L +IH VL L L+ D Sbjct: 425 DDARRIRRRIGDRGLRKTPGYSAIEIDGMVNEFISGDVSHICLADIHKVLDLTYLDPHFD 484 >ref|XP_004304947.1| PREDICTED: pentatricopeptide repeat-containing protein At3g28660-like [Fragaria vesca subsp. vesca] Length = 501 Score = 370 bits (949), Expect = e-100 Identities = 181/296 (61%), Positives = 221/296 (74%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VF++MLVRG EPD FC+ T L ACA LG L QGKWIHEYV+ R+ L +D F+GTALVDM Sbjct: 202 VFQDMLVRGFEPDGFCVATGLAACAHLGALWQGKWIHEYVRKREGLN-SDVFIGTALVDM 260 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG ID A E F M KRN SW+A+IG Y VHG A +A CLERMQ +DG+ PDG+ Sbjct: 261 YAKCGCIDLAVEAFEGMGKRNVVSWSAMIGAYGVHGYATEAISCLERMQVDDGVKPDGVV 320 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +LGVL AC H GL+++G+ LL+NM++ YG+VP+HEHYSCV+DLLC+AG+L +A ELIRRM Sbjct: 321 LLGVLTACNHGGLLEKGKALLDNMKAKYGIVPKHEHYSCVIDLLCKAGRLSDAFELIRRM 380 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKPLASVWGALLSGCR HNNV AYVQLSNIYLGA++ Sbjct: 381 PMKPLASVWGALLSGCRIHNNVDLAEIAVEQLLQVANDDRGEEVGAYVQLSNIYLGAQRS 440 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLE 888 +DA RIR+MIGEKG+KKTPG S +E+DG +EF+SGDVSHS +I +L L+S + Sbjct: 441 EDALRIRKMIGEKGIKKTPGCSMLEVDGKVNEFVSGDVSHSHCVQICTMLDLISAD 496 Score = 98.2 bits (243), Expect = 4e-18 Identities = 59/190 (31%), Positives = 94/190 (49%) Frame = +1 Query: 28 IEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDT 207 +EPD F A+ C G + G+ +H V + L AD + TA+V +Y +CG + Sbjct: 109 LEPDNFTFHFAILGCVNCGWIGPGRQMHCLVV-KNGLVAADAHVQTAVVRLYVECGVLGD 167 Query: 208 AEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACK 387 A +VF +P+R+ W I+ GY G A++A + + M G PDG V LAAC Sbjct: 168 AHKVFDEIPERDMVQWNVIMNGYVKRGLASEALRVFQDMLVR-GFEPDGFCVATGLAACA 226 Query: 388 HAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVW 567 H G + QG+++ E + GL + + +VD+ + G + AVE M + + S W Sbjct: 227 HLGALWQGKWIHEYVRKREGLNSDVFIGTALVDMYAKCGCIDLAVEAFEGMGKRNVVS-W 285 Query: 568 GALLSGCRSH 597 A++ H Sbjct: 286 SAMIGAYGVH 295 >gb|EMJ17608.1| hypothetical protein PRUPE_ppa022709mg, partial [Prunus persica] Length = 541 Score = 368 bits (945), Expect = 2e-99 Identities = 182/293 (62%), Positives = 218/293 (74%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VFR+MLV G EPD FC+ T L ACA LG L QGKWI EYVK R L+ +D F+GTALVDM Sbjct: 187 VFRDMLVTGFEPDNFCVATGLAACAHLGALRQGKWIDEYVKKRTGLK-SDVFIGTALVDM 245 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG ID A E F MPKRN SWAA+IGG+A HGCA A LERMQ +DG+ PDG+ Sbjct: 246 YAKCGCIDLAVEAFEGMPKRNVVSWAAMIGGFAAHGCATNAIHSLERMQVDDGLRPDGVV 305 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +L VL AC HAGL+++G+ LL+NM++ YG+VP+HEHYSCV+DLLC+AG+L EA++LIR+M Sbjct: 306 LLVVLMACTHAGLLEKGKLLLDNMKTQYGIVPKHEHYSCVIDLLCKAGRLNEALKLIRKM 365 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKPLASVWGALLSGCR HNNV AYVQLSNIYLGAR+ Sbjct: 366 PMKPLASVWGALLSGCRIHNNVDLAELAVKELLQLENDVRGEEVGAYVQLSNIYLGARRG 425 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLM 879 +DA RIR+MIGE G+KKTPG S +E+DG +EF+SGDVSHS I A+L L+ Sbjct: 426 EDAIRIRKMIGESGIKKTPGCSMIEVDGKVNEFVSGDVSHSHQAWICAMLDLI 478 Score = 102 bits (255), Expect = 2e-19 Identities = 59/188 (31%), Positives = 98/188 (52%) Frame = +1 Query: 34 PDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDTAE 213 PD + + ACA L G+ IH +V + L + D + TALV +YA+C +D ++ Sbjct: 96 PDNYTFNFVILACANCSWLVSGRQIHNWVV-KNGLFLVDAHVQTALVRLYAECKVLDDSK 154 Query: 214 EVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACKHA 393 +VF +P+R+ W ++ GY G A++A + M G PD V LAAC H Sbjct: 155 KVFDEIPERDVIQWNVLMNGYVRCGLASEALKVFRDMLV-TGFEPDNFCVATGLAACAHL 213 Query: 394 GLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVWGA 573 G ++QG+++ E ++ GL + + +VD+ + G + AVE MP + + S W A Sbjct: 214 GALRQGKWIDEYVKKRTGLKSDVFIGTALVDMYAKCGCIDLAVEAFEGMPKRNVVS-WAA 272 Query: 574 LLSGCRSH 597 ++ G +H Sbjct: 273 MIGGFAAH 280 >gb|EXB36666.1| hypothetical protein L484_002079 [Morus notabilis] Length = 487 Score = 360 bits (925), Expect = 3e-97 Identities = 180/294 (61%), Positives = 216/294 (73%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VFR+ML G+E DE C TALTACAQ G L GKWIHEY++ R+ +D F+GTALVDM Sbjct: 172 VFRDMLKFGVELDECCAVTALTACAQSGALWWGKWIHEYIEKREGFE-SDVFVGTALVDM 230 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 Y KCG +D A EVF +MP RN FSWAAIIGG+AVHG +A +CLERMQA+DG+ PDG+ Sbjct: 231 YTKCGCLDMAVEVFEKMPTRNAFSWAAIIGGFAVHGQVMEAIRCLERMQADDGLKPDGVV 290 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +LGVL AC HAGL K+GQ LL NMES YG++P+HEHYSCVVDLLCRAG+L EA +LIRRM Sbjct: 291 LLGVLTACTHAGLQKEGQLLLHNMESQYGILPKHEHYSCVVDLLCRAGKLREAYQLIRRM 350 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKPLASVWGALLSGCR N V YVQLSNIYL A++ Sbjct: 351 PMKPLASVWGALLSGCRIRNYVDLAELAVKELVLLENEDKRGQDGVYVQLSNIYLAAQRT 410 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMS 882 +DA IR+MIG+KG++KTPG S VE+DG +EF+SGD+ HS +I +LYL+S Sbjct: 411 EDAVLIRKMIGDKGIRKTPGCSTVEVDGRVNEFVSGDIVHSCQAKICVMLYLLS 464 >ref|XP_002517451.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223543462|gb|EEF44993.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 428 Score = 357 bits (916), Expect = 4e-96 Identities = 173/293 (59%), Positives = 218/293 (74%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VFR M V+G+EPDEFC+TTAL ACA+ G L QGKWIHEYVK K D F+GTALVDM Sbjct: 117 VFRFMFVKGVEPDEFCVTTALAACAKSGALWQGKWIHEYVK--KTTLGFDVFIGTALVDM 174 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG+I+ A +VF MPKR+ FSWAA+IGGYA+HG A +A LERM AEDG+ PDG+ Sbjct: 175 YAKCGWINMAVQVFEEMPKRSAFSWAAMIGGYAIHGYAREAIHYLERMHAEDGLRPDGVV 234 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +LGVL AC HAGL ++G+FLL+NM++ YG+VP HEHYSCVVDLLCRAG+ EA+ LI+RM Sbjct: 235 LLGVLTACTHAGLQEEGRFLLDNMKARYGIVPRHEHYSCVVDLLCRAGRWDEALALIKRM 294 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKPLASVWGA+LS CR+H N A+VQL NIYL + Sbjct: 295 PMKPLASVWGAVLSSCRTHKNAELAEFAVQELLQLENGNGNEEDAAFVQLWNIYLSTGRG 354 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLM 879 +DA +I R+IGE+GLKKTPG S +E++G+ +EF+SGDVS+ + ++HA+L L+ Sbjct: 355 EDASKIHRLIGERGLKKTPGCSMIEVNGMVNEFVSGDVSNKDVAQMHAILELL 407 Score = 63.5 bits (153), Expect = 1e-07 Identities = 41/150 (27%), Positives = 73/150 (48%) Frame = +1 Query: 148 DEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQ 327 D + TA+V +YAKC + A ++F + + + W ++ GY ++A + R Sbjct: 63 DGHIQTAVVRLYAKCKIMSDAHKMFDEIHRPDVIQWNVLMNGYIESNLESEALRVF-RFM 121 Query: 328 AEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQ 507 G+ PD V LAAC +G + QG+++ E ++ L + + +VD+ + G Sbjct: 122 FVKGVEPDEFCVTTALAACAKSGALWQGKWIHEYVKKT-TLGFDVFIGTALVDMYAKCGW 180 Query: 508 LGEAVELIRRMPMKPLASVWGALLSGCRSH 597 + AV++ MP K A W A++ G H Sbjct: 181 INMAVQVFEEMP-KRSAFSWAAMIGGYAIH 209 >gb|EOY03349.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative [Theobroma cacao] Length = 499 Score = 356 bits (913), Expect = 8e-96 Identities = 174/293 (59%), Positives = 211/293 (72%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VF+ +LV GI+PDEFCLTTALTACAQ G L +GKWIHEY++ R+ D F+GTALVDM Sbjct: 201 VFKELLVFGIQPDEFCLTTALTACAQNGSLREGKWIHEYLRKREKCLELDVFIGTALVDM 260 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG +D A EVF M KRN +SWAA+IGG+AVHG A +A C ERMQ DGI PDG+ Sbjct: 261 YAKCGCLDLAVEVFEGMSKRNVYSWAAMIGGFAVHGHARKAIHCFERMQ-NDGIRPDGVV 319 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +LGVL AC HAGL ++G FLL NME Y +VP+HEHYSCVVDLLCR G+ EA++LIRRM Sbjct: 320 LLGVLTACTHAGLAEEGLFLLNNMEGQYRIVPKHEHYSCVVDLLCRTGKFDEALKLIRRM 379 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PM+PLASVWGALL+ CR +NNV A VQLSNIY A++ Sbjct: 380 PMRPLASVWGALLNSCRIYNNVQLAELAVKELLELEDCDGDEEDAALVQLSNIYFSAQKS 439 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLM 879 +D RIRRMIG++GLKK PG S +E+DG EF+SGD+SH +IH +L L+ Sbjct: 440 EDGHRIRRMIGDRGLKKAPGCSMIEVDGRMTEFVSGDISHPLHSQIHTILRLL 492 >ref|NP_189507.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75273574|sp|Q9LJI9.1|PP260_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g28660 gi|9294280|dbj|BAB02182.1| unnamed protein product [Arabidopsis thaliana] gi|20259531|gb|AAM13885.1| unknown protein [Arabidopsis thaliana] gi|24030460|gb|AAN41382.1| unknown protein [Arabidopsis thaliana] gi|332643950|gb|AEE77471.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 354 bits (909), Expect = 2e-95 Identities = 166/298 (55%), Positives = 226/298 (75%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VF+ MLVRGIEPDEF +TTALTACAQ+G L QGKWIHE+VK ++ + +D F+GTALVDM Sbjct: 205 VFKEMLVRGIEPDEFSVTTALTACAQVGALAQGKWIHEFVKKKRWIE-SDVFVGTALVDM 263 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG I+TA EVF ++ +RN FSWAA+IGGYA +G A +A CL+R++ EDGI PD + Sbjct: 264 YAKCGCIETAVEVFEKLTRRNVFSWAALIGGYAAYGYAKKATTCLDRIEREDGIKPDSVV 323 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +LGVLAAC H G +++G+ +LENME+ YG+ P+HEHYSC+VDL+CRAG+L +A++LI +M Sbjct: 324 LLGVLAACAHGGFLEEGRTMLENMEARYGITPKHEHYSCIVDLMCRAGRLDDALDLIEKM 383 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKPLASVWGALL+GCR+H NV A VQLSNIY ++ Sbjct: 384 PMKPLASVWGALLNGCRTHKNVELGELAVQNLLDLEKGNVEEEEAALVQLSNIYFSVQRN 443 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLEVS 894 +A ++R MI ++G++KTPG+S +E+DG+ +F+SGDVSH L +IH +++L+S++ S Sbjct: 444 PEAFKVRGMIEQRGIRKTPGWSLLEVDGIVTKFVSGDVSHPNLLQIHTLIHLLSVDAS 501 >ref|XP_006290938.1| hypothetical protein CARUB_v10017051mg [Capsella rubella] gi|482559645|gb|EOA23836.1| hypothetical protein CARUB_v10017051mg [Capsella rubella] Length = 507 Score = 353 bits (905), Expect = 7e-95 Identities = 169/296 (57%), Positives = 220/296 (74%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VFR MLVRGIEPDEF +TTALTACAQ+G L QGKWIHE+VK +K ++ +D F+GTALVDM Sbjct: 208 VFREMLVRGIEPDEFSVTTALTACAQVGALAQGKWIHEFVKKKKWVK-SDVFVGTALVDM 266 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG I+TA EVF ++ +RN FSWAA+IGGYA +G A +A CL+RM+ ED I PD + Sbjct: 267 YAKCGCIETAVEVFEKLTRRNVFSWAALIGGYAAYGYAREAIMCLDRMEREDAIKPDSVV 326 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +LGVLAAC H G +++G+ +L+NMES YG+ P+HEHYSC+VDL+CRAG+L A++LI +M Sbjct: 327 LLGVLAACAHGGFLQEGRSMLDNMESRYGITPKHEHYSCIVDLMCRAGRLDGALDLIEKM 386 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKPLASVWGALL+GCR+H NV A VQLSNIY ++ Sbjct: 387 PMKPLASVWGALLNGCRTHKNVELGELAVKNLLELEKGNVDEEEAALVQLSNIYFSVQRN 446 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLE 888 +A +IR MI +KG+KK PG S +E+DG F+SGD+SH L +IH V++L+S++ Sbjct: 447 PEASKIRGMIDQKGIKKAPGCSVLEVDGDVTRFVSGDLSHPNLLQIHTVIHLLSVD 502 >ref|XP_002877120.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297322958|gb|EFH53379.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 399 Score = 351 bits (901), Expect = 2e-94 Identities = 165/296 (55%), Positives = 222/296 (75%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VFR MLVRG+EPDEF +TTALTACAQ+G L QGKWIHE+VK ++ + +D F+GTALVDM Sbjct: 100 VFREMLVRGVEPDEFSVTTALTACAQVGALAQGKWIHEFVKKKRWIE-SDVFVGTALVDM 158 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG I+ A EVF ++ +RN FSWAA+IGGYA +G A +A CL+RM+ EDGI PD + Sbjct: 159 YAKCGCIEMAVEVFEKLSRRNVFSWAALIGGYAAYGYAKKAMTCLDRMEREDGIKPDSVV 218 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +LGVLAAC H G +++G+ +L NME+ YG+ P+HEHYSC+VDL+CRAG+L +A++LI +M Sbjct: 219 LLGVLAACAHGGFLQEGRAMLGNMEARYGITPKHEHYSCIVDLMCRAGRLDDALDLIEKM 278 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKPLASVWGALL+GCR+H NV A VQLSNIY ++ Sbjct: 279 PMKPLASVWGALLNGCRTHKNVELGELAVKNLLDLEKGNAEEEEAALVQLSNIYFSVQRN 338 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLE 888 +A ++R MI ++G++KTPG+S +E+DG +F+SGDVSH L +IH V++L+S++ Sbjct: 339 PEASKVRGMIEQRGIRKTPGWSVLEVDGNVTKFVSGDVSHPNLLQIHTVIHLLSVD 394 Score = 65.5 bits (158), Expect = 3e-08 Identities = 44/186 (23%), Positives = 86/186 (46%) Frame = +1 Query: 28 IEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDT 207 I P + AC + GK IH +V + + ++D + T ++ +Y + + Sbjct: 7 IAPSYLTFYFLIVACFKACLFSVGKQIHCWVV-KNGVFLSDGHVQTGILRIYVEDKVLLD 65 Query: 208 AEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACK 387 A +VF +PK + W ++ GY G ++ + M G+ PD +V L AC Sbjct: 66 AHKVFDEIPKPDVVKWDVLMNGYVRCGLGSEGLEVFREMLVR-GVEPDEFSVTTALTACA 124 Query: 388 HAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVW 567 G + QG+++ E ++ + + + +VD+ + G + AVE+ ++ + + S W Sbjct: 125 QVGALAQGKWIHEFVKKKRWIESDVFVGTALVDMYAKCGCIEMAVEVFEKLSRRNVFS-W 183 Query: 568 GALLSG 585 AL+ G Sbjct: 184 AALIGG 189 >ref|NP_189505.2| putative pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75273576|sp|Q9LJJ1.1|PP259_ARATH RecName: Full=Putative pentatricopeptide repeat-containing protein At3g28640 gi|9294278|dbj|BAB02180.1| unnamed protein product [Arabidopsis thaliana] gi|332643948|gb|AEE77469.1| putative pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 350 bits (898), Expect = 4e-94 Identities = 166/296 (56%), Positives = 222/296 (75%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VFR MLV+G+EPDEF +TTALTACAQ+G L QGKWIHE+VK +K+ +D F+GTALVDM Sbjct: 205 VFREMLVKGLEPDEFSVTTALTACAQVGALAQGKWIHEFVK-KKSWIESDVFVGTALVDM 263 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG I+TA EVF ++ +RN FSWAA+IGGYA +G A +A CLER++ EDGI PD + Sbjct: 264 YAKCGCIETAVEVFKKLTRRNVFSWAALIGGYAAYGYAKKAMTCLERLEREDGIKPDSVV 323 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +LGVLAAC H G +++G+ +LENME+ Y + P+HEHYSC+VDL+CRAG+L +A+ LI +M Sbjct: 324 LLGVLAACAHGGFLEEGRSMLENMEARYEITPKHEHYSCIVDLMCRAGRLDDALNLIEKM 383 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKPLASVWGALL+GCR+H NV A VQLSNIY ++ Sbjct: 384 PMKPLASVWGALLNGCRTHKNVELGELAVKNLLDLEKGNVEEEEAALVQLSNIYFSVQRN 443 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLE 888 +A ++R MI ++G++KTPG+S +E+DG +F+SGDVSH L +IH V++L+S++ Sbjct: 444 PEASKVRGMIEQRGVRKTPGWSVLEVDGNVTKFVSGDVSHPNLLQIHTVIHLLSVD 499 >ref|XP_006395353.1| hypothetical protein EUTSA_v10005682mg [Eutrema salsugineum] gi|557091992|gb|ESQ32639.1| hypothetical protein EUTSA_v10005682mg [Eutrema salsugineum] Length = 505 Score = 339 bits (870), Expect = 8e-91 Identities = 165/296 (55%), Positives = 217/296 (73%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VFR ML RG EPD+F +TTALTACAQ+G L QGK IH+ +K +K L +D ++GTALVDM Sbjct: 206 VFREMLARGTEPDKFSVTTALTACAQVGALAQGKLIHKLLKKKKLLE-SDIYVGTALVDM 264 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG I+TA EVF + +RN FSWA +IGGYA +G A +A CL++M+ EDGI PD + Sbjct: 265 YAKCGCIETALEVFENLSRRNVFSWAVLIGGYAAYGYAKKAIMCLDQMEREDGIKPDSVV 324 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 +L VLAAC H G +++G+ LL+NME+ YG+ P+HEHYSC+VDL+CRAG+L +AV+LI M Sbjct: 325 LLTVLAACAHGGFLQEGRALLDNMEARYGITPKHEHYSCIVDLICRAGRLDDAVDLIEGM 384 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PMKPLASVWGALL+GCR+H NV A VQLSNIYL A++ Sbjct: 385 PMKPLASVWGALLNGCRTHKNVELGELAVKNLLDLEKGNADEEEAALVQLSNIYLIAQRN 444 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLE 888 +A IRRMIG+KG++K PG S +E+DG +F+SGDVSH L +IH +++L+S++ Sbjct: 445 TEASNIRRMIGQKGIRKAPGCSVLEVDGNVTKFVSGDVSHQNLLQIHTMIHLLSVD 500 >gb|EMT01880.1| hypothetical protein F775_14784 [Aegilops tauschii] Length = 426 Score = 305 bits (781), Expect = 2e-80 Identities = 151/303 (49%), Positives = 199/303 (65%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 +FR M V G+ PD LTTA+ ACAQ G L+ G+W+H YV++ +AD F+G+ALV M Sbjct: 101 LFRAMFVDGVAPDAVVLTTAVAACAQSGALECGEWVHRYVESNAPGLLADAFVGSALVSM 160 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG + A VF MP+RN++ W ++G +AVHG A +A CLERM EDG+ PDG+A Sbjct: 161 YAKCGCLQEAVRVFEGMPERNEYVWGTMVGAFAVHGMAREAVACLERMAGEDGVRPDGVA 220 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 VLG L+AC HAG V+ G LL+ M YG+ P HEHYSC VD+LCR G+L +AV LI M Sbjct: 221 VLGALSACAHAGKVEDGLRLLKEMRRRYGVTPGHEHYSCTVDMLCRVGRLEDAVGLIGTM 280 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PM PLASVWG+LL+GCR + NV YVQLSNIYL A + Sbjct: 281 PMTPLASVWGSLLAGCRMYGNV---KLAEVAAKELEKLGVGADEGVYVQLSNIYLDANRK 337 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLEVSID 900 DDARR+R++IG +GLKK P YSAVE+DG F++ D +H R EI +L L++ ++ + Sbjct: 338 DDARRVRKLIGSRGLKKVPAYSAVEVDGELSSFVADDQAHPRRFEIWDLLGLLADQMGLK 397 Query: 901 QNK 909 ++ Sbjct: 398 SDE 400 Score = 68.2 bits (165), Expect = 4e-09 Identities = 57/191 (29%), Positives = 90/191 (47%), Gaps = 3/191 (1%) Frame = +1 Query: 34 PDEFCLTTALTACAQLGDLDQ-GKWIHEY-VKNRKNLRVADEFLGTALVDMYAKCGYIDT 207 PD AL+A A G +H VKN L +D ++ TAL+ ++A D Sbjct: 11 PDHLSFPFALSAAAAAPVAPPPGPQLHALLVKNA--LFPSDHYVTTALLQLHAPRP--DD 66 Query: 208 AEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACK 387 A VF +P+R + +IG YA G A + L R DG+ PD + + +AAC Sbjct: 67 ARRVFDELPRREAIHYDLVIGAYARAGMAAEGL-ALFRAMFVDGVAPDAVVLTTAVAACA 125 Query: 388 HAGLVKQGQFLLENMES-LYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASV 564 +G ++ G+++ +ES GL+ + S +V + + G L EAV + MP + V Sbjct: 126 QSGALECGEWVHRYVESNAPGLLADAFVGSALVSMYAKCGCLQEAVRVFEGMPERN-EYV 184 Query: 565 WGALLSGCRSH 597 WG ++ H Sbjct: 185 WGTMVGAFAVH 195 >ref|NP_001168258.1| hypothetical protein [Zea mays] gi|223947079|gb|ACN27623.1| unknown [Zea mays] gi|413942312|gb|AFW74961.1| hypothetical protein ZEAMMB73_025514 [Zea mays] Length = 506 Score = 303 bits (777), Expect = 5e-80 Identities = 155/304 (50%), Positives = 200/304 (65%), Gaps = 1/304 (0%) Frame = +1 Query: 1 VFRNMLVR-GIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVD 177 VFR ML G+ PD LTTA+ ACAQ G LD G W+H YV+ +AD FLG+AL+ Sbjct: 193 VFRTMLDDDGVTPDAVVLTTAVAACAQAGALDLGAWVHRYVERAAPGLLADAFLGSALIG 252 Query: 178 MYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGI 357 MYAKCG ++ A VF MP+RN + WA ++G AVHG A +A CLERM EDG+ PDG+ Sbjct: 253 MYAKCGCLEEAVRVFDGMPERNAYVWATMVGALAVHGMAAEAVACLERMAREDGVRPDGV 312 Query: 358 AVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRR 537 AVLG L+AC HAG V++G LL M YG+VP HEHYSC VD+LCR G+L +AV L++ Sbjct: 313 AVLGALSACAHAGDVEEGLRLLGEMRPRYGVVPGHEHYSCAVDMLCRVGRLEDAVGLVKI 372 Query: 538 MPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQ 717 MPM PLASVWG++L+GCRSH NV YVQLSNIYL A + Sbjct: 373 MPMAPLASVWGSVLAGCRSHGNV---ELAEVAARELERLGGSADEGVYVQLSNIYLDANR 429 Query: 718 YDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLEVSI 897 DDARR+R++IG +G+KK P YSA+E+DG F++ D +H R EI VL L++ +++ Sbjct: 430 KDDARRVRKLIGGRGIKKLPAYSALEVDGEVSSFVADDQAHPRRFEIWEVLRLLADQMAQ 489 Query: 898 DQNK 909 N+ Sbjct: 490 KPNE 493 Score = 70.1 bits (170), Expect = 1e-09 Identities = 54/198 (27%), Positives = 87/198 (43%), Gaps = 10/198 (5%) Frame = +1 Query: 34 PDEFCLTTALTACAQL---------GDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYA 186 PD AL+A A L G +H + R L AD ++ TAL+ +Y+ Sbjct: 95 PDHLSFPFALSAAAALDANPSSSAAAGTAPGPQLHALLV-RNALFPADHYVTTALLQLYS 153 Query: 187 KCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVL 366 D A VF +P+R + +IG YA G + M +DG+ PD + + Sbjct: 154 PRP--DLARRVFDELPRREAIHYDLLIGSYARAGAPAEGLSVFRTMLDDDGVTPDAVVLT 211 Query: 367 GVLAACKHAGLVKQGQFLLENME-SLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMP 543 +AAC AG + G ++ +E + GL+ + S ++ + + G L EAV + MP Sbjct: 212 TAVAACAQAGALDLGAWVHRYVERAAPGLLADAFLGSALIGMYAKCGCLEEAVRVFDGMP 271 Query: 544 MKPLASVWGALLSGCRSH 597 + A VW ++ H Sbjct: 272 ERN-AYVWATMVGALAVH 288 >gb|EMS68422.1| hypothetical protein TRIUR3_23855 [Triticum urartu] Length = 425 Score = 303 bits (776), Expect = 6e-80 Identities = 152/310 (49%), Positives = 200/310 (64%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 +FR M G+ PD LTTA+ ACAQ G L+ G+W+H YV++ +AD F+G+ALV M Sbjct: 101 LFRAMFADGVAPDAVVLTTAIAACAQSGALECGEWVHRYVESNAPGLLADAFVGSALVSM 160 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG + A VF MP+RN++ W ++G +AVHG A +A CLERM EDG+ PDG+A Sbjct: 161 YAKCGCLQEAVRVFEGMPERNEYVWGTMVGAFAVHGMAREAVACLERMAGEDGVRPDGVA 220 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 VLG L+AC HAG V+ G LL+ M YG+ P HEH+SC VD+LCR G+L +AV LI M Sbjct: 221 VLGALSACAHAGKVEDGLRLLKEMRRRYGVTPGHEHFSCTVDMLCRVGRLEDAVGLIGTM 280 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PM PLASVWG+LL+GCR + NV YVQLSNIYL A + Sbjct: 281 PMTPLASVWGSLLAGCRMYGNV---ELAEVAAKELEKLGMGADEGVYVQLSNIYLDANRK 337 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSLEVSID 900 DDARR+R++IG +GLKK P YSAVEIDG F++ D +H R EI +L L++ ++ Sbjct: 338 DDARRVRKLIGNRGLKKVPAYSAVEIDGELSSFVADDQAHPRRFEIWDLLGLLADQMGRK 397 Query: 901 QNK*RRHETS 930 ++ ET+ Sbjct: 398 PDEEEEEETT 407 Score = 68.2 bits (165), Expect = 4e-09 Identities = 45/152 (29%), Positives = 76/152 (50%), Gaps = 1/152 (0%) Frame = +1 Query: 145 ADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERM 324 +D ++ TAL+ ++A D A VF +P+R + +IG YA G A + M Sbjct: 48 SDHYVTTALLQLHAPRP--DDARRVFDELPRREAIHYDLVIGAYARAGMAAEGLALFRAM 105 Query: 325 QAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMES-LYGLVPEHEHYSCVVDLLCRA 501 A DG+ PD + + +AAC +G ++ G+++ +ES GL+ + S +V + + Sbjct: 106 FA-DGVAPDAVVLTTAIAACAQSGALECGEWVHRYVESNAPGLLADAFVGSALVSMYAKC 164 Query: 502 GQLGEAVELIRRMPMKPLASVWGALLSGCRSH 597 G L EAV + MP + VWG ++ H Sbjct: 165 GCLQEAVRVFEGMPERN-EYVWGTMVGAFAVH 195 >ref|XP_004981863.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g28640-like [Setaria italica] Length = 495 Score = 302 bits (773), Expect = 1e-79 Identities = 151/293 (51%), Positives = 188/293 (64%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VFR M G+ PD LTTA+ ACAQ G LD G W+H YV+ + D F+G+ALV M Sbjct: 188 VFRAMFEDGVAPDAVVLTTAVAACAQAGALDCGAWVHRYVERAAPGLLGDAFVGSALVSM 247 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG +D A VF MP+RN++ W ++G +AVHG A +A CLERM EDG+ PDG+A Sbjct: 248 YAKCGCLDEAVRVFDGMPERNEYVWGTMVGAFAVHGMAAEAVACLERMAREDGVRPDGVA 307 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 VLG L+AC HAG V G LL M YG+ P HEHYSC VD+LCR G+L +AV LI M Sbjct: 308 VLGALSACAHAGKVDDGLRLLREMRGRYGVAPGHEHYSCTVDMLCRVGRLEDAVGLIGTM 367 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PM PL SVWG++L+GCRS+ NV YVQLSNIYL A + Sbjct: 368 PMAPLESVWGSVLAGCRSYGNV---ELAEVAARELEKLGGTADEGVYVQLSNIYLDANRK 424 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLM 879 DDARR+R++IG +G+KK P YSAVE+DG F++ D +H R EI VL L+ Sbjct: 425 DDARRVRKLIGSRGIKKAPAYSAVEVDGEVSSFVADDQAHPRCFEIWEVLRLL 477 Score = 68.6 bits (166), Expect = 3e-09 Identities = 55/195 (28%), Positives = 88/195 (45%), Gaps = 7/195 (3%) Frame = +1 Query: 34 PDEFCLTTALTACAQL------GDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCG 195 PD AL+A A + D G +H + R L D ++ TAL+ +YA Sbjct: 93 PDHLSFPFALSAAAAVDAPDSSSDAGAGAQLHALLV-RNALFPVDHYVTTALLQLYAPRP 151 Query: 196 YIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVL 375 + A VF +P+R + +IG YA G + + R EDG+ PD + + + Sbjct: 152 --ELARRVFDELPRREAIHYDLVIGAYARAGMPAEGL-AVFRAMFEDGVAPDAVVLTTAV 208 Query: 376 AACKHAGLVKQGQFLLENME-SLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKP 552 AAC AG + G ++ +E + GL+ + S +V + + G L EAV + MP + Sbjct: 209 AACAQAGALDCGAWVHRYVERAAPGLLGDAFVGSALVSMYAKCGCLDEAVRVFDGMPERN 268 Query: 553 LASVWGALLSGCRSH 597 VWG ++ H Sbjct: 269 -EYVWGTMVGAFAVH 282 >ref|NP_001173276.1| Os03g0158900 [Oryza sativa Japonica Group] gi|22773237|gb|AAN06843.1| Hypothetical protein [Oryza sativa Japonica Group] gi|108706287|gb|ABF94082.1| pentatricopeptide, putative, expressed [Oryza sativa Japonica Group] gi|125584986|gb|EAZ25650.1| hypothetical protein OsJ_09480 [Oryza sativa Japonica Group] gi|255674223|dbj|BAH92004.1| Os03g0158900 [Oryza sativa Japonica Group] Length = 490 Score = 300 bits (768), Expect = 5e-79 Identities = 145/294 (49%), Positives = 194/294 (65%) Frame = +1 Query: 1 VFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDM 180 VFR M V G+ PD LTTA+ ACAQ G L+ G+W+H YV+ + D F+G+ALV M Sbjct: 185 VFRAMFVDGVAPDAVVLTTAIAACAQAGALECGEWVHRYVEASAPWLLGDAFVGSALVSM 244 Query: 181 YAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIA 360 YAKCG ++ A VF MP+RN + W ++G +AVHG A +A CL+RM EDG+ PDG+A Sbjct: 245 YAKCGCLEQAVRVFDGMPERNDYVWGTMVGAFAVHGMAEEAVSCLDRMAREDGVRPDGVA 304 Query: 361 VLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRM 540 VLG L+AC HAG V+ G LL+ M YG+ P HEHY+C VD+LCR G+L +AV LI M Sbjct: 305 VLGALSACAHAGKVEDGLRLLKEMRRRYGVAPGHEHYACTVDMLCRVGRLEDAVALIETM 364 Query: 541 PMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQY 720 PM PLASVWG++L+GCR++ NV YVQLSNIYL + + Sbjct: 365 PMAPLASVWGSVLTGCRTYANV-----ELAEVAAAELGKLGADEGVYVQLSNIYLDSNRK 419 Query: 721 DDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMS 882 DDARR+R++IG +G++K P YSAVE+DG+ F++ D +H + EI VL L++ Sbjct: 420 DDARRVRKLIGSRGIRKVPAYSAVEVDGVVRSFVADDQAHPQRVEIWEVLGLLA 473 Score = 65.5 bits (158), Expect = 3e-08 Identities = 49/189 (25%), Positives = 84/189 (44%), Gaps = 1/189 (0%) Frame = +1 Query: 34 PDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDTAE 213 PD AL+A A + G +H + + +D ++ TAL+ + D A Sbjct: 95 PDHLSFPFALSAAATVSP-SPGAQLHALLVKNGHFP-SDHYVTTALLQLQLHAARPDDAR 152 Query: 214 EVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACKHA 393 VF +P+R + +IG Y G A + M DG+ PD + + +AAC A Sbjct: 153 RVFDELPRREAIHYDLVIGAYTRTGMAGEGLGVFRAMFV-DGVAPDAVVLTTAIAACAQA 211 Query: 394 GLVKQGQFLLENME-SLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVWG 570 G ++ G+++ +E S L+ + S +V + + G L +AV + MP + VWG Sbjct: 212 GALECGEWVHRYVEASAPWLLGDAFVGSALVSMYAKCGCLEQAVRVFDGMPERN-DYVWG 270 Query: 571 ALLSGCRSH 597 ++ H Sbjct: 271 TMVGAFAVH 279 >ref|XP_002439151.1| hypothetical protein SORBIDRAFT_09g001360 [Sorghum bicolor] gi|190688727|gb|ACE86390.1| pentatricopeptide (PPR) repeat-containing protein [Sorghum bicolor] gi|241944436|gb|EES17581.1| hypothetical protein SORBIDRAFT_09g001360 [Sorghum bicolor] Length = 517 Score = 299 bits (766), Expect = 9e-79 Identities = 148/285 (51%), Positives = 187/285 (65%) Frame = +1 Query: 28 IEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDT 207 + PD LTTA+ ACAQ G LD G W+H YV+ +AD FLG+ALV MYAKCG +D Sbjct: 205 VVPDAVVLTTAVAACAQAGALDHGAWVHRYVERTAPGLLADAFLGSALVGMYAKCGCLDD 264 Query: 208 AEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACK 387 A VF MP+RN + W ++G AVHG A +A CL+RM EDG+ PDG+AVLG L+AC Sbjct: 265 AVRVFDGMPERNAYVWGTMVGALAVHGMAAEAVACLDRMAVEDGVRPDGVAVLGALSACA 324 Query: 388 HAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVW 567 HAG V++G LL M YG+VP HEHYSC VD+LCR G+L +AV L+ MPM PLASVW Sbjct: 325 HAGNVEEGLCLLREMRPRYGVVPGHEHYSCAVDMLCRVGRLEDAVGLVETMPMAPLASVW 384 Query: 568 GALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQYDDARRIRRM 747 G++L+GCRS+ NV YVQLSNIYL A + DDARR+R++ Sbjct: 385 GSVLAGCRSYGNV---ELAEVAVRELEKLGGTADEGVYVQLSNIYLDANRKDDARRVRKL 441 Query: 748 IGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMS 882 IG +G+KK P YSAVE+DG F++ D +H R EI VL L++ Sbjct: 442 IGSRGIKKVPAYSAVEVDGEVSSFVADDQAHPRRFEIWEVLRLLA 486 Score = 69.3 bits (168), Expect = 2e-09 Identities = 54/198 (27%), Positives = 89/198 (44%), Gaps = 10/198 (5%) Frame = +1 Query: 34 PDEFCLTTALTACAQL-------GDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKC 192 PD AL+A A L G +H + R L AD ++ TAL+ +Y+ Sbjct: 97 PDHLSFPFALSAAAALDATTPSSSSYSTGPQLHALLV-RNALFPADHYVTTALLQLYSP- 154 Query: 193 GYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDG--IMPDGIAVL 366 + D A VF +P+R + +IG YA G + M +D ++PD + + Sbjct: 155 -HPDLARRVFDELPRREAIHYDLLIGSYARAGAPTEGLAVFRAMFDDDDGVVVPDAVVLT 213 Query: 367 GVLAACKHAGLVKQGQFLLENME-SLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMP 543 +AAC AG + G ++ +E + GL+ + S +V + + G L +AV + MP Sbjct: 214 TAVAACAQAGALDHGAWVHRYVERTAPGLLADAFLGSALVGMYAKCGCLDDAVRVFDGMP 273 Query: 544 MKPLASVWGALLSGCRSH 597 + A VWG ++ H Sbjct: 274 ERN-AYVWGTMVGALAVH 290