BLASTX nr result
ID: Sinomenium21_contig00027298
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00027298 (1350 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282464.1| PREDICTED: pentatricopeptide repeat-containi... 447 e-123 emb|CAN80315.1| hypothetical protein VITISV_020760 [Vitis vinifera] 444 e-122 ref|XP_007225613.1| hypothetical protein PRUPE_ppa003215mg [Prun... 431 e-118 ref|XP_006483319.1| PREDICTED: pentatricopeptide repeat-containi... 417 e-114 ref|XP_006450490.1| hypothetical protein CICLE_v10007521mg [Citr... 415 e-113 ref|XP_004289385.1| PREDICTED: pentatricopeptide repeat-containi... 409 e-111 ref|XP_007041021.1| Tetratricopeptide repeat-like superfamily pr... 407 e-111 gb|AAT66765.1| Putative selenium-binding protein, related [Solan... 404 e-110 ref|XP_004293010.1| PREDICTED: pentatricopeptide repeat-containi... 404 e-110 ref|XP_006355128.1| PREDICTED: pentatricopeptide repeat-containi... 402 e-109 ref|XP_004238807.1| PREDICTED: pentatricopeptide repeat-containi... 397 e-108 gb|EXB93974.1| hypothetical protein L484_015521 [Morus notabilis] 396 e-107 ref|XP_004142608.1| PREDICTED: pentatricopeptide repeat-containi... 394 e-107 ref|XP_007159402.1| hypothetical protein PHAVU_002G235300g [Phas... 393 e-107 ref|XP_004157162.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 390 e-106 ref|XP_003532746.1| PREDICTED: pentatricopeptide repeat-containi... 388 e-105 ref|XP_003524191.2| PREDICTED: pentatricopeptide repeat-containi... 386 e-104 gb|EYU21282.1| hypothetical protein MIMGU_mgv11b020544mg [Mimulu... 382 e-103 ref|XP_002529936.1| pentatricopeptide repeat-containing protein,... 382 e-103 ref|XP_002320276.2| hypothetical protein POPTR_0014s11190g [Popu... 377 e-102 >ref|XP_002282464.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580 [Vitis vinifera] Length = 807 Score = 447 bits (1149), Expect = e-123 Identities = 246/469 (52%), Positives = 317/469 (67%), Gaps = 35/469 (7%) Frame = +3 Query: 48 NQNGFSGGNTGDFRQNQSDIY-TGEFQQNLF--------GQNRN-------------FNT 161 N NG+ G N G+ Q +D Y QN + QNRN N Sbjct: 231 NINGYCGQNYGESLQKSNDFYGQNRNVQNSYYSEGRAEVNQNRNGNCQQIISETLGDLNR 290 Query: 162 FHQENPKELYRYPVGFHQQNPIGH--SKNPQWQVSYGTYREDPRVAQRGPSMLCGQ---- 323 + EN ++ + P G+H++N + S+N ++ + G Y+++P V Q + GQ Sbjct: 291 TYGENIRQFQQSPSGYHRENLQQYQPSENMYYRENVGQYQQNPNVGQYQQNPNIGQYQQN 350 Query: 324 -NSLEFQRTQNG---YYGGNVSEFXXXXXXXXXXXXXSGGFQQNPTWSPREAGVFQESTN 491 N ++Q+ N NV+++ S FQ + SP+ S+N Sbjct: 351 PNVAQYQQNPNVAQYQQNPNVAQYQTN----------SNEFQNSMVGSPK-------SSN 393 Query: 492 VKLDSES---AETNQHKGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKA 662 K D ES AE++Q+ GT+EE+ FCK+GKVKEA+EVLGLL+KQ ++L Y +LMKA Sbjct: 394 YKPDGESLEAAESSQYSGTLEEVDDFCKDGKVKEAIEVLGLLEKQHTPVDLPRYLRLMKA 453 Query: 663 CGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTW 842 CGEA +LQEAKAVH+ LI+S+ +KV+ YN+ILEMY+KC SM DA VF++MPERNLT+W Sbjct: 454 CGEAKALQEAKAVHESLIKSVSPLKVSTYNRILEMYSKCGSMDDAYAVFKKMPERNLTSW 513 Query: 843 DTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSK 1022 DTMIT AKN LGE+AIDLF QFKE+GLKPDGQMF GVF ACSVLGD+ EG+LHF SMSK Sbjct: 514 DTMITWFAKNDLGEEAIDLFIQFKESGLKPDGQMFIGVFMACSVLGDVIEGMLHFNSMSK 573 Query: 1023 VYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGD 1202 YGI PSM+HY S+VDMLG++GY+DEA+EF+EKMP EPSV+VWET+MN+CRV GN E+GD Sbjct: 574 DYGIVPSMKHYASMVDMLGNSGYLDEALEFVEKMPLEPSVDVWETLMNICRVQGNMEIGD 633 Query: 1203 HCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 CA+LVE+L+PSRLT QSKAGLVPVK SD+ KEKEKKKL+ NLLEVRS Sbjct: 634 RCAELVEHLEPSRLTEQSKAGLVPVKASDLEKEKEKKKLASQNLLEVRS 682 >emb|CAN80315.1| hypothetical protein VITISV_020760 [Vitis vinifera] Length = 1148 Score = 444 bits (1142), Expect = e-122 Identities = 245/469 (52%), Positives = 316/469 (67%), Gaps = 35/469 (7%) Frame = +3 Query: 48 NQNGFSGGNTGDFRQNQSDIY-TGEFQQNLF--------GQNRN-------------FNT 161 N NG+ G N G+ Q +D Y QN + QNRN N Sbjct: 231 NINGYCGQNYGESLQKSNDFYGQNRNVQNSYYSEGRAEVNQNRNGNCQQIISETLGDLNR 290 Query: 162 FHQENPKELYRYPVGFHQQNPIGH--SKNPQWQVSYGTYREDPRVAQRGPSMLCGQ---- 323 + EN ++ + P G+H++N + S+N ++ + G Y+++P V Q + GQ Sbjct: 291 TYGENIRQFQQSPSGYHRENLQQYQPSENMYYRENVGQYQQNPNVGQYQQNPNIGQYQQN 350 Query: 324 -NSLEFQRTQNG---YYGGNVSEFXXXXXXXXXXXXXSGGFQQNPTWSPREAGVFQESTN 491 N ++Q+ N NV+++ S FQ + SP+ S+N Sbjct: 351 PNVAQYQQNPNVAQYQQNPNVAQYQTN----------SNEFQNSMVGSPK-------SSN 393 Query: 492 VKLDSES---AETNQHKGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKA 662 K D ES AE++Q+ GT+EE+ FCK+GKVKEA+EVLGLL+KQ ++L Y +LMKA Sbjct: 394 YKPDGESLEAAESSQYSGTLEEVDDFCKDGKVKEAIEVLGLLEKQHTPVDLPRYLRLMKA 453 Query: 663 CGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTW 842 CGEA +LQEAKAVH+ LI+S+ +KV+ YN+ILEMY+KC SM DA VF++MPERNLT+W Sbjct: 454 CGEAKALQEAKAVHESLIKSVSPLKVSTYNRILEMYSKCGSMDDAYAVFKKMPERNLTSW 513 Query: 843 DTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSK 1022 DTMIT AKN LGE+AIDLF QFKE+GLKPD QMF GVF ACSVLGD+ EG+LHF SMSK Sbjct: 514 DTMITWFAKNDLGEEAIDLFIQFKESGLKPDXQMFIGVFMACSVLGDVIEGMLHFNSMSK 573 Query: 1023 VYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGD 1202 YGI PSM+HY S+VDMLG++GY+DEA+EF+EKMP EPSV+VWET+MN+CRV GN E+GD Sbjct: 574 DYGIVPSMKHYASMVDMLGNSGYLDEALEFVEKMPLEPSVDVWETLMNICRVQGNMEIGD 633 Query: 1203 HCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 CA+LVE+L+PSRLT QSKAGLVPVK SD+ KEKEKKKL+ NLLEVRS Sbjct: 634 RCAELVEHLEPSRLTEQSKAGLVPVKASDLEKEKEKKKLASQNLLEVRS 682 >ref|XP_007225613.1| hypothetical protein PRUPE_ppa003215mg [Prunus persica] gi|462422549|gb|EMJ26812.1| hypothetical protein PRUPE_ppa003215mg [Prunus persica] Length = 592 Score = 431 bits (1107), Expect = e-118 Identities = 250/460 (54%), Positives = 303/460 (65%), Gaps = 11/460 (2%) Frame = +3 Query: 3 ENPRAV-QGGPSEFPHNQNGFS-GGNTGDFRQNQSDIYTGEF---QQNLFGQ-NRNFNTF 164 ENP A Q G E N N F GN G F+ N + Y F QQNL G RN Sbjct: 39 ENPYASRQEGSIEVRQNPNAFGLQGNLG-FQGNLNQNYIQHFAQNQQNLNGYYTRNDVMR 97 Query: 165 HQENPKELYRY--PVGFHQQNPIGHSKNPQWQVSYGTYREDPRVAQRGPSMLCGQNSLEF 338 HQ + Y+ G +QQNPI P SYG Y + P CGQ + Sbjct: 98 HQNSSYGQYQQNPSCGQYQQNPIYGQNQPN--PSYGKYHQAPS---------CGQ----Y 142 Query: 339 QRTQNGYYGGNVSEFXXXXXXXXXXXXXSGGFQQNPTWSPR---EAGVFQESTNVKLDSE 509 Q+ Y G S+ G +Q NP ++ V ES + + E Sbjct: 143 QQAPTSY--GQQSQHV-------------GQYQTNPDPFQNTIVDSQVASESKSERKLIE 187 Query: 510 SAETNQHKGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKACGEASSLQE 689 ++E++ + GT+EEL FCKEGKVKEAVE+LG+L+KQ ++++L YFQLM+ACGEA +L+E Sbjct: 188 ASESSPYSGTLEELDKFCKEGKVKEAVEILGMLEKQQVQVDLHLYFQLMQACGEAKALEE 247 Query: 690 AKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTWDTMITGLAK 869 AK VH+++ R L + V+ YN+ILEMY+KC SM VF +MP RNLT+WD MI LAK Sbjct: 248 AKFVHENITRLLSPLNVSTYNRILEMYSKCGSMDSTFMVFNQMPNRNLTSWDIMIAWLAK 307 Query: 870 NGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSKVYGIAPSME 1049 NGLGEDAIDLFT+FK+AGLKPDGQMF GVF ACSVLGD EG+LHFESMSK YGI PSM+ Sbjct: 308 NGLGEDAIDLFTEFKKAGLKPDGQMFIGVFYACSVLGDTTEGLLHFESMSKDYGIVPSMD 367 Query: 1050 HYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGDHCAKLVEYL 1229 HYVSVVDMLGSTGY++EA+EFIEKMP EP+V+VW+T+MNLCRVHG ELGD CA+LVE L Sbjct: 368 HYVSVVDMLGSTGYLEEALEFIEKMPLEPNVDVWKTLMNLCRVHGQLELGDRCAELVEQL 427 Query: 1230 DPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 D S L QSKAGLVPVK SD+ KEKEKKKL+ NLLEVRS Sbjct: 428 DASSLNEQSKAGLVPVKDSDLVKEKEKKKLAAQNLLEVRS 467 >ref|XP_006483319.1| PREDICTED: pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Citrus sinensis] Length = 773 Score = 417 bits (1072), Expect = e-114 Identities = 239/483 (49%), Positives = 307/483 (63%), Gaps = 49/483 (10%) Frame = +3 Query: 48 NQNGFSGGNTGDFRQNQSDIYT-----------GEFQQNLFGQNRNFNTFHQENPKELYR 194 N NG G ++ F+QN ++ Y EFQ N F QN NFN + N + + Sbjct: 170 NSNGIYGESSRGFQQNSNEFYQHHAGVNSESHINEFQNNTFQQNGNFNDYGWYNNGQPHP 229 Query: 195 YPVGFH-QQNPIGHSKNPQWQVSYGTYREDPRV---------AQRGPSML------CGQN 326 G Q + G + N Q Q YG P +QR L C Q+ Sbjct: 230 NLSGAQPQMSRSGGNANVQNQ--YGPIHYGPGEVMQNRNGFNSQRFSESLGSFNGNCMQD 287 Query: 327 SLEFQRTQNGYYGGNVSEFXXXXXXXXXXXXXSGG----------FQQNPTWSPREA--- 467 + + Q+ +G+Y GN +GG +QQNP ++ Sbjct: 288 TGQHQQALSGHYSGNFG--IHQNSPSFYQQDQNGGQYQWDQSRRQYQQNPNEGQYQSYSG 345 Query: 468 ----GVF--QESTNVKLDSESAE---TNQHKGTVEELVSFCKEGKVKEAVEVLGLLDKQG 620 G+ Q N K + + AE ++Q+ GT+E+L KEGKVKEA+EVLGLL+KQ Sbjct: 346 NIQNGMMASQVLNNCKHEDDFAEASRSSQNNGTLEQLDGLVKEGKVKEAIEVLGLLEKQC 405 Query: 621 IKLELQAYFQLMKACGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDAC 800 I ++L + QLM+ACG+A +L+EAKAVH+H+ R L ++V+ YN IL+MY++C SM DA Sbjct: 406 ISVDLPTFSQLMQACGDAKALEEAKAVHEHVERLLSPLRVSTYNGILKMYSECDSMDDAF 465 Query: 801 EVFERMPERNLTTWDTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLG 980 VF M ER+LT+WDTMITG AKNGLGEDA+D+F+QFK+AGLKPD Q+F GVFSACS LG Sbjct: 466 SVFSNMTERDLTSWDTMITGFAKNGLGEDAVDIFSQFKQAGLKPDDQIFIGVFSACSALG 525 Query: 981 DIDEGILHFESMSKVYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETM 1160 D+ EG+LHFESMSK YGI PSM+HYVS+VDMLGSTGY+DEA+EFIEKMP EP V+VWE + Sbjct: 526 DVVEGMLHFESMSKDYGIVPSMKHYVSIVDMLGSTGYLDEALEFIEKMPMEPDVDVWEKL 585 Query: 1161 MNLCRVHGNTELGDHCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLE 1340 MNLCR+HGN ELGD CA++VE LDPSRL +SKAGLVPV S++AKEKE KKL+ NLLE Sbjct: 586 MNLCRMHGNLELGDRCAEIVEQLDPSRLNEKSKAGLVPVNASELAKEKENKKLASQNLLE 645 Query: 1341 VRS 1349 VRS Sbjct: 646 VRS 648 >ref|XP_006450490.1| hypothetical protein CICLE_v10007521mg [Citrus clementina] gi|557553716|gb|ESR63730.1| hypothetical protein CICLE_v10007521mg [Citrus clementina] Length = 773 Score = 415 bits (1066), Expect = e-113 Identities = 238/483 (49%), Positives = 307/483 (63%), Gaps = 49/483 (10%) Frame = +3 Query: 48 NQNGFSGGNTGDFRQNQSDIYT-----------GEFQQNLFGQNRNFNTFHQENPKELYR 194 + NG G ++ F+QN ++ Y EFQ N F QN NFN + N + + Sbjct: 170 HSNGIYGESSRGFQQNSNEFYQHHAGVNSESHINEFQNNTFQQNGNFNDYGWYNNGQPHP 229 Query: 195 YPVGFH-QQNPIGHSKNPQWQVSYGTYREDPRV---------AQRGPSML------CGQN 326 G Q + G + N Q Q YG P +QR L C Q+ Sbjct: 230 NLSGAQPQMSRSGGNANVQNQ--YGPIHYGPGEVMQNRNGFNSQRFSESLGSFNGNCMQD 287 Query: 327 SLEFQRTQNGYYGGNVSEFXXXXXXXXXXXXXSGG----------FQQNPTWSPREA--- 467 + + Q+ +G+Y GN +GG +QQNP ++ Sbjct: 288 TGQHQQALSGHYSGNFG--IHQNSPSFYQQDQNGGQYQWDQSRRQYQQNPNEGQYQSYSG 345 Query: 468 ----GVF--QESTNVKLDSESAE---TNQHKGTVEELVSFCKEGKVKEAVEVLGLLDKQG 620 G+ Q N K + + AE ++Q+ GT+E+L KEGKVKEA+EVLGLL+KQ Sbjct: 346 NIQNGMMASQVLNNCKHEDDFAEASRSSQNNGTLEQLDGLVKEGKVKEAIEVLGLLEKQC 405 Query: 621 IKLELQAYFQLMKACGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDAC 800 I ++L + QLM+ACG+A +L+EAKAVH+H+ R L ++V+ YN IL+MY++C SM DA Sbjct: 406 ISVDLPTFSQLMQACGDAKALEEAKAVHEHVERLLSPLRVSTYNGILKMYSECDSMDDAF 465 Query: 801 EVFERMPERNLTTWDTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLG 980 VF M ER+LT+WDTMITG AKNGLGEDA+D+F+QFK+AGLKPD Q+F GVFSACS LG Sbjct: 466 SVFSNMTERDLTSWDTMITGFAKNGLGEDAVDIFSQFKQAGLKPDDQIFVGVFSACSALG 525 Query: 981 DIDEGILHFESMSKVYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETM 1160 D+ EG+LHFESMSK YGI PSM+HYVS+VDMLGSTGY+DEA+EFIEKMP EP V+VWE + Sbjct: 526 DVVEGMLHFESMSKDYGIVPSMKHYVSIVDMLGSTGYLDEALEFIEKMPMEPDVDVWEKL 585 Query: 1161 MNLCRVHGNTELGDHCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLE 1340 MNLCR+HGN ELGD CA++VE LDPSRL +SKAGLVPV S++AKEKE KKL+ NLLE Sbjct: 586 MNLCRMHGNLELGDRCAEIVEQLDPSRLNEKSKAGLVPVNASELAKEKENKKLASQNLLE 645 Query: 1341 VRS 1349 VRS Sbjct: 646 VRS 648 >ref|XP_004289385.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580-like [Fragaria vesca subsp. vesca] Length = 684 Score = 409 bits (1050), Expect = e-111 Identities = 233/469 (49%), Positives = 297/469 (63%), Gaps = 29/469 (6%) Frame = +3 Query: 30 PSEFPHNQNGFSGGNTGDFRQNQSDIYT----GEFQQNLFGQNRNFNTFHQENPKELYRY 197 P + N +GF G TG +Q +T G QQN + HQE P+EL + Sbjct: 104 PVQQSGNFSGFCGPETGRLQQTTQQDWTHLGRGSIQQNSY-------VAHQEKPRELGQI 156 Query: 198 PVGFHQQNPIGHSKNPQWQVSYGTYREDPRVAQRGPSMLCGQNSLEFQRTQNGYYGGNVS 377 P GF Q+ +G Q +++ + + + G +S + Q+ N YY GN Sbjct: 157 PYGFKSQSSLGS----QGSINHSSLQHSIDYQKNTIGYNIGSSSAQKQQNLNDYYKGNGQ 212 Query: 378 EFXXXXXXXXXXXXXSGGFQQNPT-----WSPREAGVFQESTNVKLDSESAETNQH---- 530 G +QQ P SP E G +Q+S+ V +S + Q+ Sbjct: 213 YQQSVGYGRFQPSPHVGQYQQIPQVGQYQQSP-EVGQYQQSSAVGQYQQSPQVGQYQQTP 271 Query: 531 ----------------KGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKA 662 KGT+EEL FCKEGKVKEAVEVLGLL KQ + + L YFQLM A Sbjct: 272 QGGEPGEASDSNPNSVKGTLEELDLFCKEGKVKEAVEVLGLLKKQHVHVNLDQYFQLMHA 331 Query: 663 CGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTW 842 CGEA++L+EAKAVH+++ R L ++V+ YN+ILEMY+KC SM +A VF MP+RNLT+W Sbjct: 332 CGEANALEEAKAVHENM-RLLSPLEVSTYNRILEMYSKCGSMENALMVFNDMPKRNLTSW 390 Query: 843 DTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSK 1022 D MIT AKNGLGEDAIDLFTQFK+AGLKPD QMF GV ACSV+GD +EG+LHFESMSK Sbjct: 391 DIMITWFAKNGLGEDAIDLFTQFKKAGLKPDDQMFNGVLYACSVVGDAEEGLLHFESMSK 450 Query: 1023 VYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGD 1202 YGI P+M+++V VVDMLGS G++DEA+EFIEKMP P+V+VW+T+M+ CRVHG ELGD Sbjct: 451 DYGIVPTMKNHVCVVDMLGSIGFLDEALEFIEKMPLGPNVDVWKTLMHYCRVHGYLELGD 510 Query: 1203 HCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 CA+LV+ LDPS L QSKAGL+PV SD+ KEKEKK+L+ NLLEVRS Sbjct: 511 CCAELVDQLDPSCLNEQSKAGLIPVNESDLVKEKEKKQLAAKNLLEVRS 559 >ref|XP_007041021.1| Tetratricopeptide repeat-like superfamily protein, putative [Theobroma cacao] gi|508704956|gb|EOX96852.1| Tetratricopeptide repeat-like superfamily protein, putative [Theobroma cacao] Length = 710 Score = 407 bits (1046), Expect = e-111 Identities = 203/346 (58%), Positives = 262/346 (75%), Gaps = 3/346 (0%) Frame = +3 Query: 321 QNSLEFQRTQNGYYGGNVSEFXXXXXXXXXXXXXSGGFQQN--PTWSPREAGVFQESTNV 494 QN+ +FQ++Q+ +G N S++ QQN ++ G+ ++N Sbjct: 257 QNNWQFQQSQSDQHGANFSQY-----------------QQNRQDIYNANPYGLVSATSNP 299 Query: 495 KLDS-ESAETNQHKGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKACGE 671 + +S E +ET+ + TVE L FC++G VKEAVEVLG ++KQG+ ++L QLMKACGE Sbjct: 300 EGESTEVSETSSNNATVETLDEFCRKGNVKEAVEVLGSMEKQGVHVDLPRMLQLMKACGE 359 Query: 672 ASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTWDTM 851 +LQEAK VH+HLIRS +K+++ NKILE+Y+KC SM D+ EVF++M RNLT+WDTM Sbjct: 360 VKALQEAKTVHEHLIRSFSPLKISICNKILEIYSKCGSMDDSFEVFDKMRRRNLTSWDTM 419 Query: 852 ITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSKVYG 1031 IT LAKNGLGEDA+DLF++FK+ GLKPDG+MF GVFSAC V+ D++EG+LHF SMS YG Sbjct: 420 ITWLAKNGLGEDALDLFSEFKKTGLKPDGKMFIGVFSACGVVSDVNEGMLHFASMSSEYG 479 Query: 1032 IAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGDHCA 1211 I PSMEHYV VVDMLGSTG++DEA+EFIEKMP EPSV+VWET+MNLCRVHG+ ELGD CA Sbjct: 480 IVPSMEHYVGVVDMLGSTGHLDEALEFIEKMPLEPSVDVWETLMNLCRVHGHLELGDQCA 539 Query: 1212 KLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 +LVE LDPSRL QSKAGL+P+K SD+AK+ +KKKL + LEVRS Sbjct: 540 ELVEQLDPSRLNEQSKAGLIPLKDSDLAKQNDKKKLPSQSPLEVRS 585 >gb|AAT66765.1| Putative selenium-binding protein, related [Solanum demissum] Length = 741 Score = 404 bits (1039), Expect = e-110 Identities = 231/453 (50%), Positives = 289/453 (63%), Gaps = 14/453 (3%) Frame = +3 Query: 33 SEFPHNQNGFSGGNTGDF------------RQNQSDIYTGEFQQNLFGQNRNFNTFHQEN 176 S HN+N G N+ + + +Q+ +Y G +QQNL G N + ++N Sbjct: 180 SNLVHNRNDSGGENSSNLLRSSRFEGGLEAQPSQNGVY-GHYQQNLNGGNSETS---RQN 235 Query: 177 PKELYRYPVGFHQQNPIGHSKNPQWQVSYGTYREDPRVAQRGPSMLCGQNSLEFQRTQNG 356 Y G QQN + V + P+ A G +M NS + R G Sbjct: 236 FSGNYTSNGGVPQQNLSNYDPGNVRNVQSENSEKYPQNAS-GYNMERHTNSSGYSREMMG 294 Query: 357 YYGGNVSEFXXXXXXXXXXXXXSGGFQQNPTWSPREAG--VFQESTNVKLDSESAETNQH 530 Y N+S F S G Q + + G + ST V+ +S +++ Sbjct: 295 LYQQNLSGFNPS----------SAGHQASYQYQNGIVGHQEMRSSTPVEQSIDSDDSSSK 344 Query: 531 KGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKACGEASSLQEAKAVHDH 710 KG+V+EL CKEGKVKEAVEVL LLD+Q + ++L Y LM C E SL++AK++H+H Sbjct: 345 KGSVDELDDLCKEGKVKEAVEVLQLLDQQHVTVDLSRYIMLMDVCSEDKSLEDAKSIHEH 404 Query: 711 LIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTWDTMITGLAKNGLGEDA 890 L+RS ++ + +YNKILEMY KC SM DA VF +MP+RNLT+WDTMIT L KNGLGEDA Sbjct: 405 LVRSHPHLDIKMYNKILEMYGKCGSMKDAFLVFRKMPQRNLTSWDTMITWLGKNGLGEDA 464 Query: 891 IDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSKVYGIAPSMEHYVSVVD 1070 I+LF +FKE G+KPDGQMF GVF ACSV+GDI EG+LHFESMSK Y I SME YV VVD Sbjct: 465 IELFGEFKETGMKPDGQMFLGVFHACSVVGDIVEGMLHFESMSKDYDIDLSMEQYVGVVD 524 Query: 1071 MLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGDHCAKLVEYLDPSRLTV 1250 MLGSTGY+DEAMEFIE+MP EPS+EVWETMMNLCR+HGN ELGD CA++VE LDPSRL Sbjct: 525 MLGSTGYLDEAMEFIERMPIEPSIEVWETMMNLCRIHGNLELGDRCAEIVELLDPSRLDE 584 Query: 1251 QSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 QSKAG + VK SDIAKEKEKK+ S +LLE RS Sbjct: 585 QSKAGFLAVKASDIAKEKEKKR-SAQSLLEARS 616 >ref|XP_004293010.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580-like [Fragaria vesca subsp. vesca] Length = 681 Score = 404 bits (1037), Expect = e-110 Identities = 230/468 (49%), Positives = 296/468 (63%), Gaps = 28/468 (5%) Frame = +3 Query: 30 PSEFPHNQNGFSGGNTGDFRQNQSDIYT----GEFQQNLFGQNRNFNTFHQENPKELYRY 197 P + N + F G G +Q +T G QQN + HQE P+EL + Sbjct: 101 PVQQSGNFSRFRGPENGRLQQTTQLDWTHLGHGSIQQNSY-------VAHQEKPRELGQI 153 Query: 198 PVGFHQQNPIGHSKNPQWQVSYGTYREDPRVAQRGPSMLCGQNSLEFQRTQNGYYGGNVS 377 P GF Q+ S Q +++ + + + G +S + Q+ N YY GN Sbjct: 154 PYGFKSQS----SSGSQGSINHSSLQHSIDYQKNTIGYNIGSSSTQKQQNLNDYYKGNGQ 209 Query: 378 EFXXXXXXXXXXXXXSGGFQQNPT--------------WSPR--------EAGVFQESTN 491 G +QQ P SP+ E G +Q+S Sbjct: 210 YQQSVGYGRFQPSPHVGQYQQIPQVGQYQQSPQVGQHQQSPQVGQYQQSSEVGQYQQSPQ 269 Query: 492 VKLDSESAETNQH--KGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKAC 665 V +E++++N + KGT+EEL FCKEGKVKEAVEVLGLL KQ + + L YFQLM AC Sbjct: 270 VGEPAEASDSNPNSVKGTLEELDLFCKEGKVKEAVEVLGLLKKQHVHVNLDQYFQLMHAC 329 Query: 666 GEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTWD 845 GEA++L+EAKAVH+++ + L ++V+ YN+ILEMY+KC SM +A VF MP+RNLT+WD Sbjct: 330 GEANALEEAKAVHENM-QLLSPLEVSTYNRILEMYSKCGSMENALMVFNDMPKRNLTSWD 388 Query: 846 TMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSKV 1025 MIT AKNGLGEDAIDLFTQFK+AGLKPD QMF GV ACSV+GD +EG+LHFESMSK Sbjct: 389 IMITWFAKNGLGEDAIDLFTQFKKAGLKPDDQMFNGVLYACSVVGDAEEGLLHFESMSKD 448 Query: 1026 YGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGDH 1205 YGI P+M++YV VVDM GS G++DEA+EFIEKMP P+V+VW+T+M+ CRVHG ELGD Sbjct: 449 YGIVPTMKNYVCVVDMFGSIGFLDEALEFIEKMPLGPNVDVWKTLMHYCRVHGYLELGDR 508 Query: 1206 CAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 CA+LV+ LDPS L QSKAGL+PV SD+ KEKEKK+L+ NLLEVRS Sbjct: 509 CAELVDQLDPSCLNEQSKAGLIPVNESDLVKEKEKKQLAAKNLLEVRS 556 >ref|XP_006355128.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580-like [Solanum tuberosum] Length = 741 Score = 402 bits (1033), Expect = e-109 Identities = 228/453 (50%), Positives = 287/453 (63%), Gaps = 14/453 (3%) Frame = +3 Query: 33 SEFPHNQNGFSGGNTGDF------------RQNQSDIYTGEFQQNLFGQNRNFNTFHQEN 176 S HN+N G N+ + + +Q+ +Y G +QQNL G N + ++N Sbjct: 180 SNLVHNRNDSGGENSSNLLRSSRFEGGLEAQPSQNGVY-GHYQQNLNGGNSEMS---RQN 235 Query: 177 PKELYRYPVGFHQQNPIGHSKNPQWQVSYGTYREDPRVAQRGPSMLCGQNSLEFQRTQNG 356 Y VG QQN + V P+ A G +M NS + R G Sbjct: 236 FSGNYTSNVGVPQQNLSNYDPGNVRNVQSENSETYPQNAS-GYNMERHTNSSGYSREMMG 294 Query: 357 YYGGNVSEFXXXXXXXXXXXXXSGGFQQNPTWSPREAG--VFQESTNVKLDSESAETNQH 530 Y N+S F S G Q + + G + ST V+ +S +++ Sbjct: 295 QYQQNLSGFNPS----------SAGHQASYQYQNGIVGHQEMRSSTPVEQSIDSDDSSSK 344 Query: 531 KGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKACGEASSLQEAKAVHDH 710 KG+V+EL CKEGKVKEAVE+L LL++Q + ++L Y LM C E SL++AK++H+H Sbjct: 345 KGSVDELDDLCKEGKVKEAVEILQLLEQQHVTVDLSRYIMLMDVCSEDKSLEDAKSIHEH 404 Query: 711 LIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTWDTMITGLAKNGLGEDA 890 L+RS ++ + +YNKILEMY KC SM DA VF +MP+RNLT+WDTMIT L KNG GEDA Sbjct: 405 LVRSHPHLDIKMYNKILEMYGKCGSMKDAFLVFRKMPQRNLTSWDTMITWLGKNGFGEDA 464 Query: 891 IDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSKVYGIAPSMEHYVSVVD 1070 I+LF +FKE G+KPDGQMF GVF ACSV+GDI EG+LHFESMSK Y I SME YV VVD Sbjct: 465 IELFGEFKETGMKPDGQMFLGVFHACSVVGDIVEGMLHFESMSKDYDIDLSMEQYVGVVD 524 Query: 1071 MLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGDHCAKLVEYLDPSRLTV 1250 MLGSTGY+DEAMEFIE+MP EPS+EVWETMMNLCR+HGN ELGD CA++VE LDPSRL Sbjct: 525 MLGSTGYLDEAMEFIERMPIEPSIEVWETMMNLCRIHGNLELGDRCAEIVELLDPSRLDE 584 Query: 1251 QSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 QSK G + VK SDIAKEKEKK+ S +LLE RS Sbjct: 585 QSKTGFLAVKASDIAKEKEKKR-SAQSLLEARS 616 >ref|XP_004238807.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580-like [Solanum lycopersicum] Length = 727 Score = 397 bits (1021), Expect = e-108 Identities = 228/443 (51%), Positives = 287/443 (64%), Gaps = 4/443 (0%) Frame = +3 Query: 33 SEFPHNQNGFSGGNTG--DFRQNQSDIYTGEFQQNLFGQNRNFNTFHQENPKELYRYPVG 206 S+ HN+N S G + + +Q+ +Y G +QQNL G N + Q+N Y VG Sbjct: 180 SDLVHNRNDRSSRFEGGLEAQSSQNGVY-GHYQQNLNGGN---SVTSQQNFNGNYMRNVG 235 Query: 207 FHQQNPIGHSKNPQWQVSYGTYREDPRVAQRGPSMLCGQNSLEFQRTQNGYYGGNVSEFX 386 QQN + V E P+ A G +M NS + R G Y N+S F Sbjct: 236 MPQQNISNYDPGNVRNVQ----SEYPQNAS-GYNMERHTNSSGYSREMMGRYQQNLSSFN 290 Query: 387 XXXXXXXXXXXXSGGFQQNPTWSPREAG--VFQESTNVKLDSESAETNQHKGTVEELVSF 560 S G Q + + G + +T V+ +S +++ KG+V+EL Sbjct: 291 PS----------SAGHQASYQYQNGIVGHQEMRSATPVEQLIDSDDSSSKKGSVDELDDL 340 Query: 561 CKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKACGEASSLQEAKAVHDHLIRSLVNVKV 740 CKEGKVKEAVEVL LL++Q + ++L Y LM C E SL++AK++H+HL+RS ++ + Sbjct: 341 CKEGKVKEAVEVLQLLEQQHVTVDLSRYIMLMDVCSEDKSLEDAKSIHEHLVRSHPHLDI 400 Query: 741 NVYNKILEMYAKCRSMSDACEVFERMPERNLTTWDTMITGLAKNGLGEDAIDLFTQFKEA 920 +YNKILEMY KC SM DA VF +MP+RNLT+WDTMIT L KNGLGEDAI+LF +FKE Sbjct: 401 KMYNKILEMYGKCGSMKDAFLVFRKMPQRNLTSWDTMITWLGKNGLGEDAIELFGEFKET 460 Query: 921 GLKPDGQMFFGVFSACSVLGDIDEGILHFESMSKVYGIAPSMEHYVSVVDMLGSTGYMDE 1100 G+KPDGQMF GVF ACSV+GDI EG+LHFESMSK Y I SME YV VDMLGSTGY+DE Sbjct: 461 GMKPDGQMFLGVFHACSVVGDIVEGMLHFESMSKDYDIDLSMEQYVGAVDMLGSTGYLDE 520 Query: 1101 AMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGDHCAKLVEYLDPSRLTVQSKAGLVPVK 1280 AM+FIE+MP EPS++VWETMMNLCR+HGN ELGD CA++VE LDP RL QSKAG + VK Sbjct: 521 AMDFIERMPIEPSIDVWETMMNLCRIHGNLELGDRCAEIVELLDPCRLDEQSKAGFLAVK 580 Query: 1281 PSDIAKEKEKKKLSDHNLLEVRS 1349 SDIA EKEKKK S +LLE RS Sbjct: 581 ASDIATEKEKKK-SAQSLLEARS 602 >gb|EXB93974.1| hypothetical protein L484_015521 [Morus notabilis] Length = 719 Score = 396 bits (1017), Expect = e-107 Identities = 226/484 (46%), Positives = 304/484 (62%), Gaps = 36/484 (7%) Frame = +3 Query: 6 NPRAVQGGPSEFPHNQNGFSGGNTG----DFRQNQSDIYT----GEFQQ-------NLFG 140 NP V+ E +QNG GG G + +QN + +Y G F+ N G Sbjct: 121 NPSGVRSQSHESDFSQNGNYGGVYGQSNNNLQQNSNGVYQNDSGGHFENSSNTSVNNSNG 180 Query: 141 QNRNFNTFH-------QENPKELYRYPVGFHQQNPI----GHSKNPQWQVSYGTYREDPR 287 +N NF+ ++ Q+N Y ++ QN G + + Q G + Sbjct: 181 KNGNFSGYYGQSNGVLQQNSNPNREYGGNWNMQNIYDSYQGRAVEMKQQNPSGFSSQGIS 240 Query: 288 VAQRGPSMLCGQ-NSLEFQRTQNGYYGGNVSEFXXXXXXXXXXXXXSGGFQQNPTWSPRE 464 + P+ Q +S +FQ++ N Y GNV G +QQN + P + Sbjct: 241 GVEGNPNRNYMQMHSSQFQQSLNNQYAGNVG--------MNQQSPSYGQYQQN--YQPNQ 290 Query: 465 AGVFQESTNVKLDS---------ESAETNQHKGTVEELVSFCKEGKVKEAVEVLGLLDKQ 617 G ++ S E++ ++ GT+EEL +FCKEGKVKEAV VLG L+K Sbjct: 291 NGFHNSMVAPQVPSSQIPEGAPAEASGSSPCGGTLEELDNFCKEGKVKEAVGVLGSLEKW 350 Query: 618 GIKLELQAYFQLMKACGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDA 797 I ++L Y QLM+ACGEA +L+EAK VH+++++S V+ YN+ILEMY++C SM DA Sbjct: 351 HIPVDLPRYLQLMQACGEAKALEEAKVVHEYILKSESTPHVSTYNRILEMYSRCGSMEDA 410 Query: 798 CEVFERMPERNLTTWDTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVL 977 VF++MPE+NLT+WD MIT LAKNG GEDAI++FT+FK+AG +PDGQMF GVF ACSV+ Sbjct: 411 FSVFDKMPEQNLTSWDIMITWLAKNGFGEDAIEMFTKFKQAGRRPDGQMFIGVFHACSVV 470 Query: 978 GDIDEGILHFESMSKVYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWET 1157 GDI+EG+L F+SMSK +GI P+M+HY SVVDMLGSTGY+DEA+EFIEKMP +P+V+VWET Sbjct: 471 GDIEEGMLQFKSMSKDFGIIPTMDHYGSVVDMLGSTGYLDEALEFIEKMPLQPTVDVWET 530 Query: 1158 MMNLCRVHGNTELGDHCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLL 1337 +MN CRVHG+ +LGD CA+LVE L+PSRLT + KAGLVPVK SD+ K KEKK L N+L Sbjct: 531 LMNFCRVHGHMDLGDRCAELVEQLEPSRLTKELKAGLVPVKASDLGKGKEKKNLVAQNIL 590 Query: 1338 EVRS 1349 E RS Sbjct: 591 EARS 594 >ref|XP_004142608.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580-like [Cucumis sativus] Length = 671 Score = 394 bits (1011), Expect = e-107 Identities = 215/412 (52%), Positives = 272/412 (66%), Gaps = 8/412 (1%) Frame = +3 Query: 138 GQNRNFNTFHQENPKELYRYPVGFHQQNPIGHSKNPQWQVSYGTYREDPRVAQRGP---- 305 G R+ + +Q +E++ G+ N N + VS ++P GP Sbjct: 156 GSKRSMSQNNQLGHREIFSAYNGYGYNNEATQQNN--YGVSGQNLHDNP---MSGPNNHI 210 Query: 306 --SMLCGQNSLEFQRTQNGYYGGNVSEFXXXXXXXXXXXXXSGGFQQNPTWSPREAGVFQ 479 S QNS+ Q Q Y+ G+ E +Q N + Q Sbjct: 211 PLSRQYEQNSIPLQHPQGQYHQGSSVE----------------QYQPNTDTNQNSMIGTQ 254 Query: 480 ESTNVKLDSESAETN--QHKGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQL 653 NV + E E Q G +E+L FCKEGK+KEAV++L +L+KQ I ++L Y L Sbjct: 255 LLNNVNANEEIGEPKDCQDGGPLEKLDEFCKEGKLKEAVQILEVLEKQHIPVDLSRYLDL 314 Query: 654 MKACGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNL 833 M ACGEA SL+EAK V +++I+S +VKV+ YNKILEMY+KC SM DA +F +MP RN+ Sbjct: 315 MNACGEARSLEEAKVVCNYVIKSQTHVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNI 374 Query: 834 TTWDTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFES 1013 T+WDTMIT LAKNGLGEDAIDLF +FK+AGL+PDG+MF GVFSACSVLGD DEG+LHFES Sbjct: 375 TSWDTMITWLAKNGLGEDAIDLFYEFKKAGLRPDGKMFIGVFSACSVLGDADEGMLHFES 434 Query: 1014 MSKVYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTE 1193 M+K YGI PSM HYVS+VDMLGS G++DEA+EFIEKMP EP V++WETMMN+ R HG E Sbjct: 435 MTKNYGITPSMHHYVSIVDMLGSIGFVDEAVEFIEKMPLEPGVDIWETMMNISRAHGLME 494 Query: 1194 LGDHCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 LGD C +LVE+LD SRL QSKAGL+PVK SD+ KE+EKKKL++ NLLEVRS Sbjct: 495 LGDRCFELVEHLDSSRLNEQSKAGLLPVKASDLEKEREKKKLANRNLLEVRS 546 >ref|XP_007159402.1| hypothetical protein PHAVU_002G235300g [Phaseolus vulgaris] gi|561032817|gb|ESW31396.1| hypothetical protein PHAVU_002G235300g [Phaseolus vulgaris] Length = 679 Score = 393 bits (1010), Expect = e-107 Identities = 220/428 (51%), Positives = 272/428 (63%), Gaps = 19/428 (4%) Frame = +3 Query: 123 QQNLFGQNRNFNTFHQENPKELYRYPVGFHQQNPIGHSKNPQWQVSYGTYREDPRVAQRG 302 + NL G N + N + ++ ++ + +G G +P V + +DP + Sbjct: 130 RNNLVGHNGSVNGYFGQDNMKMQQVELGIDNAQASG--MHPNAFVDNHGWPQDPGQRMQS 187 Query: 303 PSMLCGQNSLEFQRTQNGYYGGNVSEFXXXXXXXXXXXXX-------SGGFQQN------ 443 P+ LE Q G N+ F SG FQQ+ Sbjct: 188 PNAYSSPGPLESQGNLRGGLNKNIDGFQLPSTVPYRRGNEMRQQYASSGQFQQSLKDGQY 247 Query: 444 -PTWSPREAGVFQE--STNVKLDSESAETNQ---HKGTVEELVSFCKEGKVKEAVEVLGL 605 P ++ + V S N D ES E + ++GT+EEL SFC EGKVKEAVEVL L Sbjct: 248 SPNFNIAQRSVVGPHLSNNANPDGESTEASNDGPYRGTLEELDSFCTEGKVKEAVEVLEL 307 Query: 606 LDKQGIKLELQAYFQLMKACGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRS 785 L+KQ I ++L QLM CGE SL+EAK VH H ++ L + V+ YN+ILEMY +C S Sbjct: 308 LEKQLIPVDLPRCLQLMHQCGETKSLEEAKIVHRHALQHLSPLHVSTYNRILEMYLECGS 367 Query: 786 MSDACEVFERMPERNLTTWDTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSA 965 + DA +F MPERNLTTWDTMIT LAKNG ED+IDLFTQFK GLKPDGQMF GV SA Sbjct: 368 VDDALNIFNNMPERNLTTWDTMITQLAKNGFAEDSIDLFTQFKNLGLKPDGQMFIGVLSA 427 Query: 966 CSVLGDIDEGILHFESMSKVYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVE 1145 CSVLGDIDEG+LHFESMS+ YGI PSM H+VSVVDM+GSTG++DEA EFIEKM EP+ + Sbjct: 428 CSVLGDIDEGMLHFESMSRDYGIMPSMAHFVSVVDMIGSTGHLDEAFEFIEKMTMEPNAD 487 Query: 1146 VWETMMNLCRVHGNTELGDHCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSD 1325 +WET+MNLCRVHGNT LGD CA+L+E LD SRL+ QSKAGLVPVK D+ KE+E KKL+ Sbjct: 488 IWETLMNLCRVHGNTGLGDLCAELLEQLDSSRLSDQSKAGLVPVKALDLTKERE-KKLAS 546 Query: 1326 HNLLEVRS 1349 NLLEVRS Sbjct: 547 KNLLEVRS 554 >ref|XP_004157162.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g25580-like [Cucumis sativus] Length = 731 Score = 390 bits (1001), Expect = e-106 Identities = 220/449 (48%), Positives = 280/449 (62%), Gaps = 16/449 (3%) Frame = +3 Query: 51 QNGFSGGNTGDFRQNQSDIYTGEFQQNLFGQNRNF-----NTFHQENP---KELYRYPVG 206 +NG+ GG D Y G +N N N + Q N +E++ G Sbjct: 184 ENGYKGGVAQDHNS-----YNGSTPRNFVDMNNNVVCGVDRSMSQNNQLGHREIFSAYNG 238 Query: 207 FHQQNPIGHSKNPQWQVSYGTYREDPRVAQRGP------SMLCGQNSLEFQRTQNGYYGG 368 + N N + VS ++P GP S QNS+ Q Q Y+ G Sbjct: 239 YGYNNEATQQNN--YGVSGQNLHDNP---MSGPNNHIPLSRQYEQNSIPLQHPQGQYHQG 293 Query: 369 NVSEFXXXXXXXXXXXXXSGGFQQNPTWSPREAGVFQESTNVKLDSESAETN--QHKGTV 542 + E +Q N + Q NV + E E Q G + Sbjct: 294 SSVE----------------QYQPNTDTNQNSMIGTQLLNNVNANEEIGEPKDCQDGGPL 337 Query: 543 EELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKACGEASSLQEAKAVHDHLIRS 722 E+L FCKEGK+KEAV++L +L+KQ I ++L Y LM ACGEA SL+EAK V +++I+S Sbjct: 338 EKLDEFCKEGKLKEAVQILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKVVCNYVIKS 397 Query: 723 LVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTWDTMITGLAKNGLGEDAIDLF 902 +VKV+ YNKILEMY+KC SM DA +F +MP RN+T+WDTMIT LAKNGLGEDAIDLF Sbjct: 398 QTHVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLF 457 Query: 903 TQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSKVYGIAPSMEHYVSVVDMLGS 1082 +FK+AGL+PDG+MF GVFSACSVLGD DEG+LHFESM+K YGI PSM HYVS+VDMLGS Sbjct: 458 YEFKKAGLRPDGKMFIGVFSACSVLGDADEGMLHFESMTKNYGITPSMHHYVSIVDMLGS 517 Query: 1083 TGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGDHCAKLVEYLDPSRLTVQSKA 1262 G++DEA+EFIEKMP EP V++WETMMN+ R HG ELGD C +LVE+LD SRL QSKA Sbjct: 518 IGFVDEAVEFIEKMPLEPGVDIWETMMNISRAHGLMELGDRCFELVEHLDSSRLNEQSKA 577 Query: 1263 GLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 GL+PVK SD+ K + ++KL++ NLLEVRS Sbjct: 578 GLLPVKASDLXKREREEKLANRNLLEVRS 606 >ref|XP_003532746.1| PREDICTED: pentatricopeptide repeat-containing protein At2g25580-like isoform X1 [Glycine max] Length = 664 Score = 388 bits (996), Expect = e-105 Identities = 221/436 (50%), Positives = 274/436 (62%), Gaps = 4/436 (0%) Frame = +3 Query: 54 NGFSGGNTGDFRQNQSDIYTGEFQQNLFGQNRNFNTFHQENPKELYRYPVGFHQQNPIGH 233 NG+ G GD + Q G N +G + N F +++ + G Q+P + Sbjct: 124 NGYFG--QGDMKMQQK---VGAGVDNAWGSGMHANPFVEKHD---WTQEPGQGMQSPNAY 175 Query: 234 SKNPQWQVSYGTYREDPRVAQRGPSMLCGQNSLEFQRTQNGYY-GGNVSEFXXXXXXXXX 410 S +P S G R D QN FQ+ QN +Y G + Sbjct: 176 S-SPGPLESQGNLRGD-----------LNQNIDHFQQPQNVHYKGSHEMRPQYPGYGQSQ 223 Query: 411 XXXXSGGFQQNPTWSPREAGVFQESTNVKLDSESAETNQ---HKGTVEELVSFCKEGKVK 581 G + N + R S+N D ESA+ + ++GT+EEL +FC EG VK Sbjct: 224 QSLKDGQYLPNLNTAQRSVVGSHLSSNANPDGESAKASNDSPYRGTLEELDNFCIEGNVK 283 Query: 582 EAVEVLGLLDKQGIKLELQAYFQLMKACGEASSLQEAKAVHDHLIRSLVNVKVNVYNKIL 761 EAVEVL LL+K I ++L Y QLM CGE SL+EAK VH H ++ L ++V+ YN+IL Sbjct: 284 EAVEVLELLEKLDIPVDLPRYLQLMHQCGENKSLEEAKNVHRHALQHLSPLQVSTYNRIL 343 Query: 762 EMYAKCRSMSDACEVFERMPERNLTTWDTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQ 941 EMY +C S+ DA +F MPERNLTTWDTMIT LAKNG ED+IDLFTQFK GLKPDGQ Sbjct: 344 EMYLECGSVDDALNIFNNMPERNLTTWDTMITQLAKNGFAEDSIDLFTQFKNLGLKPDGQ 403 Query: 942 MFFGVFSACSVLGDIDEGILHFESMSKVYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEK 1121 MF GV AC +LGDIDEG+ HFESM+K YGI PSM H+VSVVDM+GS G++DEA EFIEK Sbjct: 404 MFIGVLFACGMLGDIDEGMQHFESMNKDYGIVPSMTHFVSVVDMIGSIGHLDEAFEFIEK 463 Query: 1122 MPFEPSVEVWETMMNLCRVHGNTELGDHCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKE 1301 MP +PS ++WET+MNLCRVHGNT LGD CA+LVE LD S L QSKAGLVPVK SD+ KE Sbjct: 464 MPMKPSADIWETLMNLCRVHGNTGLGDCCAELVEQLDSSCLNEQSKAGLVPVKASDLTKE 523 Query: 1302 KEKKKLSDHNLLEVRS 1349 KEK+ L++ NLLEVRS Sbjct: 524 KEKRTLTNKNLLEVRS 539 >ref|XP_003524191.2| PREDICTED: pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Glycine max] Length = 665 Score = 386 bits (991), Expect = e-104 Identities = 226/453 (49%), Positives = 265/453 (58%), Gaps = 20/453 (4%) Frame = +3 Query: 51 QNGFSGGNTGDFRQNQSDIYTGEFQQNLFGQNRNFNT-FHQENPKELYRYPVGFHQQNPI 227 QN + G D T NL G N + N F Q N + + G Sbjct: 93 QNSYGAGQIADGGHINM---TQNVHNNLVGYNGSVNGYFGQGNMRMQQKVRAGVGSAWGS 149 Query: 228 GHSKNPQWQVSYGTYREDPRVAQRGPSMLCGQNSLEFQRTQNGYYGGNVSEFXXXXXXXX 407 G NP V + +P + P+ LE Q G N+ F Sbjct: 150 GMHANPL--VEKHGWTHEPGQKMQSPNAYGSPRPLESQGNLRGDLNQNIDHFQQPENVHY 207 Query: 408 XXXXX-------SGGFQQ---------NPTWSPREAGVFQESTNVKLDSESAETNQ---H 530 SG FQQ N + R S N D ES + + + Sbjct: 208 KGSHEMRQQYPGSGQFQQSLKDGRYLPNLNIAQRSGVGSHLSNNANHDGESDKASNDSPY 267 Query: 531 KGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKACGEASSLQEAKAVHDH 710 + T+EEL +FC EG VKEAV VL LL+K I ++L Y QLM C E SL+EAK VH H Sbjct: 268 RATLEELDNFCIEGNVKEAVNVLELLEKLHIPVDLPRYLQLMHQCAENKSLEEAKIVHRH 327 Query: 711 LIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTWDTMITGLAKNGLGEDA 890 + L ++V+ YN+ILEMY +C S+ DA +F MPERNLTTWDTMIT LAKNG ED+ Sbjct: 328 TSQHLSPLQVSTYNRILEMYLECGSVDDALNIFNNMPERNLTTWDTMITQLAKNGFAEDS 387 Query: 891 IDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSKVYGIAPSMEHYVSVVD 1070 IDLFTQFK GLKPDGQMF GV ACSVLGDIDEG+LHFESMSK YGI PSM H+VSVVD Sbjct: 388 IDLFTQFKNLGLKPDGQMFIGVLFACSVLGDIDEGMLHFESMSKDYGIVPSMTHFVSVVD 447 Query: 1071 MLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGDHCAKLVEYLDPSRLTV 1250 M+GS G++DEA EFIE+MP EPS E WET+MNLCRVHGNT LGD CA+LVE LD SRL Sbjct: 448 MIGSIGHLDEAFEFIERMPMEPSAETWETLMNLCRVHGNTGLGDRCAELVEQLDSSRLNE 507 Query: 1251 QSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 QSKAGLVPVK SD+ KEKEKK L+ NLLEVRS Sbjct: 508 QSKAGLVPVKASDLTKEKEKKNLASKNLLEVRS 540 >gb|EYU21282.1| hypothetical protein MIMGU_mgv11b020544mg [Mimulus guttatus] Length = 613 Score = 382 bits (982), Expect = e-103 Identities = 200/360 (55%), Positives = 258/360 (71%), Gaps = 3/360 (0%) Frame = +3 Query: 279 DPRVAQRGPSMLCGQNSLEFQRTQNGYYGGNVSE--FXXXXXXXXXXXXXSGGFQQNPTW 452 D V R P+ L GQN + R QN + G V F + +QQN W Sbjct: 136 DGGVQIRNPNPLYGQN-FDAPR-QNSEFIGYVGRGNFSGQSNNDGINYSNAENYQQN--W 191 Query: 453 SPREAGVFQESTNVKLDS-ESAETNQHKGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKL 629 + G T +++S E AE + + +E+L F KEGK+KEAVE+LG++D +G+ L Sbjct: 192 NGNRVGGI---TGSQIESAEVAEGTRFRSKIEDLDEFIKEGKLKEAVELLGVIDNEGLAL 248 Query: 630 ELQAYFQLMKACGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDACEVF 809 +L Y LM+ACGE +L EAK+VH HLI+S+ N++V +N ILEMY+KC SM DA VF Sbjct: 249 DLPRYVSLMQACGENEALDEAKSVHQHLIKSMPNLEVKTHNSILEMYSKCGSMKDAFSVF 308 Query: 810 ERMPERNLTTWDTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLGDID 989 ++MP+RNLT+WD MI+ LAKNGL ED+IDLFT+FK +GLKPDGQMF GVFSAC L DI Sbjct: 309 DQMPQRNLTSWDIMISWLAKNGLWEDSIDLFTEFKNSGLKPDGQMFVGVFSACGSLCDIV 368 Query: 990 EGILHFESMSKVYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNL 1169 EG+LHFESM+K YGIAP+MEHYVS+V+MLG+ G+++EA+EFIEKMP +P V +WET+M Sbjct: 369 EGMLHFESMTKDYGIAPTMEHYVSIVEMLGNAGFLNEALEFIEKMPIKPDVNIWETLMKF 428 Query: 1170 CRVHGNTELGDHCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 R+HGNTELGD CA+LVE +DPSRL QS+AGL+P+ SDIA+EKEKKKLS N L+VRS Sbjct: 429 SRLHGNTELGDRCAELVELMDPSRLNEQSRAGLIPINASDIAREKEKKKLSGQNPLDVRS 488 >ref|XP_002529936.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223530566|gb|EEF32444.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 548 Score = 382 bits (982), Expect = e-103 Identities = 212/425 (49%), Positives = 270/425 (63%), Gaps = 30/425 (7%) Frame = +3 Query: 165 HQENPKELYRYPVGFHQQNPIGHSKNPQWQVSYGTYR----------EDPRVAQRGPSML 314 H +NP + ++ PV F G+ NP W T + +V+Q G S Sbjct: 52 HLDNPDDHFQNPVNFD-----GNVLNPNWGFRETTGNVIGSNNNHNYNNNQVSQIGNSQN 106 Query: 315 CG--QN-SLEFQRTQNG------------YYGGNVSEFXXXXXXXXXXXXX-----SGGF 434 G QN S ++ + N YY GNV + +G + Sbjct: 107 SGGYQNFSGSYRESWNNNYTQHVNQLPPPYYAGNVGMYPPSYGAAAQYQQNLSVVNAGQY 166 Query: 435 QQNPTWSPREAGVFQESTNVKLDSESAETNQHKGTVEELVSFCKEGKVKEAVEVLGLLDK 614 N S N K++ + AE++ ++GT+EEL CKE KVKEAVEVL LL++ Sbjct: 167 MSNSNDVQNVMVASGVSNNPKVEDDLAESSSYRGTLEELDELCKEKKVKEAVEVLNLLEE 226 Query: 615 QGIKLELQAYFQLMKACGEASSLQEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSD 794 + + ++L + QLM+ CGEA + +EAK VHDHL+R + V+ +NKILEMY KC M Sbjct: 227 RRVLVDLPRFLQLMRICGEAKASEEAKVVHDHLVRLQSPLAVSTFNKILEMYGKCGDMDS 286 Query: 795 ACEVFERMPERNLTTWDTMITGLAKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSV 974 A VF +MP+RNLTTWDTMI LAKNGLGEDAIDLF+QFK+AGL PD Q+F GVFSAC V Sbjct: 287 AFAVFNKMPKRNLTTWDTMIAWLAKNGLGEDAIDLFSQFKQAGLVPDAQLFIGVFSACGV 346 Query: 975 LGDIDEGILHFESMSKVYGIAPSMEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWE 1154 +GD+ EG+LHFESM K YGI PSMEH+VS+VDMLG+ G++DEA+EFIEKMP EPS++VWE Sbjct: 347 VGDVIEGMLHFESMKKDYGIVPSMEHFVSIVDMLGTIGHLDEALEFIEKMPMEPSIDVWE 406 Query: 1155 TMMNLCRVHGNTELGDHCAKLVEYLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNL 1334 ++MNL R+HGN ELGD CAKLVE LD S L QSKAGLVP K S++ KEK+KK S NL Sbjct: 407 SLMNLSRIHGNLELGDRCAKLVELLDASHLNEQSKAGLVPAKVSNLTKEKDKKPAS-QNL 465 Query: 1335 LEVRS 1349 LEVRS Sbjct: 466 LEVRS 470 >ref|XP_002320276.2| hypothetical protein POPTR_0014s11190g [Populus trichocarpa] gi|566203726|ref|XP_006375426.1| hypothetical protein POPTR_0014s11190g [Populus trichocarpa] gi|550323972|gb|EEE98591.2| hypothetical protein POPTR_0014s11190g [Populus trichocarpa] gi|550323973|gb|ERP53223.1| hypothetical protein POPTR_0014s11190g [Populus trichocarpa] Length = 466 Score = 377 bits (969), Expect = e-102 Identities = 186/282 (65%), Positives = 227/282 (80%) Frame = +3 Query: 504 SESAETNQHKGTVEELVSFCKEGKVKEAVEVLGLLDKQGIKLELQAYFQLMKACGEASSL 683 +ES++++ ++G++EEL FCKEGKVKEAVE L LL KQ + ++L Y QLM+ACGEA +L Sbjct: 61 TESSKSSLNRGSMEELDEFCKEGKVKEAVEFLQLLQKQSVFVDLSRYLQLMQACGEAEAL 120 Query: 684 QEAKAVHDHLIRSLVNVKVNVYNKILEMYAKCRSMSDACEVFERMPERNLTTWDTMITGL 863 +EA+ +HD ++RS + V NKILEMY+KC +M +A VF+ M E NLT+W MIT L Sbjct: 121 EEARVIHDCIVRSQSPLDVGTLNKILEMYSKCGAMDEAFSVFDNMQECNLTSWYIMITWL 180 Query: 864 AKNGLGEDAIDLFTQFKEAGLKPDGQMFFGVFSACSVLGDIDEGILHFESMSKVYGIAPS 1043 AKNG GEDAIDLF QFK+ GLKPD Q+F GVFSAC+VLGDI+EG+LHFESM K + I PS Sbjct: 181 AKNGYGEDAIDLFNQFKQGGLKPDAQIFVGVFSACNVLGDINEGLLHFESMWKEFSIVPS 240 Query: 1044 MEHYVSVVDMLGSTGYMDEAMEFIEKMPFEPSVEVWETMMNLCRVHGNTELGDHCAKLVE 1223 MEHYVS+VDMLGS GY+ EA+EFIEKMP EPSV+VWET+MNLCR HG+ ELGD CA+L+E Sbjct: 241 MEHYVSIVDMLGSNGYLVEALEFIEKMPMEPSVDVWETLMNLCRAHGHLELGDRCAELIE 300 Query: 1224 YLDPSRLTVQSKAGLVPVKPSDIAKEKEKKKLSDHNLLEVRS 1349 LDPSRL QS AGLVPVK SDIAKEK KKK + NLL+VRS Sbjct: 301 QLDPSRLNEQSNAGLVPVKASDIAKEK-KKKTASQNLLDVRS 341