BLASTX nr result
ID: Akebia25_contig00016407
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00016407 (869 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272339.1| PREDICTED: pentatricopeptide repeat-containi... 292 1e-76 ref|XP_007219476.1| hypothetical protein PRUPE_ppa021440mg, part... 252 1e-64 ref|XP_004308191.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 246 8e-63 gb|EXB44293.1| hypothetical protein L484_012212 [Morus notabilis] 238 3e-60 ref|XP_002525572.1| pentatricopeptide repeat-containing protein,... 218 3e-54 ref|XP_007010632.1| Pentatricopeptide repeat-containing protein,... 217 5e-54 ref|NP_179518.1| pentatricopeptide repeat-containing protein [Ar... 206 1e-50 gb|AAS99720.1| At2g19280 [Arabidopsis thaliana] gi|62319953|dbj|... 204 4e-50 ref|XP_002886049.1| pentatricopeptide repeat-containing protein ... 201 3e-49 ref|XP_004147131.1| PREDICTED: pentatricopeptide repeat-containi... 200 5e-49 ref|XP_006300135.1| hypothetical protein CARUB_v10016364mg [Caps... 192 1e-46 ref|XP_006409070.1| hypothetical protein EUTSA_v10023028mg, part... 182 2e-43 ref|XP_007010634.1| Pentatricopeptide repeat-containing protein,... 153 7e-35 ref|XP_002272603.2| PREDICTED: pentatricopeptide repeat-containi... 99 2e-18 emb|CBI18516.3| unnamed protein product [Vitis vinifera] 99 2e-18 emb|CAN75473.1| hypothetical protein VITISV_002797 [Vitis vinifera] 99 2e-18 ref|XP_006828912.1| hypothetical protein AMTR_s00001p00203780 [A... 92 3e-16 ref|XP_007131879.1| hypothetical protein PHAVU_011G049000g [Phas... 91 6e-16 ref|XP_002522775.1| pentatricopeptide repeat-containing protein,... 91 6e-16 ref|XP_007138443.1| hypothetical protein PHAVU_009G209400g [Phas... 90 1e-15 >ref|XP_002272339.1| PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Vitis vinifera] Length = 644 Score = 292 bits (748), Expect = 1e-76 Identities = 151/275 (54%), Positives = 197/275 (71%), Gaps = 1/275 (0%) Frame = +3 Query: 48 VEHNGLYPSGKEISDFYCRESSIFEKLTNEEDGMERIKLVLVNRGWDLGIKNGQTIDLDE 227 +E NGL +I + ++S I E +D ME IK++L NRGW+LG +NG IDL + Sbjct: 1 MEWNGLSSGENDIFAYVDKDSLISENEKAVDDEMEIIKVILTNRGWNLGSQNGYRIDLSQ 60 Query: 228 LNVIRILNDLFDDNSNAALAFYFFRWCECYSEFKHSICSICTMIHILVSGNMNHRAVDLL 407 NV++ILNDLF+++++AALA YFFRW E KH++ S+CTMIHILVSGNMNH+A+DLL Sbjct: 61 FNVMKILNDLFEESTDAALALYFFRWSEYCMGSKHTVESVCTMIHILVSGNMNHKAMDLL 120 Query: 408 RHLTRNKDGGEEWHGSVFTVLKETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKM 587 HL G E WH ++F + ET RVLETVY MLV+ YV+E M ALKL+ M+ Sbjct: 121 LHLISYNSGEEGWH-NIFLKIHETHTKRRVLETVYGMLVNCYVKENMTQVALKLICKMRH 179 Query: 588 HNIFPSIGVCNRLLKAILESKKMELAWEFLGEMHIRGMS-NACIISLFIHEYCAKGSLAT 764 NIFP IGVCN LLKA+LES+++ LAW+FL EM +G+ NA IISLFI YC++G++ T Sbjct: 180 LNIFPLIGVCNSLLKALLESEQLNLAWDFLKEMKSQGLGLNASIISLFISGYCSQGNIDT 239 Query: 765 ACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLKE 869 KLL+EM+ G +PDVV+YTIVID+ CKM LLKE Sbjct: 240 GWKLLMEMKYLGIKPDVVAYTIVIDSLCKMSLLKE 274 Score = 60.1 bits (144), Expect = 1e-06 Identities = 38/121 (31%), Positives = 60/121 (49%), Gaps = 1/121 (0%) Frame = +3 Query: 510 YSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEMH 689 Y+ +++ Y + K IS ALK + M I PS+ L+ + + ME+A M Sbjct: 361 YTTMMAGYCKVKDISNALKYLGKMLKRGIRPSVATYTLLIDSCCKPGNMEMAEYLFQRMI 420 Query: 690 IRGM-SNACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLK 866 G+ + + ++ Y KG L A +LL MR+ G PD+V+Y I+I K GL+ Sbjct: 421 TEGLVPDVVSYNTLMNGYGKKGHLQKAFELLSMMRSAGVSPDLVTYNILIHGLIKRGLVN 480 Query: 867 E 869 E Sbjct: 481 E 481 >ref|XP_007219476.1| hypothetical protein PRUPE_ppa021440mg, partial [Prunus persica] gi|462415938|gb|EMJ20675.1| hypothetical protein PRUPE_ppa021440mg, partial [Prunus persica] Length = 675 Score = 252 bits (643), Expect = 1e-64 Identities = 137/289 (47%), Positives = 185/289 (64%), Gaps = 18/289 (6%) Frame = +3 Query: 57 NGLYPSGKE-------ISDFYCRESSIFEKLTN----------EEDGMERIKLVLVNRGW 185 NG++ S K I++ YC E + E + +ED M+R+ L+L RGW Sbjct: 56 NGIFLSAKSYPTDFRGINELYCGEDGVCEPVDTGFLFSINERPDEDEMKRLMLILAKRGW 115 Query: 186 DLGIKNGQTIDLDELNVIRILNDLFDDNSNAALAFYFFRWCECYSEFKHSICSICTMIHI 365 +LG +NG I L++LN I +LNDLF+++ +A L YFF+W EC S KH++ +IC MIHI Sbjct: 116 NLGCQNGYNIYLNQLNTIELLNDLFEESFDAKLVLYFFKWSECCSGSKHTLQTICRMIHI 175 Query: 366 LVSGNMNHRAVDLLRHLTRNKDGGEEWHGSVFTVLKETSRDSRVLETVYSMLVSSYVREK 545 LVSGN+NHRAVDL+ L RN G EE S+ VL ET + RVLET SMLV+ Y++E Sbjct: 176 LVSGNLNHRAVDLILRLVRN-HGDEESCNSLLEVLDETHSEIRVLETTCSMLVNGYIQEG 234 Query: 546 MISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEMHIRGMS-NACIIS 722 M++ ALK+ MK NIFPS G + ELAW+FL M RGM NA ++S Sbjct: 235 MVNMALKIACQMKHLNIFPSNG----------DQSSSELAWDFLEVMRTRGMGLNAAMMS 284 Query: 723 LFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLKE 869 LFI++YC++G L + KLL+EM+NYG +PDVVS+TIVI++ CKM L E Sbjct: 285 LFINKYCSEGDLESGWKLLLEMKNYGIQPDVVSFTIVINSLCKMSYLNE 333 >ref|XP_004308191.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g19280-like [Fragaria vesca subsp. vesca] Length = 599 Score = 246 bits (628), Expect = 8e-63 Identities = 123/242 (50%), Positives = 167/242 (69%), Gaps = 1/242 (0%) Frame = +3 Query: 147 MERIKLVLVNRGWDLGIKNGQTIDLDELNVIRILNDLFDDNSNAALAFYFFRWCECYSEF 326 M+RI L+L R W G +NG I ++ N++++LN LF+++ +A LA YFF+W EC + Sbjct: 1 MKRIMLILAKRPWSRGCQNGYNIYRNQFNIVKVLNYLFEESLDANLALYFFKWSECCNGS 60 Query: 327 KHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGSVFTVLKETSRDSRVLET 506 KH + + C M+HILVSGN+NHRAVDL+RHL RN EE + VL T ++RVLET Sbjct: 61 KHMVQAACRMVHILVSGNINHRAVDLVRHLVRNHT-EEETCNLLLEVLYGTHSETRVLET 119 Query: 507 VYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEM 686 V SMLV Y++E M++ AL + K NIFPS GVCN LL+A+LES ++ AW+FL M Sbjct: 120 VCSMLVDQYIKEGMVNMALNVTYETKGQNIFPSGGVCNTLLRALLESNQLNFAWDFLEVM 179 Query: 687 HIRGMS-NACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLL 863 RG+ N+ IISLFIH++C +G L + KLLV+M+ YG +PDVV Y IVID+ C+M L Sbjct: 180 QTRGLGLNSTIISLFIHKFCREGDLGSGFKLLVDMKKYGIQPDVVXYAIVIDSLCRMSYL 239 Query: 864 KE 869 KE Sbjct: 240 KE 241 >gb|EXB44293.1| hypothetical protein L484_012212 [Morus notabilis] Length = 710 Score = 238 bits (606), Expect = 3e-60 Identities = 124/247 (50%), Positives = 168/247 (68%), Gaps = 4/247 (1%) Frame = +3 Query: 126 LTNEEDGME---RIKLVLVNRGWDLGIKNGQTIDLDELNVIRILNDLFDDNSNAALAFYF 296 LTN++ + RI VL NRGWDL NG + L E+N+IRI++DLF+++S+A LA YF Sbjct: 91 LTNQKAKVREVGRITRVLKNRGWDLTSPNGYRVKLSEVNIIRIMDDLFEESSDAELALYF 150 Query: 297 FRWCECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGSVFTVLKE 476 F W E KH++ S+C MIHIL SGNM HRA+DL+ HL R + EE + + VL E Sbjct: 151 FTWSESRIGSKHTVRSVCRMIHILASGNMKHRAMDLILHLVR-RYKEEESYSFLLEVLYE 209 Query: 477 TSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKM 656 T + + E V SMLV+ Y++EK ++ ALKL +K HNIFPS V N +L+ ++ SK++ Sbjct: 210 THTERMIFEIVCSMLVNCYIKEKCLNAALKLTCQLKQHNIFPSDRVSNAMLRELIGSKQL 269 Query: 657 ELAWEFLGEMHIRGMS-NACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIV 833 ELAW++L + RGM NA ISLFIH YC +G+ + KLL MR+YG +PDV+SYTI+ Sbjct: 270 ELAWDWLEIIQSRGMGLNASTISLFIHYYCKEGNFESGWKLLCRMRDYGVKPDVISYTII 329 Query: 834 IDAFCKM 854 IDA CKM Sbjct: 330 IDALCKM 336 >ref|XP_002525572.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223535151|gb|EEF36831.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 687 Score = 218 bits (554), Expect = 3e-54 Identities = 112/232 (48%), Positives = 156/232 (67%), Gaps = 4/232 (1%) Frame = +3 Query: 183 WDLGIKNGQTIDLDELNVIRILNDLFDDNSNAALAFYFFRWCECYSEFKHSICSICTMIH 362 W LG DL +++V+ +LNDLF ++ NAA A YFFR +C S +H+I S+C +IH Sbjct: 75 WSLGCSTRFITDLSQVSVLGVLNDLFGESFNAAFALYFFRLSQCCSGLEHTIRSLCRLIH 134 Query: 363 ILVSGNMNHRAVDLLRHLTRNKDGG---EEWHGSVFTVLKETSRDSRVLETVYSMLVSSY 533 ILV G N+R +DL+ L RN G EE +F ++ +T ++ LETVYSMLV Y Sbjct: 135 ILVYGKRNYRVMDLILFLVRNIGGAVGEEELCDLLFKLVYDTGFGTKDLETVYSMLVDCY 194 Query: 534 VREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEMHIRGMS-NA 710 V E +S AL L+ +K+ NIFPS+GVCN LLKA+L S +++LAW+ L M GM NA Sbjct: 195 VTESKVSLALNLIHEIKLLNIFPSMGVCNSLLKALLRSHQLDLAWDILEGMQSFGMHLNA 254 Query: 711 CIISLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLK 866 I+SLFI YCA+G++ + K+L+EM+NYG + DV++YTIVIDA CK+ +K Sbjct: 255 SILSLFIESYCAEGNIQSGWKILMEMKNYGIKADVIAYTIVIDALCKISCVK 306 >ref|XP_007010632.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] gi|590567863|ref|XP_007010633.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] gi|508727545|gb|EOY19442.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] gi|508727546|gb|EOY19443.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] Length = 661 Score = 217 bits (552), Expect = 5e-54 Identities = 116/239 (48%), Positives = 154/239 (64%), Gaps = 1/239 (0%) Frame = +3 Query: 156 IKLVLVNRGWDLGIKNGQTIDLDELNVIRILNDLFDDNSNAALAFYFFRWCECYSEFKHS 335 IK +L RGW++ N ID +E +VI IL LF+++ +A LA YFF+ E HS Sbjct: 54 IKSILWKRGWNINPDNLCPIDFNESSVIGILTHLFEESLDAELALYFFKLSERCVGSLHS 113 Query: 336 ICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGSVFTVLKETSRDSRVLETVYS 515 + S+C MIHILVSGNMNHRAVD + L R + + + ET D VLETV S Sbjct: 114 VKSVCKMIHILVSGNMNHRAVDFILRLVRISCSKDVSEDLLLKLFYETHSDRMVLETVCS 173 Query: 516 MLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEMHIR 695 MLV Y++E + AL+L MK N+ PSIGVCN LLKA+LE +++LAW+FL +M + Sbjct: 174 MLVDCYIKENEVGLALELACKMKSFNMIPSIGVCNSLLKALLELNELDLAWDFLDQMLRQ 233 Query: 696 GMS-NACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLKE 869 G N I+SLFI +YC KG L +A L+EM+NYG +PDVV+YTI+ID+ CK+ L E Sbjct: 234 GSGLNVAIVSLFIDKYCRKGQLLSAWTFLMEMKNYGIKPDVVAYTIIIDSLCKVSCLGE 292 >ref|NP_179518.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|334184304|ref|NP_001189552.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546774|sp|Q6NKW7.2|PP164_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g19280 gi|3135258|gb|AAC16458.1| putative salt-inducible protein [Arabidopsis thaliana] gi|330251769|gb|AEC06863.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|330251770|gb|AEC06864.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 693 Score = 206 bits (523), Expect = 1e-50 Identities = 115/254 (45%), Positives = 157/254 (61%), Gaps = 1/254 (0%) Frame = +3 Query: 111 SIFEKLTNEEDGMERIKLVLVNRGWDLGIKNGQTIDLDELNVIRILNDLFDDNSNAALAF 290 SI + + D +E I+ VLV W ++G + +LD+ VIRIL+DLF++ +A++ Sbjct: 71 SILKNIDVPRDCVETIRNVLVKHNWIQKYESGFSTELDQYTVIRILDDLFEETLDASIVL 130 Query: 291 YFFRWCECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGSVFTVL 470 YFFRW E + +HS SI MIHILVSGNMN+RAVD+L L + G E V L Sbjct: 131 YFFRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEERSLCLVMKDL 190 Query: 471 KETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESK 650 ET D RVLETV+S+L+ +RE+ ++ ALKL + IFPS GVC LLK IL Sbjct: 191 FETRIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGVCISLLKEILRVH 250 Query: 651 KMELAWEFLGEMHIRGMS-NACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYT 827 +ELA EF+ M RG NA ++SLFI +YC+ G +LL+ M++YG PD+V++T Sbjct: 251 GLELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMKHYGIRPDIVAFT 310 Query: 828 IVIDAFCKMGLLKE 869 + ID CK G LKE Sbjct: 311 VFIDKLCKAGFLKE 324 >gb|AAS99720.1| At2g19280 [Arabidopsis thaliana] gi|62319953|dbj|BAD94048.1| putative salt-inducible protein [Arabidopsis thaliana] gi|110738808|dbj|BAF01327.1| putative salt-inducible protein [Arabidopsis thaliana] Length = 693 Score = 204 bits (519), Expect = 4e-50 Identities = 114/254 (44%), Positives = 157/254 (61%), Gaps = 1/254 (0%) Frame = +3 Query: 111 SIFEKLTNEEDGMERIKLVLVNRGWDLGIKNGQTIDLDELNVIRILNDLFDDNSNAALAF 290 SI + + D +E I+ VLV W ++G + +LD+ VIRIL+DLF++ +A++ Sbjct: 71 SILKNIDVPRDCVETIRNVLVKHNWIQKYESGFSTELDQYTVIRILDDLFEETLDASIVL 130 Query: 291 YFFRWCECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGSVFTVL 470 YFFRW E + +HS SI MIHILVSGNMN+RAVD+L L + G E V L Sbjct: 131 YFFRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEERSLCLVMKDL 190 Query: 471 KETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESK 650 +T D RVLETV+S+L+ +RE+ ++ ALKL + IFPS GVC LLK IL Sbjct: 191 FKTRIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGVCISLLKEILRVH 250 Query: 651 KMELAWEFLGEMHIRGMS-NACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYT 827 +ELA EF+ M RG NA ++SLFI +YC+ G +LL+ M++YG PD+V++T Sbjct: 251 GLELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMKHYGIRPDIVAFT 310 Query: 828 IVIDAFCKMGLLKE 869 + ID CK G LKE Sbjct: 311 VFIDKLCKAGFLKE 324 >ref|XP_002886049.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297331889|gb|EFH62308.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 755 Score = 201 bits (511), Expect = 3e-49 Identities = 111/254 (43%), Positives = 156/254 (61%), Gaps = 1/254 (0%) Frame = +3 Query: 111 SIFEKLTNEEDGMERIKLVLVNRGWDLGIKNGQTIDLDELNVIRILNDLFDDNSNAALAF 290 +I + + D +E I+ VL W ++G + +LD+ NVIRIL+DLF++ +A++A Sbjct: 138 TILKNIDVPSDCVETIRNVLTKHSWIQKYESGFSTELDQYNVIRILDDLFEETLDASIAL 197 Query: 291 YFFRWCECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGSVFTVL 470 YFFRW E + HS SI MIHILVSGNMN+RAVD+L L + G E V L Sbjct: 198 YFFRWSELWIGVAHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGKERSLCLVIKDL 257 Query: 471 KETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESK 650 ET D RVLETV+ ML+ ++E+ + ALKL + IFPS GVC L++ IL + Sbjct: 258 FETRIDRRVLETVFCMLIDCCIKERKVDMALKLTYKIDQFGIFPSRGVCISLVEEILRAH 317 Query: 651 KMELAWEFLGEMHIRGMS-NACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYT 827 +ELA EF+ M RG NA ++SLFI +YC+ G +LL+ M++YG PD+V++T Sbjct: 318 GLELAREFVEHMLSRGRHLNAALLSLFIRKYCSDGYFDKGWELLMGMKDYGIRPDIVAFT 377 Query: 828 IVIDAFCKMGLLKE 869 + ID CK G L+E Sbjct: 378 VFIDKLCKAGFLRE 391 >ref|XP_004147131.1| PREDICTED: pentatricopeptide repeat-containing protein At2g19280-like [Cucumis sativus] gi|449503522|ref|XP_004162044.1| PREDICTED: pentatricopeptide repeat-containing protein At2g19280-like [Cucumis sativus] Length = 532 Score = 200 bits (509), Expect = 5e-49 Identities = 111/268 (41%), Positives = 160/268 (59%) Frame = +3 Query: 66 YPSGKEISDFYCRESSIFEKLTNEEDGMERIKLVLVNRGWDLGIKNGQTIDLDELNVIRI 245 Y + + E + + +ED ME IKL+L NRG++LG Q L +IRI Sbjct: 45 YDVNSDERSYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQ------LEIIRI 98 Query: 246 LNDLFDDNSNAALAFYFFRWCECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRN 425 L+ LF+D+S+A L Y+F+W C S S+ SIC M HILV+GNMNHRAVDL+ HL +N Sbjct: 99 LDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKN 158 Query: 426 KDGGEEWHGSVFTVLKETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPS 605 E + V ET + LET SM+V+ Y++E+M+++AL L+D MK NIFPS Sbjct: 159 YGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPS 218 Query: 606 IGVCNRLLKAILESKKMELAWEFLGEMHIRGMSNACIISLFIHEYCAKGSLATACKLLVE 785 I V ++KA+L++ + +AW+ L EMH + +G+L K+L+E Sbjct: 219 IWVYKSVIKALLQTNQSGMAWDLLEEMHRQ-----------------EGNLGKGWKVLLE 261 Query: 786 MRNYGYEPDVVSYTIVIDAFCKMGLLKE 869 +RN+G +PDVV YT VI++ CK+ LLKE Sbjct: 262 LRNFGSKPDVVDYTTVINSLCKVSLLKE 289 >ref|XP_006300135.1| hypothetical protein CARUB_v10016364mg [Capsella rubella] gi|482568844|gb|EOA33033.1| hypothetical protein CARUB_v10016364mg [Capsella rubella] Length = 696 Score = 192 bits (489), Expect = 1e-46 Identities = 111/254 (43%), Positives = 153/254 (60%), Gaps = 1/254 (0%) Frame = +3 Query: 111 SIFEKLTNEEDGMERIKLVLVNRGWDLGIKNGQTIDLDELNVIRILNDLFDDNSNAALAF 290 SI + D +E I+ VL+ W ++G + +LD+ +VIRIL+DLF++ +A++A Sbjct: 79 SILRNIEVPNDCVETIRDVLMKHSWIQKHESGFSSELDQYSVIRILDDLFEETLDASIAL 138 Query: 291 YFFRWCECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGSVFTVL 470 YFFRW E + +HS SI MIHILVSGNMN+RAVD+L L + G E V L Sbjct: 139 YFFRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEESSLCLVMNDL 198 Query: 471 KETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESK 650 ET D RVLETV+ +L+ V+E+ ALKL M IFPS GVC LL+ IL Sbjct: 199 FETRIDRRVLETVFCILIDCCVKERKTDMALKLTYKMDQFGIFPSPGVCVSLLEDILRVH 258 Query: 651 KMELAWEFLGEMHIRGMS-NACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYT 827 +ELA EF+ M RG NA ++SLF+ +YC+ G +LL+ M YG PD+V++T Sbjct: 259 GLELAREFVELMLSRGRHLNASVLSLFVSKYCSDGYFDKGWELLMGMNYYGIRPDIVAFT 318 Query: 828 IVIDAFCKMGLLKE 869 ++ + CK G LKE Sbjct: 319 VLANKLCKAGFLKE 332 >ref|XP_006409070.1| hypothetical protein EUTSA_v10023028mg, partial [Eutrema salsugineum] gi|557110232|gb|ESQ50523.1| hypothetical protein EUTSA_v10023028mg, partial [Eutrema salsugineum] Length = 562 Score = 182 bits (461), Expect = 2e-43 Identities = 108/227 (47%), Positives = 138/227 (60%), Gaps = 7/227 (3%) Frame = +3 Query: 210 TIDLDELNVIRILNDLFDDNSNAALAFYFFRWCECYSEFKHSICSICTMIHILVSGNMNH 389 +I+LDE VIRIL+DLF + S+A++A YFFRW E + +HS SI MIHILVSGNMN Sbjct: 10 SIELDEYKVIRILDDLFKETSDASIALYFFRWSELWIGAEHSSRSISRMIHILVSGNMNF 69 Query: 390 RAVDLLRHLTRNKDGGEEWHGSVFTVLKETSRDSRVLETVYSMLVSSYVREKMISTALKL 569 RAVD+L L + G E + + ET D RVLE V+SMLV V+E+ + ALKL Sbjct: 70 RAVDMLLRLVKRCGGEERPLCLLMNDIFETRSDRRVLEAVFSMLVDCCVQERKVDMALKL 129 Query: 570 VDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEM-HIRGMSNACIISLFIHEYCA 746 M IFPS GVC LLK IL +ELA EF+ M R NA ++SLFI +YC Sbjct: 130 TYKMDQFGIFPSRGVCISLLKQILRIHGLELAHEFVEHMISGRRHLNAAVLSLFISKYCF 189 Query: 747 KGSLATACKLLVEMRNYGYEPDVVSYT------IVIDAFCKMGLLKE 869 G +LL+ M+ YG PDVV++T VI+ FCK+G +E Sbjct: 190 DGCFDKGWELLIGMKQYGIRPDVVAFTDSVSVSSVIEGFCKVGKPEE 236 >ref|XP_007010634.1| Pentatricopeptide repeat-containing protein, putative isoform 3 [Theobroma cacao] gi|508727547|gb|EOY19444.1| Pentatricopeptide repeat-containing protein, putative isoform 3 [Theobroma cacao] Length = 533 Score = 153 bits (387), Expect = 7e-35 Identities = 80/164 (48%), Positives = 106/164 (64%), Gaps = 1/164 (0%) Frame = +3 Query: 381 MNHRAVDLLRHLTRNKDGGEEWHGSVFTVLKETSRDSRVLETVYSMLVSSYVREKMISTA 560 MNHRAVD + L R + + + ET D VLETV SMLV Y++E + A Sbjct: 1 MNHRAVDFILRLVRISCSKDVSEDLLLKLFYETHSDRMVLETVCSMLVDCYIKENEVGLA 60 Query: 561 LKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEMHIRGMS-NACIISLFIHE 737 L+L MK N+ PSIGVCN LLKA+LE +++LAW+FL +M +G N I+SLFI + Sbjct: 61 LELACKMKSFNMIPSIGVCNSLLKALLELNELDLAWDFLDQMLRQGSGLNVAIVSLFIDK 120 Query: 738 YCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLKE 869 YC KG L +A L+EM+NYG +PDVV+YTI+ID+ CK+ L E Sbjct: 121 YCRKGQLLSAWTFLMEMKNYGIKPDVVAYTIIIDSLCKVSCLGE 164 >ref|XP_002272603.2| PREDICTED: pentatricopeptide repeat-containing protein At5g55840-like [Vitis vinifera] Length = 2037 Score = 99.0 bits (245), Expect = 2e-18 Identities = 59/198 (29%), Positives = 100/198 (50%), Gaps = 3/198 (1%) Frame = +3 Query: 282 LAFYFFRWC--ECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGS 455 LA F +W + E KH C HILV M A +LRHL + G + S Sbjct: 843 LALKFLKWVIKQPGLELKHLTHMYCLTAHILVKARMYDSAKSILRHLCQMGIGSK----S 898 Query: 456 VFTVLKETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKA 635 +F L +T + +V+ +L+ Y++E MI A++ + + + PS+ CN +L + Sbjct: 899 IFGALMDTYPLCNSIPSVFDLLIRVYLKEGMIDYAVETFELVGLVGFKPSVYTCNMILAS 958 Query: 636 ILESKKMELAWEFLGEMHIRGM-SNACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPD 812 +++ K+ EL W EM +G+ N ++ I+ C +G+L A LL +M G+ P Sbjct: 959 MVKDKRTELVWSLFREMSDKGICPNVGTFNILINGLCVEGNLKKAGNLLKQMEENGFVPT 1018 Query: 813 VVSYTIVIDAFCKMGLLK 866 +V+Y +++ +CK G K Sbjct: 1019 IVTYNTLLNWYCKKGRYK 1036 Score = 57.4 bits (137), Expect = 7e-06 Identities = 35/120 (29%), Positives = 59/120 (49%), Gaps = 1/120 (0%) Frame = +3 Query: 510 YSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEMH 689 Y+ L+ + AL+L+DHM+ + + LL + + +K ELA L M Sbjct: 1127 YNALIGGHCHVGDFEEALRLLDHMEAAGLRLNEVTYGTLLNGLCKHEKFELAKRLLERMR 1186 Query: 690 IRGMSNACII-SLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLK 866 + M I ++ I C G L A +L+ M G PDV++Y+ +I+ FC++G +K Sbjct: 1187 VNDMVVGHIAYTVLIDGLCKNGMLDEAVQLVGNMYKDGVNPDVITYSSLINGFCRVGNIK 1246 >emb|CBI18516.3| unnamed protein product [Vitis vinifera] Length = 967 Score = 99.0 bits (245), Expect = 2e-18 Identities = 59/198 (29%), Positives = 100/198 (50%), Gaps = 3/198 (1%) Frame = +3 Query: 282 LAFYFFRWC--ECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGS 455 LA F +W + E KH C HILV M A +LRHL + G + S Sbjct: 92 LALKFLKWVIKQPGLELKHLTHMYCLTAHILVKARMYDSAKSILRHLCQMGIGSK----S 147 Query: 456 VFTVLKETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKA 635 +F L +T + +V+ +L+ Y++E MI A++ + + + PS+ CN +L + Sbjct: 148 IFGALMDTYPLCNSIPSVFDLLIRVYLKEGMIDYAVETFELVGLVGFKPSVYTCNMILAS 207 Query: 636 ILESKKMELAWEFLGEMHIRGM-SNACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPD 812 +++ K+ EL W EM +G+ N ++ I+ C +G+L A LL +M G+ P Sbjct: 208 MVKDKRTELVWSLFREMSDKGICPNVGTFNILINGLCVEGNLKKAGNLLKQMEENGFVPT 267 Query: 813 VVSYTIVIDAFCKMGLLK 866 +V+Y +++ +CK G K Sbjct: 268 IVTYNTLLNWYCKKGRYK 285 >emb|CAN75473.1| hypothetical protein VITISV_002797 [Vitis vinifera] Length = 1356 Score = 99.0 bits (245), Expect = 2e-18 Identities = 59/198 (29%), Positives = 100/198 (50%), Gaps = 3/198 (1%) Frame = +3 Query: 282 LAFYFFRWC--ECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGS 455 LA F +W + E KH C HILV M A +LRHL + G + S Sbjct: 92 LALKFLKWVIKQPGLELKHLTHMYCLTAHILVKARMYDSAKSILRHLCQMGIGSK----S 147 Query: 456 VFTVLKETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKA 635 +F L +T + +V+ +L+ Y++E MI A++ + + + PS+ CN +L + Sbjct: 148 IFGALMDTYPLCNSIPSVFDLLIRVYLKEGMIDYAVETFELVGLVGFKPSVYTCNMILAS 207 Query: 636 ILESKKMELAWEFLGEMHIRGM-SNACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPD 812 +++ K+ EL W EM +G+ N ++ I+ C +G+L A LL +M G+ P Sbjct: 208 MVKDKRTELVWSLFREMSDKGICPNVGTFNILINGLCVEGNLKKAGNLLKQMEENGFVPT 267 Query: 813 VVSYTIVIDAFCKMGLLK 866 +V+Y +++ +CK G K Sbjct: 268 IVTYNTLLNWYCKKGRYK 285 Score = 57.4 bits (137), Expect = 7e-06 Identities = 35/120 (29%), Positives = 59/120 (49%), Gaps = 1/120 (0%) Frame = +3 Query: 510 YSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEMH 689 Y+ L+ + AL+L+DHM+ + + LL + + +K ELA L M Sbjct: 376 YNALIGGHCHVGDFEEALRLLDHMEAAGLRLNEVTYGTLLNGLCKHEKFELAKRLLERMR 435 Query: 690 IRGMSNACII-SLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLK 866 + M I ++ I C G L A +L+ M G PDV++Y+ +I+ FC++G +K Sbjct: 436 VNDMVVGHIAYTVLIDGLCKNGMLDEAVQLVGNMYKDGVNPDVITYSSLINGFCRVGNIK 495 >ref|XP_006828912.1| hypothetical protein AMTR_s00001p00203780 [Amborella trichopoda] gi|548833891|gb|ERM96328.1| hypothetical protein AMTR_s00001p00203780 [Amborella trichopoda] Length = 583 Score = 92.0 bits (227), Expect = 3e-16 Identities = 60/198 (30%), Positives = 90/198 (45%), Gaps = 7/198 (3%) Frame = +3 Query: 294 FFRWCECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGSVFTVLK 473 FF+W F+H+I S C M H L M A+ LL+ + K G + SVF L Sbjct: 73 FFKWVAAQKGFRHTIQSYCAMTHFLSLHRMVPEALSLLKTVVSRK--GRDSASSVFNALL 130 Query: 474 ETS------RDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKA 635 ET D R T + +L++ Y+ IS A++ + +K H CN L+ Sbjct: 131 ETQGEDVQCHDQRSRSTSFELLMNVYIDSGFISDAIQCLRLVKKHGFKLPFQACNYLMDC 190 Query: 636 ILESKKMELAWEFLGEMHIRGM-SNACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPD 812 I++S AW F E+ G N ++ +H + G + A L E+ N G P Sbjct: 191 IMKSNTPAAAWAFYSEILDYGFPPNVYTFNMIMHSFSRIGKIKEAQLLFREIGNRGLTPS 250 Query: 813 VVSYTIVIDAFCKMGLLK 866 VVS+ +I+ CK G L+ Sbjct: 251 VVSFNTLINGLCKKGDLE 268 Score = 61.6 bits (148), Expect = 4e-07 Identities = 35/120 (29%), Positives = 58/120 (48%), Gaps = 1/120 (0%) Frame = +3 Query: 510 YSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEMH 689 YS+L++ RE+ I A L D M + P+ L+ + +E + +M Sbjct: 289 YSVLINGLCRERKIEQACDLFDEMNERGLVPNSITFTTLIDGYCKEGNIEKGLQIYQKMM 348 Query: 690 IRGMSNACII-SLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLK 866 +G+ + + I+ +C G L A +L E+R G PD ++YT +ID FCK G +K Sbjct: 349 KKGLKPDLVTYNSLIYGHCKVGELKEARELFFEIRKMGLRPDKITYTTLIDGFCKEGDIK 408 >ref|XP_007131879.1| hypothetical protein PHAVU_011G049000g [Phaseolus vulgaris] gi|561004879|gb|ESW03873.1| hypothetical protein PHAVU_011G049000g [Phaseolus vulgaris] Length = 439 Score = 90.9 bits (224), Expect = 6e-16 Identities = 57/200 (28%), Positives = 98/200 (49%), Gaps = 8/200 (4%) Frame = +3 Query: 282 LAFYFFRWCECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHG--- 452 LA FF W + S H++ S +IH+L G ++ A ++R R+ D ++ + Sbjct: 80 LALRFFLWTKSKSLCHHNLASYSAIIHLLARGRLSSDASHVIRTAIRDSDQTDDQNCRFA 139 Query: 453 ----SVFTVLKETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCN 620 ++F L +T RD V+ +L+ + + + + ++++V + I P + N Sbjct: 140 SPPLNLFETLVKTYRDFGSAPFVFDLLIKACLDSRKVDPSVEIVRMLLSRGISPKVSTLN 199 Query: 621 RLLKAILESKKMELAWEFLG-EMHIRGMSNACIISLFIHEYCAKGSLATACKLLVEMRNY 797 L+ + S+ ++ E L EM NA S+ + +C +G + A KL EMRN Sbjct: 200 SLITGVCRSRGVDEGMEELWHEMRSNCKPNAYSYSVLMTAFCDEGRMGYAEKLWEEMRNE 259 Query: 798 GYEPDVVSYTIVIDAFCKMG 857 EPDVVSY +I FCK+G Sbjct: 260 KIEPDVVSYNTIIGGFCKIG 279 >ref|XP_002522775.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538013|gb|EEF39626.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 1071 Score = 90.9 bits (224), Expect = 6e-16 Identities = 60/198 (30%), Positives = 95/198 (47%), Gaps = 3/198 (1%) Frame = +3 Query: 282 LAFYFFRWC--ECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGS 455 LA F W + E +H + HILV + A +L+HL++ G + S Sbjct: 27 LALKFLNWVIQQPGLELRHLTHMLSITTHILVRARLYENAKSILKHLSQMGVGSK----S 82 Query: 456 VFTVLKETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKA 635 VF L T + +V+ +L+ Y+RE M+ AL+ M + PS+ CN LL Sbjct: 83 VFGALMNTYPLCKSNPSVFDLLIRVYLREGMVGDALETFRLMGIRGFNPSVYTCNMLLGK 142 Query: 636 ILESKKMELAWEFLGEMHIRGM-SNACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPD 812 +++ +K+ W F EM R + + ++ I+ C +G L A LL +M GY P Sbjct: 143 LVKERKVGAVWLFFKEMLARRVCPDVSTFNILINVLCVEGKLKKAGYLLKKMEESGYVPS 202 Query: 813 VVSYTIVIDAFCKMGLLK 866 VV+Y V++ +CK G K Sbjct: 203 VVTYNTVLNWYCKKGRYK 220 Score = 58.9 bits (141), Expect = 2e-06 Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 1/120 (0%) Frame = +3 Query: 510 YSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKAILESKKMELAWEFLGEMH 689 Y+ L+ + + AL +++ M+ P+ + LL + K EL+ L M Sbjct: 311 YNALIDGHCHDGNFEQALTILEMMEATGPKPNEVSYSALLNGLCRHAKFELSKSILERMR 370 Query: 690 IRGMSNACII-SLFIHEYCAKGSLATACKLLVEMRNYGYEPDVVSYTIVIDAFCKMGLLK 866 + GM CI + I C G L + KLL +M G PDVV+++++I+ FC++G +K Sbjct: 371 MNGMIVGCIAYTAMIDGLCRNGLLNESVKLLDKMLKDGVVPDVVTFSVLINGFCRVGKIK 430 >ref|XP_007138443.1| hypothetical protein PHAVU_009G209400g [Phaseolus vulgaris] gi|561011530|gb|ESW10437.1| hypothetical protein PHAVU_009G209400g [Phaseolus vulgaris] Length = 1054 Score = 90.1 bits (222), Expect = 1e-15 Identities = 57/198 (28%), Positives = 95/198 (47%), Gaps = 3/198 (1%) Frame = +3 Query: 282 LAFYFFRWC--ECYSEFKHSICSICTMIHILVSGNMNHRAVDLLRHLTRNKDGGEEWHGS 455 LA F W + E KH ICT HILV M + A L+H+ + G S Sbjct: 38 LALKFLNWVIKQRNLELKHVTHIICTTTHILVRARMYNFAKTTLKHMLQLPIG----LNS 93 Query: 456 VFTVLKETSRDSRVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRLLKA 635 VF L ET V+ +L+ +R+KM+ A++ M + PS+ CN +L + Sbjct: 94 VFCALMETYPICNSNPAVFDLLIRVCLRDKMVGDAVQTFYLMGFRGLKPSVYTCNMVLGS 153 Query: 636 ILESKKMELAWEFLGEMHIRGM-SNACIISLFIHEYCAKGSLATACKLLVEMRNYGYEPD 812 +++ +K+++ W F EM +G+ N ++ ++ C +G +A LL +M G P Sbjct: 154 LVKEQKVDMFWSFFKEMLTKGICPNVATFNILLNALCQRGKFKSAGFLLRKMEESGVYPT 213 Query: 813 VVSYTIVIDAFCKMGLLK 866 +Y +++ +CK G K Sbjct: 214 AATYNTLLNWYCKKGRYK 231 Score = 57.4 bits (137), Expect = 7e-06 Identities = 37/142 (26%), Positives = 67/142 (47%), Gaps = 2/142 (1%) Frame = +3 Query: 450 GSVFTVLKETSRDS-RVLETVYSMLVSSYVREKMISTALKLVDHMKMHNIFPSIGVCNRL 626 G V ++L+ D V YS ++ + + A++L+D M +++ P + + L Sbjct: 371 GLVSSILERMRMDGVGVGHISYSAMIDGLCKNGRLEEAVQLLDDMLKNSVSPDVVTFSVL 430 Query: 627 LKAILESKKMELAWEFLGEMHIRGM-SNACIISLFIHEYCAKGSLATACKLLVEMRNYGY 803 + K+ A E + +M+ G+ N+ + S I+ YC G L A M + GY Sbjct: 431 INGFFRVGKINNAKEIMCKMYKTGLVPNSILYSTLIYNYCKMGYLKEALNAYAIMNHSGY 490 Query: 804 EPDVVSYTIVIDAFCKMGLLKE 869 D + ++I AFC+ G L+E Sbjct: 491 AADHFTCNVLIAAFCRCGRLEE 512