BLASTX nr result
ID: Catharanthus23_contig00010370
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00010370 (1244 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272339.1| PREDICTED: pentatricopeptide repeat-containi... 223 2e-76 gb|EXB44293.1| hypothetical protein L484_012212 [Morus notabilis] 186 9e-65 gb|EOY19442.1| Pentatricopeptide repeat-containing protein, puta... 186 1e-56 ref|XP_004308191.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 187 1e-56 ref|XP_002525572.1| pentatricopeptide repeat-containing protein,... 196 4e-56 gb|EMJ20675.1| hypothetical protein PRUPE_ppa021440mg, partial [... 157 2e-51 gb|EOY19444.1| Pentatricopeptide repeat-containing protein, puta... 186 2e-44 ref|NP_179518.1| pentatricopeptide repeat-containing protein [Ar... 179 3e-42 ref|XP_004147131.1| PREDICTED: pentatricopeptide repeat-containi... 133 6e-42 gb|AAS99720.1| At2g19280 [Arabidopsis thaliana] gi|62319953|dbj|... 177 8e-42 ref|XP_002886049.1| pentatricopeptide repeat-containing protein ... 174 5e-41 ref|XP_006300135.1| hypothetical protein CARUB_v10016364mg [Caps... 168 5e-39 ref|XP_006409070.1| hypothetical protein EUTSA_v10023028mg, part... 145 3e-32 ref|XP_004144290.1| PREDICTED: pentatricopeptide repeat-containi... 88 6e-15 ref|XP_002272603.2| PREDICTED: pentatricopeptide repeat-containi... 81 1e-12 emb|CBI18516.3| unnamed protein product [Vitis vinifera] 81 1e-12 emb|CAN75473.1| hypothetical protein VITISV_002797 [Vitis vinifera] 81 1e-12 ref|XP_006842657.1| hypothetical protein AMTR_s00077p00196020 [A... 80 1e-12 ref|XP_006853118.1| hypothetical protein AMTR_s00038p00140720 [A... 70 3e-12 ref|XP_004141186.1| PREDICTED: pentatricopeptide repeat-containi... 79 5e-12 >ref|XP_002272339.1| PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Vitis vinifera] Length = 644 Score = 223 bits (569), Expect(2) = 2e-76 Identities = 113/195 (57%), Positives = 140/195 (71%) Frame = -3 Query: 591 NYRAVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVAL 412 N++A+DL+LHL+ NSGE W N+ LK I ETHT R+VL TVY ML++CYVKEN++ VAL Sbjct: 113 NHKAMDLLLHLISYNSGEEGWHNIFLK-IHETHTKRRVLETVYGMLVNCYVKENMTQVAL 171 Query: 411 NLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSY 232 L +++ LNIFP IGVCNSLL++ L SEQ++LAW+FL+EM+SQ LN SII+LFI Y Sbjct: 172 KLICKMRHLNIFPLIGVCNSLLKALLESEQLNLAWDFLKEMKSQGLGLNASIISLFISGY 231 Query: 231 CIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMIQFXXXXXX 52 C +G+ GWKLLM M+ GI DVVAYT IDSLCKMSLLKEA S++FKM Q Sbjct: 232 CSQGNIDTGWKLLMEMKYLGIKPDVVAYTIVIDSLCKMSLLKEATSILFKMTQMGVFLDS 291 Query: 51 XXXXXXXXXLCKVGK 7 CKVGK Sbjct: 292 VSVSSVVDGYCKVGK 306 Score = 90.9 bits (224), Expect(2) = 2e-76 Identities = 43/83 (51%), Positives = 64/83 (77%) Frame = -2 Query: 841 NNELGRMRVVLESCGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEW 662 ++E+ ++V+L + GW LGS Y RI L+++ +++IL+DLF+++ DAALAL FFRW E+ Sbjct: 31 DDEMEIIKVILTNRGWNLGSQNGY-RIDLSQFNVMKILNDLFEESTDAALALYFFRWSEY 89 Query: 661 YMGSESTIRSTCTMTHILVAGNM 593 MGS+ T+ S CTM HILV+GNM Sbjct: 90 CMGSKHTVESVCTMIHILVSGNM 112 Score = 62.0 bits (149), Expect = 5e-07 Identities = 37/158 (23%), Positives = 68/158 (43%) Frame = -3 Query: 474 VTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLE 295 V YN L++ Y K+ A L ++ + P + N L+ ++ ++ A + L+ Sbjct: 428 VVSYNTLMNGYGKKGHLQKAFELLSMMRSAGVSPDLVTYNILIHGLIKRGLVNEAKDILD 487 Query: 294 EMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMS 115 E+ + F+ +V T I + KG+F+ + L M +H + DVV + ++ C+ Sbjct: 488 ELTRRGFSPDVVTFTNIIGGFSNKGNFEEAFLLFFYMSEHHLEPDVVTCSALLNGYCRTR 547 Query: 114 LLKEAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGKID 1 + EA L KM+ C +G ID Sbjct: 548 CMAEANVLFHKMLDAGLKADVILYNSLIHGFCSLGNID 585 >gb|EXB44293.1| hypothetical protein L484_012212 [Morus notabilis] Length = 710 Score = 186 bits (473), Expect(2) = 9e-65 Identities = 97/196 (49%), Positives = 132/196 (67%) Frame = -3 Query: 588 YRAVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVALN 409 +RA+DLILHLVR+ E ++ L L+V++ETHT R + V +ML++CY+KE N AL Sbjct: 181 HRAMDLILHLVRRYKEEESYSFL-LEVLYETHTERMIFEIVCSMLVNCYIKEKCLNAALK 239 Query: 408 LTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSYC 229 LT QLKQ NIFPS V N++LR + S+Q++LAW++LE +QS+ LN S I+LFIH YC Sbjct: 240 LTCQLKQHNIFPSDRVSNAMLRELIGSKQLELAWDWLEIIQSRGMGLNASTISLFIHYYC 299 Query: 228 IKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMIQFXXXXXXX 49 +G+F+ GWKLL RM+ +G+ DV++YT ID+LCKMS EA SL+FKM Q Sbjct: 300 KEGNFESGWKLLCRMRDYGVKPDVISYTIIIDALCKMSCPIEATSLVFKMTQLGISPDAV 359 Query: 48 XXXXXXXXLCKVGKID 1 KVG+ID Sbjct: 360 CVSSIVDGYSKVGRID 375 Score = 89.0 bits (219), Expect(2) = 9e-65 Identities = 44/82 (53%), Positives = 59/82 (71%) Frame = -2 Query: 835 ELGRMRVVLESCGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEWYM 656 E+GR+ VL++ GW L SP Y R+ L+E I+RI+DDLF+++ DA LAL FF W E + Sbjct: 100 EVGRITRVLKNRGWDLTSPNGY-RVKLSEVNIIRIMDDLFEESSDAELALYFFTWSESRI 158 Query: 655 GSESTIRSTCTMTHILVAGNMK 590 GS+ T+RS C M HIL +GNMK Sbjct: 159 GSKHTVRSVCRMIHILASGNMK 180 Score = 66.2 bits (160), Expect = 3e-08 Identities = 38/158 (24%), Positives = 70/158 (44%) Frame = -3 Query: 474 VTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLE 295 V VYN L+D Y ++ L +K N+ P + N+L+ S + ++ A + + Sbjct: 495 VVVYNSLMDGYGEKGHLQKVFELFDMMKSSNVCPDVVTYNTLIHSLVMRGFVNEAEDVFD 554 Query: 294 EMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMS 115 E+ + F +V T I + KG+F+ + + M +H + DVV + ++ C+ Sbjct: 555 ELTERGFCPDVVTFTTLIDGFSKKGNFEEAFLVWFYMSEHRVEPDVVTCSAILNGYCRRH 614 Query: 114 LLKEAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGKID 1 ++EA +L KM+ C VG +D Sbjct: 615 RMEEAKALFQKMLNIGLKPDLRLYNNLIYGFCSVGNMD 652 >gb|EOY19442.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] gi|508727546|gb|EOY19443.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] Length = 661 Score = 186 bits (471), Expect(2) = 1e-56 Identities = 94/195 (48%), Positives = 125/195 (64%) Frame = -3 Query: 591 NYRAVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVAL 412 N+RAVD IL LVR + + +L+LK+ +ETH+ R VL TV +ML+DCY+KEN +AL Sbjct: 130 NHRAVDFILRLVRISCSKDVSEDLLLKLFYETHSDRMVLETVCSMLVDCYIKENEVGLAL 189 Query: 411 NLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSY 232 L ++K N+ PSIGVCNSLL++ L ++DLAW+FL++M Q LNV+I++LFI Y Sbjct: 190 ELACKMKSFNMIPSIGVCNSLLKALLELNELDLAWDFLDQMLRQGSGLNVAIVSLFIDKY 249 Query: 231 CIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMIQFXXXXXX 52 C KG W LM M+ +GI DVVAYT IDSLCK+S L EA SL+FK+ + Sbjct: 250 CRKGQLLSAWTFLMEMKNYGIKPDVVAYTIIIDSLCKVSCLGEATSLLFKITRLGISPDS 309 Query: 51 XXXXXXXXXLCKVGK 7 CK GK Sbjct: 310 VLVSSVVEGHCKAGK 324 Score = 62.4 bits (150), Expect(2) = 1e-56 Identities = 35/86 (40%), Positives = 53/86 (61%) Frame = -2 Query: 850 AQSNNELGRMRVVLESCGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRW 671 +Q N L ++ +L GW + +P+N I NE ++ IL LF+++ DA LAL FF+ Sbjct: 45 SQVCNPLSLIKSILWKRGWNI-NPDNLCPIDFNESSVIGILTHLFEESLDAELALYFFKL 103 Query: 670 LEWYMGSESTIRSTCTMTHILVAGNM 593 E +GS +++S C M HILV+GNM Sbjct: 104 SERCVGSLHSVKSVCKMIHILVSGNM 129 >ref|XP_004308191.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g19280-like [Fragaria vesca subsp. vesca] Length = 599 Score = 187 bits (474), Expect(2) = 1e-56 Identities = 93/173 (53%), Positives = 130/173 (75%) Frame = -3 Query: 591 NYRAVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVAL 412 N+RAVDL+ HLVR ++ E NL+L+V++ TH+ +VL TV +ML+D Y+KE + N+AL Sbjct: 80 NHRAVDLVRHLVRNHTEEET-CNLLLEVLYGTHSETRVLETVCSMLVDQYIKEGMVNMAL 138 Query: 411 NLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSY 232 N+T++ K NIFPS GVCN+LLR+ L S Q++ AW+FLE MQ++ LN +II+LFIH + Sbjct: 139 NVTYETKGQNIFPSGGVCNTLLRALLESNQLNFAWDFLEVMQTRGLGLNSTIISLFIHKF 198 Query: 231 CIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMIQ 73 C +GD G+KLL+ M+++GI DVV Y IDSLC+MS LKEA +L+FKM Q Sbjct: 199 CREGDLGSGFKLLVDMKKYGIQPDVVXYAIVIDSLCRMSYLKEATTLLFKMTQ 251 Score = 61.2 bits (147), Expect(2) = 1e-56 Identities = 31/78 (39%), Positives = 49/78 (62%) Frame = -2 Query: 826 RMRVVLESCGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEWYMGSE 647 R+ ++L W G Y I N++ IV++L+ LF+++ DA LAL FF+W E GS+ Sbjct: 3 RIMLILAKRPWSRGCQNGYN-IYRNQFNIVKVLNYLFEESLDANLALYFFKWSECCNGSK 61 Query: 646 STIRSTCTMTHILVAGNM 593 +++ C M HILV+GN+ Sbjct: 62 HMVQAACRMVHILVSGNI 79 >ref|XP_002525572.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223535151|gb|EEF36831.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 687 Score = 196 bits (497), Expect(2) = 4e-56 Identities = 102/200 (51%), Positives = 134/200 (67%), Gaps = 3/200 (1%) Frame = -3 Query: 591 NYRAVDLILHLVRKNSG---EGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSN 421 NYR +DLIL LVR G E +L+ K++++T K L TVY+ML+DCYV E+ + Sbjct: 142 NYRVMDLILFLVRNIGGAVGEEELCDLLFKLVYDTGFGTKDLETVYSMLVDCYVTESKVS 201 Query: 420 VALNLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFI 241 +ALNL ++K LNIFPS+GVCNSLL++ LRS Q+DLAW+ LE MQS LN SI++LFI Sbjct: 202 LALNLIHEIKLLNIFPSMGVCNSLLKALLRSHQLDLAWDILEGMQSFGMHLNASILSLFI 261 Query: 240 HSYCIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMIQFXXX 61 SYC +G+ Q GWK+LM M+ +GI ADV+AYT ID+LCK+S +K A SL+FKMI Sbjct: 262 ESYCAEGNIQSGWKILMEMKNYGIKADVIAYTIVIDALCKISCVKVATSLLFKMIHCGIS 321 Query: 60 XXXXXXXXXXXXLCKVGKID 1 CK G+ D Sbjct: 322 VDSVSVSSVIDGYCKKGRSD 341 Score = 50.8 bits (120), Expect(2) = 4e-56 Identities = 28/68 (41%), Positives = 40/68 (58%) Frame = -2 Query: 802 CGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEWYMGSESTIRSTCT 623 C W LG + L++ ++ +L+DLF ++ +AA AL FFR + G E TIRS C Sbjct: 73 CVWSLGCSTRFIT-DLSQVSVLGVLNDLFGESFNAAFALYFFRLSQCCSGLEHTIRSLCR 131 Query: 622 MTHILVAG 599 + HILV G Sbjct: 132 LIHILVYG 139 >gb|EMJ20675.1| hypothetical protein PRUPE_ppa021440mg, partial [Prunus persica] Length = 675 Score = 157 bits (398), Expect(2) = 2e-51 Identities = 85/197 (43%), Positives = 125/197 (63%) Frame = -3 Query: 591 NYRAVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVAL 412 N+RAVDLIL LVR N G+ N +L+V+ ETH+ +VL T +ML++ Y++E + N+AL Sbjct: 182 NHRAVDLILRLVR-NHGDEESCNSLLEVLDETHSEIRVLETTCSMLVNGYIQEGMVNMAL 240 Query: 411 NLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSY 232 + Q+K LNIFPS G +S +LAW+FLE M+++ LN ++++LFI+ Y Sbjct: 241 KIACQMKHLNIFPSNGDQSSS----------ELAWDFLEVMRTRGMGLNAAMMSLFINKY 290 Query: 231 CIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMIQFXXXXXX 52 C +GD + GWKLL+ M+ +GI DVV++T I+SLCKMS L EA +L+FKM Q Sbjct: 291 CSEGDLESGWKLLLEMKNYGIQPDVVSFTIVINSLCKMSYLNEATALLFKMTQLGISPDP 350 Query: 51 XXXXXXXXXLCKVGKID 1 CK+G+ + Sbjct: 351 VLLSSIIDGHCKLGQTE 367 Score = 73.2 bits (178), Expect(2) = 2e-51 Identities = 57/192 (29%), Positives = 90/192 (46%), Gaps = 8/192 (4%) Frame = -2 Query: 1144 MRASTSIISFASSGFKLTFGRQRRSRLLSSCNLAL----LRXXXXXXXXXXXXXDNHIDI 977 M SII+ +S KL F R+ R SS N AL L DN I + Sbjct: 1 MTGLLSIINVYASQLKLIFRRRSTLRYYSSVNSALSSIILSEDETSTLEDTVAADNGIFL 60 Query: 976 DDQYVWQNLKHARDYECLFEERCLQL----YFDLNDNRNDTKNLGTAQSNNELGRMRVVL 809 + + + + C + C + F +N+ ++ +E+ R+ ++L Sbjct: 61 SAKSYPTDFRGINELYCGEDGVCEPVDTGFLFSINERPDE----------DEMKRLMLIL 110 Query: 808 ESCGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEWYMGSESTIRST 629 GW LG Y I LN+ + +L+DLF+++ DA L L FF+W E GS+ T+++ Sbjct: 111 AKRGWNLGCQNGYN-IYLNQLNTIELLNDLFEESFDAKLVLYFFKWSECCSGSKHTLQTI 169 Query: 628 CTMTHILVAGNM 593 C M HILV+GN+ Sbjct: 170 CRMIHILVSGNL 181 >gb|EOY19444.1| Pentatricopeptide repeat-containing protein, putative isoform 3 [Theobroma cacao] Length = 533 Score = 186 bits (471), Expect = 2e-44 Identities = 94/195 (48%), Positives = 125/195 (64%) Frame = -3 Query: 591 NYRAVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVAL 412 N+RAVD IL LVR + + +L+LK+ +ETH+ R VL TV +ML+DCY+KEN +AL Sbjct: 2 NHRAVDFILRLVRISCSKDVSEDLLLKLFYETHSDRMVLETVCSMLVDCYIKENEVGLAL 61 Query: 411 NLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSY 232 L ++K N+ PSIGVCNSLL++ L ++DLAW+FL++M Q LNV+I++LFI Y Sbjct: 62 ELACKMKSFNMIPSIGVCNSLLKALLELNELDLAWDFLDQMLRQGSGLNVAIVSLFIDKY 121 Query: 231 CIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMIQFXXXXXX 52 C KG W LM M+ +GI DVVAYT IDSLCK+S L EA SL+FK+ + Sbjct: 122 CRKGQLLSAWTFLMEMKNYGIKPDVVAYTIIIDSLCKVSCLGEATSLLFKITRLGISPDS 181 Query: 51 XXXXXXXXXLCKVGK 7 CK GK Sbjct: 182 VLVSSVVEGHCKAGK 196 >ref|NP_179518.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|334184304|ref|NP_001189552.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546774|sp|Q6NKW7.2|PP164_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g19280 gi|3135258|gb|AAC16458.1| putative salt-inducible protein [Arabidopsis thaliana] gi|330251769|gb|AEC06863.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|330251770|gb|AEC06864.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 693 Score = 179 bits (453), Expect = 3e-42 Identities = 99/226 (43%), Positives = 137/226 (60%), Gaps = 5/226 (2%) Frame = -3 Query: 669 WNGIWVQRAQSVPHAQ*HISWLPEI*-----NYRAVDLILHLVRKNSGEGWWANLVLKVI 505 W+ +W+ V H+ IS + I NYRAVD++L LV+K SGE LV+K + Sbjct: 135 WSELWI----GVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEERSLCLVMKDL 190 Query: 504 FETHTSRKVLVTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSE 325 FET R+VL TV+++L+DC ++E N+AL LT+++ Q IFPS GVC SLL+ LR Sbjct: 191 FETRIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGVCISLLKEILRVH 250 Query: 324 QIDLAWNFLEEMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYT 145 ++LA F+E M S+ LN ++++LFI YC G F GW+LLM M+ +GI D+VA+T Sbjct: 251 GLELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMKHYGIRPDIVAFT 310 Query: 144 TFIDSLCKMSLLKEAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGK 7 FID LCK LKEA S++FK+ F CKVGK Sbjct: 311 VFIDKLCKAGFLKEATSVLFKLKLFGISQDSVSVSSVIDGFCKVGK 356 Score = 70.1 bits (170), Expect = 2e-09 Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 5/96 (5%) Frame = -2 Query: 823 MRVVLESCGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEWYMGSES 644 +R VL W+ + L++Y ++RILDDLF++T DA++ L FFRW E ++G E Sbjct: 86 IRNVLVKHNWIQKYESGFST-ELDQYTVIRILDDLFEETLDASIVLYFFRWSELWIGVEH 144 Query: 643 TIRSTCTMTHILVAGNMKLQS-----C*SNTASGEE 551 + RS M HILV+GNM ++ C SGEE Sbjct: 145 SSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEE 180 >ref|XP_004147131.1| PREDICTED: pentatricopeptide repeat-containing protein At2g19280-like [Cucumis sativus] gi|449503522|ref|XP_004162044.1| PREDICTED: pentatricopeptide repeat-containing protein At2g19280-like [Cucumis sativus] Length = 532 Score = 133 bits (335), Expect(2) = 6e-42 Identities = 73/173 (42%), Positives = 105/173 (60%) Frame = -3 Query: 591 NYRAVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVAL 412 N+RAVDLI HLV+ ++++LKV ETH RK L T +M+++CY+KE + AL Sbjct: 145 NHRAVDLISHLVKNYGCTEGSSSILLKVFCETHNGRKTLETTCSMMVNCYIKERMVTSAL 204 Query: 411 NLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSY 232 L Q+K LNIFPSI V S++++ L++ Q +AW+ LEEM Q+ Sbjct: 205 ILIDQMKHLNIFPSIWVYKSVIKALLQTNQSGMAWDLLEEMHRQE--------------- 249 Query: 231 CIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMIQ 73 G+ GWK+L+ ++ G DVV YTT I+SLCK+SLLKEA +L +M + Sbjct: 250 ---GNLGKGWKVLLELRNFGSKPDVVDYTTVINSLCKVSLLKEATALDVEMAE 299 Score = 65.9 bits (159), Expect(2) = 6e-42 Identities = 58/186 (31%), Positives = 86/186 (46%), Gaps = 2/186 (1%) Frame = -2 Query: 1144 MRASTSIISFASSGFKLTFGRQRRSRLLSSCNLALLRXXXXXXXXXXXXXDNHIDIDDQY 965 MR++ SIISF S KL F R+ R ++ N L NH+D D Sbjct: 1 MRSAFSIISFCS---KLNFRRKTPCRYSATANSEL-------------SSFNHMDED--- 41 Query: 964 VWQNLKHARDYECLFEERCLQLYFDLNDNRNDTK-NLGTAQSNNELGRMRVVLESCGWVL 788 +Y+ +ER N+ + + G +E+ ++++L + G+ L Sbjct: 42 -------CTNYDVNSDERSYV--------GNEVEVSKGQKTDEDEMETIKLILGNRGFNL 86 Query: 787 GS-PENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEWYMGSESTIRSTCTMTHI 611 GS P+ E I+RILD LF+ + DA L L +F+W GS ++ S C M HI Sbjct: 87 GSCPKQLE--------IIRILDVLFEDSSDAGLCLYYFKWSGCLSGSNQSLESICRMAHI 138 Query: 610 LVAGNM 593 LVAGNM Sbjct: 139 LVAGNM 144 Score = 74.3 bits (181), Expect = 1e-10 Identities = 40/158 (25%), Positives = 74/158 (46%) Frame = -3 Query: 474 VTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLE 295 V VYN+L+D Y K+ + A L ++ N+ P + N+L+ + + A + L+ Sbjct: 314 VVVYNILMDAYGKKGYMHKAFKLLDMMRSTNVTPDVVTYNTLINGLVMRGFLQEAKDILD 373 Query: 294 EMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMS 115 E+ + F+++V T IH Y +G+F+ + L M ++ ++ DVV + + C+ Sbjct: 374 ELIRRGFSVDVVTYTNIIHGYSTRGNFEEAFLLWYHMAENCVTPDVVTCSALLSGYCREK 433 Query: 114 LLKEAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGKID 1 + EA +L KM+ C VG +D Sbjct: 434 RMDEANALFCKMLDIGLKPDLILYNTLIHGFCSVGNVD 471 >gb|AAS99720.1| At2g19280 [Arabidopsis thaliana] gi|62319953|dbj|BAD94048.1| putative salt-inducible protein [Arabidopsis thaliana] gi|110738808|dbj|BAF01327.1| putative salt-inducible protein [Arabidopsis thaliana] Length = 693 Score = 177 bits (449), Expect = 8e-42 Identities = 98/226 (43%), Positives = 137/226 (60%), Gaps = 5/226 (2%) Frame = -3 Query: 669 WNGIWVQRAQSVPHAQ*HISWLPEI*-----NYRAVDLILHLVRKNSGEGWWANLVLKVI 505 W+ +W+ V H+ IS + I NYRAVD++L LV+K SGE LV+K + Sbjct: 135 WSELWI----GVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEERSLCLVMKDL 190 Query: 504 FETHTSRKVLVTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSE 325 F+T R+VL TV+++L+DC ++E N+AL LT+++ Q IFPS GVC SLL+ LR Sbjct: 191 FKTRIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGVCISLLKEILRVH 250 Query: 324 QIDLAWNFLEEMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYT 145 ++LA F+E M S+ LN ++++LFI YC G F GW+LLM M+ +GI D+VA+T Sbjct: 251 GLELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMKHYGIRPDIVAFT 310 Query: 144 TFIDSLCKMSLLKEAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGK 7 FID LCK LKEA S++FK+ F CKVGK Sbjct: 311 VFIDKLCKAGFLKEATSVLFKLKLFGISQDSVSVSSVIDGFCKVGK 356 Score = 70.1 bits (170), Expect = 2e-09 Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 5/96 (5%) Frame = -2 Query: 823 MRVVLESCGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEWYMGSES 644 +R VL W+ + L++Y ++RILDDLF++T DA++ L FFRW E ++G E Sbjct: 86 IRNVLVKHNWIQKYESGFST-ELDQYTVIRILDDLFEETLDASIVLYFFRWSELWIGVEH 144 Query: 643 TIRSTCTMTHILVAGNMKLQS-----C*SNTASGEE 551 + RS M HILV+GNM ++ C SGEE Sbjct: 145 SSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEE 180 >ref|XP_002886049.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297331889|gb|EFH62308.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 755 Score = 174 bits (442), Expect = 5e-41 Identities = 97/226 (42%), Positives = 136/226 (60%), Gaps = 5/226 (2%) Frame = -3 Query: 669 WNGIWVQRAQSVPHAQ*HISWLPEI*-----NYRAVDLILHLVRKNSGEGWWANLVLKVI 505 W+ +W+ V H+ IS + I NYRAVD++L LV+K SG+ LV+K + Sbjct: 202 WSELWI----GVAHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGKERSLCLVIKDL 257 Query: 504 FETHTSRKVLVTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSE 325 FET R+VL TV+ ML+DC +KE ++AL LT+++ Q IFPS GVC SL+ LR+ Sbjct: 258 FETRIDRRVLETVFCMLIDCCIKERKVDMALKLTYKIDQFGIFPSRGVCISLVEEILRAH 317 Query: 324 QIDLAWNFLEEMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYT 145 ++LA F+E M S+ LN ++++LFI YC G F GW+LLM M+ +GI D+VA+T Sbjct: 318 GLELAREFVEHMLSRGRHLNAALLSLFIRKYCSDGYFDKGWELLMGMKDYGIRPDIVAFT 377 Query: 144 TFIDSLCKMSLLKEAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGK 7 FID LCK L+EA S++FK+ F CKVGK Sbjct: 378 VFIDKLCKAGFLREATSVLFKLKLFGISQDSVSVSSVIDGFCKVGK 423 Score = 67.4 bits (163), Expect = 1e-08 Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 5/96 (5%) Frame = -2 Query: 823 MRVVLESCGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEWYMGSES 644 +R VL W+ + L++Y ++RILDDLF++T DA++AL FFRW E ++G Sbjct: 153 IRNVLTKHSWIQKYESGFST-ELDQYNVIRILDDLFEETLDASIALYFFRWSELWIGVAH 211 Query: 643 TIRSTCTMTHILVAGNMKLQS-----C*SNTASGEE 551 + RS M HILV+GNM ++ C SG+E Sbjct: 212 SSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGKE 247 >ref|XP_006300135.1| hypothetical protein CARUB_v10016364mg [Capsella rubella] gi|482568844|gb|EOA33033.1| hypothetical protein CARUB_v10016364mg [Capsella rubella] Length = 696 Score = 168 bits (425), Expect = 5e-39 Identities = 95/226 (42%), Positives = 132/226 (58%), Gaps = 5/226 (2%) Frame = -3 Query: 669 WNGIWVQRAQSVPHAQ*HISWLPEI*-----NYRAVDLILHLVRKNSGEGWWANLVLKVI 505 W+ +W+ V H+ IS + I NYRAVD++L LV+K SGE LV+ + Sbjct: 143 WSELWI----GVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEESSLCLVMNDL 198 Query: 504 FETHTSRKVLVTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSE 325 FET R+VL TV+ +L+DC VKE +++AL LT+++ Q IFPS GVC SLL LR Sbjct: 199 FETRIDRRVLETVFCILIDCCVKERKTDMALKLTYKMDQFGIFPSPGVCVSLLEDILRVH 258 Query: 324 QIDLAWNFLEEMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYT 145 ++LA F+E M S+ LN S+++LF+ YC G F GW+LLM M +GI D+VA+T Sbjct: 259 GLELAREFVELMLSRGRHLNASVLSLFVSKYCSDGYFDKGWELLMGMNYYGIRPDIVAFT 318 Query: 144 TFIDSLCKMSLLKEAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGK 7 + LCK LKEA +++FK+ F CKVGK Sbjct: 319 VLANKLCKAGFLKEATAILFKLKHFGISLDSVSVSSVIDGFCKVGK 364 Score = 72.0 bits (175), Expect = 5e-10 Identities = 41/102 (40%), Positives = 60/102 (58%), Gaps = 5/102 (4%) Frame = -2 Query: 841 NNELGRMRVVLESCGWVLGSPENYERISLNEYIIVRILDDLFDKTGDAALALSFFRWLEW 662 N+ + +R VL W+ + L++Y ++RILDDLF++T DA++AL FFRW E Sbjct: 88 NDCVETIRDVLMKHSWIQKHESGFSS-ELDQYSVIRILDDLFEETLDASIALYFFRWSEL 146 Query: 661 YMGSESTIRSTCTMTHILVAGNMKLQS-----C*SNTASGEE 551 ++G E + RS M HILV+GNM ++ C SGEE Sbjct: 147 WIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEE 188 >ref|XP_006409070.1| hypothetical protein EUTSA_v10023028mg, partial [Eutrema salsugineum] gi|557110232|gb|ESQ50523.1| hypothetical protein EUTSA_v10023028mg, partial [Eutrema salsugineum] Length = 562 Score = 145 bits (366), Expect = 3e-32 Identities = 83/205 (40%), Positives = 125/205 (60%), Gaps = 11/205 (5%) Frame = -3 Query: 669 WNGIWVQRAQSVPHAQ*HISWLPEI*-----NYRAVDLILHLVRKNSGEGWWANLVLKVI 505 W+ +W+ H+ IS + I N+RAVD++L LV++ GE L++ I Sbjct: 41 WSELWI----GAEHSSRSISRMIHILVSGNMNFRAVDMLLRLVKRCGGEERPLCLLMNDI 96 Query: 504 FETHTSRKVLVTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSE 325 FET + R+VL V++ML+DC V+E ++AL LT+++ Q IFPS GVC SLL+ LR Sbjct: 97 FETRSDRRVLEAVFSMLVDCCVQERKVDMALKLTYKMDQFGIFPSRGVCISLLKQILRIH 156 Query: 324 QIDLAWNFLEEMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYT 145 ++LA F+E M S + LN ++++LFI YC G F GW+LL+ M+Q+GI DVVA+T Sbjct: 157 GLELAHEFVEHMISGRRHLNAAVLSLFISKYCFDGCFDKGWELLIGMKQYGIRPDVVAFT 216 Query: 144 ------TFIDSLCKMSLLKEAVSLM 88 + I+ CK+ +EAV L+ Sbjct: 217 DSVSVSSVIEGFCKVGKPEEAVKLI 241 Score = 72.0 bits (175), Expect = 5e-10 Identities = 33/61 (54%), Positives = 46/61 (75%) Frame = -2 Query: 763 ISLNEYIIVRILDDLFDKTGDAALALSFFRWLEWYMGSESTIRSTCTMTHILVAGNMKLQ 584 I L+EY ++RILDDLF +T DA++AL FFRW E ++G+E + RS M HILV+GNM + Sbjct: 11 IELDEYKVIRILDDLFKETSDASIALYFFRWSELWIGAEHSSRSISRMIHILVSGNMNFR 70 Query: 583 S 581 + Sbjct: 71 A 71 >ref|XP_004144290.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Cucumis sativus] gi|449522905|ref|XP_004168466.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Cucumis sativus] Length = 915 Score = 88.2 bits (217), Expect = 6e-15 Identities = 51/157 (32%), Positives = 75/157 (47%) Frame = -3 Query: 474 VTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLE 295 V YN L+D Y K+ LS AL + ++ N P+ N L+ F R + I A + L Sbjct: 379 VVTYNALIDGYCKKGLSASALEILSLMESNNCSPNARTYNELILGFCRGKNIHKAMSLLH 438 Query: 294 EMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMS 115 +M ++ NV + IH C +GD +KLL M + G+ D Y+ FID+LCK Sbjct: 439 KMLERKLQPNVVTYNILIHGQCKEGDLGSAYKLLSLMNESGLVPDEWTYSVFIDTLCKRG 498 Query: 114 LLKEAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGKI 4 L++EA SL + + CKVGK+ Sbjct: 499 LVEEARSLFESLKEKGIKANEVIYSTLIDGYCKVGKV 535 Score = 63.5 bits (153), Expect = 2e-07 Identities = 39/130 (30%), Positives = 65/130 (50%), Gaps = 1/130 (0%) Frame = -3 Query: 465 YNMLLDCYVKE-NLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEM 289 YN L+D Y KE N L + +K+ +I P+ L+ + L+ ++ D A + ++M Sbjct: 557 YNSLIDGYCKEKNFKEARLLVDIMIKR-DIEPAADTYTILIDNLLKDDEFDQAHDMFDQM 615 Query: 288 QSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLL 109 S +V I T FIH+YC G + L+ +M GI D + YT FID+ + + Sbjct: 616 LSTGSHPDVFIYTAFIHAYCSHGRLKDAEVLICKMNAKGIMPDTMLYTLFIDAYGRFGSI 675 Query: 108 KEAVSLMFKM 79 A ++ +M Sbjct: 676 DGAFGILKRM 685 >ref|XP_002272603.2| PREDICTED: pentatricopeptide repeat-containing protein At5g55840-like [Vitis vinifera] Length = 2037 Score = 80.9 bits (198), Expect = 1e-12 Identities = 45/170 (26%), Positives = 88/170 (51%), Gaps = 1/170 (0%) Frame = -3 Query: 582 AVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVALNLT 403 A ++ HL + G + + + +T+ + +V+++L+ Y+KE + + A+ T Sbjct: 882 AKSILRHLCQMGIG----SKSIFGALMDTYPLCNSIPSVFDLLIRVYLKEGMIDYAVE-T 936 Query: 402 FQLKQLNIF-PSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSYCI 226 F+L L F PS+ CN +L S ++ ++ +L W+ EM + NV + I+ C+ Sbjct: 937 FELVGLVGFKPSVYTCNMILASMVKDKRTELVWSLFREMSDKGICPNVGTFNILINGLCV 996 Query: 225 KGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMI 76 +G+ + LL +M+++G +V Y T ++ CK K A+ L+ MI Sbjct: 997 EGNLKKAGNLLKQMEENGFVPTIVTYNTLLNWYCKKGRYKAAIELIDYMI 1046 Score = 62.4 bits (150), Expect = 4e-07 Identities = 40/154 (25%), Positives = 68/154 (44%) Frame = -3 Query: 465 YNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQ 286 YN L++ +VKE VA + ++ + ++ P+ N+L+ + A L+ M+ Sbjct: 1092 YNTLINGFVKEGKIGVAAQVFNEMSKFDLSPNCVTYNALIGGHCHVGDFEEALRLLDHME 1151 Query: 285 SQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLK 106 + LN ++ C F+ +LL RM+ + + +AYT ID LCK +L Sbjct: 1152 AAGLRLNEVTYGTLLNGLCKHEKFELAKRLLERMRVNDMVVGHIAYTVLIDGLCKNGMLD 1211 Query: 105 EAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGKI 4 EAV L+ M + C+VG I Sbjct: 1212 EAVQLVGNMYKDGVNPDVITYSSLINGFCRVGNI 1245 >emb|CBI18516.3| unnamed protein product [Vitis vinifera] Length = 967 Score = 80.9 bits (198), Expect = 1e-12 Identities = 45/170 (26%), Positives = 88/170 (51%), Gaps = 1/170 (0%) Frame = -3 Query: 582 AVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVALNLT 403 A ++ HL + G + + + +T+ + +V+++L+ Y+KE + + A+ T Sbjct: 131 AKSILRHLCQMGIG----SKSIFGALMDTYPLCNSIPSVFDLLIRVYLKEGMIDYAVE-T 185 Query: 402 FQLKQLNIF-PSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSYCI 226 F+L L F PS+ CN +L S ++ ++ +L W+ EM + NV + I+ C+ Sbjct: 186 FELVGLVGFKPSVYTCNMILASMVKDKRTELVWSLFREMSDKGICPNVGTFNILINGLCV 245 Query: 225 KGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMI 76 +G+ + LL +M+++G +V Y T ++ CK K A+ L+ MI Sbjct: 246 EGNLKKAGNLLKQMEENGFVPTIVTYNTLLNWYCKKGRYKAAIELIDYMI 295 >emb|CAN75473.1| hypothetical protein VITISV_002797 [Vitis vinifera] Length = 1356 Score = 80.9 bits (198), Expect = 1e-12 Identities = 45/170 (26%), Positives = 88/170 (51%), Gaps = 1/170 (0%) Frame = -3 Query: 582 AVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVALNLT 403 A ++ HL + G + + + +T+ + +V+++L+ Y+KE + + A+ T Sbjct: 131 AKSILRHLCQMGIG----SKSIFGALMDTYPLCNSIPSVFDLLIRVYLKEGMIDYAVE-T 185 Query: 402 FQLKQLNIF-PSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSYCI 226 F+L L F PS+ CN +L S ++ ++ +L W+ EM + NV + I+ C+ Sbjct: 186 FELVGLVGFKPSVYTCNMILASMVKDKRTELVWSLFREMSDKGICPNVGTFNILINGLCV 245 Query: 225 KGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKMI 76 +G+ + LL +M+++G +V Y T ++ CK K A+ L+ MI Sbjct: 246 EGNLKKAGNLLKQMEENGFVPTIVTYNTLLNWYCKKGRYKAAIELIDYMI 295 Score = 62.4 bits (150), Expect = 4e-07 Identities = 40/154 (25%), Positives = 68/154 (44%) Frame = -3 Query: 465 YNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQ 286 YN L++ +VKE VA + ++ + ++ P+ N+L+ + A L+ M+ Sbjct: 341 YNTLINGFVKEGKIGVAAQVFNEMSKFDLSPNCVTYNALIGGHCHVGDFEEALRLLDHME 400 Query: 285 SQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLK 106 + LN ++ C F+ +LL RM+ + + +AYT ID LCK +L Sbjct: 401 AAGLRLNEVTYGTLLNGLCKHEKFELAKRLLERMRVNDMVVGHIAYTVLIDGLCKNGMLD 460 Query: 105 EAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGKI 4 EAV L+ M + C+VG I Sbjct: 461 EAVQLVGNMYKDGVNPDVITYSSLINGFCRVGNI 494 >ref|XP_006842657.1| hypothetical protein AMTR_s00077p00196020 [Amborella trichopoda] gi|548844743|gb|ERN04332.1| hypothetical protein AMTR_s00077p00196020 [Amborella trichopoda] Length = 504 Score = 80.5 bits (197), Expect = 1e-12 Identities = 50/166 (30%), Positives = 81/166 (48%), Gaps = 1/166 (0%) Frame = -3 Query: 585 RAVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVALNL 406 +A++LI+ L++ + + V + + ++ V +ML+ CYV+ + L Sbjct: 37 QAMELIIELIKTHECQ------VFENLIQSMDECNWNPVVLDMLIKCYVQLGRIDEGLES 90 Query: 405 TFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSYCI 226 ++ + PSI CNSLL LRS I+ W+ EEM N + + H+ C Sbjct: 91 FKKVIGFGLVPSINACNSLLNGLLRSNAINTCWDIYEEMGRVGIPPNSYTLNILTHALCR 150 Query: 225 KGDFQGGWKLLMRMQQ-HGISADVVAYTTFIDSLCKMSLLKEAVSL 91 KGDF + L RM++ G+ D+V Y T ID CK L +A+ L Sbjct: 151 KGDFDRVTEFLERMEEREGLDLDLVTYNTLIDGYCKRDKLGDALYL 196 >ref|XP_006853118.1| hypothetical protein AMTR_s00038p00140720 [Amborella trichopoda] gi|548856757|gb|ERN14585.1| hypothetical protein AMTR_s00038p00140720 [Amborella trichopoda] Length = 855 Score = 70.5 bits (171), Expect(2) = 3e-12 Identities = 47/168 (27%), Positives = 77/168 (45%) Frame = -3 Query: 582 AVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVALNLT 403 A +LI H + NS G A+ + + ET V++++L+ Y + +L Sbjct: 141 ARNLIKHSLSANSSIG--ASAFIDRLLETSERCNSHPRVFDLVLNGYTRYGSVTESLETY 198 Query: 402 FQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSYCIK 223 +L +FPS+G N LL +R ID AW+ EM + L+ + +H+ Sbjct: 199 HRLVSNGVFPSVGCINLLLNKLVRLNFIDEAWDLYREMVERGVDLDCQTLDAMVHACSKG 258 Query: 222 GDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKM 79 G + L M+ G D V+YT I +LCK + K+A L+ +M Sbjct: 259 GKLEEAEGLFQEMRIRGCKLDSVSYTNIIQALCKKTCSKKACELLTEM 306 Score = 28.9 bits (63), Expect(2) = 3e-12 Identities = 13/42 (30%), Positives = 20/42 (47%) Frame = -2 Query: 733 ILDDLFDKTGDAALALSFFRWLEWYMGSESTIRSTCTMTHIL 608 +++ L D+ AL +FRW E G + C + HIL Sbjct: 91 VVEVLLSNQTDSKAALRYFRWAERQRGFIRGLEPLCVVLHIL 132 Score = 59.7 bits (143), Expect = 2e-06 Identities = 39/135 (28%), Positives = 69/135 (51%) Frame = -3 Query: 477 LVTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFL 298 +VT ++ CY E++ A NL Q+++ + P++ NS+++ FL+ + A + Sbjct: 380 IVTFAVLIEGCYRNEDMVK-AHNLYGQMQERGLSPNVFTVNSMIKGFLKKGMFNEALEYF 438 Query: 297 EEMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKM 118 EE + A NV + I C KG + L +M GI DVV+Y T + LC+ Sbjct: 439 EEAVESKVA-NVFTFDIIIFWLCKKGRVREASGLWEKMVSFGIIPDVVSYNTLLFGLCRE 497 Query: 117 SLLKEAVSLMFKMIQ 73 ++ A++L+ +M Q Sbjct: 498 GNIQGALNLLNQMTQ 512 >ref|XP_004141186.1| PREDICTED: pentatricopeptide repeat-containing protein At5g55840-like [Cucumis sativus] Length = 1079 Score = 78.6 bits (192), Expect = 5e-12 Identities = 42/168 (25%), Positives = 85/168 (50%) Frame = -3 Query: 582 AVDLILHLVRKNSGEGWWANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVALNLT 403 A ++ HL +KNSG +N + V+ +T+ V+++L+ Y+++ + A+N Sbjct: 73 AKSILKHLAQKNSG----SNFLFGVLMDTYPLCSSNPAVFDLLIRVYLRQGMVGHAVNTF 128 Query: 402 FQLKQLNIFPSIGVCNSLLRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSYCIK 223 + PS+ CN ++ S +++ + L W+F ++M + + NVS + I C++ Sbjct: 129 SSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFNILISVLCVQ 188 Query: 222 GDFQGGWKLLMRMQQHGISADVVAYTTFIDSLCKMSLLKEAVSLMFKM 79 G + +L M+++G +V+Y T + CK K A+ L+ M Sbjct: 189 GKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHM 236 Score = 59.3 bits (142), Expect = 3e-06 Identities = 40/175 (22%), Positives = 77/175 (44%) Frame = -3 Query: 528 ANLVLKVIFETHTSRKVLVTVYNMLLDCYVKENLSNVALNLTFQLKQLNIFPSIGVCNSL 349 A V + E + S ++ YN+L++ Y AL + ++ ++ P+ +L Sbjct: 299 ATRVFNEMIELNLSPNLIT--YNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTL 356 Query: 348 LRSFLRSEQIDLAWNFLEEMQSQQFALNVSIITLFIHSYCIKGDFQGGWKLLMRMQQHGI 169 L +S + D+A N LE + +LN T+ I C G ++LL+ M + G+ Sbjct: 357 LNGLYKSAKFDVARNILERYCINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGV 416 Query: 168 SADVVAYTTFIDSLCKMSLLKEAVSLMFKMIQFXXXXXXXXXXXXXXXLCKVGKI 4 D++ ++ I+ CK+ + +A +M K+ + CKVG + Sbjct: 417 HPDIITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNV 471