BLASTX nr result
ID: Cocculus23_contig00027391
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00027391 (823 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI26162.3| unnamed protein product [Vitis vinifera] 186 9e-45 ref|XP_006419767.1| hypothetical protein CICLE_v10007051mg, part... 181 3e-43 ref|XP_006489230.1| PREDICTED: pentatricopeptide repeat-containi... 179 8e-43 ref|XP_004491488.1| PREDICTED: pentatricopeptide repeat-containi... 170 7e-40 ref|XP_006339424.1| PREDICTED: pentatricopeptide repeat-containi... 166 1e-38 ref|XP_004297017.1| PREDICTED: pentatricopeptide repeat-containi... 160 5e-37 ref|XP_003617724.1| Pentatricopeptide repeat-containing protein ... 159 9e-37 ref|XP_004229464.1| PREDICTED: pentatricopeptide repeat-containi... 157 3e-36 ref|XP_003530332.1| PREDICTED: pentatricopeptide repeat-containi... 153 7e-35 ref|XP_004155892.1| PREDICTED: pentatricopeptide repeat-containi... 151 3e-34 ref|XP_004134313.1| PREDICTED: pentatricopeptide repeat-containi... 151 3e-34 ref|XP_007035355.1| Tetratricopeptide repeat-like superfamily pr... 147 4e-33 ref|XP_007153452.1| hypothetical protein PHAVU_003G036500g [Phas... 145 2e-32 gb|ABD96889.1| hypothetical protein [Cleome spinosa] 140 4e-31 ref|XP_006289665.1| hypothetical protein CARUB_v10003224mg [Caps... 139 2e-30 ref|XP_002873920.1| pentatricopeptide repeat-containing protein ... 135 2e-29 ref|NP_197396.1| pentatricopeptide repeat-containing protein [Ar... 131 3e-28 ref|XP_006844767.1| hypothetical protein AMTR_s00016p00258280 [A... 105 2e-20 ref|XP_007035356.1| Tetratricopeptide repeat-like superfamily pr... 82 3e-13 ref|XP_007217442.1| hypothetical protein PRUPE_ppb015972mg [Prun... 75 2e-11 >emb|CBI26162.3| unnamed protein product [Vitis vinifera] Length = 636 Score = 186 bits (472), Expect = 9e-45 Identities = 102/245 (41%), Positives = 145/245 (59%) Frame = -1 Query: 736 LKTQ*LIMARTHQSSIFNLIGKNVHKRHSFDRFDQSRHFTVGGGEIKCKHEETMIIVEDG 557 +KTQ + + SS+ + + +N + R + G+++ H+ T + Sbjct: 40 VKTQTSMARPSSSSSVISFLRQNPNSRIRNLGVVSGNQY----GDVEGAHQHT----QQQ 91 Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377 Q +I ++V +I R+RPRWEQTLLS+FPS +P + ++HQ NAL+S+RFF+WL Sbjct: 92 QHLEEIVKRVSDITRTRPRWEQTLLSDFPSFNFLDPTFLSHFVEHQKNALISLRFFHWLS 151 Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197 + GFSP+ S SCN LFD LV A A A S L S + F P+ SLE+ IRCLCK Sbjct: 152 SQS---GFSPD--SSSCNVLFDALVEAGACNAAKSFLDSTN--FNPKPASLEAYIRCLCK 204 Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDT 17 G VEEAI VF ++K IGV +++ WNS L S+R + D W+LYGEM+ S +V D T Sbjct: 205 GGLVEEAISVFGQLKGIGVCASIATWNSVLRGSVRAGRIDFVWELYGEMVESSVVADVHT 264 Query: 16 AGYLI 2 GYL+ Sbjct: 265 VGYLV 269 >ref|XP_006419767.1| hypothetical protein CICLE_v10007051mg, partial [Citrus clementina] gi|557521640|gb|ESR33007.1| hypothetical protein CICLE_v10007051mg, partial [Citrus clementina] Length = 540 Score = 181 bits (459), Expect = 3e-43 Identities = 97/192 (50%), Positives = 126/192 (65%), Gaps = 2/192 (1%) Frame = -1 Query: 571 IVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRF 392 I E Q +T+IA++VC+I R++PRWEQTLLS+FPS +P + +K QNN LLSIRF Sbjct: 1 IKESQQLYTEIAKQVCKITRTKPRWEQTLLSDFPSFNFNDPLFFREFLKQQNNMLLSIRF 60 Query: 391 FNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLI 212 F WL +H GFSP+ SCN LFD LV A+A KVA L A F P SLE I Sbjct: 61 FQWLHSHY---GFSPD--LDSCNVLFDSLVEARAFKVAKEFL--AITGFSPNPNSLELYI 113 Query: 211 RCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLV 32 +CLC+ G +EEA VFS++K +GV ++ WNSAL ++V +TD+ W LY +M+ SG+V Sbjct: 114 QCLCESGMIEEAFRVFSKLKEMGVFGSIKTWNSALLGCIKVDRTDLLWKLYHDMIESGIV 173 Query: 31 GDGD--TAGYLI 2 D D T GYLI Sbjct: 174 ADVDAETIGYLI 185 >ref|XP_006489230.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like [Citrus sinensis] Length = 589 Score = 179 bits (455), Expect = 8e-43 Identities = 95/190 (50%), Positives = 124/190 (65%), Gaps = 2/190 (1%) Frame = -1 Query: 565 EDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFN 386 E Q +T+IA++VC+I R++PRWEQTLLS+FPS +P + +K QNN LLSIRFF Sbjct: 35 ESQQLYTEIAKQVCKITRTKPRWEQTLLSDFPSFNFNDPLFFREFLKQQNNMLLSIRFFQ 94 Query: 385 WLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRC 206 WL +H GFSP+ SCN LFD LV A+A KVA L F P SLE I+C Sbjct: 95 WLHSHY---GFSPD--LDSCNVLFDSLVEARAFKVAMDFLDITG--FSPNPNSLELYIQC 147 Query: 205 LCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGD 26 LC+ G +EEA VFS++K +GV ++ WNSAL ++V +TD+ W LY +M+ SG+V D Sbjct: 148 LCESGMIEEAFRVFSKLKEMGVFGSIKTWNSALLGCIKVDRTDLLWKLYHDMIESGIVAD 207 Query: 25 GD--TAGYLI 2 D T GYLI Sbjct: 208 VDAETIGYLI 217 >ref|XP_004491488.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like isoform X1 [Cicer arietinum] gi|502099479|ref|XP_004491489.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like isoform X2 [Cicer arietinum] Length = 598 Score = 170 bits (430), Expect = 7e-40 Identities = 89/187 (47%), Positives = 122/187 (65%), Gaps = 2/187 (1%) Frame = -1 Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377 Q+ TDI ++C+I R++PRWE TLLS++PS +P + HQNN+ LS+RF +WL Sbjct: 51 QKLTDIVDEICKITRTKPRWENTLLSQYPSFNFSDPNFFLLYLNHQNNSFLSLRFLHWLS 110 Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197 +H FSP+ SCN LFD LV A+A K A SLL + F P+ SLES IRCL Sbjct: 111 SH---CSFSPDQ--SSCNVLFDALVDAEACKAAKSLLD--YPGFTPKPASLESYIRCLIN 163 Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVG--DG 23 G VE+A+DVF +K +G +P+VS +N++L A L+V +TD+ W LY M+ SG+V D Sbjct: 164 GGMVEDALDVFVTLKKVGFLPSVSTFNASLLACLKVGRTDLVWTLYERMLESGIVASIDV 223 Query: 22 DTAGYLI 2 +T GYLI Sbjct: 224 ETVGYLI 230 >ref|XP_006339424.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like [Solanum tuberosum] Length = 601 Score = 166 bits (420), Expect = 1e-38 Identities = 81/185 (43%), Positives = 123/185 (66%) Frame = -1 Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377 Q F +IA+ VC+++R+RPRWEQ LLS+FP+ +P+ +++K Q N +LS+RF WL Sbjct: 55 QSFAEIAKDVCKVIRTRPRWEQILLSDFPTVNFTDPRFYTEVLKAQKNVMLSLRFHFWLS 114 Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197 + GFS + +S +F LV AKA+ A Q+ + FVP+ + LE+ I+CLC+ Sbjct: 115 SQN---GFSRDQFSDE--VIFSGLVQAKAASAAKCFRQNMN--FVPQPSCLEAYIQCLCE 167 Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDT 17 +G +E+A+DVF+ ++ +G P++ IWNSAL S+R +TDI W LY +M SG+V D DT Sbjct: 168 NGLIEDALDVFTELRGVGHCPSLRIWNSALSDSIRAGRTDIVWKLYEDMTESGVVADVDT 227 Query: 16 AGYLI 2 G+LI Sbjct: 228 IGHLI 232 >ref|XP_004297017.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like [Fragaria vesca subsp. vesca] Length = 382 Score = 160 bits (405), Expect = 5e-37 Identities = 94/238 (39%), Positives = 131/238 (55%) Frame = -1 Query: 715 MARTHQSSIFNLIGKNVHKRHSFDRFDQSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIA 536 MAR SSI + +N H R D Q RH +V ET +D T +A Sbjct: 1 MARA-SSSILTFLRQNPHARQKPDT--QIRHLSV----------ET----KDCSDLTQVA 43 Query: 535 RKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMG 356 +++C ++R++PRWE TL SE+PS +P + +++K Q+N LS+RFF WL T + G Sbjct: 44 QQICHVIRTKPRWENTLSSEYPSSNFSDPLFIREVVKQQSNVFLSVRFFLWLGTRE---G 100 Query: 355 FSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEA 176 FSP+ S CNA+F LV A A S ++ H F PE LES RCL + G V+EA Sbjct: 101 FSPDPIS--CNAVFGALVEGNACSAAKSFIK--HTGFSPEPVLLESYARCLWEAGRVKEA 156 Query: 175 IDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDTAGYLI 2 VF R+K GV P + WN+AL ++ +TD+ W LY EMM G+ D +T L+ Sbjct: 157 SSVFKRLKEAGVCPGIGTWNAALSGCIKARRTDMVWKLYQEMMEYGVAADVETVECLV 214 >ref|XP_003617724.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355519059|gb|AET00683.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 861 Score = 159 bits (403), Expect = 9e-37 Identities = 89/188 (47%), Positives = 117/188 (62%), Gaps = 3/188 (1%) Frame = -1 Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377 Q FT ++C I RS+PRWE TL+S++PS NPK +KHQNN LS+RF +WL Sbjct: 29 QNFTQTLNEICTITRSKPRWENTLISQYPSFNFSNPKFFLSYLKHQNNTFLSLRFLHWLT 88 Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197 +H GF P+ SCNALFD LV A A K A SLL+ + FVP++ SLE +R L + Sbjct: 89 SH---CGFKPD--QSSCNALFDALVDAGAVKAAKSLLE--YPDFVPKNDSLEGYVRLLGE 141 Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVG---D 26 +G VEE DVF +K +G +P+ S +N L A L+V +TD+ W LY M+ SG VG D Sbjct: 142 NGMVEEVFDVFVSLKKVGFLPSASSFNVCLLACLKVGRTDLVWKLYELMIESG-VGVNID 200 Query: 25 GDTAGYLI 2 +T G LI Sbjct: 201 VETVGCLI 208 >ref|XP_004229464.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like [Solanum lycopersicum] Length = 601 Score = 157 bits (398), Expect = 3e-36 Identities = 78/183 (42%), Positives = 119/183 (65%) Frame = -1 Query: 550 FTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTH 371 F +IA+ VC+++R+RPRWEQ LLS+FP+ +P+ +++K Q N +LS+RF WL + Sbjct: 57 FAEIAKDVCKVIRTRPRWEQILLSDFPTVNFTDPRFYTEVLKAQKNIMLSLRFHFWLSSQ 116 Query: 370 QDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDG 191 GFS + +S +F LV AKA+ A Q+ FVP+ LE+ I+CLC++G Sbjct: 117 N---GFSRDQFSDE--VIFSGLVQAKAASAAKCFRQNMI--FVPQPNCLEAYIQCLCENG 169 Query: 190 FVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDTAG 11 +E+A+DVF+ ++++G P++ IWNSAL S+R +TD W LY +M SG+V D T G Sbjct: 170 LIEDALDVFTELRSVGHCPSLRIWNSALSDSIRAGRTDTVWKLYEDMTESGVVADVGTIG 229 Query: 10 YLI 2 +LI Sbjct: 230 HLI 232 >ref|XP_003530332.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like isoform X1 [Glycine max] gi|571466579|ref|XP_006583703.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like isoform X2 [Glycine max] Length = 577 Score = 153 bits (387), Expect = 7e-35 Identities = 84/179 (46%), Positives = 113/179 (63%), Gaps = 2/179 (1%) Frame = -1 Query: 532 KVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGF 353 ++C I R++PRWE TLLS++PS +P +KHQNNA LS+RFF+WLC+ GF Sbjct: 53 EICRITRTKPRWEDTLLSQYPSFNFKDPSFFLLYLKHQNNAFLSLRFFHWLCS---SCGF 109 Query: 352 SPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAI 173 SP+ SCN LF LV A A K+A SLL S F PE SLE I+CL G VE+A+ Sbjct: 110 SPD--QSSCNVLFQVLVDAGAGKLAKSLLDSP--GFTPEPASLEGYIQCLSGAGMVEDAV 165 Query: 172 DVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVG--DGDTAGYLI 2 D+ +K + P+V+ WN++L LR +TD+ W LY +MM SG+V + +T GYLI Sbjct: 166 DM---LKRVVFCPSVATWNASLLGCLRARRTDLVWTLYEQMMESGVVASINVETVGYLI 221 >ref|XP_004155892.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like [Cucumis sativus] Length = 638 Score = 151 bits (381), Expect = 3e-34 Identities = 84/210 (40%), Positives = 122/210 (58%), Gaps = 4/210 (1%) Frame = -1 Query: 619 TVGG--GEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPK 446 T GG G + + E ++ + + ++IA +V +++RS+PRWEQ+LLS++PS +P Sbjct: 68 TNGGNNGREEIESSEKLLNLTQRKDVSEIAAEVGKVIRSKPRWEQSLLSDYPSFNFHDPS 127 Query: 445 CVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLL 266 +++K NN LS+RFF WL + + + + SCN LFD L+ AKA A S L Sbjct: 128 FFSELLKQLNNVFLSLRFFLWLSSQPEFL-----PHPVSCNKLFDALLEAKACVPAKSFL 182 Query: 265 QSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVP 86 S +F PE SLE+ IRC+C+ G VEEA+ F +K G P V WN A + L+ Sbjct: 183 YS--FEFSPEPASLENYIRCVCEGGLVEEAVYTFDMLKEAGYRPYVETWNFAFQSCLKFG 240 Query: 85 KTDIFWDLYGEMMASGLVGDGD--TAGYLI 2 +TD+ W LY MM +G+ D D T GYLI Sbjct: 241 RTDLIWKLYEGMMETGVQKDVDIETVGYLI 270 >ref|XP_004134313.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18950-like [Cucumis sativus] Length = 602 Score = 151 bits (381), Expect = 3e-34 Identities = 84/210 (40%), Positives = 122/210 (58%), Gaps = 4/210 (1%) Frame = -1 Query: 619 TVGG--GEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPK 446 T GG G + + E ++ + + ++IA +V +++RS+PRWEQ+LLS++PS +P Sbjct: 32 TNGGNNGREEIESSEKLLNLTQRKDVSEIAAEVGKVIRSKPRWEQSLLSDYPSFNFHDPS 91 Query: 445 CVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLL 266 +++K NN LS+RFF WL + + + + SCN LFD L+ AKA A S L Sbjct: 92 FFSELLKQLNNVFLSLRFFLWLSSQPEFL-----PHPVSCNKLFDALLEAKACVPAKSFL 146 Query: 265 QSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVP 86 S +F PE SLE+ IRC+C+ G VEEA+ F +K G P V WN A + L+ Sbjct: 147 YS--FEFSPEPASLENYIRCVCEGGLVEEAVYTFDMLKEAGYRPYVETWNFAFQSCLKFG 204 Query: 85 KTDIFWDLYGEMMASGLVGDGD--TAGYLI 2 +TD+ W LY MM +G+ D D T GYLI Sbjct: 205 RTDLIWKLYEGMMETGVQKDVDIETVGYLI 234 >ref|XP_007035355.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508714384|gb|EOY06281.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] Length = 610 Score = 147 bits (372), Expect = 4e-33 Identities = 79/177 (44%), Positives = 111/177 (62%) Frame = -1 Query: 544 DIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQD 365 DI ++VC+I R+ PRWE+ LLS+FPS +P ++++ Q N LS+ FF+WL + D Sbjct: 63 DIVKQVCKITRTIPRWEENLLSKFPSFNFSDPVFFRELLRQQENVFLSLCFFHWLRSKYD 122 Query: 364 MMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFV 185 FSP+ SCN LFD+LV A A K A + L+ F PE +LE +R LC+ G V Sbjct: 123 ---FSPD--LDSCNVLFDKLVEANACKAARNFLEQTG--FSPEPRALELYLRRLCEVGLV 175 Query: 184 EEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDTA 14 EEA+++FS + IG P+V+ WN AL A L+V + D W LY +M+ SG+V D D A Sbjct: 176 EEAVEMFSMLNKIGYRPSVATWNLALLAFLKVGRNDFVWKLYQDMIDSGVVVDIDVA 232 >ref|XP_007153452.1| hypothetical protein PHAVU_003G036500g [Phaseolus vulgaris] gi|561026806|gb|ESW25446.1| hypothetical protein PHAVU_003G036500g [Phaseolus vulgaris] Length = 593 Score = 145 bits (365), Expect = 2e-32 Identities = 81/179 (45%), Positives = 111/179 (62%), Gaps = 2/179 (1%) Frame = -1 Query: 532 KVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGF 353 ++C I RS+PRWE LLS +PS +P + HQNNALLS+RFF+WLC+ GF Sbjct: 54 EICRITRSKPRWEDNLLSLYPSFNFSDPSFFLLYLNHQNNALLSLRFFHWLCS---SCGF 110 Query: 352 SPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAI 173 SP+ S NALF LV A A K A +LL PE SLE I+CL + G VE+A+ Sbjct: 111 SPD--QASYNALFCALVDAGACKAAKALLDCP--GLTPEPASLEGYIQCLSRTGMVEDAV 166 Query: 172 DVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVG--DGDTAGYLI 2 D+ +K +G P+V+ WN++L + LR +T++ W LY +MM SG+V + +T GYLI Sbjct: 167 DM---LKQVGFCPSVTTWNASLLSCLRAGRTNLVWTLYEQMMESGVVASINVETVGYLI 222 >gb|ABD96889.1| hypothetical protein [Cleome spinosa] Length = 719 Score = 140 bits (354), Expect = 4e-31 Identities = 72/180 (40%), Positives = 112/180 (62%) Frame = -1 Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377 + +T++A+ V I R +PRWEQTL+S+FPS +P +++ QNN LLS+RFF WLC Sbjct: 56 RNYTEMAKIVATITREKPRWEQTLVSDFPSFNFADPLFFRELVATQNNVLLSLRFFQWLC 115 Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197 T+ D +P+ S N LF+ L+ AKA + A + A F+P+S SLE ++CLC Sbjct: 116 TNHDC---TPDPISS--NMLFEALLDAKAVRAAKMVRDIA--GFIPDSASLEQYVKCLCG 168 Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDT 17 GF+EEAI+V+ ++K G+ ++ NS L L+ KT++ ++ Y EM+ +G D +T Sbjct: 169 VGFIEEAIEVYFQLKEAGIRISIVACNSILSGCLKAGKTELLFEFYQEMIKAGTASDANT 228 >ref|XP_006289665.1| hypothetical protein CARUB_v10003224mg [Capsella rubella] gi|482558371|gb|EOA22563.1| hypothetical protein CARUB_v10003224mg [Capsella rubella] Length = 486 Score = 139 bits (349), Expect = 2e-30 Identities = 80/220 (36%), Positives = 126/220 (57%) Frame = -1 Query: 700 QSSIFNLIGKNVHKRHSFDRFDQSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIARKVCE 521 QS + +L+ K + + + Q R + + + K +E G +T++A+ V Sbjct: 5 QSYLISLVRKRIRQNPNA----QMRSLALESRDSESKPDEQKS-AGGGTTYTEMAKTVST 59 Query: 520 IVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPED 341 ++R R RW+QTL+S+FPS +P +++K QNN L S+ FF WLC++ D ++P+ Sbjct: 60 VMRERQRWQQTLVSDFPSFNFADPLFFRELLKSQNNVLFSLWFFRWLCSNYD---YAPDP 116 Query: 340 YSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFS 161 S S LF L+ AKA K A S L + F PE T LE ++CL +DG VEEAIDV++ Sbjct: 117 ASLSL--LFGALLDAKAVKAAKSFLDTTG--FKPEPTLLEQYVKCLSEDGLVEEAIDVYN 172 Query: 160 RIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMAS 41 +K +G+ P++ NS L ++ K D FW+L+ +MM S Sbjct: 173 VLKEMGISPSIVTCNSVLLGCVKARKLDCFWELHQKMMES 212 >ref|XP_002873920.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297319757|gb|EFH50179.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 483 Score = 135 bits (340), Expect = 2e-29 Identities = 78/198 (39%), Positives = 116/198 (58%) Frame = -1 Query: 634 QSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLF 455 Q R TV + + K +E V +T++A+ V I+R R RW+QTL+S+FPS Sbjct: 23 QIRSLTVESRDSESKPDEQKSAVS----YTEMAKTVSTIMRQRQRWQQTLVSDFPSFDFA 78 Query: 454 NPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAH 275 +P Q++K QNN + S+ FF WLC++ D ++P+ S S N LF L+ KA K A Sbjct: 79 DPLFFRQLLKSQNNVMFSLWFFRWLCSNYD---YTPD--SVSLNLLFGALLDGKAVKAAK 133 Query: 274 SLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASL 95 S L + F PE T LE ++CL ++G VEEAI+V++ +K +G+ +V NS L L Sbjct: 134 SFLDTT--GFKPEPTLLEQYVKCLSEEGLVEEAIEVYNVLKEMGISSSVVTCNSVLLGCL 191 Query: 94 RVPKTDIFWDLYGEMMAS 41 + K D FW+L+ EM+ S Sbjct: 192 KARKLDRFWELHKEMIES 209 >ref|NP_197396.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635758|sp|Q8GYM2.2|PP393_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g18950 gi|332005249|gb|AED92632.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 483 Score = 131 bits (329), Expect = 3e-28 Identities = 77/198 (38%), Positives = 114/198 (57%) Frame = -1 Query: 634 QSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLF 455 Q R TV + + K +E V +T++A+ V I+R R RW+QTL+S+FPS Sbjct: 23 QIRSLTVESRDCESKPDEQKSAVS----YTEMAKTVSTIMRERQRWQQTLVSDFPSFDFA 78 Query: 454 NPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAH 275 +P +++K QNN L S+ FF WLC++ D ++P S N LF L+ KA K A Sbjct: 79 DPLFFGELLKSQNNVLFSLWFFRWLCSNYD---YTPGPV--SLNILFGALLDGKAVKAAK 133 Query: 274 SLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASL 95 S L + F PE T LE ++CL ++G VEEAI+V++ +K +G+ +V NS L L Sbjct: 134 SFLDTT--GFKPEPTLLEQYVKCLSEEGLVEEAIEVYNVLKDMGISSSVVTCNSVLLGCL 191 Query: 94 RVPKTDIFWDLYGEMMAS 41 + K D FW+L+ EM+ S Sbjct: 192 KARKLDRFWELHKEMVES 209 >ref|XP_006844767.1| hypothetical protein AMTR_s00016p00258280 [Amborella trichopoda] gi|548847238|gb|ERN06442.1| hypothetical protein AMTR_s00016p00258280 [Amborella trichopoda] Length = 696 Score = 105 bits (263), Expect = 2e-20 Identities = 60/156 (38%), Positives = 88/156 (56%) Frame = -1 Query: 469 SQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKA 290 S P CV +++ ++ +LS RFF LC + G+ P+D +SCN + + LV A+ Sbjct: 177 SHDFLKPNCVHKVLGILDDTILSFRFFKRLCY---LEGYKPDD--ESCNMVLETLVSAEE 231 Query: 289 SKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSA 110 K A ++L A +F P+ LE+LI + + EA VF +K G P++ +WNS Sbjct: 232 YKTAKTVLSLA--EFRPQPQLLEALILGFSRTTKISEAFGVFLEMKRHGFSPSLLVWNSM 289 Query: 109 LYASLRVPKTDIFWDLYGEMMASGLVGDGDTAGYLI 2 L LR +TD W++YGEMM SG+ GD T GYLI Sbjct: 290 LLGFLRDLRTDRLWEIYGEMMESGVAGDATTYGYLI 325 Score = 89.0 bits (219), Expect = 2e-15 Identities = 49/150 (32%), Positives = 85/150 (56%) Frame = -1 Query: 643 RFDQSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQ 464 R + H G K E + + +++A KVC+ +R+ WE++L+S+F S Sbjct: 31 RSNNQCHEKEEGEASKSSFSEGQEKANEEAELSEVAEKVCQRLRTHRLWEKSLVSDF-SP 89 Query: 463 TLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASK 284 +F C+ +++++QNN LS RFF+WLC + GF P+ ++SC+ +F LV+AKA K Sbjct: 90 YIFETTCIHKVLRNQNNPFLSFRFFSWLCLQE---GFKPD--TQSCDTIFGVLVNAKAWK 144 Query: 283 VAHSLLQSAHHQFVPESTSLESLIRCLCKD 194 A ++L F P+ + LE+++ C+D Sbjct: 145 AAITVLNLV--DFRPQPSVLEAMVAGFCRD 172 >ref|XP_007035356.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 2, partial [Theobroma cacao] gi|508714385|gb|EOY06282.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 2, partial [Theobroma cacao] Length = 535 Score = 81.6 bits (200), Expect = 3e-13 Identities = 48/108 (44%), Positives = 65/108 (60%) Frame = -1 Query: 337 SKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSR 158 SKS N+L V A A K A + L+ F PE +LE +R LC+ G VEEA+++FS Sbjct: 56 SKSNNSL----VEANACKAARNFLEQTG--FSPEPRALELYLRRLCEVGLVEEAVEMFSM 109 Query: 157 IKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDTA 14 + IG P+V+ WN AL A L+V + D W LY +M+ SG+V D D A Sbjct: 110 LNKIGYRPSVATWNLALLAFLKVGRNDFVWKLYQDMIDSGVVVDIDVA 157 >ref|XP_007217442.1| hypothetical protein PRUPE_ppb015972mg [Prunus persica] gi|462413592|gb|EMJ18641.1| hypothetical protein PRUPE_ppb015972mg [Prunus persica] Length = 221 Score = 75.5 bits (184), Expect = 2e-11 Identities = 52/140 (37%), Positives = 65/140 (46%), Gaps = 2/140 (1%) Frame = -1 Query: 415 NALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPE 236 N LS+R F WL +H + FSP+ S CNAL V K A S L+ H F PE Sbjct: 2 NVFLSLRCFFWLSSHNE---FSPDPIS--CNALVSAFVETKVCNPAKSFLE--HTSFSPE 54 Query: 235 STSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYG 56 S L S +K GV P + W +AL L+V +TDI W LY Sbjct: 55 LASFRKLYSV--------------SLLKEAGVCPAIMTWKAALSGCLKVGRTDIIWKLYQ 100 Query: 55 EMMASGLVGDGD--TAGYLI 2 EM+ G+V D + GYLI Sbjct: 101 EMIECGVVADVELRLLGYLI 120