BLASTX nr result
ID: Cocculus23_contig00019933
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00019933 (892 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citr... 229 8e-58 ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containi... 228 2e-57 emb|CBI29222.3| unnamed protein product [Vitis vinifera] 228 2e-57 ref|XP_002303480.2| pentatricopeptide repeat-containing family p... 225 2e-56 ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prun... 224 5e-56 ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containi... 202 2e-49 ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containi... 201 2e-49 ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containi... 195 2e-47 ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containi... 184 2e-44 ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containi... 184 4e-44 ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutr... 183 7e-44 ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily pr... 182 1e-43 ref|XP_006282558.1| hypothetical protein CARUB_v10004123mg [Caps... 177 6e-42 sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-c... 174 4e-41 emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|72687... 174 4e-41 ref|NP_567587.1| pentatricopeptide repeat-containing protein [Ar... 174 4e-41 ref|XP_002867936.1| hypothetical protein ARALYDRAFT_492917 [Arab... 174 5e-41 gb|AHB18408.1| pentatricopeptide repeat-containing protein [Goss... 172 2e-40 gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus... 171 3e-40 ref|XP_003628993.1| Pentatricopeptide repeat-containing protein ... 171 5e-40 >ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citrus clementina] gi|568835123|ref|XP_006471629.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X1 [Citrus sinensis] gi|568835125|ref|XP_006471630.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X2 [Citrus sinensis] gi|557534991|gb|ESR46109.1| hypothetical protein CICLE_v10000274mg [Citrus clementina] Length = 833 Score = 229 bits (585), Expect = 8e-58 Identities = 124/211 (58%), Positives = 152/211 (72%), Gaps = 2/211 (0%) Frame = +3 Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443 K V ++LS SLD SKC+ L +LSPQ FD +F + S+VNP+TAL FFYFA FRF Sbjct: 56 KWVSSVLSKQSLDPSKCKLFLPNLSPQEFDTLFFSIRSNVNPKTALKFFYFASQSCNFRF 115 Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGRE 623 TV SYC LIRLLL +NL++PARLLLIRLIDGK+ L+ + + RHI+IA + DL+ Sbjct: 116 TVRSYCLLIRLLLFSNLLSPARLLLIRLIDGKMPVLYASNPS-IRHIEIASQMVDLNVTS 174 Query: 624 GDLVGV--IDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANEL 797 +GV DL+VHVYCTQFK LG G DVF F+ + +FPSLKTCNFLL+SLVKANE+ Sbjct: 175 EPALGVQIADLLVHVYCTQFKNLGFGYAIDVFSIFSNKGIFPSLKTCNFLLNSLVKANEV 234 Query: 798 ERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 ++ EVF +CR GVSPDV FST IN FCK Sbjct: 235 QKGIEVFETMCR-GVSPDVFLFSTAINAFCK 264 >ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Vitis vinifera] Length = 1022 Score = 228 bits (581), Expect = 2e-57 Identities = 122/216 (56%), Positives = 154/216 (71%), Gaps = 2/216 (0%) Frame = +3 Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428 D + K V +ILS+PSLD ++C+ L+ LSP +FD +F + +VNP+TALNFFYFA D Sbjct: 113 DHALLKSVTSILSNPSLDSTQCKQLIPHLSPHQFDSVFFSVRRNVNPKTALNFFYFASDS 172 Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608 GFRFT+ SYC L+R L+ + V+PARLLLIRLID KL LF + KN RHI+IA +AD Sbjct: 173 CGFRFTLRSYCVLMRSLIVSGFVSPARLLLIRLIDRKLPVLFGDPKN--RHIEIASAMAD 230 Query: 609 LS--GREGDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782 L+ G G V +DL++HVYCTQF+ +G VFRF A + +FP++KTC FLLSSLV Sbjct: 231 LNEVGESGVAVAAVDLLIHVYCTQFRNVGFRNAIGVFRFLANKGVFPTVKTCTFLLSSLV 290 Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 KANELE+S VF + R GVSPDV FST IN FCK Sbjct: 291 KANELEKSYWVFETM-RQGVSPDVYLFSTAINAFCK 325 >emb|CBI29222.3| unnamed protein product [Vitis vinifera] Length = 826 Score = 228 bits (581), Expect = 2e-57 Identities = 122/216 (56%), Positives = 154/216 (71%), Gaps = 2/216 (0%) Frame = +3 Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428 D + K V +ILS+PSLD ++C+ L+ LSP +FD +F + +VNP+TALNFFYFA D Sbjct: 46 DHALLKSVTSILSNPSLDSTQCKQLIPHLSPHQFDSVFFSVRRNVNPKTALNFFYFASDS 105 Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608 GFRFT+ SYC L+R L+ + V+PARLLLIRLID KL LF + KN RHI+IA +AD Sbjct: 106 CGFRFTLRSYCVLMRSLIVSGFVSPARLLLIRLIDRKLPVLFGDPKN--RHIEIASAMAD 163 Query: 609 LS--GREGDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782 L+ G G V +DL++HVYCTQF+ +G VFRF A + +FP++KTC FLLSSLV Sbjct: 164 LNEVGESGVAVAAVDLLIHVYCTQFRNVGFRNAIGVFRFLANKGVFPTVKTCTFLLSSLV 223 Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 KANELE+S VF + R GVSPDV FST IN FCK Sbjct: 224 KANELEKSYWVFETM-RQGVSPDVYLFSTAINAFCK 258 >ref|XP_002303480.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550342907|gb|EEE78459.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 842 Score = 225 bits (574), Expect = 2e-56 Identities = 122/211 (57%), Positives = 150/211 (71%), Gaps = 2/211 (0%) Frame = +3 Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443 K+V ILS+PSLD +KC+ L+ LSPQ FD F S+VNP+TALNFF+F + FRF Sbjct: 65 KRVSLILSNPSLDCAKCKELVPHLSPQEFDSCFLALKSNVNPKTALNFFHFVSETCKFRF 124 Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGRE 623 T SYC LI LL+GN+L++PARLLLIRLIDGK+ A + E RH +IA+ +AD + Sbjct: 125 TARSYCVLIHLLVGNDLLSPARLLLIRLIDGKVPAFYARNF-ESRHFEIAQIMADFNLVF 183 Query: 624 GDLVGV--IDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANEL 797 ++GV DL+VHVY TQFK LG G DVF A + LFPSLKTC FLLSSLVKANEL Sbjct: 184 EPVIGVKIADLLVHVYSTQFKHLGFGFAADVFSLLAKKGLFPSLKTCTFLLSSLVKANEL 243 Query: 798 ERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 ++S EV+ IC GG+ PDV FST+IN FCK Sbjct: 244 KKSYEVYDFICLGGIIPDVHLFSTMINAFCK 274 >ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prunus persica] gi|462413303|gb|EMJ18352.1| hypothetical protein PRUPE_ppa001463mg [Prunus persica] Length = 821 Score = 224 bits (570), Expect = 5e-56 Identities = 118/209 (56%), Positives = 153/209 (73%), Gaps = 2/209 (0%) Frame = +3 Query: 270 VIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTV 449 V +ILS PSLD SKC+ L+ LS FDR+F + S+VNP+TAL+FFYFA + F+FTV Sbjct: 57 VSSILSKPSLDSSKCKALIPLLSSHEFDRVFCSISSNVNPKTALHFFYFASESFKFQFTV 116 Query: 450 GSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLS--GRE 623 S+C L+RLL+ +NLV+PARLLLIRLIDG + L+ N + RH++IA + DL+ + Sbjct: 117 RSFCVLVRLLILSNLVSPARLLLIRLIDGNVPVLYAN--HNQRHMEIAIAMLDLNTVSTQ 174 Query: 624 GDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELER 803 G V +DL++HVYCTQFK +G G D F F+ + +FPSLKTCNFLLSSLVKANEL + Sbjct: 175 GLGVQALDLLIHVYCTQFKNMGFGYAIDAFVIFSKKGVFPSLKTCNFLLSSLVKANELHK 234 Query: 804 SCEVFSLICRGGVSPDVISFSTLINVFCK 890 S +VF ++CR GVSPDV F+T IN FCK Sbjct: 235 SYDVFEVMCR-GVSPDVYLFTTAINAFCK 262 >ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Solanum tuberosum] Length = 928 Score = 202 bits (513), Expect = 2e-49 Identities = 110/211 (52%), Positives = 144/211 (68%), Gaps = 2/211 (0%) Frame = +3 Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443 K V+++LS+P +D K ++LL L+PQ+FD IF SS+ P L FF+ A GF F Sbjct: 153 KWVVSVLSNPPVDSLKIKDLLTLLTPQQFDAIFLEIYSSLKPLNVLKFFHVASGTCGFSF 212 Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGRE 623 +V SYC L+RLL+ +N PARLLLIRLIDGKL ALF+ ++ +H+++A +A+LSG Sbjct: 213 SVRSYCTLLRLLVASNHDVPARLLLIRLIDGKLPALFDT--SQQKHVEVAVSLAELSGVS 270 Query: 624 --GDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANEL 797 G V DL++H+ CTQFK +G DVFR A R ++PSLKTCNFLLSSLVK NEL Sbjct: 271 DFGVAVRTFDLLLHLCCTQFKNVGFDAALDVFRSLASRGVYPSLKTCNFLLSSLVKENEL 330 Query: 798 ERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 +S EVF ++ + GV PDV FST IN FCK Sbjct: 331 WKSYEVFGIL-KDGVEPDVYLFSTAINAFCK 360 >ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Solanum lycopersicum] Length = 839 Score = 201 bits (512), Expect = 2e-49 Identities = 111/211 (52%), Positives = 143/211 (67%), Gaps = 2/211 (0%) Frame = +3 Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443 K V+++LS P +D K ++LL L+PQ+FD IF SS+ P L FF+ A F F Sbjct: 64 KWVVSVLSDPPVDSLKIKDLLTLLNPQQFDAIFLEIHSSLKPLNVLKFFHVASGTCSFSF 123 Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGRE 623 TV SYC L+RLL+ +N APARLLLIRLIDGKL ALF++ + +H+++A +A+LSG Sbjct: 124 TVRSYCTLVRLLIASNHDAPARLLLIRLIDGKLPALFDS--LQQKHVEVAVSLAELSGVS 181 Query: 624 --GDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANEL 797 G V DL++H+ CTQFK +G DVFR A R ++PSLKTCNFLLSSLVK NEL Sbjct: 182 DFGVAVRTFDLLLHLCCTQFKSVGFDAALDVFRSLASRGVYPSLKTCNFLLSSLVKENEL 241 Query: 798 ERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 +S EVF ++ + GV PDV FST IN FCK Sbjct: 242 WKSYEVFEIL-KDGVKPDVYLFSTAINAFCK 271 >ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucumis sativus] Length = 822 Score = 195 bits (495), Expect = 2e-47 Identities = 108/209 (51%), Positives = 133/209 (63%), Gaps = 2/209 (0%) Frame = +3 Query: 270 VIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTV 449 V ++LS SLD SKC LL LSP +FD++F + NP T LNFFYFA + FRFT+ Sbjct: 50 VSSVLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTI 109 Query: 450 GSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREG- 626 SYC LI LL+ + + PARLLLIRLIDG L L N +E HI+IA + L+ G Sbjct: 110 HSYCTLILLLIRSKFIPPARLLLIRLIDGNLPVL--NLDSEKFHIEIANALFGLTSVVGR 167 Query: 627 -DLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELER 803 + DL++HVY TQF+ LG DVF A + FPSLKTCNFLLSSLVKANE E+ Sbjct: 168 FEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEK 227 Query: 804 SCEVFSLICRGGVSPDVISFSTLINVFCK 890 CEVF ++ G PDV SF+ +IN CK Sbjct: 228 CCEVFRVMSE-GACPDVFSFTNVINALCK 255 >ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Glycine max] Length = 840 Score = 184 bits (467), Expect(2) = 2e-44 Identities = 102/208 (49%), Positives = 136/208 (65%), Gaps = 3/208 (1%) Frame = +3 Query: 276 AILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTVGS 455 +IL+S +LD SKC+++L L+P FDR+F + +VNP+T FF FA FRFTV S Sbjct: 65 SILTSKTLDSSKCKSILPHLTPHHFDRLFLSLHRTVNPKTTHEFFRFATRHCNFRFTVRS 124 Query: 456 YCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNE--DRHIDIARGIADLS-GREG 626 YC L+R LL ++ V AR LL RLIDG + DR +IA + +L+ G + Sbjct: 125 YCLLLRSLLADSFVPRARFLLARLIDGHVPTWSSKTTTSFHDRLREIASSMLELNQGSDE 184 Query: 627 DLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELERS 806 +G +DL++H+ C+QFK LG FD+F F+ R +FP LKTCN LLSSLVKANEL +S Sbjct: 185 QRLGELDLLLHILCSQFKCLGSRCAFDIFVMFSKRGVFPCLKTCNLLLSSLVKANELHKS 244 Query: 807 CEVFSLICRGGVSPDVISFSTLINVFCK 890 EVF L C+ GV+PDV +F+T IN FCK Sbjct: 245 YEVFDLACQ-GVAPDVFTFTTAINAFCK 271 Score = 22.7 bits (47), Expect(2) = 2e-44 Identities = 12/21 (57%), Positives = 13/21 (61%) Frame = +1 Query: 121 VPPESLYSPNPIASLSSQPSI 183 +PP S P P SLSS PSI Sbjct: 46 LPPPSPPPPPPHPSLSSIPSI 66 >ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X1 [Cicer arietinum] gi|502153968|ref|XP_004509526.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X2 [Cicer arietinum] gi|502153970|ref|XP_004509527.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X3 [Cicer arietinum] gi|502153972|ref|XP_004509528.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X4 [Cicer arietinum] gi|502153974|ref|XP_004509529.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X5 [Cicer arietinum] gi|502153976|ref|XP_004509530.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X6 [Cicer arietinum] gi|502153978|ref|XP_004509531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X7 [Cicer arietinum] gi|502153980|ref|XP_004509532.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X8 [Cicer arietinum] gi|502153982|ref|XP_004509533.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X9 [Cicer arietinum] gi|502153984|ref|XP_004509534.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X10 [Cicer arietinum] Length = 835 Score = 184 bits (467), Expect = 4e-44 Identities = 101/205 (49%), Positives = 133/205 (64%) Frame = +3 Query: 276 AILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTVGS 455 +ILS LD SKC+++L L+P +FD +F T S+VN +T L+FF FA + F FTV S Sbjct: 62 SILSHKILDSSKCKSILPHLTPHQFDTLFFTHHSTVNLKTTLDFFRFASNQFKFCFTVRS 121 Query: 456 YCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREGDLV 635 YC LIRLLL +N + AR + RLIDG + N +DR ++A +LS Sbjct: 122 YCLLIRLLLCSNHLPRARFFMKRLIDGNVSTPLLNR--DDRLSEMASSFLELSRLTERSH 179 Query: 636 GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELERSCEV 815 G +DL++H+ C+QF+ LG FD+F F +FPSLKTCNFLLSSLVK+NEL +S V Sbjct: 180 GELDLLLHILCSQFQHLGFHWAFDIFTLFTSNGVFPSLKTCNFLLSSLVKSNELHKSYRV 239 Query: 816 FSLICRGGVSPDVISFSTLINVFCK 890 F ++CRGGVS DV +FST IN F K Sbjct: 240 FDVVCRGGVSLDVYTFSTAINAFSK 264 >ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutrema salsugineum] gi|557115148|gb|ESQ55431.1| hypothetical protein EUTSA_v10024401mg [Eutrema salsugineum] Length = 837 Score = 183 bits (465), Expect = 7e-44 Identities = 100/216 (46%), Positives = 136/216 (62%), Gaps = 2/216 (0%) Frame = +3 Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428 D + +++ A LS SLD +C+ L+ LSP FDR+F S VNP+TAL+FF A D Sbjct: 70 DRHLRERLSAALSRRSLDYEQCKQLIATLSPHEFDRLFPDFRSKVNPKTALDFFRLASDS 129 Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608 F F++ SYC LI LLL +L++PARL+LIRLI+G + L + D + IA +A Sbjct: 130 FSFSFSLRSYCLLIGLLLDASLLSPARLVLIRLINGNVPVLPSANDSRDGRVAIADAMAS 189 Query: 609 LS--GREGDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782 LS + + DL++ VYCTQFK G L D+F A + LFPS TCN LL+SLV Sbjct: 190 LSLCFDPEIRMRISDLLIEVYCTQFKRAGCYLALDIFPLLANKGLFPSRTTCNILLTSLV 249 Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 +ANE ++ CE F +C+ GVSPDV F+T+IN +CK Sbjct: 250 RANEFQKCCEAFEAVCK-GVSPDVYLFTTVINAYCK 284 >ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680604|ref|XP_007040907.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680608|ref|XP_007040908.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680612|ref|XP_007040909.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680616|ref|XP_007040910.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680620|ref|XP_007040911.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778151|gb|EOY25407.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778152|gb|EOY25408.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778153|gb|EOY25409.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778154|gb|EOY25410.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778155|gb|EOY25411.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778156|gb|EOY25412.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] Length = 845 Score = 182 bits (463), Expect = 1e-43 Identities = 107/210 (50%), Positives = 132/210 (62%), Gaps = 2/210 (0%) Frame = +3 Query: 267 KVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFT 446 ++ ILS SLD SKC+ LL LSP FDR FS S +NP+T L+FFY A FRFT Sbjct: 70 RLSCILSKSSLDSSKCKQLLPLLSPLDFDRFFSAISSHLNPKTTLHFFYLASQSFNFRFT 129 Query: 447 VGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREG 626 + SYC LI LLL N +PARLL IRLIDGKL N D HI I +ADL+ Sbjct: 130 LRSYCILILLLLLANHSSPARLLFIRLIDGKLPLSSPNNTTID-HIQITTALADLNTLSK 188 Query: 627 DLVGV--IDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELE 800 + V +D+++H+YCTQFK G DVF A + +FPS KTCNF LSSLVKANEL+ Sbjct: 189 GVPRVMGVDMLLHLYCTQFKNAGFTSAIDVFFTLADKGMFPSSKTCNFFLSSLVKANELQ 248 Query: 801 RSCEVFSLICRGGVSPDVISFSTLINVFCK 890 ++ +VF + R VS DV +T+IN FCK Sbjct: 249 KTYQVFETLSR-FVSLDVYLCTTMINAFCK 277 >ref|XP_006282558.1| hypothetical protein CARUB_v10004123mg [Capsella rubella] gi|482551263|gb|EOA15456.1| hypothetical protein CARUB_v10004123mg [Capsella rubella] Length = 838 Score = 177 bits (448), Expect = 6e-42 Identities = 101/217 (46%), Positives = 135/217 (62%), Gaps = 3/217 (1%) Frame = +3 Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428 D + ++ ++LS SLD C+ L+ LSP FDR+F S VNP+TALNFF A D Sbjct: 70 DRHLHDRLSSVLSKRSLDYELCKQLITVLSPLEFDRLFPEFRSKVNPKTALNFFRLASDS 129 Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKN-EDRHIDIARGIA 605 F F++ SYC LI LLL NL++PAR+ LIRLI+G + L + D + IA +A Sbjct: 130 FSFSFSLRSYCLLIGLLLDANLLSPARVTLIRLINGNVPVLPCGDGGLRDSRVAIADAMA 189 Query: 606 DLSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSL 779 LS + + + DL++ VYCTQFK G L DVF F A + +FPS TCN LL+SL Sbjct: 190 RLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPFLANKGMFPSKTTCNILLTSL 249 Query: 780 VKANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 V+ANE ++ CE F ++C+ GV PDV F+T IN FCK Sbjct: 250 VRANEFQKCCEAFEVVCK-GVFPDVYLFTTAINAFCK 285 >sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g19440, chloroplastic; Flags: Precursor Length = 838 Score = 174 bits (441), Expect = 4e-41 Identities = 99/216 (45%), Positives = 137/216 (63%), Gaps = 2/216 (0%) Frame = +3 Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428 D + +++ ++LS SLD +C+ L+ LSP FDR+F S VNP+TAL+FF A D Sbjct: 73 DRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASDS 132 Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608 F F++ SYC LI LLL NL++ AR++LIRLI+G + L + D + IA +A Sbjct: 133 FSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLR--DSRVAIADAMAS 190 Query: 609 LSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782 LS + + + DL++ VYCTQFK G L DVF A + +FPS TCN LL+SLV Sbjct: 191 LSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLV 250 Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 +ANE ++ CE F ++C+ GVSPDV F+T IN FCK Sbjct: 251 RANEFQKCCEAFDVVCK-GVSPDVYLFTTAINAFCK 285 >emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|7268739|emb|CAB78946.1| putative protein [Arabidopsis thaliana] Length = 814 Score = 174 bits (441), Expect = 4e-41 Identities = 99/216 (45%), Positives = 137/216 (63%), Gaps = 2/216 (0%) Frame = +3 Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428 D + +++ ++LS SLD +C+ L+ LSP FDR+F S VNP+TAL+FF A D Sbjct: 49 DRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASDS 108 Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608 F F++ SYC LI LLL NL++ AR++LIRLI+G + L + D + IA +A Sbjct: 109 FSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLR--DSRVAIADAMAS 166 Query: 609 LSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782 LS + + + DL++ VYCTQFK G L DVF A + +FPS TCN LL+SLV Sbjct: 167 LSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLV 226 Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 +ANE ++ CE F ++C+ GVSPDV F+T IN FCK Sbjct: 227 RANEFQKCCEAFDVVCK-GVSPDVYLFTTAINAFCK 261 >ref|NP_567587.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|334186696|ref|NP_001190771.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|15810161|gb|AAL07224.1| unknown protein [Arabidopsis thaliana] gi|332658782|gb|AEE84182.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658783|gb|AEE84183.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 825 Score = 174 bits (441), Expect = 4e-41 Identities = 99/216 (45%), Positives = 137/216 (63%), Gaps = 2/216 (0%) Frame = +3 Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428 D + +++ ++LS SLD +C+ L+ LSP FDR+F S VNP+TAL+FF A D Sbjct: 60 DRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASDS 119 Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608 F F++ SYC LI LLL NL++ AR++LIRLI+G + L + D + IA +A Sbjct: 120 FSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLR--DSRVAIADAMAS 177 Query: 609 LSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782 LS + + + DL++ VYCTQFK G L DVF A + +FPS TCN LL+SLV Sbjct: 178 LSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLV 237 Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 +ANE ++ CE F ++C+ GVSPDV F+T IN FCK Sbjct: 238 RANEFQKCCEAFDVVCK-GVSPDVYLFTTAINAFCK 272 >ref|XP_002867936.1| hypothetical protein ARALYDRAFT_492917 [Arabidopsis lyrata subsp. lyrata] gi|297313772|gb|EFH44195.1| hypothetical protein ARALYDRAFT_492917 [Arabidopsis lyrata subsp. lyrata] Length = 817 Score = 174 bits (440), Expect = 5e-41 Identities = 99/217 (45%), Positives = 135/217 (62%), Gaps = 3/217 (1%) Frame = +3 Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428 D + +++ ++LS SLD +C+ L+ LSP FDR+F VNP+TAL+FF A D Sbjct: 49 DRHLHERLSSVLSKRSLDYEQCKQLITVLSPHEFDRLFPEFRFKVNPKTALDFFRLASDS 108 Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRAL-FENEKNEDRHIDIARGIA 605 F F++ SYC LI LLL NL +PAR++LIRLI+G + L N D + IA +A Sbjct: 109 FSFSFSLRSYCLLIGLLLDANLSSPARVVLIRLINGNVPVLPCGNGGLRDSRVAIADAMA 168 Query: 606 DLSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSL 779 LS + + + DL++ VYCTQFK G L DVF A + +FPS TCN LL+SL Sbjct: 169 SLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSL 228 Query: 780 VKANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 V+A E ++ CE F ++C+ GVSPDV F+T IN FCK Sbjct: 229 VRATEFQKCCEAFHVVCK-GVSPDVYLFTTAINAFCK 264 >gb|AHB18408.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum] Length = 846 Score = 172 bits (436), Expect = 2e-40 Identities = 102/205 (49%), Positives = 128/205 (62%) Frame = +3 Query: 276 AILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTVGS 455 +ILS PSLD SK + LL LSP FDR F +P+T LNFF+ A FRFT+ S Sbjct: 85 SILSKPSLDSSKSKQLLPLLSPSDFDRFFIALSPRADPKTTLNFFHLASRCFNFRFTLRS 144 Query: 456 YCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREGDLV 635 Y LI LLL +N + ARLLLIRLIDGKL N HI IA +ADL+ + Sbjct: 145 YYILILLLLLSNNSSAARLLLIRLIDGKLPLFSPNNPPTVNHIQIAIALADLNTSFKGVA 204 Query: 636 GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELERSCEV 815 GV DL++H+YCTQFK +G DVF A + +FPS KTCNF L+SL+KANE+ ++ +V Sbjct: 205 GV-DLLLHLYCTQFKNVGFTYAIDVFFTLAYKGIFPSTKTCNFFLNSLLKANEVRKTYQV 263 Query: 816 FSLICRGGVSPDVISFSTLINVFCK 890 F + R VS DV +T+IN FCK Sbjct: 264 FETLSR-SVSLDVYLCTTMINGFCK 287 >gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus guttatus] Length = 847 Score = 171 bits (434), Expect = 3e-40 Identities = 102/213 (47%), Positives = 136/213 (63%), Gaps = 4/213 (1%) Frame = +3 Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443 K + ++LS + + ++C+ L+ +SP++FD IF +++ P TAL FYFA D F F Sbjct: 74 KSLASVLSGSNFNSNQCKELISQISPRQFDSIFWEIHNNIEPSTALKLFYFAGDYCSFSF 133 Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLR-ALFENEKNEDRHIDIARGIAD-LSG 617 T+ SYC L LL+ NL + ARLLLIRLID KL +L +N N H +IA +AD SG Sbjct: 134 TLRSYCILFHLLVSKNLDSAARLLLIRLIDRKLPVSLRDNVVN--LHNEIAIVLADTFSG 191 Query: 618 REGDLVG--VIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKAN 791 E G D++VHVY T+FK LGL DVFR AGR L PS KTCNFL+S+LVKA+ Sbjct: 192 SEKFRSGNRGFDMLVHVYATEFKSLGLDAAMDVFRLLAGRRLVPSFKTCNFLMSTLVKAD 251 Query: 792 ELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890 E E+S E+F ++ R + PDV +ST IN CK Sbjct: 252 EHEKSYEIFLIVSRESL-PDVYLYSTAINALCK 283 >ref|XP_003628993.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355523015|gb|AET03469.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 819 Score = 171 bits (432), Expect = 5e-40 Identities = 92/205 (44%), Positives = 129/205 (62%) Frame = +3 Query: 276 AILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTVGS 455 +IL+ LD SKC+ L+ +L+P F+ F T ++VN +T L+FF FA FRFTV S Sbjct: 54 SILAHKVLDSSKCKTLIPNLTPHEFEHSFFTHHTTVNLKTTLDFFSFASKNFKFRFTVRS 113 Query: 456 YCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREGDLV 635 YC LIRLLL +N + A+ L RLI+G + K + R +IA +L R Sbjct: 114 YCILIRLLLASNHIPRAKFTLKRLIEGNANTPLK--KTDARLSEIASAFLELGERSH--- 168 Query: 636 GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELERSCEV 815 G +DL++++ C+QF+ LG FD F F + +FPSLK+CNFL+SSLVK+NEL +S V Sbjct: 169 GELDLLIYILCSQFQHLGFHWAFDTFMLFTSKGVFPSLKSCNFLMSSLVKSNELHKSFRV 228 Query: 816 FSLICRGGVSPDVISFSTLINVFCK 890 F +CRGGV DV +++T IN +CK Sbjct: 229 FDAMCRGGVLIDVYTYATAINAYCK 253