BLASTX nr result
ID: Cephaelis21_contig00041523
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00041523 (724 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002285225.2| PREDICTED: pentatricopeptide repeat-containi... 323 2e-86 emb|CBI36234.3| unnamed protein product [Vitis vinifera] 317 2e-84 ref|NP_173004.1| pentatricopeptide repeat-containing protein [Ar... 306 3e-81 ref|XP_002303270.1| predicted protein [Populus trichocarpa] gi|2... 305 6e-81 ref|XP_002890108.1| pentatricopeptide repeat-containing protein ... 297 1e-78 >ref|XP_002285225.2| PREDICTED: pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like [Vitis vinifera] Length = 872 Score = 323 bits (829), Expect = 2e-86 Identities = 146/203 (71%), Positives = 176/203 (86%) Frame = -2 Query: 723 WGALLNSCRIYQRADLGELAARHIYEMDEEFVKYYILLCNFYSDCGKWDDVARLQRMLRE 544 WGALLN+CRIYQ +LGELAA+HI+EMD + V YYILLCN Y+D GKWD+VAR+++++RE Sbjct: 668 WGALLNACRIYQNVELGELAAQHIFEMDTKSVGYYILLCNLYADSGKWDEVARVRKIMRE 727 Query: 543 KGVTVDPGCSWVEVKGQVHAFLSGENFHPQIKEMNAVLEGFYDKMKEEGLCDPESSSMNE 364 +TVDPGCSWVEV GQVHAFL+G++FHPQIKE+NAVLEGFY+KM+ GL + S ++ Sbjct: 728 NRLTVDPGCSWVEVAGQVHAFLTGDDFHPQIKEINAVLEGFYEKMEATGLSMSKDSRRDD 787 Query: 363 LEGSKAEVFCGHSEKLAIAFALINTVPGMPISVTKNLYMCKNCHSTVKFISKIVRREISL 184 ++ SKAE+FCGHSE+LAIAF LINTVPG PI VTKNLYMC+NCH+TVKFISK+VRR IS+ Sbjct: 788 IDASKAEIFCGHSERLAIAFGLINTVPGTPIWVTKNLYMCENCHNTVKFISKVVRRGISV 847 Query: 183 RDTEHFHHFKDGKCSCRDEVYWG 115 RDTE FHHFKDG CSC DE YWG Sbjct: 848 RDTEQFHHFKDGVCSCGDEGYWG 870 >emb|CBI36234.3| unnamed protein product [Vitis vinifera] Length = 906 Score = 317 bits (812), Expect = 2e-84 Identities = 144/200 (72%), Positives = 174/200 (87%) Frame = -2 Query: 723 WGALLNSCRIYQRADLGELAARHIYEMDEEFVKYYILLCNFYSDCGKWDDVARLQRMLRE 544 WGALLN+CRIYQ +LGELAA+HI+EMD + V YYILLCN Y+D GKWD+VAR+++++RE Sbjct: 668 WGALLNACRIYQNVELGELAAQHIFEMDTKSVGYYILLCNLYADSGKWDEVARVRKIMRE 727 Query: 543 KGVTVDPGCSWVEVKGQVHAFLSGENFHPQIKEMNAVLEGFYDKMKEEGLCDPESSSMNE 364 +TVDPGCSWVEV GQVHAFL+G++FHPQIKE+NAVLEGFY+KM+ GL + S ++ Sbjct: 728 NRLTVDPGCSWVEVAGQVHAFLTGDDFHPQIKEINAVLEGFYEKMEATGLSMSKDSRRDD 787 Query: 363 LEGSKAEVFCGHSEKLAIAFALINTVPGMPISVTKNLYMCKNCHSTVKFISKIVRREISL 184 ++ SKAE+FCGHSE+LAIAF LINTVPG PI VTKNLYMC+NCH+TVKFISK+VRR IS+ Sbjct: 788 IDASKAEIFCGHSERLAIAFGLINTVPGTPIWVTKNLYMCENCHNTVKFISKVVRRGISV 847 Query: 183 RDTEHFHHFKDGKCSCRDEV 124 RDTE FHHFKDG CSC DEV Sbjct: 848 RDTEQFHHFKDGVCSCGDEV 867 >ref|NP_173004.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75191104|sp|Q9M9E2.1|PPR45_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g15510, chloroplastic; Flags: Precursor gi|8072389|gb|AAF71977.1|AC013453_2 Hypothetical protein [Arabidopsis thaliana] gi|300825685|gb|ADK35876.1| chloroplast vanilla cream 1 [Arabidopsis thaliana] gi|332191210|gb|AEE29331.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 866 Score = 306 bits (784), Expect = 3e-81 Identities = 141/199 (70%), Positives = 170/199 (85%), Gaps = 1/199 (0%) Frame = -2 Query: 723 WGALLNSCRIYQRADLGELAARHIYEMDEEFVKYYILLCNFYSDCGKWDDVARLQRMLRE 544 WGALLN+CRI+ + DLGEL+A+HI+E+D++ V YYILLCN Y+DCGKW +VA+++RM++E Sbjct: 668 WGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMKE 727 Query: 543 KGVTVDPGCSWVEVKGQVHAFLSGENFHPQIKEMNAVLEGFYDKMKEEGLCD-PESSSMN 367 G+TVD GCSWVEVKG+VHAFLS + +HPQ KE+N VLEGFY+KM E GL ESSSM+ Sbjct: 728 NGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVLEGFYEKMSEVGLTKISESSSMD 787 Query: 366 ELEGSKAEVFCGHSEKLAIAFALINTVPGMPISVTKNLYMCKNCHSTVKFISKIVRREIS 187 E E S+ E+FCGHSE+ AIAF LINTVPGMPI VTKNL MC+NCH TVKFISK VRREIS Sbjct: 788 ETEISRDEIFCGHSERKAIAFGLINTVPGMPIWVTKNLSMCENCHDTVKFISKTVRREIS 847 Query: 186 LRDTEHFHHFKDGKCSCRD 130 +RD EHFHHFKDG+CSC D Sbjct: 848 VRDAEHFHHFKDGECSCGD 866 >ref|XP_002303270.1| predicted protein [Populus trichocarpa] gi|222840702|gb|EEE78249.1| predicted protein [Populus trichocarpa] Length = 805 Score = 305 bits (781), Expect = 6e-81 Identities = 137/198 (69%), Positives = 171/198 (86%) Frame = -2 Query: 723 WGALLNSCRIYQRADLGELAARHIYEMDEEFVKYYILLCNFYSDCGKWDDVARLQRMLRE 544 WGALLN+CRI++ LGELAA+HI++ D E + YYILLCN Y+D GKWD+VA+++R ++E Sbjct: 608 WGALLNACRIHRHVLLGELAAQHIFKQDAESIGYYILLCNLYADSGKWDEVAKVRRTMKE 667 Query: 543 KGVTVDPGCSWVEVKGQVHAFLSGENFHPQIKEMNAVLEGFYDKMKEEGLCDPESSSMNE 364 +G+ VDPGCSWVEVKG+VHAFLSG+NFHPQ++E+N VLEGFY+KMK G E SSM+ Sbjct: 668 EGLIVDPGCSWVEVKGKVHAFLSGDNFHPQMQEINVVLEGFYEKMKTSGFNGQECSSMDG 727 Query: 363 LEGSKAEVFCGHSEKLAIAFALINTVPGMPISVTKNLYMCKNCHSTVKFISKIVRREISL 184 ++ SKA++FCGHSE+ AIA++LIN+ PGMPI VTKNLYMC++CHSTVKFISKIVRREIS+ Sbjct: 728 IQTSKADIFCGHSERQAIAYSLINSAPGMPIWVTKNLYMCQSCHSTVKFISKIVRREISV 787 Query: 183 RDTEHFHHFKDGKCSCRD 130 RDTE FHHFKDG CSC D Sbjct: 788 RDTEQFHHFKDGLCSCGD 805 >ref|XP_002890108.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297335950|gb|EFH66367.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 866 Score = 297 bits (761), Expect = 1e-78 Identities = 137/199 (68%), Positives = 168/199 (84%), Gaps = 1/199 (0%) Frame = -2 Query: 723 WGALLNSCRIYQRADLGELAARHIYEMDEEFVKYYILLCNFYSDCGKWDDVARLQRMLRE 544 WGALLN+CRI+ DLGEL+A+ I+E+D+ V YYILLCN Y+DCGKW +VA+++RM++E Sbjct: 668 WGALLNACRIHHNIDLGELSAQRIFELDKGSVGYYILLCNLYADCGKWREVAKVRRMMKE 727 Query: 543 KGVTVDPGCSWVEVKGQVHAFLSGENFHPQIKEMNAVLEGFYDKMKEEGL-CDPESSSMN 367 G+TVD GCSWVEVKG+VHAFLS + +HPQ KE+N VL+GFY+KM E GL ESSSM+ Sbjct: 728 NGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVLDGFYEKMSEVGLTTSSESSSMD 787 Query: 366 ELEGSKAEVFCGHSEKLAIAFALINTVPGMPISVTKNLYMCKNCHSTVKFISKIVRREIS 187 E E S+ E+FCGHSE+ AIAF LIN+VPGMPI VTKNL MC++CH TVKFISK VRREIS Sbjct: 788 ETEISRDEIFCGHSERKAIAFGLINSVPGMPIWVTKNLNMCESCHDTVKFISKTVRREIS 847 Query: 186 LRDTEHFHHFKDGKCSCRD 130 +RD+EHFHHFKDG+CSC D Sbjct: 848 VRDSEHFHHFKDGECSCGD 866