BLASTX nr result
ID: Angelica22_contig00030744
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00030744 (2029 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containi... 798 0.0 emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera] 792 0.0 ref|XP_002521980.1| pentatricopeptide repeat-containing protein,... 753 0.0 ref|NP_566237.1| pentatricopeptide repeat-containing protein [Ar... 721 0.0 ref|XP_002884468.1| pentatricopeptide repeat-containing protein ... 718 0.0 >ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic-like [Vitis vinifera] Length = 582 Score = 798 bits (2060), Expect = 0.0 Identities = 395/574 (68%), Positives = 471/574 (82%), Gaps = 1/574 (0%) Frame = +3 Query: 114 MTIYSCEFFPQCLPFGTHFKLLTFHSQYNFVVHCTXXXXXXXXXXXXXXXXXXXTDSTPT 293 MTIYS +FFP+C PF K T HS + +V C ++ P Sbjct: 1 MTIYSTDFFPRCPPFNPQLKP-TSHSHHTSIVTCRNPNPNDGFNSRNAPKVGVSAEARPA 59 Query: 294 HLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMKGFFNSK 473 HL SY+F+E+HL+KLLNRSCKAGK+NESLYFLECLV +GY PDVILCTKL+KGFFN K Sbjct: 60 HLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYT--PDVILCTKLIKGFFNFK 117 Query: 474 NVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPDVVTYNI 653 N+EKA+RVMEILES PD+FAYNA+ISGFCK+NRIE A +VL RM+ RGF PD+VTYNI Sbjct: 118 NIEKASRVMEILESHTEPDVFAYNAVISGFCKVNRIEAATQVLNRMKARGFLPDIVTYNI 177 Query: 654 MIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLDEMLSRG 833 MIGSLC+R +L LALK+++QLL DNCMPTVITYTILIEATI++GGI+EAMKLL+EML+RG Sbjct: 178 MIGSLCNRRKLGLALKVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARG 237 Query: 834 LQPDMYTYNALIRGMCREGMIERAFEFVRGLP-EGCKPDVVSYNILLRTLLFHGKWNDGE 1010 L PDMYTYNA+IRGMC+EGM+ERA E + L +GCKPDV+SYNILLR L GKW++GE Sbjct: 238 LLPDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCKPDVISYNILLRAFLNQGKWDEGE 297 Query: 1011 NLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFDPLISAL 1190 LV EMFS GC+PN VTYSILIS LCR G+IDEAI++LK+M+EK LTPDTY++DPLISAL Sbjct: 298 KLVAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISAL 357 Query: 1191 CKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMTGCQPDV 1370 CK+G+LD AI +MD+MISNGCLPDIVNYNTIL+A+ KNG A+QALEIF +L GC P+V Sbjct: 358 CKEGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNV 417 Query: 1371 TTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEAIELLRD 1550 ++YNTMISALWS GDR+RALG+VP MI K +DPD ITYNSLISC CRDGLV+EAI LL D Sbjct: 418 SSYNTMISALWSCGDRSRALGMVPAMISKGVDPDEITYNSLISCLCRDGLVEEAIGLLDD 477 Query: 1551 MESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEGIGFSGW 1730 ME S F PTVI+YNI+LLGLCK RIDDAI + +EM+EK +PNETTYILL+EGIGF+GW Sbjct: 478 MEQSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGW 537 Query: 1731 RAEAMDLANSLLVFELISNESFRRLSKTFPPLDV 1832 R EAM+LANSL ++IS +SF+RL+KTFP LDV Sbjct: 538 RTEAMELANSLFSRDVISQDSFKRLNKTFPMLDV 571 >emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera] Length = 592 Score = 792 bits (2046), Expect = 0.0 Identities = 393/574 (68%), Positives = 469/574 (81%), Gaps = 1/574 (0%) Frame = +3 Query: 114 MTIYSCEFFPQCLPFGTHFKLLTFHSQYNFVVHCTXXXXXXXXXXXXXXXXXXXTDSTPT 293 MTIYS +FFP C PF K T HS + +V C ++ P Sbjct: 11 MTIYSTDFFPHCPPFSPQLKP-TSHSHHTSIVTCRNPNPNDGYNSRNSPKVGVSAEARPA 69 Query: 294 HLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMKGFFNSK 473 HL SY+F+E+HL+KLLNRSCKAGK+NESLYFLECLV +GY PDVILCTKL+KGFFN K Sbjct: 70 HLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYT--PDVILCTKLIKGFFNFK 127 Query: 474 NVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPDVVTYNI 653 N+EKA+RVMEILES PD+FAYNA+ISGFCK+N+IE A +VL RM+ RGF PD+VTYNI Sbjct: 128 NIEKASRVMEILESHTEPDVFAYNAVISGFCKVNQIEAATQVLNRMKARGFLPDIVTYNI 187 Query: 654 MIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLDEMLSRG 833 MIGSLC+R +L LAL +++QLL DNCMPTVITYTILIEATI++GGI+EAMKLL+EML+RG Sbjct: 188 MIGSLCNRRKLGLALTVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARG 247 Query: 834 LQPDMYTYNALIRGMCREGMIERAFEFVRGLP-EGCKPDVVSYNILLRTLLFHGKWNDGE 1010 L PDMYTYNA+IRGMC+EGM+ERA E + L +GC+PDV+SYNILLR L GKW++GE Sbjct: 248 LLPDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCEPDVISYNILLRAFLNQGKWDEGE 307 Query: 1011 NLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFDPLISAL 1190 LV EMFS GC+PN VTYSILIS LCR G+IDEAI++LK+M+EK LTPDTY++DPLISAL Sbjct: 308 KLVAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISAL 367 Query: 1191 CKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMTGCQPDV 1370 CK+G+LD AI +MD+MISNGCLPDIVNYNTIL+A+ KNG A+QALEIF +L GC P+V Sbjct: 368 CKEGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNV 427 Query: 1371 TTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEAIELLRD 1550 ++YNTMISALWS GDR+RALG+VP MI K IDPD ITYNSLISC CRDGLV+EAI LL D Sbjct: 428 SSYNTMISALWSCGDRSRALGMVPAMISKGIDPDEITYNSLISCLCRDGLVEEAIGLLDD 487 Query: 1551 MESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEGIGFSGW 1730 ME S F PTVI+YNI+LLGLCK RIDDAI + +EM+EK +PNETTYILL+EGIGF+GW Sbjct: 488 MEQSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGW 547 Query: 1731 RAEAMDLANSLLVFELISNESFRRLSKTFPPLDV 1832 R EAM+LANSL ++IS +SF+RL+KTFP LDV Sbjct: 548 RTEAMELANSLFSRDVISQDSFKRLNKTFPMLDV 581 >ref|XP_002521980.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538784|gb|EEF40384.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 584 Score = 753 bits (1943), Expect = 0.0 Identities = 369/580 (63%), Positives = 458/580 (78%), Gaps = 1/580 (0%) Frame = +3 Query: 114 MTIYSCEFFPQCLPFGTHFKLLTFHSQYNFVVHCTXXXXXXXXXXXXXXXXXXXTDSTPT 293 MT++S EF P + F T T +S ++ +V C ++ T Sbjct: 1 MTLFSTEFLPHSISFTTQPLKPTSNSLHSTIVSCIRPELNDANKVRNPQKVRVSAETRQT 60 Query: 294 HLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMKGFFNSK 473 H+ S++F+E HL+KLLNRSC+AGKYNESLYFLEC+V +GY PDVILCTKL+KGFFNS+ Sbjct: 61 HVLSFDFKEVHLMKLLNRSCRAGKYNESLYFLECMVDKGYT--PDVILCTKLIKGFFNSR 118 Query: 474 NVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPDVVTYNI 653 N+ KATRVMEILE G PD+FAYNALISGF K N++E ANRVL+RM+ RGF PDVVTYNI Sbjct: 119 NIGKATRVMEILERYGKPDVFAYNALISGFIKANQLENANRVLDRMKSRGFLPDVVTYNI 178 Query: 654 MIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLDEMLSRG 833 MIGS CSRG+LDLAL++ +LL+DNC PTVITYTILIEATILDGGID AMKLLDEMLS+G Sbjct: 179 MIGSFCSRGKLDLALEIFEELLKDNCEPTVITYTILIEATILDGGIDVAMKLLDEMLSKG 238 Query: 834 LQPDMYTYNALIRGMCREGMIERAFEFVRGLPE-GCKPDVVSYNILLRTLLFHGKWNDGE 1010 L+PD TYNA+IRGMC+E M+++AFE +R L GCKPD+++YNILLRTLL GKW++GE Sbjct: 239 LEPDTLTYNAIIRGMCKEMMVDKAFELLRSLSSRGCKPDIITYNILLRTLLSRGKWSEGE 298 Query: 1011 NLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFDPLISAL 1190 L+ EM SIGC PNVVT+SILI LCRDGK++EA+NLL+ M EKGL PD Y +DPLI+ Sbjct: 299 KLISEMISIGCKPNVVTHSILIGTLCRDGKVEEAVNLLRSMKEKGLKPDAYCYDPLIAGF 358 Query: 1191 CKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMTGCQPDV 1370 C++G+LD A E +++MIS+GCLPDIVNYNTI++ + + GKADQALE+FE+L GC P+V Sbjct: 359 CREGRLDLATEFLEYMISDGCLPDIVNYNTIMAGLCRTGKADQALEVFEKLDEVGCPPNV 418 Query: 1371 TTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEAIELLRD 1550 ++YNT+ SALWSSGDR RAL ++ ++ + IDPD ITYNSLISC CRDG+VDEAIELL D Sbjct: 419 SSYNTLFSALWSSGDRYRALEMILKLLNQGIDPDEITYNSLISCLCRDGMVDEAIELLVD 478 Query: 1551 MESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEGIGFSGW 1730 M+S ++ P V++YNIILLGLCK +R +DAI+VL+ M EK QPNETTYILL+EGIGFSG Sbjct: 479 MQSGRYRPNVVSYNIILLGLCKVNRANDAIEVLAAMTEKGCQPNETTYILLIEGIGFSGL 538 Query: 1731 RAEAMDLANSLLVFELISNESFRRLSKTFPPLDVL*NLGY 1850 RAEAM+LANSL IS +SF RL+KTFP LDV +L + Sbjct: 539 RAEAMELANSLHGMNAISEDSFNRLNKTFPLLDVYKDLTF 578 >ref|NP_566237.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75207286|sp|Q9SR00.1|PP213_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g04760, chloroplastic; Flags: Precursor gi|6175176|gb|AAF04902.1|AC011437_17 hypothetical protein [Arabidopsis thaliana] gi|15810359|gb|AAL07067.1| unknown protein [Arabidopsis thaliana] gi|22136960|gb|AAM91709.1| unknown protein [Arabidopsis thaliana] gi|332640611|gb|AEE74132.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 602 Score = 721 bits (1860), Expect = 0.0 Identities = 345/521 (66%), Positives = 430/521 (82%), Gaps = 1/521 (0%) Frame = +3 Query: 276 TDSTPTHLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMK 455 T+ H S F+++ ++K+ +RSC++G Y ESL+ LE +V +GYN PDVILCTKL+K Sbjct: 75 TERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYN--PDVILCTKLIK 132 Query: 456 GFFNSKNVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPD 635 GFF +N+ KA RVMEILE G PD+FAYNALI+GFCK+NRI+ A RVL+RMR + F PD Sbjct: 133 GFFTLRNIPKAVRVMEILEKFGQPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPD 192 Query: 636 VVTYNIMIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLD 815 VTYNIMIGSLCSRG+LDLALK++NQLL DNC PTVITYTILIEAT+L+GG+DEA+KL+D Sbjct: 193 TVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMD 252 Query: 816 EMLSRGLQPDMYTYNALIRGMCREGMIERAFEFVRGLP-EGCKPDVVSYNILLRTLLFHG 992 EMLSRGL+PDM+TYN +IRGMC+EGM++RAFE VR L +GC+PDV+SYNILLR LL G Sbjct: 253 EMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQG 312 Query: 993 KWNDGENLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFD 1172 KW +GE L+ +MFS CDPNVVTYSILI+ LCRDGKI+EA+NLLKLM EKGLTPD Y++D Sbjct: 313 KWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYD 372 Query: 1173 PLISALCKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMT 1352 PLI+A C++G+LD AIE ++ MIS+GCLPDIVNYNT+L+ + KNGKADQALEIF +L Sbjct: 373 PLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEV 432 Query: 1353 GCQPDVTTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEA 1532 GC P+ ++YNTM SALWSSGD+ RAL ++ +M+ IDPD ITYNS+ISC CR+G+VDEA Sbjct: 433 GCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGIDPDEITYNSMISCLCREGMVDEA 492 Query: 1533 IELLRDMESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEG 1712 ELL DM S +F P+V+TYNI+LLG CKAHRI+DAI VL MV +PNETTY +L+EG Sbjct: 493 FELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEG 552 Query: 1713 IGFSGWRAEAMDLANSLLVFELISNESFRRLSKTFPPLDVL 1835 IGF+G+RAEAM+LAN L+ + IS SF+RL +TFP L+VL Sbjct: 553 IGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPLLNVL 593 >ref|XP_002884468.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297330308|gb|EFH60727.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 598 Score = 718 bits (1853), Expect = 0.0 Identities = 344/521 (66%), Positives = 429/521 (82%), Gaps = 1/521 (0%) Frame = +3 Query: 276 TDSTPTHLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMK 455 T+ H S F+++ ++K+ +RSC++G Y ESL+ LE +V +GYN PDVILCTKL+K Sbjct: 71 TERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYN--PDVILCTKLIK 128 Query: 456 GFFNSKNVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPD 635 GFF +NV KA RVMEILE G PD+FAYNALI+GFCK+NRI+ A RVL+RMR + F PD Sbjct: 129 GFFTLRNVPKAVRVMEILEKFGQPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPD 188 Query: 636 VVTYNIMIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLD 815 VTYNIMIGSLCSRG+LDLALK+++QLL DNC PTVITYTILIEAT+L+GG+DEA+KLLD Sbjct: 189 TVTYNIMIGSLCSRGKLDLALKVLDQLLSDNCQPTVITYTILIEATMLEGGVDEALKLLD 248 Query: 816 EMLSRGLQPDMYTYNALIRGMCREGMIERAFEFVRGLP-EGCKPDVVSYNILLRTLLFHG 992 EMLSRGL+PDM+TYN +IRGMC+EGM++RAFE +R L +GC+PDV+SYNILLR LL G Sbjct: 249 EMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMIRNLELKGCEPDVISYNILLRALLNQG 308 Query: 993 KWNDGENLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFD 1172 KW +GE L+ +MFS CDPNVVTYSILI+ LCRDGKI+EA+NLLKLM EKGLTPD Y++D Sbjct: 309 KWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYD 368 Query: 1173 PLISALCKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMT 1352 PLI+A C++G+LD AIE ++ MIS+GCLPDIVNYNT+L+ + KNGKADQALEIF +L Sbjct: 369 PLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEV 428 Query: 1353 GCQPDVTTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEA 1532 GC P+ ++YNTM SALWSSGD+ RAL ++ +M+ IDPD ITYNS+ISC CR+G+VD+A Sbjct: 429 GCSPNSSSYNTMFSALWSSGDKIRALHMILEMVSNGIDPDEITYNSMISCLCREGMVDKA 488 Query: 1533 IELLRDMESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEG 1712 ELL DM S +F P+V+TYNI+LLG CKAHRI+DAI VL MV +PNETTY +L+EG Sbjct: 489 FELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAIDVLDSMVGNGCRPNETTYTVLIEG 548 Query: 1713 IGFSGWRAEAMDLANSLLVFELISNESFRRLSKTFPPLDVL 1835 IGF+G+RAEAM+LAN L+ IS SF+RL +TFP L+VL Sbjct: 549 IGFAGYRAEAMELANDLVRINAISEYSFKRLHRTFPLLNVL 589