BLASTX nr result

ID: Angelica22_contig00030744 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00030744
         (2029 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containi...   798   0.0  
emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]   792   0.0  
ref|XP_002521980.1| pentatricopeptide repeat-containing protein,...   753   0.0  
ref|NP_566237.1| pentatricopeptide repeat-containing protein [Ar...   721   0.0  
ref|XP_002884468.1| pentatricopeptide repeat-containing protein ...   718   0.0  

>ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Vitis vinifera]
          Length = 582

 Score =  798 bits (2060), Expect = 0.0
 Identities = 395/574 (68%), Positives = 471/574 (82%), Gaps = 1/574 (0%)
 Frame = +3

Query: 114  MTIYSCEFFPQCLPFGTHFKLLTFHSQYNFVVHCTXXXXXXXXXXXXXXXXXXXTDSTPT 293
            MTIYS +FFP+C PF    K  T HS +  +V C                     ++ P 
Sbjct: 1    MTIYSTDFFPRCPPFNPQLKP-TSHSHHTSIVTCRNPNPNDGFNSRNAPKVGVSAEARPA 59

Query: 294  HLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMKGFFNSK 473
            HL SY+F+E+HL+KLLNRSCKAGK+NESLYFLECLV +GY   PDVILCTKL+KGFFN K
Sbjct: 60   HLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYT--PDVILCTKLIKGFFNFK 117

Query: 474  NVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPDVVTYNI 653
            N+EKA+RVMEILES   PD+FAYNA+ISGFCK+NRIE A +VL RM+ RGF PD+VTYNI
Sbjct: 118  NIEKASRVMEILESHTEPDVFAYNAVISGFCKVNRIEAATQVLNRMKARGFLPDIVTYNI 177

Query: 654  MIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLDEMLSRG 833
            MIGSLC+R +L LALK+++QLL DNCMPTVITYTILIEATI++GGI+EAMKLL+EML+RG
Sbjct: 178  MIGSLCNRRKLGLALKVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARG 237

Query: 834  LQPDMYTYNALIRGMCREGMIERAFEFVRGLP-EGCKPDVVSYNILLRTLLFHGKWNDGE 1010
            L PDMYTYNA+IRGMC+EGM+ERA E +  L  +GCKPDV+SYNILLR  L  GKW++GE
Sbjct: 238  LLPDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCKPDVISYNILLRAFLNQGKWDEGE 297

Query: 1011 NLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFDPLISAL 1190
             LV EMFS GC+PN VTYSILIS LCR G+IDEAI++LK+M+EK LTPDTY++DPLISAL
Sbjct: 298  KLVAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISAL 357

Query: 1191 CKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMTGCQPDV 1370
            CK+G+LD AI +MD+MISNGCLPDIVNYNTIL+A+ KNG A+QALEIF +L   GC P+V
Sbjct: 358  CKEGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNV 417

Query: 1371 TTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEAIELLRD 1550
            ++YNTMISALWS GDR+RALG+VP MI K +DPD ITYNSLISC CRDGLV+EAI LL D
Sbjct: 418  SSYNTMISALWSCGDRSRALGMVPAMISKGVDPDEITYNSLISCLCRDGLVEEAIGLLDD 477

Query: 1551 MESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEGIGFSGW 1730
            ME S F PTVI+YNI+LLGLCK  RIDDAI + +EM+EK  +PNETTYILL+EGIGF+GW
Sbjct: 478  MEQSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGW 537

Query: 1731 RAEAMDLANSLLVFELISNESFRRLSKTFPPLDV 1832
            R EAM+LANSL   ++IS +SF+RL+KTFP LDV
Sbjct: 538  RTEAMELANSLFSRDVISQDSFKRLNKTFPMLDV 571


>emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]
          Length = 592

 Score =  792 bits (2046), Expect = 0.0
 Identities = 393/574 (68%), Positives = 469/574 (81%), Gaps = 1/574 (0%)
 Frame = +3

Query: 114  MTIYSCEFFPQCLPFGTHFKLLTFHSQYNFVVHCTXXXXXXXXXXXXXXXXXXXTDSTPT 293
            MTIYS +FFP C PF    K  T HS +  +V C                     ++ P 
Sbjct: 11   MTIYSTDFFPHCPPFSPQLKP-TSHSHHTSIVTCRNPNPNDGYNSRNSPKVGVSAEARPA 69

Query: 294  HLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMKGFFNSK 473
            HL SY+F+E+HL+KLLNRSCKAGK+NESLYFLECLV +GY   PDVILCTKL+KGFFN K
Sbjct: 70   HLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYT--PDVILCTKLIKGFFNFK 127

Query: 474  NVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPDVVTYNI 653
            N+EKA+RVMEILES   PD+FAYNA+ISGFCK+N+IE A +VL RM+ RGF PD+VTYNI
Sbjct: 128  NIEKASRVMEILESHTEPDVFAYNAVISGFCKVNQIEAATQVLNRMKARGFLPDIVTYNI 187

Query: 654  MIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLDEMLSRG 833
            MIGSLC+R +L LAL +++QLL DNCMPTVITYTILIEATI++GGI+EAMKLL+EML+RG
Sbjct: 188  MIGSLCNRRKLGLALTVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARG 247

Query: 834  LQPDMYTYNALIRGMCREGMIERAFEFVRGLP-EGCKPDVVSYNILLRTLLFHGKWNDGE 1010
            L PDMYTYNA+IRGMC+EGM+ERA E +  L  +GC+PDV+SYNILLR  L  GKW++GE
Sbjct: 248  LLPDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCEPDVISYNILLRAFLNQGKWDEGE 307

Query: 1011 NLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFDPLISAL 1190
             LV EMFS GC+PN VTYSILIS LCR G+IDEAI++LK+M+EK LTPDTY++DPLISAL
Sbjct: 308  KLVAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISAL 367

Query: 1191 CKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMTGCQPDV 1370
            CK+G+LD AI +MD+MISNGCLPDIVNYNTIL+A+ KNG A+QALEIF +L   GC P+V
Sbjct: 368  CKEGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNV 427

Query: 1371 TTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEAIELLRD 1550
            ++YNTMISALWS GDR+RALG+VP MI K IDPD ITYNSLISC CRDGLV+EAI LL D
Sbjct: 428  SSYNTMISALWSCGDRSRALGMVPAMISKGIDPDEITYNSLISCLCRDGLVEEAIGLLDD 487

Query: 1551 MESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEGIGFSGW 1730
            ME S F PTVI+YNI+LLGLCK  RIDDAI + +EM+EK  +PNETTYILL+EGIGF+GW
Sbjct: 488  MEQSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGW 547

Query: 1731 RAEAMDLANSLLVFELISNESFRRLSKTFPPLDV 1832
            R EAM+LANSL   ++IS +SF+RL+KTFP LDV
Sbjct: 548  RTEAMELANSLFSRDVISQDSFKRLNKTFPMLDV 581


>ref|XP_002521980.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538784|gb|EEF40384.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 584

 Score =  753 bits (1943), Expect = 0.0
 Identities = 369/580 (63%), Positives = 458/580 (78%), Gaps = 1/580 (0%)
 Frame = +3

Query: 114  MTIYSCEFFPQCLPFGTHFKLLTFHSQYNFVVHCTXXXXXXXXXXXXXXXXXXXTDSTPT 293
            MT++S EF P  + F T     T +S ++ +V C                     ++  T
Sbjct: 1    MTLFSTEFLPHSISFTTQPLKPTSNSLHSTIVSCIRPELNDANKVRNPQKVRVSAETRQT 60

Query: 294  HLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMKGFFNSK 473
            H+ S++F+E HL+KLLNRSC+AGKYNESLYFLEC+V +GY   PDVILCTKL+KGFFNS+
Sbjct: 61   HVLSFDFKEVHLMKLLNRSCRAGKYNESLYFLECMVDKGYT--PDVILCTKLIKGFFNSR 118

Query: 474  NVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPDVVTYNI 653
            N+ KATRVMEILE  G PD+FAYNALISGF K N++E ANRVL+RM+ RGF PDVVTYNI
Sbjct: 119  NIGKATRVMEILERYGKPDVFAYNALISGFIKANQLENANRVLDRMKSRGFLPDVVTYNI 178

Query: 654  MIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLDEMLSRG 833
            MIGS CSRG+LDLAL++  +LL+DNC PTVITYTILIEATILDGGID AMKLLDEMLS+G
Sbjct: 179  MIGSFCSRGKLDLALEIFEELLKDNCEPTVITYTILIEATILDGGIDVAMKLLDEMLSKG 238

Query: 834  LQPDMYTYNALIRGMCREGMIERAFEFVRGLPE-GCKPDVVSYNILLRTLLFHGKWNDGE 1010
            L+PD  TYNA+IRGMC+E M+++AFE +R L   GCKPD+++YNILLRTLL  GKW++GE
Sbjct: 239  LEPDTLTYNAIIRGMCKEMMVDKAFELLRSLSSRGCKPDIITYNILLRTLLSRGKWSEGE 298

Query: 1011 NLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFDPLISAL 1190
             L+ EM SIGC PNVVT+SILI  LCRDGK++EA+NLL+ M EKGL PD Y +DPLI+  
Sbjct: 299  KLISEMISIGCKPNVVTHSILIGTLCRDGKVEEAVNLLRSMKEKGLKPDAYCYDPLIAGF 358

Query: 1191 CKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMTGCQPDV 1370
            C++G+LD A E +++MIS+GCLPDIVNYNTI++ + + GKADQALE+FE+L   GC P+V
Sbjct: 359  CREGRLDLATEFLEYMISDGCLPDIVNYNTIMAGLCRTGKADQALEVFEKLDEVGCPPNV 418

Query: 1371 TTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEAIELLRD 1550
            ++YNT+ SALWSSGDR RAL ++  ++ + IDPD ITYNSLISC CRDG+VDEAIELL D
Sbjct: 419  SSYNTLFSALWSSGDRYRALEMILKLLNQGIDPDEITYNSLISCLCRDGMVDEAIELLVD 478

Query: 1551 MESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEGIGFSGW 1730
            M+S ++ P V++YNIILLGLCK +R +DAI+VL+ M EK  QPNETTYILL+EGIGFSG 
Sbjct: 479  MQSGRYRPNVVSYNIILLGLCKVNRANDAIEVLAAMTEKGCQPNETTYILLIEGIGFSGL 538

Query: 1731 RAEAMDLANSLLVFELISNESFRRLSKTFPPLDVL*NLGY 1850
            RAEAM+LANSL     IS +SF RL+KTFP LDV  +L +
Sbjct: 539  RAEAMELANSLHGMNAISEDSFNRLNKTFPLLDVYKDLTF 578


>ref|NP_566237.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75207286|sp|Q9SR00.1|PP213_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g04760, chloroplastic; Flags: Precursor
            gi|6175176|gb|AAF04902.1|AC011437_17 hypothetical protein
            [Arabidopsis thaliana] gi|15810359|gb|AAL07067.1| unknown
            protein [Arabidopsis thaliana] gi|22136960|gb|AAM91709.1|
            unknown protein [Arabidopsis thaliana]
            gi|332640611|gb|AEE74132.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 602

 Score =  721 bits (1860), Expect = 0.0
 Identities = 345/521 (66%), Positives = 430/521 (82%), Gaps = 1/521 (0%)
 Frame = +3

Query: 276  TDSTPTHLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMK 455
            T+    H  S  F+++ ++K+ +RSC++G Y ESL+ LE +V +GYN  PDVILCTKL+K
Sbjct: 75   TERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYN--PDVILCTKLIK 132

Query: 456  GFFNSKNVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPD 635
            GFF  +N+ KA RVMEILE  G PD+FAYNALI+GFCK+NRI+ A RVL+RMR + F PD
Sbjct: 133  GFFTLRNIPKAVRVMEILEKFGQPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPD 192

Query: 636  VVTYNIMIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLD 815
             VTYNIMIGSLCSRG+LDLALK++NQLL DNC PTVITYTILIEAT+L+GG+DEA+KL+D
Sbjct: 193  TVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMD 252

Query: 816  EMLSRGLQPDMYTYNALIRGMCREGMIERAFEFVRGLP-EGCKPDVVSYNILLRTLLFHG 992
            EMLSRGL+PDM+TYN +IRGMC+EGM++RAFE VR L  +GC+PDV+SYNILLR LL  G
Sbjct: 253  EMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQG 312

Query: 993  KWNDGENLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFD 1172
            KW +GE L+ +MFS  CDPNVVTYSILI+ LCRDGKI+EA+NLLKLM EKGLTPD Y++D
Sbjct: 313  KWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYD 372

Query: 1173 PLISALCKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMT 1352
            PLI+A C++G+LD AIE ++ MIS+GCLPDIVNYNT+L+ + KNGKADQALEIF +L   
Sbjct: 373  PLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEV 432

Query: 1353 GCQPDVTTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEA 1532
            GC P+ ++YNTM SALWSSGD+ RAL ++ +M+   IDPD ITYNS+ISC CR+G+VDEA
Sbjct: 433  GCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGIDPDEITYNSMISCLCREGMVDEA 492

Query: 1533 IELLRDMESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEG 1712
             ELL DM S +F P+V+TYNI+LLG CKAHRI+DAI VL  MV    +PNETTY +L+EG
Sbjct: 493  FELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEG 552

Query: 1713 IGFSGWRAEAMDLANSLLVFELISNESFRRLSKTFPPLDVL 1835
            IGF+G+RAEAM+LAN L+  + IS  SF+RL +TFP L+VL
Sbjct: 553  IGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPLLNVL 593


>ref|XP_002884468.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297330308|gb|EFH60727.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 598

 Score =  718 bits (1853), Expect = 0.0
 Identities = 344/521 (66%), Positives = 429/521 (82%), Gaps = 1/521 (0%)
 Frame = +3

Query: 276  TDSTPTHLHSYNFQESHLVKLLNRSCKAGKYNESLYFLECLVTRGYNYKPDVILCTKLMK 455
            T+    H  S  F+++ ++K+ +RSC++G Y ESL+ LE +V +GYN  PDVILCTKL+K
Sbjct: 71   TERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYN--PDVILCTKLIK 128

Query: 456  GFFNSKNVEKATRVMEILESIGNPDIFAYNALISGFCKLNRIEPANRVLERMRDRGFFPD 635
            GFF  +NV KA RVMEILE  G PD+FAYNALI+GFCK+NRI+ A RVL+RMR + F PD
Sbjct: 129  GFFTLRNVPKAVRVMEILEKFGQPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPD 188

Query: 636  VVTYNIMIGSLCSRGRLDLALKMMNQLLEDNCMPTVITYTILIEATILDGGIDEAMKLLD 815
             VTYNIMIGSLCSRG+LDLALK+++QLL DNC PTVITYTILIEAT+L+GG+DEA+KLLD
Sbjct: 189  TVTYNIMIGSLCSRGKLDLALKVLDQLLSDNCQPTVITYTILIEATMLEGGVDEALKLLD 248

Query: 816  EMLSRGLQPDMYTYNALIRGMCREGMIERAFEFVRGLP-EGCKPDVVSYNILLRTLLFHG 992
            EMLSRGL+PDM+TYN +IRGMC+EGM++RAFE +R L  +GC+PDV+SYNILLR LL  G
Sbjct: 249  EMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMIRNLELKGCEPDVISYNILLRALLNQG 308

Query: 993  KWNDGENLVREMFSIGCDPNVVTYSILISWLCRDGKIDEAINLLKLMLEKGLTPDTYTFD 1172
            KW +GE L+ +MFS  CDPNVVTYSILI+ LCRDGKI+EA+NLLKLM EKGLTPD Y++D
Sbjct: 309  KWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYD 368

Query: 1173 PLISALCKQGKLDSAIELMDFMISNGCLPDIVNYNTILSAMSKNGKADQALEIFEQLAMT 1352
            PLI+A C++G+LD AIE ++ MIS+GCLPDIVNYNT+L+ + KNGKADQALEIF +L   
Sbjct: 369  PLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEV 428

Query: 1353 GCQPDVTTYNTMISALWSSGDRTRALGLVPDMIQKRIDPDYITYNSLISCFCRDGLVDEA 1532
            GC P+ ++YNTM SALWSSGD+ RAL ++ +M+   IDPD ITYNS+ISC CR+G+VD+A
Sbjct: 429  GCSPNSSSYNTMFSALWSSGDKIRALHMILEMVSNGIDPDEITYNSMISCLCREGMVDKA 488

Query: 1533 IELLRDMESSKFSPTVITYNIILLGLCKAHRIDDAIQVLSEMVEKRYQPNETTYILLVEG 1712
             ELL DM S +F P+V+TYNI+LLG CKAHRI+DAI VL  MV    +PNETTY +L+EG
Sbjct: 489  FELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAIDVLDSMVGNGCRPNETTYTVLIEG 548

Query: 1713 IGFSGWRAEAMDLANSLLVFELISNESFRRLSKTFPPLDVL 1835
            IGF+G+RAEAM+LAN L+    IS  SF+RL +TFP L+VL
Sbjct: 549  IGFAGYRAEAMELANDLVRINAISEYSFKRLHRTFPLLNVL 589


Top