BLASTX nr result

ID: Angelica22_contig00040020 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00040020
         (396 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN79811.1| hypothetical protein VITISV_018821 [Vitis vinifera]   221   5e-56
ref|XP_002271725.2| PREDICTED: pentatricopeptide repeat-containi...   220   1e-55
ref|XP_002515835.1| pentatricopeptide repeat-containing protein,...   199   1e-49
ref|XP_003544373.1| PREDICTED: pentatricopeptide repeat-containi...   196   2e-48
ref|XP_004137054.1| PREDICTED: pentatricopeptide repeat-containi...   195   3e-48

>emb|CAN79811.1| hypothetical protein VITISV_018821 [Vitis vinifera]
          Length = 871

 Score =  221 bits (563), Expect = 5e-56
 Identities = 105/130 (80%), Positives = 121/130 (93%)
 Frame = -2

Query: 395 EVHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQVG 216
           +VHGYA+RSGL EDVFVGNA+VDMYAKC +M+EA+KVFERMKVKDVVSWNAMVTGYSQ+G
Sbjct: 271 QVHGYALRSGLFEDVFVGNAVVDMYAKCGMMEEANKVFERMKVKDVVSWNAMVTGYSQIG 330

Query: 215 RFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQMQLSGSEPNIITLL 36
           RF+DAL LFE+++ EKI+LNVVTWSAVI  YAQRGLG+EALDVFRQM L GSEPN++TL+
Sbjct: 331 RFDDALGLFEKIREEKIELNVVTWSAVIAGYAQRGLGFEALDVFRQMLLCGSEPNVVTLV 390

Query: 35  SLLSGCASVG 6
           SLLSGCAS G
Sbjct: 391 SLLSGCASAG 400



 Score = 65.9 bits (159), Expect = 3e-09
 Identities = 35/108 (32%), Positives = 62/108 (57%), Gaps = 1/108 (0%)
 Frame = -2

Query: 395 EVHGYAVRSGL-CEDVFVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQV 219
           ++H Y +R+      +FV N ++DMY+K   +D A  VF+ M  ++ VSW +++TGY   
Sbjct: 519 QIHAYVLRNRFESAMLFVANCLIDMYSKSGDVDAARVVFDNMHQRNGVSWTSLMTGYGMH 578

Query: 218 GRFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQM 75
           GR E+AL +F  M+   +  + VT+  V+ A +  G+  + ++ F  M
Sbjct: 579 GRGEEALQIFYEMQKVXLVPDGVTFVVVLYACSHSGMVDQGINYFNGM 626



 Score = 65.1 bits (157), Expect = 6e-09
 Identities = 42/133 (31%), Positives = 61/133 (45%), Gaps = 3/133 (2%)
 Frame = -2

Query: 392 VHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERMK---VKDVVSWNAMVTGYSQ 222
           VH     SG   +VFVGN +V MY +C   + A +VF+ M+   V D+VSWN++V  Y Q
Sbjct: 167 VHAVVFASGFEWNVFVGNGLVSMYGRCGAWENARQVFDEMRERGVGDLVSWNSIVAAYMQ 226

Query: 221 VGRFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQMQLSGSEPNIIT 42
            G    A+ +FERM  +                                   G  P+ ++
Sbjct: 227 GGDSIRAMKMFERMTED----------------------------------LGIRPDAVS 252

Query: 41  LLSLLSGCASVGA 3
           L+++L  CASVGA
Sbjct: 253 LVNVLPACASVGA 265


>ref|XP_002271725.2| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like [Vitis vinifera]
          Length = 852

 Score =  220 bits (560), Expect = 1e-55
 Identities = 104/130 (80%), Positives = 121/130 (93%)
 Frame = -2

Query: 395 EVHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQVG 216
           +VHGYA+RSGL EDVFVGNA+VDMYAKC +M+EA+KVFERMKVKDVVSWNAMVTGYSQ+G
Sbjct: 252 QVHGYALRSGLFEDVFVGNAVVDMYAKCGMMEEANKVFERMKVKDVVSWNAMVTGYSQIG 311

Query: 215 RFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQMQLSGSEPNIITLL 36
           RF+DAL LFE+++ EKI+LNVVTWSAVI  YAQRGLG+EALDVFRQM+L GSEPN++TL+
Sbjct: 312 RFDDALGLFEKIREEKIELNVVTWSAVIAGYAQRGLGFEALDVFRQMRLCGSEPNVVTLV 371

Query: 35  SLLSGCASVG 6
           SLLSGCA  G
Sbjct: 372 SLLSGCALAG 381



 Score = 65.5 bits (158), Expect = 4e-09
 Identities = 35/108 (32%), Positives = 62/108 (57%), Gaps = 1/108 (0%)
 Frame = -2

Query: 395 EVHGYAVRSGL-CEDVFVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQV 219
           ++H Y +R+      +FV N ++DMY+K   +D A  VF+ M  ++ VSW +++TGY   
Sbjct: 500 QIHAYVLRNRFESAMLFVANCLIDMYSKSGDVDAARVVFDNMHQRNGVSWTSLMTGYGMH 559

Query: 218 GRFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQM 75
           GR E+AL +F  M+   +  + VT+  V+ A +  G+  + ++ F  M
Sbjct: 560 GRGEEALQIFYEMQKVGLVPDGVTFVVVLYACSHSGMVDQGINYFNGM 607



 Score = 65.1 bits (157), Expect = 6e-09
 Identities = 42/133 (31%), Positives = 61/133 (45%), Gaps = 3/133 (2%)
 Frame = -2

Query: 392 VHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERMK---VKDVVSWNAMVTGYSQ 222
           VH     SG   +VFVGN +V MY +C   + A +VF+ M+   V D+VSWN++V  Y Q
Sbjct: 148 VHAVVFASGFEWNVFVGNGLVSMYGRCGAWENARQVFDEMRERGVGDLVSWNSIVAAYMQ 207

Query: 221 VGRFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQMQLSGSEPNIIT 42
            G    A+ +FERM  +                                   G  P+ ++
Sbjct: 208 GGDSIRAMKMFERMTED----------------------------------LGIRPDAVS 233

Query: 41  LLSLLSGCASVGA 3
           L+++L  CASVGA
Sbjct: 234 LVNVLPACASVGA 246


>ref|XP_002515835.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223544990|gb|EEF46504.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 655

 Score =  199 bits (507), Expect = 1e-49
 Identities = 97/131 (74%), Positives = 115/131 (87%)
 Frame = -2

Query: 395 EVHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQVG 216
           +VHG+A+R GL EDVFV N++VDMYAKC LM  A+KVF+RM+ KDVVSWNAMVTGYSQ+G
Sbjct: 267 QVHGFAIRYGLFEDVFVANSLVDMYAKCGLMCIANKVFDRMQHKDVVSWNAMVTGYSQIG 326

Query: 215 RFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQMQLSGSEPNIITLL 36
           +FEDAL LFE+M+ EKI L+VV+WSAVI  YAQRGLGYEAL+VFRQMQ+ G  PN +TL+
Sbjct: 327 KFEDALGLFEKMREEKIQLDVVSWSAVIAGYAQRGLGYEALNVFRQMQVCGLRPNEVTLV 386

Query: 35  SLLSGCASVGA 3
           SLLSGCASVGA
Sbjct: 387 SLLSGCASVGA 397



 Score = 77.8 bits (190), Expect = 8e-13
 Identities = 40/126 (31%), Positives = 71/126 (56%), Gaps = 1/126 (0%)
 Frame = -2

Query: 395 EVHGYAVRSGL-CEDVFVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQV 219
           ++H + +R    C+ ++V N ++DMY+K   MD A  VF+ MK ++ VSW +++TGY   
Sbjct: 516 QIHAFVLRDQYDCDVLYVANCLIDMYSKSGDMDAARLVFDNMKHRNTVSWTSLMTGYGMH 575

Query: 218 GRFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQMQLSGSEPNIITL 39
           G  E+A+ +F+ M+ E +  + +T+  V+ A +  G+  E +  F  M    SE N  + 
Sbjct: 576 GHGEEAIKVFDEMRREGLVSDGITFLVVLYACSHSGMVDEGIKYFHDMCKEFSEKNENSA 635

Query: 38  LSLLSG 21
           L +  G
Sbjct: 636 LEIKPG 641



 Score = 58.5 bits (140), Expect = 5e-07
 Identities = 37/105 (35%), Positives = 56/105 (53%), Gaps = 12/105 (11%)
 Frame = -2

Query: 395 EVHGYAV-------RSGLCEDVFVGNAIVDMYAKCELMDEASKVFERMKVKD--VVSWNA 243
           E H Y++       RS   +++ V NAI+DMY KC+ ++    +F  +  KD  VV+W A
Sbjct: 403 ETHCYSIKCVLNFDRSDPRDELLVVNAIIDMYTKCKDINVGRAIFNSIPPKDRNVVTWTA 462

Query: 242 MVTGYSQVGRFEDALSLFERMKME---KIDLNVVTWSAVIGAYAQ 117
           M+ GY+Q G   DAL LF +M  +    +  N  T S  + A A+
Sbjct: 463 MIGGYAQHGEANDALELFSQMLKQYNRSVKPNAFTISCALMACAR 507



 Score = 56.6 bits (135), Expect = 2e-06
 Identities = 34/100 (34%), Positives = 55/100 (55%), Gaps = 6/100 (6%)
 Frame = -2

Query: 392 VHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERM---KVKDVVSWNAMVTGYSQ 222
           +H     +G   +VFV NA+V MY +C     A ++F+ +   +V D+VSWN+M+  Y Q
Sbjct: 161 IHAIVCSTGFDSNVFVCNAVVAMYGRCGASSYARQMFDELLMGEVFDLVSWNSMIAVYLQ 220

Query: 221 VGRFEDALSLFERM-KMEKIDL--NVVTWSAVIGAYAQRG 111
            G  +  + LF RM K+ + D+  + V+   V+ A A  G
Sbjct: 221 SGDLKSGIELFRRMWKVGEFDIVPDAVSLVNVLPACASMG 260


>ref|XP_003544373.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like [Glycine max]
          Length = 986

 Score =  196 bits (497), Expect = 2e-48
 Identities = 96/131 (73%), Positives = 111/131 (84%)
 Frame = -2

Query: 395 EVHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQVG 216
           +VHG+++RSGL +DVFVGNA+VDMYAKC  M+EA+KVF+RMK KDVVSWNAMVTGYSQ G
Sbjct: 385 QVHGFSIRSGLVDDVFVGNAVVDMYAKCGKMEEANKVFQRMKFKDVVSWNAMVTGYSQAG 444

Query: 215 RFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQMQLSGSEPNIITLL 36
           R E ALSLFERM  E I+L+VVTW+AVI  YAQRG G EALDVFRQM   GS PN++TL+
Sbjct: 445 RLEHALSLFERMTEENIELDVVTWTAVITGYAQRGQGCEALDVFRQMCDCGSRPNVVTLV 504

Query: 35  SLLSGCASVGA 3
           SLLS C SVGA
Sbjct: 505 SLLSACVSVGA 515



 Score = 68.9 bits (167), Expect = 4e-10
 Identities = 36/108 (33%), Positives = 63/108 (58%), Gaps = 1/108 (0%)
 Frame = -2

Query: 395 EVHGYAVRSGLCEDV-FVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQV 219
           +VH Y +R+     + FV N ++DMY+K   +D A  VF+ M  ++ VSW +++TGY   
Sbjct: 634 QVHAYVLRNFYGSVMLFVANCLIDMYSKSGDVDTAQIVFDNMPQRNAVSWTSLMTGYGMH 693

Query: 218 GRFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQM 75
           GR EDAL +F+ M+   +  + +T+  V+ A +  G+    ++ F +M
Sbjct: 694 GRGEDALRVFDEMRKVPLVPDGITFLVVLYACSHSGMVDHGINFFNRM 741



 Score = 60.5 bits (145), Expect = 1e-07
 Identities = 41/105 (39%), Positives = 60/105 (57%), Gaps = 12/105 (11%)
 Frame = -2

Query: 395 EVHGYAVRSGL--------CEDVFVGNAIVDMYAKCELMDEASKVFERM--KVKDVVSWN 246
           E H YA++  L         +D+ V N ++DMYAKC+  + A K+F+ +  K +DVV+W 
Sbjct: 521 ETHCYAIKFILNLDGPDPGADDLKVINGLIDMYAKCQSTEVARKMFDSVSPKDRDVVTWT 580

Query: 245 AMVTGYSQVGRFEDALSLFERM-KMEK-IDLNVVTWSAVIGAYAQ 117
            M+ GY+Q G   +AL LF  M KM+K I  N  T S  + A A+
Sbjct: 581 VMIGGYAQHGDANNALQLFSGMFKMDKSIKPNDFTLSCALVACAR 625



 Score = 57.8 bits (138), Expect = 9e-07
 Identities = 36/133 (27%), Positives = 59/133 (44%), Gaps = 3/133 (2%)
 Frame = -2

Query: 392 VHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERM---KVKDVVSWNAMVTGYSQ 222
           +H    RSG   +VFV NA+V MY KC  +  A  +F+ +    ++D+VSWN++V+ Y  
Sbjct: 281 LHATVSRSGFASNVFVCNAVVSMYGKCGALRHAHNMFDDLCHRGIQDLVSWNSVVSAYMW 340

Query: 221 VGRFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQMQLSGSEPNIIT 42
                 AL+LF +M    +                                    P++I+
Sbjct: 341 ASDANTALALFHKMTTRHL----------------------------------MSPDVIS 366

Query: 41  LLSLLSGCASVGA 3
           L+++L  CAS+ A
Sbjct: 367 LVNILPACASLAA 379


>ref|XP_004137054.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like [Cucumis sativus]
           gi|449479088|ref|XP_004155501.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g16860-like [Cucumis sativus]
          Length = 855

 Score =  195 bits (496), Expect = 3e-48
 Identities = 94/131 (71%), Positives = 113/131 (86%)
 Frame = -2

Query: 395 EVHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQVG 216
           +VHG++VR+GL +DVFVGNA+V MYAKC  M+EA+KVFE +K KDVVSWNAMVTGYSQ+G
Sbjct: 255 QVHGFSVRNGLVDDVFVGNALVSMYAKCSKMNEANKVFEGIKKKDVVSWNAMVTGYSQIG 314

Query: 215 RFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQMQLSGSEPNIITLL 36
            F+ ALSLF+ M+ E I L+V+TWSAVI  YAQ+G G+EALDVFRQMQL G EPN++TL 
Sbjct: 315 SFDSALSLFKMMQEEDIKLDVITWSAVIAGYAQKGHGFEALDVFRQMQLYGLEPNVVTLA 374

Query: 35  SLLSGCASVGA 3
           SLLSGCASVGA
Sbjct: 375 SLLSGCASVGA 385



 Score = 77.8 bits (190), Expect = 8e-13
 Identities = 39/108 (36%), Positives = 69/108 (63%), Gaps = 1/108 (0%)
 Frame = -2

Query: 395 EVHGYAVRS-GLCEDVFVGNAIVDMYAKCELMDEASKVFERMKVKDVVSWNAMVTGYSQV 219
           ++H YA+R+    E ++VGN ++DMY+K   +D A  VF+ MK+++VVSW +++TGY   
Sbjct: 503 QLHAYALRNENESEVLYVGNCLIDMYSKSGDIDAARAVFDNMKLRNVVSWTSLMTGYGMH 562

Query: 218 GRFEDALSLFERMKMEKIDLNVVTWSAVIGAYAQRGLGYEALDVFRQM 75
           GR E+AL LF++M+     ++ +T+  V+ A +  G+  + +  F  M
Sbjct: 563 GRGEEALHLFDQMQKLGFAVDGITFLVVLYACSHSGMVDQGMIYFHDM 610



 Score = 61.6 bits (148), Expect = 6e-08
 Identities = 37/106 (34%), Positives = 58/106 (54%), Gaps = 11/106 (10%)
 Frame = -2

Query: 395 EVHGYAVRSGLC-------EDVFVGNAIVDMYAKCELMDEASKVFERMKVKD--VVSWNA 243
           + H Y +++ L        +D+ V N ++DMYAKC+    A  +F+ ++ KD  VV+W  
Sbjct: 391 QTHAYVIKNILNLNWNDKEDDLLVLNGLIDMYAKCKSYRVARSIFDSIEGKDKNVVTWTV 450

Query: 242 MVTGYSQVGRFEDALSLFERMKMEKIDL--NVVTWSAVIGAYAQRG 111
           M+ GY+Q G   DAL LF ++  +K  L  N  T S  + A A+ G
Sbjct: 451 MIGGYAQHGEANDALKLFAQIFKQKTSLKPNAFTLSCALMACARLG 496



 Score = 59.3 bits (142), Expect = 3e-07
 Identities = 28/74 (37%), Positives = 47/74 (63%), Gaps = 3/74 (4%)
 Frame = -2

Query: 392 VHGYAVRSGLCEDVFVGNAIVDMYAKCELMDEASKVFERM---KVKDVVSWNAMVTGYSQ 222
           VH     +GL  +VF+ N+IV MY +C  +D+A ++F+ +   K++D+VSWN+++  Y Q
Sbjct: 149 VHAIVCANGLGSNVFICNSIVAMYGRCGALDDAHQMFDEVLERKIEDIVSWNSILAAYVQ 208

Query: 221 VGRFEDALSLFERM 180
            G+   AL +  RM
Sbjct: 209 GGQSRTALRIAFRM 222


Top