BLASTX nr result

ID: Dioscorea21_contig00038729 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00038729
         (443 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268999.1| PREDICTED: pentatricopeptide repeat-containi...   183   1e-44
emb|CAN80769.1| hypothetical protein VITISV_013866 [Vitis vinifera]   183   1e-44
gb|AAF43948.1|AC012188_25 Contains similarity to a hypothetical ...   181   4e-44
ref|NP_172899.1| pentatricopeptide repeat-containing protein [Ar...   181   4e-44
ref|XP_002890056.1| hypothetical protein ARALYDRAFT_888825 [Arab...   173   1e-41

>ref|XP_002268999.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g14470-like [Vitis vinifera]
          Length = 729

 Score =  183 bits (465), Expect = 1e-44
 Identities = 87/149 (58%), Positives = 115/149 (77%), Gaps = 2/149 (1%)
 Frame = +1

Query: 1   LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSD--RGIVDWNSMLSGFWKWSSKEDAC 174
           LG  SD ++RNAV+  YA+ GP   A  +FDE+ D  R + DWN+M+SG+WKW S+  A 
Sbjct: 124 LGHGSDAFVRNAVIDMYARLGPIGHARKVFDEIPDYERKVADWNAMVSGYWKWESEGQAQ 183

Query: 175 KVFDEMPERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDE 354
            +FD MPERNV++WT MV+G A+  +LE ARR F+ MPERSVVSWNA+LSGY +NGL +E
Sbjct: 184 WLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEE 243

Query: 355 CLFLFNKMMNSGIRPNETSWVTVISACAS 441
            L LF++M+N+GI P+ET+WVTVISAC+S
Sbjct: 244 ALRLFDEMVNAGIEPDETTWVTVISACSS 272



 Score = 85.1 bits (209), Expect = 5e-15
 Identities = 52/174 (29%), Positives = 86/174 (49%), Gaps = 40/174 (22%)
 Frame = +1

Query: 34  AVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKVFDEMPERNV-- 207
           A++  YAK      A   FD M +R +V WN+MLSG+ +    E+A ++FDEM    +  
Sbjct: 199 AMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEEALRLFDEMVNAGIEP 258

Query: 208 --VSWTVMVSGL-----------------------------------ARAGELEEARRVF 276
              +W  ++S                                     A+ G+L+ AR++F
Sbjct: 259 DETTWVTVISACSSRGDPCLAASLVRTLHQKRIQLNCFVRTALLDMYAKFGDLDSARKLF 318

Query: 277 ELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNS-GIRPNETSWVTVISAC 435
             MP R+VV+WN++++GY +NG     + LF +M+ +  + P+E + V+VISAC
Sbjct: 319 NTMPGRNVVTWNSMIAGYAQNGQSAMAIELFKEMITAKKLTPDEVTMVSVISAC 372



 Score = 81.6 bits (200), Expect = 6e-14
 Identities = 53/179 (29%), Positives = 86/179 (48%), Gaps = 40/179 (22%)
 Frame = +1

Query: 22  YIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKVFDEM--- 192
           ++R A+L  YAK+G    A  LF+ M  R +V WNSM++G+ +      A ++F EM   
Sbjct: 296 FVRTALLDMYAKFGDLDSARKLFNTMPGRNVVTWNSMIAGYAQNGQSAMAIELFKEMITA 355

Query: 193 -----PERNVVS---------------WTV-----------------MVSGLARAGELEE 261
                 E  +VS               W V                 M+   +R G +E+
Sbjct: 356 KKLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGHNAMIFMYSRCGSMED 415

Query: 262 ARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVTVISACA 438
           A+RVF+ M  R VVS+N ++SG+  +G   E + L + M   GI P+  +++ V++AC+
Sbjct: 416 AKRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIEPDRVTFIGVLTACS 474


>emb|CAN80769.1| hypothetical protein VITISV_013866 [Vitis vinifera]
          Length = 761

 Score =  183 bits (464), Expect = 1e-44
 Identities = 87/149 (58%), Positives = 115/149 (77%), Gaps = 2/149 (1%)
 Frame = +1

Query: 1   LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSD--RGIVDWNSMLSGFWKWSSKEDAC 174
           LG  SD ++RNAV+  YA+ GP   A  +FDE+ D  R + DWN+M+SG+WKW S+  A 
Sbjct: 124 LGHGSDAFVRNAVIDMYARLGPIGHARKVFDEIPDYERKVADWNAMVSGYWKWESEGQAQ 183

Query: 175 KVFDEMPERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDE 354
            +FD MPERNV++WT MV+G A+  +LE ARR F+ MPERSVVSWNA+LSGY +NGL +E
Sbjct: 184 WLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEE 243

Query: 355 CLFLFNKMMNSGIRPNETSWVTVISACAS 441
            L LF++M+N+GI P+ET+WVTVISAC+S
Sbjct: 244 VLRLFDEMVNAGIEPDETTWVTVISACSS 272



 Score = 81.3 bits (199), Expect = 8e-14
 Identities = 57/179 (31%), Positives = 86/179 (48%), Gaps = 44/179 (24%)
 Frame = +1

Query: 31  NAVLCYYAKYGPFLCALSLFDEMSDRGI----VDWNSMLSGFWKWSSKEDAC-------- 174
           NA+L  YA+ G     L LFDEM + GI      W +++S     SS+ D C        
Sbjct: 229 NAMLSGYAQNGLAEEVLRLFDEMVNAGIEPDETTWVTVISAC---SSRGDPCLAASLVRT 285

Query: 175 ------------------------------KVFDEMPE-RNVVSWTVMVSGLARAGELEE 261
                                         ++FDE+   RN V+W  M+S   R G L+ 
Sbjct: 286 LHQKQIQLNCFVRTALLDMYAKCGSIGAARRIFDELGAYRNSVTWNAMISAYTRVGNLDS 345

Query: 262 ARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNS-GIRPNETSWVTVISAC 435
           AR +F  MP R+VV+WN++++GY +NG     + LF +M+ +  + P+E + V+VISAC
Sbjct: 346 ARELFNTMPGRNVVTWNSMIAGYAQNGQSAMAIELFKEMITAKKLTPDEVTMVSVISAC 404



 Score = 75.5 bits (184), Expect = 4e-12
 Identities = 52/186 (27%), Positives = 85/186 (45%), Gaps = 40/186 (21%)
 Frame = +1

Query: 1   LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180
           LG   +    NA++  Y + G    A  LF+ M  R +V WNSM++G+ +      A ++
Sbjct: 321 LGAYRNSVTWNAMISAYTRVGNLDSARELFNTMPGRNVVTWNSMIAGYAQNGQSAMAIEL 380

Query: 181 FDEM--------PERNVVS---------------WTV-----------------MVSGLA 240
           F EM         E  +VS               W V                 M+   +
Sbjct: 381 FKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGHNAMIFMYS 440

Query: 241 RAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVT 420
           R G +E+A+RVF+ M  R VVS+N ++SG+  +G   E + L + M   GI P+  +++ 
Sbjct: 441 RCGSMEDAKRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIEPDRVTFIG 500

Query: 421 VISACA 438
           V++AC+
Sbjct: 501 VLTACS 506


>gb|AAF43948.1|AC012188_25 Contains similarity to a hypothetical protein from Arabidopsis
           thaliana gb|AC004044.1 and contains two domains PF|01535
           of unknown function [Arabidopsis thaliana]
          Length = 455

 Score =  181 bits (460), Expect = 4e-44
 Identities = 80/146 (54%), Positives = 108/146 (73%)
 Frame = +1

Query: 1   LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180
           LG   D Y+RN ++  Y K+     A  +FD++S R   DWN M+SG+WKW +KE+ACK+
Sbjct: 45  LGFFKDPYVRNVIMDMYVKHESVESARKVFDQISQRKGSDWNVMISGYWKWGNKEEACKL 104

Query: 181 FDEMPERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECL 360
           FD MPE +VVSWTVM++G A+  +LE AR+ F+ MPE+SVVSWNA+LSGY +NG  ++ L
Sbjct: 105 FDMMPENDVVSWTVMITGFAKVKDLENARKYFDRMPEKSVVSWNAMLSGYAQNGFTEDAL 164

Query: 361 FLFNKMMNSGIRPNETSWVTVISACA 438
            LFN M+  G+RPNET+WV VISAC+
Sbjct: 165 RLFNDMLRLGVRPNETTWVIVISACS 190



 Score = 81.6 bits (200), Expect = 6e-14
 Identities = 36/93 (38%), Positives = 65/93 (69%), Gaps = 2/93 (2%)
 Frame = +1

Query: 163 EDACKVFDEM-PERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKN 339
           + A ++F+E+  +RN+V+W  M+SG  R G++  AR++F+ MP+R+VVSWN++++GY  N
Sbjct: 231 QSARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHN 290

Query: 340 GLPDECLFLFNKMMNSG-IRPNETSWVTVISAC 435
           G     +  F  M++ G  +P+E + ++V+SAC
Sbjct: 291 GQAALAIEFFEDMIDYGDSKPDEVTMISVLSAC 323



 Score = 74.7 bits (182), Expect = 7e-12
 Identities = 51/185 (27%), Positives = 81/185 (43%), Gaps = 40/185 (21%)
 Frame = +1

Query: 1   LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180
           LG   +    NA++  Y + G    A  LFD M  R +V WNS+++G+        A + 
Sbjct: 240 LGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHNGQAALAIEF 299

Query: 181 FDEM-------PER---------------------------------NVVSWTVMVSGLA 240
           F++M       P+                                  N   +  ++   A
Sbjct: 300 FEDMIDYGDSKPDEVTMISVLSACGHMADLELGDCIVDYIRKNQIKLNDSGYRSLIFMYA 359

Query: 241 RAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVT 420
           R G L EA+RVF+ M ER VVS+N + + +  NG   E L L +KM + GI P+  ++ +
Sbjct: 360 RGGNLWEAKRVFDEMKERDVVSYNTLFTAFAANGDGVETLNLLSKMKDEGIEPDRVTYTS 419

Query: 421 VISAC 435
           V++AC
Sbjct: 420 VLTAC 424



 Score = 63.5 bits (153), Expect = 2e-08
 Identities = 32/89 (35%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
 Frame = +1

Query: 22  YIRNAVLCYYAKYGPFLCALSLFDEM-SDRGIVDWNSMLSGFWKWSSKEDACKVFDEMPE 198
           +++ A+L  +AK      A  +F+E+ + R +V WN+M+SG+ +      A ++FD MP+
Sbjct: 215 FVKTALLDMHAKCRDIQSARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPK 274

Query: 199 RNVVSWTVMVSGLARAGELEEARRVFELM 285
           RNVVSW  +++G A  G+   A   FE M
Sbjct: 275 RNVVSWNSLIAGYAHNGQAALAIEFFEDM 303


>ref|NP_172899.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|193806395|sp|Q9M9R6.2|PPR43_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g14470 gi|332191047|gb|AEE29168.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 540

 Score =  181 bits (460), Expect = 4e-44
 Identities = 80/146 (54%), Positives = 108/146 (73%)
 Frame = +1

Query: 1   LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180
           LG   D Y+RN ++  Y K+     A  +FD++S R   DWN M+SG+WKW +KE+ACK+
Sbjct: 130 LGFFKDPYVRNVIMDMYVKHESVESARKVFDQISQRKGSDWNVMISGYWKWGNKEEACKL 189

Query: 181 FDEMPERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECL 360
           FD MPE +VVSWTVM++G A+  +LE AR+ F+ MPE+SVVSWNA+LSGY +NG  ++ L
Sbjct: 190 FDMMPENDVVSWTVMITGFAKVKDLENARKYFDRMPEKSVVSWNAMLSGYAQNGFTEDAL 249

Query: 361 FLFNKMMNSGIRPNETSWVTVISACA 438
            LFN M+  G+RPNET+WV VISAC+
Sbjct: 250 RLFNDMLRLGVRPNETTWVIVISACS 275



 Score = 81.6 bits (200), Expect = 6e-14
 Identities = 36/93 (38%), Positives = 65/93 (69%), Gaps = 2/93 (2%)
 Frame = +1

Query: 163 EDACKVFDEM-PERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKN 339
           + A ++F+E+  +RN+V+W  M+SG  R G++  AR++F+ MP+R+VVSWN++++GY  N
Sbjct: 316 QSARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHN 375

Query: 340 GLPDECLFLFNKMMNSG-IRPNETSWVTVISAC 435
           G     +  F  M++ G  +P+E + ++V+SAC
Sbjct: 376 GQAALAIEFFEDMIDYGDSKPDEVTMISVLSAC 408



 Score = 74.7 bits (182), Expect = 7e-12
 Identities = 51/185 (27%), Positives = 81/185 (43%), Gaps = 40/185 (21%)
 Frame = +1

Query: 1   LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180
           LG   +    NA++  Y + G    A  LFD M  R +V WNS+++G+        A + 
Sbjct: 325 LGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHNGQAALAIEF 384

Query: 181 FDEM-------PER---------------------------------NVVSWTVMVSGLA 240
           F++M       P+                                  N   +  ++   A
Sbjct: 385 FEDMIDYGDSKPDEVTMISVLSACGHMADLELGDCIVDYIRKNQIKLNDSGYRSLIFMYA 444

Query: 241 RAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVT 420
           R G L EA+RVF+ M ER VVS+N + + +  NG   E L L +KM + GI P+  ++ +
Sbjct: 445 RGGNLWEAKRVFDEMKERDVVSYNTLFTAFAANGDGVETLNLLSKMKDEGIEPDRVTYTS 504

Query: 421 VISAC 435
           V++AC
Sbjct: 505 VLTAC 509



 Score = 63.5 bits (153), Expect = 2e-08
 Identities = 32/89 (35%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
 Frame = +1

Query: 22  YIRNAVLCYYAKYGPFLCALSLFDEM-SDRGIVDWNSMLSGFWKWSSKEDACKVFDEMPE 198
           +++ A+L  +AK      A  +F+E+ + R +V WN+M+SG+ +      A ++FD MP+
Sbjct: 300 FVKTALLDMHAKCRDIQSARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPK 359

Query: 199 RNVVSWTVMVSGLARAGELEEARRVFELM 285
           RNVVSW  +++G A  G+   A   FE M
Sbjct: 360 RNVVSWNSLIAGYAHNGQAALAIEFFEDM 388


>ref|XP_002890056.1| hypothetical protein ARALYDRAFT_888825 [Arabidopsis lyrata subsp.
           lyrata] gi|297335898|gb|EFH66315.1| hypothetical protein
           ARALYDRAFT_888825 [Arabidopsis lyrata subsp. lyrata]
          Length = 790

 Score =  173 bits (439), Expect = 1e-41
 Identities = 80/148 (54%), Positives = 110/148 (74%), Gaps = 2/148 (1%)
 Frame = +1

Query: 1   LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180
           +G+  D Y+RN ++  YAK+     A  +FD+++ R   DWN M+SG+WK+ +KE+ACK+
Sbjct: 130 MGIFKDPYVRNVIMDMYAKHESVESARKVFDQITHRKGSDWNVMISGYWKYGNKEEACKL 189

Query: 181 FDEMPER--NVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDE 354
           FD MPE   +VVSWTVM++G A+  +LE ARR F+ MPE+SVVSWNA+LSGY +NG  +E
Sbjct: 190 FDMMPEGKIDVVSWTVMITGFAKLKDLENARRCFDCMPEKSVVSWNAMLSGYSQNGFTEE 249

Query: 355 CLFLFNKMMNSGIRPNETSWVTVISACA 438
            L LFN M+  G+RPNET+WV VISAC+
Sbjct: 250 TLRLFNDMLRLGVRPNETTWVIVISACS 277



 Score = 84.0 bits (206), Expect = 1e-14
 Identities = 37/91 (40%), Positives = 65/91 (71%), Gaps = 2/91 (2%)
 Frame = +1

Query: 169 ACKVFDEM-PERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGL 345
           A ++F+E+  ++N+V+W  M+SG  R G++  AR++F+ MP+R+VVSWN+V++GY  NG 
Sbjct: 320 ARRIFNELGTQKNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSVIAGYAHNGQ 379

Query: 346 PDECLFLFNKMMNSG-IRPNETSWVTVISAC 435
           P   +  F  M++ G  +P+E + ++V+SAC
Sbjct: 380 PALAIEFFEDMIDYGDSKPDEVTMISVLSAC 410



 Score = 74.3 bits (181), Expect = 1e-11
 Identities = 51/185 (27%), Positives = 81/185 (43%), Gaps = 40/185 (21%)
 Frame = +1

Query: 1   LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180
           LG   +    NA++  Y + G    A  LFD M  R +V WNS+++G+        A + 
Sbjct: 327 LGTQKNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSVIAGYAHNGQPALAIEF 386

Query: 181 FDEM-------PER---------------------------------NVVSWTVMVSGLA 240
           F++M       P+                                  N   +  ++   A
Sbjct: 387 FEDMIDYGDSKPDEVTMISVLSACGHMGDLELGDCIVDYIGKKQIKLNDSGYRSLIFMYA 446

Query: 241 RAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVT 420
           R G L EA+RVF+ M ER VVS+N + S +  NG   + L L +KM + GI P+  ++ +
Sbjct: 447 RCGNLWEAKRVFDEMKERDVVSYNTLFSAFAANGDGVKTLNLLSKMKDEGIEPDRVTYTS 506

Query: 421 VISAC 435
           V++AC
Sbjct: 507 VLTAC 511


Top