BLASTX nr result

ID: Dioscorea21_contig00026344 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00026344
         (877 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002275085.1| PREDICTED: pentatricopeptide repeat-containi...   240   3e-61
ref|NP_187126.1| pentatricopeptide repeat-containing protein [Ar...   228   1e-57
ref|XP_002884467.1| pentatricopeptide repeat-containing protein ...   226   5e-57
ref|XP_003559839.1| PREDICTED: pentatricopeptide repeat-containi...   226   7e-57
gb|EAY91821.1| hypothetical protein OsI_13463 [Oryza sativa Indi...   219   5e-55

>ref|XP_002275085.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04750,
           mitochondrial-like [Vitis vinifera]
          Length = 654

 Score =  240 bits (613), Expect = 3e-61
 Identities = 125/292 (42%), Positives = 182/292 (62%)
 Frame = +2

Query: 2   SAIAHPCXXXXXXXXXXXXXXNPNLFIFNTMISALSFSVNQSLAFYKSMLHLCVCPDEHT 181
           SAI HP               +PNL+I+NTMISALS S+NQS AFY S+L  C+ P+  T
Sbjct: 73  SAITHPENLDMAVLLFRHHTPHPNLYIYNTMISALSLSLNQSFAFYNSLLSSCIYPNRST 132

Query: 182 XXXXXXXXXXXXXXXXQIHAQVIIFGFSFHAYVHNSLVKMYLENDEIGVVEKLVRPCGDK 361
                           QIH   II G  ++ Y+ N+L+K+YLEN+++G+  ++ +     
Sbjct: 133 FLFLLQASKFLSQVM-QIHCHAIITGSFYYGYLQNTLMKIYLENEKMGLAYQVFQQMA-A 190

Query: 362 KDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAGKSV 541
            D V FN ++  Y KKG  ++AL+   E++G G++PD++T++ LL+CCG+L     GKSV
Sbjct: 191 PDAVSFNIMIFGYAKKGHNIEALKFLHEMVGLGLKPDEFTMLGLLICCGRLGDAQLGKSV 250

Query: 542 HGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFNNVG 721
           H W+ RR   +   L+L NALLDMYVKC+E+  A  IF+ I  KD++SWN MI G+  VG
Sbjct: 251 HAWIERRGLIKSSNLILNNALLDMYVKCKELRIAQSIFNVIVRKDTISWNTMIAGYAKVG 310

Query: 722 EFDLAYKAFEEMPEKDLVSWNSLLSGYLQKGNYKRVIELFHFMLSQNDVKPD 877
             ++A+  FE+MP +DLVSWNS+++GY QKG+   V  LF  M+++N + PD
Sbjct: 311 NLEIAHNFFEDMPCRDLVSWNSIIAGYAQKGDCLMVQRLFENMVAEN-IWPD 361



 Score = 85.9 bits (211), Expect = 1e-14
 Identities = 74/311 (23%), Positives = 134/311 (43%), Gaps = 42/311 (13%)
 Frame = +2

Query: 68   PNLFIFNTMISALSFSVN--QSLAFYKSMLHLCVCPDEHTXXXXXXXXXXXXXXXX--QI 235
            P+   FN MI   +   +  ++L F   M+ L + PDE T                   +
Sbjct: 191  PDAVSFNIMIFGYAKKGHNIEALKFLHEMVGLGLKPDEFTMLGLLICCGRLGDAQLGKSV 250

Query: 236  HAQVIIFGF--SFHAYVHNSLVKMYLENDEIGVVEKLVR--------------------- 346
            HA +   G   S +  ++N+L+ MY++  E+ + + +                       
Sbjct: 251  HAWIERRGLIKSSNLILNNALLDMYVKCKELRIAQSIFNVIVRKDTISWNTMIAGYAKVG 310

Query: 347  ------------PCGDKKDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVS 490
                        PC   +D+V +N++++ Y +KG  L    +F+ ++   + PD  TI++
Sbjct: 311  NLEIAHNFFEDMPC---RDLVSWNSIIAGYAQKGDCLMVQRLFENMVAENIWPDFVTIIN 367

Query: 491  LLVCCGQLKGVLAGKSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISE 670
            L+    ++  +  G+ +HGWVVR          L +A +DMY KC  I  A  +F  ++E
Sbjct: 368  LVSAAAEIGALHHGRWIHGWVVRMQ--MKIDAFLGSAFIDMYWKCGSIKRACMVFREVTE 425

Query: 671  KDSVSWNIMIMGFNNVGEFDLAYKAFEEMPE---KDLVSWNSLLSGYLQKGNYKRVIELF 841
            KD   W  MI GF   G    A + F EM E    + V++ ++L+     G   + + +F
Sbjct: 426  KDVTVWTTMITGFAFHGYGSKALQLFYEMQEYVMPNQVTFVAVLTACSHSGFVSQGLRIF 485

Query: 842  HFMLSQNDVKP 874
            + M  +  ++P
Sbjct: 486  NSMKERYGIEP 496


>ref|NP_187126.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75207287|sp|Q9SR01.1|PP212_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At3g04750, mitochondrial; Flags: Precursor
           gi|6175175|gb|AAF04901.1|AC011437_16 hypothetical
           protein [Arabidopsis thaliana]
           gi|332640610|gb|AEE74131.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 661

 Score =  228 bits (581), Expect = 1e-57
 Identities = 124/294 (42%), Positives = 171/294 (58%), Gaps = 2/294 (0%)
 Frame = +2

Query: 2   SAIAHPCXXXXXXXXXXXXXXNPNLFIFNTMISALSFSVNQSLAFYKSMLHLCVCPDEHT 181
           SAI +P               NPN+F++NTMISA+S S N+    Y SM+   V PD  T
Sbjct: 76  SAITYPENLDLAKLLFLNFTPNPNVFVYNTMISAVSSSKNECFGLYSSMIRHRVSPDRQT 135

Query: 182 XXXXXXXXXXXXXXXXQIHAQVIIFG-FSFHAYVHNSLVKMYLENDEIGVVEKLVRPCGD 358
                           QIH  +I+ G  S   Y+ NSLVK Y+E    GV EK+      
Sbjct: 136 FLYLMKASSFLSEVK-QIHCHIIVSGCLSLGNYLWNSLVKFYMELGNFGVAEKVFARM-P 193

Query: 359 KKDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAGKS 538
             D+  FN ++  Y K+G  L+AL+++ +++  G+EPD+YT++SLLVCCG L  +  GK 
Sbjct: 194 HPDVSSFNVMIVGYAKQGFSLEALKLYFKMVSDGIEPDEYTVLSLLVCCGHLSDIRLGKG 253

Query: 539 VHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFNNV 718
           VHGW+ RR       L+L NALLDMY KC+E G A + FD + +KD  SWN M++GF  +
Sbjct: 254 VHGWIERRGPVYSSNLILSNALLDMYFKCKESGLAKRAFDAMKKKDMRSWNTMVVGFVRL 313

Query: 719 GEFDLAYKAFEEMPEKDLVSWNSLLSGYLQKGNYKRVI-ELFHFMLSQNDVKPD 877
           G+ + A   F++MP++DLVSWNSLL GY +KG  +R + ELF+ M     VKPD
Sbjct: 314 GDMEAAQAVFDQMPKRDLVSWNSLLFGYSKKGCDQRTVRELFYEMTIVEKVKPD 367



 Score = 73.2 bits (178), Expect = 8e-11
 Identities = 54/161 (33%), Positives = 83/161 (51%), Gaps = 2/161 (1%)
 Frame = +2

Query: 359 KKDIVLFNTLMSWYVKKGCYLDAL-EVFDEL-MGSGVEPDQYTIVSLLVCCGQLKGVLAG 532
           K+D+V +N+L+  Y KKGC    + E+F E+ +   V+PD+ T+VSL+        +  G
Sbjct: 328 KRDLVSWNSLLFGYSKKGCDQRTVRELFYEMTIVEKVKPDRVTMVSLISGAANNGELSHG 387

Query: 533 KSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFN 712
           + VHG V+R +  +G    L +AL+DMY KC  I  A  +F   +EKD   W  MI G  
Sbjct: 388 RWVHGLVIR-LQLKG-DAFLSSALIDMYCKCGIIERAFMVFKTATEKDVALWTSMITGLA 445

Query: 713 NVGEFDLAYKAFEEMPEKDLVSWNSLLSGYLQKGNYKRVIE 835
             G    A + F  M E+ +   N  L   L   ++  ++E
Sbjct: 446 FHGNGQQALQLFGRMQEEGVTPNNVTLLAVLTACSHSGLVE 486


>ref|XP_002884467.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297330307|gb|EFH60726.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 657

 Score =  226 bits (576), Expect = 5e-57
 Identities = 124/296 (41%), Positives = 173/296 (58%), Gaps = 4/296 (1%)
 Frame = +2

Query: 2   SAIAHPCXXXXXXXXXXXXXXNPNLFIFNTMISALSFSVNQSLAFYKSMLHLCVCPDEHT 181
           SAI +P               NPN+F++NTMISA+S S N+    Y SM+   V PD  T
Sbjct: 75  SAITYPENLDLAKLLFLDFTPNPNVFVYNTMISAVSSSKNECFGLYSSMIRYRVSPDRQT 134

Query: 182 XXXXXXXXXXXXXXXXQIHAQVIIFG-FSFHAYVHNSLVKMYLENDEIGVVEKL--VRPC 352
                           QIH  +I+ G  S   Y+ NSLVK Y+E   +G  EK+  + P 
Sbjct: 135 FLHLMKASSFLSEVK-QIHCHIIVSGCLSLGNYLWNSLVKFYMELGSLGFAEKVFAIMP- 192

Query: 353 GDKKDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAG 532
             + D+  FN ++  Y K+G  L+ALE++ +++  G+EPD+YT++ LLVCCG L  +  G
Sbjct: 193 --QPDVSSFNVMIVGYAKQGFGLEALELYYKMVSDGIEPDEYTLLGLLVCCGHLSDIRLG 250

Query: 533 KSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFN 712
           K VHGW+ RR       L+L NALLDMY KC+E G A + FD + +KD  SWN M++GF 
Sbjct: 251 KGVHGWIERRGPVYSSNLILRNALLDMYFKCKESGLAKRAFDALKKKDMRSWNTMVVGFV 310

Query: 713 NVGEFDLAYKAFEEMPEKDLVSWNSLLSGYLQKGNYKRVI-ELFHFMLSQNDVKPD 877
            +G+ + A   F++MP++DLVSWNSLL  Y +KG  +R + ELF+ ML    VKPD
Sbjct: 311 RLGDMEAAQAVFDQMPQRDLVSWNSLLFCYSKKGCDQRAVRELFYEMLIVEKVKPD 366



 Score = 76.6 bits (187), Expect = 7e-12
 Identities = 58/179 (32%), Positives = 95/179 (53%), Gaps = 6/179 (3%)
 Frame = +2

Query: 359 KKDIVLFNTLMSWYVKKGCYLDAL-EVFDE-LMGSGVEPDQYTIVSLLVCCGQLKGVLAG 532
           ++D+V +N+L+  Y KKGC   A+ E+F E L+   V+PD+ T+VSL+        +  G
Sbjct: 327 QRDLVSWNSLLFCYSKKGCDQRAVRELFYEMLIVEKVKPDRVTMVSLISGAANNGELSHG 386

Query: 533 KSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFN 712
           + VHG ++R +  +G    L +AL+DMY KC  I  A  +F   +EKD   W  MI GF 
Sbjct: 387 RWVHGLMIR-LQLEG-DAFLSSALIDMYCKCGLIERAFMVFKTATEKDVPLWTSMITGFA 444

Query: 713 NVGEFDLAYKAFEEMPEKDL----VSWNSLLSGYLQKGNYKRVIELFHFMLSQNDVKPD 877
             G    A + F+ M E+D+    V+  ++L+     G  +  + +F+ M  +    P+
Sbjct: 445 FHGYGQQALQLFKRMQEEDVTPNKVTLLAVLTACSHSGLVEEGLHVFYHMKEKFGFHPE 503


>ref|XP_003559839.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04750,
           mitochondrial-like [Brachypodium distachyon]
          Length = 601

 Score =  226 bits (575), Expect = 7e-57
 Identities = 122/270 (45%), Positives = 167/270 (61%)
 Frame = +2

Query: 68  PNLFIFNTMISALSFSVNQSLAFYKSMLHLCVCPDEHTXXXXXXXXXXXXXXXXQIHAQV 247
           PNL+ +N ++SALS S ++S+A YKSML     PDE T                Q+HA V
Sbjct: 85  PNLYCYNLVLSALSSSQSRSVALYKSMLASSASPDEKTFLSLLKSVGCASVGK-QVHAHV 143

Query: 248 IIFGFSFHAYVHNSLVKMYLENDEIGVVEKLVRPCGDKKDIVLFNTLMSWYVKKGCYLDA 427
           ++ G     Y+ NSL+KMYL+  +    E + +      D+V  N ++S YVK GC ++A
Sbjct: 144 LVNGLHSRVYLRNSLIKMYLDAGDAETAEAMFQSV-PVPDVVSCNIMLSGYVKGGCVVNA 202

Query: 428 LEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAGKSVHGWVVRRMGYQGWGLVLCNALL 607
           L++F ++    +  DQY  V+LL CCG+LK  L G+SVHG VVRRM  +  GL+L NALL
Sbjct: 203 LQLFRDMASREIGVDQYAAVALLSCCGRLKNALLGRSVHGVVVRRMDIKDRGLILSNALL 262

Query: 608 DMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFNNVGEFDLAYKAFEEMPEKDLVSWNS 787
           DMY KC E+ TAM++F    EKD +SWN MI GF N G  DLA K F + P +DL+SWN+
Sbjct: 263 DMYAKCGEMNTAMRVFGEAKEKDDISWNTMIAGFANDGMLDLASKFFFDAPCRDLISWNT 322

Query: 788 LLSGYLQKGNYKRVIELFHFMLSQNDVKPD 877
           LL+GY +   +  V+ELF+ MLS   V+PD
Sbjct: 323 LLAGYGRCREFAAVMELFNDMLSSR-VRPD 351



 Score = 85.1 bits (209), Expect = 2e-14
 Identities = 53/180 (29%), Positives = 91/180 (50%), Gaps = 4/180 (2%)
 Frame = +2

Query: 347 PCGDKKDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVL 526
           PC   +D++ +NTL++ Y +   +   +E+F++++ S V PD+ T V+L+        + 
Sbjct: 313 PC---RDLISWNTLLAGYGRCREFAAVMELFNDMLSSRVRPDKVTAVTLISAAVSKGALN 369

Query: 527 AGKSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMG 706
            GKSVHGWV++  G Q     L + L+DMY KC  +  A  +F++  +KD   W  MI G
Sbjct: 370 LGKSVHGWVLKEHGTQ--DAFLASTLVDMYCKCGNVKLAYAVFEKALDKDVTLWTAMISG 427

Query: 707 FNNVGEFDLAYKAFEEMPEKDL----VSWNSLLSGYLQKGNYKRVIELFHFMLSQNDVKP 874
               G    A   F  M  + +    V+  ++LS     G      E+F+ M  + +++P
Sbjct: 428 LAFHGHGTEALDLFWNMQNEGVAPNGVTLVTVLSACSHAGLLDEGCEIFYTMQKRFNIEP 487


>gb|EAY91821.1| hypothetical protein OsI_13463 [Oryza sativa Indica Group]
          Length = 468

 Score =  219 bits (559), Expect = 5e-55
 Identities = 120/279 (43%), Positives = 161/279 (57%), Gaps = 9/279 (3%)
 Frame = +2

Query: 68  PNLFIFNTMIS--------ALSFSVNQSLAFYKSMLHLCVCPDEHTXXXXXXXXXXXXXX 223
           PNL+I+N M+S        A S    +  A Y SML   + PDE T              
Sbjct: 86  PNLYIYNLMLSSAAAAAAAASSSPSRRPAALYMSMLASSIHPDEQTFLSLLKSVDAERRS 145

Query: 224 XX-QIHAQVIIFGFSFHAYVHNSLVKMYLENDEIGVVEKLVRPCGDKKDIVLFNTLMSWY 400
              Q+HA V++ G     Y+ NSL+KMYL+  ++   E + R C    D V  N ++S Y
Sbjct: 146 VGKQVHAHVVVTGLHSRVYLRNSLIKMYLDAGDVEAAEAMFR-CAPTADAVSCNIMLSGY 204

Query: 401 VKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAGKSVHGWVVRRMGYQGW 580
           VK GC   AL  F  +   G+  DQYT V+LL CCG+LK  + G+SVHG VVRR+G    
Sbjct: 205 VKGGCSGKALRFFRGMASRGIGVDQYTAVALLACCGRLKKAVLGRSVHGVVVRRIGVADR 264

Query: 581 GLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFNNVGEFDLAYKAFEEMP 760
           GL+L NALLDMY KC E+ TAM++FD   E+D +SWN M+ GF N G  DLA K F E+P
Sbjct: 265 GLILSNALLDMYAKCGEMNTAMRVFDEAGERDGISWNTMVAGFANAGLLDLASKYFGEVP 324

Query: 761 EKDLVSWNSLLSGYLQKGNYKRVIELFHFMLSQNDVKPD 877
            +D++SWN+LL+GY +   +   + LFH ML+ + V PD
Sbjct: 325 ARDIISWNALLAGYARYEEFSATMILFHDMLA-SSVIPD 362


Top