BLASTX nr result

ID: Dioscorea21_contig00026342 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00026342
         (957 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002532904.1| pentatricopeptide repeat-containing protein,...   397   e-108
ref|XP_004155878.1| PREDICTED: pentatricopeptide repeat-containi...   391   e-106
ref|XP_004134328.1| PREDICTED: pentatricopeptide repeat-containi...   391   e-106
ref|XP_003552546.1| PREDICTED: pentatricopeptide repeat-containi...   385   e-105
ref|XP_002278886.1| PREDICTED: pentatricopeptide repeat-containi...   385   e-105

>ref|XP_002532904.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223527338|gb|EEF29484.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 604

 Score =  397 bits (1020), Expect = e-108
 Identities = 190/318 (59%), Positives = 248/318 (77%)
 Frame = +3

Query: 3    KFNLYDDPFVAPKLISSYSNCHLLPLAVNVFNQVRHPNALLFNTIIREYGNHSQHSDAVL 182
            K NL++D +VAPKLIS++S CH + LAVNVFNQ++ PN  L+NT+IR +  +SQ   A  
Sbjct: 48   KRNLHNDLYVAPKLISAFSLCHQMNLAVNVFNQIQDPNVHLYNTLIRAHVQNSQSLKAFA 107

Query: 183  AFLKMQKDGVFPDNFTFPFLLKACSGHCALSFVKMMHAQIVKLGVLSDIFVPNSLIDSYS 362
             F  MQK+G+F DNFT+PFLLKAC+G   L  V+M+H  + K G   D+FVPNSLIDSYS
Sbjct: 108  TFFDMQKNGLFADNFTYPFLLKACNGKGWLPTVQMIHCHVEKYGFFGDLFVPNSLIDSYS 167

Query: 363  KVGRCGLEFAKRVFDEMPERDVVSWNSMIAGLVRAGELMEAREVFNQMPNRDIVSWNSML 542
            K G  G+ +A ++F EM E+D+VSWNSMI GLV+AG+L  AR++F++M  RD VSWN++L
Sbjct: 168  KCGLLGVNYAMKLFMEMGEKDLVSWNSMIGGLVKAGDLGRARKLFDEMAERDAVSWNTIL 227

Query: 543  DGYVKTGDMDEAFALFERMPERNVVSWCSLVSGYCENGDMDMARILFDRMPSKNLVSWTV 722
            DGYVK G+M +AF LFE+MPERNVVSW ++VSGYC+ GDM+MAR+LFD+MP KNLV+WT+
Sbjct: 228  DGYVKAGEMSQAFNLFEKMPERNVVSWSTMVSGYCKTGDMEMARMLFDKMPFKNLVTWTI 287

Query: 723  MISGYAEKGLAGEAYRLFTQMKEAGLEADEAAIVSILAACTESGLIGFGKKVHAYVEGTE 902
            +ISG+AEKGLA EA  L+ QM+ AGL+ D+  ++SILAAC ESGL+  GKKVHA ++   
Sbjct: 288  IISGFAEKGLAKEATTLYNQMEAAGLKPDDGTLISILAACAESGLLVLGKKVHASIKKIR 347

Query: 903  LKFVIRVCNALVDMYAKC 956
            +K  + V NALVDMYAKC
Sbjct: 348  IKCSVNVSNALVDMYAKC 365



 Score = 89.4 bits (220), Expect = 1e-15
 Identities = 74/303 (24%), Positives = 135/303 (44%), Gaps = 44/303 (14%)
 Frame = +3

Query: 42   LISSYSNCHLLPLAVNVFNQVRHPNALLFNTIIREYGNHSQHSDAVLAFLKMQKDGVFPD 221
            ++S Y     + +A  +F+++   N + +  II  +       +A   + +M+  G+ PD
Sbjct: 257  MVSGYCKTGDMEMARMLFDKMPFKNLVTWTIIISGFAEKGLAKEATTLYNQMEAAGLKPD 316

Query: 222  NFTFPFLLKACSGHCALSFVKMMHAQIVKLGVLSDIFVPNSLIDSYSKVGRCGLEFAKRV 401
            + T   +L AC+    L   K +HA I K+ +   + V N+L+D Y+K GR  ++ A  +
Sbjct: 317  DGTLISILAACAESGLLVLGKKVHASIKKIRIKCSVNVSNALVDMYAKCGR--VDKALSI 374

Query: 402  FDEMPERDVVSWNSMIAGLVRAGELMEAREVFNQMP------------------------ 509
            F+EM  RD+VSWN M+ GL   G   +A ++F++M                         
Sbjct: 375  FNEMSMRDLVSWNCMLQGLAMHGHGEKAIQLFSKMQQEGFKPDKVTLIAILCACTHAGFV 434

Query: 510  ----------NRD------IVSWNSMLDGYVKTGDMDEAFALFERMP-ERNVVSWCSLVS 638
                       RD      I  +  M+D   + G ++EAF L + MP E N V W +L+ 
Sbjct: 435  DQGLSYFNSMERDHGIVPHIEHYGCMIDLLGRGGRLEEAFRLVQSMPMEPNDVIWGTLLG 494

Query: 639  GYCENGDMDMARILFDR---MPSKNLVSWTVMISGYAEKGLAGEAYRLFTQMKEAGLEAD 809
                +  + +A  + DR   +   +  +++++ + +A  G       +  QMK  G++  
Sbjct: 495  ACRVHNAVPLAEKVLDRLITLEQSDPGNYSMLSNIFAAAGDWNSVANMRLQMKSTGVQKP 554

Query: 810  EAA 818
              A
Sbjct: 555  SGA 557


>ref|XP_004155878.1| PREDICTED: pentatricopeptide repeat-containing protein At3g29230-like
            [Cucumis sativus]
          Length = 561

 Score =  391 bits (1005), Expect = e-106
 Identities = 188/319 (58%), Positives = 242/319 (75%), Gaps = 1/319 (0%)
 Frame = +3

Query: 3    KFNLYDDPFVAPKLISSYSNCHLLPLAVNVFNQVRHPNALLFNTIIREYGNHSQHSDAVL 182
            K NL+ D FV PKLIS++S C  + LA N FNQV++PN  L+NT+IR + ++SQ S A  
Sbjct: 71   KSNLHVDLFVVPKLISAFSLCRQMLLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA 130

Query: 183  AFLKMQKDGVFPDNFTFPFLLKACSGHCALSFVKMMHAQIVKLGVLSDIFVPNSLIDSYS 362
             F  MQ+DG + DNFTFPFLLK C+G+  L  ++ +HAQI K G +SD+FVPNSLIDSYS
Sbjct: 131  TFFAMQRDGHYADNFTFPFLLKVCTGNVWLPVIESVHAQIEKFGFMSDVFVPNSLIDSYS 190

Query: 363  KVGRCGLEFAKRVFDEM-PERDVVSWNSMIAGLVRAGELMEAREVFNQMPNRDIVSWNSM 539
            K G CG+  AK++F  M   RDVVSWNSMI+GL + G   EAR+VF++MP +D +SWN+M
Sbjct: 191  KCGSCGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPEKDGISWNTM 250

Query: 540  LDGYVKTGDMDEAFALFERMPERNVVSWCSLVSGYCENGDMDMARILFDRMPSKNLVSWT 719
            LDGYVK G MD+AF LF+ MPERNVVSW ++V GYC+ GDM+MAR+LFD+MP KNLVSWT
Sbjct: 251  LDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWT 310

Query: 720  VMISGYAEKGLAGEAYRLFTQMKEAGLEADEAAIVSILAACTESGLIGFGKKVHAYVEGT 899
            +++SG+AEKGLA EA  LF QM++A L+ D   ++SILAAC ESGL+G G+K+HA ++  
Sbjct: 311  IIVSGFAEKGLAREAISLFDQMEKACLKLDNGTVMSILAACAESGLLGLGEKIHASIKNN 370

Query: 900  ELKFVIRVCNALVDMYAKC 956
              K    + NALVDMYAKC
Sbjct: 371  NFKCTTEISNALVDMYAKC 389


>ref|XP_004134328.1| PREDICTED: pentatricopeptide repeat-containing protein At3g29230-like
            [Cucumis sativus]
          Length = 601

 Score =  391 bits (1005), Expect = e-106
 Identities = 188/319 (58%), Positives = 242/319 (75%), Gaps = 1/319 (0%)
 Frame = +3

Query: 3    KFNLYDDPFVAPKLISSYSNCHLLPLAVNVFNQVRHPNALLFNTIIREYGNHSQHSDAVL 182
            K NL+ D FV PKLIS++S C  + LA N FNQV++PN  L+NT+IR + ++SQ S A  
Sbjct: 45   KSNLHVDLFVVPKLISAFSLCRQMLLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA 104

Query: 183  AFLKMQKDGVFPDNFTFPFLLKACSGHCALSFVKMMHAQIVKLGVLSDIFVPNSLIDSYS 362
             F  MQ+DG + DNFTFPFLLK C+G+  L  ++ +HAQI K G +SD+FVPNSLIDSYS
Sbjct: 105  TFFAMQRDGHYADNFTFPFLLKVCTGNVWLPVIESVHAQIEKFGFMSDVFVPNSLIDSYS 164

Query: 363  KVGRCGLEFAKRVFDEM-PERDVVSWNSMIAGLVRAGELMEAREVFNQMPNRDIVSWNSM 539
            K G CG+  AK++F  M   RDVVSWNSMI+GL + G   EAR+VF++MP +D +SWN+M
Sbjct: 165  KCGSCGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPEKDGISWNTM 224

Query: 540  LDGYVKTGDMDEAFALFERMPERNVVSWCSLVSGYCENGDMDMARILFDRMPSKNLVSWT 719
            LDGYVK G MD+AF LF+ MPERNVVSW ++V GYC+ GDM+MAR+LFD+MP KNLVSWT
Sbjct: 225  LDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWT 284

Query: 720  VMISGYAEKGLAGEAYRLFTQMKEAGLEADEAAIVSILAACTESGLIGFGKKVHAYVEGT 899
            +++SG+AEKGLA EA  LF QM++A L+ D   ++SILAAC ESGL+G G+K+HA ++  
Sbjct: 285  IIVSGFAEKGLAREAISLFDQMEKACLKLDNGTVMSILAACAESGLLGLGEKIHASIKNN 344

Query: 900  ELKFVIRVCNALVDMYAKC 956
              K    + NALVDMYAKC
Sbjct: 345  NFKCTTEISNALVDMYAKC 363


>ref|XP_003552546.1| PREDICTED: pentatricopeptide repeat-containing protein At3g29230-like
            [Glycine max]
          Length = 604

 Score =  385 bits (989), Expect = e-105
 Identities = 187/319 (58%), Positives = 240/319 (75%), Gaps = 1/319 (0%)
 Frame = +3

Query: 3    KFNLYDDPFVAPKLISSYSNCHLLPLAVNVFNQVRHPNALLFNTIIREYGNHSQHSDAVL 182
            K NL+ D FVAPKLI+++S C  L  AVNVFN V HPN  L+N+IIR + ++S H     
Sbjct: 46   KANLHQDLFVAPKLIAAFSLCRHLASAVNVFNHVPHPNVHLYNSIIRAHAHNSSHRSLPF 105

Query: 183  -AFLKMQKDGVFPDNFTFPFLLKACSGHCALSFVKMMHAQIVKLGVLSDIFVPNSLIDSY 359
             AF +MQK+G+FPDNFT+PFLLKACSG  +L  V+M+HA + K+G   DIFVPNSLIDSY
Sbjct: 106  NAFFQMQKNGLFPDNFTYPFLLKACSGPSSLPLVRMIHAHVEKIGFYGDIFVPNSLIDSY 165

Query: 360  SKVGRCGLEFAKRVFDEMPERDVVSWNSMIAGLVRAGELMEAREVFNQMPNRDIVSWNSM 539
            S+ G  GL+ A  +F  M ERDVV+WNSMI GLVR GEL  A ++F++MP+RD+VSWN+M
Sbjct: 166  SRCGNAGLDGAMSLFLAMEERDVVTWNSMIGGLVRCGELQGACKLFDEMPDRDMVSWNTM 225

Query: 540  LDGYVKTGDMDEAFALFERMPERNVVSWCSLVSGYCENGDMDMARILFDRMPSKNLVSWT 719
            LDGY K G+MD AF LFERMP RN+VSW ++V GY + GDMDMAR+LFDR P KN+V WT
Sbjct: 226  LDGYAKAGEMDTAFELFERMPWRNIVSWSTMVCGYSKGGDMDMARMLFDRCPVKNVVLWT 285

Query: 720  VMISGYAEKGLAGEAYRLFTQMKEAGLEADEAAIVSILAACTESGLIGFGKKVHAYVEGT 899
             +I+GYAEKGLA EA  L+ +M+EAG+  D+  ++SILAAC ESG++G GK++HA +   
Sbjct: 286  TIIAGYAEKGLAREATELYGKMEEAGMRPDDGFLLSILAACAESGMLGLGKRIHASMRRW 345

Query: 900  ELKFVIRVCNALVDMYAKC 956
              +   +V NA +DMYAKC
Sbjct: 346  RFRCGAKVLNAFIDMYAKC 364


>ref|XP_002278886.1| PREDICTED: pentatricopeptide repeat-containing protein At3g29230
           [Vitis vinifera]
          Length = 594

 Score =  385 bits (989), Expect = e-105
 Identities = 189/318 (59%), Positives = 240/318 (75%)
 Frame = +3

Query: 3   KFNLYDDPFVAPKLISSYSNCHLLPLAVNVFNQVRHPNALLFNTIIREYGNHSQHSDAVL 182
           K NL+ + FV  KLI+++S C  + LAVNVFNQ++ P+ LL+NT+IR +  +S+   A  
Sbjct: 42  KANLHRESFVGQKLIAAFSLCRQMTLAVNVFNQIQDPDVLLYNTLIRAHVRNSEPLLAFS 101

Query: 183 AFLKMQKDGVFPDNFTFPFLLKACSGHCALSFVKMMHAQIVKLGVLSDIFVPNSLIDSYS 362
            F +MQ  GV  DNFT+PFLLKACSG   +  V+M+HAQ+ K+G   DIFVPNSLIDSY 
Sbjct: 102 VFFEMQDSGVCADNFTYPFLLKACSGKVWVRVVEMIHAQVEKMGFCLDIFVPNSLIDSYF 161

Query: 363 KVGRCGLEFAKRVFDEMPERDVVSWNSMIAGLVRAGELMEAREVFNQMPNRDIVSWNSML 542
           K G  G+  A++VF+ M ERD VSWNSMI GLV+ GEL EAR +F++MP RD VSWN++L
Sbjct: 162 KCGLDGVAAARKVFEVMAERDTVSWNSMIGGLVKVGELGEARRLFDEMPERDTVSWNTIL 221

Query: 543 DGYVKTGDMDEAFALFERMPERNVVSWCSLVSGYCENGDMDMARILFDRMPSKNLVSWTV 722
           DGYVK G+M+ AF LFE+MP RNVVSW ++V GY + GDMDMARILFD+MP KNLV WT+
Sbjct: 222 DGYVKAGEMNAAFELFEKMPARNVVSWSTMVLGYSKAGDMDMARILFDKMPVKNLVPWTI 281

Query: 723 MISGYAEKGLAGEAYRLFTQMKEAGLEADEAAIVSILAACTESGLIGFGKKVHAYVEGTE 902
           MISGYAEKGLA +A  L+ QM+EAGL+ D+  ++SIL+AC  SGL+G GK+VHA +E T 
Sbjct: 282 MISGYAEKGLAKDAINLYNQMEEAGLKFDDGTVISILSACAVSGLLGLGKRVHASIERTR 341

Query: 903 LKFVIRVCNALVDMYAKC 956
            K    V NAL+DMYAKC
Sbjct: 342 FKCSTPVSNALIDMYAKC 359



 Score = 78.6 bits (192), Expect = 2e-12
 Identities = 70/264 (26%), Positives = 119/264 (45%), Gaps = 11/264 (4%)
 Frame = +3

Query: 42   LISSYSNCHLLPLAVNVFNQVRHPNALLFNTIIREYGNHSQHSDAVLAFLKMQKDGVFPD 221
            ++  YS    + +A  +F+++   N + +  +I  Y       DA+  + +M++ G+  D
Sbjct: 251  MVLGYSKAGDMDMARILFDKMPVKNLVPWTIMISGYAEKGLAKDAINLYNQMEEAGLKFD 310

Query: 222  NFTFPFLLKACSGHCALSFVKMMHAQIVKLGVLSDIFVPNSLIDSYSKVGRCGLEFAKRV 401
            + T   +L AC+    L   K +HA I +        V N+LID Y+K G   LE A  +
Sbjct: 311  DGTVISILSACAVSGLLGLGKRVHASIERTRFKCSTPVSNALIDMYAKCG--SLENALSI 368

Query: 402  FDEMPERDVVSWNSMIAGLVRAGELMEAREVFNQMPNR----DIVSWNSMLDGYVKTGDM 569
            F  M  +DVVSWN++I GL   G   +A ++F++M       D V++  +L      G +
Sbjct: 369  FHGMVRKDVVSWNAIIQGLAMHGHGEKALQLFSRMKGEGFVPDKVTFVGVLCACTHAGFV 428

Query: 570  DEAFALFERMPERN------VVSWCSLVSGYCENGDMDMARILFDRMP-SKNLVSWTVMI 728
            DE    F  M ER+      V  +  +V      G +  A  L   MP   N + W  ++
Sbjct: 429  DEGLHYFHAM-ERDYGVPPEVEHYGCMVDLLGRGGRLKEAFRLVHSMPLEPNAIIWGTLL 487

Query: 729  SGYAEKGLAGEAYRLFTQMKEAGL 800
                     G A  +F ++ ++ L
Sbjct: 488  GACRMHSATGLAEEVFDRLVKSEL 511



 Score = 56.6 bits (135), Expect = 8e-06
 Identities = 51/201 (25%), Positives = 89/201 (44%), Gaps = 6/201 (2%)
 Frame = +3

Query: 30  VAPKLISSYSNCHLLPLAVNVFNQVRHPNALLFNTIIREYGNHSQHSDAVLAFLKMQKDG 209
           V+  LI  Y+ C  L  A+++F+ +   + + +N II+    H     A+  F +M+ +G
Sbjct: 348 VSNALIDMYAKCGSLENALSIFHGMVRKDVVSWNAIIQGLAMHGHGEKALQLFSRMKGEG 407

Query: 210 VFPDNFTFPFLLKACSGHCAL--SFVKMMHAQIVKLGVLSDIFVPNSLIDSYSKVGRCGL 383
             PD  TF  +L AC+ H       +   HA     GV  ++     ++D   + GR  L
Sbjct: 408 FVPDKVTFVGVLCACT-HAGFVDEGLHYFHAMERDYGVPPEVEHYGCMVDLLGRGGR--L 464

Query: 384 EFAKRVFDEMP-ERDVVSWNSMIAGLVRAGELMEAREVFNQMPNRDIVSWN--SMLDG-Y 551
           + A R+   MP E + + W +++           A EVF+++   ++      SML   Y
Sbjct: 465 KEAFRLVHSMPLEPNAIIWGTLLGACRMHSATGLAEEVFDRLVKSELSDSGNLSMLSNIY 524

Query: 552 VKTGDMDEAFALFERMPERNV 614
              GD D    +  RM   ++
Sbjct: 525 AAAGDWDNFANIRLRMKSTSI 545


Top