BLASTX nr result

ID: Dioscorea21_contig00008444 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00008444
         (2667 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276432.2| PREDICTED: pentatricopeptide repeat-containi...   746   0.0  
emb|CBI37948.3| unnamed protein product [Vitis vinifera]              746   0.0  
ref|XP_002534048.1| pentatricopeptide repeat-containing protein,...   714   0.0  
gb|AAL76193.1|AC092173_5 Putative crp1 protein [Oryza sativa Jap...   680   0.0  
gb|EEE50658.1| hypothetical protein OsJ_30888 [Oryza sativa Japo...   680   0.0  

>ref|XP_002276432.2| PREDICTED: pentatricopeptide repeat-containing protein At4g34830,
            chloroplastic-like [Vitis vinifera]
          Length = 1115

 Score =  746 bits (1927), Expect = 0.0
 Identities = 377/591 (63%), Positives = 458/591 (77%), Gaps = 2/591 (0%)
 Frame = +2

Query: 899  SKFTLQDGN-VSHSQLRSSQKRADVLVKASISSADYSEAPVPVACTKEVSMNKEKHMTGT 1075
            S  +L DGN VS     ++ K A++  + S SSADY E  + ++C KE S  K   +   
Sbjct: 329  SNASLLDGNGVSFQMRNATSKEAELSAQNSHSSADYVEGKMSLSCYKEGSSGKRNDLVKG 388

Query: 1076 RGFTKDSGKRLTDKS-HNKKPGFPHSNGSLVKDALDLPAYLRAYTSLLRESRLRDCMDLL 1252
            +GF +D   RL   S H     FP SNG  VK+         AY  LL E RL DC+ LL
Sbjct: 389  KGFPRDKNGRLPPLSDHRNLSQFPLSNGMTVKEKYHDSEKFSAYNRLLSEGRLSDCIQLL 448

Query: 1253 ESVDRKSLLDMDKINPVKFLNVCKKQKALKEAFRFVKLIEKPTLSTFNMLLSVCASSQDF 1432
            E +++  LLDMDK+   KF  +C+ QKA+ EAFRF KLI  PTLSTFNML+SVCA+SQD 
Sbjct: 449  EDMEKMGLLDMDKVYHAKFFKICRSQKAVTEAFRFAKLIPTPTLSTFNMLMSVCATSQDS 508

Query: 1433 EGAFQVMLLVKEAGLKPDCKLYTTLISTCAKCGKVDAMFEVFHEMVNAGVEPNVNTYGAL 1612
             GAFQV+ LV+EAGLK DCKLYTTLISTCAK GKVDAMFEVFHEMVNA VEPNV+TYGAL
Sbjct: 509  AGAFQVLQLVREAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAEVEPNVHTYGAL 568

Query: 1613 IDGCARAGQVPKAFGAYGIMRSKRVQPDRVVFNALITACGQSGAVDRAFDVLAEMRAEPK 1792
            IDGC RAGQV KAFGAYGIMRSK+V+PDRVVFNALITACGQSGAVDRAFDVLAEMRAE +
Sbjct: 569  IDGCGRAGQVAKAFGAYGIMRSKKVEPDRVVFNALITACGQSGAVDRAFDVLAEMRAETQ 628

Query: 1793 PIDPDHVTVGALMRTCIQAGQVERAHEVYKMLHEYNIKGTPVVYTIAVSSCSQTGDLDFA 1972
            PIDPDH+TVGAL++ C  AGQV+RA EVYKM+ +YNIKGTP VYTIAVSS SQ GD +FA
Sbjct: 629  PIDPDHITVGALIKACTNAGQVDRAREVYKMIDQYNIKGTPEVYTIAVSSHSQIGDWEFA 688

Query: 1973 LSVYDDMKKKGVMPDEMFFSTLIDVAGHAGKVEAAFEILQDAKSKGVRIGIMSYSSLMGA 2152
             SVY DM +KGV+PDEMF S LIDVAGHAGK++AAFE++Q+A+ +G+ +GI+SYSSLMGA
Sbjct: 689  YSVYTDMTRKGVVPDEMFLSALIDVAGHAGKLDAAFEVIQEARIQGIPLGIVSYSSLMGA 748

Query: 2153 CCNGKNWQKALELYEDIKAVQLLPTVSTLNALLTSLCDGGQLLKSIEVLDELRDAGVQPN 2332
            C N KNWQKALELY DIK+++L PTVST+NAL+T+LC+G QL K++EVL +++ AG+ PN
Sbjct: 749  CSNAKNWQKALELYVDIKSMKLNPTVSTMNALITALCEGEQLEKAMEVLSDMKRAGLCPN 808

Query: 2333 EITYSILIVACEKKDEAELGFMLLSKAKEDRILPNXXXXXXXXXXXXRSFKKAYSIGEPV 2512
             ITYSIL+VA EKKD+ ++G M+LS+A++D + PN            R F+KA ++GEPV
Sbjct: 809  TITYSILLVASEKKDDIDVGLMILSQARKDSVAPNLVMCRCLVGMCLRRFEKACALGEPV 868

Query: 2513 ISFDGGRPHIDNKWTSRAIMAYRETIAAGVIPTIEVFSQVLGCLQFPRDTA 2665
            +SF+ GRP IDNKWTS A+M YRET++AGVIPT+E+ SQVLGCLQFPRD +
Sbjct: 869  LSFNSGRPQIDNKWTSSALMVYRETVSAGVIPTMELLSQVLGCLQFPRDVS 919


>emb|CBI37948.3| unnamed protein product [Vitis vinifera]
          Length = 1550

 Score =  746 bits (1927), Expect = 0.0
 Identities = 377/591 (63%), Positives = 458/591 (77%), Gaps = 2/591 (0%)
 Frame = +2

Query: 899  SKFTLQDGN-VSHSQLRSSQKRADVLVKASISSADYSEAPVPVACTKEVSMNKEKHMTGT 1075
            S  +L DGN VS     ++ K A++  + S SSADY E  + ++C KE S  K   +   
Sbjct: 764  SNASLLDGNGVSFQMRNATSKEAELSAQNSHSSADYVEGKMSLSCYKEGSSGKRNDLVKG 823

Query: 1076 RGFTKDSGKRLTDKS-HNKKPGFPHSNGSLVKDALDLPAYLRAYTSLLRESRLRDCMDLL 1252
            +GF +D   RL   S H     FP SNG  VK+         AY  LL E RL DC+ LL
Sbjct: 824  KGFPRDKNGRLPPLSDHRNLSQFPLSNGMTVKEKYHDSEKFSAYNRLLSEGRLSDCIQLL 883

Query: 1253 ESVDRKSLLDMDKINPVKFLNVCKKQKALKEAFRFVKLIEKPTLSTFNMLLSVCASSQDF 1432
            E +++  LLDMDK+   KF  +C+ QKA+ EAFRF KLI  PTLSTFNML+SVCA+SQD 
Sbjct: 884  EDMEKMGLLDMDKVYHAKFFKICRSQKAVTEAFRFAKLIPTPTLSTFNMLMSVCATSQDS 943

Query: 1433 EGAFQVMLLVKEAGLKPDCKLYTTLISTCAKCGKVDAMFEVFHEMVNAGVEPNVNTYGAL 1612
             GAFQV+ LV+EAGLK DCKLYTTLISTCAK GKVDAMFEVFHEMVNA VEPNV+TYGAL
Sbjct: 944  AGAFQVLQLVREAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAEVEPNVHTYGAL 1003

Query: 1613 IDGCARAGQVPKAFGAYGIMRSKRVQPDRVVFNALITACGQSGAVDRAFDVLAEMRAEPK 1792
            IDGC RAGQV KAFGAYGIMRSK+V+PDRVVFNALITACGQSGAVDRAFDVLAEMRAE +
Sbjct: 1004 IDGCGRAGQVAKAFGAYGIMRSKKVEPDRVVFNALITACGQSGAVDRAFDVLAEMRAETQ 1063

Query: 1793 PIDPDHVTVGALMRTCIQAGQVERAHEVYKMLHEYNIKGTPVVYTIAVSSCSQTGDLDFA 1972
            PIDPDH+TVGAL++ C  AGQV+RA EVYKM+ +YNIKGTP VYTIAVSS SQ GD +FA
Sbjct: 1064 PIDPDHITVGALIKACTNAGQVDRAREVYKMIDQYNIKGTPEVYTIAVSSHSQIGDWEFA 1123

Query: 1973 LSVYDDMKKKGVMPDEMFFSTLIDVAGHAGKVEAAFEILQDAKSKGVRIGIMSYSSLMGA 2152
             SVY DM +KGV+PDEMF S LIDVAGHAGK++AAFE++Q+A+ +G+ +GI+SYSSLMGA
Sbjct: 1124 YSVYTDMTRKGVVPDEMFLSALIDVAGHAGKLDAAFEVIQEARIQGIPLGIVSYSSLMGA 1183

Query: 2153 CCNGKNWQKALELYEDIKAVQLLPTVSTLNALLTSLCDGGQLLKSIEVLDELRDAGVQPN 2332
            C N KNWQKALELY DIK+++L PTVST+NAL+T+LC+G QL K++EVL +++ AG+ PN
Sbjct: 1184 CSNAKNWQKALELYVDIKSMKLNPTVSTMNALITALCEGEQLEKAMEVLSDMKRAGLCPN 1243

Query: 2333 EITYSILIVACEKKDEAELGFMLLSKAKEDRILPNXXXXXXXXXXXXRSFKKAYSIGEPV 2512
             ITYSIL+VA EKKD+ ++G M+LS+A++D + PN            R F+KA ++GEPV
Sbjct: 1244 TITYSILLVASEKKDDIDVGLMILSQARKDSVAPNLVMCRCLVGMCLRRFEKACALGEPV 1303

Query: 2513 ISFDGGRPHIDNKWTSRAIMAYRETIAAGVIPTIEVFSQVLGCLQFPRDTA 2665
            +SF+ GRP IDNKWTS A+M YRET++AGVIPT+E+ SQVLGCLQFPRD +
Sbjct: 1304 LSFNSGRPQIDNKWTSSALMVYRETVSAGVIPTMELLSQVLGCLQFPRDVS 1354


>ref|XP_002534048.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223525928|gb|EEF28334.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 1129

 Score =  714 bits (1842), Expect = 0.0
 Identities = 360/582 (61%), Positives = 440/582 (75%), Gaps = 1/582 (0%)
 Frame = +2

Query: 923  NVSHSQLRSSQKRADVLVKASISSADYSEAPVPVACTKEVSMNKEKHMTGTRGFTKDSGK 1102
            N+S  ++    K A++L   S   A+  E  V +A  +  +  K +H+ G RGF ++  K
Sbjct: 352  NISSLKVNGVGKEAELLSPQSPQFAETVERKVHLARYERGASRKNEHIGGRRGFPREKEK 411

Query: 1103 -RLTDKSHNKKPGFPHSNGSLVKDALDLPAYLRAYTSLLRESRLRDCMDLLESVDRKSLL 1279
              +    H   P FP+ NG    +       +  Y  LLR+ RL +C+DLLE ++R+ LL
Sbjct: 412  GHVIQDEHTNLPEFPYPNGVHSTNKDHKAEQVHGYNRLLRDGRLAECVDLLEDMERRGLL 471

Query: 1280 DMDKINPVKFLNVCKKQKALKEAFRFVKLIEKPTLSTFNMLLSVCASSQDFEGAFQVMLL 1459
            DM KI   KF  +CK QKA+KEAFRF KL+  P+LSTFNML+SVC+SSQD +GAF+V+ L
Sbjct: 472  DMSKIYHAKFFKICKIQKAVKEAFRFCKLVPNPSLSTFNMLMSVCSSSQDSDGAFEVLRL 531

Query: 1460 VKEAGLKPDCKLYTTLISTCAKCGKVDAMFEVFHEMVNAGVEPNVNTYGALIDGCARAGQ 1639
             + AGLK DCKLYTTLISTCAK GKVDAMFEVFHEMVNAGVEPNV+TYG+LIDGCA+AGQ
Sbjct: 532  AQGAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEPNVHTYGSLIDGCAKAGQ 591

Query: 1640 VPKAFGAYGIMRSKRVQPDRVVFNALITACGQSGAVDRAFDVLAEMRAEPKPIDPDHVTV 1819
            + KAFGAYGI+RSK V+PDRVVFNALITACGQSGAVDRAFDVLAEM AE  PIDPDH+TV
Sbjct: 592  MAKAFGAYGILRSKNVKPDRVVFNALITACGQSGAVDRAFDVLAEMGAETHPIDPDHITV 651

Query: 1820 GALMRTCIQAGQVERAHEVYKMLHEYNIKGTPVVYTIAVSSCSQTGDLDFALSVYDDMKK 1999
            GALM+ C +AGQV+RA EVY MLH+YNIKGTP VYTIAV+ CSQTGD +FA SVYDDM +
Sbjct: 652  GALMKACAKAGQVDRAKEVYNMLHKYNIKGTPEVYTIAVNFCSQTGDWEFARSVYDDMTR 711

Query: 2000 KGVMPDEMFFSTLIDVAGHAGKVEAAFEILQDAKSKGVRIGIMSYSSLMGACCNGKNWQK 2179
            KGV PDEMF S L+DVAGHAG V+ AFE LQ+A+++G ++GI+ YSSLMGAC N KNWQK
Sbjct: 712  KGVAPDEMFLSALVDVAGHAGLVDIAFETLQEARTQGTQLGIVPYSSLMGACSNAKNWQK 771

Query: 2180 ALELYEDIKAVQLLPTVSTLNALLTSLCDGGQLLKSIEVLDELRDAGVQPNEITYSILIV 2359
            ALELYEDIKA++L PTVST+NAL+T+LCDG QL K++E L E++  G+ PN +TYSIL+V
Sbjct: 772  ALELYEDIKAIKLKPTVSTMNALMTALCDGDQLQKALETLSEMKSFGLCPNIVTYSILLV 831

Query: 2360 ACEKKDEAELGFMLLSKAKEDRILPNXXXXXXXXXXXXRSFKKAYSIGEPVISFDGGRPH 2539
            A E+KD+ + G MLLS+AKED I P             R +KKA S+GE ++SFD GRP 
Sbjct: 832  ASERKDDLDAGDMLLSQAKEDCITPTFLMYKCIIGMCLRRYKKACSLGESILSFDSGRPQ 891

Query: 2540 IDNKWTSRAIMAYRETIAAGVIPTIEVFSQVLGCLQFPRDTA 2665
            I N+WTSRA+  YRETIAAG  PT+EV SQVLGCLQ P D +
Sbjct: 892  IKNEWTSRALTVYRETIAAGEKPTMEVVSQVLGCLQLPCDAS 933


>gb|AAL76193.1|AC092173_5 Putative crp1 protein [Oryza sativa Japonica Group]
          Length = 1089

 Score =  680 bits (1755), Expect = 0.0
 Identities = 341/555 (61%), Positives = 426/555 (76%), Gaps = 7/555 (1%)
 Frame = +2

Query: 1016 VPVACTKEVSMNKEKHMTGTRGFTKDSGKRLTDKSHNKKP-GFPHSNGSLVKDALDLPAY 1192
            VPVAC ++  ++K+K         K  G  +++    + P     +N + ++   D+P Y
Sbjct: 338  VPVACLRDGPVSKQKKAMNDHDDAKLIGWSISNLLSKENPENSTSANRAGLRGTKDVPDY 397

Query: 1193 LRAYTSLLRESRLRDCMDLLESVDRKSLLDMDKINPVKFLNVCKKQKALKEAFRFVKLIE 1372
            LR Y SLL + RL+D +DLLES+++K LLDM+KI+   FLN CKKQ+A+ EA RF KLI 
Sbjct: 398  LRRYNSLLIDGRLKDSVDLLESMEQKGLLDMNKIHHASFLNACKKQRAVPEAVRFCKLIN 457

Query: 1373 KPTLSTFNMLLSVCASSQDFEGAFQVMLLVKEAGLKPDCKLYTTLISTCAKCGKVDAMFE 1552
             P +STFNMLLSVCA+SQDF+GA QVM+L+KEAGLKPDCKLYTTLISTCAKCGKVDAMFE
Sbjct: 458  NPKMSTFNMLLSVCANSQDFDGALQVMVLLKEAGLKPDCKLYTTLISTCAKCGKVDAMFE 517

Query: 1553 VFHEMVNAGVEPNVNTYGALIDGCARAGQVPKAFGAYGIMRSKRVQPDRVVFNALITACG 1732
            VFHEMV+AG+EPNVNTY ALIDGCA+AGQV KAFGAYGIM SK+V+PDRVVFNALI+ACG
Sbjct: 518  VFHEMVSAGIEPNVNTYSALIDGCAKAGQVAKAFGAYGIMSSKKVKPDRVVFNALISACG 577

Query: 1733 QSGAVDRAFDVLAEMRAEP------KPIDPDHVTVGALMRTCIQAGQVERAHEVYKMLHE 1894
            +SGAV RAFDVL+EM AE       KPI PDHVTVGALM+TCIQAGQ +RA EVYKML E
Sbjct: 578  ESGAVARAFDVLSEMTAEASESKGSKPILPDHVTVGALMKTCIQAGQADRAREVYKMLQE 637

Query: 1895 YNIKGTPVVYTIAVSSCSQTGDLDFALSVYDDMKKKGVMPDEMFFSTLIDVAGHAGKVEA 2074
            YNIKGTP VYTIA+ SCS TGDL FAL +Y+DM K GV PDEMF S L+DVAGHA + +A
Sbjct: 638  YNIKGTPEVYTIALRSCSLTGDLGFALKIYEDMNKIGVQPDEMFLSALVDVAGHARRADA 697

Query: 2075 AFEILQDAKSKGVRIGIMSYSSLMGACCNGKNWQKALELYEDIKAVQLLPTVSTLNALLT 2254
            AFEI++DA++KG ++G ++YSSLMGACCN K+W+KAL+L+E+IK+++L+PTVS +NAL+T
Sbjct: 698  AFEIMKDARAKGYQVGTIAYSSLMGACCNAKDWKKALQLFEEIKSIKLMPTVSMMNALIT 757

Query: 2255 SLCDGGQLLKSIEVLDELRDAGVQPNEITYSILIVACEKKDEAELGFMLLSKAKEDRILP 2434
            +LCDG Q+LKS EVL E++  GV PN ITYS+L VACE+  EA+LG  L  + K D I  
Sbjct: 758  ALCDGDQVLKSFEVLSEMKRLGVCPNMITYSVLFVACERNAEAQLGLDLFEQLKIDSIDL 817

Query: 2435 NXXXXXXXXXXXXRSFKKAYSIGEPVISFDGGRPHIDNKWTSRAIMAYRETIAAGVIPTI 2614
            N            + F    S+G  V++F+ G+P I+NKWTS AI  YRE I+ G++P+ 
Sbjct: 818  NPTIVGCLTGLCLQMFDNDLSLGNIVVTFNLGKPQIENKWTSSAIKVYREAISTGLLPSS 877

Query: 2615 EVFSQVLGCLQFPRD 2659
            +V SQVLGCL+FP D
Sbjct: 878  DVLSQVLGCLRFPHD 892


>gb|EEE50658.1| hypothetical protein OsJ_30888 [Oryza sativa Japonica Group]
          Length = 869

 Score =  680 bits (1755), Expect = 0.0
 Identities = 341/555 (61%), Positives = 426/555 (76%), Gaps = 7/555 (1%)
 Frame = +2

Query: 1016 VPVACTKEVSMNKEKHMTGTRGFTKDSGKRLTDKSHNKKP-GFPHSNGSLVKDALDLPAY 1192
            VPVAC ++  ++K+K         K  G  +++    + P     +N + ++   D+P Y
Sbjct: 118  VPVACLRDGPVSKQKKAMNDHDDAKLIGWSISNLLSKENPENSTSANRAGLRGTKDVPDY 177

Query: 1193 LRAYTSLLRESRLRDCMDLLESVDRKSLLDMDKINPVKFLNVCKKQKALKEAFRFVKLIE 1372
            LR Y SLL + RL+D +DLLES+++K LLDM+KI+   FLN CKKQ+A+ EA RF KLI 
Sbjct: 178  LRRYNSLLIDGRLKDSVDLLESMEQKGLLDMNKIHHASFLNACKKQRAVPEAVRFCKLIN 237

Query: 1373 KPTLSTFNMLLSVCASSQDFEGAFQVMLLVKEAGLKPDCKLYTTLISTCAKCGKVDAMFE 1552
             P +STFNMLLSVCA+SQDF+GA QVM+L+KEAGLKPDCKLYTTLISTCAKCGKVDAMFE
Sbjct: 238  NPKMSTFNMLLSVCANSQDFDGALQVMVLLKEAGLKPDCKLYTTLISTCAKCGKVDAMFE 297

Query: 1553 VFHEMVNAGVEPNVNTYGALIDGCARAGQVPKAFGAYGIMRSKRVQPDRVVFNALITACG 1732
            VFHEMV+AG+EPNVNTY ALIDGCA+AGQV KAFGAYGIM SK+V+PDRVVFNALI+ACG
Sbjct: 298  VFHEMVSAGIEPNVNTYSALIDGCAKAGQVAKAFGAYGIMSSKKVKPDRVVFNALISACG 357

Query: 1733 QSGAVDRAFDVLAEMRAEP------KPIDPDHVTVGALMRTCIQAGQVERAHEVYKMLHE 1894
            +SGAV RAFDVL+EM AE       KPI PDHVTVGALM+TCIQAGQ +RA EVYKML E
Sbjct: 358  ESGAVARAFDVLSEMTAEASESKGSKPILPDHVTVGALMKTCIQAGQADRAREVYKMLQE 417

Query: 1895 YNIKGTPVVYTIAVSSCSQTGDLDFALSVYDDMKKKGVMPDEMFFSTLIDVAGHAGKVEA 2074
            YNIKGTP VYTIA+ SCS TGDL FAL +Y+DM K GV PDEMF S L+DVAGHA + +A
Sbjct: 418  YNIKGTPEVYTIALRSCSLTGDLGFALKIYEDMNKIGVQPDEMFLSALVDVAGHARRADA 477

Query: 2075 AFEILQDAKSKGVRIGIMSYSSLMGACCNGKNWQKALELYEDIKAVQLLPTVSTLNALLT 2254
            AFEI++DA++KG ++G ++YSSLMGACCN K+W+KAL+L+E+IK+++L+PTVS +NAL+T
Sbjct: 478  AFEIMKDARAKGYQVGTIAYSSLMGACCNAKDWKKALQLFEEIKSIKLMPTVSMMNALIT 537

Query: 2255 SLCDGGQLLKSIEVLDELRDAGVQPNEITYSILIVACEKKDEAELGFMLLSKAKEDRILP 2434
            +LCDG Q+LKS EVL E++  GV PN ITYS+L VACE+  EA+LG  L  + K D I  
Sbjct: 538  ALCDGDQVLKSFEVLSEMKRLGVCPNMITYSVLFVACERNAEAQLGLDLFEQLKIDSIDL 597

Query: 2435 NXXXXXXXXXXXXRSFKKAYSIGEPVISFDGGRPHIDNKWTSRAIMAYRETIAAGVIPTI 2614
            N            + F    S+G  V++F+ G+P I+NKWTS AI  YRE I+ G++P+ 
Sbjct: 598  NPTIVGCLTGLCLQMFDNDLSLGNIVVTFNLGKPQIENKWTSSAIKVYREAISTGLLPSS 657

Query: 2615 EVFSQVLGCLQFPRD 2659
            +V SQVLGCL+FP D
Sbjct: 658  DVLSQVLGCLRFPHD 672


Top