BLASTX nr result
ID: Catharanthus22_contig00047011
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00047011 (249 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006429754.1| hypothetical protein CICLE_v10011150mg [Citr... 107 2e-21 ref|XP_002271968.2| PREDICTED: pentatricopeptide repeat-containi... 105 6e-21 emb|CBI33393.3| unnamed protein product [Vitis vinifera] 105 6e-21 gb|EMJ18188.1| hypothetical protein PRUPE_ppa002640mg [Prunus pe... 105 8e-21 ref|XP_004964184.1| PREDICTED: pentatricopeptide repeat-containi... 104 1e-20 ref|XP_004305889.1| PREDICTED: pentatricopeptide repeat-containi... 102 4e-20 ref|NP_001141436.1| hypothetical protein [Zea mays] gi|194704572... 101 1e-19 ref|XP_003577575.1| PREDICTED: pentatricopeptide repeat-containi... 100 3e-19 ref|XP_002322556.1| hypothetical protein POPTR_0016s02110g [Popu... 99 6e-19 ref|XP_002442501.1| hypothetical protein SORBIDRAFT_08g020970 [S... 99 8e-19 ref|NP_001066597.1| Os12g0289800 [Oryza sativa Japonica Group] g... 97 2e-18 ref|XP_006664504.1| PREDICTED: pentatricopeptide repeat-containi... 97 2e-18 gb|EMT26142.1| hypothetical protein F775_09158 [Aegilops tauschii] 96 6e-18 gb|EXC35004.1| hypothetical protein L484_017705 [Morus notabilis] 93 3e-17 gb|EOX93361.1| Tetratricopeptide repeat-like superfamily protein... 93 3e-17 gb|EEE53064.1| hypothetical protein OsJ_35805 [Oryza sativa Japo... 93 3e-17 ref|XP_006364895.1| PREDICTED: pentatricopeptide repeat-containi... 92 9e-17 ref|XP_003538894.1| PREDICTED: pentatricopeptide repeat-containi... 91 2e-16 gb|ESW06839.1| hypothetical protein PHAVU_010G081100g [Phaseolus... 90 3e-16 gb|EOY29048.1| Tetratricopeptide repeat-like superfamily protein... 90 4e-16 >ref|XP_006429754.1| hypothetical protein CICLE_v10011150mg [Citrus clementina] gi|568855508|ref|XP_006481346.1| PREDICTED: pentatricopeptide repeat-containing protein At4g02750-like [Citrus sinensis] gi|557531811|gb|ESR42994.1| hypothetical protein CICLE_v10011150mg [Citrus clementina] Length = 740 Score = 107 bits (267), Expect = 2e-21 Identities = 52/83 (62%), Positives = 60/83 (72%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFE+V MKIKP A WG LLGACR+H+N+KL A +KLSE+EP KTS LLSN Sbjct: 610 EAFEMVKGMKIKPNAGIWGTLLGACRMHQNIKLGRIAVEKLSELEPQKTSRYALLSNMHA 669 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 EAGRWD VE VR S+ SG +KQ Sbjct: 670 EAGRWDEVEKVRVSMEGSGAQKQ 692 >ref|XP_002271968.2| PREDICTED: pentatricopeptide repeat-containing protein At4g02750 [Vitis vinifera] Length = 788 Score = 105 bits (262), Expect = 6e-21 Identities = 53/83 (63%), Positives = 60/83 (72%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAF+LV MKI A WGALLGACR+H NL+LA FAA+KL E EPHKTS+ VLLSN Sbjct: 609 EAFQLVRGMKINANAGIWGALLGACRIHGNLELAKFAAEKLLEFEPHKTSNYVLLSNMQA 668 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 EAGRWD V VR + + G EKQ Sbjct: 669 EAGRWDEVARVRRLMKEKGAEKQ 691 >emb|CBI33393.3| unnamed protein product [Vitis vinifera] Length = 752 Score = 105 bits (262), Expect = 6e-21 Identities = 53/83 (63%), Positives = 60/83 (72%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAF+LV MKI A WGALLGACR+H NL+LA FAA+KL E EPHKTS+ VLLSN Sbjct: 371 EAFQLVRGMKINANAGIWGALLGACRIHGNLELAKFAAEKLLEFEPHKTSNYVLLSNMQA 430 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 EAGRWD V VR + + G EKQ Sbjct: 431 EAGRWDEVARVRRLMKEKGAEKQ 453 >gb|EMJ18188.1| hypothetical protein PRUPE_ppa002640mg [Prunus persica] Length = 649 Score = 105 bits (261), Expect = 8e-21 Identities = 51/83 (61%), Positives = 62/83 (74%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFE+VS MKIK TA WGAL+GA R+H+NLK +A+KKL E+EP K S+ VLLSN Sbjct: 532 EAFEMVSNMKIKATARIWGALIGASRIHRNLKFGKYASKKLLEVEPDKASNYVLLSNMHA 591 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 EAGRWD+VE VR + +S EKQ Sbjct: 592 EAGRWDKVEKVRVLMKESSMEKQ 614 >ref|XP_004964184.1| PREDICTED: pentatricopeptide repeat-containing protein At4g02750-like [Setaria italica] Length = 701 Score = 104 bits (259), Expect = 1e-20 Identities = 51/82 (62%), Positives = 60/82 (73%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFELV M+I+P A WGALLGACRLHKN +LA AA+KL E+EP KTS+ VLLSN Sbjct: 612 EAFELVQGMQIQPNAGVWGALLGACRLHKNDELARLAAEKLFELEPRKTSNYVLLSNISA 671 Query: 69 EAGRWDRVEIVRASVNDSGKEK 4 EAG+WD E RAS+ + G K Sbjct: 672 EAGKWDEAEKTRASIKEKGVHK 693 >ref|XP_004305889.1| PREDICTED: pentatricopeptide repeat-containing protein At4g02750-like [Fragaria vesca subsp. vesca] Length = 739 Score = 102 bits (255), Expect = 4e-20 Identities = 51/83 (61%), Positives = 60/83 (72%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFE+V MKIK TA WGALLGA R+H+NLK +A KKL E+EP KTS+ VLLSN Sbjct: 607 EAFEMVRDMKIKATARVWGALLGASRIHRNLKFGKYATKKLLELEPDKTSNYVLLSNMNA 666 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 EAGRWD VE VR + +S +KQ Sbjct: 667 EAGRWDEVERVRVLMKESDTDKQ 689 >ref|NP_001141436.1| hypothetical protein [Zea mays] gi|194704572|gb|ACF86370.1| unknown [Zea mays] gi|414877969|tpg|DAA55100.1| TPA: hypothetical protein ZEAMMB73_905907 [Zea mays] Length = 700 Score = 101 bits (251), Expect = 1e-19 Identities = 48/82 (58%), Positives = 59/82 (71%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFELV M+I+P A WGALLGAC +HKN +LA AA++LSE+EP K S+ VLLSN Sbjct: 611 EAFELVQGMQIQPNAGVWGALLGACHMHKNHELAQLAAERLSELEPRKASNYVLLSNISA 670 Query: 69 EAGRWDRVEIVRASVNDSGKEK 4 EAG+WD E RAS+ + G K Sbjct: 671 EAGKWDESEKARASIKEKGVNK 692 >ref|XP_003577575.1| PREDICTED: pentatricopeptide repeat-containing protein At4g02750-like [Brachypodium distachyon] Length = 694 Score = 100 bits (248), Expect = 3e-19 Identities = 44/82 (53%), Positives = 60/82 (73%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFE++ M+++P A WGA+LGACR+HKN +LA AA+KL E+EPHKTS+ VLLSN Sbjct: 605 EAFEIIQGMQVQPNAGVWGAMLGACRVHKNHELAQLAAEKLYELEPHKTSNYVLLSNITA 664 Query: 69 EAGRWDRVEIVRASVNDSGKEK 4 EAG+WD + +R + + G K Sbjct: 665 EAGKWDEAQNMRVFIKERGVHK 686 >ref|XP_002322556.1| hypothetical protein POPTR_0016s02110g [Populus trichocarpa] gi|222867186|gb|EEF04317.1| hypothetical protein POPTR_0016s02110g [Populus trichocarpa] Length = 702 Score = 99.0 bits (245), Expect = 6e-19 Identities = 50/83 (60%), Positives = 58/83 (69%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFE+V MK+K TA WGALLGACR H NL+L AA KLSE EPHKTS+ VLLSN Sbjct: 571 EAFEIVRGMKVKATAGVWGALLGACRAHGNLELGRLAAHKLSEFEPHKTSNYVLLSNIHA 630 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 EA RW+ V+ VR +N S K+ Sbjct: 631 EANRWNEVQEVRMLMNASSTVKE 653 >ref|XP_002442501.1| hypothetical protein SORBIDRAFT_08g020970 [Sorghum bicolor] gi|241943194|gb|EES16339.1| hypothetical protein SORBIDRAFT_08g020970 [Sorghum bicolor] Length = 701 Score = 98.6 bits (244), Expect = 8e-19 Identities = 47/82 (57%), Positives = 60/82 (73%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFELV M+I+P A WGALLGAC+++KN +LA AA+KLSE+EP + S+ VLLSN Sbjct: 612 EAFELVQGMQIQPNAGVWGALLGACQMYKNHELARLAAEKLSELEPCRASNYVLLSNISA 671 Query: 69 EAGRWDRVEIVRASVNDSGKEK 4 EAG+WD E RAS+ + G K Sbjct: 672 EAGKWDEAEKARASIKEKGANK 693 >ref|NP_001066597.1| Os12g0289800 [Oryza sativa Japonica Group] gi|77554360|gb|ABA97156.1| pentatricopeptide, putative, expressed [Oryza sativa Japonica Group] gi|113649104|dbj|BAF29616.1| Os12g0289800 [Oryza sativa Japonica Group] Length = 756 Score = 97.4 bits (241), Expect = 2e-18 Identities = 47/78 (60%), Positives = 59/78 (75%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFELV M+I+P A WGALLGACR+HKN ++A AA+KL E+EP K S+ VLLSN CV Sbjct: 605 EAFELVQGMQIQPNAGVWGALLGACRVHKNHEIAWLAAEKLFELEPCKASNYVLLSNICV 664 Query: 69 EAGRWDRVEIVRASVNDS 16 EAG+WD + VR + +S Sbjct: 665 EAGKWDDADKVRVLMKES 682 >ref|XP_006664504.1| PREDICTED: pentatricopeptide repeat-containing protein At4g02750-like [Oryza brachyantha] Length = 799 Score = 97.1 bits (240), Expect = 2e-18 Identities = 46/77 (59%), Positives = 59/77 (76%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFEL+ M+I+P A WGALLGACR+HKN +LA AA+KL E+EP KTS+ V+LSN CV Sbjct: 607 EAFELIQGMQIQPNAGIWGALLGACRVHKNHELAWLAAEKLFELEPCKTSNYVMLSNICV 666 Query: 69 EAGRWDRVEIVRASVND 19 EAG+WD + VR + + Sbjct: 667 EAGKWDDADKVRVLMKE 683 >gb|EMT26142.1| hypothetical protein F775_09158 [Aegilops tauschii] Length = 700 Score = 95.5 bits (236), Expect = 6e-18 Identities = 45/77 (58%), Positives = 58/77 (75%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAF+LV M+I+P A WGALLGACR+HKN +LA FAA+KL E+EP KTS+ VLLSN Sbjct: 611 EAFKLVQGMQIQPNAGVWGALLGACRVHKNDELARFAAEKLFELEPRKTSNYVLLSNISA 670 Query: 69 EAGRWDRVEIVRASVND 19 E+G+WD E +R + + Sbjct: 671 ESGKWDAAENMRTLIKE 687 >gb|EXC35004.1| hypothetical protein L484_017705 [Morus notabilis] Length = 745 Score = 93.2 bits (230), Expect = 3e-17 Identities = 44/83 (53%), Positives = 59/83 (71%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 E F++VS+M+IK TA WGALLGA R+H+N +L +AA+KL E+EPHK S+ VLLSN Sbjct: 607 EGFKMVSEMRIKATAGIWGALLGAARIHRNFELGKYAAEKLLELEPHKASNYVLLSNIHA 666 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 +AGRW + VR + + EKQ Sbjct: 667 DAGRWSEAQRVRMVMAERRTEKQ 689 >gb|EOX93361.1| Tetratricopeptide repeat-like superfamily protein, putative [Theobroma cacao] Length = 747 Score = 93.2 bits (230), Expect = 3e-17 Identities = 46/83 (55%), Positives = 57/83 (68%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFE+V +KIK A WGALL AC++H NL+L A+K+L E EPHKTS VLLSN Sbjct: 609 EAFEVVRGLKIKANAGIWGALLSACKIHGNLELGKIASKELLEFEPHKTSSSVLLSNMQA 668 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 EAGRW VE +R + ++ EKQ Sbjct: 669 EAGRWHEVENMRLMMKENEAEKQ 691 >gb|EEE53064.1| hypothetical protein OsJ_35805 [Oryza sativa Japonica Group] Length = 841 Score = 93.2 bits (230), Expect = 3e-17 Identities = 44/66 (66%), Positives = 53/66 (80%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAFELV M+I+P A WGALLGACR+HKN ++A AA+KL E+EP K S+ VLLSN CV Sbjct: 605 EAFELVQGMQIQPNAGVWGALLGACRVHKNHEIAWLAAEKLFELEPCKASNYVLLSNICV 664 Query: 69 EAGRWD 52 EAG+WD Sbjct: 665 EAGKWD 670 >ref|XP_006364895.1| PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Solanum tuberosum] Length = 731 Score = 91.7 bits (226), Expect = 9e-17 Identities = 41/82 (50%), Positives = 55/82 (67%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EA ++ M+IKP A WG+LLG+CR+HKNL+L ++AAK L E+EP VLLSN Sbjct: 514 EAMTMIESMEIKPDGAIWGSLLGSCRIHKNLELGEYAAKNLFELEPENPGAYVLLSNIYA 573 Query: 69 EAGRWDRVEIVRASVNDSGKEK 4 AG WD+V +R +ND G +K Sbjct: 574 GAGNWDKVASIRTFLNDQGMKK 595 >ref|XP_003538894.1| PREDICTED: pentatricopeptide repeat-containing protein At4g02750-like [Glycine max] Length = 748 Score = 90.9 bits (224), Expect = 2e-16 Identities = 44/83 (53%), Positives = 55/83 (66%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAF V MK+K A WG+LLGACR+HKNL+L FAA++L E+EPH S+ + LSN Sbjct: 611 EAFNTVRGMKVKANAGLWGSLLGACRVHKNLELGRFAAERLFELEPHNASNYITLSNMHA 670 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 EAGRW+ VE VR + KQ Sbjct: 671 EAGRWEEVERVRMLMRGKRAGKQ 693 >gb|ESW06839.1| hypothetical protein PHAVU_010G081100g [Phaseolus vulgaris] Length = 748 Score = 90.1 bits (222), Expect = 3e-16 Identities = 42/83 (50%), Positives = 56/83 (67%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 EAF +V +MK++ A WG+LLGACR+HKNL+L FAA++L E+EP S+ + LSN Sbjct: 611 EAFNIVREMKVQANAGLWGSLLGACRVHKNLELGIFAARRLFELEPDNASNYITLSNMHA 670 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 EAGRW VE +R + D KQ Sbjct: 671 EAGRWKEVERLRMLMRDKSARKQ 693 >gb|EOY29048.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|508781793|gb|EOY29049.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|508781794|gb|EOY29050.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] Length = 675 Score = 89.7 bits (221), Expect = 4e-16 Identities = 40/83 (48%), Positives = 58/83 (69%) Frame = -1 Query: 249 EAFELVSKMKIKPTAATWGALLGACRLHKNLKLADFAAKKLSEIEPHKTSDIVLLSNTCV 70 +A + +M IKPTAA WGALLGACR+HKN++L +AA+++ E++PH + +VLLSN Sbjct: 458 KAERFIREMPIKPTAAVWGALLGACRMHKNMELGTYAAERVFELDPHDSGPLVLLSNIYA 517 Query: 69 EAGRWDRVEIVRASVNDSGKEKQ 1 AGRW VR + +SG +K+ Sbjct: 518 SAGRWSDAAKVRKMMKESGVKKE 540