BLASTX nr result
ID: Catharanthus22_contig00007824
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00007824 (612 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006424508.1| hypothetical protein CICLE_v10029423mg [Citr... 121 1e-25 ref|XP_006590282.1| PREDICTED: uncharacterized protein LOC100779... 100 2e-25 ref|XP_006590285.1| PREDICTED: uncharacterized protein LOC100779... 100 2e-25 ref|XP_002273853.2| PREDICTED: uncharacterized protein LOC100258... 120 4e-25 emb|CBI17170.3| unnamed protein product [Vitis vinifera] 120 4e-25 gb|EOY34451.1| Uncharacterized protein TCM_042130 [Theobroma cacao] 117 3e-24 ref|XP_006360442.1| PREDICTED: uncharacterized protein LOC102589... 115 1e-23 ref|XP_006590284.1| PREDICTED: uncharacterized protein LOC100779... 114 2e-23 ref|XP_004237009.1| PREDICTED: uncharacterized protein LOC101262... 114 2e-23 ref|NP_001241314.1| uncharacterized protein LOC100779715 [Glycin... 114 2e-23 ref|XP_004294004.1| PREDICTED: uncharacterized protein LOC101314... 113 5e-23 ref|XP_003539782.1| PREDICTED: uncharacterized protein LOC100778... 112 8e-23 ref|XP_006419075.1| hypothetical protein EUTSA_v10002688mg [Eutr... 111 2e-22 ref|XP_002314295.1| hypothetical protein POPTR_0009s01400g [Popu... 111 2e-22 ref|XP_004506050.1| PREDICTED: uncharacterized protein LOC101504... 110 3e-22 ref|XP_002523690.1| conserved hypothetical protein [Ricinus comm... 109 5e-22 gb|EMJ07771.1| hypothetical protein PRUPE_ppa020143mg, partial [... 108 9e-22 ref|XP_004147283.1| PREDICTED: uncharacterized protein LOC101218... 108 2e-21 ref|XP_006590283.1| PREDICTED: uncharacterized protein LOC100779... 98 1e-20 ref|XP_002875709.1| hypothetical protein ARALYDRAFT_484900 [Arab... 102 1e-19 >ref|XP_006424508.1| hypothetical protein CICLE_v10029423mg [Citrus clementina] gi|568869665|ref|XP_006488040.1| PREDICTED: uncharacterized protein LOC102612708 [Citrus sinensis] gi|557526442|gb|ESR37748.1| hypothetical protein CICLE_v10029423mg [Citrus clementina] Length = 164 Score = 121 bits (304), Expect = 1e-25 Identities = 61/112 (54%), Positives = 78/112 (69%) Frame = -1 Query: 438 SYSPFLLTSKQPASKMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKD 259 ++S + S++ K+ +SS+LPE TLLV+RT+ S++ I K Sbjct: 32 AFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVVGAAATLLVRRTKGSEETEIPLKT 91 Query: 258 CDDCGGSGICSECKGEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 C+DCGGSGIC ECKGEGFVL+KLS+E AE+AR+TAKN ATRYTAGLPKKWSY Sbjct: 92 CEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWSY 143 >ref|XP_006590282.1| PREDICTED: uncharacterized protein LOC100779715 isoform X1 [Glycine max] Length = 186 Score = 100 bits (248), Expect(2) = 2e-25 Identities = 53/98 (54%), Positives = 67/98 (68%) Frame = -1 Query: 396 KMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECK 217 ++ +SS+L E TLLVKR+++S+ I K C+DCGGSGICSECK Sbjct: 56 RLSTVSSALAETAASMAVAVTVVGAAATLLVKRSKTSESSQIQFKACEDCGGSGICSECK 115 Query: 216 GEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 GEGFVLRK SDE+AEKAR+ AKN ATR+TAGL + +Y Sbjct: 116 GEGFVLRKRSDESAEKARVQAKNMATRFTAGLAIQRNY 153 Score = 42.4 bits (98), Expect(2) = 2e-25 Identities = 16/27 (59%), Positives = 21/27 (77%) Frame = -3 Query: 127 RASQKMELLHKMLFWPFL*ILWWNWQA 47 + S++MELLHK+LFW L LWW W+A Sbjct: 156 QTSKEMELLHKVLFWSILFNLWWQWKA 182 >ref|XP_006590285.1| PREDICTED: uncharacterized protein LOC100779715 isoform X4 [Glycine max] Length = 160 Score = 100 bits (248), Expect(2) = 2e-25 Identities = 53/98 (54%), Positives = 67/98 (68%) Frame = -1 Query: 396 KMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECK 217 ++ +SS+L E TLLVKR+++S+ I K C+DCGGSGICSECK Sbjct: 30 RLSTVSSALAETAASMAVAVTVVGAAATLLVKRSKTSESSQIQFKACEDCGGSGICSECK 89 Query: 216 GEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 GEGFVLRK SDE+AEKAR+ AKN ATR+TAGL + +Y Sbjct: 90 GEGFVLRKRSDESAEKARVQAKNMATRFTAGLAIQRNY 127 Score = 42.4 bits (98), Expect(2) = 2e-25 Identities = 16/27 (59%), Positives = 21/27 (77%) Frame = -3 Query: 127 RASQKMELLHKMLFWPFL*ILWWNWQA 47 + S++MELLHK+LFW L LWW W+A Sbjct: 130 QTSKEMELLHKVLFWSILFNLWWQWKA 156 >ref|XP_002273853.2| PREDICTED: uncharacterized protein LOC100258439 [Vitis vinifera] Length = 176 Score = 120 bits (300), Expect = 4e-25 Identities = 64/130 (49%), Positives = 81/130 (62%) Frame = -1 Query: 492 LSTMRQFRHKLDNPFRENSYSPFLLTSKQPASKMFKISSSLPEXXXXXXXXXXXXXXXXT 313 L T R F+ + N F + S + + K+ ++S +LPE T Sbjct: 26 LKTSRDFKPHIHNAFDPDPISSTDVKLQAQRRKICRVSYALPETAASVAIAATVVGAAAT 85 Query: 312 LLVKRTESSKKPLILTKDCDDCGGSGICSECKGEGFVLRKLSDENAEKARMTAKNAATRY 133 LLV+R+ S+ I K C+DCGGSGICSEC GEGFVL+KLS+ +AEKAR+TAKN ATRY Sbjct: 86 LLVRRSRPSEATEIPLKICEDCGGSGICSECNGEGFVLKKLSEASAEKARLTAKNMATRY 145 Query: 132 TAGLPKKWSY 103 TAGLPKKWSY Sbjct: 146 TAGLPKKWSY 155 >emb|CBI17170.3| unnamed protein product [Vitis vinifera] Length = 927 Score = 120 bits (300), Expect = 4e-25 Identities = 64/130 (49%), Positives = 81/130 (62%) Frame = -1 Query: 492 LSTMRQFRHKLDNPFRENSYSPFLLTSKQPASKMFKISSSLPEXXXXXXXXXXXXXXXXT 313 L T R F+ + N F + S + + K+ ++S +LPE T Sbjct: 777 LKTSRDFKPHIHNAFDPDPISSTDVKLQAQRRKICRVSYALPETAASVAIAATVVGAAAT 836 Query: 312 LLVKRTESSKKPLILTKDCDDCGGSGICSECKGEGFVLRKLSDENAEKARMTAKNAATRY 133 LLV+R+ S+ I K C+DCGGSGICSEC GEGFVL+KLS+ +AEKAR+TAKN ATRY Sbjct: 837 LLVRRSRPSEATEIPLKICEDCGGSGICSECNGEGFVLKKLSEASAEKARLTAKNMATRY 896 Query: 132 TAGLPKKWSY 103 TAGLPKKWSY Sbjct: 897 TAGLPKKWSY 906 >gb|EOY34451.1| Uncharacterized protein TCM_042130 [Theobroma cacao] Length = 164 Score = 117 bits (293), Expect = 3e-24 Identities = 59/104 (56%), Positives = 73/104 (70%) Frame = -1 Query: 414 SKQPASKMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSG 235 SK+ K+ +SS+LPE T LV+RT+SS + + C+DC GSG Sbjct: 40 SKEHRRKLSILSSALPETAASVAIAATVVGAAATFLVRRTKSSDATEVPLRPCEDCEGSG 99 Query: 234 ICSECKGEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 ICSECKGEGFVL+K+S+++AEKARMTAKN ATRYTAGLPKKWSY Sbjct: 100 ICSECKGEGFVLKKMSEDSAEKARMTAKNMATRYTAGLPKKWSY 143 >ref|XP_006360442.1| PREDICTED: uncharacterized protein LOC102589280 [Solanum tuberosum] Length = 166 Score = 115 bits (287), Expect = 1e-23 Identities = 64/146 (43%), Positives = 91/146 (62%), Gaps = 2/146 (1%) Frame = -1 Query: 534 MGLRLPTTTLQIPNLSTMRQFRHKLD-NPFRENSYSPFLLTSKQPASK-MFKISSSLPEX 361 MGL L ++ I + T+ + D N + Y SK+PA++ + K+SS++ E Sbjct: 1 MGLSLHMLSVHISSSITITETLKSRDCNALCTSIYPCGSTKSKEPAARRVLKVSSAVLET 60 Query: 360 XXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECKGEGFVLRKLSDE 181 T+LV+R ++S+ + C+DCGGSG+C+EC+GEGFVL+K+SDE Sbjct: 61 AASIAVAATVVGAAATILVRRNKASEPIEAPLRICEDCGGSGVCAECRGEGFVLKKMSDE 120 Query: 180 NAEKARMTAKNAATRYTAGLPKKWSY 103 NAE+AR+ AKNAATRYTAGLPKKW Y Sbjct: 121 NAERARLMAKNAATRYTAGLPKKWGY 146 >ref|XP_006590284.1| PREDICTED: uncharacterized protein LOC100779715 isoform X3 [Glycine max] Length = 174 Score = 114 bits (285), Expect = 2e-23 Identities = 58/98 (59%), Positives = 70/98 (71%) Frame = -1 Query: 396 KMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECK 217 ++ +SS+L E TLLVKR+++S+ I K C+DCGGSGICSECK Sbjct: 56 RLSTVSSALAETAASMAVAVTVVGAAATLLVKRSKTSESSQIQFKACEDCGGSGICSECK 115 Query: 216 GEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 GEGFVLRK SDE+AEKAR+ AKN ATR+TAGLPKKWSY Sbjct: 116 GEGFVLRKRSDESAEKARVQAKNMATRFTAGLPKKWSY 153 >ref|XP_004237009.1| PREDICTED: uncharacterized protein LOC101262408 [Solanum lycopersicum] Length = 150 Score = 114 bits (285), Expect = 2e-23 Identities = 55/102 (53%), Positives = 72/102 (70%) Frame = -1 Query: 408 QPASKMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGIC 229 +PA ++ K+SS+L E T+LVKR +SS+ + C+DC GSG+C Sbjct: 29 KPARRVLKVSSALVETAASIAVAATLVGAATTILVKRNKSSEAIEAPVRICEDCDGSGVC 88 Query: 228 SECKGEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 +EC+GEGFVL+K+SDENAE+AR+ AKNAATRYTAGLPKKW Y Sbjct: 89 AECRGEGFVLKKMSDENAERARLMAKNAATRYTAGLPKKWGY 130 >ref|NP_001241314.1| uncharacterized protein LOC100779715 [Glycine max] gi|255640844|gb|ACU20705.1| unknown [Glycine max] Length = 148 Score = 114 bits (285), Expect = 2e-23 Identities = 58/98 (59%), Positives = 70/98 (71%) Frame = -1 Query: 396 KMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECK 217 ++ +SS+L E TLLVKR+++S+ I K C+DCGGSGICSECK Sbjct: 30 RLSTVSSALAETAASMAVAVTVVGAAATLLVKRSKTSESSQIQFKACEDCGGSGICSECK 89 Query: 216 GEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 GEGFVLRK SDE+AEKAR+ AKN ATR+TAGLPKKWSY Sbjct: 90 GEGFVLRKRSDESAEKARVQAKNMATRFTAGLPKKWSY 127 >ref|XP_004294004.1| PREDICTED: uncharacterized protein LOC101314352 [Fragaria vesca subsp. vesca] Length = 146 Score = 113 bits (282), Expect = 5e-23 Identities = 53/94 (56%), Positives = 69/94 (73%) Frame = -1 Query: 384 ISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECKGEGF 205 +SS+LPE T+LV+R ++++ + KDC+ CGGSGIC ECKGEGF Sbjct: 31 VSSALPETVASVAIAATVVGAAATILVRRNKAAEAAEVPMKDCESCGGSGICPECKGEGF 90 Query: 204 VLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 VL++LSDE+AE+AR+ +KNAATRYTAGLPKKWSY Sbjct: 91 VLKRLSDESAERARLASKNAATRYTAGLPKKWSY 124 >ref|XP_003539782.1| PREDICTED: uncharacterized protein LOC100778669 [Glycine max] Length = 148 Score = 112 bits (280), Expect = 8e-23 Identities = 53/70 (75%), Positives = 61/70 (87%) Frame = -1 Query: 312 LLVKRTESSKKPLILTKDCDDCGGSGICSECKGEGFVLRKLSDENAEKARMTAKNAATRY 133 LLVKR+++S+ I K C+DCGGSGICSECKGEGFVLRK SDE+AEKAR+ AKN ATR+ Sbjct: 58 LLVKRSKTSESTQIQFKVCEDCGGSGICSECKGEGFVLRKRSDESAEKARVQAKNMATRF 117 Query: 132 TAGLPKKWSY 103 TAGLPKKWSY Sbjct: 118 TAGLPKKWSY 127 >ref|XP_006419075.1| hypothetical protein EUTSA_v10002688mg [Eutrema salsugineum] gi|557097003|gb|ESQ37511.1| hypothetical protein EUTSA_v10002688mg [Eutrema salsugineum] Length = 169 Score = 111 bits (277), Expect = 2e-22 Identities = 53/71 (74%), Positives = 63/71 (88%), Gaps = 1/71 (1%) Frame = -1 Query: 312 LLVKR-TESSKKPLILTKDCDDCGGSGICSECKGEGFVLRKLSDENAEKARMTAKNAATR 136 LLV+R T+++++ K+C+ CGGSGICSECKGEGFVL+KLSDENAEKAR+TAKN ATR Sbjct: 78 LLVRRNTKANEEAEASMKECEVCGGSGICSECKGEGFVLKKLSDENAEKARLTAKNMATR 137 Query: 135 YTAGLPKKWSY 103 YTAGLPKKWSY Sbjct: 138 YTAGLPKKWSY 148 >ref|XP_002314295.1| hypothetical protein POPTR_0009s01400g [Populus trichocarpa] gi|222850703|gb|EEE88250.1| hypothetical protein POPTR_0009s01400g [Populus trichocarpa] Length = 163 Score = 111 bits (277), Expect = 2e-22 Identities = 56/103 (54%), Positives = 69/103 (66%) Frame = -1 Query: 411 KQPASKMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGI 232 ++ K + SLPE TLLV+R + S+ I K C+DCGGSGI Sbjct: 40 QEQGRKTSTVCYSLPETAASVAIAATAVGAAITLLVRRNKPSEADEIPLKTCEDCGGSGI 99 Query: 231 CSECKGEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 CSEC GEGFVL+KLS+E+AE+AR++AKN ATRYTAGLPKKWSY Sbjct: 100 CSECSGEGFVLKKLSEESAERARLSAKNMATRYTAGLPKKWSY 142 >ref|XP_004506050.1| PREDICTED: uncharacterized protein LOC101504193 [Cicer arietinum] Length = 145 Score = 110 bits (275), Expect = 3e-22 Identities = 57/98 (58%), Positives = 67/98 (68%) Frame = -1 Query: 396 KMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECK 217 + + ISS+LPE TLLVKR++ S+ I K C+DCGGSGIC C+ Sbjct: 27 RSYTISSALPETAVSVAVAATFVGAAATLLVKRSKPSESTEIQFKVCEDCGGSGICPACR 86 Query: 216 GEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 GEGFVLRK SDE+AEKAR AKN ATR+TAGLPKKWSY Sbjct: 87 GEGFVLRKRSDESAEKARNLAKNMATRFTAGLPKKWSY 124 >ref|XP_002523690.1| conserved hypothetical protein [Ricinus communis] gi|223536994|gb|EEF38630.1| conserved hypothetical protein [Ricinus communis] Length = 213 Score = 109 bits (273), Expect = 5e-22 Identities = 54/94 (57%), Positives = 68/94 (72%) Frame = -1 Query: 384 ISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECKGEGF 205 ISS+LPE T+LV+RT++++ I K C+DC GSG+CSECKGEGF Sbjct: 99 ISSALPETTASLAIAAAVVGTAATVLVRRTKATETNEIQLKTCEDCEGSGLCSECKGEGF 158 Query: 204 VLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 VL+KLS+E+AE+AR+ AKN ATRYTA LPKKWSY Sbjct: 159 VLKKLSEESAERARLNAKNMATRYTAALPKKWSY 192 >gb|EMJ07771.1| hypothetical protein PRUPE_ppa020143mg, partial [Prunus persica] Length = 129 Score = 108 bits (271), Expect = 9e-22 Identities = 52/94 (55%), Positives = 68/94 (72%) Frame = -1 Query: 384 ISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECKGEGF 205 +S++LPE T+LV+RT++S+ + K C+ CGGSGIC ECKGEGF Sbjct: 14 VSAALPETAASIAIAAAVVGTAATILVRRTKASEAAEVPMKICEACGGSGICPECKGEGF 73 Query: 204 VLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 VL++LSDE+AE+AR+ +KN ATRYTAGLPKKWSY Sbjct: 74 VLKRLSDESAERARLASKNMATRYTAGLPKKWSY 107 >ref|XP_004147283.1| PREDICTED: uncharacterized protein LOC101218484 [Cucumis sativus] gi|449501230|ref|XP_004161313.1| PREDICTED: uncharacterized LOC101218484 [Cucumis sativus] Length = 164 Score = 108 bits (269), Expect = 2e-21 Identities = 54/102 (52%), Positives = 67/102 (65%) Frame = -1 Query: 408 QPASKMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGIC 229 Q + I+SSLPE T L +R ++S+ + C+DCGGSG+C Sbjct: 42 QGGRRTLTIASSLPETAASVAIAATVVGAAATFLSRRNKNSEAVEVPLITCEDCGGSGLC 101 Query: 228 SECKGEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 SECKGEGFVL+KLSDENAE+AR+ AKN ATR+TA LPKKWSY Sbjct: 102 SECKGEGFVLKKLSDENAERARLAAKNMATRFTAALPKKWSY 143 >ref|XP_006590283.1| PREDICTED: uncharacterized protein LOC100779715 isoform X2 [Glycine max] Length = 180 Score = 97.8 bits (242), Expect(2) = 1e-20 Identities = 51/91 (56%), Positives = 63/91 (69%) Frame = -1 Query: 396 KMFKISSSLPEXXXXXXXXXXXXXXXXTLLVKRTESSKKPLILTKDCDDCGGSGICSECK 217 ++ +SS+L E TLLVKR+++S+ I K C+DCGGSGICSECK Sbjct: 56 RLSTVSSALAETAASMAVAVTVVGAAATLLVKRSKTSESSQIQFKACEDCGGSGICSECK 115 Query: 216 GEGFVLRKLSDENAEKARMTAKNAATRYTAG 124 GEGFVLRK SDE+AEKAR+ AKN ATR+TAG Sbjct: 116 GEGFVLRKRSDESAEKARVQAKNMATRFTAG 146 Score = 28.1 bits (61), Expect(2) = 1e-20 Identities = 14/25 (56%), Positives = 17/25 (68%) Frame = -2 Query: 122 FPKNGVIAQNALLAVLVDPVVELAS 48 F +NG IAQ+ALL V PVV + S Sbjct: 156 FQRNGAIAQSALLVDPVQPVVAVES 180 >ref|XP_002875709.1| hypothetical protein ARALYDRAFT_484900 [Arabidopsis lyrata subsp. lyrata] gi|297321547|gb|EFH51968.1| hypothetical protein ARALYDRAFT_484900 [Arabidopsis lyrata subsp. lyrata] Length = 163 Score = 102 bits (253), Expect = 1e-19 Identities = 46/55 (83%), Positives = 50/55 (90%) Frame = -1 Query: 267 TKDCDDCGGSGICSECKGEGFVLRKLSDENAEKARMTAKNAATRYTAGLPKKWSY 103 TK+C+ C GSGIC ECKGEGFVL+KLSD NAEKAR+TAKN ATRYTAGLPKKWSY Sbjct: 88 TKECEACLGSGICPECKGEGFVLKKLSDANAEKARLTAKNMATRYTAGLPKKWSY 142