BLASTX nr result

ID: Catharanthus23_contig00004256 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00004256
         (1251 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006845507.1| hypothetical protein AMTR_s00019p00157740 [A...    99   3e-18
ref|XP_006845508.1| hypothetical protein AMTR_s00019p00158300 [A...    85   7e-14
ref|XP_002315385.2| hypothetical protein POPTR_0010s25680g [Popu...    82   4e-13
emb|CAN79598.1| hypothetical protein VITISV_020993 [Vitis vinifera]    78   7e-12
ref|XP_002533213.1| conserved hypothetical protein [Ricinus comm...    72   5e-10
ref|XP_006453484.1| hypothetical protein CICLE_v10008947mg [Citr...    69   5e-09
gb|EOY31620.1| Uncharacterized protein TCM_038592 [Theobroma cacao]    64   1e-07
ref|XP_006453473.1| hypothetical protein CICLE_v10008957mg [Citr...    62   6e-07
gb|EXB49712.1| hypothetical protein L484_006262 [Morus notabilis]      59   3e-06

>ref|XP_006845507.1| hypothetical protein AMTR_s00019p00157740 [Amborella trichopoda]
           gi|548848079|gb|ERN07182.1| hypothetical protein
           AMTR_s00019p00157740 [Amborella trichopoda]
          Length = 321

 Score = 99.4 bits (246), Expect = 3e-18
 Identities = 74/300 (24%), Positives = 126/300 (42%), Gaps = 2/300 (0%)
 Frame = +3

Query: 102 EEKQSALGIFGILKEAFKITIKNGNIVLFISFFMFIYFYLLEFANHLILAPILSDFASKY 281
           +EK   LG+ GILK+A ++ +KN N++LF++    +   LL  ++ L+L P+L D     
Sbjct: 8   DEKTEPLGMLGILKDALRLPLKNKNLMLFVTVSSILPLSLLLLSHQLLLRPLLVDLLIHM 67

Query: 282 AINPDIISLSKTNPKLYKEILKDIRFFLLFGXXXXXXXXXXXXXXXXXXXHATHEAYTYK 461
            +       SK   +   +I KD++  L F                    +    AY  K
Sbjct: 68  TLLSGERKNSKEALETMTQIRKDVKLILAFEVAYLLVVSIAALFALATTVYTAATAYAGK 127

Query: 462 VLTLKDIFLNFKMKWKKPFVTSIHXXXXXXXXXXXXXXXXXXXXXITEGPVLSILEGGLL 641
            LT K +       WK+P +T +                            ++      +
Sbjct: 128 HLTKKQLLSRVFATWKRPLLTWVFIFLLCLSYFILLMICFGVLSLFVPSKAMAGFSIAFI 187

Query: 642 IFGAFFCVYFXXXXXXXXXXXXXEENSSGMKAIERARDVMKGKKKLQGFLLMVILIFASV 821
             G    +Y              E+N  G++A+E+A  ++KG +++QGF L ++L+    
Sbjct: 188 FMGLGGLIYLVTVWALAMVVSIVEDNCYGVEALEKAIALIKG-RRVQGFFLSLVLVVLEG 246

Query: 822 AISESLALLEKNTTK--SRFTWMVISIPNTWLFCLKKLFLFIVFTLFYHECKKGHSVEDK 995
            +S  L ++EK   K        V+ I  +   CL KL+  +V+T+FY ECKK H  + K
Sbjct: 247 VMSSLLRVVEKGRGKMSGEIEVGVVLIIGS---CLVKLYADMVYTVFYFECKKRHGEKVK 303


>ref|XP_006845508.1| hypothetical protein AMTR_s00019p00158300 [Amborella trichopoda]
           gi|548848080|gb|ERN07183.1| hypothetical protein
           AMTR_s00019p00158300 [Amborella trichopoda]
          Length = 314

 Score = 84.7 bits (208), Expect = 7e-14
 Identities = 71/300 (23%), Positives = 122/300 (40%), Gaps = 7/300 (2%)
 Frame = +3

Query: 102 EEKQSALGIFGILKEAFKITIKNGNIVLFISFFMFIYFYLLEFANHLILAPILSDFASKY 281
           +EK   LGI GILKEA K+ +KN ++++F++ F  +   LL  ++ L+L P    FA   
Sbjct: 3   DEKTEPLGILGILKEALKLPLKNKSLMVFVTIFTILPLSLLLLSHQLLLLP----FAKVL 58

Query: 282 AINPDIISLSKTNPK----LYKEILKDIRFFLLFGXXXXXXXXXXXXXXXXXXXHATHEA 449
             +  ++S  + N K       +I KD+R  + F                    +     
Sbjct: 59  LFHTTLLSRERENSKEALETMTQIRKDVRLIVAFEVAFLLLISIATIFSLATTIYTVATT 118

Query: 450 YTYKVLTLKDIFLNFKMKWKKPFVTSIHXXXXXXXXXXXXXXXXXXXXXITEGPVLSI-- 623
           Y  K LT K +       W +P +T                           GP   +  
Sbjct: 119 YVGKHLTKKQLLSRVFATWMRPLLTWFFIFLLCVAFSVLLMLSLGVLSLFV-GPKAIVGY 177

Query: 624 -LEGGLLIFGAFFCVYFXXXXXXXXXXXXXEENSSGMKAIERARDVMKGKKKLQGFLLMV 800
            +   L++ G    +Y              E++  G  A+ +A  ++KG K++QG+ L +
Sbjct: 178 SIAVSLMVLGVL--IYLVTVWALAMVVSIVEDDCYGFDALCKASALIKG-KRMQGYFLTL 234

Query: 801 ILIFASVAISESLALLEKNTTKSRFTWMVISIPNTWLFCLKKLFLFIVFTLFYHECKKGH 980
           +L+     I     + E+   +     +  S+    + CL KL+ ++ FT+FY ECKK H
Sbjct: 235 VLVVLGGFIGSLFRIDERG--RKIGGQIAFSVILVIVSCLLKLYNYVAFTVFYFECKKWH 292


>ref|XP_002315385.2| hypothetical protein POPTR_0010s25680g [Populus trichocarpa]
           gi|550330606|gb|EEF01556.2| hypothetical protein
           POPTR_0010s25680g [Populus trichocarpa]
          Length = 306

 Score = 82.4 bits (202), Expect = 4e-13
 Identities = 70/290 (24%), Positives = 110/290 (37%), Gaps = 5/290 (1%)
 Frame = +3

Query: 120 LGIFGILKEAFKITIKNGNIVLFISFFMFIYFYLLEFANHLILAPILSDFASKYAINPDI 299
           L + GIL+EA  I  +NG  +L +   +   F L+   ++L+    +      Y  N  +
Sbjct: 6   LNVIGILREAITILARNGKFMLQVMLTILFPFSLIGLLHYLLAGFFIERVEDSYEKNSPL 65

Query: 300 ISLSKTNPKLYKEILKDIRFFLLFGXXXXXXXXXXXXXXXXXXXHATHEAYTYKVLTLKD 479
                          KD+R  +                      HA+  +Y  K + L D
Sbjct: 66  GQ-------------KDVRTLIGLELALFAAFFFVCFFGIMLTIHASASSYLGKNMGLND 112

Query: 480 IFLNFKMKWKKPFVTSIHXXXXXXXXXXXXXXXXXXXXXITEGPVLSILEGGLL-IFGAF 656
           +  +    WKKP +T +                      +        L G  L I  A 
Sbjct: 113 LISSIHYAWKKPLITWLCVSLFTLTYAVLAIVLIKLVSLLDPNSYAIYLWGWFLTILAAL 172

Query: 657 FCVYFXXXXXXXXXXXXXEENSSGMKAIERARDVMKGKKKLQGFLLMVILIFASVAISES 836
           F +Y              E +S G K ++R+  +++G+K +QGFLLM IL    V I   
Sbjct: 173 FYLYLDASWTLALVISVLENDSCGTKGLKRSEKLIRGRK-IQGFLLMFILTALVVPIYVL 231

Query: 837 LALL----EKNTTKSRFTWMVISIPNTWLFCLKKLFLFIVFTLFYHECKK 974
           L +     + +     F         T LFCL K F+ +VFT+FY+ECK+
Sbjct: 232 LYVTATDDDDDDELGPFAQFAFRFVATVLFCLSKFFVSVVFTVFYYECKQ 281


>emb|CAN79598.1| hypothetical protein VITISV_020993 [Vitis vinifera]
          Length = 322

 Score = 78.2 bits (191), Expect = 7e-12
 Identities = 71/294 (24%), Positives = 111/294 (37%), Gaps = 4/294 (1%)
 Frame = +3

Query: 120 LGIFGILKEAFKITIKNGNIVLFISFFMFIYFYLLEFANHLILAPILSDFASKYAINPDI 299
           + + GI +EA K   +NG ++L I   +     LL   +HL   P++      Y      
Sbjct: 26  INVIGIFREAIKTPARNGKLMLQIMLLVVSPCTLLALLHHLFAXPLMEKVEDNY------ 79

Query: 300 ISLSKTNPKLYKEILKDIRFFLLFGXXXXXXXXXXXXXXXXXXXHATHEAYTYKVLTLKD 479
                  P ++ E   D+R  L                      +A    Y  + + LKD
Sbjct: 80  -----NKPTVHWE---DLRALLGIEVPFLVGFWXVSMFGITITIYAAAMTYARRSVCLKD 131

Query: 480 IFLNFKMKWKKPFVTSIHXXXXXXXXXXXXXXXXXXXXXITEGPVLSILEGGLLIFGAFF 659
           +     ++WKKP +TS++                     IT         G   +  A  
Sbjct: 132 LLSC--IQWKKPIITSLYVSFIPVVYAILVIGLIKSINLITREDAGQAWRGATAVMAALL 189

Query: 660 CVYFXXXXXXXXXXXXXEENSSGMKAIERARDVMKGKKKLQGFLLMVILIFASVAISES- 836
            +Y              ++   G KA+E+A  +  G+K LQGF LM+IL   S+ I    
Sbjct: 190 YIYLTSVSTLGLVVSVMDDECYGAKALEKAVKLSXGRK-LQGFFLMLILELLSIPIYILF 248

Query: 837 -LALLEKNTTKSRFTWMVISIPNTWLFCLKKLFLFIVFTLFYHECKK--GHSVE 989
            +A  + +      T        T LFCL  +  ++VF +FY ECKK  G  +E
Sbjct: 249 YVASTDDDDEIGAVTLFGFGFLATVLFCLVNMLSYVVFAVFYSECKKNSGEGIE 302


>ref|XP_002533213.1| conserved hypothetical protein [Ricinus communis]
           gi|223526970|gb|EEF29166.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 332

 Score = 72.0 bits (175), Expect = 5e-10
 Identities = 72/306 (23%), Positives = 125/306 (40%), Gaps = 14/306 (4%)
 Frame = +3

Query: 105 EKQSALGIFGILKEAFKITIKNGNIVLFISFFMFIYFYLLEFANHLILAPILSDFASKYA 284
           E    LG  G+L++A K+  KNG I+  ++ F  +   +L  +      P+++D      
Sbjct: 2   ESLMLLGFAGVLRDALKVFCKNGRIMASVALFTLLTKSILYLSITFSTKPLITDLL---- 57

Query: 285 INPDIISLSKTNPKLYKEILKDIR--FFLLFGXXXXXXXXXXXXXXXXXXXHATHEAYTY 458
           +  +++ ++  N   +  IL  IR  F + +G                        A  +
Sbjct: 58  VERNLLHVTTPNTPEFTNILAHIRKDFKIFYGLECIYVILDAVTFLLSATATILAAAIIH 117

Query: 459 ---KVLTLKDIFLNFKMKWKKPFVTSIHXXXXXXXXXXXXXXXXXXXXXITEGPVLS--- 620
                L+LK++ L     WK+P VTSI+                     I +  ++S   
Sbjct: 118 GGKDDLSLKNLLLRTTRSWKRPLVTSIYTTLFGLVYLFLYAAILFGITRIIKTLIISPVT 177

Query: 621 -----ILEGGLLIFGAFFCVYFXXXXXXXXXXXXXEENSSGMKAIERARDVMKGKKKLQG 785
                +    L + G  F VY              EE   G++A+ +A ++ KG   LQG
Sbjct: 178 VFFLGVASVFLSVSGIVFFVYLSAIWTLAIVVSAVEE-IRGIEAVIKATEISKGMN-LQG 235

Query: 786 FLLMVILIFASVAISESLALL-EKNTTKSRFTWMVISIPNTWLFCLKKLFLFIVFTLFYH 962
             L ++ I +S  +S  L +L + + T  R   +VI   +  L+    ++LF  FT+FY+
Sbjct: 236 ISLKLLFIISSCLLSGILMMLKDPSLTLHRIVALVIINSHGLLW----MYLFAAFTVFYY 291

Query: 963 ECKKGH 980
            CKK H
Sbjct: 292 RCKKTH 297


>ref|XP_006453484.1| hypothetical protein CICLE_v10008947mg [Citrus clementina]
           gi|567922958|ref|XP_006453485.1| hypothetical protein
           CICLE_v10008947mg [Citrus clementina]
           gi|568840291|ref|XP_006474103.1| PREDICTED:
           uncharacterized protein LOC102611566 isoform X1 [Citrus
           sinensis] gi|568840293|ref|XP_006474104.1| PREDICTED:
           uncharacterized protein LOC102611566 isoform X2 [Citrus
           sinensis] gi|557556710|gb|ESR66724.1| hypothetical
           protein CICLE_v10008947mg [Citrus clementina]
           gi|557556711|gb|ESR66725.1| hypothetical protein
           CICLE_v10008947mg [Citrus clementina]
          Length = 318

 Score = 68.6 bits (166), Expect = 5e-09
 Identities = 65/297 (21%), Positives = 119/297 (40%), Gaps = 2/297 (0%)
 Frame = +3

Query: 105 EKQSALGIFGILKEAFKITIKNGNIVLFISFFMFIYFYLLEFANHLILAPILSDFASKYA 284
           E    LG  GIL+E  KI  KN  ++  ++    + F LL  +N     P + D  +K  
Sbjct: 6   ESDMMLGFVGILRETPKIFSKNVRLMASLTLLNLLLFSLLFLSNVFSTKPFIPDLVTKAF 65

Query: 285 INPDIISLSKTNPKLYKEILKDIRFFLLFGXXXXXXXXXXXXXXXXXXXHATHEAYTYKV 464
           + P     S     L   +++D+R F+                       A+   +  + 
Sbjct: 66  LIPVTDPKSTEFAYLLIGLMQDLRVFIGLEWTYAIVITATSLFLSTATIIASAAMHGGES 125

Query: 465 LTLKDIFLNFKMKWKKPFVTSIHXXXXXXXXXXXXXXXXXXXXXITEGPVLSILEGGLLI 644
           L  K++ L+     K+PF T  +                     I + P+   L+  L+ 
Sbjct: 126 LFFKELLLSTLKSLKRPFFTWFYITLLVLGYSFLVLENLLPMMLIIQQPIA--LKALLIT 183

Query: 645 FGAFFCVYFXXXXXXXXXXXXXE--ENSSGMKAIERARDVMKGKKKLQGFLLMVILIFAS 818
            G    + +                E   G++A+ +A  ++KG + L GF L ++   +S
Sbjct: 184 IGILASILYNYLAVIWALAFVISVLEEKCGIEALGKAAQIVKGMQLL-GFGLNIVFTVSS 242

Query: 819 VAISESLALLEKNTTKSRFTWMVISIPNTWLFCLKKLFLFIVFTLFYHECKKGHSVE 989
             + ++L L+     +S    +VI +      CL K+F ++ +T+FY++CK+ H VE
Sbjct: 243 SILFQALRLM--TIKQSMVLPIVIGLLVVNSICLVKMFWWMAYTVFYYQCKETHGVE 297


>gb|EOY31620.1| Uncharacterized protein TCM_038592 [Theobroma cacao]
          Length = 318

 Score = 63.9 bits (154), Expect = 1e-07
 Identities = 60/289 (20%), Positives = 117/289 (40%), Gaps = 4/289 (1%)
 Frame = +3

Query: 135 ILKEAFKITIKNGNIVLFISFFMFIYFYLLEFANHLILAPILSDFASKYAINPDIISLSK 314
           +L + +KI +KNG ++ FI+  +     +L   N   +  +++D  +K +    +I  + 
Sbjct: 14  LLADTYKIYLKNGRLMGFIAALVISLHTVLYLLNVFSVKSLITDLITKQS---HLIPTTP 70

Query: 315 TNPKLYKEIL---KDIRFFLLFGXXXXXXXXXXXXXXXXXXXHATHEAYTYKVLTLKDIF 485
             P+L   ++   KDI+ +                       HA+   +  K +++KD+ 
Sbjct: 71  GTPELTNLLIGMQKDIKIYAGVEWIFLLIIAVASLFLAISTTHASALIHGGKKISIKDLV 130

Query: 486 LNFKMKWKKPFVTSIHXXXXXXXXXXXXXXXXXXXXXITEGPVLSILEGGLLIFGAFFCV 665
           L      K+PFVT  +                     I    V S +    L   A    
Sbjct: 131 LRAVRSLKRPFVTCFYITLFGLGYIFLCLVTLLPLVLILGSEVTSSVFAIPLFISAMVFY 190

Query: 666 YFXXXXXXXXXXXXXEENSSGMKAIERARDVMKGKKKLQGFLLMVILIFASVAISESLAL 845
            +              E + G++A+ +A  ++KG  KLQGF+L ++L      + + L +
Sbjct: 191 SYLSVVWNLSLVISVLEETFGIEALGKAAQIVKG-MKLQGFILNLLLTILPPLLLQCLRM 249

Query: 846 LE-KNTTKSRFTWMVISIPNTWLFCLKKLFLFIVFTLFYHECKKGHSVE 989
           +  K +   R    ++ + + WL    ++F    +T+ Y++CKK H  E
Sbjct: 250 ITLKQSEAIRIVITLLLLNSIWLV---RMFGHTAYTVLYYQCKKTHGEE 295


>ref|XP_006453473.1| hypothetical protein CICLE_v10008957mg [Citrus clementina]
           gi|568840315|ref|XP_006474115.1| PREDICTED:
           uncharacterized protein LOC102614717 [Citrus sinensis]
           gi|557556699|gb|ESR66713.1| hypothetical protein
           CICLE_v10008957mg [Citrus clementina]
          Length = 316

 Score = 61.6 bits (148), Expect = 6e-07
 Identities = 63/296 (21%), Positives = 118/296 (39%), Gaps = 4/296 (1%)
 Frame = +3

Query: 120 LGIFGILKEAFKITIKNGNIVLFISFFMFIYFYLLEFANHLILAPILSDFASKYAINPDI 299
           LG  GIL+E  KI  KN  ++  ++    +   L   +N     P +SD  +K  + P  
Sbjct: 9   LGFAGILRETPKIFSKNVRLMASLTLLNLLLLSLFFLSNVFCTKPFISDLVTKAFLIPVT 68

Query: 300 ISLSKTNPKLYKEILKDIRFFLLFGXXXXXXXXXXXXXXXXXXXHATHEAYTYKVLTLKD 479
              S     L   +++D+R F+                       A+   +  + L  K+
Sbjct: 69  DPKSTEFAYLLIGLMQDLRVFIGLEWTYAIVITATSLFLSTATIIASAATHGGESLFFKE 128

Query: 480 IFLNFKMKWKKPFVTSIHXXXXXXXXXXXXXXXXXXXXXITEGPVLSILEGGLLIFGAFF 659
           + L+     K+PF T  +                     I + P+ S  +  L+  G   
Sbjct: 129 LLLSTLKSLKRPFFTWFYITLLGLGYGFLVLENLLPMMLIIQQPIAS--KALLITIGILA 186

Query: 660 CVYFXXXXXXXXXXXXXE--ENSSGMKAIERARDVMKGKKKLQGFLLMVILIFASVAISE 833
            + +                E   G++A+ +A  ++KG + L GF L ++   +S  + +
Sbjct: 187 SILYNYLAVIWALAFVISVLEEKCGIEALGKAAQIVKGMRLL-GFGLNIVFTVSSSILFQ 245

Query: 834 SLALLEKNTTKSRFTWMVISIPNTWLFCLKKLFLFIVFTLFYHECK--KGHSVEDK 995
           +L L+     +S    +VI +      CL ++F +I +T+FY++CK  +G  VE +
Sbjct: 246 ALRLM--TIKQSTVLPIVIGLLVVNSICLVRMFWWIAYTVFYYQCKETQGEEVESQ 299


>gb|EXB49712.1| hypothetical protein L484_006262 [Morus notabilis]
          Length = 322

 Score = 59.3 bits (142), Expect = 3e-06
 Identities = 67/287 (23%), Positives = 113/287 (39%), Gaps = 3/287 (1%)
 Frame = +3

Query: 138 LKEAFKI-TIKNGNIVLFISFFMFIYFYLLEFANHLILAPILSDFASKYAINPDIISLSK 314
           L+++ K+ + KNG +V  +     +   +L  AN   + P L+DF  K  ++  ++S S 
Sbjct: 14  LQKSLKVFSRKNGKLVRSVILIFLLLISILLVANIFSVKPYLADFVFK--LSTILLSPSS 71

Query: 315 TN-PKLYKEILKDIRFFLLFGXXXXXXXXXXXXXXXXXXXHATHEAYTYKVLTLKDIFLN 491
            +   L   I  D+R                          ++  A+  K L L+ +   
Sbjct: 72  VDFANLLIPIKYDLRIIATIEWINAVASSTTSLLFATATILSSSAAHRGKDLHLRHLVSC 131

Query: 492 FKMKWKKPFVTSIHXXXXXXXXXXXXXXXXXXXXXITEGPVLSILEGGLLIFGAFFCVYF 671
               WK+PFVT  +                     I E  +       +LI      +YF
Sbjct: 132 VVKLWKRPFVTFFYTTLLDLGYVLFVLTFVAPFVLIFEHELTMHFVLPILILIPIALLYF 191

Query: 672 XXXXXXXXXXXXXE-ENSSGMKAIERARDVMKGKKKLQGFLLMVILIFASVAISESLALL 848
                          E  SG++A+ +A  +++G  KL+GFLL   L F ++  +  + L 
Sbjct: 192 YLAVVWTLAIVVSVLEEKSGIEALGKAGQIVRG-LKLKGFLLK--LFFGALYCALLILLR 248

Query: 849 EKNTTKSRFTWMVISIPNTWLFCLKKLFLFIVFTLFYHECKKGHSVE 989
             N  +S  T + + +      C  K+F  I +T+FYHEC K H  E
Sbjct: 249 MTNERQSVGTKLCVFLLFVNSICFLKMFSLIAYTVFYHECMKMHGEE 295


Top