BLASTX nr result

ID: Cephaelis21_contig00006757 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00006757
         (1987 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267544.1| PREDICTED: uncharacterized protein LOC100254...   473   e-131
ref|XP_004147991.1| PREDICTED: uncharacterized protein LOC101214...   418   e-114
ref|XP_002519590.1| conserved hypothetical protein [Ricinus comm...   374   e-101
ref|NP_565063.1| uncharacterized protein [Arabidopsis thaliana] ...   355   3e-95
ref|XP_002887480.1| hypothetical protein ARALYDRAFT_895197 [Arab...   351   4e-94

>ref|XP_002267544.1| PREDICTED: uncharacterized protein LOC100254610 [Vitis vinifera]
          Length = 457

 Score =  473 bits (1217), Expect = e-131
 Identities = 256/455 (56%), Positives = 317/455 (69%), Gaps = 3/455 (0%)
 Frame = -1

Query: 1915 RCEALQDRIQTSIIPSSSSNWKTTXXXXXXXXXXXLNRLSLLRHNPSSLSFNIGHLESVV 1736
            RC  + +R++       +++ K T           L+   L  H    LS NI HLE+VV
Sbjct: 10   RCTRVMERVERLDTSKITASCKGTLLKLASSELNFLSSTHL--HQSLPLSVNISHLEAVV 67

Query: 1735 HILQQPFITGVSRVCKTLPLSSSKDFRPKA-ASSVNNIYVDIVCCLNGSPVWFIVSDRNP 1559
            HIL+QPFITGVSRVCK  PLS +     K+   +   +Y+DIVC LN +PVWFIVSDRNP
Sbjct: 68   HILEQPFITGVSRVCKLFPLSPTIGNGEKSDCGAAKGVYLDIVCTLNRNPVWFIVSDRNP 127

Query: 1558 KYISWHGSSGNKGLRARIVEILDAAQTSETLRPSSVILFFANGLNDIVHKKLQDEFGAAD 1379
            KY+SW   SGNKGLR RI ++LDAA++S TL+PSSVILFF+NGL+  + +KLQ EFGA +
Sbjct: 128  KYVSWDECSGNKGLRTRIQQVLDAARSSLTLKPSSVILFFSNGLDQCICEKLQGEFGAYE 187

Query: 1378 VGLIFSSFDCSFSNEPEDEWINVLAKSYQAASVLKIEIS-TSMNLVLTSAKECQVRLPAL 1202
              + F      F  EPE EWINV A+SY+ A +L+I++   S ++++   K+        
Sbjct: 188  CAVEFPDCSFDFLEEPESEWINVFARSYRGACILEIKVDHVSPSVLVYDVKDSPPDAVGT 247

Query: 1201 PSPEDHVDAILALGESLCSLIAGMK-CCLQDVELNQLENCSSGHIQLVNFDTTALIAIVS 1025
              PE H+D  ++LG S  SLI GMK CCL    +  L     G   L+NFDTTALIA+VS
Sbjct: 248  QIPEKHID--ISLGASFSSLILGMKFCCLHAEGVETL----LGQDDLINFDTTALIAVVS 301

Query: 1024 GISNGGINWLLSDPESKLRSKFKCNYEFVIAQVNSEILNSIHVELIGAVSGKGGILCQSV 845
            GISNGG   LL+ PE+++R +FK NY+FVIAQV SEI N IHVEL G  SGK GI+C++V
Sbjct: 302  GISNGGTEKLLAAPETEMRLRFKGNYKFVIAQVLSEIQNPIHVELSGLTSGKRGIICETV 361

Query: 844  YSEFQELVSMCGGPKEKLRASYLLKRLRVVPDCPSSRLMSLPNTRKLALKNKVVFGTGDY 665
            +SEF+ELVSMCGGP EKLRA  LLK L VVPD PS+R+M LP TRKLALKNKVVFGTGDY
Sbjct: 362  HSEFKELVSMCGGPNEKLRADQLLKCLMVVPDSPSARMMGLPTTRKLALKNKVVFGTGDY 421

Query: 664  WHAPTLTANMGFIRAVAQTGMSLFTIEHRPRALIG 560
            WHAPTLTANM F+RA++QTGMSLFTIEHRPRAL G
Sbjct: 422  WHAPTLTANMAFVRAISQTGMSLFTIEHRPRALTG 456


>ref|XP_004147991.1| PREDICTED: uncharacterized protein LOC101214095 [Cucumis sativus]
            gi|449494348|ref|XP_004159521.1| PREDICTED:
            uncharacterized LOC101214095 [Cucumis sativus]
          Length = 458

 Score =  418 bits (1075), Expect = e-114
 Identities = 232/459 (50%), Positives = 308/459 (67%), Gaps = 6/459 (1%)
 Frame = -1

Query: 1915 RCEALQDRIQTSIIPSSSSNWKTTXXXXXXXXXXXLNRLSLLRHNPSS-LSFNIGHLESV 1739
            RC+A+ D IQT  +PSS++   +            LN LS    + S+ LS NIGHLE++
Sbjct: 13   RCKAIMDIIQT--LPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSAPLSLNIGHLEAI 70

Query: 1738 VHILQQPFITGVSRVCKTLPLSSSKDFRPKAASSVNNIYVDIVCCLNGSPVWFIVSDRNP 1559
            VHILQ P +TG+SRVCK +P SSS             +YVDI+C LN +PVW IVSDR P
Sbjct: 71   VHILQHPSVTGISRVCKPIPSSSSSQA----------VYVDIICTLNRNPVWVIVSDRKP 120

Query: 1558 KYISWHGSSGNKGLRARIVEILDAAQTSETLRPSSVILFFANGLNDIVHKKLQDEFGAAD 1379
            +YISW+    +KGL++R+ E++DAA++   L P S+ILFF++GL+  + ++L+DEF A +
Sbjct: 121  RYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATE 180

Query: 1378 VGLIFSSFDCSFSNEPEDEWINVLAKSYQAASVLKIEISTSMNLVLTSAKECQVRLPALP 1199
                FS FD +FS E + +WINVL +SY+ A VL+I+++     V +S    +V    + 
Sbjct: 181  FHFNFSDFDFAFS-EIDGDWINVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVD 239

Query: 1198 SPED-HVDAILALGESLCSLIAGMKCC----LQDVELNQLENCSSGHIQLVNFDTTALIA 1034
             PE  + +  +  G+S CS++  MK      ++D+E    E    G   L+NFDTTALIA
Sbjct: 240  EPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEKLLGGDSDLINFDTTALIA 299

Query: 1033 IVSGISNGGINWLLSDPESKLRSKFKCNYEFVIAQVNSEILNSIHVELIGAVSGKGGILC 854
            +VSGISNG    LLS PE++LR K+K NY+FVI Q  SEI   I VEL   +SGK GI+C
Sbjct: 300  LVSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIIC 359

Query: 853  QSVYSEFQELVSMCGGPKEKLRASYLLKRLRVVPDCPSSRLMSLPNTRKLALKNKVVFGT 674
            QS +SEF+EL++MCGGP EK RA++LLK + VV D  S R+  LP TRKLALKNKVVFGT
Sbjct: 360  QSAHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRMTCLPTTRKLALKNKVVFGT 419

Query: 673  GDYWHAPTLTANMGFIRAVAQTGMSLFTIEHRPRALIGE 557
            GDYW+APTLTANM F+RAV+QTGMSLFT EHRPRAL G+
Sbjct: 420  GDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 458


>ref|XP_002519590.1| conserved hypothetical protein [Ricinus communis]
            gi|223541248|gb|EEF42801.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 425

 Score =  374 bits (960), Expect = e-101
 Identities = 223/459 (48%), Positives = 285/459 (62%), Gaps = 6/459 (1%)
 Frame = -1

Query: 1915 RCEALQDRIQTSIIPSSSSNWKTTXXXXXXXXXXXLNRLSLLRHNPS-SLSFNIGHLESV 1739
            RCE + DRI    + +S ++  T               LS     PS  LS NIGHLE+V
Sbjct: 18   RCERVIDRIHRLPLHTSINHSCTRTLLKLAHSELAF--LSRTCPQPSLPLSVNIGHLEAV 75

Query: 1738 VHILQQPFITGVSRVCKTLPLSSSKDFRPKAASSVNNIYVDIVCCLNGSPVWFIVSDRNP 1559
            +H+L+ PF++GVSRVCK++          K   S   I+VD+VC  N +PVW IVSDRNP
Sbjct: 76   IHLLEHPFVSGVSRVCKSI----------KTTHSSKTIHVDVVCIFNKNPVWIIVSDRNP 125

Query: 1558 KYISWHGSSGNKGLRARIVEILDAAQTSETLRPSSVILFFANGLNDIVHKKLQDEFGAAD 1379
            KYISWH        + RI  +L  A++S+ ++P+S+++FFA GL+D V +KL+ EFGA +
Sbjct: 126  KYISWHDC-----FKLRIERLLAEARSSQIIKPTSILVFFARGLDDFVFEKLKYEFGAFE 180

Query: 1378 VGLIFSSFDCSFSNEPEDEWINVLAKSYQAASVLKIEI---STSMNLVLTSAKECQV--R 1214
            + L F         + ED WINV    YQ +  ++I++   ++S N VL    EC    +
Sbjct: 181  IELGF---------DLEDGWINVTDTPYQDSMFIEIKVDGTTSSRNAVL----ECAFVEK 227

Query: 1213 LPALPSPEDHVDAILALGESLCSLIAGMKCCLQDVELNQLENCSSGHIQLVNFDTTALIA 1034
               L   E+         +S  SLI+G +                    LVNFDTTALIA
Sbjct: 228  FDGLELQEEDT-----ADDSFTSLISGFRY----------------DGDLVNFDTTALIA 266

Query: 1033 IVSGISNGGINWLLSDPESKLRSKFKCNYEFVIAQVNSEILNSIHVELIGAVSGKGGILC 854
            IVSGISNG    LL+ PE +LR +FK N+EFV+ QV SEI N IHVE+   + GKGGI+C
Sbjct: 267  IVSGISNGCREKLLAAPEIQLRQRFKGNFEFVVGQVLSEIQNPIHVEMADIIHGKGGIIC 326

Query: 853  QSVYSEFQELVSMCGGPKEKLRASYLLKRLRVVPDCPSSRLMSLPNTRKLALKNKVVFGT 674
            +SV SEF+ELVS+CGGP EKLRA  +LK L VVPD PS R+M LP TRKLALKNKVVFGT
Sbjct: 327  ESVLSEFKELVSLCGGPNEKLRADKILKSLMVVPDSPSERMMCLPTTRKLALKNKVVFGT 386

Query: 673  GDYWHAPTLTANMGFIRAVAQTGMSLFTIEHRPRALIGE 557
            GD+W APTLTANM F+RAV+QTGMSL TIEHRPRAL G+
Sbjct: 387  GDHWRAPTLTANMAFVRAVSQTGMSLLTIEHRPRALTGD 425


>ref|NP_565063.1| uncharacterized protein [Arabidopsis thaliana]
            gi|11120791|gb|AAG30971.1|AC012396_7 hypothetical protein
            [Arabidopsis thaliana] gi|14334538|gb|AAK59677.1| unknown
            protein [Arabidopsis thaliana] gi|21436329|gb|AAM51334.1|
            unknown protein [Arabidopsis thaliana]
            gi|332197331|gb|AEE35452.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 434

 Score =  355 bits (910), Expect = 3e-95
 Identities = 208/414 (50%), Positives = 267/414 (64%), Gaps = 5/414 (1%)
 Frame = -1

Query: 1783 NPSSLSFNIGHLESVVHILQQPFITGVSRVCKTLPLSSSKDFRPKAASSVNNIYVDIVCC 1604
            +P  LS NIGH+ESVV ILQ P ITGVSRVCK +PL             +  ++VD+VC 
Sbjct: 56   SPKPLSVNIGHIESVVRILQLPSITGVSRVCKPIPLP------------IGGVHVDLVCT 103

Query: 1603 LNGSPVWFIVSDRNPKYISWHGSS-GNKGLRARIVEILDAAQTSETLRPSSVILFFANGL 1427
            L   PVW IVSDRNP+YISW+G   G+KGLR+RI +IL AA ++ TL+PSSVILFFANGL
Sbjct: 104  LGKVPVWIIVSDRNPRYISWNGDRHGSKGLRSRIEQILAAANSTTTLKPSSVILFFANGL 163

Query: 1426 NDIVHKKLQDEFGAADVGLIFSS---FDCSFSNEPEDEWINVL-AKSYQAASVLKIEIST 1259
               V++KL+DEFGA      F S    D S  ++ + EW+NV+  +SY+ A  ++I++  
Sbjct: 164  PSSVYEKLKDEFGAVYFDFGFDSDSDSDISMLDDFDCEWVNVVRTRSYKEAVSIEIKLID 223

Query: 1258 SMNLVLTSAKECQVRLPALPSPEDHVDAILALGESLCSLIAGMKCCLQDVELNQLENCSS 1079
              + + +   E  V+              L+  ++  ++I+ M+   +D           
Sbjct: 224  QCDSLASPETEVLVQAEVTE---------LSQKDAFSTVISSMRLLGEDC---------- 264

Query: 1078 GHIQLVNFDTTALIAIVSGISNGGINWLLSDPESKLRSKFKCNYEFVIAQVNSEILNSIH 899
                L+NFDTTAL+A+VSGISNG    L+  PE +L  KFK N  FVIAQ  SEI     
Sbjct: 265  ----LINFDTTALVALVSGISNGCAERLVDMPEIELEEKFKGNTVFVIAQARSEIEKPGL 320

Query: 898  VELIGAVSGKGGILCQSVYSEFQELVSMCGGPKEKLRASYLLKRLRVVPDCPSSRLMSLP 719
            V++   +SGK GI+C+SV+SEF+ELVSM  GP EKLRA  LLK L VV D PS R+MSLP
Sbjct: 321  VKVGTVLSGKRGIVCKSVFSEFKELVSMYAGPNEKLRAEQLLKSLMVVNDNPSERVMSLP 380

Query: 718  NTRKLALKNKVVFGTGDYWHAPTLTANMGFIRAVAQTGMSLFTIEHRPRALIGE 557
             TRKLA+KNK VFGTGD W APTLTANM F+RAVAQ+GMSL TI+H PRAL G+
Sbjct: 381  TTRKLAMKNKTVFGTGDRWGAPTLTANMAFVRAVAQSGMSLSTIDHSPRALTGD 434


>ref|XP_002887480.1| hypothetical protein ARALYDRAFT_895197 [Arabidopsis lyrata subsp.
            lyrata] gi|297333321|gb|EFH63739.1| hypothetical protein
            ARALYDRAFT_895197 [Arabidopsis lyrata subsp. lyrata]
          Length = 433

 Score =  351 bits (901), Expect = 4e-94
 Identities = 208/413 (50%), Positives = 268/413 (64%), Gaps = 4/413 (0%)
 Frame = -1

Query: 1783 NPSSLSFNIGHLESVVHILQQPFITGVSRVCKTLPLSSSKDFRPKAASSVNNIYVDIVCC 1604
            +P  LS NIGH+ESVV ILQ P +TGVSRVCK +PL             +  ++VD+VC 
Sbjct: 56   SPQPLSVNIGHIESVVRILQLPSVTGVSRVCKPIPLP------------IGGVHVDLVCT 103

Query: 1603 LNGSPVWFIVSDRNPKYISWHGSS-GNKGLRARIVEILDAAQTSETLRPSSVILFFANGL 1427
            L   PVW IVSDRNP+YISW G   G+KGLR+RI +IL AA ++ TL+PSSVILFFANGL
Sbjct: 104  LGKVPVWIIVSDRNPRYISWSGDRHGSKGLRSRIEQILAAANSTTTLKPSSVILFFANGL 163

Query: 1426 NDIVHKKLQDEFGAA--DVGLIFSSFDCSFSNEPEDEWINVL-AKSYQAASVLKIEISTS 1256
               +++KL+DEFGAA  D   + S  D S  ++ + EW+NV+  +SY+ A  ++I++   
Sbjct: 164  PCSIYEKLKDEFGAAHFDFFGLDSDSDISMLDDFDCEWVNVVRTRSYKEAVSVEIKLIDQ 223

Query: 1255 MNLVLTSAKECQVRLPALPSPEDHVDAILALGESLCSLIAGMKCCLQDVELNQLENCSSG 1076
             + + +   E  V+       ED  +  L+  +   S+I+ M+   +D            
Sbjct: 224  CDSLASPETEVLVQ-------EDVTE--LSQKDVFSSVISSMRLLGEDC----------- 263

Query: 1075 HIQLVNFDTTALIAIVSGISNGGINWLLSDPESKLRSKFKCNYEFVIAQVNSEILNSIHV 896
               L+NFDTTAL+A+VSGISNG    ++  PE +L  KFK N  FVIAQ  SEI     V
Sbjct: 264  ---LINFDTTALVALVSGISNGCAERIVHTPEIELEEKFKGNTVFVIAQARSEIEKPGLV 320

Query: 895  ELIGAVSGKGGILCQSVYSEFQELVSMCGGPKEKLRASYLLKRLRVVPDCPSSRLMSLPN 716
            ++   +SGK GI+C+SV SEF+ELVSM  GP EKLRA  LLK L VV D PS R+MSLP 
Sbjct: 321  KMGSVLSGKRGIVCKSVLSEFKELVSMYAGPNEKLRAEQLLKSLMVVNDNPSERVMSLPT 380

Query: 715  TRKLALKNKVVFGTGDYWHAPTLTANMGFIRAVAQTGMSLFTIEHRPRALIGE 557
            TRKLA+KNK VFGTGD W APTLTANM F+RAVAQ+GMSL T +H PRAL G+
Sbjct: 381  TRKLAMKNKTVFGTGDRWGAPTLTANMAFVRAVAQSGMSLSTNDHSPRALTGD 433


Top