BLASTX nr result

ID: Angelica23_contig00020798 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00020798
         (1648 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267544.1| PREDICTED: uncharacterized protein LOC100254...   446   e-123
ref|XP_004147991.1| PREDICTED: uncharacterized protein LOC101214...   417   e-114
ref|XP_002519590.1| conserved hypothetical protein [Ricinus comm...   397   e-108
ref|NP_565063.1| uncharacterized protein [Arabidopsis thaliana] ...   352   2e-94
ref|XP_002887480.1| hypothetical protein ARALYDRAFT_895197 [Arab...   344   3e-92

>ref|XP_002267544.1| PREDICTED: uncharacterized protein LOC100254610 [Vitis vinifera]
          Length = 457

 Score =  446 bits (1148), Expect = e-123
 Identities = 250/457 (54%), Positives = 304/457 (66%), Gaps = 6/457 (1%)
 Frame = +2

Query: 155  CRGVEKSVEK-SLKVLNTCCKQTLLRLIRSELSFLCRHNNHPTSPLSVNIGHLEAIVHIL 331
            C  V + VE+     +   CK TLL+L  SEL+FL   + H + PLSVNI HLEA+VHIL
Sbjct: 11   CTRVMERVERLDTSKITASCKGTLLKLASSELNFLSSTHLHQSLPLSVNISHLEAVVHIL 70

Query: 332  KHPCISGVSRVCKTIPQSSLGGDRDKVSDNA-KGVYVDIVCNLNGDPVWFIVSDRNPKYI 508
            + P I+GVSRVCK  P S   G+ +K    A KGVY+DIVC LN +PVWFIVSDRNPKY+
Sbjct: 71   EQPFITGVSRVCKLFPLSPTIGNGEKSDCGAAKGVYLDIVCTLNRNPVWFIVSDRNPKYV 130

Query: 509  YWHGNASCRNKGLKMRIEHVLDAARSSVSLRPSSIILFFSNGVEDDVSEGLRSEFAAIDC 688
             W  +    NKGL+ RI+ VLDAARSS++L+PSS+ILFFSNG++  + E L+ EF A +C
Sbjct: 131  SW--DECSGNKGLRTRIQQVLDAARSSLTLKPSSVILFFSNGLDQCICEKLQGEFGAYEC 188

Query: 689  KMXXXXXXXXXXXXLEGEWVDILPRSYERASVLEIKVERLGNSSSLC----SKATLCGVA 856
             +             E EW+++  RSY  A +LEIKV+ +  S  +     S     G  
Sbjct: 189  AVEFPDCSFDFLEEPESEWINVFARSYRGACILEIKVDHVSPSVLVYDVKDSPPDAVGTQ 248

Query: 857  RPEISAFGEEMFGNFESFHSFVSLMDLSFINAKNPGPALLDNSLSTSGIINFDTTALIAI 1036
             PE     +   G   SF S +  M    ++A+      ++  L    +INFDTTALIA+
Sbjct: 249  IPEKHI--DISLG--ASFSSLILGMKFCCLHAEG-----VETLLGQDDLINFDTTALIAV 299

Query: 1037 VSGISNGSIEKLLATPENELRSRFKGNYDFVITQVMSEVQNPIHMEXXXXXXXXXXXXCK 1216
            VSGISNG  EKLLA PE E+R RFKGNY FVI QV+SE+QNPIH+E            C+
Sbjct: 300  VSGISNGGTEKLLAAPETEMRLRFKGNYKFVIAQVLSEIQNPIHVELSGLTSGKRGIICE 359

Query: 1217 TICSEFNELVSMCGGPNEKSRAKHLLEHLIVVPDSPSTRMMSLATTRKLALKNKVVFGTG 1396
            T+ SEF ELVSMCGGPNEK RA  LL+ L+VVPDSPS RMM L TTRKLALKNKVVFGTG
Sbjct: 360  TVHSEFKELVSMCGGPNEKLRADQLLKCLMVVPDSPSARMMGLPTTRKLALKNKVVFGTG 419

Query: 1397 DSWHAPTLSANMAFVRAVSQTGMSLLTIEHRPRALTG 1507
            D WHAPTL+ANMAFVRA+SQTGMSL TIEHRPRALTG
Sbjct: 420  DYWHAPTLTANMAFVRAISQTGMSLFTIEHRPRALTG 456


>ref|XP_004147991.1| PREDICTED: uncharacterized protein LOC101214095 [Cucumis sativus]
            gi|449494348|ref|XP_004159521.1| PREDICTED:
            uncharacterized LOC101214095 [Cucumis sativus]
          Length = 458

 Score =  417 bits (1071), Expect = e-114
 Identities = 237/444 (53%), Positives = 299/444 (67%), Gaps = 6/444 (1%)
 Frame = +2

Query: 197  LNTCCKQTLLRLIRSELSFLCRHNNHPTSPLSVNIGHLEAIVHILKHPCISGVSRVCKTI 376
            ++  C QTL +L   EL+FL R ++  ++PLS+NIGHLEAIVHIL+HP ++G+SRVCK I
Sbjct: 30   ISVSCTQTLHKLALRELNFLSRCSSSSSAPLSLNIGHLEAIVHILQHPSVTGISRVCKPI 89

Query: 377  PQSSLGGDRDKVSDNAKGVYVDIVCNLNGDPVWFIVSDRNPKYIYWHGNASCRNKGLKMR 556
            P SS          +++ VYVDI+C LN +PVW IVSDR P+YI W+     R+KGLK R
Sbjct: 90   PSSS----------SSQAVYVDIICTLNRNPVWVIVSDRKPRYISWYKGH--RSKGLKSR 137

Query: 557  IEHVLDAARSSVSLRPSSIILFFSNGVEDDVSEGLRSEFAAIDCKMXXXXXXXXXXXXLE 736
            +E V+DAARS  +L P SIILFFS+G++  + E LR EF A +               ++
Sbjct: 138  LEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFAFSE-ID 196

Query: 737  GEWVDILPRSYERASVLEIKVERLG---NSSSLCSKATLCGVARPEISAFGEEM-FGNFE 904
            G+W+++LPRSYE A VLEIKV        SS+  SK    GV  PEI     E+ FG  +
Sbjct: 197  GDWINVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFG--D 254

Query: 905  SFHSFVSLMDLSFINA-KNPGPALLDNSLS-TSGIINFDTTALIAIVSGISNGSIEKLLA 1078
            SF S V  M  + +N  ++   A  +  L   S +INFDTTALIA+VSGISNG   KLL+
Sbjct: 255  SFCSVVMAMKPNPMNGIEDMESANFEKLLGGDSDLINFDTTALIALVSGISNGCAAKLLS 314

Query: 1079 TPENELRSRFKGNYDFVITQVMSEVQNPIHMEXXXXXXXXXXXXCKTICSEFNELVSMCG 1258
             PENELR ++K NYDFVI Q MSE++ PI +E            C++  SEF EL++MCG
Sbjct: 315  IPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSAHSEFKELITMCG 374

Query: 1259 GPNEKSRAKHLLEHLIVVPDSPSTRMMSLATTRKLALKNKVVFGTGDSWHAPTLSANMAF 1438
            GPNEKSRA HLL+H++VV D  S RM  L TTRKLALKNKVVFGTGD W+APTL+ANM+F
Sbjct: 375  GPNEKSRANHLLKHIMVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSF 434

Query: 1439 VRAVSQTGMSLLTIEHRPRALTGE 1510
            VRAVSQTGMSL T EHRPRALTG+
Sbjct: 435  VRAVSQTGMSLFTFEHRPRALTGD 458


>ref|XP_002519590.1| conserved hypothetical protein [Ricinus communis]
            gi|223541248|gb|EEF42801.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 425

 Score =  397 bits (1020), Expect = e-108
 Identities = 223/438 (50%), Positives = 287/438 (65%)
 Frame = +2

Query: 197  LNTCCKQTLLRLIRSELSFLCRHNNHPTSPLSVNIGHLEAIVHILKHPCISGVSRVCKTI 376
            +N  C +TLL+L  SEL+FL R    P+ PLSVNIGHLEA++H+L+HP +SGVSRVCK+I
Sbjct: 35   INHSCTRTLLKLAHSELAFLSRTCPQPSLPLSVNIGHLEAVIHLLEHPFVSGVSRVCKSI 94

Query: 377  PQSSLGGDRDKVSDNAKGVYVDIVCNLNGDPVWFIVSDRNPKYIYWHGNASCRNKGLKMR 556
                      K + ++K ++VD+VC  N +PVW IVSDRNPKYI WH    C     K+R
Sbjct: 95   ----------KTTHSSKTIHVDVVCIFNKNPVWIIVSDRNPKYISWH---DC----FKLR 137

Query: 557  IEHVLDAARSSVSLRPSSIILFFSNGVEDDVSEGLRSEFAAIDCKMXXXXXXXXXXXXLE 736
            IE +L  ARSS  ++P+SI++FF+ G++D V E L+ EF A + ++            LE
Sbjct: 138  IERLLAEARSSQIIKPTSILVFFARGLDDFVFEKLKYEFGAFEIEL---------GFDLE 188

Query: 737  GEWVDILPRSYERASVLEIKVERLGNSSSLCSKATLCGVARPEISAFGEEMFGNFESFHS 916
              W+++    Y+ +  +EIKV+  G +SS  +      V + +    G E+     +  S
Sbjct: 189  DGWINVTDTPYQDSMFIEIKVD--GTTSSRNAVLECAFVEKFD----GLELQEEDTADDS 242

Query: 917  FVSLMDLSFINAKNPGPALLDNSLSTSGIINFDTTALIAIVSGISNGSIEKLLATPENEL 1096
            F S               L+        ++NFDTTALIAIVSGISNG  EKLLA PE +L
Sbjct: 243  FTS---------------LISGFRYDGDLVNFDTTALIAIVSGISNGCREKLLAAPEIQL 287

Query: 1097 RSRFKGNYDFVITQVMSEVQNPIHMEXXXXXXXXXXXXCKTICSEFNELVSMCGGPNEKS 1276
            R RFKGN++FV+ QV+SE+QNPIH+E            C+++ SEF ELVS+CGGPNEK 
Sbjct: 288  RQRFKGNFEFVVGQVLSEIQNPIHVEMADIIHGKGGIICESVLSEFKELVSLCGGPNEKL 347

Query: 1277 RAKHLLEHLIVVPDSPSTRMMSLATTRKLALKNKVVFGTGDSWHAPTLSANMAFVRAVSQ 1456
            RA  +L+ L+VVPDSPS RMM L TTRKLALKNKVVFGTGD W APTL+ANMAFVRAVSQ
Sbjct: 348  RADKILKSLMVVPDSPSERMMCLPTTRKLALKNKVVFGTGDHWRAPTLTANMAFVRAVSQ 407

Query: 1457 TGMSLLTIEHRPRALTGE 1510
            TGMSLLTIEHRPRALTG+
Sbjct: 408  TGMSLLTIEHRPRALTGD 425


>ref|NP_565063.1| uncharacterized protein [Arabidopsis thaliana]
            gi|11120791|gb|AAG30971.1|AC012396_7 hypothetical protein
            [Arabidopsis thaliana] gi|14334538|gb|AAK59677.1| unknown
            protein [Arabidopsis thaliana] gi|21436329|gb|AAM51334.1|
            unknown protein [Arabidopsis thaliana]
            gi|332197331|gb|AEE35452.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 434

 Score =  352 bits (902), Expect = 2e-94
 Identities = 211/459 (45%), Positives = 283/459 (61%), Gaps = 7/459 (1%)
 Frame = +2

Query: 155  CRGVEKSVEKS--LKVLNTCCKQTLLRLIRSELSFLCRHNNHPT-SPLSVNIGHLEAIVH 325
            C  V +++E       +   C++TLL+L  SELSFL   ++ P+  PLSVNIGH+E++V 
Sbjct: 13   CESVIRTIENLPLSTAITASCRRTLLKLASSELSFLSSLSSDPSPKPLSVNIGHIESVVR 72

Query: 326  ILKHPCISGVSRVCKTIPQSSLGGDRDKVSDNAKGVYVDIVCNLNGDPVWFIVSDRNPKY 505
            IL+ P I+GVSRVCK IP                GV+VD+VC L   PVW IVSDRNP+Y
Sbjct: 73   ILQLPSITGVSRVCKPIPLP------------IGGVHVDLVCTLGKVPVWIIVSDRNPRY 120

Query: 506  IYWHGNASCRNKGLKMRIEHVLDAARSSVSLRPSSIILFFSNGVEDDVSEGLRSEFAAID 685
            I W+G+    +KGL+ RIE +L AA S+ +L+PSS+ILFF+NG+   V E L+ EF A+ 
Sbjct: 121  ISWNGDRH-GSKGLRSRIEQILAAANSTTTLKPSSVILFFANGLPSSVYEKLKDEFGAVY 179

Query: 686  CKMXXXXXXXXXXXXLEG---EWVDIL-PRSYERASVLEIKVERLGNSSSLCSKATLCGV 853
                           L+    EWV+++  RSY+ A  +EIK+  +    SL S  T   +
Sbjct: 180  FDFGFDSDSDSDISMLDDFDCEWVNVVRTRSYKEAVSIEIKL--IDQCDSLASPETEV-L 236

Query: 854  ARPEISAFGEEMFGNFESFHSFVSLMDLSFINAKNPGPALLDNSLSTSGIINFDTTALIA 1033
             + E++   ++     ++F + +S M L                L    +INFDTTAL+A
Sbjct: 237  VQAEVTELSQK-----DAFSTVISSMRL----------------LGEDCLINFDTTALVA 275

Query: 1034 IVSGISNGSIEKLLATPENELRSRFKGNYDFVITQVMSEVQNPIHMEXXXXXXXXXXXXC 1213
            +VSGISNG  E+L+  PE EL  +FKGN  FVI Q  SE++ P  ++            C
Sbjct: 276  LVSGISNGCAERLVDMPEIELEEKFKGNTVFVIAQARSEIEKPGLVKVGTVLSGKRGIVC 335

Query: 1214 KTICSEFNELVSMCGGPNEKSRAKHLLEHLIVVPDSPSTRMMSLATTRKLALKNKVVFGT 1393
            K++ SEF ELVSM  GPNEK RA+ LL+ L+VV D+PS R+MSL TTRKLA+KNK VFGT
Sbjct: 336  KSVFSEFKELVSMYAGPNEKLRAEQLLKSLMVVNDNPSERVMSLPTTRKLAMKNKTVFGT 395

Query: 1394 GDSWHAPTLSANMAFVRAVSQTGMSLLTIEHRPRALTGE 1510
            GD W APTL+ANMAFVRAV+Q+GMSL TI+H PRALTG+
Sbjct: 396  GDRWGAPTLTANMAFVRAVAQSGMSLSTIDHSPRALTGD 434


>ref|XP_002887480.1| hypothetical protein ARALYDRAFT_895197 [Arabidopsis lyrata subsp.
            lyrata] gi|297333321|gb|EFH63739.1| hypothetical protein
            ARALYDRAFT_895197 [Arabidopsis lyrata subsp. lyrata]
          Length = 433

 Score =  344 bits (883), Expect = 3e-92
 Identities = 208/458 (45%), Positives = 280/458 (61%), Gaps = 6/458 (1%)
 Frame = +2

Query: 155  CRGVEKSVEKS--LKVLNTCCKQTLLRLIRSELSFLCRHNNHPT-SPLSVNIGHLEAIVH 325
            C  V +++E       +   C++TLL+L  SELSFL   ++ P+  PLSVNIGH+E++V 
Sbjct: 13   CESVIRTIENLPLSTAITASCRRTLLKLASSELSFLSSLSSVPSPQPLSVNIGHIESVVR 72

Query: 326  ILKHPCISGVSRVCKTIPQSSLGGDRDKVSDNAKGVYVDIVCNLNGDPVWFIVSDRNPKY 505
            IL+ P ++GVSRVCK IP                GV+VD+VC L   PVW IVSDRNP+Y
Sbjct: 73   ILQLPSVTGVSRVCKPIPLP------------IGGVHVDLVCTLGKVPVWIIVSDRNPRY 120

Query: 506  IYWHGNASCRNKGLKMRIEHVLDAARSSVSLRPSSIILFFSNGVEDDVSEGLRSEFAA-- 679
            I W G+    +KGL+ RIE +L AA S+ +L+PSS+ILFF+NG+   + E L+ EF A  
Sbjct: 121  ISWSGDRH-GSKGLRSRIEQILAAANSTTTLKPSSVILFFANGLPCSIYEKLKDEFGAAH 179

Query: 680  IDCKMXXXXXXXXXXXXLEGEWVDIL-PRSYERASVLEIKVERLGNSSSLCSKATLCGVA 856
             D                + EWV+++  RSY+ A  +EIK+  +    SL S  T   + 
Sbjct: 180  FDFFGLDSDSDISMLDDFDCEWVNVVRTRSYKEAVSVEIKL--IDQCDSLASPETEV-LV 236

Query: 857  RPEISAFGEEMFGNFESFHSFVSLMDLSFINAKNPGPALLDNSLSTSGIINFDTTALIAI 1036
            + +++   ++     + F S +S M L                L    +INFDTTAL+A+
Sbjct: 237  QEDVTELSQK-----DVFSSVISSMRL----------------LGEDCLINFDTTALVAL 275

Query: 1037 VSGISNGSIEKLLATPENELRSRFKGNYDFVITQVMSEVQNPIHMEXXXXXXXXXXXXCK 1216
            VSGISNG  E+++ TPE EL  +FKGN  FVI Q  SE++ P  ++            CK
Sbjct: 276  VSGISNGCAERIVHTPEIELEEKFKGNTVFVIAQARSEIEKPGLVKMGSVLSGKRGIVCK 335

Query: 1217 TICSEFNELVSMCGGPNEKSRAKHLLEHLIVVPDSPSTRMMSLATTRKLALKNKVVFGTG 1396
            ++ SEF ELVSM  GPNEK RA+ LL+ L+VV D+PS R+MSL TTRKLA+KNK VFGTG
Sbjct: 336  SVLSEFKELVSMYAGPNEKLRAEQLLKSLMVVNDNPSERVMSLPTTRKLAMKNKTVFGTG 395

Query: 1397 DSWHAPTLSANMAFVRAVSQTGMSLLTIEHRPRALTGE 1510
            D W APTL+ANMAFVRAV+Q+GMSL T +H PRALTG+
Sbjct: 396  DRWGAPTLTANMAFVRAVAQSGMSLSTNDHSPRALTGD 433


Top