BLASTX nr result

ID: Angelica23_contig00020212 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00020212
         (3043 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002518861.1| transcription factor, putative [Ricinus comm...   137   2e-29
ref|NP_179306.2| RWP-RK domain-containing protein [Arabidopsis t...   105   1e-19
ref|NP_001031361.2| RWP-RK domain-containing protein [Arabidopsi...   104   1e-19
ref|NP_195253.4| RWP-RK domain-containing protein [Arabidopsis t...   104   2e-19
ref|XP_002279578.2| PREDICTED: protein NLP2-like [Vitis vinifera]     103   3e-19

>ref|XP_002518861.1| transcription factor, putative [Ricinus communis]
            gi|223541848|gb|EEF43394.1| transcription factor,
            putative [Ricinus communis]
          Length = 1003

 Score =  137 bits (344), Expect = 2e-29
 Identities = 103/299 (34%), Positives = 143/299 (47%), Gaps = 27/299 (9%)
 Frame = -1

Query: 2716 SYDNEVEGQREGGVISRVFWTGSPEMSPNISYYSEDEYPHKDFALSCGIRASFCFPFFNI 2537
            S D E +G  E G+  RVF    PE +PN+ YYS  EY  +D AL+  ++ +   P F  
Sbjct: 215  SADGESDG--ELGLPGRVFRQKLPEWTPNVQYYSSKEYSRRDHALNYNVQGTLALPVFEP 272

Query: 2536 NHDSGWPDGVIEIVSTCNQ--GVGSVRKICESFEGLGLCLPGFI---STNI----RKYAI 2384
            +  S    GVIE++ T  +      V K+C++ E + L     +   ST I    RK A+
Sbjct: 273  SGQSCV--GVIELIMTSQKINYAPEVDKVCKALEAVNLRSSEILDHPSTQICNEGRKNAL 330

Query: 2383 PEMDYLLDVVCRTFQFPLAQYWVAPDLFEALSMVHQYSYRNFE------------NLAPW 2240
             E+  +L VVC T++  LAQ W+          +H+ S  +F+            +LA +
Sbjct: 331  AEILEILTVVCETYKLALAQTWIP--------CMHRSSCTSFDGSCNGQVCMSTTDLASY 382

Query: 2239 SQ------FKDACWHKSLTVGEGPLGNSCTSHEAFFCKDIAALSITNYPFAHYARNCGSI 2078
                    F+DAC    L  G+G  G +  SH A FC+DI     T YP  HYAR  G  
Sbjct: 383  VVDPHMWGFRDACLEHHLQKGQGVAGRAFLSHNACFCQDITQFCKTEYPLVHYARLFGLT 442

Query: 2077 SCFTIYLCYNSLPCTTCVLEFFLPAQEMDIYYPQTLLNSLWATMKERLPNHMLASGKQL 1901
             CF I L  +       VLEFFLP    D Y  ++LL SL ATMK+   +  +ASG  L
Sbjct: 443  GCFAICLRSSYTGDDDYVLEFFLPPTISDSYEQKSLLGSLLATMKQHFQSLNVASGMDL 501



 Score = 87.8 bits (216), Expect = 2e-14
 Identities = 44/86 (51%), Positives = 60/86 (69%)
 Frame = -1

Query: 427  ACSGQDLNKMTVKVTYKGINIRFKLAGLSGIAELENNVIERLHLERKSFSIKYQDDEGDW 248
            A S Q++  +T+K TY+   IRF+++  SGI EL+  V +RL LE  +F IKY DD+ +W
Sbjct: 899  AISLQEIKSVTIKATYREDIIRFRISLSSGIVELKEEVAKRLKLEVGTFDIKYLDDDHEW 958

Query: 247  VLIACDKDVQECIEISRSLKKTTIKL 170
            VLIACD D+QECI+ISRS     I+L
Sbjct: 959  VLIACDADLQECIDISRSSGSNIIRL 984


>ref|NP_179306.2| RWP-RK domain-containing protein [Arabidopsis thaliana]
            gi|75151861|sp|Q8H111.1|NLP1_ARATH RecName: Full=Protein
            NLP1; Short=AtNLP1; AltName: Full=NIN-like protein 1;
            AltName: Full=Nodule inception protein-like protein 1
            gi|24030277|gb|AAN41311.1| unknown protein [Arabidopsis
            thaliana] gi|330251497|gb|AEC06591.1| RWP-RK
            domain-containing protein [Arabidopsis thaliana]
          Length = 909

 Score =  105 bits (261), Expect = 1e-19
 Identities = 80/271 (29%), Positives = 124/271 (45%), Gaps = 24/271 (8%)
 Frame = -1

Query: 2680 GVISRVFWTGSPEMSPNISYYSEDEYPHKDFALSCGIRASFCFPFFNINHDSGWPDGVIE 2501
            G+  RVF    PE +P++ ++  DEYP    A  C +R S   P F     SG   GV+E
Sbjct: 199  GLPGRVFLQKFPEWTPDVRFFRRDEYPRIKEAQKCDVRGSLALPVFE--RGSGTCLGVVE 256

Query: 2500 IVSTCNQGV--GSVRKICESFEGLGLCLPGFISTNIRKY----------AIPEMDYLLDV 2357
            IV+T  +      + K+C++ E + L     ++T   ++          A+PE+   L  
Sbjct: 257  IVTTTQKMNYRQELEKMCKALEAVDLRSSSNLNTPSSEFLQVYSDFYCAALPEIKDFLAT 316

Query: 2356 VCRTFQFPLAQYWVAPDLFEALSMVHQYSYRNFENLAPW------------SQFKDACWH 2213
            +CR++ FPLA  W AP   +   +  ++S  NF                    F +AC  
Sbjct: 317  ICRSYDFPLALSW-APCARQG-KVGSRHSDENFSECVSTIDSACSVPDEQSKSFWEACSE 374

Query: 2212 KSLTVGEGPLGNSCTSHEAFFCKDIAALSITNYPFAHYARNCGSISCFTIYLCYNSLPCT 2033
              L  GEG +G +  + + FF  ++A  S TNYP AH+A+  G  +   + L   S    
Sbjct: 375  HHLLQGEGIVGKAFEATKLFFVPEVATFSKTNYPLAHHAKISGLHAALAVPLKSKS-GLV 433

Query: 2032 TCVLEFFLPAQEMDIYYPQTLLNSLWATMKE 1940
              VLEFF P   +D    Q +L SL  T+++
Sbjct: 434  EFVLEFFFPKACLDTEAQQEMLKSLCVTLQQ 464



 Score = 63.2 bits (152), Expect = 4e-07
 Identities = 38/111 (34%), Positives = 57/111 (51%), Gaps = 4/111 (3%)
 Frame = -1

Query: 481  SRRNFSHPNVTAVHDTVGACSGQDLNK---MTVKVTYKGINIRFKLAGLSGIAELENNVI 311
            S + F  P V      +   S   L     + VK T+    IRF L    G AEL+  + 
Sbjct: 782  SHKTFKEPLVLDNSSPLTGSSNTSLRARGAIKVKATFGEARIRFTLLPSWGFAELKQEIA 841

Query: 310  ERLHLERKS-FSIKYQDDEGDWVLIACDKDVQECIEISRSLKKTTIKLLLD 161
             R +++  S F +KY DD+ +WVL+ C+ D+ ECI+I R  +  TIK+ L+
Sbjct: 842  RRFNIDDISWFDLKYLDDDKEWVLLTCEADLVECIDIYRLTQTHTIKISLN 892


>ref|NP_001031361.2| RWP-RK domain-containing protein [Arabidopsis thaliana]
            gi|62320801|dbj|BAD93733.1| hypothetical protein
            [Arabidopsis thaliana] gi|330251498|gb|AEC06592.1| RWP-RK
            domain-containing protein [Arabidopsis thaliana]
          Length = 654

 Score =  104 bits (260), Expect = 1e-19
 Identities = 80/268 (29%), Positives = 123/268 (45%), Gaps = 21/268 (7%)
 Frame = -1

Query: 2680 GVISRVFWTGSPEMSPNISYYSEDEYPHKDFALSCGIRASFCFPFFNINHDSGWPDGVIE 2501
            G+  RVF    PE +P++ ++  DEYP    A  C +R S   P F     SG   GV+E
Sbjct: 199  GLPGRVFLQKFPEWTPDVRFFRRDEYPRIKEAQKCDVRGSLALPVFE--RGSGTCLGVVE 256

Query: 2500 IVSTCNQGV--GSVRKICESFEGLGLCLPGFISTNIRKY-------AIPEMDYLLDVVCR 2348
            IV+T  +      + K+C++ E + L     ++T   +        A+PE+   L  +CR
Sbjct: 257  IVTTTQKMNYRQELEKMCKALEAVDLRSSSNLNTPSSEVYSDFYCAALPEIKDFLATICR 316

Query: 2347 TFQFPLAQYWVAPDLFEALSMVHQYSYRNFENLAPW------------SQFKDACWHKSL 2204
            ++ FPLA  W AP   +   +  ++S  NF                    F +AC    L
Sbjct: 317  SYDFPLALSW-APCARQG-KVGSRHSDENFSECVSTIDSACSVPDEQSKSFWEACSEHHL 374

Query: 2203 TVGEGPLGNSCTSHEAFFCKDIAALSITNYPFAHYARNCGSISCFTIYLCYNSLPCTTCV 2024
              GEG +G +  + + FF  ++A  S TNYP AH+A+  G  +   + L   S      V
Sbjct: 375  LQGEGIVGKAFEATKLFFVPEVATFSKTNYPLAHHAKISGLHAALAVPLKSKS-GLVEFV 433

Query: 2023 LEFFLPAQEMDIYYPQTLLNSLWATMKE 1940
            LEFF P   +D    Q +L SL  T+++
Sbjct: 434  LEFFFPKACLDTEAQQEMLKSLCVTLQQ 461


>ref|NP_195253.4| RWP-RK domain-containing protein [Arabidopsis thaliana]
            gi|374095497|sp|Q7X9B9.3|NLP2_ARATH RecName: Full=Protein
            NLP2; Short=AtNLP2; AltName: Full=NIN-like protein 2;
            AltName: Full=Nodule inception protein-like protein 2
            gi|332661088|gb|AEE86488.1| RWP-RK domain-containing
            protein [Arabidopsis thaliana]
          Length = 963

 Score =  104 bits (259), Expect = 2e-19
 Identities = 83/284 (29%), Positives = 129/284 (45%), Gaps = 24/284 (8%)
 Frame = -1

Query: 2680 GVISRVFWTGSPEMSPNISYYSEDEYPHKDFALSCGIRASFCFPFFNINHDSGWPDGVIE 2501
            G+  RVF    PE +P++ ++  +EYP    A  C +R S   P F     SG   GV+E
Sbjct: 228  GLPGRVFLKKLPEWTPDVRFFRSEEYPRIKEAEQCDVRGSLALPVFE--RGSGTCLGVVE 285

Query: 2500 IVSTCNQGV--GSVRKICESFEGLGLCLPGFISTNIRKY----------AIPEMDYLLDV 2357
            IV+T  +      +  IC++ E + L     ++   R++          A+PE+   L +
Sbjct: 286  IVTTTQKMNYRPELDNICKALESVNLRSSRSLNPPSREFLQVYNEFYYAALPEVSEFLTL 345

Query: 2356 VCRTFQFPLAQYWVAPDLFEALSMVHQYSYRNFEN---------LAPWSQ---FKDACWH 2213
            VCR +  PLA  W AP   +   +  ++S  NF           + P  Q   F +AC  
Sbjct: 346  VCRVYDLPLALTW-APCARQG-KVGSRHSDENFSECVSTVDDACIVPDHQSRHFLEACSE 403

Query: 2212 KSLTVGEGPLGNSCTSHEAFFCKDIAALSITNYPFAHYARNCGSISCFTIYLCYNSLPCT 2033
              L  GEG +G +  + + FF  ++   S TNYP AH+A+  G  +   + L        
Sbjct: 404  HHLLQGEGIVGKAFNATKLFFVPEVTTFSKTNYPLAHHAKISGLHAALAVPLKNKFNSSV 463

Query: 2032 TCVLEFFLPAQEMDIYYPQTLLNSLWATMKERLPNHMLASGKQL 1901
              VLEFF P   +D    Q +L SL AT+++   +  L   K+L
Sbjct: 464  EFVLEFFFPKACLDTEAQQDMLKSLSATLQQDFRSLNLFIDKEL 507



 Score = 63.9 bits (154), Expect = 2e-07
 Identities = 29/76 (38%), Positives = 48/76 (63%), Gaps = 1/76 (1%)
 Frame = -1

Query: 394  VKVTYKGINIRFKLAGLSGIAELENNVIERLHLERKS-FSIKYQDDEGDWVLIACDKDVQ 218
            VK T+    +RF L    G  EL++ +  R +++  + F +KY DD+ +WVL+ C+ D++
Sbjct: 865  VKATFGEAKVRFTLLPTWGFRELQHEIARRFNIDNIAPFDLKYLDDDKEWVLLTCEADLE 924

Query: 217  ECIEISRSLKKTTIKL 170
            ECI+I RS +  TIK+
Sbjct: 925  ECIDIYRSSQSRTIKI 940


>ref|XP_002279578.2| PREDICTED: protein NLP2-like [Vitis vinifera]
          Length = 895

 Score =  103 bits (257), Expect = 3e-19
 Identities = 103/375 (27%), Positives = 168/375 (44%), Gaps = 28/375 (7%)
 Frame = -1

Query: 2704 EVEGQREGGVISRVFWTGSPEMSPNISYYSEDEYPHKDFALSCGIRASFCFPFFNINHDS 2525
            E + + + G+  RVF    PE +P++ ++  +EYP  ++A    +R S   P F     S
Sbjct: 160  EEDSKEQVGLPGRVFLGKVPEWTPDVRFFKSEEYPRINYAQRYNVRGSLALPVFE--RGS 217

Query: 2524 GWPDGVIEIVSTCNQGVG---SVRKICESFEGLGL-----CLPGFISTN-IRKYAIPEMD 2372
            G   GVIEIV+T  Q +     +  +C++ E + L      +P   + N + + A+PE+ 
Sbjct: 218  GVCLGVIEIVTT-TQKINYRPELENVCKALEAVDLRSSEVLIPPVKACNELYQAALPEIL 276

Query: 2371 YLLDVVCRTFQFPLAQYWVAPDLFEALSMVHQYSYRNFENLAP------------WSQFK 2228
             +L  VCRT + PLAQ W AP + +      ++S +N+                 +  F 
Sbjct: 277  KVLARVCRTHRLPLAQTW-APCIQQGKGGC-RHSDKNYALFLSTVDHAYYVTDPKFKGFN 334

Query: 2227 DACWHKSLTVGEGPLGNSCTSHEAFFCKDIAALSITNYPFAHYARNCGSISCFTIYL--C 2054
            +AC+   L  G+G +G + T+++  F  DI A S T YP +H+AR  G  +   I L   
Sbjct: 335  EACFDHHLFRGQGVVGRALTTNQPCFESDITAFSKTEYPLSHHARMFGLRAAVAIRLKSI 394

Query: 2053 YNSLPCTTCVLEFFLPAQEMDIYYPQTLLNSLWATMKERLPNHMLASGKQLGHG--LSVG 1880
            YN       +LEFFLP    +    + +LNSL   +++      + + K L     L VG
Sbjct: 395  YNG--SADFILEFFLPTDCQETEEQKQVLNSLSIVIQQTCQIFRVVTEKDLEKESILPVG 452

Query: 1879 VINSSSSH--NEPKSFEIGQPDRSLPHHEGSGYTCTSNMFENCTAQRINSNSYEEAATEE 1706
             I  +S     +  S ++  P    P  E S +       +        S  Y++   EE
Sbjct: 453  EILFASDERVKQEGSVKLLSPPIKEPSREESSWIAHMMEAQKKGKGVSVSLEYQKEEPEE 512

Query: 1705 TLKTIPN-EVTEISL 1664
              K   N + TE+ L
Sbjct: 513  EFKVTTNWDNTEVEL 527



 Score = 59.3 bits (142), Expect = 6e-06
 Identities = 28/78 (35%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
 Frame = -1

Query: 394  VKVTYKGINIRFKLAGLSGIAELENNVIERLHLER-KSFSIKYQDDEGDWVLIACDKDVQ 218
            +K T+   N+RF L       +L+  +  R  ++   S  +KY DD+ +WVL+ CD D++
Sbjct: 800  IKATFGEENVRFSLQLNWSFKDLQQEIARRFGIDNMNSIDLKYLDDDCEWVLLTCDADLE 859

Query: 217  ECIEISRSLKKTTIKLLL 164
            ECI++ RS +   IKL L
Sbjct: 860  ECIDVYRSCQSRKIKLSL 877


Top