BLASTX nr result

ID: Angelica23_contig00002983 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00002983
         (4874 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002282000.1| PREDICTED: uncharacterized protein LOC100255...   234   1e-58
ref|XP_002532258.1| hypothetical protein RCOM_0624800 [Ricinus c...   191   2e-45
ref|XP_003606507.1| hypothetical protein MTR_4g061110 [Medicago ...   139   8e-30
ref|XP_002864670.1| hypothetical protein ARALYDRAFT_919258 [Arab...   136   6e-29
ref|NP_200823.1| uncharacterized protein [Arabidopsis thaliana] ...   134   3e-28

>ref|XP_002282000.1| PREDICTED: uncharacterized protein LOC100255365 [Vitis vinifera]
          Length = 1413

 Score =  234 bits (598), Expect = 1e-58
 Identities = 223/696 (32%), Positives = 318/696 (45%), Gaps = 61/696 (8%)
 Frame = +2

Query: 41   MDSDLPLLEISGEDDSLIQQTPTLRNDAVKVTDAFFSISPLQL----------PISPINQ 190
            M+SDL ++EI+GEDDSL+QQ P     AV  T  +FS SPLQ+           + P  +
Sbjct: 1    MESDLSIIEIAGEDDSLLQQIPANEIAAVN-TSNYFSCSPLQIFRPQRSAEASRLRP-TE 58

Query: 191  SMKNNVAPKRPGCSSPKI-SNKENMDFNKAEGPRLT--PHQMKRRKKG--YNLRKSIAWD 355
             +K+N    +P CSSP   SNKEN++ NK   P+L   P QMK+RKKG  YNLR+S+AWD
Sbjct: 59   DLKSNWDADKPTCSSPGDGSNKENVNENKFGVPKLCVEPQQMKKRKKGGGYNLRQSLAWD 118

Query: 356  KAFFTEEGVLDPDELSLISGTYGNSRGEALSAISEERTKSPYSSSKYTFASENFQAL--- 526
            +AFFTEEGVL+P ELS++SG +    GEALS I+EE TKS      Y   SE  QAL   
Sbjct: 119  RAFFTEEGVLNPLELSVVSGNFDKFSGEALSVINEEATKSVSDDLDYPNDSEELQALEGK 178

Query: 527  --------SASKNRNNGGLLMPQDNLLAADSKGVLSN------SGSSCNRSGAKXXXXXX 664
                    +++  ++ GG   P+ +    D++           S    NRSG+K      
Sbjct: 179  LFKELPVSTSNGGKSIGGCTQPKQDSSTCDNEAPARAVRRKVLSVHDANRSGSKRGGAQA 238

Query: 665  XXXXXXXXXXAHVHNTKSAMKVSHLQKFPDHKSGPHSLLKASKSSMLGTSQLKHNQIAHP 844
                          NTK   + + L K P  K  P  L   +KS+ L TS LK N IA P
Sbjct: 239  LTFCTQKRQAN--ANTKVVSRDAKLPKIPVSKPAPCLLNTTNKSASLSTSHLKRNHIAQP 296

Query: 845  INT-HKNISLESSCKNVKSTQNKEKSGPKCAQLPAKASAQPSRENM-----ANSSLEVNL 1006
             +T  K+  L+ S     S QN  K  P CA    K+S   +R N+     AN  LE+N 
Sbjct: 297  GSTVPKSAGLKGSSSKFISAQNNAKLRPDCASFSVKSSVLHARRNVVPTPQANPPLEINS 356

Query: 1007 LTNPPSSQIRKPNRSSEA--NLK-PIIPSGTELSGDGLY--RPKAVAGLPPRSTSVTVGN 1171
              N     + K N S +   ++K P + +  E      Y    K    LP  +  +  GN
Sbjct: 357  SGNLQPQSVTKVNSSLKVIPDIKLPAVSANREHPSIAGYSGSNKTAVSLPQNARHIG-GN 415

Query: 1172 --ESQLQKVKPSGLRMPSPSLGYFAQNNAQ-----------PSNIPKCTAIGSRKSVRDF 1312
               +Q Q  KPSGLRMPSPSLG F+Q  A             S++P      +  S+   
Sbjct: 416  LQYAQPQTTKPSGLRMPSPSLGLFSQPKASASQRNQICNLLKSSVPNYQKFHASNSIHAP 475

Query: 1313 RALHAPKNTSM---DIPENSVTVGLSSSMQCSELAASKAAMHKATKSDLMVSNIKNGNI- 1480
            R LH     S    DI    +T  L+ S +CS  +A+K  + +  K ++ V N     + 
Sbjct: 476  RPLHESGKISRVVNDIQTGDIT-ALTLSARCSISSAAKPVLPETVKLNVEVDNFNKSELK 534

Query: 1481 -PLLPDKSSEPKRSKHKVHGDIGTEENIGRHGKHENTEEIAYELDANVQITNDLKLLESG 1657
             P  P  S   K S          ++    H + +N  EI  + D+ +QI +   LL+S 
Sbjct: 535  FPCAPKSSEVIKNSILDC-----VDKKYHEHAEPQNF-EIPRKEDSELQINDHKLLLQSA 588

Query: 1658 TCKKSQEDNHGRRVYTQCQNQKIDLKISLCKPENISTSSDLIDGQLVNRHNITNIKKELS 1837
            + K   E++H     +  QN  + +K  +C  + I+    + + Q  +  ++TN      
Sbjct: 589  SSK--MENSH----TSSQQNLAVQVK-GVCGTDGITEHQFVEEKQ--DNMSVTNQHDTSE 639

Query: 1838 TSSYNLEPQNHCLGFEQTFQCTPAICSEESQGKEVN 1945
            + S              T  C+    S  S G+ VN
Sbjct: 640  SES------------RDTINCSKPSTSVVSDGQLVN 663


>ref|XP_002532258.1| hypothetical protein RCOM_0624800 [Ricinus communis]
            gi|223528046|gb|EEF30124.1| hypothetical protein
            RCOM_0624800 [Ricinus communis]
          Length = 1044

 Score =  191 bits (484), Expect = 2e-45
 Identities = 173/545 (31%), Positives = 255/545 (46%), Gaps = 47/545 (8%)
 Frame = +2

Query: 41   MDSDLPLLEISGEDDSLIQQTPTLRNDAVKVTDAFFSISPL---------QLPISPINQS 193
            M++DL L+EISGEDDSLIQQ+P           ++FS SPL          +P SPI  S
Sbjct: 1    METDLSLIEISGEDDSLIQQSPNANVSISNSPHSYFSCSPLLRIPRSTTTSIP-SPIAGS 59

Query: 194  --MKNNVAPK---RPGCSSPKISN----KENMDFNKAEGPRLT--PHQMKRRKKG--YNL 334
              +  N+A +   +P CS+ + SN    KEN++ NK EG +L+  P QMKR+KKG  YNL
Sbjct: 60   FVIGANLAEEDTSKPSCSNCEDSNSNMNKENVNLNKEEGTKLSIEPQQMKRKKKGGGYNL 119

Query: 335  RKSIAWDKAFFTEEGVLDPDELSLISGTYGNSRGEALSAISEERTKSPYSSSKYTFASEN 514
            RKS+AWD+AFFTEEGVLDP ELS++SG  G S GE LS I E R      S      + N
Sbjct: 120  RKSLAWDRAFFTEEGVLDPLELSMLSGNLGKSSGEMLSVIHEGRESLSGDSPNMHTPNNN 179

Query: 515  FQALSASKNRNNG--------GLLMPQDNLLAADS---KGVLSNSGSSCNRSGAKXXXXX 661
                S S+  N G         L  P  + +A+ S   + VL+    + + S        
Sbjct: 180  LFKESPSRTPNQGRKVAALLPKLASPARHKMASASVAKRKVLATHDINRSASKRSGCPLP 239

Query: 662  XXXXXXXXXXXAHVHNTKSAMKVSHLQKFPDHKSGPHSLLKASKSSMLGTSQLKHNQIAH 841
                       + ++ TK   K S + K    KS P  +  +S+ S+    +  HN I+ 
Sbjct: 240  PAPSSYPFRYLSFINTTKVVSKDSRVSKLSAPKSDPPVVSTSSRGSITSGGRQNHNVISQ 299

Query: 842  PINTHKNISLESSCKNVKSTQNKEKSGPKCAQLPAKASAQPSRENMANSSLEVNLLTNPP 1021
            P N  +N+ L+ +  N K+T+N  KS     +L  +++   ++ N+A  S    + +   
Sbjct: 300  PGNAQRNVGLKGNSNNTKTTRNDAKSS-SVGKLITRSTTLQAKRNVAKVSSVPEIHS--- 355

Query: 1022 SSQIRKPNR-SSEANLKPIIPSGTELSG-DGLYRPKAVAGLPPRSTSVTVGNESQLQKVK 1195
            SS ++   + S EA    ++P      G D   R  AV+       +      +Q Q  K
Sbjct: 356  SSNVQSQTKVSLEAVPDSVVPVTRPAYGPDSNTRKIAVSFSQNACYNGANMQPTQPQTAK 415

Query: 1196 PSGLRMPSPSLGYFAQN-----------NAQPSNIPKCTAIGSRKSVRDFRALHAPKNTS 1342
            PSGLRMPSPSL +F Q+           + Q  N+     +   K+         P   S
Sbjct: 416  PSGLRMPSPSLRFFGQSKLSDSHSLLERSTQACNLANSNILNFSKAGALNPVQQRPPRPS 475

Query: 1343 MDIPENSVTVGLSSSMQCSELAASKAAMHKATKSDLMVSNIKNGNIPL-LPDKSSEPKRS 1519
             +IP  S TV          +A++ AA     KS+L ++N +   + L    KS +    
Sbjct: 476  GNIPVYSSTV--------HSVASTAAASSGKIKSNLGLNNRQKVALQLQYNPKSYDTVNH 527

Query: 1520 KHKVH 1534
            K ++H
Sbjct: 528  KQQLH 532


>ref|XP_003606507.1| hypothetical protein MTR_4g061110 [Medicago truncatula]
            gi|355507562|gb|AES88704.1| hypothetical protein
            MTR_4g061110 [Medicago truncatula]
          Length = 1527

 Score =  139 bits (350), Expect = 8e-30
 Identities = 166/564 (29%), Positives = 245/564 (43%), Gaps = 53/564 (9%)
 Frame = +2

Query: 29   SLPEMDSDLPLLEISGEDDSLIQQ-TPTLRNDAVKVTDAFFSISPLQLPIS---PINQSM 196
            S+ +  SDL LLEISGEDDSL+   TPT+    V      FS SPL    S    I  S 
Sbjct: 3    SIVDSISDLSLLEISGEDDSLLSDSTPTVTAANV------FSCSPLVSARSRPRQIKGSE 56

Query: 197  KNNVAPKRPGCSSPKISNKENMDFNKAEGPRLTPHQMKRRKK---GYNLRKSIAWDKAFF 367
             +NV  +         +NKEN  ++K EG  L   Q K+RKK   G+NLRKS+AWD+AFF
Sbjct: 57   IDNVDSEDLDSLRNDSANKENAKWSKPEG--LDSSQKKKRKKKLGGFNLRKSLAWDRAFF 114

Query: 368  TEEGVLDPDELSLISGTY-GNSRGEA-LSAISEERTKS-------------PYSSSKYTF 502
            TE+GVL+P ELS+ISGT   NS+    L AI EE   +              +SS   + 
Sbjct: 115  TEQGVLNPLELSMISGTVTPNSKSNLNLEAIEEEEPATTSLALQEIEENLFKHSSGGASI 174

Query: 503  ASENFQALSASKNRNNGGLLMPQDNLLAADSKGVLSNS-GSSCNRSGAKXXXXXXXXXXX 679
             +    A+SA   +    +         A  K +  N  G+   RS              
Sbjct: 175  RNRKISAVSALSPKPVSSIKKTIPVASLAKRKILAVNDVGNKYKRSAC------------ 222

Query: 680  XXXXXAHVHNTKSAMKVSHLQKFPDHKSGPHSLLKASKSSMLGTSQLKHNQIAHPI-NTH 856
                       K+  K S + + P  KS   +    ++S ML +   K NQIA+P+ N  
Sbjct: 223  -----PRTVTVKTPSKESKITRIPVPKSS--ATATTTRSGMLSSGSSKRNQIANPVTNVP 275

Query: 857  KNISLESSCKNVKSTQN--KEKSGPKCAQLPAKASAQPSRENMANSSLEVNLLTNPPSSQ 1030
            K   ++   KN ++  +  K     KC+   ++   +P+ +++ NS   +     PPS  
Sbjct: 276  KYAGVKGPSKNPRTVASIPKVDLADKCS--VSRTLTKPAGKHLDNSVSAI----RPPSRM 329

Query: 1031 IRKPNRSSEANLKPIIPSGTELSGDGLYRPKAVAGLPPRSTSVTVGNESQ--LQKVKPSG 1204
             +  N +++ +     PS    +G G    KA     P   S T   + Q   Q  K SG
Sbjct: 330  NQTGNGANKVSEAGFPPSKMHQTGSG--ANKASEACLPHGISDTDKKKQQTLFQTSKSSG 387

Query: 1205 LRMPSPSLGYFAQNNA----------------QPSNIPKCTAIGSRKSVRDFR------- 1315
            LRMPSPS+G+F+Q  A                  SNIPK   +G+  SV D R       
Sbjct: 388  LRMPSPSIGFFSQAKASSSNGQLQKSSIPCKPSESNIPKLRKLGT-SSVNDARSKTVQGA 446

Query: 1316 ALHAPKNTSMDIPENSVTVGLSSSMQCSELAASKAAMHKATKSDLMVSNIKNGNIPLLPD 1495
            A    K  S+   ++ + V + +  + +E+    ++  K +K       +KN    +L D
Sbjct: 447  AKIRTKELSLSDVKSEIVVQIDNK-KMAEVECDSSSFEKISKQ----PEVKN----ILED 497

Query: 1496 --KSSEPKRSKHKVHGDIGTEENI 1561
                S+ +R  H+   D G E  +
Sbjct: 498  VMLKSQEQRELHENDHDSGIENMV 521


>ref|XP_002864670.1| hypothetical protein ARALYDRAFT_919258 [Arabidopsis lyrata subsp.
            lyrata] gi|297310505|gb|EFH40929.1| hypothetical protein
            ARALYDRAFT_919258 [Arabidopsis lyrata subsp. lyrata]
          Length = 1179

 Score =  136 bits (342), Expect = 6e-29
 Identities = 137/444 (30%), Positives = 205/444 (46%), Gaps = 36/444 (8%)
 Frame = +2

Query: 41   MDSDLPLLEISGEDDS---LIQQTPTLRNDAVKVTDAFFSISPLQLP-----------IS 178
            MD DL  L+ISGEDD    L++ TP   +   +   ++ + SPLQ+P            S
Sbjct: 1    MDKDL--LDISGEDDEDNWLLKNTPKKVHSGRE--KSYLNCSPLQIPRSSRIVPTRPPFS 56

Query: 179  PINQSMKNNVAPKRPGCSSPKIS-NKENMDFNKAEGPRLTPH--QMKRRKK--GYNLRKS 343
            PI +    +   ++P  S    S  KEN   +K E P+L+    QMK++KK  G+NLRKS
Sbjct: 57   PIGRVTGTSNNMEQPCASVDTDSVGKEN---SKVELPKLSVERQQMKKKKKNAGFNLRKS 113

Query: 344  IAWDKAFFTEEGVLDPDELSLISGTYGNSRGEALSAISEERTKSPYSSSKYTFASENFQA 523
            +AWD+AF TEEGVLD  ELS I+GT     G+ L AI EE  +S  S+SK T  S   QA
Sbjct: 114  LAWDRAFSTEEGVLDSSELSKITGTACQFGGDRLPAIQEEFRES-MSASKCTSVSPGLQA 172

Query: 524  LSA---------SKNRNNGGLLMPQDNLLAADSKGVLSNSGSSCNRSGAKXXXXXXXXXX 676
            L           SKNR        +  LL+A     LS                      
Sbjct: 173  LEENLFNDLPVNSKNR--------EKKLLSASRSRELS---------------------- 202

Query: 677  XXXXXXAHVHNTKSAMKVSHLQKFPDHKSGPHSLLKASKSSMLGTSQLKHNQIAHPINTH 856
                                + K P  K  P ++    K +    S+ K +Q     N+ 
Sbjct: 203  --------------------ISKVPTTKPEPLTVANNMKRTTPSPSKAKKSQPTQLKNSQ 242

Query: 857  KNISLESSCKNVKSTQNKEKSGPKCAQLPAKASAQPSRENMANSSLEVNLLTNPPSSQIR 1036
            +++  E   KN  ST++K KS         K S + +R N+ + + E+  ++N   S + 
Sbjct: 243  RSLGSEGFSKNTSSTKSKTKSSLASKSSIPKPSLKQARRNVISKTSEIPSVSNSQHSVVA 302

Query: 1037 KPNRSSEANLKPIIPSGTELSGDGLYRPKAVAGLPPRSTSVT------VGN-ESQLQKV- 1192
            K      +N+ P+  S   + G     P+  + +    TS+T      +GN +S + ++ 
Sbjct: 303  K------SNVGPMTASDVAMFGPASDIPE--SNVITLGTSLTQSSCNRLGNTQSAVSRLG 354

Query: 1193 KPSGLRMPSPSLGYFAQNNAQPSN 1264
            KPSGLR+P PS+GYF+Q+++QPS+
Sbjct: 355  KPSGLRVPKPSIGYFSQSDSQPSH 378


>ref|NP_200823.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332009903|gb|AED97286.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 1195

 Score =  134 bits (336), Expect = 3e-28
 Identities = 135/440 (30%), Positives = 198/440 (45%), Gaps = 33/440 (7%)
 Frame = +2

Query: 41   MDSDLPLLEISGEDDS---LIQQTPTLRNDAVKVTDAFFSISPLQLP-----------IS 178
            MD DL  L+ISGEDD    L++ TP   N +     ++   SPLQ+P            S
Sbjct: 1    MDKDL--LDISGEDDEDNWLLKNTPKKTNSSRG--KSYLKCSPLQIPRSSRIVPTRPPFS 56

Query: 179  PINQSMKNNVAPKRPGCSSPKIS-NKENMDFNKAEGPRLTPH--QMKRRKK--GYNLRKS 343
            PI +    +   ++P  S    S  KEN    K E P+L+    QMK++KK  G+NLRKS
Sbjct: 57   PIGRVTGTSNNREQPCASVDTDSVGKENA---KVELPKLSVERQQMKKKKKNAGFNLRKS 113

Query: 344  IAWDKAFFTEEGVLDPDELSLISGTYGNSRGEALSAISEERTKSPYSSSKYTFASENFQA 523
            +AWD+AF TEEGVLD  ELS I+GT  +  G+ L+AI EE  +S                
Sbjct: 114  LAWDRAFSTEEGVLDSSELSKITGTACHLGGDRLAAIQEEYRES---------------- 157

Query: 524  LSASKNRNNGGLLMPQDNLLAADSKGVLSNSGSSCNRSGAKXXXXXXXXXXXXXXXXAHV 703
            +SASK   + GL   ++NL   +   V S +      SG                     
Sbjct: 158  MSASKCNVSPGLQALEENLF--NDLPVNSKNREKKLVSGIMP------------------ 197

Query: 704  HNTKSAMKVSHLQKFPDHKSGPHSLLKASKSSMLGTSQLKHNQIAHPINTHKNISLESSC 883
                   K   + K P  KS P ++    K +     + K++Q     N+ +++  ES  
Sbjct: 198  -------KELSISKVPTTKSDPVTVGNNMKRTTQSPIKAKNSQPTQLKNSQRSLGSESFS 250

Query: 884  KNVKSTQNKEKSGPKCAQLPAKASAQPSRENMANSSLEVNLLTNPPSSQIRKPNRSSEAN 1063
            KN  ST++K KS         K S + +R N+ + S E+  ++    S + K      +N
Sbjct: 251  KNTSSTKSKTKSSLASKSSIPKPSLKQARRNVISKSSEIPTVSYSQHSVVAK------SN 304

Query: 1064 LKPIIPSGTELSGDGLYRPKAVAGLPPRSTSVTVGN---ESQLQKV-----------KPS 1201
            + P+  S   + G         A + P S  +T+G    +S   K            KPS
Sbjct: 305  VGPMTASDVAMLGH--------ASVIPDSNVITLGTSLAQSSCNKAGSTQSAVSRLGKPS 356

Query: 1202 GLRMPSPSLGYFAQNNAQPS 1261
            GLR P PS+GYF+Q+++QPS
Sbjct: 357  GLRAPKPSIGYFSQSDSQPS 376


Top