BLASTX nr result

ID: Angelica22_contig00015816 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00015816
         (1604 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002264392.1| PREDICTED: protein CHUP1, chloroplastic [Vit...   237   6e-60
emb|CAN72920.1| hypothetical protein VITISV_022322 [Vitis vinifera]   237   6e-60
ref|XP_002519412.1| conserved hypothetical protein [Ricinus comm...   207   8e-51
ref|XP_003590414.1| Protein CHUP1 [Medicago truncatula] gi|35547...   201   3e-49
ref|XP_003553573.1| PREDICTED: protein CHUP1, chloroplastic-like...   192   2e-46

>ref|XP_002264392.1| PREDICTED: protein CHUP1, chloroplastic [Vitis vinifera]
            gi|297740023|emb|CBI30205.3| unnamed protein product
            [Vitis vinifera]
          Length = 440

 Score =  237 bits (605), Expect = 6e-60
 Identities = 160/440 (36%), Positives = 238/440 (54%), Gaps = 12/440 (2%)
 Frame = +1

Query: 178  MKPVLLKAGIPIAISVAGFVLARFTFGRKNSGLKDSENELKFNNPP-----DVCNDQESC 342
            MKP +LKAG+P+A+SVA F++A+    R+N   K S  E + ++PP     +  +   S 
Sbjct: 1    MKPKILKAGLPLALSVAAFIIAKI-MERRNLVPKASSFENQVDSPPANSMVESVDGLNSA 59

Query: 343  CDDLDIIEGHQYLANIHDMNSLDTLHAQYIHELEEEIVCLKGKVQDLEGRELELKLRFFH 522
            C  L+  EG Q + N    + +++   Q   + EEEI+ L+ +++ L+ RE EL +RF  
Sbjct: 60   CVLLEEGEG-QIITN---RSLVESSEIQDPPDHEEEILALRRQIEHLQEREWELAMRFLC 115

Query: 523  YLEMKDQEIELTELENRLSLDILKSEFLDRELLSMEAETMRFEGMVIDFLKLATQLQISI 702
            Y E+K+QE  L EL +RL L+I + EFL+ E+  MEAE  R E +V+++L++  QL+   
Sbjct: 116  YCEIKEQESRLLELRSRLLLEIARVEFLNWEVSLMEAENKRHEDLVVEYLRVVEQLEFWK 175

Query: 703  FENKVLYKINRRLLSKAKEMPDVMQSKNKEIEAGQAEMSRNNEELSRRANRISVLENELL 882
             EN++L++  ++L  K ++   V++  N +IE  + E+SRN EEL RR   IS L+NE+ 
Sbjct: 176  LENRLLHREVKKLAKKTRQQSRVIRDCNLKIEGIEKEISRNQEELERRTTAISKLDNEVR 235

Query: 883  EMR---HITEKQXXXXXXXXXXXXXXXXSASLKTEERAALEGYNQLVNEFEQLQRDRAAE 1053
            E++   +  +++                S S    E  A E YNQLVNE E+L +DRAAE
Sbjct: 236  ELQATLNQVQEEKHQLSDKLKLAEKSAPSTSKSEAEGIAKEDYNQLVNELERLHKDRAAE 295

Query: 1054 VKELVYLRWCNACLRHELIRKNQQGENVEAKNSHQVXXXXXXXXXXXXXXXXVILDDHGI 1233
            VKELVYLRW NACLRHEL+R  +Q E  +     ++                    +H +
Sbjct: 296  VKELVYLRWSNACLRHELMRNQKQPEQNQESCQSELDFEPKGETGEH-------ASEHEL 348

Query: 1234 SSLECTTPSRD----XXXXXXXSKRAKLIEKFKRWVEGSEKTKRKSEHKNHGEVVKCIGK 1401
                   PS             SKR K+++K +RWV+GSEK K  SE     E +KC GK
Sbjct: 349  EGTVLEPPSEPCLGVSSGSHISSKRPKILQKLRRWVDGSEKIKPTSEEGEEHE-IKCFGK 407

Query: 1402 KMAVSDGAEEIQVSARNSCS 1461
               V   AEE  V   N+ S
Sbjct: 408  H-CVLHKAEEHHVHKNNALS 426


>emb|CAN72920.1| hypothetical protein VITISV_022322 [Vitis vinifera]
          Length = 1303

 Score =  237 bits (605), Expect = 6e-60
 Identities = 160/440 (36%), Positives = 238/440 (54%), Gaps = 12/440 (2%)
 Frame = +1

Query: 178  MKPVLLKAGIPIAISVAGFVLARFTFGRKNSGLKDSENELKFNNPP-----DVCNDQESC 342
            MKP +LKAG+P+A+SVA F++A+    R+N   K S  E + ++PP     +  +   S 
Sbjct: 1    MKPKILKAGLPLALSVAAFIIAKI-MERRNLVPKASSFENQVDSPPANSMVESVDGLNSA 59

Query: 343  CDDLDIIEGHQYLANIHDMNSLDTLHAQYIHELEEEIVCLKGKVQDLEGRELELKLRFFH 522
            C  L+  EG Q + N    + +++   Q   + EEEI+ L+ +++ L+ RE EL +RF  
Sbjct: 60   CVLLEEGEG-QIITN---RSLVESSEIQDPPDHEEEILALRRQIEHLQEREWELAMRFLC 115

Query: 523  YLEMKDQEIELTELENRLSLDILKSEFLDRELLSMEAETMRFEGMVIDFLKLATQLQISI 702
            Y E+K+QE  L EL +RL L+I + EFL+ E+  MEAE  R E +V+++L++  QL+   
Sbjct: 116  YCEIKEQESRLLELRSRLLLEIARVEFLNWEVSLMEAENKRHEDLVVEYLRVVEQLEFWK 175

Query: 703  FENKVLYKINRRLLSKAKEMPDVMQSKNKEIEAGQAEMSRNNEELSRRANRISVLENELL 882
             EN++L++  ++L  K ++   V++  N +IE  + E+SRN EEL RR   IS L+NE+ 
Sbjct: 176  LENRLLHREVKKLAKKTRQQSRVIRDCNLKIEGIEKEISRNQEELERRTTAISKLDNEVR 235

Query: 883  EMR---HITEKQXXXXXXXXXXXXXXXXSASLKTEERAALEGYNQLVNEFEQLQRDRAAE 1053
            E++   +  +++                S S    E  A E YNQLVNE E+L +DRAAE
Sbjct: 236  ELQATLNQVQEEKHQLSDKLKLAEKSAPSTSKSEAEGIAKEDYNQLVNELERLHKDRAAE 295

Query: 1054 VKELVYLRWCNACLRHELIRKNQQGENVEAKNSHQVXXXXXXXXXXXXXXXXVILDDHGI 1233
            VKELVYLRW NACLRHEL+R  +Q E  +     ++                    +H +
Sbjct: 296  VKELVYLRWSNACLRHELMRNQKQPEQNQESCQSELDFEPKGETGEH-------ASEHEL 348

Query: 1234 SSLECTTPSRD----XXXXXXXSKRAKLIEKFKRWVEGSEKTKRKSEHKNHGEVVKCIGK 1401
                   PS             SKR K+++K +RWV+GSEK K  SE     E +KC GK
Sbjct: 349  EGTVLEPPSEPCLGVSSGSHISSKRPKILQKLRRWVDGSEKIKPTSEEGEEHE-IKCFGK 407

Query: 1402 KMAVSDGAEEIQVSARNSCS 1461
               V   AEE  V   N+ S
Sbjct: 408  H-CVLHKAEEHHVHKNNALS 426


>ref|XP_002519412.1| conserved hypothetical protein [Ricinus communis]
            gi|223541275|gb|EEF42826.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 412

 Score =  207 bits (526), Expect = 8e-51
 Identities = 141/449 (31%), Positives = 226/449 (50%), Gaps = 10/449 (2%)
 Frame = +1

Query: 151  DTARTKIELMKPVLLKAGIPIAISVAGFVLARFTFGRKNSGLKDSENELKFNNPPDVC-- 324
            +++R+K E+MKP+ LKAGIP+A+SVA F+ AR    R     +     L+ NN       
Sbjct: 2    ESSRSKTEVMKPLFLKAGIPLALSVASFIYARIISRRVVISDRPKVCSLEANNGSIEAFR 61

Query: 325  NDQESCCDDLDII------EGHQYLANIHDMNSLDTLHAQYIHELEEEIVCLKGKVQDLE 486
            ++QES    L  +      +    + + HD++S +          +E+++ L+ +V++L+
Sbjct: 62   DNQESSIRGLSSLTSPIKDDEEAMITSSHDLSSTENSDIT----AQEQVLGLRSRVEELQ 117

Query: 487  GRELELKLRFFHYLEMKDQEIELTELENRLSLDILKSEFLDRELLSMEAETMRFEGMVID 666
             REL+L+++F  Y  MK+QE+ L EL+N L L+  + E LDRE+ S+EAE  RF+ +V D
Sbjct: 118  KRELDLEMKFLRYHVMKEQELVLMELKNMLVLEAARLESLDREISSIEAEKERFQNLVAD 177

Query: 667  FLKLATQLQISIFENKVLYKINRRLLSKAKEMPDVMQSKNKEIEAGQAEMSRNNEELSRR 846
            +  +  Q++    EN++L +  +RL  K  E   +++ KN +I+A ++E+     E+  R
Sbjct: 178  YFGVLEQIECVKLENRLLRRKAKRLSKKTMEQSRIIREKNSKIDAAESEILSFCNEIETR 237

Query: 847  ANRISVLENELLEMRHITEKQXXXXXXXXXXXXXXXXSASLKTEERAALEGYNQLVNEFE 1026
            +N I  LE+++                                 E   +E YNQL NE E
Sbjct: 238  SNVIKKLEDDI-------------------------------DAEFVPIEDYNQLANELE 266

Query: 1027 QLQRDRAAEVKELVYLRWCNACLRHELIRKNQQGENVEAKNSHQVXXXXXXXXXXXXXXX 1206
            QL++DRA+E  EL+YL+W NAC +HEL+R  +  E  E + + ++               
Sbjct: 267  QLRKDRASENAELIYLKWANACSKHELMRIQEHEEFDEKQENLELELEASEENRDCGSEQ 326

Query: 1207 XVILDDHGISSLECTTPSRDXXXXXXXSKRAKLIEKFKRWVEGS--EKTKRKSEHKNHGE 1380
                +   +   E  T           SKR KL+ K KRWVEGS     K K E K   E
Sbjct: 327  QE--NKSNLVRKEVDTDVATSSHDQGCSKRKKLLHKLKRWVEGSGDHMMKPKLEEKEK-E 383

Query: 1381 VVKCIGKKMAVSDGAEEIQVSARNSCSSS 1467
             +KC G+     +  E+  + AR SCSS+
Sbjct: 384  EIKCFGRLSLSEEKEEDHIIHARRSCSSA 412


>ref|XP_003590414.1| Protein CHUP1 [Medicago truncatula] gi|355479462|gb|AES60665.1|
            Protein CHUP1 [Medicago truncatula]
          Length = 411

 Score =  201 bits (512), Expect = 3e-49
 Identities = 140/436 (32%), Positives = 222/436 (50%), Gaps = 16/436 (3%)
 Frame = +1

Query: 151  DTARTKIELMKPVLLKAGIPIAISVAGFVLARFTFGR---KNSGLKDSENELKFNNPPDV 321
            + +  K E +KP++LKAG+PIA+S AG + A     +   K S   +S++     N  DV
Sbjct: 2    ENSTLKAENLKPIILKAGVPIAVSFAGLIYAWIITKKSLSKVSSFSESDSHTPEINSHDV 61

Query: 322  CNDQESCCDDLDIIEGHQYLANIHDMNSLDTLHAQYIHELEEEIVCLKGKVQDLEGRELE 501
               +ES  D+   +E           NS+D+        LE+EI CL+ K++ ++ REL 
Sbjct: 62   TQHEESF-DNFSSMEDE---GKEEYTNSIDSSVVSGSFGLEQEITCLRSKIEGMQMRELA 117

Query: 502  LKLRFFHYLEMKDQEIELTELENRLSLDILKSEFLDRELLSMEAETMRFEGMVIDFLKLA 681
            L L+F  Y EMK++E  L E++N LSL+  + EF DRE+  +E ETMR E  VI +LK+ 
Sbjct: 118  LTLQFDKYCEMKEKESMLREMKNMLSLETSRVEFFDREISFIEKETMRLENFVIQYLKII 177

Query: 682  TQLQISIFENKVLYKINRRLLSKAKEMPDVMQSKNKEIEAGQAEMSRNNEELSRRANRIS 861
             +L+    EN++L+K  ++LL K+K    +++ +   I+ G+ E+ RN +EL +RA+ I 
Sbjct: 178  EKLEYWKSENRLLHKKVQKLLKKSKAQSHLIKEQTLMIKEGEEEILRNYDELKKRASMIH 237

Query: 862  VLENELLEMRHITEK------------QXXXXXXXXXXXXXXXXSASLKTEERAALE-GY 1002
             LE+E+ EM+ I +             +                   L+ E +  +E  Y
Sbjct: 238  KLEDEIREMKRILDDFQDEKNELVKKLETSEEYGCKEELHKKPLKYYLQIESKDVMEEDY 297

Query: 1003 NQLVNEFEQLQRDRAAEVKELVYLRWCNACLRHELIRKNQQGENVEAKNSHQVXXXXXXX 1182
            N+++NE EQ++++   E++EL+YLR  N CL  EL+             +H+        
Sbjct: 298  NKVLNELEQVKKEHENEIEELIYLRKINVCLSQELM-------------NHEFHCP---- 340

Query: 1183 XXXXXXXXXVILDDHGISSLECTTPSRDXXXXXXXSKRAKLIEKFKRWVEGSEKTKRKSE 1362
                       LD   +SS+  +T   D       SK+ KLI+K K WV+GSEK + K E
Sbjct: 341  ----------FLDHQNVSSIGSSTFHGD----PSSSKKGKLIKKLKNWVDGSEKVRVKPE 386

Query: 1363 HKNHGEVVKCIGKKMA 1410
             K+  E +KC G   A
Sbjct: 387  GKSSNE-IKCFGMNSA 401


>ref|XP_003553573.1| PREDICTED: protein CHUP1, chloroplastic-like [Glycine max]
          Length = 445

 Score =  192 bits (488), Expect = 2e-46
 Identities = 137/463 (29%), Positives = 229/463 (49%), Gaps = 22/463 (4%)
 Frame = +1

Query: 145  MEDTARTKIELMKPVLLKAGIPIAISVAGFVLARFTFGR---KNSGLKDSENELKFNNPP 315
            ME+T      L+KP++LKAG+P+A+S AG + A F   +   K S L  +E      N  
Sbjct: 1    MENTTSKPEVLIKPIILKAGVPLAVSFAGCIYAWFVAKKSLSKTSSLSLNEGSSHETNSH 60

Query: 316  DVCNDQESC-CDDLDIIEGHQYLANIHDMNSLDTLHAQYIHELEEEIVCLKGKVQDLEGR 492
               N +ESC    L  +E   +   I      ++        LEEEI  L+  ++ +  +
Sbjct: 61   LEPNYEESCHSHSLSCLEDEGHSTTIDQSVVAESSMINDTPCLEEEINGLRSMIEGMHMK 120

Query: 493  ELELKLRFFHYLEMKDQEIELTELENRLSLDILKSEFLDRELLSMEAETMRFEGMVIDFL 672
            EL L+L+F  Y +MK+QE  + E++N LSL+  +  FLDRE+ SME +  R E  V  +L
Sbjct: 121  ELALRLQFGRYCDMKEQETVVGEIKNMLSLETARVGFLDREISSMEMQNRRLESFVAQYL 180

Query: 673  KLATQLQISIFENKVLYKINRRLLSKAKEMPDVMQSKNKEIEAGQAEMSRNNEELSRRAN 852
            ++  Q++    EN++L +  ++L+ K+K    + + +  +++  + E+ R+ + L  + +
Sbjct: 181  RVVEQIERWKSENRMLRRKFQKLMRKSKAQTRLAKEQASKLKLEEEEILRSRDALETKID 240

Query: 853  RISVLENELLEMRHITEKQXXXXXXXXXXXXXXXXSASLKTEERA--------------- 987
             I  LE+++ E++   ++                 S + K   ++               
Sbjct: 241  VIGKLEDKMEELQRALDQLQDEKNELLKKLDTAEKSYASKVTSKSLQFKVFHEQIEAGDV 300

Query: 988  ALEGYNQLVNEFEQLQRDRAAEVKELVYLRWCNACLRHELIRKNQQGENVEAKNSHQVXX 1167
            + E Y +L++E EQ +++RA E KEL+YLRW NACLRH+L+R ++Q +N + KN  ++  
Sbjct: 301  SREEYTKLLDELEQAKKERADEAKELIYLRWTNACLRHDLVRHHEQQQN-QDKNHLELEF 359

Query: 1168 XXXXXXXXXXXXXXV---ILDDHGISSLECTTPSRDXXXXXXXSKRAKLIEKFKRWVEGS 1338
                          +   +L+ H   S +  T   D       SKR KL+E+ KRWV+GS
Sbjct: 360  GRNDVLIHYDSEHELHNSLLEHHSDPSFDEHTRGHD-HSDSACSKRTKLLERLKRWVDGS 418

Query: 1339 EKTKRKSEHKNHGEVVKCIGKKMAVSDGAEEIQVSARNSCSSS 1467
            EK +                 + +VS GAEE  V  R SCSS+
Sbjct: 419  EKAR----------------VRHSVSKGAEEHLVPRRKSCSSA 445


Top