BLASTX nr result

ID: Angelica23_contig00018043 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00018043
         (1435 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002310948.1| predicted protein [Populus trichocarpa] gi|2...   346   8e-93
ref|XP_002523014.1| conserved hypothetical protein [Ricinus comm...   334   4e-89
ref|XP_001764984.1| predicted protein [Physcomitrella patens sub...   285   2e-74
gb|AFW85506.1| putative uncharacterized protein hypro4 [Zea mays]     277   6e-72
gb|EEC80161.1| hypothetical protein OsI_21977 [Oryza sativa Indi...   275   3e-71

>ref|XP_002310948.1| predicted protein [Populus trichocarpa] gi|222850768|gb|EEE88315.1|
            predicted protein [Populus trichocarpa]
          Length = 331

 Score =  346 bits (888), Expect = 8e-93
 Identities = 179/333 (53%), Positives = 217/333 (65%), Gaps = 19/333 (5%)
 Frame = +2

Query: 161  MNLNDLNKVWEIKTLKKVRENEAKEILEKVAKQVEPIMRKRRWKVHVLSEFCPANPXXXX 340
            M+LNDLNKVWEIK LKK+ E +A+++LE+VAKQV+PIM+KR+WKV +LSEFCPANP    
Sbjct: 1    MDLNDLNKVWEIKPLKKIGEEDARKVLERVAKQVQPIMKKRKWKVKILSEFCPANPALLG 60

Query: 341  XXXXXXAQVKIRLRSPFNELEFLPYNQILDTMLHELCHNVHGPHNADFYNLLDEIRKECE 520
                  A+VK+RLR P NE +F PY Q+LDTMLHELCHN +GPHN+ FYNLLDEIRKE E
Sbjct: 61   LNIGGGAEVKLRLRRPNNEWDFFPYEQVLDTMLHELCHNEYGPHNSGFYNLLDEIRKESE 120

Query: 521  -----------QGFDLPGRRLGGYTRQPPXXXXXXXXXXXXENRAKRGSLLPSGPRRIGG 667
                       +GFDLPGRRLGG++RQPP            ENRA+R +LLPSGP+R+GG
Sbjct: 121  ELMAKGITGTGEGFDLPGRRLGGFSRQPPLSLLRQSALAATENRARRDALLPSGPKRVGG 180

Query: 668  DSNMKSALSPIXXXXXXXXXXXXXXLWCASRSSGS----NGVPE----TSKSSESVGVPE 823
            DSN+K+ALSPI              LWC S+SS S    NG  E    +S S  S G+  
Sbjct: 181  DSNIKAALSPIQAAAMAAEKRLQDDLWCGSKSSDSVVTVNGNIERPEGSSTSISSKGIAT 240

Query: 824  TIKPSIIIAEKSPETSVAPTSSDAGVMWQCGVCTLSNQSLALICEACGNPKHDSHATKKQ 1003
             I P   +  + P     PT       WQC  CTL NQ +AL+CEACG  +    A  K 
Sbjct: 241  QISPGTSMNAREP-IHDHPT-------WQCNTCTLLNQPMALVCEACGTQRLKDVAKFKS 292

Query: 1004 NVWSCKFCTLNNSTEVEKCLACGEWRYSYGAPA 1102
              WSCKFCTL NS E+++C+ACGEWRYSYG PA
Sbjct: 293  --WSCKFCTLENSVELDRCMACGEWRYSYGPPA 323


>ref|XP_002523014.1| conserved hypothetical protein [Ricinus communis]
            gi|223537736|gb|EEF39356.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 326

 Score =  334 bits (856), Expect = 4e-89
 Identities = 170/327 (51%), Positives = 211/327 (64%), Gaps = 14/327 (4%)
 Frame = +2

Query: 161  MNLNDLNKVWEIKTLK-KVRENEAKEILEKVAKQVEPIMRKRRWKVHVLSEFCPANPXXX 337
            M+LNDLNK+WE+K LK K+ E +A  +LEKVAKQV+PIMR   WKV +LSEFCP+NP   
Sbjct: 1    MDLNDLNKIWEVKPLKNKIGEEDAMILLEKVAKQVQPIMRNHHWKVRILSEFCPSNPSLM 60

Query: 338  XXXXXXXAQVKIRLRSPFNELEFLPYNQILDTMLHELCHNVHGPHNADFYNLLDEIRKEC 517
                   A++K+RLR P  E +F PY Q+LDTMLHELCHN +GPHNADFYNLLD+IRKEC
Sbjct: 61   GLNIGGGAEIKLRLRRPNCEWDFFPYEQVLDTMLHELCHNQYGPHNADFYNLLDQIRKEC 120

Query: 518  E-----------QGFDLPGRRLGGYTRQPPXXXXXXXXXXXXENRAKRGSLLPSGPRRIG 664
            E           QGFDLPGR LGG++RQPP            ENRA+RG++LPSGP+R+G
Sbjct: 121  EELIAKGITGTGQGFDLPGRCLGGFSRQPPLSSMRQTALAAAENRARRGAVLPSGPQRVG 180

Query: 665  GDSNMKSALSPIXXXXXXXXXXXXXXLWCASRS-SGSNGVPETSKSSESVGVPETIKPSI 841
            GD N+K+ALSP+              LWC S+S  G + + E  ++S    +  T +   
Sbjct: 181  GDGNIKTALSPVQAAAMAAERRLHDDLWCGSKSLEGISDLKENVEASSKSNISITFEG-- 238

Query: 842  IIAEKSPE-TSVAPTSSDAGVMWQCGVCTLSNQSLALICEACGNPKHDSHATKKQNVWSC 1018
            + +  SP   +      D    WQC +CTL NQ L LICEACG  +  S A  K  VWSC
Sbjct: 239  VSSRTSPRGQTTGQKPVDDHPQWQCHMCTLLNQPLVLICEACGPERSKSIANFK--VWSC 296

Query: 1019 KFCTLNNSTEVEKCLACGEWRYSYGAP 1099
            KFCTL NS E+E+C+ACGEWRYSYG P
Sbjct: 297  KFCTLENSVELERCIACGEWRYSYGPP 323


>ref|XP_001764984.1| predicted protein [Physcomitrella patens subsp. patens]
            gi|162683793|gb|EDQ70200.1| predicted protein
            [Physcomitrella patens subsp. patens]
          Length = 331

 Score =  285 bits (729), Expect = 2e-74
 Identities = 153/333 (45%), Positives = 194/333 (58%), Gaps = 6/333 (1%)
 Frame = +2

Query: 149  PDSEMNLNDLNKVWEIKTLKKVRENEAKEILEKVAKQVEPIMRKRRWKVHVLSEFCPANP 328
            P   ++  DL+KVWEI+TLKK +++ A+ +LE  AKQV+PIMRKR+W+V +LSEFCP NP
Sbjct: 2    PLHSISKGDLDKVWEIRTLKKEKDDVARRLLEMAAKQVQPIMRKRKWQVKLLSEFCPRNP 61

Query: 329  XXXXXXXXXXAQVKIRLRSPFNELEFLPYNQILDTMLHELCHNVHGPHNADFYNLLDEIR 508
                       +V++RLR    E EF PY  +L T+LHEL HN  GPH+A FY LLD I 
Sbjct: 62   GLLGLNIDQGREVRVRLRPYGRENEFFPYESVLGTLLHELVHNDCGPHDAKFYGLLDVIT 121

Query: 509  KECE---QGFDLPGRRLGGYTRQPPXXXXXXXXXXXXENRAKRGSLLPSGPRRIGGDSNM 679
            K      QGFD  G+RLGGYT  PP            E RAK  S +PSGP+R+GGDS +
Sbjct: 122  KGISGTGQGFDARGQRLGGYTLNPPPTNMRAVALAAAEKRAKAASFMPSGPQRLGGDSEI 181

Query: 680  KSALSPIXXXXXXXXXXXXXXLWCASRSSGSNGVPETSKSSESVGVPETIKPSIIIAEKS 859
              ALSP+              +WCA+ ++      E +K  E       +  +    E S
Sbjct: 182  MRALSPLQAAAMAAERRLRDDVWCAAPTTTGGDGLEKAKEREDSTCAHPLGHTPTDTEPS 241

Query: 860  PETSVAPTSSDAG---VMWQCGVCTLSNQSLALICEACGNPKHDSHATKKQNVWSCKFCT 1030
              + V  T SD+G     W C VCTL N SLAL C ACGN K    +TK+   WSCKFCT
Sbjct: 242  KVSVVDLTLSDSGDSISEWPCSVCTLYNTSLALACAACGNRKEQPTSTKE---WSCKFCT 298

Query: 1031 LNNSTEVEKCLACGEWRYSYGAPAFSRGPYVGT 1129
            L NS  ++ C ACG+WRYSYGAP+ +R P VGT
Sbjct: 299  LANSDLLDTCEACGQWRYSYGAPSATRAPNVGT 331


>gb|AFW85506.1| putative uncharacterized protein hypro4 [Zea mays]
          Length = 346

 Score =  277 bits (708), Expect = 6e-72
 Identities = 150/344 (43%), Positives = 193/344 (56%), Gaps = 25/344 (7%)
 Frame = +2

Query: 161  MNLNDLNKVWEIKTLK-KVRENEAKEILEKVAKQVEPIMRKRRWKVHVLSEFCPANPXXX 337
            M + DL+KVWE++ LK K     A+  L++VA+QV+PIMR+ +W+V VLSEF P NP   
Sbjct: 1    MEVGDLHKVWEVRALKIKPDATAARATLDRVARQVQPIMRRHKWRVKVLSEFSPRNPRLL 60

Query: 338  XXXXXXXAQVKIRLRSPFNELEFLPYNQILDTMLHELCHNVHGPHNADFYNLLDEIRKEC 517
                    +VK+RLR    + +F+PY ++LDTMLHELCHN  GPH+A FY L DE+RKEC
Sbjct: 61   GLNVGAGVEVKLRLRRAGRDHDFIPYEEVLDTMLHELCHNERGPHDAQFYKLWDELRKEC 120

Query: 518  E-----------QGFDLPGRRLGGYTRQPPXXXXXXXXXXXXENRAKRGSLLPSGPRRIG 664
            E           QGFD  GRR+GG+T  PP            + RA+ G+LLPSGPR++G
Sbjct: 121  EELVSKGITGTGQGFDGTGRRVGGFTVHPPPPSLRQATLAAAQKRARNGALLPSGPRKLG 180

Query: 665  GDSNMKSALSPIXXXXXXXXXXXXXXLWCASRSSGSNGVPE---TSKSSESVGVPETIKP 835
            G+S + SALSP+              LWC S    +    +     + S ++   E  K 
Sbjct: 181  GNSEIMSALSPVQAAAMAAERRMYDDLWCGSHDQSAIDDSDDVIILQESPNLTRDEKDKG 240

Query: 836  SIIIAEKSPETSVA--------PTSSDA--GVMWQCGVCTLSNQSLALICEACGNPKHDS 985
            S       P TS           T+SDA     W+CG CTL NQ LA ICE CG  K   
Sbjct: 241  SCSNTSAQPSTSSRIHIAARDDRTTSDALDSSKWECGACTLLNQPLAPICEVCGTTK-PK 299

Query: 986  HATKKQNVWSCKFCTLNNSTEVEKCLACGEWRYSYGAPAFSRGP 1117
             A  K   WSCKFCTL NST+++KC AC +WRYSYG P  + GP
Sbjct: 300  IAKAKYTTWSCKFCTLENSTKLDKCSACDQWRYSYGPPVATYGP 343


>gb|EEC80161.1| hypothetical protein OsI_21977 [Oryza sativa Indica Group]
          Length = 352

 Score =  275 bits (702), Expect = 3e-71
 Identities = 149/350 (42%), Positives = 201/350 (57%), Gaps = 31/350 (8%)
 Frame = +2

Query: 161  MNLNDLNKVWEIKTLK-KVRENEAKEILEKVAKQVEPIMRKRRWKVHVLSEFCPANPXXX 337
            M + DL+KVWEI+ LK K  E  A+ +L++VAKQV+PIMR+R+W+V VLSEF P NP   
Sbjct: 1    MEVGDLHKVWEIRALKRKPDEPAARALLDRVAKQVQPIMRRRKWRVKVLSEFSPKNPRLL 60

Query: 338  XXXXXXXAQVKIRLRSPFNELEFLPYNQILDTMLHELCHNVHGPHNADFYNLLDEIRKEC 517
                    +VK+RLR    + +F+PY ++LDTMLHELCH   GPH+A FY L DE+RKEC
Sbjct: 61   GLNVGGGVEVKLRLRRAGRDYDFIPYEEVLDTMLHELCHIERGPHDAQFYKLWDELRKEC 120

Query: 518  E-----------QGFDLPGRRLGGYTRQPPXXXXXXXXXXXXENRAKRGSLLPSGPRRIG 664
            E           QGFD  GRRLGG+T  PP            + RA+ G+LLPSGPR++G
Sbjct: 121  EELVAMGITGSGQGFDGTGRRLGGFTVHPPPPSLRQATLAAAQKRARNGALLPSGPRKLG 180

Query: 665  GDSNMKSALSPIXXXXXXXXXXXXXXLWCASR-SSGSNGVPETSKSSESVGVP----ETI 829
            G++ + SALSPI              LWC S   SG +   +     ++  +P    ++ 
Sbjct: 181  GNNEIMSALSPIQAAAMAAERRMYDDLWCGSHDQSGIDDSEDVVILEDTPNLPTQLGKST 240

Query: 830  KPSIIIAEKSPETSVA-PTSSDAG-------------VMWQCGVCTLSNQSLALICEACG 967
            K     + ++P TS+  PT++ +G              +W+C  CTL NQ LA ICE C 
Sbjct: 241  KDGFSSSSENPSTSLGFPTAAQSGSSSCRITTDAGDSSLWECVACTLLNQPLAPICEVCS 300

Query: 968  NPKHDSHATKKQNVWSCKFCTLNNSTEVEKCLACGEWRYSYGAPAFSRGP 1117
              K  +    K   WSCKFCTL NST+++KC AC +WRYS+G PA +  P
Sbjct: 301  AAKPKT-TKAKYATWSCKFCTLENSTKIDKCSACDQWRYSHGPPAATYCP 349


Top