BLASTX nr result

ID: Angelica22_contig00025114 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00025114
         (1425 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003541595.1| PREDICTED: uncharacterized protein LOC100787...    96   2e-17
ref|XP_002510568.1| hypothetical protein RCOM_1598630 [Ricinus c...    95   4e-17
ref|XP_002301900.1| predicted protein [Populus trichocarpa] gi|2...    93   2e-16
dbj|BAC42517.1| myb like protein [Arabidopsis thaliana]                77   1e-11
dbj|BAJ53106.1| JHL20J20.13 [Jatropha curcas]                          76   2e-11

>ref|XP_003541595.1| PREDICTED: uncharacterized protein LOC100787956 [Glycine max]
          Length = 466

 Score = 96.3 bits (238), Expect = 2e-17
 Identities = 104/396 (26%), Positives = 162/396 (40%), Gaps = 49/396 (12%)
 Frame = -1

Query: 1416 CPLVVHEGCLGFEAKFDEVGNFTCPYCVYKRGFQDFCQAQRKAVFAEKNLYKYLNXXXXX 1237
            CP+ VH  CL    KFD  GNF CPYC YKR      + + KA+ A+ +L ++L+     
Sbjct: 60   CPVAVHATCLATGPKFDGSGNFCCPYCWYKRAVDTCRRLREKALEAKGDLSRFLDNHDHA 119

Query: 1236 XXXXXXXXXXXXXXVRDQGVERGPECSMRDQGADVFLVNNEGVVLLHKVGD--------- 1084
                             +  E G +   +D        + EG   +++V D         
Sbjct: 120  RAAAHVDLVVQDSEELME--ETGTQAQSKDNK------DEEGEARVNQVHDREETETEPE 171

Query: 1083 -----DGGLNDEFDRVKFVEKES-DECCSGEGNVYQEKDVDMNKRSHCGDTIQNDATRTE 922
                 +G + D  + V+  E+++  E  S E    ++K  D ++     + +    T TE
Sbjct: 172  GNKEKEGKVRDNEELVEERERKTVTEAQSQENKAEEDKFQDDSE-----ELVVETETETE 226

Query: 921  SFPEYHQGAGHYEAHDFIXXXXXXXXXXEGRMDEVKNQ---EQPHKTIEKTVCK---ESV 760
               E ++  G     +            E + +E K++       K +E+T  +   +S 
Sbjct: 227  VQCEENKEEGKVRDSEEHVEEMETETGAEAQPEEKKDEGKVRDSEKLVEETQTETEGQSE 286

Query: 759  PEVHVEAKRRVNETTASTCMDTDTISEEVNGARPPGVDIPKGSSRKS------------- 619
             +   E K  V  ++ S   D+D+++  +   +     +   S+RKS             
Sbjct: 287  EKKDEEGKVAVMSSSVSETYDSDSVAVSMKKRKDKKKKVT--SARKSLSLQQEHKNKHYK 344

Query: 618  --SKVAKPPAI--FK-----------KLPTPKMKRKRSTWXXXXXXXXXXXXXKYSKVVN 484
               KVA    +  FK           K  +   KRKR  W             K+S   N
Sbjct: 345  TRGKVANEEEVTSFKTTSLGQQPQRMKQSSLAAKRKRLLWTAEEEKVLKEGVSKFS-TEN 403

Query: 483  KNIPWQKILEDGRAVFDETRTPADLKDKWKNILSKE 376
            +NIPW+KILE G  VFDETRTP DLKDKWKNI+SK+
Sbjct: 404  QNIPWRKILEFGCRVFDETRTPVDLKDKWKNIISKK 439


>ref|XP_002510568.1| hypothetical protein RCOM_1598630 [Ricinus communis]
            gi|223551269|gb|EEF52755.1| hypothetical protein
            RCOM_1598630 [Ricinus communis]
          Length = 422

 Score = 95.1 bits (235), Expect = 4e-17
 Identities = 99/390 (25%), Positives = 155/390 (39%), Gaps = 43/390 (11%)
 Frame = -1

Query: 1416 CPLVVHEGCLGFEAKFDEVGNFTCPYCVYKRGFQDFCQAQRKAVFAEKNLYKYLNXXXXX 1237
            C + +H  C+  + K+DE GNF CPYC YK       + ++ A+ A+K L  +++     
Sbjct: 41   CAICLHVECIPRKPKYDEEGNFHCPYCWYKLQQARAQEWKKMALLAKKALSDFMDSRQVE 100

Query: 1236 XXXXXXXXXXXXXXVRDQGVERGPEC-----------SMRDQGADVF------------- 1129
                            D  V     C            +R++  +V              
Sbjct: 101  VGNDKAKLNDRRINGADTSVGPERNCCEHFTKMDVDDEVRNETGEVEEDQNEKNVKISDG 160

Query: 1128 -----LVNNEGVVLLHK---VGDDGGLNDEFDRVKFVEKESDECCSGEGNVYQEKDVDMN 973
                 +V +E V  +H+   + +D G   E D  + +++       GE     E++   N
Sbjct: 161  CRSTEVVEHENVSKIHEFEVLHNDEGTEKEKDNEQVIDQWEAGILEGE-----EQEDPFN 215

Query: 972  KRSHCGDTIQNDATR------TESFPEYHQGAGHYEAHDFIXXXXXXXXXXEGRM--DEV 817
                  +T+ +DA R      +E+           E  + +           G +  D  
Sbjct: 216  TNCIEEETLVDDALRGSAELKSEALKVSEGNQARKEEEEGVHEDAPAANCTGGDVVADVP 275

Query: 816  KNQEQPHKTIEKTVCKESVPEVHVEAKRRVNETTASTCMDT---DTISEEVNGARPPGVD 646
            K  +  ++T+   +           AK+R N+   ST   +   D IS E    +   V 
Sbjct: 276  KMSDSDNETLAARLSW---------AKQRANQKANSTKKSSHHPDNISVEKARNQNEKV- 325

Query: 645  IPKGSSRKSSKVAKPPAIFKKLPTPKMKRKRSTWXXXXXXXXXXXXXKYSKVVNKNIPWQ 466
            IP   SR++   AK       L  P  KRKR  W             K+S  VNKN+PW+
Sbjct: 326  IPLKKSRQTQAPAKK---LTNLSFPHEKRKRLHWKPEEEEMLREGVQKFSTTVNKNLPWK 382

Query: 465  KILEDGRAVFDETRTPADLKDKWKNILSKE 376
            KILE G  VFD +RTPADLKDKW+NI++K+
Sbjct: 383  KILEFGHHVFDGSRTPADLKDKWRNIVAKD 412


>ref|XP_002301900.1| predicted protein [Populus trichocarpa] gi|222843626|gb|EEE81173.1|
            predicted protein [Populus trichocarpa]
          Length = 472

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 97/390 (24%), Positives = 146/390 (37%), Gaps = 47/390 (12%)
 Frame = -1

Query: 1416 CPLVVHEGCLGFEAKFDEVGNFTCPYCVYKRGFQDFCQAQRKAVFAEKNLYKYLNXXXXX 1237
            CP+ +HE C  F+  FD+ G F CPYC YKR      +  RKA+ A+K L  +++     
Sbjct: 87   CPVSIHEKCANFKLAFDDSGRFCCPYCSYKREVGRAKELFRKAMLAKKALLGFIDPEMVG 146

Query: 1236 XXXXXXXXXXXXXXV---RDQGVERGPECSMRDQGADVFLVNNEGVVLLHKVGDDGGLND 1066
                              RD  VE G + S  D+   +     +G +     G D G   
Sbjct: 147  GEAKRNGGERAEFDGAENRDALVEDGLKVSDCDRCEVMVDDEMDGALPGAVDGSDNGHKS 206

Query: 1065 EFDRVKFVEKESDECCS---GEGNVYQEKDVDMNKRSHCGDTIQNDATRTESFPEYHQGA 895
            + ++++ +E   D   +    E N+ +  + +  +        + D    E         
Sbjct: 207  QEEKIQGIESLEDSISNEIRDERNISETHEFETLEGEEGKQEREKDGRILEGGERAESSK 266

Query: 894  GHYEAHDFIXXXXXXXXXXEGRMDEVKNQEQPHKT----IEKTVCKESVPEVHVEAKRR- 730
             HY     +              +E K QE+ H+      E+  C     +VH +A+   
Sbjct: 267  DHY-----VEKEQKQMQQDGCDDEEQKEQEEKHQDGCDDKEQGQCVGE-EQVHHDAREAN 320

Query: 729  ----VNETTASTCMDTDTISEEVNGARPPGVDIPKGSS------------------RKSS 616
                V    A    D+DT    V   R   +   K +                    K +
Sbjct: 321  SGGGVAAPKAPHVSDSDTGKSVVLRRRVKHIGKKKIAESLDAKLSKEAPPQRHTIDEKEA 380

Query: 615  KVAKPPAIFKKLP-----TPKM---------KRKRSTWXXXXXXXXXXXXXKYSKVVNKN 478
            K+ K   I  K P     +PK+         KR+R  W             K++   NKN
Sbjct: 381  KIQKKKVILSKEPRQRLESPKISSNLYPRNEKRQRLNWTADEEDTLKEGVEKFAIPGNKN 440

Query: 477  IPWQKILEDGRAVFDETRTPADLKDKWKNI 388
             PW+KILE G  VFD TRTP DLKDKW+N+
Sbjct: 441  TPWRKILEFGHRVFDSTRTPTDLKDKWRNM 470


>dbj|BAC42517.1| myb like protein [Arabidopsis thaliana]
          Length = 420

 Score = 76.6 bits (187), Expect = 1e-11
 Identities = 89/374 (23%), Positives = 137/374 (36%), Gaps = 29/374 (7%)
 Frame = -1

Query: 1419 DCPLVVHEGCL---------GFEAKFDEVGNFTCPYCVYKRGFQDFCQAQRKAVFAEKNL 1267
            DC L  H  CL            +  ++V N  CPYC  K         + K V AEK +
Sbjct: 90   DCLLSFHGECLYADLGSTSSSSSSSSEDVSNPFCPYCWLKIVALKSKTLREKTVEAEKAV 149

Query: 1266 YKYLNXXXXXXXXXXXXXXXXXXXVRDQGVE-RGPECSMRDQGADVFLVN---------- 1120
             KYL+                    RD+G+   G E   ++Q  D+   +          
Sbjct: 150  CKYLDKEMKS---------------RDEGITLSGDEIGNQEQSTDIVSDHELQGEKDGCS 194

Query: 1119 -----NEGVVLLHKVGDDGGLNDEFDRVKFVEKESDECCSGEGNVYQEKDVDMNKRSHCG 955
                 ++G V   KV D+ G +++    KF + E DE    +G           +     
Sbjct: 195  SKPDADQGKVGTGKVIDEVGASEKVATEKFQDAEDDETAKDQGTRILNTGAGKKRE---- 250

Query: 954  DTIQNDATRTESFPEYHQGAGHYEAHDFIXXXXXXXXXXEGRMDEVKNQEQPHKTIEKTV 775
              + +  +  ESF    Q                         D+V+  E+  +   K +
Sbjct: 251  --VSSFLSMQESFSAKEQ-------------------------DQVQQNEKRRRRGLKII 283

Query: 774  CKESVPEVHVEAKRRVNETTASTCMDTDTISEEVNGARPPGVDIPKGSSRK---SSKVAK 604
              +      + +K   NE       +  T S +V          P G  R    ++KV  
Sbjct: 284  DSD------ISSKGSSNERNGEDVTEQVTSSVQVTS--------PSGRMRNQQATTKVVA 329

Query: 603  PPAIFKKLPTPKM-KRKRSTWXXXXXXXXXXXXXKYSKVVNKNIPWQKILEDGRAVFDET 427
                 + +   KM +R+R  W             K++   NKN+PW+KILE G  VF ET
Sbjct: 330  KSKTVRDISFFKMDQRRRLLWTYEEEEMLKVGVEKFAAEANKNMPWRKILEMGEKVFHET 389

Query: 426  RTPADLKDKWKNIL 385
            RTPADLKDKW++++
Sbjct: 390  RTPADLKDKWRSMV 403


>dbj|BAJ53106.1| JHL20J20.13 [Jatropha curcas]
          Length = 531

 Score = 75.9 bits (185), Expect = 2e-11
 Identities = 36/70 (51%), Positives = 46/70 (65%)
 Frame = -1

Query: 585 KLPTPKMKRKRSTWXXXXXXXXXXXXXKYSKVVNKNIPWQKILEDGRAVFDETRTPADLK 406
           K+P    KRKR  W             K+S  VNKN+PW+KILE GR VFD +R+P+DLK
Sbjct: 460 KMPFSHEKRKRLLWRPEEEEMLREGVQKFSSKVNKNLPWRKILEFGRHVFDASRSPSDLK 519

Query: 405 DKWKNILSKE 376
           DKW+N+L+KE
Sbjct: 520 DKWRNLLAKE 529


Top