BLASTX nr result

ID: Angelica22_contig00001552 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00001552
         (2605 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268213.1| PREDICTED: protein CHLOROPLAST IMPORT APPARA...   447   e-123
emb|CAN72725.1| hypothetical protein VITISV_015092 [Vitis vinifera]   444   e-122
emb|CBI40095.3| unnamed protein product [Vitis vinifera]              413   e-112
ref|XP_002518229.1| CIL, putative [Ricinus communis] gi|22354257...   350   8e-94
ref|XP_004149511.1| PREDICTED: protein CHLOROPLAST IMPORT APPARA...   332   2e-88

>ref|XP_002268213.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Vitis
            vinifera]
          Length = 392

 Score =  447 bits (1150), Expect = e-123
 Identities = 243/411 (59%), Positives = 282/411 (68%), Gaps = 7/411 (1%)
 Frame = +2

Query: 971  MSSCISGGGRTYGFDLDMVKXXXXXXXXXXXXXXXXXXXXXXXX-PIAISHRKPRTPRKR 1147
            MSSC+SG GRTYGF+L++VK                         PIAIS RKPRTPRKR
Sbjct: 1    MSSCLSGAGRTYGFELEIVKSPSSTSPRTSHSSSPSSTISESSNSPIAISTRKPRTPRKR 60

Query: 1148 PNQTYNEAAVLLSTAYPKIFPTKHLTKPCKSSKLCNTFLDXXXXXXXXFRTIDNSGFLLH 1327
            PNQTYNEAA LLSTAYP IF TK+L  PCK +K  ++FL+        FR  D SGFLLH
Sbjct: 61   PNQTYNEAAALLSTAYPNIFSTKNLKNPCKFTKSHDSFLEDSSELLFPFRAFDASGFLLH 120

Query: 1328 QPILEKPIVAIEPKVVNSCERACQSPPGSDLYGNSAEFCDGQQEDFDAESILDEEMEEGI 1507
            QP+ EKP   + PKVVN CE+ CQS   S+  G S E CDG +EDFDAESILDEE+E GI
Sbjct: 121  QPVQEKPSFQMLPKVVNCCEKPCQSSVESEFPGKSPELCDGFEEDFDAESILDEEIEGGI 180

Query: 1508 DSIMGNLSVNNDNIVEEATNNTCI------GYPLGLGFSANFDFGFGMRRGVRALRNVDE 1669
            DSIMGNLSV+N+ + +EATN  C       G P+GLGF   F+FGFGMRRGVRALR+VDE
Sbjct: 181  DSIMGNLSVDNE-MSDEATNPVCFNSYYGNGIPMGLGFGGKFEFGFGMRRGVRALRHVDE 239

Query: 1670 SNWWSFPTVDFVDITPKFKKISPSATQKKKKRVEKIVELKSLDCPKDNKNSIVXXXXXXX 1849
             +WW FPTVD ++I+PKF K+S    +KKKK+VEK  EL+S + PK N            
Sbjct: 240  GDWWRFPTVDILEISPKFNKVS---AEKKKKKVEKAQELRSWESPKGNS----------- 285

Query: 1850 XXXXXXLPELSAGPRLKLNYGKVMEAWSDRGSPFPDEISGSESSGNDIYARLAQIDLFSE 2029
                  +P+ ++   LKLNY  V+ AWSDRGSPF  E   +E  GND  ARLAQIDLFSE
Sbjct: 286  ------IPKSNSSLLLKLNYDDVLSAWSDRGSPFSRE---TEFPGNDTAARLAQIDLFSE 336

Query: 2030 VGGVREASVQRYKEKRRNRLFSKKIRYQVRKLNADRRPRMKGRFVRRSNSD 2182
             GGVREASV RYKEKRR RLFSKKIRYQVRK+NADRRPRMKGRFVRR NS+
Sbjct: 337  CGGVREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPNSN 387


>emb|CAN72725.1| hypothetical protein VITISV_015092 [Vitis vinifera]
          Length = 392

 Score =  444 bits (1142), Expect = e-122
 Identities = 243/411 (59%), Positives = 282/411 (68%), Gaps = 7/411 (1%)
 Frame = +2

Query: 971  MSSCISGGGRTYGFDLDMVKXXXXXXXXXXXXXXXXXXXXXXXX-PIAISHRKPRTPRKR 1147
            MSSC+SG GRTYGF+L++VK                         PIAIS RK RTPRKR
Sbjct: 1    MSSCLSGAGRTYGFELEIVKXPSSTSPRTSHSSSPSSTISESSNSPIAISTRKXRTPRKR 60

Query: 1148 PNQTYNEAAVLLSTAYPKIFPTKHLTKPCKSSKLCNTFLDXXXXXXXXFRTIDNSGFLLH 1327
            PNQTYNEAA LLSTAYP IF TK+L  PCK +K  ++FL+        FR  D SGFLLH
Sbjct: 61   PNQTYNEAAALLSTAYPNIFSTKNLKNPCKFTKSHDSFLEDSSELLFPFRAFDASGFLLH 120

Query: 1328 QPILEKPIVAIEPKVVNSCERACQSPPGSDLYGNSAEFCDGQQEDFDAESILDEEMEEGI 1507
            QP+ EKP   + PKVVN CE+ CQS   S+  G S E CDG +EDFDAESILDEE+E GI
Sbjct: 121  QPVQEKPSFQMLPKVVNCCEKPCQSSVESEFPGKSPELCDGFEEDFDAESILDEEIEGGI 180

Query: 1508 DSIMGNLSVNNDNIVEEATNNTCI------GYPLGLGFSANFDFGFGMRRGVRALRNVDE 1669
            DSIMGNLSV+N+ + +EATN  C       G P+GLGF   F+FGFGMRRGVRALR+VDE
Sbjct: 181  DSIMGNLSVDNE-MSDEATNPVCFNSYYGNGIPMGLGFGGKFEFGFGMRRGVRALRHVDE 239

Query: 1670 SNWWSFPTVDFVDITPKFKKISPSATQKKKKRVEKIVELKSLDCPKDNKNSIVXXXXXXX 1849
             +WW FPTVD ++I+PKF K+S    +KKKK+VEK  EL+S + PK N            
Sbjct: 240  GDWWRFPTVDILEISPKFNKVS---AEKKKKKVEKAQELRSWESPKGNS----------- 285

Query: 1850 XXXXXXLPELSAGPRLKLNYGKVMEAWSDRGSPFPDEISGSESSGNDIYARLAQIDLFSE 2029
                  +P+ ++   LKLNY  V+ AWSDRGSPF  E   +E  GND  ARLAQIDLFSE
Sbjct: 286  ------IPKSNSSLLLKLNYDDVLSAWSDRGSPFSRE---TEFPGNDTAARLAQIDLFSE 336

Query: 2030 VGGVREASVQRYKEKRRNRLFSKKIRYQVRKLNADRRPRMKGRFVRRSNSD 2182
             GGVREASV RYKEKRR RLFSKKIRYQVRK+NADRRPRMKGRFVRR NS+
Sbjct: 337  CGGVREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPNSN 387


>emb|CBI40095.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score =  413 bits (1061), Expect = e-112
 Identities = 232/412 (56%), Positives = 268/412 (65%), Gaps = 7/412 (1%)
 Frame = +2

Query: 968  KMSSCISGGGRTYGFDLDMVKXXXXXXXXXXXXXXXXXXXXXXXX-PIAISHRKPRTPRK 1144
            KMSSC+SG GRTYGF+L++VK                         PIAIS RKPRTPRK
Sbjct: 25   KMSSCLSGAGRTYGFELEIVKSPSSTSPRTSHSSSPSSTISESSNSPIAISTRKPRTPRK 84

Query: 1145 RPNQTYNEAAVLLSTAYPKIFPTKHLTKPCKSSKLCNTFLDXXXXXXXXFRTIDNSGFLL 1324
            RPNQTYNEAA LLSTAYP IF TK+L  PCK +K  ++FL+        FR  D SGFLL
Sbjct: 85   RPNQTYNEAAALLSTAYPNIFSTKNLKNPCKFTKSHDSFLEDSSELLFPFRAFDASGFLL 144

Query: 1325 HQPILEKPIVAIEPKVVNSCERACQSPPGSDLYGNSAEFCDGQQEDFDAESILDEEMEEG 1504
            HQP+ EKP                           S E CDG +EDFDAESILDEE+E G
Sbjct: 145  HQPVQEKP-------------------------RKSPELCDGFEEDFDAESILDEEIEGG 179

Query: 1505 IDSIMGNLSVNNDNIVEEATNNTCI------GYPLGLGFSANFDFGFGMRRGVRALRNVD 1666
            IDSIMGNLSV+N+ + +EATN  C       G P+GLGF   F+FGFGMRRGVRALR+VD
Sbjct: 180  IDSIMGNLSVDNE-MSDEATNPVCFNSYYGNGIPMGLGFGGKFEFGFGMRRGVRALRHVD 238

Query: 1667 ESNWWSFPTVDFVDITPKFKKISPSATQKKKKRVEKIVELKSLDCPKDNKNSIVXXXXXX 1846
            E +WW FPTVD ++I+PKF K+S    +KKKK+VEK  EL+S + PK N           
Sbjct: 239  EGDWWRFPTVDILEISPKFNKVS---AEKKKKKVEKAQELRSWESPKGNS---------- 285

Query: 1847 XXXXXXXLPELSAGPRLKLNYGKVMEAWSDRGSPFPDEISGSESSGNDIYARLAQIDLFS 2026
                   +P+ ++   LKLNY  V+ AWSDRGSPF  E   +E  GND  ARLAQIDLFS
Sbjct: 286  -------IPKSNSSLLLKLNYDDVLSAWSDRGSPFSRE---TEFPGNDTAARLAQIDLFS 335

Query: 2027 EVGGVREASVQRYKEKRRNRLFSKKIRYQVRKLNADRRPRMKGRFVRRSNSD 2182
            E GGVREASV RYKEKRR RLFSKKIRYQVRK+NADRRPRMKGRFVRR NS+
Sbjct: 336  ECGGVREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPNSN 387


>ref|XP_002518229.1| CIL, putative [Ricinus communis] gi|223542576|gb|EEF44115.1| CIL,
            putative [Ricinus communis]
          Length = 397

 Score =  350 bits (899), Expect = 8e-94
 Identities = 216/418 (51%), Positives = 261/418 (62%), Gaps = 25/418 (5%)
 Frame = +2

Query: 974  SSCISGGGRTYGFDLDMVKXXXXXXXXXXXXXXXXXXXXXXXXPIAISHRKPRTPRKRPN 1153
            S C+SGGGR YGFDL++VK                        P+AIS RKPRT RKRPN
Sbjct: 3    SPCLSGGGRAYGFDLEIVKSPSTSTRTSHTSSPSSTLSESSNSPLAISTRKPRTHRKRPN 62

Query: 1154 QTYNEAAVLLSTAYPKIFPT---KHLTKPCKSSKLCNTFLDXXXXXXXXFRTID-NSGFL 1321
            Q YNEAA LLSTAYP IF T   +  TKP + + L +  L         FR  D +S FL
Sbjct: 63   QIYNEAAALLSTAYPNIFSTTNPRKFTKPHQDTLLLDESLSSSELLWP-FRVFDEDSSFL 121

Query: 1322 LHQPI-LEKPIVAIEPKV---VNSCER---ACQSPPGSDLYGNSAEFCDGQQEDFDAESI 1480
            LHQ +  EKP   IEPK+   +NSC++   +CQS    D  GNS E CDG +ED DAESI
Sbjct: 122  LHQTVESEKPSFLIEPKISNLMNSCDKYSFSCQS----DSQGNSMELCDGYEEDLDAESI 177

Query: 1481 LDEEMEEGIDSIMGNLSVNNDNIVEEATN--NTCIGYPLGLGFSANFDFGFGMRRGVRAL 1654
            LDEE+EEGIDSIMGNLSV+ D   + +    N+  G P+G  FS N     GMR+GVRAL
Sbjct: 178  LDEEIEEGIDSIMGNLSVSKDKTDDGSIKDVNSWYGNPMGFHFSGN-----GMRKGVRAL 232

Query: 1655 RNVDESNWWSFPTVDFVDITPKF--------KKISPSATQKK----KKRVEKIVELKSLD 1798
            R+  ESN W+FP VD + I+P+         KKI+     KK    +K+ +K+VELK+L+
Sbjct: 233  RHGHESNLWNFPIVDMLQISPRLSSNNSSSSKKITSDFKSKKCKSDEKKKKKVVELKNLE 292

Query: 1799 CPKDNKNSIVXXXXXXXXXXXXXLPELSAGPRLKLNYGKVMEAWSDRGSPFPDEISGSES 1978
              K   +                +P+ S+G  LKLNY  V+ AWSD+GSPF +EISGSE 
Sbjct: 293  MAKKESS----------------VPQSSSGLLLKLNYDGVLNAWSDKGSPFSEEISGSEG 336

Query: 1979 SGNDIYARLAQIDLFSEVGGVREASVQRYKEKRRNRLFSKKIRYQVRKLNADRRPRMK 2152
             GND+ ARLAQIDLFSE GGVREASV RYKEKRR RLFSKKIRYQVRK+NAD+RPRMK
Sbjct: 337  -GNDVSARLAQIDLFSENGGVREASVLRYKEKRRTRLFSKKIRYQVRKVNADQRPRMK 393


>ref|XP_004149511.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Cucumis
            sativus]
          Length = 430

 Score =  332 bits (852), Expect = 2e-88
 Identities = 211/436 (48%), Positives = 258/436 (59%), Gaps = 43/436 (9%)
 Frame = +2

Query: 974  SSCISGGGRTYGFDLDMVKXXXXXXXXXXXXXXXXXXXXXXXX---PIAISHRKPRTPRK 1144
            S CISGGGR Y FDL+++K                            +AIS RK RTPRK
Sbjct: 3    SPCISGGGRAYNFDLEILKSPSSSWTRTSQTSSPSSTLSESSNNTTQLAISTRKLRTPRK 62

Query: 1145 RPNQTYNEAAVLLSTAYPKIFPTKHLTKPCKSSKL---CNTFLDXXXXXXXXFRTIDNSG 1315
            RPNQTYNEA VLLSTAYP +F TKHLT P K +K     ++           FR ID+SG
Sbjct: 63   RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTKSHDDSSSLFCESAELLLPFRVIDSSG 122

Query: 1316 FLLHQPIL-EKPIVAIEPKVVNSCE-RACQSPPGSDLYGNSAEFCDGQQEDFDAESILDE 1489
            FLLHQP+L EKP   I  K+ N  E R C SP   D   NS E    + EDFDAESILDE
Sbjct: 123  FLLHQPLLEEKPNSQIHSKLTNLWENRPCSSPGEIDFQPNSMEI--EEIEDFDAESILDE 180

Query: 1490 EMEEGIDSIMGNLSVNNDNIVE-EATNNTCIG----------YPLGLGFSANFDFGFGMR 1636
            E+EEGIDSIMGNLSV  DN+ +  +T ++C+            P+GLGF+  F+ GFG R
Sbjct: 181  EIEEGIDSIMGNLSV--DNLEKGNSTQDSCVNANNHPRNWNWNPIGLGFNQKFESGFGFR 238

Query: 1637 RGVR--ALRNVDESNWWSFPTVDFVDITPKFKKISPS--------------ATQKKKKRV 1768
            +G+   A+R VD  NWW FPTVD ++I+PK     P+              +T+KKKK+V
Sbjct: 239  KGIERTAIRGVDNGNWWRFPTVDVIEISPKLNPKPPAPAPTPTPTPTPAAVSTKKKKKKV 298

Query: 1769 EK--IVELKSLDCPKDNKNSIVXXXXXXXXXXXXXLPELS-AGPRLKLNYGKVMEAWSDR 1939
            EK  ++E K    P   + S               +P+L   G  LKLNY  V +AWS R
Sbjct: 299  EKLTVIESKKAAIPLQKEKS------------EKPIPKLKPTGLLLKLNYEAVADAWSSR 346

Query: 1940 GS----PFPDEISGSESSGNDIYARLAQIDLFSEVGG-VREASVQRYKEKRRNRLFSKKI 2104
            GS    PF DEI  S+++G+D+ AR+A IDLF+E GG +REASV RYKEKRR RLFSKKI
Sbjct: 347  GSPFSDPFSDEIPSSDTAGSDVNARVANIDLFTEGGGLLREASVLRYKEKRRTRLFSKKI 406

Query: 2105 RYQVRKLNADRRPRMK 2152
            RYQVRK+NAD RPRMK
Sbjct: 407  RYQVRKVNADGRPRMK 422


Top