BLASTX nr result
ID: Angelica22_contig00006320
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00006320 (2937 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281077.1| PREDICTED: cytochrome c biogenesis protein C... 753 0.0 ref|XP_002519098.1| conserved hypothetical protein [Ricinus comm... 743 0.0 ref|XP_003537900.1| PREDICTED: cytochrome c biogenesis protein C... 743 0.0 ref|XP_004162745.1| PREDICTED: cytochrome c biogenesis protein C... 712 0.0 ref|XP_002313066.1| predicted protein [Populus trichocarpa] gi|2... 707 0.0 >ref|XP_002281077.1| PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic [Vitis vinifera] gi|297741415|emb|CBI32546.3| unnamed protein product [Vitis vinifera] Length = 557 Score = 753 bits (1945), Expect = 0.0 Identities = 386/560 (68%), Positives = 454/560 (81%), Gaps = 6/560 (1%) Frame = -1 Query: 2904 METLALSLNTTHTSMYLSSRPETHLIKSPFFPYKTTKYGLCNTKNFNFFTLKVSCKLTKT 2725 M TL S + T ++L S K FFPY T C++ +F ++CKL KT Sbjct: 1 MYTLKPSFSKT---LFLKSPLLRSSFKPQFFPYTTQISSPCSSTPLSF---SITCKL-KT 53 Query: 2724 SKDSVKTKNLI--VGPADKAPLLSEKSSGNPESDSQTKPGKKKATGKM----LLKRYXXX 2563 S+D +K+L + ++ AP +SE +GN E+ Q KP K G L+KR+ Sbjct: 54 SEDGKSSKSLAKKIVLSEGAPAVSEDGAGNGEA--QPKPASKGGGGGGGFGGLVKRFPRK 111 Query: 2562 XXXXXXXXXLAIGEMFTIAGLMAIGTFIDQGEAPGYYFQKYPEENPVFGFLTWRWILPLG 2383 LAIGEMFT+A LMA+GT IDQGEAP YYFQK+PE+NPV GF TWRW+L LG Sbjct: 112 VLSRLSNLPLAIGEMFTVAALMALGTAIDQGEAPDYYFQKFPEDNPVLGFFTWRWVLTLG 171 Query: 2382 FDHMFTSPIFLGILVLLGASLMACTSTTQIPIVKVARRWSFVHSPDSIKKQDFSDTLPSA 2203 FDHMF+SPIFLG+L LL SLMACT TTQIP+VKVARRW+F+HS ++I+KQ+FS++LP A Sbjct: 172 FDHMFSSPIFLGMLALLATSLMACTYTTQIPLVKVARRWNFLHSAEAIRKQEFSESLPKA 231 Query: 2202 SVQDLGVILMGAGYEVFLKGPSLYAFKGLAGRFAPIGVHFALLLIMAGGTLSAAGSFRGS 2023 SV+DLGV+LMGAGYEVFLKGPSLYAFKGLAGRFAPIGVH A+LLIM GGTLSA GSFRGS Sbjct: 232 SVRDLGVVLMGAGYEVFLKGPSLYAFKGLAGRFAPIGVHLAMLLIMVGGTLSATGSFRGS 291 Query: 2022 VTVPQGLNFVAGDVLEPSGFLSTPSNAFSTEVHVNKFYMEYYDSGEVSQFYTDLSLYDLE 1843 VTVPQGLNFV GDVL PSGFLSTP+ AF TEVHVN+FYM+YYDSGEV QF+TDLSL+DL Sbjct: 292 VTVPQGLNFVMGDVLSPSGFLSTPTKAFDTEVHVNRFYMDYYDSGEVLQFHTDLSLFDLN 351 Query: 1842 GKEVLRKTIKVNDPLRYGGITIYQTDWSISALQVLKDDEGPFNLAMAPLQMNGGDKKLYG 1663 GKEV+RKTI VNDPLR+ GITIYQTDWS SALQ+ KDDEGPFNLAMAPL++N GDKKL+G Sbjct: 352 GKEVMRKTISVNDPLRFDGITIYQTDWSFSALQIRKDDEGPFNLAMAPLKLN-GDKKLFG 410 Query: 1662 TILPIGSVDSPNVKGISMLARDLQSVVLYDQEGKFAGIRRPSSKLPIEIDGSKIVVLDAI 1483 T LP+G DSPNVKGISMLARDLQS+VLYD+EGKFAG+RRP+S LPI+IDG++IV+ DAI Sbjct: 411 TFLPVGDSDSPNVKGISMLARDLQSIVLYDKEGKFAGVRRPNSNLPIDIDGTRIVIEDAI 470 Query: 1482 GSSGLNLKTDPGVPVVYAGFGALMLTTCISFLSHSQIWALQDGTSVVVGGKTNRAKGEFP 1303 GSSGL+LKTDPGVP+VYAGFGALMLTTCIS+LSH+QIWALQDGT+VV+GGKTNRAK EFP Sbjct: 471 GSSGLDLKTDPGVPIVYAGFGALMLTTCISYLSHTQIWALQDGTTVVIGGKTNRAKLEFP 530 Query: 1302 DAINRLLDQVPEIVASSQSK 1243 D +N+LLD+VPE+V SS SK Sbjct: 531 DEMNQLLDRVPELVESSLSK 550 >ref|XP_002519098.1| conserved hypothetical protein [Ricinus communis] gi|223541761|gb|EEF43309.1| conserved hypothetical protein [Ricinus communis] Length = 556 Score = 743 bits (1919), Expect = 0.0 Identities = 379/537 (70%), Positives = 433/537 (80%), Gaps = 7/537 (1%) Frame = -1 Query: 2841 ETHLIKSPFFPYKTTKYG-----LCNTKNFNFFTLKVSCKLTKTSKDSVKTKNLI--VGP 2683 +TH I F T K LCN + + L VSCKL + + K KN+ + Sbjct: 13 KTHFINFHPFINSTIKLNPQIHILCNRRALS---LSVSCKLKTSKEVENKDKNVSRKILL 69 Query: 2682 ADKAPLLSEKSSGNPESDSQTKPGKKKATGKMLLKRYXXXXXXXXXXXXLAIGEMFTIAG 2503 ++ AP +SE+ + K K KR LAIGEMF IAG Sbjct: 70 SNSAPPVSEEGGAGNNGEIPDKAAKGGGGPLRFFKRLPRKVLSVLSNLPLAIGEMFAIAG 129 Query: 2502 LMAIGTFIDQGEAPGYYFQKYPEENPVFGFLTWRWILPLGFDHMFTSPIFLGILVLLGAS 2323 LMA+GT IDQG+AP YFQ YPEENPV GF TWRWIL LGFDHMF+SP+FLG+L LLG S Sbjct: 130 LMALGTVIDQGQAPEIYFQNYPEENPVLGFFTWRWILTLGFDHMFSSPVFLGMLALLGLS 189 Query: 2322 LMACTSTTQIPIVKVARRWSFVHSPDSIKKQDFSDTLPSASVQDLGVILMGAGYEVFLKG 2143 LMACT TTQIP+VKVARRW+F+HS ++I+KQ+F+DTLP AS+QD+GVILMGAGYEVFLKG Sbjct: 190 LMACTYTTQIPLVKVARRWNFLHSAEAIRKQEFADTLPQASIQDVGVILMGAGYEVFLKG 249 Query: 2142 PSLYAFKGLAGRFAPIGVHFALLLIMAGGTLSAAGSFRGSVTVPQGLNFVAGDVLEPSGF 1963 PSLYAFKGLAGRFAPIGVH A+LLIMAG TL+A GSFRGSVTVPQGLNFV GDVL PSGF Sbjct: 250 PSLYAFKGLAGRFAPIGVHLAMLLIMAGATLTATGSFRGSVTVPQGLNFVVGDVLGPSGF 309 Query: 1962 LSTPSNAFSTEVHVNKFYMEYYDSGEVSQFYTDLSLYDLEGKEVLRKTIKVNDPLRYGGI 1783 LSTP+ AF+TEVHVNKFYM+YYDSGEVSQFY+DLSLYD++GKEVLRKTI VN+PLRYGG Sbjct: 310 LSTPTEAFNTEVHVNKFYMDYYDSGEVSQFYSDLSLYDIDGKEVLRKTISVNNPLRYGGF 369 Query: 1782 TIYQTDWSISALQVLKDDEGPFNLAMAPLQMNGGDKKLYGTILPIGSVDSPNVKGISMLA 1603 TIYQTDWS SALQ+ K+DEGPFNLAMAPL++N GDKKL+GT LP+G V+SPNVKGISMLA Sbjct: 370 TIYQTDWSFSALQIRKNDEGPFNLAMAPLKIN-GDKKLFGTFLPVGDVNSPNVKGISMLA 428 Query: 1602 RDLQSVVLYDQEGKFAGIRRPSSKLPIEIDGSKIVVLDAIGSSGLNLKTDPGVPVVYAGF 1423 RDLQS+VLYDQEGKF G+RRP+SKLPI+IDG++IV+ DAIGS+GL LKTDPGVPVVYAGF Sbjct: 429 RDLQSIVLYDQEGKFVGVRRPNSKLPIDIDGTRIVIEDAIGSTGLELKTDPGVPVVYAGF 488 Query: 1422 GALMLTTCISFLSHSQIWALQDGTSVVVGGKTNRAKGEFPDAINRLLDQVPEIVASS 1252 GALMLTTCIS+LSHSQIWALQDGTS+VVGGKTNRAKG F D +NRLLDQVPEIV SS Sbjct: 489 GALMLTTCISYLSHSQIWALQDGTSMVVGGKTNRAKGVFEDEVNRLLDQVPEIVESS 545 >ref|XP_003537900.1| PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic-like [Glycine max] Length = 563 Score = 743 bits (1917), Expect = 0.0 Identities = 375/557 (67%), Positives = 448/557 (80%), Gaps = 7/557 (1%) Frame = -1 Query: 2892 ALSLNTTHTSMYLSSRPETHLIK----SPFFPYKTTKYGLCNTKNFNFFTLKVSCKLTKT 2725 ALSL+ +TS P+TH +K PFF +K K + L +SCKL + Sbjct: 3 ALSLSAINTSN--PCLPQTHFLKVPIFHPFFRFKPFSNSSHGAKRAHTLPLTISCKLKNS 60 Query: 2724 SKDSVKTKNL---IVGPADKAPLLSEKSSGNPESDSQTKPGKKKATGKMLLKRYXXXXXX 2554 + K K++ IV P L+E N +S + KKK + Sbjct: 61 QEMKNKGKSVSQKIVLSEASPPPLTEDDKNNGDSKDVPESSKKKGGLSGVANMLRRKTLQ 120 Query: 2553 XXXXXXLAIGEMFTIAGLMAIGTFIDQGEAPGYYFQKYPEENPVFGFLTWRWILPLGFDH 2374 LAIGEMF +A LMA+GTFIDQGEAP +YFQKYPE++PVFGF TWRW+L LGFDH Sbjct: 121 ILSNLPLAIGEMFAVASLMALGTFIDQGEAPDFYFQKYPEDHPVFGFFTWRWVLTLGFDH 180 Query: 2373 MFTSPIFLGILVLLGASLMACTSTTQIPIVKVARRWSFVHSPDSIKKQDFSDTLPSASVQ 2194 MFTSPIFLG+L LLGASLMACT TTQ+P++KV+RRWSF+HS ++I+KQ+FS++LP AS+Q Sbjct: 181 MFTSPIFLGVLALLGASLMACTYTTQLPLIKVSRRWSFLHSAEAIRKQEFSESLPRASIQ 240 Query: 2193 DLGVILMGAGYEVFLKGPSLYAFKGLAGRFAPIGVHFALLLIMAGGTLSAAGSFRGSVTV 2014 D+G ILMGAGYEVFLKGPSLYAF+GLAGR AP+GVH ALLLIMAGGTLSA GSFRGSVTV Sbjct: 241 DVGTILMGAGYEVFLKGPSLYAFQGLAGRLAPVGVHIALLLIMAGGTLSATGSFRGSVTV 300 Query: 2013 PQGLNFVAGDVLEPSGFLSTPSNAFSTEVHVNKFYMEYYDSGEVSQFYTDLSLYDLEGKE 1834 PQGLNFV GDVL P GFLS+P++AF+TE+HVNKF M+YY+SGEVSQF+TDLSL +++GKE Sbjct: 301 PQGLNFVVGDVLAPFGFLSSPTDAFNTEIHVNKFSMDYYESGEVSQFHTDLSLRNMDGKE 360 Query: 1833 VLRKTIKVNDPLRYGGITIYQTDWSISALQVLKDDEGPFNLAMAPLQMNGGDKKLYGTIL 1654 V+RKTI VNDPLRYGGITIYQTDWSISALQ+LKD+EGP+NLAMAPLQ+N GDKKL+GT L Sbjct: 361 VMRKTISVNDPLRYGGITIYQTDWSISALQILKDNEGPYNLAMAPLQIN-GDKKLFGTFL 419 Query: 1653 PIGSVDSPNVKGISMLARDLQSVVLYDQEGKFAGIRRPSSKLPIEIDGSKIVVLDAIGSS 1474 P+G ++SP+VKGISMLARDLQS+VLYD+EGKFAG+RRP+SKLPI IDGS+IV++DAIGSS Sbjct: 420 PVGDINSPDVKGISMLARDLQSIVLYDKEGKFAGVRRPNSKLPINIDGSEIVIVDAIGSS 479 Query: 1473 GLNLKTDPGVPVVYAGFGALMLTTCISFLSHSQIWALQDGTSVVVGGKTNRAKGEFPDAI 1294 GL LKTDPGVPVVYAGFGALM+TTCIS+LSHSQIWALQDGT+V VGGKTNRAK EFP+ + Sbjct: 480 GLELKTDPGVPVVYAGFGALMITTCISYLSHSQIWALQDGTTVFVGGKTNRAKMEFPEEM 539 Query: 1293 NRLLDQVPEIVASSQSK 1243 +RLLD+VPEIV S+ K Sbjct: 540 SRLLDKVPEIVESNLPK 556 >ref|XP_004162745.1| PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic-like [Cucumis sativus] Length = 560 Score = 712 bits (1838), Expect = 0.0 Identities = 364/559 (65%), Positives = 438/559 (78%), Gaps = 3/559 (0%) Frame = -1 Query: 2910 SSMETLALS-LNTTHTSMYLSSRPETHLIKSPFFPYKTTKYGLCNTKNFNFFTLKVSCKL 2734 S+ME L L+ LN + + +L H + S +T + + V+CK Sbjct: 9 STMERLILNNLNPSLSKPFLLKTSFLHSVFSHQITLRTAQIR----------SFSVTCK- 57 Query: 2733 TKTSKDSV--KTKNLIVGPADKAPLLSEKSSGNPESDSQTKPGKKKATGKMLLKRYXXXX 2560 K S+D N IV PL E+S + ++++ KPG + K L+KR Sbjct: 58 NKASQDKKLKNASNKIVLSEAAPPLAEEESDKSGNAEAEVKPGNGSRSMK-LVKRLPKRI 116 Query: 2559 XXXXXXXXLAIGEMFTIAGLMAIGTFIDQGEAPGYYFQKYPEENPVFGFLTWRWILPLGF 2380 LAIGEMFTIA LMA+GT IDQGEAP +YFQKYPE+NP++GF WRWIL LGF Sbjct: 117 LGALSNLPLAIGEMFTIAALMALGTVIDQGEAPDFYFQKYPEDNPLWGFFNWRWILTLGF 176 Query: 2379 DHMFTSPIFLGILVLLGASLMACTSTTQIPIVKVARRWSFVHSPDSIKKQDFSDTLPSAS 2200 DHM++S IFLG+L LLG SLMACT TTQIP+VKVARRW+F+ S ++I+K + SD LP AS Sbjct: 177 DHMYSSTIFLGMLALLGISLMACTYTTQIPLVKVARRWNFLQSGETIRKLECSDILPRAS 236 Query: 2199 VQDLGVILMGAGYEVFLKGPSLYAFKGLAGRFAPIGVHFALLLIMAGGTLSAAGSFRGSV 2020 VQDLGV+LMGAGYEVF+KGP+LYAFKGLAGRFAPIGVH A+LLIMAG TLSA GSFRGSV Sbjct: 237 VQDLGVVLMGAGYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMAGATLSATGSFRGSV 296 Query: 2019 TVPQGLNFVAGDVLEPSGFLSTPSNAFSTEVHVNKFYMEYYDSGEVSQFYTDLSLYDLEG 1840 TVPQGLNFV GDVL PSGFL+ P+ AF+TEVHVNKFYM YYDSGE+ QFY+DLSL+DL G Sbjct: 297 TVPQGLNFVVGDVLNPSGFLAKPTEAFNTEVHVNKFYMNYYDSGEIKQFYSDLSLFDLNG 356 Query: 1839 KEVLRKTIKVNDPLRYGGITIYQTDWSISALQVLKDDEGPFNLAMAPLQMNGGDKKLYGT 1660 KEV+RKTI VN+PLRYGG TIYQTDW SALQ+LK+DEGPFNLA+APL++N GDKKLYGT Sbjct: 357 KEVMRKTISVNNPLRYGGFTIYQTDWGFSALQILKNDEGPFNLAVAPLKIN-GDKKLYGT 415 Query: 1659 ILPIGSVDSPNVKGISMLARDLQSVVLYDQEGKFAGIRRPSSKLPIEIDGSKIVVLDAIG 1480 LP+G V+SP+VKGISMLARDLQS+VLYDQEGKF G+RRPSS+LPI+I+G KI ++DAIG Sbjct: 416 FLPVGDVNSPDVKGISMLARDLQSIVLYDQEGKFVGVRRPSSRLPIDINGIKIEIVDAIG 475 Query: 1479 SSGLNLKTDPGVPVVYAGFGALMLTTCISFLSHSQIWALQDGTSVVVGGKTNRAKGEFPD 1300 S+GL LKTDPGVP+VYAGFGALMLTTC+S+LSHSQ+WA+QDGT V+VGGKTNRAK EFP+ Sbjct: 476 STGLELKTDPGVPIVYAGFGALMLTTCVSYLSHSQVWAIQDGTVVIVGGKTNRAKVEFPE 535 Query: 1299 AINRLLDQVPEIVASSQSK 1243 ++RLLD+VPEI+ S +K Sbjct: 536 EMDRLLDKVPEIIEPSYNK 554 >ref|XP_002313066.1| predicted protein [Populus trichocarpa] gi|222849474|gb|EEE87021.1| predicted protein [Populus trichocarpa] Length = 432 Score = 707 bits (1824), Expect = 0.0 Identities = 341/426 (80%), Positives = 387/426 (90%) Frame = -1 Query: 2520 MFTIAGLMAIGTFIDQGEAPGYYFQKYPEENPVFGFLTWRWILPLGFDHMFTSPIFLGIL 2341 MF+IA LMA+GT IDQGEAP +YFQK+PEENP+ GF TW+W+L LGFDHM++SP+FLG+L Sbjct: 1 MFSIAVLMALGTLIDQGEAPEFYFQKFPEENPLLGFFTWKWVLTLGFDHMYSSPVFLGML 60 Query: 2340 VLLGASLMACTSTTQIPIVKVARRWSFVHSPDSIKKQDFSDTLPSASVQDLGVILMGAGY 2161 LLG SLMACT TTQIP+ KVARRW+++HS D+I+KQ+FSD LP ASVQDLGVILMG+GY Sbjct: 61 ALLGVSLMACTYTTQIPLAKVARRWNYLHSADAIRKQEFSDNLPRASVQDLGVILMGSGY 120 Query: 2160 EVFLKGPSLYAFKGLAGRFAPIGVHFALLLIMAGGTLSAAGSFRGSVTVPQGLNFVAGDV 1981 EVFLKGPSLYAFKGLAGRF+PIGVH A+LLIMAG TLSA GSFRGSVTVPQGLNFV GDV Sbjct: 121 EVFLKGPSLYAFKGLAGRFSPIGVHLAMLLIMAGATLSATGSFRGSVTVPQGLNFVVGDV 180 Query: 1980 LEPSGFLSTPSNAFSTEVHVNKFYMEYYDSGEVSQFYTDLSLYDLEGKEVLRKTIKVNDP 1801 L PSGFLSTP+ AF+TEVHVN+FYM+YYD G+V QF+TDLSL+DL GKEV+RKTI VNDP Sbjct: 181 LGPSGFLSTPTEAFNTEVHVNRFYMDYYDGGDVKQFHTDLSLFDLNGKEVMRKTISVNDP 240 Query: 1800 LRYGGITIYQTDWSISALQVLKDDEGPFNLAMAPLQMNGGDKKLYGTILPIGSVDSPNVK 1621 LRYGGIT+YQTDWSISALQV KDDEGPFNLAMAPL+++ GD KLYGT LP+G V+SPNVK Sbjct: 241 LRYGGITMYQTDWSISALQVRKDDEGPFNLAMAPLKIS-GDNKLYGTFLPVGDVNSPNVK 299 Query: 1620 GISMLARDLQSVVLYDQEGKFAGIRRPSSKLPIEIDGSKIVVLDAIGSSGLNLKTDPGVP 1441 GISMLARDLQS+VLYDQEGKF G+RRP+SKLPI+IDG KI++ DAIGSSGL LKTDPGVP Sbjct: 300 GISMLARDLQSIVLYDQEGKFVGVRRPNSKLPIDIDGMKIIIEDAIGSSGLELKTDPGVP 359 Query: 1440 VVYAGFGALMLTTCISFLSHSQIWALQDGTSVVVGGKTNRAKGEFPDAINRLLDQVPEIV 1261 VVYAGFGALMLTTC+S+LSHSQIWALQDGT+V+VGGKTNRAK EF IN LLD+VPEIV Sbjct: 360 VVYAGFGALMLTTCLSYLSHSQIWALQDGTAVIVGGKTNRAKAEFQYEINFLLDKVPEIV 419 Query: 1260 ASSQSK 1243 SS SK Sbjct: 420 ESSLSK 425