BLASTX nr result
ID: Cnidium21_contig00035941
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cnidium21_contig00035941 (1493 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi... 524 e-146 ref|XP_002521239.1| pentatricopeptide repeat-containing protein,... 475 e-131 dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] 462 e-127 ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar... 462 e-127 ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar... 462 e-127 >ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed protein product [Vitis vinifera] Length = 435 Score = 524 bits (1349), Expect = e-146 Identities = 269/449 (59%), Positives = 338/449 (75%) Frame = +2 Query: 23 LQLSVPHPWCSNKTHLQLGKSLRSLRLLPSCALSKQGQRFFTSLATSVTSDPLVTDRLIR 202 LQ+S P PW LL CALSKQGQ F +S+A DP ++RLI Sbjct: 5 LQVSRPQPWNHRSP------------LLIQCALSKQGQLFLSSVAR----DPSASNRLIC 48 Query: 203 KFIASSSKPVALNALSHLLTPNSSHPHLSSLAFPFYSLLTRAPWFSWNPKLVADVIAMLY 382 KFIASSSK +ALNALSHLL+P ++HP+LSSLA P YS ++ A WFSWNPKL+ADVIA+LY Sbjct: 49 KFIASSSKSIALNALSHLLSPTTTHPYLSSLALPLYSRISEASWFSWNPKLIADVIALLY 108 Query: 383 KQERFREADALISETVSKLDIRERELCTFYCCLIEFHSKHKSKQGVFDSYTSLKQILSCS 562 KQ + +EA+ L+SET+ KL RER+L +FYC LI+ HSKH S QGVFD + L +I+S S Sbjct: 109 KQGQLKEAETLVSETLIKLGSRERDLVSFYCNLIDSHSKHSSNQGVFDVISRLSRIVSES 168 Query: 563 SSVYLKRRAYESMICSLCVIDLPREAENMMEEMKVQGITSKPSLFELRSIVYAYGRVGLF 742 SSVY+K RAY+SMI SLC + LP EAEN++EEM+V+G+ KPS+FE RS+VY YGRVGL Sbjct: 169 SSVYVKERAYKSMISSLCAVGLPLEAENLIEEMRVKGL--KPSVFEFRSVVYGYGRVGLS 226 Query: 743 KDMKRLLFEMESEGIKMDTVCFNMVLSSLGAHAELLEMVSWLQRMKYLNVPLSIRTYNTV 922 +DM+R+L +M +EG ++DTV NMVLSS GA+ + EMVSWLQRMK ++P SIRTYN+V Sbjct: 227 EDMQRILLQMGNEGFELDTVVSNMVLSSYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSV 286 Query: 923 LNSCPSIMSRLQDPKSIPLSLNELQEGLCDNESMLVQELIVSSVLVEAMGWTSLELKLDL 1102 LNSCP IMS LQD K+ P +++EL E L +E++LV+ELI S VL E M W E KLDL Sbjct: 287 LNSCPMIMSILQDLKTFPPTIDELMETLKGDEALLVKELIGSMVLAELMEWDCSEGKLDL 346 Query: 1103 HGMHLSSAYLIILQWFDELRTRFQPVEADIPAEVRIICGSGKHSTVRGDSPVKHLVKVLM 1282 HGMHL SAYLI+LQW +ELR R E +P E+ ++CGSGKHS+VRG+SPVK +V+ +M Sbjct: 347 HGMHLGSAYLIMLQWREELRYRLNAAEYVMPVEITVVCGSGKHSSVRGESPVKRMVREMM 406 Query: 1283 VRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369 RT SP++IDRKNIGCFVAK KV K+WLC Sbjct: 407 TRTRSPMKIDRKNIGCFVAKAKVVKNWLC 435 >ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223539507|gb|EEF41095.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 460 Score = 475 bits (1222), Expect = e-131 Identities = 237/455 (52%), Positives = 321/455 (70%), Gaps = 7/455 (1%) Frame = +2 Query: 26 QLSVPHPWCSNKTHLQLGKSLRSLRLLPS------CALSKQGQRFFTSLATSVTS-DPLV 184 QL + PW H + ++LP ALSKQGQRF +SLA + T D + Sbjct: 8 QLRIKLPWSEQLRHRDHHRHRNQQQVLPMKWVFRCAALSKQGQRFLSSLAIATTKGDTVA 67 Query: 185 TDRLIRKFIASSSKPVALNALSHLLTPNSSHPHLSSLAFPFYSLLTRAPWFSWNPKLVAD 364 T+RLI+KF+A+S K +AL+ALSHLL P+SSH HLSSLAF Y + A WF WNPKLVAD Sbjct: 68 TNRLIKKFVAASPKSIALDALSHLLNPHSSHSHLSSLAFTLYLKIAEARWFQWNPKLVAD 127 Query: 365 VIAMLYKQERFREADALISETVSKLDIRERELCTFYCCLIEFHSKHKSKQGVFDSYTSLK 544 V+A L KQ R+ E+ L+S+++SKL ++ER+L FYC L+E SK S +G +S SL Sbjct: 128 VVAFLDKQGRYDESATLVSDSISKLQVKERDLARFYCNLVESQSKQNSIRGFDNSVASLM 187 Query: 545 QILSCSSSVYLKRRAYESMICSLCVIDLPREAENMMEEMKVQGITSKPSLFELRSIVYAY 724 Q++ S+SVY+KR+ Y+SM+ LC + PREAE ++EEM +G+ +PS+FE + +VYAY Sbjct: 188 QLVCNSNSVYVKRQGYKSMVNGLCEMGRPREAETLIEEMGKEGV--RPSMFEFKCVVYAY 245 Query: 725 GRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGAHAELLEMVSWLQRMKYLNVPLSI 904 G +G F++M + L +ME G ++DTVC NM+L+S GAH L EMV WLQ+MK L +P S+ Sbjct: 246 GSLGSFEEMNKCLHQMERAGFRVDTVCSNMILASYGAHNALPEMVLWLQKMKDLGIPFSL 305 Query: 905 RTYNTVLNSCPSIMSRLQDPKSIPLSLNELQEGLCDNESMLVQELIVSSVLVEAMGWTSL 1084 RT N+ LNSCP+IMS +Q+ P+S+++L + L ++E++LV+E++ SSVL EAM W Sbjct: 306 RTCNSALNSCPTIMSMMQNSNDFPISIHDLMKILSEDEALLVKEIVTSSVLDEAMKWDVA 365 Query: 1085 ELKLDLHGMHLSSAYLIILQWFDELRTRFQPVEADIPAEVRIICGSGKHSTVRGDSPVKH 1264 E KLDLHG HL SAYLIIL W +E+R RF+ V P E+ ++CGSG HS VRG+SPVK Sbjct: 366 EAKLDLHGTHLCSAYLIILLWIEEMRKRFKSVNYVNPTEITVVCGSGNHSIVRGESPVKC 425 Query: 1265 LVKVLMVRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369 +VK MVR SP+RIDR+NIGCF+AKGKV + WLC Sbjct: 426 MVKDFMVRARSPMRIDRRNIGCFIAKGKVVEEWLC 460 >dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] Length = 501 Score = 462 bits (1189), Expect = e-127 Identities = 234/418 (55%), Positives = 309/418 (73%), Gaps = 1/418 (0%) Frame = +2 Query: 119 LSKQGQRFFTSLAT-SVTSDPLVTDRLIRKFIASSSKPVALNALSHLLTPNSSHPHLSSL 295 L K G RF +SL++ ++ DP +R I+KF+A+S K VALN LSHLL+ +SHPHLS Sbjct: 85 LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 144 Query: 296 AFPFYSLLTRAPWFSWNPKLVADVIAMLYKQERFREADALISETVSKLDIRERELCTFYC 475 A YS +T A WF WNPKL+A++IA+L KQERF E++ L+S VS+L ER+ F C Sbjct: 145 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 204 Query: 476 CLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREAENMME 655 L+E +SK S QG ++ L++I+ SSSVY+K +AY+SM+ LC +D P +AE ++E Sbjct: 205 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 264 Query: 656 EMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGA 835 EM+++ I KP LFE +S++Y YGR+GLF DM R++ M +EG K+DTVC NMVLSS GA Sbjct: 265 EMRMEKI--KPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 322 Query: 836 HAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSRLQDPKSIPLSLNELQEGLCDN 1015 H L +M SWLQ++K NVP SIRTYN+VLNSCP+I+S L+D S P+SL+EL+ L ++ Sbjct: 323 HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 382 Query: 1016 ESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFQPVEADIP 1195 E++LV EL SSVL EA+ W ++E KLDLHGMHLSS+YLI+LQW DE R RF + IP Sbjct: 383 EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 442 Query: 1196 AEVRIICGSGKHSTVRGDSPVKHLVKVLMVRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369 AE+ ++ GSGKHS VRG+SPVK LVK +MVRT SP+RIDRKN+G F+AKGK K WLC Sbjct: 443 AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500 >ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown protein [Arabidopsis thaliana] gi|330251481|gb|AEC06575.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 462 bits (1189), Expect = e-127 Identities = 234/418 (55%), Positives = 309/418 (73%), Gaps = 1/418 (0%) Frame = +2 Query: 119 LSKQGQRFFTSLAT-SVTSDPLVTDRLIRKFIASSSKPVALNALSHLLTPNSSHPHLSSL 295 L K G RF +SL++ ++ DP +R I+KF+A+S K VALN LSHLL+ +SHPHLS Sbjct: 88 LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 147 Query: 296 AFPFYSLLTRAPWFSWNPKLVADVIAMLYKQERFREADALISETVSKLDIRERELCTFYC 475 A YS +T A WF WNPKL+A++IA+L KQERF E++ L+S VS+L ER+ F C Sbjct: 148 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 207 Query: 476 CLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREAENMME 655 L+E +SK S QG ++ L++I+ SSSVY+K +AY+SM+ LC +D P +AE ++E Sbjct: 208 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 267 Query: 656 EMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGA 835 EM+++ I KP LFE +S++Y YGR+GLF DM R++ M +EG K+DTVC NMVLSS GA Sbjct: 268 EMRMEKI--KPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 325 Query: 836 HAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSRLQDPKSIPLSLNELQEGLCDN 1015 H L +M SWLQ++K NVP SIRTYN+VLNSCP+I+S L+D S P+SL+EL+ L ++ Sbjct: 326 HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 385 Query: 1016 ESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFQPVEADIP 1195 E++LV EL SSVL EA+ W ++E KLDLHGMHLSS+YLI+LQW DE R RF + IP Sbjct: 386 EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 445 Query: 1196 AEVRIICGSGKHSTVRGDSPVKHLVKVLMVRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369 AE+ ++ GSGKHS VRG+SPVK LVK +MVRT SP+RIDRKN+G F+AKGK K WLC Sbjct: 446 AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503 >ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 505 Score = 462 bits (1189), Expect = e-127 Identities = 234/418 (55%), Positives = 309/418 (73%), Gaps = 1/418 (0%) Frame = +2 Query: 119 LSKQGQRFFTSLAT-SVTSDPLVTDRLIRKFIASSSKPVALNALSHLLTPNSSHPHLSSL 295 L K G RF +SL++ ++ DP +R I+KF+A+S K VALN LSHLL+ +SHPHLS Sbjct: 89 LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148 Query: 296 AFPFYSLLTRAPWFSWNPKLVADVIAMLYKQERFREADALISETVSKLDIRERELCTFYC 475 A YS +T A WF WNPKL+A++IA+L KQERF E++ L+S VS+L ER+ F C Sbjct: 149 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208 Query: 476 CLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREAENMME 655 L+E +SK S QG ++ L++I+ SSSVY+K +AY+SM+ LC +D P +AE ++E Sbjct: 209 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 268 Query: 656 EMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGA 835 EM+++ I KP LFE +S++Y YGR+GLF DM R++ M +EG K+DTVC NMVLSS GA Sbjct: 269 EMRMEKI--KPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 326 Query: 836 HAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSRLQDPKSIPLSLNELQEGLCDN 1015 H L +M SWLQ++K NVP SIRTYN+VLNSCP+I+S L+D S P+SL+EL+ L ++ Sbjct: 327 HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 386 Query: 1016 ESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFQPVEADIP 1195 E++LV EL SSVL EA+ W ++E KLDLHGMHLSS+YLI+LQW DE R RF + IP Sbjct: 387 EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 446 Query: 1196 AEVRIICGSGKHSTVRGDSPVKHLVKVLMVRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369 AE+ ++ GSGKHS VRG+SPVK LVK +MVRT SP+RIDRKN+G F+AKGK K WLC Sbjct: 447 AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504