BLASTX nr result
ID: Cephaelis21_contig00030586
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00030586 (780 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAS79604.1| putative pentatricopeptide repeat-containing prot... 287 2e-75 ref|XP_002283651.1| PREDICTED: pentatricopeptide repeat-containi... 272 5e-71 ref|XP_003603974.1| Pentatricopeptide repeat-containing protein ... 241 1e-61 ref|XP_003543566.1| PREDICTED: pentatricopeptide repeat-containi... 240 3e-61 ref|XP_003523513.1| PREDICTED: pentatricopeptide repeat-containi... 183 4e-44 >gb|AAS79604.1| putative pentatricopeptide repeat-containing protein [Ipomoea trifida] gi|118562903|dbj|BAF37793.1| hypothetical protein [Ipomoea trifida] Length = 575 Score = 287 bits (735), Expect = 2e-75 Identities = 146/251 (58%), Positives = 183/251 (72%), Gaps = 2/251 (0%) Frame = +1 Query: 34 MEITTMGQAMQLHAQALKSGGHNHDSGTEQMLQHRDSQQNLSKLFTFSALSPSGDLSYAR 213 MEIT+M QAMQLHA+ LKSG ++ + G Q+ KLFTFSALSPSGDL+YAR Sbjct: 1 MEITSMTQAMQLHARILKSGAYDSNHG-----------QDFHKLFTFSALSPSGDLNYAR 49 Query: 214 LILNSLQTPNSYYYNTMIRAYSDLPHPIQAISLFMAMHDPQHPKIS--RPDKFTFPAVLK 387 IL +L TPNS+YYNTMIRAYSD P +A +LF+ M +P ++ RPD FT+P VLK Sbjct: 50 HILRTLHTPNSFYYNTMIRAYSDSTDPTRAFTLFLYMQNPDDASVAVPRPDHFTYPFVLK 109 Query: 388 ACSKLRQTQLGKQLHGLACKFNFGSDRYISNALIHMYSAGGVPSSALKVFEEMLERDVVS 567 ACSK + GKQ+HGL K GSDRYI+NALIH+YS G P+ A KVF++M +RDVVS Sbjct: 110 ACSKSGHARFGKQIHGLVFKSGVGSDRYINNALIHLYSVSGEPNLAYKVFDKMPDRDVVS 169 Query: 568 WTSMIDGLVDNNKPIKAIALFERMVENDIEFNEATIISVLRACADAGALDIGRKVHGLIE 747 WTS+IDG VDN++PI+AI LF M+EN IE NE T+ SVLRACAD GAL+ G ++H ++ Sbjct: 170 WTSIIDGFVDNDRPIEAIRLFTHMIENGIEPNEVTVASVLRACADTGALNTGERIHSFVK 229 Query: 748 EKKFNLNSKVS 780 EK F+ N+ VS Sbjct: 230 EKNFSSNANVS 240 Score = 101 bits (252), Expect = 2e-19 Identities = 60/180 (33%), Positives = 99/180 (55%) Frame = +1 Query: 184 SPSGDLSYARLILNSLQTPNSYYYNTMIRAYSDLPHPIQAISLFMAMHDPQHPKISRPDK 363 S SG+ + A + + + + + ++I + D PI+AI LF M + P++ Sbjct: 147 SVSGEPNLAYKVFDKMPDRDVVSWTSIIDGFVDNDRPIEAIRLFTHMIENG----IEPNE 202 Query: 364 FTFPAVLKACSKLRQTQLGKQLHGLACKFNFGSDRYISNALIHMYSAGGVPSSALKVFEE 543 T +VL+AC+ G+++H + NF S+ +S ALI MY+ G AL+VF+E Sbjct: 203 VTVASVLRACADTGALNTGERIHSFVKEKNFSSNANVSTALIDMYAKCGCIDGALEVFDE 262 Query: 544 MLERDVVSWTSMIDGLVDNNKPIKAIALFERMVENDIEFNEATIISVLRACADAGALDIG 723 LE+DV WT++I GL + +KAI FE M ++D++ +E I +VL A +AG + G Sbjct: 263 TLEKDVYVWTAIIAGLASHGLCMKAIEFFENMKKSDVKMDERAITAVLSAYRNAGLVSEG 322 >ref|XP_002283651.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065 [Vitis vinifera] gi|297744424|emb|CBI37686.3| unnamed protein product [Vitis vinifera] Length = 571 Score = 272 bits (696), Expect = 5e-71 Identities = 141/248 (56%), Positives = 182/248 (73%) Frame = +1 Query: 34 MEITTMGQAMQLHAQALKSGGHNHDSGTEQMLQHRDSQQNLSKLFTFSALSPSGDLSYAR 213 MEIT++ QAMQLHAQ LKS + +NL+ LFTF+ALSP+GDL+YA Sbjct: 1 MEITSLSQAMQLHAQILKSP------------DPKKQTRNLTPLFTFAALSPAGDLTYAH 48 Query: 214 LILNSLQTPNSYYYNTMIRAYSDLPHPIQAISLFMAMHDPQHPKISRPDKFTFPAVLKAC 393 LILNSL T NS+++NTMIRAYS P P QA+ LF++M P RPDKFT+P +LK+C Sbjct: 49 LILNSLSTQNSFFHNTMIRAYSQTPDPTQALHLFLSMLC--QPTSPRPDKFTYPFLLKSC 106 Query: 394 SKLRQTQLGKQLHGLACKFNFGSDRYISNALIHMYSAGGVPSSALKVFEEMLERDVVSWT 573 ++L+Q ++GKQLHGL K SDRY+SN LIHMYS+ G A KVF +M +RDVVSWT Sbjct: 107 ARLKQPRVGKQLHGLIYKSGLESDRYVSNGLIHMYSSCGKSGRAYKVFGKMRDRDVVSWT 166 Query: 574 SMIDGLVDNNKPIKAIALFERMVENDIEFNEATIISVLRACADAGALDIGRKVHGLIEEK 753 SMIDG VD+++ ++AI LFE MVE+ +E NEAT++SVLRACADAGA+ +GR+V G+IEE+ Sbjct: 167 SMIDGFVDDDRALEAIRLFEEMVEDGVEPNEATVVSVLRACADAGAVGMGRRVQGVIEER 226 Query: 754 KFNLNSKV 777 K L + V Sbjct: 227 KIGLEANV 234 Score = 83.2 bits (204), Expect = 6e-14 Identities = 51/175 (29%), Positives = 92/175 (52%) Frame = +1 Query: 184 SPSGDLSYARLILNSLQTPNSYYYNTMIRAYSDLPHPIQAISLFMAMHDPQHPKISRPDK 363 S G A + ++ + + +MI + D ++AI LF M + P++ Sbjct: 142 SSCGKSGRAYKVFGKMRDRDVVSWTSMIDGFVDDDRALEAIRLFEEMVEDG----VEPNE 197 Query: 364 FTFPAVLKACSKLRQTQLGKQLHGLACKFNFGSDRYISNALIHMYSAGGVPSSALKVFEE 543 T +VL+AC+ +G+++ G+ + G + + ALI MY+ G SA KVF+ Sbjct: 198 ATVVSVLRACADAGAVGMGRRVQGVIEERKIGLEANVRTALIDMYAKCGSIGSARKVFDG 257 Query: 544 MLERDVVSWTSMIDGLVDNNKPIKAIALFERMVENDIEFNEATIISVLRACADAG 708 ++ +DV +WT+MI GL ++ +A+ LF++M + +E T+ +VL AC +AG Sbjct: 258 IVNKDVFAWTAMISGLANHGLCEEAVTLFDQMESFGLRPDERTMTAVLSACRNAG 312 >ref|XP_003603974.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355493022|gb|AES74225.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 566 Score = 241 bits (615), Expect = 1e-61 Identities = 129/246 (52%), Positives = 167/246 (67%), Gaps = 3/246 (1%) Frame = +1 Query: 49 MGQAMQLHAQALKSGGHNHDSGTEQMLQHRDSQQNLSKLFTFSALSPSGDLSYARLILNS 228 M QA+QLHAQ +KS +Q+N SKLFTF+A SPSGDL+YARL+LN+ Sbjct: 1 MSQALQLHAQFIKS----------------QNQRNFSKLFTFAAQSPSGDLNYARLLLNT 44 Query: 229 LQTPNSYYYNTMIRAYSDLPHP---IQAISLFMAMHDPQHPKISRPDKFTFPAVLKACSK 399 + NSYYYNT+IRAYS +P QA+SLF+ M P H + +PD FT+ LK+C + Sbjct: 45 NPSLNSYYYNTIIRAYSHTSNPTHHFQALSLFIFMLQP-HTNVPKPDTFTYSFALKSCGR 103 Query: 400 LRQTQLGKQLHGLACKFNFGSDRYISNALIHMYSAGGVPSSALKVFEEMLERDVVSWTSM 579 L+ TQ KQLHG K FG D YI NALIHMYS G A +VF+ M RDVVSWTSM Sbjct: 104 LKLTQQAKQLHGFINKMGFGFDLYIQNALIHMYSEIGELVIARQVFDRMSHRDVVSWTSM 163 Query: 580 IDGLVDNNKPIKAIALFERMVENDIEFNEATIISVLRACADAGALDIGRKVHGLIEEKKF 759 I G V+++ ++AI LF+RM+E ++ NEAT+ISVLR CAD+GAL +GRKVHG+++EK Sbjct: 164 IAGFVNHHLTVEAIQLFQRMLEVGVDVNEATVISVLRGCADSGALSVGRKVHGIVKEKGI 223 Query: 760 NLNSKV 777 + + V Sbjct: 224 DFKANV 229 Score = 81.3 bits (199), Expect = 2e-13 Identities = 67/232 (28%), Positives = 112/232 (48%) Frame = +1 Query: 28 GKMEITTMGQAMQLHAQALKSGGHNHDSGTEQMLQHRDSQQNLSKLFTFSALSPSGDLSY 207 G++++T QA QLH K G D + L H S+ G+L Sbjct: 102 GRLKLTQ--QAKQLHGFINKMG-FGFDLYIQNALIHMYSE--------------IGELVI 144 Query: 208 ARLILNSLQTPNSYYYNTMIRAYSDLPHPIQAISLFMAMHDPQHPKISRPDKFTFPAVLK 387 AR + + + + + +MI + + ++AI LF M + ++ T +VL+ Sbjct: 145 ARQVFDRMSHRDVVSWTSMIAGFVNHHLTVEAIQLFQRMLEVGVDV----NEATVISVLR 200 Query: 388 ACSKLRQTQLGKQLHGLACKFNFGSDRYISNALIHMYSAGGVPSSALKVFEEMLERDVVS 567 C+ +G+++HG+ + + ALIHMYS G SA +VF+++L+RDV Sbjct: 201 GCADSGALSVGRKVHGIVKEKGIDFKANVCTALIHMYSKCGCLESAREVFDDVLDRDVFV 260 Query: 568 WTSMIDGLVDNNKPIKAIALFERMVENDIEFNEATIISVLRACADAGALDIG 723 WT+MI GL + +AI LF M +++ +E TI+ VL A +AG + G Sbjct: 261 WTAMIYGLACHGMCKEAIELFLEMETCNVKPDERTIMVVLSAYRNAGLVREG 312 >ref|XP_003543566.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Glycine max] Length = 572 Score = 240 bits (612), Expect = 3e-61 Identities = 127/250 (50%), Positives = 169/250 (67%), Gaps = 3/250 (1%) Frame = +1 Query: 34 MEITTMGQAMQLHAQALKSGGHNHDSGTEQMLQHRDSQQNLSKLFTFSALSPSGDLSYAR 213 ME+ +M +A+Q+H Q +K G + H+D+ + LSK+FTF+ALSP GDL+YAR Sbjct: 1 MEVRSMWEALQVHGQVVKLG-----------MGHKDASRKLSKVFTFAALSPFGDLNYAR 49 Query: 214 LILNSLQTPNSYYYNTMIRAYSDLP---HPIQAISLFMAMHDPQHPKISRPDKFTFPAVL 384 L+L++ T NSYYYNT++RA+S P P A+SLF++M P PD FTFP +L Sbjct: 50 LLLSTNPTLNSYYYNTLLRAFSQTPLPTPPFHALSLFLSMPSP-------PDNFTFPFLL 102 Query: 385 KACSKLRQTQLGKQLHGLACKFNFGSDRYISNALIHMYSAGGVPSSALKVFEEMLERDVV 564 K CS+ + LGKQLH L K F D YI N L+HMYS G A +F+ M RDVV Sbjct: 103 KCCSRSKLPPLGKQLHALLTKLGFAPDLYIQNVLLHMYSEFGDLLLARSLFDRMPHRDVV 162 Query: 565 SWTSMIDGLVDNNKPIKAIALFERMVENDIEFNEATIISVLRACADAGALDIGRKVHGLI 744 SWTSMI GLV+++ P++AI LFERM++ +E NEAT+ISVLRACAD+GAL +GRKVH + Sbjct: 163 SWTSMIGGLVNHDLPVEAINLFERMLQCGVEVNEATVISVLRACADSGALSMGRKVHANL 222 Query: 745 EEKKFNLNSK 774 EE ++SK Sbjct: 223 EEWGIEIHSK 232 Score = 83.2 bits (204), Expect = 6e-14 Identities = 56/182 (30%), Positives = 96/182 (52%), Gaps = 2/182 (1%) Frame = +1 Query: 184 SPSGDLSYARLILNSLQTPNSYYYNTMIRAYSDLPHPIQAISLFMAMHDPQHPKISRPDK 363 S GDL AR + + + + + +MI + P++AI+LF M ++ Sbjct: 141 SEFGDLLLARSLFDRMPHRDVVSWTSMIGGLVNHDLPVEAINLFERMLQCG----VEVNE 196 Query: 364 FTFPAVLKACSKLRQTQLGKQLHGLACKFNFG--SDRYISNALIHMYSAGGVPSSALKVF 537 T +VL+AC+ +G+++H ++ S +S AL+ MY+ GG +SA KVF Sbjct: 197 ATVISVLRACADSGALSMGRKVHANLEEWGIEIHSKSNVSTALVDMYAKGGCIASARKVF 256 Query: 538 EEMLERDVVSWTSMIDGLVDNNKPIKAIALFERMVENDIEFNEATIISVLRACADAGALD 717 ++++ RDV WT+MI GL + AI +F M + ++ +E T+ +VL AC +AG + Sbjct: 257 DDVVHRDVFVWTAMISGLASHGLCKDAIDMFVDMESSGVKPDERTVTAVLTACRNAGLIR 316 Query: 718 IG 723 G Sbjct: 317 EG 318 >ref|XP_003523513.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Glycine max] Length = 542 Score = 183 bits (464), Expect = 4e-44 Identities = 109/247 (44%), Positives = 144/247 (58%) Frame = +1 Query: 34 MEITTMGQAMQLHAQALKSGGHNHDSGTEQMLQHRDSQQNLSKLFTFSALSPSGDLSYAR 213 ME+ +M +A+QLH Q +D+ +NLSK+F+F+ALSP GDL+YAR Sbjct: 1 MEVRSMWEALQLHGQ-------------------KDASRNLSKVFSFAALSPFGDLNYAR 41 Query: 214 LILNSLQTPNSYYYNTMIRAYSDLPHPIQAISLFMAMHDPQHPKISRPDKFTFPAVLKAC 393 L+L++ N S P P P P P FTFP +LK C Sbjct: 42 LLLST---------NPSTTTLSFAPSP-----------KPPTP----PYNFTFPFLLKCC 77 Query: 394 SKLRQTQLGKQLHGLACKFNFGSDRYISNALIHMYSAGGVPSSALKVFEEMLERDVVSWT 573 + + LGKQLH L K F D YI N L+HMYS G A +F+ M RDVVSWT Sbjct: 78 APSKLPPLGKQLHALLTKLGFAPDLYIQNVLVHMYSEFGDLVLARSLFDRMPHRDVVSWT 137 Query: 574 SMIDGLVDNNKPIKAIALFERMVENDIEFNEATIISVLRACADAGALDIGRKVHGLIEEK 753 SMI GLV+++ P++AI+LFERM++ +E NEAT+ISVLRA AD+GAL +GRKVH +EE Sbjct: 138 SMISGLVNHDLPVEAISLFERMLQCGVEVNEATVISVLRARADSGALSMGRKVHANLEEW 197 Query: 754 KFNLNSK 774 ++SK Sbjct: 198 GIEIHSK 204