BLASTX nr result
ID: Cephaelis21_contig00045784
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00045784 (669 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containi... 336 3e-90 ref|XP_002299667.1| predicted protein [Populus trichocarpa] gi|2... 325 6e-87 ref|XP_002529286.1| pentatricopeptide repeat-containing protein,... 324 8e-87 ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containi... 315 4e-84 ref|XP_003531588.1| PREDICTED: pentatricopeptide repeat-containi... 314 1e-83 >ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic [Vitis vinifera] gi|298204537|emb|CBI23812.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 336 bits (861), Expect = 3e-90 Identities = 166/222 (74%), Positives = 193/222 (86%) Frame = +2 Query: 2 RQGLKPDIVTYSTLLTGCAKVKGGYSRALELVQELKFNELRMDSVIYGTLLSVCASNDQY 181 + GL+PD VTYSTLL GC KVK GYS+ALELVQE++ + L MDSVIYGTLL+VCASN++ Sbjct: 198 QDGLRPDAVTYSTLLAGCMKVKHGYSKALELVQEMERSRLPMDSVIYGTLLAVCASNNRC 257 Query: 182 KEAEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMKSAGLVLNKVILTTL 361 KEAE YF +MK EGH PNVFHYSSLLNAYS DG+Y+KAD L+ +MKSAGLV NKVILTTL Sbjct: 258 KEAENYFNQMKDEGHLPNVFHYSSLLNAYSADGDYKKADMLVQDMKSAGLVPNKVILTTL 317 Query: 362 LKVYVRGSLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNV 541 LKVYVRG LFEKSRELL ELE LGYAEDEMPYC+LMDGL K+ +I EAKS+F++MK K V Sbjct: 318 LKVYVRGGLFEKSRELLAELEDLGYAEDEMPYCLLMDGLAKSRRILEAKSIFEEMKKKQV 377 Query: 542 KTNGYCYSIMISAFCRVGLLQDAKQLASSFEAEHDKYDVVIL 667 K++GYCYSIMISAFCR GLL++AKQLA FEA +DKYD+V+L Sbjct: 378 KSDGYCYSIMISAFCRSGLLKEAKQLARDFEATYDKYDLVML 419 Score = 78.2 bits (191), Expect = 1e-12 Identities = 46/186 (24%), Positives = 93/186 (50%), Gaps = 1/186 (0%) Frame = +2 Query: 80 RALELVQELKFNELRMDSVIYGTLLSVCASNDQYKEAEQYFEEMKSEGHSPNVFHYSSLL 259 +ALE+ ++ +R + + ++LS N +++ + + F +MK +G P+ YS+LL Sbjct: 153 KALEIYNSIQDESVRNNVSVCNSVLSCLIRNGKFENSLKLFHQMKQDGLRPDAVTYSTLL 212 Query: 260 -NAYSVDGNYEKADELIHEMKSAGLVLNKVILTTLLKVYVRGSLFEKSRELLNELEALGY 436 V Y KA EL+ EM+ + L ++ VI TLL V + +++ N+++ G+ Sbjct: 213 AGCMKVKHGYSKALELVQEMERSRLPMDSVIYGTLLAVCASNNRCKEAENYFNQMKDEGH 272 Query: 437 AEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNVKTNGYCYSIMISAFCRVGLLQDAKQ 616 + Y L++ G +A + DMK+ + N + ++ + R GL + +++ Sbjct: 273 LPNVFHYSSLLNAYSADGDYKKADMLVQDMKSAGLVPNKVILTTLLKVYVRGGLFEKSRE 332 Query: 617 LASSFE 634 L + E Sbjct: 333 LLAELE 338 >ref|XP_002299667.1| predicted protein [Populus trichocarpa] gi|222846925|gb|EEE84472.1| predicted protein [Populus trichocarpa] Length = 562 Score = 325 bits (832), Expect = 6e-87 Identities = 154/220 (70%), Positives = 191/220 (86%) Frame = +2 Query: 8 GLKPDIVTYSTLLTGCAKVKGGYSRALELVQELKFNELRMDSVIYGTLLSVCASNDQYKE 187 GL PD +TYSTLL GC K+K GYS+AL+LVQEL +N L+MDS++YGTLL+VCASN++ +E Sbjct: 103 GLTPDAITYSTLLAGCMKIKDGYSKALDLVQELNYNGLQMDSIMYGTLLAVCASNNRCEE 162 Query: 188 AEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMKSAGLVLNKVILTTLLK 367 A+ YF +MK EGHSPN+FHYSSLLNAYS DGNY+KA+EL+ +MKS+GLV NKVILTTLLK Sbjct: 163 AQSYFNQMKDEGHSPNIFHYSSLLNAYSSDGNYKKAEELVQDMKSSGLVPNKVILTTLLK 222 Query: 368 VYVRGSLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNVKT 547 VYVRG LFEKSR+LL EL+ LG+A++EMPYC+LMDGL K G + EA+SVF++MK K VK+ Sbjct: 223 VYVRGGLFEKSRDLLVELDTLGFAKNEMPYCLLMDGLAKNGLLDEARSVFNEMKEKRVKS 282 Query: 548 NGYCYSIMISAFCRVGLLQDAKQLASSFEAEHDKYDVVIL 667 GY YSIMIS+FCR GL ++AK+LA FEA++DKYDVVIL Sbjct: 283 GGYSYSIMISSFCRGGLFEEAKELAEEFEAKYDKYDVVIL 322 Score = 56.6 bits (135), Expect = 4e-06 Identities = 38/167 (22%), Positives = 80/167 (47%), Gaps = 1/167 (0%) Frame = +2 Query: 140 YGTLLSVCASNDQYKEAEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMK 319 Y + + ++ +A + + + E NVF +SLL + ++ + + H+MK Sbjct: 41 YSSYIKFMGTSLNPAKALEIYHSIPDESTKTNVFICNSLLRCLVRNTKFDSSMKFFHKMK 100 Query: 320 SAGLVLNKVILTTLLKVYVR-GSLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQI 496 + GL + + +TLL ++ + K+ +L+ EL G D + Y L+ + Sbjct: 101 NNGLTPDAITYSTLLAGCMKIKDGYSKALDLVQELNYNGLQMDSIMYGTLLAVCASNNRC 160 Query: 497 PEAKSVFDDMKNKNVKTNGYCYSIMISAFCRVGLLQDAKQLASSFEA 637 EA+S F+ MK++ N + YS +++A+ G + A++L ++ Sbjct: 161 EEAQSYFNQMKDEGHSPNIFHYSSLLNAYSSDGNYKKAEELVQDMKS 207 >ref|XP_002529286.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223531275|gb|EEF33118.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 672 Score = 324 bits (831), Expect = 8e-87 Identities = 156/222 (70%), Positives = 191/222 (86%) Frame = +2 Query: 2 RQGLKPDIVTYSTLLTGCAKVKGGYSRALELVQELKFNELRMDSVIYGTLLSVCASNDQY 181 + GL PD +TYSTLL+GC K K GYS+ L+ VQELK+N L+MD+VIYGT+L+VCAS+++ Sbjct: 209 QNGLTPDTITYSTLLSGCIKAKDGYSKTLDFVQELKYNGLQMDTVIYGTILAVCASHNRC 268 Query: 182 KEAEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMKSAGLVLNKVILTTL 361 +EAE YF +MK+EGH PNVFHYSSLLNAY+ GNY+KA+EL+ +MKS GLV NKVI TTL Sbjct: 269 EEAESYFSQMKNEGHLPNVFHYSSLLNAYASSGNYKKAEELVQDMKSLGLVPNKVIWTTL 328 Query: 362 LKVYVRGSLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNV 541 LKVYVRG LFEKS++LL ELE LGYAEDEMPYC+LMDGL KAG++ EA+S FD+MK KNV Sbjct: 329 LKVYVRGGLFEKSQQLLLELETLGYAEDEMPYCLLMDGLSKAGRVDEARSFFDEMKEKNV 388 Query: 542 KTNGYCYSIMISAFCRVGLLQDAKQLASSFEAEHDKYDVVIL 667 K++GY YSIMISA+CR LL++AKQLA FEA++DKYDVVIL Sbjct: 389 KSDGYAYSIMISAYCRGRLLEEAKQLAKEFEAKYDKYDVVIL 430 Score = 78.2 bits (191), Expect = 1e-12 Identities = 50/195 (25%), Positives = 93/195 (47%) Frame = +2 Query: 5 QGLKPDIVTYSTLLTGCAKVKGGYSRALELVQELKFNELRMDSVIYGTLLSVCASNDQYK 184 +G P++ YS+LL A G Y +A ELVQ++K L + VI+ TLL V ++ Sbjct: 281 EGHLPNVFHYSSLLNAYAS-SGNYKKAEELVQDMKSLGLVPNKVIWTTLLKVYVRGGLFE 339 Query: 185 EAEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMKSAGLVLNKVILTTLL 364 +++Q E+++ G++ + Y L++ S G ++A EMK + + + ++ Sbjct: 340 KSQQLLLELETLGYAEDEMPYCLLMDGLSKAGRVDEARSFFDEMKEKNVKSDGYAYSIMI 399 Query: 365 KVYVRGSLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNVK 544 Y RG L E++++L E EA D + ++ +AG + M + Sbjct: 400 SAYCRGRLLEEAKQLAKEFEAKYDKYDVVILNTMLCAYCRAGDMESVMQTMRKMDELAIS 459 Query: 545 TNGYCYSIMISAFCR 589 + + I+I FC+ Sbjct: 460 PSYCTFHILIKYFCK 474 Score = 58.9 bits (141), Expect = 8e-07 Identities = 48/190 (25%), Positives = 88/190 (46%), Gaps = 1/190 (0%) Frame = +2 Query: 8 GLKPDIVTYSTLLTGCAKVKGG-YSRALELVQELKFNELRMDSVIYGTLLSVCASNDQYK 184 GL P+ V ++TLL V+GG + ++ +L+ EL+ D + Y L+ + + Sbjct: 317 GLVPNKVIWTTLLK--VYVRGGLFEKSQQLLLELETLGYAEDEMPYCLLMDGLSKAGRVD 374 Query: 185 EAEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMKSAGLVLNKVILTTLL 364 EA +F+EMK + + + YS +++AY E+A +L E ++ + VIL T+L Sbjct: 375 EARSFFDEMKEKNVKSDGYAYSIMISAYCRGRLLEEAKQLAKEFEAKYDKYDVVILNTML 434 Query: 365 KVYVRGSLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNVK 544 Y R E + + +++ L + + IL+ K A +DM K + Sbjct: 435 CAYCRAGDMESVMQTMRKMDELAISPSYCTFHILIKYFCKQKLYLLAYQTMEDMHRKGHQ 494 Query: 545 TNGYCYSIMI 574 S++I Sbjct: 495 PEEELCSMLI 504 Score = 58.2 bits (139), Expect = 1e-06 Identities = 37/152 (24%), Positives = 75/152 (49%), Gaps = 1/152 (0%) Frame = +2 Query: 185 EAEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMKSAGLVLNKVILTTLL 364 +A + + + E NVF +S+L+ G ++ + +L H+MK GL + + +TLL Sbjct: 164 KALEIYNSIADESVKNNVFICNSVLSCLVRSGKFDISLKLFHKMKQNGLTPDTITYSTLL 223 Query: 365 KVYVRG-SLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNV 541 ++ + K+ + + EL+ G D + Y ++ + EA+S F MKN+ Sbjct: 224 SGCIKAKDGYSKTLDFVQELKYNGLQMDTVIYGTILAVCASHNRCEEAESYFSQMKNEGH 283 Query: 542 KTNGYCYSIMISAFCRVGLLQDAKQLASSFEA 637 N + YS +++A+ G + A++L ++ Sbjct: 284 LPNVFHYSSLLNAYASSGNYKKAEELVQDMKS 315 >ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like [Cucumis sativus] Length = 668 Score = 315 bits (808), Expect = 4e-84 Identities = 151/220 (68%), Positives = 192/220 (87%) Frame = +2 Query: 8 GLKPDIVTYSTLLTGCAKVKGGYSRALELVQELKFNELRMDSVIYGTLLSVCASNDQYKE 187 GL PD VTYST+LTGC +VK GY++A+EL++EL+ N L MD V YGTL+++CAS+++ ++ Sbjct: 202 GLCPDTVTYSTMLTGCIRVKHGYAKAMELLKELQDNGLCMDCVSYGTLIAICASHNRLED 261 Query: 188 AEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMKSAGLVLNKVILTTLLK 367 AE++F +M++EGHSPN+FHY SLLNAYS++G+Y+KADELI +MK GLV NKVILTTLLK Sbjct: 262 AERFFNQMRAEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLTGLVPNKVILTTLLK 321 Query: 368 VYVRGSLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNVKT 547 VYVRG LFEKSR+LL+ELE+LGY E+EMPYC+LMDGL KAG I EAK+VFD+MK KNVKT Sbjct: 322 VYVRGGLFEKSRKLLSELESLGYGENEMPYCLLMDGLAKAGSIREAKTVFDEMKAKNVKT 381 Query: 548 NGYCYSIMISAFCRVGLLQDAKQLASSFEAEHDKYDVVIL 667 +GY +SIMISAFCR GLL++AK LA FEA +D+YD+VIL Sbjct: 382 DGYAHSIMISAFCRGGLLEEAKLLAKDFEATYDRYDIVIL 421 Score = 73.9 bits (180), Expect = 3e-11 Identities = 44/187 (23%), Positives = 92/187 (49%), Gaps = 1/187 (0%) Frame = +2 Query: 80 RALELVQELKFNELRMDSVIYGTLLSVCASNDQYKEAEQYFEEMKSEGHSPNVFHYSSLL 259 +ALE+ ++ ++ I ++L+ N ++ + + F +MK++G P+ YS++L Sbjct: 155 KALEVYNNIEEVSIKNSIFICNSILNCLVRNGKFDTSVKLFHQMKNDGLCPDTVTYSTML 214 Query: 260 -NAYSVDGNYEKADELIHEMKSAGLVLNKVILTTLLKVYVRGSLFEKSRELLNELEALGY 436 V Y KA EL+ E++ GL ++ V TL+ + + E + N++ A G+ Sbjct: 215 TGCIRVKHGYAKAMELLKELQDNGLCMDCVSYGTLIAICASHNRLEDAERFFNQMRAEGH 274 Query: 437 AEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNVKTNGYCYSIMISAFCRVGLLQDAKQ 616 + + Y L++ G +A + +DMK + N + ++ + R GL + +++ Sbjct: 275 SPNMFHYGSLLNAYSINGDYKKADELIEDMKLTGLVPNKVILTTLLKVYVRGGLFEKSRK 334 Query: 617 LASSFEA 637 L S E+ Sbjct: 335 LLSELES 341 Score = 68.6 bits (166), Expect = 1e-09 Identities = 43/193 (22%), Positives = 92/193 (47%) Frame = +2 Query: 5 QGLKPDIVTYSTLLTGCAKVKGGYSRALELVQELKFNELRMDSVIYGTLLSVCASNDQYK 184 +G P++ Y +LL + + G Y +A EL++++K L + VI TLL V ++ Sbjct: 272 EGHSPNMFHYGSLLNAYS-INGDYKKADELIEDMKLTGLVPNKVILTTLLKVYVRGGLFE 330 Query: 185 EAEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMKSAGLVLNKVILTTLL 364 ++ + E++S G+ N Y L++ + G+ +A + EMK+ + + + ++ Sbjct: 331 KSRKLLSELESLGYGENEMPYCLLMDGLAKAGSIREAKTVFDEMKAKNVKTDGYAHSIMI 390 Query: 365 KVYVRGSLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNVK 544 + RG L E+++ L + EA D + ++ +AG++ + M + + Sbjct: 391 SAFCRGGLLEEAKLLAKDFEATYDRYDIVILNTMLCAYCRAGEMESVMQMLRKMDDLAIS 450 Query: 545 TNGYCYSIMISAF 583 + + I+I F Sbjct: 451 PDYNTFHILIKYF 463 >ref|XP_003531588.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like [Glycine max] Length = 630 Score = 314 bits (804), Expect = 1e-83 Identities = 150/220 (68%), Positives = 186/220 (84%) Frame = +2 Query: 8 GLKPDIVTYSTLLTGCAKVKGGYSRALELVQELKFNELRMDSVIYGTLLSVCASNDQYKE 187 GL PD+VTY+TLL GC K++ GY++ALEL+QEL+ N+L+MD VIYGT+++VCASN +++E Sbjct: 170 GLLPDLVTYTTLLAGCIKIENGYAKALELIQELQHNKLQMDGVIYGTIMAVCASNTKWEE 229 Query: 188 AEQYFEEMKSEGHSPNVFHYSSLLNAYSVDGNYEKADELIHEMKSAGLVLNKVILTTLLK 367 AE YF +MK EGH+PNV+HYSSL+NAYS GNY+KAD LI +MKS GLV NKVILTTLLK Sbjct: 230 AEYYFNQMKDEGHTPNVYHYSSLINAYSACGNYKKADMLIQDMKSEGLVPNKVILTTLLK 289 Query: 368 VYVRGSLFEKSRELLNELEALGYAEDEMPYCILMDGLVKAGQIPEAKSVFDDMKNKNVKT 547 VYV+G LFEKSRELL EL++LGYAEDEMPYCI MDGL KAGQI EAK +FD+M +V++ Sbjct: 290 VYVKGGLFEKSRELLAELKSLGYAEDEMPYCIFMDGLAKAGQIHEAKLIFDEMMKNHVRS 349 Query: 548 NGYCYSIMISAFCRVGLLQDAKQLASSFEAEHDKYDVVIL 667 +GY +SIMISAFCR L ++AKQLA FE +KYD+VIL Sbjct: 350 DGYAHSIMISAFCRAKLFREAKQLAKDFETTSNKYDLVIL 389