BLASTX nr result
ID: Angelica23_contig00004226
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00004226 (2197 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270788.1| PREDICTED: pentatricopeptide repeat-containi... 545 e-152 ref|XP_002524769.1| pentatricopeptide repeat-containing protein,... 532 e-148 ref|XP_004147968.1| PREDICTED: pentatricopeptide repeat-containi... 472 e-130 ref|XP_003520548.1| PREDICTED: pentatricopeptide repeat-containi... 452 e-124 ref|XP_003553441.1| PREDICTED: pentatricopeptide repeat-containi... 444 e-122 >ref|XP_002270788.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780 [Vitis vinifera] gi|296086664|emb|CBI32299.3| unnamed protein product [Vitis vinifera] Length = 494 Score = 545 bits (1404), Expect = e-152 Identities = 262/449 (58%), Positives = 344/449 (76%) Frame = +2 Query: 257 NIIGLFSDRLSHNEMQAKEDLIHMASVLRGELIRVDDCRQNLIFKVLDEKGSSWFKCYDD 436 ++IGLFSD+ S +++A+E+L S LR EL+ D +++ +VL+EKG S F+ Y + Sbjct: 42 SMIGLFSDKHSELDLRAREELRGKVSQLRDELVPSGD-DSDMVVRVLEEKGESLFRSYSN 100 Query: 437 GAAFVELLRQLEFLPRCALQVLNWRRKQSDNGFPMTAEEYAKSIRLAGRSKNIDLATELF 616 G+AFVELL+QL P ALQV NWRR Q+D PMT+EEYAK I +AGR+KN+DLA ELF Sbjct: 101 GSAFVELLKQLSSWPYLALQVFNWRRNQTDYSIPMTSEEYAKGISVAGRTKNVDLAVELF 160 Query: 617 TEAANRRIKTAATYNALLGAYMYNDCTDKCQLLYGDFKKDISCKPTVVTYNILISVFGRL 796 TEAAN++IKT +TYNAL+GAYM N +KCQ L+ D K++ SC PT+VTYNILISVFGRL Sbjct: 161 TEAANKQIKTTSTYNALMGAYMCNGHAEKCQALFRDLKREASCSPTIVTYNILISVFGRL 220 Query: 797 MLIKHMEAALQEMNNSNIKPNLNTYNMLLGGYVTAWMWDTMEKTYKVMEASGIRPDIHTH 976 ML+ HMEA +E+ + PN++TYN ++ GYVTAWMW+ ME T++ M+ I+PDI+TH Sbjct: 221 MLVDHMEATFREIKELELSPNISTYNNIIAGYVTAWMWNRMEDTFRTMKEDNIQPDINTH 280 Query: 977 LLLLRGYAHSGNLKKMEEIYKLVNYHVIEHEEFPLIRAMICAYCKSTSSERVQKVEELLR 1156 LL+LRGYAHSGNL+KMEE Y+L+ HV +E PLIRAMICAYCKS+ ++RV+K+ L++ Sbjct: 281 LLMLRGYAHSGNLQKMEETYELIKGHV-NDKEIPLIRAMICAYCKSSITDRVEKIGALMK 339 Query: 1157 LIPKHEYKSWLNVMLIKLYAQEDIMDVMEKYIDEAFAHNISVKTAGIMRCIITSYFRANA 1336 LIP++EY+ WLNVMLI++YAQED ++ ME I+EAF H SVKT +MR II +YFR NA Sbjct: 340 LIPENEYRPWLNVMLIRVYAQEDWVEEMENSINEAFEHKTSVKTMRVMRSIIATYFRCNA 399 Query: 1337 SDKLTNFVKRAEHAEWKICRSLYHCRMVMYSSENRLAEMEGVLSEMGNFNLQPTKQTFLI 1516 D+L NFVKRAE W ICRSLYHC+MVMY+S+ RL EME VL+EM + N TK+TF I Sbjct: 400 VDRLANFVKRAECGGWHICRSLYHCKMVMYASQKRLEEMESVLNEMESSNFCCTKKTFWI 459 Query: 1517 MYKAYLTWGQKRKLDQILGVMCTQGFGFP 1603 +YKAY W Q+ K++Q+ G+MC G+G P Sbjct: 460 LYKAYSMWDQRHKVEQVKGLMCKHGYGIP 488 >ref|XP_002524769.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223535953|gb|EEF37612.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 509 Score = 532 bits (1371), Expect = e-148 Identities = 280/519 (53%), Positives = 368/519 (70%) Frame = +2 Query: 56 MIRVWKLSESAKSELIITLRTFITKPKNIVPQTPYLITNNPITTSTHRFSLKDSFYSHPN 235 M RV K+S+ A +++L F P TPY++T T + + L + S P Sbjct: 1 MKRVSKISDLAVQAELLSLNKF---PSITQTLTPYILT----LTKSPIYKLARAHTSEP- 52 Query: 236 LNSNPFPNIIGLFSDRLSHNEMQAKEDLIHMASVLRGELIRVDDCRQNLIFKVLDEKGSS 415 + FPNII LFS R + +A EDL S LR EL++ + + F+VL+E+G S Sbjct: 53 --VSFFPNIISLFSRRFPVDN-KAIEDLSKTVSHLRDELVQHAE-DSDKFFRVLEEQGDS 108 Query: 416 WFKCYDDGAAFVELLRQLEFLPRCALQVLNWRRKQSDNGFPMTAEEYAKSIRLAGRSKNI 595 F+ D +A VELLRQL LP A++V NWRRKQ++ PMT EEYAK I +AGR+KN+ Sbjct: 109 LFRMRSDRSALVELLRQLVSLPHLAVEVFNWRRKQTEWSTPMTHEEYAKGITIAGRAKNV 168 Query: 596 DLATELFTEAANRRIKTAATYNALLGAYMYNDCTDKCQLLYGDFKKDISCKPTVVTYNIL 775 DLA E+F EA ++R K YNAL+GAYMYN DKCQ L+ DFKK+ + P+VVTYNIL Sbjct: 169 DLAIEIFAEACSKRRKKTCIYNALMGAYMYNGHYDKCQSLFLDFKKEANIGPSVVTYNIL 228 Query: 776 ISVFGRLMLIKHMEAALQEMNNSNIKPNLNTYNMLLGGYVTAWMWDTMEKTYKVMEASGI 955 ISVFGR ML+ HMEA +E+ N NI PN++TYN L+ GYVTAWMWD ME+ +++M+ I Sbjct: 229 ISVFGRSMLVDHMEATFRELMNLNISPNVSTYNNLIAGYVTAWMWDDMEQVFQLMKEGPI 288 Query: 956 RPDIHTHLLLLRGYAHSGNLKKMEEIYKLVNYHVIEHEEFPLIRAMICAYCKSTSSERVQ 1135 P + T+LL+LRGYAHSGN++KMEE+YKLV HV E PLIR MICAYCKS+ ++R++ Sbjct: 289 YPHLDTYLLMLRGYAHSGNIEKMEEMYKLVQDHV-NVNEVPLIRTMICAYCKSSITDRIK 347 Query: 1136 KVEELLRLIPKHEYKSWLNVMLIKLYAQEDIMDVMEKYIDEAFAHNISVKTAGIMRCIIT 1315 K+EELLRLIP+ EY+ WLNV+LIK+YAQ+++++ ME IDEAF H ++ T GIMR II Sbjct: 348 KIEELLRLIPEEEYRPWLNVLLIKVYAQQNLLEAMENKIDEAFKHETTITTVGIMRTIIA 407 Query: 1316 SYFRANASDKLTNFVKRAEHAEWKICRSLYHCRMVMYSSENRLAEMEGVLSEMGNFNLQP 1495 SYFR NA D+L +FVKRAE + W+ICRSLYHC+MVMY+SE RL EME VL++M NFNL Sbjct: 408 SYFRCNAVDRLADFVKRAECSGWRICRSLYHCKMVMYASEKRLDEMESVLNDMENFNLGR 467 Query: 1496 TKQTFLIMYKAYLTWGQKRKLDQILGVMCTQGFGFPGSA 1612 TK+TF+I+YKAYL G+K K++Q+LG+M G+ P A Sbjct: 468 TKKTFVILYKAYLMCGKKYKVEQVLGLMYKHGYEVPEGA 506 >ref|XP_004147968.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Cucumis sativus] gi|449494249|ref|XP_004159492.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Cucumis sativus] Length = 514 Score = 472 bits (1215), Expect = e-130 Identities = 244/493 (49%), Positives = 332/493 (67%), Gaps = 3/493 (0%) Frame = +2 Query: 134 KNIVPQTPYLITNNPITTSTHRFSLKDSFYSHPNLNSNPFP---NIIGLFSDRLSHNEMQ 304 +NI + +TN H + S P + + F N+I LFS++ +E+ Sbjct: 17 RNIFASFAFSLTNTNYCFLPHGLHTQSRSSSSPPPSISSFSLVANLIELFSNKPDDHEIL 76 Query: 305 AKEDLIHMASVLRGELIRVDDCRQNLIFKVLDEKGSSWFKCYDDGAAFVELLRQLEFLPR 484 A + + ELI ++ N I K+L++ + DG+AFVELL+QL P Sbjct: 77 AICIMERKVLEVSHELI-LNSEDSNKIVKILEDSKDLLLWKHTDGSAFVELLKQLGSQPN 135 Query: 485 CALQVLNWRRKQSDNGFPMTAEEYAKSIRLAGRSKNIDLATELFTEAANRRIKTAATYNA 664 AL+V NWRR+Q + FP+T EEYAK I +AG+SK+IDLA LF EA+N+R+K +TYNA Sbjct: 136 LALEVFNWRRRQGGS-FPLTVEEYAKGIAVAGKSKHIDLAVGLFNEASNKRVKATSTYNA 194 Query: 665 LLGAYMYNDCTDKCQLLYGDFKKDISCKPTVVTYNILISVFGRLMLIKHMEAALQEMNNS 844 L+G +M+N DKC ++ D K+D C P +VTYNILISVFGRLML+ HMEA ++E++N Sbjct: 195 LMGVFMFNGLADKCNSVFRDLKRDAGCVPNIVTYNILISVFGRLMLVDHMEATMREIHNL 254 Query: 845 NIKPNLNTYNMLLGGYVTAWMWDTMEKTYKVMEASGIRPDIHTHLLLLRGYAHSGNLKKM 1024 N+ PN+NTYN L+ GY+TAWMW ME+ + M+AS I P+ T LL+LRGYAHS NL+KM Sbjct: 255 NLSPNVNTYNSLIAGYITAWMWKRMEQAFMKMKASSITPNTETFLLMLRGYAHSDNLEKM 314 Query: 1025 EEIYKLVNYHVIEHEEFPLIRAMICAYCKSTSSERVQKVEELLRLIPKHEYKSWLNVMLI 1204 EE++ + HV FPLIRAMI AY +S+ +++V K++ LL+LIP+ EY+ WLNV LI Sbjct: 315 EEMHHFLKDHV-NKNNFPLIRAMIYAYSRSSITDKVHKIDALLKLIPEEEYRPWLNVKLI 373 Query: 1205 KLYAQEDIMDVMEKYIDEAFAHNISVKTAGIMRCIITSYFRANASDKLTNFVKRAEHAEW 1384 ++YAQ D ++ ME I+EAF H SV T +MR II SYFR NA DKL NF+ RAE + W Sbjct: 374 RVYAQADCLERMENSINEAFEHGTSVYTVHVMRSIIASYFRCNAVDKLINFISRAESSGW 433 Query: 1385 KICRSLYHCRMVMYSSENRLAEMEGVLSEMGNFNLQPTKQTFLIMYKAYLTWGQKRKLDQ 1564 +ICRSLYHC+MVM++S+NRL EME VL EM NFNL +K+TF I+YKAY T G + K +Q Sbjct: 434 RICRSLYHCKMVMFASQNRLEEMECVLDEMKNFNLDWSKKTFYILYKAYSTSGCRYKANQ 493 Query: 1565 ILGVMCTQGFGFP 1603 ++ MC G+G P Sbjct: 494 VVCRMCKLGYGVP 506 >ref|XP_003520548.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Glycine max] Length = 442 Score = 452 bits (1163), Expect = e-124 Identities = 222/433 (51%), Positives = 308/433 (71%) Frame = +2 Query: 314 DLIHMASVLRGELIRVDDCRQNLIFKVLDEKGSSWFKCYDDGAAFVELLRQLEFLPRCAL 493 DLI+ ++L+ ELIR D C + +LD+ + F+ + +G+A ++L+ QL P AL Sbjct: 10 DLINKVTILKNELIR-DSCDSARVQTILDDSFDTLFRRHPNGSALLKLMNQLNSNPSLAL 68 Query: 494 QVLNWRRKQSDNGFPMTAEEYAKSIRLAGRSKNIDLATELFTEAANRRIKTAATYNALLG 673 QV +WRRK+S+ PM A EY+K I+ AGRS N+DLA +LF EAA + IKT TYNAL+G Sbjct: 69 QVFSWRRKRSNAENPMDAYEYSKGIKAAGRSGNVDLAVKLFKEAAVKGIKTTGTYNALMG 128 Query: 674 AYMYNDCTDKCQLLYGDFKKDISCKPTVVTYNILISVFGRLMLIKHMEAALQEMNNSNIK 853 A+M+N D CQ L+ D K+D++C P++ TYNIL+SV+GRLML+ HMEA E+ N+ Sbjct: 129 AFMFNGLPDNCQSLFCDLKRDLTCDPSIATYNILLSVYGRLMLVDHMEATFSEIQRLNLA 188 Query: 854 PNLNTYNMLLGGYVTAWMWDTMEKTYKVMEASGIRPDIHTHLLLLRGYAHSGNLKKMEEI 1033 N+ TYN L+ GY+TAWMWD MEK +++++ S + P++ THLL+LRGYA+SGNL+KMEE+ Sbjct: 189 MNICTYNHLIAGYITAWMWDDMEKVFQMLKLSSVEPNMKTHLLMLRGYANSGNLEKMEEM 248 Query: 1034 YKLVNYHVIEHEEFPLIRAMICAYCKSTSSERVQKVEELLRLIPKHEYKSWLNVMLIKLY 1213 Y + HV +E LIR MICAYC+S+ ++R++K+E LL+ IP+ EY+ WLNV+LIKLY Sbjct: 249 YSFIRDHV-NIKEISLIRCMICAYCRSSHADRLKKIELLLKFIPQKEYRPWLNVLLIKLY 307 Query: 1214 AQEDIMDVMEKYIDEAFAHNISVKTAGIMRCIITSYFRANASDKLTNFVKRAEHAEWKIC 1393 A+ED + ME I+EAF H S+ T GI+R II +Y+R NA +KL NFV+RAE + W IC Sbjct: 308 AKEDWLAKMENAINEAFEHGTSITTKGILRRIIATYYRCNAVEKLENFVRRAEISGWSIC 367 Query: 1394 RSLYHCRMVMYSSENRLAEMEGVLSEMGNFNLQPTKQTFLIMYKAYLTWGQKRKLDQILG 1573 RSLYHC++VMY S+ L EM VL EM N L+ +K+T IMYKAY++ GQ + + LG Sbjct: 368 RSLYHCKLVMYGSQMTLLEMHNVLEEMENVKLKCSKKTLWIMYKAYMSRGQISMVLKTLG 427 Query: 1574 VMCTQGFGFPGSA 1612 M G+ P SA Sbjct: 428 QMFKHGYEVPLSA 440 >ref|XP_003553441.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Glycine max] Length = 504 Score = 444 bits (1143), Expect = e-122 Identities = 241/523 (46%), Positives = 340/523 (65%), Gaps = 4/523 (0%) Frame = +2 Query: 56 MIRVWKLSESA--KSELIITLRTFITKPKNIVPQTPYLITNNPITTSTHRFSLKDSFYSH 229 M RVW+ S A +++L++ L++ + +NP TT SL + H Sbjct: 1 MRRVWRFSSDATRRAQLLLLLQSHY-----------FHGFSNPKTTP----SLATALSHH 45 Query: 230 PNLNS--NPFPNIIGLFSDRLSHNEMQAKEDLIHMASVLRGELIRVDDCRQNLIFKVLDE 403 NLNS N ++I LF+ + + AKEDLI+ AS+L+ ELIR + + +LD+ Sbjct: 46 RNLNSVPNACSDLIPLFTAKSCSS---AKEDLINKASILKNELIR-ESSDSARVLSILDD 101 Query: 404 KGSSWFKCYDDGAAFVELLRQLEFLPRCALQVLNWRRKQSDNGFPMTAEEYAKSIRLAGR 583 + + + DG+ + L+ QL P ALQV +WRRK+S+ PM A EY+K I+ AGR Sbjct: 102 NSDTLIQRHPDGSVLLRLMNQLNSNPSLALQVFSWRRKRSNVENPMDAYEYSKGIKAAGR 161 Query: 584 SKNIDLATELFTEAANRRIKTAATYNALLGAYMYNDCTDKCQLLYGDFKKDISCKPTVVT 763 S N+DLA +LF EAA + IKT +TYNAL+GA M N D CQ L+ D K+D +C P++ T Sbjct: 162 SGNVDLAVKLFKEAAVKGIKTTSTYNALMGACMSNGLADNCQSLFCDLKRDPTCDPSIAT 221 Query: 764 YNILISVFGRLMLIKHMEAALQEMNNSNIKPNLNTYNMLLGGYVTAWMWDTMEKTYKVME 943 YNIL+SVFGRLML+ HMEA +E+ N+ TYN ++ GY+TAWMWD ME +++++ Sbjct: 222 YNILLSVFGRLMLVDHMEATFREIQKLTFTMNICTYNHMIAGYITAWMWDDMENVFQMLK 281 Query: 944 ASGIRPDIHTHLLLLRGYAHSGNLKKMEEIYKLVNYHVIEHEEFPLIRAMICAYCKSTSS 1123 S + P++ T++L+LRGYA+SGNL+KMEEIY + V + +E LIR MICAY +S+ + Sbjct: 282 RSPVEPNMKTYMLMLRGYANSGNLEKMEEIYSFITDRV-DIKEISLIRCMICAYSRSSDA 340 Query: 1124 ERVQKVEELLRLIPKHEYKSWLNVMLIKLYAQEDIMDVMEKYIDEAFAHNISVKTAGIMR 1303 +R++K+E LL+ IP EY+ WLNV+LIKLYA+ED ++ ME I+EAF H S+ T GI+R Sbjct: 341 DRLKKIELLLKFIPGKEYRPWLNVLLIKLYAKEDWLEKMENAINEAFEHGTSITTKGILR 400 Query: 1304 CIITSYFRANASDKLTNFVKRAEHAEWKICRSLYHCRMVMYSSENRLAEMEGVLSEMGNF 1483 CI+ +Y+R NA +KL NFV+RAE + W ICRS YHC++VMY S+ L M VL EM Sbjct: 401 CIVATYYRYNAVEKLENFVRRAEISGWSICRSAYHCKLVMYGSQ--LPLMHNVLEEMEMV 458 Query: 1484 NLQPTKQTFLIMYKAYLTWGQKRKLDQILGVMCTQGFGFPGSA 1612 NL+ TK+T IMYKAY+ GQ + + LG M G+ P SA Sbjct: 459 NLECTKKTLWIMYKAYMRNGQSSMVLKTLGQMFKHGYEVPLSA 501