BLASTX nr result
ID: Catharanthus22_contig00038520
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00038520 (906 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOX94675.1| 2-oxoglutarate and Fe(II)-dependent oxygenase sup... 198 3e-48 ref|XP_004158909.1| PREDICTED: uncharacterized protein LOC101226... 195 2e-47 ref|XP_004146972.1| PREDICTED: uncharacterized protein LOC101222... 195 2e-47 ref|XP_006349711.1| PREDICTED: uncharacterized protein LOC102597... 195 2e-47 ref|XP_004247194.1| PREDICTED: uncharacterized protein LOC101264... 192 1e-46 ref|XP_002302100.2| hypothetical protein POPTR_0002s05010g [Popu... 189 9e-46 ref|XP_003632344.1| PREDICTED: uncharacterized protein LOC100853... 189 1e-45 ref|XP_006575061.1| PREDICTED: uncharacterized protein LOC100786... 184 3e-44 ref|XP_006444000.1| hypothetical protein CICLE_v10023787mg [Citr... 182 1e-43 gb|EMJ01304.1| hypothetical protein PRUPE_ppa019227mg [Prunus pe... 182 1e-43 ref|XP_002524730.1| hypothetical protein RCOM_0646070 [Ricinus c... 179 1e-42 ref|XP_006402290.1| hypothetical protein EUTSA_v10006021mg [Eutr... 176 8e-42 ref|XP_006838976.1| hypothetical protein AMTR_s00002p00271300 [A... 172 2e-40 ref|XP_004292581.1| PREDICTED: uncharacterized protein LOC101308... 169 1e-39 emb|CAB86430.1| putative protein [Arabidopsis thaliana] 169 1e-39 ref|NP_191888.2| 2-oxoglutarate (2OG) and Fe(II)-dependent oxyge... 169 1e-39 ref|XP_002876716.1| hypothetical protein ARALYDRAFT_486835 [Arab... 164 4e-38 gb|ESW16652.1| hypothetical protein PHAVU_007G174300g [Phaseolus... 162 2e-37 ref|XP_006293017.1| hypothetical protein CARUB_v10019295mg [Caps... 158 3e-36 ref|XP_004495174.1| PREDICTED: uncharacterized protein LOC101496... 157 5e-36 >gb|EOX94675.1| 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein, putative isoform 1 [Theobroma cacao] Length = 484 Score = 198 bits (503), Expect = 3e-48 Identities = 117/242 (48%), Positives = 143/242 (59%), Gaps = 27/242 (11%) Frame = -2 Query: 659 IMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHNLGSDVPLKN 480 IMENLG GPGL+AI +VP AS L + RK IL+EHNLGSDVPLKN Sbjct: 75 IMENLGPTGPGLLAITNVPDASLFRRKLLPLASKLALLGPEDRKRILREHNLGSDVPLKN 134 Query: 479 LDRTVSSFAMQLKY----EKLGSERSEGQETL---------PMDDSSDAEFKDLEHCFKV 339 DR VSSFAMQLKY E + ++ S G +L + D D EF DLE+ FK Sbjct: 135 PDRNVSSFAMQLKYSQGLESIETKPSHGVGSLLNLENENICRISDFEDDEFDDLENMFKA 194 Query: 338 XXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRKGK 159 LAR CDRAIGG E+E+SLLESC+AKGRLIHYHS +D+ ++++ +RKG Sbjct: 195 LGFCMMELGLCLARICDRAIGGNELEQSLLESCAAKGRLIHYHSIVDSLVLREAGRRKGS 254 Query: 158 IRDGFRANGMKKPEQ--------------LENANNQAELWQQWHYDYGIFTVLTAPMFMS 21 + AN + EQ + + + QA LWQQWHYDYGIFTVLT PMF+ Sbjct: 255 SKR--HANNYSRSEQRLSKVANLDTNVNEVRSYDMQANLWQQWHYDYGIFTVLTDPMFLL 312 Query: 20 AS 15 AS Sbjct: 313 AS 314 >ref|XP_004158909.1| PREDICTED: uncharacterized protein LOC101226432 [Cucumis sativus] Length = 446 Score = 195 bits (496), Expect = 2e-47 Identities = 114/252 (45%), Positives = 144/252 (57%), Gaps = 27/252 (10%) Frame = -2 Query: 686 QRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHN 507 QR++SITK I+E LG GPGL+AI VP +S LN DHRK ILK+HN Sbjct: 35 QRIESITKSILEALGPNGPGLLAITGVPNSSVLRRALLPLARKLALLNPDHRKQILKDHN 94 Query: 506 LGSDVPLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDSS----------------D 375 LGSDVPL+N +R+VSSFAMQLKY + Q + SS D Sbjct: 95 LGSDVPLRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSGSELDSFCHSIENKLKD 154 Query: 374 AEFKDLEHCFKVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDN 195 EF+ L + FK +AR CDR IGGRE+EESLLESC+AKGRLIHYHS +D Sbjct: 155 NEFEHLGNSFKELGSCMMELGLRIARICDREIGGRELEESLLESCTAKGRLIHYHSALDA 214 Query: 194 AIIKQVSKRKGKIRDGFRANGMKKPEQLENA-----------NNQAELWQQWHYDYGIFT 48 ++++ + KG R+ +A+ + EQ + + LWQQWHYDYGIFT Sbjct: 215 QLLRKPANSKGTARN--QASSRRNREQSIQSRHDPSDRKGLCQSSTNLWQQWHYDYGIFT 272 Query: 47 VLTAPMFMSASD 12 VLT PMF+S S+ Sbjct: 273 VLTTPMFLSPSN 284 >ref|XP_004146972.1| PREDICTED: uncharacterized protein LOC101222496 [Cucumis sativus] Length = 446 Score = 195 bits (496), Expect = 2e-47 Identities = 114/252 (45%), Positives = 144/252 (57%), Gaps = 27/252 (10%) Frame = -2 Query: 686 QRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHN 507 QR++SITK I+E LG GPGL+AI VP +S LN DHRK ILK+HN Sbjct: 35 QRIESITKSILEALGPNGPGLLAITGVPNSSVLRRALLPLARKLALLNPDHRKQILKDHN 94 Query: 506 LGSDVPLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDSS----------------D 375 LGSDVPL+N +R+VSSFAMQLKY + Q + SS D Sbjct: 95 LGSDVPLRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSGSELDSFCHSIENKLKD 154 Query: 374 AEFKDLEHCFKVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDN 195 EF+ L + FK +AR CDR IGGRE+EESLLESC+AKGRLIHYHS +D Sbjct: 155 NEFEHLGNSFKELGSCMMELGLRIARICDREIGGRELEESLLESCTAKGRLIHYHSALDA 214 Query: 194 AIIKQVSKRKGKIRDGFRANGMKKPEQLENA-----------NNQAELWQQWHYDYGIFT 48 ++++ + KG R+ +A+ + EQ + + LWQQWHYDYGIFT Sbjct: 215 QLLRKPANSKGTARN--QASSRRNREQSIQSRHDPSDRKGLCQSSTNLWQQWHYDYGIFT 272 Query: 47 VLTAPMFMSASD 12 VLT PMF+S S+ Sbjct: 273 VLTTPMFLSPSN 284 >ref|XP_006349711.1| PREDICTED: uncharacterized protein LOC102597865 [Solanum tuberosum] Length = 441 Score = 195 bits (495), Expect = 2e-47 Identities = 114/240 (47%), Positives = 149/240 (62%), Gaps = 12/240 (5%) Frame = -2 Query: 689 IQRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEH 510 IQRL+S+T+ +MENLG GPGL+AI VP AS LN+D RK +LKE Sbjct: 31 IQRLESVTRSVMENLGPEGPGLLAITGVPEASNLRRTLLPLARKLALLNNDDRKRLLKEQ 90 Query: 509 NLGSDVPLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDSSDA----EFKDLEHCFK 342 NLGSDV LKN +R VSSF+MQLKYE+ + L +D+ EFK L FK Sbjct: 91 NLGSDVSLKNPNRDVSSFSMQLKYEQCYERSGCQVDDLDVDNRDGEVDQNEFKKLGCTFK 150 Query: 341 VXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRKG 162 LA+ CD+AIGG+E+++SLLES +AKGRLIHYHS +DN I+++ +KR G Sbjct: 151 ELGYCMMDLGLRLAQICDKAIGGQELQQSLLESGTAKGRLIHYHSAVDNDIVREDAKRNG 210 Query: 161 --KIRDG---FRANGMKKPEQLENANNQAE---LWQQWHYDYGIFTVLTAPMFMSASDQE 6 K R+G K + +E++ +Q+ LWQQWHYDYGIFT+LT PMF+ +S QE Sbjct: 211 QSKARNGKVNKNEQSSLKQQGIESSKDQSNDYGLWQQWHYDYGIFTLLTVPMFLLSSHQE 270 >ref|XP_004247194.1| PREDICTED: uncharacterized protein LOC101264669 [Solanum lycopersicum] Length = 442 Score = 192 bits (489), Expect = 1e-46 Identities = 115/241 (47%), Positives = 153/241 (63%), Gaps = 14/241 (5%) Frame = -2 Query: 686 QRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHN 507 QRL+S T+ +M+NLG GPGL+AI VP AS LN++ RK +LKE N Sbjct: 32 QRLKSATRSVMKNLGPEGPGLLAITGVPEASNLRRTLLPLARKLALLNNEDRKRLLKEQN 91 Query: 506 LGSDVPLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDS-----SDAEFKDLEHCFK 342 LGSDV LKN +R VSSF+MQLKYE+ + L +D+ + EFK+L FK Sbjct: 92 LGSDVSLKNPNRDVSSFSMQLKYEQCYERSGCQVDDLDVDNRDRGEVNQDEFKNLGCTFK 151 Query: 341 VXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRKG 162 LA+ CD+AIGG+E+++SLLES +AKGRLIHYHS +DN I+++ +KR G Sbjct: 152 ELGYCMMDLGLRLAQICDKAIGGQELQQSLLESGTAKGRLIHYHSAVDNDIVREDAKRNG 211 Query: 161 --KIRDGFRAN-----GMKKP--EQLENANNQAELWQQWHYDYGIFTVLTAPMFMSASDQ 9 K R+G +AN G+K+ E L++ +N LWQQWHYDYGIFT+LT PMF+ +S Q Sbjct: 212 QSKGRNG-KANKNEQLGLKQQGIESLKDQSNDYGLWQQWHYDYGIFTLLTVPMFLLSSHQ 270 Query: 8 E 6 E Sbjct: 271 E 271 >ref|XP_002302100.2| hypothetical protein POPTR_0002s05010g [Populus trichocarpa] gi|550344311|gb|EEE81373.2| hypothetical protein POPTR_0002s05010g [Populus trichocarpa] Length = 460 Score = 189 bits (481), Expect = 9e-46 Identities = 112/259 (43%), Positives = 142/259 (54%), Gaps = 35/259 (13%) Frame = -2 Query: 686 QRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHN 507 +R + I K IME LG GPGL++I VP AS L+HD RK ILKEHN Sbjct: 32 ERAERIKKTIMETLGPTGPGLLSITGVPKASILRQRLLPLASKLALLDHDRRKHILKEHN 91 Query: 506 LGSDVPLKNLDRTVSSFAMQLKYEKL----------GSERSEGQETLPMDDSSDA----- 372 +GSDVPLKN DR VSSFAMQLKY + + + E+ +DD+ D Sbjct: 92 MGSDVPLKNPDRNVSSFAMQLKYAQALESAPGKTNNRARSNSNLESAHLDDNDDEVTDSP 151 Query: 371 --EFKDLEHCFKVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTID 198 EF +L F+ +A+ CD AIGG+E+E SLLES +AKGRLIHYHS++D Sbjct: 152 EDEFANLSDIFRELGYCMMELGLRVAQICDMAIGGQELERSLLESGTAKGRLIHYHSSLD 211 Query: 197 NAIIKQVSKRKGKI------------------RDGFRANGMKKPEQLENANNQAELWQQW 72 N +IK +RKG + R N + ++ ++ NQ LWQQW Sbjct: 212 NLLIKASGRRKGSTKKQAYCEKNQVLLSRSEQKQSERCNLVANVNEVGSSGNQGNLWQQW 271 Query: 71 HYDYGIFTVLTAPMFMSAS 15 HYDYGIFTVLTAPMF+ S Sbjct: 272 HYDYGIFTVLTAPMFLLPS 290 >ref|XP_003632344.1| PREDICTED: uncharacterized protein LOC100853989 [Vitis vinifera] Length = 548 Score = 189 bits (480), Expect = 1e-45 Identities = 118/258 (45%), Positives = 147/258 (56%), Gaps = 36/258 (13%) Frame = -2 Query: 689 IQRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEH 510 + RL+SI+ IME LG GPGL+A+ VP S LN R ILKEH Sbjct: 38 LSRLESISTSIMEALGPSGPGLLAVTGVPNTSTLRRSLLPLARKLALLNPQDRNRILKEH 97 Query: 509 NLGSDVPLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDSSDAE------------- 369 +LGSDVPLKNLDR+VSSFAMQLKYE+ GS+ ++ + ++DS + E Sbjct: 98 SLGSDVPLKNLDRSVSSFAMQLKYEQ-GSKSTQSGPSHKVNDSGNQEQDRNDVYGLSKIQ 156 Query: 368 ---FKDLEHCFKVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTID 198 FK+L FK LAR CDRAI E+E+SLLESCSAKGRLIHYHST+D Sbjct: 157 NEEFKNLGSTFKDLGFCMMELGLHLARICDRAIHREELEQSLLESCSAKGRLIHYHSTLD 216 Query: 197 NAIIKQVSKRKGKIRDGFRANGMKKPEQ-LENANNQAE-------------------LWQ 78 + IIK++ +RKG + +AN + E + N AE LWQ Sbjct: 217 SLIIKEMGRRKGFSKQ--KANHKRDQEHPIRNEQTAAEFPNLGKTGDAGSYCCDPSNLWQ 274 Query: 77 QWHYDYGIFTVLTAPMFM 24 QWHYDYGIFTVLTAP+F+ Sbjct: 275 QWHYDYGIFTVLTAPLFI 292 >ref|XP_006575061.1| PREDICTED: uncharacterized protein LOC100786614 [Glycine max] Length = 420 Score = 184 bits (468), Expect = 3e-44 Identities = 110/229 (48%), Positives = 137/229 (59%), Gaps = 9/229 (3%) Frame = -2 Query: 674 SITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHNLGSD 495 SI IME LG GPGL+A+ +VP AS L+ + RK +LKEHNLGSD Sbjct: 27 SIVDSIMEALGPTGPGLLAVTNVPNASNLRSHLLPLARNLALLDRESRKLVLKEHNLGSD 86 Query: 494 VPLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDSSDAEFKDLEHCFKVXXXXXXXX 315 VPL+N DRTVSSFAMQLKY K + E M EF++L FK Sbjct: 87 VPLRNPDRTVSSFAMQLKYAKSQHVQQTVSECYGM------EFENLGSSFKELGLCMMEL 140 Query: 314 XXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRKGKIRDGFRAN 135 LAR CD+AIGG E+E+SLL+SC+AKGRLIHYHS +D ++KQ+ + K + RA Sbjct: 141 GLCLARICDKAIGGNELEQSLLDSCAAKGRLIHYHSHLDALLLKQLERSKATSKR--RAG 198 Query: 134 GMKKPEQLE------NANN---QAELWQQWHYDYGIFTVLTAPMFMSAS 15 +K E LE +AN+ + LWQQWHYDYGIFTVLT P+F+ S Sbjct: 199 NIKPLEGLESNSIAHDANSGGIHSNLWQQWHYDYGIFTVLTTPLFILPS 247 >ref|XP_006444000.1| hypothetical protein CICLE_v10023787mg [Citrus clementina] gi|557546262|gb|ESR57240.1| hypothetical protein CICLE_v10023787mg [Citrus clementina] Length = 448 Score = 182 bits (462), Expect = 1e-43 Identities = 111/263 (42%), Positives = 146/263 (55%), Gaps = 34/263 (12%) Frame = -2 Query: 689 IQRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEH 510 I+RL+++ +MENLG GPGL++I SVP AS LN D RK +LKEH Sbjct: 33 IKRLETVRTSVMENLGPGGPGLLSITSVPNASIHRRNLLPLARKLALLNPDDRKRLLKEH 92 Query: 509 NLGSDVPLKNLDRTVSSFAMQLKYE--------KLGSERSEGQETLPMDDSSDAEFKDLE 354 +LGSDV LKN +R VSSFAMQL+Y+ K S + + + D EFK+L Sbjct: 93 HLGSDVSLKNPERNVSSFAMQLRYKQGLESTQCKFSSRADDNVKDQDLGQLPDNEFKNLG 152 Query: 353 HCFKVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQV- 177 + FK LAR CD+AIGG+E+E+SLLES AKGRLIHYHST+D+ ++K+ Sbjct: 153 NMFKELGFCMIELGLCLARICDKAIGGQELEQSLLESSVAKGRLIHYHSTLDSVVLKEAG 212 Query: 176 -----SKRKGKIRDGFRANGMKKPEQLENAN------------NQAELWQQWHYDYGIFT 48 SK+KG + + ++ +Q E N + LWQQWHYDYG+FT Sbjct: 213 RKGRSSKKKGNPKSD-QGQCIRSEKQTECTNVDGDSDEAGISGTHSNLWQQWHYDYGVFT 271 Query: 47 VLTAPMFM--------SASDQEC 3 VLT P F+ SDQ C Sbjct: 272 VLTDPFFILPYYSSESRGSDQGC 294 >gb|EMJ01304.1| hypothetical protein PRUPE_ppa019227mg [Prunus persica] Length = 414 Score = 182 bits (462), Expect = 1e-43 Identities = 107/226 (47%), Positives = 134/226 (59%), Gaps = 4/226 (1%) Frame = -2 Query: 689 IQRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEH 510 + +LQS +K IME LG GPGL++I VP A+ LN +HRK ILK+H Sbjct: 33 LDKLQSTSKAIMEALGPVGPGLLSITGVPNAAALRRDLLPLARKLALLNPNHRKTILKDH 92 Query: 509 NLGSDVPLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDSSDAEFKDLEHCFKVXXX 330 LGSDVPLKN +R VSSFAMQ+KY E E S EF++L + F+ Sbjct: 93 KLGSDVPLKNPERNVSSFAMQIKYSHDFDETHSNSE-----HGSTIEFENLGNGFRELGF 147 Query: 329 XXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAI-IKQVSKRKGKIR 153 LAR CDRAIGG E+E+SLLESC+AK RLIHYHS ID I +K+ K + Sbjct: 148 CMMELGLQLARVCDRAIGGNELEQSLLESCTAKARLIHYHSPIDKTILVKEAMSTKRTSK 207 Query: 152 DGFRANGMK---KPEQLENANNQAELWQQWHYDYGIFTVLTAPMFM 24 ++G + + +QL + LWQQWHYDYGIFTVLTAPMF+ Sbjct: 208 RPLNSSGKQIGDEHKQLSGIGSD-NLWQQWHYDYGIFTVLTAPMFL 252 >ref|XP_002524730.1| hypothetical protein RCOM_0646070 [Ricinus communis] gi|223535914|gb|EEF37573.1| hypothetical protein RCOM_0646070 [Ricinus communis] Length = 444 Score = 179 bits (455), Expect = 1e-42 Identities = 114/261 (43%), Positives = 149/261 (57%), Gaps = 35/261 (13%) Frame = -2 Query: 689 IQRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEH 510 + RL+ I IME LG +GPGL++I +VP AS L+ D+RK +LKEH Sbjct: 32 VSRLEKIRTAIMETLGPKGPGLLSITAVPNASLLRRNLLRLAPKLALLHPDNRKRLLKEH 91 Query: 509 NLGSDVPLKNLDRTVSSFAMQLKYEK-----LGSE------RSEGQET-LPMDDS---SD 375 NLG+DV LKN R VSSFAMQLKY + LG S + T L +D+ D Sbjct: 92 NLGTDVSLKNPCRKVSSFAMQLKYAEALESVLGKPSHVIHPHSNSEPTYLDVDEVRNFQD 151 Query: 374 AEFKDLEHCFKVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDN 195 EF++L + FK LA+ CD+ IGGRE+E SLLES +AKGRLIHYHS +DN Sbjct: 152 DEFENLSNVFKDLGYCMMDLGLRLAQICDKFIGGRELERSLLESGTAKGRLIHYHSVLDN 211 Query: 194 AIIKQVSKRKGKIRDGFRANGMK--------KPEQLENAN------------NQAELWQQ 75 ++++ + KG ++ +AN K K + L+ N NQA+LWQ+ Sbjct: 212 LLLRETGRSKGSSKN--QANSKKDCEHSLNTKQDHLQGPNSVITGNKIDSYKNQADLWQE 269 Query: 74 WHYDYGIFTVLTAPMFMSASD 12 WHYDYGIFTVLTAPMF S+ Sbjct: 270 WHYDYGIFTVLTAPMFFVQSN 290 >ref|XP_006402290.1| hypothetical protein EUTSA_v10006021mg [Eutrema salsugineum] gi|557103389|gb|ESQ43743.1| hypothetical protein EUTSA_v10006021mg [Eutrema salsugineum] Length = 401 Score = 176 bits (447), Expect = 8e-42 Identities = 99/224 (44%), Positives = 131/224 (58%), Gaps = 1/224 (0%) Frame = -2 Query: 671 ITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHNLGSDV 492 I++ +ME LG GPGL+ I V G++ L+ D R ILKEH+LGSDV Sbjct: 33 ISRNVMEALGPTGPGLLCITGVLGSALLRRKLLPLARKLALLDPDKRNRILKEHHLGSDV 92 Query: 491 PLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDSSDAEFKDLEHCFKVXXXXXXXXX 312 PLKN +R VSSFAMQL Y++ + G + ++ D EFK+L FK Sbjct: 93 PLKNPERHVSSFAMQLNYDRTSFDEPIGAKLSLKEEDDDDEFKNLGGAFKELGFCMMELG 152 Query: 311 XXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRKGKIRDGFRANG 132 +AR CDR IGG +EE+LL+SC+AKGRLIHYHS D+ + S+R+ K+ G R + Sbjct: 153 LSIARLCDREIGGGLLEETLLDSCTAKGRLIHYHSAADHQFLLTESQRR-KLSSGNRVSR 211 Query: 131 MKKPEQLENANNQAELWQQWHYDYGIFTVLTAPMFMSA-SDQEC 3 + LWQQWHYDYGIFT+LT PMF+S+ S +EC Sbjct: 212 NHRNGTCFGGTRHFNLWQQWHYDYGIFTILTDPMFLSSYSYEEC 255 >ref|XP_006838976.1| hypothetical protein AMTR_s00002p00271300 [Amborella trichopoda] gi|548841482|gb|ERN01545.1| hypothetical protein AMTR_s00002p00271300 [Amborella trichopoda] Length = 452 Score = 172 bits (435), Expect = 2e-40 Identities = 101/246 (41%), Positives = 138/246 (56%), Gaps = 24/246 (9%) Frame = -2 Query: 686 QRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHN 507 +RL ++ K +ME LG GPGLIAI VP A LN+ R CILKEH Sbjct: 48 ERLDAVFKTVMETLGPEGPGLIAITGVPNAGAMRRRLLPLARKLALLNNKDRHCILKEHG 107 Query: 506 LGSDVPLKNLDRTVSSFAMQLKYEK--------LGSERSEGQ-------ETLPMDDSSDA 372 LGSD LK+LDR+VSSF L+Y++ +GS+ + + E P + + Sbjct: 108 LGSDFSLKDLDRSVSSFVFPLRYQQDFVPKLMHIGSKPGDSEDPDIYSLEQQPHETGN-- 165 Query: 371 EFKDLEHCFKVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNA 192 EFKDL + FK AR CD+ IGG E+EES+L S +AKGRLIHYHS +DN Sbjct: 166 EFKDLGNAFKELGFCMVVIGLLFARICDKGIGGGELEESILHSGTAKGRLIHYHSILDNF 225 Query: 191 IIKQVSKRKGKIRDGFRANGMKKPE------QLENANNQ---AELWQQWHYDYGIFTVLT 39 ++K+ ++ +G + R+ + + Q ++Q + LWQQWHYDYG+FTVLT Sbjct: 226 VLKEAARSRGDKKQRNRSGQILVEDSNVSSLQYSVISSQILPSNLWQQWHYDYGLFTVLT 285 Query: 38 APMFMS 21 PMF+S Sbjct: 286 TPMFLS 291 >ref|XP_004292581.1| PREDICTED: uncharacterized protein LOC101308545 [Fragaria vesca subsp. vesca] Length = 404 Score = 169 bits (429), Expect = 1e-39 Identities = 101/236 (42%), Positives = 139/236 (58%), Gaps = 8/236 (3%) Frame = -2 Query: 689 IQRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEH 510 ++R++ +K IME LG GPGL++I VP A+ ++ +HRK ILK+H Sbjct: 27 LERVELSSKAIMEALGPMGPGLLSIIGVPKAAALRWNLLPLARKLALMDPNHRKLILKDH 86 Query: 509 NLGSDVPLKNLDRTVSSFAMQLKYEK-LGSERSEGQETLPMDDSSDAEFKDLEHCFKVXX 333 LGSDVPLKN DR VSSFAMQ+KY + R + L + F +L + F+ Sbjct: 87 KLGSDVPLKNPDRKVSSFAMQIKYSNDIEDTRVNSEHELV------SGFDNLGNGFRELG 140 Query: 332 XXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAII-------KQVS 174 LAR CDRAIGG+E+E+SLLES +AK RLIHYHS ++ I+ K VS Sbjct: 141 ICMMELGLRLARICDRAIGGQELEQSLLESGTAKARLIHYHSVLEKTILVQEARPKKAVS 200 Query: 173 KRKGKIRDGFRANGMKKPEQLENANNQAELWQQWHYDYGIFTVLTAPMFMSASDQE 6 ++ +I D + +G ++ + LWQQWHYDYGIFTVLTAP+F+ AS+ + Sbjct: 201 SKRIRIGDEVKRSG---------GDDSSNLWQQWHYDYGIFTVLTAPLFVLASNAQ 247 >emb|CAB86430.1| putative protein [Arabidopsis thaliana] Length = 433 Score = 169 bits (428), Expect = 1e-39 Identities = 104/239 (43%), Positives = 132/239 (55%), Gaps = 18/239 (7%) Frame = -2 Query: 683 RLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHNL 504 R Q I+K +M+ LG GPGL+ I V G++ L+ D RK IL EH+L Sbjct: 24 RSQWISKNVMDALGPTGPGLLCITGVLGSAFLRRKLLPMARKLALLDPDKRKLILMEHHL 83 Query: 503 GSDVPLKNLDRTVSSFAMQLKYEKLGSERSEGQ-------ETLPMDDSSDAEFKDLEHCF 345 GSDVPLKN +R VSSFAMQL YE+ + S G+ L + + D F +L F Sbjct: 84 GSDVPLKNPERDVSSFAMQLNYERTTYKSSLGKLWFDEAGSKLDLQEDDDDAFTNLGGAF 143 Query: 344 KVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRK 165 K +AR CDR IGG +EESLL+SC+AKGRLIHYHS D +++ +R Sbjct: 144 KELGFCMRELGLSIARLCDREIGGGLLEESLLDSCTAKGRLIHYHSAADKYALRESQRRN 203 Query: 164 GKIRDGFRANGMKK----PEQLENANNQA-------ELWQQWHYDYGIFTVLTAPMFMS 21 + G R + ++ EQ N N A LWQQWHYDYGIFTVLT PMF+S Sbjct: 204 ---QSGNRVSSKRRVQNAAEQELNRRNGAGLSGSHFNLWQQWHYDYGIFTVLTDPMFLS 259 >ref|NP_191888.2| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [Arabidopsis thaliana] gi|18176035|gb|AAL59972.1| unknown protein [Arabidopsis thaliana] gi|22136904|gb|AAM91796.1| unknown protein [Arabidopsis thaliana] gi|332646941|gb|AEE80462.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [Arabidopsis thaliana] Length = 403 Score = 169 bits (428), Expect = 1e-39 Identities = 104/239 (43%), Positives = 132/239 (55%), Gaps = 18/239 (7%) Frame = -2 Query: 683 RLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHNL 504 R Q I+K +M+ LG GPGL+ I V G++ L+ D RK IL EH+L Sbjct: 24 RSQWISKNVMDALGPTGPGLLCITGVLGSAFLRRKLLPMARKLALLDPDKRKLILMEHHL 83 Query: 503 GSDVPLKNLDRTVSSFAMQLKYEKLGSERSEGQ-------ETLPMDDSSDAEFKDLEHCF 345 GSDVPLKN +R VSSFAMQL YE+ + S G+ L + + D F +L F Sbjct: 84 GSDVPLKNPERDVSSFAMQLNYERTTYKSSLGKLWFDEAGSKLDLQEDDDDAFTNLGGAF 143 Query: 344 KVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRK 165 K +AR CDR IGG +EESLL+SC+AKGRLIHYHS D +++ +R Sbjct: 144 KELGFCMRELGLSIARLCDREIGGGLLEESLLDSCTAKGRLIHYHSAADKYALRESQRRN 203 Query: 164 GKIRDGFRANGMKK----PEQLENANNQA-------ELWQQWHYDYGIFTVLTAPMFMS 21 + G R + ++ EQ N N A LWQQWHYDYGIFTVLT PMF+S Sbjct: 204 ---QSGNRVSSKRRVQNAAEQELNRRNGAGLSGSHFNLWQQWHYDYGIFTVLTDPMFLS 259 >ref|XP_002876716.1| hypothetical protein ARALYDRAFT_486835 [Arabidopsis lyrata subsp. lyrata] gi|297322554|gb|EFH52975.1| hypothetical protein ARALYDRAFT_486835 [Arabidopsis lyrata subsp. lyrata] Length = 417 Score = 164 bits (415), Expect = 4e-38 Identities = 106/243 (43%), Positives = 134/243 (55%), Gaps = 20/243 (8%) Frame = -2 Query: 671 ITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHNLGSDV 492 I++ +M+ LG GPGL+ I V G++ L+ D RK LKEH+LGSD+ Sbjct: 33 ISRNVMDALGPTGPGLLCITGVLGSALLRRKLLPMARKLALLDPDKRKRFLKEHHLGSDL 92 Query: 491 PLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDS--------SDAEFKDLEHCFKVX 336 PLKN +R VSSFAMQL YE+ S E L D++ D EF +L FK Sbjct: 93 PLKNPERDVSSFAMQLNYERTTCISS--LEKLWFDEAVAKLDLHQEDDEFTNLGGAFKEL 150 Query: 335 XXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRKGKI 156 +AR CDR IGG +EESLLESC+AKGRLIHYHS D +++ R Sbjct: 151 GFCMRELGLSIARICDRDIGGGLLEESLLESCTAKGRLIHYHSAADKCALREAESRN--- 207 Query: 155 RDGFRANGMKK----PEQLENANNQA-------ELWQQWHYDYGIFTVLTAPMFMSA-SD 12 + G R + ++ EQ N + A LWQQWHYDYGIFTVLT PMF+S+ S Sbjct: 208 QSGKRVSSKRRVQNAAEQEGNHRSGAGLSGSHFNLWQQWHYDYGIFTVLTDPMFLSSYSY 267 Query: 11 QEC 3 QEC Sbjct: 268 QEC 270 >gb|ESW16652.1| hypothetical protein PHAVU_007G174300g [Phaseolus vulgaris] Length = 422 Score = 162 bits (409), Expect = 2e-37 Identities = 102/233 (43%), Positives = 125/233 (53%), Gaps = 10/233 (4%) Frame = -2 Query: 674 SITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHNLGSD 495 S IME LG GPGL+AI VP AS L + RK +LKEHNLG D Sbjct: 26 STVDSIMEALGPTGPGLLAITGVPNASNLRSHLLPLARSLALLPRETRKIVLKEHNLGGD 85 Query: 494 VPLKNLDRTVSSFAMQLKYEKLGSERSEGQETLPMDDSSDAEFKDLEHCFKVXXXXXXXX 315 VPL N DR+VSSFAMQLKY K + D EF++L F+ Sbjct: 86 VPLLNPDRSVSSFAMQLKYAKSPLVEKT------VSDCCGTEFENLGSYFQELGFCMMEL 139 Query: 314 XXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRKGKIRDGFRAN 135 LAR CD+AIGG E+E SLL+S AKGRLIHYHS +D ++K+ + + + R Sbjct: 140 GLCLARICDKAIGGNELELSLLDSRGAKGRLIHYHSHLDALLLKKHERSRTTSK---RRA 196 Query: 134 GMKKPEQLENANN----------QAELWQQWHYDYGIFTVLTAPMFMSASDQE 6 G KP + N+ + LWQQWHYDYGIFTVLT+PMF+ S E Sbjct: 197 GNVKPLEGSELNSIACDVNPGGIHSNLWQQWHYDYGIFTVLTSPMFILPSYSE 249 >ref|XP_006293017.1| hypothetical protein CARUB_v10019295mg [Capsella rubella] gi|482561724|gb|EOA25915.1| hypothetical protein CARUB_v10019295mg [Capsella rubella] Length = 431 Score = 158 bits (399), Expect = 3e-36 Identities = 100/245 (40%), Positives = 132/245 (53%), Gaps = 17/245 (6%) Frame = -2 Query: 686 QRLQSITKRIMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHN 507 +R Q I++ +M LG GPGL+ I V G++ L D R ILKEH+ Sbjct: 33 KRCQCISRNVMSALGPSGPGLLCITGVLGSALLRRQLLPMARKLALLVPDKRIRILKEHH 92 Query: 506 LGSDVPLKNLDRTVSSFAMQLKYEKLGSERS------EGQETLPM-DDSSDAEFKDLEHC 348 LGSDV LKN R VSSFAMQL +E+ E TL + ++ D EF +L Sbjct: 93 LGSDVSLKNPLRDVSSFAMQLNFERTSKSSQGKLWFHEASPTLDLKEEGDDDEFTNLGAA 152 Query: 347 FKVXXXXXXXXXXXLARACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQV--S 174 FK +AR CDR IGG +E+SLLESC+AK RLIHYHS D +++ S Sbjct: 153 FKGLGFCMRELGLSIARICDREIGGGFLEDSLLESCTAKARLIHYHSAADKRALREAERS 212 Query: 173 KRKGK-IRDGFRANGMKKPEQLENANNQA------ELWQQWHYDYGIFTVLTAPMFMSA- 18 + GK + R + + +++ N LWQQWHYDYGIFT+LT PMF+S+ Sbjct: 213 NQSGKRVSSKTRVHNAAEQQEVNRRNGDGLSGSHFNLWQQWHYDYGIFTLLTDPMFLSSY 272 Query: 17 SDQEC 3 S Q+C Sbjct: 273 SYQDC 277 >ref|XP_004495174.1| PREDICTED: uncharacterized protein LOC101496515 [Cicer arietinum] Length = 395 Score = 157 bits (397), Expect = 5e-36 Identities = 94/225 (41%), Positives = 123/225 (54%), Gaps = 6/225 (2%) Frame = -2 Query: 659 IMENLGRRGPGLIAIKSVPGASXXXXXXXXXXXXXXXLNHDHRKCILKEHNLGSDVPLKN 480 IME LG GPGL+A+ +P + L+ R ILKEHNLGSDVPLK Sbjct: 28 IMEALGASGPGLLAVTGIPNVTNLRSYLLPLARKLALLDRQTRNRILKEHNLGSDVPLKI 87 Query: 479 LDRTVSSFAMQLKYEKLGSERSEGQETLPMDDSSDAEFKDLEHCFKVXXXXXXXXXXXLA 300 R+VSSFAM+L Y K S+ +G + F++L + F+ LA Sbjct: 88 PHRSVSSFAMKLNYAKTCSQDKDGTQCYGNG------FENLGNAFQELGFCMMEVGLCLA 141 Query: 299 RACDRAIGGREVEESLLESCSAKGRLIHYHSTIDNAIIKQVSKRKGKIRDGFRANGMKKP 120 R CD+AIGG E+E+SLLES +AKGRLIHYHS D+ ++Q+ K + ++ N +K Sbjct: 142 RVCDKAIGGNELEQSLLESNAAKGRLIHYHSHFDSIFLQQLDINKRRAKN----NNIKSL 197 Query: 119 EQLENANNQA------ELWQQWHYDYGIFTVLTAPMFMSASDQEC 3 E+ + A LWQQWHYDYGIFTVLT P F + C Sbjct: 198 EEGPCLKSTACDAVHSNLWQQWHYDYGIFTVLTTPFFTTQDSSTC 242