BLASTX nr result
ID: Catharanthus23_contig00022554
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00022554 (1561 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006361793.1| PREDICTED: uncharacterized protein LOC102587... 389 e-105 ref|XP_004246921.1| PREDICTED: uncharacterized protein LOC101247... 379 e-102 ref|XP_006361794.1| PREDICTED: uncharacterized protein LOC102587... 376 e-101 gb|EOY07780.1| Retinitis pigmentosa 1-like 1 protein, putative [... 287 1e-74 ref|XP_002275174.1| PREDICTED: uncharacterized protein LOC100255... 284 8e-74 ref|XP_006428834.1| hypothetical protein CICLE_v10011834mg [Citr... 281 7e-73 ref|XP_006480646.1| PREDICTED: putative leucine-rich repeat-cont... 278 3e-72 ref|XP_004306237.1| PREDICTED: uncharacterized protein LOC101307... 273 1e-70 emb|CAN68658.1| hypothetical protein VITISV_015697 [Vitis vinifera] 271 7e-70 ref|XP_002525171.1| conserved hypothetical protein [Ricinus comm... 268 5e-69 ref|XP_006381484.1| hypothetical protein POPTR_0006s13270g [Popu... 262 3e-67 ref|XP_006361795.1| PREDICTED: uncharacterized protein LOC102587... 235 2e-65 ref|XP_003541404.1| PREDICTED: uncharacterized protein LOC100799... 231 8e-58 gb|EXC16256.1| hypothetical protein L484_024430 [Morus notabilis] 230 1e-57 ref|XP_006588767.1| PREDICTED: uncharacterized protein LOC102659... 222 4e-55 gb|ESW16537.1| hypothetical protein PHAVU_007G164700g [Phaseolus... 220 1e-54 ref|NP_001189849.1| uncharacterized protein [Arabidopsis thalian... 211 5e-52 gb|AAF23302.1|AC016661_27 hypothetical protein [Arabidopsis thal... 210 2e-51 ref|NP_187584.2| uncharacterized protein [Arabidopsis thaliana] ... 210 2e-51 ref|XP_002882635.1| hypothetical protein ARALYDRAFT_317774 [Arab... 204 6e-50 >ref|XP_006361793.1| PREDICTED: uncharacterized protein LOC102587347 isoform X1 [Solanum tuberosum] Length = 486 Score = 389 bits (1000), Expect = e-105 Identities = 238/492 (48%), Positives = 303/492 (61%), Gaps = 52/492 (10%) Frame = -3 Query: 1544 MWQVLLAAAVAGSGILAKRLIFNSDGTQPVSVSIQNDQKC--LNE------SLQSQDSVF 1389 MW L+ AA AGSG LAK+ IFN + T+P+S S +D KC LN+ S Q +DS+F Sbjct: 1 MWPALVVAAAAGSGFLAKK-IFNQNATEPISGSTASDSKCDKLNDPEEFMTSFQHKDSIF 59 Query: 1388 WG------NDDGVQESTL-------------------KEQS-----------------DG 1335 N G ++S K+ S DG Sbjct: 60 TSICDKPFNPQGHKDSIFISNFDKPFDPEGLNTPFQHKDSSFVCNLGCNIQEKCEGFFDG 119 Query: 1334 SIFRFSSTSGAEKSSNNKNVRKKIRGV-SSRGKMEGLKENSKGNAAKKCG-VSGCEKRGW 1161 SIFRFSS SG+E K +K + G ++G + K S G KCG V EK Sbjct: 120 SIFRFSSASGSEMGF-RKLRKKNVEGSRKTKGNVMEWKGKSGGKLGGKCGNVRSGEKELV 178 Query: 1160 VVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSVGIMYMMSAGKAEI 981 +DQR NGK+ VCLKKRRT+K+ SGKC SC SK NS F +GL +G+M MMSAGK+EI Sbjct: 179 RLDQRKRRNGKRFYVCLKKRRTNKVPSGKCDSCASKGNSFFGYGLGIGMMCMMSAGKSEI 238 Query: 980 SKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNREMVAQSVVKKPIS 801 ++LN MDET+K V+ELKAELSR++ N A+ + ++ KNNRE ++ + Sbjct: 239 NRLNTTMDETAKAVEELKAELSRKRVAHNLCASK--NEGDIDEKNNRECRIHAIAESNNE 296 Query: 800 EKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETELQKLPWSTMECLGFE 621 + + + AEEGEC+SSVVTEE QPE EMDQLEAELE+EL KLPW + E Sbjct: 297 NRNI-YRALDLQVAEEGECASSVVTEELQPEVMEMDQLEAELESELLKLPWCSTEDTDLN 355 Query: 620 DRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHLLIEQQEGQIMELESKLQ 441 D +D+ +EF++ DD N ++Q SGVLPS LDQKLCHLLIEQQEGQI+ELES+L+ Sbjct: 356 GGRDPCQDDFLEKEFNQADDRNAETYQCSGVLPSELDQKLCHLLIEQQEGQIVELESELR 415 Query: 440 RTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDEKTSLKDQEMELGTELGS 261 +THSKL+EKE ELQ LKDCV+RLTEFSLGNASDEETD ++EDE DQE ++G E G Sbjct: 416 QTHSKLHEKETELQALKDCVRRLTEFSLGNASDEETDGKMEDEIIGGGDQEKKIGPESG- 474 Query: 260 RSMVGMKRAMDF 225 +S+VGMKR+M F Sbjct: 475 KSIVGMKRSMIF 486 >ref|XP_004246921.1| PREDICTED: uncharacterized protein LOC101247361 [Solanum lycopersicum] Length = 478 Score = 379 bits (974), Expect = e-102 Identities = 231/491 (47%), Positives = 300/491 (61%), Gaps = 51/491 (10%) Frame = -3 Query: 1544 MWQVLLAAAVAGSGILAKRLIFNSDGTQPVSVSIQNDQKC--LNE------SLQSQDSVF 1389 MW L+AAA AGSG LAK+ I N + T+P+S S ++D KC LN+ S Q +DS+F Sbjct: 1 MWPALVAAAAAGSGFLAKK-ILNQNATEPISGSTESDSKCDKLNDPEEFMTSFQHKDSIF 59 Query: 1388 ----------WGNDDGVQESTLKEQ--------------------------------SDG 1335 G+ D + ST + SDG Sbjct: 60 TSICDKPFNPQGHKDSIFISTFDKHFDPEGTNTPFQHKDSNFICNLGYSIQEKSEGFSDG 119 Query: 1334 SIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGNAAKKCG-VSGCEKRGWV 1158 SIFRFSS +E +N+RKK V K +G KG + KCG V EK + Sbjct: 120 SIFRFSSACDSEMGF--RNLRKK--NVEGSRKTKGNVMEWKGKSGGKCGNVRSGEKELFR 175 Query: 1157 VDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSVGIMYMMSAGKAEIS 978 +D+R GNGK+ VCLKKRRT+K+ SGKC SC SK NS F +GL +G+M MMSAGK+EI+ Sbjct: 176 LDERKRGNGKRFYVCLKKRRTNKVPSGKCDSCASKGNSFFGYGLGIGMMCMMSAGKSEIN 235 Query: 977 KLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNREMVAQSVVKKPISE 798 +LN MDET+K V+ELKAELSR+K N A+ ++++ KNNRE + + Sbjct: 236 RLNTTMDETAKAVEELKAELSRKKVAHNLCASK--NEVDMDEKNNRECRIHVIAENNNEN 293 Query: 797 KGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETELQKLPWSTMECLGFED 618 + + + AEEGEC+SSV+TEE QPE EMDQLEAELE+EL KLPW E + Sbjct: 294 RNI-YRALDLQVAEEGECASSVITEEPQPEVMEMDQLEAELESELLKLPWCATEVMDLNG 352 Query: 617 RTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHLLIEQQEGQIMELESKLQR 438 D +DE +EF++ DD N ++ +GVLPS LDQKLCHLLIEQQEGQI+ELES+L++ Sbjct: 353 GRDPCQDEFLEKEFNQADDRNAETYLCNGVLPSELDQKLCHLLIEQQEGQIVELESELRQ 412 Query: 437 THSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDEKTSLKDQEMELGTELGSR 258 THSKL+EKE ELQ LKDCV+RLTEFSL DEETD ++EDE DQE ++ E+G + Sbjct: 413 THSKLHEKEAELQALKDCVRRLTEFSL----DEETDGKMEDEIIVGGDQEKKIEPEVG-K 467 Query: 257 SMVGMKRAMDF 225 S++GMKR+M F Sbjct: 468 SIIGMKRSMIF 478 >ref|XP_006361794.1| PREDICTED: uncharacterized protein LOC102587347 isoform X2 [Solanum tuberosum] Length = 482 Score = 376 bits (965), Expect = e-101 Identities = 234/492 (47%), Positives = 299/492 (60%), Gaps = 52/492 (10%) Frame = -3 Query: 1544 MWQVLLAAAVAGSGILAKRLIFNSDGTQPVSVSIQNDQKC--LNE------SLQSQDSVF 1389 MW L+ AA AGSG LAK+ IFN + T+P+S S +D KC LN+ S Q +DS+F Sbjct: 1 MWPALVVAAAAGSGFLAKK-IFNQNATEPISGSTASDSKCDKLNDPEEFMTSFQHKDSIF 59 Query: 1388 WG------NDDGVQESTL-------------------KEQS-----------------DG 1335 N G ++S K+ S DG Sbjct: 60 TSICDKPFNPQGHKDSIFISNFDKPFDPEGLNTPFQHKDSSFVCNLGCNIQEKCEGFFDG 119 Query: 1334 SIFRFSSTSGAEKSSNNKNVRKKIRGV-SSRGKMEGLKENSKGNAAKKCG-VSGCEKRGW 1161 SIFRFSS SG+E K +K + G ++G + K S G KCG V EK Sbjct: 120 SIFRFSSASGSEMGF-RKLRKKNVEGSRKTKGNVMEWKGKSGGKLGGKCGNVRSGEKELV 178 Query: 1160 VVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSVGIMYMMSAGKAEI 981 +DQR NGK+ VCLKKRRT+K+ SGKC SC SK NS F +GL +G+M MMSAGK+EI Sbjct: 179 RLDQRKRRNGKRFYVCLKKRRTNKVPSGKCDSCASKGNSFFGYGLGIGMMCMMSAGKSEI 238 Query: 980 SKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNREMVAQSVVKKPIS 801 ++LN MDET+K V+ELKAELSR++ N A+ + ++ KNNRE ++ + Sbjct: 239 NRLNTTMDETAKAVEELKAELSRKRVAHNLCASK--NEGDIDEKNNRECRIHAIAESNNE 296 Query: 800 EKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETELQKLPWSTMECLGFE 621 + + + AEEGEC+SSVVTEE QPE EMDQLEAELE+EL KLPW + E Sbjct: 297 NRNI-YRALDLQVAEEGECASSVVTEELQPEVMEMDQLEAELESELLKLPWCSTEDTDLN 355 Query: 620 DRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHLLIEQQEGQIMELESKLQ 441 D +D+ +EF++ DD N ++Q SGVLPS LDQKLCHLLIEQQEGQI+ELES+L+ Sbjct: 356 GGRDPCQDDFLEKEFNQADDRNAETYQCSGVLPSELDQKLCHLLIEQQEGQIVELESELR 415 Query: 440 RTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDEKTSLKDQEMELGTELGS 261 +THSKL+EKE ELQ LKDCV+RLTEFSL DEETD ++EDE DQE ++G E G Sbjct: 416 QTHSKLHEKETELQALKDCVRRLTEFSL----DEETDGKMEDEIIGGGDQEKKIGPESG- 470 Query: 260 RSMVGMKRAMDF 225 +S+VGMKR+M F Sbjct: 471 KSIVGMKRSMIF 482 >gb|EOY07780.1| Retinitis pigmentosa 1-like 1 protein, putative [Theobroma cacao] Length = 414 Score = 287 bits (734), Expect = 1e-74 Identities = 195/460 (42%), Positives = 264/460 (57%), Gaps = 21/460 (4%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLIF-NSDGTQPVSV---SIQNDQKCLNESLQSQDSVFWGN 1380 MWQ+LL AAVAGS G+LAK L N + P+S + NDQ+ + LQ Q+ Sbjct: 1 MWQILLGAAVAGSTGLLAKHLFNPNPNPNNPISQGNPNSDNDQE--KQDLQFQNGFL--- 55 Query: 1379 DDGVQESTLKEQSDGSIFRFSSTSGAEKS---SNNKNVRKKIRGVSSRGKMEGLKENSKG 1209 + G + + + IFRFSS+ A K+ + ++N+RKK+ LK+ K Sbjct: 56 ESGCESNGEDKGKQDGIFRFSSSESAGKTGVKTKDRNLRKKVV----------LKKAEK- 104 Query: 1208 NAAKKCGVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKA------- 1050 + G G E + +K++VCLKKRRT+K + KC SC SK Sbjct: 105 ---RSSGAGGVEV-----------SRRKLAVCLKKRRTAKNVAYKCGSCPSKGEEKEGCW 150 Query: 1049 --NSVFSWGLSVGIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANV 876 +SVF WGL GIMYMMSAGKAEI+KLN+ MDET+K+VQ+LK EL +RK+ N A+N Sbjct: 151 WDSSVFRWGLGFGIMYMMSAGKAEINKLNSTMDETAKVVQDLKTELCKRKSSCNVRASNS 210 Query: 875 PTKLELQPKNNREMVAQSVVKKP--ISEKGVPVKVFAFPTAEEGECSSSVVTEEQQPE-- 708 ++ K Q ++ K ++ +KV + P ++GE +SSV+TEE +PE Sbjct: 211 ANEVTTGSKKFSGKNTQLLLDKSGTVNRDDNEIKVCSLPVIDDGEYASSVLTEEPEPEPE 270 Query: 707 AFEMDQLEAELETELQKLPWSTMECLGFEDRTDTFEDEVSAREFHRTDDENLYSHQPSGV 528 EMDQLEAELE+ELQKL E EVS + H + + S+Q GV Sbjct: 271 VGEMDQLEAELESELQKLS----------------ETEVSTKSLHESVGQRSDSYQCQGV 314 Query: 527 LPSVLDQKLCHLLIEQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNA 348 LPS LDQKLCHLLIEQQE QI ELES+L SKL +KE ELQ LKDCV+RLT FSL Sbjct: 315 LPSELDQKLCHLLIEQQENQIEELESELSSAQSKLRDKEAELQALKDCVRRLTNFSLSTV 374 Query: 347 SDEETDIQVEDEKTSLKDQEMELGTELGSRSMVGMKRAMD 228 SD++T+ Q E + + +D ++ G E +S+VGMKR ++ Sbjct: 375 SDDDTEAQGEQARMNDQDCPIKSGLET-RKSLVGMKRPIE 413 >ref|XP_002275174.1| PREDICTED: uncharacterized protein LOC100255082 [Vitis vinifera] Length = 396 Score = 284 bits (726), Expect = 8e-74 Identities = 191/450 (42%), Positives = 243/450 (54%), Gaps = 8/450 (1%) Frame = -3 Query: 1544 MWQVLLAAAVAGSGILAKRLIFNSDGTQPVS-----VSIQNDQKCLNESLQSQDSVFWGN 1380 MWQVLLAAAVAGSGI AK L N++ VS +Q KC Q S Sbjct: 1 MWQVLLAAAVAGSGIFAKNLFSNNNADPTVSPPPPQAELQTHDKCDQNHQQRSSSRSASK 60 Query: 1379 DDGVQESTLKEQSDGSIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGNAA 1200 + + IFRFSS S +K R + RG + ++EG Sbjct: 61 EVSQSNQSGVVGEAEEIFRFSS------SGVSKKPRTRPRGFKKKVEVEG---------- 104 Query: 1199 KKCGVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSV 1020 +S+ KKRRT K AS K +S S F WGL V Sbjct: 105 -------------------------ISLSFKKRRTGKAASVK----SSTDGSSFVWGLGV 135 Query: 1019 GIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNR 840 G+MYMMSAGKAEISKLN +MDET+K+VQELK EL +RK+ N ++ ++ + PK R Sbjct: 136 GMMYMMSAGKAEISKLNTSMDETAKVVQELKTELYKRKSSRNLQVSSFSSEADTSPKKIR 195 Query: 839 EMVAQSVVKKPISEKGVP--VKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETE 666 V+ K + P + + +FP ++GE +SSV+TEE +PE EMDQLEAELE+E Sbjct: 196 GKHTAQVLAKSSTGNQDPNEINISSFPVIDDGEYASSVLTEEPRPEVLEMDQLEAELESE 255 Query: 665 LQKLPWSTMECLGFEDRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHLLI 486 LQKLP + E+ D ++ +FH GVLP+ LDQKLCH+LI Sbjct: 256 LQKLPSCATDAPDCEEIRPDLGDNSNSYQFH-------------GVLPAELDQKLCHVLI 302 Query: 485 EQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDEKT 306 EQQE QI++LES+L SKL+EKE ELQ LKDCVKRLTEFSL SD+E + QVE ++ Sbjct: 303 EQQENQIVDLESELHLAQSKLHEKEAELQALKDCVKRLTEFSLSTVSDDEAESQVEQKR- 361 Query: 305 SLKDQEMELGTELGS-RSMVGMKRAMDFES 219 L D T GS RS VGMKRA+D E+ Sbjct: 362 -LIDGNSNNDTGNGSKRSAVGMKRALDSEA 390 >ref|XP_006428834.1| hypothetical protein CICLE_v10011834mg [Citrus clementina] gi|568854043|ref|XP_006480645.1| PREDICTED: putative leucine-rich repeat-containing protein DDB_G0290503-like isoform X1 [Citrus sinensis] gi|557530891|gb|ESR42074.1| hypothetical protein CICLE_v10011834mg [Citrus clementina] Length = 419 Score = 281 bits (718), Expect = 7e-73 Identities = 192/454 (42%), Positives = 256/454 (56%), Gaps = 13/454 (2%) Frame = -3 Query: 1544 MWQVLLAAAVA-GSGILAKRLIFNSDGTQ------PVSVSIQNDQKCLNESLQSQDSVFW 1386 MWQ+LLAAA A GS L + +F S G Q P V ++ + + L + V Sbjct: 1 MWQLLLAAAAAAGSTSLVAKYLFGSQGDQNEKERNPFDVDVKYHSHVVQKGLNASGCV-- 58 Query: 1385 GNDDGVQESTLKEQSDGSIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGN 1206 S ++Q IFRFSS + +S +K +RK+ + V S + G Sbjct: 59 --------SNCEKQE--GIFRFSSPESSGGNSGSKILRKEKKKVKSEKRSAG-------- 100 Query: 1205 AAKKCGVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGL 1026 G KR + V+VCLKKRRTSK + + + +SK +S+F+WGL Sbjct: 101 -----GGVELNKR------------RAVAVCLKKRRTSKNGAAERGASSSKDSSLFNWGL 143 Query: 1025 SVGIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKN 846 VG+MYM+SA KAEISKLN MDET+K+VQELK+EL RRK +S + V T + +N Sbjct: 144 GVGMMYMISASKAEISKLNTTMDETAKVVQELKSELHRRK---SSCSVLVHTSANEETEN 200 Query: 845 NREMVA---QSVVKKPISEKG--VPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEA 681 ++ + Q V+ K S KV P ++ EC+SSV+TEE+ E EMD+LEA Sbjct: 201 LEKITSNKTQQVLFKSRSGNRDLSDQKVLGLPLIDDSECTSSVLTEERDAEVLEMDKLEA 260 Query: 680 ELETELQKLPWSTMECLGFED-RTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQK 504 ELE+ELQKLPW + E ED + + E +VS EFH + ++ +Q GV PS LDQK Sbjct: 261 ELESELQKLPWYSTESSYHEDMKLNLHETKVSTMEFHEAEGQSSDFYQSHGVSPSELDQK 320 Query: 503 LCHLLIEQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQ 324 L HLLI+QQE QIM+LES+L SKL EKE ELQ LKDCVKRLTEFSL + +E IQ Sbjct: 321 LSHLLIKQQENQIMDLESELHSAQSKLGEKENELQALKDCVKRLTEFSLSSIPGDE--IQ 378 Query: 323 VEDEKTSLKDQEMELGTELGSRSMVGMKRAMDFE 222 +E+ + EL SRS+VGMKR + E Sbjct: 379 AREEQDCASKWDCSNEVELQSRSLVGMKRPVGTE 412 >ref|XP_006480646.1| PREDICTED: putative leucine-rich repeat-containing protein DDB_G0290503-like isoform X2 [Citrus sinensis] Length = 417 Score = 278 bits (712), Expect = 3e-72 Identities = 192/453 (42%), Positives = 253/453 (55%), Gaps = 12/453 (2%) Frame = -3 Query: 1544 MWQVLLAAAVA-GSGILAKRLIFNSDGTQ------PVSVSIQNDQKCLNESLQSQDSVFW 1386 MWQ+LLAAA A GS L + +F S G Q P V ++ + + L + V Sbjct: 1 MWQLLLAAAAAAGSTSLVAKYLFGSQGDQNEKERNPFDVDVKYHSHVVQKGLNASGCV-- 58 Query: 1385 GNDDGVQESTLKEQSDGSIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGN 1206 S ++Q IFRFSS + +S +K +RK+ + V S + G Sbjct: 59 --------SNCEKQE--GIFRFSSPESSGGNSGSKILRKEKKKVKSEKRSAG-------- 100 Query: 1205 AAKKCGVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGL 1026 G KR + V+VCLKKRRTSK + + + +SK +S+F+WGL Sbjct: 101 -----GGVELNKR------------RAVAVCLKKRRTSKNGAAERGASSSKDSSLFNWGL 143 Query: 1025 SVGIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKN 846 VG+MYM+SA KAEISKLN MDET+K+VQELK+EL RRK +S + V T + +N Sbjct: 144 GVGMMYMISASKAEISKLNTTMDETAKVVQELKSELHRRK---SSCSVLVHTSANEETEN 200 Query: 845 NREMVA---QSVVKKPISEKG--VPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEA 681 ++ + Q V+ K S KV P ++ EC+SSV+TEE+ E EMD+LEA Sbjct: 201 LEKITSNKTQQVLFKSRSGNRDLSDQKVLGLPLIDDSECTSSVLTEERDAEVLEMDKLEA 260 Query: 680 ELETELQKLPWSTMECLGFEDRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKL 501 ELE+ELQKLPW + E ED EVS EFH + ++ +Q GV PS LDQKL Sbjct: 261 ELESELQKLPWYSTESSYHEDMKLNLH-EVSTMEFHEAEGQSSDFYQSHGVSPSELDQKL 319 Query: 500 CHLLIEQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQV 321 HLLI+QQE QIM+LES+L SKL EKE ELQ LKDCVKRLTEFSL + +E IQ Sbjct: 320 SHLLIKQQENQIMDLESELHSAQSKLGEKENELQALKDCVKRLTEFSLSSIPGDE--IQA 377 Query: 320 EDEKTSLKDQEMELGTELGSRSMVGMKRAMDFE 222 +E+ + EL SRS+VGMKR + E Sbjct: 378 REEQDCASKWDCSNEVELQSRSLVGMKRPVGTE 410 >ref|XP_004306237.1| PREDICTED: uncharacterized protein LOC101307916 [Fragaria vesca subsp. vesca] Length = 411 Score = 273 bits (698), Expect = 1e-70 Identities = 186/451 (41%), Positives = 260/451 (57%), Gaps = 9/451 (1%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLIFNSDGTQPVSVSIQNDQKCLNESLQSQ-DSVFWGNDDG 1371 MW L AAAVA S G+LAK + + + QND N + +S SV D Sbjct: 1 MWPALFAAAVAASTGLLAKNHL-SKRASVITDSDPQNDAVVENATPRSPVGSVSRPPWDS 59 Query: 1370 VQESTLKEQSDGSIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGNAAKKC 1191 E +KE IFRFSS G+ + +R K K N A++ Sbjct: 60 DCEDQVKE----GIFRFSSDCGSGEKKKKTKLRLK----------------KKKNVAEE- 98 Query: 1190 GVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKAN--SVFSWGLSVG 1017 Q+ +G++V VCLK+R+T KC S ++ + S+F+WGL++G Sbjct: 99 ------------QQQKKKSGRRVGVCLKRRKT---VPSKCGSSSNPKDTTSLFNWGLNIG 143 Query: 1016 IMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNRE 837 IMYMM+AGKAEI+KLN MDET+K+VQELK+EL++RK + + ++ Q ++ Sbjct: 144 IMYMMTAGKAEINKLNTTMDETAKVVQELKSELNKRKASQSQQVSCSESEANTQYQSPSC 203 Query: 836 MVAQSVVKKPISEKGVP--VKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETEL 663 +Q+ + K +E G P ++ +F ++ EC SSV+TE+ +PE +MDQLEAELE+EL Sbjct: 204 KRSQTELTKSSAEYGEPNYMRASSFQISDV-ECPSSVLTEDPEPEVMDMDQLEAELESEL 262 Query: 662 QKLPWSTMECL---GFEDRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHL 492 QKLPW + E GF + F + SA++FH + + Q GVLP+ LDQKLCH+ Sbjct: 263 QKLPWCSTEAPQQEGFRNLEKGFVSDDSAQKFHGQAAKGIVIEQFQGVLPAELDQKLCHV 322 Query: 491 LIEQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDE 312 LIEQQE QI+ELES LQ SKL EKE ELQ LKDCVKRLT+ +L SD+E + E E Sbjct: 323 LIEQQESQIVELESGLQSAQSKLQEKETELQALKDCVKRLTQLNLSTVSDDENEAHNEQE 382 Query: 311 KTSLKDQEMELGTELGSRSMVGMKRAMDFES 219 +T+ + ME G+E +S+VGMKR MD+E+ Sbjct: 383 QTTHWNYNME-GSE-SLKSVVGMKRPMDYEA 411 >emb|CAN68658.1| hypothetical protein VITISV_015697 [Vitis vinifera] Length = 743 Score = 271 bits (692), Expect = 7e-70 Identities = 177/409 (43%), Positives = 223/409 (54%), Gaps = 9/409 (2%) Frame = -3 Query: 1544 MWQVLLAAAVAGSGILAKRLIFNSDGTQPVS-----VSIQNDQKCLNESLQSQDSVFWGN 1380 MWQVLLAAAVAGSGI AK L N++ VS +Q KC Q S Sbjct: 1 MWQVLLAAAVAGSGIFAKNLFSNNNADPTVSPPPPQAELQTHDKCDQNHQQRSSSRSASK 60 Query: 1379 DDGVQESTLKEQSDGSIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGNAA 1200 + + IFRFSS S +K R + RG + ++EG Sbjct: 61 EVSQSNQSGVVGEAEEIFRFSS------SGVSKKPRTRPRGFKKKVEVEG---------- 104 Query: 1199 KKCGVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSV 1020 +S+ KKRRT K AS K +S S F WGL V Sbjct: 105 -------------------------ISLSFKKRRTGKAASVK----SSTDGSSFVWGLGV 135 Query: 1019 GIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNR 840 G+MYMMSAGKAEISKLN +MDET+K+VQELK EL +RK+ N ++ ++ + PK R Sbjct: 136 GMMYMMSAGKAEISKLNTSMDETAKVVQELKTELYKRKSSRNLQVSSFSSEADTSPKKIR 195 Query: 839 EMVAQSVVKKPISEKGVP--VKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETE 666 V+ K + P + + + P ++GE +SSV+TEE +PE EMDQLEAELE+E Sbjct: 196 GKHTAQVLAKSSTGNQDPNEINISSXPVIDDGEYASSVLTEEPRPEVLEMDQLEAELESE 255 Query: 665 LQKLPWSTMECLGFEDRTDTFED--EVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHL 492 LQKLP + E+ D EVSA+ FH + +N S+Q GVLP LDQKLCH+ Sbjct: 256 LQKLPCCATDAPDSEEIRPDLGDTREVSAKGFHELEGQNSNSYQFHGVLPXELDQKLCHV 315 Query: 491 LIEQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNAS 345 LIEQQE QI++LES+L SKL+EKE ELQ LKDCVKRLTEFSL S Sbjct: 316 LIEQQENQIVDLESELHLAQSKLHEKEAELQALKDCVKRLTEFSLSTVS 364 >ref|XP_002525171.1| conserved hypothetical protein [Ricinus communis] gi|223535468|gb|EEF37137.1| conserved hypothetical protein [Ricinus communis] Length = 381 Score = 268 bits (685), Expect = 5e-69 Identities = 188/443 (42%), Positives = 249/443 (56%), Gaps = 4/443 (0%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLIFNSDGTQPVSVSIQNDQKCLNESLQSQDSVFWGNDDGV 1368 MWQ+LLAAAVAGS G +AK L+ + D ++ Q+ +S+ + D V +D Sbjct: 1 MWQLLLAAAVAGSTGFVAKHLLHHHDEHS------KDKQEPPQDSIATPDVV----NDAN 50 Query: 1367 QESTLKEQSDGSIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGNAAKKCG 1188 ++ D S+FRFSS+ A SS +K +RK KG K C Sbjct: 51 KQCGFGTSFDQSVFRFSSS--ASSSSASKKIRKN--------------SAIKGRRLKFCT 94 Query: 1187 VSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSVGIMY 1008 ++ E + Q G+ ++ +VCLKKRRT+K ++K ++F WGL VGIMY Sbjct: 95 LNEKENQRSTASQ---GSARRFAVCLKKRRTAKCGP---TVSSNKETTLFGWGLGVGIMY 148 Query: 1007 MMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNREMVA 828 MMSAGK+EISKL+NAMDETSK+VQEL+ EL +RK+ + + K+N++MV Sbjct: 149 MMSAGKSEISKLSNAMDETSKVVQELRTELYKRKSARSE---------GISFKHNQQMVN 199 Query: 827 QSVVKKPISEKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETELQKLPW 648 S + K + V ++ EC SSV+TEE P+ MDQLEAEL +ELQKLPW Sbjct: 200 GSGTEDR-DPKDIKV--------DDIECPSSVLTEEPDPQVLNMDQLEAELASELQKLPW 250 Query: 647 STMECLGFEDRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHLLIEQQEGQ 468 S E G E + VS +Q G+LPS LDQKLCHLLIEQQE Q Sbjct: 251 SDTETSGNEGGPSMVKSPVS--------------YQCHGILPSELDQKLCHLLIEQQENQ 296 Query: 467 IMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASD--EETDIQVEDEKTSLKD 294 I ELES+L SKLNEKE ELQ LKDCV+RLTEFSL S +E +IQV + S D Sbjct: 297 IEELESELHTAQSKLNEKEAELQALKDCVRRLTEFSLSTVSGKYDEPEIQVAQDCISEWD 356 Query: 293 QEMELGTELGSR-SMVGMKRAMD 228 ++ ++ R S+VGMKR +D Sbjct: 357 NNNKIESQSEPRKSVVGMKRPID 379 >ref|XP_006381484.1| hypothetical protein POPTR_0006s13270g [Populus trichocarpa] gi|550336187|gb|ERP59281.1| hypothetical protein POPTR_0006s13270g [Populus trichocarpa] Length = 395 Score = 262 bits (669), Expect = 3e-67 Identities = 184/444 (41%), Positives = 248/444 (55%), Gaps = 5/444 (1%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLIFNSDGTQPVSVSIQNDQKCLNESLQSQDSVFWGNDDGV 1368 MW +LL AA+AGS G +AK + N S S + +Q+ + + DS + Sbjct: 1 MWNLLLTAAIAGSTGFIAKHVFTNHH-----SPSEKYEQEENLQDSPAFDSPLVTKECHG 55 Query: 1367 QESTLKEQSDGSIFRFSST-SGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGNAAKKC 1191 ES ++ IFRFSS+ SG+ + K+ K+ GVS R +++ EN K + K Sbjct: 56 YESNCDQEG---IFRFSSSASGSRGKNKKKSCLKEKSGVSCR-RLKFATENVKRSGGLK- 110 Query: 1190 GVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSVGIM 1011 G +G+K CLKK+RT +S+F GL VGIM Sbjct: 111 ----------------GTSGRKSCACLKKKRTE--------------SSLFGRGLGVGIM 140 Query: 1010 YMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNREMV 831 YMMSAGKAEISKL+ AMDETSK + EL+ EL +RK+ T+L +N Sbjct: 141 YMMSAGKAEISKLSTAMDETSKTIHELRTELYKRKS----------TQLATSSENISTEQ 190 Query: 830 AQSVVKK--PISEKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETELQK 657 Q VV + + +K+ A++ EC SSV+TEE +P EMDQLEAE E+ELQK Sbjct: 191 MQLVVNRISMVDRDPNDMKLCGLTMADDVECPSSVLTEEPEPAVLEMDQLEAEFESELQK 250 Query: 656 LPWSTMECLGFE-DRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHLLIEQ 480 LPWS+ E G E R + + EVS+ F + + S+Q GVLPS LD KLCHLLIEQ Sbjct: 251 LPWSSTETSGHEITRLNLGKAEVSSEGFCELEGADDVSYQRDGVLPSELDNKLCHLLIEQ 310 Query: 479 QEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDEKTSL 300 QE QI LES+L S+L+EKE ELQ LKDCV+RLTEFSL SD+E ++Q E + Sbjct: 311 QENQITGLESELHLAQSQLHEKEAELQALKDCVRRLTEFSLSTISDDEVEVQPELGCNTE 370 Query: 299 KDQEMELGTELGSRSMVGMKRAMD 228 D+ ++G+E +S+VGMKR +D Sbjct: 371 WDKNSQVGSE-SRKSVVGMKRPID 393 >ref|XP_006361795.1| PREDICTED: uncharacterized protein LOC102587347 isoform X3 [Solanum tuberosum] Length = 390 Score = 235 bits (600), Expect(2) = 2e-65 Identities = 156/375 (41%), Positives = 205/375 (54%), Gaps = 52/375 (13%) Frame = -3 Query: 1544 MWQVLLAAAVAGSGILAKRLIFNSDGTQPVSVSIQNDQKC--LNE------SLQSQDSVF 1389 MW L+ AA AGSG LAK+ IFN + T+P+S S +D KC LN+ S Q +DS+F Sbjct: 1 MWPALVVAAAAGSGFLAKK-IFNQNATEPISGSTASDSKCDKLNDPEEFMTSFQHKDSIF 59 Query: 1388 WG------NDDGVQESTL-------------------KEQS-----------------DG 1335 N G ++S K+ S DG Sbjct: 60 TSICDKPFNPQGHKDSIFISNFDKPFDPEGLNTPFQHKDSSFVCNLGCNIQEKCEGFFDG 119 Query: 1334 SIFRFSSTSGAEKSSNNKNVRKKIRGV-SSRGKMEGLKENSKGNAAKKCG-VSGCEKRGW 1161 SIFRFSS SG+E K +K + G ++G + K S G KCG V EK Sbjct: 120 SIFRFSSASGSEMGF-RKLRKKNVEGSRKTKGNVMEWKGKSGGKLGGKCGNVRSGEKELV 178 Query: 1160 VVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSVGIMYMMSAGKAEI 981 +DQR NGK+ VCLKKRRT+K+ SGKC SC SK NS F +GL +G+M MMSAGK+EI Sbjct: 179 RLDQRKRRNGKRFYVCLKKRRTNKVPSGKCDSCASKGNSFFGYGLGIGMMCMMSAGKSEI 238 Query: 980 SKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNREMVAQSVVKKPIS 801 ++LN MDET+K V+ELKAELSR++ N A+ + ++ KNNRE ++ + Sbjct: 239 NRLNTTMDETAKAVEELKAELSRKRVAHNLCASK--NEGDIDEKNNRECRIHAIAESNNE 296 Query: 800 EKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETELQKLPWSTMECLGFE 621 + + + AEEGEC+SSVVTEE QPE EMDQLEAELE+EL KLPW + E Sbjct: 297 NRNI-YRALDLQVAEEGECASSVVTEELQPEVMEMDQLEAELESELLKLPWCSTEDTDLN 355 Query: 620 DRTDTFEDEVSAREF 576 D + ++A E+ Sbjct: 356 GGRDPCQKPINAVEY 370 Score = 43.1 bits (100), Expect(2) = 2e-65 Identities = 16/27 (59%), Positives = 22/27 (81%) Frame = -1 Query: 547 PINPVEYCRLCWIRNYAICSLNSRKVK 467 PIN VEYC L WI+++ ICSLN++K + Sbjct: 364 PINAVEYCHLNWIKSFVICSLNNKKAR 390 >ref|XP_003541404.1| PREDICTED: uncharacterized protein LOC100799571 [Glycine max] Length = 411 Score = 231 bits (588), Expect = 8e-58 Identities = 167/451 (37%), Positives = 239/451 (52%), Gaps = 5/451 (1%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLIFNSDGTQPVSVSIQNDQKCLNESLQSQDSVFWGNDDGV 1368 MW +LL AVAGS G KR + + +N + N +L + + + + + Sbjct: 1 MWHLLLLVAVAGSTGFATKRFLTHH----------RNTGEGENVNLHDPNELDFSGSESI 50 Query: 1367 QESTLKEQSDGSIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGNAAKKCG 1188 ++ SDG +F FSS+ +S ++ K R +S+ + K ++ Sbjct: 51 SQT---HHSDG-VFTFSSSKS--ESLTQRDGPKSRRSRASKNGVRAPKVEARS------- 97 Query: 1187 VSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSVGIMY 1008 ++R GG ++ LKK +K + K +SK + +F WG GIMY Sbjct: 98 -----------EKRSGGT--RLHFRLKKGEITKNVAAKAPVRSSKDDCLFGWGFCFGIMY 144 Query: 1007 MMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNREMVA 828 MMSAGKAEI+KLN MDET+K+VQELK+EL+RRK+ S+A + + KN+ ++ Sbjct: 145 MMSAGKAEINKLNKTMDETAKLVQELKSELNRRKS---SHALQILDSVGNGVKNSCKISG 201 Query: 827 QSVV---KKPISEKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETELQK 657 ++ V I + V VK+ + + GEC SS +TEE +P+ EMDQLEAELE ELQK Sbjct: 202 RNEVMLKNTNIELRDVDVKICSPCVNDCGECGSSALTEEPEPQVLEMDQLEAELEFELQK 261 Query: 656 LPWSTMECLGFEDRTDTFEDEVSARE-FHRTDDENLYSHQPSGVLPSVLDQKLCHLLIEQ 480 L + E ++ + E +H TDD NL GV S L QKL HLLI+Q Sbjct: 262 LSGCATDGPCNEKIKPNLDELEAPNEGYHGTDDWNLNYSNSHGVSASELHQKLSHLLIKQ 321 Query: 479 QEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDEKTSL 300 QE QI ELES+L + S L+EKE ELQ LKDCVK LTE SL SD+ET + + TS Sbjct: 322 QENQIAELESELHQAQSNLHEKEAELQALKDCVKCLTELSLSTVSDDETQALTDPKGTS- 380 Query: 299 KDQEMELGTELGSRSMVGMKRAMDFESYHCF 207 D + + S++G KR +D ES+ C+ Sbjct: 381 -DYGNKNIHSVVKHSIIGTKRPLDSESFSCY 410 >gb|EXC16256.1| hypothetical protein L484_024430 [Morus notabilis] Length = 373 Score = 230 bits (587), Expect = 1e-57 Identities = 157/399 (39%), Positives = 218/399 (54%), Gaps = 14/399 (3%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLI----------FNSDGTQPVSVSIQNDQKCLNESLQSQD 1398 MWQ L+AAAVAG+ GI+AK + +N D ++ +++ + L+ S Sbjct: 1 MWQALVAAAVAGTTGIVAKHFLKPASDHAHRPYNGDDLGDQPINREDESRFLSASAL--- 57 Query: 1397 SVFWGNDDGVQESTLKEQSDGSIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKEN 1218 G++ + +++ E+ D IFRFSST G G + Sbjct: 58 ----GSELDLPRASVPEEED-EIFRFSST----------------------GSRVGRRNL 90 Query: 1217 SKGNAAKKCGVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVF 1038 SK A + GV G G QR G ++V VCLK+R+TSKI +GK +SC+S S+F Sbjct: 91 SKNKGALRKGVRGERSAG--SKQRSGE--RRVGVCLKRRKTSKIGAGKRESCSSNDTSLF 146 Query: 1037 SWGLSVGIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLEL 858 WGL VGIMYM+S GKAE KLN AM+ET+K+V ELKAEL +RK+ N + ++ L Sbjct: 147 GWGLGVGIMYMVSTGKAEFCKLNAAMNETAKVVDELKAELHKRKSSQNLQVSGCASEAML 206 Query: 857 QPKNNREMVAQSVVKKPISEKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAE 678 R + +K F+ P+ + EC+SSV+TEE P EMDQLEAE Sbjct: 207 ----TRHTTSYKDDHPDHKTGPNDIKSFSSPSMNDVECASSVLTEEPAPGMQEMDQLEAE 262 Query: 677 LETELQKLPWSTMECLGFEDRTDT-FEDEVSAREFHRTDDENLYSHQ--PSGVLPSVLDQ 507 LE ELQ+LPW T++ ++ + E E SA+ F D ++ HQ +G+LP+ LDQ Sbjct: 263 LEFELQQLPWCTLDASSNQEGSKALIETEFSAQAFPEQDRQDSDIHQLGGNGILPAELDQ 322 Query: 506 KLCHLLIEQQEGQIMELESKLQRTHSKLNEKEVELQTLK 390 KLCHLLIE+QE QI ELES+L LNEKE + +K Sbjct: 323 KLCHLLIERQENQIEELESELHLAQHTLNEKEALGRAMK 361 >ref|XP_006588767.1| PREDICTED: uncharacterized protein LOC102659621 isoform X1 [Glycine max] gi|571481772|ref|XP_006588768.1| PREDICTED: uncharacterized protein LOC102659621 isoform X2 [Glycine max] gi|571481774|ref|XP_006588769.1| PREDICTED: uncharacterized protein LOC102659621 isoform X3 [Glycine max] gi|571481776|ref|XP_006588770.1| PREDICTED: uncharacterized protein LOC102659621 isoform X4 [Glycine max] Length = 401 Score = 222 bits (565), Expect = 4e-55 Identities = 168/454 (37%), Positives = 234/454 (51%), Gaps = 8/454 (1%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLIFNSDGTQPVSVSIQNDQKCLNESLQSQDSVFWGNDDGV 1368 MW +LL AVAGS G KR + + + +V++ + S F D Sbjct: 1 MWPILLLVAVAGSTGFATKRFLTRRNTIEGENVNLHD------------PSAF---DFSS 45 Query: 1367 QESTLKEQSDGSIFRFSSTSGAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGNAAKKCG 1188 EST + ++ +F FSS+ + + G SR + ++ G A K Sbjct: 46 SESTSQMHNNDGVFTFSSSKSESLTQQD--------GPESRRS----RASNDGVRAPKVE 93 Query: 1187 VSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSVGIMY 1008 ++ G G K+ +KKR ++ + K +SK +S+F GL GIM Sbjct: 94 ARSEKRNG----------GTKLHFRMKKREITRNVAAKAPVRSSKDDSLFGRGLCSGIMC 143 Query: 1007 MMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPP-----NSYAANVPTKLELQPKNN 843 MMSAGKAEI+KLN MDET+K+VQELK+EL+RRK+ +S V ++ +N Sbjct: 144 MMSAGKAEINKLNKTMDETAKLVQELKSELNRRKSSHALQNLDSVGNGVTNSCKISGRN- 202 Query: 842 REMVAQSVVKKPISE-KGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETE 666 + ++KK SE + V VK+++ + GEC SS +TEE +P+ EMDQLEAELE E Sbjct: 203 -----EVMLKKTNSELRDVDVKIWSPCVNDCGECGSSALTEEPEPQVLEMDQLEAELEFE 257 Query: 665 LQKLPWSTMECLGFED-RTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHLL 489 LQKL + E + + E E +H TD GV S L QKL HLL Sbjct: 258 LQKLSGCATDGPCDEKIKPNLDELEAPGEGYHGTD---------HGVSASELHQKLSHLL 308 Query: 488 IEQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDEK 309 I+QQE QIMELES+L + S L+EKE ELQ LKDCVKRLTE SL SD+E + + + Sbjct: 309 IKQQENQIMELESELHQAQSNLHEKEAELQALKDCVKRLTELSLSTVSDDEAQVLSDPKG 368 Query: 308 TSLKDQEMELGTELGSRSMVGMKRAMDFESYHCF 207 TS D S++G KR +D ES+ C+ Sbjct: 369 TS--DYGNNNMHSESKHSVIGTKRPLDSESFSCY 400 >gb|ESW16537.1| hypothetical protein PHAVU_007G164700g [Phaseolus vulgaris] Length = 399 Score = 220 bits (561), Expect = 1e-54 Identities = 162/449 (36%), Positives = 233/449 (51%), Gaps = 6/449 (1%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLIFNSDGTQPVSVSIQNDQKCLNESLQSQDSVFWGNDDGV 1368 MW +LL AVAGS G KR + N +N + N ++ S + Sbjct: 1 MWHLLLVVAVAGSTGFATKRFLSNH----------RNAGEVENANIDDPSSFTFATS--- 47 Query: 1367 QESTLKEQSDGSIFRFSSTS-GAEKSSNNKNVRKKIRGVSSRGKMEGLKENSKGNAAKKC 1191 EST Q DG +F FSS+ G KS ++ + ++R K+E E G Sbjct: 48 -EST--SQRDG-VFTFSSSQQGGPKSRRSRASKIRVRAP----KVEVRPEQMNG------ 93 Query: 1190 GVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRTSKIASGKCQSCTSKANSVFSWGLSVGIM 1011 G+++ LKKR +K + K K NS+F WGL +GIM Sbjct: 94 -------------------GRRLRFHLKKREIAKNVAAKAPFPCCKDNSLFDWGLFLGIM 134 Query: 1010 YMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQPKNNREMV 831 YMM AGKA+I+KLN ++ET+ +VQELK+E+++RK+ + + + +N+ ++ Sbjct: 135 YMMFAGKADINKLNKTLNETANLVQELKSEVNKRKSSCDLQNLD---SVGNGARNSSKIR 191 Query: 830 AQSVVKKPISE---KGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETELQ 660 ++V + + +K+++ + GEC SS +TEE +P+ EMDQLEAELE ELQ Sbjct: 192 GRNVAMHNNTNSELRDTDLKIWSPAVNDCGECGSSALTEEPEPQVLEMDQLEAELEFELQ 251 Query: 659 KLPWSTMECLGFEDRTDTFED-EVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHLLIE 483 KL T +E+ ++ E +H TDD+N + GV S L QKL HLLI+ Sbjct: 252 KLSGCTTGAPCYEETKPCLDEFEAPGEGYHGTDDQNFNYSESHGVSASELHQKLSHLLIK 311 Query: 482 QQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDEKTS 303 QQE QIMELES+L + S L+EKE ELQ LK+CVK LTE SL SD+ET + D K + Sbjct: 312 QQENQIMELESELHQAQSNLHEKEAELQALKNCVKHLTELSLSTVSDDETQ-ALSDPKGA 370 Query: 302 LKDQEMELGTELGSRSMVGMKRAMDFESY 216 + E S+VG KR +D ES+ Sbjct: 371 SDCGNNNIDFE-SKHSVVGTKRPLDSESW 398 >ref|NP_001189849.1| uncharacterized protein [Arabidopsis thaliana] gi|332641284|gb|AEE74805.1| uncharacterized protein AT3G09730 [Arabidopsis thaliana] Length = 405 Score = 211 bits (538), Expect = 5e-52 Identities = 172/454 (37%), Positives = 235/454 (51%), Gaps = 12/454 (2%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLI--FNSDGTQPVSVSIQNDQKCLNESLQSQDSVFWGNDD 1374 MWQV+L AA+AGS G +AKRL F+ D P + + + + S+ DS Sbjct: 1 MWQVILGAAIAGSTGFVAKRLFNPFSRDSPTPENYAEEQEPVTPPVSIGFLDSPC----- 55 Query: 1373 GVQESTLKEQSDGSIFRFSSTSGAEKSSNNKNVR---KKIRGVSSRGKMEGLKENSKGNA 1203 ++++G +FRFSS+ S + +K GV R ++ GL + K Sbjct: 56 --------DKTNG-VFRFSSSGSTVNSGSGSGSSPGFRKSSGVKCRVRVRGLMKKKK--- 103 Query: 1202 AKKCGVSGCEKRGWVVDQRGGGNGK---KVSVCLKKRRTSKIASGKCQSCTSKANSVFSW 1032 K GCE +++R G G K VC KK +T AS SK +S FS Sbjct: 104 --KKNSGGCE-----IEKRSGNVGAFEMKSEVCSKKTKTLGAASA------SKHHSSFSS 150 Query: 1031 GLSVGIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQP 852 L V +MYMMSA K EISKL+ A +ET+K++QELK ELSR K+ E Sbjct: 151 ALGVCMMYMMSAEKGEISKLHAATEETTKVIQELKDELSRIKSLQGFKFRGCAASSEKSG 210 Query: 851 K---NNREMVAQSVVKKPISEKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEA 681 + N EMV S + +K + +GE +SSV+TEE + EA EM+QLE Sbjct: 211 QIDLNRSEMV---------SRVSLDIK-----SGNDGEYASSVLTEEPEQEAVEMEQLEM 256 Query: 680 ELETELQKLPWSTMECLGFEDRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKL 501 ELE+ELQKL + D + +D V+ E S+Q G+ S LD+KL Sbjct: 257 ELESELQKLNLAETS-----DVMEECKDLVNGAE----------SYQCGGISASELDKKL 301 Query: 500 CHLLIEQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQV 321 HLLIEQQEGQI ELE++LQ T SKL EKE ELQ LK CV+RLTEF L + SD+E + + Sbjct: 302 SHLLIEQQEGQINELEAELQTTQSKLQEKEAELQALKVCVRRLTEFPLLDRSDDEHEEDL 361 Query: 320 EDEKTSLKDQEMELGTELGSRSMVGMKRAMDFES 219 + + Q + E + ++GMKR M+F S Sbjct: 362 NQDLSVSWSQHNKTDHE-ARKQIIGMKRPMEFVS 394 >gb|AAF23302.1|AC016661_27 hypothetical protein [Arabidopsis thaliana] Length = 405 Score = 210 bits (534), Expect = 2e-51 Identities = 173/458 (37%), Positives = 236/458 (51%), Gaps = 14/458 (3%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLI--FNSDGTQPVSVSIQNDQKCLNESLQSQDSVFWGNDD 1374 MWQV+L AA+AGS G +AKRL F+ D P + + + + S+ DS Sbjct: 1 MWQVILGAAIAGSTGFVAKRLFNPFSRDSPTPENYAEEQEPVTPPVSIGFLDSPC----- 55 Query: 1373 GVQESTLKEQSDGSIFRFSSTSGAEKSSNNKNVR---KKIRGVSSRGKMEGLKENSKGNA 1203 ++++G +FRFSS+ S + +K GV R ++ GL + K Sbjct: 56 --------DKTNG-VFRFSSSGSTVNSGSGSGSSPGFRKSSGVKCRVRVRGLMKKKK--- 103 Query: 1202 AKKCGVSGCEKRGWVVDQRGGGNGK---KVSVCLKKRRTSKIASG-KCQSCTSKAN-SVF 1038 K GCE +++R G G K VC KK +T AS K SC S + S F Sbjct: 104 --KKNSGGCE-----IEKRSGNVGAFEMKSEVCSKKTKTLGAASASKRGSCYSNQDHSSF 156 Query: 1037 SWGLSVGIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLEL 858 S L V +MYMMSA K EISKL+ A +ET+K++QELK ELSR K+ E Sbjct: 157 SSALGVCMMYMMSAEKGEISKLHAATEETTKVIQELKDELSRIKSLQGFKFRGCAASSEK 216 Query: 857 QPK---NNREMVAQSVVKKPISEKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQL 687 + N EMV S + +K + +GE +SSV+TEE + EA EM+QL Sbjct: 217 SGQIDLNRSEMV---------SRVSLDIK-----SGNDGEYASSVLTEEPEQEAVEMEQL 262 Query: 686 EAELETELQKLPWSTMECLGFEDRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQ 507 E ELE+ELQKL + D + +D V+ E S+Q G+ S LD+ Sbjct: 263 EMELESELQKLNLAETS-----DVMEECKDLVNGAE----------SYQCGGISASELDK 307 Query: 506 KLCHLLIEQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDI 327 KL HLLIEQQEGQI ELE++LQ T SKL EKE ELQ LK CV+RLTEF L + SD+E + Sbjct: 308 KLSHLLIEQQEGQINELEAELQTTQSKLQEKEAELQALKVCVRRLTEFPLLDRSDDEHEE 367 Query: 326 QVEDEKTSLKDQEMELGTELGSRSMVGMKRAMDFESYH 213 + + + Q + E + ++GMKR M+ H Sbjct: 368 DLNQDLSVSWSQHNKTDHE-ARKQIIGMKRPMESSCIH 404 >ref|NP_187584.2| uncharacterized protein [Arabidopsis thaliana] gi|332641283|gb|AEE74804.1| uncharacterized protein AT3G09730 [Arabidopsis thaliana] Length = 397 Score = 210 bits (534), Expect = 2e-51 Identities = 171/456 (37%), Positives = 234/456 (51%), Gaps = 12/456 (2%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLI--FNSDGTQPVSVSIQNDQKCLNESLQSQDSVFWGNDD 1374 MWQV+L AA+AGS G +AKRL F+ D P + + + + S+ DS Sbjct: 1 MWQVILGAAIAGSTGFVAKRLFNPFSRDSPTPENYAEEQEPVTPPVSIGFLDSPC----- 55 Query: 1373 GVQESTLKEQSDGSIFRFSSTSGAEKSSNNKNVR---KKIRGVSSRGKMEGLKENSKGNA 1203 ++++G +FRFSS+ S + +K GV R ++ GL + K Sbjct: 56 --------DKTNG-VFRFSSSGSTVNSGSGSGSSPGFRKSSGVKCRVRVRGLMKKKK--- 103 Query: 1202 AKKCGVSGCEKRGWVVDQRGGGNGK---KVSVCLKKRRTSKIASGKCQSCTSKANSVFSW 1032 K GCE +++R G G K VC KK +T AS SK +S FS Sbjct: 104 --KKNSGGCE-----IEKRSGNVGAFEMKSEVCSKKTKTLGAASA------SKHHSSFSS 150 Query: 1031 GLSVGIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPNSYAANVPTKLELQP 852 L V +MYMMSA K EISKL+ A +ET+K++QELK ELSR K+ E Sbjct: 151 ALGVCMMYMMSAEKGEISKLHAATEETTKVIQELKDELSRIKSLQGFKFRGCAASSEKSG 210 Query: 851 K---NNREMVAQSVVKKPISEKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEA 681 + N EMV S + +K + +GE +SSV+TEE + EA EM+QLE Sbjct: 211 QIDLNRSEMV---------SRVSLDIK-----SGNDGEYASSVLTEEPEQEAVEMEQLEM 256 Query: 680 ELETELQKLPWSTMECLGFEDRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKL 501 ELE+ELQKL + D + +D V+ E S+Q G+ S LD+KL Sbjct: 257 ELESELQKLNLAETS-----DVMEECKDLVNGAE----------SYQCGGISASELDKKL 301 Query: 500 CHLLIEQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQV 321 HLLIEQQEGQI ELE++LQ T SKL EKE ELQ LK CV+RLTEF L + SD+E + + Sbjct: 302 SHLLIEQQEGQINELEAELQTTQSKLQEKEAELQALKVCVRRLTEFPLLDRSDDEHEEDL 361 Query: 320 EDEKTSLKDQEMELGTELGSRSMVGMKRAMDFESYH 213 + + Q + E + ++GMKR M+ H Sbjct: 362 NQDLSVSWSQHNKTDHE-ARKQIIGMKRPMESSCIH 396 >ref|XP_002882635.1| hypothetical protein ARALYDRAFT_317774 [Arabidopsis lyrata subsp. lyrata] gi|297328475|gb|EFH58894.1| hypothetical protein ARALYDRAFT_317774 [Arabidopsis lyrata subsp. lyrata] Length = 392 Score = 204 bits (520), Expect = 6e-50 Identities = 165/446 (36%), Positives = 232/446 (52%), Gaps = 7/446 (1%) Frame = -3 Query: 1544 MWQVLLAAAVAGS-GILAKRLIFNSDGTQPVSVSIQNDQKCLNESLQSQDSVFWGNDDGV 1368 MWQVLL AA+AGS G++AKR FN D L + + Q+ + G Sbjct: 1 MWQVLLGAAIAGSTGLVAKRF-FNP---------FSRDSPTLEDYTEEQEPIAPPVSIGF 50 Query: 1367 QESTLKEQSDGSIFRFSSTSGAEKSSNNKNVR---KKIRGVSSRGKMEGLKENSKGNAAK 1197 +S + + +FRFSS+ S + +K GV R ++ GL + K + Sbjct: 51 LDSPFDKTN--GVFRFSSSGSTVNSGSGSGSSPGFRKSSGVKCRVRVRGLMKKKKKIS-- 106 Query: 1196 KCGVSGCEKRGWVVDQRGGGNGKKVSVCLKKRRT--SKIASGKCQSCTSKANSVFSWGLS 1023 GCE +K VC KK +T + AS + S +++ +S FS L Sbjct: 107 ----EGCEI-------------EKREVCSKKTKTLGAASASKRGSSYSNQDHSSFSSALG 149 Query: 1022 VGIMYMMSAGKAEISKLNNAMDETSKIVQELKAELSRRKTPPN-SYAANVPTKLELQPKN 846 V +MYMMSA K+EISKL+ A +ET+K++QELK ELSR K+ + +AA ++ Q Sbjct: 150 VCMMYMMSAEKSEISKLHAATEETAKVIQELKDELSRIKSLQSFKFAATASSEKSGQINL 209 Query: 845 NREMVAQSVVKKPISEKGVPVKVFAFPTAEEGECSSSVVTEEQQPEAFEMDQLEAELETE 666 +R +A I +GE +SSV+TEE + EA EM+QLE ELE+E Sbjct: 210 SRSEMASRECLDIIK------------AGNDGEYASSVLTEEPEQEAVEMEQLEMELESE 257 Query: 665 LQKLPWSTMECLGFEDRTDTFEDEVSAREFHRTDDENLYSHQPSGVLPSVLDQKLCHLLI 486 LQKL + D + +D V+ E S+Q G+ S LD+KL HLLI Sbjct: 258 LQKLNLAETS-----DVMEECKDLVNGAE----------SYQCGGISASELDKKLSHLLI 302 Query: 485 EQQEGQIMELESKLQRTHSKLNEKEVELQTLKDCVKRLTEFSLGNASDEETDIQVEDEKT 306 EQQEGQI ELE++LQ T SKL EKE ELQ LK CV+RLTEF L + SD+E + + + + Sbjct: 303 EQQEGQINELEAELQTTQSKLQEKEAELQALKVCVRRLTEFPLLDRSDDEHEEDLNQDLS 362 Query: 305 SLKDQEMELGTELGSRSMVGMKRAMD 228 Q + E+ + +VGMKR M+ Sbjct: 363 VSWSQHNKTDNEV-RKPIVGMKRPME 387