BLASTX nr result
ID: Mentha22_contig00021696
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00021696 (679 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU18758.1| hypothetical protein MIMGU_mgv1a000625mg [Mimulus... 191 2e-46 ref|XP_002526720.1| conserved hypothetical protein [Ricinus comm... 166 6e-39 ref|XP_004292072.1| PREDICTED: uncharacterized protein LOC101315... 162 7e-38 ref|XP_007199692.1| hypothetical protein PRUPE_ppa000620mg [Prun... 162 7e-38 ref|XP_006349559.1| PREDICTED: uncharacterized protein LOC102595... 159 6e-37 ref|XP_006349558.1| PREDICTED: uncharacterized protein LOC102595... 159 6e-37 ref|XP_002277702.2| PREDICTED: uncharacterized protein LOC100241... 159 1e-36 emb|CBI29872.3| unnamed protein product [Vitis vinifera] 158 1e-36 gb|EXB53137.1| hypothetical protein L484_006957 [Morus notabilis] 157 3e-36 ref|XP_002876425.1| binding protein [Arabidopsis lyrata subsp. l... 154 3e-35 ref|XP_007028825.1| ARM repeat superfamily protein, putative [Th... 153 6e-35 ref|NP_001190119.1| armadillo/beta-catenin-like repeat-containin... 153 6e-35 ref|NP_191316.5| armadillo/beta-catenin-like repeat-containing p... 153 6e-35 emb|CAB66114.1| hypothetical protein [Arabidopsis thaliana] 153 6e-35 ref|XP_007162242.1| hypothetical protein PHAVU_001G135900g [Phas... 148 2e-33 ref|XP_002308042.2| hypothetical protein POPTR_0006s05290g [Popu... 147 4e-33 ref|XP_006604370.1| PREDICTED: uncharacterized protein LOC100800... 146 5e-33 ref|XP_006604369.1| PREDICTED: uncharacterized protein LOC100800... 146 5e-33 emb|CAB41198.1| hypothetical protein [Arabidopsis thaliana] 145 9e-33 ref|XP_004160826.1| PREDICTED: uncharacterized LOC101210197 [Cuc... 145 1e-32 >gb|EYU18758.1| hypothetical protein MIMGU_mgv1a000625mg [Mimulus guttatus] Length = 1041 Score = 191 bits (484), Expect = 2e-46 Identities = 103/173 (59%), Positives = 128/173 (73%), Gaps = 25/173 (14%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTKG----------TAISSDSVLAYVINQLTD-------E 131 QHGCID LALMLC E+Q+P+ +KG TAI+ DSVLAYV+NQLT E Sbjct: 823 QHGCIDCLALMLCTEIQSPKSSKGKYPYVTKFAGTAIARDSVLAYVMNQLTGDKKDSSFE 882 Query: 132 SNG--------VRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPETR 287 S G RL+FRLCMANVLISACQKI ++GKKS+ K+I P +IRSI + +P+ R Sbjct: 883 SEGSDRVTDATARLSFRLCMANVLISACQKISDTGKKSFVKKIVPCVIRSIGEVVEPDIR 942 Query: 288 AACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 AAC+Q+LF+VAY+LKS I +SNDLL+VAL SLR GS+KE+M+GAKL+MCLMA Sbjct: 943 AACVQILFSVAYHLKSSIFSHSNDLLSVALKSLRDGSQKERMAGAKLVMCLMA 995 >ref|XP_002526720.1| conserved hypothetical protein [Ricinus communis] gi|223533909|gb|EEF35634.1| conserved hypothetical protein [Ricinus communis] Length = 1054 Score = 166 bits (420), Expect = 6e-39 Identities = 91/179 (50%), Positives = 117/179 (65%), Gaps = 31/179 (17%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK---------------GTAISSDSVLAYVINQLTDESN 137 QHGCID LALM+CAELQA E K G + + +S LAYVI+QL ++ N Sbjct: 830 QHGCIDCLALMICAELQATESLKDSSNKFRIAGKIIDSGKSTAGNSALAYVIHQLANDKN 889 Query: 138 GVRLT----------------FRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAI 269 V ++ RLCMAN LISACQKI +SGKKS+A+R P +I S+E I Sbjct: 890 EVSVSSLNIENCEFEATIPCSLRLCMANALISACQKISDSGKKSFARRSLPNLIHSVEMI 949 Query: 270 SDPETRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 S PE RAACIQV+F+ Y+LKS ++PYS DLL ++L LR+GS+KE+M+GAKL+ LMA Sbjct: 950 SHPEIRAACIQVMFSAVYHLKSAVVPYSADLLKLSLKFLRKGSDKERMAGAKLMASLMA 1008 >ref|XP_004292072.1| PREDICTED: uncharacterized protein LOC101315407 [Fragaria vesca subsp. vesca] Length = 1057 Score = 162 bits (411), Expect = 7e-38 Identities = 89/173 (51%), Positives = 116/173 (67%), Gaps = 25/173 (14%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK--------GTAISSDSVLAYVINQLTDESNG------ 140 QHGCID +ALM+CAELQ P + G +SVL YVINQLT++ + Sbjct: 840 QHGCIDCMALMICAELQDPISSNIVGTKKYLGDGTLKNSVLTYVINQLTEDKDTPVSKSN 899 Query: 141 -----------VRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPETR 287 V ++F LCMANVLISACQKI +SGKK +A+R PR+IR++E I+ E R Sbjct: 900 LDDVKCTTEVPVPISFYLCMANVLISACQKISDSGKKPFARRSLPRLIRAVEVITKSEIR 959 Query: 288 AACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 AAC QVLF+ Y+LKS+ILPYS DLL V++ +L++GSEKE+M+ AKL+ LMA Sbjct: 960 AACTQVLFSAVYHLKSIILPYSMDLLKVSIKALQKGSEKERMASAKLMGSLMA 1012 >ref|XP_007199692.1| hypothetical protein PRUPE_ppa000620mg [Prunus persica] gi|462395092|gb|EMJ00891.1| hypothetical protein PRUPE_ppa000620mg [Prunus persica] Length = 1068 Score = 162 bits (411), Expect = 7e-38 Identities = 91/170 (53%), Positives = 117/170 (68%), Gaps = 22/170 (12%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEY-----TKGTAISSDSVLAYVINQLTDESNG--------- 140 QHGCIDSLALM+CAELQ PE KG A S +SVL VIN+L +++ Sbjct: 854 QHGCIDSLALMICAELQDPESFSIVGKKGDASSGNSVLTCVINKLIQDNHQPVLLSNLDD 913 Query: 141 --------VRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPETRAAC 296 V L+F +CMANVLISACQKI +SGKK + ++ P +I S++ +++ E RAAC Sbjct: 914 VKCSSEVPVPLSFYMCMANVLISACQKILDSGKKPFVRKTLPCLIHSVKVMTNSEIRAAC 973 Query: 297 IQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 IQVLF+ Y+LKS +LPYS DLL V+L +LR+GSEKE+M+GAKLL LMA Sbjct: 974 IQVLFSSVYHLKSTVLPYSADLLEVSLKALRKGSEKERMAGAKLLGSLMA 1023 >ref|XP_006349559.1| PREDICTED: uncharacterized protein LOC102595225 isoform X2 [Solanum tuberosum] Length = 1096 Score = 159 bits (403), Expect = 6e-37 Identities = 88/175 (50%), Positives = 116/175 (66%), Gaps = 27/175 (15%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK----------------GTAISSDSVLAYVINQLT-DE 131 QHGCID LALMLC ELQA + K G +++ SV +YVI+ L E Sbjct: 876 QHGCIDCLALMLCTELQATKAVKNSISIEVCFEQSIVSSGDSLTKGSVCSYVIHHLVCGE 935 Query: 132 SNGVRL----------TFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPE 281 V L +FRLCMANVLISACQK+P + KK + +I PR++ S+E I++ E Sbjct: 936 DISVMLGRNEVVKAHHSFRLCMANVLISACQKVPCASKKPFVSKILPRVLHSVEEIANSE 995 Query: 282 TRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 R+ACIQV F++ Y+LKSL+LPYS+DLL V++ SLR+GSEKE+++GAKLL LMA Sbjct: 996 VRSACIQVFFSMVYHLKSLVLPYSSDLLKVSIKSLREGSEKERIAGAKLLASLMA 1050 >ref|XP_006349558.1| PREDICTED: uncharacterized protein LOC102595225 isoform X1 [Solanum tuberosum] Length = 1097 Score = 159 bits (403), Expect = 6e-37 Identities = 88/175 (50%), Positives = 116/175 (66%), Gaps = 27/175 (15%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK----------------GTAISSDSVLAYVINQLT-DE 131 QHGCID LALMLC ELQA + K G +++ SV +YVI+ L E Sbjct: 877 QHGCIDCLALMLCTELQATKAVKNSISIEVCFEQSIVSSGDSLTKGSVCSYVIHHLVCGE 936 Query: 132 SNGVRL----------TFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPE 281 V L +FRLCMANVLISACQK+P + KK + +I PR++ S+E I++ E Sbjct: 937 DISVMLGRNEVVKAHHSFRLCMANVLISACQKVPCASKKPFVSKILPRVLHSVEEIANSE 996 Query: 282 TRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 R+ACIQV F++ Y+LKSL+LPYS+DLL V++ SLR+GSEKE+++GAKLL LMA Sbjct: 997 VRSACIQVFFSMVYHLKSLVLPYSSDLLKVSIKSLREGSEKERIAGAKLLASLMA 1051 >ref|XP_002277702.2| PREDICTED: uncharacterized protein LOC100241927 [Vitis vinifera] Length = 1106 Score = 159 bits (401), Expect = 1e-36 Identities = 87/175 (49%), Positives = 114/175 (65%), Gaps = 27/175 (15%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTKGTAISS----------DSVLAYVINQLTDES------ 134 QHGCID LALM+C ELQAP+ G+ DSV+ YVI+QL+ ++ Sbjct: 886 QHGCIDCLALMICTELQAPKSFIGSVSDKISIIGKNFHPDSVVTYVIHQLSLDAVEAAST 945 Query: 135 -----------NGVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPE 281 V L+FRLCMANVLISACQKI +SGKK++A+RI P +I ++ I D E Sbjct: 946 SMLCSDNCASEPSVPLSFRLCMANVLISACQKISDSGKKAFARRILPYLIHFVQVIKDSE 1005 Query: 282 TRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 R AC+QVLF+ Y+LKS+ILPYS++LL ++L SL SEKE+M+G KL+ LMA Sbjct: 1006 IRVACVQVLFSAVYHLKSMILPYSSELLKLSLKSLEGNSEKERMAGVKLMASLMA 1060 >emb|CBI29872.3| unnamed protein product [Vitis vinifera] Length = 1112 Score = 158 bits (400), Expect = 1e-36 Identities = 87/181 (48%), Positives = 115/181 (63%), Gaps = 33/181 (18%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPE----------------YTKGTAISSDSVLAYVINQLTDES 134 QHGCID LALM+C ELQAP+ + G + DSV+ YVI+QL+ ++ Sbjct: 886 QHGCIDCLALMICTELQAPKSFIGSVSDKISIIGKNFHPGDSALGDSVVTYVIHQLSLDA 945 Query: 135 -----------------NGVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIE 263 V L+FRLCMANVLISACQKI +SGKK++A+RI P +I ++ Sbjct: 946 VEAASTSMLCSDNCASEPSVPLSFRLCMANVLISACQKISDSGKKAFARRILPYLIHFVQ 1005 Query: 264 AISDPETRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLM 443 I D E R AC+QVLF+ Y+LKS+ILPYS++LL ++L SL SEKE+M+G KL+ LM Sbjct: 1006 VIKDSEIRVACVQVLFSAVYHLKSMILPYSSELLKLSLKSLEGNSEKERMAGVKLMASLM 1065 Query: 444 A 446 A Sbjct: 1066 A 1066 >gb|EXB53137.1| hypothetical protein L484_006957 [Morus notabilis] Length = 1077 Score = 157 bits (397), Expect = 3e-36 Identities = 85/165 (51%), Positives = 111/165 (67%), Gaps = 18/165 (10%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPE-YTKGTAISSDSVLAYVINQLTDESNG------------- 140 QHGCID LALM+CA+LQ E T + VL YVI+QLT + Sbjct: 866 QHGCIDCLALMICADLQVSESITDSNQEKNGPVLDYVISQLTSDKKEPVSTSQFGGQMRM 925 Query: 141 ----VRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPETRAACIQVL 308 + L+FRLCMANVLISACQKIP+SGKK AK+ PR+I S+EAI++ + RAAC+QVL Sbjct: 926 FGAPLPLSFRLCMANVLISACQKIPDSGKKRLAKKALPRLISSVEAITESDIRAACLQVL 985 Query: 309 FTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLM 443 F+ Y+LKS + Y+ DLL ++L +L +GSEKEKM+GAK++ LM Sbjct: 986 FSAVYHLKSAVRTYACDLLKLSLKALEKGSEKEKMAGAKMMASLM 1030 >ref|XP_002876425.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297322263|gb|EFH52684.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 1082 Score = 154 bits (389), Expect = 3e-35 Identities = 86/171 (50%), Positives = 112/171 (65%), Gaps = 23/171 (13%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTKGTA----------ISSDSVLAYVINQLTDE------- 131 QHGCID LALM+CAELQ + K + S +SVL Y I+ L ++ Sbjct: 866 QHGCIDCLALMICAELQDLKSLKTSGGEQMRTTEEDASGNSVLDYTIHCLVEDRSNCSSI 925 Query: 132 ------SNGVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPETRAA 293 N + + FRLCMANV+ISACQKIPES KK++A++ P ++ S++ IS PE RAA Sbjct: 926 PKLSTGENPLPIPFRLCMANVIISACQKIPESTKKTFARKALPPLVHSLKVISVPEVRAA 985 Query: 294 CIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 CIQVLF+ Y LKS +LP S+DLL ++L L QGSEKEK++GAKL+ LMA Sbjct: 986 CIQVLFSAMYYLKSTLLPVSSDLLKLSLRFLEQGSEKEKLAGAKLMASLMA 1036 >ref|XP_007028825.1| ARM repeat superfamily protein, putative [Theobroma cacao] gi|508717430|gb|EOY09327.1| ARM repeat superfamily protein, putative [Theobroma cacao] Length = 1114 Score = 153 bits (386), Expect = 6e-35 Identities = 89/180 (49%), Positives = 111/180 (61%), Gaps = 33/180 (18%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK----------------GTAISSDSVLAYVINQLTDES 134 QHGCID LALM+CAELQAPE K G A S +L +VI+QL ++ Sbjct: 888 QHGCIDCLALMICAELQAPELFKDRTSLRSNIVGKKGNPGDAASRPYILRHVIHQLINDK 947 Query: 135 NGVRL-----------------TFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIE 263 + ++ +FRLCMANVLISACQKI + GK AK I P +I S+E Sbjct: 948 SELKPVLKLRDENCETKAPIPHSFRLCMANVLISACQKISDYGKNLLAKTILPCLIDSVE 1007 Query: 264 AISDPETRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLM 443 I PE RAACIQVLF+ Y+LKS +LPYS DLL ++L SL +GSE E+M+GAKL+ LM Sbjct: 1008 VIMQPEIRAACIQVLFSAVYHLKSAVLPYSCDLLKLSLKSLGKGSEMERMAGAKLMASLM 1067 >ref|NP_001190119.1| armadillo/beta-catenin-like repeat-containing protein [Arabidopsis thaliana] gi|332646153|gb|AEE79674.1| armadillo/beta-catenin-like repeat-containing protein [Arabidopsis thaliana] Length = 1096 Score = 153 bits (386), Expect = 6e-35 Identities = 87/175 (49%), Positives = 112/175 (64%), Gaps = 27/175 (15%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK----------GTAISSDSVLAYVINQLTDE------- 131 QHGCID LALM+CAELQ + +K G S SVL Y I+ L ++ Sbjct: 876 QHGCIDCLALMICAELQHLKSSKTSGGEKIRSTGKDTSGYSVLDYTIHCLIEDRSNCSSI 935 Query: 132 ----------SNGVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPE 281 N + + FRLCMANV+ISACQK PES KK++A++ P +I S++ IS PE Sbjct: 936 PKLSTDILTCENPLPIPFRLCMANVIISACQKNPESSKKTFARKALPPLIHSLKVISVPE 995 Query: 282 TRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 RAACIQVLF+ Y+LKS +LP S+DLL ++L L QGSEKEK++GAKL+ LMA Sbjct: 996 VRAACIQVLFSATYHLKSTLLPVSSDLLKLSLRFLEQGSEKEKLAGAKLMASLMA 1050 >ref|NP_191316.5| armadillo/beta-catenin-like repeat-containing protein [Arabidopsis thaliana] gi|332646152|gb|AEE79673.1| armadillo/beta-catenin-like repeat-containing protein [Arabidopsis thaliana] Length = 1092 Score = 153 bits (386), Expect = 6e-35 Identities = 87/175 (49%), Positives = 112/175 (64%), Gaps = 27/175 (15%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK----------GTAISSDSVLAYVINQLTDE------- 131 QHGCID LALM+CAELQ + +K G S SVL Y I+ L ++ Sbjct: 872 QHGCIDCLALMICAELQHLKSSKTSGGEKIRSTGKDTSGYSVLDYTIHCLIEDRSNCSSI 931 Query: 132 ----------SNGVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPE 281 N + + FRLCMANV+ISACQK PES KK++A++ P +I S++ IS PE Sbjct: 932 PKLSTDILTCENPLPIPFRLCMANVIISACQKNPESSKKTFARKALPPLIHSLKVISVPE 991 Query: 282 TRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 RAACIQVLF+ Y+LKS +LP S+DLL ++L L QGSEKEK++GAKL+ LMA Sbjct: 992 VRAACIQVLFSATYHLKSTLLPVSSDLLKLSLRFLEQGSEKEKLAGAKLMASLMA 1046 >emb|CAB66114.1| hypothetical protein [Arabidopsis thaliana] Length = 1057 Score = 153 bits (386), Expect = 6e-35 Identities = 87/175 (49%), Positives = 112/175 (64%), Gaps = 27/175 (15%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK----------GTAISSDSVLAYVINQLTDE------- 131 QHGCID LALM+CAELQ + +K G S SVL Y I+ L ++ Sbjct: 837 QHGCIDCLALMICAELQHLKSSKTSGGEKIRSTGKDTSGYSVLDYTIHCLIEDRSNCSSI 896 Query: 132 ----------SNGVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPE 281 N + + FRLCMANV+ISACQK PES KK++A++ P +I S++ IS PE Sbjct: 897 PKLSTDILTCENPLPIPFRLCMANVIISACQKNPESSKKTFARKALPPLIHSLKVISVPE 956 Query: 282 TRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 RAACIQVLF+ Y+LKS +LP S+DLL ++L L QGSEKEK++GAKL+ LMA Sbjct: 957 VRAACIQVLFSATYHLKSTLLPVSSDLLKLSLRFLEQGSEKEKLAGAKLMASLMA 1011 >ref|XP_007162242.1| hypothetical protein PHAVU_001G135900g [Phaseolus vulgaris] gi|561035706|gb|ESW34236.1| hypothetical protein PHAVU_001G135900g [Phaseolus vulgaris] Length = 1102 Score = 148 bits (373), Expect = 2e-33 Identities = 82/174 (47%), Positives = 109/174 (62%), Gaps = 26/174 (14%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQA--------PEYTKGTAISSDSVLAYVINQLTDESN------- 137 QHGCID LALM+CAELQA P+ TK SV++YV+NQ + N Sbjct: 883 QHGCIDCLALMICAELQAKESITTSMPDKTKAVGKEGKSVVSYVLNQFFNNKNERTSTPE 942 Query: 138 -----------GVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPET 284 V L+FRLCM NVLIS CQKI ES KK +A ++ P ++ S+E + E Sbjct: 943 FGDENSEFVAAAVSLSFRLCMGNVLISTCQKISESCKKPFAAQVLPFLLHSLEFETMSEI 1002 Query: 285 RAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 RAAC QVLF+ Y+L+S +LPY++DLL AL +LR+ S+KE+++GAKL+ LMA Sbjct: 1003 RAACTQVLFSAVYHLRSAVLPYASDLLRSALKALRKESDKERIAGAKLIASLMA 1056 >ref|XP_002308042.2| hypothetical protein POPTR_0006s05290g [Populus trichocarpa] gi|550335511|gb|EEE91565.2| hypothetical protein POPTR_0006s05290g [Populus trichocarpa] Length = 481 Score = 147 bits (370), Expect = 4e-33 Identities = 85/180 (47%), Positives = 109/180 (60%), Gaps = 32/180 (17%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK---------------GTAISSDSVLAYVINQLTDESN 137 QHGCID LALM+CA+LQ P K G A S + VL YVIN L ++ N Sbjct: 264 QHGCIDCLALMICAKLQVPSSFKESSKNLGAARKTSYCGNAASGNCVLLYVINLLINDEN 323 Query: 138 GV-----------------RLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEA 266 + +FR+CMANVLISACQKI +SGKK +AK+ P +++ I Sbjct: 324 ALVSASMSGSENSAFEAPTTHSFRVCMANVLISACQKISDSGKKRFAKKTVPHLLQDI-- 381 Query: 267 ISDPETRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 RAACIQVLF+ Y+LKS +LPYS+DLLN++L L +GSEKE+M+ AKL+ LMA Sbjct: 382 ------RAACIQVLFSAVYHLKSAVLPYSSDLLNLSLKFLSRGSEKERMASAKLIASLMA 435 >ref|XP_006604370.1| PREDICTED: uncharacterized protein LOC100800773 isoform X2 [Glycine max] Length = 1099 Score = 146 bits (369), Expect = 5e-33 Identities = 81/173 (46%), Positives = 108/173 (62%), Gaps = 25/173 (14%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQA--------PEYTKGTAISSDSVLAYVINQLTDESN------- 137 QHGCID LALM+CAELQA P+ + +SV+ YVINQ + N Sbjct: 881 QHGCIDCLALMICAELQAKESINNSIPDTVRALGKKGNSVVTYVINQFFNNKNEQTSTPE 940 Query: 138 ----------GVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPETR 287 V L+F LCM NVLIS CQKI ES KK +A ++ P ++ S+E + E R Sbjct: 941 FGDENSEFVAAVSLSFCLCMGNVLISTCQKISESCKKPFAAQVIPFLLHSLEFETKSEIR 1000 Query: 288 AACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 AAC QVLF+ Y+L+S +LPY++DLL +AL +LR+ S+KE+M+GAKL+ LMA Sbjct: 1001 AACTQVLFSAVYHLRSAVLPYASDLLRMALKALRKESDKERMAGAKLIASLMA 1053 >ref|XP_006604369.1| PREDICTED: uncharacterized protein LOC100800773 isoform X1 [Glycine max] Length = 1101 Score = 146 bits (369), Expect = 5e-33 Identities = 81/173 (46%), Positives = 108/173 (62%), Gaps = 25/173 (14%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQA--------PEYTKGTAISSDSVLAYVINQLTDESN------- 137 QHGCID LALM+CAELQA P+ + +SV+ YVINQ + N Sbjct: 883 QHGCIDCLALMICAELQAKESINNSIPDTVRALGKKGNSVVTYVINQFFNNKNEQTSTPE 942 Query: 138 ----------GVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISDPETR 287 V L+F LCM NVLIS CQKI ES KK +A ++ P ++ S+E + E R Sbjct: 943 FGDENSEFVAAVSLSFCLCMGNVLISTCQKISESCKKPFAAQVIPFLLHSLEFETKSEIR 1002 Query: 288 AACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 AAC QVLF+ Y+L+S +LPY++DLL +AL +LR+ S+KE+M+GAKL+ LMA Sbjct: 1003 AACTQVLFSAVYHLRSAVLPYASDLLRMALKALRKESDKERMAGAKLIASLMA 1055 >emb|CAB41198.1| hypothetical protein [Arabidopsis thaliana] Length = 305 Score = 145 bits (367), Expect = 9e-33 Identities = 87/185 (47%), Positives = 112/185 (60%), Gaps = 37/185 (20%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEYTK----------GTAISSDSVLAYVINQLTDE------- 131 QHGCID LALM+CAELQ + +K G S SVL Y I+ L ++ Sbjct: 75 QHGCIDCLALMICAELQHLKSSKTSGGEKIRSTGKDTSGYSVLDYTIHCLIEDRSNCSSI 134 Query: 132 ----------SNGVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRS-------- 257 N + + FRLCMANV+ISACQK PES KK++A++ P +I S Sbjct: 135 PKLSTDILTCENPLPIPFRLCMANVIISACQKNPESSKKTFARKALPPLIHSLKFSTKFL 194 Query: 258 --IEAISDPETRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLL 431 ++ IS PE RAACIQVLF+ Y+LKS +LP S+DLL ++L L QGSEKEK++GAKL+ Sbjct: 195 NFVQVISVPEVRAACIQVLFSATYHLKSTLLPVSSDLLKLSLRFLEQGSEKEKLAGAKLM 254 Query: 432 MCLMA 446 LMA Sbjct: 255 ASLMA 259 >ref|XP_004160826.1| PREDICTED: uncharacterized LOC101210197 [Cucumis sativus] Length = 382 Score = 145 bits (366), Expect = 1e-32 Identities = 79/178 (44%), Positives = 110/178 (61%), Gaps = 30/178 (16%) Frame = +3 Query: 3 QHGCIDSLALMLCAELQAPEY------------TKGTAISSDSVLAYVINQLTD------ 128 +HGCID +ALM+C ELQAP KG A S+L YVI +L + Sbjct: 158 KHGCIDCIALMICTELQAPNSWSASKFEKIDIDEKGHASLKGSILDYVIGRLINGTKEQG 217 Query: 129 -----------ESNGVRLTFRLCMANVLISACQKIPESGKKSYAKRITPRIIRSIEAISD 275 +N L+ RLCMANVL SACQK+ +SGKK +A ++ PR+I +E S Sbjct: 218 AAYDLDNNDNPSNNSTPLSLRLCMANVLTSACQKLSDSGKKQFAWKVLPRLISFVEVTST 277 Query: 276 -PETRAACIQVLFTVAYNLKSLILPYSNDLLNVALNSLRQGSEKEKMSGAKLLMCLMA 446 + RA CI ++F+V Y+LKS ILPYSND+ V+LN+L+ G E+E+++GAKL++CLM+ Sbjct: 278 WVDIRAPCIGIIFSVVYHLKSAILPYSNDIFRVSLNALKNGQEQERIAGAKLMVCLMS 335