BLASTX nr result
ID: Mentha29_contig00018835
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00018835 (1998 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU28349.1| hypothetical protein MIMGU_mgv1a010624mg [Mimulus... 172 7e-40 gb|EYU25881.1| hypothetical protein MIMGU_mgv1a019896mg, partial... 154 1e-34 gb|EYU36394.1| hypothetical protein MIMGU_mgv1a022334mg [Mimulus... 153 3e-34 ref|XP_004243724.1| PREDICTED: uncharacterized protein LOC101257... 145 7e-32 ref|XP_006342356.1| PREDICTED: uncharacterized protein LOC102599... 136 3e-29 ref|XP_004165423.1| PREDICTED: uncharacterized protein LOC101228... 131 1e-27 ref|XP_004141159.1| PREDICTED: uncharacterized protein LOC101210... 129 5e-27 ref|XP_006392704.1| hypothetical protein EUTSA_v10012317mg [Eutr... 126 4e-26 ref|XP_002514069.1| conserved hypothetical protein [Ricinus comm... 123 3e-25 ref|NP_188014.1| uncharacterized protein [Arabidopsis thaliana] ... 122 6e-25 ref|XP_006451567.1| hypothetical protein CICLE_v10008758mg [Citr... 115 6e-23 ref|XP_002299596.2| hypothetical protein POPTR_0001s17050g [Popu... 115 6e-23 ref|XP_006299874.1| hypothetical protein CARUB_v10016082mg [Caps... 115 6e-23 ref|XP_002885001.1| hypothetical protein ARALYDRAFT_897656 [Arab... 115 7e-23 emb|CAN69469.1| hypothetical protein VITISV_042556 [Vitis vinifera] 115 7e-23 ref|XP_006407142.1| hypothetical protein EUTSA_v10022002mg [Eutr... 114 2e-22 ref|XP_007012696.1| Uncharacterized protein TCM_037572 [Theobrom... 112 5e-22 ref|XP_004303677.1| PREDICTED: uncharacterized protein LOC101294... 112 5e-22 ref|XP_002891795.1| hypothetical protein ARALYDRAFT_474551 [Arab... 112 6e-22 ref|XP_007024590.1| Uncharacterized protein TCM_029108 [Theobrom... 111 1e-21 >gb|EYU28349.1| hypothetical protein MIMGU_mgv1a010624mg [Mimulus guttatus] Length = 307 Score = 172 bits (435), Expect = 7e-40 Identities = 129/348 (37%), Positives = 171/348 (49%), Gaps = 41/348 (11%) Frame = +1 Query: 883 MAMWERHTQNVCPKGPSFSSSLLDSIYRSIDETRPLPTDNRKPP---LHRRNKPE---DD 1044 MA+WE+ +N K PSFSSSLLDSIYRSIDE P + + LHRRN ++ Sbjct: 1 MAVWEKQAKN---KEPSFSSSLLDSIYRSIDENGA-PNEQKMEDNFLLHRRNNAAAKVEE 56 Query: 1045 DVESLRRAIMIEKWIDNHXXXXXXXXXXXXXXACRSTPRHLPXXXXXXXXXXXXXXXXXX 1224 ++ESLR+AIMIEKW++N+ +T H P Sbjct: 57 EIESLRKAIMIEKWMENYKPINT---------TTTTTTMHFPSNSGSSTDSSIFSSSET- 106 Query: 1225 XXXXXXXXXXXXLKITPKKLDPPPLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTF 1404 + + + P E + + K+RAQK+YGDLKK +EP+SPGG+I++F Sbjct: 107 -------------ESSVSRSYKIPQLENKPATKKSRAQKIYGDLKKVREPVSPGGRIASF 153 Query: 1405 FNSIFSPR-----------NXXXXXXXXXXXXXXXXKEAATCSLVASRSCLSKNASSRGS 1551 NSIFSPR TCSL SRSCLSKN SS Sbjct: 154 LNSIFSPRKTNELCSMRKSKSMKDTTTTTTTGTTTTASTKTCSL-ESRSCLSKNRSSGDC 212 Query: 1552 KSKRTVRFCPVT-----------VIVDQDE-------------IIKRNDFRVFEGKSLRK 1659 KSKR+VRFC V + D+ E IK+N K+ RK Sbjct: 213 KSKRSVRFCIVDGDCQPCGNKSGLFYDEKEDLVNTTVANIGSYFIKKNT------KNARK 266 Query: 1660 HQVKDFYRDEKDFDDMSCASSDLFELENIGRVEYEQELPLYETTSIQI 1803 +Q + +D D++SC SSDLFELENIGR YE+ELP+YETT++++ Sbjct: 267 NQ------EVRDLDELSCGSSDLFELENIGR--YEEELPVYETTNLEL 306 >gb|EYU25881.1| hypothetical protein MIMGU_mgv1a019896mg, partial [Mimulus guttatus] Length = 313 Score = 154 bits (389), Expect = 1e-34 Identities = 120/339 (35%), Positives = 164/339 (48%), Gaps = 23/339 (6%) Frame = +1 Query: 871 KKVKMAMWERHTQNVCPKGPSFSSSLLDSIYRSIDE------TRPLPTDNRKPPLHRRNK 1032 ++ MA+ ER TQ K PSFSSSLLD+IY SIDE P + +H R+ Sbjct: 3 REAAMAVGERQTQKHRRKTPSFSSSLLDAIYLSIDEPGCAAAAASQPHQQSEDFVHLRSN 62 Query: 1033 PE-------DDDVESLRRAIMIEKWIDNHXXXXXXXXXXXXXXACRSTPRHLPXXXXXXX 1191 +D++ SLRRAI +EKW++NH A + PR Sbjct: 63 TRRNNAAHFEDEIASLRRAITVEKWMENHTTTTTVVAAA----AAAAVPRR--------- 109 Query: 1192 XXXXXXXXXXXXXXXXXXXXXXXLKITPKKLDPPPLSEGRLS-------KTKTRAQKLYG 1350 + + + D LS S +TK++A K+YG Sbjct: 110 ----------RISSNSGTSSDSSILSSSSETDSSVLSRSSSSSKNHLYTRTKSKAMKIYG 159 Query: 1351 DLKKAKEPISPGGKISTFFNSIFSPRNXXXXXXXXXXXXXXXXKEAATCS--LVASRSCL 1524 +LKK KEPISPGGKI FFNSIFSPRN K + +S+SCL Sbjct: 160 ELKKVKEPISPGGKIVNFFNSIFSPRNPKQKQTTVEEWSSSIRKSRSMRDPPTTSSKSCL 219 Query: 1525 SKN-ASSRGSKSKRTVRFCPVTVIVDQDEIIKRNDFRVFEGKSLRKHQVKDFYRDEKDFD 1701 KN +SS +KSKR+V+F D++E + + + ++K + + +E D D Sbjct: 220 IKNPSSSICNKSKRSVKF-------DENEYSVNSSMPSVKSRLIKKS--VELFENESDGD 270 Query: 1702 DMSCASSDLFELENIGRVEYEQELPLYETTSIQISGKIS 1818 DMSCASSDLFELENIG E ELP+Y TTSI+++ I+ Sbjct: 271 DMSCASSDLFELENIGSYGGE-ELPVYGTTSIKMNRAIA 308 >gb|EYU36394.1| hypothetical protein MIMGU_mgv1a022334mg [Mimulus guttatus] Length = 166 Score = 153 bits (386), Expect = 3e-34 Identities = 89/168 (52%), Positives = 105/168 (62%), Gaps = 2/168 (1%) Frame = -1 Query: 711 SQQNVFVRIIRSPYRALCKARDFYVRSMLDCANSNAVGLHGAAQGPNLPRSFSAVSSTSY 532 SQQN F+RII +P+RAL KARDFYV+ ++DCA SN +GL +Q P LPRSFS SS++ Sbjct: 6 SQQNKFLRIITTPFRALGKARDFYVKRIMDCAGSNVIGLQATSQAPGLPRSFSTASSSAR 65 Query: 531 EDN-EDYRELVRAASA-RSIGGGVDLDAYIKQEXXXXXXXXXXXXXXXXXRSASVAMGRI 358 DN EDYRELVRA SA RSIGGGVDL+AY+ + RS +V+MGRI Sbjct: 66 SDNDEDYRELVRATSAGRSIGGGVDLEAYLMRREMGMRGGSGGGPATMPPRSLTVSMGRI 125 Query: 357 DEERPSAYFVADTKISXXXXXXXXXXXXXNQELRYGRSKSHAVARTSF 214 DEERP YF D EL+Y RSKSHAVART F Sbjct: 126 DEERPCCYFRED-------FYGRRSIVNSKNELKYPRSKSHAVARTPF 166 >ref|XP_004243724.1| PREDICTED: uncharacterized protein LOC101257581 [Solanum lycopersicum] Length = 309 Score = 145 bits (366), Expect = 7e-32 Identities = 114/328 (34%), Positives = 162/328 (49%), Gaps = 16/328 (4%) Frame = +1 Query: 883 MAMWERHTQNVCP--KGPSFSSSLLDSIYRSIDETRPLPTDNRKPPLHRRNKPEDDDVES 1056 M+ WE+ V K PSFSSSLL+SIY SIDE++ +++ P +R++ +++++ S Sbjct: 1 MSSWEKPIIRVQQRRKTPSFSSSLLESIYHSIDESKEEEEKHQQVP-NRKSNNKEEEIVS 59 Query: 1057 LRRAIMIEKWIDNHXXXXXXXXXXXXXXACRSTPRHLPXXXXXXXXXXXXXXXXXXXXXX 1236 LRRAI+IEKW++++ + + Sbjct: 60 LRRAILIEKWMESYKYSQSSGHFSSDSSSSAESSMFSSSETESRSINTLPKTTTQVRRPD 119 Query: 1237 XXXXXXXXLKITPKKLDPPPLSEG--RLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFN 1410 K TPK EG R +TK+RA LK K+PISPGGKI+ F N Sbjct: 120 RVVTFSTETK-TPK-------CEGGGRFMRTKSRA------LKMVKQPISPGGKIANFLN 165 Query: 1411 SIFSPRNXXXXXXXXXXXXXXXXKEAATCSLVASRSCLSKNASSR--GSKSKRTVRFCPV 1584 SIF+ +N + ++ S SCL+K SS +KSKR+VRFCPV Sbjct: 166 SIFNSKNIKKNHQEDWSSVRKSRSVNDSTTMTTS-SCLNKTPSSTSISNKSKRSVRFCPV 224 Query: 1585 TVIVDQD------EIIKRNDFRVFEGKSLRKHQVKDFYRDEKDFDDMSCASSDLFELENI 1746 TVIVD+D + I RN+ KHQ + FY+DE + D SCASSDLFELENI Sbjct: 225 TVIVDEDSQPCGHKSIYRNE--------EPKHQYRGFYQDEDEDDGRSCASSDLFELENI 276 Query: 1747 GRVEY----EQELPLYETTSIQISGKIS 1818 + + LP+Y TTS +++ I+ Sbjct: 277 SMIGHVHANRDGLPVYGTTSFKMNQAIA 304 >ref|XP_006342356.1| PREDICTED: uncharacterized protein LOC102599331 [Solanum tuberosum] Length = 301 Score = 136 bits (343), Expect = 3e-29 Identities = 107/328 (32%), Positives = 160/328 (48%), Gaps = 16/328 (4%) Frame = +1 Query: 883 MAMWER-----HTQNVCPKGPSFSSSLLDSIYRSIDETRPLPTDNRKPPLHRRNKPEDDD 1047 M+ WE+ H + K PSFSSSLL++IY SIDE++ + + + + +++ Sbjct: 1 MSSWEKPIIRPHQRR---KTPSFSSSLLEAIYHSIDESKE---EEKHEEVPNKKSNNEEE 54 Query: 1048 VESLRRAIMIEKWIDNHXXXXXXXXXXXXXXACRSTPRHLPXXXXXXXXXXXXXXXXXXX 1227 + SLRRAI+IEKW++++ + + Sbjct: 55 IVSLRRAILIEKWMESYKYTQSSGHFSSDSSSSTESSMFSSSETESRSINTLPRPTRQVR 114 Query: 1228 XXXXXXXXXXXLKITPKKLDPPPLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFF 1407 TPK GR +TK+RA LK K+PISPGGKI+ F Sbjct: 115 RSDKIVTFSTDTDTTPKCE-----GGGRFMRTKSRA------LKMVKQPISPGGKIANFL 163 Query: 1408 NSIFSPRNXXXXXXXXXXXXXXXXKEAATCSLVASRSCLSK--NASSRGSKSKRTVRFCP 1581 NSIF+ RN +++ + + + SCL+K +++SR +KSKR+VRFCP Sbjct: 164 NSIFNSRN-------MKKNHHEEWRKSRSVNDSTTSSCLNKTPSSTSRSNKSKRSVRFCP 216 Query: 1582 VTVIVDQ---DEIIKRNDFRVFEGKSLRKHQVKDFYR--DEKDFDDMSCASSDLFELENI 1746 TVIVD+ + I +N+ KHQ + FY+ DE + D SCASSDLFELENI Sbjct: 217 DTVIVDEHCGHKSIYKNE--------EPKHQYRGFYKDGDEDEDDGRSCASSDLFELENI 268 Query: 1747 GRVEY----EQELPLYETTSIQISGKIS 1818 G + + LP+Y TTS +++ I+ Sbjct: 269 GMIGHVHATRDGLPVYGTTSFKMNQAIA 296 >ref|XP_004165423.1| PREDICTED: uncharacterized protein LOC101228582 [Cucumis sativus] Length = 364 Score = 131 bits (329), Expect = 1e-27 Identities = 111/352 (31%), Positives = 159/352 (45%), Gaps = 55/352 (15%) Frame = +1 Query: 928 PSFSSSLLDSIYRSIDETRPLPTDN-----RKPPLHRRN----KPEDDDVESLRRAIMIE 1080 PSFSSSLLD+IYRSIDE+ P ++ K L ++ + D ESL MI+ Sbjct: 23 PSFSSSLLDAIYRSIDESNSQPEEHLIFYSHKTTLTTKHSILPRTVTQDPESLNFR-MID 81 Query: 1081 KWIDNHXXXXXXXXXXXXXXACRSTPRHLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1260 W+D + S+ Sbjct: 82 SWMDKKQLRNIRDFTLSSSSSSESSST-------------AGRRFSSSETEFLSRPLHRP 128 Query: 1261 LKITPKKLD-----------PPPLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFF 1407 K+ PK + P P E K+K++A K+Y DLKK K+PISPG ++++F Sbjct: 129 TKLKPKPIKTNTWAQTEIITPNPKHENGFVKSKSKASKIYHDLKKVKQPISPGARLASFL 188 Query: 1408 NSIF---SPR------------NXXXXXXXXXXXXXXXXKEAATCSLVA--SRSCLSKNA 1536 NS+F SP+ N +TCS + SRSCLSK Sbjct: 189 NSLFNGGSPKTKQKISSSTCSINSTKFDYDMSRKSKSQQGSTSTCSSASSFSRSCLSKTP 248 Query: 1537 SSRGSKSKRTVRFCPVTVIVDQD--------------EIIKRNDFRVFEGKSLRKHQVKD 1674 SSRG+ KR+VRFCPV+VIVD+D I+K+ + + G ++ + + Sbjct: 249 SSRGN-IKRSVRFCPVSVIVDEDCRPCGHKFLHKSEEPIMKKGNLKKVTGDMMKMNYEDE 307 Query: 1675 FYRDEKDFDD-MSCASSDLFELEN---IGRVEYEQELPLYETTSIQISGKIS 1818 D++D DD +SC+SSDLFEL+N IG Y +ELP+YETT + + I+ Sbjct: 308 EEDDDEDDDDALSCSSSDLFELDNLSVIGIERYREELPVYETTHFKTNCAIA 359 >ref|XP_004141159.1| PREDICTED: uncharacterized protein LOC101210356 [Cucumis sativus] Length = 375 Score = 129 bits (324), Expect = 5e-27 Identities = 115/363 (31%), Positives = 160/363 (44%), Gaps = 66/363 (18%) Frame = +1 Query: 928 PSFSSSLLDSIYRSIDETRPLPTDN-----RKPPLHRRN----KPEDDDVESLRRAIMIE 1080 PSFSSSLLD+IYRSIDE+ P ++ K L ++ + D ESL MI+ Sbjct: 23 PSFSSSLLDAIYRSIDESNSQPEEHLIFYSHKTTLTTKHSILPRTVTQDPESLNFR-MID 81 Query: 1081 KWIDNHXXXXXXXXXXXXXXACRSTPRHLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1260 W+D + S+ Sbjct: 82 SWMDKKQLRNIRDFTLSSSSSSESSST-------------AGRRFSSSETEFLSRPLHRP 128 Query: 1261 LKITPKKLD-----------PPPLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFF 1407 K+ PK + P P E K+K++A K+Y DLKK K+PISPG ++++F Sbjct: 129 TKLKPKPIKTNTWAQTEIITPNPKHENGFVKSKSKASKIYHDLKKVKQPISPGARLASFL 188 Query: 1408 NSIF---SPR------------NXXXXXXXXXXXXXXXXKEAATCSLVA--SRSCLSKNA 1536 NS+F SP+ N +TCS + SRSCLSK Sbjct: 189 NSLFNGGSPKTKQKISSSTCSINSTKFDYDMSRKSKSQQGSTSTCSSASSFSRSCLSKTP 248 Query: 1537 SSRGSKSKRTVRFCPVTVIVDQD--------------EIIKRNDFRVFEGKSLRKHQVKD 1674 SSRG+ KR+VRFCPV+VIVD+D I+K+ + + G R +VK Sbjct: 249 SSRGN-IKRSVRFCPVSVIVDEDCRPCGHKFLHKSEEPIMKKGNLKKSYGNLKRDQKVKT 307 Query: 1675 ------FYRDEKDFDD------MSCASSDLFELEN---IGRVEYEQELPLYETTSIQISG 1809 Y DE++ DD +SC+SSDLFEL+N IG Y +ELP+YETT + + Sbjct: 308 EDMMKMNYEDEEEDDDEDDDDALSCSSSDLFELDNLSVIGIERYREELPVYETTHFKTNC 367 Query: 1810 KIS 1818 I+ Sbjct: 368 AIA 370 >ref|XP_006392704.1| hypothetical protein EUTSA_v10012317mg [Eutrema salsugineum] gi|557089282|gb|ESQ29990.1| hypothetical protein EUTSA_v10012317mg [Eutrema salsugineum] Length = 372 Score = 126 bits (316), Expect = 4e-26 Identities = 112/372 (30%), Positives = 163/372 (43%), Gaps = 60/372 (16%) Frame = +1 Query: 883 MAMWERHTQNVCP---KGPSFSSSLLDSIYRSIDE--TRPLPTDNRKPPLHRRNKPEDDD 1047 M W++H+ ++ + PSFSSSLLD IYRSID+ T + +K H + D+D Sbjct: 1 MDPWDKHSIDLHRHRHRHPSFSSSLLDQIYRSIDDSSTNSDVSMRKKQQHHHHHHRLDED 60 Query: 1048 VESLRRAIMIEKWIDNHXXXXXXXXXXXXXXACRSTPRHLPXXXXXXXXXXXXXXXXXXX 1227 L + ++ + I + S+ Sbjct: 61 RVCLDKILVNRREIADDFVRSRNPKTVEPVFFKHSSSSSSDSSGFSSSGSDSFYKRSRSS 120 Query: 1228 XXXXXXXXXXXLKITPKKLDPPPL--------SEGRLSKTKTRAQKLYGDLKKAKEPISP 1383 ++ T ++ + P G +TK++A K+Y DLKK K+PISP Sbjct: 121 RSPPAIGHPKPIRTTVERFERSPQIHRPNNKQEHGSFLRTKSKALKIYSDLKKVKQPISP 180 Query: 1384 GGKISTFFNSIFS-PRNXXXXXXXXXXXXXXXXKEAATCSLVA--SRSCLSKNASSRGSK 1554 GG+++TF NS+F+ N + TCS + SRSCLSK SS K Sbjct: 181 GGRLATFLNSLFTGAGNTKKPNKINTTVPVAAAASSTTCSSASSFSRSCLSKTPSS-SEK 239 Query: 1555 SKRTVRFCPVTVIVDQDEIIKRNDFRVF---EGKSLRKHQ-------------------V 1668 SKR+VRFCPV VI+D+D ++ +++ E +S R HQ Sbjct: 240 SKRSVRFCPVNVILDEDSKRNGHNNKLYGSNERESTRHHQNFDTLEIRVVEENRRVIEAA 299 Query: 1669 KDFYR------------------DEKDFDD-MSCASSDLFELEN---IGRVEYEQELPLY 1782 K+ R DE D DD SCASSDLFEL+N IG +Y +ELP+Y Sbjct: 300 KELLRTYHKNKDVVNISGEEEEEDEDDEDDAASCASSDLFELDNLSAIGIEKYREELPVY 359 Query: 1783 ETTSIQISGKIS 1818 ETT ++ + IS Sbjct: 360 ETTRLKTNRVIS 371 >ref|XP_002514069.1| conserved hypothetical protein [Ricinus communis] gi|223546525|gb|EEF48023.1| conserved hypothetical protein [Ricinus communis] Length = 394 Score = 123 bits (309), Expect = 3e-25 Identities = 85/228 (37%), Positives = 117/228 (51%), Gaps = 59/228 (25%) Frame = +1 Query: 1294 PLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIFSPRN-----XXXXXXXXX 1458 P EG S+TK +A K+YG+LKK K+PISPGG+I++F NSIFSP + Sbjct: 156 PKCEGGFSRTKLKAMKIYGELKKVKQPISPGGRIASFLNSIFSPGSAKKVKMCSIGAMDD 215 Query: 1459 XXXXXXXKEAATCSLVA--SRSCLSKNASSR-----GSKSKRTVRFCPVTVIVDQD---- 1605 K + CS V SRSCLSK SR G+KSKR+VRFCPV+VIVD+D Sbjct: 216 VSTATDRKSKSACSSVTSFSRSCLSKTPPSRGKASNGNKSKRSVRFCPVSVIVDEDSRPC 275 Query: 1606 -------------------EIIKRNDFRVFEGKS---LRKHQVK---------------- 1671 +++K + F+ GK +R +Q K Sbjct: 276 GHKCIYEDDPGLMPTPVPPKLVKSSSFKEDAGKGAKYIRNYQKKNISEFDFRGFHSYIQD 335 Query: 1672 ----DFYRDEKDFDDMSCASSDLFELENI-GRVEYEQELPLYETTSIQ 1800 D ++D D+ SC+SSDLFEL+++ G Y +ELP+YETT+ + Sbjct: 336 RDAVDDEDSDEDEDNQSCSSSDLFELDHLMGIGRYREELPVYETTNFK 383 >ref|NP_188014.1| uncharacterized protein [Arabidopsis thaliana] gi|11994369|dbj|BAB02328.1| unnamed protein product [Arabidopsis thaliana] gi|332641926|gb|AEE75447.1| uncharacterized protein AT3G13980 [Arabidopsis thaliana] Length = 357 Score = 122 bits (306), Expect = 6e-25 Identities = 85/210 (40%), Positives = 113/210 (53%), Gaps = 36/210 (17%) Frame = +1 Query: 1294 PLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIF--SPRNXXXXXXXXXXXX 1467 P G +TK++A K+Y DLKK K+PISPGG+++TF NS+F + N Sbjct: 142 PKELGGFLRTKSKALKIYSDLKKVKQPISPGGRLATFLNSLFTNAATNPKKHKKTTTVAV 201 Query: 1468 XXXXKEAATCSLVA--SRSCLSKNASSRGSKSKRTVRFCPVTVIVDQDEIIKR----NDF 1629 ++TCS + SRSCLSK SS G KSKR+VRFCPV VI+D+D N+ Sbjct: 202 VEEPHSSSTCSSASSFSRSCLSKTPSSSG-KSKRSVRFCPVNVILDEDSSFTMPYAYNNE 260 Query: 1630 RVF---EGKSLRKHQ-----VKDFYR----------------DEKDFDD-MSCASSDLFE 1734 R++ E K + +H+ KD R +E D DD SCASSDLFE Sbjct: 261 RLYDNNEAKRVEEHRRVIQAAKDLLRTYHNKNKVTTTNINNVEEDDEDDAASCASSDLFE 320 Query: 1735 LEN---IGRVEYEQELPLYETTSIQISGKI 1815 LEN IG Y +ELP+YETT + ++ Sbjct: 321 LENLSAIGIERYREELPVYETTRLDNMNRV 350 >ref|XP_006451567.1| hypothetical protein CICLE_v10008758mg [Citrus clementina] gi|568875514|ref|XP_006490839.1| PREDICTED: uncharacterized protein LOC102628009 [Citrus sinensis] gi|557554793|gb|ESR64807.1| hypothetical protein CICLE_v10008758mg [Citrus clementina] Length = 357 Score = 115 bits (289), Expect = 6e-23 Identities = 99/347 (28%), Positives = 151/347 (43%), Gaps = 56/347 (16%) Frame = +1 Query: 928 PSFSSSLLDSIYRSIDETRPLPTDNRKPPL----------HRRNKPEDDDVESLRRAIMI 1077 PSFSSSLLD+IY SIDE+ + RK + +RN + + +LRRAIM+ Sbjct: 8 PSFSSSLLDAIYHSIDESNN--NNKRKEEMGFCCDKALMTRQRNSRVEQEAHTLRRAIMV 65 Query: 1078 EKWIDNHXXXXXXXXXXXXXXACRSTPRHLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1257 E W++ + S+ Sbjct: 66 ENWMEKQSSLDSILNLKNNSISNSSSDSTSSACADSRSFYKERSRRSKPVQSVKCMQFDR 125 Query: 1258 XL--KITPKKLDPPPLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIFSPRN 1431 + + PK+ P EG +KTK +A K+YG+LKK K+PISPGG+ ++F NSIFS N Sbjct: 126 RIIDEENPKQKQKP---EGGFTKTKLKALKIYGELKKVKQPISPGGRFTSFLNSIFSSGN 182 Query: 1432 XXXXXXXXXXXXXXXXKEAATCSLVASRSCLSKNASSRGSKS--------KRTVRFCPVT 1587 + + S SRSC+S + SK KR+VRF PV+ Sbjct: 183 AKKVNVCTVKAMEDVNFDRKSKS-TRSRSCVSNANTPPPSKGNIINNYGIKRSVRFYPVS 241 Query: 1588 VIVDQD-----------------------EIIKRNDFRVFEGKSLRKHQVKDFYR----- 1683 ++VD+ ++K+N+ V G+ L ++ +F Sbjct: 242 IVVDEHCRPCGHKCIYEDDPSLLPSLKDRVLMKKNEIGV--GRFLSNKEISEFVSRDFSP 299 Query: 1684 -------DEKDFDDMSCASSDLFELENIGRV-EYEQELPLYETTSIQ 1800 DE + D S +SSDLFEL+++ + Y +ELP+YETT+++ Sbjct: 300 NHDNDDDDEDEDDAFSYSSSDLFELDHLNGIGRYREELPVYETTNLK 346 >ref|XP_002299596.2| hypothetical protein POPTR_0001s17050g [Populus trichocarpa] gi|550347516|gb|EEE84401.2| hypothetical protein POPTR_0001s17050g [Populus trichocarpa] Length = 380 Score = 115 bits (289), Expect = 6e-23 Identities = 80/211 (37%), Positives = 112/211 (53%), Gaps = 46/211 (21%) Frame = +1 Query: 1303 EGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIFSPRNXXXXXXXXXXXXXXXXK 1482 EG KTK++A K+YGDLKK K+PISPG ++++F NS+F+ N K Sbjct: 160 EGSFVKTKSKALKIYGDLKKVKQPISPGRRLASFLNSLFTTGNAKKAKITTPGGSYEERK 219 Query: 1483 ----EAATCSLVA--SRSCLSKNASSRGSK------SKRTVRFCPVTVIVDQD------- 1605 +A+TCS + SRSCLSK SSRG K +KR+VRF PV+VIVD+D Sbjct: 220 LKSEQASTCSSASSFSRSCLSKTPSSRGGKLSSNNGAKRSVRFYPVSVIVDEDCRPCGHK 279 Query: 1606 ------------------------EIIKRNDFRVFEGKSLRKHQVKDFYRDEKDFDDMSC 1713 E + R+ + ++ K +H+ ++ D+ D D SC Sbjct: 280 NLYGSDRQEMSKLKLHVMNENRRIEEVARDLLKNYQ-KKKEEHEEEEEESDDDD-DIASC 337 Query: 1714 ASSDLFELEN---IGRVEYEQELPLYETTSI 1797 ASSDLFEL+N +G Y +ELP+YETT + Sbjct: 338 ASSDLFELDNLSVVGIERYREELPVYETTHL 368 >ref|XP_006299874.1| hypothetical protein CARUB_v10016082mg [Capsella rubella] gi|482568583|gb|EOA32772.1| hypothetical protein CARUB_v10016082mg [Capsella rubella] Length = 371 Score = 115 bits (289), Expect = 6e-23 Identities = 80/216 (37%), Positives = 110/216 (50%), Gaps = 48/216 (22%) Frame = +1 Query: 1294 PLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIF------SPRNXXXXXXXX 1455 P G +TK++A K+Y DLKK K+PISPGG+++TF NS+F +P+ Sbjct: 144 PKELGGFLRTKSKALKIYSDLKKVKQPISPGGRLATFLNSLFTNAAAANPKKQRKKTTSP 203 Query: 1456 XXXXXXXXKEAATCSLVA--SRSCLSKNASSRGSKSKRTVRFCPVTVIVDQD-------- 1605 + TCS + SRSCLSK SS G KSKR+VRFCPV VI+D+D Sbjct: 204 VRVGETVHSSSTTCSSASSFSRSCLSKTPSSSG-KSKRSVRFCPVNVILDEDSSTIPMPY 262 Query: 1606 --------------EIIKRNDFRVFEGKSL-----RKHQVKDFY----------RDEKDF 1698 ++++ + + K L K++V Y +++D Sbjct: 263 TYNNRLYGSNEDKRDVMEEHRRVIQAAKDLLRTYHNKNKVTTTYDEDIKYIANVEEDEDD 322 Query: 1699 DDMSCASSDLFELEN---IGRVEYEQELPLYETTSI 1797 D SCASSDLFELEN IG Y +ELP+YETT + Sbjct: 323 DAASCASSDLFELENLSAIGIERYREELPVYETTRL 358 >ref|XP_002885001.1| hypothetical protein ARALYDRAFT_897656 [Arabidopsis lyrata subsp. lyrata] gi|297330841|gb|EFH61260.1| hypothetical protein ARALYDRAFT_897656 [Arabidopsis lyrata subsp. lyrata] Length = 358 Score = 115 bits (288), Expect = 7e-23 Identities = 81/214 (37%), Positives = 112/214 (52%), Gaps = 40/214 (18%) Frame = +1 Query: 1294 PLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIFSPRNXXXXXXXXXXXXXX 1473 P G +TK++A K+Y DLKK K+PISPGG+++TF NS+F+ N Sbjct: 141 PKELGGFLRTKSKALKIYSDLKKVKQPISPGGRLATFLNSLFT--NAAATNPKKHKKTTT 198 Query: 1474 XXKE---AATCSLVA--SRSCLSKNASSRGSKSKRTVRFCPVTVIVDQDEIIKR----ND 1626 +E ++TCS + SRSCLSK SS G KSKR+VRFCPV VI+D+D I N+ Sbjct: 199 TVEEPHSSSTCSSASSYSRSCLSKTPSSSG-KSKRSVRFCPVNVILDEDSSIHMPYAYNN 257 Query: 1627 FRVFEGKSLRKHQVKDFYR---------------------------DEKDFDD-MSCASS 1722 ++ ++ +++ R +E D DD SCASS Sbjct: 258 NSLYGSNEAKRDVIEEHRRVIEAAKDLLRTYHNKNKVTTTTNITNVEEDDEDDAASCASS 317 Query: 1723 DLFELEN---IGRVEYEQELPLYETTSIQISGKI 1815 DLFELEN IG Y +ELP+YETT + ++ Sbjct: 318 DLFELENLSAIGIDRYREELPVYETTRLDNMNRV 351 >emb|CAN69469.1| hypothetical protein VITISV_042556 [Vitis vinifera] Length = 403 Score = 115 bits (288), Expect = 7e-23 Identities = 88/223 (39%), Positives = 114/223 (51%), Gaps = 48/223 (21%) Frame = +1 Query: 1294 PLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIFSPRNXXXXXXXXXXXXXX 1473 P EG KTK+RA K+YGDLKK K+PISPGG++++F NS+F+ Sbjct: 176 PKHEGGFVKTKSRALKIYGDLKKVKQPISPGGRLASFLNSLFTTGTAKKAKISSSEDSTP 235 Query: 1474 XXK----EAATCSLVA--SRSCLSKNASSRGSKS---KRTVRFCPVTVIVDQD------- 1605 K +TCS + SRSCLSK SSRG S KR+VRF PV+VIVD+D Sbjct: 236 ERKSKSGHTSTCSSASSFSRSCLSKTPSSRGKLSNGTKRSVRFYPVSVIVDEDCRPCGHK 295 Query: 1606 -----------------EIIKRN---DFRVFEGKS--LRKHQVKD-------FYRDEKDF 1698 E IK N + RV E L+ +Q K+ ++ D Sbjct: 296 CLYEDGKPIRTVTNFINEDIKLNIDHNRRVEEAARDLLKNYQKKNESYLREAANHEDSDD 355 Query: 1699 DDMSCASSDLFELEN---IGRVEYEQELPLYETTSIQISGKIS 1818 D SCASSDLFEL+N IG Y +ELP+YETT + + I+ Sbjct: 356 DAASCASSDLFELDNLSAIGIDRYREELPVYETTRMDTNRAIA 398 >ref|XP_006407142.1| hypothetical protein EUTSA_v10022002mg [Eutrema salsugineum] gi|557108288|gb|ESQ48595.1| hypothetical protein EUTSA_v10022002mg [Eutrema salsugineum] Length = 367 Score = 114 bits (285), Expect = 2e-22 Identities = 78/201 (38%), Positives = 105/201 (52%), Gaps = 37/201 (18%) Frame = +1 Query: 1306 GRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIFSPRNXXXXXXXXXXXXXXXXKE 1485 G +TK++A K+Y DLKK K PISPGG+++ F NS+F+ Sbjct: 155 GGFLRTKSKALKIYTDLKKVKHPISPGGRLAAFLNSLFTNAASSNPRKPKKTSEPAVSSS 214 Query: 1486 AATCSLVA--SRSCLSKNASSRGSKSKRTVRFCPVTVIVDQD---------------EII 1614 + TCS + SRSCLSK SS G KSKR+VRFCPV VI+D+D ++ Sbjct: 215 STTCSSASSFSRSCLSKTPSSSG-KSKRSVRFCPVNVILDEDSSSIHHIPYGYNNERHVM 273 Query: 1615 KRNDFRVFEG-----KSLRKHQVKDFYR------------DEKDFDDMSCASSDLFELEN 1743 + + RV E ++ +K + KD +E D D S ASSDLFELEN Sbjct: 274 EEENRRVIEAAKDLIRTYQKMKNKDHLAMHADDVTNVDDVEEDDDDAASYASSDLFELEN 333 Query: 1744 ---IGRVEYEQELPLYETTSI 1797 IG Y++ELP+YETT + Sbjct: 334 LSAIGIERYQEELPVYETTRL 354 >ref|XP_007012696.1| Uncharacterized protein TCM_037572 [Theobroma cacao] gi|508783059|gb|EOY30315.1| Uncharacterized protein TCM_037572 [Theobroma cacao] Length = 393 Score = 112 bits (281), Expect = 5e-22 Identities = 83/232 (35%), Positives = 115/232 (49%), Gaps = 61/232 (26%) Frame = +1 Query: 1306 GRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIF--SPRNXXXXXXXXXXXXXXXX 1479 G SKTK +A K+YG+LKK K+PISPGG+I+ F NSIF + + Sbjct: 157 GGFSKTKLKALKIYGELKKVKQPISPGGRITNFLNSIFNANAKKVKMCSVGVSDDVSFDR 216 Query: 1480 KEAATCSLVA--SRSCLSKNASSRGSK----SKRTVRFCPVTVIVDQD------------ 1605 K TCS + SRSCLSK SSRG+K KR+VRFCPV+VIVD+D Sbjct: 217 KSKTTCSSASSFSRSCLSKTPSSRGNKYSNGKKRSVRFCPVSVIVDEDCRPCGHKCIYED 276 Query: 1606 --EIIKRNDFRVFEGKSLRKHQVKDFYR-------------------------------- 1683 ++ + + S RK ++K+F + Sbjct: 277 DPSLMPTSTVQKNVKSSSRKEELKNFVKEKESGVSNKARDYLRSYQRRGTGKLDLRGFVD 336 Query: 1684 -----DEKDFDD-MSCASSDLFELEN-IGRVEYEQELPLYETTSIQISGKIS 1818 DE++ DD +S +SSDLFEL++ IG Y +ELP+YETTS++ I+ Sbjct: 337 DYEDDDEEEEDDALSYSSSDLFELDHLIGIGRYREELPVYETTSLKTKQAIA 388 >ref|XP_004303677.1| PREDICTED: uncharacterized protein LOC101294280 [Fragaria vesca subsp. vesca] Length = 391 Score = 112 bits (281), Expect = 5e-22 Identities = 84/208 (40%), Positives = 110/208 (52%), Gaps = 35/208 (16%) Frame = +1 Query: 1273 PKKLDPPPLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIFSPRNXXXXXXX 1452 P+ P E KTK++A K+YGDLKK K+PISPGGK+++F NSIF+ N Sbjct: 171 PENHHQKPKQENGFVKTKSKALKIYGDLKKVKQPISPGGKLASFLNSIFNGGN-GKKHKI 229 Query: 1453 XXXXXXXXXKEAATCSLVA--SRSCLSKNASSRGSKS-------KRTVRFCPVTVIVDQD 1605 + CS + SRSCL K + SRG S KR+VRF PV+VIVD+D Sbjct: 230 NIDDLGSKSTNGSNCSSASSFSRSCLRKTSVSRGELSNGGGDSAKRSVRFYPVSVIVDED 289 Query: 1606 ---------EIIKR------NDFRVFEGKSLRKHQVKDFYR---------DEKDFDD-MS 1710 E IKR +D R + R + +K++ + DE D DD S Sbjct: 290 CRPCGHKTLEEIKRSFVMDEDDHRRQVEQVARSYLMKNYQKKTHDVEMEEDEDDDDDAAS 349 Query: 1711 CASSDLFELENIGRVE-YEQELPLYETT 1791 ASSDLFEL+ +G E Y +ELP+YETT Sbjct: 350 YASSDLFELDTVGVEERYREELPVYETT 377 >ref|XP_002891795.1| hypothetical protein ARALYDRAFT_474551 [Arabidopsis lyrata subsp. lyrata] gi|297337637|gb|EFH68054.1| hypothetical protein ARALYDRAFT_474551 [Arabidopsis lyrata subsp. lyrata] Length = 351 Score = 112 bits (280), Expect = 6e-22 Identities = 81/208 (38%), Positives = 104/208 (50%), Gaps = 44/208 (21%) Frame = +1 Query: 1306 GRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIF-----SPRNXXXXXXXXXXXXX 1470 G KTK++A K+Y DLKK K+PISPGG+++TF NSIF + + Sbjct: 143 GSFLKTKSKALKIYSDLKKVKQPISPGGRLATFLNSIFTGAGNTKKLNKINTTVTSATVA 202 Query: 1471 XXXKEAATCSLVA--SRSCLSKNASSRGSKSKRTVRFCPVTVIVDQDEIIKRN------- 1623 TCS + SRSCLSK SS KSKR+VRFCPV VI D+D K N Sbjct: 203 ATASSTTTCSSASSFSRSCLSKTPSS-SEKSKRSVRFCPVNVIFDEDSSSKYNKNKLYGN 261 Query: 1624 ------------DFRVFE---------GKSLRKHQVKD------FYRDEKDFDDMSCASS 1722 + RV E + LR +Q K+ D+ + D +SC SS Sbjct: 262 NEREYESTRHTLEIRVMEENRRVIEAAKELLRTYQKKNKDVVEISGEDDDNDDALSCTSS 321 Query: 1723 DLFELEN---IGRVEYEQELPLYETTSI 1797 DLFEL+N IG Y +ELP+YETT + Sbjct: 322 DLFELDNLSAIGIDRYREELPVYETTRL 349 >ref|XP_007024590.1| Uncharacterized protein TCM_029108 [Theobroma cacao] gi|508779956|gb|EOY27212.1| Uncharacterized protein TCM_029108 [Theobroma cacao] Length = 425 Score = 111 bits (278), Expect = 1e-21 Identities = 85/231 (36%), Positives = 115/231 (49%), Gaps = 56/231 (24%) Frame = +1 Query: 1294 PLSEGRLSKTKTRAQKLYGDLKKAKEPISPGGKISTFFNSIFSPRNXXXXXXXXXXXXXX 1473 P EG +TK++A K+Y DLKK K+PISPGG++++F NS+F+ N Sbjct: 190 PKHEGGFVRTKSKALKIYSDLKKVKQPISPGGRLASFLNSLFTAGNAKKAKISSSGYEER 249 Query: 1474 XXKE---AATCSLVA--SRSCLSKNASSRGSKS----KRTVRFCPVTVIVDQD------- 1605 K ++TCS + SRSCLSK SSRG S KR+VRFCPV+VI+D+D Sbjct: 250 KLKSEQTSSTCSSASSFSRSCLSKTPSSRGKLSSNGTKRSVRFCPVSVILDEDSRPCGHK 309 Query: 1606 ------------------EIIKRN---DFRVFEGKS--LRKHQVK--------------D 1674 E+ RN + RV E L+ +Q K D Sbjct: 310 SIHYENDQTSMIRKPSNKELEFRNLEENRRVVEAAKDLLKSYQKKKEEYDMRDVRNGNGD 369 Query: 1675 FYRDEKDFDDMSCASSDLFELEN---IGRVEYEQELPLYETTSIQISGKIS 1818 D+ + D S ASSDLFEL+N IG Y +ELP+YETT + + I+ Sbjct: 370 SSEDDDEEDAASYASSDLFELDNLSAIGIERYREELPVYETTHLDTNRAIA 420