BLASTX nr result
ID: Forsythia21_contig00030688
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia21_contig00030688 (1570 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011076116.1| PREDICTED: uncharacterized protein LOC105160... 582 e-163 ref|XP_012852159.1| PREDICTED: uncharacterized protein LOC105971... 561 e-157 ref|XP_002273113.2| PREDICTED: uncharacterized protein LOC100264... 546 e-152 ref|XP_010662689.1| PREDICTED: uncharacterized protein LOC100264... 539 e-150 ref|XP_006422011.1| hypothetical protein CICLE_v10004922mg [Citr... 528 e-147 ref|XP_004242762.1| PREDICTED: uncharacterized protein LOC101267... 526 e-146 ref|XP_006358300.1| PREDICTED: uncharacterized protein LOC102584... 521 e-145 ref|XP_009628873.1| PREDICTED: uncharacterized protein LOC104119... 520 e-144 ref|XP_007038793.1| Keratin-associated protein 5-4 [Theobroma ca... 518 e-144 emb|CDP05152.1| unnamed protein product [Coffea canephora] 516 e-143 ref|XP_010255250.1| PREDICTED: uncharacterized protein LOC104595... 513 e-142 ref|XP_008234847.1| PREDICTED: uncharacterized protein LOC103333... 513 e-142 ref|XP_011043859.1| PREDICTED: uncharacterized protein LOC105139... 512 e-142 ref|XP_010255249.1| PREDICTED: uncharacterized protein LOC104595... 512 e-142 ref|XP_002305244.2| hypothetical protein POPTR_0004s07750g [Popu... 511 e-142 ref|XP_012090197.1| PREDICTED: uncharacterized protein LOC105648... 509 e-141 ref|XP_002513660.1| conserved hypothetical protein [Ricinus comm... 509 e-141 gb|KHG26673.1| putative membrane yjcL [Gossypium arboreum] 506 e-140 ref|XP_012484138.1| PREDICTED: uncharacterized protein LOC105798... 506 e-140 ref|XP_007217974.1| hypothetical protein PRUPE_ppa005389mg [Prun... 505 e-140 >ref|XP_011076116.1| PREDICTED: uncharacterized protein LOC105160444 [Sesamum indicum] Length = 455 Score = 582 bits (1501), Expect = e-163 Identities = 326/455 (71%), Positives = 350/455 (76%), Gaps = 9/455 (1%) Frame = -1 Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRN--------IPLFQDSSRLTSYEYKGKVL-ISP 1316 MA KLL P + PPP R S + PL QD S LTS K + L +SP Sbjct: 1 MALKLLFSQPINCHPPPLQRSRFASHQKPSQIPTARSPLIQDFSLLTSSSNKDRSLNLSP 60 Query: 1315 FNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVST 1136 PK +RSV+A SQLNFPIISPQD WGTWTALFATGAFGIWSEKT+IGSALSGALVS Sbjct: 61 NTNPKNVARSVVAKSQLNFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSI 120 Query: 1135 LVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVA 956 LVGLAASNLGIIASEAPAY VV+ F LYRADMRR+IRSTGTLLLAFLLGSVA Sbjct: 121 LVGLAASNLGIIASEAPAYKVVLEFLLPLAVPLLLYRADMRRIIRSTGTLLLAFLLGSVA 180 Query: 955 TTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAAD 776 TT GT VAFL+VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEAL+V+PSVLAAGLAAD Sbjct: 181 TTAGTAVAFLLVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALEVTPSVLAAGLAAD 240 Query: 775 NVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTA 596 NVICA+YFTTLFALASKIPAE +TST+D LNEES S NKLPVLQTATALA SF ICK+A Sbjct: 241 NVICAIYFTTLFALASKIPAESATSTTDGGLNEESESSNKLPVLQTATALAVSFIICKSA 300 Query: 595 SFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSI 416 SFLT Y GIQG +LP +TAIVVILAT P QFAYLAPSGEAMA+ILMQVFF V+GASGSI Sbjct: 301 SFLTNYLGIQGATLPTITAIVVILATMLPNQFAYLAPSGEAMALILMQVFFAVIGASGSI 360 Query: 415 RNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXX 236 R+VI+TAPSIFLF LVQI VHLAIILG GKL RFDLKLLL+ASNANV Sbjct: 361 RSVISTAPSIFLFALVQIGVHLAIILGLGKLLRFDLKLLLLASNANVGGPTTACGMATAK 420 Query: 235 GWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131 GWSSLVVP GQAVLKFM Sbjct: 421 GWSSLVVPGILAGIFGIAIATFLGIAFGQAVLKFM 455 >ref|XP_012852159.1| PREDICTED: uncharacterized protein LOC105971807 [Erythranthe guttatus] gi|604306080|gb|EYU25137.1| hypothetical protein MIMGU_mgv1a006291mg [Erythranthe guttata] Length = 449 Score = 561 bits (1447), Expect = e-157 Identities = 311/455 (68%), Positives = 350/455 (76%), Gaps = 9/455 (1%) Frame = -1 Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRNI---------PLFQDSSRLTSYEYKGKVLISP 1316 MA K+L PT Y+PPP R I + RN P FQ+S T K + L + Sbjct: 1 MAGKILLFHPT-YIPPPPARRSIVASRNAASQIPDTHTPSFQNSPLSTFSSDKFRTLKT- 58 Query: 1315 FNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVST 1136 + + +RSV+A SQLNFPIISP D WGTWTALFA GAFGIWSEKT+IGSALSGALVST Sbjct: 59 --ISRNPARSVVARSQLNFPIISPHDQWGTWTALFAAGAFGIWSEKTKIGSALSGALVST 116 Query: 1135 LVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVA 956 LVGLAASNLGIIASE AYNVV+ F LYRADMRRVI+STGTLLLAFLLGSVA Sbjct: 117 LVGLAASNLGIIASETAAYNVVLEFLLPLAVPLLLYRADMRRVIKSTGTLLLAFLLGSVA 176 Query: 955 TTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAAD 776 TT+GT+VA+ +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEAL VSPSVLAAGLAAD Sbjct: 177 TTVGTLVAYFLVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALGVSPSVLAAGLAAD 236 Query: 775 NVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTA 596 NVICA+YFTTLFALASKIP+E S+ T + NEES S NKLPVLQTATA+A SF ICK A Sbjct: 237 NVICAIYFTTLFALASKIPSESSSPTPGI--NEESESDNKLPVLQTATAVAVSFIICKIA 294 Query: 595 SFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSI 416 + LT++FGIQGG+LPA+TAIVV+LAT+FP QFAYLAPSGEAMA+ILMQVFF V+GASGSI Sbjct: 295 TVLTKHFGIQGGTLPAITAIVVVLATSFPNQFAYLAPSGEAMALILMQVFFAVIGASGSI 354 Query: 415 RNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXX 236 RNVITTAPSIFLF L+QI VHLA+ILG GKLFRFDL+LLL+ASNANV Sbjct: 355 RNVITTAPSIFLFALIQIGVHLAVILGLGKLFRFDLRLLLLASNANVGGPTTACGMATAK 414 Query: 235 GWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131 GW+SL+VP GQAVL+FM Sbjct: 415 GWTSLIVPGILAGIFGIAIATFLGIAFGQAVLRFM 449 >ref|XP_002273113.2| PREDICTED: uncharacterized protein LOC100264478 isoform X1 [Vitis vinifera] gi|302143806|emb|CBI22667.3| unnamed protein product [Vitis vinifera] Length = 449 Score = 546 bits (1407), Expect = e-152 Identities = 309/453 (68%), Positives = 338/453 (74%), Gaps = 7/453 (1%) Frame = -1 Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRNIPLFQDSSRLTS-----YEYKGKVLISPFNMP 1304 MASK L++ +P +P S +N P SS T + K + +SP P Sbjct: 1 MASKFLTLRAPLSIP----QPVTCSRQNFPTVSTSSCSTVEDPSLWRSKKQTQLSPLIFP 56 Query: 1303 KKSS--RSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLV 1130 K S RSV S L FPIISPQD WGTWTALFATGAFGIWSEKT+IGSALSGALVSTLV Sbjct: 57 KSSDSIRSVTVRSSLTFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLV 116 Query: 1129 GLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATT 950 GLAASNLGII+ EAPAY+VV+ F L+RAD+RRVI+STG LL+AFL+GSVATT Sbjct: 117 GLAASNLGIISCEAPAYSVVLNFLLPLAVPLLLFRADLRRVIQSTGALLMAFLIGSVATT 176 Query: 949 IGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNV 770 IGTVVAFLMVPM+SLGQD WKIAAALMGRHIGGAVNYVAISEAL VS SVLAAGLAADNV Sbjct: 177 IGTVVAFLMVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVSRSVLAAGLAADNV 236 Query: 769 ICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASF 590 ICAVYFTTLFALASKIP E STS +D +NE+ GNK PVL TATALA SFAICK F Sbjct: 237 ICAVYFTTLFALASKIPPEDSTSANDTGMNEQPEPGNKPPVLLTATALAVSFAICKAGIF 296 Query: 589 LTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRN 410 LT+YFGIQGGSLPA+TAIVVILAT FPKQF+ LAP+GE MAMILMQVFFTVVGASG+I N Sbjct: 297 LTKYFGIQGGSLPAITAIVVILATAFPKQFSLLAPAGETMAMILMQVFFTVVGASGNIGN 356 Query: 409 VITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGW 230 V+ TAPSIF+F LVQIAVHLA+ILG GKLFRFDLKLLLIASNANV GW Sbjct: 357 VMNTAPSIFMFALVQIAVHLAVILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKGW 416 Query: 229 SSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131 SSLVVP G VLKFM Sbjct: 417 SSLVVPGILAGIFGIAIATFLGIVFGVTVLKFM 449 >ref|XP_010662689.1| PREDICTED: uncharacterized protein LOC100264478 isoform X2 [Vitis vinifera] Length = 447 Score = 539 bits (1389), Expect = e-150 Identities = 308/453 (67%), Positives = 337/453 (74%), Gaps = 7/453 (1%) Frame = -1 Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRNIPLFQDSSRLTS-----YEYKGKVLISPFNMP 1304 MASK L++ +P +P S +N P SS T + K + +SP P Sbjct: 1 MASKFLTLRAPLSIP----QPVTCSRQNFPTVSTSSCSTVEDPSLWRSKKQTQLSPLIFP 56 Query: 1303 KKSS--RSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLV 1130 K S RSV S L FPIISPQD WGTWTALFATGAFGIWSEKT+IGSALSGALVSTLV Sbjct: 57 KSSDSIRSVTVRSSLTFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLV 116 Query: 1129 GLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATT 950 GLAASNLGII+ EAPAY+VV+ F L+RAD+RRVI+STG LL+AFL+GSVATT Sbjct: 117 GLAASNLGIISCEAPAYSVVLNFLLPLAVPLLLFRADLRRVIQSTGALLMAFLIGSVATT 176 Query: 949 IGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNV 770 IGTVVAFLMVPM+SLGQD WKIAAALMGRHIGGAVNYVAISEAL VS SVLAAGLAADNV Sbjct: 177 IGTVVAFLMVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVSRSVLAAGLAADNV 236 Query: 769 ICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASF 590 ICAVYFTTLFALASKIP E STS + +NE+ GNK PVL TATALA SFAICK F Sbjct: 237 ICAVYFTTLFALASKIPPEDSTSANG--MNEQPEPGNKPPVLLTATALAVSFAICKAGIF 294 Query: 589 LTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRN 410 LT+YFGIQGGSLPA+TAIVVILAT FPKQF+ LAP+GE MAMILMQVFFTVVGASG+I N Sbjct: 295 LTKYFGIQGGSLPAITAIVVILATAFPKQFSLLAPAGETMAMILMQVFFTVVGASGNIGN 354 Query: 409 VITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGW 230 V+ TAPSIF+F LVQIAVHLA+ILG GKLFRFDLKLLLIASNANV GW Sbjct: 355 VMNTAPSIFMFALVQIAVHLAVILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKGW 414 Query: 229 SSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131 SSLVVP G VLKFM Sbjct: 415 SSLVVPGILAGIFGIAIATFLGIVFGVTVLKFM 447 >ref|XP_006422011.1| hypothetical protein CICLE_v10004922mg [Citrus clementina] gi|568875109|ref|XP_006490652.1| PREDICTED: uncharacterized protein LOC102608862 [Citrus sinensis] gi|557523884|gb|ESR35251.1| hypothetical protein CICLE_v10004922mg [Citrus clementina] Length = 466 Score = 528 bits (1359), Expect = e-147 Identities = 279/393 (70%), Positives = 318/393 (80%), Gaps = 1/393 (0%) Frame = -1 Query: 1387 NIPLFQDSSRLTSYEYKGKVLISPFNMPKKSSRSVIASSQL-NFPIISPQDHWGTWTALF 1211 +IP Q S+ S+ L F P +RSV A SQL NFP+ISP D WGTWTALF Sbjct: 47 SIPQHQSSASYLSHSRTNTFLSPQFPHPSNRTRSVTARSQLPNFPLISPHDKWGTWTALF 106 Query: 1210 ATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXL 1031 ATGAFGIWSE+T+IGSALSGALVSTL+GLAASNLG+++ E+PAY++V+ F L Sbjct: 107 ATGAFGIWSERTKIGSALSGALVSTLIGLAASNLGVVSCESPAYSIVLEFLLPLAVPLLL 166 Query: 1030 YRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGG 851 +RAD+RRVI+STGTLLLAFL+GSVATT+GT +A+L+VPM+SLGQD WKIAAALMGRHIGG Sbjct: 167 FRADLRRVIKSTGTLLLAFLIGSVATTVGTALAYLLVPMRSLGQDSWKIAAALMGRHIGG 226 Query: 850 AVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEES 671 AVNYVAIS+AL VS SVLAAGLAADNVICAVYFTTLFALAS IPAE STS DV +NE S Sbjct: 227 AVNYVAISDALGVSSSVLAAGLAADNVICAVYFTTLFALASNIPAESSTSVDDVSMNEGS 286 Query: 670 GSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYL 491 G+K PVLQ ATALA +FAICK +FLT+YFGIQGGSLPA+TAIVV LATTFP QF L Sbjct: 287 VRGDKPPVLQFATALAVAFAICKAGTFLTKYFGIQGGSLPAITAIVVTLATTFPTQFNKL 346 Query: 490 APSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFD 311 AP+GEAMA+ILMQVFFTVVGASG+I +VI TAPSIF+F LVQIA+HLA+ILG GKLFRFD Sbjct: 347 APAGEAMALILMQVFFTVVGASGNIWSVINTAPSIFMFALVQIAIHLAVILGLGKLFRFD 406 Query: 310 LKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212 KLLLIASNANV GWSSL+VP Sbjct: 407 QKLLLIASNANVGGPTTACGMATAKGWSSLIVP 439 >ref|XP_004242762.1| PREDICTED: uncharacterized protein LOC101267717 [Solanum lycopersicum] Length = 462 Score = 526 bits (1356), Expect = e-146 Identities = 300/452 (66%), Positives = 334/452 (73%), Gaps = 6/452 (1%) Frame = -1 Query: 1468 MASKLLSVLPTHYLPPPEY----RPFIHSGRNIPLFQDSSRLTSYEYKGKVLISPFNMPK 1301 MA K L L Y+P P R + + + Q L+ K K L P N + Sbjct: 13 MALKQLLFLHNPYIPSPASYSCRRKNASAATSSTVLQHPMLLSMNIDKFKPLDFPKNSTR 72 Query: 1300 KSSRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGL 1124 K +RSV SQLNFPIISPQD WGTWT LFATGAFGIWSEKT+IG+ALSG+LVS LVGL Sbjct: 73 KLNRSVTTIRSQLNFPIISPQDQWGTWTVLFATGAFGIWSEKTKIGAALSGSLVSVLVGL 132 Query: 1123 AASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIG 944 AASNLGIIASEAPAY +V F L+RADMRRV++STGTLL+AFLLGSVATTIG Sbjct: 133 AASNLGIIASEAPAYKIVTGFLLPLAVPLLLFRADMRRVLKSTGTLLMAFLLGSVATTIG 192 Query: 943 TVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVIC 764 TVVAF +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEALQ SPSV+A+GLAADN+IC Sbjct: 193 TVVAFFIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQTSPSVVASGLAADNLIC 252 Query: 763 AVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLT 584 AVYFTTLFALASKIPAE + S SD ++ ES SGNKLPVLQTATALA SFAICK LT Sbjct: 253 AVYFTTLFALASKIPAEAAQSVSDDKV--ESESGNKLPVLQTATALAVSFAICKAGELLT 310 Query: 583 RYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS-GSIRNV 407 ++FGIQGG LP +TAIVVILAT+FP QFAYLAPSGEAMA+ILMQVFFT +GAS GSI NV Sbjct: 311 KHFGIQGGLLPIITAIVVILATSFPSQFAYLAPSGEAMALILMQVFFTFIGASGGSISNV 370 Query: 406 ITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWS 227 + TAPSIFLF L+QIAVHLA+ILG GKL R +LK LLIASNANV GW Sbjct: 371 LNTAPSIFLFALIQIAVHLAVILGVGKLLRLELKELLIASNANVGGPTTACGMATAKGWI 430 Query: 226 SLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131 SLVVP GQ VLKF+ Sbjct: 431 SLVVPGILAGIFGIAIATFLGIAFGQTVLKFI 462 >ref|XP_006358300.1| PREDICTED: uncharacterized protein LOC102584987 [Solanum tuberosum] Length = 453 Score = 521 bits (1343), Expect = e-145 Identities = 296/455 (65%), Positives = 333/455 (73%), Gaps = 9/455 (1%) Frame = -1 Query: 1468 MASKLLSVLPTHYLPPP-------EYRPFIHSGRNIPLFQDSSRLTSYEYKGKVLISPFN 1310 MA K L L Y+P P + S + + Q L+ K K L P N Sbjct: 1 MALKQLLFLHNPYIPSPASCSSRRKNASAATSSTSNSILQHPMLLSKDIDKFKPLDFPKN 60 Query: 1309 MPKKSSRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTL 1133 +K +RSV SQLNFPIISPQD WGTWT LFATGAFGIWSEKT++G+ALSG+LVS L Sbjct: 61 STRKLNRSVTTIRSQLNFPIISPQDQWGTWTVLFATGAFGIWSEKTKVGAALSGSLVSVL 120 Query: 1132 VGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVAT 953 VGLAASNLGIIASEAPAY +V F L+RADMRRV++STGTLLLAFLLGSVAT Sbjct: 121 VGLAASNLGIIASEAPAYKIVTGFLLPLAVPLLLFRADMRRVLKSTGTLLLAFLLGSVAT 180 Query: 952 TIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADN 773 TIGTVVAF +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEALQ SPSV+A+GLAADN Sbjct: 181 TIGTVVAFCIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQTSPSVVASGLAADN 240 Query: 772 VICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTAS 593 +ICAVYFTTLFAL SKIPAE + S +D +++ E SGNKLPVLQTATALA SFAICK Sbjct: 241 LICAVYFTTLFALTSKIPAEATQSATDDKVDSE--SGNKLPVLQTATALAVSFAICKAGE 298 Query: 592 FLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS-GSI 416 LT++FGIQGG LP +TAIVVILAT+FP QFAYLAPSGEAMA+ILMQVFFT +GAS GSI Sbjct: 299 LLTKHFGIQGGLLPTITAIVVILATSFPSQFAYLAPSGEAMALILMQVFFTFIGASGGSI 358 Query: 415 RNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXX 236 NV+ TAPSIFLF +QIAVHLA+ILG GKL + +LK LLIASNANV Sbjct: 359 SNVLNTAPSIFLFAFIQIAVHLAVILGVGKLLQLELKELLIASNANVGGPTTACGMATAK 418 Query: 235 GWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131 GW S+VVP GQAVLKFM Sbjct: 419 GWISMVVPGILAGIFGIAIATFLGIAFGQAVLKFM 453 >ref|XP_009628873.1| PREDICTED: uncharacterized protein LOC104119150 [Nicotiana tomentosiformis] Length = 452 Score = 520 bits (1339), Expect = e-144 Identities = 296/458 (64%), Positives = 339/458 (74%), Gaps = 12/458 (2%) Frame = -1 Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRNIPLFQDSSRLTSYE--YKGKVLIS------PF 1313 MASKL + + PP Y P +N+P +S +TS + +L+S P Sbjct: 1 MASKLWFLHNLYIPPPASYSP---RRQNVPA---ASAITSANTILQHPMLLSNIDKYTPL 54 Query: 1312 NMPKKS---SRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGAL 1145 + PK S +RSV SQLNFPIISPQD WGTWTALFATGAFGIWSEKT++G ALSGAL Sbjct: 55 DFPKSSKKLNRSVTTIRSQLNFPIISPQDQWGTWTALFATGAFGIWSEKTKVGGALSGAL 114 Query: 1144 VSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLG 965 VSTLVGLAASNLGIIA EAPAY +V F L+RADMRRV++STGTLLLAFLLG Sbjct: 115 VSTLVGLAASNLGIIACEAPAYKIVTGFLLPLAVPLLLFRADMRRVLQSTGTLLLAFLLG 174 Query: 964 SVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGL 785 SVATTIGTVVAF +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEAL+ SPSV+ AGL Sbjct: 175 SVATTIGTVVAFWIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALETSPSVVTAGL 234 Query: 784 AADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAIC 605 AADN+ICAVYFTTLFALASKIPAE + S ++ +++ ES SGN LPVLQ+ATALA SFAIC Sbjct: 235 AADNLICAVYFTTLFALASKIPAEATPSAAEDKIDGESESGNTLPVLQSATALAVSFAIC 294 Query: 604 KTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS 425 K FLT++F IQGG+LP +TAIVVILAT+FP QFA LAPSGEAMA+ILMQVFF +GA+ Sbjct: 295 KAGDFLTKHFVIQGGTLPIITAIVVILATSFPTQFADLAPSGEAMALILMQVFFAFIGAN 354 Query: 424 GSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXX 245 GSI NV+ TAPSIF+F LVQI VHLA+ILG GKL RF+L+ LLIASNANV Sbjct: 355 GSILNVMNTAPSIFVFVLVQIGVHLAVILGVGKLLRFELEQLLIASNANVGGPTTACGMA 414 Query: 244 XXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131 GW SLVVP GQ +LKFM Sbjct: 415 TAKGWISLVVPGILAGIFGITIATFLGIAFGQVILKFM 452 >ref|XP_007038793.1| Keratin-associated protein 5-4 [Theobroma cacao] gi|508776038|gb|EOY23294.1| Keratin-associated protein 5-4 [Theobroma cacao] Length = 466 Score = 518 bits (1333), Expect = e-144 Identities = 272/362 (75%), Positives = 303/362 (83%) Frame = -1 Query: 1297 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAA 1118 ++R V SQLNFP+ISP D WGTWTALFA GAFGIWSEKT+IGSALSGALVSTL+GLAA Sbjct: 78 ANRPVTVKSQLNFPLISPNDQWGTWTALFAIGAFGIWSEKTKIGSALSGALVSTLIGLAA 137 Query: 1117 SNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTV 938 SNLGII+ EA AY+ V+ F L+RAD+RRVI+STG LLLAFLLGSVATT+GT Sbjct: 138 SNLGIISCEAKAYSTVLEFLLPLAVPLLLFRADLRRVIKSTGKLLLAFLLGSVATTVGTA 197 Query: 937 VAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 758 +A+L+VPM++LGQD WKIAAALMGRHIGGAVNYVAIS AL VSPSVLAAGLAADNVICAV Sbjct: 198 LAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAISNALGVSPSVLAAGLAADNVICAV 257 Query: 757 YFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRY 578 YFTTLFALASK+P E STS DV + E S SG+KLPVLQ ATALA SF+ICK ++LT+Y Sbjct: 258 YFTTLFALASKVPPETSTSPEDVAMVEGSESGSKLPVLQIATALAVSFSICKLGAYLTKY 317 Query: 577 FGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITT 398 FGI GGSLPAVTAIVVILAT FP QF LAP+GEAMA+ILMQVFFTVVGASG+I NVI T Sbjct: 318 FGIPGGSLPAVTAIVVILATVFPTQFGRLAPAGEAMALILMQVFFTVVGASGNIWNVINT 377 Query: 397 APSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 218 APSIF+F LVQIA+HLA+ILG GKLFRFDLKLLLIASNANV GWSS+V Sbjct: 378 APSIFMFALVQIAIHLALILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKGWSSMV 437 Query: 217 VP 212 VP Sbjct: 438 VP 439 >emb|CDP05152.1| unnamed protein product [Coffea canephora] Length = 459 Score = 516 bits (1330), Expect = e-143 Identities = 273/355 (76%), Positives = 300/355 (84%), Gaps = 1/355 (0%) Frame = -1 Query: 1273 SQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIAS 1094 SQL++PIISPQDHWGTWTALFATGAFGIWSE+T+IGS LSGALVS LVGLAASNLGII Sbjct: 78 SQLSYPIISPQDHWGTWTALFATGAFGIWSERTKIGSTLSGALVSILVGLAASNLGIIPC 137 Query: 1093 EAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPM 914 +APAY +V+ L+RAD+RRVI+STGTLLLAFLLGSVATT+GT VAFL+VPM Sbjct: 138 DAPAYKIVLQILLPMAVPLLLFRADLRRVIQSTGTLLLAFLLGSVATTLGTAVAFLLVPM 197 Query: 913 QSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFAL 734 +SLGQDGWKIAAALMGRHIGGAVNYVAISEAL V+PSVLAAGLAADNVICA+YFTTLFAL Sbjct: 198 RSLGQDGWKIAAALMGRHIGGAVNYVAISEALGVTPSVLAAGLAADNVICAIYFTTLFAL 257 Query: 733 ASKIPAEPSTSTSDVELNEE-SGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGS 557 AS IP E ST+T+D + + S SGNKLPVL TATALA SFAICK S +YFGI GGS Sbjct: 258 ASGIPPEASTATTDADAGYDISESGNKLPVLPTATALAVSFAICKAGSSFAKYFGISGGS 317 Query: 556 LPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLF 377 LPA+TAIVVILAT FP+ FA+LAPSGEAMA+ILMQVFFTVVGASGS+ NVI TAPSI LF Sbjct: 318 LPAITAIVVILATVFPRLFAHLAPSGEAMALILMQVFFTVVGASGSMWNVINTAPSILLF 377 Query: 376 CLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212 LVQIAVHLA+ILG GKLFRFDLKLLL+ASNANV GWSSLVVP Sbjct: 378 ALVQIAVHLAVILGLGKLFRFDLKLLLLASNANVGGPTTACGMATAKGWSSLVVP 432 >ref|XP_010255250.1| PREDICTED: uncharacterized protein LOC104595990 [Nelumbo nucifera] Length = 457 Score = 513 bits (1321), Expect = e-142 Identities = 267/378 (70%), Positives = 309/378 (81%), Gaps = 2/378 (0%) Frame = -1 Query: 1339 KGKVLISPFNMPKKSSR--SVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIG 1166 + K LISP +PK S+ +QLNFP+ISP+DHWGTWTALFAT AFGIWSEKT+IG Sbjct: 53 RSKTLISPLTIPKNHGPVPSLKTRAQLNFPLISPKDHWGTWTALFATSAFGIWSEKTKIG 112 Query: 1165 SALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTL 986 SALSG+LVS LVGLAASN+GII+ EAPAY+VVM + L+RAD+RRVI STGTL Sbjct: 113 SALSGSLVSILVGLAASNIGIISCEAPAYSVVMEYLLPMAVPLLLFRADLRRVIMSTGTL 172 Query: 985 LLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSP 806 LLAFLLGSVATTIGT+VA+L+VPM+SLGQD WKIAAALMGRHIGGAVNYVAISEAL V+P Sbjct: 173 LLAFLLGSVATTIGTLVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVTP 232 Query: 805 SVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATAL 626 SVLAAGLAADNVICA+YFT+LFALAS IP E S ST D ++ +S GNKLPVLQTA A+ Sbjct: 233 SVLAAGLAADNVICAIYFTSLFALASNIPPEASKSTEDGVIDAKSEPGNKLPVLQTAIAI 292 Query: 625 AASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVF 446 A SF+ICKTA++LT+ GIQGGSLP +TA+VVILAT FP QF YLAP+GEA+A+ILMQVF Sbjct: 293 AVSFSICKTATYLTKLLGIQGGSLPCITALVVILATIFPAQFGYLAPAGEAVALILMQVF 352 Query: 445 FTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXX 266 F VVGA+GSI NVI TAPS+F+F L+QI +HLA+ILG GKL RFD KLLL+ASNANV Sbjct: 353 FAVVGANGSIWNVINTAPSVFMFALLQITIHLAVILGVGKLLRFDQKLLLLASNANVGGP 412 Query: 265 XXXXXXXXXXGWSSLVVP 212 GW SLV+P Sbjct: 413 TTACGMATAKGWGSLVIP 430 >ref|XP_008234847.1| PREDICTED: uncharacterized protein LOC103333733 isoform X1 [Prunus mume] Length = 463 Score = 513 bits (1320), Expect = e-142 Identities = 272/399 (68%), Positives = 312/399 (78%), Gaps = 1/399 (0%) Frame = -1 Query: 1324 ISPFNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEK-TEIGSALSGA 1148 +SP P RSV QLN P+IS D WGTWTALFATGAFGIWSEK T++G+ALSGA Sbjct: 65 LSPPAPPDLGDRSVAVRFQLNAPLISSHDQWGTWTALFATGAFGIWSEKNTKVGAALSGA 124 Query: 1147 LVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLL 968 LVSTL+GLAASNLGII+S APA+++V+ F LYRAD+RRVI+STG LLLAFLL Sbjct: 125 LVSTLIGLAASNLGIISSNAPAFSIVLEFLLPLAVPLLLYRADLRRVIKSTGALLLAFLL 184 Query: 967 GSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAG 788 GSVATT+GTVVA+L+VPM+SLGQD WKIAAALMGRHIGGAVNYVAI++AL VSPS+LAAG Sbjct: 185 GSVATTVGTVVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIADALGVSPSILAAG 244 Query: 787 LAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAI 608 LAADNVICAVYF+TLFALASK+P EPSTS +E + S GNKLP++QTATAL+ S AI Sbjct: 245 LAADNVICAVYFSTLFALASKVPPEPSTSDDGIEKDASSEPGNKLPLIQTATALSVSLAI 304 Query: 607 CKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGA 428 CK+ +LT+YFGIQGG LPAVTAIVV LAT FPKQFAYLAP+GEAMA+ILMQVFF+VVGA Sbjct: 305 CKSGHYLTKYFGIQGGILPAVTAIVVTLATVFPKQFAYLAPTGEAMAVILMQVFFSVVGA 364 Query: 427 SGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXX 248 SG+I NVI TAPSIF F L+QIAVHLA+ILG GKL FDLKLLLIASNANV Sbjct: 365 SGNIWNVINTAPSIFFFALIQIAVHLAVILGLGKLMGFDLKLLLIASNANVGGPTTACGM 424 Query: 247 XXXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131 W+S++VP G AVLK+M Sbjct: 425 ATAKEWNSMIVPGILAGIFGIAIATFIGIAFGLAVLKYM 463 >ref|XP_011043859.1| PREDICTED: uncharacterized protein LOC105139195 isoform X1 [Populus euphratica] gi|743901093|ref|XP_011043860.1| PREDICTED: uncharacterized protein LOC105139195 isoform X1 [Populus euphratica] Length = 452 Score = 512 bits (1319), Expect = e-142 Identities = 284/422 (67%), Positives = 326/422 (77%), Gaps = 7/422 (1%) Frame = -1 Query: 1456 LLSVLPTHYLPP-PEYRPFIHSGRNIPLFQDSSR---LTSYEYKGKV-LISPFNMPK--K 1298 + S LP + P P RP S +N P + L S Y + +SP P + Sbjct: 1 MASRLPLLHSPVVPFRRPCFVSRQNSPTTTANPTRRTLLSANYGNQTSFLSPQKNPNLIR 60 Query: 1297 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAA 1118 SS +V ++ LNFP+ISP D WG WTALFATGAFGIWSE+T+IGSALSGALVSTLVGLAA Sbjct: 61 SSVTVRSNMILNFPLISPTDPWGMWTALFATGAFGIWSERTKIGSALSGALVSTLVGLAA 120 Query: 1117 SNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTV 938 SNLGII+ E+PAY+ V+ F L+RAD+RRVI+STGTLLLAFLLGSVATT+GTV Sbjct: 121 SNLGIISCESPAYSTVLKFLLPLAVPLLLFRADLRRVIQSTGTLLLAFLLGSVATTVGTV 180 Query: 937 VAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 758 +A++MVPM++LGQD WKIAAALMGRHIGGAVNYVAIS+AL VSPSVLAAGLAADNVICAV Sbjct: 181 LAYMMVPMRALGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAV 240 Query: 757 YFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRY 578 YFT+LFALASKIPAE S S ++ S SGNKLPVLQTATALA SFAICK ++T++ Sbjct: 241 YFTSLFALASKIPAESSASIDGSGMDSGSESGNKLPVLQTATALAVSFAICKAGEYITKF 300 Query: 577 FGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITT 398 F I GG LPAVTAIVVILAT FP QF +LAPSGEA+A+ILMQVFF VVGASG++RNVI T Sbjct: 301 FAIPGGILPAVTAIVVILATAFPTQFNHLAPSGEALALILMQVFFAVVGASGNVRNVINT 360 Query: 397 APSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 218 APSIF+F LVQIA+HLA+ILG GKLFRFD KLLLIASNANV GWSSLV Sbjct: 361 APSIFMFALVQIAIHLAVILGLGKLFRFDQKLLLIASNANVGGPTTACGMATAKGWSSLV 420 Query: 217 VP 212 VP Sbjct: 421 VP 422 >ref|XP_010255249.1| PREDICTED: uncharacterized protein LOC104595989 [Nelumbo nucifera] Length = 458 Score = 512 bits (1318), Expect = e-142 Identities = 269/377 (71%), Positives = 309/377 (81%), Gaps = 3/377 (0%) Frame = -1 Query: 1333 KVLISPFNMPKKS---SRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGS 1163 K +SP PK + +RSV +QL+FP+ISP+DHWGTWTALF + AFGIWSEKT++GS Sbjct: 55 KTFLSPSTFPKGNPDLNRSVKTKAQLSFPLISPKDHWGTWTALFVSSAFGIWSEKTKVGS 114 Query: 1162 ALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLL 983 ALSGALVSTLVGL ASNLGII+ EAPAY++VM + L+RAD+RRVI STGTLL Sbjct: 115 ALSGALVSTLVGLGASNLGIISCEAPAYSLVMEYLLPMAVPLLLFRADLRRVILSTGTLL 174 Query: 982 LAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPS 803 AFLLGSVATTIGT+VA+LMVPM+SLG D WKIAAALMGRHIGGAVNYVAISEAL VSPS Sbjct: 175 SAFLLGSVATTIGTIVAYLMVPMRSLGHDNWKIAAALMGRHIGGAVNYVAISEALAVSPS 234 Query: 802 VLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALA 623 VLAAGLAADNVICA+YFT+LFALAS+IP E +T T+D ++ ES GNKLPVLQTATALA Sbjct: 235 VLAAGLAADNVICAIYFTSLFALASQIPPESTTPTNDDVIDTESQIGNKLPVLQTATALA 294 Query: 622 ASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFF 443 SFAICKT ++L++ GIQGG+LP +TAIVVILAT FP QF YLAP+GEA+A+ILMQVFF Sbjct: 295 VSFAICKTGTYLSKLLGIQGGNLPCITAIVVILATIFPAQFGYLAPAGEAVALILMQVFF 354 Query: 442 TVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXX 263 VVGA+GSI NVI TAPSIF+F LVQIAVHLA+ILG GKL +FD KLLL+ASNANV Sbjct: 355 AVVGANGSIWNVINTAPSIFMFSLVQIAVHLAVILGVGKLMQFDQKLLLLASNANVGGPA 414 Query: 262 XXXXXXXXXGWSSLVVP 212 GW SLVVP Sbjct: 415 TACGMASTKGWGSLVVP 431 >ref|XP_002305244.2| hypothetical protein POPTR_0004s07750g [Populus trichocarpa] gi|550340557|gb|EEE85755.2| hypothetical protein POPTR_0004s07750g [Populus trichocarpa] Length = 452 Score = 511 bits (1315), Expect = e-142 Identities = 271/373 (72%), Positives = 309/373 (82%), Gaps = 2/373 (0%) Frame = -1 Query: 1324 ISPFNMPK--KSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSG 1151 +SP P +SS +V ++ LNFP+ISP D WG WTALFATGAFGIWSE+T+IGSALSG Sbjct: 50 LSPQKNPNLIRSSVTVRSNLILNFPLISPTDPWGMWTALFATGAFGIWSERTKIGSALSG 109 Query: 1150 ALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFL 971 ALVSTLVGLAASNLGII+ E+PAY++V+ F L+RAD+RRVI+STGTLLLAFL Sbjct: 110 ALVSTLVGLAASNLGIISCESPAYSIVLKFLLPLAVPLLLFRADLRRVIQSTGTLLLAFL 169 Query: 970 LGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAA 791 LGSVATT+GTV+A++MVPM++LGQD WKIAAALMGRHIGGAVNYVAIS+AL+VSPSVLAA Sbjct: 170 LGSVATTVGTVLAYMMVPMRALGQDSWKIAAALMGRHIGGAVNYVAISDALRVSPSVLAA 229 Query: 790 GLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFA 611 GLAADNVICAVYFT+LFALASKIPAE S S ++ S SGNKLPVLQTATALA SFA Sbjct: 230 GLAADNVICAVYFTSLFALASKIPAESSASIDGSGMDSGSESGNKLPVLQTATALAVSFA 289 Query: 610 ICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVG 431 ICK ++T++F I GG LPAVTAIVVILAT FP QF +LAPSGEA+A+ILMQVFF VVG Sbjct: 290 ICKAGEYITKFFAIPGGILPAVTAIVVILATAFPTQFNHLAPSGEALALILMQVFFAVVG 349 Query: 430 ASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXX 251 ASG++ NVI TAPSIFLF LVQIA+HLA+ILG GKLFRFD KLLLIASNANV Sbjct: 350 ASGNVWNVINTAPSIFLFALVQIAIHLAVILGLGKLFRFDQKLLLIASNANVGGPTTACG 409 Query: 250 XXXXXGWSSLVVP 212 GWSSLVVP Sbjct: 410 MATAKGWSSLVVP 422 >ref|XP_012090197.1| PREDICTED: uncharacterized protein LOC105648428 [Jatropha curcas] gi|643706105|gb|KDP22237.1| hypothetical protein JCGZ_26068 [Jatropha curcas] Length = 459 Score = 509 bits (1310), Expect = e-141 Identities = 275/392 (70%), Positives = 311/392 (79%), Gaps = 2/392 (0%) Frame = -1 Query: 1381 PLFQDSSRLTSYEYKGKVLISP--FNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFA 1208 P Q SS S + +SP + S RSV S LNFP+ISP D WGTWTALFA Sbjct: 43 PALQSSS--ISLGNRSHTFLSPELYTEDSSSLRSVAVRSNLNFPLISPGDRWGTWTALFA 100 Query: 1207 TGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLY 1028 TGAFGIWSEKT+IGSALSGALVSTLVGLAASNLGII+ E+PAY +V+ F L+ Sbjct: 101 TGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGIISCESPAYPIVLEFLLPLAVPLLLF 160 Query: 1027 RADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGA 848 RAD+RRVI+STGTLLLAFL+GSVATT+GT+VA+ +VPM+SLGQD WKIAAALMGRHIGGA Sbjct: 161 RADLRRVIQSTGTLLLAFLIGSVATTVGTLVAYWIVPMRSLGQDSWKIAAALMGRHIGGA 220 Query: 847 VNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESG 668 VNYVAIS+AL VS SVLA+GLAADNVICAVYFTTLFALASKIP E S ST+D + E+ Sbjct: 221 VNYVAISDALGVSSSVLASGLAADNVICAVYFTTLFALASKIPPESSVSTNDGAIESETE 280 Query: 667 SGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLA 488 +KLPVL+ ATA+A SFAICK SF+T+ FGIQGG LPAVTAIVVILAT FP QF LA Sbjct: 281 PSDKLPVLKIATAIAVSFAICKAGSFVTKLFGIQGGILPAVTAIVVILATAFPTQFNQLA 340 Query: 487 PSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDL 308 PSGEA+A+ILMQVFFTVVGASG+I +VI TAPSIF+F LVQI VHLA+ILG GKLFRFDL Sbjct: 341 PSGEAIALILMQVFFTVVGASGNIWSVINTAPSIFMFALVQITVHLAVILGLGKLFRFDL 400 Query: 307 KLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212 KLLL+ASNANV GW+SLVVP Sbjct: 401 KLLLLASNANVGGPTTACGMATAKGWNSLVVP 432 >ref|XP_002513660.1| conserved hypothetical protein [Ricinus communis] gi|223547568|gb|EEF49063.1| conserved hypothetical protein [Ricinus communis] Length = 965 Score = 509 bits (1310), Expect = e-141 Identities = 281/409 (68%), Positives = 316/409 (77%), Gaps = 3/409 (0%) Frame = -1 Query: 1429 LPPPEYRPFIHSGRNIPL-FQDSSRLTSYEYKGKVLISPFNMP--KKSSRSVIASSQLNF 1259 + P Y+ F + PL F + S + + +SP P S RS+ S LNF Sbjct: 31 MSPQSYQSF----KIYPLHFHSNDNDNSNNNRNQTFLSPQLYPGDPSSRRSLAVRSNLNF 86 Query: 1258 PIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAY 1079 P+IS D WGTWTALFATGAFGIWSEKT+IGSALSGALVSTLVGLA SNLGII+ E+PAY Sbjct: 87 PLISSNDRWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLAGSNLGIISCESPAY 146 Query: 1078 NVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQ 899 VV+ F L+RAD+RRVIRSTGTLLLAFLLGSVATT+GTVVA+ +VPM+SLGQ Sbjct: 147 AVVLEFLLPLAVPLLLFRADLRRVIRSTGTLLLAFLLGSVATTVGTVVAYWIVPMRSLGQ 206 Query: 898 DGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIP 719 D WKIAAALMGRHIGGAVNYVAI++AL VS SVLA+GLAADNVICAVYFTTLFALASKIP Sbjct: 207 DSWKIAAALMGRHIGGAVNYVAIADALGVSSSVLASGLAADNVICAVYFTTLFALASKIP 266 Query: 718 AEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTA 539 AE STS+++ + S SG KLPVLQ AT+LA S AICK S++T+ FGIQGG LPAVTA Sbjct: 267 AETSTSSNEDGMESGSVSGEKLPVLQLATSLAVSLAICKAGSYVTKLFGIQGGILPAVTA 326 Query: 538 IVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIA 359 IVVILAT FP QF LAPSGEAMA+ILMQVFFTVVGASG+I NV+ TAPSIF+F LVQIA Sbjct: 327 IVVILATAFPTQFNGLAPSGEAMALILMQVFFTVVGASGNIWNVVKTAPSIFMFALVQIA 386 Query: 358 VHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212 VHL IILG GKLFRFD KLLL+ASNANV GWSSLVVP Sbjct: 387 VHLVIILGLGKLFRFDQKLLLLASNANVGGPTTACGMATAKGWSSLVVP 435 Score = 470 bits (1209), Expect = e-129 Identities = 254/410 (61%), Positives = 302/410 (73%), Gaps = 11/410 (2%) Frame = -1 Query: 1408 PFIHSGRNIPLFQDSSRLTSYEYKGKV--------LISPFNMPKKSS---RSVIASSQLN 1262 P +HS + L S L+ + + K+ SP + ++ R + SQL Sbjct: 529 PLLHSSCSPSLRISSRHLSPFSSRHKLSHPNINEAAFSPSTISLNNTSLIRQIKLRSQLR 588 Query: 1261 FPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPA 1082 FP+ISP DHWGTWTALFATGAFGIWSE T++GS +S ALVSTLVGLAASN+GII E A Sbjct: 589 FPLISPDDHWGTWTALFATGAFGIWSEGTKVGSMVSAALVSTLVGLAASNIGIIPYETAA 648 Query: 1081 YNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLG 902 Y++V+ F L+RAD+R VIRSTG L LAFLLGSVAT IGT VAFLMVPM+SLG Sbjct: 649 YSLVLEFLLPLTVPLLLFRADLRNVIRSTGKLFLAFLLGSVATIIGTTVAFLMVPMRSLG 708 Query: 901 QDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKI 722 D WKIAAALMG +IGG+VNYVAISEAL SPSV+AAG+AADNVICA YF LFALASKI Sbjct: 709 PDNWKIAAALMGSYIGGSVNYVAISEALGTSPSVVAAGIAADNVICATYFMALFALASKI 768 Query: 721 PAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVT 542 PAE S ST+ VE++ ES S K+PVLQ A ALA SF IC+TA++LT+ +QGG+LPA+T Sbjct: 769 PAENSASTNGVEMDVESSSTGKIPVLQMAAALAISFMICRTATYLTQLCKVQGGNLPAIT 828 Query: 541 AIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQI 362 AIVV LAT+FP QF LAP+G+ +A++LMQVFF VVGASGSI NVI TAPSIFLF LVQ+ Sbjct: 829 AIVVFLATSFPVQFGRLAPAGDTIALVLMQVFFAVVGASGSIWNVIKTAPSIFLFALVQL 888 Query: 361 AVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212 VHLA++LG G+LF FDLKLLL+ASNAN+ GW SLVVP Sbjct: 889 TVHLAVVLGLGRLFDFDLKLLLLASNANIGGPTTACGMATAKGWKSLVVP 938 >gb|KHG26673.1| putative membrane yjcL [Gossypium arboreum] Length = 464 Score = 506 bits (1304), Expect = e-140 Identities = 263/362 (72%), Positives = 302/362 (83%) Frame = -1 Query: 1297 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAA 1118 ++R++I SQLN P+ISP D WGTWTALFATGAFG+WSE T+ GSALSGALVSTL+GLAA Sbjct: 76 ATRTLIVKSQLNSPLISPNDQWGTWTALFATGAFGLWSENTKAGSALSGALVSTLIGLAA 135 Query: 1117 SNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTV 938 SNLGII+SEA AY++V F L+RAD+RRVI+STG LLLAFLLGSVATT+GT Sbjct: 136 SNLGIISSEAKAYSIVKEFLLPLAVPLLLFRADLRRVIKSTGKLLLAFLLGSVATTVGTA 195 Query: 937 VAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 758 +A+L+VPM++LGQD WKIAAALMGRHIGGAVNYVAIS AL+ S SVLAAGLAADNVICAV Sbjct: 196 LAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAISNALETSESVLAAGLAADNVICAV 255 Query: 757 YFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRY 578 YFTTLFALASK+PAE STS DV + E S S KLPVL+ ATALA SFAICK ++LT+Y Sbjct: 256 YFTTLFALASKVPAETSTSPEDVAMGEGSISDGKLPVLKIATALAVSFAICKLGAYLTKY 315 Query: 577 FGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITT 398 FGI GG LPAVTAIVVILAT FP QF +LAPSGEAMA+ILMQVFFTVVGASG+I +VI T Sbjct: 316 FGIPGGILPAVTAIVVILATVFPAQFGHLAPSGEAMALILMQVFFTVVGASGNIWSVIRT 375 Query: 397 APSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 218 APSIF+F LVQI++HLA+ILG GKLF+FDLKLLLIASNANV GWSS++ Sbjct: 376 APSIFMFALVQISIHLALILGLGKLFKFDLKLLLIASNANVGGPTTASGMATAKGWSSMI 435 Query: 217 VP 212 +P Sbjct: 436 IP 437 >ref|XP_012484138.1| PREDICTED: uncharacterized protein LOC105798567 [Gossypium raimondii] gi|763766946|gb|KJB34161.1| hypothetical protein B456_006G051100 [Gossypium raimondii] Length = 464 Score = 506 bits (1302), Expect = e-140 Identities = 262/362 (72%), Positives = 301/362 (83%) Frame = -1 Query: 1297 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAA 1118 ++R++I SQLN P+ISP D WGTWTALFATGAFG+WSE T+ GSALSGALVSTL+GLAA Sbjct: 76 ATRTLIVKSQLNCPLISPNDQWGTWTALFATGAFGLWSENTKAGSALSGALVSTLIGLAA 135 Query: 1117 SNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTV 938 SNLGII+SEA Y++V F L+RAD+RRVI+STG LLLAFLLGSVATT+GT Sbjct: 136 SNLGIISSEAKVYSIVKEFLLPLAVPLLLFRADLRRVIKSTGKLLLAFLLGSVATTVGTA 195 Query: 937 VAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 758 +A+L+VPM++LGQD WKIAAALMGRHIGGAVNYVAIS AL+ S SVLAAGLAADNVICAV Sbjct: 196 LAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAISNALETSESVLAAGLAADNVICAV 255 Query: 757 YFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRY 578 YFTTLFALASK+PAE STS DV + E S S KLPVL+ ATALA SFAICK ++LT+Y Sbjct: 256 YFTTLFALASKVPAETSTSPEDVAMGEGSKSDGKLPVLKIATALAVSFAICKLGAYLTKY 315 Query: 577 FGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITT 398 FGI GG LPAVTAIVVILAT FP QF +LAPSGEAMA+ILMQVFFTVVGASG+I +VI T Sbjct: 316 FGIPGGILPAVTAIVVILATVFPTQFGHLAPSGEAMALILMQVFFTVVGASGNIWSVIRT 375 Query: 397 APSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 218 APSIF+F LVQI++HLA+ILG GKLF+FDLKLLLIASNANV GWSS++ Sbjct: 376 APSIFMFALVQISIHLALILGLGKLFKFDLKLLLIASNANVGGPTTASGMATAKGWSSMI 435 Query: 217 VP 212 +P Sbjct: 436 IP 437 >ref|XP_007217974.1| hypothetical protein PRUPE_ppa005389mg [Prunus persica] gi|462414436|gb|EMJ19173.1| hypothetical protein PRUPE_ppa005389mg [Prunus persica] Length = 463 Score = 505 bits (1301), Expect = e-140 Identities = 268/399 (67%), Positives = 308/399 (77%), Gaps = 1/399 (0%) Frame = -1 Query: 1324 ISPFNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEK-TEIGSALSGA 1148 +SP P RSV QLN P+IS D WGTWTALFATGAFGIWSEK T++G+ALSGA Sbjct: 65 LSPPAPPNLGDRSVAVRFQLNAPLISSHDQWGTWTALFATGAFGIWSEKNTKVGAALSGA 124 Query: 1147 LVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLL 968 LVSTL+GLAASNLGII+S APA+++V+ F LYRAD+RRVI+STG LLLAFLL Sbjct: 125 LVSTLIGLAASNLGIISSNAPAFSIVLEFLLPLAVPLLLYRADLRRVIKSTGALLLAFLL 184 Query: 967 GSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAG 788 GSVATT+GTVVA+L+VPM+SLGQD WKIAAALMGRHIGGAVNYVAI++AL VSPS+LAAG Sbjct: 185 GSVATTVGTVVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIADALGVSPSILAAG 244 Query: 787 LAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAI 608 LAADNVICAVYF+TLFALASK+P EPSTS + + S GNKLP++QTA AL+ S AI Sbjct: 245 LAADNVICAVYFSTLFALASKVPPEPSTSDDGIRKDASSEPGNKLPLIQTAAALSVSLAI 304 Query: 607 CKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGA 428 CK+ +LT+YFGIQGG LPAVTAIVV LAT FPKQFAYLAP+GEAMA+ILMQVFF VVGA Sbjct: 305 CKSGHYLTKYFGIQGGILPAVTAIVVTLATVFPKQFAYLAPTGEAMAVILMQVFFAVVGA 364 Query: 427 SGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXX 248 SG+I +VI TAPSIF F L+QIAVHL +ILG GKL FDLKLLLIASNANV Sbjct: 365 SGNIWSVINTAPSIFFFALIQIAVHLVVILGLGKLLGFDLKLLLIASNANVGGPTTACGM 424 Query: 247 XXXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131 W+S++VP G AVLK+M Sbjct: 425 ATAKEWNSMIVPGILAGIFGIAIATFIGIAFGLAVLKYM 463