BLASTX nr result
ID: Atropa21_contig00003787
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00003787 (1589 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006355512.1| PREDICTED: mucin-19-like [Solanum tuberosum] 720 0.0 ref|XP_004246157.1| PREDICTED: uncharacterized protein LOC101252... 690 0.0 gb|EOY24313.1| G2484-1 protein, putative isoform 5 [Theobroma ca... 355 4e-95 ref|XP_006477174.1| PREDICTED: uncharacterized protein LOC102627... 341 5e-91 gb|EOY24314.1| G2484-1 protein, putative isoform 6 [Theobroma ca... 341 6e-91 ref|XP_002267137.2| PREDICTED: uncharacterized protein LOC100266... 338 5e-90 gb|EOY24309.1| G2484-1 protein, putative isoform 1 [Theobroma ca... 337 9e-90 ref|XP_006440297.1| hypothetical protein CICLE_v10018443mg [Citr... 337 1e-89 gb|EOY24312.1| G2484-1 protein, putative isoform 4 [Theobroma ca... 323 1e-85 gb|EXC02129.1| hypothetical protein L484_024094 [Morus notabilis] 317 9e-84 ref|XP_006369017.1| hypothetical protein POPTR_0001s15740g [Popu... 311 5e-82 ref|XP_006590567.1| PREDICTED: mucin-17-like [Glycine max] 308 3e-81 ref|XP_006573716.1| PREDICTED: uncharacterized protein LOC100792... 306 1e-80 ref|XP_004147256.1| PREDICTED: uncharacterized protein LOC101211... 304 6e-80 gb|EMJ10269.1| hypothetical protein PRUPE_ppa000035mg [Prunus pe... 304 8e-80 emb|CAN66568.1| hypothetical protein VITISV_039539 [Vitis vinifera] 301 4e-79 ref|XP_003611322.1| Agenet domain containing protein expressed [... 301 7e-79 ref|XP_004511695.1| PREDICTED: serine-rich adhesin for platelets... 299 2e-78 ref|XP_006385540.1| agenet domain-containing family protein [Pop... 294 6e-77 ref|XP_006385539.1| hypothetical protein POPTR_0003s07530g [Popu... 294 6e-77 >ref|XP_006355512.1| PREDICTED: mucin-19-like [Solanum tuberosum] Length = 2181 Score = 720 bits (1859), Expect = 0.0 Identities = 392/538 (72%), Positives = 416/538 (77%), Gaps = 9/538 (1%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TS+LPDLNTSS ASVLFHQPFTDLQQVQLRAQIFVYGSLIQGT P+EACMVSAFGT+DG Sbjct: 1000 TSSLPDLNTSS-ASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTAPEEACMVSAFGTADGC 1058 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 RSLWD AWRACVERI GQRS S NN TPSHPRSGPRTPDQANKQ VHQNKVTTSAAGRAG Sbjct: 1059 RSLWDPAWRACVERIHGQRSRSVNNETPSHPRSGPRTPDQANKQAVHQNKVTTSAAGRAG 1118 Query: 1227 GKATNSPAVSHMIPLSSPLWTMPTPSRDGLSSARAAVIDYKALPSMHP---PPSRNFVGH 1057 GKA+NSPAVS MIPLSSPLW M TPSRDGLSSAR A+IDYKALPSMHP PP+RNFVGH Sbjct: 1119 GKASNSPAVSPMIPLSSPLWNMATPSRDGLSSARGALIDYKALPSMHPYQTPPARNFVGH 1178 Query: 1056 TVSRPPQAPFPGPWVASPQTSAFDISAQFPALP----VKLTPVKESSLSISACANNATLG 889 T S PQAPFPGPWVASPQ S FDISAQ PALP VKLTPVKESSLSISA A +A G Sbjct: 1179 TASWLPQAPFPGPWVASPQNSPFDISAQPPALPVTESVKLTPVKESSLSISAGAKHAPPG 1238 Query: 888 LVAHAGDSGILSGASPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLGTSSEPV 709 VAHAGDSGI SGASPHDNKKA VLPAQ SADQKSRKRKKAS EDR+QKSKLGTS E V Sbjct: 1239 SVAHAGDSGIQSGASPHDNKKAPVLPAQCSADQKSRKRKKASGTEDRIQKSKLGTSFESV 1298 Query: 708 TAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXXXXXXXX 529 TAP ICT LS+ ASDD GQLSS+ VAPLVA SQTGPTSVPIIGGHF Sbjct: 1299 TAPVICTQLSNKAPASDDFGQLSSIAVAPLVAHSQTGPTSVPIIGGHFSTSVVIEPPSSS 1358 Query: 528 XXXSNSDILITSAPSSTDLSKRELDLGKKAPTSE--SKVXXXXXXXXXXXXXXXXAVSHC 355 +NSDI ITSAPSST+LSKRELDLGKK PT E SKV AVSHC Sbjct: 1359 APKNNSDIPITSAPSSTELSKRELDLGKKTPTLEYLSKVEEAKLQAEEAAANATAAVSHC 1418 Query: 354 QDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQMADE 175 QDVWS+LDKHK+SDLASD+E K MADE Sbjct: 1419 QDVWSQLDKHKHSDLASDVEFKLTSAAVAVAAATSVAKAAAAAAKLASNAALQAKLMADE 1478 Query: 174 ALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKEASRRRI 1 A+ ++ +SNPS+T+ SFPNIVNNLGSATP+SVLKSQDV NGSSSII+AA+EASRRRI Sbjct: 1479 AMKSFGVSNPSKTHAASFPNIVNNLGSATPSSVLKSQDVDNGSSSIIYAAREASRRRI 1536 >ref|XP_004246157.1| PREDICTED: uncharacterized protein LOC101252108 [Solanum lycopersicum] Length = 2155 Score = 690 bits (1781), Expect = 0.0 Identities = 377/538 (70%), Positives = 405/538 (75%), Gaps = 9/538 (1%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TS+LPDLNT+S ASVLFHQPFTDLQQVQLRAQIFVYGSLIQGT+P+EACMVSAFGTSDG Sbjct: 979 TSSLPDLNTTS-ASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTSPEEACMVSAFGTSDGC 1037 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 RSLWD AWRACVERI GQRS +GNN TPSH RSGPRTPDQANKQVVHQ+KVTTS AGRAG Sbjct: 1038 RSLWDPAWRACVERIHGQRSRAGNNETPSHSRSGPRTPDQANKQVVHQDKVTTSTAGRAG 1097 Query: 1227 GKATNSPAVSHMIPLSSPLWTMPTPSRDGLSSARAAVIDYKALPSMHP---PPSRNFVGH 1057 GK++NS AVS MIPLSSPLW M TPSRD LSSAR A+IDYKALPSMHP PP+RNFVGH Sbjct: 1098 GKSSNSLAVSPMIPLSSPLWNMATPSRDVLSSARGALIDYKALPSMHPYQTPPARNFVGH 1157 Query: 1056 TVSRPPQAPFPGPWVASPQTSAFDISAQFPALP----VKLTPVKESSLSISACANNATLG 889 T S P APFPGPWVASPQ S FD SAQ PALP VKLTPVKESSLS +A A +A G Sbjct: 1158 TASWLPPAPFPGPWVASPQNSPFDTSAQLPALPVTESVKLTPVKESSLS-TASAKHAPPG 1216 Query: 888 LVAHAGDSGILSGASPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLGTSSEPV 709 VAHAGDSGI SGA PHDN K VLPAQ+SADQKSRKRKKAS +DR QKSK+GTSSE + Sbjct: 1217 SVAHAGDSGIQSGAFPHDNTKTPVLPAQFSADQKSRKRKKASGTDDRTQKSKIGTSSESI 1276 Query: 708 TAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXXXXXXXX 529 T P ICT LS+ ASDD G LSSV VAPLVA SQTGPTSVPIIGGHF Sbjct: 1277 TTPVICTQLSNKAPASDDFGLLSSVAVAPLVAHSQTGPTSVPIIGGHFSTSVVIEPPSSS 1336 Query: 528 XXXSNSDILITSAPSSTDLSKRELDLGKKAPTSE--SKVXXXXXXXXXXXXXXXXAVSHC 355 +NSDI I SAPSST+LSKR LDLGKK PT E SKV AVSHC Sbjct: 1337 VPKNNSDIPIASAPSSTELSKRVLDLGKKTPTLEYLSKVEEAKLQAEEAAANATAAVSHC 1396 Query: 354 QDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQMADE 175 QDVWS+LDKHKNS LASD+EVK MADE Sbjct: 1397 QDVWSQLDKHKNSGLASDVEVKLTSAAVAVAAATSVAKAAAAAAKLASNAALQAKLMADE 1456 Query: 174 ALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKEASRRRI 1 A+IA+ +SNPSQT FPNIVNN GSATPASVLKSQDVGNGSSS+++AA+EASRRRI Sbjct: 1457 AMIAFGVSNPSQTQAGFFPNIVNNFGSATPASVLKSQDVGNGSSSVLYAAREASRRRI 1514 >gb|EOY24313.1| G2484-1 protein, putative isoform 5 [Theobroma cacao] Length = 2151 Score = 355 bits (910), Expect = 4e-95 Identities = 226/539 (41%), Positives = 302/539 (56%), Gaps = 11/539 (2%) Frame = -3 Query: 1584 SNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGGR 1405 S+LPDLNTS+ +S +FHQPFTDLQQVQLRAQIFVYG+LIQGT PDEA M+SAFG DGGR Sbjct: 976 SSLPDLNTSASSSAVFHQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGPDGGR 1035 Query: 1404 SLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAGG 1225 S+W++AWRAC+ER+ GQ+SH + TP R G + DQA K Q KVT+S A R+ Sbjct: 1036 SIWENAWRACIERVHGQKSHLVSPETPLQSRIGAKPSDQAIKLNAVQGKVTSSPASRSTS 1095 Query: 1224 KATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA---RAAVIDYK-ALPSMHPPPSRNFVGH 1057 K T + V+ MIPLSSPLW++PTPS D L + R AV+DY+ AL +HPPP RNFVG Sbjct: 1096 KGTPTTIVNPMIPLSSPLWSIPTPSGDPLQPSGIPRGAVMDYQQALSPLHPPPMRNFVGP 1155 Query: 1056 TVSRPPQAPFPGPWVASPQTSAFDISAQFPALPV----KLTPVKESSLSISACANNATLG 889 S Q+PF GPWV PQTSAFD +A+FP LP+ LTPV+E+S+ S + + Sbjct: 1156 NASWMSQSPFRGPWV--PQTSAFDGNARFPVLPITETANLTPVREASVPSSGMKPVSPVP 1213 Query: 888 LVAHAGDSGILSGASPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLGTSSEPV 709 +V + + +G D+KK +V Q+SAD K RKRKK+++ ED Q L + E + Sbjct: 1214 MVQSGSPANVFAGTPLLDSKKTTVTAGQHSADPKPRKRKKSTASEDPGQ-IMLHSQKESL 1272 Query: 708 TAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXXXXXXXX 529 A A T +S P A A +V++S T Sbjct: 1273 LATA-ATGHASTPAAVS--------TPATIVSKSST------------------------ 1299 Query: 528 XXXSNSDILITSAPSSTDLSKRELDLGKKAPTSE---SKVXXXXXXXXXXXXXXXXAVSH 358 D ITS S+ L K + DL ++A SE SK+ AVSH Sbjct: 1300 ------DKFITSV-SADHLKKGDQDLDQRATISEETLSKLKESQKQAEDAAAFAAAAVSH 1352 Query: 357 CQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQMAD 178 Q++W++L++H+NS LA D+E K MAD Sbjct: 1353 NQEIWNKLNRHQNSGLAPDVETKLTSAAVAIAAAAAVAKAAAAAANVASNAALQAKLMAD 1412 Query: 177 EALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKEASRRRI 1 EAL++ N T+ +S + V LG+ATPAS+L+ +D S+S+I AA+EA+RRR+ Sbjct: 1413 EALVSSGYRNSIPTDAISSSDSVKKLGNATPASILRGEDATISSNSVIVAAREAARRRV 1471 >ref|XP_006477174.1| PREDICTED: uncharacterized protein LOC102627454 isoform X1 [Citrus sinensis] gi|568846679|ref|XP_006477175.1| PREDICTED: uncharacterized protein LOC102627454 isoform X2 [Citrus sinensis] gi|568846681|ref|XP_006477176.1| PREDICTED: uncharacterized protein LOC102627454 isoform X3 [Citrus sinensis] Length = 2155 Score = 341 bits (875), Expect = 5e-91 Identities = 226/546 (41%), Positives = 292/546 (53%), Gaps = 18/546 (3%) Frame = -3 Query: 1584 SNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGGR 1405 S LPDLNTSSP ++F QPFTDLQQVQLRAQIFVYG+LIQG PDEA M+SAFG DGGR Sbjct: 960 SALPDLNTSSP--LMFQQPFTDLQQVQLRAQIFVYGALIQGIAPDEAYMISAFGGPDGGR 1017 Query: 1404 SLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAGG 1225 +W++AWR C ER+ GQ+ N TP RSG R PDQA K +KV +S GRA Sbjct: 1018 IMWETAWRGCTERLHGQKPLLNNAETPLQSRSGTRAPDQATKHGAIPSKVASSPLGRAIS 1077 Query: 1224 KATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA---RAAVIDYK-ALPSMHP---PPSRNF 1066 K T SP ++ +IPLSSPLW++PTPS D + S+ R+AV+DY+ AL +H P RNF Sbjct: 1078 KGTPSPTLNPIIPLSSPLWSIPTPSADTVQSSGMPRSAVMDYQQALSPLHAHQTPSIRNF 1137 Query: 1065 VGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPALP----VKLTPVKESSLSISACANNA 898 G S QAPF WVASPQTS FD A+FP LP V+LTP KE SL S+ + Sbjct: 1138 AGQNTSWMSQAPFRTTWVASPQTSGFDAGARFPVLPITETVQLTPAKEPSLPHSSGIKHV 1197 Query: 897 TLG-LVAHAGDSGILSGASPH-DNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLGT 724 + G ++ + + G SP D KK S P+Q+S D K RKRKK Sbjct: 1198 SSGPMIQSMSPATVFPGTSPMLDPKKMSSSPSQHSTDPKPRKRKKTP------------- 1244 Query: 723 SSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXXX 544 AS+DLGQ+ L +QSQT P S PI+ H Sbjct: 1245 -------------------ASEDLGQIM------LHSQSQTEPVSAPIVSSHTYTSVSFA 1279 Query: 543 XXXXXXXXSNSDILITSAPS-STDLSKR-ELDLGKKAPTSE---SKVXXXXXXXXXXXXX 379 ++++ + +P+ S DL + + KA SE +K+ Sbjct: 1280 TPASLVSKASTEKEMPVSPAASADLIRGGNKEAQPKASLSEETLTKLKQAKTQAEDAATF 1339 Query: 378 XXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 199 AVSH Q++W+++DK KNS L SD+E K Sbjct: 1340 AAAAVSHSQEIWNQMDKQKNSRLVSDVESKLASAAVAIAAAAAVAKAAAAAANVASSAAL 1399 Query: 198 XXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKE 19 MADEAL + D N S N S + V ++G ATPAS+LK ++ +GSSSIIFAA+E Sbjct: 1400 QAKLMADEALDSSDYGNSSLINGTSLSDSVKDMGKATPASILKVENAMSGSSSIIFAARE 1459 Query: 18 ASRRRI 1 A+RR++ Sbjct: 1460 AARRQV 1465 >gb|EOY24314.1| G2484-1 protein, putative isoform 6 [Theobroma cacao] Length = 2138 Score = 341 bits (874), Expect = 6e-91 Identities = 221/539 (41%), Positives = 297/539 (55%), Gaps = 11/539 (2%) Frame = -3 Query: 1584 SNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGGR 1405 S+LPDLNTS+ +S +FHQPFTDLQQVQLRAQIFVYG+LIQGT PDEA M+SAFG DGGR Sbjct: 976 SSLPDLNTSASSSAVFHQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGPDGGR 1035 Query: 1404 SLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAGG 1225 S+W++AWRAC+ER+ GQ+SH + TP R + Q KVT+S A R+ Sbjct: 1036 SIWENAWRACIERVHGQKSHLVSPETPLQSR-------------IVQGKVTSSPASRSTS 1082 Query: 1224 KATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA---RAAVIDYK-ALPSMHPPPSRNFVGH 1057 K T + V+ MIPLSSPLW++PTPS D L + R AV+DY+ AL +HPPP RNFVG Sbjct: 1083 KGTPTTIVNPMIPLSSPLWSIPTPSGDPLQPSGIPRGAVMDYQQALSPLHPPPMRNFVGP 1142 Query: 1056 TVSRPPQAPFPGPWVASPQTSAFDISAQFPALPV----KLTPVKESSLSISACANNATLG 889 S Q+PF GPWV PQTSAFD +A+FP LP+ LTPV+E+S+ S + + Sbjct: 1143 NASWMSQSPFRGPWV--PQTSAFDGNARFPVLPITETANLTPVREASVPSSGMKPVSPVP 1200 Query: 888 LVAHAGDSGILSGASPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLGTSSEPV 709 +V + + +G D+KK +V Q+SAD K RKRKK+++ ED Q L + E + Sbjct: 1201 MVQSGSPANVFAGTPLLDSKKTTVTAGQHSADPKPRKRKKSTASEDPGQ-IMLHSQKESL 1259 Query: 708 TAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXXXXXXXX 529 A A T +S P A A +V++S T Sbjct: 1260 LATA-ATGHASTPAAVS--------TPATIVSKSST------------------------ 1286 Query: 528 XXXSNSDILITSAPSSTDLSKRELDLGKKAPTSE---SKVXXXXXXXXXXXXXXXXAVSH 358 D ITS S+ L K + DL ++A SE SK+ AVSH Sbjct: 1287 ------DKFITSV-SADHLKKGDQDLDQRATISEETLSKLKESQKQAEDAAAFAAAAVSH 1339 Query: 357 CQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQMAD 178 Q++W++L++H+NS LA D+E K MAD Sbjct: 1340 NQEIWNKLNRHQNSGLAPDVETKLTSAAVAIAAAAAVAKAAAAAANVASNAALQAKLMAD 1399 Query: 177 EALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKEASRRRI 1 EAL++ N T+ +S + V LG+ATPAS+L+ +D S+S+I AA+EA+RRR+ Sbjct: 1400 EALVSSGYRNSIPTDAISSSDSVKKLGNATPASILRGEDATISSNSVIVAAREAARRRV 1458 >ref|XP_002267137.2| PREDICTED: uncharacterized protein LOC100266068 [Vitis vinifera] Length = 2292 Score = 338 bits (866), Expect = 5e-90 Identities = 233/546 (42%), Positives = 290/546 (53%), Gaps = 17/546 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TSNLPDLNTS+ S +F QPFTDLQQVQLRAQIFVYGSLIQGT PDEACM SAFGT DGG Sbjct: 1110 TSNLPDLNTSASPSAIFQQPFTDLQQVQLRAQIFVYGSLIQGTAPDEACMASAFGTPDGG 1169 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQAN-KQVVHQNKVTTSAAGRA 1231 RSLW++AW A VER++GQ+SH N TP RSG RTPDQA+ +Q Q KV S GRA Sbjct: 1170 RSLWENAWHASVERLQGQKSHPSNPETPLQSRSGARTPDQASIQQGALQGKVIPSPVGRA 1229 Query: 1230 GGKATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA--RAAVIDYK-ALPSMHP---PPSRN 1069 K T S V+ M+PL SPLW++ T SS R ++D+ AL +HP PP RN Sbjct: 1230 SSKGTPSTIVNPMMPLPSPLWSISTQGDVMQSSGLPRGGLMDHHPALSPLHPYQTPPVRN 1289 Query: 1068 FVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPALPV----KLTPVKESSLSISACANN 901 FVGH S Q FPGPWV S QTS D S +FPALPV KLTPV+ES++ S+ + Sbjct: 1290 FVGHNTSWISQPTFPGPWVPS-QTSGLDASVRFPALPVTETVKLTPVRESTVPHSSSVKH 1348 Query: 900 ATLGLVAHAGD-SGILSGASPH-DNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLG 727 + G + H+G + + +G SP D KKA+ P Q S D K RKRKK Sbjct: 1349 VSSGPMGHSGGPTSVFAGTSPLLDAKKATASPGQPSTDPKPRKRKKTP------------ 1396 Query: 726 TSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXX 547 AS+ Q+S L +QSQT P +P++ HF Sbjct: 1397 --------------------ASEGPSQIS------LPSQSQTEP--IPVVTSHFSTSVSI 1428 Query: 546 XXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSES----KVXXXXXXXXXXXXX 379 SN+ L+ +A S T LS ++ LG + S + Sbjct: 1429 TTPASLVSKSNTGKLVAAA-SPTFLSD-QMKLGSRDAEQRSVLTEETLGKVKEAKLQAED 1486 Query: 378 XXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 199 AVSH Q VWS LDK KNS L SD++ K Sbjct: 1487 AAAAVSHSQGVWSELDKQKNSGLISDVQAKIASAAVAIAAAASVAKAAAAAARIASNAAL 1546 Query: 198 XXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKE 19 M DEAL++ +P Q++ + V+ LG ATPAS+LK D N SSSI+ AA+E Sbjct: 1547 QAKLMVDEALVSSANIHPGQSS-----DGVSILGKATPASILKGDDGTNCSSSILVAARE 1601 Query: 18 ASRRRI 1 A+RRR+ Sbjct: 1602 AARRRV 1607 >gb|EOY24309.1| G2484-1 protein, putative isoform 1 [Theobroma cacao] gi|508777054|gb|EOY24310.1| G2484-1 protein, putative isoform 1 [Theobroma cacao] gi|508777055|gb|EOY24311.1| G2484-1 protein, putative isoform 1 [Theobroma cacao] Length = 2123 Score = 337 bits (864), Expect = 9e-90 Identities = 217/525 (41%), Positives = 290/525 (55%), Gaps = 11/525 (2%) Frame = -3 Query: 1542 LFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGGRSLWDSAWRACVERI 1363 +FHQPFTDLQQVQLRAQIFVYG+LIQGT PDEA M+SAFG DGGRS+W++AWRAC+ER+ Sbjct: 962 VFHQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGPDGGRSIWENAWRACIERV 1021 Query: 1362 RGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAGGKATNSPAVSHMIPL 1183 GQ+SH + TP R G + DQA K Q KVT+S A R+ K T + V+ MIPL Sbjct: 1022 HGQKSHLVSPETPLQSRIGAKPSDQAIKLNAVQGKVTSSPASRSTSKGTPTTIVNPMIPL 1081 Query: 1182 SSPLWTMPTPSRDGLSSA---RAAVIDYK-ALPSMHPPPSRNFVGHTVSRPPQAPFPGPW 1015 SSPLW++PTPS D L + R AV+DY+ AL +HPPP RNFVG S Q+PF GPW Sbjct: 1082 SSPLWSIPTPSGDPLQPSGIPRGAVMDYQQALSPLHPPPMRNFVGPNASWMSQSPFRGPW 1141 Query: 1014 VASPQTSAFDISAQFPALPV----KLTPVKESSLSISACANNATLGLVAHAGDSGILSGA 847 V PQTSAFD +A+FP LP+ LTPV+E+S+ S + + +V + + +G Sbjct: 1142 V--PQTSAFDGNARFPVLPITETANLTPVREASVPSSGMKPVSPVPMVQSGSPANVFAGT 1199 Query: 846 SPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLGTSSEPVTAPAICTLLSSMPL 667 D+KK +V Q+SAD K RKRKK+++ ED Q L + E + A A T +S P Sbjct: 1200 PLLDSKKTTVTAGQHSADPKPRKRKKSTASEDPGQ-IMLHSQKESLLATA-ATGHASTPA 1257 Query: 666 ASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXXXXXXXXXXXSNSDILITSAP 487 A A +V++S T D ITS Sbjct: 1258 AVS--------TPATIVSKSST------------------------------DKFITSV- 1278 Query: 486 SSTDLSKRELDLGKKAPTSE---SKVXXXXXXXXXXXXXXXXAVSHCQDVWSRLDKHKNS 316 S+ L K + DL ++A SE SK+ AVSH Q++W++L++H+NS Sbjct: 1279 SADHLKKGDQDLDQRATISEETLSKLKESQKQAEDAAAFAAAAVSHNQEIWNKLNRHQNS 1338 Query: 315 DLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQMADEALIAYDMSNPSQT 136 LA D+E K MADEAL++ N T Sbjct: 1339 GLAPDVETKLTSAAVAIAAAAAVAKAAAAAANVASNAALQAKLMADEALVSSGYRNSIPT 1398 Query: 135 NTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKEASRRRI 1 + +S + V LG+ATPAS+L+ +D S+S+I AA+EA+RRR+ Sbjct: 1399 DAISSSDSVKKLGNATPASILRGEDATISSNSVIVAAREAARRRV 1443 >ref|XP_006440297.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|567895620|ref|XP_006440298.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|567895622|ref|XP_006440299.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|557542559|gb|ESR53537.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|557542560|gb|ESR53538.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|557542561|gb|ESR53539.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] Length = 2155 Score = 337 bits (863), Expect = 1e-89 Identities = 224/546 (41%), Positives = 286/546 (52%), Gaps = 18/546 (3%) Frame = -3 Query: 1584 SNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGGR 1405 S LPDLNTSSP ++F QPFTDLQQVQLRAQIFVYG+LIQG PDEA M+SAFG DGGR Sbjct: 960 SALPDLNTSSP--LMFQQPFTDLQQVQLRAQIFVYGALIQGIAPDEAYMISAFGGPDGGR 1017 Query: 1404 SLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAGG 1225 +W++AWR C ER+ GQ+ N TP RSG R PDQA K +KV +S GRA Sbjct: 1018 IMWETAWRGCTERLHGQKPLLNNAETPLQSRSGTRAPDQATKHGAIPSKVASSPLGRAIS 1077 Query: 1224 KATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA---RAAVIDY-KALPSMH---PPPSRNF 1066 K T SP ++ +IPLSSPLW++PTPS D + S+ R+AV+DY +AL +H P RNF Sbjct: 1078 KGTPSPTLNPIIPLSSPLWSIPTPSADTVQSSGMPRSAVMDYQQALSPLHAHQTPSIRNF 1137 Query: 1065 VGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPALP----VKLTPVKESSLSISACANNA 898 G S QAPF WVASPQTS FD A+FP LP V+LTP KE SL S+ + Sbjct: 1138 AGQNTSWMSQAPFRTTWVASPQTSGFDAGARFPVLPITETVQLTPAKEPSLPHSSGIKHV 1197 Query: 897 TLG-LVAHAGDSGILSGASPH-DNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLGT 724 + G ++ + + G SP D KK S P+Q+S D K RKRKK + ED Sbjct: 1198 SSGPMIQSMSPATVFPGTSPMLDPKKMSSSPSQHSTDPKPRKRKKTPASEDS-------- 1249 Query: 723 SSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGH-FXXXXXX 547 GQ+ L +QSQT P S PI+ H + Sbjct: 1250 ------------------------GQIM------LHSQSQTEPVSAPIVSSHTYTSVSFA 1279 Query: 546 XXXXXXXXXSNSDILITSAPSSTDLSK-RELDLGKKAPTSE---SKVXXXXXXXXXXXXX 379 + S +S DL + + KA SE +K+ Sbjct: 1280 TPASLVSKAFTEKEMPVSPVASADLIRGGNKEAQPKASLSEETLTKLKQAKTQAEDAATF 1339 Query: 378 XXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 199 AVSH Q++W+++DK KNS L SD+E K Sbjct: 1340 AAAAVSHSQEIWNQMDKQKNSRLVSDVESKLASAAVAIAAAAAVAKAAAAAANVASSAAL 1399 Query: 198 XXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKE 19 MADEAL + D N S N S + V ++G ATPAS+LK ++ +GSSSIIFAA+E Sbjct: 1400 QAKLMADEALDSSDYGNSSLINGTSLSDSVKDMGKATPASILKGENAMSGSSSIIFAARE 1459 Query: 18 ASRRRI 1 A+RR++ Sbjct: 1460 AARRQV 1465 >gb|EOY24312.1| G2484-1 protein, putative isoform 4 [Theobroma cacao] Length = 2110 Score = 323 bits (828), Expect = 1e-85 Identities = 212/525 (40%), Positives = 285/525 (54%), Gaps = 11/525 (2%) Frame = -3 Query: 1542 LFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGGRSLWDSAWRACVERI 1363 +FHQPFTDLQQVQLRAQIFVYG+LIQGT PDEA M+SAFG DGGRS+W++AWRAC+ER+ Sbjct: 962 VFHQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGPDGGRSIWENAWRACIERV 1021 Query: 1362 RGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAGGKATNSPAVSHMIPL 1183 GQ+SH + TP R + Q KVT+S A R+ K T + V+ MIPL Sbjct: 1022 HGQKSHLVSPETPLQSR-------------IVQGKVTSSPASRSTSKGTPTTIVNPMIPL 1068 Query: 1182 SSPLWTMPTPSRDGLSSA---RAAVIDYK-ALPSMHPPPSRNFVGHTVSRPPQAPFPGPW 1015 SSPLW++PTPS D L + R AV+DY+ AL +HPPP RNFVG S Q+PF GPW Sbjct: 1069 SSPLWSIPTPSGDPLQPSGIPRGAVMDYQQALSPLHPPPMRNFVGPNASWMSQSPFRGPW 1128 Query: 1014 VASPQTSAFDISAQFPALPV----KLTPVKESSLSISACANNATLGLVAHAGDSGILSGA 847 V PQTSAFD +A+FP LP+ LTPV+E+S+ S + + +V + + +G Sbjct: 1129 V--PQTSAFDGNARFPVLPITETANLTPVREASVPSSGMKPVSPVPMVQSGSPANVFAGT 1186 Query: 846 SPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLGTSSEPVTAPAICTLLSSMPL 667 D+KK +V Q+SAD K RKRKK+++ ED Q L + E + A A T +S P Sbjct: 1187 PLLDSKKTTVTAGQHSADPKPRKRKKSTASEDPGQ-IMLHSQKESLLATA-ATGHASTPA 1244 Query: 666 ASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXXXXXXXXXXXSNSDILITSAP 487 A A +V++S T D ITS Sbjct: 1245 AVS--------TPATIVSKSST------------------------------DKFITSV- 1265 Query: 486 SSTDLSKRELDLGKKAPTSE---SKVXXXXXXXXXXXXXXXXAVSHCQDVWSRLDKHKNS 316 S+ L K + DL ++A SE SK+ AVSH Q++W++L++H+NS Sbjct: 1266 SADHLKKGDQDLDQRATISEETLSKLKESQKQAEDAAAFAAAAVSHNQEIWNKLNRHQNS 1325 Query: 315 DLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQMADEALIAYDMSNPSQT 136 LA D+E K MADEAL++ N T Sbjct: 1326 GLAPDVETKLTSAAVAIAAAAAVAKAAAAAANVASNAALQAKLMADEALVSSGYRNSIPT 1385 Query: 135 NTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKEASRRRI 1 + +S + V LG+ATPAS+L+ +D S+S+I AA+EA+RRR+ Sbjct: 1386 DAISSSDSVKKLGNATPASILRGEDATISSNSVIVAAREAARRRV 1430 >gb|EXC02129.1| hypothetical protein L484_024094 [Morus notabilis] Length = 2214 Score = 317 bits (812), Expect = 9e-84 Identities = 212/551 (38%), Positives = 290/551 (52%), Gaps = 22/551 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TS+LPDLN S+ S +F QPFTD QQVQLRAQIFVYGSLIQGT P+EA M+SAF SDGG Sbjct: 1023 TSSLPDLNASASPSTVFQQPFTDFQQVQLRAQIFVYGSLIQGTAPEEAYMLSAFAGSDGG 1082 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPR---SGPRTPDQANKQVV--HQNKVTTSA 1243 RS+W +AW+ACVER++ Q+S+ N TP H R + DQ +KQ Q+K ++ Sbjct: 1083 RSMWGNAWQACVERLQSQKSNPINPETPLHSRQTSTATTKLDQVSKQSAPQTQSKGLSTP 1142 Query: 1242 AGRAGGKATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA---RAAVIDY-KALPSMHP--- 1084 R+ K++ + VS MIPLSSPLW++PTP DG+ S R +V+DY +A+ MHP Sbjct: 1143 VSRSSTKSSQT-IVSPMIPLSSPLWSLPTPVGDGMQSGVMPRGSVMDYQQAVTPMHPFQT 1201 Query: 1083 PPSRNFVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPAL----PVKLTPVKESSLSIS 916 PP RN +GH S Q PF GPWV SPQ S + S +F A PV+LTPVK++++ S Sbjct: 1202 PPIRNLLGHNTSWMSQVPFRGPWVPSPQPSVPEASIRFTAFPNTEPVQLTPVKDTTVPHS 1261 Query: 915 ACANNATLGLVAHAGD-SGILSGASP-HDNKKASVLPAQYSADQKSRKRKKASSIEDRVQ 742 + + + + G + + + A+P D KK + P Q+SAD K RKRKK + E Q Sbjct: 1262 SGTKHVSSSPMVQTGALASVFTTAAPVVDLKKVTSSPGQHSADTKPRKRKKNQASEQTSQ 1321 Query: 741 KSKLGTSS-EPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHF 565 S E + AP + + L++ S + +P SQ P + + Sbjct: 1322 VILQSQSKPEALFAPVVFSNLTT-----------SVAITSPASFVSQAMPEKLVVSA--- 1367 Query: 564 XXXXXXXXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSE---SKVXXXXXXXX 394 T PSS L K + D+ +KA SE SK+ Sbjct: 1368 ----------------------TPTPSSDSLRKADHDVVQKAILSEETHSKIKEASKQAE 1405 Query: 393 XXXXXXXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXX 214 AV + Q++W +L+K K S L SD+E K Sbjct: 1406 DAAAPAAAAVGYSQEIWGQLEKRKTSGLVSDVEAKLASAAVAVAAAAAVAKAAAAVANVA 1465 Query: 213 XXXXXXXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSII 34 MADEA +++ NPSQ+ +SF VN G ATPAS+L+ +D N SSSII Sbjct: 1466 SNAALQAKLMADEAFVSHSFENPSQSTRISFSERVNEFGKATPASILRGEDGANSSSSII 1525 Query: 33 FAAKEASRRRI 1 AA+EA+RR++ Sbjct: 1526 TAAREAARRKV 1536 >ref|XP_006369017.1| hypothetical protein POPTR_0001s15740g [Populus trichocarpa] gi|550347376|gb|ERP65586.1| hypothetical protein POPTR_0001s15740g [Populus trichocarpa] Length = 2057 Score = 311 bits (797), Expect = 5e-82 Identities = 216/547 (39%), Positives = 293/547 (53%), Gaps = 18/547 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 +S+LPDLN+S+ SV+F QPFTDLQQVQLRAQIFVYG+LIQGT PDEA M+SAFG SDGG Sbjct: 944 SSSLPDLNSSASPSVMFQQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGSDGG 1003 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 +++W++A R+ +ER+ GQ+ + + TP R G R PDQA KQ Q+KV +S GR+ Sbjct: 1004 KTIWENALRSSIERLHGQKPNLTSPETPLQSRPGVRAPDQAIKQSTVQSKVISSPIGRS- 1062 Query: 1227 GKATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA---RAAVIDY-KALPSMHP---PPSRN 1069 K T + V+ M+PLSSPLW++PTP+ D S+ R ++D+ +AL MHP P RN Sbjct: 1063 SKGTPT-IVNPMVPLSSPLWSVPTPAGDTFQSSSMPRGPIMDHQRALSPMHPHQTPQIRN 1121 Query: 1068 FVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPAL-----PVKLTPVKESSLSISACAN 904 F G+ QAPF GPW SPQT A D S F A PV+LTPVK+ S+ I + A Sbjct: 1122 FAGNPWL--SQAPFCGPWATSPQTPALDTSGHFSAQLPITEPVQLTPVKDLSMPIISGAK 1179 Query: 903 NATLGLVAHAGDS-GILSGASP-HDNKKASVLPAQYSADQKSRKRKKASSIEDRVQK-SK 733 + + G VA +G S + +G P D KKA+V +Q AD K RKRKK S E Q Sbjct: 1180 HVSPGPVAQSGASTSVFTGTFPVPDAKKAAVSSSQPPADPKPRKRKKNSVSESPGQNILP 1239 Query: 732 LGTSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXX 553 +E V+AP + + LS+ S + P++ S+ PT Sbjct: 1240 PHLRTESVSAPVVTSHLST-----------SVAITTPVIFVSK-APT------------- 1274 Query: 552 XXXXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSE---SKVXXXXXXXXXXXX 382 + + +P+ TD+ + ++ SE KV Sbjct: 1275 -------------EKFVTSVSPTPTDIRNGNQNAEQRNILSEETLDKVKAARVQAEDAAT 1321 Query: 381 XXXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 202 AVSH ++W++LDK +NS L+ DIE K Sbjct: 1322 LAAAAVSHSLEMWNQLDKQRNSGLSPDIETKLASAAVAIAAAAAVAKAAAAAAKVASSAA 1381 Query: 201 XXXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAK 22 +ADEA+ + SNPSQ NT+S + NLG ATPAS+LK D N SSSI+ A+ Sbjct: 1382 LQAKLLADEAVNSGGYSNPSQDNTISVSEGMKNLGKATPASILKGDDGTNSSSSILIVAR 1441 Query: 21 EASRRRI 1 EA+RRR+ Sbjct: 1442 EAARRRV 1448 >ref|XP_006590567.1| PREDICTED: mucin-17-like [Glycine max] Length = 2135 Score = 308 bits (790), Expect = 3e-81 Identities = 217/546 (39%), Positives = 275/546 (50%), Gaps = 17/546 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TS+LPDLNTS+ +LFHQPFTD QQVQLRAQIFVYG+LIQGT PDEA M+SAFG SDGG Sbjct: 1035 TSSLPDLNTSASPPILFHQPFTDQQQVQLRAQIFVYGALIQGTVPDEAYMISAFGGSDGG 1094 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 RSLW++AWR C+ER GQ+SH N TP RS RT D +KQ Q K +S GR Sbjct: 1095 RSLWENAWRTCMERQHGQKSHPANPETPLQSRSVARTSDLPHKQSAAQGKGISSPLGRTS 1154 Query: 1227 GKATNSPAVSHMIPLSSPLWTMPTPS--RDGLSS---ARAAVIDY-KALPSMHP---PPS 1075 KAT P V+ +IPLSSPLW++ T D L S AR +V+DY +A+ +HP P Sbjct: 1155 SKAT-PPIVNPLIPLSSPLWSLSTLGLGSDSLQSSAIARGSVVDYPQAITPLHPYQTTPV 1213 Query: 1074 RNFVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPALP----VKLTPVKESSLSISACA 907 RNF+GH Q P GPW+ASP T D S Q A P +KL VK SL S+ Sbjct: 1214 RNFLGHNTPWMSQTPLRGPWIASP-TPVTDNSPQISASPASDTIKLGSVK-GSLPPSSGI 1271 Query: 906 NNATLGL-VAHAGDSGILSG-ASPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSK 733 N T G+ + G I +G AS D +V PAQ+++D K +KRKK Sbjct: 1272 KNVTSGVSTSSTGLQSIFTGTASLLDANNVTVSPAQHNSDPKPKKRKKV----------- 1320 Query: 732 LGTSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXX 553 + S+DLGQ + +AP V + P +V G+ Sbjct: 1321 ---------------------VVSEDLGQRALQSLAPGVGSHTSTPVAVVAPVGNVPITT 1359 Query: 552 XXXXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSES--KVXXXXXXXXXXXXX 379 + S D SK + ++ K+ + ES KV Sbjct: 1360 IEKS-------------VLSVSPLADQSKNDRNVEKRIMSDESLMKVKEARVHAEEASAL 1406 Query: 378 XXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 199 AV+H ++W++LDKHKNS L DIE K Sbjct: 1407 SAAAVNHSLELWNQLDKHKNSGLMPDIEAKLASAAVAVAAAATIAKAAAAAANVASNAAL 1466 Query: 198 XXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKE 19 MADEAL++ N SQ+N +S NNLG ATPAS+LK + N SII AAKE Sbjct: 1467 QAKLMADEALLSSGYDNSSQSNQISLSEGTNNLGKATPASILKGANGINSPGSIIVAAKE 1526 Query: 18 ASRRRI 1 A +RR+ Sbjct: 1527 AVKRRV 1532 >ref|XP_006573716.1| PREDICTED: uncharacterized protein LOC100792961 isoform X1 [Glycine max] gi|571436299|ref|XP_006573717.1| PREDICTED: uncharacterized protein LOC100792961 isoform X2 [Glycine max] gi|571436301|ref|XP_006573718.1| PREDICTED: uncharacterized protein LOC100792961 isoform X3 [Glycine max] gi|571436303|ref|XP_006573719.1| PREDICTED: uncharacterized protein LOC100792961 isoform X4 [Glycine max] gi|571436305|ref|XP_006573720.1| PREDICTED: uncharacterized protein LOC100792961 isoform X5 [Glycine max] gi|571436307|ref|XP_006573721.1| PREDICTED: uncharacterized protein LOC100792961 isoform X6 [Glycine max] Length = 2142 Score = 306 bits (785), Expect = 1e-80 Identities = 217/546 (39%), Positives = 273/546 (50%), Gaps = 17/546 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 T ++PDLNTS+ VLFHQPFTD QQVQLRAQIFVYG+LIQG PDEA M+SAFG SDGG Sbjct: 1043 TYSIPDLNTSASPPVLFHQPFTDQQQVQLRAQIFVYGALIQGMVPDEAYMISAFGGSDGG 1102 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 RSLWD+AWRAC+ER GQ+SH N TP RS RT D +KQ Q K +S GR Sbjct: 1103 RSLWDNAWRACMERQHGQKSHPANPETPLQSRSVARTSDLPHKQSAAQAKGISSPLGRTS 1162 Query: 1227 GKATNSPAVSHMIPLSSPLWTMPTPS--RDGLSS---ARAAVIDY-KALPSMHP---PPS 1075 KAT P V+ +IPLSSPLW++ T D L S AR +V+DY +A+ +HP P Sbjct: 1163 SKAT-PPIVNPLIPLSSPLWSLSTLGLGSDSLQSSAIARGSVMDYPQAITPLHPYQTTPV 1221 Query: 1074 RNFVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPALP----VKLTPVKESSLSISACA 907 RNF+GH Q P GPW+ SP T A D S A P +KL VK SL S+ Sbjct: 1222 RNFLGHNTPWMSQTPLRGPWIGSP-TPAPDNSTHISASPASDTIKLGSVK-GSLPPSSVI 1279 Query: 906 NNATLGL-VAHAGDSGILSG-ASPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSK 733 N T L + G I +G AS D +V PAQ+S+D K RKRKK Sbjct: 1280 KNITSSLPTSSTGLQSIFAGTASLLDANNVTVSPAQHSSDPKPRKRKKV----------- 1328 Query: 732 LGTSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXX 553 + S+DLGQ + +AP V + P +V + G+ Sbjct: 1329 ---------------------VVSEDLGQRAFQSLAPAVGSHTSTPVAVVVPVGNVPITT 1367 Query: 552 XXXXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSES--KVXXXXXXXXXXXXX 379 + S D SK + ++ K+ + ES KV Sbjct: 1368 IEKS-------------VVSVSPLADQSKNDQNVEKRIMSDESLLKVKEARVHAEEASAL 1414 Query: 378 XXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 199 AV+H ++W++LDKHKNS L DIE K Sbjct: 1415 SAAAVNHSLELWNQLDKHKNSGLMPDIEAKLASAAVAVAAAAAIAKAAAAAANVASNAAL 1474 Query: 198 XXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKE 19 MADEAL++ +N SQ+N + NNLG ATPAS+LK + N SII AAKE Sbjct: 1475 QAKLMADEALLSSGYNNSSQSNQICLSEGTNNLGKATPASILKGANGTNSPGSIIVAAKE 1534 Query: 18 ASRRRI 1 A +RR+ Sbjct: 1535 AVKRRV 1540 >ref|XP_004147256.1| PREDICTED: uncharacterized protein LOC101211275 [Cucumis sativus] gi|449505004|ref|XP_004162351.1| PREDICTED: uncharacterized LOC101211275 [Cucumis sativus] Length = 2150 Score = 304 bits (779), Expect = 6e-80 Identities = 215/545 (39%), Positives = 284/545 (52%), Gaps = 16/545 (2%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TS+LPDLN S+ S +F QPFTDLQQVQLRAQIFVYG+LIQGT PDEA M+SAFG DGG Sbjct: 983 TSSLPDLNNSASPSPMFQQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMLSAFGGPDGG 1042 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 +LW++AWR CV+R G++S + N TPS +SG R+ +QA+KQ Q+K+ + R Sbjct: 1043 TNLWENAWRMCVDRFNGKKSQTINPETPSQSQSGGRSTEQASKQSTLQSKIISPPVSRVS 1102 Query: 1227 GKATNSPAVSHMIPLSSPLWTMPTPSRDGLSS--ARAAVIDY-KALPSMHP---PPSRNF 1066 K+T S ++ MIPLSSPLW++ TPS SS R+ VIDY +AL +HP PP RNF Sbjct: 1103 SKST-STVLNPMIPLSSPLWSISTPSNALQSSIVPRSPVIDYQQALTPLHPYQTPPVRNF 1161 Query: 1065 VGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPAL----PVKLTPVKESSLSISACANNA 898 +GH +S QAPF WVA+ QTS D SA+F L PV LTPVKESS+ S+ + Sbjct: 1162 IGHNLSWFSQAPFHSTWVAT-QTSTPDSSARFSGLPITEPVHLTPVKESSVPQSSAMKPS 1220 Query: 897 TLGLVAHAGDSG-ILSGASP-HDNKKASVLPAQYSADQKSRKRKKASSIED-RVQKSKLG 727 G + H+G+ G + +GASP H+ K+ SV Q + K R+RKK S ED + ++ Sbjct: 1221 --GSLVHSGNPGNVFTGASPLHELKQVSVTTGQNPTESKMRRRKKNSVSEDPGLITMQVQ 1278 Query: 726 TSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXX 547 +PV A T+ + + S L S V+ ++ PT+ P G Sbjct: 1279 PHLKPVPAVVTTTISTLVTSPSVHLKATSENVI---LSPPPLCPTAHPKAAGQ------- 1328 Query: 546 XXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSE---SKVXXXXXXXXXXXXXX 376 DL K SE KV Sbjct: 1329 ------------------------------DLRGKPMFSEETLGKVREAKQLAEDAALFA 1358 Query: 375 XXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 196 AV H +VWS+L + KNS+L SD+E K Sbjct: 1359 SEAVKHSAEVWSQLGRQKNSELVSDVEAKLASAAVAIAAAAAVAKAAAAAANVASNAACQ 1418 Query: 195 XXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKEA 16 MADEA + Q+N S +G ATPAS+L+ +D GNGSSSII AA+EA Sbjct: 1419 AKLMADEAFSSSSPELSCQSNEFSVHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREA 1478 Query: 15 SRRRI 1 +R+R+ Sbjct: 1479 ARKRV 1483 >gb|EMJ10269.1| hypothetical protein PRUPE_ppa000035mg [Prunus persica] Length = 2263 Score = 304 bits (778), Expect = 8e-80 Identities = 218/548 (39%), Positives = 277/548 (50%), Gaps = 19/548 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TS+LPDLNTS+P SV+F QPFTDLQQVQLRAQIFVYG+LIQG P+EA MVSAFG DGG Sbjct: 1113 TSSLPDLNTSAPQSVIFQQPFTDLQQVQLRAQIFVYGALIQGIAPEEAYMVSAFGGPDGG 1172 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 R +W++AWR C+ER+ GQ+S N TP RSG R DQ KQ NK +S GRA Sbjct: 1173 RGMWENAWRVCIERLHGQKSTPINPETPLQSRSGSRASDQVIKQGALHNKGLSSPVGRAS 1232 Query: 1227 GKATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA---RAAVIDY-KALPSMHP---PPSRN 1069 K T A S MIP+SSPLW++ TP +GL + R +V+DY + +HP P +N Sbjct: 1233 TKGTPQTA-SPMIPISSPLWSISTPVCEGLQYSVIPRGSVMDYQQGFNPLHPFQTPSVKN 1291 Query: 1068 FVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPALP----VKLTPVKESSLSISACANN 901 VGH + PQ+ F GPW+ SPQ+SA + S F A P V+LTP+KE SL + Sbjct: 1292 LVGHNTTWMPQSSFRGPWLPSPQSSA-EASMHFSAFPSTEAVQLTPIKEVSLPQLPTVKH 1350 Query: 900 ATLGLVAHAGDS-GILSGASP-HDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLG 727 G A G +G SP D KK S P Q+SAD K RKRKK S E+ Q S L Sbjct: 1351 VPSGPSAQTGGPISAFAGPSPLLDPKKVSASPGQHSADPKPRKRKKISPSEELGQIS-LQ 1409 Query: 726 TSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXX 547 S+P +A + + S+ P LSS + Sbjct: 1410 AQSQPESALTVAVVSSTTP------STLSSKAM--------------------------- 1436 Query: 546 XXXXXXXXXSNSDILITSAP---SSTDLSKRELDLGKKAPTSE---SKVXXXXXXXXXXX 385 D LI S P SS L K +LDL ++A SE +KV Sbjct: 1437 -----------PDKLIMSVPPMSSSDQLKKADLDLEQRATLSEETLAKVKEARQQAEEAS 1485 Query: 384 XXXXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXX 205 AVSH Q +W++L+K KNS L SD E K Sbjct: 1486 SLAAAAVSHSQAIWNQLEKQKNSKLISDGEAKLASAAVAVAAAAAVAKAAAAAANVASNA 1545 Query: 204 XXXXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAA 25 MA+EAL Y+ +PS + ATP S+L+ +D N SSSI+ AA Sbjct: 1546 ALQAKLMAEEALDNYENPSPS-------------MRMATPVSILRGEDGTNSSSSILVAA 1592 Query: 24 KEASRRRI 1 +EA+RR++ Sbjct: 1593 REAARRKV 1600 >emb|CAN66568.1| hypothetical protein VITISV_039539 [Vitis vinifera] Length = 2321 Score = 301 bits (772), Expect = 4e-79 Identities = 221/550 (40%), Positives = 280/550 (50%), Gaps = 21/550 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TSNLPDLNTS+ S +F QPFTDLQQVQLRAQIFVYGSL+ ++ SDGG Sbjct: 1110 TSNLPDLNTSASPSAIFQQPFTDLQQVQLRAQIFVYGSLMP-----HMLLILDLLCSDGG 1164 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQAN-KQVVHQNKVTTSAAGRA 1231 RSLW++AW A VER++GQ+SH N TP RSG RTPDQA+ +Q Q KV S GRA Sbjct: 1165 RSLWENAWHASVERLQGQKSHPSNPETPLQSRSGARTPDQASIQQGALQGKVIPSPVGRA 1224 Query: 1230 GGKATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA--RAAVIDYK-ALPSMHP---PPSRN 1069 K T S V+ M+PL SPLW++ T SS R ++D+ AL +HP PP RN Sbjct: 1225 SSKGTPSTIVNPMMPLPSPLWSISTQGDVMQSSGLPRGGLMDHHPALSPLHPYQTPPVRN 1284 Query: 1068 FVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPALPV----KLTPVKESSLSISACANN 901 FVGH S Q FPGPWV S QTS D S +FPALPV KLTPV+ES++ S+ + Sbjct: 1285 FVGHNTSWISQPTFPGPWVPS-QTSGLDASVRFPALPVTETVKLTPVRESTVPHSSSVKH 1343 Query: 900 ATLGLVAHAGD-SGILSGASPH-DNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLG 727 + G + H+G + + +G SP D KKA+ P Q S D K RKRKK Sbjct: 1344 VSSGPMGHSGGPTSVFAGTSPLLDAKKATASPGQPSTDPKPRKRKKTP------------ 1391 Query: 726 TSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXXXXX 547 AS+ Q+S L +QSQT P +P++ HF Sbjct: 1392 --------------------ASEGPSQIS------LPSQSQTEP--IPVVTSHFSTSVSI 1423 Query: 546 XXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSES--------KVXXXXXXXXX 391 SN+ L+ +A S T LS ++ LG + S KV Sbjct: 1424 TTPASLVSKSNTGKLVAAA-SPTFLSD-QMKLGSRDAEQRSXLTEETLGKVKEAKLQAED 1481 Query: 390 XXXXXXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXX 211 AVSH Q VWS LDK KNS L SD++ K Sbjct: 1482 AAALAAAAVSHSQGVWSELDKQKNSGLISDVQAKIASAAVAIAAAASVAKAAAAAARIAS 1541 Query: 210 XXXXXXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIF 31 M DEAL++ +P Q++ + V+ LG ATPAS+LK D N SSSI+ Sbjct: 1542 NAALQAKLMVDEALVSSANIHPGQSS-----DGVSILGKATPASILKGDDGTNCSSSILV 1596 Query: 30 AAKEASRRRI 1 AA+EA+RRR+ Sbjct: 1597 AAREAARRRV 1606 >ref|XP_003611322.1| Agenet domain containing protein expressed [Medicago truncatula] gi|355512657|gb|AES94280.1| Agenet domain containing protein expressed [Medicago truncatula] Length = 2242 Score = 301 bits (770), Expect = 7e-79 Identities = 216/546 (39%), Positives = 275/546 (50%), Gaps = 17/546 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TS+LPDLNTS+ + VLFHQPF+DLQQVQLRAQI VYG+LIQGTTPDEA M+SA+G +DGG Sbjct: 1118 TSSLPDLNTSASSPVLFHQPFSDLQQVQLRAQILVYGALIQGTTPDEAHMISAYGGTDGG 1177 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 R+LW++ WR C+ER R Q+SH TP RS RT D KQ V Q K +S GRA Sbjct: 1178 RNLWENVWRVCMERQRSQKSHPNTPETPLQSRSAARTSDSTVKQSVLQGKGISSPLGRAS 1237 Query: 1227 GKATNSPAVSHMIPLSSPLWTMPTPSRDGLSS---ARAAVIDY-KALPSMHP---PPSRN 1069 KAT + A + +IPLSSPLW++PT S D L S AR +V+DY +AL +HP P RN Sbjct: 1238 SKATPTIA-NPLIPLSSPLWSLPTLSADSLQSSALARGSVVDYSQALTPLHPYQSPSPRN 1296 Query: 1068 FVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPALP----VKLTPVKESSLSISACANN 901 F+GH+ S QAP GPW+ SP T A D + A P +KL VK SL S+ + Sbjct: 1297 FLGHSTSWISQAPLRGPWIGSP-TPAPDNNTHLSASPSSDTIKLASVK-GSLPPSSSIKD 1354 Query: 900 ATLGLVAHAG--DSGILSGASPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKSKLG 727 T G A + S + S D +V PAQ S+ K++KRKK ED QK Sbjct: 1355 VTPGPPASSSGLQSTFVGTDSQLDANNVTVPPAQQSSGPKAKKRKKDVLSEDHGQKLLQS 1414 Query: 726 ----TSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXX 559 +S T+ + T + ++P++S + S V V+PL Q + T Sbjct: 1415 LTPAVASRASTSVSAATPVGNVPMSSVEK---SVVSVSPLADQPKNDQT----------- 1460 Query: 558 XXXXXXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSESKVXXXXXXXXXXXXX 379 + KR L + S KV Sbjct: 1461 ----------------------------VEKRIL-----SDESLMKVKEARVHAEEASAL 1487 Query: 378 XXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 199 AV+H ++W++LDKHKNS SDIE K Sbjct: 1488 SAAAVNHSLELWNQLDKHKNSGFMSDIEAKLASAAVAIAAAAAVAKAAAAAANVASNAAF 1547 Query: 198 XXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKE 19 MADEALI+ N SQ N P +NLG ATPAS+LK + N S I AAKE Sbjct: 1548 QAKLMADEALISSGYENTSQGNNTFLPEGTSNLGQATPASILKGANGPNSPGSFIVAAKE 1607 Query: 18 ASRRRI 1 A RRR+ Sbjct: 1608 AIRRRV 1613 >ref|XP_004511695.1| PREDICTED: serine-rich adhesin for platelets-like isoform X4 [Cicer arietinum] Length = 2151 Score = 299 bits (766), Expect = 2e-78 Identities = 218/546 (39%), Positives = 274/546 (50%), Gaps = 17/546 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 TS+LPDLNTS+ + VLFHQPFTDLQQVQLRAQIFVYG+LIQGTTPDEA M+SAFG +DGG Sbjct: 1015 TSSLPDLNTSTSSPVLFHQPFTDLQQVQLRAQIFVYGALIQGTTPDEAHMISAFGGTDGG 1074 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 RS+W++ WR C+ER Q+SH N TP RS RT D KQ Q K +S GR Sbjct: 1075 RSIWENVWRVCIERQHSQKSHPINPETPLQSRSAARTSDSTVKQSALQGKGISSPLGRGC 1134 Query: 1227 GKATNSPAVSHMIPLSSPLWTMPTPSRDGLSS---ARAAVIDYK----ALPSMHPPPSRN 1069 KAT + + +IPLSSPLW++PT S D L S AR +V+DY L PP RN Sbjct: 1135 SKATPT-ITTPLIPLSSPLWSLPTLSCDSLQSSALARGSVVDYSQAHTPLHHYQSPPPRN 1193 Query: 1068 FVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPALP----VKLTPVKESSLSISACANN 901 F+GH S QAP GPW+ S T A D S A P VKL VK SSL S+ N Sbjct: 1194 FLGHNTSWISQAPLRGPWIGS-ATPAPDNSTHLSASPASDTVKLGSVKGSSLPPSSSIKN 1252 Query: 900 ATLGLVAHA-GDSGILSGASPHDNKKASVLPAQYSADQKSRKRKKASSIEDRVQKS---- 736 T G A + G IL G S D +V PAQ+S+D K +KRKKA ED QK Sbjct: 1253 VTPGPPASSTGLQSILVGTSQLDANIVTVPPAQHSSDPKPKKRKKAVPYEDLGQKPLQSL 1312 Query: 735 KLGTSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXX 556 +S T+ A+ T + ++P+++ + S V V+PL Q + Sbjct: 1313 TPAVASRASTSVAVVTPVHNVPIST---VEKSVVSVSPLADQPK---------------- 1353 Query: 555 XXXXXXXXXXXXSNSDILITSAPSSTDLSK-RELDLGKKAPTSESKVXXXXXXXXXXXXX 379 N + S L K +E L + ++ S Sbjct: 1354 -------------NDQSVENRILSDESLMKVKEARLHAEEASAHSAA------------- 1387 Query: 378 XXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 199 AV+H ++WS+LDKHK+S L DIE K Sbjct: 1388 ---AVNHSLELWSQLDKHKSSGLMPDIEAKLANAAVAVAAAAAVAKAAAAAANVASNAAF 1444 Query: 198 XXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAAKE 19 MADEALI+ N SQ+ + +G ATPAS+LK + N SII AKE Sbjct: 1445 QAKLMADEALISSGCENSSQSKNF-LTEGTSKVGQATPASILKGTNGTNSPGSIIVVAKE 1503 Query: 18 ASRRRI 1 A RRR+ Sbjct: 1504 AIRRRV 1509 >ref|XP_006385540.1| agenet domain-containing family protein [Populus trichocarpa] gi|566161399|ref|XP_002304281.2| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] gi|550342637|gb|ERP63337.1| agenet domain-containing family protein [Populus trichocarpa] gi|550342638|gb|EEE79260.2| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] Length = 2107 Score = 294 bits (753), Expect = 6e-77 Identities = 211/548 (38%), Positives = 285/548 (52%), Gaps = 19/548 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 +SNLPDLN+S S++F QPFTDLQQVQLRAQIFVYG+LIQGT PDEA M+SAFG SDGG Sbjct: 947 SSNLPDLNSSVSPSLMFQQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGSDGG 1006 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 +S+W++A R+ +ER+ GQ+ H TP R G R PDQA KQ Q+KV +S GR Sbjct: 1007 KSIWENALRSSIERLHGQKPHLTTLETPLLSRPGARAPDQAIKQSNVQSKVISSPIGRTS 1066 Query: 1227 -GKATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA---RAAVIDY-KALPSMH---PPPSR 1072 G T V+ M+PLSSPLW++P PS D S+ R +D+ +AL +H P R Sbjct: 1067 MGTPT---IVNPMVPLSSPLWSVPNPSSDTFQSSSMPRGPFMDHQRALSPLHLHQTPQIR 1123 Query: 1071 NFVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPAL-----PVKLTPVKESSLSISACA 907 NF G+ Q+PF GPWV SPQT A D S +F A PV+LTPVK+ S I++ A Sbjct: 1124 NFAGNPWI--SQSPFCGPWVTSPQTLALDTSGRFSAQLPITEPVQLTPVKDLSKPITSGA 1181 Query: 906 NNATLGLVAHAGDS-GILSGASP-HDNKKASVLPAQYSADQKSRKRKKASSIEDRVQK-S 736 + + G V +G S + +G P D KK + +Q D K RKRKKAS E Q Sbjct: 1182 KHVSPGPVVQSGTSASVFTGNFPVPDAKKVTASSSQPLTDPKPRKRKKASVSESPSQNIL 1241 Query: 735 KLGTSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXX 556 + +E V P ++S P S + P+V S++ PT Sbjct: 1242 HIHPRTESVPGP-----VTSYP-------STSIAMTTPIVFVSKS-PT------------ 1276 Query: 555 XXXXXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSE---SKVXXXXXXXXXXX 385 + + +P+ TD+ K++ + ++ SE KV Sbjct: 1277 --------------EKFVTSVSPTPTDIRKQDQNAEQRNILSEETLDKVKAARVQAEDAA 1322 Query: 384 XXXXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXX 205 AVS Q++W++LDK +NS L+ D+E K Sbjct: 1323 NLAAAAVSQRQEIWNQLDKQRNSGLSPDVETKLASAAVAIAAAAAVAKAAAAAANVASNA 1382 Query: 204 XXXXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAA 25 MADEA+++ SNPSQ N +S + +LG TP VLK D N SSSI+ AA Sbjct: 1383 ALQAKLMADEAVVSGGYSNPSQDNAISVSEGMESLGRTTPDFVLKGDDGTNSSSSILVAA 1442 Query: 24 KEASRRRI 1 +EA+RRR+ Sbjct: 1443 REAARRRV 1450 >ref|XP_006385539.1| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] gi|550342636|gb|ERP63336.1| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] Length = 2105 Score = 294 bits (753), Expect = 6e-77 Identities = 211/548 (38%), Positives = 285/548 (52%), Gaps = 19/548 (3%) Frame = -3 Query: 1587 TSNLPDLNTSSPASVLFHQPFTDLQQVQLRAQIFVYGSLIQGTTPDEACMVSAFGTSDGG 1408 +SNLPDLN+S S++F QPFTDLQQVQLRAQIFVYG+LIQGT PDEA M+SAFG SDGG Sbjct: 926 SSNLPDLNSSVSPSLMFQQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGSDGG 985 Query: 1407 RSLWDSAWRACVERIRGQRSHSGNNGTPSHPRSGPRTPDQANKQVVHQNKVTTSAAGRAG 1228 +S+W++A R+ +ER+ GQ+ H TP R G R PDQA KQ Q+KV +S GR Sbjct: 986 KSIWENALRSSIERLHGQKPHLTTLETPLLSRPGARAPDQAIKQSNVQSKVISSPIGRTS 1045 Query: 1227 -GKATNSPAVSHMIPLSSPLWTMPTPSRDGLSSA---RAAVIDY-KALPSMH---PPPSR 1072 G T V+ M+PLSSPLW++P PS D S+ R +D+ +AL +H P R Sbjct: 1046 MGTPT---IVNPMVPLSSPLWSVPNPSSDTFQSSSMPRGPFMDHQRALSPLHLHQTPQIR 1102 Query: 1071 NFVGHTVSRPPQAPFPGPWVASPQTSAFDISAQFPAL-----PVKLTPVKESSLSISACA 907 NF G+ Q+PF GPWV SPQT A D S +F A PV+LTPVK+ S I++ A Sbjct: 1103 NFAGNPWI--SQSPFCGPWVTSPQTLALDTSGRFSAQLPITEPVQLTPVKDLSKPITSGA 1160 Query: 906 NNATLGLVAHAGDS-GILSGASP-HDNKKASVLPAQYSADQKSRKRKKASSIEDRVQK-S 736 + + G V +G S + +G P D KK + +Q D K RKRKKAS E Q Sbjct: 1161 KHVSPGPVVQSGTSASVFTGNFPVPDAKKVTASSSQPLTDPKPRKRKKASVSESPSQNIL 1220 Query: 735 KLGTSSEPVTAPAICTLLSSMPLASDDLGQLSSVVVAPLVAQSQTGPTSVPIIGGHFXXX 556 + +E V P ++S P S + P+V S++ PT Sbjct: 1221 HIHPRTESVPGP-----VTSYP-------STSIAMTTPIVFVSKS-PT------------ 1255 Query: 555 XXXXXXXXXXXXSNSDILITSAPSSTDLSKRELDLGKKAPTSE---SKVXXXXXXXXXXX 385 + + +P+ TD+ K++ + ++ SE KV Sbjct: 1256 --------------EKFVTSVSPTPTDIRKQDQNAEQRNILSEETLDKVKAARVQAEDAA 1301 Query: 384 XXXXXAVSHCQDVWSRLDKHKNSDLASDIEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXX 205 AVS Q++W++LDK +NS L+ D+E K Sbjct: 1302 NLAAAAVSQRQEIWNQLDKQRNSGLSPDVETKLASAAVAIAAAAAVAKAAAAAANVASNA 1361 Query: 204 XXXXXQMADEALIAYDMSNPSQTNTVSFPNIVNNLGSATPASVLKSQDVGNGSSSIIFAA 25 MADEA+++ SNPSQ N +S + +LG TP VLK D N SSSI+ AA Sbjct: 1362 ALQAKLMADEAVVSGGYSNPSQDNAISVSEGMESLGRTTPDFVLKGDDGTNSSSSILVAA 1421 Query: 24 KEASRRRI 1 +EA+RRR+ Sbjct: 1422 REAARRRV 1429