BLASTX nr result
ID: Akebia25_contig00017827
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00017827 (1992 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN62228.1| hypothetical protein VITISV_008028 [Vitis vinifera] 284 1e-73 ref|XP_007047984.1| VQ motif-containing protein [Theobroma cacao... 278 7e-72 emb|CAN67718.1| hypothetical protein VITISV_002357 [Vitis vinifera] 255 5e-65 ref|XP_002301045.1| VQ motif-containing family protein [Populus ... 240 2e-60 ref|XP_002307093.1| VQ motif-containing family protein [Populus ... 219 5e-54 ref|XP_002307385.1| VQ motif-containing family protein [Populus ... 213 2e-52 ref|XP_007018802.1| VQ motif-containing protein [Theobroma cacao... 211 8e-52 ref|XP_002534310.1| conserved hypothetical protein [Ricinus comm... 207 1e-50 ref|XP_002310570.2| VQ motif-containing family protein [Populus ... 206 2e-50 ref|XP_002513906.1| conserved hypothetical protein [Ricinus comm... 202 6e-49 gb|EXB29424.1| hypothetical protein L484_022090 [Morus notabilis] 182 5e-43 ref|XP_006472623.1| PREDICTED: uncharacterized serine-rich prote... 165 8e-38 ref|XP_006434009.1| hypothetical protein CICLE_v10001250mg [Citr... 165 8e-38 ref|XP_006428174.1| hypothetical protein CICLE_v10025465mg [Citr... 158 8e-36 ref|XP_006826929.1| hypothetical protein AMTR_s00010p00175790 [A... 158 1e-35 ref|XP_006596337.1| PREDICTED: probable myosin light chain kinas... 153 3e-34 ref|XP_007163642.1| hypothetical protein PHAVU_001G251600g [Phas... 152 7e-34 gb|EPS60571.1| hypothetical protein M569_14232 [Genlisea aurea] 144 1e-31 ref|XP_006580869.1| PREDICTED: histone-lysine N-methyltransferas... 144 2e-31 ref|XP_007159640.1| hypothetical protein PHAVU_002G254700g [Phas... 144 2e-31 >emb|CAN62228.1| hypothetical protein VITISV_008028 [Vitis vinifera] Length = 422 Score = 284 bits (726), Expect = 1e-73 Identities = 182/402 (45%), Positives = 220/402 (54%), Gaps = 74/402 (18%) Frame = +3 Query: 741 LFDPLTSYLEVFSRS-----TPTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXXXX---- 893 +FDPL++Y + SRS P NLD+VW +T RS+PNCT+ Sbjct: 25 MFDPLSNYFDPLSRSPTQLQNPNSLLNLDMVWSKTLRSDPNCTEIGGILASSSSTPPFSG 84 Query: 894 ----------AQLDRIPFP---------NASTSADPN---RNTKKRSRASRRAPTTVLTT 1007 + L +PFP AS S D RN KKRSRASRRAPTTVLTT Sbjct: 85 AQGQIRATFPSSLPSMPFPPAPENAARATASASNDQTNVARNPKKRSRASRRAPTTVLTT 144 Query: 1008 DTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGH-XXXXXXXXXXXFAQKV 1184 DT+NFRAMVQEFTGIPA PF++SPF RSR DLF + ST+RSGH FAQK+ Sbjct: 145 DTTNFRAMVQEFTGIPAQPFTSSPFPRSRLDLFGTASTMRSGHLDHAPPSYLLRPFAQKL 204 Query: 1185 QQPPFV-----------SSTMIDAIAXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSELGL 1331 Q PPF SS+M+DAIA YQLPS+LGL Sbjct: 205 QPPPFASPPPSSSSSFSSSSMVDAIA----STTNITSGSASNTSSNSTSINYQLPSDLGL 260 Query: 1332 HRQSPNLL--NTENPMLTFQSLLQSP-----PNKYPFVAKSQAP---PSIDSRLKVRVLE 1481 +Q NLL N +NP+L+ QS LQ+P PN +K Q PS DS +K+ LE Sbjct: 261 VKQPQNLLNMNVQNPILSIQSFLQTPLKYPHPNSAIMGSKPQGSLEIPSTDSHIKMGGLE 320 Query: 1482 EFGTSHGHVNANVGGLPN---------------------MGNSDGDRNHLRSFNGNYGNS 1598 +FG SHGHVN ++ GLPN +G+S G+ L NGNY NS Sbjct: 321 DFGLSHGHVNTHLSGLPNLVSSDRTASRSDNNPPSWNDGLGSSGGNHGQLGPLNGNYNNS 380 Query: 1599 QRVSSCKMNFSASSSDFHADKGSENVSSRGEGMVDSWICSSD 1724 QRV++ KMN+SASSSDFH DK ENVS+R EGMV+SWICSSD Sbjct: 381 QRVTNGKMNYSASSSDFHGDKVPENVSTRSEGMVESWICSSD 422 >ref|XP_007047984.1| VQ motif-containing protein [Theobroma cacao] gi|508700245|gb|EOX92141.1| VQ motif-containing protein [Theobroma cacao] Length = 472 Score = 278 bits (711), Expect = 7e-72 Identities = 189/409 (46%), Positives = 221/409 (54%), Gaps = 81/409 (19%) Frame = +3 Query: 741 LFDPLTSYLE-VFSRST-----PTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXXXXAQL 902 +FDPL++Y + SRS P NLD+VW + RSEPNCT L Sbjct: 65 MFDPLSNYFDHPLSRSPQLTTIPNSLLNLDVVWSKNLRSEPNCTDLGGFIASSSPTQQLL 124 Query: 903 ------DRIPFPN---------------ASTSADPN-------RNTKKRSRASRRAPTTV 998 R FP+ + T PN RN KKRSRASRRAPTTV Sbjct: 125 TNQQAQSRATFPSMQIPQGPESATKSSISGTGDQPNNNNSNMVRNPKKRSRASRRAPTTV 184 Query: 999 LTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGH-XXXXXXXXXXXFA 1175 LTTDT+NFRAMVQEFTGIPAPPF++SPF R+R DLF + ST+RS FA Sbjct: 185 LTTDTTNFRAMVQEFTGIPAPPFTSSPFPRTRLDLFGTPSTMRSTPLDPSPPHYLLRPFA 244 Query: 1176 QKVQQPPFV----------SSTMIDAIAXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSEL 1325 QK+ PPFV SS+M+DAIA YQL SEL Sbjct: 245 QKIHPPPFVSSSTASSSFPSSSMVDAIASTPSTNITSASASNNNTTSSSTSINYQLSSEL 304 Query: 1326 GLHRQSPNLL--NTENPMLTFQSLLQSPPNKYPF---------VAKSQAPPSIDSRLKVR 1472 GL +Q NLL N +NP+L FQSLLQ+PP KYP + S PS DS LK+ Sbjct: 305 GLLKQPQNLLNINMQNPILNFQSLLQAPP-KYPLPNSTILGTKLQGSLDIPSNDSSLKMG 363 Query: 1473 VLEEFGTSHGHVNANVGGLPNMGNSDG-----------------------DRNHLRSFNG 1583 VLEEFG SHGHVN N+ GL NM +SDG D++ LRS NG Sbjct: 364 VLEEFGLSHGHVNTNLSGLQNMVSSDGALPRNDSSTNPPSWGEGTGSQEHDQSLLRSING 423 Query: 1584 NY-GNSQRVSSCKM-NFSASSSDFHADKGSENVSSRGEGMVDSWICSSD 1724 Y NSQRVS+ K+ NFSASSSDFH DKG ENV++R EGMV+SWICSSD Sbjct: 424 GYNSNSQRVSNGKVSNFSASSSDFHGDKGPENVAARSEGMVESWICSSD 472 >emb|CAN67718.1| hypothetical protein VITISV_002357 [Vitis vinifera] Length = 449 Score = 255 bits (652), Expect = 5e-65 Identities = 169/395 (42%), Positives = 217/395 (54%), Gaps = 67/395 (16%) Frame = +3 Query: 741 LFDPLTSYLEVFSRSTPTPTPN----LDIVWPRTQRSEPNCTQXXXXXXXXXXXXAQLDR 908 LFDP ++Y++ FS+S+ P N LD VW R RSEPNCT + + Sbjct: 58 LFDPRSNYVDAFSQSSANPNANSLLNLDTVWSRGLRSEPNCTDFGNLTGLSSSSTSSSGQ 117 Query: 909 ----IPFP------NASTSADPN------RNTKKRSRASRRAPTTVLTTDTSNFRAMVQE 1040 + P AS+++ P+ R++KKR+RASRRAPTTVLTTDTSNFRAMVQE Sbjct: 118 SMLGVQGPVHENGGRASSASLPSDQTNVVRSSKKRTRASRRAPTTVLTTDTSNFRAMVQE 177 Query: 1041 FTGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXXFA-QKVQ---------- 1187 FTGIPAPPFSASP++R R DLF + S+++ GH + KVQ Sbjct: 178 FTGIPAPPFSASPYSR-RLDLFGAGSSIKPGHLEPLGPLYPLRPSPHKVQPNLFVSSSSS 236 Query: 1188 -QPPFVSSTMIDAI------AXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSELGLHRQSP 1346 P F +ST+ D+I A YQLPS+ G +Q Sbjct: 237 PSPSFFNSTIGDSIVSTTNIATTSTNNIITTSMAAATNAINSGSNTYQLPSDPGFPKQPQ 296 Query: 1347 NLLNTENPMLTFQSLLQSPPN---KYPF----VAKSQAPPSIDSRLKVRVLEEFGTSHGH 1505 N+L +NP+L+FQSLLQSPP+ KYP V +++P S+ L + EE G HGH Sbjct: 297 NVLGMQNPILSFQSLLQSPPSHPLKYPLADVPVFGTKSPASLT--LPLPSFEELGVPHGH 354 Query: 1506 VNANVGGLPN----------------------MGNSDGDRNHLRSFNGNYGNSQRVSSCK 1619 VNAN+ GLP+ G+++G R LR FNGNYG+S +VSS K Sbjct: 355 VNANISGLPSHATSGGSRRLRTDDNGTCWRDGAGSNEGSREQLRPFNGNYGDSPQVSSFK 414 Query: 1620 MNFSASSSDFHADKGSENVSSRGEGMVDSWICSSD 1724 +N SASSS FH +KGS+NVSSRGEG VDSWIC SD Sbjct: 415 LNCSASSSAFHPEKGSDNVSSRGEGTVDSWICPSD 449 >ref|XP_002301045.1| VQ motif-containing family protein [Populus trichocarpa] gi|222842771|gb|EEE80318.1| VQ motif-containing family protein [Populus trichocarpa] Length = 423 Score = 240 bits (613), Expect = 2e-60 Identities = 162/375 (43%), Positives = 198/375 (52%), Gaps = 47/375 (12%) Frame = +3 Query: 741 LFDPLTSYLEVFSRSTPTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXXXX--------- 893 LFDP S VFS+S P P +V R RS+PNCT Sbjct: 50 LFDPTPSLFHVFSQSQPNPI----MVQSRGLRSDPNCTDLGINLPDSLSSSQSAVLGVQG 105 Query: 894 -------AQLDRIPFPNASTSADPN--------RNTKKRSRASRRAPTTVLTTDTSNFRA 1028 ++ R + S+ P+ RN KKR+RASRRAPTTVLTTDTSNFR Sbjct: 106 SSQALPSSKQLRSVHDDGGRSSSPSHDQTHGIARNPKKRTRASRRAPTTVLTTDTSNFRQ 165 Query: 1029 MVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXXFAQKV--QQPPFV 1202 MVQEFTGIPAPPFS SPFTR R DLF S LRSGH AQKV QQ PF+ Sbjct: 166 MVQEFTGIPAPPFSGSPFTR-RLDLFGPGSGLRSGHLEPLYPLRPT--AQKVHHQQTPFL 222 Query: 1203 SSTMIDAI-----------AXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSELGLHRQSPN 1349 SS+ + + YQLP ++GLH+Q+ N Sbjct: 223 SSSFPSLLNNNIVHTTNIASTSTTANNNNTISTAATSTFNPSSLNYQLPDDIGLHKQTRN 282 Query: 1350 LLNTENPMLTFQSLLQSPPNKYPFVAKSQAPPSIDSR--LKVRVLEEFGTSHGHVNANVG 1523 LLN +N ML+ LL PP P + +SR L + LEE G HG+VNAN+ Sbjct: 283 LLNMQNQMLSIHPLLHPPPPPPPQQLPNVPGLGANSRASLPLPSLEELGMGHGYVNANLS 342 Query: 1524 GLPNMG-------NSDGDRNHLRSFNGNYGNSQRVSSCKMNFSASSSDFHADKGSENVSS 1682 GL + ++DG ++LRS NGNYGN QRV+SCK+N+S++SSDFH +KG ENVSS Sbjct: 343 GLTSHVTTEEMRLSNDGSHHNLRSLNGNYGNMQRVNSCKLNYSSASSDFHHEKGLENVSS 402 Query: 1683 RG-EGMVDSWICSSD 1724 RG EG VDSWIC S+ Sbjct: 403 RGTEGTVDSWICPSE 417 >ref|XP_002307093.1| VQ motif-containing family protein [Populus trichocarpa] gi|222856542|gb|EEE94089.1| VQ motif-containing family protein [Populus trichocarpa] Length = 510 Score = 219 bits (557), Expect = 5e-54 Identities = 169/439 (38%), Positives = 211/439 (48%), Gaps = 112/439 (25%) Frame = +3 Query: 744 FDPLTSYLEVFSRST------------PTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXX 887 FDP ++Y + + S+ P NLD+VW + RS+PNCT Sbjct: 73 FDPFSNYFDPLAPSSSSSRSPLQSLTNPNSLNNLDMVWSKNLRSDPNCTDLGGFISSSLP 132 Query: 888 XX------------------AQLDRIPFPNASTSADPN---------RNTKKRSRASRRA 986 Q P + + + N RN KKRSRASRRA Sbjct: 133 TQQFTNQTQNRTTFQSLPSHGQESATRVPGSGSVSGTNDQVSNTAGIRNPKKRSRASRRA 192 Query: 987 PTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLF-NSTSTLRSG----HXXXXX 1151 PTTVL+TDT+NFRAMVQEFTGIPAPPF++SPF RSR DLF + STLRS Sbjct: 193 PTTVLSTDTTNFRAMVQEFTGIPAPPFTSSPFPRSRLDLFGTAASTLRSAVSQHLDPSPP 252 Query: 1152 XXXXXXFAQKVQ---QPPFVSS---------TMIDAIA-XXXXXXXXXXXXXXXXXXXXX 1292 FA+K Q PPFVSS +M+DAIA Sbjct: 253 PYLLGPFAKKFQPPPPPPFVSSGSAASSFSASMVDAIASTTATNINGTCTNTTISNNIPL 312 Query: 1293 XXXXYQLPSELGLHRQSPNL--LNTENPMLTFQSLLQSPPNKYPF-----VAKSQAP--- 1442 YQLPS+LGL +Q NL LN +NP+L F LLQ+PP KYP + + P Sbjct: 313 TSINYQLPSDLGLLKQPHNLLNLNVQNPILNFHPLLQAPP-KYPLPDSPNILGTTKPQQG 371 Query: 1443 ----PSIDSRLKVRVLEEFGTSHGHVNANVGGLPNM------------------------ 1538 P S LK+ VLEEFG +HGHVN N+ GL N+ Sbjct: 372 SLEIPLNVSHLKMVVLEEFGLNHGHVNTNLSGLQNIVSSSSPSADVTLVRRSDHSNSLTN 431 Query: 1539 -----GNSDGDRNH---------LRSFNGNYGNS-QRVSSCKMNFSASSSDFHADK--GS 1667 G+++ D +H LRS NG+Y NS QRV++ K+NF ASSSDF D G Sbjct: 432 WGDGAGSNEVDHHHHQQQQQQGLLRSINGDYNNSTQRVTNGKVNFLASSSDFCGDHKLGQ 491 Query: 1668 ENVSSRGEGMVDSWICSSD 1724 ENV++R EG ++SWICSSD Sbjct: 492 ENVATRSEGTMESWICSSD 510 >ref|XP_002307385.1| VQ motif-containing family protein [Populus trichocarpa] gi|222856834|gb|EEE94381.1| VQ motif-containing family protein [Populus trichocarpa] Length = 437 Score = 213 bits (543), Expect = 2e-52 Identities = 163/389 (41%), Positives = 197/389 (50%), Gaps = 61/389 (15%) Frame = +3 Query: 741 LFDPLTSYLEVFSRSTPTPTPN-----LDIVWPRTQRSEPNCTQXXXXXXXXXXXXAQLD 905 +FDP + FS+S PN LD+V R RSE +CT+ Sbjct: 50 IFDPSPALFHAFSQSQSITNPNSSMLNLDMVHSRGLRSEHSCTRLGINLPDSLSSSQSAP 109 Query: 906 ----------------RIPFPNASTSADPN-------RNTKKRSRASRRAPTTVLTTDTS 1016 R N S+ P+ RN KKR+RASRRAPTTVLTTDTS Sbjct: 110 LGAQGSSQALPSSMQLRSVHDNGVRSSSPSDQTHGVARNPKKRTRASRRAPTTVLTTDTS 169 Query: 1017 NFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXX-FAQKV--Q 1187 NFR MVQEFTGIPAPPF+ S FTR R DLF S LRSGH AQKV Q Sbjct: 170 NFRQMVQEFTGIPAPPFTGSSFTR-RLDLFGPGSGLRSGHLEPIGSLYPLRPSAQKVHHQ 228 Query: 1188 QPPFVSST--------MIDAI---AXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSELGLH 1334 Q P +SS+ ++D + YQL + LGLH Sbjct: 229 QTPLLSSSSPSFFNNDIVDGTNIASTSTTANNNNTITTATTSTFNPSSVNYQLSAHLGLH 288 Query: 1335 RQSPNLLNTENPMLTFQSLLQSPPNKYPFVA-------KSQAPPSIDSRLKVRVLEEFGT 1493 +Q NLLN +N ML+ LLQ P + +A KSQA + S EE G Sbjct: 289 KQPQNLLNMQNQMLSIHPLLQPPAPPFQSLANVPGLGAKSQASFPLPS------FEELGM 342 Query: 1494 SHG--HVNANVGGLPNMG-------NSDGDRNH-LRSFNGNYGNSQRVSSCKMNF-SASS 1640 HG HVNA++GGL + +SDGD++H LRS +GNYGN +RV+SCK+N+ SASS Sbjct: 343 GHGDGHVNAHLGGLTSHVTTEGMRLSSDGDQDHNLRSLDGNYGNMKRVNSCKLNYSSASS 402 Query: 1641 SDFHADKGSENVSSRG-EGMVDSWICSSD 1724 S FH DK ENVSSRG EG VDSWIC S+ Sbjct: 403 SGFHHDKVLENVSSRGAEGTVDSWICPSE 431 >ref|XP_007018802.1| VQ motif-containing protein [Theobroma cacao] gi|508724130|gb|EOY16027.1| VQ motif-containing protein [Theobroma cacao] Length = 551 Score = 211 bits (538), Expect = 8e-52 Identities = 159/374 (42%), Positives = 200/374 (53%), Gaps = 67/374 (17%) Frame = +3 Query: 744 FDPLTSYLEVFSRSTPTPTP-NLDI-VWPRTQRSEPNCTQXXXXXXXXXXXXAQ-----L 902 FDP ++YL FS+S P + NLD V PR RSEPNCT + L Sbjct: 119 FDPSSNYLNPFSQSQPNNSLLNLDGGVRPRGLRSEPNCTDLGNLPGSSSSSQSMLGAQGL 178 Query: 903 DRIPFPNAST----SADPN--------------RNTKKRSRASRRAPTTVLTTDTSNFRA 1028 ++ FP++S+ A N +N KKR+RASRRAPTTVLTTDT+NFRA Sbjct: 179 NQGSFPSSSSMQSRPAHDNGARSLAQSDQTSVVKNPKKRTRASRRAPTTVLTTDTTNFRA 238 Query: 1029 MVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGH-XXXXXXXXXXXFAQKVQQPPFVS 1205 MVQEFTGIPAPPFS S ++R R DLF S S +RS H A++VQ PFVS Sbjct: 239 MVQEFTGIPAPPFSGSSYSR-RLDLFGSGSGMRSSHLEPLGSLYPLRPSAKRVQPTPFVS 297 Query: 1206 ST--------MIDA--IAXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSELGLHRQSPNLL 1355 S+ ++DA I YQLPS+L L +Q N+L Sbjct: 298 SSSPSLLNNPLVDAANITNTTSNSTIPTSIAATTNAFNPTSSNYQLPSDLSLLKQPQNML 357 Query: 1356 NTEN--PMLTFQSLLQSPPNKYP------FVAKSQAPPSIDSRLKVRVLEEFGTSHGHVN 1511 N +N P+L+FQS LQ PP +P F KSQ ++ S L+E G SHGHVN Sbjct: 358 NLQNQSPVLSFQSFLQ-PPTLHPSLNLPGFGVKSQGSSAMPS------LDELGMSHGHVN 410 Query: 1512 ANVGGLPN------------------MGNSDGDRNHLRSFNGNYG----NSQRV-SSCKM 1622 AN+GGL + +G +DG+++HLR +GNYG NSQRV +SCK+ Sbjct: 411 ANLGGLQSHVTPDGPRARSDSNWRDGIGLNDGNQDHLRPLDGNYGNDHHNSQRVNNSCKL 470 Query: 1623 NFSASSSDFHADKG 1664 NFSASSSDFH DKG Sbjct: 471 NFSASSSDFHHDKG 484 >ref|XP_002534310.1| conserved hypothetical protein [Ricinus communis] gi|223525518|gb|EEF28072.1| conserved hypothetical protein [Ricinus communis] Length = 498 Score = 207 bits (527), Expect = 1e-50 Identities = 171/440 (38%), Positives = 208/440 (47%), Gaps = 112/440 (25%) Frame = +3 Query: 741 LFDPLTSYLEVFSRSTPTPTP-------NLDIVWPRTQRSEPNCTQXXXXXXXXXXXXAQ 899 +FDPL++Y + S S P P NLD+VW + RS+ NCT Q Sbjct: 66 MFDPLSNYFDPLSSSRPPPPLTHPNSLLNLDMVWSKNLRSDTNCTDLGGFIATSSSPTQQ 125 Query: 900 L------------DRIPFP-------------------------NASTSADPNRNTKKRS 968 I P N +T+ + RN KKRS Sbjct: 126 FFTNQTQTGPTYNPSIQIPPVQETTAPSRGPGSASASGSNGHQTNNTTTTNIVRNPKKRS 185 Query: 969 RASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFN--STSTLRS--GH 1136 RASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++SPF RSR DLF + S+LRS H Sbjct: 186 RASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSPFPRSRLDLFGTAAASSLRSVVSH 245 Query: 1137 -XXXXXXXXXXXFAQKVQQPPFV----SSTMIDAIAXXXXXXXXXXXXXXXXXXXXXXXX 1301 FAQK+ QPPF+ SS+MIDAIA Sbjct: 246 LEPSHPSYLLRPFAQKI-QPPFLSSSSSSSMIDAIASSTTTSTNINSNATTNTTTNTTSN 304 Query: 1302 XYQLPSELGLHRQSPNLLNT---ENPMLTFQSLLQ-----SPPNKYPFVAKSQA----PP 1445 S+L L + NLLN +PML F SL Q S PN K Q P Sbjct: 305 -----SDLSLLKYPQNLLNINMHNSPMLNFHSLFQPSPKYSLPNSSILATKPQEGSLDTP 359 Query: 1446 SIDSRLKVRVLEEFGTSHGHVNANVGGLPNM----------------------------- 1538 S D LK+ VLEEFG SHGHV+ N+ GL N+ Sbjct: 360 SNDPHLKMGVLEEFGLSHGHVSTNLTGLHNLVSSSDTTLRRSDHNSSSSSNNNNNNSGNW 419 Query: 1539 -----GNSDGDRNHLRSFNGNY------GNSQRV--SSCKMNFSASSSDF-HADKGSEN- 1673 G+++GD + LRS NGNY N+QRV ++ K+N+SASSSDF H DKG E Sbjct: 420 GDRRVGSNEGD-HLLRSINGNYNNNNSSSNTQRVVANNGKVNYSASSSDFNHGDKGPETN 478 Query: 1674 ---VSSRGEGMVDSWICSSD 1724 ++R EGMV+SWICSSD Sbjct: 479 VVVANTRSEGMVESWICSSD 498 >ref|XP_002310570.2| VQ motif-containing family protein [Populus trichocarpa] gi|550334197|gb|EEE91020.2| VQ motif-containing family protein [Populus trichocarpa] Length = 527 Score = 206 bits (525), Expect = 2e-50 Identities = 172/432 (39%), Positives = 208/432 (48%), Gaps = 111/432 (25%) Frame = +3 Query: 741 LFDPLTSYLEVFS----RSTPTPTPN------LDIVWPRTQRSEPNCTQXXXXXXXXXXX 890 LFDPL++Y + S RS P P N LD+VW + RSEPNCT Sbjct: 69 LFDPLSNYFDPLSSASSRSPPPPFTNPNSLLNLDMVWSKNLRSEPNCTDLGGFISSSSPT 128 Query: 891 XAQLD-----RIPF----PNASTSADPN---------------RNTKKRSRASRRAPTTV 998 R F P+ SA RN KKRSRASRRAPTTV Sbjct: 129 QQLFTNQTQTRTTFQSLPPHGHESATRGPVSGTNDQVSNTAGVRNPKKRSRASRRAPTTV 188 Query: 999 LTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLF-NSTSTLRSG----HXXXXXXXXX 1163 LTTDT+NFRAMVQEFTGIPAPPF++SPF RSR DLF + STLRS Sbjct: 189 LTTDTTNFRAMVQEFTGIPAPPFTSSPFPRSRLDLFGTAASTLRSAVSHHLDPSPPPYLL 248 Query: 1164 XXFAQKVQ-----QPPFVSS---------TMIDAIA---XXXXXXXXXXXXXXXXXXXXX 1292 FAQ+ Q PPF SS +M+DAIA Sbjct: 249 RPFAQRFQPPPPPAPPFASSGSTASSFSTSMVDAIASTTTTNINNSGACTNSTTTSNISS 308 Query: 1293 XXXXYQLPSELGLHRQSPNLL--NTENPMLTFQSLLQSPPNKYPF--------VAKSQAP 1442 YQLPS+LGL +Q +LL N +NP+L F L Q+ P+KYP K+Q Sbjct: 309 TSINYQLPSDLGLLKQPHHLLNINVQNPILNFHPLFQA-PHKYPLPNSTNILGTTKAQQG 367 Query: 1443 -----PSIDSRLKVRVLEEFGTSHGHVNANVGGLPNMGNSD------------GDRNH-- 1565 PS DS LK+ VLEEFG SHGHV+ N+ GL N+ +S GD N+ Sbjct: 368 SSLEIPSNDSHLKMGVLEEFGMSHGHVSTNLTGLQNIVSSSSSPSADATLMRRGDHNNNL 427 Query: 1566 -----------------------LRSFNGNYGNS-QRVSSCKMNF-SASSSDFHAD-KGS 1667 LRS NGNY NS QRV++ K+NF ++SSSDF D KG Sbjct: 428 ANWGDGVGSNGGGHHHHQQQQGLLRSINGNYNNSTQRVTNGKVNFLASSSSDFRGDNKGQ 487 Query: 1668 ENVSSRGEGMVD 1703 ENV++R E +V+ Sbjct: 488 ENVATRSEEVVN 499 >ref|XP_002513906.1| conserved hypothetical protein [Ricinus communis] gi|223546992|gb|EEF48489.1| conserved hypothetical protein [Ricinus communis] Length = 446 Score = 202 bits (513), Expect = 6e-49 Identities = 158/396 (39%), Positives = 189/396 (47%), Gaps = 69/396 (17%) Frame = +3 Query: 744 FDPLTSYLEVFSRSTPTPTPN-----LDIVWPRTQRSEPNCTQXXXXXXXXXXXXAQLDR 908 FDP + FS+S+P P N LD+V PR RS+P+CT A Sbjct: 67 FDPSPNLFHAFSQSSPNPNLNSSLLNLDVVRPRGLRSDPDCTDLRSNLPGSSSSSATAPA 126 Query: 909 I------------------PFP---------NASTSADPN-------RNTKKRSRASRRA 986 P P N + P+ RN KKR+RASRRA Sbjct: 127 AAPSGQSSVLGAQGSGQGAPLPSMQLRSVQDNGGRCSSPSDQTHVVTRNPKKRTRASRRA 186 Query: 987 PTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNST-STLRSGHXXXXXXXXX 1163 PTTVLTTDTSNFRAMVQEFTGIPAPPFS SP++R R DLF S S +RS H Sbjct: 187 PTTVLTTDTSNFRAMVQEFTGIPAPPFSGSPYSRCRLDLFGSVGSGMRSSHLEQMGSLYP 246 Query: 1164 XX-FAQKVQ--QPPFVSS---------TMIDAIAXXXXXXXXXXXXXXXXXXXXXXXXX- 1304 AQKVQ Q PF S TM+DA Sbjct: 247 LHPSAQKVQHQQSPFSFSSSSSLLNTNTMVDATNIASTTTTTNDNNTITSSSIPAGTTST 306 Query: 1305 --------YQLPSELG-LHRQSPNLLNTENPMLTFQSLLQSPP------NKYPFVAKSQA 1439 YQLPS+LG L +Q N+LN +N ML+FQSLLQ PP N + AKSQA Sbjct: 307 FNPSSINNYQLPSDLGQLSKQPQNMLNMQNQMLSFQSLLQPPPLHHSSLNVHGLGAKSQA 366 Query: 1440 PPSIDSRLKVRVLEEFGTSHGHVNANVGGLPNMGNSDGDRNHLRSFNGNYGNSQRVSSCK 1619 + S L++ G SH NAN+ G+P+ + + LR N+ +SCK Sbjct: 367 SMPLPS------LDDLGMSH---NANLSGIPSHNVTTAEGMRLR-------NNDHNNSCK 410 Query: 1620 MNFSASSSDFHA-DKGSENVSSRGEGMVDSWICSSD 1724 N+SASSSDFH DKG E V RGEG VDSWIC S+ Sbjct: 411 FNYSASSSDFHHHDKGLEIVPPRGEGAVDSWICPSE 446 >gb|EXB29424.1| hypothetical protein L484_022090 [Morus notabilis] Length = 443 Score = 182 bits (462), Expect = 5e-43 Identities = 150/436 (34%), Positives = 197/436 (45%), Gaps = 108/436 (24%) Frame = +3 Query: 741 LFDPLTSYLEVFSRST------PTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXXXX--- 893 +FDPL+++ + S S+ P P NLD+VWP+ RS+PN Sbjct: 54 MFDPLSNFFDPVSSSSSSRSLNPNPFLNLDMVWPKPVRSDPNPNSSELVSLIPSSSQPNF 113 Query: 894 --------------------------------AQLDRIPFPNASTSADPN---------- 947 +Q+ R+P +S+SAD N Sbjct: 114 FSSNNSIQLGQTAVAGASNFPAIQIAPEIQTRSQIHRLP---SSSSADRNDAVNITPNGG 170 Query: 948 ---RNTKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTS 1118 RN KKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPF++SPF R+R DLF S S Sbjct: 171 AAPRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFTSSPFPRTRLDLFGSGS 230 Query: 1119 TLRSG---------HXXXXXXXXXXXFAQKVQQ-PPFVSSTMIDAIAXXXXXXXXXXXXX 1268 +RS FAQK+QQ PFV+++ + + Sbjct: 231 GIRSAPLDPHHHHPSTGTSSYNLLRPFAQKIQQTTPFVNTSASSSSS------------- 277 Query: 1269 XXXXXXXXXXXXYQLPSELGLHRQSPNLLNTE-NPMLTFQSLLQSPPNKYPFVAKSQAPP 1445 PS S +LLN + NP+L+F SLLQ+ P K+ + + A Sbjct: 278 ---------------PST----TTSNSLLNIQTNPVLSFHSLLQNAPPKFAKMGSTSAS- 317 Query: 1446 SIDSRLKVRVLEEFGTSHGH---VNANVGGLPN--------------------MGNSDGD 1556 ++FG SHGH VN +GG+PN MG++D + Sbjct: 318 ----------ADQFGLSHGHHVNVNPQLGGIPNPPTTMATTTATNWGITTDHGMGSNDNN 367 Query: 1557 RNH------------LRSFNGNYGNSQRVSSC-------KMNFSASSS-DFHADKGSENV 1676 + LRS NG Y + +S K+N+SASSS DFH K NV Sbjct: 368 NGNNGNNSNVDEGLLLRSINGGYTANTTAASAAAVSNGHKVNYSASSSTDFHGSKTEINV 427 Query: 1677 SSRGEGMVDSWICSSD 1724 ++R EGMV+SWICSSD Sbjct: 428 AARSEGMVESWICSSD 443 >ref|XP_006472623.1| PREDICTED: uncharacterized serine-rich protein C215.13-like [Citrus sinensis] Length = 429 Score = 165 bits (417), Expect = 8e-38 Identities = 143/395 (36%), Positives = 179/395 (45%), Gaps = 70/395 (17%) Frame = +3 Query: 750 PLTSYLEVFSRSTPTPTP------------NLDIVWPRTQRS-------EPNCTQXXXXX 872 P +SYL+ S+S P NLD+V RT RS EP+CT Sbjct: 60 PSSSYLQAHSQSQSQSQPQPQHNSNPSSFLNLDLVGSRTTRSCSVFRSSEPSCTDSSTVA 119 Query: 873 XXXXXXXAQLDRIPFPN---------ASTSADPN---RNTKKRSRASRRAPTTVLTTDTS 1016 P + + N +N KKR+R SRRAPTTVLTTDTS Sbjct: 120 HQGLINHGSFSTAPSSSHMQQQSRLLVNDQYQTNVVVKNPKKRTRTSRRAPTTVLTTDTS 179 Query: 1017 NFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRS------------GHXXXXXXXX 1160 NFRAMVQEFTGIP+ PFS R DLF S ++S G Sbjct: 180 NFRAMVQEFTGIPSQPFSVGSSYSRRLDLFGPGSAIKSSGNIHNHLEPMGGPLNYHLRPA 239 Query: 1161 XXXFAQKVQQPPFVSSTMIDAIAXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSELGLH-- 1334 F + P SS+MIDAIA Y + SELGL+ Sbjct: 240 TQKFQPNLFSSPSSSSSMIDAIA-----------AAAAASTSHNTTSNYHVLSELGLNSN 288 Query: 1335 -----RQSPNLLN--------TENPMLTFQSLLQ-SPPNKYPFVA-----KSQA----PP 1445 ++ N LN +P+++FQS+LQ S P P + KSQA P Sbjct: 289 NNNNTKEPQNTLNNIMQSQNLNHHPVVSFQSILQHSSPLNNPSLTFGANNKSQASHFGAP 348 Query: 1446 SIDSRLKVRVLEEFGTSHGHVNANVGGLPNMGNSDGDRNHLRSFNGNYGN-SQRVSSCKM 1622 S + + G+ HV GLPN +++ RSF+GN+ N SQR +SCK+ Sbjct: 349 SFEDHDHHLAMSHHGS---HV-----GLPN------NQDQFRSFDGNFANSSQRATSCKL 394 Query: 1623 NFSASSSDFHADKGSENVSSRG-EGMVDSWICSSD 1724 N+SASSSDFH +K ENVSSRG EG VDSWIC SD Sbjct: 395 NYSASSSDFHHNKNLENVSSRGTEGTVDSWICPSD 429 >ref|XP_006434009.1| hypothetical protein CICLE_v10001250mg [Citrus clementina] gi|557536131|gb|ESR47249.1| hypothetical protein CICLE_v10001250mg [Citrus clementina] Length = 426 Score = 165 bits (417), Expect = 8e-38 Identities = 140/388 (36%), Positives = 176/388 (45%), Gaps = 63/388 (16%) Frame = +3 Query: 750 PLTSYLEVFSRSTPTPTP----------NLDIVWPRTQRS-------EPNCTQXXXXXXX 878 P +SYL+ S+S P NLD+V RT RS EP+CT Sbjct: 61 PSSSYLQAHSQSQSQSQPQHNSNPSSFLNLDLVGSRTTRSCSLIRSSEPSCTDSSTVAHQ 120 Query: 879 XXXXXAQLDRIPFPN---------ASTSADPN---RNTKKRSRASRRAPTTVLTTDTSNF 1022 P + + N +N KKR+R SRRAPTTVLTTDTSNF Sbjct: 121 GLINHGSFSTAPSSSHMQQQSRLLVNDQYQTNVVVKNPKKRTRTSRRAPTTVLTTDTSNF 180 Query: 1023 RAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRS------------GHXXXXXXXXXX 1166 RAMVQEFTGIP+ PFS R DLF S ++S G Sbjct: 181 RAMVQEFTGIPSQPFSVGSSYSRRLDLFGPGSAIKSSGNIHNHLEPMGGPLNYHLRPATQ 240 Query: 1167 XFAQKVQQPPFVSSTMIDAIAXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSELGLH---- 1334 F + P SS+MIDAIA Y + SELGL+ Sbjct: 241 KFQPNLFSSPSSSSSMIDAIA-------------AAASTSHNTTSNYHVLSELGLNSNNN 287 Query: 1335 ---RQSPNLLN--------TENPMLTFQSLLQ-SPPNKYPFVAKSQAPPSIDSRLKVRVL 1478 ++ N LN +P+++FQS+LQ S P P + S S Sbjct: 288 NNTKEPQNTLNNIMQSQNLNHHPVVSFQSILQHSSPLNNPSLTFGANNKSQASHFGAPSF 347 Query: 1479 EE----FGTSHGHVNANVGGLPNMGNSDGDRNHLRSFNGNYGN-SQRVSSCKMNFSASSS 1643 E+ SH +A+ GLPN +++ RSF+GN+ N SQR +SCK+N+SASSS Sbjct: 348 EDHDHHLAMSH---HASHVGLPN------NQDQFRSFDGNFANSSQRATSCKLNYSASSS 398 Query: 1644 DFHADKGSENVSSRG-EGMVDSWICSSD 1724 DFH +K ENVSSRG EG VDSWIC SD Sbjct: 399 DFHHNKNLENVSSRGTEGTVDSWICPSD 426 >ref|XP_006428174.1| hypothetical protein CICLE_v10025465mg [Citrus clementina] gi|568819356|ref|XP_006464221.1| PREDICTED: uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like [Citrus sinensis] gi|557530164|gb|ESR41414.1| hypothetical protein CICLE_v10025465mg [Citrus clementina] Length = 491 Score = 158 bits (400), Expect = 8e-36 Identities = 143/416 (34%), Positives = 185/416 (44%), Gaps = 92/416 (22%) Frame = +3 Query: 753 LTSYLEVFSRSTPTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXXXXAQ----------- 899 L+S L + + P NLD+VW ++ RSEPNCT A Sbjct: 86 LSSSLPLNTNHHPNSLLNLDMVWSKSLRSEPNCTDLGGLFVPSSSSSATAFPALHVSPRE 145 Query: 900 -------------LDRIPFP---------NASTSADPN-----RNTKKRSRASRRAPTTV 998 L P P N + S++ + RN KKRSRASRRAPTTV Sbjct: 146 GGTESVSSKAPSFLATAPGPGLNIDQSHINLNNSSNQHSTMMVRNPKKRSRASRRAPTTV 205 Query: 999 LTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSG--------------- 1133 LTTDT+NFRAMVQEFTGIPAPPF++S F R+R DLF ++S+ S Sbjct: 206 LTTDTTNFRAMVQEFTGIPAPPFTSSHFPRTRLDLFGNSSSTSSSLMMRSTIGSHLDSSL 265 Query: 1134 HXXXXXXXXXXXFAQKVQQP-PFV----SSTMIDAIAXXXXXXXXXXXXXXXXXXXXXXX 1298 FAQK+ P PF SS IDAIA Sbjct: 266 PQPPSSYNLLRPFAQKLINPIPFSTSNNSSIFIDAIA-----SASTSTPNATATTTSTNI 320 Query: 1299 XXYQLPSELGLHRQSPNLLNTE--NPMLTFQSLLQSPPNKYPFVAK---SQAPPSIDSRL 1463 QLPS H+Q+ +N + NP+L SLLQ PP KYP PP Sbjct: 321 NYQQLPS----HQQNLFGMNMQHNNPILNLHSLLQVPP-KYPLANSPILETKPPPPPPPP 375 Query: 1464 KVRVLEEFGTSH------GHVNAN-VGGLPNMGNSDGDRNHLRSFNGNYGNSQRVSSCKM 1622 + LEE G SH H+N N + GL ++ +S + N S++G+ + ++ Sbjct: 376 QGSSLEELGLSHAAAVSASHLNTNLMSGLQSLVSSSDNNNSPTSWHGHGTGTGATAAVGS 435 Query: 1623 N-----------------FSASSSDFHADKGSENV-----SSRGEGMVDSWICSSD 1724 N FS +SS+FH DKG E+V ++R EGMV+SWICSSD Sbjct: 436 NENEEATAGLFANGKLSRFSQASSEFHRDKGQESVNVAAATTRTEGMVESWICSSD 491 >ref|XP_006826929.1| hypothetical protein AMTR_s00010p00175790 [Amborella trichopoda] gi|548831358|gb|ERM94166.1| hypothetical protein AMTR_s00010p00175790 [Amborella trichopoda] Length = 326 Score = 158 bits (399), Expect = 1e-35 Identities = 116/286 (40%), Positives = 139/286 (48%), Gaps = 30/286 (10%) Frame = +3 Query: 957 KKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTR--SRFDLFNSTSTLRS 1130 KKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIP PPFS+SPF R +RFD RS Sbjct: 47 KKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPNPPFSSSPFQRASTRFDFIGGGGGSRS 106 Query: 1131 GHXXXXXXXXXXXFAQKVQQPPFVSSTMIDAIAXXXXXXXXXXXXXXXXXXXXXXXXXYQ 1310 F QK PP SS I + Sbjct: 107 ---EPAPPFLLRPFPQK-PSPPLSSSNSISGSSSLNIVSSNADIVMPNYLAASSSSQNVP 162 Query: 1311 LPSELGLHRQSPNLLNTENPMLTFQSLLQSPPNKYPFVAKSQAPPSIDSRLKVRVLEEFG 1490 +P +L + Q P +P+L+ + SP P + DSRLK VLE FG Sbjct: 163 VP-QLPIQMQGPPSFVNFHPVLSHNAKFMSPMAPMPGFLAGKGQIPADSRLKSGVLEGFG 221 Query: 1491 ---------TSHGH-------------VNANVGGLPNMGNSDGDRNHLRSFN-----GNY 1589 + HGH + GG G + + +RS + NY Sbjct: 222 SDSGQIGGASGHGHGQTGGPRPDFVSGGGGSRGGDLGYGGDEEEEGFMRSSSSSVAANNY 281 Query: 1590 GNSQRVSSCKMNFS-ASSSDFHADKGSENVSSRGEGMVDSWICSSD 1724 +QR SSCK+N+S +SSSDFH +KGSENV RGEGMVDSWICSSD Sbjct: 282 FGNQRNSSCKLNYSVSSSSDFHVEKGSENV-GRGEGMVDSWICSSD 326 >ref|XP_006596337.1| PREDICTED: probable myosin light chain kinase DDB_G0279831-like [Glycine max] Length = 429 Score = 153 bits (386), Expect = 3e-34 Identities = 139/382 (36%), Positives = 173/382 (45%), Gaps = 54/382 (14%) Frame = +3 Query: 741 LFDPLTSYLEVFSRSTPTPTPNLDIVWPRTQ--RSEPNCTQXXXXXXXXXXXX------- 893 LFD SYL S+ P NLD +Q RSEP+CT Sbjct: 65 LFDLSPSYLHALSQ--PNSFLNLDTTTSSSQPRRSEPDCTLNVTSSPPPPTTTNIDQCLL 122 Query: 894 ----------AQLDRIPFPNASTSADPNRNTKKRSRASRRAPTTVLTTDTSNFRAMVQEF 1043 A+ D I F + TS + RN+KKR+RASRRAPTTVLTTDTSNFRAMVQEF Sbjct: 123 GSQGGLNVDNARRDTILFESGKTS-NLGRNSKKRTRASRRAPTTVLTTDTSNFRAMVQEF 181 Query: 1044 TGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXXF---AQKV-------QQP 1193 TGIPAPPFSAS R DL +S+LRS QKV Q P Sbjct: 182 TGIPAPPFSASSSYSRRLDLLTGSSSLRSFSHLDTTTGPFYPLRPSPQKVHHHHHHHQNP 241 Query: 1194 PFVSST------MIDAIAXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSELGL--HRQSPN 1349 +SS+ M+DAIA QLP +LGL H N Sbjct: 242 LLLSSSSSPYNNMVDAIA-STTTTNNSSSNNNNNNNNPINFQQQQLPPDLGLPYHHNPQN 300 Query: 1350 LLNT--ENPMLTFQSLLQSPPNKYPFVAKSQAPPSIDSRLKVRVLEEFGTSHGHVNANVG 1523 ++ + ++P L F PP +PF ++ P +E+ G SHG VN N Sbjct: 301 IMLSMQDHPTLAFH---PPPPPLHPFGFSAKLPS----------IEDLGMSHGQVNNNNP 347 Query: 1524 GLPNMGNSDGDRNHLRSFNGNYGNSQRVS-------SCKMNF---SASSSDFHADKGSEN 1673 G+ + LRS N + G ++ VS SCK+NF SAS+S H +N Sbjct: 348 NFVASGHVTSEGVPLRSVNNDGGGARDVSLRSLDGGSCKLNFSVASASTSLNHEKSTLQN 407 Query: 1674 -----VSSRGEGMVDSWICSSD 1724 +RGEG VDSWICSS+ Sbjct: 408 NNNASTGTRGEGTVDSWICSSE 429 >ref|XP_007163642.1| hypothetical protein PHAVU_001G251600g [Phaseolus vulgaris] gi|561037106|gb|ESW35636.1| hypothetical protein PHAVU_001G251600g [Phaseolus vulgaris] Length = 479 Score = 152 bits (383), Expect = 7e-34 Identities = 139/436 (31%), Positives = 188/436 (43%), Gaps = 108/436 (24%) Frame = +3 Query: 741 LFDPLTSYLEVFSRSTPTPTP----NLDIVWPRTQRSEP----------------NCTQX 860 +FDPL++YL+ RS P NLD+VW RSEP N Sbjct: 67 VFDPLSNYLDPTQRSQSHPNATQILNLDMVWNTVARSEPDLAGLMPSSSSPSPHNNSNNQ 126 Query: 861 XXXXXXXXXXXAQLDRIPFPNASTSADPN-------------------------RNTKKR 965 +Q NA+ SA P RN KKR Sbjct: 127 GFLLSQLGAGQSQTRGGGAVNAAVSAFPTSLAPESGSPRGGFEQNSGNANTNVVRNPKKR 186 Query: 966 SRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTST-----LRS 1130 SRASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++SPF R+R DLF S++ LRS Sbjct: 187 SRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSPFPRTRLDLFASSNASSSVLLRS 246 Query: 1131 GH----------XXXXXXXXXXXFAQKVQ-QPPFVS-----STMIDAIAXXXXXXXXXXX 1262 FA KVQ QP + S+M++ +A Sbjct: 247 ASSHLEQPSSQTHTQTPSYLLRPFAHKVQAQPSSIPHNNSFSSMLNTLASNNNSG----- 301 Query: 1263 XXXXXXXXXXXXXXYQLPSELGLHRQSPNLLNTENPMLTFQSLLQSPPNKYPFVAKSQ-- 1436 +H Q + LN NP+L+ QS+L + + +K+Q Sbjct: 302 -----------------SGSASIHYQQ-HSLNMHNPILSLQSILGNNDSSVLVGSKTQQQ 343 Query: 1437 ------APPSIDSRLKVRVLEEFGTSHGHV--------NANV------GGLPNMGNSDGD 1556 P ++DS LK+ LEE G H HV N N+ G L + N+ Sbjct: 344 QPSLEITPGTVDSHLKMSGLEELGLRHAHVGGHHHHHQNMNMVSSSSDGALSRVNNNISI 403 Query: 1557 RNHLRS-FNGNYGNSQRVSSCK----------------MNFSASSSDFHADKGSEN--VS 1679 N++R + ++ +QR+ +N+ +S SDFH +KG+ + V+ Sbjct: 404 NNNMRGPSSADWAQAQRIGGSNDGGVLRSLSGGTATGTLNYRSSVSDFHGEKGAPDCAVA 463 Query: 1680 SRGEGMVDSWI-CSSD 1724 +R EGMV+SWI CSSD Sbjct: 464 ARSEGMVESWINCSSD 479 >gb|EPS60571.1| hypothetical protein M569_14232 [Genlisea aurea] Length = 349 Score = 144 bits (364), Expect = 1e-31 Identities = 120/360 (33%), Positives = 157/360 (43%), Gaps = 32/360 (8%) Frame = +3 Query: 741 LFDPLTSYLEVFSRSTPTPTPNLDIVWPRTQ---RSEPNCTQXXXXXXXXXXXXAQLDRI 911 +FDPLT+Y++ + PT + W R RS+P+ Q Sbjct: 39 MFDPLTNYMQQIQYDSMNPT----MAWTRPVIPVRSDPDGEMIRQRPPVGLIPSFQFQAS 94 Query: 912 PFPNASTSADP---------------NRNTKKRSRASRRAPTTVLTTDTSNFRAMVQEFT 1046 A+ S P RN KKRSRASRRAPTTVLTTDT+NF+AMVQEFT Sbjct: 95 ATATAAESTKPIVAGQNLYQNPNQNGTRNPKKRSRASRRAPTTVLTTDTTNFKAMVQEFT 154 Query: 1047 GIPAPPFSASPFTRSRFDLFNSTSTLRSGH---XXXXXXXXXXXFAQKVQ--------QP 1193 GIP+PPFS S F R+RFDLF S S G FAQK++ P Sbjct: 155 GIPSPPFSTSSFMRNRFDLFGSRSAAVDGSVHAPQHLPPYLRRPFAQKLEPSAAAPFTTP 214 Query: 1194 PFVSSTMIDAIAXXXXXXXXXXXXXXXXXXXXXXXXXYQLPSELGLHRQSPNLLNTENPM 1373 P +S A + QLP Q+PN N +NP+ Sbjct: 215 PATNSNNNTAAS----------------SSSSPLINYQQLPL-----AQNPNPFNVQNPL 253 Query: 1374 LTFQSLLQSPPNKYPFVAKSQAPPSIDSRLKVRVLEEFGTSHGHVN---ANVGGLPNMGN 1544 L S+LQ P F+ S + P D +++ L+EF GHVN N+ LP++ N Sbjct: 254 L--NSVLQQNPK---FIFSSPSIPPSDGEIRIGSLDEFMLGLGHVNHAAMNLTDLPSLVN 308 Query: 1545 SDGDRNHLRSFNGNYGNSQRVSSCKMNFSASSSDFHADKGSENVSSRGEGMVDSWICSSD 1724 R + FNGNYG ++ H +K EN+ R + SWICSS+ Sbjct: 309 ----RVNECEFNGNYG---------------ANLLHGEKAPENIVGREGTVESSWICSSE 349 >ref|XP_006580869.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-79 specific-like [Glycine max] Length = 486 Score = 144 bits (362), Expect = 2e-31 Identities = 137/434 (31%), Positives = 186/434 (42%), Gaps = 106/434 (24%) Frame = +3 Query: 741 LFDPLTSYLEVFSRSTPTPTPNLDIVWPRTQRSEPNCT---------------------- 854 +FDPL++YL+ ++S+ T NLD++W +T RSE N T Sbjct: 64 MFDPLSNYLDPITQSS-TSLLNLDVMWSKTGRSESNQTDLVGLIPCSSSSVPSPHNEAFV 122 Query: 855 -----------------QXXXXXXXXXXXXAQLDRIPFPNASTSADPNRNTKKRSRASRR 983 + A D+I N + + + RN KKRSRASRR Sbjct: 123 SSQTRGNNSGAFPTLPPESGSRGLMLSVSAANNDQIQTHNNNNNCNVVRNPKKRSRASRR 182 Query: 984 APTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTS--TLRSG------HX 1139 APTTVLTTDT+NFRAMVQEFTGIPAPPF++S F R+R DLF ST+ TLRS Sbjct: 183 APTTVLTTDTTNFRAMVQEFTGIPAPPFTSSSFPRTRLDLFASTATPTLRSNVNVNPFDP 242 Query: 1140 XXXXXXXXXXFAQKVQQ------PPFVSSTMIDAIAXXXXXXXXXXXXXXXXXXXXXXXX 1301 FAQK+Q PP S+T+ Sbjct: 243 PTQPPYLLRPFAQKLQLRSLHPFPPSFSNTL----------PPPSTNSPTNSTSINYHQQ 292 Query: 1302 XYQLPSELGLHRQSPNLLNTENPMLTFQSLLQSP----PNKYPFVAKSQAPPSID--SRL 1463 QL GL +Q N NT T ++ + V+++Q S++ L Sbjct: 293 QQQLSEHFGLAKQPFNFNNTTPDTSTLEAYHHPKYTLGNSSSVLVSRTQQQHSLEIPPNL 352 Query: 1464 KVRVLEEFGTSHGHVNANVG-----------------------GLPNMGNS--------- 1547 K+ + EE H HVN ++G L N NS Sbjct: 353 KMGLYEELELRHDHVNTDLGCLHQNMVSSTSVGVGALSSDNNNNLSNATNSSTEWAQRTG 412 Query: 1548 ---DGDRNHLR-----SFNGNY---GNSQRVSSCKMNFSASSSDFHADKGSE---NVSSR 1685 + D +H R S NY G V++ K+++SASSSDFH +KG + ++R Sbjct: 413 TITNNDCDHGRGGGALSGTVNYNDIGEGAVVTNGKVHYSASSSDFHGEKGPDFTVTTAAR 472 Query: 1686 GEGMVDSWI-CSSD 1724 +GMV+SWI CSSD Sbjct: 473 TQGMVESWINCSSD 486 >ref|XP_007159640.1| hypothetical protein PHAVU_002G254700g [Phaseolus vulgaris] gi|561033055|gb|ESW31634.1| hypothetical protein PHAVU_002G254700g [Phaseolus vulgaris] Length = 493 Score = 144 bits (362), Expect = 2e-31 Identities = 143/439 (32%), Positives = 190/439 (43%), Gaps = 111/439 (25%) Frame = +3 Query: 741 LFDPLTSYLEVFSRSTPTPTPNLDIVWPRTQRSEPNCT---------------------- 854 +FDPL+ YL+ ++S+ T NLD++W + RSEPN T Sbjct: 67 MFDPLSGYLDPLTQSS-TSLLNLDVMWSKPGRSEPNQTTLANLIPCSSSSPSPHNQAFLS 125 Query: 855 ----------------QXXXXXXXXXXXXAQLDRIPFPNASTSADPN-----RNTKKRSR 971 + A D+I + + S + N RN KKRSR Sbjct: 126 SQTRGGNTGAFPTLLPESGSRGLMLSVSAANNDQIQTHSTTNSTNNNNSNVVRNPKKRSR 185 Query: 972 ASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNS--TSTLRSG---- 1133 ASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++SPF R+R DLF S T TLRS Sbjct: 186 ASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSPFPRTRLDLFASAATPTLRSNLNVN 245 Query: 1134 ----HXXXXXXXXXXXFAQKVQ------QPPFVSSTMIDAIAXXXXXXXXXXXXXXXXXX 1283 FAQK+Q PP +S+T+ + Sbjct: 246 VNPLDPPTPPPYLLRPFAQKLQFRSLHPFPPSLSNTLSPS-------TNSTTNSTSINYH 298 Query: 1284 XXXXXXXYQLPSELGLHRQSPNLLNTENPMLTFQSLLQSP-PNKYPFVAKSQAPPSID-- 1454 L GL +Q N NT P L + P N V++ Q S D Sbjct: 299 QQQQQQQQNLSEHFGLMKQPHNFNNT--PSLEAYHHPKYPLGNSSVLVSRPQQQSSFDIP 356 Query: 1455 SRLKVRVLEEFG-TSHGHVNAN---------------VGGLPNMGNSDGDRNHLRSFN-- 1580 LK+ V EE G GHVN + VG L + N+ + N+L + N Sbjct: 357 PSLKMGVFEELGLRPDGHVNTDLRCLHQNMVSSTSVGVGALSSGNNN--NNNNLSNANPS 414 Query: 1581 -------GNYGN----------------------SQRVSSCKMNFSASSSDFHADKGSE- 1670 G N ++RVS+ K+++SASSSDFH +K + Sbjct: 415 TEWVQRTGTITNDDCDHGGGGGGGLSGTVSYSDIAERVSNGKVHYSASSSDFHGEKVPDF 474 Query: 1671 NVSSRGEGMVDSWI-CSSD 1724 +V++R +GMV+SWI CSSD Sbjct: 475 SVTARSQGMVESWINCSSD 493