BLASTX nr result
ID: Glycyrrhiza30_contig00007846
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza30_contig00007846 (2732 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_004516825.1 PREDICTED: uncharacterized protein LOC101493421 [... 707 0.0 GAU24488.1 hypothetical protein TSUD_156000 [Trifolium subterran... 691 0.0 XP_019434392.1 PREDICTED: uncharacterized protein LOC109341052 [... 686 0.0 XP_003525566.1 PREDICTED: uncharacterized protein LOC100814892 i... 669 0.0 KHN04358.1 Putative ATP-dependent RNA helicase DDX59 [Glycine soja] 668 0.0 XP_015961901.1 PREDICTED: uncharacterized protein LOC107485872 [... 666 0.0 XP_006579628.1 PREDICTED: uncharacterized protein LOC100814892 i... 662 0.0 XP_016188375.1 PREDICTED: uncharacterized protein LOC107629935 [... 659 0.0 XP_006579629.1 PREDICTED: uncharacterized protein LOC100814892 i... 598 0.0 XP_014630978.1 PREDICTED: uncharacterized protein LOC100814892 i... 567 0.0 OIV89552.1 hypothetical protein TanjilG_19368 [Lupinus angustifo... 546 0.0 XP_008238551.1 PREDICTED: uncharacterized protein LOC103337177 [... 543 0.0 XP_007039867.2 PREDICTED: uncharacterized protein LOC18606276 [T... 543 0.0 XP_010662430.1 PREDICTED: uncharacterized protein LOC100255681 [... 542 0.0 EOY24364.1 P-loop containing nucleoside triphosphate hydrolases ... 540 0.0 ONI06631.1 hypothetical protein PRUPE_5G071400 [Prunus persica] 539 0.0 OMO59651.1 Zinc finger, CCHC-type [Corchorus capsularis] 534 e-179 XP_011464425.1 PREDICTED: uncharacterized protein LOC101306820 [... 531 e-178 XP_016670649.1 PREDICTED: uncharacterized protein LOC107890644 i... 528 e-177 XP_017609926.1 PREDICTED: uncharacterized protein LOC108455869 [... 526 e-176 >XP_004516825.1 PREDICTED: uncharacterized protein LOC101493421 [Cicer arietinum] Length = 444 Score = 707 bits (1826), Expect = 0.0 Identities = 347/446 (77%), Positives = 365/446 (81%), Gaps = 4/446 (0%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTR+NFYKNPSI+YNKHFSLSSVLQNLQAYNI TGNV TA Sbjct: 1 MGTRSNFYKNPSITYNKHFSLSSVLQNLQAYNIVTGNVTSDDQSHPAPTASLKRRRHSQP 60 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPNSAVPLV 1583 PSSMSHRDYIQKRRKEV SS+N +RVELTEDVLGNPNSA+ LV Sbjct: 61 QKSRHNHLKDDVEDV--PSSMSHRDYIQKRRKEVDSSKNSERVELTEDVLGNPNSAISLV 118 Query: 1584 DYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYICNET 1763 DYASDE+ SSE EETHTLPNSGH+ +F+GIKSRNEQRFPVSGEPVCLICGRYGEYICNET Sbjct: 119 DYASDESASSECEETHTLPNSGHKIEFNGIKSRNEQRFPVSGEPVCLICGRYGEYICNET 178 Query: 1764 DDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDA---LAVPFFSDDTWDYNRHRW 1934 DDDVCSM GSSHDQ K+ SSSGISD+ L VP FSDDTWDYNRHRW Sbjct: 179 DDDVCSMECKNELLEILKLNEGSSHDQAKNFSSSGISDSLPLLPVPVFSDDTWDYNRHRW 238 Query: 1935 SKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITV-GSNRSSSIPKDLLGLYRRCHQIGK 2111 SKKRSSLSTYECWKCQRPGHLAEDCLV CS+ TV GSNRSSSIPKDLLGLYRRC + GK Sbjct: 239 SKKRSSLSTYECWKCQRPGHLAEDCLVKGCSETTVGGSNRSSSIPKDLLGLYRRCKEFGK 298 Query: 2112 DLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKS 2291 DLLA+NCNAC SSSNLATCIDCS+VLCDGAGHL+DHIRTHPSHQKYYSHKLKRLVKCCKS Sbjct: 299 DLLASNCNACRSSSNLATCIDCSVVLCDGAGHLDDHIRTHPSHQKYYSHKLKRLVKCCKS 358 Query: 2292 TCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNA 2471 TCKVTDIKDLL+CHYCFDKAFEKFYDMYTATWKGAG SII GSICCEDHFTWHRMNCLNA Sbjct: 359 TCKVTDIKDLLVCHYCFDKAFEKFYDMYTATWKGAGFSIILGSICCEDHFTWHRMNCLNA 418 Query: 2472 GVEESASIVQRNGHKGKPKQLSDFIF 2549 E SA IV+ NGHKGK QLSDFIF Sbjct: 419 DAEGSAYIVKSNGHKGKRTQLSDFIF 444 >GAU24488.1 hypothetical protein TSUD_156000 [Trifolium subterraneum] Length = 444 Score = 691 bits (1783), Expect = 0.0 Identities = 342/446 (76%), Positives = 360/446 (80%), Gaps = 4/446 (0%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTR+N YKNPSI+YNKHFSLSSVLQNL AYNIATGNV TA Sbjct: 1 MGTRSNLYKNPSITYNKHFSLSSVLQNLHAYNIATGNVTSDEQPPPAPTA--SRKRRRPL 58 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPNSAVPLV 1583 PSSMSH DYI KRRKEV SS+N +R++LTEDVLGNPNSA+ LV Sbjct: 59 HPPQSRQNHHKNEVDDDPSSMSHYDYILKRRKEVDSSKNAERIQLTEDVLGNPNSAISLV 118 Query: 1584 DYASDENTSSE-REETHT--LPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYIC 1754 DYASDE+ SSE EETHT LPNSG +++F+G+KSRNEQRFPVSGEPVCLICGRYGEYIC Sbjct: 119 DYASDESASSECEEETHTDILPNSGPKEEFNGVKSRNEQRFPVSGEPVCLICGRYGEYIC 178 Query: 1755 NETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRHRW 1934 NET DDVCSM GSSHDQ KD SSGIS AL P FSDDTWD NRHRW Sbjct: 179 NETGDDVCSMECKNELLEIFKLNEGSSHDQAKDFPSSGISYALPAPVFSDDTWDCNRHRW 238 Query: 1935 SKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVG-SNRSSSIPKDLLGLYRRCHQIGK 2111 SKKRSSLSTYECWKCQRPGHLAEDCLV S ++ITVG SNRSSSIPKDLLGLYRRCHQ+GK Sbjct: 239 SKKRSSLSTYECWKCQRPGHLAEDCLVKSSNEITVGRSNRSSSIPKDLLGLYRRCHQLGK 298 Query: 2112 DLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKS 2291 DLLAANCN C SSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKS Sbjct: 299 DLLAANCNTCRSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKS 358 Query: 2292 TCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNA 2471 TCKVTDIKDLL+CHYCFDKAFEKFYDMYTATWKGAG SII GSICCEDHFTWHRMNCLNA Sbjct: 359 TCKVTDIKDLLVCHYCFDKAFEKFYDMYTATWKGAGFSIICGSICCEDHFTWHRMNCLNA 418 Query: 2472 GVEESASIVQRNGHKGKPKQLSDFIF 2549 G EESA IV NGH+GK QLSDFIF Sbjct: 419 GAEESAYIVNSNGHEGKRTQLSDFIF 444 >XP_019434392.1 PREDICTED: uncharacterized protein LOC109341052 [Lupinus angustifolius] Length = 442 Score = 686 bits (1771), Expect = 0.0 Identities = 336/444 (75%), Positives = 356/444 (80%), Gaps = 2/444 (0%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYN+ATGNV T Sbjct: 1 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNVATGNVLPDDQPQPDLTTASDNKPPSLK 60 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPNSAVPLV 1583 P SMSHRDYI+KRRKEVASS++HDRVELTEDVLGNPNSAV LV Sbjct: 61 RRRPPQSHDGDDDGNGAPFSMSHRDYIEKRRKEVASSRSHDRVELTEDVLGNPNSAVALV 120 Query: 1584 DYAS--DENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYICN 1757 DYAS DE+T SE +ET+ NSG +FDGIKSRNEQRFPVSGEPVCLICGRYGEYICN Sbjct: 121 DYASESDEDTPSECQETYN-QNSGLTNEFDGIKSRNEQRFPVSGEPVCLICGRYGEYICN 179 Query: 1758 ETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRHRWS 1937 ETDDDVCSM GS H+QV++ SS GISDA V F DDTWDYNRHRWS Sbjct: 180 ETDDDVCSMECKHELLEILKLNEGSIHNQVRNFSS-GISDASPVAVFGDDTWDYNRHRWS 238 Query: 1938 KKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIGKDL 2117 KK SSLSTYECWKC RPGH+AEDC+VNSCS+I V SNRSSSIPKDLLGLYRRCH+ GKDL Sbjct: 239 KKISSLSTYECWKCHRPGHIAEDCIVNSCSEIIVPSNRSSSIPKDLLGLYRRCHEFGKDL 298 Query: 2118 LAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKSTC 2297 LAANCN C SSSNLATC+DCSIV CD AGHLN HIR +PSHQKYYSHKLKRLVKCCKSTC Sbjct: 299 LAANCNTCRSSSNLATCLDCSIVFCDSAGHLNGHIRAYPSHQKYYSHKLKRLVKCCKSTC 358 Query: 2298 KVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNAGV 2477 KVTDIKDLL+CHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNAGV Sbjct: 359 KVTDIKDLLVCHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNAGV 418 Query: 2478 EESASIVQRNGHKGKPKQLSDFIF 2549 EESA IV+ +GHK K QLSDFIF Sbjct: 419 EESAYIVKNSGHKSKLTQLSDFIF 442 >XP_003525566.1 PREDICTED: uncharacterized protein LOC100814892 isoform X2 [Glycine max] KRH57389.1 hypothetical protein GLYMA_05G058500 [Glycine max] Length = 446 Score = 669 bits (1726), Expect = 0.0 Identities = 331/452 (73%), Positives = 353/452 (78%), Gaps = 10/452 (2%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTRTNFYKNPSISY K S+SSVLQNL AYNIA GNV TA Sbjct: 1 MGTRTNFYKNPSISYKKRLSISSVLQNLHAYNIAAGNVPPADPPQATSTANVPPGNPPHL 60 Query: 1404 XXXXXXXXXXXXXXXX--------GPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGN 1559 PSSMSH DYIQ RRKEV SS+NHDRVELTE+VLGN Sbjct: 61 TPSARLKRSRNPEPPEFLREYRDDAPSSMSHHDYIQNRRKEVVSSRNHDRVELTEEVLGN 120 Query: 1560 PNSAVPLVDY-ASDENTSSEREETHTLPNSGHEKQFDGIK-SRNEQRFPVSGEPVCLICG 1733 NS +PLVDY ASDE+T SE EETHTL NSG +++FDG+K SRNEQRFPVSGEPVCLICG Sbjct: 121 SNSTLPLVDYDASDEDTPSECEETHTLLNSGQQEEFDGVKKSRNEQRFPVSGEPVCLICG 180 Query: 1734 RYGEYICNETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTW 1913 RYGEYICNETDDDVCSM GSSH+QV+DISSSGIS A+ VP F DDTW Sbjct: 181 RYGEYICNETDDDVCSMECKSELLEILKLNEGSSHNQVRDISSSGISAAVPVPVFGDDTW 240 Query: 1914 DYNRHRWSKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRR 2093 DYN+H WSKK SLSTYECWKCQRPGHLAEDC+V T GSNRSSSIPKDLL LYRR Sbjct: 241 DYNQHHWSKKTCSLSTYECWKCQRPGHLAEDCMV------TDGSNRSSSIPKDLLQLYRR 294 Query: 2094 CHQIGKDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRL 2273 CHQIGKDLLAANCN C SSNLATC+DCSIVLCDGAGHL +HIRTHPSHQKYYSHKLKRL Sbjct: 295 CHQIGKDLLAANCNVCRRSSNLATCLDCSIVLCDGAGHLIEHIRTHPSHQKYYSHKLKRL 354 Query: 2274 VKCCKSTCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHR 2453 VKCCKSTCKVTDIKDLL+CHYCFDKAFEKFYDMYTATWKGAGL+I+WGSICCEDHFTWHR Sbjct: 355 VKCCKSTCKVTDIKDLLVCHYCFDKAFEKFYDMYTATWKGAGLAIMWGSICCEDHFTWHR 414 Query: 2454 MNCLNAGVEESASIVQRNGHKGKPKQLSDFIF 2549 MNCLNA VEESA I++ +GHKGK QLSDFIF Sbjct: 415 MNCLNANVEESAYILKPDGHKGKRTQLSDFIF 446 >KHN04358.1 Putative ATP-dependent RNA helicase DDX59 [Glycine soja] Length = 446 Score = 668 bits (1724), Expect = 0.0 Identities = 330/452 (73%), Positives = 353/452 (78%), Gaps = 10/452 (2%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTRTNFYKNPSISY K S+SSVLQNL AYNIA GNV TA Sbjct: 1 MGTRTNFYKNPSISYKKRLSISSVLQNLHAYNIAAGNVPPADPPQATSTANVPPSNPPHL 60 Query: 1404 XXXXXXXXXXXXXXXX--------GPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGN 1559 PSSMSH DYIQ RRKEV SS+NHDRVELTE+VLGN Sbjct: 61 TPSARLKRSRNPEPPEFLREYRDDAPSSMSHHDYIQNRRKEVVSSRNHDRVELTEEVLGN 120 Query: 1560 PNSAVPLVDY-ASDENTSSEREETHTLPNSGHEKQFDGIK-SRNEQRFPVSGEPVCLICG 1733 NS +PLVDY ASDE+T SE EETHTL NSG +++FDG+K SRN+QRFPVSGEPVCLICG Sbjct: 121 SNSTLPLVDYDASDEDTPSECEETHTLLNSGQQEEFDGVKKSRNQQRFPVSGEPVCLICG 180 Query: 1734 RYGEYICNETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTW 1913 RYGEYICNETDDDVCSM GSSH+QV+DISSSGIS A+ VP F DDTW Sbjct: 181 RYGEYICNETDDDVCSMECKSELLEILKLNEGSSHNQVRDISSSGISAAVPVPVFGDDTW 240 Query: 1914 DYNRHRWSKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRR 2093 DYN+H WSKK SLSTYECWKCQRPGHLAEDC+V T GSNRSSSIPKDLL LYRR Sbjct: 241 DYNQHHWSKKTCSLSTYECWKCQRPGHLAEDCMV------TEGSNRSSSIPKDLLQLYRR 294 Query: 2094 CHQIGKDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRL 2273 CHQIGKDLLAANCN C SSNLATC+DCSIVLCDGAGHL +HIRTHPSHQKYYSHKLKRL Sbjct: 295 CHQIGKDLLAANCNVCRRSSNLATCLDCSIVLCDGAGHLIEHIRTHPSHQKYYSHKLKRL 354 Query: 2274 VKCCKSTCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHR 2453 VKCCKSTCKVTDIKDLL+CHYCFDKAFEKFYDMYTATWKGAGL+I+WGSICCEDHFTWHR Sbjct: 355 VKCCKSTCKVTDIKDLLVCHYCFDKAFEKFYDMYTATWKGAGLAIMWGSICCEDHFTWHR 414 Query: 2454 MNCLNAGVEESASIVQRNGHKGKPKQLSDFIF 2549 MNCLNA VEESA I++ +GHKGK QLSDFIF Sbjct: 415 MNCLNANVEESAYILKPDGHKGKRTQLSDFIF 446 >XP_015961901.1 PREDICTED: uncharacterized protein LOC107485872 [Arachis duranensis] Length = 446 Score = 666 bits (1719), Expect = 0.0 Identities = 327/447 (73%), Positives = 354/447 (79%), Gaps = 5/447 (1%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTR+NFYKNPSISYNKHFSLSSVLQNLQAYN ATGN +A Sbjct: 1 MGTRSNFYKNPSISYNKHFSLSSVLQNLQAYNTATGNAPPSDEPPSI-SASSSVENTRSL 59 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVL----GNPNSA 1571 PSSMSHRDY+QKRR EV SS+N DRVELTEDVL GNPNSA Sbjct: 60 KRHRDPKSPKSQPDCDVPSSMSHRDYVQKRRDEVDSSRNSDRVELTEDVLLHCRGNPNSA 119 Query: 1572 VPLVDYASDENTSSEREETHTLPNS-GHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEY 1748 V LVDYASDE + S+R+E HT P++ EK+FDGIKSR+EQRFPVSGEPVCLICGR GEY Sbjct: 120 VALVDYASDEGSPSDRQEAHTPPDTTDQEKEFDGIKSRSEQRFPVSGEPVCLICGRCGEY 179 Query: 1749 ICNETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRH 1928 ICNET+DDVCSM GSSH QV+ SSSGIS+ L P FSDD WDY+RH Sbjct: 180 ICNETNDDVCSMECKKELLETLKLNEGSSHIQVRHFSSSGISNVLLEPVFSDDNWDYDRH 239 Query: 1929 RWSKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIG 2108 RW+KKRSSLSTYECWKC RPGHLAEDCLV SCSQIT SNRSSSIPKDLLGLYRRCHQIG Sbjct: 240 RWTKKRSSLSTYECWKCHRPGHLAEDCLVKSCSQITPSSNRSSSIPKDLLGLYRRCHQIG 299 Query: 2109 KDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCK 2288 KDLLAANCN C +SSNLATC+DCS+VLCD AGHL++HI+THPSHQK YSHKLKRLVKCCK Sbjct: 300 KDLLAANCNVCRNSSNLATCLDCSVVLCDRAGHLDEHIKTHPSHQKCYSHKLKRLVKCCK 359 Query: 2289 STCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLN 2468 STC VTDIKDLLICHYCFDKAFEKFYDMYTATWK AGLSIIWGSICCEDHFTWHRMNCLN Sbjct: 360 STCNVTDIKDLLICHYCFDKAFEKFYDMYTATWKEAGLSIIWGSICCEDHFTWHRMNCLN 419 Query: 2469 AGVEESASIVQRNGHKGKPKQLSDFIF 2549 A VE +A I + +G +GKP+QLSDFIF Sbjct: 420 ADVEVNAYIFKNDGRRGKPRQLSDFIF 446 >XP_006579628.1 PREDICTED: uncharacterized protein LOC100814892 isoform X1 [Glycine max] Length = 454 Score = 662 bits (1707), Expect = 0.0 Identities = 331/460 (71%), Positives = 353/460 (76%), Gaps = 18/460 (3%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTRTNFYKNPSISY K S+SSVLQNL AYNIA GNV TA Sbjct: 1 MGTRTNFYKNPSISYKKRLSISSVLQNLHAYNIAAGNVPPADPPQATSTANVPPGNPPHL 60 Query: 1404 XXXXXXXXXXXXXXXX--------GPSSMSHRDYIQKR--------RKEVASSQNHDRVE 1535 PSSMSH DYIQ R RKEV SS+NHDRVE Sbjct: 61 TPSARLKRSRNPEPPEFLREYRDDAPSSMSHHDYIQNRSRDILFLTRKEVVSSRNHDRVE 120 Query: 1536 LTEDVLGNPNSAVPLVDY-ASDENTSSEREETHTLPNSGHEKQFDGIK-SRNEQRFPVSG 1709 LTE+VLGN NS +PLVDY ASDE+T SE EETHTL NSG +++FDG+K SRNEQRFPVSG Sbjct: 121 LTEEVLGNSNSTLPLVDYDASDEDTPSECEETHTLLNSGQQEEFDGVKKSRNEQRFPVSG 180 Query: 1710 EPVCLICGRYGEYICNETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAV 1889 EPVCLICGRYGEYICNETDDDVCSM GSSH+QV+DISSSGIS A+ V Sbjct: 181 EPVCLICGRYGEYICNETDDDVCSMECKSELLEILKLNEGSSHNQVRDISSSGISAAVPV 240 Query: 1890 PFFSDDTWDYNRHRWSKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPK 2069 P F DDTWDYN+H WSKK SLSTYECWKCQRPGHLAEDC+V T GSNRSSSIPK Sbjct: 241 PVFGDDTWDYNQHHWSKKTCSLSTYECWKCQRPGHLAEDCMV------TDGSNRSSSIPK 294 Query: 2070 DLLGLYRRCHQIGKDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKY 2249 DLL LYRRCHQIGKDLLAANCN C SSNLATC+DCSIVLCDGAGHL +HIRTHPSHQKY Sbjct: 295 DLLQLYRRCHQIGKDLLAANCNVCRRSSNLATCLDCSIVLCDGAGHLIEHIRTHPSHQKY 354 Query: 2250 YSHKLKRLVKCCKSTCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICC 2429 YSHKLKRLVKCCKSTCKVTDIKDLL+CHYCFDKAFEKFYDMYTATWKGAGL+I+WGSICC Sbjct: 355 YSHKLKRLVKCCKSTCKVTDIKDLLVCHYCFDKAFEKFYDMYTATWKGAGLAIMWGSICC 414 Query: 2430 EDHFTWHRMNCLNAGVEESASIVQRNGHKGKPKQLSDFIF 2549 EDHFTWHRMNCLNA VEESA I++ +GHKGK QLSDFIF Sbjct: 415 EDHFTWHRMNCLNANVEESAYILKPDGHKGKRTQLSDFIF 454 >XP_016188375.1 PREDICTED: uncharacterized protein LOC107629935 [Arachis ipaensis] Length = 446 Score = 659 bits (1700), Expect = 0.0 Identities = 323/447 (72%), Positives = 352/447 (78%), Gaps = 5/447 (1%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTR+NFYKNPSISYNKHFSLSSVLQNLQAYN ATGN +A Sbjct: 1 MGTRSNFYKNPSISYNKHFSLSSVLQNLQAYNTATGNAPPSDELPSI-SATSSVENTRSL 59 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVL----GNPNSA 1571 PSSMSHRDY+QKRR EV SS+N DRVELTEDVL GNPNSA Sbjct: 60 KRHRDPKPPKSQPDCDVPSSMSHRDYVQKRRDEVDSSRNCDRVELTEDVLLHCRGNPNSA 119 Query: 1572 VPLVDYASDENTSSEREETHTLPNS-GHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEY 1748 V LVDYASDE + S+R+E HT P++ EK+F GIKSR+EQRFPVSGEP+CLICGRYGEY Sbjct: 120 VALVDYASDEGSPSDRQEAHTPPDTTDQEKEFHGIKSRSEQRFPVSGEPICLICGRYGEY 179 Query: 1749 ICNETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRH 1928 ICNET+DDVCSM GSSH QV+ SSSGIS+ P FSDD WDY+RH Sbjct: 180 ICNETNDDVCSMECKKELLETLKLNEGSSHIQVRHFSSSGISNVSLEPVFSDDNWDYDRH 239 Query: 1929 RWSKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIG 2108 RW+KKRSSLSTYECWKC RPGHLAEDCLV SCSQIT SNRSSSIPKDLLGLYRRCHQIG Sbjct: 240 RWTKKRSSLSTYECWKCHRPGHLAEDCLVKSCSQITPSSNRSSSIPKDLLGLYRRCHQIG 299 Query: 2109 KDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCK 2288 +DLLAANCN C +SSNLATC+DCS+VLCD AGHL++HI+THPSHQK YSHKLKRLVKCCK Sbjct: 300 RDLLAANCNVCRNSSNLATCLDCSVVLCDRAGHLDEHIKTHPSHQKCYSHKLKRLVKCCK 359 Query: 2289 STCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLN 2468 STC VTDIKDLLICHYCFDKAFEKFYDMYTA WK AGLSIIWGSICCEDHFTWHRMNCLN Sbjct: 360 STCNVTDIKDLLICHYCFDKAFEKFYDMYTAMWKEAGLSIIWGSICCEDHFTWHRMNCLN 419 Query: 2469 AGVEESASIVQRNGHKGKPKQLSDFIF 2549 A VE +A I + +G +GKP+QLSDFIF Sbjct: 420 ADVEVNAYIFKNDGLRGKPRQLSDFIF 446 >XP_006579629.1 PREDICTED: uncharacterized protein LOC100814892 isoform X4 [Glycine max] Length = 416 Score = 598 bits (1543), Expect = 0.0 Identities = 285/358 (79%), Positives = 308/358 (86%), Gaps = 2/358 (0%) Frame = +3 Query: 1482 IQKRRKEVASSQNHDRVELTEDVLGNPNSAVPLVDY-ASDENTSSEREETHTLPNSGHEK 1658 I + +EV SS+NHDRVELTE+VLGN NS +PLVDY ASDE+T SE EETHTL NSG ++ Sbjct: 65 ITSKTEEVVSSRNHDRVELTEEVLGNSNSTLPLVDYDASDEDTPSECEETHTLLNSGQQE 124 Query: 1659 QFDGIK-SRNEQRFPVSGEPVCLICGRYGEYICNETDDDVCSMXXXXXXXXXXXXXXGSS 1835 +FDG+K SRNEQRFPVSGEPVCLICGRYGEYICNETDDDVCSM GSS Sbjct: 125 EFDGVKKSRNEQRFPVSGEPVCLICGRYGEYICNETDDDVCSMECKSELLEILKLNEGSS 184 Query: 1836 HDQVKDISSSGISDALAVPFFSDDTWDYNRHRWSKKRSSLSTYECWKCQRPGHLAEDCLV 2015 H+QV+DISSSGIS A+ VP F DDTWDYN+H WSKK SLSTYECWKCQRPGHLAEDC+V Sbjct: 185 HNQVRDISSSGISAAVPVPVFGDDTWDYNQHHWSKKTCSLSTYECWKCQRPGHLAEDCMV 244 Query: 2016 NSCSQITVGSNRSSSIPKDLLGLYRRCHQIGKDLLAANCNACHSSSNLATCIDCSIVLCD 2195 T GSNRSSSIPKDLL LYRRCHQIGKDLLAANCN C SSNLATC+DCSIVLCD Sbjct: 245 ------TDGSNRSSSIPKDLLQLYRRCHQIGKDLLAANCNVCRRSSNLATCLDCSIVLCD 298 Query: 2196 GAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKSTCKVTDIKDLLICHYCFDKAFEKFYDMY 2375 GAGHL +HIRTHPSHQKYYSHKLKRLVKCCKSTCKVTDIKDLL+CHYCFDKAFEKFYDMY Sbjct: 299 GAGHLIEHIRTHPSHQKYYSHKLKRLVKCCKSTCKVTDIKDLLVCHYCFDKAFEKFYDMY 358 Query: 2376 TATWKGAGLSIIWGSICCEDHFTWHRMNCLNAGVEESASIVQRNGHKGKPKQLSDFIF 2549 TATWKGAGL+I+WGSICCEDHFTWHRMNCLNA VEESA I++ +GHKGK QLSDFIF Sbjct: 359 TATWKGAGLAIMWGSICCEDHFTWHRMNCLNANVEESAYILKPDGHKGKRTQLSDFIF 416 >XP_014630978.1 PREDICTED: uncharacterized protein LOC100814892 isoform X3 [Glycine max] Length = 427 Score = 567 bits (1462), Expect = 0.0 Identities = 269/334 (80%), Positives = 288/334 (86%), Gaps = 2/334 (0%) Frame = +3 Query: 1554 GNPNSAVPLVDY-ASDENTSSEREETHTLPNSGHEKQFDGIK-SRNEQRFPVSGEPVCLI 1727 GN NS +PLVDY ASDE+T SE EETHTL NSG +++FDG+K SRNEQRFPVSGEPVCLI Sbjct: 100 GNSNSTLPLVDYDASDEDTPSECEETHTLLNSGQQEEFDGVKKSRNEQRFPVSGEPVCLI 159 Query: 1728 CGRYGEYICNETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDD 1907 CGRYGEYICNETDDDVCSM GSSH+QV+DISSSGIS A+ VP F DD Sbjct: 160 CGRYGEYICNETDDDVCSMECKSELLEILKLNEGSSHNQVRDISSSGISAAVPVPVFGDD 219 Query: 1908 TWDYNRHRWSKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLY 2087 TWDYN+H WSKK SLSTYECWKCQRPGHLAEDC+V T GSNRSSSIPKDLL LY Sbjct: 220 TWDYNQHHWSKKTCSLSTYECWKCQRPGHLAEDCMV------TDGSNRSSSIPKDLLQLY 273 Query: 2088 RRCHQIGKDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLK 2267 RRCHQIGKDLLAANCN C SSNLATC+DCSIVLCDGAGHL +HIRTHPSHQKYYSHKLK Sbjct: 274 RRCHQIGKDLLAANCNVCRRSSNLATCLDCSIVLCDGAGHLIEHIRTHPSHQKYYSHKLK 333 Query: 2268 RLVKCCKSTCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTW 2447 RLVKCCKSTCKVTDIKDLL+CHYCFDKAFEKFYDMYTATWKGAGL+I+WGSICCEDHFTW Sbjct: 334 RLVKCCKSTCKVTDIKDLLVCHYCFDKAFEKFYDMYTATWKGAGLAIMWGSICCEDHFTW 393 Query: 2448 HRMNCLNAGVEESASIVQRNGHKGKPKQLSDFIF 2549 HRMNCLNA VEESA I++ +GHKGK QLSDFIF Sbjct: 394 HRMNCLNANVEESAYILKPDGHKGKRTQLSDFIF 427 >OIV89552.1 hypothetical protein TanjilG_19368 [Lupinus angustifolius] Length = 400 Score = 546 bits (1407), Expect = 0.0 Identities = 279/417 (66%), Positives = 296/417 (70%), Gaps = 29/417 (6%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYN+ATGNV T Sbjct: 1 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNVATGNVLPDDQPQPDLTTASDNKPPSLK 60 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRR------------KEVASSQNHDRVELTED 1547 P SMSHRDYI+KRR KEVASS++HDRVELTED Sbjct: 61 RRRPPQSHDGDDDGNGAPFSMSHRDYIEKRRQINFLSTYFLNRKEVASSRSHDRVELTED 120 Query: 1548 VLGNPNSAVPLVDYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLI 1727 VLGNPNSAV LVDYAS+ T+ +FDGIKSRNEQRFPVSGEPVCLI Sbjct: 121 VLGNPNSAVALVDYASESLTN----------------EFDGIKSRNEQRFPVSGEPVCLI 164 Query: 1728 CGRYGEYICNETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDD 1907 CGRYGEYICNETDDDVCSM GS H+QV++ SS GISDA V F DD Sbjct: 165 CGRYGEYICNETDDDVCSMECKHELLEILKLNEGSIHNQVRNFSS-GISDASPVAVFGDD 223 Query: 1908 TWDYNRHRWSKKRSSLSTYE-----------------CWKCQRPGHLAEDCLVNSCSQIT 2036 TWDYNRHRWSKK SSLSTYE WKC RPGH+AEDC+VNSCS+I Sbjct: 224 TWDYNRHRWSKKISSLSTYEWMAFKLTPESCGGMGINSWKCHRPGHIAEDCIVNSCSEII 283 Query: 2037 VGSNRSSSIPKDLLGLYRRCHQIGKDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLND 2216 V SNRSSSIPKDLLGLYRRCH+ GKDLLAANCN C SSSNLATC+DCSIV CD AGHLN Sbjct: 284 VPSNRSSSIPKDLLGLYRRCHEFGKDLLAANCNTCRSSSNLATCLDCSIVFCDSAGHLNG 343 Query: 2217 HIRTHPSHQKYYSHKLKRLVKCCKSTCKVTDIKDLLICHYCFDKAFEKFYDMYTATW 2387 HIR +PSHQKYYSHKLKRLVKCCKSTCKVTDIKDLL+CHYCFDKAFEKFYDMYTATW Sbjct: 344 HIRAYPSHQKYYSHKLKRLVKCCKSTCKVTDIKDLLVCHYCFDKAFEKFYDMYTATW 400 >XP_008238551.1 PREDICTED: uncharacterized protein LOC103337177 [Prunus mume] Length = 442 Score = 543 bits (1400), Expect = 0.0 Identities = 273/447 (61%), Positives = 320/447 (71%), Gaps = 5/447 (1%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGN---VXXXXXXXXXXTAXXXXXXX 1394 MGTRTNFYKNPSI+Y K SLSSVLQNL+AYNIATGN + TA Sbjct: 1 MGTRTNFYKNPSIAYKKDLSLSSVLQNLKAYNIATGNASPIEERQPAADGKTACRKRQRN 60 Query: 1395 XXXXXXXXXXXXXXXXXXX-GPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPN-S 1568 GP MSH+DYI KRRKEV++SQ ++ ELT DVLG P S Sbjct: 61 PELPPPPRRQTQNREIEGNDGP--MSHQDYIDKRRKEVSASQAYE--ELTADVLGKPGTS 116 Query: 1569 AVPLVDYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEY 1748 + LV Y SD++TS E E P+SGH + D +KSR+EQRFP GEPVC+ICG+YGEY Sbjct: 117 CLKLVQYDSDDSTS-ECELKQDSPSSGHINESDRVKSRSEQRFPHPGEPVCVICGKYGEY 175 Query: 1749 ICNETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRH 1928 IC++T+DD+CSM S +Q +D+SS G L + F +DTWDY RH Sbjct: 176 ICDKTNDDICSMECKADLLEALKVVKEPSSNQRQDVSSYGSKFTLPMHDFGEDTWDYERH 235 Query: 1929 RWSKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIG 2108 RWSKK SSLSTYECWKC+RPGHLAEDCLV + +Q+T+G + +SIP DLL LYRRCHQIG Sbjct: 236 RWSKKISSLSTYECWKCRRPGHLAEDCLVMTSNQVTLGQGKPNSIPADLLALYRRCHQIG 295 Query: 2109 KDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCK 2288 K++ AA CN C+SS NLATC+ CSI LCD AGHLN+HI+ +PSH++YYSHKL RLVKCCK Sbjct: 296 KNMSAAKCNECYSSLNLATCLHCSIPLCDNAGHLNEHIQANPSHRQYYSHKLSRLVKCCK 355 Query: 2289 STCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLN 2468 STCKVTDIKDLL CHYCFDKAF+KFYDMYTATWKG GLSII GSICCEDHF WHRMNC+N Sbjct: 356 STCKVTDIKDLLTCHYCFDKAFDKFYDMYTATWKGTGLSIISGSICCEDHFAWHRMNCMN 415 Query: 2469 AGVEESASIVQRNGHKGKPKQLSDFIF 2549 A VEESA I+ ++ K K QLSDFIF Sbjct: 416 ANVEESAYIISKSSQKDKRVQLSDFIF 442 >XP_007039867.2 PREDICTED: uncharacterized protein LOC18606276 [Theobroma cacao] Length = 441 Score = 543 bits (1398), Expect = 0.0 Identities = 263/445 (59%), Positives = 318/445 (71%), Gaps = 3/445 (0%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTR+NFYKNPS+SY K SLSS LQNL+AYNIATG+ Sbjct: 1 MGTRSNFYKNPSLSYKKDLSLSSALQNLKAYNIATGDAPPSVELEAYPPVDDKIACKKRS 60 Query: 1404 XXXXXXXXXXXXXXXX---GPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPNSAV 1574 GP MSH+DYI KRR+EV SS ++ EL+ D+L +S+V Sbjct: 61 RERKPFSMPDRRREIEENDGP--MSHQDYILKRRREVISSHGYE--ELSVDILQASSSSV 116 Query: 1575 PLVDYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYIC 1754 LVDY SD N SSE +E+ P+SGH + D +KSR+EQRFP+ GEP+C++CGRYGEYIC Sbjct: 117 NLVDYGSDGNASSECKESQDPPDSGHVNEVDQVKSRSEQRFPLPGEPICVVCGRYGEYIC 176 Query: 1755 NETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRHRW 1934 ++TDDD+CSM S +Q +SSS + +P ++DTWDYN HRW Sbjct: 177 DKTDDDICSMECKSDLLQSLQITEKSLSNQNSLLSSSEPTSISLLPELAEDTWDYNNHRW 236 Query: 1935 SKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIGKD 2114 SKK SSL TY+CWKCQRPGHLAEDCLV + Q+T+ ++ +SI +DLL LYRRCHQIGK+ Sbjct: 237 SKKSSSLCTYKCWKCQRPGHLAEDCLVTTTEQVTMRQSKLTSISRDLLELYRRCHQIGKN 296 Query: 2115 LLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKST 2294 L +A+CNAC SS LATC+DCS VLCD AGHLN+HI+THPSHQ+YYSHKLKRLVKCCKST Sbjct: 297 LSSASCNACRSSIALATCLDCSTVLCDNAGHLNEHIQTHPSHQQYYSHKLKRLVKCCKST 356 Query: 2295 CKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNAG 2474 CKVT+ +DLL+CHYCFDKAF+KFYDMYTATWKGAGLSIIWGSICC+DHFTWHRMNCLNA Sbjct: 357 CKVTNFRDLLVCHYCFDKAFDKFYDMYTATWKGAGLSIIWGSICCDDHFTWHRMNCLNAD 416 Query: 2475 VEESASIVQRNGHKGKPKQLSDFIF 2549 VE+ A I+ R+ + QLSDFIF Sbjct: 417 VEDRAYIISRDTERETHVQLSDFIF 441 >XP_010662430.1 PREDICTED: uncharacterized protein LOC100255681 [Vitis vinifera] CBI31917.3 unnamed protein product, partial [Vitis vinifera] Length = 448 Score = 542 bits (1396), Expect = 0.0 Identities = 268/454 (59%), Positives = 316/454 (69%), Gaps = 12/454 (2%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTRTNFYKNPS SYN+ FSLSSVLQNL+AYNIATG+ Sbjct: 1 MGTRTNFYKNPSFSYNRDFSLSSVLQNLKAYNIATGSASPTDESPPANEKKVNRKRRPDR 60 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPNSAVPLV 1583 GP MSH+D+I+KRRKEV+S Q + ELT D+LG NS + LV Sbjct: 61 RSPPCQNPELKETD--GP--MSHQDFIKKRRKEVSSGQVYQ--ELTPDILGTSNSGLHLV 114 Query: 1584 DYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYICNET 1763 +Y SD++TSSE PN GH + + +KSR EQRFP+ GEPVC++CG YGEYICNET Sbjct: 115 EYESDKSTSSESGAEQDPPNPGHINEVEQVKSRREQRFPLPGEPVCVVCGLYGEYICNET 174 Query: 1764 DDDVCSMXXXXXXXXXXXXXXGS-SHDQVKDISSSGISDALAVPFFSDDTWDYNRHRWSK 1940 DDDVCSM S S++ +SSSG+ AL VP +DTWDY HRWSK Sbjct: 175 DDDVCSMDCKAELLKNLRLSEESLSNEGCPTVSSSGLKCALPVPELGEDTWDYVHHRWSK 234 Query: 1941 KRSSLSTYECWKCQRPGHLAEDCLV-----------NSCSQITVGSNRSSSIPKDLLGLY 2087 KRSSL TYECWKCQRPGHLA+DCLV +C+++ +G N+S+ I +DLLGLY Sbjct: 235 KRSSLCTYECWKCQRPGHLADDCLVMTSNSQSPCLSQTCNKVPMGQNKSTFISRDLLGLY 294 Query: 2088 RRCHQIGKDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLK 2267 +RCHQIGK+L A CN C SSS LATC+DCS V+CD AGHL +HI HPSHQK +S+KLK Sbjct: 295 KRCHQIGKNLTTAKCNLCCSSSTLATCLDCSTVICDNAGHLKEHIIAHPSHQKIFSYKLK 354 Query: 2268 RLVKCCKSTCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTW 2447 RLVKCCKSTC+VTD+KDLL+CHYC DKAF+KFYDMYTATWKG GLSIIWGSICCE+HF W Sbjct: 355 RLVKCCKSTCEVTDLKDLLVCHYCLDKAFDKFYDMYTATWKGNGLSIIWGSICCEEHFAW 414 Query: 2448 HRMNCLNAGVEESASIVQRNGHKGKPKQLSDFIF 2549 HRMNCLNA VE+SA I +R+ K QLSDFIF Sbjct: 415 HRMNCLNADVEDSAYIFRRHAQKNNSIQLSDFIF 448 >EOY24364.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 [Theobroma cacao] EOY24365.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 [Theobroma cacao] EOY24366.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 [Theobroma cacao] EOY24368.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 [Theobroma cacao] Length = 441 Score = 540 bits (1390), Expect = 0.0 Identities = 262/445 (58%), Positives = 318/445 (71%), Gaps = 3/445 (0%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTR+NFYKNPS+SY K SLSS LQNL+AYNIATG+ Sbjct: 1 MGTRSNFYKNPSLSYKKDLSLSSALQNLKAYNIATGDAPPSVELEAYPPVDDKIACKKRS 60 Query: 1404 XXXXXXXXXXXXXXXX---GPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPNSAV 1574 GP MSH+DYI KRR+EV+SS ++ EL+ D+L +S+V Sbjct: 61 RERKPFSMPDRRREIEENDGP--MSHQDYILKRRREVSSSHGYE--ELSVDILQASSSSV 116 Query: 1575 PLVDYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYIC 1754 LVDY SD N SSE +E+ P+SGH + D +KSR+EQRF + GEP+C++CGRYGEYIC Sbjct: 117 NLVDYGSDGNASSECKESQDPPDSGHVNEVDQVKSRSEQRFSLPGEPICVVCGRYGEYIC 176 Query: 1755 NETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRHRW 1934 ++TDDD+CSM S +Q +SSS + +P ++DTWDYN HRW Sbjct: 177 DKTDDDICSMECKSDLLQSLQITEKSLSNQNSLLSSSEPTSISLLPELAEDTWDYNNHRW 236 Query: 1935 SKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIGKD 2114 SKK SSL TY+CWKCQRPGHLAEDCLV + Q+T+ ++ +SI +DLL LYRRCHQIGK+ Sbjct: 237 SKKSSSLCTYKCWKCQRPGHLAEDCLVTTTEQVTMRQSKLTSISRDLLELYRRCHQIGKN 296 Query: 2115 LLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKST 2294 L +A+CNAC SS LATC+DCS VLCD AGHLN+HI+THPSHQ+YYSHKLKRLVKCCKST Sbjct: 297 LSSASCNACRSSIALATCLDCSTVLCDNAGHLNEHIQTHPSHQQYYSHKLKRLVKCCKST 356 Query: 2295 CKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNAG 2474 CKVT+ +DLL+CHYCFDKAF+KFYDMYTATWKGAGLSIIWGSICC+DHFTWHRMNCLNA Sbjct: 357 CKVTNFRDLLVCHYCFDKAFDKFYDMYTATWKGAGLSIIWGSICCDDHFTWHRMNCLNAD 416 Query: 2475 VEESASIVQRNGHKGKPKQLSDFIF 2549 VE+ A I+ R+ + QLSDFIF Sbjct: 417 VEDRAYIMSRDTERETHVQLSDFIF 441 >ONI06631.1 hypothetical protein PRUPE_5G071400 [Prunus persica] Length = 442 Score = 539 bits (1389), Expect = 0.0 Identities = 271/447 (60%), Positives = 319/447 (71%), Gaps = 5/447 (1%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGN---VXXXXXXXXXXTAXXXXXXX 1394 MGTRTNFYKNPSI+Y K SLSSVLQNL+AYNIATGN + TA Sbjct: 1 MGTRTNFYKNPSIAYKKDLSLSSVLQNLKAYNIATGNTPPIEEHPPAADGKTACRKRQRD 60 Query: 1395 XXXXXXXXXXXXXXXXXXX-GPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPN-S 1568 GP MSH+DYI KRRKEV++SQ ++ ELT DVLG P S Sbjct: 61 PELPPPPRRQTQSREIEENDGP--MSHQDYIDKRRKEVSASQAYE--ELTADVLGKPGTS 116 Query: 1569 AVPLVDYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEY 1748 ++ LV Y SDE+TS E E P+SGH + D +KSR+EQ FP GEPVC+ICG+YGEY Sbjct: 117 SLKLVQYDSDESTS-ECELKQDSPSSGHIHESDRVKSRSEQHFPHPGEPVCVICGKYGEY 175 Query: 1749 ICNETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRH 1928 IC+ET+DD+CSM S +Q +D+SSSG +L +P F +DTWDY RH Sbjct: 176 ICDETNDDICSMECKADLLEALKVVKEPSSNQRQDVSSSGPKFSLPMPDFGEDTWDYERH 235 Query: 1929 RWSKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIG 2108 RWSKK SSLSTYECWKC+RPGHLAEDCLV + +Q+T+ + +SIP DLL LYRRCHQIG Sbjct: 236 RWSKKISSLSTYECWKCRRPGHLAEDCLVMTSNQVTLVQGKPNSIPADLLALYRRCHQIG 295 Query: 2109 KDLLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCK 2288 K++ AA CN C+SS NLATC+ CSI LCD AGHLN+HI+ +PSH++YYSHKL RLVKCCK Sbjct: 296 KNMSAAKCNECYSSLNLATCLHCSIPLCDNAGHLNEHIQANPSHRQYYSHKLSRLVKCCK 355 Query: 2289 STCKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLN 2468 STC VTD+KDLL C YCFDKAF+KFYDMYTATWKG GLSII GSICCEDHF WHRMNC+N Sbjct: 356 STCNVTDLKDLLTCQYCFDKAFDKFYDMYTATWKGTGLSIISGSICCEDHFAWHRMNCMN 415 Query: 2469 AGVEESASIVQRNGHKGKPKQLSDFIF 2549 A EESA I+ ++ K K QLSDFIF Sbjct: 416 ANAEESAYIISKSSQKDKRVQLSDFIF 442 >OMO59651.1 Zinc finger, CCHC-type [Corchorus capsularis] Length = 435 Score = 534 bits (1376), Expect = e-179 Identities = 261/443 (58%), Positives = 310/443 (69%), Gaps = 1/443 (0%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTR+NFYKNPSISY K SLSS LQNLQAYNIATGN Sbjct: 1 MGTRSNFYKNPSISYKKDLSLSSALQNLQAYNIATGNAPSPAELEAQPPLDNKSACKKRS 60 Query: 1404 XXXXXXXXXXXXXXXXGPSS-MSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPNSAVPL 1580 MSH+DYI KRR+E+ S ++ EL+ D+L +S V L Sbjct: 61 RERKPLPPPDIGREIEEKDGHMSHQDYILKRRREICPSPGYE--ELSTDILRASSSNVNL 118 Query: 1581 VDYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYICNE 1760 VDY SDE++SSE +E+ P+SGH Q D +KSR EQRFP+ GEPVC++CGRYGEYIC++ Sbjct: 119 VDYESDESSSSECKESQNPPDSGHLNQVDQVKSRIEQRFPLPGEPVCVVCGRYGEYICDK 178 Query: 1761 TDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRHRWSK 1940 TDDD+CSM SS +Q + SS ++ P +DTWDYN HRWSK Sbjct: 179 TDDDICSMECKSALLQSLQITEKSSSNQNPSLLSSELAYISPPPELGEDTWDYNNHRWSK 238 Query: 1941 KRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIGKDLL 2120 K S+L TY+CWKCQRPGHLAEDCLV VG ++ +SI +DLL LYRRCHQ+GK+L Sbjct: 239 KISNLCTYKCWKCQRPGHLAEDCLV------AVGHSKVTSISRDLLELYRRCHQMGKNLS 292 Query: 2121 AANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKSTCK 2300 +A+CN C SS LATC+DCS V CD AGHLN+H+R+HPSHQ+YYSHKLKRLVKCCKSTCK Sbjct: 293 SASCNICRSSIGLATCLDCSTVFCDNAGHLNEHLRSHPSHQQYYSHKLKRLVKCCKSTCK 352 Query: 2301 VTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNAGVE 2480 VT+IKDL++CHYCFDKAF+KFYDMYTATWKGAGLSIIWGSICC+DHF WHRMNCLNA VE Sbjct: 353 VTEIKDLMVCHYCFDKAFDKFYDMYTATWKGAGLSIIWGSICCDDHFAWHRMNCLNADVE 412 Query: 2481 ESASIVQRNGHKGKPKQLSDFIF 2549 + A I+ RN K K Q+SDFIF Sbjct: 413 DWAYIINRNTQKEKHVQISDFIF 435 >XP_011464425.1 PREDICTED: uncharacterized protein LOC101306820 [Fragaria vesca subsp. vesca] Length = 439 Score = 531 bits (1367), Expect = e-178 Identities = 262/445 (58%), Positives = 308/445 (69%), Gaps = 3/445 (0%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 M TRTNFYKNPSI+Y K SLSSVLQNL+AYN ATGN A Sbjct: 1 MTTRTNFYKNPSITYKKDLSLSSVLQNLKAYNTATGNAPPPEEQTPTSDAKTTGRKRQRD 60 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPN---SAV 1574 GP MSH+DYI KRRKE + + ELT DVLG S + Sbjct: 61 PKPPPRGGKREIEERDGP--MSHQDYIDKRRKEAFAKPAFE--ELTADVLGKQGPSGSCL 116 Query: 1575 PLVDYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYIC 1754 LV Y SDE++SSE EE PNSG + DG+KSR+EQR+P GEPVC++CG+YGEYIC Sbjct: 117 NLVQYDSDESSSSECEEKQDPPNSGQANESDGVKSRSEQRYPHPGEPVCVMCGKYGEYIC 176 Query: 1755 NETDDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRHRW 1934 NETDDD+CSM ++ D+SSSG S L +P F +DTWDY RHRW Sbjct: 177 NETDDDICSMDCKAELLESLKAVKDPLSNERPDVSSSGPSYTLPMPDFGEDTWDYERHRW 236 Query: 1935 SKKRSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIGKD 2114 SKK S+L TYECWKC+RPGHLA+DCLV + +Q+T+G +S+SIP DL+ LYRRCHQIGK+ Sbjct: 237 SKKTSNLCTYECWKCKRPGHLAQDCLVMTRNQMTLG--QSNSIPADLVALYRRCHQIGKN 294 Query: 2115 LLAANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKST 2294 + A CN C+SS +LATC+DCS LCD AGHL++HI+ HPSH++YYSHKL RLVKCCKST Sbjct: 295 MSVAKCNECYSSLSLATCLDCSTALCDNAGHLHEHIQRHPSHRQYYSHKLSRLVKCCKST 354 Query: 2295 CKVTDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNAG 2474 CKVTDIKDLL C YCFDKAF+KFYDMYTATWKG GLSII GS+CCEDHF WHRMNC NAG Sbjct: 355 CKVTDIKDLLACQYCFDKAFDKFYDMYTATWKGTGLSIISGSVCCEDHFDWHRMNCFNAG 414 Query: 2475 VEESASIVQRNGHKGKPKQLSDFIF 2549 E+S I+ R K K Q+SDFIF Sbjct: 415 AEDSGYIISRTSLKEKHIQISDFIF 439 >XP_016670649.1 PREDICTED: uncharacterized protein LOC107890644 isoform X1 [Gossypium hirsutum] Length = 438 Score = 528 bits (1360), Expect = e-177 Identities = 254/442 (57%), Positives = 308/442 (69%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTR+NFYKNPS+SY K SLSS LQNL+AYNIATGN Sbjct: 1 MGTRSNFYKNPSLSYKKDLSLSSALQNLKAYNIATGNAPPLVEEKSQVDDKSACRKRSRE 60 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPNSAVPLV 1583 MSH DYI KRR+EV+SSQ +D EL+ D+L NS+V LV Sbjct: 61 REPLSQLPHRSREIEHNDGPMSHHDYILKRRREVSSSQGYD--ELSADILQASNSSVNLV 118 Query: 1584 DYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYICNET 1763 DY SD + SS +ET P+SG + D +KSR+EQRFP+ GEPVC++CGRYGEYIC++T Sbjct: 119 DYESDGSASSNDKETQDPPDSGDANEVDRVKSRSEQRFPLPGEPVCVVCGRYGEYICDKT 178 Query: 1764 DDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRHRWSKK 1943 DDD+CSM S ++ SSS ++ +P ++DTWDYN HRWS++ Sbjct: 179 DDDICSMECKSALLQSLQITEKSMSNRNPSHSSSDLTSISHLPELAEDTWDYNNHRWSQR 238 Query: 1944 RSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIGKDLLA 2123 SSL +Y+CWKC+RPGHLA+DCLV + Q S +S + +DLL LYRRCH+IG++L Sbjct: 239 GSSLCSYKCWKCKRPGHLADDCLVTTPEQAV--SKQSKPVARDLLELYRRCHRIGENLSH 296 Query: 2124 ANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKSTCKV 2303 A+CNAC S LATC+DCS V+CD GHLN+HI THPSH++YYSHKLKRLVKCCKSTC+V Sbjct: 297 ASCNACRGSIGLATCLDCSTVVCDNEGHLNEHIHTHPSHKQYYSHKLKRLVKCCKSTCEV 356 Query: 2304 TDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNAGVEE 2483 T+I DLLICHYCFDKAF+KFYDMYTATWKGAGLS+IWGSICCEDHF WHRMNCLNA +E+ Sbjct: 357 TNINDLLICHYCFDKAFDKFYDMYTATWKGAGLSMIWGSICCEDHFAWHRMNCLNADIED 416 Query: 2484 SASIVQRNGHKGKPKQLSDFIF 2549 A I+ RN KGK QLSDFIF Sbjct: 417 RAYIIGRNTGKGKHVQLSDFIF 438 >XP_017609926.1 PREDICTED: uncharacterized protein LOC108455869 [Gossypium arboreum] Length = 438 Score = 526 bits (1354), Expect = e-176 Identities = 254/442 (57%), Positives = 306/442 (69%) Frame = +3 Query: 1224 MGTRTNFYKNPSISYNKHFSLSSVLQNLQAYNIATGNVXXXXXXXXXXTAXXXXXXXXXX 1403 MGTR+NFYKNPS+SY K SLSS LQNL+AYNIATGN Sbjct: 1 MGTRSNFYKNPSLSYKKDLSLSSALQNLKAYNIATGNAPPLVEEKSQVDDKSACRKRSRE 60 Query: 1404 XXXXXXXXXXXXXXXXGPSSMSHRDYIQKRRKEVASSQNHDRVELTEDVLGNPNSAVPLV 1583 MSH DYI KRR+EV+ SQ +D EL+ D+L NS+V LV Sbjct: 61 REPLSQLPHRSREIEDNDGPMSHHDYILKRRREVSLSQGYD--ELSADILQASNSSVNLV 118 Query: 1584 DYASDENTSSEREETHTLPNSGHEKQFDGIKSRNEQRFPVSGEPVCLICGRYGEYICNET 1763 DY SD + SS +ET P+SG + D +KSR+EQRFP+ GEPVC++CGRYGEYIC++T Sbjct: 119 DYESDGSASSNDKETQDPPDSGDANEVDQVKSRSEQRFPLPGEPVCVVCGRYGEYICDKT 178 Query: 1764 DDDVCSMXXXXXXXXXXXXXXGSSHDQVKDISSSGISDALAVPFFSDDTWDYNRHRWSKK 1943 DDD+CSM S ++ SSS ++ +P ++DTWDYN HRWS++ Sbjct: 179 DDDICSMECKSALLQSLQITEKSMSNRNPSQSSSDLTSISHLPELAEDTWDYNNHRWSQR 238 Query: 1944 RSSLSTYECWKCQRPGHLAEDCLVNSCSQITVGSNRSSSIPKDLLGLYRRCHQIGKDLLA 2123 SSL +Y+CWKC+RPGHLA+DCLV + Q S S I +DLL LYRRCH+IG++L Sbjct: 239 GSSLCSYKCWKCKRPGHLADDCLVTTPEQAV--SKHSKPIARDLLELYRRCHRIGENLSH 296 Query: 2124 ANCNACHSSSNLATCIDCSIVLCDGAGHLNDHIRTHPSHQKYYSHKLKRLVKCCKSTCKV 2303 A+CNAC S LATC+DCS V+CD GHLN+HI THPSH++YYSHKLKRLVKCCKSTC+V Sbjct: 297 ASCNACRGSIGLATCLDCSTVVCDNEGHLNEHIHTHPSHKQYYSHKLKRLVKCCKSTCEV 356 Query: 2304 TDIKDLLICHYCFDKAFEKFYDMYTATWKGAGLSIIWGSICCEDHFTWHRMNCLNAGVEE 2483 TDI DLLICHYCFDKAF+KFYDMYTATWKGAGLS+IWGSICCEDHF WHRM+CLNA +E+ Sbjct: 357 TDINDLLICHYCFDKAFDKFYDMYTATWKGAGLSMIWGSICCEDHFAWHRMDCLNADIED 416 Query: 2484 SASIVQRNGHKGKPKQLSDFIF 2549 A I+ RN KGK QLSDFIF Sbjct: 417 RAYIIGRNTGKGKHVQLSDFIF 438