BLASTX nr result
ID: Astragalus23_contig00029829
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00029829 (889 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subte... 199 4e-54 gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Ca... 181 7e-48 gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family ... 167 1e-43 dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subt... 155 5e-41 ref|XP_020230539.1| uncharacterized protein LOC109811261 [Cajanu... 151 4e-40 dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subt... 158 7e-40 dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subt... 153 2e-39 dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subt... 150 3e-39 ref|XP_020225471.1| uncharacterized protein LOC109807365 [Cajanu... 147 2e-38 dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subt... 150 7e-37 dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subt... 135 8e-35 dbj|GAU44059.1| hypothetical protein TSUD_399580 [Trifolium subt... 144 9e-35 ref|XP_020225309.1| uncharacterized protein LOC109807197 [Cajanu... 132 3e-33 dbj|GAU37566.1| hypothetical protein TSUD_153990 [Trifolium subt... 134 4e-33 gb|PNX68200.1| pentatricopeptide repeat-containing protein, part... 129 9e-33 dbj|GAU27275.1| hypothetical protein TSUD_125560 [Trifolium subt... 130 6e-32 gb|KYP48455.1| hypothetical protein KK1_029830 [Cajanus cajan] 133 9e-32 gb|ABD28710.1| Polynucleotidyl transferase, Ribonuclease H fold ... 131 1e-31 dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subt... 131 3e-31 gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family ... 129 5e-31 >dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subterraneum] Length = 1601 Score = 199 bits (507), Expect = 4e-54 Identities = 110/327 (33%), Positives = 167/327 (51%), Gaps = 39/327 (11%) Frame = -2 Query: 864 AMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPT 691 AM+ LI+N ++ G WM IW+L IPQ+VK+ LWR+ G R S V C + CP Sbjct: 1266 AMDTLINNEQYKIPGDWMLIWKLSIPQRVKIFLWRIAIGCLPTRDRLQSRGVQCTDLCPH 1325 Query: 690 CESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIF 511 CE+ E++WH+F SC A +W L EV ++ + L L+ R F Sbjct: 1326 CETTYENDWHLFVSCNKAHEVWREANLWDEVCSVVETVSCIKDFIFAALAALAEPRRSEF 1385 Query: 510 AMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELW 331 M++W LWK RN+K WE V+P V +A D L +W AR R + + Sbjct: 1386 VMMLWCLWKCRNDKIWEDKVQPVRVGMQLARDMLYQWRNARRREDTTGHHDSHN---VIQ 1442 Query: 330 WRKPSIGGLKCNVDAMIF*EENKYGIG-CIR*E--------------------------- 235 W+ P IG +KCN+DA +F E++K+G+G CIR + Sbjct: 1443 WQPPPIGKVKCNIDAALFNEQHKFGLGMCIRDDHGIFVKARTKWFHGSPPPVEAEAWALK 1502 Query: 234 ---------QITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVR 82 +++ VVIE+DCL ++N I ++S ++FG + C ++ + ++ IS+V+ Sbjct: 1503 EAITWMGELELSRVVIELDCLLVVNAIKSNSNNQSEFGHIISDCHRLLENYPNFEISFVK 1562 Query: 81 RQTNLVAHTLVRVSRSYASSCVHDFSP 1 RQ N VAH+L R S+SYAS+ + P Sbjct: 1563 RQANFVAHSLARASKSYASTHTFNLIP 1589 >gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Cajanus cajan] Length = 816 Score = 181 bits (458), Expect = 7e-48 Identities = 99/326 (30%), Positives = 157/326 (48%), Gaps = 39/326 (11%) Frame = -2 Query: 861 MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688 ME +I N+ LRVQG WM +W LKIP ++ LWR+ G R VPC CP C Sbjct: 484 MEHVISNNTLRVQGDWMKLWSLKIPHSTQIFLWRLLRGCIPTRLNLQQKGVPCTSSCPHC 543 Query: 687 ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508 + E+EWH+F+SCP A ++W +G ++ ++ SF LL L+ + F Sbjct: 544 SANQENEWHLFYSCPAALSIWIDSGCWPRIAHIVEQGISFIDTTWKLLGHLTGSDLTSFT 603 Query: 507 MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWW 328 +++W +W+ RN+K W+ P + S + W RS + A P + W Sbjct: 604 LMLWCIWRRRNDKVWKEGAPPPKTSIQLTEQHFHAW-----RSAHRNLAQTASPVVNHRW 658 Query: 327 RKPSIGGLKCNVDAMIF*EENKYGIG-CIR*EQ--------------------------- 232 KP CNVDA++F + + +G G C+R + Sbjct: 659 TKPPADTFTCNVDAVLFNDSSTFGFGICVRDTRGLFQTAISGWKHGLPPPHEAEAAAMLE 718 Query: 231 ---------ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRR 79 +V +E DC + + +N+ L++++G++ CR+++ HQ+ ++ ++RR Sbjct: 719 AIQYLIHSPYDNVCVETDCKQVADHLNSTQVLHSEYGIIINQCRSLLRSHQNLQVRFIRR 778 Query: 78 QTNLVAHTLVRVSRSYASSCVHDFSP 1 Q N VAHTL RV+RS AS DF P Sbjct: 779 QANRVAHTLARVARSSASHHFFDFIP 804 >gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 606 Score = 167 bits (422), Expect = 1e-43 Identities = 102/331 (30%), Positives = 155/331 (46%), Gaps = 41/331 (12%) Frame = -2 Query: 870 FAAMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEEC 697 + ME L N LR+ G W +W +K P K+ LWRV G R +VPC C Sbjct: 270 YQLMEHLTPNVDLRIPGNWSMLWSMKAPNTKKIFLWRVLRGCLPTRLNLQRRHVPCTMLC 329 Query: 696 PTCESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRY 517 PTC +G+E+EWHIFF C A +W +G ++S+ + D+ LL+ LS Sbjct: 330 PTCSAGIENEWHIFFECVEAKDIWAASGFWPKISQIIADSDGIQQAIFQLLQCLSPSEAL 389 Query: 516 IFAMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIE 337 ++W +W+ RN+ W V P +A ++ EW AR + + + + P Sbjct: 390 DLLCLMWGIWRKRNDILWNNKVTPSHTVIFLARQRISEWMSAR-ETQQIPKVARNDP--- 445 Query: 336 LWWRKPSIGGLKCNVDAMIF*EENKYG-------------------------------IG 250 + W KP +KCNVD IF + N G + Sbjct: 446 ICWFKPPPEYMKCNVDVTIFTDSNCCGFAFYIRDDLGRFKAATTGWYNGSLPPNEAEAMA 505 Query: 249 CIR*EQIT--------SVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRI 94 C+ E IT V+IE+DC +++ + + ++L +++G L Y R+++ H++ + Sbjct: 506 CL--EAITWLANSHYEKVLIELDCKKVVDDLYDSTSLFSEYGRLSYKGRSLLALHKNLEV 563 Query: 93 SYVRRQTNLVAHTLVRVSRSYASSCVHDFSP 1 +VRRQ N VA TL RVSR YAS DF P Sbjct: 564 RFVRRQANHVARTLARVSRLYASPHYFDFIP 594 >dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subterraneum] Length = 395 Score = 155 bits (393), Expect = 5e-41 Identities = 96/319 (30%), Positives = 145/319 (45%), Gaps = 40/319 (12%) Frame = -2 Query: 861 MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688 +++LID SHLRV G W +W++K P KVK L+WR+ R+ V C C C Sbjct: 59 VQELIDTSHLRVNGDWNLLWKIKAPPKVKNLIWRICRRCVSTRARLQDKGVNCPNLCALC 118 Query: 687 ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508 ED H+FF CP + +W T VS + + + +L QLS + +FA Sbjct: 119 NIEGEDSLHVFFKCPSSQNVWSMTSFFQVVSSVINNENEASAIVFQILRQLSKEDAALFA 178 Query: 507 MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWW 328 I+W++WK RN + W + A + L+EW +R+ + S +P W Sbjct: 179 CILWSIWKQRNNQIWNNVTDAQSFVFSRANNMLQEWN--TVRNVAATPVSNQQPGAACIW 236 Query: 327 RKPSIGGLKCNVDAMIF*EENKYGIG-CIR*EQ--------------------------- 232 RKPS G +KCNVDA NK GIG CIR +Q Sbjct: 237 RKPSAGHVKCNVDASFLPHNNKVGIGICIRDDQGAFILAKTEWFSPKSEVHTGEALGLLA 296 Query: 231 ---------ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTII-VQHQSYRISYVR 82 + V E+D +++ ++ +FGV+ HC++I +++ + +VR Sbjct: 297 ALNWVHELNLGPVEFELDSKRVVDSFHSSKRDFTEFGVIVEHCKSIFSTYYRNSSVEFVR 356 Query: 81 RQTNLVAHTLVRVSRSYAS 25 RQ N VAH L + + AS Sbjct: 357 RQANEVAHKLAKAATLSAS 375 >ref|XP_020230539.1| uncharacterized protein LOC109811261 [Cajanus cajan] Length = 307 Score = 151 bits (381), Expect = 4e-40 Identities = 89/300 (29%), Positives = 144/300 (48%), Gaps = 40/300 (13%) Frame = -2 Query: 780 VKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFFSCPHATALWDYTGLE 607 +K+ LWR+ G R +VPC C +C S +E+EWH+FF+C A +W +G+ Sbjct: 1 MKIFLWRLLRGCLPTRINLQRKHVPCTTLCVSCNSELENEWHVFFTCAAAKDIWTSSGMW 60 Query: 606 LEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWALWKSRNEKFWEGIVKPYEVSAI 427 ++ ++ LL L ++W +W+ RN+K W + P VS Sbjct: 61 DKIKNIVEQGEGTTDTVFQLLNHLDTKEATELLALLWCIWRRRNDKLWNDVSSPIGVSIF 120 Query: 426 IAMDQLREWEQARLRSTSLVRS-SIAKPSIELWWRKPSIGGLKCNVDAMIF*EENKYG-- 256 +A +L EW A R+T+L S +A+P+ +W KP G +KCN DA IF + N Y Sbjct: 121 LARQRLEEWLAA--RTTNLAPSPRVAEPN---YWVKPQPGFMKCNTDAAIFKDTNSYSFA 175 Query: 255 -----------------------------IGCIR------*EQITSVVIEMDCLSILNGI 181 I CI +V+IE+DC ++++ + Sbjct: 176 FCLRDNHGRFKAATTGWYHGLSPRHEAEVIACIEAMSWLTNSSYENVLIELDCKTVVDDL 235 Query: 180 NNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQTNLVAHTLVRVSRSYASSCVHDFSP 1 + + L +++G+L R+I+ H++ + ++RRQ N VAH+L R +RSYAS DF P Sbjct: 236 HGSNQLLSEYGLLIQKGRSILASHKNLSVRFIRRQANHVAHSLARAARSYASPHTFDFIP 295 >dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subterraneum] Length = 1012 Score = 158 bits (400), Expect = 7e-40 Identities = 82/211 (38%), Positives = 119/211 (56%), Gaps = 3/211 (1%) Frame = -2 Query: 864 AMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPT 691 +ME LIDN ++ G WM IW LKIPQ+VK +WRV G R V C + CP Sbjct: 758 SMETLIDNEGYKLPGDWMQIWNLKIPQRVKKFMWRVLRGCLPTRDKLQRKGVQCTDLCPH 817 Query: 690 CESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIF 511 CE+ E+EWH+F C A +W GL ++++ + A SFN+ + F Sbjct: 818 CETTYENEWHVFLGCEKAKRIWIEAGLWDDIAQLVVAANSFNSLVFSFMTVNLEQKCSDF 877 Query: 510 AMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELW 331 MI+W LWK RNEK WEG+ KP +S A + L +W + + R ++ ++I + ++ Sbjct: 878 VMIMWCLWKRRNEKIWEGVEKPVHLSINTAREYLVQWREIKARQENVRPAAI---NTQVV 934 Query: 330 WRKPSIGGLKCNVDAMIF*EENKYGIG-CIR 241 W+ P+ G KCNVDA +F EE ++G+G CIR Sbjct: 935 WQPPADGEFKCNVDAALFNEEQQFGLGMCIR 965 >dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subterraneum] Length = 479 Score = 153 bits (387), Expect = 2e-39 Identities = 100/328 (30%), Positives = 149/328 (45%), Gaps = 49/328 (14%) Frame = -2 Query: 846 DNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVE 673 DNS + G W IWR KIP KVK LLWR+G V R+ S V C C C E Sbjct: 150 DNSG--IAGNWHQIWRAKIPPKVKNLLWRIGRNVLPTRATLNSRSVQCLVHCAVCNDSAE 207 Query: 672 DEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWA 493 D HI F CP +T W GL ++ + + + ++L+ L+ + IF++++W+ Sbjct: 208 DSIHILFLCPRSTECWQQAGLWNQIDAGLNTSNNIADILLFILQSLNKEQQEIFSVLLWS 267 Query: 492 LWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQA-RLRSTSLVRSSIAKPSIELWWRKPS 316 +WK RN K W+ I + A L W+QA + RS + I + + W KPS Sbjct: 268 IWKRRNAKVWDNITESNTNVYERAQHLLTSWKQAQQTRSYANTPQPIQQRTN---WEKPS 324 Query: 315 IGGLKCNVDAMIF*EENKYGIG-CIR*EQ------------------------------- 232 G KCN+DA NK GIG CIR +Q Sbjct: 325 QGRYKCNIDASFSSTHNKVGIGMCIRDDQGRYVAAKTEWLEPILDVEIGEAMGLFSAVKW 384 Query: 231 -----ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQH-QSYRISYVRRQTN 70 ++ V EMDC +++ +++ T N+ G + CR I+ + + + ++RRQ N Sbjct: 385 VDELRLSDVDFEMDCKRVVDCLHSSRTYNSDLGDILRDCRVILATNLVNSHVKFIRRQAN 444 Query: 69 LVAHTLVRVSRSYAS--------SCVHD 10 VAH L R + AS +C++D Sbjct: 445 EVAHRLAREATCLASFHIFIDIPTCIYD 472 >dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subterraneum] Length = 372 Score = 150 bits (380), Expect = 3e-39 Identities = 79/213 (37%), Positives = 114/213 (53%), Gaps = 11/213 (5%) Frame = -2 Query: 813 MNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFFSCPH 640 M IW +KIPQK+K+ LWR G R + V C + C CE E++WH+FF C Sbjct: 1 MQIWNMKIPQKIKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNK 60 Query: 639 ATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWALWKSRNEKFWE 460 +W GL + ++ A F F LLE LS HN ++FAM +W++WK RN+K W Sbjct: 61 VEEVWAEAGLWSFIRDKLEIADGFVALFFQLLELLSQHNLHMFAMTMWSIWKRRNDKLWN 120 Query: 459 GIVKPYEVSAIIAMDQLREWE---QARLRSTSLVRS-----SIAKPSIELWWRKPSIGGL 304 GI VS ++A D L +W+ Q R + ++ S ++ S + WRKP G + Sbjct: 121 GIETRPTVSIMLARDSLHQWQLIRQKRQHTAAVTGSDSSAATLHSSSNTIRWRKPGTGEV 180 Query: 303 KCNVDAMIF*EENKYGIG-CIR*EQITSVVIEM 208 KCNVDA IF + YG+G C+R + + +M Sbjct: 181 KCNVDAAIFKDHGCYGVGICLRGDNCEFIAAKM 213 >ref|XP_020225471.1| uncharacterized protein LOC109807365 [Cajanus cajan] Length = 319 Score = 147 bits (371), Expect = 2e-38 Identities = 88/308 (28%), Positives = 137/308 (44%), Gaps = 39/308 (12%) Frame = -2 Query: 807 IWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFFSCPHAT 634 +W L IP +K+ LWR+ R VPC CP CE+ E+ WHIFF C A Sbjct: 4 LWALPIPHNMKIFLWRLLRDCLPSRQRLQQKGVPCTSLCPHCEAAQENNWHIFFGCQEAQ 63 Query: 633 ALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWALWKSRNEKFWEGI 454 +W TG+ + + LL +S + + + +W+ RN K W+ Sbjct: 64 TVWQATGIWQHIKSLVDVGEGIVEVIFSLLGSISQSHIVEVVVTLSCIWRRRNAKVWDQG 123 Query: 453 VKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWWRKPSIGGLKCNVDAMIF* 274 P V+ A R+W+ A+ RS+ + P +L W+KP G CN+DA +F Sbjct: 124 APPSGVATSQAKQYFRDWQAAQARSS----TQRTPPVHDLQWKKPHAGTFTCNIDAALFQ 179 Query: 273 EENKYGIG-CIR*E------------------------------------QITSVVIEMD 205 + + +G CIR + +T V IE D Sbjct: 180 DSSYFGYSMCIRNDHGQFLTAKTGWAHGLPPVHEAEATALLTAIQWIVTLSLTHVTIESD 239 Query: 204 CLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQTNLVAHTLVRVSRSYAS 25 C S+L+ ++ + ++++G L CR ++ H + + ++ RQ N VAH L RVSR YAS Sbjct: 240 CKSVLDALSGTQSHHSEYGSLLNKCRGLLHNHPNLSLKFIPRQANRVAHCLARVSRCYAS 299 Query: 24 SCVHDFSP 1 S + +F P Sbjct: 300 SHIFEFIP 307 >dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subterraneum] Length = 1688 Score = 150 bits (378), Expect = 7e-37 Identities = 83/233 (35%), Positives = 118/233 (50%), Gaps = 23/233 (9%) Frame = -2 Query: 870 FAAMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEEC 697 + ME+L+DN+ LRV+G W IW LKIPQK+K+ LWR G R V C C Sbjct: 1398 YYTMENLVDNTGLRVEGNWGKIWELKIPQKMKVFLWRAARGCLPTRYRLQQKGVNCPHTC 1457 Query: 696 PTCESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRY 517 C++ E++WH+FF C A +W+ GL + + F + F LLE LS H Sbjct: 1458 AYCQNNFENDWHVFFGCVKAQEIWEEAGLWSFIEGMFESTEGFVSLFFSLLELLSQHKII 1517 Query: 516 IFAMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQAR------------LRSTS 373 +F W +WK RN+K WE I VS +A D + +W+ A+ L ++ Sbjct: 1518 LFVAAFWCIWKRRNQKIWEDIELHPSVSLQLASDIIYQWKTAQTSHQRQQTSAAILPHSA 1577 Query: 372 LVRS--------SIAKPSIELWWRKPSIGGLKCNVDAMIF*EENKYGIG-CIR 241 R+ S+ ++ + W P G LKCNVDA IF E+N +G G C+R Sbjct: 1578 ATRNASGEERSVSVTTSAVRVIWTPPVQGMLKCNVDAAIFKEQNCFGAGMCLR 1630 >dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subterraneum] Length = 249 Score = 135 bits (341), Expect = 8e-35 Identities = 73/197 (37%), Positives = 104/197 (52%), Gaps = 11/197 (5%) Frame = -2 Query: 798 LKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFFSCPHATALW 625 +KIPQKVK+ LWR G R + V C + C CE E++WH+FF C +W Sbjct: 1 MKIPQKVKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNKVEEVW 60 Query: 624 DYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWALWKSRNEKFWEGIVKP 445 L + ++ A F F LLE LS HN ++FAM +W +WK RN+K W GI Sbjct: 61 AEARLWSFIRDKLEIADGFVALFFQLLELLSQHNLHMFAMTMWCIWKRRNDKLWNGIETR 120 Query: 444 YEVSAIIAMDQLREWE---QARLRSTSLVRSSIAKPSIE-----LWWRKPSIGGLKCNVD 289 VS ++A D L +W+ Q R + ++ S + ++ + WRKP G +KCNVD Sbjct: 121 PTVSIMLACDSLHQWQLIRQKRQHTAAVTGSDSSAATLHSSNNTIRWRKPGTGEVKCNVD 180 Query: 288 AMIF*EENKYGIG-CIR 241 A IF + G+G C+R Sbjct: 181 AAIFKDHGCCGVGICLR 197 >dbj|GAU44059.1| hypothetical protein TSUD_399580 [Trifolium subterraneum] Length = 1229 Score = 144 bits (362), Expect = 9e-35 Identities = 89/311 (28%), Positives = 139/311 (44%), Gaps = 40/311 (12%) Frame = -2 Query: 852 LIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVYRS---VAISGYVPCQEECPTCES 682 L + V G W IW ++IP K+K WR+ + + I G V CQ C C + Sbjct: 900 LTSHDSFNVSGDWRKIWTMQIPPKLKHFCWRMLRYCLPTRLKLHIRG-VNCQTTCAVCSN 958 Query: 681 GVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMI 502 EDE H+FF CPHA + W L + + M + SF++ +L L + F I Sbjct: 959 ATEDELHLFFDCPHAISCWKELNLWQRLEQKMHQSGSFSSIIFAILADLDADTQARFVAI 1018 Query: 501 IWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWWRK 322 +W++W++RN+ WE + +A D + ++ +++ ++ + P + W+K Sbjct: 1019 LWSIWRTRNDCLWEHKQPSTVTTCRLATDIVSDYTWC----CNMLDTTQSSPPVHR-WKK 1073 Query: 321 PSIGGLKCNVDAMIF*EENKYGIG-CIR*EQ----------------------------- 232 P LKCNVD IF E K+GIG C R +Q Sbjct: 1074 PEANWLKCNVDGAIFSTEGKFGIGICFRNDQGILVQAHTMYFPFEVTVNECEASALKYAL 1133 Query: 231 -------ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQT 73 V+ E D +++N I N N+ G L C++++ SY +++VRRQ Sbjct: 1134 LIALSSGFERVIFESDSQTVVNSILNDYRYENELGSLLSACKSLLSVIASYNVAFVRRQA 1193 Query: 72 NLVAHTLVRVS 40 N VAH L R S Sbjct: 1194 NRVAHNLARAS 1204 >ref|XP_020225309.1| uncharacterized protein LOC109807197 [Cajanus cajan] Length = 273 Score = 132 bits (332), Expect = 3e-33 Identities = 76/264 (28%), Positives = 120/264 (45%), Gaps = 37/264 (14%) Frame = -2 Query: 681 GVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMI 502 G+E+EWH+FF C A A+W+ +G+ +S + + F +LL LS N + Sbjct: 2 GLENEWHLFFDCAEAQAIWNASGIWTLISHAVNNGNDFKETLGHLLNSLSHENIVKMVVS 61 Query: 501 IWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWWRK 322 +W +W+ N K W P + +M + EW+ AR + S + S W + Sbjct: 62 LWCIWQRHNNKIWSNTTTPPHLVISQSMQKFEEWQHARAKEHPPPTQSSSPGS----WTR 117 Query: 321 PSIGGLKCNVDAMIF*EENKYGIG-CIR-------------------------------- 241 P +G +K NVDA IF E+NK G G C+R Sbjct: 118 PQVGFIKGNVDATIFKEDNKVGFGICLRDATGSLIKAKSGWLYGVAPPHEEEATTLLESI 177 Query: 240 ----*EQITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQT 73 + T V++E D ++ I N + +++G + C +++ H + + ++RRQ Sbjct: 178 RWVCDQGYTRVILESDSKQVVEDILNSNIYYSEYGHTLHRCHSLLNSHPNLLVRFIRRQA 237 Query: 72 NLVAHTLVRVSRSYASSCVHDFSP 1 N VAH+L R SR YASS V F P Sbjct: 238 NHVAHSLTRTSRYYASSHVFYFIP 261 >dbj|GAU37566.1| hypothetical protein TSUD_153990 [Trifolium subterraneum] Length = 343 Score = 134 bits (336), Expect = 4e-33 Identities = 72/213 (33%), Positives = 110/213 (51%), Gaps = 3/213 (1%) Frame = -2 Query: 861 MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688 +++LID S+LRV G W +W +K+P KVK L+WR+ R V C + C C Sbjct: 124 VQELIDTSYLRVNGNWNLVWNIKVPPKVKNLIWRICRRCLPTRVRLRDKGVECTQTCALC 183 Query: 687 ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508 ED HIFF CP + +W TG VS + + + ++L+QLS + +FA Sbjct: 184 NEENEDSEHIFFKCPSSRNVWSMTGFFHVVSNAINNNNNAQDIIFHILQQLSKDDSTVFA 243 Query: 507 MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWW 328 I+W++WK RN + W + A++ L+EW+ + +++ S + + W Sbjct: 244 CILWSIWKQRNNQIWNNVTDAQNFVLSRAVNMLQEWKAVCIVASN--PDSKTQEPLARKW 301 Query: 327 RKPSIGGLKCNVDAMIF*EENKYGIG-CIR*EQ 232 RKP G +KCN+DA + GIG CIR EQ Sbjct: 302 RKPMAGRVKCNIDASFPANSDIVGIGICIRDEQ 334 >gb|PNX68200.1| pentatricopeptide repeat-containing protein, partial [Trifolium pratense] Length = 220 Score = 129 bits (325), Expect = 9e-33 Identities = 63/164 (38%), Positives = 89/164 (54%), Gaps = 2/164 (1%) Frame = -2 Query: 870 FAAMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEEC 697 + ME+L+DN+ LRV+G W IW LKIPQK+K+ LWR G R V C C Sbjct: 23 YYTMENLVDNTGLRVEGNWGKIWGLKIPQKMKVFLWRAARGCLPTRYRLQRKGVNCPHTC 82 Query: 696 PTCESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRY 517 C++ E++WH+FF C A +W+ GL + + A F + F LLE LS HN Sbjct: 83 AYCQNNFENDWHVFFGCVKAQEIWEEAGLWSLIEGMFESAEGFVSLFFSLLELLSQHNII 142 Query: 516 IFAMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARL 385 +F W +WK RN+K WE I VS +A D + +W+ ++ Sbjct: 143 LFVAAFWCIWKRRNQKIWEDIELRPSVSLQLATDIIYQWKTTQI 186 >dbj|GAU27275.1| hypothetical protein TSUD_125560 [Trifolium subterraneum] Length = 330 Score = 130 bits (327), Expect = 6e-32 Identities = 86/322 (26%), Positives = 143/322 (44%), Gaps = 47/322 (14%) Frame = -2 Query: 825 QGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFF 652 Q W ++W++ P K K LLWR+ +G R+ +VPC CP C+ ED+WH+FF Sbjct: 7 QEDWSSLWKIHAPPKAKHLLWRICKGCIPTRTRLHERFVPCPLICPVCDQCNEDDWHVFF 66 Query: 651 SCPHATALWDYTGLELEVSRPMQDATS-----FNTCFCYLLEQLSVHNRYI---FAMIIW 496 +C + GLE +S +Q + FN C +R I FA+++W Sbjct: 67 TCNDSIHARQAAGLEHVISTRLQQLRTTQEVIFNIC--------KGEDRMIAGQFAVLLW 118 Query: 495 ALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWWRKPS 316 LW +RN+K W P I A EW + R++ + I+ W KP Sbjct: 119 TLWNNRNDKVWNESRTPGRSLGIKASQFWHEWFAIQKVQQQSPRAAQQQQFIK--WEKPP 176 Query: 315 IGGLKCNVDAMIF*EENKYGIG-CIR*EQ------------------------------- 232 +G KCNVDA + ++ G C+R Q Sbjct: 177 MGWHKCNVDAGFYHNLHRTTAGWCLRDHQGSFVRAGTSWSNGNYYIAEGEAAAVLDAMKA 236 Query: 231 -----ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQTNL 67 +T V+ E D S+++ I N +++F + + + ++ + ++ + +++RQ N+ Sbjct: 237 VENQGVTHVIFETDSKSVVDAIYNFHGGSSEFSSIICNIKNALLSNPNFVVKFIKRQANM 296 Query: 66 VAHTLVRVSRSYASSCVHDFSP 1 VAHTL R + S+++ C D P Sbjct: 297 VAHTLARAAISWSNRCTFDLLP 318 >gb|KYP48455.1| hypothetical protein KK1_029830 [Cajanus cajan] Length = 536 Score = 133 bits (335), Expect = 9e-32 Identities = 68/210 (32%), Positives = 104/210 (49%), Gaps = 3/210 (1%) Frame = -2 Query: 861 MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVYRSVAISGY--VPCQEECPTC 688 ME +I N+ LRVQG WM +W LKIP ++ LWR+ G + V C CP C Sbjct: 329 MEHVISNNTLRVQGDWMKLWSLKIPHSTQIFLWRLLRGCIPTCLNLQQKGVSCTSSCPHC 388 Query: 687 ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508 + E+EWH+F+SCP A ++W +G ++R ++ SF LL L+ + F Sbjct: 389 SANQENEWHLFYSCPAAISIWIDSGCWPRIARIVEQGISFIDTTWKLLGHLTSSDLTSFT 448 Query: 507 MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWW 328 +++W +W+ RN+K W+ P S + W+ A T A P + W Sbjct: 449 LMLWCIWRWRNDKVWKESAPPPRTSIQLTEQHFHAWQSAHRNLT-----QNASPVVNHRW 503 Query: 327 RKPSIGGLKCNVDAMIF*EENKYGIG-CIR 241 KP CNVDA +F + + +G+ C+R Sbjct: 504 TKPPANTFTCNVDAALFKDSSTFGLSICVR 533 >gb|ABD28710.1| Polynucleotidyl transferase, Ribonuclease H fold [Medicago truncatula] Length = 393 Score = 131 bits (329), Expect = 1e-31 Identities = 99/344 (28%), Positives = 146/344 (42%), Gaps = 57/344 (16%) Frame = -2 Query: 861 MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688 +ED+++N+HLR G W IWRLK+P +VK L+WRV + R IS V C C C Sbjct: 41 VEDVVNNAHLRKPGYWSGIWRLKVPPRVKKLVWRVCRECFPTRVRLISRGVNCPSACVKC 100 Query: 687 ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508 E ED +HIFF C A +W+ + ++ + + LL++LS Sbjct: 101 EDPHEDCYHIFFHCRTAIDVWNTANVWHLIAPSLSQFDNAPDIIFNLLQKLSASQMESIV 160 Query: 507 MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVR---SSIAKPSI- 340 I+W++WKSRN K W+ + + A L W +A + L + + ++P Sbjct: 161 TIMWSIWKSRNLKLWQQVSESSVTILERAKHLLEGWRKANHKQGLLGQVHSPTNSRPQTH 220 Query: 339 ----------ELWWRKPSIGGLKCNVDAMIF*EENKYGIG-CIR*E-------------- 235 + WRKP G LKCNVDA NK GIG CIR Sbjct: 221 DSQNTDNRYGNIRWRKPKSGRLKCNVDASFSTSSNKVGIGMCIRDSEGNHVRSKTMWFSP 280 Query: 234 ----------------------QITSVVIEMDCLSILNGINNHSTLNNKFGVLFYH---- 133 Q+T+V E+D +I + N N +FG + + Sbjct: 281 LCPVNIGEALGLYHATRWINELQLTNVDFEVDSKTIADYFNKARGDNTEFGSIMENTIQF 340 Query: 132 CRTIIVQHQSYRISYVRRQTNLVAHTLVRVSRSYASSCVHDFSP 1 C + + + + RRQ N VAH L + + S + D SP Sbjct: 341 CNIFLT---NSHVEFTRRQANEVAHELAKAATLGPSFHIFDESP 381 >dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subterraneum] Length = 474 Score = 131 bits (329), Expect = 3e-31 Identities = 85/320 (26%), Positives = 144/320 (45%), Gaps = 41/320 (12%) Frame = -2 Query: 861 MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688 +++ +D SHL++ G W IW+LK+P +VK L+WRV R+ + V C C C Sbjct: 139 VQEELDTSHLKMTGDWNLIWKLKVPPRVKNLVWRVCRQCIPTRTNLQNRGVNCTTVCALC 198 Query: 687 ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508 ED HIFF C ++ +W ++ +Q +L+QL+V + A Sbjct: 199 NEYDEDSGHIFFDCLSSSNIWSMCTFNHVITAGLQHYAGVTELIFAVLQQLNVDEAALMA 258 Query: 507 MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQAR-LRSTSLVRSSIAKPSIELW 331 IIW++WK RN + W + V A+ L +W + +R+ + + I IE Sbjct: 259 CIIWSIWKQRNNQIWNNVTDAQSVVFSRAVTTLHDWCVVQVIRNDTREQQRI----IEHK 314 Query: 330 WRKPSIGGLKCNVDAMIF*EENKYGIG-CIR*E--------------------------- 235 W+KP+ G +KCN+DA N+ GIG CIR E Sbjct: 315 WKKPNNGRVKCNIDASFSRNLNRVGIGICIRDEYGIYVMAKYDQFSPICDVRIGEALGLL 374 Query: 234 ---------QITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTII-VQHQSYRISYV 85 V E+D +++ ++ +++FG + HCR + + + + + ++ Sbjct: 375 SALRWVHELNFGPVDFELDSKLVVDSFRSNKYNDSEFGEIIAHCRRLFSLLYNNSSVEFI 434 Query: 84 RRQTNLVAHTLVRVSRSYAS 25 RRQ N + H+L + + AS Sbjct: 435 RRQANKIVHSLSKAATYVAS 454 >gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 406 Score = 129 bits (325), Expect = 5e-31 Identities = 73/213 (34%), Positives = 105/213 (49%), Gaps = 3/213 (1%) Frame = -2 Query: 870 FAAMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEEC 697 + ME LI N+HL V G W IW LK+ +K+ LWR+ R +P C Sbjct: 165 YYVMESLISNTHLHVPGNWKQIWSLKVLNTMKIFLWRIARRCLPSRMNLQQRGIPRTSLC 224 Query: 696 PTCESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRY 517 C E+EWHIFF C A ++W GL + + + F L+ L Sbjct: 225 AHCSLNQENEWHIFFGCQTAESIWMTFGLWPSTNAYIDNGEDFKDTIFSLISNLHHDIAC 284 Query: 516 IFAMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIE 337 +I+W++W++RN+K W P ++ AM + EW+ A+++ S +S +P + Sbjct: 285 KVIIILWSIWRNRNDKVWSDTTTPPGIAVHKAMQRYSEWQFAKVKDKS---TSQQQPHVN 341 Query: 336 LWWRKPSIGGLKCNVDAMIF*EENKYGIG-CIR 241 W KP G LKCNVDA +F EEN G G CIR Sbjct: 342 T-WTKPLPGLLKCNVDAAVFKEENIMGFGLCIR 373