BLASTX nr result
ID: Astragalus22_contig00033023
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00033023 (887 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subte... 199 6e-54 dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subt... 167 4e-47 ref|XP_020230539.1| uncharacterized protein LOC109811261 [Cajanu... 167 4e-46 dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subt... 167 9e-46 ref|XP_020225471.1| uncharacterized protein LOC109807365 [Cajanu... 165 2e-45 dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subt... 172 1e-44 dbj|GAU18498.1| hypothetical protein TSUD_366810 [Trifolium subt... 156 5e-42 gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family ... 160 2e-41 dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subt... 152 8e-40 dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subt... 154 9e-40 dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subt... 154 1e-39 gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Ca... 156 3e-39 dbj|GAU44081.1| hypothetical protein TSUD_399630 [Trifolium subt... 148 3e-37 dbj|GAU10454.1| hypothetical protein TSUD_423510, partial [Trifo... 140 3e-36 gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family ... 142 8e-36 gb|PNY17850.1| ribonuclease H [Trifolium pratense] 140 2e-35 dbj|GAU50352.1| hypothetical protein TSUD_288030 [Trifolium subt... 142 2e-35 dbj|GAU47271.1| hypothetical protein TSUD_280940 [Trifolium subt... 143 1e-34 gb|KYP36545.1| hypothetical protein KK1_042329 [Cajanus cajan] 135 2e-34 gb|KYP46236.1| Putative ribonuclease H protein At1g65750 family ... 140 3e-34 >dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subterraneum] Length = 1601 Score = 199 bits (506), Expect = 6e-54 Identities = 109/269 (40%), Positives = 143/269 (53%) Frame = -2 Query: 859 SIPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWE 680 SIP +VK+ LWR+A GCLPTR LQ R V C ++ P C E++ HLF C +A +W Sbjct: 1289 SIPQRVKIFLWRIAIGCLPTRDRLQSRGVQCTDLCPHCETTYENDWHLFVSCNKAHEVWR 1348 Query: 679 RCRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPV 500 F +L E + E ++ W +WK N K+WED ++PV Sbjct: 1349 EANLWDEVCSVVETVSCIKDFIFAALAALAEPRRSEFVMMLWCLWKCRNDKIWEDKVQPV 1408 Query: 499 AVSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEG 320 V A D L W R R +Q WQ P G +KCNID AL +++ Sbjct: 1409 RVGMQLARDMLYQWRNARR--REDTTGHHDSHNVIQ--WQPPPIGKVKCNIDAALFNEQH 1464 Query: 319 KYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLL 140 K+G+ CIRD++GIF++A+T WF G P P E EA L +A+ W EL L VVIE DCLL Sbjct: 1465 KFGLGMCIRDDHGIFVKARTKWFHGSPPPVEAEAWALKEAITWMGELELSRVVIELDCLL 1524 Query: 139 VVNAVNKASILNTEFDVIISHCKIRILLN 53 VVNA+ S +EF IIS C R+L N Sbjct: 1525 VVNAIKSNSNNQSEFGHIISDCH-RLLEN 1552 >dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subterraneum] Length = 249 Score = 167 bits (424), Expect = 4e-47 Identities = 89/243 (36%), Positives = 122/243 (50%), Gaps = 7/243 (2%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 IP KVK+ LWR ARGCLPTR L+ R V C + C E++ H+FF C + + +W Sbjct: 3 IPQKVKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNKVEEVWAE 62 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 R F FF L + L++ ++ W IWKR N K+W + Sbjct: 63 ARLWSFIRDKLEIADGFVALFFQLLELLSQHNLHMFAMTMWCIWKRRNDKLWNGIETRPT 122 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQ-------QWQRPEPGVLKCNIDPA 338 VS A D L W+L+R +H A +W++P G +KCN+D A Sbjct: 123 VSIMLACDSLHQWQLIRQKRQHTAAVTGSDSSAATLHSSNNTIRWRKPGTGEVKCNVDAA 182 Query: 337 LIDQEGKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVI 158 + G GV C+R +NG F+ AKT WF G+P+PQE EA GL + + W + GL AV I Sbjct: 183 IFKDHGCCGVGICLRGDNGEFIAAKTAWFYGLPQPQEAEACGLRETILWLGDRGLTAVSI 242 Query: 157 ETD 149 E D Sbjct: 243 ELD 245 >ref|XP_020230539.1| uncharacterized protein LOC109811261 [Cajanus cajan] Length = 307 Score = 167 bits (422), Expect = 4e-46 Identities = 86/254 (33%), Positives = 124/254 (48%) Frame = -2 Query: 844 VKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERCRXX 665 +K+ LWR+ RGCLPTR+NLQR+HVPC + C +E+E H+FF C AK +W Sbjct: 1 MKIFLWRLLRGCLPTRINLQRKHVPCTTLCVSCNSELENEWHVFFTCAAAKDIWTSSGMW 60 Query: 664 XXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAVSQH 485 F L + L + E + W IW+R N K+W DV P+ VS Sbjct: 61 DKIKNIVEQGEGTTDTVFQLLNHLDTKEATELLALLWCIWRRRNDKLWNDVSSPIGVSIF 120 Query: 484 AALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKYGVA 305 A L +W R A W +P+PG +KCN D A+ Y A Sbjct: 121 LARQRLEEWLAART-----TNLAPSPRVAEPNYWVKPQPGFMKCNTDAAIFKDTNSYSFA 175 Query: 304 FCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVVNAV 125 FC+RD +G F A T W+ G+ E E + +A+ W E V+IE DC VV+ + Sbjct: 176 FCLRDNHGRFKAATTGWYHGLSPRHEAEVIACIEAMSWLTNSSYENVLIELDCKTVVDDL 235 Query: 124 NKASILNTEFDVII 83 + ++ L +E+ ++I Sbjct: 236 HGSNQLLSEYGLLI 249 >dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subterraneum] Length = 372 Score = 167 bits (424), Expect = 9e-46 Identities = 91/254 (35%), Positives = 128/254 (50%), Gaps = 7/254 (2%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 IP K+K+ LWR ARGCLPTR L+ R V C + C E++ H+FF C + + +W Sbjct: 8 IPQKIKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNKVEEVWAE 67 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 F FF L + L++ ++ W+IWKR N K+W + Sbjct: 68 AGLWSFIRDKLEIADGFVALFFQLLELLSQHNLHMFAMTMWSIWKRRNDKLWNGIETRPT 127 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQ-------QWQRPEPGVLKCNIDPA 338 VS A D L W+L+R +H A +W++P G +KCN+D A Sbjct: 128 VSIMLARDSLHQWQLIRQKRQHTAAVTGSDSSAATLHSSSNTIRWRKPGTGEVKCNVDAA 187 Query: 337 LIDQEGKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVI 158 + G YGV C+R +N F+ AK WF G+P+PQE EA GL +A+ W + GL AV I Sbjct: 188 IFKDHGCYGVGICLRGDNCEFIAAKMAWFYGLPQPQEAEACGLREAILWLGDRGLTAVSI 247 Query: 157 ETDCLLVVNAVNKA 116 E D L V+ V K+ Sbjct: 248 ELDYLCGVSLVAKS 261 >ref|XP_020225471.1| uncharacterized protein LOC109807365 [Cajanus cajan] Length = 319 Score = 165 bits (418), Expect = 2e-45 Identities = 83/262 (31%), Positives = 129/262 (49%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 IP+ +K+ LWR+ R CLP+R LQ++ VPC + P C E+ H+FF C EA+ +W+ Sbjct: 9 IPHNMKIFLWRLLRDCLPSRQRLQQKGVPCTSLCPHCEAAQENNWHIFFGCQEAQTVWQA 68 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 F+L S+++ E V IW+R N KVW+ P Sbjct: 69 TGIWQHIKSLVDVGEGIVEVIFSLLGSISQSHIVEVVVTLSCIWRRRNAKVWDQGAPPSG 128 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317 V+ A Y DW+ + QW++P G CNID AL Sbjct: 129 VATSQAKQYFRDWQAAQA-----RSSTQRTPPVHDLQWKKPHAGTFTCNIDAALFQDSSY 183 Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137 +G + CIR+++G F+ AKT W G+P E EA L A++W + L L V IE+DC V Sbjct: 184 FGYSMCIRNDHGQFLTAKTGWAHGLPPVHEAEATALLTAIQWIVTLSLTHVTIESDCKSV 243 Query: 136 VNAVNKASILNTEFDVIISHCK 71 ++A++ ++E+ +++ C+ Sbjct: 244 LDALSGTQSHHSEYGSLLNKCR 265 >dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subterraneum] Length = 1012 Score = 172 bits (436), Expect = 1e-44 Identities = 89/231 (38%), Positives = 121/231 (52%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 IP +VK +WRV RGCLPTR LQR+ V C ++ P C E+E H+F C +AK +W Sbjct: 782 IPQRVKKFMWRVLRGCLPTRDKLQRKGVQCTDLCPHCETTYENEWHVFLGCEKAKRIWIE 841 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 F F+ E KC + +I W +WKR N K+WE V KPV Sbjct: 842 AGLWDDIAQLVVAANSFNSLVFSFMTVNLEQKCSDFVMIMWCLWKRRNEKIWEGVEKPVH 901 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317 +S + A +YL W ++ + Q WQ P G KCN+D AL ++E + Sbjct: 902 LSINTAREYLVQWREIKARQENVRPAAIN----TQVVWQPPADGEFKCNVDAALFNEEQQ 957 Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAV 164 +G+ CIR +G F++A+TM FEG P P E EA L +AL W ELG+ V Sbjct: 958 FGLGMCIRGAHGTFVKARTMVFEGTPPPLEAEAYALKEALIWLEELGISRV 1008 >dbj|GAU18498.1| hypothetical protein TSUD_366810 [Trifolium subterraneum] Length = 319 Score = 156 bits (395), Expect = 5e-42 Identities = 89/261 (34%), Positives = 121/261 (46%) Frame = -2 Query: 853 PNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERC 674 P K+K LLWR+ R C PTR+ LQ + + C +C ED HLFFKC + +W + Sbjct: 51 PPKIKNLLWRICRHCCPTRVRLQDKGIECPTDCVLCEDHDEDSFHLFFKCRNSLNIWNQT 110 Query: 673 RXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494 A F L + + K ++I W+IW + N KVW + P Sbjct: 111 NIAQAVLQASEEQSDAAAVIFTLLQQVDKDKTGIFAIIIWSIWNQRNDKVWRNKDTPQQT 170 Query: 493 SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKY 314 A+++L DW+ + +W++P PG +KCNID A Sbjct: 171 VILRAMNFLNDWK--NIISVQTSTSVDMQAETTLTKWKKPSPGRIKCNIDVAFPSNTNLI 228 Query: 313 GVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVV 134 G+ CIRDE G F+ AKT WFE E EA+G AL+W EL L V E D L+V Sbjct: 229 GIGICIRDETGAFVRAKTEWFEPKCEVHVGEALGFLSALRWVHELNLGPVDFELDSKLMV 288 Query: 133 NAVNKASILNTEFDVIISHCK 71 ++ TEF II HCK Sbjct: 289 DSSRYHRKDFTEFGAIIQHCK 309 >gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 606 Score = 160 bits (406), Expect = 2e-41 Identities = 86/253 (33%), Positives = 120/253 (47%) Frame = -2 Query: 853 PNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERC 674 PN K+ LWRV RGCLPTR+NLQRRHVPC + P C GIE+E H+FF+CVEAK +W Sbjct: 297 PNTKKIFLWRVLRGCLPTRLNLQRRHVPCTMLCPTCSAGIENEWHIFFECVEAKDIWAAS 356 Query: 673 RXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494 F L L+ + + + W IW++ N +W + + P Sbjct: 357 GFWPKISQIIADSDGIQQAIFQLLQCLSPSEALDLLCLMWGIWRKRNDILWNNKVTPSHT 416 Query: 493 SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKY 314 A + +W R + W +P P +KCN+D + Sbjct: 417 VIFLARQRISEWMSARETQQIPKVARNDPIC-----WFKPPPEYMKCNVDVTIFTDSNCC 471 Query: 313 GVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVV 134 G AF IRD+ G F A T W+ G P E EAM +A+ W E V+IE DC VV Sbjct: 472 GFAFYIRDDLGRFKAATTGWYNGSLPPNEAEAMACLEAITWLANSHYEKVLIELDCKKVV 531 Query: 133 NAVNKASILNTEF 95 + + ++ L +E+ Sbjct: 532 DDLYDSTSLFSEY 544 >dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subterraneum] Length = 395 Score = 152 bits (385), Expect = 8e-40 Identities = 82/261 (31%), Positives = 124/261 (47%) Frame = -2 Query: 853 PNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERC 674 P KVK L+WR+ R C+ TR LQ + V C + +C + ED +H+FFKC ++ +W Sbjct: 83 PPKVKNLIWRICRRCVSTRARLQDKGVNCPNLCALCNIEGEDSLHVFFKCPSSQNVWSMT 142 Query: 673 RXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494 + F + L++ + I W+IWK+ N ++W +V + Sbjct: 143 SFFQVVSSVINNENEASAIVFQILRQLSKEDAALFACILWSIWKQRNNQIWNNVTDAQSF 202 Query: 493 SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKY 314 A + L +W +R + W++P G +KCN+D + + K Sbjct: 203 VFSRANNMLQEWNTVRNVAATPVSNQQPGAACI---WRKPSAGHVKCNVDASFLPHNNKV 259 Query: 313 GVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVV 134 G+ CIRD+ G F+ AKT WF E EA+GL AL W EL L V E D VV Sbjct: 260 GIGICIRDDQGAFILAKTEWFSPKSEVHTGEALGLLAALNWVHELNLGPVEFELDSKRVV 319 Query: 133 NAVNKASILNTEFDVIISHCK 71 ++ + + TEF VI+ HCK Sbjct: 320 DSFHSSKRDFTEFGVIVEHCK 340 >dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subterraneum] Length = 474 Score = 154 bits (389), Expect = 9e-40 Identities = 86/272 (31%), Positives = 133/272 (48%), Gaps = 2/272 (0%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 +P +VK L+WRV R C+PTR NLQ R V C V +C ED H+FF C+ + +W Sbjct: 162 VPPRVKNLVWRVCRQCIPTRTNLQNRGVNCTTVCALCNEYDEDSGHIFFDCLSSSNIWSM 221 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 C F + L + + I W+IWK+ N ++W +V + Sbjct: 222 CTFNHVITAGLQHYAGVTELIFAVLQQLNVDEAALMACIIWSIWKQRNNQIWNNVTDAQS 281 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317 V A+ L DW ++++ ++ +W++P G +KCNID + + Sbjct: 282 VVFSRAVTTLHDWCVVQV----IRNDTREQQRIIEHKWKKPNNGRVKCNIDASFSRNLNR 337 Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137 G+ CIRDE GI++ AK F + + + EA+GL AL+W EL V E D LV Sbjct: 338 VGIGICIRDEYGIYVMAKYDQFSPICDVRIGEALGLLSALRWVHELNFGPVDFELDSKLV 397 Query: 136 VNAVNKASILNTEFDVIISHCK--IRILLNSS 47 V++ ++EF II+HC+ +L N+S Sbjct: 398 VDSFRSNKYNDSEFGEIIAHCRRLFSLLYNNS 429 >dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subterraneum] Length = 479 Score = 154 bits (389), Expect = 1e-39 Identities = 86/270 (31%), Positives = 128/270 (47%), Gaps = 2/270 (0%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 IP KVK LLWR+ R LPTR L R V C +C ED IH+ F C + W++ Sbjct: 166 IPPKVKNLLWRIGRNVLPTRATLNSRSVQCLVHCAVCNDSAEDSIHILFLCPRSTECWQQ 225 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 A + SL + + + SV+ W+IWKR N KVW+++ + Sbjct: 226 AGLWNQIDAGLNTSNNIADILLFILQSLNKEQQEIFSVLLWSIWKRRNAKVWDNITESNT 285 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQ--WQRPEPGVLKCNIDPALIDQE 323 A L W+ + +QQ+ W++P G KCNID + Sbjct: 286 NVYERAQHLLTSWKQAQ-----QTRSYANTPQPIQQRTNWEKPSQGRYKCNIDASFSSTH 340 Query: 322 GKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCL 143 K G+ CIRD+ G ++ AKT W E + + + EAMGL+ A+KW EL L V E DC Sbjct: 341 NKVGIGMCIRDDQGRYVAAKTEWLEPILDVEIGEAMGLFSAVKWVDELRLSDVDFEMDCK 400 Query: 142 LVVNAVNKASILNTEFDVIISHCKIRILLN 53 VV+ ++ + N++ I+ C++ + N Sbjct: 401 RVVDCLHSSRTYNSDLGDILRDCRVILATN 430 >gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Cajanus cajan] Length = 816 Score = 156 bits (395), Expect = 3e-39 Identities = 76/262 (29%), Positives = 121/262 (46%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 IP+ ++ LWR+ RGC+PTR+NLQ++ VPC P C E+E HLF+ C A +W Sbjct: 507 IPHSTQIFLWRLLRGCIPTRLNLQQKGVPCTSSCPHCSANQENEWHLFYSCPAALSIWID 566 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 F + L LT +++ W IW+R N KVW++ P Sbjct: 567 SGCWPRIAHIVEQGISFIDTTWKLLGHLTGSDLTSFTLMLWCIWRRRNDKVWKEGAPPPK 626 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317 S + W V +W +P CN+D L + Sbjct: 627 TSIQLTEQHFHAWRSAH------RNLAQTASPVVNHRWTKPPADTFTCNVDAVLFNDSST 680 Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137 +G C+RD G+F A + W G+P P E EA + +A+++ + + V +ETDC V Sbjct: 681 FGFGICVRDTRGLFQTAISGWKHGLPPPHEAEAAAMLEAIQYLIHSPYDNVCVETDCKQV 740 Query: 136 VNAVNKASILNTEFDVIISHCK 71 + +N +L++E+ +II+ C+ Sbjct: 741 ADHLNSTQVLHSEYGIIINQCR 762 >dbj|GAU44081.1| hypothetical protein TSUD_399630 [Trifolium subterraneum] Length = 539 Score = 148 bits (374), Expect = 3e-37 Identities = 79/261 (30%), Positives = 125/261 (47%) Frame = -2 Query: 853 PNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERC 674 P +V+ LLWR+ R C+PTR+NL+ R + C V +C ED H+FF C ++ +W C Sbjct: 228 PPRVRNLLWRICRRCVPTRVNLRSRGMNCTTVCSLCNDQDEDSRHIFFDCPSSRNVWSMC 287 Query: 673 RXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494 +Y F+L L+ + + W+IWK+ N ++W +V+ Sbjct: 288 CFGNKIIAALHNDYAASYLIFDLLQQLSNEDASLMACVIWSIWKQRNSRIWNNVIDAQNF 347 Query: 493 SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKY 314 A+ + DW ++ + +W +P G +KCNID + + Sbjct: 348 VLSRAVALINDWCDVQ----QARPDAMGQHTTTEIKWNKPANGRVKCNIDASFSSHNNRV 403 Query: 313 GVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVV 134 G++ CIRDE G ++ AK F + + + EA+G AL W EL L V E D LV+ Sbjct: 404 GISVCIRDEKGAYVSAKLDQFSPICDVRVGEALGFLSALSWIHELNLGPVDFELDSKLVI 463 Query: 133 NAVNKASILNTEFDVIISHCK 71 + + + TEF IISHC+ Sbjct: 464 DGFHSNNHDITEFREIISHCR 484 >dbj|GAU10454.1| hypothetical protein TSUD_423510, partial [Trifolium subterraneum] Length = 280 Score = 140 bits (353), Expect = 3e-36 Identities = 82/270 (30%), Positives = 122/270 (45%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 +P++VK LWR+A CLPTR L R + C++ +C +E ++H FF C +A WE+ Sbjct: 3 VPSRVKSFLWRMAHNCLPTRDQLATRGIHCDDTCVVCEQLMETQMHTFFACSKAVKCWEK 62 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 F FF LFD L + ++ W++WK N K+WE + Sbjct: 63 INMDGLVRELLLVANNFTTMFFTLFDRLAINQQAIVAMTLWSLWKCRNMKLWEGIDTSPH 122 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317 + A D L +W ++ +H W +P +KCN+D A + Sbjct: 123 MIITRAKDALYEWSTIQT-AKHPVHKGTNHDI----SWTKPPLNTVKCNVDCAFFNNNTI 177 Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137 G C RD G FM ++ W + E EA L ++K S G + V ETDC LV Sbjct: 178 MGYGLCFRDATGQFMHGESSWKQCFMTTAEAEATALLASIKASFAQGYQKVFFETDCKLV 237 Query: 136 VNAVNKASILNTEFDVIISHCKIRILLNSS 47 V+A+ S E IIS CK + N++ Sbjct: 238 VDALYSHSAPQNELGDIISLCKNLLSTNNN 267 >gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 406 Score = 142 bits (358), Expect = 8e-36 Identities = 77/224 (34%), Positives = 115/224 (51%), Gaps = 6/224 (2%) Frame = -2 Query: 850 NKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERCR 671 N +K+ LWR+AR CLP+RMNLQ+R +P + C + E+E H+FF C A+ +W Sbjct: 193 NTMKIFLWRIARRCLPSRMNLQQRGIPRTSLCAHCSLNQENEWHIFFGCQTAESIWMTFG 252 Query: 670 XXXXXXXXXXXXXVFAYCFFNLFDSL-TEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494 F F+L +L +I CK +I W+IW+ N KVW D P + Sbjct: 253 LWPSTNAYIDNGEDFKDTIFSLISNLHHDIACK-VIIILWSIWRNRNDKVWSDTTTPPGI 311 Query: 493 SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQ-----WQRPEPGVLKCNIDPALID 329 + H A+ +W+ ++ + QQQ W +P PG+LKCN+D A+ Sbjct: 312 AVHKAMQRYSEWQFAKVKDKSTS----------QQQPHVNTWTKPLPGLLKCNVDAAVFK 361 Query: 328 QEGKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQAL 197 +E G CIR+ +G F++AK+ W G QE EA+ L +AL Sbjct: 362 EENIMGFGLCIRNADGSFIKAKSGWQHGFINFQEAEALTLLEAL 405 >gb|PNY17850.1| ribonuclease H [Trifolium pratense] Length = 363 Score = 140 bits (353), Expect = 2e-35 Identities = 76/270 (28%), Positives = 120/270 (44%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 +P KV+ L+WR+AR CLPTR+ L RHVPC +C +E + HL F+C + W+ Sbjct: 50 VPPKVRSLIWRIARNCLPTRLRLNERHVPCPINCEICNDSVESDWHLLFQCDTSIQSWQT 109 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 ++ E +I W +W N +W Sbjct: 110 EGLWPQIRDRVQRMNSAIEVVLDICSREVEAVVNRFMIIVWGLWHNRNEWIWNQKQMNPD 169 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317 H +W + + +V ++W +P G LKCN+D A K Sbjct: 170 QINHWTKARWSEWNAAQ---QRRVTADATEYSSVHRRWVKPITGELKCNVDAAFHHSIDK 226 Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137 C+RD NG F++A + W E EA+G+WQ + W LG V+ E+D + Sbjct: 227 TSYGCCLRDSNGDFIQALSGWCNPELSVCEGEALGMWQVMSWVQNLGWSKVIFESDSKTL 286 Query: 136 VNAVNKASILNTEFDVIISHCKIRILLNSS 47 V+AVN S+ +EF V++S+ + + LN++ Sbjct: 287 VDAVNSKSVGGSEFHVLVSNIRTLLSLNNN 316 >dbj|GAU50352.1| hypothetical protein TSUD_288030 [Trifolium subterraneum] Length = 452 Score = 142 bits (357), Expect = 2e-35 Identities = 78/242 (32%), Positives = 111/242 (45%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 +P K+K WR+ RGCLPTR NL RR V C+ + +C EDE+H F C A W+ Sbjct: 203 LPPKLKHFCWRLLRGCLPTRFNLHRRGVQCQTICALCNNATEDELHPFTDCAHAILCWKE 262 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 F+ F++ S+ E K + W+IW+ N +WE+ Sbjct: 263 VNLWQSLEPQFLQSGSFSSIIFSIISSMEETKQSVFVAVLWSIWRARNECIWENKQANPV 322 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317 S D + D+ H V W++P LKCN+D A+ EGK Sbjct: 323 ASCRLDFDLIRDFNWC-----HNMLNADHMPTHV-HTWEKPPTSWLKCNVDGAIFMTEGK 376 Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137 +G+ C RD +G F++A TM F E EA + AL ++ G E V+ E+DC V Sbjct: 377 FGIGICFRDSSGSFVQAHTMTFPFEVTAAECEATAMKHALALALSNGFERVLFESDCKQV 436 Query: 136 VN 131 VN Sbjct: 437 VN 438 >dbj|GAU47271.1| hypothetical protein TSUD_280940 [Trifolium subterraneum] Length = 780 Score = 143 bits (360), Expect = 1e-34 Identities = 78/244 (31%), Positives = 114/244 (46%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 +P K+K WR+ RGCLPTR NL RR V C+ + +C EDE+HLF C A W+ Sbjct: 521 LPPKLKHFCWRLLRGCLPTRFNLHRRGVQCQTICALCNNATEDELHLFTDCANAILCWKE 580 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 F+ F++ S+ E K + + W+IW+ N +WE+ Sbjct: 581 VNLWQSLEHQFLQSGSFSSIIFSIISSMEETKQSLFAAVLWSIWRARNECIWENKQANPV 640 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317 S A D + D+ H V W++P LKCN+D A+ E K Sbjct: 641 ASCRLAFDLIRDFNWC-----HNMLNAYHMPTHV-HTWEKPLVNWLKCNVDGAIFTTEAK 694 Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137 +G+ C RD +G F++A TM F E EA + AL ++ E V+ E+DC V Sbjct: 695 FGIGICFRDSSGSFVQAHTMTFPFEVTAVECEATAMKHALALALSNAFERVLFESDCQQV 754 Query: 136 VNAV 125 +NA+ Sbjct: 755 MNAL 758 >gb|KYP36545.1| hypothetical protein KK1_042329 [Cajanus cajan] Length = 291 Score = 135 bits (341), Expect = 2e-34 Identities = 77/265 (29%), Positives = 121/265 (45%), Gaps = 7/265 (2%) Frame = -2 Query: 844 VKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERCRXX 665 +K+ LWR+ R CLP+R LQ++ VPC +C C EA+ +W+ Sbjct: 1 MKIFLWRLLRDCLPSRQRLQQKGVPCTS---LC-------------CQEAQTVWQATGIW 44 Query: 664 XXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAVSQH 485 F+L S+++ E V IW+R N KVW+ P V+ Sbjct: 45 QHIKSFVDVGEGIVEVIFSLLGSISQSHIVEVVVTLGCIWRRRNAKVWDQGAPPSGVAIS 104 Query: 484 AALDYLCDWELL-------RL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQ 326 A + DW+ R+ P H QW++P G CNID AL Sbjct: 105 QAKQHFRDWQAAQARSSTQRIPPVHDL------------QWKKPHVGTFTCNIDAALFQD 152 Query: 325 EGKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDC 146 +G + CIR+++G F+ AKT W +P E EA L A++W L L V IE+DC Sbjct: 153 SSYFGYSMCIRNDHGQFLTAKTGWAHSLPPVHEAEATALLTAIQWIENLSLTHVTIESDC 212 Query: 145 LLVVNAVNKASILNTEFDVIISHCK 71 V++A+++ ++E+ +++ C+ Sbjct: 213 KSVLDALSRTQSQHSEYGSLLNKCR 237 >gb|KYP46236.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 507 Score = 140 bits (352), Expect = 3e-34 Identities = 71/221 (32%), Positives = 104/221 (47%) Frame = -2 Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677 IP+ +K+ LWR+ R CLP+R LQ++ VPC + P C E+ H+FF C EA+ +W+ Sbjct: 292 IPHNMKIFLWRLLRDCLPSRQRLQQKGVPCTSLCPHCEAAQENNWHIFFGCQEAQTVWQA 351 Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497 F+L S+++ E V IW+R N KVW+ P Sbjct: 352 TGIWQHIKSLIDVGEGIVEVIFSLLGSISQSHIVEVVVTLSCIWRRRNAKVWDQGAPPSG 411 Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317 V+ A Y DW+ + QW++P G CNID AL Sbjct: 412 VATSQAKQYFRDWQAAQ-----ARSSTQRTPPVHDLQWKKPHAGTFTCNIDAALFQDSSY 466 Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALK 194 +G + CIR+++G F+ AKT W G+P E EA L A++ Sbjct: 467 FGYSMCIRNDHGQFLTAKTGWAHGLPPVHEAEATALLTAIQ 507