BLASTX nr result
ID: Astragalus23_contig00017214
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00017214 (448 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subt... 77 2e-13 gb|AIK35195.1| LINE-type retrotransposon LIb DNA [Ipomoea batatas] 75 6e-13 dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] 74 2e-12 dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] 74 2e-12 dbj|BAE79384.1| unnamed protein product [Ipomoea batatas] 74 2e-12 ref|XP_019150780.1| PREDICTED: uncharacterized protein LOC109147... 65 2e-09 ref|XP_024190061.1| uncharacterized protein LOC112194030 [Rosa c... 65 3e-09 ref|XP_019163505.1| PREDICTED: uncharacterized protein LOC109159... 64 5e-09 ref|XP_019168955.1| PREDICTED: uncharacterized protein LOC109164... 64 9e-09 gb|KYP34286.1| Putative ribonuclease H protein At1g65750 family ... 63 9e-09 dbj|GAU18772.1| hypothetical protein TSUD_80610 [Trifolium subte... 63 1e-08 gb|KYP46130.1| Putative ribonuclease H protein At1g65750 family ... 62 2e-08 gb|KYP64774.1| Putative ribonuclease H protein At1g65750 family,... 62 4e-08 ref|XP_020219748.1| uncharacterized protein LOC109802758 [Cajanu... 62 4e-08 gb|KYP33975.1| Putative ribonuclease H protein At1g65750 family ... 60 1e-07 gb|ONK68084.1| uncharacterized protein A4U43_C05F7260 [Asparagus... 60 1e-07 ref|XP_019162015.1| PREDICTED: uncharacterized protein LOC109158... 60 1e-07 gb|KYP55672.1| Putative ribonuclease H protein At1g65750 family,... 60 1e-07 gb|PRQ57683.1| putative RNA-directed DNA polymerase [Rosa chinen... 60 1e-07 dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like ... 59 2e-07 >dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subterraneum] Length = 1250 Score = 76.6 bits (187), Expect = 2e-13 Identities = 40/110 (36%), Positives = 61/110 (55%), Gaps = 1/110 (0%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSYNWQDRL 231 N ER RR + +C C + E+L H+F+DCN + + N ++F +W L Sbjct: 948 NMERLRRKMTASKVCSRCNLQDESLLHVFRDCNFSKSIWQNLNVQNRRSFFHENDWHQWL 1007 Query: 232 CSNLRKVQGV-DHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKA 378 +NL + G D + W L FA ILD IW SRN F+F++K +N +I+ +A Sbjct: 1008 LTNLSGMVGSKDEATWSLKFAIILDKIWYSRNSFIFSHKEINIFTIIAQA 1057 >gb|AIK35195.1| LINE-type retrotransposon LIb DNA [Ipomoea batatas] Length = 1836 Score = 75.5 bits (184), Expect = 6e-13 Identities = 43/120 (35%), Positives = 69/120 (57%), Gaps = 7/120 (5%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQ-AFCTSYN---- 216 N ER RR +++ CP CG ET+ HLF+ C+ + NC + + P AF S++ Sbjct: 1534 NSERRRRGLLEAATCPSCGTNDETIDHLFRSCDVAV---NCWEAAAPPTAFMYSFHLPVT 1590 Query: 217 -WQDRLC-SNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKAKEQS 390 W ++ C SN +G+ +W LIF +IL N+W+ RN+ VF N C N +I+ A++++ Sbjct: 1591 VWMEKSCASNQTNGRGI---SWRLIFPYILWNMWKGRNNQVFNNVCTNGNAIVKIAEQEA 1647 >dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 74.3 bits (181), Expect = 2e-12 Identities = 48/123 (39%), Positives = 67/123 (54%), Gaps = 7/123 (5%) Frame = +1 Query: 49 INYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMS-NPQAFCTS----- 210 +N ER+RR + D CP+CG E ETL HLF+ C L+ C + P F TS Sbjct: 1063 VNVERKRRGLADAASCPVCGEEDETLDHLFRRC---LLAEACWDSAVPPLTFQTSNHLHM 1119 Query: 211 YNWQDRLCSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKA-KEQ 387 ++W CS+ +K G +NW LIF +IL N+W++RN VF N IL ++ E Sbjct: 1120 HSWMKAACSSQQK-DGYS-TNWSLIFPYILWNLWKARNRLVFDNNITAPSDILNRSFMES 1177 Query: 388 SQA 396 S+A Sbjct: 1178 SEA 1180 >dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 73.9 bits (180), Expect = 2e-12 Identities = 48/123 (39%), Positives = 67/123 (54%), Gaps = 7/123 (5%) Frame = +1 Query: 49 INYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMS-NPQAFCTS----- 210 +N ER+RR + D CP+CG E ETL HLF+ C L+ C + P F TS Sbjct: 1063 VNVERKRRGLADAASCPVCGEEDETLDHLFRRC---LLAEACWDSAVPPLTFQTSNHLHM 1119 Query: 211 YNWQDRLCSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKA-KEQ 387 ++W CS+ +K G +NW LIF +IL N+W++RN VF N IL ++ E Sbjct: 1120 HSWMKAACSSQQK-DGYG-TNWSLIFPYILWNLWKARNRLVFDNNITAPSDILNRSFMES 1177 Query: 388 SQA 396 S+A Sbjct: 1178 SEA 1180 >dbj|BAE79384.1| unnamed protein product [Ipomoea batatas] Length = 1898 Score = 73.9 bits (180), Expect = 2e-12 Identities = 48/123 (39%), Positives = 67/123 (54%), Gaps = 7/123 (5%) Frame = +1 Query: 49 INYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMS-NPQAFCTS----- 210 +N ER+RR + D CP+CG E ETL HLF+ C L+ C + P F TS Sbjct: 1595 VNVERKRRGLADAASCPVCGEEDETLDHLFRRC---LLAEACWDSAVPPLTFQTSNHLHM 1651 Query: 211 YNWQDRLCSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKA-KEQ 387 ++W CS+ +K G +NW LIF +IL N+W++RN VF N IL ++ E Sbjct: 1652 HSWMKAACSSQQK-DGYG-TNWSLIFPYILWNLWKARNRLVFDNNITAPSDILNRSFMES 1709 Query: 388 SQA 396 S+A Sbjct: 1710 SEA 1712 >ref|XP_019150780.1| PREDICTED: uncharacterized protein LOC109147628 [Ipomoea nil] Length = 1232 Score = 65.1 bits (157), Expect = 2e-09 Identities = 39/116 (33%), Positives = 58/116 (50%), Gaps = 5/116 (4%) Frame = +1 Query: 49 INYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSYN---- 216 +N ER +R + CP CG E E+L HL + C + +C S+P S + Sbjct: 930 VNAERAKRGLTSDASCPRCGLEEESLDHLLRRCR---LTNDCWNSSSPPVLAISNHLPLS 986 Query: 217 -WQDRLCSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKAK 381 W ++ CS K+Q + W L+F IL NIW++RN+ VF N IL +A+ Sbjct: 987 QWIEKACSG--KMQSSMNDRWHLLFPHILWNIWKARNEVVFDNFWPTTTEILKRAR 1040 >ref|XP_024190061.1| uncharacterized protein LOC112194030 [Rosa chinensis] Length = 1296 Score = 64.7 bits (156), Expect = 3e-09 Identities = 45/126 (35%), Positives = 63/126 (50%), Gaps = 5/126 (3%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSYNWQDRL 231 N R +R++ D D CPIC +E+L HLFKDC L + N + P F S +W+ L Sbjct: 985 NAHRVKRNLTDDDTCPICRCNSESLSHLFKDCPAALNVWNSFTLPQPVKFTFSMSWEGWL 1044 Query: 232 CSNLRKVQGVDHSN-WPLIFAFILDNIWRSRNDFVFTN--KCVNFRSILLKAK--EQSQA 396 +NL + N W FAFI IW+ RN +F + N +++ A E S A Sbjct: 1045 QANLFCKAKCNAGNPWCSTFAFICWFIWKWRNKHIFEAHFQIPNHPGMVINAAIFEWSNA 1104 Query: 397 YLHSNL 414 L S+L Sbjct: 1105 QLKSDL 1110 >ref|XP_019163505.1| PREDICTED: uncharacterized protein LOC109159849 [Ipomoea nil] Length = 1316 Score = 64.3 bits (155), Expect = 5e-09 Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 10/126 (7%) Frame = +1 Query: 49 INYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSYN---- 216 +N ER+RR + CP CG E ETL HLF+ C RNC +++P + N Sbjct: 1013 VNTERKRRGLTIDSSCPRCGAEEETLDHLFRQCED---SRNCWSITSPLRNFNASNHLPI 1069 Query: 217 --WQDRLCSNLRKVQGVDHS---NWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKA- 378 W ++ C+ GV +S W +F ++L NIW++RN+ F N+ I+ +A Sbjct: 1070 GCWIEQKCAG-----GVGYSPNLKWRSLFPYVLWNIWKARNNVTFNNQITPSPVIIKRAC 1124 Query: 379 KEQSQA 396 E S+A Sbjct: 1125 SEASEA 1130 >ref|XP_019168955.1| PREDICTED: uncharacterized protein LOC109164864 [Ipomoea nil] Length = 1694 Score = 63.5 bits (153), Expect = 9e-09 Identities = 37/116 (31%), Positives = 58/116 (50%), Gaps = 3/116 (2%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMS---NPQAFCTSYNWQ 222 N ER RR + +C C G E+L H+F+ C+ L +Q N AF W Sbjct: 1392 NSERSRRGLSTDAICQRCDGVDESLDHIFRRCDFALDCWANSQAPASFNATAFTPLSLWI 1451 Query: 223 DRLCSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKAKEQS 390 C +R + +W IF ++L +IWR+RND +F N+ V +L+AK+++ Sbjct: 1452 KDNCDMMRNA--TNQGSWTTIFPYLLWSIWRARNDIIFNNRMVQVTETVLRAKKEA 1505 >gb|KYP34286.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 289 Score = 62.8 bits (151), Expect = 9e-09 Identities = 34/112 (30%), Positives = 56/112 (50%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSYNWQDRL 231 N R RR + C+LCP+C + +T FH+ +DC +L + + F + Q + Sbjct: 100 NENRSRRRMAKCNLCPVCQSQPKTTFHVLRDCPPTELLWRKLLFQSHETFFDDMDIQLWI 159 Query: 232 CSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKAKEQ 387 NL + +W + F+ ++D IWR RN+ VF +K SI+L K + Sbjct: 160 LHNLDD-YSIKRGSWNIDFSVMVDLIWRRRNELVFLDKW-ELNSIILTKKSR 209 >dbj|GAU18772.1| hypothetical protein TSUD_80610 [Trifolium subterraneum] Length = 482 Score = 63.2 bits (152), Expect = 1e-08 Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 10/135 (7%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSY-----N 216 N ER R++ + DLCP C E++ H +DC T + NP+ + + N Sbjct: 205 NEERRHRNMTNSDLCPRCQDYPESIMHCLRDCEDAREF--WTNIINPEVWSKFFSIGLNN 262 Query: 217 WQDRLCSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKAKEQ--S 390 W D SN G D +NW + F ++ +W+ RN VF+N R++L K Q S Sbjct: 263 WLDWNLSNDNI--GNDGNNWSIFFGVAVNELWKDRNSLVFSNISGIDRNLLFKINTQVSS 320 Query: 391 QAYLHS---NLVLKQ 426 LHS NLV +Q Sbjct: 321 IINLHSFQKNLVTRQ 335 >gb|KYP46130.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 262 Score = 61.6 bits (148), Expect = 2e-08 Identities = 34/112 (30%), Positives = 54/112 (48%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSYNWQDRL 231 N R RR + C+LCP+C + ET FH+ +DC +L + + F + Q + Sbjct: 48 NENRSRRRMAQCNLCPVCQSQPETTFHVLRDCPPTELLWRKLLFQSHETFFGDMDIQLWI 107 Query: 232 CSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKAKEQ 387 N + +W + F+ ++D IWR RN+ VF K SI+L K + Sbjct: 108 LHNF-DGYSIKRGSWNIDFSVMVDLIWRRRNELVFLEKW-ELNSIILTNKSR 157 >gb|KYP64774.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 930 Score = 61.6 bits (148), Expect = 4e-08 Identities = 34/112 (30%), Positives = 54/112 (48%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSYNWQDRL 231 N R RR + C+LCP+C + ET FH+ +DC +L + + F + Q + Sbjct: 716 NENRSRRRMAQCNLCPVCQSQPETTFHVLRDCPPTELLWRKLLFQSHETFFGDMDIQLWI 775 Query: 232 CSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKAKEQ 387 N + +W + F+ ++D IWR RN+ VF K SI+L K + Sbjct: 776 LHNF-DGYSIKRGSWNIDFSVMVDLIWRRRNELVFLEKW-ELNSIILTNKSR 825 >ref|XP_020219748.1| uncharacterized protein LOC109802758 [Cajanus cajan] Length = 1032 Score = 61.6 bits (148), Expect = 4e-08 Identities = 34/112 (30%), Positives = 54/112 (48%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSYNWQDRL 231 N R RR + C+LCP+C + ET FH+ +DC +L + + F + Q + Sbjct: 818 NENRSRRRMAQCNLCPVCQSQPETTFHVLRDCPPTELLWRKLLFQSHETFFGDMDIQLWI 877 Query: 232 CSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCVNFRSILLKAKEQ 387 N + +W + F+ ++D IWR RN+ VF K SI+L K + Sbjct: 878 LHNF-DGYSIKRGSWNIDFSVMVDLIWRRRNELVFLEKW-ELNSIILTNKSR 927 >gb|KYP33975.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 300 Score = 59.7 bits (143), Expect = 1e-07 Identities = 34/109 (31%), Positives = 59/109 (54%), Gaps = 3/109 (2%) Frame = +1 Query: 94 CPICGGETETLFHLFKDCNRILMLRNCTQMSN-PQAFCTSYNWQDRLCSNLRKVQGVDHS 270 CP+C ETE+ FH +DC + + T + P++F N D + +NL++ + H Sbjct: 17 CPVCMQETESNFHALRDCKFAAEIWSRTSGGSLPRSFAED-NIHDWVHANLKE-RRPSHV 74 Query: 271 NWPLIFAFILDNIWRSRNDFVFTNKCVNFRSIL--LKAKEQSQAYLHSN 411 NWP++FA LD++W RN VF N + ++ + A+ + +H+N Sbjct: 75 NWPILFAVTLDSLWIRRNKMVFDNSFSSSEQVVKEINARVTTIVSIHTN 123 >gb|ONK68084.1| uncharacterized protein A4U43_C05F7260 [Asparagus officinalis] Length = 320 Score = 59.7 bits (143), Expect = 1e-07 Identities = 37/98 (37%), Positives = 49/98 (50%), Gaps = 3/98 (3%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSY--NWQD 225 N ER RRH+ + D CP C E E HLF+DC + + T++ P ++ Y N+ Sbjct: 85 NAERWRRHLTEDDACPCCSSEPELALHLFRDCGVVTDI--WTKLKPPFSWTEFYGSNYAQ 142 Query: 226 RLCSN-LRKVQGVDHSNWPLIFAFILDNIWRSRNDFVF 336 L N L V W IFA L NIW+ RN +VF Sbjct: 143 WLRLNLLHSGPSVKDKAWASIFAVALWNIWKWRNSWVF 180 >ref|XP_019162015.1| PREDICTED: uncharacterized protein LOC109158583 [Ipomoea nil] Length = 1138 Score = 60.1 bits (144), Expect = 1e-07 Identities = 39/103 (37%), Positives = 52/103 (50%), Gaps = 3/103 (2%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSY---NWQ 222 N ER RRH+ D CP C E E+L HLF DC+ I + T F + +W Sbjct: 841 NAERFRRHLADSGRCPCCDLEEESLKHLFWDCSAIQTTWHLTDTPTCFGFPVHWSIEHWI 900 Query: 223 DRLCSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVFTNKCV 351 + CS + DHS WP FAF L + W++RN + F N+ V Sbjct: 901 ETNCSIKQ-----DHS-WPSRFAFTLWSRWKNRNAWTFQNQKV 937 >gb|KYP55672.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 327 Score = 59.7 bits (143), Expect = 1e-07 Identities = 34/109 (31%), Positives = 59/109 (54%), Gaps = 3/109 (2%) Frame = +1 Query: 94 CPICGGETETLFHLFKDCNRILMLRNCTQMSN-PQAFCTSYNWQDRLCSNLRKVQGVDHS 270 CP+C ETE+ FH +DC + + T + P++F N D + +NL++ + H Sbjct: 44 CPVCMQETESNFHALRDCKFAAEIWSRTSGGSLPRSFAED-NIHDWVHANLKE-RRPSHV 101 Query: 271 NWPLIFAFILDNIWRSRNDFVFTNKCVNFRSIL--LKAKEQSQAYLHSN 411 NWP++FA LD++W RN VF N + ++ + A+ + +H+N Sbjct: 102 NWPILFAVTLDSLWIRRNKMVFDNSFSSSEQVVKEINARVTTIVSIHTN 150 >gb|PRQ57683.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1643 Score = 60.1 bits (144), Expect = 1e-07 Identities = 36/98 (36%), Positives = 51/98 (52%), Gaps = 3/98 (3%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRILMLRNCTQMSNPQAFCTSYN--WQD 225 N +R RRH+ D CPIC +E+L HLFKDC + + N S P++ ++N W+ Sbjct: 1332 NVQRARRHLTLDDSCPICHTTSESLSHLFKDCPAVYRIWN--SFSLPESVGNTFNMDWEG 1389 Query: 226 RLCSNLRKVQGVDHS-NWPLIFAFILDNIWRSRNDFVF 336 L ++L W +F FI IW+ RN FVF Sbjct: 1390 WLNAHLHCTTKTSVGVQWCSVFVFICWYIWKWRNKFVF 1427 >dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 676 Score = 59.3 bits (142), Expect = 2e-07 Identities = 33/98 (33%), Positives = 53/98 (54%), Gaps = 3/98 (3%) Frame = +1 Query: 52 NYERERRHVIDCDLCPICGGETETLFHLFKDCNRI--LMLRNCTQMSNPQAFCTS-YNWQ 222 N ER RRH+ D D+CP+C G +E+L H+ +DC + + +R M + F TS W Sbjct: 371 NAERVRRHMADSDVCPLCKGASESLIHVLRDCPAMMGIWMRVVPVMEQRRFFETSLLEW- 429 Query: 223 DRLCSNLRKVQGVDHSNWPLIFAFILDNIWRSRNDFVF 336 + NL++ + +WP +FA + W+ R +VF Sbjct: 430 --MYGNLKERSDSERRSWPTLFALTVWWGWKWRCGYVF 465