BLASTX nr result
ID: Astragalus23_contig00031492
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00031492 (519 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subte... 157 4e-41 gb|PNX68200.1| pentatricopeptide repeat-containing protein, part... 144 6e-40 dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subt... 153 9e-40 dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subt... 146 2e-37 dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subt... 136 2e-35 dbj|GAU37566.1| hypothetical protein TSUD_153990 [Trifolium subt... 134 9e-35 gb|KYP66685.1| hypothetical protein KK1_012990 [Cajanus cajan] 126 7e-34 gb|PNX93528.1| pentatricopeptide repeat-containing protein, part... 125 1e-32 gb|KHN20416.1| Putative ribonuclease H protein [Glycine soja] 125 3e-32 gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Ca... 131 3e-32 dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subt... 124 6e-32 dbj|GAU38127.1| hypothetical protein TSUD_318140 [Trifolium subt... 125 1e-31 dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subt... 125 4e-31 gb|KYP48455.1| hypothetical protein KK1_029830 [Cajanus cajan] 127 5e-31 gb|KYP48474.1| hypothetical protein KK1_029849 [Cajanus cajan] 126 1e-30 dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subt... 125 2e-30 dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subt... 124 3e-30 gb|PNX70097.1| pentatricopeptide repeat-containing protein, part... 119 3e-30 dbj|GAU44081.1| hypothetical protein TSUD_399630 [Trifolium subt... 124 5e-30 gb|PNX73669.1| ribonuclease H [Trifolium pratense] 118 2e-29 >dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subterraneum] Length = 1601 Score = 157 bits (396), Expect = 4e-41 Identities = 80/173 (46%), Positives = 108/173 (62%), Gaps = 1/173 (0%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 + LI+N K+ GDWM IW+L+IPQ+VK+ LWRI GCLP + L +R V C DLCP CE Sbjct: 1268 DTLINNEQYKIPGDWMLIWKLSIPQRVKIFLWRIAIGCLPTRDRLQSRGVQCTDLCPHCE 1327 Query: 337 SGVEDELHLFINCPRVAEVWQHAGL-REVISKAATYDVSNFKDVFFMLMDSL*IQQRSIF 161 + E++ HLF++C + EVW+ A L EV S T VS KD F + +L +RS F Sbjct: 1328 TTYENDWHLFVSCNKAHEVWREANLWDEVCSVVET--VSCIKDFIFAALAALAEPRRSEF 1385 Query: 160 AMVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*ARNSKQQAVRETTT 2 M+ W +WK RN+K+WE ++P V +Q A D+L+QW AR RE TT Sbjct: 1386 VMMLWCLWKCRNDKIWEDKVQPVRVGMQLARDMLYQWRNARR------REDTT 1432 >gb|PNX68200.1| pentatricopeptide repeat-containing protein, partial [Trifolium pratense] Length = 220 Score = 144 bits (362), Expect = 6e-40 Identities = 64/165 (38%), Positives = 100/165 (60%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E L+DN+ L+V+G+W IW L IPQK+K+ LWR RGCLP ++ L + V C C C+ Sbjct: 27 ENLVDNTGLRVEGNWGKIWGLKIPQKMKVFLWRAARGCLPTRYRLQRKGVNCPHTCAYCQ 86 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 + E++ H+F C + E+W+ AGL +I + F +FF L++ L +F Sbjct: 87 NNFENDWHVFFGCVKAQEIWEEAGLWSLI-EGMFESAEGFVSLFFSLLELLSQHNIILFV 145 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*ARNSKQQ 23 FW +WKRRN+K+WE + +VS+Q A D+++QW+ + S Q+ Sbjct: 146 AAFWCIWKRRNQKIWEDIELRPSVSLQLATDIIYQWKTTQISHQK 190 >dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subterraneum] Length = 1012 Score = 153 bits (386), Expect = 9e-40 Identities = 73/168 (43%), Positives = 105/168 (62%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E LIDN K+ GDWM IW L IPQ+VK +WR++RGCLP + L + V C DLCP CE Sbjct: 760 ETLIDNEGYKLPGDWMQIWNLKIPQRVKKFMWRVLRGCLPTRDKLQRKGVQCTDLCPHCE 819 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 + E+E H+F+ C + +W AGL + I++ ++F + F M Q+ S F Sbjct: 820 TTYENEWHVFLGCEKAKRIWIEAGLWDDIAQLVV-AANSFNSLVFSFMTVNLEQKCSDFV 878 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*ARNSKQQAVR 14 M+ W +WKRRNEK+WEGV KP ++S+ +A + L QW + ++Q+ VR Sbjct: 879 MIMWCLWKRRNEKIWEGVEKPVHLSINTAREYLVQWREIK-ARQENVR 925 >dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subterraneum] Length = 1688 Score = 146 bits (369), Expect = 2e-37 Identities = 65/165 (39%), Positives = 101/165 (61%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E L+DN+ L+V+G+W IW L IPQK+K+ LWR RGCLP ++ L + V C C C+ Sbjct: 1402 ENLVDNTGLRVEGNWGKIWELKIPQKMKVFLWRAARGCLPTRYRLQQKGVNCPHTCAYCQ 1461 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 + E++ H+F C + E+W+ AGL I + F +FF L++ L + +F Sbjct: 1462 NNFENDWHVFFGCVKAQEIWEEAGLWSFI-EGMFESTEGFVSLFFSLLELLSQHKIILFV 1520 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*ARNSKQQ 23 FW +WKRRN+K+WE + +VS+Q A D+++QW+ A+ S Q+ Sbjct: 1521 AAFWCIWKRRNQKIWEDIELHPSVSLQLASDIIYQWKTAQTSHQR 1565 >dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subterraneum] Length = 372 Score = 136 bits (342), Expect = 2e-35 Identities = 66/155 (42%), Positives = 85/155 (54%) Frame = -3 Query: 472 MSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCESGVEDELHLFINCPR 293 M IW + IPQK+K+ LWR RGCLP + L TR V C D C CE E++ H+F C + Sbjct: 1 MQIWNMKIPQKIKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNK 60 Query: 292 VAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFAMVFWSVWKRRNEKVW 113 V EVW AGL I F +FF L++ L +FAM WS+WKRRN+K+W Sbjct: 61 VEEVWAEAGLWSFIRDKLEI-ADGFVALFFQLLELLSQHNLHMFAMTMWSIWKRRNDKLW 119 Query: 112 EGVIKPCNVSVQSALDLLHQWE*ARNSKQQAVRET 8 G+ VS+ A D LHQW+ R +Q T Sbjct: 120 NGIETRPTVSIMLARDSLHQWQLIRQKRQHTAAVT 154 >dbj|GAU37566.1| hypothetical protein TSUD_153990 [Trifolium subterraneum] Length = 343 Score = 134 bits (336), Expect = 9e-35 Identities = 58/157 (36%), Positives = 89/157 (56%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 +ELID S+L+V G+W +W + +P KVK L+WRI R CLP + L + V C C LC Sbjct: 125 QELIDTSYLRVNGNWNLVWNIKVPPKVKNLIWRICRRCLPTRVRLRDKGVECTQTCALCN 184 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 ED H+F CP VW G V+S A + +N +D+ F ++ L ++FA Sbjct: 185 EENEDSEHIFFKCPSSRNVWSMTGFFHVVSNAINNN-NNAQDIIFHILQQLSKDDSTVFA 243 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE 47 + WS+WK+RN ++W V N + A+++L +W+ Sbjct: 244 CILWSIWKQRNNQIWNNVTDAQNFVLSRAVNMLQEWK 280 >gb|KYP66685.1| hypothetical protein KK1_012990 [Cajanus cajan] Length = 149 Score = 126 bits (316), Expect = 7e-34 Identities = 56/147 (38%), Positives = 84/147 (57%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E +I N+ L+VQGDWM +W L IP ++ LWR++RGC+P + NL + VPC CP C Sbjct: 2 EHVISNNTLRVQGDWMKLWSLKIPHSTQIFLWRLLRGCIPTRLNLQQKGVPCTSSCPHCS 61 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 + E+E HLF +CP +W +G I+ +S F D + L+ L + F Sbjct: 62 ANQENEWHLFYSCPAALSIWIDSGCWPRIAHIVEQGIS-FIDTTWKLLGHLTGSDLTSFT 120 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQ 77 ++ W +W+RRN+KVW+ P S+Q Sbjct: 121 LMLWCIWRRRNDKVWKEGAPPPKTSIQ 147 >gb|PNX93528.1| pentatricopeptide repeat-containing protein, partial [Trifolium pratense] Length = 231 Score = 125 bits (314), Expect = 1e-32 Identities = 59/165 (35%), Positives = 88/165 (53%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 EELID +HL++ W +WRL +P KVK L+WR RGC P + + + C C +C Sbjct: 60 EELIDTNHLRISSFWAGVWRLKVPPKVKNLIWRSCRGCFPTRVRHRDKGIDCPSNCVVCN 119 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 ED H+F CP A +W+ +GL + +AA + + FML+ +L Q + A Sbjct: 120 DNFEDTSHVFCLCPFAASIWRDSGLWNHV-EAAVNSSNTVAETIFMLLQNLEEQNSARLA 178 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*ARNSKQQ 23 ++ WS+WK RN K+W V + + A LL W A+N+ Q Sbjct: 179 VIMWSIWKHRNMKLWNRVTETKEQVLNRADHLLEDWRAAKNTTTQ 223 >gb|KHN20416.1| Putative ribonuclease H protein [Glycine soja] Length = 249 Score = 125 bits (313), Expect = 3e-32 Identities = 59/156 (37%), Positives = 88/156 (56%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E + ++SHLKV G W +W+L +P KVK+ +WR VRGCLP + L T+ V C +CPLC Sbjct: 2 ESVFESSHLKVAGRWKDLWKLQVPNKVKVFIWRAVRGCLPTRLRLQTKGVVCTGICPLCL 61 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 + +E+E H + CP W+ AG VI + +F D+ F L+ + + S Sbjct: 62 NNLENEWHCLVACPSNLVCWKLAGFWNVI-RVQVDSADSFDDLIFRLLARISKAKISQVV 120 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQW 50 M+ W +W R N KVWE + +V+V+ A + L W Sbjct: 121 MLMWVLWWRSNGKVWEDADRSPSVTVRRATNCLTDW 156 >gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Cajanus cajan] Length = 816 Score = 131 bits (330), Expect = 3e-32 Identities = 63/167 (37%), Positives = 91/167 (54%), Gaps = 1/167 (0%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E +I N+ L+VQGDWM +W L IP ++ LWR++RGC+P + NL + VPC CP C Sbjct: 485 EHVISNNTLRVQGDWMKLWSLKIPHSTQIFLWRLLRGCIPTRLNLQQKGVPCTSSCPHCS 544 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 + E+E HLF +CP +W +G I+ +S F D + L+ L + F Sbjct: 545 ANQENEWHLFYSCPAALSIWIDSGCWPRIAHIVEQGIS-FIDTTWKLLGHLTGSDLTSFT 603 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*A-RNSKQQA 20 ++ W +W+RRN+KVW+ P S+Q H W A RN Q A Sbjct: 604 LMLWCIWRRRNDKVWKEGAPPPKTSIQLTEQHFHAWRSAHRNLAQTA 650 >dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subterraneum] Length = 249 Score = 124 bits (311), Expect = 6e-32 Identities = 62/150 (41%), Positives = 80/150 (53%) Frame = -3 Query: 457 LTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCESGVEDELHLFINCPRVAEVW 278 + IPQKVK+ LWR RGCLP + L TR V C D C CE E++ H+F C +V EVW Sbjct: 1 MKIPQKVKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNKVEEVW 60 Query: 277 QHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFAMVFWSVWKRRNEKVWEGVIK 98 A L I F +FF L++ L +FAM W +WKRRN+K+W G+ Sbjct: 61 AEARLWSFIRDKLEI-ADGFVALFFQLLELLSQHNLHMFAMTMWCIWKRRNDKLWNGIET 119 Query: 97 PCNVSVQSALDLLHQWE*ARNSKQQAVRET 8 VS+ A D LHQW+ R +Q T Sbjct: 120 RPTVSIMLACDSLHQWQLIRQKRQHTAAVT 149 >dbj|GAU38127.1| hypothetical protein TSUD_318140 [Trifolium subterraneum] Length = 359 Score = 125 bits (315), Expect = 1e-31 Identities = 56/167 (33%), Positives = 94/167 (56%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E+ +D SHL+VQG+W IW++ P K+K +WR+ R C+P + L + V C C LC+ Sbjct: 128 EDTLDISHLQVQGNWNLIWQIQAPPKIKNFIWRLCRNCIPTRTRLLQKGVNCPCNCVLCD 187 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 ED LH+F+ C V + W + GL +I + T D +N ++ F ++ L +Q SIF Sbjct: 188 DETEDSLHVFMFCDTVKQAWYNTGLWPIIQQRLTGD-NNMAELVFSILQVLASEQMSIFV 246 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*ARNSKQQAV 17 V WS+W+ RN K+W + + A +L +W+ + + +++ Sbjct: 247 TVLWSMWQSRNNKLWRNQTETASAVYDRACTVLTEWQSVQAEQTESI 293 >dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subterraneum] Length = 395 Score = 125 bits (314), Expect = 4e-31 Identities = 57/161 (35%), Positives = 87/161 (54%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 +ELID SHL+V GDW +W++ P KVK L+WRI R C+ + L + V C +LC LC Sbjct: 60 QELIDTSHLRVNGDWNLLWKIKAPPKVKNLIWRICRRCVSTRARLQDKGVNCPNLCALCN 119 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 ED LH+F CP VW +V+S + + + F ++ L + ++FA Sbjct: 120 IEGEDSLHVFFKCPSSQNVWSMTSFFQVVSSVINNE-NEASAIVFQILRQLSKEDAALFA 178 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*ARN 35 + WS+WK+RN ++W V + A ++L +W RN Sbjct: 179 CILWSIWKQRNNQIWNNVTDAQSFVFSRANNMLQEWNTVRN 219 >gb|KYP48455.1| hypothetical protein KK1_029830 [Cajanus cajan] Length = 536 Score = 127 bits (318), Expect = 5e-31 Identities = 61/167 (36%), Positives = 90/167 (53%), Gaps = 1/167 (0%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E +I N+ L+VQGDWM +W L IP ++ LWR++RGC+P NL + V C CP C Sbjct: 330 EHVISNNTLRVQGDWMKLWSLKIPHSTQIFLWRLLRGCIPTCLNLQQKGVSCTSSCPHCS 389 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 + E+E HLF +CP +W +G I++ +S F D + L+ L + F Sbjct: 390 ANQENEWHLFYSCPAAISIWIDSGCWPRIARIVEQGIS-FIDTTWKLLGHLTSSDLTSFT 448 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*A-RNSKQQA 20 ++ W +W+ RN+KVW+ P S+Q H W+ A RN Q A Sbjct: 449 LMLWCIWRWRNDKVWKESAPPPRTSIQLTEQHFHAWQSAHRNLTQNA 495 >gb|KYP48474.1| hypothetical protein KK1_029849 [Cajanus cajan] Length = 547 Score = 126 bits (316), Expect = 1e-30 Identities = 61/167 (36%), Positives = 90/167 (53%), Gaps = 1/167 (0%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E +I N+ L+VQGDWM +W L IP ++ LWR++RGC+P NL + V C CP C Sbjct: 341 EHVISNNTLRVQGDWMKLWSLKIPHSTQIFLWRLLRGCIPTCLNLQQKGVSCTSSCPHCS 400 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 + E+E HLF +CP +W +G I++ +S F D + L+ L + F Sbjct: 401 ANQENEWHLFYSCPAAISIWIDSGCWPRIARIVEQGIS-FIDTTWKLLGHLTGSDLTSFT 459 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*A-RNSKQQA 20 ++ W +W+ RN+KVW+ P S+Q H W+ A RN Q A Sbjct: 460 LMLWCIWRWRNDKVWKESAPPPRTSIQLTEQHFHAWQSAHRNLTQNA 506 >dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subterraneum] Length = 479 Score = 125 bits (313), Expect = 2e-30 Identities = 64/162 (39%), Positives = 89/162 (54%) Frame = -3 Query: 505 DNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCESGVE 326 DNS + G+W IWR IP KVK LLWRI R LP + L +R V C C +C E Sbjct: 150 DNSG--IAGNWHQIWRAKIPPKVKNLLWRIGRNVLPTRATLNSRSVQCLVHCAVCNDSAE 207 Query: 325 DELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFAMVFW 146 D +H+ CPR E WQ AGL I A +N D+ ++ SL +Q+ IF+++ W Sbjct: 208 DSIHILFLCPRSTECWQQAGLWNQID-AGLNTSNNIADILLFILQSLNKEQQEIFSVLLW 266 Query: 145 SVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*ARNSKQQA 20 S+WKRRN KVW+ + + + A LL W+ A+ ++ A Sbjct: 267 SIWKRRNAKVWDNITESNTNVYERAQHLLTSWKQAQQTRSYA 308 >dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subterraneum] Length = 474 Score = 124 bits (311), Expect = 3e-30 Identities = 54/156 (34%), Positives = 86/156 (55%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 +E +D SHLK+ GDW IW+L +P +VK L+WR+ R C+P + NL R V C +C LC Sbjct: 140 QEELDTSHLKMTGDWNLIWKLKVPPRVKNLVWRVCRQCIPTRTNLQNRGVNCTTVCALCN 199 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 ED H+F +C + +W VI+ A + ++ F ++ L + + ++ A Sbjct: 200 EYDEDSGHIFFDCLSSSNIWSMCTFNHVIT-AGLQHYAGVTELIFAVLQQLNVDEAALMA 258 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQW 50 + WS+WK+RN ++W V +V A+ LH W Sbjct: 259 CIIWSIWKQRNNQIWNNVTDAQSVVFSRAVTTLHDW 294 >gb|PNX70097.1| pentatricopeptide repeat-containing protein, partial [Trifolium pratense] Length = 221 Score = 119 bits (297), Expect = 3e-30 Identities = 59/156 (37%), Positives = 82/156 (52%) Frame = -3 Query: 487 VQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCESGVEDELHLF 308 V G W +IWR IP KVK L+WRI R LP + L +R V C C +C G ED +H+ Sbjct: 46 VAGQWNNIWRAKIPPKVKNLIWRIGRDVLPTRKKLISRGVQCPTHCDVCNDGDEDSMHVL 105 Query: 307 INCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFAMVFWSVWKRR 128 +C R + WQ AGL I + +N + F ++ L Q+ F ++ WS+WKRR Sbjct: 106 FSCTRSIQCWQRAGLWTHIGTGLVAN-NNIAENLFSILHRLNKDQQEFFCVLVWSIWKRR 164 Query: 127 NEKVWEGVIKPCNVSVQSALDLLHQWE*ARNSKQQA 20 N KVWE V ++ A L+ W A+ +Q A Sbjct: 165 NNKVWENVTDSDQTVIERAKHLITSWRNAQQIRQSA 200 >dbj|GAU44081.1| hypothetical protein TSUD_399630 [Trifolium subterraneum] Length = 539 Score = 124 bits (311), Expect = 5e-30 Identities = 61/171 (35%), Positives = 92/171 (53%) Frame = -3 Query: 514 ELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCES 335 EL+D SHL++ G W IW+L P +V+ LLWRI R C+P + NL +R + C +C LC Sbjct: 206 ELLDTSHLRMDGTWNLIWKLNAPPRVRNLLWRICRRCVPTRVNLRSRGMNCTTVCSLCND 265 Query: 334 GVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFAM 155 ED H+F +CP VW I A D + + F L+ L + S+ A Sbjct: 266 QDEDSRHIFFDCPSSRNVWSMCCFGNKIIAALHNDYA-ASYLIFDLLQQLSNEDASLMAC 324 Query: 154 VFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQWE*ARNSKQQAVRETTT 2 V WS+WK+RN ++W VI N + A+ L++ W + ++ A+ + TT Sbjct: 325 VIWSIWKQRNSRIWNNVIDAQNFVLSRAVALINDWCDVQQARPDAMGQHTT 375 >gb|PNX73669.1| ribonuclease H [Trifolium pratense] Length = 275 Score = 118 bits (296), Expect = 2e-29 Identities = 55/156 (35%), Positives = 86/156 (55%) Frame = -3 Query: 517 EELIDNSHLKVQGDWMSIWRLTIPQKVKLLLWRIVRGCLPCKFNLCTRRVPCDDLCPLCE 338 E +IDN+HL+ +GD M IW+L +P +VK+ +WR + GCLP + L + V C+ CP C Sbjct: 93 EAIIDNTHLRFEGDSMKIWKLKVPNRVKIFIWRTLGGCLPVR--LLQKGVQCEPNCPCCA 150 Query: 337 SGVEDELHLFINCPRVAEVWQHAGLREVISKAATYDVSNFKDVFFMLMDSL*IQQRSIFA 158 S E+E H FI C EVW+ G E + + ++ + ++FF L+ L ++ + Sbjct: 151 SATENECHCFIGCDVAQEVWREMGDWETMEQ-YVWNAQGYVELFFTLLQDLDSERMARNV 209 Query: 157 MVFWSVWKRRNEKVWEGVIKPCNVSVQSALDLLHQW 50 M W +W RRN+K W + + Q A + L W Sbjct: 210 MTLWMIWWRRNQKCWHDNLHSTSEVKQRATESLDDW 245