BLASTX nr result
ID: Akebia26_contig00032986
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia26_contig00032986 (529 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004141415.1| PREDICTED: uncharacterized protein LOC101212... 110 3e-22 ref|XP_004149623.1| PREDICTED: uncharacterized protein LOC101211... 108 6e-22 emb|CAN76658.1| hypothetical protein VITISV_025162 [Vitis vinifera] 96 7e-18 emb|CAN59936.1| hypothetical protein VITISV_001878 [Vitis vinifera] 95 9e-18 ref|XP_006589931.1| PREDICTED: uncharacterized protein LOC102669... 90 4e-16 gb|AAF79879.1|AC000348_32 T7N9.5 [Arabidopsis thaliana] 89 8e-16 emb|CBI24911.3| unnamed protein product [Vitis vinifera] 88 1e-15 ref|XP_002278097.1| PREDICTED: uncharacterized protein LOC100260... 88 1e-15 gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi... 88 1e-15 dbj|BAF00783.1| hypothetical protein [Arabidopsis thaliana] 87 2e-15 ref|XP_006584195.1| PREDICTED: uncharacterized protein LOC102662... 87 3e-15 ref|XP_006587971.1| PREDICTED: uncharacterized protein LOC102669... 86 5e-15 gb|ABD32757.1| Integrase, catalytic region [Medicago truncatula] 86 7e-15 ref|XP_004161031.1| PREDICTED: uncharacterized protein LOC101230... 84 2e-14 dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis t... 82 6e-14 gb|AAC62795.1| contains similarity to retroviral aspartyl protea... 82 8e-14 ref|XP_007023091.1| Uncharacterized protein TCM_027093 [Theobrom... 82 1e-13 gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi... 81 1e-13 emb|CAB40067.1| putative retrotransposon polyprotein [Arabidopsi... 80 3e-13 ref|XP_006589830.1| PREDICTED: uncharacterized protein LOC102661... 80 4e-13 >ref|XP_004141415.1| PREDICTED: uncharacterized protein LOC101212632 [Cucumis sativus] gi|449449869|ref|XP_004142687.1| PREDICTED: uncharacterized protein LOC101213831 [Cucumis sativus] Length = 440 Score = 110 bits (274), Expect = 3e-22 Identities = 62/161 (38%), Positives = 91/161 (56%) Frame = +3 Query: 12 LNPSQYQHLMTMLNTHLSVAKIGIEADVISAQASGICFSAAINSIFNFPKYWIINSGATS 191 LN QY L+ ML THL+ + G + +G C S ++N P WII+SGA+S Sbjct: 273 LNSDQYTQLLDMLQTHLNTPQNGENFKNETTHIAGTCLSNSLND----PLTWIIDSGASS 328 Query: 192 HICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQFRFNLISV 371 HIC ++F F + +N F L R+ V V +++ L+L+DVLYIP F++NL+SV Sbjct: 329 HICHDKFMFTNLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLLSV 388 Query: 372 SSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVL 494 S+L K I F +C++QD LK IGK++ Y+L Sbjct: 389 STLLKDDKFAISFADSNCLIQDKWLLKTIGKAELTNGLYLL 429 >ref|XP_004149623.1| PREDICTED: uncharacterized protein LOC101211618 [Cucumis sativus] Length = 2085 Score = 108 bits (271), Expect = 6e-22 Identities = 62/161 (38%), Positives = 90/161 (55%) Frame = +3 Query: 12 LNPSQYQHLMTMLNTHLSVAKIGIEADVISAQASGICFSAAINSIFNFPKYWIINSGATS 191 LN QY L+ ML THL + G + +G C S ++N P WII+SGA+S Sbjct: 1617 LNSDQYTQLLGMLQTHLHTPQNGENFKNETTHIAGTCLSNSLND----PLTWIIDSGASS 1672 Query: 192 HICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQFRFNLISV 371 HIC ++F F + +N F L R+ V V +++ L+L+DVLYIP F++NL+SV Sbjct: 1673 HICHDKFMFTNLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLLSV 1732 Query: 372 SSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVL 494 S+L K I F +C++QD LK IGK++ Y+L Sbjct: 1733 STLLKDDKFAISFADSNCLIQDKWLLKTIGKAELTNGLYLL 1773 >emb|CAN76658.1| hypothetical protein VITISV_025162 [Vitis vinifera] Length = 645 Score = 95.5 bits (236), Expect = 7e-18 Identities = 55/168 (32%), Positives = 96/168 (57%), Gaps = 6/168 (3%) Frame = +3 Query: 12 LNPSQYQHLMTMLNTHLSVAK-IGIEADVISAQASG-----ICFSAAINSIFNFPKYWII 173 L +Q Q L+ +L++ L + + +E+ A + FS++ + P W++ Sbjct: 253 LASNQCQQLIALLSSQLHRSTTVTLESQEQGASSGSNFLGKYYFSSSFSHNSIPPNSWVL 312 Query: 174 NSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQFR 353 ++GAT H+C + F T +NS TL N + +P++ +++L + L++ DVLY+ QFR Sbjct: 313 DTGATHHVCISLHLFKTSLLSRNSNVTLPNGHFVPINRIGSIELFTGLVVDDVLYVLQFR 372 Query: 354 FNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLD 497 FNL S+S+LT+ H + FL++SC++QD KMIG + Y+LD Sbjct: 373 FNLFSISALTQFHHCSVHFLSESCLIQDRMQEKMIGMGSCFXNLYILD 420 >emb|CAN59936.1| hypothetical protein VITISV_001878 [Vitis vinifera] Length = 1031 Score = 95.1 bits (235), Expect = 9e-18 Identities = 58/169 (34%), Positives = 88/169 (52%), Gaps = 5/169 (2%) Frame = +3 Query: 12 LNPSQYQHLMTMLNTHLSVAKIGIEADVISAQAS-----GICFSAAINSIFNFPKYWIIN 176 L Q+ L+ +L+ H S D Q S GI + +S N P WI++ Sbjct: 319 LTHDQHNQLLALLSLHSSSGSSTSFGDSNPLQQSISNFTGILSLSPSSSTLN-PSIWILD 377 Query: 177 SGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQFRF 356 SGAT H+C N FH++ ++ TL +IP+ T+ L+ L+L+ VLYIP F+F Sbjct: 378 SGATHHVCTNSSMFHSIHSFSSNTVTLPTGTKIPITGIGTIHLSPHLVLEHVLYIPTFQF 437 Query: 357 NLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLDAN 503 NLIS+S+LT+ F C +QD K+IG +R + Y+LD++ Sbjct: 438 NLISISALTQTNCFSFDFTAHFCFIQDHSQGKLIGMGRRQGNLYLLDSS 486 >ref|XP_006589931.1| PREDICTED: uncharacterized protein LOC102669127 [Glycine max] Length = 656 Score = 89.7 bits (221), Expect = 4e-16 Identities = 55/160 (34%), Positives = 89/160 (55%), Gaps = 5/160 (3%) Frame = +3 Query: 6 QMLNPSQYQHLMTMLNTHLSVAKIGIEADVISAQASGI----CFSAAINSIFNFPKYWII 173 Q + Q Q ++ +L + A++ + S+ +GI ++ +NS+F P WI+ Sbjct: 316 QSWSQQQCQQVLALLQAQM--AQLPSASQQESSDTTGIEIIGMSNSTLNSLFQAPNAWIV 373 Query: 174 NSGATSHI-CFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQF 350 +SGAT+HI CF +F F+ K LKN F L N R+ V + +V++NS ++L +VLYIP F Sbjct: 374 DSGATTHITCFPEFLFN-FKFLKNKFVLLSNGTRVQVVGTGSVRINSRIVLHNVLYIPSF 432 Query: 351 RFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSK 470 NLISV +T + + F + QD + K IG+ + Sbjct: 433 HVNLISVPKITPNCKIGVLFEDTFVLFQDTQTHKTIGRGE 472 >gb|AAF79879.1|AC000348_32 T7N9.5 [Arabidopsis thaliana] Length = 1436 Score = 88.6 bits (218), Expect = 8e-16 Identities = 41/118 (34%), Positives = 73/118 (61%) Frame = +3 Query: 165 WIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIP 344 W+I+SGA+ H+ + +HT K L +F L N + + + + ++L L L +VL+IP Sbjct: 421 WVIDSGASHHVTHERNLYHTYKALDRTFVRLPNGHTVKIEGTGFIQLTDALSLHNVLFIP 480 Query: 345 QFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLDANFSILN 518 +F+FNL+SVS LTK + F +D C++Q L M+GK ++ + Y+L+ + S+++ Sbjct: 481 EFKFNLLSVSVLTKTLQSKVSFTSDECMIQALTKELMLGKGSQVGNLYILNLDKSLVD 538 >emb|CBI24911.3| unnamed protein product [Vitis vinifera] Length = 382 Score = 88.2 bits (217), Expect = 1e-15 Identities = 46/129 (35%), Positives = 77/129 (59%) Frame = +3 Query: 111 SGICFSAAINSIFNFPKYWIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFS 290 SG+ + I S N WI++SGAT H+C+++ +F P NSF L N + +PV + Sbjct: 222 SGLLSGSTIPSSSNSSSLWILDSGATHHVCYSRASFEPFTPTFNSFVALPNGHTVPVGGT 281 Query: 291 RTVKLNSCLLLQDVLYIPQFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSK 470 +V+L + L LQ+VL++PQF NL+S+S+LT+ + F +D +++D + IG + Sbjct: 282 GSVRLCNDLTLQNVLFVPQFHCNLLSISALTQQHPYVVSFHSDHYVIEDPTQGRKIGIGR 341 Query: 471 RIADHYVLD 497 + + Y LD Sbjct: 342 QANNLYTLD 350 >ref|XP_002278097.1| PREDICTED: uncharacterized protein LOC100260149 [Vitis vinifera] Length = 359 Score = 88.2 bits (217), Expect = 1e-15 Identities = 46/129 (35%), Positives = 77/129 (59%) Frame = +3 Query: 111 SGICFSAAINSIFNFPKYWIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFS 290 SG+ + I S N WI++SGAT H+C+++ +F P NSF L N + +PV + Sbjct: 222 SGLLSGSTIPSSSNSSSLWILDSGATHHVCYSRASFEPFTPTFNSFVALPNGHTVPVGGT 281 Query: 291 RTVKLNSCLLLQDVLYIPQFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSK 470 +V+L + L LQ+VL++PQF NL+S+S+LT+ + F +D +++D + IG + Sbjct: 282 GSVRLCNDLTLQNVLFVPQFHCNLLSISALTQQHPYVVSFHSDHYVIEDPTQGRKIGIGR 341 Query: 471 RIADHYVLD 497 + + Y LD Sbjct: 342 QANNLYTLD 350 >gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1454 Score = 87.8 bits (216), Expect = 1e-15 Identities = 56/173 (32%), Positives = 90/173 (52%), Gaps = 11/173 (6%) Frame = +3 Query: 12 LNPSQYQHLMTMLNTHLSVAKIGIEADVISAQAS--GICFSAAINSIFNF---------P 158 L+ Q Q + M ++ L A ++Q+ GICFS + S Sbjct: 369 LSKEQLQQFIAMFSSQLQNTPPSTYATASTSQSDNLGICFSPSTYSFIGILTVARHTLSS 428 Query: 159 KYWIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLY 338 W+I+SGAT H+ ++ F ++ S L + + T+KLN +LL++VL+ Sbjct: 429 ATWVIDSGATHHVSHDRSLFSSLDTSVLSAVNLPTGPTVKISGVGTLKLNDDILLKNVLF 488 Query: 339 IPQFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLD 497 IP+FR NLIS+SSLT + F +SC +QDL +M+G+ +R+A+ Y+LD Sbjct: 489 IPEFRLNLISISSLTDDIGSRVIFDKNSCEIQDLIKGRMLGQGRRVANLYLLD 541 >dbj|BAF00783.1| hypothetical protein [Arabidopsis thaliana] Length = 556 Score = 87.4 bits (215), Expect = 2e-15 Identities = 56/173 (32%), Positives = 83/173 (47%), Gaps = 9/173 (5%) Frame = +3 Query: 6 QMLNPSQYQHLMTMLNTHLSVAKIGIEADVISAQASGICFSAAINSIFNFPKY------- 164 ++L Q ++ N+ + + I + GI FS++ K Sbjct: 328 KVLTKDQINGVVAYFNSQMQNSSIASSSGATITALPGIAFSSSTLGFIGVLKATVNVLSS 387 Query: 165 --WIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLY 338 WII+SGAT H+C ++ + NS TL + + TVKLN L+L +VLY Sbjct: 388 ETWIIDSGATHHVCHDKNLLMRLSETMNSSVTLPTGFGVKITCIGTVKLNEFLVLNNVLY 447 Query: 339 IPQFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLD 497 IP FR NL+ VS LTK + F D C++QD MIG+ ++I + YVLD Sbjct: 448 IPDFRLNLLCVSQLTKDLGYRVTFDEDYCLIQDHVKGLMIGRGEQINNLYVLD 500 >ref|XP_006584195.1| PREDICTED: uncharacterized protein LOC102662902 [Glycine max] Length = 490 Score = 86.7 bits (213), Expect = 3e-15 Identities = 59/187 (31%), Positives = 98/187 (52%), Gaps = 13/187 (6%) Frame = +3 Query: 6 QMLNPSQYQHLMTMLNTHLSVAKIGIEADVISAQASGI----CFSAAINSIFNFPKYWII 173 Q + Q Q ++ +L ++ + D S+ +GI ++ +NS F P WI+ Sbjct: 232 QSWSQQQCQQVLALLQAQMAQLPGASQQD--SSDTAGIETIGMSNSTLNSPFQAPNAWIV 289 Query: 174 NSGATSHI-CFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQF 350 +SGA +HI CF +F F+ K LKN F L N R+ V + +V+++S ++L +VLYIP F Sbjct: 290 DSGAITHITCFPEFLFN-FKFLKNKFVLLPNETRVQVVGTGSVRIDSRIVLHNVLYIPSF 348 Query: 351 RFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVL--------DANF 506 NLISV +T + + F + QD + K IG+ D ++L A+ Sbjct: 349 HVNLISVPKITPNCKIGVLFEDTFVLFQDTQTHKTIGRGDLHLDLFLLYTEHTLTFAASR 408 Query: 507 SILNNHS 527 +++N+HS Sbjct: 409 TLVNSHS 415 >ref|XP_006587971.1| PREDICTED: uncharacterized protein LOC102669567 [Glycine max] Length = 453 Score = 85.9 bits (211), Expect = 5e-15 Identities = 50/154 (32%), Positives = 84/154 (54%) Frame = +3 Query: 12 LNPSQYQHLMTMLNTHLSVAKIGIEADVISAQASGICFSAAINSIFNFPKYWIINSGATS 191 L+ +Q Q L++ L L+ AD ++ GIC + + +S + YWI++SG TS Sbjct: 236 LSTAQCQQLISFLTKQLNTEN---NADTLATNVLGICMNTSFDSNESC-HYWILDSGETS 291 Query: 192 HICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQFRFNLISV 371 HIC ++ F++ K L S L N ++ V +KLN + L ++L+IP FRFNL+S+ Sbjct: 292 HICCSKEQFNSFKSLHVSHVLLPNSTKVKVEGIGRIKLNDDIFLHNMLFIPTFRFNLLSL 351 Query: 372 SSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKR 473 SL +S ++QDL +L+ I +++ Sbjct: 352 VSLINDNSFQFIMQPNSFVLQDLKTLRRIDTARQ 385 >gb|ABD32757.1| Integrase, catalytic region [Medicago truncatula] Length = 1157 Score = 85.5 bits (210), Expect = 7e-15 Identities = 56/177 (31%), Positives = 85/177 (48%), Gaps = 16/177 (9%) Frame = +3 Query: 12 LNPSQYQHLMTML----------------NTHLSVAKIGIEADVISAQASGICFSAAINS 143 L +Y HL+ +L + H+ + I D S + S AI+S Sbjct: 327 LTQEKYDHLVALLQQANLLSSVSPPTGPISNHVHTSTISSVPDTQQTGISSVV-SCAIDS 385 Query: 144 IFNFPKYWIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLL 323 + +WI++SGA +HIC + F + +K + L N + V ++ V +S L L Sbjct: 386 ASH---HWILDSGANNHICSSSLCFTSFYKIKPTNVNLPNKTTVLVQYAGNVSFSSSLYL 442 Query: 324 QDVLYIPQFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVL 494 +VLY P F NLISVS + K + F TDSCI+Q+L + +MIG + I Y L Sbjct: 443 TNVLYCPTFNLNLISVSKMCKSLSCFVNFSTDSCIIQELSTKRMIGLGENIHGLYRL 499 >ref|XP_004161031.1| PREDICTED: uncharacterized protein LOC101230271 [Cucumis sativus] Length = 457 Score = 84.0 bits (206), Expect = 2e-14 Identities = 45/119 (37%), Positives = 68/119 (57%) Frame = +3 Query: 12 LNPSQYQHLMTMLNTHLSVAKIGIEADVISAQASGICFSAAINSIFNFPKYWIINSGATS 191 LN QY L+ ML THL+ + G + +G C S ++N P WII+SGA+S Sbjct: 332 LNSDQYTQLLGMLQTHLNTPQNGENFKNETTHIAGTCLSNSLND----PLTWIIDSGASS 387 Query: 192 HICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQFRFNLIS 368 HIC ++F F + +N F L R+ V V +++ L+L+DVLYIP F++NL++ Sbjct: 388 HICHDKFMFTNLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLLT 446 >dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1475 Score = 82.4 bits (202), Expect = 6e-14 Identities = 44/113 (38%), Positives = 67/113 (59%) Frame = +3 Query: 165 WIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIP 344 WII+SGAT H+ +++ F ++ ++ TL + + + +KLNS L L++VLYIP Sbjct: 446 WIIDSGATHHVSYDRNLFESLSDGLSNEVTLPTGSNVKIAGIGVIKLNSNLTLKNVLYIP 505 Query: 345 QFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLDAN 503 +FR NL+SVS TK I F D C++QD + IG+ +I YVLD + Sbjct: 506 EFRLNLLSVSQQTKDMKCKIYFDEDCCVIQDPIKEQKIGRGNQIGGLYVLDTS 558 >gb|AAC62795.1| contains similarity to retroviral aspartyl proteases (Pfam: rvp.hmm, score: 11.80) [Arabidopsis thaliana] Length = 1244 Score = 82.0 bits (201), Expect = 8e-14 Identities = 54/178 (30%), Positives = 93/178 (52%), Gaps = 16/178 (8%) Frame = +3 Query: 12 LNPSQYQHLMTMLNTHLSVAKIGIEADVISAQA----SGICFSAAINSIFNF-------- 155 L+ Q Q+ + + ++ L +D +++ +GI FS NS + F Sbjct: 354 LSNDQLQNFIALFSSQLKSQPTASSSDAGISRSPIDYTGISFS---NSTYYFVGILNVSQ 410 Query: 156 ----PKYWIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLL 323 + W+I+SGAT H+C ++ F ++ S+ L +R+ + +V++N +LL Sbjct: 411 HTLSTETWVIDSGATHHVCHDKSLFVSLDHSVVSYVNLPTGSRVKISGVGSVQINENILL 470 Query: 324 QDVLYIPQFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLD 497 ++VL++P+FR NLIS+SSLT + F C +QDL IG+ +RI + YVLD Sbjct: 471 RNVLFLPEFRLNLISISSLTSDIGSRVIFDPSCCEIQDLTKDLRIGRGRRIGNLYVLD 528 >ref|XP_007023091.1| Uncharacterized protein TCM_027093 [Theobroma cacao] gi|508778457|gb|EOY25713.1| Uncharacterized protein TCM_027093 [Theobroma cacao] Length = 994 Score = 81.6 bits (200), Expect = 1e-13 Identities = 41/110 (37%), Positives = 65/110 (59%) Frame = +3 Query: 168 IINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQ 347 I++SGA+ HI ++ F + +P+ NSF L N+ R V VKL S L L++V +P Sbjct: 174 IMDSGASDHIAYSLNKFISARPVTNSFVQLPNNKRAIVTHVGVVKLTSLLTLKNVFCVPS 233 Query: 348 FRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLD 497 FRFNL+SV LT+ + + F+ C++QD HS +IG ++ Y ++ Sbjct: 234 FRFNLVSVGQLTRTKNTSVLFIDKYCVVQDTHSWTVIGVARTFLGLYAME 283 >gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1461 Score = 81.3 bits (199), Expect = 1e-13 Identities = 57/178 (32%), Positives = 88/178 (49%), Gaps = 17/178 (9%) Frame = +3 Query: 15 NPSQYQHLMTMLNTHL-----SVAKIGIEADVISAQA---SGICFSAAINSIFNF----- 155 +P Q Q+L+ + ++ L S + + S+Q+ SGI FS + Sbjct: 375 SPDQIQNLIALFSSQLQPQIVSPQTASSQHEASSSQSVAPSGILFSPSTYCFIGILAVSH 434 Query: 156 ----PKYWIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLL 323 W+I+SGAT H+ ++ F T+ SF L + + TV +N ++L Sbjct: 435 NSLSSDTWVIDSGATHHVSHDRKLFQTLDTSIVSFVNLPTGPNVRISGVGTVLINKDIIL 494 Query: 324 QDVLYIPQFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLD 497 Q+VL+IP+FR NLIS+SSLT + F C +QDL +G+ KRI + YVLD Sbjct: 495 QNVLFIPEFRLNLISISSLTTDLGTRVIFDPSCCQIQDLTKGLTLGEGKRIGNLYVLD 552 >emb|CAB40067.1| putative retrotransposon polyprotein [Arabidopsis thaliana] gi|7267797|emb|CAB81200.1| putative retrotransposon polyprotein [Arabidopsis thaliana] Length = 1203 Score = 80.1 bits (196), Expect = 3e-13 Identities = 41/111 (36%), Positives = 62/111 (55%) Frame = +3 Query: 165 WIINSGATSHICFNQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIP 344 WII+SGA+SH+C + F + + TL N R+ + + T+ + S L+L +VL +P Sbjct: 99 WIIDSGASSHVCSDLTMFRELIHVSGVTVTLPNGTRVAITHTGTICITSTLILHNVLLVP 158 Query: 345 QFRFNLISVSSLTKGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLD 497 F+FNLISV L K F D C +Q+L MIG+ K + Y+L+ Sbjct: 159 DFKFNLISVCCLVKTLSYSAHFFADCCYIQELTRGLMIGRGKTYNNLYILE 209 >ref|XP_006589830.1| PREDICTED: uncharacterized protein LOC102661486 [Glycine max] Length = 451 Score = 79.7 bits (195), Expect = 4e-13 Identities = 55/158 (34%), Positives = 86/158 (54%) Frame = +3 Query: 24 QYQHLMTMLNTHLSVAKIGIEADVISAQASGICFSAAINSIFNFPKYWIINSGATSHICF 203 QY LM +L +H + + I V SA SG+ S + ++ + W+++SGA+ HI Sbjct: 294 QYSQLMNLLESHAT--NVVIPHGVSSA--SGMILSTSTSNFLH--DCWLLDSGASIHITC 347 Query: 204 NQFAFHTMKPLKNSFFTLLNHNRIPVHFSRTVKLNSCLLLQDVLYIPQFRFNLISVSSLT 383 + F + + + + TL N + IP+ +V L + L+L +V YIP+F+FNLISVS L Sbjct: 348 SLHHFLSYQLVYDKIVTLPNSDIIPILAIGSVCLTNTLVLHNVAYIPKFKFNLISVSVLL 407 Query: 384 KGAHLDIRFLTDSCIMQDLHSLKMIGKSKRIADHYVLD 497 +L I F + +Q+ + K IGK I YVLD Sbjct: 408 TNPNLSISFSQNEFDIQEKQACKRIGKGDLIQGLYVLD 445