BLASTX nr result
ID: Rehmannia27_contig00022268
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia27_contig00022268 (1413 letters) Database: ./nr 84,704,028 sequences; 31,038,470,784 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAH66391.1| OSIGBa0134J07.9 [Oryza sativa Indica Group] 369 e-112 emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera] 370 e-112 gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum ... 365 e-111 gb|KYP41330.1| Retrovirus-related Pol polyprotein from transposo... 346 e-110 gb|AAP46257.1| putative polyprotein [Oryza sativa Japonica Group... 361 e-109 emb|CAN79116.1| hypothetical protein VITISV_002093 [Vitis vinifera] 356 e-109 emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera] 358 e-108 ref|XP_007033616.1| Uncharacterized protein TCM_019778 [Theobrom... 342 e-107 ref|XP_010675350.1| PREDICTED: uncharacterized protein LOC104891... 308 1e-98 gb|KYP75006.1| Retrovirus-related Pol polyprotein from transposo... 311 2e-96 gb|KYP58997.1| Retrovirus-related Pol polyprotein from transposo... 304 1e-90 gb|KYP65916.1| Retrovirus-related Pol polyprotein from transposo... 291 2e-90 ref|XP_007030765.1| Uncharacterized protein TCM_026511 [Theobrom... 276 1e-78 ref|XP_007014929.1| Uncharacterized protein TCM_040529 [Theobrom... 273 1e-77 gb|KYP45601.1| Retrovirus-related Pol polyprotein from transposo... 262 2e-77 gb|KYP77007.1| Retrovirus-related Pol polyprotein from transposo... 271 4e-77 gb|AAT38797.2| Polyprotein, putative [Solanum demissum] 268 2e-75 ref|XP_015940984.1| PREDICTED: DNA-directed RNA polymerase II su... 261 3e-73 emb|CAN83567.1| hypothetical protein VITISV_030380 [Vitis vinifera] 248 1e-72 emb|CAN69620.1| hypothetical protein VITISV_008603 [Vitis vinife... 246 8e-72 >emb|CAH66391.1| OSIGBa0134J07.9 [Oryza sativa Indica Group] Length = 1314 Score = 369 bits (946), Expect = e-112 Identities = 199/472 (42%), Positives = 291/472 (61%), Gaps = 4/472 (0%) Frame = +1 Query: 7 MHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENVQ 186 + QGV++++ PRI G K SKEAW+ LK++F G +KV+++KLQ L R F NL MK++E V+ Sbjct: 69 IQQGVAESLFPRIIGAKKSKEAWDKLKEEFQGSQKVLAVKLQTLRRQFQNLLMKESEKVK 128 Query: 187 EFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELMG 366 ++FSRV +VNQ+R + + I D+K+VEK+L SLP K+E+IVAAIEESKDLSTL++ +LM Sbjct: 129 DYFSRVIEIVNQMRLYGEDINDQKVVEKILISLPEKYEYIVAAIEESKDLSTLTIQQLMS 188 Query: 367 SLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXXX 546 SLE+HE+R + S+E AFQ K S +N R N + Sbjct: 189 SLESHEERKLQREGSSIENAFQSKLSF-RPQNSRFRGN-FQKNGFPMRDRGYFQKNGFST 246 Query: 547 XXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDC--RFKCKRCRNANHSQRDCWHKDDGDT 720 +L+C I +K++H T C + C +C+ H + C ++ Sbjct: 247 QKEDGQERREKSTSSSNLWCDISQKSSHTTDMCWKKMTCNKCKRKGHIAKYCRTRE---I 303 Query: 721 NEANLSEEKEPNQ-VFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKL 897 N AN S+EKE ++ + FSC +Q++ +++W +DSGC+NHM + +F +D Y +++ + Sbjct: 304 NRANFSQEKEKSEEMVFSCHTAQEEKDDVWVIDSGCTNHMAADPNLFREMDSLYHAKIHM 363 Query: 898 GDGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCI 1077 G+G + GKG +AVQT +G IKDV VPDL QNLLS+GQLLE Y V F + C Sbjct: 364 GNGSIAQSEGKGTVAVQTADGP-KFIKDVLLVPDLKQNLLSIGQLLEHGYAVYFEDFSCK 422 Query: 1078 IIDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLK 1257 I+D++ N +AKI M N+ F L + T ++AL+S + +LWH R GHL+Y+ L LL+ Sbjct: 423 ILDRKNNRLVAKINMEKNRNFLLRMNHTTQMALRSEVDIS-DLWHKRMGHLNYRALKLLR 481 Query: 1258 KKNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPKT-SYRAKAPLEIVHADICG 1410 K MV GLP I + CEGC+FGK + FP + ++RA AP E+VH DI G Sbjct: 482 TKGMVQGLPFITLKSDPCEGCVFGKQIQASFPHSGAWRASAPFELVHTDIVG 533 >emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera] Length = 1472 Score = 370 bits (950), Expect = e-112 Identities = 203/472 (43%), Positives = 278/472 (58%), Gaps = 2/472 (0%) Frame = +1 Query: 4 FMHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENV 183 F+ Q V ++I +I T+KEAW LK F G KVI++KLQ+L RDF+ L MK+ E+V Sbjct: 70 FIQQAVHESIFSKIAAXTTAKEAWTTLKTAFQGSSKVITVKLQSLRRDFETLHMKNGESV 129 Query: 184 QEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELM 363 Q+F SRV+ +VNQ+R + + I D+ +V KVLRSL KF+H+VAAIEESKDLST S DELM Sbjct: 130 QDFLSRVAAIVNQMRSYGEDILDQTVVAKVLRSLTPKFDHVVAAIEESKDLSTYSFDELM 189 Query: 364 GSLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXX 543 GSL++HE R+SR ++ E+ F K S+++N + A Sbjct: 190 GSLQSHEVRLSRTEEKNEEKXFYTKGETSDQKNGGREATG-------------------- 229 Query: 544 XXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDCRFKCKRCRNANHSQRDCWHKDDGDTN 723 R C + + R R +Q +CW K+ + Sbjct: 230 ---------------------RGCGRGGAHGRG-----GRGRGRGDAQXECWKKERQE-K 262 Query: 724 EANLSEEKEPNQVFFSCLNSQQ-QIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLG 900 +AN E++E F N + NIW++DSGCSNHM G K +F LDES+ +VKLG Sbjct: 263 QANYVEQEEDQVKLFMAYNEEVVSSNNIWFLDSGCSNHMTGIKSLFKELDESHKLKVKLG 322 Query: 901 DGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCII 1080 D K ++ GKG AV G + ++ +V+++P LTQNLLSVGQL+ Y + F+ C+I Sbjct: 323 DDKQVQVEGKGTXAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILFDGATCVI 382 Query: 1081 IDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKK 1260 DK+ + + ++M++NK+FPL + EK AL +E NLWHLRYGHL+ KGL LL K Sbjct: 383 KDKKSDQIIVBVRMAANKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSK 442 Query: 1261 KNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPK-TSYRAKAPLEIVHADICGP 1413 K MVFGLP I D CEGCI+GK + PFPK S RA + LEI+HAD+CGP Sbjct: 443 KEMVFGLPKI-DSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGP 493 >gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1333 Score = 365 bits (938), Expect = e-111 Identities = 199/471 (42%), Positives = 277/471 (58%), Gaps = 2/471 (0%) Frame = +1 Query: 7 MHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENVQ 186 + Q + I PRI+ V+TSK+AWEILK+++ G +KVI++KLQ L RDF+ L M +NE+VQ Sbjct: 68 IQQALDDEIFPRISAVETSKQAWEILKQEYFGDDKVITVKLQTLRRDFETLFMNENESVQ 127 Query: 187 EFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELMG 366 + SR S +VN++R + + I+++ +V KVLRSL KFEH+V AIEESKDLST S DELM Sbjct: 128 GYLSRTSAIVNRMRSYGEKIDNQIVVSKVLRSLTTKFEHVVTAIEESKDLSTYSFDELMS 187 Query: 367 SLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXXX 546 SL AHE R++R + E+AFQVK S + E A + Sbjct: 188 SLLAHEDRLNRSREKVQEKAFQVKGEFSYKGKAENSAGRGHGRGN-------FRGRGRGG 240 Query: 547 XXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDCRFKCKRCRNANHSQRDCWHKDDGDTNE 726 ++ CR C+K H + DCW K + + Sbjct: 241 SGRGRNQVGEFRQYKSNIQCRYCKK-----------------FGHKEVDCWTKQKDEQKD 283 Query: 727 ANLSEE-KEPNQVFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLGD 903 AN ++ +E +++F + + +W++DSGCSNHM +K +F +LDES SEV+LGD Sbjct: 284 ANFTQNVEEESKLFMASSQITESANAVWFIDSGCSNHMSSSKSLFRDLDESQKSEVRLGD 343 Query: 904 GKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCIII 1083 K I GKG + ++T +G + + DV YVP L NLLSVGQL+ Y V F ++ C I Sbjct: 344 DKQVHIEGKGTVEIKTVQGNVKFLYDVQYVPTLAHNLLSVGQLMTSGYSVVFYDNACDIK 403 Query: 1084 DKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKKK 1263 DK+ T+A++ M+ NK+FPL++ AL + E NLWHLRYGHL+ L LL +K Sbjct: 404 DKESGRTIARVPMTQNKMFPLDISNVGNSALVVKEKNETNLWHLRYGHLNVNWLKLLVQK 463 Query: 1264 NMVFGLPNIPDRDKTCEGCIFGKMHRLPFP-KTSYRAKAPLEIVHADICGP 1413 +MV GLPNI + D CEGCI+GK R FP S+RA LE+VHAD+CGP Sbjct: 464 DMVIGLPNIKELD-LCEGCIYGKQTRKSFPVGKSWRATTCLELVHADLCGP 513 >gb|KYP41330.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 604 Score = 346 bits (887), Expect = e-110 Identities = 187/472 (39%), Positives = 286/472 (60%), Gaps = 7/472 (1%) Frame = +1 Query: 19 VSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENVQEFFS 198 ++ I PRI G T+KEAW L+++F G EKV ++KLQ L R+F+ L+MK++E V++++S Sbjct: 1 MTDTIFPRIMGATTAKEAWTTLQEEFEGSEKVRAVKLQTLRRNFELLNMKESETVKDYYS 60 Query: 199 RVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELMGSLEA 378 ++ +VNQ+R H + I DKKIVEKVL S+P K++ IV IE++KDLSTLS+ ELMGSLEA Sbjct: 61 KIKEIVNQMRAHGENILDKKIVEKVLISVPRKYDPIVTIIEQTKDLSTLSVTELMGSLEA 120 Query: 379 HEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXXXXXXX 558 +E+R++R S E AFQ K + +N++KR N Sbjct: 121 YEQRLNRHDEDSTENAFQSKLKL-RSQNKDKRNNGETSRNKENCRNFSRNYQNEYPP--- 176 Query: 559 XXXXXXXXXXXXDLYCRICRKNNHDTKDCRFK----CKRCRNANHSQRDCWHKDDGDTNE 726 C IC++ NH KDCR++ C+ C+ H ++ C +K+ ++ Sbjct: 177 ---------------CGICKRTNHAEKDCRYRGKPQCRHCKKFGHVEKYCRNKNK---HQ 218 Query: 727 ANLSEEKEPNQ-VFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLGD 903 AN +EEK Q +F++ +S + WY+DSGCSNHM + +F ++DES +V++G+ Sbjct: 219 ANFAEEKNGEQHLFYATQDSNSETSGSWYLDSGCSNHMAKDASIFKDIDESVKVKVRMGN 278 Query: 904 GKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCIII 1083 + GKG + V+T++G +I DV VP+L +NLLS+GQ++EK Y + F D C I Sbjct: 279 DIVVESKGKGTVMVETKKGT-RLITDVLLVPNLKENLLSIGQMMEKGYTLHFEGDTCKIY 337 Query: 1084 DKQKNLTMAKIKMSS-NKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKK 1260 D +K L + ++KM N+ FP++L IA+++ + + LWH R+GH + L LL + Sbjct: 338 DNKK-LEIGRVKMEKRNRSFPISLRQGPNIAMKAEVD-DSWLWHRRFGHFNTHALKLLYQ 395 Query: 1261 KNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPK-TSYRAKAPLEIVHADICGP 1413 KNM+ LP + + + CEGC+ GK HRLPF ++R K LE++H DICGP Sbjct: 396 KNMMRDLPCLKENSEACEGCLLGKQHRLPFSTGKAWRVKDLLELIHIDICGP 447 >gb|AAP46257.1| putative polyprotein [Oryza sativa Japonica Group] gi|108711922|gb|ABF99717.1| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] Length = 1335 Score = 361 bits (926), Expect = e-109 Identities = 199/472 (42%), Positives = 287/472 (60%), Gaps = 4/472 (0%) Frame = +1 Query: 7 MHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENVQ 186 + QGV++++ PRI G K SKEAW+ LK++F G +KV+++KLQ L R F NL MK++E V+ Sbjct: 69 IQQGVAESLFPRIIGAKKSKEAWDKLKEEFQGSQKVLAVKLQTLRRQFQNLLMKESEKVK 128 Query: 187 EFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELMG 366 ++FSRV +VNQ+R + + I D+K+VEK+L SLP K+E+IVAA EESKDLS Sbjct: 129 DYFSRVIEIVNQMRLYGEDINDQKVVEKILISLPEKYEYIVAATEESKDLSK-------D 181 Query: 367 SLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXXX 546 SLE+HE+R + S+E AFQ K S +N R N + Sbjct: 182 SLESHEERKLQREGSSIENAFQSKLSF-RPQNSRFRGN-FQKNGFPMRDRGYFQKNGFSR 239 Query: 547 XXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDC--RFKCKRCRNANHSQRDCWHKDDGDT 720 +L+C IC+K++H T C + C +C+ H + C ++ Sbjct: 240 QKEDGQERREKGTSSSNLWCDICQKSSHTTDMCWKKMTCNKCKRKGHIAKYCRTRE---I 296 Query: 721 NEANLSEEKEPNQ-VFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKL 897 N AN S+EKE ++ + FSC +Q++ +++W +DSGC+NHM + +F +D SY +++ + Sbjct: 297 NRANFSQEKEKSEEMVFSCHTAQEEKDDVWVIDSGCTNHMAADPNLFREMDSSYHAKIHM 356 Query: 898 GDGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCI 1077 G+G + GKG +AVQT +G IKDV VPDL QNLLS+GQLLE Y V F + C Sbjct: 357 GNGSIAQSEGKGTVAVQTADGP-KFIKDVLLVPDLKQNLLSIGQLLEHGYAVYFEDFSCK 415 Query: 1078 IIDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLK 1257 I+D++ N +AKI M N+ F L + T ++AL+S + +LWH R GHL+Y+ L LL+ Sbjct: 416 ILDRKNNRLVAKINMEKNRNFLLRMNHTTQMALRSEVDIS-DLWHKRMGHLNYRALKLLR 474 Query: 1258 KKNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPKT-SYRAKAPLEIVHADICG 1410 K MV GLP I + CEGC+FGK R FP + ++RA APLE+VHADI G Sbjct: 475 TKGMVQGLPFITLKSDPCEGCVFGKQIRASFPHSGAWRASAPLELVHADIVG 526 >emb|CAN79116.1| hypothetical protein VITISV_002093 [Vitis vinifera] Length = 1109 Score = 356 bits (914), Expect = e-109 Identities = 194/454 (42%), Positives = 267/454 (58%), Gaps = 2/454 (0%) Frame = +1 Query: 58 TSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENVQEFFSRVSTVVNQIRGHS 237 T K++ + Q G KVI++KLQ+L RDF+ L MK+ E+VQ+F SRV+ +VNQ+R + Sbjct: 61 TKKDSKALFFIQQAGSSKVITVKLQSLRRDFETLHMKNGESVQDFLSRVAAIVNQMRSYG 120 Query: 238 DTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELMGSLEAHEKRMSRFSSQSL 417 + I D+ +V KVLRSL KF+H+VAAIEESKDLST S DELMGSL++HE R+SR ++ Sbjct: 121 EDILDQTVVAKVLRSLTPKFDHVVAAIEESKDLSTYSFDELMGSLQSHEXRLSRTXEKNE 180 Query: 418 EQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXXXXXXXXXXXXXXXXXXXD 597 E+ F K S+++N + A D Sbjct: 181 EKTFYTKGETSDQKNGGREATGRGRGRGGAHGRGG------------------RXRGRGD 222 Query: 598 LYCRICRKNNHDTKDCRFKCKRCRNANHSQRDCWHKDDGDTNEANLSEEKEPNQVFFSCL 777 + +C C+ H Q +CW K+ + +AN E++E F Sbjct: 223 AQGDQRQSTEKSRNKSNIQCYYCKRFGHVQAECWKKERQE-KQANYVEQEEDQVKLFMAY 281 Query: 778 NSQQQIEN-IWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLGDGKSRKITGKGEIAVQTR 954 N + N IW++DSGCSNHM G K +F LDES+ +VKLGD K + GKG +AV Sbjct: 282 NEEVVXSNNIWFLDSGCSNHMTGIKSLFKELDESHKLKVKLGDDKQVXVEGKGTVAVNNG 341 Query: 955 EGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCIIIDKQKNLTMAKIKMSSNK 1134 G + ++ +V+++P LTQNLLSVGQL+ Y + F+ C+I DK+ + + ++M++NK Sbjct: 342 HGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILFDGSTCVIKDKKFDQIIVDVRMAANK 401 Query: 1135 IFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKKKNMVFGLPNIPDRDKTCE 1314 +FPL + EK AL +E NLWHLRYGHL+ KGL LL KK MVFGLP I D CE Sbjct: 402 LFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSKKEMVFGLPKI-DSVNVCE 460 Query: 1315 GCIFGKMHRLPFPK-TSYRAKAPLEIVHADICGP 1413 GCI+GK + PFPK S RA + LEI+HAD+CGP Sbjct: 461 GCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGP 494 >emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera] Length = 1274 Score = 358 bits (918), Expect = e-108 Identities = 199/472 (42%), Positives = 273/472 (57%), Gaps = 2/472 (0%) Frame = +1 Query: 4 FMHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENV 183 F+ Q V ++I +I T+KEAW LK F G KVI++KLQ+L RDF+ L MK+ E++ Sbjct: 70 FIQQAVHESIFSKIAAATTAKEAWTTLKTAFQGSSKVITVKLQSLRRDFETLHMKNGESM 129 Query: 184 QEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELM 363 Q+FF + I D+ +V KVLRSL KF+H+VAAIEESKDLST S DELM Sbjct: 130 QDFFVK-------------NILDQTVVAKVLRSLTPKFDHVVAAIEESKDLSTYSFDELM 176 Query: 364 GSLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXX 543 GSL++HE R+SR ++ E+AF K S+++N + A Sbjct: 177 GSLQSHEVRLSRTEEKNEEKAFYTKGETSDQKNGGREATGRGRGRGGAHGRGGRGRGRGD 236 Query: 544 XXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDCRFKCKRCRNANHSQRDCWHKDDGDTN 723 Y R + N + + +C C+ H Q +CW K+ + Sbjct: 237 AQG----------------YQRQSTEKNRNKSN--IQCYYCKRFGHVQXECWKKERQE-K 277 Query: 724 EANLSEEKEPNQVFFSCLNSQQ-QIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLG 900 +AN E++E F N + NIW++DSGCSNHM G K +F LDES+ +VKLG Sbjct: 278 QANYVEQEEDQVKLFMXYNEEVVSSNNIWFLDSGCSNHMTGIKSLFKELDESHKLKVKLG 337 Query: 901 DGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCII 1080 D K + GKG +AV G + ++ +V+++P LTQNLLSVGQL+ Y + F+ C+I Sbjct: 338 DDKQVXVEGKGIMAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILFDGATCVI 397 Query: 1081 IDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKK 1260 DK+ + + ++M++NK+FPL + EK AL +E NLWHLRYGHL+ KGL LL K Sbjct: 398 KDKKSDQIIVNVRMAANKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSK 457 Query: 1261 KNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPK-TSYRAKAPLEIVHADICGP 1413 K MVFGLP I D CEGCI+GK + PFPK S RA + LEI+HAD+CGP Sbjct: 458 KEMVFGLPKI-DSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGP 508 >ref|XP_007033616.1| Uncharacterized protein TCM_019778 [Theobroma cacao] gi|508712645|gb|EOY04542.1| Uncharacterized protein TCM_019778 [Theobroma cacao] Length = 704 Score = 342 bits (878), Expect = e-107 Identities = 182/472 (38%), Positives = 273/472 (57%), Gaps = 2/472 (0%) Frame = +1 Query: 4 FMHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENV 183 F+ Q V + I RI TS EAW+ILKK+F G KVI++KLQ R+F+ LSMK NE V Sbjct: 32 FIQQAVHETIFSRIAAATTSLEAWQILKKKFQGSSKVITVKLQTYRREFETLSMKSNEFV 91 Query: 184 QEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELM 363 Q + SRVS++VNQ++ + + I ++ +V KVLRSL KFEHIVAAIEE+ DLS S DELM Sbjct: 92 QTYLSRVSSLVNQMKSYGEDISEETVVAKVLRSLTPKFEHIVAAIEEAHDLSNYSFDELM 151 Query: 364 GSLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXX 543 SL+AHE+R+ R ++ E+AFQV + +E E Sbjct: 152 SSLQAHEERLFRSHEKNEEKAFQVNEESNLKETLENSTGGGRGRVGFRGKGHGRG----- 206 Query: 544 XXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDCRFKCKRCRNANHSQRDCWHKDDGDTN 723 R ++N + ++ F+C C+ H CW K + N Sbjct: 207 ---------------------RSRGRSNEERQNKTFQCYYCKKPGHRAAYCWQKQKDENN 245 Query: 724 EANLSEEKEPN-QVFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLG 900 +A+ E+ + ++F + ++Q ++W++DSGCSNHM G + +F LDES ++V LG Sbjct: 246 QASFVEKSDEEIRLFMAFFYEKEQSNDVWFLDSGCSNHMSGTRSLFKELDESNKTDVTLG 305 Query: 901 DGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCII 1080 + K ++ G+G I+++T +G I++ V VPDL+ NLLS+ QL+ Y + F++ C I Sbjct: 306 NSKKIRVEGRGTISIKTSQGNAKILQYVMLVPDLSHNLLSIVQLMISGYSILFDDGFCTI 365 Query: 1081 IDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKK 1260 +K+ + K+ M+ NK+FPL + + E A+ + ++E L HL YGHL+ GL LL + Sbjct: 366 KNKKFKQIITKVPMAKNKMFPLEVSMIENYAMVANGDSEARLSHLHYGHLNINGLKLLSQ 425 Query: 1261 KNMVFGLPNIPDRDKTCEGCIFGKMHRLPF-PKTSYRAKAPLEIVHADICGP 1413 K MVFGLP + + CEGC++GK + PF ++R LE+VHAD+CGP Sbjct: 426 KEMVFGLPKLENLG-FCEGCVYGKQSKKPFLVGKAWRVSKCLELVHADLCGP 476 >ref|XP_010675350.1| PREDICTED: uncharacterized protein LOC104891365 [Beta vulgaris subsp. vulgaris] Length = 326 Score = 308 bits (788), Expect = 1e-98 Identities = 154/342 (45%), Positives = 210/342 (61%), Gaps = 4/342 (1%) Frame = +1 Query: 391 MSRFSSQSLEQAFQVKASIS---EEENQEKRANSXXXXXXXXXXXXXYXXXXXXXXXXXX 561 M RF+ QSLEQAFQ K S E +N + Y Sbjct: 1 MRRFTEQSLEQAFQAKLKFSNNGENKNGYDKNFQRGTSYNRGRRRGNYKNQVTRENNQSG 60 Query: 562 XXXXXXXXXXXDLYCRICRKNNHDTKDCRFKCKRCRNANHSQRDCWHKDDGDTNEANLSE 741 YC +C+KN H+T+DCR KC RCR H ++D W + EAN +E Sbjct: 61 S------------YCNLCKKNGHNTQDCRSKCNRCRKHTHFEKDRWFRQK---EEANFAE 105 Query: 742 EKEP-NQVFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLGDGKSRK 918 KEP +Q+F++CLN+ Q+ ++WY+DSGCSNHM GNK FV+LDE+ +++ LGDG +++ Sbjct: 106 NKEPKDQLFYTCLNAHQESNDLWYIDSGCSNHMTGNKNSFVSLDENIKTQITLGDGSNQE 165 Query: 919 ITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCIIIDKQKN 1098 + GKG I V+ R I VFYVP L QNLLSVGQL+++ Y V+F+ + C+I DK+K Sbjct: 166 LAGKGTIVVRARNDSSKFIHKVFYVPRLAQNLLSVGQLMQRSYMVKFDANKCLIFDKRKG 225 Query: 1099 LTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKKKNMVFG 1278 + +M+ KIFPL + L +AL S + E LWHLR+GHL++ L LLK+K MV G Sbjct: 226 QLITNTQMAPTKIFPLRMQLEHNVALSSMVD-ESILWHLRFGHLNFNSLKLLKRKEMVTG 284 Query: 1279 LPNIPDRDKTCEGCIFGKMHRLPFPKTSYRAKAPLEIVHADI 1404 LP I + K CEGCI+G+MHRLPFP +S+RA+APLE+VHAD+ Sbjct: 285 LPPISNERKICEGCIYGEMHRLPFPTSSWRARAPLELVHADL 326 >gb|KYP75006.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 617 Score = 311 bits (798), Expect = 2e-96 Identities = 178/486 (36%), Positives = 267/486 (54%), Gaps = 17/486 (3%) Frame = +1 Query: 7 MHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENVQ 186 +++ V ++ +I K+SKEAW+IL+K G E+V ++LQ L + +N+ MK++E V Sbjct: 49 LYRAVDESGFEKIANAKSSKEAWDILEKAKKGDERVKQVRLQTLRGELENMRMKESEGVS 108 Query: 187 EFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELMG 366 EF +RV TV N++ + + + ++VEK+LRSL FE+IV AIEESKDLSTL+++EL G Sbjct: 109 EFITRVETVANKLNRNGENLPSSRVVEKILRSLTDDFENIVCAIEESKDLSTLTVEELTG 168 Query: 367 SLEAHEKRMS--RFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXX 540 SLEA+E+R + +SLEQA Q KA+I EE+ + R + Sbjct: 169 SLEAYEQRKKNKKEKGESLEQALQAKATIKEEKARGGRGQAWGGRGQTWGGRSN------ 222 Query: 541 XXXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDC-RFKCKRCRNANHSQRDCWHKDDGD 717 ++ C C K H K+C KC C N H +DC + + Sbjct: 223 ---------------TNNNIECYNCGKYGHVAKECYSIKCYNCGNLGHISKDCRSEKKRE 267 Query: 718 TNEANLSEEKEPNQVFFSCL-------NSQQQIE------NIWYVDSGCSNHMCGNKKMF 858 L+EE++ + + + + +I+ ++WY+D+G SNHMCG++ +F Sbjct: 268 EPTNFLAEEEDEGLLLVTTIPEVEIKPSCSSEIKPSCSDNSVWYLDTGASNHMCGDEHLF 327 Query: 859 VNLDESYSSEVKLGDGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLE 1038 L + V GD + G+G I Q R GK+ I DV+YVPDL N+LS+GQL+E Sbjct: 328 KMLSKEEFGSVSFGDASKVVVKGRGTIWYQQRNGKIGEIGDVYYVPDLKSNILSMGQLME 387 Query: 1039 KDYKVEFNNDHCIIIDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLR 1218 K Y V + + DK L +A+++M N+++ L L + + +Q E E WHLR Sbjct: 388 KGYSVLMKDRELQLKDKLGRL-IAQVEMKKNRMYKLELKIVQDECMQLDLEDEAMKWHLR 446 Query: 1219 YGHLSYKGLNLLKKKNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPKTS-YRAKAPLEIVH 1395 +GHL + GL L KK MVFGLP + K CE C+ GK R FP++S YRAK L ++H Sbjct: 447 FGHLHFGGLTELVKKEMVFGLPKMEFEKKFCEECVIGKHARTSFPRSSEYRAKEQLGLIH 506 Query: 1396 ADICGP 1413 D+CGP Sbjct: 507 TDLCGP 512 >gb|KYP58997.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 946 Score = 304 bits (779), Expect = 1e-90 Identities = 167/470 (35%), Positives = 256/470 (54%), Gaps = 1/470 (0%) Frame = +1 Query: 1 SFMHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNEN 180 S +HQGV +I +I KT+KEAW ILK + G EK KLQ+L R+++ M ++E+ Sbjct: 61 SQIHQGVDYSIFGKIANAKTAKEAWNILKLSYKGVEKAQKSKLQSLRREYERYEMSNSES 120 Query: 181 VQEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDEL 360 V+++FSRV+ +VN++R + + I + K+VEK+LR++P KF+H+V AI ES D +++ +L Sbjct: 121 VEQYFSRVTDLVNKMRVYGEDIPESKVVEKILRTMPMKFDHVVTAIIESHDTDIMTVAKL 180 Query: 361 MGSLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXX 540 GS+E+H R+ + + E+A + + + + +R +S Sbjct: 181 QGSIESHVSRILEKTEKGNEEALKSQVNFTNIAEPSRREDSKGREGGNINFRGR------ 234 Query: 541 XXXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDCRFKCKRCRNANHSQRDCWHKDDGDT 720 + C C K H DCRFK + AN ++ H + Sbjct: 235 -----GRGRGSFTNQERTNFNCYHCGKFGHRAADCRFK----QQANIAENQYKHTGESSD 285 Query: 721 NEANLSEEKEPNQVFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLG 900 N Q N+ IWY+D+GCSNH+CG K++F +LDE+ S VK G Sbjct: 286 NP----------QTLLLVANNFSGDGAIWYLDTGCSNHLCGKKELFFSLDETVKSTVKFG 335 Query: 901 DGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCII 1080 + + I GKG +A++ ++G N I DVFY L NLLS+GQL +K Y ++ ++ +C + Sbjct: 336 NNSNIPILGKGRVAIRLKDGSQNFISDVFYARGLHHNLLSMGQLSKKGYNMKIHHGYCTL 395 Query: 1081 IDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKK 1260 IDK +AK+KM+ N++FPL + + L S + LWH+R+GH + GLN L + Sbjct: 396 IDKSGRF-IAKVKMTPNRLFPLKICHEKFTCLSSIIPNDDWLWHMRFGHFHFSGLNYLSR 454 Query: 1261 KNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPK-TSYRAKAPLEIVHADIC 1407 K V GLP + + CE C GK HR FP S+RAK LEIVH+D+C Sbjct: 455 KEYVSGLPIVNIPNGVCETCEIGKKHRESFPTGVSWRAKKLLEIVHSDLC 504 >gb|KYP65916.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 464 Score = 291 bits (746), Expect = 2e-90 Identities = 157/424 (37%), Positives = 248/424 (58%), Gaps = 7/424 (1%) Frame = +1 Query: 163 MKDNENVQEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLST 342 MK++E V++++S++ +VNQ+R + + I DKKIVEK++ S+P K++ I+ IE+ KDLST Sbjct: 1 MKESETVKDYYSKIKEIVNQMRAYGENILDKKIVEKIIISVPRKYDPIMTTIEQIKDLST 60 Query: 343 LSLDELMGSLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXX 522 LS+ ELMGSLEA+E+R++R S+E AFQ K + ++N + + Sbjct: 61 LSVTELMGSLEAYEQRLNRHDEDSIENAFQSKLKLRNKDNSRNFSRNYQNEYPP------ 114 Query: 523 YXXXXXXXXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDCRF----KCKRCRNANHSQR 690 C IC++ N KDCR+ +C+ C H ++ Sbjct: 115 ---------------------------CDICKRTNLAEKDCRYHGKPQCRHCNKVRHVKK 147 Query: 691 DCWHKDDGDTNEANLSEEKEPNQ-VFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNL 867 C +K+ ++AN +EEK Q +F++ +S + WY+DSGCSNHM + +F ++ Sbjct: 148 YCRNKNK---HQANFAEEKNGEQHLFYATQDSNSETSGNWYLDSGCSNHMAKDASIFKDI 204 Query: 868 DESYSSEVKLGDGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDY 1047 DES +V++G+ + GKG + V+T++G M +I DV VP+L +NLLS+GQ++EK Y Sbjct: 205 DESVKVKVRMGNDTVVESKGKGTVMVETKKG-MRLITDVILVPNLKENLLSIGQMMEKGY 263 Query: 1048 KVEFNNDHCIIIDKQKNLTMAKIKMSS-NKIFPLNLPLTEKIALQSTTEAEHNLWHLRYG 1224 + F D C I D +K L + ++KM N+ FP++L A+++ + + LWH R+G Sbjct: 264 TLHFEGDSCKIYDNKK-LEIGRVKMEKRNRSFPISLRQGLNFAMKAEVD-DSWLWHQRFG 321 Query: 1225 HLSYKGLNLLKKKNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPK-TSYRAKAPLEIVHAD 1401 H + LNLL KKNM+ LP + + + CEGC GK HRLPF ++ AK LE++H + Sbjct: 322 HFNTHALNLLYKKNMMRDLPCLKENSEACEGCFLGKQHRLPFSTGKAWSAKDLLELIHTN 381 Query: 1402 ICGP 1413 ICGP Sbjct: 382 ICGP 385 >ref|XP_007030765.1| Uncharacterized protein TCM_026511 [Theobroma cacao] gi|508719370|gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao] Length = 1318 Score = 276 bits (705), Expect = 1e-78 Identities = 161/478 (33%), Positives = 269/478 (56%), Gaps = 7/478 (1%) Frame = +1 Query: 1 SFMHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNEN 180 S +H V+ AI RI +++KEAW+ +K++F G ++ I++ NL R+F+ L MKD E Sbjct: 72 SCIHSAVTDAIFVRIMACESAKEAWDKIKEEFHGSDRTRQIQILNLLREFEVLKMKDEET 131 Query: 181 VQEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDEL 360 ++++ +V VVNQ+R + I ++++V K L SLP KFE ++++E+SKDL+T+S+ EL Sbjct: 132 MKDYSDKVLRVVNQLRLFGENITERRVVNKFLVSLPEKFESKISSLEDSKDLTTMSVSEL 191 Query: 361 MGSLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXX 540 + +L+A E+R ++L Q V+A+++ +KR +S Sbjct: 192 INALQAQEQR------RALRQEDHVEAALAARR-VDKRTSSGSHKKSEYEKKDKDKRYEE 244 Query: 541 XXXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDCRF----KCKRCRNANHSQRDCWHKD 708 C C+K NH + C + KC+ C H ++ C +K+ Sbjct: 245 KKQGKKWQFPP----------CSYCKKKNHIERYCWYRPHVKCRACNQKGHVEKVCKNKE 294 Query: 709 DGDTNEANLSEEKEPNQ--VFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYS 882 + +A + E+KE + +F ++ + ++IW +DS CS H+ G K F++L+++Y Sbjct: 295 NRVEEKAAIVEQKEDAEETLFMVIESNDSKKDSIWLIDSACSTHITGKIKNFLDLNKAYK 354 Query: 883 SEVKLGDGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFN 1062 S V++GDG KI G+G + + T++G M I +V + P++TQNLLSVGQL+++ + F Sbjct: 355 STVEIGDGNLLKIAGRGTVGITTKKG-MKTIANVCFAPEVTQNLLSVGQLVKEKNSLLFK 413 Query: 1063 NDHCIIIDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKG 1242 ++ C I D +A +KM NK FPL+L +A + + E LWH R GH++Y+ Sbjct: 414 DELCTIFD-PSGREIATVKM-RNKCFPLDLNEAGHMAYKCVSN-EARLWHRRLGHINYQF 470 Query: 1243 LNLLKKKNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPKTSY-RAKAPLEIVHADICGP 1413 + + N+V +P I + +KTCE C+ GK R PFPK S R L+++H DICGP Sbjct: 471 IKNMGSLNLVNDMPIITEVEKTCEVCLQGKQSRHPFPKQSQTRTANRLQLIHTDICGP 528 >ref|XP_007014929.1| Uncharacterized protein TCM_040529 [Theobroma cacao] gi|508785292|gb|EOY32548.1| Uncharacterized protein TCM_040529 [Theobroma cacao] Length = 1266 Score = 273 bits (698), Expect = 1e-77 Identities = 161/478 (33%), Positives = 268/478 (56%), Gaps = 7/478 (1%) Frame = +1 Query: 1 SFMHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNEN 180 S +H V+ AI RI +++KEAW+ +K++F G ++ I++ NL R+F+ L MKD E Sbjct: 72 SCIHSAVTDAIFVRIMACESAKEAWDKIKEEFHGSDRTRQIQILNLLREFEVLKMKDEET 131 Query: 181 VQEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDEL 360 ++++ +V VVNQ+R + I ++++V K L SLP KFE ++++E+SKDL+T+S+ EL Sbjct: 132 MKDYSDKVLRVVNQLRLFGENITERRVVNKFLVSLPEKFESKISSLEDSKDLTTMSVSEL 191 Query: 361 MGSLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXX 540 + L+A E+R ++L Q V+A+++ +KR +S Sbjct: 192 INVLQAQEQR------RALRQEDHVEAALAARR-VDKRTSSGSHKKSEYEKKDKDKRYEE 244 Query: 541 XXXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDCRF----KCKRCRNANHSQRDCWHKD 708 C C+K NH + C + KC+ C H ++ C +K+ Sbjct: 245 KKQGKKGQFPP----------CSYCKKKNHIERYCWYRPHVKCRACNQKGHVEKVCKNKE 294 Query: 709 DGDTNEANLSEEKEPNQ--VFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYS 882 + + + E+KE + +F ++ + ++IW +DS CS H+ G K F++L+++Y Sbjct: 295 NRVEEKVAIVEQKEDAEETLFMVIESNDSKKDSIWLIDSACSTHITGKIKNFLDLNKAYK 354 Query: 883 SEVKLGDGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFN 1062 S V++GDG KI G+G I + T++G + I +V + P++TQNLLSVGQL+++ + F Sbjct: 355 STVEIGDGNLLKIEGRGTIGITTKKG-IKTIANVCFAPEVTQNLLSVGQLVKEKNSLLFK 413 Query: 1063 NDHCIIIDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKG 1242 ++ C I D +A +KM NK FPL+L +A + + E LWH R GH++Y+ Sbjct: 414 DELCTIFD-PSGREIATVKM-RNKCFPLDLNEAGHMAYKCVSN-EARLWHRRLGHINYQF 470 Query: 1243 LNLLKKKNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPKTSY-RAKAPLEIVHADICGP 1413 + + N+V +P I + +KTCE C+ GK R PFPK S RA L+++H DICGP Sbjct: 471 IKNMGSLNLVNDMPVITEVEKTCEVCLQGKQSRHPFPKQSQTRATNRLQLIHTDICGP 528 >gb|KYP45601.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 599 Score = 262 bits (669), Expect = 2e-77 Identities = 154/471 (32%), Positives = 250/471 (53%), Gaps = 5/471 (1%) Frame = +1 Query: 16 GVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENVQEFF 195 GVS+AI RI +K++KE W+ LK+++ G E++ S+K+ NL R+F+ MK+ E ++E+ Sbjct: 52 GVSQAIFTRIKTLKSAKEIWDYLKEEYAGDERIRSMKVLNLMREFELQRMKEYEKIKEYS 111 Query: 196 SRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELMGSLE 375 ++ + N+IR D IVEK+L ++ K+E A++E ++DLS ++ E++ + + Sbjct: 112 DKLLGIANKIRLLGSNFPDSIIVEKILVTVSEKYEASTASLENTRDLSKITFAEVLHAFQ 171 Query: 376 AHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXXXXXX 555 A E+R ++E A VK+ ++ + A+S Sbjct: 172 AQEQRSLMREDHAVEGALLVKSQQAKNYKKNYPASSYDKGKGGKKSYPP----------- 220 Query: 556 XXXXXXXXXXXXXDLYCRICRKNNHDTKDC----RFKCKRCRNANHSQRDCWHKDDGDTN 723 C+ C K H C KC +C H C K+ Sbjct: 221 ----------------CQHCGKMGHAPFRCWQRPDAKCNKCNQMGHEAIICKSKNQQQEE 264 Query: 724 EANLSEEKEPNQVFFS-CLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLG 900 EA +++KE +Q+F + C S + E+ W +DSGC+NHM NK +F +L + ++V++G Sbjct: 265 EAKAADQKEEDQLFVATCFLSSESSES-WLIDSGCTNHMTFNKALFRDLRPTNVTKVRIG 323 Query: 901 DGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCII 1080 +G + GKG IA+ + G I D+F VP++ QNLLSVGQL+EK +KV F + +C+I Sbjct: 324 NGDHISVKGKGTIAITSCTGT-KFIHDIFLVPEIDQNLLSVGQLIEKGFKVVFEDKYCLI 382 Query: 1081 IDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKK 1260 D M K+KM K F LN PL E+ + S E +WH R GH ++GL + + Sbjct: 383 KDAAGQ-DMFKVKMKG-KSFALN-PLEEEQVVFSLKENVTEIWHKRLGHYHHQGLLQMSE 439 Query: 1261 KNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPKTSYRAKAPLEIVHADICGP 1413 K + +P + D+ C+ C FGK +R PF KT++RA L+++H D+ GP Sbjct: 440 KGLALDIPVLEDQTSNCKACQFGKQNRKPFSKTAWRASRKLQLIHTDVAGP 490 >gb|KYP77007.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1228 Score = 271 bits (693), Expect = 4e-77 Identities = 148/430 (34%), Positives = 230/430 (53%), Gaps = 15/430 (3%) Frame = +1 Query: 163 MKDNENVQEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLST 342 M ++E+V+++FSRV+ +VN++R + + I + K+VEK+LR++P KF+H+V I ES D+ Sbjct: 1 MTNSESVEQYFSRVTDLVNKMRVYGEDIPESKVVEKILRTMPMKFDHVVTTIIESHDIEI 60 Query: 343 LSLDELMGSLEAHEKRMSRFSSQSLEQAFQVKASISE----EENQEKRANSXXXXXXXXX 510 +++ EL GS+E+H R+ + + E+A + + + + N++ R Sbjct: 61 MTVAELQGSIESHVSRILEKTEKINEEALKSQVNFTNIAEPSRNEDSRGREGGNINFKGR 120 Query: 511 XXXXYXXXXXXXXXXXXXXXXXXXXXXXDLYCRICRKNNH-------DTKDCRFKCKRCR 669 + + N + + F C C Sbjct: 121 GRGSFRGRGRCNFNQQWRDNNFRPPNQGRGGYNVRSTNRGRGRGSFTNQERTNFNCYHCG 180 Query: 670 NANHSQRDCWHKDDGDTNEANLSEEKEPN---QVFFSCLNSQQQIENIWYVDSGCSNHMC 840 H DC K + E + E + Q N+ E IWY+D+GCSNHMC Sbjct: 181 KFGHRAADCRFKQQANIAENQYEQTGEISDNPQTLLLATNNFSGNEAIWYLDTGCSNHMC 240 Query: 841 GNKKMFVNLDESYSSEVKLGDGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLS 1020 G K++F +LDE+ S VK G+ + I GKG++A++ ++G N I DVFY P L NLLS Sbjct: 241 GKKELFSSLDETVKSTVKFGNNSNIPILGKGQVAIRLKDGTQNFISDVFYAPGLHHNLLS 300 Query: 1021 VGQLLEKDYKVEFNNDHCIIIDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEH 1200 +GQL EK Y ++ ++ +C++IDK + L +AK+KMS N++FPLN+ + L S + + Sbjct: 301 LGQLSEKGYNIQIHDGYCMLIDKNRRL-IAKVKMSPNRLFPLNVQYDKIPCLSSIIQNDD 359 Query: 1201 NLWHLRYGHLSYKGLNLLKKKNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPK-TSYRAKA 1377 LWH+R+GH + GLN L +K V GLP I CE C GK HR FP S+RA+ Sbjct: 360 WLWHMRFGHYHFSGLNFLSRKEYVSGLPVINIPKGICETCEIGKKHRESFPTGKSWRARK 419 Query: 1378 PLEIVHADIC 1407 PLEIVH+D+C Sbjct: 420 PLEIVHSDLC 429 >gb|AAT38797.2| Polyprotein, putative [Solanum demissum] Length = 1793 Score = 268 bits (684), Expect = 2e-75 Identities = 150/479 (31%), Positives = 259/479 (54%), Gaps = 8/479 (1%) Frame = +1 Query: 1 SFMHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNEN 180 S M V+ ++ RI KT+KEAW+ LK+++ G ++ +++ NL R+F+ L+M+D+E Sbjct: 184 SLMQNAVADSVFYRIMACKTAKEAWDRLKEEYQGSDRTRQMQVLNLKREFECLNMQDDET 243 Query: 181 VQEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDEL 360 + ++ R+S +VN IR + DK+IVEKVL +LP +FE +++ E+SKDL LSL EL Sbjct: 244 ISKYADRISLIVNNIRLLGEEFTDKRIVEKVLVTLPERFESKISSFEKSKDLGKLSLGEL 303 Query: 361 MGSLEAHEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXX 540 MG+L+A E+R + + E A V+ I + Q+ N+ Sbjct: 304 MGALQAQEQRRNMRRDKFTEGAVSVQKQIFGKGKQQVNQNNKVKHDGGNNSGDVKKKFPP 363 Query: 541 XXXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDCRFK----CKRCRNANHSQRDCWHKD 708 C+ C++ H K C ++ C C+ H + C + Sbjct: 364 ---------------------CKYCKRTTHLEKYCWWRVDAICGNCKQTGHISKVCKSRA 402 Query: 709 DGDTN-EANLSE--EKEPNQVFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESY 879 + + +A +++ + +Q+F S + + W +DSGC++H+C + +MF LD++Y Sbjct: 403 NASGSLQAQVADAADAHEDQLFAVSYFSINESSDSWILDSGCTHHLCNDAEMFKFLDDTY 462 Query: 880 SSEVKLGDGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEF 1059 S+VK+G+G++ ++ G+G +++ G + I D+ Y PD++QNLLSVGQ+LE +Y + F Sbjct: 463 KSKVKVGNGEAVEVKGRGTMSISIISG-IKTIPDILYTPDMSQNLLSVGQMLENNYSLHF 521 Query: 1060 NNDHCIIIDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYK 1239 N C++ D + + +KM SN +F ++ + A T + NLWH R+GH + + Sbjct: 522 KNHECVVSD-PSGVELFYVKM-SNIMFSVDWEKITEQAYTITLQTCTNLWHKRFGHFNLR 579 Query: 1240 GLNLLKKKNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPKTS-YRAKAPLEIVHADICGP 1413 + +KKK +V +P + CE C GK +LPF +RA L+++H D+CGP Sbjct: 580 SIAEMKKKELVENMPEFLSNAQVCETCQQGKQTKLPFQANQVWRANQKLQLIHTDVCGP 638 >ref|XP_015940984.1| PREDICTED: DNA-directed RNA polymerase II subunit 1-like [Arachis duranensis] Length = 3020 Score = 261 bits (667), Expect = 3e-73 Identities = 150/478 (31%), Positives = 256/478 (53%), Gaps = 7/478 (1%) Frame = +1 Query: 1 SFMHQGVSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNEN 180 S ++ V+ I RI ++++K+ W+ LKK++ G +K+ +K NL R+ + + MK+NE+ Sbjct: 511 SSLYAAVTPIIFNRIMSLESAKDIWDFLKKEYEGNKKIKGMKAMNLKRELERVQMKENES 570 Query: 181 VQEFFSRVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDEL 360 VQEF +++ + N++ + ++++VEK+L S+P +FE +A++E ++++S + E Sbjct: 571 VQEFANKLLDLTNKLALLGTDLGEERLVEKLLCSVPERFEATIASLENTREISEMPFAEA 630 Query: 361 MGSLEAHE-KRMSRFSSQSLEQAFQ--VKASISEEENQEKRANSXXXXXXXXXXXXXYXX 531 + +L+A E +R+ R + +E A Q VK E++ Q K+ + Sbjct: 631 VSALQAQEQRRVLRRGDEPVEGAMQARVKHGSGEKKKQGKQFGAQQTSFSSSDAQVR--- 687 Query: 532 XXXXXXXXXXXXXXXXXXXXXDLYCRICRKNNHDTKDC----RFKCKRCRNANHSQRDCW 699 D C+ C K H C C++C H +R C Sbjct: 688 ------------------KGSDEACKHCGKKGHPFFRCWRRPNVVCRKCGKMGHIERICK 729 Query: 700 HKDDGDTNEANLSEEKEPNQVFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESY 879 K + E E Q+F + Q + W VDSGCSNHM GN+ +F N+D S Sbjct: 730 EKSSQQGEAQAAAGEVE--QLFAASGFVSQVSRDGWLVDSGCSNHMSGNEAIFTNIDRSV 787 Query: 880 SSEVKLGDGKSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEF 1059 ++ V++G+G ++ G+G + ++ G + ++ DV+YVP + QNLLSVGQL+EK K+ F Sbjct: 788 NTRVRIGNGDHLEVEGRGNVLLEG-PGGVKLMSDVYYVPKIDQNLLSVGQLVEKGMKIVF 846 Query: 1060 NNDHCIIIDKQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYK 1239 + + C I D++ L + +I M NK F + P +++ + + + LWH R GH YK Sbjct: 847 DKNECAIADEEGKL-LFRIPMQ-NKTFAFD-PASKESKAMACVQGQEELWHRRLGHFHYK 903 Query: 1240 GLNLLKKKNMVFGLPNIPDRDKTCEGCIFGKMHRLPFPKTSYRAKAPLEIVHADICGP 1413 GL +++ +V LP + CE C+ GK+ R PF KT +RAK L++VH+D+CGP Sbjct: 904 GLQFMQRHGLVEDLPQLGSEVSDCEVCLQGKLVRKPFQKTRWRAKKKLQLVHSDVCGP 961 >emb|CAN83567.1| hypothetical protein VITISV_030380 [Vitis vinifera] Length = 567 Score = 248 bits (633), Expect = 1e-72 Identities = 149/469 (31%), Positives = 253/469 (53%), Gaps = 5/469 (1%) Frame = +1 Query: 19 VSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENVQEFFS 198 VS +I +I + ++ E WE LK+++ G E++ ++++ NL R+F+ M++++ V+++ + Sbjct: 53 VSPSIFIKIMKIDSAAEIWEYLKEEYKGXERIKNMQVMNLIREFEMKKMRESDAVKDYAA 112 Query: 199 RVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELMGSLEA 378 ++ ++ +++R ++KIV+K+L +LP K+E ++++E SKDLST+SL EL+ SLEA Sbjct: 113 QLLSIADKVRLLGKEFSNEKIVQKILVTLPEKYEATISSLENSKDLSTISLTELLHSLEA 172 Query: 379 HEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXXXXXXX 558 E+R + E AFQ + + EK N+ Sbjct: 173 VEQRRLMRQGDTAEGAFQARMQKNAGHKNEKMNNNKPCSNNQKNGVFPP----------- 221 Query: 559 XXXXXXXXXXXXDLYCRICRKNNHDTKDCRF----KCKRCRNANHSQRDCWHKDDGDTNE 726 C C+K NH + C + KC +C H +R C ++ +T+ Sbjct: 222 ---------------CPHCKKTNHSPQKCWWRPDVKCNKCGKQGHVERICKNQQQEETSA 266 Query: 727 ANLSEEKEPNQVFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLGDG 906 A + + Q+F + + + W VDSGC+NHM N+ +F LD + S+V++G+G Sbjct: 267 A--VDYCQEEQLFAATCFANKSTSKSWLVDSGCTNHMTNNQDLFRELDRTTISKVRIGNG 324 Query: 907 KSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCIIID 1086 + + GKG +A++++ G + +I DV +VPD+ QNLLSVGQL+EK++KV F + +CII D Sbjct: 325 EYIPVKGKGTVAIESQTG-LKLIYDVLFVPDIDQNLLSVGQLVEKEFKVYFEDRNCIIKD 383 Query: 1087 KQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKKKN 1266 + + IKM K F LNL E A+ ++ W R H + + +KK Sbjct: 384 AE-GKEVFNIKM-KGKSFALNLLEDEHTAILQ-QDSTTMFWDRRVEHFHHDDVLYMKKNQ 440 Query: 1267 MVFGLPNIPDRDKTCEGCIFGKMHRLPFP-KTSYRAKAPLEIVHADICG 1410 +V GLP++ C C +GK +LPFP K S+RA L++VH D+ G Sbjct: 441 IVEGLPDLEKDLPICATCQYGKQTKLPFPKKISWRATQKLQLVHTDVGG 489 >emb|CAN69620.1| hypothetical protein VITISV_008603 [Vitis vinifera] gi|147841281|emb|CAN62417.1| hypothetical protein VITISV_038422 [Vitis vinifera] Length = 570 Score = 246 bits (628), Expect = 8e-72 Identities = 148/469 (31%), Positives = 252/469 (53%), Gaps = 5/469 (1%) Frame = +1 Query: 19 VSKAILPRITGVKTSKEAWEILKKQFGGYEKVISIKLQNLWRDFDNLSMKDNENVQEFFS 198 VS +I +I + ++ E WE LK+++ G E++ ++++ NL R+F+ M++++ V+++ + Sbjct: 77 VSPSIFIKIMKIDSAAEIWEYLKEEYKGDERIKNMQVMNLIREFEMKKMRESDAVKDYAA 136 Query: 199 RVSTVVNQIRGHSDTIEDKKIVEKVLRSLPAKFEHIVAAIEESKDLSTLSLDELMGSLEA 378 ++ ++ +++R ++KIV+K+L +LP K+E ++++E SKDLST+SL EL+ SLEA Sbjct: 137 QLLSIADKVRLLGKEFSNEKIVQKILVTLPEKYEATISSLENSKDLSTISLTELLHSLEA 196 Query: 379 HEKRMSRFSSQSLEQAFQVKASISEEENQEKRANSXXXXXXXXXXXXXYXXXXXXXXXXX 558 E+R + E AFQ + + EK N+ Sbjct: 197 VEQRRLMRQGDTAEGAFQARMQKNAGHKNEKMNNNKPCSNNQKNGVFPP----------- 245 Query: 559 XXXXXXXXXXXXDLYCRICRKNNHDTKDCRF----KCKRCRNANHSQRDCWHKDDGDTNE 726 C C+K NH + C + KC +C H +R C ++ +T+ Sbjct: 246 ---------------CPHCKKTNHSPQKCWWRPDVKCNKCGKQGHVERICKNQQQEETSA 290 Query: 727 ANLSEEKEPNQVFFSCLNSQQQIENIWYVDSGCSNHMCGNKKMFVNLDESYSSEVKLGDG 906 A + + Q+F + + + W VDSGC+NHM N+ +F LD + S+V++G+G Sbjct: 291 A--VDYCQEEQLFAATCFANKSTSESWLVDSGCTNHMTNNQDLFRELDRTIISKVRIGNG 348 Query: 907 KSRKITGKGEIAVQTREGKMNIIKDVFYVPDLTQNLLSVGQLLEKDYKVEFNNDHCIIID 1086 + + GKG +A++++ G + +I DV +VPD+ QNLLSVGQL+EK++KV F + +CII D Sbjct: 349 EYIPVKGKGTVAIESQTG-LKLIYDVLFVPDIDQNLLSVGQLVEKEFKVYFEDRNCIIKD 407 Query: 1087 KQKNLTMAKIKMSSNKIFPLNLPLTEKIALQSTTEAEHNLWHLRYGHLSYKGLNLLKKKN 1266 + + IKM K F LNL E A+ ++ W R H + + +KK Sbjct: 408 AE-GKEVFNIKM-KGKSFALNLLEDEHTAILQ-QDSTTMFWDRRVEHFHHDDVLYMKKNQ 464 Query: 1267 MVFGLPNIPDRDKTCEGCIFGKMHRLPFP-KTSYRAKAPLEIVHADICG 1410 + GLP++ C C +GK +LPFP K S+RA L++VH D+ G Sbjct: 465 IAEGLPDLEKDLPICATCQYGKQTKLPFPKKISWRATQKLQLVHTDVGG 513