BLASTX nr result
ID: Rehmannia28_contig00020769
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia28_contig00020769 (1961 letters) Database: ./nr 84,704,028 sequences; 31,038,470,784 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699... 394 e-127 ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697... 391 e-126 ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobrom... 280 9e-86 ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobrom... 265 5e-80 gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Gly... 251 1e-72 gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Gly... 250 2e-72 ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798... 243 7e-71 ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955... 243 2e-69 gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposo... 241 1e-68 gb|KYP75905.1| Retrovirus-related Pol polyprotein from transposo... 243 2e-68 ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954... 242 1e-67 ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662... 236 2e-67 gb|KYP31881.1| Putative transposon Ty5-1 protein YCL075W family ... 234 9e-67 gb|KYP74100.1| Retrovirus-related Pol polyprotein from transposo... 231 1e-65 gb|KYP46603.1| hypothetical protein KK1_031768 [Cajanus cajan] 232 2e-65 ref|XP_009774775.1| PREDICTED: uncharacterized protein LOC104224... 229 1e-64 ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobrom... 224 2e-64 ref|XP_012453130.1| PREDICTED: uncharacterized protein LOC105775... 228 9e-64 gb|KYP49735.1| Retrovirus-related Pol polyprotein from transposo... 226 9e-64 ref|XP_008350470.1| PREDICTED: uncharacterized protein LOC103413... 226 1e-63 >ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699729, partial [Phoenix dactylifera] Length = 490 Score = 394 bits (1011), Expect = e-127 Identities = 212/447 (47%), Positives = 286/447 (63%), Gaps = 46/447 (10%) Frame = -2 Query: 1207 AAQITQATA---NRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSIS 1037 A+ I+ A+A N P+EDPN PFFL +DN+ T+ + PPL G+NY +WSR+FSL+IS Sbjct: 4 ASHISSASATSPNHVFTPSEDPNSPFFLHHTDNAQTVIVTPPLVGSNYLSWSRSFSLAIS 63 Query: 1036 VKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDT 857 +KNK GFLDG+I TP+ +DPLYIPWLRCNNLIL WL+NS++KEIAS+++++ SAK+VW+ Sbjct: 64 IKNKLGFLDGSISTPEVTDPLYIPWLRCNNLILAWLLNSISKEIASNVLFIKSAKEVWNK 123 Query: 856 LKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTC 677 LK R++QPD+VRI S+YFT LN IWEELRNYRP+P+CSCG C C Sbjct: 124 LKSRFAQPDNVRIYQLKQQLSSITQRSLSVSEYFTQLNAIWEELRNYRPLPYCSCGHCIC 183 Query: 676 QAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARL 497 A+K VGE D+ F+FLMGLN+TYD+ RGQI+LM+P+PSLD ++++LQEERQR+AR Sbjct: 184 DALKGVGEDLELDHIFQFLMGLNDTYDTVRGQIILMSPLPSLDKTFSLVLQEERQRQARA 243 Query: 496 SFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGK--N 323 P+ ESSALA A +K K + +I C HCGK GH+ +KC+RLIGFPPNFKFTK K + Sbjct: 244 IIFPAPESSALA--AVLNKSKNRAEITCYHCGKSGHTKEKCYRLIGFPPNFKFTKTKFPS 301 Query: 322 AAGKGIGQNHSANCIPPPEIPAASSDK---TKHFSFTQEQVQKLMTLLNGDPMEVSQPS- 155 K + HSAN ++ +++ K S +Q Q+Q+L+ L+N ++S S Sbjct: 302 VNNKSVAP-HSAN-----QVISSTQGKGLSAPQLSLSQTQIQQLLALVNSGIPQMSLNSA 355 Query: 154 ------------PAPDNPSNTSHFSNMAG------NITLNSQFKS--------------- 74 P + +N++ SNMAG NIT S Sbjct: 356 STQQEPILPMVTPTTETGNNSAPSSNMAGIDLCLSNITHVPDSSSTKHYSHLAYIMDHRP 415 Query: 73 ----KFSWIIDTGASDHIVCCSSLFTS 5 + WIIDTGA+DH+VC + TS Sbjct: 416 HKIHEVPWIIDTGATDHMVCSTKFLTS 442 >ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697258 [Phoenix dactylifera] Length = 514 Score = 391 bits (1005), Expect = e-126 Identities = 209/445 (46%), Positives = 277/445 (62%), Gaps = 44/445 (9%) Frame = -2 Query: 1207 AAQITQATA---NRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSIS 1037 A+ I+ A A N P+EDPN PFFL +DN+ T+ + PPL G+NY +WSR+FSL+IS Sbjct: 4 ASHISSALATSPNHVFTPSEDPNSPFFLHRTDNAQTVIVTPPLIGSNYLSWSRSFSLAIS 63 Query: 1036 VKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDT 857 +KNK GFLDG+IPTP+ +DPLY+PWLRCNNLIL WL+NS++KEIAS+++++ S K+VW+ Sbjct: 64 IKNKLGFLDGSIPTPEVTDPLYVPWLRCNNLILAWLLNSISKEIASNVLFIKSTKEVWNK 123 Query: 856 LKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTC 677 LK R++QPD+VRI S+YFT LN IWEELRNYRP+P+CSCG C C Sbjct: 124 LKSRFAQPDNVRIYQLKQQLSSITQGTLSVSEYFTQLNAIWEELRNYRPLPYCSCGHCIC 183 Query: 676 QAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARL 497 A+K VGE DY F+FLM LN T+DS RGQI+LM+P+PSLD ++++LQEERQR+AR Sbjct: 184 DALKGVGENLELDYIFQFLMELNNTFDSVRGQIILMSPLPSLDKTFSLVLQEERQRQARA 243 Query: 496 SFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAA 317 P+ ESSALA A +K K K I C HCGKPGH+ +KC+RLIGFPPNFKFTK K+ + Sbjct: 244 IIFPAPESSALA--AVLNKPKNKAKITCYHCGKPGHTREKCYRLIGFPPNFKFTKTKSPS 301 Query: 316 --GKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEV------SQ 161 K + +HSAN + P S +Q QVQ+L L+N ++ SQ Sbjct: 302 VNNKSVA-SHSANQVISP--TQGKGLAAPQLSLSQAQVQQLFALVNSGITQLNLNSASSQ 358 Query: 160 PSPAP-------DNPSN--------------------------TSHFSNMAGNITLNSQF 80 P P + SN T H S++ + Sbjct: 359 QEPIPPMMKPITETGSNSTSTNMADIDLCLSSITRVPDTSLCSTKHHSHLTYLMDHRPHR 418 Query: 79 KSKFSWIIDTGASDHIVCCSSLFTS 5 + WI+DTGA+DH+VC ++ TS Sbjct: 419 THEVPWIVDTGATDHMVCSTTFLTS 443 >ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobroma cacao] gi|508779769|gb|EOY27025.1| Uncharacterized protein TCM_028976 [Theobroma cacao] Length = 318 Score = 280 bits (716), Expect = 9e-86 Identities = 133/302 (44%), Positives = 188/302 (62%) Frame = -2 Query: 1213 EVAAQITQATANRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISV 1034 ++ +QI+QA DP P++L +D+ ++ + P L NY AWSR+F L++S+ Sbjct: 17 QLTSQISQAN---------DPPSPYYLHHTDHLGSVVVNPKLTTNNYVAWSRSFLLALSI 67 Query: 1033 KNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTL 854 +NK GF++G+IP P +D L+ W RCNNLI++WL+NS+++ IAS+I +M S ++W+TL Sbjct: 68 RNKVGFINGSIPKPSITDDLHPIWNRCNNLIVSWLLNSISQPIASTIFFMESVAEIWNTL 127 Query: 853 KLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQ 674 KL Y+QPD+ + YF L IWEELRNYRP+PHC CG+C Sbjct: 128 KLNYAQPDNTCVCNLQYTLGSVTQRVKIVYAYFIELKCIWEELRNYRPLPHCECGKCNAN 187 Query: 673 AIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLS 494 K + D F+FL GLNE++ + R QI+LM+PIPSLD VY+M+L+EE Q+ L Sbjct: 188 CFKKFSDQYQKDMVFRFLNGLNESFSAIRSQIILMDPIPSLDKVYSMVLREESQKNMFLQ 247 Query: 493 FMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAG 314 P ES A+ + KK K D+ C HCGK GH +KC+R+I FP +FKFTKGK Sbjct: 248 SQPFLESLAMLAATNVKKKPMK-DLTCTHCGKKGHVKEKCYRIIRFPEDFKFTKGKPYVK 306 Query: 313 KG 308 KG Sbjct: 307 KG 308 >ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobroma cacao] gi|508722464|gb|EOY14361.1| Uncharacterized protein TCM_033758 [Theobroma cacao] Length = 328 Score = 265 bits (678), Expect = 5e-80 Identities = 133/294 (45%), Positives = 183/294 (62%), Gaps = 3/294 (1%) Frame = -2 Query: 1057 AFSLSISVKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNS 878 +F L++S++NK F+DG+IP PD SD L++P RCN+LIL WL+ S++ IAS++ Y+ Sbjct: 23 SFLLALSIQNKSRFIDGSIPEPDVSDKLFVPCTRCNSLILAWLLESISPPIASTVFYIRK 82 Query: 877 AKDVWDTLKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHC 698 A +VW+TLK R+SQPD RI YFT LN IWEELRNYRP+PHC Sbjct: 83 AYEVWETLKERFSQPDDARICNLQFNLYNISQGTRSVDAYFTELNCIWEELRNYRPLPHC 142 Query: 697 SCGQCTCQAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEE 518 SCG C ++ + D F+FL GLNE++ + R QIL+M P PSL+ Y +++++E Sbjct: 143 SCGICNSACFQTYIDQYQKDSVFRFLNGLNESFSALRSQILMMKPFPSLNKAYNLVIRDE 202 Query: 517 RQREARLSFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKF 338 QR L MP ESSA+A K K K+D++C +C K GH+ DKC+RLIGFPP+FKF Sbjct: 203 SQRNLYLHTMPIIESSAMATMTE-GKVKSKVDVVCSYCHKKGHTKDKCYRLIGFPPDFKF 261 Query: 337 TKGKNAAGKGIGQNHSANCIPPPEIPAASSDKTKHFS---FTQEQVQKLMTLLN 185 KGK+ K G S N + P + TK S ++ Q+QKLM+L+N Sbjct: 262 LKGKSPLKK--GNVWSINNVGPVTSKEECDESTKSLSSLTLSKHQIQKLMSLIN 313 >gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Glycine soja] Length = 484 Score = 251 bits (641), Expect = 1e-72 Identities = 139/407 (34%), Positives = 215/407 (52%), Gaps = 19/407 (4%) Frame = -2 Query: 1168 FPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPD 989 F T N P++L P++N + + P L NY WSR+ +++ KNK F+DG++P P Sbjct: 1 FSTNSAN-PYYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPP 59 Query: 988 FSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXX 809 SDPLY PW+RCN ++L W+ S++ IA S++++++A VW L++R+SQ D RI Sbjct: 60 VSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDL 119 Query: 808 XXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSDYT 632 SDYFT L W+EL NYRPIPHC C C+C I SV + DY Sbjct: 120 QEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYV 179 Query: 631 FKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR--EARLSFMPSSESSALAV 458 +FL GLN+ + ++ QI++MNP+P +D V+++++Q+ER+ S ++ SA+A+ Sbjct: 180 IRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAM 239 Query: 457 GAHPSKKKFK---------------LDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKN 323 + ++ F + +C HCGK H +D CF IG+PP +K K KN Sbjct: 240 QVNSNQSNFNGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKN 299 Query: 322 AAGKGIGQNHS-ANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAP 146 ++ N S A+ + E S F FTQE Q ++ L + SQP Sbjct: 300 SSSSSQANNTSNASAL---ESTQQGSSAQSSFQFTQEMYQGILEALQQSKVG-SQPKA-- 353 Query: 145 DNPSNTSHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5 N TS F+ + + N K+ WI+DT ++++ C FT+ Sbjct: 354 -NSVTTSPFA--LHSPSSNPNGKNPSLWILDTASTNN---CHLSFTT 394 >gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Glycine soja] Length = 484 Score = 250 bits (639), Expect = 2e-72 Identities = 139/407 (34%), Positives = 215/407 (52%), Gaps = 19/407 (4%) Frame = -2 Query: 1168 FPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPD 989 F T N P++L P++N + + P L NY WSR+ +++ KNK F+DG++P P Sbjct: 1 FSTNSAN-PYYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPP 59 Query: 988 FSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXX 809 SDPLY PW+RCN ++L W+ S++ IA S++++++A VW L++R+SQ D RI Sbjct: 60 VSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDL 119 Query: 808 XXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSDYT 632 SDYFT L W+EL NYRPIPHC C C+C I SV + DY Sbjct: 120 QEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYV 179 Query: 631 FKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR--EARLSFMPSSESSALAV 458 +FL GLN+ + ++ QI++MNP+P +D V+++++Q+ER+ S ++ SA+A+ Sbjct: 180 IRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAM 239 Query: 457 GAHPSKKKFK---------------LDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKN 323 + ++ F + +C HCGK H +D CF IG+PP +K K KN Sbjct: 240 QVNSNQSNFNGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKN 299 Query: 322 AAGKGIGQNHS-ANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAP 146 ++ N S A+ + E S F FTQE Q ++ L + SQP Sbjct: 300 SSSSSQANNTSNASAL---ESTQQGSSAQSSFQFTQEMYQGILEALQQSKVG-SQPKA-- 353 Query: 145 DNPSNTSHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5 N TS F+ + + N K+ WI+DT ++++ C FT+ Sbjct: 354 -NLVTTSPFA--LHSPSSNPNGKNPSLWILDTASTNN---CHLSFTT 394 >ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798459 [Glycine max] Length = 389 Score = 243 bits (621), Expect = 7e-71 Identities = 133/389 (34%), Positives = 202/389 (51%), Gaps = 19/389 (4%) Frame = -2 Query: 1189 ATANRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLD 1010 A N F T N P++L P++N + + P L NY WS + +++ KNK F+D Sbjct: 2 ALQNFVDFSTNSAN-PYYLHPNENPALVLVSPSLTAKNYHTWSHSMHIALISKNKDKFID 60 Query: 1009 GTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 830 G++P P SDPLY PW+RCN ++L W+ S++ IA S++++++A VW L++R+SQ D Sbjct: 61 GSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSD 120 Query: 829 SVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGE 653 RI SDYFT L W+EL NYRPIPHC C C+C I SV Sbjct: 121 IFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRV 180 Query: 652 IQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR--EARLSFMPSS 479 + DY +FL GLN+ + ++ QI++MNP+P +D V+++++Q+ER+ S ++ Sbjct: 181 YREQDYVVRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEAT 240 Query: 478 ESSALAVGAHPSKKKFK---------------LDIICQHCGKPGHSIDKCFRLIGFPPNF 344 SA+A+ + ++ F + +C HCGK H +D CF IG+PP + Sbjct: 241 SDSAMAMQVNSNQSNFNGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGY 300 Query: 343 KFTKGKNAAGKGIGQNHS-ANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEV 167 K K KN++ N S A+ + E S F FTQE Q ++ L + Sbjct: 301 KTNKSKNSSSSSQANNTSNASAL---ESTQQGSSAQSSFQFTQEMYQGILEALQQSKVG- 356 Query: 166 SQPSPAPDNPSNTSHFSNMAGNITLNSQF 80 SQP N TS F+ + + N F Sbjct: 357 SQPKA---NSVTTSPFALHSPSSNPNESF 382 >ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955841, partial [Erythranthe guttata] Length = 514 Score = 243 bits (621), Expect = 2e-69 Identities = 146/407 (35%), Positives = 214/407 (52%), Gaps = 22/407 (5%) Frame = -2 Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986 P D + P FL PSD + I + NY +WSRA ++S++VKNK GF+DGTI P Sbjct: 8 PLGDVSHPMFLHPSDGPNLILVSQLFTEDNYASWSRAMTISLTVKNKIGFIDGTISEPA- 66 Query: 985 SDPLYI--PWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXX 812 +D L + W+R NN++++W+INSV+K+I SI+Y NS+K++WD LK R+SQ + RI Sbjct: 67 ADELVMRNAWIRNNNIVMSWIINSVSKDIQGSIMYSNSSKEIWDDLKTRFSQTNGPRIFQ 126 Query: 811 XXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQLSDYT 632 + YFT + IW+EL NYRP CSCG+C C + + +Y Sbjct: 127 LRRDLANLTQGSQSVNVYFTKVKAIWDELVNYRPC--CSCGKCDCGGFEKLQAHYNQEYV 184 Query: 631 FKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSESSALAVGA 452 FLMGLNE+ STRGQILLM+P+P + V+A + QEERQR S + SS S +V Sbjct: 185 MSFLMGLNESLASTRGQILLMDPLPPISKVFAFVSQEERQRSVVSSHVESS-GSVFSVKN 243 Query: 451 HPSKK-----------KFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGI 305 K+ K K C HC GH+++KC++L G+PP++K K + ++ Sbjct: 244 EGFKRSINNQFYNTGFKKKERSFCTHCNMQGHTVEKCYKLHGYPPSYKPQKSRFSSPANQ 303 Query: 304 GQNHSANCIPPPEIPAASSDKTKHF--SFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSN 131 ++ SS + S T Q Q+ M++ + Q S A P + Sbjct: 304 VSGFDSSLDSHSSDSGVSSQHVDGYLQSMTPSQCQQFMSMFSSHMAAQQQQSAASAQPQS 363 Query: 130 TSHFSNMA------GNITLN-SQFKSKFSWIIDTGASDHIVCCSSLF 11 ++H ++ A G L+ + S WI+D+GAS HI LF Sbjct: 364 SAHGADTATVSCVTGTCALSGAPSLSSTDWILDSGASKHICHDKQLF 410 >gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 495 Score = 241 bits (615), Expect = 1e-68 Identities = 128/398 (32%), Positives = 201/398 (50%), Gaps = 18/398 (4%) Frame = -2 Query: 1144 PFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDFSDPLYIP 965 P++L P++N + + P L NY WSR+ +++ KNK F+DG++P P SDPLY P Sbjct: 5 PYYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPPVSDPLYAP 64 Query: 964 WLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXXXXXXXXX 785 W+RCN ++L W+ S++ IA S++++++A VW L++R+S D RI Sbjct: 65 WIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSHSDIFRISDLQEDLYRFR 124 Query: 784 XXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSDYTFKFLMGLN 608 SDYFT L W+EL NYRPIP+C C C+C I SV + DY +FL GLN Sbjct: 125 QGTLDVSDYFTQLKIYWDELENYRPIPYCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLN 184 Query: 607 ETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR--EARLSFMPSSESSALAVGAHPSKKK 434 + + ++ QI++MNP+P +D V+++++Q+ER+ S ++ SA+A+ + ++ Sbjct: 185 DRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSN 244 Query: 433 FK---------------LDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQ 299 F + +C HCGK H +D CF IG+PP +K K KN++ Sbjct: 245 FNGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 304 Query: 298 NHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNTSHF 119 N S A++ + T+ S Q + P + PS P+ Sbjct: 305 NTS---------NASALESTQQGSSAQS--------ITTSPFALHSPSSNPNG------- 340 Query: 118 SNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5 K+ WI+DTGA+DHI S T+ Sbjct: 341 -------------KNPSLWILDTGATDHITFDLSSLTT 365 >gb|KYP75905.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 594 Score = 243 bits (620), Expect = 2e-68 Identities = 142/407 (34%), Positives = 214/407 (52%), Gaps = 14/407 (3%) Frame = -2 Query: 1189 ATANRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLD 1010 +T N S PT+D P+FL PSDN + PL+G NY +WSRA +++ KNK GF+D Sbjct: 3 STNNSSSLPTDDYANPYFLHPSDNPGAFIVSQPLNGDNYNSWSRAILMALGEKNKIGFVD 62 Query: 1009 GTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 830 GTIP P +D Y W R NN++ +WL+N ++K++ +S+IY +SA +W+ L++R+ Q + Sbjct: 63 GTIPKPLPTDKSYHSWQRNNNIVASWLLNFISKDLQASVIYSSSATAIWNDLRIRFQQHN 122 Query: 829 SVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEI 650 R+ + YFT + +WEEL Y+P C+CG IK + Sbjct: 123 GPRVFQLRRDLVTLKQGSLNITHYFTKIKALWEELAEYQPSHACTCG-----GIKPWIDH 177 Query: 649 QLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARL--------- 497 S+Y FLMGLNE Y RGQILLM+PIP ++ ++++LQEE+Q+E + Sbjct: 178 HQSEYAMLFLMGLNEGYSHIRGQILLMDPIPPIEKGFSLVLQEEKQQELGIPTNSNDTPT 237 Query: 496 SFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAA 317 +F S + A + +P+K++ K C+HCGK GH DKCF+L G+P + K Sbjct: 238 AFAYKSGNDAKSRTNNPTKERPK----CEHCGKLGHIKDKCFKLHGYPTHLK-------- 285 Query: 316 GKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNP 137 Q +S + S K F FT +Q ++++LL + P Sbjct: 286 -----QGNSNT-----NVNQVSDKSAKAFQFTTDQYHQILSLLQ-------------NQP 322 Query: 136 SNTSHFSNMAGN---ITLNSQFKS--KFSWIIDTGASDHIVCCSSLF 11 S+ SN N +++ F S WI+D+GAS H+ C SLF Sbjct: 323 SSNCIESNPIVNGLLLSIRPSFNSIPSTKWILDSGASTHVACSLSLF 369 >ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954710 [Erythranthe guttata] Length = 659 Score = 242 bits (618), Expect = 1e-67 Identities = 145/406 (35%), Positives = 215/406 (52%), Gaps = 21/406 (5%) Frame = -2 Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986 P +D + P FL PSD + I + L NY +WSRA ++S++VKNK GF+DGTI P Sbjct: 8 PLDDVSHPMFLHPSDGPNLILVSQLLTEDNYASWSRAMTISLTVKNKIGFIDGTISEPP- 66 Query: 985 SDPLYI--PWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXX 812 +D L + W+R NN++++W+INSV+K+I SI+Y NS+K++WD LK R+SQ + RI Sbjct: 67 ADELIMRNAWIRNNNIVMSWIINSVSKDIQGSIMYSNSSKEIWDDLKTRFSQTNGPRIFQ 126 Query: 811 XXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQLSDYT 632 + YFT + IW+EL NYRP CSCG+C C + + +Y Sbjct: 127 LRRDLANLTQGSQSVNVYFTKVKAIWDELANYRPC--CSCGKCDCGGFEKLQAHYNQEYV 184 Query: 631 FKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSESSALAVGA 452 FLMGLN++ STRGQILLM+P+P + V+A + QEERQR S + SS S +V Sbjct: 185 MSFLMGLNDSLASTRGQILLMDPLPPISKVFAFISQEERQRSVVSSHVDSS-GSVFSVKN 243 Query: 451 HPSKK-----------KFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGI 305 K+ K + C HC GH+++KC++L G+PP++K K + ++ Sbjct: 244 EGFKRSINNQFYNPGLKKRERSFCTHCNMQGHTVEKCYKLHGYPPSYKPQKSRFSSHVNQ 303 Query: 304 GQNHSANCIPPPEIPAASSDKTKHF--SFTQEQVQKLMTLLNGDPMEVSQPSPAPDNP-- 137 ++ SS + + S T Q Q+ M++ + Q S A P Sbjct: 304 VSGFDSSLDSHSSDAGVSSQQVDGYLQSMTPSQCQQFMSMFSSHMAAQQQQSTASIQPQS 363 Query: 136 ---SNTSHFSNMAGNITLNS-QFKSKFSWIIDTGASDHIVCCSSLF 11 ++T+ S + G L+ S WI+D+GAS HI LF Sbjct: 364 AHGADTATVSCVTGICALSGVPSLSSADWILDSGASKHICHDKQLF 409 >ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662412 [Glycine max] Length = 424 Score = 236 bits (601), Expect = 2e-67 Identities = 136/416 (32%), Positives = 214/416 (51%), Gaps = 26/416 (6%) Frame = -2 Query: 1174 SPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPT 995 S F T +P+ P+++ P++N I ++P LD NY W R+ +++ KNK F+DGT+ Sbjct: 6 SDFAT-NPSNPYYMHPNENPSLILVQPVLDNKNYQIWCRSMKVALISKNKVKFVDGTLSP 64 Query: 994 PDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIX 815 P SDPLY PWLRCNNL+L+WL S ++EIA S+++ + A VW +L+ R+SQ D R+ Sbjct: 65 PPISDPLYEPWLRCNNLVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGDIFRVA 124 Query: 814 XXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSD 638 S YFT L T+WEE+ N+RPI C+C C+C A + + + D Sbjct: 125 DIQEEVACLQQGTLDISSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRKFKEQD 184 Query: 637 YTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSE--SSAL 464 KFL GL + Y R QI+LM+P+P+LD + ++LQ+ERQ + S E SS Sbjct: 185 KVIKFLKGLGDQYSHVRSQIMLMSPLPTLDNAFNLILQQERQFNLPSTTDSSIENQSSVN 244 Query: 463 AVGAHPSKKKFKLDI--------------ICQHCGKPGHSIDKCFRLIGFPPNFKFTKGK 326 PS+ +C HC + H+++ CF G+PP F+ K Sbjct: 245 HFSQTPSRPSNNSGCGRGRGYSSGGRGNRLCTHCNRTNHTVETCFIKHGYPPGFQHRKSN 304 Query: 325 NAAGKGIGQN----HSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQP 158 ++ + + SA+ +++ + S QEQ +++ LL ++ + P Sbjct: 305 SSGNASVVNSVQDAGSAHISSSSSASTSTNGSSASLSTIQEQYTQILQLLQQSNLQSTSP 364 Query: 157 SP-----APDNPSNTSHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5 S A ++ S+TS + N++ N + WI+DTGA+DHI F+S Sbjct: 365 SSVNSVFATNSVSHTSPSPSSGKNLSNN----TSHWWIVDTGATDHITHIFDSFSS 416 >gb|KYP31881.1| Putative transposon Ty5-1 protein YCL075W family [Cajanus cajan] Length = 437 Score = 234 bits (597), Expect = 9e-67 Identities = 135/392 (34%), Positives = 207/392 (52%), Gaps = 14/392 (3%) Frame = -2 Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986 P+ DP P FL SD PLD NYT WSRA +++ VKNK F+DG++P P Sbjct: 8 PSSDPTNPLFLHHSDGPGLFLTSQPLDNKNYTTWSRAMLVALGVKNKIPFVDGSLPRPAA 67 Query: 985 SDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXX 806 DP Y W+ NN++++WL NSV+KEI +SI++ N AK++WD LK R+S+ + RI Sbjct: 68 DDPTYAAWIHGNNVVISWLYNSVSKEIITSILFANIAKEIWDDLKSRFSRKNGPRIFQLR 127 Query: 805 XXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQLSDYTFK 626 S Y+T L +IWE+L Y+P C+CG Q ++ ++ +Y Sbjct: 128 RQLTSLQQGTDDVSTYYTKLKSIWEDLSGYKPSFPCTCG--GLQHLQVYNDL---EYVMS 182 Query: 625 FLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQRE--ARLSFMPSSESSALAVGA 452 FLMGLN+++ RGQILL +P+P + V++++LQEE QRE ++ PS S +A Sbjct: 183 FLMGLNDSFSQIRGQILLSDPLPPIGNVFSLVLQEETQREIGTAVTHTPSINSDNMAFDV 242 Query: 451 HPSKKKFKLDII---------CQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQ 299 + S K D C +CG GH+ DKC++L+G+PPN+ F + + + Sbjct: 243 NSSTKSSAADHYKFNRRERPKCAYCGLLGHTKDKCYKLVGYPPNYNFKNRQTPVANQVLE 302 Query: 298 NHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNTSHF 119 + P P ++ K + T Q Q+L+ L + M++ P A P+N + Sbjct: 303 S------PEP------LNQNKPDNLTPAQCQQLINFLT-NQMKLDNPDEAV--PTNVT-- 345 Query: 118 SNMAGNITLNSQF---KSKFSWIIDTGASDHI 32 I +N+ F + W+ID+GA+ HI Sbjct: 346 -----GICMNTHFLLHNITYRWVIDSGATSHI 372 >gb|KYP74100.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 444 Score = 231 bits (590), Expect = 1e-65 Identities = 130/401 (32%), Positives = 217/401 (54%), Gaps = 8/401 (1%) Frame = -2 Query: 1183 ANRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGT 1004 A+++ P++D + P FL SD + PLD NYT WSRA +++ VKNK F+DGT Sbjct: 2 ADQAKDPSQDVSNPLFLHHSDGPGLVLTSQPLDHKNYTTWSRAMQVALFVKNKLAFIDGT 61 Query: 1003 IPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSV 824 +P P +D ++ W NN++++WL NSV+K+I +SI++ ++A+++W LK R+S+ + Sbjct: 62 LPKPASTDSTFVAWNHANNVVISWLYNSVSKDIITSILFASTAQEIWHDLKTRFSKKNGS 121 Query: 823 RIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQL 644 RI S Y+T L +IWEEL Y+P QCTC ++ + Sbjct: 122 RIFQLRRQLMSLHQGMDDISTYYTKLKSIWEELSGYKP-----TFQCTCGGLQQLQSFTE 176 Query: 643 SDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMP---SSES 473 S+Y FLMGLN++ RGQILL +P+PS+ V++++LQ+E QRE ++ P +S++ Sbjct: 177 SEYVMSFLMGLNDSISQIRGQILLSDPLPSIGNVFSLVLQDEAQREIAVTSSPPVANSDN 236 Query: 472 SALAVGAH---PSKKKF--KLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKG 308 V + S+ +F K C HC GH+ D C++L+G+PPN+ Sbjct: 237 IVFTVNSSQPATSRNRFTKKERPRCAHCNILGHTKDTCYKLVGYPPNY------------ 284 Query: 307 IGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNT 128 +NH+ N + + + ++ + T +Q Q+L+ L +Q + T Sbjct: 285 -FKNHTTNTVNQVTGSSDNVLTSQSSNLTPDQRQQLINFL------TNQMQADTTLDAIT 337 Query: 127 SHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5 ++ + + N+ L++ + +WIID+GA+ HI C LF S Sbjct: 338 TNVTGICMNVALDNNY---HTWIIDSGATSHICCFKHLFHS 375 >gb|KYP46603.1| hypothetical protein KK1_031768 [Cajanus cajan] Length = 483 Score = 232 bits (591), Expect = 2e-65 Identities = 134/409 (32%), Positives = 213/409 (52%), Gaps = 10/409 (2%) Frame = -2 Query: 1204 AQITQATANRSPFPTEDPNK---PFFLPPSDNSHTIEIRPPLDG-TNYTAWSRAFSLSIS 1037 A +Q ++ DPN +F+ P++N + L G +NY W+RA ++ Sbjct: 7 ASSSQNASSSQGADLSDPNNRLSEYFIHPNENPSASLVAKLLIGLSNYHIWARAMRRNLI 66 Query: 1036 VKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDT 857 KNK F+DG+ PD DPLY W RCNNL+ +W+++SV+ IA SI YM A DVW Sbjct: 67 TKNKFRFVDGSNLVPDRFDPLYGAWERCNNLVNSWILSSVSPTIADSIDYMEYASDVWKD 126 Query: 856 LKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSC-GQCT 680 L+ R++Q D VRI +YFT L T+WEEL NY P+P+C C +C Sbjct: 127 LRERFAQSDLVRISELQYEIFSHKQGNFSVIEYFTHLKTLWEELENYIPVPYCPCRTKCA 186 Query: 679 CQAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQ---- 512 C A++ + + DY +FL GLN+ Y++ + QILL + +PSL+ ++M++Q ERQ Sbjct: 187 CPALRDIKSYRDEDYVIRFLQGLNDDYNALKSQILLKDNLPSLNKAFSMVVQHERQYGLE 246 Query: 511 REARLSFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTK 332 E + + +S G++ K + C HC K GH+I+ C++ G PPN +F Sbjct: 247 PENDNQVLVNYSNSRRGKGSYSGSSKSYNERYCTHCKKHGHTIEVCYQKHGLPPNLRFK- 305 Query: 331 GKNAAGKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLL-NGDPMEVSQPS 155 N++ + Q+ + N A + K + +FT+E+ + L+ LL N + + Sbjct: 306 -TNSSANVVSQDGNQNESEDEITDATGTGKDEVPTFTKEEYKSLLALLHNSQSQGIHVAN 364 Query: 154 PAPDNPSNTSHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFT 8 + S +G + + S+ ++ WI+D GA+DHI C LF+ Sbjct: 365 QFKTVSISALSESAESGKLLMFSKCSNEVLWILDFGATDHICCSLDLFS 413 >ref|XP_009774775.1| PREDICTED: uncharacterized protein LOC104224769 [Nicotiana sylvestris] Length = 446 Score = 229 bits (583), Expect = 1e-64 Identities = 135/397 (34%), Positives = 208/397 (52%), Gaps = 10/397 (2%) Frame = -2 Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986 PT D + P+FL PSD+ + DG Y W R+ +++S K K GF+DG+ P F Sbjct: 18 PTIDASHPYFLYPSDSPGMTLVTSVFDGWGYGGWRRSLLIALSTKYKLGFIDGSCSAPAF 77 Query: 985 SDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXX 806 + W RCN++I +WL+NS++KEI +S +Y SA+ +W L+ R+ Q + ++ Sbjct: 78 DSTSFSLWTRCNDMITSWLLNSLSKEIVASALYSKSAQALWTDLEDRFGQSNGAKLYHLQ 137 Query: 805 XXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQA-IKSVGEIQLSDYTF 629 + YFT L W+EL SC CTC +K V +Q ++ Sbjct: 138 KEISDLMQGSSDIAGYFTKLKLSWDELDAIYTTVTYSCA-CTCSGKVKLVKSLQ-NERLI 195 Query: 628 KFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLS-FMPSSESSALAVGA 452 +FLMGLN+TY R IL+M+P+PS++ Y++L+Q+E+QRE ++ P SS LA Sbjct: 196 QFLMGLNDTYSPVRSNILMMSPLPSINIAYSLLVQDEKQREVYVNPQFPGDFSSFLATHQ 255 Query: 451 HPSKKKF--------KLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQN 296 + S +K K ++IC HC KPGHS+DKC+R+IGFP +FKFTK G + Sbjct: 256 NISGQKSQSSDFKGRKNNLICSHCKKPGHSVDKCYRIIGFPSDFKFTKTPKLHG-----S 310 Query: 295 HSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNTSHFS 116 +N I A + T TQ+Q +L+ LLN + + N + Sbjct: 311 VKSNAI--LSFHAQPTGNTGGNPITQDQFSQLIHLLNNAQLGHTGSPTTKVNANVVQCVG 368 Query: 115 NMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5 N+ N ++ + + SWIID+GAS+H+ + F + Sbjct: 369 NIFNNPSIYLTYANTHSWIIDSGASEHMSYDTKFFAT 405 >ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobroma cacao] gi|508708772|gb|EOY00669.1| Uncharacterized protein TCM_010591 [Theobroma cacao] Length = 336 Score = 224 bits (572), Expect = 2e-64 Identities = 123/328 (37%), Positives = 186/328 (56%), Gaps = 1/328 (0%) Frame = -2 Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986 P E+ +++ SD ++ I P L NY +WSRAF L++S+ K+GF+DGTI P Sbjct: 12 PAENLLSSYYIHHSDLHGSVVINPKLAVANYMSWSRAFLLALSICKKRGFIDGTIKKPSE 71 Query: 985 SDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXX 806 ++ L+ W RCN LI+TWL+ S+T +IAS+++ M+SAK++ +TLK R+SQP I Sbjct: 72 ANSLFEDWSRCNILIVTWLLESLTPKIASNVLDMDSAKEILETLKNRFSQPYETIICNLQ 131 Query: 805 XXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQLSDYTFK 626 + YFT LN++W+EL+N+RP+P C K + Q D F Sbjct: 132 FQLRNILQGTRSVNTYFTELNSVWQELKNFRPLPQCDYEGRKNNCYKKYADQQNKDAVFC 191 Query: 625 FLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSESSALAVGAHP 446 FL GLNE++ R IL++ P S+D Y++++++ QR L E+S +A Sbjct: 192 FLNGLNESFSCLRSHILMLKPFLSIDQAYSLVIKKMLQRS--LILQSPVENSTMATVITE 249 Query: 445 SKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQNHSA-NCIPPP 269 K+K +++C HCGK GHS +K + +IGFP NFKFTK K KG +SA + Sbjct: 250 EKRK-NTNLVCSHCGKKGHSKEKYYCIIGFPENFKFTKLKRNMRKGGSSVNSAISGSEQD 308 Query: 268 EIPAASSDKTKHFSFTQEQVQKLMTLLN 185 E ++ S T+ Q+QKLMTL++ Sbjct: 309 EYDETVTNSISQLSLTKAQIQKLMTLIS 336 >ref|XP_012453130.1| PREDICTED: uncharacterized protein LOC105775144 [Gossypium raimondii] Length = 513 Score = 228 bits (582), Expect = 9e-64 Identities = 129/404 (31%), Positives = 207/404 (51%), Gaps = 21/404 (5%) Frame = -2 Query: 1153 PNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDFSDPL 974 P+ P+FL P++N + + P L NY +WSRA +++ KNK F+DG+I P +D + Sbjct: 9 PSSPYFLHPNENPSLVLVTPTLTSLNYNSWSRAMRMALLSKNKLKFVDGSILPPATTDSI 68 Query: 973 YIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXXXXXX 794 Y W RCNN++++WL +S+++ I +SI+++++A D+W L R+SQ D RI Sbjct: 69 YPAWERCNNMVISWLHHSISQSIVNSILWIDTAHDIWRDLHKRFSQGDVFRISDLQDEIS 128 Query: 793 XXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSDYTFKFLM 617 +DYFT L +W+EL N+RP+P CSC QC+C A ++ + +DY +FL Sbjct: 129 VFKQEERSVTDYFTELKVLWDELLNFRPLPSCSCRVQCSCGAFTTIRKYHNNDYVIRFLK 188 Query: 616 GLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFM--------PSSESSALA 461 GLNE Y S R QI+L++P+P+++ ++M++Q+ RQ A S + PS S + Sbjct: 189 GLNERYASIRSQIMLLDPLPTINKAFSMVIQQGRQLLAPSSTVFASNAVRQPSKRPSQAS 248 Query: 460 VGAHPSKKKFKLDI-ICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQNHSAN 284 K+D C CG H++D C+ GFPP + K N+ + + Sbjct: 249 SQVSSRSSDSKIDTRKCTFCGGLRHTVDTCYHKNGFPPGY---KSHNSTSRVHNMFEEID 305 Query: 283 CIPPPEIPAASSDKTKHFS---FTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNTSH--- 122 S T S TQEQ+ +L+ LL + + P+ + P T+ Sbjct: 306 ADTVDSFTGYSQSVTSQGSGVTLTQEQITQLLALLPSSSNQSTNPTHSQPTPHLTNQVLA 365 Query: 121 -----FSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5 ++ G + F S + WI+DT A+DHI + F S Sbjct: 366 TPSLTLASTEGIFSTPISFHSPY-WIVDTSATDHITHTLTSFAS 408 >gb|KYP49735.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 434 Score = 226 bits (576), Expect = 9e-64 Identities = 125/357 (35%), Positives = 194/357 (54%), Gaps = 11/357 (3%) Frame = -2 Query: 1048 LSISVKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKD 869 ++++VKNK F+DGT+P PD DP ++PW R NN++++W+ NSV+KEI +SI++ +AK+ Sbjct: 3 VALTVKNKLSFIDGTLPKPDIEDPTFVPWNRENNVVISWIYNSVSKEIITSILFATTAKE 62 Query: 868 VWDTLKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG 689 WD LK R+S+ + RI S Y+T L +IWEEL Y+P Sbjct: 63 NWDDLKTRFSRKNGPRIFHLKRQLMSLQQGSDDVSTYYTKLKSIWEELAGYKP-----NF 117 Query: 688 QCTCQAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR 509 QCTC ++S+ + S+Y FLMGLN+++ RGQILL +P+PS+ V++++LQEE Q+ Sbjct: 118 QCTCGGLESLHKHTQSEYVMSFLMGLNDSFSQIRGQILLSDPLPSIGNVFSLILQEETQK 177 Query: 508 EARLSFMPSSESSALAVGAHP--------SKKKF--KLDIICQHCGKPGHSIDKCFRLIG 359 E ++ S+ S +A + +K KF K + C HC GH+ DKC++L+G Sbjct: 178 EIAVTHATSAHSDDMAFAVNQCSKTNFDNNKGKFVKKDRLKCAHCEMFGHTKDKCYKLVG 237 Query: 358 FPPNFKFTKGKNAAGKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGD 179 +PPN+ +N + +I SS + T Q Q+LMTLLN Sbjct: 238 YPPNY-------------FKNRQPQVVNQVDISHESSTSNTALNLTPAQCQQLMTLLNNQ 284 Query: 178 PMEVSQPSPAPDNPSNTSHFSNMAGNITLNSQFKSK-FSWIIDTGASDHIVCCSSLF 11 + +N + + I +N F K +WIID+GA+ HI C +L+ Sbjct: 285 ----------IQSDNNLNAIATNVTGICMNVDFSDKNHTWIIDSGATSHICCSKTLY 331 >ref|XP_008350470.1| PREDICTED: uncharacterized protein LOC103413804 [Malus domestica] Length = 451 Score = 226 bits (577), Expect = 1e-63 Identities = 134/396 (33%), Positives = 197/396 (49%), Gaps = 10/396 (2%) Frame = -2 Query: 1189 ATANRSPFPT-EDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFL 1013 +T ++P T D + PF L PSD I + L G NY W RA +S+S KNK G + Sbjct: 9 STDTQNPMETIXDVSNPFILHPSDQPGNILVSKTLQGDNYNTWXRAMRISLSAKNKLGMV 68 Query: 1012 DGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQP 833 DGTI P +D + W RCN+++L W++NSV +IASS+ Y +A DVW L+ R+SQ Sbjct: 69 DGTIDPPSETDKQFASWXRCNDMVLAWILNSVHDDIASSVSYYTTATDVWADLRDRFSQG 128 Query: 832 DSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGE 653 + RI S Y+T L +W+EL +Y P C+CG +K + + Sbjct: 129 NDSRIYQIKREIVEHRQEQQSISVYYTKLKALWDELASYNETPTCTCG-----GLKKIND 183 Query: 652 IQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSES 473 + +FLMGLN++Y + RGQILLM P+P Y+++LQ+E+Q E L+ + Sbjct: 184 RDEKERVMQFLMGLNDSYAAVRGQILLMQPLPDTRRAYSLVLQQEKQVEVSLNRNNINLH 243 Query: 472 SALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKF--------TKGKNAA 317 + + + C +C H++D+CF L GFPP K+ K K AA Sbjct: 244 AMNITRNRXTAAPKGNTJQCSYCDXKYHTVDRCFYLYGFPPGHKYHGKSVKPPNKRKPAA 303 Query: 316 GKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNP 137 + + + + A SSD K FT E+ +LM +L + Sbjct: 304 NQVTVETETTKGVDSRH-KATSSDGPK---FTTEEYNQLMAMLK-----------KSNXD 348 Query: 136 SNTSHFSNMAGNITLNSQFKSK-FSWIIDTGASDHI 32 N HF+N G IT +S K WIID+GA+DH+ Sbjct: 349 GNPQHFANATGTITPSSBLSEKTLYWIIDSGATDHV 384