BLASTX nr result
ID: Forsythia22_contig00056955
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00056955 (564 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007207981.1| hypothetical protein PRUPE_ppa015715mg, part... 192 8e-47 ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prun... 188 1e-45 emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera] 187 3e-45 emb|CAN79339.1| hypothetical protein VITISV_044312 [Vitis vinifera] 184 2e-44 emb|CAN79389.1| hypothetical protein VITISV_004909 [Vitis vinifera] 181 2e-43 ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobrom... 179 7e-43 gb|AAK94516.1| gag-pol polyprotein [Hordeum vulgare] 177 2e-42 ref|XP_007048683.1| DNA/RNA polymerases superfamily protein [The... 177 3e-42 ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobrom... 176 5e-42 gb|AAK94517.1| gag-pol polyprotein [Hordeum vulgare] 176 5e-42 ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobrom... 176 8e-42 ref|XP_008779530.1| PREDICTED: uncharacterized protein LOC103699... 174 2e-41 ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The... 173 4e-41 gb|ABI96971.1| putative gag-pol polyprotein [Triticum monococcum... 173 5e-41 ref|XP_007023480.1| Uncharacterized protein TCM_027505 [Theobrom... 173 5e-41 gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Ja... 172 7e-41 dbj|BAA89466.1| gag-pol polyprotein, partial [Oryza sativa Indic... 172 9e-41 gb|AAM94350.1| gag-pol polyprotein [Zea mays] 171 2e-40 ref|XP_007037024.1| Uncharacterized protein TCM_013224 [Theobrom... 171 2e-40 gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sa... 170 3e-40 >ref|XP_007207981.1| hypothetical protein PRUPE_ppa015715mg, partial [Prunus persica] gi|462403623|gb|EMJ09180.1| hypothetical protein PRUPE_ppa015715mg, partial [Prunus persica] Length = 1445 Score = 192 bits (488), Expect = 8e-47 Identities = 98/190 (51%), Positives = 124/190 (65%), Gaps = 5/190 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T+L FSSA+HPQTDGQTEVVNRSLG+LLRCLVG+ W +L AEFA+N ++NRSTG Sbjct: 1143 TTLKFSSAFHPQTDGQTEVVNRSLGDLLRCLVGDKPGNWDLLLPVAEFAYNNSVNRSTGK 1202 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFEVV+G PR P+DLV+LP ++ SA EHI+Q+H + D YKLAA+ Sbjct: 1203 SPFEVVHGFSPRSPVDLVALPVAARTSDSATSFAEHIRQLHDDVRRQISMHTDTYKLAAN 1262 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR EF GD V + +RFP H + KL A RI+KK+ SNAY +ELP M Sbjct: 1263 AHRRQQEFREGDFVMVRVCPERFPKHSFKKLHARSMGPYRIIKKLGSNAYLIELPADMHI 1322 Query: 32 ADVFNVKHLA 3 + +FNV L+ Sbjct: 1323 SPIFNVSDLS 1332 >ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prunus persica] gi|462402465|gb|EMJ08022.1| hypothetical protein PRUPE_ppa025991mg [Prunus persica] Length = 1274 Score = 188 bits (478), Expect = 1e-45 Identities = 96/190 (50%), Positives = 122/190 (64%), Gaps = 5/190 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T+L FSSA+HPQTDGQTEVVNRSLG+LL CLVG+ W +L AEF +N ++NRSTG Sbjct: 972 TTLKFSSAFHPQTDGQTEVVNRSLGDLLHCLVGDKPGNWDLLLPVAEFTYNNSVNRSTGK 1031 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFEVV+G PR P+DLV+LP ++ SA EHI+Q+H + D YKLAA+ Sbjct: 1032 SPFEVVHGFSPRSPVDLVALPVAARSSDSATSFAEHIRQLHDDVRRQISMHTDTYKLAAN 1091 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR EF GD V + +RFP H + KL A RI+KK+ SNAY +ELP M Sbjct: 1092 AHRRQQEFREGDFVMVRVCPERFPKHSFKKLHARSMGPYRIIKKLGSNAYLIELPANMHI 1151 Query: 32 ADVFNVKHLA 3 + +FNV L+ Sbjct: 1152 SPIFNVSDLS 1161 >emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera] Length = 1323 Score = 187 bits (474), Expect = 3e-45 Identities = 92/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L FSS++HPQTDGQTEVVNRSLGNLLRC+V + ++ W VL QAEFA N + NR+TGY Sbjct: 1024 TQLKFSSSFHPQTDGQTEVVNRSLGNLLRCIVRDQLRNWDNVLPQAEFAFNSSTNRTTGY 1083 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFEV YGL P+ P+DL+ LP S HI+ +H+ +K +N+ YK A D Sbjct: 1084 LPFEVAYGLKPKQPVDLIPLPTSVRTSQDGDAFARHIRDIHEKVREKIKISNENYKEAXD 1143 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR+++F+ G LV L +RF Y KL+A R+LK++ NAY LELP + Sbjct: 1144 AHRRYIQFQEGGLVMVRLRPERFHPSTYQKLQAKKAGPFRVLKRLGENAYLLELPSNLXF 1203 Query: 32 ADVFNVKHL 6 + +FNVK L Sbjct: 1204 SPIFNVKDL 1212 >emb|CAN79339.1| hypothetical protein VITISV_044312 [Vitis vinifera] Length = 354 Score = 184 bits (467), Expect = 2e-44 Identities = 91/189 (48%), Positives = 122/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L FSS++HPQTDGQ EVVNRSLGNLLRC+V + ++ W VL QAEFA N + NR+TGY Sbjct: 56 TQLKFSSSFHPQTDGQIEVVNRSLGNLLRCIVRDQLRNWDNVLPQAEFAFNSSTNRTTGY 115 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFEV YGL P+ +DL+ LP S HIQ +H++ +K +N+ YK AAD Sbjct: 116 SPFEVAYGLKPKQLVDLIPLPTSVHTSQDGDAFTRHIQDIHENVREKIKISNENYKEAAD 175 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR+++F+ GDLV L +RF Y KL+A ++LK++ NAY LELP + Sbjct: 176 AHRRYIQFQEGDLVMVRLRPERFHPSTYQKLQAKKAGPFQVLKRLGENAYLLELPSNLHF 235 Query: 32 ADVFNVKHL 6 + +FNV+ L Sbjct: 236 SPIFNVEDL 244 >emb|CAN79389.1| hypothetical protein VITISV_004909 [Vitis vinifera] Length = 895 Score = 181 bits (458), Expect = 2e-43 Identities = 90/187 (48%), Positives = 120/187 (64%), Gaps = 5/187 (2%) Frame = -2 Query: 551 LDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGYCP 372 L FSS++HPQTDGQTEVVNRSLGNLLRC+V + ++ W VL QAEFA N + NR+ G+ P Sbjct: 599 LKFSSSFHPQTDGQTEVVNRSLGNLLRCIVRDQLRNWDNVLPQAEFAFNSSTNRTIGHSP 658 Query: 371 FEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAADGS 192 FEV YGL P+ P+DL+ L S HI+ +H+ +K +N+ YK AAD Sbjct: 659 FEVAYGLKPKQPIDLIPLSTSVRTSQDGDAFARHIRDIHEKVREKIKISNENYKEAADAH 718 Query: 191 RRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKTAD 27 RR+++F+ GDLV A L +RF Y KL+A R+LK + NAY LELP + + Sbjct: 719 RRYIQFQEGDLVMARLRPERFHPSTYQKLQAKKAGPFRVLKWLGENAYLLELPSNLHFSP 778 Query: 26 VFNVKHL 6 +FNV+ L Sbjct: 779 IFNVEDL 785 >ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobroma cacao] gi|508724940|gb|EOY16837.1| Uncharacterized protein TCM_035725 [Theobroma cacao] Length = 499 Score = 179 bits (454), Expect = 7e-43 Identities = 92/189 (48%), Positives = 124/189 (65%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L +SS HPQTDGQTEVVNRSLGN+LRCL+ + K W V+ QAEFA+N ++NRS Sbjct: 221 TELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 280 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFEV YGL P+ LDLV LP + V + +HI+++H+ A LK +N +Y A+ Sbjct: 281 TPFEVAYGLKPQHVLDLVPLPQEARVSNEGELFADHIRKIHEEVKAALKASNAEYSFTAN 340 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR EFE GD V L ++RFP Y+KLK+ ++LKKI+SNAY +ELPP ++ Sbjct: 341 QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 400 Query: 32 ADVFNVKHL 6 + +FN+ L Sbjct: 401 SHIFNILDL 409 >gb|AAK94516.1| gag-pol polyprotein [Hordeum vulgare] Length = 1720 Score = 177 bits (450), Expect = 2e-42 Identities = 90/189 (47%), Positives = 122/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L FS+ HPQTDGQTEVVNRSL +LR ++ +IK W + L EFA+N++L+ +T Sbjct: 1409 TKLLFSTTCHPQTDGQTEVVNRSLSTMLRAVLKNNIKLWEECLPHIEFAYNRSLHSTTKM 1468 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 CPFE+VYG LPR P+DL+ +P+S V+ A E E I ++H+ T +++ N +YKLA D Sbjct: 1469 CPFEIVYGFLPRAPIDLLPIPSSEKVNFDAKERAELILKMHELTKENIERMNARYKLAGD 1528 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLK-----AVRILKKINSNAYRLELPPGMKT 33 R+H+ F GDLVW L KDRFP +KL ++L+KIN NAYRLELP Sbjct: 1529 KGRKHVVFAPGDLVWLHLRKDRFPDLRKSKLMPRAGGPFKVLEKINDNAYRLELPADFGV 1588 Query: 32 ADVFNVKHL 6 + FN+ L Sbjct: 1589 SPTFNIADL 1597 >ref|XP_007048683.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508700944|gb|EOX92840.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 647 Score = 177 bits (449), Expect = 3e-42 Identities = 92/189 (48%), Positives = 123/189 (65%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L +SS HPQTDGQTEVVNRSLGN+LRCL+ + K W V+ QAEFA+N ++NRS Sbjct: 442 TELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 501 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFEV YGL P+ LDLV LP + V + HI+++H+ A LK +N +Y A+ Sbjct: 502 TPFEVAYGLKPQHVLDLVPLPQEARVSNEGELFAYHIRKIHEEVKAALKASNAEYSFTAN 561 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR EFE GD V L ++RFP Y+KLK+ +++KKI+SNAY +ELPP ++ Sbjct: 562 QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVIKKISSNAYLIELPPELQI 621 Query: 32 ADVFNVKHL 6 + +FNV L Sbjct: 622 SPIFNVLDL 630 >ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobroma cacao] gi|508727408|gb|EOY19305.1| Uncharacterized protein TCM_044370 [Theobroma cacao] Length = 1306 Score = 176 bits (447), Expect = 5e-42 Identities = 92/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L +SS HPQTD QTEVVNRSLGN+LRCL+ + K W V QAEFA+N ++NRS Sbjct: 1070 TELKYSSTCHPQTDSQTEVVNRSLGNILRCLIQNNPKTWDLVKPQAEFAYNNSVNRSIKK 1129 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFE YGL P+ LDLV LP + V + +HIQ++H+ A LK +N +Y A+ Sbjct: 1130 TPFEAAYGLKPQHVLDLVPLPQEARVSNEGELFADHIQKIHEEVKAALKASNAEYSFTAN 1189 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR EFE GD V L ++RFP Y+KLK+ ++LKKI+SNAY +ELPP ++ Sbjct: 1190 QHRRKQEFEEGDQVLVYLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 1249 Query: 32 ADVFNVKHL 6 + +FNV L Sbjct: 1250 SHIFNVLDL 1258 >gb|AAK94517.1| gag-pol polyprotein [Hordeum vulgare] Length = 1717 Score = 176 bits (447), Expect = 5e-42 Identities = 89/189 (47%), Positives = 122/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L FS+ HPQTDGQTEVVNRSL +LR ++ ++K W + L EFA+N++L+ +T Sbjct: 1406 TKLLFSTTCHPQTDGQTEVVNRSLSTMLRAVLKTNLKLWEECLPHIEFAYNRSLHSTTKM 1465 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 CPFE+VYG LPR P+DL+ +P+S V+ A E E I ++H+ T +++ N +YKLA D Sbjct: 1466 CPFEIVYGFLPRAPIDLLPIPSSEKVNFDAKERAELILKMHELTKENIERMNARYKLAGD 1525 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLK-----AVRILKKINSNAYRLELPPGMKT 33 R+H+ F GDLVW L KDRFP +KL ++L+KIN NAYRLELP Sbjct: 1526 KGRKHVVFAPGDLVWLHLRKDRFPDLRKSKLMPRAGGPFKVLEKINDNAYRLELPXDFGV 1585 Query: 32 ADVFNVKHL 6 + FN+ L Sbjct: 1586 SPTFNIADL 1594 >ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobroma cacao] gi|508724802|gb|EOY16699.1| Uncharacterized protein TCM_035549 [Theobroma cacao] Length = 1392 Score = 176 bits (445), Expect = 8e-42 Identities = 91/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L +SS HPQTDGQTEVVNRSLGN+LRCL+ + K W V+ QAEFA+N ++NRS Sbjct: 1114 TELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 1173 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFE YGL P+ LDLV LP V + +HI+++H+ LK +N +Y A+ Sbjct: 1174 TPFEAAYGLKPQHVLDLVPLPQEPRVSNEGELFADHIRKIHEEVKTALKASNAQYSFTAN 1233 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR EFE GD V L ++RFP Y+KLK+ ++LKKI+SNAY +ELPP ++ Sbjct: 1234 QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 1293 Query: 32 ADVFNVKHL 6 + +FNV L Sbjct: 1294 SPIFNVLDL 1302 >ref|XP_008779530.1| PREDICTED: uncharacterized protein LOC103699270, partial [Phoenix dactylifera] Length = 1140 Score = 174 bits (441), Expect = 2e-41 Identities = 87/189 (46%), Positives = 124/189 (65%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L +S+AYHPQTDGQTEVVNRSLGNLLRCLVG+H W +LS AEFA+N ++NR++G Sbjct: 816 TKLKYSTAYHPQTDGQTEVVNRSLGNLLRCLVGDHPGNWDLLLSTAEFAYNSSVNRTSGL 875 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFE+V G +PR P+DL+ + ++ + +A +H+Q +H+ + ++ N +YK+A D Sbjct: 876 SPFEIVLGYVPRKPVDLIPVAPNNRISETAESFAQHMQNLHKEINKKIEINNARYKMAVD 935 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR+ EF VGD V + +RFP KL A +ILK++ SNAY +++P Sbjct: 936 LRRRYQEFRVGDDVMIRIRPERFPPGTVRKLHARSMGPYKILKRVGSNAYVVDIPSDFGI 995 Query: 32 ADVFNVKHL 6 VFNV+ L Sbjct: 996 NPVFNVEDL 1004 >ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508703673|gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1452 Score = 173 bits (439), Expect = 4e-41 Identities = 90/189 (47%), Positives = 121/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L +SS HPQTDGQTEVVNRSLGN+LRCL+ + K W V+ QAEFA+N ++NRS Sbjct: 1174 TELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 1233 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFE YGL P+ LDLV LP + V + + I+++H+ A LK +N +Y A+ Sbjct: 1234 TPFEAAYGLKPQHVLDLVPLPQEARVSNEGELFADQIRKIHEEVKAALKASNAEYSFTAN 1293 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR EFE GD V L ++RFP Y+KLK+ ++LKKI+SNAY +ELPP ++ Sbjct: 1294 QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 1353 Query: 32 ADVFNVKHL 6 +FN+ L Sbjct: 1354 NPIFNILDL 1362 >gb|ABI96971.1| putative gag-pol polyprotein [Triticum monococcum subsp. aegilopoides] Length = 1704 Score = 173 bits (438), Expect = 5e-41 Identities = 87/185 (47%), Positives = 119/185 (64%), Gaps = 5/185 (2%) Frame = -2 Query: 545 FSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGYCPFE 366 FS+ HPQTDGQTEVVNR+L +LR ++ + K W + L EFA+N++L+ +T CPFE Sbjct: 1398 FSTTCHPQTDGQTEVVNRTLSTMLRAVLKNNKKMWEECLPHIEFAYNRSLHSTTKMCPFE 1457 Query: 365 VVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAADGSRR 186 +VYG LPR P+DL+ LP+S V+ A E E I ++H+ T +++ N KYKLA D R+ Sbjct: 1458 IVYGFLPRAPIDLLPLPSSEKVNFDAKERSELILKIHELTKENIERMNAKYKLARDKGRK 1517 Query: 185 HLEFEVGDLVWAVLTKDRFPAHEYNKLK-----AVRILKKINSNAYRLELPPGMKTADVF 21 H+ F GDLVW L KDRFP +KL ++L+KIN NAY+LELP + F Sbjct: 1518 HVVFAPGDLVWLHLRKDRFPNLRKSKLMPRADGPFKVLEKINDNAYKLELPADFGVSPTF 1577 Query: 20 NVKHL 6 N+ L Sbjct: 1578 NIADL 1582 >ref|XP_007023480.1| Uncharacterized protein TCM_027505 [Theobroma cacao] gi|508778846|gb|EOY26102.1| Uncharacterized protein TCM_027505 [Theobroma cacao] Length = 292 Score = 173 bits (438), Expect = 5e-41 Identities = 90/189 (47%), Positives = 120/189 (63%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L + S HPQTDGQTEVVNRSLGN+LRCL+ + K W V+ QAEFA+N +NRS Sbjct: 14 TELKYFSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNFVNRSIKK 73 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFE YGL P+ LDLV LP + V + +HI+++H+ A LK +N +Y A+ Sbjct: 74 TPFEAAYGLKPQHVLDLVPLPQEARVSNEGELFADHIRKIHEEVKAALKASNAEYSFTAN 133 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR EFE GD V L ++RFP Y+KLK+ ++LKKI+SNAY +ELPP ++ Sbjct: 134 QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 193 Query: 32 ADVFNVKHL 6 +FN+ L Sbjct: 194 NPIFNILDL 202 >gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Japonica Group] gi|31431012|gb|AAP52850.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 2447 Score = 172 bits (437), Expect = 7e-41 Identities = 86/189 (45%), Positives = 123/189 (65%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L FS+ HPQTDGQTEVVNR+L +LR ++ ++IK W + L EFA+N++L+ +T Sbjct: 1350 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHIEFAYNRSLHSTTKM 1409 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 CPF++VYGLLPR P+DL+ LP+S ++ A + E + ++H++T +++ N KYK A D Sbjct: 1410 CPFQIVYGLLPRAPIDLMPLPSSEKLNFDAKQRAELMLKLHETTKENIERMNAKYKFAGD 1469 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLK-----AVRILKKINSNAYRLELPPGMKT 33 RR L FE GDLVW L K+RFP +KL ++L KIN NAY+++LP Sbjct: 1470 KGRRELTFEPGDLVWLHLRKERFPDLRKSKLMPRADGPFKVLAKINENAYKIDLPADFGV 1529 Query: 32 ADVFNVKHL 6 + FNV L Sbjct: 1530 SPTFNVADL 1538 Score = 95.1 bits (235), Expect = 2e-17 Identities = 69/192 (35%), Positives = 98/192 (51%), Gaps = 8/192 (4%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L+FS+AYHPQTDGQTE +N+ L ++L V + K W + L AEF++N + S Sbjct: 2172 TRLNFSTAYHPQTDGQTERLNQILEDMLHACVLDFGKTWDKSLPYAEFSYNNSYQASIQM 2231 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 P+E +YG R PL L S V + ++L + ++ +LK A + K AD Sbjct: 2232 APYEALYGRKCRTPL-LWDQVGESQVFGT--DILREAEAKVRTIWDNLKVAQSRQKSYAD 2288 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKAV-------RILKKINSNAYRLELPPGM 39 RR+LEF V D V+ +T R K K RI+ + AY+LELP + Sbjct: 2289 NRRRNLEFAVDDFVYLRVTPLRGVHRFQTKGKLAPRFVGPFRIIARRGEVAYQLELPASL 2348 Query: 38 -KTADVFNVKHL 6 DVF+V L Sbjct: 2349 GNVHDVFHVSQL 2360 >dbj|BAA89466.1| gag-pol polyprotein, partial [Oryza sativa Indica Group] Length = 1587 Score = 172 bits (436), Expect = 9e-41 Identities = 87/189 (46%), Positives = 122/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L FS+ HPQTDGQTEVVNR+L +LR ++ ++IK W + L EFA+N++ + +T Sbjct: 1378 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHVEFAYNRSQHSTTKK 1437 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 CPFE+VYGLLPR P+DL+ LP S V+ A E + ++H++T +++ N KYKLA Sbjct: 1438 CPFEIVYGLLPRAPIDLLPLPTSERVNFDAKYRAELMLKLHETTKENIERMNIKYKLAGS 1497 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKL-----KAVRILKKINSNAYRLELPPGMKT 33 ++H+ FE GDLVW L KDRFP +KL ++L+KIN NAY+LELP Sbjct: 1498 KGKKHVAFEPGDLVWLHLRKDRFPNLRKSKLLPRADGPFKVLQKINDNAYKLELPADFGV 1557 Query: 32 ADVFNVKHL 6 + FN+ L Sbjct: 1558 SPTFNIADL 1566 >gb|AAM94350.1| gag-pol polyprotein [Zea mays] Length = 1618 Score = 171 bits (434), Expect = 2e-40 Identities = 84/189 (44%), Positives = 123/189 (65%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L FS+ HPQTDGQTEVVNR+L +LR ++ ++IK W L EFA+N++L+ +T Sbjct: 1353 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEDCLPHIEFAYNRSLHSTTKM 1412 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 CPF++VYGLLPR P+DL+ LP+S ++ A E + ++H++T +++ N +YK A+D Sbjct: 1413 CPFQIVYGLLPRAPIDLMPLPSSEKLNFDATRRAELMLKLHETTKENIERMNARYKFASD 1472 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKL-----KAVRILKKINSNAYRLELPPGMKT 33 R+ + FE GDLVW L K+RFP +KL ++L+KIN NAYRL+LP Sbjct: 1473 KGRKEINFEPGDLVWLHLRKERFPELRKSKLLPRADGPFKVLEKINDNAYRLDLPADFGV 1532 Query: 32 ADVFNVKHL 6 + FN+ L Sbjct: 1533 SPTFNIADL 1541 >ref|XP_007037024.1| Uncharacterized protein TCM_013224 [Theobroma cacao] gi|508774269|gb|EOY21525.1| Uncharacterized protein TCM_013224 [Theobroma cacao] Length = 412 Score = 171 bits (433), Expect = 2e-40 Identities = 89/189 (47%), Positives = 122/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L +SS HPQTDGQT+VVNRSLGN+LR L+ + K W V+ QAEFA+N ++NRS Sbjct: 134 TELKYSSTCHPQTDGQTKVVNRSLGNMLRYLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 193 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 PFE YGL P+ LDLV LP + V + +HI+++H+ A LK +N +Y A+ Sbjct: 194 TPFEAAYGLKPQHVLDLVPLPQEARVSNKGELFADHIRKIHEEVKAALKASNAEYSFTAN 253 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33 RR EF+ GD V L ++RFP Y+KLK+ ++LKKI+SNAY +ELPP ++ Sbjct: 254 QHRRKQEFDEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 313 Query: 32 ADVFNVKHL 6 + +FNV L Sbjct: 314 SPIFNVLDL 322 >gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sativa Japonica Group] gi|15217296|gb|AAK92640.1|AC079634_1 Putative retroelement [Oryza sativa Japonica Group] gi|31431373|gb|AAP53161.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1708 Score = 170 bits (431), Expect = 3e-40 Identities = 86/189 (45%), Positives = 121/189 (64%), Gaps = 5/189 (2%) Frame = -2 Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378 T L FS+ HPQTDGQTEVVNR+L +LR ++ ++IK W + L EFA+N++ + +T Sbjct: 1378 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHVEFAYNRSQHSTTKK 1437 Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198 CPFE+VYGLLPR P+DL+ LP S V+ A E + ++H++T +++ N KYKLA Sbjct: 1438 CPFEIVYGLLPRAPIDLLPLPTSERVNFDAKYHAELMLKLHETTKENIERMNIKYKLAGS 1497 Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKL-----KAVRILKKINSNAYRLELPPGMKT 33 ++H+ FE GDLVW L KDRFP +KL ++L+KIN N Y+LELP Sbjct: 1498 KGKKHVAFEPGDLVWLHLRKDRFPNLRKSKLLPRADGPFKVLQKINDNTYKLELPADFGV 1557 Query: 32 ADVFNVKHL 6 + FN+ L Sbjct: 1558 SPTFNIADL 1566