BLASTX nr result
ID: Atropa21_contig00007252
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00007252 (1076 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD99221.1| polypepetide with reverse transcriptase and RNas... 256 2e-92 ref|XP_004231415.1| PREDICTED: uncharacterized protein LOC101266... 214 3e-69 gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi... 181 5e-57 ref|XP_006365584.1| PREDICTED: uncharacterized protein LOC102598... 220 7e-55 emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera] 172 2e-54 emb|CAN68148.1| hypothetical protein VITISV_035665 [Vitis vinifera] 177 7e-54 emb|CAN61658.1| hypothetical protein VITISV_040310 [Vitis vinifera] 163 2e-53 ref|XP_006584292.1| PREDICTED: uncharacterized protein LOC102668... 163 2e-52 gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi... 170 5e-52 gb|AAM98191.1| unknown protein [Arabidopsis thaliana] gi|3860380... 170 5e-52 emb|CAN82171.1| hypothetical protein VITISV_040546 [Vitis vinifera] 157 6e-52 emb|CAN61207.1| hypothetical protein VITISV_015446 [Vitis vinifera] 162 1e-51 dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t... 168 1e-51 gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis tha... 174 2e-51 dbj|BAB10743.1| retroelement pol polyprotein-like [Arabidopsis t... 174 2e-51 emb|CAA18107.1| LTR retrotransposon like protein [Arabidopsis th... 173 3e-51 gb|ABI34329.1| Integrase core domain containing protein [Solanum... 161 1e-50 gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsi... 161 4e-50 gb|EOY10155.1| Cysteine-rich RLK (RECEPTOR-like protein kinase) ... 156 5e-50 emb|CAN72018.1| hypothetical protein VITISV_001841 [Vitis vinifera] 155 7e-50 >dbj|BAD99221.1| polypepetide with reverse transcriptase and RNaseH domains [Petunia x hybrida] Length = 389 Score = 256 bits (654), Expect(2) = 2e-92 Identities = 137/275 (49%), Positives = 181/275 (65%), Gaps = 39/275 (14%) Frame = +2 Query: 23 MRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYSLFHKSSG 202 MR PG+T P+ SH CR+RKSLYG KQAS Q Y RLSS+LGT GF SS+NDYSLF+K S Sbjct: 1 MRFPPGVTPPTPSHVCRLRKSLYGLKQASRQWYARLSSALGTRGFSSSMNDYSLFYKGSA 60 Query: 203 SFVTILAVYVDDILLT--------------------------------------DGLIVT 268 +TIL VYVDDIL+T DGLIVT Sbjct: 61 GLITILVVYVDDILITGNNHTEISALKSFLDSEFRIKDLGEVSYFLGMEILHEPDGLIVT 120 Query: 269 Q*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPTL-RHLLG*PNFLTQTRPN 445 Q KFA+ LL E+ R ++P+DP+LKL+ SGELL +PT R L+G N+LT TRP+ Sbjct: 121 QRKFAMDLLSEYSSLSPRTVTTPLDPSLKLSSISGELLQDPTFYRRLVGKLNYLTHTRPD 180 Query: 446 LSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCDADWASCS 625 +S+ V++LS +MQ+P SGH +AA + LRY++ +P LG+F++S S + G+CDADWASC Sbjct: 181 ISFAVQTLSQHMQSPRSGHLEAAYHTLRYIRNNPGLGIFLSSEQSFQLTGFCDADWASCP 240 Query: 626 DTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 D+R+S+SG+++ +GG P+S K K Q VVSLS AE+ Sbjct: 241 DSRRSISGFFLTMGGCPLSWKSKKQQVVSLSSAEA 275 Score = 111 bits (278), Expect(2) = 2e-92 Identities = 60/114 (52%), Positives = 74/114 (64%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y S+R LVA+I+WI+ L + F SL VS+HCDNQ AIHIAK+PVFHE TKHIE D Sbjct: 277 YRSLRRLVAEIAWIV-RLLHDLSVDF-SLPVSVHCDNQSAIHIAKNPVFHERTKHIELDC 334 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDIQSPDTNLRG 1071 HFV E LLDG Q+ D+FTK+L G HR +GKL ++ + LRG Sbjct: 335 HFVREKLLDGLISLSFVPSSSQIADIFTKSLTGPLHRSLMGKLGVRKSGSTLRG 388 >ref|XP_004231415.1| PREDICTED: uncharacterized protein LOC101266877 [Solanum lycopersicum] Length = 1537 Score = 214 bits (546), Expect(2) = 3e-69 Identities = 119/282 (42%), Positives = 172/282 (60%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL EEV+M+L PGL P+ C+++KSLYG KQAS Q Y++L+ +L + G+ S D+S Sbjct: 1146 DLHEEVYMKLPPGLEVPNSELVCKLKKSLYGLKQASRQWYSKLTEALSSRGYSHSQFDHS 1205 Query: 182 LFHKSSGSFVTILAVYVDDILLT------------------------------------- 250 LF++ GS +AVYVDD++LT Sbjct: 1206 LFYRKEGSLAVFVAVYVDDVILTGTDTTEIAQLKVYLDNTFKIKDLGRLHYFLGLEILDT 1265 Query: 251 -DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPTL-RHLLG*PNF 424 DG++++Q KF L LL E+ P SSP+DPT+KL + G LP+PT R L+G NF Sbjct: 1266 TDGVLISQRKFTLDLLKEYDCFNYTPLSSPLDPTVKLKAKEGVPLPDPTFYRKLIGKLNF 1325 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 LT TR +++++V+ LS +MQ+P + H QAA + LRYL++D +LG+ ++ + + +CD Sbjct: 1326 LTNTRLDIAFSVQHLSQFMQDPRAPHLQAAYHLLRYLKQDTTLGVQLSKNPDCTVQAFCD 1385 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +DWASCSD+R+SVSGY + LG SP+S K K Q VSLS AE+ Sbjct: 1386 SDWASCSDSRKSVSGYLVLLGNSPISWKSKKQETVSLSSAEA 1427 Score = 75.5 bits (184), Expect(2) = 3e-69 Identities = 44/107 (41%), Positives = 61/107 (57%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y S+R +V +++W+ + S +S +S++CD+Q AIHIA++PVFHE TKHIE D Sbjct: 1429 YRSLRKVVGELTWL-HRLITELTVSISSP-ISVYCDSQSAIHIARNPVFHERTKHIEVDC 1486 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDIQS 1050 HFV L + QL D+ TKAL G H L KL +S Sbjct: 1487 HFVRNKLQEELISLHHVSTSNQLADILTKALTGIKHSAILRKLAAKS 1533 >gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1501 Score = 181 bits (460), Expect(2) = 5e-57 Identities = 106/282 (37%), Positives = 150/282 (53%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL+EEV+M+L PG H CR+RKSLYG KQA + +LS SL GF+ S DYS Sbjct: 1109 DLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLSDSLLRFGFVQSYEDYS 1168 Query: 182 LFHKSSGSFVTILAVYVDDILLT------------------------------------- 250 LF + + + +YVDD+L+ Sbjct: 1169 LFSYTRNNIELRVLIYVDDLLICGNDGYMLQKFKDYLSRCFSMKDLGKLKYFLGIEVSRG 1228 Query: 251 -DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG*PNF 424 +G+ ++Q K+AL ++ + + SRPA +P++ L + G LL +P R L+G + Sbjct: 1229 PEGIFLSQRKYALDVIADSGNLGSRPAHTPLEQNHHLASDDGPLLSDPKPYRRLVGRLLY 1288 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 L TRP LSY+V L+ +MQNP HF AAL +RYL+ P G+ +N+ + YCD Sbjct: 1289 LLHTRPELSYSVHVLAQFMQNPREAHFDAALRVVRYLKGSPGQGILLNADPDLTLEVYCD 1348 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +DW SC TR+S+S Y + LGGSP+S K K Q VS S AE+ Sbjct: 1349 SDWQSCPLTRRSISAYVVLLGGSPISWKTKKQDTVSHSSAEA 1390 Score = 67.8 bits (164), Expect(2) = 5e-57 Identities = 41/110 (37%), Positives = 57/110 (51%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M + +I W+ L+ S L+CD++ AIHIA +PVFHE TKHIE D Sbjct: 1392 YRAMSYALKEIKWL-RKLLKELGIE-QSTPARLYCDSKAAIHIAANPVFHERTKHIESDC 1449 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDIQSPDT 1059 H V + + DG QL D+FTKAL + + KL +Q+ T Sbjct: 1450 HSVRDAVRDGIITTQHVRTTEQLADVFTKALGRNQFLYLMSKLGVQNLHT 1499 >ref|XP_006365584.1| PREDICTED: uncharacterized protein LOC102598606 [Solanum tuberosum] Length = 1453 Score = 220 bits (561), Expect = 7e-55 Identities = 124/282 (43%), Positives = 168/282 (59%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL+EEV+M + PGL P C++ KSLYG KQAS Q Y +L+++L + G+ SL+DYS Sbjct: 812 DLNEEVYMDVPPGLDVPHTGLVCKLNKSLYGLKQASRQWYEKLTAALNSRGYTHSLHDYS 871 Query: 182 LFHKSSGSFVTILAVYVDDILLT------------------------------------- 250 L + G L VYVDDIL+T Sbjct: 872 LLFRKKGHSTVFLGVYVDDILITGTNTEEITSLKNFLNDQFKIKDLGKLHYFLGLEILYK 931 Query: 251 -DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPTL-RHLLG*PNF 424 DG++++Q KF LL E + P +SP+DPT+KL GE LP+PTL R L+G NF Sbjct: 932 NDGILISQRKFVQDLLKEFHVNGLTPVTSPLDPTVKLKAHEGEPLPDPTLYRKLVGKLNF 991 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 LT TR +++Y V+ LS YMQ+P H +AA + LRYLQ+DP+LG+FM++S I +CD Sbjct: 992 LTHTRLDITYGVQHLSQYMQDPREPHLKAAFHMLRYLQKDPTLGIFMSASPDFGIQAFCD 1051 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +DWASC D+R+SVSGY + LG SP+S K K Q +SLS AE+ Sbjct: 1052 SDWASCPDSRRSVSGYIVLLGTSPISWKSKKQDTISLSSAEA 1093 Score = 159 bits (401), Expect(2) = 8e-52 Identities = 83/161 (51%), Positives = 113/161 (70%), Gaps = 1/161 (0%) Frame = +2 Query: 251 DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPTL-RHLLG*PNFL 427 DG++++Q KF LL E + P +SP+DPT+KL GE LP+PTL R L+G NFL Sbjct: 1183 DGILISQRKFVQDLLKEFHVNGLTPVTSPLDPTVKLKAHEGEPLPDPTLYRKLVGKLNFL 1242 Query: 428 TQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCDA 607 T TR +++Y V+ LS YMQ+P H +AA + LRYLQ+DP+LG+FM++S I +CD+ Sbjct: 1243 THTRLDITYGVQHLSQYMQDPREPHLKAAFHMLRYLQKDPTLGIFMSASPDFGIQAFCDS 1302 Query: 608 DWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 DWASC D+R+SVSGY + LG SP+S K K Q +SLS AE+ Sbjct: 1303 DWASCPDSRRSVSGYIVLLGTSPISWKSKKQDTISLSSAEA 1343 Score = 73.2 bits (178), Expect(2) = 8e-52 Identities = 44/111 (39%), Positives = 57/111 (51%), Gaps = 6/111 (5%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVS------LHCDNQYAIHIAKHPVFHEHTK 891 Y S+R +V ++ W+ + F L V ++CD+Q A+HIAK PVFHE TK Sbjct: 1345 YRSLRKVVGELVWL--------HRLFTELTVPPIGPSPVYCDSQAALHIAKKPVFHERTK 1396 Query: 892 HIEFDYHFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDI 1044 HIE D HFV L +G QL D+ TKAL G H L KL + Sbjct: 1397 HIEVDCHFVRSKLQEGLISLYHINTCDQLADILTKALTGIKHTAMLNKLAV 1447 >emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera] Length = 1316 Score = 172 bits (435), Expect(2) = 2e-54 Identities = 103/281 (36%), Positives = 154/281 (54%), Gaps = 39/281 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL+EEV+M+L G + C+++KSLYG KQAS Q + +L+++L +GF SL DYS Sbjct: 924 DLEEEVYMKLPEGFKATGKNKVCKLQKSLYGLKQASRQWFAKLTTALKEYGFQQSLADYS 983 Query: 182 LFHKSSGSFVTILAVYVDDILLT------------------------------------- 250 LF G+ V L VYVDD++L Sbjct: 984 LFTYRRGNIVMNLLVYVDDLILAGNDNKVCEAFKNFLDRKFGIKNLGQLKYILGIEVARG 1043 Query: 251 -DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPTL-RHLLG*PNF 424 DGL ++Q K+AL+++ E +RP PM+ KL + +G LL +P + R L+G + Sbjct: 1044 KDGLFLSQRKYALNIIKECGLLGARPVEFPMEENHKLALANGRLLNDPGMYRRLVGRLIY 1103 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 LT TRP+L+Y V LS +MQ+P H AA +RYL++ P G+ + + + ++ Y D Sbjct: 1104 LTVTRPDLTYXVHVLSQFMQSPREEHLDAAYRVVRYLKKGPGQGIVLKADNDLQLYCYSD 1163 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAE 727 +DW SC TR+S+SG ++LG SP+S + K Q +S S AE Sbjct: 1164 SDWXSCPLTRRSISGCCVKLGTSPISWRCKKQGTISRSSAE 1204 Score = 68.6 bits (166), Expect(2) = 2e-54 Identities = 40/100 (40%), Positives = 55/100 (55%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y SM ++++W+ SL + + + L+CDN+ A+HIA +PVFHE TKHIE D Sbjct: 1207 YXSMAMAASELTWL--KSLLASLGVLHDKPMKLYCDNKAALHIAANPVFHERTKHIEIDC 1264 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFL 1029 HFV E + G Q D+FTKAL +QFL Sbjct: 1265 HFVREKVQSGEIVTTYLPSKLQXADMFTKALG---RQQFL 1301 >emb|CAN68148.1| hypothetical protein VITISV_035665 [Vitis vinifera] Length = 1813 Score = 177 bits (449), Expect(2) = 7e-54 Identities = 107/282 (37%), Positives = 156/282 (55%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 +L EEV+M PGL ++ CR+RKS+YG KQAS ++ ++++ + G+I S DYS Sbjct: 1337 NLQEEVYMTPPPGLRRQGENLVCRLRKSIYGLKQASRNWFSTFTATVKSAGYIQSKADYS 1396 Query: 182 LFHKSSGSFVTILAVYVDDILLTD------------------------------------ 253 LF KS G+ T + +YVDDILLT Sbjct: 1397 LFTKSQGNKFTAILIYVDDILLTGNDLHEIKMLKTHLLKRFFIKDLGELKYFLGIEFSRS 1456 Query: 254 --GLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG*PNF 424 G+ ++Q K+ L +L + +P PM+ LKLT E GELL +P+ R L+G + Sbjct: 1457 KKGIFMSQRKYTLDILQDTGLTGVKPEKFPMEQNLKLTNEDGELLHDPSRYRRLVGRLIY 1516 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 LT TRP++ Y+VR+LS +M P H++AAL LRY++ P GLF+ S ++ + +CD Sbjct: 1517 LTVTRPDIVYSVRTLSQFMNTPRKPHWEAALRVLRYIKGSPGQGLFLPSENNLTLSAFCD 1576 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +DW C +R+SVSGY + LG S +S K K Q VS S AE+ Sbjct: 1577 SDWGGCRMSRRSVSGYCVFLGSSLISWKSKKQTNVSRSSAEA 1618 Score = 61.6 bits (148), Expect(2) = 7e-54 Identities = 32/59 (54%), Positives = 36/59 (61%) Frame = +1 Query: 826 LHCDNQYAIHIAKHPVFHEHTKHIEFDYHFVFEILLDGFXXXXXXXXXXQLDDLFTKAL 1002 L CDNQ A++IA +PVFHE TKHIE D H V E L G QL D+FTKAL Sbjct: 1650 LFCDNQAALYIAANPVFHERTKHIEIDCHIVREKLQAGVIRPCYVSTKMQLADVFTKAL 1708 >emb|CAN61658.1| hypothetical protein VITISV_040310 [Vitis vinifera] Length = 1461 Score = 163 bits (412), Expect(2) = 2e-53 Identities = 98/282 (34%), Positives = 153/282 (54%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL+EEV+M+L PG + CR+RKSLYG KQA + +L ++L +GF+ S +DYS Sbjct: 1069 DLEEEVYMKLPPGFERSDPNLVCRLRKSLYGLKQAPRCWFAKLVTALKGYGFLQSYSDYS 1128 Query: 182 LFHKSSGSFVTILAVYVDDILLTD------------------------------------ 253 LF + G+ + VYVDD++++ Sbjct: 1129 LFTYTKGNVQINVLVYVDDLIISGNDSAALKTFKAYLSDCFKMKDLGVLKYFLGIEVARS 1188 Query: 254 --GLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPN-PTLRHLLG*PNF 424 GL + Q K+ L ++ E ++P P++ +L + +GELL N + R L+G + Sbjct: 1189 SAGLFLCQRKYTLDIVSEAGLLGAKPCGFPIEQNHRLGLANGELLSNLESYRRLVGRLIY 1248 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 L TRP+L+Y+V LS +MQ P H++AAL + YL+ P G+ + + S + G+CD Sbjct: 1249 LAVTRPDLAYSVHILSQFMQEPRIEHWEAALRVVHYLKGTPGQGILLRADSDLSLQGWCD 1308 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +DWA+C T++S+SG+ + LG SP+S K K Q VS S AE+ Sbjct: 1309 SDWAACPVTKRSLSGWLVFLGQSPISWKTKKQHTVSRSSAEA 1350 Score = 74.3 bits (181), Expect(2) = 2e-53 Identities = 41/108 (37%), Positives = 55/108 (50%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M + ++ W+ L + + L CD+Q A+H+AK+PVFHE TKHIE D Sbjct: 1352 YRAMAAVTCELKWL--KGLLLSLGMHHPKAIKLFCDSQSALHMAKNPVFHERTKHIEVDC 1409 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDIQSP 1053 HFV + + DG QL D+FTKAL L KL I P Sbjct: 1410 HFVRDAITDGLIAPSYVPTVTQLADIFTKALGKKQFDYLLAKLGIFEP 1457 >ref|XP_006584292.1| PREDICTED: uncharacterized protein LOC102668041 [Glycine max] Length = 1176 Score = 163 bits (413), Expect(2) = 2e-52 Identities = 103/285 (36%), Positives = 148/285 (51%), Gaps = 42/285 (14%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSD-SHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDY 178 DL+E+++M PG + C++R+SLYG KQ+ + + S + G S D+ Sbjct: 782 DLEEDIYMEQPPGFVAQGEYGLVCKLRRSLYGLKQSPRAWFGKFSHVVQMFGLKRSEADH 841 Query: 179 SLF--HKSSGSFVTILAVYVDDILLT---------------------------------- 250 S+F H S G V ++ VYVDDI++T Sbjct: 842 SVFYYHTSPGKCVYLM-VYVDDIVITGNDTTKIVQLKEHLFSHFQTKDLGSLKYFLGIEV 900 Query: 251 ----DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG* 415 DG++++Q K+AL +L E RP SPMDP LKL + E P+P R L+G Sbjct: 901 AQSGDGIVISQKKYALDILEETGMQNCRPVESPMDPNLKLMADQSEAYPDPERYRRLVGK 960 Query: 416 PNFLTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFG 595 +LT TRP++S+ V +S +MQNP H+ A + LRY++R P GL S + G Sbjct: 961 LIYLTITRPDISFAVGVISQFMQNPHLDHWNAVMRILRYVKRAPGQGLLYEDKGSTQLSG 1020 Query: 596 YCDADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 YCDADWA C R+S SGYY+ +GG+ +S K K Q VV+ S AE+ Sbjct: 1021 YCDADWAGCPMDRRSTSGYYVFIGGNLISWKSKKQTVVAWSSAEA 1065 Score = 70.9 bits (172), Expect(2) = 2e-52 Identities = 40/93 (43%), Positives = 50/93 (53%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y SM + ++ WI LQ F L + L+CDNQ A+HIA +PVFHE TKHIE D Sbjct: 1067 YRSMAMVTCELMWI-KQFLQELRFC-EELQMKLYCDNQAALHIASNPVFHERTKHIEIDC 1124 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAG 1008 HF+ E LL Q D+ TK+L G Sbjct: 1125 HFIREKLLSKEIVTEFIGSNDQPADILTKSLRG 1157 >gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1496 Score = 170 bits (431), Expect(2) = 5e-52 Identities = 107/282 (37%), Positives = 151/282 (53%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL+EEV+MRL PG S CR+RKSLYG KQA +++LS++L GF S DYS Sbjct: 1104 DLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLKQAPRCWFSKLSTALRNIGFTQSYEDYS 1163 Query: 182 LFHKSSGSFVTILAVYVDDILLT------------------------------------- 250 LF +G + + VYVDD+++ Sbjct: 1164 LFSLKNGDTIIHVLVYVDDLIVAGNNLDAIDRFKSQLHKCFHMKDLGKLKYFLGLEVSRG 1223 Query: 251 -DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG*PNF 424 DG ++Q K+AL ++ E +P++ P+ KL +G + NP R L+G + Sbjct: 1224 PDGFCLSQRKYALDIVKETGLLGCKPSAVPIALNHKLASITGPVFTNPEQYRRLVGRFIY 1283 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 LT TRP+LSY V LS +MQ P H++AAL +RYL+ P+ G+F+ S SS I YCD Sbjct: 1284 LTITRPDLSYAVHILSQFMQAPLVAHWEAALRLVRYLKGSPAQGIFLRSDSSLIINAYCD 1343 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +D+ +C TR+S+S Y + LG SP+S K K Q VS S AE+ Sbjct: 1344 SDYNACPLTRRSLSAYVVYLGDSPISWKTKKQDTVSYSSAEA 1385 Score = 62.4 bits (150), Expect(2) = 5e-52 Identities = 35/91 (38%), Positives = 50/91 (54%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M + ++ W+ +L +S + LHCD++ AIHIA +PVFHE TKHIE D Sbjct: 1387 YRAMAYTLKELKWL--KALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHERTKHIESDC 1444 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKAL 1002 H V + +LD Q+ DL TK+L Sbjct: 1445 HKVRDAVLDKLITTEHIYTEDQVADLLTKSL 1475 >gb|AAM98191.1| unknown protein [Arabidopsis thaliana] gi|38603804|gb|AAR24647.1| At2g23330 [Arabidopsis thaliana] gi|110742535|dbj|BAE99183.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 776 Score = 170 bits (431), Expect(2) = 5e-52 Identities = 107/282 (37%), Positives = 151/282 (53%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL+EEV+MRL PG S CR+RKSLYG KQA +++LS++L GF S DYS Sbjct: 384 DLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLKQAPRCWFSKLSTALRNIGFTQSYEDYS 443 Query: 182 LFHKSSGSFVTILAVYVDDILLT------------------------------------- 250 LF +G + + VYVDD+++ Sbjct: 444 LFSLKNGDTIIHVLVYVDDLIVAGNNLDAIDRFKSQLHKCFHMKDLGKLKYFLGLEVSRG 503 Query: 251 -DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG*PNF 424 DG ++Q K+AL ++ E +P++ P+ KL +G + NP R L+G + Sbjct: 504 PDGFCLSQRKYALDIVKETGLLGCKPSAVPIALNHKLASITGPVFTNPEQYRRLVGRFIY 563 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 LT TRP+LSY V LS +MQ P H++AAL +RYL+ P+ G+F+ S SS I YCD Sbjct: 564 LTITRPDLSYAVHILSQFMQAPLVAHWEAALRLVRYLKGSPAQGIFLRSDSSLIINAYCD 623 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +D+ +C TR+S+S Y + LG SP+S K K Q VS S AE+ Sbjct: 624 SDYNACPLTRRSLSAYVVYLGDSPISWKTKKQDTVSYSSAEA 665 Score = 62.4 bits (150), Expect(2) = 5e-52 Identities = 35/91 (38%), Positives = 50/91 (54%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M + ++ W+ +L +S + LHCD++ AIHIA +PVFHE TKHIE D Sbjct: 667 YRAMAYTLKELKWL--KALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHERTKHIESDC 724 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKAL 1002 H V + +LD Q+ DL TK+L Sbjct: 725 HKVRDAVLDKLITTEHIYTEDQVADLLTKSL 755 >emb|CAN82171.1| hypothetical protein VITISV_040546 [Vitis vinifera] Length = 1129 Score = 157 bits (398), Expect(2) = 6e-52 Identities = 97/282 (34%), Positives = 150/282 (53%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL+EEV+M+L PG + CR+ KSLYG KQA + +L ++L +GF+ S +DYS Sbjct: 737 DLEEEVYMKLPPGFESSDPNLVCRLWKSLYGLKQAPRCWFAKLVTALKGYGFLQSYSDYS 796 Query: 182 LFHKSSGSFVTILAVYVDDILLTD------------------------------------ 253 LF + G+ + VYVDD++++ Sbjct: 797 LFTYTKGNVQINVLVYVDDLIISGNDSAALKTFKAYLSDCFKMKDLGVLKYFLGIEVARS 856 Query: 254 --GLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNP-TLRHLLG*PNF 424 GL + Q K+ L ++ E ++P ++ +L + +GELL NP + R L+G + Sbjct: 857 SAGLFLCQRKYTLDIVSEAGLLGAKPCGFSIEQNHRLGLANGELLSNPESYRRLVGRLIY 916 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 L TRPNL+Y+V LS +MQ P H++ AL + YL+ P G+ + + S + G+CD Sbjct: 917 LAVTRPNLAYSVHILSQFMQEPRIEHWEVALRVVHYLKGTPGQGILLRADSDLSLQGWCD 976 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +DWA+C TR+S+SG+ + LG S +S K K Q VS S AE+ Sbjct: 977 SDWAACPVTRRSLSGWLVFLGQSHISWKTKKQHTVSRSSAEA 1018 Score = 74.7 bits (182), Expect(2) = 6e-52 Identities = 41/109 (37%), Positives = 55/109 (50%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M + ++ W+ L + + L CD+Q A+H+ K+PVFHE TKHIE D Sbjct: 1020 YRAMAAVTCELKWL--KGLLLSLGVHHPKAIKLFCDSQSALHMTKNPVFHERTKHIEVDC 1077 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDIQSPD 1056 HFV + + DG QL D+FTKAL L KL I PD Sbjct: 1078 HFVRDAITDGLIAPSYVPTVTQLADIFTKALGKKQFDYLLAKLGIFEPD 1126 >emb|CAN61207.1| hypothetical protein VITISV_015446 [Vitis vinifera] Length = 900 Score = 162 bits (409), Expect(2) = 1e-51 Identities = 100/269 (37%), Positives = 147/269 (54%), Gaps = 26/269 (9%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSH-ACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDY 178 DL EEV+M PG +S CR+R+SLYG KQ+ ++R SS + G + S D+ Sbjct: 522 DLAEEVYMEQPPGFVAQGESGLVCRLRRSLYGLKQSPRAWFSRFSSVVQEFGMLRSTADH 581 Query: 179 SLF--HKSSGSFVTILAVYVDDILLT----------------------DGLIVTQ*KFAL 286 S+F H S G + L VYVDDI++T G++++Q K+AL Sbjct: 582 SVFYHHNSLGQCI-YLVVYVDDIVITGSDQDDLGKLKYFLGIEIAQSSSGVVLSQRKYAL 640 Query: 287 HLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNP-TLRHLLG*PNFLTQTRPNLSYTVR 463 +L E +P +PMDP +KL GE L +P R L+G N+LT TRP++S+ V Sbjct: 641 DILEETGMLDCKPVDTPMDPNVKLVPGQGEPLGDPGRYRRLVGKLNYLTITRPDISFPVS 700 Query: 464 SLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCDADWASCSDTRQSV 643 +S ++Q+PC H+ A + LRY++ P G+ + + GY DADWA R+S Sbjct: 701 VVSQFLQSPCDSHWDAVIRILRYIKSTPGQGVLYENRGHTQVVGYTDADWAGSPTDRRST 760 Query: 644 SGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 SGY + +GG+ +S K K Q VV+ S AE+ Sbjct: 761 SGYCVFIGGNLISWKSKKQDVVARSSAEA 789 Score = 69.7 bits (169), Expect(2) = 1e-51 Identities = 42/111 (37%), Positives = 57/111 (51%), Gaps = 3/111 (2%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M ++ W+ LQ F + + L CDNQ A+HIA +PVFHE TKHIE D Sbjct: 791 YRAMALATCELIWL-RHLLQELRFGKDEQM-KLICDNQAALHIASNPVFHERTKHIEVDC 848 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQF---LGKLDIQSP 1053 HF+ E + G QL D+FTK+L G + LG D+ +P Sbjct: 849 HFIREKIASGCVATSFVNSNDQLADIFTKSLRGPRIKYICNKLGAYDVYAP 899 >dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1491 Score = 168 bits (426), Expect(2) = 1e-51 Identities = 102/282 (36%), Positives = 146/282 (51%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL+EEV+M+L PG H CR+RKSLYG KQA + +LS +L GFI DYS Sbjct: 1099 DLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLSDALKRFGFIQGYEDYS 1158 Query: 182 LFHKSSGSFVTILAVYVDDILLT------------------------------------- 250 F S + VYVDD+++ Sbjct: 1159 FFSYSCKGIELRVLVYVDDLIICGNDEYMVQKFKEYLGRCFSMKDLGKLKYFLGIEVSRG 1218 Query: 251 -DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG*PNF 424 DG+ ++Q K+AL ++ + +RPA +P++ L + G LL +P R L+G + Sbjct: 1219 PDGIFLSQRKYALDIISDSGTLGARPAYTPLEQNHHLASDDGPLLQDPKPFRRLVGRLLY 1278 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 L TRP LSY+V LS +MQ P H +AA+ +RYL+ P G+ ++S+ + YCD Sbjct: 1279 LLHTRPELSYSVHVLSQFMQAPREAHLEAAMRIVRYLKGSPGQGILLSSNKDLTLEVYCD 1338 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +D+ SC TR+S+S Y + LGGSP+S K K Q VS S AE+ Sbjct: 1339 SDFQSCPLTRRSLSAYVVLLGGSPISWKTKKQDTVSHSSAEA 1380 Score = 62.8 bits (151), Expect(2) = 1e-51 Identities = 40/110 (36%), Positives = 56/110 (50%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M + +I W+ L+ + + L CD++ AI IA +PVFHE TKHIE D Sbjct: 1382 YRAMSVALKEIKWL-NKLLKELGITL-AAPTRLFCDSKAAISIAANPVFHERTKHIERDC 1439 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDIQSPDT 1059 H V + + DG QL D+FTKAL + + KL IQ+ T Sbjct: 1440 HSVRDAVRDGIITTHHVRTSEQLADIFTKALGRNQFIYLMSKLGIQNLHT 1489 >gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis thaliana] Length = 1468 Score = 174 bits (440), Expect(2) = 2e-51 Identities = 104/283 (36%), Positives = 164/283 (57%), Gaps = 40/283 (14%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL EEV+M+L G S CR+ KSLYG KQA +++LSS+L +GF SL+DYS Sbjct: 1076 DLKEEVYMKLPQGFQCDDPSKVCRLHKSLYGLKQAPRCWFSKLSSALKQYGFTQSLSDYS 1135 Query: 182 LF-HKSSGSFVTILAVYVDDILLT------------------------------------ 250 LF + + G FV +L VYVDD++++ Sbjct: 1136 LFSYNNDGIFVHVL-VYVDDLIISGSCPDAVAQFKSYLESCFHMKDLGLLKYFLGIEVSR 1194 Query: 251 --DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG*PN 421 G ++Q K+ L ++ E +RP++ P++ KL++ + LL + + R L+G Sbjct: 1195 NAQGFYLSQRKYVLDIISEMGLLGARPSAFPLEQNHKLSLSTSPLLSDSSRYRRLVGRLI 1254 Query: 422 FLTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYC 601 +L TRP LSY+V +L+ +MQNP H+ AA+ +RYL+ +P G+ ++S+S+ I G+C Sbjct: 1255 YLVVTRPELSYSVHTLAQFMQNPRQDHWNAAIRVVRYLKSNPGQGILLSSTSTLQINGWC 1314 Query: 602 DADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 D+D+A+C TR+S++GY+++LG +P+S K K QP VS S AE+ Sbjct: 1315 DSDYAACPLTRRSLTGYFVQLGDTPISWKTKKQPTVSRSSAEA 1357 Score = 56.6 bits (135), Expect(2) = 2e-51 Identities = 36/105 (34%), Positives = 52/105 (49%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M L ++ W+ L S + + D++ AI ++ +PV HE TKH+E D Sbjct: 1359 YRAMAFLTQELMWL-KRVLYDLGVSHVQAM-RIFSDSKSAIALSVNPVQHERTKHVEVDC 1416 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDI 1044 HF+ + +LDG QL D+ TKAL R FL KL I Sbjct: 1417 HFIRDAILDGIIATSFVPSHKQLADILTKALGEKEVRYFLRKLGI 1461 >dbj|BAB10743.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1109 Score = 174 bits (440), Expect(2) = 2e-51 Identities = 104/283 (36%), Positives = 164/283 (57%), Gaps = 40/283 (14%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL EEV+M+L G S CR+ KSLYG KQA +++LSS+L +GF SL+DYS Sbjct: 717 DLKEEVYMKLPQGFQCDDPSKVCRLHKSLYGLKQAPRCWFSKLSSALKQYGFTQSLSDYS 776 Query: 182 LF-HKSSGSFVTILAVYVDDILLT------------------------------------ 250 LF + + G FV +L VYVDD++++ Sbjct: 777 LFSYNNDGVFVHVL-VYVDDLIISGSCPDAVAQFKSYLESCFHMKDLGLLKYFLGIEVSR 835 Query: 251 --DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG*PN 421 G ++Q K+ L ++ E +RP++ P++ KL++ + LL + + R L+G Sbjct: 836 NAQGFYLSQRKYVLDIISEMGLLGARPSAFPLEQNHKLSLSTSPLLSDSSRYRRLVGRLI 895 Query: 422 FLTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYC 601 +L TRP LSY+V +L+ +MQNP H+ AA+ +RYL+ +P G+ ++S+S+ I G+C Sbjct: 896 YLAVTRPELSYSVHTLAQFMQNPRQDHWNAAIRVVRYLKSNPGQGILLSSTSTLQINGWC 955 Query: 602 DADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 D+D+A+C TR+S++GY+++LG +P+S K K QP VS S AE+ Sbjct: 956 DSDYAACPLTRRSLTGYFVQLGDTPISWKTKKQPTVSRSSAEA 998 Score = 56.6 bits (135), Expect(2) = 2e-51 Identities = 36/105 (34%), Positives = 52/105 (49%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M L ++ W+ L S + + D++ AI ++ +PV HE TKH+E D Sbjct: 1000 YRAMAFLTQELMWL-KRVLYDLGVSHVQAM-RIFSDSKSAIALSVNPVQHERTKHVEVDC 1057 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDI 1044 HF+ + +LDG QL D+ TKAL R FL KL I Sbjct: 1058 HFIRDAILDGIIATSFVPSHKQLADILTKALGEKEVRYFLRKLGI 1102 >emb|CAA18107.1| LTR retrotransposon like protein [Arabidopsis thaliana] gi|7269049|emb|CAB79159.1| LTR retrotransposon like protein [Arabidopsis thaliana] Length = 1109 Score = 173 bits (439), Expect(2) = 3e-51 Identities = 103/283 (36%), Positives = 164/283 (57%), Gaps = 40/283 (14%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL EEV+M+L G S CR+ KSLYG KQA +++LSS+L +GF SL+DYS Sbjct: 717 DLKEEVYMKLPQGFQCDDPSKVCRLHKSLYGLKQAPRCWFSKLSSALKQYGFTQSLSDYS 776 Query: 182 LF-HKSSGSFVTILAVYVDDILLT------------------------------------ 250 LF + + G FV +L VYVDD++++ Sbjct: 777 LFSYNNDGVFVHVL-VYVDDLIISGSCPDAVAQFKSYLESCFHMKDLGLLKYFLGIEVSR 835 Query: 251 --DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG*PN 421 G ++Q K+ L ++ E +RP++ P++ KL++ + LL + + R L+G Sbjct: 836 NAQGFYLSQRKYVLDIISEMGLLGARPSAFPLEQNHKLSLSTSPLLSDSSRYRRLVGRLI 895 Query: 422 FLTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYC 601 +L TRP LSY+V +L+ +MQNP H+ AA+ +RYL+ +P G+ ++S+S+ I G+C Sbjct: 896 YLAVTRPELSYSVHTLAQFMQNPRQDHWNAAIRVVRYLKSNPGQGILLSSTSTLQINGWC 955 Query: 602 DADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 D+D+A+C TR+S++GY+++LG +P+S K K QP +S S AE+ Sbjct: 956 DSDYAACPLTRRSLTGYFVQLGDTPISWKTKKQPTISRSSAEA 998 Score = 56.6 bits (135), Expect(2) = 3e-51 Identities = 36/105 (34%), Positives = 52/105 (49%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M L ++ W+ L S + + D++ AI ++ +PV HE TKH+E D Sbjct: 1000 YRAMAFLTQELMWL-KRVLYDLGVSHVQAM-RIFSDSKSAIALSVNPVQHERTKHVEVDC 1057 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDI 1044 HF+ + +LDG QL D+ TKAL R FL KL I Sbjct: 1058 HFIRDAILDGIIATSFVPSHKQLADILTKALGEKEVRYFLRKLGI 1102 >gb|ABI34329.1| Integrase core domain containing protein [Solanum demissum] Length = 1775 Score = 161 bits (407), Expect(2) = 1e-50 Identities = 103/285 (36%), Positives = 146/285 (51%), Gaps = 42/285 (14%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHA--CRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLND 175 DL+EEV+M PG +S + CR+R+SLYG KQ+ + + S+ + G S D Sbjct: 991 DLEEEVYMEQPPGFVAQGESSSLVCRLRRSLYGLKQSPRAWFGKFSTVIQEFGMTRSGAD 1050 Query: 176 YSLFHKSSG-SFVTILAVYVDDILLT---------------------------------- 250 +S+F++ S S L VYVDDI++T Sbjct: 1051 HSVFYRHSAPSRCIYLVVYVDDIVITGNDQDGITDLKQHLFKHFQTKDLGRLKYFLGIEV 1110 Query: 251 ----DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG* 415 G++++Q K+AL +L E RP +PMDP +KL GE L NP R L+G Sbjct: 1111 AQSRSGIVISQRKYALDILEETGMMGCRPVDTPMDPNVKLLPGQGEPLSNPERYRRLVGK 1170 Query: 416 PNFLTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFG 595 N+LT TRP++S+ V +S +M +PC H++A + LRY++ P GL I G Sbjct: 1171 LNYLTVTRPDISFPVSVVSQFMTSPCDSHWEAVVRILRYIKSAPGKGLLFEDQGHEHIIG 1230 Query: 596 YCDADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 Y DADWA R+S SGY + +GG+ VS K K Q VV+ S AES Sbjct: 1231 YTDADWAGSPSDRRSTSGYCVLVGGNLVSWKSKKQNVVARSSAES 1275 Score = 67.0 bits (162), Expect(2) = 1e-50 Identities = 33/61 (54%), Positives = 39/61 (63%) Frame = +1 Query: 820 VSLHCDNQYAIHIAKHPVFHEHTKHIEFDYHFVFEILLDGFXXXXXXXXXXQLDDLFTKA 999 + L CDNQ A+HIA +PVFHE TKHIE D HFV E +L G QL D+FTK+ Sbjct: 1305 MELVCDNQAALHIASNPVFHERTKHIEIDCHFVREKILSGDIVTKFVKSNDQLADIFTKS 1364 Query: 1000 L 1002 L Sbjct: 1365 L 1365 >gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1333 Score = 161 bits (408), Expect(2) = 4e-50 Identities = 99/282 (35%), Positives = 150/282 (53%), Gaps = 39/282 (13%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSHACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDYS 181 DL EEV+M+L PG + CR+RK+LYG KQA + +L+++L +GF SL DYS Sbjct: 941 DLREEVYMKLPPGFEASHPNKVCRLRKALYGLKQAPRCWFEKLTTALKRYGFQQSLADYS 1000 Query: 182 LFHKSSGSFVTILAVYVDDILLTD------------------------------------ 253 LF GS + +YVDD+++T Sbjct: 1001 LFTLVKGSVRIKILIYVDDLIITGNSQRATQQFKEYLASCFHMKDLGPLKYFLGIEVARS 1060 Query: 254 --GLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNPT-LRHLLG*PNF 424 G+ + Q K+AL ++ E +PA+ P++ KL + + LL +P R L+G + Sbjct: 1061 TTGIYICQRKYALDIISETGLLGVKPANFPLEQNHKLGLSTSPLLTDPQRYRRLVGRLIY 1120 Query: 425 LTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCD 604 L TR +L+++V L+ +MQ P H+ AAL +RYL+ DP G+F+ S I G+CD Sbjct: 1121 LAVTRLDLAFSVHILARFMQEPREDHWAAALRVVRYLKADPGQGVFLRRSGDFQITGWCD 1180 Query: 605 ADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 +DWA +R+SV+GY+++ G SP+S K K Q VS S AE+ Sbjct: 1181 SDWAGDPMSRRSVTGYFVQFGDSPISWKTKKQDTVSKSSAEA 1222 Score = 64.7 bits (156), Expect(2) = 4e-50 Identities = 40/107 (37%), Positives = 57/107 (53%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M L +++ W+ L S S ++ + CD++ AI+IA +PVFHE TKHIE DY Sbjct: 1224 YRAMSFLASELLWL-KQLLFSLGVSHVQPMI-MCCDSKSAIYIATNPVFHERTKHIEIDY 1281 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQFLGKLDIQS 1050 HFV + + G QL D+FTK L F KL I++ Sbjct: 1282 HFVRDEFVKGVITPRHVGTTSQLADIFTKPLGRDCFSAFRIKLGIRN 1328 >gb|EOY10155.1| Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao] Length = 721 Score = 156 bits (395), Expect(2) = 5e-50 Identities = 103/259 (39%), Positives = 146/259 (56%), Gaps = 16/259 (6%) Frame = +2 Query: 2 DLDEEVFMRLLPGLT----HPSDSH-ACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISS 166 DLDEEV+M + G HPS S C++ KSLYG KQA Q +L+S + +GF S Sbjct: 353 DLDEEVYMDIPKGYIVKREHPSGSKLVCKLHKSLYGLKQALRQWNAKLTSCIIHYGFKQS 412 Query: 167 LNDYSLF--HKSSGSFVTILAVYVDDILLTDGLIVTQ*KFALHLL-------LEHPDHPS 319 ++DYSLF + + G F+ +L YVDDIL+ + HL LE+ + Sbjct: 413 MSDYSLFTMNTTDGEFIALLT-YVDDILIGNTSTQVAAVVKEHLSSQFKLKDLEYGLLGA 471 Query: 320 RPASSPMDPTLKLTMES--GELLPNPTLRHLLG*PNFLTQTRPNLSYTVRSLS*YMQNPC 493 +P S+P+D +KL S EL+ + R L+G +LT TRP++SY V++LS +M P Sbjct: 472 KPVSTPIDYNVKLAKASKEDELVDSFKYRQLVGKLLYLTFTRPDISYAVQTLSQFMDKPG 531 Query: 494 SGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFGYCDADWASCSDTRQSVSGYYIRLGGS 673 H+ AA L+YL+ P G+ M + S I YCD DWA C DTR+ ++GY + +G S Sbjct: 532 RKHYMAASKVLKYLKAAPGQGILMKAESDLKISAYCDNDWAGCLDTRKFITGYCVFIGNS 591 Query: 674 PVSQKLK*QPVVSLSLAES 730 VS K K Q VV+ S E+ Sbjct: 592 LVSWKCKKQQVVARSSTEA 610 Score = 69.3 bits (168), Expect(2) = 5e-50 Identities = 35/77 (45%), Positives = 47/77 (61%) Frame = +1 Query: 820 VSLHCDNQYAIHIAKHPVFHEHTKHIEFDYHFVFEILLDGFXXXXXXXXXXQLDDLFTKA 999 V L+CDNQ AI+I+K+PV HE TKHIE D HF+ E +L G Q+ D FTKA Sbjct: 640 VKLYCDNQSAIYISKNPVLHERTKHIEIDCHFIREKILSGVIKPVHISTDSQVTDAFTKA 699 Query: 1000 LAGH*HRQFLGKLDIQS 1050 L ++ L K++I + Sbjct: 700 LQPGQFKKLLCKMNIHN 716 >emb|CAN72018.1| hypothetical protein VITISV_001841 [Vitis vinifera] Length = 1225 Score = 155 bits (393), Expect(2) = 7e-50 Identities = 100/285 (35%), Positives = 147/285 (51%), Gaps = 42/285 (14%) Frame = +2 Query: 2 DLDEEVFMRLLPGLTHPSDSH-ACRMRKSLYGFKQASHQ*YTRLSSSLGTHGFISSLNDY 178 DL EEV+M PG +S CR+R+SLYG KQ+ ++R SS + G + S D+ Sbjct: 831 DLAEEVYMEQPPGFVAQGESGLVCRLRRSLYGLKQSPRAWFSRFSSVVQEFGMLRSTADH 890 Query: 179 SLF--HKSSGSFVTILAVYVDDILLT---------------------------------- 250 S+F H S G + L VYVDDI++T Sbjct: 891 SVFYHHNSLGQCI-YLVVYVDDIVITGSDQDGIQKLKQHLFTHFQTKDLGKLKYFLGIEI 949 Query: 251 ----DGLIVTQ*KFALHLLLEHPDHPSRPASSPMDPTLKLTMESGELLPNP-TLRHLLG* 415 G++++Q K+AL +L E +P +PMDP +KL GE L +P R L+G Sbjct: 950 AQSSSGVVLSQRKYALDILEETGMLDCKPVDTPMDPNVKLVPGQGEPLGDPGRYRRLVGK 1009 Query: 416 PNFLTQTRPNLSYTVRSLS*YMQNPCSGHFQAALNALRYLQRDPSLGLFMNSSSSH*IFG 595 N+LT TRP++S+ V +S ++Q+PC H+ A + LRY++ P G+ + + G Sbjct: 1010 LNYLTITRPDISFPVSVVSQFLQSPCDSHWDAVIRILRYIKSTPGQGVLYENRGHTQVVG 1069 Query: 596 YCDADWASCSDTRQSVSGYYIRLGGSPVSQKLK*QPVVSLSLAES 730 Y DADWA R+S SGY + +GG+ +S K K Q VV+ S AE+ Sbjct: 1070 YTDADWAGSPTDRRSTSGYCVFIGGNLISWKSKKQDVVARSSAEA 1114 Score = 69.7 bits (169), Expect(2) = 7e-50 Identities = 42/111 (37%), Positives = 57/111 (51%), Gaps = 3/111 (2%) Frame = +1 Query: 730 YHSMRCLVAKISWIIPSSLQSCNFSFNSLLVSLHCDNQYAIHIAKHPVFHEHTKHIEFDY 909 Y +M ++ W+ LQ F + + L CDNQ A+HIA +PVFHE TKHIE D Sbjct: 1116 YRAMALATCELIWL-RHLLQELRFGKDEQM-KLICDNQAALHIASNPVFHERTKHIEVDC 1173 Query: 910 HFVFEILLDGFXXXXXXXXXXQLDDLFTKALAGH*HRQF---LGKLDIQSP 1053 HF+ E + G QL D+FTK+L G + LG D+ +P Sbjct: 1174 HFIREKIASGCVATSFVNSNDQLADIFTKSLRGPRIKYICNKLGAYDVYAP 1224