BLASTX nr result
ID: Cephaelis21_contig00022622
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00022622 (1238 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana] 510 e-150 ref|XP_003530517.1| PREDICTED: uncharacterized protein LOC100800... 497 e-146 gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group] 483 e-142 gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Ja... 459 e-135 gb|AAM94350.1| gag-pol polyprotein [Zea mays] 457 e-135 >gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana] Length = 1887 Score = 510 bits (1313), Expect(2) = e-150 Identities = 250/372 (67%), Positives = 298/372 (80%) Frame = -2 Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938 EKL GA LNY TY+KELYA+VRAL+T QHYL PKEFVI TDHESLK+L Q KL+K+HA+ Sbjct: 1233 EKLGGATLNYPTYDKELYALVRALQTGQHYLWPKEFVIHTDHESLKHLKGQQKLNKRHAR 1292 Query: 937 WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758 WV FIETFPYVIKYKKGKDN+VADALSR VLL+SL+ KLLGFE +K LY ND DF IY Sbjct: 1293 WVEFIETFPYVIKYKKGKDNVVADALSRRYVLLSSLDAKLLGFEHIKSLYANDSDFEKIY 1352 Query: 757 EACEVGAFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLIE 578 +CE AF KY+R DG+LF +NRLCIP+ SLREL +RE+H GGLMGHFGV+KT +V+ + Sbjct: 1353 SSCEKFAFGKYYRHDGFLFYDNRLCIPNS-SLRELFIREAHGGGLMGHFGVSKTIKVMQD 1411 Query: 577 YFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQT 398 +F+WP+MK+ VE+IC +C C +AK+ S PHG YTPLP+P P DISM+FV+GLPR++T Sbjct: 1412 HFHWPHMKRDVERICERCPTCKQAKAKSQPHGLYTPLPIPSHPWNDISMDFVVGLPRTRT 1471 Query: 397 GKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSYF 218 GKDSIFVVVDRFSKMAHFIP K++ A H+A+LFFREVVRLHG+PK+IVSDRD KFLSYF Sbjct: 1472 GKDSIFVVVDRFSKMAHFIPCHKTDDAIHIANLFFREVVRLHGMPKTIVSDRDTKFLSYF 1531 Query: 217 XXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAYN 38 LFSTT HPQTD QTEV NRTL +LLR LI+ NLK E+ LP EFAYN Sbjct: 1532 WKTLWSKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRALIKKNLKTWEDCLPHVEFAYN 1591 Query: 37 RTIHSTTSYSPF 2 ++HS + +SPF Sbjct: 1592 HSMHSASKFSPF 1603 Score = 50.8 bits (120), Expect(2) = e-150 Identities = 22/37 (59%), Positives = 28/37 (75%) Frame = -3 Query: 1233 APVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123 APVL+LPDF TF++ECDAS I VL+Q +P+AY Sbjct: 1194 APVLSLPDFLKTFEIECDASGVGIGVVLMQDKKPIAY 1230 >ref|XP_003530517.1| PREDICTED: uncharacterized protein LOC100800881 [Glycine max] Length = 1746 Score = 497 bits (1280), Expect(2) = e-146 Identities = 244/372 (65%), Positives = 290/372 (77%) Frame = -2 Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938 EKL A LNYSTY+KELYA+VRAL+T QHYLLPKEFVI +DHESLKYL QGKL+K+HAK Sbjct: 970 EKLGAAALNYSTYDKELYALVRALQTWQHYLLPKEFVIHSDHESLKYLKGQGKLNKRHAK 1029 Query: 937 WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758 WV F+E FPYVIK+KKGK N+VADALSR LL L TKL G E LK++YV+D DF I+ Sbjct: 1030 WVEFLEQFPYVIKHKKGKGNVVADALSRRHALLAMLETKLFGLESLKDMYVHDVDFAEIF 1089 Query: 757 EACEVGAFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLIE 578 ACE + Y+R +G+LF N+LC+P C S+RELLV ESH GGLMGHFGV KT E+L+E Sbjct: 1090 AACEKFSENGYYRHNGFLFKANKLCVPKC-SIRELLVSESHEGGLMGHFGVQKTLEILLE 1148 Query: 577 YFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQT 398 +F+WP+M++ V K C CI C +AKS PHG YTPLPVP P TDISM+FVLGLP+++ Sbjct: 1149 HFFWPHMRRDVHKFCGHCIVCKQAKSKVKPHGLYTPLPVPEYPWTDISMDFVLGLPKTKN 1208 Query: 397 GKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSYF 218 GKDS+FVVVDRFSKMAHFIP +K + A HVADLFF+E+VRLHG+P+SIVSDRD KFLS+F Sbjct: 1209 GKDSVFVVVDRFSKMAHFIPCKKVDDACHVADLFFKEIVRLHGLPRSIVSDRDAKFLSHF 1268 Query: 217 XXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAYN 38 LFSTT HPQTD QTEV NRTLG+LLR +++ NLK E LP EFAYN Sbjct: 1269 WRTLWGKIGTKLLFSTTCHPQTDGQTEVVNRTLGTLLRTVLKKNLKSWEACLPHVEFAYN 1328 Query: 37 RTIHSTTSYSPF 2 R +HSTT+ SPF Sbjct: 1329 RAVHSTTNCSPF 1340 Score = 50.4 bits (119), Expect(2) = e-146 Identities = 21/37 (56%), Positives = 30/37 (81%) Frame = -3 Query: 1233 APVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123 AP+LA+P+F+ +F++ECDAS+ I AVLLQ P+AY Sbjct: 931 APILAMPNFAKSFEIECDASNVGIGAVLLQEGHPIAY 967 >gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1713 Score = 483 bits (1242), Expect(2) = e-142 Identities = 238/373 (63%), Positives = 286/373 (76%), Gaps = 1/373 (0%) Frame = -2 Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938 EKL GA LNYS Y+KELYA+VRAL+T QHYL PKEFVI +DHE+LKYL Q KL+++HAK Sbjct: 1044 EKLGGAQLNYSVYDKELYALVRALETWQHYLWPKEFVIHSDHEALKYLKGQAKLNRRHAK 1103 Query: 937 WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758 WV FIETFPYV+KYKKGK+NIVADALSR +VLL L K+ G E +KELY D DF Y Sbjct: 1104 WVEFIETFPYVVKYKKGKENIVADALSRKNVLLNQLEVKVTGIESIKELYSADLDFSEPY 1163 Query: 757 EACEVG-AFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLI 581 C G +EKY DG+LF N+LC+P C S+R LL++E+H+GGLMGHFG KTY++L Sbjct: 1164 AKCTAGKGWEKYHIHDGFLFRANKLCVPHC-SVRLLLLQETHAGGLMGHFGWRKTYDMLA 1222 Query: 580 EYFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQ 401 ++FYWP M++ V+++ +C+ C KAKS +PHG YTPLPVP +P DISM+FVLGLPR++ Sbjct: 1223 DHFYWPKMRRDVQRLVQRCVTCHKAKSKLNPHGLYTPLPVPSAPWEDISMDFVLGLPRTK 1282 Query: 400 TGKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSY 221 G+DSIFVVVDRFSKMAHFIP KS+ A H+A LFF E+VRLHG+PK+IVSDRD KFLSY Sbjct: 1283 RGRDSIFVVVDRFSKMAHFIPCHKSDDASHIASLFFSEIVRLHGMPKTIVSDRDTKFLSY 1342 Query: 220 FXXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAY 41 F LFSTT HPQTD QTEV NRTL LLR LI+ NLK+ EE LP EFAY Sbjct: 1343 FWKTLWAKLGTRLLFSTTCHPQTDGQTEVVNRTLSMLLRALIKKNLKEWEECLPHVEFAY 1402 Query: 40 NRTIHSTTSYSPF 2 NR +HSTT+ PF Sbjct: 1403 NRAVHSTTNMCPF 1415 Score = 49.3 bits (116), Expect(2) = e-142 Identities = 21/36 (58%), Positives = 27/36 (75%) Frame = -3 Query: 1230 PVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123 P+L LPDF+ TF++ECDAS I VL+Q +PVAY Sbjct: 1006 PLLVLPDFTKTFEVECDASGIGIGGVLMQNGQPVAY 1041 >gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Japonica Group] gi|31431012|gb|AAP52850.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 2447 Score = 459 bits (1182), Expect(2) = e-135 Identities = 224/373 (60%), Positives = 284/373 (76%), Gaps = 1/373 (0%) Frame = -2 Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938 EKL+G +LNYSTY+KELYA+VR L+T QHYL PKEFVI +DHESLK++ SQGKL+++HAK Sbjct: 1041 EKLSGPVLNYSTYDKELYALVRTLETWQHYLWPKEFVIHSDHESLKHIRSQGKLNRRHAK 1100 Query: 937 WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758 WV FIE+FPYVIK+KKGK+NI+ADALSR LLT L+ K+ G E +K+ Y +D DF + Sbjct: 1101 WVEFIESFPYVIKHKKGKENIIADALSRRYTLLTQLDYKIFGLETIKDQYAHDADFNDVL 1160 Query: 757 EACEVG-AFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLI 581 C+ G + K+ DG++F N+LCIP+ S+R LL++E+H GGLMGHFG KT+++L Sbjct: 1161 LHCKDGRTWNKFVINDGFVFRANKLCIPAS-SVRLLLLQEAHGGGLMGHFGAKKTHDILA 1219 Query: 580 EYFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQ 401 +F+WP M++ V + ++C C KAKS HPHG Y PLPVP P DISM+FVLGLPR++ Sbjct: 1220 SHFFWPQMRRDVGRFVARCATCQKAKSRLHPHGLYMPLPVPTVPWEDISMDFVLGLPRTK 1279 Query: 400 TGKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSY 221 G+DSIFVVVDRFSKMAHFIP K++ A H+ADLFFRE+VRLHGVP +IVSDRD KFLS+ Sbjct: 1280 RGRDSIFVVVDRFSKMAHFIPCHKTDDASHIADLFFREIVRLHGVPNTIVSDRDTKFLSH 1339 Query: 220 FXXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAY 41 F LFSTT HPQTD QTEV NRTL ++LR +++ N+K EE LP EFAY Sbjct: 1340 FWRTLWAKLGTKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHIEFAY 1399 Query: 40 NRTIHSTTSYSPF 2 NR++HSTT PF Sbjct: 1400 NRSLHSTTKMCPF 1412 Score = 51.2 bits (121), Expect(2) = e-135 Identities = 23/37 (62%), Positives = 28/37 (75%) Frame = -3 Query: 1233 APVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123 AP+L LPDF+ TF+LECDAS + VLLQ +PVAY Sbjct: 1002 APLLQLPDFNKTFELECDASGIGLGGVLLQEGKPVAY 1038 Score = 236 bits (602), Expect(2) = 4e-64 Identities = 140/394 (35%), Positives = 210/394 (53%), Gaps = 30/394 (7%) Frame = -2 Query: 1093 NYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAKWVTFIETF 914 NY T++ EL A+V ALK +HYL+ I TDH+SLKY+ +Q L+ + +W+ I+ + Sbjct: 1841 NYPTHDLELAAVVHALKIWRHYLIGNRCEIYTDHKSLKYIFTQSDLNLRQRRWLELIKDY 1900 Query: 913 PYVIKYKKGKDNIVADALSRTSVLLT---------------SLNTKLLGFEFL------- 800 I Y GK N+VADALSR S T +LN ++ FL Sbjct: 1901 DVGIHYHPGKANVVADALSRKSHCNTLGVRGIPPELNQQMEALNLSIVSRGFLATLEAKP 1960 Query: 799 ------KELYVNDPDFRSIYEACEVGAFEKYFRQD-GYLFMNNRLCIPSCCSLRELLVRE 641 +E NDPD R + + + G + + G L+ NR+C+P L++L+++E Sbjct: 1961 TLLDQIREAQKNDPDMRGLLKNMKQGKAAGFIEDEHGTLWNRNRVCVPDVRELKQLILQE 2020 Query: 640 SHSGGLMGHFGVNKTYEVLIEYFYWPNMKKVVEKICSKCIACLKAKST-SHPHGSYTPLP 464 +H H G K Y L E ++W +MK+ + + + C C + K+ P G PL Sbjct: 2021 AHESPYSIHPGSTKMYLDLKEKYWWVSMKREIAEFVALCDVCQRVKAEHQRPAGLLQPLQ 2080 Query: 463 VP*SPLTDISMNFVLGLPRSQTGKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREV 284 VP +I M+F+ GLP++Q G DSI+VVVDR +K+A FIP + + +A+L+F + Sbjct: 2081 VPEWKWDEIGMDFITGLPKTQGGYDSIWVVVDRLTKVARFIPVKTTYGGNKLAELYFARI 2140 Query: 283 VRLHGVPKSIVSDRDVKFLSYFXXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLR 104 V LHGVPK IVSDR+ +F S+F FST HPQTD QTE N+ L +L Sbjct: 2141 VSLHGVPKKIVSDRESQFTSHFWKKLQEELGTRLNFSTAYHPQTDGQTERLNQILEDMLH 2200 Query: 103 VLIQHNLKD*EEILPIAEFAYNRTIHSTTSYSPF 2 + K ++ LP AEF+YN + ++ +P+ Sbjct: 2201 ACVLDFGKTWDKSLPYAEFSYNNSYQASIQMAPY 2234 Score = 36.6 bits (83), Expect(2) = 4e-64 Identities = 18/41 (43%), Positives = 24/41 (58%) Frame = -3 Query: 1236 TAPVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAYSRR 1114 ++PVL LPD F + CDAS + VL+Q VAY+ R Sbjct: 1793 SSPVLILPDTRKDFMVYCDASPQGLGCVLMQEGHVVAYASR 1833 >gb|AAM94350.1| gag-pol polyprotein [Zea mays] Length = 1618 Score = 457 bits (1177), Expect(2) = e-135 Identities = 222/373 (59%), Positives = 287/373 (76%), Gaps = 1/373 (0%) Frame = -2 Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938 EKL+G++LNYSTY+KELYA+VR L+T QHYL PKEFVI +DHESLK++ SQGKL+++HAK Sbjct: 1044 EKLSGSVLNYSTYDKELYALVRTLETWQHYLWPKEFVIHSDHESLKHIRSQGKLNRRHAK 1103 Query: 937 WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758 WV FIE+FPYVIK+KKGK+NI+ADALSR LL L+ K+ G E +K+ YV+D DF+ + Sbjct: 1104 WVEFIESFPYVIKHKKGKENIIADALSRRYTLLNQLDYKIFGLETIKDQYVHDADFKDVL 1163 Query: 757 EACEVG-AFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLI 581 C+ G + KY DG++F N+LCIP+ S+R LL++E+H GGLMGHFG KT ++L Sbjct: 1164 LHCKDGKGWNKYIVSDGFVFRANKLCIPAS-SVRLLLLQEAHGGGLMGHFGAKKTEDILA 1222 Query: 580 EYFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQ 401 +F+WP M++ V ++ ++C C KAKS +PHG Y PLPVP +P DISM+FVLGLPR++ Sbjct: 1223 GHFFWPKMRRDVVRLVARCTTCQKAKSRLNPHGLYLPLPVPSAPWEDISMDFVLGLPRTR 1282 Query: 400 TGKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSY 221 G+DS+FVVVDRFSKMAHFIP K++ A H+ADLFFRE+VRLHGVP +IVSDRD KFLS+ Sbjct: 1283 KGRDSVFVVVDRFSKMAHFIPCHKTDDATHIADLFFREIVRLHGVPNTIVSDRDAKFLSH 1342 Query: 220 FXXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAY 41 F LFSTT HPQTD QTEV NRTL ++LR +++ N+K E+ LP EFAY Sbjct: 1343 FWRTLWAKLGTKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEDCLPHIEFAY 1402 Query: 40 NRTIHSTTSYSPF 2 NR++HSTT PF Sbjct: 1403 NRSLHSTTKMCPF 1415 Score = 51.2 bits (121), Expect(2) = e-135 Identities = 23/37 (62%), Positives = 28/37 (75%) Frame = -3 Query: 1233 APVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123 AP+L LPDF+ TF+LECDAS + VLLQ +PVAY Sbjct: 1005 APLLQLPDFNKTFELECDASGIGLGGVLLQEGKPVAY 1041