BLASTX nr result

ID: Cephaelis21_contig00022622 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00022622
         (1238 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]              510   e-150
ref|XP_003530517.1| PREDICTED: uncharacterized protein LOC100800...   497   e-146
gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]     483   e-142
gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Ja...   459   e-135
gb|AAM94350.1| gag-pol polyprotein [Zea mays]                         457   e-135

>gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]
          Length = 1887

 Score =  510 bits (1313), Expect(2) = e-150
 Identities = 250/372 (67%), Positives = 298/372 (80%)
 Frame = -2

Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938
            EKL GA LNY TY+KELYA+VRAL+T QHYL PKEFVI TDHESLK+L  Q KL+K+HA+
Sbjct: 1233 EKLGGATLNYPTYDKELYALVRALQTGQHYLWPKEFVIHTDHESLKHLKGQQKLNKRHAR 1292

Query: 937  WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758
            WV FIETFPYVIKYKKGKDN+VADALSR  VLL+SL+ KLLGFE +K LY ND DF  IY
Sbjct: 1293 WVEFIETFPYVIKYKKGKDNVVADALSRRYVLLSSLDAKLLGFEHIKSLYANDSDFEKIY 1352

Query: 757  EACEVGAFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLIE 578
             +CE  AF KY+R DG+LF +NRLCIP+  SLREL +RE+H GGLMGHFGV+KT +V+ +
Sbjct: 1353 SSCEKFAFGKYYRHDGFLFYDNRLCIPNS-SLRELFIREAHGGGLMGHFGVSKTIKVMQD 1411

Query: 577  YFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQT 398
            +F+WP+MK+ VE+IC +C  C +AK+ S PHG YTPLP+P  P  DISM+FV+GLPR++T
Sbjct: 1412 HFHWPHMKRDVERICERCPTCKQAKAKSQPHGLYTPLPIPSHPWNDISMDFVVGLPRTRT 1471

Query: 397  GKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSYF 218
            GKDSIFVVVDRFSKMAHFIP  K++ A H+A+LFFREVVRLHG+PK+IVSDRD KFLSYF
Sbjct: 1472 GKDSIFVVVDRFSKMAHFIPCHKTDDAIHIANLFFREVVRLHGMPKTIVSDRDTKFLSYF 1531

Query: 217  XXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAYN 38
                        LFSTT HPQTD QTEV NRTL +LLR LI+ NLK  E+ LP  EFAYN
Sbjct: 1532 WKTLWSKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRALIKKNLKTWEDCLPHVEFAYN 1591

Query: 37   RTIHSTTSYSPF 2
             ++HS + +SPF
Sbjct: 1592 HSMHSASKFSPF 1603



 Score = 50.8 bits (120), Expect(2) = e-150
 Identities = 22/37 (59%), Positives = 28/37 (75%)
 Frame = -3

Query: 1233 APVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123
            APVL+LPDF  TF++ECDAS   I  VL+Q  +P+AY
Sbjct: 1194 APVLSLPDFLKTFEIECDASGVGIGVVLMQDKKPIAY 1230


>ref|XP_003530517.1| PREDICTED: uncharacterized protein LOC100800881 [Glycine max]
          Length = 1746

 Score =  497 bits (1280), Expect(2) = e-146
 Identities = 244/372 (65%), Positives = 290/372 (77%)
 Frame = -2

Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938
            EKL  A LNYSTY+KELYA+VRAL+T QHYLLPKEFVI +DHESLKYL  QGKL+K+HAK
Sbjct: 970  EKLGAAALNYSTYDKELYALVRALQTWQHYLLPKEFVIHSDHESLKYLKGQGKLNKRHAK 1029

Query: 937  WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758
            WV F+E FPYVIK+KKGK N+VADALSR   LL  L TKL G E LK++YV+D DF  I+
Sbjct: 1030 WVEFLEQFPYVIKHKKGKGNVVADALSRRHALLAMLETKLFGLESLKDMYVHDVDFAEIF 1089

Query: 757  EACEVGAFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLIE 578
             ACE  +   Y+R +G+LF  N+LC+P C S+RELLV ESH GGLMGHFGV KT E+L+E
Sbjct: 1090 AACEKFSENGYYRHNGFLFKANKLCVPKC-SIRELLVSESHEGGLMGHFGVQKTLEILLE 1148

Query: 577  YFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQT 398
            +F+WP+M++ V K C  CI C +AKS   PHG YTPLPVP  P TDISM+FVLGLP+++ 
Sbjct: 1149 HFFWPHMRRDVHKFCGHCIVCKQAKSKVKPHGLYTPLPVPEYPWTDISMDFVLGLPKTKN 1208

Query: 397  GKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSYF 218
            GKDS+FVVVDRFSKMAHFIP +K + A HVADLFF+E+VRLHG+P+SIVSDRD KFLS+F
Sbjct: 1209 GKDSVFVVVDRFSKMAHFIPCKKVDDACHVADLFFKEIVRLHGLPRSIVSDRDAKFLSHF 1268

Query: 217  XXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAYN 38
                        LFSTT HPQTD QTEV NRTLG+LLR +++ NLK  E  LP  EFAYN
Sbjct: 1269 WRTLWGKIGTKLLFSTTCHPQTDGQTEVVNRTLGTLLRTVLKKNLKSWEACLPHVEFAYN 1328

Query: 37   RTIHSTTSYSPF 2
            R +HSTT+ SPF
Sbjct: 1329 RAVHSTTNCSPF 1340



 Score = 50.4 bits (119), Expect(2) = e-146
 Identities = 21/37 (56%), Positives = 30/37 (81%)
 Frame = -3

Query: 1233 APVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123
            AP+LA+P+F+ +F++ECDAS+  I AVLLQ   P+AY
Sbjct: 931  APILAMPNFAKSFEIECDASNVGIGAVLLQEGHPIAY 967


>gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1713

 Score =  483 bits (1242), Expect(2) = e-142
 Identities = 238/373 (63%), Positives = 286/373 (76%), Gaps = 1/373 (0%)
 Frame = -2

Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938
            EKL GA LNYS Y+KELYA+VRAL+T QHYL PKEFVI +DHE+LKYL  Q KL+++HAK
Sbjct: 1044 EKLGGAQLNYSVYDKELYALVRALETWQHYLWPKEFVIHSDHEALKYLKGQAKLNRRHAK 1103

Query: 937  WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758
            WV FIETFPYV+KYKKGK+NIVADALSR +VLL  L  K+ G E +KELY  D DF   Y
Sbjct: 1104 WVEFIETFPYVVKYKKGKENIVADALSRKNVLLNQLEVKVTGIESIKELYSADLDFSEPY 1163

Query: 757  EACEVG-AFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLI 581
              C  G  +EKY   DG+LF  N+LC+P C S+R LL++E+H+GGLMGHFG  KTY++L 
Sbjct: 1164 AKCTAGKGWEKYHIHDGFLFRANKLCVPHC-SVRLLLLQETHAGGLMGHFGWRKTYDMLA 1222

Query: 580  EYFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQ 401
            ++FYWP M++ V+++  +C+ C KAKS  +PHG YTPLPVP +P  DISM+FVLGLPR++
Sbjct: 1223 DHFYWPKMRRDVQRLVQRCVTCHKAKSKLNPHGLYTPLPVPSAPWEDISMDFVLGLPRTK 1282

Query: 400  TGKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSY 221
             G+DSIFVVVDRFSKMAHFIP  KS+ A H+A LFF E+VRLHG+PK+IVSDRD KFLSY
Sbjct: 1283 RGRDSIFVVVDRFSKMAHFIPCHKSDDASHIASLFFSEIVRLHGMPKTIVSDRDTKFLSY 1342

Query: 220  FXXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAY 41
            F            LFSTT HPQTD QTEV NRTL  LLR LI+ NLK+ EE LP  EFAY
Sbjct: 1343 FWKTLWAKLGTRLLFSTTCHPQTDGQTEVVNRTLSMLLRALIKKNLKEWEECLPHVEFAY 1402

Query: 40   NRTIHSTTSYSPF 2
            NR +HSTT+  PF
Sbjct: 1403 NRAVHSTTNMCPF 1415



 Score = 49.3 bits (116), Expect(2) = e-142
 Identities = 21/36 (58%), Positives = 27/36 (75%)
 Frame = -3

Query: 1230 PVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123
            P+L LPDF+ TF++ECDAS   I  VL+Q  +PVAY
Sbjct: 1006 PLLVLPDFTKTFEVECDASGIGIGGVLMQNGQPVAY 1041


>gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Japonica Group]
            gi|31431012|gb|AAP52850.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 2447

 Score =  459 bits (1182), Expect(2) = e-135
 Identities = 224/373 (60%), Positives = 284/373 (76%), Gaps = 1/373 (0%)
 Frame = -2

Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938
            EKL+G +LNYSTY+KELYA+VR L+T QHYL PKEFVI +DHESLK++ SQGKL+++HAK
Sbjct: 1041 EKLSGPVLNYSTYDKELYALVRTLETWQHYLWPKEFVIHSDHESLKHIRSQGKLNRRHAK 1100

Query: 937  WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758
            WV FIE+FPYVIK+KKGK+NI+ADALSR   LLT L+ K+ G E +K+ Y +D DF  + 
Sbjct: 1101 WVEFIESFPYVIKHKKGKENIIADALSRRYTLLTQLDYKIFGLETIKDQYAHDADFNDVL 1160

Query: 757  EACEVG-AFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLI 581
              C+ G  + K+   DG++F  N+LCIP+  S+R LL++E+H GGLMGHFG  KT+++L 
Sbjct: 1161 LHCKDGRTWNKFVINDGFVFRANKLCIPAS-SVRLLLLQEAHGGGLMGHFGAKKTHDILA 1219

Query: 580  EYFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQ 401
             +F+WP M++ V +  ++C  C KAKS  HPHG Y PLPVP  P  DISM+FVLGLPR++
Sbjct: 1220 SHFFWPQMRRDVGRFVARCATCQKAKSRLHPHGLYMPLPVPTVPWEDISMDFVLGLPRTK 1279

Query: 400  TGKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSY 221
             G+DSIFVVVDRFSKMAHFIP  K++ A H+ADLFFRE+VRLHGVP +IVSDRD KFLS+
Sbjct: 1280 RGRDSIFVVVDRFSKMAHFIPCHKTDDASHIADLFFREIVRLHGVPNTIVSDRDTKFLSH 1339

Query: 220  FXXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAY 41
            F            LFSTT HPQTD QTEV NRTL ++LR +++ N+K  EE LP  EFAY
Sbjct: 1340 FWRTLWAKLGTKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHIEFAY 1399

Query: 40   NRTIHSTTSYSPF 2
            NR++HSTT   PF
Sbjct: 1400 NRSLHSTTKMCPF 1412



 Score = 51.2 bits (121), Expect(2) = e-135
 Identities = 23/37 (62%), Positives = 28/37 (75%)
 Frame = -3

Query: 1233 APVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123
            AP+L LPDF+ TF+LECDAS   +  VLLQ  +PVAY
Sbjct: 1002 APLLQLPDFNKTFELECDASGIGLGGVLLQEGKPVAY 1038



 Score =  236 bits (602), Expect(2) = 4e-64
 Identities = 140/394 (35%), Positives = 210/394 (53%), Gaps = 30/394 (7%)
 Frame = -2

Query: 1093 NYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAKWVTFIETF 914
            NY T++ EL A+V ALK  +HYL+     I TDH+SLKY+ +Q  L+ +  +W+  I+ +
Sbjct: 1841 NYPTHDLELAAVVHALKIWRHYLIGNRCEIYTDHKSLKYIFTQSDLNLRQRRWLELIKDY 1900

Query: 913  PYVIKYKKGKDNIVADALSRTSVLLT---------------SLNTKLLGFEFL------- 800
               I Y  GK N+VADALSR S   T               +LN  ++   FL       
Sbjct: 1901 DVGIHYHPGKANVVADALSRKSHCNTLGVRGIPPELNQQMEALNLSIVSRGFLATLEAKP 1960

Query: 799  ------KELYVNDPDFRSIYEACEVGAFEKYFRQD-GYLFMNNRLCIPSCCSLRELLVRE 641
                  +E   NDPD R + +  + G    +   + G L+  NR+C+P    L++L+++E
Sbjct: 1961 TLLDQIREAQKNDPDMRGLLKNMKQGKAAGFIEDEHGTLWNRNRVCVPDVRELKQLILQE 2020

Query: 640  SHSGGLMGHFGVNKTYEVLIEYFYWPNMKKVVEKICSKCIACLKAKST-SHPHGSYTPLP 464
            +H      H G  K Y  L E ++W +MK+ + +  + C  C + K+    P G   PL 
Sbjct: 2021 AHESPYSIHPGSTKMYLDLKEKYWWVSMKREIAEFVALCDVCQRVKAEHQRPAGLLQPLQ 2080

Query: 463  VP*SPLTDISMNFVLGLPRSQTGKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREV 284
            VP     +I M+F+ GLP++Q G DSI+VVVDR +K+A FIP + +     +A+L+F  +
Sbjct: 2081 VPEWKWDEIGMDFITGLPKTQGGYDSIWVVVDRLTKVARFIPVKTTYGGNKLAELYFARI 2140

Query: 283  VRLHGVPKSIVSDRDVKFLSYFXXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLR 104
            V LHGVPK IVSDR+ +F S+F             FST  HPQTD QTE  N+ L  +L 
Sbjct: 2141 VSLHGVPKKIVSDRESQFTSHFWKKLQEELGTRLNFSTAYHPQTDGQTERLNQILEDMLH 2200

Query: 103  VLIQHNLKD*EEILPIAEFAYNRTIHSTTSYSPF 2
              +    K  ++ LP AEF+YN +  ++   +P+
Sbjct: 2201 ACVLDFGKTWDKSLPYAEFSYNNSYQASIQMAPY 2234



 Score = 36.6 bits (83), Expect(2) = 4e-64
 Identities = 18/41 (43%), Positives = 24/41 (58%)
 Frame = -3

Query: 1236 TAPVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAYSRR 1114
            ++PVL LPD    F + CDAS   +  VL+Q    VAY+ R
Sbjct: 1793 SSPVLILPDTRKDFMVYCDASPQGLGCVLMQEGHVVAYASR 1833


>gb|AAM94350.1| gag-pol polyprotein [Zea mays]
          Length = 1618

 Score =  457 bits (1177), Expect(2) = e-135
 Identities = 222/373 (59%), Positives = 287/373 (76%), Gaps = 1/373 (0%)
 Frame = -2

Query: 1117 EKLNGAMLNYSTYNKELYAMVRALKT*QHYLLPKEFVIRTDHESLKYLHSQGKLSKKHAK 938
            EKL+G++LNYSTY+KELYA+VR L+T QHYL PKEFVI +DHESLK++ SQGKL+++HAK
Sbjct: 1044 EKLSGSVLNYSTYDKELYALVRTLETWQHYLWPKEFVIHSDHESLKHIRSQGKLNRRHAK 1103

Query: 937  WVTFIETFPYVIKYKKGKDNIVADALSRTSVLLTSLNTKLLGFEFLKELYVNDPDFRSIY 758
            WV FIE+FPYVIK+KKGK+NI+ADALSR   LL  L+ K+ G E +K+ YV+D DF+ + 
Sbjct: 1104 WVEFIESFPYVIKHKKGKENIIADALSRRYTLLNQLDYKIFGLETIKDQYVHDADFKDVL 1163

Query: 757  EACEVG-AFEKYFRQDGYLFMNNRLCIPSCCSLRELLVRESHSGGLMGHFGVNKTYEVLI 581
              C+ G  + KY   DG++F  N+LCIP+  S+R LL++E+H GGLMGHFG  KT ++L 
Sbjct: 1164 LHCKDGKGWNKYIVSDGFVFRANKLCIPAS-SVRLLLLQEAHGGGLMGHFGAKKTEDILA 1222

Query: 580  EYFYWPNMKKVVEKICSKCIACLKAKSTSHPHGSYTPLPVP*SPLTDISMNFVLGLPRSQ 401
             +F+WP M++ V ++ ++C  C KAKS  +PHG Y PLPVP +P  DISM+FVLGLPR++
Sbjct: 1223 GHFFWPKMRRDVVRLVARCTTCQKAKSRLNPHGLYLPLPVPSAPWEDISMDFVLGLPRTR 1282

Query: 400  TGKDSIFVVVDRFSKMAHFIPSRKSEYAKHVADLFFREVVRLHGVPKSIVSDRDVKFLSY 221
             G+DS+FVVVDRFSKMAHFIP  K++ A H+ADLFFRE+VRLHGVP +IVSDRD KFLS+
Sbjct: 1283 KGRDSVFVVVDRFSKMAHFIPCHKTDDATHIADLFFREIVRLHGVPNTIVSDRDAKFLSH 1342

Query: 220  FXXXXXXXXXXXXLFSTTSHPQTDCQTEVTNRTLGSLLRVLIQHNLKD*EEILPIAEFAY 41
            F            LFSTT HPQTD QTEV NRTL ++LR +++ N+K  E+ LP  EFAY
Sbjct: 1343 FWRTLWAKLGTKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEDCLPHIEFAY 1402

Query: 40   NRTIHSTTSYSPF 2
            NR++HSTT   PF
Sbjct: 1403 NRSLHSTTKMCPF 1415



 Score = 51.2 bits (121), Expect(2) = e-135
 Identities = 23/37 (62%), Positives = 28/37 (75%)
 Frame = -3

Query: 1233 APVLALPDFSNTFKLECDASDFEISAVLLQGSRPVAY 1123
            AP+L LPDF+ TF+LECDAS   +  VLLQ  +PVAY
Sbjct: 1005 APLLQLPDFNKTFELECDASGIGLGGVLLQEGKPVAY 1041


Top