BLASTX nr result

ID: Angelica22_contig00047334 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00047334
         (480 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABI34329.1| Integrase core domain containing protein [Solanum...    85   7e-15
gb|ABI34342.1| Polyprotein, putative [Solanum demissum]                85   7e-15
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf...    84   1e-14
emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera]    84   1e-14
emb|CAB40067.1| putative retrotransposon polyprotein [Arabidopsi...    84   1e-14

>gb|ABI34329.1| Integrase core domain containing protein [Solanum demissum]
          Length = 1775

 Score = 84.7 bits (208), Expect = 7e-15
 Identities = 44/93 (47%), Positives = 57/93 (61%)
 Frame = -1

Query: 480  LDFFETFCTCGQNVNSKSSLSLATVSQWSVTQLDVTNAFLYGELEEEVYMSIPHG*CLPA 301
            LD+ +TF    +  + +  LS+A V  W + QLD+ NAFL+G+LEEEVYM  P G     
Sbjct: 949  LDYSDTFAPVAKIASVRLFLSMAAVRHWPLHQLDIKNAFLHGDLEEEVYMEQPPG----- 1003

Query: 300  EFATYSSSVPLVCKLIKSLYGMCQSPRKWFLKF 202
             F     S  LVC+L +SLYG+ QSPR WF KF
Sbjct: 1004 -FVAQGESSSLVCRLRRSLYGLKQSPRAWFGKF 1035



 Score = 63.2 bits (152), Expect = 2e-08
 Identities = 31/73 (42%), Positives = 43/73 (58%)
 Frame = -2

Query: 224  HENGFSNSDADWGGCPLTRQSLTGYCVTXXXXXXXXXXXKQHMVSRSSAEAEY*ALADVC 45
            HE+    +DADW G P  R+S +GYCV            KQ++V+RSSAE+EY A+A   
Sbjct: 1225 HEHIIGYTDADWAGSPSDRRSTSGYCVLVGGNLVSWKSKKQNVVARSSAESEYRAMATAT 1284

Query: 44   CEINWLLNMFHEL 6
            CE+ W+  +  EL
Sbjct: 1285 CELVWIKQLLGEL 1297


>gb|ABI34342.1| Polyprotein, putative [Solanum demissum]
          Length = 1054

 Score = 84.7 bits (208), Expect = 7e-15
 Identities = 44/93 (47%), Positives = 57/93 (61%)
 Frame = -1

Query: 480 LDFFETFCTCGQNVNSKSSLSLATVSQWSVTQLDVTNAFLYGELEEEVYMSIPHG*CLPA 301
           LD+ +TF    +  + +  LS+A V  W + QLD+ NAFL+G+LEEEVYM  P G     
Sbjct: 710 LDYSDTFAPVAKIASIRLFLSMAAVRHWPLHQLDIKNAFLHGDLEEEVYMEQPPG----- 764

Query: 300 EFATYSSSVPLVCKLIKSLYGMCQSPRKWFLKF 202
            F     S  LVC+L +SLYG+ QSPR WF KF
Sbjct: 765 -FVAQGESSSLVCRLRRSLYGLKQSPRAWFGKF 796


>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score:
            11.19) [Arabidopsis thaliana]
          Length = 1633

 Score = 84.0 bits (206), Expect = 1e-14
 Identities = 41/90 (45%), Positives = 60/90 (66%)
 Frame = -1

Query: 480  LDFFETFCTCGQNVNSKSSLSLATVSQWSVTQLDVTNAFLYGELEEEVYMSIPHG*CLPA 301
            +D+ ETF    +  + K  L LA  + WS+TQ+DV+NAFL+GEL+EE+YMS+P G   P 
Sbjct: 1014 IDYMETFSPVAKFGSVKLLLGLAAATGWSLTQMDVSNAFLHGELDEEIYMSLPQGYTPPT 1073

Query: 300  EFATYSSSVPLVCKLIKSLYGMCQSPRKWF 211
              +  S     VC+L+KSLYG+ Q+ R+W+
Sbjct: 1074 GISLPSKP---VCRLLKSLYGLKQASRQWY 1100



 Score = 60.5 bits (145), Expect = 1e-07
 Identities = 31/76 (40%), Positives = 42/76 (55%)
 Frame = -2

Query: 233  VSHHENGFSNSDADWGGCPLTRQSLTGYCVTXXXXXXXXXXXKQHMVSRSSAEAEY*ALA 54
            V  +  G    DADWG C  +R+S+TG+C+            KQ +VSRSS E+EY +LA
Sbjct: 1273 VLRYLKGNPGQDADWGTCKDSRRSVTGFCIYLGTSLITWKSKKQSVVSRSSTESEYRSLA 1332

Query: 53   DVCCEINWLLNMFHEL 6
               CEI WL  +  +L
Sbjct: 1333 QATCEIIWLQQLLKDL 1348


>emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera]
          Length = 970

 Score = 84.0 bits (206), Expect = 1e-14
 Identities = 45/93 (48%), Positives = 59/93 (63%)
 Frame = -1

Query: 480 LDFFETFCTCGQNVNSKSSLSLATVSQWSVTQLDVTNAFLYGELEEEVYMSIPHG*CLPA 301
           +D+ +TF    + V  K  L+LA V  WS+TQLDV N FL+G+L E+VYMS+P G     
Sbjct: 626 VDYLDTFSPVAKLVTVKVLLTLAAVHGWSLTQLDVNNTFLHGDLHEKVYMSLPPGLYHEG 685

Query: 300 EFATYSSSVPLVCKLIKSLYGMCQSPRKWFLKF 202
           E    S  +  VCKL KSLYG+ Q+ R+WF KF
Sbjct: 686 E----SLPINTVCKLHKSLYGLKQASRQWFSKF 714



 Score = 67.0 bits (162), Expect = 1e-09
 Identities = 29/66 (43%), Positives = 44/66 (66%)
 Frame = -2

Query: 203  SDADWGGCPLTRQSLTGYCVTXXXXXXXXXXXKQHMVSRSSAEAEY*ALADVCCEINWLL 24
            +D+DW  CP T++S++G+CV            KQH VSRSSAEAEY ++A+  CE+ W+ 
Sbjct: 834  ADSDWAACPDTKRSISGFCVFIGDSLVSWKSKKQHTVSRSSAEAEYRSMANATCELMWMF 893

Query: 23   NMFHEL 6
            ++F +L
Sbjct: 894  SLFKDL 899


>emb|CAB40067.1| putative retrotransposon polyprotein [Arabidopsis thaliana]
           gi|7267797|emb|CAB81200.1| putative retrotransposon
           polyprotein [Arabidopsis thaliana]
          Length = 1203

 Score = 84.0 bits (206), Expect = 1e-14
 Identities = 41/90 (45%), Positives = 60/90 (66%)
 Frame = -1

Query: 480 LDFFETFCTCGQNVNSKSSLSLATVSQWSVTQLDVTNAFLYGELEEEVYMSIPHG*CLPA 301
           +D+ ETF    +  + K  L LA  + WS+TQ+DV+NAFL+GEL+EE+YMS+P G   P 
Sbjct: 600 IDYMETFSPVAKFGSVKLLLGLAAATGWSLTQMDVSNAFLHGELDEEIYMSLPQGYTPPT 659

Query: 300 EFATYSSSVPLVCKLIKSLYGMCQSPRKWF 211
             +  S     VC+L+KSLYG+ Q+ R+W+
Sbjct: 660 GISLPSKP---VCRLLKSLYGLKQASRQWY 686



 Score = 62.8 bits (151), Expect = 3e-08
 Identities = 33/71 (46%), Positives = 43/71 (60%)
 Frame = -2

Query: 218  NGFSNSDADWGGCPLTRQSLTGYCVTXXXXXXXXXXXKQHMVSRSSAEAEY*ALADVCCE 39
            NGFS  DADWG C  +R+S+TG+C+            KQ +VSRSS E+EY +LA   CE
Sbjct: 882  NGFS--DADWGTCKDSRRSVTGFCIYLGTSLITWKSKKQSVVSRSSTESEYRSLAQATCE 939

Query: 38   INWLLNMFHEL 6
            I WL  +  +L
Sbjct: 940  IIWLQQLLKDL 950


Top