BLASTX nr result

ID: Angelica22_contig00022637 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00022637
         (1892 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN59949.1| hypothetical protein VITISV_043423 [Vitis vinifera]   201   9e-75
dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis t...   214   3e-71
emb|CAN81016.1| hypothetical protein VITISV_025518 [Vitis vinifera]   233   3e-70
ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817...   197   5e-70
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi...   196   5e-70

>emb|CAN59949.1| hypothetical protein VITISV_043423 [Vitis vinifera]
          Length = 1059

 Score =  201 bits (512), Expect(2) = 9e-75
 Identities = 160/492 (32%), Positives = 233/492 (47%), Gaps = 29/492 (5%)
 Frame = +1

Query: 502  SITAYFTVFRELMDELDTLSPIPRCICTNSNCACGNSQKIDQYEQMNKLSQFLMGLNDQF 681
            ++ AY+T  ++L DEL + +        ++ C+CG   K        +L QFLMGLN+ +
Sbjct: 143  TVXAYYTRLKKLWDELGSYN--------DTVCSCGADHK------RRRLMQFLMGLNESY 188

Query: 682  INTRGQILLMSPLPDLGHANDMLLQEENQR---------EFSKEMTGINENMAMNVRLLQ 834
               RGQILLM+PLPD+  A   ++QE  QR         E S  +    E MA+ VR  Q
Sbjct: 189  NAIRGQILLMNPLPDVAKAYSSIVQEXKQRSLGAIRETTENSAMVVQRAEPMALXVRXGQ 248

Query: 835  NSVDKNKNKFQKKSSDSTALCDYCNLTGHTMDKCFALHGYPEWHRMYGQPKPKP-----K 999
             S  ++ N   +KS      C YC+   H  + C+ L+GYP  H  +   +        K
Sbjct: 249  GSSSRS-NPSNRKSLH----CTYCDRDHHVRETCWKLNGYPPEHPKHASNRSNHGSTHFK 303

Query: 1000 FNSRKQTANANVVHGISGVESEASVNTNTNLTEKQCQQIISMLQAQLNTQSSVINASANA 1179
             N+  Q++  NV      V  E    TN  L++ Q QQI+S++Q +  TQS+  N  ANA
Sbjct: 304  RNNSHQSSANNVKE--RPVMQEVPSMTN-GLSDLQIQQILSIMQGKGTTQST--NPKANA 358

Query: 1180 SWTQSNDITSSISPVAGNTFTSHTNSAILVGQVQFHNPTNIWIIDSGATNHITPYLSLLA 1359
            +                        S +L   +  H      IIDSGAT+HIT   +LL 
Sbjct: 359  A-----------------------ASGLLQTLLHLHR----LIIDSGATDHITSSPTLLV 391

Query: 1360 NVE--TLNSELHLPNGQSTQVTHVGDIALTSELSLKKVLYVPQFQCNLLYISKFAKDNAC 1533
            N    T    + +P+G+   +T +G++ L S  +LK VL VP F+ +L+ +S+  KD  C
Sbjct: 392  NSRKNTFLPPVAMPSGEQAPITSIGNLPLNSAATLKNVLGVPSFKVDLMSVSRVTKDLNC 451

Query: 1534 TIHFSASKCIMQDHALQRMKEIGELDDGMY----------KLHTDKVLYSSTMTVVHD-T 1680
            ++ F    CI+QD   +    +GE  DG+Y          K  T     +S  +     T
Sbjct: 452  SVTFFPHWCILQDLTTRTTIGLGEQRDGLYYLVALASEKPKTQTPSXXATSCRSPSSQVT 511

Query: 1681 NQVLQWHNRLGHPSSVVLSHIPS--LSSTDTSIITCDICHKSKQHRPSFSNSTSKSLAYF 1854
            +    WH RLGH SS  L  +    L+    S   CD+C  +KQ R  FS S+  S+  F
Sbjct: 512  SSTALWHXRLGHLSSSRLDFMAKHLLNFPFQSNNACDVCXLAKQRRLPFSVSSISSVRPF 571

Query: 1855 NLIHCDLWGPYR 1890
             LIHCD WGPY+
Sbjct: 572  ELIHCDXWGPYK 583



 Score =  107 bits (267), Expect(2) = 9e-75
 Identities = 48/116 (41%), Positives = 76/116 (65%), Gaps = 2/116 (1%)
 Frame = +2

Query: 164 IDINHPYYLSSSDHPGLALVTEVLTEQNYHHWSRSIKIALSAKLKLGFIDGTQLKPAITS 343
           +D  +PY+L  SDHPG+ LV++ L   NY  W R++ I+L+AK KLGFIDGT   P+ T 
Sbjct: 12  LDAANPYFLHHSDHPGMVLVSKPLNGDNYSTWCRAMTISLNAKSKLGFIDGTTTMPSATD 71

Query: 344 --SQYVLSMCSNDLVISWLLNSISTEIRKSVVYMQTTKQIWDDLAARFSQTKVPSL 505
              ++      ND+++SW+LNS+S ++  SV++  T +++W+DL   FSQ+  P +
Sbjct: 72  KPBEHASWKKCNDMILSWILNSLSQDLADSVIFSTTAQEVWEDLRDHFSQSNAPRI 127


>dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1475

 Score =  214 bits (546), Expect(2) = 3e-71
 Identities = 149/488 (30%), Positives = 234/488 (47%), Gaps = 23/488 (4%)
 Frame = +1

Query: 493  GTKSITAYFTVFRELMDELDTLSPIPRCICTNSNCAC--GNSQKIDQYEQMNKLSQFLMG 666
            GT S++ Y+T  + L D+LD  S    C+ T  NC C    +  I+     +K+ +FL G
Sbjct: 184  GTMSLSDYYTALKTLWDDLDGAS----CVSTCKNCTCCIATASMIEH----SKIVKFLSG 235

Query: 667  LNDQFINTRGQILLMSPLPDLGHANDMLLQEENQREFSKEMTGINENMAMNVRLLQNS-V 843
            LN+ +   R QI++   +PDL    ++L Q+ +QR      T  +     NV   Q+   
Sbjct: 236  LNESYSTIRSQIIMKKTIPDLAEIYNLLDQDHSQRNIVTMPTNAS---TFNVSAPQSDQF 292

Query: 844  DKNKNKFQKKSSDSTALCDYCNLTGHTMDKCFALHGYPEWHRMYGQPKPKPKFNSRKQTA 1023
              N  K           C +C  TGH  D C+ +HGYP   +   +    P    +   A
Sbjct: 293  AVNLAKSFGTQPKPKVQCSHCGYTGHNADTCYKIHGYPVGFKHKDKKTVTPSEKPKSVVA 352

Query: 1024 NANVVHG----ISGVESEASVNTNTNLTEKQCQQIISMLQAQLNTQSSVINASANASWTQ 1191
            N  +  G      G+  +  V    ++++ Q Q +I+    QL+  +  I  ++ AS   
Sbjct: 353  NLALTDGKVSVTQGIGPDGIVELVGSMSKSQIQDVIAYFSTQLHNPAKPITVASFASTNN 412

Query: 1192 SNDITS---SISPVAGNTFTSHTNSAILVGQVQFHNPTNIWIIDSGATNHITPYLSLLAN 1362
             N  T    S SP       S T+S  ++         N WIIDSGAT+H++   +L  +
Sbjct: 413  DNGSTFTGISFSPSTLRLLCSLTSSKKVLS-------LNTWIIDSGATHHVSYDRNLFES 465

Query: 1363 V-ETLNSELHLPNGQSTQVTHVGDIALTSELSLKKVLYVPQFQCNLLYISKFAKDNACTI 1539
            + + L++E+ LP G + ++  +G I L S L+LK VLY+P+F+ NLL +S+  KD  C I
Sbjct: 466  LSDGLSNEVTLPTGSNVKIAGIGVIKLNSNLTLKNVLYIPEFRLNLLSVSQQTKDMKCKI 525

Query: 1540 HFSASKCIMQDHALQRMKEIGELDDGMYKLHTDKVL-----YSSTMTVVHDTNQVLQ--- 1695
            +F    C++QD   ++    G    G+Y L T  V       +S++T     N V+    
Sbjct: 526  YFDEDCCVIQDPIKEQKIGRGNQIGGLYVLDTSSVECTSVDINSSVTEKQYCNAVVDSAL 585

Query: 1696 WHNRLGHPS---SVVLSHIPSLSSTD-TSIITCDICHKSKQHRPSFSNSTSKSLAYFNLI 1863
            WH+RLGHPS   + VL  +  L   +   ++ C IC K+KQ   SF +  + S   F+LI
Sbjct: 586  WHSRLGHPSYEKNDVLHDVLGLPKRNKEDLVHCSICQKAKQKHLSFPSKNNMSENKFDLI 645

Query: 1864 HCDLWGPY 1887
            H D WGP+
Sbjct: 646  HIDTWGPF 653



 Score = 82.4 bits (202), Expect(2) = 3e-71
 Identities = 40/107 (37%), Positives = 63/107 (58%)
 Frame = +2

Query: 179 PYYLSSSDHPGLALVTEVLTEQNYHHWSRSIKIALSAKLKLGFIDGTQLKPAITSSQYVL 358
           P+ LS+ D PG  LV+EVL   N+  W  ++ ++L AK K+ F+DGT  +P  +   + +
Sbjct: 63  PFALSNGDSPGNTLVSEVLDGTNFSSWKIAMFVSLYAKNKIAFVDGTLPRPPESDPSFRV 122

Query: 359 SMCSNDLVISWLLNSISTEIRKSVVYMQTTKQIWDDLAARFSQTKVP 499
               N +V SW+LNS++ +I KS++      +IW DL  RF  T +P
Sbjct: 123 WSRCNSMVKSWILNSVTKQIYKSILRFNDAAEIWKDLDTRFHITNLP 169


>emb|CAN81016.1| hypothetical protein VITISV_025518 [Vitis vinifera]
          Length = 1461

 Score =  233 bits (593), Expect(2) = 3e-70
 Identities = 151/473 (31%), Positives = 236/473 (49%), Gaps = 7/473 (1%)
 Frame = +1

Query: 493  GTKSITAYFTVFRELMDELDTLSPIPRCICTNSNCACGNSQKIDQYEQMNKLSQFLMGLN 672
            G +SI+ Y+T  +   DEL +   +        +C+CG  +K+ + ++  ++ QFLMGLN
Sbjct: 176  GQQSISVYYTKLKAFWDELSSYHEV-------LSCSCGGLEKLKERDEKERVMQFLMGLN 228

Query: 673  DQFINTRGQILLMSPLPDLGHANDMLLQEENQREFSKEMTGINENMAMNVRLLQNSVDKN 852
            D +   RGQILLM PLPD      ++LQ+E Q E S     +N     +  +L +  +K 
Sbjct: 229  DSYAAIRGQILLMHPLPDTRRVYSLVLQQEKQVEVS-----LNNGNKNHYAMLADRDNKA 283

Query: 853  KNKFQKKSSDSTALCDYCNLTGHTMDKCFALHGYPEWHRMYGQPKPKPKFNSRKQTANAN 1032
             +  Q +   +   C YC+   H+++KC+ LHG+P  H+++G+    P  N R   AN  
Sbjct: 284  TSAHQVQKQKTPLHCSYCDRDYHSIEKCYYLHGFPIGHKLHGKNVKPP--NQRHSNANNV 341

Query: 1033 VVHGISGVESEASV---NTNTNLTEKQCQQIISMLQAQLNTQSSVINASANASWTQSNDI 1203
             V     VE+EA +   N    LT ++  Q+++M++          N   +  +  +  I
Sbjct: 342  KVETNKAVETEAKLLPTNDGPRLTTEEYNQLMAMIRKN--------NGGNSQHFANATGI 393

Query: 1204 TSSISPVAGNTFTSHTNSAILVGQVQFHNPTNIWIIDSGATNHITPYLSLLANVE-TLNS 1380
              S S +  N    H+N                WIIDSGAT+H+T    LL        +
Sbjct: 394  NMSSSKIIPN--CPHSNMC--------------WIIDSGATDHVTSSAELLDPKNLPKTT 437

Query: 1381 ELHLPNGQSTQVTHVGDIALTSELSLKKVLYVPQFQCNLLYISKFAKDNACTIHFSASKC 1560
             + LPNG    +  +G + +T  + L  VL VPQFQ NLL +SK  +   C + F    C
Sbjct: 438  TISLPNGGQAHIESIGSLHVTPHIKLDDVLKVPQFQVNLLSVSKLTRALQCIVMFFFDFC 497

Query: 1561 IMQDHALQRMKEIGELDDGMYKLHTDKVLYSSTMTVVHDTNQVLQWHNRLGHPSS---VV 1731
            ++QD   ++   +G+  +G+Y L  D+    +    +H  + +  WH RLGHPSS    V
Sbjct: 498  VVQDATTRKTIGLGKQHNGLYYLAQDQ--NPALAYAIHKHSDL--WHQRLGHPSSGPLQV 553

Query: 1732 LSHIPSLSSTDTSIITCDICHKSKQHRPSFSNSTSKSLAYFNLIHCDLWGPYR 1890
            L+ +      D+  + CDIC  +KQ R SF +S   S A F+LIHCD+WGP+R
Sbjct: 554  LAKVNPKIYFDSKHV-CDICPLAKQTRLSFPSSFISSHAPFDLIHCDIWGPHR 605



 Score = 61.2 bits (147), Expect(2) = 3e-70
 Identities = 28/67 (41%), Positives = 43/67 (64%)
 Frame = +2

Query: 179 PYYLSSSDHPGLALVTEVLTEQNYHHWSRSIKIALSAKLKLGFIDGTQLKPAITSSQYVL 358
           P+ L  SDHPG+ LV++V+   NY  WSR+++I+LSAK K+GF+ G+   P+ T   +  
Sbjct: 77  PFSLHHSDHPGMVLVSKVIEGDNYSTWSRAMRISLSAKDKIGFVTGSIKPPSSTDDSFPS 136

Query: 359 SMCSNDL 379
               ND+
Sbjct: 137 WQRCNDM 143


>ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817175 [Glycine max]
          Length = 2045

 Score =  197 bits (502), Expect(2) = 5e-70
 Identities = 142/502 (28%), Positives = 221/502 (44%), Gaps = 38/502 (7%)
 Frame = +1

Query: 493  GTKSITAYFTVFRELMDELDTLSPIPRCICTNSNCACGNSQKIDQYEQMNKLSQFLMGLN 672
            GT ++T YFT  R + DE++   P P C C N  C+C     I Q +  ++  QFL GLN
Sbjct: 480  GTLTVTEYFTCLRVIWDEIENFRPDPICSC-NIRCSCNAFTIIAQRKLEDRAMQFLRGLN 538

Query: 673  DQFINTRGQILLMSPLPDLGHANDMLLQEENQR------EFSKEMTGINENMAMNVRLLQ 834
            +Q+ N R  +LLM P+P +      + Q+E Q         + E   I+ N A  V    
Sbjct: 539  EQYANIRSHVLLMDPIPTISKIFSYVAQQERQLLGNTGPGINFEPKDISINAAKTVCDFC 598

Query: 835  NSVDKNKNKFQKK-----------SSDSTALCDYCNLTGHTMDKCFALHGYPEWHRMYGQ 981
              +   ++   KK            S+    C +C   GHT+D C+  HGYP  ++ Y  
Sbjct: 599  GRIGHVESTCYKKHGVPSNYDARNKSNGRKACTHCGKIGHTVDVCYRKHGYPPGYKPY-- 656

Query: 982  PKPKPKFNSRKQTANANVVHGISGVESEASVNTNTNLTEKQCQQIISMLQAQLNTQSSVI 1161
                         +    V+ +  VES+A   T+      +  + +     Q     ++I
Sbjct: 657  -------------SGRTTVNNVVAVESKA---TDDQAQHHESHEFVRFSPEQYKALLALI 700

Query: 1162 NASANASWTQSNDITSSISPVAGNTFTSHTNSAILVGQVQFHNPTN------------IW 1305
                               P AGNT  +       +     +NPTN             W
Sbjct: 701  Q-----------------EPSAGNTALTQPKQVASISSCTVNNPTNPGMSLSLSASLTSW 743

Query: 1306 IIDSGATNHITPYLSLLANVETLNS-ELHLPNGQSTQVTHVGDIALTSELSLKKVLYVPQ 1482
            I+DSGAT+H+T  L  L + + +N   + LPNGQ    TH G + L+S ++L  VLY+P 
Sbjct: 744  ILDSGATDHVTCSLHNLHSHKRINPITVKLPNGQYVHATHSGTVQLSSNITLHDVLYIPS 803

Query: 1483 FQCNLLYISKFAKDNACTIHFSASKCIMQDHALQRMKEIGELDDGMYKLHTDKVLYSS-T 1659
            F  NL+ ISK      C + FS++ C++Q+        I E   G+Y L  +++   +  
Sbjct: 804  FTFNLISISKLVSSINCELIFSSTSCVLQEMNNHMKIGIVEAKHGLYHLIPNQLTTKAVN 863

Query: 1660 MTVVHDTNQVLQ---WHNRLGHPSS----VVLSHIPSLSSTDTSIITCDICHKSKQHRPS 1818
             T+ H    V+    WH RLGHPS+     + ++ P L +    +  C+ CH +K  +  
Sbjct: 864  STITHPRCNVIPIDLWHFRLGHPSAERIQCMKTYYPLLRNNKNFV--CNTCHYAKHKKMP 921

Query: 1819 FSNSTSKSLAYFNLIHCDLWGP 1884
            FS S S +   F+L+H D+ GP
Sbjct: 922  FSLSNSHASHAFDLLHMDIRGP 943



 Score = 95.5 bits (236), Expect(2) = 5e-70
 Identities = 45/101 (44%), Positives = 69/101 (68%)
 Frame = +2

Query: 185 YLSSSDHPGLALVTEVLTEQNYHHWSRSIKIALSAKLKLGFIDGTQLKPAITSSQYVLSM 364
           YL  S++P  ALV+ VL   NYH WSRS+  ALSAK K+ FIDG+  +P  T   +    
Sbjct: 361 YLHPSENPATALVSPVLDSTNYHSWSRSMVTALSAKNKVEFIDGSAPEPLKTDRMHGAWC 420

Query: 365 CSNDLVISWLLNSISTEIRKSVVYMQTTKQIWDDLAARFSQ 487
             N++V+SW+++S++T IR+S+++M   ++IW DL +R+SQ
Sbjct: 421 RCNNMVVSWIVHSVATSIRQSILWMDKAEEIWRDLKSRYSQ 461


>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1454

 Score =  196 bits (499), Expect(2) = 5e-70
 Identities = 142/486 (29%), Positives = 239/486 (49%), Gaps = 21/486 (4%)
 Frame = +1

Query: 493  GTKSITAYFTVFRELMDELDTLSPIPRCICTNSNCACGNSQKIDQYEQMNKLSQFLMGLN 672
            GT S++ Y+T  + L D+LD+   +      +  C CG + ++ Q  +  K+ +FL GLN
Sbjct: 182  GTLSLSEYYTRLKTLWDQLDSTEAL------DEPCTCGKAMRLQQKAEQAKIVKFLAGLN 235

Query: 673  DQFINTRGQILLMSPLPDLGHANDMLLQEENQREFSK--------EMTGINENMAMN--V 822
            + +   R QI+    LP LG    +L Q+ +Q+ FS         +++ I ++ +M+  V
Sbjct: 236  ESYAIVRRQIIAKKALPSLGEVYHILDQDNSQQSFSNVVAPPAAFQVSEITQSPSMDPTV 295

Query: 823  RLLQNSVDKNKNKFQKKSSDSTALCDYCNLTGHTMDKCFALHGYPEWHRMYGQP-----K 987
              +QN  +K +            +C + N  GH  ++C+  HG+P      G+      K
Sbjct: 296  CYVQNGPNKGR-----------PICSFYNRVGHIAERCYKKHGFPPGFTPKGKAGEKLQK 344

Query: 988  PKPKFNSRKQTANANVVHGISGVESEASVNTNTNLTEKQCQQIISMLQAQLNTQSSVINA 1167
            PKP   +  +++  N     + +ES        NL+++Q QQ I+M  +QL  Q++  + 
Sbjct: 345  PKPLAANVAESSEVN-----TSLESMVG-----NLSKEQLQQFIAMFSSQL--QNTPPST 392

Query: 1168 SANASWTQSNDITSSISPVAGNTFTSHTNSAILVGQVQFHNPTNI-WIIDSGATNHITPY 1344
             A AS +QS+++        G  F+  T S I +  V  H  ++  W+IDSGAT+H++  
Sbjct: 393  YATASTSQSDNL--------GICFSPSTYSFIGILTVARHTLSSATWVIDSGATHHVSHD 444

Query: 1345 LSLLANVET-LNSELHLPNGQSTQVTHVGDIALTSELSLKKVLYVPQFQCNLLYISKFAK 1521
             SL ++++T + S ++LP G + +++ VG + L  ++ LK VL++P+F+ NL+ IS    
Sbjct: 445  RSLFSSLDTSVLSAVNLPTGPTVKISGVGTLKLNDDILLKNVLFIPEFRLNLISISSLTD 504

Query: 1522 DNACTIHFSASKCIMQDHALQRMKEIGELDDGMYKLHTDKVLYSSTMTVVHDTNQVLQWH 1701
            D    + F  + C +QD    RM   G     +Y L       S    V      +  WH
Sbjct: 505  DIGSRVIFDKNSCEIQDLIKGRMLGQGRRVANLYLLDVGDQSISVNAVV-----DISMWH 559

Query: 1702 NRLGHPSSVVLSHI-PSLSST---DTSIITCDICHKSKQHRPSFSNSTSKSLAYFNLIHC 1869
             RLGH S   L  I  SL +T   +     C +CH +KQ + SF  S       F+L+H 
Sbjct: 560  RRLGHASLQRLDAISDSLGTTRHKNKGSDFCHVCHLAKQRKLSFPTSNKVCKEIFDLLHI 619

Query: 1870 DLWGPY 1887
            D+WGP+
Sbjct: 620  DVWGPF 625



 Score = 96.7 bits (239), Expect(2) = 5e-70
 Identities = 47/107 (43%), Positives = 68/107 (63%)
 Frame = +2

Query: 179 PYYLSSSDHPGLALVTEVLTEQNYHHWSRSIKIALSAKLKLGFIDGTQLKPAITSSQYVL 358
           P++L S+DHPGL +++  L E NY  WS ++ I+L AK K GFIDGT  +P  +   + L
Sbjct: 61  PFFLHSADHPGLNIISHRLDETNYGDWSVAMLISLDAKNKTGFIDGTLSRPLESDLNFRL 120

Query: 359 SMCSNDLVISWLLNSISTEIRKSVVYMQTTKQIWDDLAARFSQTKVP 499
               N +V SWLLNS+S +I +S++ M     IW DL +RF+ T +P
Sbjct: 121 WSRCNSMVKSWLLNSVSPQIYRSILRMNDASDIWRDLNSRFNVTNLP 167


Top