BLASTX nr result

ID: Angelica22_contig00009954 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00009954
         (2016 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAD99219.1| polypeptide with a gag-like domain [Petunia x hy...   282   2e-73
dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis t...   259   2e-66
ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817...   249   3e-63
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi...   246   2e-62
emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera]   238   4e-60

>dbj|BAD99219.1| polypeptide with a gag-like domain [Petunia x hybrida]
          Length = 463

 Score =  282 bits (722), Expect = 2e-73
 Identities = 155/426 (36%), Positives = 241/426 (56%), Gaps = 30/426 (7%)
 Frame = -3

Query: 1315 DSDNPLHLQNFDSPDMRLVSEVFDGTDFGNWKRSMLIALSARNKLCFVDGSLPKPTLIDP 1136
            D ++P +L + D+P M L++  FDG+ +GNWKR +LI+LSA+NKL F+ G+  KP   D 
Sbjct: 32   DINHPYYLASSDAPGMNLINTSFDGSSYGNWKRGVLISLSAKNKLGFITGAYKKPDKEDL 91

Query: 1135 TYKSWCRCNDMVISWILSALSKSIGRSVIYYSSSHQMWLELEERYGVSKGAQLFGLHKEL 956
             ++ W RC+DMV++W+L++LSK I  SV+Y  ++ ++W ELE+RYG   G ++F L +EL
Sbjct: 92   LFEQWRRCSDMVLAWLLNSLSKEIAESVLYSQTAQELWQELEQRYGQIDGTKMFQLQREL 151

Query: 955  TEVSQGNHNISTYFTKIKMLWDDIDSLCLLPVCTCGCKCGATSKLVQFQQDQRVIQFLMG 776
              VSQG ++++ YF K+K +WD +  L    VC+C C C A     + Q+DQ++IQFLMG
Sbjct: 152  NNVSQGTNDVAAYFNKLKRIWDQMKVLNTFMVCSCECNCEAKGHNAKMQEDQQLIQFLMG 211

Query: 775  LNESYGITRGSILMRSPLPTLGHVYSLLLQEEAQREINFTSHFIADSSSLNVHSSKPTSQ 596
            LNE Y   RG+ILM  PLP+    YS++  EE QR I   ++   DS++ N  +    +Q
Sbjct: 212  LNEVYSGIRGNILMMKPLPSTAQAYSIISHEETQRGIAAGNNVSTDSAAFNASTQNWNNQ 271

Query: 595  FNKFKPSAGD--------------VKKSSIHCNY*KKPGHLIDKCYKLHGFPSDFK---F 467
             N +   A +                +++ +C Y +  GH+ ++C KL+G  +  +    
Sbjct: 272  RNNYDNRARNQNYNNRRNNYEGRRSNQNNSYCTYCRTSGHVREECRKLNGRDNRNRRPAG 331

Query: 466  TKTKKIAASVEGPSTDTDGST-----ASTVGLPVITPEFCTQLLQMLK-------TQACF 323
                +  A+ EG +   +G+T     A +V     T E C QL+QML+       T +C 
Sbjct: 332  NPNTQANAAYEGNTMQRNGNTPENTSAGSVNAQGFTKEQCEQLIQMLQSAHSISSTASCS 391

Query: 322  TLWLNSFFCKFAGTFTAS-THIACFTTSNKTDWIIASGASDHMCSNKSLFSTFCTLPQNL 146
               +N+    F G + A+ T+  CF+T     WII SGAS HM  +KS+      LP  +
Sbjct: 392  E--VNNSAANFVGKYAANHTYTNCFSTLTTNFWIIDSGASQHMTHDKSILHNIRELPVPI 449

Query: 145  NISLPN 128
             ++LPN
Sbjct: 450  LVNLPN 455


>dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1475

 Score =  259 bits (661), Expect = 2e-66
 Identities = 157/466 (33%), Positives = 250/466 (53%), Gaps = 33/466 (7%)
 Frame = -3

Query: 1306 NPLHLQNFDSPDMRLVSEVFDGTDFGNWKRSMLIALSARNKLCFVDGSLPKPTLIDPTYK 1127
            +P  L N DSP   LVSEV DGT+F +WK +M ++L A+NK+ FVDG+LP+P   DP+++
Sbjct: 62   SPFALSNGDSPGNTLVSEVLDGTNFSSWKIAMFVSLYAKNKIAFVDGTLPRPPESDPSFR 121

Query: 1126 SWCRCNDMVISWILSALSKSIGRSVIYYSSSHQMWLELEERYGVSKGAQLFGLHKELTEV 947
             W RCN MV SWIL++++K I +S++ ++ + ++W +L+ R+ ++   + + L +++  +
Sbjct: 122  VWSRCNSMVKSWILNSVTKQIYKSILRFNDAAEIWKDLDTRFHITNLPRSYQLTQQIWSL 181

Query: 946  SQGNHNISTYFTKIKMLWDDIDSLCLLPVC-TCGCKCGATSKLVQFQQDQRVIQFLMGLN 770
             QG  ++S Y+T +K LWDD+D    +  C  C C C AT+ ++   +  ++++FL GLN
Sbjct: 182  QQGTMSLSDYYTALKTLWDDLDGASCVSTCKNCTC-CIATASMI---EHSKIVKFLSGLN 237

Query: 769  ESYGITRGSILMRSPLPTLGHVYSLLLQEEAQREINFTSHFIADSSSLNVHSSKPTSQFN 590
            ESY   R  I+M+  +P L  +Y+LL Q+ +QR I        ++S+ NV + +      
Sbjct: 238  ESYSTIRSQIIMKKTIPDLAEIYNLLDQDHSQRNI---VTMPTNASTFNVSAPQSDQFAV 294

Query: 589  KFKPSAGDVKKSSIHCNY*KKPGHLIDKCYKLHGFPSDFKFTKTKKIAASVEGPSTD--- 419
                S G   K  + C++    GH  D CYK+HG+P  FK  K KK     E P +    
Sbjct: 295  NLAKSFGTQPKPKVQCSHCGYTGHNADTCYKIHGYPVGFKH-KDKKTVTPSEKPKSVVAN 353

Query: 418  ---TDGSTASTVG-----------------LPVITPEFCTQLLQMLK--TQACFTLWLNS 305
               TDG  + T G                 +  +   F TQL    K  T A F    N 
Sbjct: 354  LALTDGKVSVTQGIGPDGIVELVGSMSKSQIQDVIAYFSTQLHNPAKPITVASFASTNND 413

Query: 304  FFCKFAG-TFTAST-HIACFTTSNK-----TDWIIASGASDHMCSNKSLFSTFCTLPQNL 146
                F G +F+ ST  + C  TS+K       WII SGA+ H+  +++LF +      N 
Sbjct: 414  NGSTFTGISFSPSTLRLLCSLTSSKKVLSLNTWIIDSGATHHVSYDRNLFESLSDGLSN- 472

Query: 145  NISLPNGQIIAITHAGTVPLLKDIILYNVLFVPLFKYNLISIPKLT 8
             ++LP G  + I   G + L  ++ L NVL++P F+ NL+S+ + T
Sbjct: 473  EVTLPTGSNVKIAGIGVIKLNSNLTLKNVLYIPEFRLNLLSVSQQT 518


>ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817175 [Glycine max]
          Length = 2045

 Score =  249 bits (635), Expect = 3e-63
 Identities = 144/462 (31%), Positives = 246/462 (53%), Gaps = 32/462 (6%)
 Frame = -3

Query: 1300 LHLQNFDSPDMRLVSEVFDGTDFGNWKRSMLIALSARNKLCFVDGSLPKPTLIDPTYKSW 1121
            L+L   ++P   LVS V D T++ +W RSM+ ALSA+NK+ F+DGS P+P   D  + +W
Sbjct: 360  LYLHPSENPATALVSPVLDSTNYHSWSRSMVTALSAKNKVEFIDGSAPEPLKTDRMHGAW 419

Query: 1120 CRCNDMVISWILSALSKSIGRSVIYYSSSHQMWLELEERYGVSKGAQLFGLHKELTEVSQ 941
            CRCN+MV+SWI+ +++ SI +S+++   + ++W +L+ RY      ++  L +E + + Q
Sbjct: 420  CRCNNMVVSWIVHSVATSIRQSILWMDKAEEIWRDLKSRYSQGDLLRISDLQQEASTMKQ 479

Query: 940  GNHNISTYFTKIKMLWDDIDSLCLLPVCTCG--CKCGATSKLVQFQQDQRVIQFLMGLNE 767
            G   ++ YFT ++++WD+I++    P+C+C   C C A + + Q + + R +QFL GLNE
Sbjct: 480  GTLTVTEYFTCLRVIWDEIENFRPDPICSCNIRCSCNAFTIIAQRKLEDRAMQFLRGLNE 539

Query: 766  SYGITRGSILMRSPLPTLGHVYSLLLQEEAQREINFTSHFIADSSSLNVHSSKPTSQF-- 593
             Y   R  +L+  P+PT+  ++S + Q+E Q   N       +   ++++++K    F  
Sbjct: 540  QYANIRSHVLLMDPIPTISKIFSYVAQQERQLLGNTGPGINFEPKDISINAAKTVCDFCG 599

Query: 592  -----------------NKFKPSAGDVKKSSIHCNY*KKPGHLIDKCYKLHGFPSDFK-- 470
                             N    +  + +K+  HC    K GH +D CY+ HG+P  +K  
Sbjct: 600  RIGHVESTCYKKHGVPSNYDARNKSNGRKACTHCG---KIGHTVDVCYRKHGYPPGYKPY 656

Query: 469  -FTKTKKIAASVEGPSTDTDGSTASTVGLPVITPEFCTQLLQMLKTQACFTLWLNSFFCK 293
                T     +VE  +TD       +      +PE    LL +++  +     L     K
Sbjct: 657  SGRTTVNNVVAVESKATDDQAQHHESHEFVRFSPEQYKALLALIQEPSAGNTALTQ--PK 714

Query: 292  FAGTFTAST-------HIACFTTSNKTDWIIASGASDHM-CSNKSLFSTFCTLPQNLNIS 137
               + ++ T        ++   +++ T WI+ SGA+DH+ CS  +L S     P  + + 
Sbjct: 715  QVASISSCTVNNPTNPGMSLSLSASLTSWILDSGATDHVTCSLHNLHSHKRINP--ITVK 772

Query: 136  LPNGQIIAITHAGTVPLLKDIILYNVLFVPLFKYNLISIPKL 11
            LPNGQ +  TH+GTV L  +I L++VL++P F +NLISI KL
Sbjct: 773  LPNGQYVHATHSGTVQLSSNITLHDVLYIPSFTFNLISISKL 814


>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1454

 Score =  246 bits (628), Expect = 2e-62
 Identities = 152/466 (32%), Positives = 246/466 (52%), Gaps = 31/466 (6%)
 Frame = -3

Query: 1312 SDNPLHLQNFDSPDMRLVSEVFDGTDFGNWKRSMLIALSARNKLCFVDGSLPKPTLIDPT 1133
            + +P  L + D P + ++S   D T++G+W  +MLI+L A+NK  F+DG+L +P   D  
Sbjct: 58   TQSPFFLHSADHPGLNIISHRLDETNYGDWSVAMLISLDAKNKTGFIDGTLSRPLESDLN 117

Query: 1132 YKSWCRCNDMVISWILSALSKSIGRSVIYYSSSHQMWLELEERYGVSKGAQLFGLHKELT 953
            ++ W RCN MV SW+L+++S  I RS++  + +  +W +L  R+ V+   + + L +E+ 
Sbjct: 118  FRLWSRCNSMVKSWLLNSVSPQIYRSILRMNDASDIWRDLNSRFNVTNLPRTYNLTQEIQ 177

Query: 952  EVSQGNHNISTYFTKIKMLWDDIDSLCLLPVCTCGCKCGATSKLVQFQQDQRVIQFLMGL 773
            +  QG  ++S Y+T++K LWD +DS   L      C CG   +L Q  +  ++++FL GL
Sbjct: 178  DFRQGTLSLSEYYTRLKTLWDQLDSTEALDE---PCTCGKAMRLQQKAEQAKIVKFLAGL 234

Query: 772  NESYGITRGSILMRSPLPTLGHVYSLLLQEEAQREINFTSHFIADSSSLNVH------SS 611
            NESY I R  I+ +  LP+LG VY +L Q+ +Q+     S+ +A  ++  V       S 
Sbjct: 235  NESYAIVRRQIIAKKALPSLGEVYHILDQDNSQQSF---SNVVAPPAAFQVSEITQSPSM 291

Query: 610  KPTSQFNKFKPSAGDVKKSSIHCNY*KKPGHLIDKCYKLHGFPSDF--------KFTKTK 455
             PT  + +  P+     K    C++  + GH+ ++CYK HGFP  F        K  K K
Sbjct: 292  DPTVCYVQNGPN-----KGRPICSFYNRVGHIAERCYKKHGFPPGFTPKGKAGEKLQKPK 346

Query: 454  KIAASVEGPSTDTDGSTASTVGLPVITPEFCTQLLQMLKTQACFT--------------- 320
             +AA+V   S++ + S  S VG   ++ E   Q + M  +Q   T               
Sbjct: 347  PLAANV-AESSEVNTSLESMVG--NLSKEQLQQFIAMFSSQLQNTPPSTYATASTSQSDN 403

Query: 319  --LWLNSFFCKFAGTFTASTHIACFTTSNKTDWIIASGASDHMCSNKSLFSTFCTLPQNL 146
              +  +     F G  T + H     T +   W+I SGA+ H+  ++SLFS+  T   + 
Sbjct: 404  LGICFSPSTYSFIGILTVARH-----TLSSATWVIDSGATHHVSHDRSLFSSLDTSVLSA 458

Query: 145  NISLPNGQIIAITHAGTVPLLKDIILYNVLFVPLFKYNLISIPKLT 8
             ++LP G  + I+  GT+ L  DI+L NVLF+P F+ NLISI  LT
Sbjct: 459  -VNLPTGPTVKISGVGTLKLNDDILLKNVLFIPEFRLNLISISSLT 503


>emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera]
          Length = 970

 Score =  238 bits (607), Expect = 4e-60
 Identities = 125/331 (37%), Positives = 193/331 (58%), Gaps = 3/331 (0%)
 Frame = -3

Query: 1315 DSDNPLHLQNFDSPDMRLVSEVFDGTDFGNWKRSMLIALSARNKLCFVDGSLPKPTLIDP 1136
            DS +P  L N D P + LVS    G ++  W R+M++AL+A+NK+ F+DGS+P P   D 
Sbjct: 23   DSTSPYFLHNLDHPGIVLVSHHLTGANYNTWSRAMVMALTAKNKISFIDGSIPCPESDDL 82

Query: 1135 TYKSWCRCNDMVISWILSALSKSIGRSVIYYSSSHQMWLELEERYGVSKGAQLFGLHKEL 956
             + +W RCN MVISWIL+++ K I  S++Y+ ++  +W +L +R+  S G ++F + K L
Sbjct: 83   LFGTWIRCNSMVISWILNSVHKDIADSLLYFDTAVGIWNDLRDRFCQSNGPRIFQIKKHL 142

Query: 955  TEVSQGNHNISTYFTKIKMLWDDIDSLCLLPVCTCGCKCGATSKLVQFQQDQRVIQFLMG 776
              +SQG+ ++STY+T++K+LWD++     LP C     CG     ++FQQ + V+QFLMG
Sbjct: 143  IALSQGSLDVSTYYTRLKILWDELKGFQPLPECA----CGTMKTWMEFQQQEYVMQFLMG 198

Query: 775  LNESYGITRGSILMRSPLPTLGHVYSLLLQEEAQREINFTSHFIADSSSLNVHSSKPTSQ 596
            LNES+  TR  ILM  PLP +  V+SL+ Q+E Q  IN+  +   DS + N  +S     
Sbjct: 199  LNESFVQTRSQILMMEPLPPIAKVFSLVAQDERQCSINYGLYTPPDSVAANDSNSTVAIS 258

Query: 595  FNKFKPSAGDVKKSSIHCNY*KKPGHLIDKCYKLHGFPSDFKFTKTKKIAASVEGPSTD- 419
              +        + +  HC      GH +DKCYKL+G+P  +KF K+K   A  +   T  
Sbjct: 259  AARLNSKPKKDRPTCSHCGI---LGHTVDKCYKLYGYPPGYKF-KSKNPHAKAQANQTSS 314

Query: 418  --TDGSTASTVGLPVITPEFCTQLLQMLKTQ 332
              T+ S  +   L  ++P  C QL+ +L +Q
Sbjct: 315  RTTEASATADSPLVSLSPAQCQQLIALLSSQ 345


Top