BLASTX nr result

ID: Angelica23_contig00017754 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00017754
         (1491 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002332417.1| predicted protein [Populus trichocarpa] gi|2...   173   1e-40
ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containi...   150   6e-34
ref|XP_002513116.1| pentatricopeptide repeat-containing protein,...   147   5e-33
ref|XP_003522216.1| PREDICTED: uncharacterized protein LOC100797...   140   1e-30
ref|XP_003598926.1| hypothetical protein MTR_3g023510 [Medicago ...   140   1e-30

>ref|XP_002332417.1| predicted protein [Populus trichocarpa] gi|222832370|gb|EEE70847.1|
            predicted protein [Populus trichocarpa]
          Length = 394

 Score =  173 bits (438), Expect = 1e-40
 Identities = 115/375 (30%), Positives = 180/375 (48%), Gaps = 27/375 (7%)
 Frame = +3

Query: 222  NYYQTKVDPQEVLQNSSLP-WEQKFCLLSG-IPWYKVLAAKKYIYCHDNVLKWXXXXXXX 395
            +Y   +  P     +  +P WE+KFC L G +PW KV+ AKKY+YCH N+L W       
Sbjct: 21   SYDYPESPPHSSFVDDGIPSWEKKFCSLIGSVPWRKVVDAKKYMYCHGNILNWDDSAGEE 80

Query: 396  XXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEYFNPDEAE--- 566
                 ++RF + I  +      PDP+++ID+I WN  IDPE++ DL+++ F PDE +   
Sbjct: 81   AFHNAKKRFWAEINGVSCGISPPDPNLFIDEIKWNAYIDPEVIKDLEQDLFVPDEGDTGG 140

Query: 567  ----------NLSSNEIPDCNNKNKSTLDNPWES-HHLENNVDIKDLAQSWNKWGDSLES 713
                      N  S     C  +N   + NPWES ++ ++++ + D A+SWN+W   +  
Sbjct: 141  KVGRKNKKRRNFVSIPSNGC-YENTDDVKNPWESNNNTQSSLSLIDKAKSWNQWDSDINK 199

Query: 714  KDAMNL----WEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQ 881
               +N     WE+    + EA K K WG   NK +GWN   N   +S+   +++ N W  
Sbjct: 200  SSNLNKVDNPWERGFSQESEAVKGKTWGVCGNKSWGWNHSGNHVDQSNDW-NNNSNPWQH 258

Query: 882  GALHFKLPNEKGWGDASKNSCGWNCGNSR--SNEWGNAGNVDSWKPRPGGTSYCRQFTNY 1055
                    N+KGWG+   +S G+N   SR  +N+  ++GN      +  G S  R++ + 
Sbjct: 259  SRQGVDPANDKGWGNLRDSSRGYNQHESRKWNNDCKSSGNGFF---QGSGASKDRKWEDN 315

Query: 1056 GNNSRNSKWLTNQNIDTEKFDSGIHSGACRKREEYQENTSRRKSWKHEGA-----DFEAH 1220
            G+NS+  K   N   +T+  D   H G    R E         SW+ EGA      +E+ 
Sbjct: 316  GSNSQGWKQWDNYGKNTKGLDFRKHGGGWETRNE--------GSWQREGAHQHITGYEST 367

Query: 1221 QFWGKASPHGQYYRG 1265
            +F G     G  + G
Sbjct: 368  RFQGDGFQTGHSWSG 382


>ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Vitis vinifera]
          Length = 1294

 Score =  150 bits (380), Expect = 6e-34
 Identities = 105/308 (34%), Positives = 140/308 (45%), Gaps = 37/308 (12%)
 Frame = +3

Query: 261  QNSSLPWEQKFCLLSGIPWYKVLAAKKYIYCHDNVLKWXXXXXXXXXXXXQERFCSMIYS 440
            QNS   WE++FC   GIPW KV+ AKKYI+ H +VL W            + RF + I  
Sbjct: 17   QNSVPSWEKRFCTSVGIPWGKVVDAKKYIHYHVDVLNWNDLAGEEAFHNAKRRFWAEING 76

Query: 441  LPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEYFNPDEAENLSSNEIPDCNNKNKSTL 620
            +P     PDPD+YID IDWNP IDPELM +LDKE+F+PDE E        DC  KN ++ 
Sbjct: 77   IPCSISQPDPDIYIDNIDWNPXIDPELMRNLDKEFFSPDEREQ-------DC--KNPASG 127

Query: 621  DNPWESHHLENNVDIKDLAQSWNKWG---DSLESKDAMN--------------------- 728
            DNPWE   L     +KD A +W+KWG     L + D  N                     
Sbjct: 128  DNPWE---LNMPKTLKDRAWAWDKWGGCKTELRNLDKTNSQVSGYATEGHYRKPDGGDSP 184

Query: 729  --LWEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQGALHFKL 902
               W  + +  ++A    +WG ++N+    ++ LN     D +   H  +W  GA+    
Sbjct: 185  WEYWNVQGVLKEKARVRNQWGGNINE----SRNLN----GDDNRWKHSCTWASGAV---- 232

Query: 903  PNEKGWGDASKNSCG-WNCGNSRSNEWGNAGN-VDSWK---------PRPGGTSYCRQFT 1049
              +  WG+   NS   WN      N+  N  N VD+W           R        Q  
Sbjct: 233  -RDDSWGNCEGNSWRMWNEVPKPINQLSNLDNGVDNWNSSCNQANAAQRDNACGGWSQGW 291

Query: 1050 NYGNNSRN 1073
            NY N SRN
Sbjct: 292  NYQNKSRN 299


>ref|XP_002513116.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548127|gb|EEF49619.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 1128

 Score =  147 bits (372), Expect = 5e-33
 Identities = 101/342 (29%), Positives = 159/342 (46%), Gaps = 31/342 (9%)
 Frame = +3

Query: 201  HPRRSPPNYYQTKVDPQE----VLQNSSLPWEQKFCLLSG-IPWYKVLAAKKYIYCHDNV 365
            H  R P + Y    +P        ++    WE+KFC L G +PW K++  KK++YCHD V
Sbjct: 13   HQYRDPASSYYNHQEPPPPYPGFAEDGVPSWEKKFCSLIGSVPWQKIVNVKKFMYCHDIV 72

Query: 366  LKWXXXXXXXXXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEY 545
            + W            + RF + I  LP +  LPDPDMY+D I+W+P+IDPEL+ +L++ +
Sbjct: 73   INWNDSAGADAFQNAKNRFWADINRLPCQISLPDPDMYVDDINWHPDIDPELVKELERAF 132

Query: 546  FNPDEAENLSSNEIPDCNNK---------------NKSTLDNPWE-SHHLENNVDIKDLA 677
            F P+E EN   N+  +C NK               N   +  PWE      +NV +++  
Sbjct: 133  FAPEEGEN---NDNVECKNKKARHFLSVPSEGWNRNPDEVRIPWECEDEGGSNVAVEEKT 189

Query: 678  QSWNKWGDSLESKDAMN----LWEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRE-- 839
            + WN+W  S  S   +N     W       +EA + K WG+  +K  GW+  +++  +  
Sbjct: 190  RGWNQWRISTNSSRNVNNGDTPWVSHFTQGNEAVEGKTWGNCADKLQGWSGSVHNQAKDW 249

Query: 840  SDKHESDHVNSWNQGALHFKLPNEKG-WGDASKNSCGW-NCGN--SRSNEWGNAGNVDSW 1007
               + ++    W    L       KG W ++S    GW + GN  ++S EW + GN   W
Sbjct: 250  GSCNLTNDDKPWGHSYL-------KGTWRESSGKLWGWSHKGNQVNQSKEWDSGGN--PW 300

Query: 1008 KPRPGGTSYCRQFTNYGNNSRNSKWLTNQNIDTEKFDSGIHS 1133
            +    G    +   N   NS +  W  N+ +   KF  G +S
Sbjct: 301  EHSSQGVVLVKD--NVWGNSNHISWGKNKQVG--KFSHGENS 338


>ref|XP_003522216.1| PREDICTED: uncharacterized protein LOC100797066 [Glycine max]
          Length = 387

 Score =  140 bits (352), Expect = 1e-30
 Identities = 95/312 (30%), Positives = 142/312 (45%), Gaps = 9/312 (2%)
 Frame = +3

Query: 216  PPNYYQTKVDPQEVLQNSSLPWEQKFCLLSG-IPWYKVLAAKKYIYCHDNVLKWXXXXXX 392
            PP +Y       E  Q+    WE+K+C + G +PW K++ +K ++YCH NV  W      
Sbjct: 22   PPTFYDINAPLPEYWQDGIPLWEKKYCTIVGLVPWQKIVDSKMFVYCHSNVFDWNDSAAE 81

Query: 393  XXXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEYFN-PDEAEN 569
                  +  + + I SLP +  LPDPD Y D+IDWNP IDP+++ ++DK +F  PDE + 
Sbjct: 82   EALQNAKNHYWAKINSLPCDISLPDPDTYNDQIDWNPYIDPDMIKEIDKAFFTVPDEEQE 141

Query: 570  LSSNEIPDCNNKNKSTLDNPWE------SHHLENNVDIKDLAQSWNKWGDSLESKDAMNL 731
             +   I +   K     +NP E      S  LENN       Q WN+ G+S +  +  N 
Sbjct: 142  TA---IKNKRTKTSVNDENPLECSDTPLSRALENNE-----VQRWNQ-GNSGDVDNTDNP 192

Query: 732  WEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQGALHFKLPNE 911
            WE    + +    D  W     K +GWN+G          + +    WN   L      +
Sbjct: 193  WECSVTHGNGRLTDNAWEGGPVKSWGWNEG---------RDHNQCKDWNSENL-----QD 238

Query: 912  KGWGDASKNSCGWNCGNSRSNEWGNAGNVDSWKPRPGGTSYCRQFTNYGNNSRN-SKWLT 1088
            KGWG A  +S  W C   +SN   N GN  SW+ +    +     T + N+  N S W  
Sbjct: 239  KGWGKARDSS--W-C-QQQSNNLANFGN-SSWQCKSSQQNVTPLKTGWRNSGANGSGWKQ 293

Query: 1089 NQNIDTEKFDSG 1124
             +  D  + + G
Sbjct: 294  QEKADVSRRNYG 305


>ref|XP_003598926.1| hypothetical protein MTR_3g023510 [Medicago truncatula]
            gi|355487974|gb|AES69177.1| hypothetical protein
            MTR_3g023510 [Medicago truncatula]
          Length = 365

 Score =  140 bits (352), Expect = 1e-30
 Identities = 105/369 (28%), Positives = 163/369 (44%), Gaps = 34/369 (9%)
 Frame = +3

Query: 213  SPPNYYQTKVDPQEVLQNSSLP-WEQKFCLLSG-IPWYKVLAAKKYIYCHDNVLKWXXXX 386
            +PP+ +     P    ++  +P WE+K+C LSG +PW K++ +K+ IYCH NVL W    
Sbjct: 18   NPPSIFYDIRAPLPEFRHDGIPVWEKKYCTLSGCVPWQKIVDSKELIYCHHNVLDWKDSG 77

Query: 387  XXXXXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEYFN-PDEA 563
                    ++R+ + + +LP +  LPDPD YI++IDWNP ID EL+ +LD  +F  PDE 
Sbjct: 78   AEEAFQNAKKRYWANVNNLPCDISLPDPDAYIEQIDWNPCIDAELIKELDNAFFTVPDEE 137

Query: 564  ENLSSNEIPDCNNKNKSTLDNPWE------SHHLENNVDIKDLAQSWNKWGDSLESKDAM 725
            E    N I     K     +NPWE         LENN   +   Q+   + D+ E+    
Sbjct: 138  E--QENAIQYKRTKISVDGENPWECAATSVGRGLENN---EVQGQNQGDYHDNSENVGTT 192

Query: 726  -NLWEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQGALHFKL 902
             N W    +  ++   D  W     K  GWN+G        +  ++  + WN G L    
Sbjct: 193  DNPWVSSAVCGNQGLTDNAWEGGHVKSRGWNEG--------RDHNNQCSGWNSGCLQ--- 241

Query: 903  PNEKGWGDASKNSC-------------GWNCGNSRSN------EWGNAGN-VDSWKPRPG 1022
              +KGWG    NS               W C +S+ N       W N+G  V  WK    
Sbjct: 242  -TDKGWGKVRDNSWCHQKSNNLANSGNSWGCKSSQQNVIPMNTGWRNSGTIVPRWKQHE- 299

Query: 1023 GTSYCRQFTNYGNNSRNSKWLT-NQNIDTEKFDSGIHSGACRKREEYQENTSRRKSWKHE 1199
              +Y    + +  N  N  W + NQ+    +  S  H+ +    +  ++++     W+ E
Sbjct: 300  -NAYVTSDSQFRRN--NGGWNSGNQSYHQMRGGSNRHNPSYNGSQPQRDDSQTGHYWRRE 356

Query: 1200 GA---DFEA 1217
             +   DF A
Sbjct: 357  QSRKRDFRA 365


Top