BLASTX nr result

ID: Angelica22_contig00001300 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00001300
         (1468 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002332417.1| predicted protein [Populus trichocarpa] gi|2...   173   1e-40
ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containi...   150   6e-34
ref|XP_002513116.1| pentatricopeptide repeat-containing protein,...   146   1e-32
ref|XP_003598926.1| hypothetical protein MTR_3g023510 [Medicago ...   141   5e-31
ref|XP_003522216.1| PREDICTED: uncharacterized protein LOC100797...   140   8e-31

>ref|XP_002332417.1| predicted protein [Populus trichocarpa] gi|222832370|gb|EEE70847.1|
            predicted protein [Populus trichocarpa]
          Length = 394

 Score =  173 bits (438), Expect = 1e-40
 Identities = 115/375 (30%), Positives = 179/375 (47%), Gaps = 27/375 (7%)
 Frame = +3

Query: 201  NYYQTKVDPQEVLQNSSLP-WEQKFCLLSG-IPWYKVLAAKKYIYCHDNVLKWXXXXXXX 374
            +Y   +  P     +  +P WE+KFC L G +PW KV+ AKKY+YCH N+L W       
Sbjct: 21   SYDYPESPPHSSFVDDGIPSWEKKFCSLIGSVPWRKVVDAKKYMYCHGNILNWDDSAGEE 80

Query: 375  XXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPKIDPELMSDLDKEYFNPDEAE--- 545
                 ++RF + I  +      PDP+++ID+I WN  IDPE++ DL+++ F PDE +   
Sbjct: 81   AFHNAKKRFWAEINGVSCGISPPDPNLFIDEIKWNAYIDPEVIKDLEQDLFVPDEGDTGG 140

Query: 546  ----------NLSSNEIPDCNNKNKSTLDNPWES-HRLENNVDIKDLAQSWNKWGDSLES 692
                      N  S     C  +N   + NPWES +  ++++ + D A+SWN+W   +  
Sbjct: 141  KVGRKNKKRRNFVSIPSNGC-YENTDDVKNPWESNNNTQSSLSLIDKAKSWNQWDSDINK 199

Query: 693  KDAMNL----WEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQ 860
               +N     WE+    + EA K K WG   NK +GWN   N   +S+   +++ N W  
Sbjct: 200  SSNLNKVDNPWERGFSQESEAVKGKTWGVCGNKSWGWNHSGNHVDQSNDW-NNNSNPWQH 258

Query: 861  GALHFKLPNEKGWGDASKNSGGWNCGNSR--SNEWGNAGNVDSWKPRPGGTSYCRQFTNY 1034
                    N+KGWG+   +S G+N   SR  +N+  ++GN      +  G S  R++ + 
Sbjct: 259  SRQGVDPANDKGWGNLRDSSRGYNQHESRKWNNDCKSSGNGFF---QGSGASKDRKWEDN 315

Query: 1035 GNNSRNSKWLTNQNIDTEKFDSGIHSGACRKREEYQENTSRRKSWKHEGA-----DFEAH 1199
            G+NS+  K   N   +T+  D   H G    R E         SW+ EGA      +E+ 
Sbjct: 316  GSNSQGWKQWDNYGKNTKGLDFRKHGGGWETRNE--------GSWQREGAHQHITGYEST 367

Query: 1200 QFWGKASPHGQYYRG 1244
            +F G     G  + G
Sbjct: 368  RFQGDGFQTGHSWSG 382


>ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Vitis vinifera]
          Length = 1294

 Score =  150 bits (380), Expect = 6e-34
 Identities = 105/308 (34%), Positives = 140/308 (45%), Gaps = 37/308 (12%)
 Frame = +3

Query: 240  QNSSLPWEQKFCLLSGIPWYKVLAAKKYIYCHDNVLKWXXXXXXXXXXXXQERFCSMIYS 419
            QNS   WE++FC   GIPW KV+ AKKYI+ H +VL W            + RF + I  
Sbjct: 17   QNSVPSWEKRFCTSVGIPWGKVVDAKKYIHYHVDVLNWNDLAGEEAFHNAKRRFWAEING 76

Query: 420  LPHEPPLPDPDMYIDKIDWNPKIDPELMSDLDKEYFNPDEAENLSSNEIPDCNNKNKSTL 599
            +P     PDPD+YID IDWNP IDPELM +LDKE+F+PDE E        DC  KN ++ 
Sbjct: 77   IPCSISQPDPDIYIDNIDWNPXIDPELMRNLDKEFFSPDEREQ-------DC--KNPASG 127

Query: 600  DNPWESHRLENNVDIKDLAQSWNKWG---DSLESKDAMN--------------------- 707
            DNPWE   L     +KD A +W+KWG     L + D  N                     
Sbjct: 128  DNPWE---LNMPKTLKDRAWAWDKWGGCKTELRNLDKTNSQVSGYATEGHYRKPDGGDSP 184

Query: 708  --LWEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQGALHFKL 881
               W  + +  ++A    +WG ++N+    ++ LN     D +   H  +W  GA+    
Sbjct: 185  WEYWNVQGVLKEKARVRNQWGGNINE----SRNLN----GDDNRWKHSCTWASGAV---- 232

Query: 882  PNEKGWGDASKNSGG-WNCGNSRSNEWGNAGN-VDSWK---------PRPGGTSYCRQFT 1028
              +  WG+   NS   WN      N+  N  N VD+W           R        Q  
Sbjct: 233  -RDDSWGNCEGNSWRMWNEVPKPINQLSNLDNGVDNWNSSCNQANAAQRDNACGGWSQGW 291

Query: 1029 NYGNNSRN 1052
            NY N SRN
Sbjct: 292  NYQNKSRN 299


>ref|XP_002513116.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548127|gb|EEF49619.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 1128

 Score =  146 bits (369), Expect = 1e-32
 Identities = 101/342 (29%), Positives = 158/342 (46%), Gaps = 31/342 (9%)
 Frame = +3

Query: 180  HPRRSPPNYYQTKVDPQE----VLQNSSLPWEQKFCLLSG-IPWYKVLAAKKYIYCHDNV 344
            H  R P + Y    +P        ++    WE+KFC L G +PW K++  KK++YCHD V
Sbjct: 13   HQYRDPASSYYNHQEPPPPYPGFAEDGVPSWEKKFCSLIGSVPWQKIVNVKKFMYCHDIV 72

Query: 345  LKWXXXXXXXXXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPKIDPELMSDLDKEY 524
            + W            + RF + I  LP +  LPDPDMY+D I+W+P IDPEL+ +L++ +
Sbjct: 73   INWNDSAGADAFQNAKNRFWADINRLPCQISLPDPDMYVDDINWHPDIDPELVKELERAF 132

Query: 525  FNPDEAENLSSNEIPDCNNK---------------NKSTLDNPWE-SHRLENNVDIKDLA 656
            F P+E EN   N+  +C NK               N   +  PWE      +NV +++  
Sbjct: 133  FAPEEGEN---NDNVECKNKKARHFLSVPSEGWNRNPDEVRIPWECEDEGGSNVAVEEKT 189

Query: 657  QSWNKWGDSLESKDAMN----LWEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRE-- 818
            + WN+W  S  S   +N     W       +EA + K WG+  +K  GW+  +++  +  
Sbjct: 190  RGWNQWRISTNSSRNVNNGDTPWVSHFTQGNEAVEGKTWGNCADKLQGWSGSVHNQAKDW 249

Query: 819  SDKHESDHVNSWNQGALHFKLPNEKG-WGDASKNSGGW-NCGN--SRSNEWGNAGNVDSW 986
               + ++    W    L       KG W ++S    GW + GN  ++S EW + GN   W
Sbjct: 250  GSCNLTNDDKPWGHSYL-------KGTWRESSGKLWGWSHKGNQVNQSKEWDSGGN--PW 300

Query: 987  KPRPGGTSYCRQFTNYGNNSRNSKWLTNQNIDTEKFDSGIHS 1112
            +    G    +   N   NS +  W  N+ +   KF  G +S
Sbjct: 301  EHSSQGVVLVKD--NVWGNSNHISWGKNKQVG--KFSHGENS 338


>ref|XP_003598926.1| hypothetical protein MTR_3g023510 [Medicago truncatula]
            gi|355487974|gb|AES69177.1| hypothetical protein
            MTR_3g023510 [Medicago truncatula]
          Length = 365

 Score =  141 bits (355), Expect = 5e-31
 Identities = 105/369 (28%), Positives = 163/369 (44%), Gaps = 34/369 (9%)
 Frame = +3

Query: 192  SPPNYYQTKVDPQEVLQNSSLP-WEQKFCLLSG-IPWYKVLAAKKYIYCHDNVLKWXXXX 365
            +PP+ +     P    ++  +P WE+K+C LSG +PW K++ +K+ IYCH NVL W    
Sbjct: 18   NPPSIFYDIRAPLPEFRHDGIPVWEKKYCTLSGCVPWQKIVDSKELIYCHHNVLDWKDSG 77

Query: 366  XXXXXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPKIDPELMSDLDKEYFN-PDEA 542
                    ++R+ + + +LP +  LPDPD YI++IDWNP ID EL+ +LD  +F  PDE 
Sbjct: 78   AEEAFQNAKKRYWANVNNLPCDISLPDPDAYIEQIDWNPCIDAELIKELDNAFFTVPDEE 137

Query: 543  ENLSSNEIPDCNNKNKSTLDNPWE------SHRLENNVDIKDLAQSWNKWGDSLESKDAM 704
            E    N I     K     +NPWE         LENN   +   Q+   + D+ E+    
Sbjct: 138  E--QENAIQYKRTKISVDGENPWECAATSVGRGLENN---EVQGQNQGDYHDNSENVGTT 192

Query: 705  -NLWEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQGALHFKL 881
             N W    +  ++   D  W     K  GWN+G        +  ++  + WN G L    
Sbjct: 193  DNPWVSSAVCGNQGLTDNAWEGGHVKSRGWNEG--------RDHNNQCSGWNSGCLQ--- 241

Query: 882  PNEKGWGDASKNS-------------GGWNCGNSRSN------EWGNAGN-VDSWKPRPG 1001
              +KGWG    NS               W C +S+ N       W N+G  V  WK    
Sbjct: 242  -TDKGWGKVRDNSWCHQKSNNLANSGNSWGCKSSQQNVIPMNTGWRNSGTIVPRWKQHE- 299

Query: 1002 GTSYCRQFTNYGNNSRNSKWLT-NQNIDTEKFDSGIHSGACRKREEYQENTSRRKSWKHE 1178
              +Y    + +  N  N  W + NQ+    +  S  H+ +    +  ++++     W+ E
Sbjct: 300  -NAYVTSDSQFRRN--NGGWNSGNQSYHQMRGGSNRHNPSYNGSQPQRDDSQTGHYWRRE 356

Query: 1179 GA---DFEA 1196
             +   DF A
Sbjct: 357  QSRKRDFRA 365


>ref|XP_003522216.1| PREDICTED: uncharacterized protein LOC100797066 [Glycine max]
          Length = 387

 Score =  140 bits (353), Expect = 8e-31
 Identities = 95/312 (30%), Positives = 142/312 (45%), Gaps = 9/312 (2%)
 Frame = +3

Query: 195  PPNYYQTKVDPQEVLQNSSLPWEQKFCLLSG-IPWYKVLAAKKYIYCHDNVLKWXXXXXX 371
            PP +Y       E  Q+    WE+K+C + G +PW K++ +K ++YCH NV  W      
Sbjct: 22   PPTFYDINAPLPEYWQDGIPLWEKKYCTIVGLVPWQKIVDSKMFVYCHSNVFDWNDSAAE 81

Query: 372  XXXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPKIDPELMSDLDKEYFN-PDEAEN 548
                  +  + + I SLP +  LPDPD Y D+IDWNP IDP+++ ++DK +F  PDE + 
Sbjct: 82   EALQNAKNHYWAKINSLPCDISLPDPDTYNDQIDWNPYIDPDMIKEIDKAFFTVPDEEQE 141

Query: 549  LSSNEIPDCNNKNKSTLDNPWE------SHRLENNVDIKDLAQSWNKWGDSLESKDAMNL 710
             +   I +   K     +NP E      S  LENN       Q WN+ G+S +  +  N 
Sbjct: 142  TA---IKNKRTKTSVNDENPLECSDTPLSRALENNE-----VQRWNQ-GNSGDVDNTDNP 192

Query: 711  WEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQGALHFKLPNE 890
            WE    + +    D  W     K +GWN+G          + +    WN   L      +
Sbjct: 193  WECSVTHGNGRLTDNAWEGGPVKSWGWNEG---------RDHNQCKDWNSENL-----QD 238

Query: 891  KGWGDASKNSGGWNCGNSRSNEWGNAGNVDSWKPRPGGTSYCRQFTNYGNNSRN-SKWLT 1067
            KGWG A  +S  W C   +SN   N GN  SW+ +    +     T + N+  N S W  
Sbjct: 239  KGWGKARDSS--W-C-QQQSNNLANFGN-SSWQCKSSQQNVTPLKTGWRNSGANGSGWKQ 293

Query: 1068 NQNIDTEKFDSG 1103
             +  D  + + G
Sbjct: 294  QEKADVSRRNYG 305


Top