BLASTX nr result

ID: Catharanthus22_contig00045781 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00045781
         (425 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006345691.1| PREDICTED: pentatricopeptide repeat-containi...    93   4e-17
ref|XP_002323442.1| hypothetical protein POPTR_0016s08300g [Popu...    75   1e-11
ref|XP_002273989.2| PREDICTED: pentatricopeptide repeat-containi...    74   3e-11
emb|CBI32643.3| unnamed protein product [Vitis vinifera]               74   3e-11
gb|EOY07667.1| Tetratricopeptide repeat-like superfamily protein...    67   3e-09
ref|XP_006428957.1| hypothetical protein CICLE_v10011437mg [Citr...    64   3e-08
ref|XP_006410915.1| hypothetical protein EUTSA_v10017783mg [Eutr...    61   2e-07
ref|XP_006294030.1| hypothetical protein CARUB_v10023022mg [Caps...    60   3e-07
ref|XP_003624377.1| Pentatricopeptide repeat-containing protein ...    59   5e-07
ref|XP_002879661.1| pentatricopeptide repeat-containing protein ...    59   7e-07
ref|XP_004492962.1| PREDICTED: pentatricopeptide repeat-containi...    58   1e-06

>ref|XP_006345691.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37320-like [Solanum tuberosum]
          Length = 483

 Score = 92.8 bits (229), Expect = 4e-17
 Identities = 48/130 (36%), Positives = 80/130 (61%), Gaps = 1/130 (0%)
 Frame = -1

Query: 389 NSLPFNSAVQSSKPFSALQELGQSAKTIQFKGKRVLDIVASKPNVANNCQSHLRLVQDFL 210
           N+L F   ++ + P + ++           KG+R LDI+A K NV  N Q+ ++L++DFL
Sbjct: 2   NNLLFRYGMRKNLPLTKIKR--------PLKGQRFLDILAPKQNVIKNHQNCVKLIEDFL 53

Query: 209 RTNPIQLKETQIDNE-TFHPDQNENSISILYKLHKRGLSIDPSVLSNALSICGSERALRM 33
           +T  +Q  +  +D + +F   + E+S+S+L++ HK G  I  S++SNA+S+C S+R   +
Sbjct: 54  QTGSVQNCKHPLDKDNSFSYSKEEDSVSLLFRFHKEGCKIGVSLVSNAMSLCASKRVFNV 113

Query: 32  GIQFHCLVFV 3
           GIQ HCLV V
Sbjct: 114 GIQVHCLVIV 123


>ref|XP_002323442.1| hypothetical protein POPTR_0016s08300g [Populus trichocarpa]
           gi|222868072|gb|EEF05203.1| hypothetical protein
           POPTR_0016s08300g [Populus trichocarpa]
          Length = 526

 Score = 74.7 bits (182), Expect = 1e-11
 Identities = 43/127 (33%), Positives = 69/127 (54%), Gaps = 13/127 (10%)
 Frame = -1

Query: 353 KPFSALQELGQSAKTIQF-KGKRVLDIVASKPNVANNCQSHLRLVQDFLRTNPIQLKETQ 177
           +PFS+  +  ++ +T +  K  R+LDI+  K     N Q+HLR++QDF + +  +  E +
Sbjct: 28  RPFSS-HKFRRTTQTKRLDKALRILDIITPKTTAPTNGQNHLRVIQDFFQAHSNRTSEQR 86

Query: 176 IDNETFHPDQNENSISILYKL------------HKRGLSIDPSVLSNALSICGSERALRM 33
           + N+   P+ +    S+  ++            +   LS D SVLSNA+S C S R LR 
Sbjct: 87  LSNDFISPNSDNGDFSVFDEILESSFINNDEDSNATSLSFDASVLSNAVSSCASTRDLRG 146

Query: 32  GIQFHCL 12
           GIQ+HCL
Sbjct: 147 GIQYHCL 153


>ref|XP_002273989.2| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37320-like [Vitis vinifera]
          Length = 510

 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 50/140 (35%), Positives = 79/140 (56%), Gaps = 2/140 (1%)
 Frame = -1

Query: 425 LRRFWDKSSHCNNSLPFNSAVQSSKPFSALQELGQSAKTIQFKGKRVLDIVASKPNVANN 246
           LR+F  K      S+PF+S   S   FS  Q + +  KT +  G  +L++V+   + A N
Sbjct: 6   LRQFIRKKPSSPISVPFSSYGSS---FS--QNIKRKTKTHKQLG--ILNLVSPTTDCAEN 58

Query: 245 CQSHLRLVQDFLRTNPIQLKETQIDNETFHPDQNENSISILYK--LHKRGLSIDPSVLSN 72
            Q+HLRL+QDFL     Q  + +  ++  H +  E + ++L    L++    +D S LS+
Sbjct: 59  RQTHLRLIQDFLPIPTNQFAQKRASDDFAHSNSLEETSNMLETNFLNEEESIVDASALSH 118

Query: 71  ALSICGSERALRMGIQFHCL 12
           ALS+C S R+L+ G+QFHCL
Sbjct: 119 ALSLCASSRSLKSGVQFHCL 138


>emb|CBI32643.3| unnamed protein product [Vitis vinifera]
          Length = 1400

 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 50/140 (35%), Positives = 79/140 (56%), Gaps = 2/140 (1%)
 Frame = -1

Query: 425  LRRFWDKSSHCNNSLPFNSAVQSSKPFSALQELGQSAKTIQFKGKRVLDIVASKPNVANN 246
            LR+F  K      S+PF+S   S   FS  Q + +  KT +  G  +L++V+   + A N
Sbjct: 867  LRQFIRKKPSSPISVPFSSYGSS---FS--QNIKRKTKTHKQLG--ILNLVSPTTDCAEN 919

Query: 245  CQSHLRLVQDFLRTNPIQLKETQIDNETFHPDQNENSISILYK--LHKRGLSIDPSVLSN 72
             Q+HLRL+QDFL     Q  + +  ++  H +  E + ++L    L++    +D S LS+
Sbjct: 920  RQTHLRLIQDFLPIPTNQFAQKRASDDFAHSNSLEETSNMLETNFLNEEESIVDASALSH 979

Query: 71   ALSICGSERALRMGIQFHCL 12
            ALS+C S R+L+ G+QFHCL
Sbjct: 980  ALSLCASSRSLKSGVQFHCL 999


>gb|EOY07667.1| Tetratricopeptide repeat-like superfamily protein, putative
           [Theobroma cacao]
          Length = 516

 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 48/142 (33%), Positives = 71/142 (50%), Gaps = 16/142 (11%)
 Frame = -1

Query: 389 NSLPFNSAVQSSKPFSALQE-LGQSAKTIQFKGK--------RVLDIVASKPNVANNCQS 237
           N+L   S +Q +  F  L   L QS     +K K        R++DI++ KP      Q+
Sbjct: 2   NTLLLKSIIQRNSCFKPLLFFLSQSIPFSSYKLKLNPPSKVLRIMDIMSPKPTPTPR-QN 60

Query: 236 HLRLVQDFLRTNPIQLKETQIDNETFHPDQNENSISILYK-------LHKRGLSIDPSVL 78
           HLRL+QDFL+++  Q       N+  + D    ++ I +         +K     +P VL
Sbjct: 61  HLRLIQDFLQSDSDQFTAQHFVNDFVYSDSPTENLPIFFNEILAPPVTNKDISKFNPIVL 120

Query: 77  SNALSICGSERALRMGIQFHCL 12
           SNA+S CGS+R L  GIQ+HCL
Sbjct: 121 SNAISSCGSKRNLYGGIQYHCL 142


>ref|XP_006428957.1| hypothetical protein CICLE_v10011437mg [Citrus clementina]
           gi|568853804|ref|XP_006480530.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At2g37320-like [Citrus sinensis]
           gi|557531014|gb|ESR42197.1| hypothetical protein
           CICLE_v10011437mg [Citrus clementina]
          Length = 539

 Score = 63.5 bits (153), Expect = 3e-08
 Identities = 49/146 (33%), Positives = 73/146 (50%), Gaps = 26/146 (17%)
 Frame = -1

Query: 371 SAVQSSKPFSAL------QELGQ--SAKTIQFKGKRVLDIVASKPNVANNCQSHLRLVQD 216
           SA+QS K  S +      Q+L Q  S+K++  K  RVLDI++ +   +   + HLRL+QD
Sbjct: 16  SALQSLKTLSIIYRSFCSQKLKQISSSKSLH-KALRVLDIISPRTRDSTKTEVHLRLIQD 74

Query: 215 FLRTNPIQLKETQID------NETFHPDQN------------ENSISILYKLHKRGLSID 90
           FL+T+   L   + +      N TF                 E  IS+ + LH+  L +D
Sbjct: 75  FLQTDSKHLDSQKFNHDFTGTNSTFGSSNVFDQLLDTPVVDVEKLISMHHDLHRERLKVD 134

Query: 89  PSVLSNALSICGSERALRMGIQFHCL 12
            S LS A++ CGS R +R G  + CL
Sbjct: 135 ASFLSTAVTSCGSTRNIRGGAPYQCL 160


>ref|XP_006410915.1| hypothetical protein EUTSA_v10017783mg [Eutrema salsugineum]
           gi|557112084|gb|ESQ52368.1| hypothetical protein
           EUTSA_v10017783mg [Eutrema salsugineum]
          Length = 502

 Score = 60.8 bits (146), Expect = 2e-07
 Identities = 39/103 (37%), Positives = 58/103 (56%), Gaps = 10/103 (9%)
 Frame = -1

Query: 290 RVLDIVASKPNVANNCQSHLRLVQDFLRTNPIQLKETQIDNETFHPDQNENSISILYK-- 117
           RVLDI++SK    +N Q+HL +V++FL+T+  Q K+  I +E F   + +N IS + +  
Sbjct: 44  RVLDIISSKSCGGSNRQNHLGIVKEFLQTDSRQFKDQAI-SEGFDLSRTKNRISSVLEQV 102

Query: 116 --------LHKRGLSIDPSVLSNALSICGSERALRMGIQFHCL 12
                     + G S D   LS+A+S CGS R  R G  FHC+
Sbjct: 103 LLEDSSSIQQRDGWSFDAYGLSSAVSSCGSNRDFRSGSGFHCV 145


>ref|XP_006294030.1| hypothetical protein CARUB_v10023022mg [Capsella rubella]
           gi|482562738|gb|EOA26928.1| hypothetical protein
           CARUB_v10023022mg [Capsella rubella]
          Length = 514

 Score = 60.1 bits (144), Expect = 3e-07
 Identities = 40/110 (36%), Positives = 58/110 (52%), Gaps = 15/110 (13%)
 Frame = -1

Query: 290 RVLDIVASKPNVANNCQSHLRLVQDFLRTNPIQLKETQIDNETFHPDQNENSISILYK-- 117
           RVLDI++SK   A+N Q+H   VQ+FL+T+  Q   + I N+ F   + +N +S + +  
Sbjct: 53  RVLDIISSKSGGASNRQNHFGFVQEFLQTDSRQFIGSAISND-FDLSRTKNGVSSVLEEV 111

Query: 116 ------------LHKR-GLSIDPSVLSNALSICGSERALRMGIQFHCLVF 6
                       +H+R G S D   LS+A+  CGS    R G  FHCL F
Sbjct: 112 LLEDFTSFVNGGMHQRDGWSFDAYGLSSAVRSCGSSGDFRTGSGFHCLAF 161


>ref|XP_003624377.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355499392|gb|AES80595.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 487

 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 47/137 (34%), Positives = 68/137 (49%), Gaps = 11/137 (8%)
 Frame = -1

Query: 389 NSLP-FNSAVQSSKPFSALQELGQSAKTIQFKGKRVLDIVASKPNVAN--NCQSHLRLVQ 219
           N LP F+   Q  +PFS  +   ++         R+L +V+ K  V +  N +SHLRLV+
Sbjct: 11  NMLPTFHHKNQIFRPFSFYRLGPKNTNKDLTNALRILKLVSPKKTVTDIENRRSHLRLVE 70

Query: 218 DFLR---TNPI-----QLKETQIDNETFHPDQNENSISILYKLHKRGLSIDPSVLSNALS 63
           D L    TNP+      + ET +++     +Q              GL ID   LS+ALS
Sbjct: 71  DILENTTTNPLGSNLKTVTETILESSVLEMEQ--------------GLGIDVCFLSHALS 116

Query: 62  ICGSERALRMGIQFHCL 12
           +CGS+R    GIQ+HCL
Sbjct: 117 LCGSKRDFYGGIQYHCL 133


>ref|XP_002879661.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297325500|gb|EFH55920.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 500

 Score = 58.9 bits (141), Expect = 7e-07
 Identities = 37/103 (35%), Positives = 55/103 (53%), Gaps = 10/103 (9%)
 Frame = -1

Query: 290 RVLDIVASKPNVANNCQSHLRLVQDFLRTNPIQLKETQIDNETFHPDQNENSISILYK-- 117
           RVLDI++SK    +N Q+H   VQ+FL+T+  Q +   I +E F   + +N +S + +  
Sbjct: 44  RVLDIISSKSGGVSNRQNHFGFVQEFLQTDSRQFRGQAI-SEDFDLSRTKNGVSSVLEEV 102

Query: 116 --------LHKRGLSIDPSVLSNALSICGSERALRMGIQFHCL 12
                   + + G S D   LS+A+  CGS R  R G  FHCL
Sbjct: 103 MLEDSSSSVKRDGWSFDAYGLSSAVRSCGSNRDFRTGSGFHCL 145


>ref|XP_004492962.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37320-like [Cicer arietinum]
          Length = 512

 Score = 57.8 bits (138), Expect = 1e-06
 Identities = 49/134 (36%), Positives = 68/134 (50%), Gaps = 8/134 (5%)
 Frame = -1

Query: 389 NSLP--FNSAVQSSKPFSALQELGQSAKTIQFKGK-RVLDIVASKPNVAN--NCQSHLRL 225
           N +P  F+S     +PFS   +LG            RVL +V+ + +V +  N +SHLRL
Sbjct: 16  NRIPTYFHSKNHFRRPFS-FHKLGPKRSNKDLTNALRVLKLVSPQKSVTDIENRRSHLRL 74

Query: 224 VQDFL---RTNPIQLKETQIDNETFHPDQNENSISILYKLHKRGLSIDPSVLSNALSICG 54
           V+D L    TNP+      ++  T      E +I       ++GL ID   LS ALS CG
Sbjct: 75  VEDILDNTATNPLCGNSATLNTVT------ETTIQSSVLEMEQGLGIDVCFLSLALSSCG 128

Query: 53  SERALRMGIQFHCL 12
           S+R L  GIQ+HCL
Sbjct: 129 SKRDLYGGIQYHCL 142


Top