BLASTX nr result

ID: Cephaelis21_contig00003489 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00003489
         (3423 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002307797.1| predicted protein [Populus trichocarpa] gi|2...   527   e-147
ref|XP_002510687.1| pentatricopeptide repeat-containing protein,...   504   e-140
ref|XP_002272601.1| PREDICTED: pentatricopeptide repeat-containi...   493   e-136
emb|CAN74746.1| hypothetical protein VITISV_012024 [Vitis vinifera]   492   e-136
ref|XP_003528528.1| PREDICTED: pentatricopeptide repeat-containi...   477   e-131

>ref|XP_002307797.1| predicted protein [Populus trichocarpa] gi|222857246|gb|EEE94793.1|
            predicted protein [Populus trichocarpa]
          Length = 475

 Score =  527 bits (1358), Expect = e-147
 Identities = 275/476 (57%), Positives = 343/476 (72%)
 Frame = +2

Query: 629  SDTRVYHRKPPKNLRFPRRTKLPLDPCLTGVSTVSRPFANDIIRDDDNFDSLTAVDDDDV 808
            S TR+ +RK PKN+R+PRR+KLP D    GV+   +    D ++D            DD+
Sbjct: 4    SSTRICYRKIPKNIRYPRRSKLPPD---FGVNLFLKKPQTDSVQDHS----------DDL 50

Query: 809  NGQKELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXXXXXXXX 988
              +    EEE   +VN   NGEIVWE +E+EAISSLF+GRIPQKPG              
Sbjct: 51   TEE----EEEEEIEVN---NGEIVWESEEIEAISSLFRGRIPQKPGKLGRERPLPLPVPY 103

Query: 989  XXXXXGFPTQKRMLRKSVAIARQSVSSQLYKNPTFLGGLAREIKALPPGERSVSLVLNRW 1168
                 G P  K+ + K V+++R S+SSQ+YKNP+FL GLA+EIK L P ++ VS+VL+  
Sbjct: 104  KLRPLGLPAPKKHVNKQVSLSRASISSQIYKNPSFLIGLAKEIKRLSP-DQDVSVVLDNC 162

Query: 1169 ARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEVLARSRELRL 1348
            +R+L KGSLS+TIRELGH+GLPE AL  FCW QKQP L+PDDR+LASTVEVLAR+ +L++
Sbjct: 163  SRYLHKGSLSLTIRELGHLGLPERALQTFCWVQKQPRLFPDDRVLASTVEVLARNHDLKV 222

Query: 1349 PFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSGVYVKLILEL 1528
            PF L  ++F +L S+ V EA+VKG I+GGSL +  KL+S A+D KR++D  VY K+ILEL
Sbjct: 223  PFNL--EKFTNLASRRVIEAMVKGLIRGGSLKLSWKLISVAKDGKRMLDPSVYAKIILEL 280

Query: 1529 GKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYDWFKKSGGVP 1708
            GKNPDK           A REDLNL+ QDCTAVMKVCI+LGKFE VE  ++WF++SG  P
Sbjct: 281  GKNPDKHVLAEALLDELAEREDLNLSQQDCTAVMKVCIKLGKFEAVESLFNWFRQSGHEP 340

Query: 1709 SVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALTDLPRAVRYF 1888
            SVVMYTT+IHSRYSE KYREALAVVWEME S+CLFD  AYRV IKLFVAL DLPRAVRYF
Sbjct: 341  SVVMYTTLIHSRYSESKYREALAVVWEMEGSDCLFDLTAYRVVIKLFVALNDLPRAVRYF 400

Query: 1889 SKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQIRVQLFEL 2056
            SKLKEAG SPT+D+Y  +I +Y+ SGR+AKCKEVW EAEMAGFK  +++   L +L
Sbjct: 401  SKLKEAGLSPTYDIYRNLITLYMVSGRLAKCKEVWKEAEMAGFKFSKEMAAGLLQL 456


>ref|XP_002510687.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223551388|gb|EEF52874.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 469

 Score =  504 bits (1298), Expect = e-140
 Identities = 274/479 (57%), Positives = 336/479 (70%), Gaps = 1/479 (0%)
 Frame = +2

Query: 629  SDTRVYHRKP-PKNLRFPRRTKLPLDPCLTGVSTVSRPFANDIIRDDDNFDSLTAVDDDD 805
            S+T+ Y+RK  PKNL+ PRR+KLP D    GV+   +     +  D    DS   VDD  
Sbjct: 4    SNTKTYYRKKLPKNLQSPRRSKLPPD---FGVNLFLKKPTTGV--DPFQMDSFL-VDD-- 55

Query: 806  VNGQKELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXXXXXXX 985
              G  +L +EE        +NG+IVWE DE+EAISSLF+GRIPQ+PGN            
Sbjct: 56   --GDGDLNQEEKE------QNGDIVWESDEIEAISSLFQGRIPQRPGNLNRERPLPLPLP 107

Query: 986  XXXXXXGFPTQKRMLRKSVAIARQSVSSQLYKNPTFLGGLAREIKALPPGERSVSLVLNR 1165
                  G P+ K+  R  V   R  + +++YKNP+FL  LA++IK L P +  VS VL+ 
Sbjct: 108  HKLRPLGPPSPKKHNRNVVVSLRSPICNKVYKNPSFLISLAKQIKCLNPDD-DVSAVLDD 166

Query: 1166 WARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEVLARSRELR 1345
             ARFLRKGSLS+TIRELGHMG P+ AL  FCWAQKQP LYPDDRILASTVE+LAR+++L+
Sbjct: 167  CARFLRKGSLSLTIRELGHMGFPDRALQTFCWAQKQPQLYPDDRILASTVEILARNQDLK 226

Query: 1346 LPFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSGVYVKLILE 1525
            +P      +F SL S+ V EA+++GF+KGG L +  KLL+ A+  KR++D+ +Y +LILE
Sbjct: 227  VPIDWQ--KFTSLASRGVIEAMIRGFLKGGRLKLAWKLLAVAKHDKRMLDASLYARLILE 284

Query: 1526 LGKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYDWFKKSGGV 1705
            LGKNPDK             REDLNL+ QDCTA+MKVCIRL KFE VE  + WFK+SG  
Sbjct: 285  LGKNPDKYMLVEQLLDELGEREDLNLSHQDCTAIMKVCIRLQKFEFVECLFTWFKQSGHE 344

Query: 1706 PSVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALTDLPRAVRY 1885
            PSVVMYTT+IHSRYSEKKYREALA VWEME S+ LFD PAYRV IKLFVAL DLPRAVRY
Sbjct: 345  PSVVMYTTLIHSRYSEKKYREALAGVWEMEGSSFLFDLPAYRVVIKLFVALNDLPRAVRY 404

Query: 1886 FSKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQIRVQLFELGK 2062
            FSKLKEAG SPT+D+Y  +IKIYL SGR+AKCKE+W EAEMAGFKLD+QI++ L    K
Sbjct: 405  FSKLKEAGLSPTYDIYRNLIKIYLVSGRLAKCKEIWKEAEMAGFKLDEQIKMDLLHQEK 463


>ref|XP_002272601.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01860-like
            [Vitis vinifera]
          Length = 514

 Score =  493 bits (1270), Expect = e-136
 Identities = 271/487 (55%), Positives = 330/487 (67%), Gaps = 10/487 (2%)
 Frame = +2

Query: 632  DTRVYHRKPPKNLRFPRRTKLPLDPCLT-----GVSTVSRPFANDII--RDDDNFDSLTA 790
            + RV HRKP KNL  PRR KLP +P ++     G S   +     ++    D N D    
Sbjct: 46   NARVNHRKPTKNLPHPRRAKLPPEPEISTFLKGGNSGTEQSEMGTVLDKEPDANDDGFLV 105

Query: 791  VDDDDVNGQKELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXX 970
               D + G+KE               GEIVW+ DE+EAISSLF GRIPQKPG        
Sbjct: 106  ---DGIEGRKE---------------GEIVWDSDEIEAISSLFMGRIPQKPGKLNRERPL 147

Query: 971  XXXXXXXXXXXGFPTQKRMLRKSVAI---ARQSVSSQLYKNPTFLGGLAREIKALPPGER 1141
                       G PT KR +R + ++   +R S+S Q+YKNP FL  +AREI+ LP  E 
Sbjct: 148  PLPLPYKLRPMGLPTTKRHVRAASSMPYASRASLSKQVYKNPDFLISIAREIRKLPL-ED 206

Query: 1142 SVSLVLNRWARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEV 1321
             VS VLN+W RFLRKGSLS+TIRELGHMGLPE AL  F WAQKQP L+PDDRILASTVEV
Sbjct: 207  DVSPVLNKWVRFLRKGSLSLTIRELGHMGLPERALQTFFWAQKQPQLFPDDRILASTVEV 266

Query: 1322 LARSRELRLPFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSG 1501
            LAR+ +L++PF L  ++F  L S+SV EA+ +GFI+ GSL++  KLL  A+DSKR++   
Sbjct: 267  LARTHKLKVPFSL--EKFTGLASRSVIEALARGFIRRGSLSLAWKLLLVAKDSKRMLGPS 324

Query: 1502 VYVKLILELGKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYD 1681
            +Y KLI ELGKNPDK             REDL L+ QDCTAVMKVCIRLGKFEIVE  ++
Sbjct: 325  IYAKLIFELGKNPDKHSLVQALLDELGEREDLKLSHQDCTAVMKVCIRLGKFEIVESLFN 384

Query: 1682 WFKKSGGVPSVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALT 1861
            W+K+S   PSVVMYTT+IHSRY+EKKYREALAVVWEMEAS+C+FD PAYRV IKLF+AL 
Sbjct: 385  WYKQSENSPSVVMYTTLIHSRYTEKKYREALAVVWEMEASDCVFDLPAYRVVIKLFIALN 444

Query: 1862 DLPRAVRYFSKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQIRV 2041
            DL R  RYFSKLKEAGFSPT+D+Y  ++KIY+   R+AKC+EV  E EM+GFKLD+    
Sbjct: 445  DLSRTGRYFSKLKEAGFSPTYDIYRDMLKIYMVFRRLAKCREVCKELEMSGFKLDKGTLS 504

Query: 2042 QLFELGK 2062
            QL +L K
Sbjct: 505  QLLQLEK 511


>emb|CAN74746.1| hypothetical protein VITISV_012024 [Vitis vinifera]
          Length = 514

 Score =  492 bits (1267), Expect = e-136
 Identities = 271/485 (55%), Positives = 329/485 (67%), Gaps = 10/485 (2%)
 Frame = +2

Query: 632  DTRVYHRKPPKNLRFPRRTKLPLDP----CLTGVSTVSRPFANDIIRD---DDNFDSLTA 790
            + RV HRKP KNL  PRR KLP +P     L G ++ +       + D   D N D    
Sbjct: 46   NARVNHRKPTKNLPHPRRAKLPPEPEISTFLKGGNSGTEQSEMGTVLDKELDPNDDGFLV 105

Query: 791  VDDDDVNGQKELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXX 970
               D + G+KE               GEIVW+ DE+EAISSLF GRIPQKPG        
Sbjct: 106  ---DGIEGRKE---------------GEIVWDSDEIEAISSLFMGRIPQKPGKLNRERPL 147

Query: 971  XXXXXXXXXXXGFPTQKRMLRKSVAI---ARQSVSSQLYKNPTFLGGLAREIKALPPGER 1141
                       G PT KR +R + ++   +R S+S Q+YKNP FL  +AREI+ LP  E 
Sbjct: 148  PLPLPYKIRPMGLPTTKRHVRAASSMPYASRASLSKQVYKNPDFLISIAREIRNLPL-ED 206

Query: 1142 SVSLVLNRWARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEV 1321
             VS VLN+W RFLRKGSLS+TIRELGHMGLPE AL  F WAQKQP L+PDDRILASTVEV
Sbjct: 207  DVSPVLNKWVRFLRKGSLSLTIRELGHMGLPERALQTFFWAQKQPQLFPDDRILASTVEV 266

Query: 1322 LARSRELRLPFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSG 1501
            LAR+ +L++PF L  ++F  L ++SV EA+ +GFI+ GSL++  KLL  A+DSKR++   
Sbjct: 267  LARTHKLKVPFSL--EKFTGLATRSVIEALARGFIRRGSLSLAWKLLLVAKDSKRMLGPS 324

Query: 1502 VYVKLILELGKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYD 1681
            +Y KLI ELGKNPDK             REDL L+ QDCTAVMKVCIRLGKFEIVE  ++
Sbjct: 325  IYAKLIFELGKNPDKHSLVQALLDELGEREDLKLSHQDCTAVMKVCIRLGKFEIVESLFN 384

Query: 1682 WFKKSGGVPSVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALT 1861
            W+K+S   PSVVMYTT+IHSRY+EKKYREALAVVWEMEAS+CLFD PAYRV IKLF+AL 
Sbjct: 385  WYKQSENSPSVVMYTTLIHSRYTEKKYREALAVVWEMEASDCLFDLPAYRVVIKLFIALN 444

Query: 1862 DLPRAVRYFSKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQIRV 2041
            DL R  RYFSKLKEAGFSPT+D+Y  ++KIY+   R+AKC+EV  E EM+GFKLD+    
Sbjct: 445  DLSRTGRYFSKLKEAGFSPTYDIYRDMLKIYMVFRRLAKCREVCKELEMSGFKLDKGTLS 504

Query: 2042 QLFEL 2056
            QL +L
Sbjct: 505  QLLQL 509


>ref|XP_003528528.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01860-like
            [Glycine max]
          Length = 478

 Score =  477 bits (1227), Expect = e-131
 Identities = 257/469 (54%), Positives = 323/469 (68%), Gaps = 7/469 (1%)
 Frame = +2

Query: 650  RKPPKNLRFPRRTKLP----LDPCLTGVSTVSRPFANDIIRDDDNFDSLTAVDDDDVNGQ 817
            R+PPK+ R+  R K P    ++  L   ST S+P                   DDD++  
Sbjct: 34   RRPPKDNRYRPRPKQPPEFGVNLFLKKPSTASKP------------------TDDDMDSN 75

Query: 818  KELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXXXXXXXXXXX 997
            +E  EEE      DG  G +VWE DELEAISSLF+GRIPQKPG                 
Sbjct: 76   EENDEEE------DGNIG-VVWESDELEAISSLFQGRIPQKPGKLDRERPLPLPVPFKLR 128

Query: 998  XXGFPTQKRMLR---KSVAIARQSVSSQLYKNPTFLGGLAREIKALPPGERSVSLVLNRW 1168
                PT K  ++    +V  +R S++ ++YK+P+FL GLAR+I  L P +  VS +L +W
Sbjct: 129  PLRLPTPKTQVKLTAPAVVSSRASMAKKVYKSPSFLVGLARQISRLGP-DADVSKILGKW 187

Query: 1169 ARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEVLARSRELRL 1348
             +FLRKGSLS+TIRELGHMG PE AL  F WAQ QPHL+PDD ILASTVEVLAR+ ELR+
Sbjct: 188  VQFLRKGSLSLTIRELGHMGFPERALQTFLWAQNQPHLFPDDWILASTVEVLARNHELRI 247

Query: 1349 PFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSGVYVKLILEL 1528
            PF L   ++  L S++V EA++KGFIKGG+L    K+L  AR  KR++DS +Y KLILEL
Sbjct: 248  PFNL--GQYTGLASRAVLEAMIKGFIKGGNLRFAWKVLIVARRDKRMLDSSIYAKLILEL 305

Query: 1529 GKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYDWFKKSGGVP 1708
            GKNPD+             R++LNL+ QDCTA+MKVC+++GKFE+VE  + WFK+SG  P
Sbjct: 306  GKNPDRHRHVLPLLDELGERDELNLSQQDCTAIMKVCVKMGKFEVVESLFSWFKQSGYQP 365

Query: 1709 SVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALTDLPRAVRYF 1888
            S+VM+T++IHSRY+EKKYREALAVVWEMEASNCLFD PAYRV IKLFVAL DL RA RYF
Sbjct: 366  SIVMFTSVIHSRYTEKKYREALAVVWEMEASNCLFDLPAYRVVIKLFVALNDLSRATRYF 425

Query: 1889 SKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQI 2035
            SKLKEAGFSP+F LY  +++IY+ASGRIAKCKE+  EAE+AGFKLD+ +
Sbjct: 426  SKLKEAGFSPSFGLYKDMLQIYMASGRIAKCKELCREAEIAGFKLDKYL 474


Top