BLASTX nr result

ID: Cephaelis21_contig00032785 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00032785
         (1770 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268094.2| PREDICTED: uncharacterized protein LOC100241...   598   e-168
ref|XP_004142596.1| PREDICTED: uncharacterized protein LOC101209...   570   e-160
ref|XP_002522027.1| pentatricopeptide repeat-containing protein,...   558   e-156
ref|XP_003555560.1| PREDICTED: uncharacterized protein LOC100807...   555   e-155
ref|XP_003535382.1| PREDICTED: uncharacterized protein LOC100802...   547   e-153

>ref|XP_002268094.2| PREDICTED: uncharacterized protein LOC100241547 [Vitis vinifera]
            gi|296085161|emb|CBI28656.3| unnamed protein product
            [Vitis vinifera]
          Length = 884

 Score =  598 bits (1543), Expect = e-168
 Identities = 309/434 (71%), Positives = 337/434 (77%), Gaps = 4/434 (0%)
 Frame = -2

Query: 1769 RKRWVPRRGKTPLDPDAEGFAYSNPMETSFKQQCLEESKIYHRKLLKVLHNEGPAVLGDL 1590
            RKRWVPRRGKTPLDPDA GF YSNPMETSFKQ+CLE+ K+YHRKLLK L NEG A LG++
Sbjct: 440  RKRWVPRRGKTPLDPDALGFIYSNPMETSFKQRCLEDWKMYHRKLLKTLRNEGLAALGEV 499

Query: 1589 SESEYFRVVERLKKIIKGPEQNALKPKAASKMLVSELKEELEAQGLPTDGTRNVLYQRVQ 1410
            SES+Y RV ERL+KIIKGP+QNALKPKAASKM+VSELKEELEAQGLPTDGTRNVLYQRVQ
Sbjct: 500  SESDYIRVEERLRKIIKGPDQNALKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQ 559

Query: 1409 KARRINRSRGRPLWVPPXXXXXXXXXXXXXXLILRIKLQEGNTEFWKRRFLGEGLQENYG 1230
            KARRINRSRGRPLWVPP              LI RIKLQEGNTEFWKRRFLGE L    G
Sbjct: 560  KARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLQEGNTEFWKRRFLGEDLTVGRG 619

Query: 1229 KQIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLNDSQVVDRIKAKEAETAKP 1050
            K +                                    +  +SQV DR+K KE E AKP
Sbjct: 620  KPMDKENSELPDVLDDADIGEDTAKEVEDDEADEEEEEVEPTESQVADRVKDKEVEAAKP 679

Query: 1049 FPMIGVQLLKDSDQTT----KSRREISRVPAEYDADLDWFPLDIHEAMEEMRKRKVFDVD 882
              MIGVQLLKDSDQTT    KSRR++SR   E   D DWFPLDIHEA +EMR+RK+FDV 
Sbjct: 680  LQMIGVQLLKDSDQTTPATRKSRRKLSRASMEDSDDDDWFPLDIHEAFKEMRERKIFDVS 739

Query: 881  DMYTIADAWGWTWERELKNKAPQRWSQEWEVELAIKVMNKVIELGGTPTIGDCAMILRAA 702
            DMYTIAD WGWTWE+ELKNK P+ W+QEWEVELAIKVM KVIELGGTPTIGDCAMILRAA
Sbjct: 740  DMYTIADVWGWTWEKELKNKPPRSWTQEWEVELAIKVMLKVIELGGTPTIGDCAMILRAA 799

Query: 701  IRDPIPSAFLTILQTSHGLGYVFGSPLYDEIISLCLDLGELDAAIAIVADLETSGIKVPD 522
            IR P+PSAFL +LQT+H LGYVFGSPLY+E+I LCLDLGELDAAIAIVAD+ETSGI VPD
Sbjct: 800  IRAPLPSAFLKVLQTTHKLGYVFGSPLYNEVIILCLDLGELDAAIAIVADMETSGIAVPD 859

Query: 521  ETLDRVISARQIKD 480
            ETLDRVISARQ+ D
Sbjct: 860  ETLDRVISARQMID 873


>ref|XP_004142596.1| PREDICTED: uncharacterized protein LOC101209618 [Cucumis sativus]
          Length = 1177

 Score =  570 bits (1468), Expect = e-160
 Identities = 297/434 (68%), Positives = 325/434 (74%), Gaps = 4/434 (0%)
 Frame = -2

Query: 1769 RKRWVPRRGKTPLDPDAEGFAYSNPMETSFKQQCLEESKIYHRKLLKVLHNEGPAVLGDL 1590
            RKRWVPR+GKTPLDPDA+GF YSNPMETSFKQ+CLE+ K+YHRK+LK L NEG   L D 
Sbjct: 721  RKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDA 780

Query: 1589 SESEYFRVVERLKKIIKGPEQNALKPKAASKMLVSELKEELEAQGLPTDGTRNVLYQRVQ 1410
            SE++Y RVVERL+KIIKGP+QN LKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQ
Sbjct: 781  SEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQ 840

Query: 1409 KARRINRSRGRPLWVPPXXXXXXXXXXXXXXLILRIKLQEGNTEFWKRRFLGEGLQENYG 1230
            KARRINRSRGRPLWVPP              LI RIKL EGNTEFWKRRFLGEGL  N  
Sbjct: 841  KARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLYSNNV 900

Query: 1229 KQIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLNDSQVVDRIKAKEAETAKP 1050
            K                                         ++Q  +R+  KE E  KP
Sbjct: 901  KPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQ-TENQDGERVIKKEVEAKKP 959

Query: 1049 FPMIGVQLLKDSDQTT----KSRREISRVPAEYDADLDWFPLDIHEAMEEMRKRKVFDVD 882
              MIGVQLLKD DQ T    KSRR  SR   E D D DWFP DI EA +E++KRKVFDV 
Sbjct: 960  LQMIGVQLLKDVDQPTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVS 1019

Query: 881  DMYTIADAWGWTWERELKNKAPQRWSQEWEVELAIKVMNKVIELGGTPTIGDCAMILRAA 702
            DMYTIAD WGWTWERELKN+ P+RWSQEWEVELAIK+M+KVIELGG PTIGDCAMILRAA
Sbjct: 1020 DMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA 1079

Query: 701  IRDPIPSAFLTILQTSHGLGYVFGSPLYDEIISLCLDLGELDAAIAIVADLETSGIKVPD 522
            I+ P+PSAFL ILQT+HGLGYVFGSPLYDE+I+LCLDLGELDAAIAIVADLET+GI V D
Sbjct: 1080 IKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHD 1139

Query: 521  ETLDRVISARQIKD 480
            ETLDRVISARQ  D
Sbjct: 1140 ETLDRVISARQTND 1153


>ref|XP_002522027.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538831|gb|EEF40431.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 889

 Score =  558 bits (1437), Expect = e-156
 Identities = 293/442 (66%), Positives = 329/442 (74%), Gaps = 10/442 (2%)
 Frame = -2

Query: 1769 RKRWVPRRGKTPLDPDAEGFAYSNPMETSFKQQCLEESKIYHRKLLKVLHNEGPAVLGDL 1590
            RKRWVPRRGKTPLDPDA GF YSNPMETSFKQ+C+E+ K++HRKLL+ L NEG A LG+ 
Sbjct: 442  RKRWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCIEDWKVHHRKLLRTLLNEGLAALGEA 501

Query: 1589 SESEYFRVVERLKKIIKGPEQNALKPKAASKMLVSELKEELEAQGLPTDGTRNVLYQRVQ 1410
            SES+Y RVVERLKKIIKGP+QN LKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQ
Sbjct: 502  SESDYLRVVERLKKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPIDGTRNVLYQRVQ 561

Query: 1409 KARRINRSRGRPLWVPPXXXXXXXXXXXXXXLILRIKLQEGNTEFWKRRFLGEGL----- 1245
            KARRINRSRGRPLWVPP              +I RIKL+EGNTEFWKRRFLGEGL     
Sbjct: 562  KARRINRSRGRPLWVPPVEEEEEEVDEELDEIISRIKLEEGNTEFWKRRFLGEGLNGSNL 621

Query: 1244 QENYGKQIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLNDSQVVDRI-KAKE 1068
            Q     +                                     +  ++Q VDR+ K KE
Sbjct: 622  QPMSVAKSELPDVLDDVDAIEDADKEVEDEEADDEEEAEAEVEVEQTENQDVDRVVKEKE 681

Query: 1067 AETAKPFPMIGVQLLKDSDQTT----KSRREISRVPAEYDADLDWFPLDIHEAMEEMRKR 900
             E  KP  MIGVQLLKDSD  T    KS+R  +R   E DAD DWFP D  EA +E+R+R
Sbjct: 682  VEAKKPLQMIGVQLLKDSDHLTTRSKKSKRRSARASVEDDADDDWFPEDPFEAFKELRER 741

Query: 899  KVFDVDDMYTIADAWGWTWERELKNKAPQRWSQEWEVELAIKVMNKVIELGGTPTIGDCA 720
            KVFDV+DMYTIAD WGWTWERE+KN+ PQ+WSQEWEVELAIK+M K  +L GTPTIGDCA
Sbjct: 742  KVFDVEDMYTIADVWGWTWEREIKNRPPQKWSQEWEVELAIKLMLKA-QLSGTPTIGDCA 800

Query: 719  MILRAAIRDPIPSAFLTILQTSHGLGYVFGSPLYDEIISLCLDLGELDAAIAIVADLETS 540
            MILRAAIR P+PSAFL ILQT+H LGY FGSPLYDE+ISLCLD+GELDAAIAIVADLE++
Sbjct: 801  MILRAAIRAPMPSAFLKILQTTHSLGYTFGSPLYDEVISLCLDIGELDAAIAIVADLEST 860

Query: 539  GIKVPDETLDRVISARQIKDDP 474
            GI VPD+TLDRVISARQ  D+P
Sbjct: 861  GITVPDQTLDRVISARQAADNP 882


>ref|XP_003555560.1| PREDICTED: uncharacterized protein LOC100807191 [Glycine max]
          Length = 887

 Score =  555 bits (1429), Expect = e-155
 Identities = 292/442 (66%), Positives = 326/442 (73%), Gaps = 11/442 (2%)
 Frame = -2

Query: 1769 RKRWVPRRGKTPLDPDAEGFAYSNPMETSFKQQCLEESKIYHRKLLKVLHNEGPAVLGD- 1593
            RKRWVPRRGKTPLDPDA GF YSNPMETSFKQ+CLEE K++++KLLK L NEG A LGD 
Sbjct: 435  RKRWVPRRGKTPLDPDAHGFIYSNPMETSFKQRCLEELKLHNKKLLKTLQNEGLAALGDG 494

Query: 1592 LSESEYFRVVERLKKIIKGPEQNALKPKAASKMLVSELKEELEAQGLPTDGTRNVLYQRV 1413
            +SES+Y RV ERLKK+IKGPEQN LKPKAASKMLVSELKEEL+AQGLP DG RNVLYQRV
Sbjct: 495  VSESDYIRVQERLKKLIKGPEQNVLKPKAASKMLVSELKEELDAQGLPIDGNRNVLYQRV 554

Query: 1412 QKARRINRSRGRPLWVPPXXXXXXXXXXXXXXLILRIKLQEGNTEFWKRRFLGEGLQENY 1233
            QKARRINRSRGRPLWVPP              LI  IKL+EGNTEFWKRRFLGEGL  + 
Sbjct: 555  QKARRINRSRGRPLWVPPVEEEEEEVDEELDALISHIKLEEGNTEFWKRRFLGEGLNGDQ 614

Query: 1232 G-------KQIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLNDSQVVDRIKA 1074
                     ++                                    +  ++Q V+RIK 
Sbjct: 615  EMPTDAAESEVPEVLDDVDAIEDAAKEVEDDEADDDEEEAEQAEEEVEPAENQDVNRIKE 674

Query: 1073 KEAETAKPFPMIGVQLLKDSDQTTKSRREISR---VPAEYDADLDWFPLDIHEAMEEMRK 903
            KE E  +P  MIGVQLLKD DQ T + ++  R   V  E D D DW PLD+ EA EEMRK
Sbjct: 675  KEVEAKRPLQMIGVQLLKDIDQPTATSKKFKRSRKVQVEDDDDDDWLPLDLFEAFEEMRK 734

Query: 902  RKVFDVDDMYTIADAWGWTWERELKNKAPQRWSQEWEVELAIKVMNKVIELGGTPTIGDC 723
            RK+FDV DMYT+ADAWGWTWERELK K P+RWSQEWEVELAIKVM KVIELGG PTIGDC
Sbjct: 735  RKIFDVSDMYTLADAWGWTWERELKKKPPRRWSQEWEVELAIKVMQKVIELGGRPTIGDC 794

Query: 722  AMILRAAIRDPIPSAFLTILQTSHGLGYVFGSPLYDEIISLCLDLGELDAAIAIVADLET 543
            AMILRAAIR P+PSAFLTILQT+H LG+ FGSPLYDEIISLC+DLGELDAA+A+VADLET
Sbjct: 795  AMILRAAIRAPLPSAFLTILQTTHSLGFKFGSPLYDEIISLCVDLGELDAAVAVVADLET 854

Query: 542  SGIKVPDETLDRVISARQIKDD 477
            +GI V D TLDRVISA+Q  D+
Sbjct: 855  TGISVSDLTLDRVISAKQRIDN 876


>ref|XP_003535382.1| PREDICTED: uncharacterized protein LOC100802355 [Glycine max]
          Length = 887

 Score =  547 bits (1410), Expect = e-153
 Identities = 289/442 (65%), Positives = 326/442 (73%), Gaps = 11/442 (2%)
 Frame = -2

Query: 1769 RKRWVPRRGKTPLDPDAEGFAYSNPMETSFKQQCLEESKIYHRKLLKVLHNEGPAVLGD- 1593
            RKRWVPRRGKTPLDPDA GF YSNPMETSFKQ+C+EE K++++KLLK L NEG A LGD 
Sbjct: 435  RKRWVPRRGKTPLDPDAHGFIYSNPMETSFKQRCMEELKLHNKKLLKTLQNEGLAALGDD 494

Query: 1592 LSESEYFRVVERLKKIIKGPEQNALKPKAASKMLVSELKEELEAQGLPTDGTRNVLYQRV 1413
            +SE +Y RV ERLKK++KGPEQN LKPKAASKMLVSELKEEL+AQGLP DGTRNVLYQRV
Sbjct: 495  VSEFDYIRVQERLKKLMKGPEQNVLKPKAASKMLVSELKEELDAQGLPIDGTRNVLYQRV 554

Query: 1412 QKARRINRSRGRPLWVPPXXXXXXXXXXXXXXLILRIKLQEGNTEFWKRRFLGEGLQ--- 1242
            QKARRINRSRGRPLWVPP              LI RIKL+EGNTEFWKRRFLGEGL    
Sbjct: 555  QKARRINRSRGRPLWVPPVEEEEEEVDEELDALISRIKLEEGNTEFWKRRFLGEGLNGDQ 614

Query: 1241 ----ENYGKQIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLNDSQVVDRIKA 1074
                +     +                                    +  ++Q V+RIK 
Sbjct: 615  EMPTDAVQSDVPEVLDDVDAIEDAAKEVEDDEADDEEEEAEQAEEEVEPAENQDVNRIKE 674

Query: 1073 KEAETAKPFPMIGVQLLKDSDQ---TTKSRREISRVPAEYDADLDWFPLDIHEAMEEMRK 903
            KE E  +P  MIGVQLLKD DQ   T+K  +   RV  E D D DW PL++ EA +EMRK
Sbjct: 675  KEVEAKRPLQMIGVQLLKDIDQPTATSKKFKRSRRVQVEDDDDDDWLPLNLFEAFKEMRK 734

Query: 902  RKVFDVDDMYTIADAWGWTWERELKNKAPQRWSQEWEVELAIKVMNKVIELGGTPTIGDC 723
            RK+FDV DMYT+ADAWGWTWERELKNK P+RWSQE EVELAIKVM+KVIELGG PTIGDC
Sbjct: 735  RKIFDVSDMYTLADAWGWTWERELKNKPPRRWSQEREVELAIKVMHKVIELGGRPTIGDC 794

Query: 722  AMILRAAIRDPIPSAFLTILQTSHGLGYVFGSPLYDEIISLCLDLGELDAAIAIVADLET 543
            AMILRAAIR P+PSAFLTILQT+H LG+ FGSPLYDE ISLC+DLGELDAA+A+VADLET
Sbjct: 795  AMILRAAIRAPLPSAFLTILQTTHALGFKFGSPLYDETISLCVDLGELDAAVAVVADLET 854

Query: 542  SGIKVPDETLDRVISARQIKDD 477
            +GI V D TLDRVISA+Q  D+
Sbjct: 855  TGISVSDHTLDRVISAKQRIDN 876


Top