BLASTX nr result

ID: Coptis24_contig00018067 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00018067
         (1467 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002303105.1| predicted protein [Populus trichocarpa] gi|2...   461   e-127
ref|XP_002269637.1| PREDICTED: tetratricopeptide repeat protein ...   461   e-127
ref|NP_563702.1| tetratricopeptide repeat-containing protein [Ar...   441   e-121
ref|XP_002889492.1| binding protein [Arabidopsis lyrata subsp. l...   437   e-120
ref|XP_003541377.1| PREDICTED: tetratricopeptide repeat protein ...   436   e-120

>ref|XP_002303105.1| predicted protein [Populus trichocarpa] gi|222844831|gb|EEE82378.1|
            predicted protein [Populus trichocarpa]
          Length = 370

 Score =  461 bits (1187), Expect = e-127
 Identities = 241/372 (64%), Positives = 287/372 (77%), Gaps = 21/372 (5%)
 Frame = +2

Query: 35   MALMMGGGSEALNESERADLDAIEALKVSGAHELKEQGNQYVKKGKKHYSDAIDCYTRAI 214
            MAL M  GSE   ESE ADL AI ALK S A ELKE+GN+YVK GKKHYSDAI+CYTRAI
Sbjct: 1    MALWMEAGSEPKTESEIADLQAISALKHSTALELKEKGNEYVKMGKKHYSDAIECYTRAI 60

Query: 215  NQKALSESDTSVLFANRAHVNLLLGNSRRALTDAEEAIKLSPSNFKAHYRAAKAALSLDL 394
            NQ ALS+SD S++++NRAHVNLLLGN RRALTDA+EAIKL P+N KA YRAAKA+LSL L
Sbjct: 61   NQDALSDSDNSIVYSNRAHVNLLLGNYRRALTDAQEAIKLCPTNVKAMYRAAKASLSLSL 120

Query: 395  LTEATSFCQAGIEQSVDNDELKKLLKQISLRQSQRENRDAQAAKALVASKDLASAIEDRG 574
            L EA SF + G+EQ  DN+ELKKL KQI+L + + + R+A+ +KA+  +KDL SAIEDRG
Sbjct: 121  LVEAKSFSENGLEQDPDNEELKKLAKQINLVKVEHDKREAEVSKAVSEAKDLLSAIEDRG 180

Query: 575  LKLGRAMYQELTGLKKPILDVNNILHWPVLLLYAEVMSSDIIEDFCETETFSFHLDM--- 745
            LK+G+AM+ EL GL+KP+LD N ILHWPVLLLYAEVMSSD IEDFCET+ F  HLDM   
Sbjct: 181  LKVGKAMFGELVGLRKPVLDKNKILHWPVLLLYAEVMSSDFIEDFCETDMFLAHLDMISI 240

Query: 746  --------MLSEGCPPLPWDNENAYTRDAVELYYETG-GVPSTKADVLQFLLEGTAGSLA 898
                    M SE CPPLPWD EN YTR+AVELYYE G GVP +K  +L +LL+GT+G+  
Sbjct: 241  FLAFFLSNMFSESCPPLPWDTENNYTREAVELYYEAGSGVPLSKKKILHYLLDGTSGANV 300

Query: 899  ES---------SHDEIKGNLPTKWVKVDERKILCDVLRQPDFVIPRIPVFYVVSKCSSFY 1051
            ES         SH   KG+  +KWVKV+E+++LCDVL++PDF+I  IPVFYVVSK SSFY
Sbjct: 301  ESVDEEKDAIESHGSGKGS--SKWVKVNEKRMLCDVLKEPDFIISGIPVFYVVSKRSSFY 358

Query: 1052 KDFKAGKWVPPP 1087
            K+FKAGKW  PP
Sbjct: 359  KEFKAGKWSLPP 370


>ref|XP_002269637.1| PREDICTED: tetratricopeptide repeat protein 4 homolog [Vitis
            vinifera] gi|297744240|emb|CBI37210.3| unnamed protein
            product [Vitis vinifera]
          Length = 364

 Score =  461 bits (1185), Expect = e-127
 Identities = 238/363 (65%), Positives = 283/363 (77%), Gaps = 13/363 (3%)
 Frame = +2

Query: 35   MALMMGGGSEALNESERADLDAIEALKVSGAHELKEQGNQYVKKGKKHYSDAIDCYTRAI 214
            MAL M  GSE   +SE ADLDAI ALK S A ELKE+GNQYVK GKKHY+DAIDCYT+AI
Sbjct: 1    MALWMETGSEPNTQSEIADLDAITALKESAALELKEKGNQYVKLGKKHYADAIDCYTKAI 60

Query: 215  NQKALSESDTSVLFANRAHVNLLLGNSRRALTDAEEAIKLSPSNFKAHYRAAKAALSLDL 394
            NQKALS+ + SV++ANRAHVNLLLGN RRAL DA+EAIKL P+N KA YRA KA+LSLDL
Sbjct: 61   NQKALSDPENSVIYANRAHVNLLLGNYRRALMDAQEAIKLCPTNVKAFYRAVKASLSLDL 120

Query: 395  LTEATSFCQAGIEQSVDNDELKKLLKQISLRQSQRENRDAQAAKALVASKDLASAIEDRG 574
            L EA S+C+ G+E+  +N+ELKKL +QI  + S+RE+ +AQ +KA+  +K L SAIE+RG
Sbjct: 121  LGEAKSYCENGLERDPNNEELKKLARQIDAQNSEREHHEAQVSKAVATAKHLVSAIENRG 180

Query: 575  LKLGRAMYQELTGLKKPILDVNNILHWPVLLLYAEVMSSDIIEDFCETETFSFHLDMMLS 754
            LK+G+A++QELTGL+KPILD NNILHWPVLLLYAEVMSSD IEDFCET+ FS HLD+M S
Sbjct: 181  LKIGKAVFQELTGLRKPILDTNNILHWPVLLLYAEVMSSDFIEDFCETDIFSAHLDIMFS 240

Query: 755  EGCPPLPWDNENAYTRDAVELYYETG-GVPSTKADVLQFLLEGTAGSLAESSHDEIK--- 922
            E CPPLPWD EN YTR+AVELYYE G GV   KA  L +LLEGT GS  ES  DE K   
Sbjct: 241  ESCPPLPWDKENNYTREAVELYYEAGSGVCLPKAKFLSYLLEGTVGSHVESIGDEEKDVI 300

Query: 923  ---------GNLPTKWVKVDERKILCDVLRQPDFVIPRIPVFYVVSKCSSFYKDFKAGKW 1075
                     G   +KWVKV+E++ L DVL++P+ +IP IPVFYVVSK SSFYK+F+ GKW
Sbjct: 301  ECSLDVTSAGKGSSKWVKVNEKRTLNDVLKEPNLMIPGIPVFYVVSKRSSFYKEFRDGKW 360

Query: 1076 VPP 1084
              P
Sbjct: 361  SLP 363


>ref|NP_563702.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332189539|gb|AEE27660.1| tetratricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 360

 Score =  441 bits (1134), Expect = e-121
 Identities = 220/361 (60%), Positives = 282/361 (78%), Gaps = 11/361 (3%)
 Frame = +2

Query: 35   MALMMGGGSEALNESERADLDAIEALKVSGAHELKEQGNQYVKKGKKHYSDAIDCYTRAI 214
            MAL M  G+    E E+ADL+AI ALK S A E KE+GN+ V+KGKKHYS+AIDCYT+AI
Sbjct: 1    MALWMDAGATPTTEKEKADLEAISALKESTAIEFKEEGNECVRKGKKHYSEAIDCYTKAI 60

Query: 215  NQKALSESDTSVLFANRAHVNLLLGNSRRALTDAEEAIKLSPSNFKAHYRAAKAALSLDL 394
            +Q  LS+S+TS+LF+NR+HVNLLLGN RRALTDAEE+++LSP N KA YRAAKA++SLDL
Sbjct: 61   SQGVLSDSETSILFSNRSHVNLLLGNYRRALTDAEESMRLSPHNVKAVYRAAKASMSLDL 120

Query: 395  LTEATSFCQAGIEQSVDNDELKKLLKQISLRQSQRENRDAQAAKALVASKDLASAIEDRG 574
            L EA S+C+ GIE    N+++KKLLK ++ ++ ++E  +AQA++A+V +K   SAIE+RG
Sbjct: 121  LNEAKSYCEKGIENDPSNEDMKKLLKLVNSKKQEKEQHEAQASQAVVEAKACLSAIENRG 180

Query: 575  LKLGRAMYQELTGLKKPILDVNNILHWPVLLLYAEVMSSDIIEDFCETETFSFHLDMMLS 754
            +K+G+AMY+ELTGLKKP+LD NNILHWPVLLLYAE M+SD +EDFCET+ F+ HLDMM S
Sbjct: 181  VKIGKAMYRELTGLKKPMLDKNNILHWPVLLLYAEAMTSDFVEDFCETDMFATHLDMMFS 240

Query: 755  EGCPPLPWDNENAYTRDAVELYYE-TGGVPSTKADVLQFLLEGTAGSLAESSHDE----- 916
            E  PPLPWD  N Y+RD +ELYYE + G P  ++ VLQ+LLEGT GS AE++ +E     
Sbjct: 241  EDSPPLPWDKNNEYSRDVIELYYEASSGTPLPRSRVLQYLLEGTKGSQAETTGEEDTSAT 300

Query: 917  -----IKGNLPTKWVKVDERKILCDVLRQPDFVIPRIPVFYVVSKCSSFYKDFKAGKWVP 1081
                 +KG+  +  VKV+ER+ L DVL++P FVIP IPVFY+VSK S FYKDF AGKW P
Sbjct: 301  KTPSYLKGS--SGMVKVNERRTLHDVLKEPKFVIPEIPVFYIVSKRSKFYKDFTAGKWTP 358

Query: 1082 P 1084
            P
Sbjct: 359  P 359


>ref|XP_002889492.1| binding protein [Arabidopsis lyrata subsp. lyrata]
            gi|297335334|gb|EFH65751.1| binding protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 360

 Score =  437 bits (1124), Expect = e-120
 Identities = 218/361 (60%), Positives = 280/361 (77%), Gaps = 11/361 (3%)
 Frame = +2

Query: 35   MALMMGGGSEALNESERADLDAIEALKVSGAHELKEQGNQYVKKGKKHYSDAIDCYTRAI 214
            MAL M  G+  + E+E+ADL+AI ALK S A E KEQGN  V+KGKKHYS+AIDCYT+AI
Sbjct: 1    MALWMDAGATPITENEKADLEAISALKESAAIEFKEQGNDCVRKGKKHYSEAIDCYTKAI 60

Query: 215  NQKALSESDTSVLFANRAHVNLLLGNSRRALTDAEEAIKLSPSNFKAHYRAAKAALSLDL 394
            NQ  LS+S+TS+LF+NR+HVNLLLGN RRALTDAEE+++L P N KA YRAAKA++SLDL
Sbjct: 61   NQGVLSDSETSILFSNRSHVNLLLGNYRRALTDAEESMRLCPHNVKAVYRAAKASMSLDL 120

Query: 395  LTEATSFCQAGIEQSVDNDELKKLLKQISLRQSQRENRDAQAAKALVASKDLASAIEDRG 574
            L EA S+C+ GIE    N+++KKLLK ++ ++ ++E  +AQ ++A+V +K   SAIE+RG
Sbjct: 121  LNEAKSYCEKGIENDPSNEDMKKLLKLVNSKKQEKEQHEAQVSRAVVEAKACLSAIENRG 180

Query: 575  LKLGRAMYQELTGLKKPILDVNNILHWPVLLLYAEVMSSDIIEDFCETETFSFHLDMMLS 754
            +K+G+AMY+ELTGLKKP+LD NNILHWPVLLLYAE M+SD +EDFCET+ F+ HLDMM S
Sbjct: 181  VKIGKAMYRELTGLKKPMLDKNNILHWPVLLLYAEAMTSDFVEDFCETDMFATHLDMMFS 240

Query: 755  EGCPPLPWDNENAYTRDAVELYYE-TGGVPSTKADVLQFLLEGTAGSLAESSHDE----- 916
            E  PPLPWD  N Y+RD +ELYYE + G P  ++ VLQ+LLE T GS AE++ +E     
Sbjct: 241  EDSPPLPWDKNNEYSRDVIELYYEASSGTPLPRSRVLQYLLESTKGSQAETTGEEDTSVT 300

Query: 917  -----IKGNLPTKWVKVDERKILCDVLRQPDFVIPRIPVFYVVSKCSSFYKDFKAGKWVP 1081
                 +KG+  +  VKV+ER+ L DVL++P FVIP IPVFY++SK S FYKDF AGKW P
Sbjct: 301  KTPSYMKGS--SGMVKVNERRTLHDVLKEPKFVIPEIPVFYILSKRSKFYKDFIAGKWSP 358

Query: 1082 P 1084
            P
Sbjct: 359  P 359


>ref|XP_003541377.1| PREDICTED: tetratricopeptide repeat protein 4 homolog [Glycine max]
          Length = 360

 Score =  436 bits (1122), Expect = e-120
 Identities = 228/359 (63%), Positives = 278/359 (77%), Gaps = 9/359 (2%)
 Frame = +2

Query: 35   MALMMGGGSEALNESERADLDAIEALKVSGAHELKEQGNQYVKKGKKHYSDAIDCYTRAI 214
            MAL M  GSE L E+E+ADL+AI ALK S A E KE+GNQYVK GKKHYSDAID YTRAI
Sbjct: 1    MALWMEKGSEPLTETEKADLEAIAALKESAAFEFKEKGNQYVKMGKKHYSDAIDYYTRAI 60

Query: 215  NQKALSESDTSVLFANRAHVNLLLGNSRRALTDAEEAIKLSPSNFKAHYRAAKAALSLDL 394
            +QKALS+S+TS+LFANRAHVNLLLGN RRALTD+ EA+KL PSN KA YRAAKA+LSL++
Sbjct: 61   DQKALSDSETSILFANRAHVNLLLGNLRRALTDSNEALKLCPSNIKAIYRAAKASLSLNM 120

Query: 395  LTEATSFCQAGIEQSVDNDELKKLLKQISLRQSQRENRDAQAAKALVASKDLASAIEDRG 574
            L EA  +C  G++   +N++LKKL +QI L+ S++E  +A+A+KA+  +K L SAIE+RG
Sbjct: 121  LAEAREYCLKGLQFDPNNEDLKKLDRQIGLKISEKEKHEAEASKAVAETKKLVSAIENRG 180

Query: 575  LKLGRAMYQELTGLKKPILDVNNILHWPVLLLYAEVMSSDIIEDFCETETFSFHLDMMLS 754
            LK+G+AMY ELTGL+KP+LD +NILHWPVLLLYAEVMSSD IEDFCET+ FS HLDM+ S
Sbjct: 181  LKIGKAMYLELTGLRKPVLDKSNILHWPVLLLYAEVMSSDFIEDFCETDMFSVHLDMIFS 240

Query: 755  EGCPPLPWDNENAYTRDAVELYYETG-GVPSTKADVLQFLLEGTAGSLAESSHDEIKGNL 931
            E   PL WD EN Y R+ +ELYYETG G+  +K  +L  LLEGTA +  E   DE K  +
Sbjct: 241  ED-QPLSWDVENNYKREFIELYYETGSGLCLSKEKLLHCLLEGTAAAHREGVGDEEKDTV 299

Query: 932  --------PTKWVKVDERKILCDVLRQPDFVIPRIPVFYVVSKCSSFYKDFKAGKWVPP 1084
                      KW+KV+ER+ L DVL++P+F+IP IPVFYVVSK SSFY  FKAGKW PP
Sbjct: 300  EDYKQHMGSPKWIKVNERRTLHDVLKEPNFIIPGIPVFYVVSKRSSFYGKFKAGKWAPP 358


Top