BLASTX nr result

ID: Dioscorea21_contig00007447 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00007447
         (1541 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003633408.1| PREDICTED: cleavage and polyadenylation spec...   956   0.0  
emb|CAN71414.1| hypothetical protein VITISV_029216 [Vitis vinifera]   956   0.0  
ref|XP_002271646.2| PREDICTED: cleavage and polyadenylation spec...   941   0.0  
ref|XP_002323824.1| predicted protein [Populus trichocarpa] gi|2...   905   0.0  
ref|NP_176297.1| cleavage and polyadenylation specificity factor...   902   0.0  

>ref|XP_003633408.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            3-I-like [Vitis vinifera]
          Length = 694

 Score =  956 bits (2472), Expect = 0.0
 Identities = 468/496 (94%), Positives = 489/496 (98%)
 Frame = -1

Query: 1490 KRRESTVTREGDQLVITPLGAGNEVGRSCVHMTYRGKTILFDCGIHPAYSGMAALPYFDE 1311
            KR +S++TREGDQL+ITPLGAGNEVGRSCV+M+Y+GKTILFDCGIHPAYSGMAALPYFDE
Sbjct: 11   KRPDSSLTREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDE 70

Query: 1310 IDPSTVDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV 1131
            IDPST+DVLL+THFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV
Sbjct: 71   IDPSTIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV 130

Query: 1130 EDMLYDEQDILRSMDRIEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG 951
            EDMLYDEQDILRSMD+IEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG
Sbjct: 131  EDMLYDEQDILRSMDKIEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG 190

Query: 950  DYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRLVREKRFTDVIHKTIAEGGRVLI 771
            DYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPR VREKRFTDVIH TI++GGRVLI
Sbjct: 191  DYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRHVREKRFTDVIHSTISQGGRVLI 250

Query: 770  PAFALGRAQELLLILDEYWARNPELHNIPIYYASPLAKRCMAVYQTYINSMNERIRNQFA 591
            PAFALGRAQELLLILDEYW+ +PELHNIPIYYASPLAKRCMAVYQTYINSMNERIRNQFA
Sbjct: 251  PAFALGRAQELLLILDEYWSNHPELHNIPIYYASPLAKRCMAVYQTYINSMNERIRNQFA 310

Query: 590  NSNPFDFKHISPLKSIENFDDVGPSVVMASPSGLQSGLSRQLFDKWCADKRNACVIPGYV 411
            NSNPFDFKHISPLKSIENF+DVGPSVVMASPSGLQSGLSRQLFD WC+DK+NACVIPGYV
Sbjct: 311  NSNPFDFKHISPLKSIENFNDVGPSVVMASPSGLQSGLSRQLFDMWCSDKKNACVIPGYV 370

Query: 410  VEGTLAKTIITEPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH 231
            VEGTLAKTII EPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH
Sbjct: 371  VEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH 430

Query: 230  GEANEMGRLKQKLITQFADKNTKIITPKNCQSVEMYFSSEKMAKTIGRLADKTPDVGETV 51
            GEANEMGRLKQKLITQFAD+NTKII+PKNCQSVEMYF+SEKMAKTIGRLA+KTP VGETV
Sbjct: 431  GEANEMGRLKQKLITQFADRNTKIISPKNCQSVEMYFNSEKMAKTIGRLAEKTPGVGETV 490

Query: 50   SGLLVKKGFTYQIMAP 3
            SGLLVKKGFTYQIMAP
Sbjct: 491  SGLLVKKGFTYQIMAP 506


>emb|CAN71414.1| hypothetical protein VITISV_029216 [Vitis vinifera]
          Length = 687

 Score =  956 bits (2472), Expect = 0.0
 Identities = 468/496 (94%), Positives = 489/496 (98%)
 Frame = -1

Query: 1490 KRRESTVTREGDQLVITPLGAGNEVGRSCVHMTYRGKTILFDCGIHPAYSGMAALPYFDE 1311
            KR +S++TREGDQL+ITPLGAGNEVGRSCV+M+Y+GKTILFDCGIHPAYSGMAALPYFDE
Sbjct: 4    KRPDSSLTREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDE 63

Query: 1310 IDPSTVDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV 1131
            IDPST+DVLL+THFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV
Sbjct: 64   IDPSTIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV 123

Query: 1130 EDMLYDEQDILRSMDRIEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG 951
            EDMLYDEQDILRSMD+IEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG
Sbjct: 124  EDMLYDEQDILRSMDKIEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG 183

Query: 950  DYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRLVREKRFTDVIHKTIAEGGRVLI 771
            DYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPR VREKRFTDVIH TI++GGRVLI
Sbjct: 184  DYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRHVREKRFTDVIHSTISQGGRVLI 243

Query: 770  PAFALGRAQELLLILDEYWARNPELHNIPIYYASPLAKRCMAVYQTYINSMNERIRNQFA 591
            PAFALGRAQELLLILDEYW+ +PELHNIPIYYASPLAKRCMAVYQTYINSMNERIRNQFA
Sbjct: 244  PAFALGRAQELLLILDEYWSNHPELHNIPIYYASPLAKRCMAVYQTYINSMNERIRNQFA 303

Query: 590  NSNPFDFKHISPLKSIENFDDVGPSVVMASPSGLQSGLSRQLFDKWCADKRNACVIPGYV 411
            NSNPFDFKHISPLKSIENF+DVGPSVVMASPSGLQSGLSRQLFD WC+DK+NACVIPGYV
Sbjct: 304  NSNPFDFKHISPLKSIENFNDVGPSVVMASPSGLQSGLSRQLFDMWCSDKKNACVIPGYV 363

Query: 410  VEGTLAKTIITEPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH 231
            VEGTLAKTII EPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH
Sbjct: 364  VEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH 423

Query: 230  GEANEMGRLKQKLITQFADKNTKIITPKNCQSVEMYFSSEKMAKTIGRLADKTPDVGETV 51
            GEANEMGRLKQKLITQFAD+NTKII+PKNCQSVEMYF+SEKMAKTIGRLA+KTP VGETV
Sbjct: 424  GEANEMGRLKQKLITQFADRNTKIISPKNCQSVEMYFNSEKMAKTIGRLAEKTPGVGETV 483

Query: 50   SGLLVKKGFTYQIMAP 3
            SGLLVKKGFTYQIMAP
Sbjct: 484  SGLLVKKGFTYQIMAP 499


>ref|XP_002271646.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            3-I-like [Vitis vinifera]
          Length = 693

 Score =  941 bits (2433), Expect = 0.0
 Identities = 462/496 (93%), Positives = 485/496 (97%)
 Frame = -1

Query: 1490 KRRESTVTREGDQLVITPLGAGNEVGRSCVHMTYRGKTILFDCGIHPAYSGMAALPYFDE 1311
            KR +S++TR GDQL+ITPLGAGNEVGRSCV+M+Y+GKTILFDCGIHPAYSGMAALPYFDE
Sbjct: 11   KRPDSSLTR-GDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDE 69

Query: 1310 IDPSTVDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV 1131
            IDPST+DVLL+THFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV
Sbjct: 70   IDPSTIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV 129

Query: 1130 EDMLYDEQDILRSMDRIEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG 951
            EDMLYDEQDILRSMD+IEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG
Sbjct: 130  EDMLYDEQDILRSMDKIEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG 189

Query: 950  DYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRLVREKRFTDVIHKTIAEGGRVLI 771
            DYSREEDRHLRAAEIPQF PDICIIESTYGVQLHQPR VREKRFTDVIH TI++GGRVLI
Sbjct: 190  DYSREEDRHLRAAEIPQFCPDICIIESTYGVQLHQPRHVREKRFTDVIHSTISQGGRVLI 249

Query: 770  PAFALGRAQELLLILDEYWARNPELHNIPIYYASPLAKRCMAVYQTYINSMNERIRNQFA 591
            PA+ALGRAQELLLILDEYW+ +PELHN+PIYYASPLAKRCMAVYQTYINSMNERIRNQFA
Sbjct: 250  PAYALGRAQELLLILDEYWSNHPELHNVPIYYASPLAKRCMAVYQTYINSMNERIRNQFA 309

Query: 590  NSNPFDFKHISPLKSIENFDDVGPSVVMASPSGLQSGLSRQLFDKWCADKRNACVIPGYV 411
            NSNPFDFKHISPLKSIENF+DVGPSVVMASP GLQSGLSRQLFD WC+DK+NACVIPGYV
Sbjct: 310  NSNPFDFKHISPLKSIENFNDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNACVIPGYV 369

Query: 410  VEGTLAKTIITEPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH 231
            V GTLAKTII EPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH
Sbjct: 370  VGGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH 429

Query: 230  GEANEMGRLKQKLITQFADKNTKIITPKNCQSVEMYFSSEKMAKTIGRLADKTPDVGETV 51
            GEANEMGRLKQKLITQFAD NTKII+PKNCQSVEMYF+SEKMAKTIGRLA+KTP+VGETV
Sbjct: 430  GEANEMGRLKQKLITQFADCNTKIISPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETV 489

Query: 50   SGLLVKKGFTYQIMAP 3
            SGLLVKKGFTYQIMAP
Sbjct: 490  SGLLVKKGFTYQIMAP 505


>ref|XP_002323824.1| predicted protein [Populus trichocarpa] gi|222866826|gb|EEF03957.1|
            predicted protein [Populus trichocarpa]
          Length = 699

 Score =  905 bits (2339), Expect = 0.0
 Identities = 438/497 (88%), Positives = 477/497 (95%), Gaps = 1/497 (0%)
 Frame = -1

Query: 1490 KRRESTVTREG-DQLVITPLGAGNEVGRSCVHMTYRGKTILFDCGIHPAYSGMAALPYFD 1314
            KRR++ VTREG DQL +TPLGAGNEVGRSCV+M+++GKT+LFDCGIHPAYSGMAALPYFD
Sbjct: 11   KRRDAPVTREGGDQLTLTPLGAGNEVGRSCVYMSFKGKTVLFDCGIHPAYSGMAALPYFD 70

Query: 1313 EIDPSTVDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVS 1134
            EIDPST+DVLL+THFHLDHAASLPYFLEKTTF+GRVFMTHATKAIYKLLL+DYVKVSKVS
Sbjct: 71   EIDPSTIDVLLVTHFHLDHAASLPYFLEKTTFRGRVFMTHATKAIYKLLLTDYVKVSKVS 130

Query: 1133 VEDMLYDEQDILRSMDRIEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYT 954
            VEDML+DE+DI RSMD+IEVIDFHQT++VNGI+FWCYTAGHVLGAAMFMVDIAGVRVLYT
Sbjct: 131  VEDMLFDEKDINRSMDKIEVIDFHQTVDVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYT 190

Query: 953  GDYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRLVREKRFTDVIHKTIAEGGRVL 774
            GDYSREEDRHLRAAE+PQFSPDICIIESTYGVQLHQPR +REKRFTDVIH TI+ GGRVL
Sbjct: 191  GDYSREEDRHLRAAEMPQFSPDICIIESTYGVQLHQPRHIREKRFTDVIHSTISLGGRVL 250

Query: 773  IPAFALGRAQELLLILDEYWARNPELHNIPIYYASPLAKRCMAVYQTYINSMNERIRNQF 594
            IPAFALGRAQELLLILDEYW+ +PELHNIP+YYASPLAK+CM VYQTYI SMNERIRNQF
Sbjct: 251  IPAFALGRAQELLLILDEYWSNHPELHNIPVYYASPLAKKCMTVYQTYILSMNERIRNQF 310

Query: 593  ANSNPFDFKHISPLKSIENFDDVGPSVVMASPSGLQSGLSRQLFDKWCADKRNACVIPGY 414
            A+SNPF FKHISPL SIE+F DVGPSVVMA+P GLQSGLSRQLFD WC+DK+NACVIPG+
Sbjct: 311  ADSNPFKFKHISPLNSIEDFTDVGPSVVMATPGGLQSGLSRQLFDMWCSDKKNACVIPGF 370

Query: 413  VVEGTLAKTIITEPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILV 234
            +VEGTLAKTII EPKEV LMNGLTAPLNMQVHYISFSAHAD+AQTSTFLKELMPPNIILV
Sbjct: 371  LVEGTLAKTIINEPKEVQLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILV 430

Query: 233  HGEANEMGRLKQKLITQFADKNTKIITPKNCQSVEMYFSSEKMAKTIGRLADKTPDVGET 54
            HGEANEMGRLKQKLIT+F D NTKIITPKNCQSVEMYF+SEKMAKT G+LA++TPDVGET
Sbjct: 431  HGEANEMGRLKQKLITEFTDGNTKIITPKNCQSVEMYFNSEKMAKTTGKLAERTPDVGET 490

Query: 53   VSGLLVKKGFTYQIMAP 3
            VSG+LVKKGFTYQIMAP
Sbjct: 491  VSGILVKKGFTYQIMAP 507


>ref|NP_176297.1| cleavage and polyadenylation specificity factor subunit 3-I
            [Arabidopsis thaliana] gi|30696512|ref|NP_849835.1|
            cleavage and polyadenylation specificity factor subunit
            3-I [Arabidopsis thaliana]
            gi|79320389|ref|NP_001031215.1| cleavage and
            polyadenylation specificity factor subunit 3-I
            [Arabidopsis thaliana]
            gi|75262219|sp|Q9C952.1|CPSF3_ARATH RecName:
            Full=Cleavage and polyadenylation specificity factor
            subunit 3-I; AltName: Full=Cleavage and polyadenylation
            specificity factor 73 kDa subunit I; Short=AtCPSF73-I;
            Short=CPSF 73 kDa subunit I
            gi|12323330|gb|AAG51638.1|AC018908_4 putative cleavage
            and polyadenylation specificity factor; 72745-70039
            [Arabidopsis thaliana] gi|23297661|gb|AAN13003.1|
            putative cleavage and polyadenylation specificity factor
            [Arabidopsis thaliana] gi|24415578|gb|AAN41458.1|
            putative cleavage and polyadenylation specificity factor
            73 kDa subunit [Arabidopsis thaliana]
            gi|222422865|dbj|BAH19419.1| AT1G61010 [Arabidopsis
            thaliana] gi|222423059|dbj|BAH19511.1| AT1G61010
            [Arabidopsis thaliana] gi|332195645|gb|AEE33766.1|
            cleavage and polyadenylation specificity factor subunit
            3-I [Arabidopsis thaliana] gi|332195646|gb|AEE33767.1|
            cleavage and polyadenylation specificity factor subunit
            3-I [Arabidopsis thaliana] gi|332195647|gb|AEE33768.1|
            cleavage and polyadenylation specificity factor subunit
            3-I [Arabidopsis thaliana]
          Length = 693

 Score =  902 bits (2331), Expect = 0.0
 Identities = 432/496 (87%), Positives = 477/496 (96%)
 Frame = -1

Query: 1490 KRRESTVTREGDQLVITPLGAGNEVGRSCVHMTYRGKTILFDCGIHPAYSGMAALPYFDE 1311
            KRRE  ++R+GDQL++TPLGAG+EVGRSCV+M++RGK ILFDCGIHPAYSGMAALPYFDE
Sbjct: 9    KRREQPISRDGDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDE 68

Query: 1310 IDPSTVDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLSDYVKVSKVSV 1131
            IDPS++DVLLITHFH+DHAASLPYFLEKTTF GRVFMTHATKAIYKLLL+DYVKVSKVSV
Sbjct: 69   IDPSSIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVSKVSV 128

Query: 1130 EDMLYDEQDILRSMDRIEVIDFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTG 951
            EDML+DEQDI +SMD+IEVIDFHQT+EVNGI+FWCYTAGHVLGAAMFMVDIAGVR+LYTG
Sbjct: 129  EDMLFDEQDINKSMDKIEVIDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRILYTG 188

Query: 950  DYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRLVREKRFTDVIHKTIAEGGRVLI 771
            DYSREEDRHLRAAE+PQFSPDICIIEST GVQLHQ R +REKRFTDVIH T+A+GGRVLI
Sbjct: 189  DYSREEDRHLRAAELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLI 248

Query: 770  PAFALGRAQELLLILDEYWARNPELHNIPIYYASPLAKRCMAVYQTYINSMNERIRNQFA 591
            PAFALGRAQELLLILDEYWA +P+LHNIPIYYASPLAK+CMAVYQTYI SMN+RIRNQFA
Sbjct: 249  PAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFA 308

Query: 590  NSNPFDFKHISPLKSIENFDDVGPSVVMASPSGLQSGLSRQLFDKWCADKRNACVIPGYV 411
            NSNPF FKHISPL SI++F+DVGPSVVMA+P GLQSGLSRQLFD WC+DK+NAC+IPGY+
Sbjct: 309  NSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYM 368

Query: 410  VEGTLAKTIITEPKEVTLMNGLTAPLNMQVHYISFSAHADFAQTSTFLKELMPPNIILVH 231
            VEGTLAKTII EPKEVTLMNGLTAPLNMQVHYISFSAHAD+AQTSTFLKELMPPNIILVH
Sbjct: 369  VEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVH 428

Query: 230  GEANEMGRLKQKLITQFADKNTKIITPKNCQSVEMYFSSEKMAKTIGRLADKTPDVGETV 51
            GEANEM RLKQKL+T+F D NTKI+TPKNC+SVEMYF+SEK+AKTIGRLA+KTPDVG+TV
Sbjct: 429  GEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVGDTV 488

Query: 50   SGLLVKKGFTYQIMAP 3
            SG+LVKKGFTYQIMAP
Sbjct: 489  SGILVKKGFTYQIMAP 504


Top