BLASTX nr result

ID: Cephaelis21_contig00002117 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00002117
         (2145 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera]   736   0.0  
ref|XP_003548683.1| PREDICTED: U2 snRNP-associated SURP motif-co...   724   0.0  
ref|XP_003531944.1| PREDICTED: U2 snRNP-associated SURP motif-co...   713   0.0  
emb|CBI21155.3| unnamed protein product [Vitis vinifera]              711   0.0  
ref|XP_002324341.1| predicted protein [Populus trichocarpa] gi|2...   711   0.0  

>emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera]
          Length = 1384

 Score =  736 bits (1901), Expect = 0.0
 Identities = 379/506 (74%), Positives = 427/506 (84%), Gaps = 3/506 (0%)
 Frame = +2

Query: 2    KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWIPPPLPALKGPESEKEAGSTYAAG 181
            KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRW+PPPLP ++ PE EKE+G+T+AAG
Sbjct: 534  KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWMPPPLPTVRSPEHEKESGTTFAAG 593

Query: 182  KSRRVELERTLTDAQRDEFEDMLRALTLERSQIKDAMGFALDNAEAAGEVVEVLTESLTL 361
            +SRRVELERTLTD QRDEFEDMLRALTLERSQIK+AMGFALDNA+AAGE+VEVLTESLTL
Sbjct: 594  RSRRVELERTLTDPQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTL 653

Query: 362  KETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRGITGRITA 541
            KETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYR +TGRITA
Sbjct: 654  KETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITA 713

Query: 542  EALKERVLKVLQVWADWFLFSDAYVNGLRATFLRAGNSGVIPFHSICGDAPELEQNTSPD 721
            EALKERV+KVLQVWADWFLFSDAYVNGLRATFLR+GNSGV PFHSICGDAPE+E+ TS +
Sbjct: 714  EALKERVMKVLQVWADWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIEKKTSSE 773

Query: 722  DRGDGEKINQDAALAIGKGAAMKELLNLPLSELERRCRHNGLSLVGGREMMVARLLYLEE 901
            D G+G K NQDAALA+GKGAAMKELL+LP++ELERRCRHNGLSLVGGRE+MVARLL LEE
Sbjct: 774  DTGEGGKSNQDAALAMGKGAAMKELLSLPIAELERRCRHNGLSLVGGREIMVARLLSLEE 833

Query: 902  AEKQRVYDVDDELKXXXXXXXXXXXXXXKKGTSGDTDPVGLSGWNSY-EDGMQLKGKVSV 1078
            AEKQR YD+DD+LK              +K    +T+ VGLSGWN Y ED +Q +GK SV
Sbjct: 834  AEKQRGYDLDDDLKYAQSHSNSGRYPSSRKEIGVETESVGLSGWNRYGEDEIQSQGKGSV 893

Query: 1079 PLPQTNPV-QHDLSSYSSEAENNSILPASKWARXXXXXXXXXXKSAGDLGLTYSSSGSEN 1255
            PL  T P+ Q +L +++++ + + +LPASKWAR          +SA  LGL+YSSSGSEN
Sbjct: 894  PLAPTIPIPQPELKAFTNKGKTDPVLPASKWAREDDDSDDEQKRSARGLGLSYSSSGSEN 953

Query: 1256 AADGFSRNHEPDFTAVASNSLHLDTG-LNEEQRQKLRRLEVALIEYRESLEERGIKSPEE 1432
            A DG  +  E +F   +S     D+G +NEE RQKLRRLEVALIEYRESLEERGIKS EE
Sbjct: 954  AGDGPXKADEMEFATESSIPSQPDSGMMNEEHRQKLRRLEVALIEYRESLEERGIKSSEE 1013

Query: 1433 IEKKVEIHRRRLQSEYGLLDFNEDGS 1510
            IE+KV IHR+RLQSEYGL D NED S
Sbjct: 1014 IERKVAIHRKRLQSEYGLSDSNEDVS 1039


>ref|XP_003548683.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Glycine max]
          Length = 971

 Score =  724 bits (1870), Expect = 0.0
 Identities = 373/506 (73%), Positives = 420/506 (83%), Gaps = 2/506 (0%)
 Frame = +2

Query: 2    KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWIPPPLPALKGPESEKEAGSTYAAG 181
            KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWIPPPLP  K PE EKE G T+A G
Sbjct: 373  KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWIPPPLPMSKSPEHEKEPGPTHAGG 432

Query: 182  KSRRVELERTLTDAQRDEFEDMLRALTLERSQIKDAMGFALDNAEAAGEVVEVLTESLTL 361
            +SRRVE ERTLTDAQRDEFEDMLRALTLERSQIK+AMGF+LDNA+AAGEVVEVLTESLTL
Sbjct: 433  RSRRVEPERTLTDAQRDEFEDMLRALTLERSQIKEAMGFSLDNADAAGEVVEVLTESLTL 492

Query: 362  KETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRGITGRITA 541
            KETPIPTK+ARLMLVSD+LHNSSAPV+NASAYRTKFEATLPDIMESFNDLYR I GRITA
Sbjct: 493  KETPIPTKIARLMLVSDILHNSSAPVRNASAYRTKFEATLPDIMESFNDLYRSIMGRITA 552

Query: 542  EALKERVLKVLQVWADWFLFSDAYVNGLRATFLRAGNSGVIPFHSICGDAPELEQNTSPD 721
            EALKERVLKVLQVWADWFLFSDAYVNGLRATFLR GNSGVIPFHSICGDAPE+EQ T+ +
Sbjct: 553  EALKERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQKTASE 612

Query: 722  DRGDGEKINQDAALAIGKGAAMKELLNLPLSELERRCRHNGLSLVGGREMMVARLLYLEE 901
            D   G K NQDAALA+G+GAAMKEL++LPL+ELERRCRHNGLSLVGGREMMVARLL LEE
Sbjct: 613  DMVVGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEE 672

Query: 902  AEKQRVYDVDDELKXXXXXXXXXXXXXXKKGTSGDTDPVGLSGWNSY-EDGMQLKGKVSV 1078
            AEKQ+ +++DDELK              ++ TS + DPVGLS WN Y ++ +Q +G+ SV
Sbjct: 673  AEKQKGFELDDELKYAHNQVSSGKYSSNQRETSAELDPVGLSAWNHYGDEDIQSQGRSSV 732

Query: 1079 PLPQTNPV-QHDLSSYSSEAENNSILPASKWARXXXXXXXXXXKSAGDLGLTYSSSGSEN 1255
            PL  T P+ Q  L +++ + +N+ +LPASKWAR          +S  +LGL+YSSSGSEN
Sbjct: 733  PLAPTLPIPQPKLKAFTKKEKNDPVLPASKWAR-EDDESDDEQRSGKNLGLSYSSSGSEN 791

Query: 1256 AADGFSRNHEPDFTAVASNSLHLDTGLNEEQRQKLRRLEVALIEYRESLEERGIKSPEEI 1435
              DG  +  E +  A  S S H D+G+NEEQRQKLRRLEVALIEY ESLEERGIK+ EEI
Sbjct: 792  VDDGLVKADESESAADRSFSAHADSGMNEEQRQKLRRLEVALIEYGESLEERGIKNLEEI 851

Query: 1436 EKKVEIHRRRLQSEYGLLDFNEDGSG 1513
            EKKV++HR+RLQ EYGL D  EDG G
Sbjct: 852  EKKVQLHRKRLQVEYGLSDSGEDGQG 877


>ref|XP_003531944.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Glycine max]
          Length = 972

 Score =  713 bits (1840), Expect = 0.0
 Identities = 366/506 (72%), Positives = 417/506 (82%), Gaps = 2/506 (0%)
 Frame = +2

Query: 2    KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWIPPPLPALKGPESEKEAGSTYAAG 181
            KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWIPP LP  K PE EKE+GST+A G
Sbjct: 374  KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWIPPQLPMSKSPEHEKESGSTHAGG 433

Query: 182  KSRRVELERTLTDAQRDEFEDMLRALTLERSQIKDAMGFALDNAEAAGEVVEVLTESLTL 361
            +SRRVE +RTLTDAQRDEFEDMLRALTLERSQIK+AMGF+LDNA+AAGE+VEVLTESLTL
Sbjct: 434  RSRRVEPDRTLTDAQRDEFEDMLRALTLERSQIKEAMGFSLDNADAAGEIVEVLTESLTL 493

Query: 362  KETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRGITGRITA 541
            KETPIPTK+ARLMLVSD+LHNSSAPV+NASAYRTKFEATLPDIMESFNDLYR I GRITA
Sbjct: 494  KETPIPTKIARLMLVSDILHNSSAPVRNASAYRTKFEATLPDIMESFNDLYRSIMGRITA 553

Query: 542  EALKERVLKVLQVWADWFLFSDAYVNGLRATFLRAGNSGVIPFHSICGDAPELEQNTSPD 721
            EALKERVLKVLQVWADWFLFSDAYVNGLRATFLR GNSGVIPFHSICGDAPE+EQNT+  
Sbjct: 554  EALKERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQNTTSK 613

Query: 722  DRGDGEKINQDAALAIGKGAAMKELLNLPLSELERRCRHNGLSLVGGREMMVARLLYLEE 901
            D   G K NQDAALA+G+GAAMKEL++LPL+ELERRCRHNGLSLVGGREMMVARLL LEE
Sbjct: 614  DMVVGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEE 673

Query: 902  AEKQRVYDVDDELKXXXXXXXXXXXXXXKKGTSGDTDPVGLSGWNSY-EDGMQLKGKVSV 1078
            AEKQR +++D+ELK              ++ TS + DPV    WN Y ++ +Q +G+ SV
Sbjct: 674  AEKQRGFELDEELKYAHNQVSSGKYSSNQRETSEEPDPV----WNHYGDEDLQSQGRSSV 729

Query: 1079 PLPQTNPV-QHDLSSYSSEAENNSILPASKWARXXXXXXXXXXKSAGDLGLTYSSSGSEN 1255
            PL  T P+ Q +L +++ + +N+ +LPASKWA           +S  ++GL+YSSSGSEN
Sbjct: 730  PLSPTLPIAQPELKAFTKKEKNDPVLPASKWAWEGDESDDEQRRSGKNIGLSYSSSGSEN 789

Query: 1256 AADGFSRNHEPDFTAVASNSLHLDTGLNEEQRQKLRRLEVALIEYRESLEERGIKSPEEI 1435
              DG  +  E +  A    S H D+G+NEEQRQKLRRLEVALIEYRESLEERG+K+ EEI
Sbjct: 790  VGDGLVKADESESAADTRFSAHADSGMNEEQRQKLRRLEVALIEYRESLEERGVKNLEEI 849

Query: 1436 EKKVEIHRRRLQSEYGLLDFNEDGSG 1513
            EKKV+ HR+RLQ EYGL D  EDG G
Sbjct: 850  EKKVQSHRKRLQVEYGLSDSGEDGHG 875


>emb|CBI21155.3| unnamed protein product [Vitis vinifera]
          Length = 941

 Score =  711 bits (1834), Expect = 0.0
 Identities = 370/505 (73%), Positives = 416/505 (82%), Gaps = 2/505 (0%)
 Frame = +2

Query: 2    KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWIPPPLPALKGPESEKEAGSTYAAG 181
            KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRW+PPPLP ++ PE EKE+G+T+AAG
Sbjct: 374  KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWMPPPLPTVRSPEHEKESGTTFAAG 433

Query: 182  KSRRVELERTLTDAQRDEFEDMLRALTLERSQIKDAMGFALDNAEAAGEVVEVLTESLTL 361
            +SRRVELERTLTD QRDEFEDMLRALTLERSQIK+AMGFALDNA+AAGE+VEVLTESLTL
Sbjct: 434  RSRRVELERTLTDPQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTL 493

Query: 362  KETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRGITGRITA 541
            KETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYR +TGRITA
Sbjct: 494  KETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITA 553

Query: 542  EALKERVLKVLQVWADWFLFSDAYVNGLRATFLRAGNSGVIPFHSICGDAPELEQNTSPD 721
            EALKERV+KVLQVWADWFLFSDAYVNGLRATFLR+GNSGV PFHSICGDAPE+E+ TS +
Sbjct: 554  EALKERVMKVLQVWADWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIEKKTSSE 613

Query: 722  DRGDGEKINQDAALAIGKGAAMKELLNLPLSELERRCRHNGLSLVGGREMMVARLLYLEE 901
            D G+G K NQDAALA+GKGAAMKELL+LP++ELERRCRHNGLSLVGGRE+MVARLL LEE
Sbjct: 614  DTGEGGKSNQDAALAMGKGAAMKELLSLPIAELERRCRHNGLSLVGGREIMVARLLSLEE 673

Query: 902  AEKQRVYDVDDELKXXXXXXXXXXXXXXKKGTSGDTDPVGLSGWNSYEDGMQLKGKVSVP 1081
            AEKQR YD+DD+LK                           S    Y + +Q +GK SVP
Sbjct: 674  AEKQRGYDLDDDLKYAQSH----------------------SNSGRYPNEIQSQGKGSVP 711

Query: 1082 LPQTNPV-QHDLSSYSSEAENNSILPASKWARXXXXXXXXXXKSAGDLGLTYSSSGSENA 1258
            L  T P+ Q +L +++++ + + +LPASKWAR          +SA  LGL+YSSSGSENA
Sbjct: 712  LAPTIPIPQPELKAFTNKGKTDPVLPASKWAREDDDSDDEQKRSARGLGLSYSSSGSENA 771

Query: 1259 ADGFSRNHEPDFTAVASNSLHLDTG-LNEEQRQKLRRLEVALIEYRESLEERGIKSPEEI 1435
             DG S+  E +F   +S     D+G +NEE RQKLRRLEVALIEYRESLEERGIKS EEI
Sbjct: 772  GDGPSKADEMEFATESSIPSQPDSGMMNEEHRQKLRRLEVALIEYRESLEERGIKSSEEI 831

Query: 1436 EKKVEIHRRRLQSEYGLLDFNEDGS 1510
            E+KV IHR+RLQSEYGL D NED S
Sbjct: 832  ERKVAIHRKRLQSEYGLSDSNEDVS 856


>ref|XP_002324341.1| predicted protein [Populus trichocarpa] gi|222865775|gb|EEF02906.1|
            predicted protein [Populus trichocarpa]
          Length = 955

 Score =  711 bits (1834), Expect = 0.0
 Identities = 368/503 (73%), Positives = 414/503 (82%), Gaps = 2/503 (0%)
 Frame = +2

Query: 2    KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWIPPPLPALKGPESEKEAGSTYAAG 181
            KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRW+PPPLP  K PE EKE+GSTYAAG
Sbjct: 374  KEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWVPPPLPTAKSPEHEKESGSTYAAG 433

Query: 182  KSRRVELERTLTDAQRDEFEDMLRALTLERSQIKDAMGFALDNAEAAGEVVEVLTESLTL 361
            +SRRV+ ERTLTD QRDEFEDMLRALTLERSQIKDAMGF+LDNA+AAGEVVEVLTESLTL
Sbjct: 434  RSRRVDSERTLTDPQRDEFEDMLRALTLERSQIKDAMGFSLDNADAAGEVVEVLTESLTL 493

Query: 362  KETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRGITGRITA 541
            KETPIPTKVARLMLVSD+LHNSSAPVKNASAYRTKFEA LPDIMESFNDLYR ITGRITA
Sbjct: 494  KETPIPTKVARLMLVSDILHNSSAPVKNASAYRTKFEAALPDIMESFNDLYRSITGRITA 553

Query: 542  EALKERVLKVLQVWADWFLFSDAYVNGLRATFLRAGNSGVIPFHSICGDAPELEQNTSPD 721
            EALKERVLKVLQVW+DWFLFSDAYVNGLRATFLR+ NSGVIPFHSICGDAPE+E+ +S +
Sbjct: 554  EALKERVLKVLQVWSDWFLFSDAYVNGLRATFLRSSNSGVIPFHSICGDAPEIEKKSSSE 613

Query: 722  DRGDGEKINQDAALAIGKGAAMKELLNLPLSELERRCRHNGLSLVGGREMMVARLLYLEE 901
            D  +G KINQDAALA+GKGAA+KEL+NLPL+ELERRCRHNGLSLVGGREMMVARLL LEE
Sbjct: 614  DAVEGAKINQDAALAMGKGAAVKELMNLPLAELERRCRHNGLSLVGGREMMVARLLSLEE 673

Query: 902  AEKQRVYDVDDELKXXXXXXXXXXXXXXKKGTSGDTDPVGLSGWNSY-EDGMQLKGKVSV 1078
            AE+QR Y++DD+LK                    ++ PVG +GWN Y ED M  + K SV
Sbjct: 674  AERQRGYELDDDLKI----------------AQSNSKPVGSTGWNVYGEDEMPSQNKGSV 717

Query: 1079 PLPQTNPV-QHDLSSYSSEAENNSILPASKWARXXXXXXXXXXKSAGDLGLTYSSSGSEN 1255
             +  T  + Q +L +++ + +N+ +LPASKWAR          +SA DLGL+YSSSGSEN
Sbjct: 718  SVASTLLIKQPELKAFAKKEKNDPVLPASKWARDDDESDDEQKRSARDLGLSYSSSGSEN 777

Query: 1256 AADGFSRNHEPDFTAVASNSLHLDTGLNEEQRQKLRRLEVALIEYRESLEERGIKSPEEI 1435
            A DG  +  E +F   A+     D+G+NEEQRQKLRRLEVALIEYRESLEERG+KS  EI
Sbjct: 778  AGDGQGKADEMEFATDANIPTQPDSGMNEEQRQKLRRLEVALIEYRESLEERGMKSSVEI 837

Query: 1436 EKKVEIHRRRLQSEYGLLDFNED 1504
            E KV IHR+ L+SEYGL   NED
Sbjct: 838  EGKVAIHRKWLESEYGLSSSNED 860


Top