BLASTX nr result

ID: Angelica22_contig00010721 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00010721
         (2036 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276914.1| PREDICTED: U4/U6 small nuclear ribonucleopro...   736   0.0  
emb|CAN64891.1| hypothetical protein VITISV_016440 [Vitis vinifera]   736   0.0  
ref|XP_004162536.1| PREDICTED: U4/U6 small nuclear ribonucleopro...   724   0.0  
ref|XP_004146749.1| PREDICTED: U4/U6 small nuclear ribonucleopro...   724   0.0  
ref|XP_002308919.1| predicted protein [Populus trichocarpa] gi|2...   722   0.0  

>ref|XP_002276914.1| PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like
            protein-like [Vitis vinifera]
          Length = 577

 Score =  736 bits (1901), Expect = 0.0
 Identities = 360/472 (76%), Positives = 411/472 (87%)
 Frame = -2

Query: 1609 TTAEYEISEESRLVRERQEKAMQEFLMKRRAAALAVPTNDMAVRARLRRLGEPMTLFGER 1430
            +  EYEISEESR  RERQEKA QEFLMKRRA+ALAVPTNDMAVR RLRRLGEP+TLFGER
Sbjct: 109  SAVEYEISEESRQFRERQEKAKQEFLMKRRASALAVPTNDMAVRTRLRRLGEPITLFGER 168

Query: 1429 EMERRDRLRTLMGKLDSEGELERLMRVHXXXXXXXAVSISDGAEDENVQYPFYTEGPKNL 1250
            EMERRDRLR +M KLD+EG+LE+LM+ H        V++ +  E+E +QYPFYTEG K+L
Sbjct: 169  EMERRDRLRMIMAKLDAEGQLEKLMKAHEEEEAAAPVAMEE-VEEETLQYPFYTEGSKSL 227

Query: 1249 LEARKEIAKHSLVRASMRLQQAKRKRDDPDEDLDAEINCALEQAASLTLNCDEIGDDRPL 1070
            LEAR EIAK+S+ RA+ RL +A+RKRDDPDEDLDAE++  L++A SL L+C EIGDDRPL
Sbjct: 228  LEARVEIAKYSIKRAASRLYRARRKRDDPDEDLDAEMDWVLKEAGSLVLDCSEIGDDRPL 287

Query: 1069 LGCSFSHDGEMLATCALNGIAKIWSMPQVRRVCTLKGHTVHATDVAFSPINNTLATASAD 890
             GCSFSHDG++LA CAL+G+AKIWSMPQV +V  LKGHT  ATDVAFSP  N LATASAD
Sbjct: 288  SGCSFSHDGKLLAACALSGVAKIWSMPQVNKVSALKGHTERATDVAFSPALNHLATASAD 347

Query: 889  RTAKLWNSEGVLLRTFEGHLNRLARIAFHPSGKYFGTTSFDKTWRLWDVETGEELLLQEG 710
            RTA+LWNSEG LL+TFEGHL+RLARIAFHPSGKY GT SFDKTWRLWDVETGEELLLQEG
Sbjct: 348  RTARLWNSEGSLLKTFEGHLDRLARIAFHPSGKYLGTASFDKTWRLWDVETGEELLLQEG 407

Query: 709  HSRSVYGLSFNKDGSLAASCGLDSLARVWDLRTGRSVLALEGHVKPIYGISFSPNGYHLA 530
            HSRSVYG+SF++DGSLAASCGLD+L RVWDLR+GRS+LALEGHVKP+ GI FSPNGYHLA
Sbjct: 408  HSRSVYGISFHRDGSLAASCGLDALGRVWDLRSGRSILALEGHVKPVLGICFSPNGYHLA 467

Query: 529  TGGDDNTCRIWDLRKKKSLYIIPAHSKLISQVKFEPQDGYFLLTASHDMTAKVWSSRDFK 350
            TG +DNTCRIWDLRKKKSLY+IPAHS L+SQVKFEPQ+GYFL+TAS+DMTAKVWS+RDFK
Sbjct: 468  TGAEDNTCRIWDLRKKKSLYVIPAHSNLVSQVKFEPQEGYFLVTASYDMTAKVWSARDFK 527

Query: 349  PVKTLSGHEDKLTSLDVVADGQYVATVSHDRHIKLWSCRDMEKAAGKGMDID 194
            PVKTLSGHE K+TSLD+  DG  +ATVSHDR IKLWS  ++EK   K MDID
Sbjct: 528  PVKTLSGHEAKVTSLDITEDGHCIATVSHDRTIKLWSSAEIEKE--KAMDID 577


>emb|CAN64891.1| hypothetical protein VITISV_016440 [Vitis vinifera]
          Length = 1088

 Score =  736 bits (1901), Expect = 0.0
 Identities = 360/472 (76%), Positives = 411/472 (87%)
 Frame = -2

Query: 1609 TTAEYEISEESRLVRERQEKAMQEFLMKRRAAALAVPTNDMAVRARLRRLGEPMTLFGER 1430
            +  EYEISEESR  RERQEKA QEFLMKRRA+ALAVPTNDMAVR RLRRLGEP+TLFGER
Sbjct: 620  SAVEYEISEESRQFRERQEKAKQEFLMKRRASALAVPTNDMAVRTRLRRLGEPITLFGER 679

Query: 1429 EMERRDRLRTLMGKLDSEGELERLMRVHXXXXXXXAVSISDGAEDENVQYPFYTEGPKNL 1250
            EMERRDRLR +M KLD+EG+LE+LM+ H        V++ +  E+E +QYPFYTEG K+L
Sbjct: 680  EMERRDRLRMIMAKLDAEGQLEKLMKAHEEEEAAAPVAMEE-VEEETLQYPFYTEGSKSL 738

Query: 1249 LEARKEIAKHSLVRASMRLQQAKRKRDDPDEDLDAEINCALEQAASLTLNCDEIGDDRPL 1070
            LEAR EIAK+S+ RA+ RL +A+RKRDDPDEDLDAE++  L++A SL L+C EIGDDRPL
Sbjct: 739  LEARVEIAKYSIKRAASRLYRARRKRDDPDEDLDAEMDWVLKEAGSLVLDCSEIGDDRPL 798

Query: 1069 LGCSFSHDGEMLATCALNGIAKIWSMPQVRRVCTLKGHTVHATDVAFSPINNTLATASAD 890
             GCSFSHDG++LA CAL+G+AKIWSMPQV +V  LKGHT  ATDVAFSP  N LATASAD
Sbjct: 799  SGCSFSHDGKLLAACALSGVAKIWSMPQVNKVSALKGHTERATDVAFSPALNHLATASAD 858

Query: 889  RTAKLWNSEGVLLRTFEGHLNRLARIAFHPSGKYFGTTSFDKTWRLWDVETGEELLLQEG 710
            RTA+LWNSEG LL+TFEGHL+RLARIAFHPSGKY GT SFDKTWRLWDVETGEELLLQEG
Sbjct: 859  RTARLWNSEGSLLKTFEGHLDRLARIAFHPSGKYLGTASFDKTWRLWDVETGEELLLQEG 918

Query: 709  HSRSVYGLSFNKDGSLAASCGLDSLARVWDLRTGRSVLALEGHVKPIYGISFSPNGYHLA 530
            HSRSVYG+SF++DGSLAASCGLD+L RVWDLR+GRS+LALEGHVKP+ GI FSPNGYHLA
Sbjct: 919  HSRSVYGISFHRDGSLAASCGLDALGRVWDLRSGRSILALEGHVKPVLGICFSPNGYHLA 978

Query: 529  TGGDDNTCRIWDLRKKKSLYIIPAHSKLISQVKFEPQDGYFLLTASHDMTAKVWSSRDFK 350
            TG +DNTCRIWDLRKKKSLY+IPAHS L+SQVKFEPQ+GYFL+TAS+DMTAKVWS+RDFK
Sbjct: 979  TGAEDNTCRIWDLRKKKSLYVIPAHSNLVSQVKFEPQEGYFLVTASYDMTAKVWSARDFK 1038

Query: 349  PVKTLSGHEDKLTSLDVVADGQYVATVSHDRHIKLWSCRDMEKAAGKGMDID 194
            PVKTLSGHE K+TSLD+  DG  +ATVSHDR IKLWS  ++EK   K MDID
Sbjct: 1039 PVKTLSGHEAKVTSLDITEDGHCIATVSHDRTIKLWSSAEIEKE--KAMDID 1088


>ref|XP_004162536.1| PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like
            protein-like [Cucumis sativus]
          Length = 568

 Score =  724 bits (1869), Expect = 0.0
 Identities = 358/496 (72%), Positives = 418/496 (84%), Gaps = 4/496 (0%)
 Frame = -2

Query: 1669 NGDIRXXXXXXXXXXXXXQRT----TAEYEISEESRLVRERQEKAMQEFLMKRRAAALAV 1502
            NG++R              RT    TAEYEISEESR  RER EKAMQEFLMKRRA+ALAV
Sbjct: 77   NGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHEKAMQEFLMKRRASALAV 136

Query: 1501 PTNDMAVRARLRRLGEPMTLFGEREMERRDRLRTLMGKLDSEGELERLMRVHXXXXXXXA 1322
            PTNDMAVRARLRRLGEP+TLFGEREMERRDRLR++M +LD+EG+LE+LM+VH        
Sbjct: 137  PTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEGQLEKLMKVHEEEEAAAT 196

Query: 1321 VSISDGAEDENVQYPFYTEGPKNLLEARKEIAKHSLVRASMRLQQAKRKRDDPDEDLDAE 1142
                + AE+E +QYPFYTEG K LL+AR +IAK+S++RAS RL++AKRKRDDPDED++AE
Sbjct: 197  GGTEE-AEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLERAKRKRDDPDEDVEAE 255

Query: 1141 INCALEQAASLTLNCDEIGDDRPLLGCSFSHDGEMLATCALNGIAKIWSMPQVRRVCTLK 962
            ++ AL QA SL L+C EIGDDRPL GCSFS DG+ LAT +L+G+AK+WSMPQVR+V   K
Sbjct: 256  MDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGVAKLWSMPQVRKVSNFK 315

Query: 961  GHTVHATDVAFSPINNTLATASADRTAKLWNSEGVLLRTFEGHLNRLARIAFHPSGKYFG 782
            GHT   TDV FSP+N  LATASADRTA+LW++EG LL+TFEGHL+RLARIAFHPSGKY G
Sbjct: 316  GHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHLDRLARIAFHPSGKYLG 375

Query: 781  TTSFDKTWRLWDVETGEELLLQEGHSRSVYGLSFNKDGSLAASCGLDSLARVWDLRTGRS 602
            TTSFDKTWRLWDVETG ELLLQEGHSRSVYG++F+ DGSL +SCGLD+LARVWDLRTGRS
Sbjct: 376  TTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRS 435

Query: 601  VLALEGHVKPIYGISFSPNGYHLATGGDDNTCRIWDLRKKKSLYIIPAHSKLISQVKFEP 422
            VLALEGHVKP+ G+SFSPNGYHLATGG+DNTCRIWDLRKKKSLYIIPAHS L+SQVK+EP
Sbjct: 436  VLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEP 495

Query: 421  QDGYFLLTASHDMTAKVWSSRDFKPVKTLSGHEDKLTSLDVVADGQYVATVSHDRHIKLW 242
            Q+GYFL+TAS DMTAK+WS+RDFKPVKTLSGHE K+TSLD+++DGQ +ATVSHDR IKLW
Sbjct: 496  QEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRTIKLW 555

Query: 241  SCRDMEKAAGKGMDID 194
            S    +    + MD+D
Sbjct: 556  SVNSKDI---QTMDVD 568


>ref|XP_004146749.1| PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like
            protein-like [Cucumis sativus]
          Length = 569

 Score =  724 bits (1869), Expect = 0.0
 Identities = 358/496 (72%), Positives = 418/496 (84%), Gaps = 4/496 (0%)
 Frame = -2

Query: 1669 NGDIRXXXXXXXXXXXXXQRT----TAEYEISEESRLVRERQEKAMQEFLMKRRAAALAV 1502
            NG++R              RT    TAEYEISEESR  RER EKAMQEFLMKRRA+ALAV
Sbjct: 78   NGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHEKAMQEFLMKRRASALAV 137

Query: 1501 PTNDMAVRARLRRLGEPMTLFGEREMERRDRLRTLMGKLDSEGELERLMRVHXXXXXXXA 1322
            PTNDMAVRARLRRLGEP+TLFGEREMERRDRLR++M +LD+EG+LE+LM+VH        
Sbjct: 138  PTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEGQLEKLMKVHEEEEAAAT 197

Query: 1321 VSISDGAEDENVQYPFYTEGPKNLLEARKEIAKHSLVRASMRLQQAKRKRDDPDEDLDAE 1142
                + AE+E +QYPFYTEG K LL+AR +IAK+S++RAS RL++AKRKRDDPDED++AE
Sbjct: 198  GGTEE-AEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLERAKRKRDDPDEDVEAE 256

Query: 1141 INCALEQAASLTLNCDEIGDDRPLLGCSFSHDGEMLATCALNGIAKIWSMPQVRRVCTLK 962
            ++ AL QA SL L+C EIGDDRPL GCSFS DG+ LAT +L+G+AK+WSMPQVR+V   K
Sbjct: 257  MDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGVAKLWSMPQVRKVSNFK 316

Query: 961  GHTVHATDVAFSPINNTLATASADRTAKLWNSEGVLLRTFEGHLNRLARIAFHPSGKYFG 782
            GHT   TDV FSP+N  LATASADRTA+LW++EG LL+TFEGHL+RLARIAFHPSGKY G
Sbjct: 317  GHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHLDRLARIAFHPSGKYLG 376

Query: 781  TTSFDKTWRLWDVETGEELLLQEGHSRSVYGLSFNKDGSLAASCGLDSLARVWDLRTGRS 602
            TTSFDKTWRLWDVETG ELLLQEGHSRSVYG++F+ DGSL +SCGLD+LARVWDLRTGRS
Sbjct: 377  TTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRS 436

Query: 601  VLALEGHVKPIYGISFSPNGYHLATGGDDNTCRIWDLRKKKSLYIIPAHSKLISQVKFEP 422
            VLALEGHVKP+ G+SFSPNGYHLATGG+DNTCRIWDLRKKKSLYIIPAHS L+SQVK+EP
Sbjct: 437  VLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEP 496

Query: 421  QDGYFLLTASHDMTAKVWSSRDFKPVKTLSGHEDKLTSLDVVADGQYVATVSHDRHIKLW 242
            Q+GYFL+TAS DMTAK+WS+RDFKPVKTLSGHE K+TSLD+++DGQ +ATVSHDR IKLW
Sbjct: 497  QEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRTIKLW 556

Query: 241  SCRDMEKAAGKGMDID 194
            S    +    + MD+D
Sbjct: 557  SVNSKDI---QTMDVD 569


>ref|XP_002308919.1| predicted protein [Populus trichocarpa] gi|222854895|gb|EEE92442.1|
            predicted protein [Populus trichocarpa]
          Length = 548

 Score =  722 bits (1864), Expect = 0.0
 Identities = 350/465 (75%), Positives = 408/465 (87%)
 Frame = -2

Query: 1609 TTAEYEISEESRLVRERQEKAMQEFLMKRRAAALAVPTNDMAVRARLRRLGEPMTLFGER 1430
            +T  YEISE SRLVRERQ+KAMQEF+MK+RAAALAVPTNDMAVR RLRRLGEP+TLFGER
Sbjct: 81   STGGYEISEASRLVRERQQKAMQEFMMKKRAAALAVPTNDMAVRTRLRRLGEPITLFGER 140

Query: 1429 EMERRDRLRTLMGKLDSEGELERLMRVHXXXXXXXAVSISDGAEDENVQYPFYTEGPKNL 1250
            EMERRDRLR LM KLDSEG+LE+LM+VH         +  D AE+E VQYPFYTEG K L
Sbjct: 141  EMERRDRLRMLMAKLDSEGQLEKLMKVHEEEEAASTAAAED-AEEEFVQYPFYTEGSKEL 199

Query: 1249 LEARKEIAKHSLVRASMRLQQAKRKRDDPDEDLDAEINCALEQAASLTLNCDEIGDDRPL 1070
            L+AR +IAK+S+ +A++RLQ+A+RKRDDPDED DAEI+ +L QA SL+LNC E+GDDRPL
Sbjct: 200  LDARIDIAKYSISKAALRLQRARRKRDDPDEDEDAEIDWSLNQAESLSLNCSELGDDRPL 259

Query: 1069 LGCSFSHDGEMLATCALNGIAKIWSMPQVRRVCTLKGHTVHATDVAFSPINNTLATASAD 890
             GCSFS DGEMLATC+L+G+AKIWS+PQV +V  LKGH   ATDVAFSP++N LATASAD
Sbjct: 260  SGCSFSCDGEMLATCSLSGVAKIWSVPQVTKVSNLKGHMERATDVAFSPVHNHLATASAD 319

Query: 889  RTAKLWNSEGVLLRTFEGHLNRLARIAFHPSGKYFGTTSFDKTWRLWDVETGEELLLQEG 710
            RTA+LWN++G LL  FEGHL+RLAR+AFHPSGKY GTTSFDKTWRLWD+++G ELLLQEG
Sbjct: 320  RTARLWNTDGSLLMKFEGHLDRLARVAFHPSGKYLGTTSFDKTWRLWDIDSGVELLLQEG 379

Query: 709  HSRSVYGLSFNKDGSLAASCGLDSLARVWDLRTGRSVLALEGHVKPIYGISFSPNGYHLA 530
            HSRS+YG++F+ DGSLAASCGLD+LARVWDLRTGRS++A EGHVKP+ GISFSPNGYHLA
Sbjct: 380  HSRSIYGIAFHHDGSLAASCGLDALARVWDLRTGRSIMAFEGHVKPLLGISFSPNGYHLA 439

Query: 529  TGGDDNTCRIWDLRKKKSLYIIPAHSKLISQVKFEPQDGYFLLTASHDMTAKVWSSRDFK 350
            TGG+DNTCRIWDLRKKKSLY+IPAHS L+SQVKFEPQ+GY+L+T+S+DMTAKVWS RDFK
Sbjct: 440  TGGEDNTCRIWDLRKKKSLYVIPAHSNLVSQVKFEPQEGYYLVTSSYDMTAKVWSGRDFK 499

Query: 349  PVKTLSGHEDKLTSLDVVADGQYVATVSHDRHIKLWSCRDMEKAA 215
             VKTLS HE K+TSLD+ ADG+ +ATVSHDR IKLWS R  EK A
Sbjct: 500  HVKTLSAHEAKVTSLDISADGRLIATVSHDRTIKLWSSRSNEKDA 544


Top