BLASTX nr result

ID: Angelica23_contig00032878 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00032878
         (1645 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21267.3| unnamed protein product [Vitis vinifera]              420   e-115
ref|XP_002512826.1| hypothetical protein RCOM_1445020 [Ricinus c...   396   e-107
ref|XP_002320420.1| predicted protein [Populus trichocarpa] gi|2...   372   e-100
ref|XP_003519934.1| PREDICTED: uncharacterized protein LOC100776...   367   6e-99
dbj|BAB11123.1| unnamed protein product [Arabidopsis thaliana]        325   3e-86

>emb|CBI21267.3| unnamed protein product [Vitis vinifera]
          Length = 716

 Score =  420 bits (1079), Expect = e-115
 Identities = 230/455 (50%), Positives = 305/455 (67%), Gaps = 8/455 (1%)
 Frame = +1

Query: 298  QRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAKKEQVGI 477
            Q+KKRL++A+ VG +  + +  KRK L S +   +++ +I L WDD  K VVAK+EQ+ I
Sbjct: 3    QQKKRLSAASIVGCSSHQPSRAKRKSLGSTQCGLNMRSHISLNWDDNKKRVVAKREQIAI 62

Query: 478  AQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERELLTQFL 657
            + R+LSPF+++VP C NILAD+  IP E FE++ LTEVLS+EVW+THLSEKER+LLTQFL
Sbjct: 63   SWRDLSPFINSVPHCPNILADIWAIPPEIFELKGLTEVLSFEVWQTHLSEKERDLLTQFL 122

Query: 658  PKGFDAQQIVQDLLEGDNFHFGNPYLKWGASLCSGYLHPDIVLNKERLLKASKKTYYSEL 837
            P G D QQ+VQ LL GDNFHFGNP+LKWGASLCSG LHPD VL+KE+ LK +KK YY EL
Sbjct: 123  PSGLDGQQVVQALLAGDNFHFGNPFLKWGASLCSGDLHPDAVLSKEQCLKTNKKAYYLEL 182

Query: 838  QNYHYDMIRDLQILKESWS-QKDLENDIEHNLWRSGKHAEQTLAAHANKSALHNLDDDL- 1011
            Q YH D I +LQ  KE W+  KD E +I  N+WRS KHA++        S  H+ +++L 
Sbjct: 183  QKYHNDNIANLQKWKERWAICKDPEKEIVQNIWRSKKHADE--------SGFHDSEENLA 234

Query: 1012 -TSESCSSAADDKACTSDELNFRSLHENSQRSDSICRKG--FMEDTYDN---ASDGVKVV 1173
             TSESCS AAD+KAC+SD       ++NS R D   +KG   M+D   +   AS+G+KVV
Sbjct: 235  ATSESCSWAADEKACSSD-------NQNSSRKDGELQKGKDLMKDKCKSPVAASNGLKVV 287

Query: 1174 PRHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSIQSRSLNHVLGDLG 1353
             R R   K    N   GDGAKYMSYIK+SK+QHQ VK SMK   NSIQ RSLN VLGDL 
Sbjct: 288  TRTRKRVKFSKLNIHYGDGAKYMSYIKISKKQHQLVK-SMKQSGNSIQPRSLNRVLGDLD 346

Query: 1354 TYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQVIHALGREMVAXXX 1533
            +++I+PY VFEEEE++K H HWS+LA  D+ AA  N   +Q++++Q+  +L  EM     
Sbjct: 347  SFHIRPYEVFEEEEKRKFHEHWSQLATRDLPAAFANRGKKQLQRRQMTQSLALEMEERLK 406

Query: 1534 XXXXXXXXXTASNLSSDQVVDEDSNPEPSTSTEDE 1638
                        ++  +Q  +  ++ EP+   +D+
Sbjct: 407  PLVEDDEKEGPDSILQEQEDNGATDHEPTMDDDDK 441


>ref|XP_002512826.1| hypothetical protein RCOM_1445020 [Ricinus communis]
            gi|223547837|gb|EEF49329.1| hypothetical protein
            RCOM_1445020 [Ricinus communis]
          Length = 858

 Score =  396 bits (1017), Expect = e-107
 Identities = 211/428 (49%), Positives = 281/428 (65%), Gaps = 15/428 (3%)
 Frame = +1

Query: 280  LEMAAGQRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAK 459
            + M A  R+KRLN  +  G +  EQ   K+KKL SPK + + K +I LEWD   + VVAK
Sbjct: 1    MPMVADHRRKRLNGVSIAGCSSWEQYKTKKKKLESPKNELNTKSHISLEWDGNKRRVVAK 60

Query: 460  KEQVGIAQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERE 639
            +EQ+G+ Q++L  FVD  P CH+ LADV+ IP E FE++NLTE+LSYEVW+THLSE ER+
Sbjct: 61   REQIGLRQKDLREFVDPSPQCHSFLADVLAIPQEIFEVDNLTEILSYEVWKTHLSESERK 120

Query: 640  LLTQFLPKGFDAQQIVQDLLEGDNFHFGNPYLKW------------GASLCSGYLHPDIV 783
             L QFLP+G D  ++VQ LL GDNFHFGNPYLKW            GAS+CSG LHPD V
Sbjct: 121  YLMQFLPRGSDGDKVVQALLTGDNFHFGNPYLKWQVLKYDDSITLEGASVCSGKLHPDAV 180

Query: 784  LNKERLLKASKKTYYSELQNYHYDMIRDLQILKESW-SQKDLENDIEHNLWRSGKHAEQT 960
            +++E+ +KA KK YYSE+QNYH DMIR LQ LKE+W S KD E ++   LWRS +  ++ 
Sbjct: 181  VHQEQCIKADKKAYYSEIQNYHNDMIRYLQKLKETWESSKDPEKEVLQKLWRSRRDVDKQ 240

Query: 961  LAAHANKSALHNLDDD--LTSESCSSAADDKACTSDELNFRSLHENSQRSDSICRKGFME 1134
              +HAN+S  H+ ++    TSESCS  A++KAC+SD  N  S+ +  +    I  K F+E
Sbjct: 241  NFSHANESRFHDPEETSAATSESCSLVAEEKACSSDNQN-SSITKGGEVQRRIYEKRFIE 299

Query: 1135 DTYDNASDGVKVVPRHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSI 1314
            +     S       R + GEKLQ  N    DG KYMSY+K+SK+QH+ VK SMK    SI
Sbjct: 300  EKRRKPSVSSDDA-RFKRGEKLQKHNIHHTDGVKYMSYLKISKKQHELVK-SMKQSGKSI 357

Query: 1315 QSRSLNHVLGDLGTYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQV 1494
            QS+ LN VLG+  T  +QPY  F +EEQKKL  HW +LAN D+ AA+ N ++RQ ++ ++
Sbjct: 358  QSKCLNRVLGNFDTLQVQPYEKFVKEEQKKLREHWLQLANKDLPAAYENWQNRQFQRCEI 417

Query: 1495 IHALGREM 1518
              +L  +M
Sbjct: 418  AKSLECDM 425


>ref|XP_002320420.1| predicted protein [Populus trichocarpa] gi|222861193|gb|EEE98735.1|
            predicted protein [Populus trichocarpa]
          Length = 912

 Score =  372 bits (954), Expect = e-100
 Identities = 208/418 (49%), Positives = 270/418 (64%), Gaps = 7/418 (1%)
 Frame = +1

Query: 286  MAAGQRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAKKE 465
            MAA QR+KRLN A+  G + +E   +KR K    K   + K  I LEWD   K VVAKKE
Sbjct: 1    MAADQRRKRLNGASLAGCSSREPYRMKRNK---SKNGLNAKSLISLEWDGNRKKVVAKKE 57

Query: 466  QVGIAQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERELL 645
            Q+GI+QR+L PFVD+V   HN LADV  +P E FE++NL EVLSYE W+ HLSE ER  L
Sbjct: 58   QIGISQRDLMPFVDSVLHYHNPLADVFAVPREIFELQNLAEVLSYETWQNHLSEDERNFL 117

Query: 646  TQFLPKGFDAQQIVQDLLEGDNFHFGNPYLKWGASLCSGYLHPDIVLNKERLLKASKKTY 825
             QFLP G   +++V+ LL GDNFHFGNP L+WGASLCSG LHPD+VL +E+ LKA KK +
Sbjct: 118  KQFLPTGLGTEEVVEALLAGDNFHFGNPLLRWGASLCSGNLHPDVVLCQEQHLKADKKAF 177

Query: 826  YSELQNYHYDMIRDLQILKESW-SQKDLENDIEHNLW-RSGKHAEQTLAAHANKSALHNL 999
            YS+LQ+YH DMI  LQ LK++W S KD E +I   +W RS   A++ ++    +S  H  
Sbjct: 178  YSKLQDYHIDMITYLQKLKDTWESSKDPEKEILQKIWRRSRSDADKRISPCDTESKFHGT 237

Query: 1000 --DDDLTSESCSSAADDKACTSDELNFRSLHENSQRSDSICRKGFMEDTYDN---ASDGV 1164
              ++  TS SCS  A++K  +SD  N   + ++ +    IC KG M++       ASD  
Sbjct: 238  GENESATSGSCSLVAEEKTSSSDTQN-SHVTKSGEVQKRICEKGSMKEKLRKSLLASDDA 296

Query: 1165 KVVPRHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSIQSRSLNHVLG 1344
                R   G+KL+  N    DGAKYMSY+K+SK+QHQ VKN MK    SIQS+SLN VLG
Sbjct: 297  ----RPGKGDKLRKRNIHRSDGAKYMSYLKISKKQHQLVKN-MKQSGKSIQSKSLNCVLG 351

Query: 1345 DLGTYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQVIHALGREM 1518
            DL T ++QPY  F +EEQKKL  HW +LAN D+  AH   R RQ ++Q++  +L  E+
Sbjct: 352  DLDTLHVQPYEEFVKEEQKKLQEHWMQLANKDLPVAHAIWRERQFQRQEITKSLEEEI 409


>ref|XP_003519934.1| PREDICTED: uncharacterized protein LOC100776137 [Glycine max]
          Length = 944

 Score =  367 bits (941), Expect = 6e-99
 Identities = 197/418 (47%), Positives = 275/418 (65%), Gaps = 7/418 (1%)
 Frame = +1

Query: 286  MAAGQRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAKKE 465
            MAA QR+KR+N AN  GY  +EQ+ +KRK L   + D +++P+I +EWD   K VVAK E
Sbjct: 1    MAADQRRKRVNGANIAGYGSREQHRIKRKNLGLVQNDLNMRPHISVEWDGNHKKVVAKWE 60

Query: 466  QVGIAQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERELL 645
            Q+GI+ R++ PF++ V + H ILADV  +P E FE++NL+EVLSYEVW+THLSE ER LL
Sbjct: 61   QIGISWRQMKPFINLVSNDHKILADVFAVPQEIFELDNLSEVLSYEVWKTHLSENERNLL 120

Query: 646  TQFLPKGFDAQQIVQDLLEGDNFHFGNPYLKWGASLCSGYLHPDIVLNKERLLKASKKTY 825
              FLP GF++ Q+V++LL G NF+FGNP+ KWGASLC G LHPD+++++E+ LK  ++ Y
Sbjct: 121  MNFLPSGFESHQVVEELLGGINFNFGNPFSKWGASLCLGSLHPDMIVDQEQHLKTERREY 180

Query: 826  YSELQNYHYDMIRDLQILKESW-SQKDLENDIEHNLWRSGKHAEQTLAAHA--NKSALHN 996
            YS + NYH DMI  L  LK+SW S KD E +I   +WR+ KH E+ + +    ++   HN
Sbjct: 181  YSHIHNYHNDMIGFLSKLKKSWQSCKDPEKEIVQKIWRT-KHVEKRMLSKVIESRGYDHN 239

Query: 997  LDDDLTSESCSSAADDKACTSDELNFRSLHENSQRSDSICRKGFMEDTYDNASDGVKVVP 1176
             +   TSESCS  A++KAC+SD     SL ++ +    +  K  ++    N  D +  +P
Sbjct: 240  GNVTGTSESCSWDAEEKACSSDN-QISSLRKDDKLQRRVLEKCIVKGKSRNLMDSLDNMP 298

Query: 1177 ----RHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSIQSRSLNHVLG 1344
                + + G+KL   +  S D  KYMS IK+SK+QH+ VKN MK    SIQSRSLN VLG
Sbjct: 299  NVGEKPKTGDKLPKHSIHSSDSDKYMSCIKISKQQHELVKN-MKQAGKSIQSRSLNRVLG 357

Query: 1345 DLGTYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQVIHALGREM 1518
            +L   ++QPY  F +EEQKKL  HW  L N D+ AA+ N   R+I++  V ++L  EM
Sbjct: 358  NLEKIHVQPYNTFVKEEQKKLQEHWLLLVNKDLPAAYLNWTERRIQRHAVRNSLVAEM 415


>dbj|BAB11123.1| unnamed protein product [Arabidopsis thaliana]
          Length = 978

 Score =  325 bits (832), Expect = 3e-86
 Identities = 191/418 (45%), Positives = 256/418 (61%), Gaps = 5/418 (1%)
 Frame = +1

Query: 280  LEMAAGQRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAK 459
            L MAA QR+KR+NSAN +G + +E    KRKK  SP        +I LEWD     VV+K
Sbjct: 38   LRMAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGALRSGDHITLEWDRNRSKVVSK 97

Query: 460  KEQVGIAQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERE 639
            KEQVG++ R L  FVD VP   N+LA V  +P ETF++ENL+EVLS EVWR+ LS+ ER 
Sbjct: 98   KEQVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERN 157

Query: 640  LLTQFLPKGFDAQQIVQDLLEGDNFHFGNPYLKWGASLCSGYLHPDIVLNKERLLKASKK 819
             L QFLP+G D +Q+VQ LL+G+NFHFGNP L WG ++CSG  HPD ++++E  L+A K+
Sbjct: 158  YLRQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKR 217

Query: 820  TYYSELQNYHYDMIRDLQILKESW-SQKDLENDIEHNLWRSGKHAEQTLAAHANKSALHN 996
             YYS L+ YH D+I  LQ LKE W S KD E DI   +W   +       A  N S    
Sbjct: 218  RYYSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGRSRGGN----AQVNGSC--- 270

Query: 997  LDDDLTSESCSSA--ADDKACTSDELNFRSLH--ENSQRSDSICRKGFMEDTYDNASDGV 1164
                LT+ S SS+   DDK  +SD +    +   E  +RS    R G  ++   N  +GV
Sbjct: 271  --QGLTAASGSSSWNEDDKPDSSDNMISPVVRCGEVQRRSK---RSGLEKEKTQN--NGV 323

Query: 1165 KVVPRHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSIQSRSLNHVLG 1344
             V  + R    L   + Q  DGAKYMSY+K+SK+QHQ +  SMK    SIQSR+LN + G
Sbjct: 324  NVGGKVRKKNVLPKDSIQQTDGAKYMSYLKISKKQHQ-IVTSMKQSGKSIQSRALNRIFG 382

Query: 1345 DLGTYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQVIHALGREM 1518
            ++ +  +QPY VF EEEQKKL+ HW  L   D+ AA+   +  Q++K+ +I ++GRE+
Sbjct: 383  NIDSLDVQPYGVFVEEEQKKLNAHWLHLVK-DLPAAYAIWKRLQLQKRDIISSMGREL 439


Top