BLASTX nr result
ID: Angelica23_contig00032878
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00032878 (1645 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI21267.3| unnamed protein product [Vitis vinifera] 420 e-115 ref|XP_002512826.1| hypothetical protein RCOM_1445020 [Ricinus c... 396 e-107 ref|XP_002320420.1| predicted protein [Populus trichocarpa] gi|2... 372 e-100 ref|XP_003519934.1| PREDICTED: uncharacterized protein LOC100776... 367 6e-99 dbj|BAB11123.1| unnamed protein product [Arabidopsis thaliana] 325 3e-86 >emb|CBI21267.3| unnamed protein product [Vitis vinifera] Length = 716 Score = 420 bits (1079), Expect = e-115 Identities = 230/455 (50%), Positives = 305/455 (67%), Gaps = 8/455 (1%) Frame = +1 Query: 298 QRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAKKEQVGI 477 Q+KKRL++A+ VG + + + KRK L S + +++ +I L WDD K VVAK+EQ+ I Sbjct: 3 QQKKRLSAASIVGCSSHQPSRAKRKSLGSTQCGLNMRSHISLNWDDNKKRVVAKREQIAI 62 Query: 478 AQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERELLTQFL 657 + R+LSPF+++VP C NILAD+ IP E FE++ LTEVLS+EVW+THLSEKER+LLTQFL Sbjct: 63 SWRDLSPFINSVPHCPNILADIWAIPPEIFELKGLTEVLSFEVWQTHLSEKERDLLTQFL 122 Query: 658 PKGFDAQQIVQDLLEGDNFHFGNPYLKWGASLCSGYLHPDIVLNKERLLKASKKTYYSEL 837 P G D QQ+VQ LL GDNFHFGNP+LKWGASLCSG LHPD VL+KE+ LK +KK YY EL Sbjct: 123 PSGLDGQQVVQALLAGDNFHFGNPFLKWGASLCSGDLHPDAVLSKEQCLKTNKKAYYLEL 182 Query: 838 QNYHYDMIRDLQILKESWS-QKDLENDIEHNLWRSGKHAEQTLAAHANKSALHNLDDDL- 1011 Q YH D I +LQ KE W+ KD E +I N+WRS KHA++ S H+ +++L Sbjct: 183 QKYHNDNIANLQKWKERWAICKDPEKEIVQNIWRSKKHADE--------SGFHDSEENLA 234 Query: 1012 -TSESCSSAADDKACTSDELNFRSLHENSQRSDSICRKG--FMEDTYDN---ASDGVKVV 1173 TSESCS AAD+KAC+SD ++NS R D +KG M+D + AS+G+KVV Sbjct: 235 ATSESCSWAADEKACSSD-------NQNSSRKDGELQKGKDLMKDKCKSPVAASNGLKVV 287 Query: 1174 PRHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSIQSRSLNHVLGDLG 1353 R R K N GDGAKYMSYIK+SK+QHQ VK SMK NSIQ RSLN VLGDL Sbjct: 288 TRTRKRVKFSKLNIHYGDGAKYMSYIKISKKQHQLVK-SMKQSGNSIQPRSLNRVLGDLD 346 Query: 1354 TYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQVIHALGREMVAXXX 1533 +++I+PY VFEEEE++K H HWS+LA D+ AA N +Q++++Q+ +L EM Sbjct: 347 SFHIRPYEVFEEEEKRKFHEHWSQLATRDLPAAFANRGKKQLQRRQMTQSLALEMEERLK 406 Query: 1534 XXXXXXXXXTASNLSSDQVVDEDSNPEPSTSTEDE 1638 ++ +Q + ++ EP+ +D+ Sbjct: 407 PLVEDDEKEGPDSILQEQEDNGATDHEPTMDDDDK 441 >ref|XP_002512826.1| hypothetical protein RCOM_1445020 [Ricinus communis] gi|223547837|gb|EEF49329.1| hypothetical protein RCOM_1445020 [Ricinus communis] Length = 858 Score = 396 bits (1017), Expect = e-107 Identities = 211/428 (49%), Positives = 281/428 (65%), Gaps = 15/428 (3%) Frame = +1 Query: 280 LEMAAGQRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAK 459 + M A R+KRLN + G + EQ K+KKL SPK + + K +I LEWD + VVAK Sbjct: 1 MPMVADHRRKRLNGVSIAGCSSWEQYKTKKKKLESPKNELNTKSHISLEWDGNKRRVVAK 60 Query: 460 KEQVGIAQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERE 639 +EQ+G+ Q++L FVD P CH+ LADV+ IP E FE++NLTE+LSYEVW+THLSE ER+ Sbjct: 61 REQIGLRQKDLREFVDPSPQCHSFLADVLAIPQEIFEVDNLTEILSYEVWKTHLSESERK 120 Query: 640 LLTQFLPKGFDAQQIVQDLLEGDNFHFGNPYLKW------------GASLCSGYLHPDIV 783 L QFLP+G D ++VQ LL GDNFHFGNPYLKW GAS+CSG LHPD V Sbjct: 121 YLMQFLPRGSDGDKVVQALLTGDNFHFGNPYLKWQVLKYDDSITLEGASVCSGKLHPDAV 180 Query: 784 LNKERLLKASKKTYYSELQNYHYDMIRDLQILKESW-SQKDLENDIEHNLWRSGKHAEQT 960 +++E+ +KA KK YYSE+QNYH DMIR LQ LKE+W S KD E ++ LWRS + ++ Sbjct: 181 VHQEQCIKADKKAYYSEIQNYHNDMIRYLQKLKETWESSKDPEKEVLQKLWRSRRDVDKQ 240 Query: 961 LAAHANKSALHNLDDD--LTSESCSSAADDKACTSDELNFRSLHENSQRSDSICRKGFME 1134 +HAN+S H+ ++ TSESCS A++KAC+SD N S+ + + I K F+E Sbjct: 241 NFSHANESRFHDPEETSAATSESCSLVAEEKACSSDNQN-SSITKGGEVQRRIYEKRFIE 299 Query: 1135 DTYDNASDGVKVVPRHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSI 1314 + S R + GEKLQ N DG KYMSY+K+SK+QH+ VK SMK SI Sbjct: 300 EKRRKPSVSSDDA-RFKRGEKLQKHNIHHTDGVKYMSYLKISKKQHELVK-SMKQSGKSI 357 Query: 1315 QSRSLNHVLGDLGTYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQV 1494 QS+ LN VLG+ T +QPY F +EEQKKL HW +LAN D+ AA+ N ++RQ ++ ++ Sbjct: 358 QSKCLNRVLGNFDTLQVQPYEKFVKEEQKKLREHWLQLANKDLPAAYENWQNRQFQRCEI 417 Query: 1495 IHALGREM 1518 +L +M Sbjct: 418 AKSLECDM 425 >ref|XP_002320420.1| predicted protein [Populus trichocarpa] gi|222861193|gb|EEE98735.1| predicted protein [Populus trichocarpa] Length = 912 Score = 372 bits (954), Expect = e-100 Identities = 208/418 (49%), Positives = 270/418 (64%), Gaps = 7/418 (1%) Frame = +1 Query: 286 MAAGQRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAKKE 465 MAA QR+KRLN A+ G + +E +KR K K + K I LEWD K VVAKKE Sbjct: 1 MAADQRRKRLNGASLAGCSSREPYRMKRNK---SKNGLNAKSLISLEWDGNRKKVVAKKE 57 Query: 466 QVGIAQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERELL 645 Q+GI+QR+L PFVD+V HN LADV +P E FE++NL EVLSYE W+ HLSE ER L Sbjct: 58 QIGISQRDLMPFVDSVLHYHNPLADVFAVPREIFELQNLAEVLSYETWQNHLSEDERNFL 117 Query: 646 TQFLPKGFDAQQIVQDLLEGDNFHFGNPYLKWGASLCSGYLHPDIVLNKERLLKASKKTY 825 QFLP G +++V+ LL GDNFHFGNP L+WGASLCSG LHPD+VL +E+ LKA KK + Sbjct: 118 KQFLPTGLGTEEVVEALLAGDNFHFGNPLLRWGASLCSGNLHPDVVLCQEQHLKADKKAF 177 Query: 826 YSELQNYHYDMIRDLQILKESW-SQKDLENDIEHNLW-RSGKHAEQTLAAHANKSALHNL 999 YS+LQ+YH DMI LQ LK++W S KD E +I +W RS A++ ++ +S H Sbjct: 178 YSKLQDYHIDMITYLQKLKDTWESSKDPEKEILQKIWRRSRSDADKRISPCDTESKFHGT 237 Query: 1000 --DDDLTSESCSSAADDKACTSDELNFRSLHENSQRSDSICRKGFMEDTYDN---ASDGV 1164 ++ TS SCS A++K +SD N + ++ + IC KG M++ ASD Sbjct: 238 GENESATSGSCSLVAEEKTSSSDTQN-SHVTKSGEVQKRICEKGSMKEKLRKSLLASDDA 296 Query: 1165 KVVPRHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSIQSRSLNHVLG 1344 R G+KL+ N DGAKYMSY+K+SK+QHQ VKN MK SIQS+SLN VLG Sbjct: 297 ----RPGKGDKLRKRNIHRSDGAKYMSYLKISKKQHQLVKN-MKQSGKSIQSKSLNCVLG 351 Query: 1345 DLGTYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQVIHALGREM 1518 DL T ++QPY F +EEQKKL HW +LAN D+ AH R RQ ++Q++ +L E+ Sbjct: 352 DLDTLHVQPYEEFVKEEQKKLQEHWMQLANKDLPVAHAIWRERQFQRQEITKSLEEEI 409 >ref|XP_003519934.1| PREDICTED: uncharacterized protein LOC100776137 [Glycine max] Length = 944 Score = 367 bits (941), Expect = 6e-99 Identities = 197/418 (47%), Positives = 275/418 (65%), Gaps = 7/418 (1%) Frame = +1 Query: 286 MAAGQRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAKKE 465 MAA QR+KR+N AN GY +EQ+ +KRK L + D +++P+I +EWD K VVAK E Sbjct: 1 MAADQRRKRVNGANIAGYGSREQHRIKRKNLGLVQNDLNMRPHISVEWDGNHKKVVAKWE 60 Query: 466 QVGIAQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERELL 645 Q+GI+ R++ PF++ V + H ILADV +P E FE++NL+EVLSYEVW+THLSE ER LL Sbjct: 61 QIGISWRQMKPFINLVSNDHKILADVFAVPQEIFELDNLSEVLSYEVWKTHLSENERNLL 120 Query: 646 TQFLPKGFDAQQIVQDLLEGDNFHFGNPYLKWGASLCSGYLHPDIVLNKERLLKASKKTY 825 FLP GF++ Q+V++LL G NF+FGNP+ KWGASLC G LHPD+++++E+ LK ++ Y Sbjct: 121 MNFLPSGFESHQVVEELLGGINFNFGNPFSKWGASLCLGSLHPDMIVDQEQHLKTERREY 180 Query: 826 YSELQNYHYDMIRDLQILKESW-SQKDLENDIEHNLWRSGKHAEQTLAAHA--NKSALHN 996 YS + NYH DMI L LK+SW S KD E +I +WR+ KH E+ + + ++ HN Sbjct: 181 YSHIHNYHNDMIGFLSKLKKSWQSCKDPEKEIVQKIWRT-KHVEKRMLSKVIESRGYDHN 239 Query: 997 LDDDLTSESCSSAADDKACTSDELNFRSLHENSQRSDSICRKGFMEDTYDNASDGVKVVP 1176 + TSESCS A++KAC+SD SL ++ + + K ++ N D + +P Sbjct: 240 GNVTGTSESCSWDAEEKACSSDN-QISSLRKDDKLQRRVLEKCIVKGKSRNLMDSLDNMP 298 Query: 1177 ----RHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSIQSRSLNHVLG 1344 + + G+KL + S D KYMS IK+SK+QH+ VKN MK SIQSRSLN VLG Sbjct: 299 NVGEKPKTGDKLPKHSIHSSDSDKYMSCIKISKQQHELVKN-MKQAGKSIQSRSLNRVLG 357 Query: 1345 DLGTYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQVIHALGREM 1518 +L ++QPY F +EEQKKL HW L N D+ AA+ N R+I++ V ++L EM Sbjct: 358 NLEKIHVQPYNTFVKEEQKKLQEHWLLLVNKDLPAAYLNWTERRIQRHAVRNSLVAEM 415 >dbj|BAB11123.1| unnamed protein product [Arabidopsis thaliana] Length = 978 Score = 325 bits (832), Expect = 3e-86 Identities = 191/418 (45%), Positives = 256/418 (61%), Gaps = 5/418 (1%) Frame = +1 Query: 280 LEMAAGQRKKRLNSANGVGYTHQEQNNLKRKKLVSPKYDSSLKPNIILEWDDKGKNVVAK 459 L MAA QR+KR+NSAN +G + +E KRKK SP +I LEWD VV+K Sbjct: 38 LRMAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGALRSGDHITLEWDRNRSKVVSK 97 Query: 460 KEQVGIAQRELSPFVDAVPSCHNILADVVNIPWETFEIENLTEVLSYEVWRTHLSEKERE 639 KEQVG++ R L FVD VP N+LA V +P ETF++ENL+EVLS EVWR+ LS+ ER Sbjct: 98 KEQVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERN 157 Query: 640 LLTQFLPKGFDAQQIVQDLLEGDNFHFGNPYLKWGASLCSGYLHPDIVLNKERLLKASKK 819 L QFLP+G D +Q+VQ LL+G+NFHFGNP L WG ++CSG HPD ++++E L+A K+ Sbjct: 158 YLRQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKR 217 Query: 820 TYYSELQNYHYDMIRDLQILKESW-SQKDLENDIEHNLWRSGKHAEQTLAAHANKSALHN 996 YYS L+ YH D+I LQ LKE W S KD E DI +W + A N S Sbjct: 218 RYYSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGRSRGGN----AQVNGSC--- 270 Query: 997 LDDDLTSESCSSA--ADDKACTSDELNFRSLH--ENSQRSDSICRKGFMEDTYDNASDGV 1164 LT+ S SS+ DDK +SD + + E +RS R G ++ N +GV Sbjct: 271 --QGLTAASGSSSWNEDDKPDSSDNMISPVVRCGEVQRRSK---RSGLEKEKTQN--NGV 323 Query: 1165 KVVPRHRLGEKLQNGNHQSGDGAKYMSYIKVSKEQHQRVKNSMKHFSNSIQSRSLNHVLG 1344 V + R L + Q DGAKYMSY+K+SK+QHQ + SMK SIQSR+LN + G Sbjct: 324 NVGGKVRKKNVLPKDSIQQTDGAKYMSYLKISKKQHQ-IVTSMKQSGKSIQSRALNRIFG 382 Query: 1345 DLGTYYIQPYRVFEEEEQKKLHVHWSKLANVDIAAAHTNLRSRQIKKQQVIHALGREM 1518 ++ + +QPY VF EEEQKKL+ HW L D+ AA+ + Q++K+ +I ++GRE+ Sbjct: 383 NIDSLDVQPYGVFVEEEQKKLNAHWLHLVK-DLPAAYAIWKRLQLQKRDIISSMGREL 439