BLASTX nr result
ID: Angelica23_contig00018044
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00018044 (1806 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002267329.1| PREDICTED: uncharacterized protein LOC100263... 369 2e-99 ref|XP_003552484.1| PREDICTED: uncharacterized protein LOC100810... 321 5e-85 ref|NP_197743.1| smr (Small MutS Related) domain-containing prot... 306 1e-80 ref|XP_003624285.1| hypothetical protein MTR_7g081260 [Medicago ... 306 1e-80 gb|ABN05922.1| Smr protein/MutS2 C-terminal [Medicago truncatula] 306 1e-80 >ref|XP_002267329.1| PREDICTED: uncharacterized protein LOC100263151 [Vitis vinifera] Length = 435 Score = 369 bits (946), Expect = 2e-99 Identities = 212/434 (48%), Positives = 283/434 (65%), Gaps = 9/434 (2%) Frame = -2 Query: 1484 MSARKFKNAGWAAYNSKQQQSAGLKSEAGDESYPPISSTTIPKHQTGFVKNFNNSGQSFS 1305 MS+ K+ GWAA++ KQ+Q GL+ E E YPPI S+ + SG+SFS Sbjct: 1 MSSASGKSPGWAAFDLKQRQKQGLEPELDKEPYPPIPSSFTSLRPCRNSASNGCSGRSFS 60 Query: 1304 SVLLPSANFPTLGANKDIKKSLLIGTSRTNEVILEKCHNNVYQAYEKLKELYPWAEKGLM 1125 S+L+PS NFPTL NKD KK + G S + +N+ A+ KLKELY WA+ L+ Sbjct: 61 SLLVPSVNFPTLEENKDCKKPMQGGNSGNKQQTKVAEVSNLVIAFNKLKELYSWADNSLI 120 Query: 1124 EDIMEAVNNDIDKASELLKEMVSSTSLHDKKETEVMDFKYNAEKFYSENNTFLADIDLTL 945 EDIM AV+NDIDKAS LL MVS+ S + KET +++ + Y EN AD + L Sbjct: 121 EDIMAAVDNDIDKASTLLGAMVSTGSFEENKETSIVELNSTSGNPY-ENCKLQADNGVFL 179 Query: 944 RETQDLVRLSYAHQNGIVNKNKKQTDVNAALEEGHPDHSTSEMRSLHSLSNIPVEPEWEE 765 L LS + +++ NK TD + + D + L + +IP+EPEWEE Sbjct: 180 GNGTVLSELSSTIGDLLIDNNKGLTDECGSSGKNLFDDAADMTLILGRMKSIPIEPEWEE 239 Query: 764 DDVYLINRKDAVRTMRLASRHSKAASDAYLRGDHLSAQQYSTKAREEWKEAEKYNAVAAK 585 DDVYL +RKDA+R MR AS+HS+AA++A+LRGDH+SA+Q+S KA++EW +AE+ N+ AA Sbjct: 240 DDVYLSHRKDAIRFMRSASQHSRAATNAFLRGDHVSAKQFSLKAKDEWVKAERLNSKAAN 299 Query: 584 EILRKRNCENDLWKLDLHGLHASEAVQALKEHLQKIESQ------VAPSTNTT--NMLRS 429 EIL RN NDLWKLDLHGLHA+EAVQAL+EHL KIE+Q V+P+ T +LRS Sbjct: 300 EILDIRNSNNDLWKLDLHGLHAAEAVQALQEHLWKIETQMPFNRSVSPNRAKTKVGILRS 359 Query: 428 VSLDSLG-MDREKCNKQKASFRPRSTVLEVITGIGLHSRGQAALPSAIETFLSENRYHYD 252 SL+S +D E+ +KQ R R T L+VITG G HSRGQAALP+A+ +FL+E+ Y ++ Sbjct: 360 PSLESFSCVDNEELDKQWTLSRQRPTSLQVITGRGNHSRGQAALPTAVRSFLNEHGYRFE 419 Query: 251 KARPGMIAVRPKFR 210 +ARPG+IAVRPKFR Sbjct: 420 EARPGVIAVRPKFR 433 >ref|XP_003552484.1| PREDICTED: uncharacterized protein LOC100810197 [Glycine max] Length = 432 Score = 321 bits (822), Expect = 5e-85 Identities = 192/438 (43%), Positives = 273/438 (62%), Gaps = 11/438 (2%) Frame = -2 Query: 1487 KMSARKFKNAGWAAYNSKQQQSAGLKSEAGDESYPPISSTTIPKHQTGFVKNFNNSGQSF 1308 KMS K +++GW A++ KQ+++ +SE D+ +P I T +K + + F Sbjct: 11 KMSWAKGQSSGWTAFDLKQRKNKDFESEVDDDPFPAIGPTD------PIIKKNHVPAKPF 64 Query: 1307 SSVLLPSANFPTLGANKDIKKSLLIGTSRTNEVILEKCHNNVYQAYEKLKELYPWAEKGL 1128 SSVLLP+ NFP L + + KK++L G+ + +V A +KL+E + WAE L Sbjct: 65 SSVLLPTKNFPPLNEDGNSKKAML-GSDSDGKYCGATTQEDVNLAIKKLREQHLWAEHSL 123 Query: 1127 MEDIMEAVNNDIDKASELLKEMVSSTSLHDKKETEVMDFKYNAEKFYSENNTFL--ADID 954 ++DI AVNN+IDKA+ LL+ M + + + K + N S++ + D Sbjct: 124 IDDIFTAVNNNIDKATSLLETMAPAVNFEESKVS------INPRSTTSDDTPCMDKTDDS 177 Query: 953 LTLRETQDLVRLSYAHQNGIVNKNKKQTDVNAALEE--GHPDHSTSEMRSLHSLSNIPVE 780 LT + +D + Y + + + +K D NA + D+ +M+ L+S +PVE Sbjct: 178 LTSEKVEDDIPFDYNLVDNLQDNDKDLEDRNAPSGQKLSGVDYLRCKMKLLNS---VPVE 234 Query: 779 PEWEEDDVYLINRKDAVRTMRLASRHSKAASDAYLRGDHLSAQQYSTKAREEWKEAEKYN 600 PEWE+DD+Y+ NRKDA+RTMRLASRHSKAAS A+LRGDH SAQ +S KAR EW AE+ N Sbjct: 235 PEWEDDDIYISNRKDALRTMRLASRHSKAASSAFLRGDHFSAQHHSMKARAEWHTAEELN 294 Query: 599 AVAAKEILRKRNCENDLWKLDLHGLHASEAVQALKEHLQKIESQ-VAPSTNTTNMLRSVS 423 + AAK+IL RN END+W+LDLHGLHA+EA+QAL+EHL +IE Q + S+ T+N ++ Sbjct: 295 SDAAKKILSIRNNENDIWRLDLHGLHATEAIQALQEHLYRIECQGFSKSSATSNGVKENG 354 Query: 422 L--DSLG----MDREKCNKQKASFRPRSTVLEVITGIGLHSRGQAALPSAIETFLSENRY 261 L +LG MDREK + Q A R R L VITGIG HSRG AALP+A+ +FL+ENRY Sbjct: 355 LGHSTLGSFNFMDREKLDTQ-APLRLRPLALHVITGIGNHSRGLAALPAAVRSFLNENRY 413 Query: 260 HYDKARPGMIAVRPKFRR 207 +++ RPG+I V PKFR+ Sbjct: 414 RFEEMRPGVITVWPKFRQ 431 >ref|NP_197743.1| smr (Small MutS Related) domain-containing protein [Arabidopsis thaliana] gi|8809708|dbj|BAA97249.1| unnamed protein product [Arabidopsis thaliana] gi|22531192|gb|AAM97100.1| unknown protein [Arabidopsis thaliana] gi|23198016|gb|AAN15535.1| unknown protein [Arabidopsis thaliana] gi|332005795|gb|AED93178.1| smr (Small MutS Related) domain-containing protein [Arabidopsis thaliana] Length = 435 Score = 306 bits (784), Expect = 1e-80 Identities = 186/440 (42%), Positives = 260/440 (59%), Gaps = 15/440 (3%) Frame = -2 Query: 1484 MSARKFKNAGWAAYNSKQQQSAGLKSEAGDESYPPISSTTIPKH--QTGFVKNFNNSGQS 1311 MS K K++GW A++ KQ+Q GL+SE + +PP+S++ + +N S +S Sbjct: 1 MSWMKGKSSGWTAFDLKQRQKQGLESEVEGDPFPPVSTSVNASFGVRGRLRRNHEPSEKS 60 Query: 1310 FSSVLLPSANFPTLGANKDIKKSLLIGTSRTNEVILEKCHNNVYQAYEKLKELYPWAEKG 1131 FSSVLLP + FP L NKD G R L N+ A+ KLKE+ WA+ Sbjct: 61 FSSVLLPPSRFPALTENKDCGNQERGGCCRRKPDTLSLPVNSHDLAFTKLKEMNSWADDN 120 Query: 1130 LMEDIMEAVNNDIDKASELLKEMVSSTSLHDKKETEVMDFKYNAEKFYSENNTF----LA 963 L+ D++ + +D + A LK MVSS D++ T ++ Y+++ SE TF + Sbjct: 121 LIRDVLLSTEDDFEMALAFLKGMVSSGK-EDEEPTSKIE-GYSSDNRRSEYRTFEKTVTS 178 Query: 962 DIDLTLRETQDLVRLSYAHQNGIVNKNKKQTDVNAALEEGHPDHSTSEMRSLHSLSNIPV 783 + + R T A + + N + VNA+ E PD + + L +IP+ Sbjct: 179 SVKMAARST-----FEDAGKYDLENSDGSSFLVNASDNEKFPDDISELDSIIQRLQSIPI 233 Query: 782 EPEWEEDDVYLINRKDAVRTMRLASRHSKAASDAYLRGDHLSAQQYSTKAREEWKEAEKY 603 EPEWEEDD+YL +RKDA++ MR AS HS+AA +A+ R DH SA+Q+S KARE+W AEK Sbjct: 234 EPEWEEDDLYLSHRKDALKVMRSASNHSRAAQNAFQRYDHASAKQHSDKAREDWLAAEKL 293 Query: 602 NAVAAKEILRKRNCENDLWKLDLHGLHASEAVQALKEHLQKIES------QVAPSTNTTN 441 NA AAK+I+ N +ND+WKLDLHGLHA+EAVQAL+E LQ IE V+P+ + Sbjct: 294 NAEAAKKIIGITNKDNDIWKLDLHGLHATEAVQALQERLQMIEGHFTVNRSVSPNRGRSK 353 Query: 440 --MLRSVSLDSLG-MDREKCNKQKASFRPRSTVLEVITGIGLHSRGQAALPSAIETFLSE 270 LRS S + G +D E + Q+ S R L+VITGIG HSRGQA+LP A++TF + Sbjct: 354 NAALRSASQEPFGRLDEEGMHCQRTSSRELRNSLQVITGIGKHSRGQASLPLAVKTFFED 413 Query: 269 NRYHYDKARPGMIAVRPKFR 210 NRY +D+ RPG+I VRPKFR Sbjct: 414 NRYRFDETRPGVITVRPKFR 433 >ref|XP_003624285.1| hypothetical protein MTR_7g081260 [Medicago truncatula] gi|355499300|gb|AES80503.1| hypothetical protein MTR_7g081260 [Medicago truncatula] Length = 431 Score = 306 bits (784), Expect = 1e-80 Identities = 188/445 (42%), Positives = 273/445 (61%), Gaps = 7/445 (1%) Frame = -2 Query: 1520 VSYQAVSSLHVKMSARKFKNAGWAAYNSKQQQSAGLKSEAGDESYPPISSTTIPKHQTGF 1341 ++ A SSL +R + +GW A++ KQ+ + SE + +PPI S++ +H F Sbjct: 1 MTVSAESSLKKMSWSRGKQPSGWTAFDLKQKMKNSIDSEVDKDPFPPIGSSSSMRHGDKF 60 Query: 1340 VKNFNNSGQSFSSVLLPSANFPTLGANKDIKKSLLIGTSRTNEVILEKCHNNVYQAYEKL 1161 VK + + FSSVL+P+ NFP L + +K++L G+ E +V + L Sbjct: 61 VKKKHVPLKPFSSVLVPNVNFPPLKEAGNGQKAVL-GSDSCGTTAQE----DVNGPTKML 115 Query: 1160 KELYPWAEKGLMEDIMEAVNNDIDKASELLKEMVSSTSLHDKKETEVMDFKYNAEKFYSE 981 KE +PWAE L++DI+ AVNN++DKA LL+ M S+ + + K + S+ Sbjct: 116 KEQHPWAENSLIDDILAAVNNNVDKAVALLETMASAVNFEEHKVLS----NPHPRPLISD 171 Query: 980 NNTFLADIDLTLRETQDLVRLSYAHQNGIVNKNKKQTDVNAALEEGHPDHSTSEMRSLHS 801 + T + +L ++V+ + IV + + D N LE + + Sbjct: 172 DVTRVVKTGESL--ALEMVKDDILFHSNIVGQLQ---DNNKDLENRYAFSGQKFSDVMDL 226 Query: 800 LSNIPVEPEWEEDDVYLINRKDAVRTMRLASRHSKAASDAYLRGDHLSAQQYSTKAREEW 621 L+++PVEPEWEEDD+YL +RKDA++TMR ASRHSKAA++A+L+G+H SAQQ+S +AREEW Sbjct: 227 LNSVPVEPEWEEDDIYLSHRKDALKTMRSASRHSKAAANAFLKGEHFSAQQHSARAREEW 286 Query: 620 KEAEKYNAVAAKEILRKRNCENDLWKLDLHGLHASEAVQALKEHLQKIESQ-----VAPS 456 A+K N+ AA +IL RN +ND+ +LDLHGLHA+EAVQAL+EHL++IESQ +APS Sbjct: 287 HNADKLNSEAATKILSIRNSDNDISRLDLHGLHAAEAVQALQEHLRRIESQGFSKSLAPS 346 Query: 455 TNT-TNMLRSVSLDSLG-MDREKCNKQKASFRPRSTVLEVITGIGLHSRGQAALPSAIET 282 N N +L SL MD E +KQ R RS + VITG+G HSRGQAALP+A+ + Sbjct: 347 NNAKKNGDAHSTLGSLNLMDWENLDKQ-VPLRLRSLAVHVITGVGNHSRGQAALPTAVRS 405 Query: 281 FLSENRYHYDKARPGMIAVRPKFRR 207 FLSENRY +++ RPG+I V PKFR+ Sbjct: 406 FLSENRYRFEEMRPGVITVWPKFRQ 430 >gb|ABN05922.1| Smr protein/MutS2 C-terminal [Medicago truncatula] Length = 432 Score = 306 bits (784), Expect = 1e-80 Identities = 188/445 (42%), Positives = 273/445 (61%), Gaps = 7/445 (1%) Frame = -2 Query: 1520 VSYQAVSSLHVKMSARKFKNAGWAAYNSKQQQSAGLKSEAGDESYPPISSTTIPKHQTGF 1341 ++ A SSL +R + +GW A++ KQ+ + SE + +PPI S++ +H F Sbjct: 2 MTVSAESSLKKMSWSRGKQPSGWTAFDLKQKMKNSIDSEVDKDPFPPIGSSSSMRHGDKF 61 Query: 1340 VKNFNNSGQSFSSVLLPSANFPTLGANKDIKKSLLIGTSRTNEVILEKCHNNVYQAYEKL 1161 VK + + FSSVL+P+ NFP L + +K++L G+ E +V + L Sbjct: 62 VKKKHVPLKPFSSVLVPNVNFPPLKEAGNGQKAVL-GSDSCGTTAQE----DVNGPTKML 116 Query: 1160 KELYPWAEKGLMEDIMEAVNNDIDKASELLKEMVSSTSLHDKKETEVMDFKYNAEKFYSE 981 KE +PWAE L++DI+ AVNN++DKA LL+ M S+ + + K + S+ Sbjct: 117 KEQHPWAENSLIDDILAAVNNNVDKAVALLETMASAVNFEEHKVLS----NPHPRPLISD 172 Query: 980 NNTFLADIDLTLRETQDLVRLSYAHQNGIVNKNKKQTDVNAALEEGHPDHSTSEMRSLHS 801 + T + +L ++V+ + IV + + D N LE + + Sbjct: 173 DVTRVVKTGESL--ALEMVKDDILFHSNIVGQLQ---DNNKDLENRYAFSGQKFSDVMDL 227 Query: 800 LSNIPVEPEWEEDDVYLINRKDAVRTMRLASRHSKAASDAYLRGDHLSAQQYSTKAREEW 621 L+++PVEPEWEEDD+YL +RKDA++TMR ASRHSKAA++A+L+G+H SAQQ+S +AREEW Sbjct: 228 LNSVPVEPEWEEDDIYLSHRKDALKTMRSASRHSKAAANAFLKGEHFSAQQHSARAREEW 287 Query: 620 KEAEKYNAVAAKEILRKRNCENDLWKLDLHGLHASEAVQALKEHLQKIESQ-----VAPS 456 A+K N+ AA +IL RN +ND+ +LDLHGLHA+EAVQAL+EHL++IESQ +APS Sbjct: 288 HNADKLNSEAATKILSIRNSDNDISRLDLHGLHAAEAVQALQEHLRRIESQGFSKSLAPS 347 Query: 455 TNT-TNMLRSVSLDSLG-MDREKCNKQKASFRPRSTVLEVITGIGLHSRGQAALPSAIET 282 N N +L SL MD E +KQ R RS + VITG+G HSRGQAALP+A+ + Sbjct: 348 NNAKKNGDAHSTLGSLNLMDWENLDKQ-VPLRLRSLAVHVITGVGNHSRGQAALPTAVRS 406 Query: 281 FLSENRYHYDKARPGMIAVRPKFRR 207 FLSENRY +++ RPG+I V PKFR+ Sbjct: 407 FLSENRYRFEEMRPGVITVWPKFRQ 431