BLASTX nr result
ID: Angelica23_contig00018228
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00018228 (1366 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi... 298 2e-78 ref|XP_002328150.1| predicted protein [Populus trichocarpa] gi|2... 291 3e-76 ref|XP_002868835.1| pentatricopeptide repeat-containing protein ... 290 6e-76 ref|XP_003516576.1| PREDICTED: pentatricopeptide repeat-containi... 289 1e-75 emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] 288 2e-75 >ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Glycine max] Length = 388 Score = 298 bits (763), Expect = 2e-78 Identities = 175/345 (50%), Positives = 218/345 (63%), Gaps = 23/345 (6%) Frame = +3 Query: 216 VRTRSFIVDSLARSFSSRG--GDFVPKYNKDGARSSRTDQEGRRMPPDPIPNRPLRADKS 389 VR SF D RS G DF + + S + + E + +PIP+RPLR+ K Sbjct: 40 VRHFSFTDDCSGRSKQPVGESDDF---FLQQSDSSFKDNGESDQSLSEPIPSRPLRSRKP 96 Query: 390 THQRKTFNQ--DRAS-------VGGGTGATRMD----FSKSGNAFNSRALQQSSQPQAAN 530 +Q Q DR S G +D SK AF + + ++++ + Sbjct: 97 VNQPPPRFQEYDRGSHSFPPRFYDNHGGPDELDQTNKSSKIDLAFQNTNVAKTNRDAGQS 156 Query: 531 LD-FLEKFKLGFD-------KGGEKKEPPKMVNNNSNQPTELPQPEDADEIFKKMKETGL 686 D FL KFKLGFD + K+ + +N NQP + P+DADEIFKKMKETGL Sbjct: 157 GDSFLNKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDADEIFKKMKETGL 216 Query: 687 IPNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFR 866 IPNAV+ML GLC+DGLVQEA+KLF LM EKGTIPE++IYTAV+EG+ KAHK DDAKRIFR Sbjct: 217 IPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFR 276 Query: 867 KMQGNGITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTGLVDCFCQEK 1046 KMQ +G++PNA +Y +LIQGL + L DA EF VEMLEAGHSPN+ TF GLVD FC EK Sbjct: 277 KMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGLVDGFCNEK 336 Query: 1047 GLEEAQKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1181 G+EEA+ I TL +K F +KAVR++LDKK PFS VWEAI GK Sbjct: 337 GVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGK 381 >ref|XP_002328150.1| predicted protein [Populus trichocarpa] gi|222837665|gb|EEE76030.1| predicted protein [Populus trichocarpa] Length = 347 Score = 291 bits (745), Expect = 3e-76 Identities = 163/340 (47%), Positives = 207/340 (60%), Gaps = 16/340 (4%) Frame = +3 Query: 210 LRVRTRSFI-VDSLARSFSS--RGGDFVPKYNKDGARSSRTDQEGRRMPPDPIPNRPLRA 380 L++ + S I + + R FSS +G +N D + R + PP+PIPNRPLR Sbjct: 12 LKLHSHSRISLSQILRRFSSSIKGSTAGAGFNFDDEKERRLQNQN---PPEPIPNRPLRG 68 Query: 381 DKSTHQRKTFNQDRASVGGGTGATRMDFSKSGNAFNSRALQQSSQPQAANLD-FLEKFKL 557 K T R T + FN + Q+ + D FL+KFKL Sbjct: 69 PKPNFNNNTNRPARPQPSHHPSTT--------SPFNLQPQTQTHDFNRISDDAFLDKFKL 120 Query: 558 GFDKGGEKKE------------PPKMVNNNSNQPTELPQPEDADEIFKKMKETGLIPNAV 701 D + PP N ++ + +DA++IF KMKETGLIPNAV Sbjct: 121 HPDHNNNVNKDAAAADTKAAAAPPPPKNEQASSASTSEPSQDAEQIFNKMKETGLIPNAV 180 Query: 702 SMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFRKMQGN 881 +ML GLC+DGLVQEA+KLF M EKGTIPEV+IYTAV++GFCKAHKLDDAKRIFRKMQ N Sbjct: 181 AMLDGLCKDGLVQEALKLFGTMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIFRKMQSN 240 Query: 882 GITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTGLVDCFCQEKGLEEA 1061 GITPNA +Y +LIQGL + DDA++F EMLE GHSPN+ TF GL+D C+EKG+EEA Sbjct: 241 GITPNAFSYAVLIQGLSKCNLFDDAIDFCFEMLELGHSPNVTTFVGLIDGLCREKGVEEA 300 Query: 1062 QKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1181 + +IGTL++K F DKAVR++LDK P S VW+AI GK Sbjct: 301 RTVIGTLRQKGFHVHDKAVRDFLDKNKPLSSSVWDAIFGK 340 >ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297314671|gb|EFH45094.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 301 Score = 290 bits (742), Expect = 6e-76 Identities = 150/285 (52%), Positives = 190/285 (66%) Frame = +3 Query: 327 QEGRRMPPDPIPNRPLRADKSTHQRKTFNQDRASVGGGTGATRMDFSKSGNAFNSRALQQ 506 QE ++ PP+P+PNRPLR ++S++ + +A D K N + Sbjct: 38 QEKQQNPPEPLPNRPLRGERSSNSHREPPARQAH----------DLGKIDNTLSDDG--- 84 Query: 507 SSQPQAANLDFLEKFKLGFDKGGEKKEPPKMVNNNSNQPTELPQPEDADEIFKKMKETGL 686 FLE+FKLG ++ ++ P+ + P PED+DEIFKKMKE GL Sbjct: 85 ----------FLEQFKLGVNQDSQETPKPEQYPQDPLLP-----PEDSDEIFKKMKEGGL 129 Query: 687 IPNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFR 866 IPNAV+ML GLC+DGLVQEAMKLF LM +KGTIPEV+IYTAV+EGFCKAHK++DAKRIFR Sbjct: 130 IPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKRIFR 189 Query: 867 KMQGNGITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTGLVDCFCQEK 1046 KMQ NGITPNA +YG+L+QGL LDDA+ F EMLE+GHSPN+ TF GLVD C+EK Sbjct: 190 KMQTNGITPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLESGHSPNIPTFVGLVDALCREK 249 Query: 1047 GLEEAQKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1181 G+E+AQ I L +K F KAV+E++DK+ PF L WEAI K Sbjct: 250 GVEQAQSAIDGLNQKGFALNVKAVKEFMDKRAPFPSLAWEAIFKK 294 >ref|XP_003516576.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Glycine max] Length = 397 Score = 289 bits (739), Expect = 1e-75 Identities = 171/354 (48%), Positives = 215/354 (60%), Gaps = 32/354 (9%) Frame = +3 Query: 216 VRTRSFIVDSLARSFSSRG--GDFVPK-----YNKDGARSSRTDQEGRRMPPDPIPNRPL 374 VR SF D RS G DF + + +G+ ++ + +PIP+RPL Sbjct: 40 VRHFSFTDDRSGRSKQPVGESDDFFREQSDSSFKDNGSNRTQESYNVEQSLSEPIPSRPL 99 Query: 375 RADKSTHQR--KTFNQDRASVGGGTGATRMDFSKSG---------NAFNSRALQQSSQPQ 521 R K +Q + DR G + R D + G ++ A Q ++ Sbjct: 100 RGKKPINQPPPRFREYDR---GSHSFPPRFDDNHGGPDELDKINKSSQIDLAFQGTTNVA 156 Query: 522 AANLD-------FLEKFKLGFD-------KGGEKKEPPKMVNNNSNQPTELPQPEDADEI 659 N D FL+KFKLGFD + K+ + +N NQP + P+DA+EI Sbjct: 157 ETNRDVGKSGGSFLDKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEI 216 Query: 660 FKKMKETGLIPNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHK 839 FKKMKETGLIPNAV+ML GLC+DGLVQEA+KLF L+ EKGTIPE++IYTAV+EG+ KAHK Sbjct: 217 FKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHK 276 Query: 840 LDDAKRIFRKMQGNGITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTG 1019 DDAKRIFRKMQ +GI+PNA +Y +LIQGL + L DA EF VEMLEAGHSPN+ F G Sbjct: 277 ADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVG 336 Query: 1020 LVDCFCQEKGLEEAQKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1181 LVD FC EKG+EEA+ I TL EK F +KAV ++LDKK PFS VWEAI GK Sbjct: 337 LVDGFCNEKGVEEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 390 >emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] Length = 381 Score = 288 bits (738), Expect = 2e-75 Identities = 157/284 (55%), Positives = 198/284 (69%), Gaps = 5/284 (1%) Frame = +3 Query: 345 PPDPIPNRPLRADKSTHQRKTFNQDRASVGGGTGATRMDFSKSGNAFNSRALQQSSQPQA 524 PP+PIPNRPLR ++ ++ R G +D + + FN + ++ Sbjct: 97 PPNPIPNRPLRGEQRMNRPPPHIPQRKL--GLPKDEGVDRASQASPFNQPS---PAEKVG 151 Query: 525 ANLD--FLEKFKLGFDKGGEKKEPPKMV---NNNSNQPTELPQPEDADEIFKKMKETGLI 689 A L+ FLE+FKLG K +E ++N E P P++ADEIF+KMKE+GLI Sbjct: 152 ATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQP-PQNADEIFRKMKESGLI 210 Query: 690 PNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFRK 869 PNAV+ML GLC+DGLVQEAMKLF LM EKGTIPEV+IYTAV+EGFCKA +LDDA RIFRK Sbjct: 211 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLDDAVRIFRK 270 Query: 870 MQGNGITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTGLVDCFCQEKG 1049 MQ NGI+PNA +Y +LI+G+ + LD A++F VEMLEAGHSPN+AT L+ FC+EKG Sbjct: 271 MQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKG 330 Query: 1050 LEEAQKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1181 +EEA+ +I TLK+K F +DKAVREYLDKKGP S LVWEA GK Sbjct: 331 VEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFFGK 374