BLASTX nr result
ID: Angelica22_contig00011495
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00011495 (1377 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi... 298 2e-78 ref|XP_002328150.1| predicted protein [Populus trichocarpa] gi|2... 291 3e-76 ref|XP_002868835.1| pentatricopeptide repeat-containing protein ... 290 6e-76 ref|XP_003516576.1| PREDICTED: pentatricopeptide repeat-containi... 289 1e-75 emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] 288 2e-75 >ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Glycine max] Length = 388 Score = 298 bits (763), Expect = 2e-78 Identities = 175/345 (50%), Positives = 218/345 (63%), Gaps = 23/345 (6%) Frame = +3 Query: 225 VRTRSFIVDSLARSFSSRG--GDFVPKYNKDGARSSRTDQEGRRMPPDPIPNRPLRADKS 398 VR SF D RS G DF + + S + + E + +PIP+RPLR+ K Sbjct: 40 VRHFSFTDDCSGRSKQPVGESDDF---FLQQSDSSFKDNGESDQSLSEPIPSRPLRSRKP 96 Query: 399 THQRKTFNQ--DRAS-------VGGGTGATRMD----FSKSGNAFNSRALQQSSQPQAAN 539 +Q Q DR S G +D SK AF + + ++++ + Sbjct: 97 VNQPPPRFQEYDRGSHSFPPRFYDNHGGPDELDQTNKSSKIDLAFQNTNVAKTNRDAGQS 156 Query: 540 LD-FLEKFKLGFD-------KGGEKKEPPKMVNNNSNQPTELPQPEDADEIFKKMKETGL 695 D FL KFKLGFD + K+ + +N NQP + P+DADEIFKKMKETGL Sbjct: 157 GDSFLNKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDADEIFKKMKETGL 216 Query: 696 IPNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFR 875 IPNAV+ML GLC+DGLVQEA+KLF LM EKGTIPE++IYTAV+EG+ KAHK DDAKRIFR Sbjct: 217 IPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFR 276 Query: 876 KMQGNGITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTGLVDCFCQEK 1055 KMQ +G++PNA +Y +LIQGL + L DA EF VEMLEAGHSPN+ TF GLVD FC EK Sbjct: 277 KMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGLVDGFCNEK 336 Query: 1056 GLEEAQKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1190 G+EEA+ I TL +K F +KAVR++LDKK PFS VWEAI GK Sbjct: 337 GVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGK 381 >ref|XP_002328150.1| predicted protein [Populus trichocarpa] gi|222837665|gb|EEE76030.1| predicted protein [Populus trichocarpa] Length = 347 Score = 291 bits (745), Expect = 3e-76 Identities = 163/340 (47%), Positives = 207/340 (60%), Gaps = 16/340 (4%) Frame = +3 Query: 219 LRVRTRSFI-VDSLARSFSS--RGGDFVPKYNKDGARSSRTDQEGRRMPPDPIPNRPLRA 389 L++ + S I + + R FSS +G +N D + R + PP+PIPNRPLR Sbjct: 12 LKLHSHSRISLSQILRRFSSSIKGSTAGAGFNFDDEKERRLQNQN---PPEPIPNRPLRG 68 Query: 390 DKSTHQRKTFNQDRASVGGGTGATRMDFSKSGNAFNSRALQQSSQPQAANLD-FLEKFKL 566 K T R T + FN + Q+ + D FL+KFKL Sbjct: 69 PKPNFNNNTNRPARPQPSHHPSTT--------SPFNLQPQTQTHDFNRISDDAFLDKFKL 120 Query: 567 GFDKGGEKKE------------PPKMVNNNSNQPTELPQPEDADEIFKKMKETGLIPNAV 710 D + PP N ++ + +DA++IF KMKETGLIPNAV Sbjct: 121 HPDHNNNVNKDAAAADTKAAAAPPPPKNEQASSASTSEPSQDAEQIFNKMKETGLIPNAV 180 Query: 711 SMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFRKMQGN 890 +ML GLC+DGLVQEA+KLF M EKGTIPEV+IYTAV++GFCKAHKLDDAKRIFRKMQ N Sbjct: 181 AMLDGLCKDGLVQEALKLFGTMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIFRKMQSN 240 Query: 891 GITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTGLVDCFCQEKGLEEA 1070 GITPNA +Y +LIQGL + DDA++F EMLE GHSPN+ TF GL+D C+EKG+EEA Sbjct: 241 GITPNAFSYAVLIQGLSKCNLFDDAIDFCFEMLELGHSPNVTTFVGLIDGLCREKGVEEA 300 Query: 1071 QKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1190 + +IGTL++K F DKAVR++LDK P S VW+AI GK Sbjct: 301 RTVIGTLRQKGFHVHDKAVRDFLDKNKPLSSSVWDAIFGK 340 >ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297314671|gb|EFH45094.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 301 Score = 290 bits (742), Expect = 6e-76 Identities = 150/285 (52%), Positives = 190/285 (66%) Frame = +3 Query: 336 QEGRRMPPDPIPNRPLRADKSTHQRKTFNQDRASVGGGTGATRMDFSKSGNAFNSRALQQ 515 QE ++ PP+P+PNRPLR ++S++ + +A D K N + Sbjct: 38 QEKQQNPPEPLPNRPLRGERSSNSHREPPARQAH----------DLGKIDNTLSDDG--- 84 Query: 516 SSQPQAANLDFLEKFKLGFDKGGEKKEPPKMVNNNSNQPTELPQPEDADEIFKKMKETGL 695 FLE+FKLG ++ ++ P+ + P PED+DEIFKKMKE GL Sbjct: 85 ----------FLEQFKLGVNQDSQETPKPEQYPQDPLLP-----PEDSDEIFKKMKEGGL 129 Query: 696 IPNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFR 875 IPNAV+ML GLC+DGLVQEAMKLF LM +KGTIPEV+IYTAV+EGFCKAHK++DAKRIFR Sbjct: 130 IPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKRIFR 189 Query: 876 KMQGNGITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTGLVDCFCQEK 1055 KMQ NGITPNA +YG+L+QGL LDDA+ F EMLE+GHSPN+ TF GLVD C+EK Sbjct: 190 KMQTNGITPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLESGHSPNIPTFVGLVDALCREK 249 Query: 1056 GLEEAQKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1190 G+E+AQ I L +K F KAV+E++DK+ PF L WEAI K Sbjct: 250 GVEQAQSAIDGLNQKGFALNVKAVKEFMDKRAPFPSLAWEAIFKK 294 >ref|XP_003516576.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Glycine max] Length = 397 Score = 289 bits (739), Expect = 1e-75 Identities = 171/354 (48%), Positives = 215/354 (60%), Gaps = 32/354 (9%) Frame = +3 Query: 225 VRTRSFIVDSLARSFSSRG--GDFVPK-----YNKDGARSSRTDQEGRRMPPDPIPNRPL 383 VR SF D RS G DF + + +G+ ++ + +PIP+RPL Sbjct: 40 VRHFSFTDDRSGRSKQPVGESDDFFREQSDSSFKDNGSNRTQESYNVEQSLSEPIPSRPL 99 Query: 384 RADKSTHQR--KTFNQDRASVGGGTGATRMDFSKSG---------NAFNSRALQQSSQPQ 530 R K +Q + DR G + R D + G ++ A Q ++ Sbjct: 100 RGKKPINQPPPRFREYDR---GSHSFPPRFDDNHGGPDELDKINKSSQIDLAFQGTTNVA 156 Query: 531 AANLD-------FLEKFKLGFD-------KGGEKKEPPKMVNNNSNQPTELPQPEDADEI 668 N D FL+KFKLGFD + K+ + +N NQP + P+DA+EI Sbjct: 157 ETNRDVGKSGGSFLDKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEI 216 Query: 669 FKKMKETGLIPNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHK 848 FKKMKETGLIPNAV+ML GLC+DGLVQEA+KLF L+ EKGTIPE++IYTAV+EG+ KAHK Sbjct: 217 FKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHK 276 Query: 849 LDDAKRIFRKMQGNGITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTG 1028 DDAKRIFRKMQ +GI+PNA +Y +LIQGL + L DA EF VEMLEAGHSPN+ F G Sbjct: 277 ADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVG 336 Query: 1029 LVDCFCQEKGLEEAQKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1190 LVD FC EKG+EEA+ I TL EK F +KAV ++LDKK PFS VWEAI GK Sbjct: 337 LVDGFCNEKGVEEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 390 >emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] Length = 381 Score = 288 bits (738), Expect = 2e-75 Identities = 157/284 (55%), Positives = 198/284 (69%), Gaps = 5/284 (1%) Frame = +3 Query: 354 PPDPIPNRPLRADKSTHQRKTFNQDRASVGGGTGATRMDFSKSGNAFNSRALQQSSQPQA 533 PP+PIPNRPLR ++ ++ R G +D + + FN + ++ Sbjct: 97 PPNPIPNRPLRGEQRMNRPPPHIPQRKL--GLPKDEGVDRASQASPFNQPS---PAEKVG 151 Query: 534 ANLD--FLEKFKLGFDKGGEKKEPPKMV---NNNSNQPTELPQPEDADEIFKKMKETGLI 698 A L+ FLE+FKLG K +E ++N E P P++ADEIF+KMKE+GLI Sbjct: 152 ATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQP-PQNADEIFRKMKESGLI 210 Query: 699 PNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFRK 878 PNAV+ML GLC+DGLVQEAMKLF LM EKGTIPEV+IYTAV+EGFCKA +LDDA RIFRK Sbjct: 211 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLDDAVRIFRK 270 Query: 879 MQGNGITPNAITYGILIQGLIRKKSLDDALEFSVEMLEAGHSPNLATFTGLVDCFCQEKG 1058 MQ NGI+PNA +Y +LI+G+ + LD A++F VEMLEAGHSPN+AT L+ FC+EKG Sbjct: 271 MQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKG 330 Query: 1059 LEEAQKMIGTLKEKSFFFEDKAVREYLDKKGPFSQLVWEAILGK 1190 +EEA+ +I TLK+K F +DKAVREYLDKKGP S LVWEA GK Sbjct: 331 VEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFFGK 374