BLASTX nr result
ID: Angelica22_contig00006997
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00006997 (2253 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi... 711 0.0 ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|2... 690 0.0 ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm... 682 0.0 ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi... 671 0.0 ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi... 669 0.0 >ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Vitis vinifera] Length = 511 Score = 711 bits (1836), Expect = 0.0 Identities = 364/514 (70%), Positives = 421/514 (81%), Gaps = 8/514 (1%) Frame = +3 Query: 333 GFLAGIDHNSKLGFT--NSFYFPQYKFVGARFCTKLKTPNSRSWF------SGTLCSLKT 488 GF + + ++LGFT +SF + + + +F SRS+ + T+C+ + Sbjct: 6 GFASSLMSPTELGFTLSSSFSIQRPRLIVPKF--------SRSFLGEYCSRATTICNHQN 57 Query: 489 HNSVVLNRGKVREFGFLKSVELDKFITSDDEDEMSEGFFEAIEELERMAREPSDVLEEMN 668 VV R K+REF KSVELD+F+TSDDEDEMSEGFFEAIEELERM REPSDVLEEMN Sbjct: 58 PRFVVPKRDKIREFRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREPSDVLEEMN 117 Query: 669 DRLSVRELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVSLMCSWVKKLIEE 848 DRLS RELQLVLVYFSQEGRDSWCALEVF+WLRKENRVDKETMELMVS+MCSWVKKLIE Sbjct: 118 DRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEG 177 Query: 849 KNEXXXXXXXXXXXXXXXXKPSFSMIEKVISLYFEMRERDGAVLFVKEILRRGISYSDDD 1028 +++ KP FSMIEKVISLY+EM E++ AVLFVKE+LRR I+YS+DD Sbjct: 178 EHDVGDVVDLLVDMDCVGLKPGFSMIEKVISLYWEMEEKEKAVLFVKEVLRREIAYSEDD 237 Query: 1029 GQGHNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLKPEVYSYLIAMTAVVKELNELAK 1208 G GH GGPTGYLAWKMM EGNY+ A+KLVI +ES LKPEVYSYLIAMTAVVKELNE AK Sbjct: 238 GDGHKGGPTGYLAWKMMAEGNYRGAVKLVIHLRESGLKPEVYSYLIAMTAVVKELNEFAK 297 Query: 1209 ALRKLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVRLSDWVIQQGGPSLQGVVLERLL 1388 ALRKLK F K+GLI+ELD +NV+LIE+YQSDL+ADGVRLS WVIQ+G L GVV ERLL Sbjct: 298 ALRKLKGFTKSGLIAELDAENVELIEKYQSDLLADGVRLSSWVIQEGRSPLHGVVYERLL 357 Query: 1389 AMYICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAICASQNESNAIARLLTRVEATSSL 1568 AMYICAG+GLEAERQLWEMKLVGKEAD LYD+VLAICAS+ E++AI+RLLT +E TSS+ Sbjct: 358 AMYICAGRGLEAERQLWEMKLVGKEADRELYDIVLAICASKKEASAISRLLTGMEVTSSI 417 Query: 1569 RKKKTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDFLDRAAVLQGLRKRIQQSGNVET 1748 R+KKTLSWLLRGYIKG H+ DA+ET+IKML+LGL P++LDRAAVLQGLR RIQQ+GNVET Sbjct: 418 RRKKTLSWLLRGYIKGSHFDDASETIIKMLDLGLCPEYLDRAAVLQGLRNRIQQTGNVET 477 Query: 1749 YLKLCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850 YLKLCKHLSDA+LIGPCLVY+Y K+YKLWI K I Sbjct: 478 YLKLCKHLSDANLIGPCLVYLYIKKYKLWILKTI 511 >ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|222850384|gb|EEE87931.1| predicted protein [Populus trichocarpa] Length = 500 Score = 690 bits (1780), Expect = 0.0 Identities = 336/456 (73%), Positives = 395/456 (86%) Frame = +3 Query: 483 KTHNSVVLNRGKVREFGFLKSVELDKFITSDDEDEMSEGFFEAIEELERMAREPSDVLEE 662 K N VV KVREF KSVELD+++TSDDE+EM EGFFEAIEELERM REPSD+LEE Sbjct: 45 KRPNFVVAKTTKVREFRLFKSVELDQYVTSDDEEEMGEGFFEAIEELERMTREPSDILEE 104 Query: 663 MNDRLSVRELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVSLMCSWVKKLI 842 MNDRLS RELQLVLVYFSQEGRDSWCALEVF+WLRKENRVDKETMELMVS+MCSWVKKLI Sbjct: 105 MNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLI 164 Query: 843 EEKNEXXXXXXXXXXXXXXXXKPSFSMIEKVISLYFEMRERDGAVLFVKEILRRGISYSD 1022 E + + KPSFSMIEKVISLY++M +++GAV FVKE+LRRGI+YS Sbjct: 165 EGEQDVGDVVDLLVDMDCVGLKPSFSMIEKVISLYWDMGKKEGAVSFVKEVLRRGIAYSG 224 Query: 1023 DDGQGHNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLKPEVYSYLIAMTAVVKELNEL 1202 DDG+G GGPTGYL WKMM +GNY++A+KLVI +ES LKPE+Y+YLIAMTAVVKELNE Sbjct: 225 DDGEGQKGGPTGYLTWKMMVDGNYRNAVKLVIHLRESGLKPEIYAYLIAMTAVVKELNEF 284 Query: 1203 AKALRKLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVRLSDWVIQQGGPSLQGVVLER 1382 +KALRKLK + ++G+++ELD +NV+L+E+YQSDL+ADGV LS WVIQ+G P+L GVV ER Sbjct: 285 SKALRKLKGYSRSGMVTELDAENVELVEKYQSDLLADGVCLSSWVIQEGSPALYGVVHER 344 Query: 1383 LLAMYICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAICASQNESNAIARLLTRVEATS 1562 LLAMYICAG+GL+AERQLWEMKLVGKEADG+LYD+VLAICASQ E++A+ARLLTR+E S Sbjct: 345 LLAMYICAGRGLDAERQLWEMKLVGKEADGDLYDIVLAICASQKEASAVARLLTRIEVAS 404 Query: 1563 SLRKKKTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDFLDRAAVLQGLRKRIQQSGNV 1742 S+RKKK+LSWLLRGYIKGGHY +AAET+IKML+LGLSPD+LDR AV+QGLRKRIQQ GNV Sbjct: 405 SMRKKKSLSWLLRGYIKGGHYGEAAETLIKMLDLGLSPDYLDRVAVMQGLRKRIQQWGNV 464 Query: 1743 ETYLKLCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850 E+YLKLCK LSD +LIGP LVY+Y K+YKLWI K++ Sbjct: 465 ESYLKLCKRLSDVNLIGPSLVYLYIKKYKLWIMKLL 500 >ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis] gi|223539607|gb|EEF41193.1| conserved hypothetical protein [Ricinus communis] Length = 499 Score = 682 bits (1761), Expect = 0.0 Identities = 344/476 (72%), Positives = 400/476 (84%), Gaps = 2/476 (0%) Frame = +3 Query: 429 KLKTPNSRSWFSGTLCSLKTHNSVVLNRGKVR--EFGFLKSVELDKFITSDDEDEMSEGF 602 + K N R + ++ K+ N VV + K R EF LKSVELD++I SDDE+EMSEGF Sbjct: 24 RYKLLNPRFFQLSSIKFPKSSNFVVAQQSKSRNREFRVLKSVELDQYIASDDEEEMSEGF 83 Query: 603 FEAIEELERMAREPSDVLEEMNDRLSVRELQLVLVYFSQEGRDSWCALEVFDWLRKENRV 782 FEAIEELERM REPSDVLEEMND+LS RELQLVLVYFSQEGRDSWCALEVF+WLRKENRV Sbjct: 84 FEAIEELERMTREPSDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRV 143 Query: 783 DKETMELMVSLMCSWVKKLIEEKNEXXXXXXXXXXXXXXXXKPSFSMIEKVISLYFEMRE 962 DKETMELMVS+MCSW+KKLIE ++E KPSFSMIEKVISLY+E+ E Sbjct: 144 DKETMELMVSIMCSWIKKLIEGEHEIGDVVDLLVDMDCVGLKPSFSMIEKVISLYWEIGE 203 Query: 963 RDGAVLFVKEILRRGISYSDDDGQGHNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLK 1142 ++ +V FVKE+LRR ++Y +DDG+G GGPTGYLAWKMM +GNY+DA+KLVI F+ES LK Sbjct: 204 KEKSVSFVKEVLRREVAYFEDDGEGQKGGPTGYLAWKMMVDGNYRDAVKLVIHFRESGLK 263 Query: 1143 PEVYSYLIAMTAVVKELNELAKALRKLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVR 1322 PEVYSYLIAMTAVVKELNE AKALRKLK F K+GLI+ELD +N +LIE+YQSDLIADGV Sbjct: 264 PEVYSYLIAMTAVVKELNEFAKALRKLKGFAKSGLIAELDAENTRLIEKYQSDLIADGVC 323 Query: 1323 LSDWVIQQGGPSLQGVVLERLLAMYICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAIC 1502 LS WVIQ+G PSL GVV ERLLAMYICAG+GL+AERQLWEMKLVGK ADG+LYD+VLAIC Sbjct: 324 LSSWVIQEGSPSLYGVVHERLLAMYICAGRGLDAERQLWEMKLVGKHADGDLYDIVLAIC 383 Query: 1503 ASQNESNAIARLLTRVEATSSLRKKKTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDF 1682 ASQ E++A++RLLTRVE TSSL+KKKTLSWLLRGY+KGG Y +AAE ++KML++GL PD+ Sbjct: 384 ASQKEASAVSRLLTRVEVTSSLQKKKTLSWLLRGYLKGGQYDEAAEALVKMLDMGLCPDY 443 Query: 1683 LDRAAVLQGLRKRIQQSGNVETYLKLCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850 LDR AVLQGLRKRIQQ GNVE+YL LCK LSD +LIGP LVY+Y K+YKLWI KM+ Sbjct: 444 LDRVAVLQGLRKRIQQWGNVESYLNLCKRLSDENLIGPSLVYLYIKKYKLWIMKML 499 >ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic-like [Glycine max] Length = 510 Score = 671 bits (1731), Expect = 0.0 Identities = 343/511 (67%), Positives = 408/511 (79%), Gaps = 4/511 (0%) Frame = +3 Query: 330 MGFLAGIDHNSKLGFTNSFYFPQYKFVGARFCTKLKTPNSRSWFSGTLC--SLKTHNSVV 503 M + G KLGF S P K F S ++ G L S K N Sbjct: 1 MAYAHGFAPIFKLGFVFSSVSPSQKRHPLVFPAS-HCGYSLKFYDGVLSARSCKFKNPSF 59 Query: 504 LNRGKVREFGFLKSVELDKFITSDDE-DEMSEGFFEAIEELERMAREPSDVLEEMNDRLS 680 + +G +R F LKSVELD+++TSDDE DEMS+GFFEAIEELERM REPSDVLEEMNDRLS Sbjct: 60 VKQGSIRGFRALKSVELDQYVTSDDEEDEMSDGFFEAIEELERMTREPSDVLEEMNDRLS 119 Query: 681 VRELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVSLMCSWVKKLIEEKNEX 860 RELQLVLVYFSQ+GRDSWCALEVFDWLRKENRVDKETMELMV++MC WVKKLI+E + Sbjct: 120 ARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQEHHGV 179 Query: 861 XXXXXXXXXXXXXXX-KPSFSMIEKVISLYFEMRERDGAVLFVKEILRRGISYSDDDGQG 1037 +P FSMIEKVISLY+EM E++GAVLFV+E+LRRGI Y ++D +G Sbjct: 180 VGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYLEEDEEG 239 Query: 1038 HNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLKPEVYSYLIAMTAVVKELNELAKALR 1217 H GGPTGYLAWKMM EG+Y A++LVI F ES LKPEVYSYL+AMTAVVKELNELAKALR Sbjct: 240 HKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVKELNELAKALR 299 Query: 1218 KLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVRLSDWVIQQGGPSLQGVVLERLLAMY 1397 KLKSF + GL++ELD ++V+L E+YQSDL+ DGVRLS+W IQ G PSL G++ ERLLAMY Sbjct: 300 KLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHGIIHERLLAMY 359 Query: 1398 ICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAICASQNESNAIARLLTRVEATSSLRKK 1577 ICAG G+EAE+QLWEMKLVGKEADG+LYD+VLAICASQ ESNA ARLLTR+E SS +KK Sbjct: 360 ICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVASSPQKK 419 Query: 1578 KTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDFLDRAAVLQGLRKRIQQSGNVETYLK 1757 K+LSWLLRGYIKGGH+++AAET++KML+LG P++LDRAAVLQGLRKRIQQ GN++TY++ Sbjct: 420 KSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVR 479 Query: 1758 LCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850 LCK LSDA+LIGPCLV++Y ++YKLW+ KM+ Sbjct: 480 LCKSLSDANLIGPCLVHLYIRKYKLWVVKML 510 >ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic-like [Glycine max] Length = 508 Score = 669 bits (1727), Expect = 0.0 Identities = 327/463 (70%), Positives = 393/463 (84%) Frame = +3 Query: 462 SGTLCSLKTHNSVVLNRGKVREFGFLKSVELDKFITSDDEDEMSEGFFEAIEELERMARE 641 S C K + V G +R F LKSVE+D+++TS+DE MS+GFFEAIEELERM RE Sbjct: 48 SARSCKFKNPSFVSAKHGSLRGFRALKSVEMDQYVTSNDE--MSDGFFEAIEELERMTRE 105 Query: 642 PSDVLEEMNDRLSVRELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVSLMC 821 PSDVLEEMNDRLS RELQLVLVYFSQ+GRDSWCALEVFDWLRKENRVDKETMELMV++MC Sbjct: 106 PSDVLEEMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMC 165 Query: 822 SWVKKLIEEKNEXXXXXXXXXXXXXXXXKPSFSMIEKVISLYFEMRERDGAVLFVKEILR 1001 WVKKLI++++ +P FSMIEKVISLY+EM E++GAVLFV+E+LR Sbjct: 166 GWVKKLIQQQHGVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLR 225 Query: 1002 RGISYSDDDGQGHNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLKPEVYSYLIAMTAV 1181 RGI Y ++D +GH GGPTGYLAWKMM EG+Y++A++LVI F+ES LKPE+YSYL+AMTAV Sbjct: 226 RGIPYVEEDEEGHKGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIYSYLVAMTAV 285 Query: 1182 VKELNELAKALRKLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVRLSDWVIQQGGPSL 1361 VKELNE AKALRKLK F +AGL++ELD ++V+L E+YQSD +ADGVRLS+WVIQ G PSL Sbjct: 286 VKELNEFAKALRKLKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNWVIQDGSPSL 345 Query: 1362 QGVVLERLLAMYICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAICASQNESNAIARLL 1541 G+V ERLLAMYICAG G+EAERQLWEMKLVGKEADG+LYD+VLAICASQ ESNA ARLL Sbjct: 346 HGIVHERLLAMYICAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLL 405 Query: 1542 TRVEATSSLRKKKTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDFLDRAAVLQGLRKR 1721 TR+E SS +KKK+LSWLLRGYIKGGH+++AAET++KML LG P++LDRAAVLQGLRKR Sbjct: 406 TRLEVVSSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRAAVLQGLRKR 465 Query: 1722 IQQSGNVETYLKLCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850 IQQ GN++TY++LCK LSDA+LIGPCLV++Y ++YKLW+ KM+ Sbjct: 466 IQQYGNLDTYVRLCKSLSDANLIGPCLVHLYIRKYKLWVVKML 508