Adaptive fully-discrete finite element methods for nonlinear quadratic parabolic boundary optimal control

Lu, Zuliang

doi:10.1186/1687-2770-2013-72

Research
Open access
Published: 04 April 2013

Adaptive fully-discrete finite element methods for nonlinear quadratic parabolic boundary optimal control

Zuliang Lu^1,2

Boundary Value Problems volume 2013, Article number: 72 (2013) Cite this article

1829 Accesses
1 Citations
Metrics details

Abstract

The aim of this work is to study adaptive fully-discrete finite element methods for quadratic boundary optimal control problems governed by nonlinear parabolic equations. We derive a posteriori error estimates for the state and control approximation. Such estimates can be used to construct reliable adaptive finite element approximation for nonlinear quadratic parabolic boundary optimal control problems. Finally, we present a numerical example to show the theoretical results.

1 Introduction

In this paper, we study the fully-discrete finite element approximation for quadratic boundary optimal control problems governed by nonlinear parabolic equations. Optimal control problems are very important models in engineering numerical simulation. They have various physical backgrounds in many practical applications. Finite element approximation of optimal control problems plays a very important role in the numerical methods for these problems. The finite element approximation of a linear elliptic optimal control problem is well investigated by Falk [1] and Geveci [2]. The discretization for semilinear elliptic optimal control problems is discussed by Arada, Casas, and Tröltzsch in [3]. Systematic introductions of the finite element method for optimal control problems can be found in [4–6].

As one of important kinds of optimal control problems, the boundary optimal control is widely used in scientific and engineering computing. The literature in this aspect is huge; see, e.g., [7–10]. For some quadratic boundary optimal control problems, Liu and Yan [11, 12] investigated a posteriori error estimates and adaptive finite element methods. Alt and Mackenroth [13] were concerned with error estimates of finite element approximations to state constrained convex parabolic boundary optimal control problems. Arada et al. discussed the numerical approximation of boundary optimal control problems governed by semilinear elliptic equations with pointwise constraints on the control in [14]. Although a priori error estimates and a posteriori error estimates of finite element approximation are widely used in numerical simulations, they have not yet been utilized in nonlinear parabolic boundary optimal control problems.

Adaptive finite element approximation is the most important method to boost accuracy of the finite element discretization. It ensures a higher density of nodes in a certain area of the given domain, where the solution is discontinuous or more difficult to approximate, using a posteriori error indicator. A posteriori error estimates are computable quantities in terms of the discrete solution that measure the actual discrete errors without the knowledge of exact solutions. They are essential in designing algorithms for mesh which equidistribute the computational effort and optimize the computation. Recently, in [15–18], we derived a priori error estimates, a posteriori error estimates and superconvergence for optimal control problems using mixed finite element methods.

In this paper, we adopt the standard notation $W^{m, p} (Ω)$ for Sobolev spaces on Ω with a norm ${∥ \cdot ∥}_{m, p}$ given by ${∥ v ∥}_{m, p}^{p} = \sum_{| α | \leq m} {∥ D^{α} v ∥}_{L^{p} (Ω)}^{p}$ and a semi-norm $| \cdot |_{m, p}$ given by $| v |_{m, p}^{p} = \sum_{| α | = m} {∥ D^{α} v ∥}_{L^{p} (Ω)}^{p}$ . We set $W_{0}^{m, p} (Ω) = {v \in W^{m, p} (Ω) : v |_{\partial Ω} = 0}$ . For $p = 2$ , we denote $H^{m} (Ω) = W^{m, 2} (Ω)$ , $H_{0}^{m} (Ω) = W_{0}^{m, 2} (Ω)$ , and ${∥ \cdot ∥}_{m} = {∥ \cdot ∥}_{m, 2}$ , $∥ \cdot ∥ = {∥ \cdot ∥}_{0, 2}$ . We denote by $L^{s} (0, T; W^{m, p} (Ω))$ the Banach space of all $L^{s}$ integrable functions from J into $W^{m, p} (Ω)$ with the norm ${∥ v ∥}_{L^{s} (J; W^{m, p} (Ω))} = {(\int_{0}^{T} {∥ v ∥}_{W^{m, p} (Ω)}^{s} d t)}^{\frac{1}{s}}$ for $s \in [1, \infty)$ , and the standard modification for $s = \infty$ . The details can be found in [19].

In this paper, we derive a posteriori error estimates for a class of boundary optimal control problems governed by a nonlinear parabolic equation. To our best knowledge, in the context of nonlinear parabolic boundary optimal control problems, these estimates are new. The problem that we are interested in is the following nonlinear quadratic parabolic boundary optimal control problem:

min_{u (t) \in K} {\int_{0}^{T} (\frac{1}{2} {∥ y - y_{0} ∥}^{2} + \frac{α}{2} {∥ u ∥}^{2}) d t}

(1)

subject to the state equations

y_{t} (x, t) - \nabla \cdot (A \nabla y (x, t)) + ϕ (y (x, t)) = f (x, t), x \in Ω, t \in J,

(2)

(A \nabla y (x, t)) \cdot n = B u (x, t) + z_{b}, x \in \partial Ω, t \in J,

(3)

y (x, 0) = y_{0} (x), x \in Ω,

(4)

where the bounded open set $Ω \subset R^{2}$ is 2 regular convex polygon with boundary ∂ Ω, $J = (0, T]$ , $f \in L^{2} (J; L^{2} (Ω))$ , $y_{0} \in H^{1} (Ω)$ , $z_{b} \in L^{2} (\partial Ω)$ , and α is a positive constant. For any $I > 0$ , the function $ϕ (\cdot) \in W^{2, \infty} (- I, I)$ , $ϕ^{'} (y) \in L^{2} (Ω)$ for any $y \in L^{2} (J; H^{1} (Ω))$ , and $ϕ^{'} (y) \geq 0$ . We assume the coefficient matrix $A (x) = {(a_{i, j} (x))}_{2 \times 2} \in {(W^{1, \infty} (Ω))}^{2 \times 2}$ is a symmetric positive definite matrix, and there is a constant $c > 0$ satisfying for any vector $X \in R^{2}$ , $X^{t} A X \geq c {∥ X ∥}_{R^{2}}^{2}$ . Here, K denotes the admissible set of the control variable defined by

K = {u (x, t) \in L^{2} (J; L^{2} (\partial Ω)) : u (x, t) \geq 0 a.e. x \in Ω, t \in J} .

(5)

The plan of this paper is as follows. In the next section, we present a finite element discretization for nonlinear quadratic parabolic boundary optimal control problems. A posteriori error estimates are established for the finite element approximation solutions in Section 3. In Section 4, we give a numerical example to prove the theoretical results.

2 Finite element methods for parabolic boundary optimal control

We shall now describe a finite element discretization of nonlinear quadratic parabolic boundary optimal control problem (1)-(4). Let $V = H^{1} (Ω)$ , $W = L^{2} (Ω)$ , $U = L^{2} (\partial Ω)$ . Let

a (y, w) = \int_{Ω} (A \nabla y) \cdot \nabla w, \forall y, w \in V,

(6)

(f_{1}, f_{2}) = \int_{Ω} f_{1} f_{2}, \forall (f_{1}, f_{2}) \in W \times W,

(7)

{(u, v)}_{U} = \int_{\partial Ω} u v, \forall (u, v) \in U \times U .

(8)

Then quadratic parabolic boundary optimal control problem (1)-(4) can be restated as

min_{u (t) \in K} {\int_{0}^{T} (\frac{1}{2} {∥ y - y_{0} ∥}^{2} + \frac{α}{2} {∥ u ∥}^{2}) d t}

(9)

subject to

(y_{t}, w) + a (y, w) + (ϕ (y), w) = (f, w) + {(B u + z_{b}, w)}_{U}, \forall w \in V, t \in J,

(10)

y (x, 0) = y_{0} (x), x \in Ω,

(11)

where the inner product in $L^{2} (Ω)$ or $L^{2} {(Ω)}^{2}$ is indicated by $(\cdot, \cdot)$ , and B is a continuous linear operator from U to $L^{2} (Ω)$ .

It is well known (see, e.g., [12]) that the optimal control problems have at least a solution $(y, u)$ , and that if a pair $(y, u)$ is the solution of (9)-(11), then there is a co-state $p \in V$ such that the triplet $(y, p, u)$ satisfies the following optimality conditions:

(y_{t}, w) + a (y, w) + (ϕ (y), w) = (f, w) + {(B u + z_{b}, w)}_{U}, \forall w \in V = H^{1} (Ω),

(12)

y (x, 0) = y_{0} (x), x \in Ω,

(13)

- (p_{t}, w) + a (q, p) + (ϕ^{'} (y) p, q) = (y - y_{0}, q), \forall q \in V = H^{1} (Ω),

(14)

p (x, T) = 0, x \in Ω,

(15)

\int_{0}^{T} {(α u + B^{*} p, v - u)}_{U} d t \geq 0, \forall v \in K \subset U = L^{2} (\partial Ω),

(16)

where $B^{*}$ is the adjoint operator of B. In the rest of the paper, we shall simply write the product as $(\cdot, \cdot)$ whenever no confusion should be caused.

Let us consider the finite element approximation of control problem (9)-(11). Again, here we consider only n-simplex elements and conforming finite elements.

Let $T^{h}$ be a regular partition of Ω. Associated with $T^{h}$ is a finite dimensional subspace $V^{h}$ of $C (\bar{Ω})$ such that $χ |_{τ}$ are polynomials of m-order ( $m \geq 1$ ) $\forall χ \in V^{h}$ and $τ \in T^{h}$ . It is easy to see that $V^{h} \subset V$ . Let $E^{h}$ be a partition of ∂ Ω into disjoint regular $(n - 1)$ -simplices s, so that $\partial Ω = ⋃_{s \in E^{h}} \bar{s}$ . Associated with $E^{h}$ is another finite dimensional subspace $U^{h}$ of $L^{2} (\partial Ω)$ such that $χ |_{τ}$ are polynomials of m-order ( $m \geq 0$ ) $\forall χ \in U^{h}$ and $s \in E^{h}$ . Let $h_{τ} (h_{s})$ denote the maximum diameter of the element $τ (s)$ in $T^{h} (E^{h})$ , $h = {max}_{τ \in T^{h}} {h_{τ}}$ , and $h_{U} = {max}_{s \in E^{h}} {h_{s}}$ . In addition C or c denotes a general positive constant independent of h.

By the definition of a finite element subspace, the finite element discretization of (9)-(11) is as follows: compute $(y_{h}, u_{h}) \in V^{h} \times K^{h}$ such that

min_{u_{h} \in K^{h}} {\int_{0}^{T} (\frac{1}{2} {∥ y_{h} - y_{0} ∥}^{2} + \frac{α}{2} {∥ u_{h} ∥}^{2})}

(17)

(y_{h t}, w_{h}) + a (y_{h}, w_{h}) + (ϕ (y_{h}), w_{h}) = (f, w_{h}) + {(B u_{h} + z_{b}, w_{h})}_{U}, \forall w_{h} \in V^{h},

(18)

y_{h} (x, 0) = y_{0}^{h} (x), x \in Ω,

(19)

where $K^{h} = K \cap U^{h}$ , $y_{0}^{h} \in V^{h}$ is an approximation of $y_{0}$ .

Again, it follows that optimal control problem (17)-(19) has at least a solution $(y_{h}, u_{h})$ , and that if a pair $(y_{h}, u_{h})$ is the solution of (17)-(19), then there is a co-state $p_{h} \in V^{h}$ such that the triplet $(y_{h}, p_{h}, u_{h})$ satisfies the following optimality conditions:

(y_{h t}, w_{h}) + a (y_{h}, w_{h}) + (ϕ (y_{h}), w_{h}) = (f, w_{h}) + {(B u_{h} + z_{b}, w_{h})}_{U}, \forall w_{h} \in V^{h},

(20)

y_{h} (x, 0) = y_{0}^{h} (x), x \in Ω,

(21)

- (p_{h t}, w_{h}) + a (q_{h}, p_{h}) + (ϕ^{'} (y_{h}) p_{h}, q_{h}) = (y_{h} - y_{0}, q_{h}), \forall q_{h} \in V^{h},

(22)

p_{h} (x, T) = 0, x \in Ω,

(23)

\int_{0}^{T} {(α u_{h} + B^{*} p_{h}, v_{h} - u_{h})}_{U} d t \geq 0, \forall v_{h} \in K^{h} .

(24)

We now consider the fully discrete approximation for the semidiscrete problem. Let $Δ t > 0$ , $N = T / Δ t \in Z$ , and let $t^{i} = i Δ t$ , $i \in R$ . Also, let

ψ^{i} = ψ^{i} (x) = ψ (x, t^{i}), d_{t} ψ^{i} = \frac{ψ^{i} - ψ^{i - 1}}{Δ t} .

For $i = 1, 2, \dots, N$ , we construct the finite element spaces $V_{i}^{h} \in V$ with the mesh $T_{h}^{i}$ (similar to $V_{h}$ ). Similarly, we construct the finite element spaces $U_{i}^{h} \in L^{2} (\partial Ω)$ with the mesh $T_{h}^{i}$ (similar to $U_{h}$ ). Let $h_{τ}^{i} (h_{s^{i}})$ denote the maximum diameter of the element $τ^{i} (s^{i})$ in $T_{h}^{i} ({(E^{h})}^{i})$ . Define mesh functions $τ (\cdot)$ , $s (\cdot)$ and mesh size functions $h_{τ} (\cdot)$ , $h_{s} (\cdot)$ such that $τ (t) |_{t \in (t_{i - 1}, t_{i}]} = τ^{i}$ , $s (t) |_{t \in (t_{i - 1}, t_{i}]} = s^{i}$ , $h_{τ} (t) |_{t \in (t_{i - 1}, t_{i}]} = h_{τ_{i}}$ , $h_{s} (t) |_{t \in (t_{i - 1}, t_{i}]} = h_{s_{i}}$ . For ease of exposition, we denote $τ (t)$ , $s (t)$ , $h_{τ} (t)$ , and $h_{s} (t)$ by τ, s, $h_{τ}$ , and $h_{s}$ , respectively.

Then the fully discrete finite element approximation of (17)-(19) is as follows. Compute $(y_{h}^{i}, u_{h}^{i}) \in V_{i}^{h} \times K_{i}^{h}$ , $i = 1, 2, \dots, N$ , such that

min_{u_{h}^{i} \in K_{i}^{h}} {\sum_{i = 1}^{N} Δ t (\frac{1}{2} {∥ y_{h}^{i} - y_{0} ∥}^{2} + \frac{α}{2} {∥ u_{h}^{i} ∥}^{2})}

(25)

(d_{t} y_{h}^{i}, w_{h}) + a (y_{h}^{i}, w_{h}) + (ϕ (y_{h}^{i}), w_{h}) = (f (x, t_{i}), w_{h}) + {(B u_{h}^{i} + z_{b}, w_{h})}_{U},

(26)

\forall w_{h} \in V_{i}^{h}, i = 1, 2, \dots, N, y_{h}^{0} (x) = y_{0}^{h} (x), x \in Ω,

(27)

where $K_{i}^{h} = K \cap U_{i}^{h}$ , $y_{0}^{h} \in V^{h}$ is an approximation of $y_{0}$ .

Now, it follows that optimal control problem (25)-(27) has at least a solution $(Y_{h}^{i}, U_{h}^{i})$ , $i = 1, 2, \dots, N$ , and that if a pair $(Y_{h}^{i}, U_{h}^{i})$ , $i = 1, 2, \dots, N$ , is the solution of (25)-(27), then there is a co-state $P_{h}^{i - 1} \in V_{i}^{h}$ , $i = 1, 2, \dots, N$ , such that the triplet $(Y_{h}^{i}, P_{h}^{i - 1}, U_{h}^{i})$ satisfies the following optimality conditions:

(d_{t} Y_{h}^{i}, w_{h}) + a (Y_{h}^{i}, w_{h}) + (ϕ (Y_{h}^{i}), w_{h}) = (f, w_{h}) + {(B U_{h}^{i} + z_{b}, w_{h})}_{U}, \forall w_{h} \in V_{i}^{h},

(28)

i = 1, 2, \dots, N, Y_{h}^{0} (x) = y_{0}^{h} (x), x \in Ω,

(29)

- (d_{t} P_{h}^{i}, q_{h}) + a (q_{h}, P_{h}^{i - 1}) + (ϕ^{'} (Y_{h}^{i - 1}) P_{h}^{i - 1}, q_{h}) = (Y_{h}^{i} - y_{0}, q_{h}), \forall q_{h} \in V_{i}^{h},

(30)

i = N, \dots, 2, 1, P_{h}^{N} (x) = 0, x \in Ω,

(31)

(α U_{h}^{i} + B^{*} P_{h}^{i}, v_{h} - U_{h}^{i}) \geq 0, \forall v_{h} \in K_{i}^{h}, i = 1, 2, \dots, N .

(32)

For $i = 1, 2, \dots, N$ , let

Y_{h} |_{(t_{i - 1}, t_{i}]} = ((t_{i} - t) Y_{h}^{i - 1} + (t - t_{i - 1}) Y_{h}^{i}) / Δ t,

(33)

P_{h} |_{(t_{i - 1}, t_{i}]} = ((t_{i} - t) P_{h}^{i - 1} + (t - t_{i - 1}) P_{h}^{i}) / Δ t,

(34)

U_{h} |_{(t_{i - 1}, t_{i}]} = U_{h}^{i} .

(35)

For any function $w \in C (0, T; L^{2} (Ω))$ , let $\hat{w} (x, t) |_{t \in (t_{i - 1}, t_{i}]} = w (x, t_{i})$ , $\tilde{w} (x, t) |_{t \in (t_{i - 1}, t_{i}]} = w (x, t_{i - 1})$ . Then the optimality conditions (28)-(32) can be restated as follows:

(Y_{h t}, w_{h}) + a ({\hat{Y}}_{h}, w_{h}) + (ϕ ({\hat{Y}}_{h}), w_{h}) = (\hat{f}, w_{h}) + {(B U_{h} + z_{b}, w_{h})}_{U}, \forall w_{h} \in V_{i}^{h},

(36)

i = 1, 2, \dots, N, Y_{h}^{0} (x) = y_{0}^{h} (x), x \in Ω,

(37)

- (P_{h t}, q_{h}) + a (q_{h}, {\tilde{P}}_{h}) + (ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h}, q_{h}) = ({\hat{Y}}_{h} - y_{0}, q_{h}), \forall q_{h} \in V_{i}^{h},

(38)

i = N, \dots, 2, 1, P_{h} (x, T) = 0, x \in Ω,

(39)

(α U_{h} + B^{*} {\tilde{P}}_{h}, v_{h} - U_{h}) \geq 0, \forall v_{h} \in K_{i}^{h}, i = 1, 2, \dots, N .

(40)

In the rest of the paper, we shall use some intermediate variables. For any control function $U_{h} \in K$ , we define that the state solution $(y (U_{h}), p (U_{h}))$ satisfies

(y_{t} (U_{h}), w) + a (y (U_{h}), w) + (ϕ (y (U_{h})), w) = (f, w) + (B U_{h} + z_{b}, w), \forall w \in V,

(41)

y (U_{h}) (x, 0) = y_{0} (x), x \in Ω,

(42)

- (p_{t} (U_{h}), q) + a (q, p (U_{h})) + (ϕ^{'} (y (U_{h})) p (U_{h}), q) = (y (U_{h}) - y_{0}, q), \forall q \in V,

(43)

p (U_{h}) (x, T) = 0, x \in Ω .

(44)

Now we restate the following well-known estimates in [19].

Lemma 2.1 Let ${\hat{π}}_{h}$ be the Clément-type interpolation operator defined in [19]. Then for any $v \in H^{1} (Ω)$ and all element τ,

{∥ v - {\hat{π}}_{h} v ∥}_{L^{2} (τ)} + h_{τ} {∥ \nabla (v - {\hat{π}}_{h} v) ∥}_{L^{2} (τ)} \leq C h_{τ} \sum_{{\bar{τ}}^{'} \cap \bar{τ} \neq \emptyset} {| v |}_{L^{2} (τ^{'})},

(45)

{∥ v - {\hat{π}}_{h} v ∥}_{L^{2} (l)} \leq C h_{l}^{1 / 2} \sum_{l \subset {\bar{τ}}^{'}} {| \nabla v |}_{L^{2} (τ^{'})},

(46)

where l is the edge of the element.

For $φ \in W_{h}$ , we write

ϕ (φ) - ϕ (ρ) = - {\tilde{ϕ}}^{'} (φ) (ρ - φ) = - ϕ^{'} (ρ) (ρ - φ) + {\tilde{ϕ}}^{''} (φ) {(ρ - φ)}^{2},

(47)

where

\begin{matrix} {\tilde{ϕ}}^{'} (φ) = \int_{0}^{1} ϕ^{'} (φ + s (ρ - φ)) d s, \\ {\tilde{ϕ}}^{''} (φ) = \int_{0}^{1} (1 - s) ϕ^{''} (ρ + s (φ - ρ)) d s \end{matrix}

are bounded functions in $\bar{Ω}$ [20].

3 A posteriori error estimates

In this section we obtain a posteriori error estimates for nonlinear quadratic parabolic boundary optimal control problems. Firstly, we estimate the error ${∥ y (U_{h}) - {\hat{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}$ .

Theorem 3.1 Let $(y (U_{h}), p (U_{h}))$ and $(Y_{h}, P_{h})$ be the solutions of (41)-(44) and (36)-(40), respectively. Then

{∥ y (U_{h}) - {\hat{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \leq C \sum_{i = 1}^{6} η_{i}^{2},

(48)

where

\begin{matrix} η_{1}^{2} = \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{2} \int_{τ} {(\hat{f} - Y_{h t} + div (A \nabla {\hat{Y}}_{h}) - ϕ ({\hat{Y}}_{h}))}^{2}, \\ η_{2}^{2} = \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} h_{l} \int_{l} {[A \nabla {\hat{Y}}_{h} \cdot n]}^{2}, \\ η_{3}^{2} = \int_{0}^{T} \sum_{l \subset \partial Ω} h_{l} \int_{l} {(A \nabla {\hat{Y}}_{h} \cdot n - B U_{h} - z_{b})}^{2}, \\ η_{4}^{2} = {∥ Y_{h} - {\hat{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2}, \\ η_{5}^{2} = {∥ Y_{h} (x, 0) - y_{0} (x) ∥}_{L^{2} (Ω)}^{2}, \\ η_{6}^{2} = {∥ f - \hat{f} ∥}_{L^{2} (J; L^{2} (Ω))}^{2}, \end{matrix}

where l is a face of an element τ, $h_{l}$ is the size of the face l, $[A \nabla y_{h} \cdot n]$ is the A-normal derivative jump over the interior face l defined by

{[A \nabla Y_{h} \cdot n]}_{l} = (A \nabla Y_{h} |_{τ_{l}^{1}} - A \nabla Y_{h} |_{τ_{l}^{2}}) \cdot n,

where n is the unit normal vector on $l = {\bar{τ}}_{l}^{1} \cap {\bar{τ}}_{l}^{2}$ outwards $τ_{l}^{1}$ .

Proof Let $e^{y} = y (U_{h}) - Y_{h}$ , and let $e_{I}^{y}$ be the Clément-type interpolator of $e^{y}$ defined in Lemma 2.1. Note that

\begin{array}{rcl} \int_{0}^{T} (y_{t} (U_{h}) - Y_{h t}, e^{y}) d t & = & \int_{0}^{T} \int_{Ω} (y_{t} (U_{h}) - Y_{h t}) e^{y} d x d t \\ = & \frac{1}{2} \int_{Ω} {((y (U_{h}) - Y_{h}) (x, T))}^{2} d x - \frac{1}{2} {∥ Y_{h} (x, 0) - y_{0} (x) ∥}_{L^{2} (Ω)}^{2} . \end{array}

Thus

\int_{0}^{T} (y_{t} (U_{h}) - Y_{h t}, e^{y}) d t + \frac{1}{2} {∥ Y_{h} (x, 0) - y_{0} (x) ∥}_{L^{2} (Ω)}^{2} \geq 0 .

Using equations (36) and (41), we infer that

\begin{matrix} c {∥ e^{y} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \\ \leq \int_{0}^{T} (A \nabla (y (U_{h}) - Y_{h}), \nabla e^{y}) d t + \int_{0}^{T} (ϕ (y (U_{h})) - ϕ (Y_{h}), e^{y}) d t \\ = \int_{0}^{T} (A \nabla (y (U_{h}) - Y_{h}), \nabla (e^{y} - e_{I}^{y})) d t + \int_{0}^{T} (ϕ (y (U_{h})) - ϕ (Y_{h}), e^{y} - e_{I}^{y}) d t \\ + \int_{0}^{T} (A \nabla (y (U_{h}) - Y_{h}), \nabla (e_{I}^{y})) d t + \int_{0}^{T} (ϕ (y (U_{h})) - ϕ (Y_{h}), e_{I}^{y}) d t \\ \leq \int_{0}^{T} (A \nabla (y (U_{h}) - Y_{h}), \nabla (e^{y} - e_{I}^{y})) + \int_{0}^{T} (ϕ (y (U_{h})) - ϕ (Y_{h}), e^{y} - e_{I}^{y}) \\ + \int_{0}^{T} (y_{t} (U_{h}) - Y_{h t}, e^{y} - e_{I}^{y}) d t + \frac{1}{2} {∥ Y_{h} (x, 0) - y_{0} (x) ∥}_{L^{2} (Ω)}^{2} \\ + \int_{0}^{T} (A \nabla (y (U_{h}) - Y_{h}), \nabla (e_{I}^{y})) d t + \int_{0}^{T} (ϕ (y (U_{h})) - ϕ (Y_{h}), e_{I}^{y}) d t \\ + \int_{0}^{T} (y_{t} (U_{h}) - Y_{h t}, e_{I}^{y}) d t \\ = \int_{0}^{T} \sum_{τ \in T^{h}} \int_{τ} (\hat{f} - Y_{h t} + div (A \nabla {\hat{Y}}_{h}) - ϕ ({\hat{Y}}_{h})) (e^{y} - e_{I}^{y}) d t \\ + \int_{0}^{T} \sum_{τ \in T^{h}} \int_{\partial τ} (A \nabla {\hat{Y}}_{h} \cdot n) (e^{y} - e_{I}^{y}) d s d t \\ + \int_{0}^{T} \int_{\partial Ω} (A \nabla {\hat{Y}}_{h} \cdot n - B U_{h} - z_{b}) (e^{y} - e_{I}^{y}) d s d t \\ + \int_{0}^{T} (A \nabla (y (U_{h}) - Y_{h}), \nabla (e^{y})) d t + \int_{0}^{T} (ϕ (y (U_{h})) - ϕ (Y_{h}), e^{y}) d t \\ + \int_{0}^{T} (f - \hat{f}, e^{y}) d t + \frac{1}{2} {∥ Y_{h} (x, 0) - y_{0} (x) ∥}_{L^{2} (Ω)}^{2} \\ = \int_{0}^{T} \sum_{τ \in T^{h}} \int_{τ} (\hat{f} - Y_{h t} + div (A \nabla {\hat{Y}}_{h}) - ϕ ({\hat{Y}}_{h})) (e^{y} - e_{I}^{y}) d t \\ + \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} \int_{l} (A \nabla {\hat{Y}}_{h} \cdot n) (e^{y} - e_{I}^{y}) d s d t \\ + \int_{0}^{T} \sum_{l \subset \partial Ω} \int_{l} (A \nabla {\hat{Y}}_{h} \cdot n - B U_{h} - z_{b}) (e^{y} - e_{I}^{y}) d s d t \\ + \int_{0}^{T} (A \nabla (Y_{h} - {\hat{Y}}_{h}), \nabla (e^{y})) d t + \int_{0}^{T} (ϕ (Y_{h}) - ϕ ({\hat{Y}}_{h}), e^{y}) d t \\ + \int_{0}^{T} (f - \hat{f}, e^{y}) d t + \frac{1}{2} {∥ Y_{h} (x, 0) - y_{0} (x) ∥}_{L^{2} (Ω)}^{2} \\ \equiv K_{1} + K_{2} + K_{3} + K_{4} + K_{5} + K_{6} + K_{7} . \end{matrix}

(49)

Let us bound each of the terms on the right-hand side of (49). By Lemma 2.1 we have

\begin{array}{rcl} K_{1} & = & \int_{0}^{T} \sum_{τ \in T^{h}} \int_{τ} (\hat{f} - Y_{h t} + div (A \nabla {\hat{Y}}_{h}) - ϕ ({\hat{Y}}_{h})) (e^{y} - e_{I}^{y}) d t \\ \leq & C \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{2} \int_{τ} {(\hat{f} - Y_{h t} + div (A \nabla {\hat{Y}}_{h}) - ϕ ({\hat{Y}}_{h}))}^{2} d t \\ + C δ \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{- 2} \int_{τ} {| e^{y} - e_{I}^{y} |}^{2} d t \\ \leq & C \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{2} \int_{τ} {(\hat{f} - Y_{h t} + div (A \nabla {\hat{Y}}_{h}) - ϕ ({\hat{Y}}_{h}))}^{2} d t + C δ {∥ e^{y} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} . \end{array}

(50)

Next, using Lemma 2.1, we get

\begin{array}{rcl} K_{2} & = & \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} \int_{l} (A \nabla {\hat{Y}}_{h} \cdot n) (e^{y} - e_{I}^{y}) d s d t \\ \leq & C \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} h_{l} \int_{l} {[A \nabla {\hat{Y}}_{h} \cdot n]}^{2} + C δ \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{- 2} \int_{τ} {| e^{y} - e_{I}^{y} |}^{2} \\ + C δ \int_{0}^{T} \sum_{τ \in T^{h}} \int_{τ} {| \nabla (e^{y} - e_{I}^{y}) |}^{2} \\ \leq & C \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} h_{l} \int_{l} {[A \nabla {\hat{Y}}_{h} \cdot n]}^{2} + C δ {∥ e^{y} ∥}_{L^{2} (J; H^{1} (Ω))}^{2}, \end{array}

(51)

and

\begin{array}{rcl} K_{3} & = & \int_{0}^{T} \sum_{l \subset \partial Ω} \int_{l} (A \nabla {\hat{Y}}_{h} \cdot n - B U_{h} - z_{b}) (e^{y} - e_{I}^{y}) \\ \leq & C \int_{0}^{T} \sum_{l \subset \partial Ω} h_{l} \int_{l} {(A \nabla {\hat{Y}}_{h} \cdot n - B U_{h} - z_{b})}^{2} \\ + C δ \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{- 2} \int_{τ} {| e^{y} - e_{I}^{y} |}^{2} + C δ \int_{0}^{T} \sum_{τ \in T^{h}} \int_{τ} {| \nabla (e^{y} - e_{I}^{y}) |}^{2} \\ \leq & C \int_{0}^{T} \sum_{l \subset \partial Ω} h_{l} \int_{l} {(A \nabla {\hat{Y}}_{h} \cdot n - B U_{h} - z_{b})}^{2} + C δ {∥ e^{y} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} . \end{array}

(52)

For $K_{4}$ - $K_{6}$ , the Schwarz inequality implies

\begin{array}{rcl} K_{4} & = & \int_{0}^{T} (A \nabla (Y_{h} - {\hat{Y}}_{h}), \nabla (e^{y})) d t \\ \leq & C {∥ Y_{h} - {\hat{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} + C δ {∥ e^{y} ∥}_{L^{2} (J; H^{1} (Ω))}^{2}, \end{array}

(53)

and

\begin{array}{rcl} K_{5} & = & \int_{0}^{T} (ϕ (Y_{h}) - ϕ ({\hat{Y}}_{h}), e^{y}) d t \\ = & \int_{0}^{T} ({\tilde{ϕ}}^{'} (Y_{h}) (Y_{h} - {\hat{Y}}_{h}), e^{y}) d t \\ \leq & C {∥ Y_{h} - {\hat{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} + C δ {∥ e^{y} ∥}_{L^{2} (J; H^{1} (Ω))}^{2}, \end{array}

(54)

and

\begin{array}{rcl} K_{6} & = & \int_{0}^{T} (f - \hat{f}, e^{y}) d t \\ \leq & C {∥ f - \hat{f} ∥}_{L^{2} (J; L^{2} (Ω))}^{2} + C δ {∥ e^{y} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} . \end{array}

(55)

Finally, add inequalities (49)-(55) to obtain

{∥ y (U_{h}) - {\hat{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \leq C \sum_{i = 1}^{6} η_{i}^{2} .

(56)

This completes the proof. □

Analogously to Theorem 3.1, we show the following estimates.

Theorem 3.2 Let $(y (U_{h}), p (U_{h}))$ and $(Y_{h}, P_{h})$ be the solutions of (41)-(44) and (36)-(40), respectively. Then

{∥ p (U_{h}) - {\tilde{P}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \leq C \sum_{i = 1}^{11} η_{i}^{2},

(57)

where

\begin{matrix} η_{7}^{2} = \sum_{τ \in T^{h}} h_{τ}^{2} \int_{τ} {({\hat{Y}}_{h} - y_{0} + P_{h t} + div (A^{*} \nabla {\tilde{P}}_{h}) - ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h})}^{2}, \\ η_{8}^{2} = \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} h_{l} \int_{l} {[A^{*} \nabla {\tilde{P}}_{h} \cdot n]}^{2}, \\ η_{9}^{2} = \int_{0}^{T} \sum_{l \subset \partial Ω} h_{l} \int_{l} {(A^{*} \nabla {\tilde{P}}_{h} \cdot n)}^{2}, \\ η_{10}^{2} = {∥ Y_{h} - {\tilde{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2}, \\ η_{11}^{2} = {∥ P_{h} - {\tilde{P}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2}, \end{matrix}

where $η_{1}$ - $η_{6}$ are defined in Theorem 3.1, l is a face of an element τ, $[A^{*} \nabla {\tilde{P}}_{h} \cdot n]$ is the A-normal derivative jump over the interior face l defined by

{[A^{*} \nabla {\tilde{P}}_{h} \cdot n]}_{l} = (A^{*} \nabla {\tilde{P}}_{h} |_{τ_{l}^{1}} - A^{*} \nabla {\tilde{P}}_{h} |_{τ_{l}^{2}}) \cdot n,

where n is the unit normal vector on $l = {\bar{τ}}_{l}^{1} \cap {\bar{τ}}_{l}^{2}$ outwards $τ_{l}^{1}$ .

Proof Let $e^{p} = p (U_{h}) - P_{h}$ , and let $e_{I}^{p} = {\hat{π}}_{h} e^{p}$ , where ${\hat{π}}_{h}$ is the Clément-type interpolator defined in Lemma 2.1. Note that $(p (U_{h}) - P_{h}) (x, T) = 0$ , then we obtain

- \int_{0}^{T} (p_{t} (U_{h}) - P_{h t}, e^{p}) d t \geq 0 .

Using equations (38) and (43), we obtain

\begin{matrix} c {∥ e^{p} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \\ \leq \int_{0}^{T} (\nabla e^{p}, A^{*} \nabla (p (U_{h}) - P_{h})) d t + \int_{0}^{T} (ϕ^{'} (y (U_{h})) (p (U_{h}) - P_{h}), e^{p}) d t \\ = \int_{0}^{T} (\nabla e^{p}, A^{*} \nabla (p (U_{h}) - {\tilde{P}}_{h})) d t + \int_{0}^{T} (ϕ^{'} (y (U_{h})) p (U_{h}) - ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h}, e^{p}) d t \\ - \int_{0}^{T} (p_{t} (U_{h}) - P_{h t}, e^{p}) d t + \int_{0}^{T} (ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h} - ϕ^{'} (y (U_{h})) P_{h}, e^{p}) d t \\ + \int_{0}^{T} (\nabla e^{p}, A^{*} \nabla ({\tilde{P}}_{h} - P_{h})) d t \\ = \int_{0}^{T} (\nabla (e^{p} - e_{I}^{p}), A^{*} \nabla (p (U_{h}) - {\tilde{P}}_{h})) d t - \int_{0}^{T} (p_{t} (U_{h}) - P_{h t}, e^{p} - e_{I}^{p}) d t \\ + \int_{0}^{T} (ϕ^{'} (y (U_{h})) p (U_{h}) - ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h}, e^{p} - e_{I}^{p}) d t + \int_{0}^{T} (y (U_{h}) - {\hat{Y}}_{h}, e_{I}^{p}) d t \\ + \int_{0}^{T} (\nabla e^{p}, A^{*} \nabla ({\tilde{P}}_{h} - P_{h})) d t + \int_{0}^{T} (ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h} - ϕ^{'} (y (U_{h})) P_{h}, e^{p}) d t \\ = \int_{0}^{T} ({\hat{Y}}_{h} - y_{0} + P_{h t} + div (A^{*} \nabla {\tilde{P}}_{h}) - ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h}, e^{p} - e_{I}^{p}) d t \\ + \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} \int_{l} (A^{*} \nabla {\tilde{P}}_{h} \cdot n) (e^{p} - e_{I}^{p}) d s d t \\ + \int_{0}^{T} \sum_{l \subset \partial Ω} \int_{l} (A^{*} \nabla {\tilde{P}}_{h} \cdot n) (e^{p} - e_{I}^{p}) d s d t + \int_{0}^{T} (y (U_{h}) - {\hat{Y}}_{h}, e^{p}) d t \\ + \int_{0}^{T} (\nabla e^{p}, A^{*} \nabla ({\tilde{P}}_{h} - P_{h})) d t + \int_{0}^{T} (ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h} - ϕ^{'} (y (U_{h})) P_{h}, e^{p}) d t \\ \equiv L_{1} + L_{2} + L_{3} + L_{4} + L_{5} + L_{6} . \end{matrix}

(58)

Now let us bound each of the terms on the right-hand side of (58). By Lemma 2.1 we have

\begin{array}{rcl} L_{1} & = & \int_{0}^{T} \sum_{τ \in T^{h}} \int_{τ} ({\hat{Y}}_{h} - y_{0} + P_{h t} + div (A^{*} \nabla {\tilde{P}}_{h}) - ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h}) (e^{p} - e_{I}^{p}) d t \\ \leq & C \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{2} \int_{τ} {({\hat{Y}}_{h} - y_{0} + P_{h t} + div (A^{*} \nabla {\tilde{P}}_{h}) - ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h})}^{2} d t \\ + C δ \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{- 2} \int_{τ} {| e^{p} - e_{I}^{p} |}^{2} d t \\ \leq & C \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{2} \int_{τ} {({\hat{Y}}_{h} - y_{0} + P_{h t} + div (A^{*} \nabla {\tilde{P}}_{h}) - ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h})}^{2} d t \\ + C δ {∥ e^{p} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} . \end{array}

(59)

Next, using Lemma 2.1, we get

\begin{array}{rcl} L_{2} & = & \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} \int_{l} (A^{*} \nabla {\tilde{P}}_{h} \cdot n) (e^{p} - e_{I}^{p}) d s d t \\ \leq & C \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} h_{l} \int_{l} {[A^{*} \nabla {\tilde{P}}_{h} \cdot n]}^{2} + C δ \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{- 2} \int_{τ} {| e^{p} - e_{I}^{p} |}^{2} \\ + C δ \int_{0}^{T} \sum_{τ \in T^{h}} \int_{τ} {| \nabla (e^{p} - e_{I}^{p}) |}^{2} \\ \leq & C \int_{0}^{T} \sum_{l \cap \partial Ω = ϕ} h_{l} \int_{l} {[A^{*} \nabla {\tilde{P}}_{h} \cdot n]}^{2} + C δ {∥ e^{p} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \end{array}

(60)

and

\begin{array}{rcl} L_{3} & = & \int_{0}^{T} \sum_{l \subset \partial Ω} \int_{l} (A^{*} \nabla {\tilde{P}}_{h} \cdot n) (e^{p} - e_{I}^{p}) \\ \leq & C \int_{0}^{T} \sum_{l \subset \partial Ω} h_{l} \int_{l} {(A^{*} \nabla {\tilde{P}}_{h} \cdot n)}^{2} \\ + C δ \int_{0}^{T} \sum_{τ \in T^{h}} h_{τ}^{- 2} \int_{τ} {| e^{p} - e_{I}^{p} |}^{2} + C δ \int_{0}^{T} \sum_{τ \in T^{h}} \int_{τ} {| \nabla (e^{p} - e_{I}^{p}) |}^{2} \\ \leq & C \int_{0}^{T} \sum_{l \subset \partial Ω} h_{l} \int_{l} {(A^{*} \nabla {\tilde{P}}_{h} \cdot n)}^{2} + C δ {∥ e^{p} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} . \end{array}

(61)

The Schwarz inequality implies

\begin{array}{rcl} L_{4} & = & \int_{0}^{T} (y (U_{h}) - {\hat{Y}}_{h}, e^{p}) d t \\ = & \int_{0}^{T} ((y (U_{h}) - Y_{h}) + (Y_{h} - {\hat{Y}}_{h}), e^{p}) d t \\ \leq & C {∥ y (U_{h}) - {\hat{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} + {∥ Y_{h} - {\hat{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \\ + C δ {∥ e^{p} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \end{array}

(62)

and

\begin{array}{rcl} L_{5} & = & \int_{0}^{T} (\nabla e^{p}, A^{*} \nabla ({\tilde{P}}_{h} - P_{h})) d t \\ \leq & C {∥ {\tilde{P}}_{h} - P_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} + C δ {∥ e^{p} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} . \end{array}

(63)

Next, for $L_{6}$ , we obtain

\begin{array}{rcl} L_{6} & = & \int_{0}^{T} (ϕ^{'} ({\tilde{Y}}_{h}) {\tilde{P}}_{h} - ϕ^{'} (y (U_{h})) P_{h}, e^{p}) d t \\ = & \int_{0}^{T} (ϕ^{'} ({\tilde{Y}}_{h}) ({\tilde{P}}_{h} - P_{h}), e^{p}) d t + \int_{0}^{T} ((ϕ^{'} ({\tilde{Y}}_{h}) - ϕ^{'} (Y_{h})) P_{h}, e^{p}) d t \\ + \int_{0}^{T} ((ϕ^{'} ({\tilde{Y}}_{h}) - ϕ^{'} (y (U_{h}))) P_{h}, e^{p}) d t \\ = & \int_{0}^{T} (ϕ^{'} ({\tilde{Y}}_{h}) ({\tilde{P}}_{h} - P_{h}), e^{p}) d t + \int_{0}^{T} ((ϕ^{'} ({\tilde{Y}}_{h}) - ϕ^{'} (Y_{h})) P_{h}, e^{p}) d t \\ + \int_{0}^{T} ({\tilde{ϕ}}^{''} ({\tilde{Y}}_{h}) (Y_{h} - y (U_{h})) P_{h}, e^{p}) d t \\ \leq & C {∥ y (U_{h}) - Y_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} + C {∥ Y_{h} - {\tilde{Y}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \\ + C {∥ P_{h} - {\tilde{P}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} + C δ {∥ e^{p} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} . \end{array}

(64)

Finally, add inequalities (58)-(64) and combine Theorem 3.1 to obtain

{∥ p (U_{h}) - {\tilde{P}}_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \leq C \sum_{i = 1}^{11} η_{i}^{2} .

(65)

This completes the proof. □

For given $u \in K$ , let M be the inverse operator of the state equation (12) such that $y (u) = M B u$ is the solution of the state equation (12). Similarly, for given $U_{h} \in K^{h}$ , $Y_{h} (U_{h}) = M_{h} B U_{h}$ is the solution of the discrete state equation (36). Let

\begin{matrix} S (u) = \frac{1}{2} {∥ M B u - y_{0} ∥}^{2} + \frac{α}{2} {∥ u ∥}^{2}, \\ S_{h} (U_{h}) = \frac{1}{2} {∥ M_{h} B U_{h} - y_{0} ∥}^{2} + \frac{α}{2} {∥ U_{h} ∥}^{2} . \end{matrix}

It is clear that S and $S_{h}$ are well defined and continuous on K and $K^{h}$ . Also, the functional $S_{h}$ can be naturally extended on K. Then (9) and (25) can be represented as

min_{u \in K} {S (u)},

(66)

min_{U_{h} \in K^{h}} {S_{h} (U_{h})} .

(67)

It can be shown that

\begin{matrix} (S^{'} (u), v) = (α u + B^{*} p, v), \\ (S^{'} (U_{h}), v) = (α U_{h} + B^{*} p (U_{h}), v), \\ (S_{h}^{'} (U_{h}), v) = (α U_{h} + B^{*} {\tilde{P}}_{h}, v), \end{matrix}

where $p (U_{h})$ is the solution of equations (41)-(43).

In many applications, $S (\cdot)$ is uniform convex near the solution u (see, e.g., [21]). The convexity of $S (\cdot)$ is closely related to the second-order sufficient conditions of the control problems, which are assumed in many studies on numerical methods of the problems. If $S (\cdot)$ is uniformly convex, then there is a $c > 0$ such that

\int_{0}^{T} (S^{'} (u) - S^{'} (U_{h}), u - U_{h}) \geq c {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2},

(68)

where u and $U_{h}$ are the solutions of (66) and (67), respectively. We assume the above inequality throughout this paper.

In order to have sharp a posteriori error estimates, we divide ∂ Ω into some subsets:

\begin{matrix} \partial Ω_{i}^{-} = {x \in \partial Ω : (B^{*} {\tilde{P}}_{h}) (x, t_{i}) \leq 0}, \\ \partial Ω_{i} = {x \in \partial Ω : (B^{*} {\tilde{P}}_{h}) (x, t_{i}) > 0, U_{h}^{i} = 0}, \\ \partial Ω_{i}^{+} = {x \in \partial Ω : (B^{*} {\tilde{P}}_{h}) (x, t_{i}) > 0, U_{h}^{i} > 0} . \end{matrix}

Then it is clear that three subsets do not intersect each other, and $\partial Ω = \partial Ω_{i}^{-} \cup \partial Ω_{i} \cup \partial Ω_{i}^{+}$ , $i = 1, 2, \dots, N$ .

Let $p (U_{h})$ be the solution of (41)-(44). We establish the following error estimate, which can be proved similarly to the proofs given in [22].

Theorem 3.3 Let u and $U_{h}$ be the solutions of (66) and (67), respectively. Then

{∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} \leq C (η_{12}^{2} + {∥ {\tilde{P}}_{h} - p (U_{h}) ∥}_{L^{2} (J; H^{1} (\partial Ω))}^{2}),

(69)

where

η_{12}^{2} = \sum_{i = 1}^{N} \int_{t_{i - 1}}^{t_{i}} \int_{\partial Ω_{i}^{-}} {| B^{*} {\tilde{P}}_{h} + α U_{h} |}^{2} .

Proof It follows from the inequality (68) that

\begin{matrix} c {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} \\ \leq \int_{0}^{T} (S^{'} (u), u - U_{h}) - (S^{'} (u_{h}), u - U_{h}) d t \\ \leq - \int_{0}^{T} (S^{'} (U_{h}), u - U_{h}) d t \\ = \int_{0}^{T} (S_{h}^{'} (U_{h}), U_{h} - u) d t + \int_{0}^{T} (S_{h}^{'} (U_{h}) - S^{'} (U_{h}), u - U_{h}) d t . \end{matrix}

(70)

Note that

\begin{matrix} \int_{0}^{T} (S_{h}^{'} (U_{h}), U_{h} - u) d t \\ = \sum_{i = 1}^{N} \int_{t_{i - 1}}^{t_{i}} \int_{\partial Ω_{i}^{-}} (B^{*} {\tilde{P}}_{h} + α U_{h}) (U_{h} - u) \\ + \sum_{i = 1}^{N} \int_{t_{i - 1}}^{t_{i}} \int_{\partial Ω_{i}} (B^{*} {\tilde{P}}_{h} + α U_{h}) (U_{h} - u) \\ + \sum_{i = 1}^{N} \int_{t_{i - 1}}^{t_{i}} \int_{\partial Ω_{i}^{+}} (B^{*} {\tilde{P}}_{h} + α U_{h}) (- u) . \end{matrix}

(71)

It is easy to see that

\begin{matrix} \int_{\partial Ω_{i}^{-}} (B^{*} {\tilde{P}}_{h} + α U_{h}) (U_{h} - u) \\ \leq \int_{\partial Ω_{i}^{-}} {| B^{*} {\tilde{P}}_{h} + α U_{h} |}^{2} d x + δ {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} \\ = C η_{12}^{2} + δ {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} . \end{matrix}

(72)

Since $U_{h}$ is piecewise constant, $U_{h} |_{s} > 0$ if $s \cap \partial Ω_{i}^{+}$ is not empty. If $u_{h} |_{s} > 0$ , there exists $ε > 0$ and $β \in U_{h}$ such that $β \geq 0$ , ${∥ β ∥}_{L^{\infty} (s)} = 1$ and $(u_{h} - ε β) |_{s} \geq 0$ . For example, one can always find such a required β from one of the shape functions on s. Hence, ${\hat{u}}_{h} \in K^{h}$ , where ${\hat{u}}_{h} = U_{h} - ε β$ as $x \in s$ and otherwise $\hat{u} = U_{h}$ . Then it follows from (40) that

\begin{matrix} \int_{s} (B^{*} {\tilde{P}}_{h} + α U_{h}) β \\ = ε^{- 1} \int_{s} (B^{*} {\tilde{P}}_{h} + α U_{h}) (U_{h} - (U_{h} - ε β)) \\ \leq ε^{- 1} \int_{\partial Ω} (B^{*} {\tilde{P}}_{h} + α U_{h}) (U_{h} - (U_{h} - ε β)) \leq 0 . \end{matrix}

(73)

Note that on $\partial Ω_{i}^{+}$ , $B^{*} {\tilde{P}}_{h} + α U_{h} \geq B^{*} {\tilde{P}}_{h} > 0$ , and from (72) we have that

\begin{array}{rcl} \int_{s \cap \partial Ω_{i}^{+}} | B^{*} {\tilde{P}}_{h} + α U_{h} | β & = & \int_{s \cap \partial Ω_{i}^{+}} (B^{*} {\tilde{P}}_{h} + α U_{h}) β \\ \leq & - \int_{s \cap \partial Ω_{i}^{-}} (B^{*} {\tilde{P}}_{h} + α U_{h}) β \leq \int_{s \cap \partial Ω_{i}^{-}} | B^{*} {\tilde{P}}_{h} + α U_{h} | . \end{array}

(74)

Let $\hat{s}$ be the reference element of s, $s^{0} = s \cap \partial Ω_{i}^{+}$ , and ${\hat{s}}^{0} \subset \hat{s}$ be a part mapped from ${\hat{s}}^{0}$ . Note that ${(\int_{s} {| \cdot |}^{2})}^{1 / 2}$ , $\int_{s} | \cdot | β$ are both norms on $L^{2} (s)$ . In such a case, for the function β fixed above, it follows from the equivalence of the norm in the finite-dimensional space that

\begin{matrix} \int_{s \cap \partial Ω_{i}^{+}} {| B^{*} {\tilde{P}}_{h} + α U_{h} |}^{2} \\ = \int_{s^{0}} {| B^{*} {\tilde{P}}_{h} + α U_{h} |}^{2} \leq C h_{s}^{2} \int_{{\hat{s}}^{0}} {| B^{*} {\tilde{P}}_{h} + α U_{h} |}^{2} \\ \leq C h_{s}^{2} {(\int_{{\hat{s}}^{0}} | B^{*} {\tilde{P}}_{h} + α U_{h} | β)}^{2} \leq C h_{s}^{- 2} {(\int_{s \cap \partial Ω_{i}^{+}} | B^{*} {\tilde{P}}_{h} + α U_{h} | β)}^{2} \\ \leq C h_{s}^{- 2} {(\int_{s \cap \partial Ω_{i}^{-}} | B^{*} {\tilde{P}}_{h} + α U_{h} |)}^{2} \leq C \int_{s \cap \partial Ω_{i}^{-}} {| B^{*} {\tilde{P}}_{h} + α U_{h} |}^{2}, \end{matrix}

(75)

where the constant C can be made independent of β since it is always possible to find the required β from the shape functions on s so that

\begin{matrix} \int_{\partial Ω_{i}^{+}} (B^{*} {\tilde{P}}_{h} + α U_{h}) (U_{h} - u) \\ \leq C \int_{\partial Ω_{i}^{+}} {| B^{*} {\tilde{P}}_{h} + α U_{h} |}^{2} + δ {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} \\ \leq C \int_{\partial Ω_{i}^{-}} {| B^{*} {\tilde{P}}_{h} + α U_{h} |}^{2} + δ {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} \\ \leq C η_{12}^{2} + δ {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} . \end{matrix}

(76)

It follows from the definition of $\partial Ω_{i}$ that $B^{*} {\tilde{P}}_{h} + α U_{h} > 0$ on $\partial Ω_{i}$ . Note that $- u \leq 0$ , we have that

\int_{\partial Ω_{i}} (B^{*} {\tilde{P}}_{h} + α U_{h}) (- u) \leq 0 .

(77)

It is easy to show that

\begin{matrix} (S_{h}^{'} (U_{h}) - S^{'} (U_{h}), u - U_{h}) \\ = (B^{*} {\tilde{P}}_{h} + α U_{h}, u - U_{h}) - (B^{*} p (U_{h}) + α U_{h}, u - U_{h}) \\ = (B^{*} ({\tilde{P}}_{h} - p (U_{h})), u - U_{h}) \\ \leq C {∥ {\tilde{P}}_{h} - p (U_{h}) ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} + δ {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} \\ \leq C {∥ {\tilde{P}}_{h} - p (U_{h}) ∥}_{L^{2} (J; H^{1} (\partial Ω))}^{2} + δ {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} . \end{matrix}

(78)

Therefore, (69) follows from (70)-(72) and (76)-(78). □

Hence, we combine Theorems 3.1-3.3 to conclude the following.

Theorem 3.4 Let $(y, p, u)$ and $(Y_{h}, P_{h}, U_{h})$ be the solutions of (12)-(16) and (36)-(40), respectively. Then

\begin{matrix} {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} + {∥ y - Y_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} + {∥ p - P_{h} ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \\ \leq C \sum_{i = 1}^{12} η_{i}^{2}, \end{matrix}

(79)

where $η_{1}, η_{2}, \dots$ , and $η_{12}$ are defined in Theorems 3.1-3.3, respectively.

Proof From (12)-(15) and (41)-(44), we obtain the error equations

(y_{t} - y_{t} (U_{h}), w) + a (y - y (U_{h}), w) + (ϕ (y) - ϕ (y (U_{h})), w) = (B (u - U_{h}), w),

(80)

\begin{matrix} - (p_{t} - p_{t} (U_{h}), q) + a (q, p - p (U_{h})) + (ϕ^{'} (y) p - ϕ^{'} (y (U_{h})) p (U_{h}), q) \\ = (y - y (U_{h}), q), \end{matrix}

(81)

for all $w \in V$ and $q \in V$ . Thus it follows from (80)-(81) that

(y_{t} - y_{t} (U_{h}), w) + a (y - y (U_{h}), w) + (ϕ^{'} (y) (y - y (U_{h})), w) = (B (u - U_{h}), w),

(82)

\begin{matrix} - (p_{t} - p_{t} (U_{h}), q) + a (q, p - p (U_{h})) + (ϕ^{'} (y (U_{h})) (p - p (U_{h})), q) \\ = ({\tilde{ϕ}}^{''} (y (U_{h})) (y (U_{h}) - y) p, q) . \end{matrix}

(83)

By using the stability results in [23], we can prove that

{∥ y - y (U_{h}) ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \leq C {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2}

(84)

and

{∥ p - p (U_{h}) ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \leq {∥ y - y (U_{h}) ∥}_{L^{2} (J; H^{1} (Ω))}^{2} \leq C {∥ u - U_{h} ∥}_{L^{2} (J; L^{2} (\partial Ω))}^{2} .

(85)

Finally, combining Theorems 3.1-3.3 and (84)-(85) leads to (79). □

4 Numerical example

In the section, we use a posteriori error estimates presented in our paper as an indicator for the adaptive finite element approximation. The optimization problem is solved numerically by a preconditioned projection algorithm, with codes developed based on AFEPACK. The optimal control problem is

\begin{matrix} min_{u (t) \in K} {\int_{0}^{T} (\frac{1}{2} {∥ y - y_{0} ∥}^{2} + \frac{1}{2} {∥ u ∥}^{2}) d t} \\ y_{t} - Δ y + y^{3} = f, x \in Ω; \nabla y \cdot n = u, x \in \partial Ω, y (x, 0) = 0, x \in Ω . \end{matrix}

In the example, we choose the domain $Ω = [0, 1] \times [0, 1]$ and $K = {u \in L^{2} (J; L^{2} (\partial Ω)) : u \geq 0}$ . Let Ω be partitioned into $T_{h}$ as described in Section 2. We use $η_{12}$ as the control mesh refinement indicator and $η_{1}$ - $η_{11}$ as the states and co-states.

For the constrained optimization problem ${min}_{u \in K} S (u)$ , where $S (u)$ is a convex functional on U, the iterative scheme reads ( $n = 0, 1, 2, \dots$ )

b (u_{n + \frac{1}{2}}, v) = b (u_{n}, v) - ρ_{n} (S^{'} (u_{n}), v), u_{n + 1} = P_{k}^{b} (u_{n + \frac{1}{2}}), \forall v \in K,

(86)

where $b (\cdot, \cdot)$ is a symmetric and positive definite bilinear form such that there exist constants $c_{0}$ and $c_{1}$ satisfying

b (u, u) \geq c_{0} {∥ u ∥}_{U}^{2}, | b (u, v) | \leq c_{1} {∥ u ∥}_{U} {∥ v ∥}_{U}, \forall u, v \in U,

(87)

and the projection operator $P_{K}^{b} U \to K$ is defined as follows. For given $w \in U$ , find $P_{K}^{b} w \in K$ such that

b (P_{K}^{b} w - w, P_{K}^{b} w - w) = min_{u \in K} b (u - w, u - w) .

(88)

The bilinear form $b (\cdot, \cdot)$ provides suitable preconditioning for the projection algorithm. An application of (86) to the discretized nonlinear parabolic boundary optimal control problem yields the following algorithm:

b (u_{n + \frac{1}{2}}^{i}, v_{h}) = b (u_{n}^{i}, v_{h}) - ρ_{n} (u_{n}^{i} + p_{n}^{i}, v_{h}), \forall v_{h} \in K_{i}^{h},

(89)

(\frac{y_{n}^{i} - y_{n}^{i - 1}}{Δ t}, w_{h}) + a (y_{n}^{i}, w_{h}) + (y_{n}^{i, 3}, w_{h}) = (f, w_{h}) + {(u_{n}^{i}, w_{h})}_{U}, \forall w_{h} \in V_{i}^{h},

(90)

(\frac{p_{n}^{i - 1} - p_{n}^{i}}{Δ t}, q_{h}) + a (q_{h}, p_{n}^{i}) + (3 y_{n}^{i, 2} p_{n}^{i}, q_{h}) = (y_{n}^{i} - y_{0}, q_{h}), \forall q_{h} \in V_{i}^{h},

(91)

u_{n + 1}^{i} = P_{k}^{b} (u_{n + \frac{1}{2}}^{i}), u_{n + \frac{1}{2}}^{i}, u_{n}^{i} \in K_{i}^{h} .

(92)

The main computational effort is to solve the state and co-state equations and to compute the projection $P_{K}^{b} u_{n + \frac{1}{2}}^{i}$ . In this paper we use a fast algebraic multigrid solver to solve the state and co-state equations. Then it is clear that the key to saving computing time is finding how to compute $P_{K}^{b} u_{n + \frac{1}{2}}^{i}$ efficiently. For the piecewise constant elements, $K^{h} = {u_{h} \in K : u_{h} \geq 0}$ and $b (u, v) = {(u, v)}_{U}$ , then

P_{K}^{b} u_{n + \frac{1}{2}}^{i} |_{T} = max (0, avg (u_{n + \frac{1}{2}}^{i}) |_{T}),

where $avg (u_{n + \frac{1}{2}}^{i}) |_{T}$ is the average of $u_{n + \frac{1}{2}}^{i}$ over T. In solving our discretized optimal control problem, we use the preconditioned projection gradient method (89)-(92) with $b (u, v) = {(u, v)}_{K^{h}}$ and a fixed step size $ρ = 0.8$ . In the numerical simulation, we use a piecewise linear finite element space for the approximation of y and p, and a piecewise constant for u.

It can be clearly seen from Table 1 that on the adaptive meshes one may use less degree of freedom to produce a given control error reduction. Then it is clear that these a posteriori error estimates are very good for the parabolic boundary optimal control, and the adaptive finite element method is more efficient.

Table 1 Comparison of uniform mesh and adaptive mesh

Full size table

Author’s contributions

ZL participated in the design of all the study and drafted the manuscript.

References

Falk FS: Approximation of a class of optimal control problems with order of convergence estimates. J. Math. Anal. Appl. 1973, 44: 28-47. 10.1016/0022-247X(73)90022-X
Article MathSciNet Google Scholar
Geveci T: On the approximation of the solution of an optimal control problem governed by an elliptic equation. RAIRO. Anal. Numér. 1979, 13: 313-328.
MathSciNet Google Scholar
Arada N, Casas E, Tröltzsch F: Error estimates for the numerical approximation of a semilinear elliptic control problem. Comput. Optim. Appl. 2002, 23: 201-229. 10.1023/A:1020576801966
Article MathSciNet Google Scholar
French DA, King JT: Approximation of an elliptic control problem by the finite element method. Numer. Funct. Anal. Optim. 1991, 12: 299-315. 10.1080/01630569108816430
Article MathSciNet Google Scholar
Gunzburger MD, Hou S: Finite dimensional approximation of a class of constrained nonlinear control problems. SIAM J. Control Optim. 1996, 34: 1001-1043. 10.1137/S0363012994262361
Article MathSciNet Google Scholar
Neittaanmaki P, Tiba D: Optimal Control of Nonlinear Parabolic Systems: Theory, Algorithms and Applications. Dekker, New York; 1994.
Google Scholar
Gong W, Yan N: A posteriori error estimates for boundary control problems governed by the parabolic partial differential equations. J. Comput. Math. 2009, 27: 68-88.
MathSciNet Google Scholar
Liu H, Yan N: Superconvergence and a posteriori error estimates for boundary control by Stokes equations. J. Comput. Math. 2006, 24: 343-356.
MathSciNet Google Scholar
Chen Y, Lu Z: A posteriori error estimates for semilinear boundary control problems. Lect. Notes Comput. Sci. Eng. 2011, 78: 455-462. 10.1007/978-3-642-11304-8_53
Article Google Scholar
Lu Z: A posteriori error estimates of finite element methods for nonlinear quadratic boundary optimal control problem. Numer. Anal. Appl. 2011, 4: 210-222. 10.1134/S1995423911030037
Article Google Scholar
Liu W, Yan N: A posteriori error estimates for some model boundary control problems. J. Comput. Appl. Math. 2000, 120: 159-173. 10.1016/S0377-0427(00)00308-3
Article MathSciNet Google Scholar
Liu W, Yan N: A posteriori error estimates for convex boundary control problems. SIAM J. Numer. Anal. 2001, 39: 73-99. 10.1137/S0036142999352187
Article MathSciNet Google Scholar
Alt W, Mackenroth U: Convergence of finite element approximations to state constrained convex parabolic boundary control problems. SIAM J. Control Optim. 1989, 27: 718-736. 10.1137/0327038
Article MathSciNet Google Scholar
Arada N, Casas E, Tröltzsch F: Error estimates for the numerical approximation of a boundary semilinear elliptic control problem. Comput. Optim. Appl. 2005, 31: 193-219. 10.1007/s10589-005-2180-2
Article MathSciNet Google Scholar
Chen Y, Lu Z: Error estimates of fully discrete mixed finite element methods for semilinear quadratic parabolic optimal control problems. Comput. Methods Appl. Mech. Eng. 2010, 199: 1415-1423. 10.1016/j.cma.2009.11.009
Article Google Scholar
Chen Y, Lu Z: Error estimates for parabolic optimal control problem by fully discrete mixed finite element methods. Finite Elem. Anal. Des. 2010, 46: 957-965. 10.1016/j.finel.2010.06.011
Article MathSciNet Google Scholar
Lu Z, Chen Y: $L^{\infty}$ -error estimates of triangular mixed finite element methods for optimal control problem govern by semilinear elliptic equation. Numer. Anal. Appl. 2009, 2: 74-86. 10.1134/S1995423909010078
Article Google Scholar
Lu Z, Chen Y: A posteriori error estimates of triangular mixed finite element methods for semilinear optimal control problems. Adv. Appl. Math. Mech. 2009, 1: 242-256.
MathSciNet Google Scholar
Lions JL: Optimal Control of Systems Governed by Partial Differential Equations. Springer, Berlin; 1971.
Book Google Scholar
Miliner FA: Mixed finite element methods for quasilinear second-order elliptic problems. Math. Comput. 1985, 44: 303-320. 10.1090/S0025-5718-1985-0777266-1
Article Google Scholar
Liu W, Yan N: A posteriori error estimates for control problems governed by nonlinear elliptic equation. Adv. Comput. Math. 2001, 15: 285-309. 10.1023/A:1014239012739
Article MathSciNet Google Scholar
Chen Y, Liu W: A posteriori error estimates for mixed finite element solutions of convex optimal control problems. J. Comput. Appl. Math. 2008, 211: 76-89. 10.1016/j.cam.2006.11.015
Article MathSciNet Google Scholar
Thomée V: Galerkin Finite Element Methods for Parabolic Problems. Springer, Berlin; 1997.
Book Google Scholar

Download references

Acknowledgements

This work is supported by National Science Foundation of China (11201510), Mathematics TianYuan Special Funds of the National Natural Science Foundation of China (11126329), China Postdoctoral Science Foundation funded project (2011M500968), Natural Science Foundation Project of CQ CSTC (cstc2012jjA00003), and Natural Science Foundation of Chongqing Municipal Education Commission (KJ121113). The author expresses his thanks to the referees for their helpful suggestions, which led to improvements of the presentation.

Author information

Authors and Affiliations

School of Mathematics and Statistics, Chongqing Three Gorges University, Chongqing, 404000, P.R. China
Zuliang Lu
College of Civil Engineering and Mechanics, Xiangtan University, Xiangtan, 411105, P.R. China
Zuliang Lu

Authors

Zuliang Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zuliang Lu.

Additional information

Competing interests

The author declares that he has no competing interests.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Lu, Z. Adaptive fully-discrete finite element methods for nonlinear quadratic parabolic boundary optimal control. Bound Value Probl 2013, 72 (2013). https://doi.org/10.1186/1687-2770-2013-72

Download citation

Received: 18 January 2013
Accepted: 14 March 2013
Published: 04 April 2013
DOI: https://doi.org/10.1186/1687-2770-2013-72

Adaptive fully-discrete finite element methods for nonlinear quadratic parabolic boundary optimal control

Abstract

1 Introduction

2 Finite element methods for parabolic boundary optimal control

3 A posteriori error estimates

4 Numerical example

Author’s contributions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Keywords