License: arXiv.org perpetual non-exclusive license
arXiv:2603.26806v1 [math.DS] 26 Mar 2026

Lagrangian chaos for the 2D Navier-Stokes equations driven by mildly degenerate noise

Dengdi Chen1, Yan Zheng1
1College of Sciences, National University of Defense Technology,
Changsha, 410073, People’s Republic of China
E-mail addresses: yanzhengyl@163.com
Abstract

We consider the 2D incompressible Navier–Stokes equations driven by mildly degenerate noise acting only on finitely many low Fourier modes, corresponding to large-scale stirring. For this system, we prove that the top Lyapunov exponent of the associated Lagrangian flow is strictly positive, thereby establishing Lagrangian chaos. This result is obtained within the framework of random dynamical systems, combining the multiplicative ergodic theorem with the refined Furstenberg criterion from [26]. Unlike the method in [26] for handling highly degenerate noise, this paper develops a unified analytical framework that combines low-mode control, finite-dimensional Malliavin calculus, and dissipation in the high modes. By constructing a finite-dimensional partial Malliavin matrix and proving its non-degeneracy, we avoid the technical complexity of performing Malliavin analysis on the full phase space and simultaneously overcome the degeneracy introduced by the manifold variables. Furthermore, the setup of mildly degenerate noise ensures natural controllability in the low-frequency subsystem, while controllability in the manifold directions requires only first order Lie brackets, significantly simplifying the Lie algebraic computations.
Keywords: Lagrangian chaos; mildly degenerate noise; Furstenberg’s criterion; stochastic Navier-Stokes equations.
Mathematics Subject Classifications: 37A25; 37H15; 37L30; 35R60

1 Introduction

The study of chaotic behavior in dynamical systems has been an important topic in recent decades, playing a significant role in fluid dynamics and providing key insights into turbulence phenomena [1]. Chaos is generally characterized by sensitive dependence on initial conditions, topological transitivity, and the density of periodic points. Among these, the study of sensitive dependence focuses on a core issue: whether a positive top Lyapunov exponent exists on a subset of the phase space with positive measure. This question remains a central concern for both mathematicians and physicists.

In this paper, we study the stochastic flow of diffeomorphisms 𝐱t:𝕋2𝕋2\mathbf{x}_{t}:\mathbb{T}^{2}\to\mathbb{T}^{2}, t0t\geq 0, defined by the random ODE

ddt𝐱t=ut(𝐱t),𝐱0=x.\frac{d}{dt}\mathbf{x}_{t}=u_{t}(\mathbf{x}_{t}),\quad\mathbf{x}_{0}=x. (1.1)

Here, the random velocity field 𝐮t:𝕋2𝕋2\mathbf{u}_{t}:\mathbb{T}^{2}\to\mathbb{T}^{2} at time t0t\geq 0 evolves according to the following the incompressible stochastic Navier-Stokes equations

{u˙t=νΔututut+k𝒵0qkΔ1ek(x)dWtk,ut=0,u0,\displaystyle\begin{cases}\dot{u}_{t}=\nu\Delta u_{t}-u_{t}\cdot\nabla u_{t}+\sum_{k\in\mathcal{Z}_{0}}q_{k}\nabla^{\perp}\Delta^{-1}e_{k}(x)\,dW^{k}_{t},\\ \nabla\cdot u_{t}=0,\\ u_{0}\in\mathcal{H},\end{cases} (1.2)

where 𝒵02{0}\mathcal{Z}_{0}\subseteq\mathbb{Z}^{2}\setminus\{0\} is the finite collection of forced modes, {Wtk}k𝒵0\{W^{k}_{t}\}_{k\in\mathcal{Z}_{0}} are a family of independent one-dimensional Wiener processes on a common canonical filtered probability space (Ω,,(t),)(\Omega,\mathcal{F},(\mathcal{F}_{t}),\mathbb{P}), ν>0\nu>0, qk0q_{k}\neq 0 for each k𝒵0k\in\mathcal{Z}_{0}, {ek}k2\{e_{k}\}_{k\in\mathbb{Z}^{2}} is the real-valued Fourier basis defined in (1.5), and note that k𝒵0qkΔ1ek(x)WtkL02(𝕋2;2)\sum_{k\in\mathcal{Z}_{0}}q_{k}\nabla^{\perp}\Delta^{-1}e_{k}(x)\,W^{k}_{t}\in L^{2}_{0}(\mathbb{T}^{2};\mathbb{R}^{2}). Here, :=H05(𝕋2;2)\mathcal{H}:=H^{5}_{0}(\mathbb{T}^{2};\mathbb{R}^{2}) and L02(𝕋2;2)L^{2}_{0}(\mathbb{T}^{2};\mathbb{R}^{2}) denote the divergence-free, mean-zero vector fields in H5(𝕋2;2)H^{5}(\mathbb{T}^{2};\mathbb{R}^{2}) and L2(𝕋2;2)L^{2}(\mathbb{T}^{2};\mathbb{R}^{2}) respectively. The spatial variable x=(x1,x2)x=(x_{1},x_{2}) belongs to a two-dimensional torus 𝕋2\mathbb{T}^{2}. That is, we impose periodic boundary conditions in space. We let constants freely depend on 𝒵0,ν,maxk|qk|+|qk|1\mathcal{Z}_{0},\nu,\max_{k}|q_{k}|+|q_{k}|^{-1}.

The pair (ut,𝐱t)(u_{t},\mathbf{x}_{t}), which comprises the velocity and position information of fluid particles at time tt, is referred to as the Lagrangian process. In the context of the system (1.1)-(1.2), the chaoticity of the Lagrangian flow 𝐱t\mathbf{x}_{t} (or Lagrangian chaos) generally corresponds to the property that its top Lyapunov exponent is strictly positive.

In this paper, we consider mildly degenerate noise which, at the very least, acts on all unstable directions in the sense that 𝒵0:={k2, 0<|k|N}\mathcal{Z}_{0}:=\{k\in\mathbb{Z}^{2},\,0<|k|\leq N_{*}\}, with N:=N(0,ν)N_{*}:=N_{*}(\mathcal{E}_{0},\nu) to be specified subsequently, where 0=kqk2\mathcal{E}_{0}:=\sum_{k}q_{k}^{2}. We prove that the dynamical system defined via (1.1) possesses a strictly positive top Lyapunov exponent. More precisely, we establish:

Theorem 1.1.

For system (1.1)-(1.2), there exist an N:=N(0,ν)N_{*}:=N_{*}(\mathcal{E}_{0},\nu) and a deterministic constant λ+>0\lambda_{+}>0 such that the following limit holds:

limt1tlog|Dx𝐱t|=λ+forμ1×a.e.(u0,x,ω),\lim\limits_{t\to\infty}\frac{1}{t}\mathop{\rm log}|D_{x}\mathbf{x}_{t}|=\lambda_{+}\,\,\,\,\text{for}\,\,\,\,\mu^{1}\times\mathbb{P}-a.e.\,(u_{0},x,\omega), (1.3)

where Dx𝐱tD_{x}\mathbf{x}_{t} refers to the Jacobian matrix of 𝐱t:𝕋2𝕋2\mathbf{x}_{t}:\mathbb{T}^{2}\to\mathbb{T}^{2} taken at xx and |Dx𝐱t||D_{x}\mathbf{x}_{t}| denotes its norm, μ1\mu^{1} denotes the unique stationary measure of Lagrangian process (ut,𝐱t)(u_{t},\mathbf{x}_{t}). Here, the positivity of λ+\lambda_{+} indicates exponential sensitivity of Lagrangian trajectories with respect to the initial data, which is the essence of Lagrangian chaos.

Remark 1.2.

In fact, we can present the following stronger result, which helps support findings on scalar turbulence (c.f. [2, 3]):

limt1tlog|Dx𝐱tν|=λ+>0forμP×a.e.(u0,x,v,ω),\lim\limits_{t\to\infty}\frac{1}{t}\mathop{\rm log}|D_{x}\mathbf{x}_{t}\nu|=\lambda_{+}>0\,\,\text{for}\,\,\,\mu^{P}\times\mathbb{P}-a.e.\,(u_{0},x,v,\omega), (1.4)

where μP\mu^{P} denotes the unique stationary measure for projective process (c.f. Definition 1.5).

1.1 Background and motivations

The stochastic two-dimensional incompressible Navier–Stokes equations, defined on a periodic domain, serves an approximation for three-dimensional turbulence in thin domains or as a simplified model of the three-dimensional Navier-Stokes equations. Consequently, this system is widely recognized in mathematics, physics, and engineering as an important fluid model for studying the statistical properties of randomly forced turbulence (see reference [4] and the references therein). To characterize the long-time behavior of fluids under random perturbations, the theory of ergodicity and mixing for stochastic Navier–Stokes equations has been relatively well developed (see [8, 5, 6, 7, 9, 10, 11, 12] and references therein). However, ergodicity and mixing represent only the starting point of research in statistical fluid mechanics. To gain a deeper understanding of the nature of turbulence, further studies from other perspectives are still required.

In recent years, some progress has been made in studying fluid turbulence from the perspective of chaos. Related research has primarily developed along two lines: one based on the velocity field, known as Eulerian chaos, and the other based on particle trajectories, referred to as Lagrangian chaos. Both concepts themselves, as well as the relationship between them, have been extensively discussed in the physics literature (see [17, 1, 18, 19, 20]). Regarding Eulerian chaos, Bedrossian et al. investigated a Galerkin approximations of the two-dimensional Navier–Stokes system driven by degenerate noise, establishing Eulerian chaos and providing quantitative lower bounds [22, 21]. However, the Eulerian chaos of the full stochastic Navier–Stokes system remains an open problem. In terms of Lagrangian chaos, Bedrossian et al. [2] first gave a rigorous proof of Lagrangian chaos induced by a continuous-time stochastic fluid model. Specifically, they established the result for the two-dimensional Navier–Stokes equations driven by time-white noise with non-degenerate excitation acting on the high-frequency modes. Building on this noise setup and the established results, their subsequent work derived a series of important conclusions, including almost sure exponential mixing of passive scalars [23], enhanced dissipation [24], and a rigorous version of Batchelor law [25]. Following [2], Cooperman et al. [26] obtained Lagrangian chaos for the two-dimensional incompressible Navier–Stokes equations driven by highly degenerate noise. Recently, [27] extended this framework to the case of highly degenerate non-Gaussian bounded noise, though conclusions are currently mainly obtained at the discrete-time level.

Overall, the two-dimensional stochastic Navier–Stokes equations have become a fundamental model for studying Lagrangian chaos within a rigorous mathematical framework. At the same time, existing proofs often rely strongly on the noise structure and the corresponding technical apparatus: the argument in [2] depends on the strong Feller property of the Markov semigroup, which typically requires the noise to be non-degenerately excited in high-frequency modes, while the treatment of highly degenerate noise in [26] requires introducing a rather intricate chain of technical steps, including the verification of Hörmander-type conditions and Malliavin calculus. These limitations result in a higher barrier to understanding and, when extending the study to more complex fluid models, necessitate repeating lengthy Lie algebraic computations to reconstruct key steps such as the regularization of the Markov semigroup. Consequently, a natural research direction is to seek to establish Lagrangian chaos under a noise structure that reflects the ‘large-scale stirring mechanism’ –where the noise is non-degenerate primarily in low-frequency directions–while remaining sufficiently non-degenerate to allow for a more transferable proof mechanism.

Based on the above discussion, this paper takes the two-dimensional stochastic incompressible Navier–Stokes equations as the core model and aims to establish a Lagrangian chaos analysis framework that is physically reasonable and technically more transparent. This framework is intended to promote the extension of the relevant theory to more general multi-physics coupled fluid models. The random forcing adopted here is a mildly degenerate noise: it acts only on the first N(ν)N_{*}(\nu) directions, where N(ν)N_{*}(\nu) varies with the viscosity coefficient ν\nu and covers all unstable directions of the system. To our knowledge, this is the first work that studies and establishes Lagrangian chaos for a fluid equation under such a mildly degenerate noise setting. This noise structure combines an intuitive physical interpretation aligned with large-scale stirring with a sufficient number of randomly forced directions. It allows the analysis to proceed with significantly reduced reliance on technical devices such as Hörmander-type Lie algebraic computations, thereby leading to a more direct and transparent line of argument and contributing to the understanding of the chaos generation mechanism.

1.2 Setup and notations

Let 𝕋2=[0,2π]2\mathbb{T}^{2}={[0,2\pi]}^{2} denote the period box. We define the following real-valued Fourier basis on this space.

ek(x):={sin(kx),k+2,cos(kx),k2,e_{k}(x):=\begin{cases}\sin(k\cdot x),&k\in\mathbb{Z}_{+}^{2},\\ \cos(k\cdot x),&k\in\mathbb{Z}_{-}^{2},\end{cases} (1.5)

where +2:={(k1,k2)2:k2>0}{(k1,k2)2:k1>0,k2=0}\mathbb{Z}_{+}^{2}:=\{(k_{1},k_{2})\in\mathbb{Z}^{2}:k_{2}>0\}\bigcup\{(k_{1},k_{2})\in\mathbb{Z}^{2}:k_{1}>0,k_{2}=0\} and 2=+2\mathbb{Z}_{-}^{2}=-\mathbb{Z}_{+}^{2}. For convenience, we rewrite

k𝒵0qkΔ1ek(x)Wtk\sum_{k\in\mathcal{Z}_{0}}q_{k}\nabla^{\perp}\Delta^{-1}e_{k}(x)\,W^{k}_{t}

as

k𝒵0qkγkek(x)Wtk,\sum_{k\in\mathcal{Z}_{0}}q_{k}\gamma_{k}e_{k}(x)\,W^{k}_{t},

where γk:=k/|k|2\gamma_{k}:=-k^{\perp}/{|k|^{2}}. Under our noise assumption, it holds that qk0q_{k}\neq 0 for any k𝒵0k\in\mathcal{Z}_{0}. In other words, we have

qk|k|αfor  0<|k|<N,\displaystyle q_{k}\approx|k|^{-\alpha}\;\;\;\;\text{for}\;\;0<|k|<N_{*}, (1.6)

where α(5,)\alpha\in(5,\infty) is as in the classical coloring assumption (c.f. [2]).

Next, we first present the definition of the base process and recall the well-posedness and ergodicity results required in the subsequent analysis.

Definition 1.3.

The base process refers to the Markov process {ut}t0\{u_{t}\}_{t\geq 0} on \mathcal{H} which solves the incompressible stochastic Navier-Stokes equation (1.2) with initial data given by u0u_{0}.

Proposition 1.4 (Well-posedness and ergodicity for base process,[11]).

For the system (1.2), the following holds for any T>0T>0:

  1. (a)

    For all functions uu\in\mathcal{H} and with probability 1, there exists a unique mild solution utC([0,T];)u_{t}\in C\big([0,T];\mathcal{H}\big) with u0=uu_{0}=u. As a function of the noise sample ωΩ\omega\in\Omega, the solution utu_{t} is measurable, t\mathcal{F}_{t}-adapted, and belongs to Lp(Ω;C([0,T];))L^{p}\big(\Omega;C([0,T];\mathcal{H})\big) for all p1p\geq 1. Lastly, (ut)(u_{t}) itself is a Feller Markov process on \mathcal{H}.

  2. (b)

    The Markov process utu_{t} admits a unique Borel stationary measure μ\mu in \mathcal{H}.

Then, we introduce the Lagrangian process and the projective process studied in this work. Under the assumption that the velocity field uu is sufficiently regular, these processes are well-defined and possess the Markov property, which is ensured by verifying the independent-increments hypothesis (H1) in Subsection 2.1.

Definition 1.5.

Given a geodesically complete Riemannian manifold (M,g^)(M,\hat{g}) and an associated vector field map Θ:H05(𝕋2;2)𝒳loc1(M)\Theta:H^{5}_{0}(\mathbb{T}^{2};\mathbb{R}^{2})\to\mathcal{X}_{\mathrm{loc}}^{1}(M) (where 𝒳loc1(M)\mathcal{X}_{\mathrm{loc}}^{1}({M}) denotes the set of Cloc1C_{\mathrm{loc}}^{1} vector fields on M{M})–associating to any velocity field utu_{t} a vector field Θut\Theta_{u_{t}} on MM–we define the associated Markov process {ut,𝐩t}t0\{u_{t},\mathbf{p}_{t}\}_{t\geq 0} where (ut)(u_{t}) is the base process and 𝐩t\mathbf{p}_{t} solves

𝐩˙t=Θut(𝐩t),𝐩0M.\dot{\mathbf{p}}_{t}=\Theta_{u_{t}}(\mathbf{p}_{t}),\quad\mathbf{p}_{0}\in M. (1.7)

In particular, the Lagrangian process (ut,𝐱t)(u_{t},\mathbf{x}_{t}) and the projective process (ut,𝐱t,vt)(u_{t},\mathbf{x}_{t},v_{t}) correspond to the choices M=T2M=T^{2} and M=T2×S1M=T^{2}\times S^{1}, respectively, with

Θut1(𝐱)=ut(𝐱t),ΘutP(𝐱,v)=(ut(𝐱t),ΠvtDut(𝐱t)vt),\displaystyle\Theta_{u_{t}}^{1}(\mathbf{x})=u_{t}(\mathbf{x}_{t}),\quad\Theta_{u_{t}}^{P}(\mathbf{x},v)=\bigl(u_{t}(\mathbf{x}_{t}),\,\Pi_{v_{t}}Du_{t}(\mathbf{x}_{t})v_{t}\bigr), (1.8)

where Πv=Idvv\Pi_{v}=\text{Id}-v\otimes v is the orthogonal projection from 2\mathbb{R}^{2} onto the tangent space of S1S^{1} (viewing vv as a unit vector in 2\mathbb{R}^{2}).

Remark 1.6.

Note that the flow 𝐩t\mathbf{p}^{t} is a well-defined diffeomorphism since the velocity field utu_{t} belongs to \mathcal{H} (so it is at least C2C^{2} by Sobolev embedding). This gives rise to an t\mathcal{F}_{t}-adapted, Feller Markov process (ut,𝐩t)(u_{t},\mathbf{p}_{t}) on ×M\mathcal{H}\times M defined by (1.7).

1.3 Outline of the proof and contributions

Given that the Lagrangian system associated with the two-dimensional incompressible Navier–Stokes equations is conservative and the forcing is driven by mildly degenerate noise, the analysis in this paper follows the framework of continuous random dynamical systems, the multiplicative ergodic theorem, and the Furstenberg’s criterion. The specific form of the Furstenberg’s criterion adopted here is taken from [26]. Within this framework, we establish a criterion for Lagrangian chaos applicable to the case of mildly degenerate noise (c.f. Theorem 2.6): it suffices to verify that the derivative cocycle 𝒜\mathcal{A} of system (1.1) satisfies the integrability condition (A1), that the Lagrangian process and its projective process satisfy the asymptotic strong Feller property (A2), and that the approximate controllability condition (A3) holds. Here, (A1) follows from regularity estimates for the base process, while (A3) has been established in [2, Section 7] by constructing smooth controls via shear and cellular flows, together with a stability analysis and the positivity of Wiener measure. Consequently, the proof of our main result ultimately reduces to establishing short-time asymptotic gradient estimates for the Markov transition semigroups of the Lagrangian and projective processes under mildly degenerate noise.(c.f. Theorem 3.1).

The key to establishing the asymptotic gradient estimates for the transition semigroup lies in the application of the Malliavin calculus that was employed in [11, 12]. The main idea is to construct a small perturbation gtg_{{}_{t}} of the driving Wiener path such that, at time tt, the variation of the state (ut,𝐩t)(u_{t},\mathbf{p}_{{}_{t}}) induced by this perturbation cancels as much as possible the variation generated by a small perturbation 𝔥×TM\mathfrak{h}\in\mathcal{H}\times TM of the initial condition, thereby achieving an effective matching between the Malliavin derivative and the Fréchet derivative. Consequently, the problem reduces to constructing a smooth control together with bounds on its cost, and to estimating the associated error terms (c.f. (3.1)).

1.3.1 Discussion of control construction and error estimates

For the case driven by mildly degenerate noise, a standard strategy for obtaining asymptotic gradient estimates of the transition semigroup for the base process is the low-high frequency splitting method adopted in [11, Subsection 4.5]. One imposes a control on the low-frequency component and uses finite-time stabilization to eliminate the low-frequency error after a finite time. No control is applied to the high-frequency component, and the corresponding error is allowed to decay due to high-mode dissipation. Combined with error inversion for control design, this ensures bounded control cost and allows the total error to satisfy the required estimates. It is important to stress that the viability of this scheme ultimately hinges on the invertibility of the low-frequency diffusion coefficient.

Unlike [11], which treats only the base process, we must also handle the additional degrees of freedom 𝐱t\mathbf{x}_{t} and vtv_{t} in the extended system. These variables are not directly driven by the noise. Instead, randomness reaches them through the noise acting on utu_{t} and is then transmitted via (1.7)–(1.8). As a result, the degeneracy of the extended system is substantially enhanced compared to the system that only considers the velocity field. A direct consequence is the following. If one naturally includes (𝐱t,vt)(\mathbf{x}_{t},v_{t}) in the low-frequency subsystem by taking Πl(u,𝐱,v)=(ul,𝐱,v)\Pi_{l}(u,\mathbf{x},v)=(u^{l},\mathbf{x},v), then the associated low-frequency diffusion coefficient Q^l:=ΠlQ^\hat{Q}_{l}:=\Pi_{l}\hat{Q} (cf. (3.6)) is necessarily non-invertible. Consequently, the error-inversion mechanism used in [11, Section 4.5] can no longer be applied. A seemingly plausible alternative is turn to place the manifold components (𝐱t,vt)(\mathbf{x}_{t},v_{t}) into the high-frequency subsystem in order to preserve invertibility of the low-frequency diffusion coefficient. However, 𝐱t\mathbf{x}_{t} and vtv_{t} lack the dissipative structure of the high-frequency velocity modes, placing them on the high-frequency side would necessitate imposing additional control on them. This ultimately forces the analysis back into the infinite-dimensional setting, which is precisely one of the difficulties encountered in [26, 12, 28] for highly degenerate noise.

For this reason, and exploiting the finite-dimensional nature of the low-frequency subsystem, we use the decomposition Πl(u,𝐱,v)=(ul,𝐱,v)\Pi_{l}(u,\mathbf{x},v)=(u^{l},\mathbf{x},v) and Πh(u,𝐱,v)=uh\Pi_{h}(u,\mathbf{x},v)=u^{h}. We implement controls only on the low-frequency subsystem and fully exploit high-mode dissipation to obtain the desired estimates. To overcome the obstruction caused by the non-invertibility of Q^l\hat{Q}_{l}, we abandon the traditional error-inversion design and instead introduce a new finite-dimensional Malliavin framework. We construct an invertible partial (finite-dimensional) Malliavin matrix for the low-frequency subsystem and use it to build an explicit smooth control. This provides representations and estimates for the Malliavin derivative, together with corresponding representations and bounds for the error terms. Notably, the invertibility result for the partial Malliavin matrix here is stronger than the non-degeneracy condition in [26, 28, 12], which holds only on a cone. As a result, the control can be directly constructed without the need to use a regularized version of the Malliavin matrix, and the error estimates are simplified accordingly.

More specifically, for a short time T0(<1)T_{0}(<1), we aim to construct a control gt=(gtl,0)g_{{}_{t}}=(g^{l}_{{}_{t}},0) over the interval [0,T0][0,T_{0}] such that the low-frequency part of the corresponding perturbed Malliavin derivative satisfies, as closely as possible, the condition

Πl(𝒟g(uT0,𝐩T0))=:𝒟gl(uT0l,𝐩T0)=Πl(𝒥0,T0𝔥),\displaystyle\Pi_{l}\big(\mathcal{D}^{g}(u_{{}_{T_{0}}},\mathbf{p}_{{}_{T_{0}}})\big)=:\mathcal{D}^{g^{l}}(u^{l}_{{}_{T_{0}}},\mathbf{p}_{{}_{T_{0}}})=\Pi_{l}\big(\mathcal{J}_{0,T_{0}}\mathfrak{h}\big), (1.9)

where the full Jacobian 𝒥s,t\mathcal{J}_{s,t} is the linear map from ×T𝐩M\mathcal{H}\times T_{\mathbf{p}}M to ×T𝐩M\mathcal{H}\times T_{\mathbf{p}}M that satisfies (3.8). Note that if there were no coupling terms between the high and low frequencies, then in the ideal scenario, the Duhamels’s formula would yield

𝒟gl(uT0l,𝐩T0)=0T0Rs,T0lQ^lgsl𝑑s,\displaystyle\mathcal{D}^{g^{l}}(u^{l}_{{}_{T_{0}}},\mathbf{p}_{{}_{T_{0}}})=\int_{0}^{T_{0}}R_{s,T_{0}}^{l}\hat{Q}_{l}g_{{}_{s}}^{l}ds, (1.10)

where the finite-dimensional matrix Rs,tlR_{s,t}^{l} is a linear map from l×T𝐩M\mathcal{H}_{l}\times T_{\mathbf{p}}M to l×T𝐩M\mathcal{H}_{l}\times T_{\mathbf{p}}M that satisfies (3.13). Naturally, our goal is to use a least-squares approach to design a control gslg_{{}_{s}}^{l} such that

0T0Rs,T0lQ^lgsl𝑑s=Πl(𝒥0,T0𝔥).\displaystyle\int_{0}^{T_{0}}R_{s,T_{0}}^{l}\hat{Q}_{l}g_{{}_{s}}^{l}ds=\Pi_{l}\big(\mathcal{J}_{0,T_{0}}\mathfrak{h}\big). (1.11)

To address the fact that Rs,tlR_{s,t}^{l} is not adapted, we introduce its (adapted) inverse matrix Ss,tlS_{s,t}^{l}. After a time τ0(0,1)\tau_{0}\in(0,1), we then define the finite-dimensional partial Malliavin matrix

𝒩T0l=τ0T0Sτ0,slQ^l(Sτ0,slQ^l)𝑑s,\mathcal{N}_{T_{0}}^{l}=\int_{\tau_{0}}^{T_{0}}S_{\tau_{0},s}^{l}\hat{Q}^{l}\big(S_{\tau_{0},s}^{l}\hat{Q}^{l}\big)^{\top}\,ds,

where τ0(0,1)\tau_{0}\in(0,1). The reason for setting the lower limit of integration to τ0\tau_{0} is to fully exploit the dissipation effect of the high-frequency modes in the system, ensuring that the high-frequency part of the error satisfies the required estimates. A detailed explanation of this point will be provided at the end of this subsection. Correspondingly, we employ a piecewise control construction: on [0,τ0][0,\tau_{0}], we set gtl0g^{l}_{{}_{t}}\equiv 0, and only activate the control on [τ0,T0][\tau_{0},T_{0}]. More specifically, once the invertibility of 𝒩T0l\mathcal{N}_{T_{0}}^{l} is established, we can construct on [τ0,T0][\tau_{0},T_{0}] the following smooth low-frequency control gtlg^{l}_{{}_{t}}:

gtl=(Sτ0,tlQ^l)(𝒩T0l)1Sτ0,T0lΠl(𝒥0,T0𝔥)g^{l}_{{}_{t}}=(S_{{}_{\tau_{0},t}}^{l}\hat{Q}_{l})^{\top}(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}S_{{}_{\tau_{0},T_{0}}}^{l}\Pi_{l}(\mathcal{J}_{0,T_{0}}\mathfrak{h})

which yields a global control gtlg_{t}^{l} defined on [0,T0][0,T_{0}] and ensures that the required matching relation (1.11) holds. To maintain the flow of the exposition, the proof of the invertibility of 𝒩T0l\mathcal{N}_{T_{0}}^{l} is postponed to the next subsection. Finally, the cost of this non-adapted control can be bounded using classical estimates for the Skorohod integral together with standard techniques.

We now turn to the error estimate, beginning with the low-frequency component. In this work we adopt a frequency-splitting framework and build the Malliavin structure only on a finite-dimensional low-frequency subsystem. Compared with the approach in [26, 28, 12], which is based on the full infinite-dimensional Malliavin matrix, this setting has two consequences. the low-frequency equation inevitably generates high–low frequency coupling terms that depend on the high-frequency Malliavin derivative, and these terms constitute the main new source of the low-frequency error. Second, since we can prove that the partial Malliavin matrix 𝒩T0l\mathcal{N}_{T_{0}}^{l} is invertible, the matching relation (1.11) can be enforced exactly. As a result, no additional error term arises from a least-squares construction as in [26, 28, 12], and many computations become substantially simpler.

For the high-frequency component of the error, we exploit the dissipative structure of the high-frequency modes and decompose it into two terms, namely the purely high-frequency dissipative term and the high–low frequency coupling term. The key point is that the purely dissipative term can be controlled directly by the decay of the high-frequency dynamics, yielding a bound consistent with the target error estimate of the form (3.11). It is important to emphasize that, in order for this dissipative mechanism to be fully effective, the low-frequency control should not remain nonzero on the whole interval [0,T0][0,T_{0}] starting from t=0t=0. Otherwise, when estimating the purely dissipative term one inevitably obtains a factor of the form (1+q(T0))eηV(u0)(1+q(T_{0}))e^{\eta V(u_{0})}, where q(T0)0q(T_{0})\to 0 as T00T_{0}\to 0, which is incompatible with (3.11). To circumvent this difficulty, [2, Section 6] introduced an auxiliary process ztz_{t} and a truncated process wtρw_{t}^{\rho}. In contrast, we avoid the additional auxiliary and truncation procedures by adopting a more direct time-splitting strategy. We first let the system evolve freely on [0,τ0][0,\tau_{0}], that is, we set the control to zero, so that the high-frequency component decays sufficiently fast due to dissipation. We then activate the low-frequency control only on [τ0,T0][\tau_{0},T_{0}]. Consequently, the relevant key term can be compressed into

(N2+q(T~0))eηV(u0),T~0:=T0τ0,\bigl(N_{*}^{-2}+q(\widetilde{T}_{0})\bigr)e^{\eta V(u_{0})},\qquad\widetilde{T}_{0}:=T_{0}-\tau_{0},

which meets the requirement of (3.11).

1.3.2 Discussion of the nondegeneracy of the partial Malliavin matrix

We now address the non-degeneracy of the partial Malliavin matrix 𝒩T0l\mathcal{N}_{T_{0}}^{l}. This property plays a central role in both the construction of the control and the ensuing error estimates. Let us emphasize that 𝒩T0l\mathcal{N}_{T_{0}}^{l} is defined on the finite-dimensional space l×TM\mathcal{H}_{l}\times TM and is a finite-dimensional adapted matrix. In contrast to the high degenerate setting–where one must deal with the full Malliavin matrix \mathcal{M}–we can obtain a stronger conclusion here: 𝒩T0l\mathcal{N}_{T_{0}}^{l} is invertible. Moreover, the non-degeneracy bound in Proposition 3.6 essentially reduces to establishing the probabilistic spectral bound (4.1) for the Malliavin matrix 𝒩T0l\mathcal{N}_{T_{0}}^{l}.

A standard approach to such a probabilistic spectral bound consists of two steps. The first is to establish a Hörmander-type spanning condition, which requires that the Lie brackets of Hörmander type cover all unstable directions of the system. More precisely, one proves the following quantitative lower bound only for those vectors hh that lie in a certain cone: for every N>0N>0, there is a n>0n>0 and a finite set 𝔅n\mathfrak{B}\subset\mathcal{B}_{n} such that for each α>0\alpha>0, we define the cone

𝒦N,α:={h:PNh2>αh2},\mathcal{K}_{N,\alpha}:=\Bigl\{h\in\mathcal{H}:\ \|P_{N}h\|^{2}>\alpha\|h\|^{2}\Bigr\},

where {PN}\{P_{N}\} is a sequence of projection operators onto successively larger spaces. On this cone, the following lower bound holds:

b(u)𝔅h,b(u)2αCh2,hKN,α,\displaystyle\sum_{b(u)\in\mathfrak{B}}\langle h,\,b(u)\rangle^{2}\geq\alpha C\|h\|^{2},\quad\forall\,h\in\mathcal{\mathcal{}}{K}_{N,\alpha}, (1.12)

where C>0C>0 is a constant independent of α\alpha, 0:=span{ek,k𝒵0}\mathcal{B}_{0}:=\text{span}\{e_{k},\,k\in\mathcal{Z}_{0}\}, n:=span{[E,F],[E,ek],E:En1,k𝒵0}\mathcal{B}_{n}:=\text{span}\{[E,F],[E,e_{k}],\,E:E\in\mathcal{B}_{n-1},\,k\in\mathcal{Z}_{0}\} and FF is the drift term. The second step is to differentiate 𝒥s,tek,h\langle\mathcal{J}_{s,t}e_{k},h\rangle in time and derive, on a high-probability event, an ‘implication estimate’: if Th,h\langle\mathcal{M}_{T}h,h\rangle is sufficiently small, then it necessarily forces the projections onto the directions in 𝔅\mathfrak{B} to become small simultaneously, i.e.

Th,h<εh2b(u)𝔅h,b(u)2εqh2.\displaystyle\langle\mathcal{M}_{T}h,h\rangle<\varepsilon\|h\|^{2}\ \Rightarrow\ \sum_{b(u)\in\mathfrak{B}}\langle h,\,b(u)\rangle^{2}\leq\varepsilon^{q}\|h\|^{2}. (1.13)

Combining this upper implication with the Hörmander lower bound (1.12), and choosing ε\varepsilon appropriately, one obtains the probabilistic spectral bound.

Unlike the classical setting where only the velocity field is involved, our extended system additionally includes degrees of freedom (𝐱t,vt)(\mathbf{x}_{t},v_{t}) on the manifold. Consequently, the non-degeneracy argument must also incorporate the vector fields on the tangent bundle TMTM. To establish the analogue (1.14) of the lower bound (1.12) required in the first step, we need to exploit the Lie brackets [F¯,ekγk][\overline{F},e_{k}\gamma_{k}] to generate new directions in both the position and the projection components, where F¯\overline{F} is the manifold vector field defined in (4.4). This leads to the following spanning condition on the tangent bundle:

span{[F¯,ekγk](𝐱,v):k𝒵0}=T𝐱𝕋2×TvS1.\operatorname{span}\{[\overline{F},e_{k}\gamma_{k}](\mathbf{x},v):k\in\mathcal{Z}_{0}\}=T_{\mathbf{x}}\mathbb{T}^{2}\times T_{v}S^{1}.

Building on this, the mildly degenerate noise structure considered in this work allows the above first step to be completed along a substantially simpler route than in the highly degenerate case treated in [26]. More specifically, in [26] one typically has to perform repeated time differentiations in order to successively generate new Lie bracket directions on PN×TMP_{N}\mathcal{H}\times TM for sufficiently large N>0N>0. In contrast, we work with the finite-dimensional partial Malliavin matrix associated with the low-frequency subsystem, so that the Hörmander condition only needs to be verified on l×TM\mathcal{H}_{l}\times TM. On the one hand, the spanning property on l\mathcal{H}_{l} follows directly from the structure of the mildly degenerate noise. On the other hand, the verification on TMTM relies on an explicit and tractable representation of the low-frequency object Sτ0,tlekγkS_{\tau_{0},t}^{l}e_{k}\gamma_{k}. More precisely, since Sτ0,tlS_{\tau_{0},t}^{l} is adapted, we may apply Itô’s formula to Sτ0,tlekγkS_{\tau_{0},t}^{l}e_{k}\gamma_{k} and expand it explicitly. In this expansion, terms involving the bracket [F¯,ekγk][\overline{F},e_{k}\gamma_{k}] appear naturally. Combining this with the above spanning condition on TMTM, we obtain the following spectral-type lower bound:

max{|ekγk,𝔥l|,|ΥLekγk,𝔥l|:k𝒵0}C(N,ν,β)(1βull)𝔥ll.\displaystyle\max\Big\{\big|\langle e_{k}\gamma_{k},\mathfrak{h}^{l}\rangle\big|,\ \big|\langle\Upsilon_{L}e_{k}\gamma_{k},\mathfrak{h}^{l}\rangle\big|:k\in\mathcal{Z}_{0}\Big\}\geq C(N_{*},\nu,\beta)\big(1-\beta\|u^{l}\|_{\mathcal{H}_{l}}\big)\|\mathfrak{h}^{l}\|_{l}. (1.14)

where ΥLekγk\Upsilon_{L}e_{k}\gamma_{k} is defined in (4.3).

In the second step, we need to establish the counterpart, in our present setting, of the general implication (1.13), namely (4.5). This is obtained by differentiating in time the quantity Sτ0,tlekγk,𝔥l\langle S_{\tau_{0},t}^{l}e_{k}\gamma_{k},\mathfrak{h}^{l}\rangle. We then combine (4.5) with the spectral-type lower bound to deduce the desired probabilistic spectral bound.

Finally, we emphasize that this work develops a unified analytical framework: on the low-frequency component, a control mechanism is used to capture the effective action of the stochastic forcing; on a finite-dimensional level, Malliavin calculus is employed to extract the required non-degeneracy; and high-frequency dissipation is leveraged to suppress small-scale instabilities. This framework is broadly applicable to the systematic study of Lagrangian chaos in incompressible fluid systems driven by mildly degenerate noise. In subsequent work, we will formulate more explicit abstract criteria tailored to this class of mildly degenerate noise and apply them to the three-dimensional hyperviscous Navier–Stokes equations as well as to multiphysics coupled fluid models, such as the two-dimensional Boussinesq and magnetohydrodynamics equations. Our aim is to further deepen the understanding of the mechanisms responsible for the emergence of chaos in statistical fluid mechanics.

It is worth noting that, in our recent work on the 2D Boussinesq equation [36], the noise acts only on the temperature component and is highly degenerate; consequently, first-order Lie brackets [F^,ekγk][\hat{F},e_{k}\gamma_{k}] do not generate new directions on the tangent bundle, and higher-order brackets become necessary. Although the underlying principle remains the same–namely, achieving controllability by effectively ‘projecting’ infinite-dimensional directions onto tangent-bundle vector fields–the Lie-bracket computations required both to verify the manifold spanning condition and to derive the relevant upper bounds are technically cumbersome. By contrast, the present framework preserves the same core mechanism while substantially reducing the computational overhead, thereby providing a more streamlined and practical route for extensions to the above classes of equations and beyond.

The paper is organized as follows. Section 2 reviews several basic results from the theory of random dynamical systems and incorporates the refined Furstenberg’s criterion from [26] into a positivity criterion for the top Lyapunov exponent of Lagrangian systems associated with incompressible fluid models driven by mildly degenerate noise. The problem is then reduced to establishing short-time asymptotic gradient estimates for the Lagrangian process and projective process. In Section 3, these gradient estimates for the extended system are reformulated in terms of the construction of controls for the low-frequency subsystem, together with corresponding error estimates, and the main result is proved under the assumption that a partial Malliavin matrix is invertible and satisfies suitable non-degeneracy bounds. Section 4 establishes the required non-degeneracy estimates for the partial Malliavin matrix and proves its invertibility. Finally, the Appendix collects proofs of several auxiliary results used in the paper.

2 Fundamental mathematical framework and Furstenberg criterion

Our objective is to prove that the top Lyapunov exponent is almost everywhere a positive constant. The proof proceeds in two stages: first, within the framework of random dynamical systems (RDS), we employ the Multiplicative ergodic theorem (MET) to establish the almost-everywhere existence and constancy of the top Lyapunov exponent; second, we apply the Furstenberg’s criterion to verify its positivity.

To this end, this section first introduces the necessary notions and results concerning random dynamical systems and the Multiplicative ergodic theorem, then states the classical Furstenberg’s criterion, and finally provides a criterion for the positivity of the top Lyapunov exponent adapted to the noise assumptions of this paper. It should be noted that this part essentially reformulates and synthesizes results from [26], without introducing fundamentally new ideas.

2.1 RDS framework and Multiplicative ergodic theorem

Let (Z,)(Z,\mathcal{B}) be a measurable space, and (Ω,,,(θt)t[0,))(\Omega,\mathcal{F},\mathbb{P},(\theta^{t})_{t\in[0,\infty)}) be a metric dynamical system with time index set [0,)[0,\infty). The mapping

𝒯:[0,)×Ω×ZZ,(t,ω,z)𝒯ωtz\mathcal{T}:[0,\infty)\times\Omega\times Z\to Z,\quad(t,\omega,z)\mapsto\mathcal{T}_{\omega}^{t}z

denotes a measurable random dynamical system covering this metric system. If for every ωΩ\omega\in\Omega, the mapping 𝒯ω:[0,)×ZZ\mathcal{T}_{\omega}:[0,\infty)\times Z\to Z belongs to the space Cu,b([0,)×Z,Z)C_{u,b}([0,\infty)\times Z,Z), then the random dynamical system is called continuous RDS. Here, for metric spaces V,WV,W, the notation Cu,b(V,W)C_{u,b}(V,W) denotes the set of all continuous maps E:VWE:V\to W such that for every bounded set OVO\subseteq V, the restriction E|OE|_{O} is uniformly continuous and the image E(O)E(O) is a bounded subset of WW.

To ensure that the process ztz_{t} as above is Markovian, it is often further required that the continuous random dynamical system 𝒯\mathcal{T} satisfies the usual independent increments assumption.
(H) For all s,t>0s,t>0, we have that 𝒯ωt\mathcal{T}_{\omega}^{t} is independent of 𝒯θtωs\mathcal{T}_{\theta^{t}\omega}^{s}. That is, the σ\sigma-subalgebra σ(𝒯ωt)\sigma(\mathcal{T}_{\omega}^{t})\subset\mathcal{F} generated by the Cu,b(Z,Z)C_{u,b}(Z,Z)-valued random variable ω𝒯ωt\omega\mapsto\mathcal{T}_{\omega}^{t} is independent of the σ\sigma-subalgebra σ(𝒯θtωs)\sigma(\mathcal{T}_{\theta^{t}\omega}^{s}) generated by ω𝒯θtωs\omega\mapsto\mathcal{T}_{\theta^{t}\omega}^{s}.

As can be seen from the regularity results for solutions to the Navier-Stokes equations in [2, Appendix A], we have the following fundamental result.

Proposition 2.1 (RDS for base, Lagrangian and projective processes).

Let 𝒰:[0,)×Ω×,(t,ω,u)𝒰ωt(u)\mathcal{U}:[0,\infty)\times\Omega\times\mathcal{H}\to\mathcal{H},(t,\omega,u)\mapsto\mathcal{U}^{t}_{\omega}(u) denote the mapping sending, for a given t0t\geq 0 and \mathbb{P}-generic ωΩ\omega\in\Omega, a given uu\in\mathcal{H} to the time-tt vector field utu_{t} conditioned on u0=uu_{0}=u. Then 𝒰\mathcal{U} is a continuous RDS on the space Z=Z=\mathcal{H} satisfying condition (H).

Similarly, the random ODE (1.8) defining the auxiliary process 𝐱t=𝐱ω,u0t𝐱0\mathbf{x}_{t}=\mathbf{x}^{t}_{\omega,u_{0}}\mathbf{x}_{0} and vt=vω,u0,𝐱0tv0v_{t}=v^{t}_{\omega,u_{0},\mathbf{x}_{0}}v_{0} are well posed, and we conclude as before that the corresponding mapping 𝔏:[0,)×Ω××T2×T2\mathfrak{L}:[0,\infty)\times\Omega\times\mathcal{H}\times T^{2}\to\mathcal{H}\times T^{2} and 𝔓:[0,)×Ω××T2×S1×T2×S1\mathfrak{P}:[0,\infty)\times\Omega\times\mathcal{H}\times T^{2}\times S^{1}\to\mathcal{H}\times T^{2}\times S^{1} for the Lagrangian process (ut,𝐱t)(u_{t},\mathbf{x}_{t}) and projective process (ut,𝐱t,vt)(u_{t},\mathbf{x}_{t},v_{t}) are continuous RDS satisfying (H) on the space Z=×T2Z=\mathcal{H}\times T^{2} and Z=×T2×S1Z=\mathcal{H}\times T^{2}\times S^{1} respectively.

Now, the process (ut,𝐩t)(u_{t},\mathbf{p}_{t}) is Markovian. We write Pt:×M𝒫(×M)P_{t}:\mathcal{H}\times M\to\mathcal{P}(\mathcal{H}\times M) to denote the time tt transition kernel, where 𝒫(X)\mathcal{P}(X) denotes the space of probability measures on XX. We also let PtP_{t} act adjointly on observables φ:×M\varphi:\mathcal{H}\times M\to\mathbb{R} by pulling back:

Ptφ(u0,𝐩0)=φ(ut,𝐩t)Pt(u0,𝐩0;dut,d𝐩t).P_{t}\varphi(u_{0},\mathbf{p}_{0})=\int\varphi(u_{t},\mathbf{p}_{t})\,P_{t}(u_{0},\mathbf{p}_{0};\mathrm{d}u_{t},\mathrm{d}\mathbf{p}_{t}).

Next, we introduce linear cocycles over random dynamical systems and the classical Multiplicative ergodic theorem.

Definition 2.2.

Let 𝒯\mathcal{T} be a continuous RDS as above, and let (τt)(\tau^{t}) be its associated skew product. A dd-dimensional linear cocycle 𝒜\mathcal{A} over the base RDS 𝒯\mathcal{T} is a mapping 𝒜:ΩCu,b([0,)×Z,Md×d())\mathcal{A}:\Omega\to C_{u,b}([0,\infty)\times Z,\,M_{d\times d}(\mathbb{R})) with the following properties:

  • (i)

    The evaluation mapping Ω×[0,)×ZMd×d()\Omega\times[0,\infty)\times Z\to M_{d\times d}(\mathbb{R}) sending (ω,t,z)𝒜w,zt(\omega,t,z)\mapsto\mathcal{A}_{w,z}^{t} is Bor([0,))Bor(Z)\mathcal{F}\otimes\text{Bor}([0,\infty))\otimes\text{Bor}(Z)-measurable.

  • (ii)

    The mapping 𝒜\mathcal{A} satisfies the cocycle property: for any zZ,wΩz\in Z,w\in\Omega we have 𝒜w,z0=Idd\mathcal{A}_{w,z}^{0}=\mathrm{Id}_{\mathbb{R}^{d}}, the d×dd\times d identity matrix, and for s,t0s,t\geq 0 we have

    𝒜w,zs+t=𝒜τt(w,z)s𝒜w,zt.\mathcal{A}_{w,z}^{s+t}=\mathcal{A}_{\tau^{t}(w,z)}^{s}\circ\mathcal{A}_{w,z}^{t}.
Remark 2.3.

The main focus of this paper is the derivative cocycle 𝒜\mathcal{A} of Lagrangian particle trajectories xtx_{t}, where

𝒜:[0,)×Ω××𝕋2M2×2(),𝒜ω,u,𝐱t=Dx𝐱ω,ut,\displaystyle\mathcal{A}:[0,\infty)\times\Omega\times\mathcal{H}\times\mathbb{T}^{2}\rightarrow M_{2\times 2}(\mathbb{R}),\;\;\mathcal{A}_{\omega,u,\mathbf{x}}^{t}=D_{x}\mathbf{x}_{\scriptstyle{\omega,u}}^{t}\,, (2.1)

which corresponds to taking 𝒯=𝔏\mathcal{T}=\mathfrak{L} and Z=×𝕋2Z=\mathcal{H}\times\mathbb{T}^{2}.

The following is a version of the Multiplicative ergodic theorem in [29].

Theorem 2.4.

Let 𝒯\mathcal{T} be a continuous RDS as above satisfying condition (H). Let μ𝒫(Z)\mu\in\mathcal{P}(Z) be an ergodic stationary measure for 𝒯\mathcal{T}, and let 𝒜\mathcal{A} be a linear cocycle over 𝒯\mathcal{T} satisfying the following integrability condition:

𝔼Z(log+|𝒜w,zt|+log+|(𝒜w,zt)1|)𝑑μ(z)<,\mathbb{E}\int_{Z}\left(\log^{+}|\mathcal{A}_{w,z}^{t}|+\log^{+}|(\mathcal{A}_{w,z}^{t})^{-1}|\right)d\mu(z)<\infty, (2.2)

where log+(x):=max{0,log(x)}\log^{+}(x):=\max\{0,\log(x)\} for x>0x>0 and 𝔼\mathbb{E} is the expectation with respect to \mathbb{P}. Then there are r{1,,d}r\in\{1,\ldots,d\} deterministic real numbers

λr<<λ1\lambda_{r}<\cdots<\lambda_{1}

and for ×μ\mathbb{P}\times\mu-a.e. (w,z)(w,z), a flag of subspaces

{0}=:Fr+1Fr(w,z)F2(w,z)F1:=d\{0\}=:F_{r+1}\subsetneq F_{r}(w,z)\subsetneq\cdots\subsetneq F_{2}(w,z)\subsetneq F_{1}:=\mathbb{R}^{d}

such that

λi=limt1tlog|𝒜w,ztv|,vFi(w,z)Fi+1(w,z).\lambda_{i}=\lim_{t\to\infty}\frac{1}{t}\log|\mathcal{A}_{w,z}^{t}v|,\quad v\in F_{i}(w,z)\setminus F_{i+1}(w,z).

Moreover, for any i{1,,r}i\in\{1,\ldots,r\}, the mapping (w,z)Fi(w,z)(w,z)\mapsto F_{i}(w,z) is measurable and dimFi(w,z)\dim F_{i}(w,z) is constant for ×μ\mathbb{P}\times\mu-a.e. (w,z)(w,z).

Here, the numbers λi\lambda_{i} are called Lyapunov exponents and mi:=dimFi(ω,z)dimFi+1(ω,z)m_{i}:=\dim F_{i}(\omega,z)-\dim F_{i+1}(\omega,z) are their multiplicities. In particular, λ1\lambda_{1} is precisely the top Lyapunov exponent of interest in this paper.

If the Lagrangian process admits a unique stationary measure μ1\mu^{1} (in which case ×μ1\mathbb{P}\times\mu^{1} is an ergodic invariant measure of the random dynamical system 𝔏\mathfrak{L}), then under the assumed integrability condition, the Multiplicative ergodic theorem implies that the top Lyapunov exponent of this linear cocycle exists and is almost surely constant with respect to ×μ1\mathbb{P}\times\mu^{1}. This completes the first part of the proof.

2.2 Furstenberg criterion and results

We review the Furstenberg’s criterion to decide positivity of the top Lyapunov exponent for conservative systems, and then provide conditions specialized to the noise assumptions of this work. We first present a standard formulation of the Furstenberg’s criterion. This formulation is directly adapted from [30] and is further developed in subsequent work published in [2].

Theorem 2.5.

If λ1=λr\lambda_{1}=\lambda_{r}, then for each zZz\in Z there is a Borel measure νz\nu_{z} on S1S^{1} such that

  1. (i)

    the assignment zνzz\mapsto\nu_{z} is measurable, and

  2. (ii)

    for each t[0,)t\in[0,\infty) and (×μ)(\mathbb{P}\times\mu)-almost all (w,z)Ω×Z(w,z)\in\Omega\times Z (perhaps depending on tt), we have that

    (𝒜w,zt)νz=ν𝒯w,zt.(\mathcal{A}_{w,z}^{t})_{*}\nu_{z}=\nu_{\mathcal{T}_{w,z}^{t}}. (2.3)

The classical Furstenberg criterion asserts that the top Lyapunov exponent is positive once one rules out every projective family of measures {νz}zsuppμ\{\nu_{z}\}_{z\in\operatorname{supp}\mu} satisfying the invariance relation (2.3). In general, however, measurability of such a family alone does not suffice to exclude these invariant structures. To obtain an effective exclusion, existing arguments typically impose stronger regularity to ensure a weakly continuous selection of {νz}\{\nu_{z}\}: [31, Section 4] and [32, Proposition 2.10] assume that the stationary measure μ\mu is mixing in total variation, while related work in [2] requires the Lagrangian process and its projective process to be strong Feller and the associated Markov semigroup to be C0C^{0}-continuous.

For Navier–Stokes Lagrangian trajectories driven by mildly degenerate noise, neither set of assumptions is available: mixing holds only with respect to the weaker dual Lipschitz metric (hence not in total variation), and the strong Feller property typically fails as well. We therefore adopt the refined Furstenberg’s criterion of [26, Proposition 3.4] and derive below a criterion tailored to the noise setting of this work.

To formulate sufficient conditions ensuring the positivity of the top Lyapunov exponent for the linear cocycle (2.1) over the random dynamical system 𝔏\mathfrak{L}, we impose the following three assumptions.

  1. (A1)

    Integrability. The linear cocycle 𝒜\mathcal{A} satisfies

    𝔼Z(log+|𝒜w,zt|+log+|(𝒜w,zt)1|)𝑑μ(z)<.\mathbb{E}\int_{Z}\left(\log^{+}|\mathcal{A}_{w,z}^{t}|+\log^{+}|(\mathcal{A}_{w,z}^{t})^{-1}|\right)d\mu(z)<\infty.
  2. (A2)

    Asymptotic strong Feller property of the extended system. For system (1.2) and (1.7)-(1.8), there exists an N=N(0,ν)N_{*}=N_{*}(\mathcal{E}_{0},\nu) such that if 𝒵0={k2, 0<|k|N}\mathcal{Z}_{0}=\{k\in\mathbb{Z}^{2},\,0<|k|\leq N_{*}\}, then the Markov semigroups associated with the Lagerange process (ut,xt)(u_{t},x_{t}) and projective process (ut,xt,vt)(u_{t},x_{t},v_{t}) are all asymptotic strong Feller in ×𝕋2\mathcal{H}\times\mathbb{T}^{2} and ×𝕋2×S1\mathcal{H}\times\mathbb{T}^{2}\times S^{1} respectively.

  3. (A3)

    Approximate controllability of the extended system. There exist z,zsuppμz,z^{\prime}\in\operatorname{supp}\mu such that zz^{\prime} belongs to the support of the measure Pt0(z,)P_{t_{0}}(z,\cdot) for some t0>0t_{0}>0, where μ\mu is the unique stationary measure for base process, and we have each of the following for all t>0t>0.

    1. (a)

      For any 𝐱𝕋2,ε,M>0\mathbf{x}\in\mathbb{T}^{2},\,\varepsilon,M>0,

      ((ut,𝐱t,At)Bε(z)×Bε(𝐱)×{ASL2():|A|>M}|(u0,𝐱0,A0)=(z,𝐱,Id))>0.\mathbb{P}\left((u_{t},\mathbf{x}_{t},A_{t})\in B_{\varepsilon}(z^{\prime})\times B_{\varepsilon}(\mathbf{x}^{\prime})\times\{A\in\mathrm{SL}_{2}(\mathbb{R}):|A|>M\}|(u_{0},\mathbf{x}_{0},A_{0})=(z,\mathbf{x},\mathrm{Id})\right)>0.
    2. (b)

      For any (𝐱,v)𝕋2×S1(\mathbf{x},v)\in\mathbb{T}^{2}\times S^{1}, open set VS1V\subset S^{1}, and ε>0\varepsilon>0,

      ((ut,𝐱t,vt)Bε(z)×Bε(𝐱)×V|(u0,𝐱0,v0)=(z,𝐱,v))>0.\mathbb{P}\left((u_{t},\mathbf{x}_{t},v_{t})\in B_{\varepsilon}(z^{\prime})\times B_{\varepsilon}(\mathbf{x}^{\prime})\times V|(u_{0},\mathbf{x}_{0},v_{0})=(z,\mathbf{x},v)\right)>0.
Theorem 2.6.

Suppose that Assumptions (A1)-(A3) are satisfied. Then there exists a deterministic constant λ+>0\lambda_{+}>0 such that

λ1=λ+=limt1tlog|𝒜u0,xt|>0forμ1×a.e.(u0,x,ω),\lambda_{1}=\lambda_{+}=\lim_{t\to\infty}\frac{1}{t}\log|\mathcal{A}^{t}_{{}_{u_{0},x}}|>0\,\,\,\,\text{for}\,\,\,\,\mu^{1}\times\mathbb{P}-a.e.\,(u_{0},x,\omega), (2.4)

where μ1\mu^{1} is the unique stationary measure for the Lagrangian process.

Remark 2.7.

By the asymptotic strong Feller theory of Mattingly–Hairer and collaborators, assumptions (A2)(A3) already imply the uniqueness of the stationary measure for the extended system. On the other hand, the compactness of the manifold MM together with a super-Lyapunov condition (c.f. Corollary A.3) yields the existence of an invariant measure. See [26, Corollary 2.1] for details.

Moreover, we also observe that the Lagrangian system induced by the incompressible stochastic Navier–Stokes equation is conservative. Hence λ:=iλi=0,\lambda_{\scriptscriptstyle\sum}:=\sum_{i}\lambda_{i}=0, and therefore

dλ1λ=0λ10.d\,\lambda_{1}\geq\lambda_{\scriptscriptstyle\sum}=0\quad\Rightarrow\quad\lambda_{1}\geq 0.

Thus, to obtain positivity of the top Lyapunov exponent, it suffices to rule out λ1=0\lambda_{1}=0, equivalently the spectral degeneracy λ1=λr\lambda_{1}=\lambda_{r}. This is precisely the obstruction addressed by the Furstenberg criterion. We verify an approximate controllability property for the extended system, which excludes the continuous invariant structures in the refined Furstenberg’s criterion of [26, Theorems 3.5-3.6]; the required continuity of these invariant structures is ensured by the asymptotic strong Feller property of the extended system. For further details, see [26, Section 3] and [2, Section 4]. This completes the second part of the proof.

Proof of the Remark 1.2.

By utilizing the existence of a unique stationary measure for the projective process (c.f. Assumption (A2)) and the random multiplicative Ergodic Theorem [33, Theorem III.1.2], we can complete the proof. ∎

Concerning the verification of assumptions (A1)(A3): assumption (A1) follows directly from regularity estimates for the solution; see [2, Appendix A]. Assumption (A3) is also established in [2, Scetion 7], where it is obtained by constructing the required smooth controls via shear and cellular flows, and then combining a stability argument with the positivity of Wiener measure. Hence, the remaining key task is to prove the asymptotic strong Feller property for the extended system under mildly degenerate noise, i.e., (A2); this is the main novelty of the present work, and the proof is deferred to the next two sections.

Although the results of [26] cover our setting, our approach avoids Malliavin analysis on the full phase space. By working with a partial Malliavin matrix we obtain a stronger statement, and we also bypass additional, intricate Lie bracket computations, which simplifies the calculations at several points.

3 Asymptotic gradient estimate

In this section we provide a sufficient condition for the asymptotic strong Feller property of the extended system, namely a long-time asymptotic gradient estimate (3.2). Owing to the compactness of the manifold MM and the super-Lyapunov property (c.f. Corollary A.3), it suffices to establish the corresponding short-time asymptotic gradient estimate (3.1), from which (3.2) follows.

Theorem 3.1 (Short-time asymptotic gradient estimate).

Fix T0(0,1)T_{0}\in(0,1). Consider either the base, Lagrange, or projective process. There exists an N=N(0,ν)N_{*}=N_{*}(\mathcal{E}_{0},\nu) such that if 𝒵0={k2, 0<|k|N}\mathcal{Z}_{0}=\{k\in\mathbb{Z}^{2},\,0<|k|\leq N_{*}\}, then for all η>0\eta>0, 𝐩0M\mathbf{p}_{{}_{0}}\in M, γ(0,1)\gamma\in(0,1), there exists C(η,γ,N,𝐩0)>0C(\eta,\gamma,N_{*},\mathbf{p}_{{}_{0}})>0, bounded locally uniformly in 𝐩0\mathbf{p}_{{}_{0}}, such that for all Fréchet differentiable observables φ:×M\varphi:\mathcal{H}\times M\to\mathbb{R} with φ\|\varphi\|_{\infty} and φ\|\nabla\varphi\|_{\infty} finite,

𝒫T0φ(u,𝐩)×T𝐩MCexp(ηV(u0))(PT0|φ|2(u0,𝐩0)+γPT0φ×T𝐩M2(u0,𝐩0)),\displaystyle\|\nabla\mathcal{P}_{{}_{T_{0}}}\varphi(u,\mathbf{p})\|_{\mathcal{H}\times T_{\mathbf{p}}M}\leq C\exp{(\eta V(u_{0}))}\bigg(\sqrt{P_{{}_{T_{0}}}|\varphi|^{2}(u_{0},\mathbf{p}_{{}_{0}})}+\gamma\sqrt{P_{{}_{T_{0}}}\|\nabla\varphi\|^{2}_{\mathcal{H}\times T_{\mathbf{p}}M}(u_{0},\mathbf{p}_{{}_{0}})}\bigg), (3.1)

where V(u)V(u) denotes the super-Lyapunov function introduced in Definition A.2, it is associated with the process utu_{t} and satisfies the super-Lyapunov property stated in Corollary A.3.

Remark 3.2.

From [26, Corollary 2.10] we know that when the manifold MM is compact, utilizing the super-Lyapunov property (c.f. Corollary A.3) of an super-Lyapunov function enables us to translate the short-time asymptotic gradient estimate into the following long-time asymptotic gradient estimate:

PtφηVCφηV+γt/T0φηV,\|\nabla P_{t}\varphi\|_{\eta V}\leq C\|\varphi\|_{\eta V}+\gamma^{\lceil t/T_{0}\rceil}\|\nabla\varphi\|_{\eta V}, (3.2)

for any t1t\geq 1, where

φηV:=supuH05exp(ηV(u))|φ(u,𝐩)|,φηV:=supuH05exp(ηV(u))|φ(u,𝐩)|×T𝐩M,\|\varphi\|_{\eta V}:=\mathop{\rm sup}\limits_{u\in H_{0}^{5}}\exp(-\eta V(u))|\varphi(u,\mathbf{p})|,\;\;\|\nabla\varphi\|_{\eta V}:=\mathop{\rm sup}\limits_{u\in H_{0}^{5}}\exp(-\eta V(u))|\nabla\varphi(u,\mathbf{p})|_{\mathcal{H}\times T_{\mathbf{p}}M},

and C=C(γ,η,N)C=C(\gamma,\eta,N_{*}) is independent of tt and 𝐩0\mathbf{p}_{0}.

3.1 Malliavin calculus and proof strategy

The extended system we consider is as follows:

u˙t\displaystyle\dot{u}_{t} =utut+νΔut+k𝒵0qkΔ1ek(x)dWtk,\displaystyle=-u_{t}\cdot\nabla u_{t}+\nu\Delta u_{t}+\sum_{k\in\mathcal{Z}_{0}}q_{k}\nabla^{\perp}\Delta^{-1}e_{k}(x)\,dW^{k}_{t}, (3.3)
x˙t\displaystyle\dot{x}_{t} =ut(xt),\displaystyle=u_{t}(x_{t}), (3.4)
v˙t\displaystyle\dot{v}_{t} =ΠvtDut(xt)vt.\displaystyle=\Pi_{v_{t}}Du_{t}(x_{t})v_{t}. (3.5)

Next, we reformulate the above system as the following abstract stochastic evolution equation on ×𝕋2×S1\mathcal{H}\times\mathbb{T}^{2}\times S^{1}:

t(ut,xt,vt)=F^(ut,xt,vt)A^(ut,xt,vt)+Q^W˙t,\displaystyle\partial_{t}(u_{t},x_{t},v_{t})=\hat{F}(u_{t},x_{t},v_{t})-\hat{A}(u_{t},x_{t},v_{t})+\hat{Q}\dot{W}_{t}, (3.6)

where F^\hat{F}, A^\hat{A} and Q^W˙t\hat{Q}\dot{W}_{t} are given by

F^(u,x,v)=(B(u,u)u(x)ΠvDu(x)v),A^(u,x,v)=(νΔu00),Q^W˙=(k𝒵0qkγkek(x)dWtk00)\hat{F}(u,x,v)=\begin{pmatrix}-B(u,u)\\ u(x)\\ \Pi_{v}Du(x)v\end{pmatrix},\quad\hat{A}(u,x,v)=\begin{pmatrix}-\nu\Delta u\\ 0\\ 0\end{pmatrix},\quad\hat{Q}\dot{W}=\begin{pmatrix}\sum_{k\in\mathcal{Z}_{0}}q_{k}\gamma_{k}e_{k}(x)\,dW^{k}_{t}\\ 0\\ 0\end{pmatrix}

with B(ut,ut):=ututB(u_{t},u_{t}):=u_{t}\cdot\nabla u_{t}.

Before presenting the proof idea of the asymptotic gradient estimates for the extended system, we briefly review the Malliavin calculus preliminaries needed for the subsequent derivations.

Consider the map Yt:C([0,t];m)××M×MY_{t}:C([0,t];\mathbb{R}^{m})\times\mathcal{H}\times M\to\mathcal{H}\times M such that (ut,𝐩t)=Yt(W,u0,𝐩0)(u_{t},\mathbf{p}_{t})=Y_{t}(W,u_{0},\mathbf{p}_{0}) for initial data u0,𝐩0u_{0},\,\mathbf{p}_{0} and noise realization WW, where m:=|𝒵0|m:=|\mathcal{Z}_{0}|. Take a direction gL2([0,T0];m)g\in L^{2}([0,T_{0}];\mathbb{R}^{m}), and let

G(t)=0tgs𝑑s.G(t)=\int_{0}^{t}g_{{}_{s}}\,ds.

For an ×M\mathcal{H}\times M-valued random variable (ut,𝐩t)(u_{t},\mathbf{p}_{t}), its Malliavin derivative in the direction gg is defined as

𝒟g(ut,𝐩t)=limε0Yt(W+εG,u0,𝐩0)Yt(W,u0,𝐩0)ε,\mathcal{D}^{g}(u_{t},\mathbf{p}_{t})=\lim_{\varepsilon\to 0}\frac{Y_{t}(W+\varepsilon G,u_{0},\mathbf{p}_{0})-Y_{t}(W,u_{0},\mathbf{p}_{0})}{\varepsilon}\,,

where the limit holds almost surely with respect to the Wiener measure. Note that we allow gg to be random and possibly nonadapted to the filtration generated by the increments of WW. The Fréchet differentiability of YtY_{t} further yields

𝒟g(ut,𝐩t)=0T0𝒟s(ut,𝐩t)gs𝑑s,\displaystyle\mathcal{D}^{g}(u_{t},\mathbf{p}_{t})=\int_{0}^{T_{0}}\mathcal{D}_{s}(u_{t},\mathbf{p}_{t})g_{{}_{s}}\,ds, (3.7)

where 𝒟s(ut,𝐩t)\mathcal{D}_{s}(u_{t},\mathbf{p}_{t}) denotes the Malliavin derivative of (ut,𝐩t)(u_{t},\mathbf{p}_{t}) at time ss. We also write 𝒥s,t\mathcal{J}_{s,t} (sts\leq t) for the derivative flow from ss to tt: for any 𝔥×T𝐩sM\mathfrak{h}\in\mathcal{H}\times T_{\mathbf{p}_{s}}M, 𝒥s,t𝔥\mathcal{J}_{s,t}\mathfrak{h} is the solution of

t𝒥s,t𝔥=DF^(ut,𝐩t)𝒥s,t𝔥A^𝒥s,t𝔥,𝒥s,s𝔥=𝔥.\displaystyle\partial_{t}\mathcal{J}_{s,t}\mathfrak{h}=D\hat{F}(u_{t},\mathbf{p}_{t})\mathcal{J}_{s,t}\mathfrak{h}-\hat{A}\mathcal{J}_{s,t}\mathfrak{h},\quad\mathcal{J}_{s,s}\mathfrak{h}=\mathfrak{h}. (3.8)

By the Duhamel’s formula, we obtain

𝒟g(ut,𝐩t)=0T0𝒥s,tQ^gs𝑑s.\mathcal{D}^{g}(u_{t},\mathbf{p}_{t})=\int_{0}^{T_{0}}\mathcal{J}_{s,t}\hat{Q}g_{{}_{s}}\,ds.

Combining this with (3.7), we arrive at the following observation

𝒟s(ut,𝐩t)={𝒥s,tQ^,s<t,0,s>t.\displaystyle\mathcal{D}_{s}(u_{t},\mathbf{p}_{t})=\begin{cases}\mathcal{J}_{s,t}\hat{Q},&s<t,\\ 0,&s>t.\end{cases} (3.9)

We now outline the proof idea for the asymptotic gradient estimates.

For 𝔥×T𝐩M\mathfrak{h}\in\mathcal{H}\times T_{\mathbf{p}}M, we use a bracket to denote the inner product with respect to ×T𝐩M\mathcal{H}\times T_{\mathbf{p}}M, that is

(φ𝐩),(ψ𝐪):=g^(𝐩,𝐪)+φ,ψ,\left\langle{\begin{pmatrix}\varphi\\ \mathbf{p}\end{pmatrix}},{\begin{pmatrix}\psi\\ \mathbf{q}\end{pmatrix}}\right\rangle:=\hat{g}(\mathbf{p},\mathbf{q})+\left\langle{\varphi},{\psi}\right\rangle_{\mathcal{H}},

then 𝔥:=𝔥×T𝐩M=𝔥,𝔥\|\mathfrak{h}\|:=\|\mathfrak{h}\|_{\mathcal{H}\times T_{\mathbf{p}}M}=\sqrt{\left\langle{\mathfrak{h}},\mathfrak{h}\right\rangle}. Now let 𝔥×T𝐩M\mathfrak{h}\in\mathcal{H}\times T_{\mathbf{p}}M with 𝔥=1\|\mathfrak{h}\|=1 arbitrary. Set ρt=𝒥0,t𝔥𝒟g\rho_{{}_{t}}=\mathcal{J}_{0,t}\mathfrak{h}-\mathcal{D}^{g}, we then have the fundamental approximate integration by parts computation,

PT0φ(u0,𝐩0),𝔥\displaystyle\left\langle\nabla P_{{}_{T_{0}}}\varphi(u_{0},\mathbf{p}_{{}_{0}}),\mathfrak{h}\right\rangle
=𝔼(u0,𝐩0)((φ)(ut,𝐩t)𝒥0,T0𝔥)\displaystyle=\mathbb{E}_{(u_{0},\mathbf{p}_{{}_{0}})}((\nabla\varphi)(u_{t},\mathbf{p}_{t})\mathcal{J}_{0,T_{0}}\mathfrak{h})
=𝔼(u0,𝐩0)(𝒟gφ(ut,𝐩t))+𝔼(u0,𝐩0)((φ)(ut,𝐩t)ρT0)\displaystyle=\mathbb{E}_{(u_{0},\mathbf{p}_{{}_{0}})}(\mathcal{D}^{g}\varphi(u_{t},\mathbf{p}_{t}))+\mathbb{E}_{(u_{0},\mathbf{p}_{{}_{0}})}((\nabla\varphi)(u_{t},\mathbf{p}_{t})\rho_{{}_{T_{0}}})
=𝔼(u0,𝐩0)(φ(uT0,𝐩T0)0T0gs𝑑Ws)+𝔼(u0,𝐩0)((φ)(ut,𝐩t)ρT0)\displaystyle=\mathbb{E}_{(u_{0},\mathbf{p}_{{}_{0}})}(\varphi(u_{{}_{T_{0}}},\mathbf{p}_{{}_{T_{0}}})\int_{0}^{T_{0}}g_{{}_{s}}dW_{s})+\mathbb{E}_{(u_{0},\mathbf{p}_{{}_{0}})}((\nabla\varphi)(u_{t},\mathbf{p}_{t})\rho_{{}_{T_{0}}})
(𝔼(u0,𝐩0)(0T0gs𝑑Ws)2)1/2PT0|φ|2(u0,𝐩0)+(𝔼(u0,𝐩0)ρT02)1/2PT0φ×T𝐩M2(u0,𝐩0).\displaystyle\leq\left(\mathbb{E}_{(u_{0},\mathbf{p}_{{}_{0}})}\left(\int_{0}^{T_{0}}g_{{}_{s}}dW_{s}\right)^{2}\right)^{1/2}\sqrt{P_{{}_{T_{0}}}|\varphi|^{2}(u_{0},\mathbf{p}_{{}_{0}})}+\left(\mathbb{E}_{(u_{0},\mathbf{p}_{{}_{0}})}\|\rho_{{}_{T_{0}}}\|_{\mathcal{H}}^{2}\right)^{1/2}\sqrt{P_{{}_{T_{0}}}\|\nabla\varphi\|^{2}_{\mathcal{H}\times T_{\mathbf{p}}M}(u_{0},\mathbf{p}_{{}_{0}})}. (3.10)

The core idea of the proof is to construct an infinitesimal perturbation gtg_{t} of the Wiener process path, such that its induced effect on the system state (ut,𝐩t)(u_{t},\mathbf{p}_{{}_{t}}) compensates for, as much as possible, the effect caused by an infinitesimal perturbation 𝔥\mathfrak{h} of the initial condition at time tt. Subsequently, our focus will be on identifying a suitable control gtg_{{}_{t}} such that for any γ(0,1)\gamma\in(0,1), the associated control cost and the resulting error satisfy

sup𝔥×T𝐩M=1𝔼(u0,𝐩0)ρT02γexp(ηV(u0))\sup_{\|\mathfrak{h}\|_{\mathcal{H}\times T_{\mathbf{p}}M}=1}\mathbb{E}_{(u_{0},\mathbf{p}_{{}_{0}})}\|\rho_{{}_{T_{0}}}\|_{\mathcal{H}}^{2}\leq\gamma\exp(\eta V(u_{0})) (3.11)

and

sup𝔥×T𝐩M=1𝔼(u0,𝐩0)|0T0gs𝑑Ws|2Cexp(ηV(u0)).\sup_{\|\mathfrak{h}\|_{\mathcal{H}\times T_{\mathbf{p}}M}=1}\mathbb{E}_{(u_{0},\mathbf{p}_{{}_{0}})}\left|\int_{0}^{T_{0}}g_{{}_{s}}dW_{s}\right|^{2}\leq C\exp(\eta V(u_{0})). (3.12)

3.2 Splitting of extended system

Here, exploiting the mild degeneracy of the noise, we implement a natural splitting of the above extended system into high and low frequencies. This splitting will subsequently allow us to construct controls within the finite-dimensional low-frequency subsystem, while fully exploiting the inherent dissipative effects present in the high-frequency component. Here, 𝒵0:={k2, 0<|k|N}\mathcal{Z}_{0}:=\{k\in\mathbb{Z}^{2},\,0<|k|\leq N_{*}\} is the set of low (frequency) modes. Let Πl:\Pi_{l}:\mathcal{H}\to\mathcal{H} denote the corresponding orthogonal projection onto the ‘low modes’ belonging to 𝒵0\mathcal{Z}_{0} and let Πh=IΠl\Pi_{h}=I-\Pi_{l} be the complementary projection onto the ‘high modes’ belonging to 2𝒵0\mathbb{Z}^{2}\setminus\mathcal{Z}_{0}. Let l(Rang(Q))\mathcal{H}_{l}(\supseteq Rang(Q)) and h\mathcal{H}_{h} denote the ranges of Πl\Pi_{l} and Πh\Pi_{h}, respectively, so that we have the orthogonal decomposition

=lh.\mathcal{H}=\mathcal{H}_{l}\oplus\mathcal{H}_{h}.

Given (u,x,v)×M(u,x,v)\in\mathcal{H}\times M, we will extend the definition of Πl\Pi_{l} and Πh\Pi_{h} to ×M\mathcal{H}\times M so that MM is included with the low modes by

(u,x,v)l=Πl(u,x,v)=(Πlu,x,v)=(ul,x,v)and(u,x,v)h=Πh(u,x,v)=Πhu=uh.(u,x,v)^{l}=\Pi_{l}(u,x,v)=(\Pi_{l}u,x,v)=(u^{l},x,v)\quad\text{and}\quad(u,x,v)^{h}=\Pi_{h}(u,x,v)=\Pi_{h}u=u^{h}.

Naturally this defines low and high processes (u,x,v)l(u,x,v)^{l} and (u,x,v)h(u,x,v)^{h}, which satisfy (note of course they are coupled)

t(ut,xt,vt)l\displaystyle\partial_{t}(u_{t},x_{t},v_{t})^{l} =F^l(ut,xt,vt)A^l(ut,xt,vt)l+Q^lW˙tl,\displaystyle=\hat{F}_{l}(u_{t},x_{t},v_{t})-\hat{A}_{l}(u_{t},x_{t},v_{t})^{l}+\hat{Q}_{l}\dot{W}_{t}^{l},
t(ut,xt,vt)h\displaystyle\partial_{t}(u_{t},x_{t},v_{t})^{h} =F^h(ut,xt,vt)A^h(ut,xt,vt)h+Q^hW˙th,\displaystyle=\hat{F}_{h}(u_{t},x_{t},v_{t})-\hat{A}_{h}(u_{t},x_{t},v_{t})^{h}+\hat{Q}_{h}\dot{W}_{t}^{h},

where

F^l(ut,xt,vt)=ΠlF^(ut,xt,vt),F^h(ut,xt,vt)=ΠhF^(ut,xt,vt),A^l(ut,xt,vt)=ΠlA^(ut,xt,vt),\hat{F}_{l}(u_{t},x_{t},v_{t})=\Pi_{l}\hat{F}(u_{t},x_{t},v_{t}),\quad\hat{F}_{h}(u_{t},x_{t},v_{t})=\Pi_{h}\hat{F}(u_{t},x_{t},v_{t}),\quad\hat{A}_{l}(u_{t},x_{t},v_{t})=\Pi_{l}\hat{A}(u_{t},x_{t},v_{t}),
A^h(ut,xt,vt)=ΠhA^(ut,xt,vt),Q^l=ΠlQ^,Q^h=ΠhQ^.\hat{A}_{h}(u_{t},x_{t},v_{t})=\Pi_{h}\hat{A}(u_{t},x_{t},v_{t}),\quad\hat{Q}_{l}=\Pi_{l}\hat{Q},\quad\hat{Q}_{h}=\Pi_{h}\hat{Q}.

Recall that for any 𝔥×TM\mathfrak{h}\in\mathcal{H}\times TM, we denote its norm by 𝔥:=𝔥×TM\|\mathfrak{h}\|:=\|\mathfrak{h}\|_{\mathcal{H}\times TM}. Similarly, for any 𝔥ll×TM\mathfrak{h}^{l}\in\mathcal{H}_{l}\times TM and 𝔥hh\mathfrak{h}^{h}\in\mathcal{H}_{h}, we define 𝔥ll:=𝔥ll×TM\|\mathfrak{h}^{l}\|_{l}:=\|\mathfrak{h}^{l}\|_{\mathcal{H}_{l}\times TM} and 𝔥hh:=𝔥hh\|\mathfrak{h}^{h}\|_{h}:=\|\mathfrak{h}^{h}\|_{\mathcal{H}_{h}}. In addition, we use ,l\langle\cdot,\cdot\rangle_{l} to denote the inner product on l×TM\mathcal{H}_{l}\times TM, and ,h\langle\cdot,\cdot\rangle_{h} to denote the inner product on h\mathcal{H}_{h}.

Remark 3.3.

To fully exploit the dissipative effect of the high modes, we incorporate the manifold component (xt,vt)(x_{t},v_{t}) into the low-frequency subsystem, utilizing its finite-dimensional structure to design the control strategy and thereby effectively overcoming the non-invertibility of Q^l\hat{Q}_{l}. This design achieves a logical separation and coordination between control and dissipation, ensuring that controllability and stability are simultaneously realized within a unified analytical framework.

3.3 Construction and cost of control

We introduce two operators: the finite-dimensional matrix Rs,tlR_{s,t}^{l}, viewed as a linear map from l×T𝐩M\mathcal{H}_{l}\times T_{\mathbf{p}}M to l×T𝐩M\mathcal{H}_{l}\times T_{\mathbf{p}}M, and the bounded linear operator Rs,thR_{s,t}^{h} from h\mathcal{H}_{h} to h\mathcal{H}_{h}. These operators are defined by the following evolution equations:

tRs,tl=A^lRs,tl+DlF^l(ut,𝐩t)Rs,tl,Rs,sl=Id,\displaystyle\partial_{t}R_{s,t}^{l}=-\hat{A}_{l}R_{s,t}^{l}+D_{l}\hat{F}_{l}(u_{t},\mathbf{p}_{t})R_{s,t}^{l},\quad R_{s,s}^{l}=\text{Id}, (3.13)

and, for 0st0\leq s\leq t,

tRs,th=A^hRs,th+DhF^h(ut,𝐩t)Rs,th,Rs,sh=Id.\displaystyle\partial_{t}R_{s,t}^{h}=-\hat{A}_{h}R_{s,t}^{h}+D_{h}\hat{F}_{h}(u_{t},\mathbf{p}_{t})R_{s,t}^{h},\quad R_{s,s}^{h}=\text{Id}. (3.14)

More specifically, the above evolution equation can also be written as:

{tRs,tl=L~tlRs,tl,Rs,sl=Id,and{tRs,th=L~thRs,th,Rs,sh=Id,\begin{cases}\partial_{t}R_{s,t}^{l}=\tilde{L}_{t}^{l}R_{s,t}^{l},\\ R_{s,s}^{l}=\text{Id},\end{cases}\quad\text{and}\quad\begin{cases}\partial_{t}R_{s,t}^{h}=\tilde{L}_{t}^{h}R_{s,t}^{h},\\ R_{s,s}^{h}=\text{Id},\end{cases} (3.15)

where L~tl:=L~l(ut,xt,vt)=L~l(ut,𝐩t):Hl7×T𝐩tMHl5×T𝐩tM\tilde{L}_{t}^{l}:=\tilde{L}^{l}(u_{t},x_{t},v_{t})=\tilde{L}^{l}(u_{t},\mathbf{p}_{{}_{t}})\colon H^{7}_{l}\times T_{\mathbf{p}_{{}_{t}}}M\to H^{5}_{l}\times T_{\mathbf{p}_{{}_{t}}}M is given by

L~tl(𝔥ul𝔥x𝔥v):=(νΠl(Δ𝔥ul)Πl(𝔥ulut+ut𝔥ul)Du(x)𝔥x+𝔥ul(x)D[ΠvDu(x)v]𝔥v+ΠvDu(x)𝔥v+Πv[D2u(x)𝔥x]v+ΠvD𝔥ul(x)v),\tilde{L}_{t}^{l}\begin{pmatrix}\mathfrak{h}_{u}^{l}\\ \mathfrak{h}_{x}\\ \mathfrak{h}_{v}\end{pmatrix}:=\begin{pmatrix}\nu\Pi_{l}(\Delta\mathfrak{h}_{u}^{l})-\Pi_{l}(\mathfrak{h}_{u}^{l}\cdot\nabla u_{t}+u_{t}\cdot\nabla\mathfrak{h}_{u}^{l})\\ Du(x)\mathfrak{h}_{x}+\mathfrak{h}_{u}^{l}(x)\\ D[\Pi_{v}Du(x)v]\mathfrak{h}_{v}+\Pi_{v}Du(x)\mathfrak{h}_{v}+\Pi_{v}[D^{2}u(x)\mathfrak{h}_{x}]v+\Pi_{v}D\mathfrak{h}_{u}^{l}(x)v\end{pmatrix},

and L~th:=L~h(ut):Hh7Hh5\tilde{L}_{t}^{h}:=\tilde{L}^{h}(u_{t})\colon H^{7}_{h}\to H^{5}_{h} is given by

L~th(𝔥h):=νΠh(Δ𝔥h)Πh(𝔥hut+ut𝔥h).\tilde{L}_{t}^{h}(\mathfrak{h}^{h}):=\nu\Pi_{h}(\Delta\mathfrak{h}^{h})-\Pi_{h}(\mathfrak{h}^{h}\cdot\nabla u_{t}+u_{t}\cdot\nabla\mathfrak{h}^{h}).

Both Rs,tlR_{s,t}^{l} and Rs,thR_{s,t}^{h} provide approximations to the projection of the full Jacobian Js,tJ_{s,t} (the derivative of the process (ut,xt,vt)(u_{t},x_{t},v_{t})) onto the low and high modes, respectively, when the time tt is sufficiently small. Observe that Rs,tlR_{s,t}^{l} is invertible (due to its satisfaction of a finite-dimensional linear evolution equation), then we denote its inverse by

Ss,tl=(Rs,tl)1.S_{s,t}^{l}=(R_{s,t}^{l})^{-1}.

Set T0=T~0+τ0(<1)T_{0}=\widetilde{T}_{0}+\tau_{0}(<1), where T~01\widetilde{T}_{0}\ll 1 and τ0(0,1)\tau_{0}\in(0,1) are to be determined later. We simplify the notation by writing Rtl=Rτ0,tlR_{t}^{l}=R_{\tau_{0},t}^{l} and Stl=Sτ0,tlS_{t}^{l}=S_{\tau_{0},t}^{l}. When τ0s<tT0\tau_{0}\leq s<t\leq T_{0}, by virtue of the invertibility of RtlR_{t}^{l}, we can express Rs,tlR_{s,t}^{l} as Rs,tl=RtlSslR_{s,t}^{l}=R_{t}^{l}S_{s}^{l}.

We first introduce the key notion of the partial Malliavin matrix, which will play a central role in the subsequent control design.

Definition 3.4.

Define the partial Malliavin matrix 𝒩T0l:l×T𝐩T0Ml×T𝐩T0M\mathcal{N}_{{}_{T_{0}}}^{l}:\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{T_{0}}}}M\to\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{T_{0}}}}M by

𝒩T0l=τ0T0SslQ^l(SslQ^l)𝑑s.\mathcal{N}_{{}_{T_{0}}}^{l}=\int_{\tau_{0}}^{T_{0}}\,S_{s}^{l}\hat{Q}^{l}\,(S_{s}^{l}\hat{Q}^{l})^{\top}\,ds.
Remark 3.5.

The matrix 𝒩T0l\mathcal{N}_{{}_{T_{0}}}^{l} corresponds to a reduced Malliavin matrix, originally introduced by Norris [34] to streamline the Malliavin’s proof of Hörmander’s theorem. The term ‘partial Malliavin matrix’ is adopted from [6] and further employed in [11, 12]; it refers specifically to the finite-dimensional Malliavin matrix associated with the low modes. Here, we note that the usual definition of the Malliavin matrix would be

𝒩T0l=τ0T0Rs,T0lQ^l(Rs,T0lQ^l)𝑑s,\mathcal{N}_{{}_{T_{0}}}^{l}=\int_{\tau_{0}}^{T_{0}}\,R_{s,T_{0}}^{l}\hat{Q}^{l}\,(R_{s,T_{0}}^{l}\hat{Q}^{l})^{\top}\,ds,

but for such a reduced matrix, we exploit the invertibility of Rs,T0lR_{s,T_{0}}^{l} to define it here as in Definition 3.4.

The main advantage of this formulation is that, unlike the original Malliavin matrix, 𝒩T0l\mathcal{N}_{{}_{T_{0}}}^{l} is adapted, which greatly simplifies calculations. For instance, the Itó’s formula can be applied directly to generate Lie brackets, as detailed in [2, Lemma 6.15]. Moreover, the invertibility of Rs,T0lR_{s,T_{0}}^{l} further justifies the frequency separation between high and low modes.

Indeed, we can later prove the non-degeneracy of 𝒩T0l\mathcal{N}_{{}_{T_{0}}}^{l}, which allows us to construct the low‑frequency component of the control gt,t[τ0,T0]g_{{}_{t}},\,t\in[\tau_{0},T_{0}]. More precisely, we have the following result; the proof is somewhat involved and is postponed to Section 4 below.

Proposition 3.6.

The matrix 𝒩T0l\mathcal{N}_{{}_{T_{0}}}^{l} is almost surely invertible on l×T𝐩T0M\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{T_{0}}}}M. Furthermore, for all p1p\geq 1, η(0,1)\eta\in(0,1) and 𝐩0M\mathbf{p}_{{}_{0}}\in M there exists C(N,ν,𝐩0,η,p)>0C(N_{*},\nu,\mathbf{p}_{{}_{0}},\eta,p)>0, locally bounded in 𝐩0\mathbf{p}_{{}_{0}}, such that

𝔼|(𝒩T0l)1|pC(N,ν,𝐩0,η,p)exp(pηV(u0)).\mathbb{E}|(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}|^{p}\leq C(N_{*},\nu,\mathbf{p}_{{}_{0}},\eta,p)\exp{(p\eta V(u_{0}))}.

We will then use Proposition 3.6 to construct a perturbation glg^{l} which is given by 0 on all intervals of the type [0,τ0][0,\tau_{0}], and by gtlL2([τ0,T0];m)g_{t}^{l}\in L^{2}([\tau_{0},T_{0}];{\mathbb{R}^{m}}) on the remaining intervals.

We define the infinitesimal variation gtlg_{t}^{l} by

gtl=(StlQ^l)(𝒩T0l)1ST0lΠl(Jτ0,T0(J0,τ0𝔥))=(StlQ^l)(𝒩T0l)1ST0lΠl(J0,T0𝔥).g_{t}^{l}=(S_{t}^{l}\hat{Q}_{l})^{\top}(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}S_{{}_{T_{0}}}^{l}\Pi_{l}(J_{\tau_{0},T_{0}}(J_{0,\tau_{0}}\mathfrak{h}))=(S_{t}^{l}\hat{Q}_{l})^{\top}(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}S_{{}_{T_{0}}}^{l}\Pi_{l}(J_{0,T_{0}}\mathfrak{h}). (3.16)

Thereafter, with a slightly abuse notation we will use gtlg_{t}^{l} to denote both the perturbation of the Wiener path on [τ0,T0][\tau_{0},T_{0}] and its extension (by zero) to the interval [0,τ0][0,\tau_{0}]. Note that here we leave the high-frequency part uncontrolled, hence the overall control is given by

gt=(gtl, 0)=((StlQ^l)(𝒩T0l)1ST0lΠl(J0,T0𝔥), 0).\displaystyle g_{t}=(g_{t}^{l},\,0)=\left((S_{t}^{l}\hat{Q}_{l})^{\top}(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}S_{{}_{T_{0}}}^{l}\Pi_{l}(J_{0,T_{0}}\mathfrak{h}),\,0\right). (3.17)

We next derive an estimate for the control cost.

Proposition 3.7 (Cost of control).

There exists a constant C(N,ν,𝐩0)C(N_{*},\nu,\mathbf{p}_{0}), locally bounded in 𝐩0\mathbf{p}_{0}, such that

𝔼|τ0T0gs𝑑Ws|2Cexp(ηV(u0))𝔥×T𝐩M.\mathbb{E}\left|\int_{\tau_{0}}^{T_{0}}g_{{}_{s}}dW_{s}\right|^{2}\leq C\exp(\eta V(u_{0}))\|\mathfrak{h}\|_{\mathcal{H}\times T_{\mathbf{p}}M}. (3.18)

This argument is highly similar to [11, Section 4.6] and requires no new ideas. Here, the control gtg_{t} is not adapted, estimating it requires appealing to the fundamental L2L^{2}-isometry of the Skorokhod integral.

Proposition 3.8.

[35, Proposition 1.3.1] Let g𝕎1,2(ΩT;L2([0,T];m))g\in\mathbb{W}^{1,2}(\Omega_{T};L^{2}([0,T];{\mathbb{R}^{m}})). Then the following identity holds:

𝔼(0Tgt,δWtm)2\displaystyle\mathbb{E}\left(\int_{0}^{T}\langle g_{{}_{t}},\delta W_{t}\rangle_{{\mathbb{R}^{m}}}\right)^{2} =𝔼0Tgsm2𝑑s+𝔼0T0Ttrm(𝒟sgt𝒟tgs)𝑑s𝑑t\displaystyle=\mathbb{E}\int_{0}^{T}\|g_{{}_{s}}\|_{{\mathbb{R}^{m}}}^{2}\,ds+\mathbb{E}\int_{0}^{T}\int_{0}^{T}\operatorname{tr}_{{}_{\mathbb{R}^{m}}}(\mathcal{D}_{s}g_{{}_{t}}\mathcal{D}_{t}g_{{}_{s}})\,ds\,dt
𝔼0Tgsm2𝑑s+𝔼0T0T𝒟tgsmm2𝑑s𝑑t\displaystyle\leq\mathbb{E}\int_{0}^{T}\|g_{{}_{s}}\|_{{\mathbb{R}^{m}}}^{2}\,ds+\mathbb{E}\int_{0}^{T}\int_{0}^{T}\|\mathcal{D}_{t}g_{{}_{s}}\|_{{\mathbb{R}^{m}}\to{\mathbb{R}^{m}}}^{2}\,ds\,dt
=g𝕎1,2(ΩT;L2([0,T];m))2,\displaystyle=\|g\|_{\mathbb{W}^{1,2}(\Omega_{T};L^{2}([0,T];{\mathbb{R}^{m}}))}^{2},

where 𝒟s\mathcal{D}_{s} is the Malliavin derivative of the noise at time ss as in (3.7).

Proof of Proposition 3.7.

By Proposition 3.8, we immediately obtain

𝔼|τ0T0gs𝑑Ws|2𝔼gtL2([τ0,T0];m)2+𝔼τ0T0𝒟sgtL2([τ0,T0];m)𝑑s.\mathbb{E}\left|\int_{\tau_{0}}^{T_{0}}g_{{}_{s}}dW_{s}\right|^{2}\leq\mathbb{E}\|g_{{}_{t}}\|_{L^{2}([\tau_{0},T_{0}];{\mathbb{R}^{m}})}^{2}+\mathbb{E}\int_{\tau_{0}}^{T_{0}}\|\mathcal{D}_{s}g_{{}_{t}}\|_{L^{2}([\tau_{0},T_{0}];{\mathbb{R}^{m}})}\,ds.

Firstly, we note that

gtL2([τ0,T0])\displaystyle\|g_{{}_{t}}\|_{L^{2}([\tau_{0},T_{0}])} =gtlL2([τ0,T0])=(StlQ^l)(𝒩T0l)1ST0lΠl(J0,T0𝔥)L2([τ0,T0])\displaystyle=\|g_{{}_{t}}^{l}\|_{L^{2}([\tau_{0},T_{0}])}=\|(S_{t}^{l}\hat{Q}_{l})^{\top}(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}S_{{}_{T_{0}}}^{l}\Pi_{l}(J_{0,T_{0}}\mathfrak{h})\|_{L^{2}([\tau_{0},T_{0}])}
(StlQ^l)l×TML2([τ0,T0])|𝒩T0l|1ST0ll×TMl×TMΠl(J0,T0𝔥)l×TM.\displaystyle\leq\|(S_{t}^{l}\hat{Q}_{l})^{\top}\|_{\mathcal{H}_{l}\times TM\to L^{2}([\tau_{0},T_{0}])}\,\,|\mathcal{N}_{{}_{T_{0}}}^{l}|^{-1}\,\,\|S_{{}_{T_{0}}}^{l}\|_{\mathcal{H}_{l}\times TM\to\mathcal{H}_{l}\times TM}\,\,\|\Pi_{l}(J_{0,T_{0}}\mathfrak{h})\|_{\mathcal{H}_{l}\times TM}.

Note also that by the product rule, we have

𝒟sgtL2([τ0,T0])\displaystyle\|\mathcal{D}_{s}g_{{}_{t}}\|_{L^{2}([\tau_{0},T_{0}])}
=𝒟s((StlQ^l)(𝒩T0l)1ST0lΠl(J0,T0𝔥))L2([τ0,T0])\displaystyle=\|\mathcal{D}_{s}\left((S_{t}^{l}\hat{Q}_{l})^{\top}(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}S_{{}_{T_{0}}}^{l}\Pi_{l}(J_{0,T_{0}}\mathfrak{h})\right)\|_{L^{2}([\tau_{0},T_{0}])}
𝒟s((StlQ^l))l×TML2([τ0,T0])|𝒩T0l|1ST0ll×TMl×TMΠl(J0,T0𝔥)l×TM\displaystyle\leq\|\mathcal{D}_{s}\big((S_{t}^{l}\hat{Q}_{l})^{\top}\big)\|_{\mathcal{H}_{l}\times TM\to L^{2}([\tau_{0},T_{0}])}\,\,|\mathcal{N}_{{}_{T_{0}}}^{l}|^{-1}\,\,\|S_{{}_{T_{0}}}^{l}\|_{\mathcal{H}_{l}\times TM\to\mathcal{H}_{l}\times TM}\,\,\|\Pi_{l}(J_{0,T_{0}}\mathfrak{h})\|_{\mathcal{H}_{l}\times TM}
+(StlQ^l)l×TML2([τ0,T0])𝒟s(𝒩T0l)1L2([τ0,T0])l×TMST0ll×TMl×TMΠl(J0,T0𝔥)l×TM\displaystyle\quad+\|(S_{t}^{l}\hat{Q}_{l})^{\top}\big\|_{\mathcal{H}_{l}\times TM\to L^{2}([\tau_{0},T_{0}])}\,\,\|\mathcal{D}_{s}(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}\|_{L^{2}([\tau_{0},T_{0}])\to\mathcal{H}_{l}\times TM}\,\,\|S_{{}_{T_{0}}}^{l}\|_{\mathcal{H}_{l}\times TM\to\mathcal{H}_{l}\times TM}\,\,\|\Pi_{l}(J_{0,T_{0}}\mathfrak{h})\|_{\mathcal{H}_{l}\times TM}
+(StlQ^l)l×TML2([τ0,T0])|𝒩T0l|1𝒟sST0ll×TMl×TMΠl(J0,T0𝔥)l×TM\displaystyle\quad+\|(S_{t}^{l}\hat{Q}_{l})^{\top}\big\|_{\mathcal{H}_{l}\times TM\to L^{2}([\tau_{0},T_{0}])}\,\,|\mathcal{N}_{{}_{T_{0}}}^{l}|^{-1}\,\,\|\mathcal{D}_{s}S_{{}_{T_{0}}}^{l}\|_{\mathcal{H}_{l}\times TM\to\mathcal{H}_{l}\times TM}\,\,\|\Pi_{l}(J_{0,T_{0}}\mathfrak{h})\|_{\mathcal{H}_{l}\times TM}
+(StlQ^l)l×TML2([τ0,T0])|𝒩T0l|1ST0ll×TMl×TM𝒟s(Πl(J0,T0𝔥))l×TM\displaystyle\quad+\|(S_{t}^{l}\hat{Q}_{l})^{\top}\big\|_{\mathcal{H}_{l}\times TM\to L^{2}([\tau_{0},T_{0}])}\,\,|\mathcal{N}_{{}_{T_{0}}}^{l}|^{-1}\,\,\|S_{{}_{T_{0}}}^{l}\|_{\mathcal{H}_{l}\times TM\to\mathcal{H}_{l}\times TM}\,\,\|\mathcal{D}_{s}\big(\Pi_{l}(J_{0,T_{0}}\mathfrak{h})\big)\|_{\mathcal{H}_{l}\times TM}

Note also that the following holds:

𝒟s(𝒩T0l)1f=(𝒩T0l)1(𝒟s𝒩T0lf)(𝒩T0l)1,𝒟sST0lf=ST0l(𝒟sRT0lf)ST0l.\displaystyle\mathcal{D}_{s}(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}f=-(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}(\mathcal{D}_{s}\mathcal{N}_{{}_{T_{0}}}^{l}f)(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1},\qquad\mathcal{D}_{s}S_{{}_{T_{0}}}^{l}f=-S_{{}_{T_{0}}}^{l}(\mathcal{D}_{s}R_{{}_{T_{0}}}^{l}f)S_{{}_{T_{0}}}^{l}. (3.19)

Thus Proposition 3.7 follows from the Proposition 3.6, (3.19), (A.7), (A.9) together with the following moment bounds, where C(N,ν,𝐩0,η)>0C(N_{*},\nu,\mathbf{p}_{0},\eta)>0 is locally bounded in 𝐩0\mathbf{p}_{0},

𝔼supτ0r<tT0(𝒟sRr,tl𝔥lL2([τ0,T0])l×T𝐩M)\displaystyle\mathbb{E}\sup_{\tau_{0}\leq r<t\leq T_{0}}\left(\|\mathcal{D}_{s}R_{r,t}^{l}\mathfrak{h}^{l}\|_{L^{2}([\tau_{0},T_{0}])\to\mathcal{H}_{l}\times T_{\mathbf{p}}M}\right) CT~0eηV(𝐮0)𝔥ll,\displaystyle\leq C\,\widetilde{T}_{0}\,e^{\eta V(\mathbf{u}_{0})}\|\mathfrak{h}^{l}\|_{l}, (3.20)
𝔼supτ0r<tT0𝒟sJr,t𝔥L2([τ0,T0])×T𝐩M\displaystyle\mathbb{E}\sup_{\tau_{0}\leq r<t\leq T_{0}}\|\mathcal{D}_{s}J_{r,t}\mathfrak{h}\|_{L^{2}([\tau_{0},T_{0}])\to\mathcal{H}\times T_{\mathbf{p}}M} CT~01/2eηV(𝐮0)𝔥.\displaystyle\leq C\,\widetilde{T}_{0}^{1/2}\,e^{\eta V(\mathbf{u}_{0})}\|\mathfrak{h}\|. (3.21)

It follows from (3.13) that Rr,tl𝔥lR_{r,t}^{l}\mathfrak{h}^{l} satisfies

tRr,tl𝔥=A^lRr,tl𝔥+DlF^l(ut,𝐩t)Rr,tl𝔥,Rr,rl𝔥=𝔥.\displaystyle\partial_{t}R_{r,t}^{l}\mathfrak{h}=-\hat{A}_{l}R_{r,t}^{l}\mathfrak{h}+D_{l}\hat{F}_{l}(u_{t},\mathbf{p}_{t})R_{r,t}^{l}\mathfrak{h},\quad R_{r,r}^{l}\mathfrak{h}=\mathfrak{h}.

Differentiating the above equation in the Wiener path for fixed 𝔥\mathfrak{h} and applying the chain rule gives:

t(𝒟s(Rr,tl𝔥))\displaystyle\partial_{t}(\mathcal{D}_{s}(R_{r,t}^{l}\mathfrak{h})) =(A^l+DlF^l(ut,𝐩t))𝒟s(Rr,tl𝔥)+D¯l2F^l(𝒟s(ut,𝐩t),Rr,tl𝔥)\displaystyle=(-\hat{A}_{l}+D_{l}\hat{F}_{l}(u_{t},\mathbf{p}_{t}))\mathcal{D}_{s}(R_{r,t}^{l}\mathfrak{h})+\overline{D}^{2}_{l}\hat{F}_{l}(\mathcal{D}_{s}(u_{t},\mathbf{p}_{t}),R_{r,t}^{l}\mathfrak{h})
=(A^l+DlF^l(ut,𝐩t))𝒟s(Rr,tl𝔥)+D¯2F^l(Js,tQ^,Rr,tl𝔥),\displaystyle=(-\hat{A}_{l}+D_{l}\hat{F}_{l}(u_{t},\mathbf{p}_{t}))\mathcal{D}_{s}(R_{r,t}^{l}\mathfrak{h})+\overline{D}^{2}\hat{F}_{l}(J_{s,t}\hat{Q},R_{r,t}^{l}\mathfrak{h}),

where D¯2F^\overline{D}^{2}\hat{F} denotes the full second variation of FF extended to the linear space l×2d\mathcal{H}_{l}\times\mathbb{R}^{2d}, and the last inequality employs the observation (3.9). Then, by the variation of constants formula, we can write it in the following integral form:

𝒟s(Rr,tl𝔥)f=rtRτ,tl(D¯2F^l(Js,τQ^f,Rr,τl𝔥))𝑑τ.\displaystyle\mathcal{D}_{s}(R_{r,t}^{l}\mathfrak{h})f=\int_{r}^{t}R_{\tau,t}^{l}\left(\overline{D}^{2}\hat{F}_{l}(J_{s,\tau}\hat{Q}f,R_{r,\tau}^{l}\mathfrak{h})\right)\,d\tau.

Analogously, we obtain

𝒟s(Jr,t𝔥)f=rtJτ,t(D¯2F^(Jr,τQ^f,Jr,τ𝔥))𝑑τ.\displaystyle\mathcal{D}_{s}(J_{r,t}\mathfrak{h})f=\int_{r}^{t}J_{\tau,t}\left(\overline{D}^{2}\hat{F}(J_{r,\tau}\hat{Q}f,J_{r,\tau}\mathfrak{h})\right)\,d\tau.

To prove (3.20)–(3.21), we only need to show

𝔼supτ0tT0D¯2F^(ut,𝐩t)(l×2d)(l×2d)C(N,ν)eηV(u0).\mathbb{E}\sup_{\tau_{0}\leq t\leq T_{0}}\|\overline{D}^{2}\hat{F}(u_{t},\mathbf{p}_{{}_{t}})\|_{(\mathcal{H}_{l}\times\mathbb{R}^{2d})\bigotimes(\mathcal{H}_{l}\times\mathbb{R}^{2d})}\leq C(N_{*},\nu)e^{\eta V(u_{0})}.

Fix (u,x,v)(u,x,v), with 𝔥=(𝔥u,𝔥x,𝔥v)\mathfrak{h}=(\mathfrak{h}^{u},\mathfrak{h}^{x},\mathfrak{h}^{v}) and 𝔨=(𝔨u,𝔨x,𝔨v)\mathfrak{k}=(\mathfrak{k}^{u},\mathfrak{k}^{x},\mathfrak{k}^{v}) being tangent vectors, if we denote F^u=B(u,u)\hat{F}^{u}=B(u,u), it follows that

D¯2F^u(u)[𝔥,𝔨]lB(𝔥u,𝔨u)l+B(𝔨u,𝔥u)lC(N)𝔥ul𝔨ul,\displaystyle\|\overline{D}^{2}\hat{F}^{u}(u)[\mathfrak{h},\mathfrak{k}]\|_{\mathcal{H}_{l}}\leq\|B(\mathfrak{h}^{u},\mathfrak{k}^{u})\|_{\mathcal{H}_{l}}+\|B(\mathfrak{k}^{u},\mathfrak{h}^{u})\|_{\mathcal{H}_{l}}\leq C(N_{*})\|\mathfrak{h}^{u}\|_{\mathcal{H}_{l}}\|\mathfrak{k}^{u}\|_{\mathcal{H}_{l}},

then

D¯2F^u(u)2(l,l)C(N).\displaystyle\|\overline{D}^{2}\hat{F}^{u}(u)\|_{\mathcal{L}^{2}(\mathcal{H}_{l},\mathcal{H}_{l})}\leq C(N_{*}).

Set F^x=u(x)\hat{F}^{x}=u(x), computation yields

D¯2F^x(u,x)[𝔥,𝔨]=D¯𝔥u(x)𝔨x+D𝔨u(x)𝔥x+D¯2u(x)[𝔥x,𝔨x].\overline{D}^{2}\hat{F}^{x}(u,x)[\mathfrak{h},\mathfrak{k}]=\overline{D}\mathfrak{h}^{u}(x)\mathfrak{k}^{x}+D\mathfrak{k}^{u}(x)\mathfrak{h}^{x}+\overline{D}^{2}u(x)[\mathfrak{h}^{x},\mathfrak{k}^{x}].

Since the space is finite-dimensional, combined with (A.2) we obtain

𝔼supτ0tT0D¯2F^x(ut,xt)2(l×d,l×d)C(N,ν)eηV(u0).\displaystyle\mathbb{E}\sup_{\tau_{0}\leq t\leq T_{0}}\|\overline{D}^{2}\hat{F}^{x}(u_{t},x_{t})\|_{\mathcal{L}^{2}(\mathcal{H}_{l}\times\mathbb{R}^{d},\mathcal{H}_{l}\times\mathbb{R}^{d})}\leq C(N_{*},\nu)e^{\eta V(u_{0})}.

Similarly, setting F^v=ΠvDu(x)v\hat{F}^{v}=\Pi_{v}Du(x)v, we have

𝔼supτ0tT0D¯2F^v(ut,xt,vt)2(l×2d,l×2d)C(N,ν)eηV(u0).\displaystyle\mathbb{E}\sup_{\tau_{0}\leq t\leq T_{0}}\|\overline{D}^{2}\hat{F}^{v}(u_{t},x_{t},v_{t})\|_{\mathcal{L}^{2}(\mathcal{H}_{l}\times\mathbb{R}^{2d},\mathcal{H}_{l}\times\mathbb{R}^{2d})}\leq C(N_{*},\nu)e^{\eta V(u_{0})}.

This completes the proof. ∎

3.4 Error representation and estimates

Due to the coupling between high and low frequencies present in both the nonlinear terms and the vector fields on the manifold, the high and low-frequency components of the Malliavin derivative are themselves typically coupled. To address this, we introduce two auxiliary processes, ξt\xi_{t} and ζt\zeta_{t}, intended to precisely correspond to the high and low-frequency parts of the Malliavin derivative, respectively. First, based on the evolution equation for the Malliavin derivative 𝒟g(ut,𝐩t)\mathcal{D}^{g}(u_{t},\mathbf{p}_{{}_{t}}), we have

t(𝒟g(ut,𝐩t))=A^𝒟g(ut,𝐩t)+DF^(ut,𝐩t)𝒟g(ut,𝐩t)+Q^gt,𝒟gt(u0,𝐩0)=0.\displaystyle\partial_{t}(\mathcal{D}^{g}(u_{t},\mathbf{p}_{{}_{t}}))=-\hat{A}\mathcal{D}^{g}(u_{t},\mathbf{p}_{{}_{t}})+D\hat{F}(u_{t},\mathbf{p}_{{}_{t}})\mathcal{D}^{g}(u_{t},\mathbf{p}_{{}_{t}})+\hat{Q}g_{{}_{t}},\,\,\,\,\,\,\,\,\,\mathcal{D}^{g_{{}_{t}}}(u_{0},\mathbf{p}_{{}_{0}})=0. (3.22)

Our goal is

𝒟g(ut,𝐩t)(=(𝒟g(utl,𝐩t),𝒟g(uth)))=(ζt,ξt).\displaystyle\mathcal{D}^{g}(u_{t},\mathbf{p}_{{}_{t}})\left(=\left(\mathcal{D}^{g}(u^{l}_{t},\mathbf{p}_{{}_{t}}),\,\mathcal{D}^{g}(u_{t}^{h})\right)\right)=(\zeta_{t},\,\xi_{t}). (3.23)

For each t[τ0,T0]t\in[\tau_{0},T_{0}], (ζt,ξt)(\zeta_{t},\,\xi_{t}) solve the following system:

{ζ˙t=A^lζt+DlF^l(ut,𝐩t)ζt+DhF^l(ut,𝐩t)ξt+Q^lgtl,ξ˙t=A^hξt+DlF^h(ut,𝐩t)ζt+DhF^h(ut,𝐩t)ξt,\begin{cases}\dot{\zeta}_{t}=-\hat{A}_{l}\zeta_{t}+D_{l}\hat{F}_{l}(u_{t},\mathbf{p}_{{}_{t}})\zeta_{t}+D_{h}\hat{F}_{l}(u_{t},\mathbf{p}_{{}_{t}})\xi_{t}+\hat{Q}_{l}g^{l}_{{}_{t}},\\ \dot{\xi}_{t}=-\hat{A}_{h}\xi_{t}+D_{l}\hat{F}_{h}(u_{t},\mathbf{p}_{{}_{t}})\zeta_{t}+D_{h}\hat{F}_{h}(u_{t},\mathbf{p}_{{}_{t}})\xi_{t},\end{cases} (3.24)

with ζτ0=0\zeta_{\tau_{0}}=0 and ξτ0=0\xi_{\tau_{0}}=0. Note that ζt\zeta_{t} and ξt\xi_{t} are defined in terms of each other, so it is necessary to verify the existence and uniqueness of their solution afterwards. If the above system admits a unique solution ηt=(ζt,ξt)\eta_{t}=(\zeta_{t},\xi_{t}), then (3.23) follows naturally, and one also obtains the expression for the error given in the following lemma.

Lemma 3.9.

Assume that gtg_{t} is defined as above and that there exists a unique solution (ζt,ξt)(\zeta_{t},\xi_{t}) to system (3.24) in the space L2(Ω;C([τ0,T0];×T𝐩M))L^{2}(\Omega;C([\tau_{0},T_{0}];\mathcal{H}\times T_{\mathbf{p}}M)), then the remainder

ρT0=J0,T0𝔥𝒟g(uT0,𝐩T0)\rho_{{}_{T_{0}}}=J_{0,T_{0}}\mathfrak{h}-\mathcal{D}^{g}(u_{{}_{T_{0}}},\mathbf{p}_{{}_{T_{0}}})

satisfies

ρT0l\displaystyle\rho_{{}_{T_{0}}}^{l} =τ0T0Rs,T0lDhF^l(us,𝐩s)ξs𝑑s,\displaystyle=-\int_{\tau_{0}}^{T_{0}}R_{s,T_{0}}^{l}D_{h}\hat{F}_{l}(u_{s},\mathbf{p}_{{}_{s}})\xi_{s}\,ds, (3.25)
ρT0h\displaystyle\rho_{{}_{T_{0}}}^{h} =Rτ0,T0h(Πh(J0,τ0𝔥))+τ0T0Rs,T0hDlF^h(us,𝐩s)Πl(J0,s𝔥)𝑑sτ0T0Rs,T0hDlF^h(us,𝐩s)ζs𝑑s.\displaystyle=R_{\tau_{0},T_{0}}^{h}(\Pi_{h}(J_{0,\tau_{0}}\mathfrak{h}))+\int_{\tau_{0}}^{T_{0}}R_{s,T_{0}}^{h}D_{l}\hat{F}_{h}(u_{s},\mathbf{p}_{{}_{s}})\Pi_{l}(J_{0,s}\mathfrak{h})\,ds-\int_{\tau_{0}}^{T_{0}}R_{s,T_{0}}^{h}D_{l}\hat{F}_{h}(u_{s},\mathbf{p}_{{}_{s}})\zeta_{s}\,ds. (3.26)
Proof.

Using (3.22) and (3.24), we obtain that in fact 𝒟g(utl,𝐩t)=ζt\mathcal{D}^{g}(u^{l}_{t},\mathbf{p}_{{}_{t}})=\zeta_{t} and 𝒟g(uth)=ξt\mathcal{D}^{g}(u^{h}_{t})=\xi_{t} and we obtain the following formulas for the Malliavin derivatives at time T0T_{0}:

{t𝒟g(utl,𝐩t)=A^l𝒟g(utl,𝐩t)+DlF^l(ut,𝐩t)𝒟g(utl,𝐩t)+DhF^l(ut,𝐩t)𝒟g(uth)+Q^lgtl,t𝒟g(uth)=A^h𝒟g(uth)+DlF^h(ut,𝐩t)𝒟g(utl,𝐩t)+DhF^h(ut,𝐩t)𝒟g(uth),\begin{cases}\partial_{t}\mathcal{D}^{g}(u^{l}_{t},\mathbf{p}_{{}_{t}})=-\hat{A}_{l}\mathcal{D}^{g}(u^{l}_{t},\mathbf{p}_{{}_{t}})+D_{l}\hat{F}_{l}(u_{t},\mathbf{p}_{{}_{t}})\mathcal{D}^{g}(u^{l}_{t},\mathbf{p}_{{}_{t}})+D_{h}\hat{F}_{l}(u_{t},\mathbf{p}_{{}_{t}})\mathcal{D}^{g}(u^{h}_{t})+\hat{Q}_{l}g^{l}_{{}_{t}},\\ \partial_{t}\mathcal{D}^{g}(u^{h}_{t})=-\hat{A}_{h}\mathcal{D}^{g}(u^{h}_{t})+D_{l}\hat{F}_{h}(u_{t},\mathbf{p}_{{}_{t}})\mathcal{D}^{g}(u^{l}_{t},\mathbf{p}_{{}_{t}})+D_{h}\hat{F}_{h}(u_{t},\mathbf{p}_{{}_{t}})\mathcal{D}^{g}(u^{h}_{t}),\end{cases}

with 𝒟g(uτ0l,𝐩τ0)=0\mathcal{D}^{g}(u^{l}_{{\tau_{0}}},\mathbf{p}_{{}_{\tau_{0}}})=0 and 𝒟g(uτ0h)=0\mathcal{D}^{g}(u^{h}_{{\tau_{0}}})=0. By the variation of constants formula and the invertibility of 𝒩T0l\mathcal{N}_{{}_{T_{0}}}^{l}, we obtain

𝒟g(uT0l,𝐩T0)\displaystyle\mathcal{D}^{g}(u^{l}_{T_{0}},\mathbf{p}_{{}_{T_{0}}}) =Πl(J0,T0𝔥)+τ0T0Rs,T0lDhF^l(us,𝐩s)ξs𝑑s,\displaystyle=\Pi_{l}({J_{0,T_{0}}\mathfrak{h}})+\int_{\tau_{0}}^{T_{0}}R_{s,T_{0}}^{l}D_{h}\hat{F}_{l}(u_{s},\mathbf{p}_{{}_{s}})\xi_{s}\,ds,
𝒟g(uT0h)\displaystyle\mathcal{D}^{g}(u^{h}_{T_{0}}) =τ0T0Rs,T0hDlF^h(us,𝐩s)ζs𝑑s.\displaystyle=\int_{\tau_{0}}^{T_{0}}R_{s,T_{0}}^{h}D_{l}\hat{F}_{h}(u_{s},\mathbf{p}_{{}_{s}})\zeta_{s}\,ds.

Using this relation, we now write

{Πl(J0,T0𝔥)=𝒟g(uT0l,𝐩T0)+ρT0l,Πh(J0,T0𝔥)=𝒟g(uT0h)+ρT0h,\begin{cases}\Pi_{l}({J_{0,T_{0}}\mathfrak{h}})=\mathcal{D}^{g}(u^{l}_{T_{0}},\mathbf{p}_{{}_{T_{0}}})+\rho_{{}_{T_{0}}}^{l},\\ \Pi_{h}({J_{0,T_{0}}\mathfrak{h}})=\mathcal{D}^{g}(u^{h}_{T_{0}})+\rho_{{}_{T_{0}}}^{h},\end{cases}

where ρT0l\rho_{{}_{T_{0}}}^{l} and ρT0h\rho_{{}_{T_{0}}}^{h} are given by (3.25) and (3.26). Since Js,tJ_{s,t} satisfies

tJs,t=A^Js,t+DF^(ut,𝐩t)Js,t,Js,s=Id,\partial_{t}J_{s,t}=-\hat{A}J_{s,t}+D\hat{F}(u_{t},\mathbf{p}_{t})J_{s,t},\quad J_{s,s}=\text{Id},

we can use the variation of constants formula to write its high-frequency projection as

Rs,th(Πh(J0,s𝔥))+stRr,thDlF^h(ur,𝐩r)Πl(J0,r𝔥)𝑑s.R_{s,t}^{h}(\Pi_{h}(J_{0,s}\mathfrak{h}))+\int_{s}^{t}R_{r,t}^{h}D_{l}\hat{F}_{h}(u_{r},\mathbf{p}_{{}_{r}})\Pi_{l}(J_{0,r}\mathfrak{h})\,ds.

This completes the proof of the lemma. ∎

We will prove the existence and uniqueness of solutions to system (3.24) and establish quantitative estimates for them. This result, on the one hand, ensures the validity of the representation for the Malliavin derivative and the error, and on the other hand, provides a basis for the subsequent error estimation.

Lemma 3.10.

Fix τ0>0\tau_{0}>0, for all T~0>0\widetilde{T}_{0}>0 sufficiently small, and all p2,η(0,1)p\geq 2,\eta\in(0,1), there exist a unique solution ηt=(ζt,ξt)×T𝐩T0M\eta_{t}=(\zeta_{t},\xi_{t})\in\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M on [τ0,T0][\tau_{0},T_{0}] to system (3.24) in Lp(Ω;C([τ0,T0];×T𝐩T0M))L^{p}(\Omega;C([\tau_{0},T_{0}];\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M)) and a constant C:=C(N,ν,η,𝐩0)>0C:=C(N_{*},\nu,\eta,\mathbf{p}_{0})>0, locally bounded in 𝐩0\mathbf{p}_{0} satisfying

(𝔼supt[τ0,T0]ηt×T𝐩T0Mp)1pCT~0exp(ηV(u0)).\left(\mathbb{E}\sup_{t\in[\tau_{0},T_{0}]}\|\eta_{t}\|_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{p}\right)^{\frac{1}{p}}\leq C\,\widetilde{T}_{0}\exp{(\eta V(u_{0}))}.

Note that ηt\eta_{t} is not adapted to the filtration (t)(\mathcal{F}_{t}).

Proof.

Define the space 𝐗T0:=Lp(Ω;C([τ0,T0];×T𝐩T0M))\mathbf{X}_{{}_{T_{0}}}:=L^{p}(\Omega;C([\tau_{0},T_{0}];\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M)) and endow it with the norm

η𝐗T0:=(𝔼supt[τ0,T0]ηt×T𝐩T0Mp)1p,\|\eta\|_{\mathbf{X}_{{}_{T_{0}}}}:=\left(\mathbb{E}\sup_{t\in[\tau_{0},T_{0}]}\|\eta_{t}\|^{p}_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}\right)^{\frac{1}{p}},

which makes it a Banach space. We prove the lemma by finding a fixed point of the following system in the aforementioned Banach space.

ζt\displaystyle\zeta_{t} =τ0T0Rs,tlDhF^l(𝐮s,𝐩s)ξs𝑑s+τ0T0Rs,tlQ^lgsl𝑑s,\displaystyle=\int_{\tau_{0}}^{T_{0}}R_{s,t}^{l}D_{h}\hat{F}_{l}(\mathbf{u}_{s},\mathbf{p}_{{}_{s}})\xi_{s}\,ds+\int_{\tau_{0}}^{T_{0}}R_{s,t}^{l}\hat{Q}^{l}g_{{}_{s}}^{l}\,ds, (3.27)
ξt\displaystyle\xi_{t} =τ0T0Rs,thDlF^h(𝐮s,𝐩s)ζs𝑑s.\displaystyle=\int_{\tau_{0}}^{T_{0}}R_{s,t}^{h}D_{l}\hat{F}_{h}(\mathbf{u}_{s},\mathbf{p}_{{}_{s}})\zeta_{s}\,ds. (3.28)

We proceed to estimate each term separately. First, by Jensen’s inequality we have

τ0T0Rs,tlQ^lgsl𝑑sl\displaystyle\left\|\int_{\tau_{0}}^{T_{0}}R_{s,t}^{l}\hat{Q}^{l}g_{{}_{s}}^{l}\,ds\right\|_{l} τ0T0Rs,tlQ^lgsll𝑑s\displaystyle\leq\int_{\tau_{0}}^{T_{0}}\|R_{s,t}^{l}\hat{Q}^{l}g_{{}_{s}}^{l}\|_{l}\,ds
C(τ0T0Rs,tll×T𝐩Ml×T𝐩M)1/2(τ0T0gslL2)1/2.\displaystyle\leq C\left(\int_{\tau_{0}}^{T_{0}}\|R_{s,t}^{l}\|_{\mathcal{H}_{l}\times T_{\mathbf{p}}M\to\mathcal{H}_{l}\times T_{\mathbf{p}}M}\right)^{1/2}\left(\int_{\tau_{0}}^{T_{0}}\|g_{{}_{s}}^{l}\|_{L^{2}}\right)^{1/2}.

Then by (A.6) and the Proposition 3.7, we obtain that

(𝔼supt[τ0,T0]τ0T0Rs,tlQ^lgsl𝑑sl×T𝐩T0Mp)1pC(N,ν,η,𝐩0)T~0exp(ηV(u0))𝔥.\displaystyle\left(\mathbb{E}\sup_{t\in[\tau_{0},T_{0}]}\|\int_{\tau_{0}}^{T_{0}}R_{s,t}^{l}\hat{Q}^{l}g_{{}_{s}}^{l}\,ds\|^{p}_{\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}\right)^{\frac{1}{p}}\leq C(N_{*},\nu,\eta,\mathbf{p}_{0})\,\widetilde{T}_{0}\,\exp{(\eta V(u_{0}))}\|\mathfrak{h}\|. (3.29)

Similarly, using (A.5)-(A.6) and (A.11)-(A.12) we have

(𝔼supt[τ0,T0]τ0T0Rs,tlDhF^l(𝐮s,𝐩s)ξs𝑑sl×T𝐩T0Mp)1pC(N,ν,η,𝐩0)T~0exp(ηV(u0))ηt𝐗T0,\displaystyle\left(\mathbb{E}\sup_{t\in[\tau_{0},T_{0}]}\|\int_{\tau_{0}}^{T_{0}}R_{s,t}^{l}D_{h}\hat{F}_{l}(\mathbf{u}_{s},\mathbf{p}_{{}_{s}})\xi_{s}\,ds\|^{p}_{\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}\right)^{\frac{1}{p}}\leq C(N_{*},\nu,\eta,\mathbf{p}_{0})\,\widetilde{T}_{0}\,\exp{(\eta V(u_{0}))}\|\eta_{t}\|_{\mathbf{X}_{{}_{T_{0}}}}, (3.30)

and

(𝔼supt[τ0,T0]τ0T0Rs,thDlF^h(𝐮s,𝐩s)ζs𝑑shp)1p\displaystyle\left(\mathbb{E}\sup_{t\in[\tau_{0},T_{0}]}\|\int_{\tau_{0}}^{T_{0}}R_{s,t}^{h}D_{l}\hat{F}_{h}(\mathbf{u}_{s},\mathbf{p}_{{}_{s}})\zeta_{s}\,ds\|^{p}_{\mathcal{H}_{h}}\right)^{\frac{1}{p}}
C(N,ν,η,𝐩0)T~03/41νN2(eνN2τ0eνN2T0)exp(ηV(u0))ηt𝐗T0.\displaystyle\leq C(N_{*},\nu,\eta,\mathbf{p}_{0})\,\widetilde{T}_{0}^{3/4}\,\frac{1}{\nu N_{*}^{2}}(e^{-\nu N_{*}^{2}\tau_{0}}-e^{-\nu N_{*}^{2}T_{0}})\exp{(\eta V(u_{0}))}\|\eta_{t}\|_{\mathbf{X}_{{}_{T_{0}}}}. (3.31)

Combining (3.29) -(3.4), we have

(ζt,ξt)𝐗T0C(N,ν,η,𝐩0)T~0exp(ηV(u0))𝔥+δηt𝐗T0,\displaystyle\|(\zeta_{t},\xi_{t})\|_{\mathbf{X}_{{}_{T_{0}}}}\leq C(N_{*},\nu,\eta,\mathbf{p}_{0})\,\widetilde{T}_{0}\,\exp{(\eta V(u_{0}))}\|\mathfrak{h}\|+\delta\|\eta_{t}\|_{\mathbf{X}_{{}_{T_{0}}}},

where δ:=max{T~0,T~03/41νN2(eνN2τ0eνN2T0)}C(N,ν,η,𝐩0)exp(ηV(u0))\delta:=\max\left\{\widetilde{T}_{0},\widetilde{T}_{0}^{3/4}\,\frac{1}{\nu N_{*}^{2}}(e^{-\nu N_{*}^{2}\tau_{0}}-e^{-\nu N_{*}^{2}T_{0}})\right\}C(N_{*},\nu,\eta,\mathbf{p}_{0})\exp{(\eta V(u_{0}))}. By choosing T~0\widetilde{T}_{0} sufficiently small and NN_{*} appropriately, we can ensure that δ<1\delta<1. Then, by taking RR sufficiently large, the mapping Φt:(ζ,ξ)(ζt,ξt)\Phi_{t}:(\zeta,\xi)\mapsto(\zeta_{t},\xi_{t}) becomes a contraction on the ball {(ζt,ξt)𝐗T0R}\{\|(\zeta_{t},\xi_{t})\|_{\mathbf{X}_{{}_{T_{0}}}}\leq R\}. Consequently, by the Contraction Mapping Principle, there exists a unique fixed point (ζt,ξt)(\zeta_{t},\xi_{t}) on [τ0,T0][\tau_{0},T_{0}] which satisfies

(𝔼supt[τ0,T0]ηt×T𝐩T0Mp)1pC(N,ν,η,𝐩0)T~0exp(ηV(u0)).\left(\mathbb{E}\sup_{t\in[\tau_{0},T_{0}]}\|\eta_{t}\|_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{p}\right)^{\frac{1}{p}}\leq C(N_{*},\nu,\eta,\mathbf{p}_{0})\,\widetilde{T}_{0}\exp{(\eta V(u_{0}))}.

We now present the bounds satisfied by the error. Note that, owing to the invertibility of 𝒩T0l\mathcal{N}_{{}_{T_{0}}}^{l}, the least-squares error arising from the construction of the control vanishes. Consequently, it suffices to consider the influence of the coupling between high and low frequencies as well as the effect of the high-frequency system itself.

Proposition 3.11.

For every η,γ(0,1)\eta,\gamma\in(0,1), there exist constants T~0\widetilde{T}_{0} and τ0\tau_{0}, with 0<T~0,τ0<10<\widetilde{T}_{0},\tau_{0}<1 such that

𝔼ρT0×T𝐩T0M2γexp(ηV(u0))𝔥×T𝐩T0M2.\displaystyle\mathbb{E}\|\rho_{{}_{T_{0}}}\|_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{2}\leq\,\gamma\,\exp{(\eta V(u_{0}))}\,\|\mathfrak{h}\|_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{2}. (3.32)
Proof.

Using Lemma 3.9, we have that

𝔼ρT0×T𝐩T0M2𝔼ρT0ll×T𝐩T0M2+𝔼ρT0hh2.\mathbb{E}\|\rho_{{}_{T_{0}}}\|_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{2}\leq\mathbb{E}\|\rho_{{}_{T_{0}}}^{l}\|_{\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{2}+\mathbb{E}\|\rho_{{}_{T_{0}}}^{h}\|_{\mathcal{H}_{h}}^{2}.

Next, we proceed to estimate term by term. Regarding 𝔼ρT0l×T𝐩T0M2\mathbb{E}\|\rho_{{}_{T_{0}}}^{l}\|_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{2}, an application of (A.6), (A.11) and Lemma 3.10 yields

𝔼ρT0l×T𝐩T0M2T~02eCT~0exp(ηV(u0))𝔼ξt𝐗T0C(N,ν,𝐩0,T0)T~03exp(ηV(u0)).\mathbb{E}\|\rho_{{}_{T_{0}}}^{l}\|_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{2}\leq\widetilde{T}_{0}^{2}e^{C\widetilde{T}_{0}}\exp{(\eta V(u_{0}))}\mathbb{E}\|\xi_{t}\|_{\mathbf{X}_{{}_{T_{0}}}}\leq C(N_{*},\nu,\mathbf{p}_{0},T_{0})\,\widetilde{T}_{0}^{3}\,\exp{(\eta V(u_{0}))}.

Set

ρT0h,cpl:=τ0T0Rs,T0hDlF^h(us,𝐩s)ζs𝑑s,\rho_{{}_{T_{0}}}^{h,cpl}:=\int_{\tau_{0}}^{T_{0}}R_{s,T_{0}}^{h}D_{l}\hat{F}_{h}(u_{s},\mathbf{p}_{{}_{s}})\zeta_{s}\,ds,

then we have

ρT0h,cplh(τ0T0Rs,T0hhh4𝑑s)1/4(τ0T0DlF^h(us,𝐩s)lh4𝑑s)1/4(τ0T0ζsl2𝑑s)1/2.\displaystyle\left\|\rho_{{}_{T_{0}}}^{h,cpl}\right\|_{\mathcal{H}_{h}}\leq\left(\int_{\tau_{0}}^{T_{0}}\|R_{s,T_{0}}^{h}\|_{\mathcal{H}_{h}\to\mathcal{H}_{h}}^{4}\,ds\right)^{1/4}\left(\int_{\tau_{0}}^{T_{0}}\|D_{l}\hat{F}_{h}(u_{s},\mathbf{p}_{{}_{s}})\|_{\mathcal{H}_{l}\to\mathcal{H}_{h}}^{4}\,ds\right)^{1/4}\left(\int_{\tau_{0}}^{T_{0}}\|\zeta_{s}\|_{\mathcal{H}_{l}}^{2}\,ds\right)^{1/2}.

Furthermore, employing Lemma 3.10 and equations (A.5), (A.12), we can obtain that

𝔼ρT0h,cplh2\displaystyle\mathbb{E}\left\|\rho_{{}_{T_{0}}}^{h,cpl}\right\|_{\mathcal{H}_{h}}^{2} C(N,ν)eCT~0(14νN2(e4νN2τ0e4νN2T0))T~05/2exp(ηV(u0))\displaystyle\leq C(N_{*},\nu)\,e^{C\widetilde{T}_{0}}\left(\frac{1}{4\nu N_{*}^{2}}(e^{-4\nu N_{*}^{2}\tau_{0}}-e^{-4\nu N_{*}^{2}T_{0}})\right)\,\widetilde{T}_{0}^{5/2}\,\exp{(\eta V(u_{0}))}
:=q1(T~0)exp(ηV(u0)),\displaystyle:=q_{1}(\widetilde{T}_{0})\,\exp{(\eta V(u_{0}))},

where q1(T~0)q_{1}(\widetilde{T}_{0}) vanishes as T~0\widetilde{T}_{0} goes to zero. Set

ρT0h,hf:=Rτ0,T0h(Πh(J0,τ0𝔥))+τ0T0Rs,T0hDlF^h(us,𝐩s)Πl(J0,s𝔥)𝑑s.\rho_{{}_{T_{0}}}^{h,hf}:=R_{\tau_{0},T_{0}}^{h}(\Pi_{h}(J_{0,\tau_{0}}\mathfrak{h}))+\int_{\tau_{0}}^{T_{0}}R_{s,T_{0}}^{h}D_{l}\hat{F}_{h}(u_{s},\mathbf{p}_{{}_{s}})\Pi_{l}(J_{0,s}\mathfrak{h})\,ds.

Then

ρT0h,hfh2\displaystyle\left\|\rho_{{}_{T_{0}}}^{h,hf}\right\|_{\mathcal{H}_{h}}^{2} CRτ0,T0hhh2Πh(J0,τ0𝔥)h2\displaystyle\leq C\|R_{\tau_{0},T_{0}}^{h}\|_{\mathcal{H}_{h}\to\mathcal{H}_{h}}^{2}\|\Pi_{h}(J_{0,\tau_{0}}\mathfrak{h})\|_{\mathcal{H}_{h}}^{2}
+C(τ0T0Rs,T0hhh4𝑑s)1/2(τ0T0DlF^hlh4𝑑s)1/2(τ0T0Πl(J0,s𝔥)l2𝑑s).\displaystyle+C\left(\int_{\tau_{0}}^{T_{0}}\|R_{s,T_{0}}^{h}\|_{\mathcal{H}_{h}\to\mathcal{H}_{h}}^{4}\,ds\right)^{1/2}\left(\int_{\tau_{0}}^{T_{0}}\|D_{l}\hat{F}_{h}\|_{\mathcal{H}_{l}\to\mathcal{H}_{h}}^{4}\,ds\right)^{1/2}\left(\int_{\tau_{0}}^{T_{0}}\|\Pi_{l}(J_{0,s}\mathfrak{h})\|_{\mathcal{H}_{l}}^{2}\,ds\right).

Furthermore, using the high-frequency dissipation property (A.10) together with estimates (A.5),(A.12) and (A.9), we have

𝔼ρT0h,hfh2\displaystyle\mathbb{E}\left\|\rho_{{}_{T_{0}}}^{h,hf}\right\|_{\mathcal{H}_{h}}^{2} CeνN2T~0eCτ0N2eηV(u0)𝔥2\displaystyle\leq Ce^{-\nu N_{*}^{2}\widetilde{T}_{0}}e^{C\tau_{0}}N_{*}^{-2}\,e^{\eta V(u_{0})}\|\mathfrak{h}\|^{2}
+CeCT~0(14νN2(e4νN2τ0e4νN2T0))T~03/2eηV(u0)𝔥2\displaystyle\quad+C\,e^{C\widetilde{T}_{0}}\left(\frac{1}{4\nu N_{*}^{2}}(e^{-4\nu N_{*}^{2}\tau_{0}}-e^{-4\nu N_{*}^{2}T_{0}})\right)\,\widetilde{T}_{0}^{3/2}e^{\eta V(u_{0})}\|\mathfrak{h}\|^{2}
C(eCτ0N2+q2(T~0))eηV(u0)𝔥2,\displaystyle\leq C\,(e^{C\tau_{0}}N_{*}^{-2}+q_{2}(\widetilde{T}_{0}))e^{\eta V(u_{0})}\|\mathfrak{h}\|^{2},

where q2(T~0)q_{2}(\widetilde{T}_{0}) vanishes as T~0\widetilde{T}_{0} goes to zero.

Note that once T~0\widetilde{T}_{0} and τ0\tau_{0} are chosen, T0T_{0} becomes fixed. Consequently, the constant can be written as C(N,ν,η,𝐩0)C(N_{*},\nu,\eta,\mathbf{p}_{0}). Combining the estimates from the three parts above, then for any γ(0,1)\gamma\in(0,1), we can select suitable T~0\widetilde{T}_{0} and τ0\tau_{0} such that the following holds:

𝔼ρT0×T𝐩T0M2γexp(ηV(u0))𝔥×T𝐩T0M2.\mathbb{E}\|\rho_{{}_{T_{0}}}\|_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{2}\leq\,\gamma\,\exp{(\eta V(u_{0}))}\,\|\mathfrak{h}\|_{\mathcal{H}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}^{2}.

4 Nondegeneracy of the partial Malliavin matrix

The basic idea of our proof in this section follows the classical framework of [2, 11, 28]. Since we are considering the part Malliavin matrix for a finite-dimensional system, the argument is primarily adapted from [2]. However, through a few technical modifications, we successfully avoid introducing the auxiliary process ztz_{t} and the truncation process wtρ:=(ut,xt,vt,zt)ρw_{t}^{\rho}:=(u_{t},x_{t},v_{t},z_{t})^{\rho}. This not only streamlines the derivation but also makes the overall proof more direct and concise.

Unlike the case of highly degenerate white noise, since StlS_{t}^{l} is adapted, we can use Itô’s formula to write it explicitly. This allows us to relate the time derivative of certain quantities to the corresponding Lie brackets, thereby generating a new vector field on the tangent bundle.

Recall that 𝒵0:={k2, 0<|k|N}\mathcal{Z}_{0}:=\{k\in\mathbb{Z}^{2},\,0<|k|\leq N_{*}\}, and

Q^Wt˙:=k𝒵0qkek(x)γkdWtk.\hat{Q}\dot{W_{t}}:=\sum_{k\in\mathcal{Z}_{0}}q_{k}e_{k}(x)\gamma_{k}\,dW^{k}_{t}.

Then, the operator Q^\hat{Q} on L2L^{2} gives rise to a family of vector fields {Qk}k𝒵0\{Q^{k}\}_{k\in\mathcal{Z}_{0}} on ×M\mathcal{H}\times M defined by

Qk={qkekγkif k𝒵0,0otherwise .Q^{k}=\begin{cases}q_{k}e_{k}\gamma_{k}&\text{if }k\in\mathcal{Z}_{0},\\ 0&\text{otherwise }.\end{cases}

We now present a lemma that is crucial for proving Proposition 3.6, which establishes a probabilistic spectral bound for the partial Malliavin matrix 𝒩T0l\mathcal{N}_{{}_{T_{0}}}^{l}.

Lemma 4.1 (Probabilistic spectral bound).

For all p1,η(0,1),T~0<1,ε>0p\geq 1,\,\eta\in(0,1),\,\widetilde{T}_{0}<1,\,\varepsilon>0 , any 𝐩0M\mathbf{p}_{0}\in M, and (u,𝐩)×M(u,\mathbf{p})\in\mathcal{H}\times M, there exists a constant C(N,η,𝐩0,p)>0C(N_{*},\eta,\mathbf{p}_{0},p)>0, locally uniformly in 𝐩0\mathbf{p}_{0}, such that

sup𝔥ll×T𝐩M,𝔥ll=1(𝒩T0l𝔥l,𝔥ll<ε)C(N,η,𝐩0,p)εpeηV(u0),\displaystyle\sup_{\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M,\,\|\mathfrak{h}^{l}\|_{l}=1}\mathbb{P}\left(\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}^{l}\rangle_{l}<\varepsilon\right)\leq C(N_{*},\eta,\mathbf{p}_{0},p)\varepsilon^{p}e^{\eta V(u_{0})}, (4.1)

where the constant is independent of ε\varepsilon.

Proof of Proposition 3.6.

Since l×T𝐩M\mathcal{H}_{l}\times T_{\mathbf{p}}M is finite-dimensional with d=dim(l×T𝐩M)d=\dim(\mathcal{H}_{l}\times T_{\mathbf{p}}M), consider a δ\delta-net {𝔥1l,,𝔥Nl}\{\mathfrak{h}_{1}^{l},\dots,\mathfrak{h}_{N}^{l}\} (with NCdδdN\leq C_{d}\delta^{-d}) on the unit sphere. Let λmin(𝒩T0l)\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l}) be the smallest eigenvalue of 𝒩T0l\mathcal{N}_{{}_{T_{0}}}^{l}. Fix ωΩ\omega\in\Omega, whenever λmin(𝒩T0l)ε\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l})\leq\varepsilon for some ε>0\varepsilon>0, there exists 𝔥ll×T𝐩M\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M (with 𝔥l=1\|\mathfrak{h}^{l}\|=1) satisfying

𝒩T0l𝔥l,𝔥ll<ε.\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}^{l}\rangle_{l}<\varepsilon.

Select 𝔥il\mathfrak{h}_{i}^{l} such that 𝔥l𝔥il<δ\|\mathfrak{h}^{l}-\mathfrak{h}_{i}^{l}\|<\delta, it follows that

𝒩T0l𝔥il,𝔥ill\displaystyle\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}_{i}^{l},\mathfrak{h}_{i}^{l}\rangle_{l} =𝒩T0l𝔥l,𝔥ll+𝒩T0l(𝔥il𝔥l),𝔥ill+𝒩T0l𝔥l,𝔥il𝔥ll\displaystyle=\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}^{l}\rangle_{l}+\langle\mathcal{N}_{{}_{T_{0}}}^{l}(\mathfrak{h}_{i}^{l}-\mathfrak{h}^{l}),\mathfrak{h}_{i}^{l}\rangle_{l}+\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}_{i}^{l}-\mathfrak{h}^{l}\rangle_{l}
𝒩T0l𝔥l,𝔥ll+𝒩T0l𝔥il𝔥l+𝒩T0l𝔥l𝔥il𝔥l\displaystyle\leq\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}^{l}\rangle_{l}+\|\mathcal{N}_{{}_{T_{0}}}^{l}\|\cdot\|\mathfrak{h}_{i}^{l}-\mathfrak{h}^{l}\|+\|\mathcal{N}_{{}_{T_{0}}}^{l}\|\cdot\|\mathfrak{h}^{l}\|\cdot\|\mathfrak{h}_{i}^{l}-\mathfrak{h}^{l}\|
ε+2MT~0(ω)δ,\displaystyle\leq\varepsilon+2M_{{\widetilde{T}_{0}}}(\omega)\cdot\delta,

where

MT~0(ω):=T~0supτ0sT0Ssll×T𝐩τ0Ml×T𝐩T0M|Q^l|2τ0T0|SslQ^l|2𝑑s𝒩T0l.\displaystyle M_{{\widetilde{T}_{0}}}(\omega):=\widetilde{T}_{0}\cdot\sup_{\tau_{0}\leq s\leq T_{0}}\|S_{s}^{l}\|_{\mathcal{H}_{l}\times T_{\mathbf{p}_{\tau_{0}}}M\to\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{T_{0}}}}M}|\hat{Q}_{l}|^{2}\geq\int_{\tau_{0}}^{T_{0}}|S_{s}^{l}\hat{Q}_{l}|^{2}\,ds\geq\|\mathcal{N}_{{}_{T_{0}}}^{l}\|.

Fix R>0R>0, set AR:={MT~0(ω)R}A_{R}:=\{M_{{\widetilde{T}_{0}}}(\omega)\leq R\}. Then, we have that

(λmin(𝒩T0l)<ε)=({λmin(𝒩T0l)<ε}AR)+({λmin(𝒩T0l)<ε}ARc).\mathbb{P}\left(\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l})<\varepsilon\right)=\mathbb{P}\left(\{\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l})<\varepsilon\}\cap A_{R}\right)+\mathbb{P}\left(\{\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l})<\varepsilon\}\cap A_{R}^{c}\right).

On ARA_{R}, setting δ=ε4R>0\delta=\frac{\varepsilon}{4R}>0 yields

𝒩T0l𝔥il,𝔥illε+2MT~0(ω)δ<2ε.\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}_{i}^{l},\mathfrak{h}_{i}^{l}\rangle_{l}\leq\varepsilon+2M_{{\widetilde{T}_{0}}}(\omega)\cdot\delta<2\varepsilon.

From this we further obtain

({λmin(𝒩T0l)<ε}AR)\displaystyle\mathbb{P}\left(\{\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l})<\varepsilon\}\cap A_{R}\right) ({mini𝒩T0l𝔥il,𝔥il<2ε}AR)\displaystyle\leq\mathbb{P}\left(\{\min_{i}\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l}_{i},\,\mathfrak{h}^{l}_{i}\rangle<2\varepsilon\}\cap A_{R}\right)
i=1N({𝒩T0l𝔥il,𝔥il<2ε}AR)\displaystyle\leq\sum_{i=1}^{N}\mathbb{P}\left(\{\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l}_{i},\,\mathfrak{h}^{l}_{i}\rangle<2\varepsilon\}\cap A_{R}\right)
CRdεpdeηV(u0).\displaystyle\leq C\,R^{d}\varepsilon^{p-d}e^{\eta V(u_{0})}.

Next, we estimate the truncated remainder term. By Markov’s inequality, for any q1q\geq 1, we have

(ARc)=(MT~0(ω)>R)𝔼[MT~0(ω)q]RqC(T~0R)qeηV(u0).\mathbb{P}(A_{R}^{c})=\mathbb{P}(M_{{\widetilde{T}_{0}}}(\omega)>R)\leq\frac{\mathbb{E}[M_{{\widetilde{T}_{0}}}(\omega)^{q}]}{R^{q}}\leq C\left(\frac{\widetilde{T}_{0}}{R}\right)^{q}e^{\eta V(u_{0})}.

Take R=εαR=\varepsilon^{-\alpha}, with α(>0)\alpha(>0) to be fixed later. It follows that

({λmin(𝒩T0l)<ε})CεpdαdeηV(u0)+T~0qεαqeηV(u0).\mathbb{P}\left(\{\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l})<\varepsilon\}\right)\leq C\varepsilon^{p-d-\alpha d}e^{\eta V(u_{0})}+\widetilde{T}_{0}^{q}\varepsilon^{\alpha q}e^{\eta V(u_{0})}.

Set α=pdd+q\alpha=\frac{p-d}{d+q}. Then we have pdαd=αqp-d-\alpha d=\alpha q and

({λmin(𝒩T0l)<ε})C(T~0q+1)εq(pd)d+peηV(u0).\mathbb{P}\left(\{\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l})<\varepsilon\}\right)\leq C(\widetilde{T}_{0}^{q}+1)\varepsilon^{\frac{q(p-d)}{d+p}}e^{\eta V(u_{0})}.

Since p(>d)p(>d) and qq can both be taken arbitrarily large, we set p=qp=q to simplify the calculations. This yields

({λmin(𝒩T0l)<ε})C(T~0p+1)εp(pd)d+peηV(u0).\mathbb{P}\left(\{\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l})<\varepsilon\}\right)\leq C(\widetilde{T}_{0}^{p}+1)\varepsilon^{\frac{p(p-d)}{d+p}}e^{\eta V(u_{0})}.

Now, set X:=λmin(𝒩T0l)X:=\lambda_{\min}(\mathcal{N}_{{}_{T_{0}}}^{l}), then for any r>0r>0, we compute 𝔼[Xr]\mathbb{E}[X^{-r}]. Note that

𝔼[Xr]\displaystyle\mathbb{E}[X^{-r}] =01(X<a)rar1𝑑a+1(X<a)rar1𝑑a\displaystyle=\int_{0}^{1}\mathbb{P}(X<a)ra^{-r-1}\,da+\int_{1}^{\infty}\mathbb{P}(X<a)ra^{-r-1}\,da
C01(T~0p+1)ap(pd)d+peηV(u0)rar1𝑑a.\displaystyle\leq C\int_{0}^{1}(\widetilde{T}_{0}^{p}+1)a^{\frac{p(p-d)}{d+p}}e^{\eta V(u_{0})}ra^{-r-1}\,da.

The above integral converges if and only if p(pd)d+pr>0\frac{p(p-d)}{d+p}-r>0, which means r<p(pd)d+pr<\frac{p(p-d)}{d+p}. By choosing pp sufficiently large (p>r+d+1p>r+d+1), we have

𝔼[Xr]C(N,ν,𝐩0,η)(T~0p+d+1+1)eηV(u0)\mathbb{E}[X^{-r}]\leq C(N_{*},\nu,\mathbf{p}_{0},\eta)(\widetilde{T}_{0}^{p+d+1}+1)e^{\eta V(u_{0})}

Furthermore, when T~0\widetilde{T}_{0} is sufficiently small, it follows that

𝔼|(𝒩T0l)1|pC(N,ν,𝐩0,η,p)exp(pηV(u0)).\mathbb{E}|(\mathcal{N}_{{}_{T_{0}}}^{l})^{-1}|^{p}\leq C(N_{*},\nu,\mathbf{p}_{{}_{0}},\eta,p)\exp{(p\eta V(u_{0}))}.

We now prove the probabilistic spectral bounds for the partial Malliavin matrix, stated in Lemma 4.1. The proof is divided into two main steps. First, we establish a statement analogous to the generalized Hörmander condition (Lemma 4.3). Then, by differentiating in time, we derive a series of implications (Lemma 4.5); combining these two parts completes the proof. Here, the Lie brackets associated with the velocity field and the vector fields on the manifold, which appear in the generalized Hörmander condition, are naturally captured by the integral expression StlQkS_{t}^{l}Q^{k}. Using Itô’s formula, the integral expression given in [2, Proposition 6.12] is as follows.

Lemma 4.2.

Let EE be a bounded vector field on ×M\mathcal{H}\times M whose range belongs to l×TM\mathcal{H}_{l}\times TM and with two bounded derivatives. Then the following formula holds:

StLE(ut,𝐩t)\displaystyle S_{t}^{L}E(u_{t},\mathbf{p}_{{}_{t}}) =E(u0,𝐩0)+τ0T0Ssl([F^,E]l(us,𝐩s)[A^,E]l(us,𝐩s))𝑑s\displaystyle=E(u_{0},\mathbf{p}_{{}_{0}})+\int_{\tau_{0}}^{T_{0}}S_{s}^{l}\,\bigl([\hat{F},E]_{l}(u_{s},\mathbf{p}_{{}_{s}})-[\hat{A},E]_{l}(u_{s},\mathbf{p}_{{}_{s}})\bigr)\,ds
+12k𝒵0τ0T0SslD2E(us,𝐩s)[Qk,Qk]𝑑s\displaystyle\quad+\frac{1}{2}\sum_{k\in\mathcal{Z}_{0}}\int_{\tau_{0}}^{T_{0}}S_{s}^{l}\,D^{2}E(u_{s},\mathbf{p}_{{}_{s}})[Q^{k},Q^{k}]\,ds
+τ0T0SslDE(us,𝐩s)Q^𝑑Ws,\displaystyle\quad+\int_{\tau_{0}}^{T_{0}}S_{s}^{l}\,DE(u_{s},\mathbf{p}_{{}_{s}})\hat{Q}\,dW_{s}, (4.2)

where [Qk,Qk][Q^{k},Q^{k}] denotes the quadratic covariation of QkQ^{k} and QkQ^{k}, and for any two differentiable vector fields F,EF,E over ×M\mathcal{H}\times M, we denote

[F^,E]l(DEl)(u,𝐩)F^(u,𝐩)(DF^l)(u,𝐩)E(u,𝐩)=(DE)(u,𝐩)F^(u,𝐩)(DlF^l)(u,𝐩)E(u,𝐩)[\hat{F},E]_{l}\equiv(DE_{l})(u,\mathbf{p})\hat{F}(u,\mathbf{p})-(D\hat{F}_{l})(u,\mathbf{p})E(u,\mathbf{p})=(DE)(u,\mathbf{p})\hat{F}(u,\mathbf{p})-(D_{l}\hat{F}_{l})(u,\mathbf{p})E(u,\mathbf{p})

and

[A^,E]l(u,𝐩)DlEl(u,𝐩)A^l(u,𝐩)A^lE(u,𝐩)=DE(u,𝐩)A^(u,𝐩)A^lE(u,𝐩).[\hat{A},E]_{l}(u,\mathbf{p})\equiv D_{l}E_{l}(u,\mathbf{p})\hat{A}_{l}(u,\mathbf{p})-\hat{A}_{l}E(u,\mathbf{p})=DE(u,\mathbf{p})\hat{A}(u,\mathbf{p})-\hat{A}_{l}E(u,\mathbf{p}).

For convenience, we define the following operator Υl\Upsilon_{l} that maps smooth vector fields on ×M\mathcal{H}\times M to smooth vector fields on ×M\mathcal{H}\times M with range in l×TM\mathcal{H}_{l}\times TM, defined by

ΥlE:=[F^,E]l[A^,E]l+12k𝒵0D2E[Qk,Qk].\displaystyle\Upsilon_{l}E:=[\hat{F},E]_{l}-[\hat{A},E]_{l}+\frac{1}{2}\sum_{k\in\mathcal{Z}_{0}}D^{2}E[Q^{k},Q^{k}]. (4.3)

We first present a lower bound result, which is essentially analogous to the generalized Hörmander condition (c.f. [28]).

Lemma 4.3.

For any (utl,𝐩t)l×M(u_{t}^{l},\mathbf{p}_{{}_{t}})\in\mathcal{H}_{l}\times M and any β(0,1)\beta\in(0,1), there exists a constant C(N,ν,β)>0C(N_{*},\nu,\beta)>0 such that the lower-bound

max{|Qk,𝔥ll|,|ΥlQk,𝔥ll|:k𝒵0}C(N,ν,β)(1βull)𝔥ll\max\left\{|\langle Q^{k},\mathfrak{h}^{l}\rangle_{l}|,\;|\langle\Upsilon_{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}|:k\in\mathcal{Z}_{0}\right\}\geq C(N_{*},\nu,\beta)(1-\beta\|u^{l}\|_{\mathcal{H}_{l}})\|\mathfrak{h}^{l}\|_{l}

holds for every 𝔥ll×T𝐩M\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M.

Before proving the above lemma, we first state a spanning condition on the tangent bundle of the manifold. This condition appears in different forms in the literature [2, 26]; here we present the version given in [2]. To this end, we rewrite equations (3.4)–(3.5) as follows:

ddt(xtvt)=F¯(ut,xt,vt),\frac{d}{dt}\begin{pmatrix}x_{t}\\ v_{t}\end{pmatrix}=\overline{F}(u_{t},x_{t},v_{t}),

where F¯(u,x,v)\overline{F}(u,x,v) is the vector field defined for each (u,x,v)×𝕋2×S1(u,x,v)\in\mathcal{H}\times\mathbb{T}^{2}\times S^{1} by

F¯(u,x,v)=k𝒵0((u)kek(x)γk(u)k(kv)ek(x)(Πvγk))Tx𝕋2×TvS1,\displaystyle\overline{F}(u,x,v)=\sum_{k\in\mathcal{Z}_{0}}\begin{pmatrix}(u)_{k}e_{k}(x)\gamma_{k}\\ (u)_{k}(k\cdot v)e_{-k}(x)(\Pi_{v}\gamma_{k})\end{pmatrix}\in T_{x}\mathbb{T}^{2}\times T_{v}{S}^{1}, (4.4)

where (u)k=1π(2π)d1u,ekγkL2(u)_{k}=\frac{1}{\pi(2\pi)^{d-1}}\langle u,e_{k}\gamma_{k}\rangle_{L^{2}}. Moreover, it directly follows that

[ekγk,F¯](x,v)=(ek(x)γk(kv)ek(x)(Πvγk)).[e_{k}\gamma_{k},\overline{F}](x,v)=\begin{pmatrix}e_{k}(x)\gamma_{k}\\ (k\cdot v)e_{-k}(x)(\Pi_{v}\gamma_{k})\end{pmatrix}.
Lemma 4.4.

[2, Lemma 5.3] Let k1,k2k^{1},k^{2} be linearly independent elements of 02\mathbb{Z}_{0}^{2} and define

K={k1,k2}{k1,k2}02.K=\{k^{1},k^{2}\}\cup\{-k^{1},-k^{2}\}\subseteq\mathbb{Z}_{0}^{2}.

Then at each point (x,v)𝕋2×S1(x,v)\in\mathbb{T}^{2}\times{S}^{1}, we have

span{[ekγk,F¯](x,v):kK}=Tx𝕋2×TvS1.\operatorname{span}\{[e_{k}\gamma_{k},\overline{F}](x,v):k\in K\}=T_{x}\mathbb{T}^{2}\times T_{v}{S}^{1}.
Proof of Lemma 4.3.

Setting E=QkE=Q^{k}, we first have

ΥlQk=qk[F¯,ekγk]qk[B(u,u),ekγk]lqk[A^,ekγk]l.\displaystyle\Upsilon_{l}Q^{k}=q_{k}[\overline{F},e_{k}\gamma_{k}]-q_{k}[B(u,u),e_{k}\gamma_{k}]_{l}-q_{k}[\hat{A},e_{k}\gamma_{k}]_{l}. (4.5)

Noted that A(ek)=λkek,λk=ν|k|2A(e_{k})=\lambda_{k}e_{k},\,\lambda_{k}=\nu|k|^{2} and

[A^,ekγk]l=D(ekγk)A^A^l(ekγk)=γkA^l(ek),[\hat{A},e_{k}\gamma_{k}]_{l}=D(e_{k}\gamma_{k})\hat{A}-\hat{A}_{l}(e_{k}\gamma_{k})=-\gamma_{k}\hat{A}_{l}(e_{k}),

then we have

|[A^,ekγk]l,𝔥ll|=λk|ekγk,𝔥ll|C(N,ν)|ekγk,𝔥ll|.\displaystyle|\langle[\hat{A},e_{k}\gamma_{k}]_{l},\,\mathfrak{h}^{l}\rangle_{l}|=\lambda_{k}|\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\rangle_{l}|\leq C(N_{*},\nu)\,|\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\rangle_{l}|. (4.6)

Simultaneously, observe that

[B(u,u),ekγk]l=Πl[B(u,ekγk)+B(ekγk,u)]-[B(u,u),e_{k}\gamma_{k}]_{l}=\Pi_{l}[B(u,e_{k}\gamma_{k})+B(e_{k}\gamma_{k},u)]

Let {ϕ1,,ϕm}\{\phi_{1},\ldots,\phi_{m}\} be an orthonormal basis of l\mathcal{H}_{l}, where each ϕk\phi_{k} is of the form ekγke_{k}\gamma_{k}. Since l\mathcal{H}_{l} is finite-dimensional, the linear operator D(B(u,u)):lD(B(u,u)):\mathcal{H}_{l}\to\mathcal{H} admits a matrix representation. Consequently, for any 𝔥l,ul\mathfrak{h}^{l,u}\in\mathcal{H}_{l}, we have

Πl(D(B(u,u))𝔥l,u)=k=1makΠl(D(B(u,u))ϕk),where ak=𝔥l,u,ϕkl,u.\Pi_{l}(D(B(u,u))\cdot\mathfrak{h}^{l,u})=\sum_{k=1}^{m}a_{k}\,\Pi_{l}(D(B(u,u))\,\phi_{k}),\quad\text{where }\,a_{k}=\langle\mathfrak{h}^{l,u},\phi_{k}\rangle_{l,u}.

Noted that Πl(D(B(u,u))ϕk)l\Pi_{l}(D(B(u,u))\,\phi_{k})\in\mathcal{H}_{l}, then we have that

Πl(D(B(u,u))ϕk)=j=1mCk,j(u)ϕj,where Ck,j(u)=Πl(D(B(u,u))ϕk,ϕjl.\Pi_{l}(D(B(u,u))\,\phi_{k})=\sum_{j=1}^{m}C_{k,j}(u)\phi_{j},\quad\text{where }\,C_{k,j}(u)=\langle\Pi_{l}(D(B(u,u))\,\phi_{k},\,\phi_{j}\rangle_{l}.

Moreover, we obtain that

[B(u,u),ekγk]l=Πl(D(B(u,u))ϕk)=j=1mCk,j(u)ϕj.[B(u,u),e_{k}\gamma_{k}]_{l}=\Pi_{l}(D(B(u,u))\cdot\phi_{k})=\sum_{j=1}^{m}C_{k,j}(u)\phi_{j}.

We now establish a uniform estimate for Ck,j(u)C_{k,j}(u) with respect to (k,j)(k,j).

|Ck,j(u)||Πl(B(ϕk,u)),ϕjl|+|Πl(B(u,ϕk)),ϕjl|C(N)ull.\displaystyle|C_{k,j}(u)|\leq|\langle\Pi_{l}(B(\phi_{k},u)),\,\phi_{j}\rangle_{l}|\,+\,|\langle\Pi_{l}(B(u,\phi_{k})),\,\phi_{j}\rangle_{l}|\leq C(N_{*})\|u^{l}\|_{\mathcal{H}_{l}}.

Therefore,

|[B(u,u),ekγk]l,𝔥ll|C(N)ullk=1m|ekγk,𝔥ll|.\displaystyle|\langle[B(u,u),e_{k}\gamma_{k}]_{l},\,\mathfrak{h}^{l}\rangle_{l}|\leq C(N_{*})\,\|u^{l}\|_{\mathcal{H}_{l}}\,\sum_{k=1}^{m}|\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\rangle_{l}|. (4.7)

Putting together (4.5)–(4.7) yields

|[F¯,ekγk]l,𝔥ll|C(N,ν)(|ΥlQk,𝔥ll|+ullk=1m|ekγk,𝔥ll|).\displaystyle|\langle[\overline{F},e_{k}\gamma_{k}]_{l},\,\mathfrak{h}^{l}\rangle_{l}|\leq C(N_{*},\nu)\,\Big(|\langle\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|+\|u^{l}\|_{\mathcal{H}_{l}}\,\sum_{k=1}^{m}|\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\rangle_{l}|\Big). (4.8)

By Lemma 4.4 (the manifold spanning condition), for any 𝔥ll×T𝐩M\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M , we have

max{|ekγk,𝔥ll|,|[F¯,ekγk]l,𝔥ll|:k𝒵0}C𝔥ll>0.\max\bigl\{\bigl|\bigl\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\bigr\rangle_{l}\bigr|,\,\bigl|\bigl\langle[\overline{F},e_{k}\gamma_{k}]_{l},\,\mathfrak{h}^{l}\bigr\rangle_{l}\bigr|:k\in\mathcal{Z}_{0}\bigr\}\geq C\|\mathfrak{h}^{l}\|_{l}>0.

Then

max{|Qk,𝔥ll|,|ΥlQk,𝔥ll|+ullk=1m|ekγk,𝔥ll|:k𝒵0}\displaystyle\max\bigl\{\bigl|\bigl\langle Q^{k},\,\mathfrak{h}^{l}\bigr\rangle_{l}\bigr|,\,\bigl|\bigl\langle\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\bigr\rangle_{l}\bigr|+\|u^{l}\|_{\mathcal{H}_{l}}\,\sum_{k=1}^{m}|\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\rangle_{l}|:k\in\mathcal{Z}_{0}\bigr\}
C(N,ν)max{|ekγk,𝔥ll|,|[F¯,ekγk]l,𝔥ll|:k𝒵0}\displaystyle\geq C(N_{*},\nu)\max\bigl\{\bigl|\bigl\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\bigr\rangle_{l}\bigr|,\,\bigl|\bigl\langle[\overline{F},e_{k}\gamma_{k}]_{l},\,\mathfrak{h}^{l}\bigr\rangle_{l}\bigr|:k\in\mathcal{Z}_{0}\bigr\}
C(N,ν)𝔥ll.\displaystyle\geq C(N_{*},\nu)\|\mathfrak{h}^{l}\|_{l}.

If there exists k𝒵0k\in\mathcal{Z}_{0} such that |ekγk,𝔥ll|C𝔥ll|\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\rangle_{l}|\geq C\|\mathfrak{h}^{l}\|_{l}, then we have that

|Qk,𝔥ll|=|qk||ekγk,𝔥ll|C𝔥ll.|\langle Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|=|q_{k}|\cdot|\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\rangle_{l}|\geq C\|\mathfrak{h}^{l}\|_{l}.

Furthermore, we obtain

max{|Qk,𝔥ll|,|ΥlQk,𝔥ll|:k𝒵0}C𝔥ll.\max\left\{|\langle Q^{k},\mathfrak{h}^{l}\rangle_{l}|,\;|\langle\Upsilon_{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}|:k\in\mathcal{Z}_{0}\right\}\geq C\|\mathfrak{h}^{l}\|_{l}.

If for every k𝒵0k\in\mathcal{Z}_{0}, |ekγk,𝔥ll|C𝔥ll|\langle e_{k}\gamma_{k},\,\mathfrak{h}^{l}\rangle_{l}|\leq C\|\mathfrak{h}^{l}\|_{l}, then there exists k𝒵0k\in\mathcal{Z}_{0} such that

|[F¯,ekγk]l,𝔥ll|C𝔥ll.\bigl|\bigl\langle[\overline{F},e_{k}\gamma_{k}]_{l},\,\mathfrak{h}^{l}\bigr\rangle_{l}\bigr|\geq C\|\mathfrak{h}^{l}\|_{l}.

Moreover, for any β(0,1)\beta\in(0,1), there exists a constant C(N,ν,β)>0C(N_{*},\nu,\beta)>0 such that

|ΥlQk,𝔥ll|C(N,ν,β)(1βull)𝔥ll.|\langle\Upsilon_{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}|\geq C(N_{*},\nu,\beta)(1-\beta\|u^{l}\|_{\mathcal{H}_{l}})\|\mathfrak{h}^{l}\|_{l}.

The proof of the lemma is thereby concluded. ∎

In what follows, we proceed to present a ‘upper bound’ for |Qk,𝔥ll|,|ΥlQk,𝔥ll||\langle Q^{k},\mathfrak{h}^{l}\rangle_{l}|,\,|\langle\Upsilon_{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}|. This is achieved through the operation of differentiation in time.

Lemma 4.5.

Fix τ0,T~0>0\tau_{0},\widetilde{T}_{0}>0. There is a positive constant (T~0)>0\mathcal{E}(\widetilde{T}_{0})>0 such that the following holds. Fix any η>0\eta>0, then for every ε(0,(T~0))\varepsilon\in(0,\mathcal{E}(\widetilde{T}_{0})) there are a set Ωε\Omega_{\varepsilon}^{*} and a constant C(η,N,T~0,𝐩0)>0C(\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0})>0, locally bounded in 𝐩0M\mathbf{p}_{0}\in M, such that

((Ωε)c)C(η,N,T~0,𝐩0)εeηV(u0)\mathbb{P}((\Omega_{\varepsilon}^{*})^{c})\leq C(\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0})\varepsilon e^{\eta V{(u_{0})}}

and on Ωε\Omega_{\varepsilon}^{*} one has

𝒩T0l𝔥l,𝔥llε𝔥l|l2{supk𝒵0|Qk,𝔥ll|ε1/8𝔥l|l,supk𝒵0supt[τ0,T0]|StlΥlQk,𝔥ll|ε1/80𝔥ll.\displaystyle\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}^{l}\rangle_{l}\leq\varepsilon\|\mathfrak{h}^{l}|_{l}^{2}\Longrightarrow\begin{cases}\sup\limits_{k\in\mathcal{Z}_{0}}|\langle Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\leq\varepsilon^{1/8}\|\mathfrak{h}^{l}|_{l},\\ \sup\limits_{k\in\mathcal{Z}_{0}}\sup\limits_{t\in[\tau_{0},T_{0}]}|\langle S_{t}^{l}\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\leq\varepsilon^{1/{80}}\|\mathfrak{h}^{l}\|_{l}.\end{cases} (4.9)

which is valid for any 𝔥ll×T𝐩M\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M.

Before proceeding to the proof of Lemma 4.5, we first state a very useful lemma. It was developed from [12, Lemma 6.14], and its final form is given in [28, Lemma 6.2].

Lemma 4.6.

[28, Lemma 6.2] Fix τ0,T~0>0\tau_{0},\widetilde{T}_{0}>0, α(0,1]\alpha\in(0,1] and an index set \mathcal{I}. Consider a collection of random functions g𝔥lg_{{}_{\mathfrak{h}^{l}}} taking values in C1,α([τ0,T0])C^{1,\alpha}([\tau_{0},{T}_{0}]) and indexed by 𝔥l\mathfrak{h}^{l}\in\mathcal{I}. Define, for each ε>0\varepsilon>0,

Λε,α:=𝔥lΛε,α𝔥l,\Lambda_{\varepsilon,\alpha}:=\bigcup_{\mathfrak{h}^{l}\in\mathcal{I}}\Lambda_{\varepsilon,\alpha}^{\mathfrak{h}^{l}},

where

Λε,α𝔥l:={supt[τ0,T0]|g𝔥l(t)|εandsupt[τ0,T0]|g𝔥l(t)|>εα2(1+α)}.\displaystyle\Lambda_{\varepsilon,\alpha}^{\mathfrak{h}^{l}}:=\left\{\sup_{t\in[\tau_{0},{T}_{0}]}|g_{{}_{\mathfrak{h}^{l}}}(t)|\leq\varepsilon\quad{\emph{\text{and}}}\quad\sup_{t\in[\tau_{0},{T}_{0}]}|g_{{}_{\mathfrak{h}^{l}}}^{\prime}(t)|>\varepsilon^{\frac{\alpha}{2(1+\alpha)}}\right\}. (4.10)

Then, there exists ε0=ε0(α,T~0)(:=(T~0/4)2(1+α)/(2+α))>0\varepsilon_{0}=\varepsilon_{0}(\alpha,\widetilde{T}_{0})\big(:=({\widetilde{T}_{0}}/4)^{2(1+\alpha)/(2+\alpha)}\big)>0 such that for every ε(0,ε0)\varepsilon\in(0,\varepsilon_{0}),

(Λε,α)Cε𝔼[sup𝔥lIg𝔥lC1,α([τ0,T0]) 2/α].\displaystyle\mathbb{P}\bigl(\Lambda_{\varepsilon,\alpha}\bigr)\leq C\,\varepsilon\;\mathbb{E}\left[\sup\limits_{\mathfrak{h}^{l}\in I}\,\|g_{{}_{\mathfrak{h}^{l}}}\|_{C^{1,\alpha}([\tau_{0},{T}_{0}])}^{\,2/\alpha}\right]. (4.11)
Remark 4.7.

Observe that

Λε,αc=𝔥l{supt[τ0,T0]|g𝔥l(t)|>ε or supt[τ0,T0]|g𝔥l(t)|εα2(1+α)}.\Lambda_{\varepsilon,\alpha}^{c}=\bigcap_{\mathfrak{h}^{l}\in\mathcal{I}}\left\{\sup_{t\in[\tau_{0},{T}_{0}]}|g_{{}_{\mathfrak{h}^{l}}}(t)|>\varepsilon\,{\emph{\text{ or }}}\sup_{t\in[\tau_{0},{T}_{0}]}|g_{{}_{\mathfrak{h}^{l}}}^{\prime}(t)|\leq\varepsilon^{\frac{\alpha}{2(1+\alpha)}}\right\}.

Thus, on Λε,αc\Lambda_{\varepsilon,\alpha}^{c},

supt[τ0,T0]|g𝔥l(t)|<εsupt[τ0,T0]|g𝔥l(t)|εα2(1+α)\displaystyle\sup_{t\in[\tau_{0},{T}_{0}]}|g_{{}_{\mathfrak{h}^{l}}}(t)|<\varepsilon\implies\sup_{t\in[\tau_{0},{T}_{0}]}|g_{{}_{\mathfrak{h}^{l}}}^{\prime}(t)|\leq\varepsilon^{\frac{\alpha}{2(1+\alpha)}} (4.12)

for every 𝔥l\mathfrak{h}^{l}\in\mathcal{I}.

Proof of Lemma 4.5.

Our proof proceeds in two steps, consisting of two operations of time differentiation.
Step 1. For any 𝔥ll×T𝐩M\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M with 𝔥l=1\|\mathfrak{h}^{l}\|=1, and any k𝒵0)k\in\mathcal{Z}_{0}), we define

g𝔥l(t):=τ0T0SslQk,𝔥ll2𝑑sk𝒵0τ0T0SslQk,𝔥ll2𝑑s=𝒩T0l𝔥l,𝔥ll.g_{{}_{\mathfrak{h}^{l}}}(t):=\int_{\tau_{0}}^{T_{0}}\langle S_{s}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}^{2}ds\leq\sum_{k\in\mathcal{Z}_{0}}\int_{\tau_{0}}^{T_{0}}\langle S_{s}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}^{2}ds=\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\,\mathfrak{h}^{l}\rangle_{l}.

Note that

g𝔥l(t)=StlQk,𝔥ll2,g_{{}_{\mathfrak{h}^{l}}}^{\prime}(t)=\langle S_{t}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}^{2},
g𝔥l′′(t)=2StlQk,𝔥lltStlQk,𝔥ll=2StlQk,𝔥llL~tlStlQk,𝔥ll.g_{{}_{\mathfrak{h}^{l}}}^{\prime\prime}(t)=2\langle S_{t}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}\,\langle\partial_{t}S_{t}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}=-2\langle S_{t}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}\,\langle\widetilde{L}_{t}^{l}S_{t}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}.

Let Ω~ε:=Λ~ε,1c\widetilde{\Omega}_{\varepsilon}:=\widetilde{\Lambda}_{\varepsilon,1}^{c}, where Λ~ε,α\widetilde{\Lambda}_{\varepsilon,\alpha} is as in (4.10) with :={𝔥ll×T𝐩M:𝔥ll=1}\mathcal{I}:=\{\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M:\|\mathfrak{h}^{l}\|_{l}=1\}, i.e.,

Λ~ε,1=𝔥l{supt[τ0,T0]|g𝔥l(t)|εandsupt[τ0,T0]|g𝔥l(t)|>ε14}.\widetilde{\Lambda}_{\varepsilon,1}=\bigcup_{\mathfrak{h}^{l}\in\mathcal{I}}\left\{\sup_{t\in[\tau_{0},T_{0}]}|g_{{}_{\mathfrak{h}^{l}}}(t)|\leq\varepsilon\quad\text{and}\sup_{t\in[\tau_{0},T_{0}]}|g_{{}_{\mathfrak{h}^{l}}}^{\prime}(t)|>\varepsilon^{\frac{1}{4}}\right\}.

Then by Lemma 4.6 with α=1\alpha=1, one has

(Ω~εc)Cϵ𝔼(supt[τ0,T0]𝔥ll=1|2StlQk,𝔥llL~tlStlQk,𝔥ll|2).\mathbb{P}(\widetilde{\Omega}_{\varepsilon}^{c})\leq C\epsilon\mathbb{E}\bigg(\sup_{\begin{subarray}{c}t\in[\tau_{0},T_{0}]\\ \|\mathfrak{h}^{l}\|_{l}=1\end{subarray}}\big|2\langle S_{t}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}\,\langle\widetilde{L}_{t}^{l}S_{t}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}\big|^{2}\bigg).

Note that

|L~tlStlQk,𝔥ll|L~tlQkStl12(CL~tlHln+2×TMl×TM2+Stll×TMl×TM2).\left|\langle\widetilde{L}_{t}^{l}S_{t}^{l}Q^{k},\mathfrak{h}^{l}\rangle_{l}\right|\leqslant\|\widetilde{L}_{t}^{l}Q^{k}\|\|S_{t}^{l}\|\leqslant\frac{1}{2}\left(C\|\widetilde{L}_{t}^{l}\|^{2}_{{H}^{n+2}_{l}\times TM\to\mathcal{H}_{l}\times TM}+\|S_{t}^{l}\|^{2}_{\mathcal{H}_{l}\times TM\to\mathcal{H}_{l}\times TM}\right).

Then by (A.7)-(A.8), we obtain

(Ω~εc)Cexp(ηV(u0))ε\mathbb{P}(\widetilde{\Omega}_{\varepsilon}^{c})\leq C\exp(\eta V(u_{0}))\varepsilon

for any ε<ε0(T~0)\varepsilon<\varepsilon_{0}(\widetilde{T}_{0}), where C=C(η,T~0,𝐩0)C=C(\eta,\widetilde{T}_{0},\mathbf{p}_{0}) that is locally bounded in 𝐩0M\mathbf{p}_{0}\in M. Finally, on Ω~ε\widetilde{\Omega}_{\varepsilon} we have, cf. (4.11), that

𝒩T0l𝔥l,𝔥llε𝔥ll2supt[τ0,T0]|StlQk,𝔥ll|ε1/8𝔥ll,\displaystyle\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\,\mathfrak{h}^{l}\rangle_{l}\leq\varepsilon\|\mathfrak{h}^{l}\|_{l}^{2}\Longrightarrow\sup_{t\in[\tau_{0},T_{0}]}|\langle S_{t}^{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\leq\varepsilon^{1/8}\|\mathfrak{h}^{l}\|_{l}, (4.13)

for each k𝒵0k\in\mathcal{Z}_{0} and any 𝔥ll×T𝐩M\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M.
Step 2. Using the Lemma 4.2, for any k𝒵0k\in\mathcal{Z}_{0}, we obtain that

StlQk=Qk+τ0T0StlΥlQk𝑑s.S_{t}^{l}Q^{k}=Q^{k}+\int_{\tau_{0}}^{T_{0}}S_{t}^{l}\Upsilon_{l}Q^{k}\,ds.

Then by (4.13), on Ω~ε\widetilde{\Omega}_{\varepsilon} we have that

𝒩T0l𝔥l,𝔥llε𝔥ll2{supk𝒵0|Qk,𝔥ll|ε1/8𝔥l|l,supk𝒵0supt[τ0,T0]τ0t|SslΥlQk,𝔥ll|𝑑sε1/8𝔥l|l,\displaystyle\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\,\mathfrak{h}^{l}\rangle_{l}\leq\varepsilon\|\mathfrak{h}^{l}\|_{l}^{2}\Longrightarrow\begin{cases}\sup\limits_{k\in\mathcal{Z}_{0}}|\langle Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\leq\varepsilon^{1/8}\|\mathfrak{h}^{l}|_{l},\\ \sup\limits_{k\in\mathcal{Z}_{0}}\sup\limits_{t\in[\tau_{0},T_{0}]}\int_{\tau_{0}}^{t}|\langle S_{s}^{l}\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\,ds\leq\varepsilon^{1/{8}}\|\mathfrak{h}^{l}|_{l},\end{cases}

for each 𝔥ll×T𝐩M\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M.

For fixed 𝔥ll×T𝐩M\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M, let

g𝔥l(t):=τ0t|SslΥlQk,𝔥ll|𝑑s.g_{{}_{\mathfrak{h}^{l}}}(t):=\int_{\tau_{0}}^{t}|\langle S_{s}^{l}\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\,ds.

Then

g𝔥l(t)=|StlΥlQk,𝔥ll|,g𝔥l′′(t)=|t(StlΥlQk),𝔥ll|.g_{{}_{\mathfrak{h}^{l}}}^{\prime}(t)=|\langle S_{t}^{l}\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|,\;\;g_{{}_{\mathfrak{h}^{l}}}^{\prime\prime}(t)=|\partial_{t}\big(\langle S_{t}^{l}\Upsilon_{l}Q^{k}\big),\,\mathfrak{h}^{l}\rangle_{l}|.

Let Ω¯ε=Λ¯ε,1/4c\overline{\Omega}_{\varepsilon}=\overline{\Lambda}_{\varepsilon,1/4}^{c} (with α=1/4\alpha=1/4), where Λ¯ϵ,α\overline{\Lambda}_{\epsilon,\alpha} is as in (4.10) over with :={𝔥ll×T𝐩M:𝔥ll=1}\mathcal{I}:=\{\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M:\|\mathfrak{h}^{l}\|_{l}=1\}. Then, on Ω¯ε\overline{\Omega}_{\varepsilon} one has, in view of (4.12),

supt[τ0,T0]τ0t|SslΥlQk,𝔥ll|𝑑sε𝔥llsupt[τ0,T0]|StlΥlQk,𝔥ll|ϵ1/10𝔥ll.\sup_{t\in[\tau_{0},T_{0}]}\int_{\tau_{0}}^{t}|\langle S_{s}^{l}\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\,ds\leq\varepsilon\|\mathfrak{h}^{l}\|_{l}\Longrightarrow\sup_{t\in[\tau_{0},T_{0}]}|\langle S_{t}^{l}\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\leq\epsilon^{1/10}\|\mathfrak{h}^{l}\|_{l}. (4.14)

By (4.11), we have that

((Ω¯ε)c)\displaystyle\mathbb{P}((\overline{\Omega}_{\varepsilon})^{c}) Cϵ𝔼(sup𝔥lg𝔥l(t)C1/4([τ0,T0])8).\displaystyle\leq C\epsilon\mathbb{E}\left(\sup_{\mathfrak{h}^{l}\in\mathcal{I}}\|g_{{}_{\mathfrak{h}^{l}}}^{\prime}(t)\|_{C^{1/4}([\tau_{0},T_{0}])}^{8}\right). (4.15)

Set Ytk:=StlΥlQk,𝔥llY_{t}^{k}:=\langle S_{t}^{l}\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}, it now suffices to estimate 𝔼(sup𝔥lYtkC1/4([τ0,T0])8)\mathbb{E}\big(\sup_{\mathfrak{h}^{l}\in\mathcal{I}}\|Y_{t}^{k}\|_{C^{1/4}([\tau_{0},T_{0}])}^{8}\big). Noted that

Ytk=Y0k+τ0tSslΥl(ΥlQk),𝔥ll𝑑s+τ0tSslD(ΥlQk)Q^dWs,𝔥ll.\displaystyle Y_{t}^{k}=Y_{0}^{k}+\int_{\tau_{0}}^{t}\langle S_{s}^{l}\Upsilon_{l}(\Upsilon_{l}Q^{k}),\,\mathfrak{h}^{l}\rangle_{l}\,ds+\int_{\tau_{0}}^{t}\langle S_{s}^{l}D(\Upsilon_{l}Q^{k})\hat{Q}\,dW_{s},\,\mathfrak{h}^{l}\rangle_{l}. (4.16)

Setting αsk:=SslΥl(ΥlQk),𝔥ll\alpha^{k}_{s}:=\langle S_{s}^{l}\Upsilon_{l}(\Upsilon_{l}Q^{k}),\,\mathfrak{h}^{l}\rangle_{l}, and γsk:=SslD(ΥlQk)Q^,𝔥ll\gamma_{s}^{k}:=\langle S_{s}^{l}D(\Upsilon_{l}Q^{k})\hat{Q},\,\mathfrak{h}^{l}\rangle_{l} as a linear operator from m{\mathbb{R}^{m}} to \mathbb{R}, we then rewrite (4.16) as

Ytk=Y0k+τ0tαsk𝑑s+τ0tγsk𝑑Ws.Y_{t}^{k}=Y_{0}^{k}+\int_{\tau_{0}}^{t}\alpha^{k}_{s}\,ds+\int_{\tau_{0}}^{t}\gamma_{s}^{k}\,dW_{s}.

We first estimate αSk\alpha_{S}^{k}. From (4.3) we immediately have

Υl(ΥlQk)=qk[F¯,ΥlQk]qk[B(u,u),ΥlQk]lqk[A^,ΥlQk]l=:qk[F^,ΥlQk]lqk[A^,ΥlQk]l,\Upsilon_{l}(\Upsilon_{l}Q^{k})=q_{k}[\overline{F},\,\Upsilon_{l}Q^{k}]-q_{k}[B(u,u),\,\Upsilon_{l}Q^{k}]_{l}-q_{k}[\hat{A},\,\Upsilon_{l}Q^{k}]_{l}=:\,q_{k}[\hat{F},\,\Upsilon_{l}Q^{k}]_{l}-q_{k}[\hat{A},\,\Upsilon_{l}Q^{k}]_{l},

where

F^=(B(u,u)u(x)ΠvDu(x)v),ΥlQk=(Πl(B(u,ekγk)+B(ekγk,u))A^lekγkekγk(kv)ek(x)(Πvγk)).\hat{F}=\begin{pmatrix}-B(u,u)\\ u(x)\\ \Pi_{v}Du(x)v\end{pmatrix},\quad\Upsilon_{l}Q^{k}=\begin{pmatrix}-\Pi_{l}\big(B(u,e_{k}\gamma_{k})+B(e_{k}\gamma_{k},u)\big)-\hat{A}_{l}e_{k}\gamma_{k}\\ e_{k}\gamma_{k}\\ (k\cdot v)e{-k}(x)(\Pi_{v}\gamma_{k})\end{pmatrix}.

A direct calculation gives

Πl(D(ΥlQk)F^)=(Πl(B(B(u))(ekγk))Πl(x(B(u)(ekγk)))uA^lxekuγkxek(x)uγk(kv)ek(x)u(Πvγk)+(kΠvxu(x)v)ek(x)(Πvγk)),\Pi_{l}\big(D(\Upsilon_{l}Q^{k})\hat{F}\big)=\begin{pmatrix}\Pi_{l}\big(\nabla B(B(u))(e_{k}\gamma_{k})\big)-\Pi_{l}\Big(\nabla_{x}\big(\nabla B(u)(e_{k}\gamma_{k})\big)\Big)u-\hat{A}_{l}\nabla_{x}e_{k}u\gamma_{k}\\ \nabla_{x}e_{k}(x)u\gamma_{k}\\ (k\cdot v)\nabla e_{-k}(x)u(\Pi_{v}\gamma_{k})+(k\Pi_{v}\nabla_{x}u(x)v)e_{-k}(x)(\Pi_{v}\gamma_{k})\end{pmatrix},

and

Πl(DF^(ΥlQk))=(Πl(B(u)(V(ekγk)))xB(u)ekγku(x)ekγkΠl((Πvxu(x)v)(V(ekγk)))+Πvx2u(x)ekγkv+Πvxu(x)(kv)ek(x)(Πvγk)),\Pi_{l}\big(D\hat{F}(\Upsilon_{l}Q^{k})\big)=\begin{pmatrix}\Pi_{l}\left(\nabla B(u)\big(\nabla V(e_{k}\gamma_{k})\big)\right)-\nabla_{x}B(u)e_{k}\gamma_{k}\\ \nabla u(x)e_{k}\gamma_{k}\\ -\Pi_{l}\left(\nabla(\Pi_{v}\nabla_{x}u(x)v)\cdot\big(\nabla V(e_{k}\gamma_{k})\big)\right)+\Pi_{v}\nabla_{x}^{2}u(x)\cdot e_{k}\gamma_{k}v+\Pi_{v}\nabla_{x}u(x)(k\cdot v)e_{-k}(x)(\Pi_{v}\gamma_{k})\end{pmatrix},

where B(u):=B(u,u)B(u):=B(u,u), B(u)ϕ:=B(u,ϕ)+B(ϕ,u)\nabla B(u)\phi:=B(u,\phi)+B(\phi,u) and V(ekγk):=B(u)(ekγk)A^lekγk\nabla V(e_{k}\gamma_{k}):=\nabla B(u)(e_{k}\gamma_{k})-\hat{A}_{l}e_{k}\gamma_{k}.

Noted that

[F^,ΥlQk]l=Πl(D(ΥlQk)F^DF^(ΥlQk)).[\hat{F},\,\Upsilon_{l}Q^{k}]_{l}=\Pi_{l}\big(D(\Upsilon_{l}Q^{k})\hat{F}-D\hat{F}(\Upsilon_{l}Q^{k})\big).

Then, using the fact that l×TM\mathcal{H}_{l}\times TM is finite-dimensional, we obtain

𝔼supτ0s<tT0|Υl(ΥlQk)|C1𝔼supτ0tT0|ulHln+2C2(N)𝔼supτ0tT0|ullCeηV(u0),\displaystyle\mathbb{E}\sup_{\tau_{0}\leq s<t\leq T_{0}}|\Upsilon_{l}(\Upsilon_{l}Q^{k})|\leq C_{1}\mathbb{E}\sup_{\tau_{0}\leq t\leq T_{0}}|\|u^{l}\|_{H^{n+2}_{l}}\leq C_{2}(N_{*})\mathbb{E}\sup_{\tau_{0}\leq t\leq T_{0}}|\|u^{l}\|_{\mathcal{H}_{l}}\leq Ce^{\eta V(u_{0})},

where C=C(η,N,T~0,𝐩0)C=C(\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0}) that is locally bounded in 𝐩0M\mathbf{p}_{0}\in M. Moreover, we have that

𝔼(αskL([τ0,T0];))C(η,N,T~0,𝐩0)eηV(u0)<.\displaystyle\mathbb{E}\big(\|\alpha_{s}^{k}\|_{L^{\infty}([\tau_{0},T_{0}];\mathbb{R})}\big)\leq C(\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0})e^{\eta V(u_{0})}<\infty. (4.17)

We now estimate γsk\gamma_{s}^{k}. Note that for any fmf\in{\mathbb{R}^{m}}, γskf=SslD(ΥlQk)Q^f,𝔥ll\gamma_{s}^{k}f=\langle S_{s}^{l}D(\Upsilon_{l}Q^{k})\hat{Q}f,\,\mathfrak{h}^{l}\rangle_{l}, its Hilbert-Schmidt norm is given by

γsk2(m,)=(m|SslD(ΥlQk)Q^gm,𝔥ll|2)1/2,\|\gamma_{s}^{k}\|_{\mathcal{L}^{2}({\mathbb{R}^{m}},\mathbb{R})}=\bigg(\sum_{m}\big|\langle S_{s}^{l}D(\Upsilon_{l}Q^{k})\hat{Q}g_{m},\,\mathfrak{h}^{l}\rangle_{l}\big|^{2}\bigg)^{1/2},

where {gm}\{g_{m}\} is an orthonormal basis of m{\mathbb{R}^{m}}. Furthermore, we have that

𝔼supτ0s<tT0γsk2(m,)\displaystyle\mathbb{E}\sup_{\tau_{0}\leq s<t\leq T_{0}}\|\gamma_{s}^{k}\|_{\mathcal{L}^{2}({\mathbb{R}^{m}},\mathbb{R})} 𝔼supτ0s<tT0Ssll×T𝐩τ0Ml×T𝐩sM\displaystyle\leq\mathbb{E}\sup_{\tau_{0}\leq s<t\leq T_{0}}\|S_{s}^{l}\|_{\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{\tau_{0}}}}M\to\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{s}}}M}
𝔼supτ0s<tT0D(ΥlQk)×Ml×T𝐩MQ^2(m,×M)\displaystyle\quad\cdot\mathbb{E}\sup_{\tau_{0}\leq s<t\leq T_{0}}\|D(\Upsilon_{l}Q^{k})\|_{\mathcal{H}\times M\to\mathcal{H}_{l}\times T_{\mathbf{p}}M}\cdot\|\hat{Q}\|_{\mathcal{L}^{2}({\mathbb{R}^{m}},\mathcal{H}\times M)}
CeηV(u0),\displaystyle\leq Ce^{\eta V(u_{0})},

where C=C(η,N,T~0,𝐩0)C=C(\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0}) that is locally bounded in 𝐩0M\mathbf{p}_{0}\in M.

Finally, we estimate the moments of the Hölder semi-norm of YtkY_{t}^{k}, our target quantity YtkC1/4([τ0,T0])\|Y_{t}^{k}\|_{C^{1/4}([\tau_{0},T_{0}])}. For any s,t[τ0,T0]s,t\in[\tau_{0},T_{0}], we have

|YtkYsk||stαrk𝑑r|+|stγrk𝑑Wr|.|Y_{t}^{k}-Y_{s}^{k}|\leq|\int_{s}^{t}\alpha_{r}^{k}\,dr|+|\int_{s}^{t}\gamma_{r}^{k}\,dW_{r}|.

By the Burkholder–Davis–Gundy inequality, for any p2p\geq 2, we have

𝔼|stγrk𝑑Wr|pC(p)𝔼(stγrk2(m,)2𝑑r)p/2C(p,η,N,T~0,𝐩0)|ts|p/2eηV(u0).\mathbb{E}|\int_{s}^{t}\gamma_{r}^{k}\,dW_{r}|^{p}\leq C(p)\mathbb{E}\Big(\int_{s}^{t}\|\gamma_{r}^{k}\|^{2}_{\mathcal{L}^{2}({\mathbb{R}^{m}},\mathbb{R})}\,dr\Big)^{p/2}\leq C(p,\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0})|t-s|^{p/2}e^{\eta V(u_{0})}.

Moreover, we obtain that

𝔼|YtkYsk|pC(p,η,N,T~0,𝐩0)|ts|p/2eηV(u0)C|ts|p/2.\mathbb{E}|Y_{t}^{k}-Y_{s}^{k}|^{p}\leq C(p,\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0})|t-s|^{p/2}e^{\eta V(u_{0})}\leq C|t-s|^{p/2}.

Then, by the Kolmogorov continuity theorem, we know that

𝔼(sup𝔥lYtkC1/4([τ0,T0])8)CeηV(u0),\mathbb{E}\Big(\sup_{\mathfrak{h}^{l}\in\mathcal{I}}\|Y^{k}_{t}\|_{C^{1/4}([\tau_{0},T_{0}])}^{8}\Big)\leq Ce^{\eta V(u_{0})},

where C=C(η,N,T~0,𝐩0)C=C(\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0}) that is locally bounded in 𝐩0M\mathbf{p}_{0}\in M. Combining with (4.15), we have

((Ω¯ε)c)\displaystyle\mathbb{P}((\overline{\Omega}_{\varepsilon})^{c}) C(η,N,T~0,𝐩0)ϵeηV(u0).\displaystyle\leq C(\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0})\epsilon e^{\eta V(u_{0})}.

Combining Step 1, setting (T~0):=min{(T~04)10/9,(T~04)4/3}\mathcal{E}(\widetilde{T}_{0}):=\min\big\{(\frac{\widetilde{T}_{0}}{4})^{10/9},(\frac{\widetilde{T}_{0}}{4})^{4/3}\big\}. Then for every ε(0,(T~0))\varepsilon\in(0,\mathcal{E}(\widetilde{T}_{0})), setting Ωε:=Ω¯εΩ~ε\Omega^{*}_{\varepsilon}:=\overline{\Omega}_{\varepsilon}\bigcap\widetilde{\Omega}_{\varepsilon}, we have on Ωε\Omega^{*}_{\varepsilon} that

𝒩T0l𝔥l,𝔥llε𝔥l|l2{supk𝒵0|Qk,𝔥ll|ε1/8𝔥l|l,supk𝒵0supt[τ0,T0]|StlΥlQk,𝔥ll|ε1/80𝔥ll,\displaystyle\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}^{l}\rangle_{l}\leq\varepsilon\|\mathfrak{h}^{l}|_{l}^{2}\Longrightarrow\begin{cases}\sup\limits_{k\in\mathcal{Z}_{0}}|\langle Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\leq\varepsilon^{1/8}\|\mathfrak{h}^{l}|_{l},\\ \sup\limits_{k\in\mathcal{Z}_{0}}\sup\limits_{t\in[\tau_{0},T_{0}]}|\langle S_{t}^{l}\Upsilon_{l}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|\leq\varepsilon^{1/{80}}\|\mathfrak{h}^{l}\|_{l},\end{cases}

and there exist a constant C(η,N,T~0,𝐩0)>0C(\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0})>0 (locally bounded in 𝐩0M\mathbf{p}_{0}\in M) such that

((Ωε)c)C(η,N,T~0,𝐩0)εeηV(u0).\mathbb{P}((\Omega_{\varepsilon}^{*})^{c})\leq C(\eta,N_{*},\widetilde{T}_{0},\mathbf{p}_{0})\varepsilon e^{\eta V({u_{0}})}.

The proof of Lemma 4.1 now follows straightforwardly.

Proof of Lemma 4.1.

Taking t=0t=0 in Lemma 4.5, then for any ε(0,0)\varepsilon\in(0,\mathcal{E}_{0}), on the set Ωε\Omega_{\varepsilon}^{*} we have

𝒩T0l𝔥l,𝔥llε𝔥l|l2max{|Qk,𝔥ll|,|Υl0Qk,𝔥ll|:k𝒵0}ε1/8𝔥ll.\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}^{l}\rangle_{l}\leq\varepsilon\|\mathfrak{h}^{l}|_{l}^{2}\Longrightarrow\max\Big\{|\langle Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|,\,|\langle\Upsilon_{l}^{0}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|:\,k\in\mathcal{Z}_{0}\Big\}\leq\varepsilon^{1/{8}}\|\mathfrak{h}^{l}\|_{l}.

Furthermore, combining with Lemma 4.3, we obtain on Ωε\Omega_{\varepsilon}^{*} that

𝒩T0l𝔥l,𝔥llεC(N,ν,β)(1βu0ll)max{|Qk,𝔥ll|,|Υl0Qk,𝔥ll|:k𝒵0}ε1/8.\displaystyle\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}^{l}\rangle_{l}\leq\varepsilon\Longrightarrow C(N_{*},\nu,\beta)(1-\beta\|u^{l}_{0}\|_{\mathcal{H}_{l}})\leq\max\Big\{|\langle Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|,\,|\langle\Upsilon_{l}^{0}Q^{k},\,\mathfrak{h}^{l}\rangle_{l}|:\,k\in\mathcal{Z}_{0}\Big\}\leq\varepsilon^{1/{8}}.

Take β(0,1)\beta\in(0,1) sufficiently small so that 1βu0ll>01-\beta\|u^{l}_{0}\|_{\mathcal{H}_{l}}>0. Then there exist constant C1(N,ν,β)>0C_{1}(N_{*},\nu,\beta)>0 and C(N,η,𝐩0,p)>0C(N_{*},\eta,\mathbf{p}_{0},p)>0, locally uniformly in 𝐩0\mathbf{p}_{0}, such that for any ε(0,~0)\varepsilon\in(0,\widetilde{\mathcal{E}}_{0}) (~0:=min{0,C1(N,ν,β)\widetilde{\mathcal{E}}_{0}:=\min\{\mathcal{E}_{0},\,C_{1}(N_{*},\nu,\beta)
(1βu0ll)}(1-\beta\|u^{l}_{0}\|_{\mathcal{H}_{l}})\}), we have

sup𝔥ll×T𝐩M,𝔥l=1(𝒩T0l𝔥l,𝔥ll<ε)C(N,η,𝐩0,p)εpeηV(u0).\sup_{\mathfrak{h}^{l}\in\mathcal{H}_{l}\times T_{\mathbf{p}}M,\,\|\mathfrak{h}^{l}\|=1}\mathbb{P}\left(\langle\mathcal{N}_{{}_{T_{0}}}^{l}\mathfrak{h}^{l},\mathfrak{h}^{l}\rangle_{l}<\varepsilon\right)\leq C(N_{*},\eta,\mathbf{p}_{0},p)\varepsilon^{p}e^{\eta V(u_{0})}.

Appendix A Basic estimates

This section first collects several basic tools used throughout the paper: moment estimates for the solution, the super-Lyapunov function and its key properties. We then present Jacobian derivative estimates that will be repeatedly invoked in Sections 3 and 4.

The moment estimates for the baseline process utu_{t} in this section are obtained by classical arguments; the corresponding results and full proofs can be found in [26, Appendix A.1]. To avoid repetition, we only state the results needed here.

Proposition A.1.

Suppose that utu_{t} is a solution to (1.2). Then there exists C>0C>0 such that for all ηC1\eta\leqslant C^{-1} and for all 0st0\leqslant s\leqslant t,

𝔼exp(ηsupsrt(urL22+νsruaH22da))CeC(ts)exp(eC1sηu0L22).\displaystyle\mathbb{E}\exp\Bigl(\eta\sup_{s\leqslant r\leqslant t}\Bigl(\|\nabla u_{r}\|^{2}_{L^{2}}+\nu\int_{s}^{r}\|u_{a}\|^{2}_{H^{2}}\,\mathrm{d}a\Bigr)\Bigr)\leqslant Ce^{C(t-s)}\exp\bigl(e^{-C^{-1}s}\eta\|\nabla u_{0}\|^{2}_{L^{2}}\bigr). (A.1)

Additionally, there exists C(n)>0C(n)>0 such that for all 0ηC10\leqslant\eta\leqslant C^{-1} and 0st0\leqslant s\leqslant t,

𝔼exp(ηsupsrt(urHn+12+νsruaHn+22da)1n+2)CeCtexp(eC1sηu0Hn+12n+2+Cηu0L22),\displaystyle\mathbb{E}\exp\Bigl(\eta\sup_{s\leqslant r\leqslant t}\Bigl(\|u_{r}\|^{2}_{H^{n+1}}+\nu\int_{s}^{r}\|u_{a}\|^{2}_{H^{n+2}}\,\mathrm{d}a\Bigr)^{\frac{1}{n+2}}\Bigr)\leqslant Ce^{Ct}\exp\Bigl(e^{-C^{-1}s}\eta\|u_{0}\|^{\frac{2}{n+2}}_{H^{n+1}}+C\eta\|\nabla u_{0}\|^{2}_{L^{2}}\Bigr), (A.2)

and

𝔼exp(ηsupsrt(urHn+12+νsruaHn+22da)1n+2)CeCs1eCtexp(Cηu0L22).\displaystyle\mathbb{E}\exp\Bigl(\eta\sup_{s\leqslant r\leqslant t}\Bigl(\|u_{r}\|^{2}_{H^{n+1}}+\nu\int_{s}^{r}\|u_{a}\|^{2}_{H^{n+2}}\,\mathrm{d}a\Bigr)^{\frac{1}{n+2}}\Bigr)\leqslant Ce^{Cs^{-1}}e^{Ct}\exp\Bigl(C\eta\|\nabla u_{0}\|^{2}_{L^{2}}\Bigr). (A.3)
Definition A.2 (Super-Lyapunov function for the base process).

For uH05(𝕋2,2)u\in H_{0}^{5}(\mathbb{T}^{2},\mathbb{R}^{2}), we define

V(u):=σ(uH012+αuH051/3),\displaystyle V(u):=\sigma\bigl(\|u\|^{2}_{H_{0}^{1}}+\alpha\|u\|^{1/3}_{H_{0}^{5}}\bigr), (A.4)

where α,σ>0\alpha,\sigma>0 are some fixed small constants, chosen according to Proposition A.1 so that Corollary A.3 holds for all η(0,2)\eta\in(0,2).

Corollary A.3 (Super-Lyapunov property).

There exist δ(0,1)\delta\in(0,1) and a constant C>0C>0 such that for all η(0,2)\eta\in(0,2) and t1t\geqslant 1,

𝔼exp(ηV(ut))Cexp(δηV(u0)).\mathbb{E}\exp(\eta V(u_{t}))\leqslant C\exp(\delta\eta V(u_{0})).

Now, we provide several Jacobian estimates needed in this paper.

Lemma A.4.

For any q>0q>0, η(0,1)\eta\in(0,1) there is C(𝐩0,η,q,N)>0C(\mathbf{p}_{0},\eta,q,N_{*})>0, locally bounded in 𝐩0M\mathbf{p}_{0}\in M, such that

𝔼sup0stRs,thhh2Ce(CνN2)(ts)exp(ηV(u0)),\displaystyle\mathbb{E}\sup\limits_{0\leq s\leq t}\|R_{s,t}^{h}\|^{2}_{\mathcal{H}_{h}\to\mathcal{H}_{h}}\leq Ce^{(C-\nu N_{*}^{2})(t-s)}\exp(\eta V(u_{0})), (A.5)
𝔼sup0stRs,tll×T𝐩sMl×T𝐩tMqCeC(ts)exp(ηV(u0)),\displaystyle\mathbb{E}\sup\limits_{0\leq s\leq t}\|R_{s,t}^{l}\|^{q}_{\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{s}}}M\to\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{t}}}M}\leq Ce^{C(t-s)}\exp(\eta V(u_{0})), (A.6)
𝔼sup0stSs,tll×T𝐩sMl×T𝐩tMqCeC(ts)exp(ηV(u0)),\displaystyle\mathbb{E}\sup\limits_{0\leq s\leq t}\|S_{s,t}^{l}\|^{q}_{\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{s}}}M\to\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{t}}}M}\leq Ce^{C(t-s)}\exp(\eta V(u_{0})), (A.7)
𝔼supτ0tT0L~tlHln+2×T𝐩tMl×T𝐩tMqCeCT~0exp(ηV(u0)).\displaystyle\mathbb{E}\sup\limits_{\tau_{0}\leq t\leq T_{0}}\|\widetilde{L}_{t}^{l}\|^{q}_{{H}_{l}^{n+2}\times T_{\mathbf{p}_{{}_{t}}}M\to\mathcal{H}_{l}\times T_{\mathbf{p}_{{}_{t}}}M}\leq Ce^{C\widetilde{T}_{0}}\exp(\eta V(u_{0})). (A.8)
Proof.

By making slight modifications and refinements to the proof of [26, (5.2),(5.5)], we obtain the validity of (A.6) and (A.8). Furthermore, note that Ss,tlS_{s,t}^{l} satisfies

tSs,tl=A^lSs,tlDlF^l(ut,𝐩t)Ss,tl,Ss,sl=Id.\displaystyle\partial_{t}S_{s,t}^{l}=\hat{A}_{l}S_{s,t}^{l}-D_{l}\hat{F}_{l}(u_{t},\mathbf{p}_{t})S_{s,t}^{l},\quad S_{s,s}^{l}=\text{Id}.

Hence, (A.7) follows in a similar manner. It now remains to prove (A.5). First, observe that for any 𝔥hh\mathfrak{h}^{h}\in\mathcal{H}_{h}, Rs,th𝔥hR_{s,t}^{h}\mathfrak{h}^{h} satisfies

tRs,th𝔥h=νΠh(ΔRs,th𝔥h)Πh(Rs,th𝔥hut+ut(Rs,th𝔥h)),Rs,sh𝔥h=𝔥h.\displaystyle\partial_{t}R_{s,t}^{h}\mathfrak{h}^{h}=\nu\Pi_{h}(\Delta R_{s,t}^{h}\mathfrak{h}^{h})-\Pi_{h}(R_{s,t}^{h}\mathfrak{h}^{h}\cdot\nabla u_{t}+u_{t}\cdot\nabla(R_{s,t}^{h}\mathfrak{h}^{h})),\quad R_{s,s}^{h}\mathfrak{h}^{h}=\mathfrak{h}^{h}.

Then,

tRs,th𝔥hh2\displaystyle\partial_{t}\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{\mathcal{H}_{h}}^{2} =2Rs,th𝔥h,tRs,th𝔥hh\displaystyle=2\langle R_{s,t}^{h}\mathfrak{h}^{h},\,\partial_{t}R_{s,t}^{h}\mathfrak{h}^{h}\rangle_{h}
=2ν(Rs,th𝔥h)h22Rs,th𝔥h,Πh(Rs,th𝔥hut+utRs,th𝔥h)h.\displaystyle=-2\nu\|\nabla(R_{s,t}^{h}\mathfrak{h}^{h})\|_{\mathcal{H}_{h}}^{2}-2\big\langle R_{s,t}^{h}\mathfrak{h}^{h},\,\Pi_{h}\big(R_{s,t}^{h}\mathfrak{h}^{h}\cdot\nabla u_{t}+u_{t}\cdot\nabla R_{s,t}^{h}\mathfrak{h}^{h}\big)\big\rangle_{h}.

Noted that

2Rs,th𝔥h,Πh(utRs,th𝔥h+Rs,th𝔥hut)h\displaystyle 2\big\langle R_{s,t}^{h}\mathfrak{h}^{h},\,\Pi_{h}\big(u_{t}\cdot\nabla R_{s,t}^{h}\mathfrak{h}^{h}+R_{s,t}^{h}\mathfrak{h}^{h}\cdot\nabla u_{t}\big)\big\rangle_{h}
=2ΠhnRs,th𝔥h,j=0n(nj)(njΔ1wtj(Rs,th𝔥h)+Δ1j(Rs,th𝔥h)njwt)L2.\displaystyle=2\Pi_{h}\bigg\langle\nabla^{n}R_{s,t}^{h}\mathfrak{h}^{h},\,\sum_{j=0}^{n}\binom{n}{j}\left(\nabla^{n-j}\nabla^{\perp}\Delta^{-1}w_{t}\cdot\nabla\nabla^{j}(R_{s,t}^{h}\mathfrak{h}^{h})+\nabla^{\perp}\Delta^{-1}\nabla^{j}(R_{s,t}^{h}\mathfrak{h}^{h})\cdot\nabla\nabla^{n-j}w_{t}\right)\bigg\rangle_{L^{2}}.

Using [26, Lemma A.8], a direct calculation gives

Πh|n(Rs,th𝔥h):j=0n(nj)(njΔ1wtj(Rs,th𝔥h)+Δ1j(Rs,th𝔥h)njwt)dx|\displaystyle\Pi_{h}\left|\int\nabla^{n}(R_{s,t}^{h}\mathfrak{h}^{h}):\sum_{j=0}^{n}\binom{n}{j}\left(\nabla^{n-j}\nabla^{\perp}\Delta^{-1}w_{t}\cdot\nabla\nabla^{j}(R_{s,t}^{h}\mathfrak{h}^{h})+\nabla^{\perp}\Delta^{-1}\nabla^{j}(R_{s,t}^{h}\mathfrak{h}^{h})\cdot\nabla\nabla^{n-j}w_{t}\right)\mathrm{d}x\right|
ν2Rs,th𝔥hHhn+12+CwtHn2n+2Rs,th𝔥hLh22ν2Rs,th𝔥hHhn+12+CwtHn2n+2Rs,th𝔥hHhn2.\displaystyle\leq\frac{\nu}{2}\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n+1}_{h}}^{2}+C\|w_{t}\|_{H^{n}}^{2n+2}\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{L^{2}_{h}}^{2}\leq\frac{\nu}{2}\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n+1}_{h}}^{2}+C\|w_{t}\|_{H^{n}}^{2n+2}\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n}_{h}}^{2}.

Thus, we have that

tRs,th𝔥hHn2νRs,th𝔥hHn+12+CwtHn2n+2Rs,th𝔥hHn2.\displaystyle\partial_{t}\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n}}^{2}\leq-\nu\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n+1}}^{2}+C\|w_{t}\|_{H^{n}}^{2n+2}\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n}}^{2}.

Owing to the fact that Rs,th𝔥hR_{s,t}^{h}\mathfrak{h}^{h} comprises exclusively high modes and contains all unstable direction, the following holds:

Rs,th𝔥hHn+12N2Rs,th𝔥hHn2.\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n+1}}^{2}\geq N_{*}^{2}\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n}}^{2}.

Then,

tRs,th𝔥hHn2(νN2CwtHn2n+2)Rs,th𝔥hHn2.\partial_{t}\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n}}^{2}\leq-\big(\nu N_{*}^{2}-C\|w_{t}\|_{H^{n}}^{2n+2}\big)\|R_{s,t}^{h}\mathfrak{h}^{h}\|_{H^{n}}^{2}.

Together, Grönwall’s inequality and (A.2) yield inequality (A.5). ∎

In addition to the estimates for the Jacobian derivatives of both low and high frequencies, we also need some estimates for the full linear operator.

Lemma A.5.

For any q>0q>0, η(0,1)\eta\in(0,1) there are C(𝐩0,η,q,N)>0C(\mathbf{p}_{0},\eta,q,N_{*})>0, locally bounded in 𝐩0M\mathbf{p}_{0}\in M, and β:=β(N)>0\beta:=\beta(N_{*})>0 such that

𝔼sup0stJs,t×T𝐩sM×T𝐩tMqCeC(ts)exp(ηV(u0)),\displaystyle\mathbb{E}\sup\limits_{0\leq s\leq t}\|J_{s,t}\|^{q}_{\mathcal{H}\times T_{\mathbf{p}_{{}_{s}}}M\to\mathcal{H}\times T_{\mathbf{p}_{{}_{t}}}M}\leq Ce^{C(t-s)}\exp(\eta V(u_{0})), (A.9)
𝔼Πh(J0,τ0𝔥)×T𝐩τ0MqCeCτ0Nqexp(ηV(u0)).\displaystyle\mathbb{E}\|\Pi_{h}(J_{0,\tau_{0}}\mathfrak{h})\|^{q}_{\mathcal{H}\times T_{\mathbf{p}_{{}_{\tau_{0}}}}M}\leq Ce^{C\tau_{0}}N_{*}^{-q}\exp(\eta V(u_{0})). (A.10)
Proof.

By making refinements to the proof of [26, (5.2)], we obtain the validity of (A.9). At the same time, we observe that

ΠhJ0,τ0Hn×T𝐩MHn×T𝐩MqNqJ0,τ0L2×T𝐩MHn+1×T𝐩Mq.\|\Pi_{h}J_{0,\tau_{0}}\|_{H^{n}\times T_{\mathbf{p}}M\to H^{n}\times T_{\mathbf{p}}M}^{q}\leq N_{*}^{-q}\|J_{0,\tau_{0}}\|_{L^{2}\times T_{\mathbf{p}}M\to H^{n+1}\times T_{\mathbf{p}}M}^{q}.

Then making refinements to the proof of [26, (5.2)-(5.3)], we obtain the bound on ΠhJ0,τ0𝔥Hn×T𝐩Mq\|\Pi_{h}J_{0,\tau_{0}}\mathfrak{h}\|_{H^{n}\times T_{\mathbf{p}}M}^{q}. ∎

Finally, we also need estimates for the low and high-frequency derivatives of the drift term F^\hat{F}, which will be used to estimate the low-high frequency coupling terms.

Lemma A.6.

For any q>0q>0, η(0,1)\eta\in(0,1) there is C(𝐩0,η,q,N)>0C(\mathbf{p}_{0},\eta,q,N_{*})>0, locally bounded in 𝐩0M\mathbf{p}_{0}\in M such that

𝔼supτ0tT0DhF^lhl×T𝐩MqCeCT~0exp(ηV(u0)),\displaystyle\mathbb{E}\sup\limits_{\tau_{0}\leq t\leq T_{0}}\|D_{h}\hat{F}_{l}\|^{q}_{\mathcal{H}_{h}\to\mathcal{H}_{l}\times T_{\mathbf{p}}M}\leq Ce^{C\widetilde{T}_{0}}\exp(\eta V(u_{0})), (A.11)
𝔼supτ0tT0DlF^hl×T𝐩MhqCeCT~0exp(ηV(u0)).\displaystyle\mathbb{E}\sup\limits_{\tau_{0}\leq t\leq T_{0}}\|D_{l}\hat{F}_{h}\|^{q}_{\mathcal{H}_{l}\times T_{\mathbf{p}}M\to\mathcal{H}_{h}}\leq Ce^{C\widetilde{T}_{0}}\exp(\eta V(u_{0})). (A.12)
Proof.

For any 𝔥hh\mathfrak{h}^{h}\in\mathcal{H}_{h}, we have that

Dh(ΠlB(u,u))𝔥h\displaystyle D_{h}\big(-\Pi_{l}B(u,u)\big)\cdot\mathfrak{h}^{h} =Πl(B(u,𝔥h)+B(𝔥h,u)),\displaystyle=-\Pi_{l}\big(B(u,\mathfrak{h}^{h})+B(\mathfrak{h}^{h},u)\big),
Dhu(x)𝔥h\displaystyle D_{h}u(x)\cdot\mathfrak{h}^{h} =𝔥h,\displaystyle=\mathfrak{h}^{h},
Dh(ΠvDu(x)v)𝔥h\displaystyle D_{h}(\Pi_{v}Du(x)v)\cdot\mathfrak{h}^{h} =ΠvD𝔥h(x)v.\displaystyle=\Pi_{v}D\mathfrak{h}^{h}(x)v.

Note that here we take n=5n=5, and after projecting via Πl\Pi_{l} onto the finite-dimensional subspace, all norms are equivalent. Combining with (A.2), we have

𝔼supτ0tT0Πl(DhB(u,u)𝔥h)l\displaystyle\mathbb{E}\sup_{\tau_{0}\leq t\leq T_{0}}\|\Pi_{l}\big(D_{h}B(u,u)\mathfrak{h}^{h})\|_{\mathcal{H}_{l}} CeCT~0exp(ηV(u0))𝔥hh,\displaystyle\leq Ce^{C\widetilde{T}_{0}}\exp(\eta V(u_{0}))\|\mathfrak{h}^{h}\|_{h},
|Dhu(x)𝔥h|\displaystyle|D_{h}u(x)\cdot\mathfrak{h}^{h}| 𝔥hC0C𝔥hh,\displaystyle\leq\|\mathfrak{h}^{h}\|_{C^{0}}\leq C\|\mathfrak{h}^{h}\|_{h},
|D𝔥h(x)v|\displaystyle|D\mathfrak{h}^{h}(x)v| C𝔥hh.\displaystyle\leq C\|\mathfrak{h}^{h}\|_{h}.

We have thereby established (A.11). Similarly, (A.12) can be proved. ∎

Declarations

Availability of data: No new data were generated or analysed in support of this search.

Conflict of interests: The authors declare that there are no conflict of interests, we do not have any possible conflicts of interest.

Funding: The manuscript is supported by National Natural Science Foundation of China (No. 12571189).

References

  • [1] Bohr T, Jensen M H, Paladin G, et al. Dynamical systems approach to turbulence[M]. 1998.
  • [2] Bedrossian J, Blumenthal A, Punshon-Smith S. Lagrangian chaos and scalar advection in stochastic fluid mechanics[J]. Journal of the European Mathematical Society, 2022, 24(6): 1893-1990.
  • [3] Bedrossian J, Blumenthal A, Punshon-Smith S. Almost-sure exponential mixing of passive scalars by the stochastic Navier–Stokes equations[J]. The Annals of Probability, 2022, 50(1): 241-303.
  • [4] Kuksin, Sergei, Armen Shirikyan. Mathematics of two-dimensional turbulence[M]. Cambridge University Press, 2012.
  • [5] Mattingly J C, Ya Sinai. Gibbsian Dynamics and Ergodicity for the Stochastically Forced Navier–Stokes Equation[J]. Communications in Mathematical Physics. 2001, 224(1): 83-106.
  • [6] Eckmann J P, Martin Hairer. Uniqueness of the invariant measure for a stochastic pde driven by degenerate noise[J]. Communications in Mathematical Physics 2001, 229(3): 523-565.
  • [7] Flandoli F, Maslowski B. Ergodicity of the 2-D Navier-Stokes equation under random perturbations[J]. Communications in mathematical physics, 1995, 172(1): 119-141.
  • [8] Bricmont J, Kupiainen A, Lefevere R. Ergodicity of the 2D Navier-Stokes Equations with Random Forcing[J]. Communications in Mathematical Physics, 2001, 224(1): 65-81.
  • [9] Kuksin S, Shirikyan A. A Coupling Approach to Randomly Forced Nonlinear PDE’s. I[J]. Communications in Mathematical Physics, 2001, 221(2): 351-366.
  • [10] Mattingly J C. Exponential convergence for the stochastically forced Navier-Stokes equations and other partially dissipative dynamics[J]. Communications in mathematical physics, 2002, 230(3): 421-462.
  • [11] Hairer M, Mattingly J C. Ergodicity of the 2D Navier-Stokes equations with degenerate stochastic forcing[J]. Annals of Mathematics, 2006: 993-1032.
  • [12] Hairer M, Mattingly J C. A theory of hypoellipticity and unique ergodicity for semilinear stochastic PDEs[J]. Electronic Journal of Probability, 2011, 16: 23.
  • [13] Eyink G L. Exact results on stationary turbulence in 2D: consequences of vorticity conservation[J]. Physica D: Nonlinear Phenomena, 1996, 91(1-2): 97-142.
  • [14] Novikov E A. Functionals and the random-force method in turbulence theory[J]. Sov. Phys. JETP, 1965, 20(5): 1290-1294.
  • [15] Young L S. Mathematical theory of Lyapunov exponents[J]. Journal of Physics A: Mathematical and Theoretical, 2013, 46(25): 254001.
  • [16] Bedrossian J, Blumenthal A, Punshon-Smith S. A regularity method for lower bounds on the Lyapunov exponent for stochastic differential equations[J]. Inventiones mathematicae, 2022, 227(2): 429-516.
  • [17] Amon C H, Guzmán A M, Morel B. Lagrangian chaos, Eulerian chaos, and mixing enhancement in converging-diverging channel flows[J]. Physics of Fluids, 1996, 8(5): 1192-1206.
  • [18] Crisanti A, Falcioni M, Vulpiani A, et al. Lagrangian chaos: transport, mixing and diffusion in fluids[J]. La Rivista del Nuovo Cimento (1978-1999), 1991, 14(12): 1-80.
  • [19] Finn J M, del-Castillo-Negrete D. Lagrangian chaos and Eulerian chaos in shear flow dynamics[J]. Chaos: An Interdisciplinary Journal of Nonlinear Science, 2001, 11(4): 816-832.
  • [20] Galluccio S, Vulpiani A. Stretching of material lines and surfaces in systems with Lagrangian chaos[J]. Physica A: Statistical Mechanics and its Applications, 1994, 212(1-2): 75-98.
  • [21] Bedrossian J, Punshon-Smith S. Chaos in Stochastic 2d Galerkin-Navier-Stokes[J]. Communications in Mathematical Physics, 2024, 405(4): 107.
  • [22] Bedrossian J, Blumenthal A, Punshon-Smith S. A regularity method for lower bounds on the Lyapunov exponent for stochastic differential equations[J]. Inventiones mathematicae, 2022, 227(2): 429-516.
  • [23] Bedrossian J, Blumenthal A, Punshon-Smith S. Almost-sure exponential mixing of passive scalars by the stochastic Navier-Stokes equations[J]. The Annals of Probability, 2022, 50(1): 241-303.
  • [24] Bedrossian J, Blumenthal A, Punshon-Smith S. Almost-sure enhanced dissipation and uniform-in-diffusivity exponential mixing for advection-diffusion by stochastic Navier-Stokes[J]. Probability Theory and Related Fields, 2021, 179(3): 777-834.
  • [25] Bedrossian J, Blumenthal A, Punshon-Smith S. The Batchelor spectrum of passive scalar turbulence in stochastic fluid mechanics at fixed Reynolds number[J]. Communications on Pure and Applied Mathematics, 2022, 75(6): 1237-1291.
  • [26] Cooperman W, Rowan K. Exponential scalar mixing for the 2D Navier-Stokes equations with degenerate stochastic forcing[J]. arXiv preprint arXiv:2408.02459, 2024.
  • [27] Nersesyan V, Zhang D, Zhou C. On the chaotic behavior of the Lagrangian flow of the 2D Navier-Stokes system with bounded degenerate noise[J]. arXiv preprint arXiv:2406.17612, 2024.
  • [28] Földes J, Glatt-Holtz N, Richards G, et al. Ergodic and mixing properties of the Boussinesq equations with a degenerate random forcing[J]. Journal of Functional Analysis, 2015, 269(8): 2427-2504.
  • [29] Oseledets V I. A multiplicative ergodic theorem. Characteristic Ljapunov, exponents of dynamical systems[J]. Trudy Moskovskogo Matematicheskogo Obshchestva, 1968, 19: 179-210.
  • [30] Ledrappier F. Positivity of the exponent for stationary sequences of matrices[C]//Lyapunov Exponents: Proceedings of a Workshop held in Bremen, November 12–15, 1984. Berlin, Heidelberg: Springer Berlin Heidelberg, 2006: 56-73.
  • [31] Bougerol P. Comparaison des exposants de Lyapounov des processus markoviens multiplicatifs[C]//Annales de l’IHP Probabilités et statistiques. 1988, 24(4): 439-489.
  • [32] Blumenthal A, Coti Zelati M, Gvalani R S. Exponential mixing for random dynamical systems and an example of Pierrehumbert[J]. The Annals of Probability, 2023, 51(4): 1559-1601.
  • [33] Kifer Y. Ergodic theory of random transformations[M]. Springer Science & Business Media, 2012.
  • [34] Norris J. Simplified malliavin calculus[M]//Séminaire de Probabilités XX 1984/85: Proceedings. Berlin, Heidelberg: Springer Berlin Heidelberg, 2006: 101-130.
  • [35] Nualart D. The Malliavin calculus and related topics[M]. Berlin, Heidelberg: Springer Berlin Heidelberg, 2006.
  • [36] Chen D, Zheng Y. Lagrangian chaos for the 2D Boussinesq equations with a degenerate random forcing. Preprint, 2025.
BETA